diff --git a/.github/README_TEMPLATE.md b/.github/README_TEMPLATE.md
index d04a0299536242a6d4c0743242c6616eaac40a97..43dba40f59684df0f79faa341c8de67916313210 100644
--- a/.github/README_TEMPLATE.md
+++ b/.github/README_TEMPLATE.md
@@ -1,13 +1,13 @@
 > :memo: A README.md template for releasing a paper code implementation to a GitHub repository.  
 >  
-> * Template version: 1.0.2020.125  
+> * Template version: 1.0.2020.170  
 > * Please modify sections depending on needs.  
 
 # Model name, Paper title, or Project Name
 
 > :memo: Add a badge for the ArXiv identifier of your paper (arXiv:YYMM.NNNNN)
 
-[![Paper](http://img.shields.io/badge/paper-arXiv.YYMM.NNNNN-B3181B.svg)](https://arxiv.org/abs/...)
+[![Paper](http://img.shields.io/badge/Paper-arXiv.YYMM.NNNNN-B3181B?logo=arXiv)](https://arxiv.org/abs/...)
 
 This repository is the official or unofficial implementation of the following paper.
 
@@ -28,8 +28,8 @@ This repository is the official or unofficial implementation of the following pa
 
 > :memo: Provide maintainer information.  
 
-* Last name, First name ([@GitHub username](https://github.com/username))
-* Last name, First name ([@GitHub username](https://github.com/username))
+* Full name ([@GitHub username](https://github.com/username))
+* Full name ([@GitHub username](https://github.com/username))
 
 ## Table of Contents
 
@@ -37,8 +37,8 @@ This repository is the official or unofficial implementation of the following pa
 
 ## Requirements
 
-[![TensorFlow 2.1](https://img.shields.io/badge/tensorflow-2.1-brightgreen)](https://github.com/tensorflow/tensorflow/releases/tag/v2.1.0)
-[![Python 3.6](https://img.shields.io/badge/python-3.6-blue.svg)](https://www.python.org/downloads/release/python-360/)
+[![TensorFlow 2.1](https://img.shields.io/badge/TensorFlow-2.1-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v2.1.0)
+[![Python 3.6](https://img.shields.io/badge/Python-3.6-3776AB)](https://www.python.org/downloads/release/python-360/)
 
 > :memo: Provide details of the software required.  
 >  
@@ -54,6 +54,8 @@ pip install -r requirements.txt
 
 ## Results
 
+[![TensorFlow Hub](https://img.shields.io/badge/TF%20Hub-Models-FF6F00?logo=tensorflow)](https://tfhub.dev/...)
+
 > :memo: Provide a table with results. (e.g., accuracy, latency)  
 >  
 > * Provide links to the pre-trained models (checkpoint, SavedModel files).  
@@ -104,6 +106,8 @@ python3 ...
 
 ## License
 
+[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
+
 > :memo: Place your license text in a file named LICENSE in the root of the repository.  
 >  
 > * Include information about your license.  
diff --git a/README.md b/README.md
index d9dbe91ddcf56cda804b3b46e7a03fd0faa54d6f..203051feb7acbf3f6501d5c29516841958bedb75 100644
--- a/README.md
+++ b/README.md
@@ -2,28 +2,34 @@
 
 # Welcome to the Model Garden for TensorFlow
 
-The TensorFlow Model Garden is a repository with a number of different implementations of state-of-the-art (SOTA) models and modeling solutions for TensorFlow users. We aim to demonstrate the best practices for modeling so that TensorFlow users can take full advantage of TensorFlow for their research and product development.
+The TensorFlow Model Garden is a repository with a number of different implementations of state-of-the-art (SOTA) models and modeling solutions for TensorFlow users. We aim to demonstrate the best practices for modeling so that TensorFlow users
+can take full advantage of TensorFlow for their research and product development.
 
 | Directory | Description |
 |-----------|-------------|
 | [official](official) | • A collection of example implementations for SOTA models using the latest TensorFlow 2's high-level APIs<br />• Officially maintained, supported, and kept up to date with the latest TensorFlow 2 APIs by TensorFlow<br />• Reasonably optimized for fast performance while still being easy to read |
 | [research](research) | • A collection of research model implementations in TensorFlow 1 or 2 by researchers<br />• Maintained and supported by researchers |
 | [community](community) | • A curated list of the GitHub repositories with machine learning models and implementations powered by TensorFlow 2 |
+| [orbit](orbit) | • A flexible and lightweight library that users can easily use or fork when writing customized training loop code in TensorFlow 2.x. It seamlessly integrates with `tf.distribute` and supports running on different device types (CPU, GPU, and TPU). |
 
-## [Announcements](../../wiki/Announcements)
+## [Announcements](https://github.com/tensorflow/models/wiki/Announcements)
 
 | Date | News |
 |------|------|
-| May 21, 2020 | [Unifying Deep Local and Global Features for Image Search (DELG)](https://github.com/tensorflow/models/tree/master/research/delf#delg) code released
-| May 7, 2020 | [MnasFPN with MobileNet-V2 backbone](https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/detection_model_zoo.md#mobile-models) released for object detection
-| May 1, 2020 | [DELF: DEep Local Features](https://github.com/tensorflow/models/tree/master/research/delf) updated to support TensorFlow 2.1
+| July 10, 2020 | TensorFlow 2 meets the [Object Detection API](https://github.com/tensorflow/models/tree/master/research/object_detection) ([Blog](https://blog.tensorflow.org/2020/07/tensorflow-2-meets-object-detection-api.html)) |
+| June 30, 2020 | [SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization](https://github.com/tensorflow/models/tree/master/official/vision/detection#train-a-spinenet-49-based-mask-r-cnn) released ([Tweet](https://twitter.com/GoogleAI/status/1278016712978264064)) |
+| June 17, 2020 | [Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection](https://github.com/tensorflow/models/tree/master/research/object_detection#june-17th-2020) released ([Tweet](https://twitter.com/GoogleAI/status/1276571419422253057)) |
+| May 21, 2020 | [Unifying Deep Local and Global Features for Image Search (DELG)](https://github.com/tensorflow/models/tree/master/research/delf#delg) code released |
+| May 19, 2020 | [MobileDets: Searching for Object Detection Architectures for Mobile Accelerators](https://github.com/tensorflow/models/tree/master/research/object_detection#may-19th-2020) released |
+| May 7, 2020 | [MnasFPN with MobileNet-V2 backbone](https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/detection_model_zoo.md#mobile-models) released for object detection |
+| May 1, 2020 | [DELF: DEep Local Features](https://github.com/tensorflow/models/tree/master/research/delf) updated to support TensorFlow 2.1 |
 | March 31, 2020 | [Introducing the Model Garden for TensorFlow 2](https://blog.tensorflow.org/2020/03/introducing-model-garden-for-tensorflow-2.html) ([Tweet](https://twitter.com/TensorFlow/status/1245029834633297921)) |
 
 ## Contributions
 
 [![help wanted:paper implementation](https://img.shields.io/github/issues/tensorflow/models/help%20wanted%3Apaper%20implementation)](https://github.com/tensorflow/models/labels/help%20wanted%3Apaper%20implementation)
 
-If you want to contribute, please review the [contribution guidelines](../../wiki/How-to-contribute).
+If you want to contribute, please review the [contribution guidelines](https://github.com/tensorflow/models/wiki/How-to-contribute).
 
 ## License
 
diff --git a/community/README.md b/community/README.md
index eea11fc2b63fede7b983e7e3aa9390400be22c6b..ed01dfbed07bca73b321336d59fd5d174545f6cd 100644
--- a/community/README.md
+++ b/community/README.md
@@ -6,13 +6,12 @@ This repository provides a curated list of the GitHub repositories with machine
 
 **Note**: Contributing companies or individuals are responsible for maintaining their repositories.
 
-## Models / Implementations
+## Computer Vision
 
-### Computer Vision
+### Image Recognition
 
-#### Image Recognition
-| Model | Reference (Paper) | Features | Maintainer |
-|-------|-------------------|----------|------------|
+| Model | Paper | Features | Maintainer |
+|-------|-------|----------|------------|
 | [DenseNet 169](https://github.com/IntelAI/models/tree/master/benchmarks/image_recognition/tensorflow/densenet169) | [Densely Connected Convolutional Networks](https://arxiv.org/pdf/1608.06993) | • FP32 Inference | [Intel](https://github.com/IntelAI) |
 | [Inception V3](https://github.com/IntelAI/models/tree/master/benchmarks/image_recognition/tensorflow/inceptionv3) | [Rethinking the Inception Architecture<br/>for Computer Vision](https://arxiv.org/pdf/1512.00567.pdf) | • Int8 Inference<br/>• FP32 Inference | [Intel](https://github.com/IntelAI) |
 | [Inception V4](https://github.com/IntelAI/models/tree/master/benchmarks/image_recognition/tensorflow/inceptionv4) | [Inception-v4, Inception-ResNet and the Impact<br/>of Residual Connections on Learning](https://arxiv.org/pdf/1602.07261) | • Int8 Inference<br/>• FP32 Inference | [Intel](https://github.com/IntelAI) |
@@ -21,12 +20,21 @@ This repository provides a curated list of the GitHub repositories with machine
 | [ResNet 50](https://github.com/IntelAI/models/tree/master/benchmarks/image_recognition/tensorflow/resnet50) | [Deep Residual Learning for Image Recognition](https://arxiv.org/pdf/1512.03385) | • Int8 Inference<br/>• FP32 Inference | [Intel](https://github.com/IntelAI) |
 | [ResNet 50v1.5](https://github.com/IntelAI/models/tree/master/benchmarks/image_recognition/tensorflow/resnet50v1_5) | [Deep Residual Learning for Image Recognition](https://arxiv.org/pdf/1512.03385) | • Int8 Inference<br/>• FP32 Inference<br/>• FP32 Training | [Intel](https://github.com/IntelAI) |
 
-#### Segmentation
-| Model | Reference (Paper) | &nbsp; &nbsp; &nbsp; Features &nbsp; &nbsp; &nbsp; | Maintainer |
-|-------|-------------------|----------|------------|
+### Object Detection
+
+| Model | Paper | Features | Maintainer |
+|-------|-------|----------|------------|
+| [R-FCN](https://github.com/IntelAI/models/tree/master/benchmarks/object_detection/tensorflow/rfcn) | [R-FCN: Object Detection<br/>via Region-based Fully Convolutional Networks](https://arxiv.org/pdf/1605.06409) | • Int8 Inference<br/>• FP32 Inference | [Intel](https://github.com/IntelAI) |
+| [SSD-MobileNet](https://github.com/IntelAI/models/tree/master/benchmarks/object_detection/tensorflow/ssd-mobilenet) | [MobileNets: Efficient Convolutional Neural Networks<br/>for Mobile Vision Applications](https://arxiv.org/pdf/1704.04861) | • Int8 Inference<br/>• FP32 Inference | [Intel](https://github.com/IntelAI) |
+| [SSD-ResNet34](https://github.com/IntelAI/models/tree/master/benchmarks/object_detection/tensorflow/ssd-resnet34) | [SSD: Single Shot MultiBox Detector](https://arxiv.org/pdf/1512.02325) | • Int8 Inference<br/>• FP32 Inference<br/>• FP32 Training | [Intel](https://github.com/IntelAI) |
+
+### Segmentation
+
+| Model | Paper | Features | Maintainer |
+|-------|-------|----------|------------|
 | [Mask R-CNN](https://github.com/NVIDIA/DeepLearningExamples/tree/master/TensorFlow2/Segmentation/MaskRCNN) | [Mask R-CNN](https://arxiv.org/abs/1703.06870) | • Automatic Mixed Precision<br/>• Multi-GPU training support with Horovod<br/>• TensorRT | [NVIDIA](https://github.com/NVIDIA) |
 | [U-Net Medical Image Segmentation](https://github.com/NVIDIA/DeepLearningExamples/tree/master/TensorFlow2/Segmentation/UNet_Medical) | [U-Net: Convolutional Networks for Biomedical Image Segmentation](https://arxiv.org/abs/1505.04597) | • Automatic Mixed Precision<br/>• Multi-GPU training support with Horovod<br/>• TensorRT | [NVIDIA](https://github.com/NVIDIA) |
 
 ## Contributions
 
-If you want to contribute, please review the [contribution guidelines](../../../wiki/How-to-contribute).
+If you want to contribute, please review the [contribution guidelines](https://github.com/tensorflow/models/wiki/How-to-contribute).
diff --git a/official/README.md b/official/README.md
index 84fd2e6342f9d7ce9a74fc2c7a3518fa5b7efd17..77e43ea9c15e9a18cfee3fb757016cf5091d0c28 100644
--- a/official/README.md
+++ b/official/README.md
@@ -17,11 +17,9 @@ with the same or improved speed and performance with each new TensorFlow build.
 The team is actively developing new models.
 In the near future, we will add:
 
-* State-of-the-art language understanding models:
-  More members in Transformer family
-* Start-of-the-art image classification models:
-  EfficientNet, MnasNet, and variants
-* A set of excellent objection detection models.
+* State-of-the-art language understanding models.
+* State-of-the-art image classification models.
+* State-of-the-art objection detection and instance segmentation models.
 
 ## Table of Contents
 
@@ -43,6 +41,7 @@ In the near future, we will add:
 |-------|-------------------|
 | [MNIST](vision/image_classification) | A basic model to classify digits from the [MNIST dataset](http://yann.lecun.com/exdb/mnist/) |
 | [ResNet](vision/image_classification) | [Deep Residual Learning for Image Recognition](https://arxiv.org/abs/1512.03385) |
+| [EfficientNet](vision/image_classification) | [EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks](https://arxiv.org/abs/1905.11946) |
 
 #### Object Detection and Segmentation
 
@@ -50,6 +49,8 @@ In the near future, we will add:
 |-------|-------------------|
 | [RetinaNet](vision/detection) | [Focal Loss for Dense Object Detection](https://arxiv.org/abs/1708.02002) |
 | [Mask R-CNN](vision/detection) | [Mask R-CNN](https://arxiv.org/abs/1703.06870) |
+| [ShapeMask](vision/detection) | [ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors](https://arxiv.org/abs/1904.03239) |
+| [SpineNet](vision/detection) | [SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization](https://arxiv.org/abs/1912.05027) |
 
 ### Natural Language Processing
 
diff --git a/official/benchmark/bert_pretrain_benchmark.py b/official/benchmark/bert_pretrain_benchmark.py
index d63c894847d8e9e9308523d3efcb06c162d323c6..be14b34b588980036267c9cf29f94475f538304e 100644
--- a/official/benchmark/bert_pretrain_benchmark.py
+++ b/official/benchmark/bert_pretrain_benchmark.py
@@ -144,6 +144,39 @@ class BertPretrainAccuracyBenchmark(bert_benchmark_utils.BertBenchmarkBase):
     self._run_and_report_benchmark(summary_path=summary_path,
                                    report_accuracy=True)
 
+  @owner_utils.Owner('tf-model-garden')
+  def benchmark_perf_2x2_tpu_bf16_seq128_10k_steps(self):
+    """Test bert pretraining with 2x2 TPU for 10000 steps."""
+    self._setup()
+    self._specify_common_flags()
+    FLAGS.num_steps_per_epoch = 5000
+    FLAGS.num_train_epochs = 2
+    FLAGS.train_batch_size = 128
+    FLAGS.model_dir = self._get_model_dir(
+        'benchmark_perf_2x2_tpu_bf16_seq128_10k_steps')
+    summary_path = os.path.join(FLAGS.model_dir,
+                                'summaries/training_summary.txt')
+    # Disable accuracy check.
+    self._run_and_report_benchmark(
+        summary_path=summary_path, report_accuracy=False)
+
+  @owner_utils.Owner('tf-model-garden')
+  def benchmark_perf_2x2_tpu_bf16_seq128_10k_steps_mlir(self):
+    """Test bert pretraining with 2x2 TPU with MLIR for 10000 steps."""
+    self._setup()
+    self._specify_common_flags()
+    FLAGS.num_steps_per_epoch = 5000
+    FLAGS.num_train_epochs = 2
+    FLAGS.train_batch_size = 128
+    FLAGS.model_dir = self._get_model_dir(
+        'benchmark_perf_2x2_tpu_bf16_seq128_10k_steps_mlir')
+    summary_path = os.path.join(FLAGS.model_dir,
+                                'summaries/training_summary.txt')
+    tf.config.experimental.enable_mlir_bridge()
+    # Disable accuracy check.
+    self._run_and_report_benchmark(
+        summary_path=summary_path, report_accuracy=False)
+
   @owner_utils.Owner('tf-model-garden')
   def benchmark_perf_4x4_tpu_bf16_seq128_10k_steps(self):
     """Test bert pretraining with 4x4 TPU for 10000 steps."""
@@ -159,6 +192,22 @@ class BertPretrainAccuracyBenchmark(bert_benchmark_utils.BertBenchmarkBase):
     self._run_and_report_benchmark(
         summary_path=summary_path, report_accuracy=False)
 
+  @owner_utils.Owner('tf-model-garden')
+  def benchmark_perf_4x4_tpu_bf16_seq128_10k_steps_mlir(self):
+    """Test bert pretraining with 4x4 TPU with MLIR for 10000 steps."""
+    self._setup()
+    self._specify_common_flags()
+    FLAGS.num_steps_per_epoch = 5000
+    FLAGS.num_train_epochs = 2
+    FLAGS.model_dir = self._get_model_dir(
+        'benchmark_perf_4x4_tpu_bf16_seq128_10k_steps_mlir')
+    summary_path = os.path.join(FLAGS.model_dir,
+                                'summaries/training_summary.txt')
+    tf.config.experimental.enable_mlir_bridge()
+    # Disable accuracy check.
+    self._run_and_report_benchmark(
+        summary_path=summary_path, report_accuracy=False)
+
   @owner_utils.Owner('tf-model-garden')
   def benchmark_perf_8x8_tpu_bf16_seq128_10k_steps(self):
     """Test bert pretraining with 8x8 TPU for 10000 steps."""
diff --git a/official/benchmark/keras_imagenet_benchmark.py b/official/benchmark/keras_imagenet_benchmark.py
index 63a48dfb1222b65311652e3bee4241854a55043e..9dfcede08b64f6670b010d389c554a2be9dac035 100644
--- a/official/benchmark/keras_imagenet_benchmark.py
+++ b/official/benchmark/keras_imagenet_benchmark.py
@@ -299,20 +299,21 @@ class MobilenetV1KerasAccuracy(keras_benchmark.KerasBenchmark):
     return os.path.join(self.output_dir, folder_name)
 
 
-class Resnet50KerasClassifierBenchmarkBase(keras_benchmark.KerasBenchmark):
-  """Resnet50 (classifier_trainer) benchmarks."""
+class KerasClassifierBenchmarkBase(keras_benchmark.KerasBenchmark):
+  """Classifier Trainer benchmarks."""
 
-  def __init__(self, output_dir=None, default_flags=None,
+  def __init__(self, model, output_dir=None, default_flags=None,
                tpu=None, dataset_builder='records', train_epochs=1,
                train_steps=110, data_dir=None):
     flag_methods = [classifier_trainer.define_classifier_flags]
 
+    self.model = model
     self.dataset_builder = dataset_builder
     self.train_epochs = train_epochs
     self.train_steps = train_steps
     self.data_dir = data_dir
 
-    super(Resnet50KerasClassifierBenchmarkBase, self).__init__(
+    super(KerasClassifierBenchmarkBase, self).__init__(
         output_dir=output_dir,
         flag_methods=flag_methods,
         default_flags=default_flags,
@@ -337,7 +338,7 @@ class Resnet50KerasClassifierBenchmarkBase(keras_benchmark.KerasBenchmark):
       dataset_num_private_threads: Optional[int] = None,
       loss_scale: Optional[str] = None):
     """Runs and reports the benchmark given the provided configuration."""
-    FLAGS.model_type = 'resnet'
+    FLAGS.model_type = self.model
     FLAGS.dataset = 'imagenet'
     FLAGS.mode = 'train_and_eval'
     FLAGS.data_dir = self.data_dir
@@ -372,7 +373,7 @@ class Resnet50KerasClassifierBenchmarkBase(keras_benchmark.KerasBenchmark):
     # input skip_steps.
     warmup = (skip_steps or (self.train_steps - 100)) // FLAGS.log_steps
 
-    super(Resnet50KerasClassifierBenchmarkBase, self)._report_benchmark(
+    super(KerasClassifierBenchmarkBase, self)._report_benchmark(
         stats,
         wall_time_sec,
         total_batch_size=total_batch_size,
@@ -599,8 +600,7 @@ class Resnet50KerasClassifierBenchmarkBase(keras_benchmark.KerasBenchmark):
         distribution_strategy='mirrored',
         per_replica_batch_size=256,
         gpu_thread_mode='gpu_private',
-        dataset_num_private_threads=48,
-        steps=310)
+        dataset_num_private_threads=48)
 
   def benchmark_xla_8_gpu_fp16_dynamic_tweaked(self):
     """Tests Keras model with config tuning, XLA, 8 GPUs and dynamic fp16."""
@@ -636,6 +636,28 @@ class Resnet50KerasClassifierBenchmarkBase(keras_benchmark.KerasBenchmark):
         distribution_strategy='tpu',
         per_replica_batch_size=128)
 
+  def benchmark_2x2_tpu_bf16_mlir(self):
+    """Test Keras model with 2x2 TPU, bf16."""
+    self._setup()
+    tf.config.experimental.enable_mlir_bridge()
+    self._run_and_report_benchmark(
+        experiment_name='benchmark_2x2_tpu_bf16_mlir',
+        dtype='bfloat16',
+        num_tpus=8,
+        distribution_strategy='tpu',
+        per_replica_batch_size=128)
+
+  def benchmark_4x4_tpu_bf16_mlir(self):
+    """Test Keras model with 4x4 TPU, bf16."""
+    self._setup()
+    tf.config.experimental.enable_mlir_bridge()
+    self._run_and_report_benchmark(
+        experiment_name='benchmark_4x4_tpu_bf16_mlir',
+        dtype='bfloat16',
+        num_tpus=32,
+        distribution_strategy='tpu',
+        per_replica_batch_size=128)
+
   def benchmark_8x8_tpu_bf16(self):
     """Test Keras model with 8x8 TPU, bf16."""
     self._setup()
@@ -647,7 +669,7 @@ class Resnet50KerasClassifierBenchmarkBase(keras_benchmark.KerasBenchmark):
         per_replica_batch_size=64)
 
   def fill_report_object(self, stats):
-    super(Resnet50KerasClassifierBenchmarkBase, self).fill_report_object(
+    super(KerasClassifierBenchmarkBase, self).fill_report_object(
         stats,
         total_batch_size=FLAGS.batch_size,
         log_steps=FLAGS.log_steps)
@@ -1086,7 +1108,7 @@ class Resnet50KerasBenchmarkBase(keras_benchmark.KerasBenchmark):
         log_steps=FLAGS.log_steps)
 
 
-class Resnet50KerasBenchmarkSynth(Resnet50KerasClassifierBenchmarkBase):
+class Resnet50KerasBenchmarkSynth(KerasClassifierBenchmarkBase):
   """Resnet50 synthetic benchmark tests."""
 
   def __init__(self, output_dir=None, root_data_dir=None, tpu=None, **kwargs):
@@ -1094,11 +1116,11 @@ class Resnet50KerasBenchmarkSynth(Resnet50KerasClassifierBenchmarkBase):
     def_flags['log_steps'] = 10
 
     super(Resnet50KerasBenchmarkSynth, self).__init__(
-        output_dir=output_dir, default_flags=def_flags, tpu=tpu,
+        model='resnet', output_dir=output_dir, default_flags=def_flags, tpu=tpu,
         dataset_builder='synthetic', train_epochs=1, train_steps=110)
 
 
-class Resnet50KerasBenchmarkReal(Resnet50KerasClassifierBenchmarkBase):
+class Resnet50KerasBenchmarkReal(KerasClassifierBenchmarkBase):
   """Resnet50 real data benchmark tests."""
 
   def __init__(self, output_dir=None, root_data_dir=None, tpu=None, **kwargs):
@@ -1107,11 +1129,25 @@ class Resnet50KerasBenchmarkReal(Resnet50KerasClassifierBenchmarkBase):
     def_flags['log_steps'] = 10
 
     super(Resnet50KerasBenchmarkReal, self).__init__(
-        output_dir=output_dir, default_flags=def_flags, tpu=tpu,
+        model='resnet', output_dir=output_dir, default_flags=def_flags, tpu=tpu,
         dataset_builder='records', train_epochs=1, train_steps=110,
         data_dir=data_dir)
 
 
+class EfficientNetKerasBenchmarkReal(KerasClassifierBenchmarkBase):
+  """EfficientNet real data benchmark tests."""
+
+  def __init__(self, output_dir=None, root_data_dir=None, tpu=None, **kwargs):
+    data_dir = os.path.join(root_data_dir, 'imagenet')
+    def_flags = {}
+    def_flags['log_steps'] = 10
+
+    super(EfficientNetKerasBenchmarkReal, self).__init__(
+        model='efficientnet', output_dir=output_dir, default_flags=def_flags,
+        tpu=tpu, dataset_builder='records', train_epochs=1, train_steps=110,
+        data_dir=data_dir)
+
+
 class Resnet50KerasBenchmarkRemoteData(Resnet50KerasBenchmarkBase):
   """Resnet50 real data (stored in remote storage) benchmark tests."""
 
diff --git a/official/benchmark/resnet_ctl_imagenet_benchmark.py b/official/benchmark/resnet_ctl_imagenet_benchmark.py
index 0e70e8da969ec9b02a2de00d1973bdd2aa5f2b51..f4a7f4bd5e797965d880900324d2969dbc0130ba 100644
--- a/official/benchmark/resnet_ctl_imagenet_benchmark.py
+++ b/official/benchmark/resnet_ctl_imagenet_benchmark.py
@@ -38,13 +38,18 @@ FLAGS = flags.FLAGS
 class CtlBenchmark(PerfZeroBenchmark):
   """Base benchmark class with methods to simplify testing."""
 
-  def __init__(self, output_dir=None, default_flags=None, flag_methods=None):
+  def __init__(self,
+               output_dir=None,
+               default_flags=None,
+               flag_methods=None,
+               **kwargs):
     self.default_flags = default_flags or {}
     self.flag_methods = flag_methods or {}
     super(CtlBenchmark, self).__init__(
         output_dir=output_dir,
         default_flags=self.default_flags,
-        flag_methods=self.flag_methods)
+        flag_methods=self.flag_methods,
+        **kwargs)
 
   def _report_benchmark(self,
                         stats,
@@ -190,13 +195,14 @@ class Resnet50CtlAccuracy(CtlBenchmark):
 class Resnet50CtlBenchmarkBase(CtlBenchmark):
   """Resnet50 benchmarks."""
 
-  def __init__(self, output_dir=None, default_flags=None):
+  def __init__(self, output_dir=None, default_flags=None, **kwargs):
     flag_methods = [common.define_keras_flags]
 
     super(Resnet50CtlBenchmarkBase, self).__init__(
         output_dir=output_dir,
         flag_methods=flag_methods,
-        default_flags=default_flags)
+        default_flags=default_flags,
+        **kwargs)
 
   @benchmark_wrappers.enable_runtime_flags
   def _run_and_report_benchmark(self):
@@ -381,12 +387,24 @@ class Resnet50CtlBenchmarkBase(CtlBenchmark):
     FLAGS.single_l2_loss_op = True
     FLAGS.use_tf_function = True
     FLAGS.enable_checkpoint_and_export = False
+    FLAGS.data_dir = 'gs://mlcompass-data/imagenet/imagenet-2012-tfrecord'
 
   def benchmark_2x2_tpu_bf16(self):
     self._setup()
     self._set_df_common()
     FLAGS.batch_size = 1024
     FLAGS.dtype = 'bf16'
+    FLAGS.model_dir = self._get_model_dir('benchmark_2x2_tpu_bf16')
+    self._run_and_report_benchmark()
+
+  @owner_utils.Owner('tf-graph-compiler')
+  def benchmark_2x2_tpu_bf16_mlir(self):
+    self._setup()
+    self._set_df_common()
+    FLAGS.batch_size = 1024
+    FLAGS.dtype = 'bf16'
+    tf.config.experimental.enable_mlir_bridge()
+    FLAGS.model_dir = self._get_model_dir('benchmark_2x2_tpu_bf16_mlir')
     self._run_and_report_benchmark()
 
   def benchmark_4x4_tpu_bf16(self):
@@ -394,6 +412,7 @@ class Resnet50CtlBenchmarkBase(CtlBenchmark):
     self._set_df_common()
     FLAGS.batch_size = 4096
     FLAGS.dtype = 'bf16'
+    FLAGS.model_dir = self._get_model_dir('benchmark_4x4_tpu_bf16')
     self._run_and_report_benchmark()
 
   @owner_utils.Owner('tf-graph-compiler')
@@ -403,6 +422,7 @@ class Resnet50CtlBenchmarkBase(CtlBenchmark):
     self._set_df_common()
     FLAGS.batch_size = 4096
     FLAGS.dtype = 'bf16'
+    FLAGS.model_dir = self._get_model_dir('benchmark_4x4_tpu_bf16_mlir')
     tf.config.experimental.enable_mlir_bridge()
     self._run_and_report_benchmark()
 
@@ -426,11 +446,11 @@ class Resnet50CtlBenchmarkSynth(Resnet50CtlBenchmarkBase):
     def_flags['skip_eval'] = True
     def_flags['use_synthetic_data'] = True
     def_flags['train_steps'] = 110
-    def_flags['steps_per_loop'] = 20
+    def_flags['steps_per_loop'] = 10
     def_flags['log_steps'] = 10
 
     super(Resnet50CtlBenchmarkSynth, self).__init__(
-        output_dir=output_dir, default_flags=def_flags)
+        output_dir=output_dir, default_flags=def_flags, **kwargs)
 
 
 class Resnet50CtlBenchmarkReal(Resnet50CtlBenchmarkBase):
@@ -441,11 +461,11 @@ class Resnet50CtlBenchmarkReal(Resnet50CtlBenchmarkBase):
     def_flags['skip_eval'] = True
     def_flags['data_dir'] = os.path.join(root_data_dir, 'imagenet')
     def_flags['train_steps'] = 110
-    def_flags['steps_per_loop'] = 20
+    def_flags['steps_per_loop'] = 10
     def_flags['log_steps'] = 10
 
     super(Resnet50CtlBenchmarkReal, self).__init__(
-        output_dir=output_dir, default_flags=def_flags)
+        output_dir=output_dir, default_flags=def_flags, **kwargs)
 
 
 if __name__ == '__main__':
diff --git a/official/benchmark/retinanet_benchmark.py b/official/benchmark/retinanet_benchmark.py
index 62bc80eef1fd00d5087af5522561ff7cf7863f5e..3b87fd21294ac1aa9334579b31b861f77e32399c 100644
--- a/official/benchmark/retinanet_benchmark.py
+++ b/official/benchmark/retinanet_benchmark.py
@@ -44,11 +44,11 @@ RESNET_CHECKPOINT_PATH = 'gs://cloud-tpu-checkpoints/retinanet/resnet50-checkpoi
 # pylint: enable=line-too-long
 
 
-class DetectionBenchmarkBase(perfzero_benchmark.PerfZeroBenchmark):
+class BenchmarkBase(perfzero_benchmark.PerfZeroBenchmark):
   """Base class to hold methods common to test classes."""
 
   def __init__(self, **kwargs):
-    super(DetectionBenchmarkBase, self).__init__(**kwargs)
+    super(BenchmarkBase, self).__init__(**kwargs)
     self.timer_callback = None
 
   def _report_benchmark(self, stats, start_time_sec, wall_time_sec, min_ap,
@@ -99,7 +99,7 @@ class DetectionBenchmarkBase(perfzero_benchmark.PerfZeroBenchmark):
         extras={'flags': flags_str})
 
 
-class RetinanetBenchmarkBase(DetectionBenchmarkBase):
+class DetectionBenchmarkBase(BenchmarkBase):
   """Base class to hold methods common to test classes in the module."""
 
   def __init__(self, **kwargs):
@@ -107,7 +107,7 @@ class RetinanetBenchmarkBase(DetectionBenchmarkBase):
     self.eval_data_path = COCO_EVAL_DATA
     self.eval_json_path = COCO_EVAL_JSON
     self.resnet_checkpoint_path = RESNET_CHECKPOINT_PATH
-    super(RetinanetBenchmarkBase, self).__init__(**kwargs)
+    super(DetectionBenchmarkBase, self).__init__(**kwargs)
 
   def _run_detection_main(self):
     """Starts detection job."""
@@ -118,7 +118,7 @@ class RetinanetBenchmarkBase(DetectionBenchmarkBase):
       return detection.run()
 
 
-class RetinanetAccuracy(RetinanetBenchmarkBase):
+class DetectionAccuracy(DetectionBenchmarkBase):
   """Accuracy test for RetinaNet model.
 
   Tests RetinaNet detection task model accuracy. The naming
@@ -126,6 +126,10 @@ class RetinanetAccuracy(RetinanetBenchmarkBase):
   `benchmark_(number of gpus)_gpu_(dataset type)` format.
   """
 
+  def __init__(self, model, **kwargs):
+    self.model = model
+    super(DetectionAccuracy, self).__init__(**kwargs)
+
   @benchmark_wrappers.enable_runtime_flags
   def _run_and_report_benchmark(self,
                                 params,
@@ -133,7 +137,7 @@ class RetinanetAccuracy(RetinanetBenchmarkBase):
                                 max_ap=0.35,
                                 do_eval=True,
                                 warmup=1):
-    """Starts RetinaNet accuracy benchmark test."""
+    """Starts Detection accuracy benchmark test."""
     FLAGS.params_override = json.dumps(params)
     # Need timer callback to measure performance
     self.timer_callback = keras_utils.TimeHistory(
@@ -156,8 +160,8 @@ class RetinanetAccuracy(RetinanetBenchmarkBase):
                            max_ap, warmup)
 
   def _setup(self):
-    super(RetinanetAccuracy, self)._setup()
-    FLAGS.model = 'retinanet'
+    super(DetectionAccuracy, self)._setup()
+    FLAGS.model = self.model
 
   def _params(self):
     return {
@@ -195,22 +199,22 @@ class RetinanetAccuracy(RetinanetBenchmarkBase):
     self._run_and_report_benchmark(params)
 
 
-class RetinanetBenchmarkReal(RetinanetAccuracy):
-  """Short benchmark performance tests for RetinaNet model.
+class DetectionBenchmarkReal(DetectionAccuracy):
+  """Short benchmark performance tests for a detection model.
 
-  Tests RetinaNet performance in different GPU configurations.
+  Tests detection performance in different accelerator configurations.
   The naming convention of below test cases follow
   `benchmark_(number of gpus)_gpu` format.
   """
 
   def _setup(self):
-    super(RetinanetBenchmarkReal, self)._setup()
+    super(DetectionBenchmarkReal, self)._setup()
     # Use negative value to avoid saving checkpoints.
     FLAGS.save_checkpoint_freq = -1
 
   @flagsaver.flagsaver
   def benchmark_8_gpu_coco(self):
-    """Run RetinaNet model accuracy test with 8 GPUs."""
+    """Run detection model accuracy test with 8 GPUs."""
     self._setup()
     params = self._params()
     params['architecture']['use_bfloat16'] = False
@@ -230,7 +234,7 @@ class RetinanetBenchmarkReal(RetinanetAccuracy):
 
   @flagsaver.flagsaver
   def benchmark_1_gpu_coco(self):
-    """Run RetinaNet model accuracy test with 1 GPU."""
+    """Run detection model accuracy test with 1 GPU."""
     self._setup()
     params = self._params()
     params['architecture']['use_bfloat16'] = False
@@ -245,7 +249,7 @@ class RetinanetBenchmarkReal(RetinanetAccuracy):
 
   @flagsaver.flagsaver
   def benchmark_xla_1_gpu_coco(self):
-    """Run RetinaNet model accuracy test with 1 GPU and XLA enabled."""
+    """Run detection model accuracy test with 1 GPU and XLA enabled."""
     self._setup()
     params = self._params()
     params['architecture']['use_bfloat16'] = False
@@ -261,7 +265,7 @@ class RetinanetBenchmarkReal(RetinanetAccuracy):
 
   @flagsaver.flagsaver
   def benchmark_2x2_tpu_coco(self):
-    """Run RetinaNet model accuracy test with 4 TPUs."""
+    """Run detection model accuracy test with 4 TPUs."""
     self._setup()
     params = self._params()
     params['train']['batch_size'] = 64
@@ -271,6 +275,88 @@ class RetinanetBenchmarkReal(RetinanetAccuracy):
     FLAGS.strategy_type = 'tpu'
     self._run_and_report_benchmark(params, do_eval=False, warmup=0)
 
+  @flagsaver.flagsaver
+  def benchmark_4x4_tpu_coco(self):
+    """Run detection model accuracy test with 4 TPUs."""
+    self._setup()
+    params = self._params()
+    params['train']['batch_size'] = 256
+    params['train']['total_steps'] = 469  # One epoch.
+    params['train']['iterations_per_loop'] = 500
+    FLAGS.model_dir = self._get_model_dir('real_benchmark_4x4_tpu_coco')
+    FLAGS.strategy_type = 'tpu'
+    self._run_and_report_benchmark(params, do_eval=False, warmup=0)
+
+  @flagsaver.flagsaver
+  def benchmark_2x2_tpu_coco_mlir(self):
+    """Run detection model accuracy test with 4 TPUs."""
+    self._setup()
+    params = self._params()
+    params['train']['batch_size'] = 64
+    params['train']['total_steps'] = 1875  # One epoch.
+    params['train']['iterations_per_loop'] = 500
+    FLAGS.model_dir = self._get_model_dir('real_benchmark_2x2_tpu_coco_mlir')
+    FLAGS.strategy_type = 'tpu'
+    tf.config.experimental.enable_mlir_bridge()
+    self._run_and_report_benchmark(params, do_eval=False, warmup=0)
+
+  @flagsaver.flagsaver
+  def benchmark_4x4_tpu_coco_mlir(self):
+    """Run RetinaNet model accuracy test with 4 TPUs."""
+    self._setup()
+    params = self._params()
+    params['train']['batch_size'] = 256
+    params['train']['total_steps'] = 469  # One epoch.
+    params['train']['iterations_per_loop'] = 500
+    FLAGS.model_dir = self._get_model_dir('real_benchmark_4x4_tpu_coco_mlir')
+    FLAGS.strategy_type = 'tpu'
+    tf.config.experimental.enable_mlir_bridge()
+    self._run_and_report_benchmark(params, do_eval=False, warmup=0)
+
+  @flagsaver.flagsaver
+  def benchmark_2x2_tpu_spinenet_coco(self):
+    """Run detection model with SpineNet backbone accuracy test with 4 TPUs."""
+    self._setup()
+    params = self._params()
+    params['architecture']['backbone'] = 'spinenet'
+    params['architecture']['multilevel_features'] = 'identity'
+    params['architecture']['use_bfloat16'] = False
+    params['train']['batch_size'] = 64
+    params['train']['total_steps'] = 1875  # One epoch.
+    params['train']['iterations_per_loop'] = 500
+    params['train']['checkpoint']['path'] = ''
+    FLAGS.model_dir = self._get_model_dir(
+        'real_benchmark_2x2_tpu_spinenet_coco')
+    FLAGS.strategy_type = 'tpu'
+    self._run_and_report_benchmark(params, do_eval=False, warmup=0)
+
+
+class RetinanetBenchmarkReal(DetectionBenchmarkReal):
+  """Short benchmark performance tests for Retinanet model."""
+
+  def __init__(self, **kwargs):
+    super(RetinanetBenchmarkReal, self).__init__(
+        model='retinanet',
+        **kwargs)
+
+
+class MaskRCNNBenchmarkReal(DetectionBenchmarkReal):
+  """Short benchmark performance tests for Mask RCNN model."""
+
+  def __init__(self, **kwargs):
+    super(MaskRCNNBenchmarkReal, self).__init__(
+        model='mask_rcnn',
+        **kwargs)
+
+
+class ShapeMaskBenchmarkReal(DetectionBenchmarkReal):
+  """Short benchmark performance tests for ShapeMask model."""
+
+  def __init__(self, **kwargs):
+    super(ShapeMaskBenchmarkReal, self).__init__(
+        model='shapemask',
+        **kwargs)
+
 
 if __name__ == '__main__':
   tf.test.main()
diff --git a/official/benchmark/transformer_benchmark.py b/official/benchmark/transformer_benchmark.py
index e61201aa174af4882c6dbab28e10fe64d8cc1377..597b9465c81875ca28c276676146b1aec04c4674 100644
--- a/official/benchmark/transformer_benchmark.py
+++ b/official/benchmark/transformer_benchmark.py
@@ -29,6 +29,8 @@ from official.nlp.transformer import misc
 from official.nlp.transformer import transformer_main as transformer_main
 from official.utils.flags import core as flags_core
 
+TPU_DATA_DIR = 'gs://mlcompass-data/transformer'
+GPU_DATA_DIR = os.getenv('TMPDIR')
 TRANSFORMER_EN2DE_DATA_DIR_NAME = 'wmt32k-en2de-official'
 EN2DE_2014_BLEU_DATA_DIR_NAME = 'newstest2014'
 FLAGS = flags.FLAGS
@@ -40,37 +42,54 @@ class TransformerBenchmark(PerfZeroBenchmark):
 
      Code under test for the Transformer Keras models report the same data and
      require the same FLAG setup.
+
   """
 
   def __init__(self, output_dir=None, default_flags=None, root_data_dir=None,
                flag_methods=None, tpu=None):
+    self._set_data_files(root_data_dir=root_data_dir)
+
+    if default_flags is None:
+      default_flags = {}
+    default_flags['data_dir'] = self.train_data_dir
+    default_flags['vocab_file'] = self.vocab_file
+
+    super(TransformerBenchmark, self).__init__(
+        output_dir=output_dir,
+        default_flags=default_flags,
+        flag_methods=flag_methods,
+        tpu=tpu)
+
+  def _set_data_files(self, root_data_dir=None, tpu_run=False):
+    """Sets train_data_dir, vocab_file, bleu_source and bleu_ref."""
+    # Use remote storage for TPU, remote storage for GPU if defined, else
+    # use environment provided root_data_dir.
+    if tpu_run:
+      root_data_dir = TPU_DATA_DIR
+    elif GPU_DATA_DIR is not None:
+      root_data_dir = GPU_DATA_DIR
+
     root_data_dir = root_data_dir if root_data_dir else ''
 
     self.train_data_dir = os.path.join(root_data_dir,
                                        TRANSFORMER_EN2DE_DATA_DIR_NAME)
-
     self.vocab_file = os.path.join(root_data_dir,
                                    TRANSFORMER_EN2DE_DATA_DIR_NAME,
                                    'vocab.ende.32768')
-
     self.bleu_source = os.path.join(root_data_dir,
                                     EN2DE_2014_BLEU_DATA_DIR_NAME,
                                     'newstest2014.en')
-
     self.bleu_ref = os.path.join(root_data_dir,
                                  EN2DE_2014_BLEU_DATA_DIR_NAME,
                                  'newstest2014.de')
 
-    if default_flags is None:
-      default_flags = {}
-    default_flags['data_dir'] = self.train_data_dir
-    default_flags['vocab_file'] = self.vocab_file
-
-    super(TransformerBenchmark, self).__init__(
-        output_dir=output_dir,
-        default_flags=default_flags,
-        flag_methods=flag_methods,
-        tpu=tpu)
+  def _set_data_file_flags(self):
+    """Sets the FLAGS for the data files."""
+    FLAGS.data_dir = self.train_data_dir
+    FLAGS.vocab_file = self.vocab_file
+    # Sets values directly to avoid validation check.
+    FLAGS['bleu_source'].value = self.bleu_source
+    FLAGS['bleu_ref'].value = self.bleu_ref
 
   @benchmark_wrappers.enable_runtime_flags
   def _run_and_report_benchmark(self,
@@ -164,12 +183,8 @@ class TransformerBaseKerasAccuracy(TransformerBenchmark):
       not converge to the 27.3 BLEU (uncased) SOTA.
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 1
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'base'
     FLAGS.batch_size = 2048
     FLAGS.train_steps = 1000
@@ -189,12 +204,8 @@ class TransformerBaseKerasAccuracy(TransformerBenchmark):
       not converge to the 27.3 BLEU (uncased) SOTA.
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 1
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'base'
     FLAGS.batch_size = 4096
     FLAGS.train_steps = 100000
@@ -215,12 +226,8 @@ class TransformerBaseKerasAccuracy(TransformerBenchmark):
       Should converge to 27.3 BLEU (uncased). This has not been confirmed yet.
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 8
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'base'
     FLAGS.batch_size = 4096*8
     FLAGS.train_steps = 100000
@@ -237,12 +244,8 @@ class TransformerBaseKerasAccuracy(TransformerBenchmark):
       Should converge to 27.3 BLEU (uncased). This has not been confirmed yet.
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 8
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'base'
     FLAGS.batch_size = 4096*8
     FLAGS.train_steps = 100000
@@ -284,12 +287,8 @@ class TransformerBigKerasAccuracy(TransformerBenchmark):
     Iterations are not epochs, an iteration is a number of steps between evals.
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 8
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'big'
     FLAGS.batch_size = 3072*8
     FLAGS.train_steps = 20000 * 12
@@ -306,12 +305,8 @@ class TransformerBigKerasAccuracy(TransformerBenchmark):
     Should converge to 28.4 BLEU (uncased). This has not be verified yet."
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 8
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'big'
     FLAGS.batch_size = 3072*8
     FLAGS.static_batch = True
@@ -337,13 +332,9 @@ class TransformerBigKerasAccuracy(TransformerBenchmark):
     not epochs, an iteration is a number of steps between evals.
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 8
     FLAGS.dtype = 'fp16'
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'big'
     FLAGS.batch_size = 3072*8
     FLAGS.train_steps = 20000 * 12
@@ -360,14 +351,10 @@ class TransformerBigKerasAccuracy(TransformerBenchmark):
       Should converge to 28.4 BLEU (uncased). This has not be verified yet."
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 8
     FLAGS.dtype = 'fp16'
     FLAGS.fp16_implementation = 'graph_rewrite'
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'big'
     FLAGS.batch_size = 3072*8
     FLAGS.train_steps = 20000 * 12
@@ -384,13 +371,9 @@ class TransformerBigKerasAccuracy(TransformerBenchmark):
       Should converge to 28.4 BLEU (uncased). This has not be verified yet."
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 8
     FLAGS.dtype = 'fp16'
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'big'
     FLAGS.batch_size = 3072*8
     FLAGS.static_batch = True
@@ -409,14 +392,10 @@ class TransformerBigKerasAccuracy(TransformerBenchmark):
       Should converge to 28.4 BLEU (uncased). This has not be verified yet."
     """
     self._setup()
+    self._set_data_file_flags()
     FLAGS.num_gpus = 8
     FLAGS.dtype = 'fp16'
     FLAGS.enable_xla = True
-    FLAGS.data_dir = self.train_data_dir
-    FLAGS.vocab_file = self.vocab_file
-    # Sets values directly to avoid validation check.
-    FLAGS['bleu_source'].value = self.bleu_source
-    FLAGS['bleu_ref'].value = self.bleu_ref
     FLAGS.param_set = 'big'
     FLAGS.batch_size = 3072*8
     FLAGS.static_batch = True
@@ -687,22 +666,41 @@ class TransformerBigKerasBenchmarkReal(TransformerKerasBenchmark):
         root_data_dir=root_data_dir, batch_per_gpu=3072,
         tpu=tpu)
 
-  def benchmark_2x2_tpu(self):
-    """Port of former snaggletooth transformer_big model on 2x2."""
-    self._setup()
-    FLAGS.model_dir = self._get_model_dir('benchmark_2x2_tpu')
+  def _set_df_common(self):
+    self._set_data_files(tpu_run=True)
+    FLAGS.data_dir = self.train_data_dir
+    FLAGS.vocab_file = self.vocab_file
+    FLAGS.distribution_strategy = 'tpu'
+    FLAGS.padded_decode = True
     FLAGS.train_steps = 300
     FLAGS.log_steps = 150
     FLAGS.steps_between_evals = 150
-    FLAGS.distribution_strategy = 'tpu'
     FLAGS.static_batch = True
     FLAGS.use_ctl = True
-    FLAGS.batch_size = 6144
+    FLAGS.enable_checkpointing = False
     FLAGS.max_length = 64
     FLAGS.decode_batch_size = 32
     FLAGS.decode_max_length = 97
-    FLAGS.padded_decode = True
-    FLAGS.enable_checkpointing = False
+
+  def benchmark_2x2_tpu(self):
+    """Port of former snaggletooth transformer_big model on 2x2."""
+    self._setup()
+    self._set_df_common()
+    FLAGS.model_dir = self._get_model_dir('benchmark_2x2_tpu')
+    FLAGS.batch_size = 6144
+
+    self._run_and_report_benchmark(
+        total_batch_size=FLAGS.batch_size,
+        log_steps=FLAGS.log_steps)
+
+  @owner_utils.Owner('tf-graph-compiler')
+  def benchmark_2x2_tpu_mlir(self):
+    """Run transformer_big model on 2x2 with the MLIR Bridge enabled."""
+    self._setup()
+    self._set_df_common()
+    FLAGS.model_dir = self._get_model_dir('benchmark_2x2_tpu_mlir')
+    FLAGS.batch_size = 6144
+    tf.config.experimental.enable_mlir_bridge()
 
     self._run_and_report_benchmark(
         total_batch_size=FLAGS.batch_size,
@@ -711,19 +709,9 @@ class TransformerBigKerasBenchmarkReal(TransformerKerasBenchmark):
   def benchmark_4x4_tpu(self):
     """Port of former GCP transformer_big model on 4x4."""
     self._setup()
+    self._set_df_common()
     FLAGS.model_dir = self._get_model_dir('benchmark_4x4_tpu')
-    FLAGS.train_steps = 300
-    FLAGS.log_steps = 150
-    FLAGS.steps_between_evals = 150
-    FLAGS.distribution_strategy = 'tpu'
-    FLAGS.static_batch = True
-    FLAGS.use_ctl = True
     FLAGS.batch_size = 24576
-    FLAGS.max_length = 64
-    FLAGS.decode_batch_size = 32
-    FLAGS.decode_max_length = 97
-    FLAGS.padded_decode = True
-    FLAGS.enable_checkpointing = False
 
     self._run_and_report_benchmark(
         total_batch_size=FLAGS.batch_size,
@@ -733,19 +721,9 @@ class TransformerBigKerasBenchmarkReal(TransformerKerasBenchmark):
   def benchmark_4x4_tpu_mlir(self):
     """Run transformer_big model on 4x4 with the MLIR Bridge enabled."""
     self._setup()
-    FLAGS.model_dir = self._get_model_dir('benchmark_4x4_tpu')
-    FLAGS.train_steps = 300
-    FLAGS.log_steps = 150
-    FLAGS.steps_between_evals = 150
-    FLAGS.distribution_strategy = 'tpu'
-    FLAGS.static_batch = True
-    FLAGS.use_ctl = True
+    self._set_df_common()
+    FLAGS.model_dir = self._get_model_dir('benchmark_4x4_tpu_mlir')
     FLAGS.batch_size = 24576
-    FLAGS.max_length = 64
-    FLAGS.decode_batch_size = 32
-    FLAGS.decode_max_length = 97
-    FLAGS.padded_decode = True
-    FLAGS.enable_checkpointing = False
     tf.config.experimental.enable_mlir_bridge()
 
     self._run_and_report_benchmark(
diff --git a/official/benchmark/unet3d_benchmark.py b/official/benchmark/unet3d_benchmark.py
index 2614b29259dcf4c85d609abca94706c95570b7ec..8c811e483e4d1935487f1175baf6f5786632c952 100644
--- a/official/benchmark/unet3d_benchmark.py
+++ b/official/benchmark/unet3d_benchmark.py
@@ -93,8 +93,11 @@ class Unet3DAccuracyBenchmark(keras_benchmark.KerasBenchmark):
     """Runs and reports the benchmark given the provided configuration."""
     params = unet_training_lib.extract_params(FLAGS)
     strategy = unet_training_lib.create_distribution_strategy(params)
-    if params.use_bfloat16:
-      policy = tf.keras.mixed_precision.experimental.Policy('mixed_bfloat16')
+
+    input_dtype = params.dtype
+    if input_dtype == 'float16' or input_dtype == 'bfloat16':
+      policy = tf.keras.mixed_precision.experimental.Policy(
+          'mixed_bfloat16' if input_dtype == 'bfloat16' else 'mixed_float16')
       tf.keras.mixed_precision.experimental.set_policy(policy)
 
     stats = {}
diff --git a/official/colab/fine_tuning_bert.ipynb b/official/colab/fine_tuning_bert.ipynb
index 443674b6b9f1292d25f26cc06e3359506763bfce..b63c9a3f6d7912c61eee0a948406c8934061e88c 100644
--- a/official/colab/fine_tuning_bert.ipynb
+++ b/official/colab/fine_tuning_bert.ipynb
@@ -12,7 +12,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "cellView": "form",
         "colab": {},
@@ -104,7 +104,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -128,7 +128,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -185,7 +185,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -204,12 +204,12 @@
         "id": "9uFskufsR2LT"
       },
       "source": [
-        "You can get a pre-trained BERT encoder from TensorFlow Hub here:"
+        "You can get a pre-trained BERT encoder from [TensorFlow Hub](https://tfhub.dev/tensorflow/bert_en_uncased_L-12_H-768_A-12/2):"
       ]
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -252,7 +252,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -267,7 +267,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -290,7 +290,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -313,7 +313,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -336,7 +336,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -376,7 +376,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -404,7 +404,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -446,7 +446,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -469,7 +469,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -490,7 +490,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -514,7 +514,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -562,7 +562,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -587,7 +587,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -617,7 +617,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -661,7 +661,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -691,7 +691,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -737,7 +737,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -769,7 +769,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -793,7 +793,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -816,7 +816,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -845,7 +845,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -870,7 +870,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -908,7 +908,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -943,7 +943,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -986,7 +986,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1023,7 +1023,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1055,7 +1055,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1071,7 +1071,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1096,7 +1096,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1110,7 +1110,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1176,7 +1176,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1201,7 +1201,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1240,7 +1240,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1273,7 +1273,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1306,7 +1306,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1351,7 +1351,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1379,7 +1379,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1406,17 +1406,44 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
-        "id": "lo6479At4sP1"
+        "id": "GDWrHm0BGpbX"
       },
       "outputs": [],
       "source": [
         "# Note: 350MB download.\n",
-        "import tensorflow_hub as hub\n",
-        "hub_encoder = hub.KerasLayer(hub_url_bert, trainable=True)\n",
+        "import tensorflow_hub as hub"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "cellView": "form",
+        "colab": {},
+        "colab_type": "code",
+        "id": "Y29meH0qGq_5"
+      },
+      "outputs": [],
+      "source": [
+        "hub_model_name = \"bert_en_uncased_L-12_H-768_A-12\" #@param [\"bert_en_uncased_L-24_H-1024_A-16\", \"bert_en_wwm_cased_L-24_H-1024_A-16\", \"bert_en_uncased_L-12_H-768_A-12\", \"bert_en_wwm_uncased_L-24_H-1024_A-16\", \"bert_en_cased_L-24_H-1024_A-16\", \"bert_en_cased_L-12_H-768_A-12\", \"bert_zh_L-12_H-768_A-12\", \"bert_multi_cased_L-12_H-768_A-12\"]"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "lo6479At4sP1"
+      },
+      "outputs": [],
+      "source": [
+        "hub_encoder = hub.KerasLayer(f\"https://tfhub.dev/tensorflow/{hub_model_name}\",\n",
+        "                             trainable=True)\n",
         "\n",
         "print(f\"The Hub encoder has {len(hub_encoder.trainable_variables)} trainable variables\")"
       ]
@@ -1433,7 +1460,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1466,7 +1493,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1491,7 +1518,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1504,7 +1531,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1545,7 +1572,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1569,7 +1596,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1592,7 +1619,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1617,7 +1644,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1643,7 +1670,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1661,7 +1688,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1688,7 +1715,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1714,7 +1741,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1733,7 +1760,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1761,7 +1788,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -1795,7 +1822,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
diff --git a/official/colab/nlp/customize_encoder.ipynb b/official/colab/nlp/customize_encoder.ipynb
new file mode 100644
index 0000000000000000000000000000000000000000..18b45d3a66fcaab007d25c1d6db1cd461509daa2
--- /dev/null
+++ b/official/colab/nlp/customize_encoder.ipynb
@@ -0,0 +1,625 @@
+{
+  "cells": [
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "Bp8t2AI8i7uP"
+      },
+      "source": [
+        "##### Copyright 2020 The TensorFlow Authors."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "cellView": "form",
+        "colab": {},
+        "colab_type": "code",
+        "id": "rxPj2Lsni9O4"
+      },
+      "outputs": [],
+      "source": [
+        "#@title Licensed under the Apache License, Version 2.0 (the \"License\");\n",
+        "# you may not use this file except in compliance with the License.\n",
+        "# You may obtain a copy of the License at\n",
+        "#\n",
+        "# https://www.apache.org/licenses/LICENSE-2.0\n",
+        "#\n",
+        "# Unless required by applicable law or agreed to in writing, software\n",
+        "# distributed under the License is distributed on an \"AS IS\" BASIS,\n",
+        "# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\n",
+        "# See the License for the specific language governing permissions and\n",
+        "# limitations under the License."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "6xS-9i5DrRvO"
+      },
+      "source": [
+        "# Customizing a Transformer Encoder"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "Mwb9uw1cDXsa"
+      },
+      "source": [
+        "\u003ctable class=\"tfo-notebook-buttons\" align=\"left\"\u003e\n",
+        "  \u003ctd\u003e\n",
+        "    \u003ca target=\"_blank\" href=\"https://www.tensorflow.org/official_models/nlp/customize_encoder\"\u003e\u003cimg src=\"https://www.tensorflow.org/images/tf_logo_32px.png\" /\u003eView on TensorFlow.org\u003c/a\u003e\n",
+        "  \u003c/td\u003e\n",
+        "  \u003ctd\u003e\n",
+        "    \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/tensorflow/models/blob/master/official/colab/nlp/customize_encoder.ipynb\"\u003e\u003cimg src=\"https://www.tensorflow.org/images/colab_logo_32px.png\" /\u003eRun in Google Colab\u003c/a\u003e\n",
+        "  \u003c/td\u003e\n",
+        "  \u003ctd\u003e\n",
+        "    \u003ca target=\"_blank\" href=\"https://github.com/tensorflow/models/blob/master/official/colab/nlp/customize_encoder.ipynb\"\u003e\u003cimg src=\"https://www.tensorflow.org/images/GitHub-Mark-32px.png\" /\u003eView source on GitHub\u003c/a\u003e\n",
+        "  \u003c/td\u003e\n",
+        "  \u003ctd\u003e\n",
+        "    \u003ca href=\"https://storage.googleapis.com/tensorflow_docs/models/official/colab/nlp/customize_encoder.ipynb\"\u003e\u003cimg src=\"https://www.tensorflow.org/images/download_logo_32px.png\" /\u003eDownload notebook\u003c/a\u003e\n",
+        "  \u003c/td\u003e\n",
+        "\u003c/table\u003e"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "iLrcV4IyrcGX"
+      },
+      "source": [
+        "## Learning objectives\n",
+        "\n",
+        "The [TensorFlow Models NLP library](https://github.com/tensorflow/models/tree/master/official/nlp/modeling) is a collection of tools for building and training modern high performance natural language models.\n",
+        "\n",
+        "The [TransformEncoder](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/networks/encoder_scaffold.py) is the core of this library, and lots of new network architectures are proposed to improve the encoder. In this Colab notebook, we will learn how to customize the encoder to employ new network architectures."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "YYxdyoWgsl8t"
+      },
+      "source": [
+        "## Install and import"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "fEJSFutUsn_h"
+      },
+      "source": [
+        "### Install the TensorFlow Model Garden pip package\n",
+        "\n",
+        "*  `tf-models-nightly` is the nightly Model Garden package created daily automatically.\n",
+        "*  `pip` will install all models and dependencies automatically."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "thsKZDjhswhR"
+      },
+      "outputs": [],
+      "source": [
+        "!pip install -q tf-nightly\n",
+        "!pip install -q tf-models-nightly"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "hpf7JPCVsqtv"
+      },
+      "source": [
+        "### Import Tensorflow and other libraries"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "my4dp-RMssQe"
+      },
+      "outputs": [],
+      "source": [
+        "import numpy as np\n",
+        "import tensorflow as tf\n",
+        "\n",
+        "from official.modeling import activations\n",
+        "from official.nlp import modeling\n",
+        "from official.nlp.modeling import layers, losses, models, networks"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "vjDmVsFfs85n"
+      },
+      "source": [
+        "## Canonical BERT encoder\n",
+        "\n",
+        "Before learning how to customize the encoder, let's firstly create a canonical BERT enoder and use it to instantiate a `BertClassifier` for classification task."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "Oav8sbgstWc-"
+      },
+      "outputs": [],
+      "source": [
+        "cfg = {\n",
+        "    \"vocab_size\": 100,\n",
+        "    \"hidden_size\": 32,\n",
+        "    \"num_layers\": 3,\n",
+        "    \"num_attention_heads\": 4,\n",
+        "    \"intermediate_size\": 64,\n",
+        "    \"activation\": activations.gelu,\n",
+        "    \"dropout_rate\": 0.1,\n",
+        "    \"attention_dropout_rate\": 0.1,\n",
+        "    \"sequence_length\": 16,\n",
+        "    \"type_vocab_size\": 2,\n",
+        "    \"initializer\": tf.keras.initializers.TruncatedNormal(stddev=0.02),\n",
+        "}\n",
+        "bert_encoder = modeling.networks.TransformerEncoder(**cfg)\n",
+        "\n",
+        "def build_classifier(bert_encoder):\n",
+        "  return modeling.models.BertClassifier(bert_encoder, num_classes=2)\n",
+        "\n",
+        "canonical_classifier_model = build_classifier(bert_encoder)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "Qe2UWI6_tsHo"
+      },
+      "source": [
+        "`canonical_classifier_model` can be trained using the training data. For details about how to train the model, please see the colab [fine_tuning_bert.ipynb](https://github.com/tensorflow/models/blob/master/official/colab/fine_tuning_bert.ipynb). We skip the code that trains the model here.\n",
+        "\n",
+        "After training, we can apply the model to do prediction.\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "csED2d-Yt5h6"
+      },
+      "outputs": [],
+      "source": [
+        "def predict(model):\n",
+        "  batch_size = 3\n",
+        "  np.random.seed(0)\n",
+        "  word_ids = np.random.randint(\n",
+        "      cfg[\"vocab_size\"], size=(batch_size, cfg[\"sequence_length\"]))\n",
+        "  mask = np.random.randint(2, size=(batch_size, cfg[\"sequence_length\"]))\n",
+        "  type_ids = np.random.randint(\n",
+        "      cfg[\"type_vocab_size\"], size=(batch_size, cfg[\"sequence_length\"]))\n",
+        "  print(model([word_ids, mask, type_ids], training=False))\n",
+        "\n",
+        "predict(canonical_classifier_model)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "PzKStEK9t_Pb"
+      },
+      "source": [
+        "## Customize BERT encoder\n",
+        "\n",
+        "One BERT encoder consists of an embedding network and multiple transformer blocks, and each transformer block contains an attention layer and a feedforward layer."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "rmwQfhj6fmKz"
+      },
+      "source": [
+        "We provide easy ways to customize each of those components via (1)\n",
+        "[EncoderScaffold](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/networks/encoder_scaffold.py) and (2) [TransformerScaffold](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/transformer_scaffold.py)."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "xsMgEVHAui11"
+      },
+      "source": [
+        "### Use EncoderScaffold\n",
+        "\n",
+        "`EncoderScaffold` allows users to provide a custom embedding subnetwork\n",
+        "  (which will replace the standard embedding logic) and/or a custom hidden layer class (which will replace the `Transformer` instantiation in the encoder)."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "-JBabpa2AOz8"
+      },
+      "source": [
+        "#### Without Customization\n",
+        "\n",
+        "Without any customization, `EncoderScaffold` behaves the same the canonical `TransformerEncoder`.\n",
+        "\n",
+        "As shown in the following example, `EncoderScaffold` can load `TransformerEncoder`'s weights and output the same values:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "ktNzKuVByZQf"
+      },
+      "outputs": [],
+      "source": [
+        "default_hidden_cfg = dict(\n",
+        "    num_attention_heads=cfg[\"num_attention_heads\"],\n",
+        "    intermediate_size=cfg[\"intermediate_size\"],\n",
+        "    intermediate_activation=activations.gelu,\n",
+        "    dropout_rate=cfg[\"dropout_rate\"],\n",
+        "    attention_dropout_rate=cfg[\"attention_dropout_rate\"],\n",
+        "    kernel_initializer=tf.keras.initializers.TruncatedNormal(0.02),\n",
+        ")\n",
+        "default_embedding_cfg = dict(\n",
+        "    vocab_size=cfg[\"vocab_size\"],\n",
+        "    type_vocab_size=cfg[\"type_vocab_size\"],\n",
+        "    hidden_size=cfg[\"hidden_size\"],\n",
+        "    seq_length=cfg[\"sequence_length\"],\n",
+        "    initializer=tf.keras.initializers.TruncatedNormal(0.02),\n",
+        "    dropout_rate=cfg[\"dropout_rate\"],\n",
+        "    max_seq_length=cfg[\"sequence_length\"],\n",
+        ")\n",
+        "default_kwargs = dict(\n",
+        "    hidden_cfg=default_hidden_cfg,\n",
+        "    embedding_cfg=default_embedding_cfg,\n",
+        "    num_hidden_instances=cfg[\"num_layers\"],\n",
+        "    pooled_output_dim=cfg[\"hidden_size\"],\n",
+        "    return_all_layer_outputs=True,\n",
+        "    pooler_layer_initializer=tf.keras.initializers.TruncatedNormal(0.02),\n",
+        ")\n",
+        "encoder_scaffold = modeling.networks.EncoderScaffold(**default_kwargs)\n",
+        "classifier_model_from_encoder_scaffold = build_classifier(encoder_scaffold)\n",
+        "classifier_model_from_encoder_scaffold.set_weights(\n",
+        "    canonical_classifier_model.get_weights())\n",
+        "predict(classifier_model_from_encoder_scaffold)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "sMaUmLyIuwcs"
+      },
+      "source": [
+        "#### Customize Embedding\n",
+        "\n",
+        "Next, we show how to use a customized embedding network.\n",
+        "\n",
+        "We firstly build an embedding network that will replace the default network. This one will have 2 inputs (`mask` and `word_ids`) instead of 3, and won't use positional embeddings."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "LTinnaG6vcsw"
+      },
+      "outputs": [],
+      "source": [
+        "word_ids = tf.keras.layers.Input(\n",
+        "    shape=(cfg['sequence_length'],), dtype=tf.int32, name=\"input_word_ids\")\n",
+        "mask = tf.keras.layers.Input(\n",
+        "    shape=(cfg['sequence_length'],), dtype=tf.int32, name=\"input_mask\")\n",
+        "embedding_layer = modeling.layers.OnDeviceEmbedding(\n",
+        "    vocab_size=cfg['vocab_size'],\n",
+        "    embedding_width=cfg['hidden_size'],\n",
+        "    initializer=tf.keras.initializers.TruncatedNormal(stddev=0.02),\n",
+        "    name=\"word_embeddings\")\n",
+        "word_embeddings = embedding_layer(word_ids)\n",
+        "attention_mask = layers.SelfAttentionMask()([word_embeddings, mask])\n",
+        "new_embedding_network = tf.keras.Model([word_ids, mask],\n",
+        "                                       [word_embeddings, attention_mask])"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "HN7_yu-6O3qI"
+      },
+      "source": [
+        "Inspecting `new_embedding_network`, we can see it takes two inputs:\n",
+        "`input_word_ids` and `input_mask`."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "fO9zKFE4OpHp"
+      },
+      "outputs": [],
+      "source": [
+        "tf.keras.utils.plot_model(new_embedding_network, show_shapes=True, dpi=48)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "9cOaGQHLv12W"
+      },
+      "source": [
+        "We then can build a new encoder using the above `new_embedding_network`."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "mtFDMNf2vIl9"
+      },
+      "outputs": [],
+      "source": [
+        "kwargs = dict(default_kwargs)\n",
+        "\n",
+        "# Use new embedding network.\n",
+        "kwargs['embedding_cls'] = new_embedding_network\n",
+        "kwargs['embedding_data'] = embedding_layer.embeddings\n",
+        "\n",
+        "encoder_with_customized_embedding = modeling.networks.EncoderScaffold(**kwargs)\n",
+        "classifier_model = build_classifier(encoder_with_customized_embedding)\n",
+        "# ... Train the model ...\n",
+        "print(classifier_model.inputs)\n",
+        "\n",
+        "# Assert that there are only two inputs.\n",
+        "assert len(classifier_model.inputs) == 2"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "Z73ZQDtmwg9K"
+      },
+      "source": [
+        "#### Customized Transformer\n",
+        "\n",
+        "User can also override the [hidden_cls](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/networks/encoder_scaffold.py#L103) argument in `EncoderScaffold`'s constructor to employ a customized Transformer layer.\n",
+        "\n",
+        "See [ReZeroTransformer](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/rezero_transformer.py) for how to implement a customized Transformer layer.\n",
+        "\n",
+        "Following is an example of using `ReZeroTransformer`:\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "uAIarLZgw6pA"
+      },
+      "outputs": [],
+      "source": [
+        "kwargs = dict(default_kwargs)\n",
+        "\n",
+        "# Use ReZeroTransformer.\n",
+        "kwargs['hidden_cls'] = modeling.layers.ReZeroTransformer\n",
+        "\n",
+        "encoder_with_rezero_transformer = modeling.networks.EncoderScaffold(**kwargs)\n",
+        "classifier_model = build_classifier(encoder_with_rezero_transformer)\n",
+        "# ... Train the model ...\n",
+        "predict(classifier_model)\n",
+        "\n",
+        "# Assert that the variable `rezero_alpha` from ReZeroTransformer exists.\n",
+        "assert 'rezero_alpha' in ''.join([x.name for x in classifier_model.trainable_weights])"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "6PMHFdvnxvR0"
+      },
+      "source": [
+        "### Use [TransformerScaffold](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/transformer_scaffold.py)\n",
+        "\n",
+        "The above method of customizing `Transformer` requires rewriting the whole `Transformer` layer, while sometimes you may only want to customize either attention layer or feedforward block. In this case, [TransformerScaffold](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/transformer_scaffold.py) can be used.\n",
+        "\n"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "D6FejlgwyAy_"
+      },
+      "source": [
+        "#### Customize Attention Layer\n",
+        "\n",
+        "User can also override the [attention_cls](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/transformer_scaffold.py#L45) argument in `TransformerScaffold`'s constructor to employ a customized Attention layer.\n",
+        "\n",
+        "See [TalkingHeadsAttention](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/talking_heads_attention.py) for how to implement a customized `Attention` layer.\n",
+        "\n",
+        "Following is an example of using [TalkingHeadsAttention](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/talking_heads_attention.py):"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "nFrSMrZuyNeQ"
+      },
+      "outputs": [],
+      "source": [
+        "# Use TalkingHeadsAttention\n",
+        "hidden_cfg = dict(default_hidden_cfg)\n",
+        "hidden_cfg['attention_cls'] = modeling.layers.TalkingHeadsAttention\n",
+        "\n",
+        "kwargs = dict(default_kwargs)\n",
+        "kwargs['hidden_cls'] = modeling.layers.TransformerScaffold\n",
+        "kwargs['hidden_cfg'] = hidden_cfg\n",
+        "\n",
+        "encoder = modeling.networks.EncoderScaffold(**kwargs)\n",
+        "classifier_model = build_classifier(encoder)\n",
+        "# ... Train the model ...\n",
+        "predict(classifier_model)\n",
+        "\n",
+        "# Assert that the variable `pre_softmax_weight` from TalkingHeadsAttention exists.\n",
+        "assert 'pre_softmax_weight' in ''.join([x.name for x in classifier_model.trainable_weights])"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "kuEJcTyByVvI"
+      },
+      "source": [
+        "#### Customize Feedforward Layer\n",
+        "\n",
+        "Similiarly, one could also customize the feedforward layer.\n",
+        "\n",
+        "See [GatedFeedforward](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/gated_feedforward.py) for how to implement a customized feedforward layer.\n",
+        "\n",
+        "Following is an example of using [GatedFeedforward](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/gated_feedforward.py)."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "XAbKy_l4y_-i"
+      },
+      "outputs": [],
+      "source": [
+        "# Use TalkingHeadsAttention\n",
+        "hidden_cfg = dict(default_hidden_cfg)\n",
+        "hidden_cfg['feedforward_cls'] = modeling.layers.GatedFeedforward\n",
+        "\n",
+        "kwargs = dict(default_kwargs)\n",
+        "kwargs['hidden_cls'] = modeling.layers.TransformerScaffold\n",
+        "kwargs['hidden_cfg'] = hidden_cfg\n",
+        "\n",
+        "encoder_with_gated_feedforward = modeling.networks.EncoderScaffold(**kwargs)\n",
+        "classifier_model = build_classifier(encoder_with_gated_feedforward)\n",
+        "# ... Train the model ...\n",
+        "predict(classifier_model)\n",
+        "\n",
+        "# Assert that the variable `gate` from GatedFeedforward exists.\n",
+        "assert 'gate' in ''.join([x.name for x in classifier_model.trainable_weights])"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "a_8NWUhkzeAq"
+      },
+      "source": [
+        "### Build a new Encoder using building blocks from KerasBERT.\n",
+        "\n",
+        "Finally, you could also build a new encoder using building blocks in the modeling library.\n",
+        "\n",
+        "See [AlbertTransformerEncoder](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/networks/albert_transformer_encoder.py) as an example:\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "xsiA3RzUzmUM"
+      },
+      "outputs": [],
+      "source": [
+        "albert_encoder = modeling.networks.AlbertTransformerEncoder(**cfg)\n",
+        "classifier_model = build_classifier(albert_encoder)\n",
+        "# ... Train the model ...\n",
+        "predict(classifier_model)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "MeidDfhlHKSO"
+      },
+      "source": [
+        "Inspecting the `albert_encoder`, we see it stacks the same `Transformer` layer multiple times."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "Uv_juT22HERW"
+      },
+      "outputs": [],
+      "source": [
+        "tf.keras.utils.plot_model(albert_encoder, show_shapes=True, dpi=48)"
+      ]
+    }
+  ],
+  "metadata": {
+    "colab": {
+      "collapsed_sections": [],
+      "name": "Customizing a Transformer Encoder",
+      "private_outputs": true,
+      "provenance": [],
+      "toc_visible": true
+    },
+    "kernelspec": {
+      "display_name": "Python 3",
+      "name": "python3"
+    }
+  },
+  "nbformat": 4,
+  "nbformat_minor": 0
+}
diff --git a/official/colab/nlp/nlp_modeling_library_intro.ipynb b/official/colab/nlp/nlp_modeling_library_intro.ipynb
new file mode 100644
index 0000000000000000000000000000000000000000..f5ffcef96419aef9c25daaf8c585efe9a3043f73
--- /dev/null
+++ b/official/colab/nlp/nlp_modeling_library_intro.ipynb
@@ -0,0 +1,601 @@
+{
+  "cells": [
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "80xnUmoI7fBX"
+      },
+      "source": [
+        "##### Copyright 2020 The TensorFlow Authors."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "cellView": "form",
+        "colab": {},
+        "colab_type": "code",
+        "id": "8nvTnfs6Q692"
+      },
+      "outputs": [],
+      "source": [
+        "#@title Licensed under the Apache License, Version 2.0 (the \"License\");\n",
+        "# you may not use this file except in compliance with the License.\n",
+        "# You may obtain a copy of the License at\n",
+        "#\n",
+        "# https://www.apache.org/licenses/LICENSE-2.0\n",
+        "#\n",
+        "# Unless required by applicable law or agreed to in writing, software\n",
+        "# distributed under the License is distributed on an \"AS IS\" BASIS,\n",
+        "# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\n",
+        "# See the License for the specific language governing permissions and\n",
+        "# limitations under the License."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "WmfcMK5P5C1G"
+      },
+      "source": [
+        "# Introduction to the TensorFlow Models NLP library"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "cH-oJ8R6AHMK"
+      },
+      "source": [
+        "\u003ctable class=\"tfo-notebook-buttons\" align=\"left\"\u003e\n",
+        "  \u003ctd\u003e\n",
+        "    \u003ca target=\"_blank\" href=\"https://www.tensorflow.org/official_models/nlp/nlp_modeling_library_intro\"\u003e\u003cimg src=\"https://www.tensorflow.org/images/tf_logo_32px.png\" /\u003eView on TensorFlow.org\u003c/a\u003e\n",
+        "  \u003c/td\u003e\n",
+        "  \u003ctd\u003e\n",
+        "    \u003ca target=\"_blank\" href=\"https://colab.research.google.com/github/tensorflow/models/blob/master/official/colab/nlp/nlp_modeling_library_intro.ipynb\"\u003e\u003cimg src=\"https://www.tensorflow.org/images/colab_logo_32px.png\" /\u003eRun in Google Colab\u003c/a\u003e\n",
+        "  \u003c/td\u003e\n",
+        "  \u003ctd\u003e\n",
+        "    \u003ca target=\"_blank\" href=\"https://github.com/tensorflow/models/blob/master/official/colab/nlp/nlp_modeling_library_intro.ipynb\"\u003e\u003cimg src=\"https://www.tensorflow.org/images/GitHub-Mark-32px.png\" /\u003eView source on GitHub\u003c/a\u003e\n",
+        "  \u003c/td\u003e\n",
+        "  \u003ctd\u003e\n",
+        "    \u003ca href=\"https://storage.googleapis.com/tensorflow_docs/models/official/colab/nlp/nlp_modeling_library_intro.ipynb\"\u003e\u003cimg src=\"https://www.tensorflow.org/images/download_logo_32px.png\" /\u003eDownload notebook\u003c/a\u003e\n",
+        "  \u003c/td\u003e\n",
+        "\u003c/table\u003e"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "0H_EFIhq4-MJ"
+      },
+      "source": [
+        "## Learning objectives\n",
+        "\n",
+        "In this Colab notebook, you will learn how to build transformer-based models for common NLP tasks including pretraining, span labelling and classification using the building blocks from [NLP modeling library](https://github.com/tensorflow/models/tree/master/official/nlp/modeling)."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "2N97-dps_nUk"
+      },
+      "source": [
+        "## Install and import"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "459ygAVl_rg0"
+      },
+      "source": [
+        "### Install the TensorFlow Model Garden pip package\n",
+        "\n",
+        "*  `tf-models-nightly` is the nightly Model Garden package created daily automatically.\n",
+        "*  `pip` will install all models and dependencies automatically."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "Y-qGkdh6_sZc"
+      },
+      "outputs": [],
+      "source": [
+        "!pip install -q tf-nightly\n",
+        "!pip install -q tf-models-nightly"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "e4huSSwyAG_5"
+      },
+      "source": [
+        "### Import Tensorflow and other libraries"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "jqYXqtjBAJd9"
+      },
+      "outputs": [],
+      "source": [
+        "import numpy as np\n",
+        "import tensorflow as tf\n",
+        "\n",
+        "from official.nlp import modeling\n",
+        "from official.nlp.modeling import layers, losses, models, networks"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "djBQWjvy-60Y"
+      },
+      "source": [
+        "## BERT pretraining model\n",
+        "\n",
+        "BERT ([Pre-training of Deep Bidirectional Transformers for Language Understanding](https://arxiv.org/abs/1810.04805)) introduced the method of pre-training language representations on a large text corpus and then using that model for downstream NLP tasks.\n",
+        "\n",
+        "In this section, we will learn how to build a model to pretrain BERT on the masked language modeling task and next sentence prediction task. For simplicity, we only show the minimum example and use dummy data."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "MKuHVlsCHmiq"
+      },
+      "source": [
+        "### Build a `BertPretrainer` model wrapping `TransformerEncoder`\n",
+        "\n",
+        "The [TransformerEncoder](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/networks/transformer_encoder.py) implements the Transformer-based encoder as described in [BERT paper](https://arxiv.org/abs/1810.04805). It includes the embedding lookups and transformer layers, but not the masked language model or classification task networks.\n",
+        "\n",
+        "The [BertPretrainer](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/models/bert_pretrainer.py) allows a user to pass in a transformer stack, and instantiates the masked language model and classification networks that are used to create the training objectives."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "EXkcXz-9BwB3"
+      },
+      "outputs": [],
+      "source": [
+        "# Build a small transformer network.\n",
+        "vocab_size = 100\n",
+        "sequence_length = 16\n",
+        "network = modeling.networks.TransformerEncoder(\n",
+        "    vocab_size=vocab_size, num_layers=2, sequence_length=16)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "0NH5irV5KTMS"
+      },
+      "source": [
+        "Inspecting the encoder, we see it contains few embedding layers, stacked `Transformer` layers and are connected to three input layers:\n",
+        "\n",
+        "`input_word_ids`, `input_type_ids` and `input_mask`.\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "lZNoZkBrIoff"
+      },
+      "outputs": [],
+      "source": [
+        "tf.keras.utils.plot_model(network, show_shapes=True, dpi=48)"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "o7eFOZXiIl-b"
+      },
+      "outputs": [],
+      "source": [
+        "# Create a BERT pretrainer with the created network.\n",
+        "num_token_predictions = 8\n",
+        "bert_pretrainer = modeling.models.BertPretrainer(\n",
+        "    network, num_classes=2, num_token_predictions=num_token_predictions, output='predictions')"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "d5h5HT7gNHx_"
+      },
+      "source": [
+        "Inspecting the `bert_pretrainer`, we see it wraps the `encoder` with additional `MaskedLM` and `Classification` heads."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "2tcNfm03IBF7"
+      },
+      "outputs": [],
+      "source": [
+        "tf.keras.utils.plot_model(bert_pretrainer, show_shapes=True, dpi=48)"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "F2oHrXGUIS0M"
+      },
+      "outputs": [],
+      "source": [
+        "# We can feed some dummy data to get masked language model and sentence output.\n",
+        "batch_size = 2\n",
+        "word_id_data = np.random.randint(vocab_size, size=(batch_size, sequence_length))\n",
+        "mask_data = np.random.randint(2, size=(batch_size, sequence_length))\n",
+        "type_id_data = np.random.randint(2, size=(batch_size, sequence_length))\n",
+        "masked_lm_positions_data = np.random.randint(2, size=(batch_size, num_token_predictions))\n",
+        "\n",
+        "outputs = bert_pretrainer(\n",
+        "    [word_id_data, mask_data, type_id_data, masked_lm_positions_data])\n",
+        "lm_output = outputs[\"masked_lm\"]\n",
+        "sentence_output = outputs[\"classification\"]\n",
+        "print(lm_output)\n",
+        "print(sentence_output)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "bnx3UCHniCS5"
+      },
+      "source": [
+        "### Compute loss\n",
+        "Next, we can use `lm_output` and `sentence_output` to compute `loss`."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "k30H4Q86f52x"
+      },
+      "outputs": [],
+      "source": [
+        "masked_lm_ids_data = np.random.randint(vocab_size, size=(batch_size, num_token_predictions))\n",
+        "masked_lm_weights_data = np.random.randint(2, size=(batch_size, num_token_predictions))\n",
+        "next_sentence_labels_data = np.random.randint(2, size=(batch_size))\n",
+        "\n",
+        "mlm_loss = modeling.losses.weighted_sparse_categorical_crossentropy_loss(\n",
+        "    labels=masked_lm_ids_data,\n",
+        "    predictions=lm_output,\n",
+        "    weights=masked_lm_weights_data)\n",
+        "sentence_loss = modeling.losses.weighted_sparse_categorical_crossentropy_loss(\n",
+        "    labels=next_sentence_labels_data,\n",
+        "    predictions=sentence_output)\n",
+        "loss = mlm_loss + sentence_loss\n",
+        "print(loss)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "wrmSs8GjHxVw"
+      },
+      "source": [
+        "With the loss, you can optimize the model.\n",
+        "After training, we can save the weights of TransformerEncoder for the downstream fine-tuning tasks. Please see [run_pretraining.py](https://github.com/tensorflow/models/blob/master/official/nlp/bert/run_pretraining.py) for the full example.\n",
+        "\n"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "k8cQVFvBCV4s"
+      },
+      "source": [
+        "## Span labeling model\n",
+        "\n",
+        "Span labeling is the task to assign labels to a span of the text, for example, label a span of text as the answer of a given question.\n",
+        "\n",
+        "In this section, we will learn how to build a span labeling model. Again, we use dummy data for simplicity."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "xrLLEWpfknUW"
+      },
+      "source": [
+        "### Build a BertSpanLabeler wrapping TransformerEncoder\n",
+        "\n",
+        "[BertSpanLabeler](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/models/bert_span_labeler.py) implements a simple single-span start-end predictor (that is, a model that predicts two values: a start token index and an end token index), suitable for SQuAD-style tasks.\n",
+        "\n",
+        "Note that `BertSpanLabeler` wraps a `TransformerEncoder`, the weights of which can be restored from the above pretraining model.\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "B941M4iUCejO"
+      },
+      "outputs": [],
+      "source": [
+        "network = modeling.networks.TransformerEncoder(\n",
+        "        vocab_size=vocab_size, num_layers=2, sequence_length=sequence_length)\n",
+        "\n",
+        "# Create a BERT trainer with the created network.\n",
+        "bert_span_labeler = modeling.models.BertSpanLabeler(network)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "QpB9pgj4PpMg"
+      },
+      "source": [
+        "Inspecting the `bert_span_labeler`, we see it wraps the encoder with additional `SpanLabeling` that outputs `start_position` and `end_postion`."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "RbqRNJCLJu4H"
+      },
+      "outputs": [],
+      "source": [
+        "tf.keras.utils.plot_model(bert_span_labeler, show_shapes=True, dpi=48)"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "fUf1vRxZJwio"
+      },
+      "outputs": [],
+      "source": [
+        "# Create a set of 2-dimensional data tensors to feed into the model.\n",
+        "word_id_data = np.random.randint(vocab_size, size=(batch_size, sequence_length))\n",
+        "mask_data = np.random.randint(2, size=(batch_size, sequence_length))\n",
+        "type_id_data = np.random.randint(2, size=(batch_size, sequence_length))\n",
+        "\n",
+        "# Feed the data to the model.\n",
+        "start_logits, end_logits = bert_span_labeler([word_id_data, mask_data, type_id_data])\n",
+        "print(start_logits)\n",
+        "print(end_logits)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "WqhgQaN1lt-G"
+      },
+      "source": [
+        "### Compute loss\n",
+        "With `start_logits` and `end_logits`, we can compute loss:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "waqs6azNl3Nn"
+      },
+      "outputs": [],
+      "source": [
+        "start_positions = np.random.randint(sequence_length, size=(batch_size))\n",
+        "end_positions = np.random.randint(sequence_length, size=(batch_size))\n",
+        "\n",
+        "start_loss = tf.keras.losses.sparse_categorical_crossentropy(\n",
+        "    start_positions, start_logits, from_logits=True)\n",
+        "end_loss = tf.keras.losses.sparse_categorical_crossentropy(\n",
+        "    end_positions, end_logits, from_logits=True)\n",
+        "\n",
+        "total_loss = (tf.reduce_mean(start_loss) + tf.reduce_mean(end_loss)) / 2\n",
+        "print(total_loss)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "Zdf03YtZmd_d"
+      },
+      "source": [
+        "With the `loss`, you can optimize the model. Please see [run_squad.py](https://github.com/tensorflow/models/blob/master/official/nlp/bert/run_squad.py) for the full example."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "0A1XnGSTChg9"
+      },
+      "source": [
+        "## Classification model\n",
+        "\n",
+        "In the last section, we show how to build a text classification model.\n"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "MSK8OpZgnQa9"
+      },
+      "source": [
+        "### Build a BertClassifier model wrapping TransformerEncoder\n",
+        "\n",
+        "[BertClassifier](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/models/bert_classifier.py) implements a [CLS] token classification model containing a single classification head."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "cXXCsffkCphk"
+      },
+      "outputs": [],
+      "source": [
+        "network = modeling.networks.TransformerEncoder(\n",
+        "        vocab_size=vocab_size, num_layers=2, sequence_length=sequence_length)\n",
+        "\n",
+        "# Create a BERT trainer with the created network.\n",
+        "num_classes = 2\n",
+        "bert_classifier = modeling.models.BertClassifier(\n",
+        "    network, num_classes=num_classes)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "8tZKueKYP4bB"
+      },
+      "source": [
+        "Inspecting the `bert_classifier`, we see it wraps the `encoder` with additional `Classification` head."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "snlutm9ZJgEZ"
+      },
+      "outputs": [],
+      "source": [
+        "tf.keras.utils.plot_model(bert_classifier, show_shapes=True, dpi=48)"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "yyHPHsqBJkCz"
+      },
+      "outputs": [],
+      "source": [
+        "# Create a set of 2-dimensional data tensors to feed into the model.\n",
+        "word_id_data = np.random.randint(vocab_size, size=(batch_size, sequence_length))\n",
+        "mask_data = np.random.randint(2, size=(batch_size, sequence_length))\n",
+        "type_id_data = np.random.randint(2, size=(batch_size, sequence_length))\n",
+        "\n",
+        "# Feed the data to the model.\n",
+        "logits = bert_classifier([word_id_data, mask_data, type_id_data])\n",
+        "print(logits)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "w--a2mg4nzKm"
+      },
+      "source": [
+        "### Compute loss\n",
+        "\n",
+        "With `logits`, we can compute `loss`:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "9X0S1DoFn_5Q"
+      },
+      "outputs": [],
+      "source": [
+        "labels = np.random.randint(num_classes, size=(batch_size))\n",
+        "\n",
+        "loss = modeling.losses.weighted_sparse_categorical_crossentropy_loss(\n",
+        "    labels=labels, predictions=tf.nn.log_softmax(logits, axis=-1))\n",
+        "print(loss)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "mzBqOylZo3og"
+      },
+      "source": [
+        "With the `loss`, you can optimize the model. Please see [run_classifier.py](https://github.com/tensorflow/models/blob/master/official/nlp/bert/run_classifier.py) or the colab [fine_tuning_bert.ipynb](https://github.com/tensorflow/models/blob/master/official/colab/fine_tuning_bert.ipynb) for the full example."
+      ]
+    }
+  ],
+  "metadata": {
+    "colab": {
+      "collapsed_sections": [],
+      "name": "Introduction to the TensorFlow Models NLP library",
+      "private_outputs": true,
+      "provenance": [],
+      "toc_visible": true
+    },
+    "kernelspec": {
+      "display_name": "Python 3",
+      "name": "python3"
+    }
+  },
+  "nbformat": 4,
+  "nbformat_minor": 0
+}
diff --git a/official/core/base_task.py b/official/core/base_task.py
index f5dfdd4f5c2ff9b75b3571df31016196e92cd934..76ebd8e14dea783bd5e495bafeb2e3218ae26eb6 100644
--- a/official/core/base_task.py
+++ b/official/core/base_task.py
@@ -18,11 +18,11 @@ import abc
 import functools
 from typing import Any, Callable, Optional
 
+from absl import logging
 import six
 import tensorflow as tf
 
 from official.modeling.hyperparams import config_definitions as cfg
-from official.utils import registry
 
 
 @six.add_metaclass(abc.ABCMeta)
@@ -37,17 +37,29 @@ class Task(tf.Module):
   # Special keys in train/validate step returned logs.
   loss = "loss"
 
-  def __init__(self, params: cfg.TaskConfig):
+  def __init__(self, params: cfg.TaskConfig, logging_dir: str = None):
+    """Task initialization.
+
+    Args:
+      params: cfg.TaskConfig instance.
+      logging_dir: a string pointing to where the model, summaries etc. will be
+        saved. You can also write additional stuff in this directory.
+    """
     self._task_config = params
+    self._logging_dir = logging_dir
 
   @property
   def task_config(self) -> cfg.TaskConfig:
     return self._task_config
 
+  @property
+  def logging_dir(self) -> str:
+    return self._logging_dir
+
   def initialize(self, model: tf.keras.Model):
     """A callback function used as CheckpointManager's init_fn.
 
-    This function will be called when no checkpoint found for the model.
+    This function will be called when no checkpoint is found for the model.
     If there is a checkpoint, the checkpoint will be loaded and this function
     will not be called. You can use this callback function to load a pretrained
     checkpoint, saved under a directory other than the model_dir.
@@ -55,11 +67,23 @@ class Task(tf.Module):
     Args:
       model: The keras.Model built or used by this task.
     """
-    pass
+    ckpt_dir_or_file = self.task_config.init_checkpoint
+    logging.info("Trying to load pretrained checkpoint from %s",
+                 ckpt_dir_or_file)
+    if tf.io.gfile.isdir(ckpt_dir_or_file):
+      ckpt_dir_or_file = tf.train.latest_checkpoint(ckpt_dir_or_file)
+    if not ckpt_dir_or_file:
+      return
+
+    ckpt = tf.train.Checkpoint(**model.checkpoint_items)
+    status = ckpt.restore(ckpt_dir_or_file)
+    status.expect_partial().assert_existing_objects_matched()
+    logging.info("Finished loading pretrained checkpoint from %s",
+                 ckpt_dir_or_file)
 
   @abc.abstractmethod
   def build_model(self) -> tf.keras.Model:
-    """Creates the model architecture.
+    """Creates model architecture.
 
     Returns:
       A model instance.
@@ -107,6 +131,7 @@ class Task(tf.Module):
     """Returns a dataset or a nested structure of dataset functions.
 
     Dataset functions define per-host datasets with the per-replica batch size.
+    With distributed training, this method runs on remote hosts.
 
     Args:
       params: hyperparams to create input pipelines.
@@ -122,7 +147,7 @@ class Task(tf.Module):
     Args:
       labels: optional label tensors.
       model_outputs: a nested structure of output tensors.
-      aux_losses: auxiliarly loss tensors, i.e. `losses` in keras.Model.
+      aux_losses: auxiliary loss tensors, i.e. `losses` in keras.Model.
 
     Returns:
       The total loss tensor.
@@ -172,6 +197,8 @@ class Task(tf.Module):
                  metrics=None):
     """Does forward and backward.
 
+    With distribution strategies, this method runs on devices.
+
     Args:
       inputs: a dictionary of input tensors.
       model: the model, forward pass definition.
@@ -217,7 +244,9 @@ class Task(tf.Module):
     return logs
 
   def validation_step(self, inputs, model: tf.keras.Model, metrics=None):
-    """Validatation step.
+    """Validation step.
+
+    With distribution strategies, this method runs on devices.
 
     Args:
       inputs: a dictionary of input tensors.
@@ -244,52 +273,24 @@ class Task(tf.Module):
     return logs
 
   def inference_step(self, inputs, model: tf.keras.Model):
-    """Performs the forward step."""
-    return model(inputs, training=False)
-
-
-_REGISTERED_TASK_CLS = {}
-
+    """Performs the forward step.
 
-# TODO(b/158268740): Move these outside the base class file.
-# TODO(b/158741360): Add type annotations once pytype checks across modules.
-def register_task_cls(task_config_cls):
-  """Decorates a factory of Tasks for lookup by a subclass of TaskConfig.
+    With distribution strategies, this method runs on devices.
 
-  This decorator supports registration of tasks as follows:
+    Args:
+      inputs: a dictionary of input tensors.
+      model: the keras.Model.
 
-  ```
-  @dataclasses.dataclass
-  class MyTaskConfig(TaskConfig):
-    # Add fields here.
-    pass
+    Returns:
+      Model outputs.
+    """
+    return model(inputs, training=False)
 
-  @register_task_cls(MyTaskConfig)
-  class MyTask(Task):
-    # Inherits def __init__(self, task_config).
+  def aggregate_logs(self, state, step_logs):
+    """Optional aggregation over logs returned from a validation step."""
     pass
 
-  my_task_config = MyTaskConfig()
-  my_task = get_task(my_task_config)  # Returns MyTask(my_task_config).
-  ```
-
-  Besisdes a class itself, other callables that create a Task from a TaskConfig
-  can be decorated by the result of this function, as long as there is at most
-  one registration for each config class.
-
-  Args:
-    task_config_cls: a subclass of TaskConfig (*not* an instance of TaskConfig).
-      Each task_config_cls can only be used for a single registration.
-
-  Returns:
-    A callable for use as class decorator that registers the decorated class
-    for creation from an instance of task_config_cls.
-  """
-  return registry.register(_REGISTERED_TASK_CLS, task_config_cls)
-
+  def reduce_aggregated_logs(self, aggregated_logs):
+    """Optional reduce of aggregated logs over validation steps."""
+    return {}
 
-# The user-visible get_task() is defined after classes have been registered.
-# TODO(b/158741360): Add type annotations once pytype checks across modules.
-def get_task_cls(task_config_cls):
-  task_cls = registry.lookup(_REGISTERED_TASK_CLS, task_config_cls)
-  return task_cls
diff --git a/official/core/exp_factory.py b/official/core/exp_factory.py
new file mode 100644
index 0000000000000000000000000000000000000000..8270565b7d97bfd820de26bbbda6d3f1d96e33d2
--- /dev/null
+++ b/official/core/exp_factory.py
@@ -0,0 +1,37 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Experiment factory methods."""
+
+from official.modeling.hyperparams import config_definitions as cfg
+from official.utils import registry
+
+
+_REGISTERED_CONFIGS = {}
+
+
+def register_config_factory(name):
+  """Register ExperimentConfig factory method."""
+  return registry.register(_REGISTERED_CONFIGS, name)
+
+
+def get_exp_config_creater(exp_name: str):
+  """Looks up ExperimentConfig factory methods."""
+  exp_creater = registry.lookup(_REGISTERED_CONFIGS, exp_name)
+  return exp_creater
+
+
+def get_exp_config(exp_name: str) -> cfg.ExperimentConfig:
+  return get_exp_config_creater(exp_name)()
diff --git a/official/core/input_reader.py b/official/core/input_reader.py
index 52f6e84e4bd02d4178586556ca191912de18fc18..20589ad9cee33546922cd5c9deaba67b2a0509ad 100644
--- a/official/core/input_reader.py
+++ b/official/core/input_reader.py
@@ -32,8 +32,9 @@ class InputReader:
                dataset_fn=tf.data.TFRecordDataset,
                decoder_fn: Optional[Callable[..., Any]] = None,
                parser_fn: Optional[Callable[..., Any]] = None,
-               dataset_transform_fn: Optional[Callable[[tf.data.Dataset],
-                                                       tf.data.Dataset]] = None,
+               transform_and_batch_fn: Optional[Callable[
+                   [tf.data.Dataset, Optional[tf.distribute.InputContext]],
+                   tf.data.Dataset]] = None,
                postprocess_fn: Optional[Callable[..., Any]] = None):
     """Initializes an InputReader instance.
 
@@ -48,9 +49,12 @@ class InputReader:
       parser_fn: An optional `callable` that takes the decoded raw tensors dict
         and parse them into a dictionary of tensors that can be consumed by the
         model. It will be executed after decoder_fn.
-      dataset_transform_fn: An optional `callable` that takes a
-        `tf.data.Dataset` object and returns a `tf.data.Dataset`. It will be
-        executed after parser_fn.
+      transform_and_batch_fn: An optional `callable` that takes a
+        `tf.data.Dataset` object and an optional `tf.distribute.InputContext` as
+        input, and returns a `tf.data.Dataset` object. It will be
+        executed after `parser_fn` to transform and batch the dataset; if None,
+        after `parser_fn` is executed, the dataset will be batched into
+        per-replica batch size.
       postprocess_fn: A optional `callable` that processes batched tensors. It
         will be executed after batching.
     """
@@ -101,7 +105,7 @@ class InputReader:
     self._dataset_fn = dataset_fn
     self._decoder_fn = decoder_fn
     self._parser_fn = parser_fn
-    self._dataset_transform_fn = dataset_transform_fn
+    self._transform_and_batch_fn = transform_and_batch_fn
     self._postprocess_fn = postprocess_fn
 
   def _read_sharded_files(
@@ -171,6 +175,9 @@ class InputReader:
         as_supervised=self._tfds_as_supervised,
         decoders=decoders,
         read_config=read_config)
+
+    if self._is_training:
+      dataset = dataset.repeat()
     return dataset
 
   @property
@@ -211,13 +218,13 @@ class InputReader:
     dataset = maybe_map_fn(dataset, self._decoder_fn)
     dataset = maybe_map_fn(dataset, self._parser_fn)
 
-    if self._dataset_transform_fn is not None:
-      dataset = self._dataset_transform_fn(dataset)
-
-    per_replica_batch_size = input_context.get_per_replica_batch_size(
-        self._global_batch_size) if input_context else self._global_batch_size
+    if self._transform_and_batch_fn is not None:
+      dataset = self._transform_and_batch_fn(dataset, input_context)
+    else:
+      per_replica_batch_size = input_context.get_per_replica_batch_size(
+          self._global_batch_size) if input_context else self._global_batch_size
+      dataset = dataset.batch(
+          per_replica_batch_size, drop_remainder=self._drop_remainder)
 
-    dataset = dataset.batch(
-        per_replica_batch_size, drop_remainder=self._drop_remainder)
     dataset = maybe_map_fn(dataset, self._postprocess_fn)
     return dataset.prefetch(tf.data.experimental.AUTOTUNE)
diff --git a/official/core/task_factory.py b/official/core/task_factory.py
new file mode 100644
index 0000000000000000000000000000000000000000..394031ae99405bf9b69d6236d41c423fcb886697
--- /dev/null
+++ b/official/core/task_factory.py
@@ -0,0 +1,68 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""A global factory to register and access all registered tasks."""
+
+from official.utils import registry
+
+_REGISTERED_TASK_CLS = {}
+
+
+# TODO(b/158741360): Add type annotations once pytype checks across modules.
+def register_task_cls(task_config_cls):
+  """Decorates a factory of Tasks for lookup by a subclass of TaskConfig.
+
+  This decorator supports registration of tasks as follows:
+
+  ```
+  @dataclasses.dataclass
+  class MyTaskConfig(TaskConfig):
+    # Add fields here.
+    pass
+
+  @register_task_cls(MyTaskConfig)
+  class MyTask(Task):
+    # Inherits def __init__(self, task_config).
+    pass
+
+  my_task_config = MyTaskConfig()
+  my_task = get_task(my_task_config)  # Returns MyTask(my_task_config).
+  ```
+
+  Besisdes a class itself, other callables that create a Task from a TaskConfig
+  can be decorated by the result of this function, as long as there is at most
+  one registration for each config class.
+
+  Args:
+    task_config_cls: a subclass of TaskConfig (*not* an instance of TaskConfig).
+      Each task_config_cls can only be used for a single registration.
+
+  Returns:
+    A callable for use as class decorator that registers the decorated class
+    for creation from an instance of task_config_cls.
+  """
+  return registry.register(_REGISTERED_TASK_CLS, task_config_cls)
+
+
+def get_task(task_config, **kwargs):
+  """Creates a Task (of suitable subclass type) from task_config."""
+  return get_task_cls(task_config.__class__)(task_config, **kwargs)
+
+
+# The user-visible get_task() is defined after classes have been registered.
+# TODO(b/158741360): Add type annotations once pytype checks across modules.
+def get_task_cls(task_config_cls):
+  task_cls = registry.lookup(_REGISTERED_TASK_CLS, task_config_cls)
+  return task_cls
diff --git a/official/modeling/activations/gelu.py b/official/modeling/activations/gelu.py
index c045bffa95b29e069831b548701b76d1b8e76c0d..dc4de8204ae81e9ad8c17f12ed0973fb0eff3c86 100644
--- a/official/modeling/activations/gelu.py
+++ b/official/modeling/activations/gelu.py
@@ -14,12 +14,6 @@
 # ==============================================================================
 """Gaussian error linear unit."""
 
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-
 import tensorflow as tf
 
 
@@ -35,6 +29,4 @@ def gelu(x):
   Returns:
     `x` with the GELU activation applied.
   """
-  cdf = 0.5 * (1.0 + tf.tanh(
-      (math.sqrt(2 / math.pi) * (x + 0.044715 * tf.pow(x, 3)))))
-  return x * cdf
+  return tf.keras.activations.gelu(x, approximate=True)
diff --git a/official/modeling/hyperparams/base_config.py b/official/modeling/hyperparams/base_config.py
index 7ce5ce2d55016dce0c985a0e6f9fe3893a25f644..b03f069c8bdae2317bd57ac9b2cc4c91ce9d169b 100644
--- a/official/modeling/hyperparams/base_config.py
+++ b/official/modeling/hyperparams/base_config.py
@@ -126,10 +126,10 @@ class Config(params_dict.ParamsDict):
     subconfig_type = Config
     if k in cls.__annotations__:
       # Directly Config subtype.
-      type_annotation = cls.__annotations__[k]
+      type_annotation = cls.__annotations__[k]  # pytype: disable=invalid-annotation
       if (isinstance(type_annotation, type) and
           issubclass(type_annotation, Config)):
-        subconfig_type = cls.__annotations__[k]
+        subconfig_type = cls.__annotations__[k]  # pytype: disable=invalid-annotation
       else:
         # Check if the field is a sequence of subtypes.
         field_type = getattr(type_annotation, '__origin__', type(None))
diff --git a/official/modeling/hyperparams/config_definitions.py b/official/modeling/hyperparams/config_definitions.py
index 2fbcdea4455aa0f11728a3b077c4d981df8682cd..c58b1de7fa5c728d549396ef8aaead0376e96963 100644
--- a/official/modeling/hyperparams/config_definitions.py
+++ b/official/modeling/hyperparams/config_definitions.py
@@ -14,13 +14,13 @@
 # limitations under the License.
 # ==============================================================================
 """Common configuration settings."""
+
 from typing import Optional, Union
 
 import dataclasses
 
 from official.modeling.hyperparams import base_config
 from official.modeling.optimization.configs import optimization_config
-from official.utils import registry
 
 OptimizationConfig = optimization_config.OptimizationConfig
 
@@ -111,6 +111,8 @@ class RuntimeConfig(base_config.Config):
     run_eagerly: Whether or not to run the experiment eagerly.
     batchnorm_spatial_persistent: Whether or not to enable the spatial
       persistent mode for CuDNN batch norm kernel for improved GPU performance.
+    allow_tpu_summary: Whether to allow summary happen inside the XLA program
+      runs on TPU through automatic outside compilation.
   """
   distribution_strategy: str = "mirrored"
   enable_xla: bool = False
@@ -123,8 +125,8 @@ class RuntimeConfig(base_config.Config):
   task_index: int = -1
   all_reduce_alg: Optional[str] = None
   num_packs: int = 1
-  loss_scale: Optional[Union[str, float]] = None
   mixed_precision_dtype: Optional[str] = None
+  loss_scale: Optional[Union[str, float]] = None
   run_eagerly: bool = False
   batchnorm_spatial_persistent: bool = False
 
@@ -172,25 +174,39 @@ class TrainerConfig(base_config.Config):
     eval_tf_function: whether or not to use tf_function for eval.
     steps_per_loop: number of steps per loop.
     summary_interval: number of steps between each summary.
-    checkpoint_intervals: number of steps between checkpoints.
+    checkpoint_interval: number of steps between checkpoints.
     max_to_keep: max checkpoints to keep.
     continuous_eval_timeout: maximum number of seconds to wait between
-      checkpoints, if set to None, continuous eval will wait indefinetely.
+      checkpoints, if set to None, continuous eval will wait indefinitely.
+      This is only used continuous_train_and_eval and continuous_eval modes.
+    train_steps: number of train steps.
+    validation_steps: number of eval steps. If `None`, the entire eval dataset
+      is used.
+    validation_interval: number of training steps to run between evaluations.
   """
   optimizer_config: OptimizationConfig = OptimizationConfig()
+  # Orbit settings.
   train_tf_while_loop: bool = True
   train_tf_function: bool = True
   eval_tf_function: bool = True
+  allow_tpu_summary: bool = False
+  # Trainer intervals.
   steps_per_loop: int = 1000
   summary_interval: int = 1000
   checkpoint_interval: int = 1000
+  # Checkpoint manager.
   max_to_keep: int = 5
   continuous_eval_timeout: Optional[int] = None
+  # Train/Eval routines.
+  train_steps: int = 0
+  validation_steps: Optional[int] = None
+  validation_interval: int = 1000
 
 
 @dataclasses.dataclass
 class TaskConfig(base_config.Config):
-  network: base_config.Config = None
+  init_checkpoint: str = ""
+  model: base_config.Config = None
   train_data: DataConfig = DataConfig()
   validation_data: DataConfig = DataConfig()
 
@@ -198,24 +214,7 @@ class TaskConfig(base_config.Config):
 @dataclasses.dataclass
 class ExperimentConfig(base_config.Config):
   """Top-level configuration."""
-  mode: str = "train"  # train, eval, train_and_eval.
   task: TaskConfig = TaskConfig()
   trainer: TrainerConfig = TrainerConfig()
   runtime: RuntimeConfig = RuntimeConfig()
-  train_steps: int = 0
-  validation_steps: Optional[int] = None
-  validation_interval: int = 100
-
-
-_REGISTERED_CONFIGS = {}
-
-
-def register_config_factory(name):
-  """Register ExperimentConfig factory method."""
-  return registry.register(_REGISTERED_CONFIGS, name)
-
 
-def get_exp_config_creater(exp_name: str):
-  """Looks up ExperimentConfig factory methods."""
-  exp_creater = registry.lookup(_REGISTERED_CONFIGS, exp_name)
-  return exp_creater
diff --git a/official/modeling/optimization/configs/learning_rate_config.py b/official/modeling/optimization/configs/learning_rate_config.py
index b55c713f1905cf9aaa52f87a6663d3385628d5a5..2a0625e0a75040e115e91c6be5b89bddb0de06b0 100644
--- a/official/modeling/optimization/configs/learning_rate_config.py
+++ b/official/modeling/optimization/configs/learning_rate_config.py
@@ -20,6 +20,20 @@ import dataclasses
 from official.modeling.hyperparams import base_config
 
 
+@dataclasses.dataclass
+class ConstantLrConfig(base_config.Config):
+  """Configuration for constant learning rate.
+
+  This class is a containers for the constant learning rate decay configs.
+
+  Attributes:
+    name: The name of the learning rate schedule. Defaults to Constant.
+    learning_rate: A float. The learning rate. Defaults to 0.1.
+  """
+  name: str = 'Constant'
+  learning_rate: float = 0.1
+
+
 @dataclasses.dataclass
 class StepwiseLrConfig(base_config.Config):
   """Configuration for stepwise learning rate decay.
diff --git a/official/modeling/optimization/configs/optimization_config.py b/official/modeling/optimization/configs/optimization_config.py
index 8aba9943ae3bf3f4a9d0c1df4d715d63ef0a26a8..23e112e1b6197a8505a18b9b8d573012d1dd5e73 100644
--- a/official/modeling/optimization/configs/optimization_config.py
+++ b/official/modeling/optimization/configs/optimization_config.py
@@ -39,12 +39,14 @@ class OptimizerConfig(oneof.OneOfConfig):
     adam: adam optimizer config.
     adamw: adam with weight decay.
     lamb: lamb optimizer.
+    rmsprop: rmsprop optimizer.
   """
   type: Optional[str] = None
   sgd: opt_cfg.SGDConfig = opt_cfg.SGDConfig()
   adam: opt_cfg.AdamConfig = opt_cfg.AdamConfig()
   adamw: opt_cfg.AdamWeightDecayConfig = opt_cfg.AdamWeightDecayConfig()
   lamb: opt_cfg.LAMBConfig = opt_cfg.LAMBConfig()
+  rmsprop: opt_cfg.RMSPropConfig = opt_cfg.RMSPropConfig()
 
 
 @dataclasses.dataclass
@@ -53,12 +55,14 @@ class LrConfig(oneof.OneOfConfig):
 
   Attributes:
     type: 'str', type of lr schedule to be used, on the of fields below.
+    constant: constant learning rate config.
     stepwise: stepwise learning rate config.
     exponential: exponential learning rate config.
     polynomial: polynomial learning rate config.
     cosine: cosine learning rate config.
   """
   type: Optional[str] = None
+  constant: lr_cfg.ConstantLrConfig = lr_cfg.ConstantLrConfig()
   stepwise: lr_cfg.StepwiseLrConfig = lr_cfg.StepwiseLrConfig()
   exponential: lr_cfg.ExponentialLrConfig = lr_cfg.ExponentialLrConfig()
   polynomial: lr_cfg.PolynomialLrConfig = lr_cfg.PolynomialLrConfig()
diff --git a/official/modeling/optimization/configs/optimizer_config.py b/official/modeling/optimization/configs/optimizer_config.py
index 4cafa9659119386d2583d8b52cb2ddf9afe37131..5e7ca2d0c195883b0af7a5920bc13402bada4139 100644
--- a/official/modeling/optimization/configs/optimizer_config.py
+++ b/official/modeling/optimization/configs/optimizer_config.py
@@ -28,18 +28,37 @@ class SGDConfig(base_config.Config):
 
   Attributes:
     name: name of the optimizer.
-    learning_rate: learning_rate for SGD optimizer.
     decay: decay rate for SGD optimizer.
     nesterov: nesterov for SGD optimizer.
     momentum: momentum for SGD optimizer.
   """
   name: str = "SGD"
-  learning_rate: float = 0.01
   decay: float = 0.0
   nesterov: bool = False
   momentum: float = 0.0
 
 
+@dataclasses.dataclass
+class RMSPropConfig(base_config.Config):
+  """Configuration for RMSProp optimizer.
+
+  The attributes for this class matches the arguments of
+  tf.keras.optimizers.RMSprop.
+
+  Attributes:
+    name: name of the optimizer.
+    rho: discounting factor for RMSprop optimizer.
+    momentum: momentum for RMSprop optimizer.
+    epsilon: epsilon value for RMSprop optimizer, help with numerical stability.
+    centered: Whether to normalize gradients or not.
+  """
+  name: str = "RMSprop"
+  rho: float = 0.9
+  momentum: float = 0.0
+  epsilon: float = 1e-7
+  centered: bool = False
+
+
 @dataclasses.dataclass
 class AdamConfig(base_config.Config):
   """Configuration for Adam optimizer.
@@ -49,7 +68,6 @@ class AdamConfig(base_config.Config):
 
   Attributes:
     name: name of the optimizer.
-    learning_rate: learning_rate for Adam optimizer.
     beta_1: decay rate for 1st order moments.
     beta_2: decay rate for 2st order moments.
     epsilon: epsilon value used for numerical stability in Adam optimizer.
@@ -57,7 +75,6 @@ class AdamConfig(base_config.Config):
     the paper "On the Convergence of Adam and beyond".
   """
   name: str = "Adam"
-  learning_rate: float = 0.001
   beta_1: float = 0.9
   beta_2: float = 0.999
   epsilon: float = 1e-07
@@ -70,7 +87,6 @@ class AdamWeightDecayConfig(base_config.Config):
 
   Attributes:
     name: name of the optimizer.
-    learning_rate: learning_rate for the optimizer.
     beta_1: decay rate for 1st order moments.
     beta_2: decay rate for 2st order moments.
     epsilon: epsilon value used for numerical stability in the optimizer.
@@ -83,7 +99,6 @@ class AdamWeightDecayConfig(base_config.Config):
                              include in weight decay.
   """
   name: str = "AdamWeightDecay"
-  learning_rate: float = 0.001
   beta_1: float = 0.9
   beta_2: float = 0.999
   epsilon: float = 1e-07
@@ -102,7 +117,6 @@ class LAMBConfig(base_config.Config):
 
   Attributes:
     name: name of the optimizer.
-    learning_rate: learning_rate for Adam optimizer.
     beta_1: decay rate for 1st order moments.
     beta_2: decay rate for 2st order moments.
     epsilon: epsilon value used for numerical stability in LAMB optimizer.
@@ -116,7 +130,6 @@ class LAMBConfig(base_config.Config):
                                    be excluded.
   """
   name: str = "LAMB"
-  learning_rate: float = 0.001
   beta_1: float = 0.9
   beta_2: float = 0.999
   epsilon: float = 1e-6
diff --git a/official/modeling/optimization/optimizer_factory.py b/official/modeling/optimization/optimizer_factory.py
index 0988f6b3dd7ecc7b99e6f12e617aacba409d1fa3..c9ac04c42213c1a5904f162f369148ec43b0af82 100644
--- a/official/modeling/optimization/optimizer_factory.py
+++ b/official/modeling/optimization/optimizer_factory.py
@@ -14,7 +14,6 @@
 # limitations under the License.
 # ==============================================================================
 """Optimizer factory class."""
-
 from typing import Union
 
 import tensorflow as tf
@@ -29,7 +28,8 @@ OPTIMIZERS_CLS = {
     'sgd': tf.keras.optimizers.SGD,
     'adam': tf.keras.optimizers.Adam,
     'adamw': nlp_optimization.AdamWeightDecay,
-    'lamb': tfa_optimizers.LAMB
+    'lamb': tfa_optimizers.LAMB,
+    'rmsprop': tf.keras.optimizers.RMSprop
 }
 
 LR_CLS = {
@@ -60,7 +60,7 @@ class OptimizerFactory(object):
   params = {
         'optimizer': {
             'type': 'sgd',
-            'sgd': {'learning_rate': 0.1, 'momentum': 0.9}
+            'sgd': {'momentum': 0.9}
         },
         'learning_rate': {
             'type': 'stepwise',
@@ -88,12 +88,15 @@ class OptimizerFactory(object):
     self._optimizer_config = config.optimizer.get()
     self._optimizer_type = config.optimizer.type
 
-    if self._optimizer_config is None:
+    if self._optimizer_type is None:
       raise ValueError('Optimizer type must be specified')
 
     self._lr_config = config.learning_rate.get()
     self._lr_type = config.learning_rate.type
 
+    if self._lr_type is None:
+      raise ValueError('Learning rate type must be specified')
+
     self._warmup_config = config.warmup.get()
     self._warmup_type = config.warmup.type
 
@@ -101,18 +104,15 @@ class OptimizerFactory(object):
     """Build learning rate.
 
     Builds learning rate from config. Learning rate schedule is built according
-    to the learning rate config. If there is no learning rate config, optimizer
-    learning rate is returned.
+    to the learning rate config. If learning rate type is consant,
+    lr_config.learning_rate is returned.
 
     Returns:
-      tf.keras.optimizers.schedules.LearningRateSchedule instance. If no
-      learning rate schedule defined, optimizer_config.learning_rate is
-      returned.
+      tf.keras.optimizers.schedules.LearningRateSchedule instance. If
+      learning rate type is consant, lr_config.learning_rate is returned.
     """
-
-    # TODO(arashwan): Explore if we want to only allow explicit const lr sched.
-    if not self._lr_config:
-      lr = self._optimizer_config.learning_rate
+    if self._lr_type == 'constant':
+      lr = self._lr_config.learning_rate
     else:
       lr = LR_CLS[self._lr_type](**self._lr_config.as_dict())
 
diff --git a/official/modeling/optimization/optimizer_factory_test.py b/official/modeling/optimization/optimizer_factory_test.py
index d7ffa16cfaf3abcd3264f7144afd9e31c81bb272..b3218778528eea895fc83c4da59ad5bcccbfa655 100644
--- a/official/modeling/optimization/optimizer_factory_test.py
+++ b/official/modeling/optimization/optimizer_factory_test.py
@@ -15,91 +15,72 @@
 # ==============================================================================
 """Tests for optimizer_factory.py."""
 
+from absl.testing import parameterized
+
 import tensorflow as tf
-import tensorflow_addons.optimizers as tfa_optimizers
 
 from official.modeling.optimization import optimizer_factory
 from official.modeling.optimization.configs import optimization_config
-from official.nlp import optimization as nlp_optimization
-
-
-class OptimizerFactoryTest(tf.test.TestCase):
-
-  def test_sgd_optimizer(self):
-    params = {
-        'optimizer': {
-            'type': 'sgd',
-            'sgd': {'learning_rate': 0.1, 'momentum': 0.9}
-        }
-    }
-    expected_optimizer_config = {
-        'name': 'SGD',
-        'learning_rate': 0.1,
-        'decay': 0.0,
-        'momentum': 0.9,
-        'nesterov': False
-    }
-    opt_config = optimization_config.OptimizationConfig(params)
-    opt_factory = optimizer_factory.OptimizerFactory(opt_config)
-    lr = opt_factory.build_learning_rate()
-    optimizer = opt_factory.build_optimizer(lr)
 
-    self.assertIsInstance(optimizer, tf.keras.optimizers.SGD)
-    self.assertEqual(expected_optimizer_config, optimizer.get_config())
 
-  def test_adam_optimizer(self):
+class OptimizerFactoryTest(tf.test.TestCase, parameterized.TestCase):
 
-    # Define adam optimizer with default values.
+  @parameterized.parameters(
+      ('sgd'),
+      ('rmsprop'),
+      ('adam'),
+      ('adamw'),
+      ('lamb'))
+  def test_optimizers(self, optimizer_type):
     params = {
         'optimizer': {
-            'type': 'adam'
+            'type': optimizer_type
+        },
+        'learning_rate': {
+            'type': 'constant',
+            'constant': {
+                'learning_rate': 0.1
+            }
         }
     }
-    expected_optimizer_config = tf.keras.optimizers.Adam().get_config()
+    optimizer_cls = optimizer_factory.OPTIMIZERS_CLS[optimizer_type]
+    expected_optimizer_config = optimizer_cls().get_config()
+    expected_optimizer_config['learning_rate'] = 0.1
 
     opt_config = optimization_config.OptimizationConfig(params)
     opt_factory = optimizer_factory.OptimizerFactory(opt_config)
     lr = opt_factory.build_learning_rate()
     optimizer = opt_factory.build_optimizer(lr)
 
-    self.assertIsInstance(optimizer, tf.keras.optimizers.Adam)
+    self.assertIsInstance(optimizer, optimizer_cls)
     self.assertEqual(expected_optimizer_config, optimizer.get_config())
 
-  def test_adam_weight_decay_optimizer(self):
+  def test_missing_types(self):
     params = {
         'optimizer': {
-            'type': 'adamw'
+            'type': 'sgd',
+            'sgd': {'momentum': 0.9}
         }
     }
-    expected_optimizer_config = nlp_optimization.AdamWeightDecay().get_config()
-    opt_config = optimization_config.OptimizationConfig(params)
-    opt_factory = optimizer_factory.OptimizerFactory(opt_config)
-    lr = opt_factory.build_learning_rate()
-    optimizer = opt_factory.build_optimizer(lr)
-
-    self.assertIsInstance(optimizer, nlp_optimization.AdamWeightDecay)
-    self.assertEqual(expected_optimizer_config, optimizer.get_config())
-
-  def test_lamb_optimizer(self):
+    with self.assertRaises(ValueError):
+      optimizer_factory.OptimizerFactory(
+          optimization_config.OptimizationConfig(params))
     params = {
-        'optimizer': {
-            'type': 'lamb'
+        'learning_rate': {
+            'type': 'stepwise',
+            'stepwise': {'boundaries': [10000, 20000],
+                         'values': [0.1, 0.01, 0.001]}
         }
     }
-    expected_optimizer_config = tfa_optimizers.LAMB().get_config()
-    opt_config = optimization_config.OptimizationConfig(params)
-    opt_factory = optimizer_factory.OptimizerFactory(opt_config)
-    lr = opt_factory.build_learning_rate()
-    optimizer = opt_factory.build_optimizer(lr)
-
-    self.assertIsInstance(optimizer, tfa_optimizers.LAMB)
-    self.assertEqual(expected_optimizer_config, optimizer.get_config())
+    with self.assertRaises(ValueError):
+      optimizer_factory.OptimizerFactory(
+          optimization_config.OptimizationConfig(params))
 
   def test_stepwise_lr_schedule(self):
     params = {
         'optimizer': {
             'type': 'sgd',
-            'sgd': {'learning_rate': 0.1, 'momentum': 0.9}
+            'sgd': {'momentum': 0.9}
         },
         'learning_rate': {
             'type': 'stepwise',
@@ -126,7 +107,7 @@ class OptimizerFactoryTest(tf.test.TestCase):
     params = {
         'optimizer': {
             'type': 'sgd',
-            'sgd': {'learning_rate': 0.1, 'momentum': 0.9}
+            'sgd': {'momentum': 0.9}
         },
         'learning_rate': {
             'type': 'stepwise',
@@ -159,7 +140,7 @@ class OptimizerFactoryTest(tf.test.TestCase):
     params = {
         'optimizer': {
             'type': 'sgd',
-            'sgd': {'learning_rate': 0.1, 'momentum': 0.9}
+            'sgd': {'momentum': 0.9}
         },
         'learning_rate': {
             'type': 'exponential',
@@ -189,7 +170,7 @@ class OptimizerFactoryTest(tf.test.TestCase):
     params = {
         'optimizer': {
             'type': 'sgd',
-            'sgd': {'learning_rate': 0.1, 'momentum': 0.9}
+            'sgd': {'momentum': 0.9}
         },
         'learning_rate': {
             'type': 'polynomial',
@@ -213,7 +194,7 @@ class OptimizerFactoryTest(tf.test.TestCase):
     params = {
         'optimizer': {
             'type': 'sgd',
-            'sgd': {'learning_rate': 0.1, 'momentum': 0.9}
+            'sgd': {'momentum': 0.9}
         },
         'learning_rate': {
             'type': 'cosine',
@@ -239,7 +220,13 @@ class OptimizerFactoryTest(tf.test.TestCase):
     params = {
         'optimizer': {
             'type': 'sgd',
-            'sgd': {'learning_rate': 0.1, 'momentum': 0.9}
+            'sgd': {'momentum': 0.9}
+        },
+        'learning_rate': {
+            'type': 'constant',
+            'constant': {
+                'learning_rate': 0.1
+            }
         },
         'warmup': {
             'type': 'linear',
@@ -263,7 +250,7 @@ class OptimizerFactoryTest(tf.test.TestCase):
     params = {
         'optimizer': {
             'type': 'sgd',
-            'sgd': {'learning_rate': 0.1, 'momentum': 0.9}
+            'sgd': {'momentum': 0.9}
         },
         'learning_rate': {
             'type': 'stepwise',
diff --git a/official/modeling/tf_utils.py b/official/modeling/tf_utils.py
index 34f8f66e75733493d6e061b8f0b9571c1e038f6c..14b6a3f1f8f64635ee90facc1874e359a2d05229 100644
--- a/official/modeling/tf_utils.py
+++ b/official/modeling/tf_utils.py
@@ -88,7 +88,6 @@ def is_special_none_tensor(tensor):
   return tensor.shape.ndims == 0 and tensor.dtype == tf.int32
 
 
-# TODO(hongkuny): consider moving custom string-map lookup to keras api.
 def get_activation(identifier):
   """Maps a identifier to a Python function, e.g., "relu" => `tf.nn.relu`.
 
@@ -173,3 +172,18 @@ def assert_rank(tensor, expected_rank, name=None):
         "For the tensor `%s`, the actual tensor rank `%d` (shape = %s) is not "
         "equal to the expected tensor rank `%s`" %
         (name, actual_rank, str(tensor.shape), str(expected_rank)))
+
+
+def safe_mean(losses):
+  """Computes a safe mean of the losses.
+
+  Args:
+    losses: `Tensor` whose elements contain individual loss measurements.
+
+  Returns:
+    A scalar representing the mean of `losses`. If `num_present` is zero,
+      then zero is returned.
+  """
+  total = tf.reduce_sum(losses)
+  num_elements = tf.cast(tf.size(losses), dtype=losses.dtype)
+  return tf.math.divide_no_nan(total, num_elements)
diff --git a/official/modeling/training/distributed_executor.py b/official/modeling/training/distributed_executor.py
index 11451260cdca52a9c9f4019010123c4d2b40e99e..4aeaa2b41d21704dadbe71510912d5ccab6b8be0 100644
--- a/official/modeling/training/distributed_executor.py
+++ b/official/modeling/training/distributed_executor.py
@@ -63,8 +63,8 @@ def metrics_as_dict(metric):
   """Puts input metric(s) into a list.
 
   Args:
-    metric: metric(s) to be put into the list. `metric` could be a object, a
-      list or a dict of tf.keras.metrics.Metric or has the `required_method`.
+    metric: metric(s) to be put into the list. `metric` could be an object, a
+      list, or a dict of tf.keras.metrics.Metric or has the `required_method`.
 
   Returns:
     A dictionary of valid metrics.
@@ -351,7 +351,8 @@ class DistributedExecutor(object):
       train_input_fn: (params: dict) -> tf.data.Dataset training data input
         function.
       eval_input_fn: (Optional) same type as train_input_fn. If not None, will
-        trigger evaluting metric on eval data. If None, will not run eval step.
+        trigger evaluating metric on eval data. If None, will not run the eval
+        step.
       model_dir: the folder path for model checkpoints.
       total_steps: total training steps.
       iterations_per_loop: train steps per loop. After each loop, this job will
@@ -672,7 +673,7 @@ class DistributedExecutor(object):
       raise ValueError('if `eval_metric_fn` is specified, '
                        'eval_metric_fn must be a callable.')
 
-    old_phrase = tf.keras.backend.learning_phase()
+    old_phase = tf.keras.backend.learning_phase()
     tf.keras.backend.set_learning_phase(0)
     params = self._params
     strategy = self._strategy
@@ -698,7 +699,8 @@ class DistributedExecutor(object):
       logging.info(
           'Checkpoint file %s found and restoring from '
           'checkpoint', checkpoint_path)
-      checkpoint.restore(checkpoint_path)
+      status = checkpoint.restore(checkpoint_path)
+      status.expect_partial().assert_existing_objects_matched()
 
       self.global_train_step = model.optimizer.iterations
       eval_iterator = self._get_input_iterator(eval_input_fn, strategy)
@@ -709,7 +711,7 @@ class DistributedExecutor(object):
       summary_writer(metrics=eval_metric_result, step=current_step)
       reset_states(eval_metric)
 
-    tf.keras.backend.set_learning_phase(old_phrase)
+    tf.keras.backend.set_learning_phase(old_phase)
     return eval_metric_result, current_step
 
   def predict(self):
@@ -759,7 +761,7 @@ class ExecutorBuilder(object):
 
     Args:
       strategy_type: string. One of 'tpu', 'mirrored', 'multi_worker_mirrored'.
-        If None. User is responsible to set the strategy before calling
+        If None, the user is responsible to set the strategy before calling
         build_executor(...).
       strategy_config: necessary config for constructing the proper Strategy.
         Check strategy_flags_dict() for examples of the structure.
diff --git a/official/nlp/albert/run_classifier.py b/official/nlp/albert/run_classifier.py
index fe72ff880f61c99e304bf089ef4ed0d75bfc349b..7b1371cc052775d3182c51a36926add43dee416e 100644
--- a/official/nlp/albert/run_classifier.py
+++ b/official/nlp/albert/run_classifier.py
@@ -14,23 +14,61 @@
 # ==============================================================================
 """ALBERT classification finetuning runner in tf2.x."""
 
+
 from __future__ import absolute_import
 from __future__ import division
 from __future__ import print_function
 
 import json
-
+import os
 from absl import app
 from absl import flags
+from absl import logging
 import tensorflow as tf
 
 from official.nlp.albert import configs as albert_configs
+from official.nlp.bert import bert_models
 from official.nlp.bert import run_classifier as run_classifier_bert
 from official.utils.misc import distribution_utils
 
+
 FLAGS = flags.FLAGS
 
 
+def predict(strategy, albert_config, input_meta_data, predict_input_fn):
+  """Function outputs both the ground truth predictions as .tsv files."""
+  with strategy.scope():
+    classifier_model = bert_models.classifier_model(
+        albert_config, input_meta_data['num_labels'])[0]
+    checkpoint = tf.train.Checkpoint(model=classifier_model)
+    latest_checkpoint_file = (
+        FLAGS.predict_checkpoint_path or
+        tf.train.latest_checkpoint(FLAGS.model_dir))
+    assert latest_checkpoint_file
+    logging.info('Checkpoint file %s found and restoring from '
+                 'checkpoint', latest_checkpoint_file)
+    checkpoint.restore(
+        latest_checkpoint_file).assert_existing_objects_matched()
+    preds, ground_truth = run_classifier_bert.get_predictions_and_labels(
+        strategy, classifier_model, predict_input_fn, return_probs=True)
+    output_predict_file = os.path.join(FLAGS.model_dir, 'test_results.tsv')
+    with tf.io.gfile.GFile(output_predict_file, 'w') as writer:
+      logging.info('***** Predict results *****')
+      for probabilities in preds:
+        output_line = '\t'.join(
+            str(class_probability)
+            for class_probability in probabilities) + '\n'
+        writer.write(output_line)
+    ground_truth_labels_file = os.path.join(FLAGS.model_dir,
+                                            'output_labels.tsv')
+    with tf.io.gfile.GFile(ground_truth_labels_file, 'w') as writer:
+      logging.info('***** Ground truth results *****')
+      for label in ground_truth:
+        output_line = '\t'.join(str(label)) + '\n'
+        writer.write(output_line)
+  return
+
+
 def main(_):
   with tf.io.gfile.GFile(FLAGS.input_meta_data_path, 'rb') as reader:
     input_meta_data = json.loads(reader.read().decode('utf-8'))
@@ -56,9 +94,14 @@ def main(_):
 
   albert_config = albert_configs.AlbertConfig.from_json_file(
       FLAGS.bert_config_file)
-  run_classifier_bert.run_bert(strategy, input_meta_data, albert_config,
-                               train_input_fn, eval_input_fn)
-
+  if FLAGS.mode == 'train_and_eval':
+    run_classifier_bert.run_bert(strategy, input_meta_data, albert_config,
+                                 train_input_fn, eval_input_fn)
+  elif FLAGS.mode == 'predict':
+    predict(strategy, albert_config, input_meta_data, eval_input_fn)
+  else:
+    raise ValueError('Unsupported mode is specified: %s' % FLAGS.mode)
+  return
 
 if __name__ == '__main__':
   flags.mark_flag_as_required('bert_config_file')
diff --git a/official/nlp/albert/tf2_albert_encoder_checkpoint_converter.py b/official/nlp/albert/tf2_albert_encoder_checkpoint_converter.py
index 402bc1445bed575362598d09212d14d03b629179..afd2ab19d6af157a24cf691b57c209d3dfd5f1fe 100644
--- a/official/nlp/albert/tf2_albert_encoder_checkpoint_converter.py
+++ b/official/nlp/albert/tf2_albert_encoder_checkpoint_converter.py
@@ -86,7 +86,7 @@ def _create_albert_model(cfg):
       activation=activations.gelu,
       dropout_rate=cfg.hidden_dropout_prob,
       attention_dropout_rate=cfg.attention_probs_dropout_prob,
-      sequence_length=cfg.max_position_embeddings,
+      max_sequence_length=cfg.max_position_embeddings,
       type_vocab_size=cfg.type_vocab_size,
       initializer=tf.keras.initializers.TruncatedNormal(
           stddev=cfg.initializer_range))
diff --git a/official/nlp/bert/bert_models.py b/official/nlp/bert/bert_models.py
index e26c2a0caa0e0a3fe4881df137e1016614a39137..807c96581dce4118afae365364acae2b12f6415b 100644
--- a/official/nlp/bert/bert_models.py
+++ b/official/nlp/bert/bert_models.py
@@ -25,7 +25,6 @@ import tensorflow_hub as hub
 from official.modeling import tf_utils
 from official.nlp.albert import configs as albert_configs
 from official.nlp.bert import configs
-from official.nlp.modeling import losses
 from official.nlp.modeling import models
 from official.nlp.modeling import networks
 
@@ -67,22 +66,27 @@ class BertPretrainLossAndMetricLayer(tf.keras.layers.Layer):
           next_sentence_loss, name='next_sentence_loss', aggregation='mean')
 
   def call(self,
-           lm_output,
-           sentence_output,
+           lm_output_logits,
+           sentence_output_logits,
            lm_label_ids,
            lm_label_weights,
            sentence_labels=None):
     """Implements call() for the layer."""
     lm_label_weights = tf.cast(lm_label_weights, tf.float32)
-    lm_output = tf.cast(lm_output, tf.float32)
+    lm_output_logits = tf.cast(lm_output_logits, tf.float32)
 
-    mask_label_loss = losses.weighted_sparse_categorical_crossentropy_loss(
-        labels=lm_label_ids, predictions=lm_output, weights=lm_label_weights)
+    lm_prediction_losses = tf.keras.losses.sparse_categorical_crossentropy(
+        lm_label_ids, lm_output_logits, from_logits=True)
+    lm_numerator_loss = tf.reduce_sum(lm_prediction_losses * lm_label_weights)
+    lm_denominator_loss = tf.reduce_sum(lm_label_weights)
+    mask_label_loss = tf.math.divide_no_nan(lm_numerator_loss,
+                                            lm_denominator_loss)
 
     if sentence_labels is not None:
-      sentence_output = tf.cast(sentence_output, tf.float32)
-      sentence_loss = losses.weighted_sparse_categorical_crossentropy_loss(
-          labels=sentence_labels, predictions=sentence_output)
+      sentence_output_logits = tf.cast(sentence_output_logits, tf.float32)
+      sentence_loss = tf.keras.losses.sparse_categorical_crossentropy(
+          sentence_labels, sentence_output_logits, from_logits=True)
+      sentence_loss = tf.reduce_mean(sentence_loss)
       loss = mask_label_loss + sentence_loss
     else:
       sentence_loss = None
@@ -92,22 +96,22 @@ class BertPretrainLossAndMetricLayer(tf.keras.layers.Layer):
     # TODO(hongkuny): Avoids the hack and switches add_loss.
     final_loss = tf.fill(batch_shape, loss)
 
-    self._add_metrics(lm_output, lm_label_ids, lm_label_weights,
-                      mask_label_loss, sentence_output, sentence_labels,
+    self._add_metrics(lm_output_logits, lm_label_ids, lm_label_weights,
+                      mask_label_loss, sentence_output_logits, sentence_labels,
                       sentence_loss)
     return final_loss
 
 
 @gin.configurable
 def get_transformer_encoder(bert_config,
-                            sequence_length,
+                            sequence_length=None,
                             transformer_encoder_cls=None,
                             output_range=None):
   """Gets a 'TransformerEncoder' object.
 
   Args:
     bert_config: A 'modeling.BertConfig' or 'modeling.AlbertConfig' object.
-    sequence_length: Maximum sequence length of the training data.
+    sequence_length: [Deprecated].
     transformer_encoder_cls: A EncoderScaffold class. If it is None, uses the
       default BERT encoder implementation.
     output_range: the sequence output range, [0, output_range). Default setting
@@ -116,13 +120,13 @@ def get_transformer_encoder(bert_config,
   Returns:
     A networks.TransformerEncoder object.
   """
+  del sequence_length
   if transformer_encoder_cls is not None:
     # TODO(hongkuny): evaluate if it is better to put cfg definition in gin.
     embedding_cfg = dict(
         vocab_size=bert_config.vocab_size,
         type_vocab_size=bert_config.type_vocab_size,
         hidden_size=bert_config.hidden_size,
-        seq_length=sequence_length,
         max_seq_length=bert_config.max_position_embeddings,
         initializer=tf.keras.initializers.TruncatedNormal(
             stddev=bert_config.initializer_range),
@@ -157,7 +161,6 @@ def get_transformer_encoder(bert_config,
       activation=tf_utils.get_activation(bert_config.hidden_act),
       dropout_rate=bert_config.hidden_dropout_prob,
       attention_dropout_rate=bert_config.attention_probs_dropout_prob,
-      sequence_length=sequence_length,
       max_sequence_length=bert_config.max_position_embeddings,
       type_vocab_size=bert_config.type_vocab_size,
       embedding_width=bert_config.embedding_size,
@@ -228,7 +231,7 @@ def pretrain_model(bert_config,
       activation=tf_utils.get_activation(bert_config.hidden_act),
       num_token_predictions=max_predictions_per_seq,
       initializer=initializer,
-      output='predictions')
+      output='logits')
 
   outputs = pretrainer_model(
       [input_word_ids, input_mask, input_type_ids, masked_lm_positions])
diff --git a/official/nlp/bert/bert_models_test.py b/official/nlp/bert/bert_models_test.py
index 93763b45bfc53c5d32de2df7f7f0f72894e9556f..0c6e3ec43b55db1bd3a53754cf176c0db8cfadf1 100644
--- a/official/nlp/bert/bert_models_test.py
+++ b/official/nlp/bert/bert_models_test.py
@@ -56,8 +56,6 @@ class BertModelsTest(tf.test.TestCase):
     # Expect two output from encoder: sequence and classification output.
     self.assertIsInstance(encoder.output, list)
     self.assertLen(encoder.output, 2)
-    # shape should be [batch size, seq_length, hidden_size]
-    self.assertEqual(encoder.output[0].shape.as_list(), [None, 5, 16])
     # shape should be [batch size, hidden_size]
     self.assertEqual(encoder.output[1].shape.as_list(), [None, 16])
 
@@ -74,16 +72,12 @@ class BertModelsTest(tf.test.TestCase):
     # Expect two output from model: start positions and end positions
     self.assertIsInstance(model.output, list)
     self.assertLen(model.output, 2)
-    # shape should be [batch size, seq_length]
-    self.assertEqual(model.output[0].shape.as_list(), [None, 5])
-    # shape should be [batch size, seq_length]
-    self.assertEqual(model.output[1].shape.as_list(), [None, 5])
 
     # Expect two output from core_model: sequence and classification output.
     self.assertIsInstance(core_model.output, list)
     self.assertLen(core_model.output, 2)
-    # shape should be [batch size, seq_length, hidden_size]
-    self.assertEqual(core_model.output[0].shape.as_list(), [None, 5, 16])
+    # shape should be [batch size, None, hidden_size]
+    self.assertEqual(core_model.output[0].shape.as_list(), [None, None, 16])
     # shape should be [batch size, hidden_size]
     self.assertEqual(core_model.output[1].shape.as_list(), [None, 16])
 
@@ -104,8 +98,8 @@ class BertModelsTest(tf.test.TestCase):
     # Expect two output from core_model: sequence and classification output.
     self.assertIsInstance(core_model.output, list)
     self.assertLen(core_model.output, 2)
-    # shape should be [batch size, 1, hidden_size]
-    self.assertEqual(core_model.output[0].shape.as_list(), [None, 1, 16])
+    # shape should be [batch size, None, hidden_size]
+    self.assertEqual(core_model.output[0].shape.as_list(), [None, None, 16])
     # shape should be [batch size, hidden_size]
     self.assertEqual(core_model.output[1].shape.as_list(), [None, 16])
 
diff --git a/official/nlp/bert/export_tfhub.py b/official/nlp/bert/export_tfhub.py
index 5923309d1fa36a16d4cccda11650d9c3d0fcc616..5a49a3df54a64ceacbe1235b870d17bc84d8a488 100644
--- a/official/nlp/bert/export_tfhub.py
+++ b/official/nlp/bert/export_tfhub.py
@@ -79,7 +79,7 @@ def export_bert_tfhub(bert_config: configs.BertConfig,
                  do_lower_case, vocab_file)
   core_model, encoder = create_bert_model(bert_config)
   checkpoint = tf.train.Checkpoint(model=encoder)
-  checkpoint.restore(model_checkpoint_path).assert_consumed()
+  checkpoint.restore(model_checkpoint_path).assert_existing_objects_matched()
   core_model.vocab_file = tf.saved_model.Asset(vocab_file)
   core_model.do_lower_case = tf.Variable(do_lower_case, trainable=False)
   core_model.save(hub_destination, include_optimizer=False, save_format="tf")
diff --git a/official/nlp/bert/input_pipeline.py b/official/nlp/bert/input_pipeline.py
index 73c2a096ef6cf71b64929f78d5fdee33b9a8692f..ed3fd173d4379a75ab1e2e5a9ba0bbdcbaa0be42 100644
--- a/official/nlp/bert/input_pipeline.py
+++ b/official/nlp/bert/input_pipeline.py
@@ -247,3 +247,39 @@ def create_squad_dataset(file_path,
   dataset = dataset.batch(batch_size, drop_remainder=True)
   dataset = dataset.prefetch(tf.data.experimental.AUTOTUNE)
   return dataset
+
+
+def create_retrieval_dataset(file_path,
+                             seq_length,
+                             batch_size,
+                             input_pipeline_context=None):
+  """Creates input dataset from (tf)records files for scoring."""
+  name_to_features = {
+      'input_ids': tf.io.FixedLenFeature([seq_length], tf.int64),
+      'input_mask': tf.io.FixedLenFeature([seq_length], tf.int64),
+      'segment_ids': tf.io.FixedLenFeature([seq_length], tf.int64),
+      'int_iden': tf.io.FixedLenFeature([1], tf.int64),
+  }
+  dataset = single_file_dataset(file_path, name_to_features)
+
+  # The dataset is always sharded by number of hosts.
+  # num_input_pipelines is the number of hosts rather than number of cores.
+  if input_pipeline_context and input_pipeline_context.num_input_pipelines > 1:
+    dataset = dataset.shard(input_pipeline_context.num_input_pipelines,
+                            input_pipeline_context.input_pipeline_id)
+
+  def _select_data_from_record(record):
+    x = {
+        'input_word_ids': record['input_ids'],
+        'input_mask': record['input_mask'],
+        'input_type_ids': record['segment_ids']
+    }
+    y = record['int_iden']
+    return (x, y)
+
+  dataset = dataset.map(
+      _select_data_from_record,
+      num_parallel_calls=tf.data.experimental.AUTOTUNE)
+  dataset = dataset.batch(batch_size, drop_remainder=False)
+  dataset = dataset.prefetch(tf.data.experimental.AUTOTUNE)
+  return dataset
diff --git a/official/nlp/bert/model_saving_utils.py b/official/nlp/bert/model_saving_utils.py
index 13d2c9ed02f9a98d9dcbb2a60c46fa5cd13bb666..24e39c6e4af02757d81dcc380612148da5891ac5 100644
--- a/official/nlp/bert/model_saving_utils.py
+++ b/official/nlp/bert/model_saving_utils.py
@@ -55,14 +55,10 @@ def export_bert_model(model_export_path: typing.Text,
     raise ValueError('model must be a tf.keras.Model object.')
 
   if checkpoint_dir:
-    # Keras compile/fit() was used to save checkpoint using
-    # model.save_weights().
     if restore_model_using_load_weights:
       model_weight_path = os.path.join(checkpoint_dir, 'checkpoint')
       assert tf.io.gfile.exists(model_weight_path)
       model.load_weights(model_weight_path)
-
-    # tf.train.Checkpoint API was used via custom training loop logic.
     else:
       checkpoint = tf.train.Checkpoint(model=model)
 
diff --git a/official/nlp/bert/model_training_utils.py b/official/nlp/bert/model_training_utils.py
index f0fe67615726906a6b1d3ef38a5ca9acfe8502de..071e18b3453a7291fd4ece111811ac1e1243a5cd 100644
--- a/official/nlp/bert/model_training_utils.py
+++ b/official/nlp/bert/model_training_utils.py
@@ -99,7 +99,9 @@ def write_txt_summary(training_summary, summary_dir):
 
 
 @deprecation.deprecated(
-    None, 'This function is deprecated. Please use Keras compile/fit instead.')
+    None, 'This function is deprecated and we do not expect adding new '
+    'functionalities. Please do not have your code depending '
+    'on this library.')
 def run_customized_training_loop(
     # pylint: disable=invalid-name
     _sentinel=None,
@@ -557,7 +559,6 @@ def run_customized_training_loop(
     for metric in model.metrics:
       training_summary[metric.name] = _float_metric_value(metric)
     if eval_metrics:
-      # TODO(hongkuny): Cleans up summary reporting in text.
       training_summary['last_train_metrics'] = _float_metric_value(
           train_metrics[0])
       training_summary['eval_metrics'] = _float_metric_value(eval_metrics[0])
diff --git a/official/nlp/bert/run_classifier.py b/official/nlp/bert/run_classifier.py
index e2eb525ae4335091c78eb4ead72494f8021a7f89..c5f3721ada6279f7446ec0d21ce1eeae549afcd8 100644
--- a/official/nlp/bert/run_classifier.py
+++ b/official/nlp/bert/run_classifier.py
@@ -343,7 +343,10 @@ def export_classifier(model_export_path, input_meta_data, bert_config,
   # Export uses float32 for now, even if training uses mixed precision.
   tf.keras.mixed_precision.experimental.set_policy('float32')
   classifier_model = bert_models.classifier_model(
-      bert_config, input_meta_data.get('num_labels', 1))[0]
+      bert_config,
+      input_meta_data.get('num_labels', 1),
+      hub_module_url=FLAGS.hub_module_url,
+      hub_module_trainable=False)[0]
 
   model_saving_utils.export_bert_model(
       model_export_path, model=classifier_model, checkpoint_dir=model_dir)
diff --git a/official/nlp/bert/run_squad_helper.py b/official/nlp/bert/run_squad_helper.py
index 7f6ea5bbbe2ae2fb6af89f139da989c82b1f893d..b03e356d91bdf6a9edf9486f505526852c6c7ef6 100644
--- a/official/nlp/bert/run_squad_helper.py
+++ b/official/nlp/bert/run_squad_helper.py
@@ -61,7 +61,11 @@ def define_common_squad_flags():
   flags.DEFINE_integer('train_batch_size', 32, 'Total batch size for training.')
   # Predict processing related.
   flags.DEFINE_string('predict_file', None,
-                      'Prediction data path with train tfrecords.')
+                      'SQuAD prediction json file path. '
+                      '`predict` mode supports multiple files: one can use '
+                      'wildcard to specify multiple files and it can also be '
+                      'multiple file patterns separated by comma. Note that '
+                      '`eval` mode only supports a single predict file.')
   flags.DEFINE_bool(
       'do_lower_case', True,
       'Whether to lower case the input text. Should be True for uncased '
@@ -159,22 +163,9 @@ def get_dataset_fn(input_file_pattern, max_seq_length, global_batch_size,
   return _dataset_fn
 
 
-def predict_squad_customized(strategy,
-                             input_meta_data,
-                             bert_config,
-                             checkpoint_path,
-                             predict_tfrecord_path,
-                             num_steps):
-  """Make predictions using a Bert-based squad model."""
-  predict_dataset_fn = get_dataset_fn(
-      predict_tfrecord_path,
-      input_meta_data['max_seq_length'],
-      FLAGS.predict_batch_size,
-      is_training=False)
-  predict_iterator = iter(
-      strategy.experimental_distribute_datasets_from_function(
-          predict_dataset_fn))
-
+def get_squad_model_to_predict(strategy, bert_config, checkpoint_path,
+                               input_meta_data):
+  """Gets a squad model to make predictions."""
   with strategy.scope():
     # Prediction always uses float32, even if training uses mixed precision.
     tf.keras.mixed_precision.experimental.set_policy('float32')
@@ -188,6 +179,23 @@ def predict_squad_customized(strategy,
   logging.info('Restoring checkpoints from %s', checkpoint_path)
   checkpoint = tf.train.Checkpoint(model=squad_model)
   checkpoint.restore(checkpoint_path).expect_partial()
+  return squad_model
+
+
+def predict_squad_customized(strategy,
+                             input_meta_data,
+                             predict_tfrecord_path,
+                             num_steps,
+                             squad_model):
+  """Make predictions using a Bert-based squad model."""
+  predict_dataset_fn = get_dataset_fn(
+      predict_tfrecord_path,
+      input_meta_data['max_seq_length'],
+      FLAGS.predict_batch_size,
+      is_training=False)
+  predict_iterator = iter(
+      strategy.experimental_distribute_datasets_from_function(
+          predict_dataset_fn))
 
   @tf.function
   def predict_step(iterator):
@@ -287,8 +295,8 @@ def train_squad(strategy,
       post_allreduce_callbacks=[clip_by_global_norm_callback])
 
 
-def prediction_output_squad(
-    strategy, input_meta_data, tokenizer, bert_config, squad_lib, checkpoint):
+def prediction_output_squad(strategy, input_meta_data, tokenizer, squad_lib,
+                            predict_file, squad_model):
   """Makes predictions for a squad dataset."""
   doc_stride = input_meta_data['doc_stride']
   max_query_length = input_meta_data['max_query_length']
@@ -296,7 +304,7 @@ def prediction_output_squad(
   version_2_with_negative = input_meta_data.get('version_2_with_negative',
                                                 False)
   eval_examples = squad_lib.read_squad_examples(
-      input_file=FLAGS.predict_file,
+      input_file=predict_file,
       is_training=False,
       version_2_with_negative=version_2_with_negative)
 
@@ -337,8 +345,7 @@ def prediction_output_squad(
 
   num_steps = int(dataset_size / FLAGS.predict_batch_size)
   all_results = predict_squad_customized(
-      strategy, input_meta_data, bert_config,
-      checkpoint, eval_writer.filename, num_steps)
+      strategy, input_meta_data, eval_writer.filename, num_steps, squad_model)
 
   all_predictions, all_nbest_json, scores_diff_json = (
       squad_lib.postprocess_output(
@@ -356,11 +363,14 @@ def prediction_output_squad(
 
 
 def dump_to_files(all_predictions, all_nbest_json, scores_diff_json,
-                  squad_lib, version_2_with_negative):
+                  squad_lib, version_2_with_negative, file_prefix=''):
   """Save output to json files."""
-  output_prediction_file = os.path.join(FLAGS.model_dir, 'predictions.json')
-  output_nbest_file = os.path.join(FLAGS.model_dir, 'nbest_predictions.json')
-  output_null_log_odds_file = os.path.join(FLAGS.model_dir, 'null_odds.json')
+  output_prediction_file = os.path.join(FLAGS.model_dir,
+                                        '%spredictions.json' % file_prefix)
+  output_nbest_file = os.path.join(FLAGS.model_dir,
+                                   '%snbest_predictions.json' % file_prefix)
+  output_null_log_odds_file = os.path.join(FLAGS.model_dir, file_prefix,
+                                           '%snull_odds.json' % file_prefix)
   logging.info('Writing predictions to: %s', (output_prediction_file))
   logging.info('Writing nbest to: %s', (output_nbest_file))
 
@@ -370,6 +380,22 @@ def dump_to_files(all_predictions, all_nbest_json, scores_diff_json,
     squad_lib.write_to_json_files(scores_diff_json, output_null_log_odds_file)
 
 
+def _get_matched_files(input_path):
+  """Returns all files that matches the input_path."""
+  input_patterns = input_path.strip().split(',')
+  all_matched_files = []
+  for input_pattern in input_patterns:
+    input_pattern = input_pattern.strip()
+    if not input_pattern:
+      continue
+    matched_files = tf.io.gfile.glob(input_pattern)
+    if not matched_files:
+      raise ValueError('%s does not match any files.' % input_pattern)
+    else:
+      all_matched_files.extend(matched_files)
+  return sorted(all_matched_files)
+
+
 def predict_squad(strategy,
                   input_meta_data,
                   tokenizer,
@@ -379,11 +405,24 @@ def predict_squad(strategy,
   """Get prediction results and evaluate them to hard drive."""
   if init_checkpoint is None:
     init_checkpoint = tf.train.latest_checkpoint(FLAGS.model_dir)
-  all_predictions, all_nbest_json, scores_diff_json = prediction_output_squad(
-      strategy, input_meta_data, tokenizer,
-      bert_config, squad_lib, init_checkpoint)
-  dump_to_files(all_predictions, all_nbest_json, scores_diff_json, squad_lib,
-                input_meta_data.get('version_2_with_negative', False))
+
+  all_predict_files = _get_matched_files(FLAGS.predict_file)
+  squad_model = get_squad_model_to_predict(strategy, bert_config,
+                                           init_checkpoint, input_meta_data)
+  for idx, predict_file in enumerate(all_predict_files):
+    all_predictions, all_nbest_json, scores_diff_json = prediction_output_squad(
+        strategy, input_meta_data, tokenizer, squad_lib, predict_file,
+        squad_model)
+    if len(all_predict_files) == 1:
+      file_prefix = ''
+    else:
+      # if predict_file is /path/xquad.ar.json, the `file_prefix` may be
+      # "xquad.ar-0-"
+      file_prefix = '%s-' % os.path.splitext(
+          os.path.basename(all_predict_files[idx]))[0]
+    dump_to_files(all_predictions, all_nbest_json, scores_diff_json, squad_lib,
+                  input_meta_data.get('version_2_with_negative', False),
+                  file_prefix)
 
 
 def eval_squad(strategy,
@@ -395,9 +434,17 @@ def eval_squad(strategy,
   """Get prediction results and evaluate them against ground truth."""
   if init_checkpoint is None:
     init_checkpoint = tf.train.latest_checkpoint(FLAGS.model_dir)
+
+  all_predict_files = _get_matched_files(FLAGS.predict_file)
+  if len(all_predict_files) != 1:
+    raise ValueError('`eval_squad` only supports one predict file, '
+                     'but got %s' % all_predict_files)
+
+  squad_model = get_squad_model_to_predict(strategy, bert_config,
+                                           init_checkpoint, input_meta_data)
   all_predictions, all_nbest_json, scores_diff_json = prediction_output_squad(
-      strategy, input_meta_data, tokenizer,
-      bert_config, squad_lib, init_checkpoint)
+      strategy, input_meta_data, tokenizer, squad_lib, all_predict_files[0],
+      squad_model)
   dump_to_files(all_predictions, all_nbest_json, scores_diff_json, squad_lib,
                 input_meta_data.get('version_2_with_negative', False))
 
diff --git a/official/nlp/bert/tf2_encoder_checkpoint_converter.py b/official/nlp/bert/tf2_encoder_checkpoint_converter.py
index 2faf6ea2cfb9f0d71d0a79dff101e0408fa41778..835a152f7ca54c32200b2aed6481a546cab366dc 100644
--- a/official/nlp/bert/tf2_encoder_checkpoint_converter.py
+++ b/official/nlp/bert/tf2_encoder_checkpoint_converter.py
@@ -61,7 +61,7 @@ def _create_bert_model(cfg):
       activation=activations.gelu,
       dropout_rate=cfg.hidden_dropout_prob,
       attention_dropout_rate=cfg.attention_probs_dropout_prob,
-      sequence_length=cfg.max_position_embeddings,
+      max_sequence_length=cfg.max_position_embeddings,
       type_vocab_size=cfg.type_vocab_size,
       initializer=tf.keras.initializers.TruncatedNormal(
           stddev=cfg.initializer_range),
@@ -73,6 +73,7 @@ def _create_bert_model(cfg):
 def convert_checkpoint(bert_config, output_path, v1_checkpoint):
   """Converts a V1 checkpoint into an OO V2 checkpoint."""
   output_dir, _ = os.path.split(output_path)
+  tf.io.gfile.makedirs(output_dir)
 
   # Create a temporary V1 name-converted checkpoint in the output directory.
   temporary_checkpoint_dir = os.path.join(output_dir, "temp_v1")
diff --git a/official/nlp/configs/bert.py b/official/nlp/configs/bert.py
index 058af898f51c99ccf35114b5bff480995b8a580d..fad49e29debd0864448b00899725b55101c8f293 100644
--- a/official/nlp/configs/bert.py
+++ b/official/nlp/configs/bert.py
@@ -13,7 +13,10 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-"""A multi-head BERT encoder network for pretraining."""
+"""Multi-head BERT encoder network with classification heads.
+
+Includes configurations and instantiation methods.
+"""
 from typing import List, Optional, Text
 
 import dataclasses
@@ -21,10 +24,8 @@ import tensorflow as tf
 
 from official.modeling import tf_utils
 from official.modeling.hyperparams import base_config
-from official.modeling.hyperparams import config_definitions as cfg
 from official.nlp.configs import encoders
 from official.nlp.modeling import layers
-from official.nlp.modeling import networks
 from official.nlp.modeling.models import bert_pretrainer
 
 
@@ -41,80 +42,30 @@ class ClsHeadConfig(base_config.Config):
 @dataclasses.dataclass
 class BertPretrainerConfig(base_config.Config):
   """BERT encoder configuration."""
-  num_masked_tokens: int = 76
   encoder: encoders.TransformerEncoderConfig = (
       encoders.TransformerEncoderConfig())
   cls_heads: List[ClsHeadConfig] = dataclasses.field(default_factory=list)
 
 
-def instantiate_from_cfg(
+def instantiate_classification_heads_from_cfgs(
+    cls_head_configs: List[ClsHeadConfig]) -> List[layers.ClassificationHead]:
+  return [
+      layers.ClassificationHead(**cfg.as_dict()) for cfg in cls_head_configs
+    ] if cls_head_configs else []
+
+
+def instantiate_pretrainer_from_cfg(
     config: BertPretrainerConfig,
-    encoder_network: Optional[tf.keras.Model] = None):
+    encoder_network: Optional[tf.keras.Model] = None
+) -> bert_pretrainer.BertPretrainerV2:
   """Instantiates a BertPretrainer from the config."""
   encoder_cfg = config.encoder
   if encoder_network is None:
-    encoder_network = networks.TransformerEncoder(
-        vocab_size=encoder_cfg.vocab_size,
-        hidden_size=encoder_cfg.hidden_size,
-        num_layers=encoder_cfg.num_layers,
-        num_attention_heads=encoder_cfg.num_attention_heads,
-        intermediate_size=encoder_cfg.intermediate_size,
-        activation=tf_utils.get_activation(encoder_cfg.hidden_activation),
-        dropout_rate=encoder_cfg.dropout_rate,
-        attention_dropout_rate=encoder_cfg.attention_dropout_rate,
-        max_sequence_length=encoder_cfg.max_position_embeddings,
-        type_vocab_size=encoder_cfg.type_vocab_size,
-        initializer=tf.keras.initializers.TruncatedNormal(
-            stddev=encoder_cfg.initializer_range))
-  if config.cls_heads:
-    classification_heads = [
-        layers.ClassificationHead(**cfg.as_dict()) for cfg in config.cls_heads
-    ]
-  else:
-    classification_heads = []
+    encoder_network = encoders.instantiate_encoder_from_cfg(encoder_cfg)
   return bert_pretrainer.BertPretrainerV2(
-      config.num_masked_tokens,
       mlm_activation=tf_utils.get_activation(encoder_cfg.hidden_activation),
       mlm_initializer=tf.keras.initializers.TruncatedNormal(
           stddev=encoder_cfg.initializer_range),
       encoder_network=encoder_network,
-      classification_heads=classification_heads)
-
-
-@dataclasses.dataclass
-class BertPretrainDataConfig(cfg.DataConfig):
-  """Data config for BERT pretraining task."""
-  input_path: str = ""
-  global_batch_size: int = 512
-  is_training: bool = True
-  seq_length: int = 512
-  max_predictions_per_seq: int = 76
-  use_next_sentence_label: bool = True
-  use_position_id: bool = False
-
-
-@dataclasses.dataclass
-class BertPretrainEvalDataConfig(BertPretrainDataConfig):
-  """Data config for the eval set in BERT pretraining task."""
-  input_path: str = ""
-  global_batch_size: int = 512
-  is_training: bool = False
-
-
-@dataclasses.dataclass
-class BertSentencePredictionDataConfig(cfg.DataConfig):
-  """Data of sentence prediction dataset."""
-  input_path: str = ""
-  global_batch_size: int = 32
-  is_training: bool = True
-  seq_length: int = 128
-
-
-@dataclasses.dataclass
-class BertSentencePredictionDevDataConfig(cfg.DataConfig):
-  """Dev data of MNLI sentence prediction dataset."""
-  input_path: str = ""
-  global_batch_size: int = 32
-  is_training: bool = False
-  seq_length: int = 128
-  drop_remainder: bool = False
+      classification_heads=instantiate_classification_heads_from_cfgs(
+          config.cls_heads))
diff --git a/official/nlp/configs/bert_test.py b/official/nlp/configs/bert_test.py
index 199608cd05ab6a83d92edbcf5154aa7b33c8dfd0..871ab45373c430667f2cf45f93492947aaa3c4e9 100644
--- a/official/nlp/configs/bert_test.py
+++ b/official/nlp/configs/bert_test.py
@@ -26,7 +26,7 @@ class BertModelsTest(tf.test.TestCase):
   def test_network_invocation(self):
     config = bert.BertPretrainerConfig(
         encoder=encoders.TransformerEncoderConfig(vocab_size=10, num_layers=1))
-    _ = bert.instantiate_from_cfg(config)
+    _ = bert.instantiate_pretrainer_from_cfg(config)
 
     # Invokes with classification heads.
     config = bert.BertPretrainerConfig(
@@ -35,7 +35,7 @@ class BertModelsTest(tf.test.TestCase):
             bert.ClsHeadConfig(
                 inner_dim=10, num_classes=2, name="next_sentence")
         ])
-    _ = bert.instantiate_from_cfg(config)
+    _ = bert.instantiate_pretrainer_from_cfg(config)
 
     with self.assertRaises(ValueError):
       config = bert.BertPretrainerConfig(
@@ -47,7 +47,7 @@ class BertModelsTest(tf.test.TestCase):
               bert.ClsHeadConfig(
                   inner_dim=10, num_classes=2, name="next_sentence")
           ])
-      _ = bert.instantiate_from_cfg(config)
+      _ = bert.instantiate_pretrainer_from_cfg(config)
 
   def test_checkpoint_items(self):
     config = bert.BertPretrainerConfig(
@@ -56,9 +56,10 @@ class BertModelsTest(tf.test.TestCase):
             bert.ClsHeadConfig(
                 inner_dim=10, num_classes=2, name="next_sentence")
         ])
-    encoder = bert.instantiate_from_cfg(config)
-    self.assertSameElements(encoder.checkpoint_items.keys(),
-                            ["encoder", "next_sentence.pooler_dense"])
+    encoder = bert.instantiate_pretrainer_from_cfg(config)
+    self.assertSameElements(
+        encoder.checkpoint_items.keys(),
+        ["encoder", "masked_lm", "next_sentence.pooler_dense"])
 
 
 if __name__ == "__main__":
diff --git a/official/nlp/configs/electra.py b/official/nlp/configs/electra.py
new file mode 100644
index 0000000000000000000000000000000000000000..61fd82db702364ffe6baf8fad1c8b3ae17d09120
--- /dev/null
+++ b/official/nlp/configs/electra.py
@@ -0,0 +1,91 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""ELECTRA model configurations and instantiation methods."""
+from typing import List, Optional
+
+import dataclasses
+import tensorflow as tf
+
+from official.modeling import tf_utils
+from official.modeling.hyperparams import base_config
+from official.nlp.configs import bert
+from official.nlp.configs import encoders
+from official.nlp.modeling import layers
+from official.nlp.modeling.models import electra_pretrainer
+
+
+@dataclasses.dataclass
+class ELECTRAPretrainerConfig(base_config.Config):
+  """ELECTRA pretrainer configuration."""
+  num_masked_tokens: int = 76
+  sequence_length: int = 512
+  num_classes: int = 2
+  discriminator_loss_weight: float = 50.0
+  tie_embeddings: bool = True
+  disallow_correct: bool = False
+  generator_encoder: encoders.TransformerEncoderConfig = (
+      encoders.TransformerEncoderConfig())
+  discriminator_encoder: encoders.TransformerEncoderConfig = (
+      encoders.TransformerEncoderConfig())
+  cls_heads: List[bert.ClsHeadConfig] = dataclasses.field(default_factory=list)
+
+
+def instantiate_classification_heads_from_cfgs(
+    cls_head_configs: List[bert.ClsHeadConfig]
+) -> List[layers.ClassificationHead]:
+  if cls_head_configs:
+    return [
+        layers.ClassificationHead(**cfg.as_dict()) for cfg in cls_head_configs
+    ]
+  else:
+    return []
+
+
+def instantiate_pretrainer_from_cfg(
+    config: ELECTRAPretrainerConfig,
+    generator_network: Optional[tf.keras.Model] = None,
+    discriminator_network: Optional[tf.keras.Model] = None,
+    ) -> electra_pretrainer.ElectraPretrainer:
+  """Instantiates ElectraPretrainer from the config."""
+  generator_encoder_cfg = config.generator_encoder
+  discriminator_encoder_cfg = config.discriminator_encoder
+  # Copy discriminator's embeddings to generator for easier model serialization.
+  if discriminator_network is None:
+    discriminator_network = encoders.instantiate_encoder_from_cfg(
+        discriminator_encoder_cfg)
+  if generator_network is None:
+    if config.tie_embeddings:
+      embedding_layer = discriminator_network.get_embedding_layer()
+      generator_network = encoders.instantiate_encoder_from_cfg(
+          generator_encoder_cfg, embedding_layer=embedding_layer)
+    else:
+      generator_network = encoders.instantiate_encoder_from_cfg(
+          generator_encoder_cfg)
+
+  return electra_pretrainer.ElectraPretrainer(
+      generator_network=generator_network,
+      discriminator_network=discriminator_network,
+      vocab_size=config.generator_encoder.vocab_size,
+      num_classes=config.num_classes,
+      sequence_length=config.sequence_length,
+      num_token_predictions=config.num_masked_tokens,
+      mlm_activation=tf_utils.get_activation(
+          generator_encoder_cfg.hidden_activation),
+      mlm_initializer=tf.keras.initializers.TruncatedNormal(
+          stddev=generator_encoder_cfg.initializer_range),
+      classification_heads=instantiate_classification_heads_from_cfgs(
+          config.cls_heads),
+      disallow_correct=config.disallow_correct)
diff --git a/official/nlp/configs/electra_test.py b/official/nlp/configs/electra_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..d06d64a95d6ef987cdb34a471521853001f11339
--- /dev/null
+++ b/official/nlp/configs/electra_test.py
@@ -0,0 +1,49 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for ELECTRA configurations and models instantiation."""
+
+import tensorflow as tf
+
+from official.nlp.configs import bert
+from official.nlp.configs import electra
+from official.nlp.configs import encoders
+
+
+class ELECTRAModelsTest(tf.test.TestCase):
+
+  def test_network_invocation(self):
+    config = electra.ELECTRAPretrainerConfig(
+        generator_encoder=encoders.TransformerEncoderConfig(
+            vocab_size=10, num_layers=1),
+        discriminator_encoder=encoders.TransformerEncoderConfig(
+            vocab_size=10, num_layers=2),
+    )
+    _ = electra.instantiate_pretrainer_from_cfg(config)
+
+    # Invokes with classification heads.
+    config = electra.ELECTRAPretrainerConfig(
+        generator_encoder=encoders.TransformerEncoderConfig(
+            vocab_size=10, num_layers=1),
+        discriminator_encoder=encoders.TransformerEncoderConfig(
+            vocab_size=10, num_layers=2),
+        cls_heads=[
+            bert.ClsHeadConfig(
+                inner_dim=10, num_classes=2, name="next_sentence")
+        ])
+    _ = electra.instantiate_pretrainer_from_cfg(config)
+
+if __name__ == "__main__":
+  tf.test.main()
diff --git a/official/nlp/configs/encoders.py b/official/nlp/configs/encoders.py
index 146879a9552fb8177734f7eebb4e49437cfb4d3e..b7467634a36adf72952481faacbce4852cd7feb7 100644
--- a/official/nlp/configs/encoders.py
+++ b/official/nlp/configs/encoders.py
@@ -13,11 +13,18 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-"""Configurations for Encoders."""
+"""Transformer Encoders.
 
+Includes configurations and instantiation methods.
+"""
+from typing import Optional
 import dataclasses
+import tensorflow as tf
 
+from official.modeling import tf_utils
 from official.modeling.hyperparams import base_config
+from official.nlp.modeling import layers
+from official.nlp.modeling import networks
 
 
 @dataclasses.dataclass
@@ -28,9 +35,64 @@ class TransformerEncoderConfig(base_config.Config):
   num_layers: int = 12
   num_attention_heads: int = 12
   hidden_activation: str = "gelu"
-  intermediate_size: int = 3076
+  intermediate_size: int = 3072
   dropout_rate: float = 0.1
   attention_dropout_rate: float = 0.1
   max_position_embeddings: int = 512
   type_vocab_size: int = 2
   initializer_range: float = 0.02
+  embedding_size: Optional[int] = None
+
+
+def instantiate_encoder_from_cfg(
+    config: TransformerEncoderConfig,
+    encoder_cls=networks.TransformerEncoder,
+    embedding_layer: Optional[layers.OnDeviceEmbedding] = None):
+  """Instantiate a Transformer encoder network from TransformerEncoderConfig."""
+  if encoder_cls.__name__ == "EncoderScaffold":
+    embedding_cfg = dict(
+        vocab_size=config.vocab_size,
+        type_vocab_size=config.type_vocab_size,
+        hidden_size=config.hidden_size,
+        max_seq_length=config.max_position_embeddings,
+        initializer=tf.keras.initializers.TruncatedNormal(
+            stddev=config.initializer_range),
+        dropout_rate=config.dropout_rate,
+    )
+    hidden_cfg = dict(
+        num_attention_heads=config.num_attention_heads,
+        intermediate_size=config.intermediate_size,
+        intermediate_activation=tf_utils.get_activation(
+            config.hidden_activation),
+        dropout_rate=config.dropout_rate,
+        attention_dropout_rate=config.attention_dropout_rate,
+        kernel_initializer=tf.keras.initializers.TruncatedNormal(
+            stddev=config.initializer_range),
+    )
+    kwargs = dict(
+        embedding_cfg=embedding_cfg,
+        hidden_cfg=hidden_cfg,
+        num_hidden_instances=config.num_layers,
+        pooled_output_dim=config.hidden_size,
+        pooler_layer_initializer=tf.keras.initializers.TruncatedNormal(
+            stddev=config.initializer_range))
+    return encoder_cls(**kwargs)
+
+  if encoder_cls.__name__ != "TransformerEncoder":
+    raise ValueError("Unknown encoder network class. %s" % str(encoder_cls))
+  encoder_network = encoder_cls(
+      vocab_size=config.vocab_size,
+      hidden_size=config.hidden_size,
+      num_layers=config.num_layers,
+      num_attention_heads=config.num_attention_heads,
+      intermediate_size=config.intermediate_size,
+      activation=tf_utils.get_activation(config.hidden_activation),
+      dropout_rate=config.dropout_rate,
+      attention_dropout_rate=config.attention_dropout_rate,
+      max_sequence_length=config.max_position_embeddings,
+      type_vocab_size=config.type_vocab_size,
+      initializer=tf.keras.initializers.TruncatedNormal(
+          stddev=config.initializer_range),
+      embedding_width=config.embedding_size,
+      embedding_layer=embedding_layer)
+  return encoder_network
diff --git a/official/nlp/data/classifier_data_lib.py b/official/nlp/data/classifier_data_lib.py
index ce17edc1f4d83eb1fa2fb305303412b77384ff9b..09f5863c19156ef601197acdc1ab0b10fe2d699c 100644
--- a/official/nlp/data/classifier_data_lib.py
+++ b/official/nlp/data/classifier_data_lib.py
@@ -31,9 +31,15 @@ from official.nlp.bert import tokenization
 
 
 class InputExample(object):
-  """A single training/test example for simple sequence classification."""
+  """A single training/test example for simple seq regression/classification."""
 
-  def __init__(self, guid, text_a, text_b=None, label=None, weight=None):
+  def __init__(self,
+               guid,
+               text_a,
+               text_b=None,
+               label=None,
+               weight=None,
+               int_iden=None):
     """Constructs a InputExample.
 
     Args:
@@ -42,16 +48,20 @@ class InputExample(object):
         sequence tasks, only this sequence must be specified.
       text_b: (Optional) string. The untokenized text of the second sequence.
         Only must be specified for sequence pair tasks.
-      label: (Optional) string. The label of the example. This should be
-        specified for train and dev examples, but not for test examples.
+      label: (Optional) string for classification, float for regression. The
+        label of the example. This should be specified for train and dev
+        examples, but not for test examples.
       weight: (Optional) float. The weight of the example to be used during
         training.
+      int_iden: (Optional) int. The int identification number of example in the
+        corpus.
     """
     self.guid = guid
     self.text_a = text_a
     self.text_b = text_b
     self.label = label
     self.weight = weight
+    self.int_iden = int_iden
 
 
 class InputFeatures(object):
@@ -63,20 +73,24 @@ class InputFeatures(object):
                segment_ids,
                label_id,
                is_real_example=True,
-               weight=None):
+               weight=None,
+               int_iden=None):
     self.input_ids = input_ids
     self.input_mask = input_mask
     self.segment_ids = segment_ids
     self.label_id = label_id
     self.is_real_example = is_real_example
     self.weight = weight
+    self.int_iden = int_iden
 
 
 class DataProcessor(object):
-  """Base class for data converters for sequence classification data sets."""
+  """Base class for converters for seq regression/classification datasets."""
 
   def __init__(self, process_text_fn=tokenization.convert_to_unicode):
     self.process_text_fn = process_text_fn
+    self.is_regression = False
+    self.label_type = None
 
   def get_train_examples(self, data_dir):
     """Gets a collection of `InputExample`s for the train set."""
@@ -110,92 +124,163 @@ class DataProcessor(object):
       return lines
 
 
-class XnliProcessor(DataProcessor):
-  """Processor for the XNLI data set."""
-  supported_languages = [
-      "ar", "bg", "de", "el", "en", "es", "fr", "hi", "ru", "sw", "th", "tr",
-      "ur", "vi", "zh"
-  ]
-
-  def __init__(self,
-               language="en",
-               process_text_fn=tokenization.convert_to_unicode):
-    super(XnliProcessor, self).__init__(process_text_fn)
-    if language == "all":
-      self.languages = XnliProcessor.supported_languages
-    elif language not in XnliProcessor.supported_languages:
-      raise ValueError("language %s is not supported for XNLI task." % language)
-    else:
-      self.languages = [language]
+class ColaProcessor(DataProcessor):
+  """Processor for the CoLA data set (GLUE version)."""
 
   def get_train_examples(self, data_dir):
     """See base class."""
-    lines = []
-    for language in self.languages:
-      # Skips the header.
-      lines.extend(
-          self._read_tsv(
-              os.path.join(data_dir, "multinli",
-                           "multinli.train.%s.tsv" % language))[1:])
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "train.tsv")), "train")
+
+  def get_dev_examples(self, data_dir):
+    """See base class."""
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "dev.tsv")), "dev")
+
+  def get_test_examples(self, data_dir):
+    """See base class."""
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "test.tsv")), "test")
+
+  def get_labels(self):
+    """See base class."""
+    return ["0", "1"]
+
+  @staticmethod
+  def get_processor_name():
+    """See base class."""
+    return "COLA"
 
+  def _create_examples(self, lines, set_type):
+    """Creates examples for the training/dev/test sets."""
     examples = []
-    for (i, line) in enumerate(lines):
-      guid = "train-%d" % i
-      text_a = self.process_text_fn(line[0])
-      text_b = self.process_text_fn(line[1])
-      label = self.process_text_fn(line[2])
-      if label == self.process_text_fn("contradictory"):
-        label = self.process_text_fn("contradiction")
+    for i, line in enumerate(lines):
+      # Only the test set has a header.
+      if set_type == "test" and i == 0:
+        continue
+      guid = "%s-%s" % (set_type, i)
+      if set_type == "test":
+        text_a = self.process_text_fn(line[1])
+        label = "0"
+      else:
+        text_a = self.process_text_fn(line[3])
+        label = self.process_text_fn(line[1])
       examples.append(
-          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+          InputExample(guid=guid, text_a=text_a, text_b=None, label=label))
     return examples
 
+
+class MnliProcessor(DataProcessor):
+  """Processor for the MultiNLI data set (GLUE version)."""
+
+  def __init__(self,
+               mnli_type="matched",
+               process_text_fn=tokenization.convert_to_unicode):
+    super(MnliProcessor, self).__init__(process_text_fn)
+    if mnli_type not in ("matched", "mismatched"):
+      raise ValueError("Invalid `mnli_type`: %s" % mnli_type)
+    self.mnli_type = mnli_type
+
+  def get_train_examples(self, data_dir):
+    """See base class."""
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "train.tsv")), "train")
+
   def get_dev_examples(self, data_dir):
     """See base class."""
-    lines = self._read_tsv(os.path.join(data_dir, "xnli.dev.tsv"))
+    if self.mnli_type == "matched":
+      return self._create_examples(
+          self._read_tsv(os.path.join(data_dir, "dev_matched.tsv")),
+          "dev_matched")
+    else:
+      return self._create_examples(
+          self._read_tsv(os.path.join(data_dir, "dev_mismatched.tsv")),
+          "dev_mismatched")
+
+  def get_test_examples(self, data_dir):
+    """See base class."""
+    if self.mnli_type == "matched":
+      return self._create_examples(
+          self._read_tsv(os.path.join(data_dir, "test_matched.tsv")), "test")
+    else:
+      return self._create_examples(
+          self._read_tsv(os.path.join(data_dir, "test_mismatched.tsv")), "test")
+
+  def get_labels(self):
+    """See base class."""
+    return ["contradiction", "entailment", "neutral"]
+
+  @staticmethod
+  def get_processor_name():
+    """See base class."""
+    return "MNLI"
+
+  def _create_examples(self, lines, set_type):
+    """Creates examples for the training/dev/test sets."""
     examples = []
-    for (i, line) in enumerate(lines):
+    for i, line in enumerate(lines):
       if i == 0:
         continue
-      guid = "dev-%d" % i
-      text_a = self.process_text_fn(line[6])
-      text_b = self.process_text_fn(line[7])
-      label = self.process_text_fn(line[1])
+      guid = "%s-%s" % (set_type, self.process_text_fn(line[0]))
+      text_a = self.process_text_fn(line[8])
+      text_b = self.process_text_fn(line[9])
+      if set_type == "test":
+        label = "contradiction"
+      else:
+        label = self.process_text_fn(line[-1])
       examples.append(
           InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
     return examples
 
+
+class MrpcProcessor(DataProcessor):
+  """Processor for the MRPC data set (GLUE version)."""
+
+  def get_train_examples(self, data_dir):
+    """See base class."""
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "train.tsv")), "train")
+
+  def get_dev_examples(self, data_dir):
+    """See base class."""
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "dev.tsv")), "dev")
+
   def get_test_examples(self, data_dir):
     """See base class."""
-    lines = self._read_tsv(os.path.join(data_dir, "xnli.test.tsv"))
-    examples_by_lang = {k: [] for k in XnliProcessor.supported_languages}
-    for (i, line) in enumerate(lines):
-      if i == 0:
-        continue
-      guid = "test-%d" % i
-      language = self.process_text_fn(line[0])
-      text_a = self.process_text_fn(line[6])
-      text_b = self.process_text_fn(line[7])
-      label = self.process_text_fn(line[1])
-      examples_by_lang[language].append(
-          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
-    return examples_by_lang
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "test.tsv")), "test")
 
   def get_labels(self):
     """See base class."""
-    return ["contradiction", "entailment", "neutral"]
+    return ["0", "1"]
 
   @staticmethod
   def get_processor_name():
     """See base class."""
-    return "XNLI"
+    return "MRPC"
+
+  def _create_examples(self, lines, set_type):
+    """Creates examples for the training/dev/test sets."""
+    examples = []
+    for i, line in enumerate(lines):
+      if i == 0:
+        continue
+      guid = "%s-%s" % (set_type, i)
+      text_a = self.process_text_fn(line[3])
+      text_b = self.process_text_fn(line[4])
+      if set_type == "test":
+        label = "0"
+      else:
+        label = self.process_text_fn(line[0])
+      examples.append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples
 
 
 class PawsxProcessor(DataProcessor):
   """Processor for the PAWS-X data set."""
-  supported_languages = [
-      "de", "en", "es", "fr", "ja", "ko", "zh"
-  ]
+  supported_languages = ["de", "en", "es", "fr", "ja", "ko", "zh"]
 
   def __init__(self,
                language="en",
@@ -219,11 +304,10 @@ class PawsxProcessor(DataProcessor):
         train_tsv = "translated_train.tsv"
       # Skips the header.
       lines.extend(
-          self._read_tsv(
-              os.path.join(data_dir, language, train_tsv))[1:])
+          self._read_tsv(os.path.join(data_dir, language, train_tsv))[1:])
 
     examples = []
-    for (i, line) in enumerate(lines):
+    for i, line in enumerate(lines):
       guid = "train-%d" % i
       text_a = self.process_text_fn(line[1])
       text_b = self.process_text_fn(line[2])
@@ -235,13 +319,12 @@ class PawsxProcessor(DataProcessor):
   def get_dev_examples(self, data_dir):
     """See base class."""
     lines = []
-    for language in PawsxProcessor.supported_languages:
-      # Skips the header.
+    for lang in PawsxProcessor.supported_languages:
       lines.extend(
-          self._read_tsv(os.path.join(data_dir, language, "dev_2k.tsv"))[1:])
+          self._read_tsv(os.path.join(data_dir, lang, "dev_2k.tsv"))[1:])
 
     examples = []
-    for (i, line) in enumerate(lines):
+    for i, line in enumerate(lines):
       guid = "dev-%d" % i
       text_a = self.process_text_fn(line[1])
       text_b = self.process_text_fn(line[2])
@@ -252,17 +335,15 @@ class PawsxProcessor(DataProcessor):
 
   def get_test_examples(self, data_dir):
     """See base class."""
-    examples_by_lang = {k: [] for k in PawsxProcessor.supported_languages}
-    for language in PawsxProcessor.supported_languages:
-      lines = self._read_tsv(os.path.join(data_dir, language, "test_2k.tsv"))
-      for (i, line) in enumerate(lines):
-        if i == 0:
-          continue
+    examples_by_lang = {k: [] for k in self.supported_languages}
+    for lang in self.supported_languages:
+      lines = self._read_tsv(os.path.join(data_dir, lang, "test_2k.tsv"))[1:]
+      for i, line in enumerate(lines):
         guid = "test-%d" % i
         text_a = self.process_text_fn(line[1])
         text_b = self.process_text_fn(line[2])
         label = self.process_text_fn(line[3])
-        examples_by_lang[language].append(
+        examples_by_lang[lang].append(
             InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
     return examples_by_lang
 
@@ -273,57 +354,11 @@ class PawsxProcessor(DataProcessor):
   @staticmethod
   def get_processor_name():
     """See base class."""
-    return "PAWS-X"
-
-
-class MnliProcessor(DataProcessor):
-  """Processor for the MultiNLI data set (GLUE version)."""
-
-  def get_train_examples(self, data_dir):
-    """See base class."""
-    return self._create_examples(
-        self._read_tsv(os.path.join(data_dir, "train.tsv")), "train")
+    return "XTREME-PAWS-X"
 
-  def get_dev_examples(self, data_dir):
-    """See base class."""
-    return self._create_examples(
-        self._read_tsv(os.path.join(data_dir, "dev_matched.tsv")),
-        "dev_matched")
 
-  def get_test_examples(self, data_dir):
-    """See base class."""
-    return self._create_examples(
-        self._read_tsv(os.path.join(data_dir, "test_matched.tsv")), "test")
-
-  def get_labels(self):
-    """See base class."""
-    return ["contradiction", "entailment", "neutral"]
-
-  @staticmethod
-  def get_processor_name():
-    """See base class."""
-    return "MNLI"
-
-  def _create_examples(self, lines, set_type):
-    """Creates examples for the training and dev sets."""
-    examples = []
-    for (i, line) in enumerate(lines):
-      if i == 0:
-        continue
-      guid = "%s-%s" % (set_type, self.process_text_fn(line[0]))
-      text_a = self.process_text_fn(line[8])
-      text_b = self.process_text_fn(line[9])
-      if set_type == "test":
-        label = "contradiction"
-      else:
-        label = self.process_text_fn(line[-1])
-      examples.append(
-          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
-    return examples
-
-
-class MrpcProcessor(DataProcessor):
-  """Processor for the MRPC data set (GLUE version)."""
+class QnliProcessor(DataProcessor):
+  """Processor for the QNLI data set (GLUE version)."""
 
   def get_train_examples(self, data_dir):
     """See base class."""
@@ -333,7 +368,7 @@ class MrpcProcessor(DataProcessor):
   def get_dev_examples(self, data_dir):
     """See base class."""
     return self._create_examples(
-        self._read_tsv(os.path.join(data_dir, "dev.tsv")), "dev")
+        self._read_tsv(os.path.join(data_dir, "dev.tsv")), "dev_matched")
 
   def get_test_examples(self, data_dir):
     """See base class."""
@@ -342,26 +377,28 @@ class MrpcProcessor(DataProcessor):
 
   def get_labels(self):
     """See base class."""
-    return ["0", "1"]
+    return ["entailment", "not_entailment"]
 
   @staticmethod
   def get_processor_name():
     """See base class."""
-    return "MRPC"
+    return "QNLI"
 
   def _create_examples(self, lines, set_type):
-    """Creates examples for the training and dev sets."""
+    """Creates examples for the training/dev/test sets."""
     examples = []
-    for (i, line) in enumerate(lines):
+    for i, line in enumerate(lines):
       if i == 0:
         continue
-      guid = "%s-%s" % (set_type, i)
-      text_a = self.process_text_fn(line[3])
-      text_b = self.process_text_fn(line[4])
+      guid = "%s-%s" % (set_type, 1)
       if set_type == "test":
-        label = "0"
+        text_a = tokenization.convert_to_unicode(line[1])
+        text_b = tokenization.convert_to_unicode(line[2])
+        label = "entailment"
       else:
-        label = self.process_text_fn(line[0])
+        text_a = tokenization.convert_to_unicode(line[1])
+        text_b = tokenization.convert_to_unicode(line[2])
+        label = tokenization.convert_to_unicode(line[-1])
       examples.append(
           InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
     return examples
@@ -395,9 +432,9 @@ class QqpProcessor(DataProcessor):
     return "QQP"
 
   def _create_examples(self, lines, set_type):
-    """Creates examples for the training and dev sets."""
+    """Creates examples for the training/dev/test sets."""
     examples = []
-    for (i, line) in enumerate(lines):
+    for i, line in enumerate(lines):
       if i == 0:
         continue
       guid = "%s-%s" % (set_type, line[0])
@@ -407,13 +444,13 @@ class QqpProcessor(DataProcessor):
         label = line[5]
       except IndexError:
         continue
-      examples.append(InputExample(guid=guid, text_a=text_a, text_b=text_b,
-                                   label=label))
+      examples.append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
     return examples
 
 
-class ColaProcessor(DataProcessor):
-  """Processor for the CoLA data set (GLUE version)."""
+class RteProcessor(DataProcessor):
+  """Processor for the RTE data set (GLUE version)."""
 
   def get_train_examples(self, data_dir):
     """See base class."""
@@ -432,29 +469,30 @@ class ColaProcessor(DataProcessor):
 
   def get_labels(self):
     """See base class."""
-    return ["0", "1"]
+    # All datasets are converted to 2-class split, where for 3-class datasets we
+    # collapse neutral and contradiction into not_entailment.
+    return ["entailment", "not_entailment"]
 
   @staticmethod
   def get_processor_name():
     """See base class."""
-    return "COLA"
+    return "RTE"
 
   def _create_examples(self, lines, set_type):
-    """Creates examples for the training and dev sets."""
+    """Creates examples for the training/dev/test sets."""
     examples = []
-    for (i, line) in enumerate(lines):
-      # Only the test set has a header
-      if set_type == "test" and i == 0:
+    for i, line in enumerate(lines):
+      if i == 0:
         continue
       guid = "%s-%s" % (set_type, i)
+      text_a = tokenization.convert_to_unicode(line[1])
+      text_b = tokenization.convert_to_unicode(line[2])
       if set_type == "test":
-        text_a = self.process_text_fn(line[1])
-        label = "0"
+        label = "entailment"
       else:
-        text_a = self.process_text_fn(line[3])
-        label = self.process_text_fn(line[1])
+        label = tokenization.convert_to_unicode(line[3])
       examples.append(
-          InputExample(guid=guid, text_a=text_a, text_b=None, label=label))
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
     return examples
 
 
@@ -486,9 +524,9 @@ class SstProcessor(DataProcessor):
     return "SST-2"
 
   def _create_examples(self, lines, set_type):
-    """Creates examples for the training and dev sets."""
+    """Creates examples for the training/dev/test sets."""
     examples = []
-    for (i, line) in enumerate(lines):
+    for i, line in enumerate(lines):
       if i == 0:
         continue
       guid = "%s-%s" % (set_type, i)
@@ -503,8 +541,14 @@ class SstProcessor(DataProcessor):
     return examples
 
 
-class QnliProcessor(DataProcessor):
-  """Processor for the QNLI data set (GLUE version)."""
+class StsBProcessor(DataProcessor):
+  """Processor for the STS-B data set (GLUE version)."""
+
+  def __init__(self, process_text_fn=tokenization.convert_to_unicode):
+    super(StsBProcessor, self).__init__(process_text_fn=process_text_fn)
+    self.is_regression = True
+    self.label_type = float
+    self._labels = None
 
   def get_train_examples(self, data_dir):
     """See base class."""
@@ -514,7 +558,7 @@ class QnliProcessor(DataProcessor):
   def get_dev_examples(self, data_dir):
     """See base class."""
     return self._create_examples(
-        self._read_tsv(os.path.join(data_dir, "dev.tsv")), "dev_matched")
+        self._read_tsv(os.path.join(data_dir, "dev.tsv")), "dev")
 
   def get_test_examples(self, data_dir):
     """See base class."""
@@ -523,28 +567,26 @@ class QnliProcessor(DataProcessor):
 
   def get_labels(self):
     """See base class."""
-    return ["entailment", "not_entailment"]
+    return self._labels
 
   @staticmethod
   def get_processor_name():
     """See base class."""
-    return "QNLI"
+    return "STS-B"
 
   def _create_examples(self, lines, set_type):
-    """Creates examples for the training and dev sets."""
+    """Creates examples for the training/dev/test sets."""
     examples = []
-    for (i, line) in enumerate(lines):
+    for i, line in enumerate(lines):
       if i == 0:
         continue
-      guid = "%s-%s" % (set_type, 1)
+      guid = "%s-%s" % (set_type, i)
+      text_a = tokenization.convert_to_unicode(line[7])
+      text_b = tokenization.convert_to_unicode(line[8])
       if set_type == "test":
-        text_a = tokenization.convert_to_unicode(line[1])
-        text_b = tokenization.convert_to_unicode(line[2])
-        label = "entailment"
+        label = 0.0
       else:
-        text_a = tokenization.convert_to_unicode(line[1])
-        text_b = tokenization.convert_to_unicode(line[2])
-        label = tokenization.convert_to_unicode(line[-1])
+        label = self.label_type(tokenization.convert_to_unicode(line[9]))
       examples.append(
           InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
     return examples
@@ -564,6 +606,8 @@ class TfdsProcessor(DataProcessor):
     tfds_params="dataset=glue/mrpc,text_key=sentence1,text_b_key=sentence2"
     tfds_params="dataset=glue/stsb,text_key=sentence1,text_b_key=sentence2,"
                 "is_regression=true,label_type=float"
+    tfds_params="dataset=snli,text_key=premise,text_b_key=hypothesis,"
+                "skip_label=-1"
   Possible parameters (please refer to the documentation of Tensorflow Datasets
   (TFDS) for the meaning of individual parameters):
     dataset: Required dataset name (potentially with subset and version number).
@@ -581,17 +625,19 @@ class TfdsProcessor(DataProcessor):
     label_type: Type of the label key (defaults to `int`).
     weight_key: Key of the float sample weight (is not used if not provided).
     is_regression: Whether the task is a regression problem (defaults to False).
+    skip_label: Skip examples with given label (defaults to None).
   """
 
-  def __init__(self, tfds_params,
+  def __init__(self,
+               tfds_params,
                process_text_fn=tokenization.convert_to_unicode):
     super(TfdsProcessor, self).__init__(process_text_fn)
     self._process_tfds_params_str(tfds_params)
     if self.module_import:
       importlib.import_module(self.module_import)
 
-    self.dataset, info = tfds.load(self.dataset_name, data_dir=self.data_dir,
-                                   with_info=True)
+    self.dataset, info = tfds.load(
+        self.dataset_name, data_dir=self.data_dir, with_info=True)
     if self.is_regression:
       self._labels = None
     else:
@@ -619,6 +665,9 @@ class TfdsProcessor(DataProcessor):
     self.label_type = dtype_map[d.get("label_type", "int")]
     self.is_regression = cast_str_to_bool(d.get("is_regression", "False"))
     self.weight_key = d.get("weight_key", None)
+    self.skip_label = d.get("skip_label", None)
+    if self.skip_label is not None:
+      self.skip_label = self.label_type(self.skip_label)
 
   def get_train_examples(self, data_dir):
     assert data_dir is None
@@ -639,7 +688,7 @@ class TfdsProcessor(DataProcessor):
     return "TFDS_" + self.dataset_name
 
   def _create_examples(self, split_name, set_type):
-    """Creates examples for the training and dev sets."""
+    """Creates examples for the training/dev/test sets."""
     if split_name not in self.dataset:
       raise ValueError("Split {} not available.".format(split_name))
     dataset = self.dataset[split_name].as_numpy_iterator()
@@ -657,13 +706,258 @@ class TfdsProcessor(DataProcessor):
         if self.text_b_key:
           text_b = self.process_text_fn(example[self.text_b_key])
         label = self.label_type(example[self.label_key])
+        if self.skip_label is not None and label == self.skip_label:
+          continue
       if self.weight_key:
         weight = float(example[self.weight_key])
       examples.append(
-          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label,
-                       weight=weight))
+          InputExample(
+              guid=guid,
+              text_a=text_a,
+              text_b=text_b,
+              label=label,
+              weight=weight))
+    return examples
+
+
+class WnliProcessor(DataProcessor):
+  """Processor for the WNLI data set (GLUE version)."""
+
+  def get_train_examples(self, data_dir):
+    """See base class."""
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "train.tsv")), "train")
+
+  def get_dev_examples(self, data_dir):
+    """See base class."""
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "dev.tsv")), "dev")
+
+  def get_test_examples(self, data_dir):
+    """See base class."""
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, "test.tsv")), "test")
+
+  def get_labels(self):
+    """See base class."""
+    return ["0", "1"]
+
+  @staticmethod
+  def get_processor_name():
+    """See base class."""
+    return "WNLI"
+
+  def _create_examples(self, lines, set_type):
+    """Creates examples for the training/dev/test sets."""
+    examples = []
+    for i, line in enumerate(lines):
+      if i == 0:
+        continue
+      guid = "%s-%s" % (set_type, i)
+      text_a = tokenization.convert_to_unicode(line[1])
+      text_b = tokenization.convert_to_unicode(line[2])
+      if set_type == "test":
+        label = "0"
+      else:
+        label = tokenization.convert_to_unicode(line[3])
+      examples.append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples
+
+
+class XnliProcessor(DataProcessor):
+  """Processor for the XNLI data set."""
+  supported_languages = [
+      "ar", "bg", "de", "el", "en", "es", "fr", "hi", "ru", "sw", "th", "tr",
+      "ur", "vi", "zh"
+  ]
+
+  def __init__(self,
+               language="en",
+               process_text_fn=tokenization.convert_to_unicode):
+    super(XnliProcessor, self).__init__(process_text_fn)
+    if language == "all":
+      self.languages = XnliProcessor.supported_languages
+    elif language not in XnliProcessor.supported_languages:
+      raise ValueError("language %s is not supported for XNLI task." % language)
+    else:
+      self.languages = [language]
+
+  def get_train_examples(self, data_dir):
+    """See base class."""
+    lines = []
+    for language in self.languages:
+      # Skips the header.
+      lines.extend(
+          self._read_tsv(
+              os.path.join(data_dir, "multinli",
+                           "multinli.train.%s.tsv" % language))[1:])
+
+    examples = []
+    for i, line in enumerate(lines):
+      guid = "train-%d" % i
+      text_a = self.process_text_fn(line[0])
+      text_b = self.process_text_fn(line[1])
+      label = self.process_text_fn(line[2])
+      if label == self.process_text_fn("contradictory"):
+        label = self.process_text_fn("contradiction")
+      examples.append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
     return examples
 
+  def get_dev_examples(self, data_dir):
+    """See base class."""
+    lines = self._read_tsv(os.path.join(data_dir, "xnli.dev.tsv"))
+    examples = []
+    for i, line in enumerate(lines):
+      if i == 0:
+        continue
+      guid = "dev-%d" % i
+      text_a = self.process_text_fn(line[6])
+      text_b = self.process_text_fn(line[7])
+      label = self.process_text_fn(line[1])
+      examples.append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples
+
+  def get_test_examples(self, data_dir):
+    """See base class."""
+    lines = self._read_tsv(os.path.join(data_dir, "xnli.test.tsv"))
+    examples_by_lang = {k: [] for k in XnliProcessor.supported_languages}
+    for i, line in enumerate(lines):
+      if i == 0:
+        continue
+      guid = "test-%d" % i
+      language = self.process_text_fn(line[0])
+      text_a = self.process_text_fn(line[6])
+      text_b = self.process_text_fn(line[7])
+      label = self.process_text_fn(line[1])
+      examples_by_lang[language].append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples_by_lang
+
+  def get_labels(self):
+    """See base class."""
+    return ["contradiction", "entailment", "neutral"]
+
+  @staticmethod
+  def get_processor_name():
+    """See base class."""
+    return "XNLI"
+
+
+class XtremePawsxProcessor(DataProcessor):
+  """Processor for the XTREME PAWS-X data set."""
+  supported_languages = ["de", "en", "es", "fr", "ja", "ko", "zh"]
+
+  def get_train_examples(self, data_dir):
+    """See base class."""
+    lines = self._read_tsv(os.path.join(data_dir, "train-en.tsv"))
+    examples = []
+    for i, line in enumerate(lines):
+      guid = "train-%d" % i
+      text_a = self.process_text_fn(line[0])
+      text_b = self.process_text_fn(line[1])
+      label = self.process_text_fn(line[2])
+      examples.append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples
+
+  def get_dev_examples(self, data_dir):
+    """See base class."""
+    lines = self._read_tsv(os.path.join(data_dir, "dev-en.tsv"))
+
+    examples = []
+    for i, line in enumerate(lines):
+      guid = "dev-%d" % i
+      text_a = self.process_text_fn(line[0])
+      text_b = self.process_text_fn(line[1])
+      label = self.process_text_fn(line[2])
+      examples.append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples
+
+  def get_test_examples(self, data_dir):
+    """See base class."""
+    examples_by_lang = {k: [] for k in self.supported_languages}
+    for lang in self.supported_languages:
+      lines = self._read_tsv(os.path.join(data_dir, f"test-{lang}.tsv"))
+      for i, line in enumerate(lines):
+        guid = "test-%d" % i
+        text_a = self.process_text_fn(line[0])
+        text_b = self.process_text_fn(line[1])
+        label = "0"
+        examples_by_lang[lang].append(
+            InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples_by_lang
+
+  def get_labels(self):
+    """See base class."""
+    return ["0", "1"]
+
+  @staticmethod
+  def get_processor_name():
+    """See base class."""
+    return "XTREME-PAWS-X"
+
+
+class XtremeXnliProcessor(DataProcessor):
+  """Processor for the XTREME XNLI data set."""
+  supported_languages = [
+      "ar", "bg", "de", "el", "en", "es", "fr", "hi", "ru", "sw", "th", "tr",
+      "ur", "vi", "zh"
+  ]
+
+  def get_train_examples(self, data_dir):
+    """See base class."""
+    lines = self._read_tsv(os.path.join(data_dir, "train-en.tsv"))
+
+    examples = []
+    for i, line in enumerate(lines):
+      guid = "train-%d" % i
+      text_a = self.process_text_fn(line[0])
+      text_b = self.process_text_fn(line[1])
+      label = self.process_text_fn(line[2])
+      examples.append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples
+
+  def get_dev_examples(self, data_dir):
+    """See base class."""
+    lines = self._read_tsv(os.path.join(data_dir, "dev-en.tsv"))
+    examples = []
+    for i, line in enumerate(lines):
+      guid = "dev-%d" % i
+      text_a = self.process_text_fn(line[0])
+      text_b = self.process_text_fn(line[1])
+      label = self.process_text_fn(line[2])
+      examples.append(
+          InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples
+
+  def get_test_examples(self, data_dir):
+    """See base class."""
+    examples_by_lang = {k: [] for k in self.supported_languages}
+    for lang in self.supported_languages:
+      lines = self._read_tsv(os.path.join(data_dir, f"test-{lang}.tsv"))
+      for i, line in enumerate(lines):
+        guid = f"test-{i}"
+        text_a = self.process_text_fn(line[0])
+        text_b = self.process_text_fn(line[1])
+        label = "contradiction"
+        examples_by_lang[lang].append(
+            InputExample(guid=guid, text_a=text_a, text_b=text_b, label=label))
+    return examples_by_lang
+
+  def get_labels(self):
+    """See base class."""
+    return ["contradiction", "entailment", "neutral"]
+
+  @staticmethod
+  def get_processor_name():
+    """See base class."""
+    return "XTREME-XNLI"
+
 
 def convert_single_example(ex_index, example, label_list, max_seq_length,
                            tokenizer):
@@ -748,8 +1042,9 @@ def convert_single_example(ex_index, example, label_list, max_seq_length,
     logging.info("input_ids: %s", " ".join([str(x) for x in input_ids]))
     logging.info("input_mask: %s", " ".join([str(x) for x in input_mask]))
     logging.info("segment_ids: %s", " ".join([str(x) for x in segment_ids]))
-    logging.info("label: %s (id = %d)", example.label, label_id)
+    logging.info("label: %s (id = %s)", example.label, str(label_id))
     logging.info("weight: %s", example.weight)
+    logging.info("int_iden: %s", str(example.int_iden))
 
   feature = InputFeatures(
       input_ids=input_ids,
@@ -757,19 +1052,24 @@ def convert_single_example(ex_index, example, label_list, max_seq_length,
       segment_ids=segment_ids,
       label_id=label_id,
       is_real_example=True,
-      weight=example.weight)
+      weight=example.weight,
+      int_iden=example.int_iden)
+
   return feature
 
 
-def file_based_convert_examples_to_features(examples, label_list,
-                                            max_seq_length, tokenizer,
-                                            output_file, label_type=None):
+def file_based_convert_examples_to_features(examples,
+                                            label_list,
+                                            max_seq_length,
+                                            tokenizer,
+                                            output_file,
+                                            label_type=None):
   """Convert a set of `InputExample`s to a TFRecord file."""
 
   tf.io.gfile.makedirs(os.path.dirname(output_file))
   writer = tf.io.TFRecordWriter(output_file)
 
-  for (ex_index, example) in enumerate(examples):
+  for ex_index, example in enumerate(examples):
     if ex_index % 10000 == 0:
       logging.info("Writing example %d of %d", ex_index, len(examples))
 
@@ -779,6 +1079,7 @@ def file_based_convert_examples_to_features(examples, label_list,
     def create_int_feature(values):
       f = tf.train.Feature(int64_list=tf.train.Int64List(value=list(values)))
       return f
+
     def create_float_feature(values):
       f = tf.train.Feature(float_list=tf.train.FloatList(value=list(values)))
       return f
@@ -789,12 +1090,14 @@ def file_based_convert_examples_to_features(examples, label_list,
     features["segment_ids"] = create_int_feature(feature.segment_ids)
     if label_type is not None and label_type == float:
       features["label_ids"] = create_float_feature([feature.label_id])
-    else:
+    elif feature.label_id is not None:
       features["label_ids"] = create_int_feature([feature.label_id])
     features["is_real_example"] = create_int_feature(
         [int(feature.is_real_example)])
     if feature.weight is not None:
       features["weight"] = create_float_feature([feature.weight])
+    if feature.int_iden is not None:
+      features["int_iden"] = create_int_feature([feature.int_iden])
 
     tf_example = tf.train.Example(features=tf.train.Features(feature=features))
     writer.write(tf_example.SerializeToString())
@@ -830,8 +1133,7 @@ def generate_tf_record_from_data_file(processor,
   Arguments:
       processor: Input processor object to be used for generating data. Subclass
         of `DataProcessor`.
-      data_dir: Directory that contains train/eval data to process. Data files
-        should be in from "dev.tsv", "test.tsv", or "train.tsv".
+      data_dir: Directory that contains train/eval/test data to process.
       tokenizer: The tokenizer to be applied on the data.
       train_data_output_path: Output to which processed tf record for training
         will be saved.
@@ -857,8 +1159,7 @@ def generate_tf_record_from_data_file(processor,
   train_input_data_examples = processor.get_train_examples(data_dir)
   file_based_convert_examples_to_features(train_input_data_examples, label_list,
                                           max_seq_length, tokenizer,
-                                          train_data_output_path,
-                                          label_type)
+                                          train_data_output_path, label_type)
   num_training_data = len(train_input_data_examples)
 
   if eval_data_output_path:
@@ -868,26 +1169,27 @@ def generate_tf_record_from_data_file(processor,
                                             tokenizer, eval_data_output_path,
                                             label_type)
 
+  meta_data = {
+      "processor_type": processor.get_processor_name(),
+      "train_data_size": num_training_data,
+      "max_seq_length": max_seq_length,
+  }
+
   if test_data_output_path:
     test_input_data_examples = processor.get_test_examples(data_dir)
     if isinstance(test_input_data_examples, dict):
       for language, examples in test_input_data_examples.items():
         file_based_convert_examples_to_features(
-            examples,
-            label_list, max_seq_length,
-            tokenizer, test_data_output_path.format(language),
-            label_type)
+            examples, label_list, max_seq_length, tokenizer,
+            test_data_output_path.format(language), label_type)
+        meta_data["test_{}_data_size".format(language)] = len(examples)
     else:
       file_based_convert_examples_to_features(test_input_data_examples,
                                               label_list, max_seq_length,
                                               tokenizer, test_data_output_path,
                                               label_type)
+      meta_data["test_data_size"] = len(test_input_data_examples)
 
-  meta_data = {
-      "processor_type": processor.get_processor_name(),
-      "train_data_size": num_training_data,
-      "max_seq_length": max_seq_length,
-  }
   if is_regression:
     meta_data["task_type"] = "bert_regression"
     meta_data["label_type"] = {int: "int", float: "float"}[label_type]
@@ -900,12 +1202,4 @@ def generate_tf_record_from_data_file(processor,
   if eval_data_output_path:
     meta_data["eval_data_size"] = len(eval_input_data_examples)
 
-  if test_data_output_path:
-    test_input_data_examples = processor.get_test_examples(data_dir)
-    if isinstance(test_input_data_examples, dict):
-      for language, examples in test_input_data_examples.items():
-        meta_data["test_{}_data_size".format(language)] = len(examples)
-    else:
-      meta_data["test_data_size"] = len(test_input_data_examples)
-
   return meta_data
diff --git a/official/nlp/data/create_finetuning_data.py b/official/nlp/data/create_finetuning_data.py
index 256c1dee0adad8b4e35a58212e62573edd946b6b..403d66b41c5b728cb3da5e3d31eeea535defbc91 100644
--- a/official/nlp/data/create_finetuning_data.py
+++ b/official/nlp/data/create_finetuning_data.py
@@ -27,18 +27,21 @@ from absl import flags
 import tensorflow as tf
 from official.nlp.bert import tokenization
 from official.nlp.data import classifier_data_lib
+from official.nlp.data import sentence_retrieval_lib
 # word-piece tokenizer based squad_lib
 from official.nlp.data import squad_lib as squad_lib_wp
 # sentence-piece tokenizer based squad_lib
 from official.nlp.data import squad_lib_sp
+from official.nlp.data import tagging_data_lib
 
 FLAGS = flags.FLAGS
 
+# TODO(chendouble): consider moving each task to its own binary.
 flags.DEFINE_enum(
     "fine_tuning_task_type", "classification",
-    ["classification", "regression", "squad"],
+    ["classification", "regression", "squad", "retrieval", "tagging"],
     "The name of the BERT fine tuning task for which data "
-    "will be generated..")
+    "will be generated.")
 
 # BERT classification specific flags.
 flags.DEFINE_string(
@@ -47,23 +50,41 @@ flags.DEFINE_string(
     "for the task.")
 
 flags.DEFINE_enum("classification_task_name", "MNLI",
-                  ["COLA", "MNLI", "MRPC", "QNLI", "QQP", "SST-2", "XNLI",
-                   "PAWS-X"],
-                  "The name of the task to train BERT classifier.")
+                  ["COLA", "MNLI", "MRPC", "PAWS-X", "QNLI", "QQP", "RTE",
+                   "SST-2", "STS-B", "WNLI", "XNLI", "XTREME-XNLI",
+                   "XTREME-PAWS-X"],
+                  "The name of the task to train BERT classifier. The "
+                  "difference between XTREME-XNLI and XNLI is: 1. the format "
+                  "of input tsv files; 2. the dev set for XTREME is english "
+                  "only and for XNLI is all languages combined. Same for "
+                  "PAWS-X.")
+
+# MNLI task-specific flag.
+flags.DEFINE_enum(
+    "mnli_type", "matched", ["matched", "mismatched"],
+    "The type of MNLI dataset.")
 
-# XNLI task specific flag.
+# XNLI task-specific flag.
 flags.DEFINE_string(
     "xnli_language", "en",
-    "Language of training data for XNIL task. If the value is 'all', the data "
+    "Language of training data for XNLI task. If the value is 'all', the data "
     "of all languages will be used for training.")
 
-# PAWS-X task specific flag.
+# PAWS-X task-specific flag.
 flags.DEFINE_string(
     "pawsx_language", "en",
-    "Language of trainig data for PAWS-X task. If the value is 'all', the data "
+    "Language of training data for PAWS-X task. If the value is 'all', the data "
     "of all languages will be used for training.")
 
-# BERT Squad task specific flags.
+# Retrieval task-specific flags.
+flags.DEFINE_enum("retrieval_task_name", "bucc", ["bucc", "tatoeba"],
+                  "The name of sentence retrieval task for scoring")
+
+# Tagging task-specific flags.
+flags.DEFINE_enum("tagging_task_name", "panx", ["panx", "udpos"],
+                  "The name of BERT tagging (token classification) task.")
+
+# BERT Squad task-specific flags.
 flags.DEFINE_string(
     "squad_data_file", None,
     "The input data file in for generating training data for BERT squad task.")
@@ -163,20 +184,29 @@ def generate_classifier_dataset():
         "cola":
             classifier_data_lib.ColaProcessor,
         "mnli":
-            classifier_data_lib.MnliProcessor,
+            functools.partial(classifier_data_lib.MnliProcessor,
+                              mnli_type=FLAGS.mnli_type),
         "mrpc":
             classifier_data_lib.MrpcProcessor,
         "qnli":
             classifier_data_lib.QnliProcessor,
         "qqp": classifier_data_lib.QqpProcessor,
+        "rte": classifier_data_lib.RteProcessor,
         "sst-2":
             classifier_data_lib.SstProcessor,
+        "sts-b":
+            classifier_data_lib.StsBProcessor,
         "xnli":
             functools.partial(classifier_data_lib.XnliProcessor,
                               language=FLAGS.xnli_language),
         "paws-x":
             functools.partial(classifier_data_lib.PawsxProcessor,
-                              language=FLAGS.pawsx_language)
+                              language=FLAGS.pawsx_language),
+        "wnli": classifier_data_lib.WnliProcessor,
+        "xtreme-xnli":
+            functools.partial(classifier_data_lib.XtremeXnliProcessor),
+        "xtreme-paws-x":
+            functools.partial(classifier_data_lib.XtremePawsxProcessor)
     }
     task_name = FLAGS.classification_task_name.lower()
     if task_name not in processors:
@@ -237,6 +267,67 @@ def generate_squad_dataset():
         FLAGS.max_query_length, FLAGS.doc_stride, FLAGS.version_2_with_negative)
 
 
+def generate_retrieval_dataset():
+  """Generate retrieval test and dev dataset and returns input meta data."""
+  assert (FLAGS.input_data_dir and FLAGS.retrieval_task_name)
+  if FLAGS.tokenizer_impl == "word_piece":
+    tokenizer = tokenization.FullTokenizer(
+        vocab_file=FLAGS.vocab_file, do_lower_case=FLAGS.do_lower_case)
+    processor_text_fn = tokenization.convert_to_unicode
+  else:
+    assert FLAGS.tokenizer_impl == "sentence_piece"
+    tokenizer = tokenization.FullSentencePieceTokenizer(FLAGS.sp_model_file)
+    processor_text_fn = functools.partial(
+        tokenization.preprocess_text, lower=FLAGS.do_lower_case)
+
+  processors = {
+      "bucc": sentence_retrieval_lib.BuccProcessor,
+      "tatoeba": sentence_retrieval_lib.TatoebaProcessor,
+  }
+
+  task_name = FLAGS.retrieval_task_name.lower()
+  if task_name not in processors:
+    raise ValueError("Task not found: %s" % task_name)
+
+  processor = processors[task_name](process_text_fn=processor_text_fn)
+
+  return sentence_retrieval_lib.generate_sentence_retrevial_tf_record(
+      processor,
+      FLAGS.input_data_dir,
+      tokenizer,
+      FLAGS.eval_data_output_path,
+      FLAGS.test_data_output_path,
+      FLAGS.max_seq_length)
+
+
+def generate_tagging_dataset():
+  """Generates tagging dataset."""
+  processors = {
+      "panx": tagging_data_lib.PanxProcessor,
+      "udpos": tagging_data_lib.UdposProcessor,
+  }
+  task_name = FLAGS.tagging_task_name.lower()
+  if task_name not in processors:
+    raise ValueError("Task not found: %s" % task_name)
+
+  if FLAGS.tokenizer_impl == "word_piece":
+    tokenizer = tokenization.FullTokenizer(
+        vocab_file=FLAGS.vocab_file, do_lower_case=FLAGS.do_lower_case)
+    processor_text_fn = tokenization.convert_to_unicode
+  elif FLAGS.tokenizer_impl == "sentence_piece":
+    tokenizer = tokenization.FullSentencePieceTokenizer(FLAGS.sp_model_file)
+    processor_text_fn = functools.partial(
+        tokenization.preprocess_text, lower=FLAGS.do_lower_case)
+  else:
+    raise ValueError("Unsupported tokenizer_impl: %s" % FLAGS.tokenizer_impl)
+
+  processor = processors[task_name]()
+  return tagging_data_lib.generate_tf_record_from_data_file(
+      processor, FLAGS.input_data_dir, tokenizer, FLAGS.max_seq_length,
+      FLAGS.train_data_output_path, FLAGS.eval_data_output_path,
+      FLAGS.test_data_output_path, processor_text_fn)
+
+
 def main(_):
   if FLAGS.tokenizer_impl == "word_piece":
     if not FLAGS.vocab_file:
@@ -248,12 +339,20 @@ def main(_):
       raise ValueError(
           "FLAG sp_model_file for sentence-piece tokenizer is not specified.")
 
+  if FLAGS.fine_tuning_task_type != "retrieval":
+    flags.mark_flag_as_required("train_data_output_path")
+
   if FLAGS.fine_tuning_task_type == "classification":
     input_meta_data = generate_classifier_dataset()
   elif FLAGS.fine_tuning_task_type == "regression":
     input_meta_data = generate_regression_dataset()
-  else:
+  elif FLAGS.fine_tuning_task_type == "retrieval":
+    input_meta_data = generate_retrieval_dataset()
+  elif FLAGS.fine_tuning_task_type == "squad":
     input_meta_data = generate_squad_dataset()
+  else:
+    assert FLAGS.fine_tuning_task_type == "tagging"
+    input_meta_data = generate_tagging_dataset()
 
   tf.io.gfile.makedirs(os.path.dirname(FLAGS.meta_data_file_path))
   with tf.io.gfile.GFile(FLAGS.meta_data_file_path, "w") as writer:
@@ -261,6 +360,5 @@ def main(_):
 
 
 if __name__ == "__main__":
-  flags.mark_flag_as_required("train_data_output_path")
   flags.mark_flag_as_required("meta_data_file_path")
   app.run(main)
diff --git a/official/nlp/data/create_pretraining_data.py b/official/nlp/data/create_pretraining_data.py
index 79dac57ac8775687673604af6fb2fb50c9f74244..fff6391cee95d209be8f785fd43dd73184a65d11 100644
--- a/official/nlp/data/create_pretraining_data.py
+++ b/official/nlp/data/create_pretraining_data.py
@@ -18,6 +18,7 @@ from __future__ import division
 from __future__ import print_function
 
 import collections
+import itertools
 import random
 
 from absl import app
@@ -48,6 +49,12 @@ flags.DEFINE_bool(
     "do_whole_word_mask", False,
     "Whether to use whole word masking rather than per-WordPiece masking.")
 
+flags.DEFINE_integer(
+    "max_ngram_size", None,
+    "Mask contiguous whole words (n-grams) of up to `max_ngram_size` using a "
+    "weighting scheme to favor shorter n-grams. "
+    "Note: `--do_whole_word_mask=True` must also be set when n-gram masking.")
+
 flags.DEFINE_bool(
     "gzip_compress", False,
     "Whether to use `GZIP` compress option to get compressed TFRecord files.")
@@ -192,7 +199,8 @@ def create_training_instances(input_files,
                               masked_lm_prob,
                               max_predictions_per_seq,
                               rng,
-                              do_whole_word_mask=False):
+                              do_whole_word_mask=False,
+                              max_ngram_size=None):
   """Create `TrainingInstance`s from raw text."""
   all_documents = [[]]
 
@@ -229,7 +237,7 @@ def create_training_instances(input_files,
           create_instances_from_document(
               all_documents, document_index, max_seq_length, short_seq_prob,
               masked_lm_prob, max_predictions_per_seq, vocab_words, rng,
-              do_whole_word_mask))
+              do_whole_word_mask, max_ngram_size))
 
   rng.shuffle(instances)
   return instances
@@ -238,7 +246,8 @@ def create_training_instances(input_files,
 def create_instances_from_document(
     all_documents, document_index, max_seq_length, short_seq_prob,
     masked_lm_prob, max_predictions_per_seq, vocab_words, rng,
-    do_whole_word_mask=False):
+    do_whole_word_mask=False,
+    max_ngram_size=None):
   """Creates `TrainingInstance`s for a single document."""
   document = all_documents[document_index]
 
@@ -337,7 +346,7 @@ def create_instances_from_document(
         (tokens, masked_lm_positions,
          masked_lm_labels) = create_masked_lm_predictions(
              tokens, masked_lm_prob, max_predictions_per_seq, vocab_words, rng,
-             do_whole_word_mask)
+             do_whole_word_mask, max_ngram_size)
         instance = TrainingInstance(
             tokens=tokens,
             segment_ids=segment_ids,
@@ -355,72 +364,238 @@ def create_instances_from_document(
 MaskedLmInstance = collections.namedtuple("MaskedLmInstance",
                                           ["index", "label"])
 
+# A _Gram is a [half-open) interval of token indices which form a word.
+# E.g.,
+#   words:  ["The", "doghouse"]
+#   tokens: ["The", "dog", "##house"]
+#   grams:  [(0,1), (1,3)]
+_Gram = collections.namedtuple("_Gram", ["begin", "end"])
+
+
+def _window(iterable, size):
+  """Helper to create a sliding window iterator with a given size.
+
+  E.g.,
+    input = [1, 2, 3, 4]
+    _window(input, 1) => [1], [2], [3], [4]
+    _window(input, 2) => [1, 2], [2, 3], [3, 4]
+    _window(input, 3) => [1, 2, 3], [2, 3, 4]
+    _window(input, 4) => [1, 2, 3, 4]
+    _window(input, 5) => None
+
+  Arguments:
+    iterable: elements to iterate over.
+    size: size of the window.
+
+  Yields:
+    Elements of `iterable` batched into a sliding window of length `size`.
+  """
+  i = iter(iterable)
+  window = []
+  try:
+    for e in range(0, size):
+      window.append(next(i))
+    yield window
+  except StopIteration:
+    # handle the case where iterable's length is less than the window size.
+    return
+  for e in i:
+    window = window[1:] + [e]
+    yield window
+
+
+def _contiguous(sorted_grams):
+  """Test whether a sequence of grams is contiguous.
+
+  Arguments:
+    sorted_grams: _Grams which are sorted in increasing order.
+  Returns:
+    True if `sorted_grams` are touching each other.
+
+  E.g.,
+    _contiguous([(1, 4), (4, 5), (5, 10)]) == True
+    _contiguous([(1, 2), (4, 5)]) == False
+  """
+  for a, b in _window(sorted_grams, 2):
+    if a.end != b.begin:
+      return False
+  return True
+
+
+def _masking_ngrams(grams, max_ngram_size, max_masked_tokens, rng):
+  """Create a list of masking {1, ..., n}-grams from a list of one-grams.
+
+  This is an extention of 'whole word masking' to mask multiple, contiguous
+  words such as (e.g., "the red boat").
+
+  Each input gram represents the token indices of a single word,
+     words:  ["the", "red", "boat"]
+     tokens: ["the", "red", "boa", "##t"]
+     grams:  [(0,1), (1,2), (2,4)]
+
+  For a `max_ngram_size` of three, possible outputs masks include:
+    1-grams: (0,1), (1,2), (2,4)
+    2-grams: (0,2), (1,4)
+    3-grams; (0,4)
+
+  Output masks will not overlap and contain less than `max_masked_tokens` total
+  tokens.  E.g., for the example above with `max_masked_tokens` as three,
+  valid outputs are,
+       [(0,1), (1,2)]  # "the", "red" covering two tokens
+       [(1,2), (2,4)]  # "red", "boa", "##t" covering three tokens
+
+  The length of the selected n-gram follows a zipf weighting to
+  favor shorter n-gram sizes (weight(1)=1, weight(2)=1/2, weight(3)=1/3, ...).
+
+  Arguments:
+    grams: List of one-grams.
+    max_ngram_size: Maximum number of contiguous one-grams combined to create
+      an n-gram.
+    max_masked_tokens: Maximum total number of tokens to be masked.
+    rng: `random.Random` generator.
+
+  Returns:
+    A list of n-grams to be used as masks.
+  """
+  if not grams:
+    return None
+
+  grams = sorted(grams)
+  num_tokens = grams[-1].end
+
+  # Ensure our grams are valid (i.e., they don't overlap).
+  for a, b in _window(grams, 2):
+    if a.end > b.begin:
+      raise ValueError("overlapping grams: {}".format(grams))
+
+  # Build map from n-gram length to list of n-grams.
+  ngrams = {i: [] for i in range(1, max_ngram_size+1)}
+  for gram_size in range(1, max_ngram_size+1):
+    for g in _window(grams, gram_size):
+      if _contiguous(g):
+        # Add an n-gram which spans these one-grams.
+        ngrams[gram_size].append(_Gram(g[0].begin, g[-1].end))
+
+  # Shuffle each list of n-grams.
+  for v in ngrams.values():
+    rng.shuffle(v)
+
+  # Create the weighting for n-gram length selection.
+  # Stored cummulatively for `random.choices` below.
+  cummulative_weights = list(
+      itertools.accumulate([1./n for n in range(1, max_ngram_size+1)]))
+
+  output_ngrams = []
+  # Keep a bitmask of which tokens have been masked.
+  masked_tokens = [False] * num_tokens
+  # Loop until we have enough masked tokens or there are no more candidate
+  # n-grams of any length.
+  # Each code path should ensure one or more elements from `ngrams` are removed
+  # to guarentee this loop terminates.
+  while (sum(masked_tokens) < max_masked_tokens and
+         sum(len(s) for s in ngrams.values())):
+    # Pick an n-gram size based on our weights.
+    sz = random.choices(range(1, max_ngram_size+1),
+                        cum_weights=cummulative_weights)[0]
+
+    # Ensure this size doesn't result in too many masked tokens.
+    # E.g., a two-gram contains _at least_ two tokens.
+    if sum(masked_tokens) + sz > max_masked_tokens:
+      # All n-grams of this length are too long and can be removed from
+      # consideration.
+      ngrams[sz].clear()
+      continue
 
-def create_masked_lm_predictions(tokens, masked_lm_prob,
-                                 max_predictions_per_seq, vocab_words, rng,
-                                 do_whole_word_mask):
-  """Creates the predictions for the masked LM objective."""
+    # All of the n-grams of this size have been used.
+    if not ngrams[sz]:
+      continue
+
+    # Choose a random n-gram of the given size.
+    gram = ngrams[sz].pop()
+    num_gram_tokens = gram.end-gram.begin
+
+    # Check if this would add too many tokens.
+    if num_gram_tokens + sum(masked_tokens) > max_masked_tokens:
+      continue
+
+    # Check if any of the tokens in this gram have already been masked.
+    if sum(masked_tokens[gram.begin:gram.end]):
+      continue
 
-  cand_indexes = []
-  for (i, token) in enumerate(tokens):
-    if token == "[CLS]" or token == "[SEP]":
+    # Found a usable n-gram!  Mark its tokens as masked and add it to return.
+    masked_tokens[gram.begin:gram.end] = [True] * (gram.end-gram.begin)
+    output_ngrams.append(gram)
+  return output_ngrams
+
+
+def _wordpieces_to_grams(tokens):
+  """Reconstitue grams (words) from `tokens`.
+
+  E.g.,
+     tokens: ['[CLS]', 'That', 'lit', '##tle', 'blue', 'tru', '##ck', '[SEP]']
+      grams: [          [1,2), [2,         4),  [4,5) , [5,       6)]
+
+  Arguments:
+    tokens: list of wordpieces
+  Returns:
+    List of _Grams representing spans of whole words
+    (without "[CLS]" and "[SEP]").
+  """
+  grams = []
+  gram_start_pos = None
+  for i, token in enumerate(tokens):
+    if gram_start_pos is not None and token.startswith("##"):
       continue
-    # Whole Word Masking means that if we mask all of the wordpieces
-    # corresponding to an original word. When a word has been split into
-    # WordPieces, the first token does not have any marker and any subsequence
-    # tokens are prefixed with ##. So whenever we see the ## token, we
-    # append it to the previous set of word indexes.
-    #
-    # Note that Whole Word Masking does *not* change the training code
-    # at all -- we still predict each WordPiece independently, softmaxed
-    # over the entire vocabulary.
-    if (do_whole_word_mask and len(cand_indexes) >= 1 and
-        token.startswith("##")):
-      cand_indexes[-1].append(i)
+    if gram_start_pos is not None:
+      grams.append(_Gram(gram_start_pos, i))
+    if token not in ["[CLS]", "[SEP]"]:
+      gram_start_pos = i
     else:
-      cand_indexes.append([i])
+      gram_start_pos = None
+  if gram_start_pos is not None:
+    grams.append(_Gram(gram_start_pos, len(tokens)))
+  return grams
 
-  rng.shuffle(cand_indexes)
 
-  output_tokens = list(tokens)
+def create_masked_lm_predictions(tokens, masked_lm_prob,
+                                 max_predictions_per_seq, vocab_words, rng,
+                                 do_whole_word_mask,
+                                 max_ngram_size=None):
+  """Creates the predictions for the masked LM objective."""
+  if do_whole_word_mask:
+    grams = _wordpieces_to_grams(tokens)
+  else:
+    # Here we consider each token to be a word to allow for sub-word masking.
+    if max_ngram_size:
+      raise ValueError("cannot use ngram masking without whole word masking")
+    grams = [_Gram(i, i+1) for i in range(0, len(tokens))
+             if tokens[i] not in ["[CLS]", "[SEP]"]]
 
   num_to_predict = min(max_predictions_per_seq,
                        max(1, int(round(len(tokens) * masked_lm_prob))))
-
+  # Generate masks.  If `max_ngram_size` in [0, None] it means we're doing
+  # whole word masking or token level masking.  Both of these can be treated
+  # as the `max_ngram_size=1` case.
+  masked_grams = _masking_ngrams(grams, max_ngram_size or 1,
+                                 num_to_predict, rng)
   masked_lms = []
-  covered_indexes = set()
-  for index_set in cand_indexes:
-    if len(masked_lms) >= num_to_predict:
-      break
-    # If adding a whole-word mask would exceed the maximum number of
-    # predictions, then just skip this candidate.
-    if len(masked_lms) + len(index_set) > num_to_predict:
-      continue
-    is_any_index_covered = False
-    for index in index_set:
-      if index in covered_indexes:
-        is_any_index_covered = True
-        break
-    if is_any_index_covered:
-      continue
-    for index in index_set:
-      covered_indexes.add(index)
-
-      masked_token = None
-      # 80% of the time, replace with [MASK]
-      if rng.random() < 0.8:
-        masked_token = "[MASK]"
+  output_tokens = list(tokens)
+  for gram in masked_grams:
+    # 80% of the time, replace all n-gram tokens with [MASK]
+    if rng.random() < 0.8:
+      replacement_action = lambda idx: "[MASK]"
+    else:
+      # 10% of the time, keep all the original n-gram tokens.
+      if rng.random() < 0.5:
+        replacement_action = lambda idx: tokens[idx]
+      # 10% of the time, replace each n-gram token with a random word.
       else:
-        # 10% of the time, keep original
-        if rng.random() < 0.5:
-          masked_token = tokens[index]
-        # 10% of the time, replace with random word
-        else:
-          masked_token = vocab_words[rng.randint(0, len(vocab_words) - 1)]
+        replacement_action = lambda idx: rng.choice(vocab_words)
 
-      output_tokens[index] = masked_token
+    for idx in range(gram.begin, gram.end):
+      output_tokens[idx] = replacement_action(idx)
+      masked_lms.append(MaskedLmInstance(index=idx, label=tokens[idx]))
 
-      masked_lms.append(MaskedLmInstance(index=index, label=tokens[index]))
   assert len(masked_lms) <= num_to_predict
   masked_lms = sorted(masked_lms, key=lambda x: x.index)
 
@@ -467,7 +642,7 @@ def main(_):
   instances = create_training_instances(
       input_files, tokenizer, FLAGS.max_seq_length, FLAGS.dupe_factor,
       FLAGS.short_seq_prob, FLAGS.masked_lm_prob, FLAGS.max_predictions_per_seq,
-      rng, FLAGS.do_whole_word_mask)
+      rng, FLAGS.do_whole_word_mask, FLAGS.max_ngram_size)
 
   output_files = FLAGS.output_file.split(",")
   logging.info("*** Writing to output files ***")
diff --git a/official/nlp/data/data_loader_factory.py b/official/nlp/data/data_loader_factory.py
new file mode 100644
index 0000000000000000000000000000000000000000..a88caea67fe93f4b5166bb8bcf97841082fdd449
--- /dev/null
+++ b/official/nlp/data/data_loader_factory.py
@@ -0,0 +1,59 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""A global factory to access NLP registered data loaders."""
+
+from official.utils import registry
+
+_REGISTERED_DATA_LOADER_CLS = {}
+
+
+def register_data_loader_cls(data_config_cls):
+  """Decorates a factory of DataLoader for lookup by a subclass of DataConfig.
+
+  This decorator supports registration of data loaders as follows:
+
+  ```
+  @dataclasses.dataclass
+  class MyDataConfig(DataConfig):
+    # Add fields here.
+    pass
+
+  @register_data_loader_cls(MyDataConfig)
+  class MyDataLoader:
+    # Inherits def __init__(self, data_config).
+    pass
+
+  my_data_config = MyDataConfig()
+
+  # Returns MyDataLoader(my_data_config).
+  my_loader = get_data_loader(my_data_config)
+  ```
+
+  Args:
+    data_config_cls: a subclass of DataConfig (*not* an instance
+      of DataConfig).
+
+  Returns:
+    A callable for use as class decorator that registers the decorated class
+      for creation from an instance of data_config_cls.
+  """
+  return registry.register(_REGISTERED_DATA_LOADER_CLS, data_config_cls)
+
+
+def get_data_loader(data_config):
+  """Creates a data_loader from data_config."""
+  return registry.lookup(_REGISTERED_DATA_LOADER_CLS, data_config.__class__)(
+      data_config)
diff --git a/official/nlp/data/pretrain_dataloader.py b/official/nlp/data/pretrain_dataloader.py
index 18325090caa6d83e68b4077aac4a27ee69bea938..985a7a5cc6c3f2e8a811d4fafbe6c731a1033f20 100644
--- a/official/nlp/data/pretrain_dataloader.py
+++ b/official/nlp/data/pretrain_dataloader.py
@@ -16,11 +16,27 @@
 """Loads dataset for the BERT pretraining task."""
 from typing import Mapping, Optional
 
+import dataclasses
 import tensorflow as tf
 
 from official.core import input_reader
+from official.modeling.hyperparams import config_definitions as cfg
+from official.nlp.data import data_loader_factory
 
 
+@dataclasses.dataclass
+class BertPretrainDataConfig(cfg.DataConfig):
+  """Data config for BERT pretraining task (tasks/masked_lm)."""
+  input_path: str = ''
+  global_batch_size: int = 512
+  is_training: bool = True
+  seq_length: int = 512
+  max_predictions_per_seq: int = 76
+  use_next_sentence_label: bool = True
+  use_position_id: bool = False
+
+
+@data_loader_factory.register_data_loader_cls(BertPretrainDataConfig)
 class BertPretrainDataLoader:
   """A class to load dataset for bert pretraining task."""
 
@@ -91,7 +107,5 @@ class BertPretrainDataLoader:
   def load(self, input_context: Optional[tf.distribute.InputContext] = None):
     """Returns a tf.dataset.Dataset."""
     reader = input_reader.InputReader(
-        params=self._params,
-        decoder_fn=self._decode,
-        parser_fn=self._parse)
+        params=self._params, decoder_fn=self._decode, parser_fn=self._parse)
     return reader.read(input_context)
diff --git a/official/nlp/data/question_answering_dataloader.py b/official/nlp/data/question_answering_dataloader.py
new file mode 100644
index 0000000000000000000000000000000000000000..08c7047e4afd80999899c34f2c5855ad2ef18634
--- /dev/null
+++ b/official/nlp/data/question_answering_dataloader.py
@@ -0,0 +1,95 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Loads dataset for the question answering (e.g, SQuAD) task."""
+from typing import Mapping, Optional
+import dataclasses
+import tensorflow as tf
+
+from official.core import input_reader
+from official.modeling.hyperparams import config_definitions as cfg
+from official.nlp.data import data_loader_factory
+
+
+@dataclasses.dataclass
+class QADataConfig(cfg.DataConfig):
+  """Data config for question answering task (tasks/question_answering)."""
+  input_path: str = ''
+  global_batch_size: int = 48
+  is_training: bool = True
+  seq_length: int = 384
+  # Settings below are question answering specific.
+  version_2_with_negative: bool = False
+  # Settings below are only used for eval mode.
+  input_preprocessed_data_path: str = ''
+  doc_stride: int = 128
+  query_length: int = 64
+  vocab_file: str = ''
+  tokenization: str = 'WordPiece'  # WordPiece or SentencePiece
+  do_lower_case: bool = True
+
+
+@data_loader_factory.register_data_loader_cls(QADataConfig)
+class QuestionAnsweringDataLoader:
+  """A class to load dataset for sentence prediction (classification) task."""
+
+  def __init__(self, params):
+    self._params = params
+    self._seq_length = params.seq_length
+    self._is_training = params.is_training
+
+  def _decode(self, record: tf.Tensor):
+    """Decodes a serialized tf.Example."""
+    name_to_features = {
+        'input_ids': tf.io.FixedLenFeature([self._seq_length], tf.int64),
+        'input_mask': tf.io.FixedLenFeature([self._seq_length], tf.int64),
+        'segment_ids': tf.io.FixedLenFeature([self._seq_length], tf.int64),
+    }
+    if self._is_training:
+      name_to_features['start_positions'] = tf.io.FixedLenFeature([], tf.int64)
+      name_to_features['end_positions'] = tf.io.FixedLenFeature([], tf.int64)
+    else:
+      name_to_features['unique_ids'] = tf.io.FixedLenFeature([], tf.int64)
+    example = tf.io.parse_single_example(record, name_to_features)
+
+    # tf.Example only supports tf.int64, but the TPU only supports tf.int32.
+    # So cast all int64 to int32.
+    for name in example:
+      t = example[name]
+      if t.dtype == tf.int64:
+        t = tf.cast(t, tf.int32)
+      example[name] = t
+
+    return example
+
+  def _parse(self, record: Mapping[str, tf.Tensor]):
+    """Parses raw tensors into a dict of tensors to be consumed by the model."""
+    x, y = {}, {}
+    for name, tensor in record.items():
+      if name in ('start_positions', 'end_positions'):
+        y[name] = tensor
+      elif name == 'input_ids':
+        x['input_word_ids'] = tensor
+      elif name == 'segment_ids':
+        x['input_type_ids'] = tensor
+      else:
+        x[name] = tensor
+    return (x, y)
+
+  def load(self, input_context: Optional[tf.distribute.InputContext] = None):
+    """Returns a tf.dataset.Dataset."""
+    reader = input_reader.InputReader(
+        params=self._params, decoder_fn=self._decode, parser_fn=self._parse)
+    return reader.read(input_context)
diff --git a/official/nlp/data/sentence_prediction_dataloader.py b/official/nlp/data/sentence_prediction_dataloader.py
index 60dd788403725aeeca2028b237c3330bbf22716c..57c068c8654ae363dcc50b081cac69d8cdb2536c 100644
--- a/official/nlp/data/sentence_prediction_dataloader.py
+++ b/official/nlp/data/sentence_prediction_dataloader.py
@@ -15,11 +15,28 @@
 # ==============================================================================
 """Loads dataset for the sentence prediction (classification) task."""
 from typing import Mapping, Optional
+import dataclasses
 import tensorflow as tf
 
 from official.core import input_reader
+from official.modeling.hyperparams import config_definitions as cfg
+from official.nlp.data import data_loader_factory
 
 
+LABEL_TYPES_MAP = {'int': tf.int64, 'float': tf.float32}
+
+
+@dataclasses.dataclass
+class SentencePredictionDataConfig(cfg.DataConfig):
+  """Data config for sentence prediction task (tasks/sentence_prediction)."""
+  input_path: str = ''
+  global_batch_size: int = 32
+  is_training: bool = True
+  seq_length: int = 128
+  label_type: str = 'int'
+
+
+@data_loader_factory.register_data_loader_cls(SentencePredictionDataConfig)
 class SentencePredictionDataLoader:
   """A class to load dataset for sentence prediction (classification) task."""
 
@@ -29,11 +46,12 @@ class SentencePredictionDataLoader:
 
   def _decode(self, record: tf.Tensor):
     """Decodes a serialized tf.Example."""
+    label_type = LABEL_TYPES_MAP[self._params.label_type]
     name_to_features = {
         'input_ids': tf.io.FixedLenFeature([self._seq_length], tf.int64),
         'input_mask': tf.io.FixedLenFeature([self._seq_length], tf.int64),
         'segment_ids': tf.io.FixedLenFeature([self._seq_length], tf.int64),
-        'label_ids': tf.io.FixedLenFeature([], tf.int64),
+        'label_ids': tf.io.FixedLenFeature([], label_type),
     }
     example = tf.io.parse_single_example(record, name_to_features)
 
diff --git a/official/nlp/data/sentence_retrieval_lib.py b/official/nlp/data/sentence_retrieval_lib.py
new file mode 100644
index 0000000000000000000000000000000000000000..d8e83ae579f8221b93e790ea62b91c3d6d2b9e90
--- /dev/null
+++ b/official/nlp/data/sentence_retrieval_lib.py
@@ -0,0 +1,168 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""BERT library to process data for cross lingual sentence retrieval task."""
+
+import os
+
+from absl import logging
+from official.nlp.bert import tokenization
+from official.nlp.data import classifier_data_lib
+
+
+class BuccProcessor(classifier_data_lib.DataProcessor):
+  """Procssor for Xtreme BUCC data set."""
+  supported_languages = ["de", "fr", "ru", "zh"]
+
+  def __init__(self,
+               process_text_fn=tokenization.convert_to_unicode):
+    super(BuccProcessor, self).__init__(process_text_fn)
+    self.languages = BuccProcessor.supported_languages
+
+  def get_dev_examples(self, data_dir, file_pattern):
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, file_pattern.format("dev"))),
+        "sample")
+
+  def get_test_examples(self, data_dir, file_pattern):
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, file_pattern.format("test"))),
+        "test")
+
+  @staticmethod
+  def get_processor_name():
+    """See base class."""
+    return "BUCC"
+
+  def _create_examples(self, lines, set_type):
+    """Creates examples for the training and dev sets."""
+    examples = []
+    for (i, line) in enumerate(lines):
+      guid = "%s-%s" % (set_type, i)
+      int_iden = int(line[0].split("-")[1])
+      text_a = self.process_text_fn(line[1])
+      examples.append(
+          classifier_data_lib.InputExample(
+              guid=guid, text_a=text_a, int_iden=int_iden))
+    return examples
+
+
+class TatoebaProcessor(classifier_data_lib.DataProcessor):
+  """Procssor for Xtreme Tatoeba data set."""
+  supported_languages = [
+      "af", "ar", "bg", "bn", "de", "el", "es", "et", "eu", "fa", "fi", "fr",
+      "he", "hi", "hu", "id", "it", "ja", "jv", "ka", "kk", "ko", "ml", "mr",
+      "nl", "pt", "ru", "sw", "ta", "te", "th", "tl", "tr", "ur", "vi", "zh"
+  ]
+
+  def __init__(self,
+               process_text_fn=tokenization.convert_to_unicode):
+    super(TatoebaProcessor, self).__init__(process_text_fn)
+    self.languages = TatoebaProcessor.supported_languages
+
+  def get_test_examples(self, data_dir, file_path):
+    return self._create_examples(
+        self._read_tsv(os.path.join(data_dir, file_path)), "test")
+
+  @staticmethod
+  def get_processor_name():
+    """See base class."""
+    return "TATOEBA"
+
+  def _create_examples(self, lines, set_type):
+    """Creates examples for the training and dev sets."""
+    examples = []
+    for (i, line) in enumerate(lines):
+      guid = "%s-%s" % (set_type, i)
+      text_a = self.process_text_fn(line[0])
+      examples.append(
+          classifier_data_lib.InputExample(
+              guid=guid, text_a=text_a, int_iden=i))
+    return examples
+
+
+def generate_sentence_retrevial_tf_record(processor,
+                                          data_dir,
+                                          tokenizer,
+                                          eval_data_output_path=None,
+                                          test_data_output_path=None,
+                                          max_seq_length=128):
+  """Generates the tf records for retrieval tasks.
+
+  Args:
+    processor: Input processor object to be used for generating data. Subclass
+      of `DataProcessor`.
+      data_dir: Directory that contains train/eval data to process. Data files
+        should be in from.
+      tokenizer: The tokenizer to be applied on the data.
+      eval_data_output_path: Output to which processed tf record for evaluation
+        will be saved.
+      test_data_output_path: Output to which processed tf record for testing
+        will be saved. Must be a pattern template with {} if processor has
+        language specific test data.
+      max_seq_length: Maximum sequence length of the to be generated
+        training/eval data.
+
+  Returns:
+      A dictionary containing input meta data.
+  """
+  assert eval_data_output_path or test_data_output_path
+
+  if processor.get_processor_name() == "BUCC":
+    path_pattern = "{}-en.{{}}.{}"
+
+  if processor.get_processor_name() == "TATOEBA":
+    path_pattern = "{}-en.{}"
+
+  meta_data = {
+      "processor_type": processor.get_processor_name(),
+      "max_seq_length": max_seq_length,
+      "number_eval_data": {},
+      "number_test_data": {},
+  }
+  logging.info("Start to process %s task data", processor.get_processor_name())
+
+  for lang_a in processor.languages:
+    for lang_b in [lang_a, "en"]:
+      if eval_data_output_path:
+        eval_input_data_examples = processor.get_dev_examples(
+            data_dir, os.path.join(path_pattern.format(lang_a, lang_b)))
+
+        num_eval_data = len(eval_input_data_examples)
+        logging.info("Processing %d dev examples of %s-en.%s", num_eval_data,
+                     lang_a, lang_b)
+        output_file = os.path.join(
+            eval_data_output_path,
+            "{}-en-{}.{}.tfrecords".format(lang_a, lang_b, "dev"))
+        classifier_data_lib.file_based_convert_examples_to_features(
+            eval_input_data_examples, None, max_seq_length, tokenizer,
+            output_file, None)
+        meta_data["number_eval_data"][f"{lang_a}-en.{lang_b}"] = num_eval_data
+
+      if test_data_output_path:
+        test_input_data_examples = processor.get_test_examples(
+            data_dir, os.path.join(path_pattern.format(lang_a, lang_b)))
+
+        num_test_data = len(test_input_data_examples)
+        logging.info("Processing %d test examples of %s-en.%s", num_test_data,
+                     lang_a, lang_b)
+        output_file = os.path.join(
+            test_data_output_path,
+            "{}-en-{}.{}.tfrecords".format(lang_a, lang_b, "test"))
+        classifier_data_lib.file_based_convert_examples_to_features(
+            test_input_data_examples, None, max_seq_length, tokenizer,
+            output_file, None)
+        meta_data["number_test_data"][f"{lang_a}-en.{lang_b}"] = num_test_data
+
+  return meta_data
diff --git a/official/nlp/data/tagging_data_lib.py b/official/nlp/data/tagging_data_lib.py
new file mode 100644
index 0000000000000000000000000000000000000000..c97fd9382f493209f61b0672c04b544259164372
--- /dev/null
+++ b/official/nlp/data/tagging_data_lib.py
@@ -0,0 +1,346 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Library to process data for tagging task such as NER/POS."""
+import collections
+import os
+
+from absl import logging
+import tensorflow as tf
+
+from official.nlp.data import classifier_data_lib
+
+# A negative label id for the padding label, which will not contribute
+# to loss/metrics in training.
+_PADDING_LABEL_ID = -1
+
+# The special unknown token, used to substitute a word which has too many
+# subwords after tokenization.
+_UNK_TOKEN = "[UNK]"
+
+
+class InputExample(object):
+  """A single training/test example for token classification."""
+
+  def __init__(self, sentence_id, words=None, label_ids=None):
+    """Constructs an InputExample."""
+    self.sentence_id = sentence_id
+    self.words = words if words else []
+    self.label_ids = label_ids if label_ids else []
+
+  def add_word_and_label_id(self, word, label_id):
+    """Adds word and label_id pair in the example."""
+    self.words.append(word)
+    self.label_ids.append(label_id)
+
+
+def _read_one_file(file_name, label_list):
+  """Reads one file and returns a list of `InputExample` instances."""
+  lines = tf.io.gfile.GFile(file_name, "r").readlines()
+  examples = []
+  label_id_map = {label: i for i, label in enumerate(label_list)}
+  sentence_id = 0
+  example = InputExample(sentence_id=0)
+  for line in lines:
+    line = line.strip("\n")
+    if line:
+      # The format is: <token>\t<label> for train/dev set and <token> for test.
+      items = line.split("\t")
+      assert len(items) == 2 or len(items) == 1
+      token = items[0].strip()
+
+      # Assign a dummy label_id for test set
+      label_id = label_id_map[items[1].strip()] if len(items) == 2 else 0
+      example.add_word_and_label_id(token, label_id)
+    else:
+      # Empty line indicates a new sentence.
+      if example.words:
+        examples.append(example)
+        sentence_id += 1
+        example = InputExample(sentence_id=sentence_id)
+
+  if example.words:
+    examples.append(example)
+  return examples
+
+
+class PanxProcessor(classifier_data_lib.DataProcessor):
+  """Processor for the Panx data set."""
+  supported_languages = [
+      "ar", "he", "vi", "id", "jv", "ms", "tl", "eu", "ml", "ta", "te", "af",
+      "nl", "en", "de", "el", "bn", "hi", "mr", "ur", "fa", "fr", "it", "pt",
+      "es", "bg", "ru", "ja", "ka", "ko", "th", "sw", "yo", "my", "zh", "kk",
+      "tr", "et", "fi", "hu"
+  ]
+
+  def get_train_examples(self, data_dir):
+    return _read_one_file(
+        os.path.join(data_dir, "train-en.tsv"), self.get_labels())
+
+  def get_dev_examples(self, data_dir):
+    return _read_one_file(
+        os.path.join(data_dir, "dev-en.tsv"), self.get_labels())
+
+  def get_test_examples(self, data_dir):
+    examples_dict = {}
+    for language in self.supported_languages:
+      examples_dict[language] = _read_one_file(
+          os.path.join(data_dir, "test-%s.tsv" % language), self.get_labels())
+    return examples_dict
+
+  def get_labels(self):
+    return ["O", "B-PER", "I-PER", "B-LOC", "I-LOC", "B-ORG", "I-ORG"]
+
+  @staticmethod
+  def get_processor_name():
+    return "panx"
+
+
+class UdposProcessor(classifier_data_lib.DataProcessor):
+  """Processor for the Udpos data set."""
+  supported_languages = [
+      "af", "ar", "bg", "de", "el", "en", "es", "et", "eu", "fa", "fi", "fr",
+      "he", "hi", "hu", "id", "it", "ja", "kk", "ko", "mr", "nl", "pt", "ru",
+      "ta", "te", "th", "tl", "tr", "ur", "vi", "yo", "zh"
+  ]
+
+  def get_train_examples(self, data_dir):
+    return _read_one_file(
+        os.path.join(data_dir, "train-en.tsv"), self.get_labels())
+
+  def get_dev_examples(self, data_dir):
+    return _read_one_file(
+        os.path.join(data_dir, "dev-en.tsv"), self.get_labels())
+
+  def get_test_examples(self, data_dir):
+    examples_dict = {}
+    for language in self.supported_languages:
+      examples_dict[language] = _read_one_file(
+          os.path.join(data_dir, "test-%s.tsv" % language), self.get_labels())
+    return examples_dict
+
+  def get_labels(self):
+    return [
+        "ADJ", "ADP", "ADV", "AUX", "CCONJ", "DET", "INTJ", "NOUN", "NUM",
+        "PART", "PRON", "PROPN", "PUNCT", "SCONJ", "SYM", "VERB", "X"
+    ]
+
+  @staticmethod
+  def get_processor_name():
+    return "udpos"
+
+
+def _tokenize_example(example, max_length, tokenizer, text_preprocessing=None):
+  """Tokenizes words and breaks long example into short ones."""
+  # Needs additional [CLS] and [SEP] tokens.
+  max_length = max_length - 2
+  new_examples = []
+  new_example = InputExample(sentence_id=example.sentence_id)
+  for i, word in enumerate(example.words):
+    if any([x < 0 for x in example.label_ids]):
+      raise ValueError("Unexpected negative label_id: %s" % example.label_ids)
+
+    if text_preprocessing:
+      word = text_preprocessing(word)
+    subwords = tokenizer.tokenize(word)
+    if (not subwords or len(subwords) > max_length) and word:
+      subwords = [_UNK_TOKEN]
+
+    if len(subwords) + len(new_example.words) > max_length:
+      # Start a new example.
+      new_examples.append(new_example)
+      new_example = InputExample(sentence_id=example.sentence_id)
+
+    for j, subword in enumerate(subwords):
+      # Use the real label for the first subword, and pad label for
+      # the remainings.
+      subword_label = example.label_ids[i] if j == 0 else _PADDING_LABEL_ID
+      new_example.add_word_and_label_id(subword, subword_label)
+
+  if new_example.words:
+    new_examples.append(new_example)
+
+  return new_examples
+
+
+def _convert_single_example(example, max_seq_length, tokenizer):
+  """Converts an `InputExample` instance to a `tf.train.Example` instance."""
+  tokens = ["[CLS]"]
+  tokens.extend(example.words)
+  tokens.append("[SEP]")
+  input_ids = tokenizer.convert_tokens_to_ids(tokens)
+  label_ids = [_PADDING_LABEL_ID]
+  label_ids.extend(example.label_ids)
+  label_ids.append(_PADDING_LABEL_ID)
+
+  segment_ids = [0] * len(input_ids)
+  input_mask = [1] * len(input_ids)
+
+  # Pad up to the sequence length.
+  while len(input_ids) < max_seq_length:
+    input_ids.append(0)
+    input_mask.append(0)
+    segment_ids.append(0)
+    label_ids.append(_PADDING_LABEL_ID)
+
+  def create_int_feature(values):
+    return tf.train.Feature(int64_list=tf.train.Int64List(value=list(values)))
+
+  features = collections.OrderedDict()
+  features["input_ids"] = create_int_feature(input_ids)
+  features["input_mask"] = create_int_feature(input_mask)
+  features["segment_ids"] = create_int_feature(segment_ids)
+  features["label_ids"] = create_int_feature(label_ids)
+  features["sentence_id"] = create_int_feature([example.sentence_id])
+
+  tf_example = tf.train.Example(features=tf.train.Features(feature=features))
+  return tf_example
+
+
+def write_example_to_file(examples,
+                          tokenizer,
+                          max_seq_length,
+                          output_file,
+                          text_preprocessing=None):
+  """Writes `InputExample`s into a tfrecord file with `tf.train.Example` protos.
+
+  Note that the words inside each example will be tokenized and be applied by
+  `text_preprocessing` if available. Also, if the length of sentence (plus
+  special [CLS] and [SEP] tokens) exceeds `max_seq_length`, the long sentence
+  will be broken into multiple short examples. For example:
+
+  Example (text_preprocessing=lowercase, max_seq_length=5)
+    words:        ["What", "a", "great", "weekend"]
+    labels:       [     7,   5,       9,        10]
+    sentence_id:  0
+    preprocessed: ["what", "a", "great", "weekend"]
+    tokenized:    ["what", "a", "great", "week", "##end"]
+
+  will result in two tf.example protos:
+
+    tokens:      ["[CLS]", "what", "a", "great", "[SEP]"]
+    label_ids:   [-1,       7,     5,     9,     -1]
+    input_mask:  [ 1,       1,     1,     1,      1]
+    segment_ids: [ 0,       0,     0,     0,      0]
+    input_ids:   [ tokenizer.convert_tokens_to_ids(tokens) ]
+    sentence_id: 0
+
+    tokens:      ["[CLS]", "week", "##end", "[SEP]", "[PAD]"]
+    label_ids:   [-1,       10,     -1,    -1,       -1]
+    input_mask:  [ 1,       1,       1,     0,        0]
+    segment_ids: [ 0,       0,       0,     0,        0]
+    input_ids:   [ tokenizer.convert_tokens_to_ids(tokens) ]
+    sentence_id: 0
+
+    Note the use of -1 in `label_ids` to indicate that a token should not be
+    considered for classification (e.g., trailing ## wordpieces or special
+    token). Token classification models should accordingly ignore these when
+    calculating loss, metrics, etc...
+
+  Args:
+    examples: A list of `InputExample` instances.
+    tokenizer: The tokenizer to be applied on the data.
+    max_seq_length: Maximum length of generated sequences.
+    output_file: The name of the output tfrecord file.
+    text_preprocessing: optional preprocessing run on each word prior to
+      tokenization.
+
+  Returns:
+    The total number of tf.train.Example proto written to file.
+  """
+  tf.io.gfile.makedirs(os.path.dirname(output_file))
+  writer = tf.io.TFRecordWriter(output_file)
+  num_tokenized_examples = 0
+  for (ex_index, example) in enumerate(examples):
+    if ex_index % 10000 == 0:
+      logging.info("Writing example %d of %d to %s", ex_index, len(examples),
+                   output_file)
+
+    tokenized_examples = _tokenize_example(example, max_seq_length,
+                                           tokenizer, text_preprocessing)
+    num_tokenized_examples += len(tokenized_examples)
+    for per_tokenized_example in tokenized_examples:
+      tf_example = _convert_single_example(
+          per_tokenized_example, max_seq_length, tokenizer)
+      writer.write(tf_example.SerializeToString())
+
+  writer.close()
+  return num_tokenized_examples
+
+
+def token_classification_meta_data(train_data_size,
+                                   max_seq_length,
+                                   num_labels,
+                                   eval_data_size=None,
+                                   test_data_size=None,
+                                   label_list=None,
+                                   processor_type=None):
+  """Creates metadata for tagging (token classification) datasets."""
+  meta_data = {
+      "train_data_size": train_data_size,
+      "max_seq_length": max_seq_length,
+      "num_labels": num_labels,
+      "task_type": "tagging",
+      "label_type": "int",
+      "label_shape": [max_seq_length],
+  }
+  if eval_data_size:
+    meta_data["eval_data_size"] = eval_data_size
+  if test_data_size:
+    meta_data["test_data_size"] = test_data_size
+  if label_list:
+    meta_data["label_list"] = label_list
+  if processor_type:
+    meta_data["processor_type"] = processor_type
+
+  return meta_data
+
+
+def generate_tf_record_from_data_file(processor,
+                                      data_dir,
+                                      tokenizer,
+                                      max_seq_length,
+                                      train_data_output_path,
+                                      eval_data_output_path,
+                                      test_data_output_path,
+                                      text_preprocessing):
+  """Generates tfrecord files from the raw data."""
+  common_kwargs = dict(tokenizer=tokenizer, max_seq_length=max_seq_length,
+                       text_preprocessing=text_preprocessing)
+  train_examples = processor.get_train_examples(data_dir)
+  train_data_size = write_example_to_file(
+      train_examples, output_file=train_data_output_path, **common_kwargs)
+
+  eval_examples = processor.get_dev_examples(data_dir)
+  eval_data_size = write_example_to_file(
+      eval_examples, output_file=eval_data_output_path, **common_kwargs)
+
+  test_input_data_examples = processor.get_test_examples(data_dir)
+  test_data_size = {}
+  for language, examples in test_input_data_examples.items():
+    test_data_size[language] = write_example_to_file(
+        examples,
+        output_file=test_data_output_path.format(language),
+        **common_kwargs)
+
+  labels = processor.get_labels()
+  meta_data = token_classification_meta_data(
+      train_data_size,
+      max_seq_length,
+      len(labels),
+      eval_data_size,
+      test_data_size,
+      label_list=labels,
+      processor_type=processor.get_processor_name())
+  return meta_data
diff --git a/official/nlp/data/tagging_data_loader.py b/official/nlp/data/tagging_data_loader.py
new file mode 100644
index 0000000000000000000000000000000000000000..9ede37091b9328bedd0ccb6fd78c126ab3237458
--- /dev/null
+++ b/official/nlp/data/tagging_data_loader.py
@@ -0,0 +1,82 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Loads dataset for the tagging (e.g., NER/POS) task."""
+from typing import Mapping, Optional
+import dataclasses
+import tensorflow as tf
+
+from official.core import input_reader
+from official.modeling.hyperparams import config_definitions as cfg
+from official.nlp.data import data_loader_factory
+
+
+@dataclasses.dataclass
+class TaggingDataConfig(cfg.DataConfig):
+  """Data config for tagging (tasks/tagging)."""
+  is_training: bool = True
+  seq_length: int = 128
+  include_sentence_id: bool = False
+
+
+@data_loader_factory.register_data_loader_cls(TaggingDataConfig)
+class TaggingDataLoader:
+  """A class to load dataset for tagging (e.g., NER and POS) task."""
+
+  def __init__(self, params: TaggingDataConfig):
+    self._params = params
+    self._seq_length = params.seq_length
+    self._include_sentence_id = params.include_sentence_id
+
+  def _decode(self, record: tf.Tensor):
+    """Decodes a serialized tf.Example."""
+    name_to_features = {
+        'input_ids': tf.io.FixedLenFeature([self._seq_length], tf.int64),
+        'input_mask': tf.io.FixedLenFeature([self._seq_length], tf.int64),
+        'segment_ids': tf.io.FixedLenFeature([self._seq_length], tf.int64),
+        'label_ids': tf.io.FixedLenFeature([self._seq_length], tf.int64),
+    }
+    if self._include_sentence_id:
+      name_to_features['sentence_id'] = tf.io.FixedLenFeature([], tf.int64)
+
+    example = tf.io.parse_single_example(record, name_to_features)
+
+    # tf.Example only supports tf.int64, but the TPU only supports tf.int32.
+    # So cast all int64 to int32.
+    for name in example:
+      t = example[name]
+      if t.dtype == tf.int64:
+        t = tf.cast(t, tf.int32)
+      example[name] = t
+
+    return example
+
+  def _parse(self, record: Mapping[str, tf.Tensor]):
+    """Parses raw tensors into a dict of tensors to be consumed by the model."""
+    x = {
+        'input_word_ids': record['input_ids'],
+        'input_mask': record['input_mask'],
+        'input_type_ids': record['segment_ids']
+    }
+    if self._include_sentence_id:
+      x['sentence_id'] = record['sentence_id']
+    y = record['label_ids']
+    return (x, y)
+
+  def load(self, input_context: Optional[tf.distribute.InputContext] = None):
+    """Returns a tf.dataset.Dataset."""
+    reader = input_reader.InputReader(
+        params=self._params, decoder_fn=self._decode, parser_fn=self._parse)
+    return reader.read(input_context)
diff --git a/official/nlp/modeling/README.md b/official/nlp/modeling/README.md
index 0e74b4637c44ed82392203ad4fb148420c05f18b..78f0cf239afb234740dc41a86d400ddd6c0240ec 100644
--- a/official/nlp/modeling/README.md
+++ b/official/nlp/modeling/README.md
@@ -1,6 +1,6 @@
 # NLP Modeling Library
 
-This libary provides a set of Keras primitives (Layers, Networks, and Models)
+This library provides a set of Keras primitives (Layers, Networks, and Models)
 that can be assembled into transformer-based models. They are
 flexible, validated, interoperable, and both TF1 and TF2 compatible.
 
@@ -16,6 +16,11 @@ standardized configuration.
 
 * [`losses`](losses) contains common loss computation used in NLP tasks.
 
+Please see the colab
+[nlp_modeling_library_intro.ipynb]
+(https://colab.sandbox.google.com/github/tensorflow/models/blob/master/official/colab/nlp/nlp_modeling_library_intro.ipynb)
+for how to build transformer-based NLP models using above primitives.
+
 Besides the pre-defined primitives, it also provides scaffold classes to allow
 easy experimentation with noval achitectures, e.g., you don’t need to fork a whole Transformer object to try a different kind of attention primitive, for instance.
 
@@ -33,11 +38,9 @@ embedding subnetwork (which will replace the standard embedding logic) and/or a
 custom hidden layer (which will replace the Transformer instantiation in the
 encoder).
 
-BERT and ALBERT models in this repo are implemented using this library. Code examples can be found in the corresponding model folder.
-
-
-
-
-
-
+Please see the colab
+[customize_encoder.ipynb]
+(https://colab.sandbox.google.com/github/tensorflow/models/blob/master/official/colab/nlp/customize_encoder.ipynb)
+for how to use scaffold classes to build noval achitectures.
 
+BERT and ALBERT models in this repo are implemented using this library. Code examples can be found in the corresponding model folder.
diff --git a/official/nlp/modeling/layers/README.md b/official/nlp/modeling/layers/README.md
index 212aee22b561fd0cccc8953b0d6f86372674bba9..1dcadc6af5d4fe5596aca4cfc0c5952d14d6df4b 100644
--- a/official/nlp/modeling/layers/README.md
+++ b/official/nlp/modeling/layers/README.md
@@ -3,19 +3,18 @@
 Layers are the fundamental building blocks for NLP models. They can be used to
 assemble new layers, networks, or models.
 
-*   [DenseEinsum](dense_einsum.py) implements a feedforward network using
-    tf.einsum. This layer contains the einsum op, the associated weight, and the
-    logic required to generate the einsum expression for the given
-    initialization parameters.
-
 *   [MultiHeadAttention](attention.py) implements an optionally masked attention
-    between two tensors, from_tensor and to_tensor, as described in
+    between query, key, value tensors as described in
     ["Attention Is All You Need"](https://arxiv.org/abs/1706.03762). If
     `from_tensor` and `to_tensor` are the same, then this is self-attention.
 
 *   [CachedAttention](attention.py) implements an attention layer with cache
     used for auto-agressive decoding.
 
+*   [MultiChannelAttention](multi_channel_attention.py) implements an variant of
+    multi-head attention which can be used to merge multiple streams for
+    cross-attentions.
+
 *   [TalkingHeadsAttention](talking_heads_attention.py) implements the talking
     heads attention, as decribed in
     ["Talking-Heads Attention"](https://arxiv.org/abs/2003.02436).
@@ -24,6 +23,10 @@ assemble new layers, networks, or models.
     described in
     ["Attention Is All You Need"](https://arxiv.org/abs/1706.03762).
 
+*   [TransformerDecoderLayer](transformer.py) TransformerDecoderLayer is made up
+    of self multi-head attention, cross multi-head attention and
+    feedforward network.
+
 *   [ReZeroTransformer](rezero_transformer.py) implements Transformer with
     ReZero described in
     ["ReZero is All You Need: Fast Convergence at Large Depth"](https://arxiv.org/abs/2003.04887).
@@ -45,8 +48,8 @@ assemble new layers, networks, or models.
     should be masked), the output will have masked positions set to
     approximately zero.
 
-* [`MaskedLM`](masked_lm.py) implements a masked language model. It assumes the
-  embedding table variable is passed to it.
+*   [`MaskedLM`](masked_lm.py) implements a masked language model. It assumes
+    the embedding table variable is passed to it.
 
 *   [ClassificationHead](cls_head.py) A pooling head over a sequence of
     embeddings, commonly used by classification tasks.
diff --git a/official/nlp/modeling/layers/__init__.py b/official/nlp/modeling/layers/__init__.py
index 9c89b0b17b7edb7eff49652a0b5d09464b90eda8..5720430abed0d1c160960d8d5ace137856eab6e1 100644
--- a/official/nlp/modeling/layers/__init__.py
+++ b/official/nlp/modeling/layers/__init__.py
@@ -20,10 +20,12 @@ from official.nlp.modeling.layers.dense_einsum import DenseEinsum
 from official.nlp.modeling.layers.gated_feedforward import GatedFeedforward
 from official.nlp.modeling.layers.masked_lm import MaskedLM
 from official.nlp.modeling.layers.masked_softmax import MaskedSoftmax
+from official.nlp.modeling.layers.multi_channel_attention import *
 from official.nlp.modeling.layers.on_device_embedding import OnDeviceEmbedding
 from official.nlp.modeling.layers.position_embedding import PositionEmbedding
+from official.nlp.modeling.layers.position_embedding import RelativePositionEmbedding
 from official.nlp.modeling.layers.rezero_transformer import ReZeroTransformer
 from official.nlp.modeling.layers.self_attention_mask import SelfAttentionMask
 from official.nlp.modeling.layers.talking_heads_attention import TalkingHeadsAttention
-from official.nlp.modeling.layers.transformer import Transformer
+from official.nlp.modeling.layers.transformer import *
 from official.nlp.modeling.layers.transformer_scaffold import TransformerScaffold
diff --git a/official/nlp/modeling/layers/attention.py b/official/nlp/modeling/layers/attention.py
index 99692b281794385a97af341d03dea0ee6c46b95b..6558bc0fce6567c9bc741a7b7586531a5be9a77a 100644
--- a/official/nlp/modeling/layers/attention.py
+++ b/official/nlp/modeling/layers/attention.py
@@ -33,7 +33,7 @@ EinsumDense = tf.keras.layers.experimental.EinsumDense
 _CHR_IDX = string.ascii_lowercase
 
 
-def _build_attention_equation(qkv_rank, attn_axes):
+def _build_attention_equation(rank, attn_axes):
   """Builds einsum equations for the attention computation.
 
   Query, key, value inputs after projection are expected to have the shape as:
@@ -50,19 +50,19 @@ def _build_attention_equation(qkv_rank, attn_axes):
   <query attention dims>, num_heads, channels)
 
   Args:
-    qkv_rank: the rank of query, key, value tensors.
+    rank: the rank of query, key, value tensors.
     attn_axes: a list/tuple of axes, [1, rank), that will do attention.
 
   Returns:
     Einsum equations.
   """
-  target_notation = _CHR_IDX[:qkv_rank]
+  target_notation = _CHR_IDX[:rank]
   # `batch_dims` includes the head dim.
-  batch_dims = tuple(np.delete(range(qkv_rank), attn_axes + (qkv_rank - 1,)))
-  letter_offset = qkv_rank
+  batch_dims = tuple(np.delete(range(rank), attn_axes + (rank - 1,)))
+  letter_offset = rank
   source_notation = ""
-  for i in range(qkv_rank):
-    if i in batch_dims or i == qkv_rank - 1:
+  for i in range(rank):
+    if i in batch_dims or i == rank - 1:
       source_notation += target_notation[i]
     else:
       source_notation += _CHR_IDX[letter_offset]
@@ -167,8 +167,8 @@ class MultiHeadAttention(tf.keras.layers.Layer):
       sequence dims. If not specified, projects back to the key feature dim.
     attention_axes: axes over which the attention is applied. `None` means
       attention over all axes, but batch, heads, and features.
-    return_attention_scores: bool, if `True`, returns the multi-head
-      attention scores as an additional output argument.
+    return_attention_scores: bool, if `True`, returns the multi-head attention
+      scores as an additional output argument.
     kernel_initializer: Initializer for dense layer kernels.
     bias_initializer: Initializer for dense layer biases.
     kernel_regularizer: Regularizer for dense layer kernels.
@@ -176,6 +176,13 @@ class MultiHeadAttention(tf.keras.layers.Layer):
     activity_regularizer: Regularizer for dense layer activity.
     kernel_constraint: Constraint for dense layer kernels.
     bias_constraint: Constraint for dense layer kernels.
+  Call args:
+    query: Query `Tensor` of shape `[B, T, dim]`.
+    value: Value `Tensor` of shape `[B, S, dim]`.
+    key: Optional key `Tensor` of shape `[B, S, dim]`. If not given, will use
+      `value` for both `key` and `value`, which is the most common case.
+    attention_mask: a boolean mask of shape `[B, T, S]`, that prevents attention
+      to certain positions.
   """
 
   def __init__(self,
@@ -214,6 +221,7 @@ class MultiHeadAttention(tf.keras.layers.Layer):
       self._attention_axes = (attention_axes,)
     else:
       self._attention_axes = attention_axes
+    self._built_from_signature = False
 
   def get_config(self):
     config = {
@@ -251,17 +259,31 @@ class MultiHeadAttention(tf.keras.layers.Layer):
     base_config = super(MultiHeadAttention, self).get_config()
     return dict(list(base_config.items()) + list(config.items()))
 
-  def build(self, input_shape):
-    inputs_len = len(input_shape)
-    if inputs_len > 3 or inputs_len < 2:
-      raise ValueError(
-          "Expects inputs list of length 2 or 3, namely [query, value] or "
-          "[query, value, key]. "
-          "Given length: %d" % inputs_len)
-    tensor_shapes = tf.nest.map_structure(tf.TensorShape, input_shape)
-    query_shape = tensor_shapes[0]
-    value_shape = tensor_shapes[1]
-    key_shape = tensor_shapes[2] if inputs_len == 3 else value_shape
+  def _build_from_signature(self, query, value, key=None):
+    """Builds layers and variables.
+
+    Once the method is called, self._built_from_signature will be set to True.
+
+    Args:
+      query: query tensor or TensorShape.
+      value: value tensor or TensorShape.
+      key: key tensor or TensorShape.
+    """
+    self._built_from_signature = True
+    if hasattr(query, "shape"):
+      query_shape = tf.TensorShape(query.shape)
+    else:
+      query_shape = query
+    if hasattr(value, "shape"):
+      value_shape = tf.TensorShape(value.shape)
+    else:
+      value_shape = value
+    if key is None:
+      key_shape = value_shape
+    elif hasattr(key, "shape"):
+      key_shape = tf.TensorShape(key.shape)
+    else:
+      key_shape = key
 
     common_kwargs = dict(
         kernel_initializer=self._kernel_initializer,
@@ -271,84 +293,79 @@ class MultiHeadAttention(tf.keras.layers.Layer):
         activity_regularizer=self._activity_regularizer,
         kernel_constraint=self._kernel_constraint,
         bias_constraint=self._bias_constraint)
-
-    free_dims = query_shape.rank - 1
-    einsum_equation, bias_axes, output_rank = _build_proj_equation(
-        free_dims, bound_dims=1, output_dims=2)
-    self._query_dense = EinsumDense(
-        einsum_equation,
-        output_shape=_get_output_shape(output_rank - 1,
-                                       [self._num_heads, self._key_size]),
-        bias_axes=bias_axes if self._use_bias else None,
-        name="query",
-        **common_kwargs)
-    einsum_equation, bias_axes, output_rank = _build_proj_equation(
-        key_shape.rank - 1, bound_dims=1, output_dims=2)
-    self._key_dense = EinsumDense(
-        einsum_equation,
-        output_shape=_get_output_shape(output_rank - 1,
-                                       [self._num_heads, self._key_size]),
-        bias_axes=bias_axes if self._use_bias else None,
-        name="key",
-        **common_kwargs)
-    einsum_equation, bias_axes, output_rank = _build_proj_equation(
-        value_shape.rank - 1, bound_dims=1, output_dims=2)
-    self._value_dense = EinsumDense(
-        einsum_equation,
-        output_shape=_get_output_shape(output_rank - 1,
-                                       [self._num_heads, self._value_size]),
-        bias_axes=bias_axes if self._use_bias else None,
-        name="value",
-        **common_kwargs)
-
-    # Builds the attention computations for multi-head dot product attention.
-    # These computations could be wrapped into the keras attention layer once it
-    # support mult-head einsum computations.
-    self._build_attention(output_rank)
-    if self._output_shape:
-      if not isinstance(self._output_shape, collections.abc.Sized):
-        output_shape = [self._output_shape]
+    with tf.init_scope():
+      free_dims = query_shape.rank - 1
+      einsum_equation, bias_axes, output_rank = _build_proj_equation(
+          free_dims, bound_dims=1, output_dims=2)
+      self._query_dense = EinsumDense(
+          einsum_equation,
+          output_shape=_get_output_shape(output_rank - 1,
+                                         [self._num_heads, self._key_size]),
+          bias_axes=bias_axes if self._use_bias else None,
+          name="query",
+          **common_kwargs)
+      einsum_equation, bias_axes, output_rank = _build_proj_equation(
+          key_shape.rank - 1, bound_dims=1, output_dims=2)
+      self._key_dense = EinsumDense(
+          einsum_equation,
+          output_shape=_get_output_shape(output_rank - 1,
+                                         [self._num_heads, self._key_size]),
+          bias_axes=bias_axes if self._use_bias else None,
+          name="key",
+          **common_kwargs)
+      einsum_equation, bias_axes, output_rank = _build_proj_equation(
+          value_shape.rank - 1, bound_dims=1, output_dims=2)
+      self._value_dense = EinsumDense(
+          einsum_equation,
+          output_shape=_get_output_shape(output_rank - 1,
+                                         [self._num_heads, self._value_size]),
+          bias_axes=bias_axes if self._use_bias else None,
+          name="value",
+          **common_kwargs)
+
+      # Builds the attention computations for multi-head dot product attention.
+      # These computations could be wrapped into the keras attention layer once
+      # it support mult-head einsum computations.
+      self.build_attention(output_rank)
+      if self._output_shape:
+        if not isinstance(self._output_shape, collections.abc.Sized):
+          output_shape = [self._output_shape]
+        else:
+          output_shape = self._output_shape
       else:
-        output_shape = self._output_shape
-    else:
-      output_shape = [query_shape[-1]]
-    einsum_equation, bias_axes, output_rank = _build_proj_equation(
-        free_dims, bound_dims=2, output_dims=len(output_shape))
-    self._output_dense = EinsumDense(
-        einsum_equation,
-        output_shape=_get_output_shape(output_rank - 1, output_shape),
-        bias_axes=bias_axes if self._use_bias else None,
-        name="attention_output",
-        **common_kwargs)
-    super(MultiHeadAttention, self).build(input_shape)
-
-  def _build_attention(self, qkv_rank):
+        output_shape = [query_shape[-1]]
+      einsum_equation, bias_axes, output_rank = _build_proj_equation(
+          free_dims, bound_dims=2, output_dims=len(output_shape))
+      self._output_dense = EinsumDense(
+          einsum_equation,
+          output_shape=_get_output_shape(output_rank - 1, output_shape),
+          bias_axes=bias_axes if self._use_bias else None,
+          name="attention_output",
+          **common_kwargs)
+
+  def build_attention(self, rank):
     """Builds multi-head dot-product attention computations.
 
-    This function builds attributes necessary for `_compute_attention` to
+    This function builds attributes necessary for `compute_attention` to
     costomize attention computation to replace the default dot-product
     attention.
 
     Args:
-      qkv_rank: the rank of query, key, value tensors.
+      rank: the rank of query, key, value tensors.
     """
     if self._attention_axes is None:
-      self._attention_axes = tuple(range(1, qkv_rank - 2))
+      self._attention_axes = tuple(range(1, rank - 2))
     else:
       self._attention_axes = tuple(self._attention_axes)
     self._dot_product_equation, self._combine_equation, attn_scores_rank = (
-        _build_attention_equation(qkv_rank, attn_axes=self._attention_axes))
+        _build_attention_equation(rank, attn_axes=self._attention_axes))
     norm_axes = tuple(
         range(attn_scores_rank - len(self._attention_axes), attn_scores_rank))
     self._masked_softmax = masked_softmax.MaskedSoftmax(
         mask_expansion_axes=[1], normalization_axes=norm_axes)
     self._dropout_layer = tf.keras.layers.Dropout(rate=self._dropout)
 
-  def _compute_attention(self,
-                         query_tensor,
-                         key_tensor,
-                         value_tensor,
-                         attention_mask=None):
+  def compute_attention(self, query, key, value, attention_mask=None):
     """Applies Dot-product attention with query, key, value tensors.
 
     This function defines the computation inside `call` with projected
@@ -356,9 +373,9 @@ class MultiHeadAttention(tf.keras.layers.Layer):
     attention implementation.
 
     Args:
-      query_tensor: Projected query `Tensor` of shape `[B, T, N, key_size]`.
-      key_tensor: Projected key `Tensor` of shape `[B, T, N, key_size]`.
-      value_tensor: Projected value `Tensor` of shape `[B, T, N, value_size]`.
+      query: Projected query `Tensor` of shape `[B, T, N, key_size]`.
+      key: Projected key `Tensor` of shape `[B, T, N, key_size]`.
+      value: Projected value `Tensor` of shape `[B, T, N, value_size]`.
       attention_mask: a boolean mask of shape `[B, T, S]`, that prevents
         attention to certain positions.
 
@@ -366,12 +383,14 @@ class MultiHeadAttention(tf.keras.layers.Layer):
       attention_output: Multi-headed outputs of attention computation.
       attention_scores: Multi-headed attention weights.
     """
+    # Note: Applying scalar multiply at the smaller end of einsum improves
+    # XLA performance, but may introduce slight numeric differences in
+    # the Transformer attention head.
+    query = tf.multiply(query, 1.0 / math.sqrt(float(self._key_size)))
+
     # Take the dot product between "query" and "key" to get the raw
     # attention scores.
-    attention_scores = tf.einsum(self._dot_product_equation, key_tensor,
-                                 query_tensor)
-    attention_scores = tf.multiply(attention_scores,
-                                   1.0 / math.sqrt(float(self._key_size)))
+    attention_scores = tf.einsum(self._dot_product_equation, key, query)
 
     # Normalize the attention scores to probabilities.
     # `attention_scores` = [B, N, T, S]
@@ -383,10 +402,10 @@ class MultiHeadAttention(tf.keras.layers.Layer):
 
     # `context_layer` = [B, T, N, H]
     attention_output = tf.einsum(self._combine_equation,
-                                 attention_scores_dropout, value_tensor)
+                                 attention_scores_dropout, value)
     return attention_output, attention_scores
 
-  def call(self, inputs, attention_mask=None):
+  def call(self, query, value, key=None, attention_mask=None):
     """Implements the forward pass.
 
     Size glossary:
@@ -399,11 +418,10 @@ class MultiHeadAttention(tf.keras.layers.Layer):
       * Value (source) attention axes shape (S), the rank must match the target.
 
     Args:
-      inputs: List of the following tensors:
-        * query: Query `Tensor` of shape `[B, T, dim]`.
-        * value: Value `Tensor` of shape `[B, S, dim]`.
-        * key: Optional key `Tensor` of shape `[B, S, dim]`. If not given, will
-          use `value` for both `key` and `value`, which is the most common case.
+      query: Query `Tensor` of shape `[B, T, dim]`.
+      value: Value `Tensor` of shape `[B, S, dim]`.
+      key: Optional key `Tensor` of shape `[B, S, dim]`. If not given, will use
+        `value` for both `key` and `value`, which is the most common case.
       attention_mask: a boolean mask of shape `[B, T, S]`, that prevents
         attention to certain positions.
 
@@ -416,29 +434,24 @@ class MultiHeadAttention(tf.keras.layers.Layer):
       attention
         axes.
     """
-    inputs_len = len(inputs)
-    if inputs_len > 3 or inputs_len < 2:
-      raise ValueError(
-          "Expects inputs list of length 2 or 3, namely [query, value] or "
-          "[query, value, key]. "
-          "Given length: %d" % inputs_len)
-    query = inputs[0]
-    value = inputs[1]
-    key = inputs[2] if inputs_len == 3 else value
+    if not self._built_from_signature:
+      self._build_from_signature(query=query, value=value, key=key)
+    if key is None:
+      key = value
 
     #   N = `num_attention_heads`
     #   H = `size_per_head`
-    # `query_tensor` = [B, T, N ,H]
-    query_tensor = self._query_dense(query)
+    # `query` = [B, T, N ,H]
+    query = self._query_dense(query)
 
-    # `key_tensor` = [B, S, N, H]
-    key_tensor = self._key_dense(key)
+    # `key` = [B, S, N, H]
+    key = self._key_dense(key)
 
-    # `value_tensor` = [B, S, N, H]
-    value_tensor = self._value_dense(value)
+    # `value` = [B, S, N, H]
+    value = self._value_dense(value)
 
-    attention_output, attention_scores = self._compute_attention(
-        query_tensor, key_tensor, value_tensor, attention_mask)
+    attention_output, attention_scores = self.compute_attention(
+        query, key, value, attention_mask)
     attention_output = self._output_dense(attention_output)
 
     if self._return_attention_scores:
@@ -453,40 +466,42 @@ class CachedAttention(MultiHeadAttention):
   Arguments are the same as `MultiHeadAttention` layer.
   """
 
-  def _update_cache(self, key_tensor, value_tensor, cache, decode_loop_step):
+  def _update_cache(self, key, value, cache, decode_loop_step):
     """Updates cache states and gets full-length key/value tensors."""
     # Combines cached keys and values with new keys and values.
     if decode_loop_step is not None:
       # TPU special case.
       key_seq_dim = cache["key"].shape.as_list()[1]
       indices = tf.reshape(
-          tf.one_hot(decode_loop_step, key_seq_dim, dtype=key_tensor.dtype),
+          tf.one_hot(decode_loop_step, key_seq_dim, dtype=key.dtype),
           [1, key_seq_dim, 1, 1])
-      key_tensor = cache["key"] + key_tensor * indices
+      key = cache["key"] + key * indices
       value_seq_dim = cache["value"].shape.as_list()[1]
       indices = tf.reshape(
-          tf.one_hot(decode_loop_step, value_seq_dim, dtype=value_tensor.dtype),
+          tf.one_hot(decode_loop_step, value_seq_dim, dtype=value.dtype),
           [1, value_seq_dim, 1, 1])
-      value_tensor = cache["value"] + value_tensor * indices
+      value = cache["value"] + value * indices
     else:
-      key_tensor = tf.concat(
-          [tf.cast(cache["key"], key_tensor.dtype), key_tensor], axis=1)
-      value_tensor = tf.concat(
-          [tf.cast(cache["value"], value_tensor.dtype), value_tensor], axis=1)
+      key = tf.concat([tf.cast(cache["key"], key.dtype), key], axis=1)
+      value = tf.concat([tf.cast(cache["value"], value.dtype), value], axis=1)
 
     # Update cache
-    cache["key"] = key_tensor
-    cache["value"] = value_tensor
+    cache["key"] = key
+    cache["value"] = value
 
-    return key_tensor, value_tensor
+    return key, value
 
   def call(self,
-           inputs,
+           query,
+           value,
+           key=None,
            attention_mask=None,
            cache=None,
            decode_loop_step=None):
-    from_tensor = inputs[0]
-    to_tensor = inputs[1]
+    if not self._built_from_signature:
+      self._build_from_signature(query=query, value=value, key=key)
+    if key is None:
+      key = value
 
     # Scalar dimensions referenced here:
     #   B = batch size (number of sequences)
@@ -494,25 +509,23 @@ class CachedAttention(MultiHeadAttention):
     #   T = `to_tensor` sequence length
     #   N = `num_attention_heads`
     #   H = `size_per_head`
-    # `query_tensor` = [B, F, N ,H]
-    query_tensor = self._query_dense(from_tensor)
+    # `query` = [B, F, N ,H]
+    query = self._query_dense(query)
 
-    # `key_tensor` = [B, T, N, H]
-    key_tensor = self._key_dense(to_tensor)
+    # `key` = [B, T, N, H]
+    key = self._key_dense(key)
 
-    # `value_tensor` = [B, T, N, H]
-    value_tensor = self._value_dense(to_tensor)
+    # `value` = [B, T, N, H]
+    value = self._value_dense(value)
 
     if cache:
-      key_tensor, value_tensor = self._update_cache(key_tensor, value_tensor,
-                                                    cache, decode_loop_step)
+      key, value = self._update_cache(key, value, cache, decode_loop_step)
+
+    query = tf.multiply(query, 1.0 / math.sqrt(float(self._key_size)))
 
     # Take the dot product between "query" and "key" to get the raw
     # attention scores.
-    attention_scores = tf.einsum(self._dot_product_equation, key_tensor,
-                                 query_tensor)
-    attention_scores = tf.multiply(attention_scores,
-                                   1.0 / math.sqrt(float(self._key_size)))
+    attention_scores = tf.einsum(self._dot_product_equation, key, query)
 
     # Normalize the attention scores to probabilities.
     # `attention_scores` = [B, N, F, T]
@@ -523,7 +536,7 @@ class CachedAttention(MultiHeadAttention):
     attention_scores = self._dropout_layer(attention_scores)
     # `context_layer` = [B, F, N, H]
     attention_output = tf.einsum(self._combine_equation, attention_scores,
-                                 value_tensor)
+                                 value)
     attention_output = self._output_dense(attention_output)
     if self._return_attention_scores:
       return attention_output, attention_scores, cache
diff --git a/official/nlp/modeling/layers/attention_test.py b/official/nlp/modeling/layers/attention_test.py
index ceb96f5084d795cdbafa7cdb352fb4692034f803..a73cd4c077b219806f3825ae297cd0b5ccc4bd21 100644
--- a/official/nlp/modeling/layers/attention_test.py
+++ b/official/nlp/modeling/layers/attention_test.py
@@ -45,7 +45,7 @@ class MultiHeadAttentionTest(keras_parameterized.TestCase):
     # Create a 3-dimensional input (the first dimension is implicit).
     query = tf.keras.Input(shape=(40, 80))
     value = tf.keras.Input(shape=(20, 80))
-    output = test_layer([query, value])
+    output = test_layer(query=query, value=value)
     self.assertEqual(output.shape.as_list(), [None] + output_dims)
 
   def test_non_masked_self_attention(self):
@@ -53,7 +53,7 @@ class MultiHeadAttentionTest(keras_parameterized.TestCase):
     test_layer = attention.MultiHeadAttention(num_heads=12, key_size=64)
     # Create a 3-dimensional input (the first dimension is implicit).
     query = tf.keras.Input(shape=(40, 80))
-    output = test_layer([query, query])
+    output = test_layer(query, query)
     self.assertEqual(output.shape.as_list(), [None, 40, 80])
 
   def test_attention_scores(self):
@@ -62,7 +62,7 @@ class MultiHeadAttentionTest(keras_parameterized.TestCase):
         num_heads=12, key_size=64, return_attention_scores=True)
     # Create a 3-dimensional input (the first dimension is implicit).
     query = tf.keras.Input(shape=(40, 80))
-    output, coef = test_layer([query, query])
+    output, coef = test_layer(query, query)
     self.assertEqual(output.shape.as_list(), [None, 40, 80])
     self.assertEqual(coef.shape.as_list(), [None, 12, 40, 40])
 
@@ -76,7 +76,7 @@ class MultiHeadAttentionTest(keras_parameterized.TestCase):
     query = tf.keras.Input(shape=(4, 8))
     value = tf.keras.Input(shape=(2, 8))
     mask_tensor = tf.keras.Input(shape=(4, 2))
-    output = test_layer([query, value], mask_tensor)
+    output = test_layer(query=query, value=value, attention_mask=mask_tensor)
 
     # Create a model containing the test layer.
     model = tf.keras.Model([query, value, mask_tensor], output)
@@ -100,7 +100,7 @@ class MultiHeadAttentionTest(keras_parameterized.TestCase):
 
     # Tests the layer with three inputs: Q, K, V.
     key = tf.keras.Input(shape=(2, 8))
-    output = test_layer([query, value, key], mask_tensor)
+    output = test_layer(query, value=value, key=key, attention_mask=mask_tensor)
     model = tf.keras.Model([query, value, key, mask_tensor], output)
 
     masked_output_data = model.predict([from_data, to_data, to_data, mask_data])
@@ -125,7 +125,7 @@ class MultiHeadAttentionTest(keras_parameterized.TestCase):
         kernel_initializer=tf.keras.initializers.TruncatedNormal(stddev=0.02))
     # Create a 3-dimensional input (the first dimension is implicit).
     query = tf.keras.Input(shape=(40, 80))
-    output = test_layer([query, query])
+    output = test_layer(query, query)
     self.assertEqual(output.shape.as_list(), [None, 40, 80])
 
   @parameterized.named_parameters(
@@ -147,11 +147,12 @@ class MultiHeadAttentionTest(keras_parameterized.TestCase):
     # Invoke the data with a random set of mask data. This should mask at least
     # one element.
     mask_data = np.random.randint(2, size=mask_shape).astype("bool")
-    output = test_layer([query, value], mask_data)
+    output = test_layer(query=query, value=value, attention_mask=mask_data)
 
     # Invoke the same data, but with a null mask (where no elements are masked).
     null_mask_data = np.ones(mask_shape)
-    unmasked_output = test_layer([query, value], null_mask_data)
+    unmasked_output = test_layer(
+        query=query, value=value, attention_mask=null_mask_data)
     # Because one data is masked and one is not, the outputs should not be the
     # same.
     self.assertNotAllClose(output, unmasked_output)
@@ -180,7 +181,7 @@ class AttentionSubclassTest(keras_parameterized.TestCase):
         key_size=64)
     # Create a 3-dimensional input (the first dimension is implicit).
     query = tf.keras.Input(shape=(40, 80))
-    output = test_layer([query, query])
+    output = test_layer(query, query)
     self.assertEqual(output.shape.as_list(), [None, 40, 80])
 
 
@@ -216,12 +217,14 @@ class CachedAttentionTest(keras_parameterized.TestCase):
     # one element.
     mask_data = np.random.randint(
         2, size=(batch_size, from_seq_length, from_seq_length))
-    masked_output_data, cache = layer([from_data, from_data], mask_data, cache)
+    masked_output_data, cache = layer(
+        query=from_data, value=from_data, attention_mask=mask_data, cache=cache)
     self.assertEqual(masked_output_data.shape, (3, 4, 8))
     self.assertEqual(cache["value"].shape, (3, 4, 2, 2))
 
     # Tests inputs without cache.
-    masked_output_data, cache = layer([from_data, from_data, mask_data])
+    masked_output_data, cache = layer(
+        query=from_data, value=from_data, attention_mask=mask_data)
     self.assertEqual(masked_output_data.shape, (3, 4, 8))
     self.assertIsNone(cache)
 
@@ -243,10 +246,12 @@ class CachedAttentionTest(keras_parameterized.TestCase):
     mask_data = np.random.randint(
         2, size=(batch_size, from_seq_length, from_seq_length), dtype=np.int32)
     # Testing the invocation directly as Keras cannot consume inputs correctly.
-    masked_output_data, cache = layer([from_data, from_data],
-                                      mask_data,
-                                      cache,
-                                      decode_loop_step=decode_loop_step)
+    masked_output_data, cache = layer(
+        query=from_data,
+        value=from_data,
+        attention_mask=mask_data,
+        cache=cache,
+        decode_loop_step=decode_loop_step)
     self.assertEqual(masked_output_data.shape, (3, 4, 8))
     self.assertEqual(cache["value"].shape, (3, 4, 2, 2))
 
diff --git a/official/nlp/modeling/layers/dense_einsum.py b/official/nlp/modeling/layers/dense_einsum.py
index ba2383e6d9e47f1e1d39898c16bf99748e4d38e3..4d57799fc9ecaf95b61d18601646070d331a93a7 100644
--- a/official/nlp/modeling/layers/dense_einsum.py
+++ b/official/nlp/modeling/layers/dense_einsum.py
@@ -21,6 +21,8 @@ from __future__ import print_function
 
 import tensorflow as tf
 
+from tensorflow.python.util import deprecation
+
 _CHR_IDX = ["a", "b", "c", "d", "e", "f", "g", "h", "i", "j", "k", "l", "m"]
 
 
@@ -57,6 +59,9 @@ class DenseEinsum(tf.keras.layers.Layer):
       `(batch_size, units)`.
   """
 
+  @deprecation.deprecated(
+      None, "DenseEinsum is deprecated. Please use "
+      "tf.keras.experimental.EinsumDense layer instead.")
   def __init__(self,
                output_shape,
                num_summed_dimensions=1,
diff --git a/official/nlp/modeling/layers/masked_lm.py b/official/nlp/modeling/layers/masked_lm.py
index 3b81556f4c7d82e79c9d9cda4894a26fde6a93f7..b9257fae1bd413941ef0cbc13ffb31603179b83f 100644
--- a/official/nlp/modeling/layers/masked_lm.py
+++ b/official/nlp/modeling/layers/masked_lm.py
@@ -34,7 +34,7 @@ class MaskedLM(tf.keras.layers.Layer):
   Arguments:
     embedding_table: The embedding table of the targets.
     activation: The activation, if any, for the dense layer.
-    initializer: The intializer for the dense layer. Defaults to a Glorot
+    initializer: The initializer for the dense layer. Defaults to a Glorot
       uniform initializer.
     output: The output style for this network. Can be either 'logits' or
       'predictions'.
diff --git a/official/nlp/modeling/layers/masked_lm_test.py b/official/nlp/modeling/layers/masked_lm_test.py
index 12e28ec95ff49c95c2729efeae04382bad5c611f..2297ce5fca1eb3aa30c078e6360e8ec0544ed280 100644
--- a/official/nlp/modeling/layers/masked_lm_test.py
+++ b/official/nlp/modeling/layers/masked_lm_test.py
@@ -34,7 +34,6 @@ class MaskedLMTest(keras_parameterized.TestCase):
 
   def create_layer(self,
                    vocab_size,
-                   sequence_length,
                    hidden_size,
                    output='predictions',
                    xformer_stack=None):
@@ -44,7 +43,6 @@ class MaskedLMTest(keras_parameterized.TestCase):
       xformer_stack = transformer_encoder.TransformerEncoder(
           vocab_size=vocab_size,
           num_layers=1,
-          sequence_length=sequence_length,
           hidden_size=hidden_size,
           num_attention_heads=4,
       )
@@ -62,7 +60,6 @@ class MaskedLMTest(keras_parameterized.TestCase):
     num_predictions = 21
     test_layer = self.create_layer(
         vocab_size=vocab_size,
-        sequence_length=sequence_length,
         hidden_size=hidden_size)
 
     # Make sure that the output tensor of the masked LM is the right shape.
@@ -81,19 +78,16 @@ class MaskedLMTest(keras_parameterized.TestCase):
     xformer_stack = transformer_encoder.TransformerEncoder(
         vocab_size=vocab_size,
         num_layers=1,
-        sequence_length=sequence_length,
         hidden_size=hidden_size,
         num_attention_heads=4,
     )
     test_layer = self.create_layer(
         vocab_size=vocab_size,
-        sequence_length=sequence_length,
         hidden_size=hidden_size,
         xformer_stack=xformer_stack,
         output='predictions')
     logit_layer = self.create_layer(
         vocab_size=vocab_size,
-        sequence_length=sequence_length,
         hidden_size=hidden_size,
         xformer_stack=xformer_stack,
         output='logits')
@@ -134,7 +128,6 @@ class MaskedLMTest(keras_parameterized.TestCase):
     num_predictions = 21
     test_layer = self.create_layer(
         vocab_size=vocab_size,
-        sequence_length=sequence_length,
         hidden_size=hidden_size)
 
     # Create a model from the masked LM layer.
@@ -155,7 +148,7 @@ class MaskedLMTest(keras_parameterized.TestCase):
   def test_unknown_output_type_fails(self):
     with self.assertRaisesRegex(ValueError, 'Unknown `output` value "bad".*'):
       _ = self.create_layer(
-          vocab_size=8, sequence_length=8, hidden_size=8, output='bad')
+          vocab_size=8, hidden_size=8, output='bad')
 
 
 if __name__ == '__main__':
diff --git a/official/nlp/nhnet/multi_channel_attention.py b/official/nlp/modeling/layers/multi_channel_attention.py
similarity index 59%
rename from official/nlp/nhnet/multi_channel_attention.py
rename to official/nlp/modeling/layers/multi_channel_attention.py
index ce97876d954a9a277213537a64400612642b31aa..e03bc3913ce51c182b2dd5d8e9ffe16589ddc6da 100644
--- a/official/nlp/nhnet/multi_channel_attention.py
+++ b/official/nlp/modeling/layers/multi_channel_attention.py
@@ -13,7 +13,8 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-"""Multi-channel decoder."""
+"""Multi-channel Attention."""
+# pylint: disable=g-classes-have-attributes
 
 from __future__ import absolute_import
 from __future__ import division
@@ -24,11 +25,24 @@ import math
 
 import tensorflow as tf
 from official.modeling import tf_utils
-from official.nlp.modeling import layers
+from official.nlp.modeling.layers import attention
+from official.nlp.modeling.layers import masked_softmax
 
 
-class DocAttention(tf.keras.layers.Layer):
-  """Documents Attention layer."""
+class VotingAttention(tf.keras.layers.Layer):
+  """Voting Attention layer.
+
+  Arguments:
+    num_heads: the number of attention heads.
+    head_size: per-head hidden size.
+    kernel_initializer: Initializer for dense layer kernels.
+    bias_initializer: Initializer for dense layer biases.
+    kernel_regularizer: Regularizer for dense layer kernels.
+    bias_regularizer: Regularizer for dense layer biases.
+    activity_regularizer: Regularizer for dense layer activity.
+    kernel_constraint: Constraint for dense layer kernels.
+    bias_constraint: Constraint for dense layer kernels.
+  """
 
   def __init__(self,
                num_heads,
@@ -41,7 +55,7 @@ class DocAttention(tf.keras.layers.Layer):
                kernel_constraint=None,
                bias_constraint=None,
                **kwargs):
-    super(DocAttention, self).__init__(**kwargs)
+    super(VotingAttention, self).__init__(**kwargs)
     self._num_heads = num_heads
     self._head_size = head_size
     self._kernel_initializer = tf.keras.initializers.get(kernel_initializer)
@@ -52,29 +66,27 @@ class DocAttention(tf.keras.layers.Layer):
     self._bias_constraint = tf.keras.constraints.get(bias_constraint)
 
   def build(self, unused_input_shapes):
-    self._query_dense = layers.DenseEinsum(
-        output_shape=(self._num_heads, self._head_size),
+    common_kwargs = dict(
         kernel_initializer=self._kernel_initializer,
         bias_initializer=self._bias_initializer,
         kernel_regularizer=self._kernel_regularizer,
         bias_regularizer=self._bias_regularizer,
         activity_regularizer=self._activity_regularizer,
         kernel_constraint=self._kernel_constraint,
-        bias_constraint=self._bias_constraint,
-        dtype=self.dtype,
-        name="encdocatt_query")
-    self._key_dense = layers.DenseEinsum(
-        output_shape=(self._num_heads, self._head_size),
-        kernel_initializer=self._kernel_initializer,
-        bias_initializer=self._bias_initializer,
-        kernel_regularizer=self._kernel_regularizer,
-        bias_regularizer=self._bias_regularizer,
-        activity_regularizer=self._activity_regularizer,
-        kernel_constraint=self._kernel_constraint,
-        bias_constraint=self._bias_constraint,
-        dtype=self.dtype,
-        name="encdocatt_key")
-    super(DocAttention, self).build(unused_input_shapes)
+        bias_constraint=self._bias_constraint)
+    self._query_dense = tf.keras.layers.experimental.EinsumDense(
+        "BAE,ENH->BANH",
+        output_shape=(None, self._num_heads, self._head_size),
+        bias_axes="NH",
+        name="query",
+        **common_kwargs)
+    self._key_dense = tf.keras.layers.experimental.EinsumDense(
+        "BAE,ENH->BANH",
+        output_shape=(None, self._num_heads, self._head_size),
+        bias_axes="NH",
+        name="key",
+        **common_kwargs)
+    super(VotingAttention, self).build(unused_input_shapes)
 
   def call(self, encoder_outputs, doc_attention_mask):
     num_docs = tf_utils.get_shape_list(encoder_outputs, expected_rank=[4])[1]
@@ -95,33 +107,55 @@ class DocAttention(tf.keras.layers.Layer):
     return tf.nn.softmax(doc_attention_probs + infadder)
 
 
-class MultiChannelAttention(layers.MultiHeadAttention):
-  """Multi-channel Attention layer."""
-
-  def build(self, input_shape):
-    super(MultiChannelAttention, self).build(input_shape)
-    self._masked_softmax = layers.MaskedSoftmax(mask_expansion_axes=[2])
-
-  def call(self, inputs, attention_mask=None):
-    from_tensor = inputs[0]
-    to_tensor = inputs[1]
-    doc_attention_probs = inputs[2]
+class MultiChannelAttention(attention.MultiHeadAttention):
+  """Multi-channel Attention layer.
+
+  Introduced in, [Generating Representative Headlines for News Stories
+  ](https://arxiv.org/abs/2001.09386). Expects multiple cross-attention
+  target sequences.
+
+  Call args:
+    query: Query `Tensor` of shape `[B, T, dim]`.
+    value: Value `Tensor` of shape `[B, A, S, dim]`, where A denotes the
+    context_attention_weights: Context weights of shape `[B, N, T, A]`, where N
+      is the number of attention heads. Combines multi-channel sources
+      context tensors according to the distribution among channels.
+    key: Optional key `Tensor` of shape `[B, A, S, dim]`. If not given, will use
+      `value` for both `key` and `value`, which is the most common case.
+    attention_mask: a boolean mask of shape `[B, T, S]`, that prevents attention
+      to certain positions.
+  """
+
+  def build_attention(self, rank):
+    super(MultiChannelAttention, self).build_attention(rank)
+    self._masked_softmax = masked_softmax.MaskedSoftmax(mask_expansion_axes=[2])
+
+  def call(self,
+           query,
+           value,
+           key=None,
+           context_attention_weights=None,
+           attention_mask=None):
+    if not self._built_from_signature:
+      self._build_from_signature(query, value, key=key)
+    if key is None:
+      key = value
 
     # Scalar dimensions referenced here:
     #   B = batch size (number of stories)
     #   A = num_docs (number of docs)
-    #   F = `from_tensor` sequence length
-    #   T = `to_tensor` sequence length
+    #   F = target sequence length
+    #   T = source sequence length
     #   N = `num_attention_heads`
     #   H = `size_per_head`
     # `query_tensor` = [B, F, N ,H]
-    query_tensor = self._query_dense(from_tensor)
+    query_tensor = self._query_dense(query)
 
     # `key_tensor` = [B, A, T, N, H]
-    key_tensor = self._key_dense(to_tensor)
+    key_tensor = self._key_dense(key)
 
     # `value_tensor` = [B, A, T, N, H]
-    value_tensor = self._value_dense(to_tensor)
+    value_tensor = self._value_dense(value)
 
     # Take the dot product between "query" and "key" to get the raw
     # attention scores.
@@ -140,7 +174,7 @@ class MultiChannelAttention(layers.MultiHeadAttention):
     # `context_layer` = [B, F, N, H]
     context_layer = tf.einsum("BANFT,BATNH->BAFNH", attention_probs,
                               value_tensor)
-    attention_output = tf.einsum("BNFA,BAFNH->BFNH", doc_attention_probs,
+    attention_output = tf.einsum("BNFA,BAFNH->BFNH", context_attention_weights,
                                  context_layer)
     attention_output = self._output_dense(attention_output)
     return attention_output
diff --git a/official/nlp/nhnet/multi_channel_attention_test.py b/official/nlp/modeling/layers/multi_channel_attention_test.py
similarity index 85%
rename from official/nlp/nhnet/multi_channel_attention_test.py
rename to official/nlp/modeling/layers/multi_channel_attention_test.py
index 76f556fd16eb891ba6ca764aab64183595d7cdb8..43a328eb2b9e25c7a5fa2733e2f5c3213aa1ae4e 100644
--- a/official/nlp/nhnet/multi_channel_attention_test.py
+++ b/official/nlp/modeling/layers/multi_channel_attention_test.py
@@ -22,14 +22,15 @@ from __future__ import print_function
 import numpy as np
 import tensorflow as tf
 
-from official.nlp.nhnet import multi_channel_attention
+from official.nlp.modeling.layers import multi_channel_attention
 
 
 class MultiChannelAttentionTest(tf.test.TestCase):
 
   def test_doc_attention(self):
     num_heads = 2
-    doc_attention = multi_channel_attention.DocAttention(num_heads, head_size=8)
+    doc_attention = multi_channel_attention.VotingAttention(
+        num_heads, head_size=8)
     num_docs = 3
     inputs = np.zeros((2, num_docs, 10, 16), dtype=np.float32)
     doc_mask = np.zeros((2, num_docs), dtype=np.float32)
@@ -47,7 +48,11 @@ class MultiChannelAttentionTest(tf.test.TestCase):
     mask_data = np.random.randint(2, size=(3, num_docs, 4, 2))
     doc_probs = np.random.randint(
         2, size=(3, num_heads, 4, num_docs)).astype(float)
-    outputs = attention_layer([from_data, to_data, doc_probs], mask_data)
+    outputs = attention_layer(
+        query=from_data,
+        value=to_data,
+        context_attention_weights=doc_probs,
+        attention_mask=mask_data)
     self.assertEqual(outputs.shape, (3, 4, 8))
 
 
diff --git a/official/nlp/modeling/layers/on_device_embedding.py b/official/nlp/modeling/layers/on_device_embedding.py
index 739cdb7e4dde157ef52d7a98769a4c40819634a7..e20f47b1e9a6bbc1e949a2dbabc52043d86ac7b0 100644
--- a/official/nlp/modeling/layers/on_device_embedding.py
+++ b/official/nlp/modeling/layers/on_device_embedding.py
@@ -38,6 +38,9 @@ class OnDeviceEmbedding(tf.keras.layers.Layer):
       lookup. Defaults to False (that is, using tf.gather). Setting this option
       to True may improve performance, especially on small vocabulary sizes, but
       will generally require more memory.
+    use_scale: Whether to scale the output embeddings. Defaults to False (that
+      is, not to scale). Setting this option to True will let values in output
+      embeddings multiplied by self._embedding_width ** 0.5.
   """
 
   def __init__(self,
@@ -45,6 +48,7 @@ class OnDeviceEmbedding(tf.keras.layers.Layer):
                embedding_width,
                initializer="glorot_uniform",
                use_one_hot=False,
+               use_scale=False,
                **kwargs):
 
     super(OnDeviceEmbedding, self).__init__(**kwargs)
@@ -52,6 +56,7 @@ class OnDeviceEmbedding(tf.keras.layers.Layer):
     self._embedding_width = embedding_width
     self._initializer = initializer
     self._use_one_hot = use_one_hot
+    self._use_scale = use_scale
 
   def get_config(self):
     config = {
@@ -59,6 +64,7 @@ class OnDeviceEmbedding(tf.keras.layers.Layer):
         "embedding_width": self._embedding_width,
         "initializer": self._initializer,
         "use_one_hot": self._use_one_hot,
+        "use_scale": self._use_scale,
     }
     base_config = super(OnDeviceEmbedding, self).get_config()
     return dict(list(base_config.items()) + list(config.items()))
@@ -85,4 +91,6 @@ class OnDeviceEmbedding(tf.keras.layers.Layer):
         # Work around b/142213824: prefer concat to shape over a Python list.
         tf.concat([tf.shape(inputs), [self._embedding_width]], axis=0))
     embeddings.set_shape(inputs.shape.as_list() + [self._embedding_width])
+    if self._use_scale:
+      embeddings *= self._embedding_width ** 0.5
     return embeddings
diff --git a/official/nlp/modeling/layers/on_device_embedding_test.py b/official/nlp/modeling/layers/on_device_embedding_test.py
index e2b9b98f181470ea233d8297550a2dd92786baae..1151def089e73ebf529f2a61ed3fc11f004ecb95 100644
--- a/official/nlp/modeling/layers/on_device_embedding_test.py
+++ b/official/nlp/modeling/layers/on_device_embedding_test.py
@@ -193,6 +193,26 @@ class OnDeviceEmbeddingTest(keras_parameterized.TestCase):
     output = model.predict(input_data)
     self.assertEqual(tf.float16, output.dtype)
 
+  def test_use_scale_layer_invocation(self):
+    vocab_size = 31
+    embedding_width = 27
+    test_layer = on_device_embedding.OnDeviceEmbedding(
+        vocab_size=vocab_size, embedding_width=embedding_width, use_scale=True)
+    # Create a 2-dimensional input (the first dimension is implicit).
+    sequence_length = 23
+    input_tensor = tf.keras.Input(shape=(sequence_length), dtype=tf.int32)
+    output_tensor = test_layer(input_tensor)
+
+    # Create a model from the test layer.
+    model = tf.keras.Model(input_tensor, output_tensor)
+
+    # Invoke the model on test data. We can't validate the output data itself
+    # (the NN is too complex) but this will rule out structural runtime errors.
+    batch_size = 3
+    input_data = np.random.randint(
+        vocab_size, size=(batch_size, sequence_length))
+    output = model.predict(input_data)
+    self.assertEqual(tf.float32, output.dtype)
 
 if __name__ == "__main__":
   tf.test.main()
diff --git a/official/nlp/modeling/layers/position_embedding.py b/official/nlp/modeling/layers/position_embedding.py
index 169e54de112d9a3ce65e9fa68f066a107d35c7a4..e748e2d048208a6f8946d5ba9ff5bf72b982f18b 100644
--- a/official/nlp/modeling/layers/position_embedding.py
+++ b/official/nlp/modeling/layers/position_embedding.py
@@ -160,7 +160,6 @@ class RelativePositionEmbedding(tf.keras.layers.Layer):
         "hidden_size": self._hidden_size,
         "min_timescale": self._min_timescale,
         "max_timescale": self._max_timescale,
-        "length": self._length,
     }
     base_config = super(RelativePositionEmbedding, self).get_config()
     return dict(list(base_config.items()) + list(config.items()))
diff --git a/official/nlp/modeling/layers/rezero_transformer.py b/official/nlp/modeling/layers/rezero_transformer.py
index 42bc1af0031db97a12a730b8e1abe98f3c9318e0..fc9a88d444282a9415a8c72e1362e4803fc9083c 100644
--- a/official/nlp/modeling/layers/rezero_transformer.py
+++ b/official/nlp/modeling/layers/rezero_transformer.py
@@ -23,7 +23,6 @@ import gin
 import tensorflow as tf
 
 from official.nlp.modeling.layers import attention
-from official.nlp.modeling.layers import dense_einsum
 
 
 @tf.keras.utils.register_keras_serializable(package="Text")
@@ -109,19 +108,20 @@ class ReZeroTransformer(tf.keras.layers.Layer):
           "The input size (%d) is not a multiple of the number of attention "
           "heads (%d)" % (hidden_size, self._num_heads))
     self._attention_head_size = int(hidden_size // self._num_heads)
-
-    self._attention_layer = attention.MultiHeadAttention(
-        num_heads=self._num_heads,
-        key_size=self._attention_head_size,
-        dropout=self._attention_dropout_rate,
+    common_kwargs = dict(
         kernel_initializer=self._kernel_initializer,
         bias_initializer=self._bias_initializer,
         kernel_regularizer=self._kernel_regularizer,
         bias_regularizer=self._bias_regularizer,
         activity_regularizer=self._activity_regularizer,
         kernel_constraint=self._kernel_constraint,
-        bias_constraint=self._bias_constraint,
-        name="self_attention")
+        bias_constraint=self._bias_constraint)
+    self._attention_layer = attention.MultiHeadAttention(
+        num_heads=self._num_heads,
+        key_size=self._attention_head_size,
+        dropout=self._attention_dropout_rate,
+        name="self_attention",
+        **common_kwargs)
     self._attention_dropout = tf.keras.layers.Dropout(rate=self._dropout_rate)
     if self._use_layer_norm:
       # Use float32 in layernorm for numeric stability.
@@ -132,17 +132,12 @@ class ReZeroTransformer(tf.keras.layers.Layer):
               axis=-1,
               epsilon=1e-12,
               dtype=tf.float32))
-    self._intermediate_dense = dense_einsum.DenseEinsum(
-        output_shape=self._intermediate_size,
-        activation=None,
-        kernel_initializer=self._kernel_initializer,
-        bias_initializer=self._bias_initializer,
-        kernel_regularizer=self._kernel_regularizer,
-        bias_regularizer=self._bias_regularizer,
-        activity_regularizer=self._activity_regularizer,
-        kernel_constraint=self._kernel_constraint,
-        bias_constraint=self._bias_constraint,
-        name="intermediate")
+    self._intermediate_dense = tf.keras.layers.experimental.EinsumDense(
+        "abc,cd->abd",
+        output_shape=(None, self._intermediate_size),
+        bias_axes="d",
+        name="intermediate",
+        **common_kwargs)
     policy = tf.keras.mixed_precision.experimental.global_policy()
     if policy.name == "mixed_bfloat16":
       # bfloat16 causes BERT with the LAMB optimizer to not converge
@@ -151,16 +146,12 @@ class ReZeroTransformer(tf.keras.layers.Layer):
       policy = tf.float32
     self._intermediate_activation_layer = tf.keras.layers.Activation(
         self._intermediate_activation, dtype=policy)
-    self._output_dense = dense_einsum.DenseEinsum(
-        output_shape=hidden_size,
-        kernel_initializer=self._kernel_initializer,
-        bias_initializer=self._bias_initializer,
-        kernel_regularizer=self._kernel_regularizer,
-        bias_regularizer=self._bias_regularizer,
-        activity_regularizer=self._activity_regularizer,
-        kernel_constraint=self._kernel_constraint,
-        bias_constraint=self._bias_constraint,
-        name="output")
+    self._output_dense = tf.keras.layers.experimental.EinsumDense(
+        "abc,cd->abd",
+        output_shape=(None, hidden_size),
+        bias_axes="d",
+        name="output",
+        **common_kwargs)
     self._output_dropout = tf.keras.layers.Dropout(rate=self._dropout_rate)
     if self._use_layer_norm:
       # Use float32 in layernorm for numeric stability.
@@ -222,9 +213,9 @@ class ReZeroTransformer(tf.keras.layers.Layer):
       attention_mask = attention_mask[:, 0:self._output_range, :]
     else:
       target_tensor = input_tensor
-    attention_inputs = [target_tensor, input_tensor]
 
-    attention_output = self._attention_layer(attention_inputs, attention_mask)
+    attention_output = self._attention_layer(
+        query=target_tensor, value=input_tensor, attention_mask=attention_mask)
     attention_output = self._attention_dropout(attention_output)
     attention_output = target_tensor + self._rezero_a * attention_output
     if self._use_layer_norm:
diff --git a/official/nlp/modeling/layers/talking_heads_attention.py b/official/nlp/modeling/layers/talking_heads_attention.py
index c65ba1e66165617aaf5652c2f77015e9a3eb7ccb..9154ea52022ce012e7960e3b08ffc3fb6408bd55 100644
--- a/official/nlp/modeling/layers/talking_heads_attention.py
+++ b/official/nlp/modeling/layers/talking_heads_attention.py
@@ -58,7 +58,7 @@ class TalkingHeadsAttention(attention.MultiHeadAttention):
     bias_constraint: Constraint for dense layer kernels.
   """
 
-  def _build_attention(self, qkv_rank):
+  def build_attention(self, qkv_rank):
     """Builds multi-head dot-product attention computations.
 
     This function overrides base class to create additional linear projection
@@ -67,7 +67,7 @@ class TalkingHeadsAttention(attention.MultiHeadAttention):
     Args:
       qkv_rank: the rank of query, key, value tensors after projection.
     """
-    super(TalkingHeadsAttention, self)._build_attention(qkv_rank)
+    super(TalkingHeadsAttention, self).build_attention(qkv_rank)
 
     # Build an equation:
     # (<batch_dims>, num_heads_a, ...),(num_heads_a, num_heads_b) ->
@@ -103,11 +103,11 @@ class TalkingHeadsAttention(attention.MultiHeadAttention):
         dtype=self.dtype,
         trainable=True)
 
-  def _compute_attention(self,
-                         query_tensor,
-                         key_tensor,
-                         value_tensor,
-                         attention_mask=None):
+  def compute_attention(self,
+                        query_tensor,
+                        key_tensor,
+                        value_tensor,
+                        attention_mask=None):
     """Applies Dot-product attention with query, key, value tensors.
 
     This function overrides base class to apply additional linear projection
diff --git a/official/nlp/modeling/layers/talking_heads_attention_test.py b/official/nlp/modeling/layers/talking_heads_attention_test.py
index ed24eda26c6f532b5e5011f5bfc8109eeca68a03..f64380df4b51bc69e82b9e2597feeb23148af759 100644
--- a/official/nlp/modeling/layers/talking_heads_attention_test.py
+++ b/official/nlp/modeling/layers/talking_heads_attention_test.py
@@ -46,7 +46,7 @@ class TalkingHeadsAttentionTest(keras_parameterized.TestCase):
     # Create a 3-dimensional input (the first dimension is implicit).
     query = tf.keras.Input(shape=(40, 80))
     value = tf.keras.Input(shape=(20, 80))
-    output = test_layer([query, value])
+    output = test_layer(query=query, value=value)
     self.assertEqual(output.shape.as_list(), [None] + output_dims)
 
   def test_non_masked_self_attention(self):
@@ -55,7 +55,7 @@ class TalkingHeadsAttentionTest(keras_parameterized.TestCase):
         num_heads=12, key_size=64)
     # Create a 3-dimensional input (the first dimension is implicit).
     query = tf.keras.Input(shape=(40, 80))
-    output = test_layer([query, query])
+    output = test_layer(query=query, value=query)
     self.assertEqual(output.shape.as_list(), [None, 40, 80])
 
   def test_attention_scores(self):
@@ -64,7 +64,7 @@ class TalkingHeadsAttentionTest(keras_parameterized.TestCase):
         num_heads=12, key_size=64, return_attention_scores=True)
     # Create a 3-dimensional input (the first dimension is implicit).
     query = tf.keras.Input(shape=(40, 80))
-    output, coef = test_layer([query, query])
+    output, coef = test_layer(query=query, value=query)
     self.assertEqual(output.shape.as_list(), [None, 40, 80])
     self.assertEqual(coef.shape.as_list(), [None, 12, 40, 40])
 
@@ -78,7 +78,7 @@ class TalkingHeadsAttentionTest(keras_parameterized.TestCase):
     query = tf.keras.Input(shape=(4, 8))
     value = tf.keras.Input(shape=(2, 8))
     mask_tensor = tf.keras.Input(shape=(4, 2))
-    output = test_layer([query, value], mask_tensor)
+    output = test_layer(query=query, value=value, attention_mask=mask_tensor)
 
     # Create a model containing the test layer.
     model = tf.keras.Model([query, value, mask_tensor], output)
@@ -102,7 +102,8 @@ class TalkingHeadsAttentionTest(keras_parameterized.TestCase):
 
     # Tests the layer with three inputs: Q, K, V.
     key = tf.keras.Input(shape=(2, 8))
-    output = test_layer([query, value, key], mask_tensor)
+    output = test_layer(
+        query=query, value=value, key=key, attention_mask=mask_tensor)
     model = tf.keras.Model([query, value, key, mask_tensor], output)
 
     masked_output_data = model.predict([from_data, to_data, to_data, mask_data])
@@ -127,7 +128,7 @@ class TalkingHeadsAttentionTest(keras_parameterized.TestCase):
         kernel_initializer=tf.keras.initializers.TruncatedNormal(stddev=0.02))
     # Create a 3-dimensional input (the first dimension is implicit).
     query = tf.keras.Input(shape=(40, 80))
-    output = test_layer([query, query])
+    output = test_layer(query=query, value=query)
     self.assertEqual(output.shape.as_list(), [None, 40, 80])
 
   @parameterized.named_parameters(
@@ -149,11 +150,12 @@ class TalkingHeadsAttentionTest(keras_parameterized.TestCase):
     # Invoke the data with a random set of mask data. This should mask at least
     # one element.
     mask_data = np.random.randint(2, size=mask_shape).astype("bool")
-    output = test_layer([query, value], mask_data)
+    output = test_layer(query=query, value=value, attention_mask=mask_data)
 
     # Invoke the same data, but with a null mask (where no elements are masked).
     null_mask_data = np.ones(mask_shape)
-    unmasked_output = test_layer([query, value], null_mask_data)
+    unmasked_output = test_layer(
+        query=query, value=value, attention_mask=null_mask_data)
     # Because one data is masked and one is not, the outputs should not be the
     # same.
     self.assertNotAllClose(output, unmasked_output)
diff --git a/official/nlp/modeling/layers/transformer.py b/official/nlp/modeling/layers/transformer.py
index 856c977aeb5ff647aa81b8e9bf4d24733737eb1e..8fc8fad47c0214caa8090227ca736dbbea7cde7f 100644
--- a/official/nlp/modeling/layers/transformer.py
+++ b/official/nlp/modeling/layers/transformer.py
@@ -23,7 +23,7 @@ import gin
 import tensorflow as tf
 
 from official.nlp.modeling.layers import attention
-from official.nlp.modeling.layers import dense_einsum
+from official.nlp.modeling.layers import multi_channel_attention
 from official.nlp.modeling.layers.util import tf_function_if_eager
 
 
@@ -49,6 +49,12 @@ class Transformer(tf.keras.layers.Layer):
     activity_regularizer: Regularizer for dense layer activity.
     kernel_constraint: Constraint for dense layer kernels.
     bias_constraint: Constraint for dense layer kernels.
+    use_bias: Whether to enable use_bias in attention layer. If set False,
+      use_bias in attention layer is disabled.
+    norm_first: Whether to normalize inputs to attention and intermediate dense
+      layers. If set False, output of attention and intermediate dense layers is
+      normalized.
+    norm_epsilon: Epsilon value to initialize normalization layers.
   """
 
   def __init__(self,
@@ -65,6 +71,9 @@ class Transformer(tf.keras.layers.Layer):
                activity_regularizer=None,
                kernel_constraint=None,
                bias_constraint=None,
+               use_bias=True,
+               norm_first=False,
+               norm_epsilon=1e-12,
                **kwargs):
     super(Transformer, self).__init__(**kwargs)
 
@@ -78,8 +87,12 @@ class Transformer(tf.keras.layers.Layer):
     self._bias_initializer = tf.keras.initializers.get(bias_initializer)
     self._kernel_regularizer = tf.keras.regularizers.get(kernel_regularizer)
     self._bias_regularizer = tf.keras.regularizers.get(bias_regularizer)
+    self._activity_regularizer = tf.keras.regularizers.get(activity_regularizer)
     self._kernel_constraint = tf.keras.constraints.get(kernel_constraint)
     self._bias_constraint = tf.keras.constraints.get(bias_constraint)
+    self._use_bias = use_bias
+    self._norm_first = norm_first
+    self._norm_epsilon = norm_epsilon
 
   def build(self, input_shape):
     input_tensor = input_shape[0] if len(input_shape) == 2 else input_shape
@@ -104,23 +117,21 @@ class Transformer(tf.keras.layers.Layer):
           "The input size (%d) is not a multiple of the number of attention "
           "heads (%d)" % (hidden_size, self._num_heads))
     self._attention_head_size = int(hidden_size // self._num_heads)
-
-    self._attention_layer = attention.MultiHeadAttention(
-        num_heads=self._num_heads,
-        key_size=self._attention_head_size,
-        dropout=self._attention_dropout_rate,
+    common_kwargs = dict(
         kernel_initializer=self._kernel_initializer,
         bias_initializer=self._bias_initializer,
         kernel_regularizer=self._kernel_regularizer,
         bias_regularizer=self._bias_regularizer,
         activity_regularizer=self._activity_regularizer,
         kernel_constraint=self._kernel_constraint,
-        bias_constraint=self._bias_constraint,
-        name="self_attention")
-    # pylint: disable=protected-access
-    self._attention_layer.build([input_tensor_shape] * 3)
-    self._attention_output_dense = self._attention_layer._output_dense
-    # pylint: enable=protected-access
+        bias_constraint=self._bias_constraint)
+    self._attention_layer = attention.MultiHeadAttention(
+        num_heads=self._num_heads,
+        key_size=self._attention_head_size,
+        dropout=self._attention_dropout_rate,
+        use_bias=self._use_bias,
+        name="self_attention",
+        **common_kwargs)
     self._attention_dropout = tf.keras.layers.Dropout(rate=self._dropout_rate)
     # Use float32 in layernorm for numeric stability.
     # It is probably safe in mixed_float16, but we haven't validated this yet.
@@ -128,19 +139,14 @@ class Transformer(tf.keras.layers.Layer):
         tf.keras.layers.LayerNormalization(
             name="self_attention_layer_norm",
             axis=-1,
-            epsilon=1e-12,
+            epsilon=self._norm_epsilon,
             dtype=tf.float32))
-    self._intermediate_dense = dense_einsum.DenseEinsum(
-        output_shape=self._intermediate_size,
-        activation=None,
-        kernel_initializer=self._kernel_initializer,
-        bias_initializer=self._bias_initializer,
-        kernel_regularizer=self._kernel_regularizer,
-        bias_regularizer=self._bias_regularizer,
-        activity_regularizer=self._activity_regularizer,
-        kernel_constraint=self._kernel_constraint,
-        bias_constraint=self._bias_constraint,
-        name="intermediate")
+    self._intermediate_dense = tf.keras.layers.experimental.EinsumDense(
+        "abc,cd->abd",
+        output_shape=(None, self._intermediate_size),
+        bias_axes="d",
+        name="intermediate",
+        **common_kwargs)
     policy = tf.keras.mixed_precision.experimental.global_policy()
     if policy.name == "mixed_bfloat16":
       # bfloat16 causes BERT with the LAMB optimizer to not converge
@@ -149,20 +155,19 @@ class Transformer(tf.keras.layers.Layer):
       policy = tf.float32
     self._intermediate_activation_layer = tf.keras.layers.Activation(
         self._intermediate_activation, dtype=policy)
-    self._output_dense = dense_einsum.DenseEinsum(
-        output_shape=hidden_size,
-        kernel_initializer=self._kernel_initializer,
-        bias_initializer=self._bias_initializer,
-        kernel_regularizer=self._kernel_regularizer,
-        bias_regularizer=self._bias_regularizer,
-        activity_regularizer=self._activity_regularizer,
-        kernel_constraint=self._kernel_constraint,
-        bias_constraint=self._bias_constraint,
-        name="output")
+    self._output_dense = tf.keras.layers.experimental.EinsumDense(
+        "abc,cd->abd",
+        output_shape=(None, hidden_size),
+        bias_axes="d",
+        name="output",
+        **common_kwargs)
     self._output_dropout = tf.keras.layers.Dropout(rate=self._dropout_rate)
     # Use float32 in layernorm for numeric stability.
     self._output_layer_norm = tf.keras.layers.LayerNormalization(
-        name="output_layer_norm", axis=-1, epsilon=1e-12, dtype=tf.float32)
+        name="output_layer_norm",
+        axis=-1,
+        epsilon=self._norm_epsilon,
+        dtype=tf.float32)
 
     super(Transformer, self).build(input_shape)
 
@@ -193,7 +198,13 @@ class Transformer(tf.keras.layers.Layer):
         "kernel_constraint":
             tf.keras.constraints.serialize(self._kernel_constraint),
         "bias_constraint":
-            tf.keras.constraints.serialize(self._bias_constraint)
+            tf.keras.constraints.serialize(self._bias_constraint),
+        "use_bias":
+            self._use_bias,
+        "norm_first":
+            self._norm_first,
+        "norm_epsilon":
+            self._norm_epsilon
     }
     base_config = super(Transformer, self).get_config()
     return dict(list(base_config.items()) + list(config.items()))
@@ -208,13 +219,22 @@ class Transformer(tf.keras.layers.Layer):
       target_tensor = input_tensor[:, 0:self._output_range, :]
       attention_mask = attention_mask[:, 0:self._output_range, :]
     else:
+      if self._norm_first:
+        source_tensor = input_tensor
+        input_tensor = self._attention_layer_norm(input_tensor)
       target_tensor = input_tensor
-    attention_inputs = [target_tensor, input_tensor]
 
-    attention_output = self._attention_layer(attention_inputs, attention_mask)
+    attention_output = self._attention_layer(
+        query=target_tensor, value=input_tensor, attention_mask=attention_mask)
     attention_output = self._attention_dropout(attention_output)
-    attention_output = self._attention_layer_norm(target_tensor +
-                                                  attention_output)
+    if self._norm_first:
+      attention_output = source_tensor + attention_output
+    else:
+      attention_output = self._attention_layer_norm(target_tensor +
+                                                    attention_output)
+    if self._norm_first:
+      source_attention_output = attention_output
+      attention_output = self._output_layer_norm(attention_output)
     intermediate_output = self._intermediate_dense(attention_output)
     intermediate_output = self._intermediate_activation_layer(
         intermediate_output)
@@ -224,7 +244,10 @@ class Transformer(tf.keras.layers.Layer):
     # is always fp32 for now. Cast layer_output to fp32 for the subsequent
     # add.
     layer_output = tf.cast(layer_output, tf.float32)
-    layer_output = self._output_layer_norm(layer_output + attention_output)
+    if self._norm_first:
+      layer_output = source_attention_output + layer_output
+    else:
+      layer_output = self._output_layer_norm(layer_output + attention_output)
 
     return layer_output
 
@@ -236,3 +259,259 @@ class CompiledTransformer(Transformer):
   @tf_function_if_eager(experimental_compile=True)
   def call(self, inputs):
     return super(CompiledTransformer, self).call(inputs)
+
+
+@tf.keras.utils.register_keras_serializable(package="Text")
+class TransformerDecoderLayer(tf.keras.layers.Layer):
+  """Single transformer layer for decoder.
+
+  It has three sub-layers:
+  (1) a multi-head self-attention mechanism.
+  (2) a encoder-decoder attention.
+  (3) a positionwise fully connected feed-forward network.
+
+  Arguments:
+    num_attention_heads: Number of attention heads.
+    intermediate_size: Size of the intermediate layer.
+    intermediate_activation: Activation for the intermediate layer.
+    dropout_rate: Dropout probability for the post-attention and output dropout.
+    attention_dropout_rate: Dropout probability for within the attention layer.
+    multi_channel_cross_attention: Whether to use `MultiChannelAttention` for
+      cross-attention between target sequences and source sequences.
+    kernel_initializer: Initializer for dense layer kernels.
+    bias_initializer: Initializer for dense layer biases.
+    kernel_regularizer: Regularizer for dense layer kernels.
+    bias_regularizer: Regularizer for dense layer biases.
+    activity_regularizer: Regularizer for dense layer activity.
+    kernel_constraint: Constraint for dense layer kernels.
+    bias_constraint: Constraint for dense layer kernels.
+    use_bias: Whether to enable use_bias in attention layer. If set False,
+      use_bias in attention layer is disabled.
+    norm_first: Whether to normalize inputs to attention and intermediate dense
+      layers. If set False, output of attention and intermediate dense layers is
+      normalized.
+    norm_epsilon: Epsilon value to initialize normalization layers.
+  """
+
+  def __init__(self,
+               num_attention_heads,
+               intermediate_size,
+               intermediate_activation,
+               dropout_rate=0.0,
+               attention_dropout_rate=0.0,
+               multi_channel_cross_attention=False,
+               kernel_initializer="glorot_uniform",
+               bias_initializer="zeros",
+               kernel_regularizer=None,
+               bias_regularizer=None,
+               activity_regularizer=None,
+               kernel_constraint=None,
+               bias_constraint=None,
+               use_bias=True,
+               norm_first=False,
+               norm_epsilon=1e-12,
+               **kwargs):
+    super(TransformerDecoderLayer, self).__init__(**kwargs)
+    self.num_attention_heads = num_attention_heads
+    self.intermediate_size = intermediate_size
+    self.intermediate_activation = tf.keras.activations.get(
+        intermediate_activation)
+    self.dropout_rate = dropout_rate
+    self.attention_dropout_rate = attention_dropout_rate
+    self.multi_channel_cross_attention = multi_channel_cross_attention
+    self._kernel_initializer = tf.keras.initializers.get(kernel_initializer)
+    self._bias_initializer = tf.keras.initializers.get(bias_initializer)
+    self._kernel_regularizer = tf.keras.regularizers.get(kernel_regularizer)
+    self._bias_regularizer = tf.keras.regularizers.get(bias_regularizer)
+    self._activity_regularizer = tf.keras.regularizers.get(activity_regularizer)
+    self._kernel_constraint = tf.keras.constraints.get(kernel_constraint)
+    self._bias_constraint = tf.keras.constraints.get(bias_constraint)
+    self._use_bias = use_bias
+    self._norm_first = norm_first
+    self._norm_epsilon = norm_epsilon
+    if self.multi_channel_cross_attention:
+      self._cross_attention_cls = multi_channel_attention.MultiChannelAttention
+    else:
+      self._cross_attention_cls = attention.MultiHeadAttention
+
+  def build(self, input_shape):
+    target_tensor_shape = tf.TensorShape(input_shape[0])
+    if len(target_tensor_shape) != 3:
+      raise ValueError("TransformerLayer expects a three-dimensional input of "
+                       "shape [batch, sequence, width].")
+    hidden_size = target_tensor_shape[2]
+    if hidden_size % self.num_attention_heads != 0:
+      raise ValueError(
+          "The hidden size (%d) is not a multiple of the number of attention "
+          "heads (%d)" % (hidden_size, self.num_attention_heads))
+    self.attention_head_size = int(hidden_size / self.num_attention_heads)
+    common_kwargs = dict(
+        kernel_initializer=self._kernel_initializer,
+        bias_initializer=self._bias_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer,
+        activity_regularizer=self._activity_regularizer,
+        kernel_constraint=self._kernel_constraint,
+        bias_constraint=self._bias_constraint)
+    # Self attention.
+    self.self_attention = attention.CachedAttention(
+        num_heads=self.num_attention_heads,
+        key_size=self.attention_head_size,
+        dropout=self.attention_dropout_rate,
+        use_bias=self._use_bias,
+        name="self_attention",
+        **common_kwargs)
+    self.self_attention_output_dense = tf.keras.layers.experimental.EinsumDense(
+        "abc,cd->abd",
+        output_shape=(None, hidden_size),
+        bias_axes="d",
+        name="output",
+        **common_kwargs)
+    self.self_attention_dropout = tf.keras.layers.Dropout(
+        rate=self.dropout_rate)
+    self.self_attention_layer_norm = (
+        tf.keras.layers.LayerNormalization(
+            name="self_attention_layer_norm",
+            axis=-1,
+            epsilon=self._norm_epsilon))
+    # Encoder-decoder attention.
+    self.encdec_attention = self._cross_attention_cls(
+        num_heads=self.num_attention_heads,
+        key_size=self.attention_head_size,
+        dropout=self.attention_dropout_rate,
+        output_shape=hidden_size,
+        use_bias=self._use_bias,
+        name="attention/encdec",
+        **common_kwargs)
+
+    self.encdec_attention_dropout = tf.keras.layers.Dropout(
+        rate=self.dropout_rate)
+    self.encdec_attention_layer_norm = (
+        tf.keras.layers.LayerNormalization(
+            name="attention/encdec_output_layer_norm",
+            axis=-1,
+            epsilon=self._norm_epsilon))
+
+    # Feed-forward projection.
+    self.intermediate_dense = tf.keras.layers.experimental.EinsumDense(
+        "abc,cd->abd",
+        output_shape=(None, self.intermediate_size),
+        bias_axes="d",
+        name="intermediate",
+        **common_kwargs)
+    self.intermediate_activation_layer = tf.keras.layers.Activation(
+        self.intermediate_activation)
+    self.output_dense = tf.keras.layers.experimental.EinsumDense(
+        "abc,cd->abd",
+        output_shape=(None, hidden_size),
+        bias_axes="d",
+        name="output",
+        **common_kwargs)
+    self.output_dropout = tf.keras.layers.Dropout(rate=self.dropout_rate)
+    self.output_layer_norm = tf.keras.layers.LayerNormalization(
+        name="output_layer_norm", axis=-1, epsilon=self._norm_epsilon)
+    super(TransformerDecoderLayer, self).build(input_shape)
+
+  def get_config(self):
+    config = {
+        "num_attention_heads":
+            self.num_attention_heads,
+        "intermediate_size":
+            self.intermediate_size,
+        "intermediate_activation":
+            self.intermediate_activation,
+        "dropout_rate":
+            self.dropout_rate,
+        "attention_dropout_rate":
+            self.attention_dropout_rate,
+        "multi_channel_cross_attention":
+            self.multi_channel_cross_attention,
+        "kernel_initializer":
+            tf.keras.initializers.serialize(self._kernel_initializer),
+        "bias_initializer":
+            tf.keras.initializers.serialize(self._bias_initializer),
+        "kernel_regularizer":
+            tf.keras.regularizers.serialize(self._kernel_regularizer),
+        "bias_regularizer":
+            tf.keras.regularizers.serialize(self._bias_regularizer),
+        "activity_regularizer":
+            tf.keras.regularizers.serialize(self._activity_regularizer),
+        "kernel_constraint":
+            tf.keras.constraints.serialize(self._kernel_constraint),
+        "bias_constraint":
+            tf.keras.constraints.serialize(self._bias_constraint),
+        "use_bias":
+            self._use_bias,
+        "norm_first":
+            self._norm_first,
+        "norm_epsilon":
+            self._norm_epsilon
+    }
+    base_config = super(TransformerDecoderLayer, self).get_config()
+    return dict(list(base_config.items()) + list(config.items()))
+
+  def common_layers_with_encoder(self):
+    """Gets layer objects that can make a Transformer encoder block."""
+    return [
+        self.self_attention, self.self_attention_layer_norm,
+        self.intermediate_dense, self.output_dense, self.output_layer_norm
+    ]
+
+  def call(self, inputs, cache=None, decode_loop_step=None):
+    if self.multi_channel_cross_attention:
+      if len(inputs) != 5:
+        raise ValueError(
+            "TransformerDecoderLayer must have 5 inputs, when it uses "
+            "multi_channel_cross_attention. But it got: %d" % len(inputs))
+    elif len(inputs) != 4:
+      raise ValueError(
+          "TransformerDecoderLayer must have 4 inputs, but it got: %d" %
+          len(inputs))
+    input_tensor, memory, attention_mask, self_attention_mask = inputs[:4]
+    source_tensor = input_tensor
+    if self._norm_first:
+      input_tensor = self.self_attention_layer_norm(input_tensor)
+    self_attention_output, cache = self.self_attention(
+        query=input_tensor,
+        value=input_tensor,
+        attention_mask=self_attention_mask,
+        cache=cache,
+        decode_loop_step=decode_loop_step)
+    self_attention_output = self.self_attention_dropout(self_attention_output)
+    if self._norm_first:
+      self_attention_output = source_tensor + self_attention_output
+    else:
+      self_attention_output = self.self_attention_layer_norm(
+          input_tensor + self_attention_output)
+    if self._norm_first:
+      source_self_attention_output = self_attention_output
+      self_attention_output = self.encdec_attention_layer_norm(
+          self_attention_output)
+    cross_attn_inputs = dict(
+        query=self_attention_output,
+        value=memory,
+        attention_mask=attention_mask)
+    if self.multi_channel_cross_attention:
+      # Accesses the 5-th input tensor for the doc-attention probabilities.
+      cross_attn_inputs["context_attention_weights"] = inputs[-1]
+    attention_output = self.encdec_attention(**cross_attn_inputs)
+    attention_output = self.encdec_attention_dropout(attention_output)
+    if self._norm_first:
+      attention_output = source_self_attention_output + attention_output
+    else:
+      attention_output = self.encdec_attention_layer_norm(
+          self_attention_output + attention_output)
+    if self._norm_first:
+      source_attention_output = attention_output
+      attention_output = self.output_layer_norm(attention_output)
+
+    intermediate_output = self.intermediate_dense(attention_output)
+    intermediate_output = self.intermediate_activation_layer(
+        intermediate_output)
+    layer_output = self.output_dense(intermediate_output)
+    layer_output = self.output_dropout(layer_output)
+    if self._norm_first:
+      layer_output = source_attention_output + layer_output
+    else:
+      layer_output = self.output_layer_norm(layer_output + attention_output)
+    return layer_output, cache
diff --git a/official/nlp/modeling/layers/transformer_scaffold.py b/official/nlp/modeling/layers/transformer_scaffold.py
index d988febfa68a3e45d3919892ba677c85350f71d6..b0932f7e0b7ead6a213cb3a7a9ca67edf5d86bb4 100644
--- a/official/nlp/modeling/layers/transformer_scaffold.py
+++ b/official/nlp/modeling/layers/transformer_scaffold.py
@@ -262,9 +262,8 @@ class TransformerScaffold(tf.keras.layers.Layer):
     else:
       input_tensor, attention_mask = (inputs, None)
 
-    attention_inputs = [input_tensor, input_tensor]
-
-    attention_output = self._attention_layer(attention_inputs, attention_mask)
+    attention_output = self._attention_layer(
+        query=input_tensor, value=input_tensor, attention_mask=attention_mask)
     attention_output = self._attention_dropout(attention_output)
     attention_output = self._attention_layer_norm(input_tensor +
                                                   attention_output)
diff --git a/official/nlp/modeling/layers/transformer_scaffold_test.py b/official/nlp/modeling/layers/transformer_scaffold_test.py
index ad919889569501c1c29a3c0f88f3e1d1621aec3a..60074ce5816447dae5f8fc8a25f729ee0fcb0988 100644
--- a/official/nlp/modeling/layers/transformer_scaffold_test.py
+++ b/official/nlp/modeling/layers/transformer_scaffold_test.py
@@ -39,10 +39,10 @@ class ValidatedAttentionLayer(attention.MultiHeadAttention):
     super(ValidatedAttentionLayer, self).__init__(**kwargs)
     self.list = call_list
 
-  def call(self, inputs, attention_mask=None):
+  def call(self, query, value, attention_mask=None):
     self.list.append(True)
     return super(ValidatedAttentionLayer, self).call(
-        inputs, attention_mask=attention_mask)
+        query, value, attention_mask=attention_mask)
 
   def get_config(self):
     config = super(ValidatedAttentionLayer, self).get_config()
diff --git a/official/nlp/modeling/layers/transformer_test.py b/official/nlp/modeling/layers/transformer_test.py
index 753ced3e77c32ee314137ca6bc2a219437dcd1f4..f83fc40071642230207dfc2d347e8127d03f69f1 100644
--- a/official/nlp/modeling/layers/transformer_test.py
+++ b/official/nlp/modeling/layers/transformer_test.py
@@ -152,7 +152,8 @@ class TransformerLayerTest(keras_parameterized.TestCase):
     _ = new_layer([input_data, mask_data])
     new_layer.set_weights(test_layer.get_weights())
     new_output_tensor = new_layer([input_data, mask_data])
-    self.assertAllClose(new_output_tensor, output_tensor[:, 0:1, :])
+    self.assertAllClose(
+        new_output_tensor, output_tensor[:, 0:1, :], atol=5e-5, rtol=0.003)
 
   def test_layer_invocation_with_float16_dtype(self, transformer_cls):
     tf.keras.mixed_precision.experimental.set_policy('mixed_float16')
@@ -215,5 +216,113 @@ class TransformerLayerTest(keras_parameterized.TestCase):
     self.assertAllEqual([1, input_length, width], output_data.shape)
 
 
+@keras_parameterized.run_all_keras_modes
+class TransformerArgumentTest(keras_parameterized.TestCase):
+
+  def test_use_bias_norm_first(self):
+    num_attention_heads = 2
+    hidden_size = 16
+    encoder_block = transformer.Transformer(
+        num_attention_heads=num_attention_heads,
+        intermediate_size=32,
+        intermediate_activation='relu',
+        dropout_rate=0.1,
+        attention_dropout_rate=0.1,
+        use_bias=False,
+        norm_first=True,
+        norm_epsilon=1e-6)
+    # Forward path.
+    dummy_tensor = tf.zeros([2, 4, 16], dtype=tf.float32)
+    dummy_mask = tf.zeros([2, 4, 4], dtype=tf.float32)
+    inputs = [dummy_tensor, dummy_mask]
+    output = encoder_block(inputs)
+    self.assertEqual(output.shape, (2, 4, hidden_size))
+
+  def test_get_config(self):
+    num_attention_heads = 2
+    encoder_block = transformer.Transformer(
+        num_attention_heads=num_attention_heads,
+        intermediate_size=32,
+        intermediate_activation='relu',
+        dropout_rate=0.1,
+        attention_dropout_rate=0.1,
+        use_bias=False,
+        norm_first=True,
+        norm_epsilon=1e-6)
+    encoder_block_config = encoder_block.get_config()
+    new_encoder_block = transformer.Transformer.from_config(
+        encoder_block_config)
+    self.assertEqual(encoder_block_config, new_encoder_block.get_config())
+
+
+def _create_cache(batch_size, init_decode_length, num_heads, head_size):
+  return {
+      'key':
+          tf.zeros([batch_size, init_decode_length, num_heads, head_size],
+                   dtype=tf.float32),
+      'value':
+          tf.zeros([batch_size, init_decode_length, num_heads, head_size],
+                   dtype=tf.float32)
+  }
+
+
+@keras_parameterized.run_all_keras_modes
+class TransformerDecoderLayerTest(keras_parameterized.TestCase):
+
+  def test_decoder_block_with_cache(self):
+    num_attention_heads = 2
+    hidden_size = 16
+    decoder_block = transformer.TransformerDecoderLayer(
+        num_attention_heads=num_attention_heads,
+        intermediate_size=32,
+        intermediate_activation='relu',
+        dropout_rate=0.1,
+        attention_dropout_rate=0.1)
+    # Forward path.
+    dummy_tensor = tf.zeros([2, 4, 16], dtype=tf.float32)
+    dummy_mask = tf.zeros([2, 4, 4], dtype=tf.float32)
+    inputs = [dummy_tensor, dummy_tensor, dummy_mask, dummy_mask]
+    cache = _create_cache(2, 0, num_attention_heads,
+                          hidden_size // num_attention_heads)
+    output, cache = decoder_block(inputs, cache)
+    self.assertEqual(output.shape, (2, 4, hidden_size))
+    self.assertEqual(cache['value'].shape, (2, 4, 2, 8))
+
+  def test_use_bias_norm_first(self):
+    num_attention_heads = 2
+    hidden_size = 16
+    decoder_block = transformer.TransformerDecoderLayer(
+        num_attention_heads=num_attention_heads,
+        intermediate_size=32,
+        intermediate_activation='relu',
+        dropout_rate=0.1,
+        attention_dropout_rate=0.1,
+        use_bias=False,
+        norm_first=True,
+        norm_epsilon=1e-6)
+    # Forward path.
+    dummy_tensor = tf.zeros([2, 4, 16], dtype=tf.float32)
+    dummy_mask = tf.zeros([2, 4, 4], dtype=tf.float32)
+    inputs = [dummy_tensor, dummy_tensor, dummy_mask, dummy_mask]
+    output, _ = decoder_block(inputs)
+    self.assertEqual(output.shape, (2, 4, hidden_size))
+
+  def test_get_config(self):
+    num_attention_heads = 2
+    decoder_block = transformer.TransformerDecoderLayer(
+        num_attention_heads=num_attention_heads,
+        intermediate_size=32,
+        intermediate_activation='relu',
+        dropout_rate=0.1,
+        attention_dropout_rate=0.1,
+        use_bias=False,
+        norm_first=True,
+        norm_epsilon=1e-6)
+    decoder_block_config = decoder_block.get_config()
+    new_decoder_block = transformer.TransformerDecoderLayer.from_config(
+        decoder_block_config)
+    self.assertEqual(decoder_block_config, new_decoder_block.get_config())
+
+
 if __name__ == '__main__':
   tf.test.main()
diff --git a/official/nlp/modeling/losses/README.md b/official/nlp/modeling/losses/README.md
index 522150cfa1518797b488146fae506bfcaf063b8e..a2607b1dab7e2cb1c054855ad3999b9036716c9a 100644
--- a/official/nlp/modeling/losses/README.md
+++ b/official/nlp/modeling/losses/README.md
@@ -4,6 +4,3 @@ Losses contains common loss computation used in NLP tasks.
 
 * `weighted_sparse_categorical_crossentropy_loss` computes per-batch sparse
 categorical crossentropy loss.
-
-* `weighted_sparse_categorical_crossentropy_per_example_loss` computes
-per-example sparse categorical crossentropy loss.
diff --git a/official/nlp/modeling/losses/__init__.py b/official/nlp/modeling/losses/__init__.py
index 919bad30809b1a4967ecb7edcb206e92637477db..7a396eb988289f7fde62be5f8ba641133c0e1543 100644
--- a/official/nlp/modeling/losses/__init__.py
+++ b/official/nlp/modeling/losses/__init__.py
@@ -14,4 +14,3 @@
 # ==============================================================================
 """Activations package definition. Subject to change."""
 from official.nlp.modeling.losses.weighted_sparse_categorical_crossentropy import loss as weighted_sparse_categorical_crossentropy_loss
-from official.nlp.modeling.losses.weighted_sparse_categorical_crossentropy import per_example_loss as weighted_sparse_categorical_crossentropy_per_example_loss
diff --git a/official/nlp/modeling/losses/weighted_sparse_categorical_crossentropy.py b/official/nlp/modeling/losses/weighted_sparse_categorical_crossentropy.py
index b88d8e3665b70be63aaa4aa2f90bb78e4bd9af3f..cd532f858242574a6697c4d6b8791c7f97ef1cf7 100644
--- a/official/nlp/modeling/losses/weighted_sparse_categorical_crossentropy.py
+++ b/official/nlp/modeling/losses/weighted_sparse_categorical_crossentropy.py
@@ -12,7 +12,7 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-"""Sparse categorical cross-entropy losses."""
+"""Weighted sparse categorical cross-entropy losses."""
 
 from __future__ import absolute_import
 from __future__ import division
@@ -43,37 +43,7 @@ def _validate_rank(labels, predictions, weights):
          "predictions.shape was %s.") % (labels.shape, predictions.shape))
 
 
-def per_example_loss(labels, predictions, weights=None):
-  """Calculate a per-example sparse categorical crossentropy loss.
-
-  This loss function assumes that the predictions are post-softmax.
-  Args:
-    labels: The labels to evaluate against. Should be a set of integer indices
-      ranging from 0 to (vocab_size-1).
-    predictions: The network predictions. Should have softmax already applied.
-    weights: An optional weight array of the same shape as the 'labels' array.
-      If None, all examples will be used.
-
-  Returns:
-    A tensor of shape predictions.shape[:-1] containing the per-example
-      loss.
-  """
-  # When using these functions with the Keras core API, we will need to squeeze
-  # the labels tensor - Keras adds a spurious inner dimension.
-  labels, predictions = _adjust_labels(labels, predictions)
-  _validate_rank(labels, predictions, weights)
-
-  labels_one_hot = tf.one_hot(labels, predictions.shape[-1])
-  labels_one_hot = tf.cast(labels_one_hot, predictions.dtype)
-  per_example_loss_data = -tf.reduce_sum(
-      predictions * labels_one_hot, axis=[-1])
-  if weights is not None:
-    weights = tf.cast(weights, per_example_loss_data.dtype)
-    per_example_loss_data = weights * per_example_loss_data
-  return per_example_loss_data
-
-
-def loss(labels, predictions, weights=None):
+def loss(labels, predictions, weights=None, from_logits=False):
   """Calculate a per-batch sparse categorical crossentropy loss.
 
   This loss function assumes that the predictions are post-softmax.
@@ -83,6 +53,7 @@ def loss(labels, predictions, weights=None):
     predictions: The network predictions. Should have softmax already applied.
     weights: An optional weight array of the same shape as the 'labels' array.
       If None, all examples will be used.
+    from_logits: Whether the input predictions are logits.
 
   Returns:
     A loss scalar.
@@ -95,12 +66,11 @@ def loss(labels, predictions, weights=None):
   labels, predictions = _adjust_labels(labels, predictions)
   _validate_rank(labels, predictions, weights)
 
-  per_example_loss_data = per_example_loss(labels, predictions, weights)
+  example_losses = tf.keras.losses.sparse_categorical_crossentropy(
+      labels, predictions, from_logits=from_logits)
 
   if weights is None:
-    return tf.reduce_mean(per_example_loss_data)
-  else:
-    numerator = tf.reduce_sum(per_example_loss_data)
-    weights = tf.cast(weights, predictions.dtype)
-    denominator = tf.reduce_sum(weights) + 1e-5
-    return numerator / denominator
+    return tf.reduce_mean(example_losses)
+  weights = tf.cast(weights, predictions.dtype)
+  return tf.math.divide_no_nan(
+      tf.reduce_sum(example_losses * weights), tf.reduce_sum(weights))
diff --git a/official/nlp/modeling/losses/weighted_sparse_categorical_crossentropy_test.py b/official/nlp/modeling/losses/weighted_sparse_categorical_crossentropy_test.py
index 2fec2a318b06f0af44b73d200b22d8a22ba88ddf..b6e1a01d76961dc47fa1482ddf616787fa365842 100644
--- a/official/nlp/modeling/losses/weighted_sparse_categorical_crossentropy_test.py
+++ b/official/nlp/modeling/losses/weighted_sparse_categorical_crossentropy_test.py
@@ -53,8 +53,7 @@ class ClassificationLossTest(keras_parameterized.TestCase):
 
     # Create a maskedLM from the transformer stack.
     test_layer = layers.MaskedLM(
-        embedding_table=xformer_stack.get_embedding_table(),
-        output=output)
+        embedding_table=xformer_stack.get_embedding_table(), output=output)
 
     # Create a model from the masked LM layer.
     lm_input_tensor = tf.keras.Input(shape=(sequence_length, hidden_size))
@@ -63,123 +62,6 @@ class ClassificationLossTest(keras_parameterized.TestCase):
     output = test_layer(lm_input_tensor, masked_positions=masked_lm_positions)
     return tf.keras.Model([lm_input_tensor, masked_lm_positions], output)
 
-  def create_classification_model(self, input_width, num_classes):
-    test_object = networks.Classification(
-        input_width=input_width, num_classes=num_classes)
-    # Create a 2-dimensional input (the first dimension is implicit).
-    pooled_data = tf.keras.Input(shape=(input_width,), dtype=tf.float32)
-    output = test_object(pooled_data)
-    return tf.keras.Model(pooled_data, output)
-
-  def test_per_example_loss_3d_input(self):
-    """Test per-example loss with a 3-dimensional input, from a masked LM."""
-    vocab_size = 100
-    sequence_length = 32
-    hidden_size = 64
-    num_predictions = 21
-    model = self.create_lm_model(
-        vocab_size=vocab_size,
-        sequence_length=sequence_length,
-        hidden_size=hidden_size,
-        num_predictions=num_predictions)
-
-    # Get the output of the masked LM.
-    batch_size = 3
-    lm_input_data = 10 * np.random.random_sample(
-        (batch_size, sequence_length, hidden_size))
-    masked_position_data = np.random.randint(
-        2, size=(batch_size, num_predictions))
-    output_data = model.predict([lm_input_data, masked_position_data])
-
-    # Calculate per-example loss.
-    labels = np.random.randint(vocab_size, size=(batch_size, num_predictions))
-    per_example_loss_data = weighted_sparse_categorical_crossentropy.per_example_loss(
-        predictions=output_data, labels=labels)
-
-    # Per-example loss data should have one value per prediction, and those
-    # values shouldn't be zero in this case (as we're using random data).
-    expected_shape = [batch_size, num_predictions]
-    self.assertEqual(expected_shape, per_example_loss_data.shape.as_list())
-    self.assertNotAllClose(
-        tf.zeros_like(per_example_loss_data), per_example_loss_data)
-
-  def test_per_example_loss_2d_input(self):
-    """Test per-example loss with a 2-d input, from a classifier."""
-    input_width = 512
-    num_classes = 10
-    model = self.create_classification_model(input_width, num_classes)
-
-    # Invoke the network as part of a Model.
-    batch_size = 3
-    input_data = 10 * np.random.random_sample((batch_size, input_width))
-    output_data = model.predict(input_data)
-
-    # Calculate per example loss.
-    labels = np.random.randint(num_classes, size=(batch_size))
-    per_example_loss_data = weighted_sparse_categorical_crossentropy.per_example_loss(
-        predictions=output_data, labels=labels)
-
-    # Per-example loss data should have one value per batch item, and those
-    # values shouldn't be zero in this case (as we're using random data).
-    self.assertEqual([batch_size], per_example_loss_data.shape.as_list())
-    self.assertNotAllClose(
-        tf.zeros_like(per_example_loss_data), per_example_loss_data)
-
-  def test_per_example_loss_weights_3d_input(self):
-    """Test weighted per-example loss with a 3-d input, from a masked LM."""
-    vocab_size = 100
-    sequence_length = 32
-    hidden_size = 64
-    num_predictions = 21
-    model = self.create_lm_model(
-        vocab_size=vocab_size,
-        sequence_length=sequence_length,
-        hidden_size=hidden_size,
-        num_predictions=num_predictions)
-
-    # Get the output of the masked LM.
-    batch_size = 3
-    lm_input_data = 10 * np.random.random_sample(
-        (batch_size, sequence_length, hidden_size))
-    masked_position_data = np.random.randint(
-        2, size=(batch_size, num_predictions))
-    output_data = model.predict([lm_input_data, masked_position_data])
-
-    # Calculate per-example loss with weights.
-    labels = np.random.randint(vocab_size, size=(batch_size, num_predictions))
-    weights = np.random.randint(2, size=(batch_size, num_predictions))
-
-    per_example_loss_data = weighted_sparse_categorical_crossentropy.per_example_loss(
-        predictions=output_data, labels=labels, weights=weights)
-
-    # Weighted per-example loss data should be equivalent to multiplying the
-    # loss tensor by the weights tensor.
-    expected_weighted_loss = per_example_loss_data * weights
-    self.assertAllClose(expected_weighted_loss, per_example_loss_data)
-
-  def test_per_example_loss_weights_2d_input(self):
-    """Test weighted per-example loss with a 2-d input, from a classifier."""
-    input_width = 512
-    num_classes = 10
-    model = self.create_classification_model(input_width, num_classes)
-
-    # Invoke the network as part of a Model.
-    batch_size = 3
-    input_data = 10 * np.random.random_sample((batch_size, input_width))
-    output_data = model.predict(input_data)
-
-    # Calculate per-example loss with weights.
-    labels = np.random.randint(num_classes, size=(batch_size))
-    weights = np.random.randint(2, size=(batch_size))
-
-    per_example_loss_data = weighted_sparse_categorical_crossentropy.per_example_loss(
-        predictions=output_data, labels=labels, weights=weights)
-
-    # Weighted per-example loss data should be equivalent to multiplying the
-    # loss tensor by the weights tensor.
-    expected_weighted_loss = per_example_loss_data * weights
-    self.assertAllClose(expected_weighted_loss, per_example_loss_data)
-
   def test_loss_3d_input(self):
     """Test overall loss with a 3-dimensional input, from a masked LM."""
     vocab_size = 100
@@ -213,26 +95,6 @@ class ClassificationLossTest(keras_parameterized.TestCase):
     self.assertNotAllClose(
         tf.zeros_like(per_example_loss_data), per_example_loss_data)
 
-  def test_loss_2d_input(self):
-    """Test overall loss with a 2-d input, from a classifier."""
-    input_width = 512
-    num_classes = 10
-    model = self.create_classification_model(input_width, num_classes)
-
-    # Invoke the network as part of a Model.
-    batch_size = 3
-    input_data = 10 * np.random.random_sample((batch_size, input_width))
-    output_data = model.predict(input_data)
-
-    # Calculate per example loss.
-    labels = np.random.randint(num_classes, size=(batch_size))
-    loss_data = weighted_sparse_categorical_crossentropy.loss(
-        predictions=output_data, labels=labels)
-
-    # Loss data should have one value only, and that value shouldn't be zero in
-    # this case (as we're using random data).
-    self.assertNotAllClose(0, loss_data)
-
   def test_loss_weights_3d_input(self):
     """Test masked loss with a 3-dimensional input, from a masked LM."""
     vocab_size = 100
@@ -262,26 +124,6 @@ class ClassificationLossTest(keras_parameterized.TestCase):
     # Because the tensor is fully masked, the loss should be 0.
     self.assertAllClose(0, weighted_loss_data)
 
-  def test_loss_weights_2d_input(self):
-    """Test masked loss with a 2-d input, from a classifier."""
-    input_width = 512
-    num_classes = 10
-    model = self.create_classification_model(input_width, num_classes)
-
-    # Invoke the network as part of a Model.
-    batch_size = 3
-    input_data = 10 * np.random.random_sample((batch_size, input_width))
-    output_data = model.predict(input_data)
-
-    # Calculate a fully masked weight tensor. This should give a loss of zero.
-    labels = np.random.randint(num_classes, size=(batch_size))
-    null_weights = np.zeros((batch_size))
-    weighted_loss_data = weighted_sparse_categorical_crossentropy.loss(
-        predictions=output_data, labels=labels, weights=null_weights)
-
-    # Because the tensor is fully masked, the loss should be 0.
-    self.assertAllClose(0, weighted_loss_data)
-
   def test_mismatched_predictions_and_labels_ranks_squeezes(self):
     """Test that the loss asserts when rank(predictions)-1 != rank(labels)."""
     batch_size = 3
@@ -289,7 +131,7 @@ class ClassificationLossTest(keras_parameterized.TestCase):
     labels = np.random.randint(10, size=(batch_size, 1))
 
     # All that this test tests is that the squeeze is successful.
-    _ = weighted_sparse_categorical_crossentropy.per_example_loss(
+    _ = weighted_sparse_categorical_crossentropy.loss(
         predictions=output_data, labels=labels)
 
   def test_mismatched_weights_and_labels_ranks_fail(self):
@@ -299,9 +141,6 @@ class ClassificationLossTest(keras_parameterized.TestCase):
     labels = np.random.randint(10, size=(batch_size, 10))
     weights = np.random.randint(2, size=(batch_size))
 
-    with self.assertRaisesRegex(RuntimeError, ".*of the same rank.*"):
-      _ = weighted_sparse_categorical_crossentropy.per_example_loss(
-          predictions=output_data, labels=labels, weights=weights)
     with self.assertRaisesRegex(RuntimeError, ".*of the same rank.*"):
       _ = weighted_sparse_categorical_crossentropy.loss(
           predictions=output_data, labels=labels, weights=weights)
@@ -317,8 +156,6 @@ class ClassificationLossTest(keras_parameterized.TestCase):
     # We're not trying to validate numerical correctness, just ensure that
     # we can in fact pass tensors to these functions without causing runtime
     # errors from the shape checking code.
-    _ = weighted_sparse_categorical_crossentropy.per_example_loss(
-        predictions=output_data, labels=labels, weights=weights)
     _ = weighted_sparse_categorical_crossentropy.loss(
         predictions=output_data, labels=labels, weights=weights)
 
@@ -338,20 +175,15 @@ class ClassificationLossTest(keras_parameterized.TestCase):
           [-2.7760355, -1.8219438, -3.0924666, -1.0779881, -0.9407509]]])
     labels = np.array([[4, 0], [2, 2], [2, 1]])
 
-    # Validate that per_example loss calculations are the same.
-    per_example_loss_data = weighted_sparse_categorical_crossentropy.per_example_loss(
-        predictions=output_data, labels=labels)
-    expected_per_example_loss_data = [[1.2923571, 2.7117882],
-                                      [2.287932, 2.287932],
-                                      [3.0924666, 1.8219438]]
-    self.assertAllClose(expected_per_example_loss_data, per_example_loss_data)
-
     # Validate that overall loss calculations are the same.
     weights = np.array([[1, 0], [0, 0], [0, 0]])
     loss_data = weighted_sparse_categorical_crossentropy.loss(
-        predictions=output_data, labels=labels, weights=weights)
+        predictions=output_data,
+        labels=labels,
+        weights=weights,
+        from_logits=True)
     expected_loss_data = 1.2923441
-    self.assertAllClose(expected_loss_data, loss_data)
+    self.assertAllClose(expected_loss_data, loss_data, rtol=1e-3)
 
   def test_legacy_classification_loss_compatibility(self):
     """Test to validate computational correctness during refactors."""
@@ -362,19 +194,15 @@ class ClassificationLossTest(keras_parameterized.TestCase):
                             [-1.6975292e-03, -6.4009643e+00, -1.0226612e+01]])
     labels = np.array([2, 1])
 
-    # Validate that per_example loss calculations are the same.
-    per_example_loss_data = weighted_sparse_categorical_crossentropy.per_example_loss(
-        predictions=output_data, labels=labels)
-    expected_per_example_loss_data = [6.4434357, 6.4009643]
-    self.assertAllClose(expected_per_example_loss_data, per_example_loss_data)
-
     # Validate that overall loss calculations are the same.
     weights = None
     loss_data = weighted_sparse_categorical_crossentropy.loss(
-        predictions=output_data, labels=labels, weights=weights)
+        predictions=output_data,
+        labels=labels,
+        weights=weights,
+        from_logits=True)
     expected_loss_data = 6.4222
-    self.assertAllClose(expected_loss_data, loss_data)
-
+    self.assertAllClose(expected_loss_data, loss_data, rtol=1e-3)
 
 if __name__ == "__main__":
   tf.test.main()
diff --git a/official/nlp/modeling/models/README.md b/official/nlp/modeling/models/README.md
index c2e572b6fe07631c17f37b29723fc7a0ac94a81e..41f4e80046d48326d45de426229d902e9df93c43 100644
--- a/official/nlp/modeling/models/README.md
+++ b/official/nlp/modeling/models/README.md
@@ -10,8 +10,8 @@ model containing a single classification head using the Classification network.
 It can be used as a regression model as well.
 
 * [`BertTokenClassifier`](bert_token_classifier.py) implements a simple token
-classification model containing a single classification head using the
-TokenClassification network.
+classification model containing a single classification head over the sequence
+output embeddings.
 
 * [`BertSpanLabeler`](bert_span_labeler.py) implementats a simple single-span
 start-end predictor (that is, a model that predicts two values: a start token
diff --git a/official/nlp/modeling/models/__init__.py b/official/nlp/modeling/models/__init__.py
index 7912e3cf8a70c19a35ef51a123b5ef3d1335617f..a072f36b79be3a2a8978034bcdc629d5a9c707d1 100644
--- a/official/nlp/modeling/models/__init__.py
+++ b/official/nlp/modeling/models/__init__.py
@@ -17,3 +17,4 @@ from official.nlp.modeling.models.bert_classifier import BertClassifier
 from official.nlp.modeling.models.bert_pretrainer import BertPretrainer
 from official.nlp.modeling.models.bert_span_labeler import BertSpanLabeler
 from official.nlp.modeling.models.bert_token_classifier import BertTokenClassifier
+from official.nlp.modeling.models.electra_pretrainer import ElectraPretrainer
diff --git a/official/nlp/modeling/models/bert_classifier.py b/official/nlp/modeling/models/bert_classifier.py
index 8db6faeba0dbebfe4f7b63cb4c3c4c33607c56cc..8584d8fb03c5835c53a117930806c759d885d52f 100644
--- a/official/nlp/modeling/models/bert_classifier.py
+++ b/official/nlp/modeling/models/bert_classifier.py
@@ -12,15 +12,12 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-"""Trainer network for BERT-style models."""
+"""BERT cls-token classifier."""
 # pylint: disable=g-classes-have-attributes
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
 
 import tensorflow as tf
 
+from official.nlp.modeling import layers
 from official.nlp.modeling import networks
 
 
@@ -36,6 +33,9 @@ class BertClassifier(tf.keras.Model):
   instantiates a classification network based on the passed `num_classes`
   argument. If `num_classes` is set to 1, a regression network is instantiated.
 
+  *Note* that the model is constructed by
+  [Keras Functional API](https://keras.io/guides/functional_api/).
+
   Arguments:
     network: A transformer network. This network should output a sequence output
       and a classification output. Furthermore, it should expose its embedding
@@ -43,23 +43,25 @@ class BertClassifier(tf.keras.Model):
     num_classes: Number of classes to predict from the classification network.
     initializer: The initializer (if any) to use in the classification networks.
       Defaults to a Glorot uniform initializer.
-    output: The output style for this network. Can be either 'logits' or
-      'predictions'.
+    dropout_rate: The dropout probability of the cls head.
+    use_encoder_pooler: Whether to use the pooler layer pre-defined inside
+      the encoder.
   """
 
   def __init__(self,
                network,
                num_classes,
                initializer='glorot_uniform',
-               output='logits',
                dropout_rate=0.1,
+               use_encoder_pooler=True,
                **kwargs):
     self._self_setattr_tracking = False
+    self._network = network
     self._config = {
         'network': network,
         'num_classes': num_classes,
         'initializer': initializer,
-        'output': output,
+        'use_encoder_pooler': use_encoder_pooler,
     }
 
     # We want to use the inputs of the passed network as the inputs to this
@@ -67,22 +69,36 @@ class BertClassifier(tf.keras.Model):
     # when we construct the Model object at the end of init.
     inputs = network.inputs
 
-    # Because we have a copy of inputs to create this Model object, we can
-    # invoke the Network object with its own input tensors to start the Model.
-    _, cls_output = network(inputs)
-    cls_output = tf.keras.layers.Dropout(rate=dropout_rate)(cls_output)
+    if use_encoder_pooler:
+      # Because we have a copy of inputs to create this Model object, we can
+      # invoke the Network object with its own input tensors to start the Model.
+      _, cls_output = network(inputs)
+      cls_output = tf.keras.layers.Dropout(rate=dropout_rate)(cls_output)
 
-    self.classifier = networks.Classification(
-        input_width=cls_output.shape[-1],
-        num_classes=num_classes,
-        initializer=initializer,
-        output=output,
-        name='classification')
-    predictions = self.classifier(cls_output)
+      self.classifier = networks.Classification(
+          input_width=cls_output.shape[-1],
+          num_classes=num_classes,
+          initializer=initializer,
+          output='logits',
+          name='sentence_prediction')
+      predictions = self.classifier(cls_output)
+    else:
+      sequence_output, _ = network(inputs)
+      self.classifier = layers.ClassificationHead(
+          inner_dim=sequence_output.shape[-1],
+          num_classes=num_classes,
+          initializer=initializer,
+          dropout_rate=dropout_rate,
+          name='sentence_prediction')
+      predictions = self.classifier(sequence_output)
 
     super(BertClassifier, self).__init__(
         inputs=inputs, outputs=predictions, **kwargs)
 
+  @property
+  def checkpoint_items(self):
+    return dict(encoder=self._network)
+
   def get_config(self):
     return self._config
 
diff --git a/official/nlp/modeling/models/bert_classifier_test.py b/official/nlp/modeling/models/bert_classifier_test.py
index 4dade8508592d5e3344b79e50dce74ccc27526c7..b05ded47bd743ab8b85f5fa5248d684d795e8d13 100644
--- a/official/nlp/modeling/models/bert_classifier_test.py
+++ b/official/nlp/modeling/models/bert_classifier_test.py
@@ -38,12 +38,11 @@ class BertClassifierTest(keras_parameterized.TestCase):
     vocab_size = 100
     sequence_length = 512
     test_network = networks.TransformerEncoder(
-        vocab_size=vocab_size, num_layers=2, sequence_length=sequence_length)
+        vocab_size=vocab_size, num_layers=2)
 
     # Create a BERT trainer with the created network.
     bert_trainer_model = bert_classifier.BertClassifier(
-        test_network,
-        num_classes=num_classes)
+        test_network, num_classes=num_classes)
 
     # Create a set of 2-dimensional inputs (the first dimension is implicit).
     word_ids = tf.keras.Input(shape=(sequence_length,), dtype=tf.int32)
@@ -63,7 +62,7 @@ class BertClassifierTest(keras_parameterized.TestCase):
     # Build a transformer network to use within the BERT trainer. (Here, we use
     # a short sequence_length for convenience.)
     test_network = networks.TransformerEncoder(
-        vocab_size=100, num_layers=2, sequence_length=2)
+        vocab_size=100, num_layers=2)
 
     # Create a BERT trainer with the created network.
     bert_trainer_model = bert_classifier.BertClassifier(
@@ -84,12 +83,12 @@ class BertClassifierTest(keras_parameterized.TestCase):
     # Build a transformer network to use within the BERT trainer. (Here, we use
     # a short sequence_length for convenience.)
     test_network = networks.TransformerEncoder(
-        vocab_size=100, num_layers=2, sequence_length=5)
+        vocab_size=100, num_layers=2)
 
     # Create a BERT trainer with the created network. (Note that all the args
     # are different, so we can catch any serialization mismatches.)
     bert_trainer_model = bert_classifier.BertClassifier(
-        test_network, num_classes=4, initializer='zeros', output='predictions')
+        test_network, num_classes=4, initializer='zeros')
 
     # Create another BERT trainer via serialization and deserialization.
     config = bert_trainer_model.get_config()
diff --git a/official/nlp/modeling/models/bert_pretrainer.py b/official/nlp/modeling/models/bert_pretrainer.py
index bce33747f03af723927fba138ddec55160262449..0dc67849ffdfa1bf8be0ef3113e8bc940df91b36 100644
--- a/official/nlp/modeling/models/bert_pretrainer.py
+++ b/official/nlp/modeling/models/bert_pretrainer.py
@@ -12,12 +12,8 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-"""Trainer network for BERT-style models."""
+"""BERT Pre-training model."""
 # pylint: disable=g-classes-have-attributes
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
 
 import copy
 from typing import List, Optional
@@ -41,6 +37,9 @@ class BertPretrainer(tf.keras.Model):
   instantiates the masked language model and classification networks that are
   used to create the training objectives.
 
+  *Note* that the model is constructed by
+  [Keras Functional API](https://keras.io/guides/functional_api/).
+
   Arguments:
     network: A transformer network. This network should output a sequence output
       and a classification output.
@@ -95,7 +94,8 @@ class BertPretrainer(tf.keras.Model):
     if isinstance(cls_output, list):
       cls_output = cls_output[-1]
     sequence_output_length = sequence_output.shape.as_list()[1]
-    if sequence_output_length < num_token_predictions:
+    if sequence_output_length is not None and (sequence_output_length <
+                                               num_token_predictions):
       raise ValueError(
           "The passed network's output length is %s, which is less than the "
           'requested num_token_predictions %s.' %
@@ -147,11 +147,9 @@ class BertPretrainerV2(tf.keras.Model):
 
   (Experimental).
   Adds the masked language model head and optional classification heads upon the
-  transformer encoder. When num_masked_tokens == 0, there won't be MaskedLM
-  head.
+  transformer encoder.
 
   Arguments:
-    num_masked_tokens: Number of tokens to predict from the masked LM.
     encoder_network: A transformer network. This network should output a
       sequence output and a classification output.
     mlm_activation: The activation (if any) to use in the masked LM network. If
@@ -169,7 +167,6 @@ class BertPretrainerV2(tf.keras.Model):
 
   def __init__(
       self,
-      num_masked_tokens: int,
       encoder_network: tf.keras.Model,
       mlm_activation=None,
       mlm_initializer='glorot_uniform',
@@ -179,7 +176,6 @@ class BertPretrainerV2(tf.keras.Model):
     self._self_setattr_tracking = False
     self._config = {
         'encoder_network': encoder_network,
-        'num_masked_tokens': num_masked_tokens,
         'mlm_initializer': mlm_initializer,
         'classification_heads': classification_heads,
         'name': name,
@@ -195,19 +191,16 @@ class BertPretrainerV2(tf.keras.Model):
       raise ValueError('Classification heads should have unique names.')
 
     outputs = dict()
-    if num_masked_tokens > 0:
-      self.masked_lm = layers.MaskedLM(
-          embedding_table=self.encoder_network.get_embedding_table(),
-          activation=mlm_activation,
-          initializer=mlm_initializer,
-          name='cls/predictions')
-      masked_lm_positions = tf.keras.layers.Input(
-          shape=(num_masked_tokens,),
-          name='masked_lm_positions',
-          dtype=tf.int32)
-      inputs.append(masked_lm_positions)
-      outputs['lm_output'] = self.masked_lm(
-          sequence_output, masked_positions=masked_lm_positions)
+    self.masked_lm = layers.MaskedLM(
+        embedding_table=self.encoder_network.get_embedding_table(),
+        activation=mlm_activation,
+        initializer=mlm_initializer,
+        name='cls/predictions')
+    masked_lm_positions = tf.keras.layers.Input(
+        shape=(None,), name='masked_lm_positions', dtype=tf.int32)
+    inputs.append(masked_lm_positions)
+    outputs['lm_output'] = self.masked_lm(
+        sequence_output, masked_positions=masked_lm_positions)
     for cls_head in self.classification_heads:
       outputs[cls_head.name] = cls_head(sequence_output)
 
@@ -217,7 +210,7 @@ class BertPretrainerV2(tf.keras.Model):
   @property
   def checkpoint_items(self):
     """Returns a dictionary of items to be additionally checkpointed."""
-    items = dict(encoder=self.encoder_network)
+    items = dict(encoder=self.encoder_network, masked_lm=self.masked_lm)
     for head in self.classification_heads:
       for key, item in head.checkpoint_items.items():
         items['.'.join([head.name, key])] = item
diff --git a/official/nlp/modeling/models/bert_pretrainer_test.py b/official/nlp/modeling/models/bert_pretrainer_test.py
index eb9ace5ccf132ec0423276b28fa1e1e473a97290..2ee1c6e2c1bc4b2c5d42e74a714cdbc930036d6e 100644
--- a/official/nlp/modeling/models/bert_pretrainer_test.py
+++ b/official/nlp/modeling/models/bert_pretrainer_test.py
@@ -118,10 +118,9 @@ class BertPretrainerTest(keras_parameterized.TestCase):
         vocab_size=vocab_size, num_layers=2, sequence_length=sequence_length)
 
     # Create a BERT trainer with the created network.
-    num_token_predictions = 2
     bert_trainer_model = bert_pretrainer.BertPretrainerV2(
-        encoder_network=test_network, num_masked_tokens=num_token_predictions)
-
+        encoder_network=test_network)
+    num_token_predictions = 20
     # Create a set of 2-dimensional inputs (the first dimension is implicit).
     word_ids = tf.keras.Input(shape=(sequence_length,), dtype=tf.int32)
     mask = tf.keras.Input(shape=(sequence_length,), dtype=tf.int32)
@@ -145,7 +144,7 @@ class BertPretrainerTest(keras_parameterized.TestCase):
     # Create a BERT trainer with the created network. (Note that all the args
     # are different, so we can catch any serialization mismatches.)
     bert_trainer_model = bert_pretrainer.BertPretrainerV2(
-        encoder_network=test_network, num_masked_tokens=2)
+        encoder_network=test_network)
 
     # Create another BERT trainer via serialization and deserialization.
     config = bert_trainer_model.get_config()
diff --git a/official/nlp/modeling/models/bert_span_labeler.py b/official/nlp/modeling/models/bert_span_labeler.py
index 9cc8d62442cd6b432f484b6920c8490ab1e4da65..5985b3b86149175a7ffc9331e3a0bc2b5887a21f 100644
--- a/official/nlp/modeling/models/bert_span_labeler.py
+++ b/official/nlp/modeling/models/bert_span_labeler.py
@@ -12,12 +12,8 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-"""Trainer network for BERT-style models."""
+"""BERT Question Answering model."""
 # pylint: disable=g-classes-have-attributes
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
 
 import tensorflow as tf
 
@@ -32,9 +28,12 @@ class BertSpanLabeler(tf.keras.Model):
   encoder as described in "BERT: Pre-training of Deep Bidirectional Transformers
   for Language Understanding" (https://arxiv.org/abs/1810.04805).
 
-  The BertSpanLabeler allows a user to pass in a transformer stack, and
+  The BertSpanLabeler allows a user to pass in a transformer encoder, and
   instantiates a span labeling network based on a single dense layer.
 
+  *Note* that the model is constructed by
+  [Keras Functional API](https://keras.io/guides/functional_api/).
+
   Arguments:
     network: A transformer network. This network should output a sequence output
       and a classification output. Furthermore, it should expose its embedding
@@ -51,11 +50,13 @@ class BertSpanLabeler(tf.keras.Model):
                output='logits',
                **kwargs):
     self._self_setattr_tracking = False
+    self._network = network
     self._config = {
         'network': network,
         'initializer': initializer,
         'output': output,
     }
+
     # We want to use the inputs of the passed network as the inputs to this
     # Model. To do this, we need to keep a handle to the network inputs for use
     # when we construct the Model object at the end of init.
@@ -89,6 +90,10 @@ class BertSpanLabeler(tf.keras.Model):
     super(BertSpanLabeler, self).__init__(
         inputs=inputs, outputs=logits, **kwargs)
 
+  @property
+  def checkpoint_items(self):
+    return dict(encoder=self._network)
+
   def get_config(self):
     return self._config
 
diff --git a/official/nlp/modeling/models/bert_span_labeler_test.py b/official/nlp/modeling/models/bert_span_labeler_test.py
index d05e91b52c9ba69a65df7dee4783ffc4113b8a3c..6c6e143ff3d716452a776d7bfd82e532ae985969 100644
--- a/official/nlp/modeling/models/bert_span_labeler_test.py
+++ b/official/nlp/modeling/models/bert_span_labeler_test.py
@@ -36,7 +36,7 @@ class BertSpanLabelerTest(keras_parameterized.TestCase):
     vocab_size = 100
     sequence_length = 512
     test_network = networks.TransformerEncoder(
-        vocab_size=vocab_size, num_layers=2, sequence_length=sequence_length)
+        vocab_size=vocab_size, num_layers=2)
 
     # Create a BERT trainer with the created network.
     bert_trainer_model = bert_span_labeler.BertSpanLabeler(test_network)
@@ -59,9 +59,8 @@ class BertSpanLabelerTest(keras_parameterized.TestCase):
     """Validate compilation using explicit output names."""
     # Build a transformer network to use within the BERT trainer.
     vocab_size = 100
-    sequence_length = 512
     test_network = networks.TransformerEncoder(
-        vocab_size=vocab_size, num_layers=2, sequence_length=sequence_length)
+        vocab_size=vocab_size, num_layers=2)
 
     # Create a BERT trainer with the created network.
     bert_trainer_model = bert_span_labeler.BertSpanLabeler(test_network)
@@ -81,7 +80,7 @@ class BertSpanLabelerTest(keras_parameterized.TestCase):
     # Build a transformer network to use within the BERT trainer. (Here, we use
     # a short sequence_length for convenience.)
     test_network = networks.TransformerEncoder(
-        vocab_size=100, num_layers=2, sequence_length=2)
+        vocab_size=100, num_layers=2)
 
     # Create a BERT trainer with the created network.
     bert_trainer_model = bert_span_labeler.BertSpanLabeler(test_network)
@@ -101,7 +100,7 @@ class BertSpanLabelerTest(keras_parameterized.TestCase):
     # Build a transformer network to use within the BERT trainer. (Here, we use
     # a short sequence_length for convenience.)
     test_network = networks.TransformerEncoder(
-        vocab_size=100, num_layers=2, sequence_length=5)
+        vocab_size=100, num_layers=2)
 
     # Create a BERT trainer with the created network. (Note that all the args
     # are different, so we can catch any serialization mismatches.)
diff --git a/official/nlp/modeling/models/bert_token_classifier.py b/official/nlp/modeling/models/bert_token_classifier.py
index 8054cb1ff01931fbae05a6e072277994a83ed3cc..04d0d394fe1b394f639ce69e172f26356eadfae6 100644
--- a/official/nlp/modeling/models/bert_token_classifier.py
+++ b/official/nlp/modeling/models/bert_token_classifier.py
@@ -12,17 +12,11 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-"""Trainer network for BERT-style models."""
+"""BERT token classifier."""
 # pylint: disable=g-classes-have-attributes
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
 
 import tensorflow as tf
 
-from official.nlp.modeling import networks
-
 
 @tf.keras.utils.register_keras_serializable(package='Text')
 class BertTokenClassifier(tf.keras.Model):
@@ -36,6 +30,9 @@ class BertTokenClassifier(tf.keras.Model):
   instantiates a token classification network based on the passed `num_classes`
   argument.
 
+  *Note* that the model is constructed by
+  [Keras Functional API](https://keras.io/guides/functional_api/).
+
   Arguments:
     network: A transformer network. This network should output a sequence output
       and a classification output. Furthermore, it should expose its embedding
@@ -55,6 +52,7 @@ class BertTokenClassifier(tf.keras.Model):
                dropout_rate=0.1,
                **kwargs):
     self._self_setattr_tracking = False
+    self._network = network
     self._config = {
         'network': network,
         'num_classes': num_classes,
@@ -73,16 +71,27 @@ class BertTokenClassifier(tf.keras.Model):
     sequence_output = tf.keras.layers.Dropout(
         rate=dropout_rate)(sequence_output)
 
-    self.classifier = networks.TokenClassification(
-        input_width=sequence_output.shape[-1],
-        num_classes=num_classes,
-        initializer=initializer,
-        output=output,
-        name='classification')
-    predictions = self.classifier(sequence_output)
-
+    self.classifier = tf.keras.layers.Dense(
+        num_classes,
+        activation=None,
+        kernel_initializer=initializer,
+        name='predictions/transform/logits')
+    self.logits = self.classifier(sequence_output)
+    if output == 'logits':
+      output_tensors = self.logits
+    elif output == 'predictions':
+      output_tensors = tf.keras.layers.Activation(tf.nn.log_softmax)(
+          self.logits)
+    else:
+      raise ValueError(
+          ('Unknown `output` value "%s". `output` can be either "logits" or '
+           '"predictions"') % output)
     super(BertTokenClassifier, self).__init__(
-        inputs=inputs, outputs=predictions, **kwargs)
+        inputs=inputs, outputs=output_tensors, **kwargs)
+
+  @property
+  def checkpoint_items(self):
+    return dict(encoder=self._network)
 
   def get_config(self):
     return self._config
diff --git a/official/nlp/modeling/models/electra_pretrainer.py b/official/nlp/modeling/models/electra_pretrainer.py
new file mode 100644
index 0000000000000000000000000000000000000000..9ee72365d68e8f1ee6643c8b33fe136a3711b94c
--- /dev/null
+++ b/official/nlp/modeling/models/electra_pretrainer.py
@@ -0,0 +1,337 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Trainer network for ELECTRA models."""
+# pylint: disable=g-classes-have-attributes
+from __future__ import absolute_import
+from __future__ import division
+# from __future__ import google_type_annotations
+from __future__ import print_function
+
+import copy
+import tensorflow as tf
+
+from official.modeling import tf_utils
+from official.nlp.modeling import layers
+
+
+@tf.keras.utils.register_keras_serializable(package='Text')
+class ElectraPretrainer(tf.keras.Model):
+  """ELECTRA network training model.
+
+  This is an implementation of the network structure described in "ELECTRA:
+  Pre-training Text Encoders as Discriminators Rather Than Generators" (
+  https://arxiv.org/abs/2003.10555).
+
+  The ElectraPretrainer allows a user to pass in two transformer models, one for
+  generator, the other for discriminator, and instantiates the masked language
+  model (at generator side) and classification networks (at discriminator side)
+  that are used to create the training objectives.
+
+  *Note* that the model is constructed by Keras Subclass API, where layers are
+  defined inside __init__ and call() implements the computation.
+
+  Arguments:
+    generator_network: A transformer network for generator, this network should
+      output a sequence output and an optional classification output.
+    discriminator_network: A transformer network for discriminator, this network
+      should output a sequence output
+    vocab_size: Size of generator output vocabulary
+    num_classes: Number of classes to predict from the classification network
+      for the generator network (not used now)
+    num_token_predictions: Number of tokens to predict from the masked LM.
+    mlm_activation: The activation (if any) to use in the masked LM and
+      classification networks. If None, no activation will be used.
+    mlm_initializer: The initializer (if any) to use in the masked LM and
+      classification networks. Defaults to a Glorot uniform initializer.
+    output_type: The output style for this network. Can be either 'logits' or
+      'predictions'.
+    disallow_correct: Whether to disallow the generator to generate the exact
+      same token in the original sentence
+  """
+
+  def __init__(self,
+               generator_network,
+               discriminator_network,
+               vocab_size,
+               num_classes,
+               num_token_predictions,
+               mlm_activation=None,
+               mlm_initializer='glorot_uniform',
+               output_type='logits',
+               disallow_correct=False,
+               **kwargs):
+    super(ElectraPretrainer, self).__init__()
+    self._config = {
+        'generator_network': generator_network,
+        'discriminator_network': discriminator_network,
+        'vocab_size': vocab_size,
+        'num_classes': num_classes,
+        'num_token_predictions': num_token_predictions,
+        'mlm_activation': mlm_activation,
+        'mlm_initializer': mlm_initializer,
+        'output_type': output_type,
+        'disallow_correct': disallow_correct,
+    }
+    for k, v in kwargs.items():
+      self._config[k] = v
+
+    self.generator_network = generator_network
+    self.discriminator_network = discriminator_network
+    self.vocab_size = vocab_size
+    self.num_classes = num_classes
+    self.num_token_predictions = num_token_predictions
+    self.mlm_activation = mlm_activation
+    self.mlm_initializer = mlm_initializer
+    self.output_type = output_type
+    self.disallow_correct = disallow_correct
+    self.masked_lm = layers.MaskedLM(
+        embedding_table=generator_network.get_embedding_table(),
+        activation=mlm_activation,
+        initializer=mlm_initializer,
+        output=output_type,
+        name='generator_masked_lm')
+    self.classification = layers.ClassificationHead(
+        inner_dim=generator_network._config_dict['hidden_size'],
+        num_classes=num_classes,
+        initializer=mlm_initializer,
+        name='generator_classification_head')
+    self.discriminator_projection = tf.keras.layers.Dense(
+        units=discriminator_network._config_dict['hidden_size'],
+        activation=mlm_activation,
+        kernel_initializer=mlm_initializer,
+        name='discriminator_projection_head')
+    self.discriminator_head = tf.keras.layers.Dense(
+        units=1, kernel_initializer=mlm_initializer)
+
+  def call(self, inputs):
+    """ELECTRA forward pass.
+
+    Args:
+      inputs: A dict of all inputs, same as the standard BERT model.
+
+    Returns:
+      outputs: A dict of pretrainer model outputs, including
+        (1) lm_outputs: a [batch_size, num_token_predictions, vocab_size] tensor
+        indicating logits on masked positions.
+        (2) sentence_outputs: a [batch_size, num_classes] tensor indicating
+        logits for nsp task.
+        (3) disc_logits: a [batch_size, sequence_length] tensor indicating
+        logits for discriminator replaced token detection task.
+        (4) disc_label: a [batch_size, sequence_length] tensor indicating
+        target labels for discriminator replaced token detection task.
+    """
+    input_word_ids = inputs['input_word_ids']
+    input_mask = inputs['input_mask']
+    input_type_ids = inputs['input_type_ids']
+    masked_lm_positions = inputs['masked_lm_positions']
+
+    ### Generator ###
+    sequence_output, cls_output = self.generator_network(
+        [input_word_ids, input_mask, input_type_ids])
+
+    # The generator encoder network may get outputs from all layers.
+    if isinstance(sequence_output, list):
+      sequence_output = sequence_output[-1]
+    if isinstance(cls_output, list):
+      cls_output = cls_output[-1]
+
+    lm_outputs = self.masked_lm(sequence_output, masked_lm_positions)
+    sentence_outputs = self.classification(sequence_output)
+
+    ### Sampling from generator ###
+    fake_data = self._get_fake_data(inputs, lm_outputs, duplicate=True)
+
+    ### Discriminator ###
+    disc_input = fake_data['inputs']
+    disc_label = fake_data['is_fake_tokens']
+    disc_sequence_output, _ = self.discriminator_network([
+        disc_input['input_word_ids'], disc_input['input_mask'],
+        disc_input['input_type_ids']
+    ])
+
+    # The discriminator encoder network may get outputs from all layers.
+    if isinstance(disc_sequence_output, list):
+      disc_sequence_output = disc_sequence_output[-1]
+
+    disc_logits = self.discriminator_head(
+        self.discriminator_projection(disc_sequence_output))
+    disc_logits = tf.squeeze(disc_logits, axis=-1)
+
+    outputs = {
+        'lm_outputs': lm_outputs,
+        'sentence_outputs': sentence_outputs,
+        'disc_logits': disc_logits,
+        'disc_label': disc_label,
+    }
+
+    return outputs
+
+  def _get_fake_data(self, inputs, mlm_logits, duplicate=True):
+    """Generate corrupted data for discriminator.
+
+    Args:
+      inputs: A dict of all inputs, same as the input of call() function
+      mlm_logits: The generator's output logits
+      duplicate: Whether to copy the original inputs dict during modifications
+
+    Returns:
+      A dict of generated fake data
+    """
+    inputs = unmask(inputs, duplicate)
+
+    if self.disallow_correct:
+      disallow = tf.one_hot(
+          inputs['masked_lm_ids'], depth=self.vocab_size, dtype=tf.float32)
+    else:
+      disallow = None
+
+    sampled_tokens = tf.stop_gradient(
+        sample_from_softmax(mlm_logits, disallow=disallow))
+    sampled_tokids = tf.argmax(sampled_tokens, -1, output_type=tf.int32)
+    updated_input_ids, masked = scatter_update(inputs['input_word_ids'],
+                                               sampled_tokids,
+                                               inputs['masked_lm_positions'])
+    labels = masked * (1 - tf.cast(
+        tf.equal(updated_input_ids, inputs['input_word_ids']), tf.int32))
+
+    updated_inputs = get_updated_inputs(
+        inputs, duplicate, input_word_ids=updated_input_ids)
+
+    return {
+        'inputs': updated_inputs,
+        'is_fake_tokens': labels,
+        'sampled_tokens': sampled_tokens
+    }
+
+  @property
+  def checkpoint_items(self):
+    """Returns a dictionary of items to be additionally checkpointed."""
+    items = dict(encoder=self.discriminator_network)
+    return items
+
+  def get_config(self):
+    return self._config
+
+  @classmethod
+  def from_config(cls, config, custom_objects=None):
+    return cls(**config)
+
+
+def scatter_update(sequence, updates, positions):
+  """Scatter-update a sequence.
+
+  Args:
+    sequence: A [batch_size, seq_len] or [batch_size, seq_len, depth] tensor
+    updates: A tensor of size batch_size*seq_len(*depth)
+    positions: A [batch_size, n_positions] tensor
+
+  Returns:
+    updated_sequence: A [batch_size, seq_len] or [batch_size, seq_len, depth]
+      tensor of "sequence" with elements at "positions" replaced by the values
+      at "updates". Updates to index 0 are ignored. If there are duplicated
+      positions the update is only applied once.
+    updates_mask: A [batch_size, seq_len] mask tensor of which inputs were
+      updated.
+  """
+  shape = tf_utils.get_shape_list(sequence, expected_rank=[2, 3])
+  depth_dimension = (len(shape) == 3)
+  if depth_dimension:
+    batch_size, seq_len, depth = shape
+  else:
+    batch_size, seq_len = shape
+    depth = 1
+    sequence = tf.expand_dims(sequence, -1)
+  n_positions = tf_utils.get_shape_list(positions)[1]
+
+  shift = tf.expand_dims(seq_len * tf.range(batch_size), -1)
+  flat_positions = tf.reshape(positions + shift, [-1, 1])
+  flat_updates = tf.reshape(updates, [-1, depth])
+  updates = tf.scatter_nd(flat_positions, flat_updates,
+                          [batch_size * seq_len, depth])
+  updates = tf.reshape(updates, [batch_size, seq_len, depth])
+
+  flat_updates_mask = tf.ones([batch_size * n_positions], tf.int32)
+  updates_mask = tf.scatter_nd(flat_positions, flat_updates_mask,
+                               [batch_size * seq_len])
+  updates_mask = tf.reshape(updates_mask, [batch_size, seq_len])
+  not_first_token = tf.concat([
+      tf.zeros((batch_size, 1), tf.int32),
+      tf.ones((batch_size, seq_len - 1), tf.int32)
+  ], -1)
+  updates_mask *= not_first_token
+  updates_mask_3d = tf.expand_dims(updates_mask, -1)
+
+  # account for duplicate positions
+  if sequence.dtype == tf.float32:
+    updates_mask_3d = tf.cast(updates_mask_3d, tf.float32)
+    updates /= tf.maximum(1.0, updates_mask_3d)
+  else:
+    assert sequence.dtype == tf.int32
+    updates = tf.math.floordiv(updates, tf.maximum(1, updates_mask_3d))
+  updates_mask = tf.minimum(updates_mask, 1)
+  updates_mask_3d = tf.minimum(updates_mask_3d, 1)
+
+  updated_sequence = (((1 - updates_mask_3d) * sequence) +
+                      (updates_mask_3d * updates))
+  if not depth_dimension:
+    updated_sequence = tf.squeeze(updated_sequence, -1)
+
+  return updated_sequence, updates_mask
+
+
+def sample_from_softmax(logits, disallow=None):
+  """Implement softmax sampling using gumbel softmax trick.
+
+  Args:
+    logits: A [batch_size, num_token_predictions, vocab_size] tensor indicating
+      the generator output logits for each masked position.
+    disallow: If `None`, we directly sample tokens from the logits. Otherwise,
+      this is a tensor of size [batch_size, num_token_predictions, vocab_size]
+      indicating the true word id in each masked position.
+
+  Returns:
+    sampled_tokens: A [batch_size, num_token_predictions, vocab_size] one hot
+      tensor indicating the sampled word id in each masked position.
+  """
+  if disallow is not None:
+    logits -= 1000.0 * disallow
+  uniform_noise = tf.random.uniform(
+      tf_utils.get_shape_list(logits), minval=0, maxval=1)
+  gumbel_noise = -tf.math.log(-tf.math.log(uniform_noise + 1e-9) + 1e-9)
+
+  # Here we essentially follow the original paper and use temperature 1.0 for
+  # generator output logits.
+  sampled_tokens = tf.one_hot(
+      tf.argmax(tf.nn.softmax(logits + gumbel_noise), -1, output_type=tf.int32),
+      logits.shape[-1])
+  return sampled_tokens
+
+
+def unmask(inputs, duplicate):
+  unmasked_input_word_ids, _ = scatter_update(inputs['input_word_ids'],
+                                              inputs['masked_lm_ids'],
+                                              inputs['masked_lm_positions'])
+  return get_updated_inputs(
+      inputs, duplicate, input_word_ids=unmasked_input_word_ids)
+
+
+def get_updated_inputs(inputs, duplicate, **kwargs):
+  if duplicate:
+    new_inputs = copy.copy(inputs)
+  else:
+    new_inputs = inputs
+  for k, v in kwargs.items():
+    new_inputs[k] = v
+  return new_inputs
diff --git a/official/nlp/modeling/models/electra_pretrainer_test.py b/official/nlp/modeling/models/electra_pretrainer_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..67fe37925fb9882fcac63565b76ac88091422c8c
--- /dev/null
+++ b/official/nlp/modeling/models/electra_pretrainer_test.py
@@ -0,0 +1,160 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for ELECTRA pre trainer network."""
+
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+
+import tensorflow as tf
+
+from tensorflow.python.keras import keras_parameterized  # pylint: disable=g-direct-tensorflow-import
+from official.nlp.modeling import networks
+from official.nlp.modeling.models import electra_pretrainer
+
+
+# This decorator runs the test in V1, V2-Eager, and V2-Functional mode. It
+# guarantees forward compatibility of this code for the V2 switchover.
+@keras_parameterized.run_all_keras_modes
+class ElectraPretrainerTest(keras_parameterized.TestCase):
+
+  def test_electra_pretrainer(self):
+    """Validate that the Keras object can be created."""
+    # Build a transformer network to use within the ELECTRA trainer.
+    vocab_size = 100
+    sequence_length = 512
+    test_generator_network = networks.TransformerEncoder(
+        vocab_size=vocab_size,
+        num_layers=2,
+        max_sequence_length=sequence_length)
+    test_discriminator_network = networks.TransformerEncoder(
+        vocab_size=vocab_size,
+        num_layers=2,
+        max_sequence_length=sequence_length)
+
+    # Create a ELECTRA trainer with the created network.
+    num_classes = 3
+    num_token_predictions = 2
+    eletrca_trainer_model = electra_pretrainer.ElectraPretrainer(
+        generator_network=test_generator_network,
+        discriminator_network=test_discriminator_network,
+        vocab_size=vocab_size,
+        num_classes=num_classes,
+        num_token_predictions=num_token_predictions,
+        disallow_correct=True)
+
+    # Create a set of 2-dimensional inputs (the first dimension is implicit).
+    word_ids = tf.keras.Input(shape=(sequence_length,), dtype=tf.int32)
+    mask = tf.keras.Input(shape=(sequence_length,), dtype=tf.int32)
+    type_ids = tf.keras.Input(shape=(sequence_length,), dtype=tf.int32)
+    lm_positions = tf.keras.Input(
+        shape=(num_token_predictions,), dtype=tf.int32)
+    lm_ids = tf.keras.Input(shape=(num_token_predictions,), dtype=tf.int32)
+    inputs = {
+        'input_word_ids': word_ids,
+        'input_mask': mask,
+        'input_type_ids': type_ids,
+        'masked_lm_positions': lm_positions,
+        'masked_lm_ids': lm_ids
+    }
+
+    # Invoke the trainer model on the inputs. This causes the layer to be built.
+    outputs = eletrca_trainer_model(inputs)
+    lm_outs = outputs['lm_outputs']
+    cls_outs = outputs['sentence_outputs']
+    disc_logits = outputs['disc_logits']
+    disc_label = outputs['disc_label']
+
+    # Validate that the outputs are of the expected shape.
+    expected_lm_shape = [None, num_token_predictions, vocab_size]
+    expected_classification_shape = [None, num_classes]
+    expected_disc_logits_shape = [None, sequence_length]
+    expected_disc_label_shape = [None, sequence_length]
+    self.assertAllEqual(expected_lm_shape, lm_outs.shape.as_list())
+    self.assertAllEqual(expected_classification_shape, cls_outs.shape.as_list())
+    self.assertAllEqual(expected_disc_logits_shape, disc_logits.shape.as_list())
+    self.assertAllEqual(expected_disc_label_shape, disc_label.shape.as_list())
+
+  def test_electra_trainer_tensor_call(self):
+    """Validate that the Keras object can be invoked."""
+    # Build a transformer network to use within the ELECTRA trainer. (Here, we
+    # use a short sequence_length for convenience.)
+    test_generator_network = networks.TransformerEncoder(
+        vocab_size=100, num_layers=4, sequence_length=3)
+    test_discriminator_network = networks.TransformerEncoder(
+        vocab_size=100, num_layers=4, sequence_length=3)
+
+    # Create a ELECTRA trainer with the created network.
+    eletrca_trainer_model = electra_pretrainer.ElectraPretrainer(
+        generator_network=test_generator_network,
+        discriminator_network=test_discriminator_network,
+        vocab_size=100,
+        num_classes=2,
+        sequence_length=3,
+        num_token_predictions=2)
+
+    # Create a set of 2-dimensional data tensors to feed into the model.
+    word_ids = tf.constant([[1, 1, 1], [2, 2, 2]], dtype=tf.int32)
+    mask = tf.constant([[1, 1, 1], [1, 0, 0]], dtype=tf.int32)
+    type_ids = tf.constant([[1, 1, 1], [2, 2, 2]], dtype=tf.int32)
+    lm_positions = tf.constant([[0, 1], [0, 2]], dtype=tf.int32)
+    lm_ids = tf.constant([[10, 20], [20, 30]], dtype=tf.int32)
+    inputs = {
+        'input_word_ids': word_ids,
+        'input_mask': mask,
+        'input_type_ids': type_ids,
+        'masked_lm_positions': lm_positions,
+        'masked_lm_ids': lm_ids
+    }
+
+    # Invoke the trainer model on the tensors. In Eager mode, this does the
+    # actual calculation. (We can't validate the outputs, since the network is
+    # too complex: this simply ensures we're not hitting runtime errors.)
+    _ = eletrca_trainer_model(inputs)
+
+  def test_serialize_deserialize(self):
+    """Validate that the ELECTRA trainer can be serialized and deserialized."""
+    # Build a transformer network to use within the BERT trainer. (Here, we use
+    # a short sequence_length for convenience.)
+    test_generator_network = networks.TransformerEncoder(
+        vocab_size=100, num_layers=4, sequence_length=3)
+    test_discriminator_network = networks.TransformerEncoder(
+        vocab_size=100, num_layers=4, sequence_length=3)
+
+    # Create a ELECTRA trainer with the created network. (Note that all the args
+    # are different, so we can catch any serialization mismatches.)
+    electra_trainer_model = electra_pretrainer.ElectraPretrainer(
+        generator_network=test_generator_network,
+        discriminator_network=test_discriminator_network,
+        vocab_size=100,
+        num_classes=2,
+        sequence_length=3,
+        num_token_predictions=2)
+
+    # Create another BERT trainer via serialization and deserialization.
+    config = electra_trainer_model.get_config()
+    new_electra_trainer_model = electra_pretrainer.ElectraPretrainer.from_config(
+        config)
+
+    # Validate that the config can be forced to JSON.
+    _ = new_electra_trainer_model.to_json()
+
+    # If the serialization was successful, the new config should match the old.
+    self.assertAllEqual(electra_trainer_model.get_config(),
+                        new_electra_trainer_model.get_config())
+
+
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/official/nlp/modeling/networks/README.md b/official/nlp/modeling/networks/README.md
index 42347373edc1e2999019c7259dda78bc58138ef2..24a8902608612c29db236029e33c9f692d49b0b5 100644
--- a/official/nlp/modeling/networks/README.md
+++ b/official/nlp/modeling/networks/README.md
@@ -20,8 +20,5 @@ into two smaller matrices and shares parameters across layers.
 intended for use as a classification or regression (if number of classes is set
 to 1) head.
 
-* [`TokenClassification`](token_classification.py) contains a single hidden
-layer, and is intended for use as a token classification head.
-
 * [`SpanLabeling`](span_labeling.py) implements a single-span labeler (that is, a prediction head that can predict one start and end index per batch item) based on a single dense hidden layer. It can be used in the SQuAD task.
 
diff --git a/official/nlp/modeling/networks/__init__.py b/official/nlp/modeling/networks/__init__.py
index b8443e9f9303326a82212ef3da4e3057218522bb..24d46eda2933f27d5cba490f07d388979a1f066d 100644
--- a/official/nlp/modeling/networks/__init__.py
+++ b/official/nlp/modeling/networks/__init__.py
@@ -17,5 +17,4 @@ from official.nlp.modeling.networks.albert_transformer_encoder import AlbertTran
 from official.nlp.modeling.networks.classification import Classification
 from official.nlp.modeling.networks.encoder_scaffold import EncoderScaffold
 from official.nlp.modeling.networks.span_labeling import SpanLabeling
-from official.nlp.modeling.networks.token_classification import TokenClassification
 from official.nlp.modeling.networks.transformer_encoder import TransformerEncoder
diff --git a/official/nlp/modeling/networks/albert_transformer_encoder.py b/official/nlp/modeling/networks/albert_transformer_encoder.py
index 398fb00c18c7341765beec50e9b0e6ecaee46e5c..8bd12956e0320bb893507a64d7064cff0c877850 100644
--- a/official/nlp/modeling/networks/albert_transformer_encoder.py
+++ b/official/nlp/modeling/networks/albert_transformer_encoder.py
@@ -40,6 +40,8 @@ class AlbertTransformerEncoder(tf.keras.Model):
   The default values for this object are taken from the ALBERT-Base
   implementation described in the paper.
 
+  *Note* that the network is constructed by Keras Functional API.
+
   Arguments:
     vocab_size: The size of the token vocabulary.
     embedding_width: The width of the word embeddings. If the embedding width is
@@ -51,9 +53,6 @@ class AlbertTransformerEncoder(tf.keras.Model):
     num_layers: The number of transformer layers.
     num_attention_heads: The number of attention heads for each transformer. The
       hidden size must be divisible by the number of attention heads.
-    sequence_length: The sequence length that this encoder expects. If None, the
-      sequence length is dynamic; if an integer, the encoder will require
-      sequences padded to this length.
     max_sequence_length: The maximum sequence length that this encoder can
       consume. If None, max_sequence_length uses the value from sequence length.
       This determines the variable shape for positional embeddings.
@@ -72,8 +71,7 @@ class AlbertTransformerEncoder(tf.keras.Model):
                hidden_size=768,
                num_layers=12,
                num_attention_heads=12,
-               sequence_length=512,
-               max_sequence_length=None,
+               max_sequence_length=512,
                type_vocab_size=16,
                intermediate_size=3072,
                activation=activations.gelu,
@@ -84,8 +82,6 @@ class AlbertTransformerEncoder(tf.keras.Model):
     activation = tf.keras.activations.get(activation)
     initializer = tf.keras.initializers.get(initializer)
 
-    if not max_sequence_length:
-      max_sequence_length = sequence_length
     self._self_setattr_tracking = False
     self._config_dict = {
         'vocab_size': vocab_size,
@@ -93,7 +89,6 @@ class AlbertTransformerEncoder(tf.keras.Model):
         'hidden_size': hidden_size,
         'num_layers': num_layers,
         'num_attention_heads': num_attention_heads,
-        'sequence_length': sequence_length,
         'max_sequence_length': max_sequence_length,
         'type_vocab_size': type_vocab_size,
         'intermediate_size': intermediate_size,
@@ -104,11 +99,11 @@ class AlbertTransformerEncoder(tf.keras.Model):
     }
 
     word_ids = tf.keras.layers.Input(
-        shape=(sequence_length,), dtype=tf.int32, name='input_word_ids')
+        shape=(None,), dtype=tf.int32, name='input_word_ids')
     mask = tf.keras.layers.Input(
-        shape=(sequence_length,), dtype=tf.int32, name='input_mask')
+        shape=(None,), dtype=tf.int32, name='input_mask')
     type_ids = tf.keras.layers.Input(
-        shape=(sequence_length,), dtype=tf.int32, name='input_type_ids')
+        shape=(None,), dtype=tf.int32, name='input_type_ids')
 
     if embedding_width is None:
       embedding_width = hidden_size
diff --git a/official/nlp/modeling/networks/albert_transformer_encoder_test.py b/official/nlp/modeling/networks/albert_transformer_encoder_test.py
index 44368e494ae04dd9b92c63987e6881aabd8ff4c2..48fcc3e2a8e9fe3323369d32f4dacc5e292c4388 100644
--- a/official/nlp/modeling/networks/albert_transformer_encoder_test.py
+++ b/official/nlp/modeling/networks/albert_transformer_encoder_test.py
@@ -48,7 +48,6 @@ class AlbertTransformerEncoderTest(keras_parameterized.TestCase):
     kwargs = dict(
         vocab_size=100,
         hidden_size=hidden_size,
-        sequence_length=sequence_length,
         num_attention_heads=2,
         num_layers=3)
     if expected_dtype == tf.float16:
@@ -92,7 +91,6 @@ class AlbertTransformerEncoderTest(keras_parameterized.TestCase):
         vocab_size=vocab_size,
         embedding_width=8,
         hidden_size=hidden_size,
-        sequence_length=sequence_length,
         num_attention_heads=2,
         num_layers=3,
         type_vocab_size=num_types)
@@ -123,7 +121,6 @@ class AlbertTransformerEncoderTest(keras_parameterized.TestCase):
         vocab_size=vocab_size,
         embedding_width=8,
         hidden_size=hidden_size,
-        sequence_length=sequence_length,
         max_sequence_length=max_sequence_length,
         num_attention_heads=2,
         num_layers=3,
@@ -141,7 +138,6 @@ class AlbertTransformerEncoderTest(keras_parameterized.TestCase):
         hidden_size=32,
         num_layers=3,
         num_attention_heads=2,
-        sequence_length=21,
         max_sequence_length=21,
         type_vocab_size=12,
         intermediate_size=1223,
diff --git a/official/nlp/modeling/networks/classification.py b/official/nlp/modeling/networks/classification.py
index fc326136cd18593bc5e06dd2f68a1e0da17a1409..55af2eb9ad1795f264f924499d46cdda1d82b8ad 100644
--- a/official/nlp/modeling/networks/classification.py
+++ b/official/nlp/modeling/networks/classification.py
@@ -29,13 +29,16 @@ class Classification(tf.keras.Model):
   This network implements a simple classifier head based on a dense layer. If
   num_classes is one, it can be considered as a regression problem.
 
+  *Note* that the network is constructed by
+  [Keras Functional API](https://keras.io/guides/functional_api/).
+
   Arguments:
     input_width: The innermost dimension of the input tensor to this network.
     num_classes: The number of classes that this network should classify to. If
       equal to 1, a regression problem is assumed.
     activation: The activation, if any, for the dense layer in this network.
-    initializer: The intializer for the dense layer in this network. Defaults to
-      a Glorot uniform initializer.
+    initializer: The initializer for the dense layer in this network. Defaults
+      to a Glorot uniform initializer.
     output: The output style for this network. Can be either 'logits' or
       'predictions'.
   """
diff --git a/official/nlp/modeling/networks/encoder_scaffold.py b/official/nlp/modeling/networks/encoder_scaffold.py
index ec9b2d102db9c3a49de509e9d9011bcf6a758e7f..3daa18b45c9bcec880567b30bc00693deab2fa14 100644
--- a/official/nlp/modeling/networks/encoder_scaffold.py
+++ b/official/nlp/modeling/networks/encoder_scaffold.py
@@ -49,6 +49,9 @@ class EncoderScaffold(tf.keras.Model):
   If the hidden_cls is not overridden, a default transformer layer will be
   instantiated.
 
+  *Note* that the network is constructed by
+  [Keras Functional API](https://keras.io/guides/functional_api/).
+
   Arguments:
     pooled_output_dim: The dimension of pooled output.
     pooler_layer_initializer: The initializer for the classification
@@ -126,16 +129,17 @@ class EncoderScaffold(tf.keras.Model):
       embeddings, attention_mask = self._embedding_network(inputs)
     else:
       self._embedding_network = None
+      seq_length = embedding_cfg.get('seq_length', None)
       word_ids = tf.keras.layers.Input(
-          shape=(embedding_cfg['seq_length'],),
+          shape=(seq_length,),
           dtype=tf.int32,
           name='input_word_ids')
       mask = tf.keras.layers.Input(
-          shape=(embedding_cfg['seq_length'],),
+          shape=(seq_length,),
           dtype=tf.int32,
           name='input_mask')
       type_ids = tf.keras.layers.Input(
-          shape=(embedding_cfg['seq_length'],),
+          shape=(seq_length,),
           dtype=tf.int32,
           name='input_type_ids')
       inputs = [word_ids, mask, type_ids]
diff --git a/official/nlp/modeling/networks/encoder_scaffold_test.py b/official/nlp/modeling/networks/encoder_scaffold_test.py
index 664bccd08e11720918e0060458dc934350d2d594..cb322a524779a2f6a78c6b2c5c36bdc8d9069231 100644
--- a/official/nlp/modeling/networks/encoder_scaffold_test.py
+++ b/official/nlp/modeling/networks/encoder_scaffold_test.py
@@ -323,6 +323,28 @@ class EncoderScaffoldLayerClassTest(keras_parameterized.TestCase):
     self.assertAllEqual(network.get_config(), new_network.get_config())
 
 
+class Embeddings(tf.keras.Model):
+
+  def __init__(self, vocab_size, hidden_size):
+    super().__init__()
+    self.inputs = [
+        tf.keras.layers.Input(
+            shape=(None,), dtype=tf.int32, name="input_word_ids"),
+        tf.keras.layers.Input(shape=(None,), dtype=tf.int32, name="input_mask")
+    ]
+    self.attention_mask = layers.SelfAttentionMask()
+    self.embedding_layer = layers.OnDeviceEmbedding(
+        vocab_size=vocab_size,
+        embedding_width=hidden_size,
+        initializer=tf.keras.initializers.TruncatedNormal(stddev=0.02),
+        name="word_embeddings")
+
+  def call(self, inputs):
+    word_ids, mask = inputs
+    word_embeddings = self.embedding_layer(word_ids)
+    return word_embeddings, self.attention_mask([word_embeddings, mask])
+
+
 @keras_parameterized.run_all_keras_modes
 class EncoderScaffoldEmbeddingNetworkTest(keras_parameterized.TestCase):
 
@@ -334,20 +356,7 @@ class EncoderScaffoldEmbeddingNetworkTest(keras_parameterized.TestCase):
     # Build an embedding network to swap in for the default network. This one
     # will have 2 inputs (mask and word_ids) instead of 3, and won't use
     # positional embeddings.
-
-    word_ids = tf.keras.layers.Input(
-        shape=(sequence_length,), dtype=tf.int32, name="input_word_ids")
-    mask = tf.keras.layers.Input(
-        shape=(sequence_length,), dtype=tf.int32, name="input_mask")
-    embedding_layer = layers.OnDeviceEmbedding(
-        vocab_size=vocab_size,
-        embedding_width=hidden_size,
-        initializer=tf.keras.initializers.TruncatedNormal(stddev=0.02),
-        name="word_embeddings")
-    word_embeddings = embedding_layer(word_ids)
-    attention_mask = layers.SelfAttentionMask()([word_embeddings, mask])
-    network = tf.keras.Model([word_ids, mask],
-                             [word_embeddings, attention_mask])
+    network = Embeddings(vocab_size, hidden_size)
 
     hidden_cfg = {
         "num_attention_heads":
@@ -371,8 +380,7 @@ class EncoderScaffoldEmbeddingNetworkTest(keras_parameterized.TestCase):
         pooler_layer_initializer=tf.keras.initializers.TruncatedNormal(
             stddev=0.02),
         hidden_cfg=hidden_cfg,
-        embedding_cls=network,
-        embedding_data=embedding_layer.embeddings)
+        embedding_cls=network)
 
     # Create the inputs (note that the first dimension is implicit).
     word_ids = tf.keras.Input(shape=(sequence_length,), dtype=tf.int32)
@@ -390,11 +398,6 @@ class EncoderScaffoldEmbeddingNetworkTest(keras_parameterized.TestCase):
     mask_data = np.random.randint(2, size=(batch_size, sequence_length))
     _ = model.predict([word_id_data, mask_data])
 
-    # Test that we can get the embedding data that we passed to the object. This
-    # is necessary to support standard language model training.
-    self.assertIs(embedding_layer.embeddings,
-                  test_network.get_embedding_table())
-
   def test_serialize_deserialize(self):
     hidden_size = 32
     sequence_length = 21
diff --git a/official/nlp/modeling/networks/span_labeling.py b/official/nlp/modeling/networks/span_labeling.py
index 2d704c33b6d62ae059d01b81bca146ca1c5adca4..db7783ef28ebcf72e8a5786f8b182d97f96c1133 100644
--- a/official/nlp/modeling/networks/span_labeling.py
+++ b/official/nlp/modeling/networks/span_labeling.py
@@ -27,12 +27,14 @@ class SpanLabeling(tf.keras.Model):
   """Span labeling network head for BERT modeling.
 
   This network implements a simple single-span labeler based on a dense layer.
+  *Note* that the network is constructed by
+  [Keras Functional API](https://keras.io/guides/functional_api/).
 
   Arguments:
     input_width: The innermost dimension of the input tensor to this network.
     activation: The activation, if any, for the dense layer in this network.
-    initializer: The intializer for the dense layer in this network. Defaults to
-      a Glorot uniform initializer.
+    initializer: The initializer for the dense layer in this network. Defaults
+      to a Glorot uniform initializer.
     output: The output style for this network. Can be either 'logits' or
       'predictions'.
   """
diff --git a/official/nlp/modeling/networks/token_classification.py b/official/nlp/modeling/networks/token_classification.py
deleted file mode 100644
index ff6163481e6f267a5aefac352ff38447a275a13a..0000000000000000000000000000000000000000
--- a/official/nlp/modeling/networks/token_classification.py
+++ /dev/null
@@ -1,83 +0,0 @@
-# Copyright 2019 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Classification network."""
-# pylint: disable=g-classes-have-attributes
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-@tf.keras.utils.register_keras_serializable(package='Text')
-class TokenClassification(tf.keras.Model):
-  """TokenClassification network head for BERT modeling.
-
-  This network implements a simple token classifier head based on a dense layer.
-
-  Arguments:
-    input_width: The innermost dimension of the input tensor to this network.
-    num_classes: The number of classes that this network should classify to.
-    activation: The activation, if any, for the dense layer in this network.
-    initializer: The intializer for the dense layer in this network. Defaults to
-      a Glorot uniform initializer.
-    output: The output style for this network. Can be either 'logits' or
-      'predictions'.
-  """
-
-  def __init__(self,
-               input_width,
-               num_classes,
-               initializer='glorot_uniform',
-               output='logits',
-               **kwargs):
-    self._self_setattr_tracking = False
-    self._config_dict = {
-        'input_width': input_width,
-        'num_classes': num_classes,
-        'initializer': initializer,
-        'output': output,
-    }
-
-    sequence_data = tf.keras.layers.Input(
-        shape=(None, input_width), name='sequence_data', dtype=tf.float32)
-
-    self.logits = tf.keras.layers.Dense(
-        num_classes,
-        activation=None,
-        kernel_initializer=initializer,
-        name='predictions/transform/logits')(
-            sequence_data)
-    predictions = tf.keras.layers.Activation(tf.nn.log_softmax)(self.logits)
-
-    if output == 'logits':
-      output_tensors = self.logits
-    elif output == 'predictions':
-      output_tensors = predictions
-    else:
-      raise ValueError(
-          ('Unknown `output` value "%s". `output` can be either "logits" or '
-           '"predictions"') % output)
-
-    super(TokenClassification, self).__init__(
-        inputs=[sequence_data], outputs=output_tensors, **kwargs)
-
-  def get_config(self):
-    return self._config_dict
-
-  @classmethod
-  def from_config(cls, config, custom_objects=None):
-    return cls(**config)
diff --git a/official/nlp/modeling/networks/token_classification_test.py b/official/nlp/modeling/networks/token_classification_test.py
deleted file mode 100644
index eb695c7845b125a5f34d82ff38218ca2dccdfe54..0000000000000000000000000000000000000000
--- a/official/nlp/modeling/networks/token_classification_test.py
+++ /dev/null
@@ -1,192 +0,0 @@
-# Copyright 2019 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for token classification network."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-from tensorflow.python.keras import keras_parameterized  # pylint: disable=g-direct-tensorflow-import
-from official.nlp.modeling.networks import token_classification
-
-
-# This decorator runs the test in V1, V2-Eager, and V2-Functional mode. It
-# guarantees forward compatibility of this code for the V2 switchover.
-@keras_parameterized.run_all_keras_modes
-class TokenClassificationTest(keras_parameterized.TestCase):
-
-  def test_network_creation(self):
-    """Validate that the Keras object can be created."""
-    sequence_length = 5
-    input_width = 512
-    num_classes = 10
-    test_object = token_classification.TokenClassification(
-        input_width=input_width, num_classes=num_classes)
-    # Create a 3-dimensional input (the first dimension is implicit).
-    sequence_data = tf.keras.Input(shape=(sequence_length, input_width),
-                                   dtype=tf.float32)
-    output = test_object(sequence_data)
-
-    # Validate that the outputs are of the expected shape.
-    expected_output_shape = [None, sequence_length, num_classes]
-    self.assertEqual(expected_output_shape, output.shape.as_list())
-
-  def test_network_invocation(self):
-    """Validate that the Keras object can be invoked."""
-    sequence_length = 5
-    input_width = 512
-    num_classes = 10
-    test_object = token_classification.TokenClassification(
-        input_width=input_width, num_classes=num_classes, output='predictions')
-    # Create a 3-dimensional input (the first dimension is implicit).
-    sequence_data = tf.keras.Input(shape=(sequence_length, input_width),
-                                   dtype=tf.float32)
-    output = test_object(sequence_data)
-
-    # Invoke the network as part of a Model.
-    model = tf.keras.Model(sequence_data, output)
-    input_data = 10 * np.random.random_sample((3, sequence_length, input_width))
-    _ = model.predict(input_data)
-
-  def test_network_invocation_with_internal_logits(self):
-    """Validate that the logit outputs are correct."""
-    sequence_length = 5
-    input_width = 512
-    num_classes = 10
-    test_object = token_classification.TokenClassification(
-        input_width=input_width, num_classes=num_classes, output='predictions')
-
-    # Create a 3-dimensional input (the first dimension is implicit).
-    sequence_data = tf.keras.Input(shape=(sequence_length, input_width),
-                                   dtype=tf.float32)
-    output = test_object(sequence_data)
-    model = tf.keras.Model(sequence_data, output)
-    logits_model = tf.keras.Model(test_object.inputs, test_object.logits)
-
-    batch_size = 3
-    input_data = 10 * np.random.random_sample(
-        (batch_size, sequence_length, input_width))
-    outputs = model.predict(input_data)
-    logits = logits_model.predict(input_data)
-
-    # Ensure that the tensor shapes are correct.
-    expected_output_shape = (batch_size, sequence_length, num_classes)
-    self.assertEqual(expected_output_shape, outputs.shape)
-    self.assertEqual(expected_output_shape, logits.shape)
-
-    # Ensure that the logits, when softmaxed, create the outputs.
-    input_tensor = tf.keras.Input(expected_output_shape[1:])
-    output_tensor = tf.keras.layers.Activation(tf.nn.log_softmax)(input_tensor)
-    softmax_model = tf.keras.Model(input_tensor, output_tensor)
-
-    calculated_softmax = softmax_model.predict(logits)
-    self.assertAllClose(outputs, calculated_softmax)
-
-  def test_network_invocation_with_internal_and_external_logits(self):
-    """Validate that the logit outputs are correct."""
-    sequence_length = 5
-    input_width = 512
-    num_classes = 10
-    test_object = token_classification.TokenClassification(
-        input_width=input_width, num_classes=num_classes, output='logits')
-
-    # Create a 3-dimensional input (the first dimension is implicit).
-    sequence_data = tf.keras.Input(shape=(sequence_length, input_width),
-                                   dtype=tf.float32)
-    output = test_object(sequence_data)
-    model = tf.keras.Model(sequence_data, output)
-    logits_model = tf.keras.Model(test_object.inputs, test_object.logits)
-
-    batch_size = 3
-    input_data = 10 * np.random.random_sample(
-        (batch_size, sequence_length, input_width))
-    outputs = model.predict(input_data)
-    logits = logits_model.predict(input_data)
-
-    # Ensure that the tensor shapes are correct.
-    expected_output_shape = (batch_size, sequence_length, num_classes)
-    self.assertEqual(expected_output_shape, outputs.shape)
-    self.assertEqual(expected_output_shape, logits.shape)
-
-    self.assertAllClose(outputs, logits)
-
-  def test_network_invocation_with_logit_output(self):
-    """Validate that the logit outputs are correct."""
-    sequence_length = 5
-    input_width = 512
-    num_classes = 10
-    test_object = token_classification.TokenClassification(
-        input_width=input_width, num_classes=num_classes, output='predictions')
-    logit_object = token_classification.TokenClassification(
-        input_width=input_width, num_classes=num_classes, output='logits')
-    logit_object.set_weights(test_object.get_weights())
-
-    # Create a 3-dimensional input (the first dimension is implicit).
-    sequence_data = tf.keras.Input(shape=(sequence_length, input_width),
-                                   dtype=tf.float32)
-    output = test_object(sequence_data)
-    logit_output = logit_object(sequence_data)
-
-    model = tf.keras.Model(sequence_data, output)
-    logits_model = tf.keras.Model(sequence_data, logit_output)
-
-    batch_size = 3
-    input_data = 10 * np.random.random_sample(
-        (batch_size, sequence_length, input_width))
-    outputs = model.predict(input_data)
-    logits = logits_model.predict(input_data)
-
-    # Ensure that the tensor shapes are correct.
-    expected_output_shape = (batch_size, sequence_length, num_classes)
-    self.assertEqual(expected_output_shape, outputs.shape)
-    self.assertEqual(expected_output_shape, logits.shape)
-
-    # Ensure that the logits, when softmaxed, create the outputs.
-    input_tensor = tf.keras.Input(expected_output_shape[1:])
-    output_tensor = tf.keras.layers.Activation(tf.nn.log_softmax)(input_tensor)
-    softmax_model = tf.keras.Model(input_tensor, output_tensor)
-
-    calculated_softmax = softmax_model.predict(logits)
-    self.assertAllClose(outputs, calculated_softmax)
-
-  def test_serialize_deserialize(self):
-    # Create a network object that sets all of its config options.
-    network = token_classification.TokenClassification(
-        input_width=128,
-        num_classes=10,
-        initializer='zeros',
-        output='predictions')
-
-    # Create another network object from the first object's config.
-    new_network = token_classification.TokenClassification.from_config(
-        network.get_config())
-
-    # Validate that the config can be forced to JSON.
-    _ = new_network.to_json()
-
-    # If the serialization was successful, the new config should match the old.
-    self.assertAllEqual(network.get_config(), new_network.get_config())
-
-  def test_unknown_output_type_fails(self):
-    with self.assertRaisesRegex(ValueError, 'Unknown `output` value "bad".*'):
-      _ = token_classification.TokenClassification(
-          input_width=128, num_classes=10, output='bad')
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/official/nlp/modeling/networks/transformer_encoder.py b/official/nlp/modeling/networks/transformer_encoder.py
index 6d75f34056b5b568bbbf46bb8c3d4033f060c733..8c0026300fe8e71b2579d03e2544ec7f5b96af98 100644
--- a/official/nlp/modeling/networks/transformer_encoder.py
+++ b/official/nlp/modeling/networks/transformer_encoder.py
@@ -39,15 +39,17 @@ class TransformerEncoder(tf.keras.Model):
   in "BERT: Pre-training of Deep Bidirectional Transformers for Language
   Understanding".
 
+  *Note* that the network is constructed by
+  [Keras Functional API](https://keras.io/guides/functional_api/).
+
   Arguments:
     vocab_size: The size of the token vocabulary.
     hidden_size: The size of the transformer hidden layers.
     num_layers: The number of transformer layers.
     num_attention_heads: The number of attention heads for each transformer. The
       hidden size must be divisible by the number of attention heads.
-    sequence_length: The sequence length that this encoder expects. If None, the
-      sequence length is dynamic; if an integer, the encoder will require
-      sequences padded to this length.
+    sequence_length: [Deprecated]. TODO(hongkuny): remove this argument once no
+      user is using it.
     max_sequence_length: The maximum sequence length that this encoder can
       consume. If None, max_sequence_length uses the value from sequence length.
       This determines the variable shape for positional embeddings.
@@ -60,7 +62,7 @@ class TransformerEncoder(tf.keras.Model):
     initializer: The initialzer to use for all weights in this encoder.
     return_all_encoder_outputs: Whether to output sequence embedding outputs of
       all encoder transformer layers.
-    output_range: the sequence output range, [0, output_range), by slicing the
+    output_range: The sequence output range, [0, output_range), by slicing the
       target sequence of the last transformer layer. `None` means the entire
       target sequence will attend to the source sequence, which yeilds the full
       output.
@@ -69,6 +71,10 @@ class TransformerEncoder(tf.keras.Model):
       two matrices in the shape of ['vocab_size', 'embedding_width'] and
       ['embedding_width', 'hidden_size'] ('embedding_width' is usually much
       smaller than 'hidden_size').
+    embedding_layer: The word embedding layer. `None` means we will create a new
+      embedding layer. Otherwise, we will reuse the given embedding layer. This
+      parameter is originally added for ELECTRA model which needs to tie the
+      generator embeddings with the discriminator embeddings.
   """
 
   def __init__(self,
@@ -76,8 +82,8 @@ class TransformerEncoder(tf.keras.Model):
                hidden_size=768,
                num_layers=12,
                num_attention_heads=12,
-               sequence_length=512,
-               max_sequence_length=None,
+               sequence_length=None,
+               max_sequence_length=512,
                type_vocab_size=16,
                intermediate_size=3072,
                activation=activations.gelu,
@@ -87,19 +93,17 @@ class TransformerEncoder(tf.keras.Model):
                return_all_encoder_outputs=False,
                output_range=None,
                embedding_width=None,
+               embedding_layer=None,
                **kwargs):
     activation = tf.keras.activations.get(activation)
     initializer = tf.keras.initializers.get(initializer)
 
-    if not max_sequence_length:
-      max_sequence_length = sequence_length
     self._self_setattr_tracking = False
     self._config_dict = {
         'vocab_size': vocab_size,
         'hidden_size': hidden_size,
         'num_layers': num_layers,
         'num_attention_heads': num_attention_heads,
-        'sequence_length': sequence_length,
         'max_sequence_length': max_sequence_length,
         'type_vocab_size': type_vocab_size,
         'intermediate_size': intermediate_size,
@@ -113,19 +117,22 @@ class TransformerEncoder(tf.keras.Model):
     }
 
     word_ids = tf.keras.layers.Input(
-        shape=(sequence_length,), dtype=tf.int32, name='input_word_ids')
+        shape=(None,), dtype=tf.int32, name='input_word_ids')
     mask = tf.keras.layers.Input(
-        shape=(sequence_length,), dtype=tf.int32, name='input_mask')
+        shape=(None,), dtype=tf.int32, name='input_mask')
     type_ids = tf.keras.layers.Input(
-        shape=(sequence_length,), dtype=tf.int32, name='input_type_ids')
+        shape=(None,), dtype=tf.int32, name='input_type_ids')
 
     if embedding_width is None:
       embedding_width = hidden_size
-    self._embedding_layer = layers.OnDeviceEmbedding(
-        vocab_size=vocab_size,
-        embedding_width=embedding_width,
-        initializer=initializer,
-        name='word_embeddings')
+    if embedding_layer is None:
+      self._embedding_layer = layers.OnDeviceEmbedding(
+          vocab_size=vocab_size,
+          embedding_width=embedding_width,
+          initializer=initializer,
+          name='word_embeddings')
+    else:
+      self._embedding_layer = embedding_layer
     word_embeddings = self._embedding_layer(word_ids)
 
     # Always uses dynamic slicing for simplicity.
@@ -209,6 +216,9 @@ class TransformerEncoder(tf.keras.Model):
   def get_embedding_table(self):
     return self._embedding_layer.embeddings
 
+  def get_embedding_layer(self):
+    return self._embedding_layer
+
   def get_config(self):
     return self._config_dict
 
diff --git a/official/nlp/modeling/networks/transformer_encoder_test.py b/official/nlp/modeling/networks/transformer_encoder_test.py
index e9fbc3aaa25e39908618626538902643edaabe72..69a4e2b626bd0a5b52365a5acf9584e0fde89f58 100644
--- a/official/nlp/modeling/networks/transformer_encoder_test.py
+++ b/official/nlp/modeling/networks/transformer_encoder_test.py
@@ -42,7 +42,6 @@ class TransformerEncoderTest(keras_parameterized.TestCase):
     test_network = transformer_encoder.TransformerEncoder(
         vocab_size=100,
         hidden_size=hidden_size,
-        sequence_length=sequence_length,
         num_attention_heads=2,
         num_layers=3)
     # Create the inputs (note that the first dimension is implicit).
@@ -71,7 +70,6 @@ class TransformerEncoderTest(keras_parameterized.TestCase):
     test_network = transformer_encoder.TransformerEncoder(
         vocab_size=100,
         hidden_size=hidden_size,
-        sequence_length=sequence_length,
         num_attention_heads=2,
         num_layers=3,
         return_all_encoder_outputs=True)
@@ -100,7 +98,6 @@ class TransformerEncoderTest(keras_parameterized.TestCase):
     test_network = transformer_encoder.TransformerEncoder(
         vocab_size=100,
         hidden_size=hidden_size,
-        sequence_length=sequence_length,
         num_attention_heads=2,
         num_layers=3)
     # Create the inputs (note that the first dimension is implicit).
@@ -132,7 +129,6 @@ class TransformerEncoderTest(keras_parameterized.TestCase):
     test_network = transformer_encoder.TransformerEncoder(
         vocab_size=vocab_size,
         hidden_size=hidden_size,
-        sequence_length=sequence_length,
         num_attention_heads=2,
         num_layers=3,
         type_vocab_size=num_types,
@@ -163,7 +159,6 @@ class TransformerEncoderTest(keras_parameterized.TestCase):
     test_network = transformer_encoder.TransformerEncoder(
         vocab_size=vocab_size,
         hidden_size=hidden_size,
-        sequence_length=sequence_length,
         max_sequence_length=max_sequence_length,
         num_attention_heads=2,
         num_layers=3,
@@ -177,7 +172,6 @@ class TransformerEncoderTest(keras_parameterized.TestCase):
     test_network = transformer_encoder.TransformerEncoder(
         vocab_size=vocab_size,
         hidden_size=hidden_size,
-        sequence_length=sequence_length,
         max_sequence_length=max_sequence_length,
         num_attention_heads=2,
         num_layers=3,
@@ -196,7 +190,6 @@ class TransformerEncoderTest(keras_parameterized.TestCase):
         hidden_size=32,
         num_layers=3,
         num_attention_heads=2,
-        sequence_length=21,
         max_sequence_length=21,
         type_vocab_size=12,
         intermediate_size=1223,
diff --git a/official/nlp/modeling/ops/__init__.py b/official/nlp/modeling/ops/__init__.py
new file mode 100644
index 0000000000000000000000000000000000000000..8b137891791fe96927ad78e64b0aad7bded08bdc
--- /dev/null
+++ b/official/nlp/modeling/ops/__init__.py
@@ -0,0 +1 @@
+
diff --git a/official/nlp/modeling/ops/beam_search.py b/official/nlp/modeling/ops/beam_search.py
new file mode 100644
index 0000000000000000000000000000000000000000..3b71e1f851e4c8f34fca3d7b2fb84da7a9da67a0
--- /dev/null
+++ b/official/nlp/modeling/ops/beam_search.py
@@ -0,0 +1,708 @@
+# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Beam search to find the translated sequence with the highest probability."""
+
+import numpy as np
+import tensorflow as tf
+
+
+def inf(dtype):
+  """Returns a value close to infinity, but is still finite in `dtype`.
+
+  This is useful to get a very large value that is still zero when multiplied by
+  zero. The floating-point "Inf" value is NaN when multiplied by zero.
+
+  Args:
+    dtype: A dtype. The returned value will be finite when casted to this dtype.
+
+  Returns:
+    A very large value.
+  """
+  if dtype == "float32" or dtype == "bfloat16":
+    return 1e7
+  elif dtype == "float16":
+    # Disable no-member lint error, as the linter thinks np.float16 does not
+    # exist for some reason.
+    return np.finfo(np.float16).max  # pylint: disable=no-member
+  else:
+    raise AssertionError("Invalid dtype: %s" % dtype)
+
+
+class _StateKeys(object):
+  """Keys to dictionary storing the state of the beam search loop."""
+
+  # Variable storing the loop index.
+  CUR_INDEX = "CUR_INDEX"
+
+  # Top sequences that are alive for each batch item. Alive sequences are ones
+  # that have not generated an EOS token. Sequences that reach EOS are marked as
+  # finished and moved to the FINISHED_SEQ tensor.
+  # Has shape [batch_size, beam_size, CUR_INDEX + 1]
+  ALIVE_SEQ = "ALIVE_SEQ"
+  # Log probabilities of each alive sequence. Shape [batch_size, beam_size]
+  ALIVE_LOG_PROBS = "ALIVE_LOG_PROBS"
+  # Dictionary of cached values for each alive sequence. The cache stores
+  # the encoder output, attention bias, and the decoder attention output from
+  # the previous iteration.
+  ALIVE_CACHE = "ALIVE_CACHE"
+
+  # Top finished sequences for each batch item.
+  # Has shape [batch_size, beam_size, CUR_INDEX + 1]. Sequences that are
+  # shorter than CUR_INDEX + 1 are padded with 0s.
+  FINISHED_SEQ = "FINISHED_SEQ"
+  # Scores for each finished sequence. Score = log probability / length norm
+  # Shape [batch_size, beam_size]
+  FINISHED_SCORES = "FINISHED_SCORES"
+  # Flags indicating which sequences in the finished sequences are finished.
+  # At the beginning, all of the sequences in FINISHED_SEQ are filler values.
+  # True -> finished sequence, False -> filler. Shape [batch_size, beam_size]
+  FINISHED_FLAGS = "FINISHED_FLAGS"
+
+
+def _expand_to_same_rank(tensor, target):
+  """Expands a given tensor to target's rank to be broadcastable.
+
+  Args:
+    tensor: input tensor to tile. Shape: [b, d1, ..., da]
+    target: target tensor. Shape: [b, d1, ..., da, ..., dn]
+
+  Returns:
+    Tiled tensor of shape [b, d1, ..., da, 1, ..., 1] with same rank of target.
+
+  Raises:
+    ValueError, if the shape rank of rank tensor/target is None.
+  """
+  if tensor.shape.rank is None:
+    raise ValueError("Expect rank for tensor shape, but got None.")
+  if target.shape.rank is None:
+    raise ValueError("Expect rank for target shape, but got None.")
+
+  with tf.name_scope("expand_rank"):
+    diff_rank = target.shape.rank - tensor.shape.rank
+    for _ in range(diff_rank):
+      tensor = tf.expand_dims(tensor, -1)
+    return tensor
+
+
+class SequenceBeamSearch(tf.Module):
+  """Implementation of beam search loop."""
+
+  def __init__(self,
+               symbols_to_logits_fn,
+               vocab_size,
+               beam_size,
+               alpha,
+               max_decode_length,
+               eos_id,
+               padded_decode,
+               dtype=tf.float32):
+    """Initialize sequence beam search.
+
+    Args:
+      symbols_to_logits_fn: A function to provide logits, which is the
+        interface to the Transformer model. The passed in arguments are: ids ->
+          A tensor with shape [batch_size * beam_size, index]. index -> A
+          scalar. cache -> A nested dictionary of tensors [batch_size *
+          beam_size, ...].
+        The function must return a tuple of logits and the updated cache: logits
+          -> A tensor with shape [batch * beam_size, vocab_size]. updated cache
+          -> A nested dictionary with the same structure as the input cache.
+      vocab_size: An integer, the size of the vocabulary, used for topk
+        computation.
+      beam_size: An integer, number of beams for beam search.
+      alpha: A float, defining the strength of length normalization.
+      max_decode_length: An integer, the maximum number of steps to decode a
+        sequence.
+      eos_id: An integer. ID of end of sentence token.
+      padded_decode: A bool, indicating if max_sequence_length padding is used
+        for beam search.
+      dtype: A tensorflow data type used for score computation. The default is
+        tf.float32.
+    """
+    self.symbols_to_logits_fn = symbols_to_logits_fn
+    self.vocab_size = vocab_size
+    self.beam_size = beam_size
+    self.alpha = alpha
+    self.max_decode_length = max_decode_length
+    self.eos_id = eos_id
+    self.padded_decode = padded_decode
+    self.dtype = tf.as_dtype(dtype)
+
+  def search(self, initial_ids, initial_cache):
+    """Beam search for sequences with highest scores.
+
+    Args:
+      initial_ids: initial ids to pass into the symbols_to_logits_fn. int tensor
+        with shape [batch_size, 1]
+      initial_cache: dictionary storing values to be passed into the
+        symbols_to_logits_fn.
+
+    Returns:
+      finished_seq and finished_scores.
+    """
+    batch_size = (
+        initial_ids.shape.as_list()[0]
+        if self.padded_decode else tf.shape(initial_ids)[0])
+    state, state_shapes = self._create_initial_state(initial_ids, initial_cache,
+                                                     batch_size)
+
+    def _grow_alive_seq(state):
+      """Grow alive sequences by one token, collect top 2*beam_size sequences.
+
+      2*beam_size sequences are collected because some sequences may have
+      reached the EOS token. 2*beam_size ensures that at least beam_size
+      sequences are still alive.
+
+      Args:
+        state: A dictionary with the current loop state.
+
+      Returns:
+        Tuple of
+        (Top 2*beam_size sequences [batch_size, 2 * beam_size, cur_index + 1],
+         Scores of returned sequences [batch_size, 2 * beam_size],
+         New alive cache, for each of the 2 * beam_size sequences)
+      """
+      i = state[_StateKeys.CUR_INDEX]
+      alive_seq = state[_StateKeys.ALIVE_SEQ]
+      alive_log_probs = state[_StateKeys.ALIVE_LOG_PROBS]
+      alive_cache = state[_StateKeys.ALIVE_CACHE]
+
+      beams_to_keep = 2 * self.beam_size
+
+      # Get logits for the next candidate IDs for the alive sequences. Get the
+      # new cache values at the same time.
+      if self.padded_decode:
+        flat_ids = tf.reshape(
+            tf.slice(alive_seq, [0, 0, i], [batch_size, self.beam_size, 1]),
+            [batch_size * self.beam_size, -1])
+      else:
+        flat_ids = _flatten_beam_dim(alive_seq)  # [batch_size * beam_size]
+      flat_cache = tf.nest.map_structure(_flatten_beam_dim, alive_cache)
+
+      flat_logits, flat_cache = self.symbols_to_logits_fn(
+          flat_ids, i, flat_cache)
+
+      # Unflatten logits to shape [batch_size, beam_size, vocab_size]
+      logits = _unflatten_beam_dim(flat_logits, batch_size, self.beam_size)
+      new_cache = tf.nest.map_structure(
+          lambda t: _unflatten_beam_dim(t, batch_size, self.beam_size),
+          flat_cache)
+
+      # Convert logits to normalized log probs
+      candidate_log_probs = _log_prob_from_logits(logits)
+
+      # Calculate new log probabilities if each of the alive sequences were
+      # extended # by the the candidate IDs.
+      # Shape [batch_size, beam_size, vocab_size]
+      log_probs = candidate_log_probs + tf.expand_dims(alive_log_probs, axis=2)
+
+      # Each batch item has beam_size * vocab_size candidate sequences. For each
+      # batch item, get the k candidates with the highest log probabilities.
+      flat_log_probs = tf.reshape(log_probs,
+                                  [-1, self.beam_size * self.vocab_size])
+      topk_log_probs, topk_indices = tf.nn.top_k(
+          flat_log_probs, k=beams_to_keep)
+
+      # Extract the alive sequences that generate the highest log probabilities
+      # after being extended.
+      topk_beam_indices = topk_indices // self.vocab_size
+      topk_seq, new_cache = _gather_beams([alive_seq, new_cache],
+                                          topk_beam_indices, batch_size,
+                                          beams_to_keep)
+
+      # Append the most probable IDs to the topk sequences
+      topk_ids = topk_indices % self.vocab_size
+      if self.padded_decode:
+        topk_seq = tf.transpose(topk_seq, perm=[2, 0, 1])
+        # TODO(b/145533236, hongkuny): Reverts once TF fix the validation.
+        topk_seq = tf.tensor_scatter_nd_update(topk_seq, [[i + 1]],
+                                               tf.expand_dims(topk_ids, axis=0))
+        topk_seq = tf.transpose(topk_seq, perm=[1, 2, 0])
+      else:
+        topk_seq = tf.concat(
+            [topk_seq, tf.expand_dims(topk_ids, axis=2)], axis=2)
+      return topk_seq, topk_log_probs, topk_ids, new_cache
+
+    def _get_new_alive_state(new_seq, new_log_probs, new_finished_flags,
+                             new_cache):
+      """Gather the top k sequences that are still alive.
+
+      Args:
+        new_seq: New sequences generated by growing the current alive sequences
+          int32 tensor with shape [batch_size, 2 * beam_size, cur_index + 1]
+        new_log_probs: Log probabilities of new sequences float32 tensor with
+          shape [batch_size, beam_size]
+        new_finished_flags: A boolean Tensor indicates which sequences are live
+          inside the beam.
+        new_cache: Dict of cached values for each sequence.
+
+      Returns:
+        Dictionary with alive keys from _StateKeys:
+          {Top beam_size sequences that are still alive (don't end with eos_id)
+           Log probabilities of top alive sequences
+           Dict cache storing decoder states for top alive sequences}
+      """
+      # To prevent finished sequences from being considered, set log probs to
+      # -inf.
+      new_log_probs += tf.cast(new_finished_flags,
+                               self.dtype) * -inf(self.dtype)
+
+      top_alive_seq, top_alive_log_probs, top_alive_cache = _gather_topk_beams(
+          [new_seq, new_log_probs, new_cache], new_log_probs, batch_size,
+          self.beam_size)
+
+      return {
+          _StateKeys.ALIVE_SEQ: top_alive_seq,
+          _StateKeys.ALIVE_LOG_PROBS: top_alive_log_probs,
+          _StateKeys.ALIVE_CACHE: top_alive_cache
+      }
+
+    def _get_new_finished_state(state, new_seq, new_log_probs,
+                                new_finished_flags):
+      """Combine new and old finished sequences, and gather the top k sequences.
+
+      Args:
+        state: A dictionary with the current loop state.
+        new_seq: New sequences generated by growing the current alive sequences
+          int32 tensor with shape [batch_size, beam_size, i + 1]
+        new_log_probs: Log probabilities of new sequences float32 tensor with
+          shape [batch_size, beam_size]
+        new_finished_flags: A boolean Tensor indicates which sequences are live
+          inside the beam.
+
+      Returns:
+        Dictionary with finished keys from _StateKeys:
+          {Top beam_size finished sequences based on score,
+           Scores of finished sequences,
+           Finished flags of finished sequences}
+      """
+      i = state[_StateKeys.CUR_INDEX]
+      finished_seq = state[_StateKeys.FINISHED_SEQ]
+      finished_scores = state[_StateKeys.FINISHED_SCORES]
+      finished_flags = state[_StateKeys.FINISHED_FLAGS]
+
+      # First append a column of 0-ids to finished_seq to increment the length.
+      # New shape of finished_seq: [batch_size, beam_size, i + 1]
+      if not self.padded_decode:
+        finished_seq = tf.concat(
+            [finished_seq,
+             tf.zeros([batch_size, self.beam_size, 1], tf.int32)],
+            axis=2)
+
+      # Calculate new seq scores from log probabilities.
+      length_norm = _length_normalization(self.alpha, i + 1, dtype=self.dtype)
+      new_scores = new_log_probs / length_norm
+
+      # Set the scores of the still-alive seq in new_seq to large negative
+      # values.
+      new_scores += ((1. - tf.cast(new_finished_flags, self.dtype)) *
+                     -inf(self.dtype))
+
+      # Combine sequences, scores, and flags.
+      finished_seq = tf.concat([finished_seq, new_seq], axis=1)
+      finished_scores = tf.concat([finished_scores, new_scores], axis=1)
+      finished_flags = tf.concat([finished_flags, new_finished_flags], axis=1)
+
+      # Return the finished sequences with the best scores.
+      top_finished_seq, top_finished_scores, top_finished_flags = (
+          _gather_topk_beams([finished_seq, finished_scores, finished_flags],
+                             finished_scores, batch_size, self.beam_size))
+
+      return {
+          _StateKeys.FINISHED_SEQ: top_finished_seq,
+          _StateKeys.FINISHED_SCORES: top_finished_scores,
+          _StateKeys.FINISHED_FLAGS: top_finished_flags
+      }
+
+    def _search_step(state):
+      """Beam search loop body.
+
+      Grow alive sequences by a single ID. Sequences that have reached the EOS
+      token are marked as finished. The alive and finished sequences with the
+      highest log probabilities and scores are returned.
+
+      A sequence's finished score is calculating by dividing the log probability
+      by the length normalization factor. Without length normalization, the
+      search is more likely to return shorter sequences.
+
+      Args:
+        state: A dictionary with the current loop state.
+
+      Returns:
+        new state dictionary.
+      """
+      # Grow alive sequences by one token.
+      new_seq, new_log_probs, topk_ids, new_cache = _grow_alive_seq(state)
+      new_finished_flags = tf.equal(topk_ids, self.eos_id)
+      # Collect top beam_size alive sequences
+      alive_state = _get_new_alive_state(new_seq, new_log_probs,
+                                         new_finished_flags, new_cache)
+
+      # Combine newly finished sequences with existing finished sequences, and
+      # collect the top k scoring sequences.
+      finished_state = _get_new_finished_state(state, new_seq, new_log_probs,
+                                               new_finished_flags)
+
+      # Increment loop index and create new state dictionary
+      new_state = {_StateKeys.CUR_INDEX: state[_StateKeys.CUR_INDEX] + 1}
+      new_state.update(alive_state)
+      new_state.update(finished_state)
+      return [new_state]
+
+    finished_state = tf.nest.map_structure(
+        tf.stop_gradient,
+        tf.while_loop(
+            self._continue_search,
+            _search_step,
+            loop_vars=[state],
+            shape_invariants=[state_shapes],
+            parallel_iterations=1))
+    finished_state = finished_state[0]
+    return self._process_finished_state(finished_state)
+
+  def _process_finished_state(self, finished_state):
+    alive_seq = finished_state[_StateKeys.ALIVE_SEQ]
+    alive_log_probs = finished_state[_StateKeys.ALIVE_LOG_PROBS]
+    finished_seq = finished_state[_StateKeys.FINISHED_SEQ]
+    finished_scores = finished_state[_StateKeys.FINISHED_SCORES]
+    finished_flags = finished_state[_StateKeys.FINISHED_FLAGS]
+    # TF2 changes tf.where behavior. Should make parameters broadcastable.
+    finished_cond = tf.reduce_any(finished_flags, 1, name="finished_cond")
+    seq_cond = _expand_to_same_rank(finished_cond, finished_seq)
+    score_cond = _expand_to_same_rank(finished_cond, finished_scores)
+
+    # Account for corner case where there are no finished sequences for a
+    # particular batch item. In that case, return alive sequences for that batch
+    # item.
+    finished_seq = tf.where(seq_cond, finished_seq, alive_seq)
+    finished_scores = tf.where(score_cond, finished_scores, alive_log_probs)
+    return finished_seq, finished_scores
+
+  def _create_initial_state(self, initial_ids, initial_cache, batch_size):
+    """Return initial state dictionary and its shape invariants."""
+    for key, value in initial_cache.items():
+      for inner_value in tf.nest.flatten(value):
+        if inner_value.dtype != self.dtype:
+          raise TypeError(
+              "initial_cache element for key '%s' has dtype %s that does not "
+              "match SequenceBeamSearch's dtype of %s. Value: %s" %
+              (key, value.dtype.name, self.dtype.name, inner_value))
+
+    # Current loop index (starts at 0)
+    cur_index = tf.constant(0)
+
+    # Create alive sequence with shape [batch_size, beam_size, 1]
+    alive_seq = _expand_to_beam_size(initial_ids, self.beam_size)
+    alive_seq = tf.expand_dims(alive_seq, axis=2)
+    if self.padded_decode:
+      alive_seq = tf.tile(alive_seq, [1, 1, self.max_decode_length + 1])
+
+    # Create tensor for storing initial log probabilities.
+    # Assume initial_ids are prob 1.0
+    initial_log_probs = tf.constant([[0.] + [-float("inf")] *
+                                     (self.beam_size - 1)],
+                                    dtype=self.dtype)
+    alive_log_probs = tf.tile(initial_log_probs, [batch_size, 1])
+
+    # Expand all values stored in the dictionary to the beam size, so that each
+    # beam has a separate cache.
+    alive_cache = tf.nest.map_structure(
+        lambda t: _expand_to_beam_size(t, self.beam_size), initial_cache)
+
+    # Initialize tensor storing finished sequences with filler values.
+    finished_seq = tf.zeros(tf.shape(alive_seq), tf.int32)
+
+    # Set scores of the initial finished seqs to negative infinity.
+    finished_scores = tf.ones([batch_size, self.beam_size],
+                              dtype=self.dtype) * -inf(self.dtype)
+
+    # Initialize finished flags with all False values.
+    finished_flags = tf.zeros([batch_size, self.beam_size], tf.bool)
+
+    # Create state dictionary
+    state = {
+        _StateKeys.CUR_INDEX: cur_index,
+        _StateKeys.ALIVE_SEQ: alive_seq,
+        _StateKeys.ALIVE_LOG_PROBS: alive_log_probs,
+        _StateKeys.ALIVE_CACHE: alive_cache,
+        _StateKeys.FINISHED_SEQ: finished_seq,
+        _StateKeys.FINISHED_SCORES: finished_scores,
+        _StateKeys.FINISHED_FLAGS: finished_flags
+    }
+
+    # Create state invariants for each value in the state dictionary. Each
+    # dimension must be a constant or None. A None dimension means either:
+    #   1) the dimension's value is a tensor that remains the same but may
+    #      depend on the input sequence to the model (e.g. batch size).
+    #   2) the dimension may have different values on different iterations.
+    if self.padded_decode:
+      state_shape_invariants = {
+          _StateKeys.CUR_INDEX:
+              tf.TensorShape([]),
+          _StateKeys.ALIVE_SEQ:
+              tf.TensorShape(
+                  [batch_size, self.beam_size, self.max_decode_length + 1]),
+          _StateKeys.ALIVE_LOG_PROBS:
+              tf.TensorShape([batch_size, self.beam_size]),
+          _StateKeys.ALIVE_CACHE:
+              tf.nest.map_structure(_get_shape, alive_cache),
+          _StateKeys.FINISHED_SEQ:
+              tf.TensorShape(
+                  [batch_size, self.beam_size, self.max_decode_length + 1]),
+          _StateKeys.FINISHED_SCORES:
+              tf.TensorShape([batch_size, self.beam_size]),
+          _StateKeys.FINISHED_FLAGS:
+              tf.TensorShape([batch_size, self.beam_size])
+      }
+    else:
+      state_shape_invariants = {
+          _StateKeys.CUR_INDEX:
+              tf.TensorShape([]),
+          _StateKeys.ALIVE_SEQ:
+              tf.TensorShape([None, self.beam_size, None]),
+          _StateKeys.ALIVE_LOG_PROBS:
+              tf.TensorShape([None, self.beam_size]),
+          _StateKeys.ALIVE_CACHE:
+              tf.nest.map_structure(_get_shape_keep_last_dim, alive_cache),
+          _StateKeys.FINISHED_SEQ:
+              tf.TensorShape([None, self.beam_size, None]),
+          _StateKeys.FINISHED_SCORES:
+              tf.TensorShape([None, self.beam_size]),
+          _StateKeys.FINISHED_FLAGS:
+              tf.TensorShape([None, self.beam_size])
+      }
+
+    return state, state_shape_invariants
+
+  def _continue_search(self, state):
+    """Return whether to continue the search loop.
+
+    The loops should terminate when
+      1) when decode length has been reached, or
+      2) when the worst score in the finished sequences is better than the best
+         score in the alive sequences (i.e. the finished sequences are provably
+         unchanging)
+
+    Args:
+      state: A dictionary with the current loop state.
+
+    Returns:
+      Bool tensor with value True if loop should continue, False if loop should
+      terminate.
+    """
+    i = state[_StateKeys.CUR_INDEX]
+    alive_log_probs = state[_StateKeys.ALIVE_LOG_PROBS]
+    finished_scores = state[_StateKeys.FINISHED_SCORES]
+    finished_flags = state[_StateKeys.FINISHED_FLAGS]
+
+    not_at_max_decode_length = tf.less(i, self.max_decode_length)
+
+    # Calculate largest length penalty (the larger penalty, the better score).
+    max_length_norm = _length_normalization(
+        self.alpha, self.max_decode_length, dtype=self.dtype)
+    # Get the best possible scores from alive sequences.
+    best_alive_scores = alive_log_probs[:, 0] / max_length_norm
+
+    # Compute worst score in finished sequences for each batch element
+    finished_scores *= tf.cast(finished_flags,
+                               self.dtype)  # set filler scores to zero
+    lowest_finished_scores = tf.reduce_min(finished_scores, axis=1)
+
+    # If there are no finished sequences in a batch element, then set the lowest
+    # finished score to -INF for that element.
+    finished_batches = tf.reduce_any(finished_flags, 1)
+    lowest_finished_scores += ((1.0 - tf.cast(finished_batches, self.dtype)) *
+                               -inf(self.dtype))
+
+    worst_finished_score_better_than_best_alive_score = tf.reduce_all(
+        tf.greater(lowest_finished_scores, best_alive_scores))
+
+    return tf.logical_and(
+        not_at_max_decode_length,
+        tf.logical_not(worst_finished_score_better_than_best_alive_score))
+
+
+def sequence_beam_search(symbols_to_logits_fn,
+                         initial_ids,
+                         initial_cache,
+                         vocab_size,
+                         beam_size,
+                         alpha,
+                         max_decode_length,
+                         eos_id,
+                         padded_decode=False,
+                         dtype="float32"):
+  """Search for sequence of subtoken ids with the largest probability.
+
+  Args:
+    symbols_to_logits_fn: A function that takes in ids, index, and cache as
+      arguments. The passed in arguments will have shape: ids -> A tensor with
+        shape [batch_size * beam_size, index]. index -> A scalar. cache -> A
+        nested dictionary of tensors [batch_size * beam_size, ...].
+      The function must return a tuple of logits and new cache: logits -> A
+        tensor with shape [batch * beam_size, vocab_size]. new cache -> A nested
+        dictionary with the same shape/structure as the inputted cache.
+    initial_ids: An int32 tensor with shape [batch_size]. Starting ids for each
+      batch item.
+    initial_cache: A dictionary, containing starting decoder variables
+      information.
+    vocab_size: An integer, the size of tokens.
+    beam_size: An integer, the number of beams.
+    alpha: A float, defining the strength of length normalization.
+    max_decode_length: An integer, the maximum length to decoded a sequence.
+    eos_id: An integer, ID of eos token, used to determine when a sequence has
+      finished.
+    padded_decode: A bool, indicating if max_sequence_length padding is used for
+      beam search.
+    dtype: A tensorflow data type used for score computation. The default is
+      tf.float32.
+
+  Returns:
+    Top decoded sequences [batch_size, beam_size, max_decode_length]
+    sequence scores [batch_size, beam_size]
+  """
+  sbs = SequenceBeamSearch(symbols_to_logits_fn, vocab_size, beam_size, alpha,
+                           max_decode_length, eos_id, padded_decode, dtype)
+  return sbs.search(initial_ids, initial_cache)
+
+
+def _log_prob_from_logits(logits):
+  return logits - tf.reduce_logsumexp(logits, axis=2, keepdims=True)
+
+
+def _length_normalization(alpha, length, dtype=tf.float32):
+  """Return length normalization factor."""
+  return tf.pow(((5. + tf.cast(length, dtype)) / 6.), alpha)
+
+
+def _expand_to_beam_size(tensor, beam_size):
+  """Tiles a given tensor by beam_size.
+
+  Args:
+    tensor: tensor to tile [batch_size, ...]
+    beam_size: How much to tile the tensor by.
+
+  Returns:
+    Tiled tensor [batch_size, beam_size, ...]
+  """
+  tensor = tf.expand_dims(tensor, axis=1)
+  tile_dims = [1] * tensor.shape.ndims
+  tile_dims[1] = beam_size
+
+  return tf.tile(tensor, tile_dims)
+
+
+def _shape_list(tensor):
+  """Return a list of the tensor's shape, and ensure no None values in list."""
+  # Get statically known shape (may contain None's for unknown dimensions)
+  shape = tensor.get_shape().as_list()
+
+  # Ensure that the shape values are not None
+  dynamic_shape = tf.shape(tensor)
+  for i in range(len(shape)):  # pylint: disable=consider-using-enumerate
+    if shape[i] is None:
+      shape[i] = dynamic_shape[i]
+  return shape
+
+
+def _get_shape_keep_last_dim(tensor):
+  shape_list = _shape_list(tensor)
+
+  # Only the last
+  for i in range(len(shape_list) - 1):
+    shape_list[i] = None
+
+  if isinstance(shape_list[-1], tf.Tensor):
+    shape_list[-1] = None
+  return tf.TensorShape(shape_list)
+
+
+def _get_shape(tensor):
+  """Return the shape of the input tensor."""
+  return tf.TensorShape(_shape_list(tensor))
+
+
+def _flatten_beam_dim(tensor):
+  """Reshapes first two dimensions in to single dimension.
+
+  Args:
+    tensor: Tensor to reshape of shape [A, B, ...]
+
+  Returns:
+    Reshaped tensor of shape [A*B, ...]
+  """
+  shape = _shape_list(tensor)
+  shape[0] *= shape[1]
+  shape.pop(1)  # Remove beam dim
+  return tf.reshape(tensor, shape)
+
+
+def _unflatten_beam_dim(tensor, batch_size, beam_size):
+  """Reshapes first dimension back to [batch_size, beam_size].
+
+  Args:
+    tensor: Tensor to reshape of shape [batch_size*beam_size, ...]
+    batch_size: Tensor, original batch size.
+    beam_size: int, original beam size.
+
+  Returns:
+    Reshaped tensor of shape [batch_size, beam_size, ...]
+  """
+  shape = _shape_list(tensor)
+  new_shape = [batch_size, beam_size] + shape[1:]
+  return tf.reshape(tensor, new_shape)
+
+
+def _gather_beams(nested, beam_indices, batch_size, new_beam_size):
+  """Gather beams from nested structure of tensors.
+
+  Each tensor in nested represents a batch of beams, where beam refers to a
+  single search state (beam search involves searching through multiple states
+  in parallel).
+
+  This function is used to gather the top beams, specified by
+  beam_indices, from the nested tensors.
+
+  Args:
+    nested: Nested structure (tensor, list, tuple or dict) containing tensors
+      with shape [batch_size, beam_size, ...].
+    beam_indices: int32 tensor with shape [batch_size, new_beam_size]. Each
+      value in beam_indices must be between [0, beam_size), and are not
+      necessarily unique.
+    batch_size: int size of batch
+    new_beam_size: int number of beams to be pulled from the nested tensors.
+
+  Returns:
+    Nested structure containing tensors with shape
+      [batch_size, new_beam_size, ...]
+  """
+  # Computes the i'th coodinate that contains the batch index for gather_nd.
+  # Batch pos is a tensor like [[0,0,0,0,],[1,1,1,1],..].
+  batch_pos = tf.range(batch_size * new_beam_size) // new_beam_size
+  batch_pos = tf.reshape(batch_pos, [batch_size, new_beam_size])
+
+  # Create coordinates to be passed to tf.gather_nd. Stacking creates a tensor
+  # with shape [batch_size, beam_size, 2], where the last dimension contains
+  # the (i, j) gathering coordinates.
+  coordinates = tf.stack([batch_pos, beam_indices], axis=2)
+
+  return tf.nest.map_structure(lambda state: tf.gather_nd(state, coordinates),
+                               nested)
+
+
+def _gather_topk_beams(nested, score_or_log_prob, batch_size, beam_size):
+  """Gather top beams from nested structure."""
+  _, topk_indexes = tf.nn.top_k(score_or_log_prob, k=beam_size)
+  return _gather_beams(nested, topk_indexes, batch_size, beam_size)
diff --git a/official/nlp/transformer/beam_search_v1_test.py b/official/nlp/modeling/ops/beam_search_test.py
similarity index 71%
rename from official/nlp/transformer/beam_search_v1_test.py
rename to official/nlp/modeling/ops/beam_search_test.py
index 53cf921fb90e93950a05e999807fc497390674a1..852d0a3ed7da4e45c0d65d88c204ace5482966a0 100644
--- a/official/nlp/transformer/beam_search_v1_test.py
+++ b/official/nlp/modeling/ops/beam_search_test.py
@@ -14,33 +14,19 @@
 # ==============================================================================
 """Test beam search helper methods."""
 
-import tensorflow.compat.v1 as tf
+import tensorflow as tf
 
-from official.nlp.transformer import beam_search_v1 as beam_search
+from official.nlp.modeling.ops import beam_search
 
 
 class BeamSearchHelperTests(tf.test.TestCase):
 
-  def setUp(self):
-    super(BeamSearchHelperTests, self).setUp()
-    tf.compat.v1.disable_eager_execution()
-
   def test_expand_to_beam_size(self):
     x = tf.ones([7, 4, 2, 5])
     x = beam_search._expand_to_beam_size(x, 3)
-    with self.session() as sess:
-      shape = sess.run(tf.shape(x))
+    shape = tf.shape(x)
     self.assertAllEqual([7, 3, 4, 2, 5], shape)
 
-  def test_shape_list(self):
-    y = tf.compat.v1.placeholder(dtype=tf.int32, shape=[])
-    x = tf.ones([7, y, 2, 5])
-    shape = beam_search._shape_list(x)
-    self.assertIsInstance(shape[0], int)
-    self.assertIsInstance(shape[1], tf.Tensor)
-    self.assertIsInstance(shape[2], int)
-    self.assertIsInstance(shape[3], int)
-
   def test_get_shape_keep_last_dim(self):
     y = tf.constant(4.0)
     x = tf.ones([7, tf.cast(tf.sqrt(y), tf.int32), 2, 5])
@@ -51,16 +37,12 @@ class BeamSearchHelperTests(tf.test.TestCase):
   def test_flatten_beam_dim(self):
     x = tf.ones([7, 4, 2, 5])
     x = beam_search._flatten_beam_dim(x)
-    with self.session() as sess:
-      shape = sess.run(tf.shape(x))
-    self.assertAllEqual([28, 2, 5], shape)
+    self.assertAllEqual([28, 2, 5], tf.shape(x))
 
   def test_unflatten_beam_dim(self):
     x = tf.ones([28, 2, 5])
     x = beam_search._unflatten_beam_dim(x, 7, 4)
-    with self.session() as sess:
-      shape = sess.run(tf.shape(x))
-    self.assertAllEqual([7, 4, 2, 5], shape)
+    self.assertAllEqual([7, 4, 2, 5], tf.shape(x))
 
   def test_gather_beams(self):
     x = tf.reshape(tf.range(24), [2, 3, 4])
@@ -73,9 +55,6 @@ class BeamSearchHelperTests(tf.test.TestCase):
     #                  [20 21 22 23]]]
 
     y = beam_search._gather_beams(x, [[1, 2], [0, 2]], 2, 2)
-    with self.session() as sess:
-      y = sess.run(y)
-
     self.assertAllEqual([[[4, 5, 6, 7],
                           [8, 9, 10, 11]],
                          [[12, 13, 14, 15],
@@ -87,9 +66,6 @@ class BeamSearchHelperTests(tf.test.TestCase):
     x_scores = [[0, 1, 1], [1, 0, 1]]
 
     y = beam_search._gather_topk_beams(x, x_scores, 2, 2)
-    with self.session() as sess:
-      y = sess.run(y)
-
     self.assertAllEqual([[[4, 5, 6, 7],
                           [8, 9, 10, 11]],
                          [[12, 13, 14, 15],
diff --git a/official/nlp/nhnet/decoder.py b/official/nlp/nhnet/decoder.py
index 0e46baa5f16d898a4dcf0958908ecc0aaf892c73..b38fa2a6b6a251af48848e5d0a8d684be8f4c098 100644
--- a/official/nlp/nhnet/decoder.py
+++ b/official/nlp/nhnet/decoder.py
@@ -22,151 +22,10 @@ from __future__ import print_function
 import tensorflow as tf
 from official.modeling import tf_utils
 from official.nlp.modeling import layers
-from official.nlp.nhnet import multi_channel_attention
+from official.nlp.modeling.layers import transformer
 from official.nlp.transformer import model_utils as transformer_utils
 
 
-class TransformerDecoderBlock(tf.keras.layers.Layer):
-  """Single transformer layer for decoder.
-
-  It has three sub-layers:
-  (1) a multi-head self-attention mechanism.
-  (2) a encoder-decoder attention.
-  (3) a positionwise fully connected feed-forward network.
-  """
-
-  def __init__(self,
-               hidden_size=768,
-               num_attention_heads=12,
-               intermediate_size=3072,
-               intermediate_activation="gelu",
-               hidden_dropout_prob=0.0,
-               attention_probs_dropout_prob=0.0,
-               initializer_range=0.02,
-               multi_channel_cross_attention=False,
-               **kwargs):
-    super(TransformerDecoderBlock, self).__init__(**kwargs)
-    self.hidden_size = hidden_size
-    self.num_attention_heads = num_attention_heads
-    self.intermediate_size = intermediate_size
-    self.intermediate_activation = tf_utils.get_activation(
-        intermediate_activation)
-    self.hidden_dropout_prob = hidden_dropout_prob
-    self.attention_probs_dropout_prob = attention_probs_dropout_prob
-    self.multi_channel_cross_attention = multi_channel_cross_attention
-    self._kernel_initializer = tf.keras.initializers.TruncatedNormal(
-        stddev=initializer_range)
-    self._bias_initializer = tf.keras.initializers.get("zeros")
-    if self.multi_channel_cross_attention:
-      self._cross_attention_cls = multi_channel_attention.MultiChannelAttention
-    else:
-      self._cross_attention_cls = layers.MultiHeadAttention
-
-    if self.hidden_size % self.num_attention_heads != 0:
-      raise ValueError(
-          "The hidden size (%d) is not a multiple of the number of attention "
-          "heads (%d)" % (self.hidden_size, self.num_attention_heads))
-    self.attention_head_size = int(self.hidden_size / self.num_attention_heads)
-
-  def build(self, input_shape):
-    # Self attention.
-    self.self_attention = layers.CachedAttention(
-        num_heads=self.num_attention_heads,
-        key_size=self.attention_head_size,
-        dropout=self.attention_probs_dropout_prob,
-        kernel_initializer=self._kernel_initializer,
-        name="self_attention")
-    self.self_attention_output_dense = layers.DenseEinsum(
-        output_shape=self.hidden_size,
-        num_summed_dimensions=2,
-        kernel_initializer=self._kernel_initializer,
-        bias_initializer=self._bias_initializer,
-        name="self_attention_output")
-    self.self_attention_dropout = tf.keras.layers.Dropout(
-        rate=self.hidden_dropout_prob)
-    self.self_attention_layer_norm = (
-        tf.keras.layers.LayerNormalization(
-            name="self_attention_layer_norm", axis=-1, epsilon=1e-12))
-    # Encoder-decoder attention.
-    self.encdec_attention = self._cross_attention_cls(
-        num_heads=self.num_attention_heads,
-        key_size=self.attention_head_size,
-        dropout=self.attention_probs_dropout_prob,
-        output_shape=self.hidden_size,
-        kernel_initializer=self._kernel_initializer,
-        name="attention/encdec")
-
-    self.encdec_attention_dropout = tf.keras.layers.Dropout(
-        rate=self.hidden_dropout_prob)
-    self.encdec_attention_layer_norm = (
-        tf.keras.layers.LayerNormalization(
-            name="attention/encdec_output_layer_norm", axis=-1, epsilon=1e-12))
-
-    # Feed-forward projection.
-    self.intermediate_dense = layers.DenseEinsum(
-        output_shape=self.intermediate_size,
-        activation=None,
-        kernel_initializer=self._kernel_initializer,
-        bias_initializer=self._bias_initializer,
-        name="intermediate")
-    self.intermediate_activation_layer = tf.keras.layers.Activation(
-        self.intermediate_activation)
-    self.output_dense = layers.DenseEinsum(
-        output_shape=self.hidden_size,
-        kernel_initializer=self._kernel_initializer,
-        bias_initializer=self._bias_initializer,
-        name="output")
-    self.output_dropout = tf.keras.layers.Dropout(rate=self.hidden_dropout_prob)
-    self.output_layer_norm = tf.keras.layers.LayerNormalization(
-        name="output_layer_norm", axis=-1, epsilon=1e-12)
-    super(TransformerDecoderBlock, self).build(input_shape)
-
-  def common_layers_with_encoder(self):
-    """Gets layer objects that can make a Transformer encoder block."""
-    return [
-        self.self_attention, self.self_attention_layer_norm,
-        self.intermediate_dense, self.output_dense, self.output_layer_norm
-    ]
-
-  def call(self, inputs, cache=None, decode_loop_step=None):
-    if self.multi_channel_cross_attention:
-      if len(inputs) != 5:
-        raise ValueError(
-            "TransformerDecoderBlock must have 5 inputs, when it uses "
-            "multi_channel_cross_attention. But it got: %d" % len(inputs))
-    elif len(inputs) != 4:
-      raise ValueError(
-          "TransformerDecoderBlock must have 4 inputs, but it got: %d" %
-          len(inputs))
-    input_tensor, memory, attention_mask, self_attention_mask = inputs[:4]
-    self_attention_inputs = [input_tensor, input_tensor]
-    self_attention_output, cache = self.self_attention(
-        self_attention_inputs,
-        attention_mask=self_attention_mask,
-        cache=cache,
-        decode_loop_step=decode_loop_step)
-    self_attention_output = self.self_attention_dropout(self_attention_output)
-    self_attention_output = self.self_attention_layer_norm(
-        input_tensor + self_attention_output)
-
-    cross_attn_inputs = [self_attention_output, memory]
-    if self.multi_channel_cross_attention:
-      # Accesses the 5-th input tensor for the doc-attention probabilities.
-      cross_attn_inputs.append(inputs[-1])
-    attention_output = self.encdec_attention(cross_attn_inputs, attention_mask)
-    attention_output = self.encdec_attention_dropout(attention_output)
-    attention_output = self.encdec_attention_layer_norm(self_attention_output +
-                                                        attention_output)
-
-    intermediate_output = self.intermediate_dense(attention_output)
-    intermediate_output = self.intermediate_activation_layer(
-        intermediate_output)
-    layer_output = self.output_dense(intermediate_output)
-    layer_output = self.output_dropout(layer_output)
-    layer_output = self.output_layer_norm(layer_output + attention_output)
-    return layer_output, cache
-
-
 class TransformerDecoder(tf.keras.layers.Layer):
   """Transformer decoder stack."""
 
@@ -200,14 +59,14 @@ class TransformerDecoder(tf.keras.layers.Layer):
     self.layers = []
     for i in range(self.num_hidden_layers):
       self.layers.append(
-          TransformerDecoderBlock(
-              hidden_size=self.hidden_size,
+          transformer.TransformerDecoderLayer(
               num_attention_heads=self.num_attention_heads,
               intermediate_size=self.intermediate_size,
               intermediate_activation=self.intermediate_activation,
-              hidden_dropout_prob=self.hidden_dropout_prob,
-              attention_probs_dropout_prob=self.attention_probs_dropout_prob,
-              initializer_range=self.initializer_range,
+              dropout_rate=self.hidden_dropout_prob,
+              attention_dropout_rate=self.attention_probs_dropout_prob,
+              kernel_initializer=tf.keras.initializers.TruncatedNormal(
+                  stddev=self.initializer_range),
               multi_channel_cross_attention=self.multi_channel_cross_attention,
               name=("layer_%d" % i)))
     super(TransformerDecoder, self).build(unused_input_shapes)
diff --git a/official/nlp/nhnet/decoder_test.py b/official/nlp/nhnet/decoder_test.py
index 1a58c7e12820daf233fc3452807d1807e77a6490..f5effbdb090e9c08939bfc203091e960741700c6 100644
--- a/official/nlp/nhnet/decoder_test.py
+++ b/official/nlp/nhnet/decoder_test.py
@@ -26,17 +26,6 @@ from official.nlp.nhnet import decoder
 from official.nlp.nhnet import utils
 
 
-def _create_cache(batch_size, init_decode_length, num_heads, head_size):
-  return {
-      "key":
-          tf.zeros([batch_size, init_decode_length, num_heads, head_size],
-                   dtype=tf.float32),
-      "value":
-          tf.zeros([batch_size, init_decode_length, num_heads, head_size],
-                   dtype=tf.float32)
-  }
-
-
 class DecoderTest(tf.test.TestCase):
 
   def setUp(self):
@@ -56,26 +45,6 @@ class DecoderTest(tf.test.TestCase):
     decoder_block.build(None)
     self.assertEqual(len(decoder_block.layers), self._config.num_hidden_layers)
 
-  def test_decoder_block_with_cache(self):
-    decoder_block = decoder.TransformerDecoderBlock(
-        hidden_size=self._config.hidden_size,
-        num_attention_heads=self._config.num_attention_heads,
-        intermediate_size=self._config.intermediate_size,
-        intermediate_activation=self._config.hidden_act,
-        hidden_dropout_prob=self._config.hidden_dropout_prob,
-        attention_probs_dropout_prob=self._config.attention_probs_dropout_prob,
-        initializer_range=self._config.initializer_range)
-    # Forward path.
-    dummy_tensor = tf.zeros([2, 4, self._config.hidden_size], dtype=tf.float32)
-    dummy_mask = tf.zeros([2, 4, 4], dtype=tf.float32)
-    inputs = [dummy_tensor, dummy_tensor, dummy_mask, dummy_mask]
-    cache = _create_cache(
-        2, 0, self._config.num_attention_heads,
-        self._config.hidden_size // self._config.num_attention_heads)
-    output, cache = decoder_block(inputs, cache)
-    self.assertEqual(output.shape, (2, 4, self._config.hidden_size))
-    self.assertEqual(cache["value"].shape, (2, 4, 2, 8))
-
   def test_bert_decoder(self):
     seq_length = 10
     encoder_input_ids = tf.keras.layers.Input(
diff --git a/official/nlp/nhnet/models.py b/official/nlp/nhnet/models.py
index 52b6c430b8b04fbcf46ca8bfd003843edcf8bf26..c8190b5ba56365a398c5e7b7a90af1a14742e54b 100644
--- a/official/nlp/nhnet/models.py
+++ b/official/nlp/nhnet/models.py
@@ -27,11 +27,11 @@ from typing import Optional, Text
 from official.modeling import tf_utils
 from official.modeling.hyperparams import params_dict
 from official.nlp.modeling import networks
+from official.nlp.modeling.layers import multi_channel_attention
 from official.nlp.nhnet import configs
 from official.nlp.nhnet import decoder
-from official.nlp.nhnet import multi_channel_attention
 from official.nlp.nhnet import utils
-from official.nlp.transformer import beam_search
+from official.nlp.modeling.ops import beam_search
 
 
 def embedding_linear(embedding_matrix, x):
@@ -273,7 +273,7 @@ class NHNet(Bert2Bert):
 
   def __init__(self, params, bert_layer, decoder_layer, name=None):
     super(NHNet, self).__init__(params, bert_layer, decoder_layer, name=name)
-    self.doc_attention = multi_channel_attention.DocAttention(
+    self.doc_attention = multi_channel_attention.VotingAttention(
         num_heads=params.num_decoder_attn_heads,
         head_size=params.hidden_size // params.num_decoder_attn_heads)
 
@@ -413,7 +413,6 @@ def get_bert2bert_layers(params: configs.BERT2BERTConfig):
       activation=tf_utils.get_activation(bert_config.hidden_act),
       dropout_rate=bert_config.hidden_dropout_prob,
       attention_dropout_rate=bert_config.attention_probs_dropout_prob,
-      sequence_length=None,
       max_sequence_length=bert_config.max_position_embeddings,
       type_vocab_size=bert_config.type_vocab_size,
       initializer=tf.keras.initializers.TruncatedNormal(
diff --git a/official/nlp/tasks/electra_task.py b/official/nlp/tasks/electra_task.py
new file mode 100644
index 0000000000000000000000000000000000000000..a34cabf08a2b9436edc5cdfbefddd1bc73cae274
--- /dev/null
+++ b/official/nlp/tasks/electra_task.py
@@ -0,0 +1,210 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""ELECTRA pretraining task (Joint Masked LM and Replaced Token Detection)."""
+import dataclasses
+import tensorflow as tf
+
+from official.core import base_task
+from official.core import task_factory
+from official.modeling.hyperparams import config_definitions as cfg
+from official.nlp.configs import bert
+from official.nlp.configs import electra
+from official.nlp.data import pretrain_dataloader
+
+
+@dataclasses.dataclass
+class ELECTRAPretrainConfig(cfg.TaskConfig):
+  """The model config."""
+  model: electra.ELECTRAPretrainerConfig = electra.ELECTRAPretrainerConfig(
+      cls_heads=[
+          bert.ClsHeadConfig(
+              inner_dim=768,
+              num_classes=2,
+              dropout_rate=0.1,
+              name='next_sentence')
+      ])
+  train_data: cfg.DataConfig = cfg.DataConfig()
+  validation_data: cfg.DataConfig = cfg.DataConfig()
+
+
+@task_factory.register_task_cls(ELECTRAPretrainConfig)
+class ELECTRAPretrainTask(base_task.Task):
+  """ELECTRA Pretrain Task (Masked LM + Replaced Token Detection)."""
+
+  def build_model(self):
+    return electra.instantiate_pretrainer_from_cfg(
+        self.task_config.model)
+
+  def build_losses(self,
+                   labels,
+                   model_outputs,
+                   metrics,
+                   aux_losses=None) -> tf.Tensor:
+    metrics = dict([(metric.name, metric) for metric in metrics])
+
+    # generator lm and (optional) nsp loss.
+    lm_prediction_losses = tf.keras.losses.sparse_categorical_crossentropy(
+        labels['masked_lm_ids'],
+        tf.cast(model_outputs['lm_outputs'], tf.float32),
+        from_logits=True)
+    lm_label_weights = labels['masked_lm_weights']
+    lm_numerator_loss = tf.reduce_sum(lm_prediction_losses * lm_label_weights)
+    lm_denominator_loss = tf.reduce_sum(lm_label_weights)
+    mlm_loss = tf.math.divide_no_nan(lm_numerator_loss, lm_denominator_loss)
+    metrics['lm_example_loss'].update_state(mlm_loss)
+    if 'next_sentence_labels' in labels:
+      sentence_labels = labels['next_sentence_labels']
+      sentence_outputs = tf.cast(
+          model_outputs['sentence_outputs'], dtype=tf.float32)
+      sentence_loss = tf.keras.losses.sparse_categorical_crossentropy(
+          sentence_labels,
+          sentence_outputs,
+          from_logits=True)
+      metrics['next_sentence_loss'].update_state(sentence_loss)
+      total_loss = mlm_loss + sentence_loss
+    else:
+      total_loss = mlm_loss
+
+    # discriminator replaced token detection (rtd) loss.
+    rtd_logits = model_outputs['disc_logits']
+    rtd_labels = tf.cast(model_outputs['disc_label'], tf.float32)
+    input_mask = tf.cast(labels['input_mask'], tf.float32)
+    rtd_ind_loss = tf.nn.sigmoid_cross_entropy_with_logits(
+        logits=rtd_logits, labels=rtd_labels)
+    rtd_numerator = tf.reduce_sum(input_mask * rtd_ind_loss)
+    rtd_denominator = tf.reduce_sum(input_mask)
+    rtd_loss = tf.math.divide_no_nan(rtd_numerator, rtd_denominator)
+    metrics['discriminator_loss'].update_state(rtd_loss)
+    total_loss = total_loss + \
+        self.task_config.model.discriminator_loss_weight * rtd_loss
+
+    if aux_losses:
+      total_loss += tf.add_n(aux_losses)
+
+    metrics['total_loss'].update_state(total_loss)
+    return total_loss
+
+  def build_inputs(self, params, input_context=None):
+    """Returns tf.data.Dataset for pretraining."""
+    if params.input_path == 'dummy':
+
+      def dummy_data(_):
+        dummy_ids = tf.zeros((1, params.seq_length), dtype=tf.int32)
+        dummy_lm = tf.zeros((1, params.max_predictions_per_seq), dtype=tf.int32)
+        return dict(
+            input_word_ids=dummy_ids,
+            input_mask=dummy_ids,
+            input_type_ids=dummy_ids,
+            masked_lm_positions=dummy_lm,
+            masked_lm_ids=dummy_lm,
+            masked_lm_weights=tf.cast(dummy_lm, dtype=tf.float32),
+            next_sentence_labels=tf.zeros((1, 1), dtype=tf.int32))
+
+      dataset = tf.data.Dataset.range(1)
+      dataset = dataset.repeat()
+      dataset = dataset.map(
+          dummy_data, num_parallel_calls=tf.data.experimental.AUTOTUNE)
+      return dataset
+
+    return pretrain_dataloader.BertPretrainDataLoader(params).load(
+        input_context)
+
+  def build_metrics(self, training=None):
+    del training
+    metrics = [
+        tf.keras.metrics.SparseCategoricalAccuracy(name='masked_lm_accuracy'),
+        tf.keras.metrics.Mean(name='lm_example_loss'),
+        tf.keras.metrics.SparseCategoricalAccuracy(
+            name='discriminator_accuracy'),
+    ]
+    if self.task_config.train_data.use_next_sentence_label:
+      metrics.append(
+          tf.keras.metrics.SparseCategoricalAccuracy(
+              name='next_sentence_accuracy'))
+      metrics.append(tf.keras.metrics.Mean(name='next_sentence_loss'))
+
+    metrics.append(tf.keras.metrics.Mean(name='discriminator_loss'))
+    metrics.append(tf.keras.metrics.Mean(name='total_loss'))
+
+    return metrics
+
+  def process_metrics(self, metrics, labels, model_outputs):
+    metrics = dict([(metric.name, metric) for metric in metrics])
+    if 'masked_lm_accuracy' in metrics:
+      metrics['masked_lm_accuracy'].update_state(labels['masked_lm_ids'],
+                                                 model_outputs['lm_outputs'],
+                                                 labels['masked_lm_weights'])
+    if 'next_sentence_accuracy' in metrics:
+      metrics['next_sentence_accuracy'].update_state(
+          labels['next_sentence_labels'], model_outputs['sentence_outputs'])
+    if 'discriminator_accuracy' in metrics:
+      disc_logits_expanded = tf.expand_dims(model_outputs['disc_logits'], -1)
+      discrim_full_logits = tf.concat(
+          [-1.0 * disc_logits_expanded, disc_logits_expanded], -1)
+      metrics['discriminator_accuracy'].update_state(
+          model_outputs['disc_label'], discrim_full_logits,
+          labels['input_mask'])
+
+  def train_step(self, inputs, model: tf.keras.Model,
+                 optimizer: tf.keras.optimizers.Optimizer, metrics):
+    """Does forward and backward.
+
+    Args:
+      inputs: a dictionary of input tensors.
+      model: the model, forward pass definition.
+      optimizer: the optimizer for this training step.
+      metrics: a nested structure of metrics objects.
+
+    Returns:
+      A dictionary of logs.
+    """
+    with tf.GradientTape() as tape:
+      outputs = model(inputs, training=True)
+      # Computes per-replica loss.
+      loss = self.build_losses(
+          labels=inputs,
+          model_outputs=outputs,
+          metrics=metrics,
+          aux_losses=model.losses)
+      # Scales loss as the default gradients allreduce performs sum inside the
+      # optimizer.
+      # TODO(b/154564893): enable loss scaling.
+      scaled_loss = loss / tf.distribute.get_strategy().num_replicas_in_sync
+    tvars = model.trainable_variables
+    grads = tape.gradient(scaled_loss, tvars)
+    optimizer.apply_gradients(list(zip(grads, tvars)))
+    self.process_metrics(metrics, inputs, outputs)
+    return {self.loss: loss}
+
+  def validation_step(self, inputs, model: tf.keras.Model, metrics):
+    """Validatation step.
+
+    Args:
+      inputs: a dictionary of input tensors.
+      model: the keras.Model.
+      metrics: a nested structure of metrics objects.
+
+    Returns:
+      A dictionary of logs.
+    """
+    outputs = model(inputs, training=False)
+    loss = self.build_losses(
+        labels=inputs,
+        model_outputs=outputs,
+        metrics=metrics,
+        aux_losses=model.losses)
+    self.process_metrics(metrics, inputs, outputs)
+    return {self.loss: loss}
diff --git a/official/nlp/tasks/electra_task_test.py b/official/nlp/tasks/electra_task_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..c0978c01d41d05e2971d396ceeb2d81c64f550ae
--- /dev/null
+++ b/official/nlp/tasks/electra_task_test.py
@@ -0,0 +1,59 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for official.nlp.tasks.electra_task."""
+
+import tensorflow as tf
+
+from official.nlp.configs import bert
+from official.nlp.configs import electra
+from official.nlp.configs import encoders
+from official.nlp.data import pretrain_dataloader
+from official.nlp.tasks import electra_task
+
+
+class ELECTRAPretrainTaskTest(tf.test.TestCase):
+
+  def test_task(self):
+    config = electra_task.ELECTRAPretrainConfig(
+        model=electra.ELECTRAPretrainerConfig(
+            generator_encoder=encoders.TransformerEncoderConfig(
+                vocab_size=30522, num_layers=1),
+            discriminator_encoder=encoders.TransformerEncoderConfig(
+                vocab_size=30522, num_layers=1),
+            num_masked_tokens=20,
+            sequence_length=128,
+            cls_heads=[
+                bert.ClsHeadConfig(
+                    inner_dim=10, num_classes=2, name="next_sentence")
+            ]),
+        train_data=pretrain_dataloader.BertPretrainDataConfig(
+            input_path="dummy",
+            max_predictions_per_seq=20,
+            seq_length=128,
+            global_batch_size=1))
+    task = electra_task.ELECTRAPretrainTask(config)
+    model = task.build_model()
+    metrics = task.build_metrics()
+    dataset = task.build_inputs(config.train_data)
+
+    iterator = iter(dataset)
+    optimizer = tf.keras.optimizers.SGD(lr=0.1)
+    task.train_step(next(iterator), model, optimizer, metrics=metrics)
+    task.validation_step(next(iterator), model, metrics=metrics)
+
+
+if __name__ == "__main__":
+  tf.test.main()
diff --git a/official/nlp/tasks/masked_lm.py b/official/nlp/tasks/masked_lm.py
index 1679c09a05cd8b085bdac94863b1f3085ef2a19a..b42c95192f3c48c0f05eb40b154022c20a6b9c69 100644
--- a/official/nlp/tasks/masked_lm.py
+++ b/official/nlp/tasks/masked_lm.py
@@ -18,16 +18,16 @@ import dataclasses
 import tensorflow as tf
 
 from official.core import base_task
+from official.core import task_factory
 from official.modeling.hyperparams import config_definitions as cfg
 from official.nlp.configs import bert
-from official.nlp.data import pretrain_dataloader
-from official.nlp.modeling import losses as loss_lib
+from official.nlp.data import data_loader_factory
 
 
 @dataclasses.dataclass
 class MaskedLMConfig(cfg.TaskConfig):
   """The model config."""
-  network: bert.BertPretrainerConfig = bert.BertPretrainerConfig(cls_heads=[
+  model: bert.BertPretrainerConfig = bert.BertPretrainerConfig(cls_heads=[
       bert.ClsHeadConfig(
           inner_dim=768, num_classes=2, dropout_rate=0.1, name='next_sentence')
   ])
@@ -35,12 +35,13 @@ class MaskedLMConfig(cfg.TaskConfig):
   validation_data: cfg.DataConfig = cfg.DataConfig()
 
 
-@base_task.register_task_cls(MaskedLMConfig)
+@task_factory.register_task_cls(MaskedLMConfig)
 class MaskedLMTask(base_task.Task):
   """Mock task object for testing."""
 
-  def build_model(self):
-    return bert.instantiate_from_cfg(self.task_config.network)
+  def build_model(self, params=None):
+    params = params or self.task_config.model
+    return bert.instantiate_pretrainer_from_cfg(params)
 
   def build_losses(self,
                    labels,
@@ -48,23 +49,23 @@ class MaskedLMTask(base_task.Task):
                    metrics,
                    aux_losses=None) -> tf.Tensor:
     metrics = dict([(metric.name, metric) for metric in metrics])
-    lm_output = tf.nn.log_softmax(model_outputs['lm_output'], axis=-1)
-    mlm_loss = loss_lib.weighted_sparse_categorical_crossentropy_loss(
-        labels=labels['masked_lm_ids'],
-        predictions=lm_output,
-        weights=labels['masked_lm_weights'])
+    lm_prediction_losses = tf.keras.losses.sparse_categorical_crossentropy(
+        labels['masked_lm_ids'],
+        tf.cast(model_outputs['lm_output'], tf.float32),
+        from_logits=True)
+    lm_label_weights = labels['masked_lm_weights']
+    lm_numerator_loss = tf.reduce_sum(lm_prediction_losses * lm_label_weights)
+    lm_denominator_loss = tf.reduce_sum(lm_label_weights)
+    mlm_loss = tf.math.divide_no_nan(lm_numerator_loss, lm_denominator_loss)
     metrics['lm_example_loss'].update_state(mlm_loss)
     if 'next_sentence_labels' in labels:
-      policy = tf.keras.mixed_precision.experimental.global_policy()
-      if policy.name == 'mixed_bfloat16':  # b/158514794: bf16 is not stable.
-        policy = tf.float32
-      predictions = tf.keras.layers.Activation(
-          tf.nn.log_softmax, dtype=policy)(model_outputs['next_sentence'])
-
       sentence_labels = labels['next_sentence_labels']
-      sentence_loss = loss_lib.weighted_sparse_categorical_crossentropy_loss(
-          labels=sentence_labels,
-          predictions=predictions)
+      sentence_outputs = tf.cast(
+          model_outputs['next_sentence'], dtype=tf.float32)
+      sentence_loss = tf.reduce_mean(
+          tf.keras.losses.sparse_categorical_crossentropy(sentence_labels,
+                                                          sentence_outputs,
+                                                          from_logits=True))
       metrics['next_sentence_loss'].update_state(sentence_loss)
       total_loss = mlm_loss + sentence_loss
     else:
@@ -77,6 +78,7 @@ class MaskedLMTask(base_task.Task):
   def build_inputs(self, params, input_context=None):
     """Returns tf.data.Dataset for pretraining."""
     if params.input_path == 'dummy':
+
       def dummy_data(_):
         dummy_ids = tf.zeros((1, params.seq_length), dtype=tf.int32)
         dummy_lm = tf.zeros((1, params.max_predictions_per_seq), dtype=tf.int32)
@@ -95,8 +97,7 @@ class MaskedLMTask(base_task.Task):
           dummy_data, num_parallel_calls=tf.data.experimental.AUTOTUNE)
       return dataset
 
-    return pretrain_dataloader.BertPretrainDataLoader(params).load(
-        input_context)
+    return data_loader_factory.get_data_loader(params).load(input_context)
 
   def build_metrics(self, training=None):
     del training
diff --git a/official/nlp/tasks/masked_lm_test.py b/official/nlp/tasks/masked_lm_test.py
index 0124165ed097d80d31d83ad82c5fac256dfddc5d..38970a378bb1dc12738e9575a660056913a891fa 100644
--- a/official/nlp/tasks/masked_lm_test.py
+++ b/official/nlp/tasks/masked_lm_test.py
@@ -19,6 +19,7 @@ import tensorflow as tf
 
 from official.nlp.configs import bert
 from official.nlp.configs import encoders
+from official.nlp.data import pretrain_dataloader
 from official.nlp.tasks import masked_lm
 
 
@@ -26,14 +27,14 @@ class MLMTaskTest(tf.test.TestCase):
 
   def test_task(self):
     config = masked_lm.MaskedLMConfig(
-        network=bert.BertPretrainerConfig(
+        init_checkpoint=self.get_temp_dir(),
+        model=bert.BertPretrainerConfig(
             encoders.TransformerEncoderConfig(vocab_size=30522, num_layers=1),
-            num_masked_tokens=20,
             cls_heads=[
                 bert.ClsHeadConfig(
                     inner_dim=10, num_classes=2, name="next_sentence")
             ]),
-        train_data=bert.BertPretrainDataConfig(
+        train_data=pretrain_dataloader.BertPretrainDataConfig(
             input_path="dummy",
             max_predictions_per_seq=20,
             seq_length=128,
@@ -48,6 +49,12 @@ class MLMTaskTest(tf.test.TestCase):
     task.train_step(next(iterator), model, optimizer, metrics=metrics)
     task.validation_step(next(iterator), model, metrics=metrics)
 
+    # Saves a checkpoint.
+    ckpt = tf.train.Checkpoint(
+        model=model, **model.checkpoint_items)
+    ckpt.save(config.init_checkpoint)
+    task.initialize(model)
+
 
 if __name__ == "__main__":
   tf.test.main()
diff --git a/official/nlp/tasks/question_answering.py b/official/nlp/tasks/question_answering.py
new file mode 100644
index 0000000000000000000000000000000000000000..aaa20e854afaf057ba0a1f3de02acafe158cef50
--- /dev/null
+++ b/official/nlp/tasks/question_answering.py
@@ -0,0 +1,293 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Question answering task."""
+import collections
+import json
+import os
+from absl import logging
+import dataclasses
+import tensorflow as tf
+import tensorflow_hub as hub
+
+from official.core import base_task
+from official.core import task_factory
+from official.modeling.hyperparams import base_config
+from official.modeling.hyperparams import config_definitions as cfg
+from official.nlp.bert import squad_evaluate_v1_1
+from official.nlp.bert import squad_evaluate_v2_0
+from official.nlp.bert import tokenization
+from official.nlp.configs import encoders
+from official.nlp.data import data_loader_factory
+from official.nlp.data import squad_lib as squad_lib_wp
+from official.nlp.data import squad_lib_sp
+from official.nlp.modeling import models
+from official.nlp.tasks import utils
+
+
+@dataclasses.dataclass
+class ModelConfig(base_config.Config):
+  """A base span labeler configuration."""
+  encoder: encoders.TransformerEncoderConfig = (
+      encoders.TransformerEncoderConfig())
+
+
+@dataclasses.dataclass
+class QuestionAnsweringConfig(cfg.TaskConfig):
+  """The model config."""
+  # At most one of `init_checkpoint` and `hub_module_url` can be specified.
+  init_checkpoint: str = ''
+  hub_module_url: str = ''
+  n_best_size: int = 20
+  max_answer_length: int = 30
+  null_score_diff_threshold: float = 0.0
+  model: ModelConfig = ModelConfig()
+  train_data: cfg.DataConfig = cfg.DataConfig()
+  validation_data: cfg.DataConfig = cfg.DataConfig()
+
+
+@task_factory.register_task_cls(QuestionAnsweringConfig)
+class QuestionAnsweringTask(base_task.Task):
+  """Task object for question answering."""
+
+  def __init__(self, params=cfg.TaskConfig, logging_dir=None):
+    super(QuestionAnsweringTask, self).__init__(params, logging_dir)
+    if params.hub_module_url and params.init_checkpoint:
+      raise ValueError('At most one of `hub_module_url` and '
+                       '`init_checkpoint` can be specified.')
+    if params.hub_module_url:
+      self._hub_module = hub.load(params.hub_module_url)
+    else:
+      self._hub_module = None
+
+    if params.validation_data.tokenization == 'WordPiece':
+      self.squad_lib = squad_lib_wp
+    elif params.validation_data.tokenization == 'SentencePiece':
+      self.squad_lib = squad_lib_sp
+    else:
+      raise ValueError('Unsupported tokenization method: {}'.format(
+          params.validation_data.tokenization))
+
+    if params.validation_data.input_path:
+      self._tf_record_input_path, self._eval_examples, self._eval_features = (
+          self._preprocess_eval_data(params.validation_data))
+
+  def build_model(self):
+    if self._hub_module:
+      encoder_network = utils.get_encoder_from_hub(self._hub_module)
+    else:
+      encoder_network = encoders.instantiate_encoder_from_cfg(
+          self.task_config.model.encoder)
+    # Currently, we only supports bert-style question answering finetuning.
+    return models.BertSpanLabeler(
+        network=encoder_network,
+        initializer=tf.keras.initializers.TruncatedNormal(
+            stddev=self.task_config.model.encoder.initializer_range))
+
+  def build_losses(self, labels, model_outputs, aux_losses=None) -> tf.Tensor:
+    start_positions = labels['start_positions']
+    end_positions = labels['end_positions']
+    start_logits, end_logits = model_outputs
+
+    start_loss = tf.keras.losses.sparse_categorical_crossentropy(
+        start_positions,
+        tf.cast(start_logits, dtype=tf.float32),
+        from_logits=True)
+    end_loss = tf.keras.losses.sparse_categorical_crossentropy(
+        end_positions,
+        tf.cast(end_logits, dtype=tf.float32),
+        from_logits=True)
+
+    loss = (tf.reduce_mean(start_loss) + tf.reduce_mean(end_loss)) / 2
+    return loss
+
+  def _preprocess_eval_data(self, params):
+    eval_examples = self.squad_lib.read_squad_examples(
+        input_file=params.input_path,
+        is_training=False,
+        version_2_with_negative=params.version_2_with_negative)
+
+    temp_file_path = params.input_preprocessed_data_path or self.logging_dir
+    if not temp_file_path:
+      raise ValueError('You must specify a temporary directory, either in '
+                       'params.input_preprocessed_data_path or logging_dir to '
+                       'store intermediate evaluation TFRecord data.')
+    eval_writer = self.squad_lib.FeatureWriter(
+        filename=os.path.join(temp_file_path, 'eval.tf_record'),
+        is_training=False)
+    eval_features = []
+
+    def _append_feature(feature, is_padding):
+      if not is_padding:
+        eval_features.append(feature)
+      eval_writer.process_feature(feature)
+
+    kwargs = dict(
+        examples=eval_examples,
+        tokenizer=tokenization.FullTokenizer(
+            vocab_file=params.vocab_file,
+            do_lower_case=params.do_lower_case),
+        max_seq_length=params.seq_length,
+        doc_stride=params.doc_stride,
+        max_query_length=params.query_length,
+        is_training=False,
+        output_fn=_append_feature,
+        batch_size=params.global_batch_size)
+    if params.tokenization == 'SentencePiece':
+      # squad_lib_sp requires one more argument 'do_lower_case'.
+      kwargs['do_lower_case'] = params.do_lower_case
+
+    eval_dataset_size = self.squad_lib.convert_examples_to_features(**kwargs)
+    eval_writer.close()
+
+    logging.info('***** Evaluation input stats *****')
+    logging.info('  Num orig examples = %d', len(eval_examples))
+    logging.info('  Num split examples = %d', len(eval_features))
+    logging.info('  Batch size = %d', params.global_batch_size)
+    logging.info('  Dataset size = %d', eval_dataset_size)
+
+    return eval_writer.filename, eval_examples, eval_features
+
+  def build_inputs(self, params, input_context=None):
+    """Returns tf.data.Dataset for sentence_prediction task."""
+    if params.input_path == 'dummy':
+      # Dummy training data for unit test.
+      def dummy_data(_):
+        dummy_ids = tf.zeros((1, params.seq_length), dtype=tf.int32)
+        x = dict(
+            input_word_ids=dummy_ids,
+            input_mask=dummy_ids,
+            input_type_ids=dummy_ids)
+        y = dict(
+            start_positions=tf.constant(0, dtype=tf.int32),
+            end_positions=tf.constant(1, dtype=tf.int32))
+        return (x, y)
+
+      dataset = tf.data.Dataset.range(1)
+      dataset = dataset.repeat()
+      dataset = dataset.map(
+          dummy_data, num_parallel_calls=tf.data.experimental.AUTOTUNE)
+      return dataset
+
+    if params.is_training:
+      dataloader_params = params
+    else:
+      input_path = self._tf_record_input_path
+      dataloader_params = params.replace(input_path=input_path)
+
+    return data_loader_factory.get_data_loader(
+        dataloader_params).load(input_context)
+
+  def build_metrics(self, training=None):
+    del training
+    # TODO(lehou): a list of metrics doesn't work the same as in compile/fit.
+    metrics = [
+        tf.keras.metrics.SparseCategoricalAccuracy(
+            name='start_position_accuracy'),
+        tf.keras.metrics.SparseCategoricalAccuracy(
+            name='end_position_accuracy'),
+    ]
+    return metrics
+
+  def process_metrics(self, metrics, labels, model_outputs):
+    metrics = dict([(metric.name, metric) for metric in metrics])
+    start_logits, end_logits = model_outputs
+    metrics['start_position_accuracy'].update_state(
+        labels['start_positions'], start_logits)
+    metrics['end_position_accuracy'].update_state(
+        labels['end_positions'], end_logits)
+
+  def process_compiled_metrics(self, compiled_metrics, labels, model_outputs):
+    start_logits, end_logits = model_outputs
+    compiled_metrics.update_state(
+        y_true=labels,  # labels has keys 'start_positions' and 'end_positions'.
+        y_pred={'start_positions': start_logits, 'end_positions': end_logits})
+
+  def validation_step(self, inputs, model: tf.keras.Model, metrics=None):
+    features, _ = inputs
+    unique_ids = features.pop('unique_ids')
+    model_outputs = self.inference_step(features, model)
+    start_logits, end_logits = model_outputs
+    logs = {
+        self.loss: 0.0,  # TODO(lehou): compute the real validation loss.
+        'unique_ids': unique_ids,
+        'start_logits': start_logits,
+        'end_logits': end_logits,
+    }
+    return logs
+
+  raw_aggregated_result = collections.namedtuple(
+      'RawResult', ['unique_id', 'start_logits', 'end_logits'])
+
+  def aggregate_logs(self, state=None, step_outputs=None):
+    assert step_outputs is not None, 'Got no logs from self.validation_step.'
+    if state is None:
+      state = []
+
+    for unique_ids, start_logits, end_logits in zip(
+        step_outputs['unique_ids'],
+        step_outputs['start_logits'],
+        step_outputs['end_logits']):
+      u_ids, s_logits, e_logits = (
+          unique_ids.numpy(), start_logits.numpy(), end_logits.numpy())
+      if u_ids.size == 1:
+        u_ids = [u_ids]
+        s_logits = [s_logits]
+        e_logits = [e_logits]
+      for values in zip(u_ids, s_logits, e_logits):
+        state.append(self.raw_aggregated_result(
+            unique_id=values[0],
+            start_logits=values[1].tolist(),
+            end_logits=values[2].tolist()))
+    return state
+
+  def reduce_aggregated_logs(self, aggregated_logs):
+    all_predictions, _, scores_diff = (
+        self.squad_lib.postprocess_output(
+            self._eval_examples,
+            self._eval_features,
+            aggregated_logs,
+            self.task_config.n_best_size,
+            self.task_config.max_answer_length,
+            self.task_config.validation_data.do_lower_case,
+            version_2_with_negative=(
+                self.task_config.validation_data.version_2_with_negative),
+            null_score_diff_threshold=(
+                self.task_config.null_score_diff_threshold),
+            verbose=False))
+
+    with tf.io.gfile.GFile(
+        self.task_config.validation_data.input_path, 'r') as reader:
+      dataset_json = json.load(reader)
+      pred_dataset = dataset_json['data']
+    if self.task_config.validation_data.version_2_with_negative:
+      eval_metrics = squad_evaluate_v2_0.evaluate(
+          pred_dataset, all_predictions, scores_diff)
+      # Filter out useless metrics, such as start_position_accuracy that
+      # we did not actually compute.
+      eval_metrics = {
+          'exact_match': eval_metrics['final_exact'],
+          'exact_match_threshold': eval_metrics['final_exact_thresh'],
+          'final_f1': eval_metrics['final_f1'] / 100.0,  # scale back to [0, 1].
+          'f1_threshold': eval_metrics['final_f1_thresh'],
+          'has_answer_exact_match': eval_metrics['HasAns_exact'],
+          'has_answer_f1': eval_metrics['HasAns_f1']}
+    else:
+      eval_metrics = squad_evaluate_v1_1.evaluate(pred_dataset, all_predictions)
+      # Filter out useless metrics, such as start_position_accuracy that
+      # we did not actually compute.
+      eval_metrics = {'exact_match': eval_metrics['exact_match'],
+                      'final_f1': eval_metrics['final_f1']}
+    return eval_metrics
diff --git a/official/nlp/tasks/question_answering_test.py b/official/nlp/tasks/question_answering_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..626c3cadb791622b0c1adbb296d5e55c53d8c0a2
--- /dev/null
+++ b/official/nlp/tasks/question_answering_test.py
@@ -0,0 +1,165 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for official.nlp.tasks.question_answering."""
+import itertools
+import json
+import os
+from absl.testing import parameterized
+import tensorflow as tf
+
+from official.nlp.bert import configs
+from official.nlp.bert import export_tfhub
+from official.nlp.configs import bert
+from official.nlp.configs import encoders
+from official.nlp.data import question_answering_dataloader
+from official.nlp.tasks import question_answering
+
+
+class QuestionAnsweringTaskTest(tf.test.TestCase, parameterized.TestCase):
+
+  def setUp(self):
+    super(QuestionAnsweringTaskTest, self).setUp()
+    self._encoder_config = encoders.TransformerEncoderConfig(
+        vocab_size=30522, num_layers=1)
+    self._train_data_config = question_answering_dataloader.QADataConfig(
+        input_path="dummy",
+        seq_length=128,
+        global_batch_size=1)
+
+    val_data = {"version": "1.1",
+                "data": [{"paragraphs": [
+                    {"context": "Sky is blue.",
+                     "qas": [{"question": "What is blue?", "id": "1234",
+                              "answers": [{"text": "Sky", "answer_start": 0},
+                                          {"text": "Sky", "answer_start": 0},
+                                          {"text": "Sky", "answer_start": 0}]
+                              }]}]}]}
+    self._val_input_path = os.path.join(self.get_temp_dir(), "val_data.json")
+    with tf.io.gfile.GFile(self._val_input_path, "w") as writer:
+      writer.write(json.dumps(val_data, indent=4) + "\n")
+
+    self._test_vocab = os.path.join(self.get_temp_dir(), "vocab.txt")
+    with tf.io.gfile.GFile(self._test_vocab, "w") as writer:
+      writer.write("[PAD]\n[UNK]\n[CLS]\n[SEP]\n[MASK]\nsky\nis\nblue\n")
+
+  def _get_validation_data_config(self, version_2_with_negative=False):
+    return question_answering_dataloader.QADataConfig(
+        is_training=False,
+        input_path=self._val_input_path,
+        input_preprocessed_data_path=self.get_temp_dir(),
+        seq_length=128,
+        global_batch_size=1,
+        version_2_with_negative=version_2_with_negative,
+        vocab_file=self._test_vocab,
+        tokenization="WordPiece",
+        do_lower_case=True)
+
+  def _run_task(self, config):
+    task = question_answering.QuestionAnsweringTask(config)
+    model = task.build_model()
+    metrics = task.build_metrics()
+    task.initialize(model)
+
+    train_dataset = task.build_inputs(config.train_data)
+    train_iterator = iter(train_dataset)
+    optimizer = tf.keras.optimizers.SGD(lr=0.1)
+    task.train_step(next(train_iterator), model, optimizer, metrics=metrics)
+
+    val_dataset = task.build_inputs(config.validation_data)
+    val_iterator = iter(val_dataset)
+    logs = task.validation_step(next(val_iterator), model, metrics=metrics)
+    logs = task.aggregate_logs(step_outputs=logs)
+    metrics = task.reduce_aggregated_logs(logs)
+    self.assertIn("final_f1", metrics)
+
+  @parameterized.parameters(itertools.product(
+      (False, True),
+      ("WordPiece", "SentencePiece"),
+  ))
+  def test_task(self, version_2_with_negative, tokenization):
+    # Saves a checkpoint.
+    pretrain_cfg = bert.BertPretrainerConfig(
+        encoder=self._encoder_config,
+        cls_heads=[
+            bert.ClsHeadConfig(
+                inner_dim=10, num_classes=3, name="next_sentence")
+        ])
+    pretrain_model = bert.instantiate_pretrainer_from_cfg(pretrain_cfg)
+    ckpt = tf.train.Checkpoint(
+        model=pretrain_model, **pretrain_model.checkpoint_items)
+    saved_path = ckpt.save(self.get_temp_dir())
+
+    config = question_answering.QuestionAnsweringConfig(
+        init_checkpoint=saved_path,
+        model=question_answering.ModelConfig(encoder=self._encoder_config),
+        train_data=self._train_data_config,
+        validation_data=self._get_validation_data_config(
+            version_2_with_negative))
+    self._run_task(config)
+
+  def test_task_with_fit(self):
+    config = question_answering.QuestionAnsweringConfig(
+        model=question_answering.ModelConfig(encoder=self._encoder_config),
+        train_data=self._train_data_config,
+        validation_data=self._get_validation_data_config())
+    task = question_answering.QuestionAnsweringTask(config)
+    model = task.build_model()
+    model = task.compile_model(
+        model,
+        optimizer=tf.keras.optimizers.SGD(lr=0.1),
+        train_step=task.train_step,
+        metrics=[tf.keras.metrics.SparseCategoricalAccuracy(name="accuracy")])
+    dataset = task.build_inputs(config.train_data)
+    logs = model.fit(dataset, epochs=1, steps_per_epoch=2)
+    self.assertIn("loss", logs.history)
+    self.assertIn("start_positions_accuracy", logs.history)
+    self.assertIn("end_positions_accuracy", logs.history)
+
+  def _export_bert_tfhub(self):
+    bert_config = configs.BertConfig(
+        vocab_size=30522,
+        hidden_size=16,
+        intermediate_size=32,
+        max_position_embeddings=128,
+        num_attention_heads=2,
+        num_hidden_layers=1)
+    _, encoder = export_tfhub.create_bert_model(bert_config)
+    model_checkpoint_dir = os.path.join(self.get_temp_dir(), "checkpoint")
+    checkpoint = tf.train.Checkpoint(model=encoder)
+    checkpoint.save(os.path.join(model_checkpoint_dir, "test"))
+    model_checkpoint_path = tf.train.latest_checkpoint(model_checkpoint_dir)
+
+    vocab_file = os.path.join(self.get_temp_dir(), "uncased_vocab.txt")
+    with tf.io.gfile.GFile(vocab_file, "w") as f:
+      f.write("dummy content")
+
+    hub_destination = os.path.join(self.get_temp_dir(), "hub")
+    export_tfhub.export_bert_tfhub(bert_config, model_checkpoint_path,
+                                   hub_destination, vocab_file)
+    return hub_destination
+
+  def test_task_with_hub(self):
+    hub_module_url = self._export_bert_tfhub()
+    config = question_answering.QuestionAnsweringConfig(
+        hub_module_url=hub_module_url,
+        model=question_answering.ModelConfig(encoder=self._encoder_config),
+        train_data=self._train_data_config,
+        validation_data=self._get_validation_data_config())
+    self._run_task(config)
+
+
+if __name__ == "__main__":
+  tf.test.main()
diff --git a/official/nlp/tasks/sentence_prediction.py b/official/nlp/tasks/sentence_prediction.py
index beebbdbad61f1ba1804f5248356a61d1b15f44c4..f8cfefdb106b460ce17445a1d926daee863a425e 100644
--- a/official/nlp/tasks/sentence_prediction.py
+++ b/official/nlp/tasks/sentence_prediction.py
@@ -14,16 +14,38 @@
 # limitations under the License.
 # ==============================================================================
 """Sentence prediction (classification) task."""
-import logging
+from typing import List, Union
+
+from absl import logging
 import dataclasses
+import numpy as np
+import orbit
+from scipy import stats
+from sklearn import metrics as sklearn_metrics
 import tensorflow as tf
 import tensorflow_hub as hub
 
 from official.core import base_task
+from official.core import task_factory
+from official.modeling.hyperparams import base_config
 from official.modeling.hyperparams import config_definitions as cfg
-from official.nlp.configs import bert
-from official.nlp.data import sentence_prediction_dataloader
-from official.nlp.modeling import losses as loss_lib
+from official.nlp.configs import encoders
+from official.nlp.data import data_loader_factory
+from official.nlp.modeling import models
+from official.nlp.tasks import utils
+
+
+METRIC_TYPES = frozenset(
+    ['accuracy', 'matthews_corrcoef', 'pearson_spearman_corr'])
+
+
+@dataclasses.dataclass
+class ModelConfig(base_config.Config):
+  """A classifier/regressor configuration."""
+  num_classes: int = 0
+  use_encoder_pooler: bool = False
+  encoder: encoders.TransformerEncoderConfig = (
+      encoders.TransformerEncoderConfig())
 
 
 @dataclasses.dataclass
@@ -32,62 +54,58 @@ class SentencePredictionConfig(cfg.TaskConfig):
   # At most one of `init_checkpoint` and `hub_module_url` can
   # be specified.
   init_checkpoint: str = ''
+  init_cls_pooler: bool = False
   hub_module_url: str = ''
-  network: bert.BertPretrainerConfig = bert.BertPretrainerConfig(
-      num_masked_tokens=0,
-      cls_heads=[
-          bert.ClsHeadConfig(
-              inner_dim=768,
-              num_classes=3,
-              dropout_rate=0.1,
-              name='sentence_prediction')
-      ])
+  metric_type: str = 'accuracy'
+  # Defines the concrete model config at instantiation time.
+  model: ModelConfig = ModelConfig()
   train_data: cfg.DataConfig = cfg.DataConfig()
   validation_data: cfg.DataConfig = cfg.DataConfig()
 
 
-@base_task.register_task_cls(SentencePredictionConfig)
+@task_factory.register_task_cls(SentencePredictionConfig)
 class SentencePredictionTask(base_task.Task):
   """Task object for sentence_prediction."""
 
-  def __init__(self, params=cfg.TaskConfig):
-    super(SentencePredictionTask, self).__init__(params)
+  def __init__(self, params=cfg.TaskConfig, logging_dir=None):
+    super(SentencePredictionTask, self).__init__(params, logging_dir)
     if params.hub_module_url and params.init_checkpoint:
       raise ValueError('At most one of `hub_module_url` and '
-                       '`pretrain_checkpoint_dir` can be specified.')
+                       '`init_checkpoint` can be specified.')
     if params.hub_module_url:
       self._hub_module = hub.load(params.hub_module_url)
     else:
       self._hub_module = None
 
+    if params.metric_type not in METRIC_TYPES:
+      raise ValueError('Invalid metric_type: {}'.format(params.metric_type))
+    self.metric_type = params.metric_type
+
   def build_model(self):
     if self._hub_module:
-      input_word_ids = tf.keras.layers.Input(
-          shape=(None,), dtype=tf.int32, name='input_word_ids')
-      input_mask = tf.keras.layers.Input(
-          shape=(None,), dtype=tf.int32, name='input_mask')
-      input_type_ids = tf.keras.layers.Input(
-          shape=(None,), dtype=tf.int32, name='input_type_ids')
-      bert_model = hub.KerasLayer(self._hub_module, trainable=True)
-      pooled_output, sequence_output = bert_model(
-          [input_word_ids, input_mask, input_type_ids])
-      encoder_from_hub = tf.keras.Model(
-          inputs=[input_word_ids, input_mask, input_type_ids],
-          outputs=[sequence_output, pooled_output])
-      return bert.instantiate_from_cfg(
-          self.task_config.network, encoder_network=encoder_from_hub)
+      encoder_network = utils.get_encoder_from_hub(self._hub_module)
     else:
-      return bert.instantiate_from_cfg(self.task_config.network)
+      encoder_network = encoders.instantiate_encoder_from_cfg(
+          self.task_config.model.encoder)
+
+    # Currently, we only support bert-style sentence prediction finetuning.
+    return models.BertClassifier(
+        network=encoder_network,
+        num_classes=self.task_config.model.num_classes,
+        initializer=tf.keras.initializers.TruncatedNormal(
+            stddev=self.task_config.model.encoder.initializer_range),
+        use_encoder_pooler=self.task_config.model.use_encoder_pooler)
 
   def build_losses(self, labels, model_outputs, aux_losses=None) -> tf.Tensor:
-    loss = loss_lib.weighted_sparse_categorical_crossentropy_loss(
-        labels=labels,
-        predictions=tf.nn.log_softmax(
-            model_outputs['sentence_prediction'], axis=-1))
+    if self.task_config.model.num_classes == 1:
+      loss = tf.keras.losses.mean_squared_error(labels, model_outputs)
+    else:
+      loss = tf.keras.losses.sparse_categorical_crossentropy(
+          labels, tf.cast(model_outputs, tf.float32), from_logits=True)
 
     if aux_losses:
       loss += tf.add_n(aux_losses)
-    return loss
+    return tf.reduce_mean(loss)
 
   def build_inputs(self, params, input_context=None):
     """Returns tf.data.Dataset for sentence_prediction task."""
@@ -99,8 +117,12 @@ class SentencePredictionTask(base_task.Task):
             input_word_ids=dummy_ids,
             input_mask=dummy_ids,
             input_type_ids=dummy_ids)
-        y = tf.ones((1, 1), dtype=tf.int32)
-        return (x, y)
+
+        if self.task_config.model.num_classes == 1:
+          y = tf.zeros((1,), dtype=tf.float32)
+        else:
+          y = tf.zeros((1, 1), dtype=tf.int32)
+        return x, y
 
       dataset = tf.data.Dataset.range(1)
       dataset = dataset.repeat()
@@ -108,20 +130,80 @@ class SentencePredictionTask(base_task.Task):
           dummy_data, num_parallel_calls=tf.data.experimental.AUTOTUNE)
       return dataset
 
-    return sentence_prediction_dataloader.SentencePredictionDataLoader(
-        params).load(input_context)
+    return data_loader_factory.get_data_loader(params).load(input_context)
 
   def build_metrics(self, training=None):
     del training
-    metrics = [tf.keras.metrics.SparseCategoricalAccuracy(name='cls_accuracy')]
+    if self.task_config.model.num_classes == 1:
+      metrics = [tf.keras.metrics.MeanSquaredError()]
+    else:
+      metrics = [
+          tf.keras.metrics.SparseCategoricalAccuracy(name='cls_accuracy')]
     return metrics
 
   def process_metrics(self, metrics, labels, model_outputs):
     for metric in metrics:
-      metric.update_state(labels, model_outputs['sentence_prediction'])
+      metric.update_state(labels, model_outputs)
 
   def process_compiled_metrics(self, compiled_metrics, labels, model_outputs):
-    compiled_metrics.update_state(labels, model_outputs['sentence_prediction'])
+    compiled_metrics.update_state(labels, model_outputs)
+
+  def validation_step(self, inputs, model: tf.keras.Model, metrics=None):
+    if self.metric_type == 'accuracy':
+      return super(SentencePredictionTask,
+                   self).validation_step(inputs, model, metrics)
+    features, labels = inputs
+    outputs = self.inference_step(features, model)
+    loss = self.build_losses(
+        labels=labels, model_outputs=outputs, aux_losses=model.losses)
+    logs = {self.loss: loss}
+    if self.metric_type == 'matthews_corrcoef':
+      logs.update({
+          'sentence_prediction':
+              tf.expand_dims(tf.math.argmax(outputs, axis=1), axis=0),
+          'labels':
+              labels,
+      })
+    if self.metric_type == 'pearson_spearman_corr':
+      logs.update({
+          'sentence_prediction': outputs,
+          'labels': labels,
+      })
+    return logs
+
+  def aggregate_logs(self, state=None, step_outputs=None):
+    if self.metric_type == 'accuracy':
+      return None
+    if state is None:
+      state = {'sentence_prediction': [], 'labels': []}
+    # TODO(b/160712818): Add support for concatenating partial batches.
+    state['sentence_prediction'].append(
+        np.concatenate([v.numpy() for v in step_outputs['sentence_prediction']],
+                       axis=0))
+    state['labels'].append(
+        np.concatenate([v.numpy() for v in step_outputs['labels']], axis=0))
+    return state
+
+  def reduce_aggregated_logs(self, aggregated_logs):
+    if self.metric_type == 'accuracy':
+      return None
+    elif self.metric_type == 'matthews_corrcoef':
+      preds = np.concatenate(aggregated_logs['sentence_prediction'], axis=0)
+      preds = np.reshape(preds, -1)
+      labels = np.concatenate(aggregated_logs['labels'], axis=0)
+      labels = np.reshape(labels, -1)
+      return {
+          self.metric_type: sklearn_metrics.matthews_corrcoef(preds, labels)
+      }
+    elif self.metric_type == 'pearson_spearman_corr':
+      preds = np.concatenate(aggregated_logs['sentence_prediction'], axis=0)
+      preds = np.reshape(preds, -1)
+      labels = np.concatenate(aggregated_logs['labels'], axis=0)
+      labels = np.reshape(labels, -1)
+      pearson_corr = stats.pearsonr(preds, labels)[0]
+      spearman_corr = stats.spearmanr(preds, labels)[0]
+      corr_metric = (pearson_corr + spearman_corr) / 2
+      return {self.metric_type: corr_metric}
 
   def initialize(self, model):
     """Load a pretrained checkpoint (if exists) and then train from iter 0."""
@@ -132,13 +214,65 @@ class SentencePredictionTask(base_task.Task):
       return
 
     pretrain2finetune_mapping = {
-        'encoder':
-            model.checkpoint_items['encoder'],
-        'next_sentence.pooler_dense':
-            model.checkpoint_items['sentence_prediction.pooler_dense'],
+        'encoder': model.checkpoint_items['encoder'],
     }
+    # TODO(b/160251903): Investigate why no pooler dense improves finetuning
+    # accuracies.
+    if self.task_config.init_cls_pooler:
+      pretrain2finetune_mapping[
+          'next_sentence.pooler_dense'] = model.checkpoint_items[
+              'sentence_prediction.pooler_dense']
     ckpt = tf.train.Checkpoint(**pretrain2finetune_mapping)
-    status = ckpt.restore(ckpt_dir_or_file)
+    status = ckpt.read(ckpt_dir_or_file)
     status.expect_partial().assert_existing_objects_matched()
-    logging.info('finished loading pretrained checkpoint from %s',
+    logging.info('Finished loading pretrained checkpoint from %s',
                  ckpt_dir_or_file)
+
+
+def predict(task: SentencePredictionTask, params: cfg.DataConfig,
+            model: tf.keras.Model) -> List[Union[int, float]]:
+  """Predicts on the input data.
+
+  Args:
+    task: A `SentencePredictionTask` object.
+    params: A `cfg.DataConfig` object.
+    model: A keras.Model.
+
+  Returns:
+    A list of predictions with length of `num_examples`. For regression task,
+      each element in the list is the predicted score; for classification task,
+      each element is the predicted class id.
+  """
+  is_regression = task.task_config.model.num_classes == 1
+
+  @tf.function
+  def predict_step(iterator):
+    """Predicts on distributed devices."""
+
+    def _replicated_step(inputs):
+      """Replicated prediction calculation."""
+      x, _ = inputs
+      outputs = task.inference_step(x, model)
+      if is_regression:
+        return outputs
+      else:
+        return tf.argmax(outputs, axis=-1)
+
+    outputs = tf.distribute.get_strategy().run(
+        _replicated_step, args=(next(iterator),))
+    return tf.nest.map_structure(
+        tf.distribute.get_strategy().experimental_local_results, outputs)
+
+  def reduce_fn(state, outputs):
+    """Concatenates model's outputs."""
+    for per_replica_batch_predictions in outputs:
+      state.extend(per_replica_batch_predictions)
+    return state
+
+  loop_fn = orbit.utils.create_loop_fn(predict_step)
+  dataset = orbit.utils.make_distributed_dataset(tf.distribute.get_strategy(),
+                                                 task.build_inputs, params)
+  # Set `num_steps` to -1 to exhaust the dataset.
+  predictions = loop_fn(
+      iter(dataset), num_steps=-1, state=[], reduce_fn=reduce_fn)
+  return predictions
diff --git a/official/nlp/tasks/sentence_prediction_test.py b/official/nlp/tasks/sentence_prediction_test.py
index e68db0a1a028d29da1c0ea0cdfbbccfc39d696f9..3e7d8f42ae95657f66d2b118112a84387c31529d 100644
--- a/official/nlp/tasks/sentence_prediction_test.py
+++ b/official/nlp/tasks/sentence_prediction_test.py
@@ -16,16 +16,61 @@
 """Tests for official.nlp.tasks.sentence_prediction."""
 import functools
 import os
+
+from absl.testing import parameterized
+import numpy as np
 import tensorflow as tf
 
 from official.nlp.bert import configs
 from official.nlp.bert import export_tfhub
 from official.nlp.configs import bert
 from official.nlp.configs import encoders
+from official.nlp.data import sentence_prediction_dataloader
 from official.nlp.tasks import sentence_prediction
 
 
-class SentencePredictionTaskTest(tf.test.TestCase):
+def _create_fake_dataset(output_path, seq_length, num_classes, num_examples):
+  """Creates a fake dataset."""
+  writer = tf.io.TFRecordWriter(output_path)
+
+  def create_int_feature(values):
+    return tf.train.Feature(int64_list=tf.train.Int64List(value=list(values)))
+
+  def create_float_feature(values):
+    return tf.train.Feature(float_list=tf.train.FloatList(value=list(values)))
+
+  for _ in range(num_examples):
+    features = {}
+    input_ids = np.random.randint(100, size=(seq_length))
+    features["input_ids"] = create_int_feature(input_ids)
+    features["input_mask"] = create_int_feature(np.ones_like(input_ids))
+    features["segment_ids"] = create_int_feature(np.ones_like(input_ids))
+    features["segment_ids"] = create_int_feature(np.ones_like(input_ids))
+
+    if num_classes == 1:
+      features["label_ids"] = create_float_feature([np.random.random()])
+    else:
+      features["label_ids"] = create_int_feature(
+          [np.random.random_integers(0, num_classes - 1, size=())])
+
+    tf_example = tf.train.Example(features=tf.train.Features(feature=features))
+    writer.write(tf_example.SerializeToString())
+  writer.close()
+
+
+class SentencePredictionTaskTest(tf.test.TestCase, parameterized.TestCase):
+
+  def setUp(self):
+    super(SentencePredictionTaskTest, self).setUp()
+    self._train_data_config = (
+        sentence_prediction_dataloader.SentencePredictionDataConfig(
+            input_path="dummy", seq_length=128, global_batch_size=1))
+
+  def get_model_config(self, num_classes):
+    return sentence_prediction.ModelConfig(
+        encoder=encoders.TransformerEncoderConfig(
+            vocab_size=30522, num_layers=1),
+        num_classes=num_classes)
 
   def _run_task(self, config):
     task = sentence_prediction.SentencePredictionTask(config)
@@ -44,16 +89,8 @@ class SentencePredictionTaskTest(tf.test.TestCase):
   def test_task(self):
     config = sentence_prediction.SentencePredictionConfig(
         init_checkpoint=self.get_temp_dir(),
-        network=bert.BertPretrainerConfig(
-            encoder=encoders.TransformerEncoderConfig(
-                vocab_size=30522, num_layers=1),
-            num_masked_tokens=0,
-            cls_heads=[
-                bert.ClsHeadConfig(
-                    inner_dim=10, num_classes=3, name="sentence_prediction")
-            ]),
-        train_data=bert.BertSentencePredictionDataConfig(
-            input_path="dummy", seq_length=128, global_batch_size=1))
+        model=self.get_model_config(2),
+        train_data=self._train_data_config)
     task = sentence_prediction.SentencePredictionTask(config)
     model = task.build_model()
     metrics = task.build_metrics()
@@ -68,17 +105,89 @@ class SentencePredictionTaskTest(tf.test.TestCase):
     pretrain_cfg = bert.BertPretrainerConfig(
         encoder=encoders.TransformerEncoderConfig(
             vocab_size=30522, num_layers=1),
-        num_masked_tokens=20,
         cls_heads=[
             bert.ClsHeadConfig(
                 inner_dim=10, num_classes=3, name="next_sentence")
         ])
-    pretrain_model = bert.instantiate_from_cfg(pretrain_cfg)
+    pretrain_model = bert.instantiate_pretrainer_from_cfg(pretrain_cfg)
     ckpt = tf.train.Checkpoint(
         model=pretrain_model, **pretrain_model.checkpoint_items)
     ckpt.save(config.init_checkpoint)
     task.initialize(model)
 
+  @parameterized.named_parameters(
+      {
+          "testcase_name": "regression",
+          "num_classes": 1,
+      },
+      {
+          "testcase_name": "classification",
+          "num_classes": 2,
+      },
+  )
+  def test_metrics_and_losses(self, num_classes):
+    config = sentence_prediction.SentencePredictionConfig(
+        init_checkpoint=self.get_temp_dir(),
+        model=self.get_model_config(num_classes),
+        train_data=self._train_data_config)
+    task = sentence_prediction.SentencePredictionTask(config)
+    model = task.build_model()
+    metrics = task.build_metrics()
+    if num_classes == 1:
+      self.assertIsInstance(metrics[0], tf.keras.metrics.MeanSquaredError)
+    else:
+      self.assertIsInstance(
+          metrics[0], tf.keras.metrics.SparseCategoricalAccuracy)
+
+    dataset = task.build_inputs(config.train_data)
+    iterator = iter(dataset)
+    optimizer = tf.keras.optimizers.SGD(lr=0.1)
+    task.train_step(next(iterator), model, optimizer, metrics=metrics)
+
+    logs = task.validation_step(next(iterator), model, metrics=metrics)
+    loss = logs["loss"].numpy()
+    if num_classes == 1:
+      self.assertAlmostEqual(loss, 42.77483, places=3)
+    else:
+      self.assertAlmostEqual(loss, 3.57627e-6, places=3)
+
+  @parameterized.parameters(("matthews_corrcoef", 2),
+                            ("pearson_spearman_corr", 1))
+  def test_np_metrics(self, metric_type, num_classes):
+    config = sentence_prediction.SentencePredictionConfig(
+        metric_type=metric_type,
+        init_checkpoint=self.get_temp_dir(),
+        model=self.get_model_config(num_classes),
+        train_data=self._train_data_config)
+    task = sentence_prediction.SentencePredictionTask(config)
+    model = task.build_model()
+    dataset = task.build_inputs(config.train_data)
+
+    iterator = iter(dataset)
+    strategy = tf.distribute.get_strategy()
+    distributed_outputs = strategy.run(
+        functools.partial(task.validation_step, model=model),
+        args=(next(iterator),))
+    outputs = tf.nest.map_structure(strategy.experimental_local_results,
+                                    distributed_outputs)
+    aggregated = task.aggregate_logs(step_outputs=outputs)
+    aggregated = task.aggregate_logs(state=aggregated, step_outputs=outputs)
+    self.assertIn(metric_type, task.reduce_aggregated_logs(aggregated))
+
+  def test_task_with_fit(self):
+    config = sentence_prediction.SentencePredictionConfig(
+        model=self.get_model_config(2), train_data=self._train_data_config)
+    task = sentence_prediction.SentencePredictionTask(config)
+    model = task.build_model()
+    model = task.compile_model(
+        model,
+        optimizer=tf.keras.optimizers.SGD(lr=0.1),
+        train_step=task.train_step,
+        metrics=task.build_metrics())
+    dataset = task.build_inputs(config.train_data)
+    logs = model.fit(dataset, epochs=1, steps_per_epoch=2)
+    self.assertIn("loss", logs.history)
+
   def _export_bert_tfhub(self):
     bert_config = configs.BertConfig(
         vocab_size=30522,
@@ -106,17 +215,39 @@ class SentencePredictionTaskTest(tf.test.TestCase):
     hub_module_url = self._export_bert_tfhub()
     config = sentence_prediction.SentencePredictionConfig(
         hub_module_url=hub_module_url,
-        network=bert.BertPretrainerConfig(
-            encoders.TransformerEncoderConfig(vocab_size=30522, num_layers=1),
-            num_masked_tokens=0,
-            cls_heads=[
-                bert.ClsHeadConfig(
-                    inner_dim=10, num_classes=3, name="sentence_prediction")
-            ]),
-        train_data=bert.BertSentencePredictionDataConfig(
-            input_path="dummy", seq_length=128, global_batch_size=10))
+        model=self.get_model_config(2),
+        train_data=self._train_data_config)
     self._run_task(config)
 
+  @parameterized.named_parameters(("classification", 5), ("regression", 1))
+  def test_prediction(self, num_classes):
+    task_config = sentence_prediction.SentencePredictionConfig(
+        model=self.get_model_config(num_classes=num_classes),
+        train_data=self._train_data_config)
+    task = sentence_prediction.SentencePredictionTask(task_config)
+    model = task.build_model()
+
+    test_data_path = os.path.join(self.get_temp_dir(), "test.tf_record")
+    seq_length = 16
+    num_examples = 100
+    _create_fake_dataset(
+        test_data_path,
+        seq_length=seq_length,
+        num_classes=num_classes,
+        num_examples=num_examples)
+
+    test_data_config = (
+        sentence_prediction_dataloader.SentencePredictionDataConfig(
+            input_path=test_data_path,
+            seq_length=seq_length,
+            is_training=False,
+            label_type="int" if num_classes > 1 else "float",
+            global_batch_size=16,
+            drop_remainder=False))
+
+    predictions = sentence_prediction.predict(task, test_data_config, model)
+    self.assertLen(predictions, num_examples)
+
 
 if __name__ == "__main__":
   tf.test.main()
diff --git a/official/nlp/tasks/tagging.py b/official/nlp/tasks/tagging.py
new file mode 100644
index 0000000000000000000000000000000000000000..d1a63a61051a50d16f0a305535e4a2fb46d356bf
--- /dev/null
+++ b/official/nlp/tasks/tagging.py
@@ -0,0 +1,280 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tagging (e.g., NER/POS) task."""
+from typing import List, Optional, Tuple
+
+import dataclasses
+import orbit
+
+from seqeval import metrics as seqeval_metrics
+
+import tensorflow as tf
+import tensorflow_hub as hub
+
+from official.core import base_task
+from official.core import task_factory
+from official.modeling.hyperparams import base_config
+from official.modeling.hyperparams import config_definitions as cfg
+from official.nlp.configs import encoders
+from official.nlp.data import data_loader_factory
+from official.nlp.modeling import models
+from official.nlp.tasks import utils
+
+
+@dataclasses.dataclass
+class ModelConfig(base_config.Config):
+  """A base span labeler configuration."""
+  encoder: encoders.TransformerEncoderConfig = (
+      encoders.TransformerEncoderConfig())
+  head_dropout: float = 0.1
+  head_initializer_range: float = 0.02
+
+
+@dataclasses.dataclass
+class TaggingConfig(cfg.TaskConfig):
+  """The model config."""
+  # At most one of `init_checkpoint` and `hub_module_url` can be specified.
+  init_checkpoint: str = ''
+  hub_module_url: str = ''
+  model: ModelConfig = ModelConfig()
+
+  # The real class names, the order of which should match real label id.
+  # Note that a word may be tokenized into multiple word_pieces tokens, and
+  # we asssume the real label id (non-negative) is assigned to the first token
+  # of the word, and a negative label id is assigned to the remaining tokens.
+  # The negative label id will not contribute to loss and metrics.
+  class_names: Optional[List[str]] = None
+  train_data: cfg.DataConfig = cfg.DataConfig()
+  validation_data: cfg.DataConfig = cfg.DataConfig()
+
+
+def _masked_labels_and_weights(y_true):
+  """Masks negative values from token level labels.
+
+  Args:
+    y_true: Token labels, typically shape (batch_size, seq_len), where tokens
+      with negative labels should be ignored during loss/accuracy calculation.
+
+  Returns:
+    (masked_y_true, masked_weights) where `masked_y_true` is the input
+    with each negative label replaced with zero and `masked_weights` is 0.0
+    where negative labels were replaced and 1.0 for original labels.
+  """
+  # Ignore the classes of tokens with negative values.
+  mask = tf.greater_equal(y_true, 0)
+  # Replace negative labels, which are out of bounds for some loss functions,
+  # with zero.
+  masked_y_true = tf.where(mask, y_true, 0)
+  return masked_y_true, tf.cast(mask, tf.float32)
+
+
+@task_factory.register_task_cls(TaggingConfig)
+class TaggingTask(base_task.Task):
+  """Task object for tagging (e.g., NER or POS)."""
+
+  def __init__(self, params=cfg.TaskConfig, logging_dir=None):
+    super(TaggingTask, self).__init__(params, logging_dir)
+    if params.hub_module_url and params.init_checkpoint:
+      raise ValueError('At most one of `hub_module_url` and '
+                       '`init_checkpoint` can be specified.')
+    if not params.class_names:
+      raise ValueError('TaggingConfig.class_names cannot be empty.')
+
+    if params.hub_module_url:
+      self._hub_module = hub.load(params.hub_module_url)
+    else:
+      self._hub_module = None
+
+  def build_model(self):
+    if self._hub_module:
+      encoder_network = utils.get_encoder_from_hub(self._hub_module)
+    else:
+      encoder_network = encoders.instantiate_encoder_from_cfg(
+          self.task_config.model.encoder)
+
+    return models.BertTokenClassifier(
+        network=encoder_network,
+        num_classes=len(self.task_config.class_names),
+        initializer=tf.keras.initializers.TruncatedNormal(
+            stddev=self.task_config.model.head_initializer_range),
+        dropout_rate=self.task_config.model.head_dropout,
+        output='logits')
+
+  def build_losses(self, labels, model_outputs, aux_losses=None) -> tf.Tensor:
+    model_outputs = tf.cast(model_outputs, tf.float32)
+    masked_labels, masked_weights = _masked_labels_and_weights(labels)
+    loss = tf.keras.losses.sparse_categorical_crossentropy(
+        masked_labels, model_outputs, from_logits=True)
+    numerator_loss = tf.reduce_sum(loss * masked_weights)
+    denominator_loss = tf.reduce_sum(masked_weights)
+    loss = tf.math.divide_no_nan(numerator_loss, denominator_loss)
+    return loss
+
+  def build_inputs(self, params: cfg.DataConfig, input_context=None):
+    """Returns tf.data.Dataset for sentence_prediction task."""
+    if params.input_path == 'dummy':
+
+      def dummy_data(_):
+        dummy_ids = tf.zeros((1, params.seq_length), dtype=tf.int32)
+        x = dict(
+            input_word_ids=dummy_ids,
+            input_mask=dummy_ids,
+            input_type_ids=dummy_ids)
+
+        # Include some label_id as -1, which will be ignored in loss/metrics.
+        y = tf.random.uniform(
+            shape=(1, params.seq_length),
+            minval=-1,
+            maxval=len(self.task_config.class_names),
+            dtype=tf.dtypes.int32)
+        return (x, y)
+
+      dataset = tf.data.Dataset.range(1)
+      dataset = dataset.repeat()
+      dataset = dataset.map(
+          dummy_data, num_parallel_calls=tf.data.experimental.AUTOTUNE)
+      return dataset
+
+    return data_loader_factory.get_data_loader(params).load(input_context)
+
+  def inference_step(self, inputs, model: tf.keras.Model):
+    """Performs the forward step."""
+    logits = model(inputs, training=False)
+    return {'logits': logits, 'predict_ids': tf.argmax(logits, axis=-1)}
+
+  def validation_step(self, inputs, model: tf.keras.Model, metrics=None):
+    """Validatation step.
+
+    Args:
+      inputs: a dictionary of input tensors.
+      model: the keras.Model.
+      metrics: a nested structure of metrics objects.
+
+    Returns:
+      A dictionary of logs.
+    """
+    features, labels = inputs
+    outputs = self.inference_step(features, model)
+    loss = self.build_losses(labels=labels, model_outputs=outputs['logits'])
+
+    # Negative label ids are padding labels which should be ignored.
+    real_label_index = tf.where(tf.greater_equal(labels, 0))
+    predict_ids = tf.gather_nd(outputs['predict_ids'], real_label_index)
+    label_ids = tf.gather_nd(labels, real_label_index)
+    return {
+        self.loss: loss,
+        'predict_ids': predict_ids,
+        'label_ids': label_ids,
+    }
+
+  def aggregate_logs(self, state=None, step_outputs=None):
+    """Aggregates over logs returned from a validation step."""
+    if state is None:
+      state = {'predict_class': [], 'label_class': []}
+
+    def id_to_class_name(batched_ids):
+      class_names = []
+      for per_example_ids in batched_ids:
+        class_names.append([])
+        for per_token_id in per_example_ids.numpy().tolist():
+          class_names[-1].append(self.task_config.class_names[per_token_id])
+
+      return class_names
+
+    # Convert id to class names, because `seqeval_metrics` relies on the class
+    # name to decide IOB tags.
+    state['predict_class'].extend(id_to_class_name(step_outputs['predict_ids']))
+    state['label_class'].extend(id_to_class_name(step_outputs['label_ids']))
+    return state
+
+  def reduce_aggregated_logs(self, aggregated_logs):
+    """Reduces aggregated logs over validation steps."""
+    label_class = aggregated_logs['label_class']
+    predict_class = aggregated_logs['predict_class']
+    return {
+        'f1':
+            seqeval_metrics.f1_score(label_class, predict_class),
+        'precision':
+            seqeval_metrics.precision_score(label_class, predict_class),
+        'recall':
+            seqeval_metrics.recall_score(label_class, predict_class),
+        'accuracy':
+            seqeval_metrics.accuracy_score(label_class, predict_class),
+    }
+
+
+def predict(task: TaggingTask, params: cfg.DataConfig,
+            model: tf.keras.Model) -> Tuple[List[List[int]], List[int]]:
+  """Predicts on the input data.
+
+  Args:
+    task: A `TaggingTask` object.
+    params: A `cfg.DataConfig` object.
+    model: A keras.Model.
+
+  Returns:
+    A tuple of `predict_ids` and `sentence_ids`, which are list with length
+      of `num_examples`. Each element in `predict_ids` is a sequence of
+      predicted per-word label id, and each element in `sentence_ids` is the
+      sentence id of the corresponding example.
+  """
+
+  @tf.function
+  def predict_step(iterator):
+    """Predicts on distributed devices."""
+
+    def _replicated_step(inputs):
+      """Replicated prediction calculation."""
+      x, y = inputs
+      sentence_ids = x.pop('sentence_id')
+      outputs = task.inference_step(x, model)
+      predict_ids = outputs['predict_ids']
+      label_mask = tf.greater_equal(y, 0)
+      return dict(
+          predict_ids=predict_ids,
+          label_mask=label_mask,
+          sentence_ids=sentence_ids)
+
+    outputs = tf.distribute.get_strategy().run(
+        _replicated_step, args=(next(iterator),))
+    return tf.nest.map_structure(
+        tf.distribute.get_strategy().experimental_local_results, outputs)
+
+  def reduce_fn(state, outputs):
+    """Concatenates model's outputs."""
+    cur_predict_ids, cur_sentence_ids = state
+    for batch_predict_ids, batch_label_mask, batch_sentence_ids in zip(
+        outputs['predict_ids'], outputs['label_mask'],
+        outputs['sentence_ids']):
+      for tmp_predict_ids, tmp_label_mask, tmp_sentence_id in zip(
+          batch_predict_ids.numpy(), batch_label_mask.numpy(),
+          batch_sentence_ids.numpy()):
+        cur_sentence_ids.append(tmp_sentence_id)
+        cur_predict_ids.append([])
+        assert len(tmp_predict_ids) == len(tmp_label_mask)
+        for i in range(len(tmp_predict_ids)):
+          # Skip the padding label.
+          if tmp_label_mask[i]:
+            cur_predict_ids[-1].append(tmp_predict_ids[i])
+    return cur_predict_ids, cur_sentence_ids
+
+  loop_fn = orbit.utils.create_loop_fn(predict_step)
+  dataset = orbit.utils.make_distributed_dataset(tf.distribute.get_strategy(),
+                                                 task.build_inputs, params)
+  # Set `num_steps` to -1 to exhaust the dataset.
+  predict_ids, sentence_ids = loop_fn(
+      iter(dataset), num_steps=-1, state=([], []), reduce_fn=reduce_fn)
+  return predict_ids, sentence_ids
diff --git a/official/nlp/tasks/tagging_test.py b/official/nlp/tasks/tagging_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..a3ea999c42fcfb5327868914ed3a48a96b30a79a
--- /dev/null
+++ b/official/nlp/tasks/tagging_test.py
@@ -0,0 +1,197 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for official.nlp.tasks.tagging."""
+import functools
+import os
+import numpy as np
+import tensorflow as tf
+
+from official.nlp.bert import configs
+from official.nlp.bert import export_tfhub
+from official.nlp.configs import encoders
+from official.nlp.data import tagging_data_loader
+from official.nlp.tasks import tagging
+
+
+def _create_fake_dataset(output_path, seq_length, num_labels, num_examples):
+  """Creates a fake dataset."""
+  writer = tf.io.TFRecordWriter(output_path)
+
+  def create_int_feature(values):
+    f = tf.train.Feature(int64_list=tf.train.Int64List(value=list(values)))
+    return f
+
+  for i in range(num_examples):
+    features = {}
+    input_ids = np.random.randint(100, size=(seq_length))
+    features["input_ids"] = create_int_feature(input_ids)
+    features["input_mask"] = create_int_feature(np.ones_like(input_ids))
+    features["segment_ids"] = create_int_feature(np.ones_like(input_ids))
+    features["label_ids"] = create_int_feature(
+        np.random.random_integers(-1, num_labels - 1, size=(seq_length)))
+    features["sentence_id"] = create_int_feature([i])
+
+    tf_example = tf.train.Example(features=tf.train.Features(feature=features))
+    writer.write(tf_example.SerializeToString())
+  writer.close()
+
+
+class TaggingTest(tf.test.TestCase):
+
+  def setUp(self):
+    super(TaggingTest, self).setUp()
+    self._encoder_config = encoders.TransformerEncoderConfig(
+        vocab_size=30522, num_layers=1)
+    self._train_data_config = tagging_data_loader.TaggingDataConfig(
+        input_path="dummy", seq_length=128, global_batch_size=1)
+
+  def _run_task(self, config):
+    task = tagging.TaggingTask(config)
+    model = task.build_model()
+    metrics = task.build_metrics()
+
+    strategy = tf.distribute.get_strategy()
+    dataset = strategy.experimental_distribute_datasets_from_function(
+        functools.partial(task.build_inputs, config.train_data))
+
+    iterator = iter(dataset)
+    optimizer = tf.keras.optimizers.SGD(lr=0.1)
+    task.train_step(next(iterator), model, optimizer, metrics=metrics)
+    task.validation_step(next(iterator), model, metrics=metrics)
+
+  def test_task(self):
+    # Saves a checkpoint.
+    encoder = encoders.instantiate_encoder_from_cfg(self._encoder_config)
+    ckpt = tf.train.Checkpoint(encoder=encoder)
+    saved_path = ckpt.save(self.get_temp_dir())
+
+    config = tagging.TaggingConfig(
+        init_checkpoint=saved_path,
+        model=tagging.ModelConfig(encoder=self._encoder_config),
+        train_data=self._train_data_config,
+        class_names=["O", "B-PER", "I-PER"])
+    task = tagging.TaggingTask(config)
+    model = task.build_model()
+    metrics = task.build_metrics()
+    dataset = task.build_inputs(config.train_data)
+
+    iterator = iter(dataset)
+    optimizer = tf.keras.optimizers.SGD(lr=0.1)
+    task.train_step(next(iterator), model, optimizer, metrics=metrics)
+    task.validation_step(next(iterator), model, metrics=metrics)
+    task.initialize(model)
+
+  def test_task_with_fit(self):
+    config = tagging.TaggingConfig(
+        model=tagging.ModelConfig(encoder=self._encoder_config),
+        train_data=self._train_data_config,
+        class_names=["O", "B-PER", "I-PER"])
+
+    task = tagging.TaggingTask(config)
+    model = task.build_model()
+    model = task.compile_model(
+        model,
+        optimizer=tf.keras.optimizers.SGD(lr=0.1),
+        train_step=task.train_step,
+        metrics=[tf.keras.metrics.SparseCategoricalAccuracy(name="accuracy")])
+    dataset = task.build_inputs(config.train_data)
+    logs = model.fit(dataset, epochs=1, steps_per_epoch=2)
+    self.assertIn("loss", logs.history)
+    self.assertIn("accuracy", logs.history)
+
+  def _export_bert_tfhub(self):
+    bert_config = configs.BertConfig(
+        vocab_size=30522,
+        hidden_size=16,
+        intermediate_size=32,
+        max_position_embeddings=128,
+        num_attention_heads=2,
+        num_hidden_layers=1)
+    _, encoder = export_tfhub.create_bert_model(bert_config)
+    model_checkpoint_dir = os.path.join(self.get_temp_dir(), "checkpoint")
+    checkpoint = tf.train.Checkpoint(model=encoder)
+    checkpoint.save(os.path.join(model_checkpoint_dir, "test"))
+    model_checkpoint_path = tf.train.latest_checkpoint(model_checkpoint_dir)
+
+    vocab_file = os.path.join(self.get_temp_dir(), "uncased_vocab.txt")
+    with tf.io.gfile.GFile(vocab_file, "w") as f:
+      f.write("dummy content")
+
+    hub_destination = os.path.join(self.get_temp_dir(), "hub")
+    export_tfhub.export_bert_tfhub(bert_config, model_checkpoint_path,
+                                   hub_destination, vocab_file)
+    return hub_destination
+
+  def test_task_with_hub(self):
+    hub_module_url = self._export_bert_tfhub()
+    config = tagging.TaggingConfig(
+        hub_module_url=hub_module_url,
+        class_names=["O", "B-PER", "I-PER"],
+        train_data=self._train_data_config)
+    self._run_task(config)
+
+  def test_seqeval_metrics(self):
+    config = tagging.TaggingConfig(
+        model=tagging.ModelConfig(encoder=self._encoder_config),
+        train_data=self._train_data_config,
+        class_names=["O", "B-PER", "I-PER"])
+    task = tagging.TaggingTask(config)
+    model = task.build_model()
+    dataset = task.build_inputs(config.train_data)
+
+    iterator = iter(dataset)
+    strategy = tf.distribute.get_strategy()
+    distributed_outputs = strategy.run(
+        functools.partial(task.validation_step, model=model),
+        args=(next(iterator),))
+    outputs = tf.nest.map_structure(strategy.experimental_local_results,
+                                    distributed_outputs)
+    aggregated = task.aggregate_logs(step_outputs=outputs)
+    aggregated = task.aggregate_logs(state=aggregated, step_outputs=outputs)
+    self.assertCountEqual({"f1", "precision", "recall", "accuracy"},
+                          task.reduce_aggregated_logs(aggregated).keys())
+
+  def test_predict(self):
+    task_config = tagging.TaggingConfig(
+        model=tagging.ModelConfig(encoder=self._encoder_config),
+        train_data=self._train_data_config,
+        class_names=["O", "B-PER", "I-PER"])
+    task = tagging.TaggingTask(task_config)
+    model = task.build_model()
+
+    test_data_path = os.path.join(self.get_temp_dir(), "test.tf_record")
+    seq_length = 16
+    num_examples = 100
+    _create_fake_dataset(
+        test_data_path,
+        seq_length=seq_length,
+        num_labels=len(task_config.class_names),
+        num_examples=num_examples)
+    test_data_config = tagging_data_loader.TaggingDataConfig(
+        input_path=test_data_path,
+        seq_length=seq_length,
+        is_training=False,
+        global_batch_size=16,
+        drop_remainder=False,
+        include_sentence_id=True)
+
+    predict_ids, sentence_ids = tagging.predict(task, test_data_config, model)
+    self.assertLen(predict_ids, num_examples)
+    self.assertLen(sentence_ids, num_examples)
+
+
+if __name__ == "__main__":
+  tf.test.main()
diff --git a/official/nlp/tasks/utils.py b/official/nlp/tasks/utils.py
new file mode 100644
index 0000000000000000000000000000000000000000..467dafe31f813779b7af5ea0209aadccb6d1bdf8
--- /dev/null
+++ b/official/nlp/tasks/utils.py
@@ -0,0 +1,34 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Common utils for tasks."""
+import tensorflow as tf
+import tensorflow_hub as hub
+
+
+def get_encoder_from_hub(hub_module: str) -> tf.keras.Model:
+  """Gets an encoder from hub."""
+  input_word_ids = tf.keras.layers.Input(
+      shape=(None,), dtype=tf.int32, name='input_word_ids')
+  input_mask = tf.keras.layers.Input(
+      shape=(None,), dtype=tf.int32, name='input_mask')
+  input_type_ids = tf.keras.layers.Input(
+      shape=(None,), dtype=tf.int32, name='input_type_ids')
+  hub_layer = hub.KerasLayer(hub_module, trainable=True)
+  pooled_output, sequence_output = hub_layer(
+      [input_word_ids, input_mask, input_type_ids])
+  return tf.keras.Model(
+      inputs=[input_word_ids, input_mask, input_type_ids],
+      outputs=[sequence_output, pooled_output])
diff --git a/official/nlp/transformer/beam_search.py b/official/nlp/transformer/beam_search.py
deleted file mode 100644
index a4c1127535e6ae805f6619819737c379cadca6f2..0000000000000000000000000000000000000000
--- a/official/nlp/transformer/beam_search.py
+++ /dev/null
@@ -1,132 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Beam search in TF v2."""
-
-import tensorflow as tf
-
-from official.nlp.transformer import beam_search_v1 as v1
-
-_StateKeys = v1._StateKeys  # pylint: disable=protected-access
-
-
-class SequenceBeamSearchV2(v1.SequenceBeamSearch):
-  """Implementation of beam search loop in v2."""
-
-  def search(self, initial_ids, initial_cache):
-    """Beam search for sequences with highest scores."""
-    state, state_shapes = self._create_initial_state(initial_ids, initial_cache)
-
-    finished_state = tf.nest.map_structure(
-        tf.stop_gradient,
-        tf.while_loop(self._continue_search,
-                      self._search_step,
-                      loop_vars=[state],
-                      shape_invariants=[state_shapes],
-                      parallel_iterations=1))
-    finished_state = finished_state[0]
-
-    alive_seq = finished_state[_StateKeys.ALIVE_SEQ]
-    alive_log_probs = finished_state[_StateKeys.ALIVE_LOG_PROBS]
-    finished_seq = finished_state[_StateKeys.FINISHED_SEQ]
-    finished_scores = finished_state[_StateKeys.FINISHED_SCORES]
-    finished_flags = finished_state[_StateKeys.FINISHED_FLAGS]
-
-    # 2.0 changes tf.where behavior. Should make parameters broadcastable.
-    finished_cond = tf.reduce_any(finished_flags, 1, name="finished_cond")
-    seq_cond = _expand_to_same_rank(finished_cond, finished_seq)
-    score_cond = _expand_to_same_rank(finished_cond, finished_scores)
-
-    # Account for corner case where there are no finished sequences for a
-    # particular batch item. In that case, return alive sequences for that batch
-    # item.
-    finished_seq = tf.where(seq_cond, finished_seq, alive_seq)
-    finished_scores = tf.where(
-        score_cond, finished_scores, alive_log_probs)
-    return finished_seq, finished_scores
-
-
-def sequence_beam_search(symbols_to_logits_fn,
-                         initial_ids,
-                         initial_cache,
-                         vocab_size,
-                         beam_size,
-                         alpha,
-                         max_decode_length,
-                         eos_id,
-                         padded_decode=False,
-                         dtype="float32"):
-  """Search for sequence of subtoken ids with the largest probability.
-
-  Args:
-    symbols_to_logits_fn: A function that takes in ids, index, and cache as
-      arguments. The passed in arguments will have shape:
-        ids -> A tensor with shape [batch_size * beam_size, index].
-        index -> A scalar.
-        cache -> A nested dictionary of tensors [batch_size * beam_size, ...].
-      The function must return a tuple of logits and new cache:
-        logits -> A tensor with shape [batch * beam_size, vocab_size].
-        new cache -> A nested dictionary with the same shape/structure as the
-          inputted cache.
-    initial_ids: An int32 tensor with shape [batch_size]. Starting ids for
-      each batch item.
-    initial_cache: A dictionary, containing starting decoder variables
-      information.
-    vocab_size: An integer, the size of tokens.
-    beam_size: An integer, the number of beams.
-    alpha: A float, defining the strength of length normalization.
-    max_decode_length: An integer, the maximum length to decoded a sequence.
-    eos_id: An integer, ID of eos token, used to determine when a sequence has
-      finished.
-    padded_decode: A bool, indicating if max_sequence_length padding is used
-      for beam search.
-    dtype: A tensorflow data type used for score computation. The default is
-      tf.float32.
-
-  Returns:
-    Top decoded sequences [batch_size, beam_size, max_decode_length]
-    sequence scores [batch_size, beam_size]
-  """
-  batch_size = (
-      initial_ids.shape.as_list()[0] if padded_decode else
-      tf.shape(initial_ids)[0])
-  sbs = SequenceBeamSearchV2(symbols_to_logits_fn, vocab_size, batch_size,
-                             beam_size, alpha, max_decode_length, eos_id,
-                             padded_decode, dtype)
-  return sbs.search(initial_ids, initial_cache)
-
-
-def _expand_to_same_rank(tensor, target):
-  """Expands a given tensor to target's rank to be broadcastable.
-
-  Args:
-    tensor: input tensor to tile. Shape: [b, d1, ..., da]
-    target: target tensor. Shape: [b, d1, ..., da, ..., dn]
-
-  Returns:
-    Tiled tensor of shape [b, d1, ..., da, 1, ..., 1] with same rank of target.
-
-  Raises:
-    ValueError, if the shape rank of rank tensor/target is None.
-  """
-  if tensor.shape.rank is None:
-    raise ValueError("Expect rank for tensor shape, but got None.")
-  if target.shape.rank is None:
-    raise ValueError("Expect rank for target shape, but got None.")
-
-  with tf.name_scope("expand_rank"):
-    diff_rank = target.shape.rank - tensor.shape.rank
-    for _ in range(diff_rank):
-      tensor = tf.expand_dims(tensor, -1)
-    return tensor
diff --git a/official/nlp/transformer/beam_search_v1.py b/official/nlp/transformer/beam_search_v1.py
index 8b143b1b30ef462f6187850b12a5ca9dfe3ab39b..937f14ccada052e0d137081e863cca9cc6939279 100644
--- a/official/nlp/transformer/beam_search_v1.py
+++ b/official/nlp/transformer/beam_search_v1.py
@@ -13,126 +13,18 @@
 # limitations under the License.
 # ==============================================================================
 """Beam search to find the translated sequence with the highest probability.
-
-Source implementation from Tensor2Tensor:
-https://github.com/tensorflow/tensor2tensor/blob/master/tensor2tensor/utils/beam_search.py
 """
 
-import numpy as np
 import tensorflow.compat.v1 as tf
-from tensorflow.python.util import nest
-
-
-def inf(dtype):
-  """Returns a value close to infinity, but is still finite in `dtype`.
-
-  This is useful to get a very large value that is still zero when multiplied by
-  zero. The floating-point "Inf" value is NaN when multiplied by zero.
-
-  Args:
-    dtype: A dtype. The returned value will be finite when casted to this dtype.
-
-  Returns:
-    A very large value.
-  """
-  if dtype == "float32" or dtype == "bfloat16":
-    return 1e7
-  elif dtype == "float16":
-    # Disable no-member lint error, as the linter thinks np.float16 does not
-    # exist for some reason.
-    return np.finfo(np.float16).max  # pylint: disable=no-member
-  else:
-    raise AssertionError('Invalid dtype: %s' % dtype)
-
-
-class _StateKeys(object):
-  """Keys to dictionary storing the state of the beam search loop."""
-
-  # Variable storing the loop index.
-  CUR_INDEX = "CUR_INDEX"
+from official.nlp.modeling.ops import beam_search
 
-  # Top sequences that are alive for each batch item. Alive sequences are ones
-  # that have not generated an EOS token. Sequences that reach EOS are marked as
-  # finished and moved to the FINISHED_SEQ tensor.
-  # Has shape [batch_size, beam_size, CUR_INDEX + 1]
-  ALIVE_SEQ = "ALIVE_SEQ"
-  # Log probabilities of each alive sequence. Shape [batch_size, beam_size]
-  ALIVE_LOG_PROBS = "ALIVE_LOG_PROBS"
-  # Dictionary of cached values for each alive sequence. The cache stores
-  # the encoder output, attention bias, and the decoder attention output from
-  # the previous iteration.
-  ALIVE_CACHE = "ALIVE_CACHE"
+_StateKeys = beam_search._StateKeys  # pylint: disable=protected-access
 
-  # Top finished sequences for each batch item.
-  # Has shape [batch_size, beam_size, CUR_INDEX + 1]. Sequences that are
-  # shorter than CUR_INDEX + 1 are padded with 0s.
-  FINISHED_SEQ = "FINISHED_SEQ"
-  # Scores for each finished sequence. Score = log probability / length norm
-  # Shape [batch_size, beam_size]
-  FINISHED_SCORES = "FINISHED_SCORES"
-  # Flags indicating which sequences in the finished sequences are finished.
-  # At the beginning, all of the sequences in FINISHED_SEQ are filler values.
-  # True -> finished sequence, False -> filler. Shape [batch_size, beam_size]
-  FINISHED_FLAGS = "FINISHED_FLAGS"
 
-
-class SequenceBeamSearch(object):
+class SequenceBeamSearch(beam_search.SequenceBeamSearch):
   """Implementation of beam search loop."""
 
-  def __init__(self,
-               symbols_to_logits_fn,
-               vocab_size,
-               batch_size,
-               beam_size,
-               alpha,
-               max_decode_length,
-               eos_id,
-               padded_decode,
-               dtype=tf.float32):
-    """Initialize sequence beam search.
-
-    Args:
-      symbols_to_logits_fn: A function to provide logits, which is the
-        interface to the Transformer model. The passed in arguments are:
-          ids -> A tensor with shape [batch_size * beam_size, index].
-          index -> A scalar.
-          cache -> A nested dictionary of tensors [batch_size * beam_size, ...].
-        The function must return a tuple of logits and the updated cache:
-          logits -> A tensor with shape [batch * beam_size, vocab_size].
-          updated cache -> A nested dictionary with the same structure as the
-            input cache.
-      vocab_size: An integer, the size of the vocabulary, used for topk
-        computation.
-      batch_size: An integer, the decode batch size.
-      beam_size: An integer, number of beams for beam search.
-      alpha: A float, defining the strength of length normalization.
-      max_decode_length: An integer, the maximum number of steps to decode
-        a sequence.
-      eos_id: An integer. ID of end of sentence token.
-      padded_decode: A bool, indicating if max_sequence_length padding is used
-        for beam search.
-      dtype: A tensorflow data type used for score computation. The default is
-        tf.float32.
-    """
-    self.symbols_to_logits_fn = symbols_to_logits_fn
-    self.vocab_size = vocab_size
-    self.batch_size = batch_size
-    self.beam_size = beam_size
-    self.alpha = alpha
-    self.max_decode_length = max_decode_length
-    self.eos_id = eos_id
-    self.padded_decode = padded_decode
-    self.dtype = tf.as_dtype(dtype)
-
-  def search(self, initial_ids, initial_cache):
-    """Beam search for sequences with highest scores."""
-    state, state_shapes = self._create_initial_state(initial_ids, initial_cache)
-
-    finished_state = tf.while_loop(
-        self._continue_search, self._search_step, loop_vars=[state],
-        shape_invariants=[state_shapes], parallel_iterations=1, back_prop=False)
-    finished_state = finished_state[0]
-
+  def _process_finished_state(self, finished_state):
     alive_seq = finished_state[_StateKeys.ALIVE_SEQ]
     alive_log_probs = finished_state[_StateKeys.ALIVE_LOG_PROBS]
     finished_seq = finished_state[_StateKeys.FINISHED_SEQ]
@@ -148,360 +40,6 @@ class SequenceBeamSearch(object):
         tf.reduce_any(finished_flags, 1), finished_scores, alive_log_probs)
     return finished_seq, finished_scores
 
-  def _create_initial_state(self, initial_ids, initial_cache):
-    """Return initial state dictionary and its shape invariants.
-
-    Args:
-      initial_ids: initial ids to pass into the symbols_to_logits_fn.
-        int tensor with shape [batch_size, 1]
-      initial_cache: dictionary storing values to be passed into the
-        symbols_to_logits_fn.
-
-    Returns:
-        state and shape invariant dictionaries with keys from _StateKeys
-    """
-    for key, value in initial_cache.items():
-      for inner_value in nest.flatten(value):
-        if inner_value.dtype != self.dtype:
-          raise TypeError(
-              "initial_cache element for key '%s' has dtype %s that does not "
-              "match SequenceBeamSearch's dtype of %s. Value: %s" %
-              (key, value.dtype.name, self.dtype.name, inner_value))
-
-    # Current loop index (starts at 0)
-    cur_index = tf.constant(0)
-
-    # Create alive sequence with shape [batch_size, beam_size, 1]
-    alive_seq = _expand_to_beam_size(initial_ids, self.beam_size)
-    alive_seq = tf.expand_dims(alive_seq, axis=2)
-    if self.padded_decode:
-      alive_seq = tf.tile(alive_seq, [1, 1, self.max_decode_length + 1])
-
-    # Create tensor for storing initial log probabilities.
-    # Assume initial_ids are prob 1.0
-    initial_log_probs = tf.constant(
-        [[0.] + [-float("inf")] * (self.beam_size - 1)], dtype=self.dtype)
-    alive_log_probs = tf.tile(initial_log_probs, [self.batch_size, 1])
-
-    # Expand all values stored in the dictionary to the beam size, so that each
-    # beam has a separate cache.
-    alive_cache = nest.map_structure(
-        lambda t: _expand_to_beam_size(t, self.beam_size), initial_cache)
-
-    # Initialize tensor storing finished sequences with filler values.
-    finished_seq = tf.zeros(tf.shape(alive_seq), tf.int32)
-
-    # Set scores of the initial finished seqs to negative infinity.
-    finished_scores = tf.ones([self.batch_size, self.beam_size],
-                              dtype=self.dtype) * -inf(self.dtype)
-
-    # Initialize finished flags with all False values.
-    finished_flags = tf.zeros([self.batch_size, self.beam_size], tf.bool)
-
-    # Create state dictionary
-    state = {
-        _StateKeys.CUR_INDEX: cur_index,
-        _StateKeys.ALIVE_SEQ: alive_seq,
-        _StateKeys.ALIVE_LOG_PROBS: alive_log_probs,
-        _StateKeys.ALIVE_CACHE: alive_cache,
-        _StateKeys.FINISHED_SEQ: finished_seq,
-        _StateKeys.FINISHED_SCORES: finished_scores,
-        _StateKeys.FINISHED_FLAGS: finished_flags
-    }
-
-    # Create state invariants for each value in the state dictionary. Each
-    # dimension must be a constant or None. A None dimension means either:
-    #   1) the dimension's value is a tensor that remains the same but may
-    #      depend on the input sequence to the model (e.g. batch size).
-    #   2) the dimension may have different values on different iterations.
-    if self.padded_decode:
-      state_shape_invariants = {
-          _StateKeys.CUR_INDEX:
-              tf.TensorShape([]),
-          _StateKeys.ALIVE_SEQ:
-              tf.TensorShape(
-                  [self.batch_size, self.beam_size,
-                   self.max_decode_length + 1]),
-          _StateKeys.ALIVE_LOG_PROBS:
-              tf.TensorShape([self.batch_size, self.beam_size]),
-          _StateKeys.ALIVE_CACHE:
-              nest.map_structure(_get_shape, alive_cache),
-          _StateKeys.FINISHED_SEQ:
-              tf.TensorShape(
-                  [self.batch_size, self.beam_size,
-                   self.max_decode_length + 1]),
-          _StateKeys.FINISHED_SCORES:
-              tf.TensorShape([self.batch_size, self.beam_size]),
-          _StateKeys.FINISHED_FLAGS:
-              tf.TensorShape([self.batch_size, self.beam_size])
-      }
-    else:
-      state_shape_invariants = {
-          _StateKeys.CUR_INDEX:
-              tf.TensorShape([]),
-          _StateKeys.ALIVE_SEQ:
-              tf.TensorShape([None, self.beam_size, None]),
-          _StateKeys.ALIVE_LOG_PROBS:
-              tf.TensorShape([None, self.beam_size]),
-          _StateKeys.ALIVE_CACHE:
-              nest.map_structure(_get_shape_keep_last_dim, alive_cache),
-          _StateKeys.FINISHED_SEQ:
-              tf.TensorShape([None, self.beam_size, None]),
-          _StateKeys.FINISHED_SCORES:
-              tf.TensorShape([None, self.beam_size]),
-          _StateKeys.FINISHED_FLAGS:
-              tf.TensorShape([None, self.beam_size])
-      }
-
-    return state, state_shape_invariants
-
-  def _continue_search(self, state):
-    """Return whether to continue the search loop.
-
-    The loops should terminate when
-      1) when decode length has been reached, or
-      2) when the worst score in the finished sequences is better than the best
-         score in the alive sequences (i.e. the finished sequences are provably
-         unchanging)
-
-    Args:
-      state: A dictionary with the current loop state.
-
-    Returns:
-      Bool tensor with value True if loop should continue, False if loop should
-      terminate.
-    """
-    i = state[_StateKeys.CUR_INDEX]
-    alive_log_probs = state[_StateKeys.ALIVE_LOG_PROBS]
-    finished_scores = state[_StateKeys.FINISHED_SCORES]
-    finished_flags = state[_StateKeys.FINISHED_FLAGS]
-
-    not_at_max_decode_length = tf.less(i, self.max_decode_length)
-
-    # Calculate largest length penalty (the larger penalty, the better score).
-    max_length_norm = _length_normalization(self.alpha, self.max_decode_length,
-                                            dtype=self.dtype)
-    # Get the best possible scores from alive sequences.
-    best_alive_scores = alive_log_probs[:, 0] / max_length_norm
-
-    # Compute worst score in finished sequences for each batch element
-    finished_scores *= tf.cast(finished_flags,
-                               self.dtype)  # set filler scores to zero
-    lowest_finished_scores = tf.reduce_min(finished_scores, axis=1)
-
-    # If there are no finished sequences in a batch element, then set the lowest
-    # finished score to -INF for that element.
-    finished_batches = tf.reduce_any(finished_flags, 1)
-    lowest_finished_scores += ((1.0 -
-                                tf.cast(finished_batches, self.dtype)) *
-                               -inf(self.dtype))
-
-    worst_finished_score_better_than_best_alive_score = tf.reduce_all(
-        tf.greater(lowest_finished_scores, best_alive_scores)
-    )
-
-    return tf.logical_and(
-        not_at_max_decode_length,
-        tf.logical_not(worst_finished_score_better_than_best_alive_score)
-    )
-
-  def _search_step(self, state):
-    """Beam search loop body.
-
-    Grow alive sequences by a single ID. Sequences that have reached the EOS
-    token are marked as finished. The alive and finished sequences with the
-    highest log probabilities and scores are returned.
-
-    A sequence's finished score is calculating by dividing the log probability
-    by the length normalization factor. Without length normalization, the
-    search is more likely to return shorter sequences.
-
-    Args:
-      state: A dictionary with the current loop state.
-
-    Returns:
-      new state dictionary.
-    """
-    # Grow alive sequences by one token.
-    new_seq, new_log_probs, topk_ids, new_cache = self._grow_alive_seq(state)
-    new_finished_flags = tf.equal(topk_ids, self.eos_id)
-    # Collect top beam_size alive sequences
-    alive_state = self._get_new_alive_state(new_seq, new_log_probs,
-                                            new_finished_flags, new_cache)
-
-    # Combine newly finished sequences with existing finished sequences, and
-    # collect the top k scoring sequences.
-    finished_state = self._get_new_finished_state(state, new_seq, new_log_probs,
-                                                  new_finished_flags)
-
-    # Increment loop index and create new state dictionary
-    new_state = {_StateKeys.CUR_INDEX: state[_StateKeys.CUR_INDEX] + 1}
-    new_state.update(alive_state)
-    new_state.update(finished_state)
-    return [new_state]
-
-  def _grow_alive_seq(self, state):
-    """Grow alive sequences by one token, and collect top 2*beam_size sequences.
-
-    2*beam_size sequences are collected because some sequences may have reached
-    the EOS token. 2*beam_size ensures that at least beam_size sequences are
-    still alive.
-
-    Args:
-      state: A dictionary with the current loop state.
-    Returns:
-      Tuple of
-      (Top 2*beam_size sequences [batch_size, 2 * beam_size, cur_index + 1],
-       Scores of returned sequences [batch_size, 2 * beam_size],
-       New alive cache, for each of the 2 * beam_size sequences)
-    """
-    i = state[_StateKeys.CUR_INDEX]
-    alive_seq = state[_StateKeys.ALIVE_SEQ]
-    alive_log_probs = state[_StateKeys.ALIVE_LOG_PROBS]
-    alive_cache = state[_StateKeys.ALIVE_CACHE]
-
-    beams_to_keep = 2 * self.beam_size
-
-    # Get logits for the next candidate IDs for the alive sequences. Get the new
-    # cache values at the same time.
-    if self.padded_decode:
-      flat_ids = tf.reshape(
-          tf.slice(alive_seq, [0, 0, i], [self.batch_size, self.beam_size, 1]),
-          [self.batch_size * self.beam_size, -1])
-    else:
-      flat_ids = _flatten_beam_dim(alive_seq)  # [batch_size * beam_size]
-    flat_cache = nest.map_structure(_flatten_beam_dim, alive_cache)
-
-    flat_logits, flat_cache = self.symbols_to_logits_fn(flat_ids, i, flat_cache)
-
-    # Unflatten logits to shape [batch_size, beam_size, vocab_size]
-    logits = _unflatten_beam_dim(flat_logits, self.batch_size, self.beam_size)
-    new_cache = nest.map_structure(
-        lambda t: _unflatten_beam_dim(t, self.batch_size, self.beam_size),
-        flat_cache)
-
-    # Convert logits to normalized log probs
-    candidate_log_probs = _log_prob_from_logits(logits)
-
-    # Calculate new log probabilities if each of the alive sequences were
-    # extended # by the the candidate IDs.
-    # Shape [batch_size, beam_size, vocab_size]
-    log_probs = candidate_log_probs + tf.expand_dims(alive_log_probs, axis=2)
-
-    # Each batch item has beam_size * vocab_size candidate sequences. For each
-    # batch item, get the k candidates with the highest log probabilities.
-    flat_log_probs = tf.reshape(log_probs,
-                                [-1, self.beam_size * self.vocab_size])
-    topk_log_probs, topk_indices = tf.nn.top_k(flat_log_probs, k=beams_to_keep)
-
-    # Extract the alive sequences that generate the highest log probabilities
-    # after being extended.
-    topk_beam_indices = topk_indices // self.vocab_size
-    topk_seq, new_cache = _gather_beams(
-        [alive_seq, new_cache], topk_beam_indices, self.batch_size,
-        beams_to_keep)
-
-    # Append the most probable IDs to the topk sequences
-    topk_ids = topk_indices % self.vocab_size
-    if self.padded_decode:
-      topk_seq = tf.transpose(topk_seq, perm=[2, 0, 1])
-      # TODO(b/145533236, hongkuny): Reverts once TF fix the validation.
-      topk_seq = tf.tensor_scatter_nd_update(topk_seq, [[i + 1]],
-                                             tf.expand_dims(topk_ids, axis=0))
-      topk_seq = tf.transpose(topk_seq, perm=[1, 2, 0])
-    else:
-      topk_seq = tf.concat([topk_seq, tf.expand_dims(topk_ids, axis=2)], axis=2)
-    return topk_seq, topk_log_probs, topk_ids, new_cache
-
-  def _get_new_alive_state(self, new_seq, new_log_probs, new_finished_flags,
-                           new_cache):
-    """Gather the top k sequences that are still alive.
-
-    Args:
-      new_seq: New sequences generated by growing the current alive sequences
-        int32 tensor with shape [batch_size, 2 * beam_size, cur_index + 1]
-      new_log_probs: Log probabilities of new sequences float32 tensor with
-        shape [batch_size, beam_size]
-      new_finished_flags: A boolean Tensor indicates which sequences are live
-        inside the beam.
-      new_cache: Dict of cached values for each sequence.
-
-    Returns:
-      Dictionary with alive keys from _StateKeys:
-        {Top beam_size sequences that are still alive (don't end with eos_id)
-         Log probabilities of top alive sequences
-         Dict cache storing decoder states for top alive sequences}
-    """
-    # To prevent finished sequences from being considered, set log probs to -inf
-    new_log_probs += tf.cast(new_finished_flags, self.dtype) * -inf(self.dtype)
-
-    top_alive_seq, top_alive_log_probs, top_alive_cache = _gather_topk_beams(
-        [new_seq, new_log_probs, new_cache], new_log_probs, self.batch_size,
-        self.beam_size)
-
-    return {
-        _StateKeys.ALIVE_SEQ: top_alive_seq,
-        _StateKeys.ALIVE_LOG_PROBS: top_alive_log_probs,
-        _StateKeys.ALIVE_CACHE: top_alive_cache
-    }
-
-  def _get_new_finished_state(self, state, new_seq, new_log_probs,
-                              new_finished_flags):
-    """Combine new and old finished sequences, and gather the top k sequences.
-
-    Args:
-      state: A dictionary with the current loop state.
-      new_seq: New sequences generated by growing the current alive sequences
-        int32 tensor with shape [batch_size, beam_size, i + 1]
-      new_log_probs: Log probabilities of new sequences float32 tensor with
-        shape [batch_size, beam_size]
-      new_finished_flags: A boolean Tensor indicates which sequences are live
-        inside the beam.
-
-    Returns:
-      Dictionary with finished keys from _StateKeys:
-        {Top beam_size finished sequences based on score,
-         Scores of finished sequences,
-         Finished flags of finished sequences}
-    """
-    i = state[_StateKeys.CUR_INDEX]
-    finished_seq = state[_StateKeys.FINISHED_SEQ]
-    finished_scores = state[_StateKeys.FINISHED_SCORES]
-    finished_flags = state[_StateKeys.FINISHED_FLAGS]
-
-    # First append a column of 0-ids to finished_seq to increment the length.
-    # New shape of finished_seq: [batch_size, beam_size, i + 1]
-    if not self.padded_decode:
-      finished_seq = tf.concat([
-          finished_seq,
-          tf.zeros([self.batch_size, self.beam_size, 1], tf.int32)
-      ],
-                               axis=2)
-
-    # Calculate new seq scores from log probabilities.
-    length_norm = _length_normalization(self.alpha, i + 1, dtype=self.dtype)
-    new_scores = new_log_probs / length_norm
-
-    # Set the scores of the still-alive seq in new_seq to large negative values.
-    new_scores += ((1. - tf.cast(new_finished_flags, self.dtype)) *
-                   -inf(self.dtype))
-
-    # Combine sequences, scores, and flags.
-    finished_seq = tf.concat([finished_seq, new_seq], axis=1)
-    finished_scores = tf.concat([finished_scores, new_scores], axis=1)
-    finished_flags = tf.concat([finished_flags, new_finished_flags], axis=1)
-
-    # Return the finished sequences with the best scores.
-    top_finished_seq, top_finished_scores, top_finished_flags = (
-        _gather_topk_beams([finished_seq, finished_scores, finished_flags],
-                           finished_scores, self.batch_size, self.beam_size))
-
-    return {
-        _StateKeys.FINISHED_SEQ: top_finished_seq,
-        _StateKeys.FINISHED_SCORES: top_finished_scores,
-        _StateKeys.FINISHED_FLAGS: top_finished_flags
-    }
-
 
 def sequence_beam_search(
     symbols_to_logits_fn, initial_ids, initial_cache, vocab_size, beam_size,
@@ -536,140 +74,6 @@ def sequence_beam_search(
     Top decoded sequences [batch_size, beam_size, max_decode_length]
     sequence scores [batch_size, beam_size]
   """
-  batch_size = (
-      initial_ids.shape.as_list()[0] if padded_decode else
-      tf.shape(initial_ids)[0])
-  sbs = SequenceBeamSearch(symbols_to_logits_fn, vocab_size, batch_size,
-                           beam_size, alpha, max_decode_length, eos_id,
-                           padded_decode)
+  sbs = SequenceBeamSearch(symbols_to_logits_fn, vocab_size, beam_size, alpha,
+                           max_decode_length, eos_id, padded_decode)
   return sbs.search(initial_ids, initial_cache)
-
-
-def _log_prob_from_logits(logits):
-  return logits - tf.reduce_logsumexp(logits, axis=2, keepdims=True)
-
-
-def _length_normalization(alpha, length, dtype=tf.float32):
-  """Return length normalization factor."""
-  return tf.pow(((5. + tf.cast(length, dtype)) / 6.), alpha)
-
-
-def _expand_to_beam_size(tensor, beam_size):
-  """Tiles a given tensor by beam_size.
-
-  Args:
-    tensor: tensor to tile [batch_size, ...]
-    beam_size: How much to tile the tensor by.
-
-  Returns:
-    Tiled tensor [batch_size, beam_size, ...]
-  """
-  tensor = tf.expand_dims(tensor, axis=1)
-  tile_dims = [1] * tensor.shape.ndims
-  tile_dims[1] = beam_size
-
-  return tf.tile(tensor, tile_dims)
-
-
-def _shape_list(tensor):
-  """Return a list of the tensor's shape, and ensure no None values in list."""
-  # Get statically known shape (may contain None's for unknown dimensions)
-  shape = tensor.get_shape().as_list()
-
-  # Ensure that the shape values are not None
-  dynamic_shape = tf.shape(tensor)
-  for i in range(len(shape)):  # pylint: disable=consider-using-enumerate
-    if shape[i] is None:
-      shape[i] = dynamic_shape[i]
-  return shape
-
-
-def _get_shape_keep_last_dim(tensor):
-  shape_list = _shape_list(tensor)
-
-  # Only the last
-  for i in range(len(shape_list) - 1):
-    shape_list[i] = None
-
-  if isinstance(shape_list[-1], tf.Tensor):
-    shape_list[-1] = None
-  return tf.TensorShape(shape_list)
-
-
-def _get_shape(tensor):
-  """Return the shape of the input tensor."""
-  return tf.TensorShape(_shape_list(tensor))
-
-
-def _flatten_beam_dim(tensor):
-  """Reshapes first two dimensions in to single dimension.
-
-  Args:
-    tensor: Tensor to reshape of shape [A, B, ...]
-
-  Returns:
-    Reshaped tensor of shape [A*B, ...]
-  """
-  shape = _shape_list(tensor)
-  shape[0] *= shape[1]
-  shape.pop(1)  # Remove beam dim
-  return tf.reshape(tensor, shape)
-
-
-def _unflatten_beam_dim(tensor, batch_size, beam_size):
-  """Reshapes first dimension back to [batch_size, beam_size].
-
-  Args:
-    tensor: Tensor to reshape of shape [batch_size*beam_size, ...]
-    batch_size: Tensor, original batch size.
-    beam_size: int, original beam size.
-
-  Returns:
-    Reshaped tensor of shape [batch_size, beam_size, ...]
-  """
-  shape = _shape_list(tensor)
-  new_shape = [batch_size, beam_size] + shape[1:]
-  return tf.reshape(tensor, new_shape)
-
-
-def _gather_beams(nested, beam_indices, batch_size, new_beam_size):
-  """Gather beams from nested structure of tensors.
-
-  Each tensor in nested represents a batch of beams, where beam refers to a
-  single search state (beam search involves searching through multiple states
-  in parallel).
-
-  This function is used to gather the top beams, specified by
-  beam_indices, from the nested tensors.
-
-  Args:
-    nested: Nested structure (tensor, list, tuple or dict) containing tensors
-      with shape [batch_size, beam_size, ...].
-    beam_indices: int32 tensor with shape [batch_size, new_beam_size]. Each
-     value in beam_indices must be between [0, beam_size), and are not
-     necessarily unique.
-    batch_size: int size of batch
-    new_beam_size: int number of beams to be pulled from the nested tensors.
-
-  Returns:
-    Nested structure containing tensors with shape
-      [batch_size, new_beam_size, ...]
-  """
-  # Computes the i'th coodinate that contains the batch index for gather_nd.
-  # Batch pos is a tensor like [[0,0,0,0,],[1,1,1,1],..].
-  batch_pos = tf.range(batch_size * new_beam_size) // new_beam_size
-  batch_pos = tf.reshape(batch_pos, [batch_size, new_beam_size])
-
-  # Create coordinates to be passed to tf.gather_nd. Stacking creates a tensor
-  # with shape [batch_size, beam_size, 2], where the last dimension contains
-  # the (i, j) gathering coordinates.
-  coordinates = tf.stack([batch_pos, beam_indices], axis=2)
-
-  return nest.map_structure(
-      lambda state: tf.gather_nd(state, coordinates), nested)
-
-
-def _gather_topk_beams(nested, score_or_log_prob, batch_size, beam_size):
-  """Gather top beams from nested structure."""
-  _, topk_indexes = tf.nn.top_k(score_or_log_prob, k=beam_size)
-  return _gather_beams(nested, topk_indexes, batch_size, beam_size)
diff --git a/official/nlp/transformer/compute_bleu.py b/official/nlp/transformer/compute_bleu.py
index 92d54c30ecbc844d271b49f49ed19abc09098abf..b0bed0ce77eb38b2a3bb2843db656ac024a6f82f 100644
--- a/official/nlp/transformer/compute_bleu.py
+++ b/official/nlp/transformer/compute_bleu.py
@@ -26,7 +26,7 @@ import re
 import sys
 import unicodedata
 
-from absl import app as absl_app
+from absl import app
 from absl import flags
 import six
 from six.moves import range
@@ -92,7 +92,11 @@ def bleu_wrapper(ref_filename, hyp_filename, case_sensitive=False):
       tf.io.gfile.GFile(ref_filename).read()).strip().splitlines()
   hyp_lines = tokenizer.native_to_unicode(
       tf.io.gfile.GFile(hyp_filename).read()).strip().splitlines()
+  return bleu_on_list(ref_lines, hyp_lines, case_sensitive)
 
+
+def bleu_on_list(ref_lines, hyp_lines, case_sensitive=False):
+  """Compute BLEU for two list of strings (reference and hypothesis)."""
   if len(ref_lines) != len(hyp_lines):
     raise ValueError(
         "Reference and translation files have different number of "
@@ -145,4 +149,4 @@ if __name__ == "__main__":
   tf.logging.set_verbosity(tf.logging.INFO)
   define_compute_bleu_flags()
   FLAGS = flags.FLAGS
-  absl_app.run(main)
+  app.run(main)
diff --git a/official/nlp/transformer/compute_bleu_test.py b/official/nlp/transformer/compute_bleu_test.py
index 6c578e3698a7e6cc2d7170433d4565cd3d8091ed..5b370947d45b0daf76ef1a901030f8168f92aa65 100644
--- a/official/nlp/transformer/compute_bleu_test.py
+++ b/official/nlp/transformer/compute_bleu_test.py
@@ -59,6 +59,14 @@ class ComputeBleuTest(tf.test.TestCase):
     tokenized = compute_bleu.bleu_tokenize(s)
     self.assertEqual(["Test0", ",", "1", "two", ",", "3"], tokenized)
 
+  def test_bleu_list(self):
+    ref = ["test 1 two 3", "more tests!"]
+    hyp = ["test 1 two 3", "More tests!"]
+    uncased_score = compute_bleu.bleu_on_list(ref, hyp, False)
+    cased_score = compute_bleu.bleu_on_list(ref, hyp, True)
+    self.assertEqual(uncased_score, 100)
+    self.assertLess(cased_score, 100)
+
 
 if __name__ == "__main__":
   tf.test.main()
diff --git a/official/nlp/transformer/data_download.py b/official/nlp/transformer/data_download.py
index e5f66685611e1ad379d05dcf321a679527914b19..3894866061e982b53136262b4f9675bcb9196280 100644
--- a/official/nlp/transformer/data_download.py
+++ b/official/nlp/transformer/data_download.py
@@ -23,7 +23,7 @@ import random
 import tarfile
 
 # pylint: disable=g-bad-import-order
-from absl import app as absl_app
+from absl import app
 from absl import flags
 from absl import logging
 import six
@@ -436,4 +436,4 @@ if __name__ == "__main__":
   logging.set_verbosity(logging.INFO)
   define_data_download_flags()
   FLAGS = flags.FLAGS
-  absl_app.run(main)
+  app.run(main)
diff --git a/official/nlp/transformer/embedding_layer.py b/official/nlp/transformer/embedding_layer.py
index 6694e2b42af47673ee3ce0b9572ec5867d69cb7d..0590a6443a00b9ae8c6149c930e218ff56cb1b42 100644
--- a/official/nlp/transformer/embedding_layer.py
+++ b/official/nlp/transformer/embedding_layer.py
@@ -43,6 +43,7 @@ class EmbeddingSharedWeights(tf.keras.layers.Layer):
       self.shared_weights = self.add_weight(
           "weights",
           shape=[self.vocab_size, self.hidden_size],
+          dtype=tf.float32,
           initializer=tf.random_normal_initializer(
               mean=0., stddev=self.hidden_size**-0.5))
     super(EmbeddingSharedWeights, self).build(input_shape)
diff --git a/official/nlp/transformer/optimizer.py b/official/nlp/transformer/optimizer.py
index 176b5eb8c6ffcea8a9bccbad5fbdef1d2106e106..fd5b92294e6580541183f4b2f28c1f20ea0f2b97 100644
--- a/official/nlp/transformer/optimizer.py
+++ b/official/nlp/transformer/optimizer.py
@@ -18,9 +18,7 @@ from __future__ import absolute_import
 from __future__ import division
 from __future__ import print_function
 
-import numpy as np
 import tensorflow as tf
-K = tf.keras.backend
 
 
 class LearningRateSchedule(tf.keras.optimizers.schedules.LearningRateSchedule):
@@ -66,72 +64,3 @@ class LearningRateSchedule(tf.keras.optimizers.schedules.LearningRateSchedule):
         'hidden_size': self.hidden_size,
         'warmup_steps': self.warmup_steps,
     }
-
-
-class LearningRateFn(object):
-  """Creates learning rate function."""
-
-  def __init__(self, learning_rate, hidden_size, warmup_steps):
-    self.learning_rate = learning_rate
-    self.hidden_size = hidden_size
-    self.warmup_steps = float(warmup_steps)
-
-  def __call__(self, global_step):
-    """Calculate learning rate with linear warmup and rsqrt decay."""
-    step = float(global_step)
-    learning_rate = self.learning_rate
-    learning_rate *= (self.hidden_size ** -0.5)
-    # Apply linear warmup
-    learning_rate *= np.minimum(1.0, step / self.warmup_steps)
-    # Apply rsqrt decay
-    learning_rate /= np.sqrt(np.maximum(step, self.warmup_steps))
-    return learning_rate
-
-
-class LearningRateScheduler(tf.keras.callbacks.Callback):
-  """Keras callback to schedule learning rate.
-
-  TODO(tianlin): Refactor this scheduler and LearningRateBatchScheduler in
-  official/resnet/keras/keras_common.py.
-  """
-
-  def __init__(self, schedule, init_steps=None, verbose=False):
-    super(LearningRateScheduler, self).__init__()
-    self.schedule = schedule
-    self.verbose = verbose
-    if init_steps is None:
-      init_steps = 0.0
-    self.steps = float(init_steps)   # Total steps during training.
-
-  def on_epoch_begin(self, epoch, logs=None):
-    if not hasattr(self.model.optimizer, 'lr'):
-      raise ValueError('Optimizer must have a "lr" attribute.')
-    if not hasattr(self.model.optimizer, 'iterations'):
-      raise ValueError('Optimizer must have a "iterations" attribute.')
-
-  def on_train_batch_begin(self, batch, logs=None):
-    """Adjusts learning rate for each train batch."""
-    if self.verbose > 0:
-      iterations = K.get_value(self.model.optimizer.iterations)
-      print('Original iteration %d' % iterations)
-
-    self.steps += 1.0
-    try:  # new API
-      lr = float(K.get_value(self.model.optimizer.lr))
-      lr = self.schedule(self.steps, lr)
-    except TypeError:  # Support for old API for backward compatibility
-      lr = self.schedule(self.steps)
-    if not isinstance(lr, (float, np.float32, np.float64)):
-      raise ValueError('The output of the "schedule" function '
-                       'should be float.')
-    K.set_value(self.model.optimizer.lr, lr)
-    K.set_value(self.model.optimizer.iterations, self.steps)
-
-    if self.verbose > 0:
-      print('Batch %05d Step %05d: LearningRateScheduler setting learning '
-            'rate to %s.' % (batch + 1, self.steps, lr))
-
-  def on_epoch_end(self, epoch, logs=None):
-    logs = logs or {}
-    logs['lr'] = K.get_value(self.model.optimizer.lr)
-    logs['steps'] = self.steps
diff --git a/official/nlp/transformer/transformer.py b/official/nlp/transformer/transformer.py
index 773e79449cdc493a96a5078ce85e801f8f9da250..f83076999a0a59772a18192b0ab789ad97863e5e 100644
--- a/official/nlp/transformer/transformer.py
+++ b/official/nlp/transformer/transformer.py
@@ -23,8 +23,8 @@ from __future__ import print_function
 
 import tensorflow as tf
 from official.nlp.modeling.layers import position_embedding
+from official.nlp.modeling.ops import beam_search
 from official.nlp.transformer import attention_layer
-from official.nlp.transformer import beam_search
 from official.nlp.transformer import embedding_layer
 from official.nlp.transformer import ffn_layer
 from official.nlp.transformer import metrics
@@ -52,7 +52,6 @@ def create_model(params, is_train):
       logits = tf.keras.layers.Lambda(lambda x: x, name="logits",
                                       dtype=tf.float32)(logits)
       model = tf.keras.Model([inputs, targets], logits)
-      # TODO(reedwm): Can we do this loss in float16 instead of float32?
       loss = metrics.transformer_loss(
           logits, targets, label_smoothing, vocab_size)
       model.add_loss(loss)
@@ -238,7 +237,6 @@ class Transformer(tf.keras.Model):
     decoder_self_attention_bias = model_utils.get_decoder_self_attention_bias(
         max_decode_length, dtype=self.params["dtype"])
 
-    # TODO(b/139770046): Refactor code with better naming of i.
     def symbols_to_logits_fn(ids, i, cache):
       """Generate logits for next potential IDs.
 
diff --git a/official/nlp/transformer/transformer_main.py b/official/nlp/transformer/transformer_main.py
index 14177d8562b6ec4b190fe5d773998368ffc0b881..7c71722797d6af0bbc609c0f905da3e4b347e4b0 100644
--- a/official/nlp/transformer/transformer_main.py
+++ b/official/nlp/transformer/transformer_main.py
@@ -241,14 +241,13 @@ class TransformerTask(object):
     if params["use_ctl"]:
       train_ds_iterator = iter(train_ds)
 
-    callbacks = self._create_callbacks(flags_obj.model_dir, 0, params)
+    callbacks = self._create_callbacks(flags_obj.model_dir, params)
 
     # Only TimeHistory callback is supported for CTL
     if params["use_ctl"]:
       callbacks = [cb for cb in callbacks
                    if isinstance(cb, keras_utils.TimeHistory)]
 
-    # TODO(b/139418525): Refactor the custom training loop logic.
     @tf.function
     def train_steps(iterator, steps):
       """Training steps function for TPU runs.
@@ -408,14 +407,9 @@ class TransformerTask(object):
     for i in range(length):
       translate.translate_from_input(val_outputs[i], subtokenizer)
 
-  def _create_callbacks(self, cur_log_dir, init_steps, params):
+  def _create_callbacks(self, cur_log_dir, params):
     """Creates a list of callbacks."""
-    sfunc = optimizer.LearningRateFn(params["learning_rate"],
-                                     params["hidden_size"],
-                                     params["learning_rate_warmup_steps"])
-    scheduler_callback = optimizer.LearningRateScheduler(sfunc, init_steps)
     callbacks = misc.get_callbacks()
-    callbacks.append(scheduler_callback)
     if params["enable_checkpointing"]:
       ckpt_full_path = os.path.join(cur_log_dir, "cp-{epoch:04d}.ckpt")
       callbacks.append(
@@ -427,8 +421,6 @@ class TransformerTask(object):
     """Loads model weights when it is provided."""
     if init_weight_path:
       logging.info("Load weights: {}".format(init_weight_path))
-      # TODO(b/139414977): Having the same variable restoring method for both
-      # TPU and GPU.
       if self.use_tpu:
         checkpoint = tf.train.Checkpoint(
             model=model, optimizer=self._create_optimizer())
@@ -445,7 +437,7 @@ class TransformerTask(object):
         params["learning_rate"], params["hidden_size"],
         params["learning_rate_warmup_steps"])
     opt = tf.keras.optimizers.Adam(
-        lr_schedule if self.use_tpu else params["learning_rate"],
+        lr_schedule,
         params["optimizer_adam_beta1"],
         params["optimizer_adam_beta2"],
         epsilon=params["optimizer_adam_epsilon"])
diff --git a/official/nlp/transformer/translate.py b/official/nlp/transformer/translate.py
index 1f92504142e08918a972dff10c422a58fcfbbd04..a6e79a9cf888077333018a3b37e0a868b648a102 100644
--- a/official/nlp/transformer/translate.py
+++ b/official/nlp/transformer/translate.py
@@ -181,7 +181,7 @@ def translate_file(model,
       raise ValueError("File output is a directory, will not save outputs to "
                        "file.")
     logging.info("Writing to file %s", output_file)
-    with tf.compat.v1.gfile.Open(output_file, "w") as f:
+    with tf.io.gfile.GFile(output_file, "w") as f:
       for i in sorted_keys:
         f.write("%s\n" % translations[i])
 
diff --git a/official/nlp/transformer/utils/metrics.py b/official/nlp/transformer/utils/metrics.py
index 7900cf807768f81af7a8afeee1f467074b04189f..6b8d1f7d26e9a4ee743fc884feba878c9de4237f 100644
--- a/official/nlp/transformer/utils/metrics.py
+++ b/official/nlp/transformer/utils/metrics.py
@@ -67,7 +67,7 @@ def padded_cross_entropy_loss(logits, labels, smoothing, vocab_size):
     # Calculate smoothing cross entropy
     with tf.name_scope("smoothing_cross_entropy", values=[logits, labels]):
       confidence = 1.0 - smoothing
-      low_confidence = (1.0 - confidence) / tf.to_float(vocab_size - 1)
+      low_confidence = (1.0 - confidence) / tf.cast(vocab_size - 1, tf.float32)
       soft_targets = tf.one_hot(
           tf.cast(labels, tf.int32),
           depth=vocab_size,
@@ -79,11 +79,11 @@ def padded_cross_entropy_loss(logits, labels, smoothing, vocab_size):
       # Calculate the best (lowest) possible value of cross entropy, and
       # subtract from the cross entropy loss.
       normalizing_constant = -(
-          confidence * tf.log(confidence) + tf.to_float(vocab_size - 1) *
-          low_confidence * tf.log(low_confidence + 1e-20))
+          confidence * tf.log(confidence) + tf.cast(vocab_size - 1, tf.float32)
+          * low_confidence * tf.log(low_confidence + 1e-20))
       xentropy -= normalizing_constant
 
-    weights = tf.to_float(tf.not_equal(labels, 0))
+    weights = tf.cast(tf.not_equal(labels, 0), tf.float32)
     return xentropy * weights, weights
 
 
@@ -142,24 +142,24 @@ def padded_accuracy(logits, labels):
   """Percentage of times that predictions matches labels on non-0s."""
   with tf.variable_scope("padded_accuracy", values=[logits, labels]):
     logits, labels = _pad_tensors_to_same_length(logits, labels)
-    weights = tf.to_float(tf.not_equal(labels, 0))
-    outputs = tf.to_int32(tf.argmax(logits, axis=-1))
-    padded_labels = tf.to_int32(labels)
-    return tf.to_float(tf.equal(outputs, padded_labels)), weights
+    weights = tf.cast(tf.not_equal(labels, 0), tf.float32)
+    outputs = tf.cast(tf.argmax(logits, axis=-1), tf.int32)
+    padded_labels = tf.cast(labels, tf.int32)
+    return tf.cast(tf.equal(outputs, padded_labels), tf.float32), weights
 
 
 def padded_accuracy_topk(logits, labels, k):
   """Percentage of times that top-k predictions matches labels on non-0s."""
   with tf.variable_scope("padded_accuracy_topk", values=[logits, labels]):
     logits, labels = _pad_tensors_to_same_length(logits, labels)
-    weights = tf.to_float(tf.not_equal(labels, 0))
+    weights = tf.cast(tf.not_equal(labels, 0), tf.float32)
     effective_k = tf.minimum(k, tf.shape(logits)[-1])
     _, outputs = tf.nn.top_k(logits, k=effective_k)
-    outputs = tf.to_int32(outputs)
-    padded_labels = tf.to_int32(labels)
+    outputs = tf.cast(outputs, tf.int32)
+    padded_labels = tf.cast(labels, tf.int32)
     padded_labels = tf.expand_dims(padded_labels, axis=-1)
     padded_labels += tf.zeros_like(outputs)  # Pad to same shape.
-    same = tf.to_float(tf.equal(outputs, padded_labels))
+    same = tf.cast(tf.equal(outputs, padded_labels), tf.float32)
     same_topk = tf.reduce_sum(same, axis=-1)
     return same_topk, weights
 
@@ -172,10 +172,11 @@ def padded_sequence_accuracy(logits, labels):
   """Percentage of times that predictions matches labels everywhere (non-0)."""
   with tf.variable_scope("padded_sequence_accuracy", values=[logits, labels]):
     logits, labels = _pad_tensors_to_same_length(logits, labels)
-    weights = tf.to_float(tf.not_equal(labels, 0))
-    outputs = tf.to_int32(tf.argmax(logits, axis=-1))
-    padded_labels = tf.to_int32(labels)
-    not_correct = tf.to_float(tf.not_equal(outputs, padded_labels)) * weights
+    weights = tf.cast(tf.not_equal(labels, 0), tf.float32)
+    outputs = tf.cast(tf.argmax(logits, axis=-1), tf.int32)
+    padded_labels = tf.cast(labels, tf.int32)
+    not_correct = (tf.cast(tf.not_equal(outputs, padded_labels), tf.float32) *
+                   weights)
     axis = list(range(1, len(outputs.get_shape())))
     correct_seq = 1.0 - tf.minimum(1.0, tf.reduce_sum(not_correct, axis=axis))
     return correct_seq, tf.constant(1.0)
@@ -201,7 +202,7 @@ def bleu_score(logits, labels):
   Returns:
     bleu: int, approx bleu score
   """
-  predictions = tf.to_int32(tf.argmax(logits, axis=-1))
+  predictions = tf.cast(tf.argmax(logits, axis=-1), tf.int32)
   # TODO: Look into removing use of py_func
   bleu = tf.py_func(compute_bleu, (labels, predictions), tf.float32)
   return bleu, tf.constant(1.0)
@@ -306,7 +307,7 @@ def rouge_2_fscore(logits, labels):
   Returns:
     rouge2_fscore: approx rouge-2 f1 score.
   """
-  predictions = tf.to_int32(tf.argmax(logits, axis=-1))
+  predictions = tf.cast(tf.argmax(logits, axis=-1), tf.int32)
   # TODO: Look into removing use of py_func
   rouge_2_f_score = tf.py_func(rouge_n, (predictions, labels), tf.float32)
   return rouge_2_f_score, tf.constant(1.0)
@@ -383,7 +384,7 @@ def rouge_l_fscore(predictions, labels):
   Returns:
     rouge_l_fscore: approx rouge-l f1 score.
   """
-  outputs = tf.to_int32(tf.argmax(predictions, axis=-1))
+  outputs = tf.cast(tf.argmax(predictions, axis=-1), tf.int32)
   rouge_l_f_score = tf.py_func(rouge_l_sentence_level, (outputs, labels),
                                tf.float32)
   return rouge_l_f_score, tf.constant(1.0)
diff --git a/official/nlp/xlnet/xlnet_modeling.py b/official/nlp/xlnet/xlnet_modeling.py
index 3e16af8e9930ba4dabb8e92743769cf1ebb48585..803ec4add128d0baa8f55eeb308a6a58a7d71ff2 100644
--- a/official/nlp/xlnet/xlnet_modeling.py
+++ b/official/nlp/xlnet/xlnet_modeling.py
@@ -14,32 +14,14 @@
 # ==============================================================================
 """Keras layers of XLNet model in TF 2.0."""
 
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
-
 import copy
-import numpy as np
 
 import tensorflow as tf
 from official.nlp.xlnet import data_utils
 
 
 def gelu(x):
-  """Gaussian Error Linear Unit.
-
-  This is a smoother version of the RELU.
-  Original paper: https://arxiv.org/abs/1606.08415
-  Args:
-    x: float Tensor to perform activation.
-
-  Returns:
-    `x` with the GELU activation applied.
-  """
-  cdf = 0.5 * (1.0 + tf.tanh(
-      (np.sqrt(2 / np.pi) * (x + 0.044715 * tf.pow(x, 3)))))
-  return x * cdf
+  return tf.keras.activations.gelu(x, approximate=True)
 
 
 def rel_shift(x, klen=-1):
@@ -55,7 +37,7 @@ def rel_shift(x, klen=-1):
 
 
 def _get_initializer(flags):
-  """Get variable intializer."""
+  """Get variable initializer."""
   if flags.init_method == 'uniform':
     initializer = tf.keras.initializers.RandomUniform(
         minval=-flags.init_range, maxval=flags.init_range)
diff --git a/official/pip_package/setup.py b/official/pip_package/setup.py
index 903777dde3631f440e07df2e189f387861ce2761..7603146009739caa4fd6980bc23357b7dd2bfd90 100644
--- a/official/pip_package/setup.py
+++ b/official/pip_package/setup.py
@@ -45,6 +45,9 @@ def _get_requirements():
       os.path.join(os.path.dirname(__file__), '../requirements.txt'), 'r') as f:
     for line in f:
       package_name = line.strip()
+      # Skip empty line or comments starting with "#".
+      if not package_name or package_name[0] == '#':
+        continue
       if package_name.startswith('-e '):
         dependency_links_tmp.append(package_name[3:].strip())
       else:
diff --git a/official/r1/README.md b/official/r1/README.md
deleted file mode 100644
index 72514177d8e811659df377a1d41b05c06b4b75af..0000000000000000000000000000000000000000
--- a/official/r1/README.md
+++ /dev/null
@@ -1,23 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Legacy Models
-
-The **r1** folder contains legacy model implementations developed
-using TensorFlow 1.x.
-
-**Note: We will remove this r1 folder from the master branch in June, 2020.**
-
-After removal, you will still be able to access legacy models
-in the previous releases.
-(e.g., [v2.1.0](https://github.com/tensorflow/models/releases/tag/v2.1.0))
-
-| Model | Description | Reference |
-| ----- | ----------- | --------- |
-| [Gradient Boosted Trees](boosted_trees) | A gradient boosted trees model to classify higgs boson process from HIGGS dataset | [Link](https://en.wikipedia.org/wiki/Gradient_boosting) |
-| [MNIST](mnist) | A basic model to classify digits from the MNIST dataset | [Link](http://yann.lecun.com/exdb/mnist/) |
-| [NCF](ncf) | NCF Estimator implementation | [arXiv:1708.05031](https://arxiv.org/abs/1708.05031) |
-| [ResNet](resnet) | A deep residual network for image recognition | [arXiv:1512.03385](https://arxiv.org/abs/1512.03385) |
-| [Transformer](transformer) | A transformer model to translate the WMT English to German dataset | [arXiv:1706.03762](https://arxiv.org/abs/1706.03762) |
-| [Wide & Deep Learning](wide_deep) | A model that combines a wide linear model and deep neural network for recommender systems | [arXiv:1606.07792](https://arxiv.org/abs/1606.07792) |
diff --git a/official/r1/__init__.py b/official/r1/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/official/r1/boosted_trees/README.md b/official/r1/boosted_trees/README.md
deleted file mode 100644
index 56c40aa50516d65ef772e68ae53b1c070abcde1c..0000000000000000000000000000000000000000
--- a/official/r1/boosted_trees/README.md
+++ /dev/null
@@ -1,117 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Classifying Higgs boson processes in the HIGGS Data Set
-
-## Overview
-The [HIGGS Data Set](https://archive.ics.uci.edu/ml/datasets/HIGGS) contains 11 million samples with 28 features, and is for the classification problem to distinguish between a signal process which produces Higgs bosons and a background process which does not.
-
-We use Gradient Boosted Trees algorithm to distinguish the two classes.
-
----
-
-The code sample uses the high level `tf.estimator.Estimator` and `tf.data.Dataset`.  These APIs are great for fast iteration and quickly adapting models to your own datasets without major code overhauls.  It allows you to move from single-worker training to distributed training, and makes it easy to export model binaries for prediction.  Here, for further simplicity and faster execution, we use a utility function `tf.contrib.estimator.boosted_trees_classifier_train_in_memory`.  This utility function is especially effective when the input is provided as in-memory data sets like numpy arrays.
-
-An input function for the `Estimator` typically uses `tf.data.Dataset` API, which can handle various data control like streaming, batching, transform and shuffling. However `boosted_trees_classifier_train_in_memory()` utility function requires that the entire data is provided as a single batch (i.e. without using `batch()` API). Thus in this practice, simply `Dataset.from_tensors()` is used to convert numpy arrays into structured tensors, and `Dataset.zip()` is used to put features and label together.
-For further references of `Dataset`, [Read more here](https://www.tensorflow.org/guide/datasets).
-
-## Running the code
-First make sure you've [added the models folder to your Python path](/official/#running-the-models); otherwise you may encounter an error like `ImportError: No module named official.boosted_trees`.
-
-### Setup
-The [HIGGS Data Set](https://archive.ics.uci.edu/ml/datasets/HIGGS) that this sample uses for training is hosted by the [UC Irvine Machine Learning Repository](https://archive.ics.uci.edu/ml/datasets/). We have provided a script that downloads and cleans the necessary files.
-
-```
-python data_download.py
-```
-
-This will download a file and store the processed file under the directory designated by `--data_dir` (defaults to `/tmp/higgs_data/`). To change the target directory, set the `--data_dir` flag. The directory could be network storages that Tensorflow supports (like Google Cloud Storage, `gs://<bucket>/<path>/`).
-The file downloaded to the local temporary folder is about 2.8 GB, and the processed file is about 0.8 GB, so there should be enough storage to handle them.
-
-
-### Training
-
-This example uses about 3 GB of RAM during training.
-You can run the code locally as follows:
-
-```
-python train_higgs.py
-```
-
-The model is by default saved to `/tmp/higgs_model`, which can be changed using the `--model_dir` flag.
-Note that the model_dir is cleaned up before every time training starts.
-
-Model parameters can be adjusted by flags, like `--n_trees`, `--max_depth`, `--learning_rate` and so on.  Check out the code for details.
-
-The final accuracy will be around 74% and loss will be around 0.516 over the eval set, when trained with the default parameters.
-
-By default, the first 1 million examples among 11 millions are used for training, and the last 1 million examples are used for evaluation.
-The training/evaluation data can be selected as index ranges by flags `--train_start`, `--train_count`, `--eval_start`, `--eval_count`, etc.
-
-### TensorBoard
-
-Run TensorBoard to inspect the details about the graph and training progression.
-
-```
-tensorboard --logdir=/tmp/higgs_model  # set logdir as --model_dir set during training.
-```
-
-## Inference with SavedModel
-You can export the model into Tensorflow [SavedModel](https://www.tensorflow.org/guide/saved_model) format by using the argument `--export_dir`:
-
-```
-python train_higgs.py --export_dir /tmp/higgs_boosted_trees_saved_model
-```
-
-After the model finishes training, use [`saved_model_cli`](https://www.tensorflow.org/guide/saved_model#cli_to_inspect_and_execute_savedmodel) to inspect and execute the SavedModel.
-
-Try the following commands to inspect the SavedModel:
-
-**Replace `${TIMESTAMP}` with the folder produced (e.g. 1524249124)**
-```
-# List possible tag_sets. Only one metagraph is saved, so there will be one option.
-saved_model_cli show --dir /tmp/higgs_boosted_trees_saved_model/${TIMESTAMP}/
-
-# Show SignatureDefs for tag_set=serve. SignatureDefs define the outputs to show.
-saved_model_cli show --dir /tmp/higgs_boosted_trees_saved_model/${TIMESTAMP}/ \
-    --tag_set serve --all
-```
-
-### Inference
-Let's use the model to predict the income group of two examples.
-Note that this model exports SavedModel with the custom parsing module that accepts csv lines as features. (Each line is an example with 28 columns; be careful to not add a label column, unlike in the training data.)
-
-```
-saved_model_cli run --dir /tmp/boosted_trees_higgs_saved_model/${TIMESTAMP}/ \
-    --tag_set serve --signature_def="predict" \
-    --input_exprs='inputs=["0.869293,-0.635082,0.225690,0.327470,-0.689993,0.754202,-0.248573,-1.092064,0.0,1.374992,-0.653674,0.930349,1.107436,1.138904,-1.578198,-1.046985,0.0,0.657930,-0.010455,-0.045767,3.101961,1.353760,0.979563,0.978076,0.920005,0.721657,0.988751,0.876678", "1.595839,-0.607811,0.007075,1.818450,-0.111906,0.847550,-0.566437,1.581239,2.173076,0.755421,0.643110,1.426367,0.0,0.921661,-1.190432,-1.615589,0.0,0.651114,-0.654227,-1.274345,3.101961,0.823761,0.938191,0.971758,0.789176,0.430553,0.961357,0.957818"]'
-```
-
-This will print out the predicted classes and class probabilities. Something like:
-
-```
-Result for output key class_ids:
-[[1]
- [0]]
-Result for output key classes:
-[['1']
- ['0']]
-Result for output key logistic:
-[[0.6440273 ]
- [0.10902369]]
-Result for output key logits:
-[[ 0.59288704]
- [-2.1007526 ]]
-Result for output key probabilities:
-[[0.3559727 0.6440273]
- [0.8909763 0.1090237]]
-```
-
-Please note that "predict" signature_def gives out different (more detailed) results than "classification" or "serving_default".
-
-## Additional Links
-
-If you are interested in distributed training, take a look at [Distributed TensorFlow](https://www.tensorflow.org/deploy/distributed).
-
-You can also [train models on Cloud ML Engine](https://cloud.google.com/ml-engine/docs/getting-started-training-prediction), which provides [hyperparameter tuning](https://cloud.google.com/ml-engine/docs/getting-started-training-prediction#hyperparameter_tuning) to maximize your model's results and enables [deploying your model for prediction](https://cloud.google.com/ml-engine/docs/getting-started-training-prediction#deploy_a_model_to_support_prediction).
diff --git a/official/r1/boosted_trees/__init__.py b/official/r1/boosted_trees/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/official/r1/boosted_trees/data_download.py b/official/r1/boosted_trees/data_download.py
deleted file mode 100644
index 1b6fc050d894a24c49ad393266c80fc4b7949d9b..0000000000000000000000000000000000000000
--- a/official/r1/boosted_trees/data_download.py
+++ /dev/null
@@ -1,97 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Downloads the UCI HIGGS Dataset and prepares train data.
-
-The details on the dataset are in https://archive.ics.uci.edu/ml/datasets/HIGGS
-
-It takes a while as it needs to download 2.8 GB over the network, process, then
-store it into the specified location as a compressed numpy file.
-
-Usage:
-$ python data_download.py --data_dir=/tmp/higgs_data
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import gzip
-import os
-import tempfile
-
-# pylint: disable=g-bad-import-order
-import numpy as np
-import pandas as pd
-from six.moves import urllib
-from absl import app as absl_app
-from absl import flags
-import tensorflow as tf
-
-from official.utils.flags import core as flags_core
-
-URL_ROOT = "https://archive.ics.uci.edu/ml/machine-learning-databases/00280"
-INPUT_FILE = "HIGGS.csv.gz"
-NPZ_FILE = "HIGGS.csv.gz.npz"  # numpy compressed file to contain "data" array.
-
-
-def _download_higgs_data_and_save_npz(data_dir):
-  """Download higgs data and store as a numpy compressed file."""
-  input_url = URL_ROOT + "/" + INPUT_FILE
-  np_filename = os.path.join(data_dir, NPZ_FILE)
-  if tf.gfile.Exists(np_filename):
-    raise ValueError("data_dir already has the processed data file: {}".format(
-        np_filename))
-  if not tf.gfile.Exists(data_dir):
-    tf.gfile.MkDir(data_dir)
-  # 2.8 GB to download.
-  try:
-    tf.logging.info("Data downloading...")
-    temp_filename, _ = urllib.request.urlretrieve(input_url)
-    # Reading and parsing 11 million csv lines takes 2~3 minutes.
-    tf.logging.info("Data processing... taking multiple minutes...")
-    with gzip.open(temp_filename, "rb") as csv_file:
-      data = pd.read_csv(
-          csv_file,
-          dtype=np.float32,
-          names=["c%02d" % i for i in range(29)]  # label + 28 features.
-      ).as_matrix()
-  finally:
-    tf.gfile.Remove(temp_filename)
-
-  # Writing to temporary location then copy to the data_dir (0.8 GB).
-  f = tempfile.NamedTemporaryFile()
-  np.savez_compressed(f, data=data)
-  tf.gfile.Copy(f.name, np_filename)
-  tf.logging.info("Data saved to: {}".format(np_filename))
-
-
-def main(unused_argv):
-  if not tf.gfile.Exists(FLAGS.data_dir):
-    tf.gfile.MkDir(FLAGS.data_dir)
-  _download_higgs_data_and_save_npz(FLAGS.data_dir)
-
-
-def define_data_download_flags():
-  """Add flags specifying data download arguments."""
-  flags.DEFINE_string(
-      name="data_dir", default="/tmp/higgs_data",
-      help=flags_core.help_wrap(
-          "Directory to download higgs dataset and store training/eval data."))
-
-
-if __name__ == "__main__":
-  tf.logging.set_verbosity(tf.logging.INFO)
-  define_data_download_flags()
-  FLAGS = flags.FLAGS
-  absl_app.run(main)
diff --git a/official/r1/boosted_trees/train_higgs.py b/official/r1/boosted_trees/train_higgs.py
deleted file mode 100644
index 5f3f2547e2dc478b5b08ff42b44b71a5c4b628c6..0000000000000000000000000000000000000000
--- a/official/r1/boosted_trees/train_higgs.py
+++ /dev/null
@@ -1,295 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-r"""A script that builds boosted trees over higgs data.
-
-If you haven't, please run data_download.py beforehand to prepare the data.
-
-For some more details on this example, please refer to README.md as well.
-
-Note that the model_dir is cleaned up before starting the training.
-
-Usage:
-$ python train_higgs.py --n_trees=100 --max_depth=6 --learning_rate=0.1 \
-    --model_dir=/tmp/higgs_model
-
-Note that BoostedTreesClassifier is available since Tensorflow 1.8.0.
-So you need to install recent enough version of Tensorflow to use this example.
-
-The training data is by default the first million examples out of 11M examples,
-and eval data is by default the last million examples.
-They are controlled by --train_start, --train_count, --eval_start, --eval_count.
-e.g. to train over the first 10 million examples instead of 1 million:
-$ python train_higgs.py --n_trees=100 --max_depth=6 --learning_rate=0.1 \
-    --model_dir=/tmp/higgs_model --train_count=10000000
-
-Training history and metrics can be inspected using tensorboard.
-Set --logdir as the --model_dir set by flag when training
-(or the default /tmp/higgs_model).
-$ tensorboard --logdir=/tmp/higgs_model
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-from absl import app as absl_app
-from absl import flags
-import numpy as np
-import tensorflow.compat.v1 as tf
-
-from official.r1.utils.logs import logger
-from official.utils.flags import core as flags_core
-from official.utils.flags._conventions import help_wrap
-
-NPZ_FILE = "HIGGS.csv.gz.npz"  # numpy compressed file containing "data" array
-
-
-def read_higgs_data(data_dir, train_start, train_count, eval_start, eval_count):
-  """Reads higgs data from csv and returns train and eval data.
-
-  Args:
-    data_dir: A string, the directory of higgs dataset.
-    train_start: An integer, the start index of train examples within the data.
-    train_count: An integer, the number of train examples within the data.
-    eval_start: An integer, the start index of eval examples within the data.
-    eval_count: An integer, the number of eval examples within the data.
-
-  Returns:
-    Numpy array of train data and eval data.
-  """
-  npz_filename = os.path.join(data_dir, NPZ_FILE)
-  try:
-    # gfile allows numpy to read data from network data sources as well.
-    with tf.gfile.Open(npz_filename, "rb") as npz_file:
-      with np.load(npz_file) as npz:
-        data = npz["data"]
-  except tf.errors.NotFoundError as e:
-    raise RuntimeError(
-        "Error loading data; use data_download.py to prepare the data.\n{}: {}"
-        .format(type(e).__name__, e))
-  return (data[train_start:train_start+train_count],
-          data[eval_start:eval_start+eval_count])
-
-
-# This showcases how to make input_fn when the input data is available in the
-# form of numpy arrays.
-def make_inputs_from_np_arrays(features_np, label_np):
-  """Makes and returns input_fn and feature_columns from numpy arrays.
-
-  The generated input_fn will return tf.data.Dataset of feature dictionary and a
-  label, and feature_columns will consist of the list of
-  tf.feature_column.BucketizedColumn.
-
-  Note, for in-memory training, tf.data.Dataset should contain the whole data
-  as a single tensor. Don't use batch.
-
-  Args:
-    features_np: A numpy ndarray (shape=[batch_size, num_features]) for
-        float32 features.
-    label_np: A numpy ndarray (shape=[batch_size, 1]) for labels.
-
-  Returns:
-    input_fn: A function returning a Dataset of feature dict and label.
-    feature_names: A list of feature names.
-    feature_column: A list of tf.feature_column.BucketizedColumn.
-  """
-  num_features = features_np.shape[1]
-  features_np_list = np.split(features_np, num_features, axis=1)
-  # 1-based feature names.
-  feature_names = ["feature_%02d" % (i + 1) for i in range(num_features)]
-
-  # Create source feature_columns and bucketized_columns.
-  def get_bucket_boundaries(feature):
-    """Returns bucket boundaries for feature by percentiles."""
-    return np.unique(np.percentile(feature, range(0, 100))).tolist()
-  source_columns = [
-      tf.feature_column.numeric_column(
-          feature_name, dtype=tf.float32,
-          # Although higgs data have no missing values, in general, default
-          # could be set as 0 or some reasonable value for missing values.
-          default_value=0.0)
-      for feature_name in feature_names
-  ]
-  bucketized_columns = [
-      tf.feature_column.bucketized_column(
-          source_columns[i],
-          boundaries=get_bucket_boundaries(features_np_list[i]))
-      for i in range(num_features)
-  ]
-
-  # Make an input_fn that extracts source features.
-  def input_fn():
-    """Returns features as a dictionary of numpy arrays, and a label."""
-    features = {
-        feature_name: tf.constant(features_np_list[i])
-        for i, feature_name in enumerate(feature_names)
-    }
-    return tf.data.Dataset.zip((tf.data.Dataset.from_tensors(features),
-                                tf.data.Dataset.from_tensors(label_np),))
-
-  return input_fn, feature_names, bucketized_columns
-
-
-def make_eval_inputs_from_np_arrays(features_np, label_np):
-  """Makes eval input as streaming batches."""
-  num_features = features_np.shape[1]
-  features_np_list = np.split(features_np, num_features, axis=1)
-  # 1-based feature names.
-  feature_names = ["feature_%02d" % (i + 1) for i in range(num_features)]
-
-  def input_fn():
-    features = {
-        feature_name: tf.constant(features_np_list[i])
-        for i, feature_name in enumerate(feature_names)
-    }
-    return tf.data.Dataset.zip((
-        tf.data.Dataset.from_tensor_slices(features),
-        tf.data.Dataset.from_tensor_slices(label_np),)).batch(1000)
-
-  return input_fn
-
-
-def _make_csv_serving_input_receiver_fn(column_names, column_defaults):
-  """Returns serving_input_receiver_fn for csv.
-
-  The input arguments are relevant to `tf.decode_csv()`.
-
-  Args:
-    column_names: a list of column names in the order within input csv.
-    column_defaults: a list of default values with the same size of
-        column_names. Each entity must be either a list of one scalar, or an
-        empty list to denote the corresponding column is required.
-        e.g. [[""], [2.5], []] indicates the third column is required while
-            the first column must be string and the second must be float/double.
-
-  Returns:
-    a serving_input_receiver_fn that handles csv for serving.
-  """
-  def serving_input_receiver_fn():
-    csv = tf.placeholder(dtype=tf.string, shape=[None], name="csv")
-    features = dict(zip(column_names, tf.decode_csv(csv, column_defaults)))
-    receiver_tensors = {"inputs": csv}
-    return tf.estimator.export.ServingInputReceiver(features, receiver_tensors)
-
-  return serving_input_receiver_fn
-
-
-def train_boosted_trees(flags_obj):
-  """Train boosted_trees estimator on HIGGS data.
-
-  Args:
-    flags_obj: An object containing parsed flag values.
-  """
-  # Clean up the model directory if present.
-  if tf.gfile.Exists(flags_obj.model_dir):
-    tf.gfile.DeleteRecursively(flags_obj.model_dir)
-  tf.logging.info("## Data loading...")
-  train_data, eval_data = read_higgs_data(
-      flags_obj.data_dir, flags_obj.train_start, flags_obj.train_count,
-      flags_obj.eval_start, flags_obj.eval_count)
-  tf.logging.info("## Data loaded; train: {}{}, eval: {}{}".format(
-      train_data.dtype, train_data.shape, eval_data.dtype, eval_data.shape))
-  # Data consists of one label column followed by 28 feature columns.
-  train_input_fn, feature_names, feature_columns = make_inputs_from_np_arrays(
-      features_np=train_data[:, 1:], label_np=train_data[:, 0:1])
-  eval_input_fn = make_eval_inputs_from_np_arrays(
-      features_np=eval_data[:, 1:], label_np=eval_data[:, 0:1])
-  tf.logging.info("## Features prepared. Training starts...")
-
-  # Create benchmark logger to log info about the training and metric values
-  run_params = {
-      "train_start": flags_obj.train_start,
-      "train_count": flags_obj.train_count,
-      "eval_start": flags_obj.eval_start,
-      "eval_count": flags_obj.eval_count,
-      "n_trees": flags_obj.n_trees,
-      "max_depth": flags_obj.max_depth,
-  }
-  benchmark_logger = logger.config_benchmark_logger(flags_obj)
-  benchmark_logger.log_run_info(
-      model_name="boosted_trees",
-      dataset_name="higgs",
-      run_params=run_params,
-      test_id=flags_obj.benchmark_test_id)
-
-  # Though BoostedTreesClassifier is under tf.estimator, faster in-memory
-  # training is yet provided as a contrib library.
-  from tensorflow.contrib import estimator as contrib_estimator  # pylint: disable=g-import-not-at-top
-  classifier = contrib_estimator.boosted_trees_classifier_train_in_memory(
-      train_input_fn,
-      feature_columns,
-      model_dir=flags_obj.model_dir or None,
-      n_trees=flags_obj.n_trees,
-      max_depth=flags_obj.max_depth,
-      learning_rate=flags_obj.learning_rate)
-
-  # Evaluation.
-  eval_results = classifier.evaluate(eval_input_fn)
-  # Benchmark the evaluation results
-  benchmark_logger.log_evaluation_result(eval_results)
-
-  # Exporting the savedmodel with csv parsing.
-  if flags_obj.export_dir is not None:
-    classifier.export_savedmodel(
-        flags_obj.export_dir,
-        _make_csv_serving_input_receiver_fn(
-            column_names=feature_names,
-            # columns are all floats.
-            column_defaults=[[0.0]] * len(feature_names)),
-        strip_default_attrs=True)
-
-
-def main(_):
-  train_boosted_trees(flags.FLAGS)
-
-
-def define_train_higgs_flags():
-  """Add tree related flags as well as training/eval configuration."""
-  flags_core.define_base(clean=False, stop_threshold=False, batch_size=False,
-                         num_gpu=False, export_dir=True)
-  flags_core.define_benchmark()
-  flags.adopt_module_key_flags(flags_core)
-
-  flags.DEFINE_integer(
-      name="train_start", default=0,
-      help=help_wrap("Start index of train examples within the data."))
-  flags.DEFINE_integer(
-      name="train_count", default=1000000,
-      help=help_wrap("Number of train examples within the data."))
-  flags.DEFINE_integer(
-      name="eval_start", default=10000000,
-      help=help_wrap("Start index of eval examples within the data."))
-  flags.DEFINE_integer(
-      name="eval_count", default=1000000,
-      help=help_wrap("Number of eval examples within the data."))
-
-  flags.DEFINE_integer(
-      "n_trees", default=100, help=help_wrap("Number of trees to build."))
-  flags.DEFINE_integer(
-      "max_depth", default=6, help=help_wrap("Maximum depths of each tree."))
-  flags.DEFINE_float(
-      "learning_rate", default=0.1,
-      help=help_wrap("The learning rate."))
-
-  flags_core.set_defaults(data_dir="/tmp/higgs_data",
-                          model_dir="/tmp/higgs_model")
-
-
-if __name__ == "__main__":
-  # Training progress and eval results are shown as logging.INFO; so enables it.
-  tf.logging.set_verbosity(tf.logging.INFO)
-  define_train_higgs_flags()
-  absl_app.run(main)
diff --git a/official/r1/mnist/README.md b/official/r1/mnist/README.md
deleted file mode 100644
index 55f352384d63c74baee5109963a949ae6a63d086..0000000000000000000000000000000000000000
--- a/official/r1/mnist/README.md
+++ /dev/null
@@ -1,91 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# MNIST in TensorFlow
-
-This directory builds a convolutional neural net to classify the [MNIST
-dataset](http://yann.lecun.com/exdb/mnist/) using the
-[tf.data](https://www.tensorflow.org/api_docs/python/tf/data),
-[tf.estimator.Estimator](https://www.tensorflow.org/api_docs/python/tf/estimator/Estimator),
-and
-[tf.layers](https://www.tensorflow.org/api_docs/python/tf/layers)
-APIs.
-
-
-## Setup
-
-To begin, you'll simply need the latest version of TensorFlow installed.
-First make sure you've [added the models folder to your Python path]:
-
-```shell
-export PYTHONPATH="$PYTHONPATH:/path/to/models"
-```
-
-Otherwise you may encounter an error like `ImportError: No module named official.mnist`.
-
-Then to train the model, run the following:
-
-```
-python mnist.py
-```
-
-The model will begin training and will automatically evaluate itself on the
-validation data.
-
-Illustrative unit tests and benchmarks can be run with:
-
-```
-python mnist_test.py
-python mnist_test.py --benchmarks=.
-```
-
-## Exporting the model
-
-You can export the model into Tensorflow [SavedModel](https://www.tensorflow.org/guide/saved_model) format by using the argument `--export_dir`:
-
-```
-python mnist.py --export_dir /tmp/mnist_saved_model
-```
-
-The SavedModel will be saved in a timestamped directory under `/tmp/mnist_saved_model/` (e.g. `/tmp/mnist_saved_model/1513630966/`).
-
-**Getting predictions with SavedModel**
-Use [`saved_model_cli`](https://www.tensorflow.org/guide/saved_model#cli_to_inspect_and_execute_savedmodel) to inspect and execute the SavedModel.
-
-```
-saved_model_cli run --dir /tmp/mnist_saved_model/TIMESTAMP --tag_set serve --signature_def classify --inputs image=examples.npy
-```
-
-`examples.npy` contains the data from `example5.png` and `example3.png` in a numpy array, in that order. The array values are normalized to values between 0 and 1.
-
-The output should look similar to below:
-```
-Result for output key classes:
-[5 3]
-Result for output key probabilities:
-[[  1.53558474e-07   1.95694142e-13   1.31193523e-09   5.47467265e-03
-    5.85711526e-22   9.94520664e-01   3.48423509e-06   2.65365645e-17
-    9.78631419e-07   3.15522470e-08]
- [  1.22413359e-04   5.87615965e-08   1.72251271e-06   9.39960718e-01
-    3.30306928e-11   2.87386645e-02   2.82353517e-02   8.21146413e-18
-    2.52568233e-03   4.15460236e-04]]
-```
-
-## Experimental: Eager Execution
-
-[Eager execution](https://research.googleblog.com/2017/10/eager-execution-imperative-define-by.html)
-(an preview feature in TensorFlow 1.5) is an imperative interface to TensorFlow.
-The exact same model defined in `mnist.py` can be trained without creating a
-TensorFlow graph using:
-
-```
-python mnist_eager.py
-```
-
-## Experimental: TPU Acceleration
-
-`mnist.py` (and `mnist_eager.py`) demonstrate training a neural network to
-classify digits on CPUs and GPUs. `mnist_tpu.py` can be used to train the
-same model using TPUs for hardware acceleration. More information in
-the [tensorflow/tpu](https://github.com/tensorflow/tpu) repository.
diff --git a/official/r1/mnist/__init__.py b/official/r1/mnist/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/official/r1/mnist/dataset.py b/official/r1/mnist/dataset.py
deleted file mode 100644
index 2bdd155d982b83abcf83a7cfe741452a240df92c..0000000000000000000000000000000000000000
--- a/official/r1/mnist/dataset.py
+++ /dev/null
@@ -1,117 +0,0 @@
-#  Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-#  Licensed under the Apache License, Version 2.0 (the "License");
-#  you may not use this file except in compliance with the License.
-#  You may obtain a copy of the License at
-#
-#   http://www.apache.org/licenses/LICENSE-2.0
-#
-#  Unless required by applicable law or agreed to in writing, software
-#  distributed under the License is distributed on an "AS IS" BASIS,
-#  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-#  See the License for the specific language governing permissions and
-#  limitations under the License.
-"""tf.data.Dataset interface to the MNIST dataset."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import gzip
-import os
-import shutil
-import tempfile
-
-import numpy as np
-from six.moves import urllib
-import tensorflow as tf
-
-
-def read32(bytestream):
-  """Read 4 bytes from bytestream as an unsigned 32-bit integer."""
-  dt = np.dtype(np.uint32).newbyteorder('>')
-  return np.frombuffer(bytestream.read(4), dtype=dt)[0]
-
-
-def check_image_file_header(filename):
-  """Validate that filename corresponds to images for the MNIST dataset."""
-  with tf.io.gfile.GFile(filename, 'rb') as f:
-    magic = read32(f)
-    read32(f)  # num_images, unused
-    rows = read32(f)
-    cols = read32(f)
-    if magic != 2051:
-      raise ValueError('Invalid magic number %d in MNIST file %s' % (magic,
-                                                                     f.name))
-    if rows != 28 or cols != 28:
-      raise ValueError(
-          'Invalid MNIST file %s: Expected 28x28 images, found %dx%d' %
-          (f.name, rows, cols))
-
-
-def check_labels_file_header(filename):
-  """Validate that filename corresponds to labels for the MNIST dataset."""
-  with tf.io.gfile.GFile(filename, 'rb') as f:
-    magic = read32(f)
-    read32(f)  # num_items, unused
-    if magic != 2049:
-      raise ValueError('Invalid magic number %d in MNIST file %s' % (magic,
-                                                                     f.name))
-
-
-def download(directory, filename):
-  """Download (and unzip) a file from the MNIST dataset if not already done."""
-  filepath = os.path.join(directory, filename)
-  if tf.io.gfile.exists(filepath):
-    return filepath
-  if not tf.io.gfile.exists(directory):
-    tf.io.gfile.makedirs(directory)
-  # CVDF mirror of http://yann.lecun.com/exdb/mnist/
-  url = 'https://storage.googleapis.com/cvdf-datasets/mnist/' + filename + '.gz'
-  _, zipped_filepath = tempfile.mkstemp(suffix='.gz')
-  print('Downloading %s to %s' % (url, zipped_filepath))
-  urllib.request.urlretrieve(url, zipped_filepath)
-  with gzip.open(zipped_filepath, 'rb') as f_in, \
-      tf.io.gfile.GFile(filepath, 'wb') as f_out:
-    shutil.copyfileobj(f_in, f_out)
-  os.remove(zipped_filepath)
-  return filepath
-
-
-def dataset(directory, images_file, labels_file):
-  """Download and parse MNIST dataset."""
-
-  images_file = download(directory, images_file)
-  labels_file = download(directory, labels_file)
-
-  check_image_file_header(images_file)
-  check_labels_file_header(labels_file)
-
-  def decode_image(image):
-    # Normalize from [0, 255] to [0.0, 1.0]
-    image = tf.io.decode_raw(image, tf.uint8)
-    image = tf.cast(image, tf.float32)
-    image = tf.reshape(image, [784])
-    return image / 255.0
-
-  def decode_label(label):
-    label = tf.io.decode_raw(label, tf.uint8)  # tf.string -> [tf.uint8]
-    label = tf.reshape(label, [])  # label is a scalar
-    return tf.cast(label, tf.int32)
-
-  images = tf.data.FixedLengthRecordDataset(
-      images_file, 28 * 28, header_bytes=16).map(decode_image)
-  labels = tf.data.FixedLengthRecordDataset(
-      labels_file, 1, header_bytes=8).map(decode_label)
-  return tf.data.Dataset.zip((images, labels))
-
-
-def train(directory):
-  """tf.data.Dataset object for MNIST training data."""
-  return dataset(directory, 'train-images-idx3-ubyte',
-                 'train-labels-idx1-ubyte')
-
-
-def test(directory):
-  """tf.data.Dataset object for MNIST test data."""
-  return dataset(directory, 't10k-images-idx3-ubyte', 't10k-labels-idx1-ubyte')
diff --git a/official/r1/mnist/example3.png b/official/r1/mnist/example3.png
deleted file mode 100644
index bb7f5b8842d2e61a3878dc0aade03773cdc6aceb..0000000000000000000000000000000000000000
Binary files a/official/r1/mnist/example3.png and /dev/null differ
diff --git a/official/r1/mnist/example5.png b/official/r1/mnist/example5.png
deleted file mode 100644
index 68496bcce524dda1b51eb5197ba2653ce7ea9475..0000000000000000000000000000000000000000
Binary files a/official/r1/mnist/example5.png and /dev/null differ
diff --git a/official/r1/mnist/examples.npy b/official/r1/mnist/examples.npy
deleted file mode 100644
index 85d78b1b6dadb1df44128ca173426aff9866c2c0..0000000000000000000000000000000000000000
Binary files a/official/r1/mnist/examples.npy and /dev/null differ
diff --git a/official/r1/mnist/mnist.py b/official/r1/mnist/mnist.py
deleted file mode 100644
index a93358d83ea7df27b1fe18d76a983fb1cfbdcf92..0000000000000000000000000000000000000000
--- a/official/r1/mnist/mnist.py
+++ /dev/null
@@ -1,247 +0,0 @@
-#  Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-#  Licensed under the Apache License, Version 2.0 (the "License");
-#  you may not use this file except in compliance with the License.
-#  You may obtain a copy of the License at
-#
-#   http://www.apache.org/licenses/LICENSE-2.0
-#
-#  Unless required by applicable law or agreed to in writing, software
-#  distributed under the License is distributed on an "AS IS" BASIS,
-#  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-#  See the License for the specific language governing permissions and
-#  limitations under the License.
-"""Convolutional Neural Network Estimator for MNIST, built with tf.layers."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from absl import app as absl_app
-from absl import flags
-from absl import logging
-from six.moves import range
-import tensorflow.compat.v1 as tf
-
-from official.r1.mnist import dataset
-from official.r1.utils.logs import hooks_helper
-from official.utils.flags import core as flags_core
-from official.utils.misc import distribution_utils
-from official.utils.misc import model_helpers
-
-
-LEARNING_RATE = 1e-4
-
-
-def create_model(data_format):
-  """Model to recognize digits in the MNIST dataset.
-
-  Network structure is equivalent to:
-  https://github.com/tensorflow/tensorflow/blob/r1.5/tensorflow/examples/tutorials/mnist/mnist_deep.py
-
-  But uses the tf.keras API.
-
-  Args:
-    data_format: Either 'channels_first' or 'channels_last'. 'channels_first' is
-      typically faster on GPUs while 'channels_last' is typically faster on
-      CPUs. See
-      https://www.tensorflow.org/performance/performance_guide#data_formats
-
-  Returns:
-    A tf.keras.Model.
-  """
-  if data_format == 'channels_first':
-    input_shape = [1, 28, 28]
-  else:
-    assert data_format == 'channels_last'
-    input_shape = [28, 28, 1]
-
-  l = tf.keras.layers
-  max_pool = l.MaxPooling2D(
-      (2, 2), (2, 2), padding='same', data_format=data_format)
-  # The model consists of a sequential chain of layers, so tf.keras.Sequential
-  # (a subclass of tf.keras.Model) makes for a compact description.
-  return tf.keras.Sequential(
-      [
-          l.Reshape(
-              target_shape=input_shape,
-              input_shape=(28 * 28,)),
-          l.Conv2D(
-              32,
-              5,
-              padding='same',
-              data_format=data_format,
-              activation=tf.nn.relu),
-          max_pool,
-          l.Conv2D(
-              64,
-              5,
-              padding='same',
-              data_format=data_format,
-              activation=tf.nn.relu),
-          max_pool,
-          l.Flatten(),
-          l.Dense(1024, activation=tf.nn.relu),
-          l.Dropout(0.4),
-          l.Dense(10)
-      ])
-
-
-def define_mnist_flags():
-  """Defines flags for mnist."""
-  flags_core.define_base(clean=True, train_epochs=True,
-                         epochs_between_evals=True, stop_threshold=True,
-                         num_gpu=True, hooks=True, export_dir=True,
-                         distribution_strategy=True)
-  flags_core.define_performance(inter_op=True, intra_op=True,
-                                num_parallel_calls=False,
-                                all_reduce_alg=True)
-  flags_core.define_image()
-  flags.adopt_module_key_flags(flags_core)
-  flags_core.set_defaults(data_dir='/tmp/mnist_data',
-                          model_dir='/tmp/mnist_model',
-                          batch_size=100,
-                          train_epochs=40)
-
-
-def model_fn(features, labels, mode, params):
-  """The model_fn argument for creating an Estimator."""
-  model = create_model(params['data_format'])
-  image = features
-  if isinstance(image, dict):
-    image = features['image']
-
-  if mode == tf.estimator.ModeKeys.PREDICT:
-    logits = model(image, training=False)
-    predictions = {
-        'classes': tf.argmax(logits, axis=1),
-        'probabilities': tf.nn.softmax(logits),
-    }
-    return tf.estimator.EstimatorSpec(
-        mode=tf.estimator.ModeKeys.PREDICT,
-        predictions=predictions,
-        export_outputs={
-            'classify': tf.estimator.export.PredictOutput(predictions)
-        })
-  if mode == tf.estimator.ModeKeys.TRAIN:
-    optimizer = tf.compat.v1.train.AdamOptimizer(learning_rate=LEARNING_RATE)
-
-    logits = model(image, training=True)
-    loss = tf.compat.v1.losses.sparse_softmax_cross_entropy(labels=labels,
-                                                            logits=logits)
-    accuracy = tf.compat.v1.metrics.accuracy(
-        labels=labels, predictions=tf.argmax(logits, axis=1))
-
-    # Name tensors to be logged with LoggingTensorHook.
-    tf.identity(LEARNING_RATE, 'learning_rate')
-    tf.identity(loss, 'cross_entropy')
-    tf.identity(accuracy[1], name='train_accuracy')
-
-    # Save accuracy scalar to Tensorboard output.
-    tf.summary.scalar('train_accuracy', accuracy[1])
-
-    return tf.estimator.EstimatorSpec(
-        mode=tf.estimator.ModeKeys.TRAIN,
-        loss=loss,
-        train_op=optimizer.minimize(
-            loss,
-            tf.compat.v1.train.get_or_create_global_step()))
-  if mode == tf.estimator.ModeKeys.EVAL:
-    logits = model(image, training=False)
-    loss = tf.losses.sparse_softmax_cross_entropy(labels=labels, logits=logits)
-    return tf.estimator.EstimatorSpec(
-        mode=tf.estimator.ModeKeys.EVAL,
-        loss=loss,
-        eval_metric_ops={
-            'accuracy':
-                tf.metrics.accuracy(
-                    labels=labels, predictions=tf.argmax(logits, axis=1)),
-        })
-
-
-def run_mnist(flags_obj):
-  """Run MNIST training and eval loop.
-
-  Args:
-    flags_obj: An object containing parsed flag values.
-  """
-  model_helpers.apply_clean(flags_obj)
-  model_function = model_fn
-
-  session_config = tf.compat.v1.ConfigProto(
-      inter_op_parallelism_threads=flags_obj.inter_op_parallelism_threads,
-      intra_op_parallelism_threads=flags_obj.intra_op_parallelism_threads,
-      allow_soft_placement=True)
-
-  distribution_strategy = distribution_utils.get_distribution_strategy(
-      distribution_strategy=flags_obj.distribution_strategy,
-      num_gpus=flags_core.get_num_gpus(flags_obj),
-      all_reduce_alg=flags_obj.all_reduce_alg)
-
-  run_config = tf.estimator.RunConfig(
-      train_distribute=distribution_strategy, session_config=session_config)
-
-  data_format = flags_obj.data_format
-  if data_format is None:
-    data_format = ('channels_first' if tf.config.list_physical_devices('GPU')
-                   else 'channels_last')
-  mnist_classifier = tf.estimator.Estimator(
-      model_fn=model_function,
-      model_dir=flags_obj.model_dir,
-      config=run_config,
-      params={
-          'data_format': data_format,
-      })
-
-  # Set up training and evaluation input functions.
-  def train_input_fn():
-    """Prepare data for training."""
-
-    # When choosing shuffle buffer sizes, larger sizes result in better
-    # randomness, while smaller sizes use less memory. MNIST is a small
-    # enough dataset that we can easily shuffle the full epoch.
-    ds = dataset.train(flags_obj.data_dir)
-    ds = ds.cache().shuffle(buffer_size=50000).batch(flags_obj.batch_size)
-
-    # Iterate through the dataset a set number (`epochs_between_evals`) of times
-    # during each training session.
-    ds = ds.repeat(flags_obj.epochs_between_evals)
-    return ds
-
-  def eval_input_fn():
-    return dataset.test(flags_obj.data_dir).batch(
-        flags_obj.batch_size).make_one_shot_iterator().get_next()
-
-  # Set up hook that outputs training logs every 100 steps.
-  train_hooks = hooks_helper.get_train_hooks(
-      flags_obj.hooks, model_dir=flags_obj.model_dir,
-      batch_size=flags_obj.batch_size)
-
-  # Train and evaluate model.
-  for _ in range(flags_obj.train_epochs // flags_obj.epochs_between_evals):
-    mnist_classifier.train(input_fn=train_input_fn, hooks=train_hooks)
-    eval_results = mnist_classifier.evaluate(input_fn=eval_input_fn)
-    print('\nEvaluation results:\n\t%s\n' % eval_results)
-
-    if model_helpers.past_stop_threshold(flags_obj.stop_threshold,
-                                         eval_results['accuracy']):
-      break
-
-  # Export the model
-  if flags_obj.export_dir is not None:
-    image = tf.compat.v1.placeholder(tf.float32, [None, 28, 28])
-    input_fn = tf.estimator.export.build_raw_serving_input_receiver_fn({
-        'image': image,
-    })
-    mnist_classifier.export_savedmodel(flags_obj.export_dir, input_fn,
-                                       strip_default_attrs=True)
-
-
-def main(_):
-  run_mnist(flags.FLAGS)
-
-
-if __name__ == '__main__':
-  logging.set_verbosity(logging.INFO)
-  define_mnist_flags()
-  absl_app.run(main)
diff --git a/official/r1/mnist/mnist_eager.py b/official/r1/mnist/mnist_eager.py
deleted file mode 100644
index 36e2f7c3c026ee6b1c338dc0a09ca640b1421e7a..0000000000000000000000000000000000000000
--- a/official/r1/mnist/mnist_eager.py
+++ /dev/null
@@ -1,214 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""MNIST model training with TensorFlow eager execution.
-
-See:
-https://research.googleblog.com/2017/10/eager-execution-imperative-define-by.html
-
-This program demonstrates training of the convolutional neural network model
-defined in mnist.py with eager execution enabled.
-
-If you are not interested in eager execution, you should ignore this file.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import time
-
-# pylint: disable=g-bad-import-order
-from absl import app as absl_app
-from absl import flags
-from six.moves import range
-from six.moves import zip
-import tensorflow as tf
-from tensorflow.python import eager as tfe
-# pylint: enable=g-bad-import-order
-
-from official.r1.mnist import dataset as mnist_dataset
-from official.r1.mnist import mnist
-from official.utils.flags import core as flags_core
-from official.utils.misc import model_helpers
-
-
-def loss(logits, labels):
-  return tf.reduce_mean(
-      tf.nn.sparse_softmax_cross_entropy_with_logits(
-          logits=logits, labels=labels))
-
-
-def compute_accuracy(logits, labels):
-  predictions = tf.argmax(logits, axis=1, output_type=tf.int64)
-  labels = tf.cast(labels, tf.int64)
-  batch_size = int(logits.shape[0])
-  return tf.reduce_sum(
-      tf.cast(tf.equal(predictions, labels), dtype=tf.float32)) / batch_size
-
-
-def train(model, optimizer, dataset, step_counter, log_interval=None):
-  """Trains model on `dataset` using `optimizer`."""
-  from tensorflow.contrib import summary as contrib_summary  # pylint: disable=g-import-not-at-top
-
-  start = time.time()
-  for (batch, (images, labels)) in enumerate(dataset):
-    with contrib_summary.record_summaries_every_n_global_steps(
-        10, global_step=step_counter):
-      # Record the operations used to compute the loss given the input,
-      # so that the gradient of the loss with respect to the variables
-      # can be computed.
-      with tf.GradientTape() as tape:
-        logits = model(images, training=True)
-        loss_value = loss(logits, labels)
-        contrib_summary.scalar('loss', loss_value)
-        contrib_summary.scalar('accuracy',
-                                    compute_accuracy(logits, labels))
-      grads = tape.gradient(loss_value, model.variables)
-      optimizer.apply_gradients(
-          list(zip(grads, model.variables)), global_step=step_counter)
-      if log_interval and batch % log_interval == 0:
-        rate = log_interval / (time.time() - start)
-        print('Step #%d\tLoss: %.6f (%d steps/sec)' % (batch, loss_value, rate))
-        start = time.time()
-
-
-def test(model, dataset):
-  """Perform an evaluation of `model` on the examples from `dataset`."""
-  from tensorflow.contrib import summary as contrib_summary  # pylint: disable=g-import-not-at-top
-  avg_loss = tf.keras.metrics.Mean('loss', dtype=tf.float32)
-  accuracy = tf.keras.metrics.Accuracy('accuracy', dtype=tf.float32)
-
-  for (images, labels) in dataset:
-    logits = model(images, training=False)
-    avg_loss.update_state(loss(logits, labels))
-    accuracy.update_state(
-        tf.argmax(logits, axis=1, output_type=tf.int64),
-        tf.cast(labels, tf.int64))
-  print('Test set: Average loss: %.4f, Accuracy: %4f%%\n' %
-        (avg_loss.result(), 100 * accuracy.result()))
-  with contrib_summary.always_record_summaries():
-    contrib_summary.scalar('loss', avg_loss.result())
-    contrib_summary.scalar('accuracy', accuracy.result())
-
-
-def run_mnist_eager(flags_obj):
-  """Run MNIST training and eval loop in eager mode.
-
-  Args:
-    flags_obj: An object containing parsed flag values.
-  """
-  tf.enable_eager_execution()
-  model_helpers.apply_clean(flags.FLAGS)
-
-  # Automatically determine device and data_format
-  (device, data_format) = ('/gpu:0', 'channels_first')
-  if flags_obj.no_gpu or not tf.test.is_gpu_available():
-    (device, data_format) = ('/cpu:0', 'channels_last')
-  # If data_format is defined in FLAGS, overwrite automatically set value.
-  if flags_obj.data_format is not None:
-    data_format = flags_obj.data_format
-  print('Using device %s, and data format %s.' % (device, data_format))
-
-  # Load the datasets
-  train_ds = mnist_dataset.train(flags_obj.data_dir).shuffle(60000).batch(
-      flags_obj.batch_size)
-  test_ds = mnist_dataset.test(flags_obj.data_dir).batch(
-      flags_obj.batch_size)
-
-  # Create the model and optimizer
-  model = mnist.create_model(data_format)
-  optimizer = tf.train.MomentumOptimizer(flags_obj.lr, flags_obj.momentum)
-
-  # Create file writers for writing TensorBoard summaries.
-  if flags_obj.output_dir:
-    # Create directories to which summaries will be written
-    # tensorboard --logdir=<output_dir>
-    # can then be used to see the recorded summaries.
-    train_dir = os.path.join(flags_obj.output_dir, 'train')
-    test_dir = os.path.join(flags_obj.output_dir, 'eval')
-    tf.gfile.MakeDirs(flags_obj.output_dir)
-  else:
-    train_dir = None
-    test_dir = None
-  summary_writer = tf.compat.v2.summary.create_file_writer(
-      train_dir, flush_millis=10000)
-  test_summary_writer = tf.compat.v2.summary.create_file_writer(
-      test_dir, flush_millis=10000, name='test')
-
-  # Create and restore checkpoint (if one exists on the path)
-  checkpoint_prefix = os.path.join(flags_obj.model_dir, 'ckpt')
-  step_counter = tf.train.get_or_create_global_step()
-  checkpoint = tf.train.Checkpoint(
-      model=model, optimizer=optimizer, step_counter=step_counter)
-  # Restore variables on creation if a checkpoint exists.
-  checkpoint.restore(tf.train.latest_checkpoint(flags_obj.model_dir))
-
-  # Train and evaluate for a set number of epochs.
-  with tf.device(device):
-    for _ in range(flags_obj.train_epochs):
-      start = time.time()
-      with summary_writer.as_default():
-        train(model, optimizer, train_ds, step_counter,
-              flags_obj.log_interval)
-      end = time.time()
-      print('\nTrain time for epoch #%d (%d total steps): %f' %
-            (checkpoint.save_counter.numpy() + 1,
-             step_counter.numpy(),
-             end - start))
-      with test_summary_writer.as_default():
-        test(model, test_ds)
-      checkpoint.save(checkpoint_prefix)
-
-
-def define_mnist_eager_flags():
-  """Defined flags and defaults for MNIST in eager mode."""
-  flags_core.define_base(clean=True, train_epochs=True, export_dir=True,
-                         distribution_strategy=True)
-  flags_core.define_image()
-  flags.adopt_module_key_flags(flags_core)
-
-  flags.DEFINE_integer(
-      name='log_interval', short_name='li', default=10,
-      help=flags_core.help_wrap('batches between logging training status'))
-
-  flags.DEFINE_string(
-      name='output_dir', short_name='od', default=None,
-      help=flags_core.help_wrap('Directory to write TensorBoard summaries'))
-
-  flags.DEFINE_float(name='learning_rate', short_name='lr', default=0.01,
-                     help=flags_core.help_wrap('Learning rate.'))
-
-  flags.DEFINE_float(name='momentum', short_name='m', default=0.5,
-                     help=flags_core.help_wrap('SGD momentum.'))
-
-  flags.DEFINE_bool(name='no_gpu', short_name='nogpu', default=False,
-                    help=flags_core.help_wrap(
-                        'disables GPU usage even if a GPU is available'))
-
-  flags_core.set_defaults(
-      data_dir='/tmp/tensorflow/mnist/input_data',
-      model_dir='/tmp/tensorflow/mnist/checkpoints/',
-      batch_size=100,
-      train_epochs=10,
-  )
-
-
-def main(_):
-  run_mnist_eager(flags.FLAGS)
-
-
-if __name__ == '__main__':
-  define_mnist_eager_flags()
-  absl_app.run(main=main)
diff --git a/official/r1/mnist/mnist_test.py b/official/r1/mnist/mnist_test.py
deleted file mode 100644
index 87e0571234ac91fa0192c8ca65353a890ce0a363..0000000000000000000000000000000000000000
--- a/official/r1/mnist/mnist_test.py
+++ /dev/null
@@ -1,140 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import time
-
-import tensorflow.compat.v1 as tf  # pylint: disable=g-bad-import-order
-from absl import logging
-from official.r1.mnist import mnist
-
-BATCH_SIZE = 100
-
-
-def dummy_input_fn():
-  image = tf.random.uniform([BATCH_SIZE, 784])
-  labels = tf.random.uniform([BATCH_SIZE, 1], maxval=9, dtype=tf.int32)
-  return image, labels
-
-
-def make_estimator():
-  data_format = 'channels_last'
-  if tf.test.is_built_with_cuda():
-    data_format = 'channels_first'
-  return tf.estimator.Estimator(
-      model_fn=mnist.model_fn, params={
-          'data_format': data_format
-      })
-
-
-class Tests(tf.test.TestCase):
-  """Run tests for MNIST model.
-
-  MNIST uses contrib and will not work with TF 2.0.  All tests are disabled if
-  using TF 2.0.
-  """
-
-  def test_mnist(self):
-    classifier = make_estimator()
-    classifier.train(input_fn=dummy_input_fn, steps=2)
-    eval_results = classifier.evaluate(input_fn=dummy_input_fn, steps=1)
-
-    loss = eval_results['loss']
-    global_step = eval_results['global_step']
-    accuracy = eval_results['accuracy']
-    self.assertEqual(loss.shape, ())
-    self.assertEqual(2, global_step)
-    self.assertEqual(accuracy.shape, ())
-
-    input_fn = lambda: tf.random.uniform([3, 784])
-    predictions_generator = classifier.predict(input_fn)
-    for _ in range(3):
-      predictions = next(predictions_generator)
-      self.assertEqual(predictions['probabilities'].shape, (10,))
-      self.assertEqual(predictions['classes'].shape, ())
-
-  def mnist_model_fn_helper(self, mode, multi_gpu=False):
-    features, labels = dummy_input_fn()
-    image_count = features.shape[0]
-    spec = mnist.model_fn(features, labels, mode, {
-        'data_format': 'channels_last',
-        'multi_gpu': multi_gpu
-    })
-
-    if mode == tf.estimator.ModeKeys.PREDICT:
-      predictions = spec.predictions
-      self.assertAllEqual(predictions['probabilities'].shape, (image_count, 10))
-      self.assertEqual(predictions['probabilities'].dtype, tf.float32)
-      self.assertAllEqual(predictions['classes'].shape, (image_count,))
-      self.assertEqual(predictions['classes'].dtype, tf.int64)
-
-    if mode != tf.estimator.ModeKeys.PREDICT:
-      loss = spec.loss
-      self.assertAllEqual(loss.shape, ())
-      self.assertEqual(loss.dtype, tf.float32)
-
-    if mode == tf.estimator.ModeKeys.EVAL:
-      eval_metric_ops = spec.eval_metric_ops
-      self.assertAllEqual(eval_metric_ops['accuracy'][0].shape, ())
-      self.assertAllEqual(eval_metric_ops['accuracy'][1].shape, ())
-      self.assertEqual(eval_metric_ops['accuracy'][0].dtype, tf.float32)
-      self.assertEqual(eval_metric_ops['accuracy'][1].dtype, tf.float32)
-
-  def test_mnist_model_fn_train_mode(self):
-    self.mnist_model_fn_helper(tf.estimator.ModeKeys.TRAIN)
-
-  def test_mnist_model_fn_train_mode_multi_gpu(self):
-    self.mnist_model_fn_helper(tf.estimator.ModeKeys.TRAIN, multi_gpu=True)
-
-  def test_mnist_model_fn_eval_mode(self):
-    self.mnist_model_fn_helper(tf.estimator.ModeKeys.EVAL)
-
-  def test_mnist_model_fn_predict_mode(self):
-    self.mnist_model_fn_helper(tf.estimator.ModeKeys.PREDICT)
-
-
-class Benchmarks(tf.test.Benchmark):
-  """Simple speed benchmarking for MNIST."""
-
-  def benchmark_train_step_time(self):
-    classifier = make_estimator()
-    # Run one step to warmup any use of the GPU.
-    classifier.train(input_fn=dummy_input_fn, steps=1)
-
-    have_gpu = tf.test.is_gpu_available()
-    num_steps = 1000 if have_gpu else 100
-    name = 'train_step_time_%s' % ('gpu' if have_gpu else 'cpu')
-
-    start = time.time()
-    classifier.train(input_fn=dummy_input_fn, steps=num_steps)
-    end = time.time()
-
-    wall_time = (end - start) / num_steps
-    self.report_benchmark(
-        iters=num_steps,
-        wall_time=wall_time,
-        name=name,
-        extras={
-            'examples_per_sec': BATCH_SIZE / wall_time
-        })
-
-
-if __name__ == '__main__':
-  logging.set_verbosity(logging.ERROR)
-  tf.disable_v2_behavior()
-  tf.test.main()
diff --git a/official/r1/mnist/mnist_tpu.py b/official/r1/mnist/mnist_tpu.py
deleted file mode 100644
index 4ca62ef6d9a6fb329e167fd21d9a62fec4d59f74..0000000000000000000000000000000000000000
--- a/official/r1/mnist/mnist_tpu.py
+++ /dev/null
@@ -1,202 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""MNIST model training using TPUs.
-
-This program demonstrates training of the convolutional neural network model
-defined in mnist.py on Google Cloud TPUs (https://cloud.google.com/tpu/).
-
-If you are not interested in TPUs, you should ignore this file.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import sys
-
-# pylint: disable=g-bad-import-order
-from absl import app as absl_app  # pylint: disable=unused-import
-import tensorflow.compat.v1 as tf
-# pylint: enable=g-bad-import-order
-
-# For open source environment, add grandparent directory for import
-sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(sys.path[0]))))
-
-from official.r1.mnist import dataset  # pylint: disable=wrong-import-position
-from official.r1.mnist import mnist  # pylint: disable=wrong-import-position
-
-# Cloud TPU Cluster Resolver flags
-tf.flags.DEFINE_string(
-    "tpu", default=None,
-    help="The Cloud TPU to use for training. This should be either the name "
-    "used when creating the Cloud TPU, or a grpc://ip.address.of.tpu:8470 "
-    "url.")
-tf.flags.DEFINE_string(
-    "tpu_zone", default=None,
-    help="[Optional] GCE zone where the Cloud TPU is located in. If not "
-    "specified, we will attempt to automatically detect the GCE project from "
-    "metadata.")
-tf.flags.DEFINE_string(
-    "gcp_project", default=None,
-    help="[Optional] Project name for the Cloud TPU-enabled project. If not "
-    "specified, we will attempt to automatically detect the GCE project from "
-    "metadata.")
-
-# Model specific parameters
-tf.flags.DEFINE_string("data_dir", "",
-                       "Path to directory containing the MNIST dataset")
-tf.flags.DEFINE_string("model_dir", None, "Estimator model_dir")
-tf.flags.DEFINE_integer("batch_size", 1024,
-                        "Mini-batch size for the training. Note that this "
-                        "is the global batch size and not the per-shard batch.")
-tf.flags.DEFINE_integer("train_steps", 1000, "Total number of training steps.")
-tf.flags.DEFINE_integer("eval_steps", 0,
-                        "Total number of evaluation steps. If `0`, evaluation "
-                        "after training is skipped.")
-tf.flags.DEFINE_float("learning_rate", 0.05, "Learning rate.")
-
-tf.flags.DEFINE_bool("use_tpu", True, "Use TPUs rather than plain CPUs")
-tf.flags.DEFINE_bool("enable_predict", True, "Do some predictions at the end")
-tf.flags.DEFINE_integer("iterations", 50,
-                        "Number of iterations per TPU training loop.")
-tf.flags.DEFINE_integer("num_shards", 8, "Number of shards (TPU chips).")
-
-FLAGS = tf.flags.FLAGS
-
-
-def metric_fn(labels, logits):
-  accuracy = tf.metrics.accuracy(
-      labels=labels, predictions=tf.argmax(logits, axis=1))
-  return {"accuracy": accuracy}
-
-
-def model_fn(features, labels, mode, params):
-  """model_fn constructs the ML model used to predict handwritten digits."""
-
-  del params
-  image = features
-  if isinstance(image, dict):
-    image = features["image"]
-
-  model = mnist.create_model("channels_last")
-
-  if mode == tf.estimator.ModeKeys.PREDICT:
-    logits = model(image, training=False)
-    predictions = {
-        'class_ids': tf.argmax(logits, axis=1),
-        'probabilities': tf.nn.softmax(logits),
-    }
-    return tf.estimator.tpu.TPUEstimatorSpec(mode, predictions=predictions)
-
-  logits = model(image, training=(mode == tf.estimator.ModeKeys.TRAIN))
-  loss = tf.losses.sparse_softmax_cross_entropy(labels=labels, logits=logits)
-
-  if mode == tf.estimator.ModeKeys.TRAIN:
-    learning_rate = tf.train.exponential_decay(
-        FLAGS.learning_rate,
-        tf.train.get_global_step(),
-        decay_steps=100000,
-        decay_rate=0.96)
-    optimizer = tf.train.GradientDescentOptimizer(learning_rate=learning_rate)
-    if FLAGS.use_tpu:
-      optimizer = tf.tpu.CrossShardOptimizer(optimizer)
-    return tf.estimator.tpu.TPUEstimatorSpec(
-        mode=mode,
-        loss=loss,
-        train_op=optimizer.minimize(loss, tf.train.get_global_step()))
-
-  if mode == tf.estimator.ModeKeys.EVAL:
-    return tf.estimator.tpu.TPUEstimatorSpec(
-        mode=mode, loss=loss, eval_metrics=(metric_fn, [labels, logits]))
-
-
-def train_input_fn(params):
-  """train_input_fn defines the input pipeline used for training."""
-  batch_size = params["batch_size"]
-  data_dir = params["data_dir"]
-  # Retrieves the batch size for the current shard. The # of shards is
-  # computed according to the input pipeline deployment. See
-  # `tf.estimator.tpu.RunConfig` for details.
-  ds = dataset.train(data_dir).cache().repeat().shuffle(
-      buffer_size=50000).batch(batch_size, drop_remainder=True)
-  return ds
-
-
-def eval_input_fn(params):
-  batch_size = params["batch_size"]
-  data_dir = params["data_dir"]
-  ds = dataset.test(data_dir).batch(batch_size, drop_remainder=True)
-  return ds
-
-
-def predict_input_fn(params):
-  batch_size = params["batch_size"]
-  data_dir = params["data_dir"]
-  # Take out top 10 samples from test data to make the predictions.
-  ds = dataset.test(data_dir).take(10).batch(batch_size)
-  return ds
-
-
-def main(argv):
-  del argv  # Unused.
-  tf.logging.set_verbosity(tf.logging.INFO)
-
-  tpu_cluster_resolver = tf.distribute.cluster_resolver.TPUClusterResolver(
-      FLAGS.tpu,
-      zone=FLAGS.tpu_zone,
-      project=FLAGS.gcp_project
-  )
-
-  run_config = tf.estimator.tpu.RunConfig(
-      cluster=tpu_cluster_resolver,
-      model_dir=FLAGS.model_dir,
-      session_config=tf.ConfigProto(
-          allow_soft_placement=True, log_device_placement=True),
-      tpu_config=tf.estimator.tpu.TPUConfig(FLAGS.iterations, FLAGS.num_shards),
-  )
-
-  estimator = tf.estimator.tpu.TPUEstimator(
-      model_fn=model_fn,
-      use_tpu=FLAGS.use_tpu,
-      train_batch_size=FLAGS.batch_size,
-      eval_batch_size=FLAGS.batch_size,
-      predict_batch_size=FLAGS.batch_size,
-      params={"data_dir": FLAGS.data_dir},
-      config=run_config)
-  # TPUEstimator.train *requires* a max_steps argument.
-  estimator.train(input_fn=train_input_fn, max_steps=FLAGS.train_steps)
-  # TPUEstimator.evaluate *requires* a steps argument.
-  # Note that the number of examples used during evaluation is
-  # --eval_steps * --batch_size.
-  # So if you change --batch_size then change --eval_steps too.
-  if FLAGS.eval_steps:
-    estimator.evaluate(input_fn=eval_input_fn, steps=FLAGS.eval_steps)
-
-  # Run prediction on top few samples of test data.
-  if FLAGS.enable_predict:
-    predictions = estimator.predict(input_fn=predict_input_fn)
-
-    for pred_dict in predictions:
-      template = ('Prediction is "{}" ({:.1f}%).')
-
-      class_id = pred_dict['class_ids']
-      probability = pred_dict['probabilities'][class_id]
-
-      print(template.format(class_id, 100 * probability))
-
-
-if __name__ == "__main__":
-  tf.disable_v2_behavior()
-  absl_app.run(main)
diff --git a/official/r1/ncf/README.md b/official/r1/ncf/README.md
deleted file mode 100644
index 8156d396d1722fd49d0a62ef0fc3b18f9d50b530..0000000000000000000000000000000000000000
--- a/official/r1/ncf/README.md
+++ /dev/null
@@ -1,7 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# NCF Estimator implementation
-
-NCF framework to train and evaluate the NeuMF model
diff --git a/official/r1/ncf/ncf_estimator_main.py b/official/r1/ncf/ncf_estimator_main.py
deleted file mode 100644
index a40e7b77aadbc8fd3b887048323a6ecfe5e5eb32..0000000000000000000000000000000000000000
--- a/official/r1/ncf/ncf_estimator_main.py
+++ /dev/null
@@ -1,187 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""NCF framework to train and evaluate the NeuMF model.
-
-The NeuMF model assembles both MF and MLP models under the NCF framework. Check
-`neumf_model.py` for more details about the models.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import contextlib
-import heapq
-import json
-import math
-import multiprocessing
-import os
-import signal
-
-from absl import app as absl_app
-from absl import flags
-from absl import logging
-import numpy as np
-from six.moves import range
-import tensorflow as tf
-import typing
-
-from official.r1.utils.logs import hooks_helper
-from official.r1.utils.logs import logger
-from official.r1.utils.logs import mlperf_helper
-from official.recommendation import constants as rconst
-from official.recommendation import data_pipeline
-from official.recommendation import data_preprocessing
-from official.recommendation import movielens
-from official.recommendation import ncf_common
-from official.recommendation import neumf_model
-from official.utils.flags import core as flags_core
-from official.utils.misc import distribution_utils
-from official.utils.misc import model_helpers
-
-
-FLAGS = flags.FLAGS
-
-
-def construct_estimator(model_dir, params):
-  """Construct either an Estimator for NCF.
-
-  Args:
-    model_dir: The model directory for the estimator
-    params: The params dict for the estimator
-
-  Returns:
-    An Estimator.
-  """
-  distribution = ncf_common.get_v1_distribution_strategy(params)
-  run_config = tf.estimator.RunConfig(train_distribute=distribution,
-                                      eval_distribute=distribution)
-  model_fn = neumf_model.neumf_model_fn
-  estimator = tf.estimator.Estimator(model_fn=model_fn, model_dir=model_dir,
-                                     config=run_config, params=params)
-  return estimator
-
-
-def log_and_get_hooks(eval_batch_size):
-  """Convenience function for hook and logger creation."""
-  # Create hooks that log information about the training and metric values
-  train_hooks = hooks_helper.get_train_hooks(
-      FLAGS.hooks,
-      model_dir=FLAGS.model_dir,
-      batch_size=FLAGS.batch_size,  # for ExamplesPerSecondHook
-      tensors_to_log={"cross_entropy": "cross_entropy"}
-  )
-  run_params = {
-      "batch_size": FLAGS.batch_size,
-      "eval_batch_size": eval_batch_size,
-      "number_factors": FLAGS.num_factors,
-      "hr_threshold": FLAGS.hr_threshold,
-      "train_epochs": FLAGS.train_epochs,
-  }
-  benchmark_logger = logger.get_benchmark_logger()
-  benchmark_logger.log_run_info(
-      model_name="recommendation",
-      dataset_name=FLAGS.dataset,
-      run_params=run_params,
-      test_id=FLAGS.benchmark_test_id)
-
-  return benchmark_logger, train_hooks
-
-
-def main(_):
-  with logger.benchmark_context(FLAGS), \
-       mlperf_helper.LOGGER(FLAGS.output_ml_perf_compliance_logging):
-    mlperf_helper.set_ncf_root(os.path.split(os.path.abspath(__file__))[0])
-    run_ncf(FLAGS)
-
-
-def run_ncf(_):
-  """Run NCF training and eval loop."""
-  params = ncf_common.parse_flags(FLAGS)
-
-  num_users, num_items, num_train_steps, num_eval_steps, producer = (
-      ncf_common.get_inputs(params))
-
-  params["num_users"], params["num_items"] = num_users, num_items
-  producer.start()
-  model_helpers.apply_clean(flags.FLAGS)
-
-  estimator = construct_estimator(model_dir=FLAGS.model_dir, params=params)
-
-  benchmark_logger, train_hooks = log_and_get_hooks(params["eval_batch_size"])
-  total_training_cycle = FLAGS.train_epochs // FLAGS.epochs_between_evals
-
-  target_reached = False
-  mlperf_helper.ncf_print(key=mlperf_helper.TAGS.TRAIN_LOOP)
-  for cycle_index in range(total_training_cycle):
-    assert FLAGS.epochs_between_evals == 1 or not mlperf_helper.LOGGER.enabled
-    logging.info("Starting a training cycle: {}/{}".format(
-        cycle_index + 1, total_training_cycle))
-
-    mlperf_helper.ncf_print(key=mlperf_helper.TAGS.TRAIN_EPOCH,
-                            value=cycle_index)
-
-    train_input_fn = producer.make_input_fn(is_training=True)
-    estimator.train(input_fn=train_input_fn, hooks=train_hooks,
-                    steps=num_train_steps)
-
-    logging.info("Beginning evaluation.")
-    eval_input_fn = producer.make_input_fn(is_training=False)
-
-    mlperf_helper.ncf_print(key=mlperf_helper.TAGS.EVAL_START,
-                            value=cycle_index)
-    eval_results = estimator.evaluate(eval_input_fn, steps=num_eval_steps)
-    logging.info("Evaluation complete.")
-
-    hr = float(eval_results[rconst.HR_KEY])
-    ndcg = float(eval_results[rconst.NDCG_KEY])
-    loss = float(eval_results["loss"])
-
-    mlperf_helper.ncf_print(
-        key=mlperf_helper.TAGS.EVAL_TARGET,
-        value={"epoch": cycle_index, "value": FLAGS.hr_threshold})
-    mlperf_helper.ncf_print(key=mlperf_helper.TAGS.EVAL_ACCURACY,
-                            value={"epoch": cycle_index, "value": hr})
-    mlperf_helper.ncf_print(
-        key=mlperf_helper.TAGS.EVAL_HP_NUM_NEG,
-        value={"epoch": cycle_index, "value": rconst.NUM_EVAL_NEGATIVES})
-
-    mlperf_helper.ncf_print(key=mlperf_helper.TAGS.EVAL_STOP, value=cycle_index)
-
-    # Benchmark the evaluation results
-    benchmark_logger.log_evaluation_result(eval_results)
-    # Log the HR and NDCG results.
-    logging.info(
-        "Iteration {}: HR = {:.4f}, NDCG = {:.4f}, Loss = {:.4f}".format(
-            cycle_index + 1, hr, ndcg, loss))
-
-    # If some evaluation threshold is met
-    if model_helpers.past_stop_threshold(FLAGS.hr_threshold, hr):
-      target_reached = True
-      break
-
-  mlperf_helper.ncf_print(key=mlperf_helper.TAGS.RUN_STOP,
-                          value={"success": target_reached})
-  producer.stop_loop()
-  producer.join()
-
-  # Clear the session explicitly to avoid session delete error
-  tf.keras.backend.clear_session()
-  mlperf_helper.ncf_print(key=mlperf_helper.TAGS.RUN_FINAL)
-
-
-if __name__ == "__main__":
-  logging.set_verbosity(logging.INFO)
-  ncf_common.define_ncf_flags()
-  absl_app.run(main)
diff --git a/official/r1/resnet/README.md b/official/r1/resnet/README.md
deleted file mode 100644
index 7f70b5016c8ee852c8d19f425fba1efed71afea7..0000000000000000000000000000000000000000
--- a/official/r1/resnet/README.md
+++ /dev/null
@@ -1,156 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# ResNet in TensorFlow
-
-Deep residual networks, or ResNets for short, provided the breakthrough idea of
-identity mappings in order to enable training of very deep convolutional neural
-networks. This folder contains an implementation of ResNet for the ImageNet
-dataset written in TensorFlow.
-
-See the following papers for more background:
-
-[1] [Deep Residual Learning for Image Recognition](https://arxiv.org/pdf/1512.03385.pdf) by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun, Dec 2015.
-
-[2] [Identity Mappings in Deep Residual Networks](https://arxiv.org/pdf/1603.05027.pdf) by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun, Jul 2016.
-
-In code, v1 refers to the ResNet defined in [1] but where a stride 2 is used on
-the 3x3 conv rather than the first 1x1 in the bottleneck. This change results
-in higher and more stable accuracy with less epochs than the original v1 and has
-shown to scale to higher batch sizes with minimal degradation in accuracy.
-There is no originating paper. The first mention we are aware of was in the
-torch version of [ResNetv1](https://github.com/facebook/fb.resnet.torch). Most
-popular v1 implementations are this implementation which we call ResNetv1.5.
-
-In testing we found v1.5 requires ~12% more compute to train and has 6% reduced
-throughput for inference compared to ResNetv1. CIFAR-10 ResNet does not use the
-bottleneck and is thus the same for v1 as v1.5.
-
-v2 refers to [2]. The principle difference between the two versions is that v1
-applies batch normalization and activation after convolution, while v2 applies
-batch normalization, then activation, and finally convolution. A schematic
-comparison is presented in Figure 1 (left) of [2].
-
-Please proceed according to which dataset you would like to train/evaluate on:
-
-
-## CIFAR-10
-
-### Setup
-
-You need to have the latest version of TensorFlow installed.
-First, make sure [the models folder is in your Python path](/official/#running-the-models); otherwise you may encounter `ImportError: No module named official.resnet`.
-
-Then, download and extract the CIFAR-10 data from Alex's website, specifying the location with the `--data_dir` flag. Run the following:
-
-```bash
-python cifar10_download_and_extract.py --data_dir <DATA_DIR>
-```
-
-Then, to train the model:
-
-```bash
-python cifar10_main.py --data_dir <DATA_DIR>/cifar-10-batches-bin --model_dir <MODEL_DIR>
-```
-
-Use `--data_dir` to specify the location of the CIFAR-10 data used in the previous step. There are more flag options as described in `cifar10_main.py`.
-
-To export a `SavedModel` from the trained checkpoint:
-
-```bash
-python cifar10_main.py --data_dir <DATA_DIR>/cifar-10-batches-bin --model_dir <MODEL_DIR> --eval_only --export_dir <EXPORT_DIR>
-```
-
-Note: The `<EXPORT_DIR>` must be present. You might want to run `mkdir <EXPORT_DIR>` beforehand.
-
-The `SavedModel` can then be [loaded](https://www.tensorflow.org/guide/saved_model#loading_a_savedmodel_in_python) in order to use the ResNet for prediction.
-
-
-## ImageNet
-
-### Setup
-To begin, you will need to download the ImageNet dataset and convert it to
-TFRecord format. The following [script](https://github.com/tensorflow/tpu/blob/master/tools/datasets/imagenet_to_gcs.py)
-and [README](https://github.com/tensorflow/tpu/tree/master/tools/datasets#imagenet_to_gcspy)
-provide a few options.
-
-Once your dataset is ready, you can begin training the model as follows:
-
-```bash
-python imagenet_main.py --data_dir=/path/to/imagenet
-```
-
-The model will begin training and will automatically evaluate itself on the
-validation data roughly once per epoch.
-
-Note that there are a number of other options you can specify, including
-`--model_dir` to choose where to store the model and `--resnet_size` to choose
-the model size (options include ResNet-18 through ResNet-200). See
-[`resnet_run_loop.py`](resnet_run_loop.py) for the full list of options.
-
-
-## Compute Devices
-Training is accomplished using the DistributionStrategies API. (https://github.com/tensorflow/tensorflow/blob/master/tensorflow/contrib/distribute/README.md)
-
-The appropriate distribution strategy is chosen based on the `--num_gpus` flag.
-By default this flag is one if TensorFlow is compiled with CUDA, and zero
-otherwise.
-
-num_gpus:
-+ 0:  Use OneDeviceStrategy and train on CPU.
-+ 1:  Use OneDeviceStrategy and train on GPU.
-+ 2+: Use MirroredStrategy (data parallelism) to distribute a batch between devices.
-
-### Pre-trained model
-You can download pre-trained versions of ResNet-50. Reported accuracies are top-1 single-crop accuracy for the ImageNet validation set.
-Models are reported as both checkpoints produced by Estimator during training, and as SavedModels which are more portable. Checkpoints are fragile,
-and these are not guaranteed to work with future versions of the code. Both ResNet v1
-and ResNet v2 have been trained in both fp16 and fp32 precision. (Here v1 refers to "v1.5". See the note above.) Furthermore, SavedModels
-are generated to accept either tensor or JPG inputs, and with channels_first (NCHW) and channels_last (NHWC) convolutions. NCHW is generally
-better for GPUs, while NHWC is generally better for CPUs. See the TensorFlow [performance guide](https://www.tensorflow.org/performance/performance_guide#data_formats)
-for more details.
-
-ResNet-50 v2 (fp32, Accuracy 76.47%):
-* [Checkpoint](http://download.tensorflow.org/models/official/20181001_resnet/checkpoints/resnet_imagenet_v2_fp32_20181001.tar.gz)
-* SavedModel [(NCHW)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v2_fp32_savedmodel_NCHW.tar.gz),
-[(NCHW, JPG)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v2_fp32_savedmodel_NCHW_jpg.tar.gz),
-[(NHWC)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v2_fp32_savedmodel_NHWC.tar.gz),
-[(NHWC, JPG)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v2_fp32_savedmodel_NHWC_jpg.tar.gz)
-
-ResNet-50 v2 (fp16, Accuracy 76.56%):
-* [Checkpoint](http://download.tensorflow.org/models/official/20181001_resnet/checkpoints/resnet_imagenet_v2_fp16_20180928.tar.gz)
-* SavedModel [(NCHW)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v2_fp16_savedmodel_NCHW.tar.gz),
-[(NCHW, JPG)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v2_fp16_savedmodel_NCHW_jpg.tar.gz),
-[(NHWC)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v2_fp16_savedmodel_NHWC.tar.gz),
-[(NHWC, JPG)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v2_fp16_savedmodel_NHWC_jpg.tar.gz)
-
-ResNet-50 v1 (fp32, Accuracy 76.53%):
-* [Checkpoint](http://download.tensorflow.org/models/official/20181001_resnet/checkpoints/resnet_imagenet_v1_fp32_20181001.tar.gz)
-* SavedModel [(NCHW)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v1_fp32_savedmodel_NCHW.tar.gz),
-[(NCHW, JPG)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v1_fp32_savedmodel_NCHW_jpg.tar.gz),
-[(NHWC)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v1_fp32_savedmodel_NHWC.tar.gz),
-[(NHWC, JPG)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v1_fp32_savedmodel_NHWC_jpg.tar.gz)
-
-ResNet-50 v1 (fp16, Accuracy 76.18%):
-* [Checkpoint](http://download.tensorflow.org/models/official/20181001_resnet/checkpoints/resnet_imagenet_v1_fp16_20181001.tar.gz)
-* SavedModel [(NCHW)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v1_fp16_savedmodel_NCHW.tar.gz),
-[(NCHW, JPG)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v1_fp16_savedmodel_NCHW_jpg.tar.gz),
-[(NHWC)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v1_fp16_savedmodel_NHWC.tar.gz),
-[(NHWC, JPG)](http://download.tensorflow.org/models/official/20181001_resnet/savedmodels/resnet_v1_fp16_savedmodel_NHWC_jpg.tar.gz)
-
-### Transfer Learning
-You can use a pretrained model to initialize a training process. In addition you are able to freeze all but the final fully connected layers to fine tune your model. Transfer Learning is useful when training on your own small datasets. For a brief look at transfer learning in the context of convolutional neural networks, we recommend reading these [short notes](http://cs231n.github.io/transfer-learning/).
-
-
-To fine tune a pretrained resnet you must make three changes to your training procedure:
-
-1) Build the exact same model as previously except we change the number of labels in the final classification layer.
-
-2) Restore all weights from the pre-trained resnet except for the final classification layer; this will get randomly initialized instead.
-
-3) Freeze earlier layers of the network
-
-We can perform these three operations by specifying two flags: ```--pretrained_model_checkpoint_path``` and ```--fine_tune```. The first flag is a string that points to the path of a pre-trained resnet model. If this flag is specified, it will load all but the final classification layer. A key thing to note: if both ```--pretrained_model_checkpoint_path``` and a non empty ```model_dir``` directory are passed, the tensorflow estimator will load only the ```model_dir```. For more on this please see [WarmStartSettings](https://www.tensorflow.org/versions/master/api_docs/python/tf/estimator/WarmStartSettings) and [Estimators](https://www.tensorflow.org/guide/estimators).
-
-The second flag ```--fine_tune``` is a boolean that indicates whether earlier layers of the network should be frozen. You may set this flag to false if you wish to continue training a pre-trained model from a checkpoint. If you set this flag to true, you can train a new classification layer from scratch.
diff --git a/official/r1/resnet/__init__.py b/official/r1/resnet/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/official/r1/resnet/cifar10_download_and_extract.py b/official/r1/resnet/cifar10_download_and_extract.py
deleted file mode 100644
index a44d042e1cc6033808244af46f8e92f744699fa5..0000000000000000000000000000000000000000
--- a/official/r1/resnet/cifar10_download_and_extract.py
+++ /dev/null
@@ -1,63 +0,0 @@
-# Copyright 2015 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Downloads and extracts the binary version of the CIFAR-10 dataset."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import argparse
-import os
-import sys
-import tarfile
-
-from six.moves import urllib
-import tensorflow as tf
-
-DATA_URL = 'https://www.cs.toronto.edu/~kriz/cifar-10-binary.tar.gz'
-
-parser = argparse.ArgumentParser()
-
-parser.add_argument(
-    '--data_dir', type=str, default='/tmp/cifar10_data',
-    help='Directory to download data and extract the tarball')
-
-
-def main(_):
-  """Download and extract the tarball from Alex's website."""
-  if not os.path.exists(FLAGS.data_dir):
-    os.makedirs(FLAGS.data_dir)
-
-  filename = DATA_URL.split('/')[-1]
-  filepath = os.path.join(FLAGS.data_dir, filename)
-
-  if not os.path.exists(filepath):
-    def _progress(count, block_size, total_size):
-      sys.stdout.write('\r>> Downloading %s %.1f%%' % (
-          filename, 100.0 * count * block_size / total_size))
-      sys.stdout.flush()
-
-    filepath, _ = urllib.request.urlretrieve(DATA_URL, filepath, _progress)
-    print()
-    statinfo = os.stat(filepath)
-    print('Successfully downloaded', filename, statinfo.st_size, 'bytes.')
-
-  tarfile.open(filepath, 'r:gz').extractall(FLAGS.data_dir)
-
-
-if __name__ == '__main__':
-  FLAGS, unparsed = parser.parse_known_args()
-  tf.compat.v1.app.run(argv=[sys.argv[0]] + unparsed)
diff --git a/official/r1/resnet/cifar10_main.py b/official/r1/resnet/cifar10_main.py
deleted file mode 100644
index e0983d82f9f808c774a8040c9cf75bbaa84baa5d..0000000000000000000000000000000000000000
--- a/official/r1/resnet/cifar10_main.py
+++ /dev/null
@@ -1,297 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Runs a ResNet model on the CIFAR-10 dataset."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-from absl import app as absl_app
-from absl import flags
-from absl import logging
-from six.moves import range
-import tensorflow as tf
-
-from official.r1.resnet import resnet_model
-from official.r1.resnet import resnet_run_loop
-from official.r1.utils.logs import logger
-from official.utils.flags import core as flags_core
-
-HEIGHT = 32
-WIDTH = 32
-NUM_CHANNELS = 3
-_DEFAULT_IMAGE_BYTES = HEIGHT * WIDTH * NUM_CHANNELS
-# The record is the image plus a one-byte label
-_RECORD_BYTES = _DEFAULT_IMAGE_BYTES + 1
-NUM_CLASSES = 10
-_NUM_DATA_FILES = 5
-
-# TODO(tobyboyd): Change to best practice 45K(train)/5K(val)/10K(test) splits.
-NUM_IMAGES = {
-    'train': 50000,
-    'validation': 10000,
-}
-
-DATASET_NAME = 'CIFAR-10'
-
-
-###############################################################################
-# Data processing
-###############################################################################
-def get_filenames(is_training, data_dir):
-  """Returns a list of filenames."""
-  assert tf.io.gfile.exists(data_dir), (
-      'Run cifar10_download_and_extract.py first to download and extract the '
-      'CIFAR-10 data.')
-
-  if is_training:
-    return [
-        os.path.join(data_dir, 'data_batch_%d.bin' % i)
-        for i in range(1, _NUM_DATA_FILES + 1)
-    ]
-  else:
-    return [os.path.join(data_dir, 'test_batch.bin')]
-
-
-def parse_record(raw_record, is_training, dtype):
-  """Parse CIFAR-10 image and label from a raw record."""
-  # Convert bytes to a vector of uint8 that is record_bytes long.
-  record_vector = tf.io.decode_raw(raw_record, tf.uint8)
-
-  # The first byte represents the label, which we convert from uint8 to int32
-  # and then to one-hot.
-  label = tf.cast(record_vector[0], tf.int32)
-
-  # The remaining bytes after the label represent the image, which we reshape
-  # from [depth * height * width] to [depth, height, width].
-  depth_major = tf.reshape(record_vector[1:_RECORD_BYTES],
-                           [NUM_CHANNELS, HEIGHT, WIDTH])
-
-  # Convert from [depth, height, width] to [height, width, depth], and cast as
-  # float32.
-  image = tf.cast(tf.transpose(a=depth_major, perm=[1, 2, 0]), tf.float32)
-
-  image = preprocess_image(image, is_training)
-  image = tf.cast(image, dtype)
-
-  return image, label
-
-
-def preprocess_image(image, is_training):
-  """Preprocess a single image of layout [height, width, depth]."""
-  if is_training:
-    # Resize the image to add four extra pixels on each side.
-    image = tf.image.resize_with_crop_or_pad(
-        image, HEIGHT + 8, WIDTH + 8)
-
-    # Randomly crop a [HEIGHT, WIDTH] section of the image.
-    image = tf.image.random_crop(image, [HEIGHT, WIDTH, NUM_CHANNELS])
-
-    # Randomly flip the image horizontally.
-    image = tf.image.random_flip_left_right(image)
-
-  # Subtract off the mean and divide by the variance of the pixels.
-  image = tf.image.per_image_standardization(image)
-  return image
-
-
-def input_fn(is_training,
-             data_dir,
-             batch_size,
-             num_epochs=1,
-             dtype=tf.float32,
-             datasets_num_private_threads=None,
-             parse_record_fn=parse_record,
-             input_context=None,
-             drop_remainder=False):
-  """Input function which provides batches for train or eval.
-
-  Args:
-    is_training: A boolean denoting whether the input is for training.
-    data_dir: The directory containing the input data.
-    batch_size: The number of samples per batch.
-    num_epochs: The number of epochs to repeat the dataset.
-    dtype: Data type to use for images/features
-    datasets_num_private_threads: Number of private threads for tf.data.
-    parse_record_fn: Function to use for parsing the records.
-    input_context: A `tf.distribute.InputContext` object passed in by
-      `tf.distribute.Strategy`.
-    drop_remainder: A boolean indicates whether to drop the remainder of the
-      batches. If True, the batch dimension will be static.
-
-  Returns:
-    A dataset that can be used for iteration.
-  """
-  filenames = get_filenames(is_training, data_dir)
-  dataset = tf.data.FixedLengthRecordDataset(filenames, _RECORD_BYTES)
-
-  if input_context:
-    logging.info(
-        'Sharding the dataset: input_pipeline_id=%d num_input_pipelines=%d',
-        input_context.input_pipeline_id, input_context.num_input_pipelines)
-    dataset = dataset.shard(input_context.num_input_pipelines,
-                            input_context.input_pipeline_id)
-
-  return resnet_run_loop.process_record_dataset(
-      dataset=dataset,
-      is_training=is_training,
-      batch_size=batch_size,
-      shuffle_buffer=NUM_IMAGES['train'],
-      parse_record_fn=parse_record_fn,
-      num_epochs=num_epochs,
-      dtype=dtype,
-      datasets_num_private_threads=datasets_num_private_threads,
-      drop_remainder=drop_remainder
-  )
-
-
-def get_synth_input_fn(dtype):
-  return resnet_run_loop.get_synth_input_fn(
-      HEIGHT, WIDTH, NUM_CHANNELS, NUM_CLASSES, dtype=dtype)
-
-
-###############################################################################
-# Running the model
-###############################################################################
-class Cifar10Model(resnet_model.Model):
-  """Model class with appropriate defaults for CIFAR-10 data."""
-
-  def __init__(self, resnet_size, data_format=None, num_classes=NUM_CLASSES,
-               resnet_version=resnet_model.DEFAULT_VERSION,
-               dtype=resnet_model.DEFAULT_DTYPE):
-    """These are the parameters that work for CIFAR-10 data.
-
-    Args:
-      resnet_size: The number of convolutional layers needed in the model.
-      data_format: Either 'channels_first' or 'channels_last', specifying which
-        data format to use when setting up the model.
-      num_classes: The number of output classes needed from the model. This
-        enables users to extend the same model to their own datasets.
-      resnet_version: Integer representing which version of the ResNet network
-      to use. See README for details. Valid values: [1, 2]
-      dtype: The TensorFlow dtype to use for calculations.
-
-    Raises:
-      ValueError: if invalid resnet_size is chosen
-    """
-    if resnet_size % 6 != 2:
-      raise ValueError('resnet_size must be 6n + 2:', resnet_size)
-
-    num_blocks = (resnet_size - 2) // 6
-
-    super(Cifar10Model, self).__init__(
-        resnet_size=resnet_size,
-        bottleneck=False,
-        num_classes=num_classes,
-        num_filters=16,
-        kernel_size=3,
-        conv_stride=1,
-        first_pool_size=None,
-        first_pool_stride=None,
-        block_sizes=[num_blocks] * 3,
-        block_strides=[1, 2, 2],
-        resnet_version=resnet_version,
-        data_format=data_format,
-        dtype=dtype
-    )
-
-
-def cifar10_model_fn(features, labels, mode, params):
-  """Model function for CIFAR-10."""
-  features = tf.reshape(features, [-1, HEIGHT, WIDTH, NUM_CHANNELS])
-  # Learning rate schedule follows arXiv:1512.03385 for ResNet-56 and under.
-  learning_rate_fn = resnet_run_loop.learning_rate_with_decay(
-      batch_size=params['batch_size'] * params.get('num_workers', 1),
-      batch_denom=128, num_images=NUM_IMAGES['train'],
-      boundary_epochs=[91, 136, 182], decay_rates=[1, 0.1, 0.01, 0.001])
-
-  # Weight decay of 2e-4 diverges from 1e-4 decay used in the ResNet paper
-  # and seems more stable in testing. The difference was nominal for ResNet-56.
-  weight_decay = 2e-4
-
-  # Empirical testing showed that including batch_normalization variables
-  # in the calculation of regularized loss helped validation accuracy
-  # for the CIFAR-10 dataset, perhaps because the regularization prevents
-  # overfitting on the small data set. We therefore include all vars when
-  # regularizing and computing loss during training.
-  def loss_filter_fn(_):
-    return True
-
-  return resnet_run_loop.resnet_model_fn(
-      features=features,
-      labels=labels,
-      mode=mode,
-      model_class=Cifar10Model,
-      resnet_size=params['resnet_size'],
-      weight_decay=weight_decay,
-      learning_rate_fn=learning_rate_fn,
-      momentum=0.9,
-      data_format=params['data_format'],
-      resnet_version=params['resnet_version'],
-      loss_scale=params['loss_scale'],
-      loss_filter_fn=loss_filter_fn,
-      dtype=params['dtype'],
-      fine_tune=params['fine_tune']
-  )
-
-
-def define_cifar_flags():
-  resnet_run_loop.define_resnet_flags()
-  flags.adopt_module_key_flags(resnet_run_loop)
-  flags_core.set_defaults(data_dir='/tmp/cifar10_data/cifar-10-batches-bin',
-                          model_dir='/tmp/cifar10_model',
-                          resnet_size='56',
-                          train_epochs=182,
-                          epochs_between_evals=10,
-                          batch_size=128,
-                          image_bytes_as_serving_input=False)
-
-
-def run_cifar(flags_obj):
-  """Run ResNet CIFAR-10 training and eval loop.
-
-  Args:
-    flags_obj: An object containing parsed flag values.
-
-  Returns:
-    Dictionary of results. Including final accuracy.
-  """
-  if flags_obj.image_bytes_as_serving_input:
-    logging.fatal(
-        '--image_bytes_as_serving_input cannot be set to True for CIFAR. '
-        'This flag is only applicable to ImageNet.')
-    return
-
-  input_function = (flags_obj.use_synthetic_data and
-                    get_synth_input_fn(flags_core.get_tf_dtype(flags_obj)) or
-                    input_fn)
-  result = resnet_run_loop.resnet_main(
-      flags_obj, cifar10_model_fn, input_function, DATASET_NAME,
-      shape=[HEIGHT, WIDTH, NUM_CHANNELS])
-
-  return result
-
-
-def main(_):
-  with logger.benchmark_context(flags.FLAGS):
-    run_cifar(flags.FLAGS)
-
-
-if __name__ == '__main__':
-  logging.set_verbosity(logging.INFO)
-  define_cifar_flags()
-  absl_app.run(main)
diff --git a/official/r1/resnet/cifar10_test.py b/official/r1/resnet/cifar10_test.py
deleted file mode 100644
index ba40eb2c64a3abf1309fc6b5e3d514239cb58574..0000000000000000000000000000000000000000
--- a/official/r1/resnet/cifar10_test.py
+++ /dev/null
@@ -1,183 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from tempfile import mkstemp
-
-from absl import logging
-import numpy as np
-import tensorflow as tf
-
-from official.r1.resnet import cifar10_main
-from official.utils.testing import integration
-
-logging.set_verbosity(logging.ERROR)
-
-_BATCH_SIZE = 128
-_HEIGHT = 32
-_WIDTH = 32
-_NUM_CHANNELS = 3
-
-
-class BaseTest(tf.test.TestCase):
-  """Tests for the Cifar10 version of Resnet.
-  """
-
-  _num_validation_images = None
-
-  @classmethod
-  def setUpClass(cls):  # pylint: disable=invalid-name
-    super(BaseTest, cls).setUpClass()
-    tf.compat.v1.disable_eager_execution()
-    cifar10_main.define_cifar_flags()
-
-  def setUp(self):
-    super(BaseTest, self).setUp()
-    self._num_validation_images = cifar10_main.NUM_IMAGES['validation']
-    cifar10_main.NUM_IMAGES['validation'] = 4
-
-  def tearDown(self):
-    super(BaseTest, self).tearDown()
-    tf.io.gfile.rmtree(self.get_temp_dir())
-    cifar10_main.NUM_IMAGES['validation'] = self._num_validation_images
-
-  def test_dataset_input_fn(self):
-    fake_data = bytearray()
-    fake_data.append(7)
-    for i in range(_NUM_CHANNELS):
-      for _ in range(_HEIGHT * _WIDTH):
-        fake_data.append(i)
-
-    _, filename = mkstemp(dir=self.get_temp_dir())
-    data_file = open(filename, 'wb')
-    data_file.write(fake_data)
-    data_file.close()
-
-    fake_dataset = tf.data.FixedLengthRecordDataset(
-        filename, cifar10_main._RECORD_BYTES)  # pylint: disable=protected-access
-    fake_dataset = fake_dataset.map(
-        lambda val: cifar10_main.parse_record(val, False, tf.float32))
-    image, label = tf.compat.v1.data.make_one_shot_iterator(
-        fake_dataset).get_next()
-
-    self.assertAllEqual(label.shape, ())
-    self.assertAllEqual(image.shape, (_HEIGHT, _WIDTH, _NUM_CHANNELS))
-
-    with self.session() as sess:
-      image, label = sess.run([image, label])
-
-      self.assertEqual(label, 7)
-
-      for row in image:
-        for pixel in row:
-          self.assertAllClose(pixel, np.array([-1.225, 0., 1.225]), rtol=1e-3)
-
-  def cifar10_model_fn_helper(self, mode, resnet_version, dtype):
-    input_fn = cifar10_main.get_synth_input_fn(dtype)
-    dataset = input_fn(True, '', _BATCH_SIZE)
-    iterator = tf.compat.v1.data.make_initializable_iterator(dataset)
-    features, labels = iterator.get_next()
-    spec = cifar10_main.cifar10_model_fn(
-        features, labels, mode, {
-            'dtype': dtype,
-            'resnet_size': 32,
-            'data_format': 'channels_last',
-            'batch_size': _BATCH_SIZE,
-            'resnet_version': resnet_version,
-            'loss_scale': 128 if dtype == tf.float16 else 1,
-            'fine_tune': False,
-        })
-
-    predictions = spec.predictions
-    self.assertAllEqual(predictions['probabilities'].shape,
-                        (_BATCH_SIZE, 10))
-    self.assertEqual(predictions['probabilities'].dtype, tf.float32)
-    self.assertAllEqual(predictions['classes'].shape, (_BATCH_SIZE,))
-    self.assertEqual(predictions['classes'].dtype, tf.int64)
-
-    if mode != tf.estimator.ModeKeys.PREDICT:
-      loss = spec.loss
-      self.assertAllEqual(loss.shape, ())
-      self.assertEqual(loss.dtype, tf.float32)
-
-    if mode == tf.estimator.ModeKeys.EVAL:
-      eval_metric_ops = spec.eval_metric_ops
-      self.assertAllEqual(eval_metric_ops['accuracy'][0].shape, ())
-      self.assertAllEqual(eval_metric_ops['accuracy'][1].shape, ())
-      self.assertEqual(eval_metric_ops['accuracy'][0].dtype, tf.float32)
-      self.assertEqual(eval_metric_ops['accuracy'][1].dtype, tf.float32)
-
-  def test_cifar10_model_fn_train_mode_v1(self):
-    self.cifar10_model_fn_helper(tf.estimator.ModeKeys.TRAIN, resnet_version=1,
-                                 dtype=tf.float32)
-
-  def test_cifar10_model_fn_trainmode__v2(self):
-    self.cifar10_model_fn_helper(tf.estimator.ModeKeys.TRAIN, resnet_version=2,
-                                 dtype=tf.float32)
-
-  def test_cifar10_model_fn_eval_mode_v1(self):
-    self.cifar10_model_fn_helper(tf.estimator.ModeKeys.EVAL, resnet_version=1,
-                                 dtype=tf.float32)
-
-  def test_cifar10_model_fn_eval_mode_v2(self):
-    self.cifar10_model_fn_helper(tf.estimator.ModeKeys.EVAL, resnet_version=2,
-                                 dtype=tf.float32)
-
-  def test_cifar10_model_fn_predict_mode_v1(self):
-    self.cifar10_model_fn_helper(tf.estimator.ModeKeys.PREDICT,
-                                 resnet_version=1, dtype=tf.float32)
-
-  def test_cifar10_model_fn_predict_mode_v2(self):
-    self.cifar10_model_fn_helper(tf.estimator.ModeKeys.PREDICT,
-                                 resnet_version=2, dtype=tf.float32)
-
-  def _test_cifar10model_shape(self, resnet_version):
-    batch_size = 135
-    num_classes = 246
-
-    model = cifar10_main.Cifar10Model(32, data_format='channels_last',
-                                      num_classes=num_classes,
-                                      resnet_version=resnet_version)
-    fake_input = tf.random.uniform([batch_size, _HEIGHT, _WIDTH, _NUM_CHANNELS])
-    output = model(fake_input, training=True)
-
-    self.assertAllEqual(output.shape, (batch_size, num_classes))
-
-  def test_cifar10model_shape_v1(self):
-    self._test_cifar10model_shape(resnet_version=1)
-
-  def test_cifar10model_shape_v2(self):
-    self._test_cifar10model_shape(resnet_version=2)
-
-  def test_cifar10_end_to_end_synthetic_v1(self):
-    integration.run_synthetic(
-        main=cifar10_main.run_cifar, tmp_root=self.get_temp_dir(),
-        extra_flags=['-resnet_version', '1', '-batch_size', '4',
-                     '--max_train_steps', '1']
-    )
-
-  def test_cifar10_end_to_end_synthetic_v2(self):
-    integration.run_synthetic(
-        main=cifar10_main.run_cifar, tmp_root=self.get_temp_dir(),
-        extra_flags=['-resnet_version', '2', '-batch_size', '4',
-                     '--max_train_steps', '1']
-    )
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/official/r1/resnet/estimator_benchmark.py b/official/r1/resnet/estimator_benchmark.py
deleted file mode 100644
index a1b9f79ff8dd43a1bdd57b6948652699ac835475..0000000000000000000000000000000000000000
--- a/official/r1/resnet/estimator_benchmark.py
+++ /dev/null
@@ -1,500 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Executes Estimator benchmarks and accuracy tests."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import time
-
-from absl import flags
-from absl import logging
-from absl.testing import flagsaver
-import tensorflow as tf
-
-from official.r1.resnet import cifar10_main as cifar_main
-from official.r1.resnet import imagenet_main
-from official.r1.utils.logs import hooks
-from official.utils.flags import core as flags_core
-
-IMAGENET_DATA_DIR_NAME = 'imagenet'
-CIFAR_DATA_DIR_NAME = 'cifar-10-batches-bin'
-FLAGS = flags.FLAGS
-
-
-class EstimatorBenchmark(tf.test.Benchmark):
-  """Base class to hold methods common to test classes in the module.
-
-     Code under test for Estimator models (ResNet50 and 56) report mostly the
-     same data and require the same FLAG setup.
-  """
-
-  local_flags = None
-
-  def __init__(self, output_dir=None, default_flags=None, flag_methods=None):
-    if not output_dir:
-      output_dir = '/tmp'
-    self.output_dir = output_dir
-    self.default_flags = default_flags or {}
-    self.flag_methods = flag_methods or {}
-
-  def _get_model_dir(self, folder_name):
-    """Returns directory to store info, e.g. saved model and event log."""
-    return os.path.join(self.output_dir, folder_name)
-
-  def _setup(self):
-    """Sets up and resets flags before each test."""
-    logging.set_verbosity(logging.INFO)
-    if EstimatorBenchmark.local_flags is None:
-      for flag_method in self.flag_methods:
-        flag_method()
-      # Loads flags to get defaults to then override. List cannot be empty.
-      flags.FLAGS(['foo'])
-      # Overrides flag values with defaults for the class of tests.
-      for k, v in self.default_flags.items():
-        setattr(FLAGS, k, v)
-      saved_flag_values = flagsaver.save_flag_values()
-      EstimatorBenchmark.local_flags = saved_flag_values
-    else:
-      flagsaver.restore_flag_values(EstimatorBenchmark.local_flags)
-
-  def _report_benchmark(self,
-                        stats,
-                        wall_time_sec,
-                        top_1_max=None,
-                        top_1_min=None):
-    """Report benchmark results by writing to local protobuf file.
-
-    Args:
-      stats: dict returned from estimator models with known entries.
-      wall_time_sec: the during of the benchmark execution in seconds
-      top_1_max: highest passing level for top_1 accuracy.
-      top_1_min: lowest passing level for top_1 accuracy.
-    """
-
-    examples_per_sec_hook = None
-    for hook in stats['train_hooks']:
-      if isinstance(hook, hooks.ExamplesPerSecondHook):
-        examples_per_sec_hook = hook
-        break
-
-    eval_results = stats['eval_results']
-    metrics = []
-    if 'accuracy' in eval_results:
-      metrics.append({'name': 'accuracy_top_1',
-                      'value': float(eval_results['accuracy']),
-                      'min_value': top_1_min,
-                      'max_value': top_1_max})
-    if 'accuracy_top_5' in eval_results:
-      metrics.append({'name': 'accuracy_top_5',
-                      'value': float(eval_results['accuracy_top_5'])})
-
-    if examples_per_sec_hook:
-      exp_per_second_list = examples_per_sec_hook.current_examples_per_sec_list
-      # ExamplesPerSecondHook skips the first 10 steps.
-      exp_per_sec = sum(exp_per_second_list) / (len(exp_per_second_list))
-      metrics.append({'name': 'exp_per_second',
-                      'value': exp_per_sec})
-    flags_str = flags_core.get_nondefault_flags_as_str()
-    self.report_benchmark(
-        iters=eval_results.get('global_step', None),
-        wall_time=wall_time_sec,
-        metrics=metrics,
-        extras={'flags': flags_str})
-
-
-class Resnet50EstimatorAccuracy(EstimatorBenchmark):
-  """Benchmark accuracy tests for ResNet50 w/ Estimator."""
-
-  def __init__(self, output_dir=None, root_data_dir=None, **kwargs):
-    """Benchmark accuracy tests for ResNet50 w/ Estimator.
-
-    Args:
-      output_dir: directory where to output e.g. log files
-      root_data_dir: directory under which to look for dataset
-      **kwargs: arbitrary named arguments. This is needed to make the
-                constructor forward compatible in case PerfZero provides more
-                named arguments before updating the constructor.
-    """
-    flag_methods = [imagenet_main.define_imagenet_flags]
-
-    self.data_dir = os.path.join(root_data_dir, IMAGENET_DATA_DIR_NAME)
-    super(Resnet50EstimatorAccuracy, self).__init__(
-        output_dir=output_dir, flag_methods=flag_methods)
-
-  def benchmark_graph_8_gpu(self):
-    """Test 8 GPUs graph mode."""
-    self._setup()
-    FLAGS.num_gpus = 8
-    FLAGS.data_dir = self.data_dir
-    FLAGS.batch_size = 128 * 8
-    FLAGS.train_epochs = 90
-    FLAGS.epochs_between_evals = 10
-    FLAGS.model_dir = self._get_model_dir('benchmark_graph_8_gpu')
-    FLAGS.dtype = 'fp32'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_8_gpu(self):
-    """Test FP16 8 GPUs graph mode."""
-    self._setup()
-    FLAGS.num_gpus = 8
-    FLAGS.data_dir = self.data_dir
-    FLAGS.batch_size = 256 * 8
-    FLAGS.train_epochs = 90
-    FLAGS.epochs_between_evals = 10
-    FLAGS.model_dir = self._get_model_dir('benchmark_graph_fp16_8_gpu')
-    FLAGS.dtype = 'fp16'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_graph_rewrite_8_gpu(self):
-    """Test FP16 graph rewrite 8 GPUs graph mode."""
-    self._setup()
-    FLAGS.num_gpus = 8
-    FLAGS.data_dir = self.data_dir
-    FLAGS.batch_size = 256 * 8
-    FLAGS.train_epochs = 90
-    FLAGS.epochs_between_evals = 10
-    FLAGS.model_dir = self._get_model_dir(
-        'benchmark_graph_fp16_graph_rewrite_8_gpu')
-    FLAGS.dtype = 'fp16'
-    FLAGS.fp16_implementation = 'graph_rewrite'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def _run_and_report_benchmark(self):
-    start_time_sec = time.time()
-    stats = imagenet_main.run_imagenet(flags.FLAGS)
-    wall_time_sec = time.time() - start_time_sec
-    self._report_benchmark(stats,
-                           wall_time_sec,
-                           top_1_min=0.762,
-                           top_1_max=0.766)
-
-
-class Resnet50EstimatorBenchmarkBase(EstimatorBenchmark):
-  """Base class for benchmarks for ResNet50 using Estimator."""
-  local_flags = None
-
-  def __init__(self, output_dir=None, default_flags=None):
-    flag_methods = [imagenet_main.define_imagenet_flags]
-
-    super(Resnet50EstimatorBenchmarkBase, self).__init__(
-        output_dir=output_dir,
-        default_flags=default_flags,
-        flag_methods=flag_methods)
-
-  def _run_and_report_benchmark(self):
-    start_time_sec = time.time()
-    stats = imagenet_main.run_imagenet(FLAGS)
-    wall_time_sec = time.time() - start_time_sec
-    print(stats)
-    # Remove values to skip triggering accuracy check.
-    stats['eval_results'].pop('accuracy', None)
-    stats['eval_results'].pop('accuracy_top_5', None)
-
-    self._report_benchmark(stats, wall_time_sec)
-
-
-class Resnet50EstimatorBenchmark(Resnet50EstimatorBenchmarkBase):
-  """Benchmarks for ResNet50 using Estimator with 1 worker."""
-
-  def __init__(self, output_dir=None, default_flags=None):
-    super(Resnet50EstimatorBenchmark, self).__init__(
-        output_dir=output_dir,
-        default_flags=default_flags)
-
-  def benchmark_graph_fp16_1_gpu(self):
-    """Benchmarks graph fp16 1 gpu."""
-    self._setup()
-
-    FLAGS.num_gpus = 1
-    FLAGS.model_dir = self._get_model_dir('benchmark_graph_fp16_1_gpu')
-    FLAGS.batch_size = 128
-    FLAGS.dtype = 'fp16'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_1_gpu_tweaked(self):
-    """Benchmarks graph fp16 1 gpu tweaked."""
-    self._setup()
-
-    FLAGS.num_gpus = 1
-    FLAGS.tf_gpu_thread_mode = 'gpu_private'
-    FLAGS.intra_op_parallelism_threads = 1
-    FLAGS.model_dir = self._get_model_dir('benchmark_graph_fp16_1_gpu_tweaked')
-    FLAGS.batch_size = 256
-    FLAGS.dtype = 'fp16'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_graph_rewrite_1_gpu_tweaked(self):
-    """Benchmarks graph fp16 graph rewrite 1 gpu tweaked."""
-    self._setup()
-
-    FLAGS.num_gpus = 1
-    FLAGS.tf_gpu_thread_mode = 'gpu_private'
-    FLAGS.intra_op_parallelism_threads = 1
-    FLAGS.model_dir = self._get_model_dir(
-        'benchmark_graph_fp16_graph_rewrite_1_gpu_tweaked')
-    FLAGS.batch_size = 256
-    FLAGS.dtype = 'fp16'
-    FLAGS.fp16_implementation = 'graph_rewrite'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_1_gpu(self):
-    """Benchmarks graph 1 gpu."""
-    self._setup()
-
-    FLAGS.num_gpus = 1
-    FLAGS.model_dir = self._get_model_dir('benchmark_graph_1_gpu')
-    FLAGS.batch_size = 128
-    FLAGS.dtype = 'fp32'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_8_gpu(self):
-    """Benchmarks graph 8 gpus."""
-    self._setup()
-
-    FLAGS.num_gpus = 8
-    FLAGS.model_dir = self._get_model_dir('benchmark_graph_8_gpu')
-    FLAGS.batch_size = 128*8
-    FLAGS.dtype = 'fp32'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_8_gpu(self):
-    """Benchmarks graph fp16 8 gpus."""
-    self._setup()
-
-    FLAGS.num_gpus = 8
-    FLAGS.model_dir = self._get_model_dir('benchmark_graph_fp16_8_gpu')
-    FLAGS.batch_size = 256*8
-    FLAGS.dtype = 'fp16'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_8_gpu_tweaked(self):
-    """Benchmarks graph fp16 8 gpus tweaked."""
-    self._setup()
-
-    FLAGS.num_gpus = 8
-    FLAGS.tf_gpu_thread_mode = 'gpu_private'
-    FLAGS.intra_op_parallelism_threads = 1
-    FLAGS.model_dir = self._get_model_dir('benchmark_graph_fp16_8_gpu_tweaked')
-    FLAGS.batch_size = 256*8
-    FLAGS.dtype = 'fp16'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_graph_rewrite_8_gpu_tweaked(self):
-    """Benchmarks graph fp16 graph rewrite 8 gpus tweaked."""
-    self._setup()
-
-    FLAGS.num_gpus = 8
-    FLAGS.tf_gpu_thread_mode = 'gpu_private'
-    FLAGS.intra_op_parallelism_threads = 1
-    FLAGS.model_dir = self._get_model_dir(
-        'benchmark_graph_fp16_graph_rewrite_8_gpu_tweaked')
-    FLAGS.batch_size = 256*8
-    FLAGS.dtype = 'fp16'
-    FLAGS.fp16_implementation = 'graph_rewrite'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-
-class Resnet50EstimatorBenchmarkSynth(Resnet50EstimatorBenchmark):
-  """Resnet50 synthetic benchmark tests."""
-
-  def __init__(self, output_dir=None, root_data_dir=None, **kwargs):
-    def_flags = {}
-    def_flags['use_synthetic_data'] = True
-    def_flags['max_train_steps'] = 110
-    def_flags['train_epochs'] = 1
-
-    super(Resnet50EstimatorBenchmarkSynth, self).__init__(
-        output_dir=output_dir, default_flags=def_flags)
-
-
-class Resnet50EstimatorBenchmarkReal(Resnet50EstimatorBenchmark):
-  """Resnet50 real data benchmark tests."""
-
-  def __init__(self, output_dir=None, root_data_dir=None, **kwargs):
-    def_flags = {}
-    def_flags['data_dir'] = os.path.join(root_data_dir, IMAGENET_DATA_DIR_NAME)
-    def_flags['max_train_steps'] = 110
-    def_flags['train_epochs'] = 1
-
-    super(Resnet50EstimatorBenchmarkReal, self).__init__(
-        output_dir=output_dir, default_flags=def_flags)
-
-
-class Resnet50MultiWorkerEstimatorBenchmark(Resnet50EstimatorBenchmarkBase):
-  """Benchmarks for ResNet50 using Estimator with multiple workers."""
-
-  def __init__(self, output_dir=None, default_flags=None):
-    super(Resnet50MultiWorkerEstimatorBenchmark, self).__init__(
-        output_dir=output_dir,
-        default_flags=default_flags)
-
-  def benchmark_graph_fp16_8_gpu_ring_tweaked(self):
-    """Benchmarks graph fp16 8 gpus with ring collective tweaked."""
-    self._setup()
-
-    FLAGS.num_gpus = 8
-    FLAGS.distribution_strategy = 'multi_worker_mirrored'
-    FLAGS.all_reduce_alg = 'ring'
-    FLAGS.tf_gpu_thread_mode = 'gpu_private'
-    FLAGS.intra_op_parallelism_threads = 1
-    FLAGS.datasets_num_private_threads = 32
-    FLAGS.model_dir = self._get_model_dir(
-        folder_name='benchmark_graph_fp16_8_gpu_ring_tweaked')
-    FLAGS.batch_size = 256*8
-    FLAGS.dtype = 'fp16'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_8_gpu_nccl_tweaked(self):
-    """Benchmarks graph fp16 8 gpus with nccl collective tweaked."""
-    self._setup()
-
-    FLAGS.num_gpus = 8
-    FLAGS.distribution_strategy = 'multi_worker_mirrored'
-    FLAGS.all_reduce_alg = 'nccl'
-    FLAGS.tf_gpu_thread_mode = 'gpu_private'
-    FLAGS.intra_op_parallelism_threads = 1
-    FLAGS.datasets_num_private_threads = 32
-    FLAGS.model_dir = self._get_model_dir(
-        folder_name='benchmark_graph_fp16_8_gpu_nccl_tweaked')
-    FLAGS.batch_size = 256*8
-    FLAGS.dtype = 'fp16'
-    FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-
-class Resnet50MultiWorkerEstimatorBenchmarkSynth(
-    Resnet50MultiWorkerEstimatorBenchmark):
-  """ResNet50, multi-worker, Estimator, synthetic data."""
-
-  def __init__(self, output_dir=None, root_data_dir=None, **kwargs):
-    def_flags = {}
-    def_flags['use_synthetic_data'] = True
-    def_flags['max_train_steps'] = 110
-    def_flags['train_epochs'] = 1
-
-    super(Resnet50MultiWorkerEstimatorBenchmarkSynth, self).__init__(
-        output_dir=output_dir, default_flags=def_flags)
-
-
-class Resnet56EstimatorAccuracy(EstimatorBenchmark):
-  """Accuracy tests for Estimator ResNet56."""
-
-  local_flags = None
-
-  def __init__(self, output_dir=None, root_data_dir=None, **kwargs):
-    """A benchmark class.
-
-    Args:
-      output_dir: directory where to output e.g. log files
-      root_data_dir: directory under which to look for dataset
-      **kwargs: arbitrary named arguments. This is needed to make the
-                constructor forward compatible in case PerfZero provides more
-                named arguments before updating the constructor.
-    """
-    flag_methods = [cifar_main.define_cifar_flags]
-
-    self.data_dir = os.path.join(root_data_dir, CIFAR_DATA_DIR_NAME)
-    super(Resnet56EstimatorAccuracy, self).__init__(
-        output_dir=output_dir, flag_methods=flag_methods)
-
-  def benchmark_graph_1_gpu(self):
-    """Test layers model with Estimator and distribution strategies."""
-    self._setup()
-    flags.FLAGS.num_gpus = 1
-    flags.FLAGS.data_dir = self.data_dir
-    flags.FLAGS.batch_size = 128
-    flags.FLAGS.train_epochs = 182
-    flags.FLAGS.model_dir = self._get_model_dir('benchmark_graph_1_gpu')
-    flags.FLAGS.resnet_size = 56
-    flags.FLAGS.dtype = 'fp32'
-    flags.FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_1_gpu(self):
-    """Test layers FP16 model with Estimator and distribution strategies."""
-    self._setup()
-    flags.FLAGS.num_gpus = 1
-    flags.FLAGS.data_dir = self.data_dir
-    flags.FLAGS.batch_size = 128
-    flags.FLAGS.train_epochs = 182
-    flags.FLAGS.model_dir = self._get_model_dir('benchmark_graph_fp16_1_gpu')
-    flags.FLAGS.resnet_size = 56
-    flags.FLAGS.dtype = 'fp16'
-    flags.FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_2_gpu(self):
-    """Test layers model with Estimator and dist_strat. 2 GPUs."""
-    self._setup()
-    flags.FLAGS.num_gpus = 2
-    flags.FLAGS.data_dir = self.data_dir
-    flags.FLAGS.batch_size = 128
-    flags.FLAGS.train_epochs = 182
-    flags.FLAGS.model_dir = self._get_model_dir('benchmark_graph_2_gpu')
-    flags.FLAGS.resnet_size = 56
-    flags.FLAGS.dtype = 'fp32'
-    flags.FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def benchmark_graph_fp16_2_gpu(self):
-    """Test layers FP16 model with Estimator and dist_strat. 2 GPUs."""
-    self._setup()
-    flags.FLAGS.num_gpus = 2
-    flags.FLAGS.data_dir = self.data_dir
-    flags.FLAGS.batch_size = 128
-    flags.FLAGS.train_epochs = 182
-    flags.FLAGS.model_dir = self._get_model_dir('benchmark_graph_fp16_2_gpu')
-    flags.FLAGS.resnet_size = 56
-    flags.FLAGS.dtype = 'fp16'
-    flags.FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def unit_test(self):
-    """A lightweight test that can finish quickly."""
-    self._setup()
-    flags.FLAGS.num_gpus = 1
-    flags.FLAGS.data_dir = self.data_dir
-    flags.FLAGS.batch_size = 128
-    flags.FLAGS.train_epochs = 1
-    flags.FLAGS.model_dir = self._get_model_dir('unit_test')
-    flags.FLAGS.resnet_size = 8
-    flags.FLAGS.dtype = 'fp32'
-    flags.FLAGS.hooks = ['ExamplesPerSecondHook']
-    self._run_and_report_benchmark()
-
-  def _run_and_report_benchmark(self):
-    """Executes benchmark and reports result."""
-    start_time_sec = time.time()
-    stats = cifar_main.run_cifar(flags.FLAGS)
-    wall_time_sec = time.time() - start_time_sec
-
-    self._report_benchmark(stats,
-                           wall_time_sec,
-                           top_1_min=0.926,
-                           top_1_max=0.938)
diff --git a/official/r1/resnet/imagenet_main.py b/official/r1/resnet/imagenet_main.py
deleted file mode 100644
index 37420d334d1c433f4c305b167ed54f8ab12f7d04..0000000000000000000000000000000000000000
--- a/official/r1/resnet/imagenet_main.py
+++ /dev/null
@@ -1,393 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Runs a ResNet model on the ImageNet dataset."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-from absl import app as absl_app
-from absl import flags
-from absl import logging
-from six.moves import range
-import tensorflow as tf
-
-from official.r1.resnet import imagenet_preprocessing
-from official.r1.resnet import resnet_model
-from official.r1.resnet import resnet_run_loop
-from official.r1.utils.logs import logger
-from official.utils.flags import core as flags_core
-
-DEFAULT_IMAGE_SIZE = 224
-NUM_CHANNELS = 3
-NUM_CLASSES = 1001
-
-NUM_IMAGES = {
-    'train': 1281167,
-    'validation': 50000,
-}
-
-_NUM_TRAIN_FILES = 1024
-_SHUFFLE_BUFFER = 10000
-
-DATASET_NAME = 'ImageNet'
-
-###############################################################################
-# Data processing
-###############################################################################
-def get_filenames(is_training, data_dir):
-  """Return filenames for dataset."""
-  if is_training:
-    return [
-        os.path.join(data_dir, 'train-%05d-of-01024' % i)
-        for i in range(_NUM_TRAIN_FILES)]
-  else:
-    return [
-        os.path.join(data_dir, 'validation-%05d-of-00128' % i)
-        for i in range(128)]
-
-
-def _parse_example_proto(example_serialized):
-  """Parses an Example proto containing a training example of an image.
-
-  The output of the build_image_data.py image preprocessing script is a dataset
-  containing serialized Example protocol buffers. Each Example proto contains
-  the following fields (values are included as examples):
-
-    image/height: 462
-    image/width: 581
-    image/colorspace: 'RGB'
-    image/channels: 3
-    image/class/label: 615
-    image/class/synset: 'n03623198'
-    image/class/text: 'knee pad'
-    image/object/bbox/xmin: 0.1
-    image/object/bbox/xmax: 0.9
-    image/object/bbox/ymin: 0.2
-    image/object/bbox/ymax: 0.6
-    image/object/bbox/label: 615
-    image/format: 'JPEG'
-    image/filename: 'ILSVRC2012_val_00041207.JPEG'
-    image/encoded: <JPEG encoded string>
-
-  Args:
-    example_serialized: scalar Tensor tf.string containing a serialized
-      Example protocol buffer.
-
-  Returns:
-    image_buffer: Tensor tf.string containing the contents of a JPEG file.
-    label: Tensor tf.int32 containing the label.
-    bbox: 3-D float Tensor of bounding boxes arranged [1, num_boxes, coords]
-      where each coordinate is [0, 1) and the coordinates are arranged as
-      [ymin, xmin, ymax, xmax].
-  """
-  # Dense features in Example proto.
-  feature_map = {
-      'image/encoded': tf.io.FixedLenFeature([], dtype=tf.string,
-                                             default_value=''),
-      'image/class/label': tf.io.FixedLenFeature([], dtype=tf.int64,
-                                                 default_value=-1),
-      'image/class/text': tf.io.FixedLenFeature([], dtype=tf.string,
-                                                default_value=''),
-  }
-  sparse_float32 = tf.io.VarLenFeature(dtype=tf.float32)
-  # Sparse features in Example proto.
-  feature_map.update(
-      {k: sparse_float32 for k in ['image/object/bbox/xmin',
-                                   'image/object/bbox/ymin',
-                                   'image/object/bbox/xmax',
-                                   'image/object/bbox/ymax']})
-
-  features = tf.io.parse_single_example(serialized=example_serialized,
-                                        features=feature_map)
-  label = tf.cast(features['image/class/label'], dtype=tf.int32)
-
-  xmin = tf.expand_dims(features['image/object/bbox/xmin'].values, 0)
-  ymin = tf.expand_dims(features['image/object/bbox/ymin'].values, 0)
-  xmax = tf.expand_dims(features['image/object/bbox/xmax'].values, 0)
-  ymax = tf.expand_dims(features['image/object/bbox/ymax'].values, 0)
-
-  # Note that we impose an ordering of (y, x) just to make life difficult.
-  bbox = tf.concat([ymin, xmin, ymax, xmax], 0)
-
-  # Force the variable number of bounding boxes into the shape
-  # [1, num_boxes, coords].
-  bbox = tf.expand_dims(bbox, 0)
-  bbox = tf.transpose(a=bbox, perm=[0, 2, 1])
-
-  return features['image/encoded'], label, bbox
-
-
-def parse_record(raw_record, is_training, dtype):
-  """Parses a record containing a training example of an image.
-
-  The input record is parsed into a label and image, and the image is passed
-  through preprocessing steps (cropping, flipping, and so on).
-
-  Args:
-    raw_record: scalar Tensor tf.string containing a serialized
-      Example protocol buffer.
-    is_training: A boolean denoting whether the input is for training.
-    dtype: data type to use for images/features.
-
-  Returns:
-    Tuple with processed image tensor and one-hot-encoded label tensor.
-  """
-  image_buffer, label, bbox = _parse_example_proto(raw_record)
-
-  image = imagenet_preprocessing.preprocess_image(
-      image_buffer=image_buffer,
-      bbox=bbox,
-      output_height=DEFAULT_IMAGE_SIZE,
-      output_width=DEFAULT_IMAGE_SIZE,
-      num_channels=NUM_CHANNELS,
-      is_training=is_training)
-  image = tf.cast(image, dtype)
-
-  return image, label
-
-
-def input_fn(is_training,
-             data_dir,
-             batch_size,
-             num_epochs=1,
-             dtype=tf.float32,
-             datasets_num_private_threads=None,
-             parse_record_fn=parse_record,
-             input_context=None,
-             drop_remainder=False,
-             tf_data_experimental_slack=False):
-  """Input function which provides batches for train or eval.
-
-  Args:
-    is_training: A boolean denoting whether the input is for training.
-    data_dir: The directory containing the input data.
-    batch_size: The number of samples per batch.
-    num_epochs: The number of epochs to repeat the dataset.
-    dtype: Data type to use for images/features
-    datasets_num_private_threads: Number of private threads for tf.data.
-    parse_record_fn: Function to use for parsing the records.
-    input_context: A `tf.distribute.InputContext` object passed in by
-      `tf.distribute.Strategy`.
-    drop_remainder: A boolean indicates whether to drop the remainder of the
-      batches. If True, the batch dimension will be static.
-    tf_data_experimental_slack: Whether to enable tf.data's
-      `experimental_slack` option.
-
-  Returns:
-    A dataset that can be used for iteration.
-  """
-  filenames = get_filenames(is_training, data_dir)
-  dataset = tf.data.Dataset.from_tensor_slices(filenames)
-
-  if input_context:
-    logging.info(
-        'Sharding the dataset: input_pipeline_id=%d num_input_pipelines=%d',
-        input_context.input_pipeline_id, input_context.num_input_pipelines)
-    dataset = dataset.shard(input_context.num_input_pipelines,
-                            input_context.input_pipeline_id)
-
-  if is_training:
-    # Shuffle the input files
-    dataset = dataset.shuffle(buffer_size=_NUM_TRAIN_FILES)
-
-  # Convert to individual records.
-  # cycle_length = 10 means that up to 10 files will be read and deserialized in
-  # parallel. You may want to increase this number if you have a large number of
-  # CPU cores.
-  dataset = dataset.interleave(
-      tf.data.TFRecordDataset,
-      cycle_length=10,
-      num_parallel_calls=tf.data.experimental.AUTOTUNE)
-
-  return resnet_run_loop.process_record_dataset(
-      dataset=dataset,
-      is_training=is_training,
-      batch_size=batch_size,
-      shuffle_buffer=_SHUFFLE_BUFFER,
-      parse_record_fn=parse_record_fn,
-      num_epochs=num_epochs,
-      dtype=dtype,
-      datasets_num_private_threads=datasets_num_private_threads,
-      drop_remainder=drop_remainder,
-      tf_data_experimental_slack=tf_data_experimental_slack,
-  )
-
-
-def get_synth_input_fn(dtype):
-  return resnet_run_loop.get_synth_input_fn(
-      DEFAULT_IMAGE_SIZE, DEFAULT_IMAGE_SIZE, NUM_CHANNELS, NUM_CLASSES,
-      dtype=dtype)
-
-
-###############################################################################
-# Running the model
-###############################################################################
-class ImagenetModel(resnet_model.Model):
-  """Model class with appropriate defaults for Imagenet data."""
-
-  def __init__(self, resnet_size, data_format=None, num_classes=NUM_CLASSES,
-               resnet_version=resnet_model.DEFAULT_VERSION,
-               dtype=resnet_model.DEFAULT_DTYPE):
-    """These are the parameters that work for Imagenet data.
-
-    Args:
-      resnet_size: The number of convolutional layers needed in the model.
-      data_format: Either 'channels_first' or 'channels_last', specifying which
-        data format to use when setting up the model.
-      num_classes: The number of output classes needed from the model. This
-        enables users to extend the same model to their own datasets.
-      resnet_version: Integer representing which version of the ResNet network
-        to use. See README for details. Valid values: [1, 2]
-      dtype: The TensorFlow dtype to use for calculations.
-    """
-
-    # For bigger models, we want to use "bottleneck" layers
-    if resnet_size < 50:
-      bottleneck = False
-    else:
-      bottleneck = True
-
-    super(ImagenetModel, self).__init__(
-        resnet_size=resnet_size,
-        bottleneck=bottleneck,
-        num_classes=num_classes,
-        num_filters=64,
-        kernel_size=7,
-        conv_stride=2,
-        first_pool_size=3,
-        first_pool_stride=2,
-        block_sizes=_get_block_sizes(resnet_size),
-        block_strides=[1, 2, 2, 2],
-        resnet_version=resnet_version,
-        data_format=data_format,
-        dtype=dtype
-    )
-
-
-def _get_block_sizes(resnet_size):
-  """Retrieve the size of each block_layer in the ResNet model.
-
-  The number of block layers used for the Resnet model varies according
-  to the size of the model. This helper grabs the layer set we want, throwing
-  an error if a non-standard size has been selected.
-
-  Args:
-    resnet_size: The number of convolutional layers needed in the model.
-
-  Returns:
-    A list of block sizes to use in building the model.
-
-  Raises:
-    KeyError: if invalid resnet_size is received.
-  """
-  choices = {
-      18: [2, 2, 2, 2],
-      34: [3, 4, 6, 3],
-      50: [3, 4, 6, 3],
-      101: [3, 4, 23, 3],
-      152: [3, 8, 36, 3],
-      200: [3, 24, 36, 3]
-  }
-
-  try:
-    return choices[resnet_size]
-  except KeyError:
-    err = ('Could not find layers for selected Resnet size.\n'
-           'Size received: {}; sizes allowed: {}.'.format(
-               resnet_size, list(choices.keys())))
-    raise ValueError(err)
-
-
-def imagenet_model_fn(features, labels, mode, params):
-  """Our model_fn for ResNet to be used with our Estimator."""
-
-  # Warmup and higher lr may not be valid for fine tuning with small batches
-  # and smaller numbers of training images.
-  if params['fine_tune']:
-    warmup = False
-    base_lr = .1
-  else:
-    warmup = True
-    base_lr = .128
-
-  learning_rate_fn = resnet_run_loop.learning_rate_with_decay(
-      batch_size=params['batch_size'] * params.get('num_workers', 1),
-      batch_denom=256, num_images=NUM_IMAGES['train'],
-      boundary_epochs=[30, 60, 80, 90], decay_rates=[1, 0.1, 0.01, 0.001, 1e-4],
-      warmup=warmup, base_lr=base_lr)
-
-  return resnet_run_loop.resnet_model_fn(
-      features=features,
-      labels=labels,
-      mode=mode,
-      model_class=ImagenetModel,
-      resnet_size=params['resnet_size'],
-      weight_decay=flags.FLAGS.weight_decay,
-      learning_rate_fn=learning_rate_fn,
-      momentum=0.9,
-      data_format=params['data_format'],
-      resnet_version=params['resnet_version'],
-      loss_scale=params['loss_scale'],
-      loss_filter_fn=None,
-      dtype=params['dtype'],
-      fine_tune=params['fine_tune'],
-      label_smoothing=flags.FLAGS.label_smoothing
-  )
-
-
-def define_imagenet_flags():
-  resnet_run_loop.define_resnet_flags(
-      resnet_size_choices=['18', '34', '50', '101', '152', '200'],
-      dynamic_loss_scale=True,
-      fp16_implementation=True)
-  flags.adopt_module_key_flags(resnet_run_loop)
-  flags_core.set_defaults(train_epochs=90)
-
-
-def run_imagenet(flags_obj):
-  """Run ResNet ImageNet training and eval loop.
-
-  Args:
-    flags_obj: An object containing parsed flag values.
-
-  Returns:
-    Dict of results of the run.  Contains the keys `eval_results` and
-      `train_hooks`. `eval_results` contains accuracy (top_1) and
-      accuracy_top_5. `train_hooks` is a list the instances of hooks used during
-      training.
-  """
-  input_function = (flags_obj.use_synthetic_data and
-                    get_synth_input_fn(flags_core.get_tf_dtype(flags_obj)) or
-                    input_fn)
-
-  result = resnet_run_loop.resnet_main(
-      flags_obj, imagenet_model_fn, input_function, DATASET_NAME,
-      shape=[DEFAULT_IMAGE_SIZE, DEFAULT_IMAGE_SIZE, NUM_CHANNELS])
-
-  return result
-
-
-def main(_):
-  with logger.benchmark_context(flags.FLAGS):
-    run_imagenet(flags.FLAGS)
-
-
-if __name__ == '__main__':
-  logging.set_verbosity(logging.INFO)
-  define_imagenet_flags()
-  absl_app.run(main)
diff --git a/official/r1/resnet/imagenet_preprocessing.py b/official/r1/resnet/imagenet_preprocessing.py
deleted file mode 100644
index 891b58ab12771219d281d698ab6ff2ca433559a7..0000000000000000000000000000000000000000
--- a/official/r1/resnet/imagenet_preprocessing.py
+++ /dev/null
@@ -1,262 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Provides utilities to preprocess images.
-
-Training images are sampled using the provided bounding boxes, and subsequently
-cropped to the sampled bounding box. Images are additionally flipped randomly,
-then resized to the target output size (without aspect-ratio preservation).
-
-Images used during evaluation are resized (with aspect-ratio preservation) and
-centrally cropped.
-
-All images undergo mean color subtraction.
-
-Note that these steps are colloquially referred to as "ResNet preprocessing,"
-and they differ from "VGG preprocessing," which does not use bounding boxes
-and instead does an aspect-preserving resize followed by random crop during
-training. (These both differ from "Inception preprocessing," which introduces
-color distortion steps.)
-
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-_R_MEAN = 123.68
-_G_MEAN = 116.78
-_B_MEAN = 103.94
-_CHANNEL_MEANS = [_R_MEAN, _G_MEAN, _B_MEAN]
-
-# The lower bound for the smallest side of the image for aspect-preserving
-# resizing. For example, if an image is 500 x 1000, it will be resized to
-# _RESIZE_MIN x (_RESIZE_MIN * 2).
-_RESIZE_MIN = 256
-
-
-def _decode_crop_and_flip(image_buffer, bbox, num_channels):
-  """Crops the given image to a random part of the image, and randomly flips.
-
-  We use the fused decode_and_crop op, which performs better than the two ops
-  used separately in series, but note that this requires that the image be
-  passed in as an un-decoded string Tensor.
-
-  Args:
-    image_buffer: scalar string Tensor representing the raw JPEG image buffer.
-    bbox: 3-D float Tensor of bounding boxes arranged [1, num_boxes, coords]
-      where each coordinate is [0, 1) and the coordinates are arranged as
-      [ymin, xmin, ymax, xmax].
-    num_channels: Integer depth of the image buffer for decoding.
-
-  Returns:
-    3-D tensor with cropped image.
-
-  """
-  # A large fraction of image datasets contain a human-annotated bounding box
-  # delineating the region of the image containing the object of interest.  We
-  # choose to create a new bounding box for the object which is a randomly
-  # distorted version of the human-annotated bounding box that obeys an
-  # allowed range of aspect ratios, sizes and overlap with the human-annotated
-  # bounding box. If no box is supplied, then we assume the bounding box is
-  # the entire image.
-  sample_distorted_bounding_box = tf.image.sample_distorted_bounding_box(
-      tf.image.extract_jpeg_shape(image_buffer),
-      bounding_boxes=bbox,
-      min_object_covered=0.1,
-      aspect_ratio_range=[0.75, 1.33],
-      area_range=[0.05, 1.0],
-      max_attempts=100,
-      use_image_if_no_bounding_boxes=True)
-  bbox_begin, bbox_size, _ = sample_distorted_bounding_box
-
-  # Reassemble the bounding box in the format the crop op requires.
-  offset_y, offset_x, _ = tf.unstack(bbox_begin)
-  target_height, target_width, _ = tf.unstack(bbox_size)
-  crop_window = tf.stack([offset_y, offset_x, target_height, target_width])
-
-  # Use the fused decode and crop op here, which is faster than each in series.
-  cropped = tf.image.decode_and_crop_jpeg(
-      image_buffer, crop_window, channels=num_channels)
-
-  # Flip to add a little more random distortion in.
-  cropped = tf.image.random_flip_left_right(cropped)
-  return cropped
-
-
-def _central_crop(image, crop_height, crop_width):
-  """Performs central crops of the given image list.
-
-  Args:
-    image: a 3-D image tensor
-    crop_height: the height of the image following the crop.
-    crop_width: the width of the image following the crop.
-
-  Returns:
-    3-D tensor with cropped image.
-  """
-  shape = tf.shape(input=image)
-  height, width = shape[0], shape[1]
-
-  amount_to_be_cropped_h = (height - crop_height)
-  crop_top = amount_to_be_cropped_h // 2
-  amount_to_be_cropped_w = (width - crop_width)
-  crop_left = amount_to_be_cropped_w // 2
-  return tf.slice(
-      image, [crop_top, crop_left, 0], [crop_height, crop_width, -1])
-
-
-def _mean_image_subtraction(image, means, num_channels):
-  """Subtracts the given means from each image channel.
-
-  For example:
-    means = [123.68, 116.779, 103.939]
-    image = _mean_image_subtraction(image, means)
-
-  Note that the rank of `image` must be known.
-
-  Args:
-    image: a tensor of size [height, width, C].
-    means: a C-vector of values to subtract from each channel.
-    num_channels: number of color channels in the image that will be distorted.
-
-  Returns:
-    the centered image.
-
-  Raises:
-    ValueError: If the rank of `image` is unknown, if `image` has a rank other
-      than three or if the number of channels in `image` doesn't match the
-      number of values in `means`.
-  """
-  if image.get_shape().ndims != 3:
-    raise ValueError('Input must be of size [height, width, C>0]')
-
-  if len(means) != num_channels:
-    raise ValueError('len(means) must match the number of channels')
-
-  # We have a 1-D tensor of means; convert to 3-D.
-  # Note(b/130245863): we explicitly call `broadcast` instead of simply
-  # expanding dimensions for better performance.
-  means = tf.broadcast_to(means, tf.shape(image))
-
-  return image - means
-
-
-def _smallest_size_at_least(height, width, resize_min):
-  """Computes new shape with the smallest side equal to `smallest_side`.
-
-  Computes new shape with the smallest side equal to `smallest_side` while
-  preserving the original aspect ratio.
-
-  Args:
-    height: an int32 scalar tensor indicating the current height.
-    width: an int32 scalar tensor indicating the current width.
-    resize_min: A python integer or scalar `Tensor` indicating the size of
-      the smallest side after resize.
-
-  Returns:
-    new_height: an int32 scalar tensor indicating the new height.
-    new_width: an int32 scalar tensor indicating the new width.
-  """
-  resize_min = tf.cast(resize_min, tf.float32)
-
-  # Convert to floats to make subsequent calculations go smoothly.
-  height, width = tf.cast(height, tf.float32), tf.cast(width, tf.float32)
-
-  smaller_dim = tf.minimum(height, width)
-  scale_ratio = resize_min / smaller_dim
-
-  # Convert back to ints to make heights and widths that TF ops will accept.
-  new_height = tf.cast(height * scale_ratio, tf.int32)
-  new_width = tf.cast(width * scale_ratio, tf.int32)
-
-  return new_height, new_width
-
-
-def _aspect_preserving_resize(image, resize_min):
-  """Resize images preserving the original aspect ratio.
-
-  Args:
-    image: A 3-D image `Tensor`.
-    resize_min: A python integer or scalar `Tensor` indicating the size of
-      the smallest side after resize.
-
-  Returns:
-    resized_image: A 3-D tensor containing the resized image.
-  """
-  shape = tf.shape(input=image)
-  height, width = shape[0], shape[1]
-
-  new_height, new_width = _smallest_size_at_least(height, width, resize_min)
-
-  return _resize_image(image, new_height, new_width)
-
-
-def _resize_image(image, height, width):
-  """Simple wrapper around tf.resize_images.
-
-  This is primarily to make sure we use the same `ResizeMethod` and other
-  details each time.
-
-  Args:
-    image: A 3-D image `Tensor`.
-    height: The target height for the resized image.
-    width: The target width for the resized image.
-
-  Returns:
-    resized_image: A 3-D tensor containing the resized image. The first two
-      dimensions have the shape [height, width].
-  """
-  return tf.compat.v1.image.resize(
-      image, [height, width], method=tf.image.ResizeMethod.BILINEAR,
-      align_corners=False)
-
-
-def preprocess_image(image_buffer, bbox, output_height, output_width,
-                     num_channels, is_training=False):
-  """Preprocesses the given image.
-
-  Preprocessing includes decoding, cropping, and resizing for both training
-  and eval images. Training preprocessing, however, introduces some random
-  distortion of the image to improve accuracy.
-
-  Args:
-    image_buffer: scalar string Tensor representing the raw JPEG image buffer.
-    bbox: 3-D float Tensor of bounding boxes arranged [1, num_boxes, coords]
-      where each coordinate is [0, 1) and the coordinates are arranged as
-      [ymin, xmin, ymax, xmax].
-    output_height: The height of the image after preprocessing.
-    output_width: The width of the image after preprocessing.
-    num_channels: Integer depth of the image buffer for decoding.
-    is_training: `True` if we're preprocessing the image for training and
-      `False` otherwise.
-
-  Returns:
-    A preprocessed image.
-  """
-  if is_training:
-    # For training, we want to randomize some of the distortions.
-    image = _decode_crop_and_flip(image_buffer, bbox, num_channels)
-    image = _resize_image(image, output_height, output_width)
-  else:
-    # For validation, we want to decode, resize, then just crop the middle.
-    image = tf.image.decode_jpeg(image_buffer, channels=num_channels)
-    image = _aspect_preserving_resize(image, _RESIZE_MIN)
-    image = _central_crop(image, output_height, output_width)
-
-  image.set_shape([output_height, output_width, num_channels])
-
-  return _mean_image_subtraction(image, _CHANNEL_MEANS, num_channels)
diff --git a/official/r1/resnet/imagenet_test.py b/official/r1/resnet/imagenet_test.py
deleted file mode 100644
index c25cafb85ba8616e9a9980c3811303ed89474110..0000000000000000000000000000000000000000
--- a/official/r1/resnet/imagenet_test.py
+++ /dev/null
@@ -1,325 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import unittest
-
-import tensorflow as tf  # pylint: disable=g-bad-import-order
-from absl import logging
-
-from official.r1.resnet import imagenet_main
-from official.utils.testing import integration
-
-logging.set_verbosity(logging.ERROR)
-
-_BATCH_SIZE = 32
-_LABEL_CLASSES = 1001
-
-
-class BaseTest(tf.test.TestCase):
-
-  _num_validation_images = None
-
-  @classmethod
-  def setUpClass(cls):  # pylint: disable=invalid-name
-    super(BaseTest, cls).setUpClass()
-    imagenet_main.define_imagenet_flags()
-
-  def setUp(self):
-    super(BaseTest, self).setUp()
-    tf.compat.v1.disable_eager_execution()
-    self._num_validation_images = imagenet_main.NUM_IMAGES['validation']
-    imagenet_main.NUM_IMAGES['validation'] = 4
-
-  def tearDown(self):
-    super(BaseTest, self).tearDown()
-    tf.io.gfile.rmtree(self.get_temp_dir())
-    imagenet_main.NUM_IMAGES['validation'] = self._num_validation_images
-
-  def _tensor_shapes_helper(self, resnet_size, resnet_version, dtype, with_gpu):
-    """Checks the tensor shapes after each phase of the ResNet model."""
-    def reshape(shape):
-      """Returns the expected dimensions depending on if a GPU is being used."""
-
-      # If a GPU is used for the test, the shape is returned (already in NCHW
-      # form). When GPU is not used, the shape is converted to NHWC.
-      if with_gpu:
-        return shape
-      return shape[0], shape[2], shape[3], shape[1]
-
-    graph = tf.Graph()
-
-    with graph.as_default(), self.test_session(
-        graph=graph, use_gpu=with_gpu, force_gpu=with_gpu):
-      model = imagenet_main.ImagenetModel(
-          resnet_size=resnet_size,
-          data_format='channels_first' if with_gpu else 'channels_last',
-          resnet_version=resnet_version,
-          dtype=dtype
-      )
-      inputs = tf.random.uniform([1, 224, 224, 3])
-      output = model(inputs, training=True)
-
-      initial_conv = graph.get_tensor_by_name('resnet_model/initial_conv:0')
-      max_pool = graph.get_tensor_by_name('resnet_model/initial_max_pool:0')
-      block_layer1 = graph.get_tensor_by_name('resnet_model/block_layer1:0')
-      block_layer2 = graph.get_tensor_by_name('resnet_model/block_layer2:0')
-      block_layer3 = graph.get_tensor_by_name('resnet_model/block_layer3:0')
-      block_layer4 = graph.get_tensor_by_name('resnet_model/block_layer4:0')
-      reduce_mean = graph.get_tensor_by_name('resnet_model/final_reduce_mean:0')
-      dense = graph.get_tensor_by_name('resnet_model/final_dense:0')
-
-      self.assertAllEqual(initial_conv.shape, reshape((1, 64, 112, 112)))
-      self.assertAllEqual(max_pool.shape, reshape((1, 64, 56, 56)))
-
-      # The number of channels after each block depends on whether we're
-      # using the building_block or the bottleneck_block.
-      if resnet_size < 50:
-        self.assertAllEqual(block_layer1.shape, reshape((1, 64, 56, 56)))
-        self.assertAllEqual(block_layer2.shape, reshape((1, 128, 28, 28)))
-        self.assertAllEqual(block_layer3.shape, reshape((1, 256, 14, 14)))
-        self.assertAllEqual(block_layer4.shape, reshape((1, 512, 7, 7)))
-        self.assertAllEqual(reduce_mean.shape, reshape((1, 512, 1, 1)))
-      else:
-        self.assertAllEqual(block_layer1.shape, reshape((1, 256, 56, 56)))
-        self.assertAllEqual(block_layer2.shape, reshape((1, 512, 28, 28)))
-        self.assertAllEqual(block_layer3.shape, reshape((1, 1024, 14, 14)))
-        self.assertAllEqual(block_layer4.shape, reshape((1, 2048, 7, 7)))
-        self.assertAllEqual(reduce_mean.shape, reshape((1, 2048, 1, 1)))
-
-      self.assertAllEqual(dense.shape, (1, _LABEL_CLASSES))
-      self.assertAllEqual(output.shape, (1, _LABEL_CLASSES))
-
-  def tensor_shapes_helper(self, resnet_size, resnet_version, with_gpu=False):
-    self._tensor_shapes_helper(resnet_size=resnet_size,
-                               resnet_version=resnet_version,
-                               dtype=tf.float32, with_gpu=with_gpu)
-    self._tensor_shapes_helper(resnet_size=resnet_size,
-                               resnet_version=resnet_version,
-                               dtype=tf.float16, with_gpu=with_gpu)
-
-  def test_tensor_shapes_resnet_18_v1(self):
-    self.tensor_shapes_helper(18, resnet_version=1)
-
-  def test_tensor_shapes_resnet_18_v2(self):
-    self.tensor_shapes_helper(18, resnet_version=2)
-
-  def test_tensor_shapes_resnet_34_v1(self):
-    self.tensor_shapes_helper(34, resnet_version=1)
-
-  def test_tensor_shapes_resnet_34_v2(self):
-    self.tensor_shapes_helper(34, resnet_version=2)
-
-  def test_tensor_shapes_resnet_50_v1(self):
-    self.tensor_shapes_helper(50, resnet_version=1)
-
-  def test_tensor_shapes_resnet_50_v2(self):
-    self.tensor_shapes_helper(50, resnet_version=2)
-
-  def test_tensor_shapes_resnet_101_v1(self):
-    self.tensor_shapes_helper(101, resnet_version=1)
-
-  def test_tensor_shapes_resnet_101_v2(self):
-    self.tensor_shapes_helper(101, resnet_version=2)
-
-  def test_tensor_shapes_resnet_152_v1(self):
-    self.tensor_shapes_helper(152, resnet_version=1)
-
-  def test_tensor_shapes_resnet_152_v2(self):
-    self.tensor_shapes_helper(152, resnet_version=2)
-
-  def test_tensor_shapes_resnet_200_v1(self):
-    self.tensor_shapes_helper(200, resnet_version=1)
-
-  def test_tensor_shapes_resnet_200_v2(self):
-    self.tensor_shapes_helper(200, resnet_version=2)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_18_with_gpu_v1(self):
-    self.tensor_shapes_helper(18, resnet_version=1, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_18_with_gpu_v2(self):
-    self.tensor_shapes_helper(18, resnet_version=2, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_34_with_gpu_v1(self):
-    self.tensor_shapes_helper(34, resnet_version=1, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_34_with_gpu_v2(self):
-    self.tensor_shapes_helper(34, resnet_version=2, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_50_with_gpu_v1(self):
-    self.tensor_shapes_helper(50, resnet_version=1, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_50_with_gpu_v2(self):
-    self.tensor_shapes_helper(50, resnet_version=2, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_101_with_gpu_v1(self):
-    self.tensor_shapes_helper(101, resnet_version=1, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_101_with_gpu_v2(self):
-    self.tensor_shapes_helper(101, resnet_version=2, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_152_with_gpu_v1(self):
-    self.tensor_shapes_helper(152, resnet_version=1, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_152_with_gpu_v2(self):
-    self.tensor_shapes_helper(152, resnet_version=2, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_200_with_gpu_v1(self):
-    self.tensor_shapes_helper(200, resnet_version=1, with_gpu=True)
-
-  @unittest.skipUnless(tf.test.is_built_with_cuda(), 'requires GPU')
-  def test_tensor_shapes_resnet_200_with_gpu_v2(self):
-    self.tensor_shapes_helper(200, resnet_version=2, with_gpu=True)
-
-  def resnet_model_fn_helper(self, mode, resnet_version, dtype):
-    """Tests that the EstimatorSpec is given the appropriate arguments."""
-    tf.compat.v1.train.create_global_step()
-
-    input_fn = imagenet_main.get_synth_input_fn(dtype)
-    dataset = input_fn(True, '', _BATCH_SIZE)
-    iterator = tf.compat.v1.data.make_initializable_iterator(dataset)
-    features, labels = iterator.get_next()
-    spec = imagenet_main.imagenet_model_fn(
-        features, labels, mode, {
-            'dtype': dtype,
-            'resnet_size': 50,
-            'data_format': 'channels_last',
-            'batch_size': _BATCH_SIZE,
-            'resnet_version': resnet_version,
-            'loss_scale': 128 if dtype == tf.float16 else 1,
-            'fine_tune': False,
-        })
-
-    predictions = spec.predictions
-    self.assertAllEqual(predictions['probabilities'].shape,
-                        (_BATCH_SIZE, _LABEL_CLASSES))
-    self.assertEqual(predictions['probabilities'].dtype, tf.float32)
-    self.assertAllEqual(predictions['classes'].shape, (_BATCH_SIZE,))
-    self.assertEqual(predictions['classes'].dtype, tf.int64)
-
-    if mode != tf.estimator.ModeKeys.PREDICT:
-      loss = spec.loss
-      self.assertAllEqual(loss.shape, ())
-      self.assertEqual(loss.dtype, tf.float32)
-
-    if mode == tf.estimator.ModeKeys.EVAL:
-      eval_metric_ops = spec.eval_metric_ops
-      self.assertAllEqual(eval_metric_ops['accuracy'][0].shape, ())
-      self.assertAllEqual(eval_metric_ops['accuracy'][1].shape, ())
-      self.assertEqual(eval_metric_ops['accuracy'][0].dtype, tf.float32)
-      self.assertEqual(eval_metric_ops['accuracy'][1].dtype, tf.float32)
-
-  def test_resnet_model_fn_train_mode_v1(self):
-    self.resnet_model_fn_helper(tf.estimator.ModeKeys.TRAIN, resnet_version=1,
-                                dtype=tf.float32)
-
-  def test_resnet_model_fn_train_mode_v2(self):
-    self.resnet_model_fn_helper(tf.estimator.ModeKeys.TRAIN, resnet_version=2,
-                                dtype=tf.float32)
-
-  def test_resnet_model_fn_eval_mode_v1(self):
-    self.resnet_model_fn_helper(tf.estimator.ModeKeys.EVAL, resnet_version=1,
-                                dtype=tf.float32)
-
-  def test_resnet_model_fn_eval_mode_v2(self):
-    self.resnet_model_fn_helper(tf.estimator.ModeKeys.EVAL, resnet_version=2,
-                                dtype=tf.float32)
-
-  def test_resnet_model_fn_predict_mode_v1(self):
-    self.resnet_model_fn_helper(tf.estimator.ModeKeys.PREDICT, resnet_version=1,
-                                dtype=tf.float32)
-
-  def test_resnet_model_fn_predict_mode_v2(self):
-    self.resnet_model_fn_helper(tf.estimator.ModeKeys.PREDICT, resnet_version=2,
-                                dtype=tf.float32)
-
-  def _test_imagenetmodel_shape(self, resnet_version):
-    batch_size = 135
-    num_classes = 246
-
-    model = imagenet_main.ImagenetModel(
-        50, data_format='channels_last', num_classes=num_classes,
-        resnet_version=resnet_version)
-
-    fake_input = tf.random.uniform([batch_size, 224, 224, 3])
-    output = model(fake_input, training=True)
-
-    self.assertAllEqual(output.shape, (batch_size, num_classes))
-
-  def test_imagenetmodel_shape_v1(self):
-    self._test_imagenetmodel_shape(resnet_version=1)
-
-  def test_imagenetmodel_shape_v2(self):
-    self._test_imagenetmodel_shape(resnet_version=2)
-
-  def test_imagenet_end_to_end_synthetic_v1(self):
-    integration.run_synthetic(
-        main=imagenet_main.run_imagenet, tmp_root=self.get_temp_dir(),
-        extra_flags=['-resnet_version', '1', '-batch_size', '4',
-                     '--max_train_steps', '1']
-    )
-
-  def test_imagenet_end_to_end_synthetic_v2(self):
-    integration.run_synthetic(
-        main=imagenet_main.run_imagenet, tmp_root=self.get_temp_dir(),
-        extra_flags=['-resnet_version', '2', '-batch_size', '4',
-                     '--max_train_steps', '1']
-    )
-
-  def test_imagenet_end_to_end_synthetic_v1_tiny(self):
-    integration.run_synthetic(
-        main=imagenet_main.run_imagenet, tmp_root=self.get_temp_dir(),
-        extra_flags=['-resnet_version', '1', '-batch_size', '4',
-                     '-resnet_size', '18', '--max_train_steps', '1']
-    )
-
-  def test_imagenet_end_to_end_synthetic_v2_tiny(self):
-    integration.run_synthetic(
-        main=imagenet_main.run_imagenet, tmp_root=self.get_temp_dir(),
-        extra_flags=['-resnet_version', '2', '-batch_size', '4',
-                     '-resnet_size', '18', '--max_train_steps', '1']
-    )
-
-  def test_imagenet_end_to_end_synthetic_v1_huge(self):
-    integration.run_synthetic(
-        main=imagenet_main.run_imagenet, tmp_root=self.get_temp_dir(),
-        extra_flags=['-resnet_version', '1', '-batch_size', '4',
-                     '-resnet_size', '200', '--max_train_steps', '1']
-    )
-
-  def test_imagenet_end_to_end_synthetic_v2_huge(self):
-    integration.run_synthetic(
-        main=imagenet_main.run_imagenet, tmp_root=self.get_temp_dir(),
-        extra_flags=['-resnet_version', '2', '-batch_size', '4',
-                     '-resnet_size', '200', '--max_train_steps', '1']
-    )
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/official/r1/resnet/resnet_model.py b/official/r1/resnet/resnet_model.py
deleted file mode 100644
index d6449df5134646114e36c637f1184eb7a442f4f7..0000000000000000000000000000000000000000
--- a/official/r1/resnet/resnet_model.py
+++ /dev/null
@@ -1,548 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Contains definitions for Residual Networks.
-
-Residual networks ('v1' ResNets) were originally proposed in:
-[1] Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
-    Deep Residual Learning for Image Recognition. arXiv:1512.03385
-
-The full preactivation 'v2' ResNet variant was introduced by:
-[2] Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
-    Identity Mappings in Deep Residual Networks. arXiv: 1603.05027
-
-The key difference of the full preactivation 'v2' variant compared to the
-'v1' variant in [1] is the use of batch normalization before every weight layer
-rather than after.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-_BATCH_NORM_DECAY = 0.997
-_BATCH_NORM_EPSILON = 1e-5
-DEFAULT_VERSION = 2
-DEFAULT_DTYPE = tf.float32
-CASTABLE_TYPES = (tf.float16,)
-ALLOWED_TYPES = (DEFAULT_DTYPE,) + CASTABLE_TYPES
-
-
-################################################################################
-# Convenience functions for building the ResNet model.
-################################################################################
-def batch_norm(inputs, training, data_format):
-  """Performs a batch normalization using a standard set of parameters."""
-  # We set fused=True for a significant performance boost. See
-  # https://www.tensorflow.org/performance/performance_guide#common_fused_ops
-  return tf.compat.v1.layers.batch_normalization(
-      inputs=inputs, axis=1 if data_format == 'channels_first' else 3,
-      momentum=_BATCH_NORM_DECAY, epsilon=_BATCH_NORM_EPSILON, center=True,
-      scale=True, training=training, fused=True)
-
-
-def fixed_padding(inputs, kernel_size, data_format):
-  """Pads the input along the spatial dimensions independently of input size.
-
-  Args:
-    inputs: A tensor of size [batch, channels, height_in, width_in] or
-      [batch, height_in, width_in, channels] depending on data_format.
-    kernel_size: The kernel to be used in the conv2d or max_pool2d operation.
-                 Should be a positive integer.
-    data_format: The input format ('channels_last' or 'channels_first').
-
-  Returns:
-    A tensor with the same format as the input with the data either intact
-    (if kernel_size == 1) or padded (if kernel_size > 1).
-  """
-  pad_total = kernel_size - 1
-  pad_beg = pad_total // 2
-  pad_end = pad_total - pad_beg
-
-  if data_format == 'channels_first':
-    padded_inputs = tf.pad(tensor=inputs,
-                           paddings=[[0, 0], [0, 0], [pad_beg, pad_end],
-                                     [pad_beg, pad_end]])
-  else:
-    padded_inputs = tf.pad(tensor=inputs,
-                           paddings=[[0, 0], [pad_beg, pad_end],
-                                     [pad_beg, pad_end], [0, 0]])
-  return padded_inputs
-
-
-def conv2d_fixed_padding(inputs, filters, kernel_size, strides, data_format):
-  """Strided 2-D convolution with explicit padding."""
-  # The padding is consistent and is based only on `kernel_size`, not on the
-  # dimensions of `inputs` (as opposed to using `tf.layers.conv2d` alone).
-  if strides > 1:
-    inputs = fixed_padding(inputs, kernel_size, data_format)
-
-  return tf.compat.v1.layers.conv2d(
-      inputs=inputs, filters=filters, kernel_size=kernel_size, strides=strides,
-      padding=('SAME' if strides == 1 else 'VALID'), use_bias=False,
-      kernel_initializer=tf.compat.v1.variance_scaling_initializer(),
-      data_format=data_format)
-
-
-################################################################################
-# ResNet block definitions.
-################################################################################
-def _building_block_v1(inputs, filters, training, projection_shortcut, strides,
-                       data_format):
-  """A single block for ResNet v1, without a bottleneck.
-
-  Convolution then batch normalization then ReLU as described by:
-    Deep Residual Learning for Image Recognition
-    https://arxiv.org/pdf/1512.03385.pdf
-    by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun, Dec 2015.
-
-  Args:
-    inputs: A tensor of size [batch, channels, height_in, width_in] or
-      [batch, height_in, width_in, channels] depending on data_format.
-    filters: The number of filters for the convolutions.
-    training: A Boolean for whether the model is in training or inference
-      mode. Needed for batch normalization.
-    projection_shortcut: The function to use for projection shortcuts
-      (typically a 1x1 convolution when downsampling the input).
-    strides: The block's stride. If greater than 1, this block will ultimately
-      downsample the input.
-    data_format: The input format ('channels_last' or 'channels_first').
-
-  Returns:
-    The output tensor of the block; shape should match inputs.
-  """
-  shortcut = inputs
-
-  if projection_shortcut is not None:
-    shortcut = projection_shortcut(inputs)
-    shortcut = batch_norm(inputs=shortcut, training=training,
-                          data_format=data_format)
-
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=filters, kernel_size=3, strides=strides,
-      data_format=data_format)
-  inputs = batch_norm(inputs, training, data_format)
-  inputs = tf.nn.relu(inputs)
-
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=filters, kernel_size=3, strides=1,
-      data_format=data_format)
-  inputs = batch_norm(inputs, training, data_format)
-  inputs += shortcut
-  inputs = tf.nn.relu(inputs)
-
-  return inputs
-
-
-def _building_block_v2(inputs, filters, training, projection_shortcut, strides,
-                       data_format):
-  """A single block for ResNet v2, without a bottleneck.
-
-  Batch normalization then ReLu then convolution as described by:
-    Identity Mappings in Deep Residual Networks
-    https://arxiv.org/pdf/1603.05027.pdf
-    by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun, Jul 2016.
-
-  Args:
-    inputs: A tensor of size [batch, channels, height_in, width_in] or
-      [batch, height_in, width_in, channels] depending on data_format.
-    filters: The number of filters for the convolutions.
-    training: A Boolean for whether the model is in training or inference
-      mode. Needed for batch normalization.
-    projection_shortcut: The function to use for projection shortcuts
-      (typically a 1x1 convolution when downsampling the input).
-    strides: The block's stride. If greater than 1, this block will ultimately
-      downsample the input.
-    data_format: The input format ('channels_last' or 'channels_first').
-
-  Returns:
-    The output tensor of the block; shape should match inputs.
-  """
-  shortcut = inputs
-  inputs = batch_norm(inputs, training, data_format)
-  inputs = tf.nn.relu(inputs)
-
-  # The projection shortcut should come after the first batch norm and ReLU
-  # since it performs a 1x1 convolution.
-  if projection_shortcut is not None:
-    shortcut = projection_shortcut(inputs)
-
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=filters, kernel_size=3, strides=strides,
-      data_format=data_format)
-
-  inputs = batch_norm(inputs, training, data_format)
-  inputs = tf.nn.relu(inputs)
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=filters, kernel_size=3, strides=1,
-      data_format=data_format)
-
-  return inputs + shortcut
-
-
-def _bottleneck_block_v1(inputs, filters, training, projection_shortcut,
-                         strides, data_format):
-  """A single block for ResNet v1, with a bottleneck.
-
-  Similar to _building_block_v1(), except using the "bottleneck" blocks
-  described in:
-    Convolution then batch normalization then ReLU as described by:
-      Deep Residual Learning for Image Recognition
-      https://arxiv.org/pdf/1512.03385.pdf
-      by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun, Dec 2015.
-
-  Args:
-    inputs: A tensor of size [batch, channels, height_in, width_in] or
-      [batch, height_in, width_in, channels] depending on data_format.
-    filters: The number of filters for the convolutions.
-    training: A Boolean for whether the model is in training or inference
-      mode. Needed for batch normalization.
-    projection_shortcut: The function to use for projection shortcuts
-      (typically a 1x1 convolution when downsampling the input).
-    strides: The block's stride. If greater than 1, this block will ultimately
-      downsample the input.
-    data_format: The input format ('channels_last' or 'channels_first').
-
-  Returns:
-    The output tensor of the block; shape should match inputs.
-  """
-  shortcut = inputs
-
-  if projection_shortcut is not None:
-    shortcut = projection_shortcut(inputs)
-    shortcut = batch_norm(inputs=shortcut, training=training,
-                          data_format=data_format)
-
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=filters, kernel_size=1, strides=1,
-      data_format=data_format)
-  inputs = batch_norm(inputs, training, data_format)
-  inputs = tf.nn.relu(inputs)
-
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=filters, kernel_size=3, strides=strides,
-      data_format=data_format)
-  inputs = batch_norm(inputs, training, data_format)
-  inputs = tf.nn.relu(inputs)
-
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=4 * filters, kernel_size=1, strides=1,
-      data_format=data_format)
-  inputs = batch_norm(inputs, training, data_format)
-  inputs += shortcut
-  inputs = tf.nn.relu(inputs)
-
-  return inputs
-
-
-def _bottleneck_block_v2(inputs, filters, training, projection_shortcut,
-                         strides, data_format):
-  """A single block for ResNet v2, with a bottleneck.
-
-  Similar to _building_block_v2(), except using the "bottleneck" blocks
-  described in:
-    Convolution then batch normalization then ReLU as described by:
-      Deep Residual Learning for Image Recognition
-      https://arxiv.org/pdf/1512.03385.pdf
-      by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun, Dec 2015.
-
-  Adapted to the ordering conventions of:
-    Batch normalization then ReLu then convolution as described by:
-      Identity Mappings in Deep Residual Networks
-      https://arxiv.org/pdf/1603.05027.pdf
-      by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun, Jul 2016.
-
-  Args:
-    inputs: A tensor of size [batch, channels, height_in, width_in] or
-      [batch, height_in, width_in, channels] depending on data_format.
-    filters: The number of filters for the convolutions.
-    training: A Boolean for whether the model is in training or inference
-      mode. Needed for batch normalization.
-    projection_shortcut: The function to use for projection shortcuts
-      (typically a 1x1 convolution when downsampling the input).
-    strides: The block's stride. If greater than 1, this block will ultimately
-      downsample the input.
-    data_format: The input format ('channels_last' or 'channels_first').
-
-  Returns:
-    The output tensor of the block; shape should match inputs.
-  """
-  shortcut = inputs
-  inputs = batch_norm(inputs, training, data_format)
-  inputs = tf.nn.relu(inputs)
-
-  # The projection shortcut should come after the first batch norm and ReLU
-  # since it performs a 1x1 convolution.
-  if projection_shortcut is not None:
-    shortcut = projection_shortcut(inputs)
-
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=filters, kernel_size=1, strides=1,
-      data_format=data_format)
-
-  inputs = batch_norm(inputs, training, data_format)
-  inputs = tf.nn.relu(inputs)
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=filters, kernel_size=3, strides=strides,
-      data_format=data_format)
-
-  inputs = batch_norm(inputs, training, data_format)
-  inputs = tf.nn.relu(inputs)
-  inputs = conv2d_fixed_padding(
-      inputs=inputs, filters=4 * filters, kernel_size=1, strides=1,
-      data_format=data_format)
-
-  return inputs + shortcut
-
-
-def block_layer(inputs, filters, bottleneck, block_fn, blocks, strides,
-                training, name, data_format):
-  """Creates one layer of blocks for the ResNet model.
-
-  Args:
-    inputs: A tensor of size [batch, channels, height_in, width_in] or
-      [batch, height_in, width_in, channels] depending on data_format.
-    filters: The number of filters for the first convolution of the layer.
-    bottleneck: Is the block created a bottleneck block.
-    block_fn: The block to use within the model, either `building_block` or
-      `bottleneck_block`.
-    blocks: The number of blocks contained in the layer.
-    strides: The stride to use for the first convolution of the layer. If
-      greater than 1, this layer will ultimately downsample the input.
-    training: Either True or False, whether we are currently training the
-      model. Needed for batch norm.
-    name: A string name for the tensor output of the block layer.
-    data_format: The input format ('channels_last' or 'channels_first').
-
-  Returns:
-    The output tensor of the block layer.
-  """
-
-  # Bottleneck blocks end with 4x the number of filters as they start with
-  filters_out = filters * 4 if bottleneck else filters
-
-  def projection_shortcut(inputs):
-    return conv2d_fixed_padding(
-        inputs=inputs, filters=filters_out, kernel_size=1, strides=strides,
-        data_format=data_format)
-
-  # Only the first block per block_layer uses projection_shortcut and strides
-  inputs = block_fn(inputs, filters, training, projection_shortcut, strides,
-                    data_format)
-
-  for _ in range(1, blocks):
-    inputs = block_fn(inputs, filters, training, None, 1, data_format)
-
-  return tf.identity(inputs, name)
-
-
-class Model(object):
-  """Base class for building the Resnet Model."""
-
-  def __init__(self, resnet_size, bottleneck, num_classes, num_filters,
-               kernel_size,
-               conv_stride, first_pool_size, first_pool_stride,
-               block_sizes, block_strides,
-               resnet_version=DEFAULT_VERSION, data_format=None,
-               dtype=DEFAULT_DTYPE):
-    """Creates a model for classifying an image.
-
-    Args:
-      resnet_size: A single integer for the size of the ResNet model.
-      bottleneck: Use regular blocks or bottleneck blocks.
-      num_classes: The number of classes used as labels.
-      num_filters: The number of filters to use for the first block layer
-        of the model. This number is then doubled for each subsequent block
-        layer.
-      kernel_size: The kernel size to use for convolution.
-      conv_stride: stride size for the initial convolutional layer
-      first_pool_size: Pool size to be used for the first pooling layer.
-        If none, the first pooling layer is skipped.
-      first_pool_stride: stride size for the first pooling layer. Not used
-        if first_pool_size is None.
-      block_sizes: A list containing n values, where n is the number of sets of
-        block layers desired. Each value should be the number of blocks in the
-        i-th set.
-      block_strides: List of integers representing the desired stride size for
-        each of the sets of block layers. Should be same length as block_sizes.
-      resnet_version: Integer representing which version of the ResNet network
-        to use. See README for details. Valid values: [1, 2]
-      data_format: Input format ('channels_last', 'channels_first', or None).
-        If set to None, the format is dependent on whether a GPU is available.
-      dtype: The TensorFlow dtype to use for calculations. If not specified
-        tf.float32 is used.
-
-    Raises:
-      ValueError: if invalid version is selected.
-    """
-    self.resnet_size = resnet_size
-
-    if not data_format:
-      data_format = ('channels_first' if tf.config.list_physical_devices('GPU')
-                     else 'channels_last')
-
-    self.resnet_version = resnet_version
-    if resnet_version not in (1, 2):
-      raise ValueError(
-          'Resnet version should be 1 or 2. See README for citations.')
-
-    self.bottleneck = bottleneck
-    if bottleneck:
-      if resnet_version == 1:
-        self.block_fn = _bottleneck_block_v1
-      else:
-        self.block_fn = _bottleneck_block_v2
-    else:
-      if resnet_version == 1:
-        self.block_fn = _building_block_v1
-      else:
-        self.block_fn = _building_block_v2
-
-    if dtype not in ALLOWED_TYPES:
-      raise ValueError('dtype must be one of: {}'.format(ALLOWED_TYPES))
-
-    self.data_format = data_format
-    self.num_classes = num_classes
-    self.num_filters = num_filters
-    self.kernel_size = kernel_size
-    self.conv_stride = conv_stride
-    self.first_pool_size = first_pool_size
-    self.first_pool_stride = first_pool_stride
-    self.block_sizes = block_sizes
-    self.block_strides = block_strides
-    self.dtype = dtype
-    self.pre_activation = resnet_version == 2
-
-  def _custom_dtype_getter(self, getter, name, shape=None, dtype=DEFAULT_DTYPE,
-                           *args, **kwargs):
-    """Creates variables in fp32, then casts to fp16 if necessary.
-
-    This function is a custom getter. A custom getter is a function with the
-    same signature as tf.get_variable, except it has an additional getter
-    parameter. Custom getters can be passed as the `custom_getter` parameter of
-    tf.variable_scope. Then, tf.get_variable will call the custom getter,
-    instead of directly getting a variable itself. This can be used to change
-    the types of variables that are retrieved with tf.get_variable.
-    The `getter` parameter is the underlying variable getter, that would have
-    been called if no custom getter was used. Custom getters typically get a
-    variable with `getter`, then modify it in some way.
-
-    This custom getter will create an fp32 variable. If a low precision
-    (e.g. float16) variable was requested it will then cast the variable to the
-    requested dtype. The reason we do not directly create variables in low
-    precision dtypes is that applying small gradients to such variables may
-    cause the variable not to change.
-
-    Args:
-      getter: The underlying variable getter, that has the same signature as
-        tf.get_variable and returns a variable.
-      name: The name of the variable to get.
-      shape: The shape of the variable to get.
-      dtype: The dtype of the variable to get. Note that if this is a low
-        precision dtype, the variable will be created as a tf.float32 variable,
-        then cast to the appropriate dtype
-      *args: Additional arguments to pass unmodified to getter.
-      **kwargs: Additional keyword arguments to pass unmodified to getter.
-
-    Returns:
-      A variable which is cast to fp16 if necessary.
-    """
-
-    if dtype in CASTABLE_TYPES:
-      var = getter(name, shape, tf.float32, *args, **kwargs)
-      return tf.cast(var, dtype=dtype, name=name + '_cast')
-    else:
-      return getter(name, shape, dtype, *args, **kwargs)
-
-  def _model_variable_scope(self):
-    """Returns a variable scope that the model should be created under.
-
-    If self.dtype is a castable type, model variable will be created in fp32
-    then cast to self.dtype before being used.
-
-    Returns:
-      A variable scope for the model.
-    """
-
-    return tf.compat.v1.variable_scope('resnet_model',
-                                       custom_getter=self._custom_dtype_getter)
-
-  def __call__(self, inputs, training):
-    """Add operations to classify a batch of input images.
-
-    Args:
-      inputs: A Tensor representing a batch of input images.
-      training: A boolean. Set to True to add operations required only when
-        training the classifier.
-
-    Returns:
-      A logits Tensor with shape [<batch_size>, self.num_classes].
-    """
-
-    with self._model_variable_scope():
-      if self.data_format == 'channels_first':
-        # Convert the inputs from channels_last (NHWC) to channels_first (NCHW).
-        # This provides a large performance boost on GPU. See
-        # https://www.tensorflow.org/performance/performance_guide#data_formats
-        inputs = tf.transpose(a=inputs, perm=[0, 3, 1, 2])
-
-      inputs = conv2d_fixed_padding(
-          inputs=inputs, filters=self.num_filters, kernel_size=self.kernel_size,
-          strides=self.conv_stride, data_format=self.data_format)
-      inputs = tf.identity(inputs, 'initial_conv')
-
-      # We do not include batch normalization or activation functions in V2
-      # for the initial conv1 because the first ResNet unit will perform these
-      # for both the shortcut and non-shortcut paths as part of the first
-      # block's projection. Cf. Appendix of [2].
-      if self.resnet_version == 1:
-        inputs = batch_norm(inputs, training, self.data_format)
-        inputs = tf.nn.relu(inputs)
-
-      if self.first_pool_size:
-        inputs = tf.compat.v1.layers.max_pooling2d(
-            inputs=inputs, pool_size=self.first_pool_size,
-            strides=self.first_pool_stride, padding='SAME',
-            data_format=self.data_format)
-        inputs = tf.identity(inputs, 'initial_max_pool')
-
-      for i, num_blocks in enumerate(self.block_sizes):
-        num_filters = self.num_filters * (2**i)
-        inputs = block_layer(
-            inputs=inputs, filters=num_filters, bottleneck=self.bottleneck,
-            block_fn=self.block_fn, blocks=num_blocks,
-            strides=self.block_strides[i], training=training,
-            name='block_layer{}'.format(i + 1), data_format=self.data_format)
-
-      # Only apply the BN and ReLU for model that does pre_activation in each
-      # building/bottleneck block, eg resnet V2.
-      if self.pre_activation:
-        inputs = batch_norm(inputs, training, self.data_format)
-        inputs = tf.nn.relu(inputs)
-
-      # The current top layer has shape
-      # `batch_size x pool_size x pool_size x final_size`.
-      # ResNet does an Average Pooling layer over pool_size,
-      # but that is the same as doing a reduce_mean. We do a reduce_mean
-      # here because it performs better than AveragePooling2D.
-      axes = [2, 3] if self.data_format == 'channels_first' else [1, 2]
-      inputs = tf.reduce_mean(input_tensor=inputs, axis=axes, keepdims=True)
-      inputs = tf.identity(inputs, 'final_reduce_mean')
-
-      inputs = tf.squeeze(inputs, axes)
-      inputs = tf.compat.v1.layers.dense(inputs=inputs, units=self.num_classes)
-      inputs = tf.identity(inputs, 'final_dense')
-      return inputs
diff --git a/official/r1/resnet/resnet_run_loop.py b/official/r1/resnet/resnet_run_loop.py
deleted file mode 100644
index 5c90d855df7a93f48fe667d3023b8d1291b85e46..0000000000000000000000000000000000000000
--- a/official/r1/resnet/resnet_run_loop.py
+++ /dev/null
@@ -1,831 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Contains utility and supporting functions for ResNet.
-
-  This module contains ResNet code which does not directly build layers. This
-includes dataset management, hyperparameter and optimizer code, and argument
-parsing. Code for defining the ResNet layers can be found in resnet_model.py.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import functools
-import math
-import multiprocessing
-import os
-
-from absl import flags
-from absl import logging
-import tensorflow as tf
-
-from official.r1.resnet import imagenet_preprocessing
-from official.r1.resnet import resnet_model
-from official.r1.utils import export
-from official.r1.utils.logs import hooks_helper
-from official.r1.utils.logs import logger
-from official.utils.flags import core as flags_core
-from official.utils.misc import distribution_utils
-from official.utils.misc import model_helpers
-
-
-################################################################################
-# Functions for input processing.
-################################################################################
-def process_record_dataset(dataset,
-                           is_training,
-                           batch_size,
-                           shuffle_buffer,
-                           parse_record_fn,
-                           num_epochs=1,
-                           dtype=tf.float32,
-                           datasets_num_private_threads=None,
-                           drop_remainder=False,
-                           tf_data_experimental_slack=False):
-  """Given a Dataset with raw records, return an iterator over the records.
-
-  Args:
-    dataset: A Dataset representing raw records
-    is_training: A boolean denoting whether the input is for training.
-    batch_size: The number of samples per batch.
-    shuffle_buffer: The buffer size to use when shuffling records. A larger
-      value results in better randomness, but smaller values reduce startup
-      time and use less memory.
-    parse_record_fn: A function that takes a raw record and returns the
-      corresponding (image, label) pair.
-    num_epochs: The number of epochs to repeat the dataset.
-    dtype: Data type to use for images/features.
-    datasets_num_private_threads: Number of threads for a private
-      threadpool created for all datasets computation.
-    drop_remainder: A boolean indicates whether to drop the remainder of the
-      batches. If True, the batch dimension will be static.
-    tf_data_experimental_slack: Whether to enable tf.data's
-      `experimental_slack` option.
-
-  Returns:
-    Dataset of (image, label) pairs ready for iteration.
-  """
-  # Defines a specific size thread pool for tf.data operations.
-  if datasets_num_private_threads:
-    options = tf.data.Options()
-    options.experimental_threading.private_threadpool_size = (
-        datasets_num_private_threads)
-    dataset = dataset.with_options(options)
-    logging.info('datasets_num_private_threads: %s',
-                 datasets_num_private_threads)
-
-  # Disable intra-op parallelism to optimize for throughput instead of latency.
-  options = tf.data.Options()
-  options.experimental_threading.max_intra_op_parallelism = 1
-  dataset = dataset.with_options(options)
-
-  # Prefetches a batch at a time to smooth out the time taken to load input
-  # files for shuffling and processing.
-  dataset = dataset.prefetch(buffer_size=batch_size)
-  if is_training:
-    # Shuffles records before repeating to respect epoch boundaries.
-    dataset = dataset.shuffle(buffer_size=shuffle_buffer)
-
-  # Repeats the dataset for the number of epochs to train.
-  dataset = dataset.repeat(num_epochs)
-
-  # Parses the raw records into images and labels.
-  dataset = dataset.map(
-      lambda value: parse_record_fn(value, is_training, dtype),
-      num_parallel_calls=tf.data.experimental.AUTOTUNE)
-  dataset = dataset.batch(batch_size, drop_remainder=drop_remainder)
-
-  # Operations between the final prefetch and the get_next call to the iterator
-  # will happen synchronously during run time. We prefetch here again to
-  # background all of the above processing work and keep it out of the
-  # critical training path. Setting buffer_size to tf.data.experimental.AUTOTUNE
-  # allows DistributionStrategies to adjust how many batches to fetch based
-  # on how many devices are present.
-  dataset = dataset.prefetch(buffer_size=tf.data.experimental.AUTOTUNE)
-
-  if tf_data_experimental_slack:
-    options = tf.data.Options()
-    options.experimental_slack = True
-    dataset = dataset.with_options(options)
-
-  return dataset
-
-
-def get_synth_input_fn(height, width, num_channels, num_classes,
-                       dtype=tf.float32):
-  """Returns an input function that returns a dataset with random data.
-
-  This input_fn returns a data set that iterates over a set of random data and
-  bypasses all preprocessing, e.g. jpeg decode and copy. The host to device
-  copy is still included. This used to find the upper throughput bound when
-  tunning the full input pipeline.
-
-  Args:
-    height: Integer height that will be used to create a fake image tensor.
-    width: Integer width that will be used to create a fake image tensor.
-    num_channels: Integer depth that will be used to create a fake image tensor.
-    num_classes: Number of classes that should be represented in the fake labels
-      tensor
-    dtype: Data type for features/images.
-
-  Returns:
-    An input_fn that can be used in place of a real one to return a dataset
-    that can be used for iteration.
-  """
-  # pylint: disable=unused-argument
-  def input_fn(is_training, data_dir, batch_size, *args, **kwargs):
-    """Returns dataset filled with random data."""
-    # Synthetic input should be within [0, 255].
-    inputs = tf.random.truncated_normal(
-        [batch_size] + [height, width, num_channels],
-        dtype=dtype,
-        mean=127,
-        stddev=60,
-        name='synthetic_inputs')
-
-    labels = tf.random.uniform(
-        [batch_size],
-        minval=0,
-        maxval=num_classes - 1,
-        dtype=tf.int32,
-        name='synthetic_labels')
-    data = tf.data.Dataset.from_tensors((inputs, labels)).repeat()
-    data = data.prefetch(buffer_size=tf.data.experimental.AUTOTUNE)
-    return data
-
-  return input_fn
-
-
-def image_bytes_serving_input_fn(image_shape, dtype=tf.float32):
-  """Serving input fn for raw jpeg images."""
-
-  def _preprocess_image(image_bytes):
-    """Preprocess a single raw image."""
-    # Bounding box around the whole image.
-    bbox = tf.constant([0.0, 0.0, 1.0, 1.0], dtype=dtype, shape=[1, 1, 4])
-    height, width, num_channels = image_shape
-    image = imagenet_preprocessing.preprocess_image(
-        image_bytes, bbox, height, width, num_channels, is_training=False)
-    return image
-
-  image_bytes_list = tf.compat.v1.placeholder(
-      shape=[None], dtype=tf.string, name='input_tensor')
-  images = tf.map_fn(
-      _preprocess_image, image_bytes_list, back_prop=False, dtype=dtype)
-  return tf.estimator.export.TensorServingInputReceiver(
-      images, {'image_bytes': image_bytes_list})
-
-
-def override_flags_and_set_envars_for_gpu_thread_pool(flags_obj):
-  """Override flags and set env_vars for performance.
-
-  These settings exist to test the difference between using stock settings
-  and manual tuning. It also shows some of the ENV_VARS that can be tweaked to
-  squeeze a few extra examples per second.  These settings are defaulted to the
-  current platform of interest, which changes over time.
-
-  On systems with small numbers of cpu cores, e.g. under 8 logical cores,
-  setting up a gpu thread pool with `tf_gpu_thread_mode=gpu_private` may perform
-  poorly.
-
-  Args:
-    flags_obj: Current flags, which will be adjusted possibly overriding
-    what has been set by the user on the command-line.
-  """
-  cpu_count = multiprocessing.cpu_count()
-  logging.info('Logical CPU cores: %s', cpu_count)
-
-  # Sets up thread pool for each GPU for op scheduling.
-  per_gpu_thread_count = 1
-  total_gpu_thread_count = per_gpu_thread_count * flags_obj.num_gpus
-  os.environ['TF_GPU_THREAD_MODE'] = flags_obj.tf_gpu_thread_mode
-  os.environ['TF_GPU_THREAD_COUNT'] = str(per_gpu_thread_count)
-  logging.info('TF_GPU_THREAD_COUNT: %s', os.environ['TF_GPU_THREAD_COUNT'])
-  logging.info('TF_GPU_THREAD_MODE: %s', os.environ['TF_GPU_THREAD_MODE'])
-
-  # Reduces general thread pool by number of threads used for GPU pool.
-  main_thread_count = cpu_count - total_gpu_thread_count
-  flags_obj.inter_op_parallelism_threads = main_thread_count
-
-  # Sets thread count for tf.data. Logical cores minus threads assign to the
-  # private GPU pool along with 2 thread per GPU for event monitoring and
-  # sending / receiving tensors.
-  num_monitoring_threads = 2 * flags_obj.num_gpus
-  flags_obj.datasets_num_private_threads = (cpu_count - total_gpu_thread_count
-                                            - num_monitoring_threads)
-
-
-################################################################################
-# Functions for running training/eval/validation loops for the model.
-################################################################################
-def learning_rate_with_decay(
-    batch_size, batch_denom, num_images, boundary_epochs, decay_rates,
-    base_lr=0.1, warmup=False):
-  """Get a learning rate that decays step-wise as training progresses.
-
-  Args:
-    batch_size: the number of examples processed in each training batch.
-    batch_denom: this value will be used to scale the base learning rate.
-      `0.1 * batch size` is divided by this number, such that when
-      batch_denom == batch_size, the initial learning rate will be 0.1.
-    num_images: total number of images that will be used for training.
-    boundary_epochs: list of ints representing the epochs at which we
-      decay the learning rate.
-    decay_rates: list of floats representing the decay rates to be used
-      for scaling the learning rate. It should have one more element
-      than `boundary_epochs`, and all elements should have the same type.
-    base_lr: Initial learning rate scaled based on batch_denom.
-    warmup: Run a 5 epoch warmup to the initial lr.
-  Returns:
-    Returns a function that takes a single argument - the number of batches
-    trained so far (global_step)- and returns the learning rate to be used
-    for training the next batch.
-  """
-  initial_learning_rate = base_lr * batch_size / batch_denom
-  batches_per_epoch = num_images / batch_size
-
-  # Reduce the learning rate at certain epochs.
-  # CIFAR-10: divide by 10 at epoch 100, 150, and 200
-  # ImageNet: divide by 10 at epoch 30, 60, 80, and 90
-  boundaries = [int(batches_per_epoch * epoch) for epoch in boundary_epochs]
-  vals = [initial_learning_rate * decay for decay in decay_rates]
-
-  def learning_rate_fn(global_step):
-    """Builds scaled learning rate function with 5 epoch warm up."""
-    lr = tf.compat.v1.train.piecewise_constant(global_step, boundaries, vals)
-    if warmup:
-      warmup_steps = int(batches_per_epoch * 5)
-      warmup_lr = (
-          initial_learning_rate * tf.cast(global_step, tf.float32) / tf.cast(
-              warmup_steps, tf.float32))
-      return tf.cond(pred=global_step < warmup_steps,
-                     true_fn=lambda: warmup_lr,
-                     false_fn=lambda: lr)
-    return lr
-
-  def poly_rate_fn(global_step):
-    """Handles linear scaling rule, gradual warmup, and LR decay.
-
-    The learning rate starts at 0, then it increases linearly per step.  After
-    FLAGS.poly_warmup_epochs, we reach the base learning rate (scaled to account
-    for batch size). The learning rate is then decayed using a polynomial rate
-    decay schedule with power 2.0.
-
-    Args:
-      global_step: the current global_step
-
-    Returns:
-      returns the current learning rate
-    """
-
-    # Learning rate schedule for LARS polynomial schedule
-    if flags.FLAGS.batch_size < 8192:
-      plr = 5.0
-      w_epochs = 5
-    elif flags.FLAGS.batch_size < 16384:
-      plr = 10.0
-      w_epochs = 5
-    elif flags.FLAGS.batch_size < 32768:
-      plr = 25.0
-      w_epochs = 5
-    else:
-      plr = 32.0
-      w_epochs = 14
-
-    w_steps = int(w_epochs * batches_per_epoch)
-    wrate = (plr * tf.cast(global_step, tf.float32) / tf.cast(
-        w_steps, tf.float32))
-
-    # TODO(pkanwar): use a flag to help calc num_epochs.
-    num_epochs = 90
-    train_steps = batches_per_epoch * num_epochs
-
-    min_step = tf.constant(1, dtype=tf.int64)
-    decay_steps = tf.maximum(min_step, tf.subtract(global_step, w_steps))
-    poly_rate = tf.train.polynomial_decay(
-        plr,
-        decay_steps,
-        train_steps - w_steps + 1,
-        power=2.0)
-    return tf.where(global_step <= w_steps, wrate, poly_rate)
-
-  # For LARS we have a new learning rate schedule
-  if flags.FLAGS.enable_lars:
-    return poly_rate_fn
-
-  return learning_rate_fn
-
-
-def per_replica_batch_size(batch_size, num_gpus):
-  """For multi-gpu, batch-size must be a multiple of the number of GPUs.
-
-
-  Note that distribution strategy handles this automatically when used with
-  Keras. For using with Estimator, we need to get per GPU batch.
-
-  Args:
-    batch_size: Global batch size to be divided among devices. This should be
-      equal to num_gpus times the single-GPU batch_size for multi-gpu training.
-    num_gpus: How many GPUs are used with DistributionStrategies.
-
-  Returns:
-    Batch size per device.
-
-  Raises:
-    ValueError: if batch_size is not divisible by number of devices
-  """
-  if num_gpus <= 1:
-    return batch_size
-
-  remainder = batch_size % num_gpus
-  if remainder:
-    err = ('When running with multiple GPUs, batch size '
-           'must be a multiple of the number of available GPUs. Found {} '
-           'GPUs with a batch size of {}; try --batch_size={} instead.'
-          ).format(num_gpus, batch_size, batch_size - remainder)
-    raise ValueError(err)
-  return int(batch_size / num_gpus)
-
-
-def resnet_model_fn(features, labels, mode, model_class,
-                    resnet_size, weight_decay, learning_rate_fn, momentum,
-                    data_format, resnet_version, loss_scale,
-                    loss_filter_fn=None, dtype=resnet_model.DEFAULT_DTYPE,
-                    fine_tune=False, label_smoothing=0.0):
-  """Shared functionality for different resnet model_fns.
-
-  Initializes the ResnetModel representing the model layers
-  and uses that model to build the necessary EstimatorSpecs for
-  the `mode` in question. For training, this means building losses,
-  the optimizer, and the train op that get passed into the EstimatorSpec.
-  For evaluation and prediction, the EstimatorSpec is returned without
-  a train op, but with the necessary parameters for the given mode.
-
-  Args:
-    features: tensor representing input images
-    labels: tensor representing class labels for all input images
-    mode: current estimator mode; should be one of
-      `tf.estimator.ModeKeys.TRAIN`, `EVALUATE`, `PREDICT`
-    model_class: a class representing a TensorFlow model that has a __call__
-      function. We assume here that this is a subclass of ResnetModel.
-    resnet_size: A single integer for the size of the ResNet model.
-    weight_decay: weight decay loss rate used to regularize learned variables.
-    learning_rate_fn: function that returns the current learning rate given
-      the current global_step
-    momentum: momentum term used for optimization
-    data_format: Input format ('channels_last', 'channels_first', or None).
-      If set to None, the format is dependent on whether a GPU is available.
-    resnet_version: Integer representing which version of the ResNet network to
-      use. See README for details. Valid values: [1, 2]
-    loss_scale: The factor to scale the loss for numerical stability. A detailed
-      summary is present in the arg parser help text.
-    loss_filter_fn: function that takes a string variable name and returns
-      True if the var should be included in loss calculation, and False
-      otherwise. If None, batch_normalization variables will be excluded
-      from the loss.
-    dtype: the TensorFlow dtype to use for calculations.
-    fine_tune: If True only train the dense layers(final layers).
-    label_smoothing: If greater than 0 then smooth the labels.
-
-  Returns:
-    EstimatorSpec parameterized according to the input params and the
-    current mode.
-  """
-
-  # Generate a summary node for the images
-  tf.compat.v1.summary.image('images', features, max_outputs=6)
-  # Checks that features/images have same data type being used for calculations.
-  assert features.dtype == dtype
-
-  model = model_class(resnet_size, data_format, resnet_version=resnet_version,
-                      dtype=dtype)
-
-  logits = model(features, mode == tf.estimator.ModeKeys.TRAIN)
-
-  # This acts as a no-op if the logits are already in fp32 (provided logits are
-  # not a SparseTensor). If dtype is is low precision, logits must be cast to
-  # fp32 for numerical stability.
-  logits = tf.cast(logits, tf.float32)
-
-  predictions = {
-      'classes': tf.argmax(input=logits, axis=1),
-      'probabilities': tf.nn.softmax(logits, name='softmax_tensor')
-  }
-
-  if mode == tf.estimator.ModeKeys.PREDICT:
-    # Return the predictions and the specification for serving a SavedModel
-    return tf.estimator.EstimatorSpec(
-        mode=mode,
-        predictions=predictions,
-        export_outputs={
-            'predict': tf.estimator.export.PredictOutput(predictions)
-        })
-
-  # Calculate loss, which includes softmax cross entropy and L2 regularization.
-  if label_smoothing != 0.0:
-    one_hot_labels = tf.one_hot(labels, 1001)
-    cross_entropy = tf.losses.softmax_cross_entropy(
-        logits=logits, onehot_labels=one_hot_labels,
-        label_smoothing=label_smoothing)
-  else:
-    cross_entropy = tf.compat.v1.losses.sparse_softmax_cross_entropy(
-        logits=logits, labels=labels)
-
-  # Create a tensor named cross_entropy for logging purposes.
-  tf.identity(cross_entropy, name='cross_entropy')
-  tf.compat.v1.summary.scalar('cross_entropy', cross_entropy)
-
-  # If no loss_filter_fn is passed, assume we want the default behavior,
-  # which is that batch_normalization variables are excluded from loss.
-  def exclude_batch_norm(name):
-    return 'batch_normalization' not in name
-  loss_filter_fn = loss_filter_fn or exclude_batch_norm
-
-  # Add weight decay to the loss.
-  l2_loss = weight_decay * tf.add_n(
-      # loss is computed using fp32 for numerical stability.
-      [
-          tf.nn.l2_loss(tf.cast(v, tf.float32))
-          for v in tf.compat.v1.trainable_variables()
-          if loss_filter_fn(v.name)
-      ])
-  tf.compat.v1.summary.scalar('l2_loss', l2_loss)
-  loss = cross_entropy + l2_loss
-
-  if mode == tf.estimator.ModeKeys.TRAIN:
-    global_step = tf.compat.v1.train.get_or_create_global_step()
-
-    learning_rate = learning_rate_fn(global_step)
-
-    # Create a tensor named learning_rate for logging purposes
-    tf.identity(learning_rate, name='learning_rate')
-    tf.compat.v1.summary.scalar('learning_rate', learning_rate)
-
-    if flags.FLAGS.enable_lars:
-      from tensorflow.contrib import opt as contrib_opt  # pylint: disable=g-import-not-at-top
-      optimizer = contrib_opt.LARSOptimizer(
-          learning_rate,
-          momentum=momentum,
-          weight_decay=weight_decay,
-          skip_list=['batch_normalization', 'bias'])
-    else:
-      optimizer = tf.compat.v1.train.MomentumOptimizer(
-          learning_rate=learning_rate,
-          momentum=momentum
-      )
-
-    fp16_implementation = getattr(flags.FLAGS, 'fp16_implementation', None)
-    if fp16_implementation == 'graph_rewrite':
-      optimizer = (
-          tf.compat.v1.train.experimental.enable_mixed_precision_graph_rewrite(
-              optimizer, loss_scale=loss_scale))
-
-    def _dense_grad_filter(gvs):
-      """Only apply gradient updates to the final layer.
-
-      This function is used for fine tuning.
-
-      Args:
-        gvs: list of tuples with gradients and variable info
-      Returns:
-        filtered gradients so that only the dense layer remains
-      """
-      return [(g, v) for g, v in gvs if 'dense' in v.name]
-
-    if loss_scale != 1 and fp16_implementation != 'graph_rewrite':
-      # When computing fp16 gradients, often intermediate tensor values are
-      # so small, they underflow to 0. To avoid this, we multiply the loss by
-      # loss_scale to make these tensor values loss_scale times bigger.
-      scaled_grad_vars = optimizer.compute_gradients(loss * loss_scale)
-
-      if fine_tune:
-        scaled_grad_vars = _dense_grad_filter(scaled_grad_vars)
-
-      # Once the gradient computation is complete we can scale the gradients
-      # back to the correct scale before passing them to the optimizer.
-      unscaled_grad_vars = [(grad / loss_scale, var)
-                            for grad, var in scaled_grad_vars]
-      minimize_op = optimizer.apply_gradients(unscaled_grad_vars, global_step)
-    else:
-      grad_vars = optimizer.compute_gradients(loss)
-      if fine_tune:
-        grad_vars = _dense_grad_filter(grad_vars)
-      minimize_op = optimizer.apply_gradients(grad_vars, global_step)
-
-    update_ops = tf.compat.v1.get_collection(tf.compat.v1.GraphKeys.UPDATE_OPS)
-    train_op = tf.group(minimize_op, update_ops)
-  else:
-    train_op = None
-
-  accuracy = tf.compat.v1.metrics.accuracy(labels, predictions['classes'])
-  accuracy_top_5 = tf.compat.v1.metrics.mean(
-      tf.nn.in_top_k(predictions=logits, targets=labels, k=5, name='top_5_op'))
-  metrics = {'accuracy': accuracy,
-             'accuracy_top_5': accuracy_top_5}
-
-  # Create a tensor named train_accuracy for logging purposes
-  tf.identity(accuracy[1], name='train_accuracy')
-  tf.identity(accuracy_top_5[1], name='train_accuracy_top_5')
-  tf.compat.v1.summary.scalar('train_accuracy', accuracy[1])
-  tf.compat.v1.summary.scalar('train_accuracy_top_5', accuracy_top_5[1])
-
-  return tf.estimator.EstimatorSpec(
-      mode=mode,
-      predictions=predictions,
-      loss=loss,
-      train_op=train_op,
-      eval_metric_ops=metrics)
-
-
-def resnet_main(
-    flags_obj, model_function, input_function, dataset_name, shape=None):
-  """Shared main loop for ResNet Models.
-
-  Args:
-    flags_obj: An object containing parsed flags. See define_resnet_flags()
-      for details.
-    model_function: the function that instantiates the Model and builds the
-      ops for train/eval. This will be passed directly into the estimator.
-    input_function: the function that processes the dataset and returns a
-      dataset that the estimator can train on. This will be wrapped with
-      all the relevant flags for running and passed to estimator.
-    dataset_name: the name of the dataset for training and evaluation. This is
-      used for logging purpose.
-    shape: list of ints representing the shape of the images used for training.
-      This is only used if flags_obj.export_dir is passed.
-
-  Returns:
-     Dict of results of the run.  Contains the keys `eval_results` and
-    `train_hooks`. `eval_results` contains accuracy (top_1) and accuracy_top_5.
-    `train_hooks` is a list the instances of hooks used during training.
-  """
-
-  model_helpers.apply_clean(flags.FLAGS)
-
-  # Ensures flag override logic is only executed if explicitly triggered.
-  if flags_obj.tf_gpu_thread_mode:
-    override_flags_and_set_envars_for_gpu_thread_pool(flags_obj)
-
-  # Configures cluster spec for distribution strategy.
-  num_workers = distribution_utils.configure_cluster(flags_obj.worker_hosts,
-                                                     flags_obj.task_index)
-
-  # Creates session config. allow_soft_placement = True, is required for
-  # multi-GPU and is not harmful for other modes.
-  session_config = tf.compat.v1.ConfigProto(
-      inter_op_parallelism_threads=flags_obj.inter_op_parallelism_threads,
-      intra_op_parallelism_threads=flags_obj.intra_op_parallelism_threads,
-      allow_soft_placement=True)
-
-  distribution_strategy = distribution_utils.get_distribution_strategy(
-      distribution_strategy=flags_obj.distribution_strategy,
-      num_gpus=flags_core.get_num_gpus(flags_obj),
-      all_reduce_alg=flags_obj.all_reduce_alg,
-      num_packs=flags_obj.num_packs)
-
-  # Creates a `RunConfig` that checkpoints every 24 hours which essentially
-  # results in checkpoints determined only by `epochs_between_evals`.
-  run_config = tf.estimator.RunConfig(
-      train_distribute=distribution_strategy,
-      session_config=session_config,
-      save_checkpoints_secs=60*60*24,
-      save_checkpoints_steps=None)
-
-  # Initializes model with all but the dense layer from pretrained ResNet.
-  if flags_obj.pretrained_model_checkpoint_path is not None:
-    warm_start_settings = tf.estimator.WarmStartSettings(
-        flags_obj.pretrained_model_checkpoint_path,
-        vars_to_warm_start='^(?!.*dense)')
-  else:
-    warm_start_settings = None
-
-  classifier = tf.estimator.Estimator(
-      model_fn=model_function, model_dir=flags_obj.model_dir, config=run_config,
-      warm_start_from=warm_start_settings, params={
-          'resnet_size': int(flags_obj.resnet_size),
-          'data_format': flags_obj.data_format,
-          'batch_size': flags_obj.batch_size,
-          'resnet_version': int(flags_obj.resnet_version),
-          'loss_scale': flags_core.get_loss_scale(flags_obj,
-                                                  default_for_fp16=128),
-          'dtype': flags_core.get_tf_dtype(flags_obj),
-          'fine_tune': flags_obj.fine_tune,
-          'num_workers': num_workers,
-      })
-
-  run_params = {
-      'batch_size': flags_obj.batch_size,
-      'dtype': flags_core.get_tf_dtype(flags_obj),
-      'resnet_size': flags_obj.resnet_size,
-      'resnet_version': flags_obj.resnet_version,
-      'synthetic_data': flags_obj.use_synthetic_data,
-      'train_epochs': flags_obj.train_epochs,
-      'num_workers': num_workers,
-  }
-  if flags_obj.use_synthetic_data:
-    dataset_name = dataset_name + '-synthetic'
-
-  benchmark_logger = logger.get_benchmark_logger()
-  benchmark_logger.log_run_info('resnet', dataset_name, run_params,
-                                test_id=flags_obj.benchmark_test_id)
-
-  train_hooks = hooks_helper.get_train_hooks(
-      flags_obj.hooks,
-      model_dir=flags_obj.model_dir,
-      batch_size=flags_obj.batch_size)
-
-  def input_fn_train(num_epochs, input_context=None):
-    return input_function(
-        is_training=True,
-        data_dir=flags_obj.data_dir,
-        batch_size=per_replica_batch_size(
-            flags_obj.batch_size, flags_core.get_num_gpus(flags_obj)),
-        num_epochs=num_epochs,
-        dtype=flags_core.get_tf_dtype(flags_obj),
-        datasets_num_private_threads=flags_obj.datasets_num_private_threads,
-        input_context=input_context)
-
-  def input_fn_eval():
-    return input_function(
-        is_training=False,
-        data_dir=flags_obj.data_dir,
-        batch_size=per_replica_batch_size(
-            flags_obj.batch_size, flags_core.get_num_gpus(flags_obj)),
-        num_epochs=1,
-        dtype=flags_core.get_tf_dtype(flags_obj))
-
-  train_epochs = (0 if flags_obj.eval_only or not flags_obj.train_epochs else
-                  flags_obj.train_epochs)
-
-  use_train_and_evaluate = flags_obj.use_train_and_evaluate or num_workers > 1
-  if use_train_and_evaluate:
-    train_spec = tf.estimator.TrainSpec(
-        input_fn=lambda input_context=None: input_fn_train(
-            train_epochs, input_context=input_context),
-        hooks=train_hooks,
-        max_steps=flags_obj.max_train_steps)
-    eval_spec = tf.estimator.EvalSpec(input_fn=input_fn_eval)
-    logging.info('Starting to train and evaluate.')
-    tf.estimator.train_and_evaluate(classifier, train_spec, eval_spec)
-    # tf.estimator.train_and_evalute doesn't return anything in multi-worker
-    # case.
-    eval_results = {}
-  else:
-    if train_epochs == 0:
-      # If --eval_only is set, perform a single loop with zero train epochs.
-      schedule, n_loops = [0], 1
-    else:
-      # Compute the number of times to loop while training. All but the last
-      # pass will train for `epochs_between_evals` epochs, while the last will
-      # train for the number needed to reach `training_epochs`. For instance if
-      #   train_epochs = 25 and epochs_between_evals = 10
-      # schedule will be set to [10, 10, 5]. That is to say, the loop will:
-      #   Train for 10 epochs and then evaluate.
-      #   Train for another 10 epochs and then evaluate.
-      #   Train for a final 5 epochs (to reach 25 epochs) and then evaluate.
-      n_loops = math.ceil(train_epochs / flags_obj.epochs_between_evals)
-      schedule = [flags_obj.epochs_between_evals for _ in range(int(n_loops))]
-      schedule[-1] = train_epochs - sum(schedule[:-1])  # over counting.
-
-    for cycle_index, num_train_epochs in enumerate(schedule):
-      logging.info('Starting cycle: %d/%d', cycle_index, int(n_loops))
-
-      if num_train_epochs:
-        # Since we are calling classifier.train immediately in each loop, the
-        # value of num_train_epochs in the lambda function will not be changed
-        # before it is used. So it is safe to ignore the pylint error here
-        # pylint: disable=cell-var-from-loop
-        classifier.train(
-            input_fn=lambda input_context=None: input_fn_train(
-                num_train_epochs, input_context=input_context),
-            hooks=train_hooks,
-            max_steps=flags_obj.max_train_steps)
-
-      # flags_obj.max_train_steps is generally associated with testing and
-      # profiling. As a result it is frequently called with synthetic data,
-      # which will iterate forever. Passing steps=flags_obj.max_train_steps
-      # allows the eval (which is generally unimportant in those circumstances)
-      # to terminate.  Note that eval will run for max_train_steps each loop,
-      # regardless of the global_step count.
-      logging.info('Starting to evaluate.')
-      eval_results = classifier.evaluate(input_fn=input_fn_eval,
-                                         steps=flags_obj.max_train_steps)
-
-      benchmark_logger.log_evaluation_result(eval_results)
-
-      if model_helpers.past_stop_threshold(
-          flags_obj.stop_threshold, eval_results['accuracy']):
-        break
-
-  if flags_obj.export_dir is not None:
-    # Exports a saved model for the given classifier.
-    export_dtype = flags_core.get_tf_dtype(flags_obj)
-    if flags_obj.image_bytes_as_serving_input:
-      input_receiver_fn = functools.partial(
-          image_bytes_serving_input_fn, shape, dtype=export_dtype)
-    else:
-      input_receiver_fn = export.build_tensor_serving_input_receiver_fn(
-          shape, batch_size=flags_obj.batch_size, dtype=export_dtype)
-    classifier.export_savedmodel(flags_obj.export_dir, input_receiver_fn,
-                                 strip_default_attrs=True)
-
-  stats = {}
-  stats['eval_results'] = eval_results
-  stats['train_hooks'] = train_hooks
-
-  return stats
-
-
-def define_resnet_flags(resnet_size_choices=None, dynamic_loss_scale=False,
-                        fp16_implementation=False):
-  """Add flags and validators for ResNet."""
-  flags_core.define_base(clean=True, train_epochs=True,
-                         epochs_between_evals=True, stop_threshold=True,
-                         num_gpu=True, hooks=True, export_dir=True,
-                         distribution_strategy=True)
-  flags_core.define_performance(num_parallel_calls=False,
-                                inter_op=True,
-                                intra_op=True,
-                                synthetic_data=True,
-                                dtype=True,
-                                all_reduce_alg=True,
-                                num_packs=True,
-                                tf_gpu_thread_mode=True,
-                                datasets_num_private_threads=True,
-                                dynamic_loss_scale=dynamic_loss_scale,
-                                fp16_implementation=fp16_implementation,
-                                loss_scale=True,
-                                tf_data_experimental_slack=True,
-                                max_train_steps=True)
-  flags_core.define_image()
-  flags_core.define_benchmark()
-  flags_core.define_distribution()
-  flags.adopt_module_key_flags(flags_core)
-
-  flags.DEFINE_enum(
-      name='resnet_version', short_name='rv', default='1',
-      enum_values=['1', '2'],
-      help=flags_core.help_wrap(
-          'Version of ResNet. (1 or 2) See README.md for details.'))
-  flags.DEFINE_bool(
-      name='fine_tune', short_name='ft', default=False,
-      help=flags_core.help_wrap(
-          'If True do not train any parameters except for the final layer.'))
-  flags.DEFINE_string(
-      name='pretrained_model_checkpoint_path', short_name='pmcp', default=None,
-      help=flags_core.help_wrap(
-          'If not None initialize all the network except the final layer with '
-          'these values'))
-  flags.DEFINE_boolean(
-      name='eval_only', default=False,
-      help=flags_core.help_wrap('Skip training and only perform evaluation on '
-                                'the latest checkpoint.'))
-  flags.DEFINE_boolean(
-      name='image_bytes_as_serving_input', default=False,
-      help=flags_core.help_wrap(
-          'If True exports savedmodel with serving signature that accepts '
-          'JPEG image bytes instead of a fixed size [HxWxC] tensor that '
-          'represents the image. The former is easier to use for serving at '
-          'the expense of image resize/cropping being done as part of model '
-          'inference. Note, this flag only applies to ImageNet and cannot '
-          'be used for CIFAR.'))
-  flags.DEFINE_boolean(
-      name='use_train_and_evaluate', default=False,
-      help=flags_core.help_wrap(
-          'If True, uses `tf.estimator.train_and_evaluate` for the training '
-          'and evaluation loop, instead of separate calls to `classifier.train '
-          'and `classifier.evaluate`, which is the default behavior.'))
-  flags.DEFINE_bool(
-      name='enable_lars', default=False,
-      help=flags_core.help_wrap(
-          'Enable LARS optimizer for large batch training.'))
-  flags.DEFINE_float(
-      name='label_smoothing', default=0.0,
-      help=flags_core.help_wrap(
-          'Label smoothing parameter used in the softmax_cross_entropy'))
-  flags.DEFINE_float(
-      name='weight_decay', default=1e-4,
-      help=flags_core.help_wrap(
-          'Weight decay coefficiant for l2 regularization.'))
-
-  choice_kwargs = dict(
-      name='resnet_size', short_name='rs', default='50',
-      help=flags_core.help_wrap('The size of the ResNet model to use.'))
-
-  if resnet_size_choices is None:
-    flags.DEFINE_string(**choice_kwargs)
-  else:
-    flags.DEFINE_enum(enum_values=resnet_size_choices, **choice_kwargs)
diff --git a/official/r1/transformer/README.md b/official/r1/transformer/README.md
deleted file mode 100644
index c680f8b3393f4da7167fdbce17af8541e5ef4cc5..0000000000000000000000000000000000000000
--- a/official/r1/transformer/README.md
+++ /dev/null
@@ -1,380 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Transformer Translation Model
-This is an implementation of the Transformer translation model as described in the [Attention is All You Need](https://arxiv.org/abs/1706.03762) paper. Based on the code provided by the authors: [Transformer code](https://github.com/tensorflow/tensor2tensor/blob/master/tensor2tensor/models/transformer.py) from [Tensor2Tensor](https://github.com/tensorflow/tensor2tensor). Also, check out the [tutorial](https://www.tensorflow.org/beta/tutorials/text/transformer) on Transformer in TF 2.0.
-
-**Please follow the [README](https://github.com/tensorflow/models/official/transformer/README.md), the new Keras-based TF 2 implementation, to walk through the new Transformer.**
-
-Transformer is a neural network architecture that solves sequence to sequence problems using attention mechanisms. Unlike traditional neural seq2seq models, Transformer does not involve recurrent connections. The attention mechanism learns dependencies between tokens in two sequences. Since attention weights apply to all tokens in the sequences, the Transformer model is able to easily capture long-distance dependencies.
-
-Transformer's overall structure follows the standard encoder-decoder pattern. The encoder uses self-attention to compute a representation of the input sequence. The decoder generates the output sequence one token at a time, taking the encoder output and previous decoder-outputted tokens as inputs.
-
-The model also applies embeddings on the input and output tokens, and adds a constant positional encoding. The positional encoding adds information about the position of each token.
-
-## Contents
-  * [Contents](#contents)
-  * [Walkthrough](#walkthrough)
-  * [Benchmarks](#benchmarks)
-    * [Training times](#training-times)
-    * [Evaluation results](#evaluation-results)
-  * [Detailed instructions](#detailed-instructions)
-    * [Environment preparation](#environment-preparation)
-    * [Download and preprocess datasets](#download-and-preprocess-datasets)
-    * [Model training and evaluation](#model-training-and-evaluation)
-    * [Translate using the model](#translate-using-the-model)
-    * [Compute official BLEU score](#compute-official-bleu-score)
-    * [TPU](#tpu)
-  * [Export trained model](#export-trained-model)
-    * [Example translation](#example-translation)
-  * [Implementation overview](#implementation-overview)
-    * [Model Definition](#model-definition)
-    * [Model Estimator](#model-estimator)
-    * [Other scripts](#other-scripts)
-    * [Test dataset](#test-dataset)
-  * [Term definitions](#term-definitions)
-
-## Walkthrough
-
-Below are the commands for running the Transformer model. See the
-[Detailed instructions](#detailed-instructions) for more details on running the
-model.
-
-```
-cd /path/to/models/official/transformer
-
-# Ensure that PYTHONPATH is correctly defined as described in
-# https://github.com/tensorflow/models/tree/master/official#requirements
-# export PYTHONPATH="$PYTHONPATH:/path/to/models"
-
-# Export variables
-PARAM_SET=big
-DATA_DIR=$HOME/transformer/data
-MODEL_DIR=$HOME/transformer/model_$PARAM_SET
-VOCAB_FILE=$DATA_DIR/vocab.ende.32768
-
-# Download training/evaluation/test datasets
-python data_download.py --data_dir=$DATA_DIR
-
-# Train the model for 10 epochs, and evaluate after every epoch.
-python transformer_main.py --data_dir=$DATA_DIR --model_dir=$MODEL_DIR \
-    --vocab_file=$VOCAB_FILE --param_set=$PARAM_SET \
-    --bleu_source=$DATA_DIR/newstest2014.en --bleu_ref=$DATA_DIR/newstest2014.de
-
-# Run during training in a separate process to get continuous updates,
-# or after training is complete.
-tensorboard --logdir=$MODEL_DIR
-
-# Translate some text using the trained model
-python translate.py --model_dir=$MODEL_DIR --vocab_file=$VOCAB_FILE \
-    --param_set=$PARAM_SET --text="hello world"
-
-# Compute model's BLEU score using the newstest2014 dataset.
-python translate.py --model_dir=$MODEL_DIR --vocab_file=$VOCAB_FILE \
-    --param_set=$PARAM_SET --file=$DATA_DIR/newstest2014.en --file_out=translation.en
-python compute_bleu.py --translation=translation.en --reference=$DATA_DIR/newstest2014.de
-```
-
-## Benchmarks
-### Training times
-
-Currently, both big and base parameter sets run on a single GPU. The measurements below
-are reported from running the model on a P100 GPU.
-
-Param Set | batches/sec | batches per epoch | time per epoch
---- | --- | --- | ---
-base | 4.8 | 83244 | 4 hr
-big | 1.1 | 41365 | 10 hr
-
-### Evaluation results
-Below are the case-insensitive BLEU scores after 10 epochs.
-
-Param Set | Score
---- | --- |
-base | 27.7
-big | 28.9
-
-
-## Detailed instructions
-
-
-0. ### Environment preparation
-
-   #### Add models repo to PYTHONPATH
-   Follow the instructions described in the [Requirements](https://github.com/tensorflow/models/tree/master/official#requirements) section to add the models folder to the python path.
-
-   #### Export variables (optional)
-
-   Export the following variables, or modify the values in each of the snippets below:
-   ```
-   PARAM_SET=big
-   DATA_DIR=$HOME/transformer/data
-   MODEL_DIR=$HOME/transformer/model_$PARAM_SET
-   VOCAB_FILE=$DATA_DIR/vocab.ende.32768
-   ```
-
-1. ### Download and preprocess datasets
-
-   [data_download.py](data_download.py) downloads and preprocesses the training and evaluation WMT datasets. After the data is downloaded and extracted, the training data is used to generate a vocabulary of subtokens. The evaluation and training strings are tokenized, and the resulting data is sharded, shuffled, and saved as TFRecords.
-
-   1.75GB of compressed data will be downloaded. In total, the raw files (compressed, extracted, and combined files) take up 8.4GB of disk space. The resulting TFRecord and vocabulary files are 722MB. The script takes around 40 minutes to run, with the bulk of the time spent downloading and ~15 minutes spent on preprocessing.
-
-   Command to run:
-   ```
-   python data_download.py --data_dir=$DATA_DIR
-   ```
-
-   Arguments:
-   * `--data_dir`: Path where the preprocessed TFRecord data, and vocab file will be saved.
-   * Use the `--help` or `-h` flag to get a full list of possible arguments.
-
-2. ### Model training and evaluation
-
-   [transformer_main.py](transformer_main.py) creates a Transformer model, and trains it using Tensorflow Estimator.
-
-   Command to run:
-   ```
-   python transformer_main.py --data_dir=$DATA_DIR --model_dir=$MODEL_DIR \
-       --vocab_file=$VOCAB_FILE --param_set=$PARAM_SET
-   ```
-
-   Arguments:
-   * `--data_dir`: This should be set to the same directory given to the `data_download`'s `data_dir` argument.
-   * `--model_dir`: Directory to save Transformer model training checkpoints.
-   * `--vocab_file`: Path to subtoken vocabulary file. If data_download was used, you may find the file in `data_dir`.
-   * `--param_set`: Parameter set to use when creating and training the model. Options are `base` and `big` (default).
-   * Use the `--help` or `-h` flag to get a full list of possible arguments.
-
-   #### Customizing training schedule
-
-   By default, the model will train for 10 epochs, and evaluate after every epoch. The training schedule may be defined through the flags:
-   * Training with epochs (default):
-     * `--train_epochs`: The total number of complete passes to make through the dataset
-     * `--epochs_between_evals`: The number of epochs to train between evaluations.
-   * Training with steps:
-     * `--train_steps`: sets the total number of training steps to run.
-     * `--steps_between_evals`: Number of training steps to run between evaluations.
-
-   Only one of `train_epochs` or `train_steps` may be set. Since the default option is to evaluate the model after training for an epoch, it may take 4 or more hours between model evaluations. To get more frequent evaluations, use the flags `--train_steps=250000 --steps_between_evals=1000`.
-
-   Note: At the beginning of each training session, the training dataset is reloaded and shuffled. Stopping the training before completing an epoch may result in worse model quality, due to the chance that some examples may be seen more than others. Therefore, it is recommended to use epochs when the model quality is important.
-
-   #### Compute BLEU score during model evaluation
-
-   Use these flags to compute the BLEU when the model evaluates:
-   * `--bleu_source`: Path to file containing text to translate.
-   * `--bleu_ref`: Path to file containing the reference translation.
-   * `--stop_threshold`: Train until the BLEU score reaches this lower bound. This setting overrides the `--train_steps` and `--train_epochs` flags.
-
-   When running `transformer_main.py`, use the flags: `--bleu_source=$DATA_DIR/newstest2014.en --bleu_ref=$DATA_DIR/newstest2014.de`
-
-   #### Tensorboard
-   Training and evaluation metrics (loss, accuracy, approximate BLEU score, etc.) are logged, and can be displayed in the browser using Tensorboard.
-   ```
-   tensorboard --logdir=$MODEL_DIR
-   ```
-   The values are displayed at [localhost:6006](localhost:6006).
-
-3. ### Translate using the model
-   [translate.py](translate.py) contains the script to use the trained model to translate input text or file. Each line in the file is translated separately.
-
-   Command to run:
-   ```
-   python translate.py --model_dir=$MODEL_DIR --vocab_file=$VOCAB_FILE \
-       --param_set=$PARAM_SET --text="hello world"
-   ```
-
-   Arguments for initializing the Subtokenizer and trained model:
-   * `--model_dir` and `--param_set`: These parameters are used to rebuild the trained model
-   * `--vocab_file`: Path to subtoken vocabulary file. If data_download was used, you may find the file in `data_dir`.
-
-   Arguments for specifying what to translate:
-   * `--text`: Text to translate
-   * `--file`: Path to file containing text to translate
-   * `--file_out`: If `--file` is set, then this file will store the input file's translations.
-
-   To translate the newstest2014 data, run:
-   ```
-   python translate.py --model_dir=$MODEL_DIR --vocab_file=$VOCAB_FILE \
-       --param_set=$PARAM_SET --file=$DATA_DIR/newstest2014.en --file_out=translation.en
-   ```
-
-   Translating the file takes around 15 minutes on a GTX1080, or 5 minutes on a P100.
-
-4. ### Compute official BLEU score
-   Use [compute_bleu.py](compute_bleu.py) to compute the BLEU by comparing generated translations to the reference translation.
-
-   Command to run:
-   ```
-   python compute_bleu.py --translation=translation.en --reference=$DATA_DIR/newstest2014.de
-   ```
-
-   Arguments:
-   * `--translation`: Path to file containing generated translations.
-   * `--reference`: Path to file containing reference translations.
-   * Use the `--help` or `-h` flag to get a full list of possible arguments.
-
-5. ### TPU
-   TPU support for this version of Transformer is experimental. Currently it is present for
-   demonstration purposes only, but will be optimized in the coming weeks.
-
-## Export trained model
-To export the model as a Tensorflow [SavedModel](https://www.tensorflow.org/guide/saved_model) format, use the argument `--export_dir` when running `transformer_main.py`. A folder will be created in the directory with the name as the timestamp (e.g. $EXPORT_DIR/1526427396).
-
-```
-EXPORT_DIR=$HOME/transformer/saved_model
-python transformer_main.py --data_dir=$DATA_DIR --model_dir=$MODEL_DIR \
-  --vocab_file=$VOCAB_FILE --param_set=$PARAM_SET --export_model=$EXPORT_DIR
-```
-
-To inspect the SavedModel, use saved_model_cli:
-```
-SAVED_MODEL_DIR=$EXPORT_DIR/{TIMESTAMP}  # replace {TIMESTAMP} with the name of the folder created
-saved_model_cli show --dir=$SAVED_MODEL_DIR  --all
-```
-
-### Example translation
-Let's translate **"hello world!"**, **"goodbye world."**, and **"Would you like some pie?"**.
-
-The SignatureDef for "translate" is:
-
-    signature_def['translate']:
-        The given SavedModel SignatureDef contains the following input(s):
-          inputs['input'] tensor_info:
-              dtype: DT_INT64
-              shape: (-1, -1)
-              name: Placeholder:0
-        The given SavedModel SignatureDef contains the following output(s):
-          outputs['outputs'] tensor_info:
-              dtype: DT_INT32
-              shape: (-1, -1)
-              name: model/Transformer/strided_slice_19:0
-          outputs['scores'] tensor_info:
-              dtype: DT_FLOAT
-              shape: (-1)
-              name: model/Transformer/strided_slice_20:0
-
-Follow the steps below to use the translate signature def:
-
-1. #### Encode the inputs to integer arrays.
-   This can be done using `utils.tokenizer.Subtokenizer`, and the vocab file in the SavedModel assets (`$SAVED_MODEL_DIR/assets.extra/vocab.txt`).
-
-   ```
-   from official.transformer.utils.tokenizer import Subtokenizer
-   s = Subtokenizer(PATH_TO_VOCAB_FILE)
-   print(s.encode("hello world!", add_eos=True))
-   ```
-
-   The encoded inputs are:
-   * `"hello world!" = [6170, 3731, 178, 207, 1]`
-   * `"goodbye world." = [15431, 13966, 36, 178, 3, 1]`
-   * `"Would you like some pie?" = [9092, 72, 155, 202, 19851, 102, 1]`
-
-2. #### Run `saved_model_cli` to obtain the predicted translations
-   The encoded inputs should be padded so that they are the same length. The padding token is `0`.
-   ```
-   ENCODED_INPUTS="[[26228, 145, 178, 1, 0, 0, 0], \
-                   [15431, 13966, 36, 178, 3, 1, 0], \
-                   [9092, 72, 155, 202, 19851, 102, 1]]"
-   ```
-
-   Now, use the `run` command with `saved_model_cli` to get the outputs.
-
-   ```
-   saved_model_cli run --dir=$SAVED_MODEL_DIR --tag_set=serve --signature_def=translate \
-     --input_expr="input=$ENCODED_INPUTS"
-   ```
-
-   The outputs will look similar to:
-   ```
-   Result for output key outputs:
-   [[18744   145   297     1     0     0     0     0     0     0     0     0
-         0     0]
-    [ 5450  4642    21    11   297     3     1     0     0     0     0     0
-         0     0]
-    [25940    22    66   103 21713    31   102     1     0     0     0     0
-         0     0]]
-   Result for output key scores:
-   [-1.5493642 -1.4032784 -3.252089 ]
-   ```
-
-3. #### Decode the outputs to strings.
-   Use the `Subtokenizer` and vocab file as described in step 1 to decode the output integer arrays.
-   ```
-   from official.transformer.utils.tokenizer import Subtokenizer
-   s = Subtokenizer(PATH_TO_VOCAB_FILE)
-   print(s.decode([18744, 145, 297, 1]))
-   ```
-   The decoded outputs from above are:
-   * `[18744, 145, 297, 1] = "Hallo Welt<EOS>"`
-   * `[5450, 4642, 21, 11, 297, 3, 1] = "Abschied von der Welt.<EOS>"`
-   * `[25940, 22, 66, 103, 21713, 31, 102, 1] = "Möchten Sie einen Kuchen?<EOS>"`
-
-## Implementation overview
-
-A brief look at each component in the code:
-
-### Model Definition
-The [model](model) subdirectory contains the implementation of the Transformer model. The following files define the Transformer model and its layers:
-* [transformer.py](model/transformer.py): Defines the transformer model and its encoder/decoder layer stacks.
-* [embedding_layer.py](model/embedding_layer.py): Contains the layer that calculates the embeddings. The embedding weights are also used to calculate the pre-softmax probabilities from the decoder output.
-* [attention_layer.py](model/attention_layer.py): Defines the multi-headed and self attention layers that are used in the encoder/decoder stacks.
-* [ffn_layer.py](model/ffn_layer.py): Defines the feedforward network that is used in the encoder/decoder stacks. The network is composed of 2 fully connected layers.
-
-Other files:
-* [beam_search.py](model/beam_search.py) contains the beam search implementation, which is used during model inference to find high scoring translations.
-* [model_params.py](model/model_params.py) contains the parameters used for the big and base models.
-* [model_utils.py](model/model_utils.py) defines some helper functions used in the model (calculating padding, bias, etc.).
-
-
-### Model Estimator
-[transformer_main.py](model/transformer.py) creates an `Estimator` to train and evaluate the model.
-
-Helper functions:
-* [utils/dataset.py](utils/dataset.py): contains functions for creating a `dataset` that is passed to the `Estimator`.
-* [utils/metrics.py](utils/metrics.py): defines metrics functions used by the `Estimator` to evaluate the
-
-### Other scripts
-
-Aside from the main file to train the Transformer model, we provide other scripts for using the model or downloading the data:
-
-#### Data download and preprocessing
-
-[data_download.py](data_download.py) downloads and extracts data, then uses `Subtokenizer` to tokenize strings into arrays of int IDs. The int arrays are converted to `tf.Examples` and saved in the `tf.RecordDataset` format.
-
- The data is downloaded from the Workshop of Machine Translation (WMT) [news translation task](http://www.statmt.org/wmt17/translation-task.html). The following datasets are used:
-
- * Europarl v7
- * Common Crawl corpus
- * News Commentary v12
-
- See the [download section](http://www.statmt.org/wmt17/translation-task.html#download) to explore the raw datasets. The parameters in this model are tuned to fit the English-German translation data, so the EN-DE texts are extracted from the downloaded compressed files.
-
-The text is transformed into arrays of integer IDs using the `Subtokenizer` defined in [`utils/tokenizer.py`](util/tokenizer.py). During initialization of the `Subtokenizer`, the raw training data is used to generate a vocabulary list containing common subtokens.
-
-The target vocabulary size of the WMT dataset is 32,768. The set of subtokens is found through binary search on the minimum number of times a subtoken appears in the data. The actual vocabulary size is 33,708, and is stored in a 324kB file.
-
-#### Translation
-Translation is defined in [translate.py](translate.py). First, `Subtokenizer` tokenizes the input. The vocabulary file is the same used to tokenize the training/eval files. Next, beam search is used to find the combination of tokens that maximizes the probability outputted by the model decoder. The tokens are then converted back to strings with `Subtokenizer`.
-
-#### BLEU computation
-[compute_bleu.py](compute_bleu.py): Implementation from [https://github.com/tensorflow/tensor2tensor/blob/master/tensor2tensor/utils/bleu_hook.py](https://github.com/tensorflow/tensor2tensor/blob/master/tensor2tensor/utils/bleu_hook.py).
-
-### Test dataset
-The [newstest2014 files](https://storage.googleapis.com/tf-perf-public/official_transformer/test_data/newstest2014.tgz)
-are extracted from the [NMT Seq2Seq tutorial](https://google.github.io/seq2seq/nmt/#download-data).
-The raw text files are converted from the SGM format of the
-[WMT 2016](http://www.statmt.org/wmt16/translation-task.html) test sets. The
-newstest2014 files are put into the `$DATA_DIR` when executing
-`data_download.py`
-
-## Term definitions
-
-**Steps / Epochs**:
-* Step: unit for processing a single batch of data
-* Epoch: a complete run through the dataset
-
-Example: Consider a training a dataset with 100 examples that is divided into 20 batches with 5 examples per batch. A single training step trains the model on one batch. After 20 training steps, the model will have trained on every batch in the dataset, or one epoch.
-
-**Subtoken**: Words are referred to as tokens, and parts of words are referred to as 'subtokens'. For example, the word 'inclined' may be split into `['incline', 'd_']`. The '\_' indicates the end of the token. The subtoken vocabulary list is guaranteed to contain the alphabet (including numbers and special characters), so all words can be tokenized.
diff --git a/official/r1/transformer/__init__.py b/official/r1/transformer/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/official/r1/transformer/attention_layer.py b/official/r1/transformer/attention_layer.py
deleted file mode 100644
index e3537939fd70ba41ace89ed870686be6d75544e2..0000000000000000000000000000000000000000
--- a/official/r1/transformer/attention_layer.py
+++ /dev/null
@@ -1,148 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Implementation of multiheaded attention and self-attention layers."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow.compat.v1 as tf
-
-
-class Attention(tf.layers.Layer):
-  """Multi-headed attention layer."""
-
-  def __init__(self, hidden_size, num_heads, attention_dropout, train):
-    if hidden_size % num_heads != 0:
-      raise ValueError("Hidden size must be evenly divisible by the number of "
-                       "heads.")
-
-    super(Attention, self).__init__()
-    self.hidden_size = hidden_size
-    self.num_heads = num_heads
-    self.attention_dropout = attention_dropout
-    self.train = train
-
-    # Layers for linearly projecting the queries, keys, and values.
-    self.q_dense_layer = tf.layers.Dense(hidden_size, use_bias=False, name="q")
-    self.k_dense_layer = tf.layers.Dense(hidden_size, use_bias=False, name="k")
-    self.v_dense_layer = tf.layers.Dense(hidden_size, use_bias=False, name="v")
-
-    self.output_dense_layer = tf.layers.Dense(hidden_size, use_bias=False,
-                                              name="output_transform")
-
-  def split_heads(self, x):
-    """Split x into different heads, and transpose the resulting value.
-
-    The tensor is transposed to insure the inner dimensions hold the correct
-    values during the matrix multiplication.
-
-    Args:
-      x: A tensor with shape [batch_size, length, hidden_size]
-
-    Returns:
-      A tensor with shape [batch_size, num_heads, length, hidden_size/num_heads]
-    """
-    with tf.name_scope("split_heads"):
-      batch_size = tf.shape(x)[0]
-      length = tf.shape(x)[1]
-
-      # Calculate depth of last dimension after it has been split.
-      depth = (self.hidden_size // self.num_heads)
-
-      # Split the last dimension
-      x = tf.reshape(x, [batch_size, length, self.num_heads, depth])
-
-      # Transpose the result
-      return tf.transpose(x, [0, 2, 1, 3])
-
-  def combine_heads(self, x):
-    """Combine tensor that has been split.
-
-    Args:
-      x: A tensor [batch_size, num_heads, length, hidden_size/num_heads]
-
-    Returns:
-      A tensor with shape [batch_size, length, hidden_size]
-    """
-    with tf.name_scope("combine_heads"):
-      batch_size = tf.shape(x)[0]
-      length = tf.shape(x)[2]
-      x = tf.transpose(x, [0, 2, 1, 3])  # --> [batch, length, num_heads, depth]
-      return tf.reshape(x, [batch_size, length, self.hidden_size])
-
-  def call(self, x, y, bias, cache=None):
-    """Apply attention mechanism to x and y.
-
-    Args:
-      x: a tensor with shape [batch_size, length_x, hidden_size]
-      y: a tensor with shape [batch_size, length_y, hidden_size]
-      bias: attention bias that will be added to the result of the dot product.
-      cache: (Used during prediction) dictionary with tensors containing results
-        of previous attentions. The dictionary must have the items:
-            {"k": tensor with shape [batch_size, i, key_channels],
-             "v": tensor with shape [batch_size, i, value_channels]}
-        where i is the current decoded length.
-
-    Returns:
-      Attention layer output with shape [batch_size, length_x, hidden_size]
-    """
-    # Linearly project the query (q), key (k) and value (v) using different
-    # learned projections. This is in preparation of splitting them into
-    # multiple heads. Multi-head attention uses multiple queries, keys, and
-    # values rather than regular attention (which uses a single q, k, v).
-    q = self.q_dense_layer(x)
-    k = self.k_dense_layer(y)
-    v = self.v_dense_layer(y)
-
-    if cache is not None:
-      # Combine cached keys and values with new keys and values.
-      k = tf.concat([cache["k"], k], axis=1)
-      v = tf.concat([cache["v"], v], axis=1)
-
-      # Update cache
-      cache["k"] = k
-      cache["v"] = v
-
-    # Split q, k, v into heads.
-    q = self.split_heads(q)
-    k = self.split_heads(k)
-    v = self.split_heads(v)
-
-    # Scale q to prevent the dot product between q and k from growing too large.
-    depth = (self.hidden_size // self.num_heads)
-    q *= depth ** -0.5
-
-    # Calculate dot product attention
-    logits = tf.matmul(q, k, transpose_b=True)
-    logits += bias
-    weights = tf.nn.softmax(logits, name="attention_weights")
-    if self.train:
-      weights = tf.nn.dropout(weights, 1.0 - self.attention_dropout)
-    attention_output = tf.matmul(weights, v)
-
-    # Recombine heads --> [batch_size, length, hidden_size]
-    attention_output = self.combine_heads(attention_output)
-
-    # Run the combined outputs through another linear projection layer.
-    attention_output = self.output_dense_layer(attention_output)
-    return attention_output
-
-
-class SelfAttention(Attention):
-  """Multiheaded self-attention layer."""
-
-  def call(self, x, bias, cache=None):
-    return super(SelfAttention, self).call(x, x, bias, cache)
diff --git a/official/r1/transformer/dataset.py b/official/r1/transformer/dataset.py
deleted file mode 100644
index 47987745cad34dc9c02683c112377868270ec41e..0000000000000000000000000000000000000000
--- a/official/r1/transformer/dataset.py
+++ /dev/null
@@ -1,284 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Input pipeline for the transformer model to read, filter, and batch examples.
-
-Two things to note in the pipeline:
-
-1. Batching scheme
-
-   The examples encoded in the TFRecord files contain data in the format:
-     {"inputs": [variable length array of integers],
-      "targets": [variable length array of integers]}
-   Where integers in the arrays refer to tokens in the English and German vocab
-   file (named `vocab.ende.32768`).
-
-   Prior to batching, elements in the dataset are grouped by length (max between
-   "inputs" and "targets" length). Each group is then batched such that:
-     group_batch_size * length <= batch_size.
-
-   Another way to view batch_size is the maximum number of tokens in each batch.
-
-   Once batched, each element in the dataset will have the shape:
-     {"inputs": [group_batch_size, padded_input_length],
-      "targets": [group_batch_size, padded_target_length]}
-   Lengths are padded to the longest "inputs" or "targets" sequence in the batch
-   (padded_input_length and padded_target_length can be different).
-
-   This batching scheme decreases the fraction of padding tokens per training
-   batch, thus improving the training speed significantly.
-
-2. Shuffling
-
-   While training, the dataset is shuffled in two places in the code. The first
-   is the list of training files. Second, while reading records using
-   `parallel_interleave`, the `sloppy` argument is used to generate randomness
-   in the order of the examples.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-import os
-
-import tensorflow.compat.v1 as tf
-
-from official.utils.misc import model_helpers
-
-# Buffer size for reading records from a TFRecord file. Each training file is
-# 7.2 MB, so 8 MB allows an entire file to be kept in memory.
-_READ_RECORD_BUFFER = 8 * 1000 * 1000
-
-# Example grouping constants. Defines length boundaries for each group.
-# These values are the defaults used in Tensor2Tensor.
-_MIN_BOUNDARY = 8
-_BOUNDARY_SCALE = 1.1
-
-
-def _load_records(filename):
-  """Read file and return a dataset of tf.Examples."""
-  return tf.data.TFRecordDataset(filename, buffer_size=_READ_RECORD_BUFFER)
-
-
-def _parse_example(serialized_example):
-  """Return inputs and targets Tensors from a serialized tf.Example."""
-  data_fields = {
-      "inputs": tf.VarLenFeature(tf.int64),
-      "targets": tf.VarLenFeature(tf.int64)
-  }
-  parsed = tf.parse_single_example(serialized_example, data_fields)
-  inputs = tf.sparse_tensor_to_dense(parsed["inputs"])
-  targets = tf.sparse_tensor_to_dense(parsed["targets"])
-  return inputs, targets
-
-
-def _filter_max_length(example, max_length=256):
-  """Indicates whether the example's length is lower than the maximum length."""
-  return tf.logical_and(tf.size(example[0]) <= max_length,
-                        tf.size(example[1]) <= max_length)
-
-
-def _get_example_length(example):
-  """Returns the maximum length between the example inputs and targets."""
-  length = tf.maximum(tf.shape(example[0])[0], tf.shape(example[1])[0])
-  return length
-
-
-def _create_min_max_boundaries(
-    max_length, min_boundary=_MIN_BOUNDARY, boundary_scale=_BOUNDARY_SCALE):
-  """Create min and max boundary lists up to max_length.
-
-  For example, when max_length=24, min_boundary=4 and boundary_scale=2, the
-  returned values will be:
-    buckets_min = [0, 4, 8, 16, 24]
-    buckets_max = [4, 8, 16, 24, 25]
-
-  Args:
-    max_length: The maximum length of example in dataset.
-    min_boundary: Minimum length in boundary.
-    boundary_scale: Amount to scale consecutive boundaries in the list.
-
-  Returns:
-    min and max boundary lists
-
-  """
-  # Create bucket boundaries list by scaling the previous boundary or adding 1
-  # (to ensure increasing boundary sizes).
-  bucket_boundaries = []
-  x = min_boundary
-  while x < max_length:
-    bucket_boundaries.append(x)
-    x = max(x + 1, int(x * boundary_scale))
-
-  # Create min and max boundary lists from the initial list.
-  buckets_min = [0] + bucket_boundaries
-  buckets_max = bucket_boundaries + [max_length + 1]
-  return buckets_min, buckets_max
-
-
-def _batch_examples(dataset, batch_size, max_length):
-  """Group examples by similar lengths, and return batched dataset.
-
-  Each batch of similar-length examples are padded to the same length, and may
-  have different number of elements in each batch, such that:
-    group_batch_size * padded_length <= batch_size.
-
-  This decreases the number of padding tokens per batch, which improves the
-  training speed.
-
-  Args:
-    dataset: Dataset of unbatched examples.
-    batch_size: Max number of tokens per batch of examples.
-    max_length: Max number of tokens in an example input or target sequence.
-
-  Returns:
-    Dataset of batched examples with similar lengths.
-  """
-  # Get min and max boundary lists for each example. These are used to calculate
-  # the `bucket_id`, which is the index at which:
-  # buckets_min[bucket_id] <= len(example) < buckets_max[bucket_id]
-  # Note that using both min and max lists improves the performance.
-  buckets_min, buckets_max = _create_min_max_boundaries(max_length)
-
-  # Create list of batch sizes for each bucket_id, so that
-  # bucket_batch_size[bucket_id] * buckets_max[bucket_id] <= batch_size
-  bucket_batch_sizes = [batch_size // x for x in buckets_max]
-  # bucket_id will be a tensor, so convert this list to a tensor as well.
-  bucket_batch_sizes = tf.constant(bucket_batch_sizes, dtype=tf.int64)
-
-  def example_to_bucket_id(example_input, example_target):
-    """Return int64 bucket id for this example, calculated based on length."""
-    seq_length = _get_example_length((example_input, example_target))
-
-    # TODO: investigate whether removing code branching improves performance.
-    conditions_c = tf.logical_and(
-        tf.less_equal(buckets_min, seq_length),
-        tf.less(seq_length, buckets_max))
-    bucket_id = tf.reduce_min(tf.where(conditions_c))
-    return bucket_id
-
-  def window_size_fn(bucket_id):
-    """Return number of examples to be grouped when given a bucket id."""
-    return bucket_batch_sizes[bucket_id]
-
-  def batching_fn(bucket_id, grouped_dataset):
-    """Batch and add padding to a dataset of elements with similar lengths."""
-    bucket_batch_size = window_size_fn(bucket_id)
-
-    # Batch the dataset and add padding so that all input sequences in the
-    # examples have the same length, and all target sequences have the same
-    # lengths as well. Resulting lengths of inputs and targets can differ.
-    return grouped_dataset.padded_batch(bucket_batch_size, ([None], [None]))
-
-  return dataset.apply(tf.data.experimental.group_by_window(
-      key_func=example_to_bucket_id,
-      reduce_func=batching_fn,
-      window_size=None,
-      window_size_func=window_size_fn))
-
-
-def _read_and_batch_from_files(
-    file_pattern, batch_size, max_length, num_parallel_calls, shuffle, repeat,
-    static_batch=False):
-  """Create dataset where each item is a dict of "inputs" and "targets".
-
-  Args:
-    file_pattern: String used to match the input TFRecord files.
-    batch_size: Maximum number of tokens per batch of examples
-    max_length: Maximum number of tokens per example
-    num_parallel_calls: Number of cpu cores for parallel input processing.
-    shuffle: If true, randomizes order of elements.
-    repeat: Number of times to repeat the dataset. If None, the dataset is
-      repeated forever.
-    static_batch: Whether the batches in the dataset should have static shapes.
-      If True, the input is batched so that every batch has the
-      shape [batch_size // max_length, max_length]. If False, the input is
-      grouped by length, and batched so that batches may have different
-      shapes [N, M], where:
-        N * M <= batch_size
-        M <= max_length
-      In general, this setting should be False. Dynamic shapes allow the inputs
-      to be grouped so that the number of padding tokens is minimized, and helps
-      model training. In cases where the input shape must be static
-      (e.g. running on TPU), this setting should be set to True.
-
-  Returns:
-    tf.data.Dataset object containing examples loaded from the files.
-  """
-  dataset = tf.data.Dataset.list_files(file_pattern, shuffle=shuffle)
-
-  # Read files and interleave results. When training, the order of the examples
-  # will be non-deterministic.
-  dataset = dataset.apply(
-      tf.data.experimental.parallel_interleave(
-          _load_records, sloppy=shuffle, cycle_length=num_parallel_calls))
-
-  # Parse each tf.Example into a dictionary
-  # TODO: Look into prefetch_input_elements for performance optimization.
-  dataset = dataset.map(_parse_example,
-                        num_parallel_calls=num_parallel_calls)
-
-  # Remove examples where the input or target length exceeds the maximum length,
-  dataset = dataset.filter(lambda x, y: _filter_max_length((x, y), max_length))
-
-  if static_batch:
-    dataset = dataset.padded_batch(
-        batch_size // max_length, ([max_length], [max_length]),
-        drop_remainder=True)
-  else:
-    # Group and batch such that each batch has examples of similar length.
-    dataset = _batch_examples(dataset, batch_size, max_length)
-
-  dataset = dataset.repeat(repeat)
-
-  # Prefetch the next element to improve speed of input pipeline.
-  dataset = dataset.prefetch(buffer_size=tf.data.experimental.AUTOTUNE)
-  return dataset
-
-
-def _generate_synthetic_data(params):
-  """Create synthetic data based on the parameter batch size."""
-  batch = length = int(math.sqrt(params["batch_size"]))
-  return model_helpers.generate_synthetic_data(
-      input_shape=tf.TensorShape([batch, length]),
-      input_value=1,
-      input_dtype=tf.int32,
-      label_shape=tf.TensorShape([batch, length]),
-      label_value=1,
-      label_dtype=tf.int32,
-  )
-
-
-def train_input_fn(params):
-  """Load and return dataset of batched examples for use during training."""
-  file_pattern = os.path.join(params["data_dir"] or "", "*train*")
-  if params["use_synthetic_data"]:
-    return _generate_synthetic_data(params)
-  return _read_and_batch_from_files(
-      file_pattern, params["batch_size"], params["max_length"],
-      params["num_parallel_calls"], shuffle=True,
-      repeat=params["repeat_dataset"], static_batch=params["static_batch"])
-
-
-def eval_input_fn(params):
-  """Load and return dataset of batched examples for use during evaluation."""
-  file_pattern = os.path.join(params["data_dir"] or "", "*dev*")
-  if params["use_synthetic_data"]:
-    return _generate_synthetic_data(params)
-  return _read_and_batch_from_files(
-      file_pattern, params["batch_size"], params["max_length"],
-      params["num_parallel_calls"], shuffle=False, repeat=1,
-      static_batch=params["static_batch"])
diff --git a/official/r1/transformer/embedding_layer.py b/official/r1/transformer/embedding_layer.py
deleted file mode 100644
index 3ebedeafc1bc8452bd97a12db255d4d69dffc524..0000000000000000000000000000000000000000
--- a/official/r1/transformer/embedding_layer.py
+++ /dev/null
@@ -1,108 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Implementation of embedding layer with shared weights."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow.compat.v1 as tf  # pylint: disable=g-bad-import-order
-
-from official.r1.utils import tpu as tpu_utils
-
-
-class EmbeddingSharedWeights(tf.layers.Layer):
-  """Calculates input embeddings and pre-softmax linear with shared weights."""
-
-  def __init__(self, vocab_size, hidden_size, method="gather"):
-    """Specify characteristic parameters of embedding layer.
-
-    Args:
-      vocab_size: Number of tokens in the embedding. (Typically ~32,000)
-      hidden_size: Dimensionality of the embedding. (Typically 512 or 1024)
-      method: Strategy for performing embedding lookup. "gather" uses tf.gather
-        which performs well on CPUs and GPUs, but very poorly on TPUs. "matmul"
-        one-hot encodes the indicies and formulates the embedding as a sparse
-        matrix multiplication. The matmul formulation is wasteful as it does
-        extra work, however matrix multiplication is very fast on TPUs which
-        makes "matmul" considerably faster than "gather" on TPUs.
-    """
-    super(EmbeddingSharedWeights, self).__init__()
-    self.vocab_size = vocab_size
-    self.hidden_size = hidden_size
-    if method not in ("gather", "matmul"):
-      raise ValueError("method {} must be 'gather' or 'matmul'".format(method))
-    self.method = method
-
-  def build(self, _):
-    with tf.variable_scope("embedding_and_softmax", reuse=tf.AUTO_REUSE):
-      # Create and initialize weights. The random normal initializer was chosen
-      # randomly, and works well.
-      self.shared_weights = tf.get_variable(
-          "weights", [self.vocab_size, self.hidden_size],
-          initializer=tf.random_normal_initializer(
-              0., self.hidden_size ** -0.5))
-
-    self.built = True
-
-  def call(self, x):
-    """Get token embeddings of x.
-
-    Args:
-      x: An int64 tensor with shape [batch_size, length]
-    Returns:
-      embeddings: float32 tensor with shape [batch_size, length, embedding_size]
-      padding: float32 tensor with shape [batch_size, length] indicating the
-        locations of the padding tokens in x.
-    """
-    with tf.name_scope("embedding"):
-      # Create binary mask of size [batch_size, length]
-      mask = tf.to_float(tf.not_equal(x, 0))
-
-      if self.method == "gather":
-        embeddings = tf.gather(self.shared_weights, x)
-        embeddings *= tf.expand_dims(mask, -1)
-      else:  # matmul
-        embeddings = tpu_utils.embedding_matmul(
-            embedding_table=self.shared_weights,
-            values=tf.cast(x, dtype=tf.int32),
-            mask=mask
-        )
-        # embedding_matmul already zeros out masked positions, so
-        # `embeddings *= tf.expand_dims(mask, -1)` is unnecessary.
-
-
-      # Scale embedding by the sqrt of the hidden size
-      embeddings *= self.hidden_size ** 0.5
-
-      return embeddings
-
-
-  def linear(self, x):
-    """Computes logits by running x through a linear layer.
-
-    Args:
-      x: A float32 tensor with shape [batch_size, length, hidden_size]
-    Returns:
-      float32 tensor with shape [batch_size, length, vocab_size].
-    """
-    with tf.name_scope("presoftmax_linear"):
-      batch_size = tf.shape(x)[0]
-      length = tf.shape(x)[1]
-
-      x = tf.reshape(x, [-1, self.hidden_size])
-      logits = tf.matmul(x, self.shared_weights, transpose_b=True)
-
-      return tf.reshape(logits, [batch_size, length, self.vocab_size])
diff --git a/official/r1/transformer/ffn_layer.py b/official/r1/transformer/ffn_layer.py
deleted file mode 100644
index fc4750323ad4d79e8b31c7670fa00ac508df7878..0000000000000000000000000000000000000000
--- a/official/r1/transformer/ffn_layer.py
+++ /dev/null
@@ -1,89 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Implementation of fully connected network."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow.compat.v1 as tf
-
-
-class FeedFowardNetwork(tf.layers.Layer):
-  """Fully connected feedforward network."""
-
-  def __init__(self, hidden_size, filter_size, relu_dropout, train, allow_pad):
-    super(FeedFowardNetwork, self).__init__()
-    self.hidden_size = hidden_size
-    self.filter_size = filter_size
-    self.relu_dropout = relu_dropout
-    self.train = train
-    self.allow_pad = allow_pad
-
-    self.filter_dense_layer = tf.layers.Dense(
-        filter_size, use_bias=True, activation=tf.nn.relu, name="filter_layer")
-    self.output_dense_layer = tf.layers.Dense(
-        hidden_size, use_bias=True, name="output_layer")
-
-  def call(self, x, padding=None):
-    """Return outputs of the feedforward network.
-
-    Args:
-      x: tensor with shape [batch_size, length, hidden_size]
-      padding: (optional) If set, the padding values are temporarily removed
-        from x (provided self.allow_pad is set). The padding values are placed
-        back in the output tensor in the same locations.
-        shape [batch_size, length]
-
-    Returns:
-      Output of the feedforward network.
-      tensor with shape [batch_size, length, hidden_size]
-    """
-    padding = None if not self.allow_pad else padding
-
-    # Retrieve dynamically known shapes
-    batch_size = tf.shape(x)[0]
-    length = tf.shape(x)[1]
-
-    if padding is not None:
-      with tf.name_scope("remove_padding"):
-        # Flatten padding to [batch_size*length]
-        pad_mask = tf.reshape(padding, [-1])
-
-        nonpad_ids = tf.to_int32(tf.where(pad_mask < 1e-9))
-
-        # Reshape x to [batch_size*length, hidden_size] to remove padding
-        x = tf.reshape(x, [-1, self.hidden_size])
-        x = tf.gather_nd(x, indices=nonpad_ids)
-
-        # Reshape x from 2 dimensions to 3 dimensions.
-        x.set_shape([None, self.hidden_size])
-        x = tf.expand_dims(x, axis=0)
-
-    output = self.filter_dense_layer(x)
-    if self.train:
-      output = tf.nn.dropout(output, 1.0 - self.relu_dropout)
-    output = self.output_dense_layer(output)
-
-    if padding is not None:
-      with tf.name_scope("re_add_padding"):
-        output = tf.squeeze(output, axis=0)
-        output = tf.scatter_nd(
-            indices=nonpad_ids,
-            updates=output,
-            shape=[batch_size * length, self.hidden_size]
-        )
-        output = tf.reshape(output, [batch_size, length, self.hidden_size])
-    return output
diff --git a/official/r1/transformer/schedule.py b/official/r1/transformer/schedule.py
deleted file mode 100644
index 60aedffc1f748eb89bb4a3fcc940793eea0e69b0..0000000000000000000000000000000000000000
--- a/official/r1/transformer/schedule.py
+++ /dev/null
@@ -1,130 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Abstract training on a step or epoch basis."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-
-import tensorflow.compat.v1 as tf
-
-
-_TRAIN, _EVAL = tf.estimator.ModeKeys.TRAIN, tf.estimator.ModeKeys.EVAL
-
-
-NUM_EXAMPLES = {
-    tf.estimator.ModeKeys.TRAIN: 4572160,
-    # # Examples that are too long are filtered out, thus the total is less
-    # # than the total number of lines.
-    # 2399123 +  # news-commentary-v12.de-en
-    # 1920209 +  # commoncrawl.de-en
-    # 270769,    # europarl-v7.de-en
-    tf.estimator.ModeKeys.EVAL: 3000,  # newstest2013
-}
-
-
-class Manager(object):
-  """Container for convenience functions to abstract step or epoch basis.
-  Transformer allows users to specify an epoch basis (generally recommended for
-  full training) or a number of steps basis (convenient since epochs are rather
-  large). TPUs furthermore require a step basis; however epochs are the norm in
-  the machine learning community and it is desirable to allow users to specify
-  epochs even when running with TPUS which requires behind the scenes
-  conversions.
-  This container simply groups what are largely mundane checks and conversions
-  rather than interspersing them throughout the run loop code.
-  """
-
-  def __init__(self, train_steps, steps_between_evals, train_epochs,
-               epochs_between_evals, default_train_epochs, batch_size,
-               max_length, use_tpu=False, num_tpu_shards=8):
-    if train_steps and train_epochs:
-      raise ValueError("Both train_steps or train_epochs were be defined.")
-
-    # Determine training schedule based on flags.
-    if train_steps:
-      self.train_eval_iterations = train_steps // steps_between_evals
-      self._single_iteration_train_steps = steps_between_evals
-      self._single_iteration_train_epochs = None
-    else:
-      train_epochs = train_epochs or default_train_epochs
-      self.train_eval_iterations = train_epochs // epochs_between_evals
-      self._single_iteration_train_steps = None
-      self._single_iteration_train_epochs = epochs_between_evals
-
-    self.max_length = max_length
-    self.batch_size = batch_size
-    self.use_tpu = use_tpu
-    self.num_tpu_shards = num_tpu_shards
-
-    if self.use_tpu:
-      assert (self.batch_size // self.max_length) % self.num_tpu_shards == 0
-
-  @property
-  def single_iteration_train_steps(self):
-    if self._single_iteration_train_steps or not self.use_tpu:
-      return self._single_iteration_train_steps
-
-    return self.epochs_to_steps(
-        num_epochs=self._single_iteration_train_epochs, mode=_TRAIN)
-
-  @property
-  def single_iteration_eval_steps(self):
-    if not self.use_tpu:
-      return None
-
-    return self.epochs_to_steps(num_epochs=1, mode=_EVAL)
-
-  @property
-  def train_increment_str(self):
-    if self._single_iteration_train_steps:
-      return "{} steps.".format(self._single_iteration_train_steps)
-
-    if not self.use_tpu:
-      return "{} epochs.".format(self._single_iteration_train_epochs)
-
-    return "~{} epochs. ({} steps)".format(
-        self._single_iteration_train_epochs,
-        self.single_iteration_train_steps)
-
-  @property
-  def repeat_dataset(self):
-    if (self._single_iteration_train_epochs is None and
-        self._single_iteration_train_steps > NUM_EXAMPLES[_TRAIN]):
-      return math.ceil(self._single_iteration_train_steps /
-                       NUM_EXAMPLES[_TRAIN])
-    return self._single_iteration_train_epochs
-
-  def epochs_to_steps(self, num_epochs, mode):
-    """Converts a number of epochs to a number of training steps.
-
-    TPU only: This function assumes that static_batch is True.
-
-      TPU can not tolerate an OutOfRange error from a dataset. As a result the
-    number of examples to be processed must be known ahead of time. TPUs also
-    do not allow partial batches, so this function rounds down.
-
-    Args:
-      num_epochs: An integer of the number of epochs to convert to steps.
-      mode: The estimator ModeKey of the computation
-
-    Returns:
-      An integer of the number of equivalent steps rounded down.
-    """
-    assert self.use_tpu, "epochs_to_steps should only be reached when using TPU"
-    total_num_tokens = NUM_EXAMPLES[mode] * self.max_length * num_epochs
-    return total_num_tokens // self.batch_size
diff --git a/official/r1/transformer/schedule_test.py b/official/r1/transformer/schedule_test.py
deleted file mode 100644
index 29b4d5fde90384dad795552d1f3939748a3fc35c..0000000000000000000000000000000000000000
--- a/official/r1/transformer/schedule_test.py
+++ /dev/null
@@ -1,84 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Test Transformer's schedule manager."""
-
-import tensorflow.compat.v1 as tf
-
-from official.r1.transformer import schedule
-
-
-class ScheduleBaseTester(tf.test.TestCase):
-  def test_mutual_exclusivity(self):
-    with self.assertRaises(ValueError):
-      schedule.Manager(
-          train_steps=100, steps_between_evals=100, train_epochs=2,
-          epochs_between_evals=1, default_train_epochs=None, batch_size=2048,
-          max_length=256)
-
-  def test_step_basis(self):
-    manager = schedule.Manager(
-        train_steps=1000, steps_between_evals=100, train_epochs=None,
-        epochs_between_evals=None, default_train_epochs=None, batch_size=2048,
-        max_length=256)
-
-    self.assertEqual(manager.single_iteration_train_steps, 100)
-
-    # Evaluation uses the full set
-    self.assertIsNone(manager.single_iteration_eval_steps)
-
-    self.assertIsNone(manager.repeat_dataset)
-
-  def test_epoch_basis(self):
-    manager = schedule.Manager(
-        train_steps=None, steps_between_evals=None, train_epochs=10,
-        epochs_between_evals=2, default_train_epochs=None, batch_size=2048,
-        max_length=256)
-
-    # For non-TPU, estimator relies on dataset exhausion
-    self.assertIsNone(manager.single_iteration_train_steps)
-    self.assertIsNone(manager.single_iteration_eval_steps)
-
-    self.assertEqual(manager.repeat_dataset, 2)
-
-  def test_step_basis_tpu(self):
-    manager = schedule.Manager(
-        train_steps=1000, steps_between_evals=100, train_epochs=None,
-        epochs_between_evals=None, default_train_epochs=None, batch_size=2048,
-        max_length=256, use_tpu=True)
-
-    self.assertEqual(manager.single_iteration_train_steps, 100)
-    # num_eval_examples / (batch_size / max_length) == 3000 / (2048 / 256)
-    self.assertEqual(manager.single_iteration_eval_steps, 375)
-    self.assertIsNone(manager.repeat_dataset)
-
-  def test_epoch_basis_tpu(self):
-    manager = schedule.Manager(
-        train_steps=None, steps_between_evals=None, train_epochs=10,
-        epochs_between_evals=2, default_train_epochs=None, batch_size=2048,
-        max_length=256, use_tpu=True)
-
-    self.assertEqual(
-        manager.single_iteration_train_steps,
-        schedule.NUM_EXAMPLES[tf.estimator.ModeKeys.TRAIN] * 2 // (2048 / 256)
-    )
-
-    # num_eval_examples / (batch_size / max_length) == 3000 / (2048 / 256)
-    self.assertEqual(manager.single_iteration_eval_steps, 375)
-
-    self.assertEqual(manager.repeat_dataset, 2)
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/official/r1/transformer/transformer.py b/official/r1/transformer/transformer.py
deleted file mode 100644
index 708c3dd92dfcceb1168a1b8b1cecfceec59f8495..0000000000000000000000000000000000000000
--- a/official/r1/transformer/transformer.py
+++ /dev/null
@@ -1,417 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Defines the Transformer model, and its encoder and decoder stacks.
-
-Model paper: https://arxiv.org/pdf/1706.03762.pdf
-Transformer model code source: https://github.com/tensorflow/tensor2tensor
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow.compat.v1 as tf
-
-from official.nlp.transformer import beam_search_v1 as beam_search
-from official.nlp.transformer import model_utils
-from official.nlp.transformer.utils.tokenizer import EOS_ID
-from official.r1.transformer import attention_layer
-from official.r1.transformer import embedding_layer
-from official.r1.transformer import ffn_layer
-
-_NEG_INF = -1e9
-
-
-class Transformer(object):
-  """Transformer model for sequence to sequence data.
-
-  Implemented as described in: https://arxiv.org/pdf/1706.03762.pdf
-
-  The Transformer model consists of an encoder and decoder. The input is an int
-  sequence (or a batch of sequences). The encoder produces a continous
-  representation, and the decoder uses the encoder output to generate
-  probabilities for the output sequence.
-  """
-
-  def __init__(self, params, train):
-    """Initialize layers to build Transformer model.
-
-    Args:
-      params: hyperparameter object defining layer sizes, dropout values, etc.
-      train: boolean indicating whether the model is in training mode. Used to
-        determine if dropout layers should be added.
-    """
-    self.train = train
-    self.params = params
-
-    self.embedding_softmax_layer = embedding_layer.EmbeddingSharedWeights(
-        params["vocab_size"], params["hidden_size"],
-        method="matmul" if params["tpu"] else "gather")
-    self.encoder_stack = EncoderStack(params, train)
-    self.decoder_stack = DecoderStack(params, train)
-
-  def __call__(self, inputs, targets=None):
-    """Calculate target logits or inferred target sequences.
-
-    Args:
-      inputs: int tensor with shape [batch_size, input_length].
-      targets: None or int tensor with shape [batch_size, target_length].
-
-    Returns:
-      If targets is defined, then return logits for each word in the target
-      sequence. float tensor with shape [batch_size, target_length, vocab_size]
-      If target is none, then generate output sequence one token at a time.
-        returns a dictionary {
-          output: [batch_size, decoded length]
-          score: [batch_size, float]}
-    """
-    # Variance scaling is used here because it seems to work in many problems.
-    # Other reasonable initializers may also work just as well.
-    initializer = tf.variance_scaling_initializer(
-        self.params["initializer_gain"], mode="fan_avg", distribution="uniform")
-    with tf.variable_scope("Transformer", initializer=initializer):
-      # Calculate attention bias for encoder self-attention and decoder
-      # multi-headed attention layers.
-      attention_bias = model_utils.get_padding_bias(inputs)
-
-      # Run the inputs through the encoder layer to map the symbol
-      # representations to continuous representations.
-      encoder_outputs = self.encode(inputs, attention_bias)
-
-      # Generate output sequence if targets is None, or return logits if target
-      # sequence is known.
-      if targets is None:
-        return self.predict(encoder_outputs, attention_bias)
-      else:
-        logits = self.decode(targets, encoder_outputs, attention_bias)
-        return logits
-
-  def encode(self, inputs, attention_bias):
-    """Generate continuous representation for inputs.
-
-    Args:
-      inputs: int tensor with shape [batch_size, input_length].
-      attention_bias: float tensor with shape [batch_size, 1, 1, input_length]
-
-    Returns:
-      float tensor with shape [batch_size, input_length, hidden_size]
-    """
-    with tf.name_scope("encode"):
-      # Prepare inputs to the layer stack by adding positional encodings and
-      # applying dropout.
-      embedded_inputs = self.embedding_softmax_layer(inputs)
-      inputs_padding = model_utils.get_padding(inputs)
-
-      with tf.name_scope("add_pos_encoding"):
-        length = tf.shape(embedded_inputs)[1]
-        pos_encoding = model_utils.get_position_encoding(
-            length, self.params["hidden_size"])
-        encoder_inputs = embedded_inputs + pos_encoding
-
-      if self.train:
-        encoder_inputs = tf.nn.dropout(
-            encoder_inputs, 1 - self.params["layer_postprocess_dropout"])
-
-      return self.encoder_stack(encoder_inputs, attention_bias, inputs_padding)
-
-  def decode(self, targets, encoder_outputs, attention_bias):
-    """Generate logits for each value in the target sequence.
-
-    Args:
-      targets: target values for the output sequence.
-        int tensor with shape [batch_size, target_length]
-      encoder_outputs: continuous representation of input sequence.
-        float tensor with shape [batch_size, input_length, hidden_size]
-      attention_bias: float tensor with shape [batch_size, 1, 1, input_length]
-
-    Returns:
-      float32 tensor with shape [batch_size, target_length, vocab_size]
-    """
-    with tf.name_scope("decode"):
-      # Prepare inputs to decoder layers by shifting targets, adding positional
-      # encoding and applying dropout.
-      decoder_inputs = self.embedding_softmax_layer(targets)
-      with tf.name_scope("shift_targets"):
-        # Shift targets to the right, and remove the last element
-        decoder_inputs = tf.pad(
-            decoder_inputs, [[0, 0], [1, 0], [0, 0]])[:, :-1, :]
-      with tf.name_scope("add_pos_encoding"):
-        length = tf.shape(decoder_inputs)[1]
-        decoder_inputs += model_utils.get_position_encoding(
-            length, self.params["hidden_size"])
-      if self.train:
-        decoder_inputs = tf.nn.dropout(
-            decoder_inputs, 1 - self.params["layer_postprocess_dropout"])
-
-      # Run values
-      decoder_self_attention_bias = model_utils.get_decoder_self_attention_bias(
-          length)
-      outputs = self.decoder_stack(
-          decoder_inputs, encoder_outputs, decoder_self_attention_bias,
-          attention_bias)
-      logits = self.embedding_softmax_layer.linear(outputs)
-      return logits
-
-  def _get_symbols_to_logits_fn(self, max_decode_length):
-    """Returns a decoding function that calculates logits of the next tokens."""
-
-    timing_signal = model_utils.get_position_encoding(
-        max_decode_length + 1, self.params["hidden_size"])
-    decoder_self_attention_bias = model_utils.get_decoder_self_attention_bias(
-        max_decode_length)
-
-    def symbols_to_logits_fn(ids, i, cache):
-      """Generate logits for next potential IDs.
-
-      Args:
-        ids: Current decoded sequences.
-          int tensor with shape [batch_size * beam_size, i + 1]
-        i: Loop index
-        cache: dictionary of values storing the encoder output, encoder-decoder
-          attention bias, and previous decoder attention values.
-
-      Returns:
-        Tuple of
-          (logits with shape [batch_size * beam_size, vocab_size],
-           updated cache values)
-      """
-      # Set decoder input to the last generated IDs
-      decoder_input = ids[:, -1:]
-
-      # Preprocess decoder input by getting embeddings and adding timing signal.
-      decoder_input = self.embedding_softmax_layer(decoder_input)
-      decoder_input += timing_signal[i:i + 1]
-
-      self_attention_bias = decoder_self_attention_bias[:, :, i:i + 1, :i + 1]
-      decoder_outputs = self.decoder_stack(
-          decoder_input, cache.get("encoder_outputs"), self_attention_bias,
-          cache.get("encoder_decoder_attention_bias"), cache)
-      logits = self.embedding_softmax_layer.linear(decoder_outputs)
-      logits = tf.squeeze(logits, axis=[1])
-      return logits, cache
-    return symbols_to_logits_fn
-
-  def predict(self, encoder_outputs, encoder_decoder_attention_bias):
-    """Return predicted sequence."""
-    batch_size = tf.shape(encoder_outputs)[0]
-    input_length = tf.shape(encoder_outputs)[1]
-    max_decode_length = input_length + self.params["extra_decode_length"]
-
-    symbols_to_logits_fn = self._get_symbols_to_logits_fn(max_decode_length)
-
-    # Create initial set of IDs that will be passed into symbols_to_logits_fn.
-    initial_ids = tf.zeros([batch_size], dtype=tf.int32)
-
-    # Create cache storing decoder attention values for each layer.
-    cache = {
-        "layer_%d" % layer: {
-            "k": tf.zeros([batch_size, 0, self.params["hidden_size"]]),
-            "v": tf.zeros([batch_size, 0, self.params["hidden_size"]]),
-        } for layer in range(self.params["num_hidden_layers"])}
-
-    # Add encoder output and attention bias to the cache.
-    cache["encoder_outputs"] = encoder_outputs
-    cache["encoder_decoder_attention_bias"] = encoder_decoder_attention_bias
-
-    # Use beam search to find the top beam_size sequences and scores.
-    decoded_ids, scores = beam_search.sequence_beam_search(
-        symbols_to_logits_fn=symbols_to_logits_fn,
-        initial_ids=initial_ids,
-        initial_cache=cache,
-        vocab_size=self.params["vocab_size"],
-        beam_size=self.params["beam_size"],
-        alpha=self.params["alpha"],
-        max_decode_length=max_decode_length,
-        eos_id=EOS_ID)
-
-    # Get the top sequence for each batch element
-    top_decoded_ids = decoded_ids[:, 0, 1:]
-    top_scores = scores[:, 0]
-
-    return {"outputs": top_decoded_ids, "scores": top_scores}
-
-
-class LayerNormalization(tf.layers.Layer):
-  """Applies layer normalization."""
-
-  def __init__(self, hidden_size):
-    super(LayerNormalization, self).__init__()
-    self.hidden_size = hidden_size
-
-  def build(self, _):
-    self.scale = tf.get_variable("layer_norm_scale", [self.hidden_size],
-                                 initializer=tf.ones_initializer())
-    self.bias = tf.get_variable("layer_norm_bias", [self.hidden_size],
-                                initializer=tf.zeros_initializer())
-    self.built = True
-
-  def call(self, x, epsilon=1e-6):
-    mean = tf.reduce_mean(x, axis=[-1], keepdims=True)
-    variance = tf.reduce_mean(tf.square(x - mean), axis=[-1], keepdims=True)
-    norm_x = (x - mean) * tf.rsqrt(variance + epsilon)
-    return norm_x * self.scale + self.bias
-
-
-class PrePostProcessingWrapper(object):
-  """Wrapper class that applies layer pre-processing and post-processing."""
-
-  def __init__(self, layer, params, train):
-    self.layer = layer
-    self.postprocess_dropout = params["layer_postprocess_dropout"]
-    self.train = train
-
-    # Create normalization layer
-    self.layer_norm = LayerNormalization(params["hidden_size"])
-
-  def __call__(self, x, *args, **kwargs):
-    # Preprocessing: apply layer normalization
-    y = self.layer_norm(x)
-
-    # Get layer output
-    y = self.layer(y, *args, **kwargs)
-
-    # Postprocessing: apply dropout and residual connection
-    if self.train:
-      y = tf.nn.dropout(y, 1 - self.postprocess_dropout)
-    return x + y
-
-
-class EncoderStack(tf.layers.Layer):
-  """Transformer encoder stack.
-
-  The encoder stack is made up of N identical layers. Each layer is composed
-  of the sublayers:
-    1. Self-attention layer
-    2. Feedforward network (which is 2 fully-connected layers)
-  """
-
-  def __init__(self, params, train):
-    super(EncoderStack, self).__init__()
-    self.layers = []
-    for _ in range(params["num_hidden_layers"]):
-      # Create sublayers for each layer.
-      self_attention_layer = attention_layer.SelfAttention(
-          params["hidden_size"], params["num_heads"],
-          params["attention_dropout"], train)
-      feed_forward_network = ffn_layer.FeedFowardNetwork(
-          params["hidden_size"], params["filter_size"],
-          params["relu_dropout"], train, params["allow_ffn_pad"])
-
-      self.layers.append([
-          PrePostProcessingWrapper(self_attention_layer, params, train),
-          PrePostProcessingWrapper(feed_forward_network, params, train)])
-
-    # Create final layer normalization layer.
-    self.output_normalization = LayerNormalization(params["hidden_size"])
-
-  def call(self, encoder_inputs, attention_bias, inputs_padding):
-    """Return the output of the encoder layer stacks.
-
-    Args:
-      encoder_inputs: tensor with shape [batch_size, input_length, hidden_size]
-      attention_bias: bias for the encoder self-attention layer.
-        [batch_size, 1, 1, input_length]
-      inputs_padding: P
-
-    Returns:
-      Output of encoder layer stack.
-      float32 tensor with shape [batch_size, input_length, hidden_size]
-    """
-    for n, layer in enumerate(self.layers):
-      # Run inputs through the sublayers.
-      self_attention_layer = layer[0]
-      feed_forward_network = layer[1]
-
-      with tf.variable_scope("layer_%d" % n):
-        with tf.variable_scope("self_attention"):
-          encoder_inputs = self_attention_layer(encoder_inputs, attention_bias)
-        with tf.variable_scope("ffn"):
-          encoder_inputs = feed_forward_network(encoder_inputs, inputs_padding)
-
-    return self.output_normalization(encoder_inputs)
-
-
-class DecoderStack(tf.layers.Layer):
-  """Transformer decoder stack.
-
-  Like the encoder stack, the decoder stack is made up of N identical layers.
-  Each layer is composed of the sublayers:
-    1. Self-attention layer
-    2. Multi-headed attention layer combining encoder outputs with results from
-       the previous self-attention layer.
-    3. Feedforward network (2 fully-connected layers)
-  """
-
-  def __init__(self, params, train):
-    super(DecoderStack, self).__init__()
-    self.layers = []
-    for _ in range(params["num_hidden_layers"]):
-      self_attention_layer = attention_layer.SelfAttention(
-          params["hidden_size"], params["num_heads"],
-          params["attention_dropout"], train)
-      enc_dec_attention_layer = attention_layer.Attention(
-          params["hidden_size"], params["num_heads"],
-          params["attention_dropout"], train)
-      feed_forward_network = ffn_layer.FeedFowardNetwork(
-          params["hidden_size"], params["filter_size"],
-          params["relu_dropout"], train, params["allow_ffn_pad"])
-
-      self.layers.append([
-          PrePostProcessingWrapper(self_attention_layer, params, train),
-          PrePostProcessingWrapper(enc_dec_attention_layer, params, train),
-          PrePostProcessingWrapper(feed_forward_network, params, train)])
-
-    self.output_normalization = LayerNormalization(params["hidden_size"])
-
-  def call(self, decoder_inputs, encoder_outputs, decoder_self_attention_bias,
-           attention_bias, cache=None):
-    """Return the output of the decoder layer stacks.
-
-    Args:
-      decoder_inputs: tensor with shape [batch_size, target_length, hidden_size]
-      encoder_outputs: tensor with shape [batch_size, input_length, hidden_size]
-      decoder_self_attention_bias: bias for decoder self-attention layer.
-        [1, 1, target_len, target_length]
-      attention_bias: bias for encoder-decoder attention layer.
-        [batch_size, 1, 1, input_length]
-      cache: (Used for fast decoding) A nested dictionary storing previous
-        decoder self-attention values. The items are:
-          {layer_n: {"k": tensor with shape [batch_size, i, key_channels],
-                     "v": tensor with shape [batch_size, i, value_channels]},
-           ...}
-
-    Returns:
-      Output of decoder layer stack.
-      float32 tensor with shape [batch_size, target_length, hidden_size]
-    """
-    for n, layer in enumerate(self.layers):
-      self_attention_layer = layer[0]
-      enc_dec_attention_layer = layer[1]
-      feed_forward_network = layer[2]
-
-      # Run inputs through the sublayers.
-      layer_name = "layer_%d" % n
-      layer_cache = cache[layer_name] if cache is not None else None
-      with tf.variable_scope(layer_name):
-        with tf.variable_scope("self_attention"):
-          decoder_inputs = self_attention_layer(
-              decoder_inputs, decoder_self_attention_bias, cache=layer_cache)
-        with tf.variable_scope("encdec_attention"):
-          decoder_inputs = enc_dec_attention_layer(
-              decoder_inputs, encoder_outputs, attention_bias)
-        with tf.variable_scope("ffn"):
-          decoder_inputs = feed_forward_network(decoder_inputs)
-
-    return self.output_normalization(decoder_inputs)
diff --git a/official/r1/transformer/transformer_main.py b/official/r1/transformer/transformer_main.py
deleted file mode 100644
index eb2c2f9488c9ad35e20b82a022fe585c36fc08f4..0000000000000000000000000000000000000000
--- a/official/r1/transformer/transformer_main.py
+++ /dev/null
@@ -1,710 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Train and evaluate the Transformer model.
-
-See README for description of setting the training schedule and evaluating the
-BLEU score.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import tempfile
-
-# pylint: disable=g-bad-import-order
-from six.moves import xrange  # pylint: disable=redefined-builtin
-from absl import app as absl_app
-from absl import flags
-import tensorflow.compat.v1 as tf
-# pylint: enable=g-bad-import-order
-
-from official.nlp.transformer import model_params
-from official.r1.utils import export
-from official.r1.utils import tpu as tpu_util
-from official.r1.transformer import translate
-from official.r1.transformer import transformer
-from official.r1.transformer import dataset
-from official.r1.transformer import schedule
-from official.nlp.transformer import compute_bleu
-from official.nlp.transformer.utils import metrics
-from official.nlp.transformer.utils import tokenizer
-from official.utils.flags import core as flags_core
-from official.r1.utils.logs import hooks_helper
-from official.r1.utils.logs import logger
-from official.utils.misc import distribution_utils
-from official.utils.misc import model_helpers
-
-PARAMS_MAP = {
-    "tiny": model_params.TINY_PARAMS,
-    "base": model_params.BASE_PARAMS,
-    "big": model_params.BIG_PARAMS,
-}
-
-
-DEFAULT_TRAIN_EPOCHS = 10
-INF = 1000000000  # 1e9
-BLEU_DIR = "bleu"
-
-# Dictionary containing tensors that are logged by the logging hooks. Each item
-# maps a string to the tensor name.
-TENSORS_TO_LOG = {
-    "learning_rate": "model/get_train_op/learning_rate/learning_rate",
-    "cross_entropy_loss": "model/cross_entropy"}
-
-
-def model_fn(features, labels, mode, params):
-  """Defines how to train, evaluate and predict from the transformer model."""
-  with tf.variable_scope("model"):
-    inputs, targets = features, labels
-
-    # Create model and get output logits.
-    model = transformer.Transformer(params, mode == tf.estimator.ModeKeys.TRAIN)
-
-    logits = model(inputs, targets)
-
-    # When in prediction mode, the labels/targets is None. The model output
-    # is the prediction
-    if mode == tf.estimator.ModeKeys.PREDICT:
-      if params["use_tpu"]:
-        raise NotImplementedError("Prediction is not yet supported on TPUs.")
-      return tf.estimator.EstimatorSpec(
-          tf.estimator.ModeKeys.PREDICT,
-          predictions=logits,
-          export_outputs={
-              "translate": tf.estimator.export.PredictOutput(logits)
-          })
-
-    # Explicitly set the shape of the logits for XLA (TPU). This is needed
-    # because the logits are passed back to the host VM CPU for metric
-    # evaluation, and the shape of [?, ?, vocab_size] is too vague. However
-    # it is known from Transformer that the first two dimensions of logits
-    # are the dimensions of targets. Note that the ambiguous shape of logits is
-    # not a problem when computing xentropy, because padded_cross_entropy_loss
-    # resolves the shape on the TPU.
-    logits.set_shape(targets.shape.as_list() + logits.shape.as_list()[2:])
-
-    # Calculate model loss.
-    # xentropy contains the cross entropy loss of every nonpadding token in the
-    # targets.
-    xentropy, weights = metrics.padded_cross_entropy_loss(
-        logits, targets, params["label_smoothing"], params["vocab_size"])
-    loss = tf.reduce_sum(xentropy) / tf.reduce_sum(weights)
-
-    # Save loss as named tensor that will be logged with the logging hook.
-    tf.identity(loss, "cross_entropy")
-
-    if mode == tf.estimator.ModeKeys.EVAL:
-      if params["use_tpu"]:
-        # host call functions should only have tensors as arguments.
-        # This lambda pre-populates params so that metric_fn is
-        # TPUEstimator compliant.
-        metric_fn = lambda logits, labels: (
-            metrics.get_eval_metrics(logits, labels, params=params))
-        eval_metrics = (metric_fn, [logits, labels])
-        return tf.estimator.tpu.TPUEstimatorSpec(
-            mode=mode,
-            loss=loss,
-            predictions={"predictions": logits},
-            eval_metrics=eval_metrics)
-      return tf.estimator.EstimatorSpec(
-          mode=mode, loss=loss, predictions={"predictions": logits},
-          eval_metric_ops=metrics.get_eval_metrics(logits, labels, params))
-    else:
-      train_op, metric_dict = get_train_op_and_metrics(loss, params)
-
-      # Epochs can be quite long. This gives some intermediate information
-      # in TensorBoard.
-      metric_dict["minibatch_loss"] = loss
-      if params["use_tpu"]:
-        return tf.estimator.tpu.TPUEstimatorSpec(
-            mode=mode,
-            loss=loss,
-            train_op=train_op,
-            host_call=tpu_util.construct_scalar_host_call(
-                metric_dict=metric_dict,
-                model_dir=params["model_dir"],
-                prefix="training/"))
-      record_scalars(metric_dict)
-      return tf.estimator.EstimatorSpec(mode=mode, loss=loss, train_op=train_op)
-
-
-def record_scalars(metric_dict):
-  for key, value in metric_dict.items():
-    tf.summary.scalar(name=key, tensor=value)
-
-
-def get_learning_rate(learning_rate, hidden_size, learning_rate_warmup_steps):
-  """Calculate learning rate with linear warmup and rsqrt decay."""
-  with tf.name_scope("learning_rate"):
-    warmup_steps = tf.to_float(learning_rate_warmup_steps)
-    step = tf.to_float(tf.train.get_or_create_global_step())
-
-    learning_rate *= (hidden_size ** -0.5)
-    # Apply linear warmup
-    learning_rate *= tf.minimum(1.0, step / warmup_steps)
-    # Apply rsqrt decay
-    learning_rate *= tf.rsqrt(tf.maximum(step, warmup_steps))
-
-    # Create a named tensor that will be logged using the logging hook.
-    # The full name includes variable and names scope. In this case, the name
-    # is model/get_train_op/learning_rate/learning_rate
-    tf.identity(learning_rate, "learning_rate")
-
-    return learning_rate
-
-
-def get_train_op_and_metrics(loss, params):
-  """Generate training op and metrics to save in TensorBoard."""
-  with tf.variable_scope("get_train_op"):
-    learning_rate = get_learning_rate(
-        learning_rate=params["learning_rate"],
-        hidden_size=params["hidden_size"],
-        learning_rate_warmup_steps=params["learning_rate_warmup_steps"])
-
-    # Create optimizer. Use LazyAdamOptimizer from TF contrib, which is faster
-    # than the TF core Adam optimizer.
-    from tensorflow.contrib import opt as contrib_opt  # pylint: disable=g-import-not-at-top
-    optimizer = contrib_opt.LazyAdamOptimizer(
-        learning_rate,
-        beta1=params["optimizer_adam_beta1"],
-        beta2=params["optimizer_adam_beta2"],
-        epsilon=params["optimizer_adam_epsilon"])
-
-    if params["use_tpu"] and params["tpu"] != tpu_util.LOCAL:
-      optimizer = tf.compat.v1.tpu.CrossShardOptimizer(optimizer)
-
-    # Uses automatic mixed precision FP16 training if on GPU.
-    if params["dtype"] == "fp16":
-      optimizer = tf.train.experimental.enable_mixed_precision_graph_rewrite(
-          optimizer)
-
-    # Calculate and apply gradients using LazyAdamOptimizer.
-    global_step = tf.train.get_global_step()
-    tvars = tf.trainable_variables()
-    gradients = optimizer.compute_gradients(
-        loss, tvars, colocate_gradients_with_ops=True)
-    minimize_op = optimizer.apply_gradients(
-        gradients, global_step=global_step, name="train")
-    update_ops = tf.get_collection(tf.GraphKeys.UPDATE_OPS)
-    train_op = tf.group(minimize_op, update_ops)
-
-    train_metrics = {"learning_rate": learning_rate}
-
-    if not params["use_tpu"]:
-      # gradient norm is not included as a summary when running on TPU, as
-      # it can cause instability between the TPU and the host controller.
-      gradient_norm = tf.global_norm(list(zip(*gradients))[0])
-      train_metrics["global_norm/gradient_norm"] = gradient_norm
-
-    return train_op, train_metrics
-
-
-def translate_and_compute_bleu(estimator, subtokenizer, bleu_source, bleu_ref):
-  """Translate file and report the cased and uncased bleu scores."""
-  # Create temporary file to store translation.
-  tmp = tempfile.NamedTemporaryFile(delete=False)
-  tmp_filename = tmp.name
-
-  translate.translate_file(
-      estimator, subtokenizer, bleu_source, output_file=tmp_filename,
-      print_all_translations=False)
-
-  # Compute uncased and cased bleu scores.
-  uncased_score = compute_bleu.bleu_wrapper(bleu_ref, tmp_filename, False)
-  cased_score = compute_bleu.bleu_wrapper(bleu_ref, tmp_filename, True)
-  os.remove(tmp_filename)
-  return uncased_score, cased_score
-
-
-def get_global_step(estimator):
-  """Return estimator's last checkpoint."""
-  return int(estimator.latest_checkpoint().split("-")[-1])
-
-
-def evaluate_and_log_bleu(estimator, bleu_source, bleu_ref, vocab_file):
-  """Calculate and record the BLEU score."""
-  subtokenizer = tokenizer.Subtokenizer(vocab_file)
-
-  uncased_score, cased_score = translate_and_compute_bleu(
-      estimator, subtokenizer, bleu_source, bleu_ref)
-
-  tf.logging.info("Bleu score (uncased): %f", uncased_score)
-  tf.logging.info("Bleu score (cased): %f", cased_score)
-  return uncased_score, cased_score
-
-
-def _validate_file(filepath):
-  """Make sure that file exists."""
-  if not tf.io.gfile.exists(filepath):
-    raise tf.errors.NotFoundError(None, None, "File %s not found." % filepath)
-
-
-def run_loop(
-    estimator, schedule_manager, train_hooks=None, benchmark_logger=None,
-    bleu_source=None, bleu_ref=None, bleu_threshold=None, vocab_file=None):
-  """Train and evaluate model, and optionally compute model's BLEU score.
-
-  **Step vs. Epoch vs. Iteration**
-
-  Steps and epochs are canonical terms used in TensorFlow and general machine
-  learning. They are used to describe running a single process (train/eval):
-    - Step refers to running the process through a single or batch of examples.
-    - Epoch refers to running the process through an entire dataset.
-
-  E.g. training a dataset with 100 examples. The dataset is
-  divided into 20 batches with 5 examples per batch. A single training step
-  trains the model on one batch. After 20 training steps, the model will have
-  trained on every batch in the dataset, or, in other words, one epoch.
-
-  Meanwhile, iteration is used in this implementation to describe running
-  multiple processes (training and eval).
-    - A single iteration:
-      1. trains the model for a specific number of steps or epochs.
-      2. evaluates the model.
-      3. (if source and ref files are provided) compute BLEU score.
-
-  This function runs through multiple train+eval+bleu iterations.
-
-  Args:
-    estimator: tf.Estimator containing model to train.
-    schedule_manager: A schedule.Manager object to guide the run loop.
-    train_hooks: List of hooks to pass to the estimator during training.
-    benchmark_logger: a BenchmarkLogger object that logs evaluation data
-    bleu_source: File containing text to be translated for BLEU calculation.
-    bleu_ref: File containing reference translations for BLEU calculation.
-    bleu_threshold: minimum BLEU score before training is stopped.
-    vocab_file: Path to vocab file that will be used to subtokenize bleu_source.
-
-  Returns:
-    Dict of results of the run.  Contains the keys `eval_results`,
-    `train_hooks`, `bleu_cased`, and `bleu_uncased`. `train_hooks` is a list the
-    instances of hooks used during training.
-
-  Raises:
-    ValueError: if both or none of single_iteration_train_steps and
-      single_iteration_train_epochs were defined.
-    NotFoundError: if the vocab file or bleu files don't exist.
-  """
-  if bleu_source:
-    _validate_file(bleu_source)
-  if bleu_ref:
-    _validate_file(bleu_ref)
-  if vocab_file:
-    _validate_file(vocab_file)
-
-  evaluate_bleu = bleu_source is not None and bleu_ref is not None
-  if evaluate_bleu and schedule_manager.use_tpu:
-    raise ValueError("BLEU score can not be computed when training with a TPU, "
-                     "as it requires estimator.predict which is not yet "
-                     "supported.")
-
-  # Print details of training schedule.
-  tf.logging.info("Training schedule:")
-  tf.logging.info(
-      "\t1. Train for {}".format(schedule_manager.train_increment_str))
-  tf.logging.info("\t2. Evaluate model.")
-  if evaluate_bleu:
-    tf.logging.info("\t3. Compute BLEU score.")
-    if bleu_threshold is not None:
-      tf.logging.info("Repeat above steps until the BLEU score reaches %f" %
-                      bleu_threshold)
-  if not evaluate_bleu or bleu_threshold is None:
-    tf.logging.info("Repeat above steps %d times." %
-                    schedule_manager.train_eval_iterations)
-
-  if evaluate_bleu:
-    # Create summary writer to log bleu score (values can be displayed in
-    # Tensorboard).
-    bleu_writer = tf.summary.FileWriter(
-        os.path.join(estimator.model_dir, BLEU_DIR))
-    if bleu_threshold is not None:
-      # Change loop stopping condition if bleu_threshold is defined.
-      schedule_manager.train_eval_iterations = INF
-
-  # Loop training/evaluation/bleu cycles
-  stats = {}
-  for i in xrange(schedule_manager.train_eval_iterations):
-    tf.logging.info("Starting iteration %d" % (i + 1))
-
-    # Train the model for single_iteration_train_steps or until the input fn
-    # runs out of examples (if single_iteration_train_steps is None).
-    estimator.train(
-        dataset.train_input_fn,
-        steps=schedule_manager.single_iteration_train_steps,
-        hooks=train_hooks)
-
-    eval_results = None
-    eval_results = estimator.evaluate(
-        input_fn=dataset.eval_input_fn,
-        steps=schedule_manager.single_iteration_eval_steps)
-
-    tf.logging.info("Evaluation results (iter %d/%d):" %
-                    (i + 1, schedule_manager.train_eval_iterations))
-    tf.logging.info(eval_results)
-    benchmark_logger.log_evaluation_result(eval_results)
-
-    # The results from estimator.evaluate() are measured on an approximate
-    # translation, which utilize the target golden values provided. The actual
-    # bleu score must be computed using the estimator.predict() path, which
-    # outputs translations that are not based on golden values. The translations
-    # are compared to reference file to get the actual bleu score.
-    if evaluate_bleu:
-      uncased_score, cased_score = evaluate_and_log_bleu(
-          estimator, bleu_source, bleu_ref, vocab_file)
-
-      stats["bleu_uncased"] = uncased_score
-      stats["bleu_cased"] = cased_score
-
-      # Write actual bleu scores using summary writer and benchmark logger
-      global_step = get_global_step(estimator)
-      summary = tf.Summary(value=[
-          tf.Summary.Value(tag="bleu/uncased", simple_value=uncased_score),
-          tf.Summary.Value(tag="bleu/cased", simple_value=cased_score),
-      ])
-      bleu_writer.add_summary(summary, global_step)
-      bleu_writer.flush()
-      benchmark_logger.log_metric(
-          "bleu_uncased", uncased_score, global_step=global_step)
-      benchmark_logger.log_metric(
-          "bleu_cased", cased_score, global_step=global_step)
-
-      # Stop training if bleu stopping threshold is met.
-      if model_helpers.past_stop_threshold(bleu_threshold, uncased_score):
-        bleu_writer.close()
-        break
-
-  stats["eval_results"] = eval_results
-  stats["train_hooks"] = train_hooks
-
-  return stats
-
-
-def define_transformer_flags():
-  """Add flags and flag validators for running transformer_main."""
-  # Add common flags (data_dir, model_dir, train_epochs, etc.).
-  flags.DEFINE_integer(
-      name="max_length", short_name="ml", default=None,
-      help=flags_core.help_wrap("Max length."))
-
-  flags_core.define_base(clean=True, train_epochs=True,
-                         epochs_between_evals=True, stop_threshold=True,
-                         num_gpu=True, hooks=True, export_dir=True,
-                         distribution_strategy=True)
-  flags_core.define_performance(
-      num_parallel_calls=True,
-      inter_op=False,
-      intra_op=False,
-      synthetic_data=True,
-      max_train_steps=False,
-      dtype=True,
-      all_reduce_alg=True
-  )
-  flags_core.define_benchmark()
-  flags_core.define_device(tpu=True)
-
-  # Set flags from the flags_core module as "key flags" so they're listed when
-  # the '-h' flag is used. Without this line, the flags defined above are
-  # only shown in the full `--helpful` help text.
-  flags.adopt_module_key_flags(flags_core)
-
-  # Add transformer-specific flags
-  flags.DEFINE_enum(
-      name="param_set", short_name="mp", default="big",
-      enum_values=PARAMS_MAP.keys(),
-      help=flags_core.help_wrap(
-          "Parameter set to use when creating and training the model. The "
-          "parameters define the input shape (batch size and max length), "
-          "model configuration (size of embedding, # of hidden layers, etc.), "
-          "and various other settings. The big parameter set increases the "
-          "default batch size, embedding/hidden size, and filter size. For a "
-          "complete list of parameters, please see model/model_params.py."))
-
-  flags.DEFINE_bool(
-      name="static_batch", default=False,
-      help=flags_core.help_wrap(
-          "Whether the batches in the dataset should have static shapes. In "
-          "general, this setting should be False. Dynamic shapes allow the "
-          "inputs to be grouped so that the number of padding tokens is "
-          "minimized, and helps model training. In cases where the input shape "
-          "must be static (e.g. running on TPU), this setting will be ignored "
-          "and static batching will always be used."))
-
-  # Flags for training with steps (may be used for debugging)
-  flags.DEFINE_integer(
-      name="train_steps", short_name="ts", default=None,
-      help=flags_core.help_wrap("The number of steps used to train."))
-  flags.DEFINE_integer(
-      name="steps_between_evals", short_name="sbe", default=1000,
-      help=flags_core.help_wrap(
-          "The Number of training steps to run between evaluations. This is "
-          "used if --train_steps is defined."))
-
-  # BLEU score computation
-  flags.DEFINE_string(
-      name="bleu_source", short_name="bls", default=None,
-      help=flags_core.help_wrap(
-          "Path to source file containing text translate when calculating the "
-          "official BLEU score. Both --bleu_source and --bleu_ref must be set. "
-          "Use the flag --stop_threshold to stop the script based on the "
-          "uncased BLEU score."))
-  flags.DEFINE_string(
-      name="bleu_ref", short_name="blr", default=None,
-      help=flags_core.help_wrap(
-          "Path to source file containing text translate when calculating the "
-          "official BLEU score. Both --bleu_source and --bleu_ref must be set. "
-          "Use the flag --stop_threshold to stop the script based on the "
-          "uncased BLEU score."))
-  flags.DEFINE_string(
-      name="vocab_file", short_name="vf", default=None,
-      help=flags_core.help_wrap(
-          "Path to subtoken vocabulary file. If data_download.py was used to "
-          "download and encode the training data, look in the data_dir to find "
-          "the vocab file."))
-
-  flags_core.set_defaults(data_dir="/tmp/translate_ende",
-                          model_dir="/tmp/transformer_model",
-                          batch_size=None,
-                          train_epochs=None)
-
-  @flags.multi_flags_validator(
-      ["train_epochs", "train_steps"],
-      message="Both --train_steps and --train_epochs were set. Only one may be "
-              "defined.")
-  def _check_train_limits(flag_dict):
-    return flag_dict["train_epochs"] is None or flag_dict["train_steps"] is None
-
-  @flags.multi_flags_validator(
-      ["bleu_source", "bleu_ref"],
-      message="Both or neither --bleu_source and --bleu_ref must be defined.")
-  def _check_bleu_files(flags_dict):
-    return (flags_dict["bleu_source"] is None) == (
-        flags_dict["bleu_ref"] is None)
-
-  @flags.multi_flags_validator(
-      ["bleu_source", "bleu_ref", "vocab_file"],
-      message="--vocab_file must be defined if --bleu_source and --bleu_ref "
-              "are defined.")
-  def _check_bleu_vocab_file(flags_dict):
-    if flags_dict["bleu_source"] and flags_dict["bleu_ref"]:
-      return flags_dict["vocab_file"] is not None
-    return True
-
-  @flags.multi_flags_validator(
-      ["export_dir", "vocab_file"],
-      message="--vocab_file must be defined if --export_dir is set.")
-  def _check_export_vocab_file(flags_dict):
-    if flags_dict["export_dir"]:
-      return flags_dict["vocab_file"] is not None
-    return True
-
-  flags_core.require_cloud_storage(["data_dir", "model_dir", "export_dir"])
-
-
-def construct_estimator(flags_obj, params, schedule_manager):
-  """Construct an estimator from either Estimator or TPUEstimator.
-
-  Args:
-    flags_obj: The FLAGS object parsed from command line.
-    params: A dict of run specific parameters.
-    schedule_manager: A schedule.Manager object containing the run schedule.
-
-  Returns:
-    An estimator object to be used for training and eval.
-  """
-  if not params["use_tpu"]:
-    distribution_strategy = distribution_utils.get_distribution_strategy(
-        distribution_strategy=flags_obj.distribution_strategy,
-        num_gpus=flags_core.get_num_gpus(flags_obj),
-        all_reduce_alg=flags_obj.all_reduce_alg)
-    return tf.estimator.Estimator(
-        model_fn=model_fn, model_dir=flags_obj.model_dir, params=params,
-        config=tf.estimator.RunConfig(train_distribute=distribution_strategy))
-
-  tpu_cluster_resolver = tf.compat.v1.cluster_resolver.TPUClusterResolver(
-      tpu=flags_obj.tpu,
-      zone=flags_obj.tpu_zone,
-      project=flags_obj.tpu_gcp_project
-  )
-
-  tpu_config = tf.estimator.tpu.TPUConfig(
-      iterations_per_loop=schedule_manager.single_iteration_train_steps,
-      num_shards=flags_obj.num_tpu_shards)
-
-  run_config = tf.estimator.tpu.RunConfig(
-      cluster=tpu_cluster_resolver,
-      model_dir=flags_obj.model_dir,
-      session_config=tf.ConfigProto(
-          allow_soft_placement=True, log_device_placement=True),
-      tpu_config=tpu_config)
-
-  return tf.estimator.tpu.TPUEstimator(
-      model_fn=model_fn,
-      use_tpu=params["use_tpu"] and flags_obj.tpu != tpu_util.LOCAL,
-      train_batch_size=schedule_manager.batch_size,
-      eval_batch_size=schedule_manager.batch_size,
-      params={
-          # TPUEstimator needs to populate batch_size itself due to sharding.
-          key: value for key, value in params.items() if key != "batch_size"
-      },
-      config=run_config)
-
-def per_replica_batch_size(batch_size, num_gpus):
-  """For multi-gpu, batch-size must be a multiple of the number of GPUs.
-
-
-  Note that distribution strategy handles this automatically when used with
-  Keras. For using with Estimator, we need to get per GPU batch.
-
-  Args:
-    batch_size: Global batch size to be divided among devices. This should be
-      equal to num_gpus times the single-GPU batch_size for multi-gpu training.
-    num_gpus: How many GPUs are used with DistributionStrategies.
-
-  Returns:
-    Batch size per device.
-
-  Raises:
-    ValueError: if batch_size is not divisible by number of devices
-  """
-  if num_gpus <= 1:
-    return batch_size
-
-  remainder = batch_size % num_gpus
-  if remainder:
-    err = ('When running with multiple GPUs, batch size '
-           'must be a multiple of the number of available GPUs. Found {} '
-           'GPUs with a batch size of {}; try --batch_size={} instead.'
-          ).format(num_gpus, batch_size, batch_size - remainder)
-    raise ValueError(err)
-  return int(batch_size / num_gpus)
-
-
-def run_transformer(flags_obj):
-  """Create tf.Estimator to train and evaluate transformer model.
-
-  Args:
-    flags_obj: Object containing parsed flag values.
-
-  Returns:
-    Dict of results of the run.  Contains the keys `eval_results`,
-    `train_hooks`, `bleu_cased`, and `bleu_uncased`. `train_hooks` is a list the
-    instances of hooks used during training.
-  """
-  num_gpus = flags_core.get_num_gpus(flags_obj)
-
-  # Add flag-defined parameters to params object
-  params = PARAMS_MAP[flags_obj.param_set]
-  if num_gpus > 1:
-    if flags_obj.param_set == "big":
-      params = model_params.BIG_MULTI_GPU_PARAMS
-    elif flags_obj.param_set == "base":
-      params = model_params.BASE_MULTI_GPU_PARAMS
-
-  params["data_dir"] = flags_obj.data_dir
-  params["model_dir"] = flags_obj.model_dir
-  params["num_parallel_calls"] = flags_obj.num_parallel_calls
-
-  params["tpu"] = flags_obj.tpu
-  params["use_tpu"] = bool(flags_obj.tpu)  # was a tpu specified.
-  params["static_batch"] = flags_obj.static_batch or params["use_tpu"]
-  params["allow_ffn_pad"] = not params["use_tpu"]
-
-  params["max_length"] = flags_obj.max_length or params["max_length"]
-
-  params["use_synthetic_data"] = flags_obj.use_synthetic_data
-
-  # Set batch size parameter, which depends on the availability of
-  # TPU and GPU, and distribution settings.
-  params["batch_size"] = (flags_obj.batch_size or (
-      params["default_batch_size_tpu"] if params["use_tpu"]
-      else params["default_batch_size"]))
-
-  total_batch_size = params["batch_size"]
-  if not params["use_tpu"]:
-    params["batch_size"] = per_replica_batch_size(params["batch_size"],
-                                                  num_gpus)
-
-  schedule_manager = schedule.Manager(
-      train_steps=flags_obj.train_steps,
-      steps_between_evals=flags_obj.steps_between_evals,
-      train_epochs=flags_obj.train_epochs,
-      epochs_between_evals=flags_obj.epochs_between_evals,
-      default_train_epochs=DEFAULT_TRAIN_EPOCHS,
-      batch_size=params["batch_size"],
-      max_length=params["max_length"],
-      use_tpu=params["use_tpu"],
-      num_tpu_shards=flags_obj.num_tpu_shards
-  )
-
-  params["repeat_dataset"] = schedule_manager.repeat_dataset
-
-  model_helpers.apply_clean(flags.FLAGS)
-
-  # Create hooks that log information about the training and metric values
-  train_hooks = hooks_helper.get_train_hooks(
-      flags_obj.hooks,
-      model_dir=flags_obj.model_dir,
-      tensors_to_log=TENSORS_TO_LOG,  # used for logging hooks
-      batch_size=total_batch_size,  # for ExamplesPerSecondHook
-      use_tpu=params["use_tpu"]  # Not all hooks can run with TPUs
-  )
-  benchmark_logger = logger.get_benchmark_logger()
-  benchmark_logger.log_run_info(
-      model_name="transformer",
-      dataset_name="wmt_translate_ende",
-      run_params=params,
-      test_id=flags_obj.benchmark_test_id)
-
-  # Train and evaluate transformer model
-  estimator = construct_estimator(flags_obj, params, schedule_manager)
-  stats = run_loop(
-      estimator=estimator,
-      # Training arguments
-      schedule_manager=schedule_manager,
-      train_hooks=train_hooks,
-      benchmark_logger=benchmark_logger,
-      # BLEU calculation arguments
-      bleu_source=flags_obj.bleu_source,
-      bleu_ref=flags_obj.bleu_ref,
-      bleu_threshold=flags_obj.stop_threshold,
-      vocab_file=flags_obj.vocab_file)
-
-  if flags_obj.export_dir and not params["use_tpu"]:
-    serving_input_fn = export.build_tensor_serving_input_receiver_fn(
-        shape=[None], dtype=tf.int64, batch_size=None)
-    # Export saved model, and save the vocab file as an extra asset. The vocab
-    # file is saved to allow consistent input encoding and output decoding.
-    # (See the "Export trained model" section in the README for an example of
-    # how to use the vocab file.)
-    # Since the model itself does not use the vocab file, this file is saved as
-    # an extra asset rather than a core asset.
-    estimator.export_savedmodel(
-        flags_obj.export_dir, serving_input_fn,
-        assets_extra={"vocab.txt": flags_obj.vocab_file},
-        strip_default_attrs=True)
-  return stats
-
-
-def main(_):
-  with logger.benchmark_context(flags.FLAGS):
-    run_transformer(flags.FLAGS)
-
-
-if __name__ == "__main__":
-  tf.logging.set_verbosity(tf.logging.INFO)
-  define_transformer_flags()
-  absl_app.run(main)
diff --git a/official/r1/transformer/translate.py b/official/r1/transformer/translate.py
deleted file mode 100644
index 9912ee3c3d569bb717c6806d6635f02194dbcfb6..0000000000000000000000000000000000000000
--- a/official/r1/transformer/translate.py
+++ /dev/null
@@ -1,237 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Translate text or files using trained transformer model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-# pylint: disable=g-bad-import-order
-from absl import app as absl_app
-from absl import flags
-import tensorflow.compat.v1 as tf
-# pylint: enable=g-bad-import-order
-
-from official.nlp.transformer.utils import tokenizer
-from official.utils.flags import core as flags_core
-
-_DECODE_BATCH_SIZE = 32
-_EXTRA_DECODE_LENGTH = 100
-_BEAM_SIZE = 4
-_ALPHA = 0.6
-
-
-def _get_sorted_inputs(filename):
-  """Read and sort lines from the file sorted by decreasing length.
-
-  Args:
-    filename: String name of file to read inputs from.
-  Returns:
-    Sorted list of inputs, and dictionary mapping original index->sorted index
-    of each element.
-  """
-  with tf.io.gfile.GFile(filename) as f:
-    records = f.read().split("\n")
-    inputs = [record.strip() for record in records]
-    if not inputs[-1]:
-      inputs.pop()
-
-  input_lens = [(i, len(line.split())) for i, line in enumerate(inputs)]
-  sorted_input_lens = sorted(input_lens, key=lambda x: x[1], reverse=True)
-
-  sorted_inputs = [None] * len(sorted_input_lens)
-  sorted_keys = [0] * len(sorted_input_lens)
-  for i, (index, _) in enumerate(sorted_input_lens):
-    sorted_inputs[i] = inputs[index]
-    sorted_keys[index] = i
-
-  return sorted_inputs, sorted_keys
-
-
-def _encode_and_add_eos(line, subtokenizer):
-  """Encode line with subtokenizer, and add EOS id to the end."""
-  return subtokenizer.encode(line) + [tokenizer.EOS_ID]
-
-
-def _trim_and_decode(ids, subtokenizer):
-  """Trim EOS and PAD tokens from ids, and decode to return a string."""
-  try:
-    index = list(ids).index(tokenizer.EOS_ID)
-    return subtokenizer.decode(ids[:index])
-  except ValueError:  # No EOS found in sequence
-    return subtokenizer.decode(ids)
-
-
-def translate_file(
-    estimator, subtokenizer, input_file, output_file=None,
-    print_all_translations=True):
-  """Translate lines in file, and save to output file if specified.
-
-  Args:
-    estimator: tf.Estimator used to generate the translations.
-    subtokenizer: Subtokenizer object for encoding and decoding source and
-       translated lines.
-    input_file: file containing lines to translate
-    output_file: file that stores the generated translations.
-    print_all_translations: If true, all translations are printed to stdout.
-
-  Raises:
-    ValueError: if output file is invalid.
-  """
-  batch_size = _DECODE_BATCH_SIZE
-
-  # Read and sort inputs by length. Keep dictionary (original index-->new index
-  # in sorted list) to write translations in the original order.
-  sorted_inputs, sorted_keys = _get_sorted_inputs(input_file)
-  num_decode_batches = (len(sorted_inputs) - 1) // batch_size + 1
-
-  def input_generator():
-    """Yield encoded strings from sorted_inputs."""
-    for i, line in enumerate(sorted_inputs):
-      if i % batch_size == 0:
-        batch_num = (i // batch_size) + 1
-
-        tf.logging.info("Decoding batch %d out of %d." %
-                        (batch_num, num_decode_batches))
-      yield _encode_and_add_eos(line, subtokenizer)
-
-  def input_fn():
-    """Created batched dataset of encoded inputs."""
-    ds = tf.data.Dataset.from_generator(
-        input_generator, tf.int64, tf.TensorShape([None]))
-    ds = ds.padded_batch(batch_size, [None])
-    return ds
-
-  translations = []
-  for i, prediction in enumerate(estimator.predict(input_fn)):
-    translation = _trim_and_decode(prediction["outputs"], subtokenizer)
-    translations.append(translation)
-
-    if print_all_translations:
-      tf.logging.info("Translating:\n\tInput: %s\n\tOutput: %s" %
-                      (sorted_inputs[i], translation))
-
-  # Write translations in the order they appeared in the original file.
-  if output_file is not None:
-    if tf.io.gfile.isdir(output_file):
-      raise ValueError("File output is a directory, will not save outputs to "
-                       "file.")
-    tf.logging.info("Writing to file %s" % output_file)
-    with tf.io.gfile.GFile(output_file, "w") as f:
-      for i in sorted_keys:
-        f.write("%s\n" % translations[i])
-
-
-def translate_text(estimator, subtokenizer, txt):
-  """Translate a single string."""
-  encoded_txt = _encode_and_add_eos(txt, subtokenizer)
-
-  def input_fn():
-    ds = tf.data.Dataset.from_tensors(encoded_txt)
-    ds = ds.batch(_DECODE_BATCH_SIZE)
-    return ds
-
-  predictions = estimator.predict(input_fn)
-  translation = next(predictions)["outputs"]
-  translation = _trim_and_decode(translation, subtokenizer)
-  tf.logging.info("Translation of \"%s\": \"%s\"" % (txt, translation))
-
-
-def main(unused_argv):
-  from official.transformer import transformer_main
-
-  tf.logging.set_verbosity(tf.logging.INFO)
-
-  if FLAGS.text is None and FLAGS.file is None:
-    tf.logging.warn("Nothing to translate. Make sure to call this script using "
-                    "flags --text or --file.")
-    return
-
-  subtokenizer = tokenizer.Subtokenizer(FLAGS.vocab_file)
-
-  # Set up estimator and params
-  params = transformer_main.PARAMS_MAP[FLAGS.param_set]
-  params["beam_size"] = _BEAM_SIZE
-  params["alpha"] = _ALPHA
-  params["extra_decode_length"] = _EXTRA_DECODE_LENGTH
-  params["batch_size"] = _DECODE_BATCH_SIZE
-  estimator = tf.estimator.Estimator(
-      model_fn=transformer_main.model_fn, model_dir=FLAGS.model_dir,
-      params=params)
-
-  if FLAGS.text is not None:
-    tf.logging.info("Translating text: %s" % FLAGS.text)
-    translate_text(estimator, subtokenizer, FLAGS.text)
-
-  if FLAGS.file is not None:
-    input_file = os.path.abspath(FLAGS.file)
-    tf.logging.info("Translating file: %s" % input_file)
-    if not tf.gfile.Exists(FLAGS.file):
-      raise ValueError("File does not exist: %s" % input_file)
-
-    output_file = None
-    if FLAGS.file_out is not None:
-      output_file = os.path.abspath(FLAGS.file_out)
-      tf.logging.info("File output specified: %s" % output_file)
-
-    translate_file(estimator, subtokenizer, input_file, output_file)
-
-
-def define_translate_flags():
-  """Define flags used for translation script."""
-  # Model flags
-  flags.DEFINE_string(
-      name="model_dir", short_name="md", default="/tmp/transformer_model",
-      help=flags_core.help_wrap(
-          "Directory containing Transformer model checkpoints."))
-  flags.DEFINE_enum(
-      name="param_set", short_name="mp", default="big",
-      enum_values=["base", "big"],
-      help=flags_core.help_wrap(
-          "Parameter set to use when creating and training the model. The "
-          "parameters define the input shape (batch size and max length), "
-          "model configuration (size of embedding, # of hidden layers, etc.), "
-          "and various other settings. The big parameter set increases the "
-          "default batch size, embedding/hidden size, and filter size. For a "
-          "complete list of parameters, please see model/model_params.py."))
-  flags.DEFINE_string(
-      name="vocab_file", short_name="vf", default=None,
-      help=flags_core.help_wrap(
-          "Path to subtoken vocabulary file. If data_download.py was used to "
-          "download and encode the training data, look in the data_dir to find "
-          "the vocab file."))
-  flags.mark_flag_as_required("vocab_file")
-
-  flags.DEFINE_string(
-      name="text", default=None,
-      help=flags_core.help_wrap(
-          "Text to translate. Output will be printed to console."))
-  flags.DEFINE_string(
-      name="file", default=None,
-      help=flags_core.help_wrap(
-          "File containing text to translate. Translation will be printed to "
-          "console and, if --file_out is provided, saved to an output file."))
-  flags.DEFINE_string(
-      name="file_out", default=None,
-      help=flags_core.help_wrap(
-          "If --file flag is specified, save translation to this file."))
-
-
-if __name__ == "__main__":
-  define_translate_flags()
-  FLAGS = flags.FLAGS
-  absl_app.run(main)
diff --git a/official/r1/utils/__init__.py b/official/r1/utils/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/official/r1/utils/data/__init__.py b/official/r1/utils/data/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/official/r1/utils/data/file_io.py b/official/r1/utils/data/file_io.py
deleted file mode 100644
index b7776fc9e40547e5665f253393fa714fe0578e9f..0000000000000000000000000000000000000000
--- a/official/r1/utils/data/file_io.py
+++ /dev/null
@@ -1,207 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Convenience functions for managing dataset file buffers."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import atexit
-import multiprocessing
-import multiprocessing.dummy
-import os
-import tempfile
-import uuid
-
-from absl import logging
-import numpy as np
-import six
-import tensorflow as tf
-# pylint:disable=logging-format-interpolation
-
-
-class _GarbageCollector(object):
-  """Deletes temporary buffer files at exit.
-
-  Certain tasks (such as NCF Recommendation) require writing buffers to
-  temporary files. (Which may be local or distributed.) It is not generally safe
-  to delete these files during operation, but they should be cleaned up. This
-  class keeps track of temporary files created, and deletes them at exit.
-  """
-  def __init__(self):
-    self.temp_buffers = []
-
-  def register(self, filepath):
-    self.temp_buffers.append(filepath)
-
-  def purge(self):
-    try:
-      for i in self.temp_buffers:
-        if tf.io.gfile.exists(i):
-          tf.io.gfile.remove(i)
-          logging.info("Buffer file {} removed".format(i))
-    except Exception as e:
-      logging.error("Failed to cleanup buffer files: {}".format(e))
-
-
-_GARBAGE_COLLECTOR = _GarbageCollector()
-atexit.register(_GARBAGE_COLLECTOR.purge)
-
-_ROWS_PER_CORE = 50000
-
-
-def write_to_temp_buffer(dataframe, buffer_folder, columns):
-  if buffer_folder is None:
-    _, buffer_path = tempfile.mkstemp()
-  else:
-    tf.io.gfile.makedirs(buffer_folder)
-    buffer_path = os.path.join(buffer_folder, str(uuid.uuid4()))
-  _GARBAGE_COLLECTOR.register(buffer_path)
-
-  return write_to_buffer(dataframe, buffer_path, columns)
-
-
-def iter_shard_dataframe(df, rows_per_core=1000):
-  """Two way shard of a dataframe.
-
-  This function evenly shards a dataframe so that it can be mapped efficiently.
-  It yields a list of dataframes with length equal to the number of CPU cores,
-  with each dataframe having rows_per_core rows. (Except for the last batch
-  which may have fewer rows in the dataframes.) Passing vectorized inputs to
-  a pool is more effecient than iterating through a dataframe in serial and
-  passing a list of inputs to the pool.
-
-  Args:
-    df: Pandas dataframe to be sharded.
-    rows_per_core: Number of rows in each shard.
-
-  Returns:
-    A list of dataframe shards.
-  """
-  n = len(df)
-  num_cores = min([multiprocessing.cpu_count(), n])
-
-  num_blocks = int(np.ceil(n / num_cores / rows_per_core))
-  max_batch_size = num_cores * rows_per_core
-  for i in range(num_blocks):
-    min_index = i * max_batch_size
-    max_index = min([(i + 1) * max_batch_size, n])
-    df_shard = df[min_index:max_index]
-    n_shard = len(df_shard)
-    boundaries = np.linspace(0, n_shard, num_cores + 1, dtype=np.int64)
-    yield [df_shard[boundaries[j]:boundaries[j+1]] for j in range(num_cores)]
-
-
-def _shard_dict_to_examples(shard_dict):
-  """Converts a dict of arrays into a list of example bytes."""
-  n = [i for i in shard_dict.values()][0].shape[0]
-  feature_list = [{} for _ in range(n)]
-  for column, values in shard_dict.items():
-    if len(values.shape) == 1:
-      values = np.reshape(values, values.shape + (1,))
-
-    if values.dtype.kind == "i":
-      feature_map = lambda x: tf.train.Feature(
-          int64_list=tf.train.Int64List(value=x))
-    elif values.dtype.kind == "f":
-      feature_map = lambda x: tf.train.Feature(
-          float_list=tf.train.FloatList(value=x))
-    else:
-      raise ValueError("Invalid dtype")
-    for i in range(n):
-      feature_list[i][column] = feature_map(values[i])
-  examples = [
-      tf.train.Example(features=tf.train.Features(feature=example_features))
-      for example_features in feature_list
-  ]
-
-  return [e.SerializeToString() for e in examples]
-
-
-def _serialize_shards(df_shards, columns, pool, writer):
-  """Map sharded dataframes to bytes, and write them to a buffer.
-
-  Args:
-    df_shards: A list of pandas dataframes. (Should be of similar size)
-    columns: The dataframe columns to be serialized.
-    pool: A pool to serialize in parallel.
-    writer: A TFRecordWriter to write the serialized shards.
-  """
-  # Pandas does not store columns of arrays as nd arrays. stack remedies this.
-  map_inputs = [{c: np.stack(shard[c].values, axis=0) for c in columns}
-                for shard in df_shards]
-
-  # Failure within pools is very irksome. Thus, it is better to thoroughly check
-  # inputs in the main process.
-  for inp in map_inputs:
-    # Check that all fields have the same number of rows.
-    assert len(set([v.shape[0] for v in inp.values()])) == 1
-    for val in inp.values():
-      assert hasattr(val, "dtype")
-      assert hasattr(val.dtype, "kind")
-      assert val.dtype.kind in ("i", "f")
-      assert len(val.shape) in (1, 2)
-  shard_bytes = pool.map(_shard_dict_to_examples, map_inputs)
-  for s in shard_bytes:
-    for example in s:
-      writer.write(example)
-
-
-def write_to_buffer(dataframe, buffer_path, columns, expected_size=None):
-  """Write a dataframe to a binary file for a dataset to consume.
-
-  Args:
-    dataframe: The pandas dataframe to be serialized.
-    buffer_path: The path where the serialized results will be written.
-    columns: The dataframe columns to be serialized.
-    expected_size: The size in bytes of the serialized results. This is used to
-      lazily construct the buffer.
-
-  Returns:
-    The path of the buffer.
-  """
-  if (tf.io.gfile.exists(buffer_path) and
-      tf.io.gfile.stat(buffer_path).length > 0):
-    actual_size = tf.io.gfile.stat(buffer_path).length
-    if expected_size == actual_size:
-      return buffer_path
-    logging.warning(
-        "Existing buffer {} has size {}. Expected size {}. Deleting and "
-        "rebuilding buffer.".format(buffer_path, actual_size, expected_size))
-    tf.io.gfile.remove(buffer_path)
-
-  if dataframe is None:
-    raise ValueError(
-        "dataframe was None but a valid existing buffer was not found.")
-
-  tf.io.gfile.makedirs(os.path.split(buffer_path)[0])
-
-  logging.info("Constructing TFRecordDataset buffer: {}".format(buffer_path))
-
-  count = 0
-  pool = multiprocessing.dummy.Pool(multiprocessing.cpu_count())
-  try:
-    with tf.io.TFRecordWriter(buffer_path) as writer:
-      for df_shards in iter_shard_dataframe(df=dataframe,
-                                            rows_per_core=_ROWS_PER_CORE):
-        _serialize_shards(df_shards, columns, pool, writer)
-        count += sum([len(s) for s in df_shards])
-        logging.info("{}/{} examples written.".format(
-            str(count).ljust(8), len(dataframe)))
-  finally:
-    pool.terminate()
-
-  logging.info("Buffer write complete.")
-  return buffer_path
diff --git a/official/r1/utils/data/file_io_test.py b/official/r1/utils/data/file_io_test.py
deleted file mode 100644
index 529cb459120e9f43b2eaf9e9b193dae7b1c3dc0e..0000000000000000000000000000000000000000
--- a/official/r1/utils/data/file_io_test.py
+++ /dev/null
@@ -1,197 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for binary data file utilities."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import contextlib
-import multiprocessing
-
-# pylint: disable=wrong-import-order
-import numpy as np
-import pandas as pd
-import tensorflow as tf
-# pylint: enable=wrong-import-order
-
-from official.r1.utils.data import file_io
-
-
-_RAW_ROW = "raw_row"
-_DUMMY_COL = "column_0"
-_DUMMY_VEC_COL = "column_1"
-_DUMMY_VEC_LEN = 4
-
-_ROWS_PER_CORE = 4
-_TEST_CASES = [
-    # One batch of one
-    dict(row_count=1, cpu_count=1, expected=[
-        [[0]]
-    ]),
-
-    dict(row_count=10, cpu_count=1, expected=[
-        [[0, 1, 2, 3]], [[4, 5, 6, 7]], [[8, 9]]
-    ]),
-
-    dict(row_count=21, cpu_count=1, expected=[
-        [[0, 1, 2, 3]], [[4, 5, 6, 7]], [[8, 9, 10, 11]],
-        [[12, 13, 14, 15]], [[16, 17, 18, 19]], [[20]]
-    ]),
-
-    dict(row_count=1, cpu_count=4, expected=[
-        [[0]]
-    ]),
-
-    dict(row_count=10, cpu_count=4, expected=[
-        [[0, 1], [2, 3, 4], [5, 6], [7, 8, 9]]
-    ]),
-
-    dict(row_count=21, cpu_count=4, expected=[
-        [[0, 1, 2, 3], [4, 5, 6, 7], [8, 9, 10, 11], [12, 13, 14, 15]],
-        [[16], [17], [18], [19, 20]]
-    ]),
-
-    dict(row_count=10, cpu_count=8, expected=[
-        [[0], [1], [2], [3, 4], [5], [6], [7], [8, 9]]
-    ]),
-
-    dict(row_count=40, cpu_count=8, expected=[
-        [[0, 1, 2, 3], [4, 5, 6, 7], [8, 9, 10, 11], [12, 13, 14, 15],
-         [16, 17, 18, 19], [20, 21, 22, 23], [24, 25, 26, 27],
-         [28, 29, 30, 31]],
-        [[32], [33], [34], [35], [36], [37], [38], [39]]
-    ]),
-]
-
-_FEATURE_MAP = {
-    _RAW_ROW: tf.io.FixedLenFeature([1], dtype=tf.int64),
-    _DUMMY_COL: tf.io.FixedLenFeature([1], dtype=tf.int64),
-    _DUMMY_VEC_COL: tf.io.FixedLenFeature([_DUMMY_VEC_LEN], dtype=tf.float32)
-}
-
-
-@contextlib.contextmanager
-def fixed_core_count(cpu_count):
-  """Override CPU count.
-
-  file_io.py uses the cpu_count function to scale to the size of the instance.
-  However, this is not desirable for testing because it can make the test flaky.
-  Instead, this context manager fixes the count for more robust testing.
-
-  Args:
-    cpu_count: How many cores multiprocessing claims to have.
-
-  Yields:
-    Nothing. (for context manager only)
-  """
-  old_count_fn = multiprocessing.cpu_count
-  multiprocessing.cpu_count = lambda: cpu_count
-  yield
-  multiprocessing.cpu_count = old_count_fn
-
-
-class BaseTest(tf.test.TestCase):
-
-  def setUp(self):
-    super(BaseTest, self).setUp()
-    tf.compat.v1.disable_eager_execution()
-
-  def _test_sharding(self, row_count, cpu_count, expected):
-    df = pd.DataFrame({_DUMMY_COL: list(range(row_count))})
-    with fixed_core_count(cpu_count):
-      shards = list(file_io.iter_shard_dataframe(df, _ROWS_PER_CORE))
-    result = [[j[_DUMMY_COL].tolist() for j in i] for i in shards]
-    self.assertAllEqual(expected, result)
-
-  def test_tiny_rows_low_core(self):
-    self._test_sharding(**_TEST_CASES[0])
-
-  def test_small_rows_low_core(self):
-    self._test_sharding(**_TEST_CASES[1])
-
-  def test_large_rows_low_core(self):
-    self._test_sharding(**_TEST_CASES[2])
-
-  def test_tiny_rows_medium_core(self):
-    self._test_sharding(**_TEST_CASES[3])
-
-  def test_small_rows_medium_core(self):
-    self._test_sharding(**_TEST_CASES[4])
-
-  def test_large_rows_medium_core(self):
-    self._test_sharding(**_TEST_CASES[5])
-
-  def test_small_rows_large_core(self):
-    self._test_sharding(**_TEST_CASES[6])
-
-  def test_large_rows_large_core(self):
-    self._test_sharding(**_TEST_CASES[7])
-
-  def _serialize_deserialize(self, num_cores=1, num_rows=20):
-    np.random.seed(1)
-    df = pd.DataFrame({
-        # Serialization order is only deterministic for num_cores=1. raw_row is
-        # used in validation after the deserialization.
-        _RAW_ROW: np.array(range(num_rows), dtype=np.int64),
-        _DUMMY_COL: np.random.randint(0, 35, size=(num_rows,)),
-        _DUMMY_VEC_COL: [
-            np.array([np.random.random() for _ in range(_DUMMY_VEC_LEN)])
-            for i in range(num_rows)  # pylint: disable=unused-variable
-        ]
-    })
-
-    with fixed_core_count(num_cores):
-      buffer_path = file_io.write_to_temp_buffer(
-          df, self.get_temp_dir(), [_RAW_ROW, _DUMMY_COL, _DUMMY_VEC_COL])
-
-    with self.session(graph=tf.Graph()) as sess:
-      dataset = tf.data.TFRecordDataset(buffer_path)
-      dataset = dataset.batch(1).map(
-          lambda x: tf.io.parse_example(serialized=x, features=_FEATURE_MAP))
-
-      data_iter = tf.compat.v1.data.make_one_shot_iterator(dataset)
-      seen_rows = set()
-      for i in range(num_rows+5):
-        row = data_iter.get_next()
-        try:
-          row_id, val_0, val_1 = sess.run(
-              [row[_RAW_ROW], row[_DUMMY_COL], row[_DUMMY_VEC_COL]])
-          row_id, val_0, val_1 = row_id[0][0], val_0[0][0], val_1[0]
-          assert row_id not in seen_rows
-          seen_rows.add(row_id)
-
-          self.assertEqual(val_0, df[_DUMMY_COL][row_id])
-          self.assertAllClose(val_1, df[_DUMMY_VEC_COL][row_id])
-
-          self.assertLess(i, num_rows, msg="Too many rows.")
-        except tf.errors.OutOfRangeError:
-          self.assertGreaterEqual(i, num_rows, msg="Too few rows.")
-
-    file_io._GARBAGE_COLLECTOR.purge()
-    assert not tf.io.gfile.exists(buffer_path)
-
-  def test_serialize_deserialize_0(self):
-    self._serialize_deserialize(num_cores=1)
-
-  def test_serialize_deserialize_1(self):
-    self._serialize_deserialize(num_cores=2)
-
-  def test_serialize_deserialize_2(self):
-    self._serialize_deserialize(num_cores=8)
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/official/r1/utils/export.py b/official/r1/utils/export.py
deleted file mode 100644
index 8061c28816fdca43c074c6e702034165fe53095e..0000000000000000000000000000000000000000
--- a/official/r1/utils/export.py
+++ /dev/null
@@ -1,49 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Convenience functions for exporting models as SavedModels or other types."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-def build_tensor_serving_input_receiver_fn(shape, dtype=tf.float32,
-                                           batch_size=1):
-  """Returns a input_receiver_fn that can be used during serving.
-
-  This expects examples to come through as float tensors, and simply
-  wraps them as TensorServingInputReceivers.
-
-  Arguably, this should live in tf.estimator.export. Testing here first.
-
-  Args:
-    shape: list representing target size of a single example.
-    dtype: the expected datatype for the input example
-    batch_size: number of input tensors that will be passed for prediction
-
-  Returns:
-    A function that itself returns a TensorServingInputReceiver.
-  """
-  def serving_input_receiver_fn():
-    # Prep a placeholder where the input example will be fed in
-    features = tf.compat.v1.placeholder(
-        dtype=dtype, shape=[batch_size] + shape, name='input_tensor')
-
-    return tf.estimator.export.TensorServingInputReceiver(
-        features=features, receiver_tensors=features)
-
-  return serving_input_receiver_fn
diff --git a/official/r1/utils/export_test.py b/official/r1/utils/export_test.py
deleted file mode 100644
index 3785edd47680ddbfc53b446588a7101933531005..0000000000000000000000000000000000000000
--- a/official/r1/utils/export_test.py
+++ /dev/null
@@ -1,63 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for exporting utils."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf  # pylint: disable=g-bad-import-order
-
-from official.r1.utils import export
-
-
-class ExportUtilsTest(tf.test.TestCase):
-  """Tests for the ExportUtils."""
-
-  def test_build_tensor_serving_input_receiver_fn(self):
-    receiver_fn = export.build_tensor_serving_input_receiver_fn(shape=[4, 5])
-    with tf.Graph().as_default():
-      receiver = receiver_fn()
-      self.assertIsInstance(
-          receiver, tf.estimator.export.TensorServingInputReceiver)
-
-      self.assertIsInstance(receiver.features, tf.Tensor)
-      self.assertEqual(receiver.features.shape, tf.TensorShape([1, 4, 5]))
-      self.assertEqual(receiver.features.dtype, tf.float32)
-      self.assertIsInstance(receiver.receiver_tensors, dict)
-      # Note that Python 3 can no longer index .values() directly; cast to list.
-      self.assertEqual(list(receiver.receiver_tensors.values())[0].shape,
-                       tf.TensorShape([1, 4, 5]))
-
-  def test_build_tensor_serving_input_receiver_fn_batch_dtype(self):
-    receiver_fn = export.build_tensor_serving_input_receiver_fn(
-        shape=[4, 5], dtype=tf.int8, batch_size=10)
-
-    with tf.Graph().as_default():
-      receiver = receiver_fn()
-      self.assertIsInstance(
-          receiver, tf.estimator.export.TensorServingInputReceiver)
-
-      self.assertIsInstance(receiver.features, tf.Tensor)
-      self.assertEqual(receiver.features.shape, tf.TensorShape([10, 4, 5]))
-      self.assertEqual(receiver.features.dtype, tf.int8)
-      self.assertIsInstance(receiver.receiver_tensors, dict)
-      # Note that Python 3 can no longer index .values() directly; cast to list.
-      self.assertEqual(list(receiver.receiver_tensors.values())[0].shape,
-                       tf.TensorShape([10, 4, 5]))
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/official/r1/utils/logs/__init__.py b/official/r1/utils/logs/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/official/r1/utils/logs/cloud_lib.py b/official/r1/utils/logs/cloud_lib.py
deleted file mode 100644
index a2d9bd3dba813ae4f7a86e66fe74be27d24c53c6..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/cloud_lib.py
+++ /dev/null
@@ -1,34 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities that interact with cloud service.
-"""
-
-import requests
-
-GCP_METADATA_URL = "http://metadata/computeMetadata/v1/instance/hostname"
-GCP_METADATA_HEADER = {"Metadata-Flavor": "Google"}
-
-
-def on_gcp():
-  """Detect whether the current running environment is on GCP."""
-  try:
-    # Timeout in 5 seconds, in case the test environment has connectivity issue.
-    # There is not default timeout, which means it might block forever.
-    response = requests.get(
-        GCP_METADATA_URL, headers=GCP_METADATA_HEADER, timeout=5)
-    return response.status_code == 200
-  except requests.exceptions.RequestException:
-    return False
diff --git a/official/r1/utils/logs/cloud_lib_test.py b/official/r1/utils/logs/cloud_lib_test.py
deleted file mode 100644
index 15cdc3c66b09253222acf7223f2e08b850e7ed58..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/cloud_lib_test.py
+++ /dev/null
@@ -1,48 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for cloud_lib."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import unittest
-
-import mock
-import requests
-
-from official.r1.utils.logs import cloud_lib
-
-
-class CloudLibTest(unittest.TestCase):
-
-  @mock.patch("requests.get")
-  def test_on_gcp(self, mock_requests_get):
-    mock_response = mock.MagicMock()
-    mock_requests_get.return_value = mock_response
-    mock_response.status_code = 200
-
-    self.assertEqual(cloud_lib.on_gcp(), True)
-
-  @mock.patch("requests.get")
-  def test_not_on_gcp(self, mock_requests_get):
-    mock_requests_get.side_effect = requests.exceptions.ConnectionError()
-
-    self.assertEqual(cloud_lib.on_gcp(), False)
-
-
-if __name__ == "__main__":
-  unittest.main()
diff --git a/official/r1/utils/logs/guidelines.md b/official/r1/utils/logs/guidelines.md
deleted file mode 100644
index 408c3cd58d6c5f1ae21149b5e7772a3835534de1..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/guidelines.md
+++ /dev/null
@@ -1,58 +0,0 @@
-# Logging in official models
-
-This library adds logging functions that print or save tensor values. Official models should define all common hooks
-(using hooks helper) and a benchmark logger.
-
-1. **Training Hooks**
-
-   Hooks are a TensorFlow concept that define specific actions at certain points of the execution. We use them to obtain and log
-   tensor values during training.
-
-   hooks_helper.py provides an easy way to create common hooks. The following hooks are currently defined:
-   * LoggingTensorHook: Logs tensor values
-   * ProfilerHook: Writes a timeline json that can be loaded into chrome://tracing.
-   * ExamplesPerSecondHook: Logs the number of examples processed per second.
-   * LoggingMetricHook: Similar to LoggingTensorHook, except that the tensors are logged in a format defined by our data
-     anaylsis pipeline.
-
-
-2. **Benchmarks**
-
-   The benchmark logger provides useful functions for logging environment information, and evaluation results.
-   The module also contains a context which is used to update the status of the run.
-
-Example usage:
-
-```
-from absl import app as absl_app
-
-from official.utils.logs import hooks_helper
-from official.utils.logs import logger
-
-def model_main(flags_obj):
-  estimator = ...
-
-  benchmark_logger = logger.get_benchmark_logger()
-  benchmark_logger.log_run_info(...)
-
-  train_hooks = hooks_helper.get_train_hooks(...)
-
-  for epoch in range(10):
-    estimator.train(..., hooks=train_hooks)
-    eval_results = estimator.evaluate(...)
-
-    # Log a dictionary of metrics
-    benchmark_logger.log_evaluation_result(eval_results)
-
-    # Log an individual metric
-    benchmark_logger.log_metric(...)
-
-
-def main(_):
-  with logger.benchmark_context(flags.FLAGS):
-    model_main(flags.FLAGS)
-
-if __name__ == "__main__":
-  # define flags
-  absl_app.run(main)
-```
diff --git a/official/r1/utils/logs/hooks.py b/official/r1/utils/logs/hooks.py
deleted file mode 100644
index c595eabb3b9f9f2c8bd78279dfdce11e5a469626..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/hooks.py
+++ /dev/null
@@ -1,130 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Hook that counts examples per second every N steps or seconds."""
-
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf  # pylint: disable=g-bad-import-order
-
-from official.r1.utils.logs import logger
-
-
-class ExamplesPerSecondHook(tf.estimator.SessionRunHook):
-  """Hook to print out examples per second.
-
-  Total time is tracked and then divided by the total number of steps
-  to get the average step time and then batch_size is used to determine
-  the running average of examples per second. The examples per second for the
-  most recent interval is also logged.
-  """
-
-  def __init__(self,
-               batch_size,
-               every_n_steps=None,
-               every_n_secs=None,
-               warm_steps=0,
-               metric_logger=None):
-    """Initializer for ExamplesPerSecondHook.
-
-    Args:
-      batch_size: Total batch size across all workers used to calculate
-        examples/second from global time.
-      every_n_steps: Log stats every n steps.
-      every_n_secs: Log stats every n seconds. Exactly one of the
-        `every_n_steps` or `every_n_secs` should be set.
-      warm_steps: The number of steps to be skipped before logging and running
-        average calculation. warm_steps steps refers to global steps across all
-        workers, not on each worker
-      metric_logger: instance of `BenchmarkLogger`, the benchmark logger that
-          hook should use to write the log. If None, BaseBenchmarkLogger will
-          be used.
-
-    Raises:
-      ValueError: if neither `every_n_steps` or `every_n_secs` is set, or
-      both are set.
-    """
-
-    if (every_n_steps is None) == (every_n_secs is None):
-      raise ValueError("exactly one of every_n_steps"
-                       " and every_n_secs should be provided.")
-
-    self._logger = metric_logger or logger.BaseBenchmarkLogger()
-
-    self._timer = tf.estimator.SecondOrStepTimer(
-        every_steps=every_n_steps, every_secs=every_n_secs)
-
-    self._step_train_time = 0
-    self._total_steps = 0
-    self._batch_size = batch_size
-    self._warm_steps = warm_steps
-    # List of examples per second logged every_n_steps.
-    self.current_examples_per_sec_list = []
-
-  def begin(self):
-    """Called once before using the session to check global step."""
-    self._global_step_tensor = tf.compat.v1.train.get_global_step()
-    if self._global_step_tensor is None:
-      raise RuntimeError(
-          "Global step should be created to use StepCounterHook.")
-
-  def before_run(self, run_context):  # pylint: disable=unused-argument
-    """Called before each call to run().
-
-    Args:
-      run_context: A SessionRunContext object.
-
-    Returns:
-      A SessionRunArgs object or None if never triggered.
-    """
-    return tf.estimator.SessionRunArgs(self._global_step_tensor)
-
-  def after_run(self, run_context, run_values):  # pylint: disable=unused-argument
-    """Called after each call to run().
-
-    Args:
-      run_context: A SessionRunContext object.
-      run_values: A SessionRunValues object.
-    """
-    global_step = run_values.results
-
-    if self._timer.should_trigger_for_step(
-        global_step) and global_step > self._warm_steps:
-      elapsed_time, elapsed_steps = self._timer.update_last_triggered_step(
-          global_step)
-      if elapsed_time is not None:
-        self._step_train_time += elapsed_time
-        self._total_steps += elapsed_steps
-
-        # average examples per second is based on the total (accumulative)
-        # training steps and training time so far
-        average_examples_per_sec = self._batch_size * (
-            self._total_steps / self._step_train_time)
-        # current examples per second is based on the elapsed training steps
-        # and training time per batch
-        current_examples_per_sec = self._batch_size * (
-            elapsed_steps / elapsed_time)
-        # Logs entries to be read from hook during or after run.
-        self.current_examples_per_sec_list.append(current_examples_per_sec)
-        self._logger.log_metric(
-            "average_examples_per_sec", average_examples_per_sec,
-            global_step=global_step)
-
-        self._logger.log_metric(
-            "current_examples_per_sec", current_examples_per_sec,
-            global_step=global_step)
diff --git a/official/r1/utils/logs/hooks_helper.py b/official/r1/utils/logs/hooks_helper.py
deleted file mode 100644
index 1e823241b0e0b9afd002d554e8853de37b54a621..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/hooks_helper.py
+++ /dev/null
@@ -1,173 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Hooks helper to return a list of TensorFlow hooks for training by name.
-
-More hooks can be added to this set. To add a new hook, 1) add the new hook to
-the registry in HOOKS, 2) add a corresponding function that parses out necessary
-parameters.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf  # pylint: disable=g-bad-import-order
-from absl import logging
-
-from official.r1.utils.logs import hooks
-from official.r1.utils.logs import logger
-from official.r1.utils.logs import metric_hook
-
-_TENSORS_TO_LOG = dict((x, x) for x in ['learning_rate',
-                                        'cross_entropy',
-                                        'train_accuracy'])
-
-
-def get_train_hooks(name_list, use_tpu=False, **kwargs):
-  """Factory for getting a list of TensorFlow hooks for training by name.
-
-  Args:
-    name_list: a list of strings to name desired hook classes. Allowed:
-      LoggingTensorHook, ProfilerHook, ExamplesPerSecondHook, which are defined
-      as keys in HOOKS
-    use_tpu: Boolean of whether computation occurs on a TPU. This will disable
-      hooks altogether.
-    **kwargs: a dictionary of arguments to the hooks.
-
-  Returns:
-    list of instantiated hooks, ready to be used in a classifier.train call.
-
-  Raises:
-    ValueError: if an unrecognized name is passed.
-  """
-
-  if not name_list:
-    return []
-
-  if use_tpu:
-    logging.warning(
-        'hooks_helper received name_list `%s`, but a '
-        'TPU is specified. No hooks will be used.', name_list)
-    return []
-
-  train_hooks = []
-  for name in name_list:
-    hook_name = HOOKS.get(name.strip().lower())
-    if hook_name is None:
-      raise ValueError('Unrecognized training hook requested: {}'.format(name))
-    else:
-      train_hooks.append(hook_name(**kwargs))
-
-  return train_hooks
-
-
-def get_logging_tensor_hook(every_n_iter=100, tensors_to_log=None, **kwargs):  # pylint: disable=unused-argument
-  """Function to get LoggingTensorHook.
-
-  Args:
-    every_n_iter: `int`, print the values of `tensors` once every N local
-      steps taken on the current worker.
-    tensors_to_log: List of tensor names or dictionary mapping labels to tensor
-      names. If not set, log _TENSORS_TO_LOG by default.
-    **kwargs: a dictionary of arguments to LoggingTensorHook.
-
-  Returns:
-    Returns a LoggingTensorHook with a standard set of tensors that will be
-    printed to stdout.
-  """
-  if tensors_to_log is None:
-    tensors_to_log = _TENSORS_TO_LOG
-
-  return tf.estimator.LoggingTensorHook(
-      tensors=tensors_to_log,
-      every_n_iter=every_n_iter)
-
-
-def get_profiler_hook(model_dir, save_steps=1000, **kwargs):  # pylint: disable=unused-argument
-  """Function to get ProfilerHook.
-
-  Args:
-    model_dir: The directory to save the profile traces to.
-    save_steps: `int`, print profile traces every N steps.
-    **kwargs: a dictionary of arguments to ProfilerHook.
-
-  Returns:
-    Returns a ProfilerHook that writes out timelines that can be loaded into
-    profiling tools like chrome://tracing.
-  """
-  return tf.estimator.ProfilerHook(save_steps=save_steps, output_dir=model_dir)
-
-
-def get_examples_per_second_hook(every_n_steps=100,
-                                 batch_size=128,
-                                 warm_steps=5,
-                                 **kwargs):  # pylint: disable=unused-argument
-  """Function to get ExamplesPerSecondHook.
-
-  Args:
-    every_n_steps: `int`, print current and average examples per second every
-      N steps.
-    batch_size: `int`, total batch size used to calculate examples/second from
-      global time.
-    warm_steps: skip this number of steps before logging and running average.
-    **kwargs: a dictionary of arguments to ExamplesPerSecondHook.
-
-  Returns:
-    Returns a ProfilerHook that writes out timelines that can be loaded into
-    profiling tools like chrome://tracing.
-  """
-  return hooks.ExamplesPerSecondHook(
-      batch_size=batch_size, every_n_steps=every_n_steps,
-      warm_steps=warm_steps, metric_logger=logger.get_benchmark_logger())
-
-
-def get_logging_metric_hook(tensors_to_log=None,
-                            every_n_secs=600,
-                            **kwargs):  # pylint: disable=unused-argument
-  """Function to get LoggingMetricHook.
-
-  Args:
-    tensors_to_log: List of tensor names or dictionary mapping labels to tensor
-      names. If not set, log _TENSORS_TO_LOG by default.
-    every_n_secs: `int`, the frequency for logging the metric. Default to every
-      10 mins.
-    **kwargs: a dictionary of arguments.
-
-  Returns:
-    Returns a LoggingMetricHook that saves tensor values in a JSON format.
-  """
-  if tensors_to_log is None:
-    tensors_to_log = _TENSORS_TO_LOG
-  return metric_hook.LoggingMetricHook(
-      tensors=tensors_to_log,
-      metric_logger=logger.get_benchmark_logger(),
-      every_n_secs=every_n_secs)
-
-
-def get_step_counter_hook(**kwargs):
-  """Function to get StepCounterHook."""
-  del kwargs
-  return tf.estimator.StepCounterHook()
-
-
-# A dictionary to map one hook name and its corresponding function
-HOOKS = {
-    'loggingtensorhook': get_logging_tensor_hook,
-    'profilerhook': get_profiler_hook,
-    'examplespersecondhook': get_examples_per_second_hook,
-    'loggingmetrichook': get_logging_metric_hook,
-    'stepcounterhook': get_step_counter_hook
-}
diff --git a/official/r1/utils/logs/hooks_test.py b/official/r1/utils/logs/hooks_test.py
deleted file mode 100644
index 5ce4c5e53a82da8625cb9f27e518589a0ddde59d..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/hooks_test.py
+++ /dev/null
@@ -1,159 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for hooks."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import time
-
-from absl import logging
-import tensorflow as tf  # pylint: disable=g-bad-import-order
-
-from official.r1.utils.logs import hooks
-from official.r1.utils.logs import mock_lib
-
-logging.set_verbosity(logging.DEBUG)
-
-
-class ExamplesPerSecondHookTest(tf.test.TestCase):
-  """Tests for the ExamplesPerSecondHook.
-
-  In the test, we explicitly run global_step tensor after train_op in order to
-  keep the global_step value and the train_op (which increase the glboal_step
-  by 1) consistent. This is to correct the discrepancies in reported global_step
-  value when running on GPUs.
-  """
-
-  def setUp(self):
-    """Mock out logging calls to verify if correct info is being monitored."""
-    self._logger = mock_lib.MockBenchmarkLogger()
-
-    self.graph = tf.Graph()
-    with self.graph.as_default():
-      tf.compat.v1.train.create_global_step()
-      self.train_op = tf.compat.v1.assign_add(
-          tf.compat.v1.train.get_global_step(), 1)
-      self.global_step = tf.compat.v1.train.get_global_step()
-
-  def test_raise_in_both_secs_and_steps(self):
-    with self.assertRaises(ValueError):
-      hooks.ExamplesPerSecondHook(
-          batch_size=256,
-          every_n_steps=10,
-          every_n_secs=20,
-          metric_logger=self._logger)
-
-  def test_raise_in_none_secs_and_steps(self):
-    with self.assertRaises(ValueError):
-      hooks.ExamplesPerSecondHook(
-          batch_size=256,
-          every_n_steps=None,
-          every_n_secs=None,
-          metric_logger=self._logger)
-
-  def _validate_log_every_n_steps(self, every_n_steps, warm_steps):
-    hook = hooks.ExamplesPerSecondHook(
-        batch_size=256,
-        every_n_steps=every_n_steps,
-        warm_steps=warm_steps,
-        metric_logger=self._logger)
-
-    with tf.compat.v1.train.MonitoredSession(
-        tf.compat.v1.train.ChiefSessionCreator(), [hook]) as mon_sess:
-      for _ in range(every_n_steps):
-        # Explicitly run global_step after train_op to get the accurate
-        # global_step value
-        mon_sess.run(self.train_op)
-        mon_sess.run(self.global_step)
-        # Nothing should be in the list yet
-        self.assertFalse(self._logger.logged_metric)
-
-      mon_sess.run(self.train_op)
-      global_step_val = mon_sess.run(self.global_step)
-
-      if global_step_val > warm_steps:
-        self._assert_metrics()
-      else:
-        # Nothing should be in the list yet
-        self.assertFalse(self._logger.logged_metric)
-
-      # Add additional run to verify proper reset when called multiple times.
-      prev_log_len = len(self._logger.logged_metric)
-      mon_sess.run(self.train_op)
-      global_step_val = mon_sess.run(self.global_step)
-
-      if every_n_steps == 1 and global_step_val > warm_steps:
-        # Each time, we log two additional metrics. Did exactly 2 get added?
-        self.assertEqual(len(self._logger.logged_metric), prev_log_len + 2)
-      else:
-        # No change in the size of the metric list.
-        self.assertEqual(len(self._logger.logged_metric), prev_log_len)
-
-  def test_examples_per_sec_every_1_steps(self):
-    with self.graph.as_default():
-      self._validate_log_every_n_steps(1, 0)
-
-  def test_examples_per_sec_every_5_steps(self):
-    with self.graph.as_default():
-      self._validate_log_every_n_steps(5, 0)
-
-  def test_examples_per_sec_every_1_steps_with_warm_steps(self):
-    with self.graph.as_default():
-      self._validate_log_every_n_steps(1, 10)
-
-  def test_examples_per_sec_every_5_steps_with_warm_steps(self):
-    with self.graph.as_default():
-      self._validate_log_every_n_steps(5, 10)
-
-  def _validate_log_every_n_secs(self, every_n_secs):
-    hook = hooks.ExamplesPerSecondHook(
-        batch_size=256,
-        every_n_steps=None,
-        every_n_secs=every_n_secs,
-        metric_logger=self._logger)
-
-    with tf.compat.v1.train.MonitoredSession(
-        tf.compat.v1.train.ChiefSessionCreator(), [hook]) as mon_sess:
-      # Explicitly run global_step after train_op to get the accurate
-      # global_step value
-      mon_sess.run(self.train_op)
-      mon_sess.run(self.global_step)
-      # Nothing should be in the list yet
-      self.assertFalse(self._logger.logged_metric)
-      time.sleep(every_n_secs)
-
-      mon_sess.run(self.train_op)
-      mon_sess.run(self.global_step)
-      self._assert_metrics()
-
-  def test_examples_per_sec_every_1_secs(self):
-    with self.graph.as_default():
-      self._validate_log_every_n_secs(1)
-
-  def test_examples_per_sec_every_5_secs(self):
-    with self.graph.as_default():
-      self._validate_log_every_n_secs(5)
-
-  def _assert_metrics(self):
-    metrics = self._logger.logged_metric
-    self.assertEqual(metrics[-2]["name"], "average_examples_per_sec")
-    self.assertEqual(metrics[-1]["name"], "current_examples_per_sec")
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/official/r1/utils/logs/logger.py b/official/r1/utils/logs/logger.py
deleted file mode 100644
index 587053e980052ee7d4e70e7bce95dfc7d7de5f4f..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/logger.py
+++ /dev/null
@@ -1,305 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Logging utilities for benchmark.
-
-For collecting local environment metrics like CPU and memory, certain python
-packages need be installed. See README for details.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import contextlib
-import datetime
-import json
-import numbers
-import os
-import threading
-import uuid
-
-from absl import flags
-from absl import logging
-from six.moves import _thread as thread
-import tensorflow as tf
-from tensorflow.python.client import device_lib
-from official.r1.utils.logs import cloud_lib
-
-METRIC_LOG_FILE_NAME = "metric.log"
-BENCHMARK_RUN_LOG_FILE_NAME = "benchmark_run.log"
-_DATE_TIME_FORMAT_PATTERN = "%Y-%m-%dT%H:%M:%S.%fZ"
-GCP_TEST_ENV = "GCP"
-RUN_STATUS_SUCCESS = "success"
-RUN_STATUS_FAILURE = "failure"
-RUN_STATUS_RUNNING = "running"
-
-
-FLAGS = flags.FLAGS
-
-# Don't use it directly. Use get_benchmark_logger to access a logger.
-_benchmark_logger = None
-_logger_lock = threading.Lock()
-
-
-def config_benchmark_logger(flag_obj=None):
-  """Config the global benchmark logger."""
-  _logger_lock.acquire()
-  try:
-    global _benchmark_logger
-    if not flag_obj:
-      flag_obj = FLAGS
-
-    if (not hasattr(flag_obj, "benchmark_logger_type") or
-        flag_obj.benchmark_logger_type == "BaseBenchmarkLogger"):
-      _benchmark_logger = BaseBenchmarkLogger()
-    elif flag_obj.benchmark_logger_type == "BenchmarkFileLogger":
-      _benchmark_logger = BenchmarkFileLogger(flag_obj.benchmark_log_dir)
-    else:
-      raise ValueError("Unrecognized benchmark_logger_type: %s"
-                       % flag_obj.benchmark_logger_type)
-
-  finally:
-    _logger_lock.release()
-  return _benchmark_logger
-
-
-def get_benchmark_logger():
-  if not _benchmark_logger:
-    config_benchmark_logger()
-  return _benchmark_logger
-
-
-@contextlib.contextmanager
-def benchmark_context(flag_obj):
-  """Context of benchmark, which will update status of the run accordingly."""
-  benchmark_logger = config_benchmark_logger(flag_obj)
-  try:
-    yield
-    benchmark_logger.on_finish(RUN_STATUS_SUCCESS)
-  except Exception:  # pylint: disable=broad-except
-    # Catch all the exception, update the run status to be failure, and re-raise
-    benchmark_logger.on_finish(RUN_STATUS_FAILURE)
-    raise
-
-
-class BaseBenchmarkLogger(object):
-  """Class to log the benchmark information to STDOUT."""
-
-  def log_evaluation_result(self, eval_results):
-    """Log the evaluation result.
-
-    The evaluate result is a dictionary that contains metrics defined in
-    model_fn. It also contains a entry for global_step which contains the value
-    of the global step when evaluation was performed.
-
-    Args:
-      eval_results: dict, the result of evaluate.
-    """
-    if not isinstance(eval_results, dict):
-      logging.warning("eval_results should be dictionary for logging. Got %s",
-                      type(eval_results))
-      return
-    global_step = eval_results[tf.compat.v1.GraphKeys.GLOBAL_STEP]
-    for key in sorted(eval_results):
-      if key != tf.compat.v1.GraphKeys.GLOBAL_STEP:
-        self.log_metric(key, eval_results[key], global_step=global_step)
-
-  def log_metric(self, name, value, unit=None, global_step=None, extras=None):
-    """Log the benchmark metric information to local file.
-
-    Currently the logging is done in a synchronized way. This should be updated
-    to log asynchronously.
-
-    Args:
-      name: string, the name of the metric to log.
-      value: number, the value of the metric. The value will not be logged if it
-        is not a number type.
-      unit: string, the unit of the metric, E.g "image per second".
-      global_step: int, the global_step when the metric is logged.
-      extras: map of string:string, the extra information about the metric.
-    """
-    metric = _process_metric_to_json(name, value, unit, global_step, extras)
-    if metric:
-      logging.info("Benchmark metric: %s", metric)
-
-  def log_run_info(self, model_name, dataset_name, run_params, test_id=None):
-    logging.info(
-        "Benchmark run: %s",
-        _gather_run_info(model_name, dataset_name, run_params, test_id))
-
-  def on_finish(self, status):
-    pass
-
-
-class BenchmarkFileLogger(BaseBenchmarkLogger):
-  """Class to log the benchmark information to local disk."""
-
-  def __init__(self, logging_dir):
-    super(BenchmarkFileLogger, self).__init__()
-    self._logging_dir = logging_dir
-    if not tf.io.gfile.isdir(self._logging_dir):
-      tf.io.gfile.makedirs(self._logging_dir)
-    self._metric_file_handler = tf.io.gfile.GFile(
-        os.path.join(self._logging_dir, METRIC_LOG_FILE_NAME), "a")
-
-  def log_metric(self, name, value, unit=None, global_step=None, extras=None):
-    """Log the benchmark metric information to local file.
-
-    Currently the logging is done in a synchronized way. This should be updated
-    to log asynchronously.
-
-    Args:
-      name: string, the name of the metric to log.
-      value: number, the value of the metric. The value will not be logged if it
-        is not a number type.
-      unit: string, the unit of the metric, E.g "image per second".
-      global_step: int, the global_step when the metric is logged.
-      extras: map of string:string, the extra information about the metric.
-    """
-    metric = _process_metric_to_json(name, value, unit, global_step, extras)
-    if metric:
-      try:
-        json.dump(metric, self._metric_file_handler)
-        self._metric_file_handler.write("\n")
-        self._metric_file_handler.flush()
-      except (TypeError, ValueError) as e:
-        logging.warning(
-            "Failed to dump metric to log file: name %s, value %s, error %s",
-            name, value, e)
-
-  def log_run_info(self, model_name, dataset_name, run_params, test_id=None):
-    """Collect most of the TF runtime information for the local env.
-
-    The schema of the run info follows official/benchmark/datastore/schema.
-
-    Args:
-      model_name: string, the name of the model.
-      dataset_name: string, the name of dataset for training and evaluation.
-      run_params: dict, the dictionary of parameters for the run, it could
-        include hyperparameters or other params that are important for the run.
-      test_id: string, the unique name of the test run by the combination of key
-        parameters, eg batch size, num of GPU. It is hardware independent.
-    """
-    run_info = _gather_run_info(model_name, dataset_name, run_params, test_id)
-
-    with tf.io.gfile.GFile(os.path.join(
-        self._logging_dir, BENCHMARK_RUN_LOG_FILE_NAME), "w") as f:
-      try:
-        json.dump(run_info, f)
-        f.write("\n")
-      except (TypeError, ValueError) as e:
-        logging.warning("Failed to dump benchmark run info to log file: %s", e)
-
-  def on_finish(self, status):
-    self._metric_file_handler.flush()
-    self._metric_file_handler.close()
-
-
-def _gather_run_info(model_name, dataset_name, run_params, test_id):
-  """Collect the benchmark run information for the local environment."""
-  run_info = {
-      "model_name": model_name,
-      "dataset": {"name": dataset_name},
-      "machine_config": {},
-      "test_id": test_id,
-      "run_date": datetime.datetime.utcnow().strftime(
-          _DATE_TIME_FORMAT_PATTERN)}
-  _collect_tensorflow_info(run_info)
-  _collect_tensorflow_environment_variables(run_info)
-  _collect_run_params(run_info, run_params)
-  _collect_memory_info(run_info)
-  _collect_test_environment(run_info)
-  return run_info
-
-
-def _process_metric_to_json(
-    name, value, unit=None, global_step=None, extras=None):
-  """Validate the metric data and generate JSON for insert."""
-  if not isinstance(value, numbers.Number):
-    logging.warning("Metric value to log should be a number. Got %s",
-                    type(value))
-    return None
-
-  extras = _convert_to_json_dict(extras)
-  return {
-      "name": name,
-      "value": float(value),
-      "unit": unit,
-      "global_step": global_step,
-      "timestamp": datetime.datetime.utcnow().strftime(
-          _DATE_TIME_FORMAT_PATTERN),
-      "extras": extras}
-
-
-def _collect_tensorflow_info(run_info):
-  run_info["tensorflow_version"] = {
-      "version": tf.version.VERSION, "git_hash": tf.version.GIT_VERSION}
-
-
-def _collect_run_params(run_info, run_params):
-  """Log the parameter information for the benchmark run."""
-  def process_param(name, value):
-    type_check = {
-        str: {"name": name, "string_value": value},
-        int: {"name": name, "long_value": value},
-        bool: {"name": name, "bool_value": str(value)},
-        float: {"name": name, "float_value": value},
-    }
-    return type_check.get(type(value),
-                          {"name": name, "string_value": str(value)})
-  if run_params:
-    run_info["run_parameters"] = [
-        process_param(k, v) for k, v in sorted(run_params.items())]
-
-
-def _collect_tensorflow_environment_variables(run_info):
-  run_info["tensorflow_environment_variables"] = [
-      {"name": k, "value": v}
-      for k, v in sorted(os.environ.items()) if k.startswith("TF_")]
-
-
-def _collect_memory_info(run_info):
-  try:
-    # Note: psutil is not installed in the TensorFlow OSS tree.
-    # It is installable via pip.
-    import psutil   # pylint: disable=g-import-not-at-top
-    vmem = psutil.virtual_memory()
-    run_info["machine_config"]["memory_total"] = vmem.total
-    run_info["machine_config"]["memory_available"] = vmem.available
-  except ImportError:
-    logging.warn("'psutil' not imported. Memory info will not be logged.")
-
-
-def _collect_test_environment(run_info):
-  """Detect the local environment, eg GCE, AWS or DGX, etc."""
-  if cloud_lib.on_gcp():
-    run_info["test_environment"] = GCP_TEST_ENV
-  # TODO(scottzhu): Add more testing env detection for other platform
-
-
-def _parse_gpu_model(physical_device_desc):
-  # Assume all the GPU connected are same model
-  for kv in physical_device_desc.split(","):
-    k, _, v = kv.partition(":")
-    if k.strip() == "name":
-      return v.strip()
-  return None
-
-
-def _convert_to_json_dict(input_dict):
-  if input_dict:
-    return [{"name": k, "value": v} for k, v in sorted(input_dict.items())]
-  else:
-    return []
diff --git a/official/r1/utils/logs/logger_test.py b/official/r1/utils/logs/logger_test.py
deleted file mode 100644
index 96a8802d58218751f8a6559e8a4958dd0065a078..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/logger_test.py
+++ /dev/null
@@ -1,253 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for benchmark logger."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import json
-import os
-import tempfile
-import time
-import unittest
-
-from absl import logging
-from absl.testing import flagsaver
-import tensorflow as tf
-
-from official.r1.utils.logs import logger
-from official.utils.flags import core as flags_core
-from official.utils.misc import keras_utils
-
-
-class BenchmarkLoggerTest(tf.test.TestCase):
-
-  @classmethod
-  def setUpClass(cls):  # pylint: disable=invalid-name
-    super(BenchmarkLoggerTest, cls).setUpClass()
-    flags_core.define_benchmark()
-
-  def test_get_default_benchmark_logger(self):
-    with flagsaver.flagsaver(benchmark_logger_type="foo"):
-      self.assertIsInstance(logger.get_benchmark_logger(),
-                            logger.BaseBenchmarkLogger)
-
-  def test_config_base_benchmark_logger(self):
-    with flagsaver.flagsaver(benchmark_logger_type="BaseBenchmarkLogger"):
-      logger.config_benchmark_logger()
-      self.assertIsInstance(logger.get_benchmark_logger(),
-                            logger.BaseBenchmarkLogger)
-
-  def test_config_benchmark_file_logger(self):
-    # Set the benchmark_log_dir first since the benchmark_logger_type will need
-    # the value to be set when it does the validation.
-    with flagsaver.flagsaver(benchmark_log_dir="/tmp"):
-      with flagsaver.flagsaver(benchmark_logger_type="BenchmarkFileLogger"):
-        logger.config_benchmark_logger()
-        self.assertIsInstance(logger.get_benchmark_logger(),
-                              logger.BenchmarkFileLogger)
-
-
-class BaseBenchmarkLoggerTest(tf.test.TestCase):
-
-  def setUp(self):
-    super(BaseBenchmarkLoggerTest, self).setUp()
-    self._actual_log = logging.info
-    self.logged_message = None
-
-    def mock_log(*args, **kwargs):
-      self.logged_message = args
-      self._actual_log(*args, **kwargs)
-
-    logging.info = mock_log
-
-  def tearDown(self):
-    super(BaseBenchmarkLoggerTest, self).tearDown()
-    logging.info = self._actual_log
-
-  def test_log_metric(self):
-    log = logger.BaseBenchmarkLogger()
-    log.log_metric("accuracy", 0.999, global_step=1e4, extras={"name": "value"})
-
-    expected_log_prefix = "Benchmark metric:"
-    self.assertRegexpMatches(str(self.logged_message), expected_log_prefix)
-
-
-class BenchmarkFileLoggerTest(tf.test.TestCase):
-
-  def setUp(self):
-    super(BenchmarkFileLoggerTest, self).setUp()
-    # Avoid pulling extra env vars from test environment which affects the test
-    # result, eg. Kokoro test has a TF_PKG env which affect the test case
-    # test_collect_tensorflow_environment_variables()
-    self.original_environ = dict(os.environ)
-    os.environ.clear()
-
-  def tearDown(self):
-    super(BenchmarkFileLoggerTest, self).tearDown()
-    tf.io.gfile.rmtree(self.get_temp_dir())
-    os.environ.clear()
-    os.environ.update(self.original_environ)
-
-  def test_create_logging_dir(self):
-    non_exist_temp_dir = os.path.join(self.get_temp_dir(), "unknown_dir")
-    self.assertFalse(tf.io.gfile.isdir(non_exist_temp_dir))
-
-    logger.BenchmarkFileLogger(non_exist_temp_dir)
-    self.assertTrue(tf.io.gfile.isdir(non_exist_temp_dir))
-
-  def test_log_metric(self):
-    log_dir = tempfile.mkdtemp(dir=self.get_temp_dir())
-    log = logger.BenchmarkFileLogger(log_dir)
-    log.log_metric("accuracy", 0.999, global_step=1e4, extras={"name": "value"})
-
-    metric_log = os.path.join(log_dir, "metric.log")
-    self.assertTrue(tf.io.gfile.exists(metric_log))
-    with tf.io.gfile.GFile(metric_log) as f:
-      metric = json.loads(f.readline())
-      self.assertEqual(metric["name"], "accuracy")
-      self.assertEqual(metric["value"], 0.999)
-      self.assertEqual(metric["unit"], None)
-      self.assertEqual(metric["global_step"], 1e4)
-      self.assertEqual(metric["extras"], [{"name": "name", "value": "value"}])
-
-  def test_log_multiple_metrics(self):
-    log_dir = tempfile.mkdtemp(dir=self.get_temp_dir())
-    log = logger.BenchmarkFileLogger(log_dir)
-    log.log_metric("accuracy", 0.999, global_step=1e4, extras={"name": "value"})
-    log.log_metric("loss", 0.02, global_step=1e4)
-
-    metric_log = os.path.join(log_dir, "metric.log")
-    self.assertTrue(tf.io.gfile.exists(metric_log))
-    with tf.io.gfile.GFile(metric_log) as f:
-      accuracy = json.loads(f.readline())
-      self.assertEqual(accuracy["name"], "accuracy")
-      self.assertEqual(accuracy["value"], 0.999)
-      self.assertEqual(accuracy["unit"], None)
-      self.assertEqual(accuracy["global_step"], 1e4)
-      self.assertEqual(accuracy["extras"], [{"name": "name", "value": "value"}])
-
-      loss = json.loads(f.readline())
-      self.assertEqual(loss["name"], "loss")
-      self.assertEqual(loss["value"], 0.02)
-      self.assertEqual(loss["unit"], None)
-      self.assertEqual(loss["global_step"], 1e4)
-      self.assertEqual(loss["extras"], [])
-
-  def test_log_non_number_value(self):
-    log_dir = tempfile.mkdtemp(dir=self.get_temp_dir())
-    log = logger.BenchmarkFileLogger(log_dir)
-    const = tf.constant(1)
-    log.log_metric("accuracy", const)
-
-    metric_log = os.path.join(log_dir, "metric.log")
-    self.assertFalse(tf.io.gfile.exists(metric_log))
-
-  def test_log_evaluation_result(self):
-    eval_result = {"loss": 0.46237424,
-                   "global_step": 207082,
-                   "accuracy": 0.9285}
-    log_dir = tempfile.mkdtemp(dir=self.get_temp_dir())
-    log = logger.BenchmarkFileLogger(log_dir)
-    log.log_evaluation_result(eval_result)
-
-    metric_log = os.path.join(log_dir, "metric.log")
-    self.assertTrue(tf.io.gfile.exists(metric_log))
-    with tf.io.gfile.GFile(metric_log) as f:
-      accuracy = json.loads(f.readline())
-      self.assertEqual(accuracy["name"], "accuracy")
-      self.assertEqual(accuracy["value"], 0.9285)
-      self.assertEqual(accuracy["unit"], None)
-      self.assertEqual(accuracy["global_step"], 207082)
-
-      loss = json.loads(f.readline())
-      self.assertEqual(loss["name"], "loss")
-      self.assertEqual(loss["value"], 0.46237424)
-      self.assertEqual(loss["unit"], None)
-      self.assertEqual(loss["global_step"], 207082)
-
-  def test_log_evaluation_result_with_invalid_type(self):
-    eval_result = "{'loss': 0.46237424, 'global_step': 207082}"
-    log_dir = tempfile.mkdtemp(dir=self.get_temp_dir())
-    log = logger.BenchmarkFileLogger(log_dir)
-    log.log_evaluation_result(eval_result)
-
-    metric_log = os.path.join(log_dir, "metric.log")
-    self.assertFalse(tf.io.gfile.exists(metric_log))
-
-  def test_collect_tensorflow_info(self):
-    run_info = {}
-    logger._collect_tensorflow_info(run_info)
-    self.assertNotEqual(run_info["tensorflow_version"], {})
-    self.assertEqual(run_info["tensorflow_version"]["version"],
-                     tf.version.VERSION)
-    self.assertEqual(run_info["tensorflow_version"]["git_hash"],
-                     tf.version.GIT_VERSION)
-
-  def test_collect_run_params(self):
-    run_info = {}
-    run_parameters = {
-        "batch_size": 32,
-        "synthetic_data": True,
-        "train_epochs": 100.00,
-        "dtype": "fp16",
-        "resnet_size": 50,
-        "random_tensor": tf.constant(2.0)
-    }
-    logger._collect_run_params(run_info, run_parameters)
-    self.assertEqual(len(run_info["run_parameters"]), 6)
-    self.assertEqual(run_info["run_parameters"][0],
-                     {"name": "batch_size", "long_value": 32})
-    self.assertEqual(run_info["run_parameters"][1],
-                     {"name": "dtype", "string_value": "fp16"})
-    v1_tensor = {"name": "random_tensor", "string_value":
-                     "Tensor(\"Const:0\", shape=(), dtype=float32)"}
-    v2_tensor = {"name": "random_tensor", "string_value":
-                     "tf.Tensor(2.0, shape=(), dtype=float32)"}
-    self.assertIn(run_info["run_parameters"][2], [v1_tensor, v2_tensor])
-
-
-    self.assertEqual(run_info["run_parameters"][3],
-                     {"name": "resnet_size", "long_value": 50})
-    self.assertEqual(run_info["run_parameters"][4],
-                     {"name": "synthetic_data", "bool_value": "True"})
-    self.assertEqual(run_info["run_parameters"][5],
-                     {"name": "train_epochs", "float_value": 100.00})
-
-  def test_collect_tensorflow_environment_variables(self):
-    os.environ["TF_ENABLE_WINOGRAD_NONFUSED"] = "1"
-    os.environ["TF_OTHER"] = "2"
-    os.environ["OTHER"] = "3"
-
-    run_info = {}
-    logger._collect_tensorflow_environment_variables(run_info)
-    self.assertIsNotNone(run_info["tensorflow_environment_variables"])
-    expected_tf_envs = [
-        {"name": "TF_ENABLE_WINOGRAD_NONFUSED", "value": "1"},
-        {"name": "TF_OTHER", "value": "2"},
-    ]
-    self.assertEqual(run_info["tensorflow_environment_variables"],
-                     expected_tf_envs)
-
-  def test_collect_memory_info(self):
-    run_info = {"machine_config": {}}
-    logger._collect_memory_info(run_info)
-    self.assertIsNotNone(run_info["machine_config"]["memory_total"])
-    self.assertIsNotNone(run_info["machine_config"]["memory_available"])
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/official/r1/utils/logs/metric_hook.py b/official/r1/utils/logs/metric_hook.py
deleted file mode 100644
index f408e3e95f09bd48373564389f4c9f1c28f698a5..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/metric_hook.py
+++ /dev/null
@@ -1,97 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Session hook for logging benchmark metric."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf  # pylint: disable=g-bad-import-order
-
-
-class LoggingMetricHook(tf.estimator.LoggingTensorHook):
-  """Hook to log benchmark metric information.
-
-  This hook is very similar as tf.train.LoggingTensorHook, which logs given
-  tensors every N local steps, every N seconds, or at the end. The metric
-  information will be logged to given log_dir or via metric_logger in JSON
-  format, which can be consumed by data analysis pipeline later.
-
-  Note that if `at_end` is True, `tensors` should not include any tensor
-  whose evaluation produces a side effect such as consuming additional inputs.
-  """
-
-  def __init__(self, tensors, metric_logger=None,
-               every_n_iter=None, every_n_secs=None, at_end=False):
-    """Initializer for LoggingMetricHook.
-
-    Args:
-      tensors: `dict` that maps string-valued tags to tensors/tensor names,
-          or `iterable` of tensors/tensor names.
-      metric_logger: instance of `BenchmarkLogger`, the benchmark logger that
-          hook should use to write the log.
-      every_n_iter: `int`, print the values of `tensors` once every N local
-          steps taken on the current worker.
-      every_n_secs: `int` or `float`, print the values of `tensors` once every N
-          seconds. Exactly one of `every_n_iter` and `every_n_secs` should be
-          provided.
-      at_end: `bool` specifying whether to print the values of `tensors` at the
-          end of the run.
-
-    Raises:
-      ValueError:
-        1. `every_n_iter` is non-positive, or
-        2. Exactly one of every_n_iter and every_n_secs should be provided.
-        3. Exactly one of log_dir and metric_logger should be provided.
-    """
-    super(LoggingMetricHook, self).__init__(
-        tensors=tensors,
-        every_n_iter=every_n_iter,
-        every_n_secs=every_n_secs,
-        at_end=at_end)
-
-    if metric_logger is None:
-      raise ValueError("metric_logger should be provided.")
-    self._logger = metric_logger
-
-  def begin(self):
-    super(LoggingMetricHook, self).begin()
-    self._global_step_tensor = tf.compat.v1.train.get_global_step()
-    if self._global_step_tensor is None:
-      raise RuntimeError(
-          "Global step should be created to use LoggingMetricHook.")
-    if self._global_step_tensor.name not in self._current_tensors:
-      self._current_tensors[self._global_step_tensor.name] = (
-          self._global_step_tensor)
-
-  def after_run(self, unused_run_context, run_values):
-    # should_trigger is a internal state that populated at before_run, and it is
-    # using self_timer to determine whether it should trigger.
-    if self._should_trigger:
-      self._log_metric(run_values.results)
-
-    self._iter_count += 1
-
-  def end(self, session):
-    if self._log_at_end:
-      values = session.run(self._current_tensors)
-      self._log_metric(values)
-
-  def _log_metric(self, tensor_values):
-    self._timer.update_last_triggered_step(self._iter_count)
-    global_step = tensor_values[self._global_step_tensor.name]
-    # self._tag_order is populated during the init of LoggingTensorHook
-    for tag in self._tag_order:
-      self._logger.log_metric(tag, tensor_values[tag], global_step=global_step)
diff --git a/official/r1/utils/logs/metric_hook_test.py b/official/r1/utils/logs/metric_hook_test.py
deleted file mode 100644
index eba93014c8a9b4806c21b77f2261ea141e9eb41b..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/metric_hook_test.py
+++ /dev/null
@@ -1,217 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for metric_hook."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tempfile
-import time
-
-import tensorflow as tf  # pylint: disable=g-bad-import-order
-from tensorflow.python.training import monitored_session  # pylint: disable=g-bad-import-order
-
-from official.r1.utils.logs import metric_hook
-from official.r1.utils.logs import mock_lib
-
-
-class LoggingMetricHookTest(tf.test.TestCase):
-  """Tests for LoggingMetricHook."""
-
-  def setUp(self):
-    super(LoggingMetricHookTest, self).setUp()
-
-    self._log_dir = tempfile.mkdtemp(dir=self.get_temp_dir())
-    self._logger = mock_lib.MockBenchmarkLogger()
-
-  def tearDown(self):
-    super(LoggingMetricHookTest, self).tearDown()
-    tf.io.gfile.rmtree(self.get_temp_dir())
-
-  def test_illegal_args(self):
-    with self.assertRaisesRegexp(ValueError, "nvalid every_n_iter"):
-      metric_hook.LoggingMetricHook(tensors=["t"], every_n_iter=0)
-    with self.assertRaisesRegexp(ValueError, "nvalid every_n_iter"):
-      metric_hook.LoggingMetricHook(tensors=["t"], every_n_iter=-10)
-    with self.assertRaisesRegexp(ValueError, "xactly one of"):
-      metric_hook.LoggingMetricHook(
-          tensors=["t"], every_n_iter=5, every_n_secs=5)
-    with self.assertRaisesRegexp(ValueError, "xactly one of"):
-      metric_hook.LoggingMetricHook(tensors=["t"])
-    with self.assertRaisesRegexp(ValueError, "metric_logger"):
-      metric_hook.LoggingMetricHook(tensors=["t"], every_n_iter=5)
-
-  def test_print_at_end_only(self):
-    with tf.Graph().as_default(), tf.compat.v1.Session() as sess:
-      tf.compat.v1.train.get_or_create_global_step()
-      t = tf.constant(42.0, name="foo")
-      train_op = tf.constant(3)
-      hook = metric_hook.LoggingMetricHook(
-          tensors=[t.name], at_end=True, metric_logger=self._logger)
-      hook.begin()
-      mon_sess = monitored_session._HookedSession(sess, [hook])  # pylint: disable=protected-access
-      sess.run(tf.compat.v1.global_variables_initializer())
-
-      for _ in range(3):
-        mon_sess.run(train_op)
-        self.assertEqual(self._logger.logged_metric, [])
-
-      hook.end(sess)
-      self.assertEqual(len(self._logger.logged_metric), 1)
-      metric = self._logger.logged_metric[0]
-      self.assertRegexpMatches(metric["name"], "foo")
-      self.assertEqual(metric["value"], 42.0)
-      self.assertEqual(metric["unit"], None)
-      self.assertEqual(metric["global_step"], 0)
-
-  def test_global_step_not_found(self):
-    with tf.Graph().as_default():
-      t = tf.constant(42.0, name="foo")
-      hook = metric_hook.LoggingMetricHook(
-          tensors=[t.name], at_end=True, metric_logger=self._logger)
-
-      with self.assertRaisesRegexp(
-          RuntimeError, "should be created to use LoggingMetricHook."):
-        hook.begin()
-
-  def test_log_tensors(self):
-    with tf.Graph().as_default(), tf.compat.v1.Session() as sess:
-      tf.compat.v1.train.get_or_create_global_step()
-      t1 = tf.constant(42.0, name="foo")
-      t2 = tf.constant(43.0, name="bar")
-      train_op = tf.constant(3)
-      hook = metric_hook.LoggingMetricHook(
-          tensors=[t1, t2], at_end=True, metric_logger=self._logger)
-      hook.begin()
-      mon_sess = monitored_session._HookedSession(sess, [hook])  # pylint: disable=protected-access
-      sess.run(tf.compat.v1.global_variables_initializer())
-
-      for _ in range(3):
-        mon_sess.run(train_op)
-        self.assertEqual(self._logger.logged_metric, [])
-
-      hook.end(sess)
-      self.assertEqual(len(self._logger.logged_metric), 2)
-      metric1 = self._logger.logged_metric[0]
-      self.assertRegexpMatches(str(metric1["name"]), "foo")
-      self.assertEqual(metric1["value"], 42.0)
-      self.assertEqual(metric1["unit"], None)
-      self.assertEqual(metric1["global_step"], 0)
-
-      metric2 = self._logger.logged_metric[1]
-      self.assertRegexpMatches(str(metric2["name"]), "bar")
-      self.assertEqual(metric2["value"], 43.0)
-      self.assertEqual(metric2["unit"], None)
-      self.assertEqual(metric2["global_step"], 0)
-
-  def _validate_print_every_n_steps(self, sess, at_end):
-    t = tf.constant(42.0, name="foo")
-
-    train_op = tf.constant(3)
-    hook = metric_hook.LoggingMetricHook(
-        tensors=[t.name], every_n_iter=10, at_end=at_end,
-        metric_logger=self._logger)
-    hook.begin()
-    mon_sess = monitored_session._HookedSession(sess, [hook])  # pylint: disable=protected-access
-    sess.run(tf.compat.v1.global_variables_initializer())
-    mon_sess.run(train_op)
-    self.assertRegexpMatches(str(self._logger.logged_metric), t.name)
-    for _ in range(3):
-      self._logger.logged_metric = []
-      for _ in range(9):
-        mon_sess.run(train_op)
-        # assertNotRegexpMatches is not supported by python 3.1 and later
-        self.assertEqual(str(self._logger.logged_metric).find(t.name), -1)
-      mon_sess.run(train_op)
-      self.assertRegexpMatches(str(self._logger.logged_metric), t.name)
-
-    # Add additional run to verify proper reset when called multiple times.
-    self._logger.logged_metric = []
-    mon_sess.run(train_op)
-    # assertNotRegexpMatches is not supported by python 3.1 and later
-    self.assertEqual(str(self._logger.logged_metric).find(t.name), -1)
-
-    self._logger.logged_metric = []
-    hook.end(sess)
-    if at_end:
-      self.assertRegexpMatches(str(self._logger.logged_metric), t.name)
-    else:
-      # assertNotRegexpMatches is not supported by python 3.1 and later
-      self.assertEqual(str(self._logger.logged_metric).find(t.name), -1)
-
-  def test_print_every_n_steps(self):
-    with tf.Graph().as_default(), tf.compat.v1.Session() as sess:
-      tf.compat.v1.train.get_or_create_global_step()
-      self._validate_print_every_n_steps(sess, at_end=False)
-      # Verify proper reset.
-      self._validate_print_every_n_steps(sess, at_end=False)
-
-  def test_print_every_n_steps_and_end(self):
-    with tf.Graph().as_default(), tf.compat.v1.Session() as sess:
-      tf.compat.v1.train.get_or_create_global_step()
-      self._validate_print_every_n_steps(sess, at_end=True)
-      # Verify proper reset.
-      self._validate_print_every_n_steps(sess, at_end=True)
-
-  def _validate_print_every_n_secs(self, sess, at_end):
-    t = tf.constant(42.0, name="foo")
-    train_op = tf.constant(3)
-
-    hook = metric_hook.LoggingMetricHook(
-        tensors=[t.name], every_n_secs=1.0, at_end=at_end,
-        metric_logger=self._logger)
-    hook.begin()
-    mon_sess = monitored_session._HookedSession(sess, [hook])  # pylint: disable=protected-access
-    sess.run(tf.compat.v1.global_variables_initializer())
-
-    mon_sess.run(train_op)
-    self.assertRegexpMatches(str(self._logger.logged_metric), t.name)
-
-    # assertNotRegexpMatches is not supported by python 3.1 and later
-    self._logger.logged_metric = []
-    mon_sess.run(train_op)
-    self.assertEqual(str(self._logger.logged_metric).find(t.name), -1)
-    time.sleep(1.0)
-
-    self._logger.logged_metric = []
-    mon_sess.run(train_op)
-    self.assertRegexpMatches(str(self._logger.logged_metric), t.name)
-
-    self._logger.logged_metric = []
-    hook.end(sess)
-    if at_end:
-      self.assertRegexpMatches(str(self._logger.logged_metric), t.name)
-    else:
-      # assertNotRegexpMatches is not supported by python 3.1 and later
-      self.assertEqual(str(self._logger.logged_metric).find(t.name), -1)
-
-  def test_print_every_n_secs(self):
-    with tf.Graph().as_default(), tf.compat.v1.Session() as sess:
-      tf.compat.v1.train.get_or_create_global_step()
-      self._validate_print_every_n_secs(sess, at_end=False)
-      # Verify proper reset.
-      self._validate_print_every_n_secs(sess, at_end=False)
-
-  def test_print_every_n_secs_and_end(self):
-    with tf.Graph().as_default(), tf.compat.v1.Session() as sess:
-      tf.compat.v1.train.get_or_create_global_step()
-      self._validate_print_every_n_secs(sess, at_end=True)
-      # Verify proper reset.
-      self._validate_print_every_n_secs(sess, at_end=True)
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/official/r1/utils/logs/mlperf_helper.py b/official/r1/utils/logs/mlperf_helper.py
deleted file mode 100644
index 3f9601b6b46f0273b289a5276b8aa8ffaf8f45d7..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/mlperf_helper.py
+++ /dev/null
@@ -1,192 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Wrapper for the mlperf logging utils.
-
-MLPerf compliance logging is only desired under a limited set of circumstances.
-This module is intended to keep users from needing to consider logging (or
-install the module) unless they are performing mlperf runs.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import namedtuple
-import json
-import os
-import re
-import subprocess
-import sys
-from absl import logging
-import typing
-# pylint:disable=logging-format-interpolation
-
-
-_MIN_VERSION = (0, 0, 10)
-_STACK_OFFSET = 2
-
-SUDO = "sudo" if os.geteuid() else ""
-
-# This indirection is used in docker.
-DROP_CACHE_LOC = os.getenv("DROP_CACHE_LOC", "/proc/sys/vm/drop_caches")
-
-_NCF_PREFIX = "NCF_RAW_"
-
-# TODO(robieta): move line parsing to mlperf util
-_PREFIX = r"(?:{})?:::MLPv([0-9]+).([0-9]+).([0-9]+)".format(_NCF_PREFIX)
-_BENCHMARK = r"([a-zA-Z0-9_]+)"
-_TIMESTAMP = r"([0-9]+\.[0-9]+)"
-_CALLSITE = r"\((.+):([0-9]+)\)"
-_TAG = r"([a-zA-Z0-9_]+)"
-_VALUE = r"(.*)"
-
-ParsedLine = namedtuple("ParsedLine", ["version", "benchmark", "timestamp",
-                                       "callsite", "tag", "value"])
-
-LINE_PATTERN = re.compile(
-    "^{prefix} {benchmark} {timestamp} {callsite} {tag}(: |$){value}?$".format(
-        prefix=_PREFIX, benchmark=_BENCHMARK, timestamp=_TIMESTAMP,
-        callsite=_CALLSITE, tag=_TAG, value=_VALUE))
-
-
-def parse_line(line): # type: (str) -> typing.Optional[ParsedLine]
-  match = LINE_PATTERN.match(line.strip())
-  if not match:
-    return
-
-  major, minor, micro, benchmark, timestamp = match.groups()[:5]
-  call_file, call_line, tag, _, value = match.groups()[5:]
-
-  return ParsedLine(version=(int(major), int(minor), int(micro)),
-                    benchmark=benchmark, timestamp=timestamp,
-                    callsite=(call_file, call_line), tag=tag, value=value)
-
-
-def unparse_line(parsed_line): # type: (ParsedLine) -> str
-  version_str = "{}.{}.{}".format(*parsed_line.version)
-  callsite_str = "({}:{})".format(*parsed_line.callsite)
-  value_str = ": {}".format(parsed_line.value) if parsed_line.value else ""
-  return ":::MLPv{} {} {} {} {} {}".format(
-      version_str, parsed_line.benchmark, parsed_line.timestamp, callsite_str,
-      parsed_line.tag, value_str)
-
-
-def get_mlperf_log():
-  """Shielded import of mlperf_log module."""
-  try:
-    import mlperf_compliance
-
-    def test_mlperf_log_pip_version():
-      """Check that mlperf_compliance is up to date."""
-      import pkg_resources
-      version = pkg_resources.get_distribution("mlperf_compliance")
-      version = tuple(int(i) for i in version.version.split("."))
-      if version < _MIN_VERSION:
-        logging.warning("mlperf_compliance is version {}, must be >= {}".format(
-            ".".join([str(i) for i in version]),
-            ".".join([str(i) for i in _MIN_VERSION])))
-        raise ImportError
-      return mlperf_compliance.mlperf_log
-
-    mlperf_log = test_mlperf_log_pip_version()
-
-  except ImportError:
-    mlperf_log = None
-
-  return mlperf_log
-
-
-class Logger(object):
-  """MLPerf logger indirection class.
-
-  This logger only logs for MLPerf runs, and prevents various errors associated
-  with not having the mlperf_compliance package installed.
-  """
-  class Tags(object):
-    def __init__(self, mlperf_log):
-      self._enabled = False
-      self._mlperf_log = mlperf_log
-
-    def __getattr__(self, item):
-      if self._mlperf_log is None or not self._enabled:
-        return
-      return getattr(self._mlperf_log, item)
-
-  def __init__(self):
-    self._enabled = False
-    self._mlperf_log = get_mlperf_log()
-    self.tags = self.Tags(self._mlperf_log)
-
-  def __call__(self, enable=False):
-    if enable and self._mlperf_log is None:
-      raise ImportError("MLPerf logging was requested, but mlperf_compliance "
-                        "module could not be loaded.")
-
-    self._enabled = enable
-    self.tags._enabled = enable
-    return self
-
-  def __enter__(self):
-    pass
-
-  def __exit__(self, exc_type, exc_val, exc_tb):
-    self._enabled = False
-    self.tags._enabled = False
-
-  @property
-  def log_file(self):
-    if self._mlperf_log is None:
-      return
-    return self._mlperf_log.LOG_FILE
-
-  @property
-  def enabled(self):
-    return self._enabled
-
-  def ncf_print(self, key, value=None, stack_offset=_STACK_OFFSET,
-                deferred=False, extra_print=False, prefix=_NCF_PREFIX):
-    if self._mlperf_log is None or not self.enabled:
-      return
-    self._mlperf_log.ncf_print(key=key, value=value, stack_offset=stack_offset,
-                               deferred=deferred, extra_print=extra_print,
-                               prefix=prefix)
-
-  def set_ncf_root(self, path):
-    if self._mlperf_log is None:
-      return
-    self._mlperf_log.ROOT_DIR_NCF = path
-
-
-LOGGER = Logger()
-ncf_print, set_ncf_root = LOGGER.ncf_print, LOGGER.set_ncf_root
-TAGS = LOGGER.tags
-
-
-def clear_system_caches():
-  if not LOGGER.enabled:
-    return
-  ret_code = subprocess.call(
-      ["sync && echo 3 | {} tee {}".format(SUDO, DROP_CACHE_LOC)],
-      shell=True)
-
-  if ret_code:
-    raise ValueError("Failed to clear caches")
-
-
-if __name__ == "__main__":
-  logging.set_verbosity(logging.INFO)
-  with LOGGER(True):
-    ncf_print(key=TAGS.RUN_START)
diff --git a/official/r1/utils/logs/mock_lib.py b/official/r1/utils/logs/mock_lib.py
deleted file mode 100644
index ee4de3c489c0c26221336538fdf2e798bbafcc30..0000000000000000000000000000000000000000
--- a/official/r1/utils/logs/mock_lib.py
+++ /dev/null
@@ -1,36 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Mock objects and related functions for testing."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-class MockBenchmarkLogger(object):
-  """This is a mock logger that can be used in dependent tests."""
-
-  def __init__(self):
-    self.logged_metric = []
-
-  def log_metric(self, name, value, unit=None, global_step=None,
-                 extras=None):
-    self.logged_metric.append({
-        "name": name,
-        "value": float(value),
-        "unit": unit,
-        "global_step": global_step,
-        "extras": extras})
diff --git a/official/r1/utils/tpu.py b/official/r1/utils/tpu.py
deleted file mode 100644
index 737a794221f2fd077b54b7ea3021993d41f72a63..0000000000000000000000000000000000000000
--- a/official/r1/utils/tpu.py
+++ /dev/null
@@ -1,116 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Functions specific to running TensorFlow on TPUs."""
-
-import tensorflow as tf
-
-
-# "local" is a magic word in the TPU cluster resolver; it informs the resolver
-# to use the local CPU as the compute device. This is useful for testing and
-# debugging; the code flow is ostensibly identical, but without the need to
-# actually have a TPU on the other end.
-LOCAL = "local"
-
-
-def construct_scalar_host_call(metric_dict, model_dir, prefix=""):
-  """Construct a host call to log scalars when training on TPU.
-
-  Args:
-    metric_dict: A dict of the tensors to be logged.
-    model_dir: The location to write the summary.
-    prefix: The prefix (if any) to prepend to the metric names.
-
-  Returns:
-    A tuple of (function, args_to_be_passed_to_said_function)
-  """
-  # type: (dict, str) -> (function, list)
-  metric_names = list(metric_dict.keys())
-
-  def host_call_fn(global_step, *args):
-    """Training host call. Creates scalar summaries for training metrics.
-
-    This function is executed on the CPU and should not directly reference
-    any Tensors in the rest of the `model_fn`. To pass Tensors from the
-    model to the `metric_fn`, provide as part of the `host_call`. See
-    https://www.tensorflow.org/api_docs/python/tf/contrib/tpu/TPUEstimatorSpec
-    for more information.
-
-    Arguments should match the list of `Tensor` objects passed as the second
-    element in the tuple passed to `host_call`.
-
-    Args:
-      global_step: `Tensor with shape `[batch]` for the global_step
-      *args: Remaining tensors to log.
-
-    Returns:
-      List of summary ops to run on the CPU host.
-    """
-    step = global_step[0]
-    with tf.compat.v1.summary.create_file_writer(
-        logdir=model_dir, filename_suffix=".host_call").as_default():
-      with tf.compat.v1.summary.always_record_summaries():
-        for i, name in enumerate(metric_names):
-          tf.compat.v1.summary.scalar(prefix + name, args[i][0], step=step)
-
-        return tf.compat.v1.summary.all_summary_ops()
-
-  # To log the current learning rate, and gradient norm for Tensorboard, the
-  # summary op needs to be run on the host CPU via host_call. host_call
-  # expects [batch_size, ...] Tensors, thus reshape to introduce a batch
-  # dimension. These Tensors are implicitly concatenated to
-  # [params['batch_size']].
-  global_step_tensor = tf.reshape(
-      tf.compat.v1.train.get_or_create_global_step(), [1])
-  other_tensors = [tf.reshape(metric_dict[key], [1]) for key in metric_names]
-
-  return host_call_fn, [global_step_tensor] + other_tensors
-
-
-def embedding_matmul(embedding_table, values, mask, name="embedding_matmul"):
-  """Performs embedding lookup via a matmul.
-
-  The matrix to be multiplied by the embedding table Tensor is constructed
-  via an implementation of scatter based on broadcasting embedding indices
-  and performing an equality comparison against a broadcasted
-  range(num_embedding_table_rows). All masked positions will produce an
-  embedding vector of zeros.
-
-  Args:
-    embedding_table: Tensor of embedding table.
-      Rank 2 (table_size x embedding dim)
-    values: Tensor of embedding indices. Rank 2 (batch x n_indices)
-    mask: Tensor of mask / weights. Rank 2 (batch x n_indices)
-    name: Optional name scope for created ops
-
-  Returns:
-    Rank 3 tensor of embedding vectors.
-  """
-
-  with tf.name_scope(name):
-    n_embeddings = embedding_table.get_shape().as_list()[0]
-    batch_size, padded_size = values.shape.as_list()
-
-    emb_idcs = tf.tile(
-        tf.reshape(values, (batch_size, padded_size, 1)), (1, 1, n_embeddings))
-    emb_weights = tf.tile(
-        tf.reshape(mask, (batch_size, padded_size, 1)), (1, 1, n_embeddings))
-    col_idcs = tf.tile(
-        tf.reshape(tf.range(n_embeddings), (1, 1, n_embeddings)),
-        (batch_size, padded_size, 1))
-    one_hot = tf.where(
-        tf.equal(emb_idcs, col_idcs), emb_weights,
-        tf.zeros((batch_size, padded_size, n_embeddings)))
-
-    return tf.tensordot(one_hot, embedding_table, 1)
diff --git a/official/r1/utils/tpu_test.py b/official/r1/utils/tpu_test.py
deleted file mode 100644
index ba5b868a61376b3ae60a3420959124e8841036de..0000000000000000000000000000000000000000
--- a/official/r1/utils/tpu_test.py
+++ /dev/null
@@ -1,108 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Test TPU optimized matmul embedding."""
-
-import numpy as np
-import tensorflow as tf
-
-from official.r1.utils import tpu as tpu_utils
-
-
-TEST_CASES = [
-    dict(embedding_dim=256, vocab_size=1000, sequence_length=64,
-         batch_size=32, seed=54131),
-    dict(embedding_dim=8, vocab_size=15, sequence_length=12,
-         batch_size=256, seed=536413),
-    dict(embedding_dim=2048, vocab_size=512, sequence_length=50,
-         batch_size=8, seed=35124)
-]
-
-
-class TPUBaseTester(tf.test.TestCase):
-  def construct_embedding_and_values(self, embedding_dim, vocab_size,
-                                     sequence_length, batch_size, seed):
-    np.random.seed(seed)
-
-    embeddings = np.random.random(size=(vocab_size, embedding_dim))
-    embedding_table = tf.convert_to_tensor(value=embeddings, dtype=tf.float32)
-
-    tokens = np.random.randint(low=1, high=vocab_size-1,
-                               size=(batch_size, sequence_length))
-    for i in range(batch_size):
-      tokens[i, np.random.randint(low=0, high=sequence_length-1):] = 0
-    values = tf.convert_to_tensor(value=tokens, dtype=tf.int32)
-    mask = tf.cast(tf.not_equal(values, 0), dtype=tf.float32)
-    return embedding_table, values, mask
-
-  def _test_embedding(self, embedding_dim, vocab_size,
-                      sequence_length, batch_size, seed):
-    """Test that matmul embedding matches embedding lookup (gather)."""
-
-    with self.test_session():
-      embedding_table, values, mask = self.construct_embedding_and_values(
-          embedding_dim=embedding_dim,
-          vocab_size=vocab_size,
-          sequence_length=sequence_length,
-          batch_size=batch_size,
-          seed=seed
-      )
-
-      embedding = (tf.nn.embedding_lookup(params=embedding_table, ids=values) *
-                   tf.expand_dims(mask, -1))
-
-      matmul_embedding = tpu_utils.embedding_matmul(
-          embedding_table=embedding_table, values=values, mask=mask)
-
-      self.assertAllClose(embedding, matmul_embedding)
-
-  def _test_masking(self, embedding_dim, vocab_size,
-                    sequence_length, batch_size, seed):
-    """Test that matmul embedding properly zeros masked positions."""
-    with self.test_session():
-      embedding_table, values, mask = self.construct_embedding_and_values(
-          embedding_dim=embedding_dim,
-          vocab_size=vocab_size,
-          sequence_length=sequence_length,
-          batch_size=batch_size,
-          seed=seed
-      )
-
-      matmul_embedding = tpu_utils.embedding_matmul(
-          embedding_table=embedding_table, values=values, mask=mask)
-
-      self.assertAllClose(matmul_embedding,
-                          matmul_embedding * tf.expand_dims(mask, -1))
-
-  def test_embedding_0(self):
-    self._test_embedding(**TEST_CASES[0])
-
-  def test_embedding_1(self):
-    self._test_embedding(**TEST_CASES[1])
-
-  def test_embedding_2(self):
-    self._test_embedding(**TEST_CASES[2])
-
-  def test_masking_0(self):
-    self._test_masking(**TEST_CASES[0])
-
-  def test_masking_1(self):
-    self._test_masking(**TEST_CASES[1])
-
-  def test_masking_2(self):
-    self._test_masking(**TEST_CASES[2])
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/official/r1/wide_deep/README.md b/official/r1/wide_deep/README.md
deleted file mode 100644
index 6598d8955f5b49649a08e3d734af49f14167b489..0000000000000000000000000000000000000000
--- a/official/r1/wide_deep/README.md
+++ /dev/null
@@ -1,102 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Predicting Income with the Census Income Dataset
-
-The implementation is based on TensorFlow 1.x.
-
-## Overview
-The [Census Income Data Set](https://archive.ics.uci.edu/ml/datasets/Census+Income) contains over 48,000 samples with attributes including age, occupation, education, and income (a binary label, either `>50K` or `<=50K`). The dataset is split into roughly 32,000 training and 16,000 testing samples.
-
-Here, we use the [wide and deep model](https://research.googleblog.com/2016/06/wide-deep-learning-better-together-with.html) to predict the income labels. The **wide model** is able to memorize interactions with data with a large number of features but not able to generalize these learned interactions on new data. The **deep model** generalizes well but is unable to learn exceptions within the data. The **wide and deep model** combines the two models and is able to generalize while learning exceptions.
-
-For the purposes of this example code, the Census Income Data Set was chosen to allow the model to train in a reasonable amount of time. You'll notice that the deep model performs almost as well as the wide and deep model on this dataset. The wide and deep model truly shines on larger data sets with high-cardinality features, where each feature has millions/billions of unique possible values (which is the specialty of the wide model).
-
-Finally, a key point. As a modeler and developer, think about how this dataset is used and the potential benefits and harm a model's predictions can cause. A model like this could reinforce societal biases and disparities. Is a feature relevant to the problem you want to solve, or will it introduce bias? For more information, read about [ML fairness](https://developers.google.com/machine-learning/fairness-overview/).
-
----
-
-The code sample in this directory uses the high level `tf.estimator.Estimator` API. This API is great for fast iteration and quickly adapting models to your own datasets without major code overhauls. It allows you to move from single-worker training to distributed training, and it makes it easy to export model binaries for prediction.
-
-The input function for the `Estimator` uses `tf.contrib.data.TextLineDataset`, which creates a `Dataset` object. The `Dataset` API makes it easy to apply transformations (map, batch, shuffle, etc.) to the data. [Read more here](https://www.tensorflow.org/guide/datasets).
-
-The `Estimator` and `Dataset` APIs are both highly encouraged for fast development and efficient training.
-
-## Running the code
-First make sure you've [added the models folder to your Python path](/official/#running-the-models); otherwise you may encounter an error like `ImportError: No module named official.wide_deep`.
-
-### Setup
-The [Census Income Data Set](https://archive.ics.uci.edu/ml/datasets/Census+Income) that this sample uses for training is hosted by the [UC Irvine Machine Learning Repository](https://archive.ics.uci.edu/ml/datasets/). We have provided a script that downloads and cleans the necessary files.
-
-```
-python census_dataset.py
-```
-
-This will download the files to `/tmp/census_data`. To change the directory, set the `--data_dir` flag.
-
-### Training
-You can run the code locally as follows:
-
-```
-python census_main.py
-```
-
-The model is saved to `/tmp/census_model` by default, which can be changed using the `--model_dir` flag.
-
-To run the *wide* or *deep*-only models, set the `--model_type` flag to `wide` or `deep`. Other flags are configurable as well; see `census_main.py` for details.
-
-The final accuracy should be over 83% with any of the three model types.
-
-You can also experiment with `-inter` and `-intra` flag to explore inter/intra op parallelism for potential better performance as follows:
-
-```
-python census_main.py --inter=<int> --intra=<int>
-```
-Please note the above optional inter/intra op does not affect model accuracy. These are TensorFlow framework configurations that only affect execution time.
-For more details regarding the above inter/intra flags, please refer to [Optimizing_for_CPU](https://www.tensorflow.org/performance/performance_guide#optimizing_for_cpu) or [TensorFlow config.proto source code](https://github.com/tensorflow/tensorflow/blob/26b4dfa65d360f2793ad75083c797d57f8661b93/tensorflow/core/protobuf/config.proto#L165).
-
-### TensorBoard
-
-Run TensorBoard to inspect the details about the graph and training progression.
-
-```
-tensorboard --logdir=/tmp/census_model
-```
-
-## Inference with SavedModel
-You can export the model into Tensorflow [SavedModel](https://www.tensorflow.org/guide/saved_model) format by using the argument `--export_dir`:
-
-```
-python census_main.py --export_dir /tmp/wide_deep_saved_model
-```
-
-After the model finishes training, use [`saved_model_cli`](https://www.tensorflow.org/guide/saved_model#cli_to_inspect_and_execute_savedmodel) to inspect and execute the SavedModel.
-
-Try the following commands to inspect the SavedModel:
-
-**Replace `${TIMESTAMP}` with the folder produced (e.g. 1524249124)**
-```
-# List possible tag_sets. Only one metagraph is saved, so there will be one option.
-saved_model_cli show --dir /tmp/wide_deep_saved_model/${TIMESTAMP}/
-
-# Show SignatureDefs for tag_set=serve. SignatureDefs define the outputs to show.
-saved_model_cli show --dir /tmp/wide_deep_saved_model/${TIMESTAMP}/ \
-    --tag_set serve --all
-```
-
-### Inference
-Let's use the model to predict the income group of two examples:
-```
-saved_model_cli run --dir /tmp/wide_deep_saved_model/${TIMESTAMP}/ \
---tag_set serve --signature_def="predict" \
---input_examples='examples=[{"age":[46.], "education_num":[10.], "capital_gain":[7688.], "capital_loss":[0.], "hours_per_week":[38.]}, {"age":[24.], "education_num":[13.], "capital_gain":[0.], "capital_loss":[0.], "hours_per_week":[50.]}]'
-```
-
-This will print out the predicted classes and class probabilities. Class 0 is the <=50k group and 1 is the >50k group.
-
-## Additional Links
-
-If you are interested in distributed training, take a look at [Distributed TensorFlow](https://www.tensorflow.org/deploy/distributed).
-
-You can also [run this model on Cloud ML Engine](https://cloud.google.com/ml-engine/docs/getting-started-training-prediction), which provides [hyperparameter tuning](https://cloud.google.com/ml-engine/docs/getting-started-training-prediction#hyperparameter_tuning) to maximize your model's results and enables [deploying your model for prediction](https://cloud.google.com/ml-engine/docs/getting-started-training-prediction#deploy_a_model_to_support_prediction).
diff --git a/official/r1/wide_deep/__init__.py b/official/r1/wide_deep/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/official/r1/wide_deep/census_dataset.py b/official/r1/wide_deep/census_dataset.py
deleted file mode 100644
index f3a07ac615d35ffe93a4161edf30b0470e601b7f..0000000000000000000000000000000000000000
--- a/official/r1/wide_deep/census_dataset.py
+++ /dev/null
@@ -1,205 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Download and clean the Census Income Dataset."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import sys
-
-# pylint: disable=wrong-import-order
-from absl import app as absl_app
-from absl import flags
-from six.moves import urllib
-from six.moves import zip
-import tensorflow.compat.v1 as tf
-# pylint: enable=wrong-import-order
-
-from official.utils.flags import core as flags_core
-
-
-DATA_URL = 'https://archive.ics.uci.edu/ml/machine-learning-databases/adult'
-TRAINING_FILE = 'adult.data'
-TRAINING_URL = '%s/%s' % (DATA_URL, TRAINING_FILE)
-EVAL_FILE = 'adult.test'
-EVAL_URL = '%s/%s' % (DATA_URL, EVAL_FILE)
-
-
-_CSV_COLUMNS = [
-    'age', 'workclass', 'fnlwgt', 'education', 'education_num',
-    'marital_status', 'occupation', 'relationship', 'race', 'gender',
-    'capital_gain', 'capital_loss', 'hours_per_week', 'native_country',
-    'income_bracket'
-]
-
-_CSV_COLUMN_DEFAULTS = [[0], [''], [0], [''], [0], [''], [''], [''], [''], [''],
-                        [0], [0], [0], [''], ['']]
-
-_HASH_BUCKET_SIZE = 1000
-
-_NUM_EXAMPLES = {
-    'train': 32561,
-    'validation': 16281,
-}
-
-
-def _download_and_clean_file(filename, url):
-  """Downloads data from url, and makes changes to match the CSV format."""
-  temp_file, _ = urllib.request.urlretrieve(url)
-  with tf.gfile.Open(temp_file, 'r') as temp_eval_file:
-    with tf.gfile.Open(filename, 'w') as eval_file:
-      for line in temp_eval_file:
-        line = line.strip()
-        line = line.replace(', ', ',')
-        if not line or ',' not in line:
-          continue
-        if line[-1] == '.':
-          line = line[:-1]
-        line += '\n'
-        eval_file.write(line)
-  tf.gfile.Remove(temp_file)
-
-
-def download(data_dir):
-  """Download census data if it is not already present."""
-  tf.gfile.MakeDirs(data_dir)
-
-  training_file_path = os.path.join(data_dir, TRAINING_FILE)
-  if not tf.gfile.Exists(training_file_path):
-    _download_and_clean_file(training_file_path, TRAINING_URL)
-
-  eval_file_path = os.path.join(data_dir, EVAL_FILE)
-  if not tf.gfile.Exists(eval_file_path):
-    _download_and_clean_file(eval_file_path, EVAL_URL)
-
-
-def build_model_columns():
-  """Builds a set of wide and deep feature columns."""
-  # Continuous variable columns
-  age = tf.feature_column.numeric_column('age')
-  education_num = tf.feature_column.numeric_column('education_num')
-  capital_gain = tf.feature_column.numeric_column('capital_gain')
-  capital_loss = tf.feature_column.numeric_column('capital_loss')
-  hours_per_week = tf.feature_column.numeric_column('hours_per_week')
-
-  education = tf.feature_column.categorical_column_with_vocabulary_list(
-      'education', [
-          'Bachelors', 'HS-grad', '11th', 'Masters', '9th', 'Some-college',
-          'Assoc-acdm', 'Assoc-voc', '7th-8th', 'Doctorate', 'Prof-school',
-          '5th-6th', '10th', '1st-4th', 'Preschool', '12th'])
-
-  marital_status = tf.feature_column.categorical_column_with_vocabulary_list(
-      'marital_status', [
-          'Married-civ-spouse', 'Divorced', 'Married-spouse-absent',
-          'Never-married', 'Separated', 'Married-AF-spouse', 'Widowed'])
-
-  relationship = tf.feature_column.categorical_column_with_vocabulary_list(
-      'relationship', [
-          'Husband', 'Not-in-family', 'Wife', 'Own-child', 'Unmarried',
-          'Other-relative'])
-
-  workclass = tf.feature_column.categorical_column_with_vocabulary_list(
-      'workclass', [
-          'Self-emp-not-inc', 'Private', 'State-gov', 'Federal-gov',
-          'Local-gov', '?', 'Self-emp-inc', 'Without-pay', 'Never-worked'])
-
-  # To show an example of hashing:
-  occupation = tf.feature_column.categorical_column_with_hash_bucket(
-      'occupation', hash_bucket_size=_HASH_BUCKET_SIZE)
-
-  # Transformations.
-  age_buckets = tf.feature_column.bucketized_column(
-      age, boundaries=[18, 25, 30, 35, 40, 45, 50, 55, 60, 65])
-
-  # Wide columns and deep columns.
-  base_columns = [
-      education, marital_status, relationship, workclass, occupation,
-      age_buckets,
-  ]
-
-  crossed_columns = [
-      tf.feature_column.crossed_column(
-          ['education', 'occupation'], hash_bucket_size=_HASH_BUCKET_SIZE),
-      tf.feature_column.crossed_column(
-          [age_buckets, 'education', 'occupation'],
-          hash_bucket_size=_HASH_BUCKET_SIZE),
-  ]
-
-  wide_columns = base_columns + crossed_columns
-
-  deep_columns = [
-      age,
-      education_num,
-      capital_gain,
-      capital_loss,
-      hours_per_week,
-      tf.feature_column.indicator_column(workclass),
-      tf.feature_column.indicator_column(education),
-      tf.feature_column.indicator_column(marital_status),
-      tf.feature_column.indicator_column(relationship),
-      # To show an example of embedding
-      tf.feature_column.embedding_column(occupation, dimension=8),
-  ]
-
-  return wide_columns, deep_columns
-
-
-def input_fn(data_file, num_epochs, shuffle, batch_size):
-  """Generate an input function for the Estimator."""
-  assert tf.gfile.Exists(data_file), (
-      '%s not found. Please make sure you have run census_dataset.py and '
-      'set the --data_dir argument to the correct path.' % data_file)
-
-  def parse_csv(value):
-    tf.logging.info('Parsing {}'.format(data_file))
-    columns = tf.decode_csv(value, record_defaults=_CSV_COLUMN_DEFAULTS)
-    features = dict(list(zip(_CSV_COLUMNS, columns)))
-    labels = features.pop('income_bracket')
-    classes = tf.equal(labels, '>50K')  # binary classification
-    return features, classes
-
-  # Extract lines from input files using the Dataset API.
-  dataset = tf.data.TextLineDataset(data_file)
-
-  if shuffle:
-    dataset = dataset.shuffle(buffer_size=_NUM_EXAMPLES['train'])
-
-  dataset = dataset.map(parse_csv, num_parallel_calls=5)
-
-  # We call repeat after shuffling, rather than before, to prevent separate
-  # epochs from blending together.
-  dataset = dataset.repeat(num_epochs)
-  dataset = dataset.batch(batch_size)
-  return dataset
-
-
-def define_data_download_flags():
-  """Add flags specifying data download arguments."""
-  flags.DEFINE_string(
-      name="data_dir", default="/tmp/census_data/",
-      help=flags_core.help_wrap(
-          "Directory to download and extract data."))
-
-
-def main(_):
-  download(flags.FLAGS.data_dir)
-
-
-if __name__ == '__main__':
-  tf.logging.set_verbosity(tf.logging.INFO)
-  define_data_download_flags()
-  absl_app.run(main)
diff --git a/official/r1/wide_deep/census_main.py b/official/r1/wide_deep/census_main.py
deleted file mode 100644
index 39a1610eeef3ba30c63873bf98d816e3dce8e2ef..0000000000000000000000000000000000000000
--- a/official/r1/wide_deep/census_main.py
+++ /dev/null
@@ -1,115 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Train DNN on census income dataset."""
-
-import os
-
-from absl import app as absl_app
-from absl import flags
-import tensorflow.compat.v1 as tf
-from official.r1.utils.logs import logger
-from official.r1.wide_deep import census_dataset
-from official.r1.wide_deep import wide_deep_run_loop
-from official.utils.flags import core as flags_core
-
-
-def define_census_flags():
-  wide_deep_run_loop.define_wide_deep_flags()
-  flags.adopt_module_key_flags(wide_deep_run_loop)
-  flags_core.set_defaults(data_dir='/tmp/census_data',
-                          model_dir='/tmp/census_model',
-                          train_epochs=40,
-                          epochs_between_evals=2,
-                          inter_op_parallelism_threads=0,
-                          intra_op_parallelism_threads=0,
-                          batch_size=40)
-
-
-def build_estimator(model_dir, model_type, model_column_fn, inter_op, intra_op):
-  """Build an estimator appropriate for the given model type."""
-  wide_columns, deep_columns = model_column_fn()
-  hidden_units = [100, 75, 50, 25]
-
-  # Create a tf.estimator.RunConfig to ensure the model is run on CPU, which
-  # trains faster than GPU for this model.
-  run_config = tf.estimator.RunConfig().replace(
-      session_config=tf.ConfigProto(device_count={'GPU': 0},
-                                    inter_op_parallelism_threads=inter_op,
-                                    intra_op_parallelism_threads=intra_op))
-
-  if model_type == 'wide':
-    return tf.estimator.LinearClassifier(
-        model_dir=model_dir,
-        feature_columns=wide_columns,
-        config=run_config)
-  elif model_type == 'deep':
-    return tf.estimator.DNNClassifier(
-        model_dir=model_dir,
-        feature_columns=deep_columns,
-        hidden_units=hidden_units,
-        config=run_config)
-  else:
-    return tf.estimator.DNNLinearCombinedClassifier(
-        model_dir=model_dir,
-        linear_feature_columns=wide_columns,
-        dnn_feature_columns=deep_columns,
-        dnn_hidden_units=hidden_units,
-        config=run_config)
-
-
-def run_census(flags_obj):
-  """Construct all necessary functions and call run_loop.
-
-  Args:
-    flags_obj: Object containing user specified flags.
-  """
-  if flags_obj.download_if_missing:
-    census_dataset.download(flags_obj.data_dir)
-
-  train_file = os.path.join(flags_obj.data_dir, census_dataset.TRAINING_FILE)
-  test_file = os.path.join(flags_obj.data_dir, census_dataset.EVAL_FILE)
-
-  # Train and evaluate the model every `flags.epochs_between_evals` epochs.
-  def train_input_fn():
-    return census_dataset.input_fn(
-        train_file, flags_obj.epochs_between_evals, True, flags_obj.batch_size)
-
-  def eval_input_fn():
-    return census_dataset.input_fn(test_file, 1, False, flags_obj.batch_size)
-
-  tensors_to_log = {
-      'average_loss': '{loss_prefix}head/truediv',
-      'loss': '{loss_prefix}head/weighted_loss/Sum'
-  }
-
-  wide_deep_run_loop.run_loop(
-      name="Census Income", train_input_fn=train_input_fn,
-      eval_input_fn=eval_input_fn,
-      model_column_fn=census_dataset.build_model_columns,
-      build_estimator_fn=build_estimator,
-      flags_obj=flags_obj,
-      tensors_to_log=tensors_to_log,
-      early_stop=True)
-
-
-def main(_):
-  with logger.benchmark_context(flags.FLAGS):
-    run_census(flags.FLAGS)
-
-
-if __name__ == '__main__':
-  tf.logging.set_verbosity(tf.logging.INFO)
-  define_census_flags()
-  absl_app.run(main)
diff --git a/official/r1/wide_deep/census_test.csv b/official/r1/wide_deep/census_test.csv
deleted file mode 100644
index 374397dbd1f979ba9320840732620d76b9725c91..0000000000000000000000000000000000000000
--- a/official/r1/wide_deep/census_test.csv
+++ /dev/null
@@ -1,30 +0,0 @@
-39,State-gov,77516,Bachelors,13,Never-married,Adm-clerical,Not-in-family,,,2174,0,40,,<=50K
-50,Self-emp-not-inc,83311,Bachelors,13,Married-civ-spouse,Exec-managerial,Husband,,,0,0,13,,<=50K
-38,Private,215646,HS-grad,9,Divorced,Handlers-cleaners,Not-in-family,,,0,0,40,,<=50K
-53,Private,234721,11th,7,Married-civ-spouse,Handlers-cleaners,Husband,,,0,0,40,,<=50K
-28,Private,338409,Bachelors,13,Married-civ-spouse,Prof-specialty,Wife,,,0,0,40,,<=50K
-37,Private,284582,Masters,14,Married-civ-spouse,Exec-managerial,Wife,,,0,0,40,,<=50K
-49,Private,160187,9th,5,Married-spouse-absent,Other-service,Not-in-family,,,0,0,16,,<=50K
-52,Self-emp-not-inc,209642,HS-grad,9,Married-civ-spouse,Exec-managerial,Husband,,,0,0,45,,>50K
-31,Private,45781,Masters,14,Never-married,Prof-specialty,Not-in-family,,,14084,0,50,,>50K
-42,Private,159449,Bachelors,13,Married-civ-spouse,Exec-managerial,Husband,,,5178,0,40,,>50K
-37,Private,280464,Some-college,10,Married-civ-spouse,Exec-managerial,Husband,,,0,0,80,,>50K
-30,State-gov,141297,Bachelors,13,Married-civ-spouse,Prof-specialty,Husband,,,0,0,40,,>50K
-23,Private,122272,Bachelors,13,Never-married,Adm-clerical,Own-child,,,0,0,30,,<=50K
-32,Private,205019,Assoc-acdm,12,Never-married,Sales,Not-in-family,,,0,0,50,,<=50K
-40,Private,121772,Assoc-voc,11,Married-civ-spouse,Craft-repair,Husband,,,0,0,40,,>50K
-34,Private,245487,7th-8th,4,Married-civ-spouse,Transport-moving,Husband,,,0,0,45,,<=50K
-25,Self-emp-not-inc,176756,HS-grad,9,Never-married,Farming-fishing,Own-child,,,0,0,35,,<=50K
-32,Private,186824,HS-grad,9,Never-married,Machine-op-inspct,Unmarried,,,0,0,40,,<=50K
-38,Private,28887,11th,7,Married-civ-spouse,Sales,Husband,,,0,0,50,,<=50K
-43,Self-emp-not-inc,292175,Masters,14,Divorced,Exec-managerial,Unmarried,,,0,0,45,,>50K
-40,Private,193524,Doctorate,16,Married-civ-spouse,Prof-specialty,Husband,,,0,0,60,,>50K
-56,Local-gov,216851,Bachelors,13,Married-civ-spouse,Tech-support,Husband,,,0,0,40,,>50K
-54,?,180211,Some-college,10,Married-civ-spouse,?,Husband,,,0,0,60,,>50K
-22,State-gov,311512,Some-college,10,Married-civ-spouse,Other-service,Husband,,,0,0,15,,<=50K
-31,Private,84154,Some-college,10,Married-civ-spouse,Sales,Husband,,,0,0,38,,>50K
-57,Federal-gov,337895,Bachelors,13,Married-civ-spouse,Prof-specialty,Husband,,,0,0,40,,>50K
-47,Private,51835,Prof-school,15,Married-civ-spouse,Prof-specialty,Wife,,,0,1902,60,,>50K
-50,Federal-gov,251585,Bachelors,13,Divorced,Exec-managerial,Not-in-family,,,0,0,55,,>50K
-25,Private,289980,HS-grad,9,Never-married,Handlers-cleaners,Not-in-family,,,0,0,35,,<=50K
-42,Private,116632,Doctorate,16,Married-civ-spouse,Prof-specialty,Husband,,,0,0,45,,>50K
diff --git a/official/r1/wide_deep/census_test.py b/official/r1/wide_deep/census_test.py
deleted file mode 100644
index 811651560d0a3e048009ab79f6f36440e83978a6..0000000000000000000000000000000000000000
--- a/official/r1/wide_deep/census_test.py
+++ /dev/null
@@ -1,163 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-from absl import logging
-import tensorflow.compat.v1 as tf
-
-from official.r1.wide_deep import census_dataset
-from official.r1.wide_deep import census_main
-from official.utils.testing import integration
-
-logging.set_verbosity(logging.ERROR)
-
-TEST_INPUT = ('18,Self-emp-not-inc,987,Bachelors,12,Married-civ-spouse,abc,'
-              'Husband,zyx,wvu,34,56,78,tsr,<=50K')
-
-TEST_INPUT_VALUES = {
-    'age': 18,
-    'education_num': 12,
-    'capital_gain': 34,
-    'capital_loss': 56,
-    'hours_per_week': 78,
-    'education': 'Bachelors',
-    'marital_status': 'Married-civ-spouse',
-    'relationship': 'Husband',
-    'workclass': 'Self-emp-not-inc',
-    'occupation': 'abc',
-}
-
-TEST_CSV = os.path.join(os.path.dirname(__file__), 'census_test.csv')
-
-
-class BaseTest(tf.test.TestCase):
-  """Tests for Wide Deep model."""
-
-  @classmethod
-  def setUpClass(cls):  # pylint: disable=invalid-name
-    super(BaseTest, cls).setUpClass()
-    census_main.define_census_flags()
-
-  def setUp(self):
-    # Create temporary CSV file
-    self.temp_dir = self.get_temp_dir()
-    self.input_csv = os.path.join(self.temp_dir, 'test.csv')
-    with tf.io.gfile.GFile(self.input_csv, 'w') as temp_csv:
-      temp_csv.write(TEST_INPUT)
-
-    with tf.io.gfile.GFile(TEST_CSV, 'r') as temp_csv:
-      test_csv_contents = temp_csv.read()
-
-    # Used for end-to-end tests.
-    for fname in [census_dataset.TRAINING_FILE, census_dataset.EVAL_FILE]:
-      with tf.io.gfile.GFile(
-          os.path.join(self.temp_dir, fname), 'w') as test_csv:
-        test_csv.write(test_csv_contents)
-
-  def test_input_fn(self):
-    dataset = census_dataset.input_fn(self.input_csv, 1, False, 1)
-    features, labels = dataset.make_one_shot_iterator().get_next()
-
-    with self.test_session() as sess:
-      features, labels = sess.run((features, labels))
-
-      # Compare the two features dictionaries.
-      for key in TEST_INPUT_VALUES:
-        self.assertTrue(key in features)
-        self.assertEqual(len(features[key]), 1)
-        feature_value = features[key][0]
-
-        # Convert from bytes to string for Python 3.
-        if isinstance(feature_value, bytes):
-          feature_value = feature_value.decode()
-
-        self.assertEqual(TEST_INPUT_VALUES[key], feature_value)
-
-      self.assertFalse(labels)
-
-  def build_and_test_estimator(self, model_type):
-    """Ensure that model trains and minimizes loss."""
-    model = census_main.build_estimator(
-        self.temp_dir, model_type,
-        model_column_fn=census_dataset.build_model_columns,
-        inter_op=0, intra_op=0)
-
-    # Train for 1 step to initialize model and evaluate initial loss
-    def get_input_fn(num_epochs, shuffle, batch_size):
-      def input_fn():
-        return census_dataset.input_fn(
-            TEST_CSV, num_epochs=num_epochs, shuffle=shuffle,
-            batch_size=batch_size)
-      return input_fn
-
-    model.train(input_fn=get_input_fn(1, True, 1), steps=1)
-    initial_results = model.evaluate(input_fn=get_input_fn(1, False, 1))
-
-    # Train for 100 epochs at batch size 3 and evaluate final loss
-    model.train(input_fn=get_input_fn(100, True, 3))
-    final_results = model.evaluate(input_fn=get_input_fn(1, False, 1))
-
-    print('%s initial results:' % model_type, initial_results)
-    print('%s final results:' % model_type, final_results)
-
-    # Ensure loss has decreased, while accuracy and both AUCs have increased.
-    self.assertLess(final_results['loss'], initial_results['loss'])
-    self.assertGreater(final_results['auc'], initial_results['auc'])
-    self.assertGreater(final_results['auc_precision_recall'],
-                       initial_results['auc_precision_recall'])
-    self.assertGreater(final_results['accuracy'], initial_results['accuracy'])
-
-  def test_wide_deep_estimator_training(self):
-    self.build_and_test_estimator('wide_deep')
-
-  def test_end_to_end_wide(self):
-    integration.run_synthetic(
-        main=census_main.main, tmp_root=self.get_temp_dir(),
-        extra_flags=[
-            '--data_dir', self.get_temp_dir(),
-            '--model_type', 'wide',
-            '--download_if_missing=false'
-        ],
-        synth=False)
-
-  def test_end_to_end_deep(self):
-    integration.run_synthetic(
-        main=census_main.main, tmp_root=self.get_temp_dir(),
-        extra_flags=[
-            '--data_dir', self.get_temp_dir(),
-            '--model_type', 'deep',
-            '--download_if_missing=false'
-        ],
-        synth=False)
-
-  def test_end_to_end_wide_deep(self):
-    integration.run_synthetic(
-        main=census_main.main, tmp_root=self.get_temp_dir(),
-        extra_flags=[
-            '--data_dir', self.get_temp_dir(),
-            '--model_type', 'wide_deep',
-            '--download_if_missing=false'
-        ],
-        synth=False)
-
-
-if __name__ == '__main__':
-  tf.disable_eager_execution()
-  tf.test.main()
diff --git a/official/r1/wide_deep/movielens_dataset.py b/official/r1/wide_deep/movielens_dataset.py
deleted file mode 100644
index 676062cbb1e5f08006ca374081185067aaba6b9d..0000000000000000000000000000000000000000
--- a/official/r1/wide_deep/movielens_dataset.py
+++ /dev/null
@@ -1,165 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Prepare MovieLens dataset for wide-deep."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import functools
-import os
-
-# pylint: disable=wrong-import-order
-from absl import app as absl_app
-from absl import flags
-import numpy as np
-import tensorflow.compat.v1 as tf
-# pylint: enable=wrong-import-order
-
-from official.recommendation import movielens
-from official.r1.utils.data import file_io
-from official.utils.flags import core as flags_core
-
-
-_BUFFER_SUBDIR = "wide_deep_buffer"
-_FEATURE_MAP = {
-    movielens.USER_COLUMN: tf.compat.v1.FixedLenFeature([1], dtype=tf.int64),
-    movielens.ITEM_COLUMN: tf.compat.v1.FixedLenFeature([1], dtype=tf.int64),
-    movielens.TIMESTAMP_COLUMN: tf.compat.v1.FixedLenFeature([1],
-                                                             dtype=tf.int64),
-    movielens.GENRE_COLUMN: tf.compat.v1.FixedLenFeature(
-        [movielens.N_GENRE], dtype=tf.int64),
-    movielens.RATING_COLUMN: tf.compat.v1.FixedLenFeature([1],
-                                                          dtype=tf.float32),
-}
-
-_BUFFER_SIZE = {
-    movielens.ML_1M: {"train": 107978119, "eval": 26994538},
-    movielens.ML_20M: {"train": 2175203810, "eval": 543802008}
-}
-
-_USER_EMBEDDING_DIM = 16
-_ITEM_EMBEDDING_DIM = 64
-
-def build_model_columns(dataset):
-  """Builds a set of wide and deep feature columns."""
-  user_id = tf.feature_column.categorical_column_with_vocabulary_list(
-      movielens.USER_COLUMN, range(1, movielens.NUM_USER_IDS[dataset]))
-  user_embedding = tf.feature_column.embedding_column(
-      user_id, _USER_EMBEDDING_DIM, max_norm=np.sqrt(_USER_EMBEDDING_DIM))
-
-  item_id = tf.feature_column.categorical_column_with_vocabulary_list(
-      movielens.ITEM_COLUMN, range(1, movielens.NUM_ITEM_IDS))
-  item_embedding = tf.feature_column.embedding_column(
-      item_id, _ITEM_EMBEDDING_DIM, max_norm=np.sqrt(_ITEM_EMBEDDING_DIM))
-
-  time = tf.feature_column.numeric_column(movielens.TIMESTAMP_COLUMN)
-  genres = tf.feature_column.numeric_column(
-      movielens.GENRE_COLUMN, shape=(movielens.N_GENRE,), dtype=tf.uint8)
-
-  deep_columns = [user_embedding, item_embedding, time, genres]
-  wide_columns = []
-
-  return wide_columns, deep_columns
-
-
-def _deserialize(examples_serialized):
-  features = tf.parse_example(examples_serialized, _FEATURE_MAP)
-  return features, features[movielens.RATING_COLUMN] / movielens.MAX_RATING
-
-
-def _buffer_path(data_dir, dataset, name):
-  return os.path.join(data_dir, _BUFFER_SUBDIR,
-                      "{}_{}_buffer".format(dataset, name))
-
-
-def _df_to_input_fn(df, name, dataset, data_dir, batch_size, repeat, shuffle):
-  """Serialize a dataframe and write it to a buffer file."""
-  buffer_path = _buffer_path(data_dir, dataset, name)
-  expected_size = _BUFFER_SIZE[dataset].get(name)
-
-  file_io.write_to_buffer(
-      dataframe=df, buffer_path=buffer_path,
-      columns=list(_FEATURE_MAP.keys()), expected_size=expected_size)
-
-  def input_fn():
-    dataset = tf.data.TFRecordDataset(buffer_path)
-    # batch comes before map because map can deserialize multiple examples.
-    dataset = dataset.batch(batch_size)
-    dataset = dataset.map(_deserialize, num_parallel_calls=16)
-    if shuffle:
-      dataset = dataset.shuffle(shuffle)
-
-    dataset = dataset.repeat(repeat)
-    return dataset.prefetch(1)
-
-  return input_fn
-
-
-def _check_buffers(data_dir, dataset):
-  train_path = os.path.join(data_dir, _BUFFER_SUBDIR,
-                            "{}_{}_buffer".format(dataset, "train"))
-  eval_path = os.path.join(data_dir, _BUFFER_SUBDIR,
-                           "{}_{}_buffer".format(dataset, "eval"))
-
-  if not tf.gfile.Exists(train_path) or not tf.gfile.Exists(eval_path):
-    return False
-
-  return all([
-      tf.gfile.Stat(_buffer_path(data_dir, dataset, "train")).length ==
-      _BUFFER_SIZE[dataset]["train"],
-      tf.gfile.Stat(_buffer_path(data_dir, dataset, "eval")).length ==
-      _BUFFER_SIZE[dataset]["eval"],
-  ])
-
-
-def construct_input_fns(dataset, data_dir, batch_size=16, repeat=1):
-  """Construct train and test input functions, as well as the column fn."""
-  if _check_buffers(data_dir, dataset):
-    train_df, eval_df = None, None
-  else:
-    df = movielens.csv_to_joint_dataframe(dataset=dataset, data_dir=data_dir)
-    df = movielens.integerize_genres(dataframe=df)
-    df = df.drop(columns=[movielens.TITLE_COLUMN])
-
-    train_df = df.sample(frac=0.8, random_state=0)
-    eval_df = df.drop(train_df.index)
-
-    train_df = train_df.reset_index(drop=True)
-    eval_df = eval_df.reset_index(drop=True)
-
-  train_input_fn = _df_to_input_fn(
-      df=train_df, name="train", dataset=dataset, data_dir=data_dir,
-      batch_size=batch_size, repeat=repeat,
-      shuffle=movielens.NUM_RATINGS[dataset])
-  eval_input_fn = _df_to_input_fn(
-      df=eval_df, name="eval", dataset=dataset, data_dir=data_dir,
-      batch_size=batch_size, repeat=repeat, shuffle=None)
-  model_column_fn = functools.partial(build_model_columns, dataset=dataset)
-
-  train_input_fn()
-  return train_input_fn, eval_input_fn, model_column_fn
-
-
-def main(_):
-  movielens.download(dataset=flags.FLAGS.dataset, data_dir=flags.FLAGS.data_dir)
-  construct_input_fns(flags.FLAGS.dataset, flags.FLAGS.data_dir)
-
-if __name__ == "__main__":
-  tf.logging.set_verbosity(tf.logging.INFO)
-  movielens.define_data_download_flags()
-  flags.adopt_module_key_flags(movielens)
-  flags_core.set_defaults(dataset="ml-1m")
-  absl_app.run(main)
diff --git a/official/r1/wide_deep/movielens_main.py b/official/r1/wide_deep/movielens_main.py
deleted file mode 100644
index 45f7453c7ed8ecbd5f877db443136612f353b3ba..0000000000000000000000000000000000000000
--- a/official/r1/wide_deep/movielens_main.py
+++ /dev/null
@@ -1,114 +0,0 @@
-# Copyright 2018 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Train DNN on Kaggle movie dataset."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-from absl import app as absl_app
-from absl import flags
-import tensorflow.compat.v1 as tf
-from official.r1.utils.logs import logger
-from official.r1.wide_deep import movielens_dataset
-from official.r1.wide_deep import wide_deep_run_loop
-from official.recommendation import movielens
-from official.utils.flags import core as flags_core
-
-
-def define_movie_flags():
-  """Define flags for movie dataset training."""
-  wide_deep_run_loop.define_wide_deep_flags()
-  flags.DEFINE_enum(
-      name="dataset", default=movielens.ML_1M,
-      enum_values=movielens.DATASETS, case_sensitive=False,
-      help=flags_core.help_wrap("Dataset to be trained and evaluated."))
-  flags.adopt_module_key_flags(wide_deep_run_loop)
-  flags_core.set_defaults(data_dir="/tmp/movielens-data/",
-                          model_dir='/tmp/movie_model',
-                          model_type="deep",
-                          train_epochs=50,
-                          epochs_between_evals=5,
-                          inter_op_parallelism_threads=0,
-                          intra_op_parallelism_threads=0,
-                          batch_size=256)
-
-  @flags.validator("stop_threshold",
-                   message="stop_threshold not supported for movielens model")
-  def _no_stop(stop_threshold):
-    return stop_threshold is None
-
-
-def build_estimator(model_dir, model_type, model_column_fn, inter_op, intra_op):
-  """Build an estimator appropriate for the given model type."""
-  if model_type != "deep":
-    raise NotImplementedError("movie dataset only supports `deep` model_type")
-  _, deep_columns = model_column_fn()
-  hidden_units = [256, 256, 256, 128]
-
-  run_config = tf.estimator.RunConfig().replace(
-      session_config=tf.ConfigProto(device_count={'GPU': 0},
-                                    inter_op_parallelism_threads=inter_op,
-                                    intra_op_parallelism_threads=intra_op))
-  return tf.estimator.DNNRegressor(
-      model_dir=model_dir,
-      feature_columns=deep_columns,
-      hidden_units=hidden_units,
-      optimizer=tf.compat.v1.train.AdamOptimizer(),
-      activation_fn=tf.nn.sigmoid,
-      dropout=0.3,
-      loss_reduction=tf.losses.Reduction.MEAN)
-
-
-def run_movie(flags_obj):
-  """Construct all necessary functions and call run_loop.
-
-  Args:
-    flags_obj: Object containing user specified flags.
-  """
-
-  if flags_obj.download_if_missing:
-    movielens.download(dataset=flags_obj.dataset, data_dir=flags_obj.data_dir)
-
-  train_input_fn, eval_input_fn, model_column_fn = \
-    movielens_dataset.construct_input_fns(
-        dataset=flags_obj.dataset, data_dir=flags_obj.data_dir,
-        batch_size=flags_obj.batch_size, repeat=flags_obj.epochs_between_evals)
-
-  tensors_to_log = {
-      'loss': '{loss_prefix}head/weighted_loss/value'
-  }
-
-  wide_deep_run_loop.run_loop(
-      name="MovieLens", train_input_fn=train_input_fn,
-      eval_input_fn=eval_input_fn,
-      model_column_fn=model_column_fn,
-      build_estimator_fn=build_estimator,
-      flags_obj=flags_obj,
-      tensors_to_log=tensors_to_log,
-      early_stop=False)
-
-
-def main(_):
-  with logger.benchmark_context(flags.FLAGS):
-    run_movie(flags.FLAGS)
-
-
-if __name__ == '__main__':
-  tf.logging.set_verbosity(tf.logging.INFO)
-  define_movie_flags()
-  absl_app.run(main)
diff --git a/official/r1/wide_deep/wide_deep_run_loop.py b/official/r1/wide_deep/wide_deep_run_loop.py
deleted file mode 100644
index d81bfc85364327afa932ed9394eb471ed1e9034b..0000000000000000000000000000000000000000
--- a/official/r1/wide_deep/wide_deep_run_loop.py
+++ /dev/null
@@ -1,133 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Core run logic for TensorFlow Wide & Deep Tutorial using tf.estimator API."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import shutil
-
-from absl import app as absl_app
-from absl import flags
-import tensorflow.compat.v1 as tf
-
-from official.r1.utils.logs import hooks_helper
-from official.r1.utils.logs import logger
-from official.utils.flags import core as flags_core
-from official.utils.misc import model_helpers
-
-
-LOSS_PREFIX = {'wide': 'linear/', 'deep': 'dnn/'}
-
-
-def define_wide_deep_flags():
-  """Add supervised learning flags, as well as wide-deep model type."""
-  flags_core.define_base(clean=True, train_epochs=True,
-                         epochs_between_evals=True, stop_threshold=True,
-                         hooks=True, export_dir=True)
-  flags_core.define_benchmark()
-  flags_core.define_performance(
-      num_parallel_calls=False, inter_op=True, intra_op=True,
-      synthetic_data=False, max_train_steps=False, dtype=False,
-      all_reduce_alg=False)
-
-  flags.adopt_module_key_flags(flags_core)
-
-  flags.DEFINE_enum(
-      name="model_type", short_name="mt", default="wide_deep",
-      enum_values=['wide', 'deep', 'wide_deep'],
-      help="Select model topology.")
-  flags.DEFINE_boolean(
-      name="download_if_missing", default=True, help=flags_core.help_wrap(
-          "Download data to data_dir if it is not already present."))
-
-
-def export_model(model, model_type, export_dir, model_column_fn):
-  """Export to SavedModel format.
-
-  Args:
-    model: Estimator object
-    model_type: string indicating model type. "wide", "deep" or "wide_deep"
-    export_dir: directory to export the model.
-    model_column_fn: Function to generate model feature columns.
-  """
-  wide_columns, deep_columns = model_column_fn()
-  if model_type == 'wide':
-    columns = wide_columns
-  elif model_type == 'deep':
-    columns = deep_columns
-  else:
-    columns = wide_columns + deep_columns
-  feature_spec = tf.feature_column.make_parse_example_spec(columns)
-  example_input_fn = (
-      tf.estimator.export.build_parsing_serving_input_receiver_fn(feature_spec))
-  model.export_savedmodel(export_dir, example_input_fn,
-                          strip_default_attrs=True)
-
-
-def run_loop(name, train_input_fn, eval_input_fn, model_column_fn,
-             build_estimator_fn, flags_obj, tensors_to_log, early_stop=False):
-  """Define training loop."""
-  model_helpers.apply_clean(flags.FLAGS)
-  model = build_estimator_fn(
-      model_dir=flags_obj.model_dir, model_type=flags_obj.model_type,
-      model_column_fn=model_column_fn,
-      inter_op=flags_obj.inter_op_parallelism_threads,
-      intra_op=flags_obj.intra_op_parallelism_threads)
-
-  run_params = {
-      'batch_size': flags_obj.batch_size,
-      'train_epochs': flags_obj.train_epochs,
-      'model_type': flags_obj.model_type,
-  }
-
-  benchmark_logger = logger.get_benchmark_logger()
-  benchmark_logger.log_run_info('wide_deep', name, run_params,
-                                test_id=flags_obj.benchmark_test_id)
-
-  loss_prefix = LOSS_PREFIX.get(flags_obj.model_type, '')
-  tensors_to_log = {k: v.format(loss_prefix=loss_prefix)
-                    for k, v in tensors_to_log.items()}
-  train_hooks = hooks_helper.get_train_hooks(
-      flags_obj.hooks, model_dir=flags_obj.model_dir,
-      batch_size=flags_obj.batch_size, tensors_to_log=tensors_to_log)
-
-  # Train and evaluate the model every `flags.epochs_between_evals` epochs.
-  for n in range(flags_obj.train_epochs // flags_obj.epochs_between_evals):
-    model.train(input_fn=train_input_fn, hooks=train_hooks)
-
-    results = model.evaluate(input_fn=eval_input_fn)
-
-    # Display evaluation metrics
-    tf.logging.info('Results at epoch %d / %d',
-                    (n + 1) * flags_obj.epochs_between_evals,
-                    flags_obj.train_epochs)
-    tf.logging.info('-' * 60)
-
-    for key in sorted(results):
-      tf.logging.info('%s: %s' % (key, results[key]))
-
-    benchmark_logger.log_evaluation_result(results)
-
-    if early_stop and model_helpers.past_stop_threshold(
-        flags_obj.stop_threshold, results['accuracy']):
-      break
-
-  # Export the model
-  if flags_obj.export_dir is not None:
-    export_model(model, flags_obj.model_type, flags_obj.export_dir,
-                 model_column_fn)
diff --git a/official/recommendation/ncf_common.py b/official/recommendation/ncf_common.py
index 8abc927bfa29c52d6c151023d281d7e4f6f52100..5323c97f989a323f7b57893d9f841b4a50bb1026 100644
--- a/official/recommendation/ncf_common.py
+++ b/official/recommendation/ncf_common.py
@@ -94,7 +94,7 @@ def parse_flags(flags_obj):
       "beta2": flags_obj.beta2,
       "epsilon": flags_obj.epsilon,
       "match_mlperf": flags_obj.ml_perf,
-      "epochs_between_evals": FLAGS.epochs_between_evals,
+      "epochs_between_evals": flags_obj.epochs_between_evals,
       "keras_use_ctl": flags_obj.keras_use_ctl,
       "hr_threshold": flags_obj.hr_threshold,
       "stream_files": flags_obj.tpu is not None,
diff --git a/official/recommendation/ncf_input_pipeline.py b/official/recommendation/ncf_input_pipeline.py
index f6bc0158effe4d5ca3faf7afbef1548d42c35183..0a5b1c71031df369e507ecfd8eaf5c844b893709 100644
--- a/official/recommendation/ncf_input_pipeline.py
+++ b/official/recommendation/ncf_input_pipeline.py
@@ -25,43 +25,39 @@ import tensorflow.compat.v2 as tf
 # pylint: enable=g-bad-import-order
 
 from official.recommendation import constants as rconst
-from official.recommendation import movielens
 from official.recommendation import data_pipeline
-
-NUM_SHARDS = 16
+from official.recommendation import movielens
 
 
 def create_dataset_from_tf_record_files(input_file_pattern,
                                         pre_batch_size,
                                         batch_size,
-                                        is_training=True):
+                                        is_training=True,
+                                        rebatch=False):
   """Creates dataset from (tf)records files for training/evaluation."""
+  if pre_batch_size != batch_size:
+    raise ValueError("Pre-batch ({}) size is not equal to batch "
+                     "size ({})".format(pre_batch_size, batch_size))
 
   files = tf.data.Dataset.list_files(input_file_pattern, shuffle=is_training)
 
-  def make_dataset(files_dataset, shard_index):
-    """Returns dataset for sharded tf record files."""
-    if pre_batch_size != batch_size:
-      raise ValueError("Pre-batch ({}) size is not equal to batch "
-                       "size ({})".format(pre_batch_size, batch_size))
-    files_dataset = files_dataset.shard(NUM_SHARDS, shard_index)
-    dataset = files_dataset.interleave(
-        tf.data.TFRecordDataset,
-        num_parallel_calls=tf.data.experimental.AUTOTUNE)
-    decode_fn = functools.partial(
-        data_pipeline.DatasetManager.deserialize,
-        batch_size=pre_batch_size,
-        is_training=is_training)
-    dataset = dataset.map(
-        decode_fn, num_parallel_calls=tf.data.experimental.AUTOTUNE)
-    return dataset
-
-  dataset = tf.data.Dataset.range(NUM_SHARDS)
-  map_fn = functools.partial(make_dataset, files)
-  dataset = dataset.interleave(
-      map_fn,
-      cycle_length=NUM_SHARDS,
+  dataset = files.interleave(
+      tf.data.TFRecordDataset,
+      cycle_length=16,
       num_parallel_calls=tf.data.experimental.AUTOTUNE)
+  decode_fn = functools.partial(
+      data_pipeline.DatasetManager.deserialize,
+      batch_size=pre_batch_size,
+      is_training=is_training)
+  dataset = dataset.map(
+      decode_fn, num_parallel_calls=tf.data.experimental.AUTOTUNE)
+
+  if rebatch:
+    # A workaround for TPU Pod evaluation dataset.
+    # TODO (b/162341937) remove once it's fixed.
+    dataset = dataset.unbatch()
+    dataset = dataset.batch(pre_batch_size)
+
   dataset = dataset.prefetch(tf.data.experimental.AUTOTUNE)
   return dataset
 
@@ -162,12 +158,18 @@ def create_ncf_input_data(params,
         params["train_dataset_path"],
         input_meta_data["train_prebatch_size"],
         params["batch_size"],
-        is_training=True)
+        is_training=True,
+        rebatch=False)
+
+    # Re-batch evaluation dataset for TPU Pods.
+    # TODO (b/162341937) remove once it's fixed.
+    eval_rebatch = (params["use_tpu"] and strategy.num_replicas_in_sync > 8)
     eval_dataset = create_dataset_from_tf_record_files(
         params["eval_dataset_path"],
         input_meta_data["eval_prebatch_size"],
         params["eval_batch_size"],
-        is_training=False)
+        is_training=False,
+        rebatch=eval_rebatch)
 
     num_train_steps = int(input_meta_data["num_train_steps"])
     num_eval_steps = int(input_meta_data["num_eval_steps"])
diff --git a/official/recommendation/ncf_keras_main.py b/official/recommendation/ncf_keras_main.py
index a4f2376b3ccbe462c2d514f64305fa5a17714a2f..c850539d4bf24e159cbf04a2c029c1e2bf4d5c26 100644
--- a/official/recommendation/ncf_keras_main.py
+++ b/official/recommendation/ncf_keras_main.py
@@ -235,6 +235,7 @@ def run_ncf(_):
 
   params = ncf_common.parse_flags(FLAGS)
   params["distribute_strategy"] = strategy
+  params["use_tpu"] = (FLAGS.distribution_strategy == "tpu")
 
   if params["use_tpu"] and not params["keras_use_ctl"]:
     logging.error("Custom training loop must be used when using TPUStrategy.")
@@ -488,19 +489,20 @@ def run_ncf_custom_training(params,
         c.on_batch_end(current_step)
 
     train_loss /= num_train_steps
-    logging.info("Done training epoch %s, epoch loss=%s.", epoch + 1,
+    logging.info("Done training epoch %s, epoch loss=%.3f", epoch + 1,
                  train_loss)
 
     eval_input_iterator = iter(
         strategy.experimental_distribute_dataset(eval_input_dataset))
-    hr_sum = 0
-    hr_count = 0
+
+    hr_sum = 0.0
+    hr_count = 0.0
     for _ in range(num_eval_steps):
       step_hr_sum, step_hr_count = eval_step(eval_input_iterator)
       hr_sum += step_hr_sum
       hr_count += step_hr_count
 
-    logging.info("Done eval epoch %s, hit_rate=%s.", epoch + 1,
+    logging.info("Done eval epoch %s, hit_rate=%.3f", epoch + 1,
                  hr_sum / hr_count)
     if eval_summary_writer:
       with eval_summary_writer.as_default():
diff --git a/official/requirements.txt b/official/requirements.txt
index 6df2c2b8e3359e6a65c5f59150668528eed486ae..4559e176eedaaa369fd614f6e8de2f6a16516560 100644
--- a/official/requirements.txt
+++ b/official/requirements.txt
@@ -3,7 +3,7 @@ google-api-python-client>=1.6.7
 google-cloud-bigquery>=0.31.0
 kaggle>=1.3.9
 numpy>=1.15.4
-oauth2client>=4.1.2
+oauth2client
 pandas>=0.22.0
 psutil>=5.4.3
 py-cpuinfo>=3.3.0
@@ -15,11 +15,13 @@ tensorflow-addons
 dataclasses
 gin-config
 tf_slim>=1.1.0
-typing
-sentencepiece
 Cython
 matplotlib
-opencv-python-headless
 pyyaml
+# CV related dependencies
+opencv-python-headless
 Pillow
--e git+https://github.com/cocodataset/cocoapi#egg=pycocotools&subdirectory=PythonAPI
+pycocotools
+# NLP related dependencies
+seqeval
+sentencepiece
diff --git a/official/staging/training/controller.py b/official/staging/training/controller.py
deleted file mode 100644
index a07be66329ad49ba07dff300d66f153552e1c78f..0000000000000000000000000000000000000000
--- a/official/staging/training/controller.py
+++ /dev/null
@@ -1,337 +0,0 @@
-# Copyright 2019 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""A light weight utilities to train TF2 models."""
-
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
-
-import time
-
-from absl import logging
-
-import tensorflow.compat.v2 as tf
-from typing import Callable, Dict, Optional, Text
-
-from official.staging.training import utils
-
-
-class Controller(object):
-  """Class that facilitates training and evaluation of models."""
-
-  def __init__(
-      self,
-      strategy: Optional[tf.distribute.Strategy] = None,
-      train_fn: Optional[Callable[[tf.Tensor],
-                                  Optional[Dict[Text, tf.Tensor]]]] = None,
-      eval_fn: Optional[Callable[[tf.Tensor],
-                                 Optional[Dict[Text, tf.Tensor]]]] = None,
-      global_step: Optional[tf.Variable] = None,
-      # Train related
-      train_steps: Optional[int] = None,
-      steps_per_loop: Optional[int] = None,
-      summary_dir: Optional[Text] = None,
-      checkpoint_manager: Optional[tf.train.CheckpointManager] = None,
-      # summary related
-      summary_interval: Optional[int] = None,
-      # Evaluation related
-      eval_summary_dir: Optional[Text] = None,
-      eval_steps: Optional[int] = None,
-      eval_interval: Optional[int] = None):
-    """Constructs a `Controller` instance.
-
-    Args:
-      strategy: An instance of `tf.distribute.Strategy`.
-      train_fn: A callable defined as `def train_fn(num_steps)`, which
-        `num_steps` indicates the number of steps to run for each loop.
-      eval_fn: A callable defined as `def eval_fn(num_steps)`, which `num_steps`
-        indicates the number of steps for one evaluation.
-      global_step: An integer `tf.Variable` indicating the global training step
-        number. Usually this can be obtained from `iterations` property of the
-        model's optimizer (e.g. `self.optimizer.iterations`), or users can
-        create their own global step variable as well. If the users create their
-        own global step variable, it is recommended to create the `tf.Variable`
-        inside strategy scope, and with
-        `aggregation=tf.VariableAggregation.ONLY_FIRST_REPLICA`.
-      train_steps: The total (maximum) number of training steps to perform.
-      steps_per_loop: The number of steps to run in each "inner loop" of
-        training (passed to the `num_steps` parameter of `train_fn`).
-      summary_dir: The directory to restore and write checkpoints and summaries.
-        If None, it will be set to `checkpoint_manager.directory`.
-      checkpoint_manager: An instance of `tf.train.CheckpointManager`.
-      summary_interval: Step interval for training summaries. Note that this
-        argument only applies to the summaries outside the training loop. If the
-        value is None, then training summaries are not enabled.
-      eval_summary_dir: The directory to write eval summaries. If None, it will
-        be set to `summary_dir`.
-      eval_steps: Number of steps to run evaluation.
-      eval_interval: Step interval for evaluation. If None, will skip evaluation
-        in the middle of training. Note that evaluation only happens outside the
-        training loop, which the loop iteration is specify by `steps_per_loop`
-        parameter.
-
-    Raises:
-      ValueError: If both `train_fn` and `eval_fn` are None.
-      ValueError: If `train_fn` is not None and `train_steps` is None.
-      ValueError: If `steps_per_loop` is None when `train_fn` is provided.
-      ValueError: If `steps_per_loop` is not a positive integer.
-    """
-    if train_fn is None and eval_fn is None:
-      raise ValueError("`train_fn` and `eval_fn` should not both be None")
-
-    # TODO(rxsang): Support training until exhaustion by passing
-    # `train_steps=-1`. Currently it cannot be supported with a host training
-    # loop because break statements are not supported with distributed dataset.
-    if train_fn is not None:
-      if train_steps is None:
-        raise ValueError("`train_steps` is required when `train_fn` is "
-                         "provided.")
-      if steps_per_loop is None:
-        raise ValueError("`steps_per_loop` is required when `train_fn is "
-                         "provided.")
-      if not isinstance(steps_per_loop, int) or steps_per_loop < 1:
-        raise ValueError("`steps_per_loop` should be a positive integer")
-    if summary_interval is not None and summary_interval <= 0:
-      raise ValueError("`summary_interval` should be larger than 0")
-
-    self.strategy = strategy or tf.distribute.get_strategy()
-
-    self.train_fn = train_fn
-    self.eval_fn = eval_fn
-    self.global_step = global_step
-    self.checkpoint_manager = checkpoint_manager
-
-    if self.train_fn is not None:
-      self.train_steps = train_steps
-      self.steps_per_loop = steps_per_loop
-      if summary_dir:
-        self.summary_dir = summary_dir
-      elif checkpoint_manager:
-        self.summary_dir = checkpoint_manager.directory
-      else:
-        self.summary_dir = None
-
-      self.summary_interval = summary_interval
-      if self.summary_dir and self.summary_interval:
-        summary_writer = tf.summary.create_file_writer(self.summary_dir)
-      else:
-        summary_writer = None
-      # TODO(rxsang): Consider pass SummaryManager directly into Controller for
-      # maximum customizability.
-      self.summary_manager = utils.SummaryManager(
-          summary_writer,
-          tf.summary.scalar,
-          global_step=self.global_step,
-          summary_interval=self.summary_interval)
-
-    if self.eval_fn is not None:
-      eval_summary_dir = eval_summary_dir or self.summary_dir
-      eval_summary_writer = tf.summary.create_file_writer(
-          eval_summary_dir) if eval_summary_dir else None
-      self.eval_summary_manager = utils.SummaryManager(
-          eval_summary_writer, tf.summary.scalar, global_step=self.global_step)
-
-      self.eval_steps = eval_steps
-      self.eval_interval = eval_interval
-
-      # Creates and initializes the interval triggers.
-      self.eval_trigger = utils.IntervalTrigger(self.eval_interval,
-                                                self.global_step.numpy())  # pytype: disable=attribute-error
-
-    if self.global_step:
-      tf.summary.experimental.set_step(self.global_step)
-
-    # Restores the model if needed.
-    if self.checkpoint_manager is not None:
-      model_restored = self._restore_model()
-      if not model_restored and self.checkpoint_manager.checkpoint_interval:
-        # If the model is not restored from a checkpoint, save an initial
-        # checkpoint.
-        ckpt_path = self.checkpoint_manager.save(
-            checkpoint_number=self.global_step)
-        logging.info("Saved checkpoins in %s", ckpt_path)
-
-  def _restore_model(self, checkpoint_path=None):
-    """Restore or initialize the model.
-
-    Args:
-      checkpoint_path: An optional string indicates the checkpoint path to
-        restore. If None, will restore from `self.checkpoint_manager`.
-
-    Returns:
-      True if the latest checkpoint is found or restored. Otherwise False.
-    """
-    with self.strategy.scope():
-      # Checkpoint restoring should be inside scope. b/139450638
-      if checkpoint_path is not None:
-        self.checkpoint_manager.checkpoint.restore(checkpoint_path)
-        return True
-      return self.checkpoint_manager.restore_or_initialize()
-
-  def _evaluate_once(self, current_step):
-    """Runs the evaluation once."""
-    logging.info("Start evaluation at step: %s", current_step)
-
-    with self.eval_summary_manager.summary_writer.as_default():
-      eval_outputs = self.eval_fn(self.eval_steps)
-
-    if eval_outputs:
-      eval_outputs = tf.nest.map_structure(lambda x: x.numpy(), eval_outputs)
-
-    info = "step: {}        evaluation metric: {}".format(
-        current_step, eval_outputs)
-    self._log_info(info)
-
-    self.eval_summary_manager.write_summaries(eval_outputs)
-    self.eval_summary_manager.flush()
-
-  def _maybe_save_checkpoints(self, current_step, force_trigger=False):
-    if self.checkpoint_manager and self.checkpoint_manager.checkpoint_interval:
-      ckpt_path = self.checkpoint_manager.save(
-          checkpoint_number=current_step, check_interval=not force_trigger)
-      if ckpt_path is not None:
-        logging.info("Saved checkpoins in %s", ckpt_path)
-
-  def _maybe_evaluate(self, current_step, force_trigger=False):
-    if self.eval_trigger(current_step, force_trigger):
-      self._evaluate_once(current_step)
-
-  def _log_info(self, message):
-    """Logs `message` to the `info` log, and also prints to stdout."""
-    logging.info(message)
-    print(message)
-
-  def train(self, evaluate=True):
-    """Runs the training, with optional evaluation.
-
-    This handles evaluation, gathering summaries, and saving checkpoints.
-
-    Args:
-      evaluate: A boolean indicates whether to perform evaluation during
-        training.
-
-    Raises:
-      RuntimeError: If `global_step` is not updated correctly in `train_fn`.
-    """
-    if self.train_fn is None:
-      raise ValueError("`self.train_fn` is required when calling `train` "
-                       "method.")
-    if self.global_step is None:
-      raise ValueError("`self.global_step` is required when calling `train` "
-                       "method.")
-    if evaluate and self.eval_fn is None:
-      raise ValueError("`self.eval_fn` is required when calling `train` method "
-                       "with `evaluate=True`")
-
-    step_timer = _StepTimer(self.global_step)
-    current_step = self.global_step.numpy()
-    logging.info("Train at step %s of %s", current_step, self.train_steps)
-    while current_step < self.train_steps:
-      # Calculates steps to run for the next train loop.
-      steps_per_loop = min(self.train_steps - current_step, self.steps_per_loop)
-      logging.info("Entering training loop with %s steps, at step %s of %s",
-                   steps_per_loop, current_step, self.train_steps)
-      current_step += steps_per_loop
-      steps_per_loop = tf.convert_to_tensor(steps_per_loop, dtype=tf.int32)
-
-      with self.summary_manager.summary_writer.as_default():
-        train_outputs = self.train_fn(steps_per_loop)
-
-      # Updates and verifies the current step after a training loop finishes.
-      if current_step != self.global_step.numpy():
-        raise RuntimeError("`self.train_fn` is not updating `global_step` "
-                           "correctly, expected: %s, actual: %s" %
-                           (current_step, self.global_step.numpy()))
-
-      # Print information like metrics and steps_per_second after a training
-      # loop.
-      if train_outputs:
-        train_outputs = tf.nest.map_structure(
-            lambda x: x.numpy(), train_outputs)
-      steps_per_second = step_timer.steps_per_second()
-      info = "step: {}        steps_per_second: {:.2f}        {}".format(
-          current_step, steps_per_second, train_outputs)
-      self._log_info(info)
-
-      train_outputs = train_outputs or {}
-      train_outputs["steps_per_second"] = steps_per_second
-      self.summary_manager.write_summaries(train_outputs)
-
-      self._maybe_save_checkpoints(current_step)
-
-      if evaluate:
-        self._maybe_evaluate(current_step)
-
-    self.summary_manager.write_summaries(train_outputs, always_write=True)
-    self.summary_manager.flush()
-    self._maybe_save_checkpoints(current_step, force_trigger=True)
-    if evaluate:
-      self._maybe_evaluate(current_step, force_trigger=True)
-
-  def evaluate(self, continuous=False, timeout_fn=None):
-    """Runs the evaluation.
-
-    Args:
-      continuous: If `True`, will continously monitor the checkpoint directory
-        to evaluate on the latest checkpoint. If `False`, will do the evaluation
-        once.
-      timeout_fn: Optional callable to call after a timeout. If the function
-        returns True, then it means that no new checkpoints will be generated
-        and the iterator will exit.
-
-    Raises:
-      ValueError: If no checkpoint found in `self.checkpoint_manager.directory`.
-    """
-    if self.eval_fn is None:
-      raise ValueError("`self.eval_fn` should not be None to call "
-                       "`evaluate()` method.")
-
-    if not continuous and timeout_fn is not None:
-      raise ValueError("`timeout_fn` can be only passed when `continuous` is "
-                       "True")
-
-    if continuous:
-      for checkpoint_path in tf.train.checkpoints_iterator(
-          self.checkpoint_manager.directory, timeout_fn=timeout_fn):
-        self._restore_model(checkpoint_path)
-        self._evaluate_once(self.global_step.numpy())
-      return
-
-    latest_checkpoint = self.checkpoint_manager.latest_checkpoint
-    if not latest_checkpoint:
-      raise ValueError("no checkpoint found in dir %s" %
-                       self.checkpoint_manager.directory)
-    self._restore_model()
-    self._evaluate_once(self.global_step.numpy())
-
-
-class _StepTimer(object):
-  """Utility class for measuring steps/second."""
-
-  def __init__(self, step):
-    self.step = step
-    self.start()
-
-  def start(self):
-    self.last_iteration = self.step.numpy()
-    self.last_time = time.time()
-
-  def steps_per_second(self, restart=True):
-    value = ((self.step.numpy() - self.last_iteration) /
-             (time.time() - self.last_time))
-    if restart:
-      self.start()
-    return value
diff --git a/official/staging/training/controller_test.py b/official/staging/training/controller_test.py
deleted file mode 100644
index eeaa191c04d40fcc108ed7b00dec86d30d5a2a0b..0000000000000000000000000000000000000000
--- a/official/staging/training/controller_test.py
+++ /dev/null
@@ -1,308 +0,0 @@
-# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for official.staging.training.controller."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-from absl.testing import parameterized
-import numpy as np
-import tensorflow as tf
-
-from tensorflow.python.distribute import combinations
-from tensorflow.python.distribute import strategy_combinations
-from official.staging.training import controller
-from official.staging.training import standard_runnable
-
-
-def all_strategy_combinations():
-  """Gets combinations of distribution strategies."""
-  return combinations.combine(
-      strategy=[
-          strategy_combinations.one_device_strategy,
-          strategy_combinations.tpu_strategy,
-          strategy_combinations.one_device_strategy_gpu,
-          strategy_combinations.mirrored_strategy_with_gpu_and_cpu,
-      ],
-      mode="eager",
-  )
-
-
-def create_model():
-  x = tf.keras.layers.Input(shape=(3,), name="input")
-  y = tf.keras.layers.Dense(4, name="dense")(x)
-  model = tf.keras.Model(x, y)
-  return model
-
-
-def summaries_with_matching_keyword(keyword, summary_dir):
-  """Yields summary protos matching given keyword from event file."""
-  event_paths = tf.io.gfile.glob(os.path.join(summary_dir, "events*"))
-  for event in tf.compat.v1.train.summary_iterator(event_paths[-1]):
-    if event.summary is not None:
-      for value in event.summary.value:
-        if keyword in value.tag:
-          tf.compat.v1.logging.error(event)
-          yield event.summary
-
-
-def check_eventfile_for_keyword(keyword, summary_dir):
-  """Checks event files for the keyword."""
-  return any(summaries_with_matching_keyword(keyword, summary_dir))
-
-
-def dataset_fn(ctx):
-  del ctx
-  inputs = np.zeros((10, 3), dtype=np.float32)
-  targets = np.zeros((10, 4), dtype=np.float32)
-  dataset = tf.data.Dataset.from_tensor_slices((inputs, targets))
-  dataset = dataset.repeat(100)
-  dataset = dataset.batch(10, drop_remainder=True)
-  return dataset
-
-
-class TestRunnable(standard_runnable.StandardTrainable,
-                   standard_runnable.StandardEvaluable):
-  """Implements the training and evaluation APIs for the test model."""
-
-  def __init__(self):
-    standard_runnable.StandardTrainable.__init__(self)
-    standard_runnable.StandardEvaluable.__init__(self)
-    self.strategy = tf.distribute.get_strategy()
-    self.model = create_model()
-    self.optimizer = tf.keras.optimizers.RMSprop()
-    self.global_step = self.optimizer.iterations
-    self.train_loss = tf.keras.metrics.Mean("train_loss", dtype=tf.float32)
-    self.eval_loss = tf.keras.metrics.Mean("eval_loss", dtype=tf.float32)
-
-  def build_train_dataset(self):
-    return self.strategy.experimental_distribute_datasets_from_function(
-        dataset_fn)
-
-  def train_step(self, iterator):
-
-    def _replicated_step(inputs):
-      """Replicated training step."""
-      inputs, targets = inputs
-      with tf.GradientTape() as tape:
-        outputs = self.model(inputs)
-        loss = tf.math.reduce_sum(outputs - targets)
-      grads = tape.gradient(loss, self.model.variables)
-      self.optimizer.apply_gradients(zip(grads, self.model.variables))
-      self.train_loss.update_state(loss)
-
-    self.strategy.run(_replicated_step, args=(next(iterator),))
-
-  def train_loop_end(self):
-    return {
-        "loss": self.train_loss.result(),
-    }
-
-  def build_eval_dataset(self):
-    return self.strategy.experimental_distribute_datasets_from_function(
-        dataset_fn)
-
-  def eval_begin(self):
-    self.eval_loss.reset_states()
-
-  def eval_step(self, iterator):
-
-    def _replicated_step(inputs):
-      """Replicated evaluation step."""
-      inputs, targets = inputs
-      outputs = self.model(inputs)
-      loss = tf.math.reduce_sum(outputs - targets)
-      self.eval_loss.update_state(loss)
-
-    self.strategy.run(_replicated_step, args=(next(iterator),))
-
-  def eval_end(self):
-    return {
-        "eval_loss": self.eval_loss.result(),
-    }
-
-
-class ControllerTest(tf.test.TestCase, parameterized.TestCase):
-
-  def setUp(self):
-    super(ControllerTest, self).setUp()
-    self.model_dir = self.get_temp_dir()
-
-  def test_no_checkpoint(self):
-    test_runnable = TestRunnable()
-    # No checkpoint manager and no strategy.
-    test_controller = controller.Controller(
-        train_fn=test_runnable.train,
-        eval_fn=test_runnable.evaluate,
-        global_step=test_runnable.global_step,
-        train_steps=10,
-        steps_per_loop=2,
-        summary_dir=os.path.join(self.model_dir, "summaries/train"),
-        summary_interval=2,
-        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"),
-        eval_steps=2,
-        eval_interval=5)
-    test_controller.train(evaluate=True)
-    self.assertEqual(test_runnable.global_step.numpy(), 10)
-    # Loss and accuracy values should be written into summaries.
-    self.assertNotEmpty(
-        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/train")))
-    self.assertTrue(
-        check_eventfile_for_keyword(
-            "loss", os.path.join(self.model_dir, "summaries/train")))
-    self.assertNotEmpty(
-        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/eval")))
-    self.assertTrue(
-        check_eventfile_for_keyword(
-            "eval_loss", os.path.join(self.model_dir, "summaries/eval")))
-    # No checkpoint, so global step starts from 0.
-    test_runnable.global_step.assign(0)
-    test_controller.train(evaluate=True)
-    self.assertEqual(test_runnable.global_step.numpy(), 10)
-
-  def test_no_checkpoint_and_summaries(self):
-    test_runnable = TestRunnable()
-    # No checkpoint + summary directories.
-    test_controller = controller.Controller(
-        train_fn=test_runnable.train,
-        eval_fn=test_runnable.evaluate,
-        global_step=test_runnable.global_step,
-        train_steps=10,
-        steps_per_loop=2,
-        eval_steps=2,
-        eval_interval=5)
-    test_controller.train(evaluate=True)
-    self.assertEqual(test_runnable.global_step.numpy(), 10)
-
-  @combinations.generate(all_strategy_combinations())
-  def test_train_and_evaluate(self, strategy):
-    with strategy.scope():
-      test_runnable = TestRunnable()
-
-    checkpoint = tf.train.Checkpoint(
-        model=test_runnable.model, optimizer=test_runnable.optimizer)
-    checkpoint_manager = tf.train.CheckpointManager(
-        checkpoint,
-        self.model_dir,
-        max_to_keep=None,
-        step_counter=test_runnable.global_step,
-        checkpoint_interval=10)
-    test_controller = controller.Controller(
-        strategy=strategy,
-        train_fn=test_runnable.train,
-        eval_fn=test_runnable.evaluate,
-        global_step=test_runnable.global_step,
-        train_steps=10,
-        steps_per_loop=2,
-        summary_dir=os.path.join(self.model_dir, "summaries/train"),
-        summary_interval=2,
-        checkpoint_manager=checkpoint_manager,
-        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"),
-        eval_steps=2,
-        eval_interval=5)
-    test_controller.train(evaluate=True)
-
-    # Checkpoints are saved.
-    self.assertNotEmpty(tf.io.gfile.glob(os.path.join(self.model_dir, "ckpt*")))
-
-    # Loss and accuracy values should be written into summaries.
-    self.assertNotEmpty(
-        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/train")))
-    self.assertTrue(
-        check_eventfile_for_keyword(
-            "loss", os.path.join(self.model_dir, "summaries/train")))
-    self.assertNotEmpty(
-        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/eval")))
-    self.assertTrue(
-        check_eventfile_for_keyword(
-            "eval_loss", os.path.join(self.model_dir, "summaries/eval")))
-
-  @combinations.generate(all_strategy_combinations())
-  def test_train_only(self, strategy):
-    with strategy.scope():
-      test_runnable = TestRunnable()
-
-    checkpoint = tf.train.Checkpoint(
-        model=test_runnable.model, optimizer=test_runnable.optimizer)
-    checkpoint_manager = tf.train.CheckpointManager(
-        checkpoint,
-        self.model_dir,
-        max_to_keep=None,
-        step_counter=test_runnable.global_step,
-        checkpoint_interval=10)
-    test_controller = controller.Controller(
-        strategy=strategy,
-        train_fn=test_runnable.train,
-        global_step=test_runnable.global_step,
-        train_steps=10,
-        steps_per_loop=2,
-        summary_dir=os.path.join(self.model_dir, "summaries/train"),
-        summary_interval=2,
-        checkpoint_manager=checkpoint_manager,
-        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"),
-    )
-    test_controller.train(evaluate=False)
-
-    # Checkpoints are saved.
-    self.assertNotEmpty(tf.io.gfile.glob(os.path.join(self.model_dir, "ckpt*")))
-
-    # Only train summaries are written.
-    self.assertNotEmpty(
-        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/train")))
-    self.assertTrue(
-        check_eventfile_for_keyword(
-            "loss", os.path.join(self.model_dir, "summaries/train")))
-    self.assertFalse(
-        tf.io.gfile.exists(os.path.join(self.model_dir, "summaries/eval")))
-
-  @combinations.generate(all_strategy_combinations())
-  def test_evaluate_only(self, strategy):
-    with strategy.scope():
-      test_runnable = TestRunnable()
-
-    checkpoint = tf.train.Checkpoint(model=test_runnable.model)
-    checkpoint.save(os.path.join(self.model_dir, "ckpt"))
-
-    checkpoint_manager = tf.train.CheckpointManager(
-        checkpoint,
-        self.model_dir,
-        max_to_keep=None,
-        step_counter=test_runnable.global_step)
-    test_controller = controller.Controller(
-        strategy=strategy,
-        eval_fn=test_runnable.evaluate,
-        global_step=test_runnable.global_step,
-        checkpoint_manager=checkpoint_manager,
-        summary_dir=os.path.join(self.model_dir, "summaries/train"),
-        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"),
-        eval_steps=2,
-        eval_interval=5)
-    test_controller.evaluate()
-
-    # Only eval summaries are written
-    self.assertFalse(
-        tf.io.gfile.exists(os.path.join(self.model_dir, "summaries/train")))
-    self.assertNotEmpty(
-        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/eval")))
-    self.assertTrue(
-        check_eventfile_for_keyword(
-            "eval_loss", os.path.join(self.model_dir, "summaries/eval")))
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/official/staging/training/standard_runnable.py b/official/staging/training/standard_runnable.py
deleted file mode 100644
index 20dd66f28e44f7b799dff4af826dcb22bb13595a..0000000000000000000000000000000000000000
--- a/official/staging/training/standard_runnable.py
+++ /dev/null
@@ -1,181 +0,0 @@
-# Copyright 2019 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""An abstraction that users can easily handle their custom training loops."""
-
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
-
-import abc
-import six
-import tensorflow.compat.v2 as tf
-from typing import Dict, Optional, Text
-
-from official.staging.training import runnable
-from official.staging.training import utils
-
-
-@six.add_metaclass(abc.ABCMeta)
-class StandardTrainable(runnable.AbstractTrainable):
-  """Implements the standard functionality of AbstractTrainable APIs."""
-
-  def __init__(self, use_tf_while_loop=True, use_tf_function=True):
-    if use_tf_while_loop and not use_tf_function:
-      raise ValueError("`use_tf_while_loop=True` and `use_tf_function=False` "
-                       "is not supported")
-    self.use_tf_while_loop = use_tf_while_loop
-    self.use_tf_function = use_tf_function
-    self.train_dataset = None
-    self.train_iter = None
-    self.train_loop_fn = None
-
-  @abc.abstractmethod
-  def build_train_dataset(self):
-    """Builds the training datasets.
-
-    Returns:
-      A tf.nest-compatible structure of tf.data.Dataset or DistributedDataset.
-    """
-    pass
-
-  def train(self,
-            num_steps: Optional[tf.Tensor]) -> Optional[Dict[Text, tf.Tensor]]:
-    """See base class."""
-    if self.train_dataset is None:
-      # Build train input dataset
-      self.train_dataset = self.build_train_dataset()
-      self.train_iter = tf.nest.map_structure(iter, self.train_dataset)
-
-    if self.train_loop_fn is None:
-      train_fn = self.train_step
-      if self.use_tf_while_loop:
-        self.train_loop_fn = utils.create_tf_while_loop_fn(train_fn)
-      else:
-        if self.use_tf_function:
-          train_fn = tf.function(train_fn)
-        self.train_loop_fn = utils.create_loop_fn(train_fn)
-
-    self.train_loop_begin()
-    self.train_loop_fn(self.train_iter, num_steps)
-    return self.train_loop_end()
-
-  def train_loop_begin(self):
-    """Called once at the beginning of the training loop.
-
-    This is a good place to reset metrics that accumulate values over multiple
-    steps of training.
-    """
-    pass
-
-  @abc.abstractmethod
-  def train_step(self, iterator):
-    """Implements one step of training.
-
-    What a "step" consists of is up to the implementer. If using distribution
-    strategies, the call to this method should take place in the "cross-replica
-    context" for generality, to allow e.g. multiple iterator dequeues and calls
-    to `strategy.run`.
-
-    Args:
-      iterator: A tf.nest-compatible structure of tf.data Iterator or
-        DistributedIterator.
-    """
-    pass
-
-  def train_loop_end(self) -> Optional[Dict[Text, tf.Tensor]]:
-    """Called at the end of the training loop.
-
-    This is a good place to get metric results. The value returned from this
-    function will be returned as-is from the train() method.
-
-    Returns:
-      The function may return a dictionary of `Tensors`, which will be
-      written to logs and as TensorBoard summaries.
-    """
-    pass
-
-
-@six.add_metaclass(abc.ABCMeta)
-class StandardEvaluable(runnable.AbstractEvaluable):
-  """Implements the standard functionality of AbstractEvaluable APIs."""
-
-  def __init__(self, use_tf_function=True):
-    self.eval_use_tf_function = use_tf_function
-    self.eval_dataset = None
-    self.eval_loop_fn = None
-
-  @abc.abstractmethod
-  def build_eval_dataset(self):
-    """Builds the evaluation datasets.
-
-    Returns:
-      A tf.nest-compatible structure of tf.data.Dataset or DistributedDataset.
-    """
-    pass
-
-  def evaluate(
-      self, num_steps: Optional[tf.Tensor]) -> Optional[Dict[Text, tf.Tensor]]:
-    """See base class."""
-    if self.eval_dataset is None:
-      # Build train input dataset
-      self.eval_dataset = self.build_eval_dataset()
-
-    if self.eval_loop_fn is None:
-      eval_fn = self.eval_step
-      if self.eval_use_tf_function:
-        eval_fn = tf.function(eval_fn)
-      self.eval_loop_fn = utils.create_loop_fn(eval_fn)
-
-    eval_iter = tf.nest.map_structure(iter, self.eval_dataset)
-
-    self.eval_begin()
-    self.eval_loop_fn(eval_iter, num_steps)
-    return self.eval_end()
-
-  def eval_begin(self):
-    """Called once at the beginning of the evaluation.
-
-    This is a good place to reset metrics that accumulate values over the entire
-    evaluation.
-    """
-    pass
-
-  @abc.abstractmethod
-  def eval_step(self, iterator):
-    """Implements one step of evaluation.
-
-    What a "step" consists of is up to the implementer. If using distribution
-    strategies, the call to this method should take place in the "cross-replica
-    context" for generality, to allow e.g. multiple iterator dequeues and calls
-    to `strategy.run`.
-
-    Args:
-      iterator: A tf.nest-compatible structure of tf.data Iterator or
-        DistributedIterator.
-    """
-    pass
-
-  def eval_end(self) -> Optional[Dict[Text, tf.Tensor]]:
-    """Called at the end of the evaluation.
-
-    This is a good place to get metric results. The value returned from this
-    function will be returned as-is from the evaluate() method.
-
-    Returns:
-      The function may return a dictionary of `Tensors`, which will be
-      written to logs and as TensorBoard summaries.
-    """
-    pass
diff --git a/official/vision/detection/README.md b/official/vision/detection/README.md
index 363e2f6c49ac870f673dba32795988a9cf95c2ab..d6cb5d4645f11681c450ab8c4ed33b38eba74b6b 100644
--- a/official/vision/detection/README.md
+++ b/official/vision/detection/README.md
@@ -48,6 +48,22 @@ so the checkpoints are not compatible.
 We will unify the implementation soon.
 
 
+### Train a SpineNet-49 based RetinaNet.
+
+```bash
+TPU_NAME="<your GCP TPU name>"
+MODEL_DIR="<path to the directory to store model files>"
+TRAIN_FILE_PATTERN="<path to the TFRecord training data>"
+EVAL_FILE_PATTERN="<path to the TFRecord validation data>"
+VAL_JSON_FILE="<path to the validation annotation JSON file>"
+python3 ~/models/official/vision/detection/main.py \
+  --strategy_type=tpu \
+  --tpu="${TPU_NAME?}" \
+  --model_dir="${MODEL_DIR?}" \
+  --mode=train \
+  --params_override="{ type: retinanet, architecture: {backbone: spinenet, multilevel_features: identity}, spinenet: {model_id: 49}, train_file_pattern: ${TRAIN_FILE_PATTERN?} }, eval: { val_json_file: ${VAL_JSON_FILE?}, eval_file_pattern: ${EVAL_FILE_PATTERN?} } }"
+```
+
 
 ### Train a custom RetinaNet using the config file.
 
@@ -123,8 +139,6 @@ predict:
  predict_batch_size: 8
 architecture:
  use_bfloat16: False
-retinanet_parser:
- use_bfloat16: False
 train:
  total_steps: 1
  batch_size: 8
@@ -165,6 +179,24 @@ so the checkpoints are not compatible.
 We will unify the implementation soon.
 
 
+### Train a SpineNet-49 based Mask R-CNN.
+
+```bash
+TPU_NAME="<your GCP TPU name>"
+MODEL_DIR="<path to the directory to store model files>"
+TRAIN_FILE_PATTERN="<path to the TFRecord training data>"
+EVAL_FILE_PATTERN="<path to the TFRecord validation data>"
+VAL_JSON_FILE="<path to the validation annotation JSON file>"
+python3 ~/models/official/vision/detection/main.py \
+  --strategy_type=tpu \
+  --tpu="${TPU_NAME?}" \
+  --model_dir="${MODEL_DIR?}" \
+  --mode=train \
+  --model=mask_rcnn \
+  --params_override="{architecture: {backbone: spinenet, multilevel_features: identity}, spinenet: {model_id: 49}, train_file_pattern: ${TRAIN_FILE_PATTERN?} }, eval: { val_json_file: ${VAL_JSON_FILE?}, eval_file_pattern: ${EVAL_FILE_PATTERN?} } }"
+```
+
+
 ### Train a custom Mask R-CNN using the config file.
 
 First, create a YAML config file, e.g. *my_maskrcnn.yaml*.
@@ -245,8 +277,6 @@ predict:
  predict_batch_size: 8
 architecture:
  use_bfloat16: False
-maskrcnn_parser:
- use_bfloat16: False
 train:
  total_steps: 1000
  batch_size: 8
@@ -255,6 +285,140 @@ use_tpu: False
 "
 ```
 
+## Train ShapeMask on TPU
+
+### Train a ResNet-50 based ShapeMask.
+
+```bash
+TPU_NAME="<your GCP TPU name>"
+MODEL_DIR="<path to the directory to store model files>"
+RESNET_CHECKPOINT="<path to the pre-trained Resnet-50 checkpoint>"
+TRAIN_FILE_PATTERN="<path to the TFRecord training data>"
+EVAL_FILE_PATTERN="<path to the TFRecord validation data>"
+VAL_JSON_FILE="<path to the validation annotation JSON file>"
+SHAPE_PRIOR_PATH="<path to shape priors>"
+python3 ~/models/official/vision/detection/main.py \
+  --strategy_type=tpu \
+  --tpu=${TPU_NAME} \
+  --model_dir=${MODEL_DIR} \
+  --mode=train \
+  --model=shapemask \
+  --params_override="{train: { checkpoint: { path: ${RESNET_CHECKPOINT}, prefix: resnet50/ }, train_file_pattern: ${TRAIN_FILE_PATTERN} }, eval: { val_json_file: ${VAL_JSON_FILE}, eval_file_pattern: ${EVAL_FILE_PATTERN} } shapemask_head: {use_category_for_mask: true, shape_prior_path: ${SHAPE_PRIOR_PATH}} }"
+```
+
+The pre-trained ResNet-50 checkpoint can be downloaded [here](https://storage.cloud.google.com/cloud-tpu-checkpoints/model-garden-vision/detection/resnet50-2018-02-07.tar.gz).
+
+The shape priors can be downloaded [here]
+(https://storage.googleapis.com/cloud-tpu-checkpoints/shapemask/kmeans_class_priors_91x20x32x32.npy)
+
+
+### Train a custom ShapeMask using the config file.
+
+First, create a YAML config file, e.g. *my_shapemask.yaml*.
+This file specifies the parameters to be overridden:
+
+```YAML
+# my_shapemask.yaml
+train:
+  train_file_pattern: <path to the TFRecord training data>
+  total_steps: <total steps to train>
+  batch_size: <training batch size>
+eval:
+  eval_file_pattern: <path to the TFRecord validation data>
+  val_json_file: <path to the validation annotation JSON file>
+  batch_size: <evaluation batch size>
+shapemask_head:
+  shape_prior_path: <path to shape priors>
+```
+
+Once the YAML config file is created, you can launch the training using the
+following command.
+
+```bash
+TPU_NAME="<your GCP TPU name>"
+MODEL_DIR="<path to the directory to store model files>"
+python3 ~/models/official/vision/detection/main.py \
+  --strategy_type=tpu \
+  --tpu=${TPU_NAME} \
+  --model_dir=${MODEL_DIR} \
+  --mode=train \
+  --model=shapemask \
+  --config_file="my_shapemask.yaml"
+```
+
+## Train ShapeMask on GPU
+
+Training on GPU is similar to that on TPU. The major change is the strategy type
+(use
+"[mirrored](https://www.tensorflow.org/api_docs/python/tf/distribute/MirroredStrategy)"
+for multiple GPU and
+"[one_device](https://www.tensorflow.org/api_docs/python/tf/distribute/OneDeviceStrategy)"
+for single GPU).
+
+Multi-GPUs example (assuming there are 8GPU connected to the host):
+
+```bash
+MODEL_DIR="<path to the directory to store model files>"
+python3 ~/models/official/vision/detection/main.py \
+  --strategy_type=mirrored \
+  --num_gpus=8 \
+  --model_dir=${MODEL_DIR} \
+  --mode=train \
+  --model=shapemask \
+  --config_file="my_shapemask.yaml"
+```
+
+A single GPU example
+
+```bash
+MODEL_DIR="<path to the directory to store model files>"
+python3 ~/models/official/vision/detection/main.py \
+  --strategy_type=one_device \
+  --num_gpus=1 \
+  --model_dir=${MODEL_DIR} \
+  --mode=train \
+  --model=shapemask \
+  --config_file="my_shapemask.yaml"
+```
+
+
+An example with inline configuration (YAML or JSON format):
+
+```
+python3 ~/models/official/vision/detection/main.py \
+  --model_dir=<model folder> \
+  --strategy_type=one_device \
+  --num_gpus=1 \
+  --mode=train \
+  --model=shapemask \
+  --params_override="eval:
+ eval_file_pattern: <Eval TFRecord file pattern>
+ batch_size: 8
+ val_json_file: <COCO format groundtruth JSON file>
+train:
+ total_steps: 1000
+ batch_size: 8
+ train_file_pattern: <Eval TFRecord file pattern>
+use_tpu: False
+"
+```
+
+
+### Run the evaluation (after training)
+
+```
+python3 /usr/share/models/official/vision/detection/main.py \
+   --strategy_type=tpu \
+   --tpu=${TPU_NAME} \
+   --model_dir=${MODEL_DIR} \
+   --mode=eval \
+   --model=shapemask \
+   --params_override="{eval: { val_json_file: ${VAL_JSON_FILE}, eval_file_pattern: ${EVAL_FILE_PATTERN}, eval_samples: 5000 } }"
+```
+
+`MODEL_DIR` needs to point to the trained path of ShapeMask model.
+Change `strategy_type=mirrored` and `num_gpus=1` to run on a GPU.
+
 Note: The JSON groundtruth file is useful for [COCO dataset](http://cocodataset.org/#home) and can be
 downloaded from the [COCO website](http://cocodataset.org/#download). For custom dataset, it is unncessary because the groundtruth can be included in the TFRecord files.
 
diff --git a/official/vision/detection/configs/base_config.py b/official/vision/detection/configs/base_config.py
index 0a4e2f5fbf001039a88bed6d834835348807719c..4505da517bb500a6fe32f5849971fc92e2d726e7 100644
--- a/official/vision/detection/configs/base_config.py
+++ b/official/vision/detection/configs/base_config.py
@@ -17,10 +17,12 @@
 
 BACKBONES = [
     'resnet',
+    'spinenet',
 ]
 
 MULTILEVEL_FEATURES = [
     'fpn',
+    'identity',
 ]
 
 # pylint: disable=line-too-long
@@ -118,6 +120,9 @@ BASE_CFG = {
     'resnet': {
         'resnet_depth': 50,
     },
+    'spinenet': {
+        'model_id': '49',
+    },
     'fpn': {
         'fpn_feat_dims': 256,
         'use_separable_conv': False,
diff --git a/official/vision/detection/dataloader/anchor.py b/official/vision/detection/dataloader/anchor.py
index f46f7480062e75cec55d48ff683dcad8301e4994..27fc4852f14f683dde7a19613df84830f1ce2bb5 100644
--- a/official/vision/detection/dataloader/anchor.py
+++ b/official/vision/detection/dataloader/anchor.py
@@ -46,15 +46,15 @@ class Anchor(object):
       num_scales: integer number representing intermediate scales added
         on each level. For instances, num_scales=2 adds one additional
         intermediate anchor scales [2^0, 2^0.5] on each level.
-      aspect_ratios: list of float numbers representing the aspect raito anchors
+      aspect_ratios: list of float numbers representing the aspect ratio anchors
         added on each level. The number indicates the ratio of width to height.
         For instances, aspect_ratios=[1.0, 2.0, 0.5] adds three anchors on each
         scale level.
       anchor_size: float number representing the scale of size of the base
         anchor to the feature stride 2^level.
       image_size: a list of integer numbers or Tensors representing
-        [height, width] of the input image size.The image_size should be divided
-        by the largest feature stride 2^max_level.
+        [height, width] of the input image size.The image_size should be
+        divisible by the largest feature stride 2^max_level.
     """
     self.min_level = min_level
     self.max_level = max_level
@@ -77,8 +77,8 @@ class Anchor(object):
       for scale in range(self.num_scales):
         for aspect_ratio in self.aspect_ratios:
           stride = 2 ** level
-          intermidate_scale = 2 ** (scale / float(self.num_scales))
-          base_anchor_size = self.anchor_size * stride * intermidate_scale
+          intermediate_scale = 2 ** (scale / float(self.num_scales))
+          base_anchor_size = self.anchor_size * stride * intermediate_scale
           aspect_x = aspect_ratio ** 0.5
           aspect_y = aspect_ratio ** -0.5
           half_anchor_size_x = base_anchor_size * aspect_x / 2.0
diff --git a/official/vision/detection/dataloader/maskrcnn_parser.py b/official/vision/detection/dataloader/maskrcnn_parser.py
index 933e1b75c04ee04e4fbb60eaeb1ac9a48412a970..35db6f1478236d347839625d397fc918478694c4 100644
--- a/official/vision/detection/dataloader/maskrcnn_parser.py
+++ b/official/vision/detection/dataloader/maskrcnn_parser.py
@@ -185,12 +185,12 @@ class Parser(object):
     is_crowds = data['groundtruth_is_crowd']
     # Skips annotations with `is_crowd` = True.
     if self._skip_crowd_during_training and self._is_training:
-      num_groundtrtuhs = tf.shape(classes)[0]
-      with tf.control_dependencies([num_groundtrtuhs, is_crowds]):
+      num_groundtruths = tf.shape(classes)[0]
+      with tf.control_dependencies([num_groundtruths, is_crowds]):
         indices = tf.cond(
             tf.greater(tf.size(is_crowds), 0),
             lambda: tf.where(tf.logical_not(is_crowds))[:, 0],
-            lambda: tf.cast(tf.range(num_groundtrtuhs), tf.int64))
+            lambda: tf.cast(tf.range(num_groundtruths), tf.int64))
       classes = tf.gather(classes, indices)
       boxes = tf.gather(boxes, indices)
       if self._include_mask:
diff --git a/official/vision/detection/evaluation/coco_utils.py b/official/vision/detection/evaluation/coco_utils.py
index 8155d1fbb89ac143eb7cc03457a6645a5b5ab505..a4f366850ac11f33c0387b216c9fdf4feb6ef609 100644
--- a/official/vision/detection/evaluation/coco_utils.py
+++ b/official/vision/detection/evaluation/coco_utils.py
@@ -237,7 +237,7 @@ def convert_groundtruths_to_coco_dataset(groundtruths, label_map=None):
               (boxes[j, k, 3] - boxes[j, k, 1]) *
               (boxes[j, k, 2] - boxes[j, k, 0]))
         if 'masks' in groundtruths:
-          mask = Image.open(six.StringIO(groundtruths['masks'][i][j, k]))
+          mask = Image.open(six.BytesIO(groundtruths['masks'][i][j, k]))
           width, height = mask.size
           np_mask = (
               np.array(mask.getdata()).reshape(height, width).astype(np.uint8))
diff --git a/official/vision/detection/main.py b/official/vision/detection/main.py
index a4d01510fa13178c08489d912d3bc73fc5f90fcf..542be3a1dcc73f82719af2d60dc9abd210787931 100644
--- a/official/vision/detection/main.py
+++ b/official/vision/detection/main.py
@@ -19,25 +19,28 @@ from __future__ import division
 # from __future__ import google_type_annotations
 from __future__ import print_function
 
-from absl import app
-from absl import flags
-from absl import logging
 import functools
-import os
 import pprint
+
+# pylint: disable=g-bad-import-order
 import tensorflow as tf
 
+from absl import app
+from absl import flags
+from absl import logging
+# pylint: enable=g-bad-import-order
+
 from official.modeling.hyperparams import params_dict
 from official.modeling.training import distributed_executor as executor
 from official.utils import hyperparams_flags
+from official.utils.flags import core as flags_core
+from official.utils.misc import distribution_utils
+from official.utils.misc import keras_utils
 from official.vision.detection.configs import factory as config_factory
 from official.vision.detection.dataloader import input_reader
 from official.vision.detection.dataloader import mode_keys as ModeKeys
 from official.vision.detection.executor.detection_executor import DetectionDistributedExecutor
 from official.vision.detection.modeling import factory as model_factory
-from official.utils.flags import core as flags_core
-from official.utils.misc import distribution_utils
-from official.utils.misc import keras_utils
 
 hyperparams_flags.initialize_common_flags()
 flags_core.define_log_steps()
@@ -194,6 +197,20 @@ def run(callbacks=None):
           'strategy_config': executor.strategy_flags_dict(),
       },
       is_strict=False)
+
+  # Make sure use_tpu and strategy_type are in sync.
+  params.use_tpu = (params.strategy_type == 'tpu')
+
+  if not params.use_tpu:
+    params.override({
+        'architecture': {
+            'use_bfloat16': False,
+        },
+        'norm_activation': {
+            'use_sync_bn': False,
+        },
+    }, is_strict=True)
+
   params.validate()
   params.lock()
   pp = pprint.PrettyPrinter()
diff --git a/official/vision/detection/modeling/architecture/factory.py b/official/vision/detection/modeling/architecture/factory.py
index ed5647d6fb83fbd7c404a4573ff247acb8999b8c..403d815eaafd91feb999d13b034c864d99804963 100644
--- a/official/vision/detection/modeling/architecture/factory.py
+++ b/official/vision/detection/modeling/architecture/factory.py
@@ -23,6 +23,7 @@ from official.vision.detection.modeling.architecture import heads
 from official.vision.detection.modeling.architecture import identity
 from official.vision.detection.modeling.architecture import nn_ops
 from official.vision.detection.modeling.architecture import resnet
+from official.vision.detection.modeling.architecture import spinenet
 
 
 def norm_activation_generator(params):
@@ -42,6 +43,9 @@ def backbone_generator(params):
         activation=params.norm_activation.activation,
         norm_activation=norm_activation_generator(
             params.norm_activation))
+  elif params.architecture.backbone == 'spinenet':
+    spinenet_params = params.spinenet
+    backbone_fn = spinenet.SpineNetBuilder(model_id=spinenet_params.model_id)
   else:
     raise ValueError('Backbone model `{}` is not supported.'
                      .format(params.architecture.backbone))
diff --git a/official/vision/detection/modeling/architecture/fpn.py b/official/vision/detection/modeling/architecture/fpn.py
index b968dc2e152eb66e2df7ca7673b506c123b59d0f..f531b49e30ea97e5ea3ebec8a47f3171c807984c 100644
--- a/official/vision/detection/modeling/architecture/fpn.py
+++ b/official/vision/detection/modeling/architecture/fpn.py
@@ -28,7 +28,7 @@ import functools
 
 import tensorflow as tf
 
-from tensorflow.python.keras import backend
+from official.vision.detection.modeling.architecture import keras_utils
 from official.vision.detection.modeling.architecture import nn_ops
 from official.vision.detection.ops import spatial_transform_ops
 
@@ -120,7 +120,7 @@ class Fpn(object):
           'The minimum backbone level %d should be '%(min(input_levels)) +
           'less or equal to FPN minimum level %d.:'%(self._min_level))
     backbone_max_level = min(max(input_levels), self._max_level)
-    with backend.get_graph().as_default(), tf.name_scope('fpn'):
+    with keras_utils.maybe_enter_backend_graph(), tf.name_scope('fpn'):
       # Adds lateral connections.
       feats_lateral = {}
       for level in range(self._min_level, backbone_max_level + 1):
diff --git a/official/vision/detection/modeling/architecture/heads.py b/official/vision/detection/modeling/architecture/heads.py
index 7f6954aecbbef8e8807345e643555ba222b0e1b9..b0061a3441f77ef6d55c677aa44966041f5b9104 100644
--- a/official/vision/detection/modeling/architecture/heads.py
+++ b/official/vision/detection/modeling/architecture/heads.py
@@ -22,7 +22,8 @@ import functools
 
 import numpy as np
 import tensorflow as tf
-from tensorflow.python.keras import backend
+
+from official.vision.detection.modeling.architecture import keras_utils
 from official.vision.detection.modeling.architecture import nn_ops
 from official.vision.detection.ops import spatial_transform_ops
 
@@ -127,7 +128,7 @@ class RpnHead(tf.keras.layers.Layer):
     scores_outputs = {}
     box_outputs = {}
 
-    with backend.get_graph().as_default(), tf.name_scope('rpn_head'):
+    with keras_utils.maybe_enter_backend_graph(), tf.name_scope('rpn_head'):
       for level in range(self._min_level, self._max_level + 1):
         scores_output, box_output = self._shared_rpn_heads(
             features[level], self._anchors_per_location, level, is_training)
@@ -249,7 +250,8 @@ class FastrcnnHead(tf.keras.layers.Layer):
         predictions.
     """
 
-    with backend.get_graph().as_default(), tf.name_scope('fast_rcnn_head'):
+    with keras_utils.maybe_enter_backend_graph(), tf.name_scope(
+        'fast_rcnn_head'):
       # reshape inputs beofre FC.
       _, num_rois, height, width, filters = roi_features.get_shape().as_list()
 
@@ -368,7 +370,7 @@ class MaskrcnnHead(tf.keras.layers.Layer):
         boxes is not 4.
     """
 
-    with backend.get_graph().as_default():
+    with keras_utils.maybe_enter_backend_graph():
       with tf.name_scope('mask_head'):
         _, num_rois, height, width, filters = roi_features.get_shape().as_list()
         net = tf.reshape(roi_features, [-1, height, width, filters])
@@ -552,7 +554,8 @@ class RetinanetHead(object):
     """Returns outputs of RetinaNet head."""
     class_outputs = {}
     box_outputs = {}
-    with backend.get_graph().as_default(), tf.name_scope('retinanet_head'):
+    with keras_utils.maybe_enter_backend_graph(), tf.name_scope(
+        'retinanet_head'):
       for level in range(self._min_level, self._max_level + 1):
         features = fpn_features[level]
 
@@ -644,7 +647,7 @@ class ShapemaskPriorHead(object):
       detection_priors: A float Tensor of shape [batch_size * num_instances,
         mask_size, mask_size, 1].
     """
-    with backend.get_graph().as_default(), tf.name_scope('prior_mask'):
+    with keras_utils.maybe_enter_backend_graph(), tf.name_scope('prior_mask'):
       batch_size, num_instances, _ = boxes.get_shape().as_list()
       outer_boxes = tf.cast(outer_boxes, tf.float32)
       boxes = tf.cast(boxes, tf.float32)
@@ -807,7 +810,7 @@ class ShapemaskCoarsemaskHead(object):
       mask_outputs: instance mask prediction as a float Tensor of shape
         [batch_size, num_instances, mask_size, mask_size].
     """
-    with backend.get_graph().as_default(), tf.name_scope('coarse_mask'):
+    with keras_utils.maybe_enter_backend_graph(), tf.name_scope('coarse_mask'):
       # Transform detection priors to have the same dimension as features.
       detection_priors = tf.expand_dims(detection_priors, axis=-1)
       detection_priors = self._coarse_mask_fc(detection_priors)
@@ -939,7 +942,7 @@ class ShapemaskFinemaskHead(object):
     """
     # Extract the foreground mean features
     # with tf.variable_scope('fine_mask', reuse=tf.AUTO_REUSE):
-    with backend.get_graph().as_default(), tf.name_scope('fine_mask'):
+    with keras_utils.maybe_enter_backend_graph(), tf.name_scope('fine_mask'):
       mask_probs = tf.nn.sigmoid(mask_logits)
       # Compute instance embedding for hard average.
       binary_mask = tf.cast(tf.greater(mask_probs, 0.5), features.dtype)
diff --git a/research/deep_contextual_bandits/bandits/core/bayesian_nn.py b/official/vision/detection/modeling/architecture/keras_utils.py
similarity index 54%
rename from research/deep_contextual_bandits/bandits/core/bayesian_nn.py
rename to official/vision/detection/modeling/architecture/keras_utils.py
index 310961591317f8c9ff958a5178e81e0422385baf..530f8f1e27b8428d121b64632dc6d6df4acdc269 100644
--- a/research/deep_contextual_bandits/bandits/core/bayesian_nn.py
+++ b/official/vision/detection/modeling/architecture/keras_utils.py
@@ -1,4 +1,4 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
+# Copyright 2019 The TensorFlow Authors. All Rights Reserved.
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@@ -12,25 +12,33 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-
-"""Define the abstract class for Bayesian Neural Networks."""
+"""Util functions to integrate with Keras internals."""
 
 from __future__ import absolute_import
 from __future__ import division
 from __future__ import print_function
 
 
-class BayesianNN(object):
-  """A Bayesian neural network keeps a distribution over neural nets."""
+from tensorflow.python.keras import backend
 
-  def __init__(self, optimizer):
-    pass
+try:
+  from tensorflow.python.keras.engine import keras_tensor  # pylint: disable=g-import-not-at-top,unused-import
+  keras_tensor.disable_keras_tensors()
+except ImportError:
+  keras_tensor = None
 
-  def build_model(self):
-    pass
 
-  def train(self, data):
+class NoOpContextManager(object):
+
+  def __enter__(self):
     pass
 
-  def sample(self, steps):
+  def __exit__(self, *args):
     pass
+
+
+def maybe_enter_backend_graph():
+  if (keras_tensor is not None) and keras_tensor.keras_tensors_enabled():
+    return NoOpContextManager()
+  else:
+    return backend.get_graph().as_default()
diff --git a/official/vision/detection/modeling/architecture/nn_blocks.py b/official/vision/detection/modeling/architecture/nn_blocks.py
new file mode 100644
index 0000000000000000000000000000000000000000..c94a079f9a4ce4081c478bc373d381070ddfaf96
--- /dev/null
+++ b/official/vision/detection/modeling/architecture/nn_blocks.py
@@ -0,0 +1,318 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Contains common building blocks for neural networks."""
+
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+
+import tensorflow as tf
+
+from official.modeling import tf_utils
+
+
+@tf.keras.utils.register_keras_serializable(package='Vision')
+class ResidualBlock(tf.keras.layers.Layer):
+  """A residual block."""
+
+  def __init__(self,
+               filters,
+               strides,
+               use_projection=False,
+               kernel_initializer='VarianceScaling',
+               kernel_regularizer=None,
+               bias_regularizer=None,
+               activation='relu',
+               use_sync_bn=False,
+               norm_momentum=0.99,
+               norm_epsilon=0.001,
+               **kwargs):
+    """A residual block with BN after convolutions.
+
+    Args:
+      filters: `int` number of filters for the first two convolutions. Note that
+        the third and final convolution will use 4 times as many filters.
+      strides: `int` block stride. If greater than 1, this block will ultimately
+        downsample the input.
+      use_projection: `bool` for whether this block should use a projection
+        shortcut (versus the default identity shortcut). This is usually `True`
+        for the first block of a block group, which may change the number of
+        filters and the resolution.
+      kernel_initializer: kernel_initializer for convolutional layers.
+      kernel_regularizer: tf.keras.regularizers.Regularizer object for Conv2D.
+        Default to None.
+      bias_regularizer: tf.keras.regularizers.Regularizer object for Conv2d.
+        Default to None.
+      activation: `str` name of the activation function.
+      use_sync_bn: if True, use synchronized batch normalization.
+      norm_momentum: `float` normalization omentum for the moving average.
+      norm_epsilon: `float` small float added to variance to avoid dividing by
+        zero.
+      **kwargs: keyword arguments to be passed.
+    """
+    super(ResidualBlock, self).__init__(**kwargs)
+
+    self._filters = filters
+    self._strides = strides
+    self._use_projection = use_projection
+    self._use_sync_bn = use_sync_bn
+    self._activation = activation
+    self._kernel_initializer = kernel_initializer
+    self._norm_momentum = norm_momentum
+    self._norm_epsilon = norm_epsilon
+    self._kernel_regularizer = kernel_regularizer
+    self._bias_regularizer = bias_regularizer
+
+    if use_sync_bn:
+      self._norm = tf.keras.layers.experimental.SyncBatchNormalization
+    else:
+      self._norm = tf.keras.layers.BatchNormalization
+    if tf.keras.backend.image_data_format() == 'channels_last':
+      self._bn_axis = -1
+    else:
+      self._bn_axis = 1
+    self._activation_fn = tf_utils.get_activation(activation)
+
+  def build(self, input_shape):
+    if self._use_projection:
+      self._shortcut = tf.keras.layers.Conv2D(
+          filters=self._filters,
+          kernel_size=1,
+          strides=self._strides,
+          use_bias=False,
+          kernel_initializer=self._kernel_initializer,
+          kernel_regularizer=self._kernel_regularizer,
+          bias_regularizer=self._bias_regularizer)
+      self._norm0 = self._norm(
+          axis=self._bn_axis,
+          momentum=self._norm_momentum,
+          epsilon=self._norm_epsilon)
+
+    self._conv1 = tf.keras.layers.Conv2D(
+        filters=self._filters,
+        kernel_size=3,
+        strides=self._strides,
+        padding='same',
+        use_bias=False,
+        kernel_initializer=self._kernel_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer)
+    self._norm1 = self._norm(
+        axis=self._bn_axis,
+        momentum=self._norm_momentum,
+        epsilon=self._norm_epsilon)
+
+    self._conv2 = tf.keras.layers.Conv2D(
+        filters=self._filters,
+        kernel_size=3,
+        strides=1,
+        padding='same',
+        use_bias=False,
+        kernel_initializer=self._kernel_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer)
+    self._norm2 = self._norm(
+        axis=self._bn_axis,
+        momentum=self._norm_momentum,
+        epsilon=self._norm_epsilon)
+
+    super(ResidualBlock, self).build(input_shape)
+
+  def get_config(self):
+    config = {
+        'filters': self._filters,
+        'strides': self._strides,
+        'use_projection': self._use_projection,
+        'kernel_initializer': self._kernel_initializer,
+        'kernel_regularizer': self._kernel_regularizer,
+        'bias_regularizer': self._bias_regularizer,
+        'activation': self._activation,
+        'use_sync_bn': self._use_sync_bn,
+        'norm_momentum': self._norm_momentum,
+        'norm_epsilon': self._norm_epsilon
+    }
+
+    base_config = super(ResidualBlock, self).get_config()
+    return dict(list(base_config.items()) + list(config.items()))
+
+  def call(self, inputs):
+    shortcut = inputs
+    if self._use_projection:
+      shortcut = self._shortcut(shortcut)
+      shortcut = self._norm0(shortcut)
+
+    x = self._conv1(inputs)
+    x = self._norm1(x)
+    x = self._activation_fn(x)
+
+    x = self._conv2(x)
+    x = self._norm2(x)
+
+    return self._activation_fn(x + shortcut)
+
+
+@tf.keras.utils.register_keras_serializable(package='Vision')
+class BottleneckBlock(tf.keras.layers.Layer):
+  """A standard bottleneck block."""
+
+  def __init__(self,
+               filters,
+               strides,
+               use_projection=False,
+               kernel_initializer='VarianceScaling',
+               kernel_regularizer=None,
+               bias_regularizer=None,
+               activation='relu',
+               use_sync_bn=False,
+               norm_momentum=0.99,
+               norm_epsilon=0.001,
+               **kwargs):
+    """A standard bottleneck block with BN after convolutions.
+
+    Args:
+      filters: `int` number of filters for the first two convolutions. Note that
+        the third and final convolution will use 4 times as many filters.
+      strides: `int` block stride. If greater than 1, this block will ultimately
+        downsample the input.
+      use_projection: `bool` for whether this block should use a projection
+        shortcut (versus the default identity shortcut). This is usually `True`
+        for the first block of a block group, which may change the number of
+        filters and the resolution.
+      kernel_initializer: kernel_initializer for convolutional layers.
+      kernel_regularizer: tf.keras.regularizers.Regularizer object for Conv2D.
+        Default to None.
+      bias_regularizer: tf.keras.regularizers.Regularizer object for Conv2d.
+        Default to None.
+      activation: `str` name of the activation function.
+      use_sync_bn: if True, use synchronized batch normalization.
+      norm_momentum: `float` normalization omentum for the moving average.
+      norm_epsilon: `float` small float added to variance to avoid dividing by
+        zero.
+      **kwargs: keyword arguments to be passed.
+    """
+    super(BottleneckBlock, self).__init__(**kwargs)
+
+    self._filters = filters
+    self._strides = strides
+    self._use_projection = use_projection
+    self._use_sync_bn = use_sync_bn
+    self._activation = activation
+    self._kernel_initializer = kernel_initializer
+    self._norm_momentum = norm_momentum
+    self._norm_epsilon = norm_epsilon
+    self._kernel_regularizer = kernel_regularizer
+    self._bias_regularizer = bias_regularizer
+    if use_sync_bn:
+      self._norm = tf.keras.layers.experimental.SyncBatchNormalization
+    else:
+      self._norm = tf.keras.layers.BatchNormalization
+    if tf.keras.backend.image_data_format() == 'channels_last':
+      self._bn_axis = -1
+    else:
+      self._bn_axis = 1
+    self._activation_fn = tf_utils.get_activation(activation)
+
+  def build(self, input_shape):
+    if self._use_projection:
+      self._shortcut = tf.keras.layers.Conv2D(
+          filters=self._filters * 4,
+          kernel_size=1,
+          strides=self._strides,
+          use_bias=False,
+          kernel_initializer=self._kernel_initializer,
+          kernel_regularizer=self._kernel_regularizer,
+          bias_regularizer=self._bias_regularizer)
+      self._norm0 = self._norm(
+          axis=self._bn_axis,
+          momentum=self._norm_momentum,
+          epsilon=self._norm_epsilon)
+
+    self._conv1 = tf.keras.layers.Conv2D(
+        filters=self._filters,
+        kernel_size=1,
+        strides=1,
+        use_bias=False,
+        kernel_initializer=self._kernel_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer)
+    self._norm1 = self._norm(
+        axis=self._bn_axis,
+        momentum=self._norm_momentum,
+        epsilon=self._norm_epsilon)
+
+    self._conv2 = tf.keras.layers.Conv2D(
+        filters=self._filters,
+        kernel_size=3,
+        strides=self._strides,
+        padding='same',
+        use_bias=False,
+        kernel_initializer=self._kernel_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer)
+    self._norm2 = self._norm(
+        axis=self._bn_axis,
+        momentum=self._norm_momentum,
+        epsilon=self._norm_epsilon)
+
+    self._conv3 = tf.keras.layers.Conv2D(
+        filters=self._filters * 4,
+        kernel_size=1,
+        strides=1,
+        use_bias=False,
+        kernel_initializer=self._kernel_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer)
+    self._norm3 = self._norm(
+        axis=self._bn_axis,
+        momentum=self._norm_momentum,
+        epsilon=self._norm_epsilon)
+
+    super(BottleneckBlock, self).build(input_shape)
+
+  def get_config(self):
+    config = {
+        'filters': self._filters,
+        'strides': self._strides,
+        'use_projection': self._use_projection,
+        'kernel_initializer': self._kernel_initializer,
+        'kernel_regularizer': self._kernel_regularizer,
+        'bias_regularizer': self._bias_regularizer,
+        'activation': self._activation,
+        'use_sync_bn': self._use_sync_bn,
+        'norm_momentum': self._norm_momentum,
+        'norm_epsilon': self._norm_epsilon
+    }
+
+    base_config = super(BottleneckBlock, self).get_config()
+    return dict(list(base_config.items()) + list(config.items()))
+
+  def call(self, inputs):
+    shortcut = inputs
+    if self._use_projection:
+      shortcut = self._shortcut(shortcut)
+      shortcut = self._norm0(shortcut)
+
+    x = self._conv1(inputs)
+    x = self._norm1(x)
+    x = self._activation_fn(x)
+
+    x = self._conv2(x)
+    x = self._norm2(x)
+    x = self._activation_fn(x)
+
+    x = self._conv3(x)
+    x = self._norm3(x)
+
+    return self._activation_fn(x + shortcut)
diff --git a/official/vision/detection/modeling/architecture/resnet.py b/official/vision/detection/modeling/architecture/resnet.py
index abbc7213ea971f0cb014d770e7e0c1707855fb08..8e34cc66ea649e319713a6a0f84d87969320d91b 100644
--- a/official/vision/detection/modeling/architecture/resnet.py
+++ b/official/vision/detection/modeling/architecture/resnet.py
@@ -25,7 +25,7 @@ from __future__ import print_function
 
 from absl import logging
 import tensorflow as tf
-from tensorflow.python.keras import backend
+from official.vision.detection.modeling.architecture import keras_utils
 from official.vision.detection.modeling.architecture import nn_ops
 
 # TODO(b/140112644): Refactor the code with Keras style, i.e. build and call.
@@ -90,7 +90,7 @@ class Resnet(object):
       The values are corresponding feature hierarchy in ResNet with shape
       [batch_size, height_l, width_l, num_filters].
     """
-    with backend.get_graph().as_default():
+    with keras_utils.maybe_enter_backend_graph():
       with tf.name_scope('resnet%s' % self._resnet_depth):
         return self._resnet_fn(inputs, is_training)
 
diff --git a/official/vision/detection/modeling/architecture/spinenet.py b/official/vision/detection/modeling/architecture/spinenet.py
new file mode 100644
index 0000000000000000000000000000000000000000..9c1094cc7005aae3b2e9d6aa5fbb224f805dfd34
--- /dev/null
+++ b/official/vision/detection/modeling/architecture/spinenet.py
@@ -0,0 +1,506 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Implementation of SpineNet model.
+
+X. Du, T-Y. Lin, P. Jin, G. Ghiasi, M. Tan, Y. Cui, Q. V. Le, X. Song
+SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
+https://arxiv.org/abs/1912.05027
+"""
+import math
+
+from absl import logging
+import tensorflow as tf
+
+from official.modeling import tf_utils
+from official.vision.detection.modeling.architecture import keras_utils
+from official.vision.detection.modeling.architecture import nn_blocks
+
+layers = tf.keras.layers
+
+FILTER_SIZE_MAP = {
+    1: 32,
+    2: 64,
+    3: 128,
+    4: 256,
+    5: 256,
+    6: 256,
+    7: 256,
+}
+
+# The fixed SpineNet architecture discovered by NAS.
+# Each element represents a specification of a building block:
+#   (block_level, block_fn, (input_offset0, input_offset1), is_output).
+SPINENET_BLOCK_SPECS = [
+    (2, 'bottleneck', (0, 1), False),
+    (4, 'residual', (0, 1), False),
+    (3, 'bottleneck', (2, 3), False),
+    (4, 'bottleneck', (2, 4), False),
+    (6, 'residual', (3, 5), False),
+    (4, 'bottleneck', (3, 5), False),
+    (5, 'residual', (6, 7), False),
+    (7, 'residual', (6, 8), False),
+    (5, 'bottleneck', (8, 9), False),
+    (5, 'bottleneck', (8, 10), False),
+    (4, 'bottleneck', (5, 10), True),
+    (3, 'bottleneck', (4, 10), True),
+    (5, 'bottleneck', (7, 12), True),
+    (7, 'bottleneck', (5, 14), True),
+    (6, 'bottleneck', (12, 14), True),
+]
+
+SCALING_MAP = {
+    '49S': {
+        'endpoints_num_filters': 128,
+        'filter_size_scale': 0.65,
+        'resample_alpha': 0.5,
+        'block_repeats': 1,
+    },
+    '49': {
+        'endpoints_num_filters': 256,
+        'filter_size_scale': 1.0,
+        'resample_alpha': 0.5,
+        'block_repeats': 1,
+    },
+    '96': {
+        'endpoints_num_filters': 256,
+        'filter_size_scale': 1.0,
+        'resample_alpha': 0.5,
+        'block_repeats': 2,
+    },
+    '143': {
+        'endpoints_num_filters': 256,
+        'filter_size_scale': 1.0,
+        'resample_alpha': 1.0,
+        'block_repeats': 3,
+    },
+    '190': {
+        'endpoints_num_filters': 512,
+        'filter_size_scale': 1.3,
+        'resample_alpha': 1.0,
+        'block_repeats': 4,
+    },
+}
+
+
+class BlockSpec(object):
+  """A container class that specifies the block configuration for SpineNet."""
+
+  def __init__(self, level, block_fn, input_offsets, is_output):
+    self.level = level
+    self.block_fn = block_fn
+    self.input_offsets = input_offsets
+    self.is_output = is_output
+
+
+def build_block_specs(block_specs=None):
+  """Builds the list of BlockSpec objects for SpineNet."""
+  if not block_specs:
+    block_specs = SPINENET_BLOCK_SPECS
+  logging.info('Building SpineNet block specs: %s', block_specs)
+  return [BlockSpec(*b) for b in block_specs]
+
+
+@tf.keras.utils.register_keras_serializable(package='Vision')
+class SpineNet(tf.keras.Model):
+  """Class to build SpineNet models."""
+
+  def __init__(self,
+               input_specs=tf.keras.layers.InputSpec(shape=[None, 640, 640, 3]),
+               min_level=3,
+               max_level=7,
+               block_specs=build_block_specs(),
+               endpoints_num_filters=256,
+               resample_alpha=0.5,
+               block_repeats=1,
+               filter_size_scale=1.0,
+               kernel_initializer='VarianceScaling',
+               kernel_regularizer=None,
+               bias_regularizer=None,
+               activation='relu',
+               use_sync_bn=False,
+               norm_momentum=0.99,
+               norm_epsilon=0.001,
+               **kwargs):
+    """SpineNet model."""
+    self._min_level = min_level
+    self._max_level = max_level
+    self._block_specs = block_specs
+    self._endpoints_num_filters = endpoints_num_filters
+    self._resample_alpha = resample_alpha
+    self._block_repeats = block_repeats
+    self._filter_size_scale = filter_size_scale
+    self._kernel_initializer = kernel_initializer
+    self._kernel_regularizer = kernel_regularizer
+    self._bias_regularizer = bias_regularizer
+    self._use_sync_bn = use_sync_bn
+    self._norm_momentum = norm_momentum
+    self._norm_epsilon = norm_epsilon
+    if activation == 'relu':
+      self._activation = tf.nn.relu
+    elif activation == 'swish':
+      self._activation = tf.nn.swish
+    else:
+      raise ValueError('Activation {} not implemented.'.format(activation))
+    self._init_block_fn = 'bottleneck'
+    self._num_init_blocks = 2
+
+    if use_sync_bn:
+      self._norm = layers.experimental.SyncBatchNormalization
+    else:
+      self._norm = layers.BatchNormalization
+
+    if tf.keras.backend.image_data_format() == 'channels_last':
+      self._bn_axis = -1
+    else:
+      self._bn_axis = 1
+
+    # Build SpineNet.
+    inputs = tf.keras.Input(shape=input_specs.shape[1:])
+
+    net = self._build_stem(inputs=inputs)
+    net = self._build_scale_permuted_network(
+        net=net, input_width=input_specs.shape[1])
+    net = self._build_endpoints(net=net)
+
+    super(SpineNet, self).__init__(inputs=inputs, outputs=net)
+
+  def _block_group(self,
+                   inputs,
+                   filters,
+                   strides,
+                   block_fn_cand,
+                   block_repeats=1,
+                   name='block_group'):
+    """Creates one group of blocks for the SpineNet model."""
+    block_fn_candidates = {
+        'bottleneck': nn_blocks.BottleneckBlock,
+        'residual': nn_blocks.ResidualBlock,
+    }
+    block_fn = block_fn_candidates[block_fn_cand]
+    _, _, _, num_filters = inputs.get_shape().as_list()
+
+    if block_fn_cand == 'bottleneck':
+      use_projection = not (num_filters == (filters * 4) and strides == 1)
+    else:
+      use_projection = not (num_filters == filters and strides == 1)
+
+    x = block_fn(
+        filters=filters,
+        strides=strides,
+        use_projection=use_projection,
+        kernel_initializer=self._kernel_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer,
+        activation=self._activation,
+        use_sync_bn=self._use_sync_bn,
+        norm_momentum=self._norm_momentum,
+        norm_epsilon=self._norm_epsilon)(
+            inputs)
+    for _ in range(1, block_repeats):
+      x = block_fn(
+          filters=filters,
+          strides=1,
+          use_projection=False,
+          kernel_initializer=self._kernel_initializer,
+          kernel_regularizer=self._kernel_regularizer,
+          bias_regularizer=self._bias_regularizer,
+          activation=self._activation,
+          use_sync_bn=self._use_sync_bn,
+          norm_momentum=self._norm_momentum,
+          norm_epsilon=self._norm_epsilon)(
+              x)
+    return tf.identity(x, name=name)
+
+  def _build_stem(self, inputs):
+    """Build SpineNet stem."""
+    x = layers.Conv2D(
+        filters=64,
+        kernel_size=7,
+        strides=2,
+        use_bias=False,
+        padding='same',
+        kernel_initializer=self._kernel_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer)(
+            inputs)
+    x = self._norm(
+        axis=self._bn_axis,
+        momentum=self._norm_momentum,
+        epsilon=self._norm_epsilon)(
+            x)
+    x = tf_utils.get_activation(self._activation)(x)
+    x = layers.MaxPool2D(pool_size=3, strides=2, padding='same')(x)
+
+    net = []
+    # Build the initial level 2 blocks.
+    for i in range(self._num_init_blocks):
+      x = self._block_group(
+          inputs=x,
+          filters=int(FILTER_SIZE_MAP[2] * self._filter_size_scale),
+          strides=1,
+          block_fn_cand=self._init_block_fn,
+          block_repeats=self._block_repeats,
+          name='stem_block_{}'.format(i + 1))
+      net.append(x)
+    return net
+
+  def _build_scale_permuted_network(self,
+                                    net,
+                                    input_width,
+                                    weighted_fusion=False):
+    """Build scale-permuted network."""
+    net_sizes = [int(math.ceil(input_width / 2**2))] * len(net)
+    net_block_fns = [self._init_block_fn] * len(net)
+    num_outgoing_connections = [0] * len(net)
+
+    endpoints = {}
+    for i, block_spec in enumerate(self._block_specs):
+      # Find out specs for the target block.
+      target_width = int(math.ceil(input_width / 2**block_spec.level))
+      target_num_filters = int(FILTER_SIZE_MAP[block_spec.level] *
+                               self._filter_size_scale)
+      target_block_fn = block_spec.block_fn
+
+      # Resample then merge input0 and input1.
+      parents = []
+      input0 = block_spec.input_offsets[0]
+      input1 = block_spec.input_offsets[1]
+
+      x0 = self._resample_with_alpha(
+          inputs=net[input0],
+          input_width=net_sizes[input0],
+          input_block_fn=net_block_fns[input0],
+          target_width=target_width,
+          target_num_filters=target_num_filters,
+          target_block_fn=target_block_fn,
+          alpha=self._resample_alpha)
+      parents.append(x0)
+      num_outgoing_connections[input0] += 1
+
+      x1 = self._resample_with_alpha(
+          inputs=net[input1],
+          input_width=net_sizes[input1],
+          input_block_fn=net_block_fns[input1],
+          target_width=target_width,
+          target_num_filters=target_num_filters,
+          target_block_fn=target_block_fn,
+          alpha=self._resample_alpha)
+      parents.append(x1)
+      num_outgoing_connections[input1] += 1
+
+      # Merge 0 outdegree blocks to the output block.
+      if block_spec.is_output:
+        for j, (j_feat,
+                j_connections) in enumerate(zip(net, num_outgoing_connections)):
+          if j_connections == 0 and (j_feat.shape[2] == target_width and
+                                     j_feat.shape[3] == x0.shape[3]):
+            parents.append(j_feat)
+            num_outgoing_connections[j] += 1
+
+      # pylint: disable=g-direct-tensorflow-import
+      if weighted_fusion:
+        dtype = parents[0].dtype
+        parent_weights = [
+            tf.nn.relu(tf.cast(tf.Variable(1.0, name='block{}_fusion{}'.format(
+                i, j)), dtype=dtype)) for j in range(len(parents))]
+        weights_sum = tf.add_n(parent_weights)
+        parents = [
+            parents[i] * parent_weights[i] / (weights_sum + 0.0001)
+            for i in range(len(parents))
+        ]
+
+      # Fuse all parent nodes then build a new block.
+      x = tf_utils.get_activation(self._activation)(tf.add_n(parents))
+      x = self._block_group(
+          inputs=x,
+          filters=target_num_filters,
+          strides=1,
+          block_fn_cand=target_block_fn,
+          block_repeats=self._block_repeats,
+          name='scale_permuted_block_{}'.format(i + 1))
+
+      net.append(x)
+      net_sizes.append(target_width)
+      net_block_fns.append(target_block_fn)
+      num_outgoing_connections.append(0)
+
+      # Save output feats.
+      if block_spec.is_output:
+        if block_spec.level in endpoints:
+          raise ValueError('Duplicate feats found for output level {}.'.format(
+              block_spec.level))
+        if (block_spec.level < self._min_level or
+            block_spec.level > self._max_level):
+          raise ValueError('Output level is out of range [{}, {}]'.format(
+              self._min_level, self._max_level))
+        endpoints[block_spec.level] = x
+
+    return endpoints
+
+  def _build_endpoints(self, net):
+    """Match filter size for endpoints before sharing conv layers."""
+    endpoints = {}
+    for level in range(self._min_level, self._max_level + 1):
+      x = layers.Conv2D(
+          filters=self._endpoints_num_filters,
+          kernel_size=1,
+          strides=1,
+          use_bias=False,
+          kernel_initializer=self._kernel_initializer,
+          kernel_regularizer=self._kernel_regularizer,
+          bias_regularizer=self._bias_regularizer)(
+              net[level])
+      x = self._norm(
+          axis=self._bn_axis,
+          momentum=self._norm_momentum,
+          epsilon=self._norm_epsilon)(
+              x)
+      x = tf_utils.get_activation(self._activation)(x)
+      endpoints[level] = x
+    return endpoints
+
+  def _resample_with_alpha(self,
+                           inputs,
+                           input_width,
+                           input_block_fn,
+                           target_width,
+                           target_num_filters,
+                           target_block_fn,
+                           alpha=0.5):
+    """Match resolution and feature dimension."""
+    _, _, _, input_num_filters = inputs.get_shape().as_list()
+    if input_block_fn == 'bottleneck':
+      input_num_filters /= 4
+    new_num_filters = int(input_num_filters * alpha)
+
+    x = layers.Conv2D(
+        filters=new_num_filters,
+        kernel_size=1,
+        strides=1,
+        use_bias=False,
+        kernel_initializer=self._kernel_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer)(
+            inputs)
+    x = self._norm(
+        axis=self._bn_axis,
+        momentum=self._norm_momentum,
+        epsilon=self._norm_epsilon)(
+            x)
+    x = tf_utils.get_activation(self._activation)(x)
+
+    # Spatial resampling.
+    if input_width > target_width:
+      x = layers.Conv2D(
+          filters=new_num_filters,
+          kernel_size=3,
+          strides=2,
+          padding='SAME',
+          use_bias=False,
+          kernel_initializer=self._kernel_initializer,
+          kernel_regularizer=self._kernel_regularizer,
+          bias_regularizer=self._bias_regularizer)(
+              x)
+      x = self._norm(
+          axis=self._bn_axis,
+          momentum=self._norm_momentum,
+          epsilon=self._norm_epsilon)(
+              x)
+      x = tf_utils.get_activation(self._activation)(x)
+      input_width /= 2
+      while input_width > target_width:
+        x = layers.MaxPool2D(pool_size=3, strides=2, padding='SAME')(x)
+        input_width /= 2
+    elif input_width < target_width:
+      scale = target_width // input_width
+      x = layers.UpSampling2D(size=(scale, scale))(x)
+
+    # Last 1x1 conv to match filter size.
+    if target_block_fn == 'bottleneck':
+      target_num_filters *= 4
+    x = layers.Conv2D(
+        filters=target_num_filters,
+        kernel_size=1,
+        strides=1,
+        use_bias=False,
+        kernel_initializer=self._kernel_initializer,
+        kernel_regularizer=self._kernel_regularizer,
+        bias_regularizer=self._bias_regularizer)(
+            x)
+    x = self._norm(
+        axis=self._bn_axis,
+        momentum=self._norm_momentum,
+        epsilon=self._norm_epsilon)(
+            x)
+
+    return x
+
+
+class SpineNetBuilder(object):
+  """SpineNet builder."""
+
+  def __init__(self,
+               model_id,
+               input_specs=tf.keras.layers.InputSpec(shape=[None, 640, 640, 3]),
+               min_level=3,
+               max_level=7,
+               block_specs=build_block_specs(),
+               kernel_initializer='VarianceScaling',
+               kernel_regularizer=None,
+               bias_regularizer=None,
+               activation='relu',
+               use_sync_bn=False,
+               norm_momentum=0.99,
+               norm_epsilon=0.001):
+    if model_id not in SCALING_MAP:
+      raise ValueError(
+          'SpineNet {} is not a valid architecture.'.format(model_id))
+    scaling_params = SCALING_MAP[model_id]
+    self._input_specs = input_specs
+    self._min_level = min_level
+    self._max_level = max_level
+    self._block_specs = block_specs
+    self._endpoints_num_filters = scaling_params['endpoints_num_filters']
+    self._resample_alpha = scaling_params['resample_alpha']
+    self._block_repeats = scaling_params['block_repeats']
+    self._filter_size_scale = scaling_params['filter_size_scale']
+    self._kernel_initializer = kernel_initializer
+    self._kernel_regularizer = kernel_regularizer
+    self._bias_regularizer = bias_regularizer
+    self._activation = activation
+    self._use_sync_bn = use_sync_bn
+    self._norm_momentum = norm_momentum
+    self._norm_epsilon = norm_epsilon
+
+  def __call__(self, inputs, is_training=None):
+    with keras_utils.maybe_enter_backend_graph():
+      model = SpineNet(
+          input_specs=self._input_specs,
+          min_level=self._min_level,
+          max_level=self._max_level,
+          block_specs=self._block_specs,
+          endpoints_num_filters=self._endpoints_num_filters,
+          resample_alpha=self._resample_alpha,
+          block_repeats=self._block_repeats,
+          filter_size_scale=self._filter_size_scale,
+          kernel_initializer=self._kernel_initializer,
+          kernel_regularizer=self._kernel_regularizer,
+          bias_regularizer=self._bias_regularizer,
+          activation=self._activation,
+          use_sync_bn=self._use_sync_bn,
+          norm_momentum=self._norm_momentum,
+          norm_epsilon=self._norm_epsilon)
+      return model(inputs)
diff --git a/official/vision/detection/modeling/losses.py b/official/vision/detection/modeling/losses.py
index 4b993061b3c51c9ae6456d84a79f7fea5d74c77e..3d6afe8adc35f8311e98abfd38e6a57de5245188 100644
--- a/official/vision/detection/modeling/losses.py
+++ b/official/vision/detection/modeling/losses.py
@@ -449,7 +449,7 @@ class RetinanetBoxLoss(object):
       num_positives: number of positive examples in the minibatch.
 
     Returns:
-      an integar tensor representing total box regression loss.
+      an integer tensor representing total box regression loss.
     """
     # Sums all positives in a batch for normalization and avoids zero
     # num_positives_sum, which would lead to inf loss during training
@@ -457,7 +457,6 @@ class RetinanetBoxLoss(object):
 
     box_losses = []
     for level in box_outputs.keys():
-      # Onehot encoding for classification labels.
       box_targets_l = labels[level]
       box_losses.append(
           self.box_loss(box_outputs[level], box_targets_l, num_positives_sum))
diff --git a/official/vision/detection/modeling/maskrcnn_model.py b/official/vision/detection/modeling/maskrcnn_model.py
index e5cbe7d56ba7d82836ef58df201aa74779cb2f69..4b37c49c1601d055c6216d47a1de61820c648be6 100644
--- a/official/vision/detection/modeling/maskrcnn_model.py
+++ b/official/vision/detection/modeling/maskrcnn_model.py
@@ -20,13 +20,13 @@ from __future__ import print_function
 
 import tensorflow as tf
 
-from tensorflow.python.keras import backend
 from official.vision.detection.dataloader import anchor
 from official.vision.detection.dataloader import mode_keys
 from official.vision.detection.evaluation import factory as eval_factory
 from official.vision.detection.modeling import base_model
 from official.vision.detection.modeling import losses
 from official.vision.detection.modeling.architecture import factory
+from official.vision.detection.modeling.architecture import keras_utils
 from official.vision.detection.ops import postprocess_ops
 from official.vision.detection.ops import roi_ops
 from official.vision.detection.ops import spatial_transform_ops
@@ -297,7 +297,7 @@ class MaskrcnnModel(base_model.Model):
   def build_model(self, params, mode):
     if self._keras_model is None:
       input_layers = self.build_input_layers(self._params, mode)
-      with backend.get_graph().as_default():
+      with keras_utils.maybe_enter_backend_graph():
         outputs = self.model_outputs(input_layers, mode)
 
         model = tf.keras.models.Model(
diff --git a/official/vision/detection/modeling/retinanet_model.py b/official/vision/detection/modeling/retinanet_model.py
index ff299674f0044cd208a1657a962d133744b78b77..aa6e38c1ef83b50000f5ea41996cab8506a5acb5 100644
--- a/official/vision/detection/modeling/retinanet_model.py
+++ b/official/vision/detection/modeling/retinanet_model.py
@@ -20,12 +20,12 @@ from __future__ import print_function
 
 import tensorflow as tf
 
-from tensorflow.python.keras import backend
 from official.vision.detection.dataloader import mode_keys
 from official.vision.detection.evaluation import factory as eval_factory
 from official.vision.detection.modeling import base_model
 from official.vision.detection.modeling import losses
 from official.vision.detection.modeling.architecture import factory
+from official.vision.detection.modeling.architecture import keras_utils
 from official.vision.detection.ops import postprocess_ops
 
 
@@ -57,7 +57,7 @@ class RetinanetModel(base_model.Model):
         params.postprocess)
 
     self._transpose_input = params.train.transpose_input
-    assert not self._transpose_input, 'Transpose input is not supportted.'
+    assert not self._transpose_input, 'Transpose input is not supported.'
     # Input layer.
     input_shape = (
         params.retinanet_parser.output_size +
@@ -120,7 +120,7 @@ class RetinanetModel(base_model.Model):
 
   def build_model(self, params, mode=None):
     if self._keras_model is None:
-      with backend.get_graph().as_default():
+      with keras_utils.maybe_enter_backend_graph():
         outputs = self.model_outputs(self._input_layer, mode)
 
         model = tf.keras.models.Model(
diff --git a/official/vision/detection/modeling/shapemask_model.py b/official/vision/detection/modeling/shapemask_model.py
index 174187ed02ae7a7617f259974d64b1906a3d16e0..5ac1dbf300ddf5cf20327367b42ca34b550a2808 100644
--- a/official/vision/detection/modeling/shapemask_model.py
+++ b/official/vision/detection/modeling/shapemask_model.py
@@ -20,13 +20,13 @@ from __future__ import print_function
 
 import tensorflow as tf
 
-from tensorflow.python.keras import backend
 from official.vision.detection.dataloader import anchor
 from official.vision.detection.dataloader import mode_keys
 from official.vision.detection.evaluation import factory as eval_factory
 from official.vision.detection.modeling import base_model
 from official.vision.detection.modeling import losses
 from official.vision.detection.modeling.architecture import factory
+from official.vision.detection.modeling.architecture import keras_utils
 from official.vision.detection.ops import postprocess_ops
 from official.vision.detection.utils import box_utils
 
@@ -265,7 +265,7 @@ class ShapeMaskModel(base_model.Model):
   def build_model(self, params, mode):
     if self._keras_model is None:
       input_layers = self.build_input_layers(self._params, mode)
-      with backend.get_graph().as_default():
+      with keras_utils.maybe_enter_backend_graph():
         outputs = self.model_outputs(input_layers, mode)
 
         model = tf.keras.models.Model(
diff --git a/official/vision/image_classification/README.md b/official/vision/image_classification/README.md
index c16fdc0c5ef13cc6a6b4c13260b4d0f1bf59358c..eb061d5b5f3284255bdb484cfbbb20bb3e157268 100644
--- a/official/vision/image_classification/README.md
+++ b/official/vision/image_classification/README.md
@@ -119,6 +119,24 @@ python3 classifier_trainer.py \
   --params_override='runtime.num_gpus=$NUM_GPUS'
 ```
 
+To train on multiple hosts, each with GPUs attached using
+[MultiWorkerMirroredStrategy](https://www.tensorflow.org/api_docs/python/tf/distribute/experimental/MultiWorkerMirroredStrategy)
+please update `runtime` section in gpu.yaml
+(or override using `--params_override`) with:
+
+```YAML
+# gpu.yaml
+runtime:
+  distribution_strategy: 'multi_worker_mirrored'
+  worker_hosts: '$HOST1:port,$HOST2:port'
+  num_gpus: $NUM_GPUS
+  task_index: 0
+```
+By having `task_index: 0` on the first host and `task_index: 1` on the second
+and so on. `$HOST1` and `$HOST2` are the IP addresses of the hosts, and `port`
+can be chosen any free port on the hosts. Only the first host will write
+TensorBoard Summaries and save checkpoints.
+
 #### On TPU:
 ```bash
 python3 classifier_trainer.py \
diff --git a/official/vision/image_classification/classifier_trainer.py b/official/vision/image_classification/classifier_trainer.py
index 639ce1b4713235a9b5a45452c194d2cda997d587..c4b87ad6068d3d1beda0e4f0dec20f363466f7f8 100644
--- a/official/vision/image_classification/classifier_trainer.py
+++ b/official/vision/image_classification/classifier_trainer.py
@@ -235,9 +235,6 @@ def initialize(params: base_configs.ExperimentConfig,
   else:
     data_format = 'channels_last'
   tf.keras.backend.set_image_data_format(data_format)
-  distribution_utils.configure_cluster(
-      params.runtime.worker_hosts,
-      params.runtime.task_index)
   if params.runtime.run_eagerly:
     # Enable eager execution to allow step-by-step debugging
     tf.config.experimental_run_functions_eagerly(True)
@@ -296,6 +293,10 @@ def train_and_eval(
   """Runs the train and eval path using compile/fit."""
   logging.info('Running train and eval.')
 
+  distribution_utils.configure_cluster(
+      params.runtime.worker_hosts,
+      params.runtime.task_index)
+
   # Note: for TPUs, strategy and scope should be created before the dataset
   strategy = strategy_override or distribution_utils.get_distribution_strategy(
       distribution_strategy=params.runtime.distribution_strategy,
@@ -338,7 +339,8 @@ def train_and_eval(
     optimizer = optimizer_factory.build_optimizer(
         optimizer_name=params.model.optimizer.name,
         base_learning_rate=learning_rate,
-        params=params.model.optimizer.as_dict())
+        params=params.model.optimizer.as_dict(),
+        model=model)
 
     metrics_map = _get_metrics(one_hot)
     metrics = [metrics_map[metric] for metric in params.train.metrics]
diff --git a/official/vision/image_classification/optimizer_factory.py b/official/vision/image_classification/optimizer_factory.py
index d15aa79e0db61e36074c7227e1eca73df163ffa0..29b19e22daf2605ba430506c8b2d6545b1cc0074 100644
--- a/official/vision/image_classification/optimizer_factory.py
+++ b/official/vision/image_classification/optimizer_factory.py
@@ -18,11 +18,12 @@ from __future__ import division
 # from __future__ import google_type_annotations
 from __future__ import print_function
 
+from typing import Any, Dict, Text, List
+
 from absl import logging
 import tensorflow as tf
 import tensorflow_addons as tfa
 
-from typing import Any, Dict, Text, List
 from official.vision.image_classification import learning_rate
 from official.vision.image_classification.configs import base_configs
 
@@ -250,7 +251,8 @@ class MovingAverage(tf.keras.optimizers.Optimizer):
 def build_optimizer(
     optimizer_name: Text,
     base_learning_rate: tf.keras.optimizers.schedules.LearningRateSchedule,
-    params: Dict[Text, Any]):
+    params: Dict[Text, Any],
+    model: tf.keras.Model = None):
   """Build the optimizer based on name.
 
   Args:
@@ -261,6 +263,8 @@ def build_optimizer(
     params: String -> Any dictionary representing the optimizer params.
       This should contain optimizer specific parameters such as
       `base_learning_rate`, `decay`, etc.
+    model: The `tf.keras.Model`. This is used for the shadow copy if using
+      `MovingAverage`.
 
   Returns:
     A tf.keras.Optimizer.
@@ -322,10 +326,13 @@ def build_optimizer(
   # Moving average should be applied last, as it's applied at test time
   moving_average_decay = params.get('moving_average_decay', 0.)
   if moving_average_decay is not None and moving_average_decay > 0.:
+    if model is None:
+      raise ValueError('`model` must be provided if using `MovingAverage`.')
     logging.info('Including moving average decay.')
     optimizer = MovingAverage(
-        optimizer,
+        optimizer=optimizer,
         average_decay=moving_average_decay)
+    optimizer.shadow_copy(model)
   return optimizer
 
 
diff --git a/official/vision/image_classification/optimizer_factory_test.py b/official/vision/image_classification/optimizer_factory_test.py
index c952618c126b4ee18b4a7f0ee87a91cff873a109..a620728482f66febe402c20e2f01717f6a1393e5 100644
--- a/official/vision/image_classification/optimizer_factory_test.py
+++ b/official/vision/image_classification/optimizer_factory_test.py
@@ -19,15 +19,21 @@ from __future__ import division
 # from __future__ import google_type_annotations
 from __future__ import print_function
 
-import tensorflow as tf
-
 from absl.testing import parameterized
+
+import tensorflow as tf
 from official.vision.image_classification import optimizer_factory
 from official.vision.image_classification.configs import base_configs
 
 
 class OptimizerFactoryTest(tf.test.TestCase, parameterized.TestCase):
 
+  def build_toy_model(self) -> tf.keras.Model:
+    """Creates a toy `tf.Keras.Model`."""
+    model = tf.keras.Sequential()
+    model.add(tf.keras.layers.Dense(1, input_shape=(1,)))
+    return model
+
   @parameterized.named_parameters(
       ('sgd', 'sgd', 0., False),
       ('momentum', 'momentum', 0., False),
@@ -40,6 +46,7 @@ class OptimizerFactoryTest(tf.test.TestCase, parameterized.TestCase):
       ('rmsprop_ema', 'rmsprop', 0.999, False))
   def test_optimizer(self, optimizer_name, moving_average_decay, lookahead):
     """Smoke test to be sure no syntax errors."""
+    model = self.build_toy_model()
     params = {
         'learning_rate': 0.001,
         'rho': 0.09,
@@ -51,7 +58,8 @@ class OptimizerFactoryTest(tf.test.TestCase, parameterized.TestCase):
     optimizer = optimizer_factory.build_optimizer(
         optimizer_name=optimizer_name,
         base_learning_rate=params['learning_rate'],
-        params=params)
+        params=params,
+        model=model)
     self.assertTrue(issubclass(type(optimizer), tf.keras.optimizers.Optimizer))
 
   def test_unknown_optimizer(self):
diff --git a/official/vision/image_classification/resnet/common.py b/official/vision/image_classification/resnet/common.py
index a9a64aa4064978863332a8024f4e46d64b9baaef..e1c465a34f7742bfb937064953dab112155641ee 100644
--- a/official/vision/image_classification/resnet/common.py
+++ b/official/vision/image_classification/resnet/common.py
@@ -255,7 +255,7 @@ def define_keras_flags(
       name='tpu', default='', help='TPU address to connect to.')
   flags.DEFINE_integer(
       name='steps_per_loop',
-      default=500,
+      default=None,
       help='Number of steps per training loop. Only training step happens '
       'inside the loop. Callbacks will not be called inside. Will be capped at '
       'steps per epoch.')
diff --git a/official/vision/image_classification/resnet/resnet_ctl_imagenet_main.py b/official/vision/image_classification/resnet/resnet_ctl_imagenet_main.py
index c128dc0b99535d806634b42b99a2e56211c567ca..ca0ccd9fdf4540d69c0970a484eaec63fa2dd2b1 100644
--- a/official/vision/image_classification/resnet/resnet_ctl_imagenet_main.py
+++ b/official/vision/image_classification/resnet/resnet_ctl_imagenet_main.py
@@ -14,18 +14,16 @@
 # ==============================================================================
 """Runs a ResNet model on the ImageNet dataset using custom training loops."""
 
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
 import math
+import os
+
 from absl import app
 from absl import flags
 from absl import logging
+import orbit
 import tensorflow as tf
 
 from official.modeling import performance
-from official.staging.training import controller
 from official.utils.flags import core as flags_core
 from official.utils.misc import distribution_utils
 from official.utils.misc import keras_utils
@@ -87,15 +85,6 @@ def get_num_train_iterations(flags_obj):
   return train_steps, train_epochs, eval_steps
 
 
-def _steps_to_run(steps_in_current_epoch, steps_per_epoch, steps_per_loop):
-  """Calculates steps to run on device."""
-  if steps_per_loop <= 0:
-    raise ValueError('steps_per_loop should be positive integer.')
-  if steps_per_loop == 1:
-    return steps_per_loop
-  return min(steps_per_loop, steps_per_epoch - steps_in_current_epoch)
-
-
 def run(flags_obj):
   """Run ResNet ImageNet training and eval loop using custom training loops.
 
@@ -121,7 +110,6 @@ def run(flags_obj):
           datasets_num_private_threads=flags_obj.datasets_num_private_threads)
     common.set_cudnn_batchnorm_mode()
 
-  # TODO(anj-s): Set data_format without using Keras.
   data_format = flags_obj.data_format
   if data_format is None:
     data_format = ('channels_first' if tf.config.list_physical_devices('GPU')
@@ -137,7 +125,14 @@ def run(flags_obj):
 
   per_epoch_steps, train_epochs, eval_steps = get_num_train_iterations(
       flags_obj)
-  steps_per_loop = min(flags_obj.steps_per_loop, per_epoch_steps)
+  if flags_obj.steps_per_loop is None:
+    steps_per_loop = per_epoch_steps
+  elif flags_obj.steps_per_loop > per_epoch_steps:
+    steps_per_loop = per_epoch_steps
+    logging.warn('Setting steps_per_loop to %d to respect epoch boundary.',
+                 steps_per_loop)
+  else:
+    steps_per_loop = flags_obj.steps_per_loop
 
   logging.info(
       'Training %d epochs, each epoch has %d steps, '
@@ -154,8 +149,8 @@ def run(flags_obj):
 
   eval_interval = flags_obj.epochs_between_evals * per_epoch_steps
   checkpoint_interval = (
-      per_epoch_steps if flags_obj.enable_checkpoint_and_export else None)
-  summary_interval = per_epoch_steps if flags_obj.enable_tensorboard else None
+      steps_per_loop * 5 if flags_obj.enable_checkpoint_and_export else None)
+  summary_interval = steps_per_loop if flags_obj.enable_tensorboard else None
 
   checkpoint_manager = tf.train.CheckpointManager(
       runnable.checkpoint,
@@ -164,20 +159,24 @@ def run(flags_obj):
       step_counter=runnable.global_step,
       checkpoint_interval=checkpoint_interval)
 
-  resnet_controller = controller.Controller(
+  resnet_controller = orbit.Controller(
       strategy,
-      runnable.train,
-      runnable.evaluate if not flags_obj.skip_eval else None,
+      runnable,
+      runnable if not flags_obj.skip_eval else None,
       global_step=runnable.global_step,
       steps_per_loop=steps_per_loop,
-      train_steps=per_epoch_steps * train_epochs,
       checkpoint_manager=checkpoint_manager,
       summary_interval=summary_interval,
-      eval_steps=eval_steps,
-      eval_interval=eval_interval)
+      eval_summary_dir=os.path.join(flags_obj.model_dir, 'eval'))
 
   time_callback.on_train_begin()
-  resnet_controller.train(evaluate=not flags_obj.skip_eval)
+  if not flags_obj.skip_eval:
+    resnet_controller.train_and_evaluate(
+        train_steps=per_epoch_steps * train_epochs,
+        eval_steps=eval_steps,
+        eval_interval=eval_interval)
+  else:
+    resnet_controller.train(steps=per_epoch_steps * train_epochs)
   time_callback.on_train_end()
 
   stats = build_stats(runnable, time_callback)
diff --git a/official/vision/image_classification/resnet/resnet_runnable.py b/official/vision/image_classification/resnet/resnet_runnable.py
index 473b18daf7aaf02bfb1dc86110b3ae0fd2704359..e3b49200fbabae5fcea8455d06fa8ecddb013300 100644
--- a/official/vision/image_classification/resnet/resnet_runnable.py
+++ b/official/vision/image_classification/resnet/resnet_runnable.py
@@ -14,33 +14,21 @@
 # ==============================================================================
 """Runs a ResNet model on the ImageNet dataset using custom training loops."""
 
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
+import orbit
 import tensorflow as tf
 
 from official.modeling import performance
 from official.staging.training import grad_utils
-from official.staging.training import standard_runnable
-from official.staging.training import utils
 from official.utils.flags import core as flags_core
 from official.vision.image_classification.resnet import common
 from official.vision.image_classification.resnet import imagenet_preprocessing
 from official.vision.image_classification.resnet import resnet_model
 
 
-class ResnetRunnable(standard_runnable.StandardTrainable,
-                     standard_runnable.StandardEvaluable):
+class ResnetRunnable(orbit.StandardTrainer, orbit.StandardEvaluator):
   """Implements the training and evaluation APIs for Resnet model."""
 
   def __init__(self, flags_obj, time_callback, epoch_steps):
-    standard_runnable.StandardTrainable.__init__(self,
-                                                 flags_obj.use_tf_while_loop,
-                                                 flags_obj.use_tf_function)
-    standard_runnable.StandardEvaluable.__init__(self,
-                                                 flags_obj.use_tf_function)
-
     self.strategy = tf.distribute.get_strategy()
     self.flags_obj = flags_obj
     self.dtype = flags_core.get_tf_dtype(flags_obj)
@@ -107,11 +95,8 @@ class ResnetRunnable(standard_runnable.StandardTrainable,
 
     # Handling epochs.
     self.epoch_steps = epoch_steps
-    self.epoch_helper = utils.EpochHelper(epoch_steps, self.global_step)
-
-  def build_train_dataset(self):
-    """See base class."""
-    return utils.make_distributed_dataset(
+    self.epoch_helper = orbit.utils.EpochHelper(epoch_steps, self.global_step)
+    train_dataset = orbit.utils.make_distributed_dataset(
         self.strategy,
         self.input_fn,
         is_training=True,
@@ -122,17 +107,20 @@ class ResnetRunnable(standard_runnable.StandardTrainable,
         .datasets_num_private_threads,
         dtype=self.dtype,
         drop_remainder=True)
-
-  def build_eval_dataset(self):
-    """See base class."""
-    return utils.make_distributed_dataset(
-        self.strategy,
-        self.input_fn,
-        is_training=False,
-        data_dir=self.flags_obj.data_dir,
-        batch_size=self.batch_size,
-        parse_record_fn=imagenet_preprocessing.parse_record,
-        dtype=self.dtype)
+    orbit.StandardTrainer.__init__(self, train_dataset,
+                                   flags_obj.use_tf_while_loop,
+                                   flags_obj.use_tf_function)
+    if not flags_obj.skip_eval:
+      eval_dataset = orbit.utils.make_distributed_dataset(
+          self.strategy,
+          self.input_fn,
+          is_training=False,
+          data_dir=self.flags_obj.data_dir,
+          batch_size=self.batch_size,
+          parse_record_fn=imagenet_preprocessing.parse_record,
+          dtype=self.dtype)
+      orbit.StandardEvaluator.__init__(self, eval_dataset,
+                                       flags_obj.use_tf_function)
 
   def train_loop_begin(self):
     """See base class."""
diff --git a/research/feelvos/LICENSE b/orbit/LICENSE
similarity index 99%
rename from research/feelvos/LICENSE
rename to orbit/LICENSE
index d645695673349e3947e8e5ae42332d0ac3164cd7..7a4a3ea2424c09fbe48d455aed1eaa94d9124835 100644
--- a/research/feelvos/LICENSE
+++ b/orbit/LICENSE
@@ -199,4 +199,4 @@
    distributed under the License is distributed on an "AS IS" BASIS,
    WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
    See the License for the specific language governing permissions and
-   limitations under the License.
+   limitations under the License.
\ No newline at end of file
diff --git a/orbit/README.md b/orbit/README.md
new file mode 100644
index 0000000000000000000000000000000000000000..539c924d3062933a0070d7cc1f6b9ab45504cf40
--- /dev/null
+++ b/orbit/README.md
@@ -0,0 +1,9 @@
+![TensorFlow Requirement: 2.x](https://img.shields.io/badge/TensorFlow%20Requirement-2.x-brightgreen)
+
+# Orbit
+
+Orbit is a customized training loop library built on top of Tensorflow 2. It
+provides a flexible lightweight library that users can easily use or fork when
+writing [customized training loop code](https://www.tensorflow.org/tutorials/distribute/custom_training)
+in TF2. It intergates with `tf.distribute` seamlessly and supports running on
+different device types (CPU, GPU, and TPU).
diff --git a/research/learning_unsupervised_learning/architectures/__init__.py b/orbit/__init__.py
similarity index 74%
rename from research/learning_unsupervised_learning/architectures/__init__.py
rename to orbit/__init__.py
index af9545f26da538aa986b19a96b6cfa2bc7459227..81a046aac492872c37a3513ae2404d033b43819b 100644
--- a/research/learning_unsupervised_learning/architectures/__init__.py
+++ b/orbit/__init__.py
@@ -1,4 +1,4 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
+# Copyright 2020 The Orbit Authors. All Rights Reserved.
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@@ -12,6 +12,9 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
+"""Orbit package definition."""
 
-
-import more_local_weight_update
+from orbit import utils
+from orbit.controller import Controller
+from orbit.runner import *
+from orbit.standard_runner import *
diff --git a/orbit/controller.py b/orbit/controller.py
new file mode 100644
index 0000000000000000000000000000000000000000..3370e556c04442c3c0f937688327909510d883fe
--- /dev/null
+++ b/orbit/controller.py
@@ -0,0 +1,405 @@
+# Copyright 2020 The Orbit Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""A light weight utilities to train TF2 models."""
+
+import time
+from typing import Callable, Dict, Optional, Text, Union
+from absl import logging
+import numpy as np
+from orbit import runner
+from orbit import utils
+
+import tensorflow as tf
+
+
+def _log_info(message: Text):
+  """Logs `message` to the `info` log, and also prints to stdout."""
+  logging.info(message)
+  print(message)
+
+
+class Controller:
+  """Class that facilitates training and evaluation of models."""
+
+  def __init__(
+      self,
+      strategy: Optional[tf.distribute.Strategy] = None,
+      trainer: Optional[runner.AbstractTrainer] = None,
+      evaluator: Optional[runner.AbstractEvaluator] = None,
+      global_step: Optional[tf.Variable] = None,
+      # Train related
+      steps_per_loop: Optional[int] = None,
+      checkpoint_manager: Optional[tf.train.CheckpointManager] = None,
+      # Summary related
+      summary_interval: Optional[int] = None,
+      summary_dir: Optional[Text] = None,
+      # Evaluation related
+      eval_summary_dir: Optional[Text] = None):
+    """Constructs a `Controller` instance.
+
+    Args:
+      strategy: An instance of `tf.distribute.Strategy`.
+      trainer: An instance of `orbit.AbstractTrainer`, which represents model
+        training details.
+      evaluator: An instance of `orbit.AbstractEvaluator`, which represents
+        model evaluation details.
+      global_step: An integer `tf.Variable` indicating the global training step
+        number. Usually this can be obtained from `iterations` property of the
+        model's optimizer (e.g. `self.optimizer.iterations`), or users can
+        create their own global step variable as well. If the users create their
+        own global step variable, it is recommended to create the `tf.Variable`
+        inside strategy scope, and with
+        `aggregation=tf.VariableAggregation.ONLY_FIRST_REPLICA`.
+      steps_per_loop: The number of steps to run in each "inner loop" of
+        training (passed to the `num_steps` parameter of `trainer.train`).
+      checkpoint_manager: An instance of `tf.train.CheckpointManager`.
+      summary_interval: Step interval for training summaries. Note that this
+        argument only applies to the summaries inside `trainer.train` function.
+        Summaries outside like "steps_per_second" and outputs from
+        `trainer.train` function will always be enabled. If set, the value
+        should be divisible by steps_per_loop.
+      summary_dir: The directory to restore and write checkpoints and summaries.
+        If None, it will be set to `checkpoint_manager.directory`.
+      eval_summary_dir: The directory to write eval summaries. If None, it will
+        be set to `summary_dir`.
+
+    Raises:
+      ValueError: If both `trainer` and `evaluator` are None.
+      ValueError: If `steps_per_loop` is not a positive integer.
+      ValueError: If `summary_interval` is not a positive integer or it cannot
+        be divisible by `steps_per_loop`.
+    """
+    if trainer is None and evaluator is None:
+      raise ValueError("`trainer` and `evaluator` should not both be None")
+
+    if trainer is not None:
+      if steps_per_loop is None:
+        raise ValueError("`steps_per_loop` is required when `trainer` is "
+                         "provided.")
+
+      if not isinstance(steps_per_loop, int) or steps_per_loop < 1:
+        raise ValueError("`steps_per_loop` should be a positive integer")
+
+      if summary_interval is not None:
+        if summary_interval <= 0:
+          raise ValueError("`summary_interval` should be larger than 0")
+        if summary_interval % steps_per_loop != 0:
+          raise ValueError("The summary interval ({}) must be a multiple "
+                           "of the steps_per_loop ({})".format(
+                               summary_interval, steps_per_loop))
+
+    self.trainer = trainer
+    self.evaluator = evaluator
+
+    self.strategy = strategy or tf.distribute.get_strategy()
+
+    self.global_step = global_step
+    self.checkpoint_manager = checkpoint_manager
+
+    if summary_dir is None and checkpoint_manager:
+      summary_dir = checkpoint_manager.directory
+
+    if self.trainer is not None:
+      self.step_timer = None
+      self.steps_per_loop = steps_per_loop
+      self.summary_interval = summary_interval
+      self.summary_manager = utils.SummaryManager(
+          summary_dir, tf.summary.scalar, global_step=self.global_step)
+
+    eval_summary_writer = None
+    if self.evaluator is not None:
+      eval_summary_dir = eval_summary_dir or summary_dir
+      if eval_summary_dir == summary_dir and self.trainer is not None:
+        # Reuse the summary writer if train and evaluation summary directory
+        # are the same.
+        self.eval_summary_manager = self.summary_manager
+      else:
+        self.eval_summary_manager = utils.SummaryManager(
+            eval_summary_dir, tf.summary.scalar, global_step=self.global_step)
+
+    if self.global_step is not None:
+      tf.summary.experimental.set_step(self.global_step)
+
+    # Restores the model if needed.
+    # TODO(momernick): We probably only want to do this on certain occasions?
+    if self.checkpoint_manager is not None:
+      checkpoint_interval = self.checkpoint_manager.checkpoint_interval
+      model_restored = self.restore_checkpoint()
+      if not model_restored and (checkpoint_interval and
+                                 self.trainer is not None):
+        # If the model is not restored from a checkpoint, and
+        # `checkpoint_interval` is enabled for training, save an initial
+        # checkpoint.
+        self.save_checkpoint()
+
+  def train(self, steps: int, checkpoint_at_completion: bool = True):
+    """Runs training.
+
+    This method calls the `train` method on the Trainable object until the
+    global step count is equal to `steps`. It will optionally save checkpoints,
+    if a CheckpointManager was passed to the Controller instance's `__init__`.
+
+    Args:
+      steps: The global step count to train up to.
+      checkpoint_at_completion: Whether to save a checkpoint when this method
+        returns. Defaults to True (write the checkpoint). This is always
+        triggered, regardless of the checkpointing interval.
+    """
+    if self.trainer is None:
+      raise ValueError("`self.trainer` is required when calling `train` "
+                       "method.")
+    if self.global_step is None:
+      raise ValueError("`self.global_step` is required when calling `train` "
+                       "method.")
+
+    # TODO(momernick): Support steps=None or -1 (training to exhaustion).
+    current_step = self.global_step.numpy()  # This is an expensive access.
+    while current_step < steps:
+      logging.info("Train at step %s of %s", current_step, steps)
+      # Calculates steps to run for the next train loop.
+      num_steps = min(steps - current_step, self.steps_per_loop)
+      self._train_n_steps(num_steps)
+      self._maybe_save_checkpoint()
+      current_step = self.global_step.numpy()  # This is an expensive access.
+
+    if checkpoint_at_completion:
+      self.save_checkpoint()
+
+  def evaluate(self, steps: int = None) -> Optional[Dict[Text, np.number]]:
+    """Runs evaluation.
+
+    This method calls the `evaluate` method on the Evaluator object for `steps`
+    steps, then writes the returned summaries (if any).
+
+    Args:
+      steps: The number of steps to evaluate for.
+
+    Returns:
+      The evaluation results as a dictionary of numpy values.
+
+    Raises:
+      ValueError: If no checkpoint found in `self.checkpoint_manager.directory`.
+      ValueError: If `evaluator` is not provided.
+    """
+    if self.evaluator is None:
+      raise ValueError("`evaluator` must be provided to call `evaluate()` "
+                       "method.")
+
+    steps = steps or -1
+    current_step = self.global_step.numpy()
+    if steps > 0:
+      logging.info("Running %s steps of evaluation at train step: %s", steps,
+                   current_step)
+      steps = tf.convert_to_tensor(steps, dtype=tf.int32)
+    else:
+      logging.info("Evaluating at train step: %s", current_step)
+
+    with self.eval_summary_manager.summary_writer().as_default():
+      eval_outputs = self.evaluator.evaluate(steps)
+
+    if eval_outputs:
+      eval_outputs = tf.nest.map_structure(utils.get_value, eval_outputs)
+
+    info = "step: {}        evaluation metric: {}".format(
+        current_step, eval_outputs)
+    _log_info(info)
+
+    self.eval_summary_manager.write_summaries(eval_outputs)
+    self.eval_summary_manager.flush()
+
+    return eval_outputs
+
+  def restore_checkpoint(self, checkpoint_path: Text = None):
+    """Restore or initialize the model.
+
+    Args:
+      checkpoint_path: An optional string indicates the checkpoint path to
+        restore. If None, will restore from `self.checkpoint_manager`.
+
+    Returns:
+      The path to the restored checkpoint if a restore happened, or None
+        if no restore occurred.
+    """
+    with self.strategy.scope():
+      # Checkpoint restoring should be inside scope. b/139450638
+      if checkpoint_path is not None:
+        self.checkpoint_manager.checkpoint.restore(checkpoint_path)
+        return checkpoint_path
+      return self.checkpoint_manager.restore_or_initialize()
+
+  def save_checkpoint(self):
+    """Checkpoint the model.
+
+    This method will write a checkpoint containing the current state of the
+    model.
+
+    Raises:
+      ValueError: if no CheckpointManager was provided to this Controller's
+        init args.
+    """
+    self._maybe_save_checkpoint(force_trigger=True)
+
+  def train_and_evaluate(self,
+                         train_steps: int = None,
+                         eval_steps: int = None,
+                         eval_interval: int = None):
+    """Train and evaluate in an interleaved manner.
+
+    This method will train the model until the global step count equals
+    `train_steps`, running an evaluation for `eval_steps` every `eval_interval`
+    training steps. In addition, this method will run a final evaluation at the
+    end of the training sequence.
+
+    Args:
+      train_steps: The global step count to train up to.
+      eval_steps: The number of steps to run during an evaluation. If None,
+        this method will evaluate over the entire evaluation dataset.
+      eval_interval: The number of training steps to run between evaluations.
+        If set, training will always stop every `eval_interval` steps, even if
+        this results in a shorter inner loop than specified by `steps_per_loop`
+        setting. If None, evaluation will only be performed after training is
+        complete.
+
+    Raises:
+      ValueError: If eval_interval is not a multiple of self.steps_per_loop.
+    """
+    current_step = self.global_step.numpy()  # This is an expensive access.
+    eval_interval = eval_interval or (train_steps - current_step)
+    while current_step < train_steps:
+      interval = min(train_steps - current_step, eval_interval)
+      num_steps = current_step + interval
+      self.train(steps=num_steps, checkpoint_at_completion=False)
+      self.evaluate(steps=eval_steps)
+      current_step = self.global_step.numpy()  # This is an expensive access.
+    self.save_checkpoint()
+
+  def evaluate_continuously(self,
+                            steps: int = None,
+                            timeout: Optional[Union[int, float]] = None,
+                            timeout_fn: Optional[Callable[[], bool]] = None):
+    """Monitor a directory and evaluate on checkpoints in it.
+
+    This method continuously monitors a directory as specified by this
+    Controller's CheckpointManager init arg and runs evaluation on the
+    checkpoints found there.
+
+    Args:
+      steps: The number of steps to run when evaluating.
+      timeout: The maximum number of seconds to wait between checkpoints. See
+        tf.train.checkpoints_iterator documentation.
+      timeout_fn: Optional callable to call after a timeout. If the function
+        returns True, then it means that no new checkpoints will be generated
+        and the iterator will exit.
+
+    Raises:
+      ValueError: If no checkpoint found in `self.checkpoint_manager.directory`.
+      ValueError: If `evaluator` was not provided as a controller init arg.
+
+    """
+    for checkpoint_path in tf.train.checkpoints_iterator(
+        self.checkpoint_manager.directory,
+        timeout=timeout,
+        timeout_fn=timeout_fn):
+      self.restore_checkpoint(checkpoint_path)
+      self.evaluate(steps)
+
+  def _train_n_steps(self, num_steps: int):
+    """Run training for `num_steps`.
+
+    It will also write training outputs to summaries if there is any.
+
+    Args:
+      num_steps: An integer indicates how many steps to run for this training
+        loop.
+
+    Raises:
+      RuntimeError: If `global_step` is not updated correctly in
+        `trainer.train`.
+    """
+    if not self.step_timer:
+      self.step_timer = StepTimer(self.global_step)
+
+    # Calculates steps to run for the next train loop.
+    current_step = self.global_step.numpy()
+    logging.info("Entering training loop at step %s to run %s steps",
+                 current_step, num_steps)
+    current_step += num_steps
+    num_steps = tf.convert_to_tensor(num_steps, dtype=tf.int32)
+
+    with self.summary_manager.summary_writer().as_default():
+      # Create a lambda that returns true when summaries should be written.
+      should_record = False  # Allows static optimization in no-summary cases.
+      if self.summary_interval:
+        should_record = lambda: (self.global_step % self.summary_interval == 0)
+      with tf.summary.record_if(should_record):
+        train_outputs = self.trainer.train(num_steps)
+
+    # Updates and verifies the current step after a training loop finishes.
+    if current_step != self.global_step.numpy():
+      raise RuntimeError("`trainer.train` function is not updating "
+                         "`global_step` correctly, expected: %s, actual: %s" %
+                         (current_step, self.global_step.numpy()))
+
+    # Print information like metrics and steps_per_second after a training
+    # loop.
+    if train_outputs:
+      train_outputs = tf.nest.map_structure(utils.get_value, train_outputs)
+
+    train_outputs = train_outputs or {}
+    steps_per_second = self.step_timer.steps_per_second()
+    info = "step: {}        steps_per_second: {:.2f}        {}".format(
+        current_step, steps_per_second, train_outputs)
+    _log_info(info)
+
+    train_outputs["steps_per_second"] = steps_per_second
+    self.summary_manager.write_summaries(train_outputs)
+
+  def _maybe_save_checkpoint(self, force_trigger: bool = False):
+    """Save checkpoints if necessary.
+
+    Args:
+      force_trigger: A boolean indicates whether to force saving checkpoints
+        regardless of the checkpoint interval.
+
+    Returns:
+      A boolean indicating whether a checkpoint was saved.
+    """
+    if self.checkpoint_manager and self.checkpoint_manager.checkpoint_interval:
+      ckpt_path = self.checkpoint_manager.save(
+          checkpoint_number=self.global_step.numpy(),
+          check_interval=not force_trigger)
+      if ckpt_path is not None:
+        logging.info("Saved checkpoints in %s", ckpt_path)
+        return True
+    return False
+
+
+class StepTimer:
+  """Utility class for measuring steps/second."""
+
+  def __init__(self, step):
+    self.step = step
+    self.start()
+
+  def start(self):
+    self.last_iteration = self.step.numpy()
+    self.last_time = time.time()
+
+  def steps_per_second(self, restart=True):
+    value = ((self.step.numpy() - self.last_iteration) /
+             (time.time() - self.last_time))
+    if restart:
+      self.start()
+    return value
diff --git a/orbit/controller_test.py b/orbit/controller_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..6751e902557a3769f31a9f25a8ba2a579123bdd1
--- /dev/null
+++ b/orbit/controller_test.py
@@ -0,0 +1,650 @@
+# Copyright 2020 The Orbit Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for orbit.controller."""
+
+import os
+from absl import logging
+from absl.testing import parameterized
+import numpy as np
+from orbit import controller
+from orbit import standard_runner
+
+import tensorflow as tf
+
+
+def create_model():
+  x = tf.keras.layers.Input(shape=(3,), name="input")
+  y = tf.keras.layers.Dense(4, name="dense")(x)
+  model = tf.keras.Model(x, y)
+  return model
+
+
+def summaries_with_matching_keyword(keyword, summary_dir):
+  """Returns summary protos matching given keyword from event file."""
+  matches = []
+  event_paths = tf.io.gfile.glob(os.path.join(summary_dir, "events*"))
+  for event in tf.compat.v1.train.summary_iterator(event_paths[-1]):
+    if event.summary is not None:
+      for value in event.summary.value:
+        if keyword in value.tag:
+          matches.append(event.summary)
+  return matches
+
+
+def dataset_fn(ctx):
+  del ctx
+  inputs = np.zeros((10, 3), dtype=np.float32)
+  targets = np.ones((10, 4), dtype=np.float32)
+  dataset = tf.data.Dataset.from_tensor_slices((inputs, targets))
+  dataset = dataset.repeat(100)
+  dataset = dataset.batch(10, drop_remainder=True)
+  return dataset
+
+
+class TestRunner(standard_runner.StandardTrainer,
+                 standard_runner.StandardEvaluator):
+  """Implements the training and evaluation APIs for the test model."""
+
+  def __init__(self, return_numpy=False):
+    self.strategy = tf.distribute.get_strategy()
+    self.model = create_model()
+    self.optimizer = tf.keras.optimizers.RMSprop(learning_rate=0.1)
+    self.global_step = self.optimizer.iterations
+    self.train_loss = tf.keras.metrics.Mean("train_loss", dtype=tf.float32)
+    self.eval_loss = tf.keras.metrics.Mean("eval_loss", dtype=tf.float32)
+    self.return_numpy = return_numpy
+    train_dataset = (
+        self.strategy.experimental_distribute_datasets_from_function(dataset_fn)
+    )
+    eval_dataset = (
+        self.strategy.experimental_distribute_datasets_from_function(dataset_fn)
+    )
+    standard_runner.StandardTrainer.__init__(self, train_dataset)
+    standard_runner.StandardEvaluator.__init__(self, eval_dataset)
+
+  def train_step(self, iterator):
+
+    def _replicated_step(inputs):
+      """Replicated training step."""
+      inputs, targets = inputs
+      with tf.GradientTape() as tape:
+        outputs = self.model(inputs)
+        loss = tf.reduce_mean(tf.keras.losses.MSE(targets, outputs))
+      grads = tape.gradient(loss, self.model.variables)
+      self.optimizer.apply_gradients(zip(grads, self.model.variables))
+      self.train_loss.update_state(loss)
+
+    self.strategy.run(_replicated_step, args=(next(iterator),))
+
+  def train_loop_end(self):
+    train_loss = self.train_loss.result()
+    return {
+        "loss": train_loss.numpy() if self.return_numpy else train_loss,
+    }
+
+  def build_eval_dataset(self):
+    return self.strategy.experimental_distribute_datasets_from_function(
+        dataset_fn)
+
+  def eval_begin(self):
+    self.eval_loss.reset_states()
+
+  def eval_step(self, iterator):
+
+    def _replicated_step(inputs):
+      """Replicated evaluation step."""
+      inputs, targets = inputs
+      outputs = self.model(inputs)
+      loss = tf.reduce_mean(tf.keras.losses.MSE(targets, outputs))
+      self.eval_loss.update_state(loss)
+
+    self.strategy.run(_replicated_step, args=(next(iterator),))
+
+  def eval_end(self):
+    eval_loss = self.eval_loss.result()
+    return {
+        "eval_loss": eval_loss.numpy() if self.return_numpy else eval_loss,
+    }
+
+
+class TestEvaluator(standard_runner.StandardEvaluator):
+  """Implements the training and evaluation APIs for the test model."""
+
+  def __init__(self):
+    self.strategy = tf.distribute.get_strategy()
+    self.model = create_model()
+    eval_dataset = self.strategy.experimental_distribute_datasets_from_function(
+        dataset_fn)
+    standard_runner.StandardEvaluator.__init__(self, eval_dataset)
+
+  def eval_reduce(self, state, output):
+    state.append(output)
+    return state
+
+  def eval_begin(self):
+    return []
+
+  def eval_step(self, iterator):
+
+    def _replicated_step(inputs):
+      """Replicated evaluation step."""
+      inputs, targets = inputs
+      outputs = self.model(inputs)
+      loss = tf.reduce_mean(tf.keras.losses.MSE(targets, outputs))
+      return loss
+
+    per_replica_losses = self.strategy.run(
+        _replicated_step, args=(next(iterator),))
+    mean_loss = self.strategy.reduce(
+        tf.distribute.ReduceOp.MEAN, per_replica_losses, axis=None)
+    return mean_loss
+
+  def eval_end(self, outputs):
+    return {
+        "eval_loss": tf.reduce_mean(outputs),
+    }
+
+
+class TestEvaluatorWithNestedSummary(standard_runner.StandardEvaluator):
+  """Implements the training and evaluation APIs for the test model."""
+
+  def __init__(self):
+    self.strategy = tf.distribute.get_strategy()
+    self.model = create_model()
+    dataset = self.strategy.experimental_distribute_datasets_from_function(
+        dataset_fn)
+    dataset2 = self.strategy.experimental_distribute_datasets_from_function(
+        dataset_fn)
+    self.loss = tf.keras.metrics.Mean("loss", dtype=tf.float32)
+    self.accuracy = tf.keras.metrics.CategoricalAccuracy(
+        "accuracy", dtype=tf.float32)
+    self.loss2 = tf.keras.metrics.Mean("loss", dtype=tf.float32)
+    self.accuracy2 = tf.keras.metrics.CategoricalAccuracy(
+        "accuracy", dtype=tf.float32)
+    standard_runner.StandardEvaluator.__init__(
+        self, eval_dataset={
+            "dataset": dataset,
+            "dataset2": dataset2
+        })
+
+  def eval_step(self, iterator):
+
+    def _replicated_step(loss, accuracy, inputs):
+      """Replicated evaluation step."""
+      inputs, targets = inputs
+      outputs = self.model(inputs)
+      loss.update_state(tf.keras.losses.MSE(targets, outputs))
+      accuracy.update_state(targets, outputs)
+
+    self.strategy.run(
+        lambda inputs: _replicated_step(self.loss, self.accuracy, inputs),
+        args=(next(iterator["dataset"]),))
+    self.strategy.run(
+        lambda inputs: _replicated_step(self.loss2, self.accuracy2, inputs),
+        args=(next(iterator["dataset2"]),))
+
+  def eval_end(self):
+    return {
+        "dataset": {
+            "loss": self.loss.result(),
+            "accuracy": self.accuracy.result()
+        },
+        "dataset2": {
+            "loss": self.loss2.result(),
+            "accuracy": self.accuracy2.result()
+        },
+    }
+
+
+class TestTrainerWithSummaries(standard_runner.StandardTrainer):
+  """A Trainer model with summaries for testing purposes."""
+
+  def __init__(self):
+    self.strategy = tf.distribute.get_strategy()
+    self.model = create_model()
+    self.optimizer = tf.keras.optimizers.RMSprop(learning_rate=0.1)
+    self.global_step = self.optimizer.iterations
+    self.train_loss = tf.keras.metrics.Mean("train_loss", dtype=tf.float32)
+    train_dataset = (
+        self.strategy.experimental_distribute_datasets_from_function(dataset_fn)
+    )
+    standard_runner.StandardTrainer.__init__(
+        self, train_dataset, use_tpu_summary_optimization=True)
+
+  def build_train_dataset(self):
+    return self.strategy.experimental_distribute_datasets_from_function(
+        dataset_fn)
+
+  def train_step(self, iterator):
+
+    def _replicated_step(inputs):
+      """Replicated training step."""
+      inputs, targets = inputs
+      with tf.GradientTape() as tape:
+        outputs = self.model(inputs)
+        loss = tf.reduce_mean(tf.keras.losses.MSE(targets, outputs))
+      tf.summary.scalar("loss", loss)
+      grads = tape.gradient(loss, self.model.variables)
+      self.optimizer.apply_gradients(zip(grads, self.model.variables))
+      self.train_loss.update_state(loss)
+
+    self.strategy.run(_replicated_step, args=(next(iterator),))
+
+
+class ControllerTest(tf.test.TestCase, parameterized.TestCase):
+
+  def setUp(self):
+    super().setUp()
+    self.model_dir = self.get_temp_dir()
+
+  def test_no_checkpoint(self):
+    test_runner = TestRunner()
+    # No checkpoint manager and no strategy.
+    test_controller = controller.Controller(
+        trainer=test_runner,
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=2,
+        summary_dir=os.path.join(self.model_dir, "summaries/train"),
+        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"))
+    test_controller.train_and_evaluate(
+        train_steps=10, eval_steps=2, eval_interval=6)
+    self.assertEqual(test_runner.global_step, 10)
+    # Loss and accuracy values should be written into summaries.
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/train")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "loss", os.path.join(self.model_dir, "summaries/train")))
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/eval")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "eval_loss", os.path.join(self.model_dir, "summaries/eval")))
+    # No checkpoint, so global step starts from 0.
+    test_runner.global_step.assign(0)
+    test_controller.train_and_evaluate(
+        train_steps=10, eval_steps=2, eval_interval=6)
+    self.assertEqual(test_runner.global_step, 10)
+
+  def test_no_checkpoint_and_summaries(self):
+    test_runner = TestRunner()
+    # No checkpoint + summary directories.
+    test_controller = controller.Controller(
+        trainer=test_runner,
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=2)
+    test_controller.train_and_evaluate(
+        train_steps=10, eval_steps=2, eval_interval=6)
+    self.assertEqual(test_runner.global_step, 10)
+
+  @parameterized.named_parameters(("return_numpy", True),
+                                  ("return_tensor", False))
+  def test_train_and_evaluate(self, return_numpy):
+    test_runner = TestRunner(return_numpy=return_numpy)
+
+    checkpoint = tf.train.Checkpoint(
+        model=test_runner.model, optimizer=test_runner.optimizer)
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint,
+        self.model_dir,
+        max_to_keep=None,
+        step_counter=test_runner.global_step,
+        checkpoint_interval=10)
+    test_controller = controller.Controller(
+        trainer=test_runner,
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=2,
+        summary_dir=os.path.join(self.model_dir, "summaries/train"),
+        checkpoint_manager=checkpoint_manager,
+        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"))
+    test_controller.train_and_evaluate(
+        train_steps=10, eval_steps=2, eval_interval=6)
+
+    # Checkpoints are saved.
+    self.assertNotEmpty(tf.io.gfile.glob(os.path.join(self.model_dir, "ckpt*")))
+
+    # Loss and accuracy values should be written into summaries.
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/train")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "loss", os.path.join(self.model_dir, "summaries/train")))
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/eval")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "eval_loss", os.path.join(self.model_dir, "summaries/eval")))
+
+  def test_train_only(self):
+    test_runner = TestRunner()
+
+    checkpoint = tf.train.Checkpoint(
+        model=test_runner.model, optimizer=test_runner.optimizer)
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint,
+        self.model_dir,
+        max_to_keep=None,
+        step_counter=test_runner.global_step,
+        checkpoint_interval=10)
+    test_controller = controller.Controller(
+        trainer=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=2,
+        summary_dir=os.path.join(self.model_dir, "summaries/train"),
+        checkpoint_manager=checkpoint_manager,
+        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"),
+    )
+    test_controller.train(steps=10)
+
+    # Checkpoints are saved.
+    self.assertNotEmpty(tf.io.gfile.glob(os.path.join(self.model_dir, "ckpt*")))
+
+    # Only train summaries are written.
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/train")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "loss", os.path.join(self.model_dir, "summaries/train")))
+    self.assertFalse(
+        tf.io.gfile.exists(os.path.join(self.model_dir, "summaries/eval")))
+
+  def test_evaluate_only(self):
+    test_runner = TestRunner()
+
+    checkpoint = tf.train.Checkpoint(model=test_runner.model)
+    checkpoint.save(os.path.join(self.model_dir, "ckpt"))
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint,
+        self.model_dir,
+        max_to_keep=None,
+        step_counter=test_runner.global_step)
+    test_controller = controller.Controller(
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        checkpoint_manager=checkpoint_manager,
+        summary_dir=os.path.join(self.model_dir, "summaries/train"),
+        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"))
+    eval_results = test_controller.evaluate(steps=2)
+
+    # Only eval summaries are written
+    self.assertFalse(
+        tf.io.gfile.exists(os.path.join(self.model_dir, "summaries/train")))
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/eval")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "eval_loss", os.path.join(self.model_dir, "summaries/eval")))
+    self.assertIn("eval_loss", eval_results)
+
+    # Tests continuous eval with timeout and timeout_fn.
+    done_file = os.path.join(self.model_dir, "summaries/eval/Done")
+
+    def timeout_fn():
+      with tf.io.gfile.GFile(done_file, "w") as f:
+        f.write("DONE")
+        return True
+
+    test_controller = controller.Controller(
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        checkpoint_manager=checkpoint_manager,
+        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"))
+    test_controller.evaluate_continuously(
+        timeout=1, timeout_fn=timeout_fn, steps=2)
+    self.assertNotEmpty(tf.io.gfile.glob(done_file))
+
+  def test_no_eval_steps(self):
+    test_runner = TestRunner()
+
+    checkpoint = tf.train.Checkpoint(model=test_runner.model)
+    checkpoint.save(os.path.join(self.model_dir, "ckpt"))
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint,
+        self.model_dir,
+        max_to_keep=None,
+        step_counter=test_runner.global_step)
+    test_controller = controller.Controller(
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        checkpoint_manager=checkpoint_manager)
+    test_controller.evaluate()
+
+  def test_already_trained_model(self):
+    test_runner = TestRunner()
+    test_runner.global_step.assign(10)
+
+    checkpoint = tf.train.Checkpoint(
+        model=test_runner.model, optimizer=test_runner.optimizer)
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint,
+        self.model_dir,
+        max_to_keep=None,
+        step_counter=test_runner.global_step,
+        checkpoint_interval=10)
+    test_controller = controller.Controller(
+        trainer=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=2,
+        checkpoint_manager=checkpoint_manager)
+    # `global_step` is already `train_steps`.
+    test_controller.train(steps=10)
+
+  def test_summaries_inside_train_fn(self):
+    test_runner = TestTrainerWithSummaries()
+
+    checkpoint = tf.train.Checkpoint(
+        model=test_runner.model, optimizer=test_runner.optimizer)
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint,
+        self.model_dir,
+        max_to_keep=None,
+        step_counter=test_runner.global_step)
+    test_controller = controller.Controller(
+        trainer=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=2,
+        summary_dir=os.path.join(self.model_dir, "summaries/train"),
+        summary_interval=2,
+        checkpoint_manager=checkpoint_manager,
+    )
+    test_controller.train(steps=10)
+
+    # Checkpoints are saved.
+    self.assertEmpty(tf.io.gfile.glob(os.path.join(self.model_dir, "ckpt*")))
+
+    # Only train summaries are written.
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/train")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "loss", os.path.join(self.model_dir, "summaries/train")))
+    self.assertFalse(
+        tf.io.gfile.exists(os.path.join(self.model_dir, "summaries/eval")))
+
+  def test_train_and_evaluate_with_same_summary_dir(self):
+    test_runner = TestRunner()
+
+    checkpoint = tf.train.Checkpoint(
+        model=test_runner.model, optimizer=test_runner.optimizer)
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint,
+        self.model_dir,
+        max_to_keep=None,
+        step_counter=test_runner.global_step)
+    test_controller = controller.Controller(
+        trainer=test_runner,
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=2,
+        summary_dir=os.path.join(self.model_dir, "summaries"),
+        checkpoint_manager=checkpoint_manager,
+        eval_summary_dir=os.path.join(self.model_dir, "summaries"))
+    test_controller.train_and_evaluate(
+        train_steps=10, eval_steps=2, eval_interval=6)
+
+    # Loss and accuracy values should be written into summaries.
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "loss", os.path.join(self.model_dir, "summaries")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "eval_loss", os.path.join(self.model_dir, "summaries")))
+
+  def test_early_stop_on_eval_loss(self):
+    test_runner = TestRunner()
+
+    class EarlyStopController(controller.Controller):
+      """A subclass of Controller supports early stopping."""
+
+      def train_and_evaluate(self,
+                             train_steps: int = None,
+                             eval_steps: int = None,
+                             eval_interval: int = None):
+        while self.global_step.numpy() < train_steps:
+          interval = min(train_steps - self.global_step.numpy(), eval_interval)
+          num_steps = self.global_step.numpy() + interval
+          self.train(steps=num_steps, checkpoint_at_completion=False)
+          self.evaluate(steps=eval_steps)
+          # Early stop condition.
+          if test_runner.eval_loss.result() < 0.1:
+            logging.info(
+                "Training early stopped as eval_loss %s is less than 0.1",
+                test_runner.eval_loss.result())
+            return
+
+    checkpoint = tf.train.Checkpoint(
+        model=test_runner.model, optimizer=test_runner.optimizer)
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint,
+        self.model_dir,
+        max_to_keep=None,
+        step_counter=test_runner.global_step,
+        checkpoint_interval=10)
+    test_controller = EarlyStopController(
+        trainer=test_runner,
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=2,
+        checkpoint_manager=checkpoint_manager)
+    test_controller.train_and_evaluate(
+        train_steps=10, eval_steps=6, eval_interval=2)
+
+    self.assertLess(test_runner.global_step, 10)
+
+  def test_evaluate_with_loss_outputs(self):
+    test_evaluator = TestEvaluator()
+
+    checkpoint = tf.train.Checkpoint(model=test_evaluator.model)
+    checkpoint.save(os.path.join(self.model_dir, "ckpt"))
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint, self.model_dir, max_to_keep=None)
+    test_controller = controller.Controller(
+        evaluator=test_evaluator,
+        global_step=tf.Variable(0, dtype=tf.int64),
+        checkpoint_manager=checkpoint_manager,
+        eval_summary_dir=os.path.join(self.model_dir, "summaries/eval"))
+    test_controller.evaluate(steps=5)
+
+    # Only eval summaries are written
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "summaries/eval")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "eval_loss", os.path.join(self.model_dir, "summaries/eval")))
+
+  def test_train_and_evaluate_reset_datasets(self):
+    test_runner = TestRunner()
+
+    test_controller = controller.Controller(
+        trainer=test_runner,
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=2)
+
+    test_controller.train_and_evaluate(
+        train_steps=10, eval_steps=2, eval_interval=6)
+
+    train_dataset = (
+        test_runner.strategy.experimental_distribute_datasets_from_function(
+            dataset_fn))
+    eval_dataset = (
+        test_runner.strategy.experimental_distribute_datasets_from_function(
+            dataset_fn))
+    test_runner.train_dataset = train_dataset
+    test_runner.eval_dataset = eval_dataset
+
+    test_controller.train_and_evaluate(
+        train_steps=10, eval_steps=2, eval_interval=6)
+
+  def test_eval_and_checkpoint_interval(self):
+    test_runner = TestRunner()
+
+    checkpoint = tf.train.Checkpoint(
+        model=test_runner.model, optimizer=test_runner.optimizer)
+    checkpoint_manager = tf.train.CheckpointManager(
+        checkpoint,
+        self.model_dir,
+        max_to_keep=None,
+        step_counter=test_runner.global_step,
+        checkpoint_interval=5)
+    test_controller = controller.Controller(
+        trainer=test_runner,
+        evaluator=test_runner,
+        global_step=test_runner.global_step,
+        steps_per_loop=10,
+        checkpoint_manager=checkpoint_manager)
+    test_controller.train_and_evaluate(
+        train_steps=10, eval_steps=2, eval_interval=5)
+
+    # Expect 3 checkpoints to be saved at step: 0, 5, 10.
+    self.assertLen(
+        tf.io.gfile.glob(os.path.join(self.model_dir, "ckpt-*.data*")), 3)
+    # Expect evaluation is performed 2 times at step: 5, 10.
+    self.assertLen(
+        summaries_with_matching_keyword("eval_loss", self.model_dir), 2)
+
+  def test_evaluate_with_nested_summaries(self):
+    test_evaluator = TestEvaluatorWithNestedSummary()
+    test_controller = controller.Controller(
+        evaluator=test_evaluator,
+        global_step=tf.Variable(0, dtype=tf.int64),
+        eval_summary_dir=self.model_dir)
+    test_controller.evaluate(steps=5)
+
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "dataset")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "loss", os.path.join(self.model_dir, "dataset")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "accuracy", os.path.join(self.model_dir, "dataset")))
+
+    self.assertNotEmpty(
+        tf.io.gfile.listdir(os.path.join(self.model_dir, "dataset2")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "loss", os.path.join(self.model_dir, "dataset2")))
+    self.assertNotEmpty(
+        summaries_with_matching_keyword(
+            "accuracy", os.path.join(self.model_dir, "dataset2")))
+
+if __name__ == "__main__":
+  tf.test.main()
diff --git a/official/staging/training/runnable.py b/orbit/runner.py
similarity index 79%
rename from official/staging/training/runnable.py
rename to orbit/runner.py
index 1af6eca06a337506a68d6329e0da16c9ca095e0a..967133db5de5440a5447d0314e8943fac8c48cfa 100644
--- a/official/staging/training/runnable.py
+++ b/orbit/runner.py
@@ -1,4 +1,4 @@
-# Copyright 2019 The TensorFlow Authors. All Rights Reserved.
+# Copyright 2020 The Orbit Authors. All Rights Reserved.
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@@ -14,19 +14,12 @@
 # ==============================================================================
 """An abstraction that users can easily handle their custom training loops."""
 
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
-
 import abc
-import six
-import tensorflow.compat.v2 as tf
 from typing import Dict, Optional, Text
+import tensorflow as tf
 
 
-@six.add_metaclass(abc.ABCMeta)
-class AbstractTrainable(tf.Module):
+class AbstractTrainer(tf.Module, metaclass=abc.ABCMeta):
   """An abstract class defining the APIs required for training."""
 
   @abc.abstractmethod
@@ -50,14 +43,14 @@ class AbstractTrainable(tf.Module):
         one update to model parameters, e.g. if training a GAN).
 
     Returns:
-      The function may return a dictionary of `Tensors`, which will be
-      written to logs and as TensorBoard summaries.
+      The function may return a dictionary of `Tensors` or numpy arrays, which
+      will be written to logs and as TensorBoard summaries. It can also be a
+      nested dictionary, yielding a hierarchy of summary directories.
     """
     pass
 
 
-@six.add_metaclass(abc.ABCMeta)
-class AbstractEvaluable(tf.Module):
+class AbstractEvaluator(tf.Module, metaclass=abc.ABCMeta):
   """An abstract class defining the APIs required for evaluation."""
 
   @abc.abstractmethod
@@ -73,7 +66,8 @@ class AbstractEvaluable(tf.Module):
         is `None`.
 
     Returns:
-      The function may return a dictionary of `Tensors`, which will be
-      written to logs and as TensorBoard summaries.
+      The function may return a dictionary of `Tensors` or numpy arrays, which
+      will be written to logs and as TensorBoard summaries. It can also be a
+      nested dictionary, yielding a hierarchy of summary directories.
     """
     pass
diff --git a/orbit/standard_runner.py b/orbit/standard_runner.py
new file mode 100644
index 0000000000000000000000000000000000000000..1d37f2cc90a3dd85f1fe39f254dfb95fc8cc854d
--- /dev/null
+++ b/orbit/standard_runner.py
@@ -0,0 +1,300 @@
+# Copyright 2020 The Orbit Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""An abstraction that users can easily handle their custom training loops."""
+
+import abc
+from typing import Any, Dict, Optional, Text
+import dataclasses
+from orbit import runner
+from orbit import utils
+import tensorflow as tf
+
+
+@dataclasses.dataclass(frozen=True)
+class TrainerOverrides:
+  """Advanced overrides for Orbit trainers.
+
+  Attributes:
+    use_tf_while_loop: A boolean indicates whether to wrap the train step with
+      a `tf.while_loop`.
+    use_tf_function: A boolean indicates whether a `tf.function` will be used.
+      If False, training will run on pure eager mode.
+    use_tpu_summary_optimization: A boolean indicates whether to enable the
+      performance optimization for summaries in TPUs. In TPUs, writing
+      summaries with outside compilation inside train step is slow. If True,
+      it creates two `tf.function` with two XLA programs: one with summaries
+      and one without, and run the program with summaries (slow one) only if
+      necessary.
+  """
+  use_tf_while_loop: bool = True
+  use_tf_function: bool = True
+  use_tpu_summary_optimization: bool = False
+
+
+class StandardTrainer(runner.AbstractTrainer, metaclass=abc.ABCMeta):
+  """Implements the standard functionality of AbstractTrainer APIs."""
+
+  def __init__(self,
+               train_dataset,
+               use_tf_while_loop=True,
+               use_tf_function=True,
+               use_tpu_summary_optimization=False):
+    """Construct a `StandardTrainer` object.
+
+    Args:
+      train_dataset: A tf.nest-compatible structure of tf.data.Dataset or
+        DistributedDataset.
+      use_tf_while_loop: A boolean indicates whether to wrap the train step with
+        a `tf.while_loop`.
+      use_tf_function: A boolean indicates whether a `tf.function` will be used.
+        If False, training will run on pure eager mode.
+      use_tpu_summary_optimization: A boolean indicates whether to enable the
+        performance optimization for summaries in TPUs. In TPUs, writing
+        summaries with outside compilation inside train step is slow. If True,
+        it creates two `tf.function` with two XLA programs: one with summaries
+          and one without, and run the program with summaries (slow one) only if
+          necessary.
+    """
+    if use_tf_while_loop and not use_tf_function:
+      raise ValueError("`use_tf_while_loop=True` and `use_tf_function=False` "
+                       "is not supported")
+    if use_tpu_summary_optimization and not use_tf_while_loop:
+      raise ValueError("`use_tpu_summary_optimization=True` and "
+                       "`use_tf_while_loop=False` is not supported")
+    self._use_tf_while_loop = use_tf_while_loop
+    self._use_tf_function = use_tf_function
+    self._train_dataset = train_dataset
+    self._train_iter = None
+    self._train_loop_fn = None
+    self._use_tpu_summary_optimization = use_tpu_summary_optimization
+
+  def train(self,
+            num_steps: Optional[tf.Tensor]) -> Optional[Dict[Text, tf.Tensor]]:
+    """See base class."""
+    self.train_loop_begin()
+
+    if self._train_iter is None:
+      self._train_iter = tf.nest.map_structure(iter, self.train_dataset)
+
+    if self._train_loop_fn is None:
+      train_fn = self.train_step
+      if self._use_tf_while_loop:
+        self._train_loop_fn = utils.create_tf_while_loop_fn(train_fn)
+        if self._use_tpu_summary_optimization:
+          self._train_loop_fn = utils.train_function_with_summaries(
+              self._train_loop_fn)
+        else:
+          self._train_loop_fn = tf.function(self._train_loop_fn)
+      else:
+        if self._use_tf_function:
+          train_fn = tf.function(train_fn)
+        self._train_loop_fn = utils.create_loop_fn(train_fn)
+
+    self._train_loop_fn(self._train_iter, num_steps)
+    return self.train_loop_end()
+
+  def train_loop_begin(self):
+    """Called once at the beginning of the training loop.
+
+    This method is called before dataset iterators creation.
+    This is a good place to reset metrics that accumulate values over multiple
+    steps of training.
+    """
+    pass
+
+  @abc.abstractmethod
+  def train_step(self, iterator):
+    """Implements one step of training.
+
+    What a "step" consists of is up to the implementer. If using distribution
+    strategies, the call to this method should take place in the "cross-replica
+    context" for generality, to allow e.g. multiple iterator dequeues and calls
+    to `strategy.run`.
+
+    Note that if `use_tf_function=True`, all the code inside `train_step` should
+    be tf.function compatible, as they will be traced with tf.function. This
+    means you cannot put arbitrary python code in this function. If users have
+    any numpy operations, they should be put in `train_loop_begin` or
+    `train_loop_end` functions.
+
+    Args:
+      iterator: A tf.nest-compatible structure of tf.data Iterator or
+        DistributedIterator.
+    """
+    pass
+
+  def train_loop_end(self) -> Optional[Dict[Text, tf.Tensor]]:
+    """Called at the end of the training loop.
+
+    This is a good place to get metric results. The value returned from this
+    function will be returned as-is from the train() method.
+
+    Returns:
+      The function may return a dictionary of `Tensors`, which will be
+      written to logs and as TensorBoard summaries. It can also be a
+      nested dictionary, yielding a hierarchy of summary directories.
+    """
+    pass
+
+  @property
+  def train_dataset(self):
+    """Returns the train_dataset instance."""
+    return self._train_dataset
+
+  @train_dataset.setter
+  def train_dataset(self, train_dataset):
+    """Set a new train dataset and replace with the existing one.
+
+    Any unfinished work in the previous dataset will be discarded.
+
+    Args:
+      train_dataset: A tf.nest-compatible structure of tf.data.Dataset or
+        DistributedDataset.
+    """
+    self._train_dataset = train_dataset
+    self._train_iter = None
+
+
+@dataclasses.dataclass(frozen=True)
+class EvaluatorOverrides:
+  """Advanced overrides for Orbit evaluators.
+
+  Attributes:
+    use_tf_function: A boolean indicates whether a `tf.function` will be used.
+      If False, training will run on pure eager mode.
+  """
+  use_tf_function: bool = True
+
+
+class StandardEvaluator(runner.AbstractEvaluator, metaclass=abc.ABCMeta):
+  """Implements the standard functionality of AbstractEvaluator APIs."""
+
+  def __init__(self, eval_dataset, use_tf_function=True):
+    """Construct a `StandardEvaluator` object.
+
+    Args:
+      eval_dataset: A tf.nest-compatible structure of tf.data.Dataset or
+        DistributedDataset.
+      use_tf_function: A boolean indicates whether a `tf.function` will be used.
+        If False, evaluation will run on pure eager mode.
+    """
+    self._eval_use_tf_function = use_tf_function
+    self._eval_dataset = eval_dataset
+    self._eval_loop_fn = None
+
+  def evaluate(
+      self, num_steps: Optional[tf.Tensor]) -> Optional[Dict[Text, tf.Tensor]]:
+    """See base class."""
+    outputs = self.eval_begin()  # pylint: disable=assignment-from-no-return
+
+    eval_iter = tf.nest.map_structure(iter, self._eval_dataset)
+    if self._eval_loop_fn is None:
+      eval_fn = self.eval_step
+      if self._eval_use_tf_function:
+        eval_fn = tf.function(eval_fn)
+      self._eval_loop_fn = utils.create_loop_fn(eval_fn)
+
+    outputs = self._eval_loop_fn(
+        eval_iter, num_steps, state=outputs, reduce_fn=self.eval_reduce)
+    if outputs is None:
+      return self.eval_end()
+    else:
+      return self.eval_end(outputs)
+
+  def eval_begin(self) -> Any:
+    """Called once at the beginning of the evaluation.
+
+    This method is called before dataset iterators creation.
+    This is a good place to reset metrics that accumulate values over the entire
+    evaluation.
+
+    Returns:
+      An output which is passed as `state` argument into `eval_reduce` function.
+    """
+    pass
+
+  @abc.abstractmethod
+  def eval_step(self, iterator) -> Any:
+    """Implements one step of evaluation.
+
+    What a "step" consists of is up to the implementer. If using distribution
+    strategies, the call to this method should take place in the "cross-replica
+    context" for generality, to allow e.g. multiple iterator dequeues and calls
+    to `strategy.run`.
+
+    Note that if `use_tf_function=True`, all the code inside `eval_step` should
+    be tf.function compatible, as they will be traced with tf.function. This
+    means you cannot put arbitrary python code in this function. If users have
+    any numpy operations, they should be put in `eval_begin`, `eval_end` or
+    `eval_reduce` functions.
+
+    Args:
+      iterator: A tf.nest-compatible structure of tf.data Iterator or
+        DistributedIterator.
+
+    Returns:
+      An output which is passed as `step_outputs` argument into `eval_reduce`
+      function.
+    """
+    pass
+
+  def eval_end(self, *args) -> Optional[Dict[Text, tf.Tensor]]:
+    """Called at the end of the evaluation.
+
+    This is a good place to get metric results. The value returned from this
+    function will be returned as-is from the evaluate() method.
+
+    Args:
+      *args: the outputs from `eval_reduce` for the last eval step.
+
+    Returns:
+      The function may return a dictionary of `Tensors`, which will be
+      written to logs and as TensorBoard summaries. It can also be a
+      nested dictionary, yielding a hierarchy of summary directories.
+    """
+    pass
+
+  def eval_reduce(self, state=None, step_outputs=None) -> Any:
+    """A function to do the reduction on the evaluation outputs per step.
+
+    This is useful for passing states throughout evaluation. E.g. it can be used
+    to maintain the output losses from all the evaluation steps, and compute the
+    mean loss in `eval_end` function.
+
+    Args:
+      state: A maintained state throughout the evaluation.
+      step_outputs: Outputs from the current evaluation step.
+
+    Returns:
+      An output which is passed as `state` argument into `eval_reduce` function
+      for the next step. After evaluation is finished, the output from last step
+      will be passed into `eval_end` function.
+    """
+    pass
+
+  @property
+  def eval_dataset(self):
+    """Returns the train_datase instance."""
+    return self._eval_dataset
+
+  @eval_dataset.setter
+  def eval_dataset(self, eval_dataset):
+    """Set a new eval dataset and replace with the existing one.
+
+    Args:
+      eval_dataset: A tf.nest-compatible structure of tf.data.Dataset or
+        DistributedDataset.
+    """
+    self._eval_dataset = eval_dataset
diff --git a/orbit/standard_runner_test.py b/orbit/standard_runner_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..fb98a715df4cb2f36f3a1f378467234697c348ac
--- /dev/null
+++ b/orbit/standard_runner_test.py
@@ -0,0 +1,94 @@
+# Copyright 2020 The Orbit Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for orbit.standard_runner."""
+
+from orbit import standard_runner
+
+import tensorflow as tf
+
+
+def dataset_fn(input_context=None):
+  del input_context
+
+  def dummy_data(_):
+    return tf.zeros((1, 1), dtype=tf.float32)
+
+  dataset = tf.data.Dataset.range(1)
+  dataset = dataset.repeat()
+  dataset = dataset.map(
+      dummy_data, num_parallel_calls=tf.data.experimental.AUTOTUNE)
+  return dataset
+
+
+class TestRunner(standard_runner.StandardTrainer,
+                 standard_runner.StandardEvaluator):
+  """Implements the training and evaluation APIs for tests."""
+
+  def __init__(self):
+    self.strategy = tf.distribute.get_strategy()
+    self.global_step = tf.Variable(
+        0,
+        trainable=False,
+        dtype=tf.int64,
+        name='global_step',
+        aggregation=tf.VariableAggregation.ONLY_FIRST_REPLICA)
+    standard_runner.StandardTrainer.__init__(self, train_dataset=None)
+    standard_runner.StandardEvaluator.__init__(self, eval_dataset=None)
+
+  def train_loop_begin(self):
+    self.train_dataset = (
+        self.strategy.experimental_distribute_datasets_from_function(dataset_fn)
+    )
+
+  def train_step(self, iterator):
+
+    def _replicated_step(_):
+      self.global_step.assign_add(1)
+
+    self.strategy.run(_replicated_step, args=(next(iterator),))
+
+  def train_loop_end(self):
+    return self.global_step.numpy()
+
+  def eval_begin(self):
+    self.eval_dataset = self.strategy.experimental_distribute_datasets_from_function(
+        dataset_fn)
+
+  def eval_step(self, iterator):
+
+    def _replicated_step(_):
+      self.global_step.assign_add(1)
+
+    self.strategy.run(_replicated_step, args=(next(iterator),))
+
+  def eval_end(self):
+    return self.global_step.numpy()
+
+
+class StandardRunnerTest(tf.test.TestCase):
+
+  def test_train(self):
+    test_runner = TestRunner()
+    self.assertEqual(
+        test_runner.train(tf.convert_to_tensor(10, dtype=tf.int32)), 10)
+
+  def test_eval(self):
+    test_runner = TestRunner()
+    self.assertEqual(
+        test_runner.evaluate(tf.convert_to_tensor(10, dtype=tf.int32)), 10)
+
+
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/official/staging/training/utils.py b/orbit/utils.py
similarity index 54%
rename from official/staging/training/utils.py
rename to orbit/utils.py
index 33fa368b7b966e449c8309e523cd31db73efb978..a8b2311219820941907ff14b4c247cd67d4e2e7a 100644
--- a/official/staging/training/utils.py
+++ b/orbit/utils.py
@@ -1,4 +1,4 @@
-# Copyright 2019 The TensorFlow Authors. All Rights Reserved.
+# Copyright 2020 The Orbit Authors. All Rights Reserved.
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@@ -14,16 +14,14 @@
 # ==============================================================================
 """Some layered modules/functions to help users writing custom training loop."""
 
-from __future__ import absolute_import
-from __future__ import division
-# from __future__ import google_type_annotations
-from __future__ import print_function
-
 import abc
+import contextlib
+import functools
 import inspect
-import six
 
-import tensorflow.compat.v2 as tf
+import os
+import numpy as np
+import tensorflow as tf
 
 
 def create_loop_fn(step_fn):
@@ -79,7 +77,6 @@ def create_tf_while_loop_fn(step_fn):
     A callable defined as the `loop_fn` defination below.
   """
 
-  @tf.function
   def loop_fn(iterator, num_steps):
     """A loop function with multiple steps.
 
@@ -98,6 +95,30 @@ def create_tf_while_loop_fn(step_fn):
   return loop_fn
 
 
+def create_global_step() -> tf.Variable:
+  """Creates a `tf.Variable` suitable for use as a global step counter.
+
+  Creating and managing a global step variable may be necessary for
+  `AbstractTrainer` subclasses that perform multiple parameter updates per
+  `Controller` "step", or use different optimizers on different steps.
+
+  In these cases, an `optimizer.iterations` property generally can't be used
+  directly, since it would correspond to parameter updates instead of iterations
+  in the `Controller`'s training loop. Such use cases should simply call
+  `step.assign_add(1)` at the end of each step.
+
+  Returns:
+    A non-trainable scalar `tf.Variable` of dtype `tf.int64`, with only the
+    first replica's value retained when synchronizing across replicas in
+    a distributed setting.
+  """
+  return tf.Variable(
+      0,
+      dtype=tf.int64,
+      trainable=False,
+      aggregation=tf.VariableAggregation.ONLY_FIRST_REPLICA)
+
+
 def make_distributed_dataset(strategy, dataset_or_fn, *args, **kwargs):
   """A helper function to create distributed dataset.
 
@@ -130,10 +151,7 @@ def make_distributed_dataset(strategy, dataset_or_fn, *args, **kwargs):
     # names, pass `ctx` as the value of `input_context` when calling
     # `dataset_or_fn`. Otherwise `ctx` will not be used when calling
     # `dataset_or_fn`.
-    if six.PY3:
-      argspec = inspect.getfullargspec(dataset_or_fn)
-    else:
-      argspec = inspect.getargspec(dataset_or_fn)
+    argspec = inspect.getfullargspec(dataset_or_fn)
     args_names = argspec.args
 
     if "input_context" in args_names:
@@ -144,96 +162,99 @@ def make_distributed_dataset(strategy, dataset_or_fn, *args, **kwargs):
   return strategy.experimental_distribute_datasets_from_function(dataset_fn)
 
 
-class SummaryManager(object):
+class SummaryManager:
   """A class manages writing summaries."""
 
-  def __init__(self,
-               summary_writer,
-               summary_fn,
-               global_step=None,
-               summary_interval=None):
+  def __init__(self, summary_dir, summary_fn, global_step=None):
     """Construct a summary manager object.
 
     Args:
-      summary_writer: A `tf.summary.SummaryWriter` instance for writing
-        summaries.
+      summary_dir: the directory to write summaries.
       summary_fn: A callable defined as `def summary_fn(name, tensor,
         step=None)`, which describes the summary operation.
-      global_step: A `tf.Variable` instance for checking the current global step
-        value, in case users want to save summaries every N steps.
-      summary_interval: An integer, indicates the minimum step interval between
-        two summaries.
+      global_step: A `tf.Variable` instance for the global step.
     """
-    if summary_writer is not None:
-      self._summary_writer = summary_writer
-      self._enabled = True
-    else:
-      self._summary_writer = tf.summary.create_noop_writer()
-      self._enabled = False
+    self._enabled = (summary_dir is not None)
+    self._summary_dir = summary_dir
     self._summary_fn = summary_fn
+    self._summary_writers = {}
 
     if global_step is None:
       self._global_step = tf.summary.experimental.get_step()
     else:
       self._global_step = global_step
 
-    if summary_interval is not None:
-      if self._global_step is None:
-        raise ValueError("`summary_interval` is not None, but no `global_step` "
-                         "can be obtained ")
-      self._last_summary_step = self._global_step.numpy()
-    self._summary_interval = summary_interval
-
-  @property
-  def summary_interval(self):
-    return self._summary_interval
+  def summary_writer(self, relative_path=""):
+    """Returns the underlying summary writer.
 
-  @property
-  def summary_writer(self):
-    """Returns the underlying summary writer."""
-    return self._summary_writer
+    Args:
+      relative_path: The current path in which to write summaries, relative to
+        the summary directory. By default it is empty, which specifies the root
+        directory.
+    """
+    if self._summary_writers and relative_path in self._summary_writers:
+      return self._summary_writers[relative_path]
+    if self._enabled:
+      self._summary_writers[relative_path] = tf.summary.create_file_writer(
+          os.path.join(self._summary_dir, relative_path))
+    else:
+      self._summary_writers[relative_path] = tf.summary.create_noop_writer()
+    return self._summary_writers[relative_path]
 
   def flush(self):
-    """Flush the underlying summary writer."""
+    """Flush the underlying summary writers."""
     if self._enabled:
-      tf.summary.flush(self._summary_writer)
-
-  def write_summaries(self, items, always_write=True):
-    """Write a bulk of summaries.
+      tf.nest.map_structure(tf.summary.flush, self._summary_writers)
+
+  def write_summaries(self, summary_dict):
+    """Write summaries for the given values.
+
+    This recursively creates subdirectories for any nested dictionaries
+    provided in `summary_dict`, yielding a hierarchy of directories which will
+    then be reflected in the TensorBoard UI as different colored curves.
+
+    E.g. users may evaluate on muliple datasets and return `summary_dict` as a
+    nested dictionary.
+
+    ```
+    {
+        "dataset": {
+            "loss": loss,
+            "accuracy": accuracy
+        },
+        "dataset2": {
+            "loss": loss2,
+            "accuracy": accuracy2
+        },
+    }
+    ```
+
+    This will create two subdirectories "dataset" and "dataset2" inside the
+    summary root directory. Each directory will contain event files including
+    both "loss" and "accuracy" summaries.
 
     Args:
-      items: a dictionary of `Tensors` for writing summaries.
-      always_write: An optional boolean. If `True`, the manager will always
-        write summaries unless the summaries have been written for the same
-        step. Otherwise the manager will only write the summaries if the
-        interval between summaries are larger than `summary_interval`.
-
-    Returns:
-      A boolean indicates whether the summaries are written or not.
+      summary_dict: A dictionary of values. If any value in `summary_dict` is
+        itself a dictionary, then the function will recursively create
+        subdirectories with names given by the keys in the dictionary. The
+        Tensor values are summarized using the summary writer instance specific
+        to the parent relative path.
     """
-    # TODO(rxsang): Support writing summaries with nested structure, so users
-    # can split the summaries into different directories for nicer visualization
-    # in Tensorboard, like train and eval metrics.
     if not self._enabled:
-      return False
+      return
+    self._write_summaries(summary_dict)
 
-    if self._summary_interval is not None:
-      current_step = self._global_step.numpy()
-      if current_step == self._last_summary_step:
-        return False
-      if not always_write and current_step < (self._last_summary_step +
-                                              self._summary_interval):
-        return False
-      self._last_summary_step = current_step
-
-    with self._summary_writer.as_default():
-      for name, tensor in items.items():
-        self._summary_fn(name, tensor, step=self._global_step)
-    return True
+  def _write_summaries(self, summary_dict, relative_path=""):
+    for name, value in summary_dict.items():
+      if isinstance(value, dict):
+        self._write_summaries(
+            value, relative_path=os.path.join(relative_path, name))
+      else:
+        with self.summary_writer(relative_path).as_default():
+          self._summary_fn(name, value, step=self._global_step)
 
 
-@six.add_metaclass(abc.ABCMeta)
-class Trigger(object):
+class Trigger(metaclass=abc.ABCMeta):
   """An abstract class representing a "trigger" for some event."""
 
   @abc.abstractmethod
@@ -294,7 +315,7 @@ class IntervalTrigger(Trigger):
     self._last_trigger_value = 0
 
 
-class EpochHelper(object):
+class EpochHelper:
   """A Helper class to handle epochs in Customized Training Loop."""
 
   def __init__(self, epoch_steps, global_step):
@@ -340,3 +361,86 @@ class EpochHelper(object):
   @property
   def current_epoch(self):
     return self._current_epoch
+
+
+@contextlib.contextmanager
+def _soft_device_placement():
+  """Context manager for soft device placement, allowing summaries on CPU."""
+  original_setting = tf.config.get_soft_device_placement()
+  try:
+    tf.config.set_soft_device_placement(True)
+    yield
+  finally:
+    tf.config.set_soft_device_placement(original_setting)
+
+
+def train_function_with_summaries(*args, **kwargs):
+  """Utility function to support TPU summaries via multiple `tf.function`s.
+
+  This permits interleaving summaries inside TPU-compatible code, but without
+  any performance impact on steps that do not write summaries.
+
+  Usage is as a decorator, similar to `tf.function`, and any `tf.function`
+  arguments will be passed through if supplied:
+
+      @trainer.train_function_with_summaries
+      def train(self, num_steps):
+        ...
+
+  The decorated function is assumed to be a loop method accepting a `num_steps`
+  parameter, as for instance would be called within the `Controller`'s outer
+  train loop. The implementation here assumes that `summary_frequency` is
+  divisible by `steps_per_loop`. The decorated method should accept two
+  arguments, `self` and `num_steps`.
+
+  Two `tf.function` versions of `train_fn` are created: one inside a summary
+  writer scope with soft device placement enabled (used on steps that require
+  summary writing), and one with no summary writer present and soft device
+  placement disabled (used on all other steps).
+
+  Args:
+    *args: Arguments to pass through to `tf.function`.
+    **kwargs: Keyword arguments to pass through to `tf.function`.
+
+  Returns:
+    If the first argument is a callable, returns the decorated callable.
+    Otherwise, returns a decorator.
+  """
+
+  def decorator(train_fn):
+    # TODO(dhr): Validate the signature of train_fn?
+
+    train_fn_with_summaries = tf.function(train_fn, *args, **kwargs)
+    train_fn_without_summaries = tf.function(train_fn, *args, **kwargs)
+
+    @functools.wraps(train_fn)
+    def wrapper(self, num_steps):
+      if tf.summary.should_record_summaries():
+        with _soft_device_placement():
+          output = train_fn_with_summaries(self, tf.constant(1))
+          num_steps -= 1
+      if num_steps >= 1:
+        with tf.summary.record_if(False):
+          output = train_fn_without_summaries(self, num_steps)
+      return output
+
+    return wrapper
+
+  if args and callable(args[0]):
+    train_fn, args = args[0], args[1:]
+    return decorator(train_fn)
+  return decorator
+
+
+def get_value(x) -> np.ndarray:
+  """Returns the value of a variable/tensor.
+
+  Args:
+      x: input variable.
+
+  Returns:
+      A Numpy array or number.
+  """
+  if not tf.is_tensor(x):
+    return x
+  return x.numpy()
diff --git a/research/fivo/bin/run_eval.sh b/orbit/utils_test.py
similarity index 60%
rename from research/fivo/bin/run_eval.sh
rename to orbit/utils_test.py
index b30bcedc2d16e5bdd681386100ecca23612a139a..329139f623c3369cfe7d9c8899ac8becf581c7f0 100644
--- a/research/fivo/bin/run_eval.sh
+++ b/orbit/utils_test.py
@@ -1,5 +1,4 @@
-#!/bin/bash
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
+# Copyright 2020 The Orbit Authors. All Rights Reserved.
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@@ -13,17 +12,22 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
+"""Tests for orbit.utils."""
 
-# An example of running evaluation.
+from orbit import utils
 
-PIANOROLL_DIR=$HOME/pianorolls
+import tensorflow as tf
 
-python run_fivo.py \
-  --mode=eval \
-  --logdir=/tmp/fivo \
-  --model=vrnn \
-  --batch_size=4 \
-  --num_samples=4 \
-  --split=test \
-  --dataset_path="$PIANOROLL_DIR/jsb.pkl" \
-  --dataset_type="pianoroll"
+
+class UtilsTest(tf.test.TestCase):
+
+  def test_create_global_step(self):
+    step = utils.create_global_step()
+    self.assertEqual(step.dtype, tf.int64)
+    self.assertEqual(step, 0)
+    step.assign_add(1)
+    self.assertEqual(step, 1)
+
+
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/research/README.md b/research/README.md
index 9cebb911844600e3758ef8fccd1d53bf6fe400c9..204955b3e082469533885a9b9860b134ee51f46f 100644
--- a/research/README.md
+++ b/research/README.md
@@ -7,118 +7,73 @@ This directory contains code implementations and pre-trained models of published
 The research models are maintained by their respective authors.
 
 ## Table of Contents
-- [Modeling Libraries and Models](#modeling-libraries-and-models)
-- [Models and Implementations](#models-and-implementations)
-  * [Computer Vision](#computer-vision)
-  * [Natural Language Processing](#natural-language-processing)
-  * [Audio and Speech](#audio-and-speech)
-  * [Reinforcement Learning](#reinforcement-learning)
-  * [Others](#others)
-- [Archived Models and Implementations](#warning-archived-models-and-implementations) (:no_entry_sign: No longer maintained)
+- [TensorFlow Research Models](#tensorflow-research-models)
+  - [Table of Contents](#table-of-contents)
+  - [Modeling Libraries and Models](#modeling-libraries-and-models)
+  - [Models and Implementations](#models-and-implementations)
+    - [Computer Vision](#computer-vision)
+    - [Natural Language Processing](#natural-language-processing)
+    - [Audio and Speech](#audio-and-speech)
+    - [Reinforcement Learning](#reinforcement-learning)
+    - [Others](#others)
+    - [Old Models and Implementations in TensorFlow 1](#old-models-and-implementations-in-tensorflow-1)
+  - [Contributions](#contributions)
 
 ## Modeling Libraries and Models
 
 | Directory | Name | Description | Maintainer(s) |
 |-----------|------|-------------|---------------|
-| [object_detection](object_detection) | TensorFlow Object Detection API | A framework that makes it easy to construct, train and deploy object detection models<br /><br />A collection of object detection models pre-trained on the COCO dataset, the Kitti dataset, the Open Images dataset, the AVA v2.1 dataset, and the iNaturalist Species Detection Dataset| @jch1, @tombstone, @pkulzc |
-| [slim](slim) | TensorFlow-Slim Image Classification Model Library | A lightweight high-level API of TensorFlow for defining, training and evaluating image classification models <br />• Inception V1/V2/V3/V4<br />• Inception-ResNet-v2<br />• ResNet V1/V2<br />• VGG 16/19<br />• MobileNet V1/V2/V3<br />• NASNet-A_Mobile/Large<br />• PNASNet-5_Large/Mobile | @sguada, @marksandler2 |
+| [object_detection](object_detection) | TensorFlow Object Detection API | A framework that makes it easy to construct, train and deploy object detection models<br /><br />A collection of object detection models pre-trained on the COCO dataset, the Kitti dataset, the Open Images dataset, the AVA v2.1 dataset, and the iNaturalist Species Detection Dataset| jch1, tombstone, pkulzc |
+| [slim](slim) | TensorFlow-Slim Image Classification Model Library | A lightweight high-level API of TensorFlow for defining, training and evaluating image classification models <br />• Inception V1/V2/V3/V4<br />• Inception-ResNet-v2<br />• ResNet V1/V2<br />• VGG 16/19<br />• MobileNet V1/V2/V3<br />• NASNet-A_Mobile/Large<br />• PNASNet-5_Large/Mobile | sguada, marksandler2 |
 
 ## Models and Implementations
 
 ### Computer Vision
 
-| Directory | Referenece (Paper) | Maintainer(s) |
-|-----------|--------------------|---------------|
-| [attention_ocr](attention_ocr) | [Attention-based Extraction of Structured Information from Street View Imagery](https://arxiv.org/abs/1704.03549) | xavigibert |
-| [autoaugment](autoaugment) | [1] [AutoAugment](https://arxiv.org/abs/1805.09501)<br />[2] [Wide Residual Networks](https://arxiv.org/abs/1605.07146)<br />[3] [Shake-Shake regularization](https://arxiv.org/abs/1705.07485)<br />[4] [ShakeDrop Regularization for Deep Residual Learning](https://arxiv.org/abs/1802.02375) | barretzoph |
-| [deeplab](deeplab) | [1] [DeepLabv1](https://arxiv.org/abs/1412.7062)<br />[2] [DeepLabv2](https://arxiv.org/abs/1606.00915)<br />[3] [DeepLabv3](https://arxiv.org/abs/1802.02611)<br />[4] [DeepLabv3+](https://arxiv.org/abs/1706.05587) | aquariusjay, yknzhu |
-| [delf](delf)  | [1] DELF (DEep Local Features): [Large-Scale Image Retrieval with Attentive Deep Local Features](https://arxiv.org/abs/1612.06321)<br />[2] [Detect-to-Retrieve](https://arxiv.org/abs/1812.01584) | andrefaraujo |
-| [lstm_object_detection](lstm_object_detection) | [Mobile Video Object Detection with Temporally-Aware Feature Maps](https://arxiv.org/abs/1711.06368) | yinxiaoli, yongzhe2160, lzyuan |
-| [marco](marco) | [Classification of crystallization outcomes using deep convolutional neural networks](https://arxiv.org/abs/1803.10342) | vincentvanhoucke |
-| [vid2depth](vid2depth) | [Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints](https://arxiv.org/abs/1802.05522) | rezama |
+| Directory | Paper(s) | Conference | Maintainer(s) |
+|-----------|----------|------------|---------------|
+| [attention_ocr](attention_ocr) | [Attention-based Extraction of Structured Information from Street View Imagery](https://arxiv.org/abs/1704.03549) | ICDAR 2017 | xavigibert |
+| [autoaugment](autoaugment) | [1] [AutoAugment](https://arxiv.org/abs/1805.09501)<br />[2] [Wide Residual Networks](https://arxiv.org/abs/1605.07146)<br />[3] [Shake-Shake regularization](https://arxiv.org/abs/1705.07485)<br />[4] [ShakeDrop Regularization for Deep Residual Learning](https://arxiv.org/abs/1802.02375) | [1] CVPR 2019<br />[2] BMVC 2016<br /> [3] ICLR 2017<br /> [4] ICLR 2018 | barretzoph |
+| [deeplab](deeplab) | [1] [DeepLabv1: Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs](https://arxiv.org/abs/1412.7062)<br />[2] [DeepLabv2: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs](https://arxiv.org/abs/1606.00915)<br />[3] [DeepLabv3: Rethinking Atrous Convolution for Semantic Image Segmentation](https://arxiv.org/abs/1706.05587)<br />[4] [DeepLabv3+: Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation](https://arxiv.org/abs/1802.02611)<br />| [1] ICLR 2015 <br />[2] TPAMI 2017 <br />[4] ECCV 2018 | aquariusjay, yknzhu |
+| [delf](delf)  | [1] DELF (DEep Local Features): [Large-Scale Image Retrieval with Attentive Deep Local Features](https://arxiv.org/abs/1612.06321)<br />[2] [Detect-to-Retrieve: Efficient Regional Aggregation for Image Search](https://arxiv.org/abs/1812.01584)<br />[3] DELG (DEep Local and Global features): [Unifying Deep Local and Global Features for Image Search](https://arxiv.org/abs/2001.05027)<br />[4] GLDv2: [Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval](https://arxiv.org/abs/2004.01804) | [1] ICCV 2017<br />[2] CVPR 2019<br />[4] CVPR 2020 | andrefaraujo |
+| [lstm_object_detection](lstm_object_detection) | [Mobile Video Object Detection with Temporally-Aware Feature Maps](https://arxiv.org/abs/1711.06368) | CVPR 2018 | yinxiaoli, yongzhe2160, lzyuan |
+| [marco](marco) | MARCO: [Classification of crystallization outcomes using deep convolutional neural networks](https://arxiv.org/abs/1803.10342) | | vincentvanhoucke |
+| [vid2depth](vid2depth) | [Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints](https://arxiv.org/abs/1802.05522) | CVPR 2018 | rezama |
 
 ### Natural Language Processing
 
-| Directory | Referenece (Paper) | Maintainer(s) |
-|-----------|--------------------|---------------|
-| [adversarial_text](adversarial_text) | [1] [Adversarial Training Methods for Semi-Supervised Text](https://arxiv.org/abs/1605.07725) Classification<br />[2] [Semi-supervised Sequence Learning](https://arxiv.org/abs/1511.01432) | rsepassi, a-dai |
-| [cvt_text](cvt_text) | [Semi-supervised sequence learning with cross-view training](https://arxiv.org/abs/1809.08370) | clarkkev, lmthang |
+| Directory | Paper(s) | Conference | Maintainer(s) |
+|-----------|----------|------------|---------------|
+| [adversarial_text](adversarial_text) | [1] [Adversarial Training Methods for Semi-Supervised Text](https://arxiv.org/abs/1605.07725) Classification<br />[2] [Semi-supervised Sequence Learning](https://arxiv.org/abs/1511.01432) | [1] ICLR 2017<br />[2] NIPS 2015 | rsepassi, a-dai |
+| [cvt_text](cvt_text) | [Semi-Supervised Sequence Modeling with Cross-View Training](https://arxiv.org/abs/1809.08370) | EMNLP 2018 | clarkkev, lmthang |
 
 ### Audio and Speech
 
-| Directory | Referenece (Paper) | Maintainer(s) |
-|-----------|--------------------|---------------|
-| [audioset](audioset) | [1] [AudioSet: A Large Scale Dataset of Audio Events](https://research.google/pubs/pub45857/)<br />[2] [CNN Architectures for Large-Scale Audio Classification](https://research.google/pubs/pub45611/) | plakal, dpwe |
+| Directory | Paper(s) | Conference | Maintainer(s) |
+|-----------|----------|------------|---------------|
+| [audioset](audioset) | [1] [Audio Set: An ontology and human-labeled dataset for audio events](https://research.google/pubs/pub45857/)<br />[2] [CNN Architectures for Large-Scale Audio Classification](https://research.google/pubs/pub45611/) | ICASSP 2017 | plakal, dpwe |
+| [deep_speech](deep_speech) | [Deep Speech 2](https://arxiv.org/abs/1512.02595) | ICLR 2016 | yhliang2018 |
 
 ### Reinforcement Learning
 
-| Directory | Referenece (Paper) | Maintainer(s) |
-|-----------|--------------------|---------------|
-| [efficient-hrl](efficient-hrl) | [1] [Data-Efficient Hierarchical Reinforcement Learning](https://arxiv.org/abs/1805.08296)<br />[2] [Near-Optimal Representation Learning for Hierarchical Reinforcement Learning](https://arxiv.org/abs/1810.01257) | ofirnachum |
-| [pcl_rl](pcl_rl) | [1] [Improving Policy Gradient by Exploring Under-appreciated Rewards](https://arxiv.org/abs/1611.09321)<br />[2] [Bridging the Gap Between Value and Policy Based Reinforcement Learning](https://arxiv.org/abs/1702.08892)<br />[3] [Trust-PCL: An Off-Policy Trust Region Method for Continuous Control](https://arxiv.org/abs/1707.01891) | ofirnachum |
+| Directory | Paper(s) | Conference | Maintainer(s) |
+|-----------|----------|------------|---------------|
+| [efficient-hrl](efficient-hrl) | [1] [Data-Efficient Hierarchical Reinforcement Learning](https://arxiv.org/abs/1805.08296)<br />[2] [Near-Optimal Representation Learning for Hierarchical Reinforcement Learning](https://arxiv.org/abs/1810.01257) | [1] NIPS 2018<br /> [2] ICLR 2019 | ofirnachum |
+| [pcl_rl](pcl_rl) | [1] [Improving Policy Gradient by Exploring Under-appreciated Rewards](https://arxiv.org/abs/1611.09321)<br />[2] [Bridging the Gap Between Value and Policy Based Reinforcement Learning](https://arxiv.org/abs/1702.08892)<br />[3] [Trust-PCL: An Off-Policy Trust Region Method for Continuous Control](https://arxiv.org/abs/1707.01891) | [1] ICLR 2017<br />[2] NIPS 2017<br />[3] ICLR 2018 | ofirnachum |
 
 ### Others
 
-| Directory | Referenece (Paper) | Maintainer(s) |
-|-----------|--------------------|---------------|
-| [lfads](lfads) | [LFADS - Latent Factor Analysis via Dynamical Systems](https://doi.org/10.1101/152884) | jazcollins, sussillo |
-| [rebar](rebar) | [REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models](https://arxiv.org/abs/1703.07370) | gjtucker |
+| Directory | Paper(s) | Conference | Maintainer(s) |
+|-----------|----------|------------|---------------|
+| [lfads](lfads) | [LFADS - Latent Factor Analysis via Dynamical Systems](https://arxiv.org/abs/1608.06315) | | jazcollins, sussillo |
+| [rebar](rebar) | [REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models](https://arxiv.org/abs/1703.07370) | NIPS 2017 | gjtucker |
 
----
+### Old Models and Implementations in TensorFlow 1
 
-## :warning: Archived Models and Implementations
-
-The following research models are no longer maintained.
-
-**Note**: We will remove archived models from the master branch in June, 2020. 
-After removal, you will still be able to access archived models in the archive branch.
-
-| Directory | Referenece (Paper) | Maintainer(s) |
-|-----------|--------------------|---------------|
-| [adv_imagenet_models](adv_imagenet_models) | [1] [Adversarial Machine Learning at Scale](https://arxiv.org/abs/1611.01236)<br />[2] [Ensemble Adversarial Training: Attacks and Defenses](https://arxiv.org/abs/1705.07204) | alexeykurakin |
-| [adversarial_crypto](adversarial_crypto) | [Learning to Protect Communications with Adversarial Neural Cryptography](https://arxiv.org/abs/1610.06918) | dave-andersen |
-| [adversarial_logit_pairing](adversarial_logit_pairing) | [Adversarial Logit Pairing](https://arxiv.org/abs/1803.06373) | alexeykurakin |
-| [autoencoder](autoencoder) | Various autoencoders | snurkabill |
-| [brain_coder](brain_coder) | [Neural Program Synthesis with Priority Queue Training](https://arxiv.org/abs/1801.03526) | danabo, mnorouzi |
-| [cognitive_mapping_and_planning](cognitive_mapping_and_planning) | [Cognitive Mapping and Planning for Visual Navigation](https://arxiv.org/abs/1702.03920) | s-gupta |
-| [compression](compression) | [Full Resolution Image Compression with Recurrent Neural Networks](https://arxiv.org/abs/1608.05148) | nmjohn |
-| [deep_contextual_bandits](deep_contextual_bandits) | [Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling](https://arxiv.org/abs/1802.09127) | rikel |
-| [deep_speech](deep_speech) | [Deep Speech 2](https://arxiv.org/abs/1512.02595) | yhliang2018 |
-| [domain_adaptation](domain_adaptation) | [1] [Domain Separation Networks](https://arxiv.org/abs/1608.06019) <br />[2] [Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks](https://arxiv.org/abs/1612.05424) | bousmalis, dmrd |
-| [feelvos](feelvos)| [FEELVOS](https://arxiv.org/abs/1902.09513) | pvoigtlaender, yuningchai, aquariusjay |
-| [fivo](fivo)| [Filtering variational objectives for training generative sequence models](https://arxiv.org/abs/1705.09279) | dieterichlawson |
-| [global_objectives](global_objectives) | [Scalable Learning of Non-Decomposable Objectives](https://arxiv.org/abs/1608.04802) | mackeya-google |
-| [im2txt](im2txt) | [Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge](https://arxiv.org/abs/1609.06647) | cshallue |
-| [inception](inception) | [Rethinking the Inception Architecture for Computer Vision](https://arxiv.org/abs/1512.00567) | shlens, vincentvanhoucke |
-| [keypointnet](keypointnet) | [KeypointNet](https://arxiv.org/abs/1807.03146) | mnorouzi |
-| [learned_optimizer](learned_optimizer) | [Learned Optimizers that Scale and Generalize](https://arxiv.org/abs/1703.04813) | olganw, nirum |
-| [learning_to_remember_rare_events](learning_to_remember_rare_events) | [Learning to Remember Rare Events](https://arxiv.org/abs/1703.03129) | lukaszkaiser, ofirnachum |
-| [learning_unsupervised_learning](learning_unsupervised_learning) | [Meta-Learning Update Rules for Unsupervised Representation Learning](https://arxiv.org/abs/1804.00222) | lukemetz, nirum |
-| [lexnet_nc](lexnet_nc) | [Olive Oil is Made of Olives, Baby Oil is Made for Babies: Interpreting Noun Compounds using Paraphrases in a Neural Model](https://arxiv.org/abs/1803.08073) | vered1986, waterson |
-| [lm_1b](lm_1b) | [Exploring the Limits of Language Modeling](https://arxiv.org/abs/1602.02410) | oriolvinyals, panyx0718 |
-| [lm_commonsense](lm_commonsense) | [A Simple Method for Commonsense Reasoning](https://arxiv.org/abs/1806.02847) | thtrieu |
-| [maskgan](maskgan)| [MaskGAN: Better Text Generation via Filling in the______](https://arxiv.org/abs/1801.07736) | liamb315, a-dai |
-| [namignizer](namignizer)| Namignizer | knathanieltucker |
-| [neural_gpu](neural_gpu)| [Neural GPUs Learn Algorithms](https://arxiv.org/abs/1511.08228) | lukaszkaiser |
-| [neural_programmer](neural_programmer) | [Learning a Natural Language Interface with Neural Programmer](https://arxiv.org/abs/1611.08945) | arvind2505 |
-| [next_frame_prediction](next_frame_prediction) | [Visual Dynamics](https://arxiv.org/abs/1607.02586) | panyx0718 |
-| [ptn](ptn) | [Perspective Transformer Nets](https://arxiv.org/abs/1612.00814) | xcyan, arkanath, hellojas, honglaklee |
-| [qa_kg](qa_kg) | [Learning to Reason](https://arxiv.org/abs/1704.05526) | yuyuz |
-| [real_nvp](real_nvp) | [Density estimation using Real NVP](https://arxiv.org/abs/1605.08803) | laurent-dinh |
-| [sentiment_analysis](sentiment_analysis)| [Effective Use of Word Order for Text Categorization with Convolutional Neural Networks](https://arxiv.org/abs/1412.1058) | sculd |
-| [seq2species](seq2species) | [Seq2Species: A deep learning approach to pattern recognition for short DNA sequences](https://doi.org/10.1101/353474) | apbusia, depristo |
-| [skip_thoughts](skip_thoughts) | [Skip-Thought Vectors](https://arxiv.org/abs/1506.06726) | cshallue |
-| [steve](steve) | [Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion](https://arxiv.org/abs/1807.01675) | buckman-google |
-| [street](street) | [End-to-End Interpretation of the French Street Name Signs Dataset](https://arxiv.org/abs/1702.03970) | theraysmith |
-| [struct2depth](struct2depth)| [Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos](https://arxiv.org/abs/1811.06152) | aneliaangelova |
-| [swivel](swivel) | [Swivel: Improving Embeddings by Noticing What's Missing](https://arxiv.org/abs/1602.02215) | waterson |
-| [tcn](tcn) | [Time-Contrastive Networks: Self-Supervised Learning from Video](https://arxiv.org/abs/1704.06888) | coreylynch, sermanet |
-| [textsum](textsum)| [A Neural Attention Model for Abstractive Sentence Summarization](https://arxiv.org/abs/1509.00685) | panyx0718, peterjliu |
-| [transformer](transformer) | [Spatial Transformer Network](https://arxiv.org/abs/1506.02025) | daviddao|
-| [video_prediction](video_prediction) | [Unsupervised Learning for Physical Interaction through Video Prediction](https://arxiv.org/abs/1605.07157) | cbfinn |
+:warning: If you are looking for old models, please visit the [Archive branch](https://github.com/tensorflow/models/tree/archive/research).
 
 ---
 
 ## Contributions
 
-If you want to contribute, please review the [contribution guidelines](../../../wiki/How-to-contribute).
+If you want to contribute, please review the [contribution guidelines](https://github.com/tensorflow/models/wiki/How-to-contribute).
diff --git a/research/adv_imagenet_models/README.md b/research/adv_imagenet_models/README.md
deleted file mode 100644
index 6129f7347effe09ef0272de9ac42d4872726fcd1..0000000000000000000000000000000000000000
--- a/research/adv_imagenet_models/README.md
+++ /dev/null
@@ -1,91 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Adversarially trained ImageNet models
-
-Pre-trained ImageNet models from the following papers:
-
-* [Adversarial Machine Learning at Scale](https://arxiv.org/abs/1611.01236)
-* [Ensemble Adversarial Training: Attacks and Defenses](https://arxiv.org/abs/1705.07204)
-
-## Contact
-
-Author: Alexey Kurakin,
-github: [AlexeyKurakin](https://github.com/AlexeyKurakin)
-
-## Pre-requesites and installation
-
-Ensure that you have installed TensorFlow 1.1 or greater
-([instructions](https://www.tensorflow.org/install/)).
-
-You also need copy of ImageNet dataset if you want to run provided example.
-Follow
-[Preparing the dataset](https://github.com/tensorflow/models/tree/master/research/slim#Data)
-instructions in TF-Slim library to get and preprocess ImageNet data.
-
-## Available models
-
-Following pre-trained models are available:
-
-Network Architecture | Adversarial training | Checkpoint
----------------------|----------------------|----------------
-Inception v3 | Step L.L. | [adv_inception_v3_2017_08_18.tar.gz](http://download.tensorflow.org/models/adv_inception_v3_2017_08_18.tar.gz)
-Inception v3 | Step L.L. on ensemble of 3 models | [ens3_adv_inception_v3_2017_08_18.tar.gz](http://download.tensorflow.org/models/ens3_adv_inception_v3_2017_08_18.tar.gz)
-Inception v3 | Step L.L. on ensemble of 4 models| [ens4_adv_inception_v3_2017_08_18.tar.gz](http://download.tensorflow.org/models/ens4_adv_inception_v3_2017_08_18.tar.gz)
-Inception ResNet v2 | Step L.L. | [adv_inception_resnet_v2_2017_12_18.tar.gz](http://download.tensorflow.org/models/adv_inception_resnet_v2_2017_12_18.tar.gz)
-Inception ResNet v2 | Step L.L. on ensemble of 3 models | [ens_adv_inception_resnet_v2_2017_08_18.tar.gz](http://download.tensorflow.org/models/ens_adv_inception_resnet_v2_2017_08_18.tar.gz)
-
-All checkpoints are compatible with
-[TF-Slim](https://github.com/tensorflow/models/tree/master/research/slim)
-implementation of Inception v3 and Inception Resnet v2.
-
-## How to evaluate models on ImageNet test data
-
-Python script `eval_on_adversarial.py` allow you to evaluate provided models
-on white-box adversarial examples generated from ImageNet test set.
-
-Usage is following:
-
-```bash
-# ${MODEL_NAME} - type of network architecture,
-#     either "inception_v3" or "inception_resnet_v2"
-# ${CHECKPOINT_PATH} - path to model checkpoint
-# ${DATASET_DIR} - directory with ImageNet test set
-# ${ADV_METHOD} - which method to use to generate adversarial images,
-#   supported method:
-#     "none" - use clean images from the dataset
-#     "stepll" - one step towards least likely class method (StepLL),
-#         see https://arxiv.org/abs/1611.01236 for details
-#     "stepllnoise" - RAND+StepLL method from https://arxiv.org/abs/1705.07204
-# ${ADV_EPS} - size of adversarial perturbation, ignored when method is none
-python eval_on_adversarial.py \
-  --model_name=${MODEL_NAME} \
-  --checkpoint_path=${CHECKPOINT_PATH} \
-  --dataset_dir=${DATASET_DIR} \
-  --batch_size=50 \
-  --adversarial_method=${ADV_METHOD} \
-  --adversarial_eps=${ADV_EPS}
-```
-
-Below is an example how to evaluate one of the models on RAND+StepLL adversarial
-examples:
-
-```bash
-# Download checkpoint
-CHECKPOINT_DIR=/tmp/checkpoints
-mkdir ${CHECKPOINT_DIR}
-wget http://download.tensorflow.org/models/ens_adv_inception_resnet_v2_2017_08_18.tar.gz
-tar -xvf ens_adv_inception_resnet_v2_2017_08_18.tar.gz
-mv ens_adv_inception_resnet_v2.ckpt* ${CHECKPOINT_DIR}
-rm ens_adv_inception_resnet_v2_2017_08_18.tar.gz
-
-# Run evaluation
-python eval_on_adversarial.py \
-  --model_name=inception_v3 \
-  --checkpoint_path=${CHECKPOINT_DIR}/ens_adv_inception_resnet_v2.ckpt \
-  --dataset_dir=${DATASET_DIR} \
-  --batch_size=50 \
-  --adversarial_method=stepllnoise \
-  --adversarial_eps=16
-```
diff --git a/research/adv_imagenet_models/eval_on_adversarial.py b/research/adv_imagenet_models/eval_on_adversarial.py
deleted file mode 100644
index f9188845c6c4e10484f9b24797d9ece3b730ffb0..0000000000000000000000000000000000000000
--- a/research/adv_imagenet_models/eval_on_adversarial.py
+++ /dev/null
@@ -1,331 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Script which evaluates model on adversarial examples."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-import imagenet
-import inception_resnet_v2
-
-import tensorflow as tf
-from tensorflow.contrib.slim.nets import inception
-
-
-slim = tf.contrib.slim
-
-tf.app.flags.DEFINE_integer(
-    'batch_size', 50, 'The number of samples in each batch.')
-
-tf.app.flags.DEFINE_integer(
-    'max_num_batches', None,
-    'Max number of batches to evaluate by default use all.')
-
-tf.app.flags.DEFINE_string(
-    'master', '', 'The address of the TensorFlow master to use.')
-
-tf.app.flags.DEFINE_string(
-    'checkpoint_path', '/tmp/tfmodel/',
-    'The directory where the model was written to or an absolute path to a '
-    'checkpoint file.')
-
-tf.app.flags.DEFINE_integer(
-    'num_preprocessing_threads', 4,
-    'The number of threads used to create the batches.')
-
-tf.app.flags.DEFINE_string(
-    'split_name', 'validation', 'The name of the train/test split.')
-
-tf.app.flags.DEFINE_string(
-    'dataset_dir', None, 'The directory where the dataset files are stored.')
-
-tf.app.flags.DEFINE_string(
-    'model_name', 'inception_v3',
-    'Name of the model to use, either "inception_v3" or "inception_resnet_v2"')
-
-tf.app.flags.DEFINE_float(
-    'moving_average_decay', None,
-    'The decay to use for the moving average.'
-    'If left as None, then moving averages are not used.')
-
-tf.app.flags.DEFINE_string(
-    'adversarial_method', 'none',
-    'What kind of adversarial examples to use for evaluation. '
-    'Could be one of: "none", "stepll", "stepllnoise".')
-
-tf.app.flags.DEFINE_float(
-    'adversarial_eps', 0.0,
-    'Size of adversarial perturbation in range [0, 255].')
-
-
-FLAGS = tf.app.flags.FLAGS
-
-
-IMAGE_SIZE = 299
-NUM_CLASSES = 1001
-
-
-def preprocess_for_eval(image, height, width,
-                        central_fraction=0.875, scope=None):
-  """Prepare one image for evaluation.
-
-  If height and width are specified it would output an image with that size by
-  applying resize_bilinear.
-  If central_fraction is specified it would crop the central fraction of the
-  input image.
-
-  Args:
-    image: 3-D Tensor of image. If dtype is tf.float32 then the range should be
-      [0, 1], otherwise it would converted to tf.float32 assuming that the range
-      is [0, MAX], where MAX is largest positive representable number for
-      int(8/16/32) data type (see `tf.image.convert_image_dtype` for details)
-    height: integer
-    width: integer
-    central_fraction: Optional Float, fraction of the image to crop.
-    scope: Optional scope for name_scope.
-  Returns:
-    3-D float Tensor of prepared image.
-  """
-  with tf.name_scope(scope, 'eval_image', [image, height, width]):
-    if image.dtype != tf.float32:
-      image = tf.image.convert_image_dtype(image, dtype=tf.float32)
-    # Crop the central region of the image with an area containing 87.5% of
-    # the original image.
-    if central_fraction:
-      image = tf.image.central_crop(image, central_fraction=central_fraction)
-
-    if height and width:
-      # Resize the image to the specified height and width.
-      image = tf.expand_dims(image, 0)
-      image = tf.image.resize_bilinear(image, [height, width],
-                                       align_corners=False)
-      image = tf.squeeze(image, [0])
-    image = tf.subtract(image, 0.5)
-    image = tf.multiply(image, 2.0)
-    return image
-
-
-def create_model(x, reuse=None):
-  """Create model graph.
-
-  Args:
-    x: input images
-    reuse: reuse parameter which will be passed to underlying variable scopes.
-      Should be None first call and True every subsequent call.
-
-  Returns:
-    (logits, end_points) - tuple of model logits and enpoints
-
-  Raises:
-    ValueError: if model type specified by --model_name flag is invalid.
-  """
-  if FLAGS.model_name == 'inception_v3':
-    with slim.arg_scope(inception.inception_v3_arg_scope()):
-      return inception.inception_v3(
-          x, num_classes=NUM_CLASSES, is_training=False, reuse=reuse)
-  elif FLAGS.model_name == 'inception_resnet_v2':
-    with slim.arg_scope(inception_resnet_v2.inception_resnet_v2_arg_scope()):
-      return inception_resnet_v2.inception_resnet_v2(
-          x, num_classes=NUM_CLASSES, is_training=False, reuse=reuse)
-  else:
-    raise ValueError('Invalid model name: %s' % (FLAGS.model_name))
-
-
-def step_target_class_adversarial_images(x, eps, one_hot_target_class):
-  """Base code for one step towards target class methods.
-
-  Args:
-    x: source images
-    eps: size of adversarial perturbation
-    one_hot_target_class: one hot encoded target classes for all images
-
-  Returns:
-    tensor with adversarial images
-  """
-  logits, end_points = create_model(x, reuse=True)
-  cross_entropy = tf.losses.softmax_cross_entropy(one_hot_target_class,
-                                                  logits,
-                                                  label_smoothing=0.1,
-                                                  weights=1.0)
-  cross_entropy += tf.losses.softmax_cross_entropy(one_hot_target_class,
-                                                   end_points['AuxLogits'],
-                                                   label_smoothing=0.1,
-                                                   weights=0.4)
-  x_adv = x - eps * tf.sign(tf.gradients(cross_entropy, x)[0])
-  x_adv = tf.clip_by_value(x_adv, -1.0, 1.0)
-  return tf.stop_gradient(x_adv)
-
-
-def stepll_adversarial_images(x, eps):
-  """One step towards least likely class (Step L.L.) adversarial examples.
-
-  This method is an alternative to FGSM which does not use true classes.
-  Method is described in the "Adversarial Machine Learning at Scale" paper,
-  https://arxiv.org/abs/1611.01236
-
-  Args:
-    x: source images
-    eps: size of adversarial perturbation
-
-  Returns:
-    adversarial images
-  """
-  logits, _ = create_model(x, reuse=True)
-  least_likely_class = tf.argmin(logits, 1)
-  one_hot_ll_class = tf.one_hot(least_likely_class, NUM_CLASSES)
-  return step_target_class_adversarial_images(x, eps, one_hot_ll_class)
-
-
-def stepllnoise_adversarial_images(x, eps):
-  """Step L.L. with noise method.
-
-  This is an imporvement of Step L.L. method. This method is better against
-  adversarially trained models which learn to mask gradient.
-  Method is described in the section "New randomized one shot attack" of
-  "Ensemble Adversarial Training: Attacks and Defenses" paper,
-  https://arxiv.org/abs/1705.07204
-
-  Args:
-    x: source images
-    eps: size of adversarial perturbation
-
-  Returns:
-    adversarial images
-  """
-  logits, _ = create_model(x, reuse=True)
-  least_likely_class = tf.argmin(logits, 1)
-  one_hot_ll_class = tf.one_hot(least_likely_class, NUM_CLASSES)
-  x_noise = x + eps / 2 * tf.sign(tf.random_normal(x.shape))
-  return step_target_class_adversarial_images(x_noise, eps / 2,
-                                              one_hot_ll_class)
-
-
-def get_input_images(dataset_images):
-  """Gets input images for the evaluation.
-
-  Args:
-    dataset_images: tensor with dataset images
-
-  Returns:
-    tensor with input images, which is either dataset images or adversarial
-    images.
-
-  Raises:
-    ValueError: if adversarial method specified by --adversarial_method flag
-      is invalid.
-  """
-  # adversarial_eps defines max difference of values of pixels if
-  # pixels are in range [0, 255]. However values of dataset pixels are
-  # in range [-1, 1], so converting epsilon.
-  eps = FLAGS.adversarial_eps / 255 * 2.0
-
-  if FLAGS.adversarial_method == 'stepll':
-    return stepll_adversarial_images(dataset_images, eps)
-  elif FLAGS.adversarial_method == 'stepllnoise':
-    return stepllnoise_adversarial_images(dataset_images, eps)
-  elif FLAGS.adversarial_method == 'none':
-    return dataset_images
-  else:
-    raise ValueError('Invalid adversarial method: %s'
-                     % (FLAGS.adversarial_method))
-
-
-def main(_):
-  if not FLAGS.dataset_dir:
-    raise ValueError('You must supply the dataset directory with --dataset_dir')
-
-  tf.logging.set_verbosity(tf.logging.INFO)
-  with tf.Graph().as_default():
-    tf_global_step = tf.train.get_or_create_global_step()
-
-    ###################
-    # Prepare dataset #
-    ###################
-    dataset = imagenet.get_split(FLAGS.split_name, FLAGS.dataset_dir)
-    provider = slim.dataset_data_provider.DatasetDataProvider(
-        dataset,
-        shuffle=False,
-        common_queue_capacity=2 * FLAGS.batch_size,
-        common_queue_min=FLAGS.batch_size)
-    [dataset_image, label] = provider.get(['image', 'label'])
-    dataset_image = preprocess_for_eval(dataset_image, IMAGE_SIZE, IMAGE_SIZE)
-    dataset_images, labels = tf.train.batch(
-        [dataset_image, label],
-        batch_size=FLAGS.batch_size,
-        num_threads=FLAGS.num_preprocessing_threads,
-        capacity=5 * FLAGS.batch_size)
-
-    ########################################
-    # Define the model and input exampeles #
-    ########################################
-    create_model(tf.placeholder(tf.float32, shape=dataset_images.shape))
-    input_images = get_input_images(dataset_images)
-    logits, _ = create_model(input_images, reuse=True)
-
-    if FLAGS.moving_average_decay > 0:
-      variable_averages = tf.train.ExponentialMovingAverage(
-          FLAGS.moving_average_decay, tf_global_step)
-      variables_to_restore = variable_averages.variables_to_restore(
-          slim.get_model_variables())
-      variables_to_restore[tf_global_step.op.name] = tf_global_step
-    else:
-      variables_to_restore = slim.get_variables_to_restore()
-
-    ######################
-    # Define the metrics #
-    ######################
-    predictions = tf.argmax(logits, 1)
-    labels = tf.squeeze(labels)
-    names_to_values, names_to_updates = slim.metrics.aggregate_metric_map({
-        'Accuracy': slim.metrics.streaming_accuracy(predictions, labels),
-        'Recall_5': slim.metrics.streaming_sparse_recall_at_k(
-            logits, tf.reshape(labels, [-1, 1]), 5),
-    })
-
-    ######################
-    # Run evaluation     #
-    ######################
-    if FLAGS.max_num_batches:
-      num_batches = FLAGS.max_num_batches
-    else:
-      # This ensures that we make a single pass over all of the data.
-      num_batches = math.ceil(dataset.num_samples / float(FLAGS.batch_size))
-
-    if tf.gfile.IsDirectory(FLAGS.checkpoint_path):
-      checkpoint_path = tf.train.latest_checkpoint(FLAGS.checkpoint_path)
-    else:
-      checkpoint_path = FLAGS.checkpoint_path
-
-    tf.logging.info('Evaluating %s' % checkpoint_path)
-
-    top1_accuracy, top5_accuracy = slim.evaluation.evaluate_once(
-        master=FLAGS.master,
-        checkpoint_path=checkpoint_path,
-        logdir=None,
-        summary_op=None,
-        num_evals=num_batches,
-        eval_op=list(names_to_updates.values()),
-        final_op=[names_to_values['Accuracy'], names_to_values['Recall_5']],
-        variables_to_restore=variables_to_restore)
-
-    print('Top1 Accuracy: ', top1_accuracy)
-    print('Top5 Accuracy: ', top5_accuracy)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/adv_imagenet_models/imagenet.py b/research/adv_imagenet_models/imagenet.py
deleted file mode 100644
index 26c4c7a388a234f647e446951a0765d1c53184cb..0000000000000000000000000000000000000000
--- a/research/adv_imagenet_models/imagenet.py
+++ /dev/null
@@ -1,118 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Provides data for the ImageNet ILSVRC 2012 Dataset plus some bounding boxes.
-
-Some images have one or more bounding boxes associated with the label of the
-image. See details here: http://image-net.org/download-bboxes
-
-WARNING: Don't use for object detection, in this case all the bounding boxes
-of the image belong to just one class.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-_FILE_PATTERN = '%s-*'
-
-_SPLITS_TO_SIZES = {
-    'train': 1281167,
-    'validation': 50000,
-}
-
-_ITEMS_TO_DESCRIPTIONS = {
-    'image': 'A color image of varying height and width.',
-    'label': 'The label id of the image, integer between 0 and 999',
-    'label_text': 'The text of the label.',
-    'object/bbox': 'A list of bounding boxes.',
-    'object/label': 'A list of labels, one per each object.',
-}
-
-_NUM_CLASSES = 1001
-
-
-def get_split(split_name, dataset_dir, file_pattern=None, reader=None):
-  """Gets a dataset tuple with instructions for reading ImageNet.
-
-  Args:
-    split_name: A train/test split name.
-    dataset_dir: The base directory of the dataset sources.
-    file_pattern: The file pattern to use when matching the dataset sources.
-      It is assumed that the pattern contains a '%s' string so that the split
-      name can be inserted.
-    reader: The TensorFlow reader type.
-
-  Returns:
-    A `Dataset` namedtuple.
-
-  Raises:
-    ValueError: if `split_name` is not a valid train/test split.
-  """
-  if split_name not in _SPLITS_TO_SIZES:
-    raise ValueError('split name %s was not recognized.' % split_name)
-
-  if not file_pattern:
-    file_pattern = _FILE_PATTERN
-  file_pattern = os.path.join(dataset_dir, file_pattern % split_name)
-
-  # Allowing None in the signature so that dataset_factory can use the default.
-  if reader is None:
-    reader = tf.TFRecordReader
-
-  keys_to_features = {
-      'image/encoded': tf.FixedLenFeature(
-          (), tf.string, default_value=''),
-      'image/format': tf.FixedLenFeature(
-          (), tf.string, default_value='jpeg'),
-      'image/class/label': tf.FixedLenFeature(
-          [], dtype=tf.int64, default_value=-1),
-      'image/class/text': tf.FixedLenFeature(
-          [], dtype=tf.string, default_value=''),
-      'image/object/bbox/xmin': tf.VarLenFeature(
-          dtype=tf.float32),
-      'image/object/bbox/ymin': tf.VarLenFeature(
-          dtype=tf.float32),
-      'image/object/bbox/xmax': tf.VarLenFeature(
-          dtype=tf.float32),
-      'image/object/bbox/ymax': tf.VarLenFeature(
-          dtype=tf.float32),
-      'image/object/class/label': tf.VarLenFeature(
-          dtype=tf.int64),
-  }
-
-  items_to_handlers = {
-      'image': slim.tfexample_decoder.Image('image/encoded', 'image/format'),
-      'label': slim.tfexample_decoder.Tensor('image/class/label'),
-      'label_text': slim.tfexample_decoder.Tensor('image/class/text'),
-      'object/bbox': slim.tfexample_decoder.BoundingBox(
-          ['ymin', 'xmin', 'ymax', 'xmax'], 'image/object/bbox/'),
-      'object/label': slim.tfexample_decoder.Tensor('image/object/class/label'),
-  }
-
-  decoder = slim.tfexample_decoder.TFExampleDecoder(
-      keys_to_features, items_to_handlers)
-
-  return slim.dataset.Dataset(
-      data_sources=file_pattern,
-      reader=reader,
-      decoder=decoder,
-      num_samples=_SPLITS_TO_SIZES[split_name],
-      items_to_descriptions=_ITEMS_TO_DESCRIPTIONS,
-      num_classes=_NUM_CLASSES)
diff --git a/research/adv_imagenet_models/inception_resnet_v2.py b/research/adv_imagenet_models/inception_resnet_v2.py
deleted file mode 100644
index 2f690e8d2f70ecde9a55f40375a7f74cd25651c7..0000000000000000000000000000000000000000
--- a/research/adv_imagenet_models/inception_resnet_v2.py
+++ /dev/null
@@ -1,358 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contains the definition of the Inception Resnet V2 architecture.
-
-As described in http://arxiv.org/abs/1602.07261.
-
-  Inception-v4, Inception-ResNet and the Impact of Residual Connections
-    on Learning
-  Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, Alex Alemi
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-
-def block35(net, scale=1.0, activation_fn=tf.nn.relu, scope=None, reuse=None):
-  """Builds the 35x35 resnet block."""
-  with tf.variable_scope(scope, 'Block35', [net], reuse=reuse):
-    with tf.variable_scope('Branch_0'):
-      tower_conv = slim.conv2d(net, 32, 1, scope='Conv2d_1x1')
-    with tf.variable_scope('Branch_1'):
-      tower_conv1_0 = slim.conv2d(net, 32, 1, scope='Conv2d_0a_1x1')
-      tower_conv1_1 = slim.conv2d(tower_conv1_0, 32, 3, scope='Conv2d_0b_3x3')
-    with tf.variable_scope('Branch_2'):
-      tower_conv2_0 = slim.conv2d(net, 32, 1, scope='Conv2d_0a_1x1')
-      tower_conv2_1 = slim.conv2d(tower_conv2_0, 48, 3, scope='Conv2d_0b_3x3')
-      tower_conv2_2 = slim.conv2d(tower_conv2_1, 64, 3, scope='Conv2d_0c_3x3')
-    mixed = tf.concat(axis=3, values=[tower_conv, tower_conv1_1, tower_conv2_2])
-    up = slim.conv2d(mixed, net.get_shape()[3], 1, normalizer_fn=None,
-                     activation_fn=None, scope='Conv2d_1x1')
-    net += scale * up
-    if activation_fn:
-      net = activation_fn(net)
-  return net
-
-
-def block17(net, scale=1.0, activation_fn=tf.nn.relu, scope=None, reuse=None):
-  """Builds the 17x17 resnet block."""
-  with tf.variable_scope(scope, 'Block17', [net], reuse=reuse):
-    with tf.variable_scope('Branch_0'):
-      tower_conv = slim.conv2d(net, 192, 1, scope='Conv2d_1x1')
-    with tf.variable_scope('Branch_1'):
-      tower_conv1_0 = slim.conv2d(net, 128, 1, scope='Conv2d_0a_1x1')
-      tower_conv1_1 = slim.conv2d(tower_conv1_0, 160, [1, 7],
-                                  scope='Conv2d_0b_1x7')
-      tower_conv1_2 = slim.conv2d(tower_conv1_1, 192, [7, 1],
-                                  scope='Conv2d_0c_7x1')
-    mixed = tf.concat(axis=3, values=[tower_conv, tower_conv1_2])
-    up = slim.conv2d(mixed, net.get_shape()[3], 1, normalizer_fn=None,
-                     activation_fn=None, scope='Conv2d_1x1')
-    net += scale * up
-    if activation_fn:
-      net = activation_fn(net)
-  return net
-
-
-def block8(net, scale=1.0, activation_fn=tf.nn.relu, scope=None, reuse=None):
-  """Builds the 8x8 resnet block."""
-  with tf.variable_scope(scope, 'Block8', [net], reuse=reuse):
-    with tf.variable_scope('Branch_0'):
-      tower_conv = slim.conv2d(net, 192, 1, scope='Conv2d_1x1')
-    with tf.variable_scope('Branch_1'):
-      tower_conv1_0 = slim.conv2d(net, 192, 1, scope='Conv2d_0a_1x1')
-      tower_conv1_1 = slim.conv2d(tower_conv1_0, 224, [1, 3],
-                                  scope='Conv2d_0b_1x3')
-      tower_conv1_2 = slim.conv2d(tower_conv1_1, 256, [3, 1],
-                                  scope='Conv2d_0c_3x1')
-    mixed = tf.concat(axis=3, values=[tower_conv, tower_conv1_2])
-    up = slim.conv2d(mixed, net.get_shape()[3], 1, normalizer_fn=None,
-                     activation_fn=None, scope='Conv2d_1x1')
-    net += scale * up
-    if activation_fn:
-      net = activation_fn(net)
-  return net
-
-
-def inception_resnet_v2_base(inputs,
-                             final_endpoint='Conv2d_7b_1x1',
-                             output_stride=16,
-                             align_feature_maps=False,
-                             scope=None):
-  """Inception model from  http://arxiv.org/abs/1602.07261.
-
-  Constructs an Inception Resnet v2 network from inputs to the given final
-  endpoint. This method can construct the network up to the final inception
-  block Conv2d_7b_1x1.
-
-  Args:
-    inputs: a tensor of size [batch_size, height, width, channels].
-    final_endpoint: specifies the endpoint to construct the network up to. It
-      can be one of ['Conv2d_1a_3x3', 'Conv2d_2a_3x3', 'Conv2d_2b_3x3',
-      'MaxPool_3a_3x3', 'Conv2d_3b_1x1', 'Conv2d_4a_3x3', 'MaxPool_5a_3x3',
-      'Mixed_5b', 'Mixed_6a', 'PreAuxLogits', 'Mixed_7a', 'Conv2d_7b_1x1']
-    output_stride: A scalar that specifies the requested ratio of input to
-      output spatial resolution. Only supports 8 and 16.
-    align_feature_maps: When true, changes all the VALID paddings in the network
-      to SAME padding so that the feature maps are aligned.
-    scope: Optional variable_scope.
-
-  Returns:
-    tensor_out: output tensor corresponding to the final_endpoint.
-    end_points: a set of activations for external use, for example summaries or
-                losses.
-
-  Raises:
-    ValueError: if final_endpoint is not set to one of the predefined values,
-      or if the output_stride is not 8 or 16, or if the output_stride is 8 and
-      we request an end point after 'PreAuxLogits'.
-  """
-  if output_stride != 8 and output_stride != 16:
-    raise ValueError('output_stride must be 8 or 16.')
-
-  padding = 'SAME' if align_feature_maps else 'VALID'
-
-  end_points = {}
-
-  def add_and_check_final(name, net):
-    end_points[name] = net
-    return name == final_endpoint
-
-  with tf.variable_scope(scope, 'InceptionResnetV2', [inputs]):
-    with slim.arg_scope([slim.conv2d, slim.max_pool2d, slim.avg_pool2d],
-                        stride=1, padding='SAME'):
-      # 149 x 149 x 32
-      net = slim.conv2d(inputs, 32, 3, stride=2, padding=padding,
-                        scope='Conv2d_1a_3x3')
-      if add_and_check_final('Conv2d_1a_3x3', net): return net, end_points
-
-      # 147 x 147 x 32
-      net = slim.conv2d(net, 32, 3, padding=padding,
-                        scope='Conv2d_2a_3x3')
-      if add_and_check_final('Conv2d_2a_3x3', net): return net, end_points
-      # 147 x 147 x 64
-      net = slim.conv2d(net, 64, 3, scope='Conv2d_2b_3x3')
-      if add_and_check_final('Conv2d_2b_3x3', net): return net, end_points
-      # 73 x 73 x 64
-      net = slim.max_pool2d(net, 3, stride=2, padding=padding,
-                            scope='MaxPool_3a_3x3')
-      if add_and_check_final('MaxPool_3a_3x3', net): return net, end_points
-      # 73 x 73 x 80
-      net = slim.conv2d(net, 80, 1, padding=padding,
-                        scope='Conv2d_3b_1x1')
-      if add_and_check_final('Conv2d_3b_1x1', net): return net, end_points
-      # 71 x 71 x 192
-      net = slim.conv2d(net, 192, 3, padding=padding,
-                        scope='Conv2d_4a_3x3')
-      if add_and_check_final('Conv2d_4a_3x3', net): return net, end_points
-      # 35 x 35 x 192
-      net = slim.max_pool2d(net, 3, stride=2, padding=padding,
-                            scope='MaxPool_5a_3x3')
-      if add_and_check_final('MaxPool_5a_3x3', net): return net, end_points
-
-      # 35 x 35 x 320
-      with tf.variable_scope('Mixed_5b'):
-        with tf.variable_scope('Branch_0'):
-          tower_conv = slim.conv2d(net, 96, 1, scope='Conv2d_1x1')
-        with tf.variable_scope('Branch_1'):
-          tower_conv1_0 = slim.conv2d(net, 48, 1, scope='Conv2d_0a_1x1')
-          tower_conv1_1 = slim.conv2d(tower_conv1_0, 64, 5,
-                                      scope='Conv2d_0b_5x5')
-        with tf.variable_scope('Branch_2'):
-          tower_conv2_0 = slim.conv2d(net, 64, 1, scope='Conv2d_0a_1x1')
-          tower_conv2_1 = slim.conv2d(tower_conv2_0, 96, 3,
-                                      scope='Conv2d_0b_3x3')
-          tower_conv2_2 = slim.conv2d(tower_conv2_1, 96, 3,
-                                      scope='Conv2d_0c_3x3')
-        with tf.variable_scope('Branch_3'):
-          tower_pool = slim.avg_pool2d(net, 3, stride=1, padding='SAME',
-                                       scope='AvgPool_0a_3x3')
-          tower_pool_1 = slim.conv2d(tower_pool, 64, 1,
-                                     scope='Conv2d_0b_1x1')
-        net = tf.concat(
-            [tower_conv, tower_conv1_1, tower_conv2_2, tower_pool_1], 3)
-
-      if add_and_check_final('Mixed_5b', net): return net, end_points
-      # TODO(alemi): Register intermediate endpoints
-      net = slim.repeat(net, 10, block35, scale=0.17)
-
-      # 17 x 17 x 1088 if output_stride == 8,
-      # 33 x 33 x 1088 if output_stride == 16
-      use_atrous = output_stride == 8
-
-      with tf.variable_scope('Mixed_6a'):
-        with tf.variable_scope('Branch_0'):
-          tower_conv = slim.conv2d(net, 384, 3, stride=1 if use_atrous else 2,
-                                   padding=padding,
-                                   scope='Conv2d_1a_3x3')
-        with tf.variable_scope('Branch_1'):
-          tower_conv1_0 = slim.conv2d(net, 256, 1, scope='Conv2d_0a_1x1')
-          tower_conv1_1 = slim.conv2d(tower_conv1_0, 256, 3,
-                                      scope='Conv2d_0b_3x3')
-          tower_conv1_2 = slim.conv2d(tower_conv1_1, 384, 3,
-                                      stride=1 if use_atrous else 2,
-                                      padding=padding,
-                                      scope='Conv2d_1a_3x3')
-        with tf.variable_scope('Branch_2'):
-          tower_pool = slim.max_pool2d(net, 3, stride=1 if use_atrous else 2,
-                                       padding=padding,
-                                       scope='MaxPool_1a_3x3')
-        net = tf.concat([tower_conv, tower_conv1_2, tower_pool], 3)
-
-      if add_and_check_final('Mixed_6a', net): return net, end_points
-
-      # TODO(alemi): register intermediate endpoints
-      with slim.arg_scope([slim.conv2d], rate=2 if use_atrous else 1):
-        net = slim.repeat(net, 20, block17, scale=0.10)
-      if add_and_check_final('PreAuxLogits', net): return net, end_points
-
-      if output_stride == 8:
-        # TODO(gpapan): Properly support output_stride for the rest of the net.
-        raise ValueError('output_stride==8 is only supported up to the '
-                         'PreAuxlogits end_point for now.')
-
-      # 8 x 8 x 2080
-      with tf.variable_scope('Mixed_7a'):
-        with tf.variable_scope('Branch_0'):
-          tower_conv = slim.conv2d(net, 256, 1, scope='Conv2d_0a_1x1')
-          tower_conv_1 = slim.conv2d(tower_conv, 384, 3, stride=2,
-                                     padding=padding,
-                                     scope='Conv2d_1a_3x3')
-        with tf.variable_scope('Branch_1'):
-          tower_conv1 = slim.conv2d(net, 256, 1, scope='Conv2d_0a_1x1')
-          tower_conv1_1 = slim.conv2d(tower_conv1, 288, 3, stride=2,
-                                      padding=padding,
-                                      scope='Conv2d_1a_3x3')
-        with tf.variable_scope('Branch_2'):
-          tower_conv2 = slim.conv2d(net, 256, 1, scope='Conv2d_0a_1x1')
-          tower_conv2_1 = slim.conv2d(tower_conv2, 288, 3,
-                                      scope='Conv2d_0b_3x3')
-          tower_conv2_2 = slim.conv2d(tower_conv2_1, 320, 3, stride=2,
-                                      padding=padding,
-                                      scope='Conv2d_1a_3x3')
-        with tf.variable_scope('Branch_3'):
-          tower_pool = slim.max_pool2d(net, 3, stride=2,
-                                       padding=padding,
-                                       scope='MaxPool_1a_3x3')
-        net = tf.concat(
-            [tower_conv_1, tower_conv1_1, tower_conv2_2, tower_pool], 3)
-
-      if add_and_check_final('Mixed_7a', net): return net, end_points
-
-      # TODO(alemi): register intermediate endpoints
-      net = slim.repeat(net, 9, block8, scale=0.20)
-      net = block8(net, activation_fn=None)
-
-      # 8 x 8 x 1536
-      net = slim.conv2d(net, 1536, 1, scope='Conv2d_7b_1x1')
-      if add_and_check_final('Conv2d_7b_1x1', net): return net, end_points
-
-    raise ValueError('final_endpoint (%s) not recognized', final_endpoint)
-
-
-def inception_resnet_v2(inputs, num_classes=1001, is_training=True,
-                        dropout_keep_prob=0.8,
-                        reuse=None,
-                        scope='InceptionResnetV2',
-                        create_aux_logits=True):
-  """Creates the Inception Resnet V2 model.
-
-  Args:
-    inputs: a 4-D tensor of size [batch_size, height, width, 3].
-    num_classes: number of predicted classes.
-    is_training: whether is training or not.
-    dropout_keep_prob: float, the fraction to keep before final layer.
-    reuse: whether or not the network and its variables should be reused. To be
-      able to reuse 'scope' must be given.
-    scope: Optional variable_scope.
-    create_aux_logits: Whether to include the auxilliary logits.
-
-  Returns:
-    logits: the logits outputs of the model.
-    end_points: the set of end_points from the inception model.
-  """
-  end_points = {}
-
-  with tf.variable_scope(scope, 'InceptionResnetV2', [inputs, num_classes],
-                         reuse=reuse) as scope:
-    with slim.arg_scope([slim.batch_norm, slim.dropout],
-                        is_training=is_training):
-
-      net, end_points = inception_resnet_v2_base(inputs, scope=scope)
-
-      if create_aux_logits:
-        with tf.variable_scope('AuxLogits'):
-          aux = end_points['PreAuxLogits']
-          aux = slim.avg_pool2d(aux, 5, stride=3, padding='VALID',
-                                scope='Conv2d_1a_3x3')
-          aux = slim.conv2d(aux, 128, 1, scope='Conv2d_1b_1x1')
-          aux = slim.conv2d(aux, 768, aux.get_shape()[1:3],
-                            padding='VALID', scope='Conv2d_2a_5x5')
-          aux = slim.flatten(aux)
-          aux = slim.fully_connected(aux, num_classes, activation_fn=None,
-                                     scope='Logits')
-          end_points['AuxLogits'] = aux
-
-      with tf.variable_scope('Logits'):
-        net = slim.avg_pool2d(net, net.get_shape()[1:3], padding='VALID',
-                              scope='AvgPool_1a_8x8')
-        net = slim.flatten(net)
-
-        net = slim.dropout(net, dropout_keep_prob, is_training=is_training,
-                           scope='Dropout')
-
-        end_points['PreLogitsFlatten'] = net
-        logits = slim.fully_connected(net, num_classes, activation_fn=None,
-                                      scope='Logits')
-        end_points['Logits'] = logits
-        end_points['Predictions'] = tf.nn.softmax(logits, name='Predictions')
-
-    return logits, end_points
-inception_resnet_v2.default_image_size = 299
-
-
-def inception_resnet_v2_arg_scope(weight_decay=0.00004,
-                                  batch_norm_decay=0.9997,
-                                  batch_norm_epsilon=0.001):
-  """Returns the scope with the default parameters for inception_resnet_v2.
-
-  Args:
-    weight_decay: the weight decay for weights variables.
-    batch_norm_decay: decay for the moving average of batch_norm momentums.
-    batch_norm_epsilon: small float added to variance to avoid dividing by zero.
-
-  Returns:
-    a arg_scope with the parameters needed for inception_resnet_v2.
-  """
-  # Set weight_decay for weights in conv2d and fully_connected layers.
-  with slim.arg_scope([slim.conv2d, slim.fully_connected],
-                      weights_regularizer=slim.l2_regularizer(weight_decay),
-                      biases_regularizer=slim.l2_regularizer(weight_decay)):
-
-    batch_norm_params = {
-        'decay': batch_norm_decay,
-        'epsilon': batch_norm_epsilon,
-    }
-    # Set activation_fn and parameters for batch_norm.
-    with slim.arg_scope([slim.conv2d], activation_fn=tf.nn.relu,
-                        normalizer_fn=slim.batch_norm,
-                        normalizer_params=batch_norm_params) as scope:
-      return scope
diff --git a/research/adversarial_crypto/README.md b/research/adversarial_crypto/README.md
deleted file mode 100644
index 3822def1325b8d4eb1fd31335f2f8ce053ff747a..0000000000000000000000000000000000000000
--- a/research/adversarial_crypto/README.md
+++ /dev/null
@@ -1,62 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Learning to Protect Communications with Adversarial Neural Cryptography
-
-This is a slightly-updated model used for the paper
-["Learning to Protect Communications with Adversarial Neural
-Cryptography"](https://arxiv.org/abs/1610.06918).
-
-> We ask whether neural networks can learn to use secret keys to protect
-> information from other neural networks. Specifically, we focus on ensuring
-> confidentiality properties in a multiagent system, and we specify those
-> properties in terms of an adversary. Thus, a system may consist of neural
-> networks named Alice and Bob, and we aim to limit what a third neural
-> network named Eve learns from eavesdropping on the communication between
-> Alice and Bob. We do not prescribe specific cryptographic algorithms to
-> these neural networks; instead, we train end-to-end, adversarially.
-> We demonstrate that the neural networks can learn how to perform forms of
-> encryption and decryption, and also how to apply these operations
-> selectively in order to meet confidentiality goals.
-
-This code allows you to train encoder/decoder/adversary network triplets
-and evaluate their effectiveness on randomly generated input and key
-pairs.
-
-## Prerequisites
-
-The only software requirements for running the encoder and decoder is having
-TensorFlow installed.
-
-Requires TensorFlow r0.12 or later.
-
-## Training and evaluating
-
-After installing TensorFlow and ensuring that your paths are configured
-appropriately:
-
-```
-python train_eval.py
-```
-
-This will begin training a fresh model.  If and when the model becomes
-sufficiently well-trained, it will reset the Eve model multiple times
-and retrain it from scratch, outputting the accuracy thus obtained
-in each run.
-
-## Model differences from the paper
-
-The model has been simplified slightly from the one described in
-the paper - the convolutional layer width was reduced by a factor
-of two.  In the version in the paper, there was a nonlinear unit
-after the fully-connected layer;  that nonlinear has been removed
-here.  These changes improve the robustness of training.  The
-initializer for the convolution layers has switched to the
-`tf.contrib.layers default` of `xavier_initializer` instead of
-a simpler `truncated_normal`.
-
-## Contact information
-
-This model repository is maintained by David G. Andersen
-([dave-andersen](https://github.com/dave-andersen)).
diff --git a/research/adversarial_crypto/train_eval.py b/research/adversarial_crypto/train_eval.py
deleted file mode 100644
index df7a00ad50f2ec01b37d8c162309a928207088d6..0000000000000000000000000000000000000000
--- a/research/adversarial_crypto/train_eval.py
+++ /dev/null
@@ -1,276 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Adversarial training to learn trivial encryption functions,
-from the paper "Learning to Protect Communications with
-Adversarial Neural Cryptography", Abadi & Andersen, 2016.
-
-https://arxiv.org/abs/1610.06918
-
-This program creates and trains three neural networks,
-termed Alice, Bob, and Eve.  Alice takes inputs
-in_m (message), in_k (key) and outputs 'ciphertext'.
-
-Bob takes inputs in_k, ciphertext and tries to reconstruct
-the message.
-
-Eve is an adversarial network that takes input ciphertext
-and also tries to reconstruct the message.
-
-The main function attempts to train these networks and then
-evaluates them, all on random plaintext and key values.
-
-"""
-
-# TensorFlow Python 3 compatibility
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-import signal
-import sys
-from six.moves import xrange  # pylint: disable=redefined-builtin
-import tensorflow as tf
-
-flags = tf.app.flags
-
-flags.DEFINE_float('learning_rate', 0.0008, 'Constant learning rate')
-flags.DEFINE_integer('batch_size', 4096, 'Batch size')
-
-FLAGS = flags.FLAGS
-
-# Input and output configuration.
-TEXT_SIZE = 16
-KEY_SIZE = 16
-
-# Training parameters.
-ITERS_PER_ACTOR = 1
-EVE_MULTIPLIER = 2  # Train Eve 2x for every step of Alice/Bob
-# Train until either max loops or Alice/Bob "good enough":
-MAX_TRAINING_LOOPS = 850000
-BOB_LOSS_THRESH = 0.02  # Exit when Bob loss < 0.02 and Eve > 7.7 bits
-EVE_LOSS_THRESH = 7.7
-
-# Logging and evaluation.
-PRINT_EVERY = 200  # In training, log every 200 steps.
-EVE_EXTRA_ROUNDS = 2000  # At end, train eve a bit more.
-RETRAIN_EVE_ITERS = 10000  # Retrain eve up to ITERS*LOOPS times.
-RETRAIN_EVE_LOOPS = 25  # With an evaluation each loop
-NUMBER_OF_EVE_RESETS = 5  # And do this up to 5 times with a fresh eve.
-# Use EVAL_BATCHES samples each time we check accuracy.
-EVAL_BATCHES = 1
-
-
-def batch_of_random_bools(batch_size, n):
-  """Return a batch of random "boolean" numbers.
-
-  Args:
-    batch_size:  Batch size dimension of returned tensor.
-    n:  number of entries per batch.
-
-  Returns:
-    A [batch_size, n] tensor of "boolean" numbers, where each number is
-    preresented as -1 or 1.
-  """
-
-  as_int = tf.random.uniform(
-      [batch_size, n], minval=0, maxval=2, dtype=tf.int32)
-  expanded_range = (as_int * 2) - 1
-  return tf.cast(expanded_range, tf.float32)
-
-
-class AdversarialCrypto(object):
-  """Primary model implementation class for Adversarial Neural Crypto.
-
-  This class contains the code for the model itself,
-  and when created, plumbs the pathways from Alice to Bob and
-  Eve, creates the optimizers and loss functions, etc.
-
-  Attributes:
-    eve_loss:  Eve's loss function.
-    bob_loss:  Bob's loss function.  Different units from eve_loss.
-    eve_optimizer:  A tf op that runs Eve's optimizer.
-    bob_optimizer:  A tf op that runs Bob's optimizer.
-    bob_reconstruction_loss:  Bob's message reconstruction loss,
-      which is comparable to eve_loss.
-    reset_eve_vars:  Execute this op to completely reset Eve.
-  """
-
-  def get_message_and_key(self):
-    """Generate random pseudo-boolean key and message values."""
-
-    batch_size = tf.compat.v1.placeholder_with_default(FLAGS.batch_size, shape=[])
-
-    in_m = batch_of_random_bools(batch_size, TEXT_SIZE)
-    in_k = batch_of_random_bools(batch_size, KEY_SIZE)
-    return in_m, in_k
-
-  def model(self, collection, message, key=None):
-    """The model for Alice, Bob, and Eve.  If key=None, the first fully connected layer
-    takes only the message as inputs.  Otherwise, it uses both the key
-    and the message.
-
-    Args:
-      collection:  The graph keys collection to add new vars to.
-      message:  The input message to process.
-      key:  The input key (if any) to use.
-    """
-
-    if key is not None:
-      combined_message = tf.concat(axis=1, values=[message, key])
-    else:
-      combined_message = message
-
-    # Ensure that all variables created are in the specified collection.
-    with tf.contrib.framework.arg_scope(
-        [tf.contrib.layers.fully_connected, tf.contrib.layers.conv2d],
-        variables_collections=[collection]):
-
-      fc = tf.contrib.layers.fully_connected(
-          combined_message,
-          TEXT_SIZE + KEY_SIZE,
-          biases_initializer=tf.constant_initializer(0.0),
-          activation_fn=None)
-
-      # Perform a sequence of 1D convolutions (by expanding the message out to 2D
-      # and then squeezing it back down).
-      fc = tf.expand_dims(fc, 2) # 2D
-      fc = tf.expand_dims(fc, 3) # 3D -- conv2d needs a depth
-      # 2,1 -> 1,2
-      conv = tf.contrib.layers.conv2d(
-          fc, 2, 2, 2, 'SAME', activation_fn=tf.nn.sigmoid)
-      # 1,2 -> 1, 2
-      conv = tf.contrib.layers.conv2d(
-          conv, 2, 1, 1, 'SAME', activation_fn=tf.nn.sigmoid)
-      # 1,2 -> 1, 1
-      conv = tf.contrib.layers.conv2d(
-          conv, 1, 1, 1, 'SAME', activation_fn=tf.nn.tanh)
-      conv = tf.squeeze(conv, 3)
-      conv = tf.squeeze(conv, 2)
-      return conv
-
-  def __init__(self):
-    in_m, in_k = self.get_message_and_key()
-    encrypted = self.model('alice', in_m, in_k)
-    decrypted = self.model('bob', encrypted, in_k)
-    eve_out = self.model('eve', encrypted, None)
-
-    self.reset_eve_vars = tf.group(
-        *[w.initializer for w in tf.compat.v1.get_collection('eve')])
-
-    optimizer = tf.compat.v1.train.AdamOptimizer(learning_rate=FLAGS.learning_rate)
-
-    # Eve's goal is to decrypt the entire message:
-    eve_bits_wrong = tf.reduce_sum(
-        tf.abs((eve_out + 1.0) / 2.0 - (in_m + 1.0) / 2.0), [1])
-    self.eve_loss = tf.reduce_sum(eve_bits_wrong)
-    self.eve_optimizer = optimizer.minimize(
-        self.eve_loss, var_list=tf.compat.v1.get_collection('eve'))
-
-    # Alice and Bob want to be accurate...
-    self.bob_bits_wrong = tf.reduce_sum(
-        tf.abs((decrypted + 1.0) / 2.0 - (in_m + 1.0) / 2.0), [1])
-    # ... and to not let Eve do better than guessing.
-    self.bob_reconstruction_loss = tf.reduce_sum(self.bob_bits_wrong)
-    bob_eve_error_deviation = tf.abs(float(TEXT_SIZE) / 2.0 - eve_bits_wrong)
-    # 7-9 bits wrong is OK too, so we squish the error function a bit.
-    # Without doing this, we often tend to hang out at 0.25 / 7.5 error,
-    # and it seems bad to have continued, high communication error.
-    bob_eve_loss = tf.reduce_sum(
-        tf.square(bob_eve_error_deviation) / (TEXT_SIZE / 2)**2)
-
-    # Rescale the losses to [0, 1] per example and combine.
-    self.bob_loss = (self.bob_reconstruction_loss / TEXT_SIZE + bob_eve_loss)
-
-    self.bob_optimizer = optimizer.minimize(
-        self.bob_loss,
-        var_list=(tf.compat.v1.get_collection('alice') + tf.compat.v1.get_collection('bob')))
-
-
-def doeval(s, ac, n, itercount):
-  """Evaluate the current network on n batches of random examples.
-
-  Args:
-    s:  The current TensorFlow session
-    ac: an instance of the AdversarialCrypto class
-    n:  The number of iterations to run.
-    itercount: Iteration count label for logging.
-
-  Returns:
-    Bob and Eve's loss, as a percent of bits incorrect.
-  """
-
-  bob_loss_accum = 0
-  eve_loss_accum = 0
-  for _ in xrange(n):
-    bl, el = s.run([ac.bob_reconstruction_loss, ac.eve_loss])
-    bob_loss_accum += bl
-    eve_loss_accum += el
-  bob_loss_percent = bob_loss_accum / (n * FLAGS.batch_size)
-  eve_loss_percent = eve_loss_accum / (n * FLAGS.batch_size)
-  print('%10d\t%20.2f\t%20.2f'%(itercount, bob_loss_percent, eve_loss_percent))
-  sys.stdout.flush()
-  return bob_loss_percent, eve_loss_percent
-
-
-def train_until_thresh(s, ac):
-  for j in xrange(MAX_TRAINING_LOOPS):
-    for _ in xrange(ITERS_PER_ACTOR):
-      s.run(ac.bob_optimizer)
-    for _ in xrange(ITERS_PER_ACTOR * EVE_MULTIPLIER):
-      s.run(ac.eve_optimizer)
-    if j % PRINT_EVERY == 0:
-      bob_avg_loss, eve_avg_loss = doeval(s, ac, EVAL_BATCHES, j)
-      if (bob_avg_loss < BOB_LOSS_THRESH and eve_avg_loss > EVE_LOSS_THRESH):
-        print('Target losses achieved.')
-        return True
-  return False
-
-
-def train_and_evaluate():
-  """Run the full training and evaluation loop."""
-
-  ac = AdversarialCrypto()
-  init = tf.compat.v1.global_variables_initializer()
-
-  with tf.compat.v1.Session() as s:
-    s.run(init)
-    print('# Batch size: ', FLAGS.batch_size)
-    print('# %10s\t%20s\t%20s'%("Iter","Bob_Recon_Error","Eve_Recon_Error"))
-
-    if train_until_thresh(s, ac):
-      for _ in xrange(EVE_EXTRA_ROUNDS):
-        s.run(ac.eve_optimizer)
-      print('Loss after eve extra training:')
-      doeval(s, ac, EVAL_BATCHES * 2, 0)
-      for _ in xrange(NUMBER_OF_EVE_RESETS):
-        print('Resetting Eve')
-        s.run(ac.reset_eve_vars)
-        eve_counter = 0
-        for _ in xrange(RETRAIN_EVE_LOOPS):
-          for _ in xrange(RETRAIN_EVE_ITERS):
-            eve_counter += 1
-            s.run(ac.eve_optimizer)
-          doeval(s, ac, EVAL_BATCHES, eve_counter)
-        doeval(s, ac, EVAL_BATCHES, eve_counter)
-
-
-def main(unused_argv):
-  # Exit more quietly with Ctrl-C.
-  signal.signal(signal.SIGINT, signal.SIG_DFL)
-  train_and_evaluate()
-
-
-if __name__ == '__main__':
-  tf.compat.v1.app.run()
diff --git a/research/adversarial_logit_pairing/README.md b/research/adversarial_logit_pairing/README.md
deleted file mode 100644
index d3f576836c4e0fb28eee9882906b18d88a90c564..0000000000000000000000000000000000000000
--- a/research/adversarial_logit_pairing/README.md
+++ /dev/null
@@ -1,281 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Adversarial logit pairing
-
-This directory contains implementation of
-[Adversarial logit pairing](https://arxiv.org/abs/1803.06373) paper as well as
-few models pre-trained on ImageNet and Tiny ImageNet.
-
-Please contact [Alexey Kurakin](https://github.com/AlexeyKurakin) regarding
-this code.
-
-## Pre-requesites
-
-Code dependencies:
-
-* TensorFlow 1.8 and Python 2.7 (other versions may work, but were not tested)
-* [Abseil Python](https://github.com/abseil/abseil-py).
-* Script which converts Tiny Imagenet dataset into TFRecord format also
-  depends on [Pandas](https://pandas.pydata.org/).
-
-## Datasets
-
-To use this code you need to download datasets. You only need to download
-those datasets which you're going to use. Following list of datasets is
-supported:
-
-* [ImageNet](http://www.image-net.org/). Follow
-  [Preparing the datasets](https://github.com/tensorflow/models/tree/master/research/slim#Data)
-  instructions in TF-Slim documentation to download and convert ImageNet dataset
-  to TFRecord format.
-
-* [Tiny ImageNet](https://tiny-imagenet.herokuapp.com/).
-  To obtain Tiny ImageNet dataset do following:
-
-  ```
-  # Download zip archive with TinyImagenet
-  curl -O http://cs231n.stanford.edu/tiny-imagenet-200.zip
-
-  # Extract archive
-  unzip tiny-imagenet-200.zip
-
-  # Convert dataset to TFRecord format
-  mkdir tiny-imagenet-tfrecord
-  python tiny_imagenet_converter/converter.py \
-    --input_dir=tiny-imagenet-200 \
-    --output_dir=tiny-imagenet-tfrecord
-  ```
-
-## Running the code
-
-NOTE: Provided code supports distributed training on multiple machines,
-and all provided checkpoints were trained in a distributed way. However it is
-beyond the scope of this document to describe how to do distributed training.
-Readed should refer to
-[other material](https://www.tensorflow.org/deploy/distributed) to learn
-about it.
-
-### Training
-
-Following command runs training:
-
-```
-# Following arguments has to be specified for training:
-# - MAX_NUMBER_OF_TRAINING_STEPS - maximum number of training steps,
-#     omit this flag or set it to -1 to have unlimited number of training steps.
-# - MODEL_NAME - name of the model, now only "resnet_v2_50" is supported.
-# - MOVING_AVG_DECAY - decay rate for exponential moving average of the
-#     trainable variables. Training with exponential moving average usually
-#     leads to better accuracy. Default of 0.9999. -1 disable exponential moving
-#     average. Default works well, so typically you set it only if you want
-#     to disable this feature.
-# - HYPERPARAMETERS - string with hyperparameters,
-#     see model_lib.py for full list of hyperparameters.
-# - DATASET - dataset, either "imagenet" or "tiny_imagenet".
-# - IMAGE_SIZE - size of the image (single number).
-# - OUTPUT_DIRECTORY - directory where to write results.
-# - IMAGENET_DIR - directory with ImageNet dataset in TFRecord format.
-# - TINY_IMAGENET_DIR - directory with Tiny ImageNet dataset in TFRecord format.
-#
-# Note that only one of IMAGENET_DIR or TINY_IMAGENET_DIR has to be provided
-# depending on which dataset you use.
-#
-python train.py \
-  --max_steps="${MAX_NUMBER_OF_TRAINING_STEPS}" \
-  --model_name="${MODEL_NAME}" \
-  --moving_average_decay="${MOVING_AVG_DECAY}" \
-  --hparams="${HYPERPARAMETERS}" \
-  --dataset="${DATASET}" \
-  --dataset_image_size="${IMAGE_SIZE}" \
-  --output_dir="${OUTPUT_DIRECTORY}" \
-  --imagenet_data_dir="${IMAGENET_DIR}" \
-  --tiny_imagenet_data_dir="${TINY_IMAGENET_DIR}"
-```
-
-Full list of training hyperparameters could be found in `model_lib.py`.
-These hyperparameters control learning rate schedule, optimizer, weight decay,
-label smoothing and adversarial training.
-
-Adversarial training is controlled by following hyperparameters:
-
-* `train_adv_method` - method which is used to craft adversarial examples during
-  training. Could be one of the following:
-
-  * `clean` - perform regular training with clean examples;
-  * `pgd_EPS_STEP_NITER` - use non targeted PGD with maximum size of
-    perturbation equal to `EPS`, step size equal to `STEP`
-    and number of iterations equal to `NITER`. Size of perturbation and step
-    size are expected to be integers between 1 and 255.
-  * `pgdll_EPS_STEP_NITER` - use targeted PGD, where target class is least
-    likely prediction of the network.
-  * `pgdrnd_EPS_STEP_NITER` - use targeted PGD, where target class is chosen
-    randomly.
-
-* `train_lp_weight` - weight of adversarial logit pairing loss. If zero or
-  negarive, then no logit pairing is performed and training is done using
-  mixed minibatch PGD. If positive then adversarial logit pairing term is added
-  to the loss.
-
-Below is example of how to run training with adversarial logit pairing on
-ImageNet 64x64:
-
-```
-python train.py \
-  --model_name="resnet_v2_50" \
-  --hparams="train_adv_method=pgdll_16_2_10,train_lp_weight=0.5" \
-  --dataset="imagenet" \
-  --dataset_image_size=64 \
-  --output_dir="/tmp/adv_train" \
-  --imagenet_data_dir="${IMAGENET_DIR}"
-```
-
-### Fine tuning
-
-Provided trainin script could be used to fine tune pre-trained checkpoint.
-Following command does this:
-
-```
-# Fine tuning adds following additional arguments:
-# - SCOPES_DO_NOT_LOAD_FROM_CHECKPOINT - comma separates list of scopes of
-#     variables, which should not be loadeded from checkpoint (and default
-#     initialization should be used instead).
-#     SCOPES_DO_NOT_LOAD_FROM_CHECKPOINT should be either same or a subset of
-#     LIST_OF_SCOPES_OF_TRAINABLE_VARS.
-# - LIST_OF_SCOPES_OF_TRAINABLE_VARS - comma separated list of scopes of
-#     trainable variables. Only variables which are prefixed with these scopes
-#     will be trained.
-# - PATH_TO_PRETRAINED_CHECKPOINT - directory with pretrained checkpoint which
-#     is used as initialization for fine tuning.
-#
-python train.py \
-  --max_steps="${MAX_NUMBER_OF_TRAINING_STEPS}" \
-  --model_name="${MODEL_NAME}" \
-  --moving_average_decay="${MOVING_AVG_DECAY}" \
-  --hparams="${HYPERPARAMETERS}" \
-  --dataset="${DATASET}" \
-  --dataset_image_size="${IMAGE_SIZE}" \
-  --output_dir="${OUTPUT_DIRECTORY}" \
-  --imagenet_data_dir="${IMAGENET_DIR}" \
-  --tiny_imagenet_data_dir="${TINY_IMAGENET_DIR}" \
-  --finetune_exclude_pretrained_scopes="${SCOPES_DO_NOT_LOAD_FROM_CHECKPOINT}" \
-  --finetune_trainable_scopes="${LIST_OF_SCOPES_OF_TRAINABLE_VARS}" \
-  --finetune_checkpoint_path="${PATH_TO_PRETRAINED_CHECKPOINT}"
-```
-
-Below is an example of how to fine tune last few layers of the model on
-Tiny Imagenet dataset:
-
-```
-python train.py \
-  --model_name="resnet_v2_50" \
-  --hparams="train_adv_method=pgdll_16_2_10,train_lp_weight=0.5,learning_rate=0.02" \
-  --dataset="tiny_imagenet" \
-  --dataset_image_size=64 \
-  --output_dir="/tmp/adv_finetune" \
-  --tiny_imagenet_data_dir="${TINY_IMAGENET_DIR}" \
-  --finetune_exclude_pretrained_scopes="resnet_v2_50/logits" \
-  --finetune_trainable_scopes="resnet_v2_50/logits,resnet_v2_50/postnorm" \
-  --finetune_checkpoint_path="/tmp/adv_train"
-```
-
-### Evaluation
-
-Following command runs evaluation:
-
-```
-# Following arguments should be provided for eval:
-# - TRAINING_DIRECTORY - directory where training checkpoints are saved.
-# - TRAINABLE_SCOPES - when loading checkpoint which was obtained by fine tuning
-#     this argument should be the same as LIST_OF_SCOPES_OF_TRAINABLE_VARS
-#     during training. Otherwise it should be empty.
-#     This is needed to properly load exponential moving average variables.
-#     If exponential moving averages are disabled then this flag could be
-#     omitted.
-# - EVAL_SUBDIR_NAME - name of the subdirectory inside TRAINING_DIRECTORY
-#     where evaluation code will be saving event files.
-# - DATASET - name of the dataset.
-# - IMAGE_SIZE - size of the image in the dataset.
-# - DATSET_SPLIT_NAME - name of the split in the dataset,
-#     either 'train' or 'validation'. Default is 'validation'.
-# - MODEL_NAME - name of the model.
-# - MOVING_AVG_DECAY - decay rate for exponential moving average.
-# - ADV_METHOD_FOR_EVAL - should be "clean" to evaluate on clean example or
-#     description of the adversarial method to evaluate on adversarial examples.
-# - HYPERPARAMETERS - hyperparameters, only "eval_batch_size" matters for eval
-# - NUMBER_OF_EXAMPLES - how many examples from the dataset use for evaluation,
-#     specify -1 to use all examples.
-# - EVAL_ONCE - if True then evaluate only once, otherwise keep evaluation
-#     running repeatedly on new checkpoints. Repeated evaluation might be useful
-#     when running concurrent with training.
-# - IMAGENET_DIR - directory with ImageNet dataset in TFRecord format.
-# - TINY_IMAGENET_DIR - directory with Tiny ImageNet dataset in TFRecord format.
-#
-python eval.py \
-  --train_dir="${TRAINING_DIRECTORY} \
-  --trainable_scopes="${TRAINABLE_SCOPES}" \
-  --eval_name="${EVAL_SUBDIR_NAME}" \
-  --dataset="${DATASET}" \
-  --dataset_image_size="${IMAGE_SIZE}" \
-  --split_name="${DATSET_SPLIT_NAME}" \
-  --model_name="${MODEL_NAME}" \
-  --moving_average_decay="${MOVING_AVG_DECAY}" \
-  --adv_method="${ADV_METHOD_FOR_EVAL}" \
-  --hparams="${HYPERPARAMETERS}" \
-  --num_examples="${NUMBER_OF_EXAMPLES}" \
-  --eval_once="${EVAL_ONCE}" \
-  --imagenet_data_dir="${IMAGENET_DIR}" \
-  --tiny_imagenet_data_dir="${TINY_IMAGENET_DIR}"
-```
-
-Example of running evaluation on 10000 of clean examples from ImageNet
-training set:
-
-```
-python eval.py \
-  --train_dir=/tmp/adv_train \
-  --dataset=imagenet \
-  --dataset_image_size=64 \
-  --split_name=train \
-  --adv_method=clean \
-  --hparams="eval_batch_size=50" \
-  --num_examples=10000 \
-  --eval_once=True \
-  --imagenet_data_dir="${IMAGENET_DIR}"
-```
-
-Example of running evaluatin on adversarial images generated from Tiny ImageNet
-validation set using fine-tuned checkpoint:
-
-```
-python eval.py \
-  --train_dir=tmp/adv_finetune \
-  --trainable_scopes="resnet_v2_50/logits,resnet_v2_50/postnorm" \
-  --dataset=tiny_imagenet \
-  --dataset_image_size=64 \
-  --adv_method=pgdrnd_16_2_10 \
-  --hparams="eval_batch_size=50" \
-  --eval_once=True \
-  --tiny_imagenet_data_dir="${TINY_IMAGENET_DIR}"
-```
-
-### Pre-trained models
-
-Following set of pre-trained checkpoints released with this code:
-
-| Model       |    Dataset   |  Accuracy on<br>clean images | Accuracy on<br>`pgdll_16_1_20` | Accuracy on<br>`pgdll_16_2_10` |
-| ----------- | ------------ | --------------- | --------------------------- | -------------- |
-| [Baseline ResNet-v2-50](http://download.tensorflow.org/models/adversarial_logit_pairing/imagenet64_base_2018_06_26.ckpt.tar.gz) | ImageNet 64x64 | 60.5% | 1.8% | 3.5% |
-| [ALP-trained ResNet-v2-50](http://download.tensorflow.org/models/adversarial_logit_pairing/imagenet64_alp025_2018_06_26.ckpt.tar.gz) | ImageNet 64x64 | 55.7% | 27.5% | 27.8% |
-| [Baseline ResNet-v2-50](http://download.tensorflow.org/models/adversarial_logit_pairing/tiny_imagenet_base_2018_06_26.ckpt.tar.gz) | Tiny ImageNet | 69.2% | 0.1% | 0.3% |
-| [ALP-trained ResNet-v2-50](http://download.tensorflow.org/models/adversarial_logit_pairing/tiny_imagenet_alp05_2018_06_26.ckpt.tar.gz) | Tiny ImageNet | 72.0% | 41.3% | 40.8% |
-
-
-* All provided checkpoints were initially trained with exponential moving
-  average. However for ease of use they were re-saved without it.
-  So to load and use provided checkpoints you need to specify
-  `--moving_average_decay=-1` flag.
-* All ALP models were trained with `pgdll_16_2_10` adversarial examples.
-* All Tiny Imagenet models were obtained by fine tuning corresponding
-  ImageNet 64x64 models. ALP-trained models were fine tuned with ALP.
diff --git a/research/adversarial_logit_pairing/adversarial_attack.py b/research/adversarial_logit_pairing/adversarial_attack.py
deleted file mode 100644
index 804bd64bcf4444007638f9802a83973ee68eb3cf..0000000000000000000000000000000000000000
--- a/research/adversarial_logit_pairing/adversarial_attack.py
+++ /dev/null
@@ -1,219 +0,0 @@
-# Copyright 2018 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Library with adversarial attacks.
-
-This library designed to be self-contained and have no dependencies other
-than TensorFlow. It only contains PGD / Iterative FGSM attacks,
-see https://arxiv.org/abs/1706.06083 and https://arxiv.org/abs/1607.02533
-for details.
-
-For wider set of adversarial attacks refer to Cleverhans library:
-https://github.com/tensorflow/cleverhans
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-def generate_pgd_common(x,
-                        bounds,
-                        model_fn,
-                        attack_params,
-                        one_hot_labels,
-                        perturbation_multiplier):
-  """Common code for generating PGD adversarial examples.
-
-  Args:
-    x: original examples.
-    bounds: tuple with bounds of image values, bounds[0] < bounds[1].
-    model_fn: model function with signature model_fn(images).
-    attack_params: parameters of the attack.
-    one_hot_labels: one hot label vector to use in the loss.
-    perturbation_multiplier: multiplier of adversarial perturbation,
-      either +1.0 or -1.0.
-
-  Returns:
-    Tensor with adversarial examples.
-
-  Raises:
-    ValueError: if attack parameters are invalid.
-  """
-  # parse attack_params
-  # Format of attack_params: 'EPS_STEP_NITER'
-  # where EPS - epsilon, STEP - step size, NITER - number of iterations
-  params_list = attack_params.split('_')
-  if len(params_list) != 3:
-    raise ValueError('Invalid parameters of PGD attack: %s' % attack_params)
-  epsilon = int(params_list[0])
-  step_size = int(params_list[1])
-  niter = int(params_list[2])
-
-  # rescale epsilon and step size to image bounds
-  epsilon = float(epsilon) / 255.0 * (bounds[1] - bounds[0])
-  step_size = float(step_size) / 255.0 * (bounds[1] - bounds[0])
-
-  # clipping boundaries
-  clip_min = tf.maximum(x - epsilon, bounds[0])
-  clip_max = tf.minimum(x + epsilon, bounds[1])
-
-  # compute starting point
-  start_x = x + tf.random_uniform(tf.shape(x), -epsilon, epsilon)
-  start_x = tf.clip_by_value(start_x, clip_min, clip_max)
-
-  # main iteration of PGD
-  loop_vars = [0, start_x]
-
-  def loop_cond(index, _):
-    return index < niter
-
-  def loop_body(index, adv_images):
-    logits = model_fn(adv_images)
-    loss = tf.reduce_sum(
-        tf.nn.softmax_cross_entropy_with_logits_v2(
-            labels=one_hot_labels,
-            logits=logits))
-    perturbation = step_size * tf.sign(tf.gradients(loss, adv_images)[0])
-    new_adv_images = adv_images + perturbation_multiplier * perturbation
-    new_adv_images = tf.clip_by_value(new_adv_images, clip_min, clip_max)
-    return index + 1, new_adv_images
-
-  with tf.control_dependencies([start_x]):
-    _, result = tf.while_loop(
-        loop_cond,
-        loop_body,
-        loop_vars,
-        back_prop=False,
-        parallel_iterations=1)
-    return result
-
-
-def generate_pgd_ll(x, bounds, model_fn, attack_params):
-  # pylint: disable=g-doc-args
-  """Generats targeted PGD adversarial examples with least likely target class.
-
-  See generate_pgd_common for description of arguments.
-
-  Returns:
-    Tensor with adversarial examples.
-  """
-  # pylint: enable=g-doc-args
-
-  # compute one hot least likely class
-  logits = model_fn(x)
-  num_classes = tf.shape(logits)[1]
-  one_hot_labels = tf.one_hot(tf.argmin(model_fn(x), axis=1), num_classes)
-
-  return generate_pgd_common(x, bounds, model_fn, attack_params,
-                             one_hot_labels=one_hot_labels,
-                             perturbation_multiplier=-1.0)
-
-
-def generate_pgd_rand(x, bounds, model_fn, attack_params):
-  # pylint: disable=g-doc-args
-  """Generats targeted PGD adversarial examples with random target class.
-
-  See generate_pgd_common for description of arguments.
-
-  Returns:
-    Tensor with adversarial examples.
-  """
-  # pylint: enable=g-doc-args
-
-  # compute one hot random class
-  logits = model_fn(x)
-  batch_size = tf.shape(logits)[0]
-  num_classes = tf.shape(logits)[1]
-  random_labels = tf.random_uniform(shape=[batch_size],
-                                    minval=0,
-                                    maxval=num_classes,
-                                    dtype=tf.int32)
-  one_hot_labels = tf.one_hot(random_labels, num_classes)
-
-  return generate_pgd_common(x, bounds, model_fn, attack_params,
-                             one_hot_labels=one_hot_labels,
-                             perturbation_multiplier=-1.0)
-
-
-def generate_pgd(x, bounds, model_fn, attack_params):
-  # pylint: disable=g-doc-args
-  """Generats non-targeted PGD adversarial examples.
-
-  See generate_pgd_common for description of arguments.
-
-  Returns:
-    tensor with adversarial examples.
-  """
-  # pylint: enable=g-doc-args
-
-  # compute one hot predicted class
-  logits = model_fn(x)
-  num_classes = tf.shape(logits)[1]
-  one_hot_labels = tf.one_hot(tf.argmax(model_fn(x), axis=1), num_classes)
-
-  return generate_pgd_common(x, bounds, model_fn, attack_params,
-                             one_hot_labels=one_hot_labels,
-                             perturbation_multiplier=1.0)
-
-
-def generate_adversarial_examples(x, bounds, model_fn, attack_description):
-  """Generates adversarial examples.
-
-  Args:
-    x: original examples.
-    bounds: tuple with bounds of image values, bounds[0] < bounds[1]
-    model_fn: model function with signature model_fn(images).
-    attack_description: string which describes an attack, see notes below for
-      details.
-
-  Returns:
-    Tensor with adversarial examples.
-
-  Raises:
-    ValueError: if attack description is invalid.
-
-
-  Attack description could be one of the following strings:
-  - "clean" - no attack, return original images.
-  - "pgd_EPS_STEP_NITER" - non-targeted PGD attack.
-  - "pgdll_EPS_STEP_NITER" - tageted PGD attack with least likely target class.
-  - "pgdrnd_EPS_STEP_NITER" - targetd PGD attack with random target class.
-
-  Meaning of attack parameters is following:
-  - EPS - maximum size of adversarial perturbation, between 0 and 255.
-  - STEP - step size of one iteration of PGD, between 0 and 255.
-  - NITER - number of iterations.
-  """
-  if attack_description == 'clean':
-    return x
-  idx = attack_description.find('_')
-  if idx < 0:
-    raise ValueError('Invalid value of attack description %s'
-                     % attack_description)
-  attack_name = attack_description[:idx]
-  attack_params = attack_description[idx+1:]
-  if attack_name == 'pgdll':
-    return generate_pgd_ll(x, bounds, model_fn, attack_params)
-  elif attack_name == 'pgdrnd':
-    return generate_pgd_rand(x, bounds, model_fn, attack_params)
-  elif attack_name == 'pgd':
-    return generate_pgd(x, bounds, model_fn, attack_params)
-  else:
-    raise ValueError('Invalid value of attack description %s'
-                     % attack_description)
-
diff --git a/research/adversarial_logit_pairing/datasets/__init__.py b/research/adversarial_logit_pairing/datasets/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/adversarial_logit_pairing/datasets/dataset_factory.py b/research/adversarial_logit_pairing/datasets/dataset_factory.py
deleted file mode 100644
index 01c36d4ff4710e1742e989b20a3daef75a6922e1..0000000000000000000000000000000000000000
--- a/research/adversarial_logit_pairing/datasets/dataset_factory.py
+++ /dev/null
@@ -1,62 +0,0 @@
-# Copyright 2018 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Library which creates datasets."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from datasets import imagenet_input
-from datasets import tiny_imagenet_input
-
-
-def get_dataset(dataset_name, split, batch_size, image_size, is_training):
-  """Returns dataset.
-
-  Args:
-    dataset_name: name of the dataset, "imagenet" or "tiny_imagenet".
-    split: name of the split, "train" or "validation".
-    batch_size: size of the minibatch.
-    image_size: size of the one side of the image. Output images will be
-      resized to square shape image_size*image_size.
-    is_training: if True then training preprocessing is done, otherwise eval
-      preprocessing is done.
-
-  Raises:
-    ValueError: if dataset_name is invalid.
-
-  Returns:
-    dataset: instance of tf.data.Dataset with the dataset.
-    num_examples: number of examples in given split of the dataset.
-    num_classes: number of classes in the dataset.
-    bounds: tuple with bounds of image values. All returned image pixels
-      are between bounds[0] and bounds[1].
-  """
-  if dataset_name == 'tiny_imagenet':
-    dataset = tiny_imagenet_input.tiny_imagenet_input(
-        split, batch_size, image_size, is_training)
-    num_examples = tiny_imagenet_input.num_examples_per_epoch(split)
-    num_classes = 200
-    bounds = (-1, 1)
-  elif dataset_name == 'imagenet':
-    dataset = imagenet_input.imagenet_input(
-        split, batch_size, image_size, is_training)
-    num_examples = imagenet_input.num_examples_per_epoch(split)
-    num_classes = 1001
-    bounds = (-1, 1)
-  else:
-    raise ValueError('Invalid dataset %s' % dataset_name)
-  return dataset, num_examples, num_classes, bounds
diff --git a/research/adversarial_logit_pairing/datasets/imagenet_input.py b/research/adversarial_logit_pairing/datasets/imagenet_input.py
deleted file mode 100644
index 0b210b8ce11f3dbf1f14482b1b4f3a95da02a48a..0000000000000000000000000000000000000000
--- a/research/adversarial_logit_pairing/datasets/imagenet_input.py
+++ /dev/null
@@ -1,255 +0,0 @@
-# Copyright 2018 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Imagenet input."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-from absl import flags
-import tensorflow as tf
-
-FLAGS = flags.FLAGS
-
-
-flags.DEFINE_string('imagenet_data_dir', None,
-                    'Directory with Imagenet dataset in TFRecord format.')
-
-
-def _decode_and_random_crop(image_buffer, bbox, image_size):
-  """Randomly crops image and then scales to target size."""
-  with tf.name_scope('distorted_bounding_box_crop',
-                     values=[image_buffer, bbox]):
-    sample_distorted_bounding_box = tf.image.sample_distorted_bounding_box(
-        tf.image.extract_jpeg_shape(image_buffer),
-        bounding_boxes=bbox,
-        min_object_covered=0.1,
-        aspect_ratio_range=[0.75, 1.33],
-        area_range=[0.08, 1.0],
-        max_attempts=10,
-        use_image_if_no_bounding_boxes=True)
-    bbox_begin, bbox_size, _ = sample_distorted_bounding_box
-
-    # Crop the image to the specified bounding box.
-    offset_y, offset_x, _ = tf.unstack(bbox_begin)
-    target_height, target_width, _ = tf.unstack(bbox_size)
-    crop_window = tf.stack([offset_y, offset_x, target_height, target_width])
-    image = tf.image.decode_and_crop_jpeg(image_buffer, crop_window, channels=3)
-    image = tf.image.convert_image_dtype(
-        image, dtype=tf.float32)
-
-    image = tf.image.resize_bicubic([image],
-                                    [image_size, image_size])[0]
-
-    return image
-
-
-def _decode_and_center_crop(image_buffer, image_size):
-  """Crops to center of image with padding then scales to target size."""
-  shape = tf.image.extract_jpeg_shape(image_buffer)
-  image_height = shape[0]
-  image_width = shape[1]
-
-  padded_center_crop_size = tf.cast(
-      0.875 * tf.cast(tf.minimum(image_height, image_width), tf.float32),
-      tf.int32)
-
-  offset_height = ((image_height - padded_center_crop_size) + 1) // 2
-  offset_width = ((image_width - padded_center_crop_size) + 1) // 2
-  crop_window = tf.stack([offset_height, offset_width,
-                          padded_center_crop_size, padded_center_crop_size])
-  image = tf.image.decode_and_crop_jpeg(image_buffer, crop_window, channels=3)
-  image = tf.image.convert_image_dtype(
-      image, dtype=tf.float32)
-
-  image = tf.image.resize_bicubic([image],
-                                  [image_size, image_size])[0]
-
-  return image
-
-
-def _normalize(image):
-  """Rescale image to [-1, 1] range."""
-  return tf.multiply(tf.subtract(image, 0.5), 2.0)
-
-
-def image_preprocessing(image_buffer, bbox, image_size, is_training):
-  """Does image decoding and preprocessing.
-
-  Args:
-    image_buffer: string tensor with encoded image.
-    bbox: bounding box of the object at the image.
-    image_size: image size.
-    is_training: whether to do training or eval preprocessing.
-
-  Returns:
-    Tensor with the image.
-  """
-  if is_training:
-    image = _decode_and_random_crop(image_buffer, bbox, image_size)
-    image = _normalize(image)
-    image = tf.image.random_flip_left_right(image)
-  else:
-    image = _decode_and_center_crop(image_buffer, image_size)
-    image = _normalize(image)
-  image = tf.reshape(image, [image_size, image_size, 3])
-  return image
-
-
-def imagenet_parser(value, image_size, is_training):
-  """Parse an ImageNet record from a serialized string Tensor.
-
-  Args:
-    value: encoded example.
-    image_size: size of the output image.
-    is_training: if True then do training preprocessing,
-      otherwise do eval preprocessing.
-
-  Returns:
-    image: tensor with the image.
-    label: true label of the image.
-  """
-  keys_to_features = {
-      'image/encoded':
-          tf.FixedLenFeature((), tf.string, ''),
-      'image/format':
-          tf.FixedLenFeature((), tf.string, 'jpeg'),
-      'image/class/label':
-          tf.FixedLenFeature([], tf.int64, -1),
-      'image/class/text':
-          tf.FixedLenFeature([], tf.string, ''),
-      'image/object/bbox/xmin':
-          tf.VarLenFeature(dtype=tf.float32),
-      'image/object/bbox/ymin':
-          tf.VarLenFeature(dtype=tf.float32),
-      'image/object/bbox/xmax':
-          tf.VarLenFeature(dtype=tf.float32),
-      'image/object/bbox/ymax':
-          tf.VarLenFeature(dtype=tf.float32),
-      'image/object/class/label':
-          tf.VarLenFeature(dtype=tf.int64),
-  }
-
-  parsed = tf.parse_single_example(value, keys_to_features)
-
-  image_buffer = tf.reshape(parsed['image/encoded'], shape=[])
-
-  xmin = tf.expand_dims(parsed['image/object/bbox/xmin'].values, 0)
-  ymin = tf.expand_dims(parsed['image/object/bbox/ymin'].values, 0)
-  xmax = tf.expand_dims(parsed['image/object/bbox/xmax'].values, 0)
-  ymax = tf.expand_dims(parsed['image/object/bbox/ymax'].values, 0)
-  # Note that ordering is (y, x)
-  bbox = tf.concat([ymin, xmin, ymax, xmax], 0)
-  # Force the variable number of bounding boxes into the shape
-  # [1, num_boxes, coords].
-  bbox = tf.expand_dims(bbox, 0)
-  bbox = tf.transpose(bbox, [0, 2, 1])
-
-  image = image_preprocessing(
-      image_buffer=image_buffer,
-      bbox=bbox,
-      image_size=image_size,
-      is_training=is_training
-  )
-
-  # Labels are in [1, 1000] range
-  label = tf.cast(
-      tf.reshape(parsed['image/class/label'], shape=[]), dtype=tf.int32)
-
-  return image, label
-
-
-def imagenet_input(split, batch_size, image_size, is_training):
-  """Returns ImageNet dataset.
-
-  Args:
-    split: name of the split, "train" or "validation".
-    batch_size: size of the minibatch.
-    image_size: size of the one side of the image. Output images will be
-      resized to square shape image_size*image_size.
-    is_training: if True then training preprocessing is done, otherwise eval
-      preprocessing is done.
-
-  Raises:
-    ValueError: if name of the split is incorrect.
-
-  Returns:
-    Instance of tf.data.Dataset with the dataset.
-  """
-  if split.lower().startswith('train'):
-    file_pattern = os.path.join(FLAGS.imagenet_data_dir, 'train-*')
-  elif split.lower().startswith('validation'):
-    file_pattern = os.path.join(FLAGS.imagenet_data_dir, 'validation-*')
-  else:
-    raise ValueError('Invalid split: %s' % split)
-
-  dataset = tf.data.Dataset.list_files(file_pattern, shuffle=is_training)
-
-  if is_training:
-    dataset = dataset.repeat()
-
-  def fetch_dataset(filename):
-    return tf.data.TFRecordDataset(filename, buffer_size=8*1024*1024)
-
-  # Read the data from disk in parallel
-  dataset = dataset.apply(
-      tf.data.experimental.parallel_interleave(
-          fetch_dataset, cycle_length=4, sloppy=True))
-  dataset = dataset.shuffle(1024)
-
-  # Parse, preprocess, and batch the data in parallel
-  dataset = dataset.apply(
-      tf.data.experimental.map_and_batch(
-          lambda value: imagenet_parser(value, image_size, is_training),
-          batch_size=batch_size,
-          num_parallel_batches=4,
-          drop_remainder=True))
-
-  def set_shapes(images, labels):
-    """Statically set the batch_size dimension."""
-    images.set_shape(images.get_shape().merge_with(
-        tf.TensorShape([batch_size, None, None, None])))
-    labels.set_shape(labels.get_shape().merge_with(
-        tf.TensorShape([batch_size])))
-    return images, labels
-
-  # Assign static batch size dimension
-  dataset = dataset.map(set_shapes)
-
-  # Prefetch overlaps in-feed with training
-  dataset = dataset.prefetch(tf.data.experimental.AUTOTUNE)
-  return dataset
-
-
-def num_examples_per_epoch(split):
-  """Returns the number of examples in the data set.
-
-  Args:
-    split: name of the split, "train" or "validation".
-
-  Raises:
-    ValueError: if split name is incorrect.
-
-  Returns:
-    Number of example in the split.
-  """
-  if split.lower().startswith('train'):
-    return 1281167
-  elif split.lower().startswith('validation'):
-    return 50000
-  else:
-    raise ValueError('Invalid split: %s' % split)
diff --git a/research/adversarial_logit_pairing/datasets/tiny_imagenet_input.py b/research/adversarial_logit_pairing/datasets/tiny_imagenet_input.py
deleted file mode 100644
index 6d216d53ed0bd9f6e7a5770510cedc7f3d9f0a42..0000000000000000000000000000000000000000
--- a/research/adversarial_logit_pairing/datasets/tiny_imagenet_input.py
+++ /dev/null
@@ -1,157 +0,0 @@
-# Copyright 2018 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tiny imagenet input."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-from absl import flags
-import tensorflow as tf
-
-FLAGS = flags.FLAGS
-
-
-flags.DEFINE_string('tiny_imagenet_data_dir', None,
-                    'Directory with Tiny Imagenet dataset in TFRecord format.')
-
-
-def tiny_imagenet_parser(value, image_size, is_training):
-  """Parses tiny imagenet example.
-
-  Args:
-    value: encoded example.
-    image_size: size of the image.
-    is_training: if True then do training preprocessing (which includes
-      random cropping), otherwise do eval preprocessing.
-
-  Returns:
-    image: tensor with the image.
-    label: true label of the image.
-  """
-  keys_to_features = {
-      'image/encoded': tf.FixedLenFeature((), tf.string, ''),
-      'label/tiny_imagenet': tf.FixedLenFeature([], tf.int64, -1),
-  }
-
-  parsed = tf.parse_single_example(value, keys_to_features)
-
-  image_buffer = tf.reshape(parsed['image/encoded'], shape=[])
-  image = tf.image.decode_image(image_buffer, channels=3)
-  image = tf.image.convert_image_dtype(
-      image, dtype=tf.float32)
-
-  # Crop image
-  if is_training:
-    bbox_begin, bbox_size, _ = tf.image.sample_distorted_bounding_box(
-        tf.shape(image),
-        bounding_boxes=tf.constant([0.0, 0.0, 1.0, 1.0],
-                                   dtype=tf.float32,
-                                   shape=[1, 1, 4]),
-        min_object_covered=0.5,
-        aspect_ratio_range=[0.75, 1.33],
-        area_range=[0.5, 1.0],
-        max_attempts=20,
-        use_image_if_no_bounding_boxes=True)
-    image = tf.slice(image, bbox_begin, bbox_size)
-
-  # resize image
-  image = tf.image.resize_bicubic([image], [image_size, image_size])[0]
-
-  # Rescale image to [-1, 1] range.
-  image = tf.multiply(tf.subtract(image, 0.5), 2.0)
-
-  image = tf.reshape(image, [image_size, image_size, 3])
-
-  # Labels are in [0, 199] range
-  label = tf.cast(
-      tf.reshape(parsed['label/tiny_imagenet'], shape=[]), dtype=tf.int32)
-
-  return image, label
-
-
-def tiny_imagenet_input(split, batch_size, image_size, is_training):
-  """Returns Tiny Imagenet Dataset.
-
-  Args:
-    split: name of the split, "train" or "validation".
-    batch_size: size of the minibatch.
-    image_size: size of the one side of the image. Output images will be
-      resized to square shape image_size*image_size.
-    is_training: if True then training preprocessing is done, otherwise eval
-      preprocessing is done.instance of tf.data.Dataset with the dataset.
-
-  Raises:
-    ValueError: if name of the split is incorrect.
-
-  Returns:
-    Instance of tf.data.Dataset with the dataset.
-  """
-  if split.lower().startswith('train'):
-    filepath = os.path.join(FLAGS.tiny_imagenet_data_dir, 'train.tfrecord')
-  elif split.lower().startswith('validation'):
-    filepath = os.path.join(FLAGS.tiny_imagenet_data_dir, 'validation.tfrecord')
-  else:
-    raise ValueError('Invalid split: %s' % split)
-
-  dataset = tf.data.TFRecordDataset(filepath, buffer_size=8*1024*1024)
-
-  if is_training:
-    dataset = dataset.shuffle(10000)
-    dataset = dataset.repeat()
-
-  dataset = dataset.apply(
-      tf.data.experimental.map_and_batch(
-          lambda value: tiny_imagenet_parser(value, image_size, is_training),
-          batch_size=batch_size,
-          num_parallel_batches=4,
-          drop_remainder=True))
-
-  def set_shapes(images, labels):
-    """Statically set the batch_size dimension."""
-    images.set_shape(images.get_shape().merge_with(
-        tf.TensorShape([batch_size, None, None, None])))
-    labels.set_shape(labels.get_shape().merge_with(
-        tf.TensorShape([batch_size])))
-    return images, labels
-
-  # Assign static batch size dimension
-  dataset = dataset.map(set_shapes)
-
-  dataset = dataset.prefetch(tf.data.experimental.AUTOTUNE)
-
-  return dataset
-
-
-def num_examples_per_epoch(split):
-  """Returns the number of examples in the data set.
-
-  Args:
-    split: name of the split, "train" or "validation".
-
-  Raises:
-    ValueError: if split name is incorrect.
-
-  Returns:
-    Number of example in the split.
-  """
-  if split.lower().startswith('train'):
-    return 100000
-  elif split.lower().startswith('validation'):
-    return 10000
-  else:
-    raise ValueError('Invalid split: %s' % split)
diff --git a/research/adversarial_logit_pairing/eval.py b/research/adversarial_logit_pairing/eval.py
deleted file mode 100644
index 504cc0b0bcf52edff9e7aaa2c0d051079ba521aa..0000000000000000000000000000000000000000
--- a/research/adversarial_logit_pairing/eval.py
+++ /dev/null
@@ -1,181 +0,0 @@
-# Copyright 2018 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Program which runs evaluation of Imagenet 64x64 and TinyImagenet models."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-from absl import app
-from absl import flags
-
-import tensorflow as tf
-
-import adversarial_attack
-import model_lib
-from datasets import dataset_factory
-
-FLAGS = flags.FLAGS
-
-
-flags.DEFINE_string('train_dir', None,
-                    'Training directory. If specified then this program '
-                    'runs in continuous evaluation mode.')
-
-flags.DEFINE_string('checkpoint_path', None,
-                    'Path to the file with checkpoint. If specified then '
-                    'this program evaluates only provided checkpoint one time.')
-
-flags.DEFINE_string('output_file', None,
-                    'Name of output file. Used only in single evaluation mode.')
-
-flags.DEFINE_string('eval_name', 'default', 'Name for eval subdirectory.')
-
-flags.DEFINE_string('master', '', 'Tensorflow master.')
-
-flags.DEFINE_string('model_name', 'resnet_v2_50', 'Name of the model.')
-
-flags.DEFINE_string('adv_method', 'clean',
-                    'Method which is used to generate adversarial examples.')
-
-flags.DEFINE_string('dataset', 'imagenet',
-                    'Dataset: "tiny_imagenet" or "imagenet".')
-
-flags.DEFINE_integer('dataset_image_size', 64,
-                     'Size of the images in the dataset.')
-
-flags.DEFINE_string('hparams', '', 'Hyper parameters.')
-
-flags.DEFINE_string('split_name', 'validation', 'Name of the split.')
-
-flags.DEFINE_float('moving_average_decay', 0.9999,
-                   'The decay to use for the moving average.')
-
-flags.DEFINE_integer('eval_interval_secs', 120,
-                     'The frequency, in seconds, with which evaluation is run.')
-
-flags.DEFINE_integer(
-    'num_examples', -1,
-    'If positive - maximum number of example to use for evaluation.')
-
-flags.DEFINE_bool('eval_once', False,
-                  'If true then evaluate model only once.')
-
-flags.DEFINE_string('trainable_scopes', None,
-                    'If set then it defines list of variable scopes for '
-                    'trainable variables.')
-
-
-def main(_):
-  if not FLAGS.train_dir and not FLAGS.checkpoint_path:
-    print('Either --train_dir or --checkpoint_path flags has to be provided.')
-  if FLAGS.train_dir and FLAGS.checkpoint_path:
-    print('Only one of --train_dir or --checkpoint_path should be provided.')
-  params = model_lib.default_hparams()
-  params.parse(FLAGS.hparams)
-  tf.logging.info('User provided hparams: %s', FLAGS.hparams)
-  tf.logging.info('All hyper parameters: %s', params)
-  batch_size = params.eval_batch_size
-  graph = tf.Graph()
-  with graph.as_default():
-    # dataset
-    dataset, num_examples, num_classes, bounds = dataset_factory.get_dataset(
-        FLAGS.dataset,
-        FLAGS.split_name,
-        batch_size,
-        FLAGS.dataset_image_size,
-        is_training=False)
-    dataset_iterator = dataset.make_one_shot_iterator()
-    images, labels = dataset_iterator.get_next()
-    if FLAGS.num_examples > 0:
-      num_examples = min(num_examples, FLAGS.num_examples)
-
-    # setup model
-    global_step = tf.train.get_or_create_global_step()
-    model_fn_two_args = model_lib.get_model(FLAGS.model_name, num_classes)
-    model_fn = lambda x: model_fn_two_args(x, is_training=False)
-    if not FLAGS.adv_method or FLAGS.adv_method == 'clean':
-      logits = model_fn(images)
-    else:
-      adv_examples = adversarial_attack.generate_adversarial_examples(
-          images, bounds, model_fn, FLAGS.adv_method)
-      logits = model_fn(adv_examples)
-
-    # update trainable variables if fine tuning is used
-    model_lib.filter_trainable_variables(FLAGS.trainable_scopes)
-
-    # Setup the moving averages
-    if FLAGS.moving_average_decay and (FLAGS.moving_average_decay > 0):
-      variable_averages = tf.train.ExponentialMovingAverage(
-          FLAGS.moving_average_decay, global_step)
-      variables_to_restore = variable_averages.variables_to_restore(
-          tf.contrib.framework.get_model_variables())
-      variables_to_restore[global_step.op.name] = global_step
-    else:
-      variables_to_restore = tf.contrib.framework.get_variables_to_restore()
-
-    # Setup evaluation metric
-    with tf.name_scope('Eval'):
-      names_to_values, names_to_updates = (
-          tf.contrib.metrics.aggregate_metric_map({
-              'Accuracy': tf.metrics.accuracy(labels, tf.argmax(logits, 1)),
-              'Top5': tf.metrics.recall_at_k(tf.to_int64(labels), logits, 5)
-          }))
-
-      for name, value in names_to_values.iteritems():
-        tf.summary.scalar(name, value)
-
-    # Run evaluation
-    num_batches = int(num_examples / batch_size)
-    if FLAGS.train_dir:
-      output_dir = os.path.join(FLAGS.train_dir, FLAGS.eval_name)
-      if not tf.gfile.Exists(output_dir):
-        tf.gfile.MakeDirs(output_dir)
-      tf.contrib.training.evaluate_repeatedly(
-          FLAGS.train_dir,
-          master=FLAGS.master,
-          scaffold=tf.train.Scaffold(
-              saver=tf.train.Saver(variables_to_restore)),
-          eval_ops=names_to_updates.values(),
-          eval_interval_secs=FLAGS.eval_interval_secs,
-          hooks=[
-              tf.contrib.training.StopAfterNEvalsHook(num_batches),
-              tf.contrib.training.SummaryAtEndHook(output_dir),
-              tf.train.LoggingTensorHook(names_to_values, at_end=True),
-          ],
-          max_number_of_evaluations=1 if FLAGS.eval_once else None)
-    else:
-      result = tf.contrib.training.evaluate_once(
-          FLAGS.checkpoint_path,
-          master=FLAGS.master,
-          scaffold=tf.train.Scaffold(
-              saver=tf.train.Saver(variables_to_restore)),
-          eval_ops=names_to_updates.values(),
-          final_ops=names_to_values,
-          hooks=[
-              tf.contrib.training.StopAfterNEvalsHook(num_batches),
-              tf.train.LoggingTensorHook(names_to_values, at_end=True),
-          ])
-      if FLAGS.output_file:
-        with tf.gfile.Open(FLAGS.output_file, 'a') as f:
-          f.write('%s,%.3f,%.3f\n'
-                  % (FLAGS.eval_name, result['Accuracy'], result['Top5']))
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/adversarial_logit_pairing/model_lib.py b/research/adversarial_logit_pairing/model_lib.py
deleted file mode 100644
index 1499a378ea1ba6511122ebe54ceed1226d38d649..0000000000000000000000000000000000000000
--- a/research/adversarial_logit_pairing/model_lib.py
+++ /dev/null
@@ -1,189 +0,0 @@
-# Copyright 2018 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Library with common functions for training and eval."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import six
-
-import tensorflow as tf
-
-from tensorflow.contrib.slim.nets import resnet_v2
-
-
-def default_hparams():
-  """Returns default hyperparameters."""
-  return tf.contrib.training.HParams(
-      # Batch size for training and evaluation.
-      batch_size=32,
-      eval_batch_size=50,
-
-      # General training parameters.
-      weight_decay=0.0001,
-      label_smoothing=0.1,
-
-      # Parameters of the adversarial training.
-      train_adv_method='clean',  # adversarial training method
-      train_lp_weight=0.0,  # Weight of adversarial logit pairing loss
-
-      # Parameters of the optimizer.
-      optimizer='rms',  # possible values are: 'rms', 'momentum', 'adam'
-      momentum=0.9,  # momentum
-      rmsprop_decay=0.9,  # Decay term for RMSProp
-      rmsprop_epsilon=1.0,  # Epsilon term for RMSProp
-
-      # Parameters of learning rate schedule.
-      lr_schedule='exp_decay',  # Possible values: 'exp_decay', 'step', 'fixed'
-      learning_rate=0.045,
-      lr_decay_factor=0.94,  # Learning exponential decay
-      lr_num_epochs_per_decay=2.0,  # Number of epochs per lr decay
-      lr_list=[1.0 / 6, 2.0 / 6, 3.0 / 6,
-               4.0 / 6, 5.0 / 6, 1.0, 0.1, 0.01,
-               0.001, 0.0001],
-      lr_decay_epochs=[1, 2, 3, 4, 5, 30, 60, 80,
-                       90])
-
-
-def get_lr_schedule(hparams, examples_per_epoch, replicas_to_aggregate=1):
-  """Returns TensorFlow op which compute learning rate.
-
-  Args:
-    hparams: hyper parameters.
-    examples_per_epoch: number of training examples per epoch.
-    replicas_to_aggregate: number of training replicas running in parallel.
-
-  Raises:
-    ValueError: if learning rate schedule specified in hparams is incorrect.
-
-  Returns:
-    learning_rate: tensor with learning rate.
-    steps_per_epoch: number of training steps per epoch.
-  """
-  global_step = tf.train.get_or_create_global_step()
-  steps_per_epoch = float(examples_per_epoch) / float(hparams.batch_size)
-  if replicas_to_aggregate > 0:
-    steps_per_epoch /= replicas_to_aggregate
-
-  if hparams.lr_schedule == 'exp_decay':
-    decay_steps = long(steps_per_epoch * hparams.lr_num_epochs_per_decay)
-    learning_rate = tf.train.exponential_decay(
-        hparams.learning_rate,
-        global_step,
-        decay_steps,
-        hparams.lr_decay_factor,
-        staircase=True)
-  elif hparams.lr_schedule == 'step':
-    lr_decay_steps = [long(epoch * steps_per_epoch)
-                      for epoch in hparams.lr_decay_epochs]
-    learning_rate = tf.train.piecewise_constant(
-        global_step, lr_decay_steps, hparams.lr_list)
-  elif hparams.lr_schedule == 'fixed':
-    learning_rate = hparams.learning_rate
-  else:
-    raise ValueError('Invalid value of lr_schedule: %s' % hparams.lr_schedule)
-
-  if replicas_to_aggregate > 0:
-    learning_rate *= replicas_to_aggregate
-
-  return learning_rate, steps_per_epoch
-
-
-def get_optimizer(hparams, learning_rate):
-  """Returns optimizer.
-
-  Args:
-    hparams: hyper parameters.
-    learning_rate: learning rate tensor.
-
-  Raises:
-    ValueError: if type of optimizer specified in hparams is incorrect.
-
-  Returns:
-    Instance of optimizer class.
-  """
-  if hparams.optimizer == 'rms':
-    optimizer = tf.train.RMSPropOptimizer(learning_rate,
-                                          hparams.rmsprop_decay,
-                                          hparams.momentum,
-                                          hparams.rmsprop_epsilon)
-  elif hparams.optimizer == 'momentum':
-    optimizer = tf.train.MomentumOptimizer(learning_rate,
-                                           hparams.momentum)
-  elif hparams.optimizer == 'adam':
-    optimizer = tf.train.AdamOptimizer(learning_rate)
-  else:
-    raise ValueError('Invalid value of optimizer: %s' % hparams.optimizer)
-  return optimizer
-
-
-RESNET_MODELS = {'resnet_v2_50': resnet_v2.resnet_v2_50}
-
-
-def get_model(model_name, num_classes):
-  """Returns function which creates model.
-
-  Args:
-    model_name: Name of the model.
-    num_classes: Number of classes.
-
-  Raises:
-    ValueError: If model_name is invalid.
-
-  Returns:
-    Function, which creates model when called.
-  """
-  if model_name.startswith('resnet'):
-    def resnet_model(images, is_training, reuse=tf.AUTO_REUSE):
-      with tf.contrib.framework.arg_scope(resnet_v2.resnet_arg_scope()):
-        resnet_fn = RESNET_MODELS[model_name]
-        logits, _ = resnet_fn(images, num_classes, is_training=is_training,
-                              reuse=reuse)
-        logits = tf.reshape(logits, [-1, num_classes])
-      return logits
-    return resnet_model
-  else:
-    raise ValueError('Invalid model: %s' % model_name)
-
-
-def filter_trainable_variables(trainable_scopes):
-  """Keep only trainable variables which are prefixed with given scopes.
-
-  Args:
-    trainable_scopes: either list of trainable scopes or string with comma
-      separated list of trainable scopes.
-
-  This function removes all variables which are not prefixed with given
-  trainable_scopes from collection of trainable variables.
-  Useful during network fine tuning, when you only need to train subset of
-  variables.
-  """
-  if not trainable_scopes:
-    return
-  if isinstance(trainable_scopes, six.string_types):
-    trainable_scopes = [scope.strip() for scope in trainable_scopes.split(',')]
-  trainable_scopes = {scope for scope in trainable_scopes if scope}
-  if not trainable_scopes:
-    return
-  trainable_collection = tf.get_collection_ref(
-      tf.GraphKeys.TRAINABLE_VARIABLES)
-  non_trainable_vars = [
-      v for v in trainable_collection
-      if not any([v.op.name.startswith(s) for s in trainable_scopes])
-  ]
-  for v in non_trainable_vars:
-    trainable_collection.remove(v)
diff --git a/research/adversarial_logit_pairing/tiny_imagenet_converter/converter.py b/research/adversarial_logit_pairing/tiny_imagenet_converter/converter.py
deleted file mode 100644
index 4fdccc32071f8c677bb1395e324c6b94aa7e85af..0000000000000000000000000000000000000000
--- a/research/adversarial_logit_pairing/tiny_imagenet_converter/converter.py
+++ /dev/null
@@ -1,241 +0,0 @@
-# Copyright 2018 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Converts Tiny Imagenet dataset into TFRecord format.
-
-As an output this program generates following files in TFRecord format:
-- train.tfrecord
-- validation.tfrecord
-- test.tfrecord
-
-Generated train and validation files will contain tf.Example entries with
-following features:
-- image/encoded - encoded image
-- image/format - image format
-- label/wnid - label WordNet ID
-- label/imagenet - imagenet label [1 ... 1000]
-- label/tiny_imagenet - tiny imagenet label [0 ... 199]
-- bbox/xmin
-- bbox/ymin
-- bbox/xmax
-- bbox/ymax
-
-Test file will contain entries with 'image/encoded' and 'image/format' features.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import namedtuple
-import os
-import random
-
-from absl import app
-from absl import flags
-from absl import logging
-
-import pandas as pd
-
-import tensorflow as tf
-
-
-FLAGS = flags.FLAGS
-
-flags.DEFINE_string('input_dir', '', 'Input directory')
-flags.DEFINE_string('output_dir', '', 'Output directory')
-
-flags.DEFINE_string('imagenet_synsets_path', '',
-                    'Optional path to /imagenet_lsvrc_2015_synsets.txt')
-
-
-ImageMetadata = namedtuple('ImageMetadata', ['label', 'x1', 'y1', 'x2', 'y2'])
-
-
-class WnIdToNodeIdConverter(object):
-  """Converts WordNet IDs to numerical labels."""
-
-  def __init__(self, wnids_path, background_class):
-    self._wnid_to_node_id = {}
-    self._node_id_to_wnid = {}
-    with tf.gfile.Open(wnids_path) as f:
-      wnids_sequence = [wnid.strip() for wnid in f.readlines() if wnid.strip()]
-    node_id_offset = 1 if background_class else 0
-    for i, label in enumerate(wnids_sequence):
-      self._wnid_to_node_id[label] = i + node_id_offset
-      self._node_id_to_wnid[i + node_id_offset] = label
-
-  def to_node_id(self, wnid):
-    return self._wnid_to_node_id[wnid]
-
-  def to_wnid(self, node_id):
-    return self._node_id_to_wnid[node_id]
-
-  def all_wnids(self):
-    return self._wnid_to_node_id.keys()
-
-
-def read_tiny_imagenet_annotations(annotations_filename,
-                                   images_dir,
-                                   one_label=None):
-  """Reads one file with Tiny Imagenet annotations."""
-  result = []
-  if one_label:
-    column_names = ['filename', 'x1', 'y1', 'x2', 'y2']
-  else:
-    column_names = ['filename', 'label', 'x1', 'y1', 'x2', 'y2']
-  with tf.gfile.Open(annotations_filename) as f:
-    data = pd.read_csv(f, sep='\t', names=column_names)
-  for row in data.itertuples():
-    label = one_label if one_label else getattr(row, 'label')
-    full_filename = os.path.join(images_dir, getattr(row, 'filename'))
-    result.append((full_filename,
-                   ImageMetadata(label=label,
-                                 x1=getattr(row, 'x1'),
-                                 y1=getattr(row, 'y1'),
-                                 x2=getattr(row, 'x2'),
-                                 y2=getattr(row, 'y2'))))
-  return result
-
-
-def read_validation_annotations(validation_dir):
-  """Reads validation data annotations."""
-  return read_tiny_imagenet_annotations(
-      os.path.join(validation_dir, 'val_annotations.txt'),
-      os.path.join(validation_dir, 'images'))
-
-
-def read_training_annotations(training_dir):
-  """Reads training data annotations."""
-  result = []
-  sub_dirs = tf.gfile.ListDirectory(training_dir)
-  for sub_dir in sub_dirs:
-    if not sub_dir.startswith('n'):
-      logging.warning('Found non-class directory in training dir: %s', sub_dir)
-      continue
-    sub_dir_results = read_tiny_imagenet_annotations(
-        os.path.join(training_dir, sub_dir, sub_dir + '_boxes.txt'),
-        os.path.join(training_dir, sub_dir, 'images'),
-        one_label=sub_dir)
-    result.extend(sub_dir_results)
-  return result
-
-
-def read_test_annotations(test_dir):
-  """Reads test data annotations."""
-  files = tf.gfile.ListDirectory(os.path.join(test_dir, 'images'))
-  return [(os.path.join(test_dir, 'images', f), None)
-          for f in files if f.endswith('.JPEG')]
-
-
-def get_image_format(filename):
-  """Returns image format from filename."""
-  filename = filename.lower()
-  if filename.endswith('jpeg') or filename.endswith('jpg'):
-    return 'jpeg'
-  elif filename.endswith('png'):
-    return 'png'
-  else:
-    raise ValueError('Unrecognized file format: %s' % filename)
-
-
-class TinyImagenetWriter(object):
-  """Helper class which writes Tiny Imagenet dataset into TFRecord file."""
-
-  def __init__(self, tiny_imagenet_wnid_conveter, imagenet_wnid_converter):
-    self.tiny_imagenet_wnid_conveter = tiny_imagenet_wnid_conveter
-    self.imagenet_wnid_converter = imagenet_wnid_converter
-
-  def write_tf_record(self,
-                      annotations,
-                      output_file):
-    """Generates TFRecord file from given list of annotations."""
-    with tf.python_io.TFRecordWriter(output_file) as writer:
-      for image_filename, image_metadata in annotations:
-        with tf.gfile.Open(image_filename) as f:
-          image_buffer = f.read()
-        image_format = get_image_format(image_filename)
-        features = {
-            'image/encoded': tf.train.Feature(
-                bytes_list=tf.train.BytesList(value=[image_buffer])),
-            'image/format': tf.train.Feature(
-                bytes_list=tf.train.BytesList(value=[image_format]))
-        }
-        if image_metadata:
-          # bounding box features
-          features['bbox/xmin'] = tf.train.Feature(
-              int64_list=tf.train.Int64List(value=[image_metadata.x1]))
-          features['bbox/ymin'] = tf.train.Feature(
-              int64_list=tf.train.Int64List(value=[image_metadata.y1]))
-          features['bbox/xmax'] = tf.train.Feature(
-              int64_list=tf.train.Int64List(value=[image_metadata.x2]))
-          features['bbox/ymax'] = tf.train.Feature(
-              int64_list=tf.train.Int64List(value=[image_metadata.y2]))
-          # tiny imagenet label, from [0, 200) iterval
-          tiny_imagenet_label = self.tiny_imagenet_wnid_conveter.to_node_id(
-              image_metadata.label)
-          features['label/wnid'] = tf.train.Feature(
-              bytes_list=tf.train.BytesList(value=image_metadata.label))
-          features['label/tiny_imagenet'] = tf.train.Feature(
-              int64_list=tf.train.Int64List(value=[tiny_imagenet_label]))
-          # full imagenet label, from [1, 1001) interval
-          if self.imagenet_wnid_converter:
-            imagenet_label = self.imagenet_wnid_converter.to_node_id(
-                image_metadata.label)
-            features['label/imagenet'] = tf.train.Feature(
-                int64_list=tf.train.Int64List(value=[imagenet_label]))
-        example = tf.train.Example(features=tf.train.Features(feature=features))
-        writer.write(example.SerializeToString())
-
-
-def main(_):
-  assert FLAGS.input_dir, 'Input directory must be provided'
-  assert FLAGS.output_dir, 'Output directory must be provided'
-
-  # Create WordNet ID conveters for tiny imagenet and possibly for imagenet
-  tiny_imagenet_wnid_conveter = WnIdToNodeIdConverter(
-      os.path.join(FLAGS.input_dir, 'wnids.txt'),
-      background_class=False)
-  if FLAGS.imagenet_synsets_path:
-    imagenet_wnid_converter = WnIdToNodeIdConverter(FLAGS.imagenet_synsets_path,
-                                                    background_class=True)
-  else:
-    imagenet_wnid_converter = None
-
-  # read tiny imagenet annotations
-  train_annotations = read_training_annotations(
-      os.path.join(FLAGS.input_dir, 'train'))
-  random.shuffle(train_annotations)
-  val_annotations = read_validation_annotations(
-      os.path.join(FLAGS.input_dir, 'val'))
-  test_filenames = read_test_annotations(os.path.join(FLAGS.input_dir, 'test'))
-
-  # Generate TFRecord files
-  writer = TinyImagenetWriter(tiny_imagenet_wnid_conveter,
-                              imagenet_wnid_converter)
-  tf.logging.info('Converting %d training images', len(train_annotations))
-  writer.write_tf_record(train_annotations,
-                         os.path.join(FLAGS.output_dir, 'train.tfrecord'))
-  tf.logging.info('Converting %d validation images ', len(val_annotations))
-  writer.write_tf_record(val_annotations,
-                         os.path.join(FLAGS.output_dir, 'validation.tfrecord'))
-  tf.logging.info('Converting %d test images', len(test_filenames))
-  writer.write_tf_record(test_filenames,
-                         os.path.join(FLAGS.output_dir, 'test.tfrecord'))
-  tf.logging.info('All files are converted')
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/adversarial_logit_pairing/train.py b/research/adversarial_logit_pairing/train.py
deleted file mode 100644
index dd20969f8d09c59f7d294ee34a9e41bd44f86b39..0000000000000000000000000000000000000000
--- a/research/adversarial_logit_pairing/train.py
+++ /dev/null
@@ -1,288 +0,0 @@
-# Copyright 2018 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Program which train models."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from absl import app
-from absl import flags
-
-import tensorflow as tf
-
-import adversarial_attack
-import model_lib
-from datasets import dataset_factory
-
-FLAGS = flags.FLAGS
-
-
-flags.DEFINE_integer('max_steps', -1, 'Number of steps to stop at.')
-
-flags.DEFINE_string('output_dir', None,
-                    'Training directory where checkpoints will be saved.')
-
-flags.DEFINE_integer('ps_tasks', 0, 'Number of parameter servers.')
-
-flags.DEFINE_integer('task', 0, 'Task ID for running distributed training.')
-
-flags.DEFINE_string('master', '', 'Tensorflow master.')
-
-flags.DEFINE_string('model_name', 'resnet_v2_50', 'Name of the model.')
-
-flags.DEFINE_string('dataset', 'imagenet',
-                    'Dataset: "tiny_imagenet" or "imagenet".')
-
-flags.DEFINE_integer('dataset_image_size', 64,
-                     'Size of the images in the dataset.')
-
-flags.DEFINE_integer('num_summary_images', 3,
-                     'Number of images to display in Tensorboard.')
-
-flags.DEFINE_integer(
-    'save_summaries_steps', 100,
-    'The frequency with which summaries are saved, in steps.')
-
-flags.DEFINE_integer(
-    'save_summaries_secs', None,
-    'The frequency with which summaries are saved, in seconds.')
-
-flags.DEFINE_integer(
-    'save_model_steps', 500,
-    'The frequency with which the model is saved, in steps.')
-
-flags.DEFINE_string('hparams', '', 'Hyper parameters.')
-
-flags.DEFINE_integer('replicas_to_aggregate', 1,
-                     'Number of gradients to collect before param updates.')
-
-flags.DEFINE_integer('worker_replicas', 1, 'Number of worker replicas.')
-
-flags.DEFINE_float('moving_average_decay', 0.9999,
-                   'The decay to use for the moving average.')
-
-# Flags to control fine tuning
-
-flags.DEFINE_string('finetune_checkpoint_path', None,
-                    'Path to checkpoint for fine tuning. '
-                    'If None then no fine tuning is done.')
-
-flags.DEFINE_string('finetune_exclude_pretrained_scopes', '',
-                    'Variable scopes to exclude when loading checkpoint for  '
-                    'fine tuning.')
-
-flags.DEFINE_string('finetune_trainable_scopes', None,
-                    'If set then it defines list of variable scopes for '
-                    'trainable variables.')
-
-
-def _get_finetuning_init_fn(variable_averages):
-  """Returns an init functions, used for fine tuning."""
-  if not FLAGS.finetune_checkpoint_path:
-    return None
-
-  if tf.train.latest_checkpoint(FLAGS.output_dir):
-    return None
-
-  if tf.gfile.IsDirectory(FLAGS.finetune_checkpoint_path):
-    checkpoint_path = tf.train.latest_checkpoint(FLAGS.finetune_checkpoint_path)
-  else:
-    checkpoint_path = FLAGS.finetune_checkpoint_path
-
-  if not checkpoint_path:
-    tf.logging.warning('Not doing fine tuning, can not find checkpoint in %s',
-                       FLAGS.finetune_checkpoint_path)
-    return None
-
-  tf.logging.info('Fine-tuning from %s', checkpoint_path)
-
-  if FLAGS.finetune_exclude_pretrained_scopes:
-    exclusions = {
-        scope.strip()
-        for scope in FLAGS.finetune_exclude_pretrained_scopes.split(',')
-    }
-  else:
-    exclusions = set()
-
-  filtered_model_variables = [
-      v for v in tf.contrib.framework.get_model_variables()
-      if not any([v.op.name.startswith(e) for e in exclusions])
-  ]
-
-  if variable_averages:
-    variables_to_restore = {}
-    for v in filtered_model_variables:
-      # variables_to_restore[variable_averages.average_name(v)] = v
-      if v in tf.trainable_variables():
-        variables_to_restore[variable_averages.average_name(v)] = v
-      else:
-        variables_to_restore[v.op.name] = v
-  else:
-    variables_to_restore = {v.op.name: v for v in filtered_model_variables}
-
-  assign_fn = tf.contrib.framework.assign_from_checkpoint_fn(
-      checkpoint_path,
-      variables_to_restore)
-  if assign_fn:
-    return lambda _, sess: assign_fn(sess)
-  else:
-    return None
-
-
-def main(_):
-  assert FLAGS.output_dir, '--output_dir has to be provided'
-  if not tf.gfile.Exists(FLAGS.output_dir):
-    tf.gfile.MakeDirs(FLAGS.output_dir)
-  params = model_lib.default_hparams()
-  params.parse(FLAGS.hparams)
-  tf.logging.info('User provided hparams: %s', FLAGS.hparams)
-  tf.logging.info('All hyper parameters: %s', params)
-  batch_size = params.batch_size
-  graph = tf.Graph()
-  with graph.as_default():
-    with tf.device(tf.train.replica_device_setter(ps_tasks=FLAGS.ps_tasks)):
-      # dataset
-      dataset, examples_per_epoch, num_classes, bounds = (
-          dataset_factory.get_dataset(
-              FLAGS.dataset,
-              'train',
-              batch_size,
-              FLAGS.dataset_image_size,
-              is_training=True))
-      dataset_iterator = dataset.make_one_shot_iterator()
-      images, labels = dataset_iterator.get_next()
-      one_hot_labels = tf.one_hot(labels, num_classes)
-
-      # set up model
-      global_step = tf.train.get_or_create_global_step()
-      model_fn = model_lib.get_model(FLAGS.model_name, num_classes)
-      if params.train_adv_method == 'clean':
-        logits = model_fn(images, is_training=True)
-        adv_examples = None
-      else:
-        model_fn_eval_mode = lambda x: model_fn(x, is_training=False)
-        adv_examples = adversarial_attack.generate_adversarial_examples(
-            images, bounds, model_fn_eval_mode, params.train_adv_method)
-        all_examples = tf.concat([images, adv_examples], axis=0)
-        logits = model_fn(all_examples, is_training=True)
-        one_hot_labels = tf.concat([one_hot_labels, one_hot_labels], axis=0)
-
-      # update trainable variables if fine tuning is used
-      model_lib.filter_trainable_variables(
-          FLAGS.finetune_trainable_scopes)
-
-      # set up losses
-      total_loss = tf.losses.softmax_cross_entropy(
-          onehot_labels=one_hot_labels,
-          logits=logits,
-          label_smoothing=params.label_smoothing)
-      tf.summary.scalar('loss_xent', total_loss)
-
-      if params.train_lp_weight > 0:
-        images1, images2 = tf.split(logits, 2)
-        loss_lp = tf.losses.mean_squared_error(
-            images1, images2, weights=params.train_lp_weight)
-        tf.summary.scalar('loss_lp', loss_lp)
-        total_loss += loss_lp
-
-      if params.weight_decay > 0:
-        loss_wd = (
-            params.weight_decay
-            * tf.add_n([tf.nn.l2_loss(v) for v in tf.trainable_variables()])
-        )
-        tf.summary.scalar('loss_wd', loss_wd)
-        total_loss += loss_wd
-
-      # Setup the moving averages:
-      if FLAGS.moving_average_decay and (FLAGS.moving_average_decay > 0):
-        with tf.name_scope('moving_average'):
-          moving_average_variables = tf.contrib.framework.get_model_variables()
-          variable_averages = tf.train.ExponentialMovingAverage(
-              FLAGS.moving_average_decay, global_step)
-      else:
-        moving_average_variables = None
-        variable_averages = None
-
-      # set up optimizer and training op
-      learning_rate, steps_per_epoch = model_lib.get_lr_schedule(
-          params, examples_per_epoch, FLAGS.replicas_to_aggregate)
-
-      optimizer = model_lib.get_optimizer(params, learning_rate)
-
-      optimizer = tf.train.SyncReplicasOptimizer(
-          opt=optimizer,
-          replicas_to_aggregate=FLAGS.replicas_to_aggregate,
-          total_num_replicas=FLAGS.worker_replicas,
-          variable_averages=variable_averages,
-          variables_to_average=moving_average_variables)
-
-      train_op = tf.contrib.training.create_train_op(
-          total_loss, optimizer,
-          update_ops=tf.get_collection(tf.GraphKeys.UPDATE_OPS))
-
-      tf.summary.image('images', images[0:FLAGS.num_summary_images])
-      if adv_examples is not None:
-        tf.summary.image('adv_images', adv_examples[0:FLAGS.num_summary_images])
-      tf.summary.scalar('total_loss', total_loss)
-      tf.summary.scalar('learning_rate', learning_rate)
-      tf.summary.scalar('current_epoch',
-                        tf.to_double(global_step) / steps_per_epoch)
-
-      # Training
-      is_chief = FLAGS.task == 0
-
-      scaffold = tf.train.Scaffold(
-          init_fn=_get_finetuning_init_fn(variable_averages))
-      hooks = [
-          tf.train.LoggingTensorHook({'total_loss': total_loss,
-                                      'global_step': global_step},
-                                     every_n_iter=1),
-          tf.train.NanTensorHook(total_loss),
-      ]
-      chief_only_hooks = [
-          tf.train.SummarySaverHook(save_steps=FLAGS.save_summaries_steps,
-                                    save_secs=FLAGS.save_summaries_secs,
-                                    output_dir=FLAGS.output_dir,
-                                    scaffold=scaffold),
-          tf.train.CheckpointSaverHook(FLAGS.output_dir,
-                                       save_steps=FLAGS.save_model_steps,
-                                       scaffold=scaffold),
-      ]
-
-      if FLAGS.max_steps > 0:
-        hooks.append(
-            tf.train.StopAtStepHook(last_step=FLAGS.max_steps))
-
-      # hook for sync replica training
-      hooks.append(optimizer.make_session_run_hook(is_chief))
-
-      with tf.train.MonitoredTrainingSession(
-          master=FLAGS.master,
-          is_chief=is_chief,
-          checkpoint_dir=FLAGS.output_dir,
-          scaffold=scaffold,
-          hooks=hooks,
-          chief_only_hooks=chief_only_hooks,
-          save_checkpoint_secs=None,
-          save_summaries_steps=None,
-          save_summaries_secs=None) as session:
-        while not session.should_stop():
-          session.run([train_op])
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/attention_ocr/README.md b/research/attention_ocr/README.md
index 0056262343d9e486ecb5a83cb65bed3af948e426..f2042e573fa5a1cecc3cd620c38e45be89c78dc1 100644
--- a/research/attention_ocr/README.md
+++ b/research/attention_ocr/README.md
@@ -1,4 +1,4 @@
-## Attention-based Extraction of Structured Information from Street View Imagery
+# Attention-based Extraction of Structured Information from Street View Imagery
 
 [![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/attention-based-extraction-of-structured/optical-character-recognition-on-fsns-test)](https://paperswithcode.com/sota/optical-character-recognition-on-fsns-test?p=attention-based-extraction-of-structured)
 [![Paper](http://img.shields.io/badge/paper-arXiv.1704.03549-B3181B.svg)](https://arxiv.org/abs/1704.03549)
@@ -7,14 +7,20 @@
 *A TensorFlow model for real-world image text extraction problems.*
 
 This folder contains the code needed to train a new Attention OCR model on the
-[FSNS dataset][FSNS] dataset to transcribe street names in France. You can
-also use it to train it on your own data.
+[FSNS dataset][FSNS] to transcribe street names in France. You can also train the code on your own data.
 
 More details can be found in our paper:
 
 ["Attention-based Extraction of Structured Information from Street View
 Imagery"](https://arxiv.org/abs/1704.03549)
 
+## Description
+
+* Paper presents a model based on ConvNets, RNN's and a novel attention mechanism.
+Achieves **84.2%** on FSNS beating the previous benchmark (**72.46%**). Also studies
+the speed/accuracy tradeoff that results from using CNN feature extractors of
+different depths.
+
 ## Contacts
 
 Authors
@@ -22,7 +28,18 @@ Authors
 * Zbigniew Wojna (zbigniewwojna@gmail.com)
 * Alexander Gorban (gorban@google.com)
 
-Maintainer: Xavier Gibert [@xavigibert](https://github.com/xavigibert)
+Maintainer
+
+* Xavier Gibert ([@xavigibert](https://github.com/xavigibert))
+
+## Table of Contents
+
+* [Requirements](https://github.com/tensorflow/models/blob/master/research/attention_ocr/README.md#requirements)
+* [Dataset](https://github.com/tensorflow/models/blob/master/research/attention_ocr/README.md#dataset)
+* [How to use this code](https://github.com/tensorflow/models/blob/master/research/attention_ocr/README.md#how-to-use-this-code)
+* [Using your own image data](https://github.com/tensorflow/models/blob/master/research/attention_ocr/README.md#using-your-own-image-data)
+* [How to use a pre-trained model](https://github.com/tensorflow/models/blob/master/research/attention_ocr/README.md#how-to-use-a-pre-trained-model)
+* [Disclaimer](https://github.com/tensorflow/models/blob/master/research/attention_ocr/README.md#disclaimer)
 
 ## Requirements
 
@@ -49,6 +66,42 @@ cd ..
 [TF]: https://www.tensorflow.org/install/
 [FSNS]: https://github.com/tensorflow/models/tree/master/research/street
 
+## Dataset
+
+The French Street Name Signs (FSNS) dataset is split into subsets, 
+each of which is composed of multiple files. Note that these datasets 
+are very large. The approximate sizes are:
+
+* Train: 512 files of 300MB each.
+* Validation: 64 files of 40MB each.
+* Test: 64 files of 50MB each.
+* The datasets download includes a directory `testdata` that contains 
+some small datasets that are big enough to test that models can 
+actually learn something.
+* Total: around 158GB
+
+The download paths are in the following list:
+
+```
+https://download.tensorflow.org/data/fsns-20160927/charset_size=134.txt
+https://download.tensorflow.org/data/fsns-20160927/test/test-00000-of-00064
+...
+https://download.tensorflow.org/data/fsns-20160927/test/test-00063-of-00064
+https://download.tensorflow.org/data/fsns-20160927/testdata/arial-32-00000-of-00001
+https://download.tensorflow.org/data/fsns-20160927/testdata/fsns-00000-of-00001
+https://download.tensorflow.org/data/fsns-20160927/testdata/mnist-sample-00000-of-00001
+https://download.tensorflow.org/data/fsns-20160927/testdata/numbers-16-00000-of-00001
+https://download.tensorflow.org/data/fsns-20160927/train/train-00000-of-00512
+...
+https://download.tensorflow.org/data/fsns-20160927/train/train-00511-of-00512
+https://download.tensorflow.org/data/fsns-20160927/validation/validation-00000-of-00064
+...
+https://download.tensorflow.org/data/fsns-20160927/validation/validation-00063-of-00064
+```
+
+All URLs are stored in the [research/street](https://github.com/tensorflow/models/tree/master/research/street) 
+repository in the text file `python/fsns_urls.txt`.
+
 ## How to use this code
 
 To run all unit tests:
@@ -80,7 +133,7 @@ tar xf attention_ocr_2017_08_09.tar.gz
 python train.py --checkpoint=model.ckpt-399731
 ```
 
-## How to use your own image data to train the model
+## Using your own image data
 
 You need to define a new dataset. There are two options:
 
@@ -166,6 +219,14 @@ implement one in Python or C++.
 
 The recommended way is to use the [Serving infrastructure][serving].
 
+To export to SavedModel format:
+
+```
+python model_export.py \
+  --checkpoint=model.ckpt-399731 \
+  --export_dir=/tmp/attention_ocr_export
+```
+
 Alternatively you can:
 1. define a placeholder for images (or use directly an numpy array)
 2. [create a graph ](https://github.com/tensorflow/models/blob/master/research/attention_ocr/python/eval.py#L60)
@@ -188,7 +249,7 @@ other than a one time experiment please use the [TensorFlow Serving][serving].
 
 [1]: https://github.com/tensorflow/tensorflow/blob/aaf7adc/tensorflow/contrib/rnn/python/tools/checkpoint_convert.py
 [2]: https://www.tensorflow.org/api_docs/python/tf/contrib/framework/assign_from_checkpoint_fn
-[serving]: https://tensorflow.github.io/serving/serving_basic
+[serving]: https://www.tensorflow.org/tfx/serving/serving_basic
 
 ## Disclaimer
 
diff --git a/research/attention_ocr/python/common_flags.py b/research/attention_ocr/python/common_flags.py
index 1cb67179f5e8c772070a6f5c44a88838ee91e6e9..86eb355ed85ceff81c10751119bc3c46ffef59b1 100644
--- a/research/attention_ocr/python/common_flags.py
+++ b/research/attention_ocr/python/common_flags.py
@@ -14,10 +14,10 @@
 # ==============================================================================
 
 """Define flags are common for both train.py and eval.py scripts."""
+import logging
 import sys
 
 from tensorflow.python.platform import flags
-import logging
 
 import datasets
 import model
@@ -35,9 +35,17 @@ logging.basicConfig(
     datefmt='%Y-%m-%d %H:%M:%S')
 
 
+_common_flags_defined = False
+
 def define():
   """Define common flags."""
   # yapf: disable
+  # common_flags.define() may be called multiple times in unit tests.
+  global _common_flags_defined
+  if _common_flags_defined:
+    return
+  _common_flags_defined = True
+
   flags.DEFINE_integer('batch_size', 32,
                        'Batch size.')
 
@@ -74,7 +82,7 @@ def define():
                       'the optimizer to use')
 
   flags.DEFINE_float('momentum', 0.9,
-                      'momentum value for the momentum optimizer if used')
+                     'momentum value for the momentum optimizer if used')
 
   flags.DEFINE_bool('use_augment_input', True,
                     'If True will use image augmentation')
diff --git a/research/attention_ocr/python/data_provider.py b/research/attention_ocr/python/data_provider.py
index 6beba17f6d07de1afb7fa90aa7e051fda402df3e..7a5a2d40cee1d431380b8958439e1d47b7f4e509 100644
--- a/research/attention_ocr/python/data_provider.py
+++ b/research/attention_ocr/python/data_provider.py
@@ -56,14 +56,14 @@ def augment_image(image):
   Returns:
     Distorted Tensor image of the same shape.
   """
-  with tf.variable_scope('AugmentImage'):
+  with tf.compat.v1.variable_scope('AugmentImage'):
     height = image.get_shape().dims[0].value
     width = image.get_shape().dims[1].value
 
     # Random crop cut from the street sign image, resized to the same size.
     # Assures that the crop is covers at least 0.8 area of the input image.
     bbox_begin, bbox_size, _ = tf.image.sample_distorted_bounding_box(
-        tf.shape(image),
+        image_size=tf.shape(input=image),
         bounding_boxes=tf.zeros([0, 0, 4]),
         min_object_covered=0.8,
         aspect_ratio_range=[0.8, 1.2],
@@ -74,7 +74,7 @@ def augment_image(image):
     # Randomly chooses one of the 4 interpolation methods
     distorted_image = inception_preprocessing.apply_with_random_selector(
         distorted_image,
-        lambda x, method: tf.image.resize_images(x, [height, width], method),
+        lambda x, method: tf.image.resize(x, [height, width], method),
         num_cases=4)
     distorted_image.set_shape([height, width, 3])
 
@@ -99,9 +99,10 @@ def central_crop(image, crop_size):
   Returns:
     A tensor of shape [crop_height, crop_width, channels].
   """
-  with tf.variable_scope('CentralCrop'):
+  with tf.compat.v1.variable_scope('CentralCrop'):
     target_width, target_height = crop_size
-    image_height, image_width = tf.shape(image)[0], tf.shape(image)[1]
+    image_height, image_width = tf.shape(
+        input=image)[0], tf.shape(input=image)[1]
     assert_op1 = tf.Assert(
         tf.greater_equal(image_height, target_height),
         ['image_height < target_height', image_height, target_height])
@@ -129,7 +130,7 @@ def preprocess_image(image, augment=False, central_crop_size=None,
     A float32 tensor of shape [H x W x 3] with RGB values in the required
     range.
   """
-  with tf.variable_scope('PreprocessImage'):
+  with tf.compat.v1.variable_scope('PreprocessImage'):
     image = tf.image.convert_image_dtype(image, dtype=tf.float32)
     if augment or central_crop_size:
       if num_towers == 1:
@@ -144,9 +145,6 @@ def preprocess_image(image, augment=False, central_crop_size=None,
         images = [augment_image(img) for img in images]
       image = tf.concat(images, 1)
 
-    image = tf.subtract(image, 0.5)
-    image = tf.multiply(image, 2.5)
-
   return image
 
 
@@ -185,7 +183,7 @@ def get_data(dataset,
       image_orig, augment, central_crop_size, num_towers=dataset.num_of_views)
   label_one_hot = slim.one_hot_encoding(label, dataset.num_char_classes)
 
-  images, images_orig, labels, labels_one_hot = (tf.train.shuffle_batch(
+  images, images_orig, labels, labels_one_hot = (tf.compat.v1.train.shuffle_batch(
       [image, image_orig, label, label_one_hot],
       batch_size=batch_size,
       num_threads=shuffle_config.num_batching_threads,
diff --git a/research/attention_ocr/python/datasets/fsns.py b/research/attention_ocr/python/datasets/fsns.py
index c7203ffcff972207795b4ef5b1e755d35559033a..ab6d0f28b1369a8e5945d57c2f102733638058d0 100644
--- a/research/attention_ocr/python/datasets/fsns.py
+++ b/research/attention_ocr/python/datasets/fsns.py
@@ -72,7 +72,7 @@ def read_charset(filename, null_character=u'\u2591'):
   """
   pattern = re.compile(r'(\d+)\t(.+)')
   charset = {}
-  with tf.gfile.GFile(filename) as f:
+  with tf.io.gfile.GFile(filename) as f:
     for i, line in enumerate(f):
       m = pattern.match(line)
       if m is None:
@@ -96,9 +96,9 @@ class _NumOfViewsHandler(slim.tfexample_decoder.ItemHandler):
     self._num_of_views = num_of_views
 
   def tensors_to_item(self, keys_to_tensors):
-    return tf.to_int64(
+    return tf.cast(
         self._num_of_views * keys_to_tensors[self._original_width_key] /
-        keys_to_tensors[self._width_key])
+        keys_to_tensors[self._width_key], dtype=tf.int64)
 
 
 def get_split(split_name, dataset_dir=None, config=None):
@@ -133,19 +133,19 @@ def get_split(split_name, dataset_dir=None, config=None):
   zero = tf.zeros([1], dtype=tf.int64)
   keys_to_features = {
       'image/encoded':
-      tf.FixedLenFeature((), tf.string, default_value=''),
+      tf.io.FixedLenFeature((), tf.string, default_value=''),
       'image/format':
-      tf.FixedLenFeature((), tf.string, default_value='png'),
+      tf.io.FixedLenFeature((), tf.string, default_value='png'),
       'image/width':
-      tf.FixedLenFeature([1], tf.int64, default_value=zero),
+      tf.io.FixedLenFeature([1], tf.int64, default_value=zero),
       'image/orig_width':
-      tf.FixedLenFeature([1], tf.int64, default_value=zero),
+      tf.io.FixedLenFeature([1], tf.int64, default_value=zero),
       'image/class':
-      tf.FixedLenFeature([config['max_sequence_length']], tf.int64),
+      tf.io.FixedLenFeature([config['max_sequence_length']], tf.int64),
       'image/unpadded_class':
-      tf.VarLenFeature(tf.int64),
+      tf.io.VarLenFeature(tf.int64),
       'image/text':
-      tf.FixedLenFeature([1], tf.string, default_value=''),
+      tf.io.FixedLenFeature([1], tf.string, default_value=''),
   }
   items_to_handlers = {
       'image':
@@ -171,12 +171,14 @@ def get_split(split_name, dataset_dir=None, config=None):
                               config['splits'][split_name]['pattern'])
   return slim.dataset.Dataset(
       data_sources=file_pattern,
-      reader=tf.TFRecordReader,
+      reader=tf.compat.v1.TFRecordReader,
       decoder=decoder,
       num_samples=config['splits'][split_name]['size'],
       items_to_descriptions=config['items_to_descriptions'],
       #  additional parameters for convenience.
       charset=charset,
+      charset_file=charset_file,
+      image_shape=config['image_shape'],
       num_char_classes=len(charset),
       num_of_views=config['num_of_views'],
       max_sequence_length=config['max_sequence_length'],
diff --git a/research/attention_ocr/python/datasets/fsns_test.py b/research/attention_ocr/python/datasets/fsns_test.py
index 4daedfbd12a58b6635cefed2bdc02bc84fc2c9ef..528fad3ac45fe5736765b21e98b21653526ab225 100644
--- a/research/attention_ocr/python/datasets/fsns_test.py
+++ b/research/attention_ocr/python/datasets/fsns_test.py
@@ -91,7 +91,7 @@ class FsnsTest(tf.test.TestCase):
     image_tf, label_tf = provider.get(['image', 'label'])
 
     with self.test_session() as sess:
-      sess.run(tf.global_variables_initializer())
+      sess.run(tf.compat.v1.global_variables_initializer())
       with slim.queues.QueueRunners(sess):
         image_np, label_np = sess.run([image_tf, label_tf])
 
diff --git a/research/attention_ocr/python/datasets/testdata/fsns/download_data.py b/research/attention_ocr/python/datasets/testdata/fsns/download_data.py
index 559e3195f2156af3be97395b5bc8c0d8ea62f174..126ef58060bbfee11d94f756853fd88cdab8f28a 100644
--- a/research/attention_ocr/python/datasets/testdata/fsns/download_data.py
+++ b/research/attention_ocr/python/datasets/testdata/fsns/download_data.py
@@ -10,7 +10,8 @@ KEEP_NUM_RECORDS = 5
 print('Downloading %s ...' % URL)
 urllib.request.urlretrieve(URL, DST_ORIG)
 
-print('Writing %d records from %s to %s ...' % (KEEP_NUM_RECORDS, DST_ORIG, DST))
+print('Writing %d records from %s to %s ...' %
+      (KEEP_NUM_RECORDS, DST_ORIG, DST))
 with tf.io.TFRecordWriter(DST) as writer:
-    for raw_record in itertools.islice(tf.python_io.tf_record_iterator(DST_ORIG), KEEP_NUM_RECORDS):
+    for raw_record in itertools.islice(tf.compat.v1.python_io.tf_record_iterator(DST_ORIG), KEEP_NUM_RECORDS):
         writer.write(raw_record)
diff --git a/research/attention_ocr/python/demo_inference.py b/research/attention_ocr/python/demo_inference.py
index d5fcf2515b85412aad272749cc50f5e81752b35d..5db873171a9a42af87b934db355018b7400134bc 100644
--- a/research/attention_ocr/python/demo_inference.py
+++ b/research/attention_ocr/python/demo_inference.py
@@ -49,7 +49,7 @@ def load_images(file_pattern, batch_size, dataset_name):
   for i in range(batch_size):
     path = file_pattern % i
     print("Reading %s" % path)
-    pil_image = PIL.Image.open(tf.gfile.GFile(path, 'rb'))
+    pil_image = PIL.Image.open(tf.io.gfile.GFile(path, 'rb'))
     images_actual_data[i, ...] = np.asarray(pil_image)
   return images_actual_data
 
@@ -58,12 +58,13 @@ def create_model(batch_size, dataset_name):
   width, height = get_dataset_image_size(dataset_name)
   dataset = common_flags.create_dataset(split_name=FLAGS.split_name)
   model = common_flags.create_model(
-    num_char_classes=dataset.num_char_classes,
-    seq_length=dataset.max_sequence_length,
-    num_views=dataset.num_of_views,
-    null_code=dataset.null_code,
-    charset=dataset.charset)
-  raw_images = tf.placeholder(tf.uint8, shape=[batch_size, height, width, 3])
+      num_char_classes=dataset.num_char_classes,
+      seq_length=dataset.max_sequence_length,
+      num_views=dataset.num_of_views,
+      null_code=dataset.null_code,
+      charset=dataset.charset)
+  raw_images = tf.compat.v1.placeholder(
+      tf.uint8, shape=[batch_size, height, width, 3])
   images = tf.map_fn(data_provider.preprocess_image, raw_images,
                      dtype=tf.float32)
   endpoints = model.create_base(images, labels_one_hot=None)
@@ -76,9 +77,9 @@ def run(checkpoint, batch_size, dataset_name, image_path_pattern):
   images_data = load_images(image_path_pattern, batch_size,
                             dataset_name)
   session_creator = monitored_session.ChiefSessionCreator(
-    checkpoint_filename_with_path=checkpoint)
+      checkpoint_filename_with_path=checkpoint)
   with monitored_session.MonitoredSession(
-      session_creator=session_creator) as sess:
+          session_creator=session_creator) as sess:
     predictions = sess.run(endpoints.predicted_text,
                            feed_dict={images_placeholder: images_data})
   return [pr_bytes.decode('utf-8') for pr_bytes in predictions.tolist()]
@@ -87,10 +88,10 @@ def run(checkpoint, batch_size, dataset_name, image_path_pattern):
 def main(_):
   print("Predicted strings:")
   predictions = run(FLAGS.checkpoint, FLAGS.batch_size, FLAGS.dataset_name,
-                  FLAGS.image_path_pattern)
+                    FLAGS.image_path_pattern)
   for line in predictions:
     print(line)
 
 
 if __name__ == '__main__':
-  tf.app.run()
+  tf.compat.v1.app.run()
diff --git a/research/attention_ocr/python/demo_inference_test.py b/research/attention_ocr/python/demo_inference_test.py
index 457fb5ab9ef5dbcb326585c2dc8281ee23d319d1..d52a5624391958849f5e8345130cfcd5b5ea3095 100644
--- a/research/attention_ocr/python/demo_inference_test.py
+++ b/research/attention_ocr/python/demo_inference_test.py
@@ -14,12 +14,13 @@ class DemoInferenceTest(tf.test.TestCase):
     super(DemoInferenceTest, self).setUp()
     for suffix in ['.meta', '.index', '.data-00000-of-00001']:
       filename = _CHECKPOINT + suffix
-      self.assertTrue(tf.gfile.Exists(filename),
+      self.assertTrue(tf.io.gfile.exists(filename),
                       msg='Missing checkpoint file %s. '
                           'Please download and extract it from %s' %
                           (filename, _CHECKPOINT_URL))
     self._batch_size = 32
-    tf.flags.FLAGS.dataset_dir = os.path.join(os.path.dirname(__file__), 'datasets/testdata/fsns')
+    tf.flags.FLAGS.dataset_dir = os.path.join(
+        os.path.dirname(__file__), 'datasets/testdata/fsns')
 
   def test_moving_variables_properly_loaded_from_a_checkpoint(self):
     batch_size = 32
@@ -30,15 +31,15 @@ class DemoInferenceTest(tf.test.TestCase):
     images_data = demo_inference.load_images(image_path_pattern, batch_size,
                                              dataset_name)
     tensor_name = 'AttentionOcr_v1/conv_tower_fn/INCE/InceptionV3/Conv2d_2a_3x3/BatchNorm/moving_mean'
-    moving_mean_tf = tf.get_default_graph().get_tensor_by_name(
-      tensor_name + ':0')
-    reader = tf.train.NewCheckpointReader(_CHECKPOINT)
+    moving_mean_tf = tf.compat.v1.get_default_graph().get_tensor_by_name(
+        tensor_name + ':0')
+    reader = tf.compat.v1.train.NewCheckpointReader(_CHECKPOINT)
     moving_mean_expected = reader.get_tensor(tensor_name)
 
     session_creator = monitored_session.ChiefSessionCreator(
-      checkpoint_filename_with_path=_CHECKPOINT)
+        checkpoint_filename_with_path=_CHECKPOINT)
     with monitored_session.MonitoredSession(
-        session_creator=session_creator) as sess:
+            session_creator=session_creator) as sess:
       moving_mean_np = sess.run(moving_mean_tf,
                                 feed_dict={images_placeholder: images_data})
 
@@ -50,38 +51,38 @@ class DemoInferenceTest(tf.test.TestCase):
                                      'fsns',
                                      image_path_pattern)
     self.assertEqual([
-      u'Boulevard de Lunel░░░░░░░░░░░░░░░░░░░',
-      'Rue de Provence░░░░░░░░░░░░░░░░░░░░░░',
-      'Rue de Port Maria░░░░░░░░░░░░░░░░░░░░',
-      'Avenue Charles Gounod░░░░░░░░░░░░░░░░',
-      'Rue de l‘Aurore░░░░░░░░░░░░░░░░░░░░░░',
-      'Rue de Beuzeville░░░░░░░░░░░░░░░░░░░░',
-      'Rue d‘Orbey░░░░░░░░░░░░░░░░░░░░░░░░░░',
-      'Rue Victor Schoulcher░░░░░░░░░░░░░░░░',
-      'Rue de la Gare░░░░░░░░░░░░░░░░░░░░░░░',
-      'Rue des Tulipes░░░░░░░░░░░░░░░░░░░░░░',
-      'Rue André Maginot░░░░░░░░░░░░░░░░░░░░',
-      'Route de Pringy░░░░░░░░░░░░░░░░░░░░░░',
-      'Rue des Landelles░░░░░░░░░░░░░░░░░░░░',
-      'Rue des Ilettes░░░░░░░░░░░░░░░░░░░░░░',
-      'Avenue de Maurin░░░░░░░░░░░░░░░░░░░░░',
-      'Rue Théresa░░░░░░░░░░░░░░░░░░░░░░░░░░',  # GT='Rue Thérésa'
-      'Route de la Balme░░░░░░░░░░░░░░░░░░░░',
-      'Rue Hélène Roederer░░░░░░░░░░░░░░░░░░',
-      'Rue Emile Bernard░░░░░░░░░░░░░░░░░░░░',
-      'Place de la Mairie░░░░░░░░░░░░░░░░░░░',
-      'Rue des Perrots░░░░░░░░░░░░░░░░░░░░░░',
-      'Rue de la Libération░░░░░░░░░░░░░░░░░',
-      'Impasse du Capcir░░░░░░░░░░░░░░░░░░░░',
-      'Avenue de la Grand Mare░░░░░░░░░░░░░░',
-      'Rue Pierre Brossolette░░░░░░░░░░░░░░░',
-      'Rue de Provence░░░░░░░░░░░░░░░░░░░░░░',
-      'Rue du Docteur Mourre░░░░░░░░░░░░░░░░',
-      'Rue d‘Ortheuil░░░░░░░░░░░░░░░░░░░░░░░',
-      'Rue des Sarments░░░░░░░░░░░░░░░░░░░░░',
-      'Rue du Centre░░░░░░░░░░░░░░░░░░░░░░░░',
-      'Impasse Pierre Mourgues░░░░░░░░░░░░░░',
-      'Rue Marcel Dassault░░░░░░░░░░░░░░░░░░'
+        u'Boulevard de Lunel░░░░░░░░░░░░░░░░░░░',
+        'Rue de Provence░░░░░░░░░░░░░░░░░░░░░░',
+        'Rue de Port Maria░░░░░░░░░░░░░░░░░░░░',
+        'Avenue Charles Gounod░░░░░░░░░░░░░░░░',
+        'Rue de l‘Aurore░░░░░░░░░░░░░░░░░░░░░░',
+        'Rue de Beuzeville░░░░░░░░░░░░░░░░░░░░',
+        'Rue d‘Orbey░░░░░░░░░░░░░░░░░░░░░░░░░░',
+        'Rue Victor Schoulcher░░░░░░░░░░░░░░░░',
+        'Rue de la Gare░░░░░░░░░░░░░░░░░░░░░░░',
+        'Rue des Tulipes░░░░░░░░░░░░░░░░░░░░░░',
+        'Rue André Maginot░░░░░░░░░░░░░░░░░░░░',
+        'Route de Pringy░░░░░░░░░░░░░░░░░░░░░░',
+        'Rue des Landelles░░░░░░░░░░░░░░░░░░░░',
+        'Rue des Ilettes░░░░░░░░░░░░░░░░░░░░░░',
+        'Avenue de Maurin░░░░░░░░░░░░░░░░░░░░░',
+        'Rue Théresa░░░░░░░░░░░░░░░░░░░░░░░░░░',  # GT='Rue Thérésa'
+        'Route de la Balme░░░░░░░░░░░░░░░░░░░░',
+        'Rue Hélène Roederer░░░░░░░░░░░░░░░░░░',
+        'Rue Emile Bernard░░░░░░░░░░░░░░░░░░░░',
+        'Place de la Mairie░░░░░░░░░░░░░░░░░░░',
+        'Rue des Perrots░░░░░░░░░░░░░░░░░░░░░░',
+        'Rue de la Libération░░░░░░░░░░░░░░░░░',
+        'Impasse du Capcir░░░░░░░░░░░░░░░░░░░░',
+        'Avenue de la Grand Mare░░░░░░░░░░░░░░',
+        'Rue Pierre Brossolette░░░░░░░░░░░░░░░',
+        'Rue de Provence░░░░░░░░░░░░░░░░░░░░░░',
+        'Rue du Docteur Mourre░░░░░░░░░░░░░░░░',
+        'Rue d‘Ortheuil░░░░░░░░░░░░░░░░░░░░░░░',
+        'Rue des Sarments░░░░░░░░░░░░░░░░░░░░░',
+        'Rue du Centre░░░░░░░░░░░░░░░░░░░░░░░░',
+        'Impasse Pierre Mourgues░░░░░░░░░░░░░░',
+        'Rue Marcel Dassault░░░░░░░░░░░░░░░░░░'
     ], predictions)
 
 
diff --git a/research/attention_ocr/python/eval.py b/research/attention_ocr/python/eval.py
index ec68ad50bc25cd8528f4e9fd7976adad72782641..c0147b66beb143f30a43256d5e092a93f89837dd 100644
--- a/research/attention_ocr/python/eval.py
+++ b/research/attention_ocr/python/eval.py
@@ -45,8 +45,8 @@ flags.DEFINE_integer('number_of_steps', None,
 
 
 def main(_):
-  if not tf.gfile.Exists(FLAGS.eval_log_dir):
-    tf.gfile.MakeDirs(FLAGS.eval_log_dir)
+  if not tf.io.gfile.exists(FLAGS.eval_log_dir):
+    tf.io.gfile.makedirs(FLAGS.eval_log_dir)
 
   dataset = common_flags.create_dataset(split_name=FLAGS.split_name)
   model = common_flags.create_model(dataset.num_char_classes,
@@ -62,7 +62,7 @@ def main(_):
   eval_ops = model.create_summaries(
       data, endpoints, dataset.charset, is_training=False)
   slim.get_or_create_global_step()
-  session_config = tf.ConfigProto(device_count={"GPU": 0})
+  session_config = tf.compat.v1.ConfigProto(device_count={"GPU": 0})
   slim.evaluation.evaluation_loop(
       master=FLAGS.master,
       checkpoint_dir=FLAGS.train_log_dir,
diff --git a/research/attention_ocr/python/inception_preprocessing.py b/research/attention_ocr/python/inception_preprocessing.py
index a4827f2cab742340da2d8d4972c41b35c9862a1e..b61b895021c230cb22f327b077255e28914a924e 100644
--- a/research/attention_ocr/python/inception_preprocessing.py
+++ b/research/attention_ocr/python/inception_preprocessing.py
@@ -38,7 +38,7 @@ def apply_with_random_selector(x, func, num_cases):
     The result of func(x, sel), where func receives the value of the
     selector as a python integer, but sel is sampled dynamically.
   """
-  sel = tf.random_uniform([], maxval=num_cases, dtype=tf.int32)
+  sel = tf.random.uniform([], maxval=num_cases, dtype=tf.int32)
   # Pass the real x only to one of the func calls.
   return control_flow_ops.merge([
       func(control_flow_ops.switch(x, tf.equal(sel, case))[1], case)
@@ -64,7 +64,7 @@ def distort_color(image, color_ordering=0, fast_mode=True, scope=None):
   Raises:
     ValueError: if color_ordering not in [0, 3]
   """
-  with tf.name_scope(scope, 'distort_color', [image]):
+  with tf.compat.v1.name_scope(scope, 'distort_color', [image]):
     if fast_mode:
       if color_ordering == 0:
         image = tf.image.random_brightness(image, max_delta=32. / 255.)
@@ -131,7 +131,7 @@ def distorted_bounding_box_crop(image,
   Returns:
     A tuple, a 3-D Tensor cropped_image and the distorted bbox
   """
-  with tf.name_scope(scope, 'distorted_bounding_box_crop', [image, bbox]):
+  with tf.compat.v1.name_scope(scope, 'distorted_bounding_box_crop', [image, bbox]):
     # Each bounding box has shape [1, num_boxes, box coords] and
     # the coordinates are ordered [ymin, xmin, ymax, xmax].
 
@@ -143,7 +143,7 @@ def distorted_bounding_box_crop(image,
     # bounding box. If no box is supplied, then we assume the bounding box is
     # the entire image.
     sample_distorted_bounding_box = tf.image.sample_distorted_bounding_box(
-        tf.shape(image),
+        image_size=tf.shape(input=image),
         bounding_boxes=bbox,
         min_object_covered=min_object_covered,
         aspect_ratio_range=aspect_ratio_range,
@@ -188,7 +188,7 @@ def preprocess_for_train(image,
   Returns:
     3-D float Tensor of distorted image used for training with range [-1, 1].
   """
-  with tf.name_scope(scope, 'distort_image', [image, height, width, bbox]):
+  with tf.compat.v1.name_scope(scope, 'distort_image', [image, height, width, bbox]):
     if bbox is None:
       bbox = tf.constant(
           [0.0, 0.0, 1.0, 1.0], dtype=tf.float32, shape=[1, 1, 4])
@@ -198,7 +198,7 @@ def preprocess_for_train(image,
     # the coordinates are ordered [ymin, xmin, ymax, xmax].
     image_with_box = tf.image.draw_bounding_boxes(
         tf.expand_dims(image, 0), bbox)
-    tf.summary.image('image_with_bounding_boxes', image_with_box)
+    tf.compat.v1.summary.image('image_with_bounding_boxes', image_with_box)
 
     distorted_image, distorted_bbox = distorted_bounding_box_crop(image, bbox)
     # Restore the shape since the dynamic slice based upon the bbox_size loses
@@ -206,8 +206,8 @@ def preprocess_for_train(image,
     distorted_image.set_shape([None, None, 3])
     image_with_distorted_box = tf.image.draw_bounding_boxes(
         tf.expand_dims(image, 0), distorted_bbox)
-    tf.summary.image('images_with_distorted_bounding_box',
-                     image_with_distorted_box)
+    tf.compat.v1.summary.image('images_with_distorted_bounding_box',
+                               image_with_distorted_box)
 
     # This resizing operation may distort the images because the aspect
     # ratio is not respected. We select a resize method in a round robin
@@ -218,11 +218,11 @@ def preprocess_for_train(image,
     num_resize_cases = 1 if fast_mode else 4
     distorted_image = apply_with_random_selector(
         distorted_image,
-        lambda x, method: tf.image.resize_images(x, [height, width], method=method),
+        lambda x, method: tf.image.resize(x, [height, width], method=method),
         num_cases=num_resize_cases)
 
-    tf.summary.image('cropped_resized_image',
-                     tf.expand_dims(distorted_image, 0))
+    tf.compat.v1.summary.image('cropped_resized_image',
+                               tf.expand_dims(distorted_image, 0))
 
     # Randomly flip the image horizontally.
     distorted_image = tf.image.random_flip_left_right(distorted_image)
@@ -233,8 +233,8 @@ def preprocess_for_train(image,
         lambda x, ordering: distort_color(x, ordering, fast_mode),
         num_cases=4)
 
-    tf.summary.image('final_distorted_image',
-                     tf.expand_dims(distorted_image, 0))
+    tf.compat.v1.summary.image('final_distorted_image',
+                               tf.expand_dims(distorted_image, 0))
     distorted_image = tf.subtract(distorted_image, 0.5)
     distorted_image = tf.multiply(distorted_image, 2.0)
     return distorted_image
@@ -265,7 +265,7 @@ def preprocess_for_eval(image,
   Returns:
     3-D float Tensor of prepared image.
   """
-  with tf.name_scope(scope, 'eval_image', [image, height, width]):
+  with tf.compat.v1.name_scope(scope, 'eval_image', [image, height, width]):
     if image.dtype != tf.float32:
       image = tf.image.convert_image_dtype(image, dtype=tf.float32)
     # Crop the central region of the image with an area containing 87.5% of
@@ -276,8 +276,8 @@ def preprocess_for_eval(image,
     if height and width:
       # Resize the image to the specified height and width.
       image = tf.expand_dims(image, 0)
-      image = tf.image.resize_bilinear(
-          image, [height, width], align_corners=False)
+      image = tf.image.resize(
+          image, [height, width], method=tf.image.ResizeMethod.BILINEAR)
       image = tf.squeeze(image, [0])
     image = tf.subtract(image, 0.5)
     image = tf.multiply(image, 2.0)
diff --git a/research/attention_ocr/python/metrics.py b/research/attention_ocr/python/metrics.py
index 9e2a6a7579812583dc60546f97976f05befe07ff..0bd6c23d009848877441d97981b6514072979a54 100644
--- a/research/attention_ocr/python/metrics.py
+++ b/research/attention_ocr/python/metrics.py
@@ -34,20 +34,21 @@ def char_accuracy(predictions, targets, rej_char, streaming=False):
     a update_ops for execution and value tensor whose value on evaluation
     returns the total character accuracy.
   """
-  with tf.variable_scope('CharAccuracy'):
+  with tf.compat.v1.variable_scope('CharAccuracy'):
     predictions.get_shape().assert_is_compatible_with(targets.get_shape())
 
-    targets = tf.to_int32(targets)
+    targets = tf.cast(targets, dtype=tf.int32)
     const_rej_char = tf.constant(rej_char, shape=targets.get_shape())
-    weights = tf.to_float(tf.not_equal(targets, const_rej_char))
-    correct_chars = tf.to_float(tf.equal(predictions, targets))
-    accuracy_per_example = tf.div(
-        tf.reduce_sum(tf.multiply(correct_chars, weights), 1),
-        tf.reduce_sum(weights, 1))
+    weights = tf.cast(tf.not_equal(targets, const_rej_char), dtype=tf.float32)
+    correct_chars = tf.cast(tf.equal(predictions, targets), dtype=tf.float32)
+    accuracy_per_example = tf.compat.v1.div(
+        tf.reduce_sum(input_tensor=tf.multiply(
+            correct_chars, weights), axis=1),
+        tf.reduce_sum(input_tensor=weights, axis=1))
     if streaming:
       return tf.contrib.metrics.streaming_mean(accuracy_per_example)
     else:
-      return tf.reduce_mean(accuracy_per_example)
+      return tf.reduce_mean(input_tensor=accuracy_per_example)
 
 
 def sequence_accuracy(predictions, targets, rej_char, streaming=False):
@@ -66,25 +67,26 @@ def sequence_accuracy(predictions, targets, rej_char, streaming=False):
     returns the total sequence accuracy.
   """
 
-  with tf.variable_scope('SequenceAccuracy'):
+  with tf.compat.v1.variable_scope('SequenceAccuracy'):
     predictions.get_shape().assert_is_compatible_with(targets.get_shape())
 
-    targets = tf.to_int32(targets)
+    targets = tf.cast(targets, dtype=tf.int32)
     const_rej_char = tf.constant(
         rej_char, shape=targets.get_shape(), dtype=tf.int32)
     include_mask = tf.not_equal(targets, const_rej_char)
-    include_predictions = tf.to_int32(
-        tf.where(include_mask, predictions,
-                 tf.zeros_like(predictions) + rej_char))
-    correct_chars = tf.to_float(tf.equal(include_predictions, targets))
+    include_predictions = tf.cast(
+        tf.compat.v1.where(include_mask, predictions,
+                           tf.zeros_like(predictions) + rej_char), dtype=tf.int32)
+    correct_chars = tf.cast(
+        tf.equal(include_predictions, targets), dtype=tf.float32)
     correct_chars_counts = tf.cast(
-        tf.reduce_sum(correct_chars, reduction_indices=[1]), dtype=tf.int32)
+        tf.reduce_sum(input_tensor=correct_chars, axis=[1]), dtype=tf.int32)
     target_length = targets.get_shape().dims[1].value
     target_chars_counts = tf.constant(
         target_length, shape=correct_chars_counts.get_shape())
-    accuracy_per_example = tf.to_float(
-        tf.equal(correct_chars_counts, target_chars_counts))
+    accuracy_per_example = tf.cast(
+        tf.equal(correct_chars_counts, target_chars_counts), dtype=tf.float32)
     if streaming:
       return tf.contrib.metrics.streaming_mean(accuracy_per_example)
     else:
-      return tf.reduce_mean(accuracy_per_example)
+      return tf.reduce_mean(input_tensor=accuracy_per_example)
diff --git a/research/attention_ocr/python/metrics_test.py b/research/attention_ocr/python/metrics_test.py
index 5560ec2c898fe7674715ec54daa08ba9e7471adf..3e83194523eb3eba904e7225e580841e7f6e3a3f 100644
--- a/research/attention_ocr/python/metrics_test.py
+++ b/research/attention_ocr/python/metrics_test.py
@@ -38,8 +38,8 @@ class AccuracyTest(tf.test.TestCase):
       A session object that should be used as a context manager.
     """
     with self.cached_session() as sess:
-      sess.run(tf.global_variables_initializer())
-      sess.run(tf.local_variables_initializer())
+      sess.run(tf.compat.v1.global_variables_initializer())
+      sess.run(tf.compat.v1.local_variables_initializer())
       yield sess
 
   def _fake_labels(self):
@@ -55,7 +55,7 @@ class AccuracyTest(tf.test.TestCase):
     return incorrect
 
   def test_sequence_accuracy_identical_samples(self):
-    labels_tf = tf.convert_to_tensor(self._fake_labels())
+    labels_tf = tf.convert_to_tensor(value=self._fake_labels())
 
     accuracy_tf = metrics.sequence_accuracy(labels_tf, labels_tf,
                                             self.rej_char)
@@ -66,9 +66,9 @@ class AccuracyTest(tf.test.TestCase):
 
   def test_sequence_accuracy_one_char_difference(self):
     ground_truth_np = self._fake_labels()
-    ground_truth_tf = tf.convert_to_tensor(ground_truth_np)
+    ground_truth_tf = tf.convert_to_tensor(value=ground_truth_np)
     prediction_tf = tf.convert_to_tensor(
-        self._incorrect_copy(ground_truth_np, bad_indexes=((0, 0))))
+        value=self._incorrect_copy(ground_truth_np, bad_indexes=((0, 0))))
 
     accuracy_tf = metrics.sequence_accuracy(prediction_tf, ground_truth_tf,
                                             self.rej_char)
@@ -80,9 +80,9 @@ class AccuracyTest(tf.test.TestCase):
 
   def test_char_accuracy_one_char_difference_with_padding(self):
     ground_truth_np = self._fake_labels()
-    ground_truth_tf = tf.convert_to_tensor(ground_truth_np)
+    ground_truth_tf = tf.convert_to_tensor(value=ground_truth_np)
     prediction_tf = tf.convert_to_tensor(
-        self._incorrect_copy(ground_truth_np, bad_indexes=((0, 0))))
+        value=self._incorrect_copy(ground_truth_np, bad_indexes=((0, 0))))
 
     accuracy_tf = metrics.char_accuracy(prediction_tf, ground_truth_tf,
                                         self.rej_char)
diff --git a/research/attention_ocr/python/model.py b/research/attention_ocr/python/model.py
index c633c5c39a0463c026cc944218cd2cc0ea7ebfb0..b489f964e9d756c90af901fa00da49553b17052a 100644
--- a/research/attention_ocr/python/model.py
+++ b/research/attention_ocr/python/model.py
@@ -12,7 +12,6 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-
 """Functions to build the Attention OCR model.
 
 Usage example:
@@ -26,6 +25,7 @@ Usage example:
 import sys
 import collections
 import logging
+import numpy as np
 import tensorflow as tf
 from tensorflow.contrib import slim
 from tensorflow.contrib.slim.nets import inception
@@ -35,29 +35,28 @@ import sequence_layers
 import utils
 
 OutputEndpoints = collections.namedtuple('OutputEndpoints', [
-  'chars_logit', 'chars_log_prob', 'predicted_chars', 'predicted_scores',
-  'predicted_text'
+    'chars_logit', 'chars_log_prob', 'predicted_chars', 'predicted_scores',
+    'predicted_text', 'predicted_length', 'predicted_conf',
+    'normalized_seq_conf'
 ])
 
 # TODO(gorban): replace with tf.HParams when it is released.
-ModelParams = collections.namedtuple('ModelParams', [
-  'num_char_classes', 'seq_length', 'num_views', 'null_code'
-])
+ModelParams = collections.namedtuple(
+    'ModelParams', ['num_char_classes', 'seq_length', 'num_views', 'null_code'])
 
 ConvTowerParams = collections.namedtuple('ConvTowerParams', ['final_endpoint'])
 
 SequenceLogitsParams = collections.namedtuple('SequenceLogitsParams', [
-  'use_attention', 'use_autoregression', 'num_lstm_units', 'weight_decay',
-  'lstm_state_clip_value'
+    'use_attention', 'use_autoregression', 'num_lstm_units', 'weight_decay',
+    'lstm_state_clip_value'
 ])
 
-SequenceLossParams = collections.namedtuple('SequenceLossParams', [
-  'label_smoothing', 'ignore_nulls', 'average_across_timesteps'
-])
+SequenceLossParams = collections.namedtuple(
+    'SequenceLossParams',
+    ['label_smoothing', 'ignore_nulls', 'average_across_timesteps'])
 
-EncodeCoordinatesParams = collections.namedtuple('EncodeCoordinatesParams', [
-  'enabled'
-])
+EncodeCoordinatesParams = collections.namedtuple('EncodeCoordinatesParams',
+                                                 ['enabled'])
 
 
 def _dict_to_array(id_to_char, default_character):
@@ -85,16 +84,16 @@ class CharsetMapper(object):
     """
     mapping_strings = tf.constant(_dict_to_array(charset, default_character))
     self.table = tf.contrib.lookup.index_to_string_table_from_tensor(
-      mapping=mapping_strings, default_value=default_character)
+        mapping=mapping_strings, default_value=default_character)
 
   def get_text(self, ids):
     """Returns a string corresponding to a sequence of character ids.
 
         Args:
           ids: a tensor with shape [batch_size, max_sequence_length]
-        """
-    return tf.reduce_join(
-      self.table.lookup(tf.to_int64(ids)), reduction_indices=1)
+    """
+    return tf.strings.reduce_join(
+        inputs=self.table.lookup(tf.cast(ids, dtype=tf.int64)), axis=1)
 
 
 def get_softmax_loss_fn(label_smoothing):
@@ -111,16 +110,153 @@ def get_softmax_loss_fn(label_smoothing):
 
     def loss_fn(labels, logits):
       return (tf.nn.softmax_cross_entropy_with_logits(
-        logits=logits, labels=labels))
+          logits=logits, labels=tf.stop_gradient(labels)))
   else:
 
     def loss_fn(labels, logits):
       return tf.nn.sparse_softmax_cross_entropy_with_logits(
-        logits=logits, labels=labels)
+          logits=logits, labels=labels)
 
   return loss_fn
 
 
+def get_tensor_dimensions(tensor):
+  """Returns the shape components of a 4D tensor with variable batch size.
+
+  Args:
+    tensor : A 4D tensor, whose last 3 dimensions are known at graph
+      construction time.
+
+  Returns:
+    batch_size : The first dimension as a tensor object.
+    height : The second dimension as a scalar value.
+    width : The third dimension as a scalar value.
+    num_features : The forth dimension as a scalar value.
+
+  Raises:
+    ValueError: if input tensor does not have 4 dimensions.
+  """
+  if len(tensor.get_shape().dims) != 4:
+    raise ValueError(
+        'Incompatible shape: len(tensor.get_shape().dims) != 4 (%d != 4)' %
+        len(tensor.get_shape().dims))
+  batch_size = tf.shape(input=tensor)[0]
+  height = tensor.get_shape().dims[1].value
+  width = tensor.get_shape().dims[2].value
+  num_features = tensor.get_shape().dims[3].value
+  return batch_size, height, width, num_features
+
+
+def lookup_indexed_value(indices, row_vecs):
+  """Lookup values in each row of 'row_vecs' indexed by 'indices'.
+
+  For each sample in the batch, look up the element for the corresponding
+  index.
+
+  Args:
+    indices : A tensor of shape (batch, )
+    row_vecs : A tensor of shape [batch, depth]
+
+  Returns:
+    A tensor of shape (batch, ) formed by row_vecs[i, indices[i]].
+  """
+  gather_indices = tf.stack((tf.range(
+      tf.shape(input=row_vecs)[0], dtype=tf.int32), tf.cast(indices, tf.int32)),
+      axis=1)
+  return tf.gather_nd(row_vecs, gather_indices)
+
+
+@utils.ConvertAllInputsToTensors
+def max_char_logprob_cumsum(char_log_prob):
+  """Computes the cumulative sum of character logprob for all sequence lengths.
+
+  Args:
+    char_log_prob: A tensor of shape [batch x seq_length x num_char_classes]
+      with log probabilities of a character.
+
+  Returns:
+    A tensor of shape [batch x (seq_length+1)] where each element x[_, j] is
+    the sum of the max char logprob for all positions upto j.
+    Note this duplicates the final column and produces (seq_length+1) columns
+    so the same function can be used regardless whether use_length_predictions
+    is true or false.
+  """
+  max_char_log_prob = tf.reduce_max(input_tensor=char_log_prob, axis=2)
+  # For an input array [a, b, c]) tf.cumsum returns [a, a + b, a + b + c] if
+  # exclusive set to False (default).
+  return tf.cumsum(max_char_log_prob, axis=1, exclusive=False)
+
+
+def find_length_by_null(predicted_chars, null_code):
+  """Determine sequence length by finding null_code among predicted char IDs.
+
+  Given the char class ID for each position, compute the sequence length.
+  Note that this function computes this based on the number of null_code,
+  instead of the position of the first null_code.
+
+  Args:
+    predicted_chars: A tensor of [batch x seq_length] where each element stores
+      the char class ID with max probability;
+    null_code: an int32, character id for the NULL.
+
+  Returns:
+    A [batch, ] tensor which stores the sequence length for each sample.
+  """
+  return tf.reduce_sum(
+      input_tensor=tf.cast(tf.not_equal(null_code, predicted_chars), tf.int32), axis=1)
+
+
+def axis_pad(tensor, axis, before=0, after=0, constant_values=0.0):
+  """Pad a tensor with the specified values along a single axis.
+
+  Args:
+    tensor: a Tensor;
+    axis: the dimension to add pad along to;
+    before: number of values to add before the contents of tensor in the
+      selected dimension;
+    after: number of values to add after the contents of tensor in the selected
+      dimension;
+    constant_values: the scalar pad value to use. Must be same type as tensor.
+
+  Returns:
+    A Tensor. Has the same type as the input tensor, but with a changed shape
+    along the specified dimension.
+  """
+  if before == 0 and after == 0:
+    return tensor
+  ndims = tensor.shape.ndims
+  padding_size = np.zeros((ndims, 2), dtype='int32')
+  padding_size[axis] = before, after
+  return tf.pad(
+      tensor=tensor,
+      paddings=tf.constant(padding_size),
+      constant_values=constant_values)
+
+
+def null_based_length_prediction(chars_log_prob, null_code):
+  """Computes length and confidence of prediction based on positions of NULLs.
+
+  Args:
+    chars_log_prob: A tensor of shape [batch x seq_length x num_char_classes]
+      with log probabilities of a character;
+    null_code: an int32, character id for the NULL.
+
+  Returns:
+    A tuple (text_log_prob, predicted_length), where
+    text_log_prob - is a tensor of the same shape as length_log_prob.
+    Element #0 of the output corresponds to probability of the empty string,
+    element #seq_length - is the probability of length=seq_length.
+    predicted_length is a tensor with shape [batch].
+  """
+  predicted_chars = tf.cast(
+      tf.argmax(input=chars_log_prob, axis=2), dtype=tf.int32)
+  # We do right pad to support sequences with seq_length elements.
+  text_log_prob = max_char_logprob_cumsum(
+      axis_pad(chars_log_prob, axis=1, after=1))
+  predicted_length = find_length_by_null(predicted_chars, null_code)
+  return text_log_prob, predicted_length
+
+
 class Model(object):
   """Class to create the Attention OCR Model."""
 
@@ -137,24 +273,24 @@ class Model(object):
       num_char_classes: size of character set.
       seq_length: number of characters in a sequence.
       num_views: Number of views (conv towers) to use.
-      null_code: A character code corresponding to a character which
-        indicates end of a sequence.
-      mparams: a dictionary with hyper parameters for methods,  keys -
-        function names, values - corresponding namedtuples.
+      null_code: A character code corresponding to a character which indicates
+        end of a sequence.
+      mparams: a dictionary with hyper parameters for methods,  keys - function
+        names, values - corresponding namedtuples.
       charset: an optional dictionary with a mapping between character ids and
-        utf8 strings. If specified the OutputEndpoints.predicted_text will
-        utf8 encoded strings corresponding to the character ids returned by
+        utf8 strings. If specified the OutputEndpoints.predicted_text will utf8
+        encoded strings corresponding to the character ids returned by
         OutputEndpoints.predicted_chars (by default the predicted_text contains
-        an empty vector). 
+        an empty vector).
         NOTE: Make sure you call tf.tables_initializer().run() if the charset
-        specified.
+          specified.
     """
     super(Model, self).__init__()
     self._params = ModelParams(
-      num_char_classes=num_char_classes,
-      seq_length=seq_length,
-      num_views=num_views,
-      null_code=null_code)
+        num_char_classes=num_char_classes,
+        seq_length=seq_length,
+        num_views=num_views,
+        null_code=null_code)
     self._mparams = self.default_mparams()
     if mparams:
       self._mparams.update(mparams)
@@ -162,21 +298,22 @@ class Model(object):
 
   def default_mparams(self):
     return {
-      'conv_tower_fn':
-        ConvTowerParams(final_endpoint='Mixed_5d'),
-      'sequence_logit_fn':
-        SequenceLogitsParams(
-          use_attention=True,
-          use_autoregression=True,
-          num_lstm_units=256,
-          weight_decay=0.00004,
-          lstm_state_clip_value=10.0),
-      'sequence_loss_fn':
-        SequenceLossParams(
-          label_smoothing=0.1,
-          ignore_nulls=True,
-          average_across_timesteps=False),
-      'encode_coordinates_fn': EncodeCoordinatesParams(enabled=False)
+        'conv_tower_fn':
+            ConvTowerParams(final_endpoint='Mixed_5d'),
+        'sequence_logit_fn':
+            SequenceLogitsParams(
+                use_attention=True,
+                use_autoregression=True,
+                num_lstm_units=256,
+                weight_decay=0.00004,
+                lstm_state_clip_value=10.0),
+        'sequence_loss_fn':
+            SequenceLossParams(
+                label_smoothing=0.1,
+                ignore_nulls=True,
+                average_across_timesteps=False),
+        'encode_coordinates_fn':
+            EncodeCoordinatesParams(enabled=False)
     }
 
   def set_mparam(self, function, **kwargs):
@@ -198,14 +335,14 @@ class Model(object):
     """
     mparams = self._mparams['conv_tower_fn']
     logging.debug('Using final_endpoint=%s', mparams.final_endpoint)
-    with tf.variable_scope('conv_tower_fn/INCE'):
+    with tf.compat.v1.variable_scope('conv_tower_fn/INCE'):
       if reuse:
-        tf.get_variable_scope().reuse_variables()
+        tf.compat.v1.get_variable_scope().reuse_variables()
       with slim.arg_scope(inception.inception_v3_arg_scope()):
         with slim.arg_scope([slim.batch_norm, slim.dropout],
                             is_training=is_training):
           net, _ = inception.inception_v3_base(
-            images, final_endpoint=mparams.final_endpoint)
+              images, final_endpoint=mparams.final_endpoint)
       return net
 
   def _create_lstm_inputs(self, net):
@@ -222,10 +359,10 @@ class Model(object):
     """
     num_features = net.get_shape().dims[1].value
     if num_features < self._params.seq_length:
-      raise AssertionError('Incorrect dimension #1 of input tensor'
-                           ' %d should be bigger than %d (shape=%s)' %
-                           (num_features, self._params.seq_length,
-                            net.get_shape()))
+      raise AssertionError(
+          'Incorrect dimension #1 of input tensor'
+          ' %d should be bigger than %d (shape=%s)' %
+          (num_features, self._params.seq_length, net.get_shape()))
     elif num_features > self._params.seq_length:
       logging.warning('Ignoring some features: use %d of %d (shape=%s)',
                       self._params.seq_length, num_features, net.get_shape())
@@ -236,7 +373,7 @@ class Model(object):
   def sequence_logit_fn(self, net, labels_one_hot):
     mparams = self._mparams['sequence_logit_fn']
     # TODO(gorban): remove /alias suffixes from the scopes.
-    with tf.variable_scope('sequence_logit_fn/SQLR'):
+    with tf.compat.v1.variable_scope('sequence_logit_fn/SQLR'):
       layer_class = sequence_layers.get_layer_class(mparams.use_attention,
                                                     mparams.use_autoregression)
       layer = layer_class(net, labels_one_hot, self._params, mparams)
@@ -252,16 +389,16 @@ class Model(object):
       A tensor with the same size as any input tensors.
     """
     batch_size, height, width, num_features = [
-      d.value for d in nets_list[0].get_shape().dims
+        d.value for d in nets_list[0].get_shape().dims
     ]
     xy_flat_shape = (batch_size, 1, height * width, num_features)
     nets_for_merge = []
-    with tf.variable_scope('max_pool_views', values=nets_list):
+    with tf.compat.v1.variable_scope('max_pool_views', values=nets_list):
       for net in nets_list:
         nets_for_merge.append(tf.reshape(net, xy_flat_shape))
       merged_net = tf.concat(nets_for_merge, 1)
       net = slim.max_pool2d(
-        merged_net, kernel_size=[len(nets_list), 1], stride=1)
+          merged_net, kernel_size=[len(nets_list), 1], stride=1)
       net = tf.reshape(net, (batch_size, height, width, num_features))
     return net
 
@@ -277,18 +414,20 @@ class Model(object):
     Returns:
       A tensor of shape [batch_size, seq_length, features_size].
     """
-    with tf.variable_scope('pool_views_fn/STCK'):
+    with tf.compat.v1.variable_scope('pool_views_fn/STCK'):
       net = tf.concat(nets, 1)
-      batch_size = net.get_shape().dims[0].value
+      batch_size = tf.shape(input=net)[0]
+      image_size = net.get_shape().dims[1].value * \
+          net.get_shape().dims[2].value
       feature_size = net.get_shape().dims[3].value
-      return tf.reshape(net, [batch_size, -1, feature_size])
+      return tf.reshape(net, tf.stack([batch_size, image_size, feature_size]))
 
   def char_predictions(self, chars_logit):
     """Returns confidence scores (softmax values) for predicted characters.
 
     Args:
-      chars_logit: chars logits, a tensor with shape
-        [batch_size x seq_length x num_char_classes]
+      chars_logit: chars logits, a tensor with shape [batch_size x seq_length x
+        num_char_classes]
 
     Returns:
       A tuple (ids, log_prob, scores), where:
@@ -301,12 +440,17 @@ class Model(object):
           with shape [batch_size x seq_length].
     """
     log_prob = utils.logits_to_log_prob(chars_logit)
-    ids = tf.to_int32(tf.argmax(log_prob, axis=2), name='predicted_chars')
+    ids = tf.cast(tf.argmax(input=log_prob, axis=2),
+                  name='predicted_chars', dtype=tf.int32)
     mask = tf.cast(
-      slim.one_hot_encoding(ids, self._params.num_char_classes), tf.bool)
+        slim.one_hot_encoding(ids, self._params.num_char_classes), tf.bool)
     all_scores = tf.nn.softmax(chars_logit)
-    selected_scores = tf.boolean_mask(all_scores, mask, name='char_scores')
-    scores = tf.reshape(selected_scores, shape=(-1, self._params.seq_length))
+    selected_scores = tf.boolean_mask(
+        tensor=all_scores, mask=mask, name='char_scores')
+    scores = tf.reshape(
+        selected_scores,
+        shape=(-1, self._params.seq_length),
+        name='predicted_scores')
     return ids, log_prob, scores
 
   def encode_coordinates_fn(self, net):
@@ -323,12 +467,12 @@ class Model(object):
     """
     mparams = self._mparams['encode_coordinates_fn']
     if mparams.enabled:
-      batch_size, h, w, _ = net.shape.as_list()
+      batch_size, h, w, _ = get_tensor_dimensions(net)
       x, y = tf.meshgrid(tf.range(w), tf.range(h))
       w_loc = slim.one_hot_encoding(x, num_classes=w)
       h_loc = slim.one_hot_encoding(y, num_classes=h)
       loc = tf.concat([h_loc, w_loc], 2)
-      loc = tf.tile(tf.expand_dims(loc, 0), [batch_size, 1, 1, 1])
+      loc = tf.tile(tf.expand_dims(loc, 0), tf.stack([batch_size, 1, 1, 1]))
       return tf.concat([net, loc], 3)
     else:
       return net
@@ -341,7 +485,8 @@ class Model(object):
     """Creates a base part of the Model (no gradients, losses or summaries).
 
     Args:
-      images: A tensor of shape [batch_size, height, width, channels].
+      images: A tensor of shape [batch_size, height, width, channels] with pixel
+        values in the range [0.0, 1.0].
       labels_one_hot: Optional (can be None) one-hot encoding for ground truth
         labels. If provided the function will create a model for training.
       scope: Optional variable_scope.
@@ -353,14 +498,19 @@ class Model(object):
     """
     logging.debug('images: %s', images)
     is_training = labels_one_hot is not None
-    with tf.variable_scope(scope, reuse=reuse):
+
+    # Normalize image pixel values to have a symmetrical range around zero.
+    images = tf.subtract(images, 0.5)
+    images = tf.multiply(images, 2.5)
+
+    with tf.compat.v1.variable_scope(scope, reuse=reuse):
       views = tf.split(
-        value=images, num_or_size_splits=self._params.num_views, axis=2)
+          value=images, num_or_size_splits=self._params.num_views, axis=2)
       logging.debug('Views=%d single view: %s', len(views), views[0])
 
       nets = [
-        self.conv_tower_fn(v, is_training, reuse=(i != 0))
-        for i, v in enumerate(views)
+          self.conv_tower_fn(v, is_training, reuse=(i != 0))
+          for i, v in enumerate(views)
       ]
       logging.debug('Conv tower: %s', nets[0])
 
@@ -374,18 +524,34 @@ class Model(object):
       logging.debug('chars_logit: %s', chars_logit)
 
       predicted_chars, chars_log_prob, predicted_scores = (
-        self.char_predictions(chars_logit))
+          self.char_predictions(chars_logit))
       if self._charset:
         character_mapper = CharsetMapper(self._charset)
         predicted_text = character_mapper.get_text(predicted_chars)
       else:
         predicted_text = tf.constant([])
+
+      text_log_prob, predicted_length = null_based_length_prediction(
+          chars_log_prob, self._params.null_code)
+      predicted_conf = lookup_indexed_value(predicted_length, text_log_prob)
+      # Convert predicted confidence from sum of logs to geometric mean
+      normalized_seq_conf = tf.exp(
+          tf.divide(predicted_conf,
+                    tf.cast(predicted_length + 1, predicted_conf.dtype)),
+          name='normalized_seq_conf')
+      predicted_conf = tf.identity(predicted_conf, name='predicted_conf')
+      predicted_text = tf.identity(predicted_text, name='predicted_text')
+      predicted_length = tf.identity(predicted_length, name='predicted_length')
+
     return OutputEndpoints(
-      chars_logit=chars_logit,
-      chars_log_prob=chars_log_prob,
-      predicted_chars=predicted_chars,
-      predicted_scores=predicted_scores,
-      predicted_text=predicted_text)
+        chars_logit=chars_logit,
+        chars_log_prob=chars_log_prob,
+        predicted_chars=predicted_chars,
+        predicted_scores=predicted_scores,
+        predicted_length=predicted_length,
+        predicted_text=predicted_text,
+        predicted_conf=predicted_conf,
+        normalized_seq_conf=normalized_seq_conf)
 
   def create_loss(self, data, endpoints):
     """Creates all losses required to train the model.
@@ -404,7 +570,7 @@ class Model(object):
     # multiple losses including regularization losses.
     self.sequence_loss_fn(endpoints.chars_logit, data.labels)
     total_loss = slim.losses.get_total_loss()
-    tf.summary.scalar('TotalLoss', total_loss)
+    tf.compat.v1.summary.scalar('TotalLoss', total_loss)
     return total_loss
 
   def label_smoothing_regularization(self, chars_labels, weight=0.1):
@@ -413,15 +579,15 @@ class Model(object):
     Uses the same method as in https://arxiv.org/abs/1512.00567.
 
     Args:
-      chars_labels: ground truth ids of charactes,
-        shape=[batch_size, seq_length];
+      chars_labels: ground truth ids of charactes, shape=[batch_size,
+        seq_length];
       weight: label-smoothing regularization weight.
 
     Returns:
       A sensor with the same shape as the input.
     """
     one_hot_labels = tf.one_hot(
-      chars_labels, depth=self._params.num_char_classes, axis=-1)
+        chars_labels, depth=self._params.num_char_classes, axis=-1)
     pos_weight = 1.0 - weight
     neg_weight = weight / self._params.num_char_classes
     return one_hot_labels * pos_weight + neg_weight
@@ -433,20 +599,20 @@ class Model(object):
     also ignore all null chars after the first one.
 
     Args:
-      chars_logits: logits for predicted characters,
-        shape=[batch_size, seq_length, num_char_classes];
-      chars_labels: ground truth ids of characters,
-        shape=[batch_size, seq_length];
+      chars_logits: logits for predicted characters, shape=[batch_size,
+        seq_length, num_char_classes];
+      chars_labels: ground truth ids of characters, shape=[batch_size,
+        seq_length];
       mparams: method hyper parameters.
 
     Returns:
       A Tensor with shape [batch_size] - the log-perplexity for each sequence.
     """
     mparams = self._mparams['sequence_loss_fn']
-    with tf.variable_scope('sequence_loss_fn/SLF'):
+    with tf.compat.v1.variable_scope('sequence_loss_fn/SLF'):
       if mparams.label_smoothing > 0:
         smoothed_one_hot_labels = self.label_smoothing_regularization(
-          chars_labels, mparams.label_smoothing)
+            chars_labels, mparams.label_smoothing)
         labels_list = tf.unstack(smoothed_one_hot_labels, axis=1)
       else:
         # NOTE: in case of sparse softmax we are not using one-hot
@@ -459,21 +625,21 @@ class Model(object):
       else:
         # Suppose that reject character is the last in the charset.
         reject_char = tf.constant(
-          self._params.num_char_classes - 1,
-          shape=(batch_size, seq_length),
-          dtype=tf.int64)
+            self._params.num_char_classes - 1,
+            shape=(batch_size, seq_length),
+            dtype=tf.int64)
         known_char = tf.not_equal(chars_labels, reject_char)
-        weights = tf.to_float(known_char)
+        weights = tf.cast(known_char, dtype=tf.float32)
 
       logits_list = tf.unstack(chars_logits, axis=1)
       weights_list = tf.unstack(weights, axis=1)
       loss = tf.contrib.legacy_seq2seq.sequence_loss(
-        logits_list,
-        labels_list,
-        weights_list,
-        softmax_loss_function=get_softmax_loss_fn(mparams.label_smoothing),
-        average_across_timesteps=mparams.average_across_timesteps)
-      tf.losses.add_loss(loss)
+          logits_list,
+          labels_list,
+          weights_list,
+          softmax_loss_function=get_softmax_loss_fn(mparams.label_smoothing),
+          average_across_timesteps=mparams.average_across_timesteps)
+      tf.compat.v1.losses.add_loss(loss)
       return loss
 
   def create_summaries(self, data, endpoints, charset, is_training):
@@ -482,8 +648,8 @@ class Model(object):
     Args:
       data: InputEndpoints namedtuple.
       endpoints: OutputEndpoints namedtuple.
-      charset: A dictionary with mapping between character codes and
-        unicode characters. Use the one provided by a dataset.charset.
+      charset: A dictionary with mapping between character codes and unicode
+        characters. Use the one provided by a dataset.charset.
       is_training: If True will create summary prefixes for training job,
         otherwise - for evaluation.
 
@@ -503,13 +669,14 @@ class Model(object):
     # tf.summary.text(sname('text/pr'), pr_text)
     # gt_text = charset_mapper.get_text(data.labels[:max_outputs,:])
     # tf.summary.text(sname('text/gt'), gt_text)
-    tf.summary.image(sname('image'), data.images, max_outputs=max_outputs)
+    tf.compat.v1.summary.image(
+        sname('image'), data.images, max_outputs=max_outputs)
 
     if is_training:
-      tf.summary.image(
-        sname('image/orig'), data.images_orig, max_outputs=max_outputs)
-      for var in tf.trainable_variables():
-        tf.summary.histogram(var.op.name, var)
+      tf.compat.v1.summary.image(
+          sname('image/orig'), data.images_orig, max_outputs=max_outputs)
+      for var in tf.compat.v1.trainable_variables():
+        tf.compat.v1.summary.histogram(var.op.name, var)
       return None
 
     else:
@@ -520,32 +687,36 @@ class Model(object):
         names_to_values[name] = value_update_tuple[0]
         names_to_updates[name] = value_update_tuple[1]
 
-      use_metric('CharacterAccuracy',
-                 metrics.char_accuracy(
-                   endpoints.predicted_chars,
-                   data.labels,
-                   streaming=True,
-                   rej_char=self._params.null_code))
+      use_metric(
+          'CharacterAccuracy',
+          metrics.char_accuracy(
+              endpoints.predicted_chars,
+              data.labels,
+              streaming=True,
+              rej_char=self._params.null_code))
       # Sequence accuracy computed by cutting sequence at the first null char
-      use_metric('SequenceAccuracy',
-                 metrics.sequence_accuracy(
-                   endpoints.predicted_chars,
-                   data.labels,
-                   streaming=True,
-                   rej_char=self._params.null_code))
+      use_metric(
+          'SequenceAccuracy',
+          metrics.sequence_accuracy(
+              endpoints.predicted_chars,
+              data.labels,
+              streaming=True,
+              rej_char=self._params.null_code))
 
       for name, value in names_to_values.items():
         summary_name = 'eval/' + name
-        tf.summary.scalar(summary_name, tf.Print(value, [value], summary_name))
+        tf.compat.v1.summary.scalar(
+            summary_name, tf.compat.v1.Print(value, [value], summary_name))
       return list(names_to_updates.values())
 
-  def create_init_fn_to_restore(self, master_checkpoint,
+  def create_init_fn_to_restore(self,
+                                master_checkpoint,
                                 inception_checkpoint=None):
     """Creates an init operations to restore weights from various checkpoints.
 
     Args:
-      master_checkpoint: path to a checkpoint which contains all weights for
-        the whole model.
+      master_checkpoint: path to a checkpoint which contains all weights for the
+        whole model.
       inception_checkpoint: path to a checkpoint which contains weights for the
         inception part only.
 
@@ -556,8 +727,8 @@ class Model(object):
     all_feed_dict = {}
 
     def assign_from_checkpoint(variables, checkpoint):
-      logging.info('Request to re-store %d weights from %s',
-                   len(variables), checkpoint)
+      logging.info('Request to re-store %d weights from %s', len(variables),
+                   checkpoint)
       if not variables:
         logging.error('Can\'t find any variables to restore.')
         sys.exit(1)
@@ -565,15 +736,18 @@ class Model(object):
       all_assign_ops.append(assign_op)
       all_feed_dict.update(feed_dict)
 
-    logging.info('variables_to_restore:\n%s' % utils.variables_to_restore().keys())
-    logging.info('moving_average_variables:\n%s' % [v.op.name for v in tf.moving_average_variables()])
-    logging.info('trainable_variables:\n%s' % [v.op.name for v in tf.trainable_variables()])
+    logging.info('variables_to_restore:\n%s',
+                 utils.variables_to_restore().keys())
+    logging.info('moving_average_variables:\n%s',
+                 [v.op.name for v in tf.compat.v1.moving_average_variables()])
+    logging.info('trainable_variables:\n%s',
+                 [v.op.name for v in tf.compat.v1.trainable_variables()])
     if master_checkpoint:
       assign_from_checkpoint(utils.variables_to_restore(), master_checkpoint)
 
     if inception_checkpoint:
       variables = utils.variables_to_restore(
-        'AttentionOcr_v1/conv_tower_fn/INCE', strip_scope=True)
+          'AttentionOcr_v1/conv_tower_fn/INCE', strip_scope=True)
       assign_from_checkpoint(variables, inception_checkpoint)
 
     def init_assign_fn(sess):
diff --git a/research/attention_ocr/python/model_export.py b/research/attention_ocr/python/model_export.py
new file mode 100644
index 0000000000000000000000000000000000000000..4f346e613cf8d5853c23c11d3c5bce9c1be97990
--- /dev/null
+++ b/research/attention_ocr/python/model_export.py
@@ -0,0 +1,198 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+r"""Converts existing checkpoint into a SavedModel.
+
+Usage example:
+python model_export.py \
+  --logtostderr --checkpoint=model.ckpt-399731 \
+  --export_dir=/tmp/attention_ocr_export
+"""
+import os
+
+import tensorflow as tf
+from tensorflow import app
+from tensorflow.contrib import slim
+from tensorflow.python.platform import flags
+
+import common_flags
+import model_export_lib
+
+FLAGS = flags.FLAGS
+common_flags.define()
+
+flags.DEFINE_string('export_dir', None, 'Directory to export model files to.')
+flags.DEFINE_integer(
+    'image_width', None,
+    'Image width used during training (or crop width if used)'
+    ' If not set, the dataset default is used instead.')
+flags.DEFINE_integer(
+    'image_height', None,
+    'Image height used during training(or crop height if used)'
+    ' If not set, the dataset default is used instead.')
+flags.DEFINE_string('work_dir', '/tmp',
+                    'A directory to store temporary files.')
+flags.DEFINE_integer('version_number', 1, 'Version number of the model')
+flags.DEFINE_bool(
+    'export_for_serving', True,
+    'Whether the exported model accepts serialized tf.Example '
+    'protos as input')
+
+
+def get_checkpoint_path():
+  """Returns a path to a checkpoint based on specified commandline flags.
+
+  In order to specify a full path to a checkpoint use --checkpoint flag.
+  Alternatively, if --train_log_dir was specified it will return a path to the
+  most recent checkpoint.
+
+  Raises:
+    ValueError: in case it can't find a checkpoint.
+
+  Returns:
+    A string.
+  """
+  if FLAGS.checkpoint:
+    return FLAGS.checkpoint
+  else:
+    model_save_path = tf.train.latest_checkpoint(FLAGS.train_log_dir)
+    if not model_save_path:
+      raise ValueError('Can\'t find a checkpoint in: %s' % FLAGS.train_log_dir)
+    return model_save_path
+
+
+def export_model(export_dir,
+                 export_for_serving,
+                 batch_size=None,
+                 crop_image_width=None,
+                 crop_image_height=None):
+  """Exports a model to the named directory.
+
+  Note that --datatset_name and --checkpoint are required and parsed by the
+  underlying module common_flags.
+
+  Args:
+    export_dir: The output dir where model is exported to.
+    export_for_serving: If True, expects a serialized image as input and attach
+      image normalization as part of exported graph.
+    batch_size: For non-serving export, the input batch_size needs to be
+      specified.
+    crop_image_width: Width of the input image. Uses the dataset default if
+      None.
+    crop_image_height: Height of the input image. Uses the dataset default if
+      None.
+
+  Returns:
+    Returns the model signature_def.
+  """
+  # Dataset object used only to get all parameters for the model.
+  dataset = common_flags.create_dataset(split_name='test')
+  model = common_flags.create_model(
+      dataset.num_char_classes,
+      dataset.max_sequence_length,
+      dataset.num_of_views,
+      dataset.null_code,
+      charset=dataset.charset)
+  dataset_image_height, dataset_image_width, image_depth = dataset.image_shape
+
+  # Add check for charmap file
+  if not os.path.exists(dataset.charset_file):
+    raise ValueError('No charset defined at {}: export will fail'.format(
+        dataset.charset))
+
+  # Default to dataset dimensions, otherwise use provided dimensions.
+  image_width = crop_image_width or dataset_image_width
+  image_height = crop_image_height or dataset_image_height
+
+  if export_for_serving:
+    images_orig = tf.compat.v1.placeholder(
+        tf.string, shape=[batch_size], name='tf_example')
+    images_orig_float = model_export_lib.generate_tfexample_image(
+        images_orig,
+        image_height,
+        image_width,
+        image_depth,
+        name='float_images')
+  else:
+    images_shape = (batch_size, image_height, image_width, image_depth)
+    images_orig = tf.compat.v1.placeholder(
+        tf.uint8, shape=images_shape, name='original_image')
+    images_orig_float = tf.image.convert_image_dtype(
+        images_orig, dtype=tf.float32, name='float_images')
+
+  endpoints = model.create_base(images_orig_float, labels_one_hot=None)
+
+  sess = tf.compat.v1.Session()
+  saver = tf.compat.v1.train.Saver(
+      slim.get_variables_to_restore(), sharded=True)
+  saver.restore(sess, get_checkpoint_path())
+  tf.compat.v1.logging.info('Model restored successfully.')
+
+  # Create model signature.
+  if export_for_serving:
+    input_tensors = {
+        tf.saved_model.CLASSIFY_INPUTS: images_orig
+    }
+  else:
+    input_tensors = {'images': images_orig}
+  signature_inputs = model_export_lib.build_tensor_info(input_tensors)
+  # NOTE: Tensors 'image_float' and 'chars_logit' are used by the inference
+  # or to compute saliency maps.
+  output_tensors = {
+      'images_float': images_orig_float,
+      'predictions': endpoints.predicted_chars,
+      'scores': endpoints.predicted_scores,
+      'chars_logit': endpoints.chars_logit,
+      'predicted_length': endpoints.predicted_length,
+      'predicted_text': endpoints.predicted_text,
+      'predicted_conf': endpoints.predicted_conf,
+      'normalized_seq_conf': endpoints.normalized_seq_conf
+  }
+  for i, t in enumerate(
+      model_export_lib.attention_ocr_attention_masks(
+          dataset.max_sequence_length)):
+    output_tensors['attention_mask_%d' % i] = t
+  signature_outputs = model_export_lib.build_tensor_info(output_tensors)
+  signature_def = tf.compat.v1.saved_model.signature_def_utils.build_signature_def(
+      signature_inputs, signature_outputs,
+      tf.saved_model.CLASSIFY_METHOD_NAME)
+  # Save model.
+  builder = tf.compat.v1.saved_model.builder.SavedModelBuilder(export_dir)
+  builder.add_meta_graph_and_variables(
+      sess, [tf.saved_model.SERVING],
+      signature_def_map={
+          tf.saved_model.DEFAULT_SERVING_SIGNATURE_DEF_KEY:
+              signature_def
+      },
+      main_op=tf.compat.v1.tables_initializer(),
+      strip_default_attrs=True)
+  builder.save()
+  tf.compat.v1.logging.info('Model has been exported to %s' % export_dir)
+
+  return signature_def
+
+
+def main(unused_argv):
+  if os.path.exists(FLAGS.export_dir):
+    raise ValueError('export_dir already exists: exporting will fail')
+
+  export_model(FLAGS.export_dir, FLAGS.export_for_serving, FLAGS.batch_size,
+               FLAGS.image_width, FLAGS.image_height)
+
+
+if __name__ == '__main__':
+  flags.mark_flag_as_required('dataset_name')
+  flags.mark_flag_as_required('export_dir')
+  app.run(main)
diff --git a/research/attention_ocr/python/model_export_lib.py b/research/attention_ocr/python/model_export_lib.py
new file mode 100644
index 0000000000000000000000000000000000000000..d5d141be2a88f07dbf7b4c1396dd585e8d4d5490
--- /dev/null
+++ b/research/attention_ocr/python/model_export_lib.py
@@ -0,0 +1,108 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Utility functions for exporting Attention OCR model."""
+
+import tensorflow as tf
+
+
+# Function borrowed from research/object_detection/core/preprocessor.py
+def normalize_image(image, original_minval, original_maxval, target_minval,
+                    target_maxval):
+  """Normalizes pixel values in the image.
+
+  Moves the pixel values from the current [original_minval, original_maxval]
+  range to a the [target_minval, target_maxval] range.
+  Args:
+    image: rank 3 float32 tensor containing 1 image -> [height, width,
+      channels].
+    original_minval: current image minimum value.
+    original_maxval: current image maximum value.
+    target_minval: target image minimum value.
+    target_maxval: target image maximum value.
+
+  Returns:
+    image: image which is the same shape as input image.
+  """
+  with tf.compat.v1.name_scope('NormalizeImage', values=[image]):
+    original_minval = float(original_minval)
+    original_maxval = float(original_maxval)
+    target_minval = float(target_minval)
+    target_maxval = float(target_maxval)
+    image = tf.cast(image, dtype=tf.float32)
+    image = tf.subtract(image, original_minval)
+    image = tf.multiply(image, (target_maxval - target_minval) /
+                        (original_maxval - original_minval))
+    image = tf.add(image, target_minval)
+    return image
+
+
+def generate_tfexample_image(input_example_strings,
+                             image_height,
+                             image_width,
+                             image_channels,
+                             name=None):
+  """Parses a 1D tensor of serialized tf.Example protos and returns image batch.
+
+  Args:
+    input_example_strings: A 1-Dimensional tensor of size [batch_size] and type
+      tf.string containing a serialized Example proto per image.
+    image_height: First image dimension.
+    image_width: Second image dimension.
+    image_channels: Third image dimension.
+    name: optional tensor name.
+
+  Returns:
+    A tensor with shape [batch_size, height, width, channels] of type float32
+    with values in the range [0..1]
+  """
+  batch_size = tf.shape(input=input_example_strings)[0]
+  images_shape = tf.stack(
+      [batch_size, image_height, image_width, image_channels])
+  tf_example_image_key = 'image/encoded'
+  feature_configs = {
+      tf_example_image_key:
+          tf.io.FixedLenFeature(
+              image_height * image_width * image_channels, dtype=tf.float32)
+  }
+  feature_tensors = tf.io.parse_example(
+      serialized=input_example_strings, features=feature_configs)
+  float_images = tf.reshape(
+      normalize_image(
+          feature_tensors[tf_example_image_key],
+          original_minval=0.0,
+          original_maxval=255.0,
+          target_minval=0.0,
+          target_maxval=1.0),
+      images_shape,
+      name=name)
+  return float_images
+
+
+def attention_ocr_attention_masks(num_characters):
+  # TODO(gorban): use tensors directly after replacing LSTM unroll methods.
+  prefix = ('AttentionOcr_v1/'
+            'sequence_logit_fn/SQLR/LSTM/attention_decoder/Attention_0')
+  names = ['%s/Softmax:0' % (prefix)]
+  for i in range(1, num_characters):
+    names += ['%s_%d/Softmax:0' % (prefix, i)]
+  return [tf.compat.v1.get_default_graph().get_tensor_by_name(n) for n in names]
+
+
+def build_tensor_info(tensor_dict):
+  return {
+      k: tf.compat.v1.saved_model.utils.build_tensor_info(t)
+      for k, t in tensor_dict.items()
+  }
diff --git a/research/attention_ocr/python/model_export_test.py b/research/attention_ocr/python/model_export_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..e91985add00caa8043c7b48537f7afcc9202699f
--- /dev/null
+++ b/research/attention_ocr/python/model_export_test.py
@@ -0,0 +1,160 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for model_export."""
+import os
+
+import numpy as np
+from absl.testing import flagsaver
+import tensorflow as tf
+
+import common_flags
+import model_export
+
+_CHECKPOINT = 'model.ckpt-399731'
+_CHECKPOINT_URL = (
+    'http://download.tensorflow.org/models/attention_ocr_2017_08_09.tar.gz')
+
+
+def _clean_up():
+  tf.io.gfile.rmtree(tf.compat.v1.test.get_temp_dir())
+
+
+def _create_tf_example_string(image):
+  """Create a serialized tf.Example proto for feeding the model."""
+  example = tf.train.Example()
+  example.features.feature['image/encoded'].float_list.value.extend(
+      list(np.reshape(image, (-1))))
+  return example.SerializeToString()
+
+
+class AttentionOcrExportTest(tf.test.TestCase):
+  """Tests for model_export.export_model."""
+
+  def setUp(self):
+    for suffix in ['.meta', '.index', '.data-00000-of-00001']:
+      filename = _CHECKPOINT + suffix
+      self.assertTrue(
+          tf.io.gfile.exists(filename),
+          msg='Missing checkpoint file %s. '
+          'Please download and extract it from %s' %
+          (filename, _CHECKPOINT_URL))
+    tf.flags.FLAGS.dataset_name = 'fsns'
+    tf.flags.FLAGS.checkpoint = _CHECKPOINT
+    tf.flags.FLAGS.dataset_dir = os.path.join(
+        os.path.dirname(__file__), 'datasets/testdata/fsns')
+    tf.test.TestCase.setUp(self)
+    _clean_up()
+    self.export_dir = os.path.join(
+        tf.compat.v1.test.get_temp_dir(), 'exported_model')
+    self.minimal_output_signature = {
+        'predictions': 'AttentionOcr_v1/predicted_chars:0',
+        'scores': 'AttentionOcr_v1/predicted_scores:0',
+        'predicted_length': 'AttentionOcr_v1/predicted_length:0',
+        'predicted_text': 'AttentionOcr_v1/predicted_text:0',
+        'predicted_conf': 'AttentionOcr_v1/predicted_conf:0',
+        'normalized_seq_conf': 'AttentionOcr_v1/normalized_seq_conf:0'
+    }
+
+  def create_input_feed(self, graph_def, serving):
+    """Returns the input feed for the model.
+
+    Creates random images, according to the size specified by dataset_name,
+    format it in the correct way depending on whether the model was exported
+    for serving, and return the correctly keyed feed_dict for inference.
+
+    Args:
+      graph_def: Graph definition of the loaded model.
+      serving: Whether the model was exported for Serving.
+
+    Returns:
+      The feed_dict suitable for model inference.
+    """
+    # Creates a dataset based on FLAGS.dataset_name.
+    self.dataset = common_flags.create_dataset('test')
+    # Create some random images to test inference for any dataset.
+    self.images = {
+        'img1':
+            np.random.uniform(low=64, high=192,
+                              size=self.dataset.image_shape).astype('uint8'),
+        'img2':
+            np.random.uniform(low=32, high=224,
+                              size=self.dataset.image_shape).astype('uint8'),
+    }
+    signature_def = graph_def.signature_def[
+        tf.saved_model.DEFAULT_SERVING_SIGNATURE_DEF_KEY]
+    if serving:
+      input_name = signature_def.inputs[
+          tf.saved_model.CLASSIFY_INPUTS].name
+      # Model for serving takes input: inputs['inputs'] = 'tf_example:0'
+      feed_dict = {
+          input_name: [
+              _create_tf_example_string(self.images['img1']),
+              _create_tf_example_string(self.images['img2'])
+          ]
+      }
+    else:
+      input_name = signature_def.inputs['images'].name
+      # Model for direct use takes input: inputs['images'] = 'original_image:0'
+      feed_dict = {
+          input_name: np.stack([self.images['img1'], self.images['img2']])
+      }
+    return feed_dict
+
+  def verify_export_load_and_inference(self, export_for_serving=False):
+    """Verify exported model can be loaded and inference can run successfully.
+
+    This function will load the exported model in self.export_dir, then create
+    some fake images according to the specification of FLAGS.dataset_name.
+    It then feeds the input through the model, and verify the minimal set of
+    output signatures are present.
+    Note: Model and dataset creation in the underlying library depends on the
+          following commandline flags:
+            FLAGS.dataset_name
+    Args:
+      export_for_serving: True if the model was exported for Serving. This
+        affects how input is fed into the model.
+    """
+    tf.compat.v1.reset_default_graph()
+    sess = tf.compat.v1.Session()
+    graph_def = tf.compat.v1.saved_model.loader.load(
+        sess=sess,
+        tags=[tf.saved_model.SERVING],
+        export_dir=self.export_dir)
+    feed_dict = self.create_input_feed(graph_def, export_for_serving)
+    results = sess.run(self.minimal_output_signature, feed_dict=feed_dict)
+
+    out_shape = (2,)
+    self.assertEqual(np.shape(results['predicted_conf']), out_shape)
+    self.assertEqual(np.shape(results['predicted_text']), out_shape)
+    self.assertEqual(np.shape(results['predicted_length']), out_shape)
+    self.assertEqual(np.shape(results['normalized_seq_conf']), out_shape)
+    out_shape = (2, self.dataset.max_sequence_length)
+    self.assertEqual(np.shape(results['scores']), out_shape)
+    self.assertEqual(np.shape(results['predictions']), out_shape)
+
+  @flagsaver.flagsaver
+  def test_fsns_export_for_serving_and_load_inference(self):
+    model_export.export_model(self.export_dir, True)
+    self.verify_export_load_and_inference(True)
+
+  @flagsaver.flagsaver
+  def test_fsns_export_and_load_inference(self):
+    model_export.export_model(self.export_dir, False, batch_size=2)
+    self.verify_export_load_and_inference(False)
+
+
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/research/attention_ocr/python/model_test.py b/research/attention_ocr/python/model_test.py
index 9b47d2b06e50ea57cf8de2109102c3e0c60606a4..6632a38358ad17d4f37888f86d4881362a402c20 100644
--- a/research/attention_ocr/python/model_test.py
+++ b/research/attention_ocr/python/model_test.py
@@ -12,11 +12,10 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-
 """Tests for the model."""
+import string
 
 import numpy as np
-import string
 import tensorflow as tf
 from tensorflow.contrib import slim
 
@@ -32,6 +31,7 @@ def create_fake_charset(num_char_classes):
 
 
 class ModelTest(tf.test.TestCase):
+
   def setUp(self):
     tf.test.TestCase.setUp(self)
 
@@ -51,18 +51,21 @@ class ModelTest(tf.test.TestCase):
     self.chars_logit_shape = (self.batch_size, self.seq_length,
                               self.num_char_classes)
     self.length_logit_shape = (self.batch_size, self.seq_length + 1)
+    # Placeholder knows image dimensions, but not batch size.
+    self.input_images = tf.compat.v1.placeholder(
+        tf.float32,
+        shape=(None, self.image_height, self.image_width, 3),
+        name='input_node')
 
     self.initialize_fakes()
 
   def initialize_fakes(self):
     self.images_shape = (self.batch_size, self.image_height, self.image_width,
                          3)
-    self.fake_images = tf.constant(
-        self.rng.randint(low=0, high=255,
-                         size=self.images_shape).astype('float32'),
-        name='input_node')
-    self.fake_conv_tower_np = self.rng.randn(
-        *self.conv_tower_shape).astype('float32')
+    self.fake_images = self.rng.randint(
+        low=0, high=255, size=self.images_shape).astype('float32')
+    self.fake_conv_tower_np = self.rng.randn(*self.conv_tower_shape).astype(
+        'float32')
     self.fake_conv_tower = tf.constant(self.fake_conv_tower_np)
     self.fake_logits = tf.constant(
         self.rng.randn(*self.chars_logit_shape).astype('float32'))
@@ -74,33 +77,44 @@ class ModelTest(tf.test.TestCase):
 
   def create_model(self, charset=None):
     return model.Model(
-        self.num_char_classes, self.seq_length, num_views=4, null_code=62,
+        self.num_char_classes,
+        self.seq_length,
+        num_views=4,
+        null_code=62,
         charset=charset)
 
   def test_char_related_shapes(self):
-    ocr_model = self.create_model()
+    charset = create_fake_charset(self.num_char_classes)
+    ocr_model = self.create_model(charset=charset)
     with self.test_session() as sess:
       endpoints_tf = ocr_model.create_base(
-          images=self.fake_images, labels_one_hot=None)
-
-      sess.run(tf.global_variables_initializer())
-      endpoints = sess.run(endpoints_tf)
-
-      self.assertEqual((self.batch_size, self.seq_length,
-                        self.num_char_classes), endpoints.chars_logit.shape)
-      self.assertEqual((self.batch_size, self.seq_length,
-                        self.num_char_classes), endpoints.chars_log_prob.shape)
+          images=self.input_images, labels_one_hot=None)
+      sess.run(tf.compat.v1.global_variables_initializer())
+      tf.compat.v1.tables_initializer().run()
+      endpoints = sess.run(
+          endpoints_tf, feed_dict={self.input_images: self.fake_images})
+
+      self.assertEqual(
+          (self.batch_size, self.seq_length, self.num_char_classes),
+          endpoints.chars_logit.shape)
+      self.assertEqual(
+          (self.batch_size, self.seq_length, self.num_char_classes),
+          endpoints.chars_log_prob.shape)
       self.assertEqual((self.batch_size, self.seq_length),
                        endpoints.predicted_chars.shape)
       self.assertEqual((self.batch_size, self.seq_length),
                        endpoints.predicted_scores.shape)
+      self.assertEqual((self.batch_size,), endpoints.predicted_text.shape)
+      self.assertEqual((self.batch_size,), endpoints.predicted_conf.shape)
+      self.assertEqual((self.batch_size,), endpoints.normalized_seq_conf.shape)
 
   def test_predicted_scores_are_within_range(self):
     ocr_model = self.create_model()
 
     _, _, scores = ocr_model.char_predictions(self.fake_logits)
     with self.test_session() as sess:
-      scores_np = sess.run(scores)
+      scores_np = sess.run(
+          scores, feed_dict={self.input_images: self.fake_images})
 
     values_in_range = (scores_np >= 0.0) & (scores_np <= 1.0)
     self.assertTrue(
@@ -111,10 +125,11 @@ class ModelTest(tf.test.TestCase):
   def test_conv_tower_shape(self):
     with self.test_session() as sess:
       ocr_model = self.create_model()
-      conv_tower = ocr_model.conv_tower_fn(self.fake_images)
+      conv_tower = ocr_model.conv_tower_fn(self.input_images)
 
-      sess.run(tf.global_variables_initializer())
-      conv_tower_np = sess.run(conv_tower)
+      sess.run(tf.compat.v1.global_variables_initializer())
+      conv_tower_np = sess.run(
+          conv_tower, feed_dict={self.input_images: self.fake_images})
 
       self.assertEqual(self.conv_tower_shape, conv_tower_np.shape)
 
@@ -124,11 +139,12 @@ class ModelTest(tf.test.TestCase):
     # updates, gradients and variances. It also depends on the type of used
     # optimizer.
     ocr_model = self.create_model()
-    ocr_model.create_base(images=self.fake_images, labels_one_hot=None)
+    ocr_model.create_base(images=self.input_images, labels_one_hot=None)
     with self.test_session() as sess:
-      tfprof_root = tf.profiler.profile(
+      tfprof_root = tf.compat.v1.profiler.profile(
           sess.graph,
-          options=tf.profiler.ProfileOptionBuilder.trainable_variables_parameter())
+          options=tf.compat.v1.profiler.ProfileOptionBuilder
+          .trainable_variables_parameter())
 
       model_size_bytes = 4 * tfprof_root.total_parameters
       self.assertLess(model_size_bytes, 1 * 2**30)
@@ -147,9 +163,9 @@ class ModelTest(tf.test.TestCase):
     summaries = ocr_model.create_summaries(
         data, endpoints, charset, is_training=False)
     with self.test_session() as sess:
-      sess.run(tf.global_variables_initializer())
-      sess.run(tf.local_variables_initializer())
-      tf.tables_initializer().run()
+      sess.run(tf.compat.v1.global_variables_initializer())
+      sess.run(tf.compat.v1.local_variables_initializer())
+      tf.compat.v1.tables_initializer().run()
       sess.run(summaries)  # just check it is runnable
 
   def test_sequence_loss_function_without_label_smoothing(self):
@@ -158,7 +174,7 @@ class ModelTest(tf.test.TestCase):
 
     loss = model.sequence_loss_fn(self.fake_logits, self.fake_labels)
     with self.test_session() as sess:
-      loss_np = sess.run(loss)
+      loss_np = sess.run(loss, feed_dict={self.input_images: self.fake_images})
 
     # This test checks that the loss function is 'runnable'.
     self.assertEqual(loss_np.shape, tuple())
@@ -172,19 +188,21 @@ class ModelTest(tf.test.TestCase):
     Returns:
       a list of tensors with encoded image coordinates in them.
     """
-    batch_size, h, w, _ = net.shape.as_list()
+    batch_size = tf.shape(input=net)[0]
+    _, h, w, _ = net.shape.as_list()
     h_loc = [
-      tf.tile(
-          tf.reshape(
-              tf.contrib.layers.one_hot_encoding(
-                  tf.constant([i]), num_classes=h), [h, 1]), [1, w])
-      for i in range(h)
+        tf.tile(
+            tf.reshape(
+                tf.contrib.layers.one_hot_encoding(
+                    tf.constant([i]), num_classes=h), [h, 1]), [1, w])
+        for i in range(h)
     ]
     h_loc = tf.concat([tf.expand_dims(t, 2) for t in h_loc], 2)
     w_loc = [
-      tf.tile(
-          tf.contrib.layers.one_hot_encoding(tf.constant([i]), num_classes=w),
-          [h, 1]) for i in range(w)
+        tf.tile(
+            tf.contrib.layers.one_hot_encoding(
+                tf.constant([i]), num_classes=w),
+            [h, 1]) for i in range(w)
     ]
     w_loc = tf.concat([tf.expand_dims(t, 2) for t in w_loc], 2)
     loc = tf.concat([h_loc, w_loc], 2)
@@ -197,11 +215,12 @@ class ModelTest(tf.test.TestCase):
     conv_w_coords_tf = model.encode_coordinates_fn(self.fake_conv_tower)
 
     with self.test_session() as sess:
-      conv_w_coords = sess.run(conv_w_coords_tf)
+      conv_w_coords = sess.run(
+          conv_w_coords_tf, feed_dict={self.input_images: self.fake_images})
 
     batch_size, height, width, feature_size = self.conv_tower_shape
-    self.assertEqual(conv_w_coords.shape, (batch_size, height, width,
-                                           feature_size + height + width))
+    self.assertEqual(conv_w_coords.shape,
+                     (batch_size, height, width, feature_size + height + width))
 
   def test_disabled_coordinate_encoding_returns_features_unchanged(self):
     model = self.create_model()
@@ -209,7 +228,8 @@ class ModelTest(tf.test.TestCase):
     conv_w_coords_tf = model.encode_coordinates_fn(self.fake_conv_tower)
 
     with self.test_session() as sess:
-      conv_w_coords = sess.run(conv_w_coords_tf)
+      conv_w_coords = sess.run(
+          conv_w_coords_tf, feed_dict={self.input_images: self.fake_images})
 
     self.assertAllEqual(conv_w_coords, self.fake_conv_tower_np)
 
@@ -221,7 +241,8 @@ class ModelTest(tf.test.TestCase):
     conv_w_coords_tf = model.encode_coordinates_fn(fake_conv_tower)
 
     with self.test_session() as sess:
-      conv_w_coords = sess.run(conv_w_coords_tf)
+      conv_w_coords = sess.run(
+          conv_w_coords_tf, feed_dict={self.input_images: self.fake_images})
 
     # Original features
     self.assertAllEqual(conv_w_coords[0, :, :, :4],
@@ -252,8 +273,8 @@ class ModelTest(tf.test.TestCase):
       endpoints_tf = ocr_model.create_base(
           images=self.fake_images, labels_one_hot=None)
 
-      sess.run(tf.global_variables_initializer())
-      tf.tables_initializer().run()
+      sess.run(tf.compat.v1.global_variables_initializer())
+      tf.compat.v1.tables_initializer().run()
       endpoints = sess.run(endpoints_tf)
 
       self.assertEqual(endpoints.predicted_text.shape, (self.batch_size,))
@@ -261,14 +282,15 @@ class ModelTest(tf.test.TestCase):
 
 
 class CharsetMapperTest(tf.test.TestCase):
+
   def test_text_corresponds_to_ids(self):
     charset = create_fake_charset(36)
-    ids = tf.constant(
-        [[17, 14, 21, 21, 24], [32, 24, 27, 21, 13]], dtype=tf.int64)
+    ids = tf.constant([[17, 14, 21, 21, 24], [32, 24, 27, 21, 13]],
+                      dtype=tf.int64)
     charset_mapper = model.CharsetMapper(charset)
 
     with self.test_session() as sess:
-      tf.tables_initializer().run()
+      tf.compat.v1.tables_initializer().run()
       text = sess.run(charset_mapper.get_text(ids))
 
     self.assertAllEqual(text, [b'hello', b'world'])
diff --git a/research/attention_ocr/python/sequence_layers.py b/research/attention_ocr/python/sequence_layers.py
index 9261f210ba5c28cc243098de17db850e3f90c2c4..15c4b1c3f9451ed54d92a734dcdb4f58e1162f89 100644
--- a/research/attention_ocr/python/sequence_layers.py
+++ b/research/attention_ocr/python/sequence_layers.py
@@ -111,12 +111,12 @@ class SequenceLayerBase(object):
     self._mparams = method_params
     self._net = net
     self._labels_one_hot = labels_one_hot
-    self._batch_size = net.get_shape().dims[0].value
+    self._batch_size = tf.shape(input=net)[0]
 
     # Initialize parameters for char logits which will be computed on the fly
     # inside an LSTM decoder.
     self._char_logits = {}
-    regularizer = slim.l2_regularizer(self._mparams.weight_decay)
+    regularizer = tf.keras.regularizers.l2(0.5 * (self._mparams.weight_decay))
     self._softmax_w = slim.model_variable(
         'softmax_w',
         [self._mparams.num_lstm_units, self._params.num_char_classes],
@@ -124,7 +124,7 @@ class SequenceLayerBase(object):
         regularizer=regularizer)
     self._softmax_b = slim.model_variable(
         'softmax_b', [self._params.num_char_classes],
-        initializer=tf.zeros_initializer(),
+        initializer=tf.compat.v1.zeros_initializer(),
         regularizer=regularizer)
 
   @abc.abstractmethod
@@ -203,8 +203,8 @@ class SequenceLayerBase(object):
       A tensor with shape [batch_size, num_char_classes]
     """
     if char_index not in self._char_logits:
-      self._char_logits[char_index] = tf.nn.xw_plus_b(inputs, self._softmax_w,
-                                                      self._softmax_b)
+      self._char_logits[char_index] = tf.compat.v1.nn.xw_plus_b(inputs, self._softmax_w,
+                                                                self._softmax_b)
     return self._char_logits[char_index]
 
   def char_one_hot(self, logit):
@@ -216,7 +216,7 @@ class SequenceLayerBase(object):
     Returns:
       A tensor with shape [batch_size, num_char_classes]
     """
-    prediction = tf.argmax(logit, axis=1)
+    prediction = tf.argmax(input=logit, axis=1)
     return slim.one_hot_encoding(prediction, self._params.num_char_classes)
 
   def get_input(self, prev, i):
@@ -244,10 +244,10 @@ class SequenceLayerBase(object):
     Returns:
       A tensor with shape [batch_size, seq_length, num_char_classes].
     """
-    with tf.variable_scope('LSTM'):
+    with tf.compat.v1.variable_scope('LSTM'):
       first_label = self.get_input(prev=None, i=0)
       decoder_inputs = [first_label] + [None] * (self._params.seq_length - 1)
-      lstm_cell = tf.contrib.rnn.LSTMCell(
+      lstm_cell = tf.compat.v1.nn.rnn_cell.LSTMCell(
           self._mparams.num_lstm_units,
           use_peepholes=False,
           cell_clip=self._mparams.lstm_state_clip_value,
@@ -259,9 +259,9 @@ class SequenceLayerBase(object):
           loop_function=self.get_input,
           cell=lstm_cell)
 
-    with tf.variable_scope('logits'):
+    with tf.compat.v1.variable_scope('logits'):
       logits_list = [
-          tf.expand_dims(self.char_logit(logit, i), dim=1)
+          tf.expand_dims(self.char_logit(logit, i), axis=1)
           for i, logit in enumerate(lstm_outputs)
       ]
 
@@ -275,7 +275,7 @@ class NetSlice(SequenceLayerBase):
   def __init__(self, *args, **kwargs):
     super(NetSlice, self).__init__(*args, **kwargs)
     self._zero_label = tf.zeros(
-        [self._batch_size, self._params.num_char_classes])
+        tf.stack([self._batch_size, self._params.num_char_classes]))
 
   def get_image_feature(self, char_index):
     """Returns a subset of image features for a character.
@@ -352,7 +352,7 @@ class Attention(SequenceLayerBase):
   def __init__(self, *args, **kwargs):
     super(Attention, self).__init__(*args, **kwargs)
     self._zero_label = tf.zeros(
-        [self._batch_size, self._params.num_char_classes])
+        tf.stack([self._batch_size, self._params.num_char_classes]))
 
   def get_eval_input(self, prev, i):
     """See SequenceLayerBase.get_eval_input for details."""
diff --git a/research/attention_ocr/python/sequence_layers_test.py b/research/attention_ocr/python/sequence_layers_test.py
index fd41e2d824c014084129707631d45de334ec741b..29be1875b2aff0bd35da5c16628543e14051a04e 100644
--- a/research/attention_ocr/python/sequence_layers_test.py
+++ b/research/attention_ocr/python/sequence_layers_test.py
@@ -29,13 +29,13 @@ import sequence_layers
 
 def fake_net(batch_size, num_features, feature_size):
   return tf.convert_to_tensor(
-      np.random.uniform(size=(batch_size, num_features, feature_size)),
+      value=np.random.uniform(size=(batch_size, num_features, feature_size)),
       dtype=tf.float32)
 
 
 def fake_labels(batch_size, seq_length, num_char_classes):
   labels_np = tf.convert_to_tensor(
-      np.random.randint(
+      value=np.random.randint(
           low=0, high=num_char_classes, size=(batch_size, seq_length)))
   return slim.one_hot_encoding(labels_np, num_classes=num_char_classes)
 
diff --git a/research/attention_ocr/python/train.py b/research/attention_ocr/python/train.py
index fa91fb73b412287889f05d0af5875e269f1ce367..445bcfe6e18b9d285bf33241647743c6da558f76 100644
--- a/research/attention_ocr/python/train.py
+++ b/research/attention_ocr/python/train.py
@@ -96,16 +96,16 @@ def get_training_hparams():
 def create_optimizer(hparams):
   """Creates optimized based on the specified flags."""
   if hparams.optimizer == 'momentum':
-    optimizer = tf.train.MomentumOptimizer(
+    optimizer = tf.compat.v1.train.MomentumOptimizer(
         hparams.learning_rate, momentum=hparams.momentum)
   elif hparams.optimizer == 'adam':
-    optimizer = tf.train.AdamOptimizer(hparams.learning_rate)
+    optimizer = tf.compat.v1.train.AdamOptimizer(hparams.learning_rate)
   elif hparams.optimizer == 'adadelta':
-    optimizer = tf.train.AdadeltaOptimizer(hparams.learning_rate)
+    optimizer = tf.compat.v1.train.AdadeltaOptimizer(hparams.learning_rate)
   elif hparams.optimizer == 'adagrad':
-    optimizer = tf.train.AdagradOptimizer(hparams.learning_rate)
+    optimizer = tf.compat.v1.train.AdagradOptimizer(hparams.learning_rate)
   elif hparams.optimizer == 'rmsprop':
-    optimizer = tf.train.RMSPropOptimizer(
+    optimizer = tf.compat.v1.train.RMSPropOptimizer(
         hparams.learning_rate, momentum=hparams.momentum)
   return optimizer
 
@@ -154,14 +154,14 @@ def train(loss, init_fn, hparams):
 
 
 def prepare_training_dir():
-  if not tf.gfile.Exists(FLAGS.train_log_dir):
+  if not tf.io.gfile.exists(FLAGS.train_log_dir):
     logging.info('Create a new training directory %s', FLAGS.train_log_dir)
-    tf.gfile.MakeDirs(FLAGS.train_log_dir)
+    tf.io.gfile.makedirs(FLAGS.train_log_dir)
   else:
     if FLAGS.reset_train_dir:
       logging.info('Reset the training directory %s', FLAGS.train_log_dir)
-      tf.gfile.DeleteRecursively(FLAGS.train_log_dir)
-      tf.gfile.MakeDirs(FLAGS.train_log_dir)
+      tf.io.gfile.rmtree(FLAGS.train_log_dir)
+      tf.io.gfile.makedirs(FLAGS.train_log_dir)
     else:
       logging.info('Use already existing training directory %s',
                    FLAGS.train_log_dir)
@@ -169,7 +169,7 @@ def prepare_training_dir():
 
 def calculate_graph_metrics():
   param_stats = model_analyzer.print_model_analysis(
-      tf.get_default_graph(),
+      tf.compat.v1.get_default_graph(),
       tfprof_options=model_analyzer.TRAINABLE_VARS_PARAMS_STAT_OPTIONS)
   return param_stats.total_parameters
 
@@ -186,7 +186,7 @@ def main(_):
   # If ps_tasks is zero, the local device is used. When using multiple
   # (non-local) replicas, the ReplicaDeviceSetter distributes the variables
   # across the different devices.
-  device_setter = tf.train.replica_device_setter(
+  device_setter = tf.compat.v1.train.replica_device_setter(
       FLAGS.ps_tasks, merge_devices=True)
   with tf.device(device_setter):
     data = data_provider.get_data(
diff --git a/research/attention_ocr/python/utils.py b/research/attention_ocr/python/utils.py
index 10d93ad21e1444736bf4562ef0df1c939617a5c1..5d282f72874856ecdac6dd3932badbbe9d915a9a 100644
--- a/research/attention_ocr/python/utils.py
+++ b/research/attention_ocr/python/utils.py
@@ -37,16 +37,16 @@ def logits_to_log_prob(logits):
     probabilities.
   """
 
-  with tf.variable_scope('log_probabilities'):
+  with tf.compat.v1.variable_scope('log_probabilities'):
     reduction_indices = len(logits.shape.as_list()) - 1
     max_logits = tf.reduce_max(
-        logits, reduction_indices=reduction_indices, keep_dims=True)
+        input_tensor=logits, axis=reduction_indices, keepdims=True)
     safe_logits = tf.subtract(logits, max_logits)
     sum_exp = tf.reduce_sum(
-        tf.exp(safe_logits),
-        reduction_indices=reduction_indices,
-        keep_dims=True)
-    log_probs = tf.subtract(safe_logits, tf.log(sum_exp))
+        input_tensor=tf.exp(safe_logits),
+        axis=reduction_indices,
+        keepdims=True)
+    log_probs = tf.subtract(safe_logits, tf.math.log(sum_exp))
   return log_probs
 
 
@@ -78,3 +78,20 @@ def variables_to_restore(scope=None, strip_scope=False):
     return variable_map
   else:
     return {v.op.name: v for v in slim.get_variables_to_restore()}
+
+
+def ConvertAllInputsToTensors(func):
+  """A decorator to convert all function's inputs into tensors.
+
+  Args:
+    func: a function to decorate.
+
+  Returns:
+    A decorated function.
+  """
+
+  def FuncWrapper(*args):
+    tensors = [tf.convert_to_tensor(value=a) for a in args]
+    return func(*tensors)
+
+  return FuncWrapper
diff --git a/research/autoencoder/AdditiveGaussianNoiseAutoencoderRunner.py b/research/autoencoder/AdditiveGaussianNoiseAutoencoderRunner.py
deleted file mode 100644
index 8d8ee08654985250ac61415df96889b4a4cf5f1b..0000000000000000000000000000000000000000
--- a/research/autoencoder/AdditiveGaussianNoiseAutoencoderRunner.py
+++ /dev/null
@@ -1,58 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import sklearn.preprocessing as prep
-import tensorflow as tf
-from tensorflow.examples.tutorials.mnist import input_data
-
-from autoencoder_models.DenoisingAutoencoder import AdditiveGaussianNoiseAutoencoder
-
-mnist = input_data.read_data_sets('MNIST_data', one_hot=True)
-
-
-def standard_scale(X_train, X_test):
-    preprocessor = prep.StandardScaler().fit(X_train)
-    X_train = preprocessor.transform(X_train)
-    X_test = preprocessor.transform(X_test)
-    return X_train, X_test
-
-
-def get_random_block_from_data(data, batch_size):
-    start_index = np.random.randint(0, len(data) - batch_size)
-    return data[start_index:(start_index + batch_size)]
-
-
-X_train, X_test = standard_scale(mnist.train.images, mnist.test.images)
-
-n_samples = int(mnist.train.num_examples)
-training_epochs = 20
-batch_size = 128
-display_step = 1
-
-autoencoder = AdditiveGaussianNoiseAutoencoder(
-    n_input=784,
-    n_hidden=200,
-    transfer_function=tf.nn.softplus,
-    optimizer=tf.train.AdamOptimizer(learning_rate = 0.001),
-    scale=0.01)
-
-for epoch in range(training_epochs):
-    avg_cost = 0.
-    total_batch = int(n_samples / batch_size)
-    # Loop over all batches
-    for i in range(total_batch):
-        batch_xs = get_random_block_from_data(X_train, batch_size)
-
-        # Fit training using batch data
-        cost = autoencoder.partial_fit(batch_xs)
-        # Compute average loss
-        avg_cost += cost / n_samples * batch_size
-
-    # Display logs per epoch step
-    if epoch % display_step == 0:
-        print("Epoch:", '%d,' % (epoch + 1),
-              "Cost:", "{:.9f}".format(avg_cost))
-
-print("Total cost: " + str(autoencoder.calc_total_cost(X_test)))
diff --git a/research/autoencoder/AutoencoderRunner.py b/research/autoencoder/AutoencoderRunner.py
deleted file mode 100644
index 7f1ab2ecd5a91c12960714ea79a864631e634f8c..0000000000000000000000000000000000000000
--- a/research/autoencoder/AutoencoderRunner.py
+++ /dev/null
@@ -1,55 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import sklearn.preprocessing as prep
-import tensorflow as tf
-from tensorflow.examples.tutorials.mnist import input_data
-
-from autoencoder_models.Autoencoder import Autoencoder
-
-mnist = input_data.read_data_sets('MNIST_data', one_hot=True)
-
-
-def standard_scale(X_train, X_test):
-    preprocessor = prep.StandardScaler().fit(X_train)
-    X_train = preprocessor.transform(X_train)
-    X_test = preprocessor.transform(X_test)
-    return X_train, X_test
-
-
-def get_random_block_from_data(data, batch_size):
-    start_index = np.random.randint(0, len(data) - batch_size)
-    return data[start_index:(start_index + batch_size)]
-
-
-X_train, X_test = standard_scale(mnist.train.images, mnist.test.images)
-
-n_samples = int(mnist.train.num_examples)
-training_epochs = 20
-batch_size = 128
-display_step = 1
-
-autoencoder = Autoencoder(n_layers=[784, 200],
-                          transfer_function = tf.nn.softplus,
-                          optimizer = tf.train.AdamOptimizer(learning_rate = 0.001))
-
-for epoch in range(training_epochs):
-    avg_cost = 0.
-    total_batch = int(n_samples / batch_size)
-    # Loop over all batches
-    for i in range(total_batch):
-        batch_xs = get_random_block_from_data(X_train, batch_size)
-
-        # Fit training using batch data
-        cost = autoencoder.partial_fit(batch_xs)
-        # Compute average loss
-        avg_cost += cost / n_samples * batch_size
-
-    # Display logs per epoch step
-    if epoch % display_step == 0:
-        print("Epoch:", '%d,' % (epoch + 1),
-              "Cost:", "{:.9f}".format(avg_cost))
-
-print("Total cost: " + str(autoencoder.calc_total_cost(X_test)))
diff --git a/research/autoencoder/MaskingNoiseAutoencoderRunner.py b/research/autoencoder/MaskingNoiseAutoencoderRunner.py
deleted file mode 100644
index b776302e286ff740ba7b8e6f679a54b23944df12..0000000000000000000000000000000000000000
--- a/research/autoencoder/MaskingNoiseAutoencoderRunner.py
+++ /dev/null
@@ -1,55 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import sklearn.preprocessing as prep
-import tensorflow as tf
-from tensorflow.examples.tutorials.mnist import input_data
-
-from autoencoder_models.DenoisingAutoencoder import MaskingNoiseAutoencoder
-
-mnist = input_data.read_data_sets('MNIST_data', one_hot=True)
-
-
-def standard_scale(X_train, X_test):
-    preprocessor = prep.StandardScaler().fit(X_train)
-    X_train = preprocessor.transform(X_train)
-    X_test = preprocessor.transform(X_test)
-    return X_train, X_test
-
-
-def get_random_block_from_data(data, batch_size):
-    start_index = np.random.randint(0, len(data) - batch_size)
-    return data[start_index:(start_index + batch_size)]
-
-
-X_train, X_test = standard_scale(mnist.train.images, mnist.test.images)
-
-n_samples = int(mnist.train.num_examples)
-training_epochs = 100
-batch_size = 128
-display_step = 1
-
-autoencoder = MaskingNoiseAutoencoder(
-    n_input=784,
-    n_hidden=200,
-    transfer_function=tf.nn.softplus,
-    optimizer=tf.train.AdamOptimizer(learning_rate=0.001),
-    dropout_probability=0.95)
-
-for epoch in range(training_epochs):
-    avg_cost = 0.
-    total_batch = int(n_samples / batch_size)
-    for i in range(total_batch):
-        batch_xs = get_random_block_from_data(X_train, batch_size)
-
-        cost = autoencoder.partial_fit(batch_xs)
-
-        avg_cost += cost / n_samples * batch_size
-
-    if epoch % display_step == 0:
-        print("Epoch:", '%d,' % (epoch + 1),
-              "Cost:", "{:.9f}".format(avg_cost))
-
-print("Total cost: " + str(autoencoder.calc_total_cost(X_test)))
diff --git a/research/autoencoder/README.md b/research/autoencoder/README.md
deleted file mode 100644
index cba7b3b66f59ac9e3810ee1b98d67133296aea25..0000000000000000000000000000000000000000
--- a/research/autoencoder/README.md
+++ /dev/null
@@ -1,3 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
diff --git a/research/autoencoder/VariationalAutoencoderRunner.py b/research/autoencoder/VariationalAutoencoderRunner.py
deleted file mode 100644
index f5ce0045f3c6dfdd357cd874f8ee24df0d8cb3d9..0000000000000000000000000000000000000000
--- a/research/autoencoder/VariationalAutoencoderRunner.py
+++ /dev/null
@@ -1,56 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import sklearn.preprocessing as prep
-import tensorflow as tf
-from tensorflow.examples.tutorials.mnist import input_data
-
-from autoencoder_models.VariationalAutoencoder import VariationalAutoencoder
-
-mnist = input_data.read_data_sets('MNIST_data', one_hot=True)
-
-
-def min_max_scale(X_train, X_test):
-    preprocessor = prep.MinMaxScaler().fit(X_train)
-    X_train = preprocessor.transform(X_train)
-    X_test = preprocessor.transform(X_test)
-    return X_train, X_test
-
-
-def get_random_block_from_data(data, batch_size):
-    start_index = np.random.randint(0, len(data) - batch_size)
-    return data[start_index:(start_index + batch_size)]
-
-
-X_train, X_test = min_max_scale(mnist.train.images, mnist.test.images)
-
-n_samples = int(mnist.train.num_examples)
-training_epochs = 20
-batch_size = 128
-display_step = 1
-
-autoencoder = VariationalAutoencoder(
-    n_input=784,
-    n_hidden=200,
-    optimizer=tf.train.AdamOptimizer(learning_rate = 0.001))
-
-for epoch in range(training_epochs):
-    avg_cost = 0.
-    total_batch = int(n_samples / batch_size)
-    # Loop over all batches
-    for i in range(total_batch):
-        batch_xs = get_random_block_from_data(X_train, batch_size)
-
-        # Fit training using batch data
-        cost = autoencoder.partial_fit(batch_xs)
-        # Compute average loss
-        avg_cost += cost / n_samples * batch_size
-
-    # Display logs per epoch step
-    if epoch % display_step == 0:
-        print("Epoch:", '%d,' % (epoch + 1),
-              "Cost:", "{:.9f}".format(avg_cost))
-
-print("Total cost: " + str(autoencoder.calc_total_cost(X_test)))
diff --git a/research/autoencoder/__init__.py b/research/autoencoder/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/autoencoder/autoencoder_models/Autoencoder.py b/research/autoencoder/autoencoder_models/Autoencoder.py
deleted file mode 100644
index 788a14642306ece056fc53a85ba8c60d87d31826..0000000000000000000000000000000000000000
--- a/research/autoencoder/autoencoder_models/Autoencoder.py
+++ /dev/null
@@ -1,91 +0,0 @@
-import numpy as np
-import tensorflow as tf
-
-
-class Autoencoder(object):
-
-    def __init__(self, n_layers, transfer_function=tf.nn.softplus, optimizer=tf.train.AdamOptimizer()):
-        self.n_layers = n_layers
-        self.transfer = transfer_function
-
-        network_weights = self._initialize_weights()
-        self.weights = network_weights
-
-        # model
-        self.x = tf.placeholder(tf.float32, [None, self.n_layers[0]])
-        self.hidden_encode = []
-        h = self.x
-        for layer in range(len(self.n_layers)-1):
-            h = self.transfer(
-                tf.add(tf.matmul(h, self.weights['encode'][layer]['w']),
-                       self.weights['encode'][layer]['b']))
-            self.hidden_encode.append(h)
-
-        self.hidden_recon = []
-        for layer in range(len(self.n_layers)-1):
-            h = self.transfer(
-                tf.add(tf.matmul(h, self.weights['recon'][layer]['w']),
-                       self.weights['recon'][layer]['b']))
-            self.hidden_recon.append(h)
-        self.reconstruction = self.hidden_recon[-1]
-
-        # cost
-        self.cost = 0.5 * tf.reduce_sum(tf.pow(tf.subtract(self.reconstruction, self.x), 2.0))
-        self.optimizer = optimizer.minimize(self.cost)
-
-        init = tf.global_variables_initializer()
-        self.sess = tf.Session()
-        self.sess.run(init)
-
-
-    def _initialize_weights(self):
-        all_weights = dict()
-        initializer = tf.contrib.layers.xavier_initializer()
-        # Encoding network weights
-        encoder_weights = []
-        for layer in range(len(self.n_layers)-1):
-            w = tf.Variable(
-                initializer((self.n_layers[layer], self.n_layers[layer + 1]),
-                            dtype=tf.float32))
-            b = tf.Variable(
-                tf.zeros([self.n_layers[layer + 1]], dtype=tf.float32))
-            encoder_weights.append({'w': w, 'b': b})
-        # Recon network weights
-        recon_weights = []
-        for layer in range(len(self.n_layers)-1, 0, -1):
-            w = tf.Variable(
-                initializer((self.n_layers[layer], self.n_layers[layer - 1]),
-                            dtype=tf.float32))
-            b = tf.Variable(
-                tf.zeros([self.n_layers[layer - 1]], dtype=tf.float32))
-            recon_weights.append({'w': w, 'b': b})
-        all_weights['encode'] = encoder_weights
-        all_weights['recon'] = recon_weights
-        return all_weights
-
-    def partial_fit(self, X):
-        cost, opt = self.sess.run((self.cost, self.optimizer), feed_dict={self.x: X})
-        return cost
-
-    def calc_total_cost(self, X):
-        return self.sess.run(self.cost, feed_dict={self.x: X})
-
-    def transform(self, X):
-        return self.sess.run(self.hidden_encode[-1], feed_dict={self.x: X})
-
-    def generate(self, hidden=None):
-        if hidden is None:
-            hidden = np.random.normal(size=self.weights['encode'][-1]['b'])
-        return self.sess.run(self.reconstruction, feed_dict={self.hidden_encode[-1]: hidden})
-
-    def reconstruct(self, X):
-        return self.sess.run(self.reconstruction, feed_dict={self.x: X})
-
-    def getWeights(self):
-        raise NotImplementedError
-        return self.sess.run(self.weights)
-
-    def getBiases(self):
-        raise NotImplementedError
-        return self.sess.run(self.weights)
-
diff --git a/research/autoencoder/autoencoder_models/DenoisingAutoencoder.py b/research/autoencoder/autoencoder_models/DenoisingAutoencoder.py
deleted file mode 100644
index 22b5dcb44a4079b80bfcfc16e3dcda5b21ca8c1b..0000000000000000000000000000000000000000
--- a/research/autoencoder/autoencoder_models/DenoisingAutoencoder.py
+++ /dev/null
@@ -1,129 +0,0 @@
-import tensorflow as tf
-
-class AdditiveGaussianNoiseAutoencoder(object):
-    def __init__(self, n_input, n_hidden, transfer_function = tf.nn.softplus, optimizer = tf.train.AdamOptimizer(),
-                 scale = 0.1):
-        self.n_input = n_input
-        self.n_hidden = n_hidden
-        self.transfer = transfer_function
-        self.scale = tf.placeholder(tf.float32)
-        self.training_scale = scale
-        network_weights = self._initialize_weights()
-        self.weights = network_weights
-
-        # model
-        self.x = tf.placeholder(tf.float32, [None, self.n_input])
-        self.hidden = self.transfer(tf.add(tf.matmul(self.x + scale * tf.random_normal((n_input,)),
-                self.weights['w1']),
-                self.weights['b1']))
-        self.reconstruction = tf.add(tf.matmul(self.hidden, self.weights['w2']), self.weights['b2'])
-
-        # cost
-        self.cost = 0.5 * tf.reduce_sum(tf.pow(tf.subtract(self.reconstruction, self.x), 2.0))
-        self.optimizer = optimizer.minimize(self.cost)
-
-        init = tf.global_variables_initializer()
-        self.sess = tf.Session()
-        self.sess.run(init)
-
-    def _initialize_weights(self):
-        all_weights = dict()
-        all_weights['w1'] = tf.get_variable("w1", shape=[self.n_input, self.n_hidden],
-            initializer=tf.contrib.layers.xavier_initializer())
-        all_weights['b1'] = tf.Variable(tf.zeros([self.n_hidden], dtype = tf.float32))
-        all_weights['w2'] = tf.Variable(tf.zeros([self.n_hidden, self.n_input], dtype = tf.float32))
-        all_weights['b2'] = tf.Variable(tf.zeros([self.n_input], dtype = tf.float32))
-        return all_weights
-
-    def partial_fit(self, X):
-        cost, opt = self.sess.run((self.cost, self.optimizer), feed_dict = {self.x: X,
-                                                                            self.scale: self.training_scale
-                                                                            })
-        return cost
-
-    def calc_total_cost(self, X):
-        return self.sess.run(self.cost, feed_dict = {self.x: X,
-                                                     self.scale: self.training_scale
-                                                     })
-
-    def transform(self, X):
-        return self.sess.run(self.hidden, feed_dict = {self.x: X,
-                                                       self.scale: self.training_scale
-                                                       })
-
-    def generate(self, hidden=None):
-        if hidden is None:
-            hidden = self.sess.run(tf.random_normal([1, self.n_hidden]))
-        return self.sess.run(self.reconstruction, feed_dict = {self.hidden: hidden})
-
-    def reconstruct(self, X):
-        return self.sess.run(self.reconstruction, feed_dict = {self.x: X,
-                                                               self.scale: self.training_scale
-                                                               })
-
-    def getWeights(self):
-        return self.sess.run(self.weights['w1'])
-
-    def getBiases(self):
-        return self.sess.run(self.weights['b1'])
-
-
-class MaskingNoiseAutoencoder(object):
-    def __init__(self, n_input, n_hidden, transfer_function = tf.nn.softplus, optimizer = tf.train.AdamOptimizer(),
-                 dropout_probability = 0.95):
-        self.n_input = n_input
-        self.n_hidden = n_hidden
-        self.transfer = transfer_function
-        self.dropout_probability = dropout_probability
-        self.keep_prob = tf.placeholder(tf.float32)
-
-        network_weights = self._initialize_weights()
-        self.weights = network_weights
-
-        # model
-        self.x = tf.placeholder(tf.float32, [None, self.n_input])
-        self.hidden = self.transfer(tf.add(tf.matmul(tf.nn.dropout(self.x, self.keep_prob), self.weights['w1']),
-                                           self.weights['b1']))
-        self.reconstruction = tf.add(tf.matmul(self.hidden, self.weights['w2']), self.weights['b2'])
-
-        # cost
-        self.cost = 0.5 * tf.reduce_sum(tf.pow(tf.subtract(self.reconstruction, self.x), 2.0))
-        self.optimizer = optimizer.minimize(self.cost)
-
-        init = tf.global_variables_initializer()
-        self.sess = tf.Session()
-        self.sess.run(init)
-
-    def _initialize_weights(self):
-        all_weights = dict()
-        all_weights['w1'] = tf.get_variable("w1", shape=[self.n_input, self.n_hidden],
-            initializer=tf.contrib.layers.xavier_initializer())
-        all_weights['b1'] = tf.Variable(tf.zeros([self.n_hidden], dtype = tf.float32))
-        all_weights['w2'] = tf.Variable(tf.zeros([self.n_hidden, self.n_input], dtype = tf.float32))
-        all_weights['b2'] = tf.Variable(tf.zeros([self.n_input], dtype = tf.float32))
-        return all_weights
-
-    def partial_fit(self, X):
-        cost, opt = self.sess.run((self.cost, self.optimizer),
-                                  feed_dict = {self.x: X, self.keep_prob: self.dropout_probability})
-        return cost
-
-    def calc_total_cost(self, X):
-        return self.sess.run(self.cost, feed_dict = {self.x: X, self.keep_prob: 1.0})
-
-    def transform(self, X):
-        return self.sess.run(self.hidden, feed_dict = {self.x: X, self.keep_prob: 1.0})
-
-    def generate(self, hidden=None):
-        if hidden is None:
-            hidden = self.sess.run(tf.random_normal([1, self.n_hidden]))
-        return self.sess.run(self.reconstruction, feed_dict = {self.hidden: hidden})
-
-    def reconstruct(self, X):
-        return self.sess.run(self.reconstruction, feed_dict = {self.x: X, self.keep_prob: 1.0})
-
-    def getWeights(self):
-        return self.sess.run(self.weights['w1'])
-
-    def getBiases(self):
-        return self.sess.run(self.weights['b1'])
diff --git a/research/autoencoder/autoencoder_models/VariationalAutoencoder.py b/research/autoencoder/autoencoder_models/VariationalAutoencoder.py
deleted file mode 100644
index 3c2556ab89c2d32be0af5e61099aa12f91c1f176..0000000000000000000000000000000000000000
--- a/research/autoencoder/autoencoder_models/VariationalAutoencoder.py
+++ /dev/null
@@ -1,70 +0,0 @@
-import tensorflow as tf
-
-class VariationalAutoencoder(object):
-
-    def __init__(self, n_input, n_hidden, optimizer = tf.train.AdamOptimizer()):
-        self.n_input = n_input
-        self.n_hidden = n_hidden
-
-        network_weights = self._initialize_weights()
-        self.weights = network_weights
-
-        # model
-        self.x = tf.placeholder(tf.float32, [None, self.n_input])
-        self.z_mean = tf.add(tf.matmul(self.x, self.weights['w1']), self.weights['b1'])
-        self.z_log_sigma_sq = tf.add(tf.matmul(self.x, self.weights['log_sigma_w1']), self.weights['log_sigma_b1'])
-
-        # sample from gaussian distribution
-        eps = tf.random_normal(tf.stack([tf.shape(self.x)[0], self.n_hidden]), 0, 1, dtype = tf.float32)
-        self.z = tf.add(self.z_mean, tf.multiply(tf.sqrt(tf.exp(self.z_log_sigma_sq)), eps))
-
-        self.reconstruction = tf.add(tf.matmul(self.z, self.weights['w2']), self.weights['b2'])
-
-        # cost
-        reconstr_loss = 0.5 * tf.reduce_sum(tf.pow(tf.subtract(self.reconstruction, self.x), 2.0), 1)
-        latent_loss = -0.5 * tf.reduce_sum(1 + self.z_log_sigma_sq
-                                           - tf.square(self.z_mean)
-                                           - tf.exp(self.z_log_sigma_sq), 1)
-        self.cost = tf.reduce_mean(reconstr_loss + latent_loss)
-        self.optimizer = optimizer.minimize(self.cost)
-
-        init = tf.global_variables_initializer()
-        self.sess = tf.Session()
-        self.sess.run(init)
-
-    def _initialize_weights(self):
-        all_weights = dict()
-        all_weights['w1'] = tf.get_variable("w1", shape=[self.n_input, self.n_hidden],
-            initializer=tf.contrib.layers.xavier_initializer())
-        all_weights['log_sigma_w1'] = tf.get_variable("log_sigma_w1", shape=[self.n_input, self.n_hidden],
-            initializer=tf.contrib.layers.xavier_initializer())
-        all_weights['b1'] = tf.Variable(tf.zeros([self.n_hidden], dtype=tf.float32))
-        all_weights['log_sigma_b1'] = tf.Variable(tf.zeros([self.n_hidden], dtype=tf.float32))
-        all_weights['w2'] = tf.Variable(tf.zeros([self.n_hidden, self.n_input], dtype=tf.float32))
-        all_weights['b2'] = tf.Variable(tf.zeros([self.n_input], dtype=tf.float32))
-        return all_weights
-
-    def partial_fit(self, X):
-        cost, opt = self.sess.run((self.cost, self.optimizer), feed_dict={self.x: X})
-        return cost
-
-    def calc_total_cost(self, X):
-        return self.sess.run(self.cost, feed_dict = {self.x: X})
-
-    def transform(self, X):
-        return self.sess.run(self.z_mean, feed_dict={self.x: X})
-
-    def generate(self, hidden = None):
-        if hidden is None:
-            hidden = self.sess.run(tf.random_normal([1, self.n_hidden]))
-        return self.sess.run(self.reconstruction, feed_dict={self.z: hidden})
-
-    def reconstruct(self, X):
-        return self.sess.run(self.reconstruction, feed_dict={self.x: X})
-
-    def getWeights(self):
-        return self.sess.run(self.weights['w1'])
-
-    def getBiases(self):
-        return self.sess.run(self.weights['b1'])
-
diff --git a/research/autoencoder/autoencoder_models/__init__.py b/research/autoencoder/autoencoder_models/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/brain_coder/README.md b/research/brain_coder/README.md
deleted file mode 100644
index 3e2a1656d8f145569266c19c64b41779ccbf308c..0000000000000000000000000000000000000000
--- a/research/brain_coder/README.md
+++ /dev/null
@@ -1,34 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Brain Coder
-
-*Authors: Daniel Abolafia, Mohammad Norouzi, Quoc Le*
-
-Brain coder is a code synthesis experimental environment. We provide code that reproduces the results from our recent paper [Neural Program Synthesis with Priority Queue Training](https://arxiv.org/abs/1801.03526). See single_task/README.md for details on how to build and reproduce those experiments.
-
-## Installation
-
-First install dependencies seperately:
-
-* [bazel](https://docs.bazel.build/versions/master/install.html)
-* [TensorFlow](https://www.tensorflow.org/install/)
-* [scipy](https://www.scipy.org/install.html)
-* [absl-py](https://github.com/abseil/abseil-py)
-
-Note: even if you already have these dependencies installed, make sure they are
-up-to-date to avoid unnecessary debugging.
-
-
-## Building
-
-Use bazel from the top-level repo directory.
-
-For example:
-
-```bash
-bazel build single_task:run
-```
-
-View README.md files in subdirectories for more details.
diff --git a/research/brain_coder/WORKSPACE b/research/brain_coder/WORKSPACE
deleted file mode 100644
index 7c07b5325e71a1684fb38089adeaaa9f4f00a775..0000000000000000000000000000000000000000
--- a/research/brain_coder/WORKSPACE
+++ /dev/null
@@ -1,5 +0,0 @@
-git_repository(
-    name = "subpar",
-    remote = "https://github.com/google/subpar",
-    tag = "1.0.0",
-)
diff --git a/research/brain_coder/common/BUILD b/research/brain_coder/common/BUILD
deleted file mode 100644
index b5f79c25096ca574d4a133f871343eedd985b25e..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/BUILD
+++ /dev/null
@@ -1,106 +0,0 @@
-licenses(["notice"])
-
-package(default_visibility = [
-    "//:__subpackages__",
-])
-
-py_library(
-    name = "bf",
-    srcs = ["bf.py"],
-)
-
-py_test(
-    name = "bf_test",
-    srcs = ["bf_test.py"],
-    deps = [
-        ":bf",
-        # tensorflow dep
-    ],
-)
-
-py_library(
-    name = "config_lib",
-    srcs = ["config_lib.py"],
-)
-
-py_test(
-    name = "config_lib_test",
-    srcs = ["config_lib_test.py"],
-    deps = [
-        ":config_lib",
-        # tensorflow dep
-    ],
-)
-
-py_library(
-    name = "reward",
-    srcs = ["reward.py"],
-)
-
-py_test(
-    name = "reward_test",
-    srcs = ["reward_test.py"],
-    deps = [
-        ":reward",
-        # numpy dep
-        # tensorflow dep
-    ],
-)
-
-py_library(
-    name = "rollout",
-    srcs = ["rollout.py"],
-    deps = [
-        ":utils",
-        # numpy dep
-        # scipy dep
-    ],
-)
-
-py_test(
-    name = "rollout_test",
-    srcs = ["rollout_test.py"],
-    deps = [
-        ":rollout",
-        # numpy dep
-        # tensorflow dep
-    ],
-)
-
-py_library(
-    name = "schedules",
-    srcs = ["schedules.py"],
-    deps = [":config_lib"],
-)
-
-py_test(
-    name = "schedules_test",
-    srcs = ["schedules_test.py"],
-    deps = [
-        ":config_lib",
-        ":schedules",
-        # numpy dep
-        # tensorflow dep
-    ],
-)
-
-py_library(
-    name = "utils",
-    srcs = ["utils.py"],
-    deps = [
-        # file dep
-        # absl dep /logging
-        # numpy dep
-        # tensorflow dep
-    ],
-)
-
-py_test(
-    name = "utils_test",
-    srcs = ["utils_test.py"],
-    deps = [
-        ":utils",
-        # numpy dep
-        # tensorflow dep
-    ],
-)
diff --git a/research/brain_coder/common/bf.py b/research/brain_coder/common/bf.py
deleted file mode 100644
index f049c45258f7b78a25b5492108b2f8b37c8a55cd..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/bf.py
+++ /dev/null
@@ -1,234 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""BrainF**k interpreter.
-
-Language info: https://en.wikipedia.org/wiki/Brainfuck
-
-Based on public implementation:
-https://github.com/pocmo/Python-Brainfuck/blob/master/brainfuck.py
-"""
-
-from collections import namedtuple
-import time
-
-
-EvalResult = namedtuple(
-    'EvalResult', ['output', 'success', 'failure_reason', 'steps', 'time',
-                   'memory', 'program_trace'])
-
-
-ExecutionSnapshot = namedtuple(
-    'ExecutionSnapshot',
-    ['codeptr', 'codechar', 'memptr', 'memval', 'memory', 'next_input',
-     'output_buffer'])
-
-
-class Status(object):
-  SUCCESS = 'success'
-  TIMEOUT = 'timeout'
-  STEP_LIMIT = 'step-limit'
-  SYNTAX_ERROR = 'syntax-error'
-
-
-CHARS = INT_TO_CHAR = ['>', '<', '+', '-', '[', ']', '.', ',']
-CHAR_TO_INT = dict([(c, i) for i, c in enumerate(INT_TO_CHAR)])
-
-
-class LookAheadIterator(object):
-  """Same API as Python iterator, with additional peek method."""
-
-  def __init__(self, iterable):
-    self._it = iter(iterable)
-    self._current_element = None
-    self._done = False
-    self._preload_next()
-
-  def _preload_next(self):
-    try:
-      self._current_element = self._it.next()
-    except StopIteration:
-      self._done = True
-
-  def next(self):
-    if self._done:
-      raise StopIteration
-    element = self._current_element
-    self._preload_next()
-    return element
-
-  def peek(self, default_value=None):
-    if self._done:
-      if default_value is None:
-        raise StopIteration
-      return default_value
-    return self._current_element
-
-
-def buildbracemap(code):
-  """Build jump map.
-
-  Args:
-    code: List or string or BF chars.
-
-  Returns:
-    bracemap: dict mapping open and close brace positions in the code to their
-        destination jumps. Specifically, positions of matching open/close braces
-        if they exist.
-    correct_syntax: True if all braces match. False if there are unmatched
-        braces in the code. Even if there are unmatched braces, a bracemap will
-        be built, and unmatched braces will map to themselves.
-  """
-  bracestack, bracemap = [], {}
-
-  correct_syntax = True
-  for position, command in enumerate(code):
-    if command == '[':
-      bracestack.append(position)
-    if command == ']':
-      if not bracestack:  # Unmatched closing brace.
-        bracemap[position] = position  # Don't jump to any position.
-        correct_syntax = False
-        continue
-      start = bracestack.pop()
-      bracemap[start] = position
-      bracemap[position] = start
-  if bracestack:  # Unmatched opening braces.
-    for pos in bracestack:
-      bracemap[pos] = pos  # Don't jump to any position.
-      correct_syntax = False
-  return bracemap, correct_syntax
-
-
-def evaluate(code, input_buffer=None, init_memory=None, base=256, timeout=1.0,
-             max_steps=None, require_correct_syntax=True, output_memory=False,
-             debug=False):
-  """Execute BF code.
-
-  Args:
-    code: String or list of BF characters. Any character not in CHARS will be
-        ignored.
-    input_buffer: A list of ints which will be used as the program's input
-        stream. Each read op "," will read an int from this list. 0's will be
-        read once the end of the list is reached, or if no input buffer is
-        given.
-    init_memory: A list of ints. Memory for first k positions will be
-        initialized to this list (where k = len(init_memory)). Memory positions
-        are initialized to 0 by default.
-    base: Integer base for the memory. When a memory value is incremented to
-        `base` it will overflow to 0. When a memory value is decremented to -1
-        it will underflow to `base` - 1.
-    timeout: Time limit for program execution in seconds. Set to None to
-        disable.
-    max_steps: Execution step limit. An execution step is the execution of one
-        operation (code character), even if that op has been executed before.
-        Execution exits when this many steps are reached. Set to None to
-        disable. Disabled by default.
-    require_correct_syntax: If True, unmatched braces will cause `evaluate` to
-        return without executing the code. The failure reason will be
-        `Status.SYNTAX_ERROR`. If False, unmatched braces are ignored
-        and execution will continue.
-    output_memory: If True, the state of the memory at the end of execution is
-        returned.
-    debug: If True, then a full program trace will be returned.
-
-  Returns:
-    EvalResult namedtuple containing
-      output: List of ints which were written out by the program with the "."
-          operation.
-      success: Boolean. Whether execution completed successfully.
-      failure_reason: One of the attributes of `Status`. Gives extra info
-          about why execution was not successful.
-      steps: Number of execution steps the program ran for.
-      time: Amount of time in seconds the program ran for.
-      memory: If `output_memory` is True, a list of memory cells up to the last
-          one written to. otherwise, None.
-  """
-  input_iter = (
-      LookAheadIterator(input_buffer) if input_buffer is not None
-      else LookAheadIterator([]))
-
-  # Null memory value. This is the value of an empty memory. Also the value
-  # returned by the read operation when the input buffer is empty, or the
-  # end of the buffer is reached.
-  null_value = 0
-
-  code = list(code)
-  bracemap, correct_syntax = buildbracemap(code)  # will modify code list
-  if require_correct_syntax and not correct_syntax:
-    return EvalResult([], False, Status.SYNTAX_ERROR, 0, 0.0,
-                      [] if output_memory else None, [] if debug else None)
-
-  output_buffer = []
-
-  codeptr, cellptr = 0, 0
-
-  cells = list(init_memory) if init_memory else [0]
-
-  program_trace = [] if debug else None
-  success = True
-  reason = Status.SUCCESS
-  start_time = time.time()
-  steps = 0
-  while codeptr < len(code):
-    command = code[codeptr]
-
-    if debug:
-      # Add step to program trace.
-      program_trace.append(ExecutionSnapshot(
-          codeptr=codeptr, codechar=command, memptr=cellptr,
-          memval=cells[cellptr], memory=list(cells),
-          next_input=input_iter.peek(null_value),
-          output_buffer=list(output_buffer)))
-
-    if command == '>':
-      cellptr += 1
-      if cellptr == len(cells): cells.append(null_value)
-
-    if command == '<':
-      cellptr = 0 if cellptr <= 0 else cellptr - 1
-
-    if command == '+':
-      cells[cellptr] = cells[cellptr] + 1 if cells[cellptr] < (base - 1) else 0
-
-    if command == '-':
-      cells[cellptr] = cells[cellptr] - 1 if cells[cellptr] > 0 else (base - 1)
-
-    if command == '[' and cells[cellptr] == 0: codeptr = bracemap[codeptr]
-    if command == ']' and cells[cellptr] != 0: codeptr = bracemap[codeptr]
-
-    if command == '.': output_buffer.append(cells[cellptr])
-    if command == ',': cells[cellptr] = next(input_iter, null_value)
-
-    codeptr += 1
-    steps += 1
-
-    if timeout is not None and time.time() - start_time > timeout:
-      success = False
-      reason = Status.TIMEOUT
-      break
-    if max_steps is not None and steps >= max_steps:
-      success = False
-      reason = Status.STEP_LIMIT
-      break
-
-  if debug:
-    # Add step to program trace.
-    command = code[codeptr] if codeptr < len(code) else ''
-    program_trace.append(ExecutionSnapshot(
-        codeptr=codeptr, codechar=command, memptr=cellptr,
-        memval=cells[cellptr], memory=list(cells),
-        next_input=input_iter.peek(null_value),
-        output_buffer=list(output_buffer)))
-
-  return EvalResult(
-      output=output_buffer,
-      success=success,
-      failure_reason=reason,
-      steps=steps,
-      time=time.time() - start_time,
-      memory=cells if output_memory else None,
-      program_trace=program_trace)
-
-
diff --git a/research/brain_coder/common/bf_test.py b/research/brain_coder/common/bf_test.py
deleted file mode 100644
index 2cbf505601a96ec1fc819f1d01fe551f2fae4a5d..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/bf_test.py
+++ /dev/null
@@ -1,137 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for common.bf."""
-
-import tensorflow as tf
-
-from common import bf  # brain coder
-
-
-class BfTest(tf.test.TestCase):
-
-  def assertCorrectOutput(self, target_output, eval_result):
-    self.assertEqual(target_output, eval_result.output)
-    self.assertTrue(eval_result.success)
-    self.assertEqual(bf.Status.SUCCESS, eval_result.failure_reason)
-
-  def testBasicOps(self):
-    self.assertCorrectOutput(
-        [3, 1, 2],
-        bf.evaluate('+++.--.+.'))
-    self.assertCorrectOutput(
-        [1, 1, 2],
-        bf.evaluate('+.<.>++.'))
-    self.assertCorrectOutput(
-        [0],
-        bf.evaluate('+,.'))
-    self.assertCorrectOutput(
-        [ord(char) for char in 'Hello World!\n'],
-        bf.evaluate(
-            '>++++++++[-<+++++++++>]<.>>+>-[+]++>++>+++[>[->+++<<+++>]<<]>-----'
-            '.>->+++..+++.>-.<<+[>[+>+]>>]<--------------.>>.+++.------.-------'
-            '-.>+.>+.'))
-
-  def testBase(self):
-    self.assertCorrectOutput(
-        [1, 4],
-        bf.evaluate('+.--.', base=5, input_buffer=[]))
-
-  def testInputBuffer(self):
-    self.assertCorrectOutput(
-        [2, 3, 4],
-        bf.evaluate('>,[>,]<[.<]', input_buffer=[4, 3, 2]))
-
-  def testBadChars(self):
-    self.assertCorrectOutput(
-        [2, 3, 4],
-        bf.evaluate('>,[>,]hello<world[.<]comments',
-                    input_buffer=[4, 3, 2]))
-
-  def testUnmatchedBraces(self):
-    self.assertCorrectOutput(
-        [3, 6, 1],
-        bf.evaluate('+++.]]]]>----.[[[[[>+.',
-                    input_buffer=[],
-                    base=10,
-                    require_correct_syntax=False))
-
-    eval_result = bf.evaluate(
-        '+++.]]]]>----.[[[[[>+.',
-        input_buffer=[],
-        base=10,
-        require_correct_syntax=True)
-    self.assertEqual([], eval_result.output)
-    self.assertFalse(eval_result.success)
-    self.assertEqual(bf.Status.SYNTAX_ERROR,
-                     eval_result.failure_reason)
-
-  def testTimeout(self):
-    er = bf.evaluate('+.[].', base=5, input_buffer=[], timeout=0.1)
-    self.assertEqual(
-        ([1], False, bf.Status.TIMEOUT),
-        (er.output, er.success, er.failure_reason))
-    self.assertTrue(0.07 < er.time < 0.21)
-
-    er = bf.evaluate('+.[-].', base=5, input_buffer=[], timeout=0.1)
-    self.assertEqual(
-        ([1, 0], True, bf.Status.SUCCESS),
-        (er.output, er.success, er.failure_reason))
-    self.assertTrue(er.time < 0.15)
-
-  def testMaxSteps(self):
-    er = bf.evaluate('+.[].', base=5, input_buffer=[], timeout=None,
-                     max_steps=100)
-    self.assertEqual(
-        ([1], False, bf.Status.STEP_LIMIT, 100),
-        (er.output, er.success, er.failure_reason, er.steps))
-
-    er = bf.evaluate('+.[-].', base=5, input_buffer=[], timeout=None,
-                     max_steps=100)
-    self.assertEqual(
-        ([1, 0], True, bf.Status.SUCCESS),
-        (er.output, er.success, er.failure_reason))
-    self.assertTrue(er.steps < 100)
-
-  def testOutputMemory(self):
-    er = bf.evaluate('+>++>+++>++++.', base=256, input_buffer=[],
-                     output_memory=True)
-    self.assertEqual(
-        ([4], True, bf.Status.SUCCESS),
-        (er.output, er.success, er.failure_reason))
-    self.assertEqual([1, 2, 3, 4], er.memory)
-
-  def testProgramTrace(self):
-    es = bf.ExecutionSnapshot
-    er = bf.evaluate(',[.>,].', base=256, input_buffer=[2, 1], debug=True)
-    self.assertEqual(
-        [es(codeptr=0, codechar=',', memptr=0, memval=0, memory=[0],
-            next_input=2, output_buffer=[]),
-         es(codeptr=1, codechar='[', memptr=0, memval=2, memory=[2],
-            next_input=1, output_buffer=[]),
-         es(codeptr=2, codechar='.', memptr=0, memval=2, memory=[2],
-            next_input=1, output_buffer=[]),
-         es(codeptr=3, codechar='>', memptr=0, memval=2, memory=[2],
-            next_input=1, output_buffer=[2]),
-         es(codeptr=4, codechar=',', memptr=1, memval=0, memory=[2, 0],
-            next_input=1, output_buffer=[2]),
-         es(codeptr=5, codechar=']', memptr=1, memval=1, memory=[2, 1],
-            next_input=0, output_buffer=[2]),
-         es(codeptr=2, codechar='.', memptr=1, memval=1, memory=[2, 1],
-            next_input=0, output_buffer=[2]),
-         es(codeptr=3, codechar='>', memptr=1, memval=1, memory=[2, 1],
-            next_input=0, output_buffer=[2, 1]),
-         es(codeptr=4, codechar=',', memptr=2, memval=0, memory=[2, 1, 0],
-            next_input=0, output_buffer=[2, 1]),
-         es(codeptr=5, codechar=']', memptr=2, memval=0, memory=[2, 1, 0],
-            next_input=0, output_buffer=[2, 1]),
-         es(codeptr=6, codechar='.', memptr=2, memval=0, memory=[2, 1, 0],
-            next_input=0, output_buffer=[2, 1]),
-         es(codeptr=7, codechar='', memptr=2, memval=0, memory=[2, 1, 0],
-            next_input=0, output_buffer=[2, 1, 0])],
-        er.program_trace)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/common/config_lib.py b/research/brain_coder/common/config_lib.py
deleted file mode 100644
index 733fa202f2e500f964beff2111cb7445fa66a9e1..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/config_lib.py
+++ /dev/null
@@ -1,337 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Objects for storing configuration and passing config into binaries.
-
-Config class stores settings and hyperparameters for models, data, and anything
-else that may be specific to a particular run.
-"""
-
-import ast
-import itertools
-from six.moves import xrange
-
-
-class Config(dict):
-  """Stores model configuration, hyperparameters, or dataset parameters."""
-
-  def __getattr__(self, attr):
-    return self[attr]
-
-  def __setattr__(self, attr, value):
-    self[attr] = value
-
-  def pretty_str(self, new_lines=True, indent=2, final_indent=0):
-    prefix = (' ' * indent) if new_lines else ''
-    final_prefix = (' ' * final_indent) if new_lines else ''
-    kv = ['%s%s=%s' % (prefix, k,
-                       (repr(v) if not isinstance(v, Config)
-                        else v.pretty_str(new_lines=new_lines,
-                                          indent=indent+2,
-                                          final_indent=indent)))
-          for k, v in self.items()]
-    if new_lines:
-      return 'Config(\n%s\n%s)' % (',\n'.join(kv), final_prefix)
-    else:
-      return 'Config(%s)' % ', '.join(kv)
-
-  def _update_iterator(self, *args, **kwargs):
-    """Convert mixed input into an iterator over (key, value) tuples.
-
-    Follows the dict.update call signature.
-
-    Args:
-      *args: (Optional) Pass a dict or iterable of (key, value) 2-tuples as
-          an unnamed argument. Only one unnamed argument allowed.
-      **kwargs: (Optional) Pass (key, value) pairs as named arguments, where the
-          argument name is the key and the argument value is the value.
-
-    Returns:
-      An iterator over (key, value) tuples given in the input.
-
-    Raises:
-      TypeError: If more than one unnamed argument is given.
-    """
-    if len(args) > 1:
-      raise TypeError('Expected at most 1 unnamed arguments, got %d'
-                      % len(args))
-    obj = args[0] if args else dict()
-    if isinstance(obj, dict):
-      return itertools.chain(obj.items(), kwargs.items())
-    # Assume obj is an iterable of 2-tuples.
-    return itertools.chain(obj, kwargs.items())
-
-  def make_default(self, keys=None):
-    """Convert OneOf objects into their default configs.
-
-    Recursively calls into Config objects.
-
-    Args:
-      keys: Iterable of key names to check. If None, all keys in self will be
-          used.
-    """
-    if keys is None:
-      keys = self.keys()
-    for k in keys:
-      # Replace OneOf with its default value.
-      if isinstance(self[k], OneOf):
-        self[k] = self[k].default()
-      # Recursively call into all Config objects, even those that came from
-      # OneOf objects in the previous code line (for nested OneOf objects).
-      if isinstance(self[k], Config):
-        self[k].make_default()
-
-  def update(self, *args, **kwargs):
-    """Same as dict.update except nested Config objects are updated.
-
-    Args:
-      *args: (Optional) Pass a dict or list of (key, value) 2-tuples as unnamed
-          argument.
-      **kwargs: (Optional) Pass (key, value) pairs as named arguments, where the
-          argument name is the key and the argument value is the value.
-    """
-    key_set = set(self.keys())
-    for k, v in self._update_iterator(*args, **kwargs):
-      if k in key_set:
-        key_set.remove(k)  # This key is updated so exclude from make_default.
-      if k in self and isinstance(self[k], Config) and isinstance(v, dict):
-        self[k].update(v)
-      elif k in self and isinstance(self[k], OneOf) and isinstance(v, dict):
-        # Replace OneOf with the chosen config.
-        self[k] = self[k].update(v)
-      else:
-        self[k] = v
-    self.make_default(key_set)
-
-  def strict_update(self, *args, **kwargs):
-    """Same as Config.update except keys and types are not allowed to change.
-
-    If a given key is not already in this instance, an exception is raised. If a
-    given value does not have the same type as the existing value for the same
-    key, an exception is raised. Use this method to catch config mistakes.
-
-    Args:
-      *args: (Optional) Pass a dict or list of (key, value) 2-tuples as unnamed
-          argument.
-      **kwargs: (Optional) Pass (key, value) pairs as named arguments, where the
-          argument name is the key and the argument value is the value.
-
-    Raises:
-      TypeError: If more than one unnamed argument is given.
-      TypeError: If new value type does not match existing type.
-      KeyError: If a given key is not already defined in this instance.
-    """
-    key_set = set(self.keys())
-    for k, v in self._update_iterator(*args, **kwargs):
-      if k in self:
-        key_set.remove(k)  # This key is updated so exclude from make_default.
-        if isinstance(self[k], Config):
-          if not isinstance(v, dict):
-            raise TypeError('dict required for Config value, got %s' % type(v))
-          self[k].strict_update(v)
-        elif isinstance(self[k], OneOf):
-          if not isinstance(v, dict):
-            raise TypeError('dict required for OneOf value, got %s' % type(v))
-          # Replace OneOf with the chosen config.
-          self[k] = self[k].strict_update(v)
-        else:
-          if not isinstance(v, type(self[k])):
-            raise TypeError('Expecting type %s for key %s, got type %s'
-                            % (type(self[k]), k, type(v)))
-          self[k] = v
-      else:
-        raise KeyError(
-            'Key %s does not exist. New key creation not allowed in '
-            'strict_update.' % k)
-    self.make_default(key_set)
-
-  @staticmethod
-  def from_str(config_str):
-    """Inverse of Config.__str__."""
-    parsed = ast.literal_eval(config_str)
-    assert isinstance(parsed, dict)
-
-    def _make_config(dictionary):
-      for k, v in dictionary.items():
-        if isinstance(v, dict):
-          dictionary[k] = _make_config(v)
-      return Config(**dictionary)
-    return _make_config(parsed)
-
-  @staticmethod
-  def parse(key_val_string):
-    """Parse hyperparameter string into Config object.
-
-    Format is 'key=val,key=val,...'
-    Values can be any python literal, or another Config object encoded as
-    'c(key=val,key=val,...)'.
-    c(...) expressions can be arbitrarily nested.
-
-    Example:
-    'a=1,b=3e-5,c=[1,2,3],d="hello world",e={"a":1,"b":2},f=c(x=1,y=[10,20])'
-
-    Args:
-      key_val_string: The hyperparameter string.
-
-    Returns:
-      Config object parsed from the input string.
-    """
-    if not key_val_string.strip():
-      return Config()
-    def _pair_to_kv(pair):
-      split_index = pair.find('=')
-      key, val = pair[:split_index].strip(), pair[split_index+1:].strip()
-      if val.startswith('c(') and val.endswith(')'):
-        val = Config.parse(val[2:-1])
-      else:
-        val = ast.literal_eval(val)
-      return key, val
-    return Config(**dict([_pair_to_kv(pair)
-                          for pair in _comma_iterator(key_val_string)]))
-
-
-class OneOf(object):
-  """Stores branching config.
-
-  In some cases there may be options which each have their own set of config
-  params. For example, if specifying config for an environment, each environment
-  can have custom config options. OneOf is a way to organize branching config.
-
-  Usage example:
-  one_of = OneOf(
-      [Config(a=1, b=2),
-       Config(a=2, c='hello'),
-       Config(a=3, d=10, e=-10)],
-      a=1)
-  config = one_of.strict_update(Config(a=3, d=20))
-  config == {'a': 3, 'd': 20, 'e': -10}
-  """
-
-  def __init__(self, choices, **kwargs):
-    """Constructor.
-
-    Usage: OneOf([Config(...), Config(...), ...], attribute=default_value)
-
-    Args:
-      choices: An iterable of Config objects. When update/strict_update is
-          called on this OneOf, one of these Config will be selected.
-      **kwargs: Give exactly one config attribute to branch on. The value of
-          this attribute during update/strict_update will determine which
-          Config is used.
-
-    Raises:
-      ValueError: If kwargs does not contain exactly one entry. Should give one
-          named argument which is used as the attribute to condition on.
-    """
-    if len(kwargs) != 1:
-      raise ValueError(
-          'Incorrect usage. Must give exactly one named argument. The argument '
-          'name is the config attribute to condition on, and the argument '
-          'value is the default choice. Got %d named arguments.' % len(kwargs))
-    key, default_value = kwargs.items()[0]
-    self.key = key
-    self.default_value = default_value
-
-    # Make sure each choice is a Config object.
-    for config in choices:
-      if not isinstance(config, Config):
-        raise TypeError('choices must be a list of Config objects. Got %s.'
-                        % type(config))
-
-    # Map value for key to the config with that value.
-    self.value_map = {config[key]: config for config in choices}
-    self.default_config = self.value_map[self.default_value]
-
-    # Make sure there are no duplicate values.
-    if len(self.value_map) != len(choices):
-      raise ValueError('Multiple choices given for the same value of %s.' % key)
-
-    # Check that the default value is valid.
-    if self.default_value not in self.value_map:
-      raise ValueError(
-          'Default value is not an available choice. Got %s=%s. Choices are %s.'
-          % (key, self.default_value, self.value_map.keys()))
-
-  def default(self):
-    return self.default_config
-
-  def update(self, other):
-    """Choose a config and update it.
-
-    If `other` is a Config, one of the config choices is selected and updated.
-    Otherwise `other` is returned.
-
-    Args:
-      other: Will update chosen config with this value by calling `update` on
-          the config.
-
-    Returns:
-      The chosen config after updating it, or `other` if no config could be
-      selected.
-    """
-    if not isinstance(other, Config):
-      return other
-    if self.key not in other or other[self.key] not in self.value_map:
-      return other
-    target = self.value_map[other[self.key]]
-    target.update(other)
-    return target
-
-  def strict_update(self, config):
-    """Choose a config and update it.
-
-    `config` must be a Config object. `config` must have the key used to select
-    among the config choices, and that key must have a value which one of the
-    config choices has.
-
-    Args:
-      config: A Config object. the chosen config will be update by calling
-           `strict_update`.
-
-    Returns:
-      The chosen config after updating it.
-
-    Raises:
-      TypeError: If `config` is not a Config instance.
-      ValueError: If `config` does not have the branching key in its key set.
-      ValueError: If the value of the config's branching key is not one of the
-          valid choices.
-    """
-    if not isinstance(config, Config):
-      raise TypeError('Expecting Config instance, got %s.' % type(config))
-    if self.key not in config:
-      raise ValueError(
-          'Branching key %s required but not found in %s' % (self.key, config))
-    if config[self.key] not in self.value_map:
-      raise ValueError(
-          'Value %s for key %s is not a possible choice. Choices are %s.'
-          % (config[self.key], self.key, self.value_map.keys()))
-    target = self.value_map[config[self.key]]
-    target.strict_update(config)
-    return target
-
-
-def _next_comma(string, start_index):
-  """Finds the position of the next comma not used in a literal collection."""
-  paren_count = 0
-  for i in xrange(start_index, len(string)):
-    c = string[i]
-    if c == '(' or c == '[' or c == '{':
-      paren_count += 1
-    elif c == ')' or c == ']' or c == '}':
-      paren_count -= 1
-    if paren_count == 0 and c == ',':
-      return i
-  return -1
-
-
-def _comma_iterator(string):
-  index = 0
-  while 1:
-    next_index = _next_comma(string, index)
-    if next_index == -1:
-      yield string[index:]
-      return
-    yield string[index:next_index]
-    index = next_index + 1
diff --git a/research/brain_coder/common/config_lib_test.py b/research/brain_coder/common/config_lib_test.py
deleted file mode 100644
index cdc96f92d2428f06e780930979662fdfda92e3f5..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/config_lib_test.py
+++ /dev/null
@@ -1,425 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for common.config_lib."""
-
-import tensorflow as tf
-
-from common import config_lib  # brain coder
-
-
-class ConfigLibTest(tf.test.TestCase):
-
-  def testConfig(self):
-    config = config_lib.Config(hello='world', foo='bar', num=123, f=56.7)
-    self.assertEqual('world', config.hello)
-    self.assertEqual('bar', config['foo'])
-    config.hello = 'everyone'
-    config['bar'] = 9000
-    self.assertEqual('everyone', config['hello'])
-    self.assertEqual(9000, config.bar)
-    self.assertEqual(5, len(config))
-
-  def testConfigUpdate(self):
-    config = config_lib.Config(a=1, b=2, c=3)
-    config.update({'b': 10, 'd': 4})
-    self.assertEqual({'a': 1, 'b': 10, 'c': 3, 'd': 4}, config)
-
-    config = config_lib.Config(a=1, b=2, c=3)
-    config.update(b=10, d=4)
-    self.assertEqual({'a': 1, 'b': 10, 'c': 3, 'd': 4}, config)
-
-    config = config_lib.Config(a=1, b=2, c=3)
-    config.update({'e': 5}, b=10, d=4)
-    self.assertEqual({'a': 1, 'b': 10, 'c': 3, 'd': 4, 'e': 5}, config)
-
-    config = config_lib.Config(
-        a=1,
-        b=2,
-        x=config_lib.Config(
-            l='a',
-            y=config_lib.Config(m=1, n=2),
-            z=config_lib.Config(
-                q=config_lib.Config(a=10, b=20),
-                r=config_lib.Config(s=1, t=2))))
-    config.update(x={'y': {'m': 10}, 'z': {'r': {'s': 5}}})
-    self.assertEqual(
-        config_lib.Config(
-            a=1, b=2,
-            x=config_lib.Config(
-                l='a',
-                y=config_lib.Config(m=10, n=2),
-                z=config_lib.Config(
-                    q=config_lib.Config(a=10, b=20),
-                    r=config_lib.Config(s=5, t=2)))),
-        config)
-
-    config = config_lib.Config(
-        foo='bar',
-        num=100,
-        x=config_lib.Config(a=1, b=2, c=config_lib.Config(h=10, i=20, j=30)),
-        y=config_lib.Config(qrs=5, tuv=10),
-        d={'a': 1, 'b': 2},
-        l=[1, 2, 3])
-    config.update(
-        config_lib.Config(
-            foo='hat',
-            num=50.5,
-            x={'a': 5, 'z': -10},
-            y=config_lib.Config(wxyz=-1)),
-        d={'a': 10, 'c': 20},
-        l=[3, 4, 5, 6])
-    self.assertEqual(
-        config_lib.Config(
-            foo='hat',
-            num=50.5,
-            x=config_lib.Config(a=5, b=2, z=-10,
-                                c=config_lib.Config(h=10, i=20, j=30)),
-            y=config_lib.Config(qrs=5, tuv=10, wxyz=-1),
-            d={'a': 10, 'c': 20},
-            l=[3, 4, 5, 6]),
-        config)
-    self.assertTrue(isinstance(config.x, config_lib.Config))
-    self.assertTrue(isinstance(config.x.c, config_lib.Config))
-    self.assertTrue(isinstance(config.y, config_lib.Config))
-
-    config = config_lib.Config(
-        foo='bar',
-        num=100,
-        x=config_lib.Config(a=1, b=2, c=config_lib.Config(h=10, i=20, j=30)),
-        y=config_lib.Config(qrs=5, tuv=10),
-        d={'a': 1, 'b': 2},
-        l=[1, 2, 3])
-    config.update(
-        config_lib.Config(
-            foo=1234,
-            num='hello',
-            x={'a': 5, 'z': -10, 'c': {'h': -5, 'k': 40}},
-            y=[1, 2, 3, 4],
-            d='stuff',
-            l={'a': 1, 'b': 2}))
-    self.assertEqual(
-        config_lib.Config(
-            foo=1234,
-            num='hello',
-            x=config_lib.Config(a=5, b=2, z=-10,
-                                c=config_lib.Config(h=-5, i=20, j=30, k=40)),
-            y=[1, 2, 3, 4],
-            d='stuff',
-            l={'a': 1, 'b': 2}),
-        config)
-    self.assertTrue(isinstance(config.x, config_lib.Config))
-    self.assertTrue(isinstance(config.x.c, config_lib.Config))
-    self.assertTrue(isinstance(config.y, list))
-
-  def testConfigStrictUpdate(self):
-    config = config_lib.Config(a=1, b=2, c=3)
-    config.strict_update({'b': 10, 'c': 20})
-    self.assertEqual({'a': 1, 'b': 10, 'c': 20}, config)
-
-    config = config_lib.Config(a=1, b=2, c=3)
-    config.strict_update(b=10, c=20)
-    self.assertEqual({'a': 1, 'b': 10, 'c': 20}, config)
-
-    config = config_lib.Config(a=1, b=2, c=3, d=4)
-    config.strict_update({'d': 100}, b=10, a=20)
-    self.assertEqual({'a': 20, 'b': 10, 'c': 3, 'd': 100}, config)
-
-    config = config_lib.Config(
-        a=1,
-        b=2,
-        x=config_lib.Config(
-            l='a',
-            y=config_lib.Config(m=1, n=2),
-            z=config_lib.Config(
-                q=config_lib.Config(a=10, b=20),
-                r=config_lib.Config(s=1, t=2))))
-    config.strict_update(x={'y': {'m': 10}, 'z': {'r': {'s': 5}}})
-    self.assertEqual(
-        config_lib.Config(
-            a=1, b=2,
-            x=config_lib.Config(
-                l='a',
-                y=config_lib.Config(m=10, n=2),
-                z=config_lib.Config(
-                    q=config_lib.Config(a=10, b=20),
-                    r=config_lib.Config(s=5, t=2)))),
-        config)
-
-    config = config_lib.Config(
-        foo='bar',
-        num=100,
-        x=config_lib.Config(a=1, b=2, c=config_lib.Config(h=10, i=20, j=30)),
-        y=config_lib.Config(qrs=5, tuv=10),
-        d={'a': 1, 'b': 2},
-        l=[1, 2, 3])
-    config.strict_update(
-        config_lib.Config(
-            foo='hat',
-            num=50,
-            x={'a': 5, 'c': {'h': 100}},
-            y=config_lib.Config(tuv=-1)),
-        d={'a': 10, 'c': 20},
-        l=[3, 4, 5, 6])
-    self.assertEqual(
-        config_lib.Config(
-            foo='hat',
-            num=50,
-            x=config_lib.Config(a=5, b=2,
-                                c=config_lib.Config(h=100, i=20, j=30)),
-            y=config_lib.Config(qrs=5, tuv=-1),
-            d={'a': 10, 'c': 20},
-            l=[3, 4, 5, 6]),
-        config)
-
-  def testConfigStrictUpdateFail(self):
-    config = config_lib.Config(a=1, b=2, c=3, x=config_lib.Config(a=1, b=2))
-    with self.assertRaises(KeyError):
-      config.strict_update({'b': 10, 'c': 20, 'd': 50})
-    with self.assertRaises(KeyError):
-      config.strict_update(b=10, d=50)
-    with self.assertRaises(KeyError):
-      config.strict_update(x={'c': 3})
-    with self.assertRaises(TypeError):
-      config.strict_update(a='string')
-    with self.assertRaises(TypeError):
-      config.strict_update(x={'a': 'string'})
-    with self.assertRaises(TypeError):
-      config.strict_update(x=[1, 2, 3])
-
-  def testConfigFromStr(self):
-    config = config_lib.Config.from_str("{'c': {'d': 5}, 'b': 2, 'a': 1}")
-    self.assertEqual(
-        {'c': {'d': 5}, 'b': 2, 'a': 1}, config)
-    self.assertTrue(isinstance(config, config_lib.Config))
-    self.assertTrue(isinstance(config.c, config_lib.Config))
-
-  def testConfigParse(self):
-    config = config_lib.Config.parse(
-        'hello="world",num=1234.5,lst=[10,20.5,True,"hi",("a","b","c")],'
-        'dct={9:10,"stuff":"qwerty","subdict":{1:True,2:False}},'
-        'subconfig=c(a=1,b=[1,2,[3,4]],c=c(f="f",g="g"))')
-    self.assertEqual(
-        {'hello': 'world', 'num': 1234.5,
-         'lst': [10, 20.5, True, 'hi', ('a', 'b', 'c')],
-         'dct': {9: 10, 'stuff': 'qwerty', 'subdict': {1: True, 2: False}},
-         'subconfig': {'a': 1, 'b': [1, 2, [3, 4]], 'c': {'f': 'f', 'g': 'g'}}},
-        config)
-    self.assertTrue(isinstance(config, config_lib.Config))
-    self.assertTrue(isinstance(config.subconfig, config_lib.Config))
-    self.assertTrue(isinstance(config.subconfig.c, config_lib.Config))
-    self.assertFalse(isinstance(config.dct, config_lib.Config))
-    self.assertFalse(isinstance(config.dct['subdict'], config_lib.Config))
-    self.assertTrue(isinstance(config.lst[4], tuple))
-
-  def testConfigParseErrors(self):
-    with self.assertRaises(SyntaxError):
-      config_lib.Config.parse('a=[1,2,b="hello"')
-    with self.assertRaises(SyntaxError):
-      config_lib.Config.parse('a=1,b=c(x="a",y="b"')
-    with self.assertRaises(SyntaxError):
-      config_lib.Config.parse('a=1,b=c(x="a")y="b"')
-    with self.assertRaises(SyntaxError):
-      config_lib.Config.parse('a=1,b=c(x="a"),y="b",')
-
-  def testOneOf(self):
-    def make_config():
-      return config_lib.Config(
-          data=config_lib.OneOf(
-              [config_lib.Config(task=1, a='hello'),
-               config_lib.Config(task=2, a='world', b='stuff'),
-               config_lib.Config(task=3, c=1234)],
-              task=2),
-          model=config_lib.Config(stuff=1))
-
-    config = make_config()
-    config.update(config_lib.Config.parse(
-        'model=c(stuff=2),data=c(task=1,a="hi")'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(task=1, a='hi'),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.update(config_lib.Config.parse(
-        'model=c(stuff=2),data=c(task=2,a="hi")'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(task=2, a='hi', b='stuff'),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.update(config_lib.Config.parse(
-        'model=c(stuff=2),data=c(task=3)'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(task=3, c=1234),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.update(config_lib.Config.parse(
-        'model=c(stuff=2)'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(task=2, a='world', b='stuff'),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.update(config_lib.Config.parse(
-        'model=c(stuff=2),data=c(task=4,d=9999)'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(task=4, d=9999),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.update(config_lib.Config.parse(
-        'model=c(stuff=2),data=5'))
-    self.assertEqual(
-        config_lib.Config(
-            data=5,
-            model=config_lib.Config(stuff=2)),
-        config)
-
-  def testOneOfStrict(self):
-    def make_config():
-      return config_lib.Config(
-          data=config_lib.OneOf(
-              [config_lib.Config(task=1, a='hello'),
-               config_lib.Config(task=2, a='world', b='stuff'),
-               config_lib.Config(task=3, c=1234)],
-              task=2),
-          model=config_lib.Config(stuff=1))
-
-    config = make_config()
-    config.strict_update(config_lib.Config.parse(
-        'model=c(stuff=2),data=c(task=1,a="hi")'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(task=1, a='hi'),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.strict_update(config_lib.Config.parse(
-        'model=c(stuff=2),data=c(task=2,a="hi")'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(task=2, a='hi', b='stuff'),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.strict_update(config_lib.Config.parse(
-        'model=c(stuff=2),data=c(task=3)'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(task=3, c=1234),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.strict_update(config_lib.Config.parse(
-        'model=c(stuff=2)'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(task=2, a='world', b='stuff'),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-  def testNestedOneOf(self):
-    def make_config():
-      return config_lib.Config(
-          data=config_lib.OneOf(
-              [config_lib.Config(task=1, a='hello'),
-               config_lib.Config(
-                   task=2,
-                   a=config_lib.OneOf(
-                       [config_lib.Config(x=1, y=2),
-                        config_lib.Config(x=-1, y=1000, z=4)],
-                       x=1)),
-               config_lib.Config(task=3, c=1234)],
-              task=2),
-          model=config_lib.Config(stuff=1))
-
-    config = make_config()
-    config.update(config_lib.Config.parse(
-        'model=c(stuff=2),data=c(task=2,a=c(x=-1,z=8))'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(
-                task=2,
-                a=config_lib.Config(x=-1, y=1000, z=8)),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.strict_update(config_lib.Config.parse(
-        'model=c(stuff=2),data=c(task=2,a=c(x=-1,z=8))'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(
-                task=2,
-                a=config_lib.Config(x=-1, y=1000, z=8)),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.update(config_lib.Config.parse('model=c(stuff=2)'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(
-                task=2,
-                a=config_lib.Config(x=1, y=2)),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-    config = make_config()
-    config.strict_update(config_lib.Config.parse('model=c(stuff=2)'))
-    self.assertEqual(
-        config_lib.Config(
-            data=config_lib.Config(
-                task=2,
-                a=config_lib.Config(x=1, y=2)),
-            model=config_lib.Config(stuff=2)),
-        config)
-
-  def testOneOfStrictErrors(self):
-    def make_config():
-      return config_lib.Config(
-          data=config_lib.OneOf(
-              [config_lib.Config(task=1, a='hello'),
-               config_lib.Config(task=2, a='world', b='stuff'),
-               config_lib.Config(task=3, c=1234)],
-              task=2),
-          model=config_lib.Config(stuff=1))
-
-    config = make_config()
-    with self.assertRaises(TypeError):
-      config.strict_update(config_lib.Config.parse(
-          'model=c(stuff=2),data=[1,2,3]'))
-
-    config = make_config()
-    with self.assertRaises(KeyError):
-      config.strict_update(config_lib.Config.parse(
-          'model=c(stuff=2),data=c(task=3,c=5678,d=9999)'))
-
-    config = make_config()
-    with self.assertRaises(ValueError):
-      config.strict_update(config_lib.Config.parse(
-          'model=c(stuff=2),data=c(task=4,d=9999)'))
-
-    config = make_config()
-    with self.assertRaises(TypeError):
-      config.strict_update(config_lib.Config.parse(
-          'model=c(stuff=2),data=5'))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/common/reward.py b/research/brain_coder/common/reward.py
deleted file mode 100644
index 87e01c9c52e1ee22f2745dce12bc5e2726711ff7..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/reward.py
+++ /dev/null
@@ -1,390 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Reward functions, distance functions, and reward managers."""
-
-from abc import ABCMeta
-from abc import abstractmethod
-from math import log
-
-
-# All sequences here are assumed to be lists of ints bounded
-# between 0 and `base`-1 (inclusive).
-
-
-#################################
-### Scalar Distance Functions ###
-#################################
-
-
-def abs_diff(a, b, base=0):
-  """Absolute value of difference between scalars.
-
-  abs_diff is symmetric, i.e. `a` and `b` are interchangeable.
-
-  Args:
-    a: First argument. An int.
-    b: Seconds argument. An int.
-    base: Dummy argument so that the argument signature matches other scalar
-        diff functions. abs_diff is the same in all bases.
-
-  Returns:
-    abs(a - b).
-  """
-  del base  # Unused.
-  return abs(a - b)
-
-
-def mod_abs_diff(a, b, base):
-  """Shortest distance between `a` and `b` in the modular integers base `base`.
-
-  The smallest distance between a and b is returned.
-  Example: mod_abs_diff(1, 99, 100) ==> 2. It is not 98.
-
-  mod_abs_diff is symmetric, i.e. `a` and `b` are interchangeable.
-
-  Args:
-    a: First argument. An int.
-    b: Seconds argument. An int.
-    base: The modulo base. A positive int.
-
-  Returns:
-    Shortest distance.
-  """
-  diff = abs(a - b)
-  if diff >= base:
-    diff %= base
-  return min(diff, (-diff) + base)
-
-
-###############################
-### List Distance Functions ###
-###############################
-
-
-def absolute_distance(pred, target, base, scalar_diff_fn=abs_diff):
-  """Asymmetric list distance function.
-
-  List distance is the sum of element-wise distances, like Hamming distance, but
-  where `pred` can be longer or shorter than `target`. For each position in both
-  `pred` and `target`, distance between those elements is computed with
-  `scalar_diff_fn`. For missing or extra elements in `pred`, the maximum
-  distance is assigned, which is equal to `base`.
-
-  Distance is 0 when `pred` and `target` are identical, and will be a positive
-  integer when they are not.
-
-  Args:
-    pred: Prediction list. Distance from this list is computed.
-    target: Target list. Distance to this list is computed.
-    base: The integer base to use. For example, a list of chars would use base
-        256.
-    scalar_diff_fn: Element-wise distance function.
-
-  Returns:
-    List distance between `pred` and `target`.
-  """
-  d = 0
-  for i, target_t in enumerate(target):
-    if i >= len(pred):
-      d += base  # A missing slot is worth the max distance.
-    else:
-      # Add element-wise distance for this slot.
-      d += scalar_diff_fn(pred[i], target_t, base)
-  if len(pred) > len(target):
-    # Each extra slot is worth the max distance.
-    d += (len(pred) - len(target)) * base
-  return d
-
-
-def log_absolute_distance(pred, target, base):
-  """Asymmetric list distance function that uses log distance.
-
-  A list distance which computes sum of element-wise distances, similar to
-  `absolute_distance`. Unlike `absolute_distance`, this scales the resulting
-  distance to be a float.
-
-  Element-wise distance are log-scale. Distance between two list changes
-  relatively less for elements that are far apart, but changes a lot (goes to 0
-  faster) when values get close together.
-
-  Args:
-    pred: List of ints. Computes distance from this list to the target.
-    target: List of ints. This is the "correct" list which the prediction list
-        is trying to match.
-    base: Integer base.
-
-  Returns:
-    Float distance normalized so that when `pred` is at most as long as `target`
-    the distance is between 0.0 and 1.0. Distance grows unboundedly large
-    as `pred` grows past `target` in length.
-  """
-  if not target:
-    length_normalizer = 1.0
-    if not pred:
-      # Distance between [] and [] is 0.0 since they are equal.
-      return 0.0
-  else:
-    length_normalizer = float(len(target))
-  # max_dist is the maximum element-wise distance, before taking log and
-  # scaling. Since we use `mod_abs_diff`, it would be (base // 2), but we add
-  # 1 to it so that missing or extra positions get the maximum penalty.
-  max_dist = base // 2 + 1
-
-  # The log-distance will be scaled by a factor.
-  # Note: +1 is added to the numerator and denominator to avoid log(0). This
-  # only has a translational effect, i.e. log(dist + 1) / log(max_dist + 1).
-  factor = log(max_dist + 1)
-
-  d = 0.0  # Total distance to be computed.
-  for i, target_t in enumerate(target):
-    if i >= len(pred):
-      # Assign the max element-wise distance for missing positions. This is 1.0
-      # after scaling.
-      d += 1.0
-    else:
-      # Add the log-dist divided by a scaling factor.
-      d += log(mod_abs_diff(pred[i], target_t, base) + 1) / factor
-  if len(pred) > len(target):
-    # Add the max element-wise distance for each extra position.
-    # Since max dist after scaling is 1, this is just the difference in list
-    # lengths.
-    d += (len(pred) - len(target))
-  return d / length_normalizer  # Normalize again by the target length.
-
-
-########################
-### Reward Functions ###
-########################
-
-# Reward functions assign reward based on program output.
-# Warning: only use these functions as the terminal rewards in episodes, i.e.
-# for the "final" programs.
-
-
-def absolute_distance_reward(pred, target, base, scalar_diff_fn=abs_diff):
-  """Reward function based on absolute_distance function.
-
-  Maximum reward, 1.0, is given when the lists are equal. Reward is scaled
-  so that 0.0 reward is given when `pred` is the empty list (assuming `target`
-  is not empty). Reward can go negative when `pred` is longer than `target`.
-
-  This is an asymmetric reward function, so which list is the prediction and
-  which is the target matters.
-
-  Args:
-    pred: Prediction sequence. This should be the sequence outputted by the
-        generated code. List of ints n, where 0 <= n < base.
-    target: Target sequence. The correct sequence that the generated code needs
-        to output. List of ints n, where 0 <= n < base.
-    base: Base of the computation.
-    scalar_diff_fn: Element-wise distance function.
-
-  Returns:
-    Reward computed based on `pred` and `target`. A float.
-  """
-  unit_dist = float(base * len(target))
-  if unit_dist == 0:
-    unit_dist = base
-  dist = absolute_distance(pred, target, base, scalar_diff_fn=scalar_diff_fn)
-  return (unit_dist - dist) / unit_dist
-
-
-def absolute_mod_distance_reward(pred, target, base):
-  """Same as `absolute_distance_reward` but `mod_abs_diff` scalar diff is used.
-
-  Args:
-    pred: Prediction sequence. This should be the sequence outputted by the
-        generated code. List of ints n, where 0 <= n < base.
-    target: Target sequence. The correct sequence that the generated code needs
-        to output. List of ints n, where 0 <= n < base.
-    base: Base of the computation.
-
-  Returns:
-    Reward computed based on `pred` and `target`. A float.
-  """
-  return absolute_distance_reward(pred, target, base, mod_abs_diff)
-
-
-def absolute_log_distance_reward(pred, target, base):
-  """Compute reward using `log_absolute_distance`.
-
-  Maximum reward, 1.0, is given when the lists are equal. Reward is scaled
-  so that 0.0 reward is given when `pred` is the empty list (assuming `target`
-  is not empty). Reward can go negative when `pred` is longer than `target`.
-
-  This is an asymmetric reward function, so which list is the prediction and
-  which is the target matters.
-
-  This reward function has the nice property that much more reward is given
-  for getting the correct value (at each position) than for there being any
-  value at all. For example, in base 100, lets say pred = [1] * 1000
-  and target = [10] * 1000. A lot of reward would be given for being 80%
-  accurate (worst element-wise distance is 50, distances here are 9) using
-  `absolute_distance`. `log_absolute_distance` on the other hand will give
-  greater and greater reward increments the closer each predicted value gets to
-  the target. That makes the reward given for accuracy somewhat independant of
-  the base.
-
-  Args:
-    pred: Prediction sequence. This should be the sequence outputted by the
-        generated code. List of ints n, where 0 <= n < base.
-    target: Target sequence. The correct sequence that the generated code needs
-        to output. List of ints n, where 0 <= n < base.
-    base: Base of the computation.
-
-  Returns:
-    Reward computed based on `pred` and `target`. A float.
-  """
-  return 1.0 - log_absolute_distance(pred, target, base)
-
-
-#######################
-### Reward Managers ###
-#######################
-
-# Reward managers assign reward to many code attempts throughout an episode.
-
-
-class RewardManager(object):
-  """Reward managers administer reward across an episode.
-
-  Reward managers are used for "editor" environments. These are environments
-  where the agent has some way to edit its code over time, and run its code
-  many time in the same episode, so that it can make incremental improvements.
-
-  Reward managers are instantiated with a target sequence, which is the known
-  correct program output. The manager is called on the output from a proposed
-  code, and returns reward. If many proposal outputs are tried, reward may be
-  some stateful function that takes previous tries into account. This is done,
-  in part, so that an agent cannot accumulate unbounded reward just by trying
-  junk programs as often as possible. So reward managers should not give the
-  same reward twice if the next proposal is not better than the last.
-  """
-  __metaclass__ = ABCMeta
-
-  def __init__(self, target, base, distance_fn=absolute_distance):
-    self._target = list(target)
-    self._base = base
-    self._distance_fn = distance_fn
-
-  @abstractmethod
-  def __call__(self, sequence):
-    """Call this reward manager like a function to get reward.
-
-    Calls to reward manager are stateful, and will take previous sequences
-    into account. Repeated calls with the same sequence may produce different
-    rewards.
-
-    Args:
-      sequence: List of integers (each between 0 and base - 1). This is the
-          proposal sequence. Reward will be computed based on the distance
-          from this sequence to the target (distance function and target are
-          given in the constructor), as well as previous sequences tried during
-          the lifetime of this object.
-
-    Returns:
-      Float value. The reward received from this call.
-    """
-    return 0.0
-
-
-class DeltaRewardManager(RewardManager):
-  """Simple reward manager that assigns reward for the net change in distance.
-
-  Given some (possibly asymmetric) list distance function, gives reward for
-  relative changes in prediction distance to the target.
-
-  For example, if on the first call the distance is 3.0, the change in distance
-  is -3 (from starting distance of 0). That relative change will be scaled to
-  produce a negative reward for this step. On the next call, the distance is 2.0
-  which is a +1 change, and that will be scaled to give a positive reward.
-  If the final call has distance 0 (the target is achieved), that is another
-  positive change of +2. The total reward across all 3 calls is then 0, which is
-  the highest posible episode total.
-
-  Reward is scaled so that the maximum element-wise distance is worth 1.0.
-  Maximum total episode reward attainable is 0.
-  """
-
-  def __init__(self, target, base, distance_fn=absolute_distance):
-    super(DeltaRewardManager, self).__init__(target, base, distance_fn)
-    self._last_diff = 0
-
-  def _diff(self, seq):
-    return self._distance_fn(seq, self._target, self._base)
-
-  def _delta_reward(self, seq):
-    # Reward is relative to previous sequence diff.
-    # Reward is scaled so that maximum token difference is worth 1.0.
-    # Reward = (last_diff - this_diff) / self.base.
-    # Reward is positive if this sequence is closer to the target than the
-    # previous sequence, and negative if this sequence is further away.
-    diff = self._diff(seq)
-    reward = (self._last_diff - diff) / float(self._base)
-    self._last_diff = diff
-    return reward
-
-  def __call__(self, seq):
-    return self._delta_reward(seq)
-
-
-class FloorRewardManager(RewardManager):
-  """Assigns positive reward for each step taken closer to the target.
-
-  Given some (possibly asymmetric) list distance function, gives reward for
-  whenever a new episode minimum distance is reached. No reward is given if
-  the distance regresses to a higher value, so that the sum of rewards
-  for the episode is positive.
-
-  Reward is scaled so that the maximum element-wise distance is worth 1.0.
-  Maximum total episode reward attainable is len(target).
-
-  If the prediction sequence is longer than the target, a reward of -1 is given.
-  Subsequence predictions which are also longer get 0 reward. The -1 penalty
-  will be canceled out with a +1 reward when a prediction is given which is at
-  most the length of the target.
-  """
-
-  def __init__(self, target, base, distance_fn=absolute_distance):
-    super(FloorRewardManager, self).__init__(target, base, distance_fn)
-    self._last_diff = 0
-    self._min_diff = self._max_diff()
-    self._too_long_penality_given = False
-
-  def _max_diff(self):
-    return self._distance_fn([], self._target, self._base)
-
-  def _diff(self, seq):
-    return self._distance_fn(seq, self._target, self._base)
-
-  def _delta_reward(self, seq):
-    # Reward is only given if this sequence is closer to the target than any
-    # previous sequence.
-    # Reward is scaled so that maximum token difference is worth 1.0
-    # Reward = (min_diff - this_diff) / self.base
-    # Reward is always positive.
-    diff = self._diff(seq)
-    if diff < self._min_diff:
-      reward = (self._min_diff - diff) / float(self._base)
-      self._min_diff = diff
-    else:
-      reward = 0.0
-    return reward
-
-  def __call__(self, seq):
-    if len(seq) > len(self._target):  # Output is too long.
-      if not self._too_long_penality_given:
-        self._too_long_penality_given = True
-        reward = -1.0
-      else:
-        reward = 0.0  # Don't give this penalty more than once.
-      return reward
-
-    reward = self._delta_reward(seq)
-    if self._too_long_penality_given:
-      reward += 1.0  # Return the subtracted reward.
-      self._too_long_penality_given = False
-    return reward
-
diff --git a/research/brain_coder/common/reward_test.py b/research/brain_coder/common/reward_test.py
deleted file mode 100644
index 38a1d4ace38cbc945362e52adb90cc9dd62f1be7..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/reward_test.py
+++ /dev/null
@@ -1,311 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for common.reward."""
-
-from math import log
-import numpy as np
-import tensorflow as tf
-
-from common import reward  # brain coder
-
-
-class RewardTest(tf.test.TestCase):
-
-  def testAbsDiff(self):
-    self.assertEqual(5, reward.abs_diff(15, 20))
-    self.assertEqual(5, reward.abs_diff(20, 15))
-
-  def testModAbsDiff(self):
-    self.assertEqual(5, reward.mod_abs_diff(15, 20, 25))
-    self.assertEqual(5, reward.mod_abs_diff(20, 15, 25))
-    self.assertEqual(2, reward.mod_abs_diff(1, 24, 25))
-    self.assertEqual(2, reward.mod_abs_diff(24, 1, 25))
-
-    self.assertEqual(0, reward.mod_abs_diff(0, 0, 5))
-    self.assertEqual(1, reward.mod_abs_diff(0, 1, 5))
-    self.assertEqual(2, reward.mod_abs_diff(0, 2, 5))
-    self.assertEqual(2, reward.mod_abs_diff(0, 3, 5))
-    self.assertEqual(1, reward.mod_abs_diff(0, 4, 5))
-
-    self.assertEqual(0, reward.mod_abs_diff(-1, 4, 5))
-    self.assertEqual(1, reward.mod_abs_diff(-5, 4, 5))
-    self.assertEqual(1, reward.mod_abs_diff(-7, 4, 5))
-    self.assertEqual(1, reward.mod_abs_diff(13, 4, 5))
-    self.assertEqual(1, reward.mod_abs_diff(15, 4, 5))
-
-  def testAbsoluteDistance_AbsDiffMethod(self):
-    self.assertEqual(
-        4,
-        reward.absolute_distance([0], [4], 5, scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        0,
-        reward.absolute_distance([4], [4], 5, scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        0,
-        reward.absolute_distance([], [], 5, scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        5,
-        reward.absolute_distance([1], [], 5, scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        5,
-        reward.absolute_distance([], [1], 5, scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        0,
-        reward.absolute_distance([1, 2, 3], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        1,
-        reward.absolute_distance([1, 2, 4], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        1,
-        reward.absolute_distance([1, 2, 2], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        5,
-        reward.absolute_distance([1, 2], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        5,
-        reward.absolute_distance([1, 2, 3, 4], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.abs_diff))
-    self.assertEqual(
-        6,
-        reward.absolute_distance([4, 4, 4], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.abs_diff))
-
-  def testAbsoluteDistance_ModDiffMethod(self):
-    self.assertEqual(
-        1,
-        reward.absolute_distance([0], [4], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        0,
-        reward.absolute_distance([4], [4], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        0,
-        reward.absolute_distance([], [], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        5,
-        reward.absolute_distance([1], [], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        5,
-        reward.absolute_distance([], [1], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        0,
-        reward.absolute_distance([1, 2, 3], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        1,
-        reward.absolute_distance([1, 2, 4], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        1,
-        reward.absolute_distance([1, 2, 2], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        5,
-        reward.absolute_distance([1, 2], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        5,
-        reward.absolute_distance([1, 2, 3, 4], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-    self.assertEqual(
-        5,
-        reward.absolute_distance([4, 4, 4], [1, 2, 3], 5,
-                                 scalar_diff_fn=reward.mod_abs_diff))
-
-  def testLogAbsoluteDistance(self):
-    def log_diff(diff, base):
-      return log(diff + 1) / log(base // 2 + 2)
-
-    self.assertEqual(
-        log_diff(1, 5),
-        reward.log_absolute_distance([0], [4], 5))
-    self.assertEqual(
-        log_diff(2, 5),
-        reward.log_absolute_distance([1], [4], 5))
-    self.assertEqual(
-        log_diff(2, 5),
-        reward.log_absolute_distance([2], [4], 5))
-    self.assertEqual(
-        log_diff(1, 5),
-        reward.log_absolute_distance([3], [4], 5))
-    self.assertEqual(
-        log_diff(3, 5),  # max_dist = base // 2 + 1 = 3
-        reward.log_absolute_distance([], [4], 5))
-    self.assertEqual(
-        0 + log_diff(3, 5),  # max_dist = base // 2 + 1 = 3
-        reward.log_absolute_distance([4, 4], [4], 5))
-    self.assertEqual(
-        0,
-        reward.log_absolute_distance([4], [4], 5))
-    self.assertEqual(
-        0,
-        reward.log_absolute_distance([], [], 5))
-    self.assertEqual(
-        1,
-        reward.log_absolute_distance([1], [], 5))
-    self.assertEqual(
-        1,
-        reward.log_absolute_distance([], [1], 5))
-
-    self.assertEqual(
-        0,
-        reward.log_absolute_distance([1, 2, 3], [1, 2, 3], 5))
-    self.assertEqual(
-        log_diff(1, 5) / 3,  # divided by target length.
-        reward.log_absolute_distance([1, 2, 4], [1, 2, 3], 5))
-    self.assertEqual(
-        log_diff(1, 5) / 3,
-        reward.log_absolute_distance([1, 2, 2], [1, 2, 3], 5))
-    self.assertEqual(
-        log_diff(3, 5) / 3,  # max_dist
-        reward.log_absolute_distance([1, 2], [1, 2, 3], 5))
-    self.assertEqual(
-        log_diff(3, 5) / 3,  # max_dist
-        reward.log_absolute_distance([1, 2, 3, 4], [1, 2, 3], 5))
-    # Add log differences for each position.
-    self.assertEqual(
-        (log_diff(2, 5) + log_diff(2, 5) + log_diff(1, 5)) / 3,
-        reward.log_absolute_distance([4, 4, 4], [1, 2, 3], 5))
-
-  def testAbsoluteDistanceReward(self):
-    self.assertEqual(
-        1,
-        reward.absolute_distance_reward([1, 2, 3], [1, 2, 3], 5))
-    self.assertEqual(
-        1 - 1 / (5 * 3.),  # 1 - distance / (base * target_len)
-        reward.absolute_distance_reward([1, 2, 4], [1, 2, 3], 5))
-    self.assertEqual(
-        1 - 1 / (5 * 3.),
-        reward.absolute_distance_reward([1, 2, 2], [1, 2, 3], 5))
-    self.assertTrue(np.isclose(
-        1 - 5 / (5 * 3.),
-        reward.absolute_distance_reward([1, 2], [1, 2, 3], 5)))
-    self.assertTrue(np.isclose(
-        1 - 5 / (5 * 3.),
-        reward.absolute_distance_reward([1, 2, 3, 4], [1, 2, 3], 5)))
-    # Add log differences for each position.
-    self.assertEqual(
-        1 - (3 + 2 + 1) / (5 * 3.),
-        reward.absolute_distance_reward([4, 4, 4], [1, 2, 3], 5))
-    self.assertEqual(
-        1,
-        reward.absolute_distance_reward([], [], 5))
-
-  def testAbsoluteModDistanceReward(self):
-    self.assertEqual(
-        1,
-        reward.absolute_mod_distance_reward([1, 2, 3], [1, 2, 3], 5))
-    self.assertEqual(
-        1 - 1 / (5 * 3.),  # 1 - distance / (base * target_len)
-        reward.absolute_mod_distance_reward([1, 2, 4], [1, 2, 3], 5))
-    self.assertEqual(
-        1 - 1 / (5 * 3.),
-        reward.absolute_mod_distance_reward([1, 2, 2], [1, 2, 3], 5))
-    self.assertTrue(np.isclose(
-        1 - 5 / (5 * 3.),
-        reward.absolute_mod_distance_reward([1, 2], [1, 2, 3], 5)))
-    self.assertTrue(np.isclose(
-        1 - 5 / (5 * 3.),
-        reward.absolute_mod_distance_reward([1, 2, 3, 4], [1, 2, 3], 5)))
-    # Add log differences for each position.
-    self.assertTrue(np.isclose(
-        1 - (2 + 2 + 1) / (5 * 3.),
-        reward.absolute_mod_distance_reward([4, 4, 4], [1, 2, 3], 5)))
-    self.assertTrue(np.isclose(
-        1 - (1 + 2 + 2) / (5 * 3.),
-        reward.absolute_mod_distance_reward([0, 1, 2], [4, 4, 4], 5)))
-    self.assertEqual(
-        1,
-        reward.absolute_mod_distance_reward([], [], 5))
-
-  def testAbsoluteLogDistanceReward(self):
-    def log_diff(diff, base):
-      return log(diff + 1) / log(base // 2 + 2)
-
-    self.assertEqual(
-        1,
-        reward.absolute_log_distance_reward([1, 2, 3], [1, 2, 3], 5))
-    self.assertEqual(
-        1 - log_diff(1, 5) / 3,  # divided by target length.
-        reward.absolute_log_distance_reward([1, 2, 4], [1, 2, 3], 5))
-    self.assertEqual(
-        1 - log_diff(1, 5) / 3,
-        reward.absolute_log_distance_reward([1, 2, 2], [1, 2, 3], 5))
-    self.assertEqual(
-        1 - log_diff(3, 5) / 3,  # max_dist
-        reward.absolute_log_distance_reward([1, 2], [1, 2, 3], 5))
-    self.assertEqual(
-        1 - log_diff(3, 5) / 3,  # max_dist
-        reward.absolute_log_distance_reward([1, 2, 3, 4], [1, 2, 3], 5))
-    # Add log differences for each position.
-    self.assertEqual(
-        1 - (log_diff(2, 5) + log_diff(2, 5) + log_diff(1, 5)) / 3,
-        reward.absolute_log_distance_reward([4, 4, 4], [1, 2, 3], 5))
-    self.assertEqual(
-        1 - (log_diff(1, 5) + log_diff(2, 5) + log_diff(2, 5)) / 3,
-        reward.absolute_log_distance_reward([0, 1, 2], [4, 4, 4], 5))
-    self.assertEqual(
-        1,
-        reward.absolute_log_distance_reward([], [], 5))
-
-  def testDeltaRewardManager(self):
-    reward_manager = reward.DeltaRewardManager(
-        [1, 2, 3, 4], base=5, distance_fn=reward.absolute_distance)
-    self.assertEqual(-3, reward_manager([1]))
-    self.assertEqual(0, reward_manager([1]))
-    self.assertEqual(4 / 5., reward_manager([1, 3]))
-    self.assertEqual(-4 / 5, reward_manager([1]))
-    self.assertEqual(3, reward_manager([1, 2, 3, 4]))
-    self.assertEqual(-1, reward_manager([1, 2, 3]))
-    self.assertEqual(0, reward_manager([1, 2, 3, 4, 3]))
-    self.assertEqual(-1, reward_manager([1, 2, 3, 4, 3, 2]))
-    self.assertEqual(2, reward_manager([1, 2, 3, 4]))
-    self.assertEqual(0, reward_manager([1, 2, 3, 4]))
-    self.assertEqual(0, reward_manager([1, 2, 3, 4]))
-
-  def testFloorRewardMananger(self):
-    reward_manager = reward.FloorRewardManager(
-        [1, 2, 3, 4], base=5, distance_fn=reward.absolute_distance)
-    self.assertEqual(1, reward_manager([1]))
-    self.assertEqual(0, reward_manager([1]))
-    self.assertEqual(4 / 5., reward_manager([1, 3]))
-    self.assertEqual(0, reward_manager([1]))
-    self.assertEqual(1 / 5., reward_manager([1, 2]))
-    self.assertEqual(0, reward_manager([0, 1]))
-    self.assertEqual(0, reward_manager([]))
-    self.assertEqual(0, reward_manager([1, 2]))
-    self.assertEqual(2, reward_manager([1, 2, 3, 4]))
-    self.assertEqual(0, reward_manager([1, 2, 3]))
-    self.assertEqual(-1, reward_manager([1, 2, 3, 4, 3]))
-    self.assertEqual(0, reward_manager([1, 2, 3, 4, 3, 2]))
-    self.assertEqual(1, reward_manager([1, 2, 3, 4]))
-    self.assertEqual(0, reward_manager([1, 2, 3, 4]))
-    self.assertEqual(0, reward_manager([1, 2, 3, 4]))
-
-    reward_manager = reward.FloorRewardManager(
-        [1, 2, 3, 4], base=5, distance_fn=reward.absolute_distance)
-    self.assertEqual(1, reward_manager([1]))
-    self.assertEqual(-1, reward_manager([1, 0, 0, 0, 0, 0]))
-    self.assertEqual(0, reward_manager([1, 2, 3, 4, 0, 0]))
-    self.assertEqual(0, reward_manager([1, 2, 3, 4, 0]))
-    self.assertEqual(1, reward_manager([]))
-    self.assertEqual(0, reward_manager([]))
-    self.assertEqual(0, reward_manager([1]))
-    self.assertEqual(1, reward_manager([1, 2]))
-    self.assertEqual(-1, reward_manager([1, 2, 3, 4, 0, 0]))
-    self.assertEqual(0, reward_manager([1, 1, 1, 1, 1]))
-    self.assertEqual(1 + 2, reward_manager([1, 2, 3, 4]))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/common/rollout.py b/research/brain_coder/common/rollout.py
deleted file mode 100644
index e377aa662db640dfa907de83d32875cc096c4295..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/rollout.py
+++ /dev/null
@@ -1,306 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Utilities related to computing training batches from episode rollouts.
-
-Implementations here are based on code from Open AI:
-https://github.com/openai/universe-starter-agent/blob/master/a3c.py.
-"""
-
-from collections import namedtuple
-import numpy as np
-import scipy.signal
-
-from common import utils  # brain coder
-
-
-class Rollout(object):
-  """Holds a rollout for an episode.
-
-  A rollout is a record of the states observed in some environment and actions
-  taken by the agent to arrive at those states. Other information includes
-  rewards received after each action, values estimated for each state, whether
-  the rollout concluded the episide, and total reward received. Everything
-  should be given in time order.
-
-  At each time t, the agent sees state s_t, takes action a_t, and then receives
-  reward r_t. The agent may optionally estimate a state value V(s_t) for each
-  state.
-
-  For an episode of length T:
-  states = [s_0, ..., s_(T-1)]
-  actions = [a_0, ..., a_(T-1)]
-  rewards = [r_0, ..., r_(T-1)]
-  values = [V(s_0), ..., V(s_(T-1))]
-
-  Note that there is an extra state s_T observed after taking action a_(T-1),
-  but this is not included in the rollout.
-
-  Rollouts have an `terminated` attribute which is True when the rollout is
-  "finalized", i.e. it holds a full episode. terminated will be False when
-  time steps are still being added to it.
-  """
-
-  def __init__(self):
-    self.states = []
-    self.actions = []
-    self.rewards = []
-    self.values = []
-    self.total_reward = 0.0
-    self.terminated = False
-
-  def add(self, state, action, reward, value=0.0, terminated=False):
-    """Add the next timestep to this rollout.
-
-    Args:
-      state: The state observed at the start of this timestep.
-      action: The action taken after observing the given state.
-      reward: The reward received for taking the given action.
-      value: The value estimated for the given state.
-      terminated: Whether this timestep ends the episode.
-
-    Raises:
-      ValueError: If this.terminated is already True, meaning that the episode
-          has already ended.
-    """
-    if self.terminated:
-      raise ValueError(
-          'Trying to add timestep to an already terminal rollout.')
-    self.states += [state]
-    self.actions += [action]
-    self.rewards += [reward]
-    self.values += [value]
-    self.terminated = terminated
-    self.total_reward += reward
-
-  def add_many(self, states, actions, rewards, values=None, terminated=False):
-    """Add many timesteps to this rollout.
-
-    Arguments are the same as `add`, but are lists of equal size.
-
-    Args:
-      states: The states observed.
-      actions: The actions taken.
-      rewards: The rewards received.
-      values: The values estimated for the given states.
-      terminated: Whether this sequence ends the episode.
-
-    Raises:
-      ValueError: If the lengths of all the input lists are not equal.
-      ValueError: If this.terminated is already True, meaning that the episode
-          has already ended.
-    """
-    if len(states) != len(actions):
-      raise ValueError(
-          'Number of states and actions must be the same. Got %d states and '
-          '%d actions' % (len(states), len(actions)))
-    if len(states) != len(rewards):
-      raise ValueError(
-          'Number of states and rewards must be the same. Got %d states and '
-          '%d rewards' % (len(states), len(rewards)))
-    if values is not None and len(states) != len(values):
-      raise ValueError(
-          'Number of states and values must be the same. Got %d states and '
-          '%d values' % (len(states), len(values)))
-    if self.terminated:
-      raise ValueError(
-          'Trying to add timesteps to an already terminal rollout.')
-    self.states += states
-    self.actions += actions
-    self.rewards += rewards
-    self.values += values if values is not None else [0.0] * len(states)
-    self.terminated = terminated
-    self.total_reward += sum(rewards)
-
-  def extend(self, other):
-    """Append another rollout to this rollout."""
-    assert not self.terminated
-    self.states.extend(other.states)
-    self.actions.extend(other.actions)
-    self.rewards.extend(other.rewards)
-    self.values.extend(other.values)
-    self.terminated = other.terminated
-    self.total_reward += other.total_reward
-
-
-def discount(x, gamma):
-  """Returns discounted sums for each value in x, with discount factor gamma.
-
-  This can be used to compute the return (discounted sum of rewards) at each
-  timestep given a sequence of rewards. See the definitions for return and
-  REINFORCE in section 3 of https://arxiv.org/pdf/1602.01783.pdf.
-
-  Let g^k mean gamma ** k.
-  For list [x_0, ..., x_N], the following list of discounted sums is computed:
-  [x_0 + g^1 * x_1 + g^2 * x_2 + ... g^N * x_N,
-   x_1 + g^1 * x_2 + g^2 * x_3 + ... g^(N-1) * x_N,
-   x_2 + g^1 * x_3 + g^2 * x_4 + ... g^(N-2) * x_N,
-   ...,
-   x_(N-1) + g^1 * x_N,
-   x_N]
-
-  Args:
-    x: List of numbers [x_0, ..., x_N].
-    gamma: Float between 0 and 1 (inclusive). This is the discount factor.
-
-  Returns:
-    List of discounted sums.
-  """
-  return scipy.signal.lfilter([1], [1, -gamma], x[::-1], axis=0)[::-1]
-
-
-def discounted_advantage_and_rewards(rewards, values, gamma, lambda_=1.0):
-  """Compute advantages and returns (discounted sum of rewards).
-
-  For an episode of length T, rewards = [r_0, ..., r_(T-1)].
-  Each reward r_t is observed after taking action a_t at state s_t. A final
-  state s_T is observed but no reward is given at this state since no action
-  a_T is taken (otherwise there would be a new state s_(T+1)).
-
-  `rewards` and `values` are for a single episode. Return R_t is the discounted
-  sum of future rewards starting at time t, where `gamma` is the discount
-  factor.
-  R_t = r_t + gamma * r_(t+1) + gamma**2 * r_(t+2) + ...
-        + gamma**(T-1-t) * r_(T-1)
-
-  Advantage A(a_t, s_t) is approximated by computing A(a_t, s_t) = R_t - V(s_t)
-  where V(s_t) is an approximation of the value at that state, given in the
-  `values` list. Returns R_t are needed for all REINFORCE algorithms. Advantage
-  is used for the advantage actor critic variant of REINFORCE.
-  See algorithm S3 in https://arxiv.org/pdf/1602.01783.pdf.
-
-  Additionally another parameter `lambda_` controls the bias-variance tradeoff.
-  See "Generalized Advantage Estimation": https://arxiv.org/abs/1506.02438.
-  lambda_ = 1 reduces to regular advantage.
-  0 <= lambda_ < 1 trades off variance for bias, with lambda_ = 0 being the
-  most biased.
-
-  Bootstrapping is also supported. If an episode does not end in a terminal
-  state (either because the episode was ended early, or the environment does not
-  have end states), the true return cannot be computed from the rewards alone.
-  However, it can be estimated by computing the value (an approximation of
-  return) of the last state s_T. Thus the `values` list will have an extra item:
-  values = [V(s_0), ..., V(s_(T-1)), V(s_T)].
-
-  Args:
-    rewards: List of observed rewards [r_0, ..., r_(T-1)].
-    values: List of estimated values [V(s_0), ..., V(s_(T-1))] with an optional
-        extra V(s_T) item.
-    gamma: Discount factor. Number between 0 and 1. 1 means no discount.
-        If not 1, gamma is typically near 1, like 0.99.
-    lambda_: Bias-variance tradeoff factor. Between 0 and 1.
-
-  Returns:
-    empirical_values: Returns at each timestep.
-    generalized_advantage: Avantages at each timestep.
-
-  Raises:
-    ValueError: If shapes of `rewards` and `values` are not rank 1.
-    ValueError: If len(values) not in (len(rewards), len(rewards) + 1).
-  """
-  rewards = np.asarray(rewards, dtype=np.float32)
-  values = np.asarray(values, dtype=np.float32)
-  if rewards.ndim != 1:
-    raise ValueError('Single episode only. rewards must be rank 1.')
-  if values.ndim != 1:
-    raise ValueError('Single episode only. values must be rank 1.')
-  if len(values) == len(rewards):
-    # No bootstrapping.
-    values = np.append(values, 0)
-    empirical_values = discount(rewards, gamma)
-  elif len(values) == len(rewards) + 1:
-    # With bootstrapping.
-    # Last value is for the terminal state (final state after last action was
-    # taken).
-    empirical_values = discount(np.append(rewards, values[-1]), gamma)[:-1]
-  else:
-    raise ValueError('values should contain the same number of items or one '
-                     'more item than rewards')
-  delta = rewards + gamma * values[1:] - values[:-1]
-  generalized_advantage = discount(delta, gamma * lambda_)
-
-  # empirical_values is the discounted sum of rewards into the future.
-  # generalized_advantage is the target for each policy update.
-  return empirical_values, generalized_advantage
-
-
-"""Batch holds a minibatch of episodes.
-
-Let bi = batch_index, i.e. the index of each episode in the minibatch.
-Let t = time.
-
-Attributes:
-  states: States for each timestep in each episode. Indexed by states[bi, t].
-  actions: Actions for each timestep in each episode. Indexed by actions[bi, t].
-  discounted_adv: Advantages (computed by discounted_advantage_and_rewards)
-      for each timestep in each episode. Indexed by discounted_adv[bi, t].
-  discounted_r: Returns (discounted sum of rewards computed by
-      discounted_advantage_and_rewards) for each timestep in each episode.
-      Indexed by discounted_r[bi, t].
-  total_rewards: Total reward for each episode, i.e. sum of rewards across all
-      timesteps (not discounted). Indexed by total_rewards[bi].
-  episode_lengths: Number of timesteps in each episode. If an episode has
-      N actions, N rewards, and N states, then its length is N. Indexed by
-      episode_lengths[bi].
-  batch_size: Number of episodes in this minibatch. An integer.
-  max_time: Maximum episode length in the batch. An integer.
-"""  # pylint: disable=pointless-string-statement
-Batch = namedtuple(
-    'Batch',
-    ['states', 'actions', 'discounted_adv', 'discounted_r', 'total_rewards',
-     'episode_lengths', 'batch_size', 'max_time'])
-
-
-def process_rollouts(rollouts, gamma, lambda_=1.0):
-  """Convert a batch of rollouts into tensors ready to be fed into a model.
-
-  Lists from each episode are stacked into 2D tensors and padded with 0s up to
-  the maximum timestep in the batch.
-
-  Args:
-    rollouts: A list of Rollout instances.
-    gamma: The discount factor. A number between 0 and 1 (inclusive). See gamma
-        argument in discounted_advantage_and_rewards.
-    lambda_: See lambda_ argument in discounted_advantage_and_rewards.
-
-  Returns:
-    Batch instance. states, actions, discounted_adv, and discounted_r are
-    numpy arrays with shape (batch_size, max_episode_length). episode_lengths
-    is a list of ints. total_rewards is a list of floats (total reward in each
-    episode). batch_size and max_time are ints.
-
-  Raises:
-    ValueError: If any of the rollouts are not terminal.
-  """
-  for ro in rollouts:
-    if not ro.terminated:
-      raise ValueError('Can only process terminal rollouts.')
-
-  episode_lengths = [len(ro.states) for ro in rollouts]
-  batch_size = len(rollouts)
-  max_time = max(episode_lengths)
-
-  states = utils.stack_pad([ro.states for ro in rollouts], 0, max_time)
-  actions = utils.stack_pad([ro.actions for ro in rollouts], 0, max_time)
-
-  discounted_rewards = [None] * batch_size
-  discounted_adv = [None] * batch_size
-  for i, ro in enumerate(rollouts):
-    disc_r, disc_adv = discounted_advantage_and_rewards(
-        ro.rewards, ro.values, gamma, lambda_)
-    discounted_rewards[i] = disc_r
-    discounted_adv[i] = disc_adv
-  discounted_rewards = utils.stack_pad(discounted_rewards, 0, max_time)
-  discounted_adv = utils.stack_pad(discounted_adv, 0, max_time)
-
-  total_rewards = [sum(ro.rewards) for ro in rollouts]
-
-  return Batch(states=states,
-               actions=actions,
-               discounted_adv=discounted_adv,
-               discounted_r=discounted_rewards,
-               total_rewards=total_rewards,
-               episode_lengths=episode_lengths,
-               batch_size=batch_size,
-               max_time=max_time)
diff --git a/research/brain_coder/common/rollout_test.py b/research/brain_coder/common/rollout_test.py
deleted file mode 100644
index 5be4cb0fafd8a2e94004c17b41e189d989a3a851..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/rollout_test.py
+++ /dev/null
@@ -1,129 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for common.rollout."""
-
-import numpy as np
-import tensorflow as tf
-
-from common import rollout as rollout_lib  # brain coder
-
-
-class RolloutTest(tf.test.TestCase):
-
-  def MakeRollout(self, states, actions, rewards, values=None, terminated=True):
-    rollout = rollout_lib.Rollout()
-    rollout.add_many(
-        states=states, actions=actions, rewards=rewards, values=values,
-        terminated=terminated)
-    return rollout
-
-  def testDiscount(self):
-    discounted = np.array([1.0 / 2 ** n for n in range(4, -1, -1)])
-    discounted[:2] += [1.0 / 2 ** n for n in range(1, -1, -1)]
-
-    self.assertTrue(np.array_equal(
-        rollout_lib.discount([0.0, 1.0, 0.0, 0.0, 1.0], 0.50),
-        discounted))
-    self.assertTrue(np.array_equal(
-        rollout_lib.discount(np.array([0.0, 1.0, 0.0, 0.0, 1.0]), 0.50),
-        discounted))
-
-  def testDiscountedAdvantageAndRewards(self):
-    # lambda=1, No bootstrapping.
-    values = [0.1, 0.5, 0.5, 0.25]
-    (empirical_values,
-     generalized_advantage) = rollout_lib.discounted_advantage_and_rewards(
-         [0.0, 0.0, 0.0, 1.0],
-         values,
-         gamma=0.75,
-         lambda_=1.0)
-    expected_discounted_r = (
-        np.array([1.0 * 0.75 ** n for n in range(3, -1, -1)]))
-    expected_adv = expected_discounted_r - values
-    self.assertTrue(np.array_equal(empirical_values, expected_discounted_r))
-    self.assertTrue(np.allclose(generalized_advantage, expected_adv))
-
-    # lambda=1, With bootstrapping.
-    values = [0.1, 0.5, 0.5, 0.25, 0.75]
-    (empirical_values,
-     generalized_advantage) = rollout_lib.discounted_advantage_and_rewards(
-         [0.0, 0.0, 0.0, 1.0],
-         values,
-         gamma=0.75,
-         lambda_=1.0)
-    expected_discounted_r = (
-        np.array([0.75 * 0.75 ** n for n in range(4, 0, -1)])
-        + np.array([1.0 * 0.75 ** n for n in range(3, -1, -1)]))
-    expected_adv = expected_discounted_r - values[:-1]
-    self.assertTrue(np.array_equal(empirical_values, expected_discounted_r))
-    self.assertTrue(np.allclose(generalized_advantage, expected_adv))
-
-    # lambda=0.5, With bootstrapping.
-    values = [0.1, 0.5, 0.5, 0.25, 0.75]
-    rewards = [0.0, 0.0, 0.0, 1.0]
-    l = 0.5  # lambda
-    g = 0.75  # gamma
-    (empirical_values,
-     generalized_advantage) = rollout_lib.discounted_advantage_and_rewards(
-         rewards,
-         values,
-         gamma=g,
-         lambda_=l)
-    expected_discounted_r = (
-        np.array([0.75 * g ** n for n in range(4, 0, -1)])
-        + np.array([1.0 * g ** n for n in range(3, -1, -1)]))
-    expected_adv = [0.0] * len(values)
-    for t in range(3, -1, -1):
-      delta_t = rewards[t] + g * values[t + 1] - values[t]
-      expected_adv[t] = delta_t + g * l * expected_adv[t + 1]
-    expected_adv = expected_adv[:-1]
-    self.assertTrue(np.array_equal(empirical_values, expected_discounted_r))
-    self.assertTrue(np.allclose(generalized_advantage, expected_adv))
-
-  def testProcessRollouts(self):
-    g = 0.95
-    rollouts = [
-        self.MakeRollout(
-            states=[3, 6, 9],
-            actions=[1, 2, 3],
-            rewards=[1.0, -1.0, 0.5],
-            values=[0.5, 0.5, 0.1]),
-        self.MakeRollout(
-            states=[10],
-            actions=[5],
-            rewards=[1.0],
-            values=[0.5])]
-    batch = rollout_lib.process_rollouts(rollouts, gamma=g)
-
-    self.assertEqual(2, batch.batch_size)
-    self.assertEqual(3, batch.max_time)
-    self.assertEqual([3, 1], batch.episode_lengths)
-    self.assertEqual([0.5, 1.0], batch.total_rewards)
-    self.assertEqual(
-        [[3, 6, 9], [10, 0, 0]],
-        batch.states.tolist())
-    self.assertEqual(
-        [[1, 2, 3], [5, 0, 0]],
-        batch.actions.tolist())
-
-    rew1, rew2 = rollouts[0].rewards, rollouts[1].rewards
-    expected_discounted_rewards = [
-        [rew1[0] + g * rew1[1] + g * g * rew1[2],
-         rew1[1] + g * rew1[2],
-         rew1[2]],
-        [rew2[0], 0.0, 0.0]]
-    expected_advantages = [
-        [dr - v
-         for dr, v
-         in zip(expected_discounted_rewards[0], rollouts[0].values)],
-        [expected_discounted_rewards[1][0] - rollouts[1].values[0], 0.0, 0.0]]
-    self.assertTrue(
-        np.allclose(expected_discounted_rewards, batch.discounted_r))
-    self.assertTrue(
-        np.allclose(expected_advantages, batch.discounted_adv))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/common/schedules.py b/research/brain_coder/common/schedules.py
deleted file mode 100644
index fff2481e536d65f154ad2d9dc3972657d860abf8..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/schedules.py
+++ /dev/null
@@ -1,301 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Schedule functions for controlling hparams over time."""
-
-from abc import ABCMeta
-from abc import abstractmethod
-import math
-
-from common import config_lib  # brain coder
-
-
-class Schedule(object):
-  """Schedule is a function which sets a hyperparameter's value over time.
-
-  For example, a schedule can be used to decay an hparams, or oscillate it over
-  time.
-
-  This object is constructed with an instance of config_lib.Config (will be
-  specific to each class implementation). For example if this is a decay
-  schedule, the config may specify the rate of decay and decay start time. Then
-  the object instance is called like a function, mapping global step (an integer
-  counting how many calls to the train op have been made) to the hparam value.
-
-  Properties of a schedule function f(t):
-  0) Domain of t is the non-negative integers (t may be 0).
-  1) Range of f is the reals.
-  2) Schedule functions can assume that they will be called in time order. This
-     allows schedules to be stateful.
-  3) Schedule functions should be deterministic. Two schedule instances with the
-     same config must always give the same value for each t, and regardless of
-     what t's it was previously called on. Users may call f(t) on arbitrary
-     (positive) time jumps. Essentially, multiple schedule instances used in
-     replica training will behave the same.
-  4) Duplicate successive calls on the same time are allowed.
-  """
-  __metaclass__ = ABCMeta
-
-  @abstractmethod
-  def __init__(self, config):
-    """Construct this schedule with a config specific to each class impl.
-
-    Args:
-      config: An instance of config_lib.Config.
-    """
-    pass
-
-  @abstractmethod
-  def __call__(self, global_step):
-    """Map `global_step` to a value.
-
-    `global_step` is an integer counting how many calls to the train op have
-    been made across all replicas (hence why it is global). Implementations
-    may assume calls to be made in time order, i.e. `global_step` now >=
-    previous `global_step` values.
-
-    Args:
-      global_step: Non-negative integer.
-
-    Returns:
-      Hparam value at this step. A number.
-    """
-    pass
-
-
-class ConstSchedule(Schedule):
-  """Constant function.
-
-  config:
-    const: Constant value at every step.
-
-  f(t) = const.
-  """
-
-  def __init__(self, config):
-    super(ConstSchedule, self).__init__(config)
-    self.const = config.const
-
-  def __call__(self, global_step):
-    return self.const
-
-
-class LinearDecaySchedule(Schedule):
-  """Linear decay function.
-
-  config:
-    initial: Decay starts from this value.
-    final: Decay ends at this value.
-    start_time: Step when decay starts. Constant before it.
-    end_time: When decay ends. Constant after it.
-
-  f(t) is a linear function when start_time <= t <= end_time, with slope of
-  (final - initial) / (end_time - start_time). f(t) = initial
-  when t <= start_time. f(t) = final when t >= end_time.
-
-  If start_time == end_time, this becomes a step function.
-  """
-
-  def __init__(self, config):
-    super(LinearDecaySchedule, self).__init__(config)
-    self.initial = config.initial
-    self.final = config.final
-    self.start_time = config.start_time
-    self.end_time = config.end_time
-
-    if self.end_time < self.start_time:
-      raise ValueError('start_time must be before end_time.')
-
-    # Linear interpolation.
-    self._time_diff = float(self.end_time - self.start_time)
-    self._diff = float(self.final - self.initial)
-    self._slope = (
-        self._diff / self._time_diff if self._time_diff > 0 else float('inf'))
-
-  def __call__(self, global_step):
-    if global_step <= self.start_time:
-      return self.initial
-    if global_step > self.end_time:
-      return self.final
-    return self.initial + (global_step - self.start_time) * self._slope
-
-
-class ExponentialDecaySchedule(Schedule):
-  """Exponential decay function.
-
-  See https://en.wikipedia.org/wiki/Exponential_decay.
-
-  Use this decay function to decay over orders of magnitude. For example, to
-  decay learning rate from 1e-2 to 1e-6. Exponential decay will decay the
-  exponent linearly.
-
-  config:
-    initial: Decay starts from this value.
-    final: Decay ends at this value.
-    start_time: Step when decay starts. Constant before it.
-    end_time: When decay ends. Constant after it.
-
-  f(t) is an exponential decay function when start_time <= t <= end_time. The
-  decay rate and amplitude are chosen so that f(t) = initial when
-  t = start_time, and f(t) = final when t = end_time. f(t) is constant for
-  t < start_time or t > end_time. initial and final must be positive values.
-
-  If start_time == end_time, this becomes a step function.
-  """
-
-  def __init__(self, config):
-    super(ExponentialDecaySchedule, self).__init__(config)
-    self.initial = config.initial
-    self.final = config.final
-    self.start_time = config.start_time
-    self.end_time = config.end_time
-
-    if self.initial <= 0 or self.final <= 0:
-      raise ValueError('initial and final must be positive numbers.')
-
-    # Linear interpolation in log space.
-    self._linear_fn = LinearDecaySchedule(
-        config_lib.Config(
-            initial=math.log(self.initial),
-            final=math.log(self.final),
-            start_time=self.start_time,
-            end_time=self.end_time))
-
-  def __call__(self, global_step):
-    return math.exp(self._linear_fn(global_step))
-
-
-class SmootherstepDecaySchedule(Schedule):
-  """Smootherstep decay function.
-
-  A sigmoidal like transition from initial to final values. A smoother
-  transition than linear and exponential decays, hence the name.
-  See https://en.wikipedia.org/wiki/Smoothstep.
-
-  config:
-    initial: Decay starts from this value.
-    final: Decay ends at this value.
-    start_time: Step when decay starts. Constant before it.
-    end_time: When decay ends. Constant after it.
-
-  f(t) is fully defined here:
-  https://en.wikipedia.org/wiki/Smoothstep#Variations.
-
-  f(t) is smooth, as in its first-derivative exists everywhere.
-  """
-
-  def __init__(self, config):
-    super(SmootherstepDecaySchedule, self).__init__(config)
-    self.initial = config.initial
-    self.final = config.final
-    self.start_time = config.start_time
-    self.end_time = config.end_time
-
-    if self.end_time < self.start_time:
-      raise ValueError('start_time must be before end_time.')
-
-    self._time_diff = float(self.end_time - self.start_time)
-    self._diff = float(self.final - self.initial)
-
-  def __call__(self, global_step):
-    if global_step <= self.start_time:
-      return self.initial
-    if global_step > self.end_time:
-      return self.final
-    x = (global_step - self.start_time) / self._time_diff
-
-    # Smootherstep
-    return self.initial + x * x * x * (x * (x * 6 - 15) + 10) * self._diff
-
-
-class HardOscillatorSchedule(Schedule):
-  """Hard oscillator function.
-
-  config:
-    high: Max value of the oscillator. Value at constant plateaus.
-    low: Min value of the oscillator. Value at constant valleys.
-    start_time: Global step when oscillation starts. Constant before this.
-    period: Width of one oscillation, i.e. number of steps over which the
-        oscillation takes place.
-    transition_fraction: Fraction of the period spent transitioning between high
-        and low values. 50% of this time is spent rising, and 50% of this time
-        is spent falling. 50% of the remaining time is spent constant at the
-        high value, and 50% of the remaining time is spent constant at the low
-        value. transition_fraction = 1.0 means the entire period is spent
-        rising and falling. transition_fraction = 0.0 means no time is spent
-        rising and falling, i.e. the function jumps instantaneously between
-        high and low.
-
-  f(t) = high when t < start_time.
-  f(t) is periodic when t >= start_time, with f(t + period) = f(t).
-  f(t) is linear with positive slope when rising, and negative slope when
-  falling. At the start of the period t0, f(t0) = high and begins to descend.
-  At the middle of the period f is low and is constant until the ascension
-  begins. f then rises from low to high and is constant again until the period
-  repeats.
-
-  Note: when transition_fraction is 0, f starts the period low and ends high.
-  """
-
-  def __init__(self, config):
-    super(HardOscillatorSchedule, self).__init__(config)
-    self.high = config.high
-    self.low = config.low
-    self.start_time = config.start_time
-    self.period = float(config.period)
-    self.transition_fraction = config.transition_fraction
-    self.half_transition_fraction = config.transition_fraction / 2.0
-
-    if self.transition_fraction < 0 or self.transition_fraction > 1.0:
-      raise ValueError('transition_fraction must be between 0 and 1.0')
-    if self.period <= 0:
-      raise ValueError('period must be positive')
-
-    self._slope = (
-        float(self.high - self.low) / self.half_transition_fraction
-        if self.half_transition_fraction > 0 else float('inf'))
-
-  def __call__(self, global_step):
-    if global_step < self.start_time:
-      return self.high
-    period_pos = ((global_step - self.start_time) / self.period) % 1.0
-    if period_pos >= 0.5:
-      # ascending
-      period_pos -= 0.5
-      if period_pos < self.half_transition_fraction:
-        return self.low + period_pos * self._slope
-      else:
-        return self.high
-    else:
-      # descending
-      if period_pos < self.half_transition_fraction:
-        return self.high - period_pos * self._slope
-      else:
-        return self.low
-
-
-_NAME_TO_CONFIG = {
-    'const': ConstSchedule,
-    'linear_decay': LinearDecaySchedule,
-    'exp_decay': ExponentialDecaySchedule,
-    'smooth_decay': SmootherstepDecaySchedule,
-    'hard_osc': HardOscillatorSchedule,
-}
-
-
-def make_schedule(config):
-  """Schedule factory.
-
-  Given `config` containing a `fn` property, a Schedule implementation is
-  instantiated with `config`. See `_NAME_TO_CONFIG` for `fn` options.
-
-  Args:
-    config: Config with a `fn` option that specifies which Schedule
-        implementation to use. `config` is passed into the constructor.
-
-  Returns:
-    A Schedule impl instance.
-  """
-  schedule_class = _NAME_TO_CONFIG[config.fn]
-  return schedule_class(config)
diff --git a/research/brain_coder/common/schedules_test.py b/research/brain_coder/common/schedules_test.py
deleted file mode 100644
index b17022f45a833fb3aa219fd06225f77fbd1b1055..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/schedules_test.py
+++ /dev/null
@@ -1,139 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for common.schedules."""
-
-from math import exp
-from math import sqrt
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from common import config_lib  # brain coder
-from common import schedules  # brain coder
-
-
-class SchedulesTest(tf.test.TestCase):
-
-  def ScheduleTestHelper(self, config, schedule_subtype, io_values):
-    """Run common checks for schedules.
-
-    Args:
-      config: Config object which is passed into schedules.make_schedule.
-      schedule_subtype: The expected schedule type to be instantiated.
-      io_values: List of (input, output) pairs. Must be in ascending input
-          order. No duplicate inputs.
-    """
-
-    # Check that make_schedule makes the correct type.
-    f = schedules.make_schedule(config)
-    self.assertTrue(isinstance(f, schedule_subtype))
-
-    # Check that multiple instances returned from make_schedule behave the same.
-    fns = [schedules.make_schedule(config) for _ in xrange(3)]
-
-    # Check that all the inputs map to the right outputs.
-    for i, o in io_values:
-      for f in fns:
-        f_out = f(i)
-        self.assertTrue(
-            np.isclose(o, f_out),
-            'Wrong value at input %d. Expected %s, got %s' % (i, o, f_out))
-
-    # Check that a subset of the io_values are still correct.
-    f = schedules.make_schedule(config)
-    subseq = [io_values[i**2] for i in xrange(int(sqrt(len(io_values))))]
-    if subseq[-1] != io_values[-1]:
-      subseq.append(io_values[-1])
-    for i, o in subseq:
-      f_out = f(i)
-      self.assertTrue(
-          np.isclose(o, f_out),
-          'Wrong value at input %d. Expected %s, got %s' % (i, o, f_out))
-
-    # Check duplicate calls.
-    f = schedules.make_schedule(config)
-    for i, o in io_values:
-      for _ in xrange(3):
-        f_out = f(i)
-        self.assertTrue(
-            np.isclose(o, f_out),
-            'Duplicate calls at input %d are not equal. Expected %s, got %s'
-            % (i, o, f_out))
-
-  def testConstSchedule(self):
-    self.ScheduleTestHelper(
-        config_lib.Config(fn='const', const=5),
-        schedules.ConstSchedule,
-        [(0, 5), (1, 5), (10, 5), (20, 5), (100, 5), (1000000, 5)])
-
-  def testLinearDecaySchedule(self):
-    self.ScheduleTestHelper(
-        config_lib.Config(fn='linear_decay', initial=2, final=0, start_time=10,
-                          end_time=20),
-        schedules.LinearDecaySchedule,
-        [(0, 2), (1, 2), (10, 2), (11, 1.8), (15, 1), (19, 0.2), (20, 0),
-         (100000, 0)])
-
-    # Test step function.
-    self.ScheduleTestHelper(
-        config_lib.Config(fn='linear_decay', initial=2, final=0, start_time=10,
-                          end_time=10),
-        schedules.LinearDecaySchedule,
-        [(0, 2), (1, 2), (10, 2), (11, 0), (15, 0)])
-
-  def testExponentialDecaySchedule(self):
-    self.ScheduleTestHelper(
-        config_lib.Config(fn='exp_decay', initial=exp(-1), final=exp(-6),
-                          start_time=10, end_time=20),
-        schedules.ExponentialDecaySchedule,
-        [(0, exp(-1)), (1, exp(-1)), (10, exp(-1)), (11, exp(-1/2. - 1)),
-         (15, exp(-5/2. - 1)), (19, exp(-9/2. - 1)), (20, exp(-6)),
-         (100000, exp(-6))])
-
-    # Test step function.
-    self.ScheduleTestHelper(
-        config_lib.Config(fn='exp_decay', initial=exp(-1), final=exp(-6),
-                          start_time=10, end_time=10),
-        schedules.ExponentialDecaySchedule,
-        [(0, exp(-1)), (1, exp(-1)), (10, exp(-1)), (11, exp(-6)),
-         (15, exp(-6))])
-
-  def testSmootherstepDecaySchedule(self):
-    self.ScheduleTestHelper(
-        config_lib.Config(fn='smooth_decay', initial=2, final=0, start_time=10,
-                          end_time=20),
-        schedules.SmootherstepDecaySchedule,
-        [(0, 2), (1, 2), (10, 2), (11, 1.98288), (15, 1), (19, 0.01712),
-         (20, 0), (100000, 0)])
-
-    # Test step function.
-    self.ScheduleTestHelper(
-        config_lib.Config(fn='smooth_decay', initial=2, final=0, start_time=10,
-                          end_time=10),
-        schedules.SmootherstepDecaySchedule,
-        [(0, 2), (1, 2), (10, 2), (11, 0), (15, 0)])
-
-  def testHardOscillatorSchedule(self):
-    self.ScheduleTestHelper(
-        config_lib.Config(fn='hard_osc', high=2, low=0, start_time=100,
-                          period=10, transition_fraction=0.5),
-        schedules.HardOscillatorSchedule,
-        [(0, 2), (1, 2), (10, 2), (100, 2), (101, 1.2), (102, 0.4), (103, 0),
-         (104, 0), (105, 0), (106, 0.8), (107, 1.6), (108, 2), (109, 2),
-         (110, 2), (111, 1.2), (112, 0.4), (115, 0), (116, 0.8), (119, 2),
-         (120, 2), (100001, 1.2), (100002, 0.4), (100005, 0), (100006, 0.8),
-         (100010, 2)])
-
-    # Test instantaneous step.
-    self.ScheduleTestHelper(
-        config_lib.Config(fn='hard_osc', high=2, low=0, start_time=100,
-                          period=10, transition_fraction=0),
-        schedules.HardOscillatorSchedule,
-        [(0, 2), (1, 2), (10, 2), (99, 2), (100, 0), (104, 0), (105, 2),
-         (106, 2), (109, 2), (110, 0)])
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/common/utils.py b/research/brain_coder/common/utils.py
deleted file mode 100644
index fa5f1c50768986ee10eee6120a0bca392b1d9d0e..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/utils.py
+++ /dev/null
@@ -1,558 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Configuration class."""
-
-import bisect
-from collections import deque
-import cPickle
-import heapq
-import random
-
-from absl import logging
-import numpy as np
-import six
-from six.moves import xrange
-import tensorflow as tf
-
-
-def tuple_to_record(tuple_, record_type):
-  return record_type(**dict(zip(record_type.__slots__, tuple_)))
-
-
-def make_record(type_name, attributes, defaults=None):
-  """Factory for mutable record classes.
-
-  A record acts just like a collections.namedtuple except slots are writable.
-  One exception is that record classes are not equivalent to tuples or other
-  record classes of the same length.
-
-  Note, each call to `make_record` produces a unique type. Two calls will make
-  different types even if `type_name` is the same each time.
-
-  Args:
-    type_name: Name of the record type to create.
-    attributes: List of names of each record attribute. The order of the list
-        is preserved.
-    defaults: (optional) default values for attributes. A dict mapping attribute
-        names to values.
-
-  Returns:
-    A new record type.
-
-  Raises:
-    ValueError: If,
-        `defaults` is not a dict,
-        `attributes` contains duplicate names,
-        `defaults` keys are not contained in `attributes`.
-  """
-  if defaults is None:
-    defaults = {}
-  if not isinstance(defaults, dict):
-    raise ValueError('defaults must be a dict.')
-  attr_set = set(attributes)
-  if len(attr_set) < len(attributes):
-    raise ValueError('No duplicate attributes allowed.')
-  if not set(defaults.keys()).issubset(attr_set):
-    raise ValueError('Default attributes must be given in the attributes list.')
-
-  class RecordClass(object):
-    """A record type.
-
-    Acts like mutable tuple with named slots.
-    """
-    __slots__ = list(attributes)
-    _defaults = dict(defaults)
-
-    def __init__(self, *args, **kwargs):
-      if len(args) > len(self.__slots__):
-        raise ValueError('Too many arguments. %s has length %d.'
-                         % (type(self).__name__, len(self.__slots__)))
-      for attr, val in self._defaults.items():
-        setattr(self, attr, val)
-      for i, arg in enumerate(args):
-        setattr(self, self.__slots__[i], arg)
-      for attr, val in kwargs.items():
-        setattr(self, attr, val)
-      for attr in self.__slots__:
-        if not hasattr(self, attr):
-          raise ValueError('Required attr "%s" is not set.' % attr)
-
-    def __len__(self):
-      return len(self.__slots__)
-
-    def __iter__(self):
-      for attr in self.__slots__:
-        yield getattr(self, attr)
-
-    def __getitem__(self, index):
-      return getattr(self, self.__slots__[index])
-
-    def __setitem__(self, index, value):
-      return setattr(self, self.__slots__[index], value)
-
-    def __eq__(self, other):
-      # Types must be equal as well as values.
-      return (isinstance(other, type(self))
-              and all(a == b for a, b in zip(self, other)))
-
-    def __str__(self):
-      return '%s(%s)' % (
-          type(self).__name__,
-          ', '.join(attr + '=' + str(getattr(self, attr))
-                    for attr in self.__slots__))
-
-    def __repr__(self):
-      return str(self)
-
-  RecordClass.__name__ = type_name
-  return RecordClass
-
-
-# Making minibatches.
-def stack_pad(tensors, pad_axes=None, pad_to_lengths=None, dtype=np.float32,
-              pad_value=0):
-  """Stack tensors along 0-th dim and pad them to be the same shape.
-
-  Args:
-    tensors: Any list of iterables (python list, numpy array, etc). Can be 1D
-        or multi-D iterables.
-    pad_axes: An int or list of ints. Axes to pad along.
-    pad_to_lengths: Length in each dimension. If pad_axes was an int, this is an
-        int or None. If pad_axes was a list of ints, this is a list of mixed int
-        and None types with the same length, or None. A None length means the
-        maximum length among the given tensors is used.
-    dtype: Type of output numpy array. Defaults to np.float32.
-    pad_value: Value to use for padding. Defaults to 0.
-
-  Returns:
-    Numpy array containing the tensors stacked along the 0-th dimension and
-        padded along the specified dimensions.
-
-  Raises:
-    ValueError: If the tensors do not have equal shapes along non-padded
-        dimensions.
-  """
-  tensors = [np.asarray(t) for t in tensors]
-  max_lengths = [max(l) for l in zip(*[t.shape for t in tensors])]
-  same_axes = dict(enumerate(max_lengths))
-  if pad_axes is None:
-    pad_axes = []
-  if isinstance(pad_axes, six.integer_types):
-    if pad_to_lengths is not None:
-      max_lengths[pad_axes] = pad_to_lengths
-    del same_axes[pad_axes]
-  else:
-    if pad_to_lengths is None:
-      pad_to_lengths = [None] * len(pad_axes)
-    for i, l in zip(pad_axes, pad_to_lengths):
-      if l is not None:
-        max_lengths[i] = l
-      del same_axes[i]
-  same_axes_items = same_axes.items()
-  dest = np.full([len(tensors)] + max_lengths, pad_value, dtype=dtype)
-  for i, t in enumerate(tensors):
-    for j, l in same_axes_items:
-      if t.shape[j] != l:
-        raise ValueError(
-            'Tensor at index %d does not have size %d along axis %d'
-            % (i, l, j))
-    dest[[i] + [slice(0, d) for d in t.shape]] = t
-  return dest
-
-
-class RandomQueue(deque):
-
-  def __init__(self, capacity):
-    super(RandomQueue, self).__init__([], capacity)
-    self.capacity = capacity
-
-  def random_sample(self, sample_size):
-    idx = np.random.choice(len(self), sample_size)
-    return [self[i] for i in idx]
-
-  def push(self, item):
-    # Append to right. Oldest element will be popped from left.
-    self.append(item)
-
-
-class MPQItemContainer(object):
-  """Class for holding an item with its score.
-
-  Defines a comparison function for use in the heap-queue.
-  """
-
-  def __init__(self, score, item, extra_data):
-    self.item = item
-    self.score = score
-    self.extra_data = extra_data
-
-  def __cmp__(self, other):
-    assert isinstance(other, type(self))
-    return cmp(self.score, other.score)
-
-  def __iter__(self):
-    """Allows unpacking like a tuple."""
-    yield self.score
-    yield self.item
-    yield self.extra_data
-
-  def __repr__(self):
-    """String representation of this item.
-
-    `extra_data` is not included in the representation. We are assuming that
-    `extra_data` is not easily interpreted by a human (if it was, it should be
-    hashable, like a string or tuple).
-
-    Returns:
-      String representation of `self`.
-    """
-    return str((self.score, self.item))
-
-  def __str__(self):
-    return repr(self)
-
-
-class MaxUniquePriorityQueue(object):
-  """A maximum priority queue where duplicates are not added.
-
-  The top items by score remain in the queue. When the capacity is reached,
-  the lowest scored item in the queue will be dropped.
-
-  This implementation differs from a typical priority queue, in that the minimum
-  score is popped, instead of the maximum. Largest scores remain stuck in the
-  queue. This is useful for accumulating the best known items from a population.
-
-  The items used to determine uniqueness must be hashable, but additional
-  non-hashable data may be stored with each item.
-  """
-
-  def __init__(self, capacity):
-    self.capacity = capacity
-    self.heap = []
-    self.unique_items = set()
-
-  def push(self, score, item, extra_data=None):
-    """Push an item onto the queue.
-
-    If the queue is at capacity, the item with the smallest score will be
-    dropped. Note that it is assumed each item has exactly one score. The same
-    item with a different score will still be dropped.
-
-    Args:
-      score: Number used to prioritize items in the queue. Largest scores are
-          kept in the queue.
-      item: A hashable item to be stored. Duplicates of this item will not be
-          added to the queue.
-      extra_data: An extra (possible not hashable) data to store with the item.
-    """
-    if item in self.unique_items:
-      return
-    if len(self.heap) >= self.capacity:
-      _, popped_item, _ = heapq.heappushpop(
-          self.heap, MPQItemContainer(score, item, extra_data))
-      self.unique_items.add(item)
-      self.unique_items.remove(popped_item)
-    else:
-      heapq.heappush(self.heap, MPQItemContainer(score, item, extra_data))
-      self.unique_items.add(item)
-
-  def pop(self):
-    """Pop the item with the lowest score.
-
-    Returns:
-      score: Item's score.
-      item: The item that was popped.
-      extra_data: Any extra data stored with the item.
-    """
-    if not self.heap:
-      return ()
-    score, item, extra_data = heapq.heappop(self.heap)
-    self.unique_items.remove(item)
-    return score, item, extra_data
-
-  def get_max(self):
-    """Peek at the item with the highest score.
-
-    Returns:
-      Same as `pop`.
-    """
-    if not self.heap:
-      return ()
-    score, item, extra_data = heapq.nlargest(1, self.heap)[0]
-    return score, item, extra_data
-
-  def get_min(self):
-    """Peek at the item with the lowest score.
-
-    Returns:
-      Same as `pop`.
-    """
-    if not self.heap:
-      return ()
-    score, item, extra_data = heapq.nsmallest(1, self.heap)[0]
-    return score, item, extra_data
-
-  def random_sample(self, sample_size):
-    """Randomly select items from the queue.
-
-    This does not modify the queue.
-
-    Items are drawn from a uniform distribution, and not weighted by score.
-
-    Args:
-      sample_size: Number of random samples to draw. The same item can be
-          sampled multiple times.
-
-    Returns:
-      List of sampled items (of length `sample_size`). Each element in the list
-      is a tuple: (item, extra_data).
-    """
-    idx = np.random.choice(len(self.heap), sample_size)
-    return [(self.heap[i].item, self.heap[i].extra_data) for i in idx]
-
-  def iter_in_order(self):
-    """Iterate over items in the queue from largest score to smallest.
-
-    Yields:
-      item: Hashable item.
-      extra_data: Extra data stored with the item.
-    """
-    for _, item, extra_data in heapq.nlargest(len(self.heap), self.heap):
-      yield item, extra_data
-
-  def __len__(self):
-    return len(self.heap)
-
-  def __iter__(self):
-    for _, item, _ in self.heap:
-      yield item
-
-  def __repr__(self):
-    return '[' + ', '.join(repr(c) for c in self.heap) + ']'
-
-  def __str__(self):
-    return repr(self)
-
-
-class RouletteWheel(object):
-  """Randomly samples stored objects proportionally to their given weights.
-
-  Stores objects and weights. Acts like a roulette wheel where each object is
-  given a slice of the roulette disk proportional to its weight.
-
-  This can be used as a replay buffer where past experiences are sampled
-  proportionally to their weights. A good choice of "weight" for reinforcement
-  learning is exp(reward / temperature) where temperature -> inf makes the
-  distribution more uniform and temperature -> 0 makes the distribution more
-  peaky.
-
-  To prevent experiences from being overweighted by appearing in the replay
-  buffer multiple times, a "unique mode" is supported where duplicate
-  experiences are ignored. In unique mode, weights can be quickly retrieved from
-  keys.
-  """
-
-  def __init__(self, unique_mode=False, save_file=None):
-    """Construct empty RouletteWheel.
-
-    If `save_file` is not None, and the file already exists on disk, whatever
-    is in the file will be loaded into this instance. This allows jobs using
-    RouletteWheel to resume after preemption.
-
-    Args:
-      unique_mode: If True, puts this RouletteWheel into unique mode, where
-          objects are added with hashable keys, so that duplicates are ignored.
-      save_file: Optional file path to save to. Must be a string containing
-          an absolute path to a file, or None. File will be Python pickle
-          format.
-    """
-    self.unique_mode = unique_mode
-    self.objects = []
-    self.weights = []
-    self.partial_sums = []
-    if self.unique_mode:
-      self.keys_to_weights = {}
-    self.save_file = save_file
-    self.save_to_disk_buffer = []
-
-    if save_file is not None and tf.gfile.Exists(save_file):
-      # Load from disk.
-      with tf.gfile.OpenFast(save_file, 'r') as f:
-        count = 0
-        while 1:
-          try:
-            obj, weight, key = cPickle.load(f)
-          except EOFError:
-            break
-          else:
-            self.add(obj, weight, key)
-            count += 1
-      logging.info('Loaded %d samples from disk.', count)
-      # Clear buffer since these items are already on disk.
-      self.save_to_disk_buffer = []
-
-  def __iter__(self):
-    return iter(zip(self.objects, self.weights))
-
-  def __len__(self):
-    return len(self.objects)
-
-  def is_empty(self):
-    """Returns whether there is anything in the roulette wheel."""
-    return not self.partial_sums
-
-  @property
-  def total_weight(self):
-    """Total cumulative weight across all objects."""
-    if self.partial_sums:
-      return self.partial_sums[-1]
-    return 0.0
-
-  def has_key(self, key):
-    if self.unique_mode:
-      RuntimeError('has_key method can only be called in unique mode.')
-    return key in self.keys_to_weights
-
-  def get_weight(self, key):
-    if self.unique_mode:
-      RuntimeError('get_weight method can only be called in unique mode.')
-    return self.keys_to_weights[key]
-
-  def add(self, obj, weight, key=None):
-    """Add one object and its weight to the roulette wheel.
-
-    Args:
-      obj: Any object to be stored.
-      weight: A non-negative float. The given object will be drawn with
-          probability proportional to this weight when sampling.
-      key: This argument is only used when in unique mode. To allow `obj` to
-          be an unhashable type, like list, a separate hashable key is given.
-          Each `key` should be unique to each `obj`. `key` is used to check if
-          `obj` has been added to the roulette wheel before.
-
-    Returns:
-      True if the object was added, False if it was not added due to it being
-      a duplicate (this only happens in unique mode).
-
-    Raises:
-      ValueError: If `weight` is negative.
-      ValueError: If `key` is not given when in unique mode, or if `key` is
-          given when not in unique mode.
-    """
-    if weight < 0:
-      raise ValueError('Weight must be non-negative')
-    if self.unique_mode:
-      if key is None:
-        raise ValueError(
-            'Hashable key required for objects when unique mode is enabled.')
-      if key in self.keys_to_weights:
-        # Weight updates are not allowed. Ignore the given value of `weight`.
-        return False
-      self.keys_to_weights[key] = weight
-    elif key is not None:
-      raise ValueError(
-          'key argument should not be used when unique mode is disabled.')
-    self.objects.append(obj)
-    self.weights.append(weight)
-    self.partial_sums.append(self.total_weight + weight)
-    if self.save_file is not None:
-      # Record new item in buffer.
-      self.save_to_disk_buffer.append((obj, weight, key))
-    return True
-
-  def add_many(self, objs, weights, keys=None):
-    """Add many object and their weights to the roulette wheel.
-
-    Arguments are the same as the `add` method, except each is a list. Lists
-    must all be the same length.
-
-    Args:
-      objs: List of objects to be stored.
-      weights: List of non-negative floats. See `add` method.
-      keys: List of hashable keys. This argument is only used when in unique
-          mode. See `add` method.
-
-    Returns:
-      Number of objects added. This number will be less than the number of
-      objects provided if we are in unique mode and some keys are already
-      in the roulette wheel.
-
-    Raises:
-      ValueError: If `keys` argument is provided when unique_mode == False, or
-          is not provided when unique_mode == True.
-      ValueError: If any of the lists are not the same length.
-      ValueError: If any of the weights are negative.
-    """
-    if keys is not None and not self.unique_mode:
-      raise ValueError('Not in unique mode. Do not provide keys.')
-    elif keys is None and self.unique_mode:
-      raise ValueError('In unique mode. You must provide hashable keys.')
-    if keys and len(objs) != len(keys):
-      raise ValueError('Number of objects does not equal number of keys.')
-    if len(objs) != len(weights):
-      raise ValueError('Number of objects does not equal number of weights.')
-    return sum([self.add(obj, weights[i], key=keys[i] if keys else None)
-                for i, obj in enumerate(objs)])
-
-  def sample(self):
-    """Spin the roulette wheel.
-
-    Randomly select an object with probability proportional to its weight.
-
-    Returns:
-      object: The selected object.
-      weight: The weight of the selected object.
-
-    Raises:
-      RuntimeError: If the roulette wheel is empty.
-    """
-    if self.is_empty():
-      raise RuntimeError('Trying to sample from empty roulette wheel.')
-    spin = random.random() * self.total_weight
-
-    # Binary search.
-    i = bisect.bisect_right(self.partial_sums, spin)
-    if i == len(self.partial_sums):
-      # This should not happen since random.random() will always be strictly
-      # less than 1.0, and the last partial sum equals self.total_weight().
-      # However it may happen due to rounding error. In that case it is easy to
-      # handle this, just select the last object.
-      i -= 1
-
-    return self.objects[i], self.weights[i]
-
-  def sample_many(self, count):
-    """Spin the roulette wheel `count` times and return the results."""
-    if self.is_empty():
-      raise RuntimeError('Trying to sample from empty roulette wheel.')
-    return [self.sample() for _ in xrange(count)]
-
-  def incremental_save(self, log_info=False):
-    """Write new entries to disk.
-
-    This performs an append operation on the `save_file` given in the
-    constructor. Any entries added since the last call to `incremental_save`
-    will be appended to the file.
-
-    If a new RouletteWheel is constructed with the same `save_file`, all the
-    entries written there will be automatically loaded into the instance.
-    This is useful when a job resumes after preemption.
-
-    Args:
-      log_info: If True, info about this operation will be logged.
-
-    Raises:
-      RuntimeError: If `save_file` given in the constructor is None.
-    """
-    if self.save_file is None:
-      raise RuntimeError('Cannot call incremental_save. `save_file` is None.')
-    if log_info:
-      logging.info('Saving %d new samples to disk.',
-                   len(self.save_to_disk_buffer))
-    with tf.gfile.OpenFast(self.save_file, 'a') as f:
-      for entry in self.save_to_disk_buffer:
-        cPickle.dump(entry, f)
-    # Clear the buffer.
-    self.save_to_disk_buffer = []
diff --git a/research/brain_coder/common/utils_test.py b/research/brain_coder/common/utils_test.py
deleted file mode 100644
index 569c2877d17bf7707616029cdd2a5eac55df7f60..0000000000000000000000000000000000000000
--- a/research/brain_coder/common/utils_test.py
+++ /dev/null
@@ -1,382 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for common.utils.
-"""
-
-from collections import Counter
-import random
-import tempfile
-import numpy as np
-import tensorflow as tf
-
-from common import utils  # brain coder
-
-
-class UtilsTest(tf.test.TestCase):
-
-  def testStackPad(self):
-    # 1D.
-    tensors = [[1, 2, 3], [4, 5, 6, 7, 8], [9]]
-    result = utils.stack_pad(tensors, pad_axes=0, pad_to_lengths=6)
-    self.assertTrue(np.array_equal(
-        result,
-        np.asarray([[1, 2, 3, 0, 0, 0],
-                    [4, 5, 6, 7, 8, 0],
-                    [9, 0, 0, 0, 0, 0]], dtype=np.float32)))
-
-    # 3D.
-    tensors = [[[[1, 2, 3], [4, 5, 6]]],
-               [[[7, 8, 9], [0, 1, 2]], [[3, 4, 5], [6, 7, 8]]],
-               [[[0, 1, 2]], [[3, 4, 5]]]]
-    result = utils.stack_pad(tensors, pad_axes=[0, 1], pad_to_lengths=[2, 2])
-    self.assertTrue(np.array_equal(
-        result,
-        np.asarray([[[[1, 2, 3], [4, 5, 6]],
-                     [[0, 0, 0], [0, 0, 0]]],
-                    [[[7, 8, 9], [0, 1, 2]],
-                     [[3, 4, 5], [6, 7, 8]]],
-                    [[[0, 1, 2], [0, 0, 0]],
-                     [[3, 4, 5], [0, 0, 0]]]], dtype=np.float32)))
-
-  def testStackPadNoAxes(self):
-    # 2D.
-    tensors = [[[1, 2, 3], [4, 5, 6]],
-               [[7, 8, 9], [1, 2, 3]],
-               [[4, 5, 6], [7, 8, 9]]]
-    result = utils.stack_pad(tensors)
-    self.assertTrue(np.array_equal(
-        result,
-        np.asarray(tensors)))
-
-  def testStackPadNoneLength(self):
-    # 1D.
-    tensors = [[1, 2, 3], [4, 5, 6, 7, 8], [9]]
-    result = utils.stack_pad(tensors, pad_axes=0, pad_to_lengths=None)
-    self.assertTrue(np.array_equal(
-        result,
-        np.asarray([[1, 2, 3, 0, 0],
-                    [4, 5, 6, 7, 8],
-                    [9, 0, 0, 0, 0]], dtype=np.float32)))
-
-    # 3D.
-    tensors = [[[[1, 2, 3], [4, 5, 6]]],
-               [[[7, 8, 9], [0, 1, 2]], [[3, 4, 5], [6, 7, 8]]],
-               [[[0, 1, 2]], [[3, 4, 5]]]]
-    result = utils.stack_pad(tensors, pad_axes=[0, 1], pad_to_lengths=None)
-    self.assertTrue(np.array_equal(
-        result,
-        np.asarray([[[[1, 2, 3], [4, 5, 6]],
-                     [[0, 0, 0], [0, 0, 0]]],
-                    [[[7, 8, 9], [0, 1, 2]],
-                     [[3, 4, 5], [6, 7, 8]]],
-                    [[[0, 1, 2], [0, 0, 0]],
-                     [[3, 4, 5], [0, 0, 0]]]], dtype=np.float32)))
-
-    # 3D with partial pad_to_lengths.
-    tensors = [[[[1, 2, 3], [4, 5, 6]]],
-               [[[7, 8, 9], [0, 1, 2]], [[3, 4, 5], [6, 7, 8]]],
-               [[[0, 1, 2]], [[3, 4, 5]]]]
-    result = utils.stack_pad(tensors, pad_axes=[0, 1], pad_to_lengths=[None, 3])
-    self.assertTrue(np.array_equal(
-        result,
-        np.asarray([[[[1, 2, 3], [4, 5, 6], [0, 0, 0]],
-                     [[0, 0, 0], [0, 0, 0], [0, 0, 0]]],
-                    [[[7, 8, 9], [0, 1, 2], [0, 0, 0]],
-                     [[3, 4, 5], [6, 7, 8], [0, 0, 0]]],
-                    [[[0, 1, 2], [0, 0, 0], [0, 0, 0]],
-                     [[3, 4, 5], [0, 0, 0], [0, 0, 0]]]], dtype=np.float32)))
-
-  def testStackPadValueError(self):
-    # 3D.
-    tensors = [[[[1, 2, 3], [4, 5, 6]]],
-               [[[7, 8, 9], [0, 1, 2]], [[3, 4, 5], [6, 7, 8]]],
-               [[[0, 1, 2]], [[3, 4, 5]]],
-               [[[1, 2, 3, 4]]]]
-
-    # Not all tensors have the same shape along axis 2.
-    with self.assertRaises(ValueError):
-      utils.stack_pad(tensors, pad_axes=[0, 1], pad_to_lengths=[2, 2])
-
-  def testRecord(self):
-    my_record = utils.make_record('my_record', ['a', 'b', 'c'], {'b': 55})
-    inst = my_record(a=1, b=2, c=3)
-    self.assertEqual(1, inst.a)
-    self.assertEqual(2, inst.b)
-    self.assertEqual(3, inst.c)
-    self.assertEqual(1, inst[0])
-    self.assertEqual(2, inst[1])
-    self.assertEqual(3, inst[2])
-    self.assertEqual([1, 2, 3], list(iter(inst)))
-    self.assertEqual(3, len(inst))
-
-    inst.b = 999
-    self.assertEqual(999, inst.b)
-    self.assertEqual(999, inst[1])
-
-    inst2 = my_record(1, 999, 3)
-    self.assertTrue(inst == inst2)
-    inst2[1] = 3
-    self.assertFalse(inst == inst2)
-
-    inst3 = my_record(a=1, c=3)
-    inst.b = 55
-    self.assertEqual(inst, inst3)
-
-  def testRecordUnique(self):
-    record1 = utils.make_record('record1', ['a', 'b', 'c'])
-    record2 = utils.make_record('record2', ['a', 'b', 'c'])
-    self.assertNotEqual(record1(1, 2, 3), record2(1, 2, 3))
-    self.assertEqual(record1(1, 2, 3), record1(1, 2, 3))
-
-  def testTupleToRecord(self):
-    my_record = utils.make_record('my_record', ['a', 'b', 'c'])
-    inst = utils.tuple_to_record((5, 6, 7), my_record)
-    self.assertEqual(my_record(5, 6, 7), inst)
-
-  def testRecordErrors(self):
-    my_record = utils.make_record('my_record', ['a', 'b', 'c'], {'b': 10})
-
-    with self.assertRaises(ValueError):
-      my_record(c=5)  # Did not provide required argument 'a'.
-    with self.assertRaises(ValueError):
-      my_record(1, 2, 3, 4)  # Too many arguments.
-
-  def testRandomQueue(self):
-    np.random.seed(567890)
-    queue = utils.RandomQueue(5)
-    queue.push(5)
-    queue.push(6)
-    queue.push(7)
-    queue.push(8)
-    queue.push(9)
-    queue.push(10)
-    self.assertTrue(5 not in queue)
-    sample = queue.random_sample(1000)
-    self.assertEqual(1000, len(sample))
-    self.assertEqual([6, 7, 8, 9, 10], sorted(np.unique(sample).tolist()))
-
-  def testMaxUniquePriorityQueue(self):
-    queue = utils.MaxUniquePriorityQueue(5)
-    queue.push(1.0, 'string 1')
-    queue.push(-0.5, 'string 2')
-    queue.push(0.5, 'string 3')
-    self.assertEqual((-0.5, 'string 2', None), queue.pop())
-    queue.push(0.1, 'string 4')
-    queue.push(1.5, 'string 5')
-    queue.push(0.0, 'string 6')
-    queue.push(0.2, 'string 7')
-    self.assertEqual((1.5, 'string 5', None), queue.get_max())
-    self.assertEqual((0.1, 'string 4', None), queue.get_min())
-    self.assertEqual(
-        [('string 5', None), ('string 1', None), ('string 3', None),
-         ('string 7', None), ('string 4', None)],
-        list(queue.iter_in_order()))
-
-  def testMaxUniquePriorityQueue_Duplicates(self):
-    queue = utils.MaxUniquePriorityQueue(5)
-    queue.push(0.0, 'string 1')
-    queue.push(0.0, 'string 2')
-    queue.push(0.0, 'string 3')
-    self.assertEqual((0.0, 'string 1', None), queue.pop())
-    self.assertEqual((0.0, 'string 2', None), queue.pop())
-    self.assertEqual((0.0, 'string 3', None), queue.pop())
-    self.assertEqual(0, len(queue))
-    queue.push(0.1, 'string 4')
-    queue.push(1.5, 'string 5')
-    queue.push(0.3, 'string 6')
-    queue.push(0.2, 'string 7')
-    queue.push(0.0, 'string 8')
-    queue.push(1.5, 'string 5')
-    queue.push(1.5, 'string 5')
-    self.assertEqual((1.5, 'string 5', None), queue.get_max())
-    self.assertEqual((0.0, 'string 8', None), queue.get_min())
-    self.assertEqual(
-        [('string 5', None), ('string 6', None), ('string 7', None),
-         ('string 4', None), ('string 8', None)],
-        list(queue.iter_in_order()))
-
-  def testMaxUniquePriorityQueue_ExtraData(self):
-    queue = utils.MaxUniquePriorityQueue(5)
-    queue.push(1.0, 'string 1', [1, 2, 3])
-    queue.push(0.5, 'string 2', [4, 5, 6])
-    queue.push(0.5, 'string 3', [7, 8, 9])
-    queue.push(0.5, 'string 2', [10, 11, 12])
-    self.assertEqual((0.5, 'string 2', [4, 5, 6]), queue.pop())
-    self.assertEqual((0.5, 'string 3', [7, 8, 9]), queue.pop())
-    self.assertEqual((1.0, 'string 1', [1, 2, 3]), queue.pop())
-    self.assertEqual(0, len(queue))
-    queue.push(0.5, 'string 2', [10, 11, 12])
-    self.assertEqual((0.5, 'string 2', [10, 11, 12]), queue.pop())
-
-  def testRouletteWheel(self):
-    random.seed(12345678987654321)
-    r = utils.RouletteWheel()
-    self.assertTrue(r.is_empty())
-    with self.assertRaises(RuntimeError):
-      r.sample()  # Cannot sample when empty.
-    self.assertEqual(0, r.total_weight)
-    self.assertEqual(True, r.add('a', 0.1))
-    self.assertFalse(r.is_empty())
-    self.assertEqual(0.1, r.total_weight)
-    self.assertEqual(True, r.add('b', 0.01))
-    self.assertEqual(0.11, r.total_weight)
-    self.assertEqual(True, r.add('c', 0.5))
-    self.assertEqual(True, r.add('d', 0.1))
-    self.assertEqual(True, r.add('e', 0.05))
-    self.assertEqual(True, r.add('f', 0.03))
-    self.assertEqual(True, r.add('g', 0.001))
-    self.assertEqual(0.791, r.total_weight)
-    self.assertFalse(r.is_empty())
-
-    # Check that sampling is correct.
-    obj, weight = r.sample()
-    self.assertTrue(isinstance(weight, float), 'Type: %s' % type(weight))
-    self.assertTrue((obj, weight) in r)
-    for obj, weight in r.sample_many(100):
-      self.assertTrue(isinstance(weight, float), 'Type: %s' % type(weight))
-      self.assertTrue((obj, weight) in r)
-
-    # Check that sampling distribution is correct.
-    n = 1000000
-    c = Counter(r.sample_many(n))
-    for obj, w in r:
-      estimated_w = c[(obj, w)] / float(n) * r.total_weight
-      self.assertTrue(
-          np.isclose(w, estimated_w, atol=1e-3),
-          'Expected %s, got %s, for object %s' % (w, estimated_w, obj))
-
-  def testRouletteWheel_AddMany(self):
-    random.seed(12345678987654321)
-    r = utils.RouletteWheel()
-    self.assertTrue(r.is_empty())
-    with self.assertRaises(RuntimeError):
-      r.sample()  # Cannot sample when empty.
-    self.assertEqual(0, r.total_weight)
-    count = r.add_many(
-        ['a', 'b', 'c', 'd', 'e', 'f', 'g'],
-        [0.1, 0.01, 0.5, 0.1, 0.05, 0.03, 0.001])
-    self.assertEqual(7, count)
-    self.assertFalse(r.is_empty())
-    self.assertEqual(0.791, r.total_weight)
-
-    # Adding no items is allowed.
-    count = r.add_many([], [])
-    self.assertEqual(0, count)
-    self.assertFalse(r.is_empty())
-    self.assertEqual(0.791, r.total_weight)
-
-    # Check that sampling is correct.
-    obj, weight = r.sample()
-    self.assertTrue(isinstance(weight, float), 'Type: %s' % type(weight))
-    self.assertTrue((obj, weight) in r)
-    for obj, weight in r.sample_many(100):
-      self.assertTrue(isinstance(weight, float), 'Type: %s' % type(weight))
-      self.assertTrue((obj, weight) in r)
-
-    # Check that sampling distribution is correct.
-    n = 1000000
-    c = Counter(r.sample_many(n))
-    for obj, w in r:
-      estimated_w = c[(obj, w)] / float(n) * r.total_weight
-      self.assertTrue(
-          np.isclose(w, estimated_w, atol=1e-3),
-          'Expected %s, got %s, for object %s' % (w, estimated_w, obj))
-
-  def testRouletteWheel_AddZeroWeights(self):
-    r = utils.RouletteWheel()
-    self.assertEqual(True, r.add('a', 0))
-    self.assertFalse(r.is_empty())
-    self.assertEqual(4, r.add_many(['b', 'c', 'd', 'e'], [0, 0.1, 0, 0]))
-    self.assertEqual(
-        [('a', 0.0), ('b', 0.0), ('c', 0.1), ('d', 0.0), ('e', 0.0)],
-        list(r))
-
-  def testRouletteWheel_UniqueMode(self):
-    random.seed(12345678987654321)
-    r = utils.RouletteWheel(unique_mode=True)
-    self.assertEqual(True, r.add([1, 2, 3], 1, 'a'))
-    self.assertEqual(True, r.add([4, 5], 0.5, 'b'))
-    self.assertEqual(False, r.add([1, 2, 3], 1.5, 'a'))
-    self.assertEqual(
-        [([1, 2, 3], 1.0), ([4, 5], 0.5)],
-        list(r))
-    self.assertEqual(1.5, r.total_weight)
-    self.assertEqual(
-        2,
-        r.add_many(
-            [[5, 6, 2, 3], [1, 2, 3], [8], [1, 2, 3]],
-            [0.1, 0.2, 0.1, 2.0],
-            ['c', 'a', 'd', 'a']))
-    self.assertEqual(
-        [([1, 2, 3], 1.0), ([4, 5], 0.5), ([5, 6, 2, 3], 0.1), ([8], 0.1)],
-        list(r))
-    self.assertTrue(np.isclose(1.7, r.total_weight))
-    self.assertEqual(0, r.add_many([], [], []))  # Adding no items is allowed.
-    with self.assertRaises(ValueError):
-      # Key not given.
-      r.add([7, 8, 9], 2.0)
-    with self.assertRaises(ValueError):
-      # Keys not given.
-      r.add_many([[7, 8, 9], [10]], [2.0, 2.0])
-    self.assertEqual(True, r.has_key('a'))
-    self.assertEqual(True, r.has_key('b'))
-    self.assertEqual(False, r.has_key('z'))
-    self.assertEqual(1.0, r.get_weight('a'))
-    self.assertEqual(0.5, r.get_weight('b'))
-
-    r = utils.RouletteWheel(unique_mode=False)
-    self.assertEqual(True, r.add([1, 2, 3], 1))
-    self.assertEqual(True, r.add([4, 5], 0.5))
-    self.assertEqual(True, r.add([1, 2, 3], 1.5))
-    self.assertEqual(
-        [([1, 2, 3], 1.0), ([4, 5], 0.5), ([1, 2, 3], 1.5)],
-        list(r))
-    self.assertEqual(3, r.total_weight)
-    self.assertEqual(
-        4,
-        r.add_many(
-            [[5, 6, 2, 3], [1, 2, 3], [8], [1, 2, 3]],
-            [0.1, 0.2, 0.1, 0.2]))
-    self.assertEqual(
-        [([1, 2, 3], 1.0), ([4, 5], 0.5), ([1, 2, 3], 1.5),
-         ([5, 6, 2, 3], 0.1), ([1, 2, 3], 0.2), ([8], 0.1), ([1, 2, 3], 0.2)],
-        list(r))
-    self.assertTrue(np.isclose(3.6, r.total_weight))
-    with self.assertRaises(ValueError):
-      # Key is given.
-      r.add([7, 8, 9], 2.0, 'a')
-    with self.assertRaises(ValueError):
-      # Keys are given.
-      r.add_many([[7, 8, 9], [10]], [2.0, 2.0], ['a', 'b'])
-
-  def testRouletteWheel_IncrementalSave(self):
-    f = tempfile.NamedTemporaryFile()
-    r = utils.RouletteWheel(unique_mode=True, save_file=f.name)
-    entries = [
-        ([1, 2, 3], 0.1, 'a'),
-        ([4, 5], 0.2, 'b'),
-        ([6], 0.3, 'c'),
-        ([7, 8, 9, 10], 0.25, 'd'),
-        ([-1, -2], 0.15, 'e'),
-        ([-3, -4, -5], 0.5, 'f')]
-
-    self.assertTrue(r.is_empty())
-    for i in range(0, len(entries), 2):
-      r.add(*entries[i])
-      r.add(*entries[i + 1])
-      r.incremental_save()
-
-      r2 = utils.RouletteWheel(unique_mode=True, save_file=f.name)
-      self.assertEqual(i + 2, len(r2))
-      count = 0
-      for j, (obj, weight) in enumerate(r2):
-        self.assertEqual(entries[j][0], obj)
-        self.assertEqual(entries[j][1], weight)
-        self.assertEqual(weight, r2.get_weight(entries[j][2]))
-        count += 1
-      self.assertEqual(i + 2, count)
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/single_task/BUILD b/research/brain_coder/single_task/BUILD
deleted file mode 100644
index 47e91b12b8ba40a2a9916a89375fbb773758d7cf..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/BUILD
+++ /dev/null
@@ -1,244 +0,0 @@
-licenses(["notice"])
-
-package(default_visibility = [
-    "//learning/brain/research/neural_coder:__subpackages__",
-])
-
-load("@subpar//:subpar.bzl", "par_binary")
-
-par_binary(
-    name = "run",
-    srcs = ["run.py"],
-    deps = [
-        ":defaults",
-        ":ga_train",
-        ":pg_train",
-        # absl dep :app
-        # absl dep /flags
-        # absl dep /logging
-    ],
-)
-
-par_binary(
-    name = "tune",
-    srcs = ["tune.py"],
-    deps = [
-        ":defaults",
-        ":run",
-        # file dep
-        # absl dep :app
-        # absl dep /flags
-        # absl dep /logging
-        # numpy dep
-        # tensorflow dep
-    ],
-)
-
-py_library(
-    name = "ga_train",
-    srcs = ["ga_train.py"],
-    deps = [
-        ":data",
-        ":defaults",
-        ":ga_lib",
-        ":results_lib",
-        # file dep
-        # absl dep /flags
-        # absl dep /logging
-        # numpy dep
-        # tensorflow dep
-        "//common:utils",  # project
-    ],
-)
-
-py_library(
-    name = "ga_lib",
-    srcs = ["ga_lib.py"],
-    deps = [
-        ":misc",
-        # absl dep /flags
-        # absl dep /logging
-        # numpy dep
-        "//common:bf",  # project
-        "//common:utils",  # project
-    ],
-)
-
-py_test(
-    name = "ga_train_test",
-    srcs = ["ga_train_test.py"],
-    deps = [
-        ":defaults",
-        ":run",
-        # absl dep /flags
-        # tensorflow dep
-    ],
-)
-
-py_library(
-    name = "pg_train",
-    srcs = ["pg_train.py"],
-    deps = [
-        ":data",
-        ":defaults",
-        ":pg_agent",
-        ":results_lib",
-        # file dep
-        # absl dep /flags
-        # absl dep /logging
-        # tensorflow dep
-        # tensorflow internal dep  # build_cleaner: keep
-    ],
-)
-
-py_library(
-    name = "pg_agent",
-    srcs = ["pg_agent.py"],
-    deps = [
-        ":misc",
-        # file dep
-        # absl dep /logging
-        # numpy dep
-        # tensorflow dep
-        "//common:rollout",  # project
-        "//common:utils",  # project
-    ],
-)
-
-py_test(
-    name = "pg_agent_test",
-    srcs = ["pg_agent_test.py"],
-    deps = [
-        ":data",
-        ":defaults",
-        ":misc",
-        ":pg_agent",
-        ":pg_train",
-        # absl dep /logging
-        # numpy dep
-        # tensorflow dep
-        "//common:utils",  # project
-    ],
-)
-
-py_library(
-    name = "defaults",
-    srcs = ["defaults.py"],
-    deps = [
-        # absl dep /logging
-        "//common:config_lib",  # project
-    ],
-)
-
-py_library(
-    name = "misc",
-    srcs = ["misc.py"],
-)
-
-py_library(
-    name = "data",
-    srcs = ["data.py"],
-    deps = [
-        ":code_tasks",
-        # absl dep /logging
-    ],
-)
-
-py_library(
-    name = "code_tasks",
-    srcs = ["code_tasks.py"],
-    deps = [
-        ":misc",
-        ":test_tasks",
-        # absl dep /logging
-        # numpy dep
-        "//common:bf",  # project
-        "//common:reward",  # project
-    ],
-)
-
-py_test(
-    name = "code_tasks_test",
-    srcs = ["code_tasks_test.py"],
-    deps = [
-        ":code_tasks",
-        ":defaults",
-        # numpy dep
-        # tensorflow dep
-    ],
-)
-
-py_library(
-    name = "test_tasks",
-    srcs = ["test_tasks.py"],
-    deps = [
-        ":misc",
-        "//common:reward",  # project
-    ],
-)
-
-py_test(
-    name = "test_tasks_test",
-    srcs = ["test_tasks_test.py"],
-    deps = [
-        ":misc",
-        ":test_tasks",
-        # numpy dep
-        # tensorflow dep
-    ],
-)
-
-py_test(
-    name = "pg_train_test",
-    size = "large",
-    srcs = ["pg_train_test.py"],
-    deps = [
-        ":defaults",
-        ":run",
-        # absl dep /logging
-        # tensorflow dep
-    ],
-)
-
-py_library(
-    name = "results_lib",
-    srcs = ["results_lib.py"],
-    deps = [
-        # file dep
-        # tensorflow dep
-    ],
-)
-
-py_test(
-    name = "results_lib_test",
-    srcs = ["results_lib_test.py"],
-    deps = [
-        ":results_lib",
-        # tensorflow dep
-    ],
-)
-
-par_binary(
-    name = "aggregate_experiment_results",
-    srcs = ["aggregate_experiment_results.py"],
-    deps = [
-        ":misc",
-        ":results_lib",
-        # file dep
-        # absl dep :app
-        # absl dep /flags
-        # numpy dep
-        # tensorflow dep
-    ],
-)
-
-par_binary(
-    name = "aggregate_tuning_results",
-    srcs = ["aggregate_tuning_results.py"],
-    deps = [
-        # file dep
-        # absl dep :app
-        # absl dep /flags
-        # tensorflow dep
-    ],
-)
diff --git a/research/brain_coder/single_task/README.md b/research/brain_coder/single_task/README.md
deleted file mode 100644
index 69eaabcc6ccabada838a0a2a3f12fd7eed69744c..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/README.md
+++ /dev/null
@@ -1,192 +0,0 @@
-# Experiments for ICLR 2018 paper.
-
-[Neural Program Synthesis with Priority Queue Training](https://arxiv.org/abs/1801.03526).
-
-Runs policy gradient (REINFORCE), priority queue training, genetic algorithm,
-and uniform random search.
-
-Run all examples below out of your top-level repo directory, i.e. where your git
-clone resides.
-
-
-## Just tell me how to run something and see results
-```bash
-# These tasks are the fastest to learn. 'echo' and 'count-down' are very
-# easy. run_eval_tasks.py will do most of the work to run all the jobs.
-# Should take between 10 and 30 minutes.
-
-# How many repetitions each experiment will run. In the paper, we use 25. Less
-# reps means faster experiments, but noisier results.
-REPS=25
-
-# Extra description in the job names for these experiments. Use this description
-# to distinguish between multiple runs of the same experiment.
-DESC="demo"
-
-# The tasks to run.
-TASKS="reverse echo-second-seq"
-
-# The model types and max NPE.
-EXPS=( pg-20M topk-20M ga-20M rand-20M )
-
-# Where training data is saved. This is chosen by launch_training.sh. Custom
-# implementations of launch_training.sh may use different locations.
-MODELS_DIR="/tmp/models"
-
-# Run run_eval_tasks.py for each experiment name in EXPS.
-for exp in "${EXPS[@]}"
-do
-  ./single_task/run_eval_tasks.py \
-      --exp "$exp" --tasks $TASKS --desc "$DESC" --reps $REPS
-done
-
-# During training or after completion, run this to aggregate results into a
-# table. This is also useful for seeing how much progress has been made.
-# Make sure the arguments here match the settings used above.
-# Note: This can take a few minutes because it reads from every experiment
-# directory.
-bazel run single_task:aggregate_experiment_results -- \
-  --models_dir="$MODELS_DIR" \
-  --max_npe="20M" \
-  --task_list="$TASKS" \
-  --model_types="[('pg', '$DESC'), ('topk', '$DESC'), ('ga', '$DESC'),
-                  ('rand', '$DESC')]" \
-  --csv_file="/tmp/results_table.csv"
-```
-
-
-## Reproduce tuning results in paper
-```bash
-bazel build -c opt single_task:tune.par
-
-# PG and TopK Tuning.
-MAX_NPE=5000000
-CONFIG="
-env=c(task_cycle=['reverse-tune','remove-tune']),
-agent=c(
-  algorithm='pg',
-  grad_clip_threshold=50.0,param_init_factor=0.5,entropy_beta=0.05,lr=1e-5,
-  optimizer='rmsprop',ema_baseline_decay=0.99,topk_loss_hparam=0.0,topk=0,
-  replay_temperature=1.0,alpha=0.0,eos_token=False),
-timestep_limit=50,batch_size=64"
-
-./single_task/launch_tuning.sh \
-    --job_name="iclr_pg_gridsearch.reverse-remove" \
-    --config="$CONFIG" \
-    --max_npe="$MAX_NPE" \
-    --num_workers_per_tuner=1 \
-    --num_ps_per_tuner=0 \
-    --num_tuners=1 \
-    --num_repetitions=50 \
-    --hparam_space_type="pg" \
-    --stop_on_success=true
-./single_task/launch_tuning.sh \
-    --job_name="iclr_pg_topk_gridsearch.reverse-remove" \
-    --config="$CONFIG" \
-    --max_npe="$MAX_NPE" \
-    --num_workers_per_tuner=1 \
-    --num_ps_per_tuner=0 \
-    --num_tuners=1 \
-    --num_repetitions=50 \
-    --hparam_space_type="pg-topk" \
-    --fixed_hparams="topk=10" \
-    --stop_on_success=true
-./single_task/launch_tuning.sh \
-    --job_name="iclr_topk_gridsearch.reverse-remove" \
-    --config="$CONFIG" \
-    --max_npe="$MAX_NPE" \
-    --num_workers_per_tuner=1 \
-    --num_ps_per_tuner=0 \
-    --num_tuners=1 \
-    --num_repetitions=50 \
-    --hparam_space_type="topk" \
-    --fixed_hparams="topk=10" \
-    --stop_on_success=true
-
-# GA Tuning.
-CONFIG="
-env=c(task_cycle=['reverse-tune','remove-char-tune']),
-agent=c(algorithm='ga'),
-timestep_limit=50"
-./single_task/launch_tuning.sh \
-    --job_name="iclr_ga_gridsearch.reverse-remove" \
-    --config="$CONFIG" \
-    --max_npe="$MAX_NPE" \
-    --num_workers_per_tuner=25 \
-    --num_ps_per_tuner=0 \
-    --num_tuners=1 \
-    --num_repetitions=50 \
-    --hparam_space_type="ga" \
-    --stop_on_success=true
-
-# Aggregate tuning results. Run after tuning jobs complete.
-bazel run -c opt single_task:aggregate_tuning_results -- \
-    --tuning_dir="$MODELS_DIR/iclr_pg_gridsearch.reverse-remove"
-bazel run -c opt single_task:aggregate_tuning_results -- \
-    --tuning_dir="$MODELS_DIR/iclr_pg_topk_gridsearch.reverse-remove"
-bazel run -c opt single_task:aggregate_tuning_results -- \
-    --tuning_dir="$MODELS_DIR/iclr_topk_gridsearch.reverse-remove"
-bazel run -c opt single_task:aggregate_tuning_results -- \
-    --tuning_dir="$MODELS_DIR/iclr_ga_gridsearch.reverse-remove"
-```
-
-## Reproduce eval results in paper
-```bash
-DESC="v0"  # Description for each experiment. "Version 0" is a good default.
-EXPS=( pg-5M topk-5M ga-5M rand-5M pg-20M topk-20M ga-20M rand-20M )
-for exp in "${EXPS[@]}"
-do
-  ./single_task/run_eval_tasks.py \
-      --exp "$exp" --iclr_tasks --desc "$DESC"
-done
-```
-
-## Run single experiment
-```bash
-EXP="topk-20M"  # Learning algorithm + max-NPE
-TASK="reverse"  # Coding task
-DESC="v0"  # Description for each experiment. "Version 0" is a good default.
-./single_task/run_eval_tasks.py \
-    --exp "$EXP" --task "$TASK" --desc "$DESC"
-```
-
-## Fetch eval results into a table
-```bash
-# These arguments should match the settings you used to run the experiments.
-MODELS_DIR="/tmp/models"
-MAX_NPE="20M"
-DESC="v0"  # Same description used in the experiments.
-# MODEL_TYPES specifies each model type and the description used in their
-# experiments.
-MODEL_TYPES="[('pg', '$DESC'), ('topk', '$DESC'),
-              ('ga', '$DESC'), ('rand', '$DESC')]"
-TASKS=""  # Empty string will default to all ICLR tasks.
-# To specify custom task list, give task names separated by spaces. Example:
-# TASKS="reverse remove-char"
-bazel run single_task:aggregate_experiment_results -- \
-    --models_dir="$MODELS_DIR" \
-    --max_npe="$MAX_NPE" \
-    --task_list="$TASKS" \
-    --model_types="$MODEL_TYPES" \
-    --csv_file="/tmp/results_table.csv"
-```
-
-## Reproduce shortest code examples in paper
-```bash
-# Maximum NPE is higher here. We only do 1 repetition, and the algorithm needs
-# time to simplify its solution.
-MODELS_DIR="/tmp/models"
-NPE="500M"
-DESC="short-code"
-./single_task/run_eval_tasks.py \
-    --exp "simpl-$NPE" --desc "$DESC" --iclr_tasks --reps 1
-
-# Aggregate best code strings. Run after training completes.
-TASKS=""  # Empty string. Will default to all ICLR tasks.
-bazel run single_task:aggregate_experiment_results -- \
-    --models_dir="$MODELS_DIR" \
-    --max_npe="$NPE" \
-    --task_list="$TASKS" \
-    --model_types="[('topk', '$DESC')]" \
-    --data=code
-```
diff --git a/research/brain_coder/single_task/aggregate_experiment_results.py b/research/brain_coder/single_task/aggregate_experiment_results.py
deleted file mode 100644
index f106253004b3bbe1ff32443c41b8999b1c9e96f6..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/aggregate_experiment_results.py
+++ /dev/null
@@ -1,380 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-r"""This script crawls experiment directories for results and aggregates them.
-
-Usage example:
-
-MODELS_DIR="/tmp/models"
-bazel run single_task:aggregate_experiment_results -- \
-    --models_dir="$MODELS_DIR" \
-    --max_npe="20M" \
-    --task_list="add echo" \
-    --model_types="[('topk', 'v0'), ('ga', 'v0')]" \
-    --csv_file=/tmp/results_table.csv
-"""
-
-import ast
-from collections import namedtuple
-import csv
-import os
-import re
-import StringIO
-import sys
-
-from absl import app
-from absl import flags
-import numpy as np
-import tensorflow as tf
-
-from single_task import misc  # brain coder
-from single_task import results_lib  # brain coder
-
-DEFAULT_MODELS = [('pg', 'v0'), ('topk', 'v0'), ('ga', 'v0'), ('rand', 'v0')]
-DEFAULT_TASKS = [
-    'reverse', 'remove-char', 'count-char', 'add', 'bool-logic', 'print-hello',
-    'echo-twice', 'echo-thrice', 'copy-reverse', 'zero-cascade', 'cascade',
-    'shift-left', 'shift-right', 'riffle', 'unriffle', 'middle-char',
-    'remove-last', 'remove-last-two', 'echo-alternating', 'echo-half', 'length',
-    'echo-second-seq', 'echo-nth-seq', 'substring', 'divide-2', 'dedup']
-
-FLAGS = flags.FLAGS
-flags.DEFINE_string(
-    'models_dir', '',
-    'Absolute path where results folders are found.')
-flags.DEFINE_string(
-    'exp_prefix', 'bf_rl_iclr',
-    'Prefix for all experiment folders.')
-flags.DEFINE_string(
-    'max_npe', '5M',
-    'String representation of max NPE of the experiments.')
-flags.DEFINE_spaceseplist(
-    'task_list', DEFAULT_TASKS,
-    'List of task names separated by spaces. If empty string, defaults to '
-    '`DEFAULT_TASKS`. These are the rows of the results table.')
-flags.DEFINE_string(
-    'model_types', str(DEFAULT_MODELS),
-    'String representation of a python list of 2-tuples, each a model_type + '
-    'job description pair. Descriptions allow you to choose among different '
-    'runs of the same experiment. These are the columns of the results table.')
-flags.DEFINE_string(
-    'csv_file', '/tmp/results_table.csv',
-    'Where to write results table. Format is CSV.')
-flags.DEFINE_enum(
-    'data', 'success_rates', ['success_rates', 'code'],
-    'What type of data to aggregate.')
-
-
-def make_csv_string(table):
-  """Convert 2D list to CSV string."""
-  s = StringIO.StringIO()
-  writer = csv.writer(s)
-  writer.writerows(table)
-  value = s.getvalue()
-  s.close()
-  return value
-
-
-def process_results(metrics):
-  """Extract useful information from given metrics.
-
-  Args:
-    metrics: List of results dicts. These should have been written to disk by
-        training jobs.
-
-  Returns:
-    Dict mapping stats names to values.
-
-  Raises:
-    ValueError: If max_npe or max_global_repetitions values are inconsistant
-        across dicts in the `metrics` list.
-  """
-  count = len(metrics)
-  success_count = 0
-  total_npe = 0  # Counting NPE across all runs.
-  success_npe = 0  # Counting NPE in successful runs only.
-  max_npe = 0
-  max_repetitions = 0
-  for metric_dict in metrics:
-    if not max_npe:
-      max_npe = metric_dict['max_npe']
-    elif max_npe != metric_dict['max_npe']:
-      raise ValueError(
-          'Invalid experiment. Different reps have different max-NPE settings.')
-    if not max_repetitions:
-      max_repetitions = metric_dict['max_global_repetitions']
-    elif max_repetitions != metric_dict['max_global_repetitions']:
-      raise ValueError(
-          'Invalid experiment. Different reps have different num-repetition '
-          'settings.')
-    if metric_dict['found_solution']:
-      success_count += 1
-      success_npe += metric_dict['npe']
-    total_npe += metric_dict['npe']
-  stats = {}
-  stats['max_npe'] = max_npe
-  stats['max_repetitions'] = max_repetitions
-  stats['repetitions'] = count
-  stats['successes'] = success_count  # successful reps
-  stats['failures'] = count - success_count  # failed reps
-  stats['success_npe'] = success_npe
-  stats['total_npe'] = total_npe
-  if success_count:
-    # Only successful runs counted.
-    stats['avg_success_npe'] = stats['success_npe'] / float(success_count)
-  else:
-    stats['avg_success_npe'] = 0.0
-  if count:
-    stats['success_rate'] = success_count / float(count)
-    stats['avg_total_npe'] = stats['total_npe'] / float(count)
-  else:
-    stats['success_rate'] = 0.0
-    stats['avg_total_npe'] = 0.0
-
-  return stats
-
-
-ProcessedResults = namedtuple('ProcessedResults', ['metrics', 'processed'])
-
-
-def get_results_for_experiment(
-    models_dir, task_name, model_type='pg', max_npe='5M', desc='v0',
-    name_prefix='bf_rl_paper', extra_desc=''):
-  """Get and process results for a given experiment.
-
-  An experiment is a set of runs with the same hyperparameters and environment.
-  It is uniquely specified by a (task_name, model_type, max_npe) triple, as
-  well as an optional description.
-
-  We assume that each experiment has a folder with the same name as the job that
-  ran the experiment. The name is computed by
-  "%name_prefix%.%desc%-%max_npe%_%task_name%".
-
-  Args:
-    models_dir: Parent directory containing experiment folders.
-    task_name: String name of task (the coding env). See code_tasks.py or
-        run_eval_tasks.py
-    model_type: Name of the algorithm, such as 'pg', 'topk', 'ga', 'rand'.
-    max_npe: String SI unit representation of the maximum NPE threshold for the
-        experiment. For example, "5M" means 5 million.
-    desc: Description.
-    name_prefix: Prefix of job names. Normally leave this as default.
-    extra_desc: Optional extra description at the end of the job name.
-
-  Returns:
-    ProcessedResults namedtuple instance, containing
-    metrics: Raw dicts read from disk.
-    processed: Stats computed by `process_results`.
-
-  Raises:
-    ValueError: If max_npe in the metrics does not match NPE in the experiment
-        folder name.
-  """
-  folder = name_prefix + '.{0}.{1}-{2}_{3}'.format(desc, model_type, max_npe,
-                                                   task_name)
-  if extra_desc:
-    folder += '.' + extra_desc
-
-  results = results_lib.Results(os.path.join(models_dir, folder))
-  metrics, _ = results.read_all()
-  processed = process_results(metrics)
-  if (not np.isclose(processed['max_npe'], misc.si_to_int(max_npe))
-      and processed['repetitions']):
-    raise ValueError(
-        'Invalid experiment. Max-NPE setting does not match expected max-NPE '
-        'in experiment name.')
-  return ProcessedResults(metrics=metrics, processed=processed)
-
-
-BestCodeResults = namedtuple(
-    'BestCodeResults',
-    ['code', 'reward', 'npe', 'folder', 'finished', 'error'])
-
-
-class BestCodeResultError(object):
-  success = 0
-  no_solution_found = 1
-  experiment_does_not_exist = 2
-
-
-def get_best_code_for_experiment(
-    models_dir, task_name, model_type='pg', max_npe='5M', desc=0,
-    name_prefix='bf_rl_paper', extra_desc=''):
-  """Like `get_results_for_experiment`, but fetches the code solutions."""
-  folder = name_prefix + '.{0}.{1}-{2}_{3}'.format(desc, model_type, max_npe,
-                                                   task_name)
-  if extra_desc:
-    folder += '.' + extra_desc
-
-  log_dir = os.path.join(models_dir, folder, 'logs')
-  search_regex = r'^solutions_([0-9])+\.txt$'
-  try:
-    all_children = tf.gfile.ListDirectory(log_dir)
-  except tf.errors.NotFoundError:
-    return BestCodeResults(
-        code=None, reward=0.0, npe=0, folder=folder, finished=False,
-        error=BestCodeResultError.experiment_does_not_exist)
-  solution_files = [
-      fname for fname in all_children if re.search(search_regex, fname)]
-  max_reward = 0.0
-  npe = 0
-  best_code = None
-  for fname in solution_files:
-    with tf.gfile.FastGFile(os.path.join(log_dir, fname), 'r') as reader:
-      results = [ast.literal_eval(entry) for entry in reader]
-    for res in results:
-      if res['reward'] > max_reward:
-        best_code = res['code']
-        max_reward = res['reward']
-        npe = res['npe']
-  error = (
-      BestCodeResultError.success if best_code
-      else BestCodeResultError.no_solution_found)
-  try:
-    # If there is a status.txt file, check if it contains the status of the job.
-    with tf.gfile.FastGFile(os.path.join(log_dir, 'status.txt'), 'r') as f:
-      # Job is done, so mark this experiment as finished.
-      finished = f.read().lower().strip() == 'done'
-  except tf.errors.NotFoundError:
-    # No status file has been written, so the experiment is not done. No need to
-    # report an error here, because we do not require that experiment jobs write
-    # out a status.txt file until they have finished.
-    finished = False
-  return BestCodeResults(
-      code=best_code, reward=max_reward, npe=npe, folder=folder,
-      finished=finished, error=error)
-
-
-def make_results_table(
-    models=None,
-    tasks=None,
-    max_npe='5M',
-    name_prefix='bf_rl_paper',
-    extra_desc='',
-    models_dir='/tmp'):
-  """Creates a table of results: algorithm + version by tasks.
-
-  Args:
-    models: The table columns. A list of (algorithm, desc) tuples.
-    tasks: The table rows. List of task names.
-    max_npe: String SI unit representation of the maximum NPE threshold for the
-        experiment. For example, "5M" means 5 million. All entries in the table
-        share the same max-NPE.
-    name_prefix: Name prefix used in logging directory for the experiment.
-    extra_desc: Extra description added to name of logging directory for the
-        experiment.
-    models_dir: Parent directory containing all experiment folders.
-
-  Returns:
-    A 2D list holding the table cells.
-  """
-  if models is None:
-    models = DEFAULT_MODELS
-  if tasks is None:
-    tasks = DEFAULT_TASKS
-  model_results = {}
-  for model_type, desc in models:
-    model_results[model_type] = {
-        tname: get_results_for_experiment(
-            models_dir, tname, model_type, max_npe, desc,
-            name_prefix=name_prefix, extra_desc=extra_desc
-        ).processed
-        for tname in tasks}
-
-  def info(stats):
-    return [str(stats['repetitions']),
-            '%.2f' % stats['success_rate'],
-            str(int(stats['avg_total_npe']))]
-
-  rows = [['max NPE: ' + max_npe]
-          + misc.flatten([['{0} ({1})'.format(m, d), '', '']
-                          for m, d in models])]
-  rows.append(
-      [''] + misc.flatten([['reps', 'success rate', 'avg NPE']
-                           for _ in models]))
-  for tname in tasks:
-    rows.append(
-        [tname]
-        + misc.flatten([info(model_results[model][tname])
-                        for model, _ in models]))
-
-  return rows
-
-
-def print_results_table(results_table):
-  """Print human readable results table to stdout."""
-  print('')
-  print('=== Results Table ===')
-  print('Format: # reps [success rate, avg total NPE]')
-
-  def info_str(info_row):
-    # num_runs (success_rate, avg_total_npe)
-    if not info_row[0]:
-      return '0'
-    return '%s [%s, %s]' % (str(info_row[0]).ljust(2), info_row[1], info_row[2])
-
-  nc = len(results_table[0])  # num cols
-  out_table = [
-      [results_table[0][0]] + [results_table[0][i] for i in range(1, nc, 3)]]
-  for row in results_table[2:]:
-    out_table.append([row[0]] + [info_str(row[i:i+3]) for i in range(1, nc, 3)])
-
-  nc = len(out_table[0])  # num cols
-  col_widths = [max(len(row[col]) for row in out_table) for col in range(nc)]
-
-  table_string = ''
-  for row in out_table:
-    table_string += ''.join(
-        [row[c].ljust(col_widths[c] + 2) for c in range(nc)]) + '\n'
-
-  print(table_string)
-
-
-def main(argv):
-  del argv  # Unused.
-
-  name_prefix = FLAGS.exp_prefix
-  print('Experiments prefix: %s' % name_prefix)
-
-  model_types = ast.literal_eval(FLAGS.model_types)
-
-  if FLAGS.data == 'success_rates':
-    results_table = make_results_table(
-        models=model_types, tasks=FLAGS.task_list, max_npe=FLAGS.max_npe,
-        models_dir=FLAGS.models_dir,
-        name_prefix=name_prefix, extra_desc='')
-    with tf.gfile.FastGFile(FLAGS.csv_file, 'w') as f:
-      f.write(make_csv_string(results_table))
-
-    print_results_table(results_table)
-  else:
-    # Best code
-    print('* = experiment is still running')
-    print('')
-    print('=== Best Synthesized Code ===')
-    for model_type, desc in model_types:
-      print('%s (%s)' % (model_type, desc))
-      sys.stdout.flush()
-      for tname in FLAGS.task_list:
-        res = get_best_code_for_experiment(
-            FLAGS.models_dir, tname, model_type, FLAGS.max_npe, desc,
-            name_prefix=name_prefix, extra_desc='')
-        unfinished_mark = '' if res.finished else ' *'
-        tname += unfinished_mark
-        if res.error == BestCodeResultError.success:
-          print('  %s' % tname)
-          print('    %s' % res.code)
-          print('    R=%.6f, NPE=%s' % (res.reward, misc.int_to_si(res.npe)))
-        elif res.error == BestCodeResultError.experiment_does_not_exist:
-          print('  Experiment does not exist. Check arguments.')
-          print('  Experiment folder: %s' % res.folder)
-          break
-        else:
-          print('  %s' % tname)
-          print('    (none)')
-        sys.stdout.flush()
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/brain_coder/single_task/aggregate_tuning_results.py b/research/brain_coder/single_task/aggregate_tuning_results.py
deleted file mode 100644
index bb2e008ce583afbea8acabfe1ed8ccf264698f5e..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/aggregate_tuning_results.py
+++ /dev/null
@@ -1,71 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-r"""After running tuning, use this script to aggregate the results.
-
-Usage:
-
-OUT_DIR="<my_tuning_dir>"
-bazel run -c opt single_task:aggregate_tuning_results -- \
-    --alsologtostderr \
-    --tuning_dir="$OUT_DIR"
-"""
-
-import ast
-import os
-
-from absl import app
-from absl import flags
-import tensorflow as tf
-
-
-FLAGS = flags.FLAGS
-flags.DEFINE_string(
-    'tuning_dir', '',
-    'Absolute path where results tuning trial folders are found.')
-
-
-def main(argv):
-  del argv  # Unused.
-
-  try:
-    trial_dirs = tf.gfile.ListDirectory(FLAGS.tuning_dir)
-  except tf.errors.NotFoundError:
-    print('Tuning directory %s does not exist.' % (FLAGS.tuning_dir,))
-    return
-
-  metrics = []
-  for trial_dir in trial_dirs:
-    tuning_results_file = os.path.join(
-        FLAGS.tuning_dir, trial_dir, 'tuning_results.txt')
-    if tf.gfile.Exists(tuning_results_file):
-      with tf.gfile.FastGFile(tuning_results_file, 'r') as reader:
-        for line in reader:
-          metrics.append(ast.literal_eval(line.replace(': nan,', ': 0.0,')))
-
-  if not metrics:
-    print('No trials found.')
-    return
-
-  num_trials = [m['num_trials'] for m in metrics]
-  assert all(n == num_trials[0] for n in num_trials)
-  num_trials = num_trials[0]
-  print('Found %d completed trials out of %d' % (len(metrics), num_trials))
-
-  # Sort by objective descending.
-  sorted_trials = sorted(metrics, key=lambda m: -m['objective'])
-
-  for i, metrics in enumerate(sorted_trials):
-    hparams = metrics['hparams']
-    keys = sorted(hparams.keys())
-    print(
-        str(i).ljust(4) + ': '
-        + '{0:.2f}'.format(metrics['objective']).ljust(10)
-        + '['
-        + ','.join(['{}={}'.format(k, hparams[k]).ljust(24) for k in keys])
-        + ']')
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/brain_coder/single_task/code_tasks.py b/research/brain_coder/single_task/code_tasks.py
deleted file mode 100644
index 27cc7ecd1c76f2d765692ce0a94acd1df04ff681..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/code_tasks.py
+++ /dev/null
@@ -1,1381 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tasks for RL."""
-
-import abc
-import copy
-import itertools
-import random
-
-from absl import logging
-import numpy as np
-from six.moves import xrange
-
-from common import bf  # brain coder
-from common import reward as r  # brain coder
-from single_task import misc  # brain coder
-from single_task import test_tasks  # brain coder
-
-
-MAX_EXECUTION_STEPS = 5000
-
-
-def make_task(task_name, override_kwargs=None, max_code_length=100,
-              require_correct_syntax=False,
-              do_code_simplification=False,
-              correct_bonus=2.0, code_length_bonus=1.0):
-  """Make tasks with setting from paper."""
-  logging.info('Making paper-config task.')
-  n = 16  # Number of test cases.
-  task_mapping = {
-      'print-hello': (
-          PrintTask, dict(base=27, fixed_string=[8, 5, 12, 12, 15])),
-      'print': (PrintIntTask, dict(base=256, fixed_string=[1, 2, 3, 4, 5])),
-      'echo': (EchoTask, dict(base=27, min_length=1, max_length=6)),
-      'remove-char': (
-          RemoveCharTask, dict(base=256, n=n, min_len=1, max_len=6)),
-      'reverse': (
-          ReverseTask, dict(base=256, n=n, min_len=1, max_len=6)),
-      'reverse-tune': (
-          ReverseTaskV2, dict(base=256, reward_type='static-bylen')),
-      'remove-char-tune': (RemoveCharTaskV2, dict(base=27)),
-      'prefix': (CommonPrefixTask, dict(base=27)),
-      'find': (FindSubStrTask, dict(base=27)),
-      'sort3': (SortFixedTaskV2, dict(base=27, n=150, length=3)),
-      'count-char': (CountCharTaskV2, dict(n=n, max_len=6)),
-      'bool-logic': (BooleanLogicTask, dict()),
-      'add': (AddTask, dict(n=9)),
-      'echo-twice': (EchoTwiceTask, dict(n=n)),
-      'echo-thrice': (EchoThriceTask, dict(n=n)),
-      'copy-reverse': (CopyReverseTask, dict(n=n)),
-      'zero-cascade': (EchoZeroCascadeTask, dict(n=n)),
-      'cascade': (EchoCascadeTask, dict(n=n)),
-      'shift-left': (ShiftLeftTask, dict(n=n)),
-      'shift-right': (ShiftRightTask, dict(n=n)),
-      'riffle': (RiffleTask, dict(n=n)),
-      'unriffle': (UnriffleTask, dict(n=n)),
-      'middle-char': (MiddleCharTask, dict(n=n)),
-      'remove-last': (RemoveLastTask, dict(n=n)),
-      'remove-last-two': (RemoveLastTwoTask, dict(n=n)),
-      'echo-alternating': (EchoAlternatingTask, dict(n=n)),
-      'echo-half': (EchoHalfTask, dict(n=n)),
-      'length': (LengthTask, dict(n=n)),
-      'echo-second-seq': (EchoSecondSequenceTask, dict(n=n)),
-      'echo-nth-seq': (EchoNthSequenceTask, dict(n=n)),
-      'substring': (SubstringTask, dict(n=n)),
-      'divide-2': (Divide2Task, dict(n=n)),
-      'dedup': (DedupTask, dict(n=n)),
-      'remove-target-char': (RemoveTargetCharTask, dict(n=n)),
-      'list-index': (ListIndexTask, dict(n=n)),
-      'fib': (FibonacciTask, dict()),
-      'count-down': (BottlesOfBeerTask, dict()),
-      'split': (SplitTask, dict()),
-      'trim-left': (TrimLeftTask, dict()),
-      'circle-route': (
-          JudgeRouteCircleTask, dict(n=100, max_len=32)),
-      'multiply': (MultiplyTask, dict(n=100)),
-      'divmod': (DivModTask, dict(n=100)),
-  }
-
-  if task_name not in task_mapping:
-    # Test tasks.
-    if task_name == 'test-hill-climb':
-      return test_tasks.BasicTaskManager(test_tasks.HillClimbingTask())
-    raise ValueError('Unknown task type "%s"' % task_name)
-  task_cls, kwargs = task_mapping[task_name]
-
-  if override_kwargs:
-    if not isinstance(override_kwargs, dict):
-      raise ValueError(
-          'override_kwargs must be a dict, got: %s', override_kwargs)
-    kwargs.update(override_kwargs)
-
-  task = task_cls(**kwargs)
-
-  reward_fn = r.absolute_distance_reward
-  # reward_fn = r.absolute_mod_distance_reward
-  # reward_fn = r.absolute_log_distance_reward
-  logging.info('Using reward function: %s', reward_fn.__name__)
-
-  # We want reward with and without code simplification to be scaled the same
-  # way. Without code simplification, give the maximum code length bonus
-  # every time.
-  min_code_length = 0.0 if do_code_simplification else max_code_length
-
-  return MultiIOTaskManager(
-      task=task, correct_bonus=correct_bonus,
-      code_length_bonus=code_length_bonus,
-      max_code_length=max_code_length, min_code_length=min_code_length,
-      reward_fn=reward_fn, require_correct_syntax=require_correct_syntax)
-
-
-def concat(lists):
-  if not lists:
-    return []
-  l = lists[0]
-  for k in lists[1:]:
-    l += k
-  return l
-
-
-def concat_join(lists, sep):
-  if not lists:
-    return []
-  l = lists[0]
-  for k in lists[1:]:
-    l += [sep] + k
-  return l
-
-
-def clipped_linear(x, x0, y0, slope, y_range):
-  min_y, max_y = y_range
-  return min(max(slope * (x - x0) + y0, min_y), max_y)
-
-
-class MultiIOTaskManager(object):
-  """Supports tasks which test the code with multiple I/O examples."""
-
-  def __init__(self, task, max_code_length=32, min_code_length=0,
-               max_execution_steps=MAX_EXECUTION_STEPS, correct_bonus=1.0,
-               code_length_bonus=1.0, failure_reward=-2.0, reward_fn=None,
-               require_correct_syntax=False):
-    assert isinstance(task, BaseTask)
-    self.task = task
-    self.max_code_length = max_code_length
-    self.min_code_length = min_code_length
-    self.max_execution_steps = max_execution_steps
-    self.require_correct_syntax = require_correct_syntax
-    self.correct_bonus = correct_bonus
-    self.code_length_bonus = code_length_bonus
-    self.failure_reward = failure_reward
-    self.time_penalty = (
-        1.0 / (max_code_length - min_code_length)
-        if max_code_length > min_code_length else 0.0)
-    if reward_fn is None:
-      self.reward_fn = r.absolute_distance_reward
-    else:
-      self.reward_fn = reward_fn
-    self.input_type = (
-        task.input_type if hasattr(task, 'input_type') else misc.IOType.integer)
-    self.output_type = (
-        task.output_type if hasattr(task, 'output_type')
-        else misc.IOType.integer)
-    self._compute_best_reward()
-
-  def _compute_best_reward(self):
-    io_seqs = self.task.make_io_set()
-    reward = 0.0
-    for _, output_seq in io_seqs:
-      reward += self.reward_fn(output_seq, output_seq, self.task.base)
-      reward += self.correct_bonus
-      reward += self.code_length_bonus  # Bonus for shortest code.
-    self.best_reward = reward
-    self.good_reward = 0.75 * reward
-    logging.info('Known best reward: %.4f', self.best_reward)
-
-  def _score_batch(self, code_strings):
-    return [self._score_code(code) for code in code_strings]
-
-  def _score_code(self, code):
-    """Run test cases on code and compute reward.
-
-    Args:
-      code: A single BF code string.
-
-    Returns:
-      misc.RewardInfo namedtuple instance containing reward and code execution
-          information, including inputs, expected outputs, code outputs, input
-          and output types, and reason for the reward obtained.
-    """
-    # Get list of 2-tuples, each containing an input sequence and an output
-    # sequence.
-    io_seqs = self.task.make_io_set()
-    terminal_reward = 0.0
-    results = []
-    reason = 'correct'
-    for input_seq, output_seq in io_seqs:
-      eval_result = bf.evaluate(
-          code, input_buffer=input_seq, timeout=0.1,
-          max_steps=self.max_execution_steps,
-          base=self.task.base,
-          require_correct_syntax=self.require_correct_syntax)
-      result, success = eval_result.output, eval_result.success
-      if not success:
-        # Code execution timed out.
-        terminal_reward = self.failure_reward
-        results = []
-        reason = eval_result.failure_reason
-        break
-      else:
-        terminal_reward += self.reward_fn(result, output_seq, self.task.base)
-        if result == output_seq:
-          terminal_reward += self.correct_bonus  # Bonus for correct answer.
-
-          # Only add additional reward for shorter code. Subtracting reward
-          # interferes with the main objective. Only optimize for length once
-          # any solution is found.
-          if self.min_code_length == self.max_code_length:
-            terminal_reward += self.code_length_bonus
-          else:
-            terminal_reward += self.code_length_bonus * clipped_linear(
-                x=len(code), x0=self.min_code_length, y0=1.0,
-                slope=-self.time_penalty, y_range=(0.0, 1.0))
-
-          # reason remains 'correct' if it is already
-        elif reason == 'correct':
-          reason = 'wrong'
-      results.append(result)
-
-    # Return list of rewards, one for each char in the code. All are 0 except
-    # for the terminal reward.
-    terminal_reward /= self.best_reward
-    return misc.RewardInfo(
-        episode_rewards=[0.0] * (len(code) - 1) + [terminal_reward],
-        input_case=misc.IOTuple(i for i, o in io_seqs),
-        correct_output=misc.IOTuple(o for i, o in io_seqs),
-        code_output=misc.IOTuple(results),
-        input_type=self.input_type,
-        output_type=self.output_type,
-        reason=reason)
-
-  def rl_batch(self, batch_size):
-    """Produces list of reward functions. One for each program in the batch."""
-    return [self._score_code] * batch_size
-
-
-def conditional_overwrite(current_value, new_value, allowed_overwrite_values):
-  if current_value in allowed_overwrite_values:
-    return new_value
-  return current_value
-
-
-class BaseTask(object):
-  """A coding task.
-
-  All coding tasks should inherit this class.
-  """
-  __metaclass__ = abc.ABCMeta
-
-  def __init__(self, base=256):
-    self.base = base  # All tasks must set the integer base that the expect.
-
-  @abc.abstractmethod
-  def make_io_set(self):
-    """Generate a set of test cases for the task.
-
-    Returns:
-      List of tuples, where each tuple is (input_case, output_case).
-      input_case and output_case are lists of integers.
-    """
-    pass
-
-
-# ==============================================================================
-# ICLR tasks.
-# ==============================================================================
-
-
-class PrintTask(BaseTask):
-  """Print string coding task.
-
-  Code needs to output a fixed string (given as a hyperparameter to the
-  task constructor). Program input is ignored.
-  """
-
-  def __init__(self, base, fixed_string=None):
-    super(type(self), self).__init__()
-    self.base = base  # base includes EOS
-    self.eos = 0
-    if fixed_string:
-      self.fixed_string = fixed_string
-    else:
-      self.fixed_string = [1, 2, 3, 0]  # ABC<EOS>
-    self.min_length = self.max_length = len(self.fixed_string)
-
-  def make_io_set(self):
-    return [(list(), list(self.fixed_string))]
-
-
-class RemoveCharTaskV2(BaseTask):
-  """Remove character coding task (version 2).
-
-  Code needs to pipe input to output, but with all the 'A' (value 1) chars
-  removed. 'A' appears exactly once in each input.
-
-  Test cases are hard-coded.
-  """
-
-  def __init__(self, base):
-    super(type(self), self).__init__()
-    self.base = base
-    self.eos = 0
-    self.remove_char = 1
-    assert base >= 27
-
-  def make_io_set(self):
-    rm = self.remove_char
-    return [
-        ([rm, 0], [0]),
-        ([20, rm, 0], [20, 0]),
-        ([rm, 13, 0], [13, 0]),
-        ([6, rm, 17, 0], [6, 17, 0]),
-        ([rm, 11, 24, 0], [11, 24, 0]),
-        ([2, 16, 21, rm, 0], [2, 16, 21, 0]),
-        ([18, rm, 12, 26, 7, 0], [18, 12, 26, 7, 0]),
-        ([9, 10, 22, rm, 4, 0], [9, 10, 22, 4, 0])]
-
-
-class RemoveCharTask(BaseTask):
-  """Remove character coding task.
-
-  Code needs to pipe input to output, but with all the 'A' (value 1) chars
-  removed. 'A' appears at least once in each input.
-
-  Test cases are dynamically generated, allowing for the number of test cases
-  to be a hyperparameter.
-  """
-
-  def __init__(self, base, n, min_len, max_len):
-    super(type(self), self).__init__()
-    self.base = base
-    self.eos = 0
-    self.remove_char = 1
-    assert base >= 27
-    self._io_pairs = self._make_io_examples(n, min_len, max_len)
-
-  def _make_io_examples(self, n, min_len, max_len):
-    """Generate test cases for the task."""
-    rand = random.Random(6849275409234)  # Test cases are fixed, but varied.
-    io_examples = []
-    for _ in xrange(n):
-      length = rand.randrange(min_len, max_len + 1)
-      rm_char_pos = rand.randrange(0, length)
-      input_seq = [rand.randrange(1, self.base) for _ in xrange(length)]
-      input_seq[rm_char_pos] = self.remove_char
-      output_seq = list(input_seq)
-      del output_seq[rm_char_pos]
-      output_seq.append(0)
-      io_examples.append((input_seq, output_seq))
-    return io_examples
-
-  def make_io_set(self):
-    return copy.deepcopy(self._io_pairs)
-
-
-class ReverseTaskV2(BaseTask):
-  """Reverse string coding task (version 2).
-
-  Code needs to pipe input to output, but in reverse order.
-
-  Stochastic test case = new test case randomly generated for every run of
-  `make_io_set`, i.e. different test cases every time code is scored.
-
-  Task supports different types of test cases:
-    rand-one: Code is scored on one stochastic test case.
-    rand-many: Code is scored on 5 stochastic test cases.
-    static-bylen: Code is scored on 5 static test cases. There is one test
-        case for string lengths 1 through 5.
-    rand-bylen: Code is scored on 5 stochastic test cases, where there is one
-        test case for string lengths 1 through 5.
-  """
-
-  def __init__(self, base, reward_type):
-    super(type(self), self).__init__()
-    self.base = base  # base includes EOS
-    assert base >= 27
-    self.eos = 0
-    self.io_pair_fn = {
-        # One random example at a time.
-        'rand-one': lambda: self._io_rand(1),
-        # K randomy examples at a time (any lengths).
-        'rand-many': lambda: self._io_rand(5),
-        # Static examples, one for each length.
-        'static-bylen': self._io_static_by_len,
-        # Random examples, one for each length.
-        'rand-bylen': self._io_rand_by_len}[reward_type]
-
-  def _make_io_examples(self, sequences):
-    outputs = [list(i) for i in sequences]
-    for o in outputs:
-      o.reverse()
-      o.append(0)
-    inputs = [i + [0] for i in sequences]
-    return zip(inputs, outputs)
-
-  def _io_rand(self, k):
-    inputs = [(np.random.choice(26, random.randrange(1, 6)) + 1).tolist()
-              for _ in xrange(k)]
-    return self._make_io_examples(inputs)
-
-  def _io_rand_by_len(self, k=5):
-    inputs = [(np.random.choice(26, length) + 1).tolist()
-              for length in xrange(1, k + 1)]
-    return self._make_io_examples(inputs)
-
-  def _io_static_by_len(self):
-    return [
-        ([7, 0], [7, 0]),
-        ([6, 2, 0], [2, 6, 0]),
-        ([5, 1, 10, 0], [10, 1, 5, 0]),
-        ([8, 6, 5, 15, 0], [15, 5, 6, 8, 0]),
-        ([10, 12, 5, 2, 7, 0], [7, 2, 5, 12, 10, 0])]
-
-  def make_io_set(self):
-    return self.io_pair_fn()
-
-
-class ReverseTask(BaseTask):
-  """Reverse string coding task.
-
-  Code needs to pipe input to output, but in reverse order.
-
-  Test cases are dynamically generated, allowing for the number of test cases
-  to be a hyperparameter.
-  """
-
-  def __init__(self, base, n, min_len, max_len):
-    super(type(self), self).__init__()
-    self.base = base  # base includes EOS
-    assert base >= 27
-    self.eos = 0
-    self._io_pairs = self._make_io_examples(n, min_len, max_len)
-
-  def _make_io_examples(self, n, min_len, max_len):
-    """Generate test cases for the task."""
-    rand = random.Random(6849275409234)  # Test cases are fixed, but varied.
-    io_examples = []
-    for _ in xrange(n):
-      length = rand.randrange(min_len, max_len + 1)
-      input_seq = [rand.randrange(1, self.base) for _ in xrange(length)]
-      output_seq = list(input_seq)
-      output_seq.reverse()
-      output_seq.append(0)
-      io_examples.append((input_seq, output_seq))
-    return io_examples
-
-  def make_io_set(self):
-    return copy.deepcopy(self._io_pairs)
-
-
-class CommonPrefixTask(BaseTask):
-  """Common prefix coding task.
-
-  Code needs to output the common prefix between two input lists. Input lists
-  are variable length, where each list ends with a 0. A common prefix is a
-  sequence which both lists start with.
-  """
-
-  def __init__(self, base):
-    super(type(self), self).__init__()
-    assert base >= 27
-    self.base = base
-    self.eos = 0
-
-  def make_io_set(self):
-    return [
-        ([12, 24, 18, 0, 12, 5, 0], [12, 0]),
-        ([1, 2, 3, 0, 1, 2, 17, 14, 0], [1, 2, 0]),
-        ([15, 2, 1, 9, 2, 0, 15, 2, 1, 25, 8, 14, 0], [15, 2, 1, 0]),
-        ([14, 9, 7, 8, 6, 16, 0, 14, 9, 7, 8, 8, 6, 8, 26, 0],
-         [14, 9, 7, 8, 0]),
-        ([12, 4, 16, 22, 1, 17, 0, 12, 4, 16, 22, 1, 8, 10, 0],
-         [12, 4, 16, 22, 1, 0])]
-
-
-class CountCharTask(BaseTask):
-
-  def __init__(self):
-    super(type(self), self).__init__()
-    self.base = 27
-    self.eos = 0
-    self.char = 1
-    self.input_type = misc.IOType.string
-    self.output_type = misc.IOType.integer
-
-  def make_io_set(self):
-    return [
-        ([10, 0], [0]),
-        ([1, 0], [1]),
-        ([1, 1, 0], [2]),
-        ([11, 1, 0], [1]),
-        ([1, 24, 0], [1]),
-        ([13, 6, 0], [0]),
-        ([9, 2, 7, 0], [0]),
-        ([1, 24, 11, 0], [1]),
-        ([19, 1, 1, 0], [2]),
-        ([1, 6, 1, 0], [2]),
-        ([22, 16, 17, 9, 0], [0]),
-        ([1, 1, 1, 19, 0], [3]),
-        ([1, 1, 1, 1, 0], [4]),
-        ([9, 4, 19, 11, 5, 0], [0]),
-        ([24, 11, 26, 1, 15, 0], [1]),
-        ([1, 1, 20, 1, 1, 0], [4]),
-        ([1, 1, 1, 1, 1, 0], [5])]
-
-
-class CountCharTaskV2(BaseTask):
-  """Count char coding task (version 2).
-
-  Code must output the number of occurances of character 'A' (value 1) in an
-  input string.
-
-  Test cases are dynamically generated, allowing for the number of test cases
-  to be a hyperparameter.
-  """
-
-  def __init__(self, n, max_len):
-    super(type(self), self).__init__()
-    self.base = 27
-    self.eos = 0
-    self.char = 1
-    self.other_chars = [c for c in xrange(self.base)
-                        if c not in (self.eos, self.char)]
-    self.input_type = misc.IOType.string
-    self.output_type = misc.IOType.integer
-    self._io_pairs = self._make_io_examples(n, max_len)
-
-  def _make_io_examples(self, n, max_len):
-    """Generate test cases for the task."""
-    rand = random.Random(6849275409234)  # Test cases are fixed, but varied.
-    io_examples = []
-    io_examples.append(([10, 0], [0]))
-    io_examples.append(([1, 0], [1]))
-    io_examples.append(([1, 1, 0], [2]))
-    io_examples.append(([9, 4, 19, 11, 5, 0], [0]))
-    io_examples.append(([24, 11, 26, 1, 15, 0], [1]))
-    for _ in xrange(n - 5):
-      length = rand.randrange(2, max_len + 1)
-      num_chars = rand.randrange(0, max_len + 1)
-      input_seq = [self.char] * num_chars + [0] * (length - num_chars)
-      rand.shuffle(input_seq)
-      for i in xrange(len(input_seq)):
-        if not input_seq[i]:
-          input_seq[i] = self.other_chars[rand.randrange(len(self.other_chars))]
-      output_seq = [num_chars]
-      io_examples.append((input_seq, output_seq))
-    return io_examples
-
-  def make_io_set(self):
-    return copy.deepcopy(self._io_pairs)
-
-
-class AddTask(BaseTask):
-  """Addition coding task.
-
-  Code needs to read in two integers and output their sum mod the BF base,
-  followed by a terminating 0.
-  """
-
-  def __init__(self, n=16):
-    super(type(self), self).__init__()
-    self.base = 256
-    self.input_type = misc.IOType.integer
-    self.output_type = misc.IOType.integer
-    self._io_pairs = self._make_io_examples(n)
-
-  def _make_io_examples(self, n):
-    """Generate test cases for the task."""
-    rand = random.Random(6849275409234)  # Test cases are fixed, but varied.
-    io_examples = [
-        ([4, 0], [4, 0]),
-        ([0, 5], [5, 0]),
-        ([1, 2], [3, 0]),
-        ([67, 21], [88, 0]),
-        ([55, 56], [111, 0]),
-        ([128, 33], [161, 0]),
-        ([221, 251], [216, 0]),
-        ([130, 127], [1, 0]),
-        ([255, 1], [0, 0])]
-    extra_examples = max(n - len(io_examples), 0)
-    for _ in xrange(extra_examples):
-      a = rand.randrange(256)
-      b = rand.randrange(256)
-      input_seq = [a, b]
-      output_seq = [(a + b) % 256, 0]
-      io_examples.append((input_seq, output_seq))
-    return io_examples
-
-  def make_io_set(self):
-    return copy.deepcopy(self._io_pairs)
-
-
-class BooleanLogicTask(BaseTask):
-  """Boolean logic (truth table) coding task.
-
-  Code needs to memorize a boolean truth table. Specifically, it must encode a
-  mapping from triple of bools to a single bool.
-  """
-
-  def __init__(self):
-    super(type(self), self).__init__()
-    self.base = 2
-    self.input_type = misc.IOType.boolean
-    self.output_type = misc.IOType.boolean
-    # X(~Z) + (~Y)(~Z) + (~X)YZ
-    self._truth_fn = (
-        lambda x, y, z:  # pylint: disable=g-long-lambda
-        (x and not z) or (not y and not z) or (not x and y and z))
-    self._test_cases = [
-        ([x, y, z], [int(self._truth_fn(x, y, z))])
-        for x, y, z in itertools.product(range(2), range(2), range(2))]
-
-  def make_io_set(self):
-    return copy.deepcopy(self._test_cases)
-
-
-# ------------------------------------------------------------------------------
-# The following tasks are generated from known BF solutions. This guarantees
-# that each task can be solved within the maximum code length, and maximum
-# execution steps.
-# ------------------------------------------------------------------------------
-
-
-def default_input_fn_factory(min_length=1, max_length=6, base=256):
-  def _input_gen(rand):
-    l = rand.randrange(min_length, max_length + 1)
-    return [rand.randrange(base) for _ in xrange(l)]
-  return _input_gen
-
-
-class KnownCodeBaseTask(BaseTask):
-  """These tasks generate their test cases from a known BF solution.
-
-  This ensures that each task has a solution which is under the max character
-  length, and that it solves the test cases under the max number of execution
-  steps.
-  """
-
-  def __init__(self, code_solution, make_input_fn, n=100, base=256,
-               max_steps=5000, seed=6849275409234):
-    super(KnownCodeBaseTask, self).__init__()
-    # Make sure known solution is less than the code length used in experiments.
-    assert len(code_solution) < 100
-    self.code_solution = code_solution
-    self.make_input_fn = make_input_fn
-    self.n = n
-    self.base = base
-    self.max_steps = max_steps
-    self.seed = seed
-    self._test_cases = list(self._test_case_generator(code_solution))
-
-  def _test_case_generator(self, code_solution):
-    rand = random.Random(self.seed)
-    for _ in xrange(self.n):
-      input_case = self.make_input_fn(rand)
-      result = bf.evaluate(
-          code_solution, input_buffer=input_case, max_steps=self.max_steps,
-          base=self.base, require_correct_syntax=False)
-      if not result.success:
-        raise RuntimeError(
-            'Program must succeed. Failed on input: %s' % input_case)
-      yield input_case, result.output
-
-  def make_io_set(self):
-    return copy.deepcopy(self._test_cases)
-
-
-class EchoTwiceTask(KnownCodeBaseTask):
-  """Echo twice."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>,.[>,.]<[<]>[.>].',
-        default_input_fn_factory(),
-        **kwargs)
-
-
-class EchoThriceTask(KnownCodeBaseTask):
-  """Echo three times."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>,.[>,.]<[<]>[.>].<[<]>[.>].',
-        default_input_fn_factory(),
-        **kwargs)
-
-
-class CopyReverseTask(KnownCodeBaseTask):
-  """Echo forwards, backwards, and then forwards again."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>,.[>,.]<[.<].>[.>].',
-        default_input_fn_factory(),
-        **kwargs)
-
-
-class EchoZeroCascadeTask(KnownCodeBaseTask):
-  """Print k-th char with k zeros inbetween (1-indexed)."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        ',[.>[->+>.<<]>+[-<+>]<<,]',
-        default_input_fn_factory(),
-        **kwargs)
-
-
-class EchoCascadeTask(KnownCodeBaseTask):
-  """Print k-th char k times (1-indexed)."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        ',>>+<<[>>[-<+>]<[->+<<.>]>+<<,].',
-        default_input_fn_factory(base=20),
-        **kwargs)
-
-
-class ShiftLeftTask(KnownCodeBaseTask):
-  """Circulate shift input left."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        ',>,[.,]<.,.',
-        default_input_fn_factory(),
-        **kwargs)
-
-
-class ShiftRightTask(KnownCodeBaseTask):
-  """Circular shift input right."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>,[>,]<.[-]<[<]>[.>].',
-        default_input_fn_factory(),
-        **kwargs)
-
-
-class RiffleTask(KnownCodeBaseTask):
-  """Shuffle like a deck of cards.
-
-  For input of length N, output values in the following index order:
-  N-1, 0, N-2, 1, N-3, 2, ...
-  """
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>,[>,]<[.[-]<[<]>.[-]>[>]<]',
-        default_input_fn_factory(base=20, max_length=8),
-        **kwargs)
-
-
-class UnriffleTask(KnownCodeBaseTask):
-  """Inverse of riffle."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>,[>,[.[-]],]<[.<].',
-        default_input_fn_factory(base=20, max_length=8),
-        **kwargs)
-
-
-class MiddleCharTask(KnownCodeBaseTask):
-  """Print middle char if length is odd, or 0 if even."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>,[>,]<<[[>]<[,<[<]>,>[>]][>]<<]>.',
-        default_input_fn_factory(max_length=10),
-        **kwargs)
-
-
-class RemoveLastTask(KnownCodeBaseTask):
-  """Remove last character."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        ',>,[[<.[-]>[-<+>]],].',
-        default_input_fn_factory(base=20),
-        **kwargs)
-
-
-class RemoveLastTwoTask(KnownCodeBaseTask):
-  """Remove last two characters."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        ',>,>,[[<<.[-]>[-<+>]>[-<+>]],].',
-        default_input_fn_factory(base=10),
-        **kwargs)
-
-
-class EchoAlternatingTask(KnownCodeBaseTask):
-  # Print even numbered chars first (0-indexed), then odd numbered chars
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>,[.,>,]<<[<]>[.>].',
-        default_input_fn_factory(base=20, max_length=8),
-        **kwargs)
-
-
-class EchoHalfTask(KnownCodeBaseTask):
-  """Echo only first half of the input (round down when odd lengthed)."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>>+>,[[<]>+[>],]<[<]>-[-[-<<+>]<[>]>]<<[->+<]>[[>]>.,<+[<]>-].',
-        default_input_fn_factory(base=20, max_length=9),
-        **kwargs)
-
-
-class LengthTask(KnownCodeBaseTask):
-  """Print length of the input sequence."""
-
-  def __init__(self, **kwargs):
-    super(type(self), self).__init__(
-        '>+>,[[<]>+[>],]<[<]>-.',
-        default_input_fn_factory(max_length=14),
-        **kwargs)
-
-
-class EchoSecondSequenceTask(KnownCodeBaseTask):
-  """Echo second sequence. Sequences are separated by 0."""
-
-  def __init__(self, **kwargs):
-    def echo_second_gen(rand):
-      l = rand.randrange(1, 6)
-      x = [rand.randrange(256) for _ in xrange(l)]
-      l = rand.randrange(1, 6)
-      y = [rand.randrange(256) for _ in xrange(l)]
-      return x + [0] + y + [0]
-    super(type(self), self).__init__(
-        ',[,],[.,].',
-        echo_second_gen,
-        **kwargs)
-
-
-class EchoNthSequenceTask(KnownCodeBaseTask):
-  """Echo n-th sequence (1-indexed). Sequences are separated by 0."""
-
-  def __init__(self, **kwargs):
-    def echo_nth_gen(rand):
-      k = rand.randrange(1, 7)
-      n = rand.randrange(1, k + 1)
-      x = []
-      for _ in xrange(k):
-        l = rand.randrange(0, 4)
-        x += [rand.randrange(256) for _ in xrange(l)] + [0]
-      return [n] + x
-    super(type(self), self).__init__(
-        ',-[->,[,]<],[.,].',
-        echo_nth_gen,
-        **kwargs)
-
-
-class SubstringTask(KnownCodeBaseTask):
-  """Echo substring.
-
-  First two inputs are i and l, where i is the starting index (0-indexed)
-  and l is the length of the substring.
-  """
-
-  def __init__(self, **kwargs):
-    def substring_gen(rand):
-      l = rand.randrange(2, 16)
-      i, j = sorted([rand.randrange(l), rand.randrange(l)])
-      n = j - i
-      x = [rand.randrange(256) for _ in xrange(l)] + [0]
-      return [i, n] + x
-    super(type(self), self).__init__(
-        '>,<,>[->,<]>,<<[->>.,<<]',
-        substring_gen,
-        **kwargs)
-
-
-class Divide2Task(KnownCodeBaseTask):
-  """Divide by 2 (integer floor division)."""
-
-  def __init__(self, **kwargs):
-    def int_input_gen(rand):
-      return [rand.randrange(256)]
-    super(type(self), self).__init__(
-        ',[-[->>+<]>[<]<]>>.',
-        int_input_gen,
-        **kwargs)
-
-
-class DedupTask(KnownCodeBaseTask):
-  """Deduplicate adjacent duplicate chars."""
-
-  def __init__(self, **kwargs):
-    def dedup_input_gen(rand):
-      np_random = np.random.RandomState(rand.randrange(2147483647))
-      num_unique = rand.randrange(1, 5)
-      unique = np_random.choice(6, num_unique, replace=False) + 1
-      return [v for v in unique for _ in xrange(rand.randrange(1, 5))] + [0]
-    super(type(self), self).__init__(
-        '>>,.[[-<+<+>>],[-<->]<[[-<->]<.>]<[->>+<<]>>]',
-        dedup_input_gen,
-        **kwargs)
-
-
-# ==============================================================================
-# Extra tasks.
-# ==============================================================================
-
-
-class PrintIntTask(BaseTask):
-  """Print integer coding task.
-
-  Code needs to output a fixed single value (given as a hyperparameter to the
-  task constructor). Program input is ignored.
-  """
-
-  def __init__(self, base, fixed_string):
-    super(type(self), self).__init__()
-    self.base = base
-    self.eos = 0
-    self.fixed_string = fixed_string
-    self.input_type = misc.IOType.integer
-    self.output_type = misc.IOType.integer
-
-  def make_io_set(self):
-    return [(list(), list(self.fixed_string))]
-
-
-class EchoTask(BaseTask):
-  """Echo string coding task.
-
-  Code needs to pipe input to putput (without any modifications).
-  """
-
-  def __init__(self, base, min_length=1, max_length=5):
-    super(type(self), self).__init__()
-    self.base = base  # base includes EOS
-    self.eos = 0
-    self.min_length = min_length
-    self.max_length = max_length
-    self._io_pairs = self._make_io_examples(25)
-
-  def _make_io_examples(self, n):
-    # Test cases are fixed, but varied.
-    np_random = np.random.RandomState(1234567890)
-    io_pairs = []
-    for _ in xrange(n):
-      length = np_random.randint(self.min_length, self.max_length + 1)
-      input_seq = np_random.randint(1, self.base, length).tolist() + [self.eos]
-      output_seq = list(input_seq)
-      io_pairs.append((input_seq, output_seq))
-    return io_pairs
-
-  def make_io_set(self):
-    return copy.deepcopy(self._io_pairs)
-
-
-class JudgeRouteCircleTask(BaseTask):
-  """Judge route circle coding task.
-
-  Code needs to determine if the given route makes a closed loop.
-  Encoding: U = 1, R = 2, D = 3, L = 4.
-
-  Based on
-  https://leetcode.com/problems/judge-route-circle/description/
-  """
-  base = 256
-  input_type = misc.IOType.integer
-  output_type = misc.IOType.integer
-
-  def __init__(self, n, max_len=12):
-    super(type(self), self).__init__()
-    self.eos = 0
-    self._io_pairs = self._make_io_examples(n, max_len)
-    self.input_type = misc.IOType.integer
-    self.output_type = misc.IOType.integer
-
-  def _solve(self, input_seq):
-    assert input_seq[-1] == 0
-    pos = [0, 0]  # (x, y)
-    for move in input_seq[:-1]:
-      assert 0 < move <= 4
-      if move & 1 == 0:  # Left or Right.
-        pos[0] += 3 - move  # Add or subtract 1.
-      else:
-        pos[1] += 2 - move  # Add or subtract 1.
-    return [int(not pos[0] and not pos[1])]
-
-  def _make_io_examples(self, n, max_len):
-    """Generate test cases for the task."""
-    rand = random.Random(6849275409234)  # Test cases are fixed, but varied.
-    io_examples = []
-    io_examples.append(([0], [1]))
-    io_examples.append(([4, 2, 0], [1]))
-    io_examples.append(([2, 4, 0], [1]))
-    io_examples.append(([3, 1, 0], [1]))
-    io_examples.append(([1, 3, 0], [1]))
-    io_examples.append(([1, 0], [0]))
-    io_examples.append(([2, 0], [0]))
-    io_examples.append(([3, 0], [0]))
-    io_examples.append(([4, 0], [0]))
-    for _ in xrange(n):
-      is_true = rand.randrange(2)
-      length = rand.randrange(1, max_len + 1)
-      if is_true:
-        # Make a true case.
-        length = (length >> 1) << 1  # Make even.
-        partition = (rand.randrange(length + 1) >> 1) << 1
-        a = partition >> 1
-        b = (length - partition) >> 1
-        counts = {1: a, 2: b, 3: a, 4: b}
-      else:
-        # Make a false case.
-        partitions = (
-            [0]
-            + sorted([rand.randrange(length + 1) for _ in range(3)])
-            + [length])
-        counts = {n: partitions[n] - partitions[n - 1] for n in range(1, 5)}
-        if counts[1] == counts[3] and counts[2] == counts[4]:
-          # By chance we sampled a true case. Make it false by exchanging
-          # one count between even and odd pairs.
-          base = 1 + 2 * rand.randrange(2)
-          a, b = (base, base + 1) if rand.randrange(2) else (base + 1, base)
-          if counts[a] == length or counts[b] == 0:
-            # If counts are at their extreme values, then swap who gets
-            # incremented and decremented.
-            a, b = b, a
-          counts[a] += 1
-          counts[b] -= 1
-          assert counts[a] <= length and counts[b] >= 0
-      assert sum(counts.values()) == length
-      input_seq = [n for n in xrange(1, 5) for _ in xrange(counts[n])]
-      rand.shuffle(input_seq)
-      input_seq += [0]
-      output_seq = self._solve(input_seq)
-      assert output_seq[0] == is_true
-      io_examples.append((input_seq, output_seq))
-    return io_examples
-
-  def make_io_set(self):
-    return copy.deepcopy(self._io_pairs)
-
-
-class MultiplyTask(BaseTask):
-  """Multiply coding task.
-
-  Code needs to multiple two ints.
-
-  Solution:
-  http://robl.co/brief-look-at-brainfuck/
-  ,>,><<[->[->+>+<<]>>[-<<+>>]<<<]>>.
-  """
-  base = 512
-  input_type = misc.IOType.integer
-  output_type = misc.IOType.integer
-
-  def __init__(self, n):
-    super(type(self), self).__init__()
-    self.eos = 0
-    self._io_pairs = self._make_io_examples(n)
-    self.input_type = misc.IOType.integer
-    self.output_type = misc.IOType.integer
-
-  def _factors(self, n):
-    return set(i for i in range(1, int(n**0.5) + 1) if n % i == 0)
-
-  def _make_io_examples(self, n):
-    """Generate test cases for the task."""
-    rand = random.Random(6849275409234)  # Test cases are fixed, but varied.
-    io_examples = []
-    for _ in xrange(n):
-      n = rand.randrange(self.base)
-      if n == 0:
-        a, b = 0, rand.randrange(self.base)
-      else:
-        f = list(self._factors(n))
-        a = f[rand.randrange(len(f))]
-        b = n // a
-      if rand.randrange(2):
-        a, b = b, a
-      io_examples.append(([a, b], [n]))
-    return io_examples
-
-  def make_io_set(self):
-    return copy.deepcopy(self._io_pairs)
-
-
-class DivModTask(BaseTask):
-  """Divmod coding task.
-
-  Code needs to take the quotient and remainder of two ints.
-
-  Solution:
-  http://robl.co/brief-look-at-brainfuck/
-  ,>,><<[>[->+>+<<]>[-<<-[>]>>>[<[-<->]<[>]>>[[-]>>+<]>-<]<<]>>>+<<[-<<+>>]<<<]>
-  >>>>[-<<<<<+>>>>>]<<<<<.>.>
-  """
-  base = 512
-  input_type = misc.IOType.integer
-  output_type = misc.IOType.integer
-
-  def __init__(self, n):
-    super(type(self), self).__init__()
-    self.eos = 0
-    self._io_pairs = self._make_io_examples(n)
-    self.input_type = misc.IOType.integer
-    self.output_type = misc.IOType.integer
-
-  def _make_io_examples(self, n):
-    rand = random.Random(6849275409234)  # Test cases are fixed, but varied.
-    io_examples = []
-    for _ in xrange(n):
-      n = rand.randrange(0, self.base)
-      k = rand.randrange(1, self.base)  # Divisor cannot be 0.
-      io_examples.append(([n, k], list(divmod(n, k))))
-    return io_examples
-
-  def make_io_set(self):
-    return copy.deepcopy(self._io_pairs)
-
-
-class FibonacciTask(BaseTask):
-
-  def __init__(self):
-    super(type(self), self).__init__()
-    self.base = 256
-    self.input_type = misc.IOType.integer
-    self.output_type = misc.IOType.integer
-
-  def make_io_set(self):
-    return [
-        ([0], [0, 1]),
-        ([1], [1, 1]),
-        ([2], [1, 2]),
-        ([3], [2, 3]),
-        ([4], [3, 5]),
-        ([5], [5, 8]),
-        ([6], [8, 13]),
-        ([7], [13, 21]),
-        ([8], [21, 34]),
-        ([9], [34, 55]),
-        ([10], [55, 89]),
-        ([11], [89, 144]),
-        ([12], [144, 233]),
-        ([13], [233, 121])]
-
-
-class FindSubStrTask(BaseTask):
-  """Find sub-string coding task.
-
-  Code needs to output a bool: True if the input string contains a hard-coded
-  substring, 'AB' (values [1, 2]).
-  """
-
-  def __init__(self, base):
-    super(type(self), self).__init__()
-    assert base >= 27
-    self.base = base
-    self.eos = 0
-    self.find_str = [1, 2]
-    self.input_type = misc.IOType.string
-    self.output_type = misc.IOType.boolean
-
-  def make_io_set(self):
-    return [
-        ([1, 1, 23, 0], [0]),
-        ([21, 3, 2, 0], [0]),
-        ([2, 1, 19, 0], [0]),
-        ([2, 24, 15, 3, 0], [0]),
-        ([24, 6, 10, 16, 4, 0], [0]),
-        ([1, 2, 12, 0], [1]),
-        ([7, 1, 2, 0], [1]),
-        ([1, 2, 11, 3, 0], [1]),
-        ([1, 1, 2, 18, 0], [1]),
-        ([7, 25, 1, 2, 0], [1]),
-        ([3, 1, 2, 11, 8, 0], [1]),
-        ([15, 16, 20, 1, 2, 0], [1])]
-
-
-class SortFixedTask(BaseTask):
-  """Sort list coding task.
-
-  Code needs to output a sorted input list. The task consists of lists of the
-  same length L, where L is provided to this task's constructor as a
-  hyperparameter.
-  """
-
-  def __init__(self, base, length=3):
-    super(type(self), self).__init__()
-    assert base >= 27
-    self.base = base
-    self.eos = 0
-    self.length = length
-    assert length == 3  # More lengths will be supported.
-
-  def make_io_set(self):
-    if self.length == 3:
-      return [
-          ([1, 20, 6], [1, 6, 20]),
-          ([13, 6, 7], [6, 7, 13]),
-          ([24, 2, 23], [2, 23, 24]),
-          ([16, 12, 3], [3, 12, 16]),
-          ([11, 24, 4], [4, 11, 24]),
-          ([10, 1, 19], [1, 10, 19])]
-
-
-class SortFixedTaskV2(BaseTask):
-  """Sort list coding task (version 2).
-
-  Code needs to output a sorted input list. The task consists of lists of the
-  same length L, where L is provided to this task's constructor as a
-  hyperparameter.
-
-  Test cases are dynamically generated, allowing for the number of test cases
-  to be a hyperparameter.
-  """
-
-  def __init__(self, base, n, length=3):
-    super(type(self), self).__init__()
-    assert base >= 27
-    self.base = base
-    self.eos = 0
-    self._io_pairs = self._make_io_examples(n, length)
-    self.input_type = misc.IOType.integer
-    self.output_type = misc.IOType.integer
-
-  def _make_io_examples(self, n, length):
-    rand = random.Random(6849275409234)  # Test cases are fixed, but varied.
-    io_examples = []
-    for _ in xrange(n):
-      input_seq = [rand.randrange(1, self.base) for _ in xrange(length)]
-      output_seq = sorted(input_seq)
-      io_examples.append((input_seq, output_seq))
-    return io_examples
-
-  def make_io_set(self):
-    return copy.deepcopy(self._io_pairs)
-
-
-class RemoveTargetCharTask(KnownCodeBaseTask):
-  """Remove target character from string, where first input is the target.
-
-  Target can appear multiple times.
-  """
-
-  def __init__(self, **kwargs):
-    def randrange_hole(rand, a, hole, b):
-      x = rand.randrange(a, b - 1)
-      if x >= hole:
-        return x + 1
-      return x
-    def remove_target_char_gen(rand):
-      char = rand.randrange(1, 6)
-      l = rand.randrange(1, 8)
-      input_seq = [randrange_hole(rand, 1, char, 256) for _ in xrange(l)]
-      idx = range(l)
-      rand.shuffle(idx)
-      num_targets = rand.randrange(0, l)
-      for pos in idx[:num_targets]:
-        input_seq[pos] = char
-      return [char] + input_seq + [0]
-    super(type(self), self).__init__(
-        ',>>>,[<<<[->+>+<<]>>[->->+<<]>[>[-<+>]<.[-]]>[-]<<<[-<+>]>>,].',
-        remove_target_char_gen,
-        **kwargs)
-
-
-class ListIndexTask(KnownCodeBaseTask):
-  """Echo i-th value in the given list."""
-
-  def __init__(self, **kwargs):
-    def array_index_gen(rand):
-      l = rand.randrange(1, 16)
-      i = rand.randrange(l)
-      return [i] + [rand.randrange(256) for _ in xrange(l)] + [0]
-    super(type(self), self).__init__(
-        ',[->,<]>,.',
-        array_index_gen,
-        **kwargs)
-
-
-# ==============================================================================
-# Tasks based on primaryobjects paper.
-# ==============================================================================
-
-
-def string2tokens(string):
-  return [ord(c) for c in string]
-
-
-def stringlist2tokens(strings):
-  return [string2tokens(string) for string in strings]
-
-
-def string2tokens_b27(string):
-  return [ord(c.lower()) - ord('a') + 1 for c in string]
-
-
-def stringlist2tokens_b27(strings):
-  return [string2tokens_b27(string) for string in strings]
-
-
-class BottlesOfBeerTask(BaseTask):
-  """Bottles of beer coding task.
-
-  This is a counting task. Code needs to read in an int N and then output
-  every int from N to 0, each separated by a 0.
-  """
-  base = 256
-  input_type = misc.IOType.integer
-  output_type = misc.IOType.integer
-
-  def make_io_set(self):
-    return [
-        ([1], [1, 0]),
-        ([2], [2, 0, 1, 0]),
-        ([3], [3, 0, 2, 0, 1, 0]),
-        ([4], [4, 0, 3, 0, 2, 0, 1, 0]),
-        ([5], [5, 0, 4, 0, 3, 0, 2, 0, 1, 0]),
-        ([6], [6, 0, 5, 0, 4, 0, 3, 0, 2, 0, 1, 0])]
-
-
-class SplitTask(BaseTask):
-  """Split coding task.
-
-  Code needs to pipe input strings to output, but insert a 0 after every 3
-  characters. This is in essence splitting the string into intervals of length
-  3.
-  """
-  base = 28
-  input_type = misc.IOType.string
-  output_type = misc.IOType.integer
-
-  def _splicer(self, lst, insert, interval=3):
-    for i, item in enumerate(lst):
-      yield item
-      if (i + 1) % interval == 0 and i < len(lst) - 1:
-        yield insert
-
-  def __init__(self):
-    super(type(self), self).__init__()
-    inputs = stringlist2tokens_b27(
-        ['hello', 'orange', 'spaghetti', 'wins', 'one'])
-    targets = [list(self._splicer(i, 27)) for i in inputs]
-    self._test_cases = list(zip(inputs, targets))
-
-  def make_io_set(self):
-    return copy.deepcopy(self._test_cases)
-
-
-class TrimLeftTask(BaseTask):
-  """Trim left coding task.
-
-  Code needs to pipe input strings to output, but remove everything before the
-  first quotation char (").
-  """
-  base = 256
-  input_type = misc.IOType.integer
-  output_type = misc.IOType.integer
-
-  def __init__(self):
-    super(type(self), self).__init__()
-    inputs = stringlist2tokens(
-        ['a "inside" over', 'xy "test" rights', 'ca6 "foresting" service',
-         'abc"def"yz.', 'A"B"'])
-    targets = stringlist2tokens(
-        ['"inside" over', '"test" rights', '"foresting" service', '"def"yz.',
-         '"B"'])
-    self._test_cases = list(zip(inputs, targets))
-
-  def make_io_set(self):
-    return copy.deepcopy(self._test_cases)
diff --git a/research/brain_coder/single_task/code_tasks_test.py b/research/brain_coder/single_task/code_tasks_test.py
deleted file mode 100644
index d3260a1a56ec0f7c36363d558122f7f7e49198e6..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/code_tasks_test.py
+++ /dev/null
@@ -1,108 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for code_tasks."""
-
-import numpy as np
-import tensorflow as tf
-
-from single_task import code_tasks  # brain coder
-from single_task import defaults  # brain coder
-
-
-def pad(string, pad_length, pad_char):
-  return string + pad_char * (pad_length - len(string))
-
-
-class CodeTasksTest(tf.test.TestCase):
-
-  def assertClose(self, a, b):
-    self.assertTrue(
-        np.isclose(a, b, atol=1e-4),
-        'Expecting approximately equal values. Got: %s, %s' % (a, b))
-
-  def testMultiIOTaskManager(self):
-    maxlen = 100
-    padchr = '['
-    task = code_tasks.make_paper_task(
-        'print', timestep_limit=maxlen, do_code_simplification=False)
-    reward_fns = task.rl_batch(1)
-    r = reward_fns[0]
-    self.assertClose(
-        r(pad('++++++++.---.+++++++...', maxlen, padchr)).episode_rewards[-1],
-        0.2444)
-    self.assertClose(
-        r(pad('++++++++.---.+++++++..+++.',
-              maxlen, padchr)).episode_rewards[-1],
-        1.0)
-
-    task = code_tasks.make_paper_task(
-        'print', timestep_limit=maxlen, do_code_simplification=True)
-    reward_fns = task.rl_batch(1)
-    r = reward_fns[0]
-    self.assertClose(
-        r('++++++++.---.+++++++...').episode_rewards[-1],
-        0.2444)
-    self.assertClose(
-        r('++++++++.---.+++++++..+++.').episode_rewards[-1],
-        0.935)
-    self.assertClose(
-        r(pad('++++++++.---.+++++++..+++.',
-              maxlen, padchr)).episode_rewards[-1],
-        0.75)
-
-    task = code_tasks.make_paper_task(
-        'reverse', timestep_limit=maxlen, do_code_simplification=False)
-    reward_fns = task.rl_batch(1)
-    r = reward_fns[0]
-    self.assertClose(
-        r(pad('>,>,>,.<.<.<.', maxlen, padchr)).episode_rewards[-1],
-        0.1345)
-    self.assertClose(
-        r(pad(',[>,]+[,<.]', maxlen, padchr)).episode_rewards[-1],
-        1.0)
-
-    task = code_tasks.make_paper_task(
-        'reverse', timestep_limit=maxlen, do_code_simplification=True)
-    reward_fns = task.rl_batch(1)
-    r = reward_fns[0]
-    self.assertClose(r('>,>,>,.<.<.<.').episode_rewards[-1], 0.1324)
-    self.assertClose(r(',[>,]+[,<.]').episode_rewards[-1], 0.9725)
-    self.assertClose(
-        r(pad(',[>,]+[,<.]', maxlen, padchr)).episode_rewards[-1],
-        0.75)
-
-  def testMakeTask(self):
-    maxlen = 100
-    padchr = '['
-    config = defaults.default_config_with_updates(
-        'env=c(config_for_iclr=False,fixed_string=[8,5,12,12,15])')
-    task = code_tasks.make_task(config.env, 'print', timestep_limit=maxlen)
-    reward_fns = task.rl_batch(1)
-    r = reward_fns[0]
-    self.assertClose(
-        r('++++++++.---.+++++++...').episode_rewards[-1],
-        0.2444)
-    self.assertClose(
-        r('++++++++.---.+++++++..+++.').episode_rewards[-1],
-        0.935)
-    self.assertClose(
-        r(pad('++++++++.---.+++++++..+++.',
-              maxlen, padchr)).episode_rewards[-1],
-        0.75)
-
-  def testKnownCodeBaseTask(self):
-    maxlen = 100
-    padchr = '['
-    task = code_tasks.make_paper_task(
-        'shift-left', timestep_limit=maxlen, do_code_simplification=False)
-    reward_fns = task.rl_batch(1)
-    r = reward_fns[0]
-    self.assertClose(
-        r(pad(',>,[.,]<.,.', maxlen, padchr)).episode_rewards[-1],
-        1.0)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/single_task/data.py b/research/brain_coder/single_task/data.py
deleted file mode 100644
index 8f34464f5a3e1c403b0f253f1520920c303b0819..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/data.py
+++ /dev/null
@@ -1,89 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Manage data for pretraining and RL tasks."""
-
-import ast
-from collections import namedtuple
-
-from absl import logging
-
-from single_task import code_tasks  # brain coder
-
-
-RLBatch = namedtuple('RLBatch', ['reward_fns', 'batch_size', 'good_reward'])
-
-
-class DataManager(object):
-  """Interface between environment and model."""
-
-  def __init__(self, global_config, run_number=None,
-               do_code_simplification=False):
-    """Constructs a DataManager.
-
-    Args:
-      global_config: A config_lib.Config instance containing all config. See
-          config in defaults.py.
-      run_number: Which run this is (of the same experiment). This should be set
-          when a task cycle is defined in the config. A task cycle is a list of
-          tasks to cycle through repeatedly, and the selected task is a function
-          of the run number, i.e. 0-th run, 1-st run, 2-nd run, etc...
-          This can be None if only a single task is set in the config.
-      do_code_simplification: When global_config.env.config_for_iclr is True,
-          use this option to create code simplification (code golf) tasks, vs
-          fixed length coding tasks. If True, a task with code simplification
-          reward will be constructed.
-
-    Raises:
-      ValueError: If global_config.env.task and global_config.env.task_cycle
-          are both set, or both not set. Only one should be given.
-      ValueError: If global_config.env.task_cycle is set but run_number is None.
-    """
-    env_config = global_config.env
-    self.batch_size = global_config.batch_size
-
-    if env_config.task_cycle:
-      if env_config.task:
-        raise ValueError('Do not set both `task` and `task_cycle`.')
-      if run_number is None:
-        raise ValueError('Do not use task_cycle for single-run experiment.')
-      index = run_number % len(env_config.task_cycle)
-      self.task_name = env_config.task_cycle[index]
-      logging.info('run_number: %d,  task_cycle index: %d', run_number, index)
-      logging.info('task_cycle: %s', env_config.task_cycle)
-    elif env_config.task:
-      self.task_name = env_config.task
-    else:
-      raise ValueError('Either `task` or `task_cycle` must be set.')
-    logging.info('Task for this run: "%s"', self.task_name)
-
-    logging.info('config_for_iclr=True; do_code_simplification=%s',
-                 do_code_simplification)
-    self.rl_task = code_tasks.make_task(
-        task_name=self.task_name,
-        override_kwargs=ast.literal_eval(env_config.task_kwargs),
-        max_code_length=global_config.timestep_limit,
-        require_correct_syntax=env_config.correct_syntax,
-        do_code_simplification=do_code_simplification,
-        correct_bonus=env_config.task_manager_config.correct_bonus,
-        code_length_bonus=env_config.task_manager_config.code_length_bonus)
-
-  def sample_rl_batch(self):
-    """Create reward functions from the current task.
-
-    Returns:
-      RLBatch namedtuple instance, which holds functions and information for
-      a minibatch of episodes.
-      * reward_fns: A reward function for each episode. Maps code string to
-          reward.
-      * batch_size: Number of episodes in this minibatch.
-      * good_reward: Estimated threshold of rewards which indicate the algorithm
-          is starting to solve the task. This is a heuristic that tries to
-          reduce the amount of stuff written to disk.
-    """
-    reward_fns = self.rl_task.rl_batch(self.batch_size)
-    return RLBatch(
-        reward_fns=reward_fns,
-        batch_size=self.batch_size,
-        good_reward=self.rl_task.good_reward)
diff --git a/research/brain_coder/single_task/defaults.py b/research/brain_coder/single_task/defaults.py
deleted file mode 100644
index d9bd8b942532dfffcf06d90d331e58725c4d82a9..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/defaults.py
+++ /dev/null
@@ -1,82 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Default configuration for agent and environment."""
-
-from absl import logging
-
-from common import config_lib  # brain coder
-
-
-def default_config():
-  return config_lib.Config(
-      agent=config_lib.OneOf(
-          [config_lib.Config(
-              algorithm='pg',
-              policy_lstm_sizes=[35,35],
-              # Set value_lstm_sizes to None to share weights with policy.
-              value_lstm_sizes=[35,35],
-              obs_embedding_size=10,
-              grad_clip_threshold=10.0,
-              param_init_factor=1.0,
-              lr=5e-5,
-              pi_loss_hparam=1.0,
-              vf_loss_hparam=0.5,
-              entropy_beta=1e-2,
-              regularizer=0.0,
-              softmax_tr=1.0,  # Reciprocal temperature.
-              optimizer='rmsprop',  # 'adam', 'sgd', 'rmsprop'
-              topk=0,  # Top-k unique codes will be stored.
-              topk_loss_hparam=0.0,  # off policy loss multiplier.
-              # Uniformly sample this many episodes from topk buffer per batch.
-              # If topk is 0, this has no effect.
-              topk_batch_size=1,
-              # Exponential moving average baseline for REINFORCE.
-              # If zero, A2C is used.
-              # If non-zero, should be close to 1, like .99, .999, etc.
-              ema_baseline_decay=0.99,
-              # Whether agent can emit EOS token. If true, agent can emit EOS
-              # token which ends the episode early (ends the sequence).
-              # If false, agent must emit tokens until the timestep limit is
-              # reached. e.g. True means variable length code, False means fixed
-              # length code.
-              # WARNING: Making this false slows things down.
-              eos_token=False,
-              replay_temperature=1.0,
-              # Replay probability. 1 = always replay, 0 = always on policy.
-              alpha=0.0,
-              # Whether to normalize importance weights in each minibatch.
-              iw_normalize=True),
-           config_lib.Config(
-              algorithm='ga',
-              crossover_rate=0.99,
-              mutation_rate=0.086),
-           config_lib.Config(
-              algorithm='rand')],
-          algorithm='pg',
-      ),
-      env=config_lib.Config(
-          # If True, task-specific settings are not needed.
-          task='',  # 'print', 'echo', 'reverse', 'remove', ...
-          task_cycle=[],  # If non-empty, reptitions will cycle through tasks.
-          task_kwargs='{}',  # Python dict literal.
-          task_manager_config=config_lib.Config(
-              # Reward recieved per test case. These bonuses will be scaled
-              # based on how many test cases there are.
-              correct_bonus=2.0,  # Bonus for code getting correct answer.
-              code_length_bonus=1.0),  # Maximum bonus for short code.
-          correct_syntax=False,
-      ),
-      batch_size=64,
-      timestep_limit=32)
-
-
-def default_config_with_updates(config_string, do_logging=True):
-  if do_logging:
-    logging.info('Config string: "%s"', config_string)
-  config = default_config()
-  config.strict_update(config_lib.Config.parse(config_string))
-  if do_logging:
-    logging.info('Config:\n%s', config.pretty_str())
-  return config
diff --git a/research/brain_coder/single_task/ga_lib.py b/research/brain_coder/single_task/ga_lib.py
deleted file mode 100644
index fadb96482b21a5c65c0d6d6cf4a3aec3b5708235..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/ga_lib.py
+++ /dev/null
@@ -1,472 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Genetic algorithm for BF tasks.
-
-Inspired by https://github.com/primaryobjects/AI-Programmer.
-GA function code borrowed from https://github.com/DEAP/deap.
-"""
-
-from collections import namedtuple
-import random
-
-from absl import flags
-from absl import logging
-import numpy as np
-from six.moves import xrange
-
-from common import bf  # brain coder
-from common import utils  # brain coder
-from single_task import misc  # brain coder
-
-FLAGS = flags.FLAGS
-
-# Saving reward of previous programs saves computation if a program appears
-# again.
-USE_REWARD_CACHE = True  # Disable this if GA is using up too much memory.
-GENES = bf.CHARS
-MAX_PROGRAM_STEPS = 500
-STEP_BONUS = True
-
-ALPHANUM_CHARS = (
-    ['_'] +
-    [chr(ord('a') + i_) for i_ in range(26)] +
-    [chr(ord('A') + i_) for i_ in range(26)] +
-    [chr(ord('0') + i_) for i_ in range(10)])
-
-Result = namedtuple(
-    'Result',
-    ['reward', 'inputs', 'code_outputs', 'target_outputs', 'type_in',
-     'type_out', 'base', 'correct'])
-
-
-class IOType(object):
-  string = 'string'
-  integer = 'integer'
-
-
-class CustomType(object):
-
-  def __init__(self, to_str_fn):
-    self.to_str_fn = to_str_fn
-
-  def __call__(self, obj):
-    return self.to_str_fn(obj)
-
-
-def tokens_list_repr(tokens, repr_type, base):
-  """Make human readable representation of program IO."""
-  if isinstance(repr_type, CustomType):
-    return repr_type(tokens)
-  elif repr_type == IOType.string:
-    chars = (
-        [ALPHANUM_CHARS[t] for t in tokens] if base < len(ALPHANUM_CHARS)
-        else [chr(t) for t in tokens])
-    return ''.join(chars)
-  elif repr_type == IOType.integer:
-    return str(tokens)
-  raise ValueError('No such representation type "%s"', repr_type)
-
-
-def io_repr(result):
-  """Make human readable representation of test cases."""
-  inputs = ','.join(
-      tokens_list_repr(tokens, result.type_in, result.base)
-      for tokens in result.inputs)
-  code_outputs = ','.join(
-      tokens_list_repr(tokens, result.type_out, result.base)
-      for tokens in result.code_outputs)
-  target_outputs = ','.join(
-      tokens_list_repr(tokens, result.type_out, result.base)
-      for tokens in result.target_outputs)
-  return inputs, target_outputs, code_outputs
-
-
-def make_task_eval_fn(task_manager):
-  """Returns a wrapper that converts an RL task into a GA task.
-
-  Args:
-    task_manager: Is a task manager object from code_tasks.py
-
-  Returns:
-    A function that takes as input a single list of a code chars, and outputs
-    a Result namedtuple instance containing the reward and information about
-    code execution.
-  """
-  def to_data_list(single_or_tuple):
-    if isinstance(single_or_tuple, misc.IOTuple):
-      return list(single_or_tuple)
-    return [single_or_tuple]
-
-  def to_ga_type(rl_type):
-    if rl_type == misc.IOType.string:
-      return IOType.string
-    return IOType.integer
-
-  # Wrapper function.
-  def evalbf(bf_chars):
-    result = task_manager._score_code(''.join(bf_chars))
-    reward = sum(result.episode_rewards)
-    correct = result.reason == 'correct'
-    return Result(
-        reward=reward,
-        inputs=to_data_list(result.input_case),
-        code_outputs=to_data_list(result.code_output),
-        target_outputs=to_data_list(result.correct_output),
-        type_in=to_ga_type(result.input_type),
-        type_out=to_ga_type(result.output_type),
-        correct=correct,
-        base=task_manager.task.base)
-
-  return evalbf
-
-
-def debug_str(individual, task_eval_fn):
-  res = task_eval_fn(individual)
-  input_str, target_output_str, code_output_str = io_repr(res)
-  return (
-      ''.join(individual) +
-      ' | ' + input_str +
-      ' | ' + target_output_str +
-      ' | ' + code_output_str +
-      ' | ' + str(res.reward) +
-      ' | ' + str(res.correct))
-
-
-def mutate_single(code_tokens, mutation_rate):
-  """Mutate a single code string.
-
-  Args:
-    code_tokens: A string/list/Individual of BF code chars. Must end with EOS
-        symbol '_'.
-    mutation_rate: Float between 0 and 1 which sets the probability of each char
-        being mutated.
-
-  Returns:
-    An Individual instance containing the mutated code string.
-
-  Raises:
-    ValueError: If `code_tokens` does not end with EOS symbol.
-  """
-  if len(code_tokens) <= 1:
-    return code_tokens
-  if code_tokens[-1] == '_':
-    # Do this check to ensure that the code strings have not been corrupted.
-    raise ValueError('`code_tokens` must end with EOS symbol.')
-  else:
-    cs = Individual(code_tokens)
-    eos = []
-  mutated = False
-  for pos in range(len(cs)):
-    if random.random() < mutation_rate:
-      mutated = True
-      new_char = GENES[random.randrange(len(GENES))]
-      x = random.random()
-      if x < 0.25 and pos != 0 and pos != len(cs) - 1:
-        # Insertion mutation.
-        if random.random() < 0.50:
-          # Shift up.
-          cs = cs[:pos] + [new_char] + cs[pos:-1]
-        else:
-          # Shift down.
-          cs = cs[1:pos] + [new_char] + cs[pos:]
-      elif x < 0.50:
-        # Deletion mutation.
-        if random.random() < 0.50:
-          # Shift down.
-          cs = cs[:pos] + cs[pos + 1:] + [new_char]
-        else:
-          # Shift up.
-          cs = [new_char] + cs[:pos] + cs[pos + 1:]
-      elif x < 0.75:
-        # Shift rotate mutation (position invariant).
-        if random.random() < 0.50:
-          # Shift down.
-          cs = cs[1:] + [cs[0]]
-        else:
-          # Shift up.
-          cs = [cs[-1]] + cs[:-1]
-      else:
-        # Replacement mutation.
-        cs = cs[:pos] + [new_char] + cs[pos + 1:]
-  assert len(cs) + len(eos) == len(code_tokens)
-  if mutated:
-    return Individual(cs + eos)
-  else:
-    return Individual(code_tokens)
-
-
-def crossover(parent1, parent2):
-  """Performs crossover mating between two code strings.
-
-  Crossover mating is where a random position is selected, and the chars
-  after that point are swapped. The resulting new code strings are returned.
-
-  Args:
-    parent1: First code string.
-    parent2: Second code string.
-
-  Returns:
-    A 2-tuple of children, i.e. the resulting code strings after swapping.
-  """
-  max_parent, min_parent = (
-      (parent1, parent2) if len(parent1) > len(parent2)
-      else (parent2, parent1))
-  pos = random.randrange(len(max_parent))
-  if pos >= len(min_parent):
-    child1 = max_parent[:pos]
-    child2 = min_parent + max_parent[pos:]
-  else:
-    child1 = max_parent[:pos] + min_parent[pos:]
-    child2 = min_parent[:pos] + max_parent[pos:]
-  return Individual(child1), Individual(child2)
-
-
-def _make_even(n):
-  """Return largest even integer less than or equal to `n`."""
-  return (n >> 1) << 1
-
-
-def mutate_and_crossover(population, mutation_rate, crossover_rate):
-  """Take a generational step over a population.
-
-  Transforms population of parents into population of children (of the same
-  size) via crossover mating and then mutation on the resulting children.
-
-  Args:
-    population: Parent population. A list of Individual objects.
-    mutation_rate: Probability of mutation. See `mutate_single`.
-    crossover_rate: Probability that two parents will mate.
-
-  Returns:
-    Child population. A list of Individual objects.
-  """
-  children = [None] * len(population)
-  for i in xrange(0, _make_even(len(population)), 2):
-    p1 = population[i]
-    p2 = population[i + 1]
-    if random.random() < crossover_rate:
-      p1, p2 = crossover(p1, p2)
-    c1 = mutate_single(p1, mutation_rate)
-    c2 = mutate_single(p2, mutation_rate)
-    children[i] = c1
-    children[i + 1] = c2
-  if children[-1] is None:
-    children[-1] = population[-1]
-  return children
-
-
-def ga_loop(population, cxpb, mutpb, ngen, task_eval_fn, halloffame=None,
-            checkpoint_writer=None):
-  """A bare bones genetic algorithm.
-
-  Similar to chapter 7 of Back, Fogel and Michalewicz, "Evolutionary
-  Computation 1 : Basic Algorithms and Operators", 2000.
-
-  Args:
-    population: A list of individuals.
-    cxpb: The probability of mating two individuals.
-    mutpb: The probability of mutating a gene.
-    ngen: The number of generation. Unlimited if zero.
-    task_eval_fn: A python function which maps an Individual to a Result
-        namedtuple.
-    halloffame: (optional) a utils.MaxUniquePriorityQueue object that will be
-        used to aggregate the best individuals found during search.
-    checkpoint_writer: (optional) an object that can save and load populations.
-        Needs to have `write`, `load`, and `has_checkpoint` methods. Used to
-        periodically save progress. In event of a restart, the population will
-        be loaded from disk.
-
-  Returns:
-    GaResult namedtuple instance. This contains information about the GA run,
-    including the resulting population, best reward (fitness) obtained, and
-    the best code string found.
-  """
-
-  has_checkpoint = False
-  if checkpoint_writer and checkpoint_writer.has_checkpoint():
-    try:
-      gen, population, halloffame = checkpoint_writer.load()
-    except EOFError:  # Data was corrupted. Start over.
-      pass
-    else:
-      has_checkpoint = True
-      logging.info(
-          'Loaded population from checkpoint. Starting at generation %d', gen)
-
-      # Evaluate the individuals with an invalid fitness
-      invalid_ind = [ind for ind in population if not ind.fitness.valid]
-      for ind in invalid_ind:
-        ind.fitness.values = task_eval_fn(ind).reward,
-      for _, ind in halloffame.iter_in_order():
-        ind.fitness.values = task_eval_fn(ind).reward,
-
-  if not has_checkpoint:
-    # Evaluate the individuals with an invalid fitness
-    invalid_ind = [ind for ind in population if not ind.fitness.valid]
-    for ind in invalid_ind:
-      ind.fitness.values = task_eval_fn(ind).reward,
-
-    if halloffame is not None:
-      for ind in population:
-        halloffame.push(ind.fitness.values, tuple(ind), ind)
-
-    logging.info('Initialized new population.')
-
-    gen = 1
-
-  pop_size = len(population)
-  program_reward_cache = {} if USE_REWARD_CACHE else None
-
-  # Begin the generational process
-  while ngen == 0 or gen <= ngen:
-    # Select the next generation individuals
-    offspring = roulette_selection(population, pop_size - len(halloffame))
-
-    # Vary the pool of individuals
-    # offspring = varAnd(offspring, toolbox, cxpb, mutpb)
-    offspring = mutate_and_crossover(
-        offspring, mutation_rate=mutpb, crossover_rate=cxpb)
-
-    # Evaluate the individuals with an invalid fitness
-    invalid_ind = [ind for ind in offspring if not ind.fitness.valid]
-    for ind in invalid_ind:
-      str_repr = ''.join(ind)
-      if program_reward_cache is not None and str_repr in program_reward_cache:
-        ind.fitness.values = (program_reward_cache[str_repr],)
-      else:
-        eval_result = task_eval_fn(ind)
-        ind.fitness.values = (eval_result.reward,)
-        if program_reward_cache is not None:
-          program_reward_cache[str_repr] = eval_result.reward
-
-    # Replace the current population by the offspring
-    population = list(offspring)
-
-    # Update the hall of fame with the generated individuals
-    if halloffame is not None:
-      for ind in population:
-        halloffame.push(ind.fitness.values, tuple(ind), ind)
-
-    # elitism
-    population.extend([ind for _, ind in halloffame.iter_in_order()])
-
-    if gen % 100 == 0:
-      top_code = '\n'.join([debug_str(ind, task_eval_fn)
-                            for ind in topk(population, k=4)])
-      logging.info('gen: %d\nNPE: %d\n%s\n\n', gen, gen * pop_size, top_code)
-
-      best_code = ''.join(halloffame.get_max()[1])
-      res = task_eval_fn(best_code)
-
-      # Write population and hall-of-fame to disk.
-      if checkpoint_writer:
-        checkpoint_writer.write(gen, population, halloffame)
-
-      if res.correct:
-        logging.info('Solution found:\n%s\nreward = %s\n',
-                     best_code, res.reward)
-        break
-
-    gen += 1
-
-  best_code = ''.join(halloffame.get_max()[1])
-  res = task_eval_fn(best_code)
-
-  return GaResult(
-      population=population, best_code=best_code, reward=res.reward,
-      solution_found=res.correct, generations=gen,
-      num_programs=gen * len(population),
-      max_generations=ngen, max_num_programs=ngen * len(population))
-
-
-GaResult = namedtuple(
-    'GaResult',
-    ['population', 'best_code', 'reward', 'generations', 'num_programs',
-     'solution_found', 'max_generations', 'max_num_programs'])
-
-
-def reward_conversion(reward):
-  """Convert real value into positive value."""
-  if reward <= 0:
-    return 0.05
-  return reward + 0.05
-
-
-def roulette_selection(population, k):
-  """Select `k` individuals with prob proportional to fitness.
-
-  Each of the `k` selections is independent.
-
-  Warning:
-    The roulette selection by definition cannot be used for minimization
-    or when the fitness can be smaller or equal to 0.
-
-  Args:
-    population: A list of Individual objects to select from.
-    k: The number of individuals to select.
-
-  Returns:
-    A list of selected individuals.
-  """
-  fitnesses = np.asarray(
-      [reward_conversion(ind.fitness.values[0])
-       for ind in population])
-  assert np.all(fitnesses > 0)
-
-  sum_fits = fitnesses.sum()
-  chosen = [None] * k
-  for i in xrange(k):
-    u = random.random() * sum_fits
-    sum_ = 0
-    for ind, fitness in zip(population, fitnesses):
-      sum_ += fitness
-      if sum_ > u:
-        chosen[i] = Individual(ind)
-        break
-    if not chosen[i]:
-      chosen[i] = Individual(population[-1])
-
-  return chosen
-
-
-def make_population(make_individual_fn, n):
-  return [make_individual_fn() for _ in xrange(n)]
-
-
-def best(population):
-  best_ind = None
-  for ind in population:
-    if best_ind is None or best_ind.fitness.values < ind.fitness.values:
-      best_ind = ind
-  return best_ind
-
-
-def topk(population, k):
-  q = utils.MaxUniquePriorityQueue(k)
-  for ind in population:
-    q.push(ind.fitness.values, tuple(ind), ind)
-  return [ind for _, ind in q.iter_in_order()]
-
-
-class Fitness(object):
-
-  def __init__(self):
-    self.values = ()
-
-  @property
-  def valid(self):
-    """Assess if a fitness is valid or not."""
-    return bool(self.values)
-
-
-class Individual(list):
-
-  def __init__(self, *args):
-    super(Individual, self).__init__(*args)
-    self.fitness = Fitness()
-
-
-def random_individual(genome_size):
-  return lambda: Individual(np.random.choice(GENES, genome_size).tolist())
diff --git a/research/brain_coder/single_task/ga_train.py b/research/brain_coder/single_task/ga_train.py
deleted file mode 100644
index 630eca427e478dbadad58bd94b56e89a5a747526..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/ga_train.py
+++ /dev/null
@@ -1,324 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Genetic algorithm for BF tasks.
-
-Also contains the uniform random search algorithm.
-
-Inspired by https://github.com/primaryobjects/AI-Programmer.
-GA function code borrowed from https://github.com/DEAP/deap.
-"""
-
-import cPickle
-import os
-import sys
-from time import sleep
-
-from absl import flags
-from absl import logging
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from common import utils  # brain coder
-from single_task import data  # brain coder
-from single_task import defaults  # brain coder
-from single_task import ga_lib  # brain coder
-from single_task import results_lib  # brain coder
-
-FLAGS = flags.FLAGS
-
-
-def define_tuner_hparam_space(hparam_space_type):
-  """Define tunable hparams for grid search."""
-  if hparam_space_type != 'ga':
-    raise ValueError('Hparam space is not valid: "%s"' % hparam_space_type)
-  return {
-      'population_size': [10, 25, 50, 100, 500],
-      'crossover_rate': [0.2, 0.5, 0.7, 0.9, 0.95],
-      'mutation_rate': [0.01, 0.03, 0.05, 0.1, 0.15]}
-
-
-def write_hparams_to_config(config, hparams, hparam_space_type):
-  """Write hparams given by the tuner into the Config object."""
-  if hparam_space_type != 'ga':
-    raise ValueError('Hparam space is not valid: "%s"' % hparam_space_type)
-  config.batch_size = hparams.population_size
-  config.agent.crossover_rate = hparams.crossover_rate
-  config.agent.mutation_rate = hparams.mutation_rate
-
-
-class CheckpointWriter(object):
-  """Manages loading and saving GA populations to disk.
-
-  This object is used by the genetic algorithm to save progress periodically
-  so that a recent population can be loaded from disk in the event of a restart.
-  """
-
-  def __init__(self, checkpoint_dir, population_size):
-    self.checkpoint_file = os.path.join(checkpoint_dir, 'checkpoint.pickle')
-    self.population_size = population_size
-
-  def write(self, gen, population, halloffame):
-    """Write GA state to disk.
-
-    Overwrites previous saved state.
-
-    Args:
-      gen: Generation number.
-      population: List of Individual objects.
-      halloffame: Hall-of-fame buffer. Typically a priority queue.
-    """
-    raw = cPickle.dumps((gen, population, halloffame))
-    with tf.gfile.FastGFile(self.checkpoint_file, 'w') as f:
-      f.write(raw)
-
-  def load(self):
-    """Loads GA state from disk.
-
-    Loads whatever is on disk, which will be whatever the most recent call
-    to `write` wrote.
-
-    Returns:
-      gen: Generation number.
-      population: List of Individual objects.
-      halloffame: Hall-of-fame buffer. Typically a priority queue.
-    """
-    with tf.gfile.FastGFile(self.checkpoint_file, 'r') as f:
-      raw = f.read()
-    objs = cPickle.loads(raw)
-    # Validate data.
-    assert isinstance(objs, tuple) and len(objs) == 3, (
-        'Expecting a 3-tuple, but got %s instead.' % (objs,))
-    gen, population, halloffame = objs
-    assert isinstance(gen, int), (
-        'Expecting `gen` to be an integer, got %s' % (gen,))
-    assert (
-        isinstance(population, list)
-        and len(population) == self.population_size
-    ), (
-        'Expecting `population` to be a list with size %d, got %s'
-        % (self.population_size, population))
-    assert halloffame is None or len(halloffame) == 2, (
-        'Expecting hall-of-fame object to have length two, got length %d'
-        % len(halloffame))
-    logging.info('Loaded pop from checkpoint file: "%s".',
-                 self.checkpoint_file)
-    return gen, population, halloffame
-
-  def has_checkpoint(self):
-    """Checks if a checkpoint exists on disk, and if so returns True."""
-    return tf.gfile.Exists(self.checkpoint_file)
-
-
-def run_training(config=None, tuner=None, logdir=None, trial_name=None,  # pylint: disable=unused-argument
-                 is_chief=True):
-  """Do all training runs.
-
-  This is the top level training function for policy gradient based models.
-  Run this from the main function.
-
-  Args:
-    config: config_lib.Config instance containing global config (agent and
-        environment hparams). If None, config will be parsed from FLAGS.config.
-    tuner: (unused) A tuner instance. Leave as None if not tuning.
-    logdir: Parent directory where all data from all runs will be written. If
-        None, FLAGS.logdir will be used.
-    trial_name: (unused) If tuning, set this to a unique string that identifies
-        this trial. If `tuner` is not None, this also must be set.
-    is_chief: True if this worker is the chief.
-
-  Returns:
-    List of results dicts which were written to disk. Each training run gets a
-    results dict. Results dict contains metrics, i.e. (name, value) pairs which
-    give information about the training run.
-
-  Raises:
-    ValueError: If FLAGS.num_workers does not divide FLAGS.num_repetitions.
-    ValueError: If results dicts read from disk contain invalid data.
-  """
-  if not config:
-    # If custom config is not given, get it from flags.
-    config = defaults.default_config_with_updates(FLAGS.config)
-  if not logdir:
-    logdir = FLAGS.logdir
-
-  if FLAGS.num_repetitions % FLAGS.num_workers != 0:
-    raise ValueError('Number of workers must divide number of repetitions')
-  num_local_reps = FLAGS.num_repetitions // FLAGS.num_workers
-  logging.info('Running %d reps globally.', FLAGS.num_repetitions)
-  logging.info('This worker will run %d local reps.', num_local_reps)
-  if FLAGS.max_npe:
-    max_generations = FLAGS.max_npe // config.batch_size
-    logging.info('Max samples per rep: %d', FLAGS.max_npe)
-    logging.info('Max generations per rep: %d', max_generations)
-  else:
-    max_generations = sys.maxint
-    logging.info('Running unlimited generations.')
-
-  assert FLAGS.num_workers > 0
-  logging.info('Starting experiment. Directory: "%s"', logdir)
-  results = results_lib.Results(logdir, FLAGS.task_id)
-  local_results_list = results.read_this_shard()
-  if local_results_list:
-    if local_results_list[0]['max_npe'] != FLAGS.max_npe:
-      raise ValueError(
-          'Cannot resume training. Max-NPE changed. Was %s, now %s',
-          local_results_list[0]['max_npe'], FLAGS.max_npe)
-    if local_results_list[0]['max_global_repetitions'] != FLAGS.num_repetitions:
-      raise ValueError(
-          'Cannot resume training. Number of repetitions changed. Was %s, '
-          'now %s',
-          local_results_list[0]['max_global_repetitions'],
-          FLAGS.num_repetitions)
-  start_rep = len(local_results_list)
-
-  for rep in xrange(start_rep, num_local_reps):
-    global_rep = num_local_reps * FLAGS.task_id + rep
-    logging.info(
-        'Starting repetition: Rep = %d. (global rep = %d)',
-        rep, global_rep)
-
-    # Save data for each rep, like checkpoints, goes into separate folders.
-    run_dir = os.path.join(logdir, 'run_%d' % global_rep)
-
-    if not tf.gfile.IsDirectory(run_dir):
-      tf.gfile.MakeDirs(run_dir)
-    checkpoint_writer = CheckpointWriter(run_dir,
-                                         population_size=config.batch_size)
-
-    data_manager = data.DataManager(config, run_number=global_rep)
-    task_eval_fn = ga_lib.make_task_eval_fn(data_manager.rl_task)
-
-    if config.agent.algorithm == 'rand':
-      logging.info('Running random search.')
-      assert FLAGS.max_npe
-      result = run_random_search(
-          FLAGS.max_npe, run_dir, task_eval_fn, config.timestep_limit)
-    else:
-      assert config.agent.algorithm == 'ga'
-      logging.info('Running genetic algorithm.')
-      pop = ga_lib.make_population(
-          ga_lib.random_individual(config.timestep_limit),
-          n=config.batch_size)
-      hof = utils.MaxUniquePriorityQueue(2)  # Hall of fame.
-      result = ga_lib.ga_loop(
-          pop,
-          cxpb=config.agent.crossover_rate, mutpb=config.agent.mutation_rate,
-          task_eval_fn=task_eval_fn,
-          ngen=max_generations, halloffame=hof,
-          checkpoint_writer=checkpoint_writer)
-
-    logging.info('Finished rep. Num gens: %d', result.generations)
-
-    results_dict = {
-        'max_npe': FLAGS.max_npe,
-        'batch_size': config.batch_size,
-        'max_batches': FLAGS.max_npe // config.batch_size,
-        'npe': result.num_programs,
-        'max_global_repetitions': FLAGS.num_repetitions,
-        'max_local_repetitions': num_local_reps,
-        'code_solution': result.best_code if result.solution_found else '',
-        'best_reward': result.reward,
-        'num_batches': result.generations,
-        'found_solution': result.solution_found,
-        'task': data_manager.task_name,
-        'global_rep': global_rep}
-    logging.info('results_dict: %s', results_dict)
-    results.append(results_dict)
-
-  if is_chief:
-    logging.info(
-        'Worker is chief. Waiting for all workers to finish so that results '
-        'can be reported to the tuner.')
-
-    global_results_list, shard_stats = results.read_all(
-        num_shards=FLAGS.num_workers)
-    while not all(s.finished for s in shard_stats):
-      logging.info(
-          'Still waiting on these workers: %s',
-          ', '.join(
-              ['%d (%d reps left)'
-               % (i, s.max_local_reps - s.num_local_reps_completed)
-               for i, s in enumerate(shard_stats)
-               if not s.finished]))
-      sleep(60)
-      global_results_list, shard_stats = results.read_all(
-          num_shards=FLAGS.num_workers)
-
-    logging.info(
-        '%d results obtained. Chief worker is exiting the experiment.',
-        len(global_results_list))
-
-    return global_results_list
-
-
-def run_random_search(max_num_programs, checkpoint_dir, task_eval_fn,
-                      timestep_limit):
-  """Run uniform random search routine.
-
-  Randomly samples programs from a uniform distribution until either a valid
-  program is found, or the maximum NPE is reached. Results are written to disk
-  and returned.
-
-  Args:
-    max_num_programs: Maximum NPE (number of programs executed). If no solution
-        is found after this many programs are tried, the run is stopped and
-        considered a failure.
-    checkpoint_dir: Where to save state during the run.
-    task_eval_fn: Function that maps code string to result containing total
-        reward and info about success.
-    timestep_limit: Maximum length of code strings.
-
-  Returns:
-    ga_lib.GaResult namedtuple instance. This contains the best code and highest
-    reward found.
-  """
-  checkpoint_file = os.path.join(checkpoint_dir, 'random_search.txt')
-  num_programs_seen = 0
-  found_solution = False
-  best_code = ''
-  best_reward = 0.0
-  if tf.gfile.Exists(checkpoint_file):
-    try:
-      with tf.gfile.FastGFile(checkpoint_file, 'r') as f:
-        lines = list(f)
-        num_programs_seen = int(lines[0])
-        found_solution = bool(int(lines[1]))
-        if found_solution:
-          best_code = lines[2]
-          best_reward = float(lines[3])
-    except:  # pylint: disable=bare-except
-      pass
-
-  while not found_solution and num_programs_seen < max_num_programs:
-    if num_programs_seen % 1000 == 0:
-      logging.info('num_programs_seen = %d', num_programs_seen)
-      with tf.gfile.FastGFile(checkpoint_file, 'w') as f:
-        f.write(str(num_programs_seen) + '\n')
-        f.write(str(int(found_solution)) + '\n')
-
-    code = np.random.choice(ga_lib.GENES, timestep_limit).tolist()
-    res = task_eval_fn(code)
-    found_solution = res.correct
-    num_programs_seen += 1
-
-    if found_solution:
-      best_code = ''.join(code)
-      best_reward = res.reward
-
-  logging.info('num_programs_seen = %d', num_programs_seen)
-  logging.info('found solution: %s', found_solution)
-  with tf.gfile.FastGFile(checkpoint_file, 'w') as f:
-    f.write(str(num_programs_seen) + '\n')
-    f.write(str(int(found_solution)) + '\n')
-    if found_solution:
-      f.write(best_code + '\n')
-      f.write(str(best_reward) + '\n')
-
-  return ga_lib.GaResult(
-      population=[], best_code=best_code, reward=best_reward,
-      solution_found=found_solution, generations=num_programs_seen,
-      num_programs=num_programs_seen, max_generations=max_num_programs,
-      max_num_programs=max_num_programs)
diff --git a/research/brain_coder/single_task/ga_train_test.py b/research/brain_coder/single_task/ga_train_test.py
deleted file mode 100644
index ff69ad84952a3fb90cad28b3cf8e67ff55c96e95..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/ga_train_test.py
+++ /dev/null
@@ -1,51 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for ga_train.
-
-Tests that ga runs for a few generations without crashing.
-"""
-
-from absl import flags
-import tensorflow as tf
-
-from single_task import defaults  # brain coder
-from single_task import run  # brain coder
-
-FLAGS = flags.FLAGS
-
-
-class GaTest(tf.test.TestCase):
-
-  def RunTrainingSteps(self, config_string, num_steps=10):
-    """Run a few training steps with the given config.
-
-    Just check that nothing crashes.
-
-    Args:
-      config_string: Config encoded in a string. See
-          $REPO_PATH/common/config_lib.py
-      num_steps: Number of training steps to run. Defaults to 10.
-    """
-    config = defaults.default_config_with_updates(config_string)
-    FLAGS.max_npe = num_steps * config.batch_size
-    FLAGS.logdir = tf.test.get_temp_dir()
-    FLAGS.config = config_string
-    run.main(None)
-
-  def testGeneticAlgorithm(self):
-    self.RunTrainingSteps(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="ga"),'
-        'timestep_limit=40,batch_size=64')
-
-  def testUniformRandomSearch(self):
-    self.RunTrainingSteps(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="rand"),'
-        'timestep_limit=40,batch_size=64')
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/single_task/launch_training.sh b/research/brain_coder/single_task/launch_training.sh
deleted file mode 100755
index a4a4688ed2912792185aa8f3134b1680fed6f006..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/launch_training.sh
+++ /dev/null
@@ -1,72 +0,0 @@
-#!/bin/bash
-# Launches training jobs.
-# Modify this file to launch workers with your prefered cloud API.
-# The following implementation runs each worker as a subprocess on the local
-# machine.
-
-MODELS_DIR="/tmp/models"
-
-# Get command line options.
-OPTS=$(getopt -n "$0" -o "" --long "job_name:,config:,num_workers:,num_ps:,max_npe:,num_repetitions:,stop_on_success:" -- "$@")
-if [ $? != 0 ] ; then echo "Failed parsing options." >&2 ; exit 1 ; fi
-
-eval set -- "$OPTS"
-
-JOB_NAME=""           # Name of the process and the logs directory.
-CONFIG=""             # Model and environment hparams.
-# NUM_WORKERS: Number of workers to launch for this training job. If using
-# neural networks, each worker will be 1 replica.
-NUM_WORKERS=1
-# NUM_PS: Number of parameter servers to launch for this training job. Only set
-# this if using neural networks. For 1 worker, no parameter servers are needed.
-# For more than 1 worker, at least 1 parameter server is needed to store the
-# global model.
-NUM_PS=0
-# MAX_NPE: Maximum number of programs executed. Training will quit once this
-# threshold is reached. If 0, the threshold is infinite.
-MAX_NPE=0
-NUM_REPETITIONS=1     # How many times to run this experiment.
-STOP_ON_SUCCESS=true  # Whether to halt training when a solution is found.
-
-# Parse options into variables.
-while true; do
-  case "$1" in
-    --job_name ) JOB_NAME="$2"; shift; shift ;;
-    --config ) CONFIG="$2"; shift; shift ;;
-    --num_workers ) NUM_WORKERS="$2"; shift; shift ;;
-    --num_ps ) NUM_PS="$2"; shift; shift ;;
-    --max_npe ) MAX_NPE="$2"; shift; shift ;;
-    --num_repetitions ) NUM_REPETITIONS="$2"; shift; shift ;;
-    --stop_on_success ) STOP_ON_SUCCESS="$2"; shift; shift ;;
-    -- ) shift; break ;;
-    * ) break ;;
-  esac
-done
-
-# Launch jobs.
-# TODO: multi-worker RL training
-
-LOGDIR="$MODELS_DIR/$JOB_NAME"
-mkdir -p $LOGDIR
-
-BIN_DIR="bazel-bin/single_task"
-for (( i=0; i<NUM_WORKERS; i++))
-do
-  # Expecting run.par to be built.
-  $BIN_DIR/run.par \
-      --alsologtostderr \
-      --config="$CONFIG" \
-      --logdir="$LOGDIR" \
-      --max_npe="$MAX_NPE" \
-      --num_repetitions="$NUM_REPETITIONS" \
-      --stop_on_success="$STOP_ON_SUCCESS" \
-      --task_id="$i" \
-      --num_workers="$NUM_WORKERS" \
-      --summary_tasks=1 \
-      2> "$LOGDIR/task_$i.log" &  # Run as subprocess
-  echo "Launched task $i. Logs: $LOGDIR/task_$i.log"
-done
-
-
-# Use "pidof run.par" to find jobs.
-# Kill with "pkill run.par"
diff --git a/research/brain_coder/single_task/launch_tuning.sh b/research/brain_coder/single_task/launch_tuning.sh
deleted file mode 100755
index 97ce51b543e13d4b1c412656a93197b5b47373bb..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/launch_tuning.sh
+++ /dev/null
@@ -1,87 +0,0 @@
-#!/bin/bash
-# Launches tuning jobs.
-# Modify this file to launch workers with your prefered cloud API.
-# The following implementation runs each worker as a subprocess on the local
-# machine.
-
-MODELS_DIR="/tmp/models"
-
-# Get command line options.
-OPTS=$(getopt -n "$0" -o "" --long "job_name:,config:,num_tuners:,num_workers_per_tuner:,num_ps_per_tuner:,max_npe:,num_repetitions:,stop_on_success:,fixed_hparams:,hparam_space_type:" -- "$@")
-if [ $? != 0 ] ; then echo "Failed parsing options." >&2 ; exit 1 ; fi
-
-eval set -- "$OPTS"
-
-JOB_NAME=""           # Name of the process and the logs directory.
-CONFIG=""             # Model and environment hparams.
-# NUM_TUNERS: Number of tuning jobs to launch. Each tuning job can train a
-# hparam combination. So more tuners means more hparams tried in parallel.
-NUM_TUNERS=1
-# NUM_WORKERS_PER_TUNER: Number of workers to launch for each tuning job. If
-# using neural networks, each worker will be 1 replica.
-NUM_WORKERS_PER_TUNER=1
-# NUM_PS_PER_TUNER: Number of parameter servers to launch for this tuning job.
-# Only set this if using neural networks. For 1 worker per tuner, no parameter
-# servers are needed. For more than 1 worker per tuner, at least 1 parameter
-# server per tuner is needed to store the global model for each tuner.
-NUM_PS_PER_TUNER=0
-# MAX_NPE: Maximum number of programs executed. Training will quit once this
-# threshold is reached. If 0, the threshold is infinite.
-MAX_NPE=0
-NUM_REPETITIONS=25    # How many times to run this experiment.
-STOP_ON_SUCCESS=true  # Whether to halt training when a solution is found.
-# FIXED_HPARAMS: Hold hparams fixed in the grid search. This reduces the search
-# space.
-FIXED_HPARAMS=""
-# HPARAM_SPACE_TYPE: Specifies the hparam search space. See
-# `define_tuner_hparam_space` functions defined in pg_train.py and ga_train.py.
-HPARAM_SPACE_TYPE="pg"
-
-# Parse options into variables.
-while true; do
-  case "$1" in
-    --job_name ) JOB_NAME="$2"; shift; shift ;;
-    --config ) CONFIG="$2"; shift; shift ;;
-    --num_tuners ) NUM_TUNERS="$2"; shift; shift ;;
-    --num_workers_per_tuner ) NUM_WORKERS_PER_TUNER="$2"; shift; shift ;;
-    --num_ps_per_tuner ) NUM_PS_PER_TUNER="$2"; shift; shift ;;
-    --max_npe ) MAX_NPE="$2"; shift; shift ;;
-    --num_repetitions ) NUM_REPETITIONS="$2"; shift; shift ;;
-    --stop_on_success ) STOP_ON_SUCCESS="$2"; shift; shift ;;
-    --fixed_hparams ) FIXED_HPARAMS="$2"; shift; shift ;;
-    --hparam_space_type ) HPARAM_SPACE_TYPE="$2"; shift; shift ;;
-    -- ) shift; break ;;
-    * ) break ;;
-  esac
-done
-
-# Launch jobs.
-# TODO: multi-worker RL training
-
-LOGDIR="$MODELS_DIR/$JOB_NAME"
-mkdir -p $LOGDIR
-
-BIN_DIR="bazel-bin/single_task"
-for ((tuner=0;tuner<NUM_TUNERS;tuner+=1)); do
-  for ((i=0;i<NUM_WORKERS_PER_TUNER;i++)); do
-    # Expecting tune.par to be built.
-    echo "$LOGDIR"
-    $BIN_DIR/tune.par \
-        --alsologtostderr \
-        --config="$CONFIG" \
-        --logdir="$LOGDIR" \
-        --max_npe="$MAX_NPE" \
-        --num_repetitions="$NUM_REPETITIONS" \
-        --stop_on_success="$STOP_ON_SUCCESS" \
-        --summary_tasks=1 \
-        --hparam_space="$HPARAM_SPACE_TYPE" \
-        --fixed_hparams="$FIXED_HPARAMS" \
-        --tuner_id=$tuner \
-        --num_tuners=$NUM_TUNERS \
-        2> "$LOGDIR/tuner_$tuner.task_$i.log" &  # Run as subprocess
-    echo "Launched tuner $tuner, task $i. Logs: $LOGDIR/tuner_$tuner.task_$i.log"
-  done
-done
-
-# Use "pidof tune.par" to find jobs.
-# Kill with "pkill tune.par"
diff --git a/research/brain_coder/single_task/misc.py b/research/brain_coder/single_task/misc.py
deleted file mode 100644
index 07061d81c8aaafd4d97efc11ecca451528c6e9dd..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/misc.py
+++ /dev/null
@@ -1,149 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Utilities specific to this project."""
-
-from collections import namedtuple
-from six import string_types
-
-
-#####################
-# BF-lang utilities #
-#####################
-
-
-BF_EOS_INT = 0  # Also used as SOS (start of sequence).
-BF_EOS_CHAR = TEXT_EOS_CHAR = '_'
-BF_LANG_INTS = range(1, 9)
-BF_INT_TO_CHAR = [BF_EOS_CHAR, '>', '<', '+', '-', '[', ']', '.', ',']
-BF_CHAR_TO_INT = dict([(c, i) for i, c in enumerate(BF_INT_TO_CHAR)])
-
-
-RewardInfo = namedtuple('RewardInfo', ['episode_rewards', 'input_case',
-                                       'correct_output',
-                                       'code_output', 'reason', 'input_type',
-                                       'output_type'])
-
-
-class IOType(object):
-  string = 'string'
-  integer = 'integer'
-  boolean = 'boolean'
-
-
-class IOTuple(tuple):
-  pass
-
-
-def flatten(lst):
-  return [item for row in lst for item in row]
-
-
-def bf_num_tokens():
-  # BF tokens plus EOS.
-  return len(BF_INT_TO_CHAR)
-
-
-def bf_char2int(bf_char):
-  """Convert BF code char to int token."""
-  return BF_CHAR_TO_INT[bf_char]
-
-
-def bf_int2char(bf_int):
-  """Convert BF int token to code char."""
-  return BF_INT_TO_CHAR[bf_int]
-
-
-def bf_tokens_to_string(bf_tokens, truncate=True):
-  """Convert token list to code string. Will truncate at EOS token.
-
-  Args:
-    bf_tokens: Python list of ints representing the code string.
-    truncate: If true, the output string will end at the first EOS token.
-        If false, the entire token list is converted to string.
-
-  Returns:
-    String representation of the tokens.
-
-  Raises:
-    ValueError: If bf_tokens is not a python list.
-  """
-  if not isinstance(bf_tokens, list):
-    raise ValueError('Only python list supported here.')
-  if truncate:
-    try:
-      eos_index = bf_tokens.index(BF_EOS_INT)
-    except ValueError:
-      eos_index = len(bf_tokens)
-  else:
-    eos_index = len(bf_tokens)
-  return ''.join([BF_INT_TO_CHAR[t] for t in bf_tokens[:eos_index]])
-
-
-def bf_string_to_tokens(bf_string):
-  """Convert string to token list. Will strip and append EOS token."""
-  tokens = [BF_CHAR_TO_INT[char] for char in bf_string.strip()]
-  tokens.append(BF_EOS_INT)
-  return tokens
-
-
-def tokens_to_text(tokens):
-  """Convert token list to human readable text."""
-  return ''.join(
-      [TEXT_EOS_CHAR if t == 0 else chr(t - 1 + ord('A')) for t in tokens])
-
-
-###################################
-# Number representation utilities #
-###################################
-
-
-# https://en.wikipedia.org/wiki/Metric_prefix
-si_magnitudes = {
-    'k': 1e3,
-    'm': 1e6,
-    'g': 1e9}
-
-
-def si_to_int(s):
-  """Convert string ending with SI magnitude to int.
-
-  Examples: 5K ==> 5000, 12M ==> 12000000.
-
-  Args:
-    s: String in the form 'xx..xP' where x is a digit and P is an SI prefix.
-
-  Returns:
-    Integer equivalent to the string.
-  """
-  if isinstance(s, string_types) and s[-1].lower() in si_magnitudes.keys():
-    return int(int(s[:-1]) * si_magnitudes[s[-1].lower()])
-  return int(s)
-
-
-def int_to_si(n):
-  """Convert integer to string with SI magnitude.
-
-  `n` will be truncated.
-
-  Examples: 5432 ==> 5k, 12345678 ==> 12M
-
-  Args:
-    n: Integer to represent as a string.
-
-  Returns:
-    String representation of `n` containing SI magnitude.
-  """
-  m = abs(n)
-  sign = -1 if n < 0 else 1
-  if m < 1e3:
-    return str(n)
-  if m < 1e6:
-    return '{0}K'.format(sign*int(m / 1e3))
-  if m < 1e9:
-    return '{0}M'.format(sign*int(m / 1e6))
-  if m < 1e12:
-    return '{0}G'.format(sign*int(m / 1e9))
-  return str(m)
-
diff --git a/research/brain_coder/single_task/pg_agent.py b/research/brain_coder/single_task/pg_agent.py
deleted file mode 100644
index 13fc7da2dc89a1fbcc7fa5efbbce87008580aa92..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/pg_agent.py
+++ /dev/null
@@ -1,1297 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Language model agent.
-
-Agent outputs code in a sequence just like a language model. Can be trained
-as a language model or using RL, or a combination of the two.
-"""
-
-from collections import namedtuple
-from math import exp
-from math import log
-import time
-
-from absl import logging
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from common import rollout as rollout_lib  # brain coder
-from common import utils  # brain coder
-from single_task import misc  # brain coder
-
-
-# Experiments in the ICLR 2018 paper used reduce_sum instead of reduce_mean for
-# some losses. We make all loses be batch_size independent, and multiply the
-# changed losses by 64, which was the fixed batch_size when the experiments
-# where run. The loss hyperparameters still match what is reported in the paper.
-MAGIC_LOSS_MULTIPLIER = 64
-
-
-def rshift_time(tensor_2d, fill=misc.BF_EOS_INT):
-  """Right shifts a 2D tensor along the time dimension (axis-1)."""
-  dim_0 = tf.shape(tensor_2d)[0]
-  fill_tensor = tf.fill([dim_0, 1], fill)
-  return tf.concat([fill_tensor, tensor_2d[:, :-1]], axis=1)
-
-
-def join(a, b):
-  # Concat a and b along 0-th dim.
-  if a is None or len(a) == 0:  # pylint: disable=g-explicit-length-test
-    return b
-  if b is None or len(b) == 0:  # pylint: disable=g-explicit-length-test
-    return a
-  return np.concatenate((a, b))
-
-
-def make_optimizer(kind, lr):
-  if kind == 'sgd':
-    return tf.train.GradientDescentOptimizer(lr)
-  elif kind == 'adam':
-    return tf.train.AdamOptimizer(lr)
-  elif kind == 'rmsprop':
-    return tf.train.RMSPropOptimizer(learning_rate=lr, decay=0.99)
-  else:
-    raise ValueError('Optimizer type "%s" not recognized.' % kind)
-
-
-class LinearWrapper(tf.contrib.rnn.RNNCell):
-  """RNNCell wrapper that adds a linear layer to the output."""
-
-  def __init__(self, cell, output_size, dtype=tf.float32, suppress_index=None):
-    self.cell = cell
-    self._output_size = output_size
-    self._dtype = dtype
-    self._suppress_index = suppress_index
-    self.smallest_float = -2.4e38
-
-  def __call__(self, inputs, state, scope=None):
-    with tf.variable_scope(type(self).__name__):
-      outputs, state = self.cell(inputs, state, scope=scope)
-      logits = tf.matmul(
-          outputs,
-          tf.get_variable('w_output',
-                          [self.cell.output_size, self.output_size],
-                          dtype=self._dtype))
-      if self._suppress_index is not None:
-        # Replace the target index with -inf, so that it never gets selected.
-        batch_size = tf.shape(logits)[0]
-        logits = tf.concat(
-            [logits[:, :self._suppress_index],
-             tf.fill([batch_size, 1], self.smallest_float),
-             logits[:, self._suppress_index + 1:]],
-            axis=1)
-
-    return logits, state
-
-  @property
-  def output_size(self):
-    return self._output_size
-
-  @property
-  def state_size(self):
-    return self.cell.state_size
-
-  def zero_state(self, batch_size, dtype):
-    return self.cell.zero_state(batch_size, dtype)
-
-
-UpdateStepResult = namedtuple(
-    'UpdateStepResult',
-    ['global_step', 'global_npe', 'summaries_list', 'gradients_dict'])
-
-
-class AttrDict(dict):
-  """Dict with attributes as keys.
-
-  https://stackoverflow.com/a/14620633
-  """
-
-  def __init__(self, *args, **kwargs):
-    super(AttrDict, self).__init__(*args, **kwargs)
-    self.__dict__ = self
-
-
-class LMAgent(object):
-  """Language model agent."""
-  action_space = misc.bf_num_tokens()
-  observation_space = misc.bf_num_tokens()
-
-  def __init__(self, global_config, task_id=0,
-               logging_file=None,
-               experience_replay_file=None,
-               global_best_reward_fn=None,
-               found_solution_op=None,
-               assign_code_solution_fn=None,
-               program_count=None,
-               do_iw_summaries=False,
-               stop_on_success=True,
-               dtype=tf.float32,
-               verbose_level=0,
-               is_local=True):
-    self.config = config = global_config.agent
-    self.logging_file = logging_file
-    self.experience_replay_file = experience_replay_file
-    self.task_id = task_id
-    self.verbose_level = verbose_level
-    self.global_best_reward_fn = global_best_reward_fn
-    self.found_solution_op = found_solution_op
-    self.assign_code_solution_fn = assign_code_solution_fn
-    self.parent_scope_name = tf.get_variable_scope().name
-    self.dtype = dtype
-    self.allow_eos_token = config.eos_token
-    self.stop_on_success = stop_on_success
-    self.pi_loss_hparam = config.pi_loss_hparam
-    self.vf_loss_hparam = config.vf_loss_hparam
-    self.is_local = is_local
-
-    self.top_reward = 0.0
-    self.embeddings_trainable = True
-
-    self.no_op = tf.no_op()
-
-    self.learning_rate = tf.constant(
-        config.lr, dtype=dtype, name='learning_rate')
-    self.initializer = tf.contrib.layers.variance_scaling_initializer(
-        factor=config.param_init_factor,
-        mode='FAN_AVG',
-        uniform=True,
-        dtype=dtype)  # TF's default initializer.
-    tf.get_variable_scope().set_initializer(self.initializer)
-
-    self.a2c = config.ema_baseline_decay == 0
-    if not self.a2c:
-      logging.info('Using exponential moving average REINFORCE baselines.')
-      self.ema_baseline_decay = config.ema_baseline_decay
-      self.ema_by_len = [0.0] * global_config.timestep_limit
-    else:
-      logging.info('Using advantage (a2c) with learned value function.')
-      self.ema_baseline_decay = 0.0
-      self.ema_by_len = None
-
-    # Top-k
-    if config.topk and config.topk_loss_hparam:
-      self.topk_loss_hparam = config.topk_loss_hparam
-      self.topk_batch_size = config.topk_batch_size
-      if self.topk_batch_size <= 0:
-        raise ValueError('topk_batch_size must be a positive integer. Got %s',
-                         self.topk_batch_size)
-      self.top_episodes = utils.MaxUniquePriorityQueue(config.topk)
-      logging.info('Made max-priorty-queue with capacity %d',
-                   self.top_episodes.capacity)
-    else:
-      self.top_episodes = None
-      self.topk_loss_hparam = 0.0
-      logging.info('No max-priorty-queue')
-
-    # Experience replay.
-    self.replay_temperature = config.replay_temperature
-    self.num_replay_per_batch = int(global_config.batch_size * config.alpha)
-    self.num_on_policy_per_batch = (
-        global_config.batch_size - self.num_replay_per_batch)
-    self.replay_alpha = (
-        self.num_replay_per_batch / float(global_config.batch_size))
-    logging.info('num_replay_per_batch: %d', self.num_replay_per_batch)
-    logging.info('num_on_policy_per_batch: %d', self.num_on_policy_per_batch)
-    logging.info('replay_alpha: %s', self.replay_alpha)
-    if self.num_replay_per_batch > 0:
-      # Train with off-policy episodes from replay buffer.
-      start_time = time.time()
-      self.experience_replay = utils.RouletteWheel(
-          unique_mode=True, save_file=experience_replay_file)
-      logging.info('Took %s sec to load replay buffer from disk.',
-                   int(time.time() - start_time))
-      logging.info('Replay buffer file location: "%s"',
-                   self.experience_replay.save_file)
-    else:
-      # Only train on-policy.
-      self.experience_replay = None
-
-    if program_count is not None:
-      self.program_count = program_count
-      self.program_count_add_ph = tf.placeholder(
-          tf.int64, [], 'program_count_add_ph')
-      self.program_count_add_op = self.program_count.assign_add(
-          self.program_count_add_ph)
-
-    ################################
-    # RL policy and value networks #
-    ################################
-    batch_size = global_config.batch_size
-    logging.info('batch_size: %d', batch_size)
-
-    self.policy_cell = LinearWrapper(
-        tf.contrib.rnn.MultiRNNCell(
-            [tf.contrib.rnn.BasicLSTMCell(cell_size)
-             for cell_size in config.policy_lstm_sizes]),
-        self.action_space,
-        dtype=dtype,
-        suppress_index=None if self.allow_eos_token else misc.BF_EOS_INT)
-    self.value_cell = LinearWrapper(
-        tf.contrib.rnn.MultiRNNCell(
-            [tf.contrib.rnn.BasicLSTMCell(cell_size)
-             for cell_size in config.value_lstm_sizes]),
-        1,
-        dtype=dtype)
-
-    obs_embedding_scope = 'obs_embed'
-    with tf.variable_scope(
-        obs_embedding_scope,
-        initializer=tf.random_uniform_initializer(minval=-1.0, maxval=1.0)):
-      obs_embeddings = tf.get_variable(
-          'embeddings',
-          [self.observation_space, config.obs_embedding_size],
-          dtype=dtype, trainable=self.embeddings_trainable)
-      self.obs_embeddings = obs_embeddings
-
-    ################################
-    # RL policy and value networks #
-    ################################
-
-    initial_state = tf.fill([batch_size], misc.BF_EOS_INT)
-    def loop_fn(loop_time, cell_output, cell_state, loop_state):
-      """Function called by tf.nn.raw_rnn to instantiate body of the while_loop.
-
-      See https://www.tensorflow.org/api_docs/python/tf/nn/raw_rnn for more
-      information.
-
-      When time is 0, and cell_output, cell_state, loop_state are all None,
-      `loop_fn` will create the initial input, internal cell state, and loop
-      state. When time > 0, `loop_fn` will operate on previous cell output,
-      state, and loop state.
-
-      Args:
-        loop_time: A scalar tensor holding the current timestep (zero based
-            counting).
-        cell_output: Output of the raw_rnn cell at the current timestep.
-        cell_state: Cell internal state at the current timestep.
-        loop_state: Additional loop state. These tensors were returned by the
-            previous call to `loop_fn`.
-
-      Returns:
-        elements_finished: Bool tensor of shape [batch_size] which marks each
-            sequence in the batch as being finished or not finished.
-        next_input: A tensor containing input to be fed into the cell at the
-            next timestep.
-        next_cell_state: Cell internal state to be fed into the cell at the
-            next timestep.
-        emit_output: Tensor to be added to the TensorArray returned by raw_rnn
-            as output from the while_loop.
-        next_loop_state: Additional loop state. These tensors will be fed back
-            into the next call to `loop_fn` as `loop_state`.
-      """
-      if cell_output is None:  # 0th time step.
-        next_cell_state = self.policy_cell.zero_state(batch_size, dtype)
-        elements_finished = tf.zeros([batch_size], tf.bool)
-        output_lengths = tf.ones([batch_size], dtype=tf.int32)
-        next_input = tf.gather(obs_embeddings, initial_state)
-        emit_output = None
-        next_loop_state = (
-            tf.TensorArray(dtype=tf.int32, size=0, dynamic_size=True),
-            output_lengths,
-            elements_finished
-        )
-      else:
-        scaled_logits = cell_output * config.softmax_tr  # Scale temperature.
-        prev_chosen, prev_output_lengths, prev_elements_finished = loop_state
-        next_cell_state = cell_state
-        chosen_outputs = tf.to_int32(tf.where(
-            tf.logical_not(prev_elements_finished),
-            tf.multinomial(logits=scaled_logits, num_samples=1)[:, 0],
-            tf.zeros([batch_size], dtype=tf.int64)))
-        elements_finished = tf.logical_or(
-            tf.equal(chosen_outputs, misc.BF_EOS_INT),
-            loop_time >= global_config.timestep_limit)
-        output_lengths = tf.where(
-            elements_finished,
-            prev_output_lengths,
-            # length includes EOS token. empty seq has len 1.
-            tf.tile(tf.expand_dims(loop_time + 1, 0), [batch_size])
-        )
-        next_input = tf.gather(obs_embeddings, chosen_outputs)
-        emit_output = scaled_logits
-        next_loop_state = (prev_chosen.write(loop_time - 1, chosen_outputs),
-                           output_lengths,
-                           tf.logical_or(prev_elements_finished,
-                                         elements_finished))
-      return (elements_finished, next_input, next_cell_state, emit_output,
-              next_loop_state)
-
-    with tf.variable_scope('policy'):
-      (decoder_outputs_ta,
-       _,  # decoder_state
-       (sampled_output_ta, output_lengths, _)) = tf.nn.raw_rnn(
-           cell=self.policy_cell,
-           loop_fn=loop_fn)
-    policy_logits = tf.transpose(decoder_outputs_ta.stack(), (1, 0, 2),
-                                 name='policy_logits')
-    sampled_tokens = tf.transpose(sampled_output_ta.stack(), (1, 0),
-                                  name='sampled_tokens')
-    # Add SOS to beginning of the sequence.
-    rshift_sampled_tokens = rshift_time(sampled_tokens, fill=misc.BF_EOS_INT)
-
-    # Initial state is 0, 2nd state is first token.
-    # Note: If value of last state is computed, this will be used as bootstrap.
-    if self.a2c:
-      with tf.variable_scope('value'):
-        value_output, _ = tf.nn.dynamic_rnn(
-            self.value_cell,
-            tf.gather(obs_embeddings, rshift_sampled_tokens),
-            sequence_length=output_lengths,
-            dtype=dtype)
-      value = tf.squeeze(value_output, axis=[2])
-    else:
-      value = tf.zeros([], dtype=dtype)
-
-    # for sampling actions from the agent, and which told tensors for doing
-    # gradient updates on the agent.
-    self.sampled_batch = AttrDict(
-        logits=policy_logits,
-        value=value,
-        tokens=sampled_tokens,
-        episode_lengths=output_lengths,
-        probs=tf.nn.softmax(policy_logits),
-        log_probs=tf.nn.log_softmax(policy_logits))
-
-    # adjusted_lengths can be less than the full length of each episode.
-    # Use this to train on only part of an episode (starting from t=0).
-    self.adjusted_lengths = tf.placeholder(
-        tf.int32, [None], name='adjusted_lengths')
-    self.policy_multipliers = tf.placeholder(
-        dtype,
-        [None, None],
-        name='policy_multipliers')
-    # Empirical value, i.e. discounted sum of observed future rewards from each
-    # time step in the episode.
-    self.empirical_values = tf.placeholder(
-        dtype,
-        [None, None],
-        name='empirical_values')
-
-    # Off-policy training. Just add supervised loss to the RL loss.
-    self.off_policy_targets = tf.placeholder(
-        tf.int32,
-        [None, None],
-        name='off_policy_targets')
-    self.off_policy_target_lengths = tf.placeholder(
-        tf.int32, [None], name='off_policy_target_lengths')
-
-    self.actions = tf.placeholder(tf.int32, [None, None], name='actions')
-    # Add SOS to beginning of the sequence.
-    inputs = rshift_time(self.actions, fill=misc.BF_EOS_INT)
-    with tf.variable_scope('policy', reuse=True):
-      logits, _ = tf.nn.dynamic_rnn(
-          self.policy_cell, tf.gather(obs_embeddings, inputs),
-          sequence_length=self.adjusted_lengths,
-          dtype=dtype)
-
-    if self.a2c:
-      with tf.variable_scope('value', reuse=True):
-        value_output, _ = tf.nn.dynamic_rnn(
-            self.value_cell,
-            tf.gather(obs_embeddings, inputs),
-            sequence_length=self.adjusted_lengths,
-            dtype=dtype)
-      value2 = tf.squeeze(value_output, axis=[2])
-    else:
-      value2 = tf.zeros([], dtype=dtype)
-
-    self.given_batch = AttrDict(
-        logits=logits,
-        value=value2,
-        tokens=sampled_tokens,
-        episode_lengths=self.adjusted_lengths,
-        probs=tf.nn.softmax(logits),
-        log_probs=tf.nn.log_softmax(logits))
-
-    # Episode masks.
-    max_episode_length = tf.shape(self.actions)[1]
-    # range_row shape: [1, max_episode_length]
-    range_row = tf.expand_dims(tf.range(max_episode_length), 0)
-    episode_masks = tf.cast(
-        tf.less(range_row, tf.expand_dims(self.given_batch.episode_lengths, 1)),
-        dtype=dtype)
-    episode_masks_3d = tf.expand_dims(episode_masks, 2)
-
-    # Length adjusted episodes.
-    self.a_probs = a_probs = self.given_batch.probs * episode_masks_3d
-    self.a_log_probs = a_log_probs = (
-        self.given_batch.log_probs * episode_masks_3d)
-    self.a_value = a_value = self.given_batch.value * episode_masks
-    self.a_policy_multipliers = a_policy_multipliers = (
-        self.policy_multipliers * episode_masks)
-    if self.a2c:
-      self.a_empirical_values = a_empirical_values = (
-          self.empirical_values * episode_masks)
-
-    # pi_loss is scalar
-    acs_onehot = tf.one_hot(self.actions, self.action_space, dtype=dtype)
-    self.acs_onehot = acs_onehot
-    chosen_masked_log_probs = acs_onehot * a_log_probs
-    pi_target = tf.expand_dims(a_policy_multipliers, -1)
-    pi_loss_per_step = chosen_masked_log_probs * pi_target  # Maximize.
-    self.pi_loss = pi_loss = (
-        -tf.reduce_mean(tf.reduce_sum(pi_loss_per_step, axis=[1, 2]), axis=0)
-        * MAGIC_LOSS_MULTIPLIER)  # Minimize.
-    assert len(self.pi_loss.shape) == 0  # pylint: disable=g-explicit-length-test
-
-    # shape: [batch_size, time]
-    self.chosen_log_probs = tf.reduce_sum(chosen_masked_log_probs, axis=2)
-    self.chosen_probs = tf.reduce_sum(acs_onehot * a_probs, axis=2)
-
-    # loss of value function
-    if self.a2c:
-      vf_loss_per_step = tf.square(a_value - a_empirical_values)
-      self.vf_loss = vf_loss = (
-          tf.reduce_mean(tf.reduce_sum(vf_loss_per_step, axis=1), axis=0)
-          * MAGIC_LOSS_MULTIPLIER)  # Minimize.
-      assert len(self.vf_loss.shape) == 0  # pylint: disable=g-explicit-length-test
-    else:
-      self.vf_loss = vf_loss = 0.0
-
-    # Maximize entropy regularizer
-    self.entropy = entropy = (
-        -tf.reduce_mean(
-            tf.reduce_sum(a_probs * a_log_probs, axis=[1, 2]), axis=0)
-        * MAGIC_LOSS_MULTIPLIER)  # Maximize
-    self.negentropy = -entropy  # Minimize negentropy.
-    assert len(self.negentropy.shape) == 0  # pylint: disable=g-explicit-length-test
-
-    # off-policy loss
-    self.offp_switch = tf.placeholder(dtype, [], name='offp_switch')
-    if self.top_episodes is not None:
-      # Add SOS to beginning of the sequence.
-      offp_inputs = tf.gather(obs_embeddings,
-                              rshift_time(self.off_policy_targets,
-                                          fill=misc.BF_EOS_INT))
-      with tf.variable_scope('policy', reuse=True):
-        offp_logits, _ = tf.nn.dynamic_rnn(
-            self.policy_cell, offp_inputs, self.off_policy_target_lengths,
-            dtype=dtype)  # shape: [batch_size, time, action_space]
-      topk_loss_per_step = tf.nn.sparse_softmax_cross_entropy_with_logits(
-          labels=self.off_policy_targets,
-          logits=offp_logits,
-          name='topk_loss_per_logit')
-      # Take mean over batch dimension so that the loss multiplier strength is
-      # independent of batch size. Sum over time dimension.
-      topk_loss = tf.reduce_mean(
-          tf.reduce_sum(topk_loss_per_step, axis=1), axis=0)
-      assert len(topk_loss.shape) == 0  # pylint: disable=g-explicit-length-test
-      self.topk_loss = topk_loss * self.offp_switch
-      logging.info('Including off policy loss.')
-    else:
-      self.topk_loss = topk_loss = 0.0
-
-    self.entropy_hparam = tf.constant(
-        config.entropy_beta, dtype=dtype, name='entropy_beta')
-
-    self.pi_loss_term = pi_loss * self.pi_loss_hparam
-    self.vf_loss_term = vf_loss * self.vf_loss_hparam
-    self.entropy_loss_term = self.negentropy * self.entropy_hparam
-    self.topk_loss_term = self.topk_loss_hparam * topk_loss
-    self.loss = (
-        self.pi_loss_term
-        + self.vf_loss_term
-        + self.entropy_loss_term
-        + self.topk_loss_term)
-
-    params = tf.get_collection(tf.GraphKeys.TRAINABLE_VARIABLES,
-                               tf.get_variable_scope().name)
-    self.trainable_variables = params
-    self.sync_variables = self.trainable_variables
-    non_embedding_params = [p for p in params
-                            if obs_embedding_scope not in p.name]
-    self.non_embedding_params = non_embedding_params
-    self.params = params
-
-    if config.regularizer:
-      logging.info('Adding L2 regularizer with scale %.2f.',
-                   config.regularizer)
-      self.regularizer = config.regularizer * sum(
-          tf.nn.l2_loss(w) for w in non_embedding_params)
-      self.loss += self.regularizer
-    else:
-      logging.info('Skipping regularizer.')
-      self.regularizer = 0.0
-
-    # Only build gradients graph for local model.
-    if self.is_local:
-      unclipped_grads = tf.gradients(self.loss, params)
-      self.dense_unclipped_grads = [
-          tf.convert_to_tensor(g) for g in unclipped_grads]
-      self.grads, self.global_grad_norm = tf.clip_by_global_norm(
-          unclipped_grads, config.grad_clip_threshold)
-      self.gradients_dict = dict(zip(params, self.grads))
-      self.optimizer = make_optimizer(config.optimizer, self.learning_rate)
-      self.all_variables = tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES,
-                                             tf.get_variable_scope().name)
-
-    self.do_iw_summaries = do_iw_summaries
-    if self.do_iw_summaries:
-      b = None
-      self.log_iw_replay_ph = tf.placeholder(tf.float32, [b],
-                                             'log_iw_replay_ph')
-      self.log_iw_policy_ph = tf.placeholder(tf.float32, [b],
-                                             'log_iw_policy_ph')
-      self.log_prob_replay_ph = tf.placeholder(tf.float32, [b],
-                                               'log_prob_replay_ph')
-      self.log_prob_policy_ph = tf.placeholder(tf.float32, [b],
-                                               'log_prob_policy_ph')
-      self.log_norm_replay_weights_ph = tf.placeholder(
-          tf.float32, [b], 'log_norm_replay_weights_ph')
-      self.iw_summary_op = tf.summary.merge([
-          tf.summary.histogram('is/log_iw_replay', self.log_iw_replay_ph),
-          tf.summary.histogram('is/log_iw_policy', self.log_iw_policy_ph),
-          tf.summary.histogram('is/log_prob_replay', self.log_prob_replay_ph),
-          tf.summary.histogram('is/log_prob_policy', self.log_prob_policy_ph),
-          tf.summary.histogram(
-              'is/log_norm_replay_weights', self.log_norm_replay_weights_ph),
-      ])
-
-  def make_summary_ops(self):
-    """Construct summary ops for the model."""
-    # size = number of timesteps across entire batch. Number normalized by size
-    # will not be affected by the amount of padding at the ends of sequences
-    # in the batch.
-    size = tf.cast(
-        tf.reduce_sum(self.given_batch.episode_lengths), dtype=self.dtype)
-    offp_size = tf.cast(tf.reduce_sum(self.off_policy_target_lengths),
-                        dtype=self.dtype)
-    scope_prefix = self.parent_scope_name
-
-    def _remove_prefix(prefix, name):
-      assert name.startswith(prefix)
-      return name[len(prefix):]
-
-    # RL summaries.
-    self.rl_summary_op = tf.summary.merge(
-        [tf.summary.scalar('model/policy_loss', self.pi_loss / size),
-         tf.summary.scalar('model/value_loss', self.vf_loss / size),
-         tf.summary.scalar('model/topk_loss', self.topk_loss / offp_size),
-         tf.summary.scalar('model/entropy', self.entropy / size),
-         tf.summary.scalar('model/loss', self.loss / size),
-         tf.summary.scalar('model/grad_norm',
-                           tf.global_norm(self.grads)),
-         tf.summary.scalar('model/unclipped_grad_norm', self.global_grad_norm),
-         tf.summary.scalar('model/non_embedding_var_norm',
-                           tf.global_norm(self.non_embedding_params)),
-         tf.summary.scalar('hparams/entropy_beta', self.entropy_hparam),
-         tf.summary.scalar('hparams/topk_loss_hparam', self.topk_loss_hparam),
-         tf.summary.scalar('hparams/learning_rate', self.learning_rate),
-         tf.summary.scalar('model/trainable_var_norm',
-                           tf.global_norm(self.trainable_variables)),
-         tf.summary.scalar('loss/loss', self.loss),
-         tf.summary.scalar('loss/entropy', self.entropy_loss_term),
-         tf.summary.scalar('loss/vf', self.vf_loss_term),
-         tf.summary.scalar('loss/policy', self.pi_loss_term),
-         tf.summary.scalar('loss/offp', self.topk_loss_term)] +
-        [tf.summary.scalar(
-            'param_norms/' + _remove_prefix(scope_prefix + '/', p.name),
-            tf.norm(p))
-         for p in self.params] +
-        [tf.summary.scalar(
-            'grad_norms/' + _remove_prefix(scope_prefix + '/', p.name),
-            tf.norm(g))
-         for p, g in zip(self.params, self.grads)] +
-        [tf.summary.scalar(
-            'unclipped_grad_norms/' + _remove_prefix(scope_prefix + '/',
-                                                     p.name),
-            tf.norm(g))
-         for p, g in zip(self.params, self.dense_unclipped_grads)])
-
-    self.text_summary_placeholder = tf.placeholder(tf.string, shape=[])
-    self.rl_text_summary_op = tf.summary.text('rl',
-                                              self.text_summary_placeholder)
-
-  def _rl_text_summary(self, session, step, npe, tot_r, num_steps,
-                       input_case, code_output, code, reason):
-    """Logs summary about a single episode and creates a text_summary for TB.
-
-    Args:
-      session: tf.Session instance.
-      step: Global training step.
-      npe: Number of programs executed so far.
-      tot_r: Total reward.
-      num_steps: Number of timesteps in the episode (i.e. code length).
-      input_case: Inputs for test cases.
-      code_output: Outputs produced by running the code on the inputs.
-      code: String representation of the code.
-      reason: Reason for the reward assigned by the task.
-
-    Returns:
-      Serialized text summary data for tensorboard.
-    """
-    if not input_case:
-      input_case = ' '
-    if not code_output:
-      code_output = ' '
-    if not code:
-      code = ' '
-    text = (
-        'Tot R: **%.2f**;  Len: **%d**;  Reason: **%s**\n\n'
-        'Input: **`%s`**; Output: **`%s`**\n\nCode: **`%s`**'
-        % (tot_r, num_steps, reason, input_case, code_output, code))
-    text_summary = session.run(self.rl_text_summary_op,
-                               {self.text_summary_placeholder: text})
-    logging.info(
-        'Step %d.\t NPE: %d\t Reason: %s.\t Tot R: %.2f.\t Length: %d. '
-        '\tInput: %s \tOutput: %s \tProgram: %s',
-        step, npe, reason, tot_r, num_steps, input_case,
-        code_output, code)
-    return text_summary
-
-  def _rl_reward_summary(self, total_rewards):
-    """Create summary ops that report on episode rewards.
-
-    Creates summaries for average, median, max, and min rewards in the batch.
-
-    Args:
-      total_rewards: Tensor of shape [batch_size] containing the total reward
-          from each episode in the batch.
-
-    Returns:
-      tf.Summary op.
-    """
-    tr = np.asarray(total_rewards)
-    reward_summary = tf.Summary(value=[
-        tf.Summary.Value(
-            tag='reward/avg',
-            simple_value=np.mean(tr)),
-        tf.Summary.Value(
-            tag='reward/med',
-            simple_value=np.median(tr)),
-        tf.Summary.Value(
-            tag='reward/max',
-            simple_value=np.max(tr)),
-        tf.Summary.Value(
-            tag='reward/min',
-            simple_value=np.min(tr))])
-    return reward_summary
-
-  def _iw_summary(self, session, replay_iw, replay_log_probs,
-                  norm_replay_weights, on_policy_iw,
-                  on_policy_log_probs):
-    """Compute summaries for importance weights at a given batch.
-
-    Args:
-      session: tf.Session instance.
-      replay_iw: Importance weights for episodes from replay buffer.
-      replay_log_probs: Total log probabilities of the replay episodes under the
-          current policy.
-      norm_replay_weights: Normalized replay weights, i.e. values in `replay_iw`
-          divided by the total weight in the entire replay buffer. Note, this is
-          also the probability of selecting each episode from the replay buffer
-          (in a roulette wheel replay buffer).
-      on_policy_iw: Importance weights for episodes sampled from the current
-          policy.
-      on_policy_log_probs: Total log probabilities of the on-policy episodes
-          under the current policy.
-
-    Returns:
-      Serialized TF summaries. Use a summary writer to write these summaries to
-      disk.
-    """
-    return session.run(
-        self.iw_summary_op,
-        {self.log_iw_replay_ph: np.log(replay_iw),
-         self.log_iw_policy_ph: np.log(on_policy_iw),
-         self.log_norm_replay_weights_ph: np.log(norm_replay_weights),
-         self.log_prob_replay_ph: replay_log_probs,
-         self.log_prob_policy_ph: on_policy_log_probs})
-
-  def _compute_iw(self, policy_log_probs, replay_weights):
-    """Compute importance weights for a batch of episodes.
-
-    Arguments are iterables of length batch_size.
-
-    Args:
-      policy_log_probs: Log probability of each episode under the current
-          policy.
-      replay_weights: Weight of each episode in the replay buffer. 0 for
-          episodes not sampled from the replay buffer (i.e. sampled from the
-          policy).
-
-    Returns:
-      Numpy array of shape [batch_size] containing the importance weight for
-      each episode in the batch.
-    """
-    log_total_replay_weight = log(self.experience_replay.total_weight)
-
-    # importance weight
-    # = 1 / [(1 - a) + a * exp(log(replay_weight / total_weight / p))]
-    # = 1 / ((1-a) + a*q/p)
-    a = float(self.replay_alpha)
-    a_com = 1.0 - a  # compliment of a
-    importance_weights = np.asarray(
-        [1.0 / (a_com
-                + a * exp((log(replay_weight) - log_total_replay_weight)
-                          - log_p))
-         if replay_weight > 0 else 1.0 / a_com
-         for log_p, replay_weight
-         in zip(policy_log_probs, replay_weights)])
-    return importance_weights
-
-  def update_step(self, session, rl_batch, train_op, global_step_op,
-                  return_gradients=False):
-    """Perform gradient update on the model.
-
-    Args:
-      session: tf.Session instance.
-      rl_batch: RLBatch instance from data.py. Use DataManager to create a
-          RLBatch for each call to update_step. RLBatch contains a batch of
-          tasks.
-      train_op: A TF op which will perform the gradient update. LMAgent does not
-          own its training op, so that trainers can do distributed training
-          and construct a specialized training op.
-      global_step_op: A TF op which will return the current global step when
-          run (should not increment it).
-      return_gradients: If True, the gradients will be saved and returned from
-          this method call. This is useful for testing.
-
-    Returns:
-      Results from the update step in a UpdateStepResult namedtuple, including
-      global step, global NPE, serialized summaries, and optionally gradients.
-    """
-    assert self.is_local
-
-    # Do update for REINFORCE or REINFORCE + replay buffer.
-    if self.experience_replay is None:
-      # Train with on-policy REINFORCE.
-
-      # Sample new programs from the policy.
-      num_programs_from_policy = rl_batch.batch_size
-      (batch_actions,
-       batch_values,
-       episode_lengths) = session.run(
-           [self.sampled_batch.tokens, self.sampled_batch.value,
-            self.sampled_batch.episode_lengths])
-      if episode_lengths.size == 0:
-        # This should not happen.
-        logging.warn(
-            'Shapes:\n'
-            'batch_actions.shape: %s\n'
-            'batch_values.shape: %s\n'
-            'episode_lengths.shape: %s\n',
-            batch_actions.shape, batch_values.shape, episode_lengths.shape)
-
-      # Compute rewards.
-      code_scores = compute_rewards(
-          rl_batch, batch_actions, episode_lengths)
-      code_strings = code_scores.code_strings
-      batch_tot_r = code_scores.total_rewards
-      test_cases = code_scores.test_cases
-      code_outputs = code_scores.code_outputs
-      reasons = code_scores.reasons
-
-      # Process on-policy samples.
-      batch_targets, batch_returns = process_episodes(
-          code_scores.batch_rewards, episode_lengths, a2c=self.a2c,
-          baselines=self.ema_by_len,
-          batch_values=batch_values)
-      batch_policy_multipliers = batch_targets
-      batch_emp_values = batch_returns if self.a2c else [[]]
-      adjusted_lengths = episode_lengths
-
-      if self.top_episodes:
-        assert len(self.top_episodes) > 0  # pylint: disable=g-explicit-length-test
-        off_policy_targets = [
-            item for item, _
-            in self.top_episodes.random_sample(self.topk_batch_size)]
-        off_policy_target_lengths = [len(t) for t in off_policy_targets]
-        off_policy_targets = utils.stack_pad(off_policy_targets, pad_axes=0,
-                                             dtype=np.int32)
-        offp_switch = 1
-      else:
-        off_policy_targets = [[0]]
-        off_policy_target_lengths = [1]
-        offp_switch = 0
-
-      fetches = {
-          'global_step': global_step_op,
-          'program_count': self.program_count,
-          'summaries': self.rl_summary_op,
-          'train_op': train_op,
-          'gradients': self.gradients_dict if return_gradients else self.no_op}
-      fetched = session.run(
-          fetches,
-          {self.actions: batch_actions,
-           self.empirical_values: batch_emp_values,
-           self.policy_multipliers: batch_policy_multipliers,
-           self.adjusted_lengths: adjusted_lengths,
-           self.off_policy_targets: off_policy_targets,
-           self.off_policy_target_lengths: off_policy_target_lengths,
-           self.offp_switch: offp_switch})
-
-      combined_adjusted_lengths = adjusted_lengths
-      combined_returns = batch_returns
-    else:
-      # Train with REINFORCE + off-policy replay buffer by using importance
-      # sampling.
-
-      # Sample new programs from the policy.
-      # Note: batch size is constant. A full batch will be sampled, but not all
-      # programs will be executed and added to the replay buffer. Those which
-      # are not executed will be discarded and not counted.
-      batch_actions, batch_values, episode_lengths, log_probs = session.run(
-          [self.sampled_batch.tokens, self.sampled_batch.value,
-           self.sampled_batch.episode_lengths, self.sampled_batch.log_probs])
-      if episode_lengths.size == 0:
-        # This should not happen.
-        logging.warn(
-            'Shapes:\n'
-            'batch_actions.shape: %s\n'
-            'batch_values.shape: %s\n'
-            'episode_lengths.shape: %s\n',
-            batch_actions.shape, batch_values.shape, episode_lengths.shape)
-
-      # Sample from experince replay buffer
-      empty_replay_buffer = (
-          self.experience_replay.is_empty()
-          if self.experience_replay is not None else True)
-      num_programs_from_replay_buff = (
-          self.num_replay_per_batch if not empty_replay_buffer else 0)
-      num_programs_from_policy = (
-          rl_batch.batch_size - num_programs_from_replay_buff)
-      if (not empty_replay_buffer) and num_programs_from_replay_buff:
-        result = self.experience_replay.sample_many(
-            num_programs_from_replay_buff)
-        experience_samples, replay_weights = zip(*result)
-        (replay_actions,
-         replay_rewards,
-         _,  # log probs
-         replay_adjusted_lengths) = zip(*experience_samples)
-
-        replay_batch_actions = utils.stack_pad(replay_actions, pad_axes=0,
-                                               dtype=np.int32)
-
-        # compute log probs for replay samples under current policy
-        all_replay_log_probs, = session.run(
-            [self.given_batch.log_probs],
-            {self.actions: replay_batch_actions,
-             self.adjusted_lengths: replay_adjusted_lengths})
-        replay_log_probs = [
-            np.choose(replay_actions[i], all_replay_log_probs[i, :l].T).sum()
-            for i, l in enumerate(replay_adjusted_lengths)]
-      else:
-        # Replay buffer is empty. Do not sample from it.
-        replay_actions = None
-        replay_policy_multipliers = None
-        replay_adjusted_lengths = None
-        replay_log_probs = None
-        replay_weights = None
-        replay_returns = None
-        on_policy_weights = [0] * num_programs_from_replay_buff
-
-      assert not self.a2c  # TODO(danabo): Support A2C with importance sampling.
-
-      # Compute rewards.
-      code_scores = compute_rewards(
-          rl_batch, batch_actions, episode_lengths,
-          batch_size=num_programs_from_policy)
-      code_strings = code_scores.code_strings
-      batch_tot_r = code_scores.total_rewards
-      test_cases = code_scores.test_cases
-      code_outputs = code_scores.code_outputs
-      reasons = code_scores.reasons
-
-      # Process on-policy samples.
-      p = num_programs_from_policy
-      batch_targets, batch_returns = process_episodes(
-          code_scores.batch_rewards, episode_lengths[:p], a2c=False,
-          baselines=self.ema_by_len)
-      batch_policy_multipliers = batch_targets
-      batch_emp_values = [[]]
-      on_policy_returns = batch_returns
-
-      # Process off-policy samples.
-      if (not empty_replay_buffer) and num_programs_from_replay_buff:
-        offp_batch_rewards = [
-            [0.0] * (l - 1) + [r]
-            for l, r in zip(replay_adjusted_lengths, replay_rewards)]
-        assert len(offp_batch_rewards) == num_programs_from_replay_buff
-        assert len(replay_adjusted_lengths) == num_programs_from_replay_buff
-        replay_batch_targets, replay_returns = process_episodes(
-            offp_batch_rewards, replay_adjusted_lengths, a2c=False,
-            baselines=self.ema_by_len)
-        # Convert 2D array back into ragged 2D list.
-        replay_policy_multipliers = [
-            replay_batch_targets[i, :l]
-            for i, l
-            in enumerate(
-                replay_adjusted_lengths[:num_programs_from_replay_buff])]
-
-      adjusted_lengths = episode_lengths[:num_programs_from_policy]
-
-      if self.top_episodes:
-        assert len(self.top_episodes) > 0  # pylint: disable=g-explicit-length-test
-        off_policy_targets = [
-            item for item, _
-            in self.top_episodes.random_sample(self.topk_batch_size)]
-        off_policy_target_lengths = [len(t) for t in off_policy_targets]
-        off_policy_targets = utils.stack_pad(off_policy_targets, pad_axes=0,
-                                             dtype=np.int32)
-        offp_switch = 1
-      else:
-        off_policy_targets = [[0]]
-        off_policy_target_lengths = [1]
-        offp_switch = 0
-
-      # On-policy episodes.
-      if num_programs_from_policy:
-        separate_actions = [
-            batch_actions[i, :l]
-            for i, l in enumerate(adjusted_lengths)]
-        chosen_log_probs = [
-            np.choose(separate_actions[i], log_probs[i, :l].T)
-            for i, l in enumerate(adjusted_lengths)]
-        new_experiences = [
-            (separate_actions[i],
-             batch_tot_r[i],
-             chosen_log_probs[i].sum(), l)
-            for i, l in enumerate(adjusted_lengths)]
-        on_policy_policy_multipliers = [
-            batch_policy_multipliers[i, :l]
-            for i, l in enumerate(adjusted_lengths)]
-        (on_policy_actions,
-         _,  # rewards
-         on_policy_log_probs,
-         on_policy_adjusted_lengths) = zip(*new_experiences)
-      else:
-        new_experiences = []
-        on_policy_policy_multipliers = []
-        on_policy_actions = []
-        on_policy_log_probs = []
-        on_policy_adjusted_lengths = []
-
-      if (not empty_replay_buffer) and num_programs_from_replay_buff:
-        # Look for new experiences in replay buffer. Assign weight if an episode
-        # is in the buffer.
-        on_policy_weights = [0] * num_programs_from_policy
-        for i, cs in enumerate(code_strings):
-          if self.experience_replay.has_key(cs):
-            on_policy_weights[i] = self.experience_replay.get_weight(cs)
-
-      # Randomly select on-policy or off policy episodes to train on.
-      combined_actions = join(replay_actions, on_policy_actions)
-      combined_policy_multipliers = join(
-          replay_policy_multipliers, on_policy_policy_multipliers)
-      combined_adjusted_lengths = join(
-          replay_adjusted_lengths, on_policy_adjusted_lengths)
-      combined_returns = join(replay_returns, on_policy_returns)
-      combined_actions = utils.stack_pad(combined_actions, pad_axes=0)
-      combined_policy_multipliers = utils.stack_pad(combined_policy_multipliers,
-                                                    pad_axes=0)
-      # P
-      combined_on_policy_log_probs = join(replay_log_probs, on_policy_log_probs)
-      # Q
-      # Assume weight is zero for all sequences sampled from the policy.
-      combined_q_weights = join(replay_weights, on_policy_weights)
-
-      # Importance adjustment. Naive formulation:
-      # E_{x~p}[f(x)] ~= 1/N sum_{x~p}(f(x)) ~= 1/N sum_{x~q}(f(x) * p(x)/q(x)).
-      # p(x) is the policy, and q(x) is the off-policy distribution, i.e. replay
-      # buffer distribution. Importance weight w(x) = p(x) / q(x).
-
-      # Instead of sampling from the replay buffer only, we sample from a
-      # mixture distribution of the policy and replay buffer.
-      # We are sampling from the mixture a*q(x) + (1-a)*p(x), where 0 <= a <= 1.
-      # Thus the importance weight w(x) = p(x) / (a*q(x) + (1-a)*p(x))
-      # = 1 / ((1-a) + a*q(x)/p(x)) where q(x) is 0 for x sampled from the
-      #                             policy.
-      # Note: a = self.replay_alpha
-      if empty_replay_buffer:
-        # The replay buffer is empty.
-        # Do no gradient update this step. The replay buffer will have stuff in
-        # it next time.
-        combined_policy_multipliers *= 0
-      elif not num_programs_from_replay_buff:
-        combined_policy_multipliers = np.ones([len(combined_actions), 1],
-                                              dtype=np.float32)
-      else:
-        # If a < 1 compute importance weights
-        # importance weight
-        # = 1 / [(1 - a) + a * exp(log(replay_weight / total_weight / p))]
-        # = 1 / ((1-a) + a*q/p)
-        importance_weights = self._compute_iw(combined_on_policy_log_probs,
-                                              combined_q_weights)
-        if self.config.iw_normalize:
-          importance_weights *= (
-              float(rl_batch.batch_size) / importance_weights.sum())
-        combined_policy_multipliers *= importance_weights.reshape(-1, 1)
-
-      # Train on replay batch, top-k MLE.
-      assert self.program_count is not None
-      fetches = {
-          'global_step': global_step_op,
-          'program_count': self.program_count,
-          'summaries': self.rl_summary_op,
-          'train_op': train_op,
-          'gradients': self.gradients_dict if return_gradients else self.no_op}
-      fetched = session.run(
-          fetches,
-          {self.actions: combined_actions,
-           self.empirical_values: [[]],  # replay_emp_values,
-           self.policy_multipliers: combined_policy_multipliers,
-           self.adjusted_lengths: combined_adjusted_lengths,
-           self.off_policy_targets: off_policy_targets,
-           self.off_policy_target_lengths: off_policy_target_lengths,
-           self.offp_switch: offp_switch})
-
-      # Add to experience replay buffer.
-      self.experience_replay.add_many(
-          objs=new_experiences,
-          weights=[exp(r / self.replay_temperature) for r in batch_tot_r],
-          keys=code_strings)
-
-    # Update program count.
-    session.run(
-        [self.program_count_add_op],
-        {self.program_count_add_ph: num_programs_from_policy})
-
-    # Update EMA baselines on the mini-batch which we just did traning on.
-    if not self.a2c:
-      for i in xrange(rl_batch.batch_size):
-        episode_length = combined_adjusted_lengths[i]
-        empirical_returns = combined_returns[i, :episode_length]
-        for j in xrange(episode_length):
-          # Update ema_baselines in place.
-          self.ema_by_len[j] = (
-              self.ema_baseline_decay * self.ema_by_len[j]
-              + (1 - self.ema_baseline_decay) * empirical_returns[j])
-
-    global_step = fetched['global_step']
-    global_npe = fetched['program_count']
-    core_summaries = fetched['summaries']
-    summaries_list = [core_summaries]
-
-    if num_programs_from_policy:
-      s_i = 0
-      text_summary = self._rl_text_summary(
-          session,
-          global_step,
-          global_npe,
-          batch_tot_r[s_i],
-          episode_lengths[s_i], test_cases[s_i],
-          code_outputs[s_i], code_strings[s_i], reasons[s_i])
-      reward_summary = self._rl_reward_summary(batch_tot_r)
-
-      is_best = False
-      if self.global_best_reward_fn:
-        # Save best reward.
-        best_reward = np.max(batch_tot_r)
-        is_best = self.global_best_reward_fn(session, best_reward)
-
-      if self.found_solution_op is not None and 'correct' in reasons:
-        session.run(self.found_solution_op)
-
-        # Save program to disk for record keeping.
-        if self.stop_on_success:
-          solutions = [
-              {'code': code_strings[i], 'reward': batch_tot_r[i],
-               'npe': global_npe}
-              for i in xrange(len(reasons)) if reasons[i] == 'correct']
-        elif is_best:
-          solutions = [
-              {'code': code_strings[np.argmax(batch_tot_r)],
-               'reward': np.max(batch_tot_r),
-               'npe': global_npe}]
-        else:
-          solutions = []
-        if solutions:
-          if self.assign_code_solution_fn:
-            self.assign_code_solution_fn(session, solutions[0]['code'])
-          with tf.gfile.FastGFile(self.logging_file, 'a') as writer:
-            for solution_dict in solutions:
-              writer.write(str(solution_dict) + '\n')
-
-      max_i = np.argmax(batch_tot_r)
-      max_tot_r = batch_tot_r[max_i]
-      if max_tot_r >= self.top_reward:
-        if max_tot_r >= self.top_reward:
-          self.top_reward = max_tot_r
-        logging.info('Top code: r=%.2f, \t%s', max_tot_r, code_strings[max_i])
-      if self.top_episodes is not None:
-        self.top_episodes.push(
-            max_tot_r, tuple(batch_actions[max_i, :episode_lengths[max_i]]))
-
-      summaries_list += [text_summary, reward_summary]
-
-      if self.do_iw_summaries and not empty_replay_buffer:
-        # prob of replay samples under replay buffer sampling.
-        norm_replay_weights = [
-            w / self.experience_replay.total_weight
-            for w in replay_weights]
-        replay_iw = self._compute_iw(replay_log_probs, replay_weights)
-        on_policy_iw = self._compute_iw(on_policy_log_probs, on_policy_weights)
-        summaries_list.append(
-            self._iw_summary(
-                session, replay_iw, replay_log_probs, norm_replay_weights,
-                on_policy_iw, on_policy_log_probs))
-
-    return UpdateStepResult(
-        global_step=global_step,
-        global_npe=global_npe,
-        summaries_list=summaries_list,
-        gradients_dict=fetched['gradients'])
-
-
-def io_to_text(io_case, io_type):
-  if isinstance(io_case, misc.IOTuple):
-    # If there are many strings, join them with ','.
-    return ','.join([io_to_text(e, io_type) for e in io_case])
-  if io_type == misc.IOType.string:
-    # There is one string. Return it.
-    return misc.tokens_to_text(io_case)
-  if (io_type == misc.IOType.integer
-      or io_type == misc.IOType.boolean):
-    if len(io_case) == 1:
-      return str(io_case[0])
-    return str(io_case)
-
-
-CodeScoreInfo = namedtuple(
-    'CodeScoreInfo',
-    ['code_strings', 'batch_rewards', 'total_rewards', 'test_cases',
-     'code_outputs', 'reasons'])
-
-
-def compute_rewards(rl_batch, batch_actions, episode_lengths, batch_size=None):
-  """Compute rewards for each episode in the batch.
-
-  Args:
-    rl_batch: A data.RLBatch instance. This holds information about the task
-        each episode is solving, and a reward function for each episode.
-    batch_actions: Contains batch of episodes. Each sequence of actions will be
-        converted into a BF program and then scored. A numpy array of shape
-        [batch_size, max_sequence_length].
-    episode_lengths: The sequence length of each episode in the batch. Iterable
-        of length batch_size.
-    batch_size: (optional) number of programs to score. Use this to limit the
-        number of programs executed from this batch. For example, when doing
-        importance sampling some of the on-policy episodes will be discarded
-        and they should not be executed. `batch_size` can be less than or equal
-        to the size of the input batch.
-
-  Returns:
-    CodeScoreInfo namedtuple instance. This holds not just the computed rewards,
-    but additional information computed during code execution which can be used
-    for debugging and monitoring. this includes: BF code strings, test cases
-    the code was executed on, code outputs from those test cases, and reasons
-    for success or failure.
-  """
-  code_strings = [
-      ''.join([misc.bf_int2char(a) for a in action_sequence[:l]])
-      for action_sequence, l in zip(batch_actions, episode_lengths)]
-  if batch_size is None:
-    batch_size = len(code_strings)
-  else:
-    assert batch_size <= len(code_strings)
-    code_strings = code_strings[:batch_size]
-
-  if isinstance(rl_batch.reward_fns, (list, tuple)):
-    # reward_fns is a list of functions, same length as code_strings.
-    assert len(rl_batch.reward_fns) >= batch_size
-    r_fn_results = [
-        rl_batch.reward_fns[i](code_strings[i]) for i in xrange(batch_size)]
-  else:
-    # reward_fns is allowed to be one function which processes a batch of code
-    # strings. This is useful for efficiency and batch level computation.
-    r_fn_results = rl_batch.reward_fns(code_strings)
-
-  # Expecting that r_fn returns a list of rewards. Length of list equals
-  # length of the code string (including EOS char).
-
-  batch_rewards = [r.episode_rewards for r in r_fn_results]
-  total_rewards = [sum(b) for b in batch_rewards]
-  test_cases = [io_to_text(r.input_case, r.input_type) for r in r_fn_results]
-  code_outputs = [io_to_text(r.code_output, r.output_type)
-                  for r in r_fn_results]
-  reasons = [r.reason for r in r_fn_results]
-  return CodeScoreInfo(
-      code_strings=code_strings,
-      batch_rewards=batch_rewards,
-      total_rewards=total_rewards,
-      test_cases=test_cases,
-      code_outputs=code_outputs,
-      reasons=reasons)
-
-
-def process_episodes(
-    batch_rewards, episode_lengths, a2c=False, baselines=None,
-    batch_values=None):
-  """Compute REINFORCE targets.
-
-  REINFORCE here takes the form:
-  grad_t = grad[log(pi(a_t|c_t))*target_t]
-  where c_t is context: i.e. RNN state or environment state (or both).
-
-  Two types of targets are supported:
-  1) Advantage actor critic (a2c).
-  2) Vanilla REINFORCE with baseline.
-
-  Args:
-    batch_rewards: Rewards received in each episode in the batch. A numpy array
-        of shape [batch_size, max_sequence_length]. Note, these are per-timestep
-        rewards, not total reward.
-    episode_lengths: Length of each episode. An iterable of length batch_size.
-    a2c: A bool. Whether to compute a2c targets (True) or vanilla targets
-        (False).
-    baselines: If a2c is False, provide baselines for each timestep. This is a
-        list (or indexable container) of length max_time. Note: baselines are
-        shared across all episodes, which is why there is no batch dimension.
-        It is up to the caller to update baselines accordingly.
-    batch_values: If a2c is True, provide values computed by a value estimator.
-        A numpy array of shape [batch_size, max_sequence_length].
-
-  Returns:
-    batch_targets: REINFORCE targets for each episode and timestep. A numpy
-        array of shape [batch_size, max_sequence_length].
-    batch_returns: Returns computed for each episode and timestep. This is for
-        reference, and is not used in the REINFORCE gradient update (but was
-        used to compute the targets). A numpy array of shape
-        [batch_size, max_sequence_length].
-  """
-  num_programs = len(batch_rewards)
-  assert num_programs <= len(episode_lengths)
-  batch_returns = [None] * num_programs
-  batch_targets = [None] * num_programs
-  for i in xrange(num_programs):
-    episode_length = episode_lengths[i]
-    assert len(batch_rewards[i]) == episode_length
-    # Compute target for each timestep.
-    # If we are computing A2C:
-    #    target_t = advantage_t = R_t - V(c_t)
-    #    where V(c_t) is a learned value function (provided as `values`).
-    # Otherwise:
-    #    target_t = R_t - baselines[t]
-    #    where `baselines` are provided.
-    # In practice we use a more generalized formulation of advantage. See docs
-    # for `discounted_advantage_and_rewards`.
-    if a2c:
-      # Compute advantage.
-      assert batch_values is not None
-      episode_values = batch_values[i, :episode_length]
-      episode_rewards = batch_rewards[i]
-      emp_val, gen_adv = rollout_lib.discounted_advantage_and_rewards(
-          episode_rewards, episode_values, gamma=1.0, lambda_=1.0)
-      batch_returns[i] = emp_val
-      batch_targets[i] = gen_adv
-    else:
-      # Compute return for each timestep. See section 3 of
-      # https://arxiv.org/pdf/1602.01783.pdf
-      assert baselines is not None
-      empirical_returns = rollout_lib.discount(batch_rewards[i], gamma=1.0)
-      targets = [None] * episode_length
-      for j in xrange(episode_length):
-        targets[j] = empirical_returns[j] - baselines[j]
-      batch_returns[i] = empirical_returns
-      batch_targets[i] = targets
-  batch_returns = utils.stack_pad(batch_returns, 0)
-  if num_programs:
-    batch_targets = utils.stack_pad(batch_targets, 0)
-  else:
-    batch_targets = np.array([], dtype=np.float32)
-
-  return (batch_targets, batch_returns)
diff --git a/research/brain_coder/single_task/pg_agent_test.py b/research/brain_coder/single_task/pg_agent_test.py
deleted file mode 100644
index 503d37ecacbf968b0786b3553e6a97667569bf7d..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/pg_agent_test.py
+++ /dev/null
@@ -1,395 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for pg_agent."""
-
-from collections import Counter
-
-from absl import logging
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from common import utils  # brain coder
-from single_task import data  # brain coder
-from single_task import defaults  # brain coder
-from single_task import misc  # brain coder
-from single_task import pg_agent as agent_lib  # brain coder
-from single_task import pg_train  # brain coder
-
-
-# Symmetric mean absolute percentage error (SMAPE).
-# https://en.wikipedia.org/wiki/Symmetric_mean_absolute_percentage_error
-def smape(a, b):
-  return 2.0 * abs(a - b) / float(a + b)
-
-
-def onehot(dim, num_dims):
-  value = np.zeros(num_dims, dtype=np.float32)
-  value[dim] = 1
-  return value
-
-
-def random_sequence(max_length, num_tokens, eos=0):
-  length = np.random.randint(1, max_length - 1)
-  return np.append(np.random.randint(1, num_tokens, length), eos)
-
-
-def repeat_and_pad(v, rep, total_len):
-  return [v] * rep + [0.0] * (total_len - rep)
-
-
-class AgentTest(tf.test.TestCase):
-
-  def testProcessEpisodes(self):
-    batch_size = 3
-
-    def reward_fn(code_string):
-      return misc.RewardInfo(
-          episode_rewards=[float(ord(c)) for c in code_string],
-          input_case=[],
-          correct_output=[],
-          code_output=[],
-          input_type=misc.IOType.integer,
-          output_type=misc.IOType.integer,
-          reason='none')
-
-    rl_batch = data.RLBatch(
-        reward_fns=[reward_fn for _ in range(batch_size)],
-        batch_size=batch_size,
-        good_reward=10.0)
-    batch_actions = np.asarray([
-        [4, 5, 3, 6, 8, 1, 0, 0],
-        [1, 2, 3, 4, 0, 0, 0, 0],
-        [8, 7, 6, 5, 4, 3, 2, 1]], dtype=np.int32)
-    batch_values = np.asarray([
-        [0, 1, 2, 1, 0, 1, 1, 0],
-        [0, 2, 1, 2, 1, 0, 0, 0],
-        [0, 1, 1, 0, 0, 0, 1, 1]], dtype=np.float32)
-    episode_lengths = np.asarray([7, 5, 8], dtype=np.int32)
-
-    scores = agent_lib.compute_rewards(
-        rl_batch, batch_actions, episode_lengths)
-    batch_targets, batch_returns = agent_lib.process_episodes(
-        scores.batch_rewards, episode_lengths, a2c=True,
-        batch_values=batch_values)
-    self.assertEqual(
-        [[473.0, 428.0, 337.0, 294.0, 201.0, 157.0, 95.0, 0.0],
-         [305.0, 243.0, 183.0, 140.0, 95.0, 0.0, 0.0, 0.0],
-         [484.0, 440.0, 394.0, 301.0, 210.0, 165.0, 122.0, 62.0]],
-        batch_returns.tolist())
-    self.assertEqual(
-        [[473.0, 427.0, 335.0, 293.0, 201.0, 156.0, 94.0, 0.0],
-         [305.0, 241.0, 182.0, 138.0, 94.0, 0.0, 0.0, 0.0],
-         [484.0, 439.0, 393.0, 301.0, 210.0, 165.0, 121.0, 61.0]],
-        batch_targets.tolist())
-
-  def testVarUpdates(self):
-    """Tests that variables get updated as expected.
-
-    For the RL update, check that gradients are non-zero and that the global
-    model gets updated.
-    """
-    config = defaults.default_config_with_updates(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="pg",eos_token=True,optimizer="sgd",lr=1.0)')
-    lr = config.agent.lr
-
-    tf.reset_default_graph()
-    trainer = pg_train.AsyncTrainer(
-        config, task_id=0, ps_tasks=0, num_workers=1)
-    global_init_op = tf.variables_initializer(
-        tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, 'global'))
-    with tf.Session() as sess:
-      sess.run(global_init_op)  # Initialize global copy.
-      trainer.initialize(sess)
-      model = trainer.model
-      global_vars = sess.run(trainer.global_model.trainable_variables)
-      local_vars = sess.run(model.trainable_variables)
-
-      # Make sure names match.
-      g_prefix = 'global/'
-      l_prefix = 'local/'
-      for g, l in zip(trainer.global_model.trainable_variables,
-                      model.trainable_variables):
-        self.assertEqual(g.name[len(g_prefix):], l.name[len(l_prefix):])
-
-      # Assert that shapes and values are the same between global and local
-      # models.
-      for g, l in zip(global_vars, local_vars):
-        self.assertEqual(g.shape, l.shape)
-        self.assertTrue(np.array_equal(g, l))
-
-      # Make all gradients dense tensors.
-      for param, grad in model.gradients_dict.items():
-        if isinstance(grad, tf.IndexedSlices):
-          # Converts to dense tensor.
-          model.gradients_dict[param] = tf.multiply(grad, 1.0)
-
-      # Perform update.
-      results = model.update_step(
-          sess, trainer.data_manager.sample_rl_batch(), trainer.train_op,
-          trainer.global_step, return_gradients=True)
-      grads_dict = results.gradients_dict
-      for grad in grads_dict.values():
-        self.assertIsNotNone(grad)
-        self.assertTrue(np.count_nonzero(grad) > 0)
-      global_update = sess.run(trainer.global_model.trainable_variables)
-      for tf_var, var_before, var_after in zip(
-          model.trainable_variables, local_vars, global_update):
-        # Check that the params were updated.
-        self.assertTrue(np.allclose(
-            var_after,
-            var_before - grads_dict[tf_var] * lr))
-
-      # Test that global to local sync works.
-      sess.run(trainer.sync_op)
-      global_vars = sess.run(trainer.global_model.trainable_variables)
-      local_vars = sess.run(model.trainable_variables)
-      for l, g in zip(local_vars, global_vars):
-        self.assertTrue(np.allclose(l, g))
-
-  def testMonteCarloGradients(self):
-    """Test Monte Carlo estimate of REINFORCE gradient.
-
-    Test that the Monte Carlo estimate of the REINFORCE gradient is
-    approximately equal to the true gradient. We compute the true gradient for a
-    toy environment with a very small action space.
-
-    Similar to section 5 of https://arxiv.org/pdf/1505.00521.pdf.
-    """
-    # Test may have different outcome on different machines due to different
-    # rounding behavior of float arithmetic.
-    tf.reset_default_graph()
-    tf.set_random_seed(12345678987654321)
-    np.random.seed(1294024302)
-    max_length = 2
-    num_tokens = misc.bf_num_tokens()
-    eos = misc.BF_EOS_INT
-    assert eos == 0
-    def sequence_iterator(max_length):
-      """Iterates through all sequences up to the given length."""
-      yield [eos]
-      for a in xrange(1, num_tokens):
-        if max_length > 1:
-          for sub_seq in sequence_iterator(max_length - 1):
-            yield [a] + sub_seq
-        else:
-          yield [a]
-    actions = list(sequence_iterator(max_length))
-
-    # This batch contains all possible episodes up to max_length.
-    actions_batch = utils.stack_pad(actions, 0)
-    lengths_batch = [len(s) for s in actions]
-
-    reward_map = {tuple(a): np.random.randint(-1, 7) for a in actions_batch}
-    # reward_map = {tuple(a): np.random.normal(3, 1)
-    #               for a in actions_batch}  # normal distribution
-    # reward_map = {tuple(a): 1.0
-    #               for a in actions_batch}  # expected reward is 1
-
-    n = 100000  # MC sample size.
-    config = defaults.default_config_with_updates(
-        'env=c(task="print"),'
-        'agent=c(algorithm="pg",optimizer="sgd",lr=1.0,ema_baseline_decay=0.99,'
-        'entropy_beta=0.0,topk_loss_hparam=0.0,regularizer=0.0,'
-        'policy_lstm_sizes=[10],eos_token=True),'
-        'batch_size='+str(n)+',timestep_limit='+str(max_length))
-
-    dtype = tf.float64
-    trainer = pg_train.AsyncTrainer(
-        config, task_id=0, ps_tasks=0, num_workers=1, dtype=dtype)
-    model = trainer.model
-    actions_ph = model.actions
-    lengths_ph = model.adjusted_lengths
-    multipliers_ph = model.policy_multipliers
-
-    global_init_op = tf.variables_initializer(
-        tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, 'global'))
-    with tf.Session() as sess, sess.graph.as_default():
-      sess.run(global_init_op)  # Initialize global copy.
-      trainer.initialize(sess)
-
-      # Compute exact gradients.
-      # exact_grads = sum(P(a) * grad(log P(a)) * R(a) for a in actions_batch)
-      true_loss_unnormalized = 0.0
-      exact_grads = [np.zeros(v.shape) for v in model.trainable_variables]
-      episode_probs_map = {}
-      grads_map = {}
-      for a_idx in xrange(len(actions_batch)):
-        a = actions_batch[a_idx]
-        grads_result, probs_result, loss = sess.run(
-            [model.dense_unclipped_grads, model.chosen_probs, model.loss],
-            {actions_ph: [a],
-             lengths_ph: [lengths_batch[a_idx]],
-             multipliers_ph: [
-                 repeat_and_pad(reward_map[tuple(a)],
-                                lengths_batch[a_idx],
-                                max_length)]})
-        # Take product over time axis.
-        episode_probs_result = np.prod(probs_result[0, :lengths_batch[a_idx]])
-        for i in range(0, len(exact_grads)):
-          exact_grads[i] += grads_result[i] * episode_probs_result
-        episode_probs_map[tuple(a)] = episode_probs_result
-        reward_map[tuple(a)] = reward_map[tuple(a)]
-        grads_map[tuple(a)] = grads_result
-        true_loss_unnormalized += loss
-      # Normalize loss. Since each episode is feed into the model one at a time,
-      # normalization needs to be done manually.
-      true_loss = true_loss_unnormalized / float(len(actions_batch))
-
-      # Compute Monte Carlo gradients.
-      # E_a~P[grad(log P(a)) R(a)] is aprox. eq. to
-      # sum(grad(log P(a)) R(a) for a in actions_sampled_from_P) / n
-      # where len(actions_sampled_from_P) == n.
-      #
-      # In other words, sample from the policy and compute the gradients of the
-      # log probs weighted by the returns. This will excersize the code in
-      # agent.py
-      sampled_actions, sampled_lengths = sess.run(
-          [model.sampled_tokens, model.episode_lengths])
-      pi_multipliers = [
-          repeat_and_pad(reward_map[tuple(a)], l, max_length)
-          for a, l in zip(sampled_actions, sampled_lengths)]
-      mc_grads_unnormalized, sampled_probs, mc_loss_unnormalized = sess.run(
-          [model.dense_unclipped_grads, model.chosen_probs, model.loss],
-          {actions_ph: sampled_actions,
-           multipliers_ph: pi_multipliers,
-           lengths_ph: sampled_lengths})
-      # Loss is already normalized across the minibatch, so no normalization
-      # is needed.
-      mc_grads = mc_grads_unnormalized
-      mc_loss = mc_loss_unnormalized
-
-    # Make sure true loss and MC loss are similar.
-    loss_error = smape(true_loss, mc_loss)
-    self.assertTrue(loss_error < 0.15, msg='actual: %s' % loss_error)
-
-    # Check that probs computed for episodes sampled from the model are the same
-    # as the recorded true probs.
-    for i in range(100):
-      acs = tuple(sampled_actions[i].tolist())
-      sampled_prob = np.prod(sampled_probs[i, :sampled_lengths[i]])
-      self.assertTrue(np.isclose(episode_probs_map[acs], sampled_prob))
-
-    # Make sure MC estimates of true probs are close.
-    counter = Counter(tuple(e) for e in sampled_actions)
-    for acs, count in counter.iteritems():
-      mc_prob = count / float(len(sampled_actions))
-      true_prob = episode_probs_map[acs]
-      error = smape(mc_prob, true_prob)
-      self.assertTrue(
-          error < 0.15,
-          msg='actual: %s; count: %s; mc_prob: %s; true_prob: %s'
-          % (error, count, mc_prob, true_prob))
-
-    # Manually recompute MC gradients and make sure they match MC gradients
-    # computed in TF.
-    mc_grads_recompute = [np.zeros(v.shape) for v in model.trainable_variables]
-    for i in range(n):
-      acs = tuple(sampled_actions[i].tolist())
-      for i in range(0, len(mc_grads_recompute)):
-        mc_grads_recompute[i] += grads_map[acs][i]
-    for i in range(0, len(mc_grads_recompute)):
-      self.assertTrue(np.allclose(mc_grads[i], mc_grads_recompute[i] / n))
-
-    # Check angle between gradients as fraction of pi.
-    for index in range(len(mc_grads)):
-      v1 = mc_grads[index].reshape(-1)
-      v2 = exact_grads[index].reshape(-1)
-      # angle = arccos(v1 . v2 / (|v1|*|v2|))
-      angle_rad = np.arccos(
-          np.dot(v1, v2) / (np.linalg.norm(v1) * np.linalg.norm(v2)))
-      logging.info('angle / pi: %s', angle_rad / np.pi)
-      angle_frac = angle_rad / np.pi
-      self.assertTrue(angle_frac < 0.02, msg='actual: %s' % angle_frac)
-    # Check norms.
-    for index in range(len(mc_grads)):
-      v1_norm = np.linalg.norm(mc_grads[index].reshape(-1))
-      v2_norm = np.linalg.norm(exact_grads[index].reshape(-1))
-      error = smape(v1_norm, v2_norm)
-      self.assertTrue(error < 0.02, msg='actual: %s' % error)
-
-    # Check expected rewards.
-    # E_a~P[R(a)] approx eq sum(P(a) * R(a) for a in actions)
-    mc_expected_reward = np.mean(
-        [reward_map[tuple(a)] for a in sampled_actions])
-    exact_expected_reward = np.sum(
-        [episode_probs_map[k] * reward_map[k] for k in reward_map])
-    error = smape(mc_expected_reward, exact_expected_reward)
-    self.assertTrue(error < 0.005, msg='actual: %s' % angle_frac)
-
-  def testNumericalGradChecking(self):
-    # Similar to
-    # http://ufldl.stanford.edu/wiki/index.php/Gradient_checking_and_advanced_optimization.
-    epsilon = 1e-4
-    eos = misc.BF_EOS_INT
-    self.assertEqual(0, eos)
-    config = defaults.default_config_with_updates(
-        'env=c(task="print"),'
-        'agent=c(algorithm="pg",optimizer="sgd",lr=1.0,ema_baseline_decay=0.99,'
-        'entropy_beta=0.0,topk_loss_hparam=0.0,policy_lstm_sizes=[10],'
-        'eos_token=True),'
-        'batch_size=64')
-    dtype = tf.float64
-    tf.reset_default_graph()
-    tf.set_random_seed(12345678987654321)
-    np.random.seed(1294024302)
-    trainer = pg_train.AsyncTrainer(
-        config, task_id=0, ps_tasks=0, num_workers=1, dtype=dtype)
-    model = trainer.model
-    actions_ph = model.actions
-    lengths_ph = model.adjusted_lengths
-    multipliers_ph = model.policy_multipliers
-    loss = model.pi_loss
-    global_init_op = tf.variables_initializer(
-        tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, 'global'))
-
-    assign_add_placeholders = [None] * len(model.trainable_variables)
-    assign_add_ops = [None] * len(model.trainable_variables)
-    param_shapes = [None] * len(model.trainable_variables)
-    for i, param in enumerate(model.trainable_variables):
-      param_shapes[i] = param.get_shape().as_list()
-      assign_add_placeholders[i] = tf.placeholder(dtype,
-                                                  np.prod(param_shapes[i]))
-      assign_add_ops[i] = param.assign_add(
-          tf.reshape(assign_add_placeholders[i], param_shapes[i]))
-
-    with tf.Session() as sess:
-      sess.run(global_init_op)  # Initialize global copy.
-      trainer.initialize(sess)
-
-      actions_raw = [random_sequence(10, 9) for _ in xrange(16)]
-      actions_batch = utils.stack_pad(actions_raw, 0)
-      lengths_batch = [len(l) for l in actions_raw]
-      feed = {actions_ph: actions_batch,
-              multipliers_ph: np.ones_like(actions_batch),
-              lengths_ph: lengths_batch}
-
-      estimated_grads = [None] * len(model.trainable_variables)
-      for i, param in enumerate(model.trainable_variables):
-        param_size = np.prod(param_shapes[i])
-        estimated_grads[i] = np.zeros(param_size, dtype=np.float64)
-        for index in xrange(param_size):
-          e = onehot(index, param_size) * epsilon
-          sess.run(assign_add_ops[i],
-                   {assign_add_placeholders[i]: e})
-          j_plus = sess.run(loss, feed)
-          sess.run(assign_add_ops[i],
-                   {assign_add_placeholders[i]: -2 * e})
-          j_minus = sess.run(loss, feed)
-          sess.run(assign_add_ops[i],
-                   {assign_add_placeholders[i]: e})
-          estimated_grads[i][index] = (j_plus - j_minus) / (2 * epsilon)
-        estimated_grads[i] = estimated_grads[i].reshape(param_shapes[i])
-
-      analytic_grads = sess.run(model.dense_unclipped_grads, feed)
-
-      for g1, g2 in zip(estimated_grads[1:], analytic_grads[1:]):
-        logging.info('norm (g1-g2): %s', np.abs(g1 - g2).mean())
-        self.assertTrue(np.allclose(g1, g2))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/single_task/pg_train.py b/research/brain_coder/single_task/pg_train.py
deleted file mode 100644
index fde7cc84729a56002e8688d268a2085432ee124e..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/pg_train.py
+++ /dev/null
@@ -1,782 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-r"""Train RL agent on coding tasks."""
-
-import contextlib
-import cPickle
-import cProfile
-import marshal
-import os
-import time
-
-from absl import flags
-from absl import logging
-import tensorflow as tf
-
-# internal session lib import
-
-from single_task import data  # brain coder
-from single_task import defaults  # brain coder
-from single_task import pg_agent as agent_lib  # brain coder
-from single_task import results_lib  # brain coder
-
-
-FLAGS = flags.FLAGS
-flags.DEFINE_string(
-    'master', '',
-    'URL of the TensorFlow master to use.')
-flags.DEFINE_integer(
-    'ps_tasks', 0,
-    'Number of parameter server tasks. Only set to 0 for '
-    'single worker training.')
-flags.DEFINE_integer(
-    'summary_interval', 10,
-    'How often to write summaries.')
-flags.DEFINE_integer(
-    'summary_tasks', 16,
-    'If greater than 0 only tasks 0 through summary_tasks - 1 '
-    'will write summaries. If 0, all tasks will write '
-    'summaries.')
-flags.DEFINE_bool(
-    'stop_on_success', True,
-    'If True, training will stop as soon as a solution is found. '
-    'If False, training will continue indefinitely until another '
-    'stopping condition is reached.')
-flags.DEFINE_bool(
-    'do_profiling', False,
-    'If True, cProfile profiler will run and results will be '
-    'written to logdir. WARNING: Results will not be written if '
-    'the code crashes. Make sure it exists successfully.')
-flags.DEFINE_integer('model_v', 0, 'Model verbosity level.')
-flags.DEFINE_bool(
-    'delayed_graph_cleanup', True,
-    'If true, container for n-th run will not be reset until the (n+1)-th run '
-    'is complete. This greatly reduces the chance that a worker is still '
-    'using the n-th container when it is cleared.')
-
-
-def define_tuner_hparam_space(hparam_space_type):
-  """Define tunable hparams for grid search."""
-  if hparam_space_type not in ('pg', 'pg-topk', 'topk', 'is'):
-    raise ValueError('Hparam space is not valid: "%s"' % hparam_space_type)
-
-  # Discrete hparam space is stored as a dict from hparam name to discrete
-  # values.
-  hparam_space = {}
-
-  if hparam_space_type in ('pg', 'pg-topk', 'is'):
-    # Add a floating point parameter named learning rate.
-    hparam_space['lr'] = [1e-5, 1e-4, 1e-3]
-    hparam_space['entropy_beta'] = [0.005, 0.01, 0.05, 0.10]
-  else:  # 'topk'
-    # Add a floating point parameter named learning rate.
-    hparam_space['lr'] = [1e-5, 1e-4, 1e-3]
-    hparam_space['entropy_beta'] = [0.0, 0.005, 0.01, 0.05, 0.10]
-
-  if hparam_space_type in ('topk', 'pg-topk'):
-    # topk tuning will be enabled.
-    hparam_space['topk'] = [10]
-    hparam_space['topk_loss_hparam'] = [1.0, 10.0, 50.0, 200.0]
-
-  elif hparam_space_type == 'is':
-    # importance sampling tuning will be enabled.
-    hparam_space['replay_temperature'] = [0.25, 0.5, 1.0, 2.0]
-    hparam_space['alpha'] = [0.5, 0.75, 63/64.]
-
-  return hparam_space
-
-
-def write_hparams_to_config(config, hparams, hparam_space_type):
-  """Write hparams given by the tuner into the Config object."""
-  if hparam_space_type not in ('pg', 'pg-topk', 'topk', 'is'):
-    raise ValueError('Hparam space is not valid: "%s"' % hparam_space_type)
-
-  config.agent.lr = hparams.lr
-  config.agent.entropy_beta = hparams.entropy_beta
-
-  if hparam_space_type in ('topk', 'pg-topk'):
-    # topk tuning will be enabled.
-    config.agent.topk = hparams.topk
-    config.agent.topk_loss_hparam = hparams.topk_loss_hparam
-  elif hparam_space_type == 'is':
-    # importance sampling tuning will be enabled.
-    config.agent.replay_temperature = hparams.replay_temperature
-    config.agent.alpha = hparams.alpha
-
-
-def make_initialized_variable(value, name, shape=None, dtype=tf.float32):
-  """Create a tf.Variable with a constant initializer.
-
-  Args:
-    value: Constant value to initialize the variable with. This is the value
-        that the variable starts with.
-    name: Name of the variable in the TF graph.
-    shape: Shape of the variable. If None, variable will be a scalar.
-    dtype: Data type of the variable. Should be a TF dtype. Defaults to
-        tf.float32.
-
-  Returns:
-    tf.Variable instance.
-  """
-  if shape is None:
-    shape = []
-  return tf.get_variable(
-      name=name, shape=shape, initializer=tf.constant_initializer(value),
-      dtype=dtype, trainable=False)
-
-
-class AsyncTrainer(object):
-  """Manages graph creation and training.
-
-  This async trainer creates a global model on the parameter server, and a local
-  model (for this worker). Gradient updates are sent to the global model, and
-  the updated weights are synced to the local copy.
-  """
-
-  def __init__(self, config, task_id, ps_tasks, num_workers, is_chief=True,
-               summary_writer=None,
-               dtype=tf.float32,
-               summary_interval=1,
-               run_number=0,
-               logging_dir='/tmp', model_v=0):
-    self.config = config
-    self.data_manager = data.DataManager(
-        config, run_number=run_number,
-        do_code_simplification=not FLAGS.stop_on_success)
-    self.task_id = task_id
-    self.ps_tasks = ps_tasks
-    self.is_chief = is_chief
-    if ps_tasks == 0:
-      assert task_id == 0, 'No parameter servers specified. Expecting 1 task.'
-      assert num_workers == 1, (
-          'No parameter servers specified. Expecting 1 task.')
-      worker_device = '/job:localhost/replica:%d/task:0/cpu:0' % task_id
-      # worker_device = '/cpu:0'
-      # ps_device = '/cpu:0'
-    else:
-      assert num_workers > 0, 'There must be at least 1 training worker.'
-      worker_device = '/job:worker/replica:%d/task:0/cpu:0' % task_id
-      # ps_device = '/job:ps/replica:0/task:0/cpu:0'
-    logging.info('worker_device: %s', worker_device)
-
-    logging_file = os.path.join(
-        logging_dir, 'solutions_%d.txt' % task_id)
-    experience_replay_file = os.path.join(
-        logging_dir, 'replay_buffer_%d.pickle' % task_id)
-    self.topk_file = os.path.join(
-        logging_dir, 'topk_buffer_%d.pickle' % task_id)
-
-    tf.get_variable_scope().set_use_resource(True)
-
-    # global model
-    with tf.device(tf.train.replica_device_setter(ps_tasks,
-                                                  ps_device='/job:ps/replica:0',
-                                                  worker_device=worker_device)):
-      with tf.variable_scope('global'):
-        global_model = agent_lib.LMAgent(config, dtype=dtype, is_local=False)
-        global_params_dict = {p.name: p
-                              for p in global_model.sync_variables}
-        self.global_model = global_model
-        self.global_step = make_initialized_variable(
-            0, 'global_step', dtype=tf.int64)
-
-        self.global_best_reward = make_initialized_variable(
-            -10.0, 'global_best_reward', dtype=tf.float64)
-        self.is_best_model = make_initialized_variable(
-            False, 'is_best_model', dtype=tf.bool)
-        self.reset_is_best_model = self.is_best_model.assign(False)
-        self.global_best_reward_placeholder = tf.placeholder(
-            tf.float64, [], name='global_best_reward_placeholder')
-        self.assign_global_best_reward_op = tf.group(
-            self.global_best_reward.assign(
-                self.global_best_reward_placeholder),
-            self.is_best_model.assign(True))
-        def assign_global_best_reward_fn(session, reward):
-          reward = round(reward, 10)
-          best_reward = round(session.run(self.global_best_reward), 10)
-          is_best = reward > best_reward
-          if is_best:
-            session.run(self.assign_global_best_reward_op,
-                        {self.global_best_reward_placeholder: reward})
-          return is_best
-        self.assign_global_best_reward_fn = assign_global_best_reward_fn
-
-        # Any worker will set to true when it finds a solution.
-        self.found_solution_flag = make_initialized_variable(
-            False, 'found_solution_flag', dtype=tf.bool)
-        self.found_solution_op = self.found_solution_flag.assign(True)
-
-        self.run_number = make_initialized_variable(
-            run_number, 'run_number', dtype=tf.int32)
-
-        # Store a solution when found.
-        self.code_solution_variable = tf.get_variable(
-            'code_solution', [], tf.string,
-            initializer=tf.constant_initializer(''))
-        self.code_solution_ph = tf.placeholder(
-            tf.string, [], name='code_solution_ph')
-        self.code_solution_assign_op = self.code_solution_variable.assign(
-            self.code_solution_ph)
-        def assign_code_solution_fn(session, code_solution_string):
-          session.run(self.code_solution_assign_op,
-                      {self.code_solution_ph: code_solution_string})
-        self.assign_code_solution_fn = assign_code_solution_fn
-
-        # Count all programs sampled from policy. This does not include
-        # programs sampled from replay buffer.
-        # This equals NPE (number of programs executed). Only programs sampled
-        # from the policy need to be executed.
-        self.program_count = make_initialized_variable(
-            0, 'program_count', dtype=tf.int64)
-
-    # local model
-    with tf.device(worker_device):
-      with tf.variable_scope('local'):
-        self.model = model = agent_lib.LMAgent(
-            config,
-            task_id=task_id,
-            logging_file=logging_file,
-            experience_replay_file=experience_replay_file,
-            dtype=dtype,
-            global_best_reward_fn=self.assign_global_best_reward_fn,
-            found_solution_op=self.found_solution_op,
-            assign_code_solution_fn=self.assign_code_solution_fn,
-            program_count=self.program_count,
-            stop_on_success=FLAGS.stop_on_success,
-            verbose_level=model_v)
-        local_params = model.trainable_variables
-        local_params_dict = {p.name: p for p in local_params}
-
-    # Pull global params to local model.
-    def _global_to_local_scope(name):
-      assert name.startswith('global/')
-      return 'local' + name[6:]
-    sync_dict = {
-        local_params_dict[_global_to_local_scope(p_name)]: p
-        for p_name, p in global_params_dict.items()}
-    self.sync_op = tf.group(*[v_local.assign(v_global)
-                              for v_local, v_global
-                              in sync_dict.items()])
-
-    # Pair local gradients with global params.
-    grad_var_dict = {
-        gradient: sync_dict[local_var]
-        for local_var, gradient in model.gradients_dict.items()}
-
-    # local model
-    model.make_summary_ops()  # Don't put summaries under 'local' scope.
-    with tf.variable_scope('local'):
-      self.train_op = model.optimizer.apply_gradients(
-          grad_var_dict.items(), global_step=self.global_step)
-      self.local_init_op = tf.variables_initializer(
-          tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES,
-                            tf.get_variable_scope().name))
-
-    self.local_step = 0
-    self.last_summary_time = time.time()
-    self.summary_interval = summary_interval
-    self.summary_writer = summary_writer
-    self.cached_global_step = -1
-    self.cached_global_npe = -1
-
-    logging.info('summary_interval: %d', self.summary_interval)
-
-    # Load top-k buffer.
-    if self.model.top_episodes is not None and tf.gfile.Exists(self.topk_file):
-      try:
-        with tf.gfile.FastGFile(self.topk_file, 'r') as f:
-          self.model.top_episodes = cPickle.loads(f.read())
-        logging.info(
-            'Loaded top-k buffer from disk with %d items. Location: "%s"',
-            len(self.model.top_episodes), self.topk_file)
-      except (cPickle.UnpicklingError, EOFError) as e:
-        logging.warn(
-            'Failed to load existing top-k buffer from disk. Removing bad file.'
-            '\nLocation: "%s"\nException: %s', self.topk_file, str(e))
-        tf.gfile.Remove(self.topk_file)
-
-  def initialize(self, session):
-    """Run initialization ops."""
-    session.run(self.local_init_op)
-    session.run(self.sync_op)
-    self.cached_global_step, self.cached_global_npe = session.run(
-        [self.global_step, self.program_count])
-
-  def update_global_model(self, session):
-    """Run an update step.
-
-    1) Asynchronously copy global weights to local model.
-    2) Call into local model's update_step method, which does the following:
-        a) Sample batch of programs from policy.
-        b) Compute rewards.
-        c) Compute gradients and update the global model asynchronously.
-    3) Write tensorboard summaries to disk.
-
-    Args:
-      session: tf.Session instance.
-    """
-    session.run(self.sync_op)  # Copy weights from global to local.
-
-    with session.as_default():
-      result = self.model.update_step(
-          session, self.data_manager.sample_rl_batch(), self.train_op,
-          self.global_step)
-      global_step = result.global_step
-      global_npe = result.global_npe
-      summaries = result.summaries_list
-    self.cached_global_step = global_step
-    self.cached_global_npe = global_npe
-    self.local_step += 1
-
-    if self.summary_writer and self.local_step % self.summary_interval == 0:
-      if not isinstance(summaries, (tuple, list)):
-        summaries = [summaries]
-      summaries.append(self._local_step_summary())
-      if self.is_chief:
-        (global_best_reward,
-         found_solution_flag,
-         program_count) = session.run(
-             [self.global_best_reward,
-              self.found_solution_flag,
-              self.program_count])
-        summaries.append(
-            tf.Summary(
-                value=[tf.Summary.Value(
-                    tag='model/best_reward',
-                    simple_value=global_best_reward)]))
-        summaries.append(
-            tf.Summary(
-                value=[tf.Summary.Value(
-                    tag='model/solution_found',
-                    simple_value=int(found_solution_flag))]))
-        summaries.append(
-            tf.Summary(
-                value=[tf.Summary.Value(
-                    tag='model/program_count',
-                    simple_value=program_count)]))
-      for s in summaries:
-        self.summary_writer.add_summary(s, global_step)
-      self.last_summary_time = time.time()
-
-  def _local_step_summary(self):
-    """Compute number of local steps per time increment."""
-    dt = time.time() - self.last_summary_time
-    steps_per_time = self.summary_interval / float(dt)
-    return tf.Summary(value=[
-        tf.Summary.Value(
-            tag='local_step/per_sec',
-            simple_value=steps_per_time),
-        tf.Summary.Value(
-            tag='local_step/step',
-            simple_value=self.local_step)])
-
-  def maybe_save_best_model(self, session, saver, checkpoint_file):
-    """Check if this model got the highest reward and save to disk if so."""
-    if self.is_chief and session.run(self.is_best_model):
-      logging.info('Saving best model to "%s"', checkpoint_file)
-      saver.save(session, checkpoint_file)
-      session.run(self.reset_is_best_model)
-
-  def save_replay_buffer(self):
-    """Save replay buffer to disk.
-
-    Call this periodically so that training can recover if jobs go down.
-    """
-    if self.model.experience_replay is not None:
-      logging.info('Saving experience replay buffer to "%s".',
-                   self.model.experience_replay.save_file)
-      self.model.experience_replay.incremental_save(True)
-
-  def delete_replay_buffer(self):
-    """Delete replay buffer from disk.
-
-    Call this at the end of training to clean up. Replay buffer can get very
-    large.
-    """
-    if self.model.experience_replay is not None:
-      logging.info('Deleting experience replay buffer at "%s".',
-                   self.model.experience_replay.save_file)
-      tf.gfile.Remove(self.model.experience_replay.save_file)
-
-  def save_topk_buffer(self):
-    """Save top-k buffer to disk.
-
-    Call this periodically so that training can recover if jobs go down.
-    """
-    if self.model.top_episodes is not None:
-      logging.info('Saving top-k buffer to "%s".', self.topk_file)
-      # Overwrite previous data each time.
-      with tf.gfile.FastGFile(self.topk_file, 'w') as f:
-        f.write(cPickle.dumps(self.model.top_episodes))
-
-
-@contextlib.contextmanager
-def managed_session(sv, master='', config=None,
-                    start_standard_services=True,
-                    close_summary_writer=True,
-                    max_wait_secs=7200):
-  # Same as Supervisor.managed_session, but with configurable timeout.
-  try:
-    sess = sv.prepare_or_wait_for_session(
-        master=master, config=config,
-        start_standard_services=start_standard_services,
-        max_wait_secs=max_wait_secs)
-    yield sess
-  except tf.errors.DeadlineExceededError:
-    raise
-  except Exception as e:  # pylint: disable=broad-except
-    sv.request_stop(e)
-  finally:
-    try:
-      # Request all the threads to stop and wait for them to do so.  Any
-      # exception raised by the threads is raised again from stop().
-      # Passing stop_grace_period_secs is for blocked enqueue/dequeue
-      # threads which are not checking for `should_stop()`.  They
-      # will be stopped when we close the session further down.
-      sv.stop(close_summary_writer=close_summary_writer)
-    finally:
-      # Close the session to finish up all pending calls.  We do not care
-      # about exceptions raised when closing.  This takes care of
-      # blocked enqueue/dequeue calls.
-      try:
-        sess.close()
-      except Exception:  # pylint: disable=broad-except
-        # Silently ignore exceptions raised by close().
-        pass
-
-
-def train(config, is_chief, tuner=None, run_dir=None, run_number=0,
-          results_writer=None):
-  """Run training loop.
-
-  Args:
-    config: config_lib.Config instance containing global config (agent and env).
-    is_chief: True if this worker is chief. Chief worker manages writing some
-        data to disk and initialization of the global model.
-    tuner: A tuner instance. If not tuning, leave as None.
-    run_dir: Directory where all data for this run will be written. If None,
-        run_dir = FLAGS.logdir. Set this argument when doing multiple runs.
-    run_number: Which run is this.
-    results_writer: Managest writing training results to disk. Results are a
-        dict of metric names and values.
-
-  Returns:
-    The trainer object used to run training updates.
-  """
-  logging.info('Will run asynchronous training.')
-
-  if run_dir is None:
-    run_dir = FLAGS.logdir
-  train_dir = os.path.join(run_dir, 'train')
-  best_model_checkpoint = os.path.join(train_dir, 'best.ckpt')
-  events_dir = '%s/events_%d' % (run_dir, FLAGS.task_id)
-  logging.info('Events directory: %s', events_dir)
-
-  logging_dir = os.path.join(run_dir, 'logs')
-  if not tf.gfile.Exists(logging_dir):
-    tf.gfile.MakeDirs(logging_dir)
-  status_file = os.path.join(logging_dir, 'status.txt')
-
-  if FLAGS.summary_tasks and FLAGS.task_id < FLAGS.summary_tasks:
-    summary_writer = tf.summary.FileWriter(events_dir)
-  else:
-    summary_writer = None
-
-  # Only profile task 0.
-  if FLAGS.do_profiling:
-    logging.info('Profiling enabled')
-    profiler = cProfile.Profile()
-    profiler.enable()
-  else:
-    profiler = None
-
-  trainer = AsyncTrainer(
-      config, FLAGS.task_id, FLAGS.ps_tasks, FLAGS.num_workers,
-      is_chief=is_chief,
-      summary_interval=FLAGS.summary_interval,
-      summary_writer=summary_writer,
-      logging_dir=logging_dir,
-      run_number=run_number,
-      model_v=FLAGS.model_v)
-
-  variables_to_save = [v for v in tf.global_variables()
-                       if v.name.startswith('global')]
-  global_init_op = tf.variables_initializer(variables_to_save)
-  saver = tf.train.Saver(variables_to_save)
-
-  var_list = tf.get_collection(tf.GraphKeys.TRAINABLE_VARIABLES,
-                               tf.get_variable_scope().name)
-  logging.info('Trainable vars:')
-  for v in var_list:
-    logging.info('  %s, %s, %s', v.name, v.device, v.get_shape())
-
-  logging.info('All vars:')
-  for v in tf.global_variables():
-    logging.info('  %s, %s, %s', v.name, v.device, v.get_shape())
-
-  def init_fn(unused_sess):
-    logging.info('No checkpoint found. Initialized global params.')
-
-  sv = tf.train.Supervisor(is_chief=is_chief,
-                           logdir=train_dir,
-                           saver=saver,
-                           summary_op=None,
-                           init_op=global_init_op,
-                           init_fn=init_fn,
-                           summary_writer=summary_writer,
-                           ready_op=tf.report_uninitialized_variables(
-                               variables_to_save),
-                           ready_for_local_init_op=None,
-                           global_step=trainer.global_step,
-                           save_model_secs=30,
-                           save_summaries_secs=30)
-
-  # Add a thread that periodically checks if this Trial should stop
-  # based on an early stopping policy.
-  if tuner:
-    sv.Loop(60, tuner.check_for_stop, (sv.coord,))
-
-  last_replay_save_time = time.time()
-
-  global_step = -1
-  logging.info(
-      'Starting session. '
-      'If this hangs, we\'re mostly likely waiting to connect '
-      'to the parameter server. One common cause is that the parameter '
-      'server DNS name isn\'t resolving yet, or is misspecified.')
-  should_retry = True
-  supervisor_deadline_exceeded = False
-  while should_retry:
-    try:
-      with managed_session(
-          sv, FLAGS.master, max_wait_secs=60) as session, session.as_default():
-        should_retry = False
-        do_training = True
-
-        try:
-          trainer.initialize(session)
-          if session.run(trainer.run_number) != run_number:
-            # If we loaded existing model from disk, and the saved run number is
-            # different, throw an exception.
-            raise RuntimeError(
-                'Expecting to be on run %d, but is actually on run %d. '
-                'run_dir: "%s"'
-                % (run_number, session.run(trainer.run_number), run_dir))
-          global_step = trainer.cached_global_step
-          logging.info('Starting training at step=%d', global_step)
-          while do_training:
-            trainer.update_global_model(session)
-
-            if is_chief:
-              trainer.maybe_save_best_model(
-                  session, saver, best_model_checkpoint)
-            global_step = trainer.cached_global_step
-            global_npe = trainer.cached_global_npe
-
-            if time.time() - last_replay_save_time >= 30:
-              trainer.save_replay_buffer()
-              trainer.save_topk_buffer()
-              last_replay_save_time = time.time()
-
-            # Stopping conditions.
-            if tuner and tuner.should_trial_stop():
-              logging.info('Tuner requested early stopping. Finishing.')
-              do_training = False
-            if is_chief and FLAGS.stop_on_success:
-              found_solution = session.run(trainer.found_solution_flag)
-              if found_solution:
-                do_training = False
-                logging.info('Solution found. Finishing.')
-            if FLAGS.max_npe and global_npe >= FLAGS.max_npe:
-              # Max NPE (number of programs executed) reached.
-              logging.info('Max NPE reached. Finishing.')
-              do_training = False
-            if sv.should_stop():
-              logging.info('Supervisor issued stop. Finishing.')
-              do_training = False
-
-        except tf.errors.NotFoundError:
-          # Catch "Error while reading resource variable".
-          # The chief worker likely destroyed the container, so do not retry.
-          logging.info('Caught NotFoundError. Quitting.')
-          do_training = False
-          should_retry = False
-          break
-        except tf.errors.InternalError as e:
-          # Catch "Invalid variable reference."
-          if str(e).startswith('Invalid variable reference.'):
-            # The chief worker likely destroyed the container, so do not
-            # retry.
-            logging.info(
-                'Caught "InternalError: Invalid variable reference.". '
-                'Quitting.')
-            do_training = False
-            should_retry = False
-            break
-          else:
-            # Pass exception through.
-            raise
-
-        # Exited training loop. Write results to disk.
-        if is_chief and results_writer:
-          assert not should_retry
-          with tf.gfile.FastGFile(status_file, 'w') as f:
-            f.write('done')
-          (program_count,
-           found_solution,
-           code_solution,
-           best_reward,
-           global_step) = session.run(
-               [trainer.program_count,
-                trainer.found_solution_flag,
-                trainer.code_solution_variable,
-                trainer.global_best_reward,
-                trainer.global_step])
-          results_dict = {
-              'max_npe': FLAGS.max_npe,
-              'batch_size': config.batch_size,
-              'max_batches': FLAGS.max_npe // config.batch_size,
-              'npe': program_count,
-              'max_global_repetitions': FLAGS.num_repetitions,
-              'max_local_repetitions': FLAGS.num_repetitions,
-              'code_solution': code_solution,
-              'best_reward': best_reward,
-              'num_batches': global_step,
-              'found_solution': found_solution,
-              'task': trainer.data_manager.task_name,
-              'global_rep': run_number}
-          logging.info('results_dict: %s', results_dict)
-          results_writer.append(results_dict)
-
-    except tf.errors.AbortedError:
-      # Catch "Graph handle is not found" error due to preempted jobs.
-      logging.info('Caught AbortedError. Retying.')
-      should_retry = True
-    except tf.errors.DeadlineExceededError:
-      supervisor_deadline_exceeded = True
-      should_retry = False
-
-  if is_chief:
-    logging.info('This is chief worker. Stopping all workers.')
-    sv.stop()
-
-  if supervisor_deadline_exceeded:
-    logging.info('Supervisor timed out. Quitting.')
-  else:
-    logging.info('Reached %s steps. Worker stopped.', global_step)
-
-  # Dump profiling.
-  """
-  How to use profiling data.
-
-  Download the profiler dump to your local machine, say to PROF_FILE_PATH.
-  In a separate script, run something like the following:
-
-  import pstats
-  p = pstats.Stats(PROF_FILE_PATH)
-  p.strip_dirs().sort_stats('cumtime').print_stats()
-
-  This will sort by 'cumtime', which "is the cumulative time spent in this and
-  all subfunctions (from invocation till exit)."
-  https://docs.python.org/2/library/profile.html#instant-user-s-manual
-  """  # pylint: disable=pointless-string-statement
-  if profiler:
-    prof_file = os.path.join(run_dir, 'task_%d.prof' % FLAGS.task_id)
-    logging.info('Done profiling.\nDumping to "%s".', prof_file)
-    profiler.create_stats()
-    with tf.gfile.Open(prof_file, 'w') as f:
-      f.write(marshal.dumps(profiler.stats))
-
-  return trainer
-
-
-def run_training(config=None, tuner=None, logdir=None, trial_name=None,
-                 is_chief=True):
-  """Do all training runs.
-
-  This is the top level training function for policy gradient based models.
-  Run this from the main function.
-
-  Args:
-    config: config_lib.Config instance containing global config (agent and
-        environment hparams). If None, config will be parsed from FLAGS.config.
-    tuner: A tuner instance. Leave as None if not tuning.
-    logdir: Parent directory where all data from all runs will be written. If
-        None, FLAGS.logdir will be used.
-    trial_name: If tuning, set this to a unique string that identifies this
-        trial. If `tuner` is not None, this also must be set.
-    is_chief: True if this worker is the chief.
-
-  Returns:
-    List of results dicts which were written to disk. Each training run gets a
-    results dict. Results dict contains metrics, i.e. (name, value) pairs which
-    give information about the training run.
-
-  Raises:
-    ValueError: If results dicts read from disk contain invalid data.
-  """
-  if not config:
-    # If custom config is not given, get it from flags.
-    config = defaults.default_config_with_updates(FLAGS.config)
-  if not logdir:
-    logdir = FLAGS.logdir
-  if not tf.gfile.Exists(logdir):
-    tf.gfile.MakeDirs(logdir)
-  assert FLAGS.num_repetitions > 0
-  results = results_lib.Results(logdir)
-  results_list, _ = results.read_all()
-
-  logging.info('Starting experiment. Directory: "%s"', logdir)
-
-  if results_list:
-    if results_list[0]['max_npe'] != FLAGS.max_npe:
-      raise ValueError(
-          'Cannot resume training. Max-NPE changed. Was %s, now %s',
-          results_list[0]['max_npe'], FLAGS.max_npe)
-    if results_list[0]['max_global_repetitions'] != FLAGS.num_repetitions:
-      raise ValueError(
-          'Cannot resume training. Number of repetitions changed. Was %s, '
-          'now %s',
-          results_list[0]['max_global_repetitions'],
-          FLAGS.num_repetitions)
-
-  while len(results_list) < FLAGS.num_repetitions:
-    run_number = len(results_list)
-    rep_container_name = trial_name if trial_name else 'container'
-    if FLAGS.num_repetitions > 1:
-      rep_dir = os.path.join(logdir, 'run_%d' % run_number)
-      rep_container_name = rep_container_name + '_run_' + str(run_number)
-    else:
-      rep_dir = logdir
-
-    logging.info(
-        'Starting repetition %d (%d out of %d)', run_number, run_number + 1,
-        FLAGS.num_repetitions)
-
-    # Train will write result to disk.
-    with tf.container(rep_container_name):
-      trainer = train(config, is_chief, tuner, rep_dir, run_number, results)
-    logging.info('Done training.')
-
-    if is_chief:
-      # Destroy current container immediately (clears current graph).
-      logging.info('Clearing shared variables.')
-      tf.Session.reset(FLAGS.master, containers=[rep_container_name])
-      logging.info('Shared variables cleared.')
-
-      # Delete replay buffer on disk.
-      assert trainer
-      trainer.delete_replay_buffer()
-    else:
-      # Give chief worker time to clean up.
-      sleep_sec = 30.0
-      logging.info('Sleeping for %s sec.', sleep_sec)
-      time.sleep(sleep_sec)
-    tf.reset_default_graph()
-    logging.info('Default graph reset.')
-
-    # Expecting that train wrote new result to disk before returning.
-    results_list, _ = results.read_all()
-  return results_list
diff --git a/research/brain_coder/single_task/pg_train_test.py b/research/brain_coder/single_task/pg_train_test.py
deleted file mode 100644
index 0a562e5331e638cab82bc8033bfa2c1fc355e960..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/pg_train_test.py
+++ /dev/null
@@ -1,87 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for pg_train.
-
-These tests excersize code paths available through configuration options.
-Training will be run for just a few steps with the goal being to check that
-nothing crashes.
-"""
-
-from absl import flags
-import tensorflow as tf
-
-from single_task import defaults  # brain coder
-from single_task import run  # brain coder
-
-FLAGS = flags.FLAGS
-
-
-class TrainTest(tf.test.TestCase):
-
-  def RunTrainingSteps(self, config_string, num_steps=10):
-    """Run a few training steps with the given config.
-
-    Just check that nothing crashes.
-
-    Args:
-      config_string: Config encoded in a string. See
-          $REPO_PATH/common/config_lib.py
-      num_steps: Number of training steps to run. Defaults to 10.
-    """
-    config = defaults.default_config_with_updates(config_string)
-    FLAGS.master = ''
-    FLAGS.max_npe = num_steps * config.batch_size
-    FLAGS.summary_interval = 1
-    FLAGS.logdir = tf.test.get_temp_dir()
-    FLAGS.config = config_string
-    tf.reset_default_graph()
-    run.main(None)
-
-  def testVanillaPolicyGradient(self):
-    self.RunTrainingSteps(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="pg"),'
-        'timestep_limit=90,batch_size=64')
-
-  def testVanillaPolicyGradient_VariableLengthSequences(self):
-    self.RunTrainingSteps(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="pg",eos_token=False),'
-        'timestep_limit=90,batch_size=64')
-
-  def testVanillaActorCritic(self):
-    self.RunTrainingSteps(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="pg",ema_baseline_decay=0.0),'
-        'timestep_limit=90,batch_size=64')
-
-  def testPolicyGradientWithTopK(self):
-    self.RunTrainingSteps(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="pg",topk_loss_hparam=1.0,topk=10),'
-        'timestep_limit=90,batch_size=64')
-
-  def testVanillaActorCriticWithTopK(self):
-    self.RunTrainingSteps(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="pg",ema_baseline_decay=0.0,topk_loss_hparam=1.0,'
-        'topk=10),'
-        'timestep_limit=90,batch_size=64')
-
-  def testPolicyGradientWithTopK_VariableLengthSequences(self):
-    self.RunTrainingSteps(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="pg",topk_loss_hparam=1.0,topk=10,eos_token=False),'
-        'timestep_limit=90,batch_size=64')
-
-  def testPolicyGradientWithImportanceSampling(self):
-    self.RunTrainingSteps(
-        'env=c(task="reverse"),'
-        'agent=c(algorithm="pg",alpha=0.5),'
-        'timestep_limit=90,batch_size=64')
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/single_task/results_lib.py b/research/brain_coder/single_task/results_lib.py
deleted file mode 100644
index fd28fdd49ba3200dc9faa18d1722235ee4bf2ac2..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/results_lib.py
+++ /dev/null
@@ -1,155 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Results object manages distributed reading and writing of results to disk."""
-
-import ast
-from collections import namedtuple
-import os
-import re
-from six.moves import xrange
-import tensorflow as tf
-
-
-ShardStats = namedtuple(
-    'ShardStats',
-    ['num_local_reps_completed', 'max_local_reps', 'finished'])
-
-
-def ge_non_zero(a, b):
-  return a >= b and b > 0
-
-
-def get_shard_id(file_name):
-  assert file_name[-4:].lower() == '.txt'
-  return int(file_name[file_name.rfind('_') + 1: -4])
-
-
-class Results(object):
-  """Manages reading and writing training results to disk asynchronously.
-
-  Each worker writes to its own file, so that there are no race conditions when
-  writing happens. However any worker may read any file, as is the case for
-  `read_all`. Writes are expected to be atomic so that workers will never
-  read incomplete data, and this is likely to be the case on Unix systems.
-  Reading out of date data is fine, as workers calling `read_all` will wait
-  until data from every worker has been written before proceeding.
-  """
-  file_template = 'experiment_results_{0}.txt'
-  search_regex = r'^experiment_results_([0-9])+\.txt$'
-
-  def __init__(self, log_dir, shard_id=0):
-    """Construct `Results` instance.
-
-    Args:
-      log_dir: Where to write results files.
-      shard_id: Unique id for this file (i.e. shard). Each worker that will
-          be writing results should use a different shard id. If there are
-          N shards, each shard should be numbered 0 through N-1.
-    """
-    # Use different files for workers so that they can write to disk async.
-    assert 0 <= shard_id
-    self.file_name = self.file_template.format(shard_id)
-    self.log_dir = log_dir
-    self.results_file = os.path.join(self.log_dir, self.file_name)
-
-  def append(self, metrics):
-    """Append results to results list on disk."""
-    with tf.gfile.FastGFile(self.results_file, 'a') as writer:
-      writer.write(str(metrics) + '\n')
-
-  def read_this_shard(self):
-    """Read only from this shard."""
-    return self._read_shard(self.results_file)
-
-  def _read_shard(self, results_file):
-    """Read only from the given shard file."""
-    try:
-      with tf.gfile.FastGFile(results_file, 'r') as reader:
-        results = [ast.literal_eval(entry) for entry in reader]
-    except tf.errors.NotFoundError:
-      # No results written to disk yet. Return empty list.
-      return []
-    return results
-
-  def _get_max_local_reps(self, shard_results):
-    """Get maximum number of repetitions the given shard needs to complete.
-
-    Worker working on each shard needs to complete a certain number of runs
-    before it finishes. This method will return that number so that we can
-    determine which shards are still not done.
-
-    We assume that workers are including a 'max_local_repetitions' value in
-    their results, which should be the total number of repetitions it needs to
-    run.
-
-    Args:
-      shard_results: Dict mapping metric names to values. This should be read
-          from a shard on disk.
-
-    Returns:
-      Maximum number of repetitions the given shard needs to complete.
-    """
-    mlrs = [r['max_local_repetitions'] for r in shard_results]
-    if not mlrs:
-      return 0
-    for n in mlrs[1:]:
-      assert n == mlrs[0], 'Some reps have different max rep.'
-    return mlrs[0]
-
-  def read_all(self, num_shards=None):
-    """Read results across all shards, i.e. get global results list.
-
-    Args:
-      num_shards: (optional) specifies total number of shards. If the caller
-          wants information about which shards are incomplete, provide this
-          argument (so that shards which have yet to be created are still
-          counted as incomplete shards). Otherwise, no information about
-          incomplete shards will be returned.
-
-    Returns:
-      aggregate: Global list of results (across all shards).
-      shard_stats: List of ShardStats instances, one for each shard. Or None if
-          `num_shards` is None.
-    """
-    try:
-      all_children = tf.gfile.ListDirectory(self.log_dir)
-    except tf.errors.NotFoundError:
-      if num_shards is None:
-        return [], None
-      return [], [[] for _ in xrange(num_shards)]
-    shard_ids = {
-        get_shard_id(fname): fname
-        for fname in all_children if re.search(self.search_regex, fname)}
-
-    if num_shards is None:
-      aggregate = []
-      shard_stats = None
-      for results_file in shard_ids.values():
-        aggregate.extend(self._read_shard(
-            os.path.join(self.log_dir, results_file)))
-    else:
-      results_per_shard = [None] * num_shards
-      for shard_id in xrange(num_shards):
-        if shard_id in shard_ids:
-          results_file = shard_ids[shard_id]
-          results_per_shard[shard_id] = self._read_shard(
-              os.path.join(self.log_dir, results_file))
-        else:
-          results_per_shard[shard_id] = []
-
-      # Compute shard stats.
-      shard_stats = []
-      for shard_results in results_per_shard:
-        max_local_reps = self._get_max_local_reps(shard_results)
-        shard_stats.append(ShardStats(
-            num_local_reps_completed=len(shard_results),
-            max_local_reps=max_local_reps,
-            finished=ge_non_zero(len(shard_results), max_local_reps)))
-
-      # Compute aggregate.
-      aggregate = [
-          r for shard_results in results_per_shard for r in shard_results]
-
-    return aggregate, shard_stats
diff --git a/research/brain_coder/single_task/results_lib_test.py b/research/brain_coder/single_task/results_lib_test.py
deleted file mode 100644
index 6fe838d74d6a3bdea4c3b219a4d3ceea4385a97e..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/results_lib_test.py
+++ /dev/null
@@ -1,84 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for results_lib."""
-
-import contextlib
-import os
-import shutil
-import tempfile
-from six.moves import xrange
-import tensorflow as tf
-
-from single_task import results_lib  # brain coder
-
-
-@contextlib.contextmanager
-def temporary_directory(suffix='', prefix='tmp', base_path=None):
-  """A context manager to create a temporary directory and clean up on exit.
-
-  The parameters are the same ones expected by tempfile.mkdtemp.
-  The directory will be securely and atomically created.
-  Everything under it will be removed when exiting the context.
-
-  Args:
-    suffix: optional suffix.
-    prefix: options prefix.
-    base_path: the base path under which to create the temporary directory.
-  Yields:
-    The absolute path of the new temporary directory.
-  """
-  temp_dir_path = tempfile.mkdtemp(suffix, prefix, base_path)
-  try:
-    yield temp_dir_path
-  finally:
-    try:
-      shutil.rmtree(temp_dir_path)
-    except OSError as e:
-      if e.message == 'Cannot call rmtree on a symbolic link':
-        # Interesting synthetic exception made up by shutil.rmtree.
-        # Means we received a symlink from mkdtemp.
-        # Also means must clean up the symlink instead.
-        os.unlink(temp_dir_path)
-      else:
-        raise
-
-
-def freeze(dictionary):
-  """Convert dict to hashable frozenset."""
-  return frozenset(dictionary.iteritems())
-
-
-class ResultsLibTest(tf.test.TestCase):
-
-  def testResults(self):
-    with temporary_directory() as logdir:
-      results_obj = results_lib.Results(logdir)
-      self.assertEqual(results_obj.read_this_shard(), [])
-      results_obj.append(
-          {'foo': 1.5, 'bar': 2.5, 'baz': 0})
-      results_obj.append(
-          {'foo': 5.5, 'bar': -1, 'baz': 2})
-      self.assertEqual(
-          results_obj.read_this_shard(),
-          [{'foo': 1.5, 'bar': 2.5, 'baz': 0},
-           {'foo': 5.5, 'bar': -1, 'baz': 2}])
-
-  def testShardedResults(self):
-    with temporary_directory() as logdir:
-      n = 4  # Number of shards.
-      results_objs = [
-          results_lib.Results(logdir, shard_id=i) for i in xrange(n)]
-      for i, robj in enumerate(results_objs):
-        robj.append({'foo': i, 'bar': 1 + i * 2})
-      results_list, _ = results_objs[0].read_all()
-
-      # Check results. Order does not matter here.
-      self.assertEqual(
-          set(freeze(r) for r in results_list),
-          set(freeze({'foo': i, 'bar': 1 + i * 2}) for i in xrange(n)))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/single_task/run.py b/research/brain_coder/single_task/run.py
deleted file mode 100644
index 9d8f37c973dcca3bbf8e25bce3d181e5405c6167..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/run.py
+++ /dev/null
@@ -1,142 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-r"""Run training.
-
-Choose training algorithm and task(s) and follow these examples.
-
-Run synchronous policy gradient training locally:
-
-CONFIG="agent=c(algorithm='pg'),env=c(task='reverse')"
-OUT_DIR="/tmp/bf_pg_local"
-rm -rf $OUT_DIR
-bazel run -c opt single_task:run -- \
-    --alsologtostderr \
-    --config="$CONFIG" \
-    --max_npe=0 \
-    --logdir="$OUT_DIR" \
-    --summary_interval=1 \
-    --model_v=0
-learning/brain/tensorboard/tensorboard.sh --port 12345 --logdir "$OUT_DIR"
-
-
-Run genetic algorithm locally:
-
-CONFIG="agent=c(algorithm='ga'),env=c(task='reverse')"
-OUT_DIR="/tmp/bf_ga_local"
-rm -rf $OUT_DIR
-bazel run -c opt single_task:run -- \
-    --alsologtostderr \
-    --config="$CONFIG" \
-    --max_npe=0 \
-    --logdir="$OUT_DIR"
-
-
-Run uniform random search locally:
-
-CONFIG="agent=c(algorithm='rand'),env=c(task='reverse')"
-OUT_DIR="/tmp/bf_rand_local"
-rm -rf $OUT_DIR
-bazel run -c opt single_task:run -- \
-    --alsologtostderr \
-    --config="$CONFIG" \
-    --max_npe=0 \
-    --logdir="$OUT_DIR"
-"""
-
-from absl import app
-from absl import flags
-from absl import logging
-
-from single_task import defaults  # brain coder
-from single_task import ga_train  # brain coder
-from single_task import pg_train  # brain coder
-
-FLAGS = flags.FLAGS
-flags.DEFINE_string('config', '', 'Configuration.')
-flags.DEFINE_string(
-    'logdir', None, 'Absolute path where to write results.')
-flags.DEFINE_integer('task_id', 0, 'ID for this worker.')
-flags.DEFINE_integer('num_workers', 1, 'How many workers there are.')
-flags.DEFINE_integer(
-    'max_npe', 0,
-    'NPE = number of programs executed. Maximum number of programs to execute '
-    'in each run. Training will complete when this threshold is reached. Set '
-    'to 0 for unlimited training.')
-flags.DEFINE_integer(
-    'num_repetitions', 1,
-    'Number of times the same experiment will be run (globally across all '
-    'workers). Each run is independent.')
-flags.DEFINE_string(
-    'log_level', 'INFO',
-    'The threshold for what messages will be logged. One of DEBUG, INFO, WARN, '
-    'ERROR, or FATAL.')
-
-
-# To register an algorithm:
-# 1) Add dependency in the BUILD file to this build rule.
-# 2) Import the algorithm's module at the top of this file.
-# 3) Add a new entry in the following dict. The key is the algorithm name
-#    (used to select the algorithm in the config). The value is the module
-#    defining the expected functions for training and tuning. See the docstring
-#    for `get_namespace` for further details.
-ALGORITHM_REGISTRATION = {
-    'pg': pg_train,
-    'ga': ga_train,
-    'rand': ga_train,
-}
-
-
-def get_namespace(config_string):
-  """Get namespace for the selected algorithm.
-
-  Users who want to add additional algorithm types should modify this function.
-  The algorithm's namespace should contain the following functions:
-    run_training: Run the main training loop.
-    define_tuner_hparam_space: Return the hparam tuning space for the algo.
-    write_hparams_to_config: Helper for tuning. Write hparams chosen for tuning
-        to the Config object.
-  Look at pg_train.py and ga_train.py for function signatures and
-  implementations.
-
-  Args:
-    config_string: String representation of a Config object. This will get
-        parsed into a Config in order to determine what algorithm to use.
-
-  Returns:
-    algorithm_namespace: The module corresponding to the algorithm given in the
-        config.
-    config: The Config object resulting from parsing `config_string`.
-
-  Raises:
-    ValueError: If config.agent.algorithm is not one of the registered
-        algorithms.
-  """
-  config = defaults.default_config_with_updates(config_string)
-  if config.agent.algorithm not in ALGORITHM_REGISTRATION:
-    raise ValueError('Unknown algorithm type "%s"' % (config.agent.algorithm,))
-  else:
-    return ALGORITHM_REGISTRATION[config.agent.algorithm], config
-
-
-def main(argv):
-  del argv  # Unused.
-
-  logging.set_verbosity(FLAGS.log_level)
-
-  flags.mark_flag_as_required('logdir')
-  if FLAGS.num_workers <= 0:
-    raise ValueError('num_workers flag must be greater than 0.')
-  if FLAGS.task_id < 0:
-    raise ValueError('task_id flag must be greater than or equal to 0.')
-  if FLAGS.task_id >= FLAGS.num_workers:
-    raise ValueError(
-        'task_id flag must be strictly less than num_workers flag.')
-
-  ns, _ = get_namespace(FLAGS.config)
-  ns.run_training(is_chief=FLAGS.task_id == 0)
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/brain_coder/single_task/run_eval_tasks.py b/research/brain_coder/single_task/run_eval_tasks.py
deleted file mode 100755
index eb684c344381462cd3626404b5d7fd7cf5d72b22..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/run_eval_tasks.py
+++ /dev/null
@@ -1,296 +0,0 @@
-#!/usr/bin/env python
-from __future__ import print_function
-
-r"""This script can launch any eval experiments from the paper.
-
-This is a script. Run with python, not bazel.
-
-Usage:
-./single_task/run_eval_tasks.py \
-    --exp EXP --desc DESC [--tuning_tasks] [--iclr_tasks] [--task TASK] \
-    [--tasks TASK1 TASK2 ...]
-
-where EXP is one of the keys in `experiments`,
-and DESC is a string description of the set of experiments (such as "v0")
-
-Set only one of these flags:
---tuning_tasks flag only runs tuning tasks.
---iclr_tasks flag only runs the tasks included in the paper.
---regression_tests flag runs tasks which function as regression tests.
---task flag manually selects a single task to run.
---tasks flag takes a custom list of tasks.
-
-Other flags:
---reps N specifies N repetitions per experiment, Default is 25.
---training_replicas R specifies that R workers will be launched to train one
-    task (for neural network algorithms). These workers will update a global
-    model stored on a parameter server. Defaults to 1. If R > 1, a parameter
-    server will also be launched.
-
-
-Run everything:
-exps=( pg-20M pg-topk-20M topk-20M ga-20M rand-20M )
-BIN_DIR="single_task"
-for exp in "${exps[@]}"
-do
-  ./$BIN_DIR/run_eval_tasks.py \
-      --exp "$exp" --iclr_tasks
-done
-"""
-
-import argparse
-from collections import namedtuple
-import subprocess
-
-
-S = namedtuple('S', ['length'])
-default_length = 100
-
-
-iclr_tasks = [
-    'reverse', 'remove-char', 'count-char', 'add', 'bool-logic', 'print-hello',
-    'echo-twice', 'echo-thrice', 'copy-reverse', 'zero-cascade', 'cascade',
-    'shift-left', 'shift-right', 'riffle', 'unriffle', 'middle-char',
-    'remove-last', 'remove-last-two', 'echo-alternating', 'echo-half', 'length',
-    'echo-second-seq', 'echo-nth-seq', 'substring', 'divide-2', 'dedup']
-
-
-regression_test_tasks = ['reverse', 'test-hill-climb']
-
-
-E = namedtuple(
-    'E',
-    ['name', 'method_type', 'config', 'simplify', 'batch_size', 'max_npe'])
-
-
-def make_experiment_settings(name, **kwargs):
-  # Unpack experiment info from name.
-  def split_last(string, char):
-    i = string.rindex(char)
-    return string[:i], string[i+1:]
-  def si_to_int(si_string):
-    return int(
-        si_string.upper().replace('K', '0'*3).replace('M', '0'*6)
-        .replace('G', '0'*9))
-  method_type, max_npe = split_last(name, '-')
-  assert method_type
-  assert max_npe
-  return E(
-      name=name, method_type=method_type, max_npe=si_to_int(max_npe), **kwargs)
-
-
-experiments_set = {
-    make_experiment_settings(
-        'pg-20M',
-        config='entropy_beta=0.05,lr=0.0001,topk_loss_hparam=0.0,topk=0,'
-               'pi_loss_hparam=1.0,alpha=0.0',
-        simplify=False,
-        batch_size=64),
-    make_experiment_settings(
-        'pg-topk-20M',
-        config='entropy_beta=0.01,lr=0.0001,topk_loss_hparam=50.0,topk=10,'
-               'pi_loss_hparam=1.0,alpha=0.0',
-        simplify=False,
-        batch_size=64),
-    make_experiment_settings(
-        'topk-20M',
-        config='entropy_beta=0.01,lr=0.0001,topk_loss_hparam=200.0,topk=10,'
-               'pi_loss_hparam=0.0,alpha=0.0',
-        simplify=False,
-        batch_size=64),
-    make_experiment_settings(
-        'topk-0ent-20M',
-        config='entropy_beta=0.000,lr=0.0001,topk_loss_hparam=200.0,topk=10,'
-               'pi_loss_hparam=0.0,alpha=0.0',
-        simplify=False,
-        batch_size=64),
-    make_experiment_settings(
-        'ga-20M',
-        config='crossover_rate=0.95,mutation_rate=0.15',
-        simplify=False,
-        batch_size=100),  # Population size.
-    make_experiment_settings(
-        'rand-20M',
-        config='',
-        simplify=False,
-        batch_size=1),
-    make_experiment_settings(
-        'simpl-500M',
-        config='entropy_beta=0.05,lr=0.0001,topk_loss_hparam=0.5,topk=10,'
-               'pi_loss_hparam=1.0,alpha=0.0',
-        simplify=True,
-        batch_size=64),
-}
-
-experiments = {e.name: e for e in experiments_set}
-
-
-# pylint: disable=redefined-outer-name
-def parse_args(extra_args=()):
-  """Parse arguments and extract task and experiment info."""
-  parser = argparse.ArgumentParser(description='Run all eval tasks.')
-  parser.add_argument('--exp', required=True)
-  parser.add_argument('--tuning_tasks', action='store_true')
-  parser.add_argument('--iclr_tasks', action='store_true')
-  parser.add_argument('--regression_tests', action='store_true')
-  parser.add_argument('--desc', default='v0')
-  parser.add_argument('--reps', default=25)
-  parser.add_argument('--task')
-  parser.add_argument('--tasks', nargs='+')
-  for arg_string, default in extra_args:
-    parser.add_argument(arg_string, default=default)
-  args = parser.parse_args()
-
-  print('Running experiment: %s' % (args.exp,))
-  if args.desc:
-    print('Extra description: "%s"' % (args.desc,))
-  if args.exp not in experiments:
-    raise ValueError('Experiment name is not valid')
-  experiment_name = args.exp
-  experiment_settings = experiments[experiment_name]
-  assert experiment_settings.name == experiment_name
-
-  if args.tasks:
-    print('Launching tasks from args: %s' % (args.tasks,))
-    tasks = {t: S(length=default_length) for t in args.tasks}
-  elif args.task:
-    print('Launching single task "%s"' % args.task)
-    tasks = {args.task: S(length=default_length)}
-  elif args.tuning_tasks:
-    print('Only running tuning tasks')
-    tasks = {name: S(length=default_length)
-             for name in ['reverse-tune', 'remove-char-tune']}
-  elif args.iclr_tasks:
-    print('Running eval tasks from ICLR paper.')
-    tasks = {name: S(length=default_length) for name in iclr_tasks}
-  elif args.regression_tests:
-    tasks = {name: S(length=default_length) for name in regression_test_tasks}
-  print('Tasks: %s' % tasks.keys())
-
-  print('reps = %d' % (int(args.reps),))
-
-  return args, tasks, experiment_settings
-
-
-def run(command_string):
-  subprocess.call(command_string, shell=True)
-
-
-if __name__ == '__main__':
-  LAUNCH_TRAINING_COMMAND = 'single_task/launch_training.sh'
-  COMPILE_COMMAND = 'bazel build -c opt single_task:run.par'
-
-  args, tasks, experiment_settings = parse_args(
-      extra_args=(('--training_replicas', 1),))
-
-  if experiment_settings.method_type in (
-      'pg', 'pg-topk', 'topk', 'topk-0ent', 'simpl'):
-    # Runs PG and TopK.
-
-    def make_run_cmd(job_name, task, max_npe, num_reps, code_length,
-                     batch_size, do_simplify, custom_config_str):
-      """Constructs terminal command for launching NN based algorithms.
-
-      The arguments to this function will be used to create config for the
-      experiment.
-
-      Args:
-        job_name: Name of the job to launch. Should uniquely identify this
-            experiment run.
-        task: Name of the coding task to solve.
-        max_npe: Maximum number of programs executed. An integer.
-        num_reps: Number of times to run the experiment. An integer.
-        code_length: Maximum allowed length of synthesized code.
-        batch_size: Minibatch size for gradient descent.
-        do_simplify: Whether to run the experiment in code simplification mode.
-            A bool.
-        custom_config_str: Additional config for the model config string.
-
-      Returns:
-        The terminal command that launches the specified experiment.
-      """
-      config = """
-        env=c(task='{0}',correct_syntax=False),
-        agent=c(
-          algorithm='pg',
-          policy_lstm_sizes=[35,35],value_lstm_sizes=[35,35],
-          grad_clip_threshold=50.0,param_init_factor=0.5,regularizer=0.0,
-          softmax_tr=1.0,optimizer='rmsprop',ema_baseline_decay=0.99,
-          eos_token={3},{4}),
-        timestep_limit={1},batch_size={2}
-      """.replace(' ', '').replace('\n', '').format(
-          task, code_length, batch_size, do_simplify, custom_config_str)
-      num_ps = 0 if args.training_replicas == 1 else 1
-      return (
-          r'{0} --job_name={1} --config="{2}" --max_npe={3} '
-          '--num_repetitions={4} --num_workers={5} --num_ps={6} '
-          '--stop_on_success={7}'
-          .format(LAUNCH_TRAINING_COMMAND, job_name, config, max_npe, num_reps,
-                  args.training_replicas, num_ps, str(not do_simplify).lower()))
-
-  else:
-    # Runs GA and Rand.
-    assert experiment_settings.method_type in ('ga', 'rand')
-
-    def make_run_cmd(job_name, task, max_npe, num_reps, code_length,
-                     batch_size, do_simplify, custom_config_str):
-      """Constructs terminal command for launching GA or uniform random search.
-
-      The arguments to this function will be used to create config for the
-      experiment.
-
-      Args:
-        job_name: Name of the job to launch. Should uniquely identify this
-            experiment run.
-        task: Name of the coding task to solve.
-        max_npe: Maximum number of programs executed. An integer.
-        num_reps: Number of times to run the experiment. An integer.
-        code_length: Maximum allowed length of synthesized code.
-        batch_size: Minibatch size for gradient descent.
-        do_simplify: Whether to run the experiment in code simplification mode.
-            A bool.
-        custom_config_str: Additional config for the model config string.
-
-      Returns:
-        The terminal command that launches the specified experiment.
-      """
-      assert not do_simplify
-      if custom_config_str:
-        custom_config_str = ',' + custom_config_str
-      config = """
-        env=c(task='{0}',correct_syntax=False),
-        agent=c(
-          algorithm='{4}'
-          {3}),
-        timestep_limit={1},batch_size={2}
-      """.replace(' ', '').replace('\n', '').format(
-          task, code_length, batch_size, custom_config_str,
-          experiment_settings.method_type)
-      num_workers = num_reps  # Do each rep in parallel.
-      return (
-          r'{0} --job_name={1} --config="{2}" --max_npe={3} '
-          '--num_repetitions={4} --num_workers={5} --num_ps={6} '
-          '--stop_on_success={7}'
-          .format(LAUNCH_TRAINING_COMMAND, job_name, config, max_npe, num_reps,
-                  num_workers, 0, str(not do_simplify).lower()))
-
-  print('Compiling...')
-  run(COMPILE_COMMAND)
-
-  print('Launching %d coding tasks...' % len(tasks))
-  for task, task_settings in tasks.iteritems():
-    name = 'bf_rl_iclr'
-    desc = '{0}.{1}_{2}'.format(args.desc, experiment_settings.name, task)
-    job_name = '{}.{}'.format(name, desc)
-    print('Job name: %s' % job_name)
-    reps = int(args.reps) if not experiment_settings.simplify else 1
-    run_cmd = make_run_cmd(
-        job_name, task, experiment_settings.max_npe, reps,
-        task_settings.length, experiment_settings.batch_size,
-        experiment_settings.simplify,
-        experiment_settings.config)
-    print('Running command:\n' + run_cmd)
-    run(run_cmd)
-
-  print('Done.')
-# pylint: enable=redefined-outer-name
diff --git a/research/brain_coder/single_task/test_tasks.py b/research/brain_coder/single_task/test_tasks.py
deleted file mode 100644
index fb07a12653ebad6b38dc3786e749d3e8bf2b2072..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/test_tasks.py
+++ /dev/null
@@ -1,127 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tasks that test correctness of algorithms."""
-
-from six.moves import xrange
-from common import reward as reward_lib  # brain coder
-from single_task import misc  # brain coder
-
-
-class BasicTaskManager(object):
-  """Wraps a generic reward function."""
-
-  def __init__(self, reward_fn):
-    self.reward_fn = reward_fn
-    self.good_reward = 1.0
-
-  def _score_string(self, string):
-    actions = misc.bf_string_to_tokens(string)
-    reward, correct = self.reward_fn(actions)
-    return misc.RewardInfo(
-        episode_rewards=[0.0] * (len(string) - 1) + [reward],
-        input_case=None,
-        correct_output=None,
-        code_output=actions,
-        input_type=None,
-        output_type=misc.IOType.integer,
-        reason='correct' if correct else 'wrong')
-
-  def rl_batch(self, batch_size):
-    reward_fns = [self._score_string] * batch_size
-    return reward_fns
-
-
-class Trie(object):
-  """Trie for sequences."""
-  EOS = ()
-
-  def __init__(self):
-    self.trie = {}
-
-  def insert(self, sequence):
-    d = self.trie
-    for e in sequence:
-      if e not in d:
-        d[e] = {}
-      d = d[e]
-    d[self.EOS] = True   # Terminate sequence.
-
-  def prefix_match(self, sequence):
-    """Return prefix of `sequence` which exists in the trie."""
-    d = self.trie
-    index = 0
-    for i, e in enumerate(sequence + [self.EOS]):
-      index = i
-      if e in d:
-        d = d[e]
-        if e == self.EOS:
-          return sequence, True
-      else:
-        break
-    return sequence[:index], False
-
-  def next_choices(self, sequence):
-    d = self.trie
-    for e in sequence:
-      if e in d:
-        d = d[e]
-      else:
-        raise ValueError('Sequence not a prefix: %s' % (sequence,))
-    return d.keys()
-
-
-class HillClimbingTask(object):
-  """Simple task that tests reward hill climbing ability.
-
-  There are a set of paths (sequences of tokens) which are rewarded. The total
-  reward for a path is proportional to its length, so the longest path is the
-  target. Shorter paths can be dead ends.
-  """
-
-  def __init__(self):
-    # Paths are sequences of sub-sequences. Here we form unique sub-sequences
-    # out of 3 arbitrary ints. We use sub-sequences instead of single entities
-    # to make the task harder by making the episodes last longer, i.e. more
-    # for the agent to remember.
-    a = (1, 2, 3)
-    b = (4, 5, 6)
-    c = (7, 8, 7)
-    d = (6, 5, 4)
-    e = (3, 2, 1)
-    f = (8, 5, 1)
-    g = (6, 4, 2)
-    h = (1, 8, 3)
-    self.paths = Trie()
-    self.paths.insert([a, b, h])
-    self.paths.insert([a, b, c, d, e, f, g, h])
-    self.paths.insert([a, b, c, d, e, b, a])
-    self.paths.insert([a, b, g, h])
-    self.paths.insert([a, e, f, g])
-    self.correct_sequence = misc.flatten([a, b, c, d, e, f, g, h])
-
-    def distance_fn(a, b):
-      len_diff = abs(len(a) - len(b))
-      return sum(reward_lib.mod_abs_diff(ai - 1, bi - 1, 8)
-                 for ai, bi in zip(a, b)) + len_diff * 4  # 8 / 2 = 4
-    self.distance_fn = distance_fn
-
-  def __call__(self, actions):
-    # Compute reward for action sequence.
-    actions = [a for a in actions if a > 0]
-    sequence = [tuple(actions[i: i + 3]) for i in xrange(0, len(actions), 3)]
-    prefix, complete = self.paths.prefix_match(sequence)
-    if complete:
-      return float(len(prefix)), actions == self.correct_sequence
-    if len(prefix) == len(sequence):
-      return float(len(prefix)), False
-    next_pred = sequence[len(prefix)]
-    choices = self.paths.next_choices(prefix)
-    if choices == [()]:
-      return (len(prefix) - len(next_pred) / 3.0), False
-    min_dist = min(self.distance_fn(c, next_pred) for c in choices)
-    # +1 reward for each element in the sequence correct, plus fraction torwards
-    # closest next element.
-    # Maximum distance possible is num_actions * base / 2 = 3 * 8 / 2 = 12
-    return (len(prefix) + (1 - min_dist / 12.0)), False
diff --git a/research/brain_coder/single_task/test_tasks_test.py b/research/brain_coder/single_task/test_tasks_test.py
deleted file mode 100644
index bc905c6936de4c686e6cac1203c65c36bd7a0b16..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/test_tasks_test.py
+++ /dev/null
@@ -1,63 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-"""Tests for test_tasks."""
-
-import numpy as np
-import tensorflow as tf
-
-from single_task import misc  # brain coder
-from single_task import test_tasks  # brain coder
-
-
-def get_reward(reward_fn, candidate):
-  return sum(reward_fn(misc.bf_tokens_to_string(candidate)).episode_rewards)
-
-
-class TestTasksTest(tf.test.TestCase):
-
-  def testHillClimbingTask(self):
-    task = test_tasks.BasicTaskManager(test_tasks.HillClimbingTask())
-    reward_fns = task.rl_batch(1)
-    reward_fn = reward_fns[0]
-    self.assertTrue(np.isclose(get_reward(reward_fn, [1, 2, 0]), 8 / 12.))
-    self.assertTrue(np.isclose(get_reward(reward_fn, [1, 2, 2, 0]), 11 / 12.))
-    self.assertTrue(np.isclose(get_reward(reward_fn, [1, 2, 3, 0]), 1.0))
-    self.assertTrue(
-        np.isclose(get_reward(reward_fn, [1, 2, 3, 4, 5, 2, 0]), 1. + 8 / 12.))
-    self.assertTrue(
-        np.isclose(get_reward(reward_fn, [1, 2, 3, 4, 5, 6, 0]), 2.0))
-    self.assertTrue(
-        np.isclose(get_reward(reward_fn, [1, 2, 3, 4, 5, 6, 1, 8, 3, 0]), 3.0))
-    self.assertTrue(
-        np.isclose(get_reward(reward_fn, [1, 2, 3, 4, 5, 6, 7, 8, 7, 0]), 3.0))
-    self.assertTrue(
-        np.isclose(get_reward(reward_fn, [1, 2, 3, 4, 5, 6, 1, 8, 3, 1, 0]),
-                   3.0 - 4 / 12.))
-    self.assertTrue(
-        np.isclose(
-            get_reward(reward_fn, [1, 2, 3, 4, 5, 6, 1, 8, 3, 1, 1, 1, 1, 0]),
-            2.0))
-    self.assertTrue(
-        np.isclose(get_reward(reward_fn, [1, 2, 3, 4, 5, 6, 7, 8, 7, 3, 0]),
-                   3.0 + 1 / 12.))
-    self.assertTrue(
-        np.isclose(
-            get_reward(reward_fn, [1, 2, 3, 4, 5, 6, 7, 8, 7, 6, 5, 4, 3, 2, 1,
-                                   8, 5, 1, 6, 4, 2, 1, 8, 3, 0]),
-            8.0))
-    self.assertTrue(
-        np.isclose(
-            get_reward(reward_fn, [1, 2, 3, 4, 5, 6, 7, 8, 7, 6, 5, 4, 3, 2, 1,
-                                   8, 5, 1, 6, 4, 2, 1, 8, 3, 1, 1, 0]),
-            8.0 - 8 / 12.))
-    self.assertTrue(
-        np.isclose(get_reward(reward_fn, [1, 2, 3, 4, 5, 6, 7, 8, 7, 6, 5, 4, 3,
-                                          2, 1, 8, 5, 1, 6, 4, 2, 1, 8, 3, 1, 1,
-                                          1, 1, 1, 1, 1, 0]),
-                   7.0))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/brain_coder/single_task/tune.py b/research/brain_coder/single_task/tune.py
deleted file mode 100644
index 3473b5e94bd3c1f737a18f0187790d5df2d7a2aa..0000000000000000000000000000000000000000
--- a/research/brain_coder/single_task/tune.py
+++ /dev/null
@@ -1,262 +0,0 @@
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-r"""Run grid search.
-
-Look at launch_tuning.sh for details on how to tune at scale.
-
-Usage example:
-Tune with one worker on the local machine.
-
-CONFIG="agent=c(algorithm='pg'),"
-CONFIG+="env=c(task_cycle=['reverse-tune', 'remove-tune'])"
-HPARAM_SPACE_TYPE="pg"
-OUT_DIR="/tmp/bf_pg_tune"
-MAX_NPE=5000000
-NUM_REPETITIONS=50
-rm -rf $OUT_DIR
-mkdir $OUT_DIR
-bazel run -c opt single_task:tune -- \
-    --alsologtostderr \
-    --config="$CONFIG" \
-    --max_npe="$MAX_NPE" \
-    --num_repetitions="$NUM_REPETITIONS" \
-    --logdir="$OUT_DIR" \
-    --summary_interval=1 \
-    --model_v=0 \
-    --hparam_space="$HPARAM_SPACE_TYPE" \
-    --tuner_id=0 \
-    --num_tuners=1 \
-    2>&1 >"$OUT_DIR/tuner_0.log"
-learning/brain/tensorboard/tensorboard.sh --port 12345 --logdir "$OUT_DIR"
-"""
-
-import ast
-import os
-
-from absl import app
-from absl import flags
-from absl import logging
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from single_task import defaults  # brain coder
-from single_task import run as run_lib  # brain coder
-
-FLAGS = flags.FLAGS
-flags.DEFINE_integer(
-    'tuner_id', 0,
-    'The unique ID for this tuning worker.')
-flags.DEFINE_integer(
-    'num_tuners', 1,
-    'How many tuners are there.')
-flags.DEFINE_string(
-    'hparam_space', 'default',
-    'String name which denotes the hparam space to tune over. This is '
-    'algorithm dependent.')
-flags.DEFINE_string(
-    'fixed_hparams', '',
-    'HParams string. Used to fix hparams during tuning.')
-flags.DEFINE_float(
-    'success_rate_objective_weight', 1.0,
-    'How much to weight success rate vs num programs seen. By default, only '
-    'success rate is optimized (this is the setting used in the paper).')
-
-
-def parse_hparams_string(hparams_str):
-  hparams = {}
-  for term in hparams_str.split(','):
-    if not term:
-      continue
-    name, value = term.split('=')
-    hparams[name.strip()] = ast.literal_eval(value)
-  return hparams
-
-
-def int_to_multibase(n, bases):
-  digits = [0] * len(bases)
-  for i, b in enumerate(bases):
-    n, d = divmod(n, b)
-    digits[i] = d
-  return digits
-
-
-def hparams_for_index(index, tuning_space):
-  keys = sorted(tuning_space.keys())
-  indices = int_to_multibase(index, [len(tuning_space[k]) for k in keys])
-  return tf.contrib.training.HParams(
-      **{k: tuning_space[k][i] for k, i in zip(keys, indices)})
-
-
-def run_tuner_loop(ns):
-  """Run tuning loop for this worker."""
-  is_chief = FLAGS.task_id == 0
-  tuning_space = ns.define_tuner_hparam_space(
-      hparam_space_type=FLAGS.hparam_space)
-  fixed_hparams = parse_hparams_string(FLAGS.fixed_hparams)
-  for name, value in fixed_hparams.iteritems():
-    tuning_space[name] = [value]
-  tuning_space_size = np.prod([len(values) for values in tuning_space.values()])
-
-  num_local_trials, remainder = divmod(tuning_space_size, FLAGS.num_tuners)
-  if FLAGS.tuner_id < remainder:
-    num_local_trials += 1
-  starting_trial_id = (
-      num_local_trials * FLAGS.tuner_id + min(remainder, FLAGS.tuner_id))
-
-  logging.info('tuning_space_size: %d', tuning_space_size)
-  logging.info('num_local_trials: %d', num_local_trials)
-  logging.info('starting_trial_id: %d', starting_trial_id)
-
-  for local_trial_index in xrange(num_local_trials):
-    trial_config = defaults.default_config_with_updates(FLAGS.config)
-    global_trial_index = local_trial_index + starting_trial_id
-    trial_name = 'trial_' + str(global_trial_index)
-    trial_dir = os.path.join(FLAGS.logdir, trial_name)
-    hparams = hparams_for_index(global_trial_index, tuning_space)
-    ns.write_hparams_to_config(
-        trial_config, hparams, hparam_space_type=FLAGS.hparam_space)
-
-    results_list = ns.run_training(
-        config=trial_config, tuner=None, logdir=trial_dir, is_chief=is_chief,
-        trial_name=trial_name)
-
-    if not is_chief:
-      # Only chief worker needs to write tuning results to disk.
-      continue
-
-    objective, metrics = compute_tuning_objective(
-        results_list, hparams, trial_name, num_trials=tuning_space_size)
-    logging.info('metrics:\n%s', metrics)
-    logging.info('objective: %s', objective)
-    logging.info('programs_seen_fraction: %s',
-                 metrics['programs_seen_fraction'])
-    logging.info('success_rate: %s', metrics['success_rate'])
-    logging.info('success_rate_objective_weight: %s',
-                 FLAGS.success_rate_objective_weight)
-
-    tuning_results_file = os.path.join(trial_dir, 'tuning_results.txt')
-    with tf.gfile.FastGFile(tuning_results_file, 'a') as writer:
-      writer.write(str(metrics) + '\n')
-
-    logging.info('Trial %s complete.', trial_name)
-
-
-def compute_tuning_objective(results_list, hparams, trial_name, num_trials):
-  """Compute tuning objective and metrics given results and trial information.
-
-  Args:
-    results_list: List of results dicts read from disk. These are written by
-        workers.
-    hparams: tf.contrib.training.HParams instance containing the hparams used
-        in this trial (only the hparams which are being tuned).
-    trial_name: Name of this trial. Used to create a trial directory.
-    num_trials: Total number of trials that need to be run. This is saved in the
-        metrics dict for future reference.
-
-  Returns:
-    objective: The objective computed for this trial. Choose the hparams for the
-        trial with the largest objective value.
-    metrics: Information about this trial. A dict.
-  """
-  found_solution = [r['found_solution'] for r in results_list]
-  successful_program_counts = [
-      r['npe'] for r in results_list if r['found_solution']]
-
-  success_rate = sum(found_solution) / float(len(results_list))
-
-  max_programs = FLAGS.max_npe  # Per run.
-  all_program_counts = [
-      r['npe'] if r['found_solution'] else max_programs
-      for r in results_list]
-  programs_seen_fraction = (
-      float(sum(all_program_counts))
-      / (max_programs * len(all_program_counts)))
-
-  # min/max/avg stats are over successful runs.
-  metrics = {
-      'num_runs': len(results_list),
-      'num_succeeded': sum(found_solution),
-      'success_rate': success_rate,
-      'programs_seen_fraction': programs_seen_fraction,
-      'avg_programs': np.mean(successful_program_counts),
-      'max_possible_programs_per_run': max_programs,
-      'global_step': sum([r['num_batches'] for r in results_list]),
-      'hparams': hparams.values(),
-      'trial_name': trial_name,
-      'num_trials': num_trials}
-
-  # Report stats per tasks.
-  tasks = [r['task'] for r in results_list]
-  for task in set(tasks):
-    task_list = [r for r in results_list if r['task'] == task]
-    found_solution = [r['found_solution'] for r in task_list]
-    successful_rewards = [
-        r['best_reward'] for r in task_list
-        if r['found_solution']]
-    successful_num_batches = [
-        r['num_batches']
-        for r in task_list if r['found_solution']]
-    successful_program_counts = [
-        r['npe'] for r in task_list if r['found_solution']]
-    metrics_append = {
-        task + '__num_runs': len(task_list),
-        task + '__num_succeeded': sum(found_solution),
-        task + '__success_rate': (
-            sum(found_solution) / float(len(task_list)))}
-    metrics.update(metrics_append)
-    if any(found_solution):
-      metrics_append = {
-          task + '__min_reward': min(successful_rewards),
-          task + '__max_reward': max(successful_rewards),
-          task + '__avg_reward': np.median(successful_rewards),
-          task + '__min_programs': min(successful_program_counts),
-          task + '__max_programs': max(successful_program_counts),
-          task + '__avg_programs': np.mean(successful_program_counts),
-          task + '__min_batches': min(successful_num_batches),
-          task + '__max_batches': max(successful_num_batches),
-          task + '__avg_batches': np.mean(successful_num_batches)}
-      metrics.update(metrics_append)
-
-  # Objective will be maximized.
-  # Maximize success rate, minimize num programs seen.
-  # Max objective is always 1.
-  weight = FLAGS.success_rate_objective_weight
-  objective = (
-      weight * success_rate
-      + (1 - weight) * (1 - programs_seen_fraction))
-  metrics['objective'] = objective
-
-  return objective, metrics
-
-
-def main(argv):
-  del argv
-
-  logging.set_verbosity(FLAGS.log_level)
-
-  if not FLAGS.logdir:
-    raise ValueError('logdir flag must be provided.')
-  if FLAGS.num_workers <= 0:
-    raise ValueError('num_workers flag must be greater than 0.')
-  if FLAGS.task_id < 0:
-    raise ValueError('task_id flag must be greater than or equal to 0.')
-  if FLAGS.task_id >= FLAGS.num_workers:
-    raise ValueError(
-        'task_id flag must be strictly less than num_workers flag.')
-  if FLAGS.num_tuners <= 0:
-    raise ValueError('num_tuners flag must be greater than 0.')
-  if FLAGS.tuner_id < 0:
-    raise ValueError('tuner_id flag must be greater than or equal to 0.')
-  if FLAGS.tuner_id >= FLAGS.num_tuners:
-    raise ValueError(
-        'tuner_id flag must be strictly less than num_tuners flag.')
-
-  ns, _ = run_lib.get_namespace(FLAGS.config)
-  run_tuner_loop(ns)
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/cognitive_mapping_and_planning/.gitignore b/research/cognitive_mapping_and_planning/.gitignore
deleted file mode 100644
index cbc6a8f0271075171ffdf3c2bc5fb9c528b08fc6..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/.gitignore
+++ /dev/null
@@ -1,4 +0,0 @@
-deps
-*.pyc
-lib*.so
-lib*.so*
diff --git a/research/cognitive_mapping_and_planning/README.md b/research/cognitive_mapping_and_planning/README.md
deleted file mode 100644
index 4457bafbb4d229998a01dadc46efe41f4ba1a3e0..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/README.md
+++ /dev/null
@@ -1,127 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Cognitive Mapping and Planning for Visual Navigation
-**Saurabh Gupta, James Davidson, Sergey Levine, Rahul Sukthankar, Jitendra Malik**
-
-**Computer Vision and Pattern Recognition (CVPR) 2017.**
-
-**[ArXiv](https://arxiv.org/abs/1702.03920), 
-[Project Website](https://sites.google.com/corp/view/cognitive-mapping-and-planning/)**
-
-### Citing
-If you find this code base and models useful in your research, please consider
-citing the following paper:
-  ```
-  @inproceedings{gupta2017cognitive,
-    title={Cognitive Mapping and Planning for Visual Navigation},
-    author={Gupta, Saurabh and Davidson, James and Levine, Sergey and
-      Sukthankar, Rahul and Malik, Jitendra},
-    booktitle={CVPR},
-    year={2017}
-  }
-  ```
-
-### Contents
-1.  [Requirements: software](#requirements-software)
-2.  [Requirements: data](#requirements-data)
-3.  [Test Pre-trained Models](#test-pre-trained-models)
-4.  [Train your Own Models](#train-your-own-models)
-
-### Requirements: software
-1.  Python Virtual Env Setup: All code is implemented in Python but depends on a
-    small number of python packages and a couple of C libraries. We recommend
-    using virtual environment for installing these python packages and python
-    bindings for these C libraries.
-      ```Shell
-      VENV_DIR=venv
-      pip install virtualenv
-      virtualenv $VENV_DIR
-      source $VENV_DIR/bin/activate
-      
-      # You may need to upgrade pip for installing openv-python.
-      pip install --upgrade pip
-      # Install simple dependencies.
-      pip install -r requirements.txt
-
-      # Patch bugs in dependencies.
-      sh patches/apply_patches.sh
-      ```
-
-2.  Install [Tensorflow](https://www.tensorflow.org/) inside this virtual
-    environment. You will need to use one of the latest nightly builds 
-    (see instructions [here](https://github.com/tensorflow/tensorflow#installation)).
-
-3.  Swiftshader: We use
-    [Swiftshader](https://github.com/google/swiftshader.git), a CPU based
-    renderer to render the meshes.  It is possible to use other renderers,
-    replace `SwiftshaderRenderer` in `render/swiftshader_renderer.py` with
-    bindings to your renderer. 
-    ```Shell
-    mkdir -p deps
-    git clone --recursive https://github.com/google/swiftshader.git deps/swiftshader-src
-    cd deps/swiftshader-src && git checkout 91da6b00584afd7dcaed66da88e2b617429b3950
-    git submodule update
-    mkdir build && cd build && cmake .. && make -j 16 libEGL libGLESv2
-    cd ../../../
-    cp deps/swiftshader-src/build/libEGL* libEGL.so.1
-    cp deps/swiftshader-src/build/libGLESv2* libGLESv2.so.2
-    ```
-
-4.  PyAssimp: We use [PyAssimp](https://github.com/assimp/assimp.git) to load
-    meshes.  It is possible to use other libraries to load meshes, replace
-    `Shape` `render/swiftshader_renderer.py` with bindings to your library for
-    loading meshes. 
-    ```Shell
-    mkdir -p deps
-    git clone https://github.com/assimp/assimp.git deps/assimp-src
-    cd deps/assimp-src
-    git checkout 2afeddd5cb63d14bc77b53740b38a54a97d94ee8
-    cmake CMakeLists.txt -G 'Unix Makefiles' && make -j 16
-    cd port/PyAssimp && python setup.py install
-    cd ../../../..
-    cp deps/assimp-src/lib/libassimp* .
-    ```
-
-5.  graph-tool: We use [graph-tool](https://git.skewed.de/count0/graph-tool)
-    library for graph processing.
-    ```Shell
-    mkdir -p deps
-    # If the following git clone command fails, you can also download the source
-    # from https://downloads.skewed.de/graph-tool/graph-tool-2.2.44.tar.bz2
-    git clone https://git.skewed.de/count0/graph-tool deps/graph-tool-src
-    cd deps/graph-tool-src && git checkout 178add3a571feb6666f4f119027705d95d2951ab
-    bash autogen.sh
-    ./configure --disable-cairo --disable-sparsehash --prefix=$HOME/.local
-    make -j 16
-    make install
-    cd ../../
-    ```
-
-### Requirements: data
-1.  Download the Stanford 3D Indoor Spaces Dataset (S3DIS Dataset) and ImageNet
-    Pre-trained models for initializing different models. Follow instructions in
-    `data/README.md`
-
-### Test Pre-trained Models
-1.  Download pre-trained models. See `output/README.md`.
-
-2.  Test models using `scripts/script_test_pretrained_models.sh`.
-
-### Train Your Own Models
-All models were trained asynchronously with 16 workers each worker using data
-from a single floor. The default hyper-parameters correspond to this setting.
-See [distributed training with
-Tensorflow](https://www.tensorflow.org/deploy/distributed) for setting up
-distributed training. Training with a single worker is possible with the current
-code base but will require some minor changes to allow each worker to load all
-training environments.
-
-### Contact
-For questions or issues open an issue on the tensorflow/models [issues
-tracker](https://github.com/tensorflow/models/issues). Please assign issues to
-@s-gupta.
-
-### Credits
-This code was written by Saurabh Gupta (@s-gupta).
diff --git a/research/cognitive_mapping_and_planning/__init__.py b/research/cognitive_mapping_and_planning/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/cognitive_mapping_and_planning/cfgs/__init__.py b/research/cognitive_mapping_and_planning/cfgs/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/cognitive_mapping_and_planning/cfgs/config_cmp.py b/research/cognitive_mapping_and_planning/cfgs/config_cmp.py
deleted file mode 100644
index 715eee2b973cb66f816ecdb65bbcc3abdd8a9483..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/cfgs/config_cmp.py
+++ /dev/null
@@ -1,283 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import os, sys
-import numpy as np
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-import logging
-import src.utils as utils
-import cfgs.config_common as cc
-
-
-import tensorflow as tf
-
-
-rgb_resnet_v2_50_path = 'data/init_models/resnet_v2_50/model.ckpt-5136169'
-d_resnet_v2_50_path = 'data/init_models/distill_rgb_to_d_resnet_v2_50/model.ckpt-120002'
-
-def get_default_args():
-  summary_args = utils.Foo(display_interval=1, test_iters=26,
-                           arop_full_summary_iters=14)
-
-  control_args = utils.Foo(train=False, test=False,
-                           force_batchnorm_is_training_at_test=False,
-                           reset_rng_seed=False, only_eval_when_done=False,
-                           test_mode=None)
-  return summary_args, control_args
-
-def get_default_cmp_args():
-  batch_norm_param = {'center': True, 'scale': True,
-                      'activation_fn':tf.nn.relu}
-
-  mapper_arch_args = utils.Foo(
-      dim_reduce_neurons=64,
-      fc_neurons=[1024, 1024],
-      fc_out_size=8,
-      fc_out_neurons=64,
-      encoder='resnet_v2_50',
-      deconv_neurons=[64, 32, 16, 8, 4, 2],
-      deconv_strides=[2, 2, 2, 2, 2, 2],
-      deconv_layers_per_block=2,
-      deconv_kernel_size=4,
-      fc_dropout=0.5,
-      combine_type='wt_avg_logits',
-      batch_norm_param=batch_norm_param)
-
-  readout_maps_arch_args = utils.Foo(
-      num_neurons=[],
-      strides=[],
-      kernel_size=None,
-      layers_per_block=None)
-
-  arch_args = utils.Foo(
-      vin_val_neurons=8, vin_action_neurons=8, vin_ks=3, vin_share_wts=False,
-      pred_neurons=[64, 64], pred_batch_norm_param=batch_norm_param,
-      conv_on_value_map=0, fr_neurons=16, fr_ver='v2', fr_inside_neurons=64,
-      fr_stride=1, crop_remove_each=30, value_crop_size=4,
-      action_sample_type='sample', action_sample_combine_type='one_or_other',
-      sample_gt_prob_type='inverse_sigmoid_decay', dagger_sample_bn_false=True,
-      vin_num_iters=36, isd_k=750., use_agent_loc=False, multi_scale=True,
-      readout_maps=False, rom_arch=readout_maps_arch_args)
-
-  return arch_args, mapper_arch_args
-
-def get_arch_vars(arch_str):
-  if arch_str == '': vals = []
-  else: vals = arch_str.split('_')
-  ks = ['var1', 'var2', 'var3']
-  ks = ks[:len(vals)]
-  
-  # Exp Ver.
-  if len(vals) == 0: ks.append('var1'); vals.append('v0')
-  # custom arch.
-  if len(vals) == 1: ks.append('var2'); vals.append('')
-  # map scape for projection baseline.
-  if len(vals) == 2: ks.append('var3'); vals.append('fr2')
-
-  assert(len(vals) == 3)
-
-  vars = utils.Foo()
-  for k, v in zip(ks, vals):
-    setattr(vars, k, v)
-
-  logging.error('arch_vars: %s', vars)
-  return vars
-
-def process_arch_str(args, arch_str):
-  # This function modifies args.
-  args.arch, args.mapper_arch = get_default_cmp_args()
-
-  arch_vars = get_arch_vars(arch_str)
-
-  args.navtask.task_params.outputs.ego_maps = True
-  args.navtask.task_params.outputs.ego_goal_imgs = True
-  args.navtask.task_params.outputs.egomotion = True
-  args.navtask.task_params.toy_problem = False
-
-  if arch_vars.var1 == 'lmap':
-    args = process_arch_learned_map(args, arch_vars)
-
-  elif arch_vars.var1 == 'pmap':
-    args = process_arch_projected_map(args, arch_vars)
-
-  else:
-    logging.fatal('arch_vars.var1 should be lmap or pmap, but is %s', arch_vars.var1)
-    assert(False)
-
-  return args
-
-def process_arch_learned_map(args, arch_vars):
-  # Multiscale vision based system.
-  args.navtask.task_params.input_type = 'vision'
-  args.navtask.task_params.outputs.images = True
-  
-  if args.navtask.camera_param.modalities[0] == 'rgb':
-    args.solver.pretrained_path = rgb_resnet_v2_50_path
-  elif args.navtask.camera_param.modalities[0] == 'depth':
-    args.solver.pretrained_path = d_resnet_v2_50_path
-
-  if arch_vars.var2 == 'Ssc':
-    sc = 1./args.navtask.task_params.step_size
-    args.arch.vin_num_iters = 40
-    args.navtask.task_params.map_scales = [sc]
-    max_dist = args.navtask.task_params.max_dist * \
-        args.navtask.task_params.num_goals
-    args.navtask.task_params.map_crop_sizes = [2*max_dist]
-
-    args.arch.fr_stride = 1
-    args.arch.vin_action_neurons = 8
-    args.arch.vin_val_neurons = 3
-    args.arch.fr_inside_neurons = 32
-
-    args.mapper_arch.pad_map_with_zeros_each = [24]
-    args.mapper_arch.deconv_neurons = [64, 32, 16]
-    args.mapper_arch.deconv_strides = [1, 2, 1]
-
-  elif (arch_vars.var2 == 'Msc' or arch_vars.var2 == 'MscROMms' or
-        arch_vars.var2 == 'MscROMss' or arch_vars.var2 == 'MscNoVin'):
-    # Code for multi-scale planner.
-    args.arch.vin_num_iters = 8
-    args.arch.crop_remove_each = 4
-    args.arch.value_crop_size = 8
-
-    sc = 1./args.navtask.task_params.step_size
-    max_dist = args.navtask.task_params.max_dist * \
-        args.navtask.task_params.num_goals
-    n_scales = np.log2(float(max_dist) / float(args.arch.vin_num_iters))
-    n_scales = int(np.ceil(n_scales)+1)
-
-    args.navtask.task_params.map_scales = \
-        list(sc*(0.5**(np.arange(n_scales))[::-1]))
-    args.navtask.task_params.map_crop_sizes = [16 for x in range(n_scales)]
-
-    args.arch.fr_stride = 1
-    args.arch.vin_action_neurons = 8
-    args.arch.vin_val_neurons = 3
-    args.arch.fr_inside_neurons = 32
-
-    args.mapper_arch.pad_map_with_zeros_each = [0 for _ in range(n_scales)]
-    args.mapper_arch.deconv_neurons = [64*n_scales, 32*n_scales, 16*n_scales]
-    args.mapper_arch.deconv_strides = [1, 2, 1]
-
-    if arch_vars.var2 == 'MscNoVin':
-      # No planning version.
-      args.arch.fr_stride = [1, 2, 1, 2]
-      args.arch.vin_action_neurons = None
-      args.arch.vin_val_neurons = 16
-      args.arch.fr_inside_neurons = 32
-
-      args.arch.crop_remove_each = 0
-      args.arch.value_crop_size = 4
-      args.arch.vin_num_iters = 0
-
-    elif arch_vars.var2 == 'MscROMms' or arch_vars.var2 == 'MscROMss':
-      # Code with read outs, MscROMms flattens and reads out,
-      # MscROMss does not flatten and produces output at multiple scales.
-      args.navtask.task_params.outputs.readout_maps = True
-      args.navtask.task_params.map_resize_method = 'antialiasing'
-      args.arch.readout_maps = True
-
-      if arch_vars.var2 == 'MscROMms':
-        args.arch.rom_arch.num_neurons = [64, 1]
-        args.arch.rom_arch.kernel_size = 4
-        args.arch.rom_arch.strides = [2,2]
-        args.arch.rom_arch.layers_per_block = 2
-
-        args.navtask.task_params.readout_maps_crop_sizes = [64]
-        args.navtask.task_params.readout_maps_scales = [sc]
-
-      elif arch_vars.var2 == 'MscROMss':
-        args.arch.rom_arch.num_neurons = \
-            [64, len(args.navtask.task_params.map_scales)]
-        args.arch.rom_arch.kernel_size = 4
-        args.arch.rom_arch.strides = [1,1]
-        args.arch.rom_arch.layers_per_block = 1
-
-        args.navtask.task_params.readout_maps_crop_sizes = \
-            args.navtask.task_params.map_crop_sizes
-        args.navtask.task_params.readout_maps_scales = \
-            args.navtask.task_params.map_scales
-
-  else:
-    logging.fatal('arch_vars.var2 not one of Msc, MscROMms, MscROMss, MscNoVin.')
-    assert(False)
-
-  map_channels = args.mapper_arch.deconv_neurons[-1] / \
-    (2*len(args.navtask.task_params.map_scales))
-  args.navtask.task_params.map_channels = map_channels
-  
-  return args
-
-def process_arch_projected_map(args, arch_vars):
-  # Single scale vision based system which does not use a mapper but instead
-  # uses an analytically estimated map.
-  ds = int(arch_vars.var3[2])
-  args.navtask.task_params.input_type = 'analytical_counts'
-  args.navtask.task_params.outputs.analytical_counts = True
-
-  assert(args.navtask.task_params.modalities[0] == 'depth')
-  args.navtask.camera_param.img_channels = None
-
-  analytical_counts = utils.Foo(map_sizes=[512/ds],
-                                xy_resolution=[5.*ds],
-                                z_bins=[[-10, 10, 150, 200]],
-                                non_linearity=[arch_vars.var2])
-  args.navtask.task_params.analytical_counts = analytical_counts
-
-  sc = 1./ds
-  args.arch.vin_num_iters = 36
-  args.navtask.task_params.map_scales = [sc]
-  args.navtask.task_params.map_crop_sizes = [512/ds]
-
-  args.arch.fr_stride = [1,2]
-  args.arch.vin_action_neurons = 8
-  args.arch.vin_val_neurons = 3
-  args.arch.fr_inside_neurons = 32
-
-  map_channels = len(analytical_counts.z_bins[0]) + 1
-  args.navtask.task_params.map_channels = map_channels
-  args.solver.freeze_conv = False
-
-  return args
-
-def get_args_for_config(config_name):
-  args = utils.Foo()
-
-  args.summary, args.control = get_default_args()
-
-  exp_name, mode_str = config_name.split('+')
-  arch_str, solver_str, navtask_str = exp_name.split('.')
-  logging.error('config_name: %s', config_name)
-  logging.error('arch_str: %s', arch_str)
-  logging.error('navtask_str: %s', navtask_str)
-  logging.error('solver_str: %s', solver_str)
-  logging.error('mode_str: %s', mode_str)
-
-  args.solver = cc.process_solver_str(solver_str)
-  args.navtask = cc.process_navtask_str(navtask_str)
-
-  args = process_arch_str(args, arch_str)
-  args.arch.isd_k = args.solver.isd_k
-
-  # Train, test, etc.
-  mode, imset = mode_str.split('_')
-  args = cc.adjust_args_for_mode(args, mode)
-  args.navtask.building_names = args.navtask.dataset.get_split(imset)
-  args.control.test_name = '{:s}_on_{:s}'.format(mode, imset)
-
-  # Log the arguments
-  logging.error('%s', args)
-  return args
diff --git a/research/cognitive_mapping_and_planning/cfgs/config_common.py b/research/cognitive_mapping_and_planning/cfgs/config_common.py
deleted file mode 100644
index 440bf5b72f87a1eeca38e22f33b22e82de7345c0..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/cfgs/config_common.py
+++ /dev/null
@@ -1,261 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import os
-import numpy as np
-import logging
-import src.utils as utils
-import datasets.nav_env_config as nec
-from datasets import factory
-
-def adjust_args_for_mode(args, mode):
-  if mode == 'train':
-    args.control.train = True
-  
-  elif mode == 'val1':
-    # Same settings as for training, to make sure nothing wonky is happening
-    # there.
-    args.control.test = True
-    args.control.test_mode = 'val'
-    args.navtask.task_params.batch_size = 32
-
-  elif mode == 'val2':
-    # No data augmentation, not sampling but taking the argmax action, not
-    # sampling from the ground truth at all.
-    args.control.test = True
-    args.arch.action_sample_type = 'argmax'
-    args.arch.sample_gt_prob_type = 'zero'
-    args.navtask.task_params.data_augment = \
-      utils.Foo(lr_flip=0, delta_angle=0, delta_xy=0, relight=False,
-                relight_fast=False, structured=False)
-    args.control.test_mode = 'val'
-    args.navtask.task_params.batch_size = 32
-
-  elif mode == 'bench':
-    # Actually testing the agent in settings that are kept same between
-    # different runs.
-    args.navtask.task_params.batch_size = 16
-    args.control.test = True
-    args.arch.action_sample_type = 'argmax'
-    args.arch.sample_gt_prob_type = 'zero'
-    args.navtask.task_params.data_augment = \
-      utils.Foo(lr_flip=0, delta_angle=0, delta_xy=0, relight=False,
-                relight_fast=False, structured=False)
-    args.summary.test_iters = 250
-    args.control.only_eval_when_done = True
-    args.control.reset_rng_seed = True
-    args.control.test_mode = 'test'
-  else:
-    logging.fatal('Unknown mode: %s.', mode)
-    assert(False)
-  return args
-
-def get_solver_vars(solver_str):
-  if solver_str == '': vals = []; 
-  else: vals = solver_str.split('_')
-  ks = ['clip', 'dlw', 'long', 'typ', 'isdk', 'adam_eps', 'init_lr'];
-  ks = ks[:len(vals)]
-
-  # Gradient clipping or not.
-  if len(vals) == 0: ks.append('clip'); vals.append('noclip');
-  # data loss weight.
-  if len(vals) == 1: ks.append('dlw');  vals.append('dlw20')
-  # how long to train for.
-  if len(vals) == 2: ks.append('long');  vals.append('nolong')
-  # Adam
-  if len(vals) == 3: ks.append('typ');  vals.append('adam2')
-  # reg loss wt
-  if len(vals) == 4: ks.append('rlw');  vals.append('rlw1')
-  # isd_k
-  if len(vals) == 5: ks.append('isdk');  vals.append('isdk415') # 415, inflexion at 2.5k.
-  # adam eps
-  if len(vals) == 6: ks.append('adam_eps');  vals.append('aeps1en8')
-  # init lr
-  if len(vals) == 7: ks.append('init_lr');  vals.append('lr1en3')
-
-  assert(len(vals) == 8)
-  
-  vars = utils.Foo()
-  for k, v in zip(ks, vals):
-    setattr(vars, k, v)
-  logging.error('solver_vars: %s', vars)
-  return vars
-
-def process_solver_str(solver_str):
-  solver = utils.Foo(
-      seed=0, learning_rate_decay=None, clip_gradient_norm=None, max_steps=None,
-      initial_learning_rate=None, momentum=None, steps_per_decay=None,
-      logdir=None, sync=False, adjust_lr_sync=True, wt_decay=0.0001,
-      data_loss_wt=None, reg_loss_wt=None, freeze_conv=True, num_workers=1,
-      task=0, ps_tasks=0, master='local', typ=None, momentum2=None,
-      adam_eps=None)
-
-  # Clobber with overrides from solver str.
-  solver_vars = get_solver_vars(solver_str)
-
-  solver.data_loss_wt          = float(solver_vars.dlw[3:].replace('x', '.'))
-  solver.adam_eps              = float(solver_vars.adam_eps[4:].replace('x', '.').replace('n', '-'))
-  solver.initial_learning_rate = float(solver_vars.init_lr[2:].replace('x', '.').replace('n', '-'))
-  solver.reg_loss_wt           = float(solver_vars.rlw[3:].replace('x', '.'))
-  solver.isd_k                 = float(solver_vars.isdk[4:].replace('x', '.'))
-
-  long = solver_vars.long
-  if long == 'long':
-    solver.steps_per_decay = 40000
-    solver.max_steps = 120000
-  elif long == 'long2':
-    solver.steps_per_decay = 80000
-    solver.max_steps = 120000
-  elif long == 'nolong' or long == 'nol':
-    solver.steps_per_decay = 20000
-    solver.max_steps = 60000
-  else:
-    logging.fatal('solver_vars.long should be long, long2, nolong or nol.')
-    assert(False)
-
-  clip = solver_vars.clip
-  if clip == 'noclip' or clip == 'nocl':
-    solver.clip_gradient_norm = 0
-  elif clip[:4] == 'clip':
-    solver.clip_gradient_norm = float(clip[4:].replace('x', '.'))
-  else:
-    logging.fatal('Unknown solver_vars.clip: %s', clip)
-    assert(False)
-
-  typ = solver_vars.typ
-  if typ == 'adam':
-    solver.typ = 'adam'
-    solver.momentum = 0.9
-    solver.momentum2 = 0.999
-    solver.learning_rate_decay = 1.0
-  elif typ == 'adam2':
-    solver.typ = 'adam'
-    solver.momentum = 0.9
-    solver.momentum2 = 0.999
-    solver.learning_rate_decay = 0.1
-  elif typ == 'sgd':
-    solver.typ = 'sgd'
-    solver.momentum = 0.99
-    solver.momentum2 = None
-    solver.learning_rate_decay = 0.1
-  else:
-    logging.fatal('Unknown solver_vars.typ: %s', typ)
-    assert(False)
-
-  logging.error('solver: %s', solver)
-  return solver
-
-def get_navtask_vars(navtask_str):
-  if navtask_str == '': vals = []
-  else: vals = navtask_str.split('_')
-
-  ks_all = ['dataset_name', 'modality', 'task', 'history', 'max_dist',
-            'num_steps', 'step_size', 'n_ori', 'aux_views', 'data_aug']
-  ks = ks_all[:len(vals)]
-
-  # All data or not.
-  if len(vals) == 0: ks.append('dataset_name'); vals.append('sbpd')
-  # modality
-  if len(vals) == 1: ks.append('modality'); vals.append('rgb')
-  # semantic task?
-  if len(vals) == 2: ks.append('task'); vals.append('r2r')
-  # number of history frames.
-  if len(vals) == 3: ks.append('history'); vals.append('h0')
-  # max steps
-  if len(vals) == 4: ks.append('max_dist'); vals.append('32')
-  # num steps
-  if len(vals) == 5: ks.append('num_steps'); vals.append('40')
-  # step size
-  if len(vals) == 6: ks.append('step_size'); vals.append('8')
-  # n_ori
-  if len(vals) == 7: ks.append('n_ori'); vals.append('4')
-  # Auxiliary views.
-  if len(vals) == 8: ks.append('aux_views'); vals.append('nv0')
-  # Normal data augmentation as opposed to structured data augmentation (if set
-  # to straug.
-  if len(vals) == 9: ks.append('data_aug'); vals.append('straug')
-
-  assert(len(vals) == 10)
-  for i in range(len(ks)):
-    assert(ks[i] == ks_all[i])
-
-  vars = utils.Foo()
-  for k, v in zip(ks, vals):
-    setattr(vars, k, v)
-  logging.error('navtask_vars: %s', vals)
-  return vars
-
-def process_navtask_str(navtask_str):
-  navtask = nec.nav_env_base_config()
-  
-  # Clobber with overrides from strings.
-  navtask_vars = get_navtask_vars(navtask_str)
-
-  navtask.task_params.n_ori = int(navtask_vars.n_ori)
-  navtask.task_params.max_dist = int(navtask_vars.max_dist)
-  navtask.task_params.num_steps = int(navtask_vars.num_steps)
-  navtask.task_params.step_size = int(navtask_vars.step_size)
-  navtask.task_params.data_augment.delta_xy = int(navtask_vars.step_size)/2.
-  n_aux_views_each = int(navtask_vars.aux_views[2])
-  aux_delta_thetas = np.concatenate((np.arange(n_aux_views_each) + 1,
-                                     -1 -np.arange(n_aux_views_each)))
-  aux_delta_thetas = aux_delta_thetas*np.deg2rad(navtask.camera_param.fov)
-  navtask.task_params.aux_delta_thetas = aux_delta_thetas
-  
-  if navtask_vars.data_aug == 'aug':
-    navtask.task_params.data_augment.structured = False
-  elif navtask_vars.data_aug == 'straug':
-    navtask.task_params.data_augment.structured = True
-  else:
-    logging.fatal('Unknown navtask_vars.data_aug %s.', navtask_vars.data_aug)
-    assert(False)
-
-  navtask.task_params.num_history_frames = int(navtask_vars.history[1:])
-  navtask.task_params.n_views = 1+navtask.task_params.num_history_frames
-  
-  navtask.task_params.goal_channels = int(navtask_vars.n_ori)
-  
-  if navtask_vars.task == 'hard': 
-    navtask.task_params.type = 'rng_rejection_sampling_many'
-    navtask.task_params.rejection_sampling_M = 2000
-    navtask.task_params.min_dist = 10
-  elif navtask_vars.task == 'r2r':
-    navtask.task_params.type = 'room_to_room_many'
-  elif navtask_vars.task == 'ST':
-    # Semantic task at hand.
-    navtask.task_params.goal_channels = \
-        len(navtask.task_params.semantic_task.class_map_names)
-    navtask.task_params.rel_goal_loc_dim = \
-        len(navtask.task_params.semantic_task.class_map_names)
-    navtask.task_params.type = 'to_nearest_obj_acc'
-  else:
-    logging.fatal('navtask_vars.task: should be hard or r2r, ST')
-    assert(False)
-  
-  if navtask_vars.modality == 'rgb':
-    navtask.camera_param.modalities = ['rgb']
-    navtask.camera_param.img_channels = 3
-  elif navtask_vars.modality == 'd':
-    navtask.camera_param.modalities = ['depth']
-    navtask.camera_param.img_channels = 2
-  
-  navtask.task_params.img_height   = navtask.camera_param.height
-  navtask.task_params.img_width    = navtask.camera_param.width
-  navtask.task_params.modalities   = navtask.camera_param.modalities
-  navtask.task_params.img_channels = navtask.camera_param.img_channels
-  navtask.task_params.img_fov      = navtask.camera_param.fov
-  
-  navtask.dataset = factory.get_dataset(navtask_vars.dataset_name)
-  return navtask
diff --git a/research/cognitive_mapping_and_planning/cfgs/config_distill.py b/research/cognitive_mapping_and_planning/cfgs/config_distill.py
deleted file mode 100644
index 53be2f8a5f12ee701a53c1c354079659da6958d4..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/cfgs/config_distill.py
+++ /dev/null
@@ -1,114 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import pprint
-import copy
-import os
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-import logging
-import src.utils as utils
-import cfgs.config_common as cc
-
-
-import tensorflow as tf
-
-rgb_resnet_v2_50_path = 'cache/resnet_v2_50_inception_preprocessed/model.ckpt-5136169'
-
-def get_default_args():
-  robot = utils.Foo(radius=15, base=10, height=140, sensor_height=120,
-                    camera_elevation_degree=-15)
-
-  camera_param = utils.Foo(width=225, height=225, z_near=0.05, z_far=20.0,
-                           fov=60., modalities=['rgb', 'depth'])
-
-  env = utils.Foo(padding=10, resolution=5, num_point_threshold=2,
-                  valid_min=-10, valid_max=200, n_samples_per_face=200)
-
-  data_augment = utils.Foo(lr_flip=0, delta_angle=1, delta_xy=4, relight=False,
-                           relight_fast=False, structured=False)
-
-  task_params = utils.Foo(num_actions=4, step_size=4, num_steps=0,
-                          batch_size=32, room_seed=0, base_class='Building',
-                          task='mapping', n_ori=6, data_augment=data_augment,
-                          output_transform_to_global_map=False,
-                          output_canonical_map=False,
-                          output_incremental_transform=False,
-                          output_free_space=False, move_type='shortest_path',
-                          toy_problem=0)
-
-  buildinger_args = utils.Foo(building_names=['area1_gates_wingA_floor1_westpart'],
-                              env_class=None, robot=robot, 
-                              task_params=task_params, env=env,
-                              camera_param=camera_param)
-
-  solver_args = utils.Foo(seed=0, learning_rate_decay=0.1,
-                          clip_gradient_norm=0, max_steps=120000,
-                          initial_learning_rate=0.001, momentum=0.99,
-                          steps_per_decay=40000, logdir=None, sync=False,
-                          adjust_lr_sync=True, wt_decay=0.0001,
-                          data_loss_wt=1.0, reg_loss_wt=1.0,
-                          num_workers=1, task=0, ps_tasks=0, master='local')
-
-  summary_args = utils.Foo(display_interval=1, test_iters=100)
-
-  control_args = utils.Foo(train=False, test=False,
-                           force_batchnorm_is_training_at_test=False)
-  
-  arch_args = utils.Foo(rgb_encoder='resnet_v2_50', d_encoder='resnet_v2_50')
-
-  return utils.Foo(solver=solver_args,
-                   summary=summary_args, control=control_args, arch=arch_args,
-                   buildinger=buildinger_args)
-
-def get_vars(config_name):
-  vars = config_name.split('_')
-  if len(vars) == 1: # All data or not.
-    vars.append('noall')
-  if len(vars) == 2: # n_ori
-    vars.append('4')
-  logging.error('vars: %s', vars)
-  return vars
-
-def get_args_for_config(config_name):
-  args = get_default_args()
-  config_name, mode = config_name.split('+')
-  vars = get_vars(config_name)
-  
-  logging.info('config_name: %s, mode: %s', config_name, mode)
-  
-  args.buildinger.task_params.n_ori = int(vars[2])
-  args.solver.freeze_conv = True
-  args.solver.pretrained_path = rgb_resnet_v2_50_path
-  args.buildinger.task_params.img_channels = 5
-  args.solver.data_loss_wt = 0.00001
- 
-  if vars[0] == 'v0':
-    None
-  else:
-    logging.error('config_name: %s undefined', config_name)
-
-  args.buildinger.task_params.height = args.buildinger.camera_param.height
-  args.buildinger.task_params.width = args.buildinger.camera_param.width
-  args.buildinger.task_params.modalities = args.buildinger.camera_param.modalities
-  
-  if vars[1] == 'all':
-    args = cc.get_args_for_mode_building_all(args, mode)
-  elif vars[1] == 'noall':
-    args = cc.get_args_for_mode_building(args, mode)
-  
-  # Log the arguments
-  logging.error('%s', args)
-  return args
diff --git a/research/cognitive_mapping_and_planning/cfgs/config_vision_baseline.py b/research/cognitive_mapping_and_planning/cfgs/config_vision_baseline.py
deleted file mode 100644
index 3cc64fe594ab025fbcfb41543302fa42c7fc0074..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/cfgs/config_vision_baseline.py
+++ /dev/null
@@ -1,173 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import pprint
-import os
-import numpy as np
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-import logging
-import src.utils as utils
-import cfgs.config_common as cc
-import datasets.nav_env_config as nec
-
-
-import tensorflow as tf
-
-FLAGS = flags.FLAGS
-
-get_solver_vars = cc.get_solver_vars
-get_navtask_vars = cc.get_navtask_vars
-
-
-rgb_resnet_v2_50_path = 'data/init_models/resnet_v2_50/model.ckpt-5136169'
-d_resnet_v2_50_path = 'data/init_models/distill_rgb_to_d_resnet_v2_50/model.ckpt-120002'
-
-def get_default_args():
-  summary_args = utils.Foo(display_interval=1, test_iters=26,
-                           arop_full_summary_iters=14)
-
-  control_args = utils.Foo(train=False, test=False,
-                           force_batchnorm_is_training_at_test=False,
-                           reset_rng_seed=False, only_eval_when_done=False,
-                           test_mode=None)
-  return summary_args, control_args
-
-def get_default_baseline_args():
-  batch_norm_param = {'center': True, 'scale': True,
-                      'activation_fn':tf.nn.relu}
-  arch_args = utils.Foo(
-      pred_neurons=[], goal_embed_neurons=[], img_embed_neurons=[],
-      batch_norm_param=batch_norm_param, dim_reduce_neurons=64, combine_type='',
-      encoder='resnet_v2_50', action_sample_type='sample',
-      action_sample_combine_type='one_or_other',
-      sample_gt_prob_type='inverse_sigmoid_decay', dagger_sample_bn_false=True,
-      isd_k=750., use_visit_count=False, lstm_output=False, lstm_ego=False,
-      lstm_img=False, fc_dropout=0.0, embed_goal_for_state=False,
-      lstm_output_init_state_from_goal=False)
-  return arch_args
-
-def get_arch_vars(arch_str):
-  if arch_str == '': vals = []
-  else: vals = arch_str.split('_')
-  
-  ks = ['ver', 'lstm_dim', 'dropout']
-  
-  # Exp Ver
-  if len(vals) == 0: vals.append('v0')
-  # LSTM dimentsions
-  if len(vals) == 1: vals.append('lstm2048')
-  # Dropout
-  if len(vals) == 2: vals.append('noDO')
-  
-  assert(len(vals) == 3)
-  
-  vars = utils.Foo()
-  for k, v in zip(ks, vals):
-    setattr(vars, k, v)
-  
-  logging.error('arch_vars: %s', vars)
-  return vars
-
-def process_arch_str(args, arch_str):
-  # This function modifies args.
-  args.arch = get_default_baseline_args()
-  arch_vars = get_arch_vars(arch_str)
-
-  args.navtask.task_params.outputs.rel_goal_loc = True
-  args.navtask.task_params.input_type = 'vision'
-  args.navtask.task_params.outputs.images = True
-  
-  if args.navtask.camera_param.modalities[0] == 'rgb':
-    args.solver.pretrained_path = rgb_resnet_v2_50_path
-  elif args.navtask.camera_param.modalities[0] == 'depth':
-    args.solver.pretrained_path = d_resnet_v2_50_path
-  else:
-    logging.fatal('Neither of rgb or d')
-
-  if arch_vars.dropout == 'DO': 
-    args.arch.fc_dropout = 0.5
-
-  args.tfcode = 'B'
-  
-  exp_ver = arch_vars.ver
-  if exp_ver == 'v0':
-    # Multiplicative interaction between goal loc and image features.
-    args.arch.combine_type = 'multiply'
-    args.arch.pred_neurons = [256, 256]
-    args.arch.goal_embed_neurons = [64, 8]
-    args.arch.img_embed_neurons = [1024, 512, 256*8]
-  
-  elif exp_ver == 'v1':
-    # Additive interaction between goal and image features.
-    args.arch.combine_type = 'add'
-    args.arch.pred_neurons = [256, 256]
-    args.arch.goal_embed_neurons = [64, 256]
-    args.arch.img_embed_neurons = [1024, 512, 256]
-  
-  elif exp_ver == 'v2':
-    # LSTM at the output on top of multiple interactions.
-    args.arch.combine_type = 'multiply'
-    args.arch.goal_embed_neurons = [64, 8]
-    args.arch.img_embed_neurons = [1024, 512, 256*8]
-    args.arch.lstm_output = True
-    args.arch.lstm_output_dim = int(arch_vars.lstm_dim[4:])
-    args.arch.pred_neurons = [256] # The other is inside the LSTM.
-  
-  elif exp_ver == 'v0blind':
-    # LSTM only on the goal location.
-    args.arch.combine_type = 'goalonly'
-    args.arch.goal_embed_neurons = [64, 256]
-    args.arch.img_embed_neurons = [2] # I dont know what it will do otherwise.
-    args.arch.lstm_output = True
-    args.arch.lstm_output_dim = 256
-    args.arch.pred_neurons = [256] # The other is inside the LSTM.
-  
-  else:
-    logging.fatal('exp_ver: %s undefined', exp_ver)
-    assert(False)
-
-  # Log the arguments
-  logging.error('%s', args)
-  return args
-
-def get_args_for_config(config_name):
-  args = utils.Foo()
-
-  args.summary, args.control = get_default_args()
-
-  exp_name, mode_str = config_name.split('+')
-  arch_str, solver_str, navtask_str = exp_name.split('.')
-  logging.error('config_name: %s', config_name)
-  logging.error('arch_str: %s', arch_str)
-  logging.error('navtask_str: %s', navtask_str)
-  logging.error('solver_str: %s', solver_str)
-  logging.error('mode_str: %s', mode_str)
-
-  args.solver = cc.process_solver_str(solver_str)
-  args.navtask = cc.process_navtask_str(navtask_str)
-
-  args = process_arch_str(args, arch_str)
-  args.arch.isd_k = args.solver.isd_k
-
-  # Train, test, etc.
-  mode, imset = mode_str.split('_')
-  args = cc.adjust_args_for_mode(args, mode)
-  args.navtask.building_names = args.navtask.dataset.get_split(imset)
-  args.control.test_name = '{:s}_on_{:s}'.format(mode, imset)
-
-  # Log the arguments
-  logging.error('%s', args)
-  return args
diff --git a/research/cognitive_mapping_and_planning/data/.gitignore b/research/cognitive_mapping_and_planning/data/.gitignore
deleted file mode 100644
index 2b6d5e46652d14a9c0a8025dbcccfc2dd4376e4a..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/data/.gitignore
+++ /dev/null
@@ -1,3 +0,0 @@
-stanford_building_parser_dataset_raw
-stanford_building_parser_dataset
-init_models
diff --git a/research/cognitive_mapping_and_planning/data/README.md b/research/cognitive_mapping_and_planning/data/README.md
deleted file mode 100644
index a8928345351dac19c0e12fd33f99dd2aa600e23b..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/data/README.md
+++ /dev/null
@@ -1,33 +0,0 @@
-This directory contains the data needed for training and benchmarking various
-navigation models.
-
-1.  Download the data from the [dataset website]
-    (http://buildingparser.stanford.edu/dataset.html).
-    1.  [Raw meshes](https://goo.gl/forms/2YSPaO2UKmn5Td5m2). We need the meshes
-        which are in the noXYZ folder. Download the tar files and place them in
-        the `stanford_building_parser_dataset_raw` folder. You need to download
-        `area_1_noXYZ.tar`, `area_3_noXYZ.tar`, `area_5a_noXYZ.tar`,
-        `area_5b_noXYZ.tar`, `area_6_noXYZ.tar` for training and
-        `area_4_noXYZ.tar` for evaluation.
-    2.  [Annotations](https://goo.gl/forms/4SoGp4KtH1jfRqEj2) for setting up
-        tasks. We will need the file called `Stanford3dDataset_v1.2.zip`. Place
-        the file in the directory `stanford_building_parser_dataset_raw`.
-
-2.  Preprocess the data.
-    1.  Extract meshes using `scripts/script_preprocess_meshes_S3DIS.sh`. After
-        this `ls data/stanford_building_parser_dataset/mesh` should have 6
-        folders `area1`, `area3`, `area4`, `area5a`, `area5b`, `area6`, with
-        textures and obj files within each directory.
-    2.  Extract out room information and semantics from zip file using
-        `scripts/script_preprocess_annoations_S3DIS.sh`. After this there should
-        be `room-dimension` and `class-maps` folder in
-        `data/stanford_building_parser_dataset`. (If you find this script to
-        crash because of an exception in np.loadtxt while processing
-        `Area_5/office_19/Annotations/ceiling_1.txt`, there is a special
-        character on line 323474, that should be removed manually.)
-
-3.  Download ImageNet Pre-trained models. We used ResNet-v2-50 for representing
-    images. For RGB images this is pre-trained on ImageNet. For Depth images we
-    [distill](https://arxiv.org/abs/1507.00448) the RGB model to depth images
-    using paired RGB-D images. Both there models are available through
-    `scripts/script_download_init_models.sh`
diff --git a/research/cognitive_mapping_and_planning/datasets/__init__.py b/research/cognitive_mapping_and_planning/datasets/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/cognitive_mapping_and_planning/datasets/factory.py b/research/cognitive_mapping_and_planning/datasets/factory.py
deleted file mode 100644
index 3f7b5c0a602dbacf9619dc1c2ec98e94200428b6..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/datasets/factory.py
+++ /dev/null
@@ -1,113 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Wrapper for selecting the navigation environment that we want to train and
-test on.
-"""
-import numpy as np
-import os, glob
-import platform
-
-import logging
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-
-import render.swiftshader_renderer as renderer 
-import src.file_utils as fu
-import src.utils as utils
-
-def get_dataset(dataset_name):
-  if dataset_name == 'sbpd':
-    dataset = StanfordBuildingParserDataset(dataset_name)
-  else:
-    logging.fatal('Not one of sbpd')
-  return dataset
-
-class Loader():
-  def get_data_dir():
-    pass
-
-  def get_meta_data(self, file_name, data_dir=None):
-    if data_dir is None:
-      data_dir = self.get_data_dir()
-    full_file_name = os.path.join(data_dir, 'meta', file_name)
-    assert(fu.exists(full_file_name)), \
-      '{:s} does not exist'.format(full_file_name)
-    ext = os.path.splitext(full_file_name)[1]
-    if ext == '.txt':
-      ls = []
-      with fu.fopen(full_file_name, 'r') as f:
-        for l in f:
-          ls.append(l.rstrip())
-    elif ext == '.pkl':
-      ls = utils.load_variables(full_file_name)
-    return ls
-
-  def load_building(self, name, data_dir=None):
-    if data_dir is None:
-      data_dir = self.get_data_dir()
-    out = {}
-    out['name'] = name
-    out['data_dir'] = data_dir
-    out['room_dimension_file'] = os.path.join(data_dir, 'room-dimension',
-                                              name+'.pkl')
-    out['class_map_folder'] = os.path.join(data_dir, 'class-maps')
-    return out
-
-  def load_building_meshes(self, building):
-    dir_name = os.path.join(building['data_dir'], 'mesh', building['name'])
-    mesh_file_name = glob.glob1(dir_name, '*.obj')[0]
-    mesh_file_name_full = os.path.join(dir_name, mesh_file_name)
-    logging.error('Loading building from obj file: %s', mesh_file_name_full)
-    shape = renderer.Shape(mesh_file_name_full, load_materials=True, 
-                           name_prefix=building['name']+'_')
-    return [shape]
-
-class StanfordBuildingParserDataset(Loader):
-  def __init__(self, ver):
-    self.ver = ver
-    self.data_dir = None
-  
-  def get_data_dir(self):
-    if self.data_dir is None:
-      self.data_dir = 'data/stanford_building_parser_dataset/'
-    return self.data_dir
-
-  def get_benchmark_sets(self):
-    return self._get_benchmark_sets()
-
-  def get_split(self, split_name):
-    if self.ver == 'sbpd':
-      return self._get_split(split_name)
-    else:
-      logging.fatal('Unknown version.')
-
-  def _get_benchmark_sets(self):
-    sets = ['train1', 'val', 'test']
-    return sets
-
-  def _get_split(self, split_name):
-    train = ['area1', 'area5a', 'area5b', 'area6']
-    train1 = ['area1']
-    val = ['area3']
-    test = ['area4']
-
-    sets = {}
-    sets['train'] = train
-    sets['train1'] = train1
-    sets['val'] = val
-    sets['test'] = test
-    sets['all'] = sorted(list(set(train + val + test)))
-    return sets[split_name]
diff --git a/research/cognitive_mapping_and_planning/datasets/nav_env.py b/research/cognitive_mapping_and_planning/datasets/nav_env.py
deleted file mode 100644
index 5710e26dcb113121d99400cb060104224dd91749..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/datasets/nav_env.py
+++ /dev/null
@@ -1,1465 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Navidation Environment. Includes the following classes along with some
-helper functions.
-  Building: Loads buildings, computes traversibility, exposes functionality for
-    rendering images.
-  
-  GridWorld: Base class which implements functionality for moving an agent on a
-    grid world.
-  
-  NavigationEnv: Base class which generates navigation problems on a grid world.
-  
-  VisualNavigationEnv: Builds upon NavigationEnv and Building to provide
-    interface that is used externally to train the agent. 
-  
-  MeshMapper: Class used for distilling the model, testing the mapper.
-  
-  BuildingMultiplexer: Wrapper class that instantiates a VisualNavigationEnv for
-    each building and multiplexes between them as needed.
-"""
-
-import numpy as np
-import os
-import re
-import matplotlib.pyplot as plt
-
-import graph_tool as gt
-import graph_tool.topology
-
-from tensorflow.python.platform import gfile
-import logging
-import src.file_utils as fu
-import src.utils as utils
-import src.graph_utils as gu
-import src.map_utils as mu
-import src.depth_utils as du
-import render.swiftshader_renderer as sru
-from render.swiftshader_renderer import SwiftshaderRenderer
-import cv2
-
-label_nodes_with_class           = gu.label_nodes_with_class
-label_nodes_with_class_geodesic  = gu.label_nodes_with_class_geodesic
-get_distance_node_list           = gu.get_distance_node_list
-convert_to_graph_tool            = gu.convert_to_graph_tool
-generate_graph                   = gu.generate_graph
-get_hardness_distribution        = gu.get_hardness_distribution
-rng_next_goal_rejection_sampling = gu.rng_next_goal_rejection_sampling
-rng_next_goal                    = gu.rng_next_goal
-rng_room_to_room                 = gu.rng_room_to_room
-rng_target_dist_field            = gu.rng_target_dist_field
-
-compute_traversibility           = mu.compute_traversibility
-make_map                         = mu.make_map
-resize_maps                      = mu.resize_maps
-pick_largest_cc                  = mu.pick_largest_cc
-get_graph_origin_loc             = mu.get_graph_origin_loc
-generate_egocentric_maps         = mu.generate_egocentric_maps
-generate_goal_images             = mu.generate_goal_images
-get_map_to_predict               = mu.get_map_to_predict
-
-bin_points                       = du.bin_points
-make_geocentric                  = du.make_geocentric
-get_point_cloud_from_z           = du.get_point_cloud_from_z
-get_camera_matrix                = du.get_camera_matrix
-
-def _get_semantic_maps(folder_name, building_name, map, flip):
-  # Load file from the cache.
-  file_name = '{:s}_{:d}_{:d}_{:d}_{:d}_{:d}_{:d}.pkl'
-  file_name = file_name.format(building_name, map.size[0], map.size[1],
-                               map.origin[0], map.origin[1], map.resolution,
-                               flip)
-  file_name = os.path.join(folder_name, file_name)
-  logging.info('Loading semantic maps from %s.', file_name)
-
-  if fu.exists(file_name):
-    a = utils.load_variables(file_name)
-    maps = a['maps'] #HxWx#C
-    cats = a['cats']
-  else:
-    logging.error('file_name: %s not found.', file_name)
-    maps = None
-    cats = None
-  return maps, cats
-
-def _select_classes(all_maps, all_cats, cats_to_use):
-  inds = []
-  for c in cats_to_use:
-    ind = all_cats.index(c)
-    inds.append(ind)
-  out_maps = all_maps[:,:,inds]
-  return out_maps
-
-def _get_room_dimensions(file_name, resolution, origin, flip=False):
-  if fu.exists(file_name):
-    a = utils.load_variables(file_name)['room_dimension']
-    names = a.keys()
-    dims = np.concatenate(a.values(), axis=0).reshape((-1,6))
-    ind = np.argsort(names)
-    dims = dims[ind,:]
-    names = [names[x] for x in ind]
-    if flip:
-      dims_new = dims*1
-      dims_new[:,1] = -dims[:,4]
-      dims_new[:,4] = -dims[:,1]
-      dims = dims_new*1
-
-    dims = dims*100.
-    dims[:,0] = dims[:,0] - origin[0]
-    dims[:,1] = dims[:,1] - origin[1]
-    dims[:,3] = dims[:,3] - origin[0]
-    dims[:,4] = dims[:,4] - origin[1]
-    dims = dims / resolution
-    out = {'names': names, 'dims': dims}
-  else:
-    out = None
-  return out
-
-def _filter_rooms(room_dims, room_regex):
-  pattern = re.compile(room_regex)
-  ind = []
-  for i, name in enumerate(room_dims['names']):
-    if pattern.match(name):
-      ind.append(i)
-  new_room_dims = {}
-  new_room_dims['names'] = [room_dims['names'][i] for i in ind]
-  new_room_dims['dims'] = room_dims['dims'][ind,:]*1
-  return new_room_dims
-
-def _label_nodes_with_room_id(xyt, room_dims):
-  # Label the room with the ID into things.
-  node_room_id = -1*np.ones((xyt.shape[0], 1))
-  dims = room_dims['dims']
-  for x, name in enumerate(room_dims['names']):
-    all_ = np.concatenate((xyt[:,[0]] >= dims[x,0],
-                           xyt[:,[0]] <= dims[x,3],
-                           xyt[:,[1]] >= dims[x,1],
-                           xyt[:,[1]] <= dims[x,4]), axis=1)
-    node_room_id[np.all(all_, axis=1), 0] = x
-  return node_room_id
-
-def get_path_ids(start_node_id, end_node_id, pred_map):
-  id = start_node_id
-  path = [id]
-  while id != end_node_id:
-    id = pred_map[id]
-    path.append(id)
-  return path
-
-def image_pre(images, modalities):
-  # Assumes images are ...xHxWxC.
-  # We always assume images are RGB followed by Depth.
-  if 'depth' in modalities:
-    d = images[...,-1][...,np.newaxis]*1.
-    d[d < 0.01] = np.NaN; isnan = np.isnan(d);
-    d = 100./d; d[isnan] = 0.;
-    images = np.concatenate((images[...,:-1], d, isnan), axis=images.ndim-1)
-  if 'rgb' in modalities:
-    images[...,:3] = images[...,:3]*1. - 128
-  return images
-
-def _get_relative_goal_loc(goal_loc, loc, theta):
-  r = np.sqrt(np.sum(np.square(goal_loc - loc), axis=1))
-  t = np.arctan2(goal_loc[:,1] - loc[:,1], goal_loc[:,0] - loc[:,0])
-  t = t-theta[:,0] + np.pi/2
-  return np.expand_dims(r,axis=1), np.expand_dims(t, axis=1)
-
-def _gen_perturbs(rng, batch_size, num_steps, lr_flip, delta_angle, delta_xy,
-                  structured):
-  perturbs = []
-  for i in range(batch_size):
-    # Doing things one by one for each episode in this batch. This way this
-    # remains replicatable even when we change the batch size.
-    p = np.zeros((num_steps+1, 4))
-    if lr_flip:
-      # Flip the whole trajectory.
-      p[:,3] = rng.rand(1)-0.5
-    if delta_angle > 0:
-      if structured:
-        p[:,2] = (rng.rand(1)-0.5)* delta_angle
-      else:
-        p[:,2] = (rng.rand(p.shape[0])-0.5)* delta_angle
-    if delta_xy > 0:
-      if structured:
-        p[:,:2] = (rng.rand(1, 2)-0.5)*delta_xy
-      else:
-        p[:,:2] = (rng.rand(p.shape[0], 2)-0.5)*delta_xy
-    perturbs.append(p)
-  return perturbs
-
-def get_multiplexer_class(args, task_number):
-  assert(args.task_params.base_class == 'Building')
-  logging.info('Returning BuildingMultiplexer')
-  R = BuildingMultiplexer(args, task_number)
-  return R
-
-class GridWorld():
-  def __init__(self):
-    """Class members that will be assigned by any class that actually uses this
-    class."""
-    self.restrict_to_largest_cc = None
-    self.robot = None
-    self.env = None
-    self.category_list = None
-    self.traversible = None
-
-  def get_loc_axis(self, node, delta_theta, perturb=None):
-    """Based on the node orientation returns X, and Y axis. Used to sample the
-    map in egocentric coordinate frame.
-    """
-    if type(node) == tuple:
-      node = np.array([node])
-    if perturb is None:
-      perturb = np.zeros((node.shape[0], 4))
-    xyt = self.to_actual_xyt_vec(node)
-    x = xyt[:,[0]] + perturb[:,[0]]
-    y = xyt[:,[1]] + perturb[:,[1]]
-    t = xyt[:,[2]] + perturb[:,[2]]
-    theta = t*delta_theta
-    loc = np.concatenate((x,y), axis=1)
-    x_axis = np.concatenate((np.cos(theta), np.sin(theta)), axis=1)
-    y_axis = np.concatenate((np.cos(theta+np.pi/2.), np.sin(theta+np.pi/2.)),
-                            axis=1)
-    # Flip the sampled map where need be.
-    y_axis[np.where(perturb[:,3] > 0)[0], :] *= -1.
-    return loc, x_axis, y_axis, theta
-
-  def to_actual_xyt(self, pqr):
-    """Converts from node to location on the map."""
-    (p, q, r) = pqr
-    if self.task.n_ori == 6:
-      out = (p - q * 0.5 + self.task.origin_loc[0],
-             q * np.sqrt(3.) / 2. + self.task.origin_loc[1], r)
-    elif self.task.n_ori == 4:
-      out = (p + self.task.origin_loc[0],
-             q + self.task.origin_loc[1], r)
-    return out
-
-  def to_actual_xyt_vec(self, pqr):
-    """Converts from node array to location array on the map."""
-    p = pqr[:,0][:, np.newaxis]
-    q = pqr[:,1][:, np.newaxis]
-    r = pqr[:,2][:, np.newaxis]
-    if self.task.n_ori == 6:
-      out = np.concatenate((p - q * 0.5 + self.task.origin_loc[0],
-                            q * np.sqrt(3.) / 2. + self.task.origin_loc[1],
-                            r), axis=1)
-    elif self.task.n_ori == 4:
-      out = np.concatenate((p + self.task.origin_loc[0],
-                            q + self.task.origin_loc[1],
-                            r), axis=1)
-    return out
-
-  def raw_valid_fn_vec(self, xyt):
-    """Returns if the given set of nodes is valid or not."""
-    height = self.traversible.shape[0]
-    width = self.traversible.shape[1]
-    x = np.round(xyt[:,[0]]).astype(np.int32)
-    y = np.round(xyt[:,[1]]).astype(np.int32)
-    is_inside = np.all(np.concatenate((x >= 0, y >= 0,
-                                       x < width, y < height), axis=1), axis=1)
-    x = np.minimum(np.maximum(x, 0), width-1)
-    y = np.minimum(np.maximum(y, 0), height-1)
-    ind = np.ravel_multi_index((y,x), self.traversible.shape)
-    is_traversible = self.traversible.ravel()[ind]
-
-    is_valid = np.all(np.concatenate((is_inside[:,np.newaxis], is_traversible),
-                                     axis=1), axis=1)
-    return is_valid
-
-
-  def valid_fn_vec(self, pqr):
-    """Returns if the given set of nodes is valid or not."""
-    xyt = self.to_actual_xyt_vec(np.array(pqr))
-    height = self.traversible.shape[0]
-    width = self.traversible.shape[1]
-    x = np.round(xyt[:,[0]]).astype(np.int32)
-    y = np.round(xyt[:,[1]]).astype(np.int32)
-    is_inside = np.all(np.concatenate((x >= 0, y >= 0,
-                                       x < width, y < height), axis=1), axis=1)
-    x = np.minimum(np.maximum(x, 0), width-1)
-    y = np.minimum(np.maximum(y, 0), height-1)
-    ind = np.ravel_multi_index((y,x), self.traversible.shape)
-    is_traversible = self.traversible.ravel()[ind]
-
-    is_valid = np.all(np.concatenate((is_inside[:,np.newaxis], is_traversible),
-                                     axis=1), axis=1)
-    return is_valid
-
-  def get_feasible_actions(self, node_ids):
-    """Returns the feasible set of actions from the current node."""
-    a = np.zeros((len(node_ids), self.task_params.num_actions), dtype=np.int32)
-    gtG = self.task.gtG
-    next_node = []
-    for i, c in enumerate(node_ids):
-      neigh = gtG.vertex(c).out_neighbours()
-      neigh_edge = gtG.vertex(c).out_edges()
-      nn = {}
-      for n, e in zip(neigh, neigh_edge):
-        _ = gtG.ep['action'][e]
-        a[i,_] = 1
-        nn[_] = int(n)
-      next_node.append(nn)
-    return a, next_node
-
-  def take_action(self, current_node_ids, action):
-    """Returns the new node after taking the action action. Stays at the current
-    node if the action is invalid."""
-    actions, next_node_ids = self.get_feasible_actions(current_node_ids)
-    new_node_ids = []
-    for i, (c,a) in enumerate(zip(current_node_ids, action)):
-      if actions[i,a] == 1:
-        new_node_ids.append(next_node_ids[i][a])
-      else:
-        new_node_ids.append(c)
-    return new_node_ids
-
-  def set_r_obj(self, r_obj):
-    """Sets the SwiftshaderRenderer object used for rendering."""
-    self.r_obj = r_obj
-
-class Building(GridWorld):
-  def __init__(self, building_name, robot, env,
-               category_list=None, small=False, flip=False, logdir=None,
-               building_loader=None):
-
-    self.restrict_to_largest_cc = True
-    self.robot = robot
-    self.env = env
-    self.logdir = logdir
-
-    # Load the building meta data.
-    building = building_loader.load_building(building_name)
-    if small:
-      building['mesh_names'] = building['mesh_names'][:5]
-
-    # New code.
-    shapess = building_loader.load_building_meshes(building)
-    if flip:
-      for shapes in shapess:
-        shapes.flip_shape()
-
-    vs = []
-    for shapes in shapess:
-      vs.append(shapes.get_vertices()[0])
-    vs = np.concatenate(vs, axis=0)
-    map = make_map(env.padding, env.resolution, vertex=vs, sc=100.)
-    map = compute_traversibility(
-        map, robot.base, robot.height, robot.radius, env.valid_min,
-        env.valid_max, env.num_point_threshold, shapess=shapess, sc=100.,
-        n_samples_per_face=env.n_samples_per_face)
-
-    room_dims = _get_room_dimensions(building['room_dimension_file'],
-                                     env.resolution, map.origin, flip=flip)
-    class_maps, class_map_names = _get_semantic_maps(
-        building['class_map_folder'], building_name, map, flip)
-
-    self.class_maps      = class_maps
-    self.class_map_names = class_map_names
-    self.building        = building
-    self.shapess         = shapess
-    self.map             = map
-    self.traversible     = map.traversible*1
-    self.building_name   = building_name
-    self.room_dims       = room_dims
-    self.flipped         = flip
-    self.renderer_entitiy_ids = []
-
-    if self.restrict_to_largest_cc:
-      self.traversible = pick_largest_cc(self.traversible)
-
-  def load_building_into_scene(self):
-    # Loads the scene.
-    self.renderer_entitiy_ids += self.r_obj.load_shapes(self.shapess)
-    # Free up memory, we dont need the mesh or the materials anymore.
-    self.shapess = None
-
-  def add_entity_at_nodes(self, nodes, height, shape):
-    xyt = self.to_actual_xyt_vec(nodes)
-    nxy = xyt[:,:2]*1.
-    nxy = nxy * self.map.resolution
-    nxy = nxy + self.map.origin
-    Ts = np.concatenate((nxy, nxy[:,:1]), axis=1)
-    Ts[:,2] = height; Ts = Ts / 100.;
-
-    # Merge all the shapes into a single shape and add that shape.
-    shape.replicate_shape(Ts)
-    entity_ids = self.r_obj.load_shapes([shape])
-    self.renderer_entitiy_ids += entity_ids
-    return entity_ids
-
-  def add_shapes(self, shapes):
-    scene = self.r_obj.viz.scene()
-    for shape in shapes:
-      scene.AddShape(shape)
-
-  def add_materials(self, materials):
-    scene = self.r_obj.viz.scene()
-    for material in materials:
-      scene.AddOrUpdateMaterial(material)
-
-  def set_building_visibility(self, visibility):
-    self.r_obj.set_entity_visible(self.renderer_entitiy_ids, visibility)
-
-  def render_nodes(self, nodes, perturb=None, aux_delta_theta=0.):
-    self.set_building_visibility(True)
-    if perturb is None:
-      perturb = np.zeros((len(nodes), 4))
-
-    imgs = []
-    r = 2
-    elevation_z = r * np.tan(np.deg2rad(self.robot.camera_elevation_degree))
-
-    for i in range(len(nodes)):
-      xyt = self.to_actual_xyt(nodes[i])
-      lookat_theta = 3.0 * np.pi / 2.0 - (xyt[2]+perturb[i,2]+aux_delta_theta) * (self.task.delta_theta)
-      nxy = np.array([xyt[0]+perturb[i,0], xyt[1]+perturb[i,1]]).reshape(1, -1)
-      nxy = nxy * self.map.resolution
-      nxy = nxy + self.map.origin
-      camera_xyz = np.zeros((1, 3))
-      camera_xyz[...] = [nxy[0, 0], nxy[0, 1], self.robot.sensor_height]
-      camera_xyz = camera_xyz / 100.
-      lookat_xyz = np.array([-r * np.sin(lookat_theta),
-                             -r * np.cos(lookat_theta), elevation_z])
-      lookat_xyz = lookat_xyz + camera_xyz[0, :]
-      self.r_obj.position_camera(camera_xyz[0, :].tolist(),
-                                 lookat_xyz.tolist(), [0.0, 0.0, 1.0])
-      img = self.r_obj.render(take_screenshot=True, output_type=0)
-      img = [x for x in img if x is not None]
-      img = np.concatenate(img, axis=2).astype(np.float32)
-      if perturb[i,3]>0:
-        img = img[:,::-1,:]
-      imgs.append(img)
-
-    self.set_building_visibility(False)
-    return imgs
-
-
-class MeshMapper(Building):
-  def __init__(self, robot, env, task_params, building_name, category_list,
-               flip, logdir=None, building_loader=None):
-    Building.__init__(self, building_name, robot, env, category_list,
-                      small=task_params.toy_problem, flip=flip, logdir=logdir,
-                      building_loader=building_loader)
-    self.task_params = task_params
-    self.task = None
-    self._preprocess_for_task(self.task_params.building_seed)
-
-  def _preprocess_for_task(self, seed):
-    if self.task is None or self.task.seed != seed:
-      rng = np.random.RandomState(seed)
-      origin_loc = get_graph_origin_loc(rng, self.traversible)
-      self.task = utils.Foo(seed=seed, origin_loc=origin_loc,
-                            n_ori=self.task_params.n_ori)
-      G = generate_graph(self.valid_fn_vec,
-                                  self.task_params.step_size, self.task.n_ori,
-                                  (0, 0, 0))
-      gtG, nodes, nodes_to_id = convert_to_graph_tool(G)
-      self.task.gtG = gtG
-      self.task.nodes = nodes
-      self.task.delta_theta = 2.0*np.pi/(self.task.n_ori*1.)
-      self.task.nodes_to_id = nodes_to_id
-      logging.info('Building %s, #V=%d, #E=%d', self.building_name,
-                   self.task.nodes.shape[0], self.task.gtG.num_edges())
-
-      if self.logdir is not None:
-        write_traversible = cv2.applyColorMap(self.traversible.astype(np.uint8)*255, cv2.COLORMAP_JET)
-        img_path = os.path.join(self.logdir,
-                                '{:s}_{:d}_graph.png'.format(self.building_name,
-                                                             seed))
-        node_xyt = self.to_actual_xyt_vec(self.task.nodes)
-        plt.set_cmap('jet');
-        fig, ax = utils.subplot(plt, (1,1), (12,12))
-        ax.plot(node_xyt[:,0], node_xyt[:,1], 'm.')
-        ax.imshow(self.traversible, origin='lower');
-        ax.set_axis_off(); ax.axis('equal');
-        ax.set_title('{:s}, {:d}, {:d}'.format(self.building_name,
-                                               self.task.nodes.shape[0],
-                                               self.task.gtG.num_edges()))
-        if self.room_dims is not None:
-          for i, r in enumerate(self.room_dims['dims']*1):
-            min_ = r[:3]*1
-            max_ = r[3:]*1
-            xmin, ymin, zmin = min_
-            xmax, ymax, zmax = max_
-
-            ax.plot([xmin, xmax, xmax, xmin, xmin],
-                    [ymin, ymin, ymax, ymax, ymin], 'g')
-        with fu.fopen(img_path, 'w') as f:
-          fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-        plt.close(fig)
-
-
-  def _gen_rng(self, rng):
-    # instances is a list of list of node_ids.
-    if self.task_params.move_type == 'circle':
-      _, _, _, _, paths = rng_target_dist_field(self.task_params.batch_size,
-                                                self.task.gtG, rng, 0, 1,
-                                                compute_path=True)
-      instances_ = paths
-
-      instances = []
-      for instance_ in instances_:
-        instance = instance_
-        for i in range(self.task_params.num_steps):
-          instance.append(self.take_action([instance[-1]], [1])[0])
-        instances.append(instance)
-
-    elif self.task_params.move_type == 'shortest_path':
-      _, _, _, _, paths = rng_target_dist_field(self.task_params.batch_size,
-                                                self.task.gtG, rng,
-                                                self.task_params.num_steps,
-                                                self.task_params.num_steps+1,
-                                                compute_path=True)
-      instances = paths
-
-    elif self.task_params.move_type == 'circle+forward':
-      _, _, _, _, paths = rng_target_dist_field(self.task_params.batch_size,
-                                                self.task.gtG, rng, 0, 1,
-                                                compute_path=True)
-      instances_ = paths
-      instances = []
-      for instance_ in instances_:
-        instance = instance_
-        for i in range(self.task_params.n_ori-1):
-          instance.append(self.take_action([instance[-1]], [1])[0])
-        while len(instance) <= self.task_params.num_steps:
-          while self.take_action([instance[-1]], [3])[0] == instance[-1] and len(instance) <= self.task_params.num_steps:
-            instance.append(self.take_action([instance[-1]], [2])[0])
-          if len(instance) <= self.task_params.num_steps:
-            instance.append(self.take_action([instance[-1]], [3])[0])
-        instances.append(instance)
-
-    # Do random perturbation if needed.
-    perturbs = _gen_perturbs(rng, self.task_params.batch_size,
-                             self.task_params.num_steps,
-                             self.task_params.data_augment.lr_flip,
-                             self.task_params.data_augment.delta_angle,
-                             self.task_params.data_augment.delta_xy,
-                             self.task_params.data_augment.structured)
-    return instances, perturbs
-
-  def worker(self, instances, perturbs):
-    # Output the images and the free space.
-
-    # Make the instances be all the same length.
-    for i in range(len(instances)):
-      for j in range(self.task_params.num_steps - len(instances[i]) + 1):
-        instances[i].append(instances[i][-1])
-      if perturbs[i].shape[0] < self.task_params.num_steps+1:
-        p = np.zeros((self.task_params.num_steps+1, 4))
-        p[:perturbs[i].shape[0], :] = perturbs[i]
-        p[perturbs[i].shape[0]:, :] = perturbs[i][-1,:]
-        perturbs[i] = p
-
-    instances_ = []
-    for instance in instances:
-      instances_ = instances_ + instance
-    perturbs_ = np.concatenate(perturbs, axis=0)
-
-    instances_nodes = self.task.nodes[instances_,:]
-    instances_nodes = [tuple(x) for x in instances_nodes]
-
-    imgs_ = self.render_nodes(instances_nodes, perturbs_)
-    imgs = []; next = 0;
-    for instance in instances:
-      img_i = []
-      for _ in instance:
-        img_i.append(imgs_[next])
-        next = next+1
-      imgs.append(img_i)
-    imgs = np.array(imgs)
-
-    # Render out the maps in the egocentric view for all nodes and not just the
-    # last node.
-    all_nodes = []
-    for x in instances:
-      all_nodes = all_nodes + x
-    all_perturbs = np.concatenate(perturbs, axis=0)
-    loc, x_axis, y_axis, theta = self.get_loc_axis(
-        self.task.nodes[all_nodes, :]*1, delta_theta=self.task.delta_theta,
-        perturb=all_perturbs)
-    fss = None
-    valids = None
-    loc_on_map = None
-    theta_on_map = None
-    cum_fs = None
-    cum_valid = None
-    incremental_locs = None
-    incremental_thetas = None
-
-    if self.task_params.output_free_space:
-      fss, valids = get_map_to_predict(loc, x_axis, y_axis,
-                                       map=self.traversible*1.,
-                                       map_size=self.task_params.map_size)
-      fss = np.array(fss) > 0.5
-      fss = np.reshape(fss, [self.task_params.batch_size,
-                             self.task_params.num_steps+1,
-                             self.task_params.map_size,
-                             self.task_params.map_size])
-      valids = np.reshape(np.array(valids), fss.shape)
-
-    if self.task_params.output_transform_to_global_map:
-      # Output the transform to the global map.
-      loc_on_map = np.reshape(loc*1, [self.task_params.batch_size,
-                                      self.task_params.num_steps+1, -1])
-      # Converting to location wrt to first location so that warping happens
-      # properly.
-      theta_on_map = np.reshape(theta*1, [self.task_params.batch_size,
-                                            self.task_params.num_steps+1, -1])
-
-    if self.task_params.output_incremental_transform:
-      # Output the transform to the global map.
-      incremental_locs_ = np.reshape(loc*1, [self.task_params.batch_size,
-                                             self.task_params.num_steps+1, -1])
-      incremental_locs_[:,1:,:] -= incremental_locs_[:,:-1,:]
-      t0 = -np.pi/2+np.reshape(theta*1, [self.task_params.batch_size,
-                                        self.task_params.num_steps+1, -1])
-      t = t0*1
-      incremental_locs = incremental_locs_*1
-      incremental_locs[:,:,0] = np.sum(incremental_locs_ * np.concatenate((np.cos(t), np.sin(t)), axis=-1), axis=-1)
-      incremental_locs[:,:,1] = np.sum(incremental_locs_ * np.concatenate((np.cos(t+np.pi/2), np.sin(t+np.pi/2)), axis=-1), axis=-1)
-      incremental_locs[:,0,:] = incremental_locs_[:,0,:]
-      # print incremental_locs_[0,:,:], incremental_locs[0,:,:], t0[0,:,:]
-
-      incremental_thetas = np.reshape(theta*1, [self.task_params.batch_size,
-                                                self.task_params.num_steps+1,
-                                                -1])
-      incremental_thetas[:,1:,:] += -incremental_thetas[:,:-1,:]
-
-    if self.task_params.output_canonical_map:
-      loc_ = loc[0::(self.task_params.num_steps+1), :]
-      x_axis = np.zeros_like(loc_); x_axis[:,1] = 1
-      y_axis = np.zeros_like(loc_); y_axis[:,0] = -1
-      cum_fs, cum_valid = get_map_to_predict(loc_, x_axis, y_axis,
-                                             map=self.traversible*1.,
-                                             map_size=self.task_params.map_size)
-      cum_fs = np.array(cum_fs) > 0.5
-      cum_fs = np.reshape(cum_fs, [self.task_params.batch_size, 1,
-                                   self.task_params.map_size,
-                                   self.task_params.map_size])
-      cum_valid = np.reshape(np.array(cum_valid), cum_fs.shape)
-
-
-    inputs = {'fs_maps': fss,
-              'valid_maps': valids,
-              'imgs': imgs,
-              'loc_on_map': loc_on_map,
-              'theta_on_map': theta_on_map,
-              'cum_fs_maps': cum_fs,
-              'cum_valid_maps': cum_valid,
-              'incremental_thetas': incremental_thetas,
-              'incremental_locs': incremental_locs}
-    return inputs
-
-  def pre(self, inputs):
-    inputs['imgs'] = image_pre(inputs['imgs'], self.task_params.modalities)
-    if inputs['loc_on_map'] is not None:
-      inputs['loc_on_map'] = inputs['loc_on_map'] - inputs['loc_on_map'][:,[0],:]
-    if inputs['theta_on_map'] is not None:
-      inputs['theta_on_map'] = np.pi/2. - inputs['theta_on_map']
-    return inputs
-
-def _nav_env_reset_helper(type, rng, nodes, batch_size, gtG, max_dist,
-                          num_steps, num_goals, data_augment, **kwargs):
-  """Generates and returns a new episode."""
-  max_compute = max_dist + 4*num_steps
-  if type == 'general':
-    start_node_ids, end_node_ids, dist, pred_map, paths = \
-        rng_target_dist_field(batch_size, gtG, rng, max_dist, max_compute,
-                              nodes=nodes, compute_path=False)
-    target_class = None
-
-  elif type == 'room_to_room_many':
-    goal_node_ids = []; dists = [];
-    node_room_ids = kwargs['node_room_ids']
-    # Sample the first one
-    start_node_ids_, end_node_ids_, dist_, _, _ = rng_room_to_room(
-        batch_size, gtG, rng, max_dist, max_compute,
-        node_room_ids=node_room_ids, nodes=nodes)
-    start_node_ids = start_node_ids_
-    goal_node_ids.append(end_node_ids_)
-    dists.append(dist_)
-    for n in range(num_goals-1):
-      start_node_ids_, end_node_ids_, dist_, _, _ = rng_next_goal(
-          goal_node_ids[n], batch_size, gtG, rng, max_dist,
-          max_compute, node_room_ids=node_room_ids, nodes=nodes,
-          dists_from_start_node=dists[n])
-      goal_node_ids.append(end_node_ids_)
-      dists.append(dist_)
-    target_class = None
-
-  elif type == 'rng_rejection_sampling_many':
-    num_goals = num_goals
-    goal_node_ids = []; dists = [];
-
-    n_ori = kwargs['n_ori']
-    step_size = kwargs['step_size']
-    min_dist = kwargs['min_dist']
-    sampling_distribution = kwargs['sampling_distribution']
-    target_distribution = kwargs['target_distribution']
-    rejection_sampling_M = kwargs['rejection_sampling_M']
-    distribution_bins = kwargs['distribution_bins']
-
-    for n in range(num_goals):
-      if n == 0: input_nodes = None
-      else: input_nodes = goal_node_ids[n-1]
-      start_node_ids_, end_node_ids_, dist_, _, _, _, _ = rng_next_goal_rejection_sampling(
-              input_nodes, batch_size, gtG, rng, max_dist, min_dist,
-              max_compute, sampling_distribution, target_distribution, nodes,
-              n_ori, step_size, distribution_bins, rejection_sampling_M)
-      if n == 0: start_node_ids = start_node_ids_
-      goal_node_ids.append(end_node_ids_)
-      dists.append(dist_)
-    target_class = None
-
-  elif type == 'room_to_room_back':
-    num_goals = num_goals
-    assert(num_goals == 2), 'num_goals must be 2.'
-    goal_node_ids = []; dists = [];
-    node_room_ids = kwargs['node_room_ids']
-    # Sample the first one.
-    start_node_ids_, end_node_ids_, dist_, _, _ = rng_room_to_room(
-        batch_size, gtG, rng, max_dist, max_compute,
-        node_room_ids=node_room_ids, nodes=nodes)
-    start_node_ids = start_node_ids_
-    goal_node_ids.append(end_node_ids_)
-    dists.append(dist_)
-
-    # Set second goal to be starting position, and compute distance to the start node.
-    goal_node_ids.append(start_node_ids)
-    dist = []
-    for i in range(batch_size):
-      dist_ = gt.topology.shortest_distance(
-          gt.GraphView(gtG, reversed=True),
-          source=gtG.vertex(start_node_ids[i]), target=None)
-      dist_ = np.array(dist_.get_array())
-      dist.append(dist_)
-    dists.append(dist)
-    target_class = None
-
-  elif type[:14] == 'to_nearest_obj':
-    # Generate an episode by sampling one of the target classes (with
-    # probability proportional to the number of nodes in the world).
-    # With the sampled class sample a node that is within some distance from
-    # the sampled class.
-    class_nodes   = kwargs['class_nodes']
-    sampling      = kwargs['sampling']
-    dist_to_class = kwargs['dist_to_class']
-
-    assert(num_goals == 1), 'Only supports a single goal.'
-    ind = rng.choice(class_nodes.shape[0], size=batch_size)
-    target_class = class_nodes[ind,1]
-    start_node_ids = []; dists = []; goal_node_ids = [];
-
-    for t in target_class:
-      if sampling == 'uniform':
-        max_dist = max_dist
-        cnts = np.bincount(dist_to_class[t], minlength=max_dist+1)*1.
-        cnts[max_dist+1:] = 0
-        p_each = 1./ cnts / (max_dist+1.)
-        p_each[cnts == 0] = 0
-        p = p_each[dist_to_class[t]]*1.; p = p/np.sum(p)
-        start_node_id = rng.choice(p.shape[0], size=1, p=p)[0]
-      else:
-        logging.fatal('Sampling not one of uniform.')
-      start_node_ids.append(start_node_id)
-      dists.append(dist_to_class[t])
-      # Dummy goal node, same as the start node, so that vis is better.
-      goal_node_ids.append(start_node_id)
-    dists = [dists]
-    goal_node_ids = [goal_node_ids]
-
-  return start_node_ids, goal_node_ids, dists, target_class
-
-
-class NavigationEnv(GridWorld, Building):
-  """Wrapper around GridWorld which sets up navigation tasks.
-  """
-  def _debug_save_hardness(self, seed):
-    out_path = os.path.join(self.logdir, '{:s}_{:d}_hardness.png'.format(self.building_name, seed))
-    batch_size = 4000
-    rng = np.random.RandomState(0)
-    start_node_ids, end_node_ids, dists, pred_maps, paths, hardnesss, gt_dists = \
-      rng_next_goal_rejection_sampling(
-          None, batch_size, self.task.gtG, rng, self.task_params.max_dist,
-          self.task_params.min_dist, self.task_params.max_dist,
-          self.task.sampling_distribution, self.task.target_distribution,
-          self.task.nodes, self.task_params.n_ori, self.task_params.step_size,
-          self.task.distribution_bins, self.task.rejection_sampling_M)
-    bins = self.task.distribution_bins 
-    n_bins = self.task.n_bins
-    with plt.style.context('ggplot'):
-      fig, axes = utils.subplot(plt, (1,2), (10,10))
-      ax = axes[0]
-      _ = ax.hist(hardnesss, bins=bins, weights=np.ones_like(hardnesss)/len(hardnesss))
-      ax.plot(bins[:-1]+0.5/n_bins, self.task.target_distribution, 'g')
-      ax.plot(bins[:-1]+0.5/n_bins, self.task.sampling_distribution, 'b')
-      ax.grid('on')
-      
-      ax = axes[1]
-      _ = ax.hist(gt_dists, bins=np.arange(self.task_params.max_dist+1))
-      ax.grid('on')
-      ax.set_title('Mean: {:0.2f}, Median: {:0.2f}'.format(np.mean(gt_dists),
-                                                           np.median(gt_dists)))
-      with fu.fopen(out_path, 'w') as f:
-        fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-
-  def _debug_save_map_nodes(self, seed):
-    """Saves traversible space along with nodes generated on the graph. Takes
-    the seed as input."""
-    img_path = os.path.join(self.logdir, '{:s}_{:d}_graph.png'.format(self.building_name, seed))
-    node_xyt = self.to_actual_xyt_vec(self.task.nodes)
-    plt.set_cmap('jet');
-    fig, ax = utils.subplot(plt, (1,1), (12,12))
-    ax.plot(node_xyt[:,0], node_xyt[:,1], 'm.')
-    ax.set_axis_off(); ax.axis('equal');
-    
-    if self.room_dims is not None:
-      for i, r in enumerate(self.room_dims['dims']*1):
-        min_ = r[:3]*1
-        max_ = r[3:]*1
-        xmin, ymin, zmin = min_
-        xmax, ymax, zmax = max_
-
-        ax.plot([xmin, xmax, xmax, xmin, xmin],
-                [ymin, ymin, ymax, ymax, ymin], 'g')
-    ax.imshow(self.traversible, origin='lower');
-    with fu.fopen(img_path, 'w') as f:
-      fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-
-  def _debug_semantic_maps(self, seed):
-    """Saves traversible space along with nodes generated on the graph. Takes
-    the seed as input."""
-    for i, cls in enumerate(self.task_params.semantic_task.class_map_names):
-      img_path = os.path.join(self.logdir, '{:s}_flip{:d}_{:s}_graph.png'.format(self.building_name, seed, cls))
-      maps = self.traversible*1.
-      maps += 0.5*(self.task.class_maps_dilated[:,:,i])
-      write_traversible = (maps*1.+1.)/3.0
-      write_traversible = (write_traversible*255.).astype(np.uint8)[:,:,np.newaxis]
-      write_traversible = write_traversible + np.zeros((1,1,3), dtype=np.uint8)
-      fu.write_image(img_path, write_traversible[::-1,:,:])
-
-  def _preprocess_for_task(self, seed):
-    """Sets up the task field for doing navigation on the grid world."""
-    if self.task is None or self.task.seed != seed:
-      rng = np.random.RandomState(seed)
-      origin_loc = get_graph_origin_loc(rng, self.traversible)
-      self.task = utils.Foo(seed=seed, origin_loc=origin_loc,
-                            n_ori=self.task_params.n_ori)
-      G = generate_graph(self.valid_fn_vec, self.task_params.step_size,
-                         self.task.n_ori, (0, 0, 0))
-      gtG, nodes, nodes_to_id = convert_to_graph_tool(G)
-      self.task.gtG = gtG
-      self.task.nodes = nodes
-      self.task.delta_theta = 2.0*np.pi/(self.task.n_ori*1.)
-      self.task.nodes_to_id = nodes_to_id
-
-      logging.info('Building %s, #V=%d, #E=%d', self.building_name,
-                   self.task.nodes.shape[0], self.task.gtG.num_edges())
-      type = self.task_params.type
-      if type == 'general':
-        # Do nothing
-        _ = None
-
-      elif type == 'room_to_room_many' or type == 'room_to_room_back':
-        if type == 'room_to_room_back':
-          assert(self.task_params.num_goals == 2), 'num_goals must be 2.'
-
-        self.room_dims = _filter_rooms(self.room_dims, self.task_params.room_regex)
-        xyt = self.to_actual_xyt_vec(self.task.nodes)
-        self.task.node_room_ids = _label_nodes_with_room_id(xyt, self.room_dims)
-        self.task.reset_kwargs = {'node_room_ids': self.task.node_room_ids}
-
-      elif type == 'rng_rejection_sampling_many':
-        n_bins = 20
-        rejection_sampling_M = self.task_params.rejection_sampling_M
-        min_dist = self.task_params.min_dist
-        bins = np.arange(n_bins+1)/(n_bins*1.)
-        target_d = np.zeros(n_bins); target_d[...] = 1./n_bins;
-
-        sampling_d = get_hardness_distribution(
-            self.task.gtG, self.task_params.max_dist, self.task_params.min_dist,
-            np.random.RandomState(0), 4000, bins, self.task.nodes,
-            self.task_params.n_ori, self.task_params.step_size)
-
-        self.task.reset_kwargs = {'distribution_bins': bins,
-                                  'target_distribution': target_d,
-                                  'sampling_distribution': sampling_d,
-                                  'rejection_sampling_M': rejection_sampling_M,
-                                  'n_bins': n_bins, 
-                                  'n_ori': self.task_params.n_ori,
-                                  'step_size': self.task_params.step_size,
-                                  'min_dist': self.task_params.min_dist}
-        self.task.n_bins = n_bins
-        self.task.distribution_bins = bins
-        self.task.target_distribution = target_d
-        self.task.sampling_distribution = sampling_d
-        self.task.rejection_sampling_M = rejection_sampling_M
-
-        if self.logdir is not None:
-          self._debug_save_hardness(seed)
-
-      elif type[:14] == 'to_nearest_obj':
-        self.room_dims = _filter_rooms(self.room_dims, self.task_params.room_regex)
-        xyt = self.to_actual_xyt_vec(self.task.nodes)
-
-        self.class_maps = _select_classes(self.class_maps,
-                                          self.class_map_names,
-                                          self.task_params.semantic_task.class_map_names)*1
-        self.class_map_names = self.task_params.semantic_task.class_map_names
-        nodes_xyt = self.to_actual_xyt_vec(np.array(self.task.nodes))
-
-        tt = utils.Timer(); tt.tic();
-        if self.task_params.type == 'to_nearest_obj_acc':
-          self.task.class_maps_dilated, self.task.node_class_label = label_nodes_with_class_geodesic(
-            nodes_xyt, self.class_maps,
-            self.task_params.semantic_task.pix_distance+8, self.map.traversible,
-            ff_cost=1., fo_cost=1., oo_cost=4., connectivity=8.)
-
-        dists = []
-        for i in range(len(self.class_map_names)):
-          class_nodes_ = np.where(self.task.node_class_label[:,i])[0]
-          dists.append(get_distance_node_list(gtG, source_nodes=class_nodes_, direction='to'))
-        self.task.dist_to_class = dists
-        a_, b_ = np.where(self.task.node_class_label)
-        self.task.class_nodes = np.concatenate((a_[:,np.newaxis], b_[:,np.newaxis]), axis=1)
-        
-        if self.logdir is not None:
-          self._debug_semantic_maps(seed)
-        
-        self.task.reset_kwargs = {'sampling': self.task_params.semantic_task.sampling,
-                                  'class_nodes': self.task.class_nodes,
-                                  'dist_to_class': self.task.dist_to_class}
-
-      if self.logdir is not None:
-        self._debug_save_map_nodes(seed)
-
-  def reset(self, rngs):
-    rng = rngs[0]; rng_perturb = rngs[1];
-    nodes = self.task.nodes
-    tp = self.task_params
-
-    start_node_ids, goal_node_ids, dists, target_class = \
-        _nav_env_reset_helper(tp.type, rng, self.task.nodes, tp.batch_size,
-                              self.task.gtG, tp.max_dist, tp.num_steps,
-                              tp.num_goals, tp.data_augment,
-                              **(self.task.reset_kwargs))
-
-    start_nodes = [tuple(nodes[_,:]) for _ in start_node_ids]
-    goal_nodes = [[tuple(nodes[_,:]) for _ in __] for __ in goal_node_ids]
-    data_augment = tp.data_augment
-    perturbs = _gen_perturbs(rng_perturb, tp.batch_size,
-                             (tp.num_steps+1)*tp.num_goals,
-                             data_augment.lr_flip, data_augment.delta_angle,
-                             data_augment.delta_xy, data_augment.structured)
-    perturbs = np.array(perturbs) # batch x steps x 4
-    end_perturbs = perturbs[:,-(tp.num_goals):,:]*1 # fixed perturb for the goal.
-    perturbs = perturbs[:,:-(tp.num_goals),:]*1
-
-    history = -np.ones((tp.batch_size, tp.num_steps*tp.num_goals), dtype=np.int32)
-    self.episode = utils.Foo(
-        start_nodes=start_nodes, start_node_ids=start_node_ids,
-        goal_nodes=goal_nodes, goal_node_ids=goal_node_ids, dist_to_goal=dists,
-        perturbs=perturbs, goal_perturbs=end_perturbs, history=history,
-        target_class=target_class, history_frames=[])
-    return start_node_ids
-
-  def take_action(self, current_node_ids, action, step_number):
-    """In addition to returning the action, also returns the reward that the
-    agent receives."""
-    goal_number = step_number / self.task_params.num_steps
-    new_node_ids = GridWorld.take_action(self, current_node_ids, action)
-    rewards = []
-    for i, n in enumerate(new_node_ids):
-      reward = 0
-      if n == self.episode.goal_node_ids[goal_number][i]:
-        reward = self.task_params.reward_at_goal
-      reward = reward - self.task_params.reward_time_penalty
-      rewards.append(reward)
-    return new_node_ids, rewards
-
-
-  def get_optimal_action(self, current_node_ids, step_number):
-    """Returns the optimal action from the current node."""
-    goal_number = step_number / self.task_params.num_steps
-    gtG = self.task.gtG
-    a = np.zeros((len(current_node_ids), self.task_params.num_actions), dtype=np.int32)
-    d_dict = self.episode.dist_to_goal[goal_number]
-    for i, c in enumerate(current_node_ids):
-      neigh = gtG.vertex(c).out_neighbours()
-      neigh_edge = gtG.vertex(c).out_edges()
-      ds = np.array([d_dict[i][int(x)] for x in neigh])
-      ds_min = np.min(ds)
-      for i_, e in enumerate(neigh_edge):
-        if ds[i_] == ds_min:
-          _ = gtG.ep['action'][e]
-          a[i, _] = 1
-    return a
-
-  def get_targets(self, current_node_ids, step_number):
-    """Returns the target actions from the current node."""
-    action = self.get_optimal_action(current_node_ids, step_number)
-    action = np.expand_dims(action, axis=1)
-    return vars(utils.Foo(action=action))
-
-  def get_targets_name(self):
-    """Returns the list of names of the targets."""
-    return ['action']
-
-  def cleanup(self):
-    self.episode = None
-
-class VisualNavigationEnv(NavigationEnv):
-  """Class for doing visual navigation in environments. Functions for computing
-  features on states, etc.
-  """
-  def __init__(self, robot, env, task_params, category_list=None,
-               building_name=None, flip=False, logdir=None,
-               building_loader=None, r_obj=None):
-    tt = utils.Timer()
-    tt.tic()
-    Building.__init__(self, building_name, robot, env, category_list,
-                      small=task_params.toy_problem, flip=flip, logdir=logdir,
-                      building_loader=building_loader)
-
-    self.set_r_obj(r_obj)
-    self.task_params = task_params
-    self.task = None
-    self.episode = None
-    self._preprocess_for_task(self.task_params.building_seed)
-    if hasattr(self.task_params, 'map_scales'):
-      self.task.scaled_maps = resize_maps(
-          self.traversible.astype(np.float32)*1, self.task_params.map_scales,
-          self.task_params.map_resize_method)
-    else:
-      logging.fatal('VisualNavigationEnv does not support scale_f anymore.')
-    self.task.readout_maps_scaled = resize_maps(
-      self.traversible.astype(np.float32)*1,
-      self.task_params.readout_maps_scales,
-      self.task_params.map_resize_method)
-    tt.toc(log_at=1, log_str='VisualNavigationEnv __init__: ')
-
-  def get_weight(self):
-    return self.task.nodes.shape[0]
-
-  def get_common_data(self):
-    goal_nodes = self.episode.goal_nodes
-    start_nodes = self.episode.start_nodes
-    perturbs = self.episode.perturbs
-    goal_perturbs = self.episode.goal_perturbs
-    target_class = self.episode.target_class
-
-    goal_locs = []; rel_goal_locs = [];
-    for i in range(len(goal_nodes)):
-      end_nodes = goal_nodes[i]
-      goal_loc, _, _, goal_theta = self.get_loc_axis(
-          np.array(end_nodes), delta_theta=self.task.delta_theta,
-          perturb=goal_perturbs[:,i,:])
-
-      # Compute the relative location to all goals from the starting location.
-      loc, _, _, theta = self.get_loc_axis(np.array(start_nodes),
-                                           delta_theta=self.task.delta_theta,
-                                           perturb=perturbs[:,0,:])
-      r_goal, t_goal = _get_relative_goal_loc(goal_loc*1., loc, theta)
-      rel_goal_loc = np.concatenate((r_goal*np.cos(t_goal), r_goal*np.sin(t_goal),
-                                     np.cos(goal_theta-theta),
-                                     np.sin(goal_theta-theta)), axis=1)
-      rel_goal_locs.append(np.expand_dims(rel_goal_loc, axis=1))
-      goal_locs.append(np.expand_dims(goal_loc, axis=1))
-
-    map = self.traversible*1.
-    maps = np.repeat(np.expand_dims(np.expand_dims(map, axis=0), axis=0),
-                     self.task_params.batch_size, axis=0)*1
-    if self.task_params.type[:14] == 'to_nearest_obj':
-      for i in range(self.task_params.batch_size):
-        maps[i,0,:,:] += 0.5*(self.task.class_maps_dilated[:,:,target_class[i]])
-
-    rel_goal_locs = np.concatenate(rel_goal_locs, axis=1)
-    goal_locs = np.concatenate(goal_locs, axis=1)
-    maps = np.expand_dims(maps, axis=-1)
-
-    if self.task_params.type[:14] == 'to_nearest_obj':
-      rel_goal_locs = np.zeros((self.task_params.batch_size, 1,
-                                len(self.task_params.semantic_task.class_map_names)),
-                               dtype=np.float32)
-      goal_locs = np.zeros((self.task_params.batch_size, 1, 2),
-                           dtype=np.float32)
-      for i in range(self.task_params.batch_size):
-          t = target_class[i]
-          rel_goal_locs[i,0,t] = 1.
-          goal_locs[i,0,0] = t
-          goal_locs[i,0,1] = np.NaN
-
-    return vars(utils.Foo(orig_maps=maps, goal_loc=goal_locs,
-                          rel_goal_loc_at_start=rel_goal_locs))
-
-  def pre_common_data(self, inputs):
-    return inputs
-
-
-  def get_features(self, current_node_ids, step_number):
-    task_params = self.task_params
-    goal_number = step_number / self.task_params.num_steps
-    end_nodes = self.task.nodes[self.episode.goal_node_ids[goal_number],:]*1
-    current_nodes = self.task.nodes[current_node_ids,:]*1
-    end_perturbs = self.episode.goal_perturbs[:,goal_number,:][:,np.newaxis,:]
-    perturbs = self.episode.perturbs
-    target_class = self.episode.target_class
-
-    # Append to history.
-    self.episode.history[:,step_number] = np.array(current_node_ids)
-
-    # Render out the images from current node.
-    outs = {}
-
-    if self.task_params.outputs.images:
-      imgs_all = []
-      imgs = self.render_nodes([tuple(x) for x in current_nodes],
-                               perturb=perturbs[:,step_number,:])
-      imgs_all.append(imgs)
-      aux_delta_thetas = self.task_params.aux_delta_thetas
-      for i in range(len(aux_delta_thetas)):
-        imgs = self.render_nodes([tuple(x) for x in current_nodes],
-                                 perturb=perturbs[:,step_number,:],
-                                 aux_delta_theta=aux_delta_thetas[i])
-        imgs_all.append(imgs)
-      imgs_all = np.array(imgs_all) # A x B x H x W x C
-      imgs_all = np.transpose(imgs_all, axes=[1,0,2,3,4])
-      imgs_all = np.expand_dims(imgs_all, axis=1) # B x N x A x H x W x C
-      if task_params.num_history_frames > 0:
-        if step_number == 0:
-          # Append the same frame 4 times
-          for i in range(task_params.num_history_frames+1):
-            self.episode.history_frames.insert(0, imgs_all*1.)
-        self.episode.history_frames.insert(0, imgs_all)
-        self.episode.history_frames.pop()
-        imgs_all_with_history = np.concatenate(self.episode.history_frames, axis=2)
-      else:
-        imgs_all_with_history = imgs_all
-      outs['imgs'] = imgs_all_with_history # B x N x A x H x W x C
-
-    if self.task_params.outputs.node_ids:
-      outs['node_ids'] = np.array(current_node_ids).reshape((-1,1,1))
-      outs['perturbs'] = np.expand_dims(perturbs[:,step_number, :]*1., axis=1)
-
-    if self.task_params.outputs.analytical_counts:
-      assert(self.task_params.modalities == ['depth'])
-      d = image_pre(outs['imgs']*1., self.task_params.modalities)
-      cm = get_camera_matrix(self.task_params.img_width,
-                             self.task_params.img_height,
-                             self.task_params.img_fov)
-      XYZ = get_point_cloud_from_z(100./d[...,0], cm)
-      XYZ = make_geocentric(XYZ*100., self.robot.sensor_height,
-                                      self.robot.camera_elevation_degree)
-      for i in range(len(self.task_params.analytical_counts.map_sizes)):
-        non_linearity = self.task_params.analytical_counts.non_linearity[i]
-        count, isvalid = bin_points(XYZ*1.,
-                                    map_size=self.task_params.analytical_counts.map_sizes[i],
-                                    xy_resolution=self.task_params.analytical_counts.xy_resolution[i],
-                                    z_bins=self.task_params.analytical_counts.z_bins[i])
-        assert(count.shape[2] == 1), 'only works for n_views equal to 1.'
-        count = count[:,:,0,:,:,:]
-        isvalid = isvalid[:,:,0,:,:,:]
-        if non_linearity == 'none':
-          None
-        elif non_linearity == 'min10':
-          count = np.minimum(count, 10.)
-        elif non_linearity == 'sqrt':
-          count = np.sqrt(count)
-        else:
-          logging.fatal('Undefined non_linearity.')
-        outs['analytical_counts_{:d}'.format(i)] = count
-
-    # Compute the goal location in the cordinate frame of the robot.
-    if self.task_params.outputs.rel_goal_loc:
-      if self.task_params.type[:14] != 'to_nearest_obj':
-        loc, _, _, theta = self.get_loc_axis(current_nodes,
-                                             delta_theta=self.task.delta_theta,
-                                             perturb=perturbs[:,step_number,:])
-        goal_loc, _, _, goal_theta = self.get_loc_axis(end_nodes,
-                                                       delta_theta=self.task.delta_theta,
-                                                       perturb=end_perturbs[:,0,:])
-        r_goal, t_goal = _get_relative_goal_loc(goal_loc, loc, theta)
-
-        rel_goal_loc = np.concatenate((r_goal*np.cos(t_goal), r_goal*np.sin(t_goal),
-                                       np.cos(goal_theta-theta),
-                                       np.sin(goal_theta-theta)), axis=1)
-        outs['rel_goal_loc'] = np.expand_dims(rel_goal_loc, axis=1)
-      elif self.task_params.type[:14] == 'to_nearest_obj':
-        rel_goal_loc = np.zeros((self.task_params.batch_size, 1,
-                                 len(self.task_params.semantic_task.class_map_names)),
-                                dtype=np.float32)
-        for i in range(self.task_params.batch_size):
-          t = target_class[i]
-          rel_goal_loc[i,0,t] = 1.
-        outs['rel_goal_loc'] = rel_goal_loc
-
-    # Location on map to plot the trajectory during validation.
-    if self.task_params.outputs.loc_on_map:
-      loc, x_axis, y_axis, theta = self.get_loc_axis(current_nodes,
-                                                     delta_theta=self.task.delta_theta,
-                                                     perturb=perturbs[:,step_number,:])
-      outs['loc_on_map'] = np.expand_dims(loc, axis=1)
-
-    # Compute gt_dist to goal
-    if self.task_params.outputs.gt_dist_to_goal:
-      gt_dist_to_goal = np.zeros((len(current_node_ids), 1), dtype=np.float32)
-      for i, n in enumerate(current_node_ids):
-        gt_dist_to_goal[i,0] = self.episode.dist_to_goal[goal_number][i][n]
-      outs['gt_dist_to_goal'] = np.expand_dims(gt_dist_to_goal, axis=1)
-
-    # Free space in front of you, map and goal as images.
-    if self.task_params.outputs.ego_maps:
-      loc, x_axis, y_axis, theta = self.get_loc_axis(current_nodes,
-                                                     delta_theta=self.task.delta_theta,
-                                                     perturb=perturbs[:,step_number,:])
-      maps = generate_egocentric_maps(self.task.scaled_maps,
-                                      self.task_params.map_scales,
-                                      self.task_params.map_crop_sizes, loc,
-                                      x_axis, y_axis, theta)
-
-      for i in range(len(self.task_params.map_scales)):
-        outs['ego_maps_{:d}'.format(i)] = \
-            np.expand_dims(np.expand_dims(maps[i], axis=1), axis=-1)
-
-    if self.task_params.outputs.readout_maps:
-      loc, x_axis, y_axis, theta = self.get_loc_axis(current_nodes,
-                                                     delta_theta=self.task.delta_theta,
-                                                     perturb=perturbs[:,step_number,:])
-      maps = generate_egocentric_maps(self.task.readout_maps_scaled,
-                                      self.task_params.readout_maps_scales,
-                                      self.task_params.readout_maps_crop_sizes,
-                                      loc, x_axis, y_axis, theta)
-      for i in range(len(self.task_params.readout_maps_scales)):
-        outs['readout_maps_{:d}'.format(i)] = \
-            np.expand_dims(np.expand_dims(maps[i], axis=1), axis=-1)
-
-    # Images for the goal.
-    if self.task_params.outputs.ego_goal_imgs:
-      if self.task_params.type[:14] != 'to_nearest_obj': 
-        loc, x_axis, y_axis, theta = self.get_loc_axis(current_nodes,
-                                                       delta_theta=self.task.delta_theta,
-                                                       perturb=perturbs[:,step_number,:])
-        goal_loc, _, _, _ = self.get_loc_axis(end_nodes,
-                                              delta_theta=self.task.delta_theta,
-                                              perturb=end_perturbs[:,0,:])
-        rel_goal_orientation = np.mod(
-            np.int32(current_nodes[:,2:] - end_nodes[:,2:]), self.task_params.n_ori)
-        goal_dist, goal_theta = _get_relative_goal_loc(goal_loc, loc, theta)
-        goals = generate_goal_images(self.task_params.map_scales,
-                                     self.task_params.map_crop_sizes,
-                                     self.task_params.n_ori, goal_dist,
-                                     goal_theta, rel_goal_orientation)
-        for i in range(len(self.task_params.map_scales)):
-          outs['ego_goal_imgs_{:d}'.format(i)] = np.expand_dims(goals[i], axis=1)
-
-      elif self.task_params.type[:14] == 'to_nearest_obj':
-        for i in range(len(self.task_params.map_scales)):
-          num_classes = len(self.task_params.semantic_task.class_map_names)
-          outs['ego_goal_imgs_{:d}'.format(i)] = np.zeros((self.task_params.batch_size, 1,
-                                                           self.task_params.map_crop_sizes[i],
-                                                           self.task_params.map_crop_sizes[i],
-                                                           self.task_params.goal_channels))
-        for i in range(self.task_params.batch_size):
-          t = target_class[i]
-          for j in range(len(self.task_params.map_scales)):
-            outs['ego_goal_imgs_{:d}'.format(j)][i,:,:,:,t] = 1.
-
-    # Incremental locs and theta (for map warping), always in the original scale
-    # of the map, the subequent steps in the tf code scale appropriately.
-    # Scaling is done by just multiplying incremental_locs appropriately.
-    if self.task_params.outputs.egomotion:
-      if step_number == 0:
-        # Zero Ego Motion
-        incremental_locs = np.zeros((self.task_params.batch_size, 1, 2), dtype=np.float32)
-        incremental_thetas = np.zeros((self.task_params.batch_size, 1, 1), dtype=np.float32)
-      else:
-        previous_nodes = self.task.nodes[self.episode.history[:,step_number-1], :]*1
-        loc, _, _, theta = self.get_loc_axis(current_nodes,
-                                             delta_theta=self.task.delta_theta,
-                                             perturb=perturbs[:,step_number,:])
-        previous_loc, _, _, previous_theta = self.get_loc_axis(
-            previous_nodes, delta_theta=self.task.delta_theta,
-            perturb=perturbs[:,step_number-1,:])
-
-        incremental_locs_ = np.reshape(loc-previous_loc, [self.task_params.batch_size, 1, -1])
-
-        t = -np.pi/2+np.reshape(theta*1, [self.task_params.batch_size, 1, -1])
-        incremental_locs = incremental_locs_*1
-        incremental_locs[:,:,0] = np.sum(incremental_locs_ *
-                                         np.concatenate((np.cos(t), np.sin(t)),
-                                                        axis=-1), axis=-1)
-        incremental_locs[:,:,1] = np.sum(incremental_locs_ *
-                                         np.concatenate((np.cos(t+np.pi/2),
-                                                         np.sin(t+np.pi/2)),
-                                                        axis=-1), axis=-1)
-        incremental_thetas = np.reshape(theta-previous_theta,
-                                        [self.task_params.batch_size, 1, -1])
-      outs['incremental_locs'] = incremental_locs
-      outs['incremental_thetas'] = incremental_thetas
-
-    if self.task_params.outputs.visit_count:
-      # Output the visit count for this state, how many times has the current
-      # state been visited, and how far in the history was the last visit
-      # (except this one)
-      visit_count = np.zeros((self.task_params.batch_size, 1), dtype=np.int32)
-      last_visit = -np.ones((self.task_params.batch_size, 1), dtype=np.int32)
-      if step_number >= 1:
-        h = self.episode.history[:,:(step_number)]
-        visit_count[:,0] = np.sum(h == np.array(current_node_ids).reshape([-1,1]),
-                                  axis=1)
-        last_visit[:,0] = np.argmax(h[:,::-1] == np.array(current_node_ids).reshape([-1,1]),
-                                    axis=1) + 1
-        last_visit[visit_count == 0] = -1 # -1 if not visited.
-      outs['visit_count'] = np.expand_dims(visit_count, axis=1)
-      outs['last_visit'] = np.expand_dims(last_visit, axis=1)
-    return outs
-
-  def get_features_name(self):
-    f = []
-    if self.task_params.outputs.images:
-      f.append('imgs')
-    if self.task_params.outputs.rel_goal_loc:
-      f.append('rel_goal_loc')
-    if self.task_params.outputs.loc_on_map:
-      f.append('loc_on_map')
-    if self.task_params.outputs.gt_dist_to_goal:
-      f.append('gt_dist_to_goal')
-    if self.task_params.outputs.ego_maps:
-      for i in range(len(self.task_params.map_scales)):
-        f.append('ego_maps_{:d}'.format(i))
-    if self.task_params.outputs.readout_maps:
-      for i in range(len(self.task_params.readout_maps_scales)):
-        f.append('readout_maps_{:d}'.format(i))
-    if self.task_params.outputs.ego_goal_imgs:
-      for i in range(len(self.task_params.map_scales)):
-        f.append('ego_goal_imgs_{:d}'.format(i))
-    if self.task_params.outputs.egomotion:
-      f.append('incremental_locs')
-      f.append('incremental_thetas')
-    if self.task_params.outputs.visit_count:
-      f.append('visit_count')
-      f.append('last_visit')
-    if self.task_params.outputs.analytical_counts:
-      for i in range(len(self.task_params.analytical_counts.map_sizes)):
-        f.append('analytical_counts_{:d}'.format(i))
-    if self.task_params.outputs.node_ids:
-      f.append('node_ids')
-      f.append('perturbs')
-    return f
-
-  def pre_features(self, inputs):
-    if self.task_params.outputs.images:
-      inputs['imgs'] = image_pre(inputs['imgs'], self.task_params.modalities)
-    return inputs
-
-class BuildingMultiplexer():
-  def __init__(self, args, task_number):
-    params = vars(args)
-    for k in params.keys():
-      setattr(self, k, params[k])
-    self.task_number = task_number
-    self._pick_data(task_number)
-    logging.info('Env Class: %s.', self.env_class)
-    if self.task_params.task == 'planning':
-      self._setup_planner()
-    elif self.task_params.task == 'mapping':
-      self._setup_mapper()
-    elif self.task_params.task == 'map+plan':
-      self._setup_mapper()
-    else:
-      logging.error('Undefined task: %s'.format(self.task_params.task))
-
-  def _pick_data(self, task_number):
-    logging.error('Input Building Names: %s', self.building_names)
-    self.flip = [np.mod(task_number / len(self.building_names), 2) == 1]
-    id = np.mod(task_number, len(self.building_names))
-    self.building_names = [self.building_names[id]]
-    self.task_params.building_seed = task_number
-    logging.error('BuildingMultiplexer: Picked Building Name: %s', self.building_names)
-    self.building_names = self.building_names[0].split('+')
-    self.flip = [self.flip[0] for _ in self.building_names]
-    logging.error('BuildingMultiplexer: Picked Building Name: %s', self.building_names)
-    logging.error('BuildingMultiplexer: Flipping Buildings: %s', self.flip)
-    logging.error('BuildingMultiplexer: Set building_seed: %d', self.task_params.building_seed)
-    self.num_buildings = len(self.building_names)
-    logging.error('BuildingMultiplexer: Num buildings: %d', self.num_buildings)
-
-  def _setup_planner(self):
-    # Load building env class.
-    self.buildings = []
-    for i, building_name in enumerate(self.building_names):
-      b = self.env_class(robot=self.robot, env=self.env,
-                         task_params=self.task_params,
-                         building_name=building_name, flip=self.flip[i],
-                         logdir=self.logdir, building_loader=self.dataset)
-      self.buildings.append(b)
-
-  def _setup_mapper(self):
-    # Set up the renderer.
-    cp = self.camera_param
-    rgb_shader, d_shader = sru.get_shaders(cp.modalities)
-    r_obj = SwiftshaderRenderer()
-    r_obj.init_display(width=cp.width, height=cp.height, fov=cp.fov,
-                       z_near=cp.z_near, z_far=cp.z_far, rgb_shader=rgb_shader,
-                       d_shader=d_shader)
-    self.r_obj = r_obj
-    r_obj.clear_scene()
-
-    # Load building env class.
-    self.buildings = []
-    wt = []
-    for i, building_name in enumerate(self.building_names):
-      b = self.env_class(robot=self.robot, env=self.env,
-                         task_params=self.task_params,
-                         building_name=building_name, flip=self.flip[i],
-                         logdir=self.logdir, building_loader=self.dataset,
-                         r_obj=r_obj)
-      wt.append(b.get_weight())
-      b.load_building_into_scene()
-      b.set_building_visibility(False)
-      self.buildings.append(b)
-    wt = np.array(wt).astype(np.float32)
-    wt = wt / np.sum(wt+0.0001)
-    self.building_sampling_weights = wt
-
-  def sample_building(self, rng):
-    if self.num_buildings == 1:
-      building_id = rng.choice(range(len(self.building_names)))
-    else:
-      building_id = rng.choice(self.num_buildings,
-                               p=self.building_sampling_weights)
-    b = self.buildings[building_id]
-    instances = b._gen_rng(rng)
-    self._building_id = building_id
-    return self.buildings[building_id], instances
-
-  def sample_env(self, rngs):
-    rng = rngs[0];
-    if self.num_buildings == 1:
-      building_id = rng.choice(range(len(self.building_names)))
-    else:
-      building_id = rng.choice(self.num_buildings,
-                               p=self.building_sampling_weights)
-    return self.buildings[building_id]
-
-  def pre(self, inputs):
-    return self.buildings[self._building_id].pre(inputs)
-  
-  def __del__(self):
-    self.r_obj.clear_scene()
-    logging.error('Clearing scene.')
diff --git a/research/cognitive_mapping_and_planning/datasets/nav_env_config.py b/research/cognitive_mapping_and_planning/datasets/nav_env_config.py
deleted file mode 100644
index 3d71c5767c4dc0ed9f05cce5c1790f11ede3778a..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/datasets/nav_env_config.py
+++ /dev/null
@@ -1,127 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Configs for stanford navigation environment.
-
-Base config for stanford navigation enviornment.
-"""
-import numpy as np
-import src.utils as utils
-import datasets.nav_env as nav_env
-
-def nav_env_base_config():
-  """Returns the base config for stanford navigation environment.
-
-  Returns:
-    Base config for stanford navigation environment.
-  """
-  robot = utils.Foo(radius=15,
-                    base=10,
-                    height=140,
-                    sensor_height=120,
-                    camera_elevation_degree=-15)
-
-  env = utils.Foo(padding=10,
-                  resolution=5,
-                  num_point_threshold=2,
-                  valid_min=-10,
-                  valid_max=200,
-                  n_samples_per_face=200)
-
-  camera_param = utils.Foo(width=225,
-                           height=225,
-                           z_near=0.05,
-                           z_far=20.0,
-                           fov=60.,
-                           modalities=['rgb'],
-                           img_channels=3)
-
-  data_augment = utils.Foo(lr_flip=0,
-                           delta_angle=0.5,
-                           delta_xy=4,
-                           relight=True,
-                           relight_fast=False, 
-                           structured=False) # if True, uses the same perturb for the whole episode.
-
-  outputs = utils.Foo(images=True,
-                      rel_goal_loc=False,
-                      loc_on_map=True,
-                      gt_dist_to_goal=True,
-                      ego_maps=False,
-                      ego_goal_imgs=False,
-                      egomotion=False,
-                      visit_count=False,
-                      analytical_counts=False,
-                      node_ids=True,
-                      readout_maps=False)
-
-  # class_map_names=['board', 'chair', 'door', 'sofa', 'table']
-  class_map_names = ['chair', 'door', 'table']
-  semantic_task = utils.Foo(class_map_names=class_map_names, pix_distance=16,
-                            sampling='uniform')
-  
-  # time per iteration for cmp is 0.82 seconds per episode with 3.4s overhead per batch.
-  task_params = utils.Foo(max_dist=32,
-                          step_size=8,
-                          num_steps=40,
-                          num_actions=4,
-                          batch_size=4, 
-                          building_seed=0,
-                          num_goals=1,
-                          img_height=None,
-                          img_width=None,
-                          img_channels=None,
-                          modalities=None,
-                          outputs=outputs,
-                          map_scales=[1.],
-                          map_crop_sizes=[64],
-                          rel_goal_loc_dim=4,
-                          base_class='Building',
-                          task='map+plan',
-                          n_ori=4,
-                          type='room_to_room_many',
-                          data_augment=data_augment,
-                          room_regex='^((?!hallway).)*$',
-                          toy_problem=False,
-                          map_channels=1,
-                          gt_coverage=False,
-                          input_type='maps',
-                          full_information=False,
-                          aux_delta_thetas=[],
-                          semantic_task=semantic_task,
-                          num_history_frames=0,
-                          node_ids_dim=1,
-                          perturbs_dim=4,
-                          map_resize_method='linear_noantialiasing',
-                          readout_maps_channels=1,
-                          readout_maps_scales=[],
-                          readout_maps_crop_sizes=[],
-                          n_views=1,
-                          reward_time_penalty=0.1,
-                          reward_at_goal=1.,
-                          discount_factor=0.99,
-                          rejection_sampling_M=100,
-                          min_dist=None)
-
-  navtask_args = utils.Foo(
-      building_names=['area1_gates_wingA_floor1_westpart'],
-      env_class=nav_env.VisualNavigationEnv,
-      robot=robot,
-      task_params=task_params,
-      env=env,
-      camera_param=camera_param,
-      cache_rooms=True)
-  return navtask_args
-
diff --git a/research/cognitive_mapping_and_planning/matplotlibrc b/research/cognitive_mapping_and_planning/matplotlibrc
deleted file mode 100644
index ed5097572ae68680d0c9afdf510968e1c3d175d4..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/matplotlibrc
+++ /dev/null
@@ -1 +0,0 @@
-backend      : agg
diff --git a/research/cognitive_mapping_and_planning/output/.gitignore b/research/cognitive_mapping_and_planning/output/.gitignore
deleted file mode 100644
index a767cafbbd864d0baf76530294598e4c2be60a24..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/output/.gitignore
+++ /dev/null
@@ -1 +0,0 @@
-* 
diff --git a/research/cognitive_mapping_and_planning/output/README.md b/research/cognitive_mapping_and_planning/output/README.md
deleted file mode 100644
index 7518c3874390da7e2aa65a89ccdec035ca7610e8..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/output/README.md
+++ /dev/null
@@ -1,16 +0,0 @@
-### Pre-Trained Models
-
-We provide the following pre-trained models:
-
-Config Name                            | Checkpoint                                                                                                                            | Mean Dist. | 50%ile Dist. | 75%ile Dist. | Success %age |
-:-:                                    | :-:                                                                                                                                   | :-:        | :-:          | :-:          | :-:          |
-cmp.lmap_Msc.clip5.sbpd_d_r2r          | [ckpt](http://download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/cmp.lmap_Msc.clip5.sbpd_d_r2r.tar)             | 4.79       | 0            | 1            | 78.9         |
-cmp.lmap_Msc.clip5.sbpd_rgb_r2r        | [ckpt](http://download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/cmp.lmap_Msc.clip5.sbpd_rgb_r2r.tar)           | 7.74       | 0            | 14           | 62.4         |
-cmp.lmap_Msc.clip5.sbpd_d_ST           | [ckpt](http://download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/cmp.lmap_Msc.clip5.sbpd_d_ST.tar)              | 10.67      | 9            | 19           | 39.7         |
-cmp.lmap_Msc.clip5.sbpd_rgb_ST         | [ckpt](http://download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/cmp.lmap_Msc.clip5.sbpd_rgb_ST.tar)            | 11.27      | 10           | 19           | 35.6         |
-cmp.lmap_Msc.clip5.sbpd_d_r2r_h0_64_80 | [ckpt](http:////download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/cmp.lmap_Msc.clip5.sbpd_d_r2r_h0_64_80.tar) | 11.6       | 0            | 19           | 66.9         |
-bl.v2.noclip.sbpd_d_r2r                | [ckpt](http://download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/bl.v2.noclip.sbpd_d_r2r.tar)                   | 5.90       | 0            | 6            | 71.2         |
-bl.v2.noclip.sbpd_rgb_r2r              | [ckpt](http://download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/bl.v2.noclip.sbpd_rgb_r2r.tar)                 | 10.21      | 1            | 21           | 53.4         |
-bl.v2.noclip.sbpd_d_ST                 | [ckpt](http://download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/bl.v2.noclip.sbpd_d_ST.tar)                    | 13.29      | 14           | 23           | 28.0         |
-bl.v2.noclip.sbpd_rgb_ST               | [ckpt](http://download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/bl.v2.noclip.sbpd_rgb_ST.tar)                  | 13.37      | 13           | 20           | 24.2         |
-bl.v2.noclip.sbpd_d_r2r_h0_64_80       | [ckpt](http:////download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/bl.v2.noclip.sbpd_d_r2r_h0_64_80.tar)       | 15.30      | 0            | 29           | 57.9         |
diff --git a/research/cognitive_mapping_and_planning/patches/GLES2_2_0.py.patch b/research/cognitive_mapping_and_planning/patches/GLES2_2_0.py.patch
deleted file mode 100644
index de1be442d5b9fff44862d37b9329e32face2b663..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/patches/GLES2_2_0.py.patch
+++ /dev/null
@@ -1,14 +0,0 @@
-10c10
-< from OpenGL import platform, constant, arrays
----
-> from OpenGL import platform, constant, arrays, contextdata
-249a250
-> from OpenGL._bytes import _NULL_8_BYTE
-399c400
-<     array = ArrayDatatype.asArray( pointer, type )
----
->     array = arrays.ArrayDatatype.asArray( pointer, type )
-405c406
-<         ArrayDatatype.voidDataPointer( array )
----
->         arrays.ArrayDatatype.voidDataPointer( array )
diff --git a/research/cognitive_mapping_and_planning/patches/apply_patches.sh b/research/cognitive_mapping_and_planning/patches/apply_patches.sh
deleted file mode 100644
index 4a786058258decdfb381eff25684183d92788ebe..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/patches/apply_patches.sh
+++ /dev/null
@@ -1,18 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-echo $VIRTUAL_ENV
-patch $VIRTUAL_ENV/local/lib/python2.7/site-packages/OpenGL/GLES2/VERSION/GLES2_2_0.py patches/GLES2_2_0.py.patch
-patch $VIRTUAL_ENV/local/lib/python2.7/site-packages/OpenGL/platform/ctypesloader.py patches/ctypesloader.py.patch
diff --git a/research/cognitive_mapping_and_planning/patches/ctypesloader.py.patch b/research/cognitive_mapping_and_planning/patches/ctypesloader.py.patch
deleted file mode 100644
index 27dd43b18010dc5fdcd605b9a5d470abaa19151f..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/patches/ctypesloader.py.patch
+++ /dev/null
@@ -1,15 +0,0 @@
-45c45,46
-<         return dllType( name, mode )
----
->         print './' + name
->         return dllType( './' + name, mode )
-47,48c48,53
-<         err.args += (name,fullName)
-<         raise
----
->         try:
->             print name
->             return dllType( name, mode )
->         except:
->             err.args += (name,fullName)
->             raise
diff --git a/research/cognitive_mapping_and_planning/render/__init__.py b/research/cognitive_mapping_and_planning/render/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/cognitive_mapping_and_planning/render/depth_rgb_encoded.fp b/research/cognitive_mapping_and_planning/render/depth_rgb_encoded.fp
deleted file mode 100644
index 23e93d27f585e93896799f177888e9c50fa03eed..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/render/depth_rgb_encoded.fp
+++ /dev/null
@@ -1,30 +0,0 @@
-// This shader computes per-pixel depth (-z coordinate in the camera space, or
-// orthogonal distance to the camera plane). The result is multiplied by the
-// `kFixedPointFraction` constant and is encoded to RGB channels as an integer
-// (R being the least significant byte).
-
-#ifdef GL_ES
-#ifdef GL_FRAGMENT_PRECISION_HIGH
-precision highp float;
-#else
-precision mediump float;
-#endif
-#endif
-
-const float kFixedPointFraction = 1000.0;
-
-varying float vDepth;
-
-void main(void) {
-  float d = vDepth;
-
-  // Encode the depth to RGB.
-  d *= (kFixedPointFraction / 255.0);
-  gl_FragColor.r = mod(d, 1.0);
-  d = (d - gl_FragColor.r) / 255.0;
-  gl_FragColor.g = mod(d, 1.0);
-  d = (d - gl_FragColor.g) / 255.0;
-  gl_FragColor.b = mod(d, 1.0);
-
-  gl_FragColor.a = 1.0;
-}
diff --git a/research/cognitive_mapping_and_planning/render/depth_rgb_encoded.vp b/research/cognitive_mapping_and_planning/render/depth_rgb_encoded.vp
deleted file mode 100644
index 2db74f14aa7f253b8f544ec1ab519129f13426a0..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/render/depth_rgb_encoded.vp
+++ /dev/null
@@ -1,15 +0,0 @@
-uniform mat4 uViewMatrix;
-uniform mat4 uProjectionMatrix;
-
-attribute vec3 aPosition;
-
-varying float vDepth;
-
-void main(void) {
-  vec4 worldPosition = vec4(aPosition, 1.0);
-  vec4 viewPosition = uViewMatrix * worldPosition;
-  gl_Position = uProjectionMatrix * viewPosition;
-
-  // Orthogonal depth is simply -z in the camera space.
-  vDepth = -viewPosition.z;
-}
diff --git a/research/cognitive_mapping_and_planning/render/rgb_flat_color.fp b/research/cognitive_mapping_and_planning/render/rgb_flat_color.fp
deleted file mode 100644
index c8c24d76103793d9cfa9166517177cb332d1a92c..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/render/rgb_flat_color.fp
+++ /dev/null
@@ -1,11 +0,0 @@
-precision highp float;
-varying vec4 vColor;
-varying vec2 vTextureCoord;
-
-uniform sampler2D uTexture;
-
-void main(void) {
-  vec4 color = vColor;
-  color = texture2D(uTexture, vTextureCoord);
-  gl_FragColor = color;
-}
diff --git a/research/cognitive_mapping_and_planning/render/rgb_flat_color.vp b/research/cognitive_mapping_and_planning/render/rgb_flat_color.vp
deleted file mode 100644
index ebc79173405f7449921fd40f778fe3695aab5ea8..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/render/rgb_flat_color.vp
+++ /dev/null
@@ -1,18 +0,0 @@
-uniform mat4 uViewMatrix;
-uniform mat4 uProjectionMatrix;
-uniform vec4 uColor;
-
-attribute vec4 aColor;
-attribute vec3 aPosition;
-attribute vec2 aTextureCoord;
-
-varying vec4 vColor;
-varying vec2 vTextureCoord;
-
-void main(void) {
-  vec4 worldPosition = vec4(aPosition, 1.0);
-  gl_Position = uProjectionMatrix * (uViewMatrix * worldPosition);
-
-  vColor = aColor * uColor;
-  vTextureCoord = aTextureCoord;
-}
diff --git a/research/cognitive_mapping_and_planning/render/swiftshader_renderer.py b/research/cognitive_mapping_and_planning/render/swiftshader_renderer.py
deleted file mode 100644
index 74b1be72c11a2877231a66886d02babfd4793ce8..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/render/swiftshader_renderer.py
+++ /dev/null
@@ -1,427 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Implements loading and rendering of meshes. Contains 2 classes:
-  Shape: Class that exposes high level functions for loading and manipulating
-    shapes. This currently is bound to assimp
-    (https://github.com/assimp/assimp). If you want to interface to a different
-    library, reimplement this class with bindings to your mesh loading library.
-
-  SwiftshaderRenderer: Class that renders Shapes. Currently this uses python
-    bindings to OpenGL (EGL), bindings to an alternate renderer may be implemented
-    here. 
-"""
-
-import numpy as np, os
-import cv2, ctypes, logging, os, numpy as np
-import pyassimp as assimp
-from OpenGL.GLES2 import *
-from OpenGL.EGL import *
-import src.rotation_utils as ru 
-
-__version__ = 'swiftshader_renderer'
-
-def get_shaders(modalities):
-  rgb_shader = 'rgb_flat_color' if 'rgb' in modalities else None
-  d_shader = 'depth_rgb_encoded' if 'depth' in modalities else None
-  return rgb_shader, d_shader
-
-def sample_points_on_faces(vs, fs, rng, n_samples_per_face):
-  idx = np.repeat(np.arange(fs.shape[0]), n_samples_per_face)
-  
-  r = rng.rand(idx.size, 2)
-  r1 = r[:,:1]; r2 = r[:,1:]; sqrt_r1 = np.sqrt(r1);
-  
-  v1 = vs[fs[idx, 0], :]; v2 = vs[fs[idx, 1], :]; v3 = vs[fs[idx, 2], :];
-  pts = (1-sqrt_r1)*v1 + sqrt_r1*(1-r2)*v2 + sqrt_r1*r2*v3
-  
-  v1 = vs[fs[:,0], :]; v2 = vs[fs[:, 1], :]; v3 = vs[fs[:, 2], :];
-  ar = 0.5*np.sqrt(np.sum(np.cross(v1-v3, v2-v3)**2, 1))
-  
-  return pts, ar, idx
-
-class Shape():
-  def get_pyassimp_load_options(self):
-    load_flags = assimp.postprocess.aiProcess_Triangulate;
-    load_flags = load_flags | assimp.postprocess.aiProcess_SortByPType;
-    load_flags = load_flags | assimp.postprocess.aiProcess_OptimizeMeshes;
-    load_flags = load_flags | assimp.postprocess.aiProcess_RemoveRedundantMaterials;
-    load_flags = load_flags | assimp.postprocess.aiProcess_FindDegenerates;
-    load_flags = load_flags | assimp.postprocess.aiProcess_GenSmoothNormals;
-    load_flags = load_flags | assimp.postprocess.aiProcess_JoinIdenticalVertices;
-    load_flags = load_flags | assimp.postprocess.aiProcess_ImproveCacheLocality;
-    load_flags = load_flags | assimp.postprocess.aiProcess_GenUVCoords;
-    load_flags = load_flags | assimp.postprocess.aiProcess_FindInvalidData;
-    return load_flags
-
-  def __init__(self, obj_file, material_file=None, load_materials=True,
-               name_prefix='', name_suffix=''):
-    if material_file is not None:
-      logging.error('Ignoring material file input, reading them off obj file.')
-    load_flags = self.get_pyassimp_load_options()
-    scene = assimp.load(obj_file, processing=load_flags)
-    filter_ind = self._filter_triangles(scene.meshes)
-    self.meshes = [scene.meshes[i] for i in filter_ind]
-    for m in self.meshes:
-      m.name = name_prefix + m.name + name_suffix
-
-    dir_name = os.path.dirname(obj_file)
-    # Load materials
-    materials = None
-    if load_materials:
-      materials = []
-      for m in self.meshes:
-        file_name = os.path.join(dir_name, m.material.properties[('file', 1)])
-        assert(os.path.exists(file_name)), \
-            'Texture file {:s} foes not exist.'.format(file_name)
-        img_rgb = cv2.imread(file_name)[::-1,:,::-1]
-        if img_rgb.shape[0] != img_rgb.shape[1]:
-          logging.warn('Texture image not square.')
-          sz = np.maximum(img_rgb.shape[0], img_rgb.shape[1])
-          sz = int(np.power(2., np.ceil(np.log2(sz))))
-          img_rgb = cv2.resize(img_rgb, (sz,sz), interpolation=cv2.INTER_LINEAR)
-        else:
-          sz = img_rgb.shape[0]
-          sz_ = int(np.power(2., np.ceil(np.log2(sz))))
-          if sz != sz_:
-            logging.warn('Texture image not square of power of 2 size. ' +
-                         'Changing size from %d to %d.', sz, sz_)
-            sz = sz_
-            img_rgb = cv2.resize(img_rgb, (sz,sz), interpolation=cv2.INTER_LINEAR)
-        materials.append(img_rgb)
-    self.scene = scene
-    self.materials = materials
-
-  def _filter_triangles(self, meshes):
-    select = []
-    for i in range(len(meshes)):
-      if meshes[i].primitivetypes == 4:
-        select.append(i)
-    return select
-
-  def flip_shape(self):
-    for m in self.meshes:
-      m.vertices[:,1] = -m.vertices[:,1]
-      bb = m.faces*1
-      bb[:,1] = m.faces[:,2]
-      bb[:,2] = m.faces[:,1]
-      m.faces = bb
-      # m.vertices[:,[0,1]] = m.vertices[:,[1,0]]
-
-  def get_vertices(self):
-    vs = []
-    for m in self.meshes:
-      vs.append(m.vertices)
-    vss = np.concatenate(vs, axis=0)
-    return vss, vs
-
-  def get_faces(self):
-    vs = []
-    for m in self.meshes:
-      v = m.faces
-      vs.append(v)
-    return vs
-
-  def get_number_of_meshes(self):
-    return len(self.meshes)
-
-  def scale(self, sx=1., sy=1., sz=1.):
-    pass
-
-  def sample_points_on_face_of_shape(self, i, n_samples_per_face, sc):
-    v = self.meshes[i].vertices*sc
-    f = self.meshes[i].faces
-    p, face_areas, face_idx = sample_points_on_faces(
-        v, f, np.random.RandomState(0), n_samples_per_face)
-    return p, face_areas, face_idx
-  
-  def __del__(self):
-    scene = self.scene
-    assimp.release(scene)
-
-class SwiftshaderRenderer():
-  def __init__(self):
-    self.entities = {}
-
-  def init_display(self, width, height, fov, z_near, z_far, rgb_shader,
-                   d_shader):
-    self.init_renderer_egl(width, height)
-    dir_path = os.path.dirname(os.path.realpath(__file__))
-    if d_shader is not None and rgb_shader is not None:
-      logging.fatal('Does not support setting both rgb_shader and d_shader.')
-    
-    if d_shader is not None:
-      assert rgb_shader is None
-      shader = d_shader
-      self.modality = 'depth'
-    
-    if rgb_shader is not None:
-      assert d_shader is None
-      shader = rgb_shader
-      self.modality = 'rgb'
-    
-    self.create_shaders(os.path.join(dir_path, shader+'.vp'),
-                        os.path.join(dir_path, shader + '.fp'))
-    aspect = width*1./(height*1.)
-    self.set_camera(fov, z_near, z_far, aspect)
-
-  def init_renderer_egl(self, width, height):
-    major,minor = ctypes.c_long(),ctypes.c_long()
-    logging.info('init_renderer_egl: EGL_DEFAULT_DISPLAY: %s', EGL_DEFAULT_DISPLAY)
-
-    egl_display = eglGetDisplay(EGL_DEFAULT_DISPLAY)
-    logging.info('init_renderer_egl: egl_display: %s', egl_display)
-
-    eglInitialize(egl_display, major, minor)
-    logging.info('init_renderer_egl: EGL_OPENGL_API, EGL_OPENGL_ES_API: %s, %s',
-                 EGL_OPENGL_API, EGL_OPENGL_ES_API)
-    eglBindAPI(EGL_OPENGL_ES_API)
-
-    num_configs = ctypes.c_long()
-    configs = (EGLConfig*1)()
-    local_attributes = [EGL_RED_SIZE, 8, EGL_GREEN_SIZE, 8, EGL_BLUE_SIZE, 8,
-                        EGL_DEPTH_SIZE, 16, EGL_SURFACE_TYPE, EGL_PBUFFER_BIT,
-                        EGL_RENDERABLE_TYPE, EGL_OPENGL_ES2_BIT, EGL_NONE,]
-    logging.error('init_renderer_egl: local attributes: %s', local_attributes)
-    local_attributes = arrays.GLintArray.asArray(local_attributes)
-    success = eglChooseConfig(egl_display, local_attributes, configs, 1, num_configs)
-    logging.error('init_renderer_egl: eglChooseConfig success, num_configs: %d, %d', success, num_configs.value)
-    egl_config = configs[0]
-
-
-    context_attributes = [EGL_CONTEXT_CLIENT_VERSION, 2, EGL_NONE]
-    context_attributes = arrays.GLintArray.asArray(context_attributes)
-    egl_context = eglCreateContext(egl_display, egl_config, EGL_NO_CONTEXT, context_attributes)
-
-    buffer_attributes = [EGL_WIDTH, width, EGL_HEIGHT, height, EGL_NONE]
-    buffer_attributes = arrays.GLintArray.asArray(buffer_attributes)
-    egl_surface = eglCreatePbufferSurface(egl_display, egl_config, buffer_attributes)
-
-
-    eglMakeCurrent(egl_display, egl_surface, egl_surface, egl_context)
-    logging.error("init_renderer_egl: egl_display: %s egl_surface: %s, egl_config: %s", egl_display, egl_surface, egl_context)
-
-    glViewport(0, 0, width, height);
-
-    self.egl_display = egl_display
-    self.egl_surface = egl_surface
-    self.egl_config =  egl_config
-    self.egl_mapping = {}
-    self.render_timer = None
-    self.load_timer = None
-    self.height = height
-    self.width = width
-
-  def create_shaders(self, v_shader_file, f_shader_file):
-    v_shader = glCreateShader(GL_VERTEX_SHADER)
-    with open(v_shader_file, 'r') as f:
-      ls = ''
-      for l in f:
-        ls = ls + l
-    glShaderSource(v_shader, ls)
-    glCompileShader(v_shader);
-    assert(glGetShaderiv(v_shader, GL_COMPILE_STATUS) == 1)
-
-    f_shader = glCreateShader(GL_FRAGMENT_SHADER)
-    with open(f_shader_file, 'r') as f:
-      ls = ''
-      for l in f:
-        ls = ls + l
-    glShaderSource(f_shader, ls)
-    glCompileShader(f_shader);
-    assert(glGetShaderiv(f_shader, GL_COMPILE_STATUS) == 1)
-
-    egl_program = glCreateProgram();
-    assert(egl_program)
-    glAttachShader(egl_program, v_shader)
-    glAttachShader(egl_program, f_shader)
-    glLinkProgram(egl_program);
-    assert(glGetProgramiv(egl_program, GL_LINK_STATUS) == 1)
-    glUseProgram(egl_program)
-
-    glBindAttribLocation(egl_program, 0, "aPosition")
-    glBindAttribLocation(egl_program, 1, "aColor")
-    glBindAttribLocation(egl_program, 2, "aTextureCoord")
-
-    self.egl_program = egl_program
-    self.egl_mapping['vertexs'] = 0
-    self.egl_mapping['vertexs_color'] = 1
-    self.egl_mapping['vertexs_tc'] = 2
-    
-    glClearColor(0.0, 0.0, 0.0, 1.0);
-    # glEnable(GL_CULL_FACE); glCullFace(GL_BACK);
-    glEnable(GL_DEPTH_TEST);
-    
-    glClear(GL_COLOR_BUFFER_BIT | GL_DEPTH_BUFFER_BIT)
-
-  def set_camera(self, fov_vertical, z_near, z_far, aspect):
-    width = 2*np.tan(np.deg2rad(fov_vertical)/2.0)*z_near*aspect;
-    height = 2*np.tan(np.deg2rad(fov_vertical)/2.0)*z_near;
-    egl_program = self.egl_program
-    c = np.eye(4, dtype=np.float32)
-    c[3,3] = 0
-    c[3,2] = -1
-    c[2,2] = -(z_near+z_far)/(z_far-z_near)
-    c[2,3] = -2.0*(z_near*z_far)/(z_far-z_near)
-    c[0,0] = 2.0*z_near/width
-    c[1,1] = 2.0*z_near/height
-    c = c.T
-    
-    projection_matrix_o = glGetUniformLocation(egl_program, 'uProjectionMatrix')
-    projection_matrix = np.eye(4, dtype=np.float32)
-    projection_matrix[...] = c
-    projection_matrix = np.reshape(projection_matrix, (-1))
-    glUniformMatrix4fv(projection_matrix_o, 1, GL_FALSE, projection_matrix)
-    
-
-  def load_default_object(self):
-    v = np.array([[0.0, 0.5, 0.0, 1.0, 1.0, 0.0, 1.0],
-                  [-0.5, -0.5, 0.0, 1.0, 0.0, 1.0, 1.0],
-                  [0.5, -0.5, 0.0, 1.0, 1.0, 1.0, 1.0]], dtype=np.float32)
-    v = np.concatenate((v,v+0.1), axis=0)
-    v = np.ascontiguousarray(v, dtype=np.float32)
-
-    vbo = glGenBuffers(1)
-    glBindBuffer (GL_ARRAY_BUFFER, vbo)
-    glBufferData (GL_ARRAY_BUFFER, v.dtype.itemsize*v.size, v, GL_STATIC_DRAW)
-    glVertexAttribPointer(0, 3, GL_FLOAT, GL_FALSE, 28, ctypes.c_void_p(0))
-    glVertexAttribPointer(1, 4, GL_FLOAT, GL_FALSE, 28, ctypes.c_void_p(12))
-    glEnableVertexAttribArray(0);
-    glEnableVertexAttribArray(1);
-
-    self.num_to_render = 6;
-
-  def _actual_render(self):
-    for entity_id, entity in self.entities.iteritems():
-      if entity['visible']:
-        vbo = entity['vbo']
-        tbo = entity['tbo']
-        num = entity['num']
-
-        glBindBuffer(GL_ARRAY_BUFFER, vbo)
-        glVertexAttribPointer(self.egl_mapping['vertexs'], 3, GL_FLOAT, GL_FALSE,
-                              20, ctypes.c_void_p(0))
-        glVertexAttribPointer(self.egl_mapping['vertexs_tc'], 2, GL_FLOAT,
-                              GL_FALSE, 20, ctypes.c_void_p(12))
-        glEnableVertexAttribArray(self.egl_mapping['vertexs']);
-        glEnableVertexAttribArray(self.egl_mapping['vertexs_tc']);
-
-        glBindTexture(GL_TEXTURE_2D, tbo)
-        glTexParameteri(GL_TEXTURE_2D, GL_TEXTURE_MAG_FILTER, GL_LINEAR);
-        glTexParameteri(GL_TEXTURE_2D, GL_TEXTURE_MIN_FILTER, GL_LINEAR);
-        glDrawArrays(GL_TRIANGLES, 0, num)
-
-  def render(self, take_screenshot=False, output_type=0):
-    # self.render_timer.tic()
-    self._actual_render()
-    # self.render_timer.toc(log_at=1000, log_str='render timer', type='time')
-
-    np_rgb_img = None
-    np_d_img = None
-    c = 1000.
-    if take_screenshot:
-      if self.modality == 'rgb':
-        screenshot_rgba = np.zeros((self.height, self.width, 4), dtype=np.uint8)
-        glReadPixels(0, 0, self.width, self.height, GL_RGBA, GL_UNSIGNED_BYTE, screenshot_rgba)
-        np_rgb_img = screenshot_rgba[::-1,:,:3];
-
-      if self.modality == 'depth': 
-        screenshot_d = np.zeros((self.height, self.width, 4), dtype=np.uint8)
-        glReadPixels(0, 0, self.width, self.height, GL_RGBA, GL_UNSIGNED_BYTE, screenshot_d)
-        np_d_img = screenshot_d[::-1,:,:3];
-        np_d_img = np_d_img[:,:,2]*(255.*255./c) + np_d_img[:,:,1]*(255./c) + np_d_img[:,:,0]*(1./c)
-        np_d_img = np_d_img.astype(np.float32)
-        np_d_img[np_d_img == 0] = np.NaN
-        np_d_img = np_d_img[:,:,np.newaxis]
-
-    glClear(GL_COLOR_BUFFER_BIT | GL_DEPTH_BUFFER_BIT)
-    return np_rgb_img, np_d_img
-
-  def _load_mesh_into_gl(self, mesh, material):
-    vvt = np.concatenate((mesh.vertices, mesh.texturecoords[0,:,:2]), axis=1)
-    vvt = np.ascontiguousarray(vvt[mesh.faces.reshape((-1)),:], dtype=np.float32)
-    num = vvt.shape[0]
-    vvt = np.reshape(vvt, (-1))
-
-    vbo = glGenBuffers(1)
-    glBindBuffer(GL_ARRAY_BUFFER, vbo)
-    glBufferData(GL_ARRAY_BUFFER, vvt.dtype.itemsize*vvt.size, vvt, GL_STATIC_DRAW)
-
-    tbo = glGenTextures(1)
-    glBindTexture(GL_TEXTURE_2D, tbo)
-    glTexImage2D(GL_TEXTURE_2D, 0, GL_RGB, material.shape[1],
-                 material.shape[0], 0, GL_RGB, GL_UNSIGNED_BYTE,
-                 np.reshape(material, (-1)))
-    return num, vbo, tbo
-
-  def load_shapes(self, shapes):
-    entities = self.entities
-    entity_ids = []
-    for i, shape in enumerate(shapes):
-      for j in range(len(shape.meshes)):
-        name = shape.meshes[j].name
-        assert name not in entities, '{:s} entity already exists.'.format(name)
-        num, vbo, tbo = self._load_mesh_into_gl(shape.meshes[j], shape.materials[j])
-        entities[name] = {'num': num, 'vbo': vbo, 'tbo': tbo, 'visible': False}
-        entity_ids.append(name)
-    return entity_ids
-
-  def set_entity_visible(self, entity_ids, visibility):
-    for entity_id in entity_ids:
-      self.entities[entity_id]['visible'] = visibility
-
-  def position_camera(self, camera_xyz, lookat_xyz, up):
-    camera_xyz = np.array(camera_xyz)
-    lookat_xyz = np.array(lookat_xyz)
-    up = np.array(up)
-    lookat_to = lookat_xyz - camera_xyz
-    lookat_from = np.array([0, 1., 0.])
-    up_from = np.array([0, 0., 1.])
-    up_to = up * 1.
-    # np.set_printoptions(precision=2, suppress=True)
-    # print up_from, lookat_from, up_to, lookat_to
-    r = ru.rotate_camera_to_point_at(up_from, lookat_from, up_to, lookat_to)
-    R = np.eye(4, dtype=np.float32)
-    R[:3,:3] = r
-
-    t = np.eye(4, dtype=np.float32)
-    t[:3,3] = -camera_xyz
-
-    view_matrix = np.dot(R.T, t)
-    flip_yz = np.eye(4, dtype=np.float32)
-    flip_yz[1,1] = 0; flip_yz[2,2] = 0; flip_yz[1,2] = 1; flip_yz[2,1] = -1;
-    view_matrix = np.dot(flip_yz, view_matrix)
-    view_matrix = view_matrix.T
-    # print np.concatenate((R, t, view_matrix), axis=1)
-    view_matrix = np.reshape(view_matrix, (-1))
-    view_matrix_o = glGetUniformLocation(self.egl_program, 'uViewMatrix')
-    glUniformMatrix4fv(view_matrix_o, 1, GL_FALSE, view_matrix)
-    return None, None #camera_xyz, q
-
-  def clear_scene(self):
-    keys = self.entities.keys()
-    for entity_id in keys:
-      entity = self.entities.pop(entity_id, None)
-      vbo = entity['vbo']
-      tbo = entity['tbo']
-      num = entity['num']
-      glDeleteBuffers(1, [vbo])
-      glDeleteTextures(1, [tbo])
-
-  def __del__(self):
-    self.clear_scene()
-    eglMakeCurrent(self.egl_display, EGL_NO_SURFACE, EGL_NO_SURFACE, EGL_NO_CONTEXT)
-    eglDestroySurface(self.egl_display, self.egl_surface)
-    eglTerminate(self.egl_display)
diff --git a/research/cognitive_mapping_and_planning/requirements.txt b/research/cognitive_mapping_and_planning/requirements.txt
deleted file mode 100644
index 306c807a6c9fd9404afa1c05108e5e835e84edc6..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/requirements.txt
+++ /dev/null
@@ -1,9 +0,0 @@
-numpy
-pillow
-PyOpenGL
-PyOpenGL-accelerate
-six
-networkx
-scikit-image
-scipy
-opencv-python
diff --git a/research/cognitive_mapping_and_planning/scripts/__init__.py b/research/cognitive_mapping_and_planning/scripts/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/cognitive_mapping_and_planning/scripts/script_distill.py b/research/cognitive_mapping_and_planning/scripts/script_distill.py
deleted file mode 100644
index 010c690412ed28011146ab44109dc099d02324e7..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/scripts/script_distill.py
+++ /dev/null
@@ -1,177 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r""" Script to setup the grid moving agent.
-
-blaze build --define=ION_GFX_OGLES20=1 -c opt --copt=-mavx --config=cuda_clang \
-    learning/brain/public/tensorflow_std_server{,_gpu} \
-    experimental/users/saurabhgupta/navigation/cmp/scripts/script_distill.par \
-    experimental/users/saurabhgupta/navigation/cmp/scripts/script_distill
-
-
-./blaze-bin/experimental/users/saurabhgupta/navigation/cmp/scripts/script_distill \
-  --logdir=/cns/iq-d/home/saurabhgupta/output/stanford-distill/local/v0/ \
-  --config_name 'v0+train' --gfs_user robot-intelligence-gpu
-
-"""
-import sys, os, numpy as np
-import copy
-import argparse, pprint
-import time
-import cProfile
-
-
-import tensorflow as tf
-from tensorflow.contrib import slim
-from tensorflow.python.framework import ops
-from tensorflow.contrib.framework.python.ops import variables 
-
-import logging
-from tensorflow.python.platform import gfile
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-from cfgs import config_distill
-from tfcode import tf_utils
-import src.utils as utils
-import src.file_utils as fu
-import tfcode.distillation as distill 
-import datasets.nav_env as nav_env
-
-FLAGS = flags.FLAGS
-
-flags.DEFINE_string('master', 'local',
-                    'The name of the TensorFlow master to use.')
-flags.DEFINE_integer('ps_tasks', 0, 'The number of parameter servers. If the '
-                     'value is 0, then the parameters are handled locally by '
-                     'the worker.')
-flags.DEFINE_integer('task', 0, 'The Task ID. This value is used when training '
-                     'with multiple workers to identify each worker.')
-
-flags.DEFINE_integer('num_workers', 1, '')
-
-flags.DEFINE_string('config_name', '', '')
-
-flags.DEFINE_string('logdir', '', '')
-
-def main(_):
-  args = config_distill.get_args_for_config(FLAGS.config_name)
-  args.logdir = FLAGS.logdir
-  args.solver.num_workers = FLAGS.num_workers
-  args.solver.task = FLAGS.task
-  args.solver.ps_tasks = FLAGS.ps_tasks
-  args.solver.master = FLAGS.master
-  
-  args.buildinger.env_class = nav_env.MeshMapper
-  fu.makedirs(args.logdir)
-  args.buildinger.logdir = args.logdir
-  R = nav_env.get_multiplexor_class(args.buildinger, args.solver.task)
-  
-  if False:
-    pr = cProfile.Profile()
-    pr.enable()
-    rng = np.random.RandomState(0)
-    for i in range(1):
-      b, instances_perturbs = R.sample_building(rng)
-      inputs = b.worker(*(instances_perturbs))
-      for j in range(inputs['imgs'].shape[0]):
-        p = os.path.join('tmp', '{:d}.png'.format(j))
-        img = inputs['imgs'][j,0,:,:,:3]*1
-        img = (img).astype(np.uint8)
-        fu.write_image(p, img)
-      print(inputs['imgs'].shape)
-      inputs = R.pre(inputs)
-    pr.disable()
-    pr.print_stats(2)
-
-  if args.control.train:
-    if not gfile.Exists(args.logdir):
-      gfile.MakeDirs(args.logdir)
-   
-    m = utils.Foo()
-    m.tf_graph = tf.Graph()
-    
-    config = tf.ConfigProto()
-    config.device_count['GPU'] = 1
-    config.gpu_options.allow_growth = True
-    config.gpu_options.per_process_gpu_memory_fraction = 0.8
-    
-    with m.tf_graph.as_default():
-      with tf.device(tf.train.replica_device_setter(args.solver.ps_tasks)):
-        m = distill.setup_to_run(m, args, is_training=True,
-                                batch_norm_is_training=True)
-
-        train_step_kwargs = distill.setup_train_step_kwargs_mesh(
-            m, R, os.path.join(args.logdir, 'train'),
-            rng_seed=args.solver.task, is_chief=args.solver.task==0, iters=1,
-            train_display_interval=args.summary.display_interval)
-
-        final_loss = slim.learning.train(
-            train_op=m.train_op,
-            logdir=args.logdir,
-            master=args.solver.master,
-            is_chief=args.solver.task == 0,
-            number_of_steps=args.solver.max_steps,
-            train_step_fn=tf_utils.train_step_custom,
-            train_step_kwargs=train_step_kwargs,
-            global_step=m.global_step_op,
-            init_op=m.init_op,
-            init_fn=m.init_fn,
-            sync_optimizer=m.sync_optimizer,
-            saver=m.saver_op,
-            summary_op=None, session_config=config)
- 
-  if args.control.test:
-    m = utils.Foo()
-    m.tf_graph = tf.Graph()
-    checkpoint_dir = os.path.join(format(args.logdir))
-    with m.tf_graph.as_default():
-      m = distill.setup_to_run(m, args, is_training=False,
-                              batch_norm_is_training=args.control.force_batchnorm_is_training_at_test)
-      
-      train_step_kwargs = distill.setup_train_step_kwargs_mesh(
-          m, R, os.path.join(args.logdir, args.control.test_name),
-          rng_seed=args.solver.task+1, is_chief=args.solver.task==0,
-          iters=args.summary.test_iters, train_display_interval=None)
-      
-      sv = slim.learning.supervisor.Supervisor(
-          graph=ops.get_default_graph(), logdir=None, init_op=m.init_op,
-          summary_op=None, summary_writer=None, global_step=None, saver=m.saver_op)
-
-      last_checkpoint = None
-      while True:
-        last_checkpoint = slim.evaluation.wait_for_new_checkpoint(checkpoint_dir, last_checkpoint)
-        checkpoint_iter = int(os.path.basename(last_checkpoint).split('-')[1])
-        start = time.time()
-        logging.info('Starting evaluation at %s using checkpoint %s.', 
-                     time.strftime('%Y-%m-%d-%H:%M:%S', time.localtime()),
-                     last_checkpoint)
-        
-        config = tf.ConfigProto()
-        config.device_count['GPU'] = 1
-        config.gpu_options.allow_growth = True
-        config.gpu_options.per_process_gpu_memory_fraction = 0.8
-        
-        with sv.managed_session(args.solver.master,config=config,
-                                start_standard_services=False) as sess:
-          sess.run(m.init_op)
-          sv.saver.restore(sess, last_checkpoint)
-          sv.start_queue_runners(sess)
-          vals, _ = tf_utils.train_step_custom(
-              sess, None, m.global_step_op, train_step_kwargs, mode='val')
-          if checkpoint_iter >= args.solver.max_steps:
-            break
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/cognitive_mapping_and_planning/scripts/script_download_init_models.sh b/research/cognitive_mapping_and_planning/scripts/script_download_init_models.sh
deleted file mode 100644
index 1900bd0b03566d29dac8a8de5f4fce623be98a92..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/scripts/script_download_init_models.sh
+++ /dev/null
@@ -1,18 +0,0 @@
-# Script to download models to initialize the RGB and D models for training.We
-# use ResNet-v2-50 for both modalities.
-
-mkdir -p data/init_models
-cd data/init_models
-
-# RGB Models are initialized by pre-training on ImageNet.
-mkdir -p resnet_v2_50
-RGB_URL="http://download.tensorflow.org/models/resnet_v2_50_2017_04_14.tar.gz"
-wget $RGB_URL
-tar -xf resnet_v2_50_2017_04_14.tar.gz -C resnet_v2_50
-
-# Depth models are initialized by distilling the RGB model to D images using
-# Cross-Modal Distillation (https://arxiv.org/abs/1507.00448).
-mkdir -p distill_rgb_to_d_resnet_v2_50
-D_URL="http://download.tensorflow.org/models/cognitive_mapping_and_planning/2017_04_16/distill_rgb_to_d_resnet_v2_50.tar"
-wget $D_URL
-tar -xf distill_rgb_to_d_resnet_v2_50.tar -C distill_rgb_to_d_resnet_v2_50
diff --git a/research/cognitive_mapping_and_planning/scripts/script_env_vis.py b/research/cognitive_mapping_and_planning/scripts/script_env_vis.py
deleted file mode 100644
index 3690ff484fea9344db6fbe20ac54731200f0c84e..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/scripts/script_env_vis.py
+++ /dev/null
@@ -1,186 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A simple python function to walk in the enviornments that we have created.
-PYTHONPATH='.' PYOPENGL_PLATFORM=egl python scripts/script_env_vis.py \
-  --dataset_name sbpd --building_name area3
-"""
-import sys
-import numpy as np
-import matplotlib
-matplotlib.use('TkAgg')
-from PIL import ImageTk, Image
-import Tkinter as tk
-import logging
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-
-import datasets.nav_env_config as nec
-import datasets.nav_env as nav_env
-import cv2
-from datasets import factory
-import render.swiftshader_renderer as renderer
-
-SwiftshaderRenderer = renderer.SwiftshaderRenderer
-VisualNavigationEnv = nav_env.VisualNavigationEnv
-
-FLAGS = flags.FLAGS
-flags.DEFINE_string('dataset_name', 'sbpd', 'Name of the dataset.')
-flags.DEFINE_float('fov', 60., 'Field of view')
-flags.DEFINE_integer('image_size', 512, 'Size of the image.')
-flags.DEFINE_string('building_name', '', 'Name of the building.')
-
-def get_args():
-  navtask = nec.nav_env_base_config()
-  navtask.task_params.type = 'rng_rejection_sampling_many'
-  navtask.task_params.rejection_sampling_M = 2000
-  navtask.task_params.min_dist = 10
-  sz = FLAGS.image_size
-  navtask.camera_param.fov = FLAGS.fov
-  navtask.camera_param.height = sz
-  navtask.camera_param.width = sz
-  navtask.task_params.img_height = sz
-  navtask.task_params.img_width = sz
-
-  # navtask.task_params.semantic_task.class_map_names = ['chair', 'door', 'table']
-  # navtask.task_params.type = 'to_nearest_obj_acc'
-
-  logging.info('navtask: %s', navtask)
-  return navtask
-
-def load_building(dataset_name, building_name):
-  dataset = factory.get_dataset(dataset_name)
-
-  navtask = get_args()
-  cp = navtask.camera_param
-  rgb_shader, d_shader = renderer.get_shaders(cp.modalities)
-  r_obj = SwiftshaderRenderer()
-  r_obj.init_display(width=cp.width, height=cp.height,
-                     fov=cp.fov, z_near=cp.z_near, z_far=cp.z_far,
-                     rgb_shader=rgb_shader, d_shader=d_shader)
-  r_obj.clear_scene()
-  b = VisualNavigationEnv(robot=navtask.robot, env=navtask.env,
-                          task_params=navtask.task_params,
-                          building_name=building_name, flip=False,
-                          logdir=None, building_loader=dataset,
-                          r_obj=r_obj)
-  b.load_building_into_scene()
-  b.set_building_visibility(False)
-  return b
-
-def walk_through(b):
-  # init agent at a random location in the environment.
-  init_env_state = b.reset([np.random.RandomState(0), np.random.RandomState(0)])
-
-  global current_node
-  rng = np.random.RandomState(0)
-  current_node = rng.choice(b.task.nodes.shape[0])
-
-  root = tk.Tk()
-  image = b.render_nodes(b.task.nodes[[current_node],:])[0]
-  print(image.shape)
-  image = image.astype(np.uint8)
-  im = Image.fromarray(image)
-  im = ImageTk.PhotoImage(im)
-  panel = tk.Label(root, image=im)
-
-  map_size = b.traversible.shape
-  sc = np.max(map_size)/256.
-  loc = np.array([[map_size[1]/2., map_size[0]/2.]])
-  x_axis = np.zeros_like(loc); x_axis[:,1] = sc
-  y_axis = np.zeros_like(loc); y_axis[:,0] = -sc
-  cum_fs, cum_valid = nav_env.get_map_to_predict(loc, x_axis, y_axis,
-                                                   map=b.traversible*1.,
-                                                   map_size=256)
-  cum_fs = cum_fs[0]
-  cum_fs = cv2.applyColorMap((cum_fs*255).astype(np.uint8), cv2.COLORMAP_JET)
-  im = Image.fromarray(cum_fs)
-  im = ImageTk.PhotoImage(im)
-  panel_overhead = tk.Label(root, image=im)
-
-  def refresh():
-    global current_node
-    image = b.render_nodes(b.task.nodes[[current_node],:])[0]
-    image = image.astype(np.uint8)
-    im = Image.fromarray(image)
-    im = ImageTk.PhotoImage(im)
-    panel.configure(image=im)
-    panel.image = im
-
-  def left_key(event):
-    global current_node
-    current_node = b.take_action([current_node], [2], 1)[0][0]
-    refresh()
-
-  def up_key(event):
-    global current_node
-    current_node = b.take_action([current_node], [3], 1)[0][0]
-    refresh()
-
-  def right_key(event):
-    global current_node
-    current_node = b.take_action([current_node], [1], 1)[0][0]
-    refresh()
-
-  def quit(event):
-    root.destroy()
-
-  panel_overhead.grid(row=4, column=5, rowspan=1, columnspan=1,
-                      sticky=tk.W+tk.E+tk.N+tk.S)
-  panel.bind('<Left>', left_key)
-  panel.bind('<Up>', up_key)
-  panel.bind('<Right>', right_key)
-  panel.bind('q', quit)
-  panel.focus_set()
-  panel.grid(row=0, column=0, rowspan=5, columnspan=5,
-             sticky=tk.W+tk.E+tk.N+tk.S)
-  root.mainloop()
-
-def simple_window():
-  root = tk.Tk()
-
-  image = np.zeros((128, 128, 3), dtype=np.uint8)
-  image[32:96, 32:96, 0] = 255
-  im = Image.fromarray(image)
-  im = ImageTk.PhotoImage(im)
-
-  image = np.zeros((128, 128, 3), dtype=np.uint8)
-  image[32:96, 32:96, 1] = 255
-  im2 = Image.fromarray(image)
-  im2 = ImageTk.PhotoImage(im2)
-
-  panel = tk.Label(root, image=im)
-
-  def left_key(event):
-    panel.configure(image=im2)
-    panel.image = im2
-
-  def quit(event):
-    sys.exit()
-
-  panel.bind('<Left>', left_key)
-  panel.bind('<Up>', left_key)
-  panel.bind('<Down>', left_key)
-  panel.bind('q', quit)
-  panel.focus_set()
-  panel.pack(side = "bottom", fill = "both", expand = "yes")
-  root.mainloop()
-
-def main(_):
-  b = load_building(FLAGS.dataset_name, FLAGS.building_name)
-  walk_through(b)
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/cognitive_mapping_and_planning/scripts/script_nav_agent_release.py b/research/cognitive_mapping_and_planning/scripts/script_nav_agent_release.py
deleted file mode 100644
index dab2819a6fcf100cb2e385e45b7aa694c4c5f033..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/scripts/script_nav_agent_release.py
+++ /dev/null
@@ -1,253 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r""" Script to train and test the grid navigation agent.
-Usage:
-  1. Testing a model.
-  CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 \
-    PYTHONPATH='.' PYOPENGL_PLATFORM=egl python scripts/script_nav_agent_release.py \
-    --config_name cmp.lmap_Msc.clip5.sbpd_d_r2r+bench_test \
-    --logdir output/cmp.lmap_Msc.clip5.sbpd_d_r2r
-
-  2. Training a model (locally).
-  CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 \
-    PYTHONPATH='.' PYOPENGL_PLATFORM=egl python scripts/script_nav_agent_release.py \
-    --config_name cmp.lmap_Msc.clip5.sbpd_d_r2r+train_train \
-    --logdir output/cmp.lmap_Msc.clip5.sbpd_d_r2r_
-
-  3. Training a model (distributed).
-  # See https://www.tensorflow.org/deploy/distributed on how to setup distributed
-  # training.
-  CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 \
-    PYTHONPATH='.' PYOPENGL_PLATFORM=egl python scripts/script_nav_agent_release.py \
-    --config_name cmp.lmap_Msc.clip5.sbpd_d_r2r+train_train \
-    --logdir output/cmp.lmap_Msc.clip5.sbpd_d_r2r_ \
-    --ps_tasks $num_ps --master $master_name --task $worker_id
-"""
-
-import sys, os, numpy as np
-import copy
-import argparse, pprint
-import time
-import cProfile
-import platform
-
-
-import tensorflow as tf
-from tensorflow.contrib import slim
-from tensorflow.python.framework import ops
-from tensorflow.contrib.framework.python.ops import variables
-
-import logging
-from tensorflow.python.platform import gfile
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-from cfgs import config_cmp
-from cfgs import config_vision_baseline
-import datasets.nav_env as nav_env
-import src.file_utils as fu 
-import src.utils as utils
-import tfcode.cmp as cmp 
-from tfcode import tf_utils
-from tfcode import vision_baseline_lstm
-
-FLAGS = flags.FLAGS
-
-flags.DEFINE_string('master', '',
-                    'The address of the tensorflow master')
-flags.DEFINE_integer('ps_tasks', 0, 'The number of parameter servers. If the '
-                     'value is 0, then the parameters are handled locally by '
-                     'the worker.')
-flags.DEFINE_integer('task', 0, 'The Task ID. This value is used when training '
-                     'with multiple workers to identify each worker.')
-
-flags.DEFINE_integer('num_workers', 1, '')
-
-flags.DEFINE_string('config_name', '', '')
-
-flags.DEFINE_string('logdir', '', '')
-
-flags.DEFINE_integer('solver_seed', 0, '')
-
-flags.DEFINE_integer('delay_start_iters', 20, '')
-
-logging.basicConfig(level=logging.INFO)
-
-def main(_):
-  _launcher(FLAGS.config_name, FLAGS.logdir)
-
-def _launcher(config_name, logdir):
-  args = _setup_args(config_name, logdir)
-
-  fu.makedirs(args.logdir)
-
-  if args.control.train:
-    _train(args)
-
-  if args.control.test:
-    _test(args)
-
-def get_args_for_config(config_name):
-  configs = config_name.split('.')
-  type = configs[0]
-  config_name = '.'.join(configs[1:])
-  if type == 'cmp':
-    args = config_cmp.get_args_for_config(config_name)
-    args.setup_to_run = cmp.setup_to_run
-    args.setup_train_step_kwargs = cmp.setup_train_step_kwargs
-
-  elif type == 'bl':
-    args = config_vision_baseline.get_args_for_config(config_name)
-    args.setup_to_run = vision_baseline_lstm.setup_to_run
-    args.setup_train_step_kwargs = vision_baseline_lstm.setup_train_step_kwargs
-
-  else:
-    logging.fatal('Unknown type: {:s}'.format(type))
-  return args
-
-def _setup_args(config_name, logdir):
-  args = get_args_for_config(config_name)
-  args.solver.num_workers = FLAGS.num_workers
-  args.solver.task = FLAGS.task
-  args.solver.ps_tasks = FLAGS.ps_tasks
-  args.solver.master = FLAGS.master
-  args.solver.seed = FLAGS.solver_seed
-  args.logdir = logdir
-  args.navtask.logdir = None
-  return args
-
-def _train(args):
-  container_name = ""
-
-  R = lambda: nav_env.get_multiplexer_class(args.navtask, args.solver.task)
-  m = utils.Foo()
-  m.tf_graph = tf.Graph()
-
-  config = tf.ConfigProto()
-  config.device_count['GPU'] = 1
-
-  with m.tf_graph.as_default():
-    with tf.device(tf.train.replica_device_setter(args.solver.ps_tasks,
-                                          merge_devices=True)):
-      with tf.container(container_name):
-        m = args.setup_to_run(m, args, is_training=True,
-                             batch_norm_is_training=True, summary_mode='train')
-
-        train_step_kwargs = args.setup_train_step_kwargs(
-            m, R(), os.path.join(args.logdir, 'train'), rng_seed=args.solver.task,
-            is_chief=args.solver.task==0,
-            num_steps=args.navtask.task_params.num_steps*args.navtask.task_params.num_goals, iters=1,
-            train_display_interval=args.summary.display_interval,
-            dagger_sample_bn_false=args.arch.dagger_sample_bn_false)
-
-        delay_start = (args.solver.task*(args.solver.task+1))/2 * FLAGS.delay_start_iters
-        logging.error('delaying start for task %d by %d steps.',
-                      args.solver.task, delay_start)
-
-        additional_args = {}
-        final_loss = slim.learning.train(
-            train_op=m.train_op,
-            logdir=args.logdir,
-            master=args.solver.master,
-            is_chief=args.solver.task == 0,
-            number_of_steps=args.solver.max_steps,
-            train_step_fn=tf_utils.train_step_custom_online_sampling,
-            train_step_kwargs=train_step_kwargs,
-            global_step=m.global_step_op,
-            init_op=m.init_op,
-            init_fn=m.init_fn,
-            sync_optimizer=m.sync_optimizer,
-            saver=m.saver_op,
-            startup_delay_steps=delay_start,
-            summary_op=None, session_config=config, **additional_args)
-
-def _test(args):
-  args.solver.master = ''
-  container_name = ""
-  checkpoint_dir = os.path.join(format(args.logdir))
-  logging.error('Checkpoint_dir: %s', args.logdir)
-
-  config = tf.ConfigProto();
-  config.device_count['GPU'] = 1;
-
-  m = utils.Foo()
-  m.tf_graph = tf.Graph()
-
-  rng_data_seed = 0; rng_action_seed = 0;
-  R = lambda: nav_env.get_multiplexer_class(args.navtask, rng_data_seed)
-  with m.tf_graph.as_default():
-    with tf.container(container_name):
-      m = args.setup_to_run(
-        m, args, is_training=False,
-        batch_norm_is_training=args.control.force_batchnorm_is_training_at_test,
-        summary_mode=args.control.test_mode)
-      train_step_kwargs = args.setup_train_step_kwargs(
-        m, R(), os.path.join(args.logdir, args.control.test_name),
-        rng_seed=rng_data_seed, is_chief=True,
-        num_steps=args.navtask.task_params.num_steps*args.navtask.task_params.num_goals,
-        iters=args.summary.test_iters, train_display_interval=None,
-        dagger_sample_bn_false=args.arch.dagger_sample_bn_false)
-
-      saver = slim.learning.tf_saver.Saver(variables.get_variables_to_restore())
-
-      sv = slim.learning.supervisor.Supervisor(
-          graph=ops.get_default_graph(), logdir=None, init_op=m.init_op,
-          summary_op=None, summary_writer=None, global_step=None, saver=m.saver_op)
-
-      last_checkpoint = None
-      reported = False
-      while True:
-        last_checkpoint_ = None
-        while last_checkpoint_ is None:
-          last_checkpoint_ = slim.evaluation.wait_for_new_checkpoint(
-            checkpoint_dir, last_checkpoint, seconds_to_sleep=10, timeout=60)
-        if last_checkpoint_ is None: break
-
-        last_checkpoint = last_checkpoint_
-        checkpoint_iter = int(os.path.basename(last_checkpoint).split('-')[1])
-
-        logging.info('Starting evaluation at %s using checkpoint %s.',
-                     time.strftime('%Y-%m-%d-%H:%M:%S', time.localtime()),
-                     last_checkpoint)
-
-        if (args.control.only_eval_when_done == False or 
-            checkpoint_iter >= args.solver.max_steps):
-          start = time.time()
-          logging.info('Starting evaluation at %s using checkpoint %s.', 
-                       time.strftime('%Y-%m-%d-%H:%M:%S', time.localtime()),
-                       last_checkpoint)
-
-          with sv.managed_session(args.solver.master, config=config,
-                                  start_standard_services=False) as sess:
-            sess.run(m.init_op)
-            sv.saver.restore(sess, last_checkpoint)
-            sv.start_queue_runners(sess)
-            if args.control.reset_rng_seed:
-              train_step_kwargs['rng_data'] = [np.random.RandomState(rng_data_seed),
-                                               np.random.RandomState(rng_data_seed)]
-              train_step_kwargs['rng_action'] = np.random.RandomState(rng_action_seed)
-            vals, _ = tf_utils.train_step_custom_online_sampling(
-                sess, None, m.global_step_op, train_step_kwargs,
-                mode=args.control.test_mode)
-            should_stop = False
-
-            if checkpoint_iter >= args.solver.max_steps: 
-              should_stop = True
-
-            if should_stop:
-              break
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/cognitive_mapping_and_planning/scripts/script_plot_trajectory.py b/research/cognitive_mapping_and_planning/scripts/script_plot_trajectory.py
deleted file mode 100644
index 08273a83b512fa3100f7df6e20d41d666b037aad..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/scripts/script_plot_trajectory.py
+++ /dev/null
@@ -1,339 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""
-Code for plotting trajectories in the top view, and also plot first person views
-from saved trajectories. Does not run the network but only loads the mesh data
-to plot the view points.
-  CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64
-  PYTHONPATH='.' PYOPENGL_PLATFORM=egl python scripts/script_plot_trajectory.py \
-      --first_person --num_steps 40 \
-      --config_name cmp.lmap_Msc.clip5.sbpd_d_r2r \
-      --imset test --alsologtostderr --base_dir output --out_dir vis
-
-"""
-import os, sys, numpy as np, copy
-import matplotlib
-matplotlib.use("Agg")
-import matplotlib.pyplot as plt
-import matplotlib.animation as animation
-from matplotlib.gridspec import GridSpec
-
-import tensorflow as tf
-from tensorflow.contrib import slim
-import cv2
-import logging
-from tensorflow.python.platform import gfile
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-
-from datasets import nav_env
-import scripts.script_nav_agent_release as sna
-import src.file_utils as fu
-from src import graph_utils
-from src import utils
-FLAGS = flags.FLAGS
-
-flags.DEFINE_string('out_dir', 'vis', 'Directory where to store the output')
-flags.DEFINE_string('type', '', 'Optional type.')
-flags.DEFINE_bool('first_person', False, 'Visualize the first person view.')
-flags.DEFINE_bool('top_view', False, 'Visualize the trajectory in the top view.')
-flags.DEFINE_integer('num_steps', 40, 'Number of steps to run the model for.')
-flags.DEFINE_string('imset', 'test', '')
-flags.DEFINE_string('base_dir', 'output', 'Cache directory.')
-
-def _get_suffix_str():
-  return ''
-
-
-def _load_trajectory():
-  base_dir = FLAGS.base_dir
-  config_name = FLAGS.config_name+_get_suffix_str()
-
-  dir_name = os.path.join(base_dir, FLAGS.type, config_name)
-  logging.info('Waiting for snapshot in directory %s.', dir_name)
-  last_checkpoint = slim.evaluation.wait_for_new_checkpoint(dir_name, None)
-  checkpoint_iter = int(os.path.basename(last_checkpoint).split('-')[1])
-
-  # Load the distances.
-  a = utils.load_variables(os.path.join(dir_name, 'bench_on_'+FLAGS.imset,
-                                        'all_locs_at_t_{:d}.pkl'.format(checkpoint_iter)))
-  return a
-
-def _compute_hardness():
-  # Load the stanford data to compute the hardness.
-  if FLAGS.type == '':
-    args = sna.get_args_for_config(FLAGS.config_name+'+bench_'+FLAGS.imset)
-  else:
-    args = sna.get_args_for_config(FLAGS.type+'.'+FLAGS.config_name+'+bench_'+FLAGS.imset)
-
-  args.navtask.logdir = None
-  R = lambda: nav_env.get_multiplexer_class(args.navtask, 0)
-  R = R()
-
-  rng_data = [np.random.RandomState(0), np.random.RandomState(0)]
-
-  # Sample a room.
-  h_dists = []
-  gt_dists = []
-  for i in range(250):
-    e = R.sample_env(rng_data)
-    nodes = e.task.nodes
-
-    # Initialize the agent.
-    init_env_state = e.reset(rng_data)
-
-    gt_dist_to_goal = [e.episode.dist_to_goal[0][j][s]
-                       for j, s in enumerate(e.episode.start_node_ids)]
-
-    for j in range(args.navtask.task_params.batch_size):
-      start_node_id = e.episode.start_node_ids[j]
-      end_node_id =e.episode.goal_node_ids[0][j]
-      h_dist = graph_utils.heuristic_fn_vec(
-          nodes[[start_node_id],:], nodes[[end_node_id], :],
-          n_ori=args.navtask.task_params.n_ori,
-          step_size=args.navtask.task_params.step_size)[0][0]
-      gt_dist = e.episode.dist_to_goal[0][j][start_node_id]
-      h_dists.append(h_dist)
-      gt_dists.append(gt_dist)
-
-  h_dists = np.array(h_dists)
-  gt_dists = np.array(gt_dists)
-  e = R.sample_env([np.random.RandomState(0), np.random.RandomState(0)])
-  input = e.get_common_data()
-  orig_maps = input['orig_maps'][0,0,:,:,0]
-  return h_dists, gt_dists, orig_maps
-
-def plot_trajectory_first_person(dt, orig_maps, out_dir):
-  out_dir = os.path.join(out_dir, FLAGS.config_name+_get_suffix_str(),
-                         FLAGS.imset)
-  fu.makedirs(out_dir)
-
-  # Load the model so that we can render.
-  plt.set_cmap('gray')
-  samples_per_action = 8; wait_at_action = 0;
-
-  Writer = animation.writers['mencoder']
-  writer = Writer(fps=3*(samples_per_action+wait_at_action),
-                  metadata=dict(artist='anonymous'), bitrate=1800)
-
-  args = sna.get_args_for_config(FLAGS.config_name + '+bench_'+FLAGS.imset)
-  args.navtask.logdir = None
-  navtask_ = copy.deepcopy(args.navtask)
-  navtask_.camera_param.modalities = ['rgb']
-  navtask_.task_params.modalities = ['rgb']
-  sz = 512
-  navtask_.camera_param.height = sz
-  navtask_.camera_param.width = sz
-  navtask_.task_params.img_height = sz
-  navtask_.task_params.img_width = sz
-  R = lambda: nav_env.get_multiplexer_class(navtask_, 0)
-  R = R()
-  b = R.buildings[0]
-
-  f = [0 for _ in range(wait_at_action)] + \
-      [float(_)/samples_per_action for _ in range(samples_per_action)];
-
-  # Generate things for it to render.
-  inds_to_do = []
-  inds_to_do += [1, 4, 10] #1291, 1268, 1273, 1289, 1302, 1426, 1413, 1449, 1399, 1390]
-
-  for i in inds_to_do:
-    fig = plt.figure(figsize=(10,8))
-    gs = GridSpec(3,4)
-    gs.update(wspace=0.05, hspace=0.05, left=0.0, top=0.97, right=1.0, bottom=0.)
-    ax = fig.add_subplot(gs[:,:-1])
-    ax1 = fig.add_subplot(gs[0,-1])
-    ax2 = fig.add_subplot(gs[1,-1])
-    ax3 = fig.add_subplot(gs[2,-1])
-    axes = [ax, ax1, ax2, ax3]
-    # ax = fig.add_subplot(gs[:,:])
-    # axes = [ax]
-    for ax in axes:
-      ax.set_axis_off()
-
-    node_ids = dt['all_node_ids'][i, :, 0]*1
-    # Prune so that last node is not repeated more than 3 times?
-    if np.all(node_ids[-4:] == node_ids[-1]):
-      while node_ids[-4] == node_ids[-1]:
-        node_ids = node_ids[:-1]
-    num_steps = np.minimum(FLAGS.num_steps, len(node_ids))
-
-    xyt = b.to_actual_xyt_vec(b.task.nodes[node_ids])
-    xyt_diff = xyt[1:,:] - xyt[:-1:,:]
-    xyt_diff[:,2] = np.mod(xyt_diff[:,2], 4)
-    ind = np.where(xyt_diff[:,2] == 3)[0]
-    xyt_diff[ind, 2] = -1
-    xyt_diff = np.expand_dims(xyt_diff, axis=1)
-    to_cat = [xyt_diff*_ for _ in f]
-    perturbs_all = np.concatenate(to_cat, axis=1)
-    perturbs_all = np.concatenate([perturbs_all, np.zeros_like(perturbs_all[:,:,:1])], axis=2)
-    node_ids_all = np.expand_dims(node_ids, axis=1)*1
-    node_ids_all = np.concatenate([node_ids_all for _ in f], axis=1)
-    node_ids_all = np.reshape(node_ids_all[:-1,:], -1)
-    perturbs_all = np.reshape(perturbs_all, [-1, 4])
-    imgs = b.render_nodes(b.task.nodes[node_ids_all,:], perturb=perturbs_all)
-
-    # Get action at each node.
-    actions = []
-    _, action_to_nodes = b.get_feasible_actions(node_ids)
-    for j in range(num_steps-1):
-      action_to_node = action_to_nodes[j]
-      node_to_action = dict(zip(action_to_node.values(), action_to_node.keys()))
-      actions.append(node_to_action[node_ids[j+1]])
-
-    def init_fn():
-      return fig,
-    gt_dist_to_goal = []
-
-    # Render trajectories.
-    def worker(j):
-      # Plot the image.
-      step_number = j/(samples_per_action + wait_at_action)
-      img = imgs[j]; ax = axes[0]; ax.clear(); ax.set_axis_off();
-      img = img.astype(np.uint8); ax.imshow(img);
-      tt = ax.set_title(
-          "First Person View\n" +
-          "Top corners show diagnostics (distance, agents' action) not input to agent.",
-          fontsize=12)
-      plt.setp(tt, color='white')
-
-      # Distance to goal.
-      t = 'Dist to Goal:\n{:2d} steps'.format(int(dt['all_d_at_t'][i, step_number]))
-      t = ax.text(0.01, 0.99, t,
-          horizontalalignment='left',
-          verticalalignment='top',
-          fontsize=20, color='red',
-          transform=ax.transAxes, alpha=1.0)
-      t.set_bbox(dict(color='white', alpha=0.85, pad=-0.1))
-
-      # Action to take.
-      action_latex = ['$\odot$ ', '$\curvearrowright$ ', '$\curvearrowleft$ ', r'$\Uparrow$ ']
-      t = ax.text(0.99, 0.99, action_latex[actions[step_number]],
-          horizontalalignment='right',
-          verticalalignment='top',
-          fontsize=40, color='green',
-          transform=ax.transAxes, alpha=1.0)
-      t.set_bbox(dict(color='white', alpha=0.85, pad=-0.1))
-
-
-      # Plot the map top view.
-      ax = axes[-1]
-      if j == 0:
-        # Plot the map
-        locs = dt['all_locs'][i,:num_steps,:]
-        goal_loc = dt['all_goal_locs'][i,:,:]
-        xymin = np.minimum(np.min(goal_loc, axis=0), np.min(locs, axis=0))
-        xymax = np.maximum(np.max(goal_loc, axis=0), np.max(locs, axis=0))
-        xy1 = (xymax+xymin)/2. - 0.7*np.maximum(np.max(xymax-xymin), 24)
-        xy2 = (xymax+xymin)/2. + 0.7*np.maximum(np.max(xymax-xymin), 24)
-
-        ax.set_axis_on()
-        ax.patch.set_facecolor((0.333, 0.333, 0.333))
-        ax.set_xticks([]); ax.set_yticks([]);
-        ax.imshow(orig_maps, origin='lower', vmin=-1.0, vmax=2.0)
-        ax.plot(goal_loc[:,0], goal_loc[:,1], 'g*', markersize=12)
-
-        locs = dt['all_locs'][i,:1,:]
-        ax.plot(locs[:,0], locs[:,1], 'b.', markersize=12)
-
-        ax.set_xlim([xy1[0], xy2[0]])
-        ax.set_ylim([xy1[1], xy2[1]])
-
-      locs = dt['all_locs'][i,step_number,:]
-      locs = np.expand_dims(locs, axis=0)
-      ax.plot(locs[:,0], locs[:,1], 'r.', alpha=1.0, linewidth=0, markersize=4)
-      tt = ax.set_title('Trajectory in topview', fontsize=14)
-      plt.setp(tt, color='white')
-      return fig,
-
-    line_ani = animation.FuncAnimation(fig, worker,
-                                       (num_steps-1)*(wait_at_action+samples_per_action),
-                                       interval=500, blit=True, init_func=init_fn)
-    tmp_file_name = 'tmp.mp4'
-    line_ani.save(tmp_file_name, writer=writer, savefig_kwargs={'facecolor':'black'})
-    out_file_name = os.path.join(out_dir, 'vis_{:04d}.mp4'.format(i))
-    print(out_file_name)
-
-    if fu.exists(out_file_name):
-      gfile.Remove(out_file_name)
-    gfile.Copy(tmp_file_name, out_file_name)
-    gfile.Remove(tmp_file_name)
-    plt.close(fig)
-
-def plot_trajectory(dt, hardness, orig_maps, out_dir):
-  out_dir = os.path.join(out_dir, FLAGS.config_name+_get_suffix_str(),
-                         FLAGS.imset)
-  fu.makedirs(out_dir)
-  out_file = os.path.join(out_dir, 'all_locs_at_t.pkl')
-  dt['hardness'] = hardness
-  utils.save_variables(out_file, dt.values(), dt.keys(), overwrite=True)
-
-  #Plot trajectories onto the maps
-  plt.set_cmap('gray')
-  for i in range(4000):
-    goal_loc = dt['all_goal_locs'][i, :, :]
-    locs = np.concatenate((dt['all_locs'][i,:,:],
-                           dt['all_locs'][i,:,:]), axis=0)
-    xymin = np.minimum(np.min(goal_loc, axis=0), np.min(locs, axis=0))
-    xymax = np.maximum(np.max(goal_loc, axis=0), np.max(locs, axis=0))
-    xy1 = (xymax+xymin)/2. - 1.*np.maximum(np.max(xymax-xymin), 24)
-    xy2 = (xymax+xymin)/2. + 1.*np.maximum(np.max(xymax-xymin), 24)
-
-    fig, ax = utils.tight_imshow_figure(plt, figsize=(6,6))
-    ax.set_axis_on()
-    ax.patch.set_facecolor((0.333, 0.333, 0.333))
-    ax.set_xticks([])
-    ax.set_yticks([])
-
-    all_locs = dt['all_locs'][i,:,:]*1
-    uniq = np.where(np.any(all_locs[1:,:] != all_locs[:-1,:], axis=1))[0]+1
-    uniq = np.sort(uniq).tolist()
-    uniq.insert(0,0)
-    uniq = np.array(uniq)
-    all_locs = all_locs[uniq, :]
-
-    ax.plot(dt['all_locs'][i, 0, 0],
-            dt['all_locs'][i, 0, 1], 'b.', markersize=24)
-    ax.plot(dt['all_goal_locs'][i, 0, 0],
-            dt['all_goal_locs'][i, 0, 1], 'g*', markersize=19)
-    ax.plot(all_locs[:,0], all_locs[:,1], 'r', alpha=0.4, linewidth=2)
-    ax.scatter(all_locs[:,0], all_locs[:,1],
-               c=5+np.arange(all_locs.shape[0])*1./all_locs.shape[0],
-               cmap='Reds', s=30, linewidth=0)
-    ax.imshow(orig_maps, origin='lower', vmin=-1.0, vmax=2.0, aspect='equal')
-    ax.set_xlim([xy1[0], xy2[0]])
-    ax.set_ylim([xy1[1], xy2[1]])
-
-    file_name = os.path.join(out_dir, 'trajectory_{:04d}.png'.format(i))
-    print(file_name)
-    with fu.fopen(file_name, 'w') as f:
-      plt.savefig(f)
-    plt.close(fig)
-
-
-def main(_):
-  a = _load_trajectory()
-  h_dists, gt_dists, orig_maps = _compute_hardness()
-  hardness = 1.-h_dists*1./ gt_dists
-
-  if FLAGS.top_view:
-    plot_trajectory(a, hardness, orig_maps, out_dir=FLAGS.out_dir)
-
-  if FLAGS.first_person:
-    plot_trajectory_first_person(a, orig_maps, out_dir=FLAGS.out_dir)
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/cognitive_mapping_and_planning/scripts/script_preprocess_annoations_S3DIS.py b/research/cognitive_mapping_and_planning/scripts/script_preprocess_annoations_S3DIS.py
deleted file mode 100644
index 58f32d121acf4c638625079907b02161e808af68..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/scripts/script_preprocess_annoations_S3DIS.py
+++ /dev/null
@@ -1,197 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import os
-import glob
-import numpy as np
-import logging
-import cPickle
-from datasets import nav_env
-from datasets import factory
-from src import utils 
-from src import map_utils as mu
-
-logging.basicConfig(level=logging.INFO)
-DATA_DIR = 'data/stanford_building_parser_dataset_raw/'
-
-mkdir_if_missing = utils.mkdir_if_missing
-save_variables = utils.save_variables
-
-def _get_semantic_maps(building_name, transform, map_, flip, cats):
-  rooms = get_room_in_building(building_name)
-  maps = []
-  for cat in cats:
-    maps.append(np.zeros((map_.size[1], map_.size[0])))
-  
-  for r in rooms:
-    room = load_room(building_name, r, category_list=cats)
-    classes = room['class_id']
-    for i, cat in enumerate(cats):
-      c_ind = cats.index(cat)
-      ind = [_ for _, c in enumerate(classes) if c == c_ind]
-      if len(ind) > 0:
-        vs = [room['vertexs'][x]*1 for x in ind]
-        vs = np.concatenate(vs, axis=0)
-        if transform:
-          vs = np.array([vs[:,1], vs[:,0], vs[:,2]]).T
-          vs[:,0] = -vs[:,0]
-          vs[:,1] += 4.20
-          vs[:,0] += 6.20
-        vs = vs*100.
-        if flip:
-          vs[:,1] = -vs[:,1]
-        maps[i] = maps[i] + \
-            mu._project_to_map(map_, vs, ignore_points_outside_map=True)
-  return maps
-
-def _map_building_name(building_name):
-  b = int(building_name.split('_')[0][4])
-  out_name = 'Area_{:d}'.format(b)
-  if b == 5:
-    if int(building_name.split('_')[0][5]) == 1:
-      transform = True
-    else:
-      transform = False
-  else:
-    transform = False
-  return out_name, transform
-
-def get_categories():
-  cats = ['beam', 'board', 'bookcase', 'ceiling', 'chair', 'clutter', 'column',
-          'door', 'floor', 'sofa', 'table', 'wall', 'window']
-  return cats
-
-def _write_map_files(b_in, b_out, transform):
-  cats = get_categories()
-
-  env = utils.Foo(padding=10, resolution=5, num_point_threshold=2,
-                  valid_min=-10, valid_max=200, n_samples_per_face=200)
-  robot = utils.Foo(radius=15, base=10, height=140, sensor_height=120,
-                    camera_elevation_degree=-15)
-  
-  building_loader = factory.get_dataset('sbpd')
-  for flip in [False, True]:
-    b = nav_env.Building(b_out, robot, env, flip=flip,
-                         building_loader=building_loader)
-    logging.info("building_in: %s, building_out: %s, transform: %d", b_in,
-                 b_out, transform)
-    maps = _get_semantic_maps(b_in, transform, b.map, flip, cats)
-    maps = np.transpose(np.array(maps), axes=[1,2,0])
-
-    #  Load file from the cache.
-    file_name = '{:s}_{:d}_{:d}_{:d}_{:d}_{:d}_{:d}.pkl'
-    file_name = file_name.format(b.building_name, b.map.size[0], b.map.size[1],
-                                 b.map.origin[0], b.map.origin[1],
-                                 b.map.resolution, flip)
-    out_file = os.path.join(DATA_DIR, 'processing', 'class-maps', file_name)
-    logging.info('Writing semantic maps to %s.', out_file)
-    save_variables(out_file, [maps, cats], ['maps', 'cats'], overwrite=True)
-
-def _transform_area5b(room_dimension):
-  for a in room_dimension.keys():
-    r = room_dimension[a]*1
-    r[[0,1,3,4]] = r[[1,0,4,3]]
-    r[[0,3]] = -r[[3,0]]
-    r[[1,4]] += 4.20
-    r[[0,3]] += 6.20
-    room_dimension[a] = r
-  return room_dimension
-
-def collect_room(building_name, room_name):
-  room_dir = os.path.join(DATA_DIR, 'Stanford3dDataset_v1.2', building_name,
-                          room_name, 'Annotations')
-  files = glob.glob1(room_dir, '*.txt')
-  files = sorted(files, key=lambda s: s.lower())
-  vertexs = []; colors = [];
-  for f in files:
-    file_name = os.path.join(room_dir, f)
-    logging.info('  %s', file_name)
-    a = np.loadtxt(file_name)
-    vertex = a[:,:3]*1.
-    color = a[:,3:]*1
-    color = color.astype(np.uint8)
-    vertexs.append(vertex)
-    colors.append(color)
-  files = [f.split('.')[0] for f in files]
-  out = {'vertexs': vertexs, 'colors': colors, 'names': files}
-  return out
-
-def load_room(building_name, room_name, category_list=None):
-  room = collect_room(building_name, room_name)
-  room['building_name'] = building_name
-  room['room_name']     = room_name
-  instance_id = range(len(room['names']))
-  room['instance_id'] = instance_id
-  if category_list is not None:
-    name = [r.split('_')[0] for r in room['names']]
-    class_id = []
-    for n in name:
-      if n in category_list:
-        class_id.append(category_list.index(n))
-      else:
-        class_id.append(len(category_list))
-    room['class_id'] = class_id
-    room['category_list'] = category_list
-  return room
-
-def get_room_in_building(building_name):
-  building_dir = os.path.join(DATA_DIR, 'Stanford3dDataset_v1.2', building_name)
-  rn = os.listdir(building_dir)
-  rn = [x for x in rn if os.path.isdir(os.path.join(building_dir, x))]
-  rn = sorted(rn, key=lambda s: s.lower())
-  return rn
-
-def write_room_dimensions(b_in, b_out, transform):
-  rooms = get_room_in_building(b_in)
-  room_dimension = {}
-  for r in rooms:
-    room = load_room(b_in, r, category_list=None)
-    vertex = np.concatenate(room['vertexs'], axis=0)
-    room_dimension[r] = np.concatenate((np.min(vertex, axis=0), np.max(vertex, axis=0)), axis=0)
-  if transform == 1:
-    room_dimension = _transform_area5b(room_dimension)
-  
-  out_file = os.path.join(DATA_DIR, 'processing', 'room-dimension', b_out+'.pkl')
-  save_variables(out_file, [room_dimension], ['room_dimension'], overwrite=True)
-
-def write_room_dimensions_all(I):
-  mkdir_if_missing(os.path.join(DATA_DIR, 'processing', 'room-dimension'))
-  bs_in = ['Area_1', 'Area_2', 'Area_3', 'Area_4', 'Area_5', 'Area_5', 'Area_6']
-  bs_out = ['area1', 'area2', 'area3', 'area4', 'area5a', 'area5b', 'area6']
-  transforms = [0, 0, 0, 0, 0, 1, 0]
-  
-  for i in I:
-    b_in = bs_in[i]
-    b_out = bs_out[i]
-    t = transforms[i]
-    write_room_dimensions(b_in, b_out, t)
-
-def write_class_maps_all(I):
-  mkdir_if_missing(os.path.join(DATA_DIR, 'processing', 'class-maps'))
-  bs_in = ['Area_1', 'Area_2', 'Area_3', 'Area_4', 'Area_5', 'Area_5', 'Area_6']
-  bs_out = ['area1', 'area2', 'area3', 'area4', 'area5a', 'area5b', 'area6']
-  transforms = [0, 0, 0, 0, 0, 1, 0]
-  
-  for i in I:
-    b_in = bs_in[i]
-    b_out = bs_out[i]
-    t = transforms[i]
-    _write_map_files(b_in, b_out, t)
-
-
-if __name__ == '__main__':
-  write_room_dimensions_all([0, 2, 3, 4, 5, 6])
-  write_class_maps_all([0, 2, 3, 4, 5, 6])
-
diff --git a/research/cognitive_mapping_and_planning/scripts/script_preprocess_annoations_S3DIS.sh b/research/cognitive_mapping_and_planning/scripts/script_preprocess_annoations_S3DIS.sh
deleted file mode 100644
index 1384fabe69259ccc514a14d62aee358d1909bffb..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/scripts/script_preprocess_annoations_S3DIS.sh
+++ /dev/null
@@ -1,24 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-cd data/stanford_building_parser_dataset_raw
-unzip Stanford3dDataset_v1.2.zip
-cd ../../
-PYOPENGL_PLATFORM=egl PYTHONPATH='.' python scripts/script_preprocess_annoations_S3DIS.py
-
-mv data/stanford_building_parser_dataset_raw/processing/room-dimension data/stanford_building_parser_dataset/.
-mv data/stanford_building_parser_dataset_raw/processing/class-maps data/stanford_building_parser_dataset/.
-
-echo "You may now delete data/stanford_building_parser_dataset_raw if needed."
diff --git a/research/cognitive_mapping_and_planning/scripts/script_preprocess_meshes_S3DIS.sh b/research/cognitive_mapping_and_planning/scripts/script_preprocess_meshes_S3DIS.sh
deleted file mode 100644
index 557a4dde611d42e71d71dd1589abf96f55e6eec6..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/scripts/script_preprocess_meshes_S3DIS.sh
+++ /dev/null
@@ -1,37 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-mkdir -p data/stanford_building_parser_dataset
-mkdir -p data/stanford_building_parser_dataset/mesh
-cd data/stanford_building_parser_dataset_raw
-
-# Untar the files and extract the meshes.
-for t in "1" "3" "4" "5a" "5b" "6"; do
-  tar -xf area_"$t"_noXYZ.tar area_$t/3d/rgb_textures
-  mv area_$t/3d/rgb_textures ../stanford_building_parser_dataset/mesh/area$t
-  rmdir area_$t/3d
-  rmdir area_$t
-done
-
-cd ../../
-
-# Preprocess meshes to remove the group and chunk information.
-cd data/stanford_building_parser_dataset/
-for t in "1" "3" "4" "5a" "5b" "6"; do
-  obj_name=`ls mesh/area$t/*.obj`
-  cp $obj_name "$obj_name".bck
-  cat $obj_name.bck | grep -v '^g' | grep -v '^o' > $obj_name
-done
-cd ../../
diff --git a/research/cognitive_mapping_and_planning/scripts/script_test_pretrained_models.sh b/research/cognitive_mapping_and_planning/scripts/script_test_pretrained_models.sh
deleted file mode 100644
index a4299fff5346afb53783a61de5c3e84f102a6304..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/scripts/script_test_pretrained_models.sh
+++ /dev/null
@@ -1,63 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# Test CMP models.
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name cmp.lmap_Msc.clip5.sbpd_d_r2r+bench_test \
-  --logdir output/cmp.lmap_Msc.clip5.sbpd_d_r2r
-
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name cmp.lmap_Msc.clip5.sbpd_rgb_r2r+bench_test \
-  --logdir output/cmp.lmap_Msc.clip5.sbpd_rgb_r2r
-
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name cmp.lmap_Msc.clip5.sbpd_d_ST+bench_test \
-  --logdir output/cmp.lmap_Msc.clip5.sbpd_d_ST
-
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name cmp.lmap_Msc.clip5.sbpd_rgb_ST+bench_test \
-  --logdir output/cmp.lmap_Msc.clip5.sbpd_rgb_ST
-
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name cmp.lmap_Msc.clip5.sbpd_d_r2r_h0_64_80+bench_test \
-  --logdir output/cmp.lmap_Msc.clip5.sbpd_d_r2r_h0_64_80
-
-# Test LSTM baseline models.
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name bl.v2.noclip.sbpd_d_r2r+bench_test \
-  --logdir output/bl.v2.noclip.sbpd_d_r2r
-
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name bl.v2.noclip.sbpd_rgb_r2r+bench_test \
-  --logdir output/bl.v2.noclip.sbpd_rgb_r2r
-
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name bl.v2.noclip.sbpd_d_ST+bench_test \
-  --logdir output/bl.v2.noclip.sbpd_d_ST
-
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name bl.v2.noclip.sbpd_rgb_ST+bench_test \
-  --logdir output/bl.v2.noclip.sbpd_rgb_ST
-
-CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-  python scripts/script_nav_agent_release.py --config_name bl.v2.noclip.sbpd_d_r2r_h0_64_80+bench_test \
-  --logdir output/bl.v2.noclip.sbpd_d_r2r_h0_64_80
-
-# Visualize test trajectories in top view.
-# CUDA_VISIBLE_DEVICES=0 LD_LIBRARY_PATH=/opt/cuda-8.0/lib64:/opt/cudnnv51/lib64 PYTHONPATH='.' PYOPENGL_PLATFORM=egl \
-#   python scripts/script_plot_trajectory.py \
-#     --first_person --num_steps 40 \
-#     --config_name cmp.lmap_Msc.clip5.sbpd_d_r2r \
-#     --imset test --alsologtostderr
diff --git a/research/cognitive_mapping_and_planning/src/__init__.py b/research/cognitive_mapping_and_planning/src/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/cognitive_mapping_and_planning/src/depth_utils.py b/research/cognitive_mapping_and_planning/src/depth_utils.py
deleted file mode 100644
index 35f14fc7c37fffb2a408decede11e378867a2834..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/src/depth_utils.py
+++ /dev/null
@@ -1,96 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities for processing depth images.
-"""
-import numpy as np
-import src.rotation_utils as ru
-import src.utils as utils
-
-def get_camera_matrix(width, height, fov):
-  """Returns a camera matrix from image size and fov."""
-  xc = (width-1.) / 2.
-  zc = (height-1.) / 2.
-  f = (width / 2.) / np.tan(np.deg2rad(fov / 2.))
-  camera_matrix = utils.Foo(xc=xc, zc=zc, f=f)
-  return camera_matrix
-
-def get_point_cloud_from_z(Y, camera_matrix):
-  """Projects the depth image Y into a 3D point cloud.
-  Inputs:
-    Y is ...xHxW
-    camera_matrix
-  Outputs:
-    X is positive going right
-    Y is positive into the image
-    Z is positive up in the image
-    XYZ is ...xHxWx3
-  """
-  x, z = np.meshgrid(np.arange(Y.shape[-1]),
-                     np.arange(Y.shape[-2]-1, -1, -1))
-  for i in range(Y.ndim-2):
-    x = np.expand_dims(x, axis=0)
-    z = np.expand_dims(z, axis=0)
-  X = (x-camera_matrix.xc) * Y / camera_matrix.f
-  Z = (z-camera_matrix.zc) * Y / camera_matrix.f
-  XYZ = np.concatenate((X[...,np.newaxis], Y[...,np.newaxis],
-                        Z[...,np.newaxis]), axis=X.ndim)
-  return XYZ
-
-def make_geocentric(XYZ, sensor_height, camera_elevation_degree):
-  """Transforms the point cloud into geocentric coordinate frame.
-  Input:
-    XYZ                     : ...x3
-    sensor_height           : height of the sensor
-    camera_elevation_degree : camera elevation to rectify.
-  Output:
-    XYZ : ...x3
-  """
-  R = ru.get_r_matrix([1.,0.,0.], angle=np.deg2rad(camera_elevation_degree))
-  XYZ = np.matmul(XYZ.reshape(-1,3), R.T).reshape(XYZ.shape)
-  XYZ[...,2] = XYZ[...,2] + sensor_height
-  return XYZ
-
-def bin_points(XYZ_cms, map_size, z_bins, xy_resolution):
-  """Bins points into xy-z bins
-  XYZ_cms is ... x H x W x3
-  Outputs is ... x map_size x map_size x (len(z_bins)+1)
-  """
-  sh = XYZ_cms.shape
-  XYZ_cms = XYZ_cms.reshape([-1, sh[-3], sh[-2], sh[-1]])
-  n_z_bins = len(z_bins)+1
-  map_center = (map_size-1.)/2.
-  counts = []
-  isvalids = []
-  for XYZ_cm in XYZ_cms:
-    isnotnan = np.logical_not(np.isnan(XYZ_cm[:,:,0]))
-    X_bin = np.round(XYZ_cm[:,:,0] / xy_resolution + map_center).astype(np.int32)
-    Y_bin = np.round(XYZ_cm[:,:,1] / xy_resolution + map_center).astype(np.int32)
-    Z_bin = np.digitize(XYZ_cm[:,:,2], bins=z_bins).astype(np.int32)
-
-    isvalid = np.array([X_bin >= 0, X_bin < map_size, Y_bin >= 0, Y_bin < map_size,
-                        Z_bin >= 0, Z_bin < n_z_bins, isnotnan])
-    isvalid = np.all(isvalid, axis=0)
-
-    ind = (Y_bin * map_size + X_bin) * n_z_bins + Z_bin
-    ind[np.logical_not(isvalid)] = 0
-    count = np.bincount(ind.ravel(), isvalid.ravel().astype(np.int32),
-                         minlength=map_size*map_size*n_z_bins)
-    count = np.reshape(count, [map_size, map_size, n_z_bins])
-    counts.append(count)
-    isvalids.append(isvalid)
-  counts = np.array(counts).reshape(list(sh[:-3]) + [map_size, map_size, n_z_bins])
-  isvalids = np.array(isvalids).reshape(list(sh[:-3]) + [sh[-3], sh[-2], 1])
-  return counts, isvalids
diff --git a/research/cognitive_mapping_and_planning/src/file_utils.py b/research/cognitive_mapping_and_planning/src/file_utils.py
deleted file mode 100644
index b386236ca6e04c9fa1e452b6ad3e70c6ab9bb88a..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/src/file_utils.py
+++ /dev/null
@@ -1,42 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities for manipulating files.
-"""
-import os
-import numpy as np
-import PIL
-from tensorflow.python.platform import gfile
-import cv2
-
-exists   = lambda path: gfile.Exists(path)
-fopen    = lambda path, mode: gfile.Open(path, mode)
-makedirs = lambda path: gfile.MakeDirs(path)
-listdir  = lambda path: gfile.ListDir(path)
-copyfile = lambda a, b, o: gfile.Copy(a,b,o)
-
-def write_image(image_path, rgb):
-  ext = os.path.splitext(image_path)[1]
-  with gfile.GFile(image_path, 'w') as f:
-    img_str = cv2.imencode(ext, rgb[:,:,::-1])[1].tostring()
-    f.write(img_str)
-
-def read_image(image_path, type='rgb'):
-  with fopen(image_path, 'r') as f:
-    I = PIL.Image.open(f)
-    II = np.array(I)
-    if type == 'rgb':
-      II = II[:,:,:3]
-  return II
diff --git a/research/cognitive_mapping_and_planning/src/graph_utils.py b/research/cognitive_mapping_and_planning/src/graph_utils.py
deleted file mode 100644
index cd99fd22a2f630438f31eecd7fbfece2c6008ead..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/src/graph_utils.py
+++ /dev/null
@@ -1,552 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Various function to manipulate graphs for computing distances.
-"""
-import skimage.morphology
-import numpy as np
-import networkx as nx
-import itertools
-import logging
-from datasets.nav_env import get_path_ids
-import graph_tool as gt
-import graph_tool.topology
-import graph_tool.generation
-import src.utils as utils
-
-# Compute shortest path from all nodes to or from all source nodes
-def get_distance_node_list(gtG, source_nodes, direction, weights=None):
-  gtG_ = gt.Graph(gtG)
-  v = gtG_.add_vertex()
-
-  if weights is not None:
-    weights = gtG_.edge_properties[weights]
-
-  for s in source_nodes:
-    e = gtG_.add_edge(s, int(v))
-    if weights is not None:
-      weights[e] = 0.
-
-  if direction == 'to':
-    dist = gt.topology.shortest_distance(
-        gt.GraphView(gtG_, reversed=True), source=gtG_.vertex(int(v)),
-        target=None, weights=weights)
-  elif direction == 'from':
-    dist = gt.topology.shortest_distance(
-        gt.GraphView(gtG_, reversed=False), source=gtG_.vertex(int(v)),
-        target=None, weights=weights)
-  dist = np.array(dist.get_array())
-  dist = dist[:-1]
-  if weights is None:
-    dist = dist-1
-  return dist
-
-# Functions for semantically labelling nodes in the traversal graph.
-def generate_lattice(sz_x, sz_y):
-  """Generates a lattice with sz_x vertices along x and sz_y vertices along y
-  direction Each of these vertices is step_size distance apart. Origin is at
-  (0,0).  """
-  g = gt.generation.lattice([sz_x, sz_y])
-  x, y = np.meshgrid(np.arange(sz_x), np.arange(sz_y))
-  x = np.reshape(x, [-1,1]); y = np.reshape(y, [-1,1]);
-  nodes = np.concatenate((x,y), axis=1)
-  return g, nodes
-
-def add_diagonal_edges(g, nodes, sz_x, sz_y, edge_len):
-  offset = [sz_x+1, sz_x-1]
-  for o in offset:
-    s = np.arange(nodes.shape[0]-o-1)
-    t = s + o
-    ind = np.all(np.abs(nodes[s,:] - nodes[t,:]) == np.array([[1,1]]), axis=1)
-    s = s[ind][:,np.newaxis]
-    t = t[ind][:,np.newaxis]
-    st = np.concatenate((s,t), axis=1)
-    for i in range(st.shape[0]):
-      e = g.add_edge(st[i,0], st[i,1], add_missing=False)
-      g.ep['wts'][e] = edge_len
-
-def convert_traversible_to_graph(traversible, ff_cost=1., fo_cost=1.,
-                                 oo_cost=1., connectivity=4):
-  assert(connectivity == 4 or connectivity == 8)
-
-  sz_x = traversible.shape[1]
-  sz_y = traversible.shape[0]
-  g, nodes = generate_lattice(sz_x, sz_y)
-
-  # Assign costs.
-  edge_wts = g.new_edge_property('float')
-  g.edge_properties['wts'] = edge_wts
-  wts = np.ones(g.num_edges(), dtype=np.float32)
-  edge_wts.get_array()[:] = wts
-
-  if connectivity == 8:
-    add_diagonal_edges(g, nodes, sz_x, sz_y, np.sqrt(2.))
-
-  se = np.array([[int(e.source()), int(e.target())] for e in g.edges()])
-  s_xy = nodes[se[:,0]]
-  t_xy = nodes[se[:,1]]
-  s_t = np.ravel_multi_index((s_xy[:,1], s_xy[:,0]), traversible.shape)
-  t_t = np.ravel_multi_index((t_xy[:,1], t_xy[:,0]), traversible.shape)
-  s_t = traversible.ravel()[s_t]
-  t_t = traversible.ravel()[t_t]
-
-  wts = np.zeros(g.num_edges(), dtype=np.float32)
-  wts[np.logical_and(s_t == True, t_t == True)] = ff_cost
-  wts[np.logical_and(s_t == False, t_t == False)] = oo_cost
-  wts[np.logical_xor(s_t, t_t)] = fo_cost
-
-  edge_wts = g.edge_properties['wts']
-  for i, e in enumerate(g.edges()):
-    edge_wts[e] = edge_wts[e] * wts[i]
-  # d = edge_wts.get_array()*1.
-  # edge_wts.get_array()[:] = d*wts
-  return g, nodes
-
-def label_nodes_with_class(nodes_xyt, class_maps, pix):
-  """
-  Returns:
-    class_maps__: one-hot class_map for each class.
-    node_class_label: one-hot class_map for each class, nodes_xyt.shape[0] x n_classes
-  """
-  # Assign each pixel to a node.
-  selem = skimage.morphology.disk(pix)
-  class_maps_ = class_maps*1.
-  for i in range(class_maps.shape[2]):
-    class_maps_[:,:,i] = skimage.morphology.dilation(class_maps[:,:,i]*1, selem)
-  class_maps__ = np.argmax(class_maps_, axis=2)
-  class_maps__[np.max(class_maps_, axis=2) == 0] = -1
-
-  # For each node pick out the label from this class map.
-  x = np.round(nodes_xyt[:,[0]]).astype(np.int32)
-  y = np.round(nodes_xyt[:,[1]]).astype(np.int32)
-  ind = np.ravel_multi_index((y,x), class_maps__.shape)
-  node_class_label = class_maps__.ravel()[ind][:,0]
-
-  # Convert to one hot versions.
-  class_maps_one_hot = np.zeros(class_maps.shape, dtype=np.bool)
-  node_class_label_one_hot = np.zeros((node_class_label.shape[0], class_maps.shape[2]), dtype=np.bool)
-  for i in range(class_maps.shape[2]):
-    class_maps_one_hot[:,:,i] = class_maps__ == i
-    node_class_label_one_hot[:,i] = node_class_label == i
-  return class_maps_one_hot, node_class_label_one_hot
-
-def label_nodes_with_class_geodesic(nodes_xyt, class_maps, pix, traversible,
-                                    ff_cost=1., fo_cost=1., oo_cost=1.,
-                                    connectivity=4):
-  """Labels nodes in nodes_xyt with class labels using geodesic distance as
-  defined by traversible from class_maps.
-  Inputs:
-    nodes_xyt
-    class_maps: counts for each class.
-    pix: distance threshold to consider close enough to target.
-    traversible: binary map of whether traversible or not.
-  Output:
-    labels: For each node in nodes_xyt returns a label of the class or -1 is
-    unlabelled.
-  """
-  g, nodes = convert_traversible_to_graph(traversible, ff_cost=ff_cost,
-                                          fo_cost=fo_cost, oo_cost=oo_cost,
-                                          connectivity=connectivity)
-
-  class_dist = np.zeros_like(class_maps*1.)
-  n_classes = class_maps.shape[2]
-  if False:
-    # Assign each pixel to a class based on number of points.
-    selem = skimage.morphology.disk(pix)
-    class_maps_ = class_maps*1.
-    class_maps__ = np.argmax(class_maps_, axis=2)
-    class_maps__[np.max(class_maps_, axis=2) == 0] = -1
-
-  # Label nodes with classes.
-  for i in range(n_classes):
-    # class_node_ids = np.where(class_maps__.ravel() == i)[0]
-    class_node_ids = np.where(class_maps[:,:,i].ravel() > 0)[0]
-    dist_i = get_distance_node_list(g, class_node_ids, 'to', weights='wts')
-    class_dist[:,:,i] = np.reshape(dist_i, class_dist[:,:,i].shape)
-  class_map_geodesic = (class_dist <= pix)
-  class_map_geodesic = np.reshape(class_map_geodesic, [-1, n_classes])
-
-  # For each node pick out the label from this class map.
-  x = np.round(nodes_xyt[:,[0]]).astype(np.int32)
-  y = np.round(nodes_xyt[:,[1]]).astype(np.int32)
-  ind = np.ravel_multi_index((y,x), class_dist[:,:,0].shape)
-  node_class_label = class_map_geodesic[ind[:,0],:]
-  class_map_geodesic = class_dist <= pix
-  return class_map_geodesic, node_class_label
-
-def _get_next_nodes_undirected(n, sc, n_ori):
-  nodes_to_add = []
-  nodes_to_validate = []
-  (p, q, r) = n
-  nodes_to_add.append((n, (p, q, r), 0))
-  if n_ori == 4:
-    for _ in [1, 2, 3, 4]:
-      if _ == 1:
-        v = (p - sc, q, r)
-      elif _ == 2:
-        v = (p + sc, q, r)
-      elif _ == 3:
-        v = (p, q - sc, r)
-      elif _ == 4:
-        v = (p, q + sc, r)
-      nodes_to_validate.append((n, v, _))
-  return nodes_to_add, nodes_to_validate
-
-def _get_next_nodes(n, sc, n_ori):
-  nodes_to_add = []
-  nodes_to_validate = []
-  (p, q, r) = n
-  for r_, a_ in zip([-1, 0, 1], [1, 0, 2]):
-    nodes_to_add.append((n, (p, q, np.mod(r+r_, n_ori)), a_))
-
-  if n_ori == 6:
-    if r == 0:
-      v = (p + sc, q, r)
-    elif r == 1:
-      v = (p + sc, q + sc, r)
-    elif r == 2:
-      v = (p, q + sc, r)
-    elif r == 3:
-      v = (p - sc, q, r)
-    elif r == 4:
-      v = (p - sc, q - sc, r)
-    elif r == 5:
-      v = (p, q - sc, r)
-  elif n_ori == 4:
-    if r == 0:
-      v = (p + sc, q, r)
-    elif r == 1:
-      v = (p, q + sc, r)
-    elif r == 2:
-      v = (p - sc, q, r)
-    elif r == 3:
-      v = (p, q - sc, r)
-  nodes_to_validate.append((n,v,3))
-
-  return nodes_to_add, nodes_to_validate
-
-def generate_graph(valid_fn_vec=None, sc=1., n_ori=6,
-                   starting_location=(0, 0, 0), vis=False, directed=True):
-  timer = utils.Timer()
-  timer.tic()
-  if directed: G = nx.DiGraph(directed=True)
-  else: G = nx.Graph()
-  G.add_node(starting_location)
-  new_nodes = G.nodes()
-  while len(new_nodes) != 0:
-    nodes_to_add = []
-    nodes_to_validate = []
-    for n in new_nodes:
-      if directed:
-        na, nv = _get_next_nodes(n, sc, n_ori)
-      else:
-        na, nv = _get_next_nodes_undirected(n, sc, n_ori)
-      nodes_to_add = nodes_to_add + na
-      if valid_fn_vec is not None:
-        nodes_to_validate = nodes_to_validate + nv
-      else:
-        node_to_add = nodes_to_add + nv
-
-    # Validate nodes.
-    vs = [_[1] for _ in nodes_to_validate]
-    valids = valid_fn_vec(vs)
-
-    for nva, valid in zip(nodes_to_validate, valids):
-      if valid:
-        nodes_to_add.append(nva)
-
-    new_nodes = []
-    for n,v,a in nodes_to_add:
-      if not G.has_node(v):
-        new_nodes.append(v)
-      G.add_edge(n, v, action=a)
-
-  timer.toc(average=True, log_at=1, log_str='src.graph_utils.generate_graph')
-  return (G)
-
-def vis_G(G, ax, vertex_color='r', edge_color='b', r=None):
-  if edge_color is not None:
-    for e in G.edges():
-      XYT = zip(*e)
-      x = XYT[-3]
-      y = XYT[-2]
-      t = XYT[-1]
-      if r is None or t[0] == r:
-        ax.plot(x, y, edge_color)
-  if vertex_color is not None:
-    XYT = zip(*G.nodes())
-    x = XYT[-3]
-    y = XYT[-2]
-    t = XYT[-1]
-    ax.plot(x, y, vertex_color + '.')
-
-def convert_to_graph_tool(G):
-  timer = utils.Timer()
-  timer.tic()
-  gtG = gt.Graph(directed=G.is_directed())
-  gtG.ep['action'] = gtG.new_edge_property('int')
-
-  nodes_list = G.nodes()
-  nodes_array = np.array(nodes_list)
-
-  nodes_id = np.zeros((nodes_array.shape[0],), dtype=np.int64)
-
-  for i in range(nodes_array.shape[0]):
-    v = gtG.add_vertex()
-    nodes_id[i] = int(v)
-
-  # d = {key: value for (key, value) in zip(nodes_list, nodes_id)}
-  d = dict(itertools.izip(nodes_list, nodes_id))
-
-  for src, dst, data in G.edges_iter(data=True):
-    e = gtG.add_edge(d[src], d[dst])
-    gtG.ep['action'][e] = data['action']
-  nodes_to_id = d
-  timer.toc(average=True, log_at=1, log_str='src.graph_utils.convert_to_graph_tool')
-  return gtG, nodes_array, nodes_to_id
-
-
-def _rejection_sampling(rng, sampling_d, target_d, bins, hardness, M):
-  bin_ind = np.digitize(hardness, bins)-1
-  i = 0
-  ratio = target_d[bin_ind] / (M*sampling_d[bin_ind])
-  while i < ratio.size and rng.rand() > ratio[i]:
-    i = i+1
-  return i
-
-def heuristic_fn_vec(n1, n2, n_ori, step_size):
-  # n1 is a vector and n2 is a single point.
-  dx = (n1[:,0] - n2[0,0])/step_size
-  dy = (n1[:,1] - n2[0,1])/step_size
-  dt = n1[:,2] - n2[0,2]
-  dt = np.mod(dt, n_ori)
-  dt = np.minimum(dt, n_ori-dt)
-
-  if n_ori == 6:
-    if dx*dy > 0:
-      d = np.maximum(np.abs(dx), np.abs(dy))
-    else:
-      d = np.abs(dy-dx)
-  elif n_ori == 4:
-    d = np.abs(dx) + np.abs(dy)
-
-  return (d + dt).reshape((-1,1))
-
-def get_hardness_distribution(gtG, max_dist, min_dist, rng, trials, bins, nodes,
-                              n_ori, step_size):
-  heuristic_fn = lambda node_ids, node_id: \
-    heuristic_fn_vec(nodes[node_ids, :], nodes[[node_id], :], n_ori, step_size)
-  num_nodes = gtG.num_vertices()
-  gt_dists = []; h_dists = [];
-  for i in range(trials):
-    end_node_id = rng.choice(num_nodes)
-    gt_dist = gt.topology.shortest_distance(gt.GraphView(gtG, reversed=True),
-                                            source=gtG.vertex(end_node_id),
-                                            target=None, max_dist=max_dist)
-    gt_dist = np.array(gt_dist.get_array())
-    ind = np.where(np.logical_and(gt_dist <= max_dist, gt_dist >= min_dist))[0]
-    gt_dist = gt_dist[ind]
-    h_dist = heuristic_fn(ind, end_node_id)[:,0]
-    gt_dists.append(gt_dist)
-    h_dists.append(h_dist)
-  gt_dists = np.concatenate(gt_dists)
-  h_dists = np.concatenate(h_dists)
-  hardness = 1. - h_dists*1./gt_dists
-  hist, _ = np.histogram(hardness, bins)
-  hist = hist.astype(np.float64)
-  hist = hist / np.sum(hist)
-  return hist
-
-def rng_next_goal_rejection_sampling(start_node_ids, batch_size, gtG, rng,
-                                     max_dist, min_dist, max_dist_to_compute,
-                                     sampling_d, target_d,
-                                     nodes, n_ori, step_size, bins, M):
-  sample_start_nodes = start_node_ids is None
-  dists = []; pred_maps = []; end_node_ids = []; start_node_ids_ = [];
-  hardnesss = []; gt_dists = [];
-  num_nodes = gtG.num_vertices()
-  for i in range(batch_size):
-    done = False
-    while not done:
-      if sample_start_nodes:
-        start_node_id = rng.choice(num_nodes)
-      else:
-        start_node_id = start_node_ids[i]
-
-      gt_dist = gt.topology.shortest_distance(
-          gt.GraphView(gtG, reversed=False), source=start_node_id, target=None,
-          max_dist=max_dist)
-      gt_dist = np.array(gt_dist.get_array())
-      ind = np.where(np.logical_and(gt_dist <= max_dist, gt_dist >= min_dist))[0]
-      ind = rng.permutation(ind)
-      gt_dist = gt_dist[ind]*1.
-      h_dist = heuristic_fn_vec(nodes[ind, :], nodes[[start_node_id], :],
-                                n_ori, step_size)[:,0]
-      hardness = 1. - h_dist / gt_dist
-      sampled_ind = _rejection_sampling(rng, sampling_d, target_d, bins,
-                                        hardness, M)
-      if sampled_ind < ind.size:
-        # print sampled_ind
-        end_node_id = ind[sampled_ind]
-        hardness = hardness[sampled_ind]
-        gt_dist = gt_dist[sampled_ind]
-        done = True
-
-    # Compute distance from end node to all nodes, to return.
-    dist, pred_map = gt.topology.shortest_distance(
-        gt.GraphView(gtG, reversed=True), source=end_node_id, target=None,
-        max_dist=max_dist_to_compute, pred_map=True)
-    dist = np.array(dist.get_array())
-    pred_map = np.array(pred_map.get_array())
-
-    hardnesss.append(hardness); dists.append(dist); pred_maps.append(pred_map);
-    start_node_ids_.append(start_node_id); end_node_ids.append(end_node_id);
-    gt_dists.append(gt_dist);
-    paths = None
-  return start_node_ids_, end_node_ids, dists, pred_maps, paths, hardnesss, gt_dists
-
-
-def rng_next_goal(start_node_ids, batch_size, gtG, rng, max_dist,
-                  max_dist_to_compute, node_room_ids, nodes=None,
-                  compute_path=False, dists_from_start_node=None):
-  # Compute the distance field from the starting location, and then pick a
-  # destination in another room if possible otherwise anywhere outside this
-  # room.
-  dists = []; pred_maps = []; paths = []; end_node_ids = [];
-  for i in range(batch_size):
-    room_id = node_room_ids[start_node_ids[i]]
-    # Compute distances.
-    if dists_from_start_node == None:
-      dist, pred_map = gt.topology.shortest_distance(
-        gt.GraphView(gtG, reversed=False), source=gtG.vertex(start_node_ids[i]),
-        target=None, max_dist=max_dist_to_compute, pred_map=True)
-      dist = np.array(dist.get_array())
-    else:
-      dist = dists_from_start_node[i]
-
-    # Randomly sample nodes which are within max_dist.
-    near_ids = dist <= max_dist
-    near_ids = near_ids[:, np.newaxis]
-    # Check to see if there is a non-negative node which is close enough.
-    non_same_room_ids = node_room_ids != room_id
-    non_hallway_ids = node_room_ids != -1
-    good1_ids = np.logical_and(near_ids, np.logical_and(non_same_room_ids, non_hallway_ids))
-    good2_ids = np.logical_and(near_ids, non_hallway_ids)
-    good3_ids = near_ids
-    if np.any(good1_ids):
-      end_node_id = rng.choice(np.where(good1_ids)[0])
-    elif np.any(good2_ids):
-      end_node_id = rng.choice(np.where(good2_ids)[0])
-    elif np.any(good3_ids):
-      end_node_id = rng.choice(np.where(good3_ids)[0])
-    else:
-      logging.error('Did not find any good nodes.')
-
-    # Compute distance to this new goal for doing distance queries.
-    dist, pred_map = gt.topology.shortest_distance(
-        gt.GraphView(gtG, reversed=True), source=gtG.vertex(end_node_id),
-        target=None, max_dist=max_dist_to_compute, pred_map=True)
-    dist = np.array(dist.get_array())
-    pred_map = np.array(pred_map.get_array())
-
-    dists.append(dist)
-    pred_maps.append(pred_map)
-    end_node_ids.append(end_node_id)
-
-    path = None
-    if compute_path:
-      path = get_path_ids(start_node_ids[i], end_node_ids[i], pred_map)
-    paths.append(path)
-
-  return start_node_ids, end_node_ids, dists, pred_maps, paths
-
-
-def rng_room_to_room(batch_size, gtG, rng, max_dist, max_dist_to_compute,
-                     node_room_ids, nodes=None, compute_path=False):
-  # Sample one of the rooms, compute the distance field. Pick a destination in
-  # another room if possible otherwise anywhere outside this room.
-  dists = []; pred_maps = []; paths = []; start_node_ids = []; end_node_ids = [];
-  room_ids = np.unique(node_room_ids[node_room_ids[:,0] >= 0, 0])
-  for i in range(batch_size):
-    room_id = rng.choice(room_ids)
-    end_node_id = rng.choice(np.where(node_room_ids[:,0] == room_id)[0])
-    end_node_ids.append(end_node_id)
-
-    # Compute distances.
-    dist, pred_map = gt.topology.shortest_distance(
-        gt.GraphView(gtG, reversed=True), source=gtG.vertex(end_node_id),
-        target=None, max_dist=max_dist_to_compute, pred_map=True)
-    dist = np.array(dist.get_array())
-    pred_map = np.array(pred_map.get_array())
-    dists.append(dist)
-    pred_maps.append(pred_map)
-
-    # Randomly sample nodes which are within max_dist.
-    near_ids = dist <= max_dist
-    near_ids = near_ids[:, np.newaxis]
-
-    # Check to see if there is a non-negative node which is close enough.
-    non_same_room_ids = node_room_ids != room_id
-    non_hallway_ids = node_room_ids != -1
-    good1_ids = np.logical_and(near_ids, np.logical_and(non_same_room_ids, non_hallway_ids))
-    good2_ids = np.logical_and(near_ids, non_hallway_ids)
-    good3_ids = near_ids
-    if np.any(good1_ids):
-      start_node_id = rng.choice(np.where(good1_ids)[0])
-    elif np.any(good2_ids):
-      start_node_id = rng.choice(np.where(good2_ids)[0])
-    elif np.any(good3_ids):
-      start_node_id = rng.choice(np.where(good3_ids)[0])
-    else:
-      logging.error('Did not find any good nodes.')
-
-    start_node_ids.append(start_node_id)
-
-    path = None
-    if compute_path:
-      path = get_path_ids(start_node_ids[i], end_node_ids[i], pred_map)
-    paths.append(path)
-
-  return start_node_ids, end_node_ids, dists, pred_maps, paths
-
-
-def rng_target_dist_field(batch_size, gtG, rng, max_dist, max_dist_to_compute,
-                          nodes=None, compute_path=False):
-  # Sample a single node, compute distance to all nodes less than max_dist,
-  # sample nodes which are a particular distance away.
-  dists = []; pred_maps = []; paths = []; start_node_ids = []
-  end_node_ids = rng.choice(gtG.num_vertices(), size=(batch_size,),
-                            replace=False).tolist()
-
-  for i in range(batch_size):
-    dist, pred_map = gt.topology.shortest_distance(
-        gt.GraphView(gtG, reversed=True), source=gtG.vertex(end_node_ids[i]),
-        target=None, max_dist=max_dist_to_compute, pred_map=True)
-    dist = np.array(dist.get_array())
-    pred_map = np.array(pred_map.get_array())
-    dists.append(dist)
-    pred_maps.append(pred_map)
-
-    # Randomly sample nodes which are withing max_dist
-    near_ids = np.where(dist <= max_dist)[0]
-    start_node_id = rng.choice(near_ids, size=(1,), replace=False)[0]
-    start_node_ids.append(start_node_id)
-
-    path = None
-    if compute_path:
-      path = get_path_ids(start_node_ids[i], end_node_ids[i], pred_map)
-    paths.append(path)
-
-  return start_node_ids, end_node_ids, dists, pred_maps, paths
diff --git a/research/cognitive_mapping_and_planning/src/map_utils.py b/research/cognitive_mapping_and_planning/src/map_utils.py
deleted file mode 100644
index 6756131a9eac161e7633ef089ed573e324f859e1..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/src/map_utils.py
+++ /dev/null
@@ -1,245 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Various function to compute the ground truth map for training etc.
-"""
-import copy
-import skimage.morphology
-import logging
-import numpy as np
-import scipy.ndimage
-import matplotlib.pyplot as plt
-import PIL
-
-import src.utils as utils
-import cv2
-
-def _get_xy_bounding_box(vertex, padding):
-  """Returns the xy bounding box of the environment."""
-  min_ = np.floor(np.min(vertex[:, :2], axis=0) - padding).astype(np.int)
-  max_ = np.ceil(np.max(vertex[:, :2], axis=0) + padding).astype(np.int)
-  return min_, max_
-
-def _project_to_map(map, vertex, wt=None, ignore_points_outside_map=False):
-  """Projects points to map, returns how many points are present at each
-  location."""
-  num_points = np.zeros((map.size[1], map.size[0]))
-  vertex_ = vertex[:, :2] - map.origin
-  vertex_ = np.round(vertex_ / map.resolution).astype(np.int)
-  if ignore_points_outside_map:
-    good_ind = np.all(np.array([vertex_[:,1] >= 0, vertex_[:,1] < map.size[1],
-                                vertex_[:,0] >= 0, vertex_[:,0] < map.size[0]]),
-                      axis=0)
-    vertex_ = vertex_[good_ind, :]
-    if wt is not None:
-      wt = wt[good_ind, :]
-  if wt is None:
-    np.add.at(num_points, (vertex_[:, 1], vertex_[:, 0]), 1)
-  else:
-    assert(wt.shape[0] == vertex.shape[0]), \
-      'number of weights should be same as vertices.'
-    np.add.at(num_points, (vertex_[:, 1], vertex_[:, 0]), wt)
-  return num_points
-
-def make_map(padding, resolution, vertex=None, sc=1.):
-  """Returns a map structure."""
-  min_, max_ = _get_xy_bounding_box(vertex*sc, padding=padding)
-  sz = np.ceil((max_ - min_ + 1) / resolution).astype(np.int32)
-  max_ = min_ + sz * resolution - 1
-  map = utils.Foo(origin=min_, size=sz, max=max_, resolution=resolution,
-                  padding=padding)
-  return map
-
-def _fill_holes(img, thresh):
-  """Fills holes less than thresh area (assumes 4 connectivity when computing
-  hole area."""
-  l, n = scipy.ndimage.label(np.logical_not(img))
-  img_ = img == True
-  cnts = np.bincount(l.reshape(-1))
-  for i, cnt in enumerate(cnts):
-    if cnt < thresh:
-      l[l == i] = -1
-  img_[l == -1] = True
-  return img_
-
-def compute_traversibility(map, robot_base, robot_height, robot_radius,
-                           valid_min, valid_max, num_point_threshold, shapess,
-                           sc=100., n_samples_per_face=200):
-  """Returns a bit map with pixels that are traversible or not as long as the
-  robot center is inside this volume we are good colisions can be detected by
-  doing a line search on things, or walking from current location to final
-  location in the bitmap, or doing bwlabel on the traversibility map."""
-
-  tt = utils.Timer()
-  tt.tic()
-  num_obstcale_points = np.zeros((map.size[1], map.size[0]))
-  num_points = np.zeros((map.size[1], map.size[0]))
-
-  for i, shapes in enumerate(shapess):
-    for j in range(shapes.get_number_of_meshes()):
-      p, face_areas, face_idx = shapes.sample_points_on_face_of_shape(
-          j, n_samples_per_face, sc)
-      wt = face_areas[face_idx]/n_samples_per_face
-
-      ind = np.all(np.concatenate(
-        (p[:, [2]] > robot_base,
-         p[:, [2]] < robot_base + robot_height), axis=1),axis=1)
-      num_obstcale_points += _project_to_map(map, p[ind, :], wt[ind])
-
-      ind = np.all(np.concatenate(
-        (p[:, [2]] > valid_min,
-         p[:, [2]] < valid_max), axis=1),axis=1)
-      num_points += _project_to_map(map, p[ind, :], wt[ind])
-
-  selem = skimage.morphology.disk(robot_radius / map.resolution)
-  obstacle_free = skimage.morphology.binary_dilation(
-      _fill_holes(num_obstcale_points > num_point_threshold, 20), selem) != True
-  valid_space = _fill_holes(num_points > num_point_threshold, 20)
-  traversible = np.all(np.concatenate((obstacle_free[...,np.newaxis],
-                                       valid_space[...,np.newaxis]), axis=2),
-                       axis=2)
-  # plt.imshow(np.concatenate((obstacle_free, valid_space, traversible), axis=1))
-  # plt.show()
-
-  map_out = copy.deepcopy(map)
-  map_out.num_obstcale_points = num_obstcale_points
-  map_out.num_points = num_points
-  map_out.traversible = traversible
-  map_out.obstacle_free = obstacle_free
-  map_out.valid_space = valid_space
-  tt.toc(log_at=1, log_str='src.map_utils.compute_traversibility: ')
-  return map_out
-
-
-def resize_maps(map, map_scales, resize_method):
-  scaled_maps = []
-  for i, sc in enumerate(map_scales):
-    if resize_method == 'antialiasing':
-      # Resize using open cv so that we can compute the size.
-      # Use PIL resize to use anti aliasing feature.
-      map_ = cv2.resize(map*1, None, None, fx=sc, fy=sc, interpolation=cv2.INTER_LINEAR)
-      w = map_.shape[1]; h = map_.shape[0]
-
-      map_img = PIL.Image.fromarray((map*255).astype(np.uint8))
-      map__img = map_img.resize((w,h), PIL.Image.ANTIALIAS)
-      map_ = np.asarray(map__img).astype(np.float32)
-      map_ = map_/255.
-      map_ = np.minimum(map_, 1.0)
-      map_ = np.maximum(map_, 0.0)
-    elif resize_method == 'linear_noantialiasing':
-      map_ = cv2.resize(map*1, None, None, fx=sc, fy=sc, interpolation=cv2.INTER_LINEAR)
-    else:
-      logging.error('Unknown resizing method')
-    scaled_maps.append(map_)
-  return scaled_maps
-
-
-def pick_largest_cc(traversible):
-  out = scipy.ndimage.label(traversible)[0]
-  cnt = np.bincount(out.reshape(-1))[1:]
-  return out == np.argmax(cnt) + 1
-
-def get_graph_origin_loc(rng, traversible):
-  """Erode the traversibility mask so that we get points in the bulk of the
-  graph, and not end up with a situation where the graph is localized in the
-  corner of a cramped room. Output Locs is in the coordinate frame of the
-  map."""
-
-  aa = pick_largest_cc(skimage.morphology.binary_erosion(traversible == True,
-                                                         selem=np.ones((15,15))))
-  y, x = np.where(aa > 0)
-  ind = rng.choice(y.size)
-  locs = np.array([x[ind], y[ind]])
-  locs = locs + rng.rand(*(locs.shape)) - 0.5
-  return locs
-
-
-def generate_egocentric_maps(scaled_maps, map_scales, map_crop_sizes, loc,
-                             x_axis, y_axis, theta):
-  maps = []
-  for i, (map_, sc, map_crop_size) in enumerate(zip(scaled_maps, map_scales, map_crop_sizes)):
-    maps_i = np.array(get_map_to_predict(loc*sc, x_axis, y_axis, map_,
-                                         map_crop_size,
-                                         interpolation=cv2.INTER_LINEAR)[0])
-    maps_i[np.isnan(maps_i)] = 0
-    maps.append(maps_i)
-  return maps
-
-def generate_goal_images(map_scales, map_crop_sizes, n_ori, goal_dist,
-                         goal_theta, rel_goal_orientation):
-  goal_dist = goal_dist[:,0]
-  goal_theta = goal_theta[:,0]
-  rel_goal_orientation = rel_goal_orientation[:,0]
-
-  goals = [];
-  # Generate the map images.
-  for i, (sc, map_crop_size) in enumerate(zip(map_scales, map_crop_sizes)):
-    goal_i = np.zeros((goal_dist.shape[0], map_crop_size, map_crop_size, n_ori),
-                      dtype=np.float32)
-    x = goal_dist*np.cos(goal_theta)*sc + (map_crop_size-1.)/2.
-    y = goal_dist*np.sin(goal_theta)*sc + (map_crop_size-1.)/2.
-
-    for j in range(goal_dist.shape[0]):
-      gc = rel_goal_orientation[j]
-      x0 = np.floor(x[j]).astype(np.int32); x1 = x0 + 1;
-      y0 = np.floor(y[j]).astype(np.int32); y1 = y0 + 1;
-      if x0 >= 0 and x0 <= map_crop_size-1:
-        if y0 >= 0 and y0 <= map_crop_size-1:
-          goal_i[j, y0, x0, gc] = (x1-x[j])*(y1-y[j])
-        if y1 >= 0 and y1 <= map_crop_size-1:
-          goal_i[j, y1, x0, gc] = (x1-x[j])*(y[j]-y0)
-
-      if x1 >= 0 and x1 <= map_crop_size-1:
-        if y0 >= 0 and y0 <= map_crop_size-1:
-          goal_i[j, y0, x1, gc] = (x[j]-x0)*(y1-y[j])
-        if y1 >= 0 and y1 <= map_crop_size-1:
-          goal_i[j, y1, x1, gc] = (x[j]-x0)*(y[j]-y0)
-
-    goals.append(goal_i)
-  return goals
-
-def get_map_to_predict(src_locs, src_x_axiss, src_y_axiss, map, map_size,
-                       interpolation=cv2.INTER_LINEAR):
-  fss = []
-  valids = []
-
-  center = (map_size-1.0)/2.0
-  dst_theta = np.pi/2.0
-  dst_loc = np.array([center, center])
-  dst_x_axis = np.array([np.cos(dst_theta), np.sin(dst_theta)])
-  dst_y_axis = np.array([np.cos(dst_theta+np.pi/2), np.sin(dst_theta+np.pi/2)])
-
-  def compute_points(center, x_axis, y_axis):
-    points = np.zeros((3,2),dtype=np.float32)
-    points[0,:] = center
-    points[1,:] = center + x_axis
-    points[2,:] = center + y_axis
-    return points
-
-  dst_points = compute_points(dst_loc, dst_x_axis, dst_y_axis)
-  for i in range(src_locs.shape[0]):
-    src_loc = src_locs[i,:]
-    src_x_axis = src_x_axiss[i,:]
-    src_y_axis = src_y_axiss[i,:]
-    src_points = compute_points(src_loc, src_x_axis, src_y_axis)
-    M = cv2.getAffineTransform(src_points, dst_points)
-
-    fs = cv2.warpAffine(map, M, (map_size, map_size), None, flags=interpolation,
-                        borderValue=np.NaN)
-    valid = np.invert(np.isnan(fs))
-    valids.append(valid)
-    fss.append(fs)
-  return fss, valids
-
diff --git a/research/cognitive_mapping_and_planning/src/rotation_utils.py b/research/cognitive_mapping_and_planning/src/rotation_utils.py
deleted file mode 100644
index 8d6d4f3cbdb1f808d210dce8b22fa3ba831d45a9..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/src/rotation_utils.py
+++ /dev/null
@@ -1,73 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities for generating and applying rotation matrices.
-"""
-import numpy as np
-
-ANGLE_EPS = 0.001
-
-
-def normalize(v):
-  return v / np.linalg.norm(v)
-
-
-def get_r_matrix(ax_, angle):
-  ax = normalize(ax_)
-  if np.abs(angle) > ANGLE_EPS:
-    S_hat = np.array(
-        [[0.0, -ax[2], ax[1]], [ax[2], 0.0, -ax[0]], [-ax[1], ax[0], 0.0]],
-        dtype=np.float32)
-    R = np.eye(3) + np.sin(angle)*S_hat + \
-        (1-np.cos(angle))*(np.linalg.matrix_power(S_hat, 2))
-  else:
-    R = np.eye(3)
-  return R
-
-
-def r_between(v_from_, v_to_):
-  v_from = normalize(v_from_)
-  v_to = normalize(v_to_)
-  ax = normalize(np.cross(v_from, v_to))
-  angle = np.arccos(np.dot(v_from, v_to))
-  return get_r_matrix(ax, angle)
-
-
-def rotate_camera_to_point_at(up_from, lookat_from, up_to, lookat_to):
-  inputs = [up_from, lookat_from, up_to, lookat_to]
-  for i in range(4):
-    inputs[i] = normalize(np.array(inputs[i]).reshape((-1,)))
-  up_from, lookat_from, up_to, lookat_to = inputs
-  r1 = r_between(lookat_from, lookat_to)
-
-  new_x = np.dot(r1, np.array([1, 0, 0]).reshape((-1, 1))).reshape((-1))
-  to_x = normalize(np.cross(lookat_to, up_to))
-  angle = np.arccos(np.dot(new_x, to_x))
-  if angle > ANGLE_EPS:
-    if angle < np.pi - ANGLE_EPS:
-      ax = normalize(np.cross(new_x, to_x))
-      flip = np.dot(lookat_to, ax)
-      if flip > 0:
-        r2 = get_r_matrix(lookat_to, angle)
-      elif flip < 0:
-        r2 = get_r_matrix(lookat_to, -1. * angle)
-    else:
-      # Angle of rotation is too close to 180 degrees, direction of rotation
-      # does not matter.
-      r2 = get_r_matrix(lookat_to, angle)
-  else:
-    r2 = np.eye(3)
-  return np.dot(r2, r1)
-
diff --git a/research/cognitive_mapping_and_planning/src/utils.py b/research/cognitive_mapping_and_planning/src/utils.py
deleted file mode 100644
index a1b9e44260b7c7884855761f56ac60d6f508c2fb..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/src/utils.py
+++ /dev/null
@@ -1,168 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Generaly Utilities.
-"""
-
-import numpy as np, cPickle, os, time
-from six.moves import xrange
-import src.file_utils as fu
-import logging
-
-class Timer():
-  def __init__(self):
-    self.calls = 0.
-    self.start_time = 0.
-    self.time_per_call = 0.
-    self.total_time = 0.
-    self.last_log_time = 0.
-
-  def tic(self):
-    self.start_time = time.time()
-
-  def toc(self, average=True, log_at=-1, log_str='', type='calls'):
-    if self.start_time == 0:
-      logging.error('Timer not started by calling tic().')
-    t = time.time()
-    diff = time.time() - self.start_time
-    self.total_time += diff
-    self.calls += 1.
-    self.time_per_call = self.total_time/self.calls
-
-    if type == 'calls' and log_at > 0 and np.mod(self.calls, log_at) == 0:
-      _ = []
-      logging.info('%s: %f seconds.', log_str, self.time_per_call)
-    elif type == 'time' and log_at > 0 and t - self.last_log_time >= log_at:
-      _ = []
-      logging.info('%s: %f seconds.', log_str, self.time_per_call)
-      self.last_log_time = t
-
-    if average:
-      return self.time_per_call
-    else:
-      return diff
-
-class Foo(object):
-  def __init__(self, **kwargs):
-    self.__dict__.update(kwargs)
-  def __str__(self):
-    str_ = ''
-    for v in vars(self).keys():
-      a = getattr(self, v)
-      if True: #isinstance(v, object):
-        str__ = str(a)
-        str__ = str__.replace('\n', '\n  ')
-      else:
-        str__ = str(a)
-      str_ += '{:s}: {:s}'.format(v, str__)
-      str_ += '\n'
-    return str_
-
-
-def dict_equal(dict1, dict2):
-  assert(set(dict1.keys()) == set(dict2.keys())), "Sets of keys between 2 dictionaries are different."
-  for k in dict1.keys():
-    assert(type(dict1[k]) == type(dict2[k])), "Type of key '{:s}' if different.".format(k)
-    if type(dict1[k]) == np.ndarray:
-      assert(dict1[k].dtype == dict2[k].dtype), "Numpy Type of key '{:s}' if different.".format(k)
-      assert(np.allclose(dict1[k], dict2[k])), "Value for key '{:s}' do not match.".format(k)
-    else:
-      assert(dict1[k] == dict2[k]), "Value for key '{:s}' do not match.".format(k)
-  return True
-
-def subplot(plt, Y_X, sz_y_sz_x = (10, 10)):
-  Y,X = Y_X
-  sz_y, sz_x = sz_y_sz_x
-  plt.rcParams['figure.figsize'] = (X*sz_x, Y*sz_y)
-  fig, axes = plt.subplots(Y, X)
-  plt.subplots_adjust(wspace=0.1, hspace=0.1)
-  return fig, axes
-
-def tic_toc_print(interval, string):
-  global tic_toc_print_time_old
-  if 'tic_toc_print_time_old' not in globals():
-    tic_toc_print_time_old = time.time()
-    print(string)
-  else:
-    new_time = time.time()
-    if new_time - tic_toc_print_time_old > interval:
-      tic_toc_print_time_old = new_time;
-      print(string)
-
-def mkdir_if_missing(output_dir):
-  if not fu.exists(output_dir):
-    fu.makedirs(output_dir)
-
-def save_variables(pickle_file_name, var, info, overwrite = False):
-  if fu.exists(pickle_file_name) and overwrite == False:
-    raise Exception('{:s} exists and over write is false.'.format(pickle_file_name))
-  # Construct the dictionary
-  assert(type(var) == list); assert(type(info) == list);
-  d = {}
-  for i in xrange(len(var)):
-    d[info[i]] = var[i]
-  with fu.fopen(pickle_file_name, 'w') as f:
-    cPickle.dump(d, f, cPickle.HIGHEST_PROTOCOL)
-
-def load_variables(pickle_file_name):
-  if fu.exists(pickle_file_name):
-    with fu.fopen(pickle_file_name, 'r') as f:
-      d = cPickle.load(f)
-    return d
-  else:
-    raise Exception('{:s} does not exists.'.format(pickle_file_name))
-
-def voc_ap(rec, prec):
-  rec = rec.reshape((-1,1))
-  prec = prec.reshape((-1,1))
-  z = np.zeros((1,1))
-  o = np.ones((1,1))
-  mrec = np.vstack((z, rec, o))
-  mpre = np.vstack((z, prec, z))
-  for i in range(len(mpre)-2, -1, -1):
-    mpre[i] = max(mpre[i], mpre[i+1])
-
-  I = np.where(mrec[1:] != mrec[0:-1])[0]+1;
-  ap = 0;
-  for i in I:
-    ap = ap + (mrec[i] - mrec[i-1])*mpre[i];
-  return ap
-
-def tight_imshow_figure(plt, figsize=None):
-  fig = plt.figure(figsize=figsize)
-  ax = plt.Axes(fig, [0,0,1,1])
-  ax.set_axis_off()
-  fig.add_axes(ax)
-  return fig, ax
-
-def calc_pr(gt, out, wt=None):
-  if wt is None:
-    wt = np.ones((gt.size,1))
-
-  gt = gt.astype(np.float64).reshape((-1,1))
-  wt = wt.astype(np.float64).reshape((-1,1))
-  out = out.astype(np.float64).reshape((-1,1))
-
-  gt = gt*wt
-  tog = np.concatenate([gt, wt, out], axis=1)*1.
-  ind = np.argsort(tog[:,2], axis=0)[::-1]
-  tog = tog[ind,:]
-  cumsumsortgt = np.cumsum(tog[:,0])
-  cumsumsortwt = np.cumsum(tog[:,1])
-  prec = cumsumsortgt / cumsumsortwt
-  rec = cumsumsortgt / np.sum(tog[:,0])
-
-  ap = voc_ap(rec, prec)
-  return ap, rec, prec
diff --git a/research/cognitive_mapping_and_planning/tfcode/__init__.py b/research/cognitive_mapping_and_planning/tfcode/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/cognitive_mapping_and_planning/tfcode/cmp.py b/research/cognitive_mapping_and_planning/tfcode/cmp.py
deleted file mode 100644
index 228ef90fddcd9ff41b26795544d93a1f18466158..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/tfcode/cmp.py
+++ /dev/null
@@ -1,553 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Code for setting up the network for CMP.
-
-Sets up the mapper and the planner.
-"""
-
-import sys, os, numpy as np
-import matplotlib.pyplot as plt
-import copy
-import argparse, pprint
-import time
-
-
-import tensorflow as tf
-
-from tensorflow.contrib import slim
-from tensorflow.contrib.slim import arg_scope
-
-import logging
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-from src import utils 
-import src.file_utils as fu
-import tfcode.nav_utils as nu 
-import tfcode.cmp_utils as cu 
-import tfcode.cmp_summary as cmp_s
-from tfcode import tf_utils
-
-value_iteration_network = cu.value_iteration_network
-rotate_preds            = cu.rotate_preds
-deconv                  = cu.deconv
-get_visual_frustum      = cu.get_visual_frustum
-fr_v2                   = cu.fr_v2
-
-setup_train_step_kwargs = nu.default_train_step_kwargs
-compute_losses_multi_or = nu.compute_losses_multi_or
-
-get_repr_from_image     = nu.get_repr_from_image
-
-_save_d_at_t            = nu.save_d_at_t
-_save_all               = nu.save_all
-_eval_ap                = nu.eval_ap
-_eval_dist              = nu.eval_dist
-_plot_trajectories      = nu.plot_trajectories
-
-_vis_readout_maps       = cmp_s._vis_readout_maps
-_vis                    = cmp_s._vis
-_summary_vis            = cmp_s._summary_vis
-_summary_readout_maps   = cmp_s._summary_readout_maps
-_add_summaries          = cmp_s._add_summaries
-
-def _inputs(problem):
-  # Set up inputs.
-  with tf.name_scope('inputs'):
-    inputs = []
-    inputs.append(('orig_maps', tf.float32, 
-                   (problem.batch_size, 1, None, None, 1)))
-    inputs.append(('goal_loc', tf.float32, 
-                   (problem.batch_size, problem.num_goals, 2)))
-    common_input_data, _ = tf_utils.setup_inputs(inputs)
-
-    inputs = []
-    if problem.input_type == 'vision':
-      # Multiple images from an array of cameras.
-      inputs.append(('imgs', tf.float32, 
-                     (problem.batch_size, None, len(problem.aux_delta_thetas)+1,
-                      problem.img_height, problem.img_width,
-                      problem.img_channels)))
-    elif problem.input_type == 'analytical_counts':
-      for i in range(len(problem.map_crop_sizes)):
-        inputs.append(('analytical_counts_{:d}'.format(i), tf.float32, 
-                      (problem.batch_size, None, problem.map_crop_sizes[i],
-                       problem.map_crop_sizes[i], problem.map_channels)))
-
-    if problem.outputs.readout_maps: 
-      for i in range(len(problem.readout_maps_crop_sizes)):
-        inputs.append(('readout_maps_{:d}'.format(i), tf.float32, 
-                      (problem.batch_size, None,
-                       problem.readout_maps_crop_sizes[i],
-                       problem.readout_maps_crop_sizes[i],
-                       problem.readout_maps_channels)))
-
-    for i in range(len(problem.map_crop_sizes)):
-      inputs.append(('ego_goal_imgs_{:d}'.format(i), tf.float32, 
-                    (problem.batch_size, None, problem.map_crop_sizes[i],
-                     problem.map_crop_sizes[i], problem.goal_channels)))
-      for s in ['sum_num', 'sum_denom', 'max_denom']:
-        inputs.append(('running_'+s+'_{:d}'.format(i), tf.float32,
-                       (problem.batch_size, 1, problem.map_crop_sizes[i],
-                        problem.map_crop_sizes[i], problem.map_channels)))
-
-    inputs.append(('incremental_locs', tf.float32, 
-                   (problem.batch_size, None, 2)))
-    inputs.append(('incremental_thetas', tf.float32, 
-                   (problem.batch_size, None, 1)))
-    inputs.append(('step_number', tf.int32, (1, None, 1)))
-    inputs.append(('node_ids', tf.int32, (problem.batch_size, None,
-                                          problem.node_ids_dim)))
-    inputs.append(('perturbs', tf.float32, (problem.batch_size, None,
-                                            problem.perturbs_dim)))
-    
-    # For plotting result plots
-    inputs.append(('loc_on_map', tf.float32, (problem.batch_size, None, 2)))
-    inputs.append(('gt_dist_to_goal', tf.float32, (problem.batch_size, None, 1)))
-
-    step_input_data, _ = tf_utils.setup_inputs(inputs)
-
-    inputs = []
-    inputs.append(('action', tf.int32, (problem.batch_size, None, problem.num_actions)))
-    train_data, _ = tf_utils.setup_inputs(inputs)
-    train_data.update(step_input_data)
-    train_data.update(common_input_data)
-  return common_input_data, step_input_data, train_data 
-
-def readout_general(multi_scale_belief, num_neurons, strides, layers_per_block,
-                    kernel_size, batch_norm_is_training_op, wt_decay):
-  multi_scale_belief = tf.stop_gradient(multi_scale_belief)
-  with tf.variable_scope('readout_maps_deconv'):
-    x, outs = deconv(multi_scale_belief, batch_norm_is_training_op,
-                     wt_decay=wt_decay, neurons=num_neurons, strides=strides,
-                     layers_per_block=layers_per_block, kernel_size=kernel_size,
-                     conv_fn=slim.conv2d_transpose, offset=0,
-                     name='readout_maps_deconv')
-    probs = tf.sigmoid(x)
-  return x, probs
-
-
-def running_combine(fss_logits, confs_probs, incremental_locs,
-                    incremental_thetas, previous_sum_num, previous_sum_denom,
-                    previous_max_denom, map_size, num_steps):
-  # fss_logits is B x N x H x W x C
-  # confs_logits is B x N x H x W x C
-  # incremental_locs is B x N x 2
-  # incremental_thetas is B x N x 1
-  # previous_sum_num etc is B x 1 x H x W x C
-
-  with tf.name_scope('combine_{:d}'.format(num_steps)):
-    running_sum_nums_ = []; running_sum_denoms_ = [];
-    running_max_denoms_ = [];
-
-    fss_logits_ = tf.unstack(fss_logits, axis=1, num=num_steps)
-    confs_probs_ = tf.unstack(confs_probs, axis=1, num=num_steps)
-    incremental_locs_ = tf.unstack(incremental_locs, axis=1, num=num_steps)
-    incremental_thetas_ = tf.unstack(incremental_thetas, axis=1, num=num_steps)
-    running_sum_num = tf.unstack(previous_sum_num, axis=1, num=1)[0]
-    running_sum_denom = tf.unstack(previous_sum_denom, axis=1, num=1)[0]
-    running_max_denom = tf.unstack(previous_max_denom, axis=1, num=1)[0]
-
-    for i in range(num_steps):
-      # Rotate the previous running_num and running_denom
-      running_sum_num, running_sum_denom, running_max_denom = rotate_preds(
-          incremental_locs_[i], incremental_thetas_[i], map_size,
-          [running_sum_num, running_sum_denom, running_max_denom],
-          output_valid_mask=False)[0]
-      # print i, num_steps, running_sum_num.get_shape().as_list()
-      running_sum_num = running_sum_num + fss_logits_[i] * confs_probs_[i]
-      running_sum_denom = running_sum_denom + confs_probs_[i]
-      running_max_denom = tf.maximum(running_max_denom, confs_probs_[i])
-      running_sum_nums_.append(running_sum_num)
-      running_sum_denoms_.append(running_sum_denom)
-      running_max_denoms_.append(running_max_denom)
-
-    running_sum_nums = tf.stack(running_sum_nums_, axis=1)
-    running_sum_denoms = tf.stack(running_sum_denoms_, axis=1)
-    running_max_denoms = tf.stack(running_max_denoms_, axis=1)
-    return running_sum_nums, running_sum_denoms, running_max_denoms
-
-def get_map_from_images(imgs, mapper_arch, task_params, freeze_conv, wt_decay,
-                        is_training, batch_norm_is_training_op, num_maps,
-                        split_maps=True):
-  # Hit image with a resnet.
-  n_views = len(task_params.aux_delta_thetas) + 1
-  out = utils.Foo()
-
-  images_reshaped = tf.reshape(imgs, 
-      shape=[-1, task_params.img_height,
-             task_params.img_width,
-             task_params.img_channels], name='re_image')
-
-  x, out.vars_to_restore = get_repr_from_image(
-      images_reshaped, task_params.modalities, task_params.data_augment,
-      mapper_arch.encoder, freeze_conv, wt_decay, is_training)
-
-  # Reshape into nice things so that these can be accumulated over time steps
-  # for faster backprop.
-  sh_before = x.get_shape().as_list()
-  out.encoder_output = tf.reshape(x, shape=[task_params.batch_size, -1, n_views] + sh_before[1:])
-  x = tf.reshape(out.encoder_output, shape=[-1] + sh_before[1:])
-
-  # Add a layer to reduce dimensions for a fc layer.
-  if mapper_arch.dim_reduce_neurons > 0:
-    ks = 1; neurons = mapper_arch.dim_reduce_neurons;
-    init_var = np.sqrt(2.0/(ks**2)/neurons)
-    batch_norm_param = mapper_arch.batch_norm_param
-    batch_norm_param['is_training'] = batch_norm_is_training_op
-    out.conv_feat = slim.conv2d(x, neurons, kernel_size=ks, stride=1,
-                    normalizer_fn=slim.batch_norm, normalizer_params=batch_norm_param,
-                    padding='SAME', scope='dim_reduce',
-                    weights_regularizer=slim.l2_regularizer(wt_decay),
-                    weights_initializer=tf.random_normal_initializer(stddev=init_var))
-    reshape_conv_feat = slim.flatten(out.conv_feat)
-    sh = reshape_conv_feat.get_shape().as_list()
-    out.reshape_conv_feat = tf.reshape(reshape_conv_feat, shape=[-1, sh[1]*n_views])
-
-  with tf.variable_scope('fc'):
-    # Fully connected layers to compute the representation in top-view space.
-    fc_batch_norm_param = {'center': True, 'scale': True, 
-                           'activation_fn':tf.nn.relu,
-                           'is_training': batch_norm_is_training_op}
-    f = out.reshape_conv_feat
-    out_neurons = (mapper_arch.fc_out_size**2)*mapper_arch.fc_out_neurons
-    neurons = mapper_arch.fc_neurons + [out_neurons]
-    f, _ = tf_utils.fc_network(f, neurons=neurons, wt_decay=wt_decay,
-                               name='fc', offset=0,
-                               batch_norm_param=fc_batch_norm_param,
-                               is_training=is_training,
-                               dropout_ratio=mapper_arch.fc_dropout)
-    f = tf.reshape(f, shape=[-1, mapper_arch.fc_out_size,
-                             mapper_arch.fc_out_size,
-                             mapper_arch.fc_out_neurons], name='re_fc')
-
-  # Use pool5 to predict the free space map via deconv layers.
-  with tf.variable_scope('deconv'):
-    x, outs = deconv(f, batch_norm_is_training_op, wt_decay=wt_decay,
-                     neurons=mapper_arch.deconv_neurons,
-                     strides=mapper_arch.deconv_strides,
-                     layers_per_block=mapper_arch.deconv_layers_per_block,
-                     kernel_size=mapper_arch.deconv_kernel_size,
-                     conv_fn=slim.conv2d_transpose, offset=0, name='deconv')
-
-  # Reshape x the right way.
-  sh = x.get_shape().as_list()
-  x = tf.reshape(x, shape=[task_params.batch_size, -1] + sh[1:])
-  out.deconv_output = x
-
-  # Separate out the map and the confidence predictions, pass the confidence
-  # through a sigmoid.
-  if split_maps:
-    with tf.name_scope('split'):
-      out_all = tf.split(value=x, axis=4, num_or_size_splits=2*num_maps)
-      out.fss_logits = out_all[:num_maps]
-      out.confs_logits = out_all[num_maps:]
-    with tf.name_scope('sigmoid'):
-      out.confs_probs = [tf.nn.sigmoid(x) for x in out.confs_logits]
-  return out
-
-def setup_to_run(m, args, is_training, batch_norm_is_training, summary_mode):
-  assert(args.arch.multi_scale), 'removed support for old single scale code.'
-  # Set up the model.
-  tf.set_random_seed(args.solver.seed)
-  task_params = args.navtask.task_params
-
-  batch_norm_is_training_op = \
-      tf.placeholder_with_default(batch_norm_is_training, shape=[],
-                                  name='batch_norm_is_training_op') 
-
-  # Setup the inputs
-  m.input_tensors = {}
-  m.train_ops = {}
-  m.input_tensors['common'], m.input_tensors['step'], m.input_tensors['train'] = \
-      _inputs(task_params)
-
-  m.init_fn = None
-
-  if task_params.input_type == 'vision':
-    m.vision_ops = get_map_from_images(
-        m.input_tensors['step']['imgs'], args.mapper_arch,
-        task_params, args.solver.freeze_conv,
-        args.solver.wt_decay, is_training, batch_norm_is_training_op,
-        num_maps=len(task_params.map_crop_sizes))
-
-    # Load variables from snapshot if needed.
-    if args.solver.pretrained_path is not None:
-      m.init_fn = slim.assign_from_checkpoint_fn(args.solver.pretrained_path,
-                                                 m.vision_ops.vars_to_restore)
-
-    # Set up caching of vision features if needed.
-    if args.solver.freeze_conv:
-      m.train_ops['step_data_cache'] = [m.vision_ops.encoder_output]
-    else:
-      m.train_ops['step_data_cache'] = []
-
-    # Set up blobs that are needed for the computation in rest of the graph.
-    m.ego_map_ops = m.vision_ops.fss_logits
-    m.coverage_ops = m.vision_ops.confs_probs
-    
-    # Zero pad these to make them same size as what the planner expects.
-    for i in range(len(m.ego_map_ops)):
-      if args.mapper_arch.pad_map_with_zeros_each[i] > 0:
-        paddings = np.zeros((5,2), dtype=np.int32)
-        paddings[2:4,:] = args.mapper_arch.pad_map_with_zeros_each[i]
-        paddings_op = tf.constant(paddings, dtype=tf.int32)
-        m.ego_map_ops[i] = tf.pad(m.ego_map_ops[i], paddings=paddings_op)
-        m.coverage_ops[i] = tf.pad(m.coverage_ops[i], paddings=paddings_op)
-  
-  elif task_params.input_type == 'analytical_counts':
-    m.ego_map_ops = []; m.coverage_ops = []
-    for i in range(len(task_params.map_crop_sizes)):
-      ego_map_op = m.input_tensors['step']['analytical_counts_{:d}'.format(i)]
-      coverage_op = tf.cast(tf.greater_equal(
-          tf.reduce_max(ego_map_op, reduction_indices=[4],
-                        keep_dims=True), 1), tf.float32)
-      coverage_op = tf.ones_like(ego_map_op) * coverage_op
-      m.ego_map_ops.append(ego_map_op)
-      m.coverage_ops.append(coverage_op)
-      m.train_ops['step_data_cache'] = []
-  
-  num_steps = task_params.num_steps
-  num_goals = task_params.num_goals
-
-  map_crop_size_ops = []
-  for map_crop_size in task_params.map_crop_sizes:
-    map_crop_size_ops.append(tf.constant(map_crop_size, dtype=tf.int32, shape=(2,)))
-
-  with tf.name_scope('check_size'):
-    is_single_step = tf.equal(tf.unstack(tf.shape(m.ego_map_ops[0]), num=5)[1], 1)
-
-  fr_ops = []; value_ops = [];
-  fr_intermediate_ops = []; value_intermediate_ops = [];
-  crop_value_ops = [];
-  resize_crop_value_ops = [];
-  confs = []; occupancys = [];
-
-  previous_value_op = None
-  updated_state = []; state_names = [];
-
-  for i in range(len(task_params.map_crop_sizes)):
-    map_crop_size = task_params.map_crop_sizes[i]
-    with tf.variable_scope('scale_{:d}'.format(i)): 
-      # Accumulate the map.
-      fn = lambda ns: running_combine(
-             m.ego_map_ops[i],
-             m.coverage_ops[i],
-             m.input_tensors['step']['incremental_locs'] * task_params.map_scales[i],
-             m.input_tensors['step']['incremental_thetas'],
-             m.input_tensors['step']['running_sum_num_{:d}'.format(i)],
-             m.input_tensors['step']['running_sum_denom_{:d}'.format(i)],
-             m.input_tensors['step']['running_max_denom_{:d}'.format(i)],
-             map_crop_size, ns)
-
-      running_sum_num, running_sum_denom, running_max_denom = \
-          tf.cond(is_single_step, lambda: fn(1), lambda: fn(num_steps*num_goals))
-      updated_state += [running_sum_num, running_sum_denom, running_max_denom]
-      state_names += ['running_sum_num_{:d}'.format(i),
-                      'running_sum_denom_{:d}'.format(i),
-                      'running_max_denom_{:d}'.format(i)]
-
-      # Concat the accumulated map and goal
-      occupancy = running_sum_num / tf.maximum(running_sum_denom, 0.001)
-      conf = running_max_denom
-      # print occupancy.get_shape().as_list()
-
-      # Concat occupancy, how much occupied and goal.
-      with tf.name_scope('concat'):
-        sh = [-1, map_crop_size, map_crop_size, task_params.map_channels]
-        occupancy = tf.reshape(occupancy, shape=sh)
-        conf = tf.reshape(conf, shape=sh)
-
-        sh = [-1, map_crop_size, map_crop_size, task_params.goal_channels]
-        goal = tf.reshape(m.input_tensors['step']['ego_goal_imgs_{:d}'.format(i)], shape=sh)
-        to_concat = [occupancy, conf, goal]
-
-        if previous_value_op is not None:
-          to_concat.append(previous_value_op)
-
-        x = tf.concat(to_concat, 3)
-
-      # Pass the map, previous rewards and the goal through a few convolutional
-      # layers to get fR.
-      fr_op, fr_intermediate_op = fr_v2(
-         x, output_neurons=args.arch.fr_neurons,
-         inside_neurons=args.arch.fr_inside_neurons,
-         is_training=batch_norm_is_training_op, name='fr',
-         wt_decay=args.solver.wt_decay, stride=args.arch.fr_stride)
-
-      # Do Value Iteration on the fR
-      if args.arch.vin_num_iters > 0:
-        value_op, value_intermediate_op = value_iteration_network(
-            fr_op, num_iters=args.arch.vin_num_iters,
-            val_neurons=args.arch.vin_val_neurons,
-            action_neurons=args.arch.vin_action_neurons,
-            kernel_size=args.arch.vin_ks, share_wts=args.arch.vin_share_wts,
-            name='vin', wt_decay=args.solver.wt_decay)
-      else:
-        value_op = fr_op
-        value_intermediate_op = []
-
-      # Crop out and upsample the previous value map.
-      remove = args.arch.crop_remove_each
-      if remove > 0:
-        crop_value_op = value_op[:, remove:-remove, remove:-remove,:]
-      else:
-        crop_value_op = value_op
-      crop_value_op = tf.reshape(crop_value_op, shape=[-1, args.arch.value_crop_size,
-                                                       args.arch.value_crop_size,
-                                                       args.arch.vin_val_neurons])
-      if i < len(task_params.map_crop_sizes)-1:
-        # Reshape it to shape of the next scale.
-        previous_value_op = tf.image.resize_bilinear(crop_value_op,
-                                                     map_crop_size_ops[i+1],
-                                                     align_corners=True)
-        resize_crop_value_ops.append(previous_value_op)
-      
-      occupancys.append(occupancy)
-      confs.append(conf)
-      value_ops.append(value_op)
-      crop_value_ops.append(crop_value_op)
-      fr_ops.append(fr_op)
-      fr_intermediate_ops.append(fr_intermediate_op)
-  
-  m.value_ops = value_ops
-  m.value_intermediate_ops = value_intermediate_ops
-  m.fr_ops = fr_ops
-  m.fr_intermediate_ops = fr_intermediate_ops
-  m.final_value_op = crop_value_op
-  m.crop_value_ops = crop_value_ops
-  m.resize_crop_value_ops = resize_crop_value_ops
-  m.confs = confs
-  m.occupancys = occupancys
-
-  sh = [-1, args.arch.vin_val_neurons*((args.arch.value_crop_size)**2)]
-  m.value_features_op = tf.reshape(m.final_value_op, sh, name='reshape_value_op')
-  
-  # Determine what action to take.
-  with tf.variable_scope('action_pred'):
-    batch_norm_param = args.arch.pred_batch_norm_param
-    if batch_norm_param is not None:
-      batch_norm_param['is_training'] = batch_norm_is_training_op
-    m.action_logits_op, _ = tf_utils.fc_network(
-        m.value_features_op, neurons=args.arch.pred_neurons,
-        wt_decay=args.solver.wt_decay, name='pred', offset=0,
-        num_pred=task_params.num_actions,
-        batch_norm_param=batch_norm_param) 
-    m.action_prob_op = tf.nn.softmax(m.action_logits_op)
-
-  init_state = tf.constant(0., dtype=tf.float32, shape=[
-      task_params.batch_size, 1, map_crop_size, map_crop_size,
-      task_params.map_channels])
-
-  m.train_ops['state_names'] = state_names
-  m.train_ops['updated_state'] = updated_state
-  m.train_ops['init_state'] = [init_state for _ in updated_state]
-
-  m.train_ops['step'] = m.action_prob_op
-  m.train_ops['common'] = [m.input_tensors['common']['orig_maps'],
-                           m.input_tensors['common']['goal_loc']]
-  m.train_ops['batch_norm_is_training_op'] = batch_norm_is_training_op
-  m.loss_ops = []; m.loss_ops_names = [];
-
-  if args.arch.readout_maps:
-    with tf.name_scope('readout_maps'):
-      all_occupancys = tf.concat(m.occupancys + m.confs, 3)
-      readout_maps, probs = readout_general(
-          all_occupancys, num_neurons=args.arch.rom_arch.num_neurons,
-          strides=args.arch.rom_arch.strides, 
-          layers_per_block=args.arch.rom_arch.layers_per_block, 
-          kernel_size=args.arch.rom_arch.kernel_size,
-          batch_norm_is_training_op=batch_norm_is_training_op,
-          wt_decay=args.solver.wt_decay)
-
-      gt_ego_maps = [m.input_tensors['step']['readout_maps_{:d}'.format(i)]
-                     for i in range(len(task_params.readout_maps_crop_sizes))]
-      m.readout_maps_gt = tf.concat(gt_ego_maps, 4)
-      gt_shape = tf.shape(m.readout_maps_gt)
-      m.readout_maps_logits = tf.reshape(readout_maps, gt_shape)
-      m.readout_maps_probs = tf.reshape(probs, gt_shape)
-
-      # Add a loss op
-      m.readout_maps_loss_op = tf.losses.sigmoid_cross_entropy(
-          tf.reshape(m.readout_maps_gt, [-1, len(task_params.readout_maps_crop_sizes)]), 
-          tf.reshape(readout_maps, [-1, len(task_params.readout_maps_crop_sizes)]),
-          scope='loss')
-      m.readout_maps_loss_op = 10.*m.readout_maps_loss_op
-
-  ewma_decay = 0.99 if is_training else 0.0
-  weight = tf.ones_like(m.input_tensors['train']['action'], dtype=tf.float32,
-                        name='weight')
-  m.reg_loss_op, m.data_loss_op, m.total_loss_op, m.acc_ops = \
-    compute_losses_multi_or(m.action_logits_op,
-                            m.input_tensors['train']['action'], weights=weight,
-                            num_actions=task_params.num_actions,
-                            data_loss_wt=args.solver.data_loss_wt,
-                            reg_loss_wt=args.solver.reg_loss_wt,
-                            ewma_decay=ewma_decay)
-  
-  if args.arch.readout_maps:
-    m.total_loss_op = m.total_loss_op + m.readout_maps_loss_op
-    m.loss_ops += [m.readout_maps_loss_op]
-    m.loss_ops_names += ['readout_maps_loss']
-
-  m.loss_ops += [m.reg_loss_op, m.data_loss_op, m.total_loss_op]
-  m.loss_ops_names += ['reg_loss', 'data_loss', 'total_loss']
-
-  if args.solver.freeze_conv:
-    vars_to_optimize = list(set(tf.trainable_variables()) -
-                            set(m.vision_ops.vars_to_restore))
-  else:
-    vars_to_optimize = None
-
-  m.lr_op, m.global_step_op, m.train_op, m.should_stop_op, m.optimizer, \
-  m.sync_optimizer = tf_utils.setup_training(
-      m.total_loss_op, 
-      args.solver.initial_learning_rate, 
-      args.solver.steps_per_decay,
-      args.solver.learning_rate_decay, 
-      args.solver.momentum,
-      args.solver.max_steps, 
-      args.solver.sync, 
-      args.solver.adjust_lr_sync,
-      args.solver.num_workers, 
-      args.solver.task,
-      vars_to_optimize=vars_to_optimize,
-      clip_gradient_norm=args.solver.clip_gradient_norm,
-      typ=args.solver.typ, momentum2=args.solver.momentum2,
-      adam_eps=args.solver.adam_eps)
-
-  if args.arch.sample_gt_prob_type == 'inverse_sigmoid_decay':
-    m.sample_gt_prob_op = tf_utils.inverse_sigmoid_decay(args.arch.isd_k,
-                                                         m.global_step_op)
-  elif args.arch.sample_gt_prob_type == 'zero':
-    m.sample_gt_prob_op = tf.constant(-1.0, dtype=tf.float32)
-
-  elif args.arch.sample_gt_prob_type.split('_')[0] == 'step':
-    step = int(args.arch.sample_gt_prob_type.split('_')[1])
-    m.sample_gt_prob_op = tf_utils.step_gt_prob(
-        step, m.input_tensors['step']['step_number'][0,0,0])
-
-  m.sample_action_type = args.arch.action_sample_type
-  m.sample_action_combine_type = args.arch.action_sample_combine_type
-
-  m.summary_ops = {
-      summary_mode: _add_summaries(m, args, summary_mode,
-                                   args.summary.arop_full_summary_iters)}
-
-  m.init_op = tf.group(tf.global_variables_initializer(),
-                       tf.local_variables_initializer())
-  m.saver_op = tf.train.Saver(keep_checkpoint_every_n_hours=4,
-                              write_version=tf.train.SaverDef.V2)
-  return m
diff --git a/research/cognitive_mapping_and_planning/tfcode/cmp_summary.py b/research/cognitive_mapping_and_planning/tfcode/cmp_summary.py
deleted file mode 100644
index 55313bfbd52a9e079e1de5093ae1882a9bf1d858..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/tfcode/cmp_summary.py
+++ /dev/null
@@ -1,213 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Code for setting up summaries for CMP.
-"""
-
-import sys, os, numpy as np
-import matplotlib.pyplot as plt
-
-
-import tensorflow as tf
-
-from tensorflow.contrib import slim
-from tensorflow.contrib.slim import arg_scope
-
-import logging
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-from src import utils 
-import src.file_utils as fu
-import tfcode.nav_utils as nu 
-
-def _vis_readout_maps(outputs, global_step, output_dir, metric_summary, N):
-  # outputs is [gt_map, pred_map]:
-  if N >= 0:
-    outputs = outputs[:N]
-  N = len(outputs)
-
-  plt.set_cmap('jet')
-  fig, axes = utils.subplot(plt, (N, outputs[0][0].shape[4]*2), (5,5))
-  axes = axes.ravel()[::-1].tolist()
-  for i in range(N):
-    gt_map, pred_map = outputs[i]
-    for j in [0]:
-      for k in range(gt_map.shape[4]):
-        # Display something like the midpoint of the trajectory.
-        id = np.int(gt_map.shape[1]/2)
-
-        ax = axes.pop();
-        ax.imshow(gt_map[j,id,:,:,k], origin='lower', interpolation='none',
-                  vmin=0., vmax=1.)
-        ax.set_axis_off();
-        if i == 0: ax.set_title('gt_map')
-
-        ax = axes.pop();
-        ax.imshow(pred_map[j,id,:,:,k], origin='lower', interpolation='none',
-                  vmin=0., vmax=1.)
-        ax.set_axis_off();
-        if i == 0: ax.set_title('pred_map')
-
-  file_name = os.path.join(output_dir, 'readout_map_{:d}.png'.format(global_step))
-  with fu.fopen(file_name, 'w') as f:
-    fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-  plt.close(fig)
-
-def _vis(outputs, global_step, output_dir, metric_summary, N):
-  # Plot the value map, goal for various maps to see what if the model is
-  # learning anything useful.
-  #
-  # outputs is [values, goals, maps, occupancy, conf].
-  #
-  if N >= 0:
-    outputs = outputs[:N]
-  N = len(outputs)
-
-  plt.set_cmap('jet')
-  fig, axes = utils.subplot(plt, (N, outputs[0][0].shape[4]*5), (5,5))
-  axes = axes.ravel()[::-1].tolist()
-  for i in range(N):
-    values, goals, maps, occupancy, conf = outputs[i]
-    for j in [0]:
-      for k in range(values.shape[4]):
-        # Display something like the midpoint of the trajectory.
-        id = np.int(values.shape[1]/2)
-
-        ax = axes.pop();
-        ax.imshow(goals[j,id,:,:,k], origin='lower', interpolation='none')
-        ax.set_axis_off();
-        if i == 0: ax.set_title('goal')
-
-        ax = axes.pop();
-        ax.imshow(occupancy[j,id,:,:,k], origin='lower', interpolation='none')
-        ax.set_axis_off();
-        if i == 0: ax.set_title('occupancy')
-
-        ax = axes.pop();
-        ax.imshow(conf[j,id,:,:,k], origin='lower', interpolation='none',
-                  vmin=0., vmax=1.)
-        ax.set_axis_off();
-        if i == 0: ax.set_title('conf')
-
-        ax = axes.pop();
-        ax.imshow(values[j,id,:,:,k], origin='lower', interpolation='none')
-        ax.set_axis_off();
-        if i == 0: ax.set_title('value')
-
-        ax = axes.pop();
-        ax.imshow(maps[j,id,:,:,k], origin='lower', interpolation='none')
-        ax.set_axis_off();
-        if i == 0: ax.set_title('incr map')
-
-  file_name = os.path.join(output_dir, 'value_vis_{:d}.png'.format(global_step))
-  with fu.fopen(file_name, 'w') as f:
-    fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-  plt.close(fig)
-
-def _summary_vis(m, batch_size, num_steps, arop_full_summary_iters):
-  arop = []; arop_summary_iters = []; arop_eval_fns = [];
-  vis_value_ops = []; vis_goal_ops = []; vis_map_ops = []; 
-  vis_occupancy_ops = []; vis_conf_ops = [];
-  for i, val_op in enumerate(m.value_ops):
-    vis_value_op = tf.reduce_mean(tf.abs(val_op), axis=3, keep_dims=True)
-    vis_value_ops.append(vis_value_op)
-    
-    vis_occupancy_op = tf.reduce_mean(tf.abs(m.occupancys[i]), 3, True)
-    vis_occupancy_ops.append(vis_occupancy_op)
-    
-    vis_conf_op = tf.reduce_max(tf.abs(m.confs[i]), axis=3, keep_dims=True)
-    vis_conf_ops.append(vis_conf_op)
-    
-    ego_goal_imgs_i_op = m.input_tensors['step']['ego_goal_imgs_{:d}'.format(i)]
-    vis_goal_op = tf.reduce_max(ego_goal_imgs_i_op, 4, True)
-    vis_goal_ops.append(vis_goal_op)
-    
-    vis_map_op = tf.reduce_mean(tf.abs(m.ego_map_ops[i]), 4, True)
-    vis_map_ops.append(vis_map_op)
-
-  vis_goal_ops = tf.concat(vis_goal_ops, 4)
-  vis_map_ops = tf.concat(vis_map_ops, 4)
-  vis_value_ops = tf.concat(vis_value_ops, 3)
-  vis_occupancy_ops = tf.concat(vis_occupancy_ops, 3)
-  vis_conf_ops = tf.concat(vis_conf_ops, 3)
-
-  sh = tf.unstack(tf.shape(vis_value_ops))[1:]
-  vis_value_ops = tf.reshape(vis_value_ops, shape=[batch_size, -1] + sh)
-
-  sh = tf.unstack(tf.shape(vis_conf_ops))[1:]
-  vis_conf_ops = tf.reshape(vis_conf_ops, shape=[batch_size, -1] + sh)
-
-  sh = tf.unstack(tf.shape(vis_occupancy_ops))[1:]
-  vis_occupancy_ops = tf.reshape(vis_occupancy_ops, shape=[batch_size,-1] + sh)
-
-  # Save memory, only return time steps that need to be visualized, factor of
-  # 32 CPU memory saving.
-  id = np.int(num_steps/2)
-  vis_goal_ops = tf.expand_dims(vis_goal_ops[:,id,:,:,:], axis=1)
-  vis_map_ops = tf.expand_dims(vis_map_ops[:,id,:,:,:], axis=1)
-  vis_value_ops = tf.expand_dims(vis_value_ops[:,id,:,:,:], axis=1)
-  vis_conf_ops = tf.expand_dims(vis_conf_ops[:,id,:,:,:], axis=1)
-  vis_occupancy_ops = tf.expand_dims(vis_occupancy_ops[:,id,:,:,:], axis=1)
-
-  arop += [[vis_value_ops, vis_goal_ops, vis_map_ops, vis_occupancy_ops,
-            vis_conf_ops]]
-  arop_summary_iters += [arop_full_summary_iters]
-  arop_eval_fns += [_vis]
-  return arop, arop_summary_iters, arop_eval_fns
-
-def _summary_readout_maps(m, num_steps, arop_full_summary_iters):
-  arop = []; arop_summary_iters = []; arop_eval_fns = [];
-  id = np.int(num_steps-1)
-  vis_readout_maps_gt = m.readout_maps_gt
-  vis_readout_maps_prob = tf.reshape(m.readout_maps_probs,
-                                     shape=tf.shape(vis_readout_maps_gt))
-  vis_readout_maps_gt = tf.expand_dims(vis_readout_maps_gt[:,id,:,:,:], 1)
-  vis_readout_maps_prob = tf.expand_dims(vis_readout_maps_prob[:,id,:,:,:], 1)
-  arop += [[vis_readout_maps_gt, vis_readout_maps_prob]]
-  arop_summary_iters += [arop_full_summary_iters]
-  arop_eval_fns += [_vis_readout_maps]
-  return arop, arop_summary_iters, arop_eval_fns
-
-def _add_summaries(m, args, summary_mode, arop_full_summary_iters):
-  task_params = args.navtask.task_params
-  
-  summarize_ops = [m.lr_op, m.global_step_op, m.sample_gt_prob_op] + \
-      m.loss_ops + m.acc_ops
-  summarize_names = ['lr', 'global_step', 'sample_gt_prob_op'] + \
-      m.loss_ops_names + ['acc_{:d}'.format(i) for i in range(len(m.acc_ops))]
-  to_aggregate = [0, 0, 0] + [1]*len(m.loss_ops_names) + [1]*len(m.acc_ops)
-
-  scope_name = 'summary'
-  with tf.name_scope(scope_name):
-    s_ops = nu.add_default_summaries(summary_mode, arop_full_summary_iters,
-                                     summarize_ops, summarize_names,
-                                     to_aggregate, m.action_prob_op,
-                                     m.input_tensors, scope_name=scope_name)
-    if summary_mode == 'val':
-      arop, arop_summary_iters, arop_eval_fns = _summary_vis(
-          m, task_params.batch_size, task_params.num_steps,
-          arop_full_summary_iters)
-      s_ops.additional_return_ops += arop
-      s_ops.arop_summary_iters += arop_summary_iters
-      s_ops.arop_eval_fns += arop_eval_fns
-      
-      if args.arch.readout_maps:
-        arop, arop_summary_iters, arop_eval_fns = _summary_readout_maps(
-            m, task_params.num_steps, arop_full_summary_iters)
-        s_ops.additional_return_ops += arop
-        s_ops.arop_summary_iters += arop_summary_iters
-        s_ops.arop_eval_fns += arop_eval_fns
-  
-  return s_ops
diff --git a/research/cognitive_mapping_and_planning/tfcode/cmp_utils.py b/research/cognitive_mapping_and_planning/tfcode/cmp_utils.py
deleted file mode 100644
index 6d87c697b4b29128c8b8a42caac27aeb4d657ec6..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/tfcode/cmp_utils.py
+++ /dev/null
@@ -1,164 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utility functions for setting up the CMP graph.
-"""
-
-import os, numpy as np
-import matplotlib.pyplot as plt
-
-
-import tensorflow as tf
-
-from tensorflow.contrib import slim
-from tensorflow.contrib.slim import arg_scope
-import logging
-from src import utils
-import src.file_utils as fu
-from tfcode import tf_utils
-
-resnet_v2 = tf_utils.resnet_v2
-custom_residual_block = tf_utils.custom_residual_block
-
-def value_iteration_network(
-    fr, num_iters, val_neurons, action_neurons, kernel_size, share_wts=False,
-    name='vin', wt_decay=0.0001, activation_fn=None, shape_aware=False):
-  """
-  Constructs a Value Iteration Network, convolutions and max pooling across
-  channels.
-  Input:
-    fr:             NxWxHxC
-    val_neurons:    Number of channels for maintaining the value.
-    action_neurons: Computes action_neurons * val_neurons at each iteration to
-                    max pool over.
-  Output:
-    value image:  NxHxWx(val_neurons)
-  """
-  init_var = np.sqrt(2.0/(kernel_size**2)/(val_neurons*action_neurons))
-  vals = []
-  with tf.variable_scope(name) as varscope:
-    if shape_aware == False:
-      fr_shape = tf.unstack(tf.shape(fr))
-      val_shape = tf.stack(fr_shape[:-1] + [val_neurons])
-      val = tf.zeros(val_shape, name='val_init')
-    else:
-      val = tf.expand_dims(tf.zeros_like(fr[:,:,:,0]), dim=-1) * \
-          tf.constant(0., dtype=tf.float32, shape=[1,1,1,val_neurons])
-      val_shape = tf.shape(val)
-    vals.append(val)
-    for i in range(num_iters):
-      if share_wts:
-        # The first Value Iteration maybe special, so it can have its own
-        # paramterss.
-        scope = 'conv'
-        if i == 0: scope = 'conv_0'
-        if i > 1: varscope.reuse_variables()
-      else:
-        scope = 'conv_{:d}'.format(i)
-      val = slim.conv2d(tf.concat([val, fr], 3, name='concat_{:d}'.format(i)),
-                        num_outputs=action_neurons*val_neurons,
-                        kernel_size=kernel_size, stride=1, activation_fn=activation_fn,
-                        scope=scope, normalizer_fn=None,
-                        weights_regularizer=slim.l2_regularizer(wt_decay),
-                        weights_initializer=tf.random_normal_initializer(stddev=init_var),
-                        biases_initializer=tf.zeros_initializer())
-      val = tf.reshape(val, [-1, action_neurons*val_neurons, 1, 1],
-                       name='re_{:d}'.format(i))
-      val = slim.max_pool2d(val, kernel_size=[action_neurons,1],
-                            stride=[action_neurons,1], padding='VALID',
-                            scope='val_{:d}'.format(i))
-      val = tf.reshape(val, val_shape, name='unre_{:d}'.format(i))
-      vals.append(val)
-  return val, vals
-
-
-def rotate_preds(loc_on_map, relative_theta, map_size, preds,
-                 output_valid_mask):
-  with tf.name_scope('rotate'):
-    flow_op = tf_utils.get_flow(loc_on_map, relative_theta, map_size=map_size)
-    if type(preds) != list:
-      rotated_preds, valid_mask_warps = tf_utils.dense_resample(preds, flow_op,
-                                                                output_valid_mask)
-    else:
-      rotated_preds = [] ;valid_mask_warps = []
-      for pred in preds:
-        rotated_pred, valid_mask_warp = tf_utils.dense_resample(pred, flow_op,
-                                                                output_valid_mask)
-        rotated_preds.append(rotated_pred)
-        valid_mask_warps.append(valid_mask_warp)
-  return rotated_preds, valid_mask_warps
-
-def get_visual_frustum(map_size, shape_like, expand_dims=[0,0]):
-  with tf.name_scope('visual_frustum'):
-    l = np.tril(np.ones(map_size)) ;l = l + l[:,::-1]
-    l = (l == 2).astype(np.float32)
-    for e in expand_dims:
-      l = np.expand_dims(l, axis=e)
-    confs_probs = tf.constant(l, dtype=tf.float32)
-    confs_probs = tf.ones_like(shape_like, dtype=tf.float32) * confs_probs
-  return confs_probs
-
-def deconv(x, is_training, wt_decay, neurons, strides, layers_per_block,
-            kernel_size, conv_fn, name, offset=0):
-  """Generates a up sampling network with residual connections. 
-  """
-  batch_norm_param = {'center': True, 'scale': True,
-                      'activation_fn': tf.nn.relu,
-                      'is_training': is_training}
-  outs = []
-  for i, (neuron, stride) in enumerate(zip(neurons, strides)):
-    for s in range(layers_per_block):
-      scope = '{:s}_{:d}_{:d}'.format(name, i+1+offset,s+1)
-      x = custom_residual_block(x, neuron, kernel_size, stride, scope,
-                                is_training, wt_decay, use_residual=True,
-                                residual_stride_conv=True, conv_fn=conv_fn,
-                                batch_norm_param=batch_norm_param)
-      stride = 1
-    outs.append((x,True))
-  return x, outs
-
-def fr_v2(x, output_neurons, inside_neurons, is_training, name='fr',
-          wt_decay=0.0001, stride=1, updates_collections=tf.GraphKeys.UPDATE_OPS):
-  """Performs fusion of information between the map and the reward map.
-  Inputs
-    x:   NxHxWxC1
-
-  Outputs
-    fr map:     NxHxWx(output_neurons)
-  """
-  if type(stride) != list:
-    stride = [stride]
-  with slim.arg_scope(resnet_v2.resnet_utils.resnet_arg_scope(
-      is_training=is_training, weight_decay=wt_decay)):
-    with slim.arg_scope([slim.batch_norm], updates_collections=updates_collections) as arg_sc:
-      # Change the updates_collections for the conv normalizer_params to None
-      for i in range(len(arg_sc.keys())):
-        if 'convolution' in arg_sc.keys()[i]:
-          arg_sc.values()[i]['normalizer_params']['updates_collections'] = updates_collections
-      with slim.arg_scope(arg_sc):
-        bottleneck = resnet_v2.bottleneck
-        blocks = []
-        for i, s in enumerate(stride):
-          b = resnet_v2.resnet_utils.Block(
-              'block{:d}'.format(i + 1), bottleneck, [{
-                  'depth': output_neurons,
-                  'depth_bottleneck': inside_neurons,
-                  'stride': stride[i]
-              }])
-          blocks.append(b)
-        x, outs = resnet_v2.resnet_v2(x, blocks, num_classes=None, global_pool=False,
-                                     output_stride=None, include_root_block=False,
-                                     reuse=False, scope=name)
-  return x, outs
diff --git a/research/cognitive_mapping_and_planning/tfcode/nav_utils.py b/research/cognitive_mapping_and_planning/tfcode/nav_utils.py
deleted file mode 100644
index 2f764f33df91a80f6539dcbae1e0fa7093becd29..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/tfcode/nav_utils.py
+++ /dev/null
@@ -1,435 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Various losses for training navigation agents.
-
-Defines various loss functions for navigation agents, 
-compute_losses_multi_or.
-"""
-
-import os, numpy as np
-import matplotlib.pyplot as plt
-
-
-import tensorflow as tf
-
-from tensorflow.contrib import slim
-from tensorflow.contrib.slim import arg_scope
-from tensorflow.contrib.slim.nets import resnet_v2
-from tensorflow.python.training import moving_averages
-import logging
-from src import utils 
-import src.file_utils as fu
-from tfcode import tf_utils
-
-
-def compute_losses_multi_or(logits, actions_one_hot, weights=None,
-                            num_actions=-1, data_loss_wt=1., reg_loss_wt=1.,
-                            ewma_decay=0.99, reg_loss_op=None):
-  assert(num_actions > 0), 'num_actions must be specified and must be > 0.'
-  
-  with tf.name_scope('loss'):
-    if weights is None:
-      weight = tf.ones_like(actions_one_hot, dtype=tf.float32, name='weight')
-    
-    actions_one_hot = tf.cast(tf.reshape(actions_one_hot, [-1, num_actions],
-                                         're_actions_one_hot'), tf.float32)
-    weights = tf.reduce_sum(tf.reshape(weights, [-1, num_actions], 're_weight'),
-                            reduction_indices=1)
-    total = tf.reduce_sum(weights)
-
-    action_prob = tf.nn.softmax(logits)
-    action_prob = tf.reduce_sum(tf.multiply(action_prob, actions_one_hot),
-                                reduction_indices=1)
-    example_loss = -tf.log(tf.maximum(tf.constant(1e-4), action_prob))
-
-    data_loss_op = tf.reduce_sum(example_loss * weights) / total
-    if reg_loss_op is None:
-      if reg_loss_wt > 0:
-        reg_loss_op = tf.add_n(tf.losses.get_regularization_losses())
-      else:
-        reg_loss_op = tf.constant(0.)
-    
-    if reg_loss_wt > 0:
-      total_loss_op = data_loss_wt*data_loss_op + reg_loss_wt*reg_loss_op 
-    else:
-      total_loss_op = data_loss_wt*data_loss_op
-
-    is_correct = tf.cast(tf.greater(action_prob, 0.5, name='pred_class'), tf.float32)
-    acc_op = tf.reduce_sum(is_correct*weights) / total
-
-    ewma_acc_op = moving_averages.weighted_moving_average(
-        acc_op, ewma_decay, weight=total, name='ewma_acc')
-
-    acc_ops = [ewma_acc_op]
-
-  return reg_loss_op, data_loss_op, total_loss_op, acc_ops
-
-
-def get_repr_from_image(images_reshaped, modalities, data_augment, encoder,
-                        freeze_conv, wt_decay, is_training):
-  # Pass image through lots of convolutional layers, to obtain pool5
-  if modalities == ['rgb']:
-    with tf.name_scope('pre_rgb'):
-      x = (images_reshaped + 128.) / 255. # Convert to brightness between 0 and 1.
-      if data_augment.relight and is_training:
-        x = tf_utils.distort_image(x, fast_mode=data_augment.relight_fast)
-      x = (x-0.5)*2.0
-    scope_name = encoder
-  elif modalities == ['depth']:
-    with tf.name_scope('pre_d'):
-      d_image = images_reshaped
-      x = 2*(d_image[...,0] - 80.0)/100.0
-      y = d_image[...,1]
-      d_image = tf.concat([tf.expand_dims(x, -1), tf.expand_dims(y, -1)], 3)
-      x = d_image
-    scope_name = 'd_'+encoder
-
-  resnet_is_training = is_training and (not freeze_conv)
-  with slim.arg_scope(resnet_v2.resnet_utils.resnet_arg_scope(resnet_is_training)):
-    fn = getattr(tf_utils, encoder)
-    x, end_points = fn(x, num_classes=None, global_pool=False,
-                       output_stride=None, reuse=None,
-                       scope=scope_name)
-  vars_ = slim.get_variables_to_restore()
-
-  conv_feat = x
-  return conv_feat, vars_
-
-def default_train_step_kwargs(m, obj, logdir, rng_seed, is_chief, num_steps,
-                              iters, train_display_interval,
-                              dagger_sample_bn_false):
-  train_step_kwargs = {}
-  train_step_kwargs['obj'] = obj 
-  train_step_kwargs['m'] = m
-  
-  # rng_data has 2 independent rngs, one for sampling episodes and one for
-  # sampling perturbs (so that we can make results reproducible.
-  train_step_kwargs['rng_data'] = [np.random.RandomState(rng_seed), 
-                                   np.random.RandomState(rng_seed)]
-  train_step_kwargs['rng_action'] = np.random.RandomState(rng_seed)
-  if is_chief: 
-    train_step_kwargs['writer'] = tf.summary.FileWriter(logdir) #, m.tf_graph)
-  else:
-    train_step_kwargs['writer'] = None
-  train_step_kwargs['iters'] = iters
-  train_step_kwargs['train_display_interval'] = train_display_interval 
-  train_step_kwargs['num_steps'] = num_steps
-  train_step_kwargs['logdir'] = logdir
-  train_step_kwargs['dagger_sample_bn_false'] = dagger_sample_bn_false 
-  return train_step_kwargs
-
-# Utilities for visualizing and analysing validation output.
-def save_d_at_t(outputs, global_step, output_dir, metric_summary, N):
-  """Save distance to goal at all time steps.
-  
-  Args:
-    outputs        : [gt_dist_to_goal].
-    global_step : number of iterations.
-    output_dir     : output directory.
-    metric_summary : to append scalars to summary.
-    N              : number of outputs to process.
-
-  """
-  d_at_t = np.concatenate(map(lambda x: x[0][:,:,0]*1, outputs), axis=0)
-  fig, axes = utils.subplot(plt, (1,1), (5,5))
-  axes.plot(np.arange(d_at_t.shape[1]), np.mean(d_at_t, axis=0), 'r.')
-  axes.set_xlabel('time step')
-  axes.set_ylabel('dist to next goal')
-  axes.grid('on')
-  file_name = os.path.join(output_dir, 'dist_at_t_{:d}.png'.format(global_step))
-  with fu.fopen(file_name, 'w') as f:
-    fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-  file_name = os.path.join(output_dir, 'dist_at_t_{:d}.pkl'.format(global_step))
-  utils.save_variables(file_name, [d_at_t], ['d_at_t'], overwrite=True)
-  plt.close(fig)
-  return None
-
-def save_all(outputs, global_step, output_dir, metric_summary, N):
-  """Save numerous statistics.
-  
-  Args:
-    outputs        : [locs, goal_loc, gt_dist_to_goal, node_ids, perturbs]
-    global_step    : number of iterations.
-    output_dir     : output directory.
-    metric_summary : to append scalars to summary.
-    N              : number of outputs to process.
-  """
-  all_locs = np.concatenate(map(lambda x: x[0], outputs), axis=0)
-  all_goal_locs = np.concatenate(map(lambda x: x[1], outputs), axis=0)
-  all_d_at_t = np.concatenate(map(lambda x: x[2][:,:,0]*1, outputs), axis=0)
-  all_node_ids = np.concatenate(map(lambda x: x[3], outputs), axis=0)
-  all_perturbs = np.concatenate(map(lambda x: x[4], outputs), axis=0)
-  
-  file_name = os.path.join(output_dir, 'all_locs_at_t_{:d}.pkl'.format(global_step))
-  vars = [all_locs, all_goal_locs, all_d_at_t, all_node_ids, all_perturbs]
-  var_names = ['all_locs', 'all_goal_locs', 'all_d_at_t', 'all_node_ids', 'all_perturbs']
-  utils.save_variables(file_name, vars, var_names,  overwrite=True)
-  return None
-
-def eval_ap(outputs, global_step, output_dir, metric_summary, N, num_classes=4):
-  """Processes the collected outputs to compute AP for action prediction.
-  
-  Args:
-    outputs        : [logits, labels]
-    global_step    : global_step.
-    output_dir     : where to store results.
-    metric_summary : summary object to add summaries to.
-    N              : number of outputs to process.
-    num_classes    : number of classes to compute AP over, and to reshape tensors.
-  """
-  if N >= 0:
-    outputs = outputs[:N]
-  logits = np.concatenate(map(lambda x: x[0], outputs), axis=0).reshape((-1, num_classes))
-  labels = np.concatenate(map(lambda x: x[1], outputs), axis=0).reshape((-1, num_classes))
-  aps = []
-  for i in range(logits.shape[1]):
-    ap, rec, prec = utils.calc_pr(labels[:,i], logits[:,i])
-    ap = ap[0]
-    tf_utils.add_value_to_summary(metric_summary, 'aps/ap_{:d}: '.format(i), ap)
-    aps.append(ap)
-  return aps
-
-def eval_dist(outputs, global_step, output_dir, metric_summary, N):
-  """Processes the collected outputs during validation to 
-  1. Plot the distance over time curve.
-  2. Compute mean and median distances.
-  3. Plots histogram of end distances.
-  
-  Args:
-    outputs        : [locs, goal_loc, gt_dist_to_goal].
-    global_step    : global_step.
-    output_dir     : where to store results.
-    metric_summary : summary object to add summaries to.
-    N              : number of outputs to process.
-  """
-  SUCCESS_THRESH = 3
-  if N >= 0:
-    outputs = outputs[:N]
-  
-  # Plot distance at time t.
-  d_at_t = []
-  for i in range(len(outputs)):
-    locs, goal_loc, gt_dist_to_goal = outputs[i]
-    d_at_t.append(gt_dist_to_goal[:,:,0]*1)
-
-  # Plot the distance.
-  fig, axes = utils.subplot(plt, (1,1), (5,5))
-  d_at_t = np.concatenate(d_at_t, axis=0)
-  axes.plot(np.arange(d_at_t.shape[1]), np.mean(d_at_t, axis=0), 'r.')
-  axes.set_xlabel('time step')
-  axes.set_ylabel('dist to next goal')
-  axes.grid('on')
-  file_name = os.path.join(output_dir, 'dist_at_t_{:d}.png'.format(global_step))
-  with fu.fopen(file_name, 'w') as f:
-    fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-  file_name = os.path.join(output_dir, 'dist_at_t_{:d}.pkl'.format(global_step))
-  utils.save_variables(file_name, [d_at_t], ['d_at_t'], overwrite=True)
-  plt.close(fig)
-
-  # Plot the trajectories and the init_distance and final distance.
-  d_inits = []
-  d_ends = []
-  for i in range(len(outputs)):
-    locs, goal_loc, gt_dist_to_goal = outputs[i]
-    d_inits.append(gt_dist_to_goal[:,0,0]*1)
-    d_ends.append(gt_dist_to_goal[:,-1,0]*1)
-
-  # Plot the distance.
-  fig, axes = utils.subplot(plt, (1,1), (5,5))
-  d_inits = np.concatenate(d_inits, axis=0)
-  d_ends = np.concatenate(d_ends, axis=0)
-  axes.plot(d_inits+np.random.rand(*(d_inits.shape))-0.5,
-            d_ends+np.random.rand(*(d_ends.shape))-0.5, '.', mec='red', mew=1.0)
-  axes.set_xlabel('init dist'); axes.set_ylabel('final dist'); 
-  axes.grid('on'); axes.axis('equal');
-  title_str = 'mean: {:0.1f}, 50: {:0.1f}, 75: {:0.2f}, s: {:0.1f}'
-  title_str = title_str.format(
-      np.mean(d_ends), np.median(d_ends), np.percentile(d_ends, q=75),
-      100*(np.mean(d_ends <= SUCCESS_THRESH)))
-  axes.set_title(title_str)
-  file_name = os.path.join(output_dir, 'dist_{:d}.png'.format(global_step))
-  with fu.fopen(file_name, 'w') as f:
-    fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-
-  file_name = os.path.join(output_dir, 'dist_{:d}.pkl'.format(global_step))
-  utils.save_variables(file_name, [d_inits, d_ends], ['d_inits', 'd_ends'],
-                       overwrite=True)
-  plt.close(fig)
-
-  # Plot the histogram of the end_distance.
-  with plt.style.context('seaborn-white'):
-    d_ends_ = np.sort(d_ends)
-    d_inits_ = np.sort(d_inits)
-    leg = [];
-    fig, ax = utils.subplot(plt, (1,1), (5,5))
-    ax.grid('on')
-    ax.set_xlabel('Distance from goal'); ax.xaxis.label.set_fontsize(16);
-    ax.set_ylabel('Fraction of data'); ax.yaxis.label.set_fontsize(16);
-    ax.plot(d_ends_, np.arange(d_ends_.size)*1./d_ends_.size, 'r')
-    ax.plot(d_inits_, np.arange(d_inits_.size)*1./d_inits_.size, 'k')
-    leg.append('Final'); leg.append('Init');
-    ax.legend(leg, fontsize='x-large');
-    ax.set_axis_on()
-    title_str = 'mean: {:0.1f}, 50: {:0.1f}, 75: {:0.2f}, s: {:0.1f}'
-    title_str = title_str.format(
-        np.mean(d_ends), np.median(d_ends), np.percentile(d_ends, q=75),
-        100*(np.mean(d_ends <= SUCCESS_THRESH)))
-    ax.set_title(title_str)
-    file_name = os.path.join(output_dir, 'dist_hist_{:d}.png'.format(global_step))
-    with fu.fopen(file_name, 'w') as f:
-      fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-  
-  # Log distance metrics.
-  tf_utils.add_value_to_summary(metric_summary, 'dists/success_init: ',
-                                100*(np.mean(d_inits <= SUCCESS_THRESH)))
-  tf_utils.add_value_to_summary(metric_summary, 'dists/success_end: ',
-                                100*(np.mean(d_ends <= SUCCESS_THRESH)))
-  tf_utils.add_value_to_summary(metric_summary, 'dists/dist_init (75): ',
-                                np.percentile(d_inits, q=75))
-  tf_utils.add_value_to_summary(metric_summary, 'dists/dist_end (75): ',
-                                np.percentile(d_ends, q=75))
-  tf_utils.add_value_to_summary(metric_summary, 'dists/dist_init (median): ',
-                                np.median(d_inits))
-  tf_utils.add_value_to_summary(metric_summary, 'dists/dist_end (median): ',
-                                np.median(d_ends))
-  tf_utils.add_value_to_summary(metric_summary, 'dists/dist_init (mean): ',
-                                np.mean(d_inits))
-  tf_utils.add_value_to_summary(metric_summary, 'dists/dist_end (mean): ',
-                                np.mean(d_ends))
-  return np.median(d_inits), np.median(d_ends), np.mean(d_inits), np.mean(d_ends), \
-      np.percentile(d_inits, q=75), np.percentile(d_ends, q=75), \
-      100*(np.mean(d_inits) <= SUCCESS_THRESH), 100*(np.mean(d_ends) <= SUCCESS_THRESH)
-
-def plot_trajectories(outputs, global_step, output_dir, metric_summary, N):
-  """Processes the collected outputs during validation to plot the trajectories
-  in the top view.
-  
-  Args:
-    outputs        : [locs, orig_maps, goal_loc].
-    global_step    : global_step.
-    output_dir     : where to store results.
-    metric_summary : summary object to add summaries to.
-    N              : number of outputs to process.
-  """
-  if N >= 0:
-    outputs = outputs[:N]
-  N = len(outputs)
-
-  plt.set_cmap('gray')
-  fig, axes = utils.subplot(plt, (N, outputs[0][1].shape[0]), (5,5))
-  axes = axes.ravel()[::-1].tolist()
-  for i in range(N):
-    locs, orig_maps, goal_loc = outputs[i]
-    is_semantic = np.isnan(goal_loc[0,0,1])
-    for j in range(orig_maps.shape[0]):
-      ax = axes.pop();
-      ax.plot(locs[j,0,0], locs[j,0,1], 'ys')
-      # Plot one by one, so that they come in different colors.
-      for k in range(goal_loc.shape[1]):
-        if not is_semantic:
-          ax.plot(goal_loc[j,k,0], goal_loc[j,k,1], 's')
-      if False:
-        ax.plot(locs[j,:,0], locs[j,:,1], 'r.', ms=3)
-        ax.imshow(orig_maps[j,0,:,:,0], origin='lower')
-        ax.set_axis_off();
-      else:
-        ax.scatter(locs[j,:,0], locs[j,:,1], c=np.arange(locs.shape[1]),
-                   cmap='jet', s=10, lw=0)
-        ax.imshow(orig_maps[j,0,:,:,0], origin='lower', vmin=-1.0, vmax=2.0)
-        if not is_semantic:
-          xymin = np.minimum(np.min(goal_loc[j,:,:], axis=0), np.min(locs[j,:,:], axis=0))
-          xymax = np.maximum(np.max(goal_loc[j,:,:], axis=0), np.max(locs[j,:,:], axis=0))
-        else:
-          xymin = np.min(locs[j,:,:], axis=0)
-          xymax = np.max(locs[j,:,:], axis=0)
-        xy1 = (xymax+xymin)/2. - np.maximum(np.max(xymax-xymin), 12)
-        xy2 = (xymax+xymin)/2. + np.maximum(np.max(xymax-xymin), 12)
-        ax.set_xlim([xy1[0], xy2[0]])
-        ax.set_ylim([xy1[1], xy2[1]])
-        ax.set_axis_off()
-  file_name = os.path.join(output_dir, 'trajectory_{:d}.png'.format(global_step))
-  with fu.fopen(file_name, 'w') as f:
-    fig.savefig(f, bbox_inches='tight', transparent=True, pad_inches=0)
-  plt.close(fig)
-  return None
-
-def add_default_summaries(mode, arop_full_summary_iters, summarize_ops,
-                          summarize_names, to_aggregate, action_prob_op,
-                          input_tensors, scope_name):
-  assert(mode == 'train' or mode == 'val' or mode == 'test'), \
-    'add_default_summaries mode is neither train or val or test.'
-  
-  s_ops = tf_utils.get_default_summary_ops()
-  
-  if mode == 'train':
-    s_ops.summary_ops, s_ops.print_summary_ops, additional_return_ops, \
-    arop_summary_iters, arop_eval_fns = tf_utils.simple_summaries(
-            summarize_ops, summarize_names, mode, to_aggregate=False,
-            scope_name=scope_name)
-    s_ops.additional_return_ops += additional_return_ops
-    s_ops.arop_summary_iters += arop_summary_iters
-    s_ops.arop_eval_fns += arop_eval_fns
-  elif mode == 'val':
-    s_ops.summary_ops, s_ops.print_summary_ops, additional_return_ops, \
-    arop_summary_iters, arop_eval_fns = tf_utils.simple_summaries(
-            summarize_ops, summarize_names, mode, to_aggregate=to_aggregate,
-            scope_name=scope_name)
-    s_ops.additional_return_ops += additional_return_ops
-    s_ops.arop_summary_iters += arop_summary_iters
-    s_ops.arop_eval_fns += arop_eval_fns
-  
-  elif mode == 'test':
-    s_ops.summary_ops, s_ops.print_summary_ops, additional_return_ops, \
-    arop_summary_iters, arop_eval_fns = tf_utils.simple_summaries(
-        [], [], mode, to_aggregate=[], scope_name=scope_name)
-    s_ops.additional_return_ops += additional_return_ops
-    s_ops.arop_summary_iters += arop_summary_iters
-    s_ops.arop_eval_fns += arop_eval_fns
-
-  
-  if mode == 'val':
-    arop = s_ops.additional_return_ops
-    arop += [[action_prob_op, input_tensors['train']['action']]]
-    arop += [[input_tensors['step']['loc_on_map'],
-              input_tensors['common']['goal_loc'],
-              input_tensors['step']['gt_dist_to_goal']]]
-    arop += [[input_tensors['step']['loc_on_map'],
-              input_tensors['common']['orig_maps'],
-              input_tensors['common']['goal_loc']]]
-    s_ops.arop_summary_iters += [-1, arop_full_summary_iters,
-                                 arop_full_summary_iters]
-    s_ops.arop_eval_fns += [eval_ap, eval_dist, plot_trajectories]
-  
-  elif mode == 'test':
-    arop = s_ops.additional_return_ops
-    arop += [[input_tensors['step']['loc_on_map'],
-              input_tensors['common']['goal_loc'],
-              input_tensors['step']['gt_dist_to_goal']]]
-    arop += [[input_tensors['step']['gt_dist_to_goal']]]
-    arop += [[input_tensors['step']['loc_on_map'],
-              input_tensors['common']['goal_loc'],
-              input_tensors['step']['gt_dist_to_goal'],
-              input_tensors['step']['node_ids'],
-              input_tensors['step']['perturbs']]]
-    arop += [[input_tensors['step']['loc_on_map'],
-              input_tensors['common']['orig_maps'],
-              input_tensors['common']['goal_loc']]]
-    s_ops.arop_summary_iters += [-1, -1, -1, arop_full_summary_iters]
-    s_ops.arop_eval_fns += [eval_dist, save_d_at_t, save_all,
-                            plot_trajectories]
-  return s_ops
-
-
diff --git a/research/cognitive_mapping_and_planning/tfcode/tf_utils.py b/research/cognitive_mapping_and_planning/tfcode/tf_utils.py
deleted file mode 100644
index 5f96d8ff5ce7473f0ec49096abcbac274e6c4fcc..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/tfcode/tf_utils.py
+++ /dev/null
@@ -1,840 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import numpy as np
-import sys
-import tensorflow as tf
-import src.utils as utils
-import logging
-from tensorflow.contrib import slim
-from tensorflow.contrib.metrics.python.ops import confusion_matrix_ops
-from tensorflow.contrib.slim import arg_scope
-from tensorflow.contrib.slim.nets import resnet_v2
-from tensorflow.python.framework import dtypes
-from tensorflow.python.ops import array_ops
-from tensorflow.python.ops import check_ops
-from tensorflow.python.ops import math_ops
-from tensorflow.python.ops import variable_scope
-sys.path.insert(0, '../slim')
-from preprocessing import inception_preprocessing as ip
-
-resnet_v2_50 = resnet_v2.resnet_v2_50
-
-
-def custom_residual_block(x, neurons, kernel_size, stride, name, is_training,
-                          wt_decay=0.0001, use_residual=True,
-                          residual_stride_conv=True, conv_fn=slim.conv2d,
-                          batch_norm_param=None):
-  
-  # batch norm x and relu
-  init_var = np.sqrt(2.0/(kernel_size**2)/neurons)
-  with arg_scope([conv_fn], 
-                 weights_regularizer=slim.l2_regularizer(wt_decay),
-                 weights_initializer=tf.random_normal_initializer(stddev=init_var),
-                 biases_initializer=tf.zeros_initializer()): 
-    
-    if batch_norm_param is None:
-      batch_norm_param = {'center': True, 'scale': False, 
-                          'activation_fn':tf.nn.relu, 
-                          'is_training': is_training}
-    
-    y = slim.batch_norm(x, scope=name+'_bn', **batch_norm_param)
-
-    y = conv_fn(y, num_outputs=neurons, kernel_size=kernel_size, stride=stride,
-                activation_fn=None, scope=name+'_1',
-                normalizer_fn=slim.batch_norm,
-                normalizer_params=batch_norm_param)
-    
-    y = conv_fn(y, num_outputs=neurons, kernel_size=kernel_size,
-                    stride=1, activation_fn=None, scope=name+'_2')
-
-    if use_residual:
-      if stride != 1 or x.get_shape().as_list()[-1] != neurons:
-        batch_norm_param_ = dict(batch_norm_param)
-        batch_norm_param_['activation_fn'] = None
-        x = conv_fn(x, num_outputs=neurons, kernel_size=1,
-                        stride=stride if residual_stride_conv else 1,
-                        activation_fn=None, scope=name+'_0_1x1',
-                        normalizer_fn=slim.batch_norm,
-                        normalizer_params=batch_norm_param_)
-        if not residual_stride_conv:
-          x = slim.avg_pool2d(x, 1, stride=stride, scope=name+'_0_avg')
-  
-      y = tf.add(x, y, name=name+'_add')
-    
-    return y
-
-def step_gt_prob(step, step_number_op):
-  # Change samping probability from 1 to -1 at step steps.
-  with tf.name_scope('step_gt_prob'):
-    out = tf.cond(tf.less(step_number_op, step),
-            lambda: tf.constant(1.), lambda: tf.constant(-1.))
-    return out 
-
-def inverse_sigmoid_decay(k, global_step_op):
-  with tf.name_scope('inverse_sigmoid_decay'):
-    k = tf.constant(k, dtype=tf.float32)
-    tmp = k*tf.exp(-tf.cast(global_step_op, tf.float32)/k)
-    tmp = tmp / (1. + tmp)
-  return tmp
-
-def dense_resample(im, flow_im, output_valid_mask, name='dense_resample'):
-  """ Resample reward at particular locations.
-  Args:
-    im:      ...xHxWxC matrix to sample from.
-    flow_im: ...xHxWx2 matrix, samples the image using absolute offsets as given
-             by the flow_im.
-  """
-  with tf.name_scope(name):
-    valid_mask = None
-    
-    x, y = tf.unstack(flow_im, axis=-1)
-    x = tf.cast(tf.reshape(x, [-1]), tf.float32)
-    y = tf.cast(tf.reshape(y, [-1]), tf.float32)
-
-    # constants
-    shape = tf.unstack(tf.shape(im))
-    channels = shape[-1]
-    width = shape[-2]
-    height = shape[-3]
-    num_batch = tf.cast(tf.reduce_prod(tf.stack(shape[:-3])), 'int32')
-    zero = tf.constant(0, dtype=tf.int32)
-
-    # Round up and down.
-    x0 = tf.cast(tf.floor(x), 'int32'); x1 = x0 + 1;
-    y0 = tf.cast(tf.floor(y), 'int32'); y1 = y0 + 1;
-    
-    if output_valid_mask:
-      valid_mask = tf.logical_and(
-          tf.logical_and(tf.less_equal(x, tf.cast(width, tf.float32)-1.), tf.greater_equal(x, 0.)),
-          tf.logical_and(tf.less_equal(y, tf.cast(height, tf.float32)-1.), tf.greater_equal(y, 0.)))
-      valid_mask = tf.reshape(valid_mask, shape=shape[:-1] + [1])
-  
-    x0 = tf.clip_by_value(x0, zero, width-1)
-    x1 = tf.clip_by_value(x1, zero, width-1)
-    y0 = tf.clip_by_value(y0, zero, height-1)
-    y1 = tf.clip_by_value(y1, zero, height-1)
-
-    dim2 = width; dim1 = width * height;
-
-    # Create base index
-    base = tf.reshape(tf.range(num_batch) * dim1, shape=[-1,1])
-    base = tf.reshape(tf.tile(base, [1, height*width]), shape=[-1])
-
-    base_y0 = base + y0 * dim2
-    base_y1 = base + y1 * dim2
-    idx_a = base_y0 + x0
-    idx_b = base_y1 + x0
-    idx_c = base_y0 + x1
-    idx_d = base_y1 + x1
-
-    # use indices to lookup pixels in the flat image and restore channels dim
-    sh = tf.stack([tf.constant(-1,dtype=tf.int32), channels])
-    im_flat = tf.cast(tf.reshape(im, sh), dtype=tf.float32)
-    pixel_a = tf.gather(im_flat, idx_a)
-    pixel_b = tf.gather(im_flat, idx_b)
-    pixel_c = tf.gather(im_flat, idx_c)
-    pixel_d = tf.gather(im_flat, idx_d)
-
-    # and finally calculate interpolated values
-    x1_f = tf.to_float(x1)
-    y1_f = tf.to_float(y1)
-
-    wa = tf.expand_dims(((x1_f - x) * (y1_f - y)), 1)
-    wb = tf.expand_dims((x1_f - x) * (1.0 - (y1_f - y)), 1)
-    wc = tf.expand_dims(((1.0 - (x1_f - x)) * (y1_f - y)), 1)
-    wd = tf.expand_dims(((1.0 - (x1_f - x)) * (1.0 - (y1_f - y))), 1)
-
-    output = tf.add_n([wa * pixel_a, wb * pixel_b, wc * pixel_c, wd * pixel_d])
-    output = tf.reshape(output, shape=tf.shape(im))
-    return output, valid_mask
- 
-def get_flow(t, theta, map_size, name_scope='gen_flow'):
-  """
-  Rotates the map by theta and translates the rotated map by t.
-  
-  Assume that the robot rotates by an angle theta and then moves forward by
-  translation t. This function returns the flow field field. For every pixel in
-  the new image it tells us which pixel in the original image it came from:
-  NewI(x, y) = OldI(flow_x(x,y), flow_y(x,y)).
-
-  Assume there is a point p in the original image. Robot rotates by R and moves
-  forward by t.  p1 = Rt*p; p2 = p1 - t; (the world moves in opposite direction.
-  So, p2 = Rt*p - t, thus p2 came from R*(p2+t), which is what this function
-  calculates.
-
-    t:      ... x 2 (translation for B batches of N motions each).
-    theta:  ... x 1 (rotation for B batches of N motions each).
-    
-    Output: ... x map_size x map_size x 2
-  """
-
-  with tf.name_scope(name_scope):
-    tx, ty = tf.unstack(tf.reshape(t, shape=[-1, 1, 1, 1, 2]), axis=4)
-    theta = tf.reshape(theta, shape=[-1, 1, 1, 1])
-    c = tf.constant((map_size-1.)/2., dtype=tf.float32)
-
-    x, y = np.meshgrid(np.arange(map_size), np.arange(map_size))
-    x = tf.constant(x[np.newaxis, :, :, np.newaxis], dtype=tf.float32, name='x', 
-                    shape=[1, map_size, map_size, 1])
-    y = tf.constant(y[np.newaxis, :, :, np.newaxis], dtype=tf.float32, name='y',
-                    shape=[1,map_size, map_size, 1])
-
-    x = x-(-tx+c)
-    y = y-(-ty+c)
-
-    sin_theta = tf.sin(theta)
-    cos_theta = tf.cos(theta)
-    xr = cos_theta*x - sin_theta*y
-    yr = sin_theta*x + cos_theta*y
-
-    xr = xr + c
-    yr = yr + c
-    
-    flow = tf.stack([xr, yr], axis=-1)
-    sh = tf.unstack(tf.shape(t), axis=0)
-    sh = tf.stack(sh[:-1]+[tf.constant(_, dtype=tf.int32) for _ in [map_size, map_size, 2]])
-    flow = tf.reshape(flow, shape=sh)
-    return flow
-
-def distort_image(im, fast_mode=False):
-  # All images in the same batch are transformed the same way, but over
-  # iterations you see different distortions.
-  # im should be float with values between 0 and 1.
-  im_ = tf.reshape(im, shape=(-1,1,3))
-  im_ = ip.apply_with_random_selector(
-      im_, lambda x, ordering: ip.distort_color(x, ordering, fast_mode),
-      num_cases=4)
-  im_ = tf.reshape(im_, tf.shape(im))
-  return im_
-
-def fc_network(x, neurons, wt_decay, name, num_pred=None, offset=0,
-               batch_norm_param=None, dropout_ratio=0.0, is_training=None): 
-  if dropout_ratio > 0:
-    assert(is_training is not None), \
-      'is_training needs to be defined when trainnig with dropout.'
-  
-  repr = []
-  for i, neuron in enumerate(neurons):
-    init_var = np.sqrt(2.0/neuron)
-    if batch_norm_param is not None:
-      x = slim.fully_connected(x, neuron, activation_fn=None,
-                               weights_initializer=tf.random_normal_initializer(stddev=init_var),
-                               weights_regularizer=slim.l2_regularizer(wt_decay),
-                               normalizer_fn=slim.batch_norm,
-                               normalizer_params=batch_norm_param,
-                               biases_initializer=tf.zeros_initializer(),
-                               scope='{:s}_{:d}'.format(name, offset+i))
-    else:
-      x = slim.fully_connected(x, neuron, activation_fn=tf.nn.relu,
-                               weights_initializer=tf.random_normal_initializer(stddev=init_var),
-                               weights_regularizer=slim.l2_regularizer(wt_decay),
-                               biases_initializer=tf.zeros_initializer(),
-                               scope='{:s}_{:d}'.format(name, offset+i))
-    if dropout_ratio > 0:
-       x = slim.dropout(x, keep_prob=1-dropout_ratio, is_training=is_training,
-                        scope='{:s}_{:d}'.format('dropout_'+name, offset+i))
-    repr.append(x)
-  
-  if num_pred is not None:
-    init_var = np.sqrt(2.0/num_pred)
-    x = slim.fully_connected(x, num_pred,
-                             weights_regularizer=slim.l2_regularizer(wt_decay),
-                             weights_initializer=tf.random_normal_initializer(stddev=init_var),
-                             biases_initializer=tf.zeros_initializer(),
-                             activation_fn=None,
-                             scope='{:s}_pred'.format(name))
-  return x, repr
-
-def concat_state_x_list(f, names):
-  af = {}
-  for i, k in enumerate(names):
-    af[k] = np.concatenate([x[i] for x in f], axis=1)
-  return af
-
-def concat_state_x(f, names):
-  af = {}
-  for k in names:
-    af[k] = np.concatenate([x[k] for x in f], axis=1)
-    # af[k] = np.swapaxes(af[k], 0, 1)
-  return af
-
-def sample_action(rng, action_probs, optimal_action, sample_gt_prob,
-                  type='sample', combine_type='one_or_other'):
-  optimal_action_ = optimal_action/np.sum(optimal_action+0., 1, keepdims=True)
-  action_probs_ = action_probs/np.sum(action_probs+0.001, 1, keepdims=True)
-  batch_size = action_probs_.shape[0]
-
-  action = np.zeros((batch_size), dtype=np.int32)
-  action_sample_wt = np.zeros((batch_size), dtype=np.float32)
-  if combine_type == 'add':
-    sample_gt_prob_ = np.minimum(np.maximum(sample_gt_prob, 0.), 1.)
-
-  for i in range(batch_size):
-    if combine_type == 'one_or_other':
-      sample_gt = rng.rand() < sample_gt_prob
-      if sample_gt: distr_ = optimal_action_[i,:]*1.
-      else: distr_ = action_probs_[i,:]*1.
-    elif combine_type == 'add':
-      distr_ = optimal_action_[i,:]*sample_gt_prob_ + \
-          (1.-sample_gt_prob_)*action_probs_[i,:]
-      distr_ = distr_ / np.sum(distr_)
-
-    if type == 'sample':
-      action[i] = np.argmax(rng.multinomial(1, distr_, size=1))
-    elif type == 'argmax':
-      action[i] = np.argmax(distr_)
-    action_sample_wt[i] = action_probs_[i, action[i]] / distr_[action[i]]
-  return action, action_sample_wt
-
-def train_step_custom_online_sampling(sess, train_op, global_step,
-                                      train_step_kwargs, mode='train'):
-  m          = train_step_kwargs['m']
-  obj        = train_step_kwargs['obj']
-  rng_data   = train_step_kwargs['rng_data']
-  rng_action = train_step_kwargs['rng_action']
-  writer     = train_step_kwargs['writer']
-  iters      = train_step_kwargs['iters']
-  num_steps  = train_step_kwargs['num_steps']
-  logdir     = train_step_kwargs['logdir']
-  dagger_sample_bn_false = train_step_kwargs['dagger_sample_bn_false']
-  train_display_interval = train_step_kwargs['train_display_interval']
-  if 'outputs' not in m.train_ops:
-    m.train_ops['outputs'] = []
-
-  s_ops = m.summary_ops[mode]
-  val_additional_ops = []
-
-  # Print all variables here.
-  if False:
-    v = tf.get_collection(tf.GraphKeys.VARIABLES)
-    v_op = [_.value() for _ in v]
-    v_op_value = sess.run(v_op)
-
-    filter = lambda x, y: 'Adam' in x.name
-    # filter = lambda x, y: np.is_any_nan(y)
-    ind = [i for i, (_, __) in enumerate(zip(v, v_op_value)) if filter(_, __)]
-    v = [v[i] for i in ind]
-    v_op_value = [v_op_value[i] for i in ind]
-
-    for i in range(len(v)): 
-      logging.info('XXXX: variable: %30s, is_any_nan: %5s, norm: %f.',
-                   v[i].name, np.any(np.isnan(v_op_value[i])),
-                   np.linalg.norm(v_op_value[i]))
-
-  tt = utils.Timer()
-  for i in range(iters):
-    tt.tic()
-    # Sample a room.
-    e = obj.sample_env(rng_data)
-
-    # Initialize the agent.
-    init_env_state = e.reset(rng_data)
-
-    # Get and process the common data.
-    input = e.get_common_data()
-    input = e.pre_common_data(input)
-    feed_dict  = prepare_feed_dict(m.input_tensors['common'], input)
-    if dagger_sample_bn_false:
-      feed_dict[m.train_ops['batch_norm_is_training_op']] = False
-    common_data = sess.run(m.train_ops['common'], feed_dict=feed_dict)
-
-    states = []
-    state_features = []
-    state_targets = []
-    net_state_to_input = []
-    step_data_cache = []
-    executed_actions = []
-    rewards = []
-    action_sample_wts = []
-    states.append(init_env_state)
-
-    net_state = sess.run(m.train_ops['init_state'], feed_dict=feed_dict)
-    net_state = dict(zip(m.train_ops['state_names'], net_state))
-    net_state_to_input.append(net_state)
-    for j in range(num_steps):
-      f = e.get_features(states[j], j)
-      f = e.pre_features(f)
-      f.update(net_state)
-      f['step_number'] = np.ones((1,1,1), dtype=np.int32)*j
-      state_features.append(f)
-
-      feed_dict = prepare_feed_dict(m.input_tensors['step'], state_features[-1])
-      optimal_action = e.get_optimal_action(states[j], j)
-      for x, v in zip(m.train_ops['common'], common_data):
-        feed_dict[x] = v
-      if dagger_sample_bn_false:
-        feed_dict[m.train_ops['batch_norm_is_training_op']] = False
-      outs = sess.run([m.train_ops['step'], m.sample_gt_prob_op,
-                       m.train_ops['step_data_cache'],
-                       m.train_ops['updated_state'],
-                       m.train_ops['outputs']], feed_dict=feed_dict)
-      action_probs = outs[0]
-      sample_gt_prob = outs[1]
-      step_data_cache.append(dict(zip(m.train_ops['step_data_cache'], outs[2])))
-      net_state = outs[3]
-      if hasattr(e, 'update_state'):
-        outputs = outs[4]
-        outputs = dict(zip(m.train_ops['output_names'], outputs))
-        e.update_state(outputs, j)
-      state_targets.append(e.get_targets(states[j], j))
-
-      if j < num_steps-1:
-        # Sample from action_probs and optimal action.
-        action, action_sample_wt = sample_action(
-            rng_action, action_probs, optimal_action, sample_gt_prob,
-            m.sample_action_type, m.sample_action_combine_type)
-        next_state, reward = e.take_action(states[j], action, j)
-        executed_actions.append(action)
-        states.append(next_state)
-        rewards.append(reward)
-        action_sample_wts.append(action_sample_wt)
-        net_state = dict(zip(m.train_ops['state_names'], net_state))
-        net_state_to_input.append(net_state)
-    
-    # Concatenate things together for training.
-    rewards = np.array(rewards).T
-    action_sample_wts = np.array(action_sample_wts).T
-    executed_actions = np.array(executed_actions).T
-    all_state_targets = concat_state_x(state_targets, e.get_targets_name())
-    all_state_features = concat_state_x(state_features,
-                                        e.get_features_name()+['step_number'])
-    # all_state_net = concat_state_x(net_state_to_input,
-    # m.train_ops['state_names'])
-    all_step_data_cache = concat_state_x(step_data_cache,
-                                         m.train_ops['step_data_cache'])
-
-    dict_train = dict(input)
-    dict_train.update(all_state_features)
-    dict_train.update(all_state_targets)
-    # dict_train.update(all_state_net)
-    dict_train.update(net_state_to_input[0])
-    dict_train.update(all_step_data_cache)
-    dict_train.update({'rewards': rewards, 
-                       'action_sample_wts': action_sample_wts,
-                       'executed_actions': executed_actions})
-    feed_dict = prepare_feed_dict(m.input_tensors['train'], dict_train)
-    for x in m.train_ops['step_data_cache']:
-      feed_dict[x] = all_step_data_cache[x]
-    if mode == 'train':
-      n_step = sess.run(global_step)
-
-      if np.mod(n_step, train_display_interval) == 0:
-        total_loss, np_global_step, summary, print_summary = sess.run(
-            [train_op, global_step, s_ops.summary_ops, s_ops.print_summary_ops],
-            feed_dict=feed_dict)
-        logging.error("")
-      else:
-        total_loss, np_global_step, summary = sess.run(
-            [train_op, global_step, s_ops.summary_ops], feed_dict=feed_dict)
-
-      if writer is not None and summary is not None:
-        writer.add_summary(summary, np_global_step)
-
-      should_stop = sess.run(m.should_stop_op)
-
-    if mode != 'train':
-      arop = [[] for j in range(len(s_ops.additional_return_ops))]
-      for j in range(len(s_ops.additional_return_ops)):
-        if s_ops.arop_summary_iters[j] < 0 or i < s_ops.arop_summary_iters[j]:
-          arop[j] = s_ops.additional_return_ops[j]
-      val = sess.run(arop, feed_dict=feed_dict)
-      val_additional_ops.append(val)
-      tt.toc(log_at=60, log_str='val timer {:d} / {:d}: '.format(i, iters), 
-             type='time')
-
-  if mode != 'train':
-    # Write the default val summaries.
-    summary, print_summary, np_global_step = sess.run(
-        [s_ops.summary_ops, s_ops.print_summary_ops, global_step]) 
-    if writer is not None and summary is not None:
-      writer.add_summary(summary, np_global_step)
-
-    # write custom validation ops
-    val_summarys = []
-    val_additional_ops = zip(*val_additional_ops)
-    if len(s_ops.arop_eval_fns) > 0:
-      val_metric_summary = tf.summary.Summary()
-      for i in range(len(s_ops.arop_eval_fns)):
-        val_summary = None
-        if s_ops.arop_eval_fns[i] is not None:
-          val_summary = s_ops.arop_eval_fns[i](val_additional_ops[i],
-                                               np_global_step, logdir,
-                                               val_metric_summary,
-                                               s_ops.arop_summary_iters[i])
-        val_summarys.append(val_summary)
-      if writer is not None:
-        writer.add_summary(val_metric_summary, np_global_step)
-
-    # Return the additional val_ops
-    total_loss = (val_additional_ops, val_summarys)
-    should_stop = None
-  
-  return total_loss, should_stop
-
-def train_step_custom_v2(sess, train_op, global_step, train_step_kwargs,
-                         mode='train'):
-  m      = train_step_kwargs['m']
-  obj    = train_step_kwargs['obj']
-  rng    = train_step_kwargs['rng']
-  writer = train_step_kwargs['writer']
-  iters  = train_step_kwargs['iters']
-  logdir = train_step_kwargs['logdir']
-  train_display_interval = train_step_kwargs['train_display_interval']
-
-  s_ops = m.summary_ops[mode]
-  val_additional_ops = [] 
-
-  # Print all variables here.
-  if False:
-    v = tf.get_collection(tf.GraphKeys.VARIABLES)
-    v_op = [_.value() for _ in v]
-    v_op_value = sess.run(v_op)
-
-    filter = lambda x, y: 'Adam' in x.name
-    # filter = lambda x, y: np.is_any_nan(y)
-    ind = [i for i, (_, __) in enumerate(zip(v, v_op_value)) if filter(_, __)]
-    v = [v[i] for i in ind]
-    v_op_value = [v_op_value[i] for i in ind]
-
-    for i in range(len(v)): 
-      logging.info('XXXX: variable: %30s, is_any_nan: %5s, norm: %f.',
-                   v[i].name, np.any(np.isnan(v_op_value[i])),
-                   np.linalg.norm(v_op_value[i]))
-
-  tt = utils.Timer()
-  for i in range(iters):
-    tt.tic()
-    e          = obj.sample_env(rng)
-    rngs       = e.gen_rng(rng)
-    input_data = e.gen_data(*rngs)
-    input_data = e.pre_data(input_data)
-    feed_dict  = prepare_feed_dict(m.input_tensors, input_data)
-
-    if mode == 'train':
-      n_step = sess.run(global_step)
-
-      if np.mod(n_step, train_display_interval) == 0:
-        total_loss, np_global_step, summary, print_summary = sess.run(
-            [train_op, global_step, s_ops.summary_ops, s_ops.print_summary_ops], 
-            feed_dict=feed_dict)
-      else:
-        total_loss, np_global_step, summary = sess.run(
-            [train_op, global_step, s_ops.summary_ops],
-            feed_dict=feed_dict)
-
-      if writer is not None and summary is not None:
-        writer.add_summary(summary, np_global_step)
-
-      should_stop = sess.run(m.should_stop_op)
-
-    if mode != 'train':
-      arop = [[] for j in range(len(s_ops.additional_return_ops))]
-      for j in range(len(s_ops.additional_return_ops)):
-        if s_ops.arop_summary_iters[j] < 0 or i < s_ops.arop_summary_iters[j]:
-          arop[j] = s_ops.additional_return_ops[j]
-      val = sess.run(arop, feed_dict=feed_dict)
-      val_additional_ops.append(val)
-      tt.toc(log_at=60, log_str='val timer {:d} / {:d}: '.format(i, iters), 
-             type='time')
-
-  if mode != 'train':
-    # Write the default val summaries.
-    summary, print_summary, np_global_step = sess.run(
-        [s_ops.summary_ops, s_ops.print_summary_ops, global_step]) 
-    if writer is not None and summary is not None:
-      writer.add_summary(summary, np_global_step)
-
-    # write custom validation ops
-    val_summarys = []
-    val_additional_ops = zip(*val_additional_ops)
-    if len(s_ops.arop_eval_fns) > 0:
-      val_metric_summary = tf.summary.Summary()
-      for i in range(len(s_ops.arop_eval_fns)):
-        val_summary = None
-        if s_ops.arop_eval_fns[i] is not None:
-          val_summary = s_ops.arop_eval_fns[i](val_additional_ops[i],
-                                               np_global_step, logdir,
-                                               val_metric_summary,
-                                               s_ops.arop_summary_iters[i])
-        val_summarys.append(val_summary)
-      if writer is not None:
-        writer.add_summary(val_metric_summary, np_global_step)
-
-    # Return the additional val_ops
-    total_loss = (val_additional_ops, val_summarys)
-    should_stop = None
-
-  return total_loss, should_stop
-
-def train_step_custom(sess, train_op, global_step, train_step_kwargs, 
-                      mode='train'):
-  m        = train_step_kwargs['m']
-  params   = train_step_kwargs['params']
-  rng      = train_step_kwargs['rng']
-  writer   = train_step_kwargs['writer']
-  iters    = train_step_kwargs['iters']
-  gen_rng  = train_step_kwargs['gen_rng']
-  logdir   = train_step_kwargs['logdir']
-  gen_data = train_step_kwargs['gen_data']
-  pre_data = train_step_kwargs['pre_data']
-  train_display_interval = train_step_kwargs['train_display_interval']
-  
-  val_additional_ops = [] 
-  # Print all variables here.
-  if False:
-    v = tf.get_collection(tf.GraphKeys.VARIABLES)
-    for _ in v: 
-      val = sess.run(_.value())
-      logging.info('variable: %30s, is_any_nan: %5s, norm: %f.', _.name,
-                   np.any(np.isnan(val)), np.linalg.norm(val))
-
-  for i in range(iters):
-    rngs       = gen_rng(params, rng)
-    input_data = gen_data(params, *rngs)
-    input_data = pre_data(params, input_data)
-    feed_dict  = prepare_feed_dict(m.input_tensors, input_data)
-    
-    if mode == 'train':
-      n_step = sess.run(global_step)
-      
-      if np.mod(n_step, train_display_interval) == 0:
-        total_loss, np_global_step, summary, print_summary = sess.run(
-            [train_op, global_step, m.summary_op[mode], m.print_summary_op[mode]], 
-            feed_dict=feed_dict)
-      else:
-        total_loss, np_global_step, summary = sess.run(
-            [train_op, global_step, m.summary_op[mode]],
-            feed_dict=feed_dict)
-
-      if writer is not None:
-        writer.add_summary(summary, np_global_step)
-        
-      should_stop = sess.run(m.should_stop_op)
-    
-    if mode == 'val':
-      val = sess.run(m.agg_update_op[mode] + m.additional_return_op[mode], 
-                     feed_dict=feed_dict)
-      val_additional_ops.append(val[len(m.agg_update_op[mode]):])
-  
-  if mode == 'val':
-    summary, print_summary, np_global_step = sess.run(
-        [m.summary_op[mode], m.print_summary_op[mode], global_step]) 
-    if writer is not None:
-      writer.add_summary(summary, np_global_step)
-    sess.run([m.agg_reset_op[mode]])
-    
-    # write custom validation ops
-    if m.eval_metrics_fn[mode] is not None:
-      val_metric_summary = m.eval_metrics_fn[mode](val_additional_ops,
-                                                   np_global_step, logdir)
-      if writer is not None:
-        writer.add_summary(val_metric_summary, np_global_step)
-    
-    total_loss = val_additional_ops
-    should_stop = None
-    
-  return total_loss, should_stop
-
-def setup_training(loss_op, initial_learning_rate, steps_per_decay,
-                   learning_rate_decay, momentum, max_steps,
-                   sync=False, adjust_lr_sync=True,
-                   num_workers=1, replica_id=0, vars_to_optimize=None, 
-                   clip_gradient_norm=0, typ=None, momentum2=0.999,
-                   adam_eps=1e-8):
-  if sync and adjust_lr_sync:
-    initial_learning_rate = initial_learning_rate * num_workers
-    max_steps = np.int(max_steps / num_workers)
-    steps_per_decay = np.int(steps_per_decay / num_workers)
-
-  global_step_op = slim.get_or_create_global_step()
-  lr_op          = tf.train.exponential_decay(initial_learning_rate,
-    global_step_op, steps_per_decay, learning_rate_decay, staircase=True)
-  if typ == 'sgd':
-    optimizer      = tf.train.MomentumOptimizer(lr_op, momentum)
-  elif typ == 'adam':
-    optimizer      = tf.train.AdamOptimizer(learning_rate=lr_op, beta1=momentum,
-                                            beta2=momentum2, epsilon=adam_eps)
-  
-  if sync:
-    
-    sync_optimizer = tf.train.SyncReplicasOptimizer(optimizer, 
-                                               replicas_to_aggregate=num_workers, 
-                                               replica_id=replica_id, 
-                                               total_num_replicas=num_workers)
-    train_op       = slim.learning.create_train_op(loss_op, sync_optimizer,
-                                                   variables_to_train=vars_to_optimize,
-                                                   clip_gradient_norm=clip_gradient_norm)
-  else:
-    sync_optimizer = None
-    train_op       = slim.learning.create_train_op(loss_op, optimizer,
-                                                   variables_to_train=vars_to_optimize,
-                                                   clip_gradient_norm=clip_gradient_norm)
-    should_stop_op = tf.greater_equal(global_step_op, max_steps)
-  return lr_op, global_step_op, train_op, should_stop_op, optimizer, sync_optimizer
-
-def add_value_to_summary(metric_summary, tag, val, log=True, tag_str=None):
-  """Adds a scalar summary to the summary object. Optionally also logs to
-  logging."""
-  new_value = metric_summary.value.add();
-  new_value.tag = tag
-  new_value.simple_value = val
-  if log:
-    if tag_str is None:
-      tag_str = tag + '%f'
-    logging.info(tag_str, val)
-
-def add_scalar_summary_op(tensor, name=None, 
-    summary_key='summaries', print_summary_key='print_summaries', prefix=''):
-  collections = []
-  op = tf.summary.scalar(name, tensor, collections=collections)
-  if summary_key != print_summary_key:
-    tf.add_to_collection(summary_key, op)
-  
-  op = tf.Print(op, [tensor], '    {:-<25s}: '.format(name) + prefix)
-  tf.add_to_collection(print_summary_key, op)
-  return op
-
-def setup_inputs(inputs):
-  input_tensors = {}
-  input_shapes  = {}
-  for (name, typ, sz) in inputs:
-    _ = tf.placeholder(typ, shape=sz, name=name)
-    input_tensors[name] = _
-    input_shapes[name]  = sz
-  return input_tensors, input_shapes
-
-def prepare_feed_dict(input_tensors, inputs):
-  feed_dict = {}
-  for n in input_tensors.keys():
-    feed_dict[input_tensors[n]] = inputs[n].astype(input_tensors[n].dtype.as_numpy_dtype)
-  return feed_dict
-
-def simple_add_summaries(summarize_ops, summarize_names,
-                         summary_key='summaries',
-                         print_summary_key='print_summaries', prefix=''):
-  for op, name, in zip(summarize_ops, summarize_names):
-    add_scalar_summary_op(op, name, summary_key, print_summary_key, prefix)
-
-  summary_op       = tf.summary.merge_all(summary_key)
-  print_summary_op = tf.summary.merge_all(print_summary_key)
-  return summary_op, print_summary_op
-
-def add_summary_ops(m, summarize_ops, summarize_names, to_aggregate=None,
-                    summary_key='summaries',
-                    print_summary_key='print_summaries', prefix=''):
-  if type(to_aggregate) != list:
-    to_aggregate = [to_aggregate for _ in summarize_ops]
-  
-  # set up aggregating metrics
-  if np.any(to_aggregate):
-    agg_ops = []
-    for op, name, to_agg in zip(summarize_ops, summarize_names, to_aggregate):
-      if to_agg:
-        # agg_ops.append(slim.metrics.streaming_mean(op, return_reset_op=True))
-        agg_ops.append(tf.contrib.metrics.streaming_mean(op))
-        # agg_ops.append(tf.contrib.metrics.streaming_mean(op, return_reset_op=True))
-      else:
-        agg_ops.append([None, None, None])
-
-    # agg_values_op, agg_update_op, agg_reset_op = zip(*agg_ops)
-    # agg_update_op = [x for x in agg_update_op if x is not None]
-    # agg_reset_op = [x for x in agg_reset_op if x is not None]
-    agg_values_op, agg_update_op = zip(*agg_ops)
-    agg_update_op = [x for x in agg_update_op if x is not None]
-    agg_reset_op  = [tf.no_op()]
-  else:
-    agg_values_op = [None for _ in to_aggregate]
-    agg_update_op = [tf.no_op()]
-    agg_reset_op  = [tf.no_op()]
-
-  for op, name, to_agg, agg_op in zip(summarize_ops, summarize_names, to_aggregate, agg_values_op):
-    if to_agg:
-      add_scalar_summary_op(agg_op, name, summary_key, print_summary_key, prefix)
-    else:
-      add_scalar_summary_op(op, name, summary_key, print_summary_key, prefix)
-
-  summary_op       = tf.summary.merge_all(summary_key)
-  print_summary_op = tf.summary.merge_all(print_summary_key)
-  return summary_op, print_summary_op, agg_update_op, agg_reset_op
-
-
-
-def accum_val_ops(outputs, names, global_step, output_dir, metric_summary, N):
-  """Processes the collected outputs to compute AP for action prediction.
-  
-  Args:
-    outputs        : List of scalar ops to summarize.
-    names          : Name of the scalar ops.
-    global_step    : global_step.
-    output_dir     : where to store results.
-    metric_summary : summary object to add summaries to.
-    N              : number of outputs to process.
-  """
-  outs = []
-  if N >= 0:
-    outputs = outputs[:N]
-  for i in range(len(outputs[0])):
-    scalar = np.array(map(lambda x: x[i], outputs))
-    assert(scalar.ndim == 1)
-    add_value_to_summary(metric_summary, names[i], np.mean(scalar),
-                         tag_str='{:>27s}:  [{:s}]: %f'.format(names[i], ''))
-    outs.append(np.mean(scalar))
-  return outs
-
-def get_default_summary_ops():
-  return utils.Foo(summary_ops=None, print_summary_ops=None, 
-                   additional_return_ops=[], arop_summary_iters=[],
-                   arop_eval_fns=[])
-
-
-def simple_summaries(summarize_ops, summarize_names, mode, to_aggregate=False,
-                     scope_name='summary'):
-
-  if type(to_aggregate) != list:
-    to_aggregate = [to_aggregate for _ in summarize_ops]
-  
-  summary_key = '{:s}_summaries'.format(mode)
-  print_summary_key = '{:s}_print_summaries'.format(mode)
-  prefix=' [{:s}]: '.format(mode)
-  
-  # Default ops for things that dont need to be aggregated.
-  if not np.all(to_aggregate):
-    for op, name, to_agg in zip(summarize_ops, summarize_names, to_aggregate):
-      if not to_agg:
-        add_scalar_summary_op(op, name, summary_key, print_summary_key, prefix)
-    summary_ops = tf.summary.merge_all(summary_key)
-    print_summary_ops = tf.summary.merge_all(print_summary_key)
-  else:
-    summary_ops = tf.no_op()
-    print_summary_ops = tf.no_op()
- 
-  # Default ops for things that dont need to be aggregated.
-  if np.any(to_aggregate):
-    additional_return_ops = [[summarize_ops[i] 
-                              for i, x in enumerate(to_aggregate )if x]]
-    arop_summary_iters = [-1]
-    s_names = ['{:s}/{:s}'.format(scope_name, summarize_names[i]) 
-               for i, x in enumerate(to_aggregate) if x]
-    fn = lambda outputs, global_step, output_dir, metric_summary, N: \
-      accum_val_ops(outputs, s_names, global_step, output_dir, metric_summary,
-                    N)
-    arop_eval_fns = [fn]
-  else:
-    additional_return_ops = []
-    arop_summary_iters = []
-    arop_eval_fns = []
-  return summary_ops, print_summary_ops, additional_return_ops, \
-    arop_summary_iters, arop_eval_fns
diff --git a/research/cognitive_mapping_and_planning/tfcode/vision_baseline_lstm.py b/research/cognitive_mapping_and_planning/tfcode/vision_baseline_lstm.py
deleted file mode 100644
index ccf3ab23b06b71ed2a6d300b9a7d2a67a396c52e..0000000000000000000000000000000000000000
--- a/research/cognitive_mapping_and_planning/tfcode/vision_baseline_lstm.py
+++ /dev/null
@@ -1,533 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import numpy as np
-
-
-import tensorflow as tf
-
-from tensorflow.contrib import slim
-
-import logging
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-from src import utils 
-import src.file_utils as fu
-import tfcode.nav_utils as nu 
-from tfcode import tf_utils
-
-setup_train_step_kwargs = nu.default_train_step_kwargs
-compute_losses_multi_or = nu.compute_losses_multi_or
-get_repr_from_image = nu.get_repr_from_image
-
-_save_d_at_t = nu.save_d_at_t
-_save_all = nu.save_all
-_eval_ap = nu.eval_ap
-_eval_dist = nu.eval_dist
-_plot_trajectories = nu.plot_trajectories
-
-def lstm_online(cell_fn, num_steps, inputs, state, varscope):
-  # inputs is B x num_steps x C, C channels.
-  # state is 2 tuple with B x 1 x C1, B x 1 x C2 
-  # Output state is always B x 1 x C
-  inputs = tf.unstack(inputs, axis=1, num=num_steps)
-  state = tf.unstack(state, axis=1, num=1)[0]
-  outputs = [] 
-  
-  if num_steps > 1: 
-    varscope.reuse_variables()
-  
-  for s in range(num_steps):
-    output, state = cell_fn(inputs[s], state)
-    outputs.append(output)
-  outputs = tf.stack(outputs, axis=1)
-  state = tf.stack([state], axis=1)
-  return outputs, state
-
-def _inputs(problem, lstm_states, lstm_state_dims):
-  # Set up inputs.
-  with tf.name_scope('inputs'):
-    n_views = problem.n_views
-
-    inputs = []
-    inputs.append(('orig_maps', tf.float32,
-                   (problem.batch_size, 1, None, None, 1)))
-    inputs.append(('goal_loc', tf.float32,
-                   (problem.batch_size, problem.num_goals, 2)))
-
-    # For initing LSTM.
-    inputs.append(('rel_goal_loc_at_start', tf.float32,
-                   (problem.batch_size, problem.num_goals,
-                    problem.rel_goal_loc_dim)))
-    common_input_data, _ = tf_utils.setup_inputs(inputs)
-
-    inputs = []
-    inputs.append(('imgs', tf.float32, (problem.batch_size, None, n_views,
-                                        problem.img_height, problem.img_width,
-                                        problem.img_channels)))
-    # Goal location as a tuple of delta location and delta theta.
-    inputs.append(('rel_goal_loc', tf.float32, (problem.batch_size, None,
-                                                problem.rel_goal_loc_dim)))
-    if problem.outputs.visit_count:
-      inputs.append(('visit_count', tf.int32, (problem.batch_size, None, 1)))
-      inputs.append(('last_visit', tf.int32, (problem.batch_size, None, 1)))
-
-    for i, (state, dim) in enumerate(zip(lstm_states, lstm_state_dims)):
-      inputs.append((state, tf.float32, (problem.batch_size, 1, dim)))
-
-    if problem.outputs.egomotion:
-      inputs.append(('incremental_locs', tf.float32,
-                     (problem.batch_size, None, 2)))
-      inputs.append(('incremental_thetas', tf.float32,
-                     (problem.batch_size, None, 1)))
-
-    inputs.append(('step_number', tf.int32, (1, None, 1)))
-    inputs.append(('node_ids', tf.int32, (problem.batch_size, None,
-                                          problem.node_ids_dim)))
-    inputs.append(('perturbs', tf.float32, (problem.batch_size, None,
-                                            problem.perturbs_dim)))
-
-    # For plotting result plots
-    inputs.append(('loc_on_map', tf.float32, (problem.batch_size, None, 2)))
-    inputs.append(('gt_dist_to_goal', tf.float32, (problem.batch_size, None, 1)))
-    step_input_data, _ = tf_utils.setup_inputs(inputs)
-
-    inputs = []
-    inputs.append(('executed_actions', tf.int32, (problem.batch_size, None)))
-    inputs.append(('rewards', tf.float32, (problem.batch_size, None)))
-    inputs.append(('action_sample_wts', tf.float32, (problem.batch_size, None)))
-    inputs.append(('action', tf.int32, (problem.batch_size, None,
-                                        problem.num_actions)))
-    train_data, _ = tf_utils.setup_inputs(inputs)
-    train_data.update(step_input_data)
-    train_data.update(common_input_data)
-  return common_input_data, step_input_data, train_data
-
-
-def _add_summaries(m, summary_mode, arop_full_summary_iters):
-  summarize_ops = [m.lr_op, m.global_step_op, m.sample_gt_prob_op,
-                   m.total_loss_op, m.data_loss_op, m.reg_loss_op] + m.acc_ops
-  summarize_names = ['lr', 'global_step', 'sample_gt_prob_op', 'total_loss',
-                     'data_loss', 'reg_loss'] + \
-                    ['acc_{:d}'.format(i) for i in range(len(m.acc_ops))]
-  to_aggregate = [0, 0, 0, 1, 1, 1] + [1]*len(m.acc_ops)
-
-  scope_name = 'summary'
-  with tf.name_scope(scope_name):
-    s_ops = nu.add_default_summaries(summary_mode, arop_full_summary_iters,
-                                     summarize_ops, summarize_names,
-                                     to_aggregate, m.action_prob_op,
-                                     m.input_tensors, scope_name=scope_name)
-    m.summary_ops = {summary_mode: s_ops}
-
-def visit_count_fc(visit_count, last_visit, embed_neurons, wt_decay, fc_dropout):
-  with tf.variable_scope('embed_visit_count'):
-    visit_count = tf.reshape(visit_count, shape=[-1])
-    last_visit = tf.reshape(last_visit, shape=[-1])
-    
-    visit_count = tf.clip_by_value(visit_count, clip_value_min=-1,
-                                   clip_value_max=15)
-    last_visit = tf.clip_by_value(last_visit, clip_value_min=-1,
-                                   clip_value_max=15)
-    visit_count = tf.one_hot(visit_count, depth=16, axis=1, dtype=tf.float32,
-                             on_value=10., off_value=0.)
-    last_visit = tf.one_hot(last_visit, depth=16, axis=1, dtype=tf.float32,
-                             on_value=10., off_value=0.)
-    f = tf.concat([visit_count, last_visit], 1)
-    x, _ = tf_utils.fc_network(
-        f, neurons=embed_neurons, wt_decay=wt_decay, name='visit_count_embed',
-        offset=0, batch_norm_param=None, dropout_ratio=fc_dropout,
-        is_training=is_training)
-  return x
-
-def lstm_setup(name, x, batch_size, is_single_step, lstm_dim, lstm_out,
-               num_steps, state_input_op):
-  # returns state_name, state_init_op, updated_state_op, out_op 
-  with tf.name_scope('reshape_'+name):
-    sh = x.get_shape().as_list()
-    x = tf.reshape(x, shape=[batch_size, -1, sh[-1]])
-
-  with tf.variable_scope(name) as varscope:
-    cell = tf.contrib.rnn.LSTMCell(
-      num_units=lstm_dim, forget_bias=1.0, state_is_tuple=False,
-      num_proj=lstm_out, use_peepholes=True,
-      initializer=tf.random_uniform_initializer(-0.01, 0.01, seed=0),
-      cell_clip=None, proj_clip=None)
-
-    sh = [batch_size, 1, lstm_dim+lstm_out]
-    state_init_op = tf.constant(0., dtype=tf.float32, shape=sh)
-
-    fn = lambda ns: lstm_online(cell, ns, x, state_input_op, varscope)
-    out_op, updated_state_op = tf.cond(is_single_step, lambda: fn(1), lambda:
-                                       fn(num_steps))
-
-  return name, state_init_op, updated_state_op, out_op 
-
-def combine_setup(name, combine_type, embed_img, embed_goal, num_img_neuorons=None,
-                  num_goal_neurons=None):
-  with tf.name_scope(name + '_' + combine_type):
-    if combine_type == 'add':
-      # Simple concat features from goal and image
-      out = embed_img + embed_goal
-
-    elif combine_type == 'multiply':
-      # Multiply things together
-      re_embed_img = tf.reshape(
-          embed_img, shape=[-1, num_img_neuorons / num_goal_neurons,
-                            num_goal_neurons])
-      re_embed_goal = tf.reshape(embed_goal, shape=[-1, num_goal_neurons, 1])
-      x = tf.matmul(re_embed_img, re_embed_goal, transpose_a=False, transpose_b=False)
-      out = slim.flatten(x)
-    elif combine_type == 'none' or combine_type == 'imgonly':
-      out = embed_img
-    elif combine_type == 'goalonly':
-      out = embed_goal
-    else:
-      logging.fatal('Undefined combine_type: %s', combine_type)
-  return out
-
-
-def preprocess_egomotion(locs, thetas):
-  with tf.name_scope('pre_ego'):
-    pre_ego = tf.concat([locs, tf.sin(thetas), tf.cos(thetas)], 2)
-    sh = pre_ego.get_shape().as_list()
-    pre_ego = tf.reshape(pre_ego, [-1, sh[-1]])
-  return pre_ego
-
-def setup_to_run(m, args, is_training, batch_norm_is_training, summary_mode):
-  # Set up the model.
-  tf.set_random_seed(args.solver.seed)
-  task_params = args.navtask.task_params
-  num_steps = task_params.num_steps
-  num_goals = task_params.num_goals
-  num_actions = task_params.num_actions
-  num_actions_ = num_actions
-
-  n_views = task_params.n_views
-
-  batch_norm_is_training_op = \
-      tf.placeholder_with_default(batch_norm_is_training, shape=[],
-                                  name='batch_norm_is_training_op') 
-  # Setup the inputs
-  m.input_tensors = {}
-  lstm_states = []; lstm_state_dims = [];
-  state_names = []; updated_state_ops = []; init_state_ops = [];
-  if args.arch.lstm_output:
-    lstm_states += ['lstm_output']
-    lstm_state_dims += [args.arch.lstm_output_dim+task_params.num_actions]
-  if args.arch.lstm_ego:
-    lstm_states += ['lstm_ego']
-    lstm_state_dims += [args.arch.lstm_ego_dim + args.arch.lstm_ego_out]
-    lstm_states += ['lstm_img']
-    lstm_state_dims += [args.arch.lstm_img_dim + args.arch.lstm_img_out]
-  elif args.arch.lstm_img:
-    # An LSTM only on the image
-    lstm_states += ['lstm_img']
-    lstm_state_dims += [args.arch.lstm_img_dim + args.arch.lstm_img_out]
-  else:
-    # No LSTMs involved here.
-    None
-
-  m.input_tensors['common'], m.input_tensors['step'], m.input_tensors['train'] = \
-      _inputs(task_params, lstm_states, lstm_state_dims)
-
-  with tf.name_scope('check_size'):
-    is_single_step = tf.equal(tf.unstack(tf.shape(m.input_tensors['step']['imgs']), 
-                                        num=6)[1], 1)
-
-  images_reshaped = tf.reshape(m.input_tensors['step']['imgs'], 
-      shape=[-1, task_params.img_height, task_params.img_width,
-             task_params.img_channels], name='re_image')
-
-  rel_goal_loc_reshaped = tf.reshape(m.input_tensors['step']['rel_goal_loc'], 
-      shape=[-1, task_params.rel_goal_loc_dim], name='re_rel_goal_loc')
-
-  x, vars_ = get_repr_from_image(
-      images_reshaped, task_params.modalities, task_params.data_augment,
-      args.arch.encoder, args.solver.freeze_conv, args.solver.wt_decay,
-      is_training)
-
-  # Reshape into nice things so that these can be accumulated over time steps
-  # for faster backprop.
-  sh_before = x.get_shape().as_list()
-  m.encoder_output = tf.reshape(
-      x, shape=[task_params.batch_size, -1, n_views] + sh_before[1:])
-  x = tf.reshape(m.encoder_output, shape=[-1] + sh_before[1:])
-
-  # Add a layer to reduce dimensions for a fc layer.
-  if args.arch.dim_reduce_neurons > 0:
-    ks = 1; neurons = args.arch.dim_reduce_neurons;
-    init_var = np.sqrt(2.0/(ks**2)/neurons)
-    batch_norm_param = args.arch.batch_norm_param
-    batch_norm_param['is_training'] = batch_norm_is_training_op
-    m.conv_feat = slim.conv2d(
-        x, neurons, kernel_size=ks, stride=1, normalizer_fn=slim.batch_norm,
-        normalizer_params=batch_norm_param, padding='SAME', scope='dim_reduce',
-        weights_regularizer=slim.l2_regularizer(args.solver.wt_decay),
-        weights_initializer=tf.random_normal_initializer(stddev=init_var))
-    reshape_conv_feat = slim.flatten(m.conv_feat)
-    sh = reshape_conv_feat.get_shape().as_list()
-    m.reshape_conv_feat = tf.reshape(reshape_conv_feat, 
-                                     shape=[-1, sh[1]*n_views])
-
-  # Restore these from a checkpoint.
-  if args.solver.pretrained_path is not None:
-    m.init_fn = slim.assign_from_checkpoint_fn(args.solver.pretrained_path,
-                                               vars_)
-  else:
-    m.init_fn = None
-
-  # Hit the goal_location with a bunch of fully connected layers, to embed it
-  # into some space.
-  with tf.variable_scope('embed_goal'):
-    batch_norm_param = args.arch.batch_norm_param
-    batch_norm_param['is_training'] = batch_norm_is_training_op
-    m.embed_goal, _ = tf_utils.fc_network(
-        rel_goal_loc_reshaped, neurons=args.arch.goal_embed_neurons,
-        wt_decay=args.solver.wt_decay, name='goal_embed', offset=0,
-        batch_norm_param=batch_norm_param, dropout_ratio=args.arch.fc_dropout,
-        is_training=is_training)
-  
-  if args.arch.embed_goal_for_state:
-    with tf.variable_scope('embed_goal_for_state'):
-      batch_norm_param = args.arch.batch_norm_param
-      batch_norm_param['is_training'] = batch_norm_is_training_op
-      m.embed_goal_for_state, _ = tf_utils.fc_network(
-          m.input_tensors['common']['rel_goal_loc_at_start'][:,0,:],
-          neurons=args.arch.goal_embed_neurons, wt_decay=args.solver.wt_decay,
-          name='goal_embed', offset=0, batch_norm_param=batch_norm_param,
-          dropout_ratio=args.arch.fc_dropout, is_training=is_training)
-
-  # Hit the goal_location with a bunch of fully connected layers, to embed it
-  # into some space.
-  with tf.variable_scope('embed_img'):
-    batch_norm_param = args.arch.batch_norm_param
-    batch_norm_param['is_training'] = batch_norm_is_training_op
-    m.embed_img, _ = tf_utils.fc_network(
-        m.reshape_conv_feat, neurons=args.arch.img_embed_neurons,
-        wt_decay=args.solver.wt_decay, name='img_embed', offset=0,
-        batch_norm_param=batch_norm_param, dropout_ratio=args.arch.fc_dropout,
-        is_training=is_training)
-
-  # For lstm_ego, and lstm_image, embed the ego motion, accumulate it into an
-  # LSTM, combine with image features and accumulate those in an LSTM. Finally
-  # combine what you get from the image LSTM with the goal to output an action.
-  if args.arch.lstm_ego:
-    ego_reshaped = preprocess_egomotion(m.input_tensors['step']['incremental_locs'], 
-                                        m.input_tensors['step']['incremental_thetas'])
-    with tf.variable_scope('embed_ego'):
-      batch_norm_param = args.arch.batch_norm_param
-      batch_norm_param['is_training'] = batch_norm_is_training_op
-      m.embed_ego, _ = tf_utils.fc_network(
-          ego_reshaped, neurons=args.arch.ego_embed_neurons,
-          wt_decay=args.solver.wt_decay, name='ego_embed', offset=0,
-          batch_norm_param=batch_norm_param, dropout_ratio=args.arch.fc_dropout,
-          is_training=is_training)
-
-    state_name, state_init_op, updated_state_op, out_op = lstm_setup(
-        'lstm_ego', m.embed_ego, task_params.batch_size, is_single_step, 
-        args.arch.lstm_ego_dim, args.arch.lstm_ego_out, num_steps*num_goals,
-        m.input_tensors['step']['lstm_ego'])
-    state_names += [state_name]
-    init_state_ops += [state_init_op]
-    updated_state_ops += [updated_state_op]
-
-    # Combine the output with the vision features.
-    m.img_ego_op = combine_setup('img_ego', args.arch.combine_type_ego,
-                                 m.embed_img, out_op,
-                                 args.arch.img_embed_neurons[-1],
-                                 args.arch.lstm_ego_out)
-
-    # LSTM on these vision features.
-    state_name, state_init_op, updated_state_op, out_op = lstm_setup(
-        'lstm_img', m.img_ego_op, task_params.batch_size, is_single_step, 
-        args.arch.lstm_img_dim, args.arch.lstm_img_out, num_steps*num_goals,
-        m.input_tensors['step']['lstm_img'])
-    state_names += [state_name]
-    init_state_ops += [state_init_op]
-    updated_state_ops += [updated_state_op]
-
-    m.img_for_goal = out_op
-    num_img_for_goal_neurons = args.arch.lstm_img_out
-
-  elif args.arch.lstm_img:
-    # LSTM on just the image features.
-    state_name, state_init_op, updated_state_op, out_op = lstm_setup(
-        'lstm_img', m.embed_img, task_params.batch_size, is_single_step,
-        args.arch.lstm_img_dim, args.arch.lstm_img_out, num_steps*num_goals,
-        m.input_tensors['step']['lstm_img'])
-    state_names += [state_name]
-    init_state_ops += [state_init_op]
-    updated_state_ops += [updated_state_op]
-    m.img_for_goal = out_op
-    num_img_for_goal_neurons = args.arch.lstm_img_out
-
-  else:
-    m.img_for_goal = m.embed_img
-    num_img_for_goal_neurons = args.arch.img_embed_neurons[-1]
-
-
-  if args.arch.use_visit_count:
-    m.embed_visit_count = visit_count_fc(
-        m.input_tensors['step']['visit_count'],
-        m.input_tensors['step']['last_visit'], args.arch.goal_embed_neurons,
-        args.solver.wt_decay, args.arch.fc_dropout, is_training=is_training)
-    m.embed_goal = m.embed_goal + m.embed_visit_count
-  
-  m.combined_f = combine_setup('img_goal', args.arch.combine_type,
-                               m.img_for_goal, m.embed_goal,
-                               num_img_for_goal_neurons,
-                               args.arch.goal_embed_neurons[-1])
-
-  # LSTM on the combined representation.
-  if args.arch.lstm_output:
-    name = 'lstm_output'
-    # A few fully connected layers here.
-    with tf.variable_scope('action_pred'):
-      batch_norm_param = args.arch.batch_norm_param
-      batch_norm_param['is_training'] = batch_norm_is_training_op
-      x, _ = tf_utils.fc_network(
-          m.combined_f, neurons=args.arch.pred_neurons,
-          wt_decay=args.solver.wt_decay, name='pred', offset=0,
-          batch_norm_param=batch_norm_param, dropout_ratio=args.arch.fc_dropout)
-
-    if args.arch.lstm_output_init_state_from_goal:
-      # Use the goal embedding to initialize the LSTM state.
-      # UGLY CLUGGY HACK: if this is doing computation for a single time step
-      # then this will not involve back prop, so we can use the state input from
-      # the feed dict, otherwise we compute the state representation from the
-      # goal and feed that in. Necessary for using goal location to generate the
-      # state representation.
-      m.embed_goal_for_state = tf.expand_dims(m.embed_goal_for_state, dim=1)
-      state_op = tf.cond(is_single_step, lambda: m.input_tensors['step'][name],
-                         lambda: m.embed_goal_for_state)
-      state_name, state_init_op, updated_state_op, out_op = lstm_setup(
-          name, x, task_params.batch_size, is_single_step,
-          args.arch.lstm_output_dim,
-          num_actions_,
-          num_steps*num_goals, state_op)
-      init_state_ops += [m.embed_goal_for_state]
-    else:
-      state_op = m.input_tensors['step'][name]
-      state_name, state_init_op, updated_state_op, out_op = lstm_setup(
-          name, x, task_params.batch_size, is_single_step,
-          args.arch.lstm_output_dim,
-          num_actions_, num_steps*num_goals, state_op)
-      init_state_ops += [state_init_op]
-
-    state_names += [state_name]
-    updated_state_ops += [updated_state_op]
-
-    out_op = tf.reshape(out_op, shape=[-1, num_actions_])
-    if num_actions_ > num_actions:
-      m.action_logits_op = out_op[:,:num_actions]
-      m.baseline_op = out_op[:,num_actions:]
-    else:
-      m.action_logits_op = out_op
-      m.baseline_op = None
-    m.action_prob_op = tf.nn.softmax(m.action_logits_op)
-
-  else:
-    # A few fully connected layers here.
-    with tf.variable_scope('action_pred'):
-      batch_norm_param = args.arch.batch_norm_param
-      batch_norm_param['is_training'] = batch_norm_is_training_op
-      out_op, _ = tf_utils.fc_network(
-          m.combined_f, neurons=args.arch.pred_neurons,
-          wt_decay=args.solver.wt_decay, name='pred', offset=0,
-          num_pred=num_actions_,
-          batch_norm_param=batch_norm_param,
-          dropout_ratio=args.arch.fc_dropout, is_training=is_training)
-      if num_actions_ > num_actions:
-        m.action_logits_op = out_op[:,:num_actions]
-        m.baseline_op = out_op[:,num_actions:]
-      else:
-        m.action_logits_op = out_op 
-        m.baseline_op = None
-      m.action_prob_op = tf.nn.softmax(m.action_logits_op)
-
-  m.train_ops = {}
-  m.train_ops['step'] = m.action_prob_op
-  m.train_ops['common'] = [m.input_tensors['common']['orig_maps'],
-                           m.input_tensors['common']['goal_loc'],
-                           m.input_tensors['common']['rel_goal_loc_at_start']]
-  m.train_ops['state_names'] = state_names
-  m.train_ops['init_state'] = init_state_ops
-  m.train_ops['updated_state'] = updated_state_ops
-  m.train_ops['batch_norm_is_training_op'] = batch_norm_is_training_op
-
-  # Flat list of ops which cache the step data.
-  m.train_ops['step_data_cache'] = [tf.no_op()]
-
-  if args.solver.freeze_conv:
-    m.train_ops['step_data_cache'] = [m.encoder_output]
-  else:
-    m.train_ops['step_data_cache'] = []
-
-  ewma_decay = 0.99 if is_training else 0.0
-  weight = tf.ones_like(m.input_tensors['train']['action'], dtype=tf.float32,
-                        name='weight')
-
-  m.reg_loss_op, m.data_loss_op, m.total_loss_op, m.acc_ops = \
-    compute_losses_multi_or(
-        m.action_logits_op, m.input_tensors['train']['action'],
-        weights=weight, num_actions=num_actions,
-        data_loss_wt=args.solver.data_loss_wt,
-        reg_loss_wt=args.solver.reg_loss_wt, ewma_decay=ewma_decay)
-
-
-  if args.solver.freeze_conv:
-    vars_to_optimize = list(set(tf.trainable_variables()) - set(vars_))
-  else:
-    vars_to_optimize = None
-
-  m.lr_op, m.global_step_op, m.train_op, m.should_stop_op, m.optimizer, \
-  m.sync_optimizer = tf_utils.setup_training(
-      m.total_loss_op, 
-      args.solver.initial_learning_rate, 
-      args.solver.steps_per_decay,
-      args.solver.learning_rate_decay, 
-      args.solver.momentum,
-      args.solver.max_steps, 
-      args.solver.sync, 
-      args.solver.adjust_lr_sync,
-      args.solver.num_workers, 
-      args.solver.task,
-      vars_to_optimize=vars_to_optimize,
-      clip_gradient_norm=args.solver.clip_gradient_norm,
-      typ=args.solver.typ, momentum2=args.solver.momentum2,
-      adam_eps=args.solver.adam_eps)
-  
-  
-  if args.arch.sample_gt_prob_type == 'inverse_sigmoid_decay':
-    m.sample_gt_prob_op = tf_utils.inverse_sigmoid_decay(args.arch.isd_k,
-                                                         m.global_step_op)
-  elif args.arch.sample_gt_prob_type == 'zero':
-    m.sample_gt_prob_op = tf.constant(-1.0, dtype=tf.float32)
-  elif args.arch.sample_gt_prob_type.split('_')[0] == 'step':
-    step = int(args.arch.sample_gt_prob_type.split('_')[1])
-    m.sample_gt_prob_op = tf_utils.step_gt_prob(
-        step, m.input_tensors['step']['step_number'][0,0,0])
-  
-  m.sample_action_type = args.arch.action_sample_type
-  m.sample_action_combine_type = args.arch.action_sample_combine_type
-  _add_summaries(m, summary_mode, args.summary.arop_full_summary_iters)
-  
-  m.init_op = tf.group(tf.global_variables_initializer(),
-                       tf.local_variables_initializer())
-  m.saver_op = tf.train.Saver(keep_checkpoint_every_n_hours=4,
-                              write_version=tf.train.SaverDef.V2)
-  
-  return m
diff --git a/research/compression/README.md b/research/compression/README.md
deleted file mode 100644
index 7f431b5eac6805fbecc276783cef2bc6c62068e5..0000000000000000000000000000000000000000
--- a/research/compression/README.md
+++ /dev/null
@@ -1,19 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Compression with Neural Networks
-
-This is a [TensorFlow](http://www.tensorflow.org/) model repo containing
-research on compression with neural networks. This repo currently contains 
-code for the following papers:
-
-[Full Resolution Image Compression with Recurrent Neural Networks](https://arxiv.org/abs/1608.05148)
-
-## Organization
-[Image Encoder](image_encoder/): Encoding and decoding images into their binary representation.
-
-[Entropy Coder](entropy_coder/): Lossless compression of the binary representation.
-
-## Contact Info
-Model repository maintained by Nick Johnston ([nmjohn](https://github.com/nmjohn)).
diff --git a/research/compression/entropy_coder/README.md b/research/compression/entropy_coder/README.md
deleted file mode 100644
index 59e889990aab71e12ed13122c9b5a796a048402a..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/README.md
+++ /dev/null
@@ -1,109 +0,0 @@
-# Neural net based entropy coding
-
-This is a [TensorFlow](http://www.tensorflow.org/) model for additional
-lossless compression of bitstreams generated by neural net based image
-encoders as described in
-[https://arxiv.org/abs/1703.10114](https://arxiv.org/abs/1703.10114).
-
-To be more specific, the entropy coder aims at compressing further binary
-codes which have a 3D tensor structure with:
-
-*   the first two dimensions of the tensors corresponding to the height and
-the width of the binary codes,
-*   the last dimension being the depth of the codes. The last dimension can be
-sliced into N groups of K, where each additional group is used by the image
-decoder to add more details to the reconstructed image.
-
-The code in this directory only contains the underlying code probability model
-but does not perform the actual compression using arithmetic coding.
-The code probability model is enough to compute the theoretical compression
-ratio.
-
-
-## Prerequisites
-The only software requirements for running the encoder and decoder is having
-Tensorflow installed.
-
-You will also need to add the top level source directory of the entropy coder
-to your `PYTHONPATH`, for example:
-
-`export PYTHONPATH=${PYTHONPATH}:/tmp/models/compression`
-
-
-## Training the entropy coder
-
-### Synthetic dataset
-If you do not have a training dataset, there is a simple code generative model
-that you can use to generate a dataset and play with the entropy coder.
-The generative model is located under dataset/gen\_synthetic\_dataset.py. Note
-that this simple generative model is not going to give good results on real
-images as it is not supposed to be close to the statistics of the binary
-representation of encoded images. Consider it as a toy dataset, no more, no
-less.
-
-To generate a synthetic dataset with 20000 samples:
-
-`mkdir -p /tmp/dataset`
-
-`python ./dataset/gen_synthetic_dataset.py --dataset_dir=/tmp/dataset/
---count=20000`
-
-Note that the generator has not been optimized at all, generating the synthetic
-dataset is currently pretty slow.
-
-### Training
-
-If you just want to play with the entropy coder trainer, here is the command
-line that can be used to train the entropy coder on the synthetic dataset:
-
-`mkdir -p /tmp/entropy_coder_train`
-
-`python ./core/entropy_coder_train.py --task=0
---train_dir=/tmp/entropy_coder_train/
---model=progressive
---model_config=./configs/synthetic/model_config.json
---train_config=./configs/synthetic/train_config.json
---input_config=./configs/synthetic/input_config.json
-`
-
-Training is configured using 3 files formatted using JSON:
-
-*   One file is used to configure the underlying entropy coder model.
-    Currently, only the *progressive* model is supported.
-    This model takes 2 mandatory parameters and an optional one:
-    *   `layer_depth`: the number of bits per layer (a.k.a. iteration).
-         Background: the image decoder takes each layer to add more detail
-         to the image.
-    *    `layer_count`: the maximum number of layers that should be supported
-         by the model. This should be equal or greater than the maximum number
-         of layers in the input binary codes.
-    *    `coded_layer_count`: This can be used to consider only partial codes,
-         keeping only the first `coded_layer_count` layers and ignoring the
-         remaining layers. If left empty, the binary codes are left unchanged.
-*   One file to configure the training, including the learning rate, ...
-    The meaning of the parameters are pretty straightforward. Note that this
-    file is only used during training and is not needed during inference.
-*   One file to specify the input dataset to use during training.
-    The dataset is formatted using tf.RecordIO.
-
-
-## Inference: file size after entropy coding.
-
-### Using a synthetic sample
-
-Here is the command line to generate a single synthetic sample formatted
-in the same way as what is provided by the image encoder:
-
-`python ./dataset/gen_synthetic_single.py
---sample_filename=/tmp/dataset/sample_0000.npz`
-
-To actually compute the additional compression ratio using the entropy coder
-trained in the previous step:
-
-`python ./core/entropy_coder_single.py
---model=progressive
---model_config=./configs/synthetic/model_config.json
---input_codes=/tmp/dataset/sample_0000.npz
---checkpoint=/tmp/entropy_coder_train/model.ckpt-209078`
-
-where the checkpoint number should be adjusted accordingly.
diff --git a/research/compression/entropy_coder/__init__.py b/research/compression/entropy_coder/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/compression/entropy_coder/all_models/__init__.py b/research/compression/entropy_coder/all_models/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/compression/entropy_coder/all_models/all_models.py b/research/compression/entropy_coder/all_models/all_models.py
deleted file mode 100644
index e376dac737667a348065eec622920b0a81ed1ac9..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/all_models/all_models.py
+++ /dev/null
@@ -1,19 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Import and register all the entropy coder models."""
-
-# pylint: disable=unused-import
-from entropy_coder.progressive import progressive
diff --git a/research/compression/entropy_coder/all_models/all_models_test.py b/research/compression/entropy_coder/all_models/all_models_test.py
deleted file mode 100644
index b8aff504a0a00d579d1b2768164b78b6c095b235..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/all_models/all_models_test.py
+++ /dev/null
@@ -1,68 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Basic test of all registered models."""
-
-import tensorflow as tf
-
-# pylint: disable=unused-import
-import all_models
-# pylint: enable=unused-import
-from entropy_coder.model import model_factory
-
-
-class AllModelsTest(tf.test.TestCase):
-
-  def testBuildModelForTraining(self):
-    factory = model_factory.GetModelRegistry()
-    model_names = factory.GetAvailableModels()
-
-    for m in model_names:
-      tf.reset_default_graph()
-
-      global_step = tf.Variable(tf.zeros([], dtype=tf.int64),
-                                trainable=False,
-                                name='global_step')
-
-      optimizer = tf.train.GradientDescentOptimizer(learning_rate=0.1)
-
-      batch_size = 3
-      height = 40
-      width = 20
-      depth = 5
-      binary_codes = tf.placeholder(dtype=tf.float32,
-                                    shape=[batch_size, height, width, depth])
-
-      # Create a model with the default configuration.
-      print('Creating model: {}'.format(m))
-      model = factory.CreateModel(m)
-      model.Initialize(global_step,
-                       optimizer,
-                       model.GetConfigStringForUnitTest())
-      self.assertTrue(model.loss is None, 'model: {}'.format(m))
-      self.assertTrue(model.train_op is None, 'model: {}'.format(m))
-      self.assertTrue(model.average_code_length is None, 'model: {}'.format(m))
-
-      # Build the Tensorflow graph corresponding to the model.
-      model.BuildGraph(binary_codes)
-      self.assertTrue(model.loss is not None, 'model: {}'.format(m))
-      self.assertTrue(model.average_code_length is not None,
-                      'model: {}'.format(m))
-      if model.train_op is None:
-        print('Model {} is not trainable'.format(m))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/compression/entropy_coder/configs/gru_prime3/model_config.json b/research/compression/entropy_coder/configs/gru_prime3/model_config.json
deleted file mode 100644
index cf63a4c454df5c47c732c5eaeea481b2aa714665..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/configs/gru_prime3/model_config.json
+++ /dev/null
@@ -1,4 +0,0 @@
-{
-	"layer_count": 16,
-	"layer_depth": 32
-}
diff --git a/research/compression/entropy_coder/configs/synthetic/input_config.json b/research/compression/entropy_coder/configs/synthetic/input_config.json
deleted file mode 100644
index 18455e65120cd45cb04106ed8b6b2d6641e1d49a..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/configs/synthetic/input_config.json
+++ /dev/null
@@ -1,4 +0,0 @@
-{
-	"data": "/tmp/dataset/synthetic_dataset",
-	"unique_code_size": true
-}
diff --git a/research/compression/entropy_coder/configs/synthetic/model_config.json b/research/compression/entropy_coder/configs/synthetic/model_config.json
deleted file mode 100644
index c6f1f3e11547a75c05019e24c59a7fc6d2a29e3b..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/configs/synthetic/model_config.json
+++ /dev/null
@@ -1,4 +0,0 @@
-{
-	"layer_depth": 2,
-	"layer_count": 8
-}
diff --git a/research/compression/entropy_coder/configs/synthetic/train_config.json b/research/compression/entropy_coder/configs/synthetic/train_config.json
deleted file mode 100644
index 79e4909fd3f93df983d79890e25b7b61ba14aa40..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/configs/synthetic/train_config.json
+++ /dev/null
@@ -1,6 +0,0 @@
-{
-	"batch_size": 4,
-	"learning_rate": 0.1,
-	"decay_rate": 0.9,
-	"samples_per_decay": 20000
-}
diff --git a/research/compression/entropy_coder/core/code_loader.py b/research/compression/entropy_coder/core/code_loader.py
deleted file mode 100644
index 603ab724afb0e6c4e94db9c121d7799eaf30fa02..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/core/code_loader.py
+++ /dev/null
@@ -1,73 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Load binary codes stored as tf.Example in a TFRecord table."""
-
-import tensorflow as tf
-
-
-def ReadFirstCode(dataset):
-  """Read the first example from a binary code RecordIO table."""
-  for record in tf.python_io.tf_record_iterator(dataset):
-    tf_example = tf.train.Example()
-    tf_example.ParseFromString(record)
-    break
-  return tf_example
-
-
-def LoadBinaryCode(input_config, batch_size):
-  """Load a batch of binary codes from a tf.Example dataset.
-
-  Args:
-    input_config: An InputConfig proto containing the input configuration.
-    batch_size: Output batch size of examples.
-
-  Returns:
-    A batched tensor of binary codes.
-  """
-  data = input_config.data
-
-  # TODO: Possibly use multiple files (instead of just one).
-  file_list = [data]
-  filename_queue = tf.train.string_input_producer(file_list,
-                                                  capacity=4)
-  reader = tf.TFRecordReader()
-  _, values = reader.read(filename_queue)
-
-  serialized_example = tf.reshape(values, shape=[1])
-  serialized_features = {
-      'code_shape': tf.FixedLenFeature([3],
-                                       dtype=tf.int64),
-      'code': tf.VarLenFeature(tf.float32),
-  }
-  example = tf.parse_example(serialized_example, serialized_features)
-
-  # 3D shape: height x width x binary_code_depth
-  z = example['code_shape']
-  code_shape = tf.reshape(tf.cast(z, tf.int32), [3])
-  # Un-flatten the binary codes.
-  code = tf.reshape(tf.sparse_tensor_to_dense(example['code']), code_shape)
-
-  queue_size = 10
-  queue = tf.PaddingFIFOQueue(
-      queue_size + 3 * batch_size,
-      dtypes=[code.dtype],
-      shapes=[[None, None, None]])
-  enqueue_op = queue.enqueue([code])
-  dequeue_code = queue.dequeue_many(batch_size)
-  queue_runner = tf.train.queue_runner.QueueRunner(queue, [enqueue_op])
-  tf.add_to_collection(tf.GraphKeys.QUEUE_RUNNERS, queue_runner)
-
-  return dequeue_code
diff --git a/research/compression/entropy_coder/core/config_helper.py b/research/compression/entropy_coder/core/config_helper.py
deleted file mode 100644
index a7d949e329b93f33d330d1ba494f71ae1704fa3f..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/core/config_helper.py
+++ /dev/null
@@ -1,52 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Helper functions used in both train and inference."""
-
-import json
-import os.path
-
-import tensorflow as tf
-
-
-def GetConfigString(config_file):
-  config_string = ''
-  if config_file is not None:
-    config_string = open(config_file).read()
-  return config_string
-
-
-class InputConfig(object):
-
-  def __init__(self, config_string):
-    config = json.loads(config_string)
-    self.data = config["data"]
-    self.unique_code_size = config["unique_code_size"]
-
-
-class TrainConfig(object):
-
-  def __init__(self, config_string):
-    config = json.loads(config_string)
-    self.batch_size = config["batch_size"]
-    self.learning_rate = config["learning_rate"]
-    self.decay_rate = config["decay_rate"]
-    self.samples_per_decay = config["samples_per_decay"]
-
-
-def SaveConfig(directory, filename, config_string):
-  path = os.path.join(directory, filename)
-  with tf.gfile.Open(path, mode='w') as f:
-    f.write(config_string)
diff --git a/research/compression/entropy_coder/core/entropy_coder_single.py b/research/compression/entropy_coder/core/entropy_coder_single.py
deleted file mode 100644
index 8a61b488b6bdd11e1cff4a2da672129240eb7240..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/core/entropy_coder_single.py
+++ /dev/null
@@ -1,116 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Compute the additional compression ratio after entropy coding."""
-
-import io
-import os
-
-import numpy as np
-import tensorflow as tf
-
-import config_helper
-
-# pylint: disable=unused-import
-from entropy_coder.all_models import all_models
-# pylint: enable=unused-import
-from entropy_coder.model import model_factory
-
-
-# Checkpoint used to restore the model parameters.
-tf.app.flags.DEFINE_string('checkpoint', None,
-                           """Model checkpoint.""")
-
-# Model selection and configuration.
-tf.app.flags.DEFINE_string('model', None, """Underlying encoder model.""")
-tf.app.flags.DEFINE_string('model_config', None,
-                           """Model config protobuf given as text file.""")
-
-# File holding the binary codes.
-tf.flags.DEFINE_string('input_codes', None, 'Location of binary code file.')
-
-FLAGS = tf.flags.FLAGS
-
-
-def main(_):
-  if (FLAGS.input_codes is None or FLAGS.model is None):
-    print ('\nUsage: python entropy_coder_single.py --model=progressive '
-           '--model_config=model_config.json'
-           '--iteration=15\n\n')
-    return
-
-  #if FLAGS.iteration < -1 or FLAGS.iteration > 15:
-  #  print ('\n--iteration must be between 0 and 15 inclusive, or -1 to infer '
-  #         'from file.\n')
-  #  return
-  #iteration = FLAGS.iteration
-
-  if not tf.gfile.Exists(FLAGS.input_codes):
-    print('\nInput codes not found.\n')
-    return
-
-  with tf.gfile.FastGFile(FLAGS.input_codes, 'rb') as code_file:
-    contents = code_file.read()
-    loaded_codes = np.load(io.BytesIO(contents))
-    assert ['codes', 'shape'] not in loaded_codes.files
-    loaded_shape = loaded_codes['shape']
-    loaded_array = loaded_codes['codes']
-
-    # Unpack and recover code shapes.
-    unpacked_codes = np.reshape(np.unpackbits(loaded_array)
-                                [:np.prod(loaded_shape)],
-                                loaded_shape)
-
-    numpy_int_codes = unpacked_codes.transpose([1, 2, 3, 0, 4])
-    numpy_int_codes = numpy_int_codes.reshape([numpy_int_codes.shape[0],
-                                               numpy_int_codes.shape[1],
-                                               numpy_int_codes.shape[2],
-                                               -1])
-    numpy_codes = numpy_int_codes.astype(np.float32) * 2.0 - 1.0
-
-  with tf.Graph().as_default() as graph:
-    # TF tensor to hold the binary codes to losslessly compress.
-    batch_size = 1
-    codes = tf.placeholder(tf.float32, shape=numpy_codes.shape)
-
-    # Create the entropy coder model.
-    global_step = None
-    optimizer = None
-    model = model_factory.GetModelRegistry().CreateModel(FLAGS.model)
-    model_config_string = config_helper.GetConfigString(FLAGS.model_config)
-    model.Initialize(global_step, optimizer, model_config_string)
-    model.BuildGraph(codes)
-
-    saver = tf.train.Saver(sharded=True, keep_checkpoint_every_n_hours=12.0)
-
-    with tf.Session(graph=graph) as sess:
-      # Initialize local variables.
-      sess.run(tf.local_variables_initializer())
-
-      # Restore model variables.
-      saver.restore(sess, FLAGS.checkpoint)
-
-      tf_tensors = {
-          'code_length': model.average_code_length
-      }
-      feed_dict = {codes: numpy_codes}
-      np_tensors = sess.run(tf_tensors, feed_dict=feed_dict)
-
-      print('Additional compression ratio: {}'.format(
-          np_tensors['code_length']))
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/compression/entropy_coder/core/entropy_coder_train.py b/research/compression/entropy_coder/core/entropy_coder_train.py
deleted file mode 100644
index 27c489037d27095b578aed6ad10a5a190ec49b18..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/core/entropy_coder_train.py
+++ /dev/null
@@ -1,184 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Train an entropy coder model."""
-
-import time
-
-import tensorflow as tf
-
-import code_loader
-import config_helper
-
-# pylint: disable=unused-import
-from entropy_coder.all_models import all_models
-# pylint: enable=unused-import
-from entropy_coder.model import model_factory
-
-
-FLAGS = tf.app.flags.FLAGS
-
-# Hardware resources configuration.
-tf.app.flags.DEFINE_string('master', '',
-                           """Name of the TensorFlow master to use.""")
-tf.app.flags.DEFINE_string('train_dir', None,
-                           """Directory where to write event logs.""")
-tf.app.flags.DEFINE_integer('task', None,
-                            """Task id of the replica running the training.""")
-tf.app.flags.DEFINE_integer('ps_tasks', 0, """Number of tasks in the ps job.
-                            If 0 no ps job is used.""")
-
-# Model selection and configuration.
-tf.app.flags.DEFINE_string('model', None, """Underlying encoder model.""")
-tf.app.flags.DEFINE_string('model_config', None,
-                           """Model config protobuf given as text file.""")
-
-# Training data and parameters configuration.
-tf.app.flags.DEFINE_string('input_config', None,
-                           """Path to the training input config file.""")
-tf.app.flags.DEFINE_string('train_config', None,
-                           """Path to the training experiment config file.""")
-
-
-def train():
-  if FLAGS.train_dir is None:
-    raise ValueError('Parameter train_dir must be provided')
-  if FLAGS.task is None:
-    raise ValueError('Parameter task must be provided')
-  if FLAGS.model is None:
-    raise ValueError('Parameter model must be provided')
-
-  input_config_string = config_helper.GetConfigString(FLAGS.input_config)
-  input_config = config_helper.InputConfig(input_config_string)
-
-  # Training parameters.
-  train_config_string = config_helper.GetConfigString(FLAGS.train_config)
-  train_config = config_helper.TrainConfig(train_config_string)
-
-  batch_size = train_config.batch_size
-  initial_learning_rate = train_config.learning_rate
-  decay_rate = train_config.decay_rate
-  samples_per_decay = train_config.samples_per_decay
-
-  # Parameters for learning-rate decay.
-  # The formula is decay_rate ** floor(steps / decay_steps).
-  decay_steps = samples_per_decay / batch_size
-  decay_steps = max(decay_steps, 1)
-
-  first_code = code_loader.ReadFirstCode(input_config.data)
-  first_code_height = (
-      first_code.features.feature['code_shape'].int64_list.value[0])
-  first_code_width = (
-      first_code.features.feature['code_shape'].int64_list.value[1])
-  max_bit_depth = (
-      first_code.features.feature['code_shape'].int64_list.value[2])
-  print('Maximum code depth: {}'.format(max_bit_depth))
-
-  with tf.Graph().as_default():
-    ps_ops = ["Variable", "VariableV2", "AutoReloadVariable", "VarHandleOp"]
-    with tf.device(tf.train.replica_device_setter(FLAGS.ps_tasks,
-                                                  ps_ops=ps_ops)):
-      codes = code_loader.LoadBinaryCode(
-          input_config=input_config,
-          batch_size=batch_size)
-      if input_config.unique_code_size:
-        print('Input code size: {} x {}'.format(first_code_height,
-                                                first_code_width))
-        codes.set_shape(
-            [batch_size, first_code_height, first_code_width, max_bit_depth])
-      else:
-        codes.set_shape([batch_size, None, None, max_bit_depth])
-      codes_effective_shape = tf.shape(codes)
-
-      global_step = tf.contrib.framework.create_global_step()
-
-      # Apply learning-rate decay.
-      learning_rate = tf.train.exponential_decay(
-          learning_rate=initial_learning_rate,
-          global_step=global_step,
-          decay_steps=decay_steps,
-          decay_rate=decay_rate,
-          staircase=True)
-      tf.summary.scalar('Learning Rate', learning_rate)
-      optimizer = tf.train.AdamOptimizer(learning_rate=learning_rate,
-                                         epsilon=1.0)
-
-      # Create the entropy coder model.
-      model = model_factory.GetModelRegistry().CreateModel(FLAGS.model)
-      model_config_string = config_helper.GetConfigString(FLAGS.model_config)
-      model.Initialize(global_step, optimizer, model_config_string)
-      model.BuildGraph(codes)
-
-      summary_op = tf.summary.merge_all()
-
-      # Verify that the model can actually be trained.
-      if model.train_op is None:
-        raise ValueError('Input model {} is not trainable'.format(FLAGS.model))
-
-      # We disable the summary thread run by Supervisor class by passing
-      # summary_op=None. We still pass save_summaries_secs because it is used by
-      # the global step counter thread.
-      is_chief = (FLAGS.task == 0)
-      sv = tf.train.Supervisor(logdir=FLAGS.train_dir,
-                               is_chief=is_chief,
-                               global_step=global_step,
-                               # saver=model.saver,
-                               summary_op=None,
-                               save_summaries_secs=120,
-                               save_model_secs=600,
-                               recovery_wait_secs=30)
-
-      sess = sv.PrepareSession(FLAGS.master)
-      sv.StartQueueRunners(sess)
-
-      step = sess.run(global_step)
-      print('Trainer initial step: {}.'.format(step))
-
-      # Once everything has been setup properly, save the configs.
-      if is_chief:
-        config_helper.SaveConfig(FLAGS.train_dir, 'input_config.json',
-                                 input_config_string)
-        config_helper.SaveConfig(FLAGS.train_dir, 'model_config.json',
-                                 model_config_string)
-        config_helper.SaveConfig(FLAGS.train_dir, 'train_config.json',
-                                 train_config_string)
-
-      # Train the model.
-      next_summary_time = time.time()
-      while not sv.ShouldStop():
-        feed_dict = None
-
-        # Once in a while, update the summaries on the chief worker.
-        if is_chief and next_summary_time < time.time():
-          summary_str = sess.run(summary_op, feed_dict=feed_dict)
-          sv.SummaryComputed(sess, summary_str)
-          next_summary_time = time.time() + sv.save_summaries_secs
-        else:
-          tf_tensors = {
-              'train': model.train_op,
-              'code_length': model.average_code_length
-          }
-          np_tensors = sess.run(tf_tensors, feed_dict=feed_dict)
-          print(np_tensors['code_length'])
-
-      sv.Stop()
-
-
-def main(argv=None):  # pylint: disable=unused-argument
-  train()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/compression/entropy_coder/dataset/gen_synthetic_dataset.py b/research/compression/entropy_coder/dataset/gen_synthetic_dataset.py
deleted file mode 100644
index de60aee324d4a6209d00a873ee681aa59aae0d8e..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/dataset/gen_synthetic_dataset.py
+++ /dev/null
@@ -1,89 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Generate a synthetic dataset."""
-
-import os
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import synthetic_model
-
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_string(
-    'dataset_dir', None,
-    """Directory where to write the dataset and the configs.""")
-tf.app.flags.DEFINE_integer(
-    'count', 1000,
-    """Number of samples to generate.""")
-
-
-def int64_feature(values):
-  """Returns a TF-Feature of int64s.
-
-  Args:
-    values: A scalar or list of values.
-
-  Returns:
-    A TF-Feature.
-  """
-  if not isinstance(values, (tuple, list)):
-    values = [values]
-  return tf.train.Feature(int64_list=tf.train.Int64List(value=values))
-
-
-def float_feature(values):
-  """Returns a TF-Feature of floats.
-
-  Args:
-    values: A scalar of list of values.
-
-  Returns:
-    A TF-Feature.
-  """
-  if not isinstance(values, (tuple, list)):
-    values = [values]
-  return tf.train.Feature(float_list=tf.train.FloatList(value=values))
-
-
-def AddToTFRecord(code, tfrecord_writer):
-  example = tf.train.Example(features=tf.train.Features(feature={
-      'code_shape': int64_feature(code.shape),
-      'code': float_feature(code.flatten().tolist()),
-  }))
-  tfrecord_writer.write(example.SerializeToString())
-
-
-def GenerateDataset(filename, count, code_shape):
-  with tf.python_io.TFRecordWriter(filename) as tfrecord_writer:
-    for _ in xrange(count):
-      code = synthetic_model.GenerateSingleCode(code_shape)
-      # Convert {0,1} codes to {-1,+1} codes.
-      code = 2.0 * code - 1.0
-      AddToTFRecord(code, tfrecord_writer)
-
-
-def main(argv=None):  # pylint: disable=unused-argument
-  GenerateDataset(os.path.join(FLAGS.dataset_dir + '/synthetic_dataset'),
-                  FLAGS.count,
-                  [35, 48, 8])
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/compression/entropy_coder/dataset/gen_synthetic_single.py b/research/compression/entropy_coder/dataset/gen_synthetic_single.py
deleted file mode 100644
index b8c3821c38b6a0b95f01ad7ffb283cca4beb34b3..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/dataset/gen_synthetic_single.py
+++ /dev/null
@@ -1,72 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Generate a single synthetic sample."""
-
-import io
-import os
-
-import numpy as np
-import tensorflow as tf
-
-import synthetic_model
-
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_string(
-    'sample_filename', None,
-    """Output file to store the generated binary code.""")
-
-
-def GenerateSample(filename, code_shape, layer_depth):
-  # {0, +1} binary codes.
-  # No conversion since the output file is expected to store
-  # codes using {0, +1} codes (and not {-1, +1}).
-  code = synthetic_model.GenerateSingleCode(code_shape)
-  code = np.round(code)
-
-  # Reformat the code so as to be compatible with what is generated
-  # by the image encoder.
-  # The image encoder generates a tensor of size:
-  # iteration_count x batch_size x height x width x iteration_depth.
-  # Here: batch_size = 1
-  if code_shape[-1] % layer_depth != 0:
-    raise ValueError('Number of layers is not an integer')
-  height = code_shape[0]
-  width = code_shape[1]
-  code = code.reshape([1, height, width, -1, layer_depth])
-  code = np.transpose(code, [3, 0, 1, 2, 4])
-
-  int_codes = code.astype(np.int8)
-  exported_codes = np.packbits(int_codes.reshape(-1))
-
-  output = io.BytesIO()
-  np.savez_compressed(output, shape=int_codes.shape, codes=exported_codes)
-  with tf.gfile.FastGFile(filename, 'wb') as code_file:
-    code_file.write(output.getvalue())
-
-
-def main(argv=None):  # pylint: disable=unused-argument
-  # Note: the height and the width is different from the training dataset.
-  # The main purpose is to show that the entropy coder model is fully
-  # convolutional and can be used on any image size.
-  layer_depth = 2
-  GenerateSample(FLAGS.sample_filename, [31, 36, 8], layer_depth)
-
-
-if __name__ == '__main__':
-  tf.app.run()
-
diff --git a/research/compression/entropy_coder/dataset/synthetic_model.py b/research/compression/entropy_coder/dataset/synthetic_model.py
deleted file mode 100644
index 9cccb64a136aba5a623c95e7c2dede2191d2cd62..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/dataset/synthetic_model.py
+++ /dev/null
@@ -1,75 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Binary code sample generator."""
-
-import numpy as np
-from six.moves import xrange
-
-
-_CRC_LINE = [
-    [0, 1, 0],
-    [1, 1, 0],
-    [1, 0, 0]
-]
-
-_CRC_DEPTH = [1, 1, 0, 1]
-
-
-def ComputeLineCrc(code, width, y, x, d):
-  crc = 0
-  for dy in xrange(len(_CRC_LINE)):
-    i = y - 1 - dy
-    if i < 0:
-      continue
-    for dx in xrange(len(_CRC_LINE[dy])):
-      j = x - 2 + dx
-      if j < 0 or j >= width:
-        continue
-      crc += 1 if (code[i, j, d] != _CRC_LINE[dy][dx]) else 0
-  return crc
-
-
-def ComputeDepthCrc(code, y, x, d):
-  crc = 0
-  for delta in xrange(len(_CRC_DEPTH)):
-    k = d - 1 - delta
-    if k < 0:
-      continue
-    crc += 1 if (code[y, x, k] != _CRC_DEPTH[delta]) else 0
-  return crc
-
-
-def GenerateSingleCode(code_shape):
-  code = np.zeros(code_shape, dtype=np.int)
-
-  keep_value_proba = 0.8
-
-  height = code_shape[0]
-  width = code_shape[1]
-  depth = code_shape[2]
-
-  for d in xrange(depth):
-    for y in xrange(height):
-      for x in xrange(width):
-        v1 = ComputeLineCrc(code, width, y, x, d)
-        v2 = ComputeDepthCrc(code, y, x, d)
-        v = 1 if (v1 + v2 >= 6) else 0
-        if np.random.rand() < keep_value_proba:
-          code[y, x, d] = v
-        else:
-          code[y, x, d] = 1 - v
-
-  return code
diff --git a/research/compression/entropy_coder/lib/__init__.py b/research/compression/entropy_coder/lib/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/compression/entropy_coder/lib/block_base.py b/research/compression/entropy_coder/lib/block_base.py
deleted file mode 100644
index 615dff82829dbbcab46c7217cd35f6259de01161..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/block_base.py
+++ /dev/null
@@ -1,258 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Base class for Tensorflow building blocks."""
-
-import collections
-import contextlib
-import itertools
-
-import tensorflow as tf
-
-_block_stacks = collections.defaultdict(lambda: [])
-
-
-class BlockBase(object):
-  """Base class for transform wrappers of Tensorflow.
-
-  To implement a Tensorflow transform block, inherit this class.
-
-  1. To create a variable, use NewVar() method. Do not overload this method!
-     For example, use as follows.
-         a_variable = self.NewVar(initial_value)
-
-  2. All Tensorflow-related code must be done inside 'with self._BlockScope().'
-     Otherwise, name scoping and block hierarchy will not work. An exception
-     is _Apply() method, which is already called inside the context manager
-     by __call__() method.
-
-  3. Override and implement _Apply() method. This method is called by
-     __call__() method.
-
-  The users would use blocks like the following.
-      nn1 = NN(128, bias=Bias(0), act=tf.nn.relu)
-      y = nn1(x)
-
-  Some things to consider.
-
-  - Use lazy-initialization if possible. That is, initialize at first Apply()
-    rather than at __init__().
-
-  Note: if needed, the variables can be created on a specific parameter
-  server by creating blocks in a scope like:
-    with g.device(device):
-      linear = Linear(...)
-  """
-
-  def __init__(self, name):
-    self._variables = []
-    self._subblocks = []
-    self._called = False
-
-    # Intentionally distinguishing empty string and None.
-    # If name is an empty string, then do not use name scope.
-    self.name = name if name is not None else self.__class__.__name__
-    self._graph = tf.get_default_graph()
-
-    if self.name:
-      # Capture the scope string at the init time.
-      with self._graph.name_scope(self.name) as scope:
-        self._scope_str = scope
-    else:
-      self._scope_str = ''
-
-    # Maintain hierarchy structure of blocks.
-    self._stack = _block_stacks[self._graph]
-    if self.__class__ is BlockBase:
-      # This code is only executed to create the root, which starts in the
-      # initialized state.
-      assert not self._stack
-      self._parent = None
-      self._called = True  # The root is initialized.
-      return
-
-    # Create a fake root if a root is not already present.
-    if not self._stack:
-      self._stack.append(BlockBase('NoOpRoot'))
-
-    self._parent = self._stack[-1]
-    self._parent._subblocks.append(self)  # pylint: disable=protected-access
-
-  def __repr__(self):
-    return '"{}" ({})'.format(self._scope_str, self.__class__.__name__)
-
-  @contextlib.contextmanager
-  def _OptionalNameScope(self, scope_str):
-    if scope_str:
-      with self._graph.name_scope(scope_str):
-        yield
-    else:
-      yield
-
-  @contextlib.contextmanager
-  def _BlockScope(self):
-    """Context manager that handles graph, namescope, and nested blocks."""
-    self._stack.append(self)
-
-    try:
-      with self._graph.as_default():
-        with self._OptionalNameScope(self._scope_str):
-          yield self
-    finally:  # Pop from the stack no matter exception is raised or not.
-      # The following line is executed when leaving 'with self._BlockScope()'
-      self._stack.pop()
-
-  def __call__(self, *args, **kwargs):
-    assert self._stack is _block_stacks[self._graph]
-
-    with self._BlockScope():
-      ret = self._Apply(*args, **kwargs)
-
-    self._called = True
-    return ret
-
-  def _Apply(self, *args, **kwargs):
-    """Implementation of __call__()."""
-    raise NotImplementedError()
-
-  # Redirect all variable creation to this single function, so that we can
-  # switch to better variable creation scheme.
-  def NewVar(self, value, **kwargs):
-    """Creates a new variable.
-
-    This function creates a variable, then returns a local copy created by
-    Identity operation. To get the Variable class object, use LookupRef()
-    method.
-
-    Note that each time Variable class object is used as an input to an
-    operation, Tensorflow will create a new Send/Recv pair. This hurts
-    performance.
-
-    If not for assign operations, use the local copy returned by this method.
-
-    Args:
-      value: Initialization value of the variable. The shape and the data type
-        of the variable is determined by this initial value.
-      **kwargs: Extra named arguments passed to Variable.__init__().
-
-    Returns:
-      A local copy of the new variable.
-    """
-    v = tf.Variable(value, **kwargs)
-
-    self._variables.append(v)
-    return v
-
-  @property
-  def initialized(self):
-    """Returns bool if the block is initialized.
-
-    By default, BlockBase assumes that a block is initialized when __call__()
-    is executed for the first time. If this is an incorrect assumption for some
-    subclasses, override this property in those subclasses.
-
-    Returns:
-      True if initialized, False otherwise.
-    """
-    return self._called
-
-  def AssertInitialized(self):
-    """Asserts initialized property."""
-    if not self.initialized:
-      raise RuntimeError('{} has not been initialized.'.format(self))
-
-  def VariableList(self):
-    """Returns the list of all tensorflow variables used inside this block."""
-    variables = list(itertools.chain(
-        itertools.chain.from_iterable(
-            t.VariableList() for t in self._subblocks),
-        self._VariableList()))
-    return variables
-
-  def _VariableList(self):
-    """Returns the list of all tensorflow variables owned by this block."""
-    self.AssertInitialized()
-    return self._variables
-
-  def CreateWeightLoss(self):
-    """Returns L2 loss list of (almost) all variables used inside this block.
-
-    When this method needs to be overridden, there are two choices.
-
-    1. Override CreateWeightLoss() to change the weight loss of all variables
-       that belong to this block, both directly and indirectly.
-    2. Override _CreateWeightLoss() to change the weight loss of all
-       variables that directly belong to this block but not to the sub-blocks.
-
-    Returns:
-      A Tensor object or None.
-    """
-    losses = list(itertools.chain(
-        itertools.chain.from_iterable(
-            t.CreateWeightLoss() for t in self._subblocks),
-        self._CreateWeightLoss()))
-    return losses
-
-  def _CreateWeightLoss(self):
-    """Returns weight loss list of variables that belong to this block."""
-    self.AssertInitialized()
-    with self._BlockScope():
-      return [tf.nn.l2_loss(v) for v in self._variables]
-
-  def CreateUpdateOps(self):
-    """Creates update operations for this block and its sub-blocks."""
-    ops = list(itertools.chain(
-        itertools.chain.from_iterable(
-            t.CreateUpdateOps() for t in self._subblocks),
-        self._CreateUpdateOps()))
-    return ops
-
-  def _CreateUpdateOps(self):
-    """Creates update operations for this block."""
-    self.AssertInitialized()
-    return []
-
-  def MarkAsNonTrainable(self):
-    """Mark all the variables of this block as non-trainable.
-
-    All the variables owned directly or indirectly (through subblocks) are
-    marked as non trainable.
-
-    This function along with CheckpointInitOp can be used to load a pretrained
-    model that consists in only one part of the whole graph.
-    """
-    assert self._called
-
-    all_variables = self.VariableList()
-    collection = tf.get_collection_ref(tf.GraphKeys.TRAINABLE_VARIABLES)
-    for v in all_variables:
-      if v in collection:
-        collection.remove(v)
-
-
-def CreateWeightLoss():
-  """Returns all weight losses from the blocks in the graph."""
-  stack = _block_stacks[tf.get_default_graph()]
-  if not stack:
-    return []
-  return stack[0].CreateWeightLoss()
-
-
-def CreateBlockUpdates():
-  """Combines all updates from the blocks in the graph."""
-  stack = _block_stacks[tf.get_default_graph()]
-  if not stack:
-    return []
-  return stack[0].CreateUpdateOps()
diff --git a/research/compression/entropy_coder/lib/block_util.py b/research/compression/entropy_coder/lib/block_util.py
deleted file mode 100644
index 80479cc66df95338aa119ba1216cd213ecfbe08d..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/block_util.py
+++ /dev/null
@@ -1,101 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utility functions for blocks."""
-
-from __future__ import division
-from __future__ import unicode_literals
-
-import math
-
-import numpy as np
-import six
-import tensorflow as tf
-
-
-class RsqrtInitializer(object):
-  """Gaussian initializer with standard deviation 1/sqrt(n).
-
-  Note that tf.truncated_normal is used internally. Therefore any random sample
-  outside two-sigma will be discarded and re-sampled.
-  """
-
-  def __init__(self, dims=(0,), **kwargs):
-    """Creates an initializer.
-
-    Args:
-      dims: Dimension(s) index to compute standard deviation:
-        1.0 / sqrt(product(shape[dims]))
-      **kwargs: Extra keyword arguments to pass to tf.truncated_normal.
-    """
-    if isinstance(dims, six.integer_types):
-      self._dims = [dims]
-    else:
-      self._dims = dims
-    self._kwargs = kwargs
-
-  def __call__(self, shape, dtype):
-    stddev = 1.0 / np.sqrt(np.prod([shape[x] for x in self._dims]))
-    return tf.truncated_normal(
-        shape=shape, dtype=dtype, stddev=stddev, **self._kwargs)
-
-
-class RectifierInitializer(object):
-  """Gaussian initializer with standard deviation sqrt(2/fan_in).
-
-  Note that tf.random_normal is used internally to ensure the expected weight
-  distribution. This is intended to be used with ReLU activations, specially
-  in ResNets.
-
-  For details please refer to:
-  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet
-  Classification
-  """
-
-  def __init__(self, dims=(0,), scale=2.0, **kwargs):
-    """Creates an initializer.
-
-    Args:
-      dims: Dimension(s) index to compute standard deviation:
-        sqrt(scale / product(shape[dims]))
-      scale: A constant scaling for the initialization used as
-        sqrt(scale / product(shape[dims])).
-      **kwargs: Extra keyword arguments to pass to tf.truncated_normal.
-    """
-    if isinstance(dims, six.integer_types):
-      self._dims = [dims]
-    else:
-      self._dims = dims
-    self._kwargs = kwargs
-    self._scale = scale
-
-  def __call__(self, shape, dtype):
-    stddev = np.sqrt(self._scale / np.prod([shape[x] for x in self._dims]))
-    return tf.random_normal(
-        shape=shape, dtype=dtype, stddev=stddev, **self._kwargs)
-
-
-class GaussianInitializer(object):
-  """Gaussian initializer with a given standard deviation.
-
-  Note that tf.truncated_normal is used internally. Therefore any random sample
-  outside two-sigma will be discarded and re-sampled.
-  """
-
-  def __init__(self, stddev=1.0):
-    self._stddev = stddev
-
-  def __call__(self, shape, dtype):
-    return tf.truncated_normal(shape=shape, dtype=dtype, stddev=self._stddev)
diff --git a/research/compression/entropy_coder/lib/blocks.py b/research/compression/entropy_coder/lib/blocks.py
deleted file mode 100644
index 002384eb07045f1cad963d217a205ade51ba03b6..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks.py
+++ /dev/null
@@ -1,24 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from block_base import *
-from block_util import *
-from blocks_binarizer import *
-from blocks_entropy_coding import *
-from blocks_lstm import *
-from blocks_masked_conv2d import *
-from blocks_masked_conv2d_lstm import *
-from blocks_operator import *
-from blocks_std import *
diff --git a/research/compression/entropy_coder/lib/blocks_binarizer.py b/research/compression/entropy_coder/lib/blocks_binarizer.py
deleted file mode 100644
index 8206731610613af2cf3ec15210fd5b9977f4a916..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_binarizer.py
+++ /dev/null
@@ -1,35 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Activation and weight binarizer implementations."""
-
-import math
-
-import numpy as np
-import tensorflow as tf
-
-
-def ConvertSignCodeToZeroOneCode(x):
-  """Conversion from codes {-1, +1} to codes {0, 1}."""
-  return 0.5 * (x + 1.0)
-
-
-def ConvertZeroOneCodeToSignCode(x):
-  """Convert from codes {0, 1} to codes {-1, +1}."""
-  return 2.0 * x - 1.0
-
-
-def CheckZeroOneCode(x):
-  return tf.reduce_all(tf.equal(x * (x - 1.0), 0))
diff --git a/research/compression/entropy_coder/lib/blocks_entropy_coding.py b/research/compression/entropy_coder/lib/blocks_entropy_coding.py
deleted file mode 100644
index 6ee5d97926c1b50b12cb9853d16caa25ba31e8d7..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_entropy_coding.py
+++ /dev/null
@@ -1,49 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Set of blocks related to entropy coding."""
-
-import math
-
-import tensorflow as tf
-
-import block_base
-
-# pylint does not recognize block_base.BlockBase.__call__().
-# pylint: disable=not-callable
-
-
-class CodeLength(block_base.BlockBase):
-  """Theoretical bound for a code length given a probability distribution.
-  """
-
-  def __init__(self, name=None):
-    super(CodeLength, self).__init__(name)
-
-  def _Apply(self, c, p):
-    """Theoretical bound of the coded length given a probability distribution.
-
-    Args:
-      c: The binary codes. Belong to {0, 1}.
-      p: The probability of: P(code==+1)
-
-    Returns:
-      The average code length.
-      Note: the average code length can be greater than 1 bit (e.g. when
-          encoding the least likely symbol).
-    """
-    entropy = ((1.0 - c) * tf.log(1.0 - p) + c * tf.log(p)) / (-math.log(2))
-    entropy = tf.reduce_mean(entropy)
-    return entropy
diff --git a/research/compression/entropy_coder/lib/blocks_entropy_coding_test.py b/research/compression/entropy_coder/lib/blocks_entropy_coding_test.py
deleted file mode 100644
index 5209865f5991598ee873ed24a4be572e3f9fc515..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_entropy_coding_test.py
+++ /dev/null
@@ -1,56 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for basic tensorflow blocks_entropy_coding."""
-
-from __future__ import division
-from __future__ import unicode_literals
-
-import math
-
-import numpy as np
-import tensorflow as tf
-
-import blocks_entropy_coding
-
-
-class BlocksEntropyCodingTest(tf.test.TestCase):
-
-  def testCodeLength(self):
-    shape = [2, 4]
-    proba_feed = [[0.65, 0.25, 0.70, 0.10],
-                  [0.28, 0.20, 0.44, 0.54]]
-    symbol_feed = [[1.0, 0.0, 1.0, 0.0],
-                   [0.0, 0.0, 0.0, 1.0]]
-    mean_code_length = - (
-        (math.log(0.65) + math.log(0.75) + math.log(0.70) + math.log(0.90) +
-         math.log(0.72) + math.log(0.80) + math.log(0.56) + math.log(0.54)) /
-        math.log(2.0)) / (shape[0] * shape[1])
-
-    symbol = tf.placeholder(dtype=tf.float32, shape=shape)
-    proba = tf.placeholder(dtype=tf.float32, shape=shape)
-    code_length_calculator = blocks_entropy_coding.CodeLength()
-    code_length = code_length_calculator(symbol, proba)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      code_length_eval = code_length.eval(
-          feed_dict={symbol: symbol_feed, proba: proba_feed})
-
-    self.assertAllClose(mean_code_length, code_length_eval)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/compression/entropy_coder/lib/blocks_lstm.py b/research/compression/entropy_coder/lib/blocks_lstm.py
deleted file mode 100644
index 6e474e3e3fcb6eeb3f18daf320e21a3acc88a2bf..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_lstm.py
+++ /dev/null
@@ -1,263 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Blocks of LSTM and its variants."""
-
-import numpy as np
-import tensorflow as tf
-
-import block_base
-import block_util
-import blocks_std
-
-# pylint does not recognize block_base.BlockBase.__call__().
-# pylint: disable=not-callable
-
-
-def LSTMBiasInit(shape, dtype):
-  """Returns ones for forget-gate, and zeros for the others."""
-  shape = np.array(shape)
-
-  # Check internal consistencies.
-  assert shape.shape == (1,), shape
-  assert shape[0] % 4 == 0, shape
-
-  n = shape[0] // 4
-  ones = tf.fill([n], tf.constant(1, dtype=dtype))
-  zeros = tf.fill([3 * n], tf.constant(0, dtype=dtype))
-  return tf.concat([ones, zeros], 0)
-
-
-class LSTMBase(block_base.BlockBase):
-  """Base class for LSTM implementations.
-
-  These LSTM implementations use the pattern found in [1]. No peephole
-  connection, i.e., cell content is not used in recurrence computation.
-  Hidden units are also output units.
-
-  [1] Zaremba, Sutskever, Vinyals. Recurrent Neural Network Regularization,
-  2015. arxiv:1409.2329.
-  """
-
-  def __init__(self, output_shape, name):
-    """Initializes LSTMBase class object.
-
-    Args:
-      output_shape: List representing the LSTM output shape. This argument
-        does not include batch dimension. For example, if the LSTM output has
-        shape [batch, depth], then pass [depth].
-      name: Name of this block.
-    """
-    super(LSTMBase, self).__init__(name)
-
-    with self._BlockScope():
-      self._output_shape = [None] + list(output_shape)
-      self._hidden = None
-      self._cell = None
-
-  @property
-  def hidden(self):
-    """Returns the hidden units of this LSTM."""
-    return self._hidden
-
-  @hidden.setter
-  def hidden(self, value):
-    """Assigns to the hidden units of this LSTM.
-
-    Args:
-      value: The new value for the hidden units. If None, the hidden units are
-        considered to be filled with zeros.
-    """
-    if value is not None:
-      value.get_shape().assert_is_compatible_with(self._output_shape)
-    self._hidden = value
-
-  @property
-  def cell(self):
-    """Returns the cell units of this LSTM."""
-    return self._cell
-
-  @cell.setter
-  def cell(self, value):
-    """Assigns to the cell units of this LSTM.
-
-    Args:
-      value: The new value for the cell units. If None, the cell units are
-        considered to be filled with zeros.
-    """
-    if value is not None:
-      value.get_shape().assert_is_compatible_with(self._output_shape)
-    self._cell = value
-
-  # Consider moving bias terms to the base, and require this method to be
-  # linear.
-  def _TransformInputs(self, _):
-    """Transforms the input units to (4 * depth) units.
-
-    The forget-gate, input-gate, output-gate, and cell update is computed as
-      f, i, j, o = T(h) + R(x)
-    where h is hidden units, x is input units, and T, R are transforms of
-    h, x, respectively.
-
-    This method implements R. Note that T is strictly linear, so if LSTM is
-    going to use bias, this method must include the bias to the transformation.
-
-    Subclasses must implement this method. See _Apply() for more details.
-    """
-    raise NotImplementedError()
-
-  def _TransformHidden(self, _):
-    """Transforms the hidden units to (4 * depth) units.
-
-    The forget-gate, input-gate, output-gate, and cell update is computed as
-      f, i, j, o = T(h) + R(x)
-    where h is hidden units, x is input units, and T, R are transforms of
-    h, x, respectively.
-
-    This method implements T in the equation. The method must implement a
-    strictly linear transformation. For example, it may use MatMul or Conv2D,
-    but must not add bias. This is because when hidden units are zeros, then
-    the LSTM implementation will skip calling this method, instead of passing
-    zeros to this function.
-
-    Subclasses must implement this method. See _Apply() for more details.
-    """
-    raise NotImplementedError()
-
-  def _Apply(self, *args):
-    xtransform = self._TransformInputs(*args)
-    depth_axis = len(self._output_shape) - 1
-
-    if self.hidden is not None:
-      htransform = self._TransformHidden(self.hidden)
-      f, i, j, o = tf.split(
-          value=htransform + xtransform, num_or_size_splits=4, axis=depth_axis)
-    else:
-      f, i, j, o = tf.split(
-          value=xtransform, num_or_size_splits=4, axis=depth_axis)
-
-    if self.cell is not None:
-      self.cell = tf.sigmoid(f) * self.cell + tf.sigmoid(i) * tf.tanh(j)
-    else:
-      self.cell = tf.sigmoid(i) * tf.tanh(j)
-
-    self.hidden = tf.sigmoid(o) * tf.tanh(self.cell)
-    return self.hidden
-
-
-class LSTM(LSTMBase):
-  """Efficient LSTM implementation used in [1].
-
-  [1] Zaremba, Sutskever, Vinyals. Recurrent Neural Network Regularization,
-  2015. arxiv:1409.2329.
-  """
-
-  def __init__(self,
-               depth,
-               bias=LSTMBiasInit,
-               initializer=block_util.RsqrtInitializer(),
-               name=None):
-    super(LSTM, self).__init__([depth], name)
-
-    with self._BlockScope():
-      self._depth = depth
-      self._nn = blocks_std.NN(
-          4 * depth, bias=bias, act=None, initializer=initializer)
-      self._hidden_linear = blocks_std.Linear(
-          4 * depth, initializer=initializer)
-
-  def _TransformInputs(self, *args):
-    return self._nn(*args)
-
-  def _TransformHidden(self, h):
-    return self._hidden_linear(h)
-
-
-class Conv2DLSTM(LSTMBase):
-  """Convolutional LSTM implementation with optimizations inspired by [1].
-
-  Note that when using the batch normalization feature, the bias initializer
-  will not be used, since BN effectively cancels its effect out.
-
-  [1] Zaremba, Sutskever, Vinyals. Recurrent Neural Network Regularization,
-  2015. arxiv:1409.2329.
-  """
-
-  def __init__(self,
-               depth,
-               filter_size,
-               hidden_filter_size,
-               strides,
-               padding,
-               bias=LSTMBiasInit,
-               initializer=block_util.RsqrtInitializer(dims=(0, 1, 2)),
-               use_moving_average=False,
-               name=None):
-    super(Conv2DLSTM, self).__init__([None, None, depth], name)
-    self._iter = 0
-
-    with self._BlockScope():
-      self._input_conv = blocks_std.Conv2D(
-          4 * depth,
-          filter_size,
-          strides,
-          padding,
-          bias=None,
-          act=None,
-          initializer=initializer,
-          name='input_conv2d')
-
-      self._hidden_conv = blocks_std.Conv2D(
-          4 * depth,
-          hidden_filter_size,
-          [1, 1],
-          'SAME',
-          bias=None,
-          act=None,
-          initializer=initializer,
-          name='hidden_conv2d')
-
-      if bias is not None:
-        self._bias = blocks_std.BiasAdd(bias, name='biases')
-      else:
-        self._bias = blocks_std.PassThrough()
-
-  def _TransformInputs(self, x):
-    return self._bias(self._input_conv(x))
-
-  def _TransformHidden(self, h):
-    return self._hidden_conv(h)
-
-  def _Apply(self, *args):
-    xtransform = self._TransformInputs(*args)
-    depth_axis = len(self._output_shape) - 1
-
-    if self.hidden is not None:
-      htransform = self._TransformHidden(self.hidden)
-      f, i, j, o = tf.split(
-          value=htransform + xtransform, num_or_size_splits=4, axis=depth_axis)
-    else:
-      f, i, j, o = tf.split(
-          value=xtransform, num_or_size_splits=4, axis=depth_axis)
-
-    if self.cell is not None:
-      self.cell = tf.sigmoid(f) * self.cell + tf.sigmoid(i) * tf.tanh(j)
-    else:
-      self.cell = tf.sigmoid(i) * tf.tanh(j)
-
-    self.hidden = tf.sigmoid(o) * tf.tanh(self.cell)
-
-    self._iter += 1
-    return self.hidden
diff --git a/research/compression/entropy_coder/lib/blocks_lstm_test.py b/research/compression/entropy_coder/lib/blocks_lstm_test.py
deleted file mode 100644
index 03c32dc136effda11163f2e35c5a48496f0187c0..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_lstm_test.py
+++ /dev/null
@@ -1,113 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for LSTM tensorflow blocks."""
-from __future__ import division
-
-import numpy as np
-import tensorflow as tf
-
-import block_base
-import blocks_std
-import blocks_lstm
-
-
-class BlocksLSTMTest(tf.test.TestCase):
-
-  def CheckUnary(self, y, op_type):
-    self.assertEqual(op_type, y.op.type)
-    self.assertEqual(1, len(y.op.inputs))
-    return y.op.inputs[0]
-
-  def CheckBinary(self, y, op_type):
-    self.assertEqual(op_type, y.op.type)
-    self.assertEqual(2, len(y.op.inputs))
-    return y.op.inputs
-
-  def testLSTM(self):
-    lstm = blocks_lstm.LSTM(10)
-    lstm.hidden = tf.zeros(shape=[10, 10], dtype=tf.float32)
-    lstm.cell = tf.zeros(shape=[10, 10], dtype=tf.float32)
-    x = tf.placeholder(dtype=tf.float32, shape=[10, 11])
-    y = lstm(x)
-
-    o, tanhc = self.CheckBinary(y, 'Mul')
-    self.assertEqual(self.CheckUnary(o, 'Sigmoid').name, 'LSTM/split:3')
-
-    self.assertIs(lstm.cell, self.CheckUnary(tanhc, 'Tanh'))
-    fc, ij = self.CheckBinary(lstm.cell, 'Add')
-
-    f, _ = self.CheckBinary(fc, 'Mul')
-    self.assertEqual(self.CheckUnary(f, 'Sigmoid').name, 'LSTM/split:0')
-
-    i, j = self.CheckBinary(ij, 'Mul')
-    self.assertEqual(self.CheckUnary(i, 'Sigmoid').name, 'LSTM/split:1')
-    j = self.CheckUnary(j, 'Tanh')
-    self.assertEqual(j.name, 'LSTM/split:2')
-
-  def testLSTMBiasInit(self):
-    lstm = blocks_lstm.LSTM(9)
-    x = tf.placeholder(dtype=tf.float32, shape=[15, 7])
-    lstm(x)
-    b = lstm._nn._bias
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      bias_var = b._bias.eval()
-
-      comp = ([1.0] * 9) + ([0.0] * 27)
-      self.assertAllEqual(bias_var, comp)
-
-  def testConv2DLSTM(self):
-    lstm = blocks_lstm.Conv2DLSTM(depth=10,
-                                  filter_size=[1, 1],
-                                  hidden_filter_size=[1, 1],
-                                  strides=[1, 1],
-                                  padding='SAME')
-    lstm.hidden = tf.zeros(shape=[10, 11, 11, 10], dtype=tf.float32)
-    lstm.cell = tf.zeros(shape=[10, 11, 11, 10], dtype=tf.float32)
-    x = tf.placeholder(dtype=tf.float32, shape=[10, 11, 11, 1])
-    y = lstm(x)
-
-    o, tanhc = self.CheckBinary(y, 'Mul')
-    self.assertEqual(self.CheckUnary(o, 'Sigmoid').name, 'Conv2DLSTM/split:3')
-
-    self.assertIs(lstm.cell, self.CheckUnary(tanhc, 'Tanh'))
-    fc, ij = self.CheckBinary(lstm.cell, 'Add')
-
-    f, _ = self.CheckBinary(fc, 'Mul')
-    self.assertEqual(self.CheckUnary(f, 'Sigmoid').name, 'Conv2DLSTM/split:0')
-
-    i, j = self.CheckBinary(ij, 'Mul')
-    self.assertEqual(self.CheckUnary(i, 'Sigmoid').name, 'Conv2DLSTM/split:1')
-    j = self.CheckUnary(j, 'Tanh')
-    self.assertEqual(j.name, 'Conv2DLSTM/split:2')
-
-  def testConv2DLSTMBiasInit(self):
-    lstm = blocks_lstm.Conv2DLSTM(9, 1, 1, [1, 1], 'SAME')
-    x = tf.placeholder(dtype=tf.float32, shape=[1, 7, 7, 7])
-    lstm(x)
-    b = lstm._bias
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      bias_var = b._bias.eval()
-
-      comp = ([1.0] * 9) + ([0.0] * 27)
-      self.assertAllEqual(bias_var, comp)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/compression/entropy_coder/lib/blocks_masked_conv2d.py b/research/compression/entropy_coder/lib/blocks_masked_conv2d.py
deleted file mode 100644
index 3f562384a681964554ead02477da24c13715d4d1..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_masked_conv2d.py
+++ /dev/null
@@ -1,226 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Define some typical masked 2D convolutions."""
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import block_util
-import blocks_std
-
-# pylint does not recognize block_base.BlockBase.__call__().
-# pylint: disable=not-callable
-
-
-class RasterScanConv2D(blocks_std.Conv2DBase):
-  """Conv2D with no dependency on future pixels (in raster scan order).
-
-  For example, assuming a 5 x 5 kernel, the kernel is applied a spatial mask:
-    T T T T T
-    T T T T T
-    T T x F F
-    F F F F F
-    F F F F F
-  where 'T' are pixels which are available when computing the convolution
-  for pixel 'x'. All the pixels marked with 'F' are not available.
-  'x' itself is not available if strict_order is True, otherwise, it is
-  available.
-  """
-
-  def __init__(self, depth, filter_size, strides, padding,
-               strict_order=True,
-               bias=None, act=None, initializer=None, name=None):
-    super(RasterScanConv2D, self).__init__(
-        depth, filter_size, strides, padding, bias, act, name=name)
-
-    if (filter_size[0] % 2) != 1 or (filter_size[1] % 2) != 1:
-      raise ValueError('Kernel size should be odd.')
-
-    with self._BlockScope():
-      if initializer is None:
-        initializer = block_util.RsqrtInitializer(dims=(0, 1, 2))
-      self._initializer = initializer
-      self._strict_order = strict_order
-
-  def _CreateKernel(self, shape, dtype):
-    init = self._initializer(shape, dtype)
-    kernel = self.NewVar(init)
-
-    mask = np.ones(shape[:2], dtype=dtype.as_numpy_dtype)
-    center = shape[:2] // 2
-    mask[center[0] + 1:, :] = 0
-    if not self._strict_order:
-      mask[center[0], center[1] + 1:] = 0
-    else:
-      mask[center[0], center[1]:] = 0
-    mask = mask.reshape(mask.shape + (1, 1))
-
-    return tf.convert_to_tensor(mask, dtype) * kernel
-
-
-class DepthOrderConv2D(blocks_std.Conv2DBase):
-  """Conv2D with no dependency on higher depth dimensions.
-
-  More precisely, the output depth #n has only dependencies on input depths #k
-  for k < n (if strict_order is True) or for k <= n (if strict_order is False).
-  """
-
-  def __init__(self, depth, filter_size, strides, padding,
-               strict_order=True,
-               bias=None, act=None, initializer=None, name=None):
-    super(DepthOrderConv2D, self).__init__(
-        depth, filter_size, strides, padding, bias, act, name=name)
-
-    with self._BlockScope():
-      if initializer is None:
-        initializer = block_util.RsqrtInitializer(dims=(0, 1, 2))
-      self._initializer = initializer
-      self._strict_order = strict_order
-
-  def _CreateKernel(self, shape, dtype):
-    init = self._initializer(shape, dtype)
-    kernel = self.NewVar(init)
-
-    mask = np.ones(shape[2:], dtype=dtype.as_numpy_dtype)
-    depth_output = shape[3]
-    for d in xrange(depth_output):
-      if self._strict_order:
-        mask[d:, d] = 0
-      else:
-        mask[d + 1:, d] = 0
-    mask = mask.reshape((1, 1) + mask.shape)
-
-    return tf.convert_to_tensor(mask, dtype) * kernel
-
-
-class GroupRasterScanConv2D(blocks_std.Conv2DBase):
-  """Conv2D with no dependency on future pixels (in raster scan order).
-
-  This version only introduces dependencies on previous pixels in raster scan
-  order. It can also introduce some dependencies on previous depth positions
-  of the current pixel (current pixel = center pixel of the kernel) in the
-  following way:
-  the depth dimension of the input is split into Ki groups of size
-  |input_group_size|, the output dimension is split into Ko groups of size
-  |output_group_size| (usually Ki == Ko). Each output group ko of the current
-  pixel position can only depend on previous input groups ki
-  (i.e. ki < ko if strict_order is True or ki <= ko if strict_order is False).
-
-  Notes:
-  - Block RasterScanConv2D is a special case of GroupRasterScanConv2D
-    where Ki == Ko == 1 (i.e. input_group_size == input_depth and
-    output_group_size == output_depth).
-  - For 1x1 convolution, block DepthOrderConv2D is a special case of
-    GroupRasterScanConv2D where input_group_size == 1 and
-    output_group_size == 1.
-  """
-
-  def __init__(self, depth, filter_size, strides, padding,
-               strict_order=True,
-               input_group_size=1,
-               output_group_size=1,
-               bias=None, act=None, initializer=None, name=None):
-    super(GroupRasterScanConv2D, self).__init__(
-        depth, filter_size, strides, padding, bias, act, name=name)
-
-    if (filter_size[0] % 2) != 1 or (filter_size[1] % 2) != 1:
-      raise ValueError('Kernel size should be odd.')
-
-    with self._BlockScope():
-      if initializer is None:
-        initializer = block_util.RsqrtInitializer(dims=(0, 1, 2))
-      self._initializer = initializer
-      self._input_group_size = input_group_size
-      self._output_group_size = output_group_size
-      self._strict_order = strict_order
-
-      if depth % self._output_group_size != 0:
-        raise ValueError(
-            'Invalid depth group size: {} for depth {}'.format(
-                self._output_group_size, depth))
-      self._output_group_count = depth // self._output_group_size
-
-  def _CreateKernel(self, shape, dtype):
-    init = self._initializer(shape, dtype)
-    kernel = self.NewVar(init)
-
-    depth_input = shape[2]
-    if depth_input % self._input_group_size != 0:
-      raise ValueError(
-          'Invalid depth group size: {} for depth {}'.format(
-              self._input_group_size, depth_input))
-    input_group_count = depth_input // self._input_group_size
-    output_group_count = self._output_group_count
-
-    # Set the mask to 0 for future pixels in raster scan order.
-    center = shape[:2] // 2
-    mask = np.ones([shape[0], shape[1],
-                    input_group_count, self._input_group_size,
-                    output_group_count, self._output_group_size],
-                   dtype=dtype.as_numpy_dtype)
-    mask[center[0] + 1:, :, :, :, :, :] = 0
-    mask[center[0], center[1] + 1:, :, :, :, :] = 0
-
-    # Adjust the mask for the current position (the center position).
-    depth_output = shape[3]
-    for d in xrange(output_group_count):
-      mask[center[0], center[1], d + 1:, :, d:d + 1, :] = 0
-      if self._strict_order:
-        mask[center[0], center[1], d, :, d:d + 1, :] = 0
-
-    mask = mask.reshape([shape[0], shape[1], depth_input, depth_output])
-    return tf.convert_to_tensor(mask, dtype) * kernel
-
-
-class InFillingConv2D(blocks_std.Conv2DBase):
-  """Conv2D with kernel having no dependency on the current pixel.
-
-  For example, assuming a 5 x 5 kernel, the kernel is applied a spatial mask:
-    T T T T T
-    T T T T T
-    T T x T T
-    T T T T T
-    T T T T T
-  where 'T' marks a pixel which is available when computing the convolution
-  for pixel 'x'. 'x' itself is not available.
-  """
-
-  def __init__(self, depth, filter_size, strides, padding,
-               bias=None, act=None, initializer=None, name=None):
-    super(InFillingConv2D, self).__init__(
-        depth, filter_size, strides, padding, bias, act, name=name)
-
-    if (filter_size[0] % 2) != 1 or (filter_size[1] % 2) != 1:
-      raise ValueError('Kernel size should be odd.')
-    if filter_size[0] == 1 and filter_size[1] == 1:
-      raise ValueError('Kernel size should be larger than 1x1.')
-
-    with self._BlockScope():
-      if initializer is None:
-        initializer = block_util.RsqrtInitializer(dims=(0, 1, 2))
-      self._initializer = initializer
-
-  def _CreateKernel(self, shape, dtype):
-    init = self._initializer(shape, dtype)
-    kernel = self.NewVar(init)
-
-    mask = np.ones(shape[:2], dtype=dtype.as_numpy_dtype)
-    center = shape[:2] // 2
-    mask[center[0], center[1]] = 0
-    mask = mask.reshape(mask.shape + (1, 1))
-
-    return tf.convert_to_tensor(mask, dtype) * kernel
diff --git a/research/compression/entropy_coder/lib/blocks_masked_conv2d_lstm.py b/research/compression/entropy_coder/lib/blocks_masked_conv2d_lstm.py
deleted file mode 100644
index 2d6dfeffcaff1289adf3bdec33cb0560db6b0416..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_masked_conv2d_lstm.py
+++ /dev/null
@@ -1,79 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Masked conv2d LSTM."""
-
-import block_base
-import block_util
-import blocks_masked_conv2d
-import blocks_lstm
-import blocks_std
-
-# pylint: disable=not-callable
-
-
-class RasterScanConv2DLSTM(blocks_lstm.LSTMBase):
-  """Convolutional LSTM implementation with optimizations inspired by [1].
-
-  Note that when using the batch normalization feature, the bias initializer
-  will not be used, since BN effectively cancels its effect out.
-
-  [1] Zaremba, Sutskever, Vinyals. Recurrent Neural Network Regularization,
-  2015. arxiv:1409.2329.
-  """
-
-  def __init__(self,
-               depth,
-               filter_size,
-               hidden_filter_size,
-               strides,
-               padding,
-               bias=blocks_lstm.LSTMBiasInit,
-               initializer=block_util.RsqrtInitializer(dims=(0, 1, 2)),
-               name=None):
-    super(RasterScanConv2DLSTM, self).__init__([None, None, depth], name)
-
-    with self._BlockScope():
-      self._input_conv = blocks_masked_conv2d.RasterScanConv2D(
-          4 * depth,
-          filter_size,
-          strides,
-          padding,
-          strict_order=False,
-          bias=None,
-          act=None,
-          initializer=initializer,
-          name='input_conv2d')
-
-      self._hidden_conv = blocks_std.Conv2D(
-          4 * depth,
-          hidden_filter_size,
-          [1, 1],
-          'SAME',
-          bias=None,
-          act=None,
-          initializer=initializer,
-          name='hidden_conv2d')
-
-      if bias is not None:
-        self._bias = blocks_std.BiasAdd(bias, name='biases')
-      else:
-        self._bias = blocks_std.PassThrough()
-
-  def _TransformInputs(self, x):
-    return self._bias(self._input_conv(x))
-
-  def _TransformHidden(self, h):
-    return self._hidden_conv(h)
diff --git a/research/compression/entropy_coder/lib/blocks_masked_conv2d_test.py b/research/compression/entropy_coder/lib/blocks_masked_conv2d_test.py
deleted file mode 100644
index 1d284ebffe5a24b91c96936c17d6c23febdf76d5..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_masked_conv2d_test.py
+++ /dev/null
@@ -1,207 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests of the 2D masked convolution blocks."""
-
-from __future__ import division
-from __future__ import unicode_literals
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import blocks_masked_conv2d
-
-
-class MaskedConv2DTest(tf.test.TestCase):
-
-  def testRasterScanKernel(self):
-    kernel_size = 5
-    input_depth = 1
-    output_depth = 1
-    kernel_shape = [kernel_size, kernel_size, input_depth, output_depth]
-
-    # pylint: disable=bad-whitespace
-    kernel_feed = [[ 1.0,  2.0,  3.0,  4.0,  5.0],
-                   [ 6.0,  7.0,  8.0,  9.0, 10.0],
-                   [11.0, 12.0, 13.0, 14.0, 15.0],
-                   [16.0, 17.0, 18.0, 19.0, 20.0],
-                   [21.0, 22.0, 23.0, 24.0, 25.0]]
-    kernel_feed = np.reshape(kernel_feed, kernel_shape)
-    kernel_expected = [[ 1.0,  2.0, 3.0, 4.0,  5.0],
-                       [ 6.0,  7.0, 8.0, 9.0, 10.0],
-                       [11.0, 12.0, 0.0, 0.0,  0.0],
-                       [ 0.0,  0.0, 0.0, 0.0,  0.0],
-                       [ 0.0,  0.0, 0.0, 0.0,  0.0]]
-    kernel_expected = np.reshape(kernel_expected, kernel_shape)
-    # pylint: enable=bad-whitespace
-
-    init_kernel = lambda s, t: tf.constant(kernel_feed, dtype=t, shape=s)
-    masked_conv2d = blocks_masked_conv2d.RasterScanConv2D(
-        output_depth, [kernel_size] * 2, [1] * 2, 'SAME',
-        initializer=init_kernel)
-    x = tf.placeholder(dtype=tf.float32, shape=[10] * 3 + [input_depth])
-    _ = masked_conv2d(x)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      kernel_value = masked_conv2d._kernel.eval()
-
-    self.assertAllEqual(kernel_expected, kernel_value)
-
-  def testDepthOrderKernel(self):
-    kernel_size = 1
-    input_depth = 7
-    output_depth = input_depth
-    kernel_shape = [kernel_size, kernel_size, input_depth, output_depth]
-
-    kernel_feed = np.ones(kernel_shape)
-    x_shape = [5] * 3 + [input_depth]
-    x_feed = np.ones(x_shape)
-    y_expected = np.zeros(x_shape[0:3] + [output_depth])
-    y_expected[:, :, :] = np.arange(output_depth)
-
-    init_kernel = lambda s, t: tf.constant(kernel_feed, dtype=t, shape=s)
-    masked_conv2d = blocks_masked_conv2d.DepthOrderConv2D(
-        output_depth, [kernel_size] * 2, [1] * 2, 'SAME',
-        strict_order=True,
-        initializer=init_kernel)
-    x = tf.placeholder(dtype=tf.float32, shape=x_shape)
-    y = masked_conv2d(x)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      y_value = y.eval(feed_dict={x: x_feed})
-
-    self.assertAllEqual(y_expected, y_value)
-
-  def testGroupRasterScanKernel(self):
-    kernel_size = 3
-    input_depth = 4
-    input_group_size = 2
-    output_depth = 2
-    output_group_size = 1
-    kernel_shape = [kernel_size, kernel_size, input_depth, output_depth]
-    kernel_feed = np.ones(shape=kernel_shape)
-
-    height = 5
-    width = 5
-    x_shape = [1, height, width, input_depth]
-    x_feed = np.ones(shape=x_shape)
-
-    # pylint: disable=bad-whitespace
-    y_expected = [
-        [[ 0,  2], [ 4,  6], [ 4,  6], [ 4,  6], [ 4,  6]],
-        [[ 8, 10], [16, 18], [16, 18], [16, 18], [12, 14]],
-        [[ 8, 10], [16, 18], [16, 18], [16, 18], [12, 14]],
-        [[ 8, 10], [16, 18], [16, 18], [16, 18], [12, 14]],
-        [[ 8, 10], [16, 18], [16, 18], [16, 18], [12, 14]],
-    ]
-    y_expected = np.reshape(y_expected, [1, height, width, output_depth])
-    # pylint: enable=bad-whitespace
-
-    init_kernel = lambda s, t: tf.constant(kernel_feed, dtype=t, shape=s)
-    masked_conv2d = blocks_masked_conv2d.GroupRasterScanConv2D(
-        output_depth, [kernel_size] * 2, [1] * 2, 'SAME',
-        strict_order=True,
-        input_group_size=input_group_size,
-        output_group_size=output_group_size,
-        initializer=init_kernel)
-    x = tf.placeholder(dtype=tf.float32, shape=x_shape)
-    y = masked_conv2d(x)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      y_value = y.eval(feed_dict={x: x_feed})
-
-    self.assertAllEqual(y_expected, y_value)
-
-  def testInFillingKernel(self):
-    kernel_size = 5
-    input_depth = 1
-    output_depth = 1
-    kernel_shape = [kernel_size, kernel_size, input_depth, output_depth]
-
-    # pylint: disable=bad-whitespace
-    kernel_feed = [[ 1.0,  2.0,  3.0,  4.0,  5.0],
-                   [ 6.0,  7.0,  8.0,  9.0, 10.0],
-                   [11.0, 12.0, 13.0, 14.0, 15.0],
-                   [16.0, 17.0, 18.0, 19.0, 20.0],
-                   [21.0, 22.0, 23.0, 24.0, 25.0]]
-    kernel_feed = np.reshape(kernel_feed, kernel_shape)
-    kernel_expected = [[ 1.0,  2.0,  3.0,  4.0,  5.0],
-                       [ 6.0,  7.0,  8.0,  9.0, 10.0],
-                       [11.0, 12.0,  0.0, 14.0, 15.0],
-                       [16.0, 17.0, 18.0, 19.0, 20.0],
-                       [21.0, 22.0, 23.0, 24.0, 25.0]]
-    kernel_expected = np.reshape(kernel_expected, kernel_shape)
-    # pylint: enable=bad-whitespace
-
-    init_kernel = lambda s, t: tf.constant(kernel_feed, dtype=t, shape=s)
-    masked_conv2d = blocks_masked_conv2d.InFillingConv2D(
-        output_depth, [kernel_size] * 2, [1] * 2, 'SAME',
-        initializer=init_kernel)
-    x = tf.placeholder(dtype=tf.float32, shape=[10] * 3 + [input_depth])
-    _ = masked_conv2d(x)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      kernel_value = masked_conv2d._kernel.eval()
-
-    self.assertAllEqual(kernel_expected, kernel_value)
-
-  def testConv2DMaskedNumerics(self):
-    kernel_size = 5
-    input_shape = [1, 10, 10, 1]
-    filter_shape = [kernel_size, kernel_size, 1, 1]
-    strides = [1, 1, 1, 1]
-    output_shape = [1, 10, 10, 1]
-
-    conv = blocks_masked_conv2d.RasterScanConv2D(
-        depth=filter_shape[-1],
-        filter_size=filter_shape[0:2],
-        strides=strides[1:3],
-        padding='SAME',
-        initializer=tf.constant_initializer(value=1.0))
-    x = tf.placeholder(dtype=tf.float32, shape=input_shape)
-    y = conv(x)
-
-    x_feed = - np.ones(input_shape, dtype=float)
-    y_expected = np.ones(output_shape, dtype=float)
-    for i in xrange(input_shape[1]):
-      for j in xrange(input_shape[2]):
-        x_feed[0, i, j, 0] = 10 * (j + 1) + i
-        v = 0
-        ki_start = max(i - kernel_size // 2, 0)
-        kj_start = max(j - kernel_size // 2, 0)
-        kj_end = min(j + kernel_size // 2, input_shape[2] - 1)
-        for ki in range(ki_start, i + 1):
-          for kj in range(kj_start, kj_end + 1):
-            if ki > i:
-              continue
-            if ki == i and kj >= j:
-              continue
-            v += 10 * (kj + 1) + ki
-        y_expected[0, i, j, 0] = v
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      y_value = y.eval(feed_dict={x: x_feed})
-
-    self.assertAllEqual(y_expected, y_value)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/compression/entropy_coder/lib/blocks_operator.py b/research/compression/entropy_coder/lib/blocks_operator.py
deleted file mode 100644
index e35e37b27aa416ed48f91eda866d372601741cba..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_operator.py
+++ /dev/null
@@ -1,87 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Common blocks which work as operators on other blocks."""
-
-import tensorflow as tf
-
-import block_base
-
-# pylint: disable=not-callable
-
-
-class CompositionOperator(block_base.BlockBase):
-  """Composition of several blocks."""
-
-  def __init__(self, block_list, name=None):
-    """Initialization of the composition operator.
-
-    Args:
-      block_list: List of blocks.BlockBase that are chained to create
-        a new blocks.BlockBase.
-      name: Name of this block.
-    """
-    super(CompositionOperator, self).__init__(name)
-    self._blocks = block_list
-
-  def _Apply(self, x):
-    """Apply successively all the blocks on the given input tensor."""
-    h = x
-    for layer in self._blocks:
-      h = layer(h)
-    return h
-
-
-class LineOperator(block_base.BlockBase):
-  """Repeat the same block over all the lines of an input tensor."""
-
-  def __init__(self, block, name=None):
-    super(LineOperator, self).__init__(name)
-    self._block = block
-
-  def _Apply(self, x):
-    height = x.get_shape()[1].value
-    if height is None:
-      raise ValueError('Unknown tensor height')
-    all_line_x = tf.split(value=x, num_or_size_splits=height, axis=1)
-
-    y = []
-    for line_x in all_line_x:
-      y.append(self._block(line_x))
-    y = tf.concat(values=y, axis=1)
-
-    return y
-
-
-class TowerOperator(block_base.BlockBase):
-  """Parallel execution with concatenation of several blocks."""
-
-  def __init__(self, block_list, dim=3, name=None):
-    """Initialization of the parallel exec + concat (Tower).
-
-    Args:
-      block_list: List of blocks.BlockBase that are chained to create
-        a new blocks.BlockBase.
-      dim: the dimension on which to concat.
-      name: Name of this block.
-    """
-    super(TowerOperator, self).__init__(name)
-    self._blocks = block_list
-    self._concat_dim = dim
-
-  def _Apply(self, x):
-    """Apply successively all the blocks on the given input tensor."""
-    outputs = [layer(x) for layer in self._blocks]
-    return tf.concat(outputs, self._concat_dim)
diff --git a/research/compression/entropy_coder/lib/blocks_operator_test.py b/research/compression/entropy_coder/lib/blocks_operator_test.py
deleted file mode 100644
index 8b6d80da1d09102585e4725dd5c59f48d48eafcd..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_operator_test.py
+++ /dev/null
@@ -1,64 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests of the block operators."""
-
-import numpy as np
-import tensorflow as tf
-
-import block_base
-import blocks_operator
-
-
-class AddOneBlock(block_base.BlockBase):
-
-  def __init__(self, name=None):
-    super(AddOneBlock, self).__init__(name)
-
-  def _Apply(self, x):
-    return x + 1.0
-
-
-class SquareBlock(block_base.BlockBase):
-
-  def __init__(self, name=None):
-    super(SquareBlock, self).__init__(name)
-
-  def _Apply(self, x):
-    return x * x
-
-
-class BlocksOperatorTest(tf.test.TestCase):
-
-  def testComposition(self):
-    x_value = np.array([[1.0, 2.0, 3.0],
-                        [-1.0, -2.0, -3.0]])
-    y_expected_value = np.array([[4.0, 9.0, 16.0],
-                                 [0.0, 1.0, 4.0]])
-
-    x = tf.placeholder(dtype=tf.float32, shape=[2, 3])
-    complex_block = blocks_operator.CompositionOperator(
-        [AddOneBlock(),
-         SquareBlock()])
-    y = complex_block(x)
-
-    with self.test_session():
-      y_value = y.eval(feed_dict={x: x_value})
-
-    self.assertAllClose(y_expected_value, y_value)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/compression/entropy_coder/lib/blocks_std.py b/research/compression/entropy_coder/lib/blocks_std.py
deleted file mode 100644
index 2c617485342452f500d4b1b0b18e33b07d51e487..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_std.py
+++ /dev/null
@@ -1,363 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Basic blocks for building tensorflow models."""
-
-import numpy as np
-import tensorflow as tf
-
-import block_base
-import block_util
-
-# pylint does not recognize block_base.BlockBase.__call__().
-# pylint: disable=not-callable
-
-
-def HandleConvPaddingModes(x, padding, kernel_shape, strides):
-  """Returns an updated tensor and padding type for REFLECT and SYMMETRIC.
-
-  Args:
-    x: A 4D tensor with shape [batch_size, height, width, depth].
-    padding: Padding mode (SAME, VALID, REFLECT, or SYMMETRIC).
-    kernel_shape: Shape of convolution kernel that will be applied.
-    strides: Convolution stride that will be used.
-
-  Returns:
-    x and padding after adjustments for REFLECT and SYMMETRIC.
-  """
-  # For 1x1 convolution, all padding modes are the same.
-  if np.all(kernel_shape[:2] == 1):
-    return x, 'VALID'
-
-  if padding == 'REFLECT' or padding == 'SYMMETRIC':
-    # We manually compute the number of paddings as if 'SAME'.
-    # From Tensorflow kernel, the formulas are as follows.
-    #   output_shape = ceil(input_shape / strides)
-    #   paddings = (output_shape - 1) * strides + filter_size - input_shape
-    # Let x, y, s be a shorthand notations for input_shape, output_shape, and
-    # strides, respectively. Let (x - 1) = sn + r where 0 <= r < s. Note that
-    #   y - 1 = ceil(x / s) - 1 = floor((x - 1) / s) = n
-    # provided that x > 0. Therefore
-    #   paddings = n * s + filter_size - (sn + r + 1)
-    #            = filter_size - r - 1.
-    input_shape = x.get_shape()  # shape at graph construction time
-    img_shape = tf.shape(x)[1:3]  # image shape (no batch) at run time
-    remainder = tf.mod(img_shape - 1, strides[1:3])
-    pad_sizes = kernel_shape[:2] - remainder - 1
-
-    pad_rows = pad_sizes[0]
-    pad_cols = pad_sizes[1]
-    pad = tf.stack([[0, 0], tf.stack([pad_rows // 2, (pad_rows + 1) // 2]),
-                    tf.stack([pad_cols // 2, (pad_cols + 1) // 2]), [0, 0]])
-
-    # Manually pad the input and switch the padding mode to 'VALID'.
-    x = tf.pad(x, pad, mode=padding)
-    x.set_shape([input_shape[0], x.get_shape()[1],
-                 x.get_shape()[2], input_shape[3]])
-    padding = 'VALID'
-
-  return x, padding
-
-
-class PassThrough(block_base.BlockBase):
-  """A dummy transform block that does nothing."""
-
-  def __init__(self):
-    # Pass an empty string to disable name scoping.
-    super(PassThrough, self).__init__(name='')
-
-  def _Apply(self, inp):
-    return inp
-
-  @property
-  def initialized(self):
-    """Always returns True."""
-    return True
-
-
-class Bias(object):
-  """An initialization helper class for BiasAdd block below."""
-
-  def __init__(self, value=0):
-    self.value = value
-
-
-class BiasAdd(block_base.BlockBase):
-  """A tf.nn.bias_add wrapper.
-
-  This wrapper may act as a PassThrough block depending on the initializer
-  provided, to make easier optional bias applications in NN blocks, etc.
-  See __init__() for the details.
-  """
-
-  def __init__(self, initializer=Bias(0), name=None):
-    """Initializes Bias block.
-
-    |initializer| parameter have two special cases.
-
-    1. If initializer is None, then this block works as a PassThrough.
-    2. If initializer is a Bias class object, then tf.constant_initializer is
-       used with the stored value.
-
-    Args:
-      initializer: An initializer for the bias variable.
-      name: Name of this block.
-    """
-    super(BiasAdd, self).__init__(name)
-
-    with self._BlockScope():
-      if isinstance(initializer, Bias):
-        self._initializer = tf.constant_initializer(value=initializer.value)
-      else:
-        self._initializer = initializer
-
-      self._bias = None
-
-  def _Apply(self, x):
-    if not self._bias:
-      init = self._initializer([int(x.get_shape()[-1])], x.dtype)
-      self._bias = self.NewVar(init)
-
-    return tf.nn.bias_add(x, self._bias)
-
-  def CreateWeightLoss(self):
-    return []
-
-
-class LinearBase(block_base.BlockBase):
-  """A matmul wrapper.
-
-  Returns input * W, where matrix W can be customized through derivation.
-  """
-
-  def __init__(self, depth, name=None):
-    super(LinearBase, self).__init__(name)
-
-    with self._BlockScope():
-      self._depth = depth
-      self._matrix = None
-
-  def _CreateKernel(self, shape, dtype):
-    raise NotImplementedError('This method must be sub-classed.')
-
-  def _Apply(self, x):
-    if not self._matrix:
-      shape = [int(x.get_shape()[-1]), self._depth]
-      self._matrix = self._CreateKernel(shape, x.dtype)
-
-    return tf.matmul(x, self._matrix)
-
-
-class Linear(LinearBase):
-  """A matmul wrapper.
-
-  Returns input * W, where matrix W is learned.
-  """
-
-  def __init__(self,
-               depth,
-               initializer=block_util.RsqrtInitializer(),
-               name=None):
-    super(Linear, self).__init__(depth, name)
-
-    with self._BlockScope():
-      self._initializer = initializer
-
-  def _CreateKernel(self, shape, dtype):
-    init = self._initializer(shape, dtype)
-    return self.NewVar(init)
-
-
-class NN(block_base.BlockBase):
-  """A neural network layer wrapper.
-
-  Returns act(input * W + b), where matrix W, bias b are learned, and act is an
-  optional activation function (i.e., nonlinearity).
-
-  This transform block can handle multiple inputs. If x_1, x_2, ..., x_m are
-  the inputs, then returns act(x_1 * W_1 + ... + x_m * W_m + b).
-
-  Attributes:
-    nunits: The dimension of the output.
-  """
-
-  def __init__(self,
-               depth,
-               bias=Bias(0),
-               act=None,  # e.g., tf.nn.relu
-               initializer=block_util.RsqrtInitializer(),
-               linear_block_factory=(lambda d, i: Linear(d, initializer=i)),
-               name=None):
-    """Initializes NN block.
-
-    Args:
-      depth: The depth of the output.
-      bias: An initializer for the bias, or a Bias class object. If None, there
-        will be no bias term for this NN block. See BiasAdd block.
-      act: Optional activation function. If None, no activation is applied.
-      initializer: The initialization method for the matrix weights.
-      linear_block_factory: A function used to create a linear block.
-      name: The name of this block.
-    """
-    super(NN, self).__init__(name)
-
-    with self._BlockScope():
-      self._linear_block_factory = linear_block_factory
-      self._depth = depth
-      self._initializer = initializer
-      self._matrices = None
-
-      self._bias = BiasAdd(bias) if bias else PassThrough()
-      self._act = act if act else PassThrough()
-
-  def _Apply(self, *args):
-    if not self._matrices:
-      self._matrices = [
-          self._linear_block_factory(self._depth, self._initializer)
-          for _ in args]
-
-    if len(self._matrices) != len(args):
-      raise ValueError('{} expected {} inputs, but observed {} inputs'.format(
-          self.name, len(self._matrices), len(args)))
-
-    if len(args) > 1:
-      y = tf.add_n([m(x) for m, x in zip(self._matrices, args)])
-    else:
-      y = self._matrices[0](args[0])
-
-    return self._act(self._bias(y))
-
-
-class Conv2DBase(block_base.BlockBase):
-  """A tf.nn.conv2d operator."""
-
-  def __init__(self, depth, filter_size, strides, padding,
-               bias=None, act=None, atrous_rate=None, conv=tf.nn.conv2d,
-               name=None):
-    """Initializes a Conv2DBase block.
-
-    Arguments:
-      depth: The output depth of the block (i.e. #filters); if negative, the
-        output depth will be set to be the same as the input depth.
-      filter_size: The size of the 2D filter. If it's specified as an integer,
-        it's going to create a square filter. Otherwise, this is a tuple
-        specifying the height x width of the filter.
-      strides: A tuple specifying the y and x stride.
-      padding: One of the valid padding modes allowed by tf.nn.conv2d, or
-        'REFLECT'/'SYMMETRIC' for mirror padding.
-      bias: An initializer for the bias, or a Bias class object. If None, there
-          will be no bias in this block. See BiasAdd block.
-      act: Optional activation function applied to the output.
-      atrous_rate: optional input rate for ATrous convolution. If not None, this
-          will be used and the strides will be ignored.
-      conv: The convolution function to use (e.g. tf.nn.conv2d).
-      name: The name for this conv2d op.
-    """
-    super(Conv2DBase, self).__init__(name)
-
-    with self._BlockScope():
-      self._act = act if act else PassThrough()
-      self._bias = BiasAdd(bias) if bias else PassThrough()
-
-      self._kernel_shape = np.zeros((4,), dtype=np.int32)
-      self._kernel_shape[:2] = filter_size
-      self._kernel_shape[3] = depth
-
-      self._strides = np.ones((4,), dtype=np.int32)
-      self._strides[1:3] = strides
-      self._strides = list(self._strides)
-
-      self._padding = padding
-
-      self._kernel = None
-      self._conv = conv
-
-      self._atrous_rate = atrous_rate
-
-  def _CreateKernel(self, shape, dtype):
-    raise NotImplementedError('This method must be sub-classed')
-
-  def _Apply(self, x):
-    """Apply the self._conv op.
-
-    Arguments:
-      x: input tensor. It needs to be a 4D tensor of the form
-          [batch, height, width, channels].
-    Returns:
-      The output of the convolution of x with the current convolutional
-      kernel.
-    Raises:
-      ValueError: if number of channels is not defined at graph construction.
-    """
-    input_shape = x.get_shape().with_rank(4)
-    input_shape[3:].assert_is_fully_defined()  # channels must be defined
-    if self._kernel is None:
-      assert self._kernel_shape[2] == 0, self._kernel_shape
-      self._kernel_shape[2] = input_shape[3].value
-      if self._kernel_shape[3] < 0:
-        # Make output depth be the same as input depth.
-        self._kernel_shape[3] = self._kernel_shape[2]
-      self._kernel = self._CreateKernel(self._kernel_shape, x.dtype)
-
-    x, padding = HandleConvPaddingModes(
-        x, self._padding, self._kernel_shape, self._strides)
-    if self._atrous_rate is None:
-      x = self._conv(x, self._kernel, strides=self._strides, padding=padding)
-    else:
-      x = self._conv(x, self._kernel, rate=self._atrous_rate, padding=padding)
-
-    if self._padding != 'VALID':
-      # Manually update shape. Known shape information can be lost by tf.pad().
-      height = (1 + (input_shape[1].value - 1) // self._strides[1]
-                if input_shape[1].value else None)
-      width = (1 + (input_shape[2].value - 1) // self._strides[2]
-               if input_shape[2].value else None)
-      shape = x.get_shape()
-      x.set_shape([shape[0], height, width, shape[3]])
-
-    return self._act(self._bias(x))
-
-
-class Conv2D(Conv2DBase):
-  """A tf.nn.conv2d operator."""
-
-  def __init__(self, depth, filter_size, strides, padding,
-               bias=None, act=None, initializer=None, name=None):
-    """Initializes a Conv2D block.
-
-    Arguments:
-      depth: The output depth of the block (i.e., #filters)
-      filter_size: The size of the 2D filter. If it's specified as an integer,
-        it's going to create a square filter. Otherwise, this is a tuple
-        specifying the height x width of the filter.
-      strides: A tuple specifying the y and x stride.
-      padding: One of the valid padding modes allowed by tf.nn.conv2d, or
-        'REFLECT'/'SYMMETRIC' for mirror padding.
-      bias: An initializer for the bias, or a Bias class object. If None, there
-          will be no bias in this block. See BiasAdd block.
-      act: Optional activation function applied to the output.
-      initializer: Optional initializer for weights.
-      name: The name for this conv2d op.
-    """
-    super(Conv2D, self).__init__(depth, filter_size, strides, padding, bias,
-                                 act, conv=tf.nn.conv2d, name=name)
-
-    with self._BlockScope():
-      if initializer is None:
-        initializer = block_util.RsqrtInitializer(dims=(0, 1, 2))
-      self._initializer = initializer
-
-  def _CreateKernel(self, shape, dtype):
-    return self.NewVar(self._initializer(shape, dtype))
diff --git a/research/compression/entropy_coder/lib/blocks_std_test.py b/research/compression/entropy_coder/lib/blocks_std_test.py
deleted file mode 100644
index 328ebc9d2173436b2108b343b98650128a4613e3..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/lib/blocks_std_test.py
+++ /dev/null
@@ -1,340 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for basic tensorflow blocks_std."""
-
-from __future__ import division
-from __future__ import unicode_literals
-
-import math
-import os
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import blocks_std
-
-
-def _NumpyConv2D(x, f, strides, padding, rate=1):
-  assert strides[0] == 1 and strides[3] == 1, strides
-
-  if rate > 1:
-    f_shape = f.shape
-    expand_f = np.zeros([f_shape[0], ((f_shape[1] - 1) * rate + 1),
-                         f_shape[2], f_shape[3]])
-    expand_f[:, [y * rate for y in range(f_shape[1])], :, :] = f
-    f = np.zeros([((f_shape[0] - 1) * rate + 1), expand_f.shape[1],
-                  f_shape[2], f_shape[3]])
-    f[[y * rate for y in range(f_shape[0])], :, :, :] = expand_f
-
-  if padding != 'VALID':
-    assert x.shape[1] > 0 and x.shape[2] > 0, x.shape
-    # Compute the number of padded rows and cols.
-    # See Conv2D block comments for a math explanation.
-    remainder = ((x.shape[1] - 1) % strides[1], (x.shape[2] - 1) % strides[2])
-    pad_rows = f.shape[0] - remainder[0] - 1
-    pad_cols = f.shape[1] - remainder[1] - 1
-    pad = ((0, 0),
-           (pad_rows // 2, (pad_rows + 1) // 2),
-           (pad_cols // 2, (pad_cols + 1) // 2),
-           (0, 0))
-
-    # Pad the input using numpy.pad().
-    mode = None
-    if padding == 'SAME':
-      mode = str('constant')
-    if padding == 'REFLECT':
-      mode = str('reflect')
-    if padding == 'SYMMETRIC':
-      mode = str('symmetric')
-    x = np.pad(x, pad, mode=mode)
-
-  # Since x is now properly padded, proceed as if padding mode is VALID.
-  x_window = np.empty(
-      (x.shape[0],
-       int(math.ceil((x.shape[1] - f.shape[0] + 1) / strides[1])),
-       int(math.ceil((x.shape[2] - f.shape[1] + 1) / strides[2])),
-       np.prod(f.shape[:3])))
-
-  # The output at pixel location (i, j) is the result of linear transformation
-  # applied to the window whose top-left corner is at
-  # (i * row_stride, j * col_stride).
-  for i in xrange(x_window.shape[1]):
-    k = i * strides[1]
-    for j in xrange(x_window.shape[2]):
-      l = j * strides[2]
-      x_window[:, i, j, :] = x[:,
-                               k:(k + f.shape[0]),
-                               l:(l + f.shape[1]),
-                               :].reshape((x_window.shape[0], -1))
-
-  y = np.tensordot(x_window, f.reshape((-1, f.shape[3])), axes=1)
-  return y
-
-
-class BlocksStdTest(tf.test.TestCase):
-
-  def CheckUnary(self, y, op_type):
-    self.assertEqual(op_type, y.op.type)
-    self.assertEqual(1, len(y.op.inputs))
-    return y.op.inputs[0]
-
-  def CheckBinary(self, y, op_type):
-    self.assertEqual(op_type, y.op.type)
-    self.assertEqual(2, len(y.op.inputs))
-    return y.op.inputs
-
-  def testPassThrough(self):
-    p = blocks_std.PassThrough()
-    x = tf.placeholder(dtype=tf.float32, shape=[1])
-    self.assertIs(p(x), x)
-
-  def CheckBiasAdd(self, y, b):
-    x, u = self.CheckBinary(y, 'BiasAdd')
-    self.assertIs(u, b._bias.value())
-    self.assertEqual(x.dtype, u.dtype.base_dtype)
-    return x
-
-  def testBiasAdd(self):
-    b = blocks_std.BiasAdd()
-    x = tf.placeholder(dtype=tf.float32, shape=[4, 8])
-    y = b(x)
-    self.assertEqual(b._bias.get_shape(), x.get_shape()[-1:])
-    self.assertIs(x, self.CheckBiasAdd(y, b))
-
-  def testBiasRankTest(self):
-    b = blocks_std.BiasAdd()
-    x = tf.placeholder(dtype=tf.float32, shape=[10])
-    with self.assertRaises(ValueError):
-      b(x)
-
-  def CheckLinear(self, y, m):
-    x, w = self.CheckBinary(y, 'MatMul')
-    self.assertIs(w, m._matrix.value())
-    self.assertEqual(x.dtype, w.dtype.base_dtype)
-    return x
-
-  def testLinear(self):
-    m = blocks_std.Linear(10)
-    x = tf.placeholder(dtype=tf.float32, shape=[8, 9])
-    y = m(x)
-    self.assertEqual(m._matrix.get_shape(), [9, 10])
-    self.assertIs(x, self.CheckLinear(y, m))
-
-  def testLinearShared(self):
-    # Create a linear map which is applied twice on different inputs
-    # (i.e. the weights of the map are shared).
-    linear_map = blocks_std.Linear(6)
-    x1 = tf.random_normal(shape=[1, 5])
-    x2 = tf.random_normal(shape=[1, 5])
-    xs = x1 + x2
-
-    # Apply the transform with the same weights.
-    y1 = linear_map(x1)
-    y2 = linear_map(x2)
-    ys = linear_map(xs)
-
-    with self.test_session() as sess:
-      # Initialize all the variables of the graph.
-      tf.global_variables_initializer().run()
-
-      y1_res, y2_res, ys_res = sess.run([y1, y2, ys])
-      self.assertAllClose(y1_res + y2_res, ys_res)
-
-  def CheckNN(self, y, nn, act=None):
-    if act:
-      pre_act = self.CheckUnary(y, act)
-    else:
-      pre_act = y
-
-    if not isinstance(nn._bias, blocks_std.PassThrough):
-      pre_bias = self.CheckBiasAdd(pre_act, nn._bias)
-    else:
-      pre_bias = pre_act
-
-    if len(nn._matrices) > 1:
-      self.assertEqual('AddN', pre_bias.op.type)
-      pre_bias = pre_bias.op.inputs
-    else:
-      pre_bias = [pre_bias]
-
-    self.assertEqual(len(pre_bias), len(nn._matrices))
-    return [self.CheckLinear(u, m) for u, m in zip(pre_bias, nn._matrices)]
-
-  def testNNWithoutActWithoutBias(self):
-    nn = blocks_std.NN(10, act=None, bias=None)
-    x = tf.placeholder(dtype=tf.float32, shape=[5, 7])
-    y = nn(x)
-    self.assertIs(x, self.CheckNN(y, nn)[0])
-
-  def testNNWithoutBiasWithAct(self):
-    nn = blocks_std.NN(10, act=tf.nn.relu, bias=None)
-    x = tf.placeholder(dtype=tf.float32, shape=[5, 7])
-    y = nn(x)
-    self.assertIs(x, self.CheckNN(y, nn, 'Relu')[0])
-
-  def testNNWithBiasWithoutAct(self):
-    nn = blocks_std.NN(10, bias=blocks_std.Bias(0), act=None)
-    x = tf.placeholder(dtype=tf.float32, shape=[5, 7])
-    y = nn(x)
-    self.assertIs(x, self.CheckNN(y, nn)[0])
-
-  def testNNWithBiasWithAct(self):
-    nn = blocks_std.NN(10, bias=blocks_std.Bias(0), act=tf.square)
-    x = tf.placeholder(dtype=tf.float32, shape=[5, 7])
-    y = nn(x)
-    self.assertIs(x, self.CheckNN(y, nn, 'Square')[0])
-
-  def testNNMultipleInputs(self):
-    nn = blocks_std.NN(10, bias=blocks_std.Bias(0), act=tf.tanh)
-    x = [tf.placeholder(dtype=tf.float32, shape=[5, 7]),
-         tf.placeholder(dtype=tf.float32, shape=[5, 3]),
-         tf.placeholder(dtype=tf.float32, shape=[5, 5])]
-    y = nn(*x)
-    xs = self.CheckNN(y, nn, 'Tanh')
-    self.assertEqual(len(x), len(xs))
-    for u, v in zip(x, xs):
-      self.assertIs(u, v)
-
-  def testConv2DSAME(self):
-    np.random.seed(142536)
-
-    x_shape = [4, 16, 11, 5]
-    f_shape = [4, 3, 5, 6]
-    strides = [1, 2, 2, 1]
-    padding = 'SAME'
-
-    conv = blocks_std.Conv2D(depth=f_shape[-1],
-                             filter_size=f_shape[0:2],
-                             strides=strides[1:3],
-                             padding=padding,
-                             act=None,
-                             bias=None)
-    x_value = np.random.normal(size=x_shape)
-    x = tf.convert_to_tensor(x_value, dtype=tf.float32)
-    y = conv(x)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      f_value = conv._kernel.eval()
-      y_value = y.eval()
-
-    y_expected = _NumpyConv2D(x_value, f_value,
-                              strides=strides, padding=padding)
-    self.assertAllClose(y_expected, y_value)
-
-  def testConv2DValid(self):
-    np.random.seed(253647)
-
-    x_shape = [4, 11, 12, 5]
-    f_shape = [5, 2, 5, 5]
-    strides = [1, 2, 2, 1]
-    padding = 'VALID'
-
-    conv = blocks_std.Conv2D(depth=f_shape[-1],
-                             filter_size=f_shape[0:2],
-                             strides=strides[1:3],
-                             padding=padding,
-                             act=None,
-                             bias=None)
-    x_value = np.random.normal(size=x_shape)
-    x = tf.convert_to_tensor(x_value, dtype=tf.float32)
-    y = conv(x)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      f_value = conv._kernel.eval()
-      y_value = y.eval()
-
-    y_expected = _NumpyConv2D(x_value, f_value,
-                              strides=strides, padding=padding)
-    self.assertAllClose(y_expected, y_value)
-
-  def testConv2DSymmetric(self):
-    np.random.seed(364758)
-
-    x_shape = [4, 10, 12, 6]
-    f_shape = [3, 4, 6, 5]
-    strides = [1, 1, 1, 1]
-    padding = 'SYMMETRIC'
-
-    conv = blocks_std.Conv2D(depth=f_shape[-1],
-                             filter_size=f_shape[0:2],
-                             strides=strides[1:3],
-                             padding=padding,
-                             act=None,
-                             bias=None)
-    x_value = np.random.normal(size=x_shape)
-    x = tf.convert_to_tensor(x_value, dtype=tf.float32)
-    y = conv(x)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      f_value = conv._kernel.eval()
-      y_value = y.eval()
-
-    y_expected = _NumpyConv2D(x_value, f_value,
-                              strides=strides, padding=padding)
-    self.assertAllClose(y_expected, y_value)
-
-  def testConv2DReflect(self):
-    np.random.seed(768798)
-
-    x_shape = [4, 10, 12, 6]
-    f_shape = [3, 4, 6, 5]
-    strides = [1, 2, 2, 1]
-    padding = 'REFLECT'
-
-    conv = blocks_std.Conv2D(depth=f_shape[-1],
-                             filter_size=f_shape[0:2],
-                             strides=strides[1:3],
-                             padding=padding,
-                             act=None,
-                             bias=None)
-    x_value = np.random.normal(size=x_shape)
-    x = tf.convert_to_tensor(x_value, dtype=tf.float32)
-    y = conv(x)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      f_value = conv._kernel.eval()
-      y_value = y.eval()
-
-    y_expected = _NumpyConv2D(x_value, f_value,
-                              strides=strides, padding=padding)
-    self.assertAllClose(y_expected, y_value)
-
-  def testConv2DBias(self):
-    input_shape = [19, 14, 14, 64]
-    filter_shape = [3, 7, 64, 128]
-    strides = [1, 2, 2, 1]
-    output_shape = [19, 6, 4, 128]
-
-    conv = blocks_std.Conv2D(depth=filter_shape[-1],
-                             filter_size=filter_shape[0:2],
-                             strides=strides[1:3],
-                             padding='VALID',
-                             act=None,
-                             bias=blocks_std.Bias(1))
-    x = tf.placeholder(dtype=tf.float32, shape=input_shape)
-
-    y = conv(x)
-    self.CheckBiasAdd(y, conv._bias)
-    self.assertEqual(output_shape, y.get_shape().as_list())
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/compression/entropy_coder/model/__init__.py b/research/compression/entropy_coder/model/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/compression/entropy_coder/model/entropy_coder_model.py b/research/compression/entropy_coder/model/entropy_coder_model.py
deleted file mode 100644
index 67f7eb5bc05f3df7363529c19fa77d176caaabc1..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/model/entropy_coder_model.py
+++ /dev/null
@@ -1,55 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Entropy coder model."""
-
-
-class EntropyCoderModel(object):
-  """Entropy coder model."""
-
-  def __init__(self):
-    # Loss used for training the model.
-    self.loss = None
-
-    # Tensorflow op to run to train the model.
-    self.train_op = None
-
-    # Tensor corresponding to the average code length of the input bit field
-    # tensor. The average code length is a number of output bits per input bit.
-    # To get an effective compression, this number should be between 0.0
-    # and 1.0 (1.0 corresponds to no compression).
-    self.average_code_length = None
-
-  def Initialize(self, global_step, optimizer, config_string):
-    raise NotImplementedError()
-
-  def BuildGraph(self, input_codes):
-    """Build the Tensorflow graph corresponding to the entropy coder model.
-
-    Args:
-      input_codes: Tensor of size: batch_size x height x width x bit_depth
-        corresponding to the codes to compress.
-        The input codes are {-1, +1} codes.
-    """
-    # TODO:
-    # - consider switching to {0, 1} codes.
-    # - consider passing an extra tensor which gives for each (b, y, x)
-    #   what is the actual depth (which would allow to use more or less bits
-    #   for each (y, x) location.
-    raise NotImplementedError()
-
-  def GetConfigStringForUnitTest(self):
-    """Returns a default model configuration to be used for unit tests."""
-    return None
diff --git a/research/compression/entropy_coder/model/model_factory.py b/research/compression/entropy_coder/model/model_factory.py
deleted file mode 100644
index e6f9902f3bb720e76f228f2774a9eaf7774ef191..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/model/model_factory.py
+++ /dev/null
@@ -1,53 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Entropy coder model registrar."""
-
-
-class ModelFactory(object):
-  """Factory of encoder/decoder models."""
-
-  def __init__(self):
-    self._model_dictionary = dict()
-
-  def RegisterModel(self,
-                    entropy_coder_model_name,
-                    entropy_coder_model_factory):
-    self._model_dictionary[entropy_coder_model_name] = (
-        entropy_coder_model_factory)
-
-  def CreateModel(self, model_name):
-    current_model_factory = self._model_dictionary[model_name]
-    return current_model_factory()
-
-  def GetAvailableModels(self):
-    return self._model_dictionary.keys()
-
-
-_model_registry = ModelFactory()
-
-
-def GetModelRegistry():
-  return _model_registry
-
-
-class RegisterEntropyCoderModel(object):
-
-  def __init__(self, model_name):
-    self._model_name = model_name
-
-  def __call__(self, f):
-    _model_registry.RegisterModel(self._model_name, f)
-    return f
diff --git a/research/compression/entropy_coder/progressive/__init__.py b/research/compression/entropy_coder/progressive/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/compression/entropy_coder/progressive/progressive.py b/research/compression/entropy_coder/progressive/progressive.py
deleted file mode 100644
index 7b03a07db055b62aa1c0f9cc89ddd2472899db3c..0000000000000000000000000000000000000000
--- a/research/compression/entropy_coder/progressive/progressive.py
+++ /dev/null
@@ -1,242 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Code probability model used for entropy coding."""
-
-import json
-
-from six.moves import xrange
-import tensorflow as tf
-
-from entropy_coder.lib import blocks
-from entropy_coder.model import entropy_coder_model
-from entropy_coder.model import model_factory
-
-# pylint: disable=not-callable
-
-
-class BrnnPredictor(blocks.BlockBase):
-  """BRNN prediction applied on one layer."""
-
-  def __init__(self, code_depth, name=None):
-    super(BrnnPredictor, self).__init__(name)
-
-    with self._BlockScope():
-      hidden_depth = 2 * code_depth
-
-      # What is coming from the previous layer/iteration
-      # is going through a regular Conv2D layer as opposed to the binary codes
-      # of the current layer/iteration which are going through a masked
-      # convolution.
-      self._adaptation0 = blocks.RasterScanConv2D(
-          hidden_depth, [7, 7], [1, 1], 'SAME',
-          strict_order=True,
-          bias=blocks.Bias(0), act=tf.tanh)
-      self._adaptation1 = blocks.Conv2D(
-          hidden_depth, [3, 3], [1, 1], 'SAME',
-          bias=blocks.Bias(0), act=tf.tanh)
-      self._predictor = blocks.CompositionOperator([
-          blocks.LineOperator(
-              blocks.RasterScanConv2DLSTM(
-                  depth=hidden_depth,
-                  filter_size=[1, 3],
-                  hidden_filter_size=[1, 3],
-                  strides=[1, 1],
-                  padding='SAME')),
-          blocks.Conv2D(hidden_depth, [1, 1], [1, 1], 'SAME',
-                        bias=blocks.Bias(0), act=tf.tanh),
-          blocks.Conv2D(code_depth, [1, 1], [1, 1], 'SAME',
-                        bias=blocks.Bias(0), act=tf.tanh)
-      ])
-
-  def _Apply(self, x, s):
-    # Code estimation using both:
-    # - the state from the previous iteration/layer,
-    # - the binary codes that are before in raster scan order.
-    h = tf.concat(values=[self._adaptation0(x), self._adaptation1(s)], axis=3)
-
-    estimated_codes = self._predictor(h)
-
-    return estimated_codes
-
-
-class LayerPrediction(blocks.BlockBase):
-  """Binary code prediction for one layer."""
-
-  def __init__(self, layer_count, code_depth, name=None):
-    super(LayerPrediction, self).__init__(name)
-
-    self._layer_count = layer_count
-
-    # No previous layer.
-    self._layer_state = None
-    self._current_layer = 0
-
-    with self._BlockScope():
-      # Layers used to do the conditional code prediction.
-      self._brnn_predictors = []
-      for _ in xrange(layer_count):
-        self._brnn_predictors.append(BrnnPredictor(code_depth))
-
-      # Layers used to generate the input of the LSTM operating on the
-      # iteration/depth domain.
-      hidden_depth = 2 * code_depth
-      self._state_blocks = []
-      for _ in xrange(layer_count):
-        self._state_blocks.append(blocks.CompositionOperator([
-            blocks.Conv2D(
-                hidden_depth, [3, 3], [1, 1], 'SAME',
-                bias=blocks.Bias(0), act=tf.tanh),
-            blocks.Conv2D(
-                code_depth, [3, 3], [1, 1], 'SAME',
-                bias=blocks.Bias(0), act=tf.tanh)
-        ]))
-
-      # Memory of the RNN is equivalent to the size of 2 layers of binary
-      # codes.
-      hidden_depth = 2 * code_depth
-      self._layer_rnn = blocks.CompositionOperator([
-          blocks.Conv2DLSTM(
-              depth=hidden_depth,
-              filter_size=[1, 1],
-              hidden_filter_size=[1, 1],
-              strides=[1, 1],
-              padding='SAME'),
-          blocks.Conv2D(hidden_depth, [1, 1], [1, 1], 'SAME',
-                        bias=blocks.Bias(0), act=tf.tanh),
-          blocks.Conv2D(code_depth, [1, 1], [1, 1], 'SAME',
-                        bias=blocks.Bias(0), act=tf.tanh)
-      ])
-
-  def _Apply(self, x):
-    assert self._current_layer < self._layer_count
-
-    # Layer state is set to 0 when there is no previous iteration.
-    if self._layer_state is None:
-      self._layer_state = tf.zeros_like(x, dtype=tf.float32)
-
-    # Code estimation using both:
-    # - the state from the previous iteration/layer,
-    # - the binary codes that are before in raster scan order.
-    estimated_codes = self._brnn_predictors[self._current_layer](
-        x, self._layer_state)
-
-    # Compute the updated layer state.
-    h = self._state_blocks[self._current_layer](x)
-    self._layer_state = self._layer_rnn(h)
-    self._current_layer += 1
-
-    return estimated_codes
-
-
-class ProgressiveModel(entropy_coder_model.EntropyCoderModel):
-  """Progressive BRNN entropy coder model."""
-
-  def __init__(self):
-    super(ProgressiveModel, self).__init__()
-
-  def Initialize(self, global_step, optimizer, config_string):
-    if config_string is None:
-      raise ValueError('The progressive model requires a configuration.')
-    config = json.loads(config_string)
-    if 'coded_layer_count' not in config:
-      config['coded_layer_count'] = 0
-
-    self._config = config
-    self._optimizer = optimizer
-    self._global_step = global_step
-
-  def BuildGraph(self, input_codes):
-    """Build the graph corresponding to the progressive BRNN model."""
-    layer_depth = self._config['layer_depth']
-    layer_count = self._config['layer_count']
-
-    code_shape = input_codes.get_shape()
-    code_depth = code_shape[-1].value
-    if self._config['coded_layer_count'] > 0:
-      prefix_depth = self._config['coded_layer_count'] * layer_depth
-      if code_depth < prefix_depth:
-        raise ValueError('Invalid prefix depth: {} VS {}'.format(
-            prefix_depth, code_depth))
-      input_codes = input_codes[:, :, :, :prefix_depth]
-
-    code_shape = input_codes.get_shape()
-    code_depth = code_shape[-1].value
-    if code_depth % layer_depth != 0:
-      raise ValueError(
-          'Code depth must be a multiple of the layer depth: {} vs {}'.format(
-              code_depth, layer_depth))
-    code_layer_count = code_depth // layer_depth
-    if code_layer_count > layer_count:
-      raise ValueError('Input codes have too many layers: {}, max={}'.format(
-          code_layer_count, layer_count))
-
-    # Block used to estimate binary codes.
-    layer_prediction = LayerPrediction(layer_count, layer_depth)
-
-    # Block used to compute code lengths.
-    code_length_block = blocks.CodeLength()
-
-    # Loop over all the layers.
-    code_length = []
-    code_layers = tf.split(
-        value=input_codes, num_or_size_splits=code_layer_count, axis=3)
-    for k in xrange(code_layer_count):
-      x = code_layers[k]
-      predicted_x = layer_prediction(x)
-      # Saturate the prediction to avoid infinite code length.
-      epsilon = 0.001
-      predicted_x = tf.clip_by_value(
-          predicted_x, -1 + epsilon, +1 - epsilon)
-      code_length.append(code_length_block(
-          blocks.ConvertSignCodeToZeroOneCode(x),
-          blocks.ConvertSignCodeToZeroOneCode(predicted_x)))
-      tf.summary.scalar('code_length_layer_{:02d}'.format(k), code_length[-1])
-    code_length = tf.stack(code_length)
-    self.loss = tf.reduce_mean(code_length)
-    tf.summary.scalar('loss', self.loss)
-
-    # Loop over all the remaining layers just to make sure they are
-    # instantiated. Otherwise, loading model params could fail.
-    dummy_x = tf.zeros_like(code_layers[0])
-    for _ in xrange(layer_count - code_layer_count):
-      dummy_predicted_x = layer_prediction(dummy_x)
-
-    # Average bitrate over total_line_count.
-    self.average_code_length = tf.reduce_mean(code_length)
-
-    if self._optimizer:
-      optim_op = self._optimizer.minimize(self.loss,
-                                          global_step=self._global_step)
-      block_updates = blocks.CreateBlockUpdates()
-      if block_updates:
-        with tf.get_default_graph().control_dependencies([optim_op]):
-          self.train_op = tf.group(*block_updates)
-      else:
-        self.train_op = optim_op
-    else:
-      self.train_op = None
-
-  def GetConfigStringForUnitTest(self):
-    s = '{\n'
-    s += '"layer_depth": 1,\n'
-    s += '"layer_count": 8\n'
-    s += '}\n'
-    return s
-
-
-@model_factory.RegisterEntropyCoderModel('progressive')
-def CreateProgressiveModel():
-  return ProgressiveModel()
diff --git a/research/compression/image_encoder/README.md b/research/compression/image_encoder/README.md
deleted file mode 100644
index a47da977aa4db4be26528c5ebfe030024f31291b..0000000000000000000000000000000000000000
--- a/research/compression/image_encoder/README.md
+++ /dev/null
@@ -1,105 +0,0 @@
-# Image Compression with Neural Networks
-
-This is a [TensorFlow](http://www.tensorflow.org/) model for compressing and
-decompressing images using an already trained  Residual GRU model as descibed
-in [Full Resolution Image Compression with Recurrent Neural Networks](https://arxiv.org/abs/1608.05148). Please consult the paper for more details
-on the architecture and compression results.
-
-This code will allow you to perform the lossy compression on an model
-already trained on compression. This code doesn't not currently contain the
-Entropy Coding portions of our paper.
-
-
-## Prerequisites
-The only software requirements for running the encoder and decoder is having
-Tensorflow installed. You will also need to [download](http://download.tensorflow.org/models/compression_residual_gru-2016-08-23.tar.gz)
-and extract the model residual_gru.pb.
-
-If you want to generate the perceptual similarity under MS-SSIM, you will also
-need to [Install SciPy](https://www.scipy.org/install.html).
-
-## Encoding
-The Residual GRU network is fully convolutional, but requires the images
-height and width in pixels by a multiple of 32. There is an image in this folder
-called example.png that is 768x1024 if one is needed for testing. We also
-rely on TensorFlow's built in decoding ops, which support only PNG and JPEG at
-time of release.
-
-To encode an image, simply run the following command:
-
-`python encoder.py --input_image=/your/image/here.png
---output_codes=output_codes.npz --iteration=15
---model=/path/to/model/residual_gru.pb
-`
-
-The iteration parameter specifies the lossy-quality to target for compression.
-The quality can be [0-15], where 0 corresponds to a target of 1/8 (bits per
-pixel) bpp and every increment results in an additional 1/8 bpp.
-
-| Iteration | BPP | Compression Ratio |
-|---: |---: |---: |
-|0 | 0.125 | 192:1|
-|1 | 0.250 | 96:1|
-|2 | 0.375 | 64:1|
-|3 | 0.500 | 48:1|
-|4 | 0.625 | 38.4:1|
-|5 | 0.750 | 32:1|
-|6 | 0.875 | 27.4:1|
-|7 | 1.000 | 24:1|
-|8 | 1.125 | 21.3:1|
-|9 | 1.250 | 19.2:1|
-|10 | 1.375 | 17.4:1|
-|11 | 1.500 | 16:1|
-|12 | 1.625 | 14.7:1|
-|13 | 1.750 | 13.7:1|
-|14 | 1.875 | 12.8:1|
-|15 | 2.000 | 12:1|
-
-The output_codes file contains the numpy shape and a flattened, bit-packed
-array of the codes. These can be inspected in python by using numpy.load().
-
-
-## Decoding
-After generating codes for an image, the lossy reconstructions for that image
-can be done as follows:
-
-`python decoder.py --input_codes=codes.npz --output_directory=/tmp/decoded/
---model=residual_gru.pb`
-
-The output_directory will contain images decoded at each quality level.
-
-
-## Comparing Similarity
-One of our primary metrics for comparing how similar two images are
-is MS-SSIM.
-
-To generate these metrics on your images you can run:
-`python msssim.py --original_image=/path/to/your/image.png
---compared_image=/tmp/decoded/image_15.png`
-
-
-## Results
-CSV results containing the post-entropy bitrates and MS-SSIM over Kodak can 
-are available for reference. Each row of the CSV represents each of the Kodak
-images in their dataset number (1-24). Each column of the CSV represents each
-iteration of the model (1-16).
-
-[Post Entropy Bitrates](https://storage.googleapis.com/compression-ml/residual_gru_results/bitrate.csv)
-
-[MS-SSIM](https://storage.googleapis.com/compression-ml/residual_gru_results/msssim.csv)
-
-
-## FAQ
-
-#### How do I train my own compression network?
-We currently don't provide the code to build and train a compression
-graph from scratch.
-
-#### I get an InvalidArgumentError: Incompatible shapes.
-This is usually due to the fact that our network only supports images that are
-both height and width divisible by 32 pixel. Try padding your images to 32
-pixel boundaries.
-
-
-## Contact Info
-Model repository maintained by Nick Johnston ([nmjohn](https://github.com/nmjohn)).
diff --git a/research/compression/image_encoder/decoder.py b/research/compression/image_encoder/decoder.py
deleted file mode 100644
index 75bc18cad0fdd4055df7b42d5440635365504774..0000000000000000000000000000000000000000
--- a/research/compression/image_encoder/decoder.py
+++ /dev/null
@@ -1,127 +0,0 @@
-#!/usr/bin/python
-#
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-r"""Neural Network Image Compression Decoder.
-
-Decompress an image from the numpy's npz format generated by the encoder.
-
-Example usage:
-python decoder.py --input_codes=output_codes.pkl --iteration=15 \
---output_directory=/tmp/compression_output/ --model=residual_gru.pb
-"""
-import io
-import os
-
-import numpy as np
-import tensorflow as tf
-
-tf.flags.DEFINE_string('input_codes', None, 'Location of binary code file.')
-tf.flags.DEFINE_integer('iteration', -1, 'The max quality level of '
-                        'the images to output. Use -1 to infer from loaded '
-                        ' codes.')
-tf.flags.DEFINE_string('output_directory', None, 'Directory to save decoded '
-                       'images.')
-tf.flags.DEFINE_string('model', None, 'Location of compression model.')
-
-FLAGS = tf.flags.FLAGS
-
-
-def get_input_tensor_names():
-  name_list = ['GruBinarizer/SignBinarizer/Sign:0']
-  for i in range(1, 16):
-    name_list.append('GruBinarizer/SignBinarizer/Sign_{}:0'.format(i))
-  return name_list
-
-
-def get_output_tensor_names():
-  return ['loop_{0:02d}/add:0'.format(i) for i in range(0, 16)]
-
-
-def main(_):
-  if (FLAGS.input_codes is None or FLAGS.output_directory is None or
-      FLAGS.model is None):
-    print('\nUsage: python decoder.py --input_codes=output_codes.pkl '
-          '--iteration=15 --output_directory=/tmp/compression_output/ '
-          '--model=residual_gru.pb\n\n')
-    return
-
-  if FLAGS.iteration < -1 or FLAGS.iteration > 15:
-    print('\n--iteration must be between 0 and 15 inclusive, or -1 to infer '
-          'from file.\n')
-    return
-  iteration = FLAGS.iteration
-
-  if not tf.gfile.Exists(FLAGS.output_directory):
-    tf.gfile.MkDir(FLAGS.output_directory)
-
-  if not tf.gfile.Exists(FLAGS.input_codes):
-    print('\nInput codes not found.\n')
-    return
-
-  contents = ''
-  with tf.gfile.FastGFile(FLAGS.input_codes, 'rb') as code_file:
-    contents = code_file.read()
-    loaded_codes = np.load(io.BytesIO(contents))
-    assert ['codes', 'shape'] not in loaded_codes.files
-    loaded_shape = loaded_codes['shape']
-    loaded_array = loaded_codes['codes']
-
-    # Unpack and recover code shapes.
-    unpacked_codes = np.reshape(np.unpackbits(loaded_array)
-                                [:np.prod(loaded_shape)],
-                                loaded_shape)
-
-    numpy_int_codes = np.split(unpacked_codes, len(unpacked_codes))
-    if iteration == -1:
-      iteration = len(unpacked_codes) - 1
-    # Convert back to float and recover scale.
-    numpy_codes = [np.squeeze(x.astype(np.float32), 0) * 2 - 1 for x in
-                   numpy_int_codes]
-
-  with tf.Graph().as_default() as graph:
-    # Load the inference model for decoding.
-    with tf.gfile.FastGFile(FLAGS.model, 'rb') as model_file:
-      graph_def = tf.GraphDef()
-      graph_def.ParseFromString(model_file.read())
-    _ = tf.import_graph_def(graph_def, name='')
-
-    # For encoding the tensors into PNGs.
-    input_image = tf.placeholder(tf.uint8)
-    encoded_image = tf.image.encode_png(input_image)
-
-    input_tensors = [graph.get_tensor_by_name(name) for name in
-                     get_input_tensor_names()][0:iteration+1]
-    outputs = [graph.get_tensor_by_name(name) for name in
-               get_output_tensor_names()][0:iteration+1]
-
-  feed_dict = {key: value for (key, value) in zip(input_tensors,
-                                                  numpy_codes)}
-
-  with tf.Session(graph=graph) as sess:
-    results = sess.run(outputs, feed_dict=feed_dict)
-
-    for index, result in enumerate(results):
-      img = np.uint8(np.clip(result + 0.5, 0, 255))
-      img = img.squeeze()
-      png_img = sess.run(encoded_image, feed_dict={input_image: img})
-
-      with tf.gfile.FastGFile(os.path.join(FLAGS.output_directory,
-                                           'image_{0:02d}.png'.format(index)),
-                              'w') as output_image:
-        output_image.write(png_img)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/compression/image_encoder/encoder.py b/research/compression/image_encoder/encoder.py
deleted file mode 100644
index 27754bdaea19779cea653408d17ed2e6a051f0c5..0000000000000000000000000000000000000000
--- a/research/compression/image_encoder/encoder.py
+++ /dev/null
@@ -1,105 +0,0 @@
-#!/usr/bin/python
-#
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-r"""Neural Network Image Compression Encoder.
-
-Compresses an image to a binarized numpy array. The image must be padded to a
-multiple of 32 pixels in height and width.
-
-Example usage:
-python encoder.py --input_image=/your/image/here.png \
---output_codes=output_codes.pkl --iteration=15 --model=residual_gru.pb
-"""
-import io
-import os
-
-import numpy as np
-import tensorflow as tf
-
-tf.flags.DEFINE_string('input_image', None, 'Location of input image. We rely '
-                       'on tf.image to decode the image, so only PNG and JPEG '
-                       'formats are currently supported.')
-tf.flags.DEFINE_integer('iteration', 15, 'Quality level for encoding image. '
-                        'Must be between 0 and 15 inclusive.')
-tf.flags.DEFINE_string('output_codes', None, 'File to save output encoding.')
-tf.flags.DEFINE_string('model', None, 'Location of compression model.')
-
-FLAGS = tf.flags.FLAGS
-
-
-def get_output_tensor_names():
-  name_list = ['GruBinarizer/SignBinarizer/Sign:0']
-  for i in range(1, 16):
-    name_list.append('GruBinarizer/SignBinarizer/Sign_{}:0'.format(i))
-  return name_list
-
-
-def main(_):
-  if (FLAGS.input_image is None or FLAGS.output_codes is None or
-      FLAGS.model is None):
-    print('\nUsage: python encoder.py --input_image=/your/image/here.png '
-          '--output_codes=output_codes.pkl --iteration=15 '
-          '--model=residual_gru.pb\n\n')
-    return
-
-  if FLAGS.iteration < 0 or FLAGS.iteration > 15:
-    print('\n--iteration must be between 0 and 15 inclusive.\n')
-    return
-
-  with tf.gfile.FastGFile(FLAGS.input_image, 'rb') as input_image:
-    input_image_str = input_image.read()
-
-  with tf.Graph().as_default() as graph:
-    # Load the inference model for encoding.
-    with tf.gfile.FastGFile(FLAGS.model, 'rb') as model_file:
-      graph_def = tf.GraphDef()
-      graph_def.ParseFromString(model_file.read())
-    _ = tf.import_graph_def(graph_def, name='')
-
-    input_tensor = graph.get_tensor_by_name('Placeholder:0')
-    outputs = [graph.get_tensor_by_name(name) for name in
-               get_output_tensor_names()]
-
-    input_image = tf.placeholder(tf.string)
-    _, ext = os.path.splitext(FLAGS.input_image)
-    if ext == '.png':
-      decoded_image = tf.image.decode_png(input_image, channels=3)
-    elif ext == '.jpeg' or ext == '.jpg':
-      decoded_image = tf.image.decode_jpeg(input_image, channels=3)
-    else:
-      assert False, 'Unsupported file format {}'.format(ext)
-    decoded_image = tf.expand_dims(decoded_image, 0)
-
-  with tf.Session(graph=graph) as sess:
-    img_array = sess.run(decoded_image, feed_dict={input_image:
-                                                   input_image_str})
-    results = sess.run(outputs, feed_dict={input_tensor: img_array})
-
-  results = results[0:FLAGS.iteration + 1]
-  int_codes = np.asarray([x.astype(np.int8) for x in results])
-
-  # Convert int codes to binary.
-  int_codes = (int_codes + 1)//2
-  export = np.packbits(int_codes.reshape(-1))
-
-  output = io.BytesIO()
-  np.savez_compressed(output, shape=int_codes.shape, codes=export)
-  with tf.gfile.FastGFile(FLAGS.output_codes, 'w') as code_file:
-    code_file.write(output.getvalue())
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/compression/image_encoder/example.png b/research/compression/image_encoder/example.png
deleted file mode 100644
index d3409b01a557fe8c3058fad21ed969f8af28cb97..0000000000000000000000000000000000000000
Binary files a/research/compression/image_encoder/example.png and /dev/null differ
diff --git a/research/compression/image_encoder/msssim.py b/research/compression/image_encoder/msssim.py
deleted file mode 100644
index f07a3712785c62feb261feb90016e0f621a3ee1d..0000000000000000000000000000000000000000
--- a/research/compression/image_encoder/msssim.py
+++ /dev/null
@@ -1,217 +0,0 @@
-#!/usr/bin/python
-#
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Python implementation of MS-SSIM.
-
-Usage:
-
-python msssim.py --original_image=original.png --compared_image=distorted.png
-"""
-import numpy as np
-from scipy import signal
-from scipy.ndimage.filters import convolve
-import tensorflow as tf
-
-
-tf.flags.DEFINE_string('original_image', None, 'Path to PNG image.')
-tf.flags.DEFINE_string('compared_image', None, 'Path to PNG image.')
-FLAGS = tf.flags.FLAGS
-
-
-def _FSpecialGauss(size, sigma):
-  """Function to mimic the 'fspecial' gaussian MATLAB function."""
-  radius = size // 2
-  offset = 0.0
-  start, stop = -radius, radius + 1
-  if size % 2 == 0:
-    offset = 0.5
-    stop -= 1
-  x, y = np.mgrid[offset + start:stop, offset + start:stop]
-  assert len(x) == size
-  g = np.exp(-((x**2 + y**2)/(2.0 * sigma**2)))
-  return g / g.sum()
-
-
-def _SSIMForMultiScale(img1, img2, max_val=255, filter_size=11,
-                       filter_sigma=1.5, k1=0.01, k2=0.03):
-  """Return the Structural Similarity Map between `img1` and `img2`.
-
-  This function attempts to match the functionality of ssim_index_new.m by
-  Zhou Wang: http://www.cns.nyu.edu/~lcv/ssim/msssim.zip
-
-  Arguments:
-    img1: Numpy array holding the first RGB image batch.
-    img2: Numpy array holding the second RGB image batch.
-    max_val: the dynamic range of the images (i.e., the difference between the
-      maximum the and minimum allowed values).
-    filter_size: Size of blur kernel to use (will be reduced for small images).
-    filter_sigma: Standard deviation for Gaussian blur kernel (will be reduced
-      for small images).
-    k1: Constant used to maintain stability in the SSIM calculation (0.01 in
-      the original paper).
-    k2: Constant used to maintain stability in the SSIM calculation (0.03 in
-      the original paper).
-
-  Returns:
-    Pair containing the mean SSIM and contrast sensitivity between `img1` and
-    `img2`.
-
-  Raises:
-    RuntimeError: If input images don't have the same shape or don't have four
-      dimensions: [batch_size, height, width, depth].
-  """
-  if img1.shape != img2.shape:
-    raise RuntimeError('Input images must have the same shape (%s vs. %s).',
-                       img1.shape, img2.shape)
-  if img1.ndim != 4:
-    raise RuntimeError('Input images must have four dimensions, not %d',
-                       img1.ndim)
-
-  img1 = img1.astype(np.float64)
-  img2 = img2.astype(np.float64)
-  _, height, width, _ = img1.shape
-
-  # Filter size can't be larger than height or width of images.
-  size = min(filter_size, height, width)
-
-  # Scale down sigma if a smaller filter size is used.
-  sigma = size * filter_sigma / filter_size if filter_size else 0
-
-  if filter_size:
-    window = np.reshape(_FSpecialGauss(size, sigma), (1, size, size, 1))
-    mu1 = signal.fftconvolve(img1, window, mode='valid')
-    mu2 = signal.fftconvolve(img2, window, mode='valid')
-    sigma11 = signal.fftconvolve(img1 * img1, window, mode='valid')
-    sigma22 = signal.fftconvolve(img2 * img2, window, mode='valid')
-    sigma12 = signal.fftconvolve(img1 * img2, window, mode='valid')
-  else:
-    # Empty blur kernel so no need to convolve.
-    mu1, mu2 = img1, img2
-    sigma11 = img1 * img1
-    sigma22 = img2 * img2
-    sigma12 = img1 * img2
-
-  mu11 = mu1 * mu1
-  mu22 = mu2 * mu2
-  mu12 = mu1 * mu2
-  sigma11 -= mu11
-  sigma22 -= mu22
-  sigma12 -= mu12
-
-  # Calculate intermediate values used by both ssim and cs_map.
-  c1 = (k1 * max_val) ** 2
-  c2 = (k2 * max_val) ** 2
-  v1 = 2.0 * sigma12 + c2
-  v2 = sigma11 + sigma22 + c2
-  ssim = np.mean((((2.0 * mu12 + c1) * v1) / ((mu11 + mu22 + c1) * v2)))
-  cs = np.mean(v1 / v2)
-  return ssim, cs
-
-
-def MultiScaleSSIM(img1, img2, max_val=255, filter_size=11, filter_sigma=1.5,
-                   k1=0.01, k2=0.03, weights=None):
-  """Return the MS-SSIM score between `img1` and `img2`.
-
-  This function implements Multi-Scale Structural Similarity (MS-SSIM) Image
-  Quality Assessment according to Zhou Wang's paper, "Multi-scale structural
-  similarity for image quality assessment" (2003).
-  Link: https://ece.uwaterloo.ca/~z70wang/publications/msssim.pdf
-
-  Author's MATLAB implementation:
-  http://www.cns.nyu.edu/~lcv/ssim/msssim.zip
-
-  Arguments:
-    img1: Numpy array holding the first RGB image batch.
-    img2: Numpy array holding the second RGB image batch.
-    max_val: the dynamic range of the images (i.e., the difference between the
-      maximum the and minimum allowed values).
-    filter_size: Size of blur kernel to use (will be reduced for small images).
-    filter_sigma: Standard deviation for Gaussian blur kernel (will be reduced
-      for small images).
-    k1: Constant used to maintain stability in the SSIM calculation (0.01 in
-      the original paper).
-    k2: Constant used to maintain stability in the SSIM calculation (0.03 in
-      the original paper).
-    weights: List of weights for each level; if none, use five levels and the
-      weights from the original paper.
-
-  Returns:
-    MS-SSIM score between `img1` and `img2`.
-
-  Raises:
-    RuntimeError: If input images don't have the same shape or don't have four
-      dimensions: [batch_size, height, width, depth].
-  """
-  if img1.shape != img2.shape:
-    raise RuntimeError('Input images must have the same shape (%s vs. %s).',
-                       img1.shape, img2.shape)
-  if img1.ndim != 4:
-    raise RuntimeError('Input images must have four dimensions, not %d',
-                       img1.ndim)
-
-  # Note: default weights don't sum to 1.0 but do match the paper / matlab code.
-  weights = np.array(weights if weights else
-                     [0.0448, 0.2856, 0.3001, 0.2363, 0.1333])
-  levels = weights.size
-  downsample_filter = np.ones((1, 2, 2, 1)) / 4.0
-  im1, im2 = [x.astype(np.float64) for x in [img1, img2]]
-  mssim = np.array([])
-  mcs = np.array([])
-  for _ in range(levels):
-    ssim, cs = _SSIMForMultiScale(
-        im1, im2, max_val=max_val, filter_size=filter_size,
-        filter_sigma=filter_sigma, k1=k1, k2=k2)
-    mssim = np.append(mssim, ssim)
-    mcs = np.append(mcs, cs)
-    filtered = [convolve(im, downsample_filter, mode='reflect')
-                for im in [im1, im2]]
-    im1, im2 = [x[:, ::2, ::2, :] for x in filtered]
-  return (np.prod(mcs[0:levels-1] ** weights[0:levels-1]) *
-          (mssim[levels-1] ** weights[levels-1]))
-
-
-def main(_):
-  if FLAGS.original_image is None or FLAGS.compared_image is None:
-    print('\nUsage: python msssim.py --original_image=original.png '
-          '--compared_image=distorted.png\n\n')
-    return
-
-  if not tf.gfile.Exists(FLAGS.original_image):
-    print('\nCannot find --original_image.\n')
-    return
-
-  if not tf.gfile.Exists(FLAGS.compared_image):
-    print('\nCannot find --compared_image.\n')
-    return
-
-  with tf.gfile.FastGFile(FLAGS.original_image) as image_file:
-    img1_str = image_file.read('rb')
-  with tf.gfile.FastGFile(FLAGS.compared_image) as image_file:
-    img2_str = image_file.read('rb')
-
-  input_img = tf.placeholder(tf.string)
-  decoded_image = tf.expand_dims(tf.image.decode_png(input_img, channels=3), 0)
-
-  with tf.Session() as sess:
-    img1 = sess.run(decoded_image, feed_dict={input_img: img1_str})
-    img2 = sess.run(decoded_image, feed_dict={input_img: img2_str})
-
-  print((MultiScaleSSIM(img1, img2, max_val=255)))
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/deep_contextual_bandits/README.md b/research/deep_contextual_bandits/README.md
deleted file mode 100644
index b81309af5b08003eb727e079e70c3dd08eedb6f6..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/README.md
+++ /dev/null
@@ -1,444 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Deep Bayesian Bandits Library
-
-This library corresponds to the *[Deep Bayesian Bandits Showdown: An Empirical
-Comparison of Bayesian Deep Networks for Thompson
-Sampling](https://arxiv.org/abs/1802.09127)* paper, published in
-[ICLR](https://iclr.cc/) 2018. We provide a benchmark to test decision-making
-algorithms for contextual-bandits. In particular, the current library implements
-a variety of algorithms (many of them based on approximate Bayesian Neural
-Networks and Thompson sampling), and a number of real and syntethic data
-problems exhibiting a diverse set of properties.
-
-It is a Python library that uses [TensorFlow](https://www.tensorflow.org/).
-
-We encourage contributors to add new approximate Bayesian Neural Networks or,
-more generally, contextual bandits algorithms to the library. Also, we would
-like to extend the data sources over time, so we warmly encourage contributions
-in this front too!
-
-Please, use the following when citing the code or the paper:
-
-```
-@article{riquelme2018deep, title={Deep Bayesian Bandits Showdown: An Empirical
-Comparison of Bayesian Deep Networks for Thompson Sampling},
-author={Riquelme, Carlos and Tucker, George and Snoek, Jasper},
-journal={International Conference on Learning Representations, ICLR.}, year={2018}}
-```
-
-**Contact**. This repository is maintained by [Carlos Riquelme](http://rikel.me) ([rikel](https://github.com/rikel)). Feel free to reach out directly at [rikel@google.com](mailto:rikel@google.com) with any questions or comments.
-
-
-We first briefly introduce contextual bandits, Thompson sampling, enumerate the
-implemented algorithms, and the available data sources. Then, we provide a
-simple complete example illustrating how to use the library.
-
-## Contextual Bandits
-
-Contextual bandits are a rich decision-making framework where an algorithm has
-to choose among a set of *k* actions at every time step *t*, after observing
-a context (or side-information) denoted by *X<sub>t</sub>*. The general pseudocode for
-the process if we use algorithm **A** is as follows:
-
-```
-At time t = 1, ..., T:
-  1. Observe new context: X_t
-  2. Choose action: a_t = A.action(X_t)
-  3. Observe reward: r_t
-  4. Update internal state of the algorithm: A.update((X_t, a_t, r_t))
-```
-
-The goal is to maximize the total sum of rewards: &sum;<sub>t</sub> r<sub>t</sub>
-
-For example, each *X<sub>t</sub>* could encode the properties of a specific user (and
-the time or day), and we may have to choose an ad, discount coupon, treatment,
-hyper-parameters, or version of a website to show or provide to the user.
-Hopefully, over time, we will learn how to match each type of user to the most
-beneficial personalized action under some metric (the reward).
-
-## Thompson Sampling
-
-Thompson Sampling is a meta-algorithm that chooses an action for the contextual
-bandit in a statistically efficient manner, simultaneously finding the best arm
-while attempting to incur low cost. Informally speaking, we assume the expected
-reward is given by some function
-**E**[r<sub>t</sub> | X<sub>t</sub>, a<sub>t</sub>] = f(X<sub>t</sub>, a<sub>t</sub>).
-Unfortunately, function **f** is unknown, as otherwise we could just choose the
-action with highest expected value:
-a<sub>t</sub><sup>*</sup> = arg max<sub>i</sub> f(X<sub>t</sub>, a<sub>t</sub>).
-
-The idea behind Thompson Sampling is based on keeping a posterior distribution
-&pi;<sub>t</sub> over functions in some family f &isin; F after observing the first
-*t-1* datapoints. Then, at time *t*, we sample one potential explanation of
-the underlying process: f<sub>t</sub> &sim; &pi;<sub>t</sub>, and act optimally (i.e., greedily)
-*according to f<sub>t</sub>*. In other words, we choose
-a<sub>t</sub> = arg max<sub>i</sub> f<sub>t</sub>(X<sub>t</sub>, a<sub>i</sub>).
-Finally, we update our posterior distribution with the new collected
-datapoint (X<sub>t</sub>, a<sub>t</sub>, r<sub>t</sub>).
-
-The main issue is that keeping an updated posterior &pi;<sub>t</sub> (or, even,
-sampling from it) is often intractable for highly parameterized models like deep
-neural networks. The algorithms we list in the next section provide tractable
-*approximations* that can be used in combination with Thompson Sampling to solve
-the contextual bandit problem.
-
-## Algorithms
-
-The Deep Bayesian Bandits library includes the following algorithms (see the
-[paper](https://arxiv.org/abs/1802.09127) for further details):
-
-1.  **Linear Algorithms**. As a powerful baseline, we provide linear algorithms.
-    In particular, we focus on the exact Bayesian linear regression
-    implementation, while it is easy to derive the greedy OLS version (possibly,
-    with epsilon-greedy exploration). The algorithm is implemented in
-    *linear_full_posterior_sampling.py*, and it is instantiated as follows:
-
-    ```
-        linear_full = LinearFullPosteriorSampling('MyLinearTS', my_hparams)
-    ```
-
-2.  **Neural Linear**. We introduce an algorithm we call Neural Linear, which
-    operates by learning a neural network to map contexts to rewards for each
-    action, and ---simultaneously--- it updates a Bayesian linear regression in
-    the last layer (i.e., the one that maps the final representation **z** to
-    the rewards **r**). Thompson Sampling samples the linear parameters
-    &beta;<sub>i</sub> for each action *i*, but keeps the network that computes the
-    representation. Then, both parts (network and Bayesian linear regression)
-    are updated, possibly at different frequencies. The algorithm is implemented
-    in *neural_linear_sampling.py*, and we create an algorithm instance like
-    this:
-
-    ```
-        neural_linear = NeuralLinearPosteriorSampling('MyNLinear', my_hparams)
-    ```
-
-3.  **Neural Greedy**. Another standard benchmark is to train a neural network
-    that maps contexts to rewards, and at each time *t* just acts greedily
-    according to the current model. In particular, this approach does *not*
-    explicitly use Thompson Sampling. However, due to stochastic gradient
-    descent, there is still some randomness in its output. It is
-    straight-forward to add epsilon-greedy exploration to choose random
-    actions with probability &epsilon; &isin; (0, 1). The algorithm is
-    implemented in *neural_bandit_model.py*, and it is used together with
-    *PosteriorBNNSampling* (defined in *posterior_bnn_sampling.py*) by calling:
-
-    ```
-      neural_greedy = PosteriorBNNSampling('MyNGreedy', my_hparams, 'RMSProp')
-    ```
-
-4.  **Stochastic Variational Inference**, Bayes by Backpropagation. We implement
-    a Bayesian neural network by modeling each individual weight posterior as a
-    univariate Gaussian distribution: w<sub>ij</sub> &sim; N(&mu;<sub>ij</sub>, &sigma;<sub>ij</sub><sup>2</sup>).
-    Thompson sampling then samples a network at each time step
-    by sampling each weight independently. The variational approach consists in
-    maximizing a proxy for maximum likelihood of the observed data, the ELBO or
-    variational lower bound, to fit the values of &mu;<sub>ij</sub>, &sigma;<sub>ij</sub><sup>2</sup>
-    for every *i, j*.
-
-    See [Weight Uncertainty in Neural
-    Networks](https://arxiv.org/abs/1505.05424).
-
-    The BNN algorithm is implemented in *variational_neural_bandit_model.py*,
-    and it is used together with *PosteriorBNNSampling* (defined in
-    *posterior_bnn_sampling.py*) by calling:
-
-    ```
-        bbb = PosteriorBNNSampling('myBBB', my_hparams, 'Variational')
-    ```
-
-5.  **Expectation-Propagation**, Black-box alpha-divergence minimization.
-    The family of expectation-propagation algorithms is based on the message
-    passing framework . They iteratively approximate the posterior by updating a
-    single approximation factor (or site) at a time, which usually corresponds
-    to the likelihood of one data point. We focus on methods that directly
-    optimize the global EP objective via stochastic gradient descent, as, for
-    instance, Power EP. For further details see original paper below.
-
-    See [Black-box alpha-divergence
-    Minimization](https://arxiv.org/abs/1511.03243).
-
-    We create an instance of the algorithm like this:
-
-    ```
-        bb_adiv = PosteriorBNNSampling('MyEP', my_hparams, 'AlphaDiv')
-    ```
-
-6.  **Dropout**. Dropout is a training technique where the output of each neuron
-    is independently zeroed out with probability *p* at each forward pass.
-    Once the network has been trained, dropout can still be used to obtain a
-    distribution of predictions for a specific input. Following the best action
-    with respect to the random dropout prediction can be interpreted as an
-    implicit form of Thompson sampling. The code for dropout is the same as for
-    Neural Greedy (see above), but we need to set two hyper-parameters:
-    *use_dropout=True* and *keep_prob=p* where *p* takes the desired value in
-    (0, 1). Then:
-
-    ```
-        dropout = PosteriorBNNSampling('MyDropout', my_hparams, 'RMSProp')
-    ```
-
-7.  **Monte Carlo Methods**. To be added soon.
-
-8.  **Bootstrapped Networks**. This algorithm trains simultaneously and in
-    parallel **q** neural networks based on different datasets D<sub>1</sub>, ..., D<sub>q</sub>. The way those datasets are collected is by adding each new collected
-    datapoint (X<sub>t</sub>, a<sub>t</sub>, r<sub>t</sub>) to each dataset *D<sub>i</sub>* independently and with
-    probability p &isin; (0, 1]. Therefore, the main hyperparameters of the
-    algorithm are **(q, p)**. In order to choose an action for a new context,
-    one of the **q** networks is first selected with uniform probability (i.e.,
-    *1/q*). Then, the best action according to the *selected* network is
-    played.
-
-    See [Deep Exploration via Bootstrapped
-    DQN](https://arxiv.org/abs/1602.04621).
-
-    The algorithm is implemented in *bootstrapped_bnn_sampling.py*, and we
-    instantiate it as (where *my_hparams* contains both **q** and **p**):
-
-    ```
-        bootstrap = BootstrappedBNNSampling('MyBoot', my_hparams)
-    ```
-
-9.  **Parameter-Noise**. Another approach to approximate a distribution over
-    neural networks (or more generally, models) that map contexts to rewards,
-    consists in randomly perturbing a point estimate trained by Stochastic
-    Gradient Descent on the data. The Parameter-Noise algorithm uses a heuristic
-    to control the amount of noise &sigma;<sub>t</sub><sup>2</sup> it adds independently to the
-    parameters representing a neural network: &theta;<sub>t</sub><sup>'</sup> = &theta;<sub>t</sub> + &epsilon; where
-    &epsilon; &sim; N(0, &sigma;<sub>t</sub><sup>2</sup> Id).
-    After using &theta;<sub>t</sub><sup>'</sup> for decision making, the following SGD
-    training steps start again from &theta;<sub>t</sub>. The key hyperparameters to set
-    are those controlling the noise heuristic.
-
-    See [Parameter Space Noise for
-    Exploration](https://arxiv.org/abs/1706.01905).
-
-    The algorithm is implemented in *parameter_noise_sampling.py*, and we create
-    an instance by calling:
-
-    ```
-        parameter_noise = ParameterNoiseSampling('MyParamNoise', my_hparams)
-    ```
-
-10. **Gaussian Processes**. Another standard benchmark are Gaussian Processes,
-    see *Gaussian Processes for Machine Learning* by Rasmussen and Williams for
-    an introduction. To model the expected reward of different actions, we fit a
-    multitask GP.
-
-    See [Multi-task Gaussian Process
-    Prediction](http://papers.nips.cc/paper/3189-multi-task-gaussian-process-prediction.pdf).
-
-    Our implementation is provided in *multitask_gp.py*, and it is instantiated
-    as follows:
-
-    ```
-        gp = PosteriorBNNSampling('MyMultitaskGP', my_hparams, 'GP')
-    ```
-
-In the code snippet at the bottom, we show how to instantiate some of these
-algorithms, and how to run the contextual bandit simulator, and display the
-high-level results.
-
-## Data
-
-In the paper we use two types of contextual datasets: synthetic and based on
-real-world data.
-
-We provide functions that sample problems from those datasets. In the case of
-real-world data, you first need to download the raw datasets, and pass the route
-to the functions. Links for the datasets are provided below.
-
-### Synthetic Datasets
-
-Synthetic datasets are contained in the *synthetic_data_sampler.py* file. In
-particular, it includes:
-
-1.  **Linear data**. Provides a number of linear arms, and Gaussian contexts.
-
-2.  **Sparse linear data**. Provides a number of sparse linear arms, and
-    Gaussian contexts.
-
-3.  **Wheel bandit data**. Provides sampled data from the wheel bandit data, see
-    [Section 5.4](https://arxiv.org/abs/1802.09127) in the paper.
-
-### Real-World Datasets
-
-Real-world data generating functions are contained in the *data_sampler.py*
-file.
-
-In particular, it includes:
-
-1.  **Mushroom data**. Each incoming context represents a different type of
-    mushroom, and the actions are eat or no-eat. Eating an edible mushroom
-    provides positive reward, while eating a poisonous one provides positive
-    reward with probability *p*, and a large negative reward with probability
-    *1-p*. All the rewards, and the value of *p* are customizable. The
-    [dataset](https://archive.ics.uci.edu/ml/datasets/mushroom) is part of the
-    UCI repository, and the bandit problem was proposed in Blundell et al.
-    (2015). Data is available [here](https://storage.googleapis.com/bandits_datasets/mushroom.data)
-    or alternatively [here](https://archive.ics.uci.edu/ml/machine-learning-databases/mushroom/),
-    use the *agaricus-lepiota.data* file.
-
-2.  **Stock data**. We created the Financial Dataset by pulling the stock prices
-    of *d = 21* publicly traded companies in NYSE and Nasdaq, for the last 14
-    years (*n = 3713*). For each day, the context was the price difference
-    between the beginning and end of the session for each stock. We
-    synthetically created the arms to be a linear combination of the contexts,
-    representing *k = 8* different potential portfolios. Data is available
-    [here](https://storage.googleapis.com/bandits_datasets/raw_stock_contexts).
-
-3.  **Jester data**. We create a recommendation system bandit problem as
-    follows. The Jester Dataset (Goldberg et al., 2001) provides continuous
-    ratings in *[-10, 10]* for 100 jokes from a total of 73421 users. We find
-    a *complete* subset of *n = 19181* users rating all 40 jokes. Following
-    Riquelme et al. (2017), we take *d = 32* of the ratings as the context of
-    the user, and *k = 8* as the arms. The agent recommends one joke, and
-    obtains the reward corresponding to the rating of the user for the selected
-    joke. Data is available [here](https://storage.googleapis.com/bandits_datasets/jester_data_40jokes_19181users.npy).
-
-4.  **Statlog data**. The Shuttle Statlog Dataset (Asuncion & Newman, 2007)
-    provides the value of *d = 9* indicators during a space shuttle flight,
-    and the goal is to predict the state of the radiator subsystem of the
-    shuttle. There are *k = 7* possible states, and if the agent selects the
-    right state, then reward 1 is generated. Otherwise, the agent obtains no
-    reward (*r = 0*). The most interesting aspect of the dataset is that one
-    action is the optimal one in 80% of the cases, and some algorithms may
-    commit to this action instead of further exploring. In this case, the number
-    of contexts is *n = 43500*. Data is available [here](https://storage.googleapis.com/bandits_datasets/shuttle.trn) or alternatively
-    [here](https://archive.ics.uci.edu/ml/datasets/Statlog+\(Shuttle\)), use
-    *shuttle.trn* file.
-
-5.  **Adult data**. The Adult Dataset (Kohavi, 1996; Asuncion & Newman, 2007)
-    comprises personal information from the US Census Bureau database, and the
-    standard prediction task is to determine if a person makes over 50K a year
-    or not. However, we consider the *k = 14* different occupations as
-    feasible actions, based on *d = 94* covariates (many of them binarized).
-    As in previous datasets, the agent obtains a reward of 1 for making the
-    right prediction, and 0 otherwise. The total number of contexts is *n =
-    45222*. Data is available [here](https://storage.googleapis.com/bandits_datasets/adult.full) or alternatively
-    [here](https://archive.ics.uci.edu/ml/datasets/adult), use *adult.data*
-    file.
-
-6.  **Census data**. The US Census (1990) Dataset (Asuncion & Newman, 2007)
-    contains a number of personal features (age, native language, education...)
-    which we summarize in *d = 389* covariates, including binary dummy
-    variables for categorical features. Our goal again is to predict the
-    occupation of the individual among *k = 9* classes. The agent obtains
-    reward 1 for making the right prediction, and 0 otherwise. Data is available
-    [here](https://storage.googleapis.com/bandits_datasets/USCensus1990.data.txt) or alternatively [here](https://archive.ics.uci.edu/ml/datasets/US+Census+Data+\(1990\)), use
-    *USCensus1990.data.txt* file.
-
-7.  **Covertype data**. The Covertype Dataset (Asuncion & Newman, 2007)
-    classifies the cover type of northern Colorado forest areas in *k = 7*
-    classes, based on *d = 54* features, including elevation, slope, aspect,
-    and soil type. Again, the agent obtains reward 1 if the correct class is
-    selected, and 0 otherwise. Data is available [here](https://storage.googleapis.com/bandits_datasets/covtype.data) or alternatively
-    [here](https://archive.ics.uci.edu/ml/datasets/covertype), use
-    *covtype.data* file.
-
-In datasets 4-7, each feature of the dataset is normalized first.
-
-## Usage: Basic Example
-
-This library requires Tensorflow, Numpy, and Pandas.
-
-The file *example_main.py* provides a complete example on how to use the
-library. We run the code:
-
-```
-    python example_main.py
-```
-
-**Do not forget to** configure the routes to the data files at the top of *example_main.py*.
-
-For example, we can run the Mushroom bandit for 2000 contexts on a few
-algorithms as follows:
-
-```
-  # Problem parameters
-  num_contexts = 2000
-
-  # Choose data source among:
-  # {linear, sparse_linear, mushroom, financial, jester,
-  #  statlog, adult, covertype, census, wheel}
-  data_type = 'mushroom'
-
-  # Create dataset
-  sampled_vals = sample_data(data_type, num_contexts)
-  dataset, opt_rewards, opt_actions, num_actions, context_dim = sampled_vals
-
-  # Define hyperparameters and algorithms
-  hparams_linear = tf.contrib.training.HParams(num_actions=num_actions,
-                                               context_dim=context_dim,
-                                               a0=6,
-                                               b0=6,
-                                               lambda_prior=0.25,
-                                               initial_pulls=2)
-
-  hparams_dropout = tf.contrib.training.HParams(num_actions=num_actions,
-                                                context_dim=context_dim,
-                                                init_scale=0.3,
-                                                activation=tf.nn.relu,
-                                                layer_sizes=[50],
-                                                batch_size=512,
-                                                activate_decay=True,
-                                                initial_lr=0.1,
-                                                max_grad_norm=5.0,
-                                                show_training=False,
-                                                freq_summary=1000,
-                                                buffer_s=-1,
-                                                initial_pulls=2,
-                                                optimizer='RMS',
-                                                reset_lr=True,
-                                                lr_decay_rate=0.5,
-                                                training_freq=50,
-                                                training_epochs=100,
-                                                keep_prob=0.80,
-                                                use_dropout=True)
-
-  ### Create hyper-parameter configurations for other algorithms
-    [...]
-
-  algos = [
-      UniformSampling('Uniform Sampling', hparams),
-      PosteriorBNNSampling('Dropout', hparams_dropout, 'RMSProp'),
-      PosteriorBNNSampling('BBB', hparams_bbb, 'Variational'),
-      NeuralLinearPosteriorSampling('NeuralLinear', hparams_nlinear),
-      LinearFullPosteriorSampling('LinFullPost', hparams_linear),
-      BootstrappedBNNSampling('BootRMS', hparams_boot),
-      ParameterNoiseSampling('ParamNoise', hparams_pnoise),
-  ]
-
-  # Run contextual bandit problem
-  t_init = time.time()
-  results = run_contextual_bandit(context_dim, num_actions, dataset, algos)
-  _, h_rewards = results
-
-  # Display results
-  display_results(algos, opt_rewards, opt_actions, h_rewards, t_init, data_type)
-
-```
-
-The previous code leads to final results that look like:
-
-```
----------------------------------------------------
----------------------------------------------------
-mushroom bandit completed after 69.8401839733 seconds.
----------------------------------------------------
-  0) LinFullPost         |               total reward =     4365.0.
-  1) NeuralLinear        |               total reward =     4110.0.
-  2) Dropout             |               total reward =     3430.0.
-  3) ParamNoise          |               total reward =     3270.0.
-  4) BootRMS             |               total reward =     3050.0.
-  5) BBB                 |               total reward =     2505.0.
-  6) Uniform Sampling    |               total reward =    -4930.0.
----------------------------------------------------
-Optimal total reward = 5235.
-Frequency of optimal actions (action, frequency):
-[[0, 953], [1, 1047]]
----------------------------------------------------
----------------------------------------------------
-```
diff --git a/research/deep_contextual_bandits/bandits/algorithms/bb_alpha_divergence_model.py b/research/deep_contextual_bandits/bandits/algorithms/bb_alpha_divergence_model.py
deleted file mode 100644
index 5b9c0ebd0988873eaf97d8d68d25dae5e5b9cd71..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/bb_alpha_divergence_model.py
+++ /dev/null
@@ -1,373 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Bayesian NN using expectation propagation (Black-Box Alpha-Divergence).
-
-See https://arxiv.org/abs/1511.03243 for details.
-All formulas used in this implementation are derived in:
-https://www.overleaf.com/12837696kwzjxkyhdytk#/49028744/.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import sys
-import numpy as np
-import tensorflow as tf
-from absl import flags
-
-from bandits.core.bayesian_nn import BayesianNN
-
-
-FLAGS = flags.FLAGS
-tfd = tf.contrib.distributions  # update to: tensorflow_probability.distributions
-
-
-def log_gaussian(x, mu, sigma, reduce_sum=True):
-  res = tfd.Normal(mu, sigma).log_prob(x)
-  if reduce_sum:
-    return tf.reduce_sum(res)
-  else:
-    return res
-
-
-class BBAlphaDivergence(BayesianNN):
-  """Implements an approximate Bayesian NN via Black-Box Alpha-Divergence."""
-
-  def __init__(self, hparams, name):
-
-    self.name = name
-    self.hparams = hparams
-
-    self.alpha = getattr(self.hparams, 'alpha', 1.0)
-    self.num_mc_nn_samples = getattr(self.hparams, 'num_mc_nn_samples', 10)
-
-    self.n_in = self.hparams.context_dim
-    self.n_out = self.hparams.num_actions
-    self.layers = self.hparams.layer_sizes
-    self.batch_size = self.hparams.batch_size
-
-    self.show_training = self.hparams.show_training
-    self.freq_summary = self.hparams.freq_summary
-    self.verbose = getattr(self.hparams, 'verbose', True)
-
-    self.cleared_times_trained = self.hparams.cleared_times_trained
-    self.initial_training_steps = self.hparams.initial_training_steps
-    self.training_schedule = np.linspace(self.initial_training_steps,
-                                         self.hparams.training_epochs,
-                                         self.cleared_times_trained)
-
-    self.times_trained = 0
-    self.initialize_model()
-
-  def initialize_model(self):
-    """Builds and initialize the model."""
-
-    self.num_w = 0
-    self.num_b = 0
-
-    self.weights_m = {}
-    self.weights_std = {}
-    self.biases_m = {}
-    self.biases_std = {}
-
-    self.h_max_var = []
-
-    if self.hparams.use_sigma_exp_transform:
-      self.sigma_transform = tfd.bijectors.Exp()
-    else:
-      self.sigma_transform = tfd.bijectors.Softplus()
-
-    # Build the graph corresponding to the Bayesian NN instance.
-    self.graph = tf.Graph()
-
-    with self.graph.as_default():
-
-      self.sess = tf.Session()
-      self.x = tf.placeholder(shape=[None, self.n_in],
-                              dtype=tf.float32, name='x')
-      self.y = tf.placeholder(shape=[None, self.n_out],
-                              dtype=tf.float32, name='y')
-      self.weights = tf.placeholder(shape=[None, self.n_out],
-                                    dtype=tf.float32, name='w')
-      self.data_size = tf.placeholder(tf.float32, shape=(), name='data_size')
-
-      self.prior_variance = self.hparams.prior_variance
-      if self.prior_variance < 0:
-        # if not fixed, we learn the prior.
-        self.prior_variance = self.sigma_transform.forward(
-            self.build_mu_variable([1, 1]))
-
-      self.build_model()
-      self.sess.run(tf.global_variables_initializer())
-
-  def build_mu_variable(self, shape):
-    """Returns a mean variable initialized as N(0, 0.05)."""
-    return tf.Variable(tf.random_normal(shape, 0.0, 0.05))
-
-  def build_sigma_variable(self, shape, init=-5.):
-    """Returns a sigma variable initialized as N(init, 0.05)."""
-    # Initialize sigma to be very small initially to encourage MAP opt first
-    return tf.Variable(tf.random_normal(shape, init, 0.05))
-
-  def build_layer(self, input_x, shape, layer_id, activation_fn=tf.nn.relu):
-    """Builds a layer with N(mean, std) for each weight, and samples from it."""
-
-    w_mu = self.build_mu_variable(shape)
-    w_sigma = self.sigma_transform.forward(self.build_sigma_variable(shape))
-
-    w_noise = tf.random_normal(shape)
-    w = w_mu + w_sigma * w_noise
-
-    b_mu = self.build_mu_variable([1, shape[1]])
-    b_sigma = self.sigma_transform.forward(
-        self.build_sigma_variable([1, shape[1]]))
-
-    b_noise = tf.random_normal([1, shape[1]])
-    b = b_mu + b_sigma * b_noise
-
-    # Create outputs
-    output_h = activation_fn(tf.matmul(input_x, w) + b)
-
-    # Store means and stds
-    self.weights_m[layer_id] = w_mu
-    self.weights_std[layer_id] = w_sigma
-    self.biases_m[layer_id] = b_mu
-    self.biases_std[layer_id] = b_sigma
-
-    return output_h
-
-  def sample_neural_network(self, activation_fn=tf.nn.relu):
-    """Samples a nn from posterior, computes data log lk and log f factor."""
-
-    with self.graph.as_default():
-
-      log_f = 0
-      n = self.data_size
-      input_x = self.x
-
-      for layer_id in range(self.total_layers):
-
-        # load mean and std of each weight
-        w_mu = self.weights_m[layer_id]
-        w_sigma = self.weights_std[layer_id]
-        b_mu = self.biases_m[layer_id]
-        b_sigma = self.biases_std[layer_id]
-
-        # sample weights from Gaussian distribution
-        shape = w_mu.shape
-        w_noise = tf.random_normal(shape)
-        b_noise = tf.random_normal([1, int(shape[1])])
-        w = w_mu + w_sigma * w_noise
-        b = b_mu + b_sigma * b_noise
-
-        # compute contribution to log_f
-        t1 = w * w_mu / (n * w_sigma ** 2)
-        t2 = (0.5 * w ** 2 / n) * (1 / self.prior_variance - 1 / w_sigma ** 2)
-        log_f += tf.reduce_sum(t1 + t2)
-
-        t1 = b * b_mu / (n * b_sigma ** 2)
-        t2 = (0.5 * b ** 2 / n) * (1 / self.prior_variance - 1 / b_sigma ** 2)
-        log_f += tf.reduce_sum(t1 + t2)
-
-        if layer_id < self.total_layers - 1:
-          output_h = activation_fn(tf.matmul(input_x, w) + b)
-        else:
-          output_h = tf.matmul(input_x, w) + b
-
-        input_x = output_h
-
-      # compute log likelihood of the observed reward under the sampled nn
-      log_likelihood = log_gaussian(
-          self.y, output_h, self.noise_sigma, reduce_sum=False)
-      weighted_log_likelihood = tf.reduce_sum(log_likelihood * self.weights, -1)
-
-    return log_f, weighted_log_likelihood
-
-  def log_z_q(self):
-    """Computes log-partition function of current posterior parameters."""
-
-    with self.graph.as_default():
-
-      log_z_q = 0
-
-      for layer_id in range(self.total_layers):
-
-        w_mu = self.weights_m[layer_id]
-        w_sigma = self.weights_std[layer_id]
-        b_mu = self.biases_m[layer_id]
-        b_sigma = self.biases_std[layer_id]
-
-        w_term = 0.5 * tf.reduce_sum(w_mu ** 2 / w_sigma ** 2)
-        w_term += 0.5 * tf.reduce_sum(tf.log(2 * np.pi) + 2 * tf.log(w_sigma))
-
-        b_term = 0.5 * tf.reduce_sum(b_mu ** 2 / b_sigma ** 2)
-        b_term += 0.5 * tf.reduce_sum(tf.log(2 * np.pi) + 2 * tf.log(b_sigma))
-
-        log_z_q += w_term + b_term
-
-      return log_z_q
-
-  def log_z_prior(self):
-    """Computes log-partition function of the prior parameters."""
-    num_params = self.num_w + self.num_b
-    return num_params * 0.5 * tf.log(2 * np.pi * self.prior_variance)
-
-  def log_alpha_likelihood_ratio(self, activation_fn=tf.nn.relu):
-
-    # each nn sample returns (log f, log likelihoods)
-    nn_samples = [
-        self.sample_neural_network(activation_fn)
-        for _ in range(self.num_mc_nn_samples)
-    ]
-    nn_log_f_samples = [elt[0] for elt in nn_samples]
-    nn_log_lk_samples = [elt[1] for elt in nn_samples]
-
-    # we stack the (log f, log likelihoods) from the k nn samples
-    nn_log_f_stack = tf.stack(nn_log_f_samples)      # k x 1
-    nn_log_lk_stack = tf.stack(nn_log_lk_samples)    # k x N
-    nn_f_tile = tf.tile(nn_log_f_stack, [self.batch_size])
-    nn_f_tile = tf.reshape(nn_f_tile,
-                           [self.num_mc_nn_samples, self.batch_size])
-
-    # now both the log f and log likelihood terms have shape: k x N
-    # apply formula in https://www.overleaf.com/12837696kwzjxkyhdytk#/49028744/
-    nn_log_ratio = nn_log_lk_stack - nn_f_tile
-    nn_log_ratio = self.alpha * tf.transpose(nn_log_ratio)
-    logsumexp_value = tf.reduce_logsumexp(nn_log_ratio, -1)
-    log_k_scalar = tf.log(tf.cast(self.num_mc_nn_samples, tf.float32))
-    log_k = log_k_scalar * tf.ones([self.batch_size])
-
-    return tf.reduce_sum(logsumexp_value - log_k, -1)
-
-  def build_model(self, activation_fn=tf.nn.relu):
-    """Defines the actual NN model with fully connected layers.
-
-    Args:
-      activation_fn: Activation function for the neural network.
-
-    The loss is computed for partial feedback settings (bandits), so only
-    the observed outcome is backpropagated (see weighted loss).
-    Selects the optimizer and, finally, it also initializes the graph.
-    """
-
-    print('Initializing model {}.'.format(self.name))
-
-    # Build terms for the noise sigma estimation for each action.
-    noise_sigma_mu = (self.build_mu_variable([1, self.n_out])
-                      + self.sigma_transform.inverse(self.hparams.noise_sigma))
-    noise_sigma_sigma = self.sigma_transform.forward(
-        self.build_sigma_variable([1, self.n_out]))
-
-    pre_noise_sigma = noise_sigma_mu + tf.random_normal(
-        [1, self.n_out]) * noise_sigma_sigma
-    self.noise_sigma = self.sigma_transform.forward(pre_noise_sigma)
-
-    # Build network
-    input_x = self.x
-    n_in = self.n_in
-    self.total_layers = len(self.layers) + 1
-    if self.layers[0] == 0:
-      self.total_layers = 1
-
-    for l_number, n_nodes in enumerate(self.layers):
-      if n_nodes > 0:
-        h = self.build_layer(input_x, [n_in, n_nodes], l_number)
-        input_x = h
-        n_in = n_nodes
-        self.num_w += n_in * n_nodes
-        self.num_b += n_nodes
-
-    self.y_pred = self.build_layer(input_x, [n_in, self.n_out],
-                                   self.total_layers - 1,
-                                   activation_fn=lambda x: x)
-
-    # Compute energy function based on sampled nn's
-    log_coeff = self.data_size / (self.batch_size * self.alpha)
-    log_ratio = log_coeff * self.log_alpha_likelihood_ratio(activation_fn)
-    logzprior = self.log_z_prior()
-    logzq = self.log_z_q()
-    energy = logzprior - logzq - log_ratio
-
-    self.loss = energy
-    self.global_step = tf.train.get_or_create_global_step()
-    self.train_op = tf.train.AdamOptimizer(self.hparams.initial_lr).minimize(
-        self.loss, global_step=self.global_step)
-
-    # Useful for debugging
-    sq_loss = tf.squared_difference(self.y_pred, self.y)
-    weighted_sq_loss = self.weights * sq_loss
-    self.cost = tf.reduce_sum(weighted_sq_loss) / self.batch_size
-
-    # Create tensorboard metrics
-    self.create_summaries()
-    self.summary_writer = tf.summary.FileWriter('{}/graph_{}'.format(
-        FLAGS.logdir, self.name), self.sess.graph)
-
-  def create_summaries(self):
-    tf.summary.scalar('loss', self.loss)
-    tf.summary.scalar('cost', self.cost)
-    self.summary_op = tf.summary.merge_all()
-
-  def assign_lr(self):
-    """Resets the learning rate in dynamic schedules for subsequent trainings.
-
-    In bandits settings, we do expand our dataset over time. Then, we need to
-    re-train the network with the new data. Those algorithms that do not keep
-    the step constant, can reset it at the start of each training process.
-    """
-
-    decay_steps = 1
-    if self.hparams.activate_decay:
-      current_gs = self.sess.run(self.global_step)
-      with self.graph.as_default():
-        self.lr = tf.train.inverse_time_decay(self.hparams.initial_lr,
-                                              self.global_step - current_gs,
-                                              decay_steps,
-                                              self.hparams.lr_decay_rate)
-
-  def train(self, data, num_steps):
-    """Trains the BNN for num_steps, using the data in 'data'.
-
-    Args:
-      data: ContextualDataset object that provides the data.
-      num_steps: Number of minibatches to train the network for.
-    """
-
-    if self.times_trained < self.cleared_times_trained:
-      num_steps = int(self.training_schedule[self.times_trained])
-    self.times_trained += 1
-
-    if self.verbose:
-      print('Training {} for {} steps...'.format(self.name, num_steps))
-
-    with self.graph.as_default():
-
-      for step in range(num_steps):
-        x, y, w = data.get_batch_with_weights(self.hparams.batch_size)
-        _, summary, global_step, loss = self.sess.run(
-            [self.train_op, self.summary_op, self.global_step, self.loss],
-            feed_dict={self.x: x, self.y: y, self.weights: w,
-                       self.data_size: data.num_points()})
-
-        weights_l = self.sess.run(self.weights_std[0])
-        self.h_max_var.append(np.max(weights_l))
-
-        if step % self.freq_summary == 0:
-          if self.show_training:
-            print('step: {}, loss: {}'.format(step, loss))
-            sys.stdout.flush()
-          self.summary_writer.add_summary(summary, global_step)
diff --git a/research/deep_contextual_bandits/bandits/algorithms/bf_variational_neural_bandit_model.py b/research/deep_contextual_bandits/bandits/algorithms/bf_variational_neural_bandit_model.py
deleted file mode 100644
index cb87c23358f27bd93e30528b20f7a3bb3ba876dd..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/bf_variational_neural_bandit_model.py
+++ /dev/null
@@ -1,352 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Bayesian NN using factorized VI (Bayes By Backprop. Blundell et al. 2014).
-
-See https://arxiv.org/abs/1505.05424 for details.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-# import tensorflow_probability as tfp
-
-from absl import flags
-from bandits.core.bayesian_nn import BayesianNN
-
-
-FLAGS = flags.FLAGS
-# tfd = tfp.distributions
-tfd = tf.contrib.distributions
-tfl = tf.contrib.layers
-
-
-def log_gaussian(x, mu, sigma, reduce_sum=True):
-  """Returns log Gaussian pdf."""
-  res = tfd.Normal(mu, sigma).log_prob(x)
-  if reduce_sum:
-    return tf.reduce_sum(res)
-  else:
-    return res
-
-
-def analytic_kl(mu_1, sigma_1, mu_2, sigma_2):
-  """KL for two Gaussian distributions with diagonal covariance matrix."""
-  kl = tfd.kl_divergence(tfd.MVNDiag(mu_1, sigma_1), tfd.MVNDiag(mu_2, sigma_2))
-  return kl
-
-
-class BfVariationalNeuralBanditModel(BayesianNN):
-  """Implements an approximate Bayesian NN using Variational Inference."""
-
-  def __init__(self, hparams, name="BBBNN"):
-
-    self.name = name
-    self.hparams = hparams
-
-    self.n_in = self.hparams.context_dim
-    self.n_out = self.hparams.num_actions
-    self.layers = self.hparams.layer_sizes
-    self.init_scale = self.hparams.init_scale
-    self.f_num_points = None
-    if "f_num_points" in hparams:
-      self.f_num_points = self.hparams.f_num_points
-
-    self.cleared_times_trained = self.hparams.cleared_times_trained
-    self.initial_training_steps = self.hparams.initial_training_steps
-    self.training_schedule = np.linspace(self.initial_training_steps,
-                                         self.hparams.training_epochs,
-                                         self.cleared_times_trained)
-    self.verbose = getattr(self.hparams, "verbose", True)
-
-    self.weights_m = {}
-    self.weights_std = {}
-    self.biases_m = {}
-    self.biases_std = {}
-
-    self.times_trained = 0
-
-    if self.hparams.use_sigma_exp_transform:
-      self.sigma_transform = tf.exp
-      self.inverse_sigma_transform = np.log
-    else:
-      self.sigma_transform = tf.nn.softplus
-      self.inverse_sigma_transform = lambda y: y + np.log(1. - np.exp(-y))
-
-    # Whether to use the local reparameterization trick to compute the loss.
-    # See details in https://arxiv.org/abs/1506.02557
-    self.use_local_reparameterization = True
-
-    self.build_graph()
-
-  def build_mu_variable(self, shape):
-    """Returns a mean variable initialized as N(0, 0.05)."""
-    return tf.Variable(tf.random_normal(shape, 0.0, 0.05))
-
-  def build_sigma_variable(self, shape, init=-5.):
-    """Returns a sigma variable initialized as N(init, 0.05)."""
-    # Initialize sigma to be very small initially to encourage MAP opt first
-    return tf.Variable(tf.random_normal(shape, init, 0.05))
-
-  def build_layer(self, input_x, input_x_local, shape,
-                  layer_id, activation_fn=tf.nn.relu):
-    """Builds a variational layer, and computes KL term.
-
-    Args:
-      input_x: Input to the variational layer.
-      input_x_local: Input when the local reparameterization trick was applied.
-      shape: [number_inputs, number_outputs] for the layer.
-      layer_id: Number of layer in the architecture.
-      activation_fn: Activation function to apply.
-
-    Returns:
-      output_h: Output of the variational layer.
-      output_h_local: Output when local reparameterization trick was applied.
-      neg_kl: Negative KL term for the layer.
-    """
-
-    w_mu = self.build_mu_variable(shape)
-    w_sigma = self.sigma_transform(self.build_sigma_variable(shape))
-    w_noise = tf.random_normal(shape)
-    w = w_mu + w_sigma * w_noise
-
-    b_mu = self.build_mu_variable([1, shape[1]])
-    b_sigma = self.sigma_transform(self.build_sigma_variable([1, shape[1]]))
-    b = b_mu
-
-    # Store means and stds
-    self.weights_m[layer_id] = w_mu
-    self.weights_std[layer_id] = w_sigma
-    self.biases_m[layer_id] = b_mu
-    self.biases_std[layer_id] = b_sigma
-
-    # Create outputs
-    output_h = activation_fn(tf.matmul(input_x, w) + b)
-
-    if self.use_local_reparameterization:
-      # Use analytic KL divergence wrt the prior
-      neg_kl = -analytic_kl(w_mu, w_sigma,
-                            0., tf.to_float(np.sqrt(2./shape[0])))
-    else:
-      # Create empirical KL loss terms
-      log_p = log_gaussian(w, 0., tf.to_float(np.sqrt(2./shape[0])))
-      log_q = log_gaussian(w, tf.stop_gradient(w_mu), tf.stop_gradient(w_sigma))
-      neg_kl = log_p - log_q
-
-    # Apply local reparameterization trick: sample activations pre nonlinearity
-    m_h = tf.matmul(input_x_local, w_mu) + b
-    v_h = tf.matmul(tf.square(input_x_local), tf.square(w_sigma))
-    output_h_local = m_h + tf.sqrt(v_h + 1e-6) * tf.random_normal(tf.shape(v_h))
-    output_h_local = activation_fn(output_h_local)
-
-    return output_h, output_h_local, neg_kl
-
-  def build_action_noise(self):
-    """Defines a model for additive noise per action, and its KL term."""
-
-    # Define mean and std variables (log-normal dist) for each action.
-    noise_sigma_mu = (self.build_mu_variable([1, self.n_out])
-                      + self.inverse_sigma_transform(self.hparams.noise_sigma))
-    noise_sigma_sigma = self.sigma_transform(
-        self.build_sigma_variable([1, self.n_out]))
-
-    pre_noise_sigma = (noise_sigma_mu
-                       + tf.random_normal([1, self.n_out]) * noise_sigma_sigma)
-    self.noise_sigma = self.sigma_transform(pre_noise_sigma)
-
-    # Compute KL for additive noise sigma terms.
-    if getattr(self.hparams, "infer_noise_sigma", False):
-      neg_kl_term = log_gaussian(
-          pre_noise_sigma,
-          self.inverse_sigma_transform(self.hparams.noise_sigma),
-          self.hparams.prior_sigma
-      )
-      neg_kl_term -= log_gaussian(pre_noise_sigma,
-                                  noise_sigma_mu,
-                                  noise_sigma_sigma)
-    else:
-      neg_kl_term = 0.
-
-    return neg_kl_term
-
-  def build_model(self, activation_fn=tf.nn.relu):
-    """Defines the actual NN model with fully connected layers.
-
-    The loss is computed for partial feedback settings (bandits), so only
-    the observed outcome is backpropagated (see weighted loss).
-    Selects the optimizer and, finally, it also initializes the graph.
-
-    Args:
-      activation_fn: the activation function used in the nn layers.
-    """
-
-    def weight_prior(dtype, shape, c, d, e):
-      del c, d, e
-      return tfd.Independent(
-          tfd.Normal(loc=tf.zeros(shape, dtype),
-                     scale=tf.to_float(np.sqrt(2) / shape[0])),
-          reinterpreted_batch_ndims=tf.size(shape))
-
-    if self.verbose:
-      print("Initializing model {}.".format(self.name))
-
-    # Compute model additive noise for each action with log-normal distribution
-    neg_kl_term = self.build_action_noise()
-
-    # Build variational network using self.x as input.
-    input_x = self.x
-
-    # Create Keras model using DenseLocalReparameterization (prior N(0, 1)).
-    model_layers = [
-        tfl.DenseLocalReparameterization(
-            n_nodes,
-            activation=tf.nn.relu,
-            kernel_prior_fn=weight_prior
-        )
-        for n_nodes in self.layers if n_nodes > 0
-    ]
-
-    output_layer = tfl.DenseLocalReparameterization(
-        self.n_out,
-        activation=lambda x: x,
-        kernel_prior_fn=weight_prior
-    )
-    model_layers.append(output_layer)
-
-    model = tf.keras.Sequential(model_layers)
-    self.y_pred = model(input_x)
-
-    # Compute KL term
-    neg_kl_term -= tf.add_n(model.losses)
-
-    # Compute log likelihood (with learned or fixed noise level)
-    if getattr(self.hparams, "infer_noise_sigma", False):
-      log_likelihood = log_gaussian(
-          self.y, self.y_pred, self.noise_sigma, reduce_sum=False)
-    else:
-      log_likelihood = log_gaussian(
-          self.y, self.y_pred, self.hparams.noise_sigma, reduce_sum=False)
-
-    # Only take into account observed outcomes (bandits setting)
-    batch_size = tf.to_float(tf.shape(self.x)[0])
-    weighted_log_likelihood = tf.reduce_sum(
-        log_likelihood * self.weights) / batch_size
-
-    # The objective is 1/n * (\sum_i log_like_i - KL); neg_kl_term estimates -KL
-    elbo = weighted_log_likelihood + (neg_kl_term / self.n)
-
-    self.loss = -elbo
-    self.global_step = tf.train.get_or_create_global_step()
-    self.train_op = tf.train.AdamOptimizer(self.hparams.initial_lr).minimize(
-        self.loss, global_step=self.global_step)
-
-    # Create tensorboard metrics
-    self.create_summaries()
-    self.summary_writer = tf.summary.FileWriter(
-        "{}/graph_{}".format(FLAGS.logdir, self.name), self.sess.graph)
-
-  def build_graph(self):
-    """Defines graph, session, placeholders, and model.
-
-    Placeholders are: n (size of the dataset), x and y (context and observed
-    reward for each action), and weights (one-hot encoding of selected action
-    for each context, i.e., only possibly non-zero element in each y).
-    """
-
-    self.graph = tf.Graph()
-    with self.graph.as_default():
-
-      self.sess = tf.Session()
-
-      self.n = tf.placeholder(shape=[], dtype=tf.float32)
-
-      self.x = tf.placeholder(shape=[None, self.n_in], dtype=tf.float32)
-      self.y = tf.placeholder(shape=[None, self.n_out], dtype=tf.float32)
-      self.weights = tf.placeholder(shape=[None, self.n_out], dtype=tf.float32)
-
-      self.build_model()
-      self.sess.run(tf.global_variables_initializer())
-
-  def create_summaries(self):
-    """Defines summaries including mean loss, and global step."""
-
-    with self.graph.as_default():
-      with tf.name_scope(self.name + "_summaries"):
-        tf.summary.scalar("loss", self.loss)
-        tf.summary.scalar("global_step", self.global_step)
-        self.summary_op = tf.summary.merge_all()
-
-  def assign_lr(self):
-    """Resets the learning rate in dynamic schedules for subsequent trainings.
-
-    In bandits settings, we do expand our dataset over time. Then, we need to
-    re-train the network with the new data. The algorithms that do not keep
-    the step constant, can reset it at the start of each *training* process.
-    """
-
-    decay_steps = 1
-    if self.hparams.activate_decay:
-      current_gs = self.sess.run(self.global_step)
-      with self.graph.as_default():
-        self.lr = tf.train.inverse_time_decay(self.hparams.initial_lr,
-                                              self.global_step - current_gs,
-                                              decay_steps,
-                                              self.hparams.lr_decay_rate)
-
-  def train(self, data, num_steps):
-    """Trains the BNN for num_steps, using the data in 'data'.
-
-    Args:
-      data: ContextualDataset object that provides the data.
-      num_steps: Number of minibatches to train the network for.
-
-    Returns:
-      losses: Loss history during training.
-    """
-
-    if self.times_trained < self.cleared_times_trained:
-      num_steps = int(self.training_schedule[self.times_trained])
-    self.times_trained += 1
-
-    losses = []
-
-    with self.graph.as_default():
-
-      if self.verbose:
-        print("Training {} for {} steps...".format(self.name, num_steps))
-
-      for step in range(num_steps):
-        x, y, weights = data.get_batch_with_weights(self.hparams.batch_size)
-        _, summary, global_step, loss = self.sess.run(
-            [self.train_op, self.summary_op, self.global_step, self.loss],
-            feed_dict={
-                self.x: x,
-                self.y: y,
-                self.weights: weights,
-                self.n: data.num_points(self.f_num_points),
-            })
-
-        losses.append(loss)
-
-        if step % self.hparams.freq_summary == 0:
-          if self.hparams.show_training:
-            print("{} | step: {}, loss: {}".format(
-                self.name, global_step, loss))
-          self.summary_writer.add_summary(summary, global_step)
-
-    return losses
diff --git a/research/deep_contextual_bandits/bandits/algorithms/bootstrapped_bnn_sampling.py b/research/deep_contextual_bandits/bandits/algorithms/bootstrapped_bnn_sampling.py
deleted file mode 100644
index 7c44b681c7bd1da113ec29c1bb6d370c88d7053f..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/bootstrapped_bnn_sampling.py
+++ /dev/null
@@ -1,98 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contextual algorithm based on boostrapping neural networks."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-from bandits.core.bandit_algorithm import BanditAlgorithm
-from bandits.core.contextual_dataset import ContextualDataset
-from bandits.algorithms.neural_bandit_model import NeuralBanditModel
-
-
-class BootstrappedBNNSampling(BanditAlgorithm):
-  """Thompson Sampling algorithm based on training several neural networks."""
-
-  def __init__(self, name, hparams, optimizer='RMS'):
-    """Creates a BootstrappedSGDSampling object based on a specific optimizer.
-
-      hparams.q: Number of models that are independently trained.
-      hparams.p: Prob of independently including each datapoint in each model.
-
-    Args:
-      name: Name given to the instance.
-      hparams: Hyperparameters for each individual model.
-      optimizer: Neural network optimization algorithm.
-    """
-
-    self.name = name
-    self.hparams = hparams
-    self.optimizer_n = optimizer
-
-    self.training_freq = hparams.training_freq
-    self.training_epochs = hparams.training_epochs
-    self.t = 0
-
-    self.q = hparams.q
-    self.p = hparams.p
-
-    self.datasets = [
-        ContextualDataset(hparams.context_dim,
-                          hparams.num_actions,
-                          hparams.buffer_s)
-        for _ in range(self.q)
-    ]
-
-    self.bnn_boot = [
-        NeuralBanditModel(optimizer, hparams, '{}-{}-bnn'.format(name, i))
-        for i in range(self.q)
-    ]
-
-  def action(self, context):
-    """Selects action for context based on Thompson Sampling using one BNN."""
-
-    if self.t < self.hparams.num_actions * self.hparams.initial_pulls:
-      # round robin until each action has been taken "initial_pulls" times
-      return self.t % self.hparams.num_actions
-
-    # choose model uniformly at random
-    model_index = np.random.randint(self.q)
-
-    with self.bnn_boot[model_index].graph.as_default():
-      c = context.reshape((1, self.hparams.context_dim))
-      output = self.bnn_boot[model_index].sess.run(
-          self.bnn_boot[model_index].y_pred,
-          feed_dict={self.bnn_boot[model_index].x: c})
-      return np.argmax(output)
-
-  def update(self, context, action, reward):
-    """Updates the data buffer, and re-trains the BNN every self.freq_update."""
-
-    self.t += 1
-    for i in range(self.q):
-      # include the data point with probability p independently in each dataset
-      if np.random.random() < self.p or self.t < 2:
-        self.datasets[i].add(context, action, reward)
-
-    if self.t % self.training_freq == 0:
-      # update all the models:
-      for i in range(self.q):
-        if self.hparams.reset_lr:
-          self.bnn_boot[i].assign_lr()
-        self.bnn_boot[i].train(self.datasets[i], self.training_epochs)
diff --git a/research/deep_contextual_bandits/bandits/algorithms/fixed_policy_sampling.py b/research/deep_contextual_bandits/bandits/algorithms/fixed_policy_sampling.py
deleted file mode 100644
index d5ad6e3ed9ed9d1478e6ac132b41cfb5ae1bb47a..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/fixed_policy_sampling.py
+++ /dev/null
@@ -1,51 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contextual bandit algorithm that selects an action at random."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-from bandits.core.bandit_algorithm import BanditAlgorithm
-
-
-class FixedPolicySampling(BanditAlgorithm):
-  """Defines a baseline; returns an action at random with probs given by p."""
-
-  def __init__(self, name, p, hparams):
-    """Creates a FixedPolicySampling object.
-
-    Args:
-      name: Name of the algorithm.
-      p: Vector of normalized probabilities corresponding to sampling each arm.
-      hparams: Hyper-parameters, including the number of arms (num_actions).
-
-    Raises:
-      ValueError: when p dimension does not match the number of actions.
-    """
-
-    self.name = name
-    self.p = p
-    self.hparams = hparams
-
-    if len(p) != self.hparams.num_actions:
-      raise ValueError('Policy needs k probabilities.')
-
-  def action(self, context):
-    """Selects an action at random according to distribution p."""
-    return np.random.choice(range(self.hparams.num_actions), p=self.p)
diff --git a/research/deep_contextual_bandits/bandits/algorithms/linear_full_posterior_sampling.py b/research/deep_contextual_bandits/bandits/algorithms/linear_full_posterior_sampling.py
deleted file mode 100644
index 15ef8fa9b562101111042dc2ce7b17174018ab6e..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/linear_full_posterior_sampling.py
+++ /dev/null
@@ -1,164 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contextual algorithm that keeps a full linear posterior for each arm."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-from scipy.stats import invgamma
-
-from bandits.core.bandit_algorithm import BanditAlgorithm
-from bandits.core.contextual_dataset import ContextualDataset
-
-
-class LinearFullPosteriorSampling(BanditAlgorithm):
-  """Thompson Sampling with independent linear models and unknown noise var."""
-
-  def __init__(self, name, hparams):
-    """Initialize posterior distributions and hyperparameters.
-
-    Assume a linear model for each action i: reward = context^T beta_i + noise
-    Each beta_i has a Gaussian prior (lambda parameter), each sigma2_i (noise
-    level) has an inverse Gamma prior (a0, b0 parameters). Mean, covariance,
-    and precision matrices are initialized, and the ContextualDataset created.
-
-    Args:
-      name: Name of the algorithm.
-      hparams: Hyper-parameters of the algorithm.
-    """
-
-    self.name = name
-    self.hparams = hparams
-
-    # Gaussian prior for each beta_i
-    self._lambda_prior = self.hparams.lambda_prior
-
-    self.mu = [
-        np.zeros(self.hparams.context_dim + 1)
-        for _ in range(self.hparams.num_actions)
-    ]
-
-    self.cov = [(1.0 / self.lambda_prior) * np.eye(self.hparams.context_dim + 1)
-                for _ in range(self.hparams.num_actions)]
-
-    self.precision = [
-        self.lambda_prior * np.eye(self.hparams.context_dim + 1)
-        for _ in range(self.hparams.num_actions)
-    ]
-
-    # Inverse Gamma prior for each sigma2_i
-    self._a0 = self.hparams.a0
-    self._b0 = self.hparams.b0
-
-    self.a = [self._a0 for _ in range(self.hparams.num_actions)]
-    self.b = [self._b0 for _ in range(self.hparams.num_actions)]
-
-    self.t = 0
-    self.data_h = ContextualDataset(hparams.context_dim,
-                                    hparams.num_actions,
-                                    intercept=True)
-
-  def action(self, context):
-    """Samples beta's from posterior, and chooses best action accordingly.
-
-    Args:
-      context: Context for which the action need to be chosen.
-
-    Returns:
-      action: Selected action for the context.
-    """
-
-    # Round robin until each action has been selected "initial_pulls" times
-    if self.t < self.hparams.num_actions * self.hparams.initial_pulls:
-      return self.t % self.hparams.num_actions
-
-    # Sample sigma2, and beta conditional on sigma2
-    sigma2_s = [
-        self.b[i] * invgamma.rvs(self.a[i])
-        for i in range(self.hparams.num_actions)
-    ]
-
-    try:
-      beta_s = [
-          np.random.multivariate_normal(self.mu[i], sigma2_s[i] * self.cov[i])
-          for i in range(self.hparams.num_actions)
-      ]
-    except np.linalg.LinAlgError as e:
-      # Sampling could fail if covariance is not positive definite
-      print('Exception when sampling from {}.'.format(self.name))
-      print('Details: {} | {}.'.format(e.message, e.args))
-      d = self.hparams.context_dim + 1
-      beta_s = [
-          np.random.multivariate_normal(np.zeros((d)), np.eye(d))
-          for i in range(self.hparams.num_actions)
-      ]
-
-    # Compute sampled expected values, intercept is last component of beta
-    vals = [
-        np.dot(beta_s[i][:-1], context.T) + beta_s[i][-1]
-        for i in range(self.hparams.num_actions)
-    ]
-
-    return np.argmax(vals)
-
-  def update(self, context, action, reward):
-    """Updates action posterior using the linear Bayesian regression formula.
-
-    Args:
-      context: Last observed context.
-      action: Last observed action.
-      reward: Last observed reward.
-    """
-
-    self.t += 1
-    self.data_h.add(context, action, reward)
-
-    # Update posterior of action with formulas: \beta | x,y ~ N(mu_q, cov_q)
-    x, y = self.data_h.get_data(action)
-
-    # The algorithm could be improved with sequential update formulas (cheaper)
-    s = np.dot(x.T, x)
-
-    # Some terms are removed as we assume prior mu_0 = 0.
-    precision_a = s + self.lambda_prior * np.eye(self.hparams.context_dim + 1)
-    cov_a = np.linalg.inv(precision_a)
-    mu_a = np.dot(cov_a, np.dot(x.T, y))
-
-    # Inverse Gamma posterior update
-    a_post = self.a0 + x.shape[0] / 2.0
-    b_upd = 0.5 * (np.dot(y.T, y) - np.dot(mu_a.T, np.dot(precision_a, mu_a)))
-    b_post = self.b0 + b_upd
-
-    # Store new posterior distributions
-    self.mu[action] = mu_a
-    self.cov[action] = cov_a
-    self.precision[action] = precision_a
-    self.a[action] = a_post
-    self.b[action] = b_post
-
-  @property
-  def a0(self):
-    return self._a0
-
-  @property
-  def b0(self):
-    return self._b0
-
-  @property
-  def lambda_prior(self):
-    return self._lambda_prior
diff --git a/research/deep_contextual_bandits/bandits/algorithms/multitask_gp.py b/research/deep_contextual_bandits/bandits/algorithms/multitask_gp.py
deleted file mode 100644
index 0c35dfaeaf9e30993d49d807f16dd64e15d3fc66..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/multitask_gp.py
+++ /dev/null
@@ -1,374 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A Multitask Gaussian process."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from absl import flags
-from absl import logging
-
-import numpy as np
-import tensorflow as tf
-from bandits.core.bayesian_nn import BayesianNN
-
-FLAGS = flags.FLAGS
-tfd = tf.contrib.distributions
-
-class MultitaskGP(BayesianNN):
-  """Implements a Gaussian process with multi-task outputs.
-
-  Optimizes the hyperparameters over the log marginal likelihood.
-  Uses a Matern 3/2 + linear covariance and returns
-  sampled predictions for test inputs.  The outputs are optionally
-  correlated where the correlation structure is learned through latent
-  embeddings of the tasks.
-  """
-
-  def __init__(self, hparams):
-    self.name = "MultiTaskGP"
-    self.hparams = hparams
-
-    self.n_in = self.hparams.context_dim
-    self.n_out = self.hparams.num_outputs
-    self.keep_fixed_after_max_obs = self.hparams.keep_fixed_after_max_obs
-
-    self._show_training = self.hparams.show_training
-    self._freq_summary = self.hparams.freq_summary
-
-    # Dimensionality of the latent task vectors
-    self.task_latent_dim = self.hparams.task_latent_dim
-
-    # Maximum number of observations to include
-    self.max_num_points = self.hparams.max_num_points
-
-    if self.hparams.learn_embeddings:
-      self.learn_embeddings = self.hparams.learn_embeddings
-    else:
-      self.learn_embeddings = False
-
-    # create the graph corresponding to the BNN instance
-    self.graph = tf.Graph()
-    with self.graph.as_default():
-      # store a new session for the graph
-      self.sess = tf.Session()
-
-      with tf.variable_scope(self.name, reuse=tf.AUTO_REUSE):
-        self.n = tf.placeholder(shape=[], dtype=tf.float64)
-        self.x = tf.placeholder(shape=[None, self.n_in], dtype=tf.float64)
-        self.x_in = tf.placeholder(shape=[None, self.n_in], dtype=tf.float64)
-        self.y = tf.placeholder(shape=[None, self.n_out], dtype=tf.float64)
-        self.weights = tf.placeholder(shape=[None, self.n_out],
-                                      dtype=tf.float64)
-
-        self.build_model()
-      self.sess.run(tf.global_variables_initializer())
-
-  def atleast_2d(self, x, dims):
-    return tf.reshape(tf.expand_dims(x, axis=0), (-1, dims))
-
-  def sq_dist(self, x, x2):
-    a2 = tf.reduce_sum(tf.square(x), 1)
-    b2 = tf.reduce_sum(tf.square(x2), 1)
-    sqdists = tf.expand_dims(a2, 1) + b2 - 2.0 * tf.matmul(x, tf.transpose(x2))
-    return sqdists
-
-  # Covariance between outputs
-  def task_cov(self, x, x2):
-    """Squared Exponential Covariance Kernel over latent task embeddings."""
-    # Index into latent task vectors
-    x_vecs = tf.gather(self.task_vectors, tf.argmax(x, axis=1), axis=0)
-    x2_vecs = tf.gather(self.task_vectors, tf.argmax(x2, axis=1), axis=0)
-    r = self.sq_dist(self.atleast_2d(x_vecs, self.task_latent_dim),
-                     self.atleast_2d(x2_vecs, self.task_latent_dim))
-    return tf.exp(-r)
-
-  def cov(self, x, x2):
-    """Matern 3/2 + Linear Gaussian Process Covariance Function."""
-    ls = tf.clip_by_value(self.length_scales, -5.0, 5.0)
-    ls_lin = tf.clip_by_value(self.length_scales_lin, -5.0, 5.0)
-    r = self.sq_dist(self.atleast_2d(x, self.n_in)/tf.nn.softplus(ls),
-                     self.atleast_2d(x2, self.n_in)/tf.nn.softplus(ls))
-    r = tf.clip_by_value(r, 0, 1e8)
-
-    # Matern 3/2 Covariance
-    matern = (1.0 + tf.sqrt(3.0*r + 1e-16)) * tf.exp(-tf.sqrt(3.0*r + 1e-16))
-    # Linear Covariance
-    lin = tf.matmul(x / tf.nn.softplus(ls_lin),
-                    x2 / tf.nn.softplus(ls_lin), transpose_b=True)
-    return (tf.nn.softplus(self.amplitude) * matern +
-            tf.nn.softplus(self.amplitude_linear) * lin)
-
-  def build_model(self):
-    """Defines the GP model.
-
-    The loss is computed for partial feedback settings (bandits), so only
-    the observed outcome is backpropagated (see weighted loss).
-    Selects the optimizer and, finally, it also initializes the graph.
-    """
-
-    logging.info("Initializing model %s.", self.name)
-    self.global_step = tf.train.get_or_create_global_step()
-
-    # Define state for the model (inputs, etc.)
-    self.x_train = tf.get_variable(
-        "training_data",
-        initializer=tf.ones(
-            [self.hparams.batch_size, self.n_in], dtype=tf.float64),
-        validate_shape=False,
-        trainable=False)
-    self.y_train = tf.get_variable(
-        "training_labels",
-        initializer=tf.zeros([self.hparams.batch_size, 1], dtype=tf.float64),
-        validate_shape=False,
-        trainable=False)
-    self.weights_train = tf.get_variable(
-        "weights_train",
-        initializer=tf.ones(
-            [self.hparams.batch_size, self.n_out], dtype=tf.float64),
-        validate_shape=False,
-        trainable=False)
-    self.input_op = tf.assign(self.x_train, self.x_in, validate_shape=False)
-    self.input_w_op = tf.assign(
-        self.weights_train, self.weights, validate_shape=False)
-
-    self.input_std = tf.get_variable(
-        "data_standard_deviation",
-        initializer=tf.ones([1, self.n_out], dtype=tf.float64),
-        dtype=tf.float64,
-        trainable=False)
-    self.input_mean = tf.get_variable(
-        "data_mean",
-        initializer=tf.zeros([1, self.n_out], dtype=tf.float64),
-        dtype=tf.float64,
-        trainable=True)
-
-    # GP Hyperparameters
-    self.noise = tf.get_variable(
-        "noise", initializer=tf.cast(0.0, dtype=tf.float64))
-    self.amplitude = tf.get_variable(
-        "amplitude", initializer=tf.cast(1.0, dtype=tf.float64))
-    self.amplitude_linear = tf.get_variable(
-        "linear_amplitude", initializer=tf.cast(1.0, dtype=tf.float64))
-    self.length_scales = tf.get_variable(
-        "length_scales", initializer=tf.zeros([1, self.n_in], dtype=tf.float64))
-    self.length_scales_lin = tf.get_variable(
-        "length_scales_linear",
-        initializer=tf.zeros([1, self.n_in], dtype=tf.float64))
-
-    # Latent embeddings of the different outputs for task covariance
-    self.task_vectors = tf.get_variable(
-        "latent_task_vectors",
-        initializer=tf.random_normal(
-            [self.n_out, self.task_latent_dim], dtype=tf.float64))
-
-    # Normalize outputs across each dimension
-    # Since we have different numbers of observations across each task, we
-    # normalize by their respective counts.
-    index_counts = self.atleast_2d(tf.reduce_sum(self.weights, axis=0),
-                                   self.n_out)
-    index_counts = tf.where(index_counts > 0, index_counts,
-                            tf.ones(tf.shape(index_counts), dtype=tf.float64))
-    self.mean_op = tf.assign(self.input_mean,
-                             tf.reduce_sum(self.y, axis=0) / index_counts)
-    self.var_op = tf.assign(
-        self.input_std, tf.sqrt(1e-4 + tf.reduce_sum(tf.square(
-            self.y - tf.reduce_sum(self.y, axis=0) / index_counts), axis=0)
-                                / index_counts))
-
-    with tf.control_dependencies([self.var_op]):
-      y_normed = self.atleast_2d(
-          (self.y - self.input_mean) / self.input_std, self.n_out)
-      y_normed = self.atleast_2d(tf.boolean_mask(y_normed, self.weights > 0), 1)
-    self.out_op = tf.assign(self.y_train, y_normed, validate_shape=False)
-
-    # Observation noise
-    alpha = tf.nn.softplus(self.noise) + 1e-6
-
-    # Covariance
-    with tf.control_dependencies([self.input_op, self.input_w_op, self.out_op]):
-      self.self_cov = (self.cov(self.x_in, self.x_in) *
-                       self.task_cov(self.weights, self.weights) +
-                       tf.eye(tf.shape(self.x_in)[0], dtype=tf.float64) * alpha)
-
-    self.chol = tf.cholesky(self.self_cov)
-    self.kinv = tf.cholesky_solve(self.chol, tf.eye(tf.shape(self.x_in)[0],
-                                                    dtype=tf.float64))
-
-    self.input_inv = tf.Variable(
-        tf.eye(self.hparams.batch_size, dtype=tf.float64),
-        validate_shape=False,
-        trainable=False)
-    self.input_cov_op = tf.assign(self.input_inv, self.kinv,
-                                  validate_shape=False)
-
-    # Log determinant by taking the singular values along the diagonal
-    # of self.chol
-    with tf.control_dependencies([self.input_cov_op]):
-      logdet = 2.0 * tf.reduce_sum(tf.log(tf.diag_part(self.chol) + 1e-16))
-
-    # Log Marginal likelihood
-    self.marginal_ll = -tf.reduce_sum(-0.5 * tf.matmul(
-        tf.transpose(y_normed), tf.matmul(self.kinv, y_normed)) - 0.5 * logdet -
-                                      0.5 * self.n * np.log(2 * np.pi))
-
-    zero = tf.cast(0., dtype=tf.float64)
-    one = tf.cast(1., dtype=tf.float64)
-    standard_normal = tfd.Normal(loc=zero, scale=one)
-
-    # Loss is marginal likelihood and priors
-    self.loss = tf.reduce_sum(
-        self.marginal_ll -
-        (standard_normal.log_prob(self.amplitude) +
-         standard_normal.log_prob(tf.exp(self.noise)) +
-         standard_normal.log_prob(self.amplitude_linear) +
-         tfd.Normal(loc=zero, scale=one * 10.).log_prob(
-             self.task_vectors))
-    )
-
-    # Optimizer for hyperparameters
-    optimizer = tf.train.AdamOptimizer(learning_rate=self.hparams.lr)
-    vars_to_optimize = [
-        self.amplitude, self.length_scales, self.length_scales_lin,
-        self.amplitude_linear, self.noise, self.input_mean
-    ]
-
-    if self.learn_embeddings:
-      vars_to_optimize.append(self.task_vectors)
-    grads = optimizer.compute_gradients(self.loss, vars_to_optimize)
-    self.train_op = optimizer.apply_gradients(grads,
-                                              global_step=self.global_step)
-
-    # Predictions for test data
-    self.y_mean, self.y_pred = self.posterior_mean_and_sample(self.x)
-
-    # create tensorboard metrics
-    self.create_summaries()
-    self.summary_writer = tf.summary.FileWriter("{}/graph_{}".format(
-        FLAGS.logdir, self.name), self.sess.graph)
-    self.check = tf.add_check_numerics_ops()
-
-  def posterior_mean_and_sample(self, candidates):
-    """Draw samples for test predictions.
-
-    Given a Tensor of 'candidates' inputs, returns samples from the posterior
-    and the posterior mean prediction for those inputs.
-
-    Args:
-      candidates: A (num-examples x num-dims) Tensor containing the inputs for
-      which to return predictions.
-    Returns:
-      y_mean: The posterior mean prediction given these inputs
-      y_sample: A sample from the posterior of the outputs given these inputs
-    """
-    # Cross-covariance for test predictions
-    w = tf.identity(self.weights_train)
-    inds = tf.squeeze(
-        tf.reshape(
-            tf.tile(
-                tf.reshape(tf.range(self.n_out), (self.n_out, 1)),
-                (1, tf.shape(candidates)[0])), (-1, 1)))
-
-    cross_cov = self.cov(tf.tile(candidates, [self.n_out, 1]), self.x_train)
-    cross_task_cov = self.task_cov(tf.one_hot(inds, self.n_out), w)
-    cross_cov *= cross_task_cov
-
-    # Test mean prediction
-    y_mean = tf.matmul(cross_cov, tf.matmul(self.input_inv, self.y_train))
-
-    # Test sample predictions
-    # Note this can be done much more efficiently using Kronecker products
-    # if all tasks are fully observed (which we won't assume)
-    test_cov = (
-        self.cov(tf.tile(candidates, [self.n_out, 1]),
-                 tf.tile(candidates, [self.n_out, 1])) *
-        self.task_cov(tf.one_hot(inds, self.n_out),
-                      tf.one_hot(inds, self.n_out)) -
-        tf.matmul(cross_cov,
-                  tf.matmul(self.input_inv,
-                            tf.transpose(cross_cov))))
-
-    # Get the matrix square root through an SVD for drawing samples
-    # This seems more numerically stable than the Cholesky
-    s, _, v = tf.svd(test_cov, full_matrices=True)
-    test_sqrt = tf.matmul(v, tf.matmul(tf.diag(s), tf.transpose(v)))
-
-    y_sample = (
-        tf.matmul(
-            test_sqrt,
-            tf.random_normal([tf.shape(test_sqrt)[0], 1], dtype=tf.float64)) +
-        y_mean)
-
-    y_sample = (
-        tf.transpose(tf.reshape(y_sample,
-                                (self.n_out, -1))) * self.input_std +
-        self.input_mean)
-
-    return y_mean, y_sample
-
-  def create_summaries(self):
-    with self.graph.as_default():
-      tf.summary.scalar("loss", self.loss)
-      tf.summary.scalar("log_noise", self.noise)
-      tf.summary.scalar("log_amp", self.amplitude)
-      tf.summary.scalar("log_amp_lin", self.amplitude_linear)
-      tf.summary.histogram("length_scales", self.length_scales)
-      tf.summary.histogram("length_scales_lin", self.length_scales_lin)
-      self.summary_op = tf.summary.merge_all()
-
-  def train(self, data, num_steps):
-    """Trains the GP for num_steps, using the data in 'data'.
-
-    Args:
-      data: ContextualDataset object that provides the data.
-      num_steps: Number of minibatches to train the network for.
-    """
-
-    logging.info("Training %s for %d steps...", self.name, num_steps)
-    for step in range(num_steps):
-      numpts = min(data.num_points(None), self.max_num_points)
-      if numpts >= self.max_num_points and self.keep_fixed_after_max_obs:
-        x = data.contexts[:numpts, :]
-        y = data.rewards[:numpts, :]
-        weights = np.zeros((x.shape[0], self.n_out))
-        for i, val in enumerate(data.actions[:numpts]):
-          weights[i, val] = 1.0
-      else:
-        x, y, weights = data.get_batch_with_weights(numpts)
-
-      ops = [
-          self.global_step, self.summary_op, self.loss, self.noise,
-          self.amplitude, self.amplitude_linear, self.length_scales,
-          self.length_scales_lin, self.input_cov_op, self.input_op, self.var_op,
-          self.input_w_op, self.out_op, self.train_op
-      ]
-
-      res = self.sess.run(ops,
-                          feed_dict={self.x: x,
-                                     self.x_in: x,
-                                     self.y: y,
-                                     self.weights: weights,
-                                     self.n: numpts,
-                                    })
-
-      if step % self._freq_summary == 0:
-        if self._show_training:
-          logging.info("step: %d, loss: %g noise: %f amp: %f amp_lin: %f",
-                       step, res[2], res[3], res[4], res[5])
-      summary = res[1]
-      global_step = res[0]
-      self.summary_writer.add_summary(summary, global_step=global_step)
diff --git a/research/deep_contextual_bandits/bandits/algorithms/neural_bandit_model.py b/research/deep_contextual_bandits/bandits/algorithms/neural_bandit_model.py
deleted file mode 100644
index 99d7cd4dc8e2c35571f82bbb79ea1564a148ff5d..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/neural_bandit_model.py
+++ /dev/null
@@ -1,220 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Define a family of neural network architectures for bandits.
-
-The network accepts different type of optimizers that could lead to different
-approximations of the posterior distribution or simply to point estimates.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from absl import flags
-from bandits.core.bayesian_nn import BayesianNN
-
-FLAGS = flags.FLAGS
-
-
-class NeuralBanditModel(BayesianNN):
-  """Implements a neural network for bandit problems."""
-
-  def __init__(self, optimizer, hparams, name):
-    """Saves hyper-params and builds the Tensorflow graph."""
-
-    self.opt_name = optimizer
-    self.name = name
-    self.hparams = hparams
-    self.verbose = getattr(self.hparams, "verbose", True)
-    self.times_trained = 0
-    self.build_model()
-
-  def build_layer(self, x, num_units):
-    """Builds a layer with input x; dropout and layer norm if specified."""
-
-    init_s = self.hparams.init_scale
-
-    layer_n = getattr(self.hparams, "layer_norm", False)
-    dropout = getattr(self.hparams, "use_dropout", False)
-
-    nn = tf.contrib.layers.fully_connected(
-        x,
-        num_units,
-        activation_fn=self.hparams.activation,
-        normalizer_fn=None if not layer_n else tf.contrib.layers.layer_norm,
-        normalizer_params={},
-        weights_initializer=tf.random_uniform_initializer(-init_s, init_s)
-    )
-
-    if dropout:
-      nn = tf.nn.dropout(nn, self.hparams.keep_prob)
-
-    return nn
-
-  def forward_pass(self):
-
-    init_s = self.hparams.init_scale
-
-    scope_name = "prediction_{}".format(self.name)
-    with tf.variable_scope(scope_name, reuse=tf.AUTO_REUSE):
-      nn = self.x
-      for num_units in self.hparams.layer_sizes:
-        if num_units > 0:
-          nn = self.build_layer(nn, num_units)
-
-      y_pred = tf.layers.dense(
-          nn,
-          self.hparams.num_actions,
-          kernel_initializer=tf.random_uniform_initializer(-init_s, init_s))
-
-    return nn, y_pred
-
-  def build_model(self):
-    """Defines the actual NN model with fully connected layers.
-
-    The loss is computed for partial feedback settings (bandits), so only
-    the observed outcome is backpropagated (see weighted loss).
-    Selects the optimizer and, finally, it also initializes the graph.
-    """
-
-    # create and store the graph corresponding to the BNN instance
-    self.graph = tf.Graph()
-
-    with self.graph.as_default():
-
-      # create and store a new session for the graph
-      self.sess = tf.Session()
-
-      with tf.name_scope(self.name):
-
-        self.global_step = tf.train.get_or_create_global_step()
-
-        # context
-        self.x = tf.placeholder(
-            shape=[None, self.hparams.context_dim],
-            dtype=tf.float32,
-            name="{}_x".format(self.name))
-
-        # reward vector
-        self.y = tf.placeholder(
-            shape=[None, self.hparams.num_actions],
-            dtype=tf.float32,
-            name="{}_y".format(self.name))
-
-        # weights (1 for selected action, 0 otherwise)
-        self.weights = tf.placeholder(
-            shape=[None, self.hparams.num_actions],
-            dtype=tf.float32,
-            name="{}_w".format(self.name))
-
-        # with tf.variable_scope("prediction_{}".format(self.name)):
-        self.nn, self.y_pred = self.forward_pass()
-        self.loss = tf.squared_difference(self.y_pred, self.y)
-        self.weighted_loss = tf.multiply(self.weights, self.loss)
-        self.cost = tf.reduce_sum(self.weighted_loss) / self.hparams.batch_size
-
-        if self.hparams.activate_decay:
-          self.lr = tf.train.inverse_time_decay(
-              self.hparams.initial_lr, self.global_step,
-              1, self.hparams.lr_decay_rate)
-        else:
-          self.lr = tf.Variable(self.hparams.initial_lr, trainable=False)
-
-        # create tensorboard metrics
-        self.create_summaries()
-        self.summary_writer = tf.summary.FileWriter(
-            "{}/graph_{}".format(FLAGS.logdir, self.name), self.sess.graph)
-
-        tvars = tf.trainable_variables()
-        grads, _ = tf.clip_by_global_norm(
-            tf.gradients(self.cost, tvars), self.hparams.max_grad_norm)
-
-        self.optimizer = self.select_optimizer()
-
-        self.train_op = self.optimizer.apply_gradients(
-            zip(grads, tvars), global_step=self.global_step)
-
-        self.init = tf.global_variables_initializer()
-
-        self.initialize_graph()
-
-  def initialize_graph(self):
-    """Initializes all variables."""
-
-    with self.graph.as_default():
-      if self.verbose:
-        print("Initializing model {}.".format(self.name))
-      self.sess.run(self.init)
-
-  def assign_lr(self):
-    """Resets the learning rate in dynamic schedules for subsequent trainings.
-
-    In bandits settings, we do expand our dataset over time. Then, we need to
-    re-train the network with the new data. The algorithms that do not keep
-    the step constant, can reset it at the start of each *training* process.
-    """
-
-    decay_steps = 1
-    if self.hparams.activate_decay:
-      current_gs = self.sess.run(self.global_step)
-      with self.graph.as_default():
-        self.lr = tf.train.inverse_time_decay(self.hparams.initial_lr,
-                                              self.global_step - current_gs,
-                                              decay_steps,
-                                              self.hparams.lr_decay_rate)
-
-  def select_optimizer(self):
-    """Selects optimizer. To be extended (SGLD, KFAC, etc)."""
-    return tf.train.RMSPropOptimizer(self.lr)
-
-  def create_summaries(self):
-    """Defines summaries including mean loss, learning rate, and global step."""
-
-    with self.graph.as_default():
-      with tf.name_scope(self.name + "_summaries"):
-        tf.summary.scalar("cost", self.cost)
-        tf.summary.scalar("lr", self.lr)
-        tf.summary.scalar("global_step", self.global_step)
-        self.summary_op = tf.summary.merge_all()
-
-  def train(self, data, num_steps):
-    """Trains the network for num_steps, using the provided data.
-
-    Args:
-      data: ContextualDataset object that provides the data.
-      num_steps: Number of minibatches to train the network for.
-    """
-
-    if self.verbose:
-      print("Training {} for {} steps...".format(self.name, num_steps))
-
-    with self.graph.as_default():
-
-      for step in range(num_steps):
-        x, y, w = data.get_batch_with_weights(self.hparams.batch_size)
-        _, cost, summary, lr = self.sess.run(
-            [self.train_op, self.cost, self.summary_op, self.lr],
-            feed_dict={self.x: x, self.y: y, self.weights: w})
-
-        if step % self.hparams.freq_summary == 0:
-          if self.hparams.show_training:
-            print("{} | step: {}, lr: {}, loss: {}".format(
-                self.name, step, lr, cost))
-          self.summary_writer.add_summary(summary, step)
-
-      self.times_trained += 1
diff --git a/research/deep_contextual_bandits/bandits/algorithms/neural_linear_sampling.py b/research/deep_contextual_bandits/bandits/algorithms/neural_linear_sampling.py
deleted file mode 100644
index 43fc551614b49ad34538aa64090bcda5f823a60f..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/neural_linear_sampling.py
+++ /dev/null
@@ -1,180 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Thompson Sampling with linear posterior over a learnt deep representation."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-from scipy.stats import invgamma
-
-from bandits.core.bandit_algorithm import BanditAlgorithm
-from bandits.core.contextual_dataset import ContextualDataset
-from bandits.algorithms.neural_bandit_model import NeuralBanditModel
-
-
-class NeuralLinearPosteriorSampling(BanditAlgorithm):
-  """Full Bayesian linear regression on the last layer of a deep neural net."""
-
-  def __init__(self, name, hparams, optimizer='RMS'):
-
-    self.name = name
-    self.hparams = hparams
-    self.latent_dim = self.hparams.layer_sizes[-1]
-
-    # Gaussian prior for each beta_i
-    self._lambda_prior = self.hparams.lambda_prior
-
-    self.mu = [
-        np.zeros(self.latent_dim)
-        for _ in range(self.hparams.num_actions)
-    ]
-
-    self.cov = [(1.0 / self.lambda_prior) * np.eye(self.latent_dim)
-                for _ in range(self.hparams.num_actions)]
-
-    self.precision = [
-        self.lambda_prior * np.eye(self.latent_dim)
-        for _ in range(self.hparams.num_actions)
-    ]
-
-    # Inverse Gamma prior for each sigma2_i
-    self._a0 = self.hparams.a0
-    self._b0 = self.hparams.b0
-
-    self.a = [self._a0 for _ in range(self.hparams.num_actions)]
-    self.b = [self._b0 for _ in range(self.hparams.num_actions)]
-
-    # Regression and NN Update Frequency
-    self.update_freq_lr = hparams.training_freq
-    self.update_freq_nn = hparams.training_freq_network
-
-    self.t = 0
-    self.optimizer_n = optimizer
-
-    self.num_epochs = hparams.training_epochs
-    self.data_h = ContextualDataset(hparams.context_dim,
-                                    hparams.num_actions,
-                                    intercept=False)
-    self.latent_h = ContextualDataset(self.latent_dim,
-                                      hparams.num_actions,
-                                      intercept=False)
-    self.bnn = NeuralBanditModel(optimizer, hparams, '{}-bnn'.format(name))
-
-  def action(self, context):
-    """Samples beta's from posterior, and chooses best action accordingly."""
-
-    # Round robin until each action has been selected "initial_pulls" times
-    if self.t < self.hparams.num_actions * self.hparams.initial_pulls:
-      return self.t % self.hparams.num_actions
-
-    # Sample sigma2, and beta conditional on sigma2
-    sigma2_s = [
-        self.b[i] * invgamma.rvs(self.a[i])
-        for i in range(self.hparams.num_actions)
-    ]
-
-    try:
-      beta_s = [
-          np.random.multivariate_normal(self.mu[i], sigma2_s[i] * self.cov[i])
-          for i in range(self.hparams.num_actions)
-      ]
-    except np.linalg.LinAlgError as e:
-      # Sampling could fail if covariance is not positive definite
-      print('Exception when sampling for {}.'.format(self.name))
-      print('Details: {} | {}.'.format(e.message, e.args))
-      d = self.latent_dim
-      beta_s = [
-          np.random.multivariate_normal(np.zeros((d)), np.eye(d))
-          for i in range(self.hparams.num_actions)
-      ]
-
-    # Compute last-layer representation for the current context
-    with self.bnn.graph.as_default():
-      c = context.reshape((1, self.hparams.context_dim))
-      z_context = self.bnn.sess.run(self.bnn.nn, feed_dict={self.bnn.x: c})
-
-    # Apply Thompson Sampling to last-layer representation
-    vals = [
-        np.dot(beta_s[i], z_context.T) for i in range(self.hparams.num_actions)
-    ]
-    return np.argmax(vals)
-
-  def update(self, context, action, reward):
-    """Updates the posterior using linear bayesian regression formula."""
-
-    self.t += 1
-    self.data_h.add(context, action, reward)
-    c = context.reshape((1, self.hparams.context_dim))
-    z_context = self.bnn.sess.run(self.bnn.nn, feed_dict={self.bnn.x: c})
-    self.latent_h.add(z_context, action, reward)
-
-    # Retrain the network on the original data (data_h)
-    if self.t % self.update_freq_nn == 0:
-
-      if self.hparams.reset_lr:
-        self.bnn.assign_lr()
-      self.bnn.train(self.data_h, self.num_epochs)
-
-      # Update the latent representation of every datapoint collected so far
-      new_z = self.bnn.sess.run(self.bnn.nn,
-                                feed_dict={self.bnn.x: self.data_h.contexts})
-      self.latent_h.replace_data(contexts=new_z)
-
-    # Update the Bayesian Linear Regression
-    if self.t % self.update_freq_lr == 0:
-
-      # Find all the actions to update
-      actions_to_update = self.latent_h.actions[:-self.update_freq_lr]
-
-      for action_v in np.unique(actions_to_update):
-
-        # Update action posterior with formulas: \beta | z,y ~ N(mu_q, cov_q)
-        z, y = self.latent_h.get_data(action_v)
-
-        # The algorithm could be improved with sequential formulas (cheaper)
-        s = np.dot(z.T, z)
-
-        # Some terms are removed as we assume prior mu_0 = 0.
-        precision_a = s + self.lambda_prior * np.eye(self.latent_dim)
-        cov_a = np.linalg.inv(precision_a)
-        mu_a = np.dot(cov_a, np.dot(z.T, y))
-
-        # Inverse Gamma posterior update
-        a_post = self.a0 + z.shape[0] / 2.0
-        b_upd = 0.5 * np.dot(y.T, y)
-        b_upd -= 0.5 * np.dot(mu_a.T, np.dot(precision_a, mu_a))
-        b_post = self.b0 + b_upd
-
-        # Store new posterior distributions
-        self.mu[action_v] = mu_a
-        self.cov[action_v] = cov_a
-        self.precision[action_v] = precision_a
-        self.a[action_v] = a_post
-        self.b[action_v] = b_post
-
-  @property
-  def a0(self):
-    return self._a0
-
-  @property
-  def b0(self):
-    return self._b0
-
-  @property
-  def lambda_prior(self):
-    return self._lambda_prior
diff --git a/research/deep_contextual_bandits/bandits/algorithms/parameter_noise_sampling.py b/research/deep_contextual_bandits/bandits/algorithms/parameter_noise_sampling.py
deleted file mode 100644
index 19944ad577372b6971f03f1117fc33d5a2a276b1..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/parameter_noise_sampling.py
+++ /dev/null
@@ -1,187 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contextual algorithm based on Thompson Sampling + direct noise injection."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-from scipy.special import logsumexp
-import tensorflow as tf
-
-from absl import flags
-
-from bandits.core.bandit_algorithm import BanditAlgorithm
-from bandits.core.contextual_dataset import ContextualDataset
-from bandits.algorithms.neural_bandit_model import NeuralBanditModel
-
-FLAGS = flags.FLAGS
-
-
-class ParameterNoiseSampling(BanditAlgorithm):
-  """Parameter Noise Sampling algorithm based on adding noise to net params.
-
-  Described in https://arxiv.org/abs/1706.01905
-  """
-
-  def __init__(self, name, hparams):
-    """Creates the algorithm, and sets up the adaptive Gaussian noise."""
-
-    self.name = name
-    self.hparams = hparams
-    self.verbose = getattr(self.hparams, 'verbose', True)
-    self.noise_std = getattr(self.hparams, 'noise_std', 0.005)
-    self.eps = getattr(self.hparams, 'eps', 0.05)
-    self.d_samples = getattr(self.hparams, 'd_samples', 300)
-    self.optimizer = getattr(self.hparams, 'optimizer', 'RMS')
-
-    # keep track of noise heuristic statistics
-    self.std_h = [self.noise_std]
-    self.eps_h = [self.eps]
-    self.kl_h = []
-    self.t = 0
-
-    self.freq_update = hparams.training_freq
-    self.num_epochs = hparams.training_epochs
-
-    self.data_h = ContextualDataset(hparams.context_dim, hparams.num_actions,
-                                    hparams.buffer_s)
-    self.bnn = NeuralBanditModel(self.optimizer, hparams, '{}-bnn'.format(name))
-
-    with self.bnn.graph.as_default():
-
-      # noise-injection std placeholder
-      self.bnn.noise_std_ph = tf.placeholder(tf.float32, shape=())
-
-      # create noise corruption op; adds noise to all weights
-      tvars = tf.trainable_variables()
-      self.bnn.noisy_grads = [
-          tf.random_normal(v.get_shape(), 0, self.bnn.noise_std_ph)
-          for v in tvars
-      ]
-
-      # add noise to all params, then compute prediction, then subtract.
-      with tf.control_dependencies(self.bnn.noisy_grads):
-        self.bnn.noise_add_ops = [
-            tvars[i].assign_add(n) for i, n in enumerate(self.bnn.noisy_grads)
-        ]
-        with tf.control_dependencies(self.bnn.noise_add_ops):
-          # we force the prediction for 'y' to be recomputed after adding noise
-          self.bnn.noisy_nn, self.bnn.noisy_pred_val = self.bnn.forward_pass()
-
-          self.bnn.noisy_pred = tf.identity(self.bnn.noisy_pred_val)
-          with tf.control_dependencies([tf.identity(self.bnn.noisy_pred)]):
-            self.bnn.noise_sub_ops = [
-                tvars[i].assign_add(-n)
-                for i, n in enumerate(self.bnn.noisy_grads)
-            ]
-
-  def action(self, context):
-    """Selects action based on Thompson Sampling *after* adding noise."""
-
-    if self.t < self.hparams.num_actions * self.hparams.initial_pulls:
-      # round robin until each action has been taken "initial_pulls" times
-      return self.t % self.hparams.num_actions
-
-    with self.bnn.graph.as_default():
-      # run noise prediction op to choose action, and subtract noise op after.
-      c = context.reshape((1, self.hparams.context_dim))
-      output, _ = self.bnn.sess.run(
-          [self.bnn.noisy_pred, self.bnn.noise_sub_ops],
-          feed_dict={self.bnn.x: c,
-                     self.bnn.noise_std_ph: self.noise_std})
-      return np.argmax(output)
-
-  def update(self, context, action, reward):
-    """Updates the data buffer, and re-trains the BNN and noise level."""
-
-    self.t += 1
-    self.data_h.add(context, action, reward)
-
-    if self.t % self.freq_update == 0:
-      self.bnn.train(self.data_h, self.num_epochs)
-      self.update_noise()
-
-  def update_noise(self):
-    """Increase noise if distance btw original and corrupted distrib small."""
-
-    kl = self.compute_distance()
-    delta = -np.log1p(- self.eps + self.eps / self.hparams.num_actions)
-
-    if kl < delta:
-      self.noise_std *= 1.01
-    else:
-      self.noise_std /= 1.01
-
-    self.eps *= 0.99
-
-    if self.verbose:
-      print('Update eps={} | kl={} | std={} | delta={} | increase={}.'.format(
-          self.eps, kl, self.noise_std, delta, kl < delta))
-
-    # store noise-injection statistics for inspection: std, KL, eps.
-    self.std_h.append(self.noise_std)
-    self.kl_h.append(kl)
-    self.eps_h.append(self.eps)
-
-  def compute_distance(self):
-    """Computes empirical KL for original and corrupted output distributions."""
-
-    random_inputs, _ = self.data_h.get_batch(self.d_samples)
-    y_model = self.bnn.sess.run(
-        self.bnn.y_pred,
-        feed_dict={
-            self.bnn.x: random_inputs,
-            self.bnn.noise_std_ph: self.noise_std
-        })
-    y_noisy, _ = self.bnn.sess.run(
-        [self.bnn.noisy_pred, self.bnn.noise_sub_ops],
-        feed_dict={
-            self.bnn.x: random_inputs,
-            self.bnn.noise_std_ph: self.noise_std
-        })
-
-    if self.verbose:
-      # display how often original & perturbed models propose different actions
-      s = np.sum([np.argmax(y_model[i, :]) == np.argmax(y_noisy[i, :])
-                  for i in range(y_model.shape[0])])
-      print('{} | % of agreement btw original / corrupted actions: {}.'.format(
-          self.name, s / self.d_samples))
-
-    kl = self.compute_kl_with_logits(y_model, y_noisy)
-    return kl
-
-  def compute_kl_with_logits(self, logits1, logits2):
-    """Computes KL from logits samples from two distributions."""
-
-    def exp_times_diff(a, b):
-      return np.multiply(np.exp(a), a - b)
-
-    logsumexp1 = logsumexp(logits1, axis=1)
-    logsumexp2 = logsumexp(logits2, axis=1)
-    logsumexp_diff = logsumexp2 - logsumexp1
-
-    exp_diff = exp_times_diff(logits1, logits2)
-    exp_diff = np.sum(exp_diff, axis=1)
-
-    inv_exp_sum = np.sum(np.exp(logits1), axis=1)
-    term1 = np.divide(exp_diff, inv_exp_sum)
-
-    kl = term1 + logsumexp_diff
-    kl = np.maximum(kl, 0.0)
-    kl = np.nan_to_num(kl)
-    return np.mean(kl)
diff --git a/research/deep_contextual_bandits/bandits/algorithms/posterior_bnn_sampling.py b/research/deep_contextual_bandits/bandits/algorithms/posterior_bnn_sampling.py
deleted file mode 100644
index 0f0c5d365a3a3e48006fe6b4e7e47ab73ea756cf..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/posterior_bnn_sampling.py
+++ /dev/null
@@ -1,92 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contextual bandit algorithm based on Thompson Sampling and a Bayesian NN."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-from bandits.core.bandit_algorithm import BanditAlgorithm
-from bandits.algorithms.bb_alpha_divergence_model import BBAlphaDivergence
-from bandits.algorithms.bf_variational_neural_bandit_model import BfVariationalNeuralBanditModel
-from bandits.core.contextual_dataset import ContextualDataset
-from bandits.algorithms.multitask_gp import MultitaskGP
-from bandits.algorithms.neural_bandit_model import NeuralBanditModel
-from bandits.algorithms.variational_neural_bandit_model import VariationalNeuralBanditModel
-
-
-class PosteriorBNNSampling(BanditAlgorithm):
-  """Posterior Sampling algorithm based on a Bayesian neural network."""
-
-  def __init__(self, name, hparams, bnn_model='RMSProp'):
-    """Creates a PosteriorBNNSampling object based on a specific optimizer.
-
-    The algorithm has two basic tools: an Approx BNN and a Contextual Dataset.
-    The Bayesian Network keeps the posterior based on the optimizer iterations.
-
-    Args:
-      name: Name of the algorithm.
-      hparams: Hyper-parameters of the algorithm.
-      bnn_model: Type of BNN. By default RMSProp (point estimate).
-    """
-
-    self.name = name
-    self.hparams = hparams
-    self.optimizer_n = hparams.optimizer
-
-    self.training_freq = hparams.training_freq
-    self.training_epochs = hparams.training_epochs
-    self.t = 0
-    self.data_h = ContextualDataset(hparams.context_dim, hparams.num_actions,
-                                    hparams.buffer_s)
-
-    # to be extended with more BNNs (BB alpha-div, GPs, SGFS, constSGD...)
-    bnn_name = '{}-bnn'.format(name)
-    if bnn_model == 'Variational':
-      self.bnn = VariationalNeuralBanditModel(hparams, bnn_name)
-    elif bnn_model == 'AlphaDiv':
-      self.bnn = BBAlphaDivergence(hparams, bnn_name)
-    elif bnn_model == 'Variational_BF':
-      self.bnn = BfVariationalNeuralBanditModel(hparams, bnn_name)
-    elif bnn_model == 'GP':
-      self.bnn = MultitaskGP(hparams)
-    else:
-      self.bnn = NeuralBanditModel(self.optimizer_n, hparams, bnn_name)
-
-  def action(self, context):
-    """Selects action for context based on Thompson Sampling using the BNN."""
-
-    if self.t < self.hparams.num_actions * self.hparams.initial_pulls:
-      # round robin until each action has been taken "initial_pulls" times
-      return self.t % self.hparams.num_actions
-
-    with self.bnn.graph.as_default():
-      c = context.reshape((1, self.hparams.context_dim))
-      output = self.bnn.sess.run(self.bnn.y_pred, feed_dict={self.bnn.x: c})
-      return np.argmax(output)
-
-  def update(self, context, action, reward):
-    """Updates data buffer, and re-trains the BNN every training_freq steps."""
-
-    self.t += 1
-    self.data_h.add(context, action, reward)
-
-    if self.t % self.training_freq == 0:
-      if self.hparams.reset_lr:
-        self.bnn.assign_lr()
-      self.bnn.train(self.data_h, self.training_epochs)
diff --git a/research/deep_contextual_bandits/bandits/algorithms/uniform_sampling.py b/research/deep_contextual_bandits/bandits/algorithms/uniform_sampling.py
deleted file mode 100644
index 15c073fbe89da4e9aef595c8772ceaa3667e1952..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/uniform_sampling.py
+++ /dev/null
@@ -1,43 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contextual bandit algorithm that selects an action uniformly at random."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-from bandits.core.bandit_algorithm import BanditAlgorithm
-
-
-class UniformSampling(BanditAlgorithm):
-  """Defines a baseline; returns one action uniformly at random."""
-
-  def __init__(self, name, hparams):
-    """Creates a UniformSampling object.
-
-    Args:
-      name: Name of the algorithm.
-      hparams: Hyper-parameters, including the number of arms (num_actions).
-    """
-
-    self.name = name
-    self.hparams = hparams
-
-  def action(self, context):
-    """Selects an action uniformly at random."""
-    return np.random.choice(range(self.hparams.num_actions))
diff --git a/research/deep_contextual_bandits/bandits/algorithms/variational_neural_bandit_model.py b/research/deep_contextual_bandits/bandits/algorithms/variational_neural_bandit_model.py
deleted file mode 100644
index 7700c08ba9f7861aac522ba6da9f7371b5e203af..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/algorithms/variational_neural_bandit_model.py
+++ /dev/null
@@ -1,346 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Bayesian NN using factorized VI (Bayes By Backprop. Blundell et al. 2014).
-
-See https://arxiv.org/abs/1505.05424 for details.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-from absl import flags
-from bandits.core.bayesian_nn import BayesianNN
-
-FLAGS = flags.FLAGS
-
-
-def log_gaussian(x, mu, sigma, reduce_sum=True):
-  """Returns log Gaussian pdf."""
-  res = (-0.5 * np.log(2 * np.pi) - tf.log(sigma) - tf.square(x - mu) /
-         (2 * tf.square(sigma)))
-  if reduce_sum:
-    return tf.reduce_sum(res)
-  else:
-    return res
-
-
-def analytic_kl(mu_1, sigma_1, mu_2, sigma_2):
-  """KL for two Gaussian distributions with diagonal covariance matrix."""
-  sigma_1_sq = tf.square(sigma_1)
-  sigma_2_sq = tf.square(sigma_2)
-
-  t1 = tf.square(mu_1 - mu_2) / (2. * sigma_2_sq)
-  t2 = (sigma_1_sq/sigma_2_sq - 1. - tf.log(sigma_1_sq) + tf.log(sigma_2_sq))/2.
-  return tf.reduce_sum(t1 + t2)
-
-
-class VariationalNeuralBanditModel(BayesianNN):
-  """Implements an approximate Bayesian NN using Variational Inference."""
-
-  def __init__(self, hparams, name="BBBNN"):
-
-    self.name = name
-    self.hparams = hparams
-
-    self.n_in = self.hparams.context_dim
-    self.n_out = self.hparams.num_actions
-    self.layers = self.hparams.layer_sizes
-    self.init_scale = self.hparams.init_scale
-    self.f_num_points = None
-    if "f_num_points" in hparams:
-      self.f_num_points = self.hparams.f_num_points
-
-    self.cleared_times_trained = self.hparams.cleared_times_trained
-    self.initial_training_steps = self.hparams.initial_training_steps
-    self.training_schedule = np.linspace(self.initial_training_steps,
-                                         self.hparams.training_epochs,
-                                         self.cleared_times_trained)
-    self.verbose = getattr(self.hparams, "verbose", True)
-
-    self.weights_m = {}
-    self.weights_std = {}
-    self.biases_m = {}
-    self.biases_std = {}
-
-    self.times_trained = 0
-
-    if self.hparams.use_sigma_exp_transform:
-      self.sigma_transform = tf.exp
-      self.inverse_sigma_transform = np.log
-    else:
-      self.sigma_transform = tf.nn.softplus
-      self.inverse_sigma_transform = lambda y: y + np.log(1. - np.exp(-y))
-
-    # Whether to use the local reparameterization trick to compute the loss.
-    # See details in https://arxiv.org/abs/1506.02557
-    self.use_local_reparameterization = True
-
-    self.build_graph()
-
-  def build_mu_variable(self, shape):
-    """Returns a mean variable initialized as N(0, 0.05)."""
-    return tf.Variable(tf.random_normal(shape, 0.0, 0.05))
-
-  def build_sigma_variable(self, shape, init=-5.):
-    """Returns a sigma variable initialized as N(init, 0.05)."""
-    # Initialize sigma to be very small initially to encourage MAP opt first
-    return tf.Variable(tf.random_normal(shape, init, 0.05))
-
-  def build_layer(self, input_x, input_x_local, shape,
-                  layer_id, activation_fn=tf.nn.relu):
-    """Builds a variational layer, and computes KL term.
-
-    Args:
-      input_x: Input to the variational layer.
-      input_x_local: Input when the local reparameterization trick was applied.
-      shape: [number_inputs, number_outputs] for the layer.
-      layer_id: Number of layer in the architecture.
-      activation_fn: Activation function to apply.
-
-    Returns:
-      output_h: Output of the variational layer.
-      output_h_local: Output when local reparameterization trick was applied.
-      neg_kl: Negative KL term for the layer.
-    """
-
-    w_mu = self.build_mu_variable(shape)
-    w_sigma = self.sigma_transform(self.build_sigma_variable(shape))
-    w_noise = tf.random_normal(shape)
-    w = w_mu + w_sigma * w_noise
-
-    b_mu = self.build_mu_variable([1, shape[1]])
-    b_sigma = self.sigma_transform(self.build_sigma_variable([1, shape[1]]))
-    b = b_mu
-
-    # Store means and stds
-    self.weights_m[layer_id] = w_mu
-    self.weights_std[layer_id] = w_sigma
-    self.biases_m[layer_id] = b_mu
-    self.biases_std[layer_id] = b_sigma
-
-    # Create outputs
-    output_h = activation_fn(tf.matmul(input_x, w) + b)
-
-    if self.use_local_reparameterization:
-      # Use analytic KL divergence wrt the prior
-      neg_kl = -analytic_kl(w_mu, w_sigma,
-                            0., tf.to_float(np.sqrt(2./shape[0])))
-    else:
-      # Create empirical KL loss terms
-      log_p = log_gaussian(w, 0., tf.to_float(np.sqrt(2./shape[0])))
-      log_q = log_gaussian(w, tf.stop_gradient(w_mu), tf.stop_gradient(w_sigma))
-      neg_kl = log_p - log_q
-
-    # Apply local reparameterization trick: sample activations pre nonlinearity
-    m_h = tf.matmul(input_x_local, w_mu) + b
-    v_h = tf.matmul(tf.square(input_x_local), tf.square(w_sigma))
-    output_h_local = m_h + tf.sqrt(v_h + 1e-6) * tf.random_normal(tf.shape(v_h))
-    output_h_local = activation_fn(output_h_local)
-
-    return output_h, output_h_local, neg_kl
-
-  def build_action_noise(self):
-    """Defines a model for additive noise per action, and its KL term."""
-
-    # Define mean and std variables (log-normal dist) for each action.
-    noise_sigma_mu = (self.build_mu_variable([1, self.n_out])
-                      + self.inverse_sigma_transform(self.hparams.noise_sigma))
-    noise_sigma_sigma = self.sigma_transform(
-        self.build_sigma_variable([1, self.n_out]))
-
-    pre_noise_sigma = (noise_sigma_mu
-                       + tf.random_normal([1, self.n_out]) * noise_sigma_sigma)
-    self.noise_sigma = self.sigma_transform(pre_noise_sigma)
-
-    # Compute KL for additive noise sigma terms.
-    if getattr(self.hparams, "infer_noise_sigma", False):
-      neg_kl_term = log_gaussian(
-          pre_noise_sigma,
-          self.inverse_sigma_transform(self.hparams.noise_sigma),
-          self.hparams.prior_sigma
-      )
-      neg_kl_term -= log_gaussian(pre_noise_sigma,
-                                  noise_sigma_mu,
-                                  noise_sigma_sigma)
-    else:
-      neg_kl_term = 0.
-
-    return neg_kl_term
-
-  def build_model(self, activation_fn=tf.nn.relu):
-    """Defines the actual NN model with fully connected layers.
-
-    The loss is computed for partial feedback settings (bandits), so only
-    the observed outcome is backpropagated (see weighted loss).
-    Selects the optimizer and, finally, it also initializes the graph.
-
-    Args:
-      activation_fn: the activation function used in the nn layers.
-    """
-
-    if self.verbose:
-      print("Initializing model {}.".format(self.name))
-    neg_kl_term, l_number = 0, 0
-    use_local_reparameterization = self.use_local_reparameterization
-
-    # Compute model additive noise for each action with log-normal distribution
-    neg_kl_term += self.build_action_noise()
-
-    # Build network.
-    input_x = self.x
-    input_local = self.x
-    n_in = self.n_in
-
-    for l_number, n_nodes in enumerate(self.layers):
-      if n_nodes > 0:
-        h, h_local, neg_kl = self.build_layer(input_x, input_local,
-                                              [n_in, n_nodes], l_number)
-
-        neg_kl_term += neg_kl
-        input_x, input_local = h, h_local
-        n_in = n_nodes
-
-    # Create last linear layer
-    h, h_local, neg_kl = self.build_layer(input_x, input_local,
-                                          [n_in, self.n_out],
-                                          l_number + 1,
-                                          activation_fn=lambda x: x)
-    neg_kl_term += neg_kl
-
-    self.y_pred = h
-    self.y_pred_local = h_local
-
-    # Compute log likelihood (with learned or fixed noise level)
-    if getattr(self.hparams, "infer_noise_sigma", False):
-      log_likelihood = log_gaussian(
-          self.y, self.y_pred_local, self.noise_sigma, reduce_sum=False)
-    else:
-      y_hat = self.y_pred_local if use_local_reparameterization else self.y_pred
-      log_likelihood = log_gaussian(
-          self.y, y_hat, self.hparams.noise_sigma, reduce_sum=False)
-
-    # Only take into account observed outcomes (bandits setting)
-    batch_size = tf.to_float(tf.shape(self.x)[0])
-    weighted_log_likelihood = tf.reduce_sum(
-        log_likelihood * self.weights) / batch_size
-
-    # The objective is 1/n * (\sum_i log_like_i - KL); neg_kl_term estimates -KL
-    elbo = weighted_log_likelihood + (neg_kl_term / self.n)
-
-    self.loss = -elbo
-    self.global_step = tf.train.get_or_create_global_step()
-    self.train_op = tf.train.AdamOptimizer(self.hparams.initial_lr).minimize(
-        self.loss, global_step=self.global_step)
-
-    # Create tensorboard metrics
-    self.create_summaries()
-    self.summary_writer = tf.summary.FileWriter(
-        "{}/graph_{}".format(FLAGS.logdir, self.name), self.sess.graph)
-
-  def build_graph(self):
-    """Defines graph, session, placeholders, and model.
-
-    Placeholders are: n (size of the dataset), x and y (context and observed
-    reward for each action), and weights (one-hot encoding of selected action
-    for each context, i.e., only possibly non-zero element in each y).
-    """
-
-    self.graph = tf.Graph()
-    with self.graph.as_default():
-
-      self.sess = tf.Session()
-
-      self.n = tf.placeholder(shape=[], dtype=tf.float32)
-
-      self.x = tf.placeholder(shape=[None, self.n_in], dtype=tf.float32)
-      self.y = tf.placeholder(shape=[None, self.n_out], dtype=tf.float32)
-      self.weights = tf.placeholder(shape=[None, self.n_out], dtype=tf.float32)
-
-      self.build_model()
-      self.sess.run(tf.global_variables_initializer())
-
-  def create_summaries(self):
-    """Defines summaries including mean loss, and global step."""
-
-    with self.graph.as_default():
-      with tf.name_scope(self.name + "_summaries"):
-        tf.summary.scalar("loss", self.loss)
-        tf.summary.scalar("global_step", self.global_step)
-        self.summary_op = tf.summary.merge_all()
-
-  def assign_lr(self):
-    """Resets the learning rate in dynamic schedules for subsequent trainings.
-
-    In bandits settings, we do expand our dataset over time. Then, we need to
-    re-train the network with the new data. The algorithms that do not keep
-    the step constant, can reset it at the start of each *training* process.
-    """
-
-    decay_steps = 1
-    if self.hparams.activate_decay:
-      current_gs = self.sess.run(self.global_step)
-      with self.graph.as_default():
-        self.lr = tf.train.inverse_time_decay(self.hparams.initial_lr,
-                                              self.global_step - current_gs,
-                                              decay_steps,
-                                              self.hparams.lr_decay_rate)
-
-  def train(self, data, num_steps):
-    """Trains the BNN for num_steps, using the data in 'data'.
-
-    Args:
-      data: ContextualDataset object that provides the data.
-      num_steps: Number of minibatches to train the network for.
-
-    Returns:
-      losses: Loss history during training.
-    """
-
-    if self.times_trained < self.cleared_times_trained:
-      num_steps = int(self.training_schedule[self.times_trained])
-    self.times_trained += 1
-
-    losses = []
-
-    with self.graph.as_default():
-
-      if self.verbose:
-        print("Training {} for {} steps...".format(self.name, num_steps))
-
-      for step in range(num_steps):
-        x, y, weights = data.get_batch_with_weights(self.hparams.batch_size)
-        _, summary, global_step, loss = self.sess.run(
-            [self.train_op, self.summary_op, self.global_step, self.loss],
-            feed_dict={
-                self.x: x,
-                self.y: y,
-                self.weights: weights,
-                self.n: data.num_points(self.f_num_points),
-            })
-
-        losses.append(loss)
-
-        if step % self.hparams.freq_summary == 0:
-          if self.hparams.show_training:
-            print("{} | step: {}, loss: {}".format(
-                self.name, global_step, loss))
-          self.summary_writer.add_summary(summary, global_step)
-
-    return losses
diff --git a/research/deep_contextual_bandits/bandits/core/bandit_algorithm.py b/research/deep_contextual_bandits/bandits/core/bandit_algorithm.py
deleted file mode 100644
index cae4e1676a865d538fa41936feb9118283b92a2c..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/core/bandit_algorithm.py
+++ /dev/null
@@ -1,34 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Define the abstract class for contextual bandit algorithms."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-class BanditAlgorithm(object):
-  """A bandit algorithm must be able to do two basic operations.
-
-  1. Choose an action given a context.
-  2. Update its internal model given a triple (context, played action, reward).
-  """
-
-  def action(self, context):
-    pass
-
-  def update(self, context, action, reward):
-    pass
diff --git a/research/deep_contextual_bandits/bandits/core/contextual_bandit.py b/research/deep_contextual_bandits/bandits/core/contextual_bandit.py
deleted file mode 100644
index 98467378953b9f3e38057be8a0068fdbc7b59a84..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/core/contextual_bandit.py
+++ /dev/null
@@ -1,125 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Define a contextual bandit from which we can sample and compute rewards.
-
-We can feed the data, sample a context, its reward for a specific action, and
-also the optimal action for a given context.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-
-def run_contextual_bandit(context_dim, num_actions, dataset, algos):
-  """Run a contextual bandit problem on a set of algorithms.
-
-  Args:
-    context_dim: Dimension of the context.
-    num_actions: Number of available actions.
-    dataset: Matrix where every row is a context + num_actions rewards.
-    algos: List of algorithms to use in the contextual bandit instance.
-
-  Returns:
-    h_actions: Matrix with actions: size (num_context, num_algorithms).
-    h_rewards: Matrix with rewards: size (num_context, num_algorithms).
-  """
-
-  num_contexts = dataset.shape[0]
-
-  # Create contextual bandit
-  cmab = ContextualBandit(context_dim, num_actions)
-  cmab.feed_data(dataset)
-
-  h_actions = np.empty((0, len(algos)), float)
-  h_rewards = np.empty((0, len(algos)), float)
-
-  # Run the contextual bandit process
-  for i in range(num_contexts):
-    context = cmab.context(i)
-    actions = [a.action(context) for a in algos]
-    rewards = [cmab.reward(i, action) for action in actions]
-
-    for j, a in enumerate(algos):
-      a.update(context, actions[j], rewards[j])
-
-    h_actions = np.vstack((h_actions, np.array(actions)))
-    h_rewards = np.vstack((h_rewards, np.array(rewards)))
-
-  return h_actions, h_rewards
-
-
-class ContextualBandit(object):
-  """Implements a Contextual Bandit with d-dimensional contexts and k arms."""
-
-  def __init__(self, context_dim, num_actions):
-    """Creates a contextual bandit object.
-
-    Args:
-      context_dim: Dimension of the contexts.
-      num_actions: Number of arms for the multi-armed bandit.
-    """
-
-    self._context_dim = context_dim
-    self._num_actions = num_actions
-
-  def feed_data(self, data):
-    """Feeds the data (contexts + rewards) to the bandit object.
-
-    Args:
-      data: Numpy array with shape [n, d+k], where n is the number of contexts,
-        d is the dimension of each context, and k the number of arms (rewards).
-
-    Raises:
-      ValueError: when data dimensions do not correspond to the object values.
-    """
-
-    if data.shape[1] != self.context_dim + self.num_actions:
-      raise ValueError('Data dimensions do not match.')
-
-    self._number_contexts = data.shape[0]
-    self.data = data
-    self.order = range(self.number_contexts)
-
-  def reset(self):
-    """Randomly shuffle the order of the contexts to deliver."""
-    self.order = np.random.permutation(self.number_contexts)
-
-  def context(self, number):
-    """Returns the number-th context."""
-    return self.data[self.order[number]][:self.context_dim]
-
-  def reward(self, number, action):
-    """Returns the reward for the number-th context and action."""
-    return self.data[self.order[number]][self.context_dim + action]
-
-  def optimal(self, number):
-    """Returns the optimal action (in hindsight) for the number-th context."""
-    return np.argmax(self.data[self.order[number]][self.context_dim:])
-
-  @property
-  def context_dim(self):
-    return self._context_dim
-
-  @property
-  def num_actions(self):
-    return self._num_actions
-
-  @property
-  def number_contexts(self):
-    return self._number_contexts
diff --git a/research/deep_contextual_bandits/bandits/core/contextual_dataset.py b/research/deep_contextual_bandits/bandits/core/contextual_dataset.py
deleted file mode 100644
index 9fae7629c7c2ee39ab6b98ddac73876b5fca421a..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/core/contextual_dataset.py
+++ /dev/null
@@ -1,166 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Define a data buffer for contextual bandit algorithms."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-
-class ContextualDataset(object):
-  """The buffer is able to append new data, and sample random minibatches."""
-
-  def __init__(self, context_dim, num_actions, buffer_s=-1, intercept=False):
-    """Creates a ContextualDataset object.
-
-    The data is stored in attributes: contexts and rewards.
-    The sequence of taken actions are stored in attribute actions.
-
-    Args:
-      context_dim: Dimension of the contexts.
-      num_actions: Number of arms for the multi-armed bandit.
-      buffer_s: Size of buffer for training. Only last buffer_s will be
-        returned as minibatch. If buffer_s = -1, all data will be used.
-      intercept: If True, it adds a constant (1.0) dimension to each context X,
-        at the end.
-    """
-
-    self._context_dim = context_dim
-    self._num_actions = num_actions
-    self._contexts = None
-    self._rewards = None
-    self.actions = []
-    self.buffer_s = buffer_s
-    self.intercept = intercept
-
-  def add(self, context, action, reward):
-    """Adds a new triplet (context, action, reward) to the dataset.
-
-    The reward for the actions that weren't played is assumed to be zero.
-
-    Args:
-      context: A d-dimensional vector with the context.
-      action: Integer between 0 and k-1 representing the chosen arm.
-      reward: Real number representing the reward for the (context, action).
-    """
-
-    if self.intercept:
-      c = np.array(context[:])
-      c = np.append(c, 1.0).reshape((1, self.context_dim + 1))
-    else:
-      c = np.array(context[:]).reshape((1, self.context_dim))
-
-    if self.contexts is None:
-      self.contexts = c
-    else:
-      self.contexts = np.vstack((self.contexts, c))
-
-    r = np.zeros((1, self.num_actions))
-    r[0, action] = reward
-    if self.rewards is None:
-      self.rewards = r
-    else:
-      self.rewards = np.vstack((self.rewards, r))
-
-    self.actions.append(action)
-
-  def replace_data(self, contexts=None, actions=None, rewards=None):
-    if contexts is not None:
-      self.contexts = contexts
-    if actions is not None:
-      self.actions = actions
-    if rewards is not None:
-      self.rewards = rewards
-
-  def get_batch(self, batch_size):
-    """Returns a random minibatch of (contexts, rewards) with batch_size."""
-    n, _ = self.contexts.shape
-    if self.buffer_s == -1:
-      # use all the data
-      ind = np.random.choice(range(n), batch_size)
-    else:
-      # use only buffer (last buffer_s observations)
-      ind = np.random.choice(range(max(0, n - self.buffer_s), n), batch_size)
-    return self.contexts[ind, :], self.rewards[ind, :]
-
-  def get_data(self, action):
-    """Returns all (context, reward) where the action was played."""
-    n, _ = self.contexts.shape
-    ind = np.array([i for i in range(n) if self.actions[i] == action])
-    return self.contexts[ind, :], self.rewards[ind, action]
-
-  def get_data_with_weights(self):
-    """Returns all observations with one-hot weights for actions."""
-    weights = np.zeros((self.contexts.shape[0], self.num_actions))
-    a_ind = np.array([(i, val) for i, val in enumerate(self.actions)])
-    weights[a_ind[:, 0], a_ind[:, 1]] = 1.0
-    return self.contexts, self.rewards, weights
-
-  def get_batch_with_weights(self, batch_size):
-    """Returns a random mini-batch with one-hot weights for actions."""
-    n, _ = self.contexts.shape
-    if self.buffer_s == -1:
-      # use all the data
-      ind = np.random.choice(range(n), batch_size)
-    else:
-      # use only buffer (last buffer_s obs)
-      ind = np.random.choice(range(max(0, n - self.buffer_s), n), batch_size)
-
-    weights = np.zeros((batch_size, self.num_actions))
-    sampled_actions = np.array(self.actions)[ind]
-    a_ind = np.array([(i, val) for i, val in enumerate(sampled_actions)])
-    weights[a_ind[:, 0], a_ind[:, 1]] = 1.0
-    return self.contexts[ind, :], self.rewards[ind, :], weights
-
-  def num_points(self, f=None):
-    """Returns number of points in the buffer (after applying function f)."""
-    if f is not None:
-      return f(self.contexts.shape[0])
-    return self.contexts.shape[0]
-
-  @property
-  def context_dim(self):
-    return self._context_dim
-
-  @property
-  def num_actions(self):
-    return self._num_actions
-
-  @property
-  def contexts(self):
-    return self._contexts
-
-  @contexts.setter
-  def contexts(self, value):
-    self._contexts = value
-
-  @property
-  def actions(self):
-    return self._actions
-
-  @actions.setter
-  def actions(self, value):
-    self._actions = value
-
-  @property
-  def rewards(self):
-    return self._rewards
-
-  @rewards.setter
-  def rewards(self, value):
-    self._rewards = value
diff --git a/research/deep_contextual_bandits/bandits/data/data_sampler.py b/research/deep_contextual_bandits/bandits/data/data_sampler.py
deleted file mode 100644
index 55d1bae383637485182a9524ba8a3cb37b76bd0d..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/data/data_sampler.py
+++ /dev/null
@@ -1,374 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Functions to create bandit problems from datasets."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import pandas as pd
-import tensorflow as tf
-
-
-def one_hot(df, cols):
-  """Returns one-hot encoding of DataFrame df including columns in cols."""
-  for col in cols:
-    dummies = pd.get_dummies(df[col], prefix=col, drop_first=False)
-    df = pd.concat([df, dummies], axis=1)
-    df = df.drop(col, axis=1)
-  return df
-
-
-def sample_mushroom_data(file_name,
-                         num_contexts,
-                         r_noeat=0,
-                         r_eat_safe=5,
-                         r_eat_poison_bad=-35,
-                         r_eat_poison_good=5,
-                         prob_poison_bad=0.5):
-  """Samples bandit game from Mushroom UCI Dataset.
-
-  Args:
-    file_name: Route of file containing the original Mushroom UCI dataset.
-    num_contexts: Number of points to sample, i.e. (context, action rewards).
-    r_noeat: Reward for not eating a mushroom.
-    r_eat_safe: Reward for eating a non-poisonous mushroom.
-    r_eat_poison_bad: Reward for eating a poisonous mushroom if harmed.
-    r_eat_poison_good: Reward for eating a poisonous mushroom if not harmed.
-    prob_poison_bad: Probability of being harmed by eating a poisonous mushroom.
-
-  Returns:
-    dataset: Sampled matrix with n rows: (context, eat_reward, no_eat_reward).
-    opt_vals: Vector of expected optimal (reward, action) for each context.
-
-  We assume r_eat_safe > r_noeat, and r_eat_poison_good > r_eat_poison_bad.
-  """
-
-  # first two cols of df encode whether mushroom is edible or poisonous
-  df = pd.read_csv(file_name, header=None)
-  df = one_hot(df, df.columns)
-  ind = np.random.choice(range(df.shape[0]), num_contexts, replace=True)
-
-  contexts = df.iloc[ind, 2:]
-  no_eat_reward = r_noeat * np.ones((num_contexts, 1))
-  random_poison = np.random.choice(
-      [r_eat_poison_bad, r_eat_poison_good],
-      p=[prob_poison_bad, 1 - prob_poison_bad],
-      size=num_contexts)
-  eat_reward = r_eat_safe * df.iloc[ind, 0]
-  eat_reward += np.multiply(random_poison, df.iloc[ind, 1])
-  eat_reward = eat_reward.values.reshape((num_contexts, 1))
-
-  # compute optimal expected reward and optimal actions
-  exp_eat_poison_reward = r_eat_poison_bad * prob_poison_bad
-  exp_eat_poison_reward += r_eat_poison_good * (1 - prob_poison_bad)
-  opt_exp_reward = r_eat_safe * df.iloc[ind, 0] + max(
-      r_noeat, exp_eat_poison_reward) * df.iloc[ind, 1]
-
-  if r_noeat > exp_eat_poison_reward:
-    # actions: no eat = 0 ; eat = 1
-    opt_actions = df.iloc[ind, 0]  # indicator of edible
-  else:
-    # should always eat (higher expected reward)
-    opt_actions = np.ones((num_contexts, 1))
-
-  opt_vals = (opt_exp_reward.values, opt_actions.values)
-
-  return np.hstack((contexts, no_eat_reward, eat_reward)), opt_vals
-
-
-def sample_stock_data(file_name, context_dim, num_actions, num_contexts,
-                      sigma, shuffle_rows=True):
-  """Samples linear bandit game from stock prices dataset.
-
-  Args:
-    file_name: Route of file containing the stock prices dataset.
-    context_dim: Context dimension (i.e. vector with the price of each stock).
-    num_actions: Number of actions (different linear portfolio strategies).
-    num_contexts: Number of contexts to sample.
-    sigma: Vector with additive noise levels for each action.
-    shuffle_rows: If True, rows from original dataset are shuffled.
-
-  Returns:
-    dataset: Sampled matrix with rows: (context, reward_1, ..., reward_k).
-    opt_vals: Vector of expected optimal (reward, action) for each context.
-  """
-
-  with tf.gfile.Open(file_name, 'r') as f:
-    contexts = np.loadtxt(f, skiprows=1)
-
-  if shuffle_rows:
-    np.random.shuffle(contexts)
-  contexts = contexts[:num_contexts, :]
-
-  betas = np.random.uniform(-1, 1, (context_dim, num_actions))
-  betas /= np.linalg.norm(betas, axis=0)
-
-  mean_rewards = np.dot(contexts, betas)
-  noise = np.random.normal(scale=sigma, size=mean_rewards.shape)
-  rewards = mean_rewards + noise
-
-  opt_actions = np.argmax(mean_rewards, axis=1)
-  opt_rewards = [mean_rewards[i, a] for i, a in enumerate(opt_actions)]
-  return np.hstack((contexts, rewards)), (np.array(opt_rewards), opt_actions)
-
-
-def sample_jester_data(file_name, context_dim, num_actions, num_contexts,
-                       shuffle_rows=True, shuffle_cols=False):
-  """Samples bandit game from (user, joke) dense subset of Jester dataset.
-
-  Args:
-    file_name: Route of file containing the modified Jester dataset.
-    context_dim: Context dimension (i.e. vector with some ratings from a user).
-    num_actions: Number of actions (number of joke ratings to predict).
-    num_contexts: Number of contexts to sample.
-    shuffle_rows: If True, rows from original dataset are shuffled.
-    shuffle_cols: Whether or not context/action jokes are randomly shuffled.
-
-  Returns:
-    dataset: Sampled matrix with rows: (context, rating_1, ..., rating_k).
-    opt_vals: Vector of deterministic optimal (reward, action) for each context.
-  """
-
-  with tf.gfile.Open(file_name, 'rb') as f:
-    dataset = np.load(f)
-
-  if shuffle_cols:
-    dataset = dataset[:, np.random.permutation(dataset.shape[1])]
-  if shuffle_rows:
-    np.random.shuffle(dataset)
-  dataset = dataset[:num_contexts, :]
-
-  assert context_dim + num_actions == dataset.shape[1], 'Wrong data dimensions.'
-
-  opt_actions = np.argmax(dataset[:, context_dim:], axis=1)
-  opt_rewards = np.array([dataset[i, context_dim + a]
-                          for i, a in enumerate(opt_actions)])
-
-  return dataset, (opt_rewards, opt_actions)
-
-
-def sample_statlog_data(file_name, num_contexts, shuffle_rows=True,
-                        remove_underrepresented=False):
-  """Returns bandit problem dataset based on the UCI statlog data.
-
-  Args:
-    file_name: Route of file containing the Statlog dataset.
-    num_contexts: Number of contexts to sample.
-    shuffle_rows: If True, rows from original dataset are shuffled.
-    remove_underrepresented: If True, removes arms with very few rewards.
-
-  Returns:
-    dataset: Sampled matrix with rows: (context, action rewards).
-    opt_vals: Vector of deterministic optimal (reward, action) for each context.
-
-  https://archive.ics.uci.edu/ml/datasets/Statlog+(Shuttle)
-  """
-
-  with tf.gfile.Open(file_name, 'r') as f:
-    data = np.loadtxt(f)
-
-  num_actions = 7  # some of the actions are very rarely optimal.
-
-  # Shuffle data
-  if shuffle_rows:
-    np.random.shuffle(data)
-  data = data[:num_contexts, :]
-
-  # Last column is label, rest are features
-  contexts = data[:, :-1]
-  labels = data[:, -1].astype(int) - 1  # convert to 0 based index
-
-  if remove_underrepresented:
-    contexts, labels = remove_underrepresented_classes(contexts, labels)
-
-  return classification_to_bandit_problem(contexts, labels, num_actions)
-
-
-def sample_adult_data(file_name, num_contexts, shuffle_rows=True,
-                      remove_underrepresented=False):
-  """Returns bandit problem dataset based on the UCI adult data.
-
-  Args:
-    file_name: Route of file containing the Adult dataset.
-    num_contexts: Number of contexts to sample.
-    shuffle_rows: If True, rows from original dataset are shuffled.
-    remove_underrepresented: If True, removes arms with very few rewards.
-
-  Returns:
-    dataset: Sampled matrix with rows: (context, action rewards).
-    opt_vals: Vector of deterministic optimal (reward, action) for each context.
-
-  Preprocessing:
-    * drop rows with missing values
-    * convert categorical variables to 1 hot encoding
-
-  https://archive.ics.uci.edu/ml/datasets/census+income
-  """
-  with tf.gfile.Open(file_name, 'r') as f:
-    df = pd.read_csv(f, header=None,
-                     na_values=[' ?']).dropna()
-
-  num_actions = 14
-
-  if shuffle_rows:
-    df = df.sample(frac=1)
-  df = df.iloc[:num_contexts, :]
-
-  labels = df[6].astype('category').cat.codes.as_matrix()
-  df = df.drop([6], axis=1)
-
-  # Convert categorical variables to 1 hot encoding
-  cols_to_transform = [1, 3, 5, 7, 8, 9, 13, 14]
-  df = pd.get_dummies(df, columns=cols_to_transform)
-
-  if remove_underrepresented:
-    df, labels = remove_underrepresented_classes(df, labels)
-  contexts = df.as_matrix()
-
-  return classification_to_bandit_problem(contexts, labels, num_actions)
-
-
-def sample_census_data(file_name, num_contexts, shuffle_rows=True,
-                       remove_underrepresented=False):
-  """Returns bandit problem dataset based on the UCI census data.
-
-  Args:
-    file_name: Route of file containing the Census dataset.
-    num_contexts: Number of contexts to sample.
-    shuffle_rows: If True, rows from original dataset are shuffled.
-    remove_underrepresented: If True, removes arms with very few rewards.
-
-  Returns:
-    dataset: Sampled matrix with rows: (context, action rewards).
-    opt_vals: Vector of deterministic optimal (reward, action) for each context.
-
-  Preprocessing:
-    * drop rows with missing labels
-    * convert categorical variables to 1 hot encoding
-
-  Note: this is the processed (not the 'raw') dataset. It contains a subset
-  of the raw features and they've all been discretized.
-
-  https://archive.ics.uci.edu/ml/datasets/US+Census+Data+%281990%29
-  """
-  # Note: this dataset is quite large. It will be slow to load and preprocess.
-  with tf.gfile.Open(file_name, 'r') as f:
-    df = (pd.read_csv(f, header=0, na_values=['?'])
-          .dropna())
-
-  num_actions = 9
-
-  if shuffle_rows:
-    df = df.sample(frac=1)
-  df = df.iloc[:num_contexts, :]
-
-  # Assuming what the paper calls response variable is the label?
-  labels = df['dOccup'].astype('category').cat.codes.as_matrix()
-  # In addition to label, also drop the (unique?) key.
-  df = df.drop(['dOccup', 'caseid'], axis=1)
-
-  # All columns are categorical. Convert to 1 hot encoding.
-  df = pd.get_dummies(df, columns=df.columns)
-
-  if remove_underrepresented:
-    df, labels = remove_underrepresented_classes(df, labels)
-  contexts = df.as_matrix()
-
-  return classification_to_bandit_problem(contexts, labels, num_actions)
-
-
-def sample_covertype_data(file_name, num_contexts, shuffle_rows=True,
-                          remove_underrepresented=False):
-  """Returns bandit problem dataset based on the UCI Cover_Type data.
-
-  Args:
-    file_name: Route of file containing the Covertype dataset.
-    num_contexts: Number of contexts to sample.
-    shuffle_rows: If True, rows from original dataset are shuffled.
-    remove_underrepresented: If True, removes arms with very few rewards.
-
-  Returns:
-    dataset: Sampled matrix with rows: (context, action rewards).
-    opt_vals: Vector of deterministic optimal (reward, action) for each context.
-
-  Preprocessing:
-    * drop rows with missing labels
-    * convert categorical variables to 1 hot encoding
-
-  https://archive.ics.uci.edu/ml/datasets/Covertype
-  """
-  with tf.gfile.Open(file_name, 'r') as f:
-    df = (pd.read_csv(f, header=0, na_values=['?'])
-          .dropna())
-
-  num_actions = 7
-
-  if shuffle_rows:
-    df = df.sample(frac=1)
-  df = df.iloc[:num_contexts, :]
-
-  # Assuming what the paper calls response variable is the label?
-  # Last column is label.
-  labels = df[df.columns[-1]].astype('category').cat.codes.as_matrix()
-  df = df.drop([df.columns[-1]], axis=1)
-
-  # All columns are either quantitative or already converted to 1 hot.
-  if remove_underrepresented:
-    df, labels = remove_underrepresented_classes(df, labels)
-  contexts = df.as_matrix()
-
-  return classification_to_bandit_problem(contexts, labels, num_actions)
-
-
-def classification_to_bandit_problem(contexts, labels, num_actions=None):
-  """Normalize contexts and encode deterministic rewards."""
-
-  if num_actions is None:
-    num_actions = np.max(labels) + 1
-  num_contexts = contexts.shape[0]
-
-  # Due to random subsampling in small problems, some features may be constant
-  sstd = safe_std(np.std(contexts, axis=0, keepdims=True)[0, :])
-
-  # Normalize features
-  contexts = ((contexts - np.mean(contexts, axis=0, keepdims=True)) / sstd)
-
-  # One hot encode labels as rewards
-  rewards = np.zeros((num_contexts, num_actions))
-  rewards[np.arange(num_contexts), labels] = 1.0
-
-  return contexts, rewards, (np.ones(num_contexts), labels)
-
-
-def safe_std(values):
-  """Remove zero std values for ones."""
-  return np.array([val if val != 0.0 else 1.0 for val in values])
-
-
-def remove_underrepresented_classes(features, labels, thresh=0.0005):
-  """Removes classes when number of datapoints fraction is below a threshold."""
-
-  # Threshold doesn't seem to agree with https://arxiv.org/pdf/1706.04687.pdf
-  # Example: for Covertype, they report 4 classes after filtering, we get 7?
-  total_count = labels.shape[0]
-  unique, counts = np.unique(labels, return_counts=True)
-  ratios = counts.astype('float') / total_count
-  vals_and_ratios = dict(zip(unique, ratios))
-  print('Unique classes and their ratio of total: %s' % vals_and_ratios)
-  keep = [vals_and_ratios[v] >= thresh for v in labels]
-  return features[keep], labels[np.array(keep)]
diff --git a/research/deep_contextual_bandits/bandits/data/synthetic_data_sampler.py b/research/deep_contextual_bandits/bandits/data/synthetic_data_sampler.py
deleted file mode 100644
index c7de48aba4de109392aa8efad06071886cf67964..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/bandits/data/synthetic_data_sampler.py
+++ /dev/null
@@ -1,179 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Several functions to sample contextual data."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-
-def sample_contextual_data(num_contexts, dim_context, num_actions, sigma):
-  """Samples independent Gaussian data.
-
-  There is nothing to learn here as the rewards do not depend on the context.
-
-  Args:
-    num_contexts: Number of contexts to sample.
-    dim_context: Dimension of the contexts.
-    num_actions: Number of arms for the multi-armed bandit.
-    sigma: Standard deviation of the independent Gaussian samples.
-
-  Returns:
-    data: A [num_contexts, dim_context + num_actions] numpy array with the data.
-  """
-  size_data = [num_contexts, dim_context + num_actions]
-  return np.random.normal(scale=sigma, size=size_data)
-
-
-def sample_linear_data(num_contexts, dim_context, num_actions, sigma=0.0):
-  """Samples data from linearly parameterized arms.
-
-  The reward for context X and arm j is given by X^T beta_j, for some latent
-  set of parameters {beta_j : j = 1, ..., k}. The beta's are sampled uniformly
-  at random, the contexts are Gaussian, and sigma-noise is added to the rewards.
-
-  Args:
-    num_contexts: Number of contexts to sample.
-    dim_context: Dimension of the contexts.
-    num_actions: Number of arms for the multi-armed bandit.
-    sigma: Standard deviation of the additive noise. Set to zero for no noise.
-
-  Returns:
-    data: A [n, d+k] numpy array with the data.
-    betas: Latent parameters that determine expected reward for each arm.
-    opt: (optimal_rewards, optimal_actions) for all contexts.
-  """
-
-  betas = np.random.uniform(-1, 1, (dim_context, num_actions))
-  betas /= np.linalg.norm(betas, axis=0)
-  contexts = np.random.normal(size=[num_contexts, dim_context])
-  rewards = np.dot(contexts, betas)
-  opt_actions = np.argmax(rewards, axis=1)
-  rewards += np.random.normal(scale=sigma, size=rewards.shape)
-  opt_rewards = np.array([rewards[i, act] for i, act in enumerate(opt_actions)])
-  return np.hstack((contexts, rewards)), betas, (opt_rewards, opt_actions)
-
-
-def sample_sparse_linear_data(num_contexts, dim_context, num_actions,
-                              sparse_dim, sigma=0.0):
-  """Samples data from sparse linearly parameterized arms.
-
-  The reward for context X and arm j is given by X^T beta_j, for some latent
-  set of parameters {beta_j : j = 1, ..., k}. The beta's are sampled uniformly
-  at random, the contexts are Gaussian, and sigma-noise is added to the rewards.
-  Only s components out of d are non-zero for each arm's beta.
-
-  Args:
-    num_contexts: Number of contexts to sample.
-    dim_context: Dimension of the contexts.
-    num_actions: Number of arms for the multi-armed bandit.
-    sparse_dim: Dimension of the latent subspace (sparsity pattern dimension).
-    sigma: Standard deviation of the additive noise. Set to zero for no noise.
-
-  Returns:
-    data: A [num_contexts, dim_context+num_actions] numpy array with the data.
-    betas: Latent parameters that determine expected reward for each arm.
-    opt: (optimal_rewards, optimal_actions) for all contexts.
-  """
-
-  flatten = lambda l: [item for sublist in l for item in sublist]
-  sparse_pattern = flatten(
-      [[(j, i) for j in np.random.choice(range(dim_context),
-                                         sparse_dim,
-                                         replace=False)]
-       for i in range(num_actions)])
-  betas = np.random.uniform(-1, 1, (dim_context, num_actions))
-  mask = np.zeros((dim_context, num_actions))
-  for elt in sparse_pattern:
-    mask[elt] = 1
-  betas = np.multiply(betas, mask)
-  betas /= np.linalg.norm(betas, axis=0)
-  contexts = np.random.normal(size=[num_contexts, dim_context])
-  rewards = np.dot(contexts, betas)
-  opt_actions = np.argmax(rewards, axis=1)
-  rewards += np.random.normal(scale=sigma, size=rewards.shape)
-  opt_rewards = np.array([rewards[i, act] for i, act in enumerate(opt_actions)])
-  return np.hstack((contexts, rewards)), betas, (opt_rewards, opt_actions)
-
-
-def sample_wheel_bandit_data(num_contexts, delta, mean_v, std_v,
-                             mu_large, std_large):
-  """Samples from Wheel bandit game (see https://arxiv.org/abs/1802.09127).
-
-  Args:
-    num_contexts: Number of points to sample, i.e. (context, action rewards).
-    delta: Exploration parameter: high reward in one region if norm above delta.
-    mean_v: Mean reward for each action if context norm is below delta.
-    std_v: Gaussian reward std for each action if context norm is below delta.
-    mu_large: Mean reward for optimal action if context norm is above delta.
-    std_large: Reward std for optimal action if context norm is above delta.
-
-  Returns:
-    dataset: Sampled matrix with n rows: (context, action rewards).
-    opt_vals: Vector of expected optimal (reward, action) for each context.
-  """
-
-  context_dim = 2
-  num_actions = 5
-
-  data = []
-  rewards = []
-  opt_actions = []
-  opt_rewards = []
-
-  # sample uniform contexts in unit ball
-  while len(data) < num_contexts:
-    raw_data = np.random.uniform(-1, 1, (int(num_contexts / 3), context_dim))
-
-    for i in range(raw_data.shape[0]):
-      if np.linalg.norm(raw_data[i, :]) <= 1:
-        data.append(raw_data[i, :])
-
-  contexts = np.stack(data)[:num_contexts, :]
-
-  # sample rewards
-  for i in range(num_contexts):
-    r = [np.random.normal(mean_v[j], std_v[j]) for j in range(num_actions)]
-    if np.linalg.norm(contexts[i, :]) >= delta:
-      # large reward in the right region for the context
-      r_big = np.random.normal(mu_large, std_large)
-      if contexts[i, 0] > 0:
-        if contexts[i, 1] > 0:
-          r[0] = r_big
-          opt_actions.append(0)
-        else:
-          r[1] = r_big
-          opt_actions.append(1)
-      else:
-        if contexts[i, 1] > 0:
-          r[2] = r_big
-          opt_actions.append(2)
-        else:
-          r[3] = r_big
-          opt_actions.append(3)
-    else:
-      opt_actions.append(np.argmax(mean_v))
-
-    opt_rewards.append(r[opt_actions[-1]])
-    rewards.append(r)
-
-  rewards = np.stack(rewards)
-  opt_rewards = np.array(opt_rewards)
-  opt_actions = np.array(opt_actions)
-
-  return np.hstack((contexts, rewards)), (opt_rewards, opt_actions)
diff --git a/research/deep_contextual_bandits/example_main.py b/research/deep_contextual_bandits/example_main.py
deleted file mode 100644
index c71a5aa26f94adbf5989d002fd5c768582c14e14..0000000000000000000000000000000000000000
--- a/research/deep_contextual_bandits/example_main.py
+++ /dev/null
@@ -1,454 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple example of contextual bandits simulation.
-
-Code corresponding to:
-Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks
-for Thompson Sampling, by Carlos Riquelme, George Tucker, and Jasper Snoek.
-https://arxiv.org/abs/1802.09127
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import time
-from absl import app
-from absl import flags
-import numpy as np
-import os
-import tensorflow as tf
-
-from bandits.algorithms.bootstrapped_bnn_sampling import BootstrappedBNNSampling
-from bandits.core.contextual_bandit import run_contextual_bandit
-from bandits.data.data_sampler import sample_adult_data
-from bandits.data.data_sampler import sample_census_data
-from bandits.data.data_sampler import sample_covertype_data
-from bandits.data.data_sampler import sample_jester_data
-from bandits.data.data_sampler import sample_mushroom_data
-from bandits.data.data_sampler import sample_statlog_data
-from bandits.data.data_sampler import sample_stock_data
-from bandits.algorithms.fixed_policy_sampling import FixedPolicySampling
-from bandits.algorithms.linear_full_posterior_sampling import LinearFullPosteriorSampling
-from bandits.algorithms.neural_linear_sampling import NeuralLinearPosteriorSampling
-from bandits.algorithms.parameter_noise_sampling import ParameterNoiseSampling
-from bandits.algorithms.posterior_bnn_sampling import PosteriorBNNSampling
-from bandits.data.synthetic_data_sampler import sample_linear_data
-from bandits.data.synthetic_data_sampler import sample_sparse_linear_data
-from bandits.data.synthetic_data_sampler import sample_wheel_bandit_data
-from bandits.algorithms.uniform_sampling import UniformSampling
-
-# Set up your file routes to the data files.
-base_route = os.getcwd()
-data_route = 'contextual_bandits/datasets'
-
-FLAGS = flags.FLAGS
-FLAGS.set_default('alsologtostderr', True)
-flags.DEFINE_string('logdir', '/tmp/bandits/', 'Base directory to save output')
-flags.DEFINE_string(
-    'mushroom_data',
-    os.path.join(base_route, data_route, 'mushroom.data'),
-    'Directory where Mushroom data is stored.')
-flags.DEFINE_string(
-    'financial_data',
-    os.path.join(base_route, data_route, 'raw_stock_contexts'),
-    'Directory where Financial data is stored.')
-flags.DEFINE_string(
-    'jester_data',
-    os.path.join(base_route, data_route, 'jester_data_40jokes_19181users.npy'),
-    'Directory where Jester data is stored.')
-flags.DEFINE_string(
-    'statlog_data',
-    os.path.join(base_route, data_route, 'shuttle.trn'),
-    'Directory where Statlog data is stored.')
-flags.DEFINE_string(
-    'adult_data',
-    os.path.join(base_route, data_route, 'adult.full'),
-    'Directory where Adult data is stored.')
-flags.DEFINE_string(
-    'covertype_data',
-    os.path.join(base_route, data_route, 'covtype.data'),
-    'Directory where Covertype data is stored.')
-flags.DEFINE_string(
-    'census_data',
-    os.path.join(base_route, data_route, 'USCensus1990.data.txt'),
-    'Directory where Census data is stored.')
-
-
-def sample_data(data_type, num_contexts=None):
-  """Sample data from given 'data_type'.
-
-  Args:
-    data_type: Dataset from which to sample.
-    num_contexts: Number of contexts to sample.
-
-  Returns:
-    dataset: Sampled matrix with rows: (context, reward_1, ..., reward_num_act).
-    opt_rewards: Vector of expected optimal reward for each context.
-    opt_actions: Vector of optimal action for each context.
-    num_actions: Number of available actions.
-    context_dim: Dimension of each context.
-  """
-
-  if data_type == 'linear':
-    # Create linear dataset
-    num_actions = 8
-    context_dim = 10
-    noise_stds = [0.01 * (i + 1) for i in range(num_actions)]
-    dataset, _, opt_linear = sample_linear_data(num_contexts, context_dim,
-                                                num_actions, sigma=noise_stds)
-    opt_rewards, opt_actions = opt_linear
-  elif data_type == 'sparse_linear':
-    # Create sparse linear dataset
-    num_actions = 7
-    context_dim = 10
-    noise_stds = [0.01 * (i + 1) for i in range(num_actions)]
-    num_nnz_dims = int(context_dim / 3.0)
-    dataset, _, opt_sparse_linear = sample_sparse_linear_data(
-        num_contexts, context_dim, num_actions, num_nnz_dims, sigma=noise_stds)
-    opt_rewards, opt_actions = opt_sparse_linear
-  elif data_type == 'mushroom':
-    # Create mushroom dataset
-    num_actions = 2
-    context_dim = 117
-    file_name = FLAGS.mushroom_data
-    dataset, opt_mushroom = sample_mushroom_data(file_name, num_contexts)
-    opt_rewards, opt_actions = opt_mushroom
-  elif data_type == 'financial':
-    num_actions = 8
-    context_dim = 21
-    num_contexts = min(3713, num_contexts)
-    noise_stds = [0.01 * (i + 1) for i in range(num_actions)]
-    file_name = FLAGS.financial_data
-    dataset, opt_financial = sample_stock_data(file_name, context_dim,
-                                               num_actions, num_contexts,
-                                               noise_stds, shuffle_rows=True)
-    opt_rewards, opt_actions = opt_financial
-  elif data_type == 'jester':
-    num_actions = 8
-    context_dim = 32
-    num_contexts = min(19181, num_contexts)
-    file_name = FLAGS.jester_data
-    dataset, opt_jester = sample_jester_data(file_name, context_dim,
-                                             num_actions, num_contexts,
-                                             shuffle_rows=True,
-                                             shuffle_cols=True)
-    opt_rewards, opt_actions = opt_jester
-  elif data_type == 'statlog':
-    file_name = FLAGS.statlog_data
-    num_actions = 7
-    num_contexts = min(43500, num_contexts)
-    sampled_vals = sample_statlog_data(file_name, num_contexts,
-                                       shuffle_rows=True)
-    contexts, rewards, (opt_rewards, opt_actions) = sampled_vals
-    dataset = np.hstack((contexts, rewards))
-    context_dim = contexts.shape[1]
-  elif data_type == 'adult':
-    file_name = FLAGS.adult_data
-    num_actions = 14
-    num_contexts = min(45222, num_contexts)
-    sampled_vals = sample_adult_data(file_name, num_contexts,
-                                     shuffle_rows=True)
-    contexts, rewards, (opt_rewards, opt_actions) = sampled_vals
-    dataset = np.hstack((contexts, rewards))
-    context_dim = contexts.shape[1]
-  elif data_type == 'covertype':
-    file_name = FLAGS.covertype_data
-    num_actions = 7
-    num_contexts = min(150000, num_contexts)
-    sampled_vals = sample_covertype_data(file_name, num_contexts,
-                                         shuffle_rows=True)
-    contexts, rewards, (opt_rewards, opt_actions) = sampled_vals
-    dataset = np.hstack((contexts, rewards))
-    context_dim = contexts.shape[1]
-  elif data_type == 'census':
-    file_name = FLAGS.census_data
-    num_actions = 9
-    num_contexts = min(150000, num_contexts)
-    sampled_vals = sample_census_data(file_name, num_contexts,
-                                      shuffle_rows=True)
-    contexts, rewards, (opt_rewards, opt_actions) = sampled_vals
-    dataset = np.hstack((contexts, rewards))
-    context_dim = contexts.shape[1]
-  elif data_type == 'wheel':
-    delta = 0.95
-    num_actions = 5
-    context_dim = 2
-    mean_v = [1.0, 1.0, 1.0, 1.0, 1.2]
-    std_v = [0.05, 0.05, 0.05, 0.05, 0.05]
-    mu_large = 50
-    std_large = 0.01
-    dataset, opt_wheel = sample_wheel_bandit_data(num_contexts, delta,
-                                                  mean_v, std_v,
-                                                  mu_large, std_large)
-    opt_rewards, opt_actions = opt_wheel
-
-  return dataset, opt_rewards, opt_actions, num_actions, context_dim
-
-
-def display_results(algos, opt_rewards, opt_actions, h_rewards, t_init, name):
-  """Displays summary statistics of the performance of each algorithm."""
-
-  print('---------------------------------------------------')
-  print('---------------------------------------------------')
-  print('{} bandit completed after {} seconds.'.format(
-    name, time.time() - t_init))
-  print('---------------------------------------------------')
-
-  performance_pairs = []
-  for j, a in enumerate(algos):
-    performance_pairs.append((a.name, np.sum(h_rewards[:, j])))
-  performance_pairs = sorted(performance_pairs,
-                             key=lambda elt: elt[1],
-                             reverse=True)
-  for i, (name, reward) in enumerate(performance_pairs):
-    print('{:3}) {:20}| \t \t total reward = {:10}.'.format(i, name, reward))
-
-  print('---------------------------------------------------')
-  print('Optimal total reward = {}.'.format(np.sum(opt_rewards)))
-  print('Frequency of optimal actions (action, frequency):')
-  print([[elt, list(opt_actions).count(elt)] for elt in set(opt_actions)])
-  print('---------------------------------------------------')
-  print('---------------------------------------------------')
-
-
-def main(_):
-
-  # Problem parameters
-  num_contexts = 2000
-
-  # Data type in {linear, sparse_linear, mushroom, financial, jester,
-  #                 statlog, adult, covertype, census, wheel}
-  data_type = 'mushroom'
-
-  # Create dataset
-  sampled_vals = sample_data(data_type, num_contexts)
-  dataset, opt_rewards, opt_actions, num_actions, context_dim = sampled_vals
-
-  # Define hyperparameters and algorithms
-  hparams = tf.contrib.training.HParams(num_actions=num_actions)
-
-  hparams_linear = tf.contrib.training.HParams(num_actions=num_actions,
-                                               context_dim=context_dim,
-                                               a0=6,
-                                               b0=6,
-                                               lambda_prior=0.25,
-                                               initial_pulls=2)
-
-  hparams_rms = tf.contrib.training.HParams(num_actions=num_actions,
-                                            context_dim=context_dim,
-                                            init_scale=0.3,
-                                            activation=tf.nn.relu,
-                                            layer_sizes=[50],
-                                            batch_size=512,
-                                            activate_decay=True,
-                                            initial_lr=0.1,
-                                            max_grad_norm=5.0,
-                                            show_training=False,
-                                            freq_summary=1000,
-                                            buffer_s=-1,
-                                            initial_pulls=2,
-                                            optimizer='RMS',
-                                            reset_lr=True,
-                                            lr_decay_rate=0.5,
-                                            training_freq=50,
-                                            training_epochs=100,
-                                            p=0.95,
-                                            q=3)
-
-  hparams_dropout = tf.contrib.training.HParams(num_actions=num_actions,
-                                                context_dim=context_dim,
-                                                init_scale=0.3,
-                                                activation=tf.nn.relu,
-                                                layer_sizes=[50],
-                                                batch_size=512,
-                                                activate_decay=True,
-                                                initial_lr=0.1,
-                                                max_grad_norm=5.0,
-                                                show_training=False,
-                                                freq_summary=1000,
-                                                buffer_s=-1,
-                                                initial_pulls=2,
-                                                optimizer='RMS',
-                                                reset_lr=True,
-                                                lr_decay_rate=0.5,
-                                                training_freq=50,
-                                                training_epochs=100,
-                                                use_dropout=True,
-                                                keep_prob=0.80)
-
-  hparams_bbb = tf.contrib.training.HParams(num_actions=num_actions,
-                                            context_dim=context_dim,
-                                            init_scale=0.3,
-                                            activation=tf.nn.relu,
-                                            layer_sizes=[50],
-                                            batch_size=512,
-                                            activate_decay=True,
-                                            initial_lr=0.1,
-                                            max_grad_norm=5.0,
-                                            show_training=False,
-                                            freq_summary=1000,
-                                            buffer_s=-1,
-                                            initial_pulls=2,
-                                            optimizer='RMS',
-                                            use_sigma_exp_transform=True,
-                                            cleared_times_trained=10,
-                                            initial_training_steps=100,
-                                            noise_sigma=0.1,
-                                            reset_lr=False,
-                                            training_freq=50,
-                                            training_epochs=100)
-
-  hparams_nlinear = tf.contrib.training.HParams(num_actions=num_actions,
-                                                context_dim=context_dim,
-                                                init_scale=0.3,
-                                                activation=tf.nn.relu,
-                                                layer_sizes=[50],
-                                                batch_size=512,
-                                                activate_decay=True,
-                                                initial_lr=0.1,
-                                                max_grad_norm=5.0,
-                                                show_training=False,
-                                                freq_summary=1000,
-                                                buffer_s=-1,
-                                                initial_pulls=2,
-                                                reset_lr=True,
-                                                lr_decay_rate=0.5,
-                                                training_freq=1,
-                                                training_freq_network=50,
-                                                training_epochs=100,
-                                                a0=6,
-                                                b0=6,
-                                                lambda_prior=0.25)
-
-  hparams_nlinear2 = tf.contrib.training.HParams(num_actions=num_actions,
-                                                 context_dim=context_dim,
-                                                 init_scale=0.3,
-                                                 activation=tf.nn.relu,
-                                                 layer_sizes=[50],
-                                                 batch_size=512,
-                                                 activate_decay=True,
-                                                 initial_lr=0.1,
-                                                 max_grad_norm=5.0,
-                                                 show_training=False,
-                                                 freq_summary=1000,
-                                                 buffer_s=-1,
-                                                 initial_pulls=2,
-                                                 reset_lr=True,
-                                                 lr_decay_rate=0.5,
-                                                 training_freq=10,
-                                                 training_freq_network=50,
-                                                 training_epochs=100,
-                                                 a0=6,
-                                                 b0=6,
-                                                 lambda_prior=0.25)
-
-  hparams_pnoise = tf.contrib.training.HParams(num_actions=num_actions,
-                                               context_dim=context_dim,
-                                               init_scale=0.3,
-                                               activation=tf.nn.relu,
-                                               layer_sizes=[50],
-                                               batch_size=512,
-                                               activate_decay=True,
-                                               initial_lr=0.1,
-                                               max_grad_norm=5.0,
-                                               show_training=False,
-                                               freq_summary=1000,
-                                               buffer_s=-1,
-                                               initial_pulls=2,
-                                               optimizer='RMS',
-                                               reset_lr=True,
-                                               lr_decay_rate=0.5,
-                                               training_freq=50,
-                                               training_epochs=100,
-                                               noise_std=0.05,
-                                               eps=0.1,
-                                               d_samples=300,
-                                              )
-
-  hparams_alpha_div = tf.contrib.training.HParams(num_actions=num_actions,
-                                                  context_dim=context_dim,
-                                                  init_scale=0.3,
-                                                  activation=tf.nn.relu,
-                                                  layer_sizes=[50],
-                                                  batch_size=512,
-                                                  activate_decay=True,
-                                                  initial_lr=0.1,
-                                                  max_grad_norm=5.0,
-                                                  show_training=False,
-                                                  freq_summary=1000,
-                                                  buffer_s=-1,
-                                                  initial_pulls=2,
-                                                  optimizer='RMS',
-                                                  use_sigma_exp_transform=True,
-                                                  cleared_times_trained=10,
-                                                  initial_training_steps=100,
-                                                  noise_sigma=0.1,
-                                                  reset_lr=False,
-                                                  training_freq=50,
-                                                  training_epochs=100,
-                                                  alpha=1.0,
-                                                  k=20,
-                                                  prior_variance=0.1)
-
-  hparams_gp = tf.contrib.training.HParams(num_actions=num_actions,
-                                           num_outputs=num_actions,
-                                           context_dim=context_dim,
-                                           reset_lr=False,
-                                           learn_embeddings=True,
-                                           max_num_points=1000,
-                                           show_training=False,
-                                           freq_summary=1000,
-                                           batch_size=512,
-                                           keep_fixed_after_max_obs=True,
-                                           training_freq=50,
-                                           initial_pulls=2,
-                                           training_epochs=100,
-                                           lr=0.01,
-                                           buffer_s=-1,
-                                           initial_lr=0.001,
-                                           lr_decay_rate=0.0,
-                                           optimizer='RMS',
-                                           task_latent_dim=5,
-                                           activate_decay=False)
-
-  algos = [
-      UniformSampling('Uniform Sampling', hparams),
-      UniformSampling('Uniform Sampling 2', hparams),
-      FixedPolicySampling('fixed1', [0.75, 0.25], hparams),
-      FixedPolicySampling('fixed2', [0.25, 0.75], hparams),
-      PosteriorBNNSampling('RMS', hparams_rms, 'RMSProp'),
-      PosteriorBNNSampling('Dropout', hparams_dropout, 'RMSProp'),
-      PosteriorBNNSampling('BBB', hparams_bbb, 'Variational'),
-      NeuralLinearPosteriorSampling('NeuralLinear', hparams_nlinear),
-      NeuralLinearPosteriorSampling('NeuralLinear2', hparams_nlinear2),
-      LinearFullPosteriorSampling('LinFullPost', hparams_linear),
-      BootstrappedBNNSampling('BootRMS', hparams_rms),
-      ParameterNoiseSampling('ParamNoise', hparams_pnoise),
-      PosteriorBNNSampling('BBAlphaDiv', hparams_alpha_div, 'AlphaDiv'),
-      PosteriorBNNSampling('MultitaskGP', hparams_gp, 'GP'),
-  ]
-
-  # Run contextual bandit problem
-  t_init = time.time()
-  results = run_contextual_bandit(context_dim, num_actions, dataset, algos)
-  _, h_rewards = results
-
-  # Display results
-  display_results(algos, opt_rewards, opt_actions, h_rewards, t_init, data_type)
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/deep_speech/deep_speech.py b/research/deep_speech/deep_speech.py
index 526f0fc2572b35816d79205b2f215ba5f8e0f21a..3d809c3cbc245e20b752bf2d2e45823f33521d64 100644
--- a/research/deep_speech/deep_speech.py
+++ b/research/deep_speech/deep_speech.py
@@ -28,8 +28,6 @@ import data.dataset as dataset
 import decoder
 import deep_speech_model
 from official.utils.flags import core as flags_core
-from official.utils.logs import hooks_helper
-from official.utils.logs import logger
 from official.utils.misc import distribution_utils
 from official.utils.misc import model_helpers
 
@@ -276,16 +274,6 @@ def run_deep_speech(_):
       "use_bias": flags_obj.use_bias
   }
 
-  dataset_name = "LibriSpeech"
-  benchmark_logger = logger.get_benchmark_logger()
-  benchmark_logger.log_run_info("deep_speech", dataset_name, run_params,
-                                test_id=flags_obj.benchmark_test_id)
-
-  train_hooks = hooks_helper.get_train_hooks(
-      flags_obj.hooks,
-      model_dir=flags_obj.model_dir,
-      batch_size=flags_obj.batch_size)
-
   per_replica_batch_size = per_device_batch_size(flags_obj.batch_size, num_gpus)
 
   def input_fn_train():
@@ -307,7 +295,7 @@ def run_deep_speech(_):
         train_speech_dataset.entries, cycle_index, flags_obj.sortagrad,
         flags_obj.batch_size)
 
-    estimator.train(input_fn=input_fn_train, hooks=train_hooks)
+    estimator.train(input_fn=input_fn_train)
 
     # Evaluation
     tf.logging.info("Starting to evaluate...")
@@ -433,8 +421,7 @@ def define_deep_speech_flags():
 
 
 def main(_):
-  with logger.benchmark_context(flags_obj):
-    run_deep_speech(flags_obj)
+  run_deep_speech(flags_obj)
 
 
 if __name__ == "__main__":
diff --git a/research/deeplab/convert_to_tflite.py b/research/deeplab/convert_to_tflite.py
new file mode 100644
index 0000000000000000000000000000000000000000..d23ce9e2337829d7a71a8cde487fb73ab068b664
--- /dev/null
+++ b/research/deeplab/convert_to_tflite.py
@@ -0,0 +1,112 @@
+# Copyright 2018 The TensorFlow Authors All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tools to convert a quantized deeplab model to tflite."""
+
+from absl import app
+from absl import flags
+import numpy as np
+from PIL import Image
+import tensorflow as tf
+
+
+flags.DEFINE_string('quantized_graph_def_path', None,
+                    'Path to quantized graphdef.')
+flags.DEFINE_string('output_tflite_path', None, 'Output TFlite model path.')
+flags.DEFINE_string(
+    'input_tensor_name', None,
+    'Input tensor to TFlite model. This usually should be the input tensor to '
+    'model backbone.'
+)
+flags.DEFINE_string(
+    'output_tensor_name', 'ArgMax:0',
+    'Output tensor name of TFlite model. By default we output the raw semantic '
+    'label predictions.'
+)
+flags.DEFINE_string(
+    'test_image_path', None,
+    'Path to an image to test the consistency between input graphdef / '
+    'converted tflite model.'
+)
+
+FLAGS = flags.FLAGS
+
+
+def convert_to_tflite(quantized_graphdef,
+                      backbone_input_tensor,
+                      output_tensor):
+  """Helper method to convert quantized deeplab model to TFlite."""
+  with tf.Graph().as_default() as graph:
+    tf.graph_util.import_graph_def(quantized_graphdef, name='')
+    sess = tf.compat.v1.Session()
+
+    tflite_input = graph.get_tensor_by_name(backbone_input_tensor)
+    tflite_output = graph.get_tensor_by_name(output_tensor)
+    converter = tf.compat.v1.lite.TFLiteConverter.from_session(
+        sess, [tflite_input], [tflite_output])
+    converter.inference_type = tf.compat.v1.lite.constants.QUANTIZED_UINT8
+    input_arrays = converter.get_input_arrays()
+    converter.quantized_input_stats = {input_arrays[0]: (127.5, 127.5)}
+    return converter.convert()
+
+
+def check_tflite_consistency(graph_def, tflite_model, image_path):
+  """Runs tflite and frozen graph on same input, check their outputs match."""
+  # Load tflite model and check input size.
+  interpreter = tf.lite.Interpreter(model_content=tflite_model)
+  interpreter.allocate_tensors()
+  input_details = interpreter.get_input_details()
+  output_details = interpreter.get_output_details()
+  height, width = input_details[0]['shape'][1:3]
+
+  # Prepare input image data.
+  with tf.io.gfile.GFile(image_path, 'rb') as f:
+    image = Image.open(f)
+  image = np.asarray(image.convert('RGB').resize((width, height)))
+  image = np.expand_dims(image, 0)
+
+  # Output from tflite model.
+  interpreter.set_tensor(input_details[0]['index'], image)
+  interpreter.invoke()
+  output_tflite = interpreter.get_tensor(output_details[0]['index'])
+
+  with tf.Graph().as_default():
+    tf.graph_util.import_graph_def(graph_def, name='')
+    with tf.compat.v1.Session() as sess:
+      # Note here the graph will include preprocessing part of the graph
+      # (e.g. resize, pad, normalize). Given the input image size is at the
+      # crop size (backbone input size), resize / pad should be an identity op.
+      output_graph = sess.run(
+          FLAGS.output_tensor_name, feed_dict={'ImageTensor:0': image})
+
+  print('%.2f%% pixels have matched semantic labels.' % (
+      100 * np.mean(output_graph == output_tflite)))
+
+
+def main(unused_argv):
+  with tf.io.gfile.GFile(FLAGS.quantized_graph_def_path, 'rb') as f:
+    graph_def = tf.compat.v1.GraphDef.FromString(f.read())
+  tflite_model = convert_to_tflite(
+      graph_def, FLAGS.input_tensor_name, FLAGS.output_tensor_name)
+
+  if FLAGS.output_tflite_path:
+    with tf.io.gfile.GFile(FLAGS.output_tflite_path, 'wb') as f:
+      f.write(tflite_model)
+
+  if FLAGS.test_image_path:
+    check_tflite_consistency(graph_def, tflite_model, FLAGS.test_image_path)
+
+
+if __name__ == '__main__':
+  app.run(main)
diff --git a/research/deeplab/datasets/build_cityscapes_data.py b/research/deeplab/datasets/build_cityscapes_data.py
index ce81baef20a460abaa634d3f1dcb6760a0858dec..53c11e30310f38a8abadeeeae78a0d71f5f7f8cb 100644
--- a/research/deeplab/datasets/build_cityscapes_data.py
+++ b/research/deeplab/datasets/build_cityscapes_data.py
@@ -113,17 +113,23 @@ def _get_files(data, dataset_split):
 
   Args:
     data: String, desired data ('image' or 'label').
-    dataset_split: String, dataset split ('train', 'val', 'test')
+    dataset_split: String, dataset split ('train_fine', 'val_fine', 'test_fine')
 
   Returns:
     A list of sorted file names or None when getting label for
       test set.
   """
-  if data == 'label' and dataset_split == 'test':
-    return None
+  if dataset_split == 'train_fine':
+    split_dir = 'train'
+  elif dataset_split == 'val_fine':
+    split_dir = 'val'
+  elif dataset_split == 'test_fine':
+    split_dir = 'test'
+  else:
+    raise RuntimeError("Split {} is not supported".format(dataset_split))
   pattern = '*%s.%s' % (_POSTFIX_MAP[data], _DATA_FORMAT_MAP[data])
   search_files = os.path.join(
-      FLAGS.cityscapes_root, _FOLDERS_MAP[data], dataset_split, '*', pattern)
+      FLAGS.cityscapes_root, _FOLDERS_MAP[data], split_dir, '*', pattern)
   filenames = glob.glob(search_files)
   return sorted(filenames)
 
@@ -132,7 +138,7 @@ def _convert_dataset(dataset_split):
   """Converts the specified dataset split to TFRecord format.
 
   Args:
-    dataset_split: The dataset split (e.g., train, val).
+    dataset_split: The dataset split (e.g., train_fine, val_fine).
 
   Raises:
     RuntimeError: If loaded image and label have different shape, or if the
@@ -142,8 +148,12 @@ def _convert_dataset(dataset_split):
   label_files = _get_files('label', dataset_split)
 
   num_images = len(image_files)
+  num_labels = len(label_files)
   num_per_shard = int(math.ceil(num_images / _NUM_SHARDS))
 
+  if num_images != num_labels:
+    raise RuntimeError("The number of images and labels doesn't match: {} {}".format(num_images, num_labels))
+
   image_reader = build_data.ImageReader('png', channels=3)
   label_reader = build_data.ImageReader('png', channels=1)
 
@@ -179,8 +189,8 @@ def _convert_dataset(dataset_split):
 
 
 def main(unused_argv):
-  # Only support converting 'train' and 'val' sets for now.
-  for dataset_split in ['train', 'val']:
+  # Only support converting 'train_fine', 'val_fine' and 'test_fine' sets for now.
+  for dataset_split in ['train_fine', 'val_fine', 'test_fine']:
     _convert_dataset(dataset_split)
 
 
diff --git a/research/deeplab/datasets/convert_cityscapes.sh b/research/deeplab/datasets/convert_cityscapes.sh
index a95b5d66aad79ae7cbd6ad2d3ee60550ab7f6239..ddc39fb11ddfed38e0b9daf5974b96286a4aa43b 100644
--- a/research/deeplab/datasets/convert_cityscapes.sh
+++ b/research/deeplab/datasets/convert_cityscapes.sh
@@ -42,6 +42,8 @@ WORK_DIR="."
 # Root path for Cityscapes dataset.
 CITYSCAPES_ROOT="${WORK_DIR}/cityscapes"
 
+export PYTHONPATH="${CITYSCAPES_ROOT}:${PYTHONPATH}"
+
 # Create training labels.
 python "${CITYSCAPES_ROOT}/cityscapesscripts/preparation/createTrainIdLabelImgs.py"
 
diff --git a/research/deeplab/deprecated/segmentation_dataset.py b/research/deeplab/deprecated/segmentation_dataset.py
index 2a5980b1d940878cd1aead4a5d301cca7b4a642b..8a6a8c766e40f06e6bee4f7550018b7933a51b92 100644
--- a/research/deeplab/deprecated/segmentation_dataset.py
+++ b/research/deeplab/deprecated/segmentation_dataset.py
@@ -81,8 +81,8 @@ DatasetDescriptor = collections.namedtuple(
 
 _CITYSCAPES_INFORMATION = DatasetDescriptor(
     splits_to_sizes={
-        'train': 2975,
-        'val': 500,
+        'train_fine': 2975,
+        'val_fine': 500,
     },
     num_classes=19,
     ignore_label=255,
diff --git a/research/deeplab/g3doc/cityscapes.md b/research/deeplab/g3doc/cityscapes.md
index af703088e61b49aa81bf62b536469b410f0fb352..5a660aaca342d16461b2355cae058900a212db1a 100644
--- a/research/deeplab/g3doc/cityscapes.md
+++ b/research/deeplab/g3doc/cityscapes.md
@@ -43,7 +43,7 @@ A local training job using `xception_65` can be run with the following command:
 python deeplab/train.py \
     --logtostderr \
     --training_number_of_steps=90000 \
-    --train_split="train" \
+    --train_split="train_fine" \
     --model_variant="xception_65" \
     --atrous_rates=6 \
     --atrous_rates=12 \
@@ -95,7 +95,7 @@ command:
 # From tensorflow/models/research/
 python deeplab/eval.py \
     --logtostderr \
-    --eval_split="val" \
+    --eval_split="val_fine" \
     --model_variant="xception_65" \
     --atrous_rates=6 \
     --atrous_rates=12 \
@@ -121,7 +121,7 @@ command:
 # From tensorflow/models/research/
 python deeplab/vis.py \
     --logtostderr \
-    --vis_split="val" \
+    --vis_split="val_fine" \
     --model_variant="xception_65" \
     --atrous_rates=6 \
     --atrous_rates=12 \
diff --git a/research/deeplab/g3doc/installation.md b/research/deeplab/g3doc/installation.md
index 8629aba42207fc6e35c907024485c0e7f29f5e10..591a1f8da50d139f48cb3cd3c299535fcdab16a9 100644
--- a/research/deeplab/g3doc/installation.md
+++ b/research/deeplab/g3doc/installation.md
@@ -68,6 +68,6 @@ Quick running the whole code on the PASCAL VOC 2012 dataset:
 
 ```bash
 # From tensorflow/models/research/deeplab
-sh local_test.sh
+bash local_test.sh
 ```
 
diff --git a/research/deeplab/g3doc/quantize.md b/research/deeplab/g3doc/quantize.md
index d88a2e9a8acbac4a0de6e3ea2bed65cb44535665..65dbdd70b4dd67838326b434dd5fe9753a5afc1c 100644
--- a/research/deeplab/g3doc/quantize.md
+++ b/research/deeplab/g3doc/quantize.md
@@ -42,7 +42,6 @@ python deeplab/train.py \
     --train_batch_size=8 \
     --base_learning_rate=3e-5 \
     --dataset="pascal_voc_seg" \
-    --initialize_last_layer \
     --quantize_delay_step=0 \
     --tf_initial_checkpoint=${PATH_TO_TRAINED_FLOAT_MODEL} \
     --train_logdir=${PATH_TO_TRAIN_DIR} \
@@ -65,18 +64,12 @@ python deeplab/export_model.py \
 Commandline below shows how to convert exported graphdef to TFlite model.
 
 ```
-tflite_convert \
-  --graph_def_file=${OUTPUT_DIR}/frozen_inference_graph.pb \
-  --output_file=${OUTPUT_DIR}/frozen_inference_graph.tflite \
-  --output_format=TFLITE \
-  --input_shape=1,513,513,3 \
-  --input_arrays="MobilenetV2/MobilenetV2/input" \
-  --inference_type=QUANTIZED_UINT8 \
-  --inference_input_type=QUANTIZED_UINT8 \
-  --std_dev_values=128 \
-  --mean_values=128 \
-  --change_concat_input_ranges=true \
-  --output_arrays="ArgMax"
+# From tensorflow/models/research/
+python deeplab/convert_to_tflite.py \
+  --quantized_graph_def_path=${OUTPUT_DIR}/frozen_inference_graph.pb \
+  --input_tensor_name=MobilenetV2/MobilenetV2/input:0 \
+  --output_tflite_path=${OUTPUT_DIR}/frozen_inference_graph.tflite \
+  --test_image_path=${PATH_TO_TEST_IMAGE}
 ```
 
 **[Important]** Note that converted model expects 513x513 RGB input and doesn't
diff --git a/research/deeplab/local_test.sh b/research/deeplab/local_test.sh
index d5e4a5f42bb4241d4b6dd1b9d8a2619c4ca9dc8b..c9ad75f69280a9179a891b52f310b05a5b744def 100644
--- a/research/deeplab/local_test.sh
+++ b/research/deeplab/local_test.sh
@@ -19,7 +19,7 @@
 #
 # Usage:
 #   # From the tensorflow/models/research/deeplab directory.
-#   sh ./local_test.sh
+#   bash ./local_test.sh
 #
 #
 
@@ -42,7 +42,7 @@ python "${WORK_DIR}"/model_test.py
 # Go to datasets folder and download PASCAL VOC 2012 segmentation dataset.
 DATASET_DIR="datasets"
 cd "${WORK_DIR}/${DATASET_DIR}"
-sh download_and_convert_voc2012.sh
+bash download_and_convert_voc2012.sh
 
 # Go back to original directory.
 cd "${CURRENT_DIR}"
diff --git a/research/delf/INSTALL_INSTRUCTIONS.md b/research/delf/INSTALL_INSTRUCTIONS.md
index 4f66e9389fdd126dd769a8282a69482b989c9c9e..f5616f47ff095f5ed481f69fc07132d584d7aa54 100644
--- a/research/delf/INSTALL_INSTRUCTIONS.md
+++ b/research/delf/INSTALL_INSTRUCTIONS.md
@@ -1,8 +1,43 @@
 ## DELF installation
 
+### Installation script
+
+We now have a script to do the entire installation in one shot. Navigate to the
+directory `models/research/delf/delf/python/training`, then run:
+
+```bash
+# From models/research/delf/delf/python/training
+bash install_delf.sh
+```
+
+If this works, you are done! If not, see below for detailed instructions for
+installing this codebase and its dependencies.
+
+*Please note that this installation script only works on 64 bits Linux
+architectures due to the `protoc` binary that is automatically downloaded. If
+you wish to install the DELF library on other architectures please update the
+[`install_delf.sh`](delf/python/training/install_delf.sh) script by referencing
+the desired `protoc`
+[binary release](https://github.com/protocolbuffers/protobuf/releases).*
+
+In more detail: the `install_delf.sh` script installs both the DELF library and
+its dependencies in the following sequence:
+
+*   Install TensorFlow 2.2 and TensorFlow 2.2 for GPU.
+*   Install the [TF-Slim](https://github.com/google-research/tf-slim) library
+    from source.
+*   Download [protoc](https://github.com/protocolbuffers/protobuf) and compile
+    the DELF Protocol Buffers.
+*   Install the matplotlib, numpy, scikit-image, scipy and python3-tk Python
+    libraries.
+*   Install the
+    [TensorFlow Object Detection API](https://github.com/tensorflow/models/tree/master/research/object_detection)
+    from the cloned TensorFlow Model Garden repository.
+*   Install the DELF package.
+
 ### Tensorflow
 
-[![TensorFlow 2.1](https://img.shields.io/badge/tensorflow-2.1-brightgreen)](https://github.com/tensorflow/tensorflow/releases/tag/v2.1.0)
+[![TensorFlow 2.2](https://img.shields.io/badge/tensorflow-2.2-brightgreen)](https://github.com/tensorflow/tensorflow/releases/tag/v2.2.0)
 [![Python 3.6](https://img.shields.io/badge/python-3.6-blue.svg)](https://www.python.org/downloads/release/python-360/)
 
 For detailed steps to install Tensorflow, follow the
@@ -11,9 +46,9 @@ typical user can install Tensorflow using one of the following commands:
 
 ```bash
 # For CPU:
-pip3 install 'tensorflow'
+pip3 install 'tensorflow>=2.2.0'
 # For GPU:
-pip3 install 'tensorflow-gpu'
+pip3 install 'tensorflow-gpu>=2.2.0'
 ```
 
 ### TF-Slim
diff --git a/research/delf/README.md b/research/delf/README.md
index f10852759c3455ae2990475ea917a4e45ee96264..a8bea62000a0f8a03c529284172107607d83192b 100644
--- a/research/delf/README.md
+++ b/research/delf/README.md
@@ -1,6 +1,6 @@
 # Deep Local and Global Image Features
 
-[![TensorFlow 2.1](https://img.shields.io/badge/tensorflow-2.1-brightgreen)](https://github.com/tensorflow/tensorflow/releases/tag/v2.1.0)
+[![TensorFlow 2.2](https://img.shields.io/badge/tensorflow-2.2-brightgreen)](https://github.com/tensorflow/tensorflow/releases/tag/v2.2.0)
 [![Python 3.6](https://img.shields.io/badge/python-3.6-blue.svg)](https://www.python.org/downloads/release/python-360/)
 
 This project presents code for deep local and global image feature methods,
@@ -41,7 +41,7 @@ DELG:
 ```
 "Unifying Deep Local and Global Features for Image Search",
 B. Cao*, A. Araujo* and J. Sim,
-arxiv:2001.05027
+Proc. ECCV'20
 ```
 
 GLDv2:
@@ -55,11 +55,11 @@ Proc. CVPR'20
 
 ## News
 
+-   [Jul'20] Check out our ECCV'20 paper:
+    ["Unifying Deep Local and Global Features for Image Search"](https://arxiv.org/abs/2001.05027)
 -   [Apr'20] Check out our CVPR'20 paper: ["Google Landmarks Dataset v2 - A
     Large-Scale Benchmark for Instance-Level Recognition and
     Retrieval"](https://arxiv.org/abs/2004.01804)
--   [Jan'20] Check out our new paper:
-    ["Unifying Deep Local and Global Features for Image Search"](https://arxiv.org/abs/2001.05027)
 -   [Jun'19] DELF achieved 2nd place in
     [CVPR Visual Localization challenge (Local Features track)](https://sites.google.com/corp/view/ltvl2019).
     See our slides
@@ -182,104 +182,55 @@ directories therein, `protos` and `python`.
 
 ### `delf/protos`
 
-This directory contains protobufs:
-
--   `aggregation_config.proto`: protobuf for configuring local feature
-    aggregation.
--   `box.proto`: protobuf for serializing detected boxes.
--   `datum.proto`: general-purpose protobuf for serializing float tensors.
--   `delf_config.proto`: protobuf for configuring DELF/DELG extraction.
--   `feature.proto`: protobuf for serializing DELF features.
+This directory contains protobufs for local feature aggregation
+(`aggregation_config.proto`), serializing detected boxes (`box.proto`),
+serializing float tensors (`datum.proto`), configuring DELF/DELG extraction
+(`delf_config.proto`), serializing local features (`feature.proto`).
 
 ### `delf/python`
 
-This directory contains files for several different purposes:
-
--   `box_io.py`, `datum_io.py`, `feature_io.py` are helper files for reading and
-    writing tensors and features.
--   `delf_v1.py` contains code to create DELF models.
--   `feature_aggregation_extractor.py` contains a module to perform local
-    feature aggregation.
--   `feature_aggregation_similarity.py` contains a module to perform similarity
-    computation for aggregated local features.
--   `feature_extractor.py` contains the code to extract features using DELF.
-    This is particularly useful for extracting features over multiple scales,
-    with keypoint selection based on attention scores, and PCA/whitening
-    post-processing.
-
-The subdirectory `delf/python/examples` contains sample scripts to run DELF
-feature extraction/matching, and object detection:
-
--   `delf_config_example.pbtxt` shows an example instantiation of the DelfConfig
-    proto, used for DELF feature extraction.
--   `detector.py` is a module to construct an object detector function.
--   `extract_boxes.py` enables object detection from a list of images.
--   `extract_features.py` enables DELF extraction from a list of images.
--   `extractor.py` is a module to construct a DELF/DELG local feature extraction
-    function.
--   `match_images.py` supports image matching using DELF features extracted
-    using `extract_features.py`.
+This directory contains files for several different purposes, such as:
+reading/writing tensors/features (`box_io.py`, `datum_io.py`, `feature_io.py`),
+local feature aggregation extraction and similarity computation
+(`feature_aggregation_extractor.py`, `feature_aggregation_similarity.py`) and
+helper functions for image/feature loading/processing (`utils.py`,
+`feature_extractor.py`).
 
-The subdirectory `delf/python/delg` contains sample scripts/configs related to
-the DELG paper:
+The subdirectory `delf/python/examples` contains sample scripts to run DELF/DELG
+feature extraction/matching (`extractor.py`, `extract_features.py`,
+`match_images.py`) and object detection (`detector.py`, `extract_boxes.py`).
+`delf_config_example.pbtxt` shows an example instantiation of the DelfConfig
+proto, used for DELF feature extraction.
 
--   `delg_gld_config.pbtxt` gives the DelfConfig used in DELG paper.
--   `extract_features.py` for local+global feature extraction on Revisited
-    datasets.
--   `perform_retrieval.py` for performing retrieval/evaluating methods on
-    Revisited datasets.
+The subdirectory `delf/python/delg` contains sample scripts/configs related to
+the DELG paper: `extract_features.py` for local+global feature extraction (with
+and example `delg_gld_config.pbtxt`) and `perform_retrieval.py` for performing
+retrieval/scoring.
 
 The subdirectory `delf/python/detect_to_retrieve` contains sample
-scripts/configs related to the Detect-to-Retrieve paper:
-
--   `aggregation_extraction.py` is a library to extract/save feature
-    aggregation.
--   `boxes_and_features_extraction.py` is a library to extract/save boxes and
-    DELF features.
--   `cluster_delf_features.py` for local feature clustering.
--   `dataset.py` for parsing/evaluating results on Revisited Oxford/Paris
-    datasets.
--   `delf_gld_config.pbtxt` gives the DelfConfig used in Detect-to-Retrieve
-    paper.
--   `extract_aggregation.py` for aggregated local feature extraction.
--   `extract_index_boxes_and_features.py` for index image local feature
-    extraction / bounding box detection on Revisited datasets.
--   `extract_query_features.py` for query image local feature extraction on
-    Revisited datasets.
--   `image_reranking.py` is a module to re-rank images with geometric
-    verification.
--   `perform_retrieval.py` for performing retrieval/evaluating methods using
-    aggregated local features on Revisited datasets.
--   `index_aggregation_config.pbtxt`, `query_aggregation_config.pbtxt` give
-    AggregationConfig's for Detect-to-Retrieve experiments.
+scripts/configs related to the Detect-to-Retrieve paper, for feature/box
+extraction/aggregation/clustering (`aggregation_extraction.py`,
+`boxes_and_features_extraction.py`, `cluster_delf_features.py`,
+`extract_aggregation.py`, `extract_index_boxes_and_features.py`,
+`extract_query_features.py`), image retrieval/reranking (`perform_retrieval.py`,
+`image_reranking.py`), along with configs used for feature
+extraction/aggregation (`delf_gld_config.pbtxt`,
+`index_aggregation_config.pbtxt`, `query_aggregation_config.pbtxt`) and
+Revisited Oxford/Paris dataset parsing/evaluation (`dataset.py`).
 
 The subdirectory `delf/python/google_landmarks_dataset` contains sample
-scripts/modules for computing GLD metrics / reproducing results from the GLDv2
-paper:
-
--   `compute_recognition_metrics.py` performs recognition metric computation
-    given input predictions and solution files.
--   `compute_retrieval_metrics.py` performs retrieval metric computation given
-    input predictions and solution files.
--   `dataset_file_io.py` is a module for dataset-related file IO.
--   `metrics.py` is a module for GLD metric computation.
--   `rn101_af_gldv2clean_config.pbtxt` gives the DelfConfig used in the
-    ResNet101-ArcFace (trained on GLDv2-train-clean) baseline used in the GLDv2
-    paper.
+scripts/modules for computing GLD metrics (`metrics.py`,
+`compute_recognition_metrics.py`, `compute_retrieval_metrics.py`), GLD file IO
+(`dataset_file_io.py`) / reproducing results from the GLDv2 paper
+(`rn101_af_gldv2clean_config.pbtxt` and the instructions therein).
 
 The subdirectory `delf/python/training` contains sample scripts/modules for
-performing DELF training:
-
--   `datasets/googlelandmarks.py` is the dataset module used for training.
--   `model/delf_model.py` is the model module used for training.
--   `model/export_model.py` is a script for exporting trained models in the
-    format used by the inference code.
--   `model/export_model_utils.py` is a module with utilities for model
-    exporting.
--   `model/resnet50.py` is a module with a backbone RN50 implementation.
--   `build_image_dataset.py` converts downloaded dataset into TFRecords format
-    for training.
--   `train.py` is the main training script.
+performing model training (`train.py`) based on a ResNet50 DELF model
+(`model/resnet50.py`, `model/delf_model.py`), also presenting relevant model
+exporting scripts and associated utils (`model/export_model.py`,
+`model/export_global_model.py`, `model/export_model_utils.py`) and dataset
+downloading/preprocessing (`download_dataset.sh`, `build_image_dataset.py`,
+`datasets/googlelandmarks.py`).
 
 Besides these, other files in the different subdirectories contain tests for the
 various modules.
@@ -290,6 +241,16 @@ Andr&eacute; Araujo (@andrefaraujo)
 
 ## Release history
 
+### Jul, 2020
+
+-   Full TF2 support. Only one minor `compat.v1` usage left. Updated
+    instructions to require TF2.2
+-   Refactored / much improved training code, with very detailed, step-by-step
+    instructions
+
+**Thanks to contributors**: Dan Anghel, Barbara Fusinska and Andr&eacute;
+Araujo.
+
 ### May, 2020
 
 -   Codebase is now Python3-first
diff --git a/research/delf/delf/protos/delf_config.proto b/research/delf/delf/protos/delf_config.proto
index 10ae0a614cbdd483f08f1f9f806a9d3adbe6b46d..d13d911b48b7a9fe46e7c8fcb3c1c20d5a927e04 100644
--- a/research/delf/delf/protos/delf_config.proto
+++ b/research/delf/delf/protos/delf_config.proto
@@ -86,6 +86,9 @@ message DelfConfig {
   // Path to DELF model.
   optional string model_path = 1;  // Required.
 
+  // Whether model has been exported using TF version 2+.
+  optional bool is_tf2_exported = 10 [default = false];
+
   // Image scales to be used.
   repeated float image_scales = 2;
 
diff --git a/research/delf/delf/python/detect_to_retrieve/cluster_delf_features.py b/research/delf/delf/python/detect_to_retrieve/cluster_delf_features.py
index 9ddda8e4d0cae7950e76383950aab976249f3461..ec18f306b20d2a702c8b488726d164de7817c262 100644
--- a/research/delf/delf/python/detect_to_retrieve/cluster_delf_features.py
+++ b/research/delf/delf/python/detect_to_retrieve/cluster_delf_features.py
@@ -131,7 +131,7 @@ def main(argv):
       delf_dataset = tf.data.Dataset.from_tensor_slices((features_placeholder))
       delf_dataset = delf_dataset.shuffle(1000).batch(
           features_for_clustering.shape[0])
-      iterator = delf_dataset.make_initializable_iterator()
+      iterator = tf.compat.v1.data.make_initializable_iterator(delf_dataset)
 
       def _initializer_fn(sess):
         """Initialize dataset iterator, feed in the data."""
diff --git a/research/delf/delf/python/examples/extractor.py b/research/delf/delf/python/examples/extractor.py
index bd63ab38362a9c6f9ccc5d3bfca1fd007045d261..db7b80c9916df6e19006bc00841dabd320c64704 100644
--- a/research/delf/delf/python/examples/extractor.py
+++ b/research/delf/delf/python/examples/extractor.py
@@ -102,7 +102,15 @@ def MakeExtractor(config):
 
   Returns:
     Function that receives an image and returns features.
+
+  Raises:
+    ValueError: if config is invalid.
   """
+  # Assert the configuration
+  if config.use_global_features and hasattr(
+      config, 'is_tf2_exported') and config.is_tf2_exported:
+    raise ValueError('use_global_features is incompatible with is_tf2_exported')
+
   # Load model.
   model = tf.saved_model.load(config.model_path)
 
@@ -178,7 +186,8 @@ def MakeExtractor(config):
       else:
         global_pca_parameters['variances'] = None
 
-  model = model.prune(feeds=feeds, fetches=fetches)
+  if not hasattr(config, 'is_tf2_exported') or not config.is_tf2_exported:
+    model = model.prune(feeds=feeds, fetches=fetches)
 
   def ExtractorFn(image, resize_factor=1.0):
     """Receives an image and returns DELF global and/or local features.
@@ -197,7 +206,6 @@ def MakeExtractor(config):
         features (key 'local_features' mapping to a dict with keys 'locations',
         'descriptors', 'scales', 'attention').
     """
-
     resized_image, scale_factors = ResizeImage(
         image, config, resize_factor=resize_factor)
 
@@ -224,8 +232,20 @@ def MakeExtractor(config):
     output = None
 
     if config.use_local_features:
-      output = model(image_tensor, image_scales_tensor, score_threshold_tensor,
-                     max_feature_num_tensor)
+      if hasattr(config, 'is_tf2_exported') and config.is_tf2_exported:
+        predict = model.signatures['serving_default']
+        output_dict = predict(
+            input_image=image_tensor,
+            input_scales=image_scales_tensor,
+            input_max_feature_num=max_feature_num_tensor,
+            input_abs_thres=score_threshold_tensor)
+        output = [
+            output_dict['boxes'], output_dict['features'],
+            output_dict['scales'], output_dict['scores']
+        ]
+      else:
+        output = model(image_tensor, image_scales_tensor,
+                       score_threshold_tensor, max_feature_num_tensor)
     else:
       output = model(image_tensor, image_scales_tensor)
 
diff --git a/research/delf/delf/python/feature_aggregation_extractor.py b/research/delf/delf/python/feature_aggregation_extractor.py
index f230642ea950d5393005583334836630328198c9..29496a0c20c0f0c1b3de53537b69cb5348c4a217 100644
--- a/research/delf/delf/python/feature_aggregation_extractor.py
+++ b/research/delf/delf/python/feature_aggregation_extractor.py
@@ -269,8 +269,7 @@ class ExtractAggregatedRepresentation(object):
                            axis=0), [num_assignments, 1]) - tf.gather(
                                codebook, selected_visual_words[ind])
         return ind + 1, tf.tensor_scatter_nd_add(
-            vlad, tf.expand_dims(selected_visual_words[ind], axis=1),
-            tf.cast(diff, dtype=tf.float32))
+            vlad, tf.expand_dims(selected_visual_words[ind], axis=1), diff)
 
       ind_vlad = tf.constant(0, dtype=tf.int32)
       keep_going = lambda j, vlad: tf.less(j, num_features)
@@ -396,9 +395,7 @@ class ExtractAggregatedRepresentation(object):
 
     visual_words = tf.reshape(
         tf.where(
-            tf.greater(
-                per_centroid_norms,
-                tf.cast(tf.sqrt(_NORM_SQUARED_TOLERANCE), dtype=tf.float32))),
+            tf.greater(per_centroid_norms, tf.sqrt(_NORM_SQUARED_TOLERANCE))),
         [-1])
 
     per_centroid_normalized_vector = tf.math.l2_normalize(
diff --git a/research/delf/delf/python/training/README.md b/research/delf/delf/python/training/README.md
index a836370fb7830392715c45298987c40e24859032..6712ee2e41116c320bbf6c2c231138c552fa443f 100644
--- a/research/delf/delf/python/training/README.md
+++ b/research/delf/delf/python/training/README.md
@@ -1,18 +1,22 @@
 # DELF Training Instructions
 
-This README documents the end-to-end process for training a landmark detection and retrieval
-model using the DELF library on the [Google Landmarks Dataset v2](https://github.com/cvdfoundation/google-landmark) (GLDv2). This can be achieved following these steps:
-1. Install the DELF Python library.
-2. Download the raw images of the GLDv2 dataset.
-3. Prepare the training data.
-4. Run the training.
+This README documents the end-to-end process for training a landmark detection
+and retrieval model using the DELF library on the
+[Google Landmarks Dataset v2](https://github.com/cvdfoundation/google-landmark)
+(GLDv2). This can be achieved following these steps:
+
+1.  Install the DELF Python library.
+2.  Download the raw images of the GLDv2 dataset.
+3.  Prepare the training data.
+4.  Run the training.
 
 The next sections will cove each of these steps in greater detail.
 
 ## Prerequisites
 
-Clone the [TensorFlow Model Garden](https://github.com/tensorflow/models) repository and move
-into the `models/research/delf/delf/python/training`folder.
+Clone the [TensorFlow Model Garden](https://github.com/tensorflow/models)
+repository and move into the `models/research/delf/delf/python/training`folder.
+
 ```
 git clone https://github.com/tensorflow/models.git
 cd models/research/delf/delf/python/training
@@ -20,109 +24,245 @@ cd models/research/delf/delf/python/training
 
 ## Install the DELF Library
 
-The DELF Python library can be installed by running the [`install_delf.sh`](./install_delf.sh)
-script using the command:
-```
-bash install_delf.sh
-```
-The script installs both the DELF library and its dependencies in the following sequence:
-* Install TensorFlow 2.2 and TensorFlow 2.2 for GPU.
-* Install the [TF-Slim](https://github.com/google-research/tf-slim) library from source.
-* Download [protoc](https://github.com/protocolbuffers/protobuf) and compile the DELF Protocol
-Buffers.
-* Install the matplotlib, numpy, scikit-image, scipy and python3-tk Python libraries.
-* Install the [TensorFlow Object Detection API](https://github.com/tensorflow/models/tree/master/research/object_detection) from the cloned TensorFlow Model Garden repository.
-* Install the DELF package.
-
-*Please note that the current installation only works on 64 bits Linux architectures due to the 
-`protoc` binary downloaded by the installation script. If you wish to install the DELF library on
-other architectures please update the [`install_delf.sh`](./install_delf.sh) script by referencing
-the desired `protoc` [binary release](https://github.com/protocolbuffers/protobuf/releases).*
+To be able to use this code, please follow
+[these instructions](../../../INSTALL_INSTRUCTIONS.md) to properly install the
+DELF library.
 
 ## Download the GLDv2 Training Data
 
-The [GLDv2](https://github.com/cvdfoundation/google-landmark) images are grouped in 3 datasets: TRAIN, INDEX, TEST. Images in each dataset are grouped into `*.tar` files and individually
-referenced in `*.csv`files containing training metadata and licensing information. The number of
-`*.tar` files per dataset is as follows:
-* TRAIN: 500 files.
-* INDEX: 100 files.
-* TEST: 20 files.
+The [GLDv2](https://github.com/cvdfoundation/google-landmark) images are grouped
+in 3 datasets: TRAIN, INDEX, TEST. Images in each dataset are grouped into
+`*.tar` files and individually referenced in `*.csv`files containing training
+metadata and licensing information. The number of `*.tar` files per dataset is
+as follows:
+
+*   TRAIN: 500 files.
+*   INDEX: 100 files.
+*   TEST: 20 files.
+
+To download the GLDv2 images, run the
+[`download_dataset.sh`](./download_dataset.sh) script like in the following
+example:
 
-To download the GLDv2 images, run the [`download_dataset.sh`](./download_dataset.sh) script like in
-the following example:
 ```
 bash download_dataset.sh 500 100 20
 ```
+
 The script takes the following parameters, in order:
-* The number of image files from the TRAIN dataset to download (maximum 500).
-* The number of image files from the INDEX dataset to download (maximum 100).
-* The number of image files from the TEST dataset to download (maximum 20).
+
+*   The number of image files from the TRAIN dataset to download (maximum 500).
+*   The number of image files from the INDEX dataset to download (maximum 100).
+*   The number of image files from the TEST dataset to download (maximum 20).
 
 The script downloads the GLDv2 images under the following directory structure:
-* gldv2_dataset/
-  * train/ - Contains raw images from the TRAIN dataset.
-  * index/ - Contains raw images from the INDEX dataset.
-  * test/ - Contains raw images from the TEST dataset.
-
-Each of the three folders `gldv2_dataset/train/`, `gldv2_dataset/index/` and `gldv2_dataset/test/`
-contains the following:
-* The downloaded `*.tar` files.
-* The corresponding MD5 checksum files, `*.txt`.
-* The unpacked content of the downloaded files. (*Images are organized in folders and subfolders
-based on the first, second and third character in their file name.*)
-* The CSV files containing training and licensing metadata of the downloaded images.
-
-*Please note that due to the large size of the GLDv2 dataset, the download can take up to 12 
-hours and up to 1 TB of disk space. In order to save bandwidth and disk space, you may want to start by downloading only the TRAIN dataset, the only one required for the training, thus saving
-approximately ~95 GB, the equivalent of the INDEX and TEST datasets. To further save disk space,
-the `*.tar` files can be deleted after downloading and upacking them.*
+
+*   gldv2_dataset/
+    *   train/ - Contains raw images from the TRAIN dataset.
+    *   index/ - Contains raw images from the INDEX dataset.
+    *   test/ - Contains raw images from the TEST dataset.
+
+Each of the three folders `gldv2_dataset/train/`, `gldv2_dataset/index/` and
+`gldv2_dataset/test/` contains the following:
+
+*   The downloaded `*.tar` files.
+*   The corresponding MD5 checksum files, `*.txt`.
+*   The unpacked content of the downloaded files. (*Images are organized in
+    folders and subfolders based on the first, second and third character in
+    their file name.*)
+*   The CSV files containing training and licensing metadata of the downloaded
+    images.
+
+*Please note that due to the large size of the GLDv2 dataset, the download can
+take up to 12 hours and up to 1 TB of disk space. In order to save bandwidth and
+disk space, you may want to start by downloading only the TRAIN dataset, the
+only one required for the training, thus saving approximately ~95 GB, the
+equivalent of the INDEX and TEST datasets. To further save disk space, the
+`*.tar` files can be deleted after downloading and upacking them.*
 
 ## Prepare the Data for Training
 
-Preparing the data for training consists of creating [TFRecord](https://www.tensorflow.org/tutorials/load_data/tfrecord)
-files from the raw GLDv2 images grouped into TRAIN and VALIDATION splits. The training set
-produced contains only the *clean* subset of the GLDv2 dataset. The [CVPR'20 paper](https://arxiv.org/abs/2004.01804)
-introducing the GLDv2 dataset contains a detailed description of the *clean* subset.
+Preparing the data for training consists of creating
+[TFRecord](https://www.tensorflow.org/tutorials/load_data/tfrecord) files from
+the raw GLDv2 images grouped into TRAIN and VALIDATION splits. The training set
+produced contains only the *clean* subset of the GLDv2 dataset. The
+[CVPR'20 paper](https://arxiv.org/abs/2004.01804) introducing the GLDv2 dataset
+contains a detailed description of the *clean* subset.
+
+Generating the TFRecord files containing the TRAIN and VALIDATION splits of the
+*clean* GLDv2 subset can be achieved by running the
+[`build_image_dataset.py`](./build_image_dataset.py) script. Assuming that the
+GLDv2 images have been downloaded to the `gldv2_dataset` folder, the script can
+be run as follows:
 
-Generating the TFRecord files containing the TRAIN and VALIDATION splits of the *clean* GLDv2 
-subset can be achieved by running the [`build_image_dataset.py`](./build_image_dataset.py) 
-script. Assuming that the GLDv2 images have been downloaded to the `gldv2_dataset` folder, the 
-script can be run as follows:
 ```
 python3 build_image_dataset.py \
-    --train_csv_path=gldv2_dataset/train/train.csv \
-    --train_clean_csv_path=gldv2_dataset/train/train_clean.csv \
-    --train_directory=gldv2_dataset/train/*/*/*/ \
-    --output_directory=gldv2_dataset/tfrecord/ \
-    --num_shards=128 \
-    --generate_train_validation_splits \
-    --validation_split_size=0.2
+  --train_csv_path=gldv2_dataset/train/train.csv \
+  --train_clean_csv_path=gldv2_dataset/train/train_clean.csv \
+  --train_directory=gldv2_dataset/train/*/*/*/ \
+  --output_directory=gldv2_dataset/tfrecord/ \
+  --num_shards=128 \
+  --generate_train_validation_splits \
+  --validation_split_size=0.2
 ```
-*Please refer to the source code of the [`build_image_dataset.py`](./build_image_dataset.py) script for a detailed description of its parameters.*
 
-The TFRecord files written in the `OUTPUT_DIRECTORY` will be prefixed as follows:
-* TRAIN split: `train-*`
-* VALIDATION split: `validation-*`
+*Please refer to the source code of the
+[`build_image_dataset.py`](./build_image_dataset.py) script for a detailed
+description of its parameters.*
+
+The TFRecord files written in the `OUTPUT_DIRECTORY` will be prefixed as
+follows:
+
+*   TRAIN split: `train-*`
+*   VALIDATION split: `validation-*`
+
+The same script can be used to generate TFRecord files for the TEST split for
+post-training evaluation purposes. This can be achieved by adding the
+parameters:
 
-The same script can be used to generate TFRecord files for the TEST split for post-training
-evaluation purposes. This can be achieved by adding the parameters:
 ```
-    --test_csv_path=gldv2_dataset/train/test.csv \
-    --test_directory=gldv2_dataset/test/*/*/*/ \
+--test_csv_path=gldv2_dataset/train/test.csv \
+--test_directory=gldv2_dataset/test/*/*/*/ \
 ```
-In this scenario, the TFRecord files of the TEST split written in the `OUTPUT_DIRECTORY` will be
-named according to the pattern `test-*`.
 
-*Please note that due to the large size of the GLDv2 dataset, the generation of the TFRecord 
-files can take up to 12 hours and up to 500 GB of space disk.*
+In this scenario, the TFRecord files of the TEST split written in the
+`OUTPUT_DIRECTORY` will be named according to the pattern `test-*`.
+
+*Please note that due to the large size of the GLDv2 dataset, the generation of
+the TFRecord files can take up to 12 hours and up to 500 GB of space disk.*
 
 ## Running the Training
 
-Assuming the TFRecord files were generated in the `gldv2_dataset/tfrecord/` directory, running 
-the following command should start training a model:
+For the training to converge faster, it is possible to initialize the ResNet
+backbone with the weights of a pretrained ImageNet model. The ImageNet
+checkpoint is available at the following location:
+[`http://storage.googleapis.com/delf/resnet50_imagenet_weights.tar.gz`](http://storage.googleapis.com/delf/resnet50_imagenet_weights.tar.gz).
+To download and unpack it run the following commands on a Linux box:
+
+```
+curl -Os http://storage.googleapis.com/delf/resnet50_imagenet_weights.tar.gz
+tar -xzvf resnet50_imagenet_weights.tar.gz
+```
+
+Assuming the TFRecord files were generated in the `gldv2_dataset/tfrecord/`
+directory, running the following command should start training a model and
+output the results in the `gldv2_training` directory:
 
 ```
 python3 train.py \
   --train_file_pattern=gldv2_dataset/tfrecord/train* \
-  --validation_file_pattern=gldv2_dataset/tfrecord/validation*
+  --validation_file_pattern=gldv2_dataset/tfrecord/validation* \
+  --imagenet_checkpoint=resnet50_weights_tf_dim_ordering_tf_kernels_notop.h5 \
+  --dataset_version=gld_v2_clean \
+  --logdir=gldv2_training/
 ```
+
+On a multi-GPU machine the batch size can be increased to speed up the training
+using the `--batch_size` parameter. On a 8 Tesla P100 GPUs machine you can set
+the batch size to `256`:
+
+```
+--batch_size=256
+```
+
+## Exporting the Trained Model
+
+Assuming the training output, the TensorFlow checkpoint, is in the
+`gldv2_training` directory, running the following commands exports the model.
+
+### DELF local feature model
+
+```
+python3 model/export_model.py \
+  --ckpt_path=gldv2_training/delf_weights \
+  --export_path=gldv2_model_local \
+  --block3_strides
+```
+
+### Kaggle-compatible global feature model
+
+To export a global feature model in the format required by the
+[2020 Landmark Retrieval challenge](https://www.kaggle.com/c/landmark-retrieval-2020),
+you can use the following command:
+
+```
+python3 model/export_global_model.py \
+  --ckpt_path=gldv2_training/delf_weights \
+  --export_path=gldv2_model_global \
+  --input_scales_list=0.70710677,1.0,1.4142135 \
+  --multi_scale_pool_type=sum \
+  --normalize_global_descriptor
+```
+
+## Testing the Trained Model
+
+After the trained model has been exported, it can be used to extract DELF
+features from 2 images of the same landmark and to perform a matching test
+between the 2 images based on the extracted features to validate they represent
+the same landmark.
+
+Start by downloading the Oxford buildings dataset:
+
+```
+mkdir data && cd data
+wget http://www.robots.ox.ac.uk/~vgg/data/oxbuildings/oxbuild_images.tgz
+mkdir oxford5k_images oxford5k_features
+tar -xvzf oxbuild_images.tgz -C oxford5k_images/
+cd ../
+echo data/oxford5k_images/hertford_000056.jpg >> list_images.txt
+echo data/oxford5k_images/oxford_000317.jpg >> list_images.txt
+```
+
+Make a copy of the
+[`delf_config_example.pbtxt`](../examples/delf_config_example.pbtxt) protobuffer
+file which configures the DELF feature extraction. Update the file by making the
+following changes:
+
+*   set the `model_path` attribute to the directory containing the exported
+    model, `gldv2_model_local` in this example
+*   add at the root level the attribute `is_tf2_exported` with the value `true`
+*   set to `false` the `use_pca` attribute inside `delf_local_config`
+
+The ensuing file should resemble the following:
+
+```
+model_path: "gldv2_model_local"
+image_scales: .25
+image_scales: .3536
+image_scales: .5
+image_scales: .7071
+image_scales: 1.0
+image_scales: 1.4142
+image_scales: 2.0
+is_tf2_exported: true
+delf_local_config {
+  use_pca: false
+  max_feature_num: 1000
+  score_threshold: 100.0
+}
+```
+
+Run the following command to extract DELF features for the images
+`hertford_000056.jpg` and `oxford_000317.jpg`:
+
+```
+python3 ../examples/extract_features.py \
+  --config_path delf_config_example.pbtxt \
+  --list_images_path list_images.txt \
+  --output_dir data/oxford5k_features
+```
+
+Run the following command to perform feature matching between the images
+`hertford_000056.jpg` and `oxford_000317.jpg`:
+
+```
+python3 ../examples/match_images.py \
+  --image_1_path data/oxford5k_images/hertford_000056.jpg \
+  --image_2_path data/oxford5k_images/oxford_000317.jpg \
+  --features_1_path data/oxford5k_features/hertford_000056.delf \
+  --features_2_path data/oxford5k_features/oxford_000317.delf \
+  --output_image matched_images.png
+```
+
+The generated image `matched_images.png` should look similar to this one:
+
+![MatchedImagesDemo](./matched_images_demo.png)
diff --git a/research/delf/delf/python/training/build_image_dataset.py b/research/delf/delf/python/training/build_image_dataset.py
index 5df58df0b80d506330e3560b46b8835c283a2bf8..23103d491968bffcf8aae2384a9a0b3624df2539 100644
--- a/research/delf/delf/python/training/build_image_dataset.py
+++ b/research/delf/delf/python/training/build_image_dataset.py
@@ -302,6 +302,21 @@ def _write_relabeling_rules(relabeling_rules):
       csv_writer.writerow([new_label, old_label])
 
 
+def _shuffle_by_columns(np_array, random_state):
+  """Shuffle the columns of a 2D numpy array.
+
+  Args:
+    np_array: array to shuffle.
+    random_state: numpy RandomState to be used for shuffling.
+  Returns:
+    The shuffled array.
+  """
+  columns = np_array.shape[1]
+  columns_indices = np.arange(columns)
+  random_state.shuffle(columns_indices)
+  return np_array[:, columns_indices]
+
+
 def _build_train_and_validation_splits(image_paths, file_ids, labels,
                                        validation_split_size, seed):
   """Create TRAIN and VALIDATION splits containg all labels in equal proportion.
@@ -353,19 +368,21 @@ def _build_train_and_validation_splits(image_paths, file_ids, labels,
   for label, indexes in image_attrs_idx_by_label.items():
     # Create the subset for the current label.
     image_attrs_label = image_attrs[:, indexes]
-    images_per_label = image_attrs_label.shape[1]
     # Shuffle the current label subset.
-    columns_indices = np.arange(images_per_label)
-    rs.shuffle(columns_indices)
-    image_attrs_label = image_attrs_label[:, columns_indices]
+    image_attrs_label = _shuffle_by_columns(image_attrs_label, rs)
     # Split the current label subset into TRAIN and VALIDATION splits and add
     # each split to the list of all splits.
+    images_per_label = image_attrs_label.shape[1]
     cutoff_idx = max(1, int(validation_split_size * images_per_label))
     splits[_VALIDATION_SPLIT].append(image_attrs_label[:, 0 : cutoff_idx])
     splits[_TRAIN_SPLIT].append(image_attrs_label[:, cutoff_idx : ])
 
-  validation_split = np.concatenate(splits[_VALIDATION_SPLIT], axis=1)
-  train_split = np.concatenate(splits[_TRAIN_SPLIT], axis=1)
+  # Concatenate all subsets of image attributes into TRAIN and VALIDATION splits
+  # and reshuffle them again to ensure variance of labels across batches.
+  validation_split = _shuffle_by_columns(
+      np.concatenate(splits[_VALIDATION_SPLIT], axis=1), rs)
+  train_split = _shuffle_by_columns(
+      np.concatenate(splits[_TRAIN_SPLIT], axis=1), rs)
 
   # Unstack the image attribute arrays in the TRAIN and VALIDATION splits and
   # convert them back to lists. Convert labels back to 'int' from 'str'
diff --git a/research/delf/delf/python/training/datasets/googlelandmarks.py b/research/delf/delf/python/training/datasets/googlelandmarks.py
index f289cc166460f3a2fd9f157bc672ea0a464a2995..9d184aceca875bdba4109384141701049d6b389e 100644
--- a/research/delf/delf/python/training/datasets/googlelandmarks.py
+++ b/research/delf/delf/python/training/datasets/googlelandmarks.py
@@ -29,11 +29,7 @@ import tensorflow as tf
 
 class _GoogleLandmarksInfo(object):
   """Metadata about the Google Landmarks dataset."""
-  num_classes = {
-      'gld_v1': 14951,
-      'gld_v2': 203094,
-      'gld_v2_clean': 81313
-  }
+  num_classes = {'gld_v1': 14951, 'gld_v2': 203094, 'gld_v2_clean': 81313}
 
 
 class _DataAugmentationParams(object):
@@ -123,6 +119,8 @@ def _ParseFunction(example, name_to_features, image_size, augmentation):
   # Parse to get image.
   image = parsed_example['image/encoded']
   image = tf.io.decode_jpeg(image)
+  image = NormalizeImages(
+      image, pixel_value_scale=128.0, pixel_value_offset=128.0)
   if augmentation:
     image = _ImageNetCrop(image)
   else:
@@ -130,6 +128,7 @@ def _ParseFunction(example, name_to_features, image_size, augmentation):
     image.set_shape([image_size, image_size, 3])
   # Parse to get label.
   label = parsed_example['image/class/label']
+
   return image, label
 
 
@@ -162,6 +161,7 @@ def CreateDataset(file_pattern,
       'image/width': tf.io.FixedLenFeature([], tf.int64, default_value=0),
       'image/channels': tf.io.FixedLenFeature([], tf.int64, default_value=0),
       'image/format': tf.io.FixedLenFeature([], tf.string, default_value=''),
+      'image/id': tf.io.FixedLenFeature([], tf.string, default_value=''),
       'image/filename': tf.io.FixedLenFeature([], tf.string, default_value=''),
       'image/encoded': tf.io.FixedLenFeature([], tf.string, default_value=''),
       'image/class/label': tf.io.FixedLenFeature([], tf.int64, default_value=0),
diff --git a/research/delf/delf/python/training/matched_images_demo.png b/research/delf/delf/python/training/matched_images_demo.png
new file mode 100644
index 0000000000000000000000000000000000000000..b8a4cc9ac898f0e76d5a7dab3a50d04591bfbf2f
Binary files /dev/null and b/research/delf/delf/python/training/matched_images_demo.png differ
diff --git a/research/delf/delf/python/training/model/delf_model.py b/research/delf/delf/python/training/model/delf_model.py
index 27409de99c52dcb0f0eb00ca9ae0602a2be0d30b..f5dda85263fcb3be1de41183e6dad564e607f76f 100644
--- a/research/delf/delf/python/training/model/delf_model.py
+++ b/research/delf/delf/python/training/model/delf_model.py
@@ -132,10 +132,12 @@ class Delf(tf.keras.Model):
             self.attn_classification.trainable_weights)
 
   def call(self, input_image, training=True):
-    blocks = {'block3': None}
-    self.backbone(input_image, intermediates_dict=blocks, training=training)
+    blocks = {}
 
-    features = blocks['block3']
+    self.backbone.build_call(
+        input_image, intermediates_dict=blocks, training=training)
+
+    features = blocks['block3']  # pytype: disable=key-error
     _, probs, _ = self.attention(features, training=training)
 
     return probs, features
diff --git a/research/delf/delf/python/training/model/export_global_model.py b/research/delf/delf/python/training/model/export_global_model.py
new file mode 100644
index 0000000000000000000000000000000000000000..820cb0e4b93ae75defd21e69ef5edd2e3d9f54a6
--- /dev/null
+++ b/research/delf/delf/python/training/model/export_global_model.py
@@ -0,0 +1,155 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Export global feature tensorflow inference model.
+
+This model includes image pyramids for multi-scale processing.
+"""
+
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+
+import os
+
+from absl import app
+from absl import flags
+import tensorflow as tf
+
+from delf.python.training.model import delf_model
+from delf.python.training.model import export_model_utils
+
+FLAGS = flags.FLAGS
+
+flags.DEFINE_string('ckpt_path', '/tmp/delf-logdir/delf-weights',
+                    'Path to saved checkpoint.')
+flags.DEFINE_string('export_path', None, 'Path where model will be exported.')
+flags.DEFINE_list(
+    'input_scales_list', None,
+    'Optional input image scales to use. If None (default), an input end-point '
+    '"input_scales" is added for the exported model. If not None, the '
+    'specified list of floats will be hard-coded as the desired input scales.')
+flags.DEFINE_enum(
+    'multi_scale_pool_type', 'None', ['None', 'average', 'sum'],
+    "If 'None' (default), the model is exported with an output end-point "
+    "'global_descriptors', where the global descriptor for each scale is "
+    "returned separately. If not 'None', the global descriptor of each scale is"
+    ' pooled and a 1D global descriptor is returned, with output end-point '
+    "'global_descriptor'.")
+flags.DEFINE_boolean('normalize_global_descriptor', False,
+                     'If True, L2-normalizes global descriptor.')
+
+
+class _ExtractModule(tf.Module):
+  """Helper module to build and save global feature model."""
+
+  def __init__(self,
+               multi_scale_pool_type='None',
+               normalize_global_descriptor=False,
+               input_scales_tensor=None):
+    """Initialization of global feature model.
+
+    Args:
+      multi_scale_pool_type: Type of multi-scale pooling to perform.
+      normalize_global_descriptor: Whether to L2-normalize global descriptor.
+      input_scales_tensor: If None, the exported function to be used should be
+        ExtractFeatures, where an input end-point "input_scales" is added for
+        the exported model. If not None, the specified 1D tensor of floats will
+        be hard-coded as the desired input scales, in conjunction with
+        ExtractFeaturesFixedScales.
+    """
+    self._multi_scale_pool_type = multi_scale_pool_type
+    self._normalize_global_descriptor = normalize_global_descriptor
+    if input_scales_tensor is None:
+      self._input_scales_tensor = []
+    else:
+      self._input_scales_tensor = input_scales_tensor
+
+    # Setup the DELF model for extraction.
+    self._model = delf_model.Delf(block3_strides=False, name='DELF')
+
+  def LoadWeights(self, checkpoint_path):
+    self._model.load_weights(checkpoint_path)
+
+  @tf.function(input_signature=[
+      tf.TensorSpec(shape=[None, None, 3], dtype=tf.uint8, name='input_image'),
+      tf.TensorSpec(shape=[None], dtype=tf.float32, name='input_scales'),
+      tf.TensorSpec(
+          shape=[None], dtype=tf.int32, name='input_global_scales_ind')
+  ])
+  def ExtractFeatures(self, input_image, input_scales, input_global_scales_ind):
+    extracted_features = export_model_utils.ExtractGlobalFeatures(
+        input_image,
+        input_scales,
+        input_global_scales_ind,
+        lambda x: self._model.backbone.build_call(x, training=False),
+        multi_scale_pool_type=self._multi_scale_pool_type,
+        normalize_global_descriptor=self._normalize_global_descriptor)
+
+    named_output_tensors = {}
+    if self._multi_scale_pool_type == 'None':
+      named_output_tensors['global_descriptors'] = tf.identity(
+          extracted_features, name='global_descriptors')
+    else:
+      named_output_tensors['global_descriptor'] = tf.identity(
+          extracted_features, name='global_descriptor')
+
+    return named_output_tensors
+
+  @tf.function(input_signature=[
+      tf.TensorSpec(shape=[None, None, 3], dtype=tf.uint8, name='input_image')
+  ])
+  def ExtractFeaturesFixedScales(self, input_image):
+    return self.ExtractFeatures(input_image, self._input_scales_tensor,
+                                tf.range(tf.size(self._input_scales_tensor)))
+
+
+def main(argv):
+  if len(argv) > 1:
+    raise app.UsageError('Too many command-line arguments.')
+
+  export_path = FLAGS.export_path
+  if os.path.exists(export_path):
+    raise ValueError('export_path %s already exists.' % export_path)
+
+  if FLAGS.input_scales_list is None:
+    input_scales_tensor = None
+  else:
+    input_scales_tensor = tf.constant(
+        [float(s) for s in FLAGS.input_scales_list],
+        dtype=tf.float32,
+        shape=[len(FLAGS.input_scales_list)],
+        name='input_scales')
+  module = _ExtractModule(FLAGS.multi_scale_pool_type,
+                          FLAGS.normalize_global_descriptor,
+                          input_scales_tensor)
+
+  # Load the weights.
+  checkpoint_path = FLAGS.ckpt_path
+  module.LoadWeights(checkpoint_path)
+  print('Checkpoint loaded from ', checkpoint_path)
+
+  # Save the module
+  if FLAGS.input_scales_list is None:
+    served_function = module.ExtractFeatures
+  else:
+    served_function = module.ExtractFeaturesFixedScales
+
+  tf.saved_model.save(
+      module, export_path, signatures={'serving_default': served_function})
+
+
+if __name__ == '__main__':
+  app.run(main)
diff --git a/research/delf/delf/python/training/model/export_model.py b/research/delf/delf/python/training/model/export_model.py
index 4af69a231641ef2cc69a08fb9a5ba5c31655c26c..10fb8905e1e7d6d575c9b0f6480276ca9719662c 100644
--- a/research/delf/delf/python/training/model/export_model.py
+++ b/research/delf/delf/python/training/model/export_model.py
@@ -42,67 +42,39 @@ flags.DEFINE_boolean('block3_strides', False,
 flags.DEFINE_float('iou', 1.0, 'IOU for non-max suppression.')
 
 
-def _build_tensor_info(tensor_dict):
-  """Replace the dict's value by the tensor info.
-
-  Args:
-    tensor_dict: A dictionary contains <string, tensor>.
-
-  Returns:
-    dict: New dictionary contains <string, tensor_info>.
-  """
-  return {
-      k: tf.compat.v1.saved_model.utils.build_tensor_info(t)
-      for k, t in tensor_dict.items()
-  }
-
-
-def main(argv):
-  if len(argv) > 1:
-    raise app.UsageError('Too many command-line arguments.')
-
-  export_path = FLAGS.export_path
-  if os.path.exists(export_path):
-    raise ValueError('Export_path already exists.')
-
-  with tf.Graph().as_default() as g, tf.compat.v1.Session(graph=g) as sess:
-
+class _ExtractModule(tf.Module):
+  """Helper module to build and save DELF model."""
+
+  def __init__(self, block3_strides, iou):
+    """Initialization of DELF model.
+
+    Args:
+      block3_strides: bool, whether to add strides to the output of block3.
+      iou: IOU for non-max suppression.
+    """
+    self._stride_factor = 2.0 if block3_strides else 1.0
+    self._iou = iou
     # Setup the DELF model for extraction.
-    model = delf_model.Delf(block3_strides=FLAGS.block3_strides, name='DELF')
-
-    # Initial forward pass to build model.
-    images = tf.zeros((1, 321, 321, 3), dtype=tf.float32)
-    model(images)
+    self._model = delf_model.Delf(
+        block3_strides=block3_strides, name='DELF')
 
-    stride_factor = 2.0 if FLAGS.block3_strides else 1.0
+  def LoadWeights(self, checkpoint_path):
+    self._model.load_weights(checkpoint_path)
 
-    # Setup the multiscale keypoint extraction.
-    input_image = tf.compat.v1.placeholder(
-        tf.uint8, shape=(None, None, 3), name='input_image')
-    input_abs_thres = tf.compat.v1.placeholder(
-        tf.float32, shape=(), name='input_abs_thres')
-    input_scales = tf.compat.v1.placeholder(
-        tf.float32, shape=[None], name='input_scales')
-    input_max_feature_num = tf.compat.v1.placeholder(
-        tf.int32, shape=(), name='input_max_feature_num')
+  @tf.function(input_signature=[
+      tf.TensorSpec(shape=[None, None, 3], dtype=tf.uint8, name='input_image'),
+      tf.TensorSpec(shape=[None], dtype=tf.float32, name='input_scales'),
+      tf.TensorSpec(shape=(), dtype=tf.int32, name='input_max_feature_num'),
+      tf.TensorSpec(shape=(), dtype=tf.float32, name='input_abs_thres')
+  ])
+  def ExtractFeatures(self, input_image, input_scales, input_max_feature_num,
+                      input_abs_thres):
 
     extracted_features = export_model_utils.ExtractLocalFeatures(
         input_image, input_scales, input_max_feature_num, input_abs_thres,
-        FLAGS.iou, lambda x: model(x, training=False), stride_factor)
+        self._iou, lambda x: self._model(x, training=False),
+        self._stride_factor)
 
-    # Load the weights.
-    checkpoint_path = FLAGS.ckpt_path
-    model.load_weights(checkpoint_path)
-    print('Checkpoint loaded from ', checkpoint_path)
-
-    named_input_tensors = {
-        'input_image': input_image,
-        'input_scales': input_scales,
-        'input_abs_thres': input_abs_thres,
-        'input_max_feature_num': input_max_feature_num,
-    }
-
-    # Outputs to the exported model.
     named_output_tensors = {}
     named_output_tensors['boxes'] = tf.identity(
         extracted_features[0], name='boxes')
@@ -112,25 +84,27 @@ def main(argv):
         extracted_features[2], name='scales')
     named_output_tensors['scores'] = tf.identity(
         extracted_features[3], name='scores')
+    return named_output_tensors
+
+
+def main(argv):
+  if len(argv) > 1:
+    raise app.UsageError('Too many command-line arguments.')
+
+  export_path = FLAGS.export_path
+  if os.path.exists(export_path):
+    raise ValueError(f'Export_path {export_path} already exists. Please '
+                     'specify a different path or delete the existing one.')
+
+  module = _ExtractModule(FLAGS.block3_strides, FLAGS.iou)
+
+  # Load the weights.
+  checkpoint_path = FLAGS.ckpt_path
+  module.LoadWeights(checkpoint_path)
+  print('Checkpoint loaded from ', checkpoint_path)
 
-    # Export the model.
-    signature_def = tf.compat.v1.saved_model.signature_def_utils.build_signature_def(
-        inputs=_build_tensor_info(named_input_tensors),
-        outputs=_build_tensor_info(named_output_tensors))
-
-    print('Exporting trained model to:', export_path)
-    builder = tf.compat.v1.saved_model.builder.SavedModelBuilder(export_path)
-
-    init_op = None
-    builder.add_meta_graph_and_variables(
-        sess, [tf.compat.v1.saved_model.tag_constants.SERVING],
-        signature_def_map={
-            tf.compat.v1.saved_model.signature_constants
-            .DEFAULT_SERVING_SIGNATURE_DEF_KEY:
-                signature_def
-        },
-        main_op=init_op)
-    builder.save()
+  # Save the module
+  tf.saved_model.save(module, export_path)
 
 
 if __name__ == '__main__':
diff --git a/research/delf/delf/python/training/model/export_model_utils.py b/research/delf/delf/python/training/model/export_model_utils.py
index f4302aca139802e99d80bfd4e1fc27e353abdfbb..64d6672569e7d4973c921de8cc8c8c63051589b3 100644
--- a/research/delf/delf/python/training/model/export_model_utils.py
+++ b/research/delf/delf/python/training/model/export_model_utils.py
@@ -142,20 +142,21 @@ def ExtractLocalFeatures(image, image_scales, max_feature_num, abs_thres, iou,
   keep_going = lambda j, b, f, scales, scores: tf.less(j, num_scales)
 
   (_, output_boxes, output_features, output_scales,
-   output_scores) = tf.while_loop(
-       cond=keep_going,
-       body=_ProcessSingleScale,
-       loop_vars=[
-           i, output_boxes, output_features, output_scales, output_scores
-       ],
-       shape_invariants=[
-           i.get_shape(),
-           tf.TensorShape([None, 4]),
-           tf.TensorShape([None, feature_depth]),
-           tf.TensorShape([None]),
-           tf.TensorShape([None])
-       ],
-       back_prop=False)
+   output_scores) = tf.nest.map_structure(
+       tf.stop_gradient,
+       tf.while_loop(
+           cond=keep_going,
+           body=_ProcessSingleScale,
+           loop_vars=[
+               i, output_boxes, output_features, output_scales, output_scores
+           ],
+           shape_invariants=[
+               i.get_shape(),
+               tf.TensorShape([None, 4]),
+               tf.TensorShape([None, feature_depth]),
+               tf.TensorShape([None]),
+               tf.TensorShape([None])
+           ]))
 
   feature_boxes = box_list.BoxList(output_boxes)
   feature_boxes.add_field('features', output_features)
@@ -169,3 +170,99 @@ def ExtractLocalFeatures(image, image_scales, max_feature_num, abs_thres, iou,
   return final_boxes.get(), final_boxes.get_field(
       'features'), final_boxes.get_field('scales'), tf.expand_dims(
           final_boxes.get_field('scores'), 1)
+
+
+@tf.function
+def ExtractGlobalFeatures(image,
+                          image_scales,
+                          global_scales_ind,
+                          model_fn,
+                          multi_scale_pool_type='None',
+                          normalize_global_descriptor=False):
+  """Extract global features for input image.
+
+  Args:
+    image: image tensor of type tf.uint8 with shape [h, w, channels].
+    image_scales: 1D float tensor which contains float scales used for image
+      pyramid construction.
+    global_scales_ind: Feature extraction happens only for a subset of
+      `image_scales`, those with corresponding indices from this tensor.
+    model_fn: model function. Follows the signature:
+      * Args:
+        * `images`: Image tensor which is re-scaled.
+      * Returns:
+        * `global_descriptors`: Global descriptors for input images.
+    multi_scale_pool_type: If set, the global descriptor of each scale is pooled
+      and a 1D global descriptor is returned.
+    normalize_global_descriptor: If True, output global descriptors are
+      L2-normalized.
+
+  Returns:
+    global_descriptors: If `multi_scale_pool_type` is 'None', returns a [S, D]
+      float tensor. S is the number of scales, and D the global descriptor
+      dimensionality. Each D-dimensional entry is a global descriptor, which may
+      be L2-normalized depending on `normalize_global_descriptor`. If
+      `multi_scale_pool_type` is not 'None', returns a [D] float tensor with the
+      pooled global descriptor.
+
+  """
+  original_image_shape_float = tf.gather(
+      tf.dtypes.cast(tf.shape(image), tf.float32), [0, 1])
+  image_tensor = gld.NormalizeImages(
+      image, pixel_value_offset=128.0, pixel_value_scale=128.0)
+  image_tensor = tf.expand_dims(image_tensor, 0, name='image/expand_dims')
+
+  def _ResizeAndExtract(scale_index):
+    """Helper function to resize image then extract global feature.
+
+    Args:
+      scale_index: A valid index in image_scales.
+
+    Returns:
+      global_descriptor: [1,D] tensor denoting the extracted global descriptor.
+    """
+    scale = tf.gather(image_scales, scale_index)
+    new_image_size = tf.dtypes.cast(
+        tf.round(original_image_shape_float * scale), tf.int32)
+    resized_image = tf.image.resize(image_tensor, new_image_size)
+    global_descriptor = model_fn(resized_image)
+    return global_descriptor
+
+  # First loop to find initial scale to be used.
+  num_scales = tf.shape(image_scales)[0]
+  initial_scale_index = tf.constant(-1, dtype=tf.int32)
+  for scale_index in tf.range(num_scales):
+    if tf.reduce_any(tf.equal(global_scales_ind, scale_index)):
+      initial_scale_index = scale_index
+      break
+
+  output_global = _ResizeAndExtract(initial_scale_index)
+
+  # Loop over subsequent scales.
+  for scale_index in tf.range(initial_scale_index + 1, num_scales):
+    # Allow an undefined number of global feature scales to be extracted.
+    tf.autograph.experimental.set_loop_options(
+        shape_invariants=[(output_global, tf.TensorShape([None, None]))])
+
+    if tf.reduce_any(tf.equal(global_scales_ind, scale_index)):
+      global_descriptor = _ResizeAndExtract(scale_index)
+      output_global = tf.concat([output_global, global_descriptor], 0)
+
+  normalization_axis = 1
+  if multi_scale_pool_type == 'average':
+    output_global = tf.reduce_mean(
+        output_global,
+        axis=0,
+        keepdims=False,
+        name='multi_scale_average_pooling')
+    normalization_axis = 0
+  elif multi_scale_pool_type == 'sum':
+    output_global = tf.reduce_sum(
+        output_global, axis=0, keepdims=False, name='multi_scale_sum_pooling')
+    normalization_axis = 0
+
+  if normalize_global_descriptor:
+    output_global = tf.nn.l2_normalize(
+        output_global, axis=normalization_axis, name='l2_normalization')
+
+  return output_global
diff --git a/research/delf/delf/python/training/model/resnet50.py b/research/delf/delf/python/training/model/resnet50.py
index 1c4d7c2f68dea12d74fcd32a8b52fd1285e92b59..6daaab67419d99ebcefd7b25f89c284bf00832af 100644
--- a/research/delf/delf/python/training/model/resnet50.py
+++ b/research/delf/delf/python/training/model/resnet50.py
@@ -22,9 +22,14 @@ from __future__ import division
 from __future__ import print_function
 
 import functools
+import os
+import tempfile
 
+from absl import logging
+import h5py
 import tensorflow as tf
 
+
 layers = tf.keras.layers
 
 
@@ -284,8 +289,8 @@ class ResNet50(tf.keras.Model):
       else:
         self.global_pooling = None
 
-  def call(self, inputs, training=True, intermediates_dict=None):
-    """Call the ResNet50 model.
+  def build_call(self, inputs, training=True, intermediates_dict=None):
+    """Building the ResNet50 model.
 
     Args:
       inputs: Images to compute features for.
@@ -356,3 +361,79 @@ class ResNet50(tf.keras.Model):
       return self.global_pooling(x)
     else:
       return x
+
+  def call(self, inputs, training=True, intermediates_dict=None):
+    """Call the ResNet50 model.
+
+    Args:
+      inputs: Images to compute features for.
+      training: Whether model is in training phase.
+      intermediates_dict: `None` or dictionary. If not None, accumulate feature
+        maps from intermediate blocks into the dictionary. ""
+
+    Returns:
+      Tensor with featuremap.
+    """
+    return self.build_call(inputs, training, intermediates_dict)
+
+  def restore_weights(self, filepath):
+    """Load pretrained weights.
+
+    This function loads a .h5 file from the filepath with saved model weights
+    and assigns them to the model.
+
+    Args:
+      filepath: String, path to the .h5 file
+    Raises:
+      ValueError: if the file referenced by `filepath` does not exist.
+    """
+    if not tf.io.gfile.exists(filepath):
+      raise ValueError('Unable to load weights from %s. You must provide a'
+                       'valid file.' % (filepath))
+
+    # Create a local copy of the weights file for h5py to be able to read it.
+    local_filename = os.path.basename(filepath)
+    tmp_filename = os.path.join(tempfile.gettempdir(), local_filename)
+    tf.io.gfile.copy(filepath, tmp_filename, overwrite=True)
+
+    # Load the content of the weights file.
+    f = h5py.File(tmp_filename, mode='r')
+    saved_layer_names = [n.decode('utf8') for n in f.attrs['layer_names']]
+
+    try:
+      # Iterate through all the layers assuming the max `depth` is 2.
+      for layer in self.layers:
+        if hasattr(layer, 'layers'):
+          for inlayer in layer.layers:
+            # Make sure the weights are in the saved model, and that we are in
+            # the innermost layer.
+            if inlayer.name not in saved_layer_names:
+              raise ValueError('Layer %s absent from the pretrained weights.'
+                               'Unable to load its weights.' % (inlayer.name))
+            if hasattr(inlayer, 'layers'):
+              raise ValueError('Layer %s is not a depth 2 layer. Unable to load'
+                               'its weights.' % (inlayer.name))
+            # Assign the weights in the current layer.
+            g = f[inlayer.name]
+            weight_names = [n.decode('utf8') for n in g.attrs['weight_names']]
+            weight_values = [g[weight_name] for weight_name in weight_names]
+            print('Setting the weights for layer %s' % (inlayer.name))
+            inlayer.set_weights(weight_values)
+    finally:
+      # Clean up the temporary file.
+      tf.io.gfile.remove(tmp_filename)
+
+  def log_weights(self):
+    """Log backbone weights."""
+    logging.info('Logging backbone weights')
+    logging.info('------------------------')
+    for layer in self.layers:
+      if hasattr(layer, 'layers'):
+        for inlayer in layer.layers:
+          logging.info('Weights for layer: %s, inlayer % s', layer.name,
+                       inlayer.name)
+          weights = inlayer.get_weights()
+          logging.info(weights)
+      else:
+        logging.info('Layer %s does not have inner layers.',
+                     layer.name)
diff --git a/research/delf/delf/python/training/train.py b/research/delf/delf/python/training/train.py
index 9b0d0a6cdaea696398ae50fcdadbead91899539f..12b7a5f9cc3282e59c738f74c7fbd4798021c429 100644
--- a/research/delf/delf/python/training/train.py
+++ b/research/delf/delf/python/training/train.py
@@ -43,17 +43,20 @@ flags.DEFINE_string('train_file_pattern', '/tmp/data/train*',
                     'File pattern of training dataset files.')
 flags.DEFINE_string('validation_file_pattern', '/tmp/data/validation*',
                     'File pattern of validation dataset files.')
-flags.DEFINE_enum('dataset_version', 'gld_v1',
-                  ['gld_v1', 'gld_v2', 'gld_v2_clean'],
-                  'Google Landmarks dataset version, used to determine the'
-                  'number of classes.')
+flags.DEFINE_enum(
+    'dataset_version', 'gld_v1', ['gld_v1', 'gld_v2', 'gld_v2_clean'],
+    'Google Landmarks dataset version, used to determine the'
+    'number of classes.')
 flags.DEFINE_integer('seed', 0, 'Seed to training dataset.')
-flags.DEFINE_float('initial_lr', 0.001, 'Initial learning rate.')
+flags.DEFINE_float('initial_lr', 0.01, 'Initial learning rate.')
 flags.DEFINE_integer('batch_size', 32, 'Global batch size.')
 flags.DEFINE_integer('max_iters', 500000, 'Maximum iterations.')
-flags.DEFINE_boolean('block3_strides', False, 'Whether to use block3_strides.')
+flags.DEFINE_boolean('block3_strides', True, 'Whether to use block3_strides.')
 flags.DEFINE_boolean('use_augmentation', True,
                      'Whether to use ImageNet style augmentation.')
+flags.DEFINE_string(
+    'imagenet_checkpoint', None,
+    'ImageNet checkpoint for ResNet backbone. If None, no checkpoint is used.')
 
 
 def _record_accuracy(metric, logits, labels):
@@ -64,6 +67,10 @@ def _record_accuracy(metric, logits, labels):
 
 def _attention_summaries(scores, global_step):
   """Record statistics of the attention score."""
+  tf.summary.image(
+      'batch_attention',
+      scores / tf.reduce_max(scores + 1e-3),
+      step=global_step)
   tf.summary.scalar('attention/max', tf.reduce_max(scores), step=global_step)
   tf.summary.scalar('attention/min', tf.reduce_min(scores), step=global_step)
   tf.summary.scalar('attention/mean', tf.reduce_mean(scores), step=global_step)
@@ -124,7 +131,7 @@ def main(argv):
   max_iters = FLAGS.max_iters
   global_batch_size = FLAGS.batch_size
   image_size = 321
-  num_eval = 1000
+  num_eval_batches = int(50000 / global_batch_size)
   report_interval = 100
   eval_interval = 1000
   save_interval = 20000
@@ -134,9 +141,10 @@ def main(argv):
   clip_val = tf.constant(10.0)
 
   if FLAGS.debug:
+    tf.config.run_functions_eagerly(True)
     global_batch_size = 4
-    max_iters = 4
-    num_eval = 1
+    max_iters = 100
+    num_eval_batches = 1
     save_interval = 1
     report_interval = 1
 
@@ -159,11 +167,12 @@ def main(argv):
       augmentation=False,
       seed=FLAGS.seed)
 
-  train_iterator = strategy.make_dataset_iterator(train_dataset)
-  validation_iterator = strategy.make_dataset_iterator(validation_dataset)
+  train_dist_dataset = strategy.experimental_distribute_dataset(train_dataset)
+  validation_dist_dataset = strategy.experimental_distribute_dataset(
+      validation_dataset)
 
-  train_iterator.initialize()
-  validation_iterator.initialize()
+  train_iter = iter(train_dist_dataset)
+  validation_iter = iter(validation_dist_dataset)
 
   # Create a checkpoint directory to store the checkpoints.
   checkpoint_prefix = os.path.join(FLAGS.logdir, 'delf_tf2-ckpt')
@@ -219,11 +228,14 @@ def main(argv):
       labels = tf.clip_by_value(labels, 0, model.num_classes)
 
       global_step = optimizer.iterations
+      tf.summary.image('batch_images', (images + 1.0) / 2.0, step=global_step)
       tf.summary.scalar(
           'image_range/max', tf.reduce_max(images), step=global_step)
       tf.summary.scalar(
           'image_range/min', tf.reduce_min(images), step=global_step)
 
+      # TODO(andrearaujo): we should try to unify the backprop into a single
+      # function, instead of applying once to descriptor then to attention.
       def _backprop_loss(tape, loss, weights):
         """Backpropogate losses using clipped gradients.
 
@@ -344,12 +356,25 @@ def main(argv):
       with tf.summary.record_if(
           tf.math.equal(0, optimizer.iterations % report_interval)):
 
+        # TODO(dananghel): try to load pretrained weights at backbone creation.
+        # Load pretrained weights for ResNet50 trained on ImageNet.
+        if FLAGS.imagenet_checkpoint is not None:
+          logging.info('Attempting to load ImageNet pretrained weights.')
+          input_batch = next(train_iter)
+          _, _ = distributed_train_step(input_batch)
+          model.backbone.restore_weights(FLAGS.imagenet_checkpoint)
+          logging.info('Done.')
+        else:
+          logging.info('Skip loading ImageNet pretrained weights.')
+        if FLAGS.debug:
+          model.backbone.log_weights()
+
         global_step_value = optimizer.iterations.numpy()
         while global_step_value < max_iters:
 
           # input_batch : images(b, h, w, c), labels(b,).
           try:
-            input_batch = train_iterator.get_next()
+            input_batch = next(train_iter)
           except tf.errors.OutOfRangeError:
             # Break if we run out of data in the dataset.
             logging.info('Stopping training at global step %d, no more data',
@@ -392,9 +417,9 @@ def main(argv):
 
           # Validate once in {eval_interval*n, n \in N} steps.
           if global_step_value % eval_interval == 0:
-            for i in range(num_eval):
+            for i in range(num_eval_batches):
               try:
-                validation_batch = validation_iterator.get_next()
+                validation_batch = next(validation_iter)
                 desc_validation_result, attn_validation_result = (
                     distributed_validation_step(validation_batch))
               except tf.errors.OutOfRangeError:
@@ -416,13 +441,17 @@ def main(argv):
               print('          : attn:', attn_validation_result.numpy())
 
           # Save checkpoint once (each save_interval*n, n \in N) steps.
+          # TODO(andrearaujo): save only in one of the two ways. They are
+          # identical, the only difference is that the manager adds some extra
+          # prefixes and variables (eg, optimizer variables).
           if global_step_value % save_interval == 0:
             save_path = manager.save()
-            logging.info('Saved({global_step_value}) at %s', save_path)
+            logging.info('Saved (%d) at %s', global_step_value, save_path)
 
             file_path = '%s/delf_weights' % FLAGS.logdir
             model.save_weights(file_path, save_format='tf')
-            logging.info('Saved weights({global_step_value}) at %s', file_path)
+            logging.info('Saved weights (%d) at %s', global_step_value,
+                         file_path)
 
           # Reset metrics for next step.
           desc_train_accuracy.reset_states()
diff --git a/research/delf/setup.py b/research/delf/setup.py
index 7aec6f0065a476dbc83b28145916f2981df4bd82..f0ec02523ece82e0696c16908cdc9325572126f9 100644
--- a/research/delf/setup.py
+++ b/research/delf/setup.py
@@ -22,7 +22,7 @@ install_requires = [
     'pandas >= 0.24.2',
     'numpy >= 1.16.1',
     'scipy >= 1.2.2',
-    'tensorflow >= 2.0.0b1',
+    'tensorflow >= 2.2.0',
     'tf_slim >= 1.1',
     'tensorflow_probability >= 0.9.0',
 ]
diff --git a/research/domain_adaptation/README.md b/research/domain_adaptation/README.md
deleted file mode 100644
index e8a2b83794f11ed3711e6bc26254a90cb5469440..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/README.md
+++ /dev/null
@@ -1,124 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-## Introduction
-This is the code used for two domain adaptation papers.
-
-The `domain_separation` directory contains code for the "Domain Separation
-Networks" paper by Bousmalis K., Trigeorgis G., et al. which was presented at
-NIPS 2016. The paper can be found here: https://arxiv.org/abs/1608.06019.
-
-The `pixel_domain_adaptation` directory contains the code used for the
-"Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial
-Networks" paper by Bousmalis K., et al. (presented at CVPR 2017). The paper can
-be found here: https://arxiv.org/abs/1612.05424. PixelDA aims to perform domain
-adaptation by transfering the visual style of the target domain (which has few
-or no labels) to a source domain (which has many labels). This is accomplished
-using a Generative Adversarial Network (GAN).
-
-### Other implementations
-* [Simplified-DSN](https://github.com/AmirHussein96/Simplified-DSN): 
-  An unofficial implementation of the [Domain Separation Networks paper](https://arxiv.org/abs/1608.06019).
-
-## Contact
-The domain separation code was open-sourced
-by [Konstantinos Bousmalis](https://github.com/bousmalis)
-(konstantinos@google.com), while the pixel level domain adaptation code was
-open-sourced by [David Dohan](https://github.com/dmrd) (ddohan@google.com).
-
-## Installation
-You will need to have the following installed on your machine before trying out the DSN code.
-
-*  TensorFlow 1.x: https://www.tensorflow.org/install/
-*  Bazel: https://bazel.build/
-
-## Initial setup
-In order to run the MNIST to MNIST-M experiments, you will need to set the
-data directory:
-
-```
-$ export DSN_DATA_DIR=/your/dir
-```
-
-Add models and models/slim to your `$PYTHONPATH` (assumes $PWD is /models):
-
-```
-$ export PYTHONPATH=$PYTHONPATH:$PWD:$PWD/slim
-```
-
-## Getting the datasets
-
-You can fetch the MNIST data by running
-
-```
- $ bazel run slim:download_and_convert_data -- --dataset_dir $DSN_DATA_DIR --dataset_name=mnist
-```
-
-The MNIST-M dataset is available online [here](http://bit.ly/2nrlUAJ).  Once it is downloaded and extracted into your data directory, create TFRecord files by running:
-```
-$ bazel run domain_adaptation/datasets:download_and_convert_mnist_m -- --dataset_dir $DSN_DATA_DIR
-```
-
-# Running PixelDA from MNIST to MNIST-M
-You can run PixelDA as follows (using Tensorboard to examine the results):
-
-```
-$ bazel run domain_adaptation/pixel_domain_adaptation:pixelda_train -- --dataset_dir $DSN_DATA_DIR --source_dataset mnist --target_dataset mnist_m
-```
-
-And evaluation as:
-```
-$ bazel run domain_adaptation/pixel_domain_adaptation:pixelda_eval -- --dataset_dir $DSN_DATA_DIR --source_dataset mnist --target_dataset mnist_m --target_split_name test
-```
-
-The MNIST-M results in the paper were run with the following hparams flag:
-```
---hparams arch=resnet,domain_loss_weight=0.135603587834,num_training_examples=16000000,style_transfer_loss_weight=0.0113173311334,task_loss_in_g_weight=0.0100959947002,task_tower=mnist,task_tower_in_g_step=true
-```
-
-### A note on terminology/language of the code:
-
-The components of the network can be grouped into two parts
-which correspond to elements which are jointly optimized: The generator
-component and the discriminator component.
-
-The generator component takes either an image or noise vector and produces an
-output image. 
-
-The discriminator component takes the generated images and the target images
-and attempts to discriminate between them.
-
-## Running DSN code for adapting MNIST to MNIST-M
-
-Then you need to build the binaries with Bazel:
-
-```
-$ bazel build -c opt domain_adaptation/domain_separation/...
-```
-
-You can then train with the following command:
-
-```
-$ ./bazel-bin/domain_adaptation/domain_separation/dsn_train  \
-      --similarity_loss=dann_loss  \
-      --basic_tower=dann_mnist  \
-      --source_dataset=mnist  \
-      --target_dataset=mnist_m  \
-      --learning_rate=0.0117249  \
-      --gamma_weight=0.251175  \
-      --weight_decay=1e-6  \
-      --layers_to_regularize=fc3  \
-      --nouse_separation  \
-      --master=""  \
-      --dataset_dir=${DSN_DATA_DIR}  \
-      -v --use_logging
-```
-
-Evaluation can be invoked with the following command:
-
-```
-$ ./bazel-bin/domain_adaptation/domain_separation/dsn_eval  \
-    -v --dataset mnist_m --split test --num_examples=9001  \
-    --dataset_dir=${DSN_DATA_DIR}
-```
diff --git a/research/domain_adaptation/WORKSPACE b/research/domain_adaptation/WORKSPACE
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/domain_adaptation/__init__.py b/research/domain_adaptation/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/domain_adaptation/datasets/BUILD b/research/domain_adaptation/datasets/BUILD
deleted file mode 100644
index 067a79374fbcedaa6fcd90293e5365aaad4c18c6..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/datasets/BUILD
+++ /dev/null
@@ -1,45 +0,0 @@
-# Domain Adaptation Scenarios Datasets
-
-package(
-    default_visibility = [
-        ":internal",
-    ],
-)
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-package_group(
-    name = "internal",
-    packages = [
-        "//domain_adaptation/...",
-    ],
-)
-
-py_library(
-    name = "dataset_factory",
-    srcs = ["dataset_factory.py"],
-    deps = [
-        ":mnist_m",
-        "//slim:mnist",
-    ],
-)
-
-py_binary(
-    name = "download_and_convert_mnist_m",
-    srcs = ["download_and_convert_mnist_m.py"],
-    deps = [
-        
-        "//slim:dataset_utils",
-    ],
-)
-
-py_binary(
-    name = "mnist_m",
-    srcs = ["mnist_m.py"],
-    deps = [
-        
-        "//slim:dataset_utils",
-    ],
-)
diff --git a/research/domain_adaptation/datasets/__init__.py b/research/domain_adaptation/datasets/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/domain_adaptation/datasets/dataset_factory.py b/research/domain_adaptation/datasets/dataset_factory.py
deleted file mode 100644
index 4ca1b41c412a78d25053fc786c8f81072fe90adb..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/datasets/dataset_factory.py
+++ /dev/null
@@ -1,107 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""A factory-pattern class which returns image/label pairs."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-import tensorflow as tf
-
-from slim.datasets import mnist
-from domain_adaptation.datasets import mnist_m
-
-slim = tf.contrib.slim
-
-
-def get_dataset(dataset_name,
-                split_name,
-                dataset_dir,
-                file_pattern=None,
-                reader=None):
-  """Given a dataset name and a split_name returns a Dataset.
-
-  Args:
-    dataset_name: String, the name of the dataset.
-    split_name: A train/test split name.
-    dataset_dir: The directory where the dataset files are stored.
-    file_pattern: The file pattern to use for matching the dataset source files.
-    reader: The subclass of tf.ReaderBase. If left as `None`, then the default
-      reader defined by each dataset is used.
-
-  Returns:
-    A tf-slim `Dataset` class.
-
-  Raises:
-    ValueError: if `dataset_name` isn't recognized.
-  """
-  dataset_name_to_module = {'mnist': mnist, 'mnist_m': mnist_m}
-  if dataset_name not in dataset_name_to_module:
-    raise ValueError('Name of dataset unknown %s.' % dataset_name)
-
-  return dataset_name_to_module[dataset_name].get_split(split_name, dataset_dir,
-                                                        file_pattern, reader)
-
-
-def provide_batch(dataset_name, split_name, dataset_dir, num_readers,
-                  batch_size, num_preprocessing_threads):
-  """Provides a batch of images and corresponding labels.
-
-    Args:
-    dataset_name: String, the name of the dataset.
-    split_name: A train/test split name.
-    dataset_dir: The directory where the dataset files are stored.
-    num_readers: The number of readers used by DatasetDataProvider.
-    batch_size: The size of the batch requested.
-    num_preprocessing_threads: The number of preprocessing threads for
-      tf.train.batch.
-    file_pattern: The file pattern to use for matching the dataset source files.
-    reader: The subclass of tf.ReaderBase. If left as `None`, then the default
-      reader defined by each dataset is used.
-
-  Returns:
-    A batch of
-      images: tensor of [batch_size, height, width, channels].
-      labels: dictionary of labels.
-  """
-  dataset = get_dataset(dataset_name, split_name, dataset_dir)
-  provider = slim.dataset_data_provider.DatasetDataProvider(
-      dataset,
-      num_readers=num_readers,
-      common_queue_capacity=20 * batch_size,
-      common_queue_min=10 * batch_size)
-  [image, label] = provider.get(['image', 'label'])
-
-  # Convert images to float32
-  image = tf.image.convert_image_dtype(image, tf.float32)
-  image -= 0.5
-  image *= 2
-
-  # Load the data.
-  labels = {}
-  images, labels['classes'] = tf.train.batch(
-      [image, label],
-      batch_size=batch_size,
-      num_threads=num_preprocessing_threads,
-      capacity=5 * batch_size)
-  labels['classes'] = slim.one_hot_encoding(labels['classes'],
-                                            dataset.num_classes)
-
-  # Convert mnist to RGB and 32x32 so that it can match mnist_m.
-  if dataset_name == 'mnist':
-    images = tf.image.grayscale_to_rgb(images)
-    images = tf.image.resize_images(images, [32, 32])
-  return images, labels
diff --git a/research/domain_adaptation/datasets/download_and_convert_mnist_m.py b/research/domain_adaptation/datasets/download_and_convert_mnist_m.py
deleted file mode 100644
index 3b5004d3d8aaf54656389e517c50f38299714bc7..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/datasets/download_and_convert_mnist_m.py
+++ /dev/null
@@ -1,237 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-r"""Downloads and converts MNIST-M data to TFRecords of TF-Example protos.
-
-This module downloads the MNIST-M data, uncompresses it, reads the files
-that make up the MNIST-M data and creates two TFRecord datasets: one for train
-and one for test. Each TFRecord dataset is comprised of a set of TF-Example
-protocol buffers, each of which contain a single image and label.
-
-The script should take about a minute to run.
-
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import random
-import sys
-
-# Dependency imports
-import numpy as np
-from six.moves import urllib
-import tensorflow as tf
-
-from slim.datasets import dataset_utils
-
-tf.app.flags.DEFINE_string(
-    'dataset_dir', None,
-    'The directory where the output TFRecords and temporary files are saved.')
-
-FLAGS = tf.app.flags.FLAGS
-
-_IMAGE_SIZE = 32
-_NUM_CHANNELS = 3
-
-# The number of images in the training set.
-_NUM_TRAIN_SAMPLES = 59001
-
-# The number of images to be kept from the training set for the validation set.
-_NUM_VALIDATION = 1000
-
-# The number of images in the test set.
-_NUM_TEST_SAMPLES = 9001
-
-# Seed for repeatability.
-_RANDOM_SEED = 0
-
-# The names of the classes.
-_CLASS_NAMES = [
-    'zero',
-    'one',
-    'two',
-    'three',
-    'four',
-    'five',
-    'size',
-    'seven',
-    'eight',
-    'nine',
-]
-
-
-class ImageReader(object):
-  """Helper class that provides TensorFlow image coding utilities."""
-
-  def __init__(self):
-    # Initializes function that decodes RGB PNG data.
-    self._decode_png_data = tf.placeholder(dtype=tf.string)
-    self._decode_png = tf.image.decode_png(self._decode_png_data, channels=3)
-
-  def read_image_dims(self, sess, image_data):
-    image = self.decode_png(sess, image_data)
-    return image.shape[0], image.shape[1]
-
-  def decode_png(self, sess, image_data):
-    image = sess.run(
-        self._decode_png, feed_dict={self._decode_png_data: image_data})
-    assert len(image.shape) == 3
-    assert image.shape[2] == 3
-    return image
-
-
-def _convert_dataset(split_name, filenames, filename_to_class_id, dataset_dir):
-  """Converts the given filenames to a TFRecord dataset.
-
-  Args:
-    split_name: The name of the dataset, either 'train' or 'valid'.
-    filenames: A list of absolute paths to png images.
-    filename_to_class_id: A dictionary from filenames (strings) to class ids
-      (integers).
-    dataset_dir: The directory where the converted datasets are stored.
-  """
-  print('Converting the {} split.'.format(split_name))
-  # Train and validation splits are both in the train directory.
-  if split_name in ['train', 'valid']:
-    png_directory = os.path.join(dataset_dir, 'mnist_m', 'mnist_m_train')
-  elif split_name == 'test':
-    png_directory = os.path.join(dataset_dir, 'mnist_m', 'mnist_m_test')
-
-  with tf.Graph().as_default():
-    image_reader = ImageReader()
-
-    with tf.Session('') as sess:
-      output_filename = _get_output_filename(dataset_dir, split_name)
-
-      with tf.python_io.TFRecordWriter(output_filename) as tfrecord_writer:
-        for filename in filenames:
-          # Read the filename:
-          image_data = tf.gfile.FastGFile(
-              os.path.join(png_directory, filename), 'r').read()
-          height, width = image_reader.read_image_dims(sess, image_data)
-
-          class_id = filename_to_class_id[filename]
-          example = dataset_utils.image_to_tfexample(image_data, 'png', height,
-                                                     width, class_id)
-          tfrecord_writer.write(example.SerializeToString())
-
-  sys.stdout.write('\n')
-  sys.stdout.flush()
-
-
-def _extract_labels(label_filename):
-  """Extract the labels into a dict of filenames to int labels.
-
-  Args:
-    labels_filename: The filename of the MNIST-M labels.
-
-  Returns:
-    A dictionary of filenames to int labels.
-  """
-  print('Extracting labels from: ', label_filename)
-  label_file = tf.gfile.FastGFile(label_filename, 'r').readlines()
-  label_lines = [line.rstrip('\n').split() for line in label_file]
-  labels = {}
-  for line in label_lines:
-    assert len(line) == 2
-    labels[line[0]] = int(line[1])
-  return labels
-
-
-def _get_output_filename(dataset_dir, split_name):
-  """Creates the output filename.
-
-  Args:
-    dataset_dir: The directory where the temporary files are stored.
-    split_name: The name of the train/test split.
-
-  Returns:
-    An absolute file path.
-  """
-  return '%s/mnist_m_%s.tfrecord' % (dataset_dir, split_name)
-
-
-def _get_filenames(dataset_dir):
-  """Returns a list of filenames and inferred class names.
-
-  Args:
-    dataset_dir: A directory containing a set PNG encoded MNIST-M images.
-
-  Returns:
-    A list of image file paths, relative to `dataset_dir`.
-  """
-  photo_filenames = []
-  for filename in os.listdir(dataset_dir):
-    photo_filenames.append(filename)
-  return photo_filenames
-
-
-def run(dataset_dir):
-  """Runs the download and conversion operation.
-
-  Args:
-    dataset_dir: The dataset directory where the dataset is stored.
-  """
-  if not tf.gfile.Exists(dataset_dir):
-    tf.gfile.MakeDirs(dataset_dir)
-
-  train_filename = _get_output_filename(dataset_dir, 'train')
-  testing_filename = _get_output_filename(dataset_dir, 'test')
-
-  if tf.gfile.Exists(train_filename) and tf.gfile.Exists(testing_filename):
-    print('Dataset files already exist. Exiting without re-creating them.')
-    return
-
-  # TODO(konstantinos): Add download and cleanup functionality
-
-  train_validation_filenames = _get_filenames(
-      os.path.join(dataset_dir, 'mnist_m', 'mnist_m_train'))
-  test_filenames = _get_filenames(
-      os.path.join(dataset_dir, 'mnist_m', 'mnist_m_test'))
-
-  # Divide into train and validation:
-  random.seed(_RANDOM_SEED)
-  random.shuffle(train_validation_filenames)
-  train_filenames = train_validation_filenames[_NUM_VALIDATION:]
-  validation_filenames = train_validation_filenames[:_NUM_VALIDATION]
-
-  train_validation_filenames_to_class_ids = _extract_labels(
-      os.path.join(dataset_dir, 'mnist_m', 'mnist_m_train_labels.txt'))
-  test_filenames_to_class_ids = _extract_labels(
-      os.path.join(dataset_dir, 'mnist_m', 'mnist_m_test_labels.txt'))
-
-  # Convert the train, validation, and test sets.
-  _convert_dataset('train', train_filenames,
-                   train_validation_filenames_to_class_ids, dataset_dir)
-  _convert_dataset('valid', validation_filenames,
-                   train_validation_filenames_to_class_ids, dataset_dir)
-  _convert_dataset('test', test_filenames, test_filenames_to_class_ids,
-                   dataset_dir)
-
-  # Finally, write the labels file:
-  labels_to_class_names = dict(zip(range(len(_CLASS_NAMES)), _CLASS_NAMES))
-  dataset_utils.write_label_file(labels_to_class_names, dataset_dir)
-
-  print('\nFinished converting the MNIST-M dataset!')
-
-
-def main(_):
-  assert FLAGS.dataset_dir
-  run(FLAGS.dataset_dir)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/domain_adaptation/datasets/mnist_m.py b/research/domain_adaptation/datasets/mnist_m.py
deleted file mode 100644
index fab6c443cf3d2e9783d19bf52c81b7aa62d56a38..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/datasets/mnist_m.py
+++ /dev/null
@@ -1,98 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Provides data for the MNIST-M dataset.
-
-The dataset scripts used to create the dataset can be found at:
-tensorflow_models/domain_adaptation_/datasets/download_and_convert_mnist_m_dataset.py
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-# Dependency imports
-import tensorflow as tf
-
-from slim.datasets import dataset_utils
-
-slim = tf.contrib.slim
-
-_FILE_PATTERN = 'mnist_m_%s.tfrecord'
-
-_SPLITS_TO_SIZES = {'train': 58001, 'valid': 1000, 'test': 9001}
-
-_NUM_CLASSES = 10
-
-_ITEMS_TO_DESCRIPTIONS = {
-    'image': 'A [32 x 32 x 1] RGB image.',
-    'label': 'A single integer between 0 and 9',
-}
-
-
-def get_split(split_name, dataset_dir, file_pattern=None, reader=None):
-  """Gets a dataset tuple with instructions for reading MNIST.
-
-  Args:
-    split_name: A train/test split name.
-    dataset_dir: The base directory of the dataset sources.
-
-  Returns:
-    A `Dataset` namedtuple.
-
-  Raises:
-    ValueError: if `split_name` is not a valid train/test split.
-  """
-  if split_name not in _SPLITS_TO_SIZES:
-    raise ValueError('split name %s was not recognized.' % split_name)
-
-  if not file_pattern:
-    file_pattern = _FILE_PATTERN
-  file_pattern = os.path.join(dataset_dir, file_pattern % split_name)
-
-  # Allowing None in the signature so that dataset_factory can use the default.
-  if reader is None:
-    reader = tf.TFRecordReader
-
-  keys_to_features = {
-      'image/encoded':
-          tf.FixedLenFeature((), tf.string, default_value=''),
-      'image/format':
-          tf.FixedLenFeature((), tf.string, default_value='png'),
-      'image/class/label':
-          tf.FixedLenFeature(
-              [1], tf.int64, default_value=tf.zeros([1], dtype=tf.int64)),
-  }
-
-  items_to_handlers = {
-      'image': slim.tfexample_decoder.Image(shape=[32, 32, 3], channels=3),
-      'label': slim.tfexample_decoder.Tensor('image/class/label', shape=[]),
-  }
-
-  decoder = slim.tfexample_decoder.TFExampleDecoder(
-      keys_to_features, items_to_handlers)
-
-  labels_to_names = None
-  if dataset_utils.has_labels(dataset_dir):
-    labels_to_names = dataset_utils.read_label_file(dataset_dir)
-
-  return slim.dataset.Dataset(
-      data_sources=file_pattern,
-      reader=reader,
-      decoder=decoder,
-      num_samples=_SPLITS_TO_SIZES[split_name],
-      num_classes=_NUM_CLASSES,
-      items_to_descriptions=_ITEMS_TO_DESCRIPTIONS,
-      labels_to_names=labels_to_names)
diff --git a/research/domain_adaptation/domain_separation/BUILD b/research/domain_adaptation/domain_separation/BUILD
deleted file mode 100644
index 14dceda27e49d74eaaaeae21676183b78c72b9c2..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/BUILD
+++ /dev/null
@@ -1,157 +0,0 @@
-# Domain Separation Networks
-
-package(
-    default_visibility = [
-        ":internal",
-    ],
-)
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-package_group(
-    name = "internal",
-    packages = [
-        "//domain_adaptation/...",
-    ],
-)
-
-py_library(
-    name = "models",
-    srcs = [
-        "models.py",
-    ],
-    deps = [
-        ":utils",
-    ],
-)
-
-py_library(
-    name = "losses",
-    srcs = [
-        "losses.py",
-    ],
-    deps = [
-        ":grl_op_grads_py",
-        ":grl_op_shapes_py",
-        ":grl_ops",
-        ":utils",
-    ],
-)
-
-py_test(
-    name = "losses_test",
-    srcs = [
-        "losses_test.py",
-    ],
-    deps = [
-        ":losses",
-        ":utils",
-    ],
-)
-
-py_library(
-    name = "dsn",
-    srcs = [
-        "dsn.py",
-    ],
-    deps = [
-        ":grl_op_grads_py",
-        ":grl_op_shapes_py",
-        ":grl_ops",
-        ":losses",
-        ":models",
-        ":utils",
-    ],
-)
-
-py_test(
-    name = "dsn_test",
-    srcs = [
-        "dsn_test.py",
-    ],
-    deps = [
-        ":dsn",
-    ],
-)
-
-py_binary(
-    name = "dsn_train",
-    srcs = [
-        "dsn_train.py",
-    ],
-    deps = [
-        ":dsn",
-        ":models",
-        "//domain_adaptation/datasets:dataset_factory",
-    ],
-)
-
-py_binary(
-    name = "dsn_eval",
-    srcs = [
-        "dsn_eval.py",
-    ],
-    deps = [
-        ":dsn",
-        ":models",
-        "//domain_adaptation/datasets:dataset_factory",
-    ],
-)
-
-py_test(
-    name = "models_test",
-    srcs = [
-        "models_test.py",
-    ],
-    deps = [
-        ":models",
-        "//domain_adaptation/datasets:dataset_factory",
-    ],
-)
-
-py_library(
-    name = "utils",
-    srcs = [
-        "utils.py",
-    ],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "grl_op_grads_py",
-    srcs = [
-        "grl_op_grads.py",
-    ],
-    deps = [
-        ":grl_ops",
-    ],
-)
-
-py_library(
-    name = "grl_op_shapes_py",
-    srcs = [
-        "grl_op_shapes.py",
-    ],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "grl_ops",
-    srcs = ["grl_ops.py"],
-    data = ["_grl_ops.so"],
-)
-
-py_test(
-    name = "grl_ops_test",
-    size = "small",
-    srcs = ["grl_ops_test.py"],
-    deps = [
-        ":grl_op_grads_py",
-        ":grl_op_shapes_py",
-        ":grl_ops",
-    ],
-)
diff --git a/research/domain_adaptation/domain_separation/__init__.py b/research/domain_adaptation/domain_separation/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/domain_adaptation/domain_separation/_grl_ops.so b/research/domain_adaptation/domain_separation/_grl_ops.so
deleted file mode 100755
index 4c35473760a76dcb743d58f45eddccecb5f5161e..0000000000000000000000000000000000000000
Binary files a/research/domain_adaptation/domain_separation/_grl_ops.so and /dev/null differ
diff --git a/research/domain_adaptation/domain_separation/dsn.py b/research/domain_adaptation/domain_separation/dsn.py
deleted file mode 100644
index 3018e8a791840ae465bad493913235cc04c31cff..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/dsn.py
+++ /dev/null
@@ -1,355 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Functions to create a DSN model and add the different losses to it.
-
-Specifically, in this file we define the:
-  - Shared Encoding Similarity Loss Module, with:
-    - The MMD Similarity method
-    - The Correlation Similarity method
-    - The Gradient Reversal (Domain-Adversarial) method
-  - Difference Loss Module
-  - Reconstruction Loss Module
-  - Task Loss Module
-"""
-from functools import partial
-
-import tensorflow as tf
-
-import losses
-import models
-import utils
-
-slim = tf.contrib.slim
-
-
-################################################################################
-# HELPER FUNCTIONS
-################################################################################
-def dsn_loss_coefficient(params):
-  """The global_step-dependent weight that specifies when to kick in DSN losses.
-
-  Args:
-    params: A dictionary of parameters. Expecting 'domain_separation_startpoint'
-
-  Returns:
-    A weight to that effectively enables or disables the DSN-related losses,
-    i.e. similarity, difference, and reconstruction losses.
-  """
-  return tf.where(
-      tf.less(slim.get_or_create_global_step(),
-              params['domain_separation_startpoint']), 1e-10, 1.0)
-
-
-################################################################################
-# MODEL CREATION
-################################################################################
-def create_model(source_images, source_labels, domain_selection_mask,
-                 target_images, target_labels, similarity_loss, params,
-                 basic_tower_name):
-  """Creates a DSN model.
-
-  Args:
-    source_images: images from the source domain, a tensor of size
-      [batch_size, height, width, channels]
-    source_labels: a dictionary with the name, tensor pairs. 'classes' is one-
-      hot for the number of classes.
-    domain_selection_mask: a boolean tensor of size [batch_size, ] which denotes
-      the labeled images that belong to the source domain.
-    target_images: images from the target domain, a tensor of size
-      [batch_size, height width, channels].
-    target_labels: a dictionary with the name, tensor pairs.
-    similarity_loss: The type of method to use for encouraging
-      the codes from the shared encoder to be similar.
-    params: A dictionary of parameters. Expecting 'weight_decay',
-      'layers_to_regularize', 'use_separation', 'domain_separation_startpoint',
-      'alpha_weight', 'beta_weight', 'gamma_weight', 'recon_loss_name',
-      'decoder_name', 'encoder_name'
-    basic_tower_name: the name of the tower to use for the shared encoder.
-
-  Raises:
-    ValueError: if the arch is not one of the available architectures.
-  """
-  network = getattr(models, basic_tower_name)
-  num_classes = source_labels['classes'].get_shape().as_list()[1]
-
-  # Make sure we are using the appropriate number of classes.
-  network = partial(network, num_classes=num_classes)
-
-  # Add the classification/pose estimation loss to the source domain.
-  source_endpoints = add_task_loss(source_images, source_labels, network,
-                                   params)
-
-  if similarity_loss == 'none':
-    # No domain adaptation, we can stop here.
-    return
-
-  with tf.variable_scope('towers', reuse=True):
-    target_logits, target_endpoints = network(
-        target_images, weight_decay=params['weight_decay'], prefix='target')
-
-  # Plot target accuracy of the train set.
-  target_accuracy = utils.accuracy(
-      tf.argmax(target_logits, 1), tf.argmax(target_labels['classes'], 1))
-
-  if 'quaternions' in target_labels:
-    target_quaternion_loss = losses.log_quaternion_loss(
-        target_labels['quaternions'], target_endpoints['quaternion_pred'],
-        params)
-    tf.summary.scalar('eval/Target quaternions', target_quaternion_loss)
-
-  tf.summary.scalar('eval/Target accuracy', target_accuracy)
-
-  source_shared = source_endpoints[params['layers_to_regularize']]
-  target_shared = target_endpoints[params['layers_to_regularize']]
-
-  # When using the semisupervised model we include labeled target data in the
-  # source classifier. We do not want to include these target domain when
-  # we use the similarity loss.
-  indices = tf.range(0, source_shared.get_shape().as_list()[0])
-  indices = tf.boolean_mask(indices, domain_selection_mask)
-  add_similarity_loss(similarity_loss,
-                      tf.gather(source_shared, indices),
-                      tf.gather(target_shared, indices), params)
-
-  if params['use_separation']:
-    add_autoencoders(
-        source_images,
-        source_shared,
-        target_images,
-        target_shared,
-        params=params,)
-
-
-def add_similarity_loss(method_name,
-                        source_samples,
-                        target_samples,
-                        params,
-                        scope=None):
-  """Adds a loss encouraging the shared encoding from each domain to be similar.
-
-  Args:
-    method_name: the name of the encoding similarity method to use. Valid
-      options include `dann_loss', `mmd_loss' or `correlation_loss'.
-    source_samples: a tensor of shape [num_samples, num_features].
-    target_samples: a tensor of shape [num_samples, num_features].
-    params: a dictionary of parameters. Expecting 'gamma_weight'.
-    scope: optional name scope for summary tags.
-  Raises:
-    ValueError: if `method_name` is not recognized.
-  """
-  weight = dsn_loss_coefficient(params) * params['gamma_weight']
-  method = getattr(losses, method_name)
-  method(source_samples, target_samples, weight, scope)
-
-
-def add_reconstruction_loss(recon_loss_name, images, recons, weight, domain):
-  """Adds a reconstruction loss.
-
-  Args:
-    recon_loss_name: The name of the reconstruction loss.
-    images: A `Tensor` of size [batch_size, height, width, 3].
-    recons: A `Tensor` whose size matches `images`.
-    weight: A scalar coefficient for the loss.
-    domain: The name of the domain being reconstructed.
-
-  Raises:
-    ValueError: If `recon_loss_name` is not recognized.
-  """
-  if recon_loss_name == 'sum_of_pairwise_squares':
-    loss_fn = tf.contrib.losses.mean_pairwise_squared_error
-  elif recon_loss_name == 'sum_of_squares':
-    loss_fn = tf.contrib.losses.mean_squared_error
-  else:
-    raise ValueError('recon_loss_name value [%s] not recognized.' %
-                     recon_loss_name)
-
-  loss = loss_fn(recons, images, weight)
-  assert_op = tf.Assert(tf.is_finite(loss), [loss])
-  with tf.control_dependencies([assert_op]):
-    tf.summary.scalar('losses/%s Recon Loss' % domain, loss)
-
-
-def add_autoencoders(source_data, source_shared, target_data, target_shared,
-                     params):
-  """Adds the encoders/decoders for our domain separation model w/ incoherence.
-
-  Args:
-    source_data: images from the source domain, a tensor of size
-      [batch_size, height, width, channels]
-    source_shared: a tensor with first dimension batch_size
-    target_data: images from the target domain, a tensor of size
-      [batch_size, height, width, channels]
-    target_shared: a tensor with first dimension batch_size
-    params: A dictionary of parameters. Expecting 'layers_to_regularize',
-      'beta_weight', 'alpha_weight', 'recon_loss_name', 'decoder_name',
-      'encoder_name', 'weight_decay'
-  """
-
-  def normalize_images(images):
-    images -= tf.reduce_min(images)
-    return images / tf.reduce_max(images)
-
-  def concat_operation(shared_repr, private_repr):
-    return shared_repr + private_repr
-
-  mu = dsn_loss_coefficient(params)
-
-  # The layer to concatenate the networks at.
-  concat_layer = params['layers_to_regularize']
-
-  # The coefficient for modulating the private/shared difference loss.
-  difference_loss_weight = params['beta_weight'] * mu
-
-  # The reconstruction weight.
-  recon_loss_weight = params['alpha_weight'] * mu
-
-  # The reconstruction loss to use.
-  recon_loss_name = params['recon_loss_name']
-
-  # The decoder/encoder to use.
-  decoder_name = params['decoder_name']
-  encoder_name = params['encoder_name']
-
-  _, height, width, _ = source_data.get_shape().as_list()
-  code_size = source_shared.get_shape().as_list()[-1]
-  weight_decay = params['weight_decay']
-
-  encoder_fn = getattr(models, encoder_name)
-  # Target Auto-encoding.
-  with tf.variable_scope('source_encoder'):
-    source_endpoints = encoder_fn(
-        source_data, code_size, weight_decay=weight_decay)
-
-  with tf.variable_scope('target_encoder'):
-    target_endpoints = encoder_fn(
-        target_data, code_size, weight_decay=weight_decay)
-
-  decoder_fn = getattr(models, decoder_name)
-
-  decoder = partial(
-      decoder_fn,
-      height=height,
-      width=width,
-      channels=source_data.get_shape().as_list()[-1],
-      weight_decay=weight_decay)
-
-  # Source Auto-encoding.
-  source_private = source_endpoints[concat_layer]
-  target_private = target_endpoints[concat_layer]
-  with tf.variable_scope('decoder'):
-    source_recons = decoder(concat_operation(source_shared, source_private))
-
-  with tf.variable_scope('decoder', reuse=True):
-    source_private_recons = decoder(
-        concat_operation(tf.zeros_like(source_private), source_private))
-    source_shared_recons = decoder(
-        concat_operation(source_shared, tf.zeros_like(source_shared)))
-
-  with tf.variable_scope('decoder', reuse=True):
-    target_recons = decoder(concat_operation(target_shared, target_private))
-    target_shared_recons = decoder(
-        concat_operation(target_shared, tf.zeros_like(target_shared)))
-    target_private_recons = decoder(
-        concat_operation(tf.zeros_like(target_private), target_private))
-
-  losses.difference_loss(
-      source_private,
-      source_shared,
-      weight=difference_loss_weight,
-      name='Source')
-  losses.difference_loss(
-      target_private,
-      target_shared,
-      weight=difference_loss_weight,
-      name='Target')
-
-  add_reconstruction_loss(recon_loss_name, source_data, source_recons,
-                          recon_loss_weight, 'source')
-  add_reconstruction_loss(recon_loss_name, target_data, target_recons,
-                          recon_loss_weight, 'target')
-
-  # Add summaries
-  source_reconstructions = tf.concat(
-      axis=2,
-      values=map(normalize_images, [
-          source_data, source_recons, source_shared_recons,
-          source_private_recons
-      ]))
-  target_reconstructions = tf.concat(
-      axis=2,
-      values=map(normalize_images, [
-          target_data, target_recons, target_shared_recons,
-          target_private_recons
-      ]))
-  tf.summary.image(
-      'Source Images:Recons:RGB',
-      source_reconstructions[:, :, :, :3],
-      max_outputs=10)
-  tf.summary.image(
-      'Target Images:Recons:RGB',
-      target_reconstructions[:, :, :, :3],
-      max_outputs=10)
-
-  if source_reconstructions.get_shape().as_list()[3] == 4:
-    tf.summary.image(
-        'Source Images:Recons:Depth',
-        source_reconstructions[:, :, :, 3:4],
-        max_outputs=10)
-    tf.summary.image(
-        'Target Images:Recons:Depth',
-        target_reconstructions[:, :, :, 3:4],
-        max_outputs=10)
-
-
-def add_task_loss(source_images, source_labels, basic_tower, params):
-  """Adds a classification and/or pose estimation loss to the model.
-
-  Args:
-    source_images: images from the source domain, a tensor of size
-      [batch_size, height, width, channels]
-    source_labels: labels from the source domain, a tensor of size [batch_size].
-      or a tuple of (quaternions, class_labels)
-    basic_tower: a function that creates the single tower of the model.
-    params: A dictionary of parameters. Expecting 'weight_decay', 'pose_weight'.
-  Returns:
-    The source endpoints.
-
-  Raises:
-    RuntimeError: if basic tower does not support pose estimation.
-  """
-  with tf.variable_scope('towers'):
-    source_logits, source_endpoints = basic_tower(
-        source_images, weight_decay=params['weight_decay'], prefix='Source')
-
-  if 'quaternions' in source_labels:  # We have pose estimation as well
-    if 'quaternion_pred' not in source_endpoints:
-      raise RuntimeError('Please use a model for estimation e.g. pose_mini')
-
-    loss = losses.log_quaternion_loss(source_labels['quaternions'],
-                                      source_endpoints['quaternion_pred'],
-                                      params)
-
-    assert_op = tf.Assert(tf.is_finite(loss), [loss])
-    with tf.control_dependencies([assert_op]):
-      quaternion_loss = loss
-      tf.summary.histogram('log_quaternion_loss_hist', quaternion_loss)
-    slim.losses.add_loss(quaternion_loss * params['pose_weight'])
-    tf.summary.scalar('losses/quaternion_loss', quaternion_loss)
-
-  classification_loss = tf.losses.softmax_cross_entropy(
-      source_labels['classes'], source_logits)
-
-  tf.summary.scalar('losses/classification_loss', classification_loss)
-  return source_endpoints
diff --git a/research/domain_adaptation/domain_separation/dsn_eval.py b/research/domain_adaptation/domain_separation/dsn_eval.py
deleted file mode 100644
index b6cccdfcc17e8f18e8381530b5c8f41501bda29b..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/dsn_eval.py
+++ /dev/null
@@ -1,161 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# pylint: disable=line-too-long
-"""Evaluation for Domain Separation Networks (DSNs)."""
-# pylint: enable=line-too-long
-import math
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from domain_adaptation.datasets import dataset_factory
-from domain_adaptation.domain_separation import losses
-from domain_adaptation.domain_separation import models
-
-slim = tf.contrib.slim
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_integer('batch_size', 32,
-                            'The number of images in each batch.')
-
-tf.app.flags.DEFINE_string('master', '',
-                           'BNS name of the TensorFlow master to use.')
-
-tf.app.flags.DEFINE_string('checkpoint_dir', '/tmp/da/',
-                           'Directory where the model was written to.')
-
-tf.app.flags.DEFINE_string(
-    'eval_dir', '/tmp/da/',
-    'Directory where we should write the tf summaries to.')
-
-tf.app.flags.DEFINE_string('dataset_dir', None,
-                           'The directory where the dataset files are stored.')
-
-tf.app.flags.DEFINE_string('dataset', 'mnist_m',
-                           'Which dataset to test on: "mnist", "mnist_m".')
-
-tf.app.flags.DEFINE_string('split', 'valid',
-                           'Which portion to test on: "valid", "test".')
-
-tf.app.flags.DEFINE_integer('num_examples', 1000, 'Number of test examples.')
-
-tf.app.flags.DEFINE_string('basic_tower', 'dann_mnist',
-                           'The basic tower building block.')
-
-tf.app.flags.DEFINE_bool('enable_precision_recall', False,
-                         'If True, precision and recall for each class will '
-                         'be added to the metrics.')
-
-tf.app.flags.DEFINE_bool('use_logging', False, 'Debugging messages.')
-
-
-def quaternion_metric(predictions, labels):
-  params = {'batch_size': FLAGS.batch_size, 'use_logging': False}
-  logcost = losses.log_quaternion_loss_batch(predictions, labels, params)
-  return slim.metrics.streaming_mean(logcost)
-
-
-def angle_diff(true_q, pred_q):
-  angles = 2 * (
-      180.0 /
-      np.pi) * np.arccos(np.abs(np.sum(np.multiply(pred_q, true_q), axis=1)))
-  return angles
-
-
-def provide_batch_fn():
-  """ The provide_batch function to use. """
-  return dataset_factory.provide_batch
-
-
-def main(_):
-  g = tf.Graph()
-  with g.as_default():
-    # Load the data.
-    images, labels = provide_batch_fn()(
-        FLAGS.dataset, FLAGS.split, FLAGS.dataset_dir, 4, FLAGS.batch_size, 4)
-
-    num_classes = labels['classes'].get_shape().as_list()[1]
-
-    tf.summary.image('eval_images', images, max_outputs=3)
-
-    # Define the model:
-    with tf.variable_scope('towers'):
-      basic_tower = getattr(models, FLAGS.basic_tower)
-      predictions, endpoints = basic_tower(
-          images,
-          num_classes=num_classes,
-          is_training=False,
-          batch_norm_params=None)
-    metric_names_to_values = {}
-
-    # Define the metrics:
-    if 'quaternions' in labels:  # Also have to evaluate pose estimation!
-      quaternion_loss = quaternion_metric(labels['quaternions'],
-                                          endpoints['quaternion_pred'])
-
-      angle_errors, = tf.py_func(
-          angle_diff, [labels['quaternions'], endpoints['quaternion_pred']],
-          [tf.float32])
-
-      metric_names_to_values[
-          'Angular mean error'] = slim.metrics.streaming_mean(angle_errors)
-      metric_names_to_values['Quaternion Loss'] = quaternion_loss
-
-    accuracy = tf.contrib.metrics.streaming_accuracy(
-        tf.argmax(predictions, 1), tf.argmax(labels['classes'], 1))
-
-    predictions = tf.argmax(predictions, 1)
-    labels = tf.argmax(labels['classes'], 1)
-    metric_names_to_values['Accuracy'] = accuracy
-
-    if FLAGS.enable_precision_recall:
-      for i in xrange(num_classes):
-        index_map = tf.one_hot(i, depth=num_classes)
-        name = 'PR/Precision_{}'.format(i)
-        metric_names_to_values[name] = slim.metrics.streaming_precision(
-            tf.gather(index_map, predictions), tf.gather(index_map, labels))
-        name = 'PR/Recall_{}'.format(i)
-        metric_names_to_values[name] = slim.metrics.streaming_recall(
-            tf.gather(index_map, predictions), tf.gather(index_map, labels))
-
-    names_to_values, names_to_updates = slim.metrics.aggregate_metric_map(
-        metric_names_to_values)
-
-    # Create the summary ops such that they also print out to std output:
-    summary_ops = []
-    for metric_name, metric_value in names_to_values.iteritems():
-      op = tf.summary.scalar(metric_name, metric_value)
-      op = tf.Print(op, [metric_value], metric_name)
-      summary_ops.append(op)
-
-    # This ensures that we make a single pass over all of the data.
-    num_batches = math.ceil(FLAGS.num_examples / float(FLAGS.batch_size))
-
-    # Setup the global step.
-    slim.get_or_create_global_step()
-    slim.evaluation.evaluation_loop(
-        FLAGS.master,
-        checkpoint_dir=FLAGS.checkpoint_dir,
-        logdir=FLAGS.eval_dir,
-        num_evals=num_batches,
-        eval_op=names_to_updates.values(),
-        summary_op=tf.summary.merge(summary_ops))
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/domain_adaptation/domain_separation/dsn_test.py b/research/domain_adaptation/domain_separation/dsn_test.py
deleted file mode 100644
index 3d687398a9b9356455f739417bc96ddb2ca5ad40..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/dsn_test.py
+++ /dev/null
@@ -1,157 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for DSN model assembly functions."""
-
-import numpy as np
-import tensorflow as tf
-
-import dsn
-
-
-class HelperFunctionsTest(tf.test.TestCase):
-
-  def testBasicDomainSeparationStartPoint(self):
-    with self.test_session() as sess:
-      # Test for when global_step < domain_separation_startpoint
-      step = tf.contrib.slim.get_or_create_global_step()
-      sess.run(tf.global_variables_initializer())  # global_step = 0
-      params = {'domain_separation_startpoint': 2}
-      weight = dsn.dsn_loss_coefficient(params)
-      weight_np = sess.run(weight)
-      self.assertAlmostEqual(weight_np, 1e-10)
-
-      step_op = tf.assign_add(step, 1)
-      step_np = sess.run(step_op)  # global_step = 1
-      weight = dsn.dsn_loss_coefficient(params)
-      weight_np = sess.run(weight)
-      self.assertAlmostEqual(weight_np, 1e-10)
-
-      # Test for when global_step >= domain_separation_startpoint
-      step_np = sess.run(step_op)  # global_step = 2
-      tf.logging.info(step_np)
-      weight = dsn.dsn_loss_coefficient(params)
-      weight_np = sess.run(weight)
-      self.assertAlmostEqual(weight_np, 1.0)
-
-
-class DsnModelAssemblyTest(tf.test.TestCase):
-
-  def _testBuildDefaultModel(self):
-    images = tf.to_float(np.random.rand(32, 28, 28, 1))
-    labels = {}
-    labels['classes'] = tf.one_hot(
-        tf.to_int32(np.random.randint(0, 9, (32))), 10)
-
-    params = {
-        'use_separation': True,
-        'layers_to_regularize': 'fc3',
-        'weight_decay': 0.0,
-        'ps_tasks': 1,
-        'domain_separation_startpoint': 1,
-        'alpha_weight': 1,
-        'beta_weight': 1,
-        'gamma_weight': 1,
-        'recon_loss_name': 'sum_of_squares',
-        'decoder_name': 'small_decoder',
-        'encoder_name': 'default_encoder',
-    }
-    return images, labels, params
-
-  def testBuildModelDann(self):
-    images, labels, params = self._testBuildDefaultModel()
-
-    with self.test_session():
-      dsn.create_model(images, labels,
-                       tf.cast(tf.ones([32,]), tf.bool), images, labels,
-                       'dann_loss', params, 'dann_mnist')
-      loss_tensors = tf.contrib.losses.get_losses()
-    self.assertEqual(len(loss_tensors), 6)
-
-  def testBuildModelDannSumOfPairwiseSquares(self):
-    images, labels, params = self._testBuildDefaultModel()
-
-    with self.test_session():
-      dsn.create_model(images, labels,
-                       tf.cast(tf.ones([32,]), tf.bool), images, labels,
-                       'dann_loss', params, 'dann_mnist')
-      loss_tensors = tf.contrib.losses.get_losses()
-    self.assertEqual(len(loss_tensors), 6)
-
-  def testBuildModelDannMultiPSTasks(self):
-    images, labels, params = self._testBuildDefaultModel()
-    params['ps_tasks'] = 10
-    with self.test_session():
-      dsn.create_model(images, labels,
-                       tf.cast(tf.ones([32,]), tf.bool), images, labels,
-                       'dann_loss', params, 'dann_mnist')
-      loss_tensors = tf.contrib.losses.get_losses()
-    self.assertEqual(len(loss_tensors), 6)
-
-  def testBuildModelMmd(self):
-    images, labels, params = self._testBuildDefaultModel()
-
-    with self.test_session():
-      dsn.create_model(images, labels,
-                       tf.cast(tf.ones([32,]), tf.bool), images, labels,
-                       'mmd_loss', params, 'dann_mnist')
-      loss_tensors = tf.contrib.losses.get_losses()
-    self.assertEqual(len(loss_tensors), 6)
-
-  def testBuildModelCorr(self):
-    images, labels, params = self._testBuildDefaultModel()
-
-    with self.test_session():
-      dsn.create_model(images, labels,
-                       tf.cast(tf.ones([32,]), tf.bool), images, labels,
-                       'correlation_loss', params, 'dann_mnist')
-      loss_tensors = tf.contrib.losses.get_losses()
-    self.assertEqual(len(loss_tensors), 6)
-
-  def testBuildModelNoDomainAdaptation(self):
-    images, labels, params = self._testBuildDefaultModel()
-    params['use_separation'] = False
-    with self.test_session():
-      dsn.create_model(images, labels,
-                       tf.cast(tf.ones([32,]), tf.bool), images, labels, 'none',
-                       params, 'dann_mnist')
-      loss_tensors = tf.contrib.losses.get_losses()
-      self.assertEqual(len(loss_tensors), 1)
-      self.assertEqual(len(tf.contrib.losses.get_regularization_losses()), 0)
-
-  def testBuildModelNoAdaptationWeightDecay(self):
-    images, labels, params = self._testBuildDefaultModel()
-    params['use_separation'] = False
-    params['weight_decay'] = 1e-5
-    with self.test_session():
-      dsn.create_model(images, labels,
-                       tf.cast(tf.ones([32,]), tf.bool), images, labels, 'none',
-                       params, 'dann_mnist')
-      loss_tensors = tf.contrib.losses.get_losses()
-      self.assertEqual(len(loss_tensors), 1)
-      self.assertTrue(len(tf.contrib.losses.get_regularization_losses()) >= 1)
-
-  def testBuildModelNoSeparation(self):
-    images, labels, params = self._testBuildDefaultModel()
-    params['use_separation'] = False
-    with self.test_session():
-      dsn.create_model(images, labels,
-                       tf.cast(tf.ones([32,]), tf.bool), images, labels,
-                       'dann_loss', params, 'dann_mnist')
-      loss_tensors = tf.contrib.losses.get_losses()
-    self.assertEqual(len(loss_tensors), 2)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/domain_adaptation/domain_separation/dsn_train.py b/research/domain_adaptation/domain_separation/dsn_train.py
deleted file mode 100644
index 5e364ad3037b041125a3523370b3b040478f0d8e..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/dsn_train.py
+++ /dev/null
@@ -1,278 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Training for Domain Separation Networks (DSNs)."""
-from __future__ import division
-
-import tensorflow as tf
-
-from domain_adaptation.datasets import dataset_factory
-import dsn
-
-slim = tf.contrib.slim
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_integer('batch_size', 32,
-                            'The number of images in each batch.')
-
-tf.app.flags.DEFINE_string('source_dataset', 'pose_synthetic',
-                           'Source dataset to train on.')
-
-tf.app.flags.DEFINE_string('target_dataset', 'pose_real',
-                           'Target dataset to train on.')
-
-tf.app.flags.DEFINE_string('target_labeled_dataset', 'none',
-                           'Target dataset to train on.')
-
-tf.app.flags.DEFINE_string('dataset_dir', None,
-                           'The directory where the dataset files are stored.')
-
-tf.app.flags.DEFINE_string('master', '',
-                           'BNS name of the TensorFlow master to use.')
-
-tf.app.flags.DEFINE_string('train_log_dir', '/tmp/da/',
-                           'Directory where to write event logs.')
-
-tf.app.flags.DEFINE_string(
-    'layers_to_regularize', 'fc3',
-    'Comma-separated list of layer names to use MMD regularization on.')
-
-tf.app.flags.DEFINE_float('learning_rate', .01, 'The learning rate')
-
-tf.app.flags.DEFINE_float('alpha_weight', 1e-6,
-                          'The coefficient for scaling the reconstruction '
-                          'loss.')
-
-tf.app.flags.DEFINE_float(
-    'beta_weight', 1e-6,
-    'The coefficient for scaling the private/shared difference loss.')
-
-tf.app.flags.DEFINE_float(
-    'gamma_weight', 1e-6,
-    'The coefficient for scaling the shared encoding similarity loss.')
-
-tf.app.flags.DEFINE_float('pose_weight', 0.125,
-                          'The coefficient for scaling the pose loss.')
-
-tf.app.flags.DEFINE_float(
-    'weight_decay', 1e-6,
-    'The coefficient for the L2 regularization applied for all weights.')
-
-tf.app.flags.DEFINE_integer(
-    'save_summaries_secs', 60,
-    'The frequency with which summaries are saved, in seconds.')
-
-tf.app.flags.DEFINE_integer(
-    'save_interval_secs', 60,
-    'The frequency with which the model is saved, in seconds.')
-
-tf.app.flags.DEFINE_integer(
-    'max_number_of_steps', None,
-    'The maximum number of gradient steps. Use None to train indefinitely.')
-
-tf.app.flags.DEFINE_integer(
-    'domain_separation_startpoint', 1,
-    'The global step to add the domain separation losses.')
-
-tf.app.flags.DEFINE_integer(
-    'bipartite_assignment_top_k', 3,
-    'The number of top-k matches to use in bipartite matching adaptation.')
-
-tf.app.flags.DEFINE_float('decay_rate', 0.95, 'Learning rate decay factor.')
-
-tf.app.flags.DEFINE_integer('decay_steps', 20000, 'Learning rate decay steps.')
-
-tf.app.flags.DEFINE_float('momentum', 0.9, 'The momentum value.')
-
-tf.app.flags.DEFINE_bool('use_separation', False,
-                         'Use our domain separation model.')
-
-tf.app.flags.DEFINE_bool('use_logging', False, 'Debugging messages.')
-
-tf.app.flags.DEFINE_integer(
-    'ps_tasks', 0,
-    'The number of parameter servers. If the value is 0, then the parameters '
-    'are handled locally by the worker.')
-
-tf.app.flags.DEFINE_integer(
-    'num_readers', 4,
-    'The number of parallel readers that read data from the dataset.')
-
-tf.app.flags.DEFINE_integer('num_preprocessing_threads', 4,
-                            'The number of threads used to create the batches.')
-
-tf.app.flags.DEFINE_integer(
-    'task', 0,
-    'The Task ID. This value is used when training with multiple workers to '
-    'identify each worker.')
-
-tf.app.flags.DEFINE_string('decoder_name', 'small_decoder',
-                           'The decoder to use.')
-tf.app.flags.DEFINE_string('encoder_name', 'default_encoder',
-                           'The encoder to use.')
-
-################################################################################
-# Flags that control the architecture and losses
-################################################################################
-tf.app.flags.DEFINE_string(
-    'similarity_loss', 'grl',
-    'The method to use for encouraging the common encoder codes to be '
-    'similar, one of "grl", "mmd", "corr".')
-
-tf.app.flags.DEFINE_string('recon_loss_name', 'sum_of_pairwise_squares',
-                           'The name of the reconstruction loss.')
-
-tf.app.flags.DEFINE_string('basic_tower', 'pose_mini',
-                           'The basic tower building block.')
-
-def provide_batch_fn():
-  """ The provide_batch function to use. """
-  return dataset_factory.provide_batch
-
-def main(_):
-  model_params = {
-      'use_separation': FLAGS.use_separation,
-      'domain_separation_startpoint': FLAGS.domain_separation_startpoint,
-      'layers_to_regularize': FLAGS.layers_to_regularize,
-      'alpha_weight': FLAGS.alpha_weight,
-      'beta_weight': FLAGS.beta_weight,
-      'gamma_weight': FLAGS.gamma_weight,
-      'pose_weight': FLAGS.pose_weight,
-      'recon_loss_name': FLAGS.recon_loss_name,
-      'decoder_name': FLAGS.decoder_name,
-      'encoder_name': FLAGS.encoder_name,
-      'weight_decay': FLAGS.weight_decay,
-      'batch_size': FLAGS.batch_size,
-      'use_logging': FLAGS.use_logging,
-      'ps_tasks': FLAGS.ps_tasks,
-      'task': FLAGS.task,
-  }
-  g = tf.Graph()
-  with g.as_default():
-    with tf.device(tf.train.replica_device_setter(FLAGS.ps_tasks)):
-      # Load the data.
-      source_images, source_labels = provide_batch_fn()(
-          FLAGS.source_dataset, 'train', FLAGS.dataset_dir, FLAGS.num_readers,
-          FLAGS.batch_size, FLAGS.num_preprocessing_threads)
-      target_images, target_labels = provide_batch_fn()(
-          FLAGS.target_dataset, 'train', FLAGS.dataset_dir, FLAGS.num_readers,
-          FLAGS.batch_size, FLAGS.num_preprocessing_threads)
-
-      # In the unsupervised case all the samples in the labeled
-      # domain are from the source domain.
-      domain_selection_mask = tf.fill((source_images.get_shape().as_list()[0],),
-                                      True)
-
-      # When using the semisupervised model we include labeled target data in
-      # the source labelled data.
-      if FLAGS.target_labeled_dataset != 'none':
-        # 1000 is the maximum number of labelled target samples that exists in
-        # the datasets.
-        target_semi_images, target_semi_labels = provide_batch_fn()(
-            FLAGS.target_labeled_dataset, 'train', FLAGS.batch_size)
-
-        # Calculate the proportion of source domain samples in the semi-
-        # supervised setting, so that the proportion is set accordingly in the
-        # batches.
-        proportion = float(source_labels['num_train_samples']) / (
-            source_labels['num_train_samples'] +
-            target_semi_labels['num_train_samples'])
-
-        rnd_tensor = tf.random_uniform(
-            (target_semi_images.get_shape().as_list()[0],))
-
-        domain_selection_mask = rnd_tensor < proportion
-        source_images = tf.where(domain_selection_mask, source_images,
-                                 target_semi_images)
-        source_class_labels = tf.where(domain_selection_mask,
-                                       source_labels['classes'],
-                                       target_semi_labels['classes'])
-
-        if 'quaternions' in source_labels:
-          source_pose_labels = tf.where(domain_selection_mask,
-                                        source_labels['quaternions'],
-                                        target_semi_labels['quaternions'])
-          (source_images, source_class_labels, source_pose_labels,
-           domain_selection_mask) = tf.train.shuffle_batch(
-               [
-                   source_images, source_class_labels, source_pose_labels,
-                   domain_selection_mask
-               ],
-               FLAGS.batch_size,
-               50000,
-               5000,
-               num_threads=1,
-               enqueue_many=True)
-
-        else:
-          (source_images, source_class_labels,
-           domain_selection_mask) = tf.train.shuffle_batch(
-               [source_images, source_class_labels, domain_selection_mask],
-               FLAGS.batch_size,
-               50000,
-               5000,
-               num_threads=1,
-               enqueue_many=True)
-        source_labels = {}
-        source_labels['classes'] = source_class_labels
-        if 'quaternions' in source_labels:
-          source_labels['quaternions'] = source_pose_labels
-
-      slim.get_or_create_global_step()
-      tf.summary.image('source_images', source_images, max_outputs=3)
-      tf.summary.image('target_images', target_images, max_outputs=3)
-
-      dsn.create_model(
-          source_images,
-          source_labels,
-          domain_selection_mask,
-          target_images,
-          target_labels,
-          FLAGS.similarity_loss,
-          model_params,
-          basic_tower_name=FLAGS.basic_tower)
-
-      # Configure the optimization scheme:
-      learning_rate = tf.train.exponential_decay(
-          FLAGS.learning_rate,
-          slim.get_or_create_global_step(),
-          FLAGS.decay_steps,
-          FLAGS.decay_rate,
-          staircase=True,
-          name='learning_rate')
-
-      tf.summary.scalar('learning_rate', learning_rate)
-      tf.summary.scalar('total_loss', tf.losses.get_total_loss())
-
-      opt = tf.train.MomentumOptimizer(learning_rate, FLAGS.momentum)
-      tf.logging.set_verbosity(tf.logging.INFO)
-      # Run training.
-      loss_tensor = slim.learning.create_train_op(
-          slim.losses.get_total_loss(),
-          opt,
-          summarize_gradients=True,
-          colocate_gradients_with_ops=True)
-      slim.learning.train(
-          train_op=loss_tensor,
-          logdir=FLAGS.train_log_dir,
-          master=FLAGS.master,
-          is_chief=FLAGS.task == 0,
-          number_of_steps=FLAGS.max_number_of_steps,
-          save_summaries_secs=FLAGS.save_summaries_secs,
-          save_interval_secs=FLAGS.save_interval_secs)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/domain_adaptation/domain_separation/grl_op_grads.py b/research/domain_adaptation/domain_separation/grl_op_grads.py
deleted file mode 100644
index fcd85ba2b5e7912bffe646a73558af8184812ea6..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/grl_op_grads.py
+++ /dev/null
@@ -1,34 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Gradients for operators defined in grl_ops.py."""
-import tensorflow as tf
-
-
-@tf.RegisterGradient("GradientReversal")
-def _GradientReversalGrad(_, grad):
-  """The gradients for `gradient_reversal`.
-
-  Args:
-    _: The `gradient_reversal` `Operation` that we are differentiating,
-      which we can use to find the inputs and outputs of the original op.
-    grad: Gradient with respect to the output of the `gradient_reversal` op.
-
-  Returns:
-    Gradient with respect to the input of `gradient_reversal`, which is simply
-    the negative of the input gradient.
-
-  """
-  return tf.negative(grad)
diff --git a/research/domain_adaptation/domain_separation/grl_op_kernels.cc b/research/domain_adaptation/domain_separation/grl_op_kernels.cc
deleted file mode 100644
index ba30128f11e9e88c702d3a80593d930519f346fe..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/grl_op_kernels.cc
+++ /dev/null
@@ -1,47 +0,0 @@
-/* Copyright 2016 The TensorFlow Authors All Rights Reserved.
-
-Licensed under the Apache License, Version 2.0 (the "License");
-you may not use this file except in compliance with the License.
-You may obtain a copy of the License at
-
-    http://www.apache.org/licenses/LICENSE-2.0
-
-Unless required by applicable law or agreed to in writing, software
-distributed under the License is distributed on an "AS IS" BASIS,
-WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-See the License for the specific language governing permissions and
-limitations under the License.
-==============================================================================*/
-
-// This file contains the implementations of the ops registered in
-// grl_ops.cc.
-
-#include "tensorflow/core/framework/op_kernel.h"
-#include "tensorflow/core/framework/types.pb.h"
-
-namespace tensorflow {
-
-// The gradient reversal op is used in domain adversarial training.  It behaves
-// as the identity op during forward propagation, and multiplies its input by -1
-// during backward propagation.
-class GradientReversalOp : public OpKernel {
- public:
-  explicit GradientReversalOp(OpKernelConstruction* context)
-      : OpKernel(context) {}
-
-  // Gradient reversal op behaves as the identity op during forward
-  // propagation. Compute() function copied from the IdentityOp::Compute()
-  // function here: third_party/tensorflow/core/kernels/identity_op.h.
-  void Compute(OpKernelContext* context) override {
-    if (IsRefType(context->input_dtype(0))) {
-      context->forward_ref_input_to_ref_output(0, 0);
-    } else {
-      context->set_output(0, context->input(0));
-    }
-  }
-};
-
-REGISTER_KERNEL_BUILDER(Name("GradientReversal").Device(DEVICE_CPU),
-                        GradientReversalOp);
-
-}  // namespace tensorflow
diff --git a/research/domain_adaptation/domain_separation/grl_op_shapes.py b/research/domain_adaptation/domain_separation/grl_op_shapes.py
deleted file mode 100644
index 52773c680af265beca9125e48bf68152b8a34e56..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/grl_op_shapes.py
+++ /dev/null
@@ -1,16 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Shape inference for operators defined in grl_ops.cc."""
diff --git a/research/domain_adaptation/domain_separation/grl_ops.cc b/research/domain_adaptation/domain_separation/grl_ops.cc
deleted file mode 100644
index d441c2b484215605db65a043be6cfa0ab90da2c3..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/grl_ops.cc
+++ /dev/null
@@ -1,36 +0,0 @@
-/* Copyright 2016 The TensorFlow Authors All Rights Reserved.
-
-Licensed under the Apache License, Version 2.0 (the "License");
-you may not use this file except in compliance with the License.
-You may obtain a copy of the License at
-
-    http://www.apache.org/licenses/LICENSE-2.0
-
-Unless required by applicable law or agreed to in writing, software
-distributed under the License is distributed on an "AS IS" BASIS,
-WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-See the License for the specific language governing permissions and
-limitations under the License.
-==============================================================================*/
-
-// Contains custom ops.
-
-#include "tensorflow/core/framework/common_shape_fns.h"
-#include "tensorflow/core/framework/op.h"
-
-namespace tensorflow {
-
-// This custom op is used by adversarial training.
-REGISTER_OP("GradientReversal")
-    .Input("input: float")
-    .Output("output: float")
-    .SetShapeFn(shape_inference::UnchangedShape)
-    .Doc(R"doc(
-This op copies the input to the output during forward propagation, and
-negates the input during backward propagation.
-
-input: Tensor.
-output: Tensor, copied from input.
-)doc");
-
-}  // namespace tensorflow
diff --git a/research/domain_adaptation/domain_separation/grl_ops.py b/research/domain_adaptation/domain_separation/grl_ops.py
deleted file mode 100644
index 50447247b10caf3e41f3c0fb1c6f943dd3d9de6e..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/grl_ops.py
+++ /dev/null
@@ -1,28 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""GradientReversal op Python library."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os.path
-
-import tensorflow as tf
-
-tf.logging.info(tf.resource_loader.get_data_files_path())
-_grl_ops_module = tf.load_op_library(
-    os.path.join(tf.resource_loader.get_data_files_path(),
-                 '_grl_ops.so'))
-gradient_reversal = _grl_ops_module.gradient_reversal
diff --git a/research/domain_adaptation/domain_separation/grl_ops_test.py b/research/domain_adaptation/domain_separation/grl_ops_test.py
deleted file mode 100644
index b431a6c02b60ade92a653d2ee8108c0586c70fbb..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/grl_ops_test.py
+++ /dev/null
@@ -1,73 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for grl_ops."""
-
-#from models.domain_adaptation.domain_separation import grl_op_grads  # pylint: disable=unused-import
-#from models.domain_adaptation.domain_separation import grl_op_shapes  # pylint: disable=unused-import
-import tensorflow as tf
-
-import grl_op_grads
-import grl_ops
-
-FLAGS = tf.app.flags.FLAGS
-
-
-class GRLOpsTest(tf.test.TestCase):
-
-  def testGradientReversalOp(self):
-    with tf.Graph().as_default():
-      with self.test_session():
-        # Test that in forward prop, gradient reversal op acts as the
-        # identity operation.
-        examples = tf.constant([5.0, 4.0, 3.0, 2.0, 1.0])
-        output = grl_ops.gradient_reversal(examples)
-        expected_output = examples
-        self.assertAllEqual(output.eval(), expected_output.eval())
-
-        # Test that shape inference works as expected.
-        self.assertAllEqual(output.get_shape(), expected_output.get_shape())
-
-        # Test that in backward prop, gradient reversal op multiplies
-        # gradients by -1.
-        examples = tf.constant([[1.0]])
-        w = tf.get_variable(name='w', shape=[1, 1])
-        b = tf.get_variable(name='b', shape=[1])
-        init_op = tf.global_variables_initializer()
-        init_op.run()
-        features = tf.nn.xw_plus_b(examples, w, b)
-        # Construct two outputs: features layer passes directly to output1, but
-        # features layer passes through a gradient reversal layer before
-        # reaching output2.
-        output1 = features
-        output2 = grl_ops.gradient_reversal(features)
-        gold = tf.constant([1.0])
-        loss1 = gold - output1
-        loss2 = gold - output2
-        opt = tf.train.GradientDescentOptimizer(learning_rate=0.01)
-        grads_and_vars_1 = opt.compute_gradients(loss1,
-                                                 tf.trainable_variables())
-        grads_and_vars_2 = opt.compute_gradients(loss2,
-                                                 tf.trainable_variables())
-        self.assertAllEqual(len(grads_and_vars_1), len(grads_and_vars_2))
-        for i in range(len(grads_and_vars_1)):
-          g1 = grads_and_vars_1[i][0]
-          g2 = grads_and_vars_2[i][0]
-          # Verify that gradients of loss1 are the negative of gradients of
-          # loss2.
-          self.assertAllEqual(tf.negative(g1).eval(), g2.eval())
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/domain_adaptation/domain_separation/losses.py b/research/domain_adaptation/domain_separation/losses.py
deleted file mode 100644
index 0d882340de10e4dd64d44f9357e8bfc5b1dd4712..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/losses.py
+++ /dev/null
@@ -1,290 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Domain Adaptation Loss Functions.
-
-The following domain adaptation loss functions are defined:
-
-- Maximum Mean Discrepancy (MMD).
-  Relevant paper:
-    Gretton, Arthur, et al.,
-    "A kernel two-sample test."
-    The Journal of Machine Learning Research, 2012
-
-- Correlation Loss on a batch.
-"""
-from functools import partial
-import tensorflow as tf
-
-import grl_op_grads  # pylint: disable=unused-import
-import grl_op_shapes  # pylint: disable=unused-import
-import grl_ops
-import utils
-slim = tf.contrib.slim
-
-
-################################################################################
-# SIMILARITY LOSS
-################################################################################
-def maximum_mean_discrepancy(x, y, kernel=utils.gaussian_kernel_matrix):
-  r"""Computes the Maximum Mean Discrepancy (MMD) of two samples: x and y.
-
-  Maximum Mean Discrepancy (MMD) is a distance-measure between the samples of
-  the distributions of x and y. Here we use the kernel two sample estimate
-  using the empirical mean of the two distributions.
-
-  MMD^2(P, Q) = || \E{\phi(x)} - \E{\phi(y)} ||^2
-              = \E{ K(x, x) } + \E{ K(y, y) } - 2 \E{ K(x, y) },
-
-  where K = <\phi(x), \phi(y)>,
-    is the desired kernel function, in this case a radial basis kernel.
-
-  Args:
-      x: a tensor of shape [num_samples, num_features]
-      y: a tensor of shape [num_samples, num_features]
-      kernel: a function which computes the kernel in MMD. Defaults to the
-              GaussianKernelMatrix.
-
-  Returns:
-      a scalar denoting the squared maximum mean discrepancy loss.
-  """
-  with tf.name_scope('MaximumMeanDiscrepancy'):
-    # \E{ K(x, x) } + \E{ K(y, y) } - 2 \E{ K(x, y) }
-    cost = tf.reduce_mean(kernel(x, x))
-    cost += tf.reduce_mean(kernel(y, y))
-    cost -= 2 * tf.reduce_mean(kernel(x, y))
-
-    # We do not allow the loss to become negative.
-    cost = tf.where(cost > 0, cost, 0, name='value')
-  return cost
-
-
-def mmd_loss(source_samples, target_samples, weight, scope=None):
-  """Adds a similarity loss term, the MMD between two representations.
-
-  This Maximum Mean Discrepancy (MMD) loss is calculated with a number of
-  different Gaussian kernels.
-
-  Args:
-    source_samples: a tensor of shape [num_samples, num_features].
-    target_samples: a tensor of shape [num_samples, num_features].
-    weight: the weight of the MMD loss.
-    scope: optional name scope for summary tags.
-
-  Returns:
-    a scalar tensor representing the MMD loss value.
-  """
-  sigmas = [
-      1e-6, 1e-5, 1e-4, 1e-3, 1e-2, 1e-1, 1, 5, 10, 15, 20, 25, 30, 35, 100,
-      1e3, 1e4, 1e5, 1e6
-  ]
-  gaussian_kernel = partial(
-      utils.gaussian_kernel_matrix, sigmas=tf.constant(sigmas))
-
-  loss_value = maximum_mean_discrepancy(
-      source_samples, target_samples, kernel=gaussian_kernel)
-  loss_value = tf.maximum(1e-4, loss_value) * weight
-  assert_op = tf.Assert(tf.is_finite(loss_value), [loss_value])
-  with tf.control_dependencies([assert_op]):
-    tag = 'MMD Loss'
-    if scope:
-      tag = scope + tag
-    tf.summary.scalar(tag, loss_value)
-    tf.losses.add_loss(loss_value)
-
-  return loss_value
-
-
-def correlation_loss(source_samples, target_samples, weight, scope=None):
-  """Adds a similarity loss term, the correlation between two representations.
-
-  Args:
-    source_samples: a tensor of shape [num_samples, num_features]
-    target_samples: a tensor of shape [num_samples, num_features]
-    weight: a scalar weight for the loss.
-    scope: optional name scope for summary tags.
-
-  Returns:
-    a scalar tensor representing the correlation loss value.
-  """
-  with tf.name_scope('corr_loss'):
-    source_samples -= tf.reduce_mean(source_samples, 0)
-    target_samples -= tf.reduce_mean(target_samples, 0)
-
-    source_samples = tf.nn.l2_normalize(source_samples, 1)
-    target_samples = tf.nn.l2_normalize(target_samples, 1)
-
-    source_cov = tf.matmul(tf.transpose(source_samples), source_samples)
-    target_cov = tf.matmul(tf.transpose(target_samples), target_samples)
-
-    corr_loss = tf.reduce_mean(tf.square(source_cov - target_cov)) * weight
-
-  assert_op = tf.Assert(tf.is_finite(corr_loss), [corr_loss])
-  with tf.control_dependencies([assert_op]):
-    tag = 'Correlation Loss'
-    if scope:
-      tag = scope + tag
-    tf.summary.scalar(tag, corr_loss)
-    tf.losses.add_loss(corr_loss)
-
-  return corr_loss
-
-
-def dann_loss(source_samples, target_samples, weight, scope=None):
-  """Adds the domain adversarial (DANN) loss.
-
-  Args:
-    source_samples: a tensor of shape [num_samples, num_features].
-    target_samples: a tensor of shape [num_samples, num_features].
-    weight: the weight of the loss.
-    scope: optional name scope for summary tags.
-
-  Returns:
-    a scalar tensor representing the correlation loss value.
-  """
-  with tf.variable_scope('dann'):
-    batch_size = tf.shape(source_samples)[0]
-    samples = tf.concat(axis=0, values=[source_samples, target_samples])
-    samples = slim.flatten(samples)
-
-    domain_selection_mask = tf.concat(
-        axis=0, values=[tf.zeros((batch_size, 1)), tf.ones((batch_size, 1))])
-
-    # Perform the gradient reversal and be careful with the shape.
-    grl = grl_ops.gradient_reversal(samples)
-    grl = tf.reshape(grl, (-1, samples.get_shape().as_list()[1]))
-
-    grl = slim.fully_connected(grl, 100, scope='fc1')
-    logits = slim.fully_connected(grl, 1, activation_fn=None, scope='fc2')
-
-    domain_predictions = tf.sigmoid(logits)
-
-  domain_loss = tf.losses.log_loss(
-      domain_selection_mask, domain_predictions, weights=weight)
-
-  domain_accuracy = utils.accuracy(
-      tf.round(domain_predictions), domain_selection_mask)
-
-  assert_op = tf.Assert(tf.is_finite(domain_loss), [domain_loss])
-  with tf.control_dependencies([assert_op]):
-    tag_loss = 'losses/domain_loss'
-    tag_accuracy = 'losses/domain_accuracy'
-    if scope:
-      tag_loss = scope + tag_loss
-      tag_accuracy = scope + tag_accuracy
-
-    tf.summary.scalar(tag_loss, domain_loss)
-    tf.summary.scalar(tag_accuracy, domain_accuracy)
-
-  return domain_loss
-
-
-################################################################################
-# DIFFERENCE LOSS
-################################################################################
-def difference_loss(private_samples, shared_samples, weight=1.0, name=''):
-  """Adds the difference loss between the private and shared representations.
-
-  Args:
-    private_samples: a tensor of shape [num_samples, num_features].
-    shared_samples: a tensor of shape [num_samples, num_features].
-    weight: the weight of the incoherence loss.
-    name: the name of the tf summary.
-  """
-  private_samples -= tf.reduce_mean(private_samples, 0)
-  shared_samples -= tf.reduce_mean(shared_samples, 0)
-
-  private_samples = tf.nn.l2_normalize(private_samples, 1)
-  shared_samples = tf.nn.l2_normalize(shared_samples, 1)
-
-  correlation_matrix = tf.matmul(
-      private_samples, shared_samples, transpose_a=True)
-
-  cost = tf.reduce_mean(tf.square(correlation_matrix)) * weight
-  cost = tf.where(cost > 0, cost, 0, name='value')
-
-  tf.summary.scalar('losses/Difference Loss {}'.format(name),
-                                       cost)
-  assert_op = tf.Assert(tf.is_finite(cost), [cost])
-  with tf.control_dependencies([assert_op]):
-    tf.losses.add_loss(cost)
-
-
-################################################################################
-# TASK LOSS
-################################################################################
-def log_quaternion_loss_batch(predictions, labels, params):
-  """A helper function to compute the error between quaternions.
-
-  Args:
-    predictions: A Tensor of size [batch_size, 4].
-    labels: A Tensor of size [batch_size, 4].
-    params: A dictionary of parameters. Expecting 'use_logging', 'batch_size'.
-
-  Returns:
-    A Tensor of size [batch_size], denoting the error between the quaternions.
-  """
-  use_logging = params['use_logging']
-  assertions = []
-  if use_logging:
-    assertions.append(
-        tf.Assert(
-            tf.reduce_all(
-                tf.less(
-                    tf.abs(tf.reduce_sum(tf.square(predictions), [1]) - 1),
-                    1e-4)),
-            ['The l2 norm of each prediction quaternion vector should be 1.']))
-    assertions.append(
-        tf.Assert(
-            tf.reduce_all(
-                tf.less(
-                    tf.abs(tf.reduce_sum(tf.square(labels), [1]) - 1), 1e-4)),
-            ['The l2 norm of each label quaternion vector should be 1.']))
-
-  with tf.control_dependencies(assertions):
-    product = tf.multiply(predictions, labels)
-  internal_dot_products = tf.reduce_sum(product, [1])
-
-  if use_logging:
-    internal_dot_products = tf.Print(
-        internal_dot_products,
-        [internal_dot_products, tf.shape(internal_dot_products)],
-        'internal_dot_products:')
-
-  logcost = tf.log(1e-4 + 1 - tf.abs(internal_dot_products))
-  return logcost
-
-
-def log_quaternion_loss(predictions, labels, params):
-  """A helper function to compute the mean error between batches of quaternions.
-
-  The caller is expected to add the loss to the graph.
-
-  Args:
-    predictions: A Tensor of size [batch_size, 4].
-    labels: A Tensor of size [batch_size, 4].
-    params: A dictionary of parameters. Expecting 'use_logging', 'batch_size'.
-
-  Returns:
-    A Tensor of size 1, denoting the mean error between batches of quaternions.
-  """
-  use_logging = params['use_logging']
-  logcost = log_quaternion_loss_batch(predictions, labels, params)
-  logcost = tf.reduce_sum(logcost, [0])
-  batch_size = params['batch_size']
-  logcost = tf.multiply(logcost, 1.0 / batch_size, name='log_quaternion_loss')
-  if use_logging:
-    logcost = tf.Print(
-        logcost, [logcost], '[logcost]', name='log_quaternion_loss_print')
-  return logcost
diff --git a/research/domain_adaptation/domain_separation/losses_test.py b/research/domain_adaptation/domain_separation/losses_test.py
deleted file mode 100644
index 46e50301be56f5977adcb3fb00587f076934b785..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/losses_test.py
+++ /dev/null
@@ -1,110 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for DSN losses."""
-from functools import partial
-
-import numpy as np
-import tensorflow as tf
-
-import losses
-import utils
-
-
-def MaximumMeanDiscrepancySlow(x, y, sigmas):
-  num_samples = x.get_shape().as_list()[0]
-
-  def AverageGaussianKernel(x, y, sigmas):
-    result = 0
-    for sigma in sigmas:
-      dist = tf.reduce_sum(tf.square(x - y))
-      result += tf.exp((-1.0 / (2.0 * sigma)) * dist)
-    return result / num_samples**2
-
-  total = 0
-
-  for i in range(num_samples):
-    for j in range(num_samples):
-      total += AverageGaussianKernel(x[i, :], x[j, :], sigmas)
-      total += AverageGaussianKernel(y[i, :], y[j, :], sigmas)
-      total += -2 * AverageGaussianKernel(x[i, :], y[j, :], sigmas)
-
-  return total
-
-
-class LogQuaternionLossTest(tf.test.TestCase):
-
-  def test_log_quaternion_loss_batch(self):
-    with self.test_session():
-      predictions = tf.random_uniform((10, 4), seed=1)
-      predictions = tf.nn.l2_normalize(predictions, 1)
-      labels = tf.random_uniform((10, 4), seed=1)
-      labels = tf.nn.l2_normalize(labels, 1)
-      params = {'batch_size': 10, 'use_logging': False}
-      x = losses.log_quaternion_loss_batch(predictions, labels, params)
-      self.assertTrue(((10,) == tf.shape(x).eval()).all())
-
-
-class MaximumMeanDiscrepancyTest(tf.test.TestCase):
-
-  def test_mmd_name(self):
-    with self.test_session():
-      x = tf.random_uniform((2, 3), seed=1)
-      kernel = partial(utils.gaussian_kernel_matrix, sigmas=tf.constant([1.]))
-      loss = losses.maximum_mean_discrepancy(x, x, kernel)
-
-      self.assertEquals(loss.op.name, 'MaximumMeanDiscrepancy/value')
-
-  def test_mmd_is_zero_when_inputs_are_same(self):
-    with self.test_session():
-      x = tf.random_uniform((2, 3), seed=1)
-      kernel = partial(utils.gaussian_kernel_matrix, sigmas=tf.constant([1.]))
-      self.assertEquals(0, losses.maximum_mean_discrepancy(x, x, kernel).eval())
-
-  def test_fast_mmd_is_similar_to_slow_mmd(self):
-    with self.test_session():
-      x = tf.constant(np.random.normal(size=(2, 3)), tf.float32)
-      y = tf.constant(np.random.rand(2, 3), tf.float32)
-
-      cost_old = MaximumMeanDiscrepancySlow(x, y, [1.]).eval()
-      kernel = partial(utils.gaussian_kernel_matrix, sigmas=tf.constant([1.]))
-      cost_new = losses.maximum_mean_discrepancy(x, y, kernel).eval()
-
-      self.assertAlmostEqual(cost_old, cost_new, delta=1e-5)
-
-  def test_multiple_sigmas(self):
-    with self.test_session():
-      x = tf.constant(np.random.normal(size=(2, 3)), tf.float32)
-      y = tf.constant(np.random.rand(2, 3), tf.float32)
-
-      sigmas = tf.constant([2., 5., 10, 20, 30])
-      kernel = partial(utils.gaussian_kernel_matrix, sigmas=sigmas)
-      cost_old = MaximumMeanDiscrepancySlow(x, y, [2., 5., 10, 20, 30]).eval()
-      cost_new = losses.maximum_mean_discrepancy(x, y, kernel=kernel).eval()
-
-      self.assertAlmostEqual(cost_old, cost_new, delta=1e-5)
-
-  def test_mmd_is_zero_when_distributions_are_same(self):
-
-    with self.test_session():
-      x = tf.random_uniform((1000, 10), seed=1)
-      y = tf.random_uniform((1000, 10), seed=3)
-
-      kernel = partial(utils.gaussian_kernel_matrix, sigmas=tf.constant([100.]))
-      loss = losses.maximum_mean_discrepancy(x, y, kernel=kernel).eval()
-
-      self.assertAlmostEqual(0, loss, delta=1e-4)
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/domain_adaptation/domain_separation/models.py b/research/domain_adaptation/domain_separation/models.py
deleted file mode 100644
index 04ccaf82eb9b31a6ea78871204c7df70eca3fbfd..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/models.py
+++ /dev/null
@@ -1,443 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Contains different architectures for the different DSN parts.
-
-We define here the modules that can be used in the different parts of the DSN
-model.
-- shared encoder (dsn_cropped_linemod, dann_xxxx)
-- private encoder (default_encoder)
-- decoder (large_decoder, gtsrb_decoder, small_decoder)
-"""
-import tensorflow as tf
-
-#from models.domain_adaptation.domain_separation
-import utils
-
-slim = tf.contrib.slim
-
-
-def default_batch_norm_params(is_training=False):
-  """Returns default batch normalization parameters for DSNs.
-
-  Args:
-    is_training: whether or not the model is training.
-
-  Returns:
-    a dictionary that maps batch norm parameter names (strings) to values.
-  """
-  return {
-      # Decay for the moving averages.
-      'decay': 0.5,
-      # epsilon to prevent 0s in variance.
-      'epsilon': 0.001,
-      'is_training': is_training
-  }
-
-
-################################################################################
-# PRIVATE ENCODERS
-################################################################################
-def default_encoder(images, code_size, batch_norm_params=None,
-                    weight_decay=0.0):
-  """Encodes the given images to codes of the given size.
-
-  Args:
-    images: a tensor of size [batch_size, height, width, 1].
-    code_size: the number of hidden units in the code layer of the classifier.
-    batch_norm_params: a dictionary that maps batch norm parameter names to
-      values.
-    weight_decay: the value for the weight decay coefficient.
-
-  Returns:
-    end_points: the code of the input.
-  """
-  end_points = {}
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      weights_regularizer=slim.l2_regularizer(weight_decay),
-      activation_fn=tf.nn.relu,
-      normalizer_fn=slim.batch_norm,
-      normalizer_params=batch_norm_params):
-    with slim.arg_scope([slim.conv2d], kernel_size=[5, 5], padding='SAME'):
-      net = slim.conv2d(images, 32, scope='conv1')
-      net = slim.max_pool2d(net, [2, 2], 2, scope='pool1')
-      net = slim.conv2d(net, 64, scope='conv2')
-      net = slim.max_pool2d(net, [2, 2], 2, scope='pool2')
-
-      net = slim.flatten(net)
-      end_points['flatten'] = net
-      net = slim.fully_connected(net, code_size, scope='fc1')
-      end_points['fc3'] = net
-  return end_points
-
-
-################################################################################
-# DECODERS
-################################################################################
-def large_decoder(codes,
-                  height,
-                  width,
-                  channels,
-                  batch_norm_params=None,
-                  weight_decay=0.0):
-  """Decodes the codes to a fixed output size.
-
-  Args:
-    codes: a tensor of size [batch_size, code_size].
-    height: the height of the output images.
-    width: the width of the output images.
-    channels: the number of the output channels.
-    batch_norm_params: a dictionary that maps batch norm parameter names to
-      values.
-    weight_decay: the value for the weight decay coefficient.
-
-  Returns:
-    recons: the reconstruction tensor of shape [batch_size, height, width, 3].
-  """
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      weights_regularizer=slim.l2_regularizer(weight_decay),
-      activation_fn=tf.nn.relu,
-      normalizer_fn=slim.batch_norm,
-      normalizer_params=batch_norm_params):
-    net = slim.fully_connected(codes, 600, scope='fc1')
-    batch_size = net.get_shape().as_list()[0]
-    net = tf.reshape(net, [batch_size, 10, 10, 6])
-
-    net = slim.conv2d(net, 32, [5, 5], scope='conv1_1')
-
-    net = tf.image.resize_nearest_neighbor(net, (16, 16))
-
-    net = slim.conv2d(net, 32, [5, 5], scope='conv2_1')
-
-    net = tf.image.resize_nearest_neighbor(net, (32, 32))
-
-    net = slim.conv2d(net, 32, [5, 5], scope='conv3_2')
-
-    output_size = [height, width]
-    net = tf.image.resize_nearest_neighbor(net, output_size)
-
-    with slim.arg_scope([slim.conv2d], kernel_size=[3, 3]):
-      net = slim.conv2d(net, channels, activation_fn=None, scope='conv4_1')
-
-  return net
-
-
-def gtsrb_decoder(codes,
-                  height,
-                  width,
-                  channels,
-                  batch_norm_params=None,
-                  weight_decay=0.0):
-  """Decodes the codes to a fixed output size. This decoder is specific to GTSRB
-
-  Args:
-    codes: a tensor of size [batch_size, 100].
-    height: the height of the output images.
-    width: the width of the output images.
-    channels: the number of the output channels.
-    batch_norm_params: a dictionary that maps batch norm parameter names to
-      values.
-    weight_decay: the value for the weight decay coefficient.
-
-  Returns:
-    recons: the reconstruction tensor of shape [batch_size, height, width, 3].
-
-  Raises:
-    ValueError: When the input code size is not 100.
-  """
-  batch_size, code_size = codes.get_shape().as_list()
-  if code_size != 100:
-    raise ValueError('The code size used as an input to the GTSRB decoder is '
-                     'expected to be 100.')
-
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      weights_regularizer=slim.l2_regularizer(weight_decay),
-      activation_fn=tf.nn.relu,
-      normalizer_fn=slim.batch_norm,
-      normalizer_params=batch_norm_params):
-    net = codes
-    net = tf.reshape(net, [batch_size, 10, 10, 1])
-    net = slim.conv2d(net, 32, [3, 3], scope='conv1_1')
-
-    # First upsampling 20x20
-    net = tf.image.resize_nearest_neighbor(net, [20, 20])
-
-    net = slim.conv2d(net, 32, [3, 3], scope='conv2_1')
-
-    output_size = [height, width]
-    # Final upsampling 40 x 40
-    net = tf.image.resize_nearest_neighbor(net, output_size)
-
-    with slim.arg_scope([slim.conv2d], kernel_size=[3, 3]):
-      net = slim.conv2d(net, 16, scope='conv3_1')
-      net = slim.conv2d(net, channels, activation_fn=None, scope='conv3_2')
-
-  return net
-
-
-def small_decoder(codes,
-                  height,
-                  width,
-                  channels,
-                  batch_norm_params=None,
-                  weight_decay=0.0):
-  """Decodes the codes to a fixed output size.
-
-  Args:
-    codes: a tensor of size [batch_size, code_size].
-    height: the height of the output images.
-    width: the width of the output images.
-    channels: the number of the output channels.
-    batch_norm_params: a dictionary that maps batch norm parameter names to
-      values.
-    weight_decay: the value for the weight decay coefficient.
-
-  Returns:
-    recons: the reconstruction tensor of shape [batch_size, height, width, 3].
-  """
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      weights_regularizer=slim.l2_regularizer(weight_decay),
-      activation_fn=tf.nn.relu,
-      normalizer_fn=slim.batch_norm,
-      normalizer_params=batch_norm_params):
-    net = slim.fully_connected(codes, 300, scope='fc1')
-    batch_size = net.get_shape().as_list()[0]
-    net = tf.reshape(net, [batch_size, 10, 10, 3])
-
-    net = slim.conv2d(net, 16, [3, 3], scope='conv1_1')
-    net = slim.conv2d(net, 16, [3, 3], scope='conv1_2')
-
-    output_size = [height, width]
-    net = tf.image.resize_nearest_neighbor(net, output_size)
-
-    with slim.arg_scope([slim.conv2d], kernel_size=[3, 3]):
-      net = slim.conv2d(net, 16, scope='conv2_1')
-      net = slim.conv2d(net, channels, activation_fn=None, scope='conv2_2')
-
-  return net
-
-
-################################################################################
-# SHARED ENCODERS
-################################################################################
-def dann_mnist(images,
-               weight_decay=0.0,
-               prefix='model',
-               num_classes=10,
-               **kwargs):
-  """Creates a convolution MNIST model.
-
-  Note that this model implements the architecture for MNIST proposed in:
-   Y. Ganin et al., Domain-Adversarial Training of Neural Networks (DANN),
-   JMLR 2015
-
-  Args:
-    images: the MNIST digits, a tensor of size [batch_size, 28, 28, 1].
-    weight_decay: the value for the weight decay coefficient.
-    prefix: name of the model to use when prefixing tags.
-    num_classes: the number of output classes to use.
-    **kwargs: Placeholder for keyword arguments used by other shared encoders.
-
-  Returns:
-    the output logits, a tensor of size [batch_size, num_classes].
-    a dictionary with key/values the layer names and tensors.
-  """
-  end_points = {}
-
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      weights_regularizer=slim.l2_regularizer(weight_decay),
-      activation_fn=tf.nn.relu,):
-    with slim.arg_scope([slim.conv2d], padding='SAME'):
-      end_points['conv1'] = slim.conv2d(images, 32, [5, 5], scope='conv1')
-      end_points['pool1'] = slim.max_pool2d(
-          end_points['conv1'], [2, 2], 2, scope='pool1')
-      end_points['conv2'] = slim.conv2d(
-          end_points['pool1'], 48, [5, 5], scope='conv2')
-      end_points['pool2'] = slim.max_pool2d(
-          end_points['conv2'], [2, 2], 2, scope='pool2')
-      end_points['fc3'] = slim.fully_connected(
-          slim.flatten(end_points['pool2']), 100, scope='fc3')
-      end_points['fc4'] = slim.fully_connected(
-          slim.flatten(end_points['fc3']), 100, scope='fc4')
-
-  logits = slim.fully_connected(
-      end_points['fc4'], num_classes, activation_fn=None, scope='fc5')
-
-  return logits, end_points
-
-
-def dann_svhn(images,
-              weight_decay=0.0,
-              prefix='model',
-              num_classes=10,
-              **kwargs):
-  """Creates the convolutional SVHN model.
-
-  Note that this model implements the architecture for MNIST proposed in:
-   Y. Ganin et al., Domain-Adversarial Training of Neural Networks (DANN),
-   JMLR 2015
-
-  Args:
-    images: the SVHN digits, a tensor of size [batch_size, 32, 32, 3].
-    weight_decay: the value for the weight decay coefficient.
-    prefix: name of the model to use when prefixing tags.
-    num_classes: the number of output classes to use.
-    **kwargs: Placeholder for keyword arguments used by other shared encoders.
-
-  Returns:
-    the output logits, a tensor of size [batch_size, num_classes].
-    a dictionary with key/values the layer names and tensors.
-  """
-
-  end_points = {}
-
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      weights_regularizer=slim.l2_regularizer(weight_decay),
-      activation_fn=tf.nn.relu,):
-    with slim.arg_scope([slim.conv2d], padding='SAME'):
-
-      end_points['conv1'] = slim.conv2d(images, 64, [5, 5], scope='conv1')
-      end_points['pool1'] = slim.max_pool2d(
-          end_points['conv1'], [3, 3], 2, scope='pool1')
-      end_points['conv2'] = slim.conv2d(
-          end_points['pool1'], 64, [5, 5], scope='conv2')
-      end_points['pool2'] = slim.max_pool2d(
-          end_points['conv2'], [3, 3], 2, scope='pool2')
-      end_points['conv3'] = slim.conv2d(
-          end_points['pool2'], 128, [5, 5], scope='conv3')
-
-      end_points['fc3'] = slim.fully_connected(
-          slim.flatten(end_points['conv3']), 3072, scope='fc3')
-      end_points['fc4'] = slim.fully_connected(
-          slim.flatten(end_points['fc3']), 2048, scope='fc4')
-
-  logits = slim.fully_connected(
-      end_points['fc4'], num_classes, activation_fn=None, scope='fc5')
-
-  return logits, end_points
-
-
-def dann_gtsrb(images,
-               weight_decay=0.0,
-               prefix='model',
-               num_classes=43,
-               **kwargs):
-  """Creates the convolutional GTSRB model.
-
-  Note that this model implements the architecture for MNIST proposed in:
-   Y. Ganin et al., Domain-Adversarial Training of Neural Networks (DANN),
-   JMLR 2015
-
-  Args:
-    images: the GTSRB images, a tensor of size [batch_size, 40, 40, 3].
-    weight_decay: the value for the weight decay coefficient.
-    prefix: name of the model to use when prefixing tags.
-    num_classes: the number of output classes to use.
-    **kwargs: Placeholder for keyword arguments used by other shared encoders.
-
-  Returns:
-    the output logits, a tensor of size [batch_size, num_classes].
-    a dictionary with key/values the layer names and tensors.
-  """
-
-  end_points = {}
-
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      weights_regularizer=slim.l2_regularizer(weight_decay),
-      activation_fn=tf.nn.relu,):
-    with slim.arg_scope([slim.conv2d], padding='SAME'):
-
-      end_points['conv1'] = slim.conv2d(images, 96, [5, 5], scope='conv1')
-      end_points['pool1'] = slim.max_pool2d(
-          end_points['conv1'], [2, 2], 2, scope='pool1')
-      end_points['conv2'] = slim.conv2d(
-          end_points['pool1'], 144, [3, 3], scope='conv2')
-      end_points['pool2'] = slim.max_pool2d(
-          end_points['conv2'], [2, 2], 2, scope='pool2')
-      end_points['conv3'] = slim.conv2d(
-          end_points['pool2'], 256, [5, 5], scope='conv3')
-      end_points['pool3'] = slim.max_pool2d(
-          end_points['conv3'], [2, 2], 2, scope='pool3')
-
-      end_points['fc3'] = slim.fully_connected(
-          slim.flatten(end_points['pool3']), 512, scope='fc3')
-
-  logits = slim.fully_connected(
-      end_points['fc3'], num_classes, activation_fn=None, scope='fc4')
-
-  return logits, end_points
-
-
-def dsn_cropped_linemod(images,
-                        weight_decay=0.0,
-                        prefix='model',
-                        num_classes=11,
-                        batch_norm_params=None,
-                        is_training=False):
-  """Creates the convolutional pose estimation model for Cropped Linemod.
-
-  Args:
-    images: the Cropped Linemod samples, a tensor of size
-      [batch_size, 64, 64, 4].
-    weight_decay: the value for the weight decay coefficient.
-    prefix: name of the model to use when prefixing tags.
-    num_classes: the number of output classes to use.
-    batch_norm_params: a dictionary that maps batch norm parameter names to
-      values.
-    is_training: specifies whether or not we're currently training the model.
-      This variable will determine the behaviour of the dropout layer.
-
-  Returns:
-    the output logits, a tensor of size [batch_size, num_classes].
-    a dictionary with key/values the layer names and tensors.
-  """
-
-  end_points = {}
-
-  tf.summary.image('{}/input_images'.format(prefix), images)
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      weights_regularizer=slim.l2_regularizer(weight_decay),
-      activation_fn=tf.nn.relu,
-      normalizer_fn=slim.batch_norm if batch_norm_params else None,
-      normalizer_params=batch_norm_params):
-    with slim.arg_scope([slim.conv2d], padding='SAME'):
-      end_points['conv1'] = slim.conv2d(images, 32, [5, 5], scope='conv1')
-      end_points['pool1'] = slim.max_pool2d(
-          end_points['conv1'], [2, 2], 2, scope='pool1')
-      end_points['conv2'] = slim.conv2d(
-          end_points['pool1'], 64, [5, 5], scope='conv2')
-      end_points['pool2'] = slim.max_pool2d(
-          end_points['conv2'], [2, 2], 2, scope='pool2')
-      net = slim.flatten(end_points['pool2'])
-      end_points['fc3'] = slim.fully_connected(net, 128, scope='fc3')
-      net = slim.dropout(
-          end_points['fc3'], 0.5, is_training=is_training, scope='dropout')
-
-      with tf.variable_scope('quaternion_prediction'):
-        predicted_quaternion = slim.fully_connected(
-            net, 4, activation_fn=tf.nn.tanh)
-        predicted_quaternion = tf.nn.l2_normalize(predicted_quaternion, 1)
-      logits = slim.fully_connected(
-          net, num_classes, activation_fn=None, scope='fc4')
-  end_points['quaternion_pred'] = predicted_quaternion
-
-  return logits, end_points
diff --git a/research/domain_adaptation/domain_separation/models_test.py b/research/domain_adaptation/domain_separation/models_test.py
deleted file mode 100644
index 69d1a27259022569cc5865e49dd6bba5675d834f..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/models_test.py
+++ /dev/null
@@ -1,167 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for DSN components."""
-
-import numpy as np
-import tensorflow as tf
-
-#from models.domain_adaptation.domain_separation
-import models
-
-
-class SharedEncodersTest(tf.test.TestCase):
-
-  def _testSharedEncoder(self,
-                         input_shape=[5, 28, 28, 1],
-                         model=models.dann_mnist,
-                         is_training=True):
-    images = tf.to_float(np.random.rand(*input_shape))
-
-    with self.test_session() as sess:
-      logits, _ = model(images)
-      sess.run(tf.global_variables_initializer())
-      logits_np = sess.run(logits)
-    return logits_np
-
-  def testBuildGRLMnistModel(self):
-    logits = self._testSharedEncoder(model=getattr(models,
-                                                   'dann_mnist'))
-    self.assertEqual(logits.shape, (5, 10))
-    self.assertTrue(np.any(logits))
-
-  def testBuildGRLSvhnModel(self):
-    logits = self._testSharedEncoder(model=getattr(models,
-                                                   'dann_svhn'))
-    self.assertEqual(logits.shape, (5, 10))
-    self.assertTrue(np.any(logits))
-
-  def testBuildGRLGtsrbModel(self):
-    logits = self._testSharedEncoder([5, 40, 40, 3],
-                                     getattr(models, 'dann_gtsrb'))
-    self.assertEqual(logits.shape, (5, 43))
-    self.assertTrue(np.any(logits))
-
-  def testBuildPoseModel(self):
-    logits = self._testSharedEncoder([5, 64, 64, 4],
-                                     getattr(models, 'dsn_cropped_linemod'))
-    self.assertEqual(logits.shape, (5, 11))
-    self.assertTrue(np.any(logits))
-
-  def testBuildPoseModelWithBatchNorm(self):
-    images = tf.to_float(np.random.rand(10, 64, 64, 4))
-
-    with self.test_session() as sess:
-      logits, _ = getattr(models, 'dsn_cropped_linemod')(
-          images, batch_norm_params=models.default_batch_norm_params(True))
-      sess.run(tf.global_variables_initializer())
-      logits_np = sess.run(logits)
-    self.assertEqual(logits_np.shape, (10, 11))
-    self.assertTrue(np.any(logits_np))
-
-
-class EncoderTest(tf.test.TestCase):
-
-  def _testEncoder(self, batch_norm_params=None, channels=1):
-    images = tf.to_float(np.random.rand(10, 28, 28, channels))
-
-    with self.test_session() as sess:
-      end_points = models.default_encoder(
-          images, 128, batch_norm_params=batch_norm_params)
-      sess.run(tf.global_variables_initializer())
-      private_code = sess.run(end_points['fc3'])
-    self.assertEqual(private_code.shape, (10, 128))
-    self.assertTrue(np.any(private_code))
-    self.assertTrue(np.all(np.isfinite(private_code)))
-
-  def testEncoder(self):
-    self._testEncoder()
-
-  def testEncoderMultiChannel(self):
-    self._testEncoder(None, 4)
-
-  def testEncoderIsTrainingBatchNorm(self):
-    self._testEncoder(models.default_batch_norm_params(True))
-
-  def testEncoderBatchNorm(self):
-    self._testEncoder(models.default_batch_norm_params(False))
-
-
-class DecoderTest(tf.test.TestCase):
-
-  def _testDecoder(self,
-                   height=64,
-                   width=64,
-                   channels=4,
-                   batch_norm_params=None,
-                   decoder=models.small_decoder):
-    codes = tf.to_float(np.random.rand(32, 100))
-
-    with self.test_session() as sess:
-      output = decoder(
-          codes,
-          height=height,
-          width=width,
-          channels=channels,
-          batch_norm_params=batch_norm_params)
-      sess.run(tf.global_variables_initializer())
-      output_np = sess.run(output)
-    self.assertEqual(output_np.shape, (32, height, width, channels))
-    self.assertTrue(np.any(output_np))
-    self.assertTrue(np.all(np.isfinite(output_np)))
-
-  def testSmallDecoder(self):
-    self._testDecoder(28, 28, 4, None, getattr(models, 'small_decoder'))
-
-  def testSmallDecoderThreeChannels(self):
-    self._testDecoder(28, 28, 3)
-
-  def testSmallDecoderBatchNorm(self):
-    self._testDecoder(28, 28, 4, models.default_batch_norm_params(False))
-
-  def testSmallDecoderIsTrainingBatchNorm(self):
-    self._testDecoder(28, 28, 4, models.default_batch_norm_params(True))
-
-  def testLargeDecoder(self):
-    self._testDecoder(32, 32, 4, None, getattr(models, 'large_decoder'))
-
-  def testLargeDecoderThreeChannels(self):
-    self._testDecoder(32, 32, 3, None, getattr(models, 'large_decoder'))
-
-  def testLargeDecoderBatchNorm(self):
-    self._testDecoder(32, 32, 4,
-                      models.default_batch_norm_params(False),
-                      getattr(models, 'large_decoder'))
-
-  def testLargeDecoderIsTrainingBatchNorm(self):
-    self._testDecoder(32, 32, 4,
-                      models.default_batch_norm_params(True),
-                      getattr(models, 'large_decoder'))
-
-  def testGtsrbDecoder(self):
-    self._testDecoder(40, 40, 3, None, getattr(models, 'large_decoder'))
-
-  def testGtsrbDecoderBatchNorm(self):
-    self._testDecoder(40, 40, 4,
-                      models.default_batch_norm_params(False),
-                      getattr(models, 'gtsrb_decoder'))
-
-  def testGtsrbDecoderIsTrainingBatchNorm(self):
-    self._testDecoder(40, 40, 4,
-                      models.default_batch_norm_params(True),
-                      getattr(models, 'gtsrb_decoder'))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/domain_adaptation/domain_separation/utils.py b/research/domain_adaptation/domain_separation/utils.py
deleted file mode 100644
index e144ee86120bd58eb06b710fb35f3f58b5a05343..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/domain_separation/utils.py
+++ /dev/null
@@ -1,183 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Auxiliary functions for domain adaptation related losses.
-"""
-import math
-import tensorflow as tf
-
-
-def create_summaries(end_points, prefix='', max_images=3, use_op_name=False):
-  """Creates a tf summary per endpoint.
-
-  If the endpoint is a 4 dimensional tensor it displays it as an image
-  otherwise if it is a two dimensional one it creates a histogram summary.
-
-  Args:
-    end_points: a dictionary of name, tf tensor pairs.
-    prefix: an optional string to prefix the summary with.
-    max_images: the maximum number of images to display per summary.
-    use_op_name: Use the op name as opposed to the shorter end_points key.
-  """
-  for layer_name in end_points:
-    if use_op_name:
-      name = end_points[layer_name].op.name
-    else:
-      name = layer_name
-    if len(end_points[layer_name].get_shape().as_list()) == 4:
-      # if it's an actual image do not attempt to reshape it
-      if end_points[layer_name].get_shape().as_list()[-1] == 1 or end_points[
-          layer_name].get_shape().as_list()[-1] == 3:
-        visualization_image = end_points[layer_name]
-      else:
-        visualization_image = reshape_feature_maps(end_points[layer_name])
-      tf.summary.image(
-          '{}/{}'.format(prefix, name),
-          visualization_image,
-          max_outputs=max_images)
-    elif len(end_points[layer_name].get_shape().as_list()) == 3:
-      images = tf.expand_dims(end_points[layer_name], 3)
-      tf.summary.image(
-          '{}/{}'.format(prefix, name),
-          images,
-          max_outputs=max_images)
-    elif len(end_points[layer_name].get_shape().as_list()) == 2:
-      tf.summary.histogram('{}/{}'.format(prefix, name), end_points[layer_name])
-
-
-def reshape_feature_maps(features_tensor):
-  """Reshape activations for tf.summary.image visualization.
-
-  Arguments:
-    features_tensor: a tensor of activations with a square number of feature
-                     maps, eg 4, 9, 16, etc.
-  Returns:
-    A composite image with all the feature maps that can be passed as an
-    argument to tf.summary.image.
-  """
-  assert len(features_tensor.get_shape().as_list()) == 4
-  num_filters = features_tensor.get_shape().as_list()[-1]
-  assert num_filters > 0
-  num_filters_sqrt = math.sqrt(num_filters)
-  assert num_filters_sqrt.is_integer(
-  ), 'Number of filters should be a square number but got {}'.format(
-      num_filters)
-  num_filters_sqrt = int(num_filters_sqrt)
-  conv_summary = tf.unstack(features_tensor, axis=3)
-  conv_one_row = tf.concat(axis=2, values=conv_summary[0:num_filters_sqrt])
-  ind = 1
-  conv_final = conv_one_row
-  for ind in range(1, num_filters_sqrt):
-    conv_one_row = tf.concat(axis=2,
-                             values=conv_summary[
-        ind * num_filters_sqrt + 0:ind * num_filters_sqrt + num_filters_sqrt])
-    conv_final = tf.concat(
-        axis=1, values=[tf.squeeze(conv_final), tf.squeeze(conv_one_row)])
-    conv_final = tf.expand_dims(conv_final, -1)
-  return conv_final
-
-
-def accuracy(predictions, labels):
-  """Calculates the classificaton accuracy.
-
-  Args:
-    predictions: the predicted values, a tensor whose size matches 'labels'.
-    labels: the ground truth values, a tensor of any size.
-
-  Returns:
-    a tensor whose value on evaluation returns the total accuracy.
-  """
-  return tf.reduce_mean(tf.cast(tf.equal(predictions, labels), tf.float32))
-
-
-def compute_upsample_values(input_tensor, upsample_height, upsample_width):
-  """Compute values for an upsampling op (ops.BatchCropAndResize).
-
-  Args:
-    input_tensor: image tensor with shape [batch, height, width, in_channels]
-    upsample_height: integer
-    upsample_width: integer
-
-  Returns:
-    grid_centers: tensor with shape [batch, 1]
-    crop_sizes: tensor with shape [batch, 1]
-    output_height: integer
-    output_width: integer
-  """
-  batch, input_height, input_width, _ = input_tensor.shape
-
-  height_half = input_height / 2.
-  width_half = input_width / 2.
-  grid_centers = tf.constant(batch * [[height_half, width_half]])
-  crop_sizes = tf.constant(batch * [[input_height, input_width]])
-  output_height = input_height * upsample_height
-  output_width = input_width * upsample_width
-
-  return grid_centers, tf.to_float(crop_sizes), output_height, output_width
-
-
-def compute_pairwise_distances(x, y):
-  """Computes the squared pairwise Euclidean distances between x and y.
-
-  Args:
-    x: a tensor of shape [num_x_samples, num_features]
-    y: a tensor of shape [num_y_samples, num_features]
-
-  Returns:
-    a distance matrix of dimensions [num_x_samples, num_y_samples].
-
-  Raises:
-    ValueError: if the inputs do no matched the specified dimensions.
-  """
-
-  if not len(x.get_shape()) == len(y.get_shape()) == 2:
-    raise ValueError('Both inputs should be matrices.')
-
-  if x.get_shape().as_list()[1] != y.get_shape().as_list()[1]:
-    raise ValueError('The number of features should be the same.')
-
-  norm = lambda x: tf.reduce_sum(tf.square(x), 1)
-
-  # By making the `inner' dimensions of the two matrices equal to 1 using
-  # broadcasting then we are essentially substracting every pair of rows
-  # of x and y.
-  # x will be num_samples x num_features x 1,
-  # and y will be 1 x num_features x num_samples (after broadcasting).
-  # After the substraction we will get a
-  # num_x_samples x num_features x num_y_samples matrix.
-  # The resulting dist will be of shape num_y_samples x num_x_samples.
-  # and thus we need to transpose it again.
-  return tf.transpose(norm(tf.expand_dims(x, 2) - tf.transpose(y)))
-
-
-def gaussian_kernel_matrix(x, y, sigmas):
-  r"""Computes a Guassian Radial Basis Kernel between the samples of x and y.
-
-  We create a sum of multiple gaussian kernels each having a width sigma_i.
-
-  Args:
-    x: a tensor of shape [num_samples, num_features]
-    y: a tensor of shape [num_samples, num_features]
-    sigmas: a tensor of floats which denote the widths of each of the
-      gaussians in the kernel.
-  Returns:
-    A tensor of shape [num_samples{x}, num_samples{y}] with the RBF kernel.
-  """
-  beta = 1. / (2. * (tf.expand_dims(sigmas, 1)))
-
-  dist = compute_pairwise_distances(x, y)
-
-  s = tf.matmul(beta, tf.reshape(dist, (1, -1)))
-
-  return tf.reshape(tf.reduce_sum(tf.exp(-s), 0), tf.shape(dist))
diff --git a/research/domain_adaptation/pixel_domain_adaptation/BUILD b/research/domain_adaptation/pixel_domain_adaptation/BUILD
deleted file mode 100644
index 2bc8d4a49a828f97b8f45166aa2bbc552d4a3b92..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/BUILD
+++ /dev/null
@@ -1,90 +0,0 @@
-# Description:
-#   Contains code for domain-adaptation style transfer.
-
-package(
-    default_visibility = [
-        ":internal",
-    ],
-)
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-package_group(
-    name = "internal",
-    packages = [
-        "//domain_adaptation/...",
-    ],
-)
-
-py_library(
-    name = "pixelda_preprocess",
-    srcs = ["pixelda_preprocess.py"],
-    deps = [
-        
-    ],
-)
-
-py_test(
-    name = "pixelda_preprocess_test",
-    srcs = ["pixelda_preprocess_test.py"],
-    deps = [
-        ":pixelda_preprocess",
-        
-    ],
-)
-
-py_library(
-    name = "pixelda_model",
-    srcs = [
-        "pixelda_model.py",
-        "pixelda_task_towers.py",
-        "hparams.py",
-    ],
-    deps = [
-        
-    ],
-)
-
-py_library(
-    name = "pixelda_utils",
-    srcs = ["pixelda_utils.py"],
-    deps = [
-        
-    ],
-)
-
-py_library(
-    name = "pixelda_losses",
-    srcs = ["pixelda_losses.py"],
-    deps = [
-        
-    ],
-)
-
-py_binary(
-    name = "pixelda_train",
-    srcs = ["pixelda_train.py"],
-    deps = [
-        ":pixelda_losses",
-        ":pixelda_model",
-        ":pixelda_preprocess",
-        ":pixelda_utils",
-        
-        "//domain_adaptation/datasets:dataset_factory",
-    ],
-)
-
-py_binary(
-    name = "pixelda_eval",
-    srcs = ["pixelda_eval.py"],
-    deps = [
-        ":pixelda_losses",
-        ":pixelda_model",
-        ":pixelda_preprocess",
-        ":pixelda_utils",
-        
-        "//domain_adaptation/datasets:dataset_factory",
-    ],
-)
diff --git a/research/domain_adaptation/pixel_domain_adaptation/README.md b/research/domain_adaptation/pixel_domain_adaptation/README.md
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/domain_adaptation/pixel_domain_adaptation/baselines/BUILD b/research/domain_adaptation/pixel_domain_adaptation/baselines/BUILD
deleted file mode 100644
index c41a4ffeee80114145c4c3fc32a2191879b1b08a..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/baselines/BUILD
+++ /dev/null
@@ -1,23 +0,0 @@
-licenses(["notice"])  # Apache 2.0
-
-py_binary(
-    name = "baseline_train",
-    srcs = ["baseline_train.py"],
-    deps = [
-        
-        "//domain_adaptation/datasets:dataset_factory",
-        "//domain_adaptation/pixel_domain_adaptation:pixelda_model",
-        "//domain_adaptation/pixel_domain_adaptation:pixelda_preprocess",
-    ],
-)
-
-py_binary(
-    name = "baseline_eval",
-    srcs = ["baseline_eval.py"],
-    deps = [
-        
-        "//domain_adaptation/datasets:dataset_factory",
-        "//domain_adaptation/pixel_domain_adaptation:pixelda_model",
-        "//domain_adaptation/pixel_domain_adaptation:pixelda_preprocess",
-    ],
-)
diff --git a/research/domain_adaptation/pixel_domain_adaptation/baselines/README.md b/research/domain_adaptation/pixel_domain_adaptation/baselines/README.md
deleted file mode 100644
index d61195ad2de6867801143aeda906cb5efe30a5e3..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/baselines/README.md
+++ /dev/null
@@ -1,60 +0,0 @@
-The best baselines are obtainable via the following configuration:
-
-
-## MNIST => MNIST_M
-
-Accuracy:
-MNIST-Train: 99.9
-MNIST_M-Train: 63.9
-MNIST_M-Valid: 63.9
-MNIST_M-Test: 63.6
-
-Learning Rate = 0.0001
-Weight Decay = 0.0
-Number of Steps: 105,000
-
-## MNIST => USPS
-
-Accuracy:
-MNIST-Train: 100.0
-USPS-Train: 82.8
-USPS-Valid: 82.8
-USPS-Test: 78.9
-
-Learning Rate = 0.0001
-Weight Decay = 0.0
-Number of Steps: 22,000
-
-## MNIST_M => MNIST
-
-Accuracy:
-MNIST_M-Train: 100
-MNIST-Train: 98.5
-MNIST-Valid: 98.5
-MNIST-Test: 98.1
-
-Learning Rate = 0.001
-Weight Decay = 0.0
-Number of Steps: 604,400
-
-## MNIST_M => MNIST_M
-
-Accuracy:
-MNIST_M-Train: 100.0
-MNIST_M-Valid: 96.6
-MNIST_M-Test: 96.4
-
-Learning Rate = 0.001
-Weight Decay = 0.0
-Number of Steps: 139,400
-
-## USPS => USPS
-
-Accuracy:
-USPS-Train: 100.0
-USPS-Valid: 100.0
-USPS-Test: 96.5
-
-Learning Rate = 0.001
-Weight Decay = 0.0
-Number of Steps: 67,000
diff --git a/research/domain_adaptation/pixel_domain_adaptation/baselines/baseline_eval.py b/research/domain_adaptation/pixel_domain_adaptation/baselines/baseline_eval.py
deleted file mode 100644
index 6b7ef6452b4897b00dc8c977bf40526ad5052ede..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/baselines/baseline_eval.py
+++ /dev/null
@@ -1,141 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-r"""Evals the classification/pose baselines."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from functools import partial
-
-import math
-
-# Dependency imports
-
-import tensorflow as tf
-
-from domain_adaptation.datasets import dataset_factory
-from domain_adaptation.pixel_domain_adaptation import pixelda_preprocess
-from domain_adaptation.pixel_domain_adaptation import pixelda_task_towers
-
-flags = tf.app.flags
-FLAGS = flags.FLAGS
-
-slim = tf.contrib.slim
-
-flags.DEFINE_string('master', '', 'BNS name of the tensorflow server')
-
-flags.DEFINE_string(
-    'checkpoint_dir', None, 'The location of the checkpoint files.')
-
-flags.DEFINE_string(
-    'eval_dir', None, 'The directory where evaluation logs are written.')
-
-flags.DEFINE_integer('batch_size', 32, 'The number of samples per batch.')
-
-flags.DEFINE_string('dataset_name', None, 'The name of the dataset.')
-
-flags.DEFINE_string('dataset_dir', None,
-                    'The directory where the data is stored.')
-
-flags.DEFINE_string('split_name', None, 'The name of the train/test split.')
-
-flags.DEFINE_integer('eval_interval_secs', 60 * 5,
-                     'How often (in seconds) to run evaluation.')
-
-flags.DEFINE_integer(
-    'num_readers', 4,
-    'The number of parallel readers that read data from the dataset.')
-
-def main(unused_argv):
-  tf.logging.set_verbosity(tf.logging.INFO)
-  hparams = tf.contrib.training.HParams()
-  hparams.weight_decay_task_classifier = 0.0
-
-  if FLAGS.dataset_name in ['mnist', 'mnist_m', 'usps']:
-    hparams.task_tower = 'mnist'
-  else:
-    raise ValueError('Unknown dataset %s' % FLAGS.dataset_name)
-
-  if not tf.gfile.Exists(FLAGS.eval_dir):
-    tf.gfile.MakeDirs(FLAGS.eval_dir)
-
-  with tf.Graph().as_default():
-    dataset = dataset_factory.get_dataset(FLAGS.dataset_name, FLAGS.split_name,
-                                          FLAGS.dataset_dir)
-    num_classes = dataset.num_classes
-    num_samples = dataset.num_samples
-
-    preprocess_fn = partial(pixelda_preprocess.preprocess_classification,
-                            is_training=False)
-
-    images, labels = dataset_factory.provide_batch(
-        FLAGS.dataset_name,
-        FLAGS.split_name,
-        dataset_dir=FLAGS.dataset_dir,
-        num_readers=FLAGS.num_readers,
-        batch_size=FLAGS.batch_size,
-        num_preprocessing_threads=FLAGS.num_readers)
-
-    # Define the model
-    logits, _ = pixelda_task_towers.add_task_specific_model(
-        images, hparams, num_classes=num_classes, is_training=True)
-
-    #####################
-    # Define the losses #
-    #####################
-    if 'classes' in labels:
-      one_hot_labels = labels['classes']
-      loss = tf.losses.softmax_cross_entropy(
-          onehot_labels=one_hot_labels, logits=logits)
-      tf.summary.scalar('losses/Classification_Loss', loss)
-    else:
-      raise ValueError('Only support classification for now.')
-
-    total_loss = tf.losses.get_total_loss()
-
-    predictions = tf.reshape(tf.argmax(logits, 1), shape=[-1])
-    class_labels = tf.argmax(labels['classes'], 1)
-
-    metrics_to_values, metrics_to_updates = slim.metrics.aggregate_metric_map({
-        'Mean_Loss':
-            tf.contrib.metrics.streaming_mean(total_loss),
-        'Accuracy':
-            tf.contrib.metrics.streaming_accuracy(predictions,
-                                                  tf.reshape(
-                                                      class_labels,
-                                                      shape=[-1])),
-        'Recall_at_5':
-            tf.contrib.metrics.streaming_recall_at_k(logits, class_labels, 5),
-    })
-
-    tf.summary.histogram('outputs/Predictions', predictions)
-    tf.summary.histogram('outputs/Ground_Truth', class_labels)
-
-    for name, value in metrics_to_values.iteritems():
-      tf.summary.scalar(name, value)
-
-    num_batches = int(math.ceil(num_samples / float(FLAGS.batch_size)))
-
-    slim.evaluation.evaluation_loop(
-        master=FLAGS.master,
-        checkpoint_dir=FLAGS.checkpoint_dir,
-        logdir=FLAGS.eval_dir,
-        num_evals=num_batches,
-        eval_op=metrics_to_updates.values(),
-        eval_interval_secs=FLAGS.eval_interval_secs)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/domain_adaptation/pixel_domain_adaptation/baselines/baseline_train.py b/research/domain_adaptation/pixel_domain_adaptation/baselines/baseline_train.py
deleted file mode 100644
index 8c92bd81a7b68879000dd793ba2fd013f395f408..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/baselines/baseline_train.py
+++ /dev/null
@@ -1,161 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-r"""Trains the classification/pose baselines."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from functools import partial
-
-# Dependency imports
-
-import tensorflow as tf
-
-from domain_adaptation.datasets import dataset_factory
-from domain_adaptation.pixel_domain_adaptation import pixelda_preprocess
-from domain_adaptation.pixel_domain_adaptation import pixelda_task_towers
-
-flags = tf.app.flags
-FLAGS = flags.FLAGS
-
-slim = tf.contrib.slim
-
-flags.DEFINE_string('master', '', 'BNS name of the tensorflow server')
-
-flags.DEFINE_integer('task', 0, 'The task ID.')
-
-flags.DEFINE_integer('num_ps_tasks', 0,
-                     'The number of parameter servers. If the value is 0, then '
-                     'the parameters are handled locally by the worker.')
-
-flags.DEFINE_integer('batch_size', 32, 'The number of samples per batch.')
-
-flags.DEFINE_string('dataset_name', None, 'The name of the dataset.')
-
-flags.DEFINE_string('dataset_dir', None,
-                    'The directory where the data is stored.')
-
-flags.DEFINE_string('split_name', None, 'The name of the train/test split.')
-
-flags.DEFINE_float('learning_rate', 0.001, 'The initial learning rate.')
-
-flags.DEFINE_integer(
-    'learning_rate_decay_steps', 20000,
-    'The frequency, in steps, at which the learning rate is decayed.')
-
-flags.DEFINE_float('learning_rate_decay_factor',
-                   0.95,
-                   'The factor with which the learning rate is decayed.')
-
-flags.DEFINE_float('adam_beta1', 0.5, 'The beta1 value for the AdamOptimizer')
-
-flags.DEFINE_float('weight_decay', 1e-5,
-                   'The L2 coefficient on the model weights.')
-
-flags.DEFINE_string(
-    'logdir', None, 'The location of the logs and checkpoints.')
-
-flags.DEFINE_integer('save_interval_secs', 600,
-                     'How often, in seconds, we save the model to disk.')
-
-flags.DEFINE_integer('save_summaries_secs', 600,
-                     'How often, in seconds, we compute the summaries.')
-
-flags.DEFINE_integer(
-    'num_readers', 4,
-    'The number of parallel readers that read data from the dataset.')
-
-flags.DEFINE_float(
-    'moving_average_decay', 0.9999,
-    'The amount of decay to use for moving averages.')
-
-
-def main(unused_argv):
-  tf.logging.set_verbosity(tf.logging.INFO)
-  hparams = tf.contrib.training.HParams()
-  hparams.weight_decay_task_classifier = FLAGS.weight_decay
-
-  if FLAGS.dataset_name in ['mnist', 'mnist_m', 'usps']:
-    hparams.task_tower = 'mnist'
-  else:
-    raise ValueError('Unknown dataset %s' % FLAGS.dataset_name)
-
-  with tf.Graph().as_default():
-    with tf.device(
-        tf.train.replica_device_setter(FLAGS.num_ps_tasks, merge_devices=True)):
-      dataset = dataset_factory.get_dataset(FLAGS.dataset_name,
-                                            FLAGS.split_name, FLAGS.dataset_dir)
-      num_classes = dataset.num_classes
-
-      preprocess_fn = partial(pixelda_preprocess.preprocess_classification,
-                              is_training=True)
-
-      images, labels = dataset_factory.provide_batch(
-          FLAGS.dataset_name,
-          FLAGS.split_name,
-          dataset_dir=FLAGS.dataset_dir,
-          num_readers=FLAGS.num_readers,
-          batch_size=FLAGS.batch_size,
-          num_preprocessing_threads=FLAGS.num_readers)
-      # preprocess_fn=preprocess_fn)
-
-      # Define the model
-      logits, _ = pixelda_task_towers.add_task_specific_model(
-          images, hparams, num_classes=num_classes, is_training=True)
-
-      # Define the losses
-      if 'classes' in labels:
-        one_hot_labels = labels['classes']
-        loss = tf.losses.softmax_cross_entropy(
-            onehot_labels=one_hot_labels, logits=logits)
-        tf.summary.scalar('losses/Classification_Loss', loss)
-      else:
-        raise ValueError('Only support classification for now.')
-
-      total_loss = tf.losses.get_total_loss()
-      tf.summary.scalar('losses/Total_Loss', total_loss)
-
-      # Setup the moving averages
-      moving_average_variables = slim.get_model_variables()
-      variable_averages = tf.train.ExponentialMovingAverage(
-          FLAGS.moving_average_decay, slim.get_or_create_global_step())
-      tf.add_to_collection(
-          tf.GraphKeys.UPDATE_OPS,
-          variable_averages.apply(moving_average_variables))
-
-      # Specify the optimization scheme:
-      learning_rate = tf.train.exponential_decay(
-          FLAGS.learning_rate,
-          slim.get_or_create_global_step(),
-          FLAGS.learning_rate_decay_steps,
-          FLAGS.learning_rate_decay_factor,
-          staircase=True)
-
-      optimizer = tf.train.AdamOptimizer(learning_rate, beta1=FLAGS.adam_beta1)
-
-      train_op = slim.learning.create_train_op(total_loss, optimizer)
-
-      slim.learning.train(
-          train_op,
-          FLAGS.logdir,
-          master=FLAGS.master,
-          is_chief=(FLAGS.task == 0),
-          save_summaries_secs=FLAGS.save_summaries_secs,
-          save_interval_secs=FLAGS.save_interval_secs)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/domain_adaptation/pixel_domain_adaptation/hparams.py b/research/domain_adaptation/pixel_domain_adaptation/hparams.py
deleted file mode 100644
index ba9539f7d435c86f9fc92ed3406835bdaf2b50f3..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/hparams.py
+++ /dev/null
@@ -1,201 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Define model HParams."""
-import tensorflow as tf
-
-
-def create_hparams(hparam_string=None):
-  """Create model hyperparameters. Parse nondefault from given string."""
-  hparams = tf.contrib.training.HParams(
-      # The name of the architecture to use.
-      arch='resnet',
-      lrelu_leakiness=0.2,
-      batch_norm_decay=0.9,
-      weight_decay=1e-5,
-      normal_init_std=0.02,
-      generator_kernel_size=3,
-      discriminator_kernel_size=3,
-
-      # Stop training after this many examples are processed
-      # If none, train indefinitely
-      num_training_examples=0,
-
-      # Apply data augmentation to datasets
-      # Applies only in training job
-      augment_source_images=False,
-      augment_target_images=False,
-
-      # Discriminator
-      # Number of filters in first layer of discriminator
-      num_discriminator_filters=64,
-      discriminator_conv_block_size=1,  # How many convs to have at each size
-      discriminator_filter_factor=2.0,  # Multiply # filters by this each layer
-      # Add gaussian noise with this stddev to every hidden layer of D
-      discriminator_noise_stddev=0.2,  # lmetz: Start seeing results at >= 0.1
-      # If true, add this gaussian noise to input images to D as well
-      discriminator_image_noise=False,
-      discriminator_first_stride=1,  # Stride in first conv of discriminator
-      discriminator_do_pooling=False,  # If true, replace stride 2 with avg pool
-      discriminator_dropout_keep_prob=0.9,  # keep probability for dropout
-
-      # DCGAN Generator
-      # Number of filters in generator decoder last layer (repeatedly halved
-      # from 1st layer)
-      num_decoder_filters=64,
-      # Number of filters in generator encoder 1st layer (repeatedly doubled
-      # after 1st layer)
-      num_encoder_filters=64,
-
-      # This is the shape to which the noise vector is projected (if we're
-      # transferring from noise).
-      # Write this way instead of [4, 4, 64] for hparam search flexibility
-      projection_shape_size=4,
-      projection_shape_channels=64,
-
-      # Indicates the method by which we enlarge the spatial representation
-      # of an image. Possible values include:
-      # - resize_conv: Performs a nearest neighbor resize followed by a conv.
-      # - conv2d_transpose: Performs a conv2d_transpose.
-      upsample_method='resize_conv',
-
-      # Visualization
-      summary_steps=500,  # Output image summary every N steps
-
-      ###################################
-      # Task Classifier Hyperparameters #
-      ###################################
-
-      # Which task-specific prediction tower to use. Possible choices are:
-      #  none: No task tower.
-      #  doubling_pose_estimator: classifier + quaternion regressor.
-      #    [conv + pool]* + FC
-      # Classifiers used in DSN paper:
-      #  gtsrb: Classifier used for GTSRB
-      #  svhn: Classifier used for SVHN
-      #  mnist: Classifier used for MNIST
-      #  pose_mini: Classifier + regressor used for pose_mini
-      task_tower='doubling_pose_estimator',
-      weight_decay_task_classifier=1e-5,
-      source_task_loss_weight=1.0,
-      transferred_task_loss_weight=1.0,
-
-      # Number of private layers in doubling_pose_estimator task tower
-      num_private_layers=2,
-
-      # The weight for the log quaternion loss we use for source and transferred
-      # samples of the cropped_linemod dataset.
-      # In the DSN work, 1/8 of the classifier weight worked well for our log
-      # quaternion loss
-      source_pose_weight=0.125 * 2.0,
-      transferred_pose_weight=0.125 * 1.0,
-
-      # If set to True, the style transfer network also attempts to change its
-      # weights to maximize the performance of the task tower. If set to False,
-      # then the style transfer network only attempts to change its weights to
-      # make the transferred images more likely according to the domain
-      # classifier.
-      task_tower_in_g_step=True,
-      task_loss_in_g_weight=1.0,  # Weight of task loss in G
-
-      #########################################
-      # 'simple` generator arch model hparams #
-      #########################################
-      simple_num_conv_layers=1,
-      simple_conv_filters=8,
-
-      #########################
-      # Resnet Hyperparameters#
-      #########################
-      resnet_blocks=6,  # Number of resnet blocks
-      resnet_filters=64,  # Number of filters per conv in resnet blocks
-      # If true, add original input back to result of convolutions inside the
-      # resnet arch. If false, it turns into a simple stack of conv/relu/BN
-      # layers.
-      resnet_residuals=True,
-
-      #######################################
-      # The residual / interpretable model. #
-      #######################################
-      res_int_blocks=2,  # The number of residual blocks.
-      res_int_convs=2,  # The number of conv calls inside each block.
-      res_int_filters=64,  # The number of filters used by each convolution.
-
-      ####################
-      # Latent variables #
-      ####################
-      # if true, then generate random noise and project to input for generator
-      noise_channel=True,
-      # The number of dimensions in the input noise vector.
-      noise_dims=10,
-
-      # If true, then one hot encode source image class and project as an
-      # additional channel for the input to generator. This gives the generator
-      # access to the class, which may help generation performance.
-      condition_on_source_class=False,
-
-      ########################
-      # Loss Hyperparameters #
-      ########################
-      domain_loss_weight=1.0,
-      style_transfer_loss_weight=1.0,
-
-      ########################################################################
-      # Encourages the transferred images to be similar to the source images #
-      # using a configurable metric.                                         #
-      ########################################################################
-
-      # The weight of the loss function encouraging the source and transferred
-      # images to be similar. If set to 0, then the loss function is not used.
-      transferred_similarity_loss_weight=0.0,
-
-      # The type of loss used to encourage transferred and source image
-      # similarity. Valid values include:
-      #   mpse: Mean Pairwise Squared Error
-      #   mse: Mean Squared Error
-      #   hinged_mse: Computes the mean squared error using squared differences
-      #     greater than hparams.transferred_similarity_max_diff
-      #   hinged_mae: Computes the mean absolute error using absolute
-      #     differences greater than hparams.transferred_similarity_max_diff.
-      transferred_similarity_loss='mpse',
-
-      # The maximum allowable difference between the source and target images.
-      # This value is used, in effect, to produce a hinge loss. Note that the
-      # range of values should be between 0 and 1.
-      transferred_similarity_max_diff=0.4,
-
-      ################################
-      # Optimization Hyperparameters #
-      ################################
-      learning_rate=0.001,
-      batch_size=32,
-      lr_decay_steps=20000,
-      lr_decay_rate=0.95,
-
-      # Recomendation from the DCGAN paper:
-      adam_beta1=0.5,
-      clip_gradient_norm=5.0,
-
-      # The number of times we run the discriminator train_op in a row.
-      discriminator_steps=1,
-
-      # The number of times we run the generator train_op in a row.
-      generator_steps=1)
-
-  if hparam_string:
-    tf.logging.info('Parsing command line hparams: %s', hparam_string)
-    hparams.parse(hparam_string)
-
-  tf.logging.info('Final parsed hparams: %s', hparams.values())
-  return hparams
diff --git a/research/domain_adaptation/pixel_domain_adaptation/pixelda_eval.py b/research/domain_adaptation/pixel_domain_adaptation/pixelda_eval.py
deleted file mode 100644
index 23824249a9e95586ed85e40cd89c5f6814977969..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/pixelda_eval.py
+++ /dev/null
@@ -1,298 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-r"""Evaluates the PIXELDA model.
-
--- Compiles the model for CPU.
-$ bazel build -c opt third_party/tensorflow_models/domain_adaptation/pixel_domain_adaptation:pixelda_eval
-
--- Compile the model for GPU.
-$ bazel build -c opt --copt=-mavx --config=cuda \
-    third_party/tensorflow_models/domain_adaptation/pixel_domain_adaptation:pixelda_eval
-
--- Runs the training.
-$ ./bazel-bin/third_party/tensorflow_models/domain_adaptation/pixel_domain_adaptation/pixelda_eval \
-    --source_dataset=mnist \
-    --target_dataset=mnist_m \
-    --dataset_dir=/tmp/datasets/ \
-    --alsologtostderr
-
--- Visualize the results.
-$ bash learning/brain/tensorboard/tensorboard.sh \
-    --port 2222 --logdir=/tmp/pixelda/
-"""
-from functools import partial
-import math
-
-# Dependency imports
-
-import tensorflow as tf
-
-from domain_adaptation.datasets import dataset_factory
-from domain_adaptation.pixel_domain_adaptation import pixelda_model
-from domain_adaptation.pixel_domain_adaptation import pixelda_preprocess
-from domain_adaptation.pixel_domain_adaptation import pixelda_utils
-from domain_adaptation.pixel_domain_adaptation import pixelda_losses
-from domain_adaptation.pixel_domain_adaptation.hparams import create_hparams
-
-slim = tf.contrib.slim
-
-flags = tf.app.flags
-FLAGS = flags.FLAGS
-
-flags.DEFINE_string('master', '', 'BNS name of the TensorFlow master to use.')
-
-flags.DEFINE_string('checkpoint_dir', '/tmp/pixelda/',
-                    'Directory where the model was written to.')
-
-flags.DEFINE_string('eval_dir', '/tmp/pixelda/',
-                    'Directory where the results are saved to.')
-
-flags.DEFINE_integer('eval_interval_secs', 60,
-                     'The frequency, in seconds, with which evaluation is run.')
-
-flags.DEFINE_string('target_split_name', 'test',
-                    'The name of the train/test split.')
-flags.DEFINE_string('source_split_name', 'train', 'Split for source dataset.'
-                    ' Defaults to train.')
-
-flags.DEFINE_string('source_dataset', 'mnist',
-                    'The name of the source dataset.')
-
-flags.DEFINE_string('target_dataset', 'mnist_m',
-                    'The name of the target dataset.')
-
-flags.DEFINE_string(
-    'dataset_dir',
-    '',  # None,
-    'The directory where the datasets can be found.')
-
-flags.DEFINE_integer(
-    'num_readers', 4,
-    'The number of parallel readers that read data from the dataset.')
-
-flags.DEFINE_integer('num_preprocessing_threads', 4,
-                     'The number of threads used to create the batches.')
-
-# HParams
-
-flags.DEFINE_string('hparams', '', 'Comma separated hyperparameter values')
-
-
-def run_eval(run_dir, checkpoint_dir, hparams):
-  """Runs the eval loop.
-
-  Args:
-    run_dir: The directory where eval specific logs are placed
-    checkpoint_dir: The directory where the checkpoints are stored
-    hparams: The hyperparameters struct.
-
-  Raises:
-    ValueError: if hparams.arch is not recognized.
-  """
-  for checkpoint_path in slim.evaluation.checkpoints_iterator(
-      checkpoint_dir, FLAGS.eval_interval_secs):
-    with tf.Graph().as_default():
-      #########################
-      # Preprocess the inputs #
-      #########################
-      target_dataset = dataset_factory.get_dataset(
-          FLAGS.target_dataset,
-          split_name=FLAGS.target_split_name,
-          dataset_dir=FLAGS.dataset_dir)
-      target_images, target_labels = dataset_factory.provide_batch(
-          FLAGS.target_dataset, FLAGS.target_split_name, FLAGS.dataset_dir,
-          FLAGS.num_readers, hparams.batch_size,
-          FLAGS.num_preprocessing_threads)
-      num_target_classes = target_dataset.num_classes
-      target_labels['class'] = tf.argmax(target_labels['classes'], 1)
-      del target_labels['classes']
-
-      if hparams.arch not in ['dcgan']:
-        source_dataset = dataset_factory.get_dataset(
-            FLAGS.source_dataset,
-            split_name=FLAGS.source_split_name,
-            dataset_dir=FLAGS.dataset_dir)
-        num_source_classes = source_dataset.num_classes
-        source_images, source_labels = dataset_factory.provide_batch(
-            FLAGS.source_dataset, FLAGS.source_split_name, FLAGS.dataset_dir,
-            FLAGS.num_readers, hparams.batch_size,
-            FLAGS.num_preprocessing_threads)
-        source_labels['class'] = tf.argmax(source_labels['classes'], 1)
-        del source_labels['classes']
-        if num_source_classes != num_target_classes:
-          raise ValueError(
-              'Input and output datasets must have same number of classes')
-      else:
-        source_images = None
-        source_labels = None
-
-      ####################
-      # Define the model #
-      ####################
-      end_points = pixelda_model.create_model(
-          hparams,
-          target_images,
-          source_images=source_images,
-          source_labels=source_labels,
-          is_training=False,
-          num_classes=num_target_classes)
-
-      #######################
-      # Metrics & Summaries #
-      #######################
-      names_to_values, names_to_updates = create_metrics(end_points,
-                                                         source_labels,
-                                                         target_labels, hparams)
-      pixelda_utils.summarize_model(end_points)
-      pixelda_utils.summarize_transferred_grid(
-          end_points['transferred_images'], source_images, name='Transferred')
-      if 'source_images_recon' in end_points:
-        pixelda_utils.summarize_transferred_grid(
-            end_points['source_images_recon'],
-            source_images,
-            name='Source Reconstruction')
-      pixelda_utils.summarize_images(target_images, 'Target')
-
-      for name, value in names_to_values.iteritems():
-        tf.summary.scalar(name, value)
-
-      # Use the entire split by default
-      num_examples = target_dataset.num_samples
-
-      num_batches = math.ceil(num_examples / float(hparams.batch_size))
-      global_step = slim.get_or_create_global_step()
-
-      result = slim.evaluation.evaluate_once(
-          master=FLAGS.master,
-          checkpoint_path=checkpoint_path,
-          logdir=run_dir,
-          num_evals=num_batches,
-          eval_op=names_to_updates.values(),
-          final_op=names_to_values)
-
-
-def to_degrees(log_quaternion_loss):
-  """Converts a log quaternion distance to an angle.
-
-  Args:
-    log_quaternion_loss: The log quaternion distance between two
-      unit quaternions (or a batch of pairs of quaternions).
-
-  Returns:
-    The angle in degrees of the implied angle-axis representation.
-  """
-  return tf.acos(-(tf.exp(log_quaternion_loss) - 1)) * 2 * 180 / math.pi
-
-
-def create_metrics(end_points, source_labels, target_labels, hparams):
-  """Create metrics for the model.
-
-  Args:
-    end_points: A dictionary of end point name to tensor
-    source_labels: Labels for source images. batch_size x 1
-    target_labels: Labels for target images. batch_size x 1
-    hparams: The hyperparameters struct.
-
-  Returns:
-    Tuple of (names_to_values, names_to_updates), dictionaries that map a metric
-    name to its value and update op, respectively
-
-  """
-  ###########################################
-  # Evaluate the Domain Prediction Accuracy #
-  ###########################################
-  batch_size = hparams.batch_size
-  names_to_values, names_to_updates = slim.metrics.aggregate_metric_map({
-      ('eval/Domain_Accuracy-Transferred'):
-          tf.contrib.metrics.streaming_accuracy(
-              tf.to_int32(
-                  tf.round(tf.sigmoid(end_points[
-                      'transferred_domain_logits']))),
-              tf.zeros(batch_size, dtype=tf.int32)),
-      ('eval/Domain_Accuracy-Target'):
-          tf.contrib.metrics.streaming_accuracy(
-              tf.to_int32(
-                  tf.round(tf.sigmoid(end_points['target_domain_logits']))),
-              tf.ones(batch_size, dtype=tf.int32))
-  })
-
-  ################################
-  # Evaluate the task classifier #
-  ################################
-  if 'source_task_logits' in end_points:
-    metric_name = 'eval/Task_Accuracy-Source'
-    names_to_values[metric_name], names_to_updates[
-        metric_name] = tf.contrib.metrics.streaming_accuracy(
-            tf.argmax(end_points['source_task_logits'], 1),
-            source_labels['class'])
-
-  if 'transferred_task_logits' in end_points:
-    metric_name = 'eval/Task_Accuracy-Transferred'
-    names_to_values[metric_name], names_to_updates[
-        metric_name] = tf.contrib.metrics.streaming_accuracy(
-            tf.argmax(end_points['transferred_task_logits'], 1),
-            source_labels['class'])
-
-  if 'target_task_logits' in end_points:
-    metric_name = 'eval/Task_Accuracy-Target'
-    names_to_values[metric_name], names_to_updates[
-        metric_name] = tf.contrib.metrics.streaming_accuracy(
-            tf.argmax(end_points['target_task_logits'], 1),
-            target_labels['class'])
-
-  ##########################################################################
-  # Pose data-specific losses.
-  ##########################################################################
-  if 'quaternion' in source_labels.keys():
-    params = {}
-    params['use_logging'] = False
-    params['batch_size'] = batch_size
-
-    angle_loss_source = to_degrees(
-        pixelda_losses.log_quaternion_loss_batch(end_points[
-            'source_quaternion'], source_labels['quaternion'], params))
-    angle_loss_transferred = to_degrees(
-        pixelda_losses.log_quaternion_loss_batch(end_points[
-            'transferred_quaternion'], source_labels['quaternion'], params))
-    angle_loss_target = to_degrees(
-        pixelda_losses.log_quaternion_loss_batch(end_points[
-            'target_quaternion'], target_labels['quaternion'], params))
-
-    metric_name = 'eval/Angle_Loss-Source'
-    names_to_values[metric_name], names_to_updates[
-        metric_name] = slim.metrics.mean(angle_loss_source)
-
-    metric_name = 'eval/Angle_Loss-Transferred'
-    names_to_values[metric_name], names_to_updates[
-        metric_name] = slim.metrics.mean(angle_loss_transferred)
-
-    metric_name = 'eval/Angle_Loss-Target'
-    names_to_values[metric_name], names_to_updates[
-        metric_name] = slim.metrics.mean(angle_loss_target)
-
-  return names_to_values, names_to_updates
-
-
-def main(_):
-  tf.logging.set_verbosity(tf.logging.INFO)
-  hparams = create_hparams(FLAGS.hparams)
-  run_eval(
-      run_dir=FLAGS.eval_dir,
-      checkpoint_dir=FLAGS.checkpoint_dir,
-      hparams=hparams)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/domain_adaptation/pixel_domain_adaptation/pixelda_losses.py b/research/domain_adaptation/pixel_domain_adaptation/pixelda_losses.py
deleted file mode 100644
index cf39765d4d28c5a04cb8868cdc465cdd0129b0df..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/pixelda_losses.py
+++ /dev/null
@@ -1,385 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Defines the various loss functions in use by the PIXELDA model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-
-def add_domain_classifier_losses(end_points, hparams):
-  """Adds losses related to the domain-classifier.
-
-  Args:
-    end_points: A map of network end point names to `Tensors`.
-    hparams: The hyperparameters struct.
-
-  Returns:
-    loss: A `Tensor` representing the total task-classifier loss.
-  """
-  if hparams.domain_loss_weight == 0:
-    tf.logging.info(
-        'Domain classifier loss weight is 0, so not creating losses.')
-    return 0
-
-  # The domain prediction loss is minimized with respect to the domain
-  # classifier features only. Its aim is to predict the domain of the images.
-  # Note: 1 = 'real image' label, 0 = 'fake image' label
-  transferred_domain_loss = tf.losses.sigmoid_cross_entropy(
-      multi_class_labels=tf.zeros_like(end_points['transferred_domain_logits']),
-      logits=end_points['transferred_domain_logits'])
-  tf.summary.scalar('Domain_loss_transferred', transferred_domain_loss)
-
-  target_domain_loss = tf.losses.sigmoid_cross_entropy(
-      multi_class_labels=tf.ones_like(end_points['target_domain_logits']),
-      logits=end_points['target_domain_logits'])
-  tf.summary.scalar('Domain_loss_target', target_domain_loss)
-
-  # Compute the total domain loss:
-  total_domain_loss = transferred_domain_loss + target_domain_loss
-  total_domain_loss *= hparams.domain_loss_weight
-  tf.summary.scalar('Domain_loss_total', total_domain_loss)
-
-  return total_domain_loss
-
-def log_quaternion_loss_batch(predictions, labels, params):
-  """A helper function to compute the error between quaternions.
-
-  Args:
-    predictions: A Tensor of size [batch_size, 4].
-    labels: A Tensor of size [batch_size, 4].
-    params: A dictionary of parameters. Expecting 'use_logging', 'batch_size'.
-
-  Returns:
-    A Tensor of size [batch_size], denoting the error between the quaternions.
-  """
-  use_logging = params['use_logging']
-  assertions = []
-  if use_logging:
-    assertions.append(
-        tf.Assert(
-            tf.reduce_all(
-                tf.less(
-                    tf.abs(tf.reduce_sum(tf.square(predictions), [1]) - 1),
-                    1e-4)),
-            ['The l2 norm of each prediction quaternion vector should be 1.']))
-    assertions.append(
-        tf.Assert(
-            tf.reduce_all(
-                tf.less(
-                    tf.abs(tf.reduce_sum(tf.square(labels), [1]) - 1), 1e-4)),
-            ['The l2 norm of each label quaternion vector should be 1.']))
-
-  with tf.control_dependencies(assertions):
-    product = tf.multiply(predictions, labels)
-  internal_dot_products = tf.reduce_sum(product, [1])
-
-  if use_logging:
-    internal_dot_products = tf.Print(internal_dot_products, [
-        internal_dot_products,
-        tf.shape(internal_dot_products)
-    ], 'internal_dot_products:')
-
-  logcost = tf.log(1e-4 + 1 - tf.abs(internal_dot_products))
-  return logcost
-
-
-def log_quaternion_loss(predictions, labels, params):
-  """A helper function to compute the mean error between batches of quaternions.
-
-  The caller is expected to add the loss to the graph.
-
-  Args:
-    predictions: A Tensor of size [batch_size, 4].
-    labels: A Tensor of size [batch_size, 4].
-    params: A dictionary of parameters. Expecting 'use_logging', 'batch_size'.
-
-  Returns:
-    A Tensor of size 1, denoting the mean error between batches of quaternions.
-  """
-  use_logging = params['use_logging']
-  logcost = log_quaternion_loss_batch(predictions, labels, params)
-  logcost = tf.reduce_sum(logcost, [0])
-  batch_size = params['batch_size']
-  logcost = tf.multiply(logcost, 1.0 / batch_size, name='log_quaternion_loss')
-  if use_logging:
-    logcost = tf.Print(
-        logcost, [logcost], '[logcost]', name='log_quaternion_loss_print')
-  return logcost
-
-def _quaternion_loss(labels, predictions, weight, batch_size, domain,
-                     add_summaries):
-  """Creates a Quaternion Loss.
-
-  Args:
-    labels: The true quaternions.
-    predictions: The predicted quaternions.
-    weight: A scalar weight.
-    batch_size: The size of the batches.
-    domain: The name of the domain from which the labels were taken.
-    add_summaries: Whether or not to add summaries for the losses.
-
-  Returns:
-    A `Tensor` representing the loss.
-  """
-  assert domain in ['Source', 'Transferred']
-
-  params = {'use_logging': False, 'batch_size': batch_size}
-  loss = weight * log_quaternion_loss(labels, predictions, params)
-
-  if add_summaries:
-    assert_op = tf.Assert(tf.is_finite(loss), [loss])
-    with tf.control_dependencies([assert_op]):
-      tf.summary.histogram(
-          'Log_Quaternion_Loss_%s' % domain, loss, collections='losses')
-      tf.summary.scalar(
-          'Task_Quaternion_Loss_%s' % domain, loss, collections='losses')
-
-  return loss
-
-
-def _add_task_specific_losses(end_points, source_labels, num_classes, hparams,
-                              add_summaries=False):
-  """Adds losses related to the task-classifier.
-
-  Args:
-    end_points: A map of network end point names to `Tensors`.
-    source_labels: A dictionary of output labels to `Tensors`.
-    num_classes: The number of classes used by the classifier.
-    hparams: The hyperparameters struct.
-    add_summaries: Whether or not to add the summaries.
-
-  Returns:
-    loss: A `Tensor` representing the total task-classifier loss.
-  """
-  # TODO(ddohan): Make sure the l2 regularization is added to the loss
-
-  one_hot_labels = slim.one_hot_encoding(source_labels['class'], num_classes)
-  total_loss = 0
-
-  if 'source_task_logits' in end_points:
-    loss = tf.losses.softmax_cross_entropy(
-        onehot_labels=one_hot_labels,
-        logits=end_points['source_task_logits'],
-        weights=hparams.source_task_loss_weight)
-    if add_summaries:
-      tf.summary.scalar('Task_Classifier_Loss_Source', loss)
-    total_loss += loss
-
-  if 'transferred_task_logits' in end_points:
-    loss = tf.losses.softmax_cross_entropy(
-        onehot_labels=one_hot_labels,
-        logits=end_points['transferred_task_logits'],
-        weights=hparams.transferred_task_loss_weight)
-    if add_summaries:
-      tf.summary.scalar('Task_Classifier_Loss_Transferred', loss)
-    total_loss += loss
-
-  #########################
-  # Pose specific losses. #
-  #########################
-  if 'quaternion' in source_labels:
-    total_loss += _quaternion_loss(
-        source_labels['quaternion'],
-        end_points['source_quaternion'],
-        hparams.source_pose_weight,
-        hparams.batch_size,
-        'Source',
-        add_summaries)
-
-    total_loss += _quaternion_loss(
-        source_labels['quaternion'],
-        end_points['transferred_quaternion'],
-        hparams.transferred_pose_weight,
-        hparams.batch_size,
-        'Transferred',
-        add_summaries)
-
-  if add_summaries:
-    tf.summary.scalar('Task_Loss_Total', total_loss)
-
-  return total_loss
-
-
-def _transferred_similarity_loss(reconstructions,
-                                 source_images,
-                                 weight=1.0,
-                                 method='mse',
-                                 max_diff=0.4,
-                                 name='similarity'):
-  """Computes a loss encouraging similarity between source and transferred.
-
-  Args:
-    reconstructions: A `Tensor` of shape [batch_size, height, width, channels]
-    source_images: A `Tensor` of shape [batch_size, height, width, channels].
-    weight: Multiple similarity loss by this weight before returning
-    method: One of:
-      mpse = Mean Pairwise Squared Error
-      mse = Mean Squared Error
-      hinged_mse = Computes the mean squared error using squared differences
-        greater than hparams.transferred_similarity_max_diff
-      hinged_mae = Computes the mean absolute error using absolute
-        differences greater than hparams.transferred_similarity_max_diff.
-    max_diff: Maximum unpenalized difference for hinged losses
-    name: Identifying name to use for creating summaries
-
-
-  Returns:
-    A `Tensor` representing the transferred similarity loss.
-
-  Raises:
-    ValueError: if `method` is not recognized.
-  """
-  if weight == 0:
-    return 0
-
-  source_channels = source_images.shape.as_list()[-1]
-  reconstruction_channels = reconstructions.shape.as_list()[-1]
-
-  # Convert grayscale source to RGB if target is RGB
-  if source_channels == 1 and reconstruction_channels != 1:
-    source_images = tf.tile(source_images, [1, 1, 1, reconstruction_channels])
-  if reconstruction_channels == 1 and source_channels != 1:
-    reconstructions = tf.tile(reconstructions, [1, 1, 1, source_channels])
-
-  if method == 'mpse':
-    reconstruction_similarity_loss_fn = (
-        tf.contrib.losses.mean_pairwise_squared_error)
-  elif method == 'masked_mpse':
-
-    def masked_mpse(predictions, labels, weight):
-      """Masked mpse assuming we have a depth to create a mask from."""
-      assert labels.shape.as_list()[-1] == 4
-      mask = tf.to_float(tf.less(labels[:, :, :, 3:4], 0.99))
-      mask = tf.tile(mask, [1, 1, 1, 4])
-      predictions *= mask
-      labels *= mask
-      tf.image_summary('masked_pred', predictions)
-      tf.image_summary('masked_label', labels)
-      return tf.contrib.losses.mean_pairwise_squared_error(
-          predictions, labels, weight)
-
-    reconstruction_similarity_loss_fn = masked_mpse
-  elif method == 'mse':
-    reconstruction_similarity_loss_fn = tf.contrib.losses.mean_squared_error
-  elif method == 'hinged_mse':
-
-    def hinged_mse(predictions, labels, weight):
-      diffs = tf.square(predictions - labels)
-      diffs = tf.maximum(0.0, diffs - max_diff)
-      return tf.reduce_mean(diffs) * weight
-
-    reconstruction_similarity_loss_fn = hinged_mse
-  elif method == 'hinged_mae':
-
-    def hinged_mae(predictions, labels, weight):
-      diffs = tf.abs(predictions - labels)
-      diffs = tf.maximum(0.0, diffs - max_diff)
-      return tf.reduce_mean(diffs) * weight
-
-    reconstruction_similarity_loss_fn = hinged_mae
-  else:
-    raise ValueError('Unknown reconstruction loss %s' % method)
-
-  reconstruction_similarity_loss = reconstruction_similarity_loss_fn(
-      reconstructions, source_images, weight)
-
-  name = '%s_Similarity_(%s)' % (name, method)
-  tf.summary.scalar(name, reconstruction_similarity_loss)
-  return reconstruction_similarity_loss
-
-
-def g_step_loss(source_images, source_labels, end_points, hparams, num_classes):
-  """Configures the loss function which runs during the g-step.
-
-  Args:
-    source_images: A `Tensor` of shape [batch_size, height, width, channels].
-    source_labels: A dictionary of `Tensors` of shape [batch_size]. Valid keys
-      are 'class' and 'quaternion'.
-    end_points: A map of the network end points.
-    hparams: The hyperparameters struct.
-    num_classes: Number of classes for classifier loss
-
-  Returns:
-    A `Tensor` representing a loss function.
-
-  Raises:
-    ValueError: if hparams.transferred_similarity_loss_weight is non-zero but
-      hparams.transferred_similarity_loss is invalid.
-  """
-  generator_loss = 0
-
-  ################################################################
-  # Adds a loss which encourages the discriminator probabilities #
-  # to be high (near one).
-  ################################################################
-
-  # As per the GAN paper, maximize the log probs, instead of minimizing
-  # log(1-probs). Since we're minimizing, we'll minimize -log(probs) which is
-  # the same thing.
-  style_transfer_loss = tf.losses.sigmoid_cross_entropy(
-      logits=end_points['transferred_domain_logits'],
-      multi_class_labels=tf.ones_like(end_points['transferred_domain_logits']),
-      weights=hparams.style_transfer_loss_weight)
-  tf.summary.scalar('Style_transfer_loss', style_transfer_loss)
-  generator_loss += style_transfer_loss
-
-  # Optimizes the style transfer network to produce transferred images similar
-  # to the source images.
-  generator_loss += _transferred_similarity_loss(
-      end_points['transferred_images'],
-      source_images,
-      weight=hparams.transferred_similarity_loss_weight,
-      method=hparams.transferred_similarity_loss,
-      name='transferred_similarity')
-
-  # Optimizes the style transfer network to maximize classification accuracy.
-  if source_labels is not None and hparams.task_tower_in_g_step:
-    generator_loss += _add_task_specific_losses(
-        end_points, source_labels, num_classes,
-        hparams) * hparams.task_loss_in_g_weight
-
-  return generator_loss
-
-
-def d_step_loss(end_points, source_labels, num_classes, hparams):
-  """Configures the losses during the D-Step.
-
-  Note that during the D-step, the model optimizes both the domain (binary)
-  classifier and the task classifier.
-
-  Args:
-    end_points: A map of the network end points.
-    source_labels: A dictionary of output labels to `Tensors`.
-    num_classes: The number of classes used by the classifier.
-    hparams: The hyperparameters struct.
-
-  Returns:
-    A `Tensor` representing the value of the D-step loss.
-  """
-  domain_classifier_loss = add_domain_classifier_losses(end_points, hparams)
-
-  task_classifier_loss = 0
-  if source_labels is not None:
-    task_classifier_loss = _add_task_specific_losses(
-        end_points, source_labels, num_classes, hparams, add_summaries=True)
-
-  return domain_classifier_loss + task_classifier_loss
diff --git a/research/domain_adaptation/pixel_domain_adaptation/pixelda_model.py b/research/domain_adaptation/pixel_domain_adaptation/pixelda_model.py
deleted file mode 100644
index 16b550a62d88ec2724c91f9dab9e3b34c736ec4f..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/pixelda_model.py
+++ /dev/null
@@ -1,713 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Contains the Domain Adaptation via Style Transfer (PixelDA) model components.
-
-A number of details in the implementation make reference to one of the following
-works:
-
-- "Unsupervised Representation Learning with Deep Convolutional
-    Generative Adversarial Networks""
-   https://arxiv.org/abs/1511.06434
-
-This paper makes several architecture recommendations:
-1. Use strided convs in discriminator, fractional-strided convs in generator
-2. batchnorm everywhere
-3. remove fully connected layers for deep models
-4. ReLu for all layers in generator, except tanh on output
-5. LeakyReLu for everything in discriminator
-"""
-import functools
-import math
-
-# Dependency imports
-import numpy as np
-
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-from domain_adaptation.pixel_domain_adaptation import pixelda_task_towers
-
-
-def create_model(hparams,
-                 target_images,
-                 source_images=None,
-                 source_labels=None,
-                 is_training=False,
-                 noise=None,
-                 num_classes=None):
-  """Create a GAN model.
-
-  Arguments:
-    hparams: HParam object specifying model params
-    target_images: A `Tensor` of size [batch_size, height, width, channels]. It
-      is assumed that the images are [-1, 1] normalized.
-    source_images: A `Tensor` of size [batch_size, height, width, channels]. It
-      is assumed that the images are [-1, 1] normalized.
-    source_labels: A `Tensor` of size [batch_size] of categorical labels between
-      [0, num_classes]
-    is_training: whether model is currently training
-    noise: If None, model generates its own noise. Otherwise use provided.
-    num_classes: Number of classes for classification
-
-  Returns:
-    end_points dict with model outputs
-
-  Raises:
-    ValueError: unknown hparams.arch setting
-  """
-  if num_classes is None and hparams.arch in ['resnet', 'simple']:
-    raise ValueError('Num classes must be provided to create task classifier')
-
-  if target_images.dtype != tf.float32:
-    raise ValueError('target_images must be tf.float32 and [-1, 1] normalized.')
-  if source_images is not None and source_images.dtype != tf.float32:
-    raise ValueError('source_images must be tf.float32 and [-1, 1] normalized.')
-
-  ###########################
-  # Create latent variables #
-  ###########################
-  latent_vars = dict()
-
-  if hparams.noise_channel:
-    noise_shape = [hparams.batch_size, hparams.noise_dims]
-    if noise is not None:
-      assert noise.shape.as_list() == noise_shape
-      tf.logging.info('Using provided noise')
-    else:
-      tf.logging.info('Using random noise')
-      noise = tf.random_uniform(
-          shape=noise_shape,
-          minval=-1,
-          maxval=1,
-          dtype=tf.float32,
-          name='random_noise')
-    latent_vars['noise'] = noise
-
-  ####################
-  # Create generator #
-  ####################
-
-  with slim.arg_scope(
-      [slim.conv2d, slim.conv2d_transpose, slim.fully_connected],
-      normalizer_params=batch_norm_params(is_training,
-                                          hparams.batch_norm_decay),
-      weights_initializer=tf.random_normal_initializer(
-          stddev=hparams.normal_init_std),
-      weights_regularizer=tf.contrib.layers.l2_regularizer(
-          hparams.weight_decay)):
-    with slim.arg_scope([slim.conv2d], padding='SAME'):
-      if hparams.arch == 'dcgan':
-        end_points = dcgan(
-            target_images, latent_vars, hparams, scope='generator')
-      elif hparams.arch == 'resnet':
-        end_points = resnet_generator(
-            source_images,
-            target_images.shape.as_list()[1:4],
-            hparams=hparams,
-            latent_vars=latent_vars)
-      elif hparams.arch == 'residual_interpretation':
-        end_points = residual_interpretation_generator(
-            source_images, is_training=is_training, hparams=hparams)
-      elif hparams.arch == 'simple':
-        end_points = simple_generator(
-            source_images,
-            target_images,
-            is_training=is_training,
-            hparams=hparams,
-            latent_vars=latent_vars)
-      elif hparams.arch == 'identity':
-        # Pass through unmodified, besides changing # channels
-        # Used to calculate baseline numbers
-        # Also set `generator_steps=0` for baseline
-        if hparams.generator_steps:
-          raise ValueError('Must set generator_steps=0 for identity arch. Is %s'
-                           % hparams.generator_steps)
-        transferred_images = source_images
-        source_channels = source_images.shape.as_list()[-1]
-        target_channels = target_images.shape.as_list()[-1]
-        if source_channels == 1 and target_channels == 3:
-          transferred_images = tf.tile(source_images, [1, 1, 1, 3])
-        if source_channels == 3 and target_channels == 1:
-          transferred_images = tf.image.rgb_to_grayscale(source_images)
-        end_points = {'transferred_images': transferred_images}
-      else:
-        raise ValueError('Unknown architecture: %s' % hparams.arch)
-
-      #####################
-      # Domain Classifier #
-      #####################
-      if hparams.arch in [
-          'dcgan', 'resnet', 'residual_interpretation', 'simple', 'identity',
-      ]:
-
-        # Add a discriminator for these architectures
-        end_points['transferred_domain_logits'] = predict_domain(
-            end_points['transferred_images'],
-            hparams,
-            is_training=is_training,
-            reuse=False)
-        end_points['target_domain_logits'] = predict_domain(
-            target_images,
-            hparams,
-            is_training=is_training,
-            reuse=True)
-
-      ###################
-      # Task Classifier #
-      ###################
-      if hparams.task_tower != 'none' and hparams.arch in [
-          'resnet', 'residual_interpretation', 'simple', 'identity',
-      ]:
-        with tf.variable_scope('discriminator'):
-          with tf.variable_scope('task_tower'):
-            end_points['source_task_logits'], end_points[
-                'source_quaternion'] = pixelda_task_towers.add_task_specific_model(
-                    source_images,
-                    hparams,
-                    num_classes=num_classes,
-                    is_training=is_training,
-                    reuse_private=False,
-                    private_scope='source_task_classifier',
-                    reuse_shared=False)
-            end_points['transferred_task_logits'], end_points[
-                'transferred_quaternion'] = (
-                    pixelda_task_towers.add_task_specific_model(
-                        end_points['transferred_images'],
-                        hparams,
-                        num_classes=num_classes,
-                        is_training=is_training,
-                        reuse_private=False,
-                        private_scope='transferred_task_classifier',
-                        reuse_shared=True))
-            end_points['target_task_logits'], end_points[
-                'target_quaternion'] = pixelda_task_towers.add_task_specific_model(
-                    target_images,
-                    hparams,
-                    num_classes=num_classes,
-                    is_training=is_training,
-                    reuse_private=True,
-                    private_scope='transferred_task_classifier',
-                    reuse_shared=True)
-  # Remove any endpoints with None values
-  return dict((k, v) for k, v in end_points.iteritems() if v is not None)
-
-
-def batch_norm_params(is_training, batch_norm_decay):
-  return {
-      'is_training': is_training,
-      # Decay for the moving averages.
-      'decay': batch_norm_decay,
-      # epsilon to prevent 0s in variance.
-      'epsilon': 0.001,
-  }
-
-
-def lrelu(x, leakiness=0.2):
-  """Relu, with optional leaky support."""
-  return tf.where(tf.less(x, 0.0), leakiness * x, x, name='leaky_relu')
-
-
-def upsample(net, num_filters, scale=2, method='resize_conv', scope=None):
-  """Performs spatial upsampling of the given features.
-
-  Args:
-    net: A `Tensor` of shape [batch_size, height, width, filters].
-    num_filters: The number of output filters.
-    scale: The scale of the upsampling. Must be a positive integer greater or
-      equal to two.
-    method: The method by which the features are upsampled. Valid options
-      include 'resize_conv' and 'conv2d_transpose'.
-    scope: An optional variable scope.
-
-  Returns:
-    A new set of features of shape
-      [batch_size, height*scale, width*scale, num_filters].
-
-  Raises:
-    ValueError: if `method` is not valid or
-  """
-  if scale < 2:
-    raise ValueError('scale must be greater or equal to two.')
-
-  with tf.variable_scope(scope, 'upsample', [net]):
-    if method == 'resize_conv':
-      net = tf.image.resize_nearest_neighbor(
-          net, [net.shape.as_list()[1] * scale,
-                net.shape.as_list()[2] * scale],
-          align_corners=True,
-          name='resize')
-      return slim.conv2d(net, num_filters, stride=1, scope='conv')
-    elif method == 'conv2d_transpose':
-      return slim.conv2d_transpose(net, num_filters, scope='deconv')
-    else:
-      raise ValueError('Upsample method [%s] was not recognized.' % method)
-
-
-def project_latent_vars(hparams, proj_shape, latent_vars, combine_method='sum'):
-  """Generate noise and project to input volume size.
-
-  Args:
-    hparams: The hyperparameter HParams struct.
-    proj_shape: Shape to project noise (not including batch size).
-    latent_vars: dictionary of `'key': Tensor of shape [batch_size, N]`
-    combine_method: How to combine the projected values.
-      sum = project to volume then sum
-      concat = concatenate along last dimension (i.e. channel)
-
-  Returns:
-    If combine_method=sum, a `Tensor` of size `hparams.projection_shape`
-    If combine_method=concat and there are N latent vars, a `Tensor` of size
-      `hparams.projection_shape`, with the last channel multiplied by N
-
-
-  Raises:
-    ValueError: combine_method is not one of sum/concat
-  """
-  values = []
-  for var in latent_vars:
-    with tf.variable_scope(var):
-      # Project & reshape noise to a HxWxC input
-      projected = slim.fully_connected(
-          latent_vars[var],
-          np.prod(proj_shape),
-          activation_fn=tf.nn.relu,
-          normalizer_fn=slim.batch_norm)
-      values.append(tf.reshape(projected, [hparams.batch_size] + proj_shape))
-
-  if combine_method == 'sum':
-    result = values[0]
-    for value in values[1:]:
-      result += value
-  elif combine_method == 'concat':
-    # Concatenate along last axis
-    result = tf.concat(values, len(proj_shape))
-  else:
-    raise ValueError('Unknown combine_method %s' % combine_method)
-
-  tf.logging.info('Latent variables projected to size %s volume', result.shape)
-
-  return result
-
-
-def resnet_block(net, hparams):
-  """Create a resnet block."""
-  net_in = net
-  net = slim.conv2d(
-      net,
-      hparams.resnet_filters,
-      stride=1,
-      normalizer_fn=slim.batch_norm,
-      activation_fn=tf.nn.relu)
-  net = slim.conv2d(
-      net,
-      hparams.resnet_filters,
-      stride=1,
-      normalizer_fn=slim.batch_norm,
-      activation_fn=None)
-  if hparams.resnet_residuals:
-    net += net_in
-  return net
-
-
-def resnet_stack(images, output_shape, hparams, scope=None):
-  """Create a resnet style transfer block.
-
-  Args:
-    images: [batch-size, height, width, channels] image tensor to feed as input
-    output_shape: output image shape in form [height, width, channels]
-    hparams: hparams objects
-    scope: Variable scope
-
-  Returns:
-    Images after processing with resnet blocks.
-  """
-  end_points = {}
-  if hparams.noise_channel:
-    # separate the noise for visualization
-    end_points['noise'] = images[:, :, :, -1]
-  assert images.shape.as_list()[1:3] == output_shape[0:2]
-
-  with tf.variable_scope(scope, 'resnet_style_transfer', [images]):
-    with slim.arg_scope(
-        [slim.conv2d],
-        normalizer_fn=slim.batch_norm,
-        kernel_size=[hparams.generator_kernel_size] * 2,
-        stride=1):
-      net = slim.conv2d(
-          images,
-          hparams.resnet_filters,
-          normalizer_fn=None,
-          activation_fn=tf.nn.relu)
-      for block in range(hparams.resnet_blocks):
-        net = resnet_block(net, hparams)
-        end_points['resnet_block_{}'.format(block)] = net
-
-      net = slim.conv2d(
-          net,
-          output_shape[-1],
-          kernel_size=[1, 1],
-          normalizer_fn=None,
-          activation_fn=tf.nn.tanh,
-          scope='conv_out')
-      end_points['transferred_images'] = net
-    return net, end_points
-
-
-def predict_domain(images,
-                   hparams,
-                   is_training=False,
-                   reuse=False,
-                   scope='discriminator'):
-  """Creates a discriminator for a GAN.
-
-  Args:
-    images: A `Tensor` of size [batch_size, height, width, channels]. It is
-      assumed that the images are centered between -1 and 1.
-    hparams: hparam object with params for discriminator
-    is_training: Specifies whether or not we're training or testing.
-    reuse: Whether to reuse variable scope
-    scope: An optional variable_scope.
-
-  Returns:
-    [batch size, 1] - logit output of discriminator.
-  """
-  with tf.variable_scope(scope, 'discriminator', [images], reuse=reuse):
-    lrelu_partial = functools.partial(lrelu, leakiness=hparams.lrelu_leakiness)
-    with slim.arg_scope(
-        [slim.conv2d],
-        kernel_size=[hparams.discriminator_kernel_size] * 2,
-        activation_fn=lrelu_partial,
-        stride=2,
-        normalizer_fn=slim.batch_norm):
-
-      def add_noise(hidden, scope_num=None):
-        if scope_num:
-          hidden = slim.dropout(
-              hidden,
-              hparams.discriminator_dropout_keep_prob,
-              is_training=is_training,
-              scope='dropout_%s' % scope_num)
-        if hparams.discriminator_noise_stddev == 0:
-          return hidden
-        return hidden + tf.random_normal(
-            hidden.shape.as_list(),
-            mean=0.0,
-            stddev=hparams.discriminator_noise_stddev)
-
-      # As per the recommendation of the DCGAN paper, we don't use batch norm
-      # on the discriminator input (https://arxiv.org/pdf/1511.06434v2.pdf).
-      if hparams.discriminator_image_noise:
-        images = add_noise(images)
-      net = slim.conv2d(
-          images,
-          hparams.num_discriminator_filters,
-          normalizer_fn=None,
-          stride=hparams.discriminator_first_stride,
-          scope='conv1_stride%s' % hparams.discriminator_first_stride)
-      net = add_noise(net, 1)
-
-      block_id = 2
-      # Repeatedly stack
-      # discriminator_conv_block_size-1 conv layers with stride 1
-      #  followed by a stride 2 layer
-      # Add (optional) noise at every point
-      while net.shape.as_list()[1] > hparams.projection_shape_size:
-        num_filters = int(hparams.num_discriminator_filters *
-                          (hparams.discriminator_filter_factor**(block_id - 1)))
-        for conv_id in range(1, hparams.discriminator_conv_block_size):
-          net = slim.conv2d(
-              net,
-              num_filters,
-              stride=1,
-              scope='conv_%s_%s' % (block_id, conv_id))
-        if hparams.discriminator_do_pooling:
-          net = slim.conv2d(
-              net, num_filters, scope='conv_%s_prepool' % block_id)
-          net = slim.avg_pool2d(
-              net, kernel_size=[2, 2], stride=2, scope='pool_%s' % block_id)
-        else:
-          net = slim.conv2d(
-              net, num_filters, scope='conv_%s_stride2' % block_id)
-        net = add_noise(net, block_id)
-        block_id += 1
-      net = slim.flatten(net)
-      net = slim.fully_connected(
-          net,
-          1,
-          # Models with BN here generally produce noise
-          normalizer_fn=None,
-          activation_fn=None,
-          scope='fc_logit_out')  # Returns logits!
-  return net
-
-
-def dcgan_generator(images, output_shape, hparams, scope=None):
-  """Transforms the visual style of the input images.
-
-  Args:
-    images: A `Tensor` of shape [batch_size, height, width, channels].
-    output_shape: A list or tuple of 3 elements: the output height, width and
-      number of channels.
-    hparams: hparams object with generator parameters
-    scope: Scope to place generator inside
-
-  Returns:
-    A `Tensor` of shape [batch_size, height, width, output_channels] which
-    represents the result of style transfer.
-
-  Raises:
-    ValueError: If `output_shape` is not a list or tuple or if it doesn't have
-    three elements or if `output_shape` or `images` arent square.
-  """
-  if not isinstance(output_shape, (tuple, list)):
-    raise ValueError('output_shape must be a tuple or list.')
-  elif len(output_shape) != 3:
-    raise ValueError('output_shape must have three elements.')
-
-  if output_shape[0] != output_shape[1]:
-    raise ValueError('output_shape must be square')
-  if images.shape.as_list()[1] != images.shape.as_list()[2]:
-    raise ValueError('images height and width must match.')
-
-  outdim = output_shape[0]
-  indim = images.shape.as_list()[1]
-  num_iterations = int(math.ceil(math.log(float(outdim) / float(indim), 2.0)))
-
-  with slim.arg_scope(
-      [slim.conv2d, slim.conv2d_transpose],
-      kernel_size=[hparams.generator_kernel_size] * 2,
-      stride=2):
-    with tf.variable_scope(scope or 'generator'):
-
-      net = images
-
-      # Repeatedly halve # filters until = hparams.decode_filters in last layer
-      for i in range(num_iterations):
-        num_filters = hparams.num_decoder_filters * 2**(num_iterations - i - 1)
-        net = slim.conv2d_transpose(net, num_filters, scope='deconv_%s' % i)
-
-      # Crop down to desired size (e.g. 32x32 -> 28x28)
-      dif = net.shape.as_list()[1] - outdim
-      low = dif / 2
-      high = net.shape.as_list()[1] - low
-      net = net[:, low:high, low:high, :]
-
-      # No batch norm on generator output
-      net = slim.conv2d(
-          net,
-          output_shape[2],
-          kernel_size=[1, 1],
-          stride=1,
-          normalizer_fn=None,
-          activation_fn=tf.tanh,
-          scope='conv_out')
-  return net
-
-
-def dcgan(target_images, latent_vars, hparams, scope='dcgan'):
-  """Creates the PixelDA model.
-
-  Args:
-    target_images: A `Tensor` of shape [batch_size, height, width, 3]
-      sampled from the image domain to which we want to transfer.
-    latent_vars: dictionary of 'key': Tensor of shape [batch_size, N]
-    hparams: The hyperparameter map.
-    scope: Surround generator component with this scope
-
-  Returns:
-    A dictionary of model outputs.
-  """
-  proj_shape = [
-      hparams.projection_shape_size, hparams.projection_shape_size,
-      hparams.projection_shape_channels
-  ]
-  source_volume = project_latent_vars(
-      hparams, proj_shape, latent_vars, combine_method='concat')
-
-  ###################################################
-  # Transfer the source images to the target style. #
-  ###################################################
-  with tf.variable_scope(scope, 'generator', [target_images]):
-    transferred_images = dcgan_generator(
-        source_volume,
-        output_shape=target_images.shape.as_list()[1:4],
-        hparams=hparams)
-    assert transferred_images.shape.as_list() == target_images.shape.as_list()
-
-  return {'transferred_images': transferred_images}
-
-
-def resnet_generator(images, output_shape, hparams, latent_vars=None):
-  """Creates a ResNet-based generator.
-
-  Args:
-    images: A `Tensor` of shape [batch_size, height, width, num_channels]
-      sampled from the image domain from which we want to transfer
-    output_shape: A length-3 array indicating the height, width and channels of
-      the output.
-    hparams: The hyperparameter map.
-    latent_vars: dictionary of 'key': Tensor of shape [batch_size, N]
-
-  Returns:
-    A dictionary of model outputs.
-  """
-  with tf.variable_scope('generator'):
-    if latent_vars:
-      noise_channel = project_latent_vars(
-          hparams,
-          proj_shape=images.shape.as_list()[1:3] + [1],
-          latent_vars=latent_vars,
-          combine_method='concat')
-      images = tf.concat([images, noise_channel], 3)
-
-    transferred_images, end_points = resnet_stack(
-        images,
-        output_shape=output_shape,
-        hparams=hparams,
-        scope='resnet_stack')
-    end_points['transferred_images'] = transferred_images
-
-  return end_points
-
-
-def residual_interpretation_block(images, hparams, scope):
-  """Learns a residual image which is added to the incoming image.
-
-  Args:
-    images: A `Tensor` of size [batch_size, height, width, 3]
-    hparams: The hyperparameters struct.
-    scope: The name of the variable op scope.
-
-  Returns:
-    The updated images.
-  """
-  with tf.variable_scope(scope):
-    with slim.arg_scope(
-        [slim.conv2d],
-        normalizer_fn=None,
-        kernel_size=[hparams.generator_kernel_size] * 2):
-
-      net = images
-      for _ in range(hparams.res_int_convs):
-        net = slim.conv2d(
-            net, hparams.res_int_filters, activation_fn=tf.nn.relu)
-      net = slim.conv2d(net, 3, activation_fn=tf.nn.tanh)
-
-    # Add the residual
-    images += net
-
-    # Clip the output
-    images = tf.maximum(images, -1.0)
-    images = tf.minimum(images, 1.0)
-    return images
-
-
-def residual_interpretation_generator(images,
-                                      is_training,
-                                      hparams,
-                                      latent_vars=None):
-  """Creates a generator producing purely residual transformations.
-
-  A residual generator differs from the resnet generator in that each 'block' of
-  the residual generator produces a residual image. Consequently, the 'progress'
-  of the model generation process can be directly observed at inference time,
-  making it easier to diagnose and understand.
-
-  Args:
-    images: A `Tensor` of shape [batch_size, height, width, num_channels]
-      sampled from the image domain from which we want to transfer. It is
-      assumed that the images are centered between -1 and 1.
-    is_training: whether or not the model is training.
-    hparams: The hyperparameter map.
-    latent_vars: dictionary of 'key': Tensor of shape [batch_size, N]
-
-  Returns:
-    A dictionary of model outputs.
-  """
-  end_points = {}
-
-  with tf.variable_scope('generator'):
-    if latent_vars:
-      projected_latent = project_latent_vars(
-          hparams,
-          proj_shape=images.shape.as_list()[1:3] + [images.shape.as_list()[-1]],
-          latent_vars=latent_vars,
-          combine_method='sum')
-      images += projected_latent
-    with tf.variable_scope(None, 'residual_style_transfer', [images]):
-      for i in range(hparams.res_int_blocks):
-        images = residual_interpretation_block(images, hparams,
-                                               'residual_%d' % i)
-        end_points['transferred_images_%d' % i] = images
-
-      end_points['transferred_images'] = images
-
-  return end_points
-
-
-def simple_generator(source_images, target_images, is_training, hparams,
-                     latent_vars):
-  """Simple generator architecture (stack of convs) for trying small models."""
-  end_points = {}
-  with tf.variable_scope('generator'):
-    feed_source_images = source_images
-
-    if latent_vars:
-      projected_latent = project_latent_vars(
-          hparams,
-          proj_shape=source_images.shape.as_list()[1:3] + [1],
-          latent_vars=latent_vars,
-          combine_method='concat')
-      feed_source_images = tf.concat([source_images, projected_latent], 3)
-
-    end_points = {}
-
-    ###################################################
-    # Transfer the source images to the target style. #
-    ###################################################
-    with slim.arg_scope(
-        [slim.conv2d],
-        normalizer_fn=slim.batch_norm,
-        stride=1,
-        kernel_size=[hparams.generator_kernel_size] * 2):
-      net = feed_source_images
-
-      # N convolutions
-      for i in range(1, hparams.simple_num_conv_layers):
-        normalizer_fn = None
-        if i != 0:
-          normalizer_fn = slim.batch_norm
-        net = slim.conv2d(
-            net,
-            hparams.simple_conv_filters,
-            normalizer_fn=normalizer_fn,
-            activation_fn=tf.nn.relu)
-
-      # Project back to right # image channels
-      net = slim.conv2d(
-          net,
-          target_images.shape.as_list()[-1],
-          kernel_size=[1, 1],
-          stride=1,
-          normalizer_fn=None,
-          activation_fn=tf.tanh,
-          scope='conv_out')
-
-    transferred_images = net
-    assert transferred_images.shape.as_list() == target_images.shape.as_list()
-    end_points['transferred_images'] = transferred_images
-
-  return end_points
diff --git a/research/domain_adaptation/pixel_domain_adaptation/pixelda_preprocess.py b/research/domain_adaptation/pixel_domain_adaptation/pixelda_preprocess.py
deleted file mode 100644
index 747c17b18bf007d85e606015da6687a343bf74d2..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/pixelda_preprocess.py
+++ /dev/null
@@ -1,129 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Contains functions for preprocessing the inputs."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-
-import tensorflow as tf
-
-
-def preprocess_classification(image, labels, is_training=False):
-  """Preprocesses the image and labels for classification purposes.
-
-  Preprocessing includes shifting the images to be 0-centered between -1 and 1.
-  This is not only a popular method of preprocessing (inception) but is also
-  the mechanism used by DSNs.
-
-  Args:
-    image: A `Tensor` of size [height, width, 3].
-    labels: A dictionary of labels.
-    is_training: Whether or not we're training the model.
-
-  Returns:
-    The preprocessed image and labels.
-  """
-  # If the image is uint8, this will scale it to 0-1.
-  image = tf.image.convert_image_dtype(image, tf.float32)
-  image -= 0.5
-  image *= 2
-
-  return image, labels
-
-
-def preprocess_style_transfer(image,
-                              labels,
-                              augment=False,
-                              size=None,
-                              is_training=False):
-  """Preprocesses the image and labels for style transfer purposes.
-
-  Args:
-    image: A `Tensor` of size [height, width, 3].
-    labels: A dictionary of labels.
-    augment: Whether to apply data augmentation to inputs
-    size: The height and width to which images should be resized. If left as
-      `None`, then no resizing is performed
-    is_training: Whether or not we're training the model
-
-  Returns:
-    The preprocessed image and labels. Scaled to [-1, 1]
-  """
-  # If the image is uint8, this will scale it to 0-1.
-  image = tf.image.convert_image_dtype(image, tf.float32)
-  if augment and is_training:
-    image = image_augmentation(image)
-
-  if size:
-    image = resize_image(image, size)
-
-  image -= 0.5
-  image *= 2
-
-  return image, labels
-
-
-def image_augmentation(image):
-  """Performs data augmentation by randomly permuting the inputs.
-
-  Args:
-    image: A float `Tensor` of size [height, width, channels] with values
-      in range[0,1].
-
-  Returns:
-    The mutated batch of images
-  """
-  # Apply photometric data augmentation (contrast etc.)
-  num_channels = image.shape_as_list()[-1]
-  if num_channels == 4:
-    # Only augment image part
-    image, depth = image[:, :, 0:3], image[:, :, 3:4]
-  elif num_channels == 1:
-    image = tf.image.grayscale_to_rgb(image)
-  image = tf.image.random_brightness(image, max_delta=0.1)
-  image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-  image = tf.image.random_hue(image, max_delta=0.032)
-  image = tf.image.random_contrast(image, lower=0.5, upper=1.5)
-  image = tf.clip_by_value(image, 0, 1.0)
-  if num_channels == 4:
-    image = tf.concat(2, [image, depth])
-  elif num_channels == 1:
-    image = tf.image.rgb_to_grayscale(image)
-  return image
-
-
-def resize_image(image, size=None):
-  """Resize image to target size.
-
-  Args:
-    image: A `Tensor` of size [height, width, 3].
-    size: (height, width) to resize image to.
-
-  Returns:
-    resized image
-  """
-  if size is None:
-    raise ValueError('Must specify size')
-
-  if image.shape_as_list()[:2] == size:
-    # Don't resize if not necessary
-    return image
-  image = tf.expand_dims(image, 0)
-  image = tf.image.resize_images(image, size)
-  image = tf.squeeze(image, 0)
-  return image
diff --git a/research/domain_adaptation/pixel_domain_adaptation/pixelda_preprocess_test.py b/research/domain_adaptation/pixel_domain_adaptation/pixelda_preprocess_test.py
deleted file mode 100644
index 73f8c7ff05fc7d2614c419759a02f78ffbcdfec0..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/pixelda_preprocess_test.py
+++ /dev/null
@@ -1,69 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Tests for domain_adaptation.pixel_domain_adaptation.pixelda_preprocess."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-
-import tensorflow as tf
-
-from domain_adaptation.pixel_domain_adaptation import pixelda_preprocess
-
-
-class PixelDAPreprocessTest(tf.test.TestCase):
-
-  def assert_preprocess_classification_is_centered(self, dtype, is_training):
-    tf.set_random_seed(0)
-
-    if dtype == tf.uint8:
-      image = tf.random_uniform((100, 200, 3), maxval=255, dtype=tf.int64)
-      image = tf.cast(image, tf.uint8)
-    else:
-      image = tf.random_uniform((100, 200, 3), maxval=1.0, dtype=dtype)
-
-    labels = {}
-    image, labels = pixelda_preprocess.preprocess_classification(
-        image, labels, is_training=is_training)
-
-    with self.test_session() as sess:
-      np_image = sess.run(image)
-
-      self.assertTrue(np_image.min() <= -0.95)
-      self.assertTrue(np_image.min() >= -1.0)
-      self.assertTrue(np_image.max() >= 0.95)
-      self.assertTrue(np_image.max() <= 1.0)
-
-  def testPreprocessClassificationZeroCentersUint8DuringTrain(self):
-    self.assert_preprocess_classification_is_centered(
-        tf.uint8, is_training=True)
-
-  def testPreprocessClassificationZeroCentersUint8DuringTest(self):
-    self.assert_preprocess_classification_is_centered(
-        tf.uint8, is_training=False)
-
-  def testPreprocessClassificationZeroCentersFloatDuringTrain(self):
-    self.assert_preprocess_classification_is_centered(
-        tf.float32, is_training=True)
-
-  def testPreprocessClassificationZeroCentersFloatDuringTest(self):
-    self.assert_preprocess_classification_is_centered(
-        tf.float32, is_training=False)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/domain_adaptation/pixel_domain_adaptation/pixelda_task_towers.py b/research/domain_adaptation/pixel_domain_adaptation/pixelda_task_towers.py
deleted file mode 100644
index 1cb42e2d890a7759318cf0981640c0dd1645461e..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/pixelda_task_towers.py
+++ /dev/null
@@ -1,317 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Task towers for PixelDA model."""
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-
-def add_task_specific_model(images,
-                            hparams,
-                            num_classes=10,
-                            is_training=False,
-                            reuse_private=False,
-                            private_scope=None,
-                            reuse_shared=False,
-                            shared_scope=None):
-  """Create a classifier for the given images.
-
-  The classifier is composed of a few 'private' layers followed by a few
-  'shared' layers. This lets us account for different image 'style', while
-  sharing the last few layers as 'content' layers.
-
-  Args:
-    images: A `Tensor` of size [batch_size, height, width, 3].
-    hparams: model hparams
-    num_classes: The number of output classes.
-    is_training: whether model is training
-    reuse_private: Whether or not to reuse the private weights, which are the
-      first few layers in the classifier
-    private_scope: The name of the variable_scope for the private (unshared)
-      components of the classifier.
-    reuse_shared: Whether or not to reuse the shared weights, which are the last
-      few layers in the classifier
-    shared_scope: The name of the variable_scope for the shared components of
-      the classifier.
-
-  Returns:
-    The logits, a `Tensor` of shape [batch_size, num_classes].
-
-  Raises:
-    ValueError: If hparams.task_classifier is an unknown value
-  """
-
-  model = hparams.task_tower
-  # Make sure the classifier name shows up in graph
-  shared_scope = shared_scope or (model + '_shared')
-  kwargs = {
-      'num_classes': num_classes,
-      'is_training': is_training,
-      'reuse_private': reuse_private,
-      'reuse_shared': reuse_shared,
-  }
-
-  if private_scope:
-    kwargs['private_scope'] = private_scope
-  if shared_scope:
-    kwargs['shared_scope'] = shared_scope
-
-  quaternion_pred = None
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      activation_fn=tf.nn.relu,
-      weights_regularizer=tf.contrib.layers.l2_regularizer(
-          hparams.weight_decay_task_classifier)):
-    with slim.arg_scope([slim.conv2d], padding='SAME'):
-      if model == 'doubling_pose_estimator':
-        logits, quaternion_pred = doubling_cnn_class_and_quaternion(
-            images, num_private_layers=hparams.num_private_layers, **kwargs)
-      elif model == 'mnist':
-        logits, _ = mnist_classifier(images, **kwargs)
-      elif model == 'svhn':
-        logits, _ = svhn_classifier(images, **kwargs)
-      elif model == 'gtsrb':
-        logits, _ = gtsrb_classifier(images, **kwargs)
-      elif model == 'pose_mini':
-        logits, quaternion_pred = pose_mini_tower(images, **kwargs)
-      else:
-        raise ValueError('Unknown task classifier %s' % model)
-
-  return logits, quaternion_pred
-
-
-#####################################
-# Classifiers used in the DSN paper #
-#####################################
-
-
-def mnist_classifier(images,
-                     is_training=False,
-                     num_classes=10,
-                     reuse_private=False,
-                     private_scope='mnist',
-                     reuse_shared=False,
-                     shared_scope='task_model'):
-  """Creates the convolutional MNIST model from the gradient reversal paper.
-
-  Note that since the output is a set of 'logits', the values fall in the
-  interval of (-infinity, infinity). Consequently, to convert the outputs to a
-  probability distribution over the characters, one will need to convert them
-  using the softmax function:
-        logits, endpoints = conv_mnist(images, is_training=False)
-        predictions = tf.nn.softmax(logits)
-
-  Args:
-    images: the MNIST digits, a tensor of size [batch_size, 28, 28, 1].
-    is_training: specifies whether or not we're currently training the model.
-      This variable will determine the behaviour of the dropout layer.
-    num_classes: the number of output classes to use.
-
-  Returns:
-    the output logits, a tensor of size [batch_size, num_classes].
-    a dictionary with key/values the layer names and tensors.
-  """
-
-  net = {}
-
-  with tf.variable_scope(private_scope, reuse=reuse_private):
-    net['conv1'] = slim.conv2d(images, 32, [5, 5], scope='conv1')
-    net['pool1'] = slim.max_pool2d(net['conv1'], [2, 2], 2, scope='pool1')
-
-  with tf.variable_scope(shared_scope, reuse=reuse_shared):
-    net['conv2'] = slim.conv2d(net['pool1'], 48, [5, 5], scope='conv2')
-    net['pool2'] = slim.max_pool2d(net['conv2'], [2, 2], 2, scope='pool2')
-    net['fc3'] = slim.fully_connected(
-        slim.flatten(net['pool2']), 100, scope='fc3')
-    net['fc4'] = slim.fully_connected(
-        slim.flatten(net['fc3']), 100, scope='fc4')
-    logits = slim.fully_connected(
-        net['fc4'], num_classes, activation_fn=None, scope='fc5')
-  return logits, net
-
-
-def svhn_classifier(images,
-                    is_training=False,
-                    num_classes=10,
-                    reuse_private=False,
-                    private_scope=None,
-                    reuse_shared=False,
-                    shared_scope='task_model'):
-  """Creates the convolutional SVHN model from the gradient reversal paper.
-
-  Note that since the output is a set of 'logits', the values fall in the
-  interval of (-infinity, infinity). Consequently, to convert the outputs to a
-  probability distribution over the characters, one will need to convert them
-  using the softmax function:
-        logits = mnist.Mnist(images, is_training=False)
-        predictions = tf.nn.softmax(logits)
-
-  Args:
-    images: the SVHN digits, a tensor of size [batch_size, 40, 40, 3].
-    is_training: specifies whether or not we're currently training the model.
-      This variable will determine the behaviour of the dropout layer.
-    num_classes: the number of output classes to use.
-
-  Returns:
-    the output logits, a tensor of size [batch_size, num_classes].
-    a dictionary with key/values the layer names and tensors.
-  """
-
-  net = {}
-
-  with tf.variable_scope(private_scope, reuse=reuse_private):
-    net['conv1'] = slim.conv2d(images, 64, [5, 5], scope='conv1')
-    net['pool1'] = slim.max_pool2d(net['conv1'], [3, 3], 2, scope='pool1')
-
-  with tf.variable_scope(shared_scope, reuse=reuse_shared):
-    net['conv2'] = slim.conv2d(net['pool1'], 64, [5, 5], scope='conv2')
-    net['pool2'] = slim.max_pool2d(net['conv2'], [3, 3], 2, scope='pool2')
-    net['conv3'] = slim.conv2d(net['pool2'], 128, [5, 5], scope='conv3')
-
-    net['fc3'] = slim.fully_connected(
-        slim.flatten(net['conv3']), 3072, scope='fc3')
-    net['fc4'] = slim.fully_connected(
-        slim.flatten(net['fc3']), 2048, scope='fc4')
-
-    logits = slim.fully_connected(
-        net['fc4'], num_classes, activation_fn=None, scope='fc5')
-
-  return logits, net
-
-
-def gtsrb_classifier(images,
-                     is_training=False,
-                     num_classes=43,
-                     reuse_private=False,
-                     private_scope='gtsrb',
-                     reuse_shared=False,
-                     shared_scope='task_model'):
-  """Creates the convolutional GTSRB model from the gradient reversal paper.
-
-  Note that since the output is a set of 'logits', the values fall in the
-  interval of (-infinity, infinity). Consequently, to convert the outputs to a
-  probability distribution over the characters, one will need to convert them
-  using the softmax function:
-        logits = mnist.Mnist(images, is_training=False)
-        predictions = tf.nn.softmax(logits)
-
-  Args:
-    images: the SVHN digits, a tensor of size [batch_size, 40, 40, 3].
-    is_training: specifies whether or not we're currently training the model.
-      This variable will determine the behaviour of the dropout layer.
-    num_classes: the number of output classes to use.
-    reuse_private: Whether or not to reuse the private components of the model.
-    private_scope: The name of the private scope.
-    reuse_shared: Whether or not to reuse the shared components of the model.
-    shared_scope: The name of the shared scope.
-
-  Returns:
-    the output logits, a tensor of size [batch_size, num_classes].
-    a dictionary with key/values the layer names and tensors.
-  """
-
-  net = {}
-
-  with tf.variable_scope(private_scope, reuse=reuse_private):
-    net['conv1'] = slim.conv2d(images, 96, [5, 5], scope='conv1')
-    net['pool1'] = slim.max_pool2d(net['conv1'], [2, 2], 2, scope='pool1')
-  with tf.variable_scope(shared_scope, reuse=reuse_shared):
-    net['conv2'] = slim.conv2d(net['pool1'], 144, [3, 3], scope='conv2')
-    net['pool2'] = slim.max_pool2d(net['conv2'], [2, 2], 2, scope='pool2')
-    net['conv3'] = slim.conv2d(net['pool2'], 256, [5, 5], scope='conv3')
-    net['pool3'] = slim.max_pool2d(net['conv3'], [2, 2], 2, scope='pool3')
-
-    net['fc3'] = slim.fully_connected(
-        slim.flatten(net['pool3']), 512, scope='fc3')
-    logits = slim.fully_connected(
-        net['fc3'], num_classes, activation_fn=None, scope='fc4')
-
-    return logits, net
-
-
-#########################
-# pose_mini task towers #
-#########################
-
-
-def pose_mini_tower(images,
-                    num_classes=11,
-                    is_training=False,
-                    reuse_private=False,
-                    private_scope='pose_mini',
-                    reuse_shared=False,
-                    shared_scope='task_model'):
-  """Task tower for the pose_mini dataset."""
-
-  with tf.variable_scope(private_scope, reuse=reuse_private):
-    net = slim.conv2d(images, 32, [5, 5], scope='conv1')
-    net = slim.max_pool2d(net, [2, 2], stride=2, scope='pool1')
-  with tf.variable_scope(shared_scope, reuse=reuse_shared):
-    net = slim.conv2d(net, 64, [5, 5], scope='conv2')
-    net = slim.max_pool2d(net, [2, 2], stride=2, scope='pool2')
-    net = slim.flatten(net)
-
-    net = slim.fully_connected(net, 128, scope='fc3')
-    net = slim.dropout(net, 0.5, is_training=is_training, scope='dropout')
-    with tf.variable_scope('quaternion_prediction'):
-      quaternion_pred = slim.fully_connected(
-          net, 4, activation_fn=tf.tanh, scope='fc_q')
-      quaternion_pred = tf.nn.l2_normalize(quaternion_pred, 1)
-
-    logits = slim.fully_connected(
-        net, num_classes, activation_fn=None, scope='fc4')
-
-    return logits, quaternion_pred
-
-
-def doubling_cnn_class_and_quaternion(images,
-                                      num_private_layers=1,
-                                      num_classes=10,
-                                      is_training=False,
-                                      reuse_private=False,
-                                      private_scope='doubling_cnn',
-                                      reuse_shared=False,
-                                      shared_scope='task_model'):
-  """Alternate conv, pool while doubling filter count."""
-  net = images
-  depth = 32
-  layer_id = 1
-
-  with tf.variable_scope(private_scope, reuse=reuse_private):
-    while num_private_layers > 0 and net.shape.as_list()[1] > 5:
-      net = slim.conv2d(net, depth, [3, 3], scope='conv%s' % layer_id)
-      net = slim.max_pool2d(net, [2, 2], stride=2, scope='pool%s' % layer_id)
-      depth *= 2
-      layer_id += 1
-      num_private_layers -= 1
-
-  with tf.variable_scope(shared_scope, reuse=reuse_shared):
-    while net.shape.as_list()[1] > 5:
-      net = slim.conv2d(net, depth, [3, 3], scope='conv%s' % layer_id)
-      net = slim.max_pool2d(net, [2, 2], stride=2, scope='pool%s' % layer_id)
-      depth *= 2
-      layer_id += 1
-
-    net = slim.flatten(net)
-    net = slim.fully_connected(net, 100, scope='fc1')
-    net = slim.dropout(net, 0.5, is_training=is_training, scope='dropout')
-    quaternion_pred = slim.fully_connected(
-        net, 4, activation_fn=tf.tanh, scope='fc_q')
-    quaternion_pred = tf.nn.l2_normalize(quaternion_pred, 1)
-
-    logits = slim.fully_connected(
-        net, num_classes, activation_fn=None, scope='fc_logits')
-
-    return logits, quaternion_pred
diff --git a/research/domain_adaptation/pixel_domain_adaptation/pixelda_train.py b/research/domain_adaptation/pixel_domain_adaptation/pixelda_train.py
deleted file mode 100644
index 4ca072cceafa48769623381b8e564fe650f2a514..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/pixelda_train.py
+++ /dev/null
@@ -1,409 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-r"""Trains the PixelDA model."""
-
-from functools import partial
-import os
-
-# Dependency imports
-
-import tensorflow as tf
-
-from domain_adaptation.datasets import dataset_factory
-from domain_adaptation.pixel_domain_adaptation import pixelda_losses
-from domain_adaptation.pixel_domain_adaptation import pixelda_model
-from domain_adaptation.pixel_domain_adaptation import pixelda_preprocess
-from domain_adaptation.pixel_domain_adaptation import pixelda_utils
-from domain_adaptation.pixel_domain_adaptation.hparams import create_hparams
-
-slim = tf.contrib.slim
-
-flags = tf.app.flags
-FLAGS = flags.FLAGS
-
-flags.DEFINE_string('master', '', 'BNS name of the TensorFlow master to use.')
-
-flags.DEFINE_integer(
-    'ps_tasks', 0,
-    'The number of parameter servers. If the value is 0, then the parameters '
-    'are handled locally by the worker.')
-
-flags.DEFINE_integer(
-    'task', 0,
-    'The Task ID. This value is used when training with multiple workers to '
-    'identify each worker.')
-
-flags.DEFINE_string('train_log_dir', '/tmp/pixelda/',
-                    'Directory where to write event logs.')
-
-flags.DEFINE_integer(
-    'save_summaries_steps', 500,
-    'The frequency with which summaries are saved, in seconds.')
-
-flags.DEFINE_integer('save_interval_secs', 300,
-                     'The frequency with which the model is saved, in seconds.')
-
-flags.DEFINE_boolean('summarize_gradients', False,
-                     'Whether to summarize model gradients')
-
-flags.DEFINE_integer(
-    'print_loss_steps', 100,
-    'The frequency with which the losses are printed, in steps.')
-
-flags.DEFINE_string('source_dataset', 'mnist', 'The name of the source dataset.'
-                    ' If hparams="arch=dcgan", this flag is ignored.')
-
-flags.DEFINE_string('target_dataset', 'mnist_m',
-                    'The name of the target dataset.')
-
-flags.DEFINE_string('source_split_name', 'train',
-                    'Name of the train split for the source.')
-
-flags.DEFINE_string('target_split_name', 'train',
-                    'Name of the train split for the target.')
-
-flags.DEFINE_string('dataset_dir', '',
-                    'The directory where the datasets can be found.')
-
-flags.DEFINE_integer(
-    'num_readers', 4,
-    'The number of parallel readers that read data from the dataset.')
-
-flags.DEFINE_integer('num_preprocessing_threads', 4,
-                     'The number of threads used to create the batches.')
-
-# HParams
-
-flags.DEFINE_string('hparams', '', 'Comma separated hyperparameter values')
-
-
-def _get_vars_and_update_ops(hparams, scope):
-  """Returns the variables and update ops for a particular variable scope.
-
-  Args:
-    hparams: The hyperparameters struct.
-    scope: The variable scope.
-
-  Returns:
-    A tuple consisting of trainable variables and update ops.
-  """
-  is_trainable = lambda x: x in tf.trainable_variables()
-  var_list = filter(is_trainable, slim.get_model_variables(scope))
-  global_step = slim.get_or_create_global_step()
-
-  update_ops = tf.get_collection(tf.GraphKeys.UPDATE_OPS, scope)
-
-  tf.logging.info('All variables for scope: %s',
-                  slim.get_model_variables(scope))
-  tf.logging.info('Trainable variables for scope: %s', var_list)
-
-  return var_list, update_ops
-
-
-def _train(discriminator_train_op,
-           generator_train_op,
-           logdir,
-           master='',
-           is_chief=True,
-           scaffold=None,
-           hooks=None,
-           chief_only_hooks=None,
-           save_checkpoint_secs=600,
-           save_summaries_steps=100,
-           hparams=None):
-  """Runs the training loop.
-
-  Args:
-    discriminator_train_op: A `Tensor` that, when executed, will apply the
-      gradients and return the loss value for the discriminator.
-    generator_train_op: A `Tensor` that, when executed, will apply the
-      gradients and return the loss value for the generator.
-    logdir: The directory where the graph and checkpoints are saved.
-    master: The URL of the master.
-    is_chief: Specifies whether or not the training is being run by the primary
-      replica during replica training.
-    scaffold: An tf.train.Scaffold instance.
-    hooks: List of `tf.train.SessionRunHook` callbacks which are run inside the
-      training loop.
-    chief_only_hooks: List of `tf.train.SessionRunHook` instances which are run
-      inside the training loop for the chief trainer only.
-    save_checkpoint_secs: The frequency, in seconds, that a checkpoint is saved
-      using a default checkpoint saver. If `save_checkpoint_secs` is set to
-      `None`, then the default checkpoint saver isn't used.
-    save_summaries_steps: The frequency, in number of global steps, that the
-      summaries are written to disk using a default summary saver. If
-      `save_summaries_steps` is set to `None`, then the default summary saver
-      isn't used.
-    hparams: The hparams struct.
-
-  Returns:
-    the value of the loss function after training.
-
-  Raises:
-    ValueError: if `logdir` is `None` and either `save_checkpoint_secs` or
-    `save_summaries_steps` are `None.
-  """
-  global_step = slim.get_or_create_global_step()
-
-  scaffold = scaffold or tf.train.Scaffold()
-
-  hooks = hooks or []
-
-  if is_chief:
-    session_creator = tf.train.ChiefSessionCreator(
-        scaffold=scaffold, checkpoint_dir=logdir, master=master)
-
-    if chief_only_hooks:
-      hooks.extend(chief_only_hooks)
-    hooks.append(tf.train.StepCounterHook(output_dir=logdir))
-
-    if save_summaries_steps:
-      if logdir is None:
-        raise ValueError(
-            'logdir cannot be None when save_summaries_steps is None')
-      hooks.append(
-          tf.train.SummarySaverHook(
-              scaffold=scaffold,
-              save_steps=save_summaries_steps,
-              output_dir=logdir))
-
-    if save_checkpoint_secs:
-      if logdir is None:
-        raise ValueError(
-            'logdir cannot be None when save_checkpoint_secs is None')
-      hooks.append(
-          tf.train.CheckpointSaverHook(
-              logdir, save_secs=save_checkpoint_secs, scaffold=scaffold))
-  else:
-    session_creator = tf.train.WorkerSessionCreator(
-        scaffold=scaffold, master=master)
-
-  with tf.train.MonitoredSession(
-      session_creator=session_creator, hooks=hooks) as session:
-    loss = None
-    while not session.should_stop():
-      # Run the domain classifier op X times.
-      for _ in range(hparams.discriminator_steps):
-        if session.should_stop():
-          return loss
-        loss, np_global_step = session.run(
-            [discriminator_train_op, global_step])
-        if np_global_step % FLAGS.print_loss_steps == 0:
-          tf.logging.info('Step %d: Discriminator Loss = %.2f', np_global_step,
-                          loss)
-
-      # Run the generator op X times.
-      for _ in range(hparams.generator_steps):
-        if session.should_stop():
-          return loss
-        loss, np_global_step = session.run([generator_train_op, global_step])
-        if np_global_step % FLAGS.print_loss_steps == 0:
-          tf.logging.info('Step %d: Generator Loss = %.2f', np_global_step,
-                          loss)
-  return loss
-
-
-def run_training(run_dir, checkpoint_dir, hparams):
-  """Runs the training loop.
-
-  Args:
-    run_dir: The directory where training specific logs are placed
-    checkpoint_dir: The directory where the checkpoints and log files are
-      stored.
-    hparams: The hyperparameters struct.
-
-  Raises:
-    ValueError: if hparams.arch is not recognized.
-  """
-  for path in [run_dir, checkpoint_dir]:
-    if not tf.gfile.Exists(path):
-      tf.gfile.MakeDirs(path)
-
-  # Serialize hparams to log dir
-  hparams_filename = os.path.join(checkpoint_dir, 'hparams.json')
-  with tf.gfile.FastGFile(hparams_filename, 'w') as f:
-    f.write(hparams.to_json())
-
-  with tf.Graph().as_default():
-    with tf.device(tf.train.replica_device_setter(FLAGS.ps_tasks)):
-      global_step = slim.get_or_create_global_step()
-
-      #########################
-      # Preprocess the inputs #
-      #########################
-      target_dataset = dataset_factory.get_dataset(
-          FLAGS.target_dataset,
-          split_name='train',
-          dataset_dir=FLAGS.dataset_dir)
-      target_images, _ = dataset_factory.provide_batch(
-          FLAGS.target_dataset, 'train', FLAGS.dataset_dir, FLAGS.num_readers,
-          hparams.batch_size, FLAGS.num_preprocessing_threads)
-      num_target_classes = target_dataset.num_classes
-
-      if hparams.arch not in ['dcgan']:
-        source_dataset = dataset_factory.get_dataset(
-            FLAGS.source_dataset,
-            split_name='train',
-            dataset_dir=FLAGS.dataset_dir)
-        num_source_classes = source_dataset.num_classes
-        source_images, source_labels = dataset_factory.provide_batch(
-            FLAGS.source_dataset, 'train', FLAGS.dataset_dir, FLAGS.num_readers,
-            hparams.batch_size, FLAGS.num_preprocessing_threads)
-        # Data provider provides 1 hot labels, but we expect categorical.
-        source_labels['class'] = tf.argmax(source_labels['classes'], 1)
-        del source_labels['classes']
-        if num_source_classes != num_target_classes:
-          raise ValueError(
-              'Source and Target datasets must have same number of classes. '
-              'Are %d and %d' % (num_source_classes, num_target_classes))
-      else:
-        source_images = None
-        source_labels = None
-
-      ####################
-      # Define the model #
-      ####################
-      end_points = pixelda_model.create_model(
-          hparams,
-          target_images,
-          source_images=source_images,
-          source_labels=source_labels,
-          is_training=True,
-          num_classes=num_target_classes)
-
-      #################################
-      # Get the variables to optimize #
-      #################################
-      generator_vars, generator_update_ops = _get_vars_and_update_ops(
-          hparams, 'generator')
-      discriminator_vars, discriminator_update_ops = _get_vars_and_update_ops(
-          hparams, 'discriminator')
-
-      ########################
-      # Configure the losses #
-      ########################
-      generator_loss = pixelda_losses.g_step_loss(
-          source_images,
-          source_labels,
-          end_points,
-          hparams,
-          num_classes=num_target_classes)
-      discriminator_loss = pixelda_losses.d_step_loss(
-          end_points, source_labels, num_target_classes, hparams)
-
-      ###########################
-      # Create the training ops #
-      ###########################
-      learning_rate = hparams.learning_rate
-      if hparams.lr_decay_steps:
-        learning_rate = tf.train.exponential_decay(
-            learning_rate,
-            slim.get_or_create_global_step(),
-            decay_steps=hparams.lr_decay_steps,
-            decay_rate=hparams.lr_decay_rate,
-            staircase=True)
-      tf.summary.scalar('Learning_rate', learning_rate)
-
-
-      if hparams.discriminator_steps == 0:
-        discriminator_train_op = tf.no_op()
-      else:
-        discriminator_optimizer = tf.train.AdamOptimizer(
-            learning_rate, beta1=hparams.adam_beta1)
-
-        discriminator_train_op = slim.learning.create_train_op(
-            discriminator_loss,
-            discriminator_optimizer,
-            update_ops=discriminator_update_ops,
-            variables_to_train=discriminator_vars,
-            clip_gradient_norm=hparams.clip_gradient_norm,
-            summarize_gradients=FLAGS.summarize_gradients)
-
-      if hparams.generator_steps == 0:
-        generator_train_op = tf.no_op()
-      else:
-        generator_optimizer = tf.train.AdamOptimizer(
-            learning_rate, beta1=hparams.adam_beta1)
-        generator_train_op = slim.learning.create_train_op(
-            generator_loss,
-            generator_optimizer,
-            update_ops=generator_update_ops,
-            variables_to_train=generator_vars,
-            clip_gradient_norm=hparams.clip_gradient_norm,
-            summarize_gradients=FLAGS.summarize_gradients)
-
-      #############
-      # Summaries #
-      #############
-      pixelda_utils.summarize_model(end_points)
-      pixelda_utils.summarize_transferred_grid(
-          end_points['transferred_images'], source_images, name='Transferred')
-      if 'source_images_recon' in end_points:
-        pixelda_utils.summarize_transferred_grid(
-            end_points['source_images_recon'],
-            source_images,
-            name='Source Reconstruction')
-      pixelda_utils.summaries_color_distributions(end_points['transferred_images'],
-                                               'Transferred')
-      pixelda_utils.summaries_color_distributions(target_images, 'Target')
-
-      if source_images is not None:
-        pixelda_utils.summarize_transferred(source_images,
-                                         end_points['transferred_images'])
-        pixelda_utils.summaries_color_distributions(source_images, 'Source')
-        pixelda_utils.summaries_color_distributions(
-            tf.abs(source_images - end_points['transferred_images']),
-            'Abs(Source_minus_Transferred)')
-
-      number_of_steps = None
-      if hparams.num_training_examples:
-        # Want to control by amount of data seen, not # steps
-        number_of_steps = hparams.num_training_examples / hparams.batch_size
-
-      hooks = [tf.train.StepCounterHook(),]
-
-      chief_only_hooks = [
-          tf.train.CheckpointSaverHook(
-              saver=tf.train.Saver(),
-              checkpoint_dir=run_dir,
-              save_secs=FLAGS.save_interval_secs)
-      ]
-
-      if number_of_steps:
-        hooks.append(tf.train.StopAtStepHook(last_step=number_of_steps))
-
-      _train(
-          discriminator_train_op,
-          generator_train_op,
-          logdir=run_dir,
-          master=FLAGS.master,
-          is_chief=FLAGS.task == 0,
-          hooks=hooks,
-          chief_only_hooks=chief_only_hooks,
-          save_checkpoint_secs=None,
-          save_summaries_steps=FLAGS.save_summaries_steps,
-          hparams=hparams)
-
-def main(_):
-  tf.logging.set_verbosity(tf.logging.INFO)
-  hparams = create_hparams(FLAGS.hparams)
-  run_training(
-      run_dir=FLAGS.train_log_dir,
-      checkpoint_dir=FLAGS.train_log_dir,
-      hparams=hparams)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/domain_adaptation/pixel_domain_adaptation/pixelda_utils.py b/research/domain_adaptation/pixel_domain_adaptation/pixelda_utils.py
deleted file mode 100644
index 28e8006f267f9bf7f13c3dff78625cc4cbd00185..0000000000000000000000000000000000000000
--- a/research/domain_adaptation/pixel_domain_adaptation/pixelda_utils.py
+++ /dev/null
@@ -1,195 +0,0 @@
-# Copyright 2017 Google Inc.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Utilities for PixelDA model."""
-import math
-
-# Dependency imports
-
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-flags = tf.app.flags
-FLAGS = flags.FLAGS
-
-
-def remove_depth(images):
-  """Takes a batch of images and remove depth channel if present."""
-  if images.shape.as_list()[-1] == 4:
-    return images[:, :, :, 0:3]
-  return images
-
-
-def image_grid(images, max_grid_size=4):
-  """Given images and N, return first N^2 images as an NxN image grid.
-
-  Args:
-    images: a `Tensor` of size [batch_size, height, width, channels]
-    max_grid_size: Maximum image grid height/width
-
-  Returns:
-    Single image batch, of dim [1, h*n, w*n, c]
-  """
-  images = remove_depth(images)
-  batch_size = images.shape.as_list()[0]
-  grid_size = min(int(math.sqrt(batch_size)), max_grid_size)
-  assert images.shape.as_list()[0] >= grid_size * grid_size
-
-  # If we have a depth channel
-  if images.shape.as_list()[-1] == 4:
-    images = images[:grid_size * grid_size, :, :, 0:3]
-    depth = tf.image.grayscale_to_rgb(images[:grid_size * grid_size, :, :, 3:4])
-
-    images = tf.reshape(images, [-1, images.shape.as_list()[2], 3])
-    split = tf.split(0, grid_size, images)
-    depth = tf.reshape(depth, [-1, images.shape.as_list()[2], 3])
-    depth_split = tf.split(0, grid_size, depth)
-    grid = tf.concat(split + depth_split, 1)
-    return tf.expand_dims(grid, 0)
-  else:
-    images = images[:grid_size * grid_size, :, :, :]
-    images = tf.reshape(
-        images, [-1, images.shape.as_list()[2],
-                 images.shape.as_list()[3]])
-    split = tf.split(images, grid_size, 0)
-    grid = tf.concat(split, 1)
-    return tf.expand_dims(grid, 0)
-
-
-def source_and_output_image_grid(output_images,
-                                 source_images=None,
-                                 max_grid_size=4):
-  """Create NxN image grid for output, concatenate source grid if given.
-
-  Makes grid out of output_images and, if provided, source_images, and
-  concatenates them.
-
-  Args:
-    output_images: [batch_size, h, w, c] tensor of images
-    source_images: optional[batch_size, h, w, c] tensor of images
-    max_grid_size: Image grid height/width
-
-  Returns:
-    Single image batch, of dim [1, h*n, w*n, c]
-
-
-  """
-  output_grid = image_grid(output_images, max_grid_size=max_grid_size)
-  if source_images is not None:
-    source_grid = image_grid(source_images, max_grid_size=max_grid_size)
-    # Make sure they have the same # of channels before concat
-    # Assumes either 1 or 3 channels
-    if output_grid.shape.as_list()[-1] != source_grid.shape.as_list()[-1]:
-      if output_grid.shape.as_list()[-1] == 1:
-        output_grid = tf.tile(output_grid, [1, 1, 1, 3])
-      if source_grid.shape.as_list()[-1] == 1:
-        source_grid = tf.tile(source_grid, [1, 1, 1, 3])
-    output_grid = tf.concat([output_grid, source_grid], 1)
-  return output_grid
-
-
-def summarize_model(end_points):
-  """Summarizes the given model via its end_points.
-
-  Args:
-    end_points: A dictionary of end_point names to `Tensor`.
-  """
-  tf.summary.histogram('domain_logits_transferred',
-                       tf.sigmoid(end_points['transferred_domain_logits']))
-
-  tf.summary.histogram('domain_logits_target',
-                       tf.sigmoid(end_points['target_domain_logits']))
-
-
-def summarize_transferred_grid(transferred_images,
-                               source_images=None,
-                               name='Transferred'):
-  """Produces a visual grid summarization of the image transferrence.
-
-  Args:
-    transferred_images: A `Tensor` of size [batch_size, height, width, c].
-    source_images: A `Tensor` of size [batch_size, height, width, c].
-    name: Name to use in summary name
-  """
-  if source_images is not None:
-    grid = source_and_output_image_grid(transferred_images, source_images)
-  else:
-    grid = image_grid(transferred_images)
-  tf.summary.image('%s_Images_Grid' % name, grid, max_outputs=1)
-
-
-def summarize_transferred(source_images,
-                          transferred_images,
-                          max_images=20,
-                          name='Transferred'):
-  """Produces a visual summary of the image transferrence.
-
-  This summary displays the source image, transferred image, and a grayscale
-  difference image which highlights the differences between input and output.
-
-  Args:
-    source_images: A `Tensor` of size [batch_size, height, width, channels].
-    transferred_images: A `Tensor` of size [batch_size, height, width, channels]
-    max_images: The number of images to show.
-    name: Name to use in summary name
-
-  Raises:
-    ValueError: If number of channels in source and target are incompatible
-  """
-  source_channels = source_images.shape.as_list()[-1]
-  transferred_channels = transferred_images.shape.as_list()[-1]
-  if source_channels < transferred_channels:
-    if source_channels != 1:
-      raise ValueError(
-          'Source must be 1 channel or same # of channels as target')
-    source_images = tf.tile(source_images, [1, 1, 1, transferred_channels])
-  if transferred_channels < source_channels:
-    if transferred_channels != 1:
-      raise ValueError(
-          'Target must be 1 channel or same # of channels as source')
-    transferred_images = tf.tile(transferred_images, [1, 1, 1, source_channels])
-  diffs = tf.abs(source_images - transferred_images)
-  diffs = tf.reduce_max(diffs, reduction_indices=[3], keep_dims=True)
-  diffs = tf.tile(diffs, [1, 1, 1, max(source_channels, transferred_channels)])
-
-  transition_images = tf.concat([
-      source_images,
-      transferred_images,
-      diffs,
-  ], 2)
-
-  tf.summary.image(
-      '%s_difference' % name, transition_images, max_outputs=max_images)
-
-
-def summaries_color_distributions(images, name):
-  """Produces a histogram of the color distributions of the images.
-
-  Args:
-    images: A `Tensor` of size [batch_size, height, width, 3].
-    name: The name of the images being summarized.
-  """
-  tf.summary.histogram('color_values/%s' % name, images)
-
-
-def summarize_images(images, name):
-  """Produces a visual summary of the given images.
-
-  Args:
-    images: A `Tensor` of size [batch_size, height, width, 3].
-    name: The name of the images being summarized.
-  """
-  grid = image_grid(images)
-  tf.summary.image('%s_Images' % name, grid, max_outputs=1)
diff --git a/research/efficient-hrl/agent.py b/research/efficient-hrl/agent.py
index cb02b51fa9eb9d98c83ca863d4cfe8a9c90008ce..0028ddffa0d37a0e80d2c990e6263a3d9b4ab948 100644
--- a/research/efficient-hrl/agent.py
+++ b/research/efficient-hrl/agent.py
@@ -149,7 +149,7 @@ class UvfAgentCore(object):
 
     error = tf.square(actions - pred_actions)
     spec_range = (self._action_spec.maximum - self._action_spec.minimum) / 2
-    normalized_error = error / tf.constant(spec_range) ** 2
+    normalized_error = tf.cast(error, tf.float64) / tf.constant(spec_range) ** 2
     return -normalized_error
 
   @gin.configurable('uvf_add_noise_fn')
diff --git a/research/feelvos/CONTRIBUTING.md b/research/feelvos/CONTRIBUTING.md
deleted file mode 100644
index 939e5341e74dc2371c8b47f0e27b50581bed5f63..0000000000000000000000000000000000000000
--- a/research/feelvos/CONTRIBUTING.md
+++ /dev/null
@@ -1,28 +0,0 @@
-# How to Contribute
-
-We'd love to accept your patches and contributions to this project. There are
-just a few small guidelines you need to follow.
-
-## Contributor License Agreement
-
-Contributions to this project must be accompanied by a Contributor License
-Agreement. You (or your employer) retain the copyright to your contribution;
-this simply gives us permission to use and redistribute your contributions as
-part of the project. Head over to <https://cla.developers.google.com/> to see
-your current agreements on file or to sign a new one.
-
-You generally only need to submit a CLA once, so if you've already submitted one
-(even if it was for a different project), you probably don't need to do it
-again.
-
-## Code reviews
-
-All submissions, including submissions by project members, require review. We
-use GitHub pull requests for this purpose. Consult
-[GitHub Help](https://help.github.com/articles/about-pull-requests/) for more
-information on using pull requests.
-
-## Community Guidelines
-
-This project follows [Google's Open Source Community
-Guidelines](https://opensource.google.com/conduct/).
diff --git a/research/feelvos/README.md b/research/feelvos/README.md
deleted file mode 100644
index 69017c8b19fc1427c47cbdfbdce408ffa92ec32c..0000000000000000000000000000000000000000
--- a/research/feelvos/README.md
+++ /dev/null
@@ -1,102 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation
-
-FEELVOS is a fast model for video object segmentation which does not rely on fine-tuning on the
-first frame.
-
-For details, please refer to our paper. If you find the code useful, please
-also consider citing it.
-
-* FEELVOS:
-
-```
-@inproceedings{feelvos2019,
-    title={FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation},
-    author={Paul Voigtlaender and Yuning Chai and Florian Schroff and Hartwig Adam and Bastian Leibe and Liang-Chieh Chen},
-    booktitle={CVPR},
-    year={2019}
-}
-```
-
-## Dependencies
-
-FEELVOS requires a good GPU with around 12 GB of memory and depends on the following libraries
-
-* TensorFlow
-* Pillow
-* Numpy
-* Scipy
-* Scikit Learn Image
-* tf Slim (which is included in the "tensorflow/models/research/" checkout)
-* DeepLab (which is included in the "tensorflow/models/research/" checkout)
-* correlation_cost (optional, see below)
-
-For detailed steps to install Tensorflow, follow the [Tensorflow installation
-instructions](https://www.tensorflow.org/install/). A typical user can install
-Tensorflow using the following command:
-
-```bash
-pip install tensorflow-gpu
-```
-
-The remaining libraries can also be installed with pip using:
-
-```bash
-pip install pillow scipy scikit-image
-```
-
-## Dependency on correlation_cost
-
-For fast cross-correlation, we use correlation cost as an external dependency. By default FEELVOS
-will use a slow and memory hungry fallback implementation without correlation_cost. If you care for
-performance, you should set up correlation_cost by following the instructions in
-correlation_cost/README and afterwards setting ```USE_CORRELATION_COST = True``` in
-utils/embedding_utils.py.
-
-## Pre-trained Models
-
-We provide 2 pre-trained FEELVOS models, both are based on Xception-65:
-
-* [Trained on DAVIS 2017](http://download.tensorflow.org/models/feelvos_davis17_trained.tar.gz)
-* [Trained on DAVIS 2017 and YouTube-VOS](http://download.tensorflow.org/models/feelvos_davis17_and_youtubevos_trained.tar.gz)
-
-Additionally, we provide a [DeepLab checkpoint for Xception-65 pre-trained on ImageNet and COCO](http://download.tensorflow.org/models/xception_65_coco_pretrained_2018_10_02.tar.gz),
-which can be used as an initialization for training FEELVOS.
-
-## Pre-computed Segmentation Masks
-
-We provide [pre-computed segmentation masks](http://download.tensorflow.org/models/feelvos_precomputed_masks.zip)
-for FEELVOS both for training with and without YouTube-VOS data for the following datasets:
-
-* DAVIS 2017 validation set
-* DAVIS 2017 test-dev set
-* YouTube-Objects dataset
-
-## Local Inference
-For a demo of local inference on DAVIS 2017 run
-
-```bash
-# From tensorflow/models/research/feelvos
-sh eval.sh
-```
-
-## Local Training
-For a demo of local training on DAVIS 2017 run
-
-```bash
-# From tensorflow/models/research/feelvos
-sh train.sh
-```
-
-## Contacts (Maintainers)
-*   Paul Voigtlaender, github: [pvoigtlaender](https://github.com/pvoigtlaender)
-*   Yuning Chai, github: [yuningchai](https://github.com/yuningchai)
-*   Liang-Chieh Chen, github: [aquariusjay](https://github.com/aquariusjay)
-
-## License
-
-All the codes in feelvos folder is covered by the [LICENSE](https://github.com/tensorflow/models/blob/master/LICENSE)
-under tensorflow/models. Please refer to the LICENSE for details.
diff --git a/research/feelvos/__init__.py b/research/feelvos/__init__.py
deleted file mode 100644
index 6f1373443d0ff84fd90714e41dade400ab41a22c..0000000000000000000000000000000000000000
--- a/research/feelvos/__init__.py
+++ /dev/null
@@ -1,14 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
diff --git a/research/feelvos/common.py b/research/feelvos/common.py
deleted file mode 100644
index 98f5a9ce348aea36efa4b3cc57048d3659f18895..0000000000000000000000000000000000000000
--- a/research/feelvos/common.py
+++ /dev/null
@@ -1,163 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Provides flags that are common to scripts.
-
-Common flags from train/vis_video.py are collected in this script.
-"""
-import tensorflow as tf
-
-from deeplab import common
-
-flags = tf.app.flags
-
-flags.DEFINE_enum(
-    'classification_loss', 'softmax_with_attention',
-    ['softmax', 'triplet', 'softmax_with_attention'],
-    'Type of loss function used for classifying pixels, can be either softmax, '
-    'softmax_with_attention, or triplet.')
-
-flags.DEFINE_integer('k_nearest_neighbors', 1,
-                     'The number of nearest neighbors to use.')
-
-flags.DEFINE_integer('embedding_dimension', 100, 'The dimension used for the '
-                                                 'learned embedding')
-
-flags.DEFINE_boolean('use_softmax_feedback', True,
-                     'Whether to give the softmax predictions of the last '
-                     'frame as additional input to the segmentation head.')
-
-flags.DEFINE_boolean('sample_adjacent_and_consistent_query_frames', True,
-                     'If true, the query frames (all but the first frame '
-                     'which is the reference frame) will be sampled such '
-                     'that they are adjacent video frames and have the same '
-                     'crop coordinates and flip augmentation. Note that if '
-                     'use_softmax_feedback is True, this option will '
-                     'automatically be activated.')
-
-flags.DEFINE_integer('embedding_seg_feature_dimension', 256,
-                     'The dimensionality used in the segmentation head layers.')
-
-flags.DEFINE_integer('embedding_seg_n_layers', 4, 'The number of layers in the '
-                                                  'segmentation head.')
-
-flags.DEFINE_integer('embedding_seg_kernel_size', 7, 'The kernel size used in '
-                                                     'the segmentation head.')
-
-flags.DEFINE_multi_integer('embedding_seg_atrous_rates', [],
-                           'The atrous rates to use for the segmentation head.')
-
-flags.DEFINE_boolean('normalize_nearest_neighbor_distances', True,
-                     'Whether to normalize the nearest neighbor distances '
-                     'to [0,1] using sigmoid, scale and shift.')
-
-flags.DEFINE_boolean('also_attend_to_previous_frame', True, 'Whether to also '
-                     'use nearest neighbor attention with respect to the '
-                     'previous frame.')
-
-flags.DEFINE_bool('use_local_previous_frame_attention', True,
-                  'Whether to restrict the previous frame attention to a local '
-                  'search window. Only has an effect, if '
-                  'also_attend_to_previous_frame is True.')
-
-flags.DEFINE_integer('previous_frame_attention_window_size', 15,
-                     'The window size used for local previous frame attention,'
-                     ' if use_local_previous_frame_attention is True.')
-
-flags.DEFINE_boolean('use_first_frame_matching', True, 'Whether to extract '
-                     'features by matching to the reference frame. This should '
-                     'always be true except for ablation experiments.')
-
-FLAGS = flags.FLAGS
-
-# Constants
-
-# Perform semantic segmentation predictions.
-OUTPUT_TYPE = common.OUTPUT_TYPE
-
-# Semantic segmentation item names.
-LABELS_CLASS = common.LABELS_CLASS
-IMAGE = common.IMAGE
-HEIGHT = common.HEIGHT
-WIDTH = common.WIDTH
-IMAGE_NAME = common.IMAGE_NAME
-SOURCE_ID = 'source_id'
-VIDEO_ID = 'video_id'
-LABEL = common.LABEL
-ORIGINAL_IMAGE = common.ORIGINAL_IMAGE
-PRECEDING_FRAME_LABEL = 'preceding_frame_label'
-
-# Test set name.
-TEST_SET = common.TEST_SET
-
-# Internal constants.
-OBJECT_LABEL = 'object_label'
-
-
-class VideoModelOptions(common.ModelOptions):
-  """Internal version of immutable class to hold model options."""
-
-  def __new__(cls,
-              outputs_to_num_classes,
-              crop_size=None,
-              atrous_rates=None,
-              output_stride=8):
-    """Constructor to set default values.
-
-    Args:
-      outputs_to_num_classes: A dictionary from output type to the number of
-        classes. For example, for the task of semantic segmentation with 21
-        semantic classes, we would have outputs_to_num_classes['semantic'] = 21.
-      crop_size: A tuple [crop_height, crop_width].
-      atrous_rates: A list of atrous convolution rates for ASPP.
-      output_stride: The ratio of input to output spatial resolution.
-
-    Returns:
-      A new VideoModelOptions instance.
-    """
-    self = super(VideoModelOptions, cls).__new__(
-        cls,
-        outputs_to_num_classes,
-        crop_size,
-        atrous_rates,
-        output_stride)
-    # Add internal flags.
-    self.classification_loss = FLAGS.classification_loss
-
-    return self
-
-
-def parse_decoder_output_stride():
-  """Parses decoder output stride.
-
-  FEELVOS assumes decoder_output_stride = 4. Thus, this function is created for
-  this particular purpose.
-
-  Returns:
-    An integer specifying the decoder_output_stride.
-
-  Raises:
-    ValueError: If decoder_output_stride is None or contains more than one
-      element.
-  """
-  if FLAGS.decoder_output_stride:
-    decoder_output_stride = [
-        int(x) for x in FLAGS.decoder_output_stride]
-    if len(decoder_output_stride) != 1:
-      raise ValueError('Expect decoder output stride has only one element.')
-    decoder_output_stride = decoder_output_stride[0]
-  else:
-    raise ValueError('Expect flag decoder output stride not to be None.')
-  return decoder_output_stride
diff --git a/research/feelvos/correlation_cost/README.md b/research/feelvos/correlation_cost/README.md
deleted file mode 100644
index 6cdbe550c7fcf63191f6967dd99c72cf341302bc..0000000000000000000000000000000000000000
--- a/research/feelvos/correlation_cost/README.md
+++ /dev/null
@@ -1,36 +0,0 @@
-# correlation_cost
-
-FEELVOS uses correlation_cost as an optional dependency to improve the speed and memory consumption
-of cross-correlation.
-
-## Installation
-
-Unfortunately we cannot provide the code for correlation_cost directly, so you
-will have to copy some files from this pull request
-https://github.com/tensorflow/tensorflow/pull/21392/. For your convenience we
-prepared scripts to download and adjust the code automatically.
-
-In the best case, all you need to do is run compile.sh with the path to your
-CUDA installation (tested only with CUDA 9).
-Note that the path should be to a folder containing the cuda folder, not to the
-cuda folder itself, e.g. if your cuda is in /usr/local/cuda-9.0, you can create
-a symlink /usr/local/cuda pointing to /usr/local/cuda-9.0 and then run
-
-```bash
-sh build.sh /usr/local/
-```
-
-This will
-
-* Download the code via ```sh get_code.sh ```
-* Apply minor adjustments to the code via ```sh fix_code.sh```
-* Clone the dependencies cub and thrust from github via ```sh clone_dependencies.sh```
-* Compile a shared library correlation_cost.so for correlation_cost via
-```sh compile.sh "${CUDA_DIR}"```
-
-Please review the licenses of correlation_cost, cub, and thrust.
-
-## Enabling correlation_cost
-If you managed to create the correlation_cost.so file, then set
-```USE_CORRELATION_COST = True``` in feelvos/utils/embedding_utils.py and try to run
-```sh eval.sh```.
diff --git a/research/feelvos/correlation_cost/build.sh b/research/feelvos/correlation_cost/build.sh
deleted file mode 100755
index 37d9adb3147df07646a462fd170772393abf5642..0000000000000000000000000000000000000000
--- a/research/feelvos/correlation_cost/build.sh
+++ /dev/null
@@ -1,37 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-#
-# This script is used to download and build the code for correlation_cost.
-#
-# Usage:
-#   sh ./build.sh cuda_dir
-# Where cuda_dir points to a directory containing the cuda folder (not the cuda folder itself).
-#
-#
-
-if [ "$#" -ne 1 ]; then
-  echo "Illegal number of parameters, usage: ./build.sh cuda_dir"
-  echo "Where cuda_dir points to a directory containing the cuda folder (not the cuda folder itself)"
-  exit 1
-fi
-
-set -e
-set -x
-
-sh ./get_code.sh
-sh ./fix_code.sh
-sh ./clone_dependencies.sh
-sh ./compile.sh $1
diff --git a/research/feelvos/correlation_cost/clone_dependencies.sh b/research/feelvos/correlation_cost/clone_dependencies.sh
deleted file mode 100755
index 9174313f58a833a5ab547e21c63cdc87681cbc5d..0000000000000000000000000000000000000000
--- a/research/feelvos/correlation_cost/clone_dependencies.sh
+++ /dev/null
@@ -1,31 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-#
-# This script is used to clone the dependencies, i.e. cub and thrust, of correlation_cost from github.
-#
-# Usage:
-#   sh ./clone_dependencies.sh
-#
-#
-
-# Clone cub.
-if [ ! -d cub ] ; then
-  git clone https://github.com/dmlc/cub.git
-fi
-# Clone thrust.
-if [ ! -d thrust ] ; then
-  git clone https://github.com/thrust/thrust.git
-fi
diff --git a/research/feelvos/correlation_cost/compile.sh b/research/feelvos/correlation_cost/compile.sh
deleted file mode 100755
index 6025292dfa78b44dd6fcf2f1b349af936a43fcc7..0000000000000000000000000000000000000000
--- a/research/feelvos/correlation_cost/compile.sh
+++ /dev/null
@@ -1,46 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-#
-# This script is used to compile the code for correlation_cost and create correlation_cost.so.
-#
-#  Usage:
-#    sh ./compile.sh cuda_dir
-#  Where cuda_dir points to a directory containing the cuda folder (not the cuda folder itself).
-#
-#
-
-if [ "$#" -ne 1 ]; then
-  echo "Illegal number of parameters, usage: ./compile.sh cuda_dir"
-  exit 1
-fi
-CUDA_DIR=$1
-
-if [ ! -d "${CUDA_DIR}/cuda" ]; then
-  echo "cuda_dir must point to a directory containing the cuda folder, not to the cuda folder itself"
-  exit 1
-fi
-
-TF_CFLAGS=( $(python -c 'import tensorflow as tf; print(" ".join(tf.sysconfig.get_compile_flags()))') )
-TF_LFLAGS=( $(python -c 'import tensorflow as tf; print(" ".join(tf.sysconfig.get_link_flags()))') )
-CUB_DIR=cub
-THRUST_DIR=thrust
-
-# Depending on the versions of your nvcc and gcc, the flag --expt-relaxed-constexpr might be required or should be removed.
-# If nvcc complains about a too new gcc version, you can point it to another gcc
-# version by using something like nvcc -ccbin /path/to/your/gcc6
-nvcc -std=c++11 --expt-relaxed-constexpr -I ./ -I ${CUB_DIR}/../ -I ${THRUST_DIR} -I ${CUDA_DIR}/ -c -o correlation_cost_op_gpu.o kernels/correlation_cost_op_gpu.cu.cc ${TF_CFLAGS[@]} -D GOOGLE_CUDA=1 -x cu -Xcompiler -fPIC
-
-g++ -std=c++11 -I ./ -L ${CUDA_DIR}/cuda/lib64 -shared -o correlation_cost.so ops/correlation_cost_op.cc kernels/correlation_cost_op.cc correlation_cost_op_gpu.o ${TF_CFLAGS[@]} -fPIC -lcudart ${TF_LFLAGS[@]} -D GOOGLE_CUDA=1
diff --git a/research/feelvos/correlation_cost/fix_code.sh b/research/feelvos/correlation_cost/fix_code.sh
deleted file mode 100755
index d4f285db3d745fc55a20bac57f97c6ca2fd8a5c4..0000000000000000000000000000000000000000
--- a/research/feelvos/correlation_cost/fix_code.sh
+++ /dev/null
@@ -1,33 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-#
-# This script is used to modify the downloaded code.
-#
-#  Usage:
-#    sh ./fix_code.sh
-#
-#
-
-sed -i "s/tensorflow\/contrib\/correlation_cost\///g" kernels/correlation_cost_op_gpu.cu.cc
-sed -i "s/tensorflow\/contrib\/correlation_cost\///g" kernels/correlation_cost_op.cc
-sed -i "s/external\/cub_archive\//cub\//g" kernels/correlation_cost_op_gpu.cu.cc
-
-sed -i "s/from tensorflow.contrib.util import loader/import tensorflow as tf/g" python/ops/correlation_cost_op.py
-grep -v "from tensorflow" python/ops/correlation_cost_op.py | grep -v resource_loader.get_path_to_datafile > correlation_cost_op.py.tmp && mv correlation_cost_op.py.tmp python/ops/correlation_cost_op.py
-sed -i "s/array_ops/tf/g" python/ops/correlation_cost_op.py
-sed -i "s/ops/tf/g" python/ops/correlation_cost_op.py
-sed -i "s/loader.load_op_library(/tf.load_op_library('feelvos\/correlation_cost\/correlation_cost.so')/g" python/ops/correlation_cost_op.py
-sed -i "s/gen_correlation_cost_op/_correlation_cost_op_so/g" python/ops/correlation_cost_op.py
diff --git a/research/feelvos/correlation_cost/get_code.sh b/research/feelvos/correlation_cost/get_code.sh
deleted file mode 100755
index 337142166ac4b61835417e807ef0a495532d749c..0000000000000000000000000000000000000000
--- a/research/feelvos/correlation_cost/get_code.sh
+++ /dev/null
@@ -1,32 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-#
-# This script is used to download the code for correlation_cost.
-#
-#  Usage:
-#    sh ./get_code.sh
-#
-#
-
-mkdir -p kernels ops python/ops
-touch __init__.py
-touch python/__init__.py
-touch python/ops/__init__.py
-wget https://raw.githubusercontent.com/tensorflow/tensorflow/91b163b9bd8dd0f8c2631b4245a67dfd387536a6/tensorflow/contrib/correlation_cost/ops/correlation_cost_op.cc -O ops/correlation_cost_op.cc
-wget https://raw.githubusercontent.com/tensorflow/tensorflow/91b163b9bd8dd0f8c2631b4245a67dfd387536a6/tensorflow/contrib/correlation_cost/python/ops/correlation_cost_op.py -O python/ops/correlation_cost_op.py
-wget https://raw.githubusercontent.com/tensorflow/tensorflow/91b163b9bd8dd0f8c2631b4245a67dfd387536a6/tensorflow/contrib/correlation_cost/kernels/correlation_cost_op.cc -O kernels/correlation_cost_op.cc
-wget https://raw.githubusercontent.com/tensorflow/tensorflow/91b163b9bd8dd0f8c2631b4245a67dfd387536a6/tensorflow/contrib/correlation_cost/kernels/correlation_cost_op.h -O kernels/correlation_cost_op.h
-wget https://raw.githubusercontent.com/tensorflow/tensorflow/91b163b9bd8dd0f8c2631b4245a67dfd387536a6/tensorflow/contrib/correlation_cost/kernels/correlation_cost_op_gpu.cu.cc -O kernels/correlation_cost_op_gpu.cu.cc
diff --git a/research/feelvos/datasets/__init__.py b/research/feelvos/datasets/__init__.py
deleted file mode 100644
index 6f1373443d0ff84fd90714e41dade400ab41a22c..0000000000000000000000000000000000000000
--- a/research/feelvos/datasets/__init__.py
+++ /dev/null
@@ -1,14 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
diff --git a/research/feelvos/datasets/build_davis2017_data.py b/research/feelvos/datasets/build_davis2017_data.py
deleted file mode 100644
index 5e093fc3b4531f5439957ea3608770441bd5ce4a..0000000000000000000000000000000000000000
--- a/research/feelvos/datasets/build_davis2017_data.py
+++ /dev/null
@@ -1,163 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Converts DAVIS 2017 data to TFRecord file format with SequenceExample protos.
-"""
-
-import io
-import math
-import os
-from StringIO import StringIO
-import numpy as np
-import PIL
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_string('data_folder', 'DAVIS2017/',
-                           'Folder containing the DAVIS 2017 data')
-
-tf.app.flags.DEFINE_string('imageset', 'val',
-                           'Which subset to use, either train or val')
-
-tf.app.flags.DEFINE_string(
-    'output_dir', './tfrecord',
-    'Path to save converted TFRecords of TensorFlow examples.')
-
-_NUM_SHARDS_TRAIN = 10
-_NUM_SHARDS_VAL = 1
-
-
-def read_image(path):
-  with open(path) as fid:
-    image_str = fid.read()
-    image = PIL.Image.open(io.BytesIO(image_str))
-    w, h = image.size
-  return image_str, (h, w)
-
-
-def read_annotation(path):
-  """Reads a single image annotation from a png image.
-
-  Args:
-    path: Path to the png image.
-
-  Returns:
-    png_string: The png encoded as string.
-    size: Tuple of (height, width).
-  """
-  with open(path) as fid:
-    x = np.array(PIL.Image.open(fid))
-    h, w = x.shape
-    im = PIL.Image.fromarray(x)
-
-  output = StringIO()
-  im.save(output, format='png')
-  png_string = output.getvalue()
-  output.close()
-
-  return png_string, (h, w)
-
-
-def process_video(key, input_dir, anno_dir):
-  """Creates a SequenceExample for the video.
-
-  Args:
-    key: Name of the video.
-    input_dir: Directory which contains the image files.
-    anno_dir: Directory which contains the annotation files.
-
-  Returns:
-    The created SequenceExample.
-  """
-  frame_names = sorted(tf.gfile.ListDirectory(input_dir))
-  anno_files = sorted(tf.gfile.ListDirectory(anno_dir))
-  assert len(frame_names) == len(anno_files)
-
-  sequence = tf.train.SequenceExample()
-  context = sequence.context.feature
-  features = sequence.feature_lists.feature_list
-
-  for i, name in enumerate(frame_names):
-    image_str, image_shape = read_image(
-        os.path.join(input_dir, name))
-    anno_str, anno_shape = read_annotation(
-        os.path.join(anno_dir, name[:-4] + '.png'))
-    image_encoded = features['image/encoded'].feature.add()
-    image_encoded.bytes_list.value.append(image_str)
-    segmentation_encoded = features['segmentation/object/encoded'].feature.add()
-    segmentation_encoded.bytes_list.value.append(anno_str)
-
-    np.testing.assert_array_equal(np.array(image_shape), np.array(anno_shape))
-
-    if i == 0:
-      first_shape = np.array(image_shape)
-    else:
-      np.testing.assert_array_equal(np.array(image_shape), first_shape)
-
-  context['video_id'].bytes_list.value.append(key.encode('ascii'))
-  context['clip/frames'].int64_list.value.append(len(frame_names))
-  context['image/format'].bytes_list.value.append('JPEG')
-  context['image/channels'].int64_list.value.append(3)
-  context['image/height'].int64_list.value.append(first_shape[0])
-  context['image/width'].int64_list.value.append(first_shape[1])
-  context['segmentation/object/format'].bytes_list.value.append('PNG')
-  context['segmentation/object/height'].int64_list.value.append(first_shape[0])
-  context['segmentation/object/width'].int64_list.value.append(first_shape[1])
-
-  return sequence
-
-
-def convert(data_folder, imageset, output_dir, num_shards):
-  """Converts the specified subset of DAVIS 2017 to TFRecord format.
-
-  Args:
-    data_folder: The path to the DAVIS 2017 data.
-    imageset: The subset to use, either train or val.
-    output_dir: Where to store the TFRecords.
-    num_shards: The number of shards used for storing the data.
-  """
-  sets_file = os.path.join(data_folder, 'ImageSets', '2017', imageset + '.txt')
-  vids = [x.strip() for x in open(sets_file).readlines()]
-  num_vids = len(vids)
-  num_vids_per_shard = int(math.ceil(num_vids) / float(num_shards))
-  for shard_id in range(num_shards):
-    output_filename = os.path.join(
-        output_dir,
-        '%s-%05d-of-%05d.tfrecord' % (imageset, shard_id, num_shards))
-    with tf.python_io.TFRecordWriter(output_filename) as tfrecord_writer:
-      start_idx = shard_id * num_vids_per_shard
-      end_idx = min((shard_id + 1) * num_vids_per_shard, num_vids)
-      for i in range(start_idx, end_idx):
-        print('Converting video %d/%d shard %d video %s' % (
-            i + 1, num_vids, shard_id, vids[i]))
-        img_dir = os.path.join(data_folder, 'JPEGImages', '480p', vids[i])
-        anno_dir = os.path.join(data_folder, 'Annotations', '480p', vids[i])
-        example = process_video(vids[i], img_dir, anno_dir)
-        tfrecord_writer.write(example.SerializeToString())
-
-
-def main(unused_argv):
-  imageset = FLAGS.imageset
-  assert imageset in ('train', 'val')
-  if imageset == 'train':
-    num_shards = _NUM_SHARDS_TRAIN
-  else:
-    num_shards = _NUM_SHARDS_VAL
-  convert(FLAGS.data_folder, FLAGS.imageset, FLAGS.output_dir, num_shards)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/feelvos/datasets/download_and_convert_davis17.sh b/research/feelvos/datasets/download_and_convert_davis17.sh
deleted file mode 100644
index 011be61ba7586c8f3d141ccc00194d1c7ae56c3a..0000000000000000000000000000000000000000
--- a/research/feelvos/datasets/download_and_convert_davis17.sh
+++ /dev/null
@@ -1,77 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-#
-# Script to download and preprocess the DAVIS 2017 dataset.
-#
-# Usage:
-#   bash ./download_and_convert_davis17.sh
-
-# Exit immediately if a command exits with a non-zero status.
-set -e
-
-CURRENT_DIR=$(pwd)
-WORK_DIR="./davis17"
-mkdir -p "${WORK_DIR}"
-cd "${WORK_DIR}"
-
-# Helper function to download and unpack the DAVIS 2017 dataset.
-download_and_uncompress() {
-  local BASE_URL=${1}
-  local FILENAME=${2}
-
-  if [ ! -f "${FILENAME}" ]; then
-    echo "Downloading ${FILENAME} to ${WORK_DIR}"
-    wget -nd -c "${BASE_URL}/${FILENAME}"
-    echo "Uncompressing ${FILENAME}"
-    unzip "${FILENAME}"
-  fi
-}
-
-BASE_URL="https://data.vision.ee.ethz.ch/csergi/share/davis/"
-FILENAME="DAVIS-2017-trainval-480p.zip"
-
-download_and_uncompress "${BASE_URL}" "${FILENAME}"
-
-cd "${CURRENT_DIR}"
-
-# Root path for DAVIS 2017 dataset.
-DAVIS_ROOT="${WORK_DIR}/DAVIS"
-
-# Build TFRecords of the dataset.
-# First, create output directory for storing TFRecords.
-OUTPUT_DIR="${WORK_DIR}/tfrecord"
-mkdir -p "${OUTPUT_DIR}"
-
-IMAGE_FOLDER="${DAVIS_ROOT}/JPEGImages"
-LIST_FOLDER="${DAVIS_ROOT}/ImageSets/Segmentation"
-
-# Convert validation set.
-if [ ! -f "${OUTPUT_DIR}/val-00000-of-00001.tfrecord" ]; then
-  echo "Converting DAVIS 2017 dataset (val)..."
-  python ./build_davis2017_data.py \
-    --data_folder="${DAVIS_ROOT}" \
-    --imageset=val \
-    --output_dir="${OUTPUT_DIR}"
-fi
-
-# Convert training set.
-if [ ! -f "${OUTPUT_DIR}/train-00009-of-00010.tfrecord" ]; then
-  echo "Converting DAVIS 2017 dataset (train)..."
-  python ./build_davis2017_data.py \
-    --data_folder="${DAVIS_ROOT}" \
-    --imageset=train \
-    --output_dir="${OUTPUT_DIR}"
-fi
diff --git a/research/feelvos/datasets/tfsequence_example_decoder.py b/research/feelvos/datasets/tfsequence_example_decoder.py
deleted file mode 100644
index 2fa3e95d5b98eb00aa485371037b4ad6b0e7ece3..0000000000000000000000000000000000000000
--- a/research/feelvos/datasets/tfsequence_example_decoder.py
+++ /dev/null
@@ -1,118 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contains the TFExampleDecoder.
-
-The TFExampleDecode is a DataDecoder used to decode TensorFlow Example protos.
-In order to do so each requested item must be paired with one or more Example
-features that are parsed to produce the Tensor-based manifestation of the item.
-"""
-
-import tensorflow as tf
-slim = tf.contrib.slim
-data_decoder = slim.data_decoder
-
-
-class TFSequenceExampleDecoder(data_decoder.DataDecoder):
-  """A decoder for TensorFlow SequenceExamples.
-
-  Decoding SequenceExample proto buffers is comprised of two stages:
-  (1) Example parsing and (2) tensor manipulation.
-
-  In the first stage, the tf.parse_single_sequence_example function is called
-  with a list of FixedLenFeatures and SparseLenFeatures. These instances tell TF
-  how to parse the example. The output of this stage is a set of tensors.
-
-  In the second stage, the resulting tensors are manipulated to provide the
-  requested 'item' tensors.
-
-  To perform this decoding operation, a SequenceExampleDecoder is given a list
-  of ItemHandlers. Each ItemHandler indicates the set of features for stage 1
-  and contains the instructions for post_processing its tensors for stage 2.
-  """
-
-  def __init__(self, keys_to_context_features, keys_to_sequence_features,
-               items_to_handlers):
-    """Constructs the decoder.
-
-    Args:
-      keys_to_context_features: a dictionary from TF-SequenceExample context
-        keys to either tf.VarLenFeature or tf.FixedLenFeature instances.
-        See tensorflow's parsing_ops.py.
-      keys_to_sequence_features: a dictionary from TF-SequenceExample sequence
-        keys to either tf.VarLenFeature or tf.FixedLenSequenceFeature instances.
-        See tensorflow's parsing_ops.py.
-      items_to_handlers: a dictionary from items (strings) to ItemHandler
-        instances. Note that the ItemHandler's are provided the keys that they
-        use to return the final item Tensors.
-
-    Raises:
-      ValueError: if the same key is present for context features and sequence
-        features.
-    """
-    unique_keys = set()
-    unique_keys.update(keys_to_context_features)
-    unique_keys.update(keys_to_sequence_features)
-    if len(unique_keys) != (
-        len(keys_to_context_features) + len(keys_to_sequence_features)):
-      # This situation is ambiguous in the decoder's keys_to_tensors variable.
-      raise ValueError('Context and sequence keys are not unique. \n'
-                       ' Context keys: %s \n Sequence keys: %s' %
-                       (list(keys_to_context_features.keys()),
-                        list(keys_to_sequence_features.keys())))
-
-    self._keys_to_context_features = keys_to_context_features
-    self._keys_to_sequence_features = keys_to_sequence_features
-    self._items_to_handlers = items_to_handlers
-
-  def list_items(self):
-    """See base class."""
-    return self._items_to_handlers.keys()
-
-  def decode(self, serialized_example, items=None):
-    """Decodes the given serialized TF-SequenceExample.
-
-    Args:
-      serialized_example: a serialized TF-SequenceExample tensor.
-      items: the list of items to decode. These must be a subset of the item
-        keys in self._items_to_handlers. If `items` is left as None, then all
-        of the items in self._items_to_handlers are decoded.
-
-    Returns:
-      the decoded items, a list of tensor.
-    """
-
-    context, feature_list = tf.parse_single_sequence_example(
-        serialized_example, self._keys_to_context_features,
-        self._keys_to_sequence_features)
-
-    # Reshape non-sparse elements just once:
-    for k in self._keys_to_context_features:
-      v = self._keys_to_context_features[k]
-      if isinstance(v, tf.FixedLenFeature):
-        context[k] = tf.reshape(context[k], v.shape)
-
-    if not items:
-      items = self._items_to_handlers.keys()
-
-    outputs = []
-    for item in items:
-      handler = self._items_to_handlers[item]
-      keys_to_tensors = {
-          key: context[key] if key in context else feature_list[key]
-          for key in handler.keys
-      }
-      outputs.append(handler.tensors_to_item(keys_to_tensors))
-    return outputs
diff --git a/research/feelvos/datasets/video_dataset.py b/research/feelvos/datasets/video_dataset.py
deleted file mode 100644
index 17b62e989af866df0232a0e6d921faee84fe1fa7..0000000000000000000000000000000000000000
--- a/research/feelvos/datasets/video_dataset.py
+++ /dev/null
@@ -1,196 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Provides data from video object segmentation datasets.
-
-This file provides both images and annotations (instance segmentations) for
-TensorFlow. Currently, we support the following datasets:
-
-1. DAVIS 2017 (https://davischallenge.org/davis2017/code.html).
-
-2. DAVIS 2016 (https://davischallenge.org/davis2016/code.html).
-
-3. YouTube-VOS (https://youtube-vos.org/dataset/download).
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import os.path
-import tensorflow as tf
-from feelvos.datasets import tfsequence_example_decoder
-
-slim = tf.contrib.slim
-dataset = slim.dataset
-tfexample_decoder = slim.tfexample_decoder
-
-
-_ITEMS_TO_DESCRIPTIONS = {
-    'image': 'A color image of varying height and width.',
-    'labels_class': ('A semantic segmentation label whose size matches image.'
-                     'Its values range from 0 (background) to num_classes.'),
-}
-
-# Named tuple to describe the dataset properties.
-DatasetDescriptor = collections.namedtuple(
-    'DatasetDescriptor',
-    ['splits_to_sizes',   # Splits of the dataset into training, val, and test.
-     'num_classes',   # Number of semantic classes.
-     'ignore_label',  # Ignore label value.
-    ]
-)
-
-_DAVIS_2016_INFORMATION = DatasetDescriptor(
-    splits_to_sizes={'train': [30, 1830],
-                     'val': [20, 1376]},
-    num_classes=2,
-    ignore_label=255,
-)
-
-_DAVIS_2017_INFORMATION = DatasetDescriptor(
-    splits_to_sizes={'train': [60, 4219],
-                     'val': [30, 2023],
-                     'test-dev': [30, 2037]},
-    num_classes=None,  # Number of instances per videos differ.
-    ignore_label=255,
-)
-
-_YOUTUBE_VOS_2018_INFORMATION = DatasetDescriptor(
-    # Leave these sizes as None to allow for different splits into
-    # training and validation sets.
-    splits_to_sizes={'train': [None, None],
-                     'val': [None, None]},
-    num_classes=None,  # Number of instances per video differs.
-    ignore_label=255,
-)
-
-_DATASETS_INFORMATION = {
-    'davis_2016': _DAVIS_2016_INFORMATION,
-    'davis_2017': _DAVIS_2017_INFORMATION,
-    'youtube_vos_2018': _YOUTUBE_VOS_2018_INFORMATION,
-}
-
-# Default file pattern of SSTable. Note we include '-' to avoid the confusion
-# between `train-` and `trainval-` sets.
-_FILE_PATTERN = '%s-*'
-
-
-def get_dataset(dataset_name,
-                split_name,
-                dataset_dir,
-                file_pattern=None,
-                data_type='tf_sequence_example',
-                decode_video_frames=False):
-  """Gets an instance of slim Dataset.
-
-  Args:
-    dataset_name: String, dataset name.
-    split_name: String, the train/val Split name.
-    dataset_dir: String, the directory of the dataset sources.
-    file_pattern: String, file pattern of SSTable.
-    data_type: String, data type. Currently supports 'tf_example' and
-      'annotated_image'.
-    decode_video_frames: Boolean, decode the images or not. Not decoding it here
-        is useful if we subsample later
-
-  Returns:
-    An instance of slim Dataset.
-
-  Raises:
-    ValueError: If the dataset_name or split_name is not recognized, or if
-      the dataset_type is not supported.
-  """
-  if dataset_name not in _DATASETS_INFORMATION:
-    raise ValueError('The specified dataset is not supported yet.')
-
-  splits_to_sizes = _DATASETS_INFORMATION[dataset_name].splits_to_sizes
-
-  if split_name not in splits_to_sizes:
-    raise ValueError('data split name %s not recognized' % split_name)
-
-  # Prepare the variables for different datasets.
-  num_classes = _DATASETS_INFORMATION[dataset_name].num_classes
-  ignore_label = _DATASETS_INFORMATION[dataset_name].ignore_label
-
-  if file_pattern is None:
-    file_pattern = _FILE_PATTERN
-  file_pattern = os.path.join(dataset_dir, file_pattern % split_name)
-  if data_type == 'tf_sequence_example':
-    keys_to_context_features = {
-        'image/format': tf.FixedLenFeature((), tf.string, default_value='jpeg'),
-        'image/height': tf.FixedLenFeature((), tf.int64, default_value=0),
-        'image/width': tf.FixedLenFeature((), tf.int64, default_value=0),
-        'segmentation/object/format': tf.FixedLenFeature(
-            (), tf.string, default_value='png'),
-        'video_id': tf.FixedLenFeature((), tf.string, default_value='unknown')
-    }
-    label_name = 'class' if dataset_name == 'davis_2016' else 'object'
-    keys_to_sequence_features = {
-        'image/encoded': tf.FixedLenSequenceFeature((), dtype=tf.string),
-        'segmentation/{}/encoded'.format(label_name):
-            tf.FixedLenSequenceFeature((), tf.string),
-        'segmentation/{}/encoded'.format(label_name):
-            tf.FixedLenSequenceFeature((), tf.string),
-    }
-    items_to_handlers = {
-        'height': tfexample_decoder.Tensor('image/height'),
-        'width': tfexample_decoder.Tensor('image/width'),
-        'video_id': tfexample_decoder.Tensor('video_id')
-    }
-    if decode_video_frames:
-      decode_image_handler = tfexample_decoder.Image(
-          image_key='image/encoded',
-          format_key='image/format',
-          channels=3,
-          repeated=True)
-      items_to_handlers['image'] = decode_image_handler
-      decode_label_handler = tfexample_decoder.Image(
-          image_key='segmentation/{}/encoded'.format(label_name),
-          format_key='segmentation/{}/format'.format(label_name),
-          channels=1,
-          repeated=True)
-      items_to_handlers['labels_class'] = decode_label_handler
-    else:
-      items_to_handlers['image/encoded'] = tfexample_decoder.Tensor(
-          'image/encoded')
-      items_to_handlers[
-          'segmentation/object/encoded'] = tfexample_decoder.Tensor(
-              'segmentation/{}/encoded'.format(label_name))
-    decoder = tfsequence_example_decoder.TFSequenceExampleDecoder(
-        keys_to_context_features, keys_to_sequence_features, items_to_handlers)
-  else:
-    raise ValueError('Unknown data type.')
-
-  size = splits_to_sizes[split_name]
-  if isinstance(size, collections.Sequence):
-    num_videos = size[0]
-    num_samples = size[1]
-  else:
-    num_videos = 0
-    num_samples = size
-
-  return dataset.Dataset(
-      data_sources=file_pattern,
-      reader=tf.TFRecordReader,
-      decoder=decoder,
-      num_samples=num_samples,
-      num_videos=num_videos,
-      items_to_descriptions=_ITEMS_TO_DESCRIPTIONS,
-      ignore_label=ignore_label,
-      num_classes=num_classes,
-      name=dataset_name,
-      multi_label=True)
diff --git a/research/feelvos/eval.sh b/research/feelvos/eval.sh
deleted file mode 100755
index 96cb7f409a1e652ba8263f35c3786cb0cb77f5d1..0000000000000000000000000000000000000000
--- a/research/feelvos/eval.sh
+++ /dev/null
@@ -1,86 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-#
-# This script is used to locally run inference on DAVIS 2017. Users could also
-# modify from this script for their use case. See train.sh for an example of
-# local training.
-#
-# Usage:
-#   # From the tensorflow/models/research/feelvos directory.
-#   sh ./eval.sh
-#
-#
-
-# Exit immediately if a command exits with a non-zero status.
-set -e
-
-# Move one-level up to tensorflow/models/research directory.
-cd ..
-
-# Update PYTHONPATH.
-export PYTHONPATH=$PYTHONPATH:`pwd`:`pwd`/slim:`pwd`/feelvos
-
-# Set up the working environment.
-CURRENT_DIR=$(pwd)
-WORK_DIR="${CURRENT_DIR}/feelvos"
-
-# Run embedding_utils_test first to make sure the PYTHONPATH is correctly set.
-python "${WORK_DIR}"/utils/embedding_utils_test.py -v
-
-# Go to datasets folder and download and convert the DAVIS 2017 dataset.
-DATASET_DIR="datasets"
-cd "${WORK_DIR}/${DATASET_DIR}"
-sh download_and_convert_davis17.sh
-
-# Go to models folder and download and unpack the DAVIS 2017 trained model.
-MODELS_DIR="models"
-mkdir -p "${WORK_DIR}/${MODELS_DIR}"
-cd "${WORK_DIR}/${MODELS_DIR}"
-if [ ! -d "feelvos_davis17_trained" ]; then
-  wget http://download.tensorflow.org/models/feelvos_davis17_trained.tar.gz
-  tar -xvf feelvos_davis17_trained.tar.gz
-  echo "model_checkpoint_path: \"model.ckpt-200004\"" > feelvos_davis17_trained/checkpoint
-  rm feelvos_davis17_trained.tar.gz
-fi
-CHECKPOINT_DIR="${WORK_DIR}/${MODELS_DIR}/feelvos_davis17_trained/"
-
-# Go back to orignal directory.
-cd "${CURRENT_DIR}"
-
-# Set up the working directories.
-DAVIS_FOLDER="davis17"
-EXP_FOLDER="exp/eval_on_val_set"
-VIS_LOGDIR="${WORK_DIR}/${DATASET_DIR}/${DAVIS_FOLDER}/${EXP_FOLDER}/eval"
-mkdir -p ${VIS_LOGDIR}
-
-DAVIS_DATASET="${WORK_DIR}/${DATASET_DIR}/${DAVIS_FOLDER}/tfrecord"
-
-python "${WORK_DIR}"/vis_video.py \
-  --dataset=davis_2017 \
-  --dataset_dir="${DAVIS_DATASET}" \
-  --vis_logdir="${VIS_LOGDIR}" \
-  --checkpoint_dir="${CHECKPOINT_DIR}" \
-  --logtostderr \
-  --atrous_rates=12 \
-  --atrous_rates=24 \
-  --atrous_rates=36 \
-  --decoder_output_stride=4 \
-  --model_variant=xception_65 \
-  --multi_grid=1 \
-  --multi_grid=1 \
-  --multi_grid=1 \
-  --output_stride=8 \
-  --save_segmentations
diff --git a/research/feelvos/input_preprocess.py b/research/feelvos/input_preprocess.py
deleted file mode 100644
index 954c0b42ef2650b1c25ec8071933beee57e9bd69..0000000000000000000000000000000000000000
--- a/research/feelvos/input_preprocess.py
+++ /dev/null
@@ -1,280 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Prepare the data used for FEELVOS training/evaluation."""
-import tensorflow as tf
-
-from deeplab.core import feature_extractor
-from deeplab.core import preprocess_utils
-
-# The probability of flipping the images and labels
-# left-right during training
-_PROB_OF_FLIP = 0.5
-
-get_random_scale = preprocess_utils.get_random_scale
-randomly_scale_image_and_label = (
-    preprocess_utils.randomly_scale_image_and_label)
-
-
-def preprocess_image_and_label(image,
-                               label,
-                               crop_height,
-                               crop_width,
-                               min_resize_value=None,
-                               max_resize_value=None,
-                               resize_factor=None,
-                               min_scale_factor=1.,
-                               max_scale_factor=1.,
-                               scale_factor_step_size=0,
-                               ignore_label=255,
-                               is_training=True,
-                               model_variant=None):
-  """Preprocesses the image and label.
-
-  Args:
-    image: Input image.
-    label: Ground truth annotation label.
-    crop_height: The height value used to crop the image and label.
-    crop_width: The width value used to crop the image and label.
-    min_resize_value: Desired size of the smaller image side.
-    max_resize_value: Maximum allowed size of the larger image side.
-    resize_factor: Resized dimensions are multiple of factor plus one.
-    min_scale_factor: Minimum scale factor value.
-    max_scale_factor: Maximum scale factor value.
-    scale_factor_step_size: The step size from min scale factor to max scale
-      factor. The input is randomly scaled based on the value of
-      (min_scale_factor, max_scale_factor, scale_factor_step_size).
-    ignore_label: The label value which will be ignored for training and
-      evaluation.
-    is_training: If the preprocessing is used for training or not.
-    model_variant: Model variant (string) for choosing how to mean-subtract the
-      images. See feature_extractor.network_map for supported model variants.
-
-  Returns:
-    original_image: Original image (could be resized).
-    processed_image: Preprocessed image.
-    label: Preprocessed ground truth segmentation label.
-
-  Raises:
-    ValueError: Ground truth label not provided during training.
-  """
-  if is_training and label is None:
-    raise ValueError('During training, label must be provided.')
-  if model_variant is None:
-    tf.logging.warning('Default mean-subtraction is performed. Please specify '
-                       'a model_variant. See feature_extractor.network_map for '
-                       'supported model variants.')
-
-  # Keep reference to original image.
-  original_image = image
-
-  processed_image = tf.cast(image, tf.float32)
-
-  if label is not None:
-    label = tf.cast(label, tf.int32)
-
-  # Resize image and label to the desired range.
-  if min_resize_value is not None or max_resize_value is not None:
-    [processed_image, label] = (
-        preprocess_utils.resize_to_range(
-            image=processed_image,
-            label=label,
-            min_size=min_resize_value,
-            max_size=max_resize_value,
-            factor=resize_factor,
-            align_corners=True))
-    # The `original_image` becomes the resized image.
-    original_image = tf.identity(processed_image)
-
-  # Data augmentation by randomly scaling the inputs.
-  scale = get_random_scale(
-      min_scale_factor, max_scale_factor, scale_factor_step_size)
-  processed_image, label = randomly_scale_image_and_label(
-      processed_image, label, scale)
-
-  processed_image.set_shape([None, None, 3])
-
-  if crop_height is not None and crop_width is not None:
-    # Pad image and label to have dimensions >= [crop_height, crop_width].
-    image_shape = tf.shape(processed_image)
-    image_height = image_shape[0]
-    image_width = image_shape[1]
-
-    target_height = image_height + tf.maximum(crop_height - image_height, 0)
-    target_width = image_width + tf.maximum(crop_width - image_width, 0)
-
-    # Pad image with mean pixel value.
-    mean_pixel = tf.reshape(
-        feature_extractor.mean_pixel(model_variant), [1, 1, 3])
-    processed_image = preprocess_utils.pad_to_bounding_box(
-        processed_image, 0, 0, target_height, target_width, mean_pixel)
-
-    if label is not None:
-      label = preprocess_utils.pad_to_bounding_box(
-          label, 0, 0, target_height, target_width, ignore_label)
-
-    # Randomly crop the image and label.
-    if is_training and label is not None:
-      processed_image, label = preprocess_utils.random_crop(
-          [processed_image, label], crop_height, crop_width)
-
-    processed_image.set_shape([crop_height, crop_width, 3])
-
-    if label is not None:
-      label.set_shape([crop_height, crop_width, 1])
-
-  if is_training:
-    # Randomly left-right flip the image and label.
-    processed_image, label, _ = preprocess_utils.flip_dim(
-        [processed_image, label], _PROB_OF_FLIP, dim=1)
-
-  return original_image, processed_image, label
-
-
-def preprocess_images_and_labels_consistently(images,
-                                              labels,
-                                              crop_height,
-                                              crop_width,
-                                              min_resize_value=None,
-                                              max_resize_value=None,
-                                              resize_factor=None,
-                                              min_scale_factor=1.,
-                                              max_scale_factor=1.,
-                                              scale_factor_step_size=0,
-                                              ignore_label=255,
-                                              is_training=True,
-                                              model_variant=None):
-  """Preprocesses images and labels in a consistent way.
-
-  Similar to preprocess_image_and_label, but works on a list of images
-  and a list of labels and uses the same crop coordinates and either flips
-  all images and labels or none of them.
-
-  Args:
-    images: List of input images.
-    labels: List of ground truth annotation labels.
-    crop_height: The height value used to crop the image and label.
-    crop_width: The width value used to crop the image and label.
-    min_resize_value: Desired size of the smaller image side.
-    max_resize_value: Maximum allowed size of the larger image side.
-    resize_factor: Resized dimensions are multiple of factor plus one.
-    min_scale_factor: Minimum scale factor value.
-    max_scale_factor: Maximum scale factor value.
-    scale_factor_step_size: The step size from min scale factor to max scale
-      factor. The input is randomly scaled based on the value of
-      (min_scale_factor, max_scale_factor, scale_factor_step_size).
-    ignore_label: The label value which will be ignored for training and
-      evaluation.
-    is_training: If the preprocessing is used for training or not.
-    model_variant: Model variant (string) for choosing how to mean-subtract the
-      images. See feature_extractor.network_map for supported model variants.
-
-  Returns:
-    original_images: Original images (could be resized).
-    processed_images: Preprocessed images.
-    labels: Preprocessed ground truth segmentation labels.
-
-  Raises:
-    ValueError: Ground truth label not provided during training.
-  """
-  if is_training and labels is None:
-    raise ValueError('During training, labels must be provided.')
-  if model_variant is None:
-    tf.logging.warning('Default mean-subtraction is performed. Please specify '
-                       'a model_variant. See feature_extractor.network_map for '
-                       'supported model variants.')
-  if labels is not None:
-    assert len(images) == len(labels)
-  num_imgs = len(images)
-
-  # Keep reference to original images.
-  original_images = images
-
-  processed_images = [tf.cast(image, tf.float32) for image in images]
-
-  if labels is not None:
-    labels = [tf.cast(label, tf.int32) for label in labels]
-
-  # Resize images and labels to the desired range.
-  if min_resize_value is not None or max_resize_value is not None:
-    processed_images, labels = zip(*[
-        preprocess_utils.resize_to_range(
-            image=processed_image,
-            label=label,
-            min_size=min_resize_value,
-            max_size=max_resize_value,
-            factor=resize_factor,
-            align_corners=True) for processed_image, label
-        in zip(processed_images, labels)])
-    # The `original_images` becomes the resized images.
-    original_images = [tf.identity(processed_image)
-                       for processed_image in processed_images]
-
-  # Data augmentation by randomly scaling the inputs.
-  scale = get_random_scale(
-      min_scale_factor, max_scale_factor, scale_factor_step_size)
-  processed_images, labels = zip(
-      *[randomly_scale_image_and_label(processed_image, label, scale)
-        for processed_image, label in zip(processed_images, labels)])
-
-  for processed_image in processed_images:
-    processed_image.set_shape([None, None, 3])
-
-  if crop_height is not None and crop_width is not None:
-    # Pad image and label to have dimensions >= [crop_height, crop_width].
-    image_shape = tf.shape(processed_images[0])
-    image_height = image_shape[0]
-    image_width = image_shape[1]
-
-    target_height = image_height + tf.maximum(crop_height - image_height, 0)
-    target_width = image_width + tf.maximum(crop_width - image_width, 0)
-
-    # Pad image with mean pixel value.
-    mean_pixel = tf.reshape(
-        feature_extractor.mean_pixel(model_variant), [1, 1, 3])
-    processed_images = [preprocess_utils.pad_to_bounding_box(
-        processed_image, 0, 0, target_height, target_width, mean_pixel)
-                        for processed_image in processed_images]
-
-    if labels is not None:
-      labels = [preprocess_utils.pad_to_bounding_box(
-          label, 0, 0, target_height, target_width, ignore_label)
-                for label in labels]
-
-    # Randomly crop the images and labels.
-    if is_training and labels is not None:
-      cropped = preprocess_utils.random_crop(
-          processed_images + labels, crop_height, crop_width)
-      assert len(cropped) == 2 * num_imgs
-      processed_images = cropped[:num_imgs]
-      labels = cropped[num_imgs:]
-
-    for processed_image in processed_images:
-      processed_image.set_shape([crop_height, crop_width, 3])
-
-    if labels is not None:
-      for label in labels:
-        label.set_shape([crop_height, crop_width, 1])
-
-  if is_training:
-    # Randomly left-right flip the image and label.
-    res = preprocess_utils.flip_dim(
-        list(processed_images + labels), _PROB_OF_FLIP, dim=1)
-    maybe_flipped = res[:-1]
-    assert len(maybe_flipped) == 2 * num_imgs
-    processed_images = maybe_flipped[:num_imgs]
-    labels = maybe_flipped[num_imgs:]
-
-  return original_images, processed_images, labels
diff --git a/research/feelvos/model.py b/research/feelvos/model.py
deleted file mode 100644
index f145f91616958b7327d99bb55efb1b7b5016a223..0000000000000000000000000000000000000000
--- a/research/feelvos/model.py
+++ /dev/null
@@ -1,480 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Provides DeepLab model definition and helper functions.
-
-DeepLab is a deep learning system for semantic image segmentation with
-the following features:
-
-(1) Atrous convolution to explicitly control the resolution at which
-feature responses are computed within Deep Convolutional Neural Networks.
-
-(2) Atrous spatial pyramid pooling (ASPP) to robustly segment objects at
-multiple scales with filters at multiple sampling rates and effective
-fields-of-views.
-
-(3) ASPP module augmented with image-level feature and batch normalization.
-
-(4) A simple yet effective decoder module to recover the object boundaries.
-
-See the following papers for more details:
-
-"Encoder-Decoder with Atrous Separable Convolution for Semantic Image
-Segmentation"
-Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, Hartwig Adam.
-(https://arxiv.org/abs1802.02611)
-
-"Rethinking Atrous Convolution for Semantic Image Segmentation,"
-Liang-Chieh Chen, George Papandreou, Florian Schroff, Hartwig Adam
-(https://arxiv.org/abs/1706.05587)
-
-"DeepLab: Semantic Image Segmentation with Deep Convolutional Nets,
-Atrous Convolution, and Fully Connected CRFs",
-Liang-Chieh Chen*, George Papandreou*, Iasonas Kokkinos, Kevin Murphy,
-Alan L Yuille (* equal contribution)
-(https://arxiv.org/abs/1606.00915)
-
-"Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected
-CRFs"
-Liang-Chieh Chen*, George Papandreou*, Iasonas Kokkinos, Kevin Murphy,
-Alan L. Yuille (* equal contribution)
-(https://arxiv.org/abs/1412.7062)
-"""
-import collections
-import tensorflow as tf
-
-from deeplab import model
-from feelvos import common
-from feelvos.utils import embedding_utils
-from feelvos.utils import train_utils
-
-slim = tf.contrib.slim
-
-
-get_branch_logits = model.get_branch_logits
-get_extra_layer_scopes = model.get_extra_layer_scopes
-multi_scale_logits_v2 = model.multi_scale_logits
-refine_by_decoder = model.refine_by_decoder
-scale_dimension = model.scale_dimension
-split_separable_conv2d = model.split_separable_conv2d
-
-MERGED_LOGITS_SCOPE = model.MERGED_LOGITS_SCOPE
-IMAGE_POOLING_SCOPE = model.IMAGE_POOLING_SCOPE
-ASPP_SCOPE = model.ASPP_SCOPE
-CONCAT_PROJECTION_SCOPE = model.CONCAT_PROJECTION_SCOPE
-
-
-def predict_labels(images,
-                   model_options,
-                   image_pyramid=None,
-                   reference_labels=None,
-                   k_nearest_neighbors=1,
-                   embedding_dimension=None,
-                   use_softmax_feedback=False,
-                   initial_softmax_feedback=None,
-                   embedding_seg_feature_dimension=256,
-                   embedding_seg_n_layers=4,
-                   embedding_seg_kernel_size=7,
-                   embedding_seg_atrous_rates=None,
-                   also_return_softmax_probabilities=False,
-                   num_frames_per_video=None,
-                   normalize_nearest_neighbor_distances=False,
-                   also_attend_to_previous_frame=False,
-                   use_local_previous_frame_attention=False,
-                   previous_frame_attention_window_size=9,
-                   use_first_frame_matching=True,
-                   also_return_embeddings=False,
-                   ref_embeddings=None):
-  """Predicts segmentation labels.
-
-  Args:
-    images: A tensor of size [batch, height, width, channels].
-    model_options: An InternalModelOptions instance to configure models.
-    image_pyramid: Input image scales for multi-scale feature extraction.
-    reference_labels: A tensor of size [batch, height, width, 1].
-      ground truth labels used to perform a nearest neighbor query
-    k_nearest_neighbors: Integer, the number of neighbors to use for nearest
-      neighbor queries.
-    embedding_dimension: Integer, the dimension used for the learned embedding.
-    use_softmax_feedback: Boolean, whether to give the softmax predictions of
-      the last frame as additional input to the segmentation head.
-    initial_softmax_feedback: Float32 tensor, or None. Can be used to
-      initialize the softmax predictions used for the feedback loop.
-      Typically only useful for inference. Only has an effect if
-      use_softmax_feedback is True.
-    embedding_seg_feature_dimension: Integer, the dimensionality used in the
-      segmentation head layers.
-    embedding_seg_n_layers: Integer, the number of layers in the segmentation
-      head.
-    embedding_seg_kernel_size: Integer, the kernel size used in the
-      segmentation head.
-    embedding_seg_atrous_rates: List of integers of length
-      embedding_seg_n_layers, the atrous rates to use for the segmentation head.
-    also_return_softmax_probabilities: Boolean, if true, additionally return
-      the softmax probabilities as second return value.
-    num_frames_per_video: Integer, the number of frames per video.
-    normalize_nearest_neighbor_distances: Boolean, whether to normalize the
-      nearest neighbor distances to [0,1] using sigmoid, scale and shift.
-    also_attend_to_previous_frame: Boolean, whether to also use nearest
-      neighbor attention with respect to the previous frame.
-    use_local_previous_frame_attention: Boolean, whether to restrict the
-      previous frame attention to a local search window.
-      Only has an effect, if also_attend_to_previous_frame is True.
-    previous_frame_attention_window_size: Integer, the window size used for
-      local previous frame attention, if use_local_previous_frame_attention
-      is True.
-    use_first_frame_matching: Boolean, whether to extract features by matching
-      to the reference frame. This should always be true except for ablation
-      experiments.
-    also_return_embeddings: Boolean, whether to return the embeddings as well.
-    ref_embeddings: Tuple of
-      (first_frame_embeddings, previous_frame_embeddings),
-      each of shape [batch, height, width, embedding_dimension], or None.
-
-  Returns:
-    A dictionary with keys specifying the output_type (e.g., semantic
-      prediction) and values storing Tensors representing predictions (argmax
-      over channels). Each prediction has size [batch, height, width].
-    If also_return_softmax_probabilities is True, the second return value are
-      the softmax probabilities.
-    If also_return_embeddings is True, it will also return an embeddings
-      tensor of shape [batch, height, width, embedding_dimension].
-
-  Raises:
-    ValueError: If classification_loss is not softmax, softmax_with_attention,
-      nor triplet.
-  """
-  if (model_options.classification_loss == 'triplet' and
-      reference_labels is None):
-    raise ValueError('Need reference_labels for triplet loss')
-
-  if model_options.classification_loss == 'softmax_with_attention':
-    if embedding_dimension is None:
-      raise ValueError('Need embedding_dimension for softmax_with_attention '
-                       'loss')
-    if reference_labels is None:
-      raise ValueError('Need reference_labels for softmax_with_attention loss')
-    res = (
-        multi_scale_logits_with_nearest_neighbor_matching(
-            images,
-            model_options=model_options,
-            image_pyramid=image_pyramid,
-            is_training=False,
-            reference_labels=reference_labels,
-            clone_batch_size=1,
-            num_frames_per_video=num_frames_per_video,
-            embedding_dimension=embedding_dimension,
-            max_neighbors_per_object=0,
-            k_nearest_neighbors=k_nearest_neighbors,
-            use_softmax_feedback=use_softmax_feedback,
-            initial_softmax_feedback=initial_softmax_feedback,
-            embedding_seg_feature_dimension=embedding_seg_feature_dimension,
-            embedding_seg_n_layers=embedding_seg_n_layers,
-            embedding_seg_kernel_size=embedding_seg_kernel_size,
-            embedding_seg_atrous_rates=embedding_seg_atrous_rates,
-            normalize_nearest_neighbor_distances=
-            normalize_nearest_neighbor_distances,
-            also_attend_to_previous_frame=also_attend_to_previous_frame,
-            use_local_previous_frame_attention=
-            use_local_previous_frame_attention,
-            previous_frame_attention_window_size=
-            previous_frame_attention_window_size,
-            use_first_frame_matching=use_first_frame_matching,
-            also_return_embeddings=also_return_embeddings,
-            ref_embeddings=ref_embeddings
-        ))
-    if also_return_embeddings:
-      outputs_to_scales_to_logits, embeddings = res
-    else:
-      outputs_to_scales_to_logits = res
-      embeddings = None
-  else:
-    outputs_to_scales_to_logits = multi_scale_logits_v2(
-        images,
-        model_options=model_options,
-        image_pyramid=image_pyramid,
-        is_training=False,
-        fine_tune_batch_norm=False)
-
-  predictions = {}
-  for output in sorted(outputs_to_scales_to_logits):
-    scales_to_logits = outputs_to_scales_to_logits[output]
-    original_logits = scales_to_logits[MERGED_LOGITS_SCOPE]
-    if isinstance(original_logits, list):
-      assert len(original_logits) == 1
-      original_logits = original_logits[0]
-    logits = tf.image.resize_bilinear(original_logits, tf.shape(images)[1:3],
-                                      align_corners=True)
-    if model_options.classification_loss in ('softmax',
-                                             'softmax_with_attention'):
-      predictions[output] = tf.argmax(logits, 3)
-    elif model_options.classification_loss == 'triplet':
-      # to keep this fast, we do the nearest neighbor assignment on the
-      # resolution at which the embedding is extracted and scale the result up
-      # afterwards
-      embeddings = original_logits
-      reference_labels_logits_size = tf.squeeze(
-          tf.image.resize_nearest_neighbor(
-              reference_labels[tf.newaxis],
-              train_utils.resolve_shape(embeddings)[1:3],
-              align_corners=True), axis=0)
-      nn_labels = embedding_utils.assign_labels_by_nearest_neighbors(
-          embeddings[0], embeddings[1:], reference_labels_logits_size,
-          k_nearest_neighbors)
-      predictions[common.OUTPUT_TYPE] = tf.image.resize_nearest_neighbor(
-          nn_labels, tf.shape(images)[1:3], align_corners=True)
-    else:
-      raise ValueError(
-          'Only support softmax, triplet, or softmax_with_attention for '
-          'classification_loss.')
-
-  if also_return_embeddings:
-    assert also_return_softmax_probabilities
-    return predictions, tf.nn.softmax(original_logits, axis=-1), embeddings
-  elif also_return_softmax_probabilities:
-    return predictions, tf.nn.softmax(original_logits, axis=-1)
-  else:
-    return predictions
-
-
-def multi_scale_logits_with_nearest_neighbor_matching(
-    images,
-    model_options,
-    image_pyramid,
-    clone_batch_size,
-    reference_labels,
-    num_frames_per_video,
-    embedding_dimension,
-    max_neighbors_per_object,
-    weight_decay=0.0001,
-    is_training=False,
-    fine_tune_batch_norm=False,
-    k_nearest_neighbors=1,
-    use_softmax_feedback=False,
-    initial_softmax_feedback=None,
-    embedding_seg_feature_dimension=256,
-    embedding_seg_n_layers=4,
-    embedding_seg_kernel_size=7,
-    embedding_seg_atrous_rates=None,
-    normalize_nearest_neighbor_distances=False,
-    also_attend_to_previous_frame=False,
-    damage_initial_previous_frame_mask=False,
-    use_local_previous_frame_attention=False,
-    previous_frame_attention_window_size=9,
-    use_first_frame_matching=True,
-    also_return_embeddings=False,
-    ref_embeddings=None):
-  """Gets the logits for multi-scale inputs using nearest neighbor attention.
-
-  Adjusted version of multi_scale_logits_v2 to support nearest neighbor
-  attention and a variable number of classes for each element of the batch.
-  The returned logits are all downsampled (due to max-pooling layers)
-  for both training and evaluation.
-
-  Args:
-    images: A tensor of size [batch, height, width, channels].
-    model_options: A ModelOptions instance to configure models.
-    image_pyramid: Input image scales for multi-scale feature extraction.
-    clone_batch_size: Integer, the number of videos on a batch.
-    reference_labels: The segmentation labels of the reference frame on which
-      attention is applied.
-    num_frames_per_video: Integer, the number of frames per video.
-    embedding_dimension: Integer, the dimension of the embedding.
-    max_neighbors_per_object: Integer, the maximum number of candidates
-      for the nearest neighbor query per object after subsampling.
-      Can be 0 for no subsampling.
-    weight_decay: The weight decay for model variables.
-    is_training: Is training or not.
-    fine_tune_batch_norm: Fine-tune the batch norm parameters or not.
-    k_nearest_neighbors: Integer, the number of nearest neighbors to use.
-    use_softmax_feedback: Boolean, whether to give the softmax predictions of
-      the last frame as additional input to the segmentation head.
-    initial_softmax_feedback: List of Float32 tensors, or None.
-      Can be used to initialize the softmax predictions used for the feedback
-      loop. Only has an effect if use_softmax_feedback is True.
-    embedding_seg_feature_dimension: Integer, the dimensionality used in the
-      segmentation head layers.
-    embedding_seg_n_layers: Integer, the number of layers in the segmentation
-      head.
-    embedding_seg_kernel_size: Integer, the kernel size used in the
-      segmentation head.
-    embedding_seg_atrous_rates: List of integers of length
-      embedding_seg_n_layers, the atrous rates to use for the segmentation head.
-    normalize_nearest_neighbor_distances: Boolean, whether to normalize the
-      nearest neighbor distances to [0,1] using sigmoid, scale and shift.
-    also_attend_to_previous_frame: Boolean, whether to also use nearest
-      neighbor attention with respect to the previous frame.
-    damage_initial_previous_frame_mask: Boolean, whether to artificially damage
-      the initial previous frame mask. Only has an effect if
-      also_attend_to_previous_frame is True.
-    use_local_previous_frame_attention: Boolean, whether to restrict the
-      previous frame attention to a local search window.
-      Only has an effect, if also_attend_to_previous_frame is True.
-    previous_frame_attention_window_size: Integer, the window size used for
-      local previous frame attention, if use_local_previous_frame_attention
-      is True.
-    use_first_frame_matching: Boolean, whether to extract features by matching
-      to the reference frame. This should always be true except for ablation
-      experiments.
-    also_return_embeddings: Boolean, whether to return the embeddings as well.
-    ref_embeddings: Tuple of
-      (first_frame_embeddings, previous_frame_embeddings),
-      each of shape [batch, height, width, embedding_dimension], or None.
-
-  Returns:
-    outputs_to_scales_to_logits: A map of maps from output_type (e.g.,
-      semantic prediction) to a dictionary of multi-scale logits names to
-      logits. For each output_type, the dictionary has keys which
-      correspond to the scales and values which correspond to the logits.
-      For example, if `scales` equals [1.0, 1.5], then the keys would
-      include 'merged_logits', 'logits_1.00' and 'logits_1.50'.
-    If also_return_embeddings is True, it will also return an embeddings
-      tensor of shape [batch, height, width, embedding_dimension].
-
-  Raises:
-    ValueError: If model_options doesn't specify crop_size and its
-      add_image_level_feature = True, since add_image_level_feature requires
-      crop_size information.
-  """
-  # Setup default values.
-  if not image_pyramid:
-    image_pyramid = [1.0]
-  crop_height = (
-      model_options.crop_size[0]
-      if model_options.crop_size else tf.shape(images)[1])
-  crop_width = (
-      model_options.crop_size[1]
-      if model_options.crop_size else tf.shape(images)[2])
-
-  # Compute the height, width for the output logits.
-  if model_options.decoder_output_stride:
-    logits_output_stride = min(model_options.decoder_output_stride)
-  else:
-    logits_output_stride = model_options.output_stride
-  logits_height = scale_dimension(
-      crop_height,
-      max(1.0, max(image_pyramid)) / logits_output_stride)
-  logits_width = scale_dimension(
-      crop_width,
-      max(1.0, max(image_pyramid)) / logits_output_stride)
-
-  # Compute the logits for each scale in the image pyramid.
-  outputs_to_scales_to_logits = {
-      k: {}
-      for k in model_options.outputs_to_num_classes
-  }
-
-  for image_scale in image_pyramid:
-    if image_scale != 1.0:
-      scaled_height = scale_dimension(crop_height, image_scale)
-      scaled_width = scale_dimension(crop_width, image_scale)
-      scaled_crop_size = [scaled_height, scaled_width]
-      scaled_images = tf.image.resize_bilinear(
-          images, scaled_crop_size, align_corners=True)
-      scaled_reference_labels = tf.image.resize_nearest_neighbor(
-          reference_labels, scaled_crop_size, align_corners=True
-      )
-      if model_options.crop_size is None:
-        scaled_crop_size = None
-      if model_options.crop_size:
-        scaled_images.set_shape([None, scaled_height, scaled_width, 3])
-    else:
-      scaled_crop_size = model_options.crop_size
-      scaled_images = images
-      scaled_reference_labels = reference_labels
-
-    updated_options = model_options._replace(crop_size=scaled_crop_size)
-    res = embedding_utils.get_logits_with_matching(
-        scaled_images,
-        updated_options,
-        weight_decay=weight_decay,
-        reuse=tf.AUTO_REUSE,
-        is_training=is_training,
-        fine_tune_batch_norm=fine_tune_batch_norm,
-        reference_labels=scaled_reference_labels,
-        batch_size=clone_batch_size,
-        num_frames_per_video=num_frames_per_video,
-        embedding_dimension=embedding_dimension,
-        max_neighbors_per_object=max_neighbors_per_object,
-        k_nearest_neighbors=k_nearest_neighbors,
-        use_softmax_feedback=use_softmax_feedback,
-        initial_softmax_feedback=initial_softmax_feedback,
-        embedding_seg_feature_dimension=embedding_seg_feature_dimension,
-        embedding_seg_n_layers=embedding_seg_n_layers,
-        embedding_seg_kernel_size=embedding_seg_kernel_size,
-        embedding_seg_atrous_rates=embedding_seg_atrous_rates,
-        normalize_nearest_neighbor_distances=
-        normalize_nearest_neighbor_distances,
-        also_attend_to_previous_frame=also_attend_to_previous_frame,
-        damage_initial_previous_frame_mask=damage_initial_previous_frame_mask,
-        use_local_previous_frame_attention=use_local_previous_frame_attention,
-        previous_frame_attention_window_size=
-        previous_frame_attention_window_size,
-        use_first_frame_matching=use_first_frame_matching,
-        also_return_embeddings=also_return_embeddings,
-        ref_embeddings=ref_embeddings
-    )
-    if also_return_embeddings:
-      outputs_to_logits, embeddings = res
-    else:
-      outputs_to_logits = res
-      embeddings = None
-
-    # Resize the logits to have the same dimension before merging.
-    for output in sorted(outputs_to_logits):
-      if isinstance(outputs_to_logits[output], collections.Sequence):
-        outputs_to_logits[output] = [tf.image.resize_bilinear(
-            x, [logits_height, logits_width], align_corners=True)
-                                     for x in outputs_to_logits[output]]
-      else:
-        outputs_to_logits[output] = tf.image.resize_bilinear(
-            outputs_to_logits[output], [logits_height, logits_width],
-            align_corners=True)
-
-    # Return when only one input scale.
-    if len(image_pyramid) == 1:
-      for output in sorted(model_options.outputs_to_num_classes):
-        outputs_to_scales_to_logits[output][
-            MERGED_LOGITS_SCOPE] = outputs_to_logits[output]
-      if also_return_embeddings:
-        return outputs_to_scales_to_logits, embeddings
-      else:
-        return outputs_to_scales_to_logits
-
-    # Save logits to the output map.
-    for output in sorted(model_options.outputs_to_num_classes):
-      outputs_to_scales_to_logits[output][
-          'logits_%.2f' % image_scale] = outputs_to_logits[output]
-
-  # Merge the logits from all the multi-scale inputs.
-  for output in sorted(model_options.outputs_to_num_classes):
-    # Concatenate the multi-scale logits for each output type.
-    all_logits = [
-        [tf.expand_dims(l, axis=4)]
-        for logits in outputs_to_scales_to_logits[output].values()
-        for l in logits
-    ]
-    transposed = map(list, zip(*all_logits))
-    all_logits = [tf.concat(t, 4) for t in transposed]
-    merge_fn = (
-        tf.reduce_max
-        if model_options.merge_method == 'max' else tf.reduce_mean)
-    outputs_to_scales_to_logits[output][MERGED_LOGITS_SCOPE] = [merge_fn(
-        l, axis=4) for l in all_logits]
-
-  if also_return_embeddings:
-    return outputs_to_scales_to_logits, embeddings
-  else:
-    return outputs_to_scales_to_logits
diff --git a/research/feelvos/train.py b/research/feelvos/train.py
deleted file mode 100644
index 16c085722749bcfde5aeff15cdbec336e5efe451..0000000000000000000000000000000000000000
--- a/research/feelvos/train.py
+++ /dev/null
@@ -1,630 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Training script for the FEELVOS model.
-
-See model.py for more details and usage.
-"""
-import six
-import tensorflow as tf
-
-from feelvos import common
-from feelvos import model
-from feelvos.datasets import video_dataset
-from feelvos.utils import embedding_utils
-from feelvos.utils import train_utils
-from feelvos.utils import video_input_generator
-from deployment import model_deploy
-
-slim = tf.contrib.slim
-prefetch_queue = slim.prefetch_queue
-flags = tf.app.flags
-FLAGS = flags.FLAGS
-
-# Settings for multi-GPUs/multi-replicas training.
-
-flags.DEFINE_integer('num_clones', 1, 'Number of clones to deploy.')
-
-flags.DEFINE_boolean('clone_on_cpu', False, 'Use CPUs to deploy clones.')
-
-flags.DEFINE_integer('num_replicas', 1, 'Number of worker replicas.')
-
-flags.DEFINE_integer('startup_delay_steps', 15,
-                     'Number of training steps between replicas startup.')
-
-flags.DEFINE_integer('num_ps_tasks', 0,
-                     'The number of parameter servers. If the value is 0, then '
-                     'the parameters are handled locally by the worker.')
-
-flags.DEFINE_string('master', '', 'BNS name of the tensorflow server')
-
-flags.DEFINE_integer('task', 0, 'The task ID.')
-
-# Settings for logging.
-
-flags.DEFINE_string('train_logdir', None,
-                    'Where the checkpoint and logs are stored.')
-
-flags.DEFINE_integer('log_steps', 10,
-                     'Display logging information at every log_steps.')
-
-flags.DEFINE_integer('save_interval_secs', 1200,
-                     'How often, in seconds, we save the model to disk.')
-
-flags.DEFINE_integer('save_summaries_secs', 600,
-                     'How often, in seconds, we compute the summaries.')
-
-# Settings for training strategy.
-
-flags.DEFINE_enum('learning_policy', 'poly', ['poly', 'step'],
-                  'Learning rate policy for training.')
-
-flags.DEFINE_float('base_learning_rate', 0.0007,
-                   'The base learning rate for model training.')
-
-flags.DEFINE_float('learning_rate_decay_factor', 0.1,
-                   'The rate to decay the base learning rate.')
-
-flags.DEFINE_integer('learning_rate_decay_step', 2000,
-                     'Decay the base learning rate at a fixed step.')
-
-flags.DEFINE_float('learning_power', 0.9,
-                   'The power value used in the poly learning policy.')
-
-flags.DEFINE_integer('training_number_of_steps', 200000,
-                     'The number of steps used for training')
-
-flags.DEFINE_float('momentum', 0.9, 'The momentum value to use')
-
-flags.DEFINE_integer('train_batch_size', 6,
-                     'The number of images in each batch during training.')
-
-flags.DEFINE_integer('train_num_frames_per_video', 3,
-                     'The number of frames used per video during training')
-
-flags.DEFINE_float('weight_decay', 0.00004,
-                   'The value of the weight decay for training.')
-
-flags.DEFINE_multi_integer('train_crop_size', [465, 465],
-                           'Image crop size [height, width] during training.')
-
-flags.DEFINE_float('last_layer_gradient_multiplier', 1.0,
-                   'The gradient multiplier for last layers, which is used to '
-                   'boost the gradient of last layers if the value > 1.')
-
-flags.DEFINE_boolean('upsample_logits', True,
-                     'Upsample logits during training.')
-
-flags.DEFINE_integer('batch_capacity_factor', 16, 'Batch capacity factor.')
-
-flags.DEFINE_integer('num_readers', 1, 'Number of readers for data provider.')
-
-flags.DEFINE_integer('batch_num_threads', 1, 'Batch number of threads.')
-
-flags.DEFINE_integer('prefetch_queue_capacity_factor', 32,
-                     'Prefetch queue capacity factor.')
-
-flags.DEFINE_integer('prefetch_queue_num_threads', 1,
-                     'Prefetch queue number of threads.')
-
-flags.DEFINE_integer('train_max_neighbors_per_object', 1024,
-                     'The maximum number of candidates for the nearest '
-                     'neighbor query per object after subsampling')
-
-# Settings for fine-tuning the network.
-
-flags.DEFINE_string('tf_initial_checkpoint', None,
-                    'The initial checkpoint in tensorflow format.')
-
-flags.DEFINE_boolean('initialize_last_layer', False,
-                     'Initialize the last layer.')
-
-flags.DEFINE_boolean('last_layers_contain_logits_only', False,
-                     'Only consider logits as last layers or not.')
-
-flags.DEFINE_integer('slow_start_step', 0,
-                     'Training model with small learning rate for few steps.')
-
-flags.DEFINE_float('slow_start_learning_rate', 1e-4,
-                   'Learning rate employed during slow start.')
-
-flags.DEFINE_boolean('fine_tune_batch_norm', False,
-                     'Fine tune the batch norm parameters or not.')
-
-flags.DEFINE_float('min_scale_factor', 1.,
-                   'Mininum scale factor for data augmentation.')
-
-flags.DEFINE_float('max_scale_factor', 1.3,
-                   'Maximum scale factor for data augmentation.')
-
-flags.DEFINE_float('scale_factor_step_size', 0,
-                   'Scale factor step size for data augmentation.')
-
-flags.DEFINE_multi_integer('atrous_rates', None,
-                           'Atrous rates for atrous spatial pyramid pooling.')
-
-flags.DEFINE_integer('output_stride', 8,
-                     'The ratio of input to output spatial resolution.')
-
-flags.DEFINE_boolean('sample_only_first_frame_for_finetuning', False,
-                     'Whether to only sample the first frame during '
-                     'fine-tuning. This should be False when using lucid data, '
-                     'but True when fine-tuning on the first frame only. Only '
-                     'has an effect if first_frame_finetuning is True.')
-
-flags.DEFINE_multi_integer('first_frame_finetuning', [0],
-                           'Whether to only sample the first frame for '
-                           'fine-tuning.')
-
-# Dataset settings.
-
-flags.DEFINE_multi_string('dataset', [], 'Name of the segmentation datasets.')
-
-flags.DEFINE_multi_float('dataset_sampling_probabilities', [],
-                         'A list of probabilities to sample each of the '
-                         'datasets.')
-
-flags.DEFINE_string('train_split', 'train',
-                    'Which split of the dataset to be used for training')
-
-flags.DEFINE_multi_string('dataset_dir', [], 'Where the datasets reside.')
-
-flags.DEFINE_multi_integer('three_frame_dataset', [0],
-                           'Whether the dataset has exactly three frames per '
-                           'video of which the first is to be used as reference'
-                           ' and the two others are consecutive frames to be '
-                           'used as query  frames.'
-                           'Set true for pascal lucid data.')
-
-flags.DEFINE_boolean('damage_initial_previous_frame_mask', False,
-                     'Whether to artificially damage the initial previous '
-                     'frame mask. Only has an effect if '
-                     'also_attend_to_previous_frame is True.')
-
-flags.DEFINE_float('top_k_percent_pixels', 0.15, 'Float in [0.0, 1.0].'
-                   'When its value < 1.0, only compute the loss for the top k'
-                   'percent pixels (e.g., the top 20% pixels). This is useful'
-                   'for hard pixel mining.')
-
-flags.DEFINE_integer('hard_example_mining_step', 100000,
-                     'The training step in which the hard exampling mining '
-                     'kicks off. Note that we gradually reduce the mining '
-                     'percent to the top_k_percent_pixels. For example, if '
-                     'hard_example_mining_step=100K and '
-                     'top_k_percent_pixels=0.25, then mining percent will '
-                     'gradually reduce from 100% to 25% until 100K steps '
-                     'after which we only mine top 25% pixels. Only has an '
-                     'effect if top_k_percent_pixels < 1.0')
-
-
-def _build_deeplab(inputs_queue_or_samples, outputs_to_num_classes,
-                   ignore_label):
-  """Builds a clone of DeepLab.
-
-  Args:
-    inputs_queue_or_samples: A prefetch queue for images and labels, or
-      directly a dict of the samples.
-    outputs_to_num_classes: A map from output type to the number of classes.
-      For example, for the task of semantic segmentation with 21 semantic
-      classes, we would have outputs_to_num_classes['semantic'] = 21.
-    ignore_label: Ignore label.
-
-  Returns:
-    A map of maps from output_type (e.g., semantic prediction) to a
-      dictionary of multi-scale logits names to logits. For each output_type,
-      the dictionary has keys which correspond to the scales and values which
-      correspond to the logits. For example, if `scales` equals [1.0, 1.5],
-      then the keys would include 'merged_logits', 'logits_1.00' and
-      'logits_1.50'.
-
-  Raises:
-    ValueError: If classification_loss is not softmax, softmax_with_attention,
-      or triplet.
-  """
-  if hasattr(inputs_queue_or_samples, 'dequeue'):
-    samples = inputs_queue_or_samples.dequeue()
-  else:
-    samples = inputs_queue_or_samples
-  train_crop_size = (None if 0 in FLAGS.train_crop_size else
-                     FLAGS.train_crop_size)
-
-  model_options = common.VideoModelOptions(
-      outputs_to_num_classes=outputs_to_num_classes,
-      crop_size=train_crop_size,
-      atrous_rates=FLAGS.atrous_rates,
-      output_stride=FLAGS.output_stride)
-
-  if model_options.classification_loss == 'softmax_with_attention':
-    clone_batch_size = FLAGS.train_batch_size // FLAGS.num_clones
-
-    # Create summaries of ground truth labels.
-    for n in range(clone_batch_size):
-      tf.summary.image(
-          'gt_label_%d' % n,
-          tf.cast(samples[common.LABEL][
-              n * FLAGS.train_num_frames_per_video:
-              (n + 1) * FLAGS.train_num_frames_per_video],
-                  tf.uint8) * 32, max_outputs=FLAGS.train_num_frames_per_video)
-
-    if common.PRECEDING_FRAME_LABEL in samples:
-      preceding_frame_label = samples[common.PRECEDING_FRAME_LABEL]
-      init_softmax = []
-      for n in range(clone_batch_size):
-        init_softmax_n = embedding_utils.create_initial_softmax_from_labels(
-            preceding_frame_label[n, tf.newaxis],
-            samples[common.LABEL][n * FLAGS.train_num_frames_per_video,
-                                  tf.newaxis],
-            common.parse_decoder_output_stride(),
-            reduce_labels=True)
-        init_softmax_n = tf.squeeze(init_softmax_n, axis=0)
-        init_softmax.append(init_softmax_n)
-        tf.summary.image('preceding_frame_label',
-                         tf.cast(preceding_frame_label[n, tf.newaxis],
-                                 tf.uint8) * 32)
-    else:
-      init_softmax = None
-
-    outputs_to_scales_to_logits = (
-        model.multi_scale_logits_with_nearest_neighbor_matching(
-            samples[common.IMAGE],
-            model_options=model_options,
-            image_pyramid=FLAGS.image_pyramid,
-            weight_decay=FLAGS.weight_decay,
-            is_training=True,
-            fine_tune_batch_norm=FLAGS.fine_tune_batch_norm,
-            reference_labels=samples[common.LABEL],
-            clone_batch_size=FLAGS.train_batch_size // FLAGS.num_clones,
-            num_frames_per_video=FLAGS.train_num_frames_per_video,
-            embedding_dimension=FLAGS.embedding_dimension,
-            max_neighbors_per_object=FLAGS.train_max_neighbors_per_object,
-            k_nearest_neighbors=FLAGS.k_nearest_neighbors,
-            use_softmax_feedback=FLAGS.use_softmax_feedback,
-            initial_softmax_feedback=init_softmax,
-            embedding_seg_feature_dimension=
-            FLAGS.embedding_seg_feature_dimension,
-            embedding_seg_n_layers=FLAGS.embedding_seg_n_layers,
-            embedding_seg_kernel_size=FLAGS.embedding_seg_kernel_size,
-            embedding_seg_atrous_rates=FLAGS.embedding_seg_atrous_rates,
-            normalize_nearest_neighbor_distances=
-            FLAGS.normalize_nearest_neighbor_distances,
-            also_attend_to_previous_frame=FLAGS.also_attend_to_previous_frame,
-            damage_initial_previous_frame_mask=
-            FLAGS.damage_initial_previous_frame_mask,
-            use_local_previous_frame_attention=
-            FLAGS.use_local_previous_frame_attention,
-            previous_frame_attention_window_size=
-            FLAGS.previous_frame_attention_window_size,
-            use_first_frame_matching=FLAGS.use_first_frame_matching
-        ))
-  else:
-    outputs_to_scales_to_logits = model.multi_scale_logits_v2(
-        samples[common.IMAGE],
-        model_options=model_options,
-        image_pyramid=FLAGS.image_pyramid,
-        weight_decay=FLAGS.weight_decay,
-        is_training=True,
-        fine_tune_batch_norm=FLAGS.fine_tune_batch_norm)
-
-  if model_options.classification_loss == 'softmax':
-    for output, num_classes in six.iteritems(outputs_to_num_classes):
-      train_utils.add_softmax_cross_entropy_loss_for_each_scale(
-          outputs_to_scales_to_logits[output],
-          samples[common.LABEL],
-          num_classes,
-          ignore_label,
-          loss_weight=1.0,
-          upsample_logits=FLAGS.upsample_logits,
-          scope=output)
-  elif model_options.classification_loss == 'triplet':
-    for output, _ in six.iteritems(outputs_to_num_classes):
-      train_utils.add_triplet_loss_for_each_scale(
-          FLAGS.train_batch_size // FLAGS.num_clones,
-          FLAGS.train_num_frames_per_video,
-          FLAGS.embedding_dimension, outputs_to_scales_to_logits[output],
-          samples[common.LABEL], scope=output)
-  elif model_options.classification_loss == 'softmax_with_attention':
-    labels = samples[common.LABEL]
-    batch_size = FLAGS.train_batch_size // FLAGS.num_clones
-    num_frames_per_video = FLAGS.train_num_frames_per_video
-    h, w = train_utils.resolve_shape(labels)[1:3]
-    labels = tf.reshape(labels, tf.stack(
-        [batch_size, num_frames_per_video, h, w, 1]))
-    # Strip the reference labels off.
-    if FLAGS.also_attend_to_previous_frame or FLAGS.use_softmax_feedback:
-      n_ref_frames = 2
-    else:
-      n_ref_frames = 1
-    labels = labels[:, n_ref_frames:]
-    # Merge batch and time dimensions.
-    labels = tf.reshape(labels, tf.stack(
-        [batch_size * (num_frames_per_video - n_ref_frames), h, w, 1]))
-
-    for output, num_classes in six.iteritems(outputs_to_num_classes):
-      train_utils.add_dynamic_softmax_cross_entropy_loss_for_each_scale(
-          outputs_to_scales_to_logits[output],
-          labels,
-          ignore_label,
-          loss_weight=1.0,
-          upsample_logits=FLAGS.upsample_logits,
-          scope=output,
-          top_k_percent_pixels=FLAGS.top_k_percent_pixels,
-          hard_example_mining_step=FLAGS.hard_example_mining_step)
-  else:
-    raise ValueError('Only support softmax, softmax_with_attention'
-                     ' or triplet for classification_loss.')
-
-  return outputs_to_scales_to_logits
-
-
-def main(unused_argv):
-  # Set up deployment (i.e., multi-GPUs and/or multi-replicas).
-  config = model_deploy.DeploymentConfig(
-      num_clones=FLAGS.num_clones,
-      clone_on_cpu=FLAGS.clone_on_cpu,
-      replica_id=FLAGS.task,
-      num_replicas=FLAGS.num_replicas,
-      num_ps_tasks=FLAGS.num_ps_tasks)
-
-  with tf.Graph().as_default():
-    with tf.device(config.inputs_device()):
-      train_crop_size = (None if 0 in FLAGS.train_crop_size else
-                         FLAGS.train_crop_size)
-      assert FLAGS.dataset
-      assert len(FLAGS.dataset) == len(FLAGS.dataset_dir)
-      if len(FLAGS.first_frame_finetuning) == 1:
-        first_frame_finetuning = (list(FLAGS.first_frame_finetuning)
-                                  * len(FLAGS.dataset))
-      else:
-        first_frame_finetuning = FLAGS.first_frame_finetuning
-      if len(FLAGS.three_frame_dataset) == 1:
-        three_frame_dataset = (list(FLAGS.three_frame_dataset)
-                               * len(FLAGS.dataset))
-      else:
-        three_frame_dataset = FLAGS.three_frame_dataset
-      assert len(FLAGS.dataset) == len(first_frame_finetuning)
-      assert len(FLAGS.dataset) == len(three_frame_dataset)
-      datasets, samples_list = zip(
-          *[_get_dataset_and_samples(config, train_crop_size, dataset,
-                                     dataset_dir, bool(first_frame_finetuning_),
-                                     bool(three_frame_dataset_))
-            for dataset, dataset_dir, first_frame_finetuning_,
-            three_frame_dataset_ in zip(FLAGS.dataset, FLAGS.dataset_dir,
-                                        first_frame_finetuning,
-                                        three_frame_dataset)])
-      # Note that this way of doing things is wasteful since it will evaluate
-      # all branches but just use one of them. But let's do it anyway for now,
-      # since it's easy and will probably be fast enough.
-      dataset = datasets[0]
-      if len(samples_list) == 1:
-        samples = samples_list[0]
-      else:
-        probabilities = FLAGS.dataset_sampling_probabilities
-        if probabilities:
-          assert len(probabilities) == len(samples_list)
-        else:
-          # Default to uniform probabilities.
-          probabilities = [1.0 / len(samples_list) for _ in samples_list]
-        probabilities = tf.constant(probabilities)
-        logits = tf.log(probabilities[tf.newaxis])
-        rand_idx = tf.squeeze(tf.multinomial(logits, 1, output_dtype=tf.int32),
-                              axis=[0, 1])
-
-        def wrap(x):
-          def f():
-            return x
-          return f
-
-        samples = tf.case({tf.equal(rand_idx, idx): wrap(s)
-                           for idx, s in enumerate(samples_list)},
-                          exclusive=True)
-
-      # Prefetch_queue requires the shape to be known at graph creation time.
-      # So we only use it if we crop to a fixed size.
-      if train_crop_size is None:
-        inputs_queue = samples
-      else:
-        inputs_queue = prefetch_queue.prefetch_queue(
-            samples,
-            capacity=FLAGS.prefetch_queue_capacity_factor*config.num_clones,
-            num_threads=FLAGS.prefetch_queue_num_threads)
-
-    # Create the global step on the device storing the variables.
-    with tf.device(config.variables_device()):
-      global_step = tf.train.get_or_create_global_step()
-
-      # Define the model and create clones.
-      model_fn = _build_deeplab
-      if FLAGS.classification_loss == 'triplet':
-        embedding_dim = FLAGS.embedding_dimension
-        output_type_to_dim = {'embedding': embedding_dim}
-      else:
-        output_type_to_dim = {common.OUTPUT_TYPE: dataset.num_classes}
-      model_args = (inputs_queue, output_type_to_dim, dataset.ignore_label)
-      clones = model_deploy.create_clones(config, model_fn, args=model_args)
-
-      # Gather update_ops from the first clone. These contain, for example,
-      # the updates for the batch_norm variables created by model_fn.
-      first_clone_scope = config.clone_scope(0)
-      update_ops = tf.get_collection(tf.GraphKeys.UPDATE_OPS, first_clone_scope)
-
-    # Gather initial summaries.
-    summaries = set(tf.get_collection(tf.GraphKeys.SUMMARIES))
-
-    # Add summaries for model variables.
-    for model_var in tf.contrib.framework.get_model_variables():
-      summaries.add(tf.summary.histogram(model_var.op.name, model_var))
-
-    # Add summaries for losses.
-    for loss in tf.get_collection(tf.GraphKeys.LOSSES, first_clone_scope):
-      summaries.add(tf.summary.scalar('losses/%s' % loss.op.name, loss))
-
-    # Build the optimizer based on the device specification.
-    with tf.device(config.optimizer_device()):
-      learning_rate = train_utils.get_model_learning_rate(
-          FLAGS.learning_policy,
-          FLAGS.base_learning_rate,
-          FLAGS.learning_rate_decay_step,
-          FLAGS.learning_rate_decay_factor,
-          FLAGS.training_number_of_steps,
-          FLAGS.learning_power,
-          FLAGS.slow_start_step,
-          FLAGS.slow_start_learning_rate)
-      optimizer = tf.train.MomentumOptimizer(learning_rate, FLAGS.momentum)
-      summaries.add(tf.summary.scalar('learning_rate', learning_rate))
-
-    startup_delay_steps = FLAGS.task * FLAGS.startup_delay_steps
-
-    with tf.device(config.variables_device()):
-      total_loss, grads_and_vars = model_deploy.optimize_clones(
-          clones, optimizer)
-      total_loss = tf.check_numerics(total_loss, 'Loss is inf or nan.')
-      summaries.add(tf.summary.scalar('total_loss', total_loss))
-
-      # Modify the gradients for biases and last layer variables.
-      last_layers = model.get_extra_layer_scopes(
-          FLAGS.last_layers_contain_logits_only)
-      grad_mult = train_utils.get_model_gradient_multipliers(
-          last_layers, FLAGS.last_layer_gradient_multiplier)
-      if grad_mult:
-        grads_and_vars = slim.learning.multiply_gradients(grads_and_vars,
-                                                          grad_mult)
-
-      with tf.name_scope('grad_clipping'):
-        grads_and_vars = slim.learning.clip_gradient_norms(grads_and_vars, 5.0)
-
-      # Create histogram summaries for the gradients.
-      # We have too many summaries for mldash, so disable this one for now.
-      # for grad, var in grads_and_vars:
-      #   summaries.add(tf.summary.histogram(
-      #       var.name.replace(':0', '_0') + '/gradient', grad))
-
-      # Create gradient update op.
-      grad_updates = optimizer.apply_gradients(grads_and_vars,
-                                               global_step=global_step)
-      update_ops.append(grad_updates)
-      update_op = tf.group(*update_ops)
-      with tf.control_dependencies([update_op]):
-        train_tensor = tf.identity(total_loss, name='train_op')
-
-    # Add the summaries from the first clone. These contain the summaries
-    # created by model_fn and either optimize_clones() or _gather_clone_loss().
-    summaries |= set(tf.get_collection(tf.GraphKeys.SUMMARIES,
-                                       first_clone_scope))
-
-    # Merge all summaries together.
-    summary_op = tf.summary.merge(list(summaries))
-
-    # Soft placement allows placing on CPU ops without GPU implementation.
-    session_config = tf.ConfigProto(allow_soft_placement=True,
-                                    log_device_placement=False)
-
-    # Start the training.
-    slim.learning.train(
-        train_tensor,
-        logdir=FLAGS.train_logdir,
-        log_every_n_steps=FLAGS.log_steps,
-        master=FLAGS.master,
-        number_of_steps=FLAGS.training_number_of_steps,
-        is_chief=(FLAGS.task == 0),
-        session_config=session_config,
-        startup_delay_steps=startup_delay_steps,
-        init_fn=train_utils.get_model_init_fn(FLAGS.train_logdir,
-                                              FLAGS.tf_initial_checkpoint,
-                                              FLAGS.initialize_last_layer,
-                                              last_layers,
-                                              ignore_missing_vars=True),
-        summary_op=summary_op,
-        save_summaries_secs=FLAGS.save_summaries_secs,
-        save_interval_secs=FLAGS.save_interval_secs)
-
-
-def _get_dataset_and_samples(config, train_crop_size, dataset_name,
-                             dataset_dir, first_frame_finetuning,
-                             three_frame_dataset):
-  """Creates dataset object and samples dict of tensor.
-
-  Args:
-    config: A DeploymentConfig.
-    train_crop_size: Integer, the crop size used for training.
-    dataset_name: String, the name of the dataset.
-    dataset_dir: String, the directory of the dataset.
-    first_frame_finetuning: Boolean, whether the used dataset is a dataset
-      for first frame fine-tuning.
-    three_frame_dataset: Boolean, whether the dataset has exactly three frames
-      per video of which the first is to be used as reference and the two
-      others are consecutive frames to be used as query frames.
-
-  Returns:
-    dataset: An instance of slim Dataset.
-    samples: A dictionary of tensors for semantic segmentation.
-  """
-
-  # Split the batch across GPUs.
-  assert FLAGS.train_batch_size % config.num_clones == 0, (
-      'Training batch size not divisble by number of clones (GPUs).')
-
-  clone_batch_size = FLAGS.train_batch_size / config.num_clones
-
-  if first_frame_finetuning:
-    train_split = 'val'
-  else:
-    train_split = FLAGS.train_split
-
-  data_type = 'tf_sequence_example'
-  # Get dataset-dependent information.
-  dataset = video_dataset.get_dataset(
-      dataset_name,
-      train_split,
-      dataset_dir=dataset_dir,
-      data_type=data_type)
-
-  tf.gfile.MakeDirs(FLAGS.train_logdir)
-  tf.logging.info('Training on %s set', train_split)
-
-  samples = video_input_generator.get(
-      dataset,
-      FLAGS.train_num_frames_per_video,
-      train_crop_size,
-      clone_batch_size,
-      num_readers=FLAGS.num_readers,
-      num_threads=FLAGS.batch_num_threads,
-      min_resize_value=FLAGS.min_resize_value,
-      max_resize_value=FLAGS.max_resize_value,
-      resize_factor=FLAGS.resize_factor,
-      min_scale_factor=FLAGS.min_scale_factor,
-      max_scale_factor=FLAGS.max_scale_factor,
-      scale_factor_step_size=FLAGS.scale_factor_step_size,
-      dataset_split=FLAGS.train_split,
-      is_training=True,
-      model_variant=FLAGS.model_variant,
-      batch_capacity_factor=FLAGS.batch_capacity_factor,
-      decoder_output_stride=common.parse_decoder_output_stride(),
-      first_frame_finetuning=first_frame_finetuning,
-      sample_only_first_frame_for_finetuning=
-      FLAGS.sample_only_first_frame_for_finetuning,
-      sample_adjacent_and_consistent_query_frames=
-      FLAGS.sample_adjacent_and_consistent_query_frames or
-      FLAGS.use_softmax_feedback,
-      remap_labels_to_reference_frame=True,
-      three_frame_dataset=three_frame_dataset,
-      add_prev_frame_label=not FLAGS.also_attend_to_previous_frame
-  )
-  return dataset, samples
-
-
-if __name__ == '__main__':
-  flags.mark_flag_as_required('train_logdir')
-  tf.logging.set_verbosity(tf.logging.INFO)
-  tf.app.run()
diff --git a/research/feelvos/train.sh b/research/feelvos/train.sh
deleted file mode 100755
index 63b7ea19d4c53dea932322c3885abb9a95237e0c..0000000000000000000000000000000000000000
--- a/research/feelvos/train.sh
+++ /dev/null
@@ -1,92 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-#
-# This script is used to run local training on DAVIS 2017. Users could also
-# modify from this script for their use case. See eval.sh for an example of
-# local inference with a pre-trained model.
-#
-# Note that this script runs local training with a single GPU and a smaller crop
-# and batch size, while in the paper, we trained our models with 16 GPUS with
-# --num_clones=2, --train_batch_size=6, --num_replicas=8,
-# --training_number_of_steps=200000, --train_crop_size=465,
-# --train_crop_size=465.
-#
-# Usage:
-#   # From the tensorflow/models/research/feelvos directory.
-#   sh ./train.sh
-#
-#
-
-# Exit immediately if a command exits with a non-zero status.
-set -e
-
-# Move one-level up to tensorflow/models/research directory.
-cd ..
-
-# Update PYTHONPATH.
-export PYTHONPATH=$PYTHONPATH:`pwd`:`pwd`/slim:`pwd`/feelvos
-
-# Set up the working environment.
-CURRENT_DIR=$(pwd)
-WORK_DIR="${CURRENT_DIR}/feelvos"
-
-# Set up the working directories.
-DATASET_DIR="datasets"
-DAVIS_FOLDER="davis17"
-DAVIS_DATASET="${WORK_DIR}/${DATASET_DIR}/${DAVIS_FOLDER}/tfrecord"
-EXP_FOLDER="exp/train"
-TRAIN_LOGDIR="${WORK_DIR}/${DATASET_DIR}/${DAVIS_FOLDER}/${EXP_FOLDER}/train"
-mkdir -p ${TRAIN_LOGDIR}
-
-# Go to datasets folder and download and convert the DAVIS 2017 dataset.
-DATASET_DIR="datasets"
-cd "${WORK_DIR}/${DATASET_DIR}"
-sh download_and_convert_davis17.sh
-
-# Go to models folder and download and unpack the COCO pre-trained model.
-MODELS_DIR="models"
-mkdir -p "${WORK_DIR}/${MODELS_DIR}"
-cd "${WORK_DIR}/${MODELS_DIR}"
-if [ ! -d "xception_65_coco_pretrained" ]; then
-  wget http://download.tensorflow.org/models/xception_65_coco_pretrained_2018_10_02.tar.gz
-  tar -xvf xception_65_coco_pretrained_2018_10_02.tar.gz
-  rm xception_65_coco_pretrained_2018_10_02.tar.gz
-fi
-INIT_CKPT="${WORK_DIR}/${MODELS_DIR}/xception_65_coco_pretrained/x65-b2u1s2p-d48-2-3x256-sc-cr300k_init.ckpt"
-
-# Go back to orignal directory.
-cd "${CURRENT_DIR}"
-
-python "${WORK_DIR}"/train.py \
-  --dataset=davis_2017 \
-  --dataset_dir="${DAVIS_DATASET}" \
-  --train_logdir="${TRAIN_LOGDIR}" \
-  --tf_initial_checkpoint="${INIT_CKPT}" \
-  --logtostderr \
-  --atrous_rates=6 \
-  --atrous_rates=12 \
-  --atrous_rates=18 \
-  --decoder_output_stride=4 \
-  --model_variant=xception_65 \
-  --multi_grid=1 \
-  --multi_grid=1 \
-  --multi_grid=1 \
-  --output_stride=16 \
-  --weight_decay=0.00004 \
-  --num_clones=1 \
-  --train_batch_size=1 \
-  --train_crop_size=300 \
-  --train_crop_size=300
diff --git a/research/feelvos/utils/__init__.py b/research/feelvos/utils/__init__.py
deleted file mode 100644
index 6f1373443d0ff84fd90714e41dade400ab41a22c..0000000000000000000000000000000000000000
--- a/research/feelvos/utils/__init__.py
+++ /dev/null
@@ -1,14 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
diff --git a/research/feelvos/utils/embedding_utils.py b/research/feelvos/utils/embedding_utils.py
deleted file mode 100644
index 233c70d9327d08251537c58821dd8405b42f0fe7..0000000000000000000000000000000000000000
--- a/research/feelvos/utils/embedding_utils.py
+++ /dev/null
@@ -1,1082 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities for the instance embedding for segmentation."""
-
-import numpy as np
-import tensorflow as tf
-from deeplab import model
-from deeplab.core import preprocess_utils
-from feelvos.utils import mask_damaging
-
-slim = tf.contrib.slim
-resolve_shape = preprocess_utils.resolve_shape
-WRONG_LABEL_PADDING_DISTANCE = 1e20
-
-# With correlation_cost local matching will be much faster. But we provide a
-# slow fallback for convenience.
-USE_CORRELATION_COST = False
-if USE_CORRELATION_COST:
-  # pylint: disable=g-import-not-at-top
-  from correlation_cost.python.ops import correlation_cost_op
-
-
-def pairwise_distances(x, y):
-  """Computes pairwise squared l2 distances between tensors x and y.
-
-  Args:
-    x: Tensor of shape [n, feature_dim].
-    y: Tensor of shape [m, feature_dim].
-
-  Returns:
-    Float32 distances tensor of shape [n, m].
-  """
-  # d[i,j] = (x[i] - y[j]) * (x[i] - y[j])'
-  # = sum(x[i]^2, 1) + sum(y[j]^2, 1) - 2 * x[i] * y[j]'
-  xs = tf.reduce_sum(x * x, axis=1)[:, tf.newaxis]
-  ys = tf.reduce_sum(y * y, axis=1)[tf.newaxis, :]
-  d = xs + ys - 2 * tf.matmul(x, y, transpose_b=True)
-  return d
-
-
-def pairwise_distances2(x, y):
-  """Computes pairwise squared l2 distances between tensors x and y.
-
-  Naive implementation, high memory use. Could be useful to test the more
-  efficient implementation.
-
-  Args:
-    x: Tensor of shape [n, feature_dim].
-    y: Tensor of shape [m, feature_dim].
-
-  Returns:
-    distances of shape [n, m].
-  """
-  return tf.reduce_sum(tf.squared_difference(
-      x[:, tf.newaxis], y[tf.newaxis, :]), axis=-1)
-
-
-def cross_correlate(x, y, max_distance=9):
-  """Efficiently computes the cross correlation of x and y.
-
-  Optimized implementation using correlation_cost.
-  Note that we do not normalize by the feature dimension.
-
-  Args:
-    x: Float32 tensor of shape [height, width, feature_dim].
-    y: Float32 tensor of shape [height, width, feature_dim].
-    max_distance: Integer, the maximum distance in pixel coordinates
-      per dimension which is considered to be in the search window.
-
-  Returns:
-    Float32 tensor of shape [height, width, (2 * max_distance + 1) ** 2].
-  """
-  with tf.name_scope('cross_correlation'):
-    corr = correlation_cost_op.correlation_cost(
-        x[tf.newaxis], y[tf.newaxis], kernel_size=1,
-        max_displacement=max_distance, stride_1=1, stride_2=1,
-        pad=max_distance)
-    corr = tf.squeeze(corr, axis=0)
-    # This correlation implementation takes the mean over the feature_dim,
-    # but we want sum here, so multiply by feature_dim.
-    feature_dim = resolve_shape(x)[-1]
-    corr *= feature_dim
-    return corr
-
-
-def local_pairwise_distances(x, y, max_distance=9):
-  """Computes pairwise squared l2 distances using a local search window.
-
-  Optimized implementation using correlation_cost.
-
-  Args:
-    x: Float32 tensor of shape [height, width, feature_dim].
-    y: Float32 tensor of shape [height, width, feature_dim].
-    max_distance: Integer, the maximum distance in pixel coordinates
-      per dimension which is considered to be in the search window.
-
-  Returns:
-    Float32 distances tensor of shape
-      [height, width, (2 * max_distance + 1) ** 2].
-  """
-  with tf.name_scope('local_pairwise_distances'):
-    # d[i,j] = (x[i] - y[j]) * (x[i] - y[j])'
-    # = sum(x[i]^2, -1) + sum(y[j]^2, -1) - 2 * x[i] * y[j]'
-    corr = cross_correlate(x, y, max_distance=max_distance)
-    xs = tf.reduce_sum(x * x, axis=2)[..., tf.newaxis]
-    ys = tf.reduce_sum(y * y, axis=2)[..., tf.newaxis]
-    ones_ys = tf.ones_like(ys)
-    ys = cross_correlate(ones_ys, ys, max_distance=max_distance)
-    d = xs + ys - 2 * corr
-    # Boundary should be set to Inf.
-    boundary = tf.equal(
-        cross_correlate(ones_ys, ones_ys, max_distance=max_distance), 0)
-    d = tf.where(boundary, tf.fill(tf.shape(d), tf.constant(np.float('inf'))),
-                 d)
-    return d
-
-
-def local_pairwise_distances2(x, y, max_distance=9):
-  """Computes pairwise squared l2 distances using a local search window.
-
-  Naive implementation using map_fn.
-  Used as a slow fallback for when correlation_cost is not available.
-
-  Args:
-    x: Float32 tensor of shape [height, width, feature_dim].
-    y: Float32 tensor of shape [height, width, feature_dim].
-    max_distance: Integer, the maximum distance in pixel coordinates
-      per dimension which is considered to be in the search window.
-
-  Returns:
-    Float32 distances tensor of shape
-      [height, width, (2 * max_distance + 1) ** 2].
-  """
-  with tf.name_scope('local_pairwise_distances2'):
-    padding_val = 1e20
-    padded_y = tf.pad(y, [[max_distance, max_distance],
-                          [max_distance, max_distance], [0, 0]],
-                      constant_values=padding_val)
-    height, width, _ = resolve_shape(x)
-    dists = []
-    for y_start in range(2 * max_distance + 1):
-      y_end = y_start + height
-      y_slice = padded_y[y_start:y_end]
-      for x_start in range(2 * max_distance + 1):
-        x_end = x_start + width
-        offset_y = y_slice[:, x_start:x_end]
-        dist = tf.reduce_sum(tf.squared_difference(x, offset_y), axis=2)
-        dists.append(dist)
-    dists = tf.stack(dists, axis=2)
-    return dists
-
-
-def majority_vote(labels):
-  """Performs a label majority vote along axis 1.
-
-  Second try, hopefully this time more efficient.
-  We assume that the labels are contiguous starting from 0.
-  It will also work for non-contiguous labels, but be inefficient.
-
-  Args:
-    labels: Int tensor of shape [n, k]
-
-  Returns:
-    The majority of labels along axis 1
-  """
-  max_label = tf.reduce_max(labels)
-  one_hot = tf.one_hot(labels, depth=max_label + 1)
-  summed = tf.reduce_sum(one_hot, axis=1)
-  majority = tf.argmax(summed, axis=1)
-  return majority
-
-
-def assign_labels_by_nearest_neighbors(reference_embeddings, query_embeddings,
-                                       reference_labels, k=1):
-  """Segments by nearest neighbor query wrt the reference frame.
-
-  Args:
-    reference_embeddings: Tensor of shape [height, width, embedding_dim],
-      the embedding vectors for the reference frame
-    query_embeddings: Tensor of shape [n_query_images, height, width,
-      embedding_dim], the embedding vectors for the query frames
-    reference_labels: Tensor of shape [height, width, 1], the class labels of
-      the reference frame
-    k: Integer, the number of nearest neighbors to use
-
-  Returns:
-    The labels of the nearest neighbors as [n_query_frames, height, width, 1]
-    tensor
-
-  Raises:
-    ValueError: If k < 1.
-  """
-  if k < 1:
-    raise ValueError('k must be at least 1')
-  dists = flattened_pairwise_distances(reference_embeddings, query_embeddings)
-  if k == 1:
-    nn_indices = tf.argmin(dists, axis=1)[..., tf.newaxis]
-  else:
-    _, nn_indices = tf.nn.top_k(-dists, k, sorted=False)
-  reference_labels = tf.reshape(reference_labels, [-1])
-  nn_labels = tf.gather(reference_labels, nn_indices)
-  if k == 1:
-    nn_labels = tf.squeeze(nn_labels, axis=1)
-  else:
-    nn_labels = majority_vote(nn_labels)
-  height = tf.shape(reference_embeddings)[0]
-  width = tf.shape(reference_embeddings)[1]
-  n_query_frames = query_embeddings.shape[0]
-  nn_labels = tf.reshape(nn_labels, [n_query_frames, height, width, 1])
-  return nn_labels
-
-
-def flattened_pairwise_distances(reference_embeddings, query_embeddings):
-  """Calculates flattened tensor of pairwise distances between ref and query.
-
-  Args:
-    reference_embeddings: Tensor of shape [..., embedding_dim],
-      the embedding vectors for the reference frame
-    query_embeddings: Tensor of shape [n_query_images, height, width,
-      embedding_dim], the embedding vectors for the query frames.
-
-  Returns:
-    A distance tensor of shape [reference_embeddings.size / embedding_dim,
-    query_embeddings.size / embedding_dim]
-  """
-  embedding_dim = resolve_shape(query_embeddings)[-1]
-  reference_embeddings = tf.reshape(reference_embeddings, [-1, embedding_dim])
-  first_dim = -1
-  query_embeddings = tf.reshape(query_embeddings, [first_dim, embedding_dim])
-  dists = pairwise_distances(query_embeddings, reference_embeddings)
-  return dists
-
-
-def nearest_neighbor_features_per_object(
-    reference_embeddings, query_embeddings, reference_labels,
-    max_neighbors_per_object, k_nearest_neighbors, gt_ids=None, n_chunks=100):
-  """Calculates the distance to the nearest neighbor per object.
-
-  For every pixel of query_embeddings calculate the distance to the
-  nearest neighbor in the (possibly subsampled) reference_embeddings per object.
-
-  Args:
-    reference_embeddings: Tensor of shape [height, width, embedding_dim],
-      the embedding vectors for the reference frame.
-    query_embeddings: Tensor of shape [n_query_images, height, width,
-      embedding_dim], the embedding vectors for the query frames.
-    reference_labels: Tensor of shape [height, width, 1], the class labels of
-      the reference frame.
-    max_neighbors_per_object: Integer, the maximum number of candidates
-      for the nearest neighbor query per object after subsampling,
-      or 0 for no subsampling.
-    k_nearest_neighbors: Integer, the number of nearest neighbors to use.
-    gt_ids: Int tensor of shape [n_objs] of the sorted unique ground truth
-      ids in the first frame. If None, it will be derived from
-      reference_labels.
-    n_chunks: Integer, the number of chunks to use to save memory
-      (set to 1 for no chunking).
-
-  Returns:
-    nn_features: A float32 tensor of nearest neighbor features of shape
-      [n_query_images, height, width, n_objects, feature_dim].
-    gt_ids: An int32 tensor of the unique sorted object ids present
-      in the reference labels.
-  """
-  with tf.name_scope('nn_features_per_object'):
-    reference_labels_flat = tf.reshape(reference_labels, [-1])
-    if gt_ids is None:
-      ref_obj_ids, _ = tf.unique(reference_labels_flat)
-      ref_obj_ids = tf.contrib.framework.sort(ref_obj_ids)
-      gt_ids = ref_obj_ids
-    embedding_dim = resolve_shape(reference_embeddings)[-1]
-    reference_embeddings_flat = tf.reshape(reference_embeddings,
-                                           [-1, embedding_dim])
-
-    reference_embeddings_flat, reference_labels_flat = (
-        subsample_reference_embeddings_and_labels(reference_embeddings_flat,
-                                                  reference_labels_flat,
-                                                  gt_ids,
-                                                  max_neighbors_per_object))
-    shape = resolve_shape(query_embeddings)
-    query_embeddings_flat = tf.reshape(query_embeddings, [-1, embedding_dim])
-    nn_features = _nearest_neighbor_features_per_object_in_chunks(
-        reference_embeddings_flat, query_embeddings_flat, reference_labels_flat,
-        gt_ids, k_nearest_neighbors, n_chunks)
-    nn_features_dim = resolve_shape(nn_features)[-1]
-    nn_features_reshaped = tf.reshape(nn_features,
-                                      tf.stack(shape[:3] + [tf.size(gt_ids),
-                                                            nn_features_dim]))
-    return nn_features_reshaped, gt_ids
-
-
-def _nearest_neighbor_features_per_object_in_chunks(
-    reference_embeddings_flat, query_embeddings_flat, reference_labels_flat,
-    ref_obj_ids, k_nearest_neighbors, n_chunks):
-  """Calculates the nearest neighbor features per object in chunks to save mem.
-
-  Uses chunking to bound the memory use.
-
-  Args:
-    reference_embeddings_flat: Tensor of shape [n, embedding_dim],
-      the embedding vectors for the reference frame.
-    query_embeddings_flat: Tensor of shape [m, embedding_dim], the embedding
-      vectors for the query frames.
-    reference_labels_flat: Tensor of shape [n], the class labels of the
-      reference frame.
-    ref_obj_ids: int tensor of unique object ids in the reference labels.
-    k_nearest_neighbors: Integer, the number of nearest neighbors to use.
-    n_chunks: Integer, the number of chunks to use to save memory
-      (set to 1 for no chunking).
-
-  Returns:
-    nn_features: A float32 tensor of nearest neighbor features of shape
-      [m, n_objects, feature_dim].
-  """
-  chunk_size = tf.cast(tf.ceil(tf.cast(tf.shape(query_embeddings_flat)[0],
-                                       tf.float32) / n_chunks), tf.int32)
-  wrong_label_mask = tf.not_equal(reference_labels_flat,
-                                  ref_obj_ids[:, tf.newaxis])
-  all_features = []
-  for n in range(n_chunks):
-    if n_chunks == 1:
-      query_embeddings_flat_chunk = query_embeddings_flat
-    else:
-      chunk_start = n * chunk_size
-      chunk_end = (n + 1) * chunk_size
-      query_embeddings_flat_chunk = query_embeddings_flat[chunk_start:chunk_end]
-    # Use control dependencies to make sure that the chunks are not processed
-    # in parallel which would prevent any peak memory savings.
-    with tf.control_dependencies(all_features):
-      features = _nn_features_per_object_for_chunk(
-          reference_embeddings_flat, query_embeddings_flat_chunk,
-          wrong_label_mask, k_nearest_neighbors
-      )
-    all_features.append(features)
-  if n_chunks == 1:
-    nn_features = all_features[0]
-  else:
-    nn_features = tf.concat(all_features, axis=0)
-  return nn_features
-
-
-def _nn_features_per_object_for_chunk(
-    reference_embeddings, query_embeddings, wrong_label_mask,
-    k_nearest_neighbors):
-  """Extracts features for each object using nearest neighbor attention.
-
-  Args:
-    reference_embeddings: Tensor of shape [n_chunk, embedding_dim],
-      the embedding vectors for the reference frame.
-    query_embeddings: Tensor of shape [m_chunk, embedding_dim], the embedding
-      vectors for the query frames.
-    wrong_label_mask:
-    k_nearest_neighbors: Integer, the number of nearest neighbors to use.
-
-  Returns:
-    nn_features: A float32 tensor of nearest neighbor features of shape
-      [m_chunk, n_objects, feature_dim].
-  """
-  reference_embeddings_key = reference_embeddings
-  query_embeddings_key = query_embeddings
-  dists = flattened_pairwise_distances(reference_embeddings_key,
-                                       query_embeddings_key)
-  dists = (dists[:, tf.newaxis, :] +
-           tf.cast(wrong_label_mask[tf.newaxis, :, :], tf.float32) *
-           WRONG_LABEL_PADDING_DISTANCE)
-  if k_nearest_neighbors == 1:
-    features = tf.reduce_min(dists, axis=2, keepdims=True)
-  else:
-    # Find the closest k and combine them according to attention_feature_type
-    dists, _ = tf.nn.top_k(-dists, k=k_nearest_neighbors)
-    dists = -dists
-    # If not enough real neighbors were found, pad with the farthest real
-    # neighbor.
-    valid_mask = tf.less(dists, WRONG_LABEL_PADDING_DISTANCE)
-    masked_dists = dists * tf.cast(valid_mask, tf.float32)
-    pad_dist = tf.tile(tf.reduce_max(masked_dists, axis=2)[..., tf.newaxis],
-                       multiples=[1, 1, k_nearest_neighbors])
-    dists = tf.where(valid_mask, dists, pad_dist)
-    # take mean of distances
-    features = tf.reduce_mean(dists, axis=2, keepdims=True)
-  return features
-
-
-def create_embedding_segmentation_features(features, feature_dimension,
-                                           n_layers, kernel_size, reuse,
-                                           atrous_rates=None):
-  """Extracts features which can be used to estimate the final segmentation.
-
-  Args:
-    features: input features of shape [batch, height, width, features]
-    feature_dimension: Integer, the dimensionality used in the segmentation
-      head layers.
-    n_layers: Integer, the number of layers in the segmentation head.
-    kernel_size: Integer, the kernel size used in the segmentation head.
-    reuse: reuse mode for the variable_scope.
-    atrous_rates: List of integers of length n_layers, the atrous rates to use.
-
-  Returns:
-    Features to be used to estimate the segmentation labels of shape
-      [batch, height, width, embedding_seg_feat_dim].
-  """
-  if atrous_rates is None or not atrous_rates:
-    atrous_rates = [1 for _ in range(n_layers)]
-  assert len(atrous_rates) == n_layers
-  with tf.variable_scope('embedding_seg', reuse=reuse):
-    for n in range(n_layers):
-      features = model.split_separable_conv2d(
-          features, feature_dimension, kernel_size=kernel_size,
-          rate=atrous_rates[n], scope='split_separable_conv2d_{}'.format(n))
-    return features
-
-
-def add_image_summaries(images, nn_features, logits, batch_size,
-                        prev_frame_nn_features=None):
-  """Adds image summaries of input images, attention features and logits.
-
-  Args:
-    images: Image tensor of shape [batch, height, width, channels].
-    nn_features: Nearest neighbor attention features of shape
-      [batch_size, height, width, n_objects, 1].
-    logits: Float32 tensor of logits.
-    batch_size: Integer, the number of videos per clone per mini-batch.
-    prev_frame_nn_features: Nearest neighbor attention features wrt. the
-      last frame of shape [batch_size, height, width, n_objects, 1].
-      Can be None.
-  """
-  # Separate reference and query images.
-  reshaped_images = tf.reshape(images, tf.stack(
-      [batch_size, -1] + resolve_shape(images)[1:]))
-  reference_images = reshaped_images[:, 0]
-  query_images = reshaped_images[:, 1:]
-  query_images_reshaped = tf.reshape(query_images, tf.stack(
-      [-1] + resolve_shape(images)[1:]))
-  tf.summary.image('ref_images', reference_images, max_outputs=batch_size)
-  tf.summary.image('query_images', query_images_reshaped, max_outputs=10)
-  predictions = tf.cast(
-      tf.argmax(logits, axis=-1), tf.uint8)[..., tf.newaxis]
-  # Scale up so that we can actually see something.
-  tf.summary.image('predictions', predictions * 32, max_outputs=10)
-  # We currently only show the first dimension of the features for background
-  # and the first foreground object.
-  tf.summary.image('nn_fg_features', nn_features[..., 0:1, 0],
-                   max_outputs=batch_size)
-  if prev_frame_nn_features is not None:
-    tf.summary.image('nn_fg_features_prev', prev_frame_nn_features[..., 0:1, 0],
-                     max_outputs=batch_size)
-  tf.summary.image('nn_bg_features', nn_features[..., 1:2, 0],
-                   max_outputs=batch_size)
-  if prev_frame_nn_features is not None:
-    tf.summary.image('nn_bg_features_prev',
-                     prev_frame_nn_features[..., 1:2, 0],
-                     max_outputs=batch_size)
-
-
-def get_embeddings(images, model_options, embedding_dimension):
-  """Extracts embedding vectors for images. Should only be used for inference.
-
-  Args:
-    images: A tensor of shape [batch, height, width, channels].
-    model_options: A ModelOptions instance to configure models.
-    embedding_dimension: Integer, the dimension of the embedding.
-
-  Returns:
-    embeddings: A tensor of shape [batch, height, width, embedding_dimension].
-  """
-  features, end_points = model.extract_features(
-      images,
-      model_options,
-      is_training=False)
-
-  if model_options.decoder_output_stride is not None:
-    decoder_output_stride = min(model_options.decoder_output_stride)
-    if model_options.crop_size is None:
-      height = tf.shape(images)[1]
-      width = tf.shape(images)[2]
-    else:
-      height, width = model_options.crop_size
-    features = model.refine_by_decoder(
-        features,
-        end_points,
-        crop_size=[height, width],
-        decoder_output_stride=[decoder_output_stride],
-        decoder_use_separable_conv=model_options.decoder_use_separable_conv,
-        model_variant=model_options.model_variant,
-        is_training=False)
-
-  with tf.variable_scope('embedding'):
-    embeddings = split_separable_conv2d_with_identity_initializer(
-        features, embedding_dimension, scope='split_separable_conv2d')
-  return embeddings
-
-
-def get_logits_with_matching(images,
-                             model_options,
-                             weight_decay=0.0001,
-                             reuse=None,
-                             is_training=False,
-                             fine_tune_batch_norm=False,
-                             reference_labels=None,
-                             batch_size=None,
-                             num_frames_per_video=None,
-                             embedding_dimension=None,
-                             max_neighbors_per_object=0,
-                             k_nearest_neighbors=1,
-                             use_softmax_feedback=True,
-                             initial_softmax_feedback=None,
-                             embedding_seg_feature_dimension=256,
-                             embedding_seg_n_layers=4,
-                             embedding_seg_kernel_size=7,
-                             embedding_seg_atrous_rates=None,
-                             normalize_nearest_neighbor_distances=True,
-                             also_attend_to_previous_frame=True,
-                             damage_initial_previous_frame_mask=False,
-                             use_local_previous_frame_attention=True,
-                             previous_frame_attention_window_size=15,
-                             use_first_frame_matching=True,
-                             also_return_embeddings=False,
-                             ref_embeddings=None):
-  """Gets the logits by atrous/image spatial pyramid pooling using attention.
-
-  Args:
-    images: A tensor of size [batch, height, width, channels].
-    model_options: A ModelOptions instance to configure models.
-    weight_decay: The weight decay for model variables.
-    reuse: Reuse the model variables or not.
-    is_training: Is training or not.
-    fine_tune_batch_norm: Fine-tune the batch norm parameters or not.
-    reference_labels: The segmentation labels of the reference frame on which
-      attention is applied.
-    batch_size: Integer, the number of videos on a batch
-    num_frames_per_video: Integer, the number of frames per video
-    embedding_dimension: Integer, the dimension of the embedding
-    max_neighbors_per_object: Integer, the maximum number of candidates
-      for the nearest neighbor query per object after subsampling.
-      Can be 0 for no subsampling.
-    k_nearest_neighbors: Integer, the number of nearest neighbors to use.
-    use_softmax_feedback: Boolean, whether to give the softmax predictions of
-      the last frame as additional input to the segmentation head.
-    initial_softmax_feedback: List of Float32 tensors, or None. Can be used to
-      initialize the softmax predictions used for the feedback loop.
-      Only has an effect if use_softmax_feedback is True.
-    embedding_seg_feature_dimension: Integer, the dimensionality used in the
-      segmentation head layers.
-    embedding_seg_n_layers: Integer, the number of layers in the segmentation
-      head.
-    embedding_seg_kernel_size: Integer, the kernel size used in the
-      segmentation head.
-    embedding_seg_atrous_rates: List of integers of length
-      embedding_seg_n_layers, the atrous rates to use for the segmentation head.
-    normalize_nearest_neighbor_distances: Boolean, whether to normalize the
-      nearest neighbor distances to [0,1] using sigmoid, scale and shift.
-    also_attend_to_previous_frame: Boolean, whether to also use nearest
-      neighbor attention with respect to the previous frame.
-    damage_initial_previous_frame_mask: Boolean, whether to artificially damage
-      the initial previous frame mask. Only has an effect if
-      also_attend_to_previous_frame is True.
-    use_local_previous_frame_attention: Boolean, whether to restrict the
-      previous frame attention to a local search window.
-      Only has an effect, if also_attend_to_previous_frame is True.
-    previous_frame_attention_window_size: Integer, the window size used for
-      local previous frame attention, if use_local_previous_frame_attention
-      is True.
-    use_first_frame_matching: Boolean, whether to extract features by matching
-      to the reference frame. This should always be true except for ablation
-      experiments.
-    also_return_embeddings: Boolean, whether to return the embeddings as well.
-    ref_embeddings: Tuple of
-      (first_frame_embeddings, previous_frame_embeddings),
-      each of shape [batch, height, width, embedding_dimension], or None.
-  Returns:
-    outputs_to_logits: A map from output_type to logits.
-    If also_return_embeddings is True, it will also return an embeddings
-      tensor of shape [batch, height, width, embedding_dimension].
-  """
-  features, end_points = model.extract_features(
-      images,
-      model_options,
-      weight_decay=weight_decay,
-      reuse=reuse,
-      is_training=is_training,
-      fine_tune_batch_norm=fine_tune_batch_norm)
-
-  if model_options.decoder_output_stride:
-    decoder_output_stride = min(model_options.decoder_output_stride)
-    if model_options.crop_size is None:
-      height = tf.shape(images)[1]
-      width = tf.shape(images)[2]
-    else:
-      height, width = model_options.crop_size
-    decoder_height = model.scale_dimension(height, 1.0 / decoder_output_stride)
-    decoder_width = model.scale_dimension(width, 1.0 / decoder_output_stride)
-    features = model.refine_by_decoder(
-        features,
-        end_points,
-        crop_size=[height, width],
-        decoder_output_stride=[decoder_output_stride],
-        decoder_use_separable_conv=model_options.decoder_use_separable_conv,
-        model_variant=model_options.model_variant,
-        weight_decay=weight_decay,
-        reuse=reuse,
-        is_training=is_training,
-        fine_tune_batch_norm=fine_tune_batch_norm)
-
-  with tf.variable_scope('embedding', reuse=reuse):
-    embeddings = split_separable_conv2d_with_identity_initializer(
-        features, embedding_dimension, scope='split_separable_conv2d')
-    embeddings = tf.identity(embeddings, name='embeddings')
-  scaled_reference_labels = tf.image.resize_nearest_neighbor(
-      reference_labels,
-      resolve_shape(embeddings, 4)[1:3],
-      align_corners=True)
-  h, w = decoder_height, decoder_width
-  if num_frames_per_video is None:
-    num_frames_per_video = tf.size(embeddings) // (
-        batch_size * h * w * embedding_dimension)
-  new_labels_shape = tf.stack([batch_size, -1, h, w, 1])
-  reshaped_reference_labels = tf.reshape(scaled_reference_labels,
-                                         new_labels_shape)
-  new_embeddings_shape = tf.stack([batch_size,
-                                   num_frames_per_video, h, w,
-                                   embedding_dimension])
-  reshaped_embeddings = tf.reshape(embeddings, new_embeddings_shape)
-  all_nn_features = []
-  all_ref_obj_ids = []
-  # To keep things simple, we do all this separate for each sequence for now.
-  for n in range(batch_size):
-    embedding = reshaped_embeddings[n]
-    if ref_embeddings is None:
-      n_chunks = 100
-      reference_embedding = embedding[0]
-      if also_attend_to_previous_frame or use_softmax_feedback:
-        queries_embedding = embedding[2:]
-      else:
-        queries_embedding = embedding[1:]
-    else:
-      if USE_CORRELATION_COST:
-        n_chunks = 20
-      else:
-        n_chunks = 500
-      reference_embedding = ref_embeddings[0][n]
-      queries_embedding = embedding
-    reference_labels = reshaped_reference_labels[n][0]
-    nn_features_n, ref_obj_ids = nearest_neighbor_features_per_object(
-        reference_embedding, queries_embedding, reference_labels,
-        max_neighbors_per_object, k_nearest_neighbors, n_chunks=n_chunks)
-    if normalize_nearest_neighbor_distances:
-      nn_features_n = (tf.nn.sigmoid(nn_features_n) - 0.5) * 2
-    all_nn_features.append(nn_features_n)
-    all_ref_obj_ids.append(ref_obj_ids)
-
-  feat_dim = resolve_shape(features)[-1]
-  features = tf.reshape(features, tf.stack(
-      [batch_size, num_frames_per_video, h, w, feat_dim]))
-  if ref_embeddings is None:
-    # Strip the features for the reference frame.
-    if also_attend_to_previous_frame or use_softmax_feedback:
-      features = features[:, 2:]
-    else:
-      features = features[:, 1:]
-
-  # To keep things simple, we do all this separate for each sequence for now.
-  outputs_to_logits = {output: [] for
-                       output in model_options.outputs_to_num_classes}
-  for n in range(batch_size):
-    features_n = features[n]
-    nn_features_n = all_nn_features[n]
-    nn_features_n_tr = tf.transpose(nn_features_n, [3, 0, 1, 2, 4])
-    n_objs = tf.shape(nn_features_n_tr)[0]
-    # Repeat features for every object.
-    features_n_tiled = tf.tile(features_n[tf.newaxis],
-                               multiples=[n_objs, 1, 1, 1, 1])
-    prev_frame_labels = None
-    if also_attend_to_previous_frame:
-      prev_frame_labels = reshaped_reference_labels[n, 1]
-      if is_training and damage_initial_previous_frame_mask:
-        # Damage the previous frame masks.
-        prev_frame_labels = mask_damaging.damage_masks(prev_frame_labels,
-                                                       dilate=False)
-      tf.summary.image('prev_frame_labels',
-                       tf.cast(prev_frame_labels[tf.newaxis],
-                               tf.uint8) * 32)
-      initial_softmax_feedback_n = create_initial_softmax_from_labels(
-          prev_frame_labels, reshaped_reference_labels[n][0],
-          decoder_output_stride=None, reduce_labels=True)
-    elif initial_softmax_feedback is not None:
-      initial_softmax_feedback_n = initial_softmax_feedback[n]
-    else:
-      initial_softmax_feedback_n = None
-    if initial_softmax_feedback_n is None:
-      last_softmax = tf.zeros((n_objs, h, w, 1), dtype=tf.float32)
-    else:
-      last_softmax = tf.transpose(initial_softmax_feedback_n, [2, 0, 1])[
-          ..., tf.newaxis]
-    assert len(model_options.outputs_to_num_classes) == 1
-    output = model_options.outputs_to_num_classes.keys()[0]
-    logits = []
-    n_ref_frames = 1
-    prev_frame_nn_features_n = None
-    if also_attend_to_previous_frame or use_softmax_feedback:
-      n_ref_frames += 1
-    if ref_embeddings is not None:
-      n_ref_frames = 0
-    for t in range(num_frames_per_video - n_ref_frames):
-      to_concat = [features_n_tiled[:, t]]
-      if use_first_frame_matching:
-        to_concat.append(nn_features_n_tr[:, t])
-      if use_softmax_feedback:
-        to_concat.append(last_softmax)
-      if also_attend_to_previous_frame:
-        assert normalize_nearest_neighbor_distances, (
-            'previous frame attention currently only works when normalized '
-            'distances are used')
-        embedding = reshaped_embeddings[n]
-        if ref_embeddings is None:
-          last_frame_embedding = embedding[t + 1]
-          query_embeddings = embedding[t + 2, tf.newaxis]
-        else:
-          last_frame_embedding = ref_embeddings[1][0]
-          query_embeddings = embedding
-        if use_local_previous_frame_attention:
-          assert query_embeddings.shape[0] == 1
-          prev_frame_nn_features_n = (
-              local_previous_frame_nearest_neighbor_features_per_object(
-                  last_frame_embedding,
-                  query_embeddings[0],
-                  prev_frame_labels,
-                  all_ref_obj_ids[n],
-                  max_distance=previous_frame_attention_window_size)
-          )
-        else:
-          prev_frame_nn_features_n, _ = (
-              nearest_neighbor_features_per_object(
-                  last_frame_embedding, query_embeddings, prev_frame_labels,
-                  max_neighbors_per_object, k_nearest_neighbors,
-                  gt_ids=all_ref_obj_ids[n]))
-          prev_frame_nn_features_n = (tf.nn.sigmoid(
-              prev_frame_nn_features_n) - 0.5) * 2
-        prev_frame_nn_features_n_sq = tf.squeeze(prev_frame_nn_features_n,
-                                                 axis=0)
-        prev_frame_nn_features_n_tr = tf.transpose(
-            prev_frame_nn_features_n_sq, [2, 0, 1, 3])
-        to_concat.append(prev_frame_nn_features_n_tr)
-      features_n_concat_t = tf.concat(to_concat, axis=-1)
-      embedding_seg_features_n_t = (
-          create_embedding_segmentation_features(
-              features_n_concat_t, embedding_seg_feature_dimension,
-              embedding_seg_n_layers, embedding_seg_kernel_size,
-              reuse or n > 0, atrous_rates=embedding_seg_atrous_rates))
-      logits_t = model.get_branch_logits(
-          embedding_seg_features_n_t,
-          1,
-          model_options.atrous_rates,
-          aspp_with_batch_norm=model_options.aspp_with_batch_norm,
-          kernel_size=model_options.logits_kernel_size,
-          weight_decay=weight_decay,
-          reuse=reuse or n > 0 or t > 0,
-          scope_suffix=output
-      )
-      logits.append(logits_t)
-      prev_frame_labels = tf.transpose(tf.argmax(logits_t, axis=0),
-                                       [2, 0, 1])
-      last_softmax = tf.nn.softmax(logits_t, axis=0)
-    logits = tf.stack(logits, axis=1)
-    logits_shape = tf.stack(
-        [n_objs, num_frames_per_video - n_ref_frames] +
-        resolve_shape(logits)[2:-1])
-    logits_reshaped = tf.reshape(logits, logits_shape)
-    logits_transposed = tf.transpose(logits_reshaped, [1, 2, 3, 0])
-    outputs_to_logits[output].append(logits_transposed)
-
-    add_image_summaries(
-        images[n * num_frames_per_video: (n+1) * num_frames_per_video],
-        nn_features_n,
-        logits_transposed,
-        batch_size=1,
-        prev_frame_nn_features=prev_frame_nn_features_n)
-  if also_return_embeddings:
-    return outputs_to_logits, embeddings
-  else:
-    return outputs_to_logits
-
-
-def subsample_reference_embeddings_and_labels(
-    reference_embeddings_flat, reference_labels_flat, ref_obj_ids,
-    max_neighbors_per_object):
-  """Subsamples the reference embedding vectors and labels.
-
-  After subsampling, at most max_neighbors_per_object items will remain per
-    class.
-
-  Args:
-    reference_embeddings_flat: Tensor of shape [n, embedding_dim],
-      the embedding vectors for the reference frame.
-    reference_labels_flat: Tensor of shape [n, 1],
-      the class labels of the reference frame.
-    ref_obj_ids: An int32 tensor of the unique object ids present
-      in the reference labels.
-    max_neighbors_per_object: Integer, the maximum number of candidates
-      for the nearest neighbor query per object after subsampling,
-      or 0 for no subsampling.
-
-  Returns:
-    reference_embeddings_flat: Tensor of shape [n_sub, embedding_dim],
-      the subsampled embedding vectors for the reference frame.
-    reference_labels_flat: Tensor of shape [n_sub, 1],
-      the class labels of the reference frame.
-  """
-  if max_neighbors_per_object == 0:
-    return reference_embeddings_flat, reference_labels_flat
-  same_label_mask = tf.equal(reference_labels_flat[tf.newaxis, :],
-                             ref_obj_ids[:, tf.newaxis])
-  max_neighbors_per_object_repeated = tf.tile(
-      tf.constant(max_neighbors_per_object)[tf.newaxis],
-      multiples=[tf.size(ref_obj_ids)])
-  # Somehow map_fn on GPU caused trouble sometimes, so let's put it on CPU
-  # for now.
-  with tf.device('cpu:0'):
-    subsampled_indices = tf.map_fn(_create_subsampling_mask,
-                                   (same_label_mask,
-                                    max_neighbors_per_object_repeated),
-                                   dtype=tf.int64,
-                                   name='subsample_labels_map_fn',
-                                   parallel_iterations=1)
-  mask = tf.not_equal(subsampled_indices, tf.constant(-1, dtype=tf.int64))
-  masked_indices = tf.boolean_mask(subsampled_indices, mask)
-  reference_embeddings_flat = tf.gather(reference_embeddings_flat,
-                                        masked_indices)
-  reference_labels_flat = tf.gather(reference_labels_flat, masked_indices)
-  return reference_embeddings_flat, reference_labels_flat
-
-
-def _create_subsampling_mask(args):
-  """Creates boolean mask which can be used to subsample the labels.
-
-  Args:
-    args: tuple of (label_mask, max_neighbors_per_object), where label_mask
-      is the mask to be subsampled and max_neighbors_per_object is a int scalar,
-      the maximum number of neighbors to be retained after subsampling.
-
-  Returns:
-    The boolean mask for subsampling the labels.
-  """
-  label_mask, max_neighbors_per_object = args
-  indices = tf.squeeze(tf.where(label_mask), axis=1)
-  shuffled_indices = tf.random_shuffle(indices)
-  subsampled_indices = shuffled_indices[:max_neighbors_per_object]
-  n_pad = max_neighbors_per_object - tf.size(subsampled_indices)
-  padded_label = -1
-  padding = tf.fill((n_pad,), tf.constant(padded_label, dtype=tf.int64))
-  padded = tf.concat([subsampled_indices, padding], axis=0)
-  return padded
-
-
-def conv2d_identity_initializer(scale=1.0, mean=0, stddev=3e-2):
-  """Creates an identity initializer for TensorFlow conv2d.
-
-  We add a small amount of normal noise to the initialization matrix.
-  Code copied from lcchen@.
-
-  Args:
-    scale: The scale coefficient for the identity weight matrix.
-    mean: A 0-D Tensor or Python value of type `dtype`. The mean of the
-      truncated normal distribution.
-    stddev: A 0-D Tensor or Python value of type `dtype`. The standard deviation
-      of the truncated normal distribution.
-
-  Returns:
-    An identity initializer function for TensorFlow conv2d.
-  """
-  def _initializer(shape, dtype=tf.float32, partition_info=None):
-    """Returns the identity matrix scaled by `scale`.
-
-    Args:
-      shape: A tuple of int32 numbers indicating the shape of the initializing
-         matrix.
-      dtype: The data type of the initializing matrix.
-      partition_info: (Optional) variable_scope._PartitionInfo object holding
-      additional information about how the variable is partitioned. This input
-        is not used in our case, but is required by TensorFlow.
-
-    Returns:
-      A identity matrix.
-
-    Raises:
-      ValueError: If len(shape) != 4, or shape[0] != shape[1], or shape[0] is
-        not odd, or shape[1] is not odd..
-    """
-    del partition_info
-    if len(shape) != 4:
-      raise ValueError('Expect shape length to be 4.')
-    if shape[0] != shape[1]:
-      raise ValueError('Expect shape[0] = shape[1].')
-    if shape[0] % 2 != 1:
-      raise ValueError('Expect shape[0] to be odd value.')
-    if shape[1] % 2 != 1:
-      raise ValueError('Expect shape[1] to be odd value.')
-    weights = np.zeros(shape, dtype=np.float32)
-    center_y = shape[0] / 2
-    center_x = shape[1] / 2
-    min_channel = min(shape[2], shape[3])
-    for i in range(min_channel):
-      weights[center_y, center_x, i, i] = scale
-    return tf.constant(weights, dtype=dtype) + tf.truncated_normal(
-        shape, mean=mean, stddev=stddev, dtype=dtype)
-
-  return _initializer
-
-
-def split_separable_conv2d_with_identity_initializer(
-    inputs,
-    filters,
-    kernel_size=3,
-    rate=1,
-    weight_decay=0.00004,
-    scope=None):
-  """Splits a separable conv2d into depthwise and pointwise conv2d.
-
-  This operation differs from `tf.layers.separable_conv2d` as this operation
-  applies activation function between depthwise and pointwise conv2d.
-
-  Args:
-    inputs: Input tensor with shape [batch, height, width, channels].
-    filters: Number of filters in the 1x1 pointwise convolution.
-    kernel_size: A list of length 2: [kernel_height, kernel_width] of
-      of the filters. Can be an int if both values are the same.
-    rate: Atrous convolution rate for the depthwise convolution.
-    weight_decay: The weight decay to use for regularizing the model.
-    scope: Optional scope for the operation.
-
-  Returns:
-    Computed features after split separable conv2d.
-  """
-  initializer = conv2d_identity_initializer()
-  outputs = slim.separable_conv2d(
-      inputs,
-      None,
-      kernel_size=kernel_size,
-      depth_multiplier=1,
-      rate=rate,
-      weights_initializer=initializer,
-      weights_regularizer=None,
-      scope=scope + '_depthwise')
-  return slim.conv2d(
-      outputs,
-      filters,
-      1,
-      weights_initializer=initializer,
-      weights_regularizer=slim.l2_regularizer(weight_decay),
-      scope=scope + '_pointwise')
-
-
-def create_initial_softmax_from_labels(last_frame_labels, reference_labels,
-                                       decoder_output_stride, reduce_labels):
-  """Creates initial softmax predictions from last frame labels.
-
-  Args:
-    last_frame_labels: last frame labels of shape [1, height, width, 1].
-    reference_labels: reference frame labels of shape [1, height, width, 1].
-    decoder_output_stride: Integer, the stride of the decoder. Can be None, in
-      this case it's assumed that the last_frame_labels and reference_labels
-      are already scaled to the decoder output resolution.
-    reduce_labels: Boolean, whether to reduce the depth of the softmax one_hot
-      encoding to the actual number of labels present in the reference frame
-      (otherwise the depth will be the highest label index + 1).
-
-  Returns:
-    init_softmax: the initial softmax predictions.
-  """
-  if decoder_output_stride is None:
-    labels_output_size = last_frame_labels
-    reference_labels_output_size = reference_labels
-  else:
-    h = tf.shape(last_frame_labels)[1]
-    w = tf.shape(last_frame_labels)[2]
-    h_sub = model.scale_dimension(h, 1.0 / decoder_output_stride)
-    w_sub = model.scale_dimension(w, 1.0 / decoder_output_stride)
-    labels_output_size = tf.image.resize_nearest_neighbor(
-        last_frame_labels, [h_sub, w_sub], align_corners=True)
-    reference_labels_output_size = tf.image.resize_nearest_neighbor(
-        reference_labels, [h_sub, w_sub], align_corners=True)
-  if reduce_labels:
-    unique_labels, _ = tf.unique(tf.reshape(reference_labels_output_size, [-1]))
-    depth = tf.size(unique_labels)
-  else:
-    depth = tf.reduce_max(reference_labels_output_size) + 1
-  one_hot_assertion = tf.assert_less(tf.reduce_max(labels_output_size), depth)
-  with tf.control_dependencies([one_hot_assertion]):
-    init_softmax = tf.one_hot(tf.squeeze(labels_output_size,
-                                         axis=-1),
-                              depth=depth,
-                              dtype=tf.float32)
-  return init_softmax
-
-
-def local_previous_frame_nearest_neighbor_features_per_object(
-    prev_frame_embedding, query_embedding, prev_frame_labels,
-    gt_ids, max_distance=9):
-  """Computes nearest neighbor features while only allowing local matches.
-
-  Args:
-    prev_frame_embedding: Tensor of shape [height, width, embedding_dim],
-      the embedding vectors for the last frame.
-    query_embedding: Tensor of shape [height, width, embedding_dim],
-      the embedding vectors for the query frames.
-    prev_frame_labels: Tensor of shape [height, width, 1], the class labels of
-      the previous frame.
-    gt_ids: Int Tensor of shape [n_objs] of the sorted unique ground truth
-      ids in the first frame.
-    max_distance: Integer, the maximum distance allowed for local matching.
-
-  Returns:
-    nn_features: A float32 np.array of nearest neighbor features of shape
-      [1, height, width, n_objects, 1].
-  """
-  with tf.name_scope(
-      'local_previous_frame_nearest_neighbor_features_per_object'):
-    if USE_CORRELATION_COST:
-      tf.logging.info('Using correlation_cost.')
-      d = local_pairwise_distances(query_embedding, prev_frame_embedding,
-                                   max_distance=max_distance)
-    else:
-      # Slow fallback in case correlation_cost is not available.
-      tf.logging.warn('correlation cost is not available, using slow fallback '
-                      'implementation.')
-      d = local_pairwise_distances2(query_embedding, prev_frame_embedding,
-                                    max_distance=max_distance)
-    d = (tf.nn.sigmoid(d) - 0.5) * 2
-    height = tf.shape(prev_frame_embedding)[0]
-    width = tf.shape(prev_frame_embedding)[1]
-
-    # Create offset versions of the mask.
-    if USE_CORRELATION_COST:
-      # New, faster code with cross-correlation via correlation_cost.
-      # Due to padding we have to add 1 to the labels.
-      offset_labels = correlation_cost_op.correlation_cost(
-          tf.ones((1, height, width, 1)),
-          tf.cast(prev_frame_labels + 1, tf.float32)[tf.newaxis],
-          kernel_size=1,
-          max_displacement=max_distance, stride_1=1, stride_2=1,
-          pad=max_distance)
-      offset_labels = tf.squeeze(offset_labels, axis=0)[..., tf.newaxis]
-      # Subtract the 1 again and round.
-      offset_labels = tf.round(offset_labels - 1)
-      offset_masks = tf.equal(
-          offset_labels,
-          tf.cast(gt_ids, tf.float32)[tf.newaxis, tf.newaxis, tf.newaxis, :])
-    else:
-      # Slower code, without dependency to correlation_cost
-      masks = tf.equal(prev_frame_labels, gt_ids[tf.newaxis, tf.newaxis])
-      padded_masks = tf.pad(masks,
-                            [[max_distance, max_distance],
-                             [max_distance, max_distance],
-                             [0, 0]])
-      offset_masks = []
-      for y_start in range(2 * max_distance + 1):
-        y_end = y_start + height
-        masks_slice = padded_masks[y_start:y_end]
-        for x_start in range(2 * max_distance + 1):
-          x_end = x_start + width
-          offset_mask = masks_slice[:, x_start:x_end]
-          offset_masks.append(offset_mask)
-      offset_masks = tf.stack(offset_masks, axis=2)
-
-    pad = tf.ones((height, width, (2 * max_distance + 1) ** 2, tf.size(gt_ids)))
-    d_tiled = tf.tile(d[..., tf.newaxis], multiples=(1, 1, 1, tf.size(gt_ids)))
-    d_masked = tf.where(offset_masks, d_tiled, pad)
-    dists = tf.reduce_min(d_masked, axis=2)
-    dists = tf.reshape(dists, (1, height, width, tf.size(gt_ids), 1))
-    return dists
diff --git a/research/feelvos/utils/embedding_utils_test.py b/research/feelvos/utils/embedding_utils_test.py
deleted file mode 100644
index ddebd7b4e7fcc9402887ebf59d247fea815d6cda..0000000000000000000000000000000000000000
--- a/research/feelvos/utils/embedding_utils_test.py
+++ /dev/null
@@ -1,213 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for embedding utils."""
-
-import unittest
-import numpy as np
-import tensorflow as tf
-from feelvos.utils import embedding_utils
-
-if embedding_utils.USE_CORRELATION_COST:
-  # pylint: disable=g-import-not-at-top
-  from correlation_cost.python.ops import correlation_cost_op
-
-
-class EmbeddingUtilsTest(tf.test.TestCase):
-
-  def test_pairwise_distances(self):
-    x = np.arange(100, dtype=np.float32).reshape(20, 5)
-    y = np.arange(100, 200, dtype=np.float32).reshape(20, 5)
-    g = tf.Graph()
-    with g.as_default():
-      with self.test_session(graph=g) as sess:
-        x = tf.constant(x)
-        y = tf.constant(y)
-        d1 = embedding_utils.pairwise_distances(x, y)
-        d2 = embedding_utils.pairwise_distances2(x, y)
-        d1_val, d2_val = sess.run([d1, d2])
-        self.assertAllClose(d1_val, d2_val)
-
-  @unittest.skipIf(not embedding_utils.USE_CORRELATION_COST,
-                   'depends on correlation_cost')
-  def test_correlation_cost_one_dimensional(self):
-    a = np.array([[[[1.0], [2.0]], [[3.0], [4.0]]]])
-    b = np.array([[[[2.0], [1.0]], [[4.0], [3.0]]]])
-    g = tf.Graph()
-    with g.as_default():
-      with self.test_session(graph=g) as sess:
-        c = correlation_cost_op.correlation_cost(
-            a, b, kernel_size=1, max_displacement=1, stride_1=1, stride_2=1,
-            pad=1)
-        c = tf.squeeze(c, axis=0)
-        c_val = sess.run(c)
-        self.assertAllEqual(c_val.shape, (2, 2, 9))
-        for y in range(2):
-          for x in range(2):
-            for dy in range(-1, 2):
-              for dx in range(-1, 2):
-                a_slice = a[0, y, x, 0]
-                if y + dy < 0 or y + dy > 1 or x + dx < 0 or x + dx > 1:
-                  b_slice = 0
-                else:
-                  b_slice = b[0, y + dy, x + dx, 0]
-                expected = a_slice * b_slice
-                dy0 = dy + 1
-                dx0 = dx + 1
-                self.assertAlmostEqual(c_val[y, x, 3 * dy0 + dx0], expected)
-
-  @unittest.skipIf(not embedding_utils.USE_CORRELATION_COST,
-                   'depends on correlation_cost')
-  def test_correlation_cost_two_dimensional(self):
-    a = np.array([[[[1.0, -5.0], [7.0, 2.0]], [[1.0, 3.0], [3.0, 4.0]]]])
-    b = np.array([[[[2.0, 1.0], [0.0, -9.0]], [[4.0, 3.0], [3.0, 1.0]]]])
-    g = tf.Graph()
-    with g.as_default():
-      with self.test_session(graph=g) as sess:
-        c = correlation_cost_op.correlation_cost(
-            a, b, kernel_size=1, max_displacement=1, stride_1=1, stride_2=1,
-            pad=1)
-        c = tf.squeeze(c, axis=0)
-        c_val = sess.run(c)
-        self.assertAllEqual(c_val.shape, (2, 2, 9))
-        for y in range(2):
-          for x in range(2):
-            for dy in range(-1, 2):
-              for dx in range(-1, 2):
-                a_slice = a[0, y, x, :]
-                if y + dy < 0 or y + dy > 1 or x + dx < 0 or x + dx > 1:
-                  b_slice = 0
-                else:
-                  b_slice = b[0, y + dy, x + dx, :]
-                expected = (a_slice * b_slice).mean()
-                dy0 = dy + 1
-                dx0 = dx + 1
-                self.assertAlmostEqual(c_val[y, x, 3 * dy0 + dx0], expected)
-
-  @unittest.skipIf(not embedding_utils.USE_CORRELATION_COST,
-                   'depends on correlation_cost')
-  def test_local_pairwise_distances_one_dimensional(self):
-    a = np.array([[[1.0], [2.0]], [[3.0], [4.0]]])
-    b = np.array([[[2.0], [1.0]], [[4.0], [3.0]]])
-    g = tf.Graph()
-    with g.as_default():
-      with self.test_session(graph=g) as sess:
-        a_tf = tf.constant(a, dtype=tf.float32)
-        b_tf = tf.constant(b, dtype=tf.float32)
-        d = embedding_utils.local_pairwise_distances(a_tf, b_tf,
-                                                     max_distance=1)
-        d_val = sess.run(d)
-        for y in range(2):
-          for x in range(2):
-            for dy in range(-1, 2):
-              for dx in range(-1, 2):
-                a_slice = a[y, x, 0]
-                if y + dy < 0 or y + dy > 1 or x + dx < 0 or x + dx > 1:
-                  expected = np.float('inf')
-                else:
-                  b_slice = b[y + dy, x + dx, 0]
-                  expected = (a_slice - b_slice) ** 2
-                dy0 = dy + 1
-                dx0 = dx + 1
-                self.assertAlmostEqual(d_val[y, x, 3 * dy0 + dx0], expected)
-
-  @unittest.skipIf(not embedding_utils.USE_CORRELATION_COST,
-                   'depends on correlation_cost')
-  def test_local_pairwise_distances_shape(self):
-    a = np.zeros((4, 5, 2))
-    b = np.zeros((4, 5, 2))
-    g = tf.Graph()
-    with g.as_default():
-      with self.test_session(graph=g) as sess:
-        a_tf = tf.constant(a, dtype=tf.float32)
-        b_tf = tf.constant(b, dtype=tf.float32)
-        d = embedding_utils.local_pairwise_distances(a_tf, b_tf, max_distance=4)
-        d_val = sess.run(d)
-        self.assertAllEqual(d_val.shape, (4, 5, 81))
-
-  @unittest.skipIf(not embedding_utils.USE_CORRELATION_COST,
-                   'depends on correlation_cost')
-  def test_local_pairwise_distances_two_dimensional(self):
-    a = np.array([[[1.0, -5.0], [7.0, 2.0]], [[1.0, 3.0], [3.0, 4.0]]])
-    b = np.array([[[2.0, 1.0], [0.0, -9.0]], [[4.0, 3.0], [3.0, 1.0]]])
-    g = tf.Graph()
-    with g.as_default():
-      with self.test_session(graph=g) as sess:
-        a_tf = tf.constant(a, dtype=tf.float32)
-        b_tf = tf.constant(b, dtype=tf.float32)
-        d = embedding_utils.local_pairwise_distances(a_tf, b_tf,
-                                                     max_distance=1)
-        d_val = sess.run(d)
-        for y in range(2):
-          for x in range(2):
-            for dy in range(-1, 2):
-              for dx in range(-1, 2):
-                a_slice = a[y, x, :]
-                if y + dy < 0 or y + dy > 1 or x + dx < 0 or x + dx > 1:
-                  expected = np.float('inf')
-                else:
-                  b_slice = b[y + dy, x + dx, :]
-                  expected = ((a_slice - b_slice) ** 2).sum()
-                dy0 = dy + 1
-                dx0 = dx + 1
-                self.assertAlmostEqual(d_val[y, x, 3 * dy0 + dx0], expected)
-
-  @unittest.skipIf(not embedding_utils.USE_CORRELATION_COST,
-                   'depends on correlation_cost')
-  def test_local_previous_frame_nearest_neighbor_features_per_object(self):
-    prev_frame_embedding = np.array([[[1.0, -5.0], [7.0, 2.0]],
-                                     [[1.0, 3.0], [3.0, 4.0]]]) / 10
-    query_embedding = np.array([[[2.0, 1.0], [0.0, -9.0]],
-                                [[4.0, 3.0], [3.0, 1.0]]]) / 10
-    prev_frame_labels = np.array([[[0], [1]], [[1], [0]]])
-    gt_ids = np.array([0, 1])
-    g = tf.Graph()
-    with g.as_default():
-      with self.test_session(graph=g) as sess:
-        prev_frame_embedding_tf = tf.constant(prev_frame_embedding,
-                                              dtype=tf.float32)
-        query_embedding_tf = tf.constant(query_embedding, dtype=tf.float32)
-        embu = embedding_utils
-        dists = (
-            embu.local_previous_frame_nearest_neighbor_features_per_object(
-                prev_frame_embedding_tf, query_embedding_tf,
-                prev_frame_labels, gt_ids, max_distance=1))
-        dists = tf.squeeze(dists, axis=4)
-        dists = tf.squeeze(dists, axis=0)
-        dists_val = sess.run(dists)
-        for obj_id in gt_ids:
-          for y in range(2):
-            for x in range(2):
-              curr_min = 1.0
-              for dy in range(-1, 2):
-                for dx in range(-1, 2):
-                  # Attention: here we shift the prev frame embedding,
-                  # not the query.
-                  if y + dy < 0 or y + dy > 1 or x + dx < 0 or x + dx > 1:
-                    continue
-                  if prev_frame_labels[y + dy, x + dx, 0] != obj_id:
-                    continue
-                  prev_frame_slice = prev_frame_embedding[y + dy, x + dx, :]
-                  query_frame_slice = query_embedding[y, x, :]
-                  v_unnorm = ((prev_frame_slice - query_frame_slice) ** 2).sum()
-                  v = ((1.0 / (1.0 + np.exp(-v_unnorm))) - 0.5) * 2
-                  curr_min = min(curr_min, v)
-              expected = curr_min
-              self.assertAlmostEqual(dists_val[y, x, obj_id], expected,
-                                     places=5)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/feelvos/utils/eval_utils.py b/research/feelvos/utils/eval_utils.py
deleted file mode 100644
index 517ec0d788eb3a6ec48246e10920dd4b55332bf5..0000000000000000000000000000000000000000
--- a/research/feelvos/utils/eval_utils.py
+++ /dev/null
@@ -1,153 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utility functions for evaluations."""
-
-import numpy as np
-import PIL
-import tensorflow as tf
-
-pascal_colormap = [
-    0, 0, 0,
-    0.5020, 0, 0,
-    0, 0.5020, 0,
-    0.5020, 0.5020, 0,
-    0, 0, 0.5020,
-    0.5020, 0, 0.5020,
-    0, 0.5020, 0.5020,
-    0.5020, 0.5020, 0.5020,
-    0.2510, 0, 0,
-    0.7529, 0, 0,
-    0.2510, 0.5020, 0,
-    0.7529, 0.5020, 0,
-    0.2510, 0, 0.5020,
-    0.7529, 0, 0.5020,
-    0.2510, 0.5020, 0.5020,
-    0.7529, 0.5020, 0.5020,
-    0, 0.2510, 0,
-    0.5020, 0.2510, 0,
-    0, 0.7529, 0,
-    0.5020, 0.7529, 0,
-    0, 0.2510, 0.5020,
-    0.5020, 0.2510, 0.5020,
-    0, 0.7529, 0.5020,
-    0.5020, 0.7529, 0.5020,
-    0.2510, 0.2510, 0]
-
-
-def save_segmentation_with_colormap(filename, img):
-  """Saves a segmentation with the pascal colormap as expected for DAVIS eval.
-
-  Args:
-    filename: Where to store the segmentation.
-    img: A numpy array of the segmentation to be saved.
-  """
-  if img.shape[-1] == 1:
-    img = img[..., 0]
-
-  # Save with colormap.
-  colormap = (np.array(pascal_colormap) * 255).round().astype('uint8')
-  colormap_image = PIL.Image.new('P', (16, 16))
-  colormap_image.putpalette(colormap)
-  pil_image = PIL.Image.fromarray(img.astype('uint8'))
-  pil_image_with_colormap = pil_image.quantize(palette=colormap_image)
-  with tf.gfile.GFile(filename, 'w') as f:
-    pil_image_with_colormap.save(f)
-
-
-def save_embeddings(filename, embeddings):
-  with tf.gfile.GFile(filename, 'w') as f:
-    np.save(f, embeddings)
-
-
-def calculate_iou(pred_labels, ref_labels):
-  """Calculates the intersection over union for binary segmentation.
-
-  Args:
-    pred_labels: predicted segmentation labels.
-    ref_labels: reference segmentation labels.
-
-  Returns:
-    The IoU between pred_labels and ref_labels
-  """
-  if ref_labels.any():
-    i = np.logical_and(pred_labels, ref_labels).sum()
-    u = np.logical_or(pred_labels, ref_labels).sum()
-    return i.astype('float') / u
-  else:
-    if pred_labels.any():
-      return 0.0
-    else:
-      return 1.0
-
-
-def calculate_multi_object_miou_tf(pred_labels, ref_labels):
-  """Calculates the mIoU for a batch of predicted and reference labels.
-
-  Args:
-    pred_labels: Int32 tensor of shape [batch, height, width, 1].
-    ref_labels: Int32 tensor of shape [batch, height, width, 1].
-
-  Returns:
-    The mIoU between pred_labels and ref_labels as float32 scalar tensor.
-  """
-
-  def calculate_multi_object_miou(pred_labels_, ref_labels_):
-    """Calculates the mIoU for predicted and reference labels in numpy.
-
-    Args:
-      pred_labels_: int32 np.array of shape [batch, height, width, 1].
-      ref_labels_: int32 np.array of shape [batch, height, width, 1].
-
-    Returns:
-      The mIoU between pred_labels_ and ref_labels_.
-    """
-    assert len(pred_labels_.shape) == 4
-    assert pred_labels_.shape[3] == 1
-    assert pred_labels_.shape == ref_labels_.shape
-    ious = []
-    for pred_label, ref_label in zip(pred_labels_, ref_labels_):
-      ids = np.setdiff1d(np.unique(ref_label), [0])
-      if ids.size == 0:
-        continue
-      for id_ in ids:
-        iou = calculate_iou(pred_label == id_, ref_label == id_)
-        ious.append(iou)
-    if ious:
-      return np.cast['float32'](np.mean(ious))
-    else:
-      return np.cast['float32'](1.0)
-
-  miou = tf.py_func(calculate_multi_object_miou, [pred_labels, ref_labels],
-                    tf.float32, name='calculate_multi_object_miou')
-  miou.set_shape(())
-  return miou
-
-
-def calculate_multi_object_ious(pred_labels, ref_labels, label_set):
-  """Calculates the intersection over union for binary segmentation.
-
-  Args:
-    pred_labels: predicted segmentation labels.
-    ref_labels: reference segmentation labels.
-    label_set: int np.array of object ids.
-
-  Returns:
-    float np.array of IoUs between pred_labels and ref_labels
-      for each object in label_set.
-  """
-  # Background should not be included as object label.
-  return np.array([calculate_iou(pred_labels == label, ref_labels == label)
-                   for label in label_set if label != 0])
diff --git a/research/feelvos/utils/mask_damaging.py b/research/feelvos/utils/mask_damaging.py
deleted file mode 100644
index 74f3cdab5a0e4374f0cd238544a9a582fd0eef92..0000000000000000000000000000000000000000
--- a/research/feelvos/utils/mask_damaging.py
+++ /dev/null
@@ -1,176 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities for artificially damaging segmentation masks."""
-
-import numpy as np
-from scipy.ndimage import interpolation
-from skimage import morphology
-from skimage import transform
-import tensorflow as tf
-
-
-def damage_masks(labels, shift=True, scale=True, rotate=True, dilate=True):
-  """Damages segmentation masks by random transformations.
-
-  Args:
-    labels: Int32 labels tensor of shape (height, width, 1).
-    shift: Boolean, whether to damage the masks by shifting.
-    scale: Boolean, whether to damage the masks by scaling.
-    rotate: Boolean, whether to damage the masks by rotation.
-    dilate: Boolean, whether to damage the masks by dilation.
-
-  Returns:
-    The damaged version of labels.
-  """
-  def _damage_masks_np(labels_):
-    return damage_masks_np(labels_, shift, scale, rotate, dilate)
-  damaged_masks = tf.py_func(_damage_masks_np, [labels], tf.int32,
-                             name='damage_masks')
-  damaged_masks.set_shape(labels.get_shape())
-  return damaged_masks
-
-
-def damage_masks_np(labels, shift=True, scale=True, rotate=True, dilate=True):
-  """Performs the actual mask damaging in numpy.
-
-  Args:
-    labels: Int32 numpy array of shape (height, width, 1).
-    shift: Boolean, whether to damage the masks by shifting.
-    scale: Boolean, whether to damage the masks by scaling.
-    rotate: Boolean, whether to damage the masks by rotation.
-    dilate: Boolean, whether to damage the masks by dilation.
-
-  Returns:
-    The damaged version of labels.
-  """
-  unique_labels = np.unique(labels)
-  unique_labels = np.setdiff1d(unique_labels, [0])
-  # Shuffle to get random depth ordering when combining together.
-  np.random.shuffle(unique_labels)
-  damaged_labels = np.zeros_like(labels)
-  for l in unique_labels:
-    obj_mask = (labels == l)
-    damaged_obj_mask = _damage_single_object_mask(obj_mask, shift, scale,
-                                                  rotate, dilate)
-    damaged_labels[damaged_obj_mask] = l
-  return damaged_labels
-
-
-def _damage_single_object_mask(mask, shift, scale, rotate, dilate):
-  """Performs mask damaging in numpy for a single object.
-
-  Args:
-    mask: Boolean numpy array of shape(height, width, 1).
-    shift: Boolean, whether to damage the masks by shifting.
-    scale: Boolean, whether to damage the masks by scaling.
-    rotate: Boolean, whether to damage the masks by rotation.
-    dilate: Boolean, whether to damage the masks by dilation.
-
-  Returns:
-    The damaged version of mask.
-  """
-  # For now we just do shifting and scaling. Better would be Affine or thin
-  # spline plate transformations.
-  if shift:
-    mask = _shift_mask(mask)
-  if scale:
-    mask = _scale_mask(mask)
-  if rotate:
-    mask = _rotate_mask(mask)
-  if dilate:
-    mask = _dilate_mask(mask)
-  return mask
-
-
-def _shift_mask(mask, max_shift_factor=0.05):
-  """Damages a mask for a single object by randomly shifting it in numpy.
-
-  Args:
-    mask: Boolean numpy array of shape(height, width, 1).
-    max_shift_factor: Float scalar, the maximum factor for random shifting.
-
-  Returns:
-    The shifted version of mask.
-  """
-  nzy, nzx, _ = mask.nonzero()
-  h = nzy.max() - nzy.min()
-  w = nzx.max() - nzx.min()
-  size = np.sqrt(h * w)
-  offset = np.random.uniform(-size * max_shift_factor, size * max_shift_factor,
-                             2)
-  shifted_mask = interpolation.shift(np.squeeze(mask, axis=2),
-                                     offset, order=0).astype('bool')[...,
-                                                                     np.newaxis]
-  return shifted_mask
-
-
-def _scale_mask(mask, scale_amount=0.025):
-  """Damages a mask for a single object by randomly scaling it in numpy.
-
-  Args:
-    mask: Boolean numpy array of shape(height, width, 1).
-    scale_amount: Float scalar, the maximum factor for random scaling.
-
-  Returns:
-    The scaled version of mask.
-  """
-  nzy, nzx, _ = mask.nonzero()
-  cy = 0.5 * (nzy.max() - nzy.min())
-  cx = 0.5 * (nzx.max() - nzx.min())
-  scale_factor = np.random.uniform(1.0 - scale_amount, 1.0 + scale_amount)
-  shift = transform.SimilarityTransform(translation=[-cx, -cy])
-  inv_shift = transform.SimilarityTransform(translation=[cx, cy])
-  s = transform.SimilarityTransform(scale=[scale_factor, scale_factor])
-  m = (shift + (s + inv_shift)).inverse
-  scaled_mask = transform.warp(mask, m) > 0.5
-  return scaled_mask
-
-
-def _rotate_mask(mask, max_rot_degrees=3.0):
-  """Damages a mask for a single object by randomly rotating it in numpy.
-
-  Args:
-    mask: Boolean numpy array of shape(height, width, 1).
-    max_rot_degrees: Float scalar, the maximum number of degrees to rotate.
-
-  Returns:
-    The scaled version of mask.
-  """
-  cy = 0.5 * mask.shape[0]
-  cx = 0.5 * mask.shape[1]
-  rot_degrees = np.random.uniform(-max_rot_degrees, max_rot_degrees)
-  shift = transform.SimilarityTransform(translation=[-cx, -cy])
-  inv_shift = transform.SimilarityTransform(translation=[cx, cy])
-  r = transform.SimilarityTransform(rotation=np.deg2rad(rot_degrees))
-  m = (shift + (r + inv_shift)).inverse
-  scaled_mask = transform.warp(mask, m) > 0.5
-  return scaled_mask
-
-
-def _dilate_mask(mask, dilation_radius=5):
-  """Damages a mask for a single object by dilating it in numpy.
-
-  Args:
-    mask: Boolean numpy array of shape(height, width, 1).
-    dilation_radius: Integer, the radius of the used disk structure element.
-
-  Returns:
-    The dilated version of mask.
-  """
-  disk = morphology.disk(dilation_radius, dtype=np.bool)
-  dilated_mask = morphology.binary_dilation(
-      np.squeeze(mask, axis=2), selem=disk)[..., np.newaxis]
-  return dilated_mask
diff --git a/research/feelvos/utils/train_utils.py b/research/feelvos/utils/train_utils.py
deleted file mode 100644
index 02a04cd33645931c5c795bef8559c0d3f5c4c23c..0000000000000000000000000000000000000000
--- a/research/feelvos/utils/train_utils.py
+++ /dev/null
@@ -1,269 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utility functions for training."""
-import collections
-import six
-import tensorflow as tf
-
-from deeplab.core import preprocess_utils
-from deeplab.utils import train_utils
-from feelvos.utils import embedding_utils
-from feelvos.utils import eval_utils
-
-slim = tf.contrib.slim
-add_softmax_cross_entropy_loss_for_each_scale = (
-    train_utils.add_softmax_cross_entropy_loss_for_each_scale)
-get_model_gradient_multipliers = train_utils.get_model_gradient_multipliers
-get_model_learning_rate = train_utils.get_model_learning_rate
-resolve_shape = preprocess_utils.resolve_shape
-
-
-def add_triplet_loss_for_each_scale(batch_size, num_frames_per_video,
-                                    embedding_dim, scales_to_embeddings,
-                                    labels, scope):
-  """Adds triplet loss for logits of each scale.
-
-  Args:
-    batch_size: Int, the number of video chunks sampled per batch
-    num_frames_per_video: Int, the number of frames per video.
-    embedding_dim: Int, the dimension of the learned embedding
-    scales_to_embeddings: A map from embedding names for different scales to
-      embeddings. The embeddings have shape [batch, embeddings_height,
-      embeddings_width, embedding_dim].
-    labels: Groundtruth labels with shape [batch, image_height, image_width, 1].
-    scope: String, the scope for the loss.
-
-  Raises:
-    ValueError: labels is None.
-  """
-  if labels is None:
-    raise ValueError('No label for triplet loss.')
-  for scale, embeddings in scales_to_embeddings.iteritems():
-    loss_scope = None
-    if scope:
-      loss_scope = '%s_%s' % (scope, scale)
-    # Label is downsampled to the same size as logits.
-    scaled_labels = tf.image.resize_nearest_neighbor(
-        labels,
-        resolve_shape(embeddings, 4)[1:3],
-        align_corners=True)
-    # Reshape from [batch * num_frames, ...] to [batch, num_frames, ...].
-    h = tf.shape(embeddings)[1]
-    w = tf.shape(embeddings)[2]
-    new_labels_shape = tf.stack([batch_size, num_frames_per_video, h, w, 1])
-    reshaped_labels = tf.reshape(scaled_labels, new_labels_shape)
-    new_embeddings_shape = tf.stack([batch_size, num_frames_per_video, h, w,
-                                     -1])
-    reshaped_embeddings = tf.reshape(embeddings, new_embeddings_shape)
-
-    with tf.name_scope(loss_scope):
-      total_loss = tf.constant(0, dtype=tf.float32)
-      for n in range(batch_size):
-        embedding = reshaped_embeddings[n]
-        label = reshaped_labels[n]
-        n_pixels = h * w
-        n_anchors_used = 256
-        sampled_anchor_indices = tf.random_shuffle(tf.range(n_pixels))[
-            :n_anchors_used]
-        anchors_pool = tf.reshape(embedding[0], [-1, embedding_dim])
-        anchors_pool_classes = tf.reshape(label[0], [-1])
-        anchors = tf.gather(anchors_pool, sampled_anchor_indices)
-        anchor_classes = tf.gather(anchors_pool_classes, sampled_anchor_indices)
-
-        pos_neg_pool = tf.reshape(embedding[1:], [-1, embedding_dim])
-        pos_neg_pool_classes = tf.reshape(label[1:], [-1])
-        dists = embedding_utils.pairwise_distances(anchors, pos_neg_pool)
-        pos_mask = tf.equal(anchor_classes[:, tf.newaxis],
-                            pos_neg_pool_classes[tf.newaxis, :])
-        neg_mask = tf.logical_not(pos_mask)
-        pos_mask_f = tf.cast(pos_mask, tf.float32)
-        neg_mask_f = tf.cast(neg_mask, tf.float32)
-        pos_dists = pos_mask_f * dists + 1e20 * neg_mask_f
-        neg_dists = neg_mask_f * dists + 1e20 * pos_mask_f
-        pos_dists_min = tf.reduce_min(pos_dists, axis=1)
-        neg_dists_min = tf.reduce_min(neg_dists, axis=1)
-        margin = 1.0
-        loss = tf.nn.relu(pos_dists_min - neg_dists_min + margin)
-        # Handle case that no positive is present (per anchor).
-        any_pos = tf.reduce_any(pos_mask, axis=1)
-        loss *= tf.cast(any_pos, tf.float32)
-        # Average over anchors
-        loss = tf.reduce_mean(loss, axis=0)
-        total_loss += loss
-      total_loss /= batch_size
-      # Scale the loss up a bit.
-      total_loss *= 3.0
-      tf.add_to_collection(tf.GraphKeys.LOSSES, total_loss)
-
-
-def add_dynamic_softmax_cross_entropy_loss_for_each_scale(
-    scales_to_logits, labels, ignore_label, loss_weight=1.0,
-    upsample_logits=True, scope=None, top_k_percent_pixels=1.0,
-    hard_example_mining_step=100000):
-  """Adds softmax cross entropy loss per scale for logits with varying classes.
-
-  Also adds summaries for mIoU.
-
-  Args:
-    scales_to_logits: A map from logits names for different scales to logits.
-      The logits are a list of length batch_size of tensors of shape
-      [time, logits_height, logits_width, num_classes].
-    labels: Groundtruth labels with shape [batch_size * time, image_height,
-      image_width, 1].
-    ignore_label: Integer, label to ignore.
-    loss_weight: Float, loss weight.
-    upsample_logits: Boolean, upsample logits or not.
-    scope: String, the scope for the loss.
-    top_k_percent_pixels: A float, the value lies in [0.0, 1.0]. When its
-      value < 1.0, only compute the loss for the top k percent pixels (e.g.,
-      the top 20% pixels). This is useful for hard pixel mining.
-    hard_example_mining_step: An integer, the training step in which the
-      hard exampling mining kicks off. Note that we gradually reduce the
-      mining percent to the top_k_percent_pixels. For example, if
-      hard_example_mining_step=100K and top_k_percent_pixels=0.25, then
-      mining percent will gradually reduce from 100% to 25% until 100K steps
-      after which we only mine top 25% pixels.
-
-  Raises:
-    ValueError: Label or logits is None.
-  """
-  if labels is None:
-    raise ValueError('No label for softmax cross entropy loss.')
-
-  if top_k_percent_pixels < 0 or top_k_percent_pixels > 1:
-    raise ValueError('Unexpected value of top_k_percent_pixels.')
-
-  for scale, logits in six.iteritems(scales_to_logits):
-    loss_scope = None
-    if scope:
-      loss_scope = '%s_%s' % (scope, scale)
-
-    if upsample_logits:
-      # Label is not downsampled, and instead we upsample logits.
-      assert isinstance(logits, collections.Sequence)
-      logits = [tf.image.resize_bilinear(
-          x,
-          preprocess_utils.resolve_shape(labels, 4)[1:3],
-          align_corners=True) for x in logits]
-      scaled_labels = labels
-    else:
-      # Label is downsampled to the same size as logits.
-      assert isinstance(logits, collections.Sequence)
-      scaled_labels = tf.image.resize_nearest_neighbor(
-          labels,
-          preprocess_utils.resolve_shape(logits[0], 4)[1:3],
-          align_corners=True)
-
-    batch_size = len(logits)
-    num_time = preprocess_utils.resolve_shape(logits[0])[0]
-    reshaped_labels = tf.reshape(
-        scaled_labels, ([batch_size, num_time] +
-                        preprocess_utils.resolve_shape(scaled_labels)[1:]))
-    for n, logits_n in enumerate(logits):
-      labels_n = reshaped_labels[n]
-      labels_n = tf.reshape(labels_n, shape=[-1])
-      not_ignore_mask = tf.to_float(tf.not_equal(labels_n,
-                                                 ignore_label)) * loss_weight
-      num_classes_n = tf.shape(logits_n)[-1]
-      one_hot_labels = slim.one_hot_encoding(
-          labels_n, num_classes_n, on_value=1.0, off_value=0.0)
-      logits_n_flat = tf.reshape(logits_n, shape=[-1, num_classes_n])
-      if top_k_percent_pixels == 1.0:
-        tf.losses.softmax_cross_entropy(
-            one_hot_labels,
-            logits_n_flat,
-            weights=not_ignore_mask,
-            scope=loss_scope)
-      else:
-        # Only compute the loss for top k percent pixels.
-        # First, compute the loss for all pixels. Note we do not put the loss
-        # to loss_collection and set reduction = None to keep the shape.
-        num_pixels = tf.to_float(tf.shape(logits_n_flat)[0])
-        pixel_losses = tf.losses.softmax_cross_entropy(
-            one_hot_labels,
-            logits_n_flat,
-            weights=not_ignore_mask,
-            scope='pixel_losses',
-            loss_collection=None,
-            reduction=tf.losses.Reduction.NONE)
-        # Compute the top_k_percent pixels based on current training step.
-        if hard_example_mining_step == 0:
-          # Directly focus on the top_k pixels.
-          top_k_pixels = tf.to_int32(top_k_percent_pixels * num_pixels)
-        else:
-          # Gradually reduce the mining percent to top_k_percent_pixels.
-          global_step = tf.to_float(tf.train.get_or_create_global_step())
-          ratio = tf.minimum(1.0, global_step / hard_example_mining_step)
-          top_k_pixels = tf.to_int32(
-              (ratio * top_k_percent_pixels + (1.0 - ratio)) * num_pixels)
-        _, top_k_indices = tf.nn.top_k(pixel_losses,
-                                       k=top_k_pixels,
-                                       sorted=True,
-                                       name='top_k_percent_pixels')
-        # Compute the loss for the top k percent pixels.
-        tf.losses.softmax_cross_entropy(
-            tf.gather(one_hot_labels, top_k_indices),
-            tf.gather(logits_n_flat, top_k_indices),
-            weights=tf.gather(not_ignore_mask, top_k_indices),
-            scope=loss_scope)
-
-      pred_n = tf.argmax(logits_n, axis=-1, output_type=tf.int32)[
-          ..., tf.newaxis]
-      labels_n = labels[n * num_time: (n + 1) * num_time]
-      miou = eval_utils.calculate_multi_object_miou_tf(pred_n, labels_n)
-      tf.summary.scalar('miou', miou)
-
-
-def get_model_init_fn(train_logdir,
-                      tf_initial_checkpoint,
-                      initialize_last_layer,
-                      last_layers,
-                      ignore_missing_vars=False):
-  """Gets the function initializing model variables from a checkpoint.
-
-  Args:
-    train_logdir: Log directory for training.
-    tf_initial_checkpoint: TensorFlow checkpoint for initialization.
-    initialize_last_layer: Initialize last layer or not.
-    last_layers: Last layers of the model.
-    ignore_missing_vars: Ignore missing variables in the checkpoint.
-
-  Returns:
-    Initialization function.
-  """
-  if tf_initial_checkpoint is None:
-    tf.logging.info('Not initializing the model from a checkpoint.')
-    return None
-
-  if tf.train.latest_checkpoint(train_logdir):
-    tf.logging.info('Ignoring initialization; other checkpoint exists')
-    return None
-
-  tf.logging.info('Initializing model from path: %s', tf_initial_checkpoint)
-
-  # Variables that will not be restored.
-  exclude_list = ['global_step']
-  if not initialize_last_layer:
-    exclude_list.extend(last_layers)
-
-  variables_to_restore = slim.get_variables_to_restore(exclude=exclude_list)
-
-  if variables_to_restore:
-    return slim.assign_from_checkpoint_fn(
-        tf_initial_checkpoint,
-        variables_to_restore,
-        ignore_missing_vars=ignore_missing_vars)
-  return None
diff --git a/research/feelvos/utils/video_input_generator.py b/research/feelvos/utils/video_input_generator.py
deleted file mode 100644
index c0135e50110c677865217c8a3f13d1d1d891f0b2..0000000000000000000000000000000000000000
--- a/research/feelvos/utils/video_input_generator.py
+++ /dev/null
@@ -1,558 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Wrapper for providing semantic segmentation video data."""
-
-import tensorflow as tf
-from feelvos import input_preprocess
-from feelvos import model
-from feelvos.utils import mask_damaging
-from feelvos.utils import train_utils
-
-slim = tf.contrib.slim
-dataset_data_provider = slim.dataset_data_provider
-
-
-MIN_LABEL_COUNT = 10
-
-
-def decode_image_sequence(tensor, image_format='jpeg', shape=None,
-                          channels=3, raw_dtype=tf.uint8):
-  """Decodes a sequence of images.
-
-  Args:
-    tensor: the tensor of strings to decode, shape: [num_images]
-    image_format: a string (possibly tensor) with the format of the image.
-      Options include 'jpeg', 'png', and 'raw'.
-    shape: a list or tensor of the decoded image shape for a single image.
-    channels: if 'shape' is None, the third dimension of the image is set to
-      this value.
-    raw_dtype: if the image is encoded as raw bytes, this is the method of
-      decoding the bytes into values.
-  Returns:
-    The decoded images with shape [time, height, width, channels].
-  """
-  handler = slim.tfexample_decoder.Image(
-      shape=shape, channels=channels, dtype=raw_dtype, repeated=True)
-  return handler.tensors_to_item({'image/encoded': tensor,
-                                  'image/format': image_format})
-
-
-def _get_data(data_provider, dataset_split, video_frames_are_decoded):
-  """Gets data from data provider.
-
-  Args:
-    data_provider: An object of slim.data_provider.
-    dataset_split: Dataset split.
-    video_frames_are_decoded: Boolean, whether the video frames are already
-        decoded
-
-  Returns:
-    image: Image Tensor.
-    label: Label Tensor storing segmentation annotations.
-    object_label: An integer refers to object_label according to labelmap. If
-      the example has more than one object_label, take the first one.
-    image_name: Image name.
-    height: Image height.
-    width: Image width.
-    video_id: String tensor representing the name of the video.
-
-  Raises:
-    ValueError: Failed to find label.
-  """
-
-  if video_frames_are_decoded:
-    image, = data_provider.get(['image'])
-  else:
-    image, = data_provider.get(['image/encoded'])
-
-  # Some datasets do not contain image_name.
-  if 'image_name' in data_provider.list_items():
-    image_name, = data_provider.get(['image_name'])
-  else:
-    image_name = tf.constant('')
-
-  height, width = data_provider.get(['height', 'width'])
-
-  label = None
-  if dataset_split != 'test':
-    if video_frames_are_decoded:
-      if 'labels_class' not in data_provider.list_items():
-        raise ValueError('Failed to find labels.')
-      label, = data_provider.get(['labels_class'])
-    else:
-      key = 'segmentation/object/encoded'
-      if key not in data_provider.list_items():
-        raise ValueError('Failed to find labels.')
-      label, = data_provider.get([key])
-
-  object_label = None
-  video_id, = data_provider.get(['video_id'])
-
-  return image, label, object_label, image_name, height, width, video_id
-
-
-def _has_foreground_and_background_in_first_frame(label, subsampling_factor):
-  """Checks if the labels have foreground and background in the first frame.
-
-  Args:
-    label: Label tensor of shape [num_frames, height, width, 1].
-    subsampling_factor: Integer, the subsampling factor.
-
-  Returns:
-    Boolean, whether the labels have foreground and background in the first
-      frame.
-  """
-  h, w = train_utils.resolve_shape(label)[1:3]
-  label_downscaled = tf.squeeze(
-      tf.image.resize_nearest_neighbor(label[0, tf.newaxis],
-                                       [h // subsampling_factor,
-                                        w // subsampling_factor],
-                                       align_corners=True),
-      axis=0)
-  is_bg = tf.equal(label_downscaled, 0)
-  is_fg = tf.logical_not(is_bg)
-  # Just using reduce_any was not robust enough, so lets make sure the count
-  # is above MIN_LABEL_COUNT.
-  fg_count = tf.reduce_sum(tf.cast(is_fg, tf.int32))
-  bg_count = tf.reduce_sum(tf.cast(is_bg, tf.int32))
-  has_bg = tf.greater_equal(fg_count, MIN_LABEL_COUNT)
-  has_fg = tf.greater_equal(bg_count, MIN_LABEL_COUNT)
-  return tf.logical_and(has_bg, has_fg)
-
-
-def _has_foreground_and_background_in_first_frame_2(label,
-                                                    decoder_output_stride):
-  """Checks if the labels have foreground and background in the first frame.
-
-  Second attempt, this time we use the actual output dimension for resizing.
-
-  Args:
-    label: Label tensor of shape [num_frames, height, width, 1].
-    decoder_output_stride: Integer, the stride of the decoder output.
-
-  Returns:
-    Boolean, whether the labels have foreground and background in the first
-      frame.
-  """
-  h, w = train_utils.resolve_shape(label)[1:3]
-  h_sub = model.scale_dimension(h, 1.0 / decoder_output_stride)
-  w_sub = model.scale_dimension(w, 1.0 / decoder_output_stride)
-  label_downscaled = tf.squeeze(
-      tf.image.resize_nearest_neighbor(label[0, tf.newaxis], [h_sub, w_sub],
-                                       align_corners=True), axis=0)
-  is_bg = tf.equal(label_downscaled, 0)
-  is_fg = tf.logical_not(is_bg)
-  # Just using reduce_any was not robust enough, so lets make sure the count
-  # is above MIN_LABEL_COUNT.
-  fg_count = tf.reduce_sum(tf.cast(is_fg, tf.int32))
-  bg_count = tf.reduce_sum(tf.cast(is_bg, tf.int32))
-  has_bg = tf.greater_equal(fg_count, MIN_LABEL_COUNT)
-  has_fg = tf.greater_equal(bg_count, MIN_LABEL_COUNT)
-  return tf.logical_and(has_bg, has_fg)
-
-
-def _has_enough_pixels_of_each_object_in_first_frame(
-    label, decoder_output_stride):
-  """Checks if for each object (incl. background) enough pixels are visible.
-
-  During test time, we will usually not see a reference frame in which only
-  very few pixels of one object are visible. These cases can be problematic
-  during training, especially if more than the 1-nearest neighbor is used.
-  That's why this function can be used to detect and filter these cases.
-
-  Args:
-    label: Label tensor of shape [num_frames, height, width, 1].
-    decoder_output_stride: Integer, the stride of the decoder output.
-
-  Returns:
-    Boolean, whether the labels have enough pixels of each object in the first
-      frame.
-  """
-  h, w = train_utils.resolve_shape(label)[1:3]
-  h_sub = model.scale_dimension(h, 1.0 / decoder_output_stride)
-  w_sub = model.scale_dimension(w, 1.0 / decoder_output_stride)
-  label_downscaled = tf.squeeze(
-      tf.image.resize_nearest_neighbor(label[0, tf.newaxis], [h_sub, w_sub],
-                                       align_corners=True), axis=0)
-  _, _, counts = tf.unique_with_counts(
-      tf.reshape(label_downscaled, [-1]))
-  has_enough_pixels_per_object = tf.reduce_all(
-      tf.greater_equal(counts, MIN_LABEL_COUNT))
-  return has_enough_pixels_per_object
-
-
-def get(dataset,
-        num_frames_per_video,
-        crop_size,
-        batch_size,
-        min_resize_value=None,
-        max_resize_value=None,
-        resize_factor=None,
-        min_scale_factor=1.,
-        max_scale_factor=1.,
-        scale_factor_step_size=0,
-        preprocess_image_and_label=True,
-        num_readers=1,
-        num_threads=1,
-        dataset_split=None,
-        is_training=True,
-        model_variant=None,
-        batch_capacity_factor=32,
-        video_frames_are_decoded=False,
-        decoder_output_stride=None,
-        first_frame_finetuning=False,
-        sample_only_first_frame_for_finetuning=False,
-        sample_adjacent_and_consistent_query_frames=False,
-        remap_labels_to_reference_frame=True,
-        generate_prev_frame_mask_by_mask_damaging=False,
-        three_frame_dataset=False,
-        add_prev_frame_label=True):
-  """Gets the dataset split for semantic segmentation.
-
-  This functions gets the dataset split for semantic segmentation. In
-  particular, it is a wrapper of (1) dataset_data_provider which returns the raw
-  dataset split, (2) input_preprcess which preprocess the raw data, and (3) the
-  Tensorflow operation of batching the preprocessed data. Then, the output could
-  be directly used by training, evaluation or visualization.
-
-  Args:
-    dataset: An instance of slim Dataset.
-    num_frames_per_video: The number of frames used per video
-    crop_size: Image crop size [height, width].
-    batch_size: Batch size.
-    min_resize_value: Desired size of the smaller image side.
-    max_resize_value: Maximum allowed size of the larger image side.
-    resize_factor: Resized dimensions are multiple of factor plus one.
-    min_scale_factor: Minimum scale factor value.
-    max_scale_factor: Maximum scale factor value.
-    scale_factor_step_size: The step size from min scale factor to max scale
-      factor. The input is randomly scaled based on the value of
-      (min_scale_factor, max_scale_factor, scale_factor_step_size).
-    preprocess_image_and_label: Boolean variable specifies if preprocessing of
-      image and label will be performed or not.
-    num_readers: Number of readers for data provider.
-    num_threads: Number of threads for batching data.
-    dataset_split: Dataset split.
-    is_training: Is training or not.
-    model_variant: Model variant (string) for choosing how to mean-subtract the
-      images. See feature_extractor.network_map for supported model variants.
-    batch_capacity_factor: Batch capacity factor affecting the training queue
-      batch capacity.
-    video_frames_are_decoded: Boolean, whether the video frames are already
-        decoded
-    decoder_output_stride: Integer, the stride of the decoder output.
-    first_frame_finetuning: Boolean, whether to only sample the first frame
-      for fine-tuning.
-    sample_only_first_frame_for_finetuning: Boolean, whether to only sample the
-      first frame during fine-tuning. This should be False when using lucid or
-      wonderland data, but true when fine-tuning on the first frame only.
-      Only has an effect if first_frame_finetuning is True.
-    sample_adjacent_and_consistent_query_frames: Boolean, if true, the query
-      frames (all but the first frame which is the reference frame) will be
-      sampled such that they are adjacent video frames and have the same
-      crop coordinates and flip augmentation.
-    remap_labels_to_reference_frame: Boolean, whether to remap the labels of
-      the query frames to match the labels of the (downscaled) reference frame.
-      If a query frame contains a label which is not present in the reference,
-      it will be mapped to background.
-    generate_prev_frame_mask_by_mask_damaging: Boolean, whether to generate
-      the masks used as guidance from the previous frame by damaging the
-      ground truth mask.
-    three_frame_dataset: Boolean, whether the dataset has exactly three frames
-      per video of which the first is to be used as reference and the two
-      others are consecutive frames to be used as query frames.
-    add_prev_frame_label: Boolean, whether to sample one more frame before the
-      first query frame to obtain a previous frame label. Only has an effect,
-      if sample_adjacent_and_consistent_query_frames is True and
-      generate_prev_frame_mask_by_mask_damaging is False.
-
-  Returns:
-    A dictionary of batched Tensors for semantic segmentation.
-
-  Raises:
-    ValueError: dataset_split is None, or Failed to find labels.
-  """
-  if dataset_split is None:
-    raise ValueError('Unknown dataset split.')
-  if model_variant is None:
-    tf.logging.warning('Please specify a model_variant. See '
-                       'feature_extractor.network_map for supported model '
-                       'variants.')
-
-  data_provider = dataset_data_provider.DatasetDataProvider(
-      dataset,
-      num_readers=num_readers,
-      num_epochs=None if is_training else 1,
-      shuffle=is_training)
-  image, label, object_label, image_name, height, width, video_id = _get_data(
-      data_provider, dataset_split, video_frames_are_decoded)
-
-  sampling_is_valid = tf.constant(True)
-  if num_frames_per_video is not None:
-    total_num_frames = tf.shape(image)[0]
-    if first_frame_finetuning or three_frame_dataset:
-      if sample_only_first_frame_for_finetuning:
-        assert not sample_adjacent_and_consistent_query_frames, (
-            'this option does not make sense for sampling only first frame.')
-        # Sample the first frame num_frames_per_video times.
-        sel_indices = tf.tile(tf.constant(0, dtype=tf.int32)[tf.newaxis],
-                              multiples=[num_frames_per_video])
-      else:
-        if sample_adjacent_and_consistent_query_frames:
-          if add_prev_frame_label:
-            num_frames_per_video += 1
-          # Since this is first frame fine-tuning, we'll for now assume that
-          # each sequence has exactly 3 images: the ref frame and 2 adjacent
-          # query frames.
-          assert num_frames_per_video == 3
-          with tf.control_dependencies([tf.assert_equal(total_num_frames, 3)]):
-            sel_indices = tf.constant([1, 2], dtype=tf.int32)
-        else:
-          # Sample num_frames_per_video - 1 query frames which are not the
-          # first frame.
-          sel_indices = tf.random_shuffle(
-              tf.range(1, total_num_frames))[:(num_frames_per_video - 1)]
-        # Concat first frame as reference frame to the front.
-        sel_indices = tf.concat([tf.constant(0, dtype=tf.int32)[tf.newaxis],
-                                 sel_indices], axis=0)
-    else:
-      if sample_adjacent_and_consistent_query_frames:
-        if add_prev_frame_label:
-          # Sample one more frame which we can use to provide initial softmax
-          # feedback.
-          num_frames_per_video += 1
-        ref_idx = tf.random_shuffle(tf.range(total_num_frames))[0]
-        sampling_is_valid = tf.greater_equal(total_num_frames,
-                                             num_frames_per_video)
-        def sample_query_start_idx():
-          return tf.random_shuffle(
-              tf.range(total_num_frames - num_frames_per_video + 1))[0]
-        query_start_idx = tf.cond(sampling_is_valid, sample_query_start_idx,
-                                  lambda: tf.constant(0, dtype=tf.int32))
-        def sample_sel_indices():
-          return tf.concat(
-              [ref_idx[tf.newaxis],
-               tf.range(
-                   query_start_idx,
-                   query_start_idx + (num_frames_per_video - 1))], axis=0)
-        sel_indices = tf.cond(
-            sampling_is_valid, sample_sel_indices,
-            lambda: tf.zeros((num_frames_per_video,), dtype=tf.int32))
-      else:
-        # Randomly sample some frames from the video.
-        sel_indices = tf.random_shuffle(
-            tf.range(total_num_frames))[:num_frames_per_video]
-    image = tf.gather(image, sel_indices, axis=0)
-  if not video_frames_are_decoded:
-    image = decode_image_sequence(image)
-
-  if label is not None:
-    if num_frames_per_video is not None:
-      label = tf.gather(label, sel_indices, axis=0)
-    if not video_frames_are_decoded:
-      label = decode_image_sequence(label, image_format='png', channels=1)
-
-    # Sometimes, label is saved as [num_frames_per_video, height, width] or
-    # [num_frames_per_video, height, width, 1]. We change it to be
-    # [num_frames_per_video, height, width, 1].
-    if label.shape.ndims == 3:
-      label = tf.expand_dims(label, 3)
-    elif label.shape.ndims == 4 and label.shape.dims[3] == 1:
-      pass
-    else:
-      raise ValueError('Input label shape must be '
-                       '[num_frames_per_video, height, width],'
-                       ' or [num_frames, height, width, 1]. '
-                       'Got {}'.format(label.shape.ndims))
-    label.set_shape([None, None, None, 1])
-
-  # Add size of first dimension since tf can't figure it out automatically.
-  image.set_shape((num_frames_per_video, None, None, None))
-  if label is not None:
-    label.set_shape((num_frames_per_video, None, None, None))
-
-  preceding_frame_label = None
-  if preprocess_image_and_label:
-    if num_frames_per_video is None:
-      raise ValueError('num_frame_per_video must be specified for preproc.')
-    original_images = []
-    images = []
-    labels = []
-    if sample_adjacent_and_consistent_query_frames:
-      num_frames_individual_preproc = 1
-    else:
-      num_frames_individual_preproc = num_frames_per_video
-    for frame_idx in range(num_frames_individual_preproc):
-      original_image_t, image_t, label_t = (
-          input_preprocess.preprocess_image_and_label(
-              image[frame_idx],
-              label[frame_idx],
-              crop_height=crop_size[0] if crop_size is not None else None,
-              crop_width=crop_size[1] if crop_size is not None else None,
-              min_resize_value=min_resize_value,
-              max_resize_value=max_resize_value,
-              resize_factor=resize_factor,
-              min_scale_factor=min_scale_factor,
-              max_scale_factor=max_scale_factor,
-              scale_factor_step_size=scale_factor_step_size,
-              ignore_label=dataset.ignore_label,
-              is_training=is_training,
-              model_variant=model_variant))
-      original_images.append(original_image_t)
-      images.append(image_t)
-      labels.append(label_t)
-    if sample_adjacent_and_consistent_query_frames:
-      imgs_for_preproc = [image[frame_idx] for frame_idx in
-                          range(1, num_frames_per_video)]
-      labels_for_preproc = [label[frame_idx] for frame_idx in
-                            range(1, num_frames_per_video)]
-      original_image_rest, image_rest, label_rest = (
-          input_preprocess.preprocess_images_and_labels_consistently(
-              imgs_for_preproc,
-              labels_for_preproc,
-              crop_height=crop_size[0] if crop_size is not None else None,
-              crop_width=crop_size[1] if crop_size is not None else None,
-              min_resize_value=min_resize_value,
-              max_resize_value=max_resize_value,
-              resize_factor=resize_factor,
-              min_scale_factor=min_scale_factor,
-              max_scale_factor=max_scale_factor,
-              scale_factor_step_size=scale_factor_step_size,
-              ignore_label=dataset.ignore_label,
-              is_training=is_training,
-              model_variant=model_variant))
-      original_images.extend(original_image_rest)
-      images.extend(image_rest)
-      labels.extend(label_rest)
-    assert len(original_images) == num_frames_per_video
-    assert len(images) == num_frames_per_video
-    assert len(labels) == num_frames_per_video
-
-    if remap_labels_to_reference_frame:
-      # Remap labels to indices into the labels of the (downscaled) reference
-      # frame, or 0, i.e. background, for labels which are not present
-      # in the reference.
-      reference_labels = labels[0][tf.newaxis]
-      h, w = train_utils.resolve_shape(reference_labels)[1:3]
-      embedding_height = model.scale_dimension(
-          h, 1.0 / decoder_output_stride)
-      embedding_width = model.scale_dimension(
-          w, 1.0 / decoder_output_stride)
-      reference_labels_embedding_size = tf.squeeze(
-          tf.image.resize_nearest_neighbor(
-              reference_labels, tf.stack([embedding_height, embedding_width]),
-              align_corners=True),
-          axis=0)
-      # Get sorted unique labels in the reference frame.
-      labels_in_ref_frame, _ = tf.unique(
-          tf.reshape(reference_labels_embedding_size, [-1]))
-      labels_in_ref_frame = tf.contrib.framework.sort(labels_in_ref_frame)
-      for idx in range(1, len(labels)):
-        ref_label_mask = tf.equal(
-            labels[idx],
-            labels_in_ref_frame[tf.newaxis, tf.newaxis, :])
-        remapped = tf.argmax(tf.cast(ref_label_mask, tf.uint8), axis=-1,
-                             output_type=tf.int32)
-        # Set to 0 if label is not present
-        is_in_ref = tf.reduce_any(ref_label_mask, axis=-1)
-        remapped *= tf.cast(is_in_ref, tf.int32)
-        labels[idx] = remapped[..., tf.newaxis]
-
-    if sample_adjacent_and_consistent_query_frames:
-      if first_frame_finetuning and generate_prev_frame_mask_by_mask_damaging:
-        preceding_frame_label = mask_damaging.damage_masks(labels[1])
-      elif add_prev_frame_label:
-        # Discard the image of the additional frame and take the label as
-        # initialization for softmax feedback.
-        original_images = [original_images[0]] + original_images[2:]
-        preceding_frame_label = labels[1]
-        images = [images[0]] + images[2:]
-        labels = [labels[0]] + labels[2:]
-        num_frames_per_video -= 1
-
-    original_image = tf.stack(original_images, axis=0)
-    image = tf.stack(images, axis=0)
-    label = tf.stack(labels, axis=0)
-  else:
-    if label is not None:
-      # Need to set label shape due to batching.
-      label.set_shape([num_frames_per_video,
-                       None if crop_size is None else crop_size[0],
-                       None if crop_size is None else crop_size[1],
-                       1])
-    original_image = tf.to_float(tf.zeros_like(label))
-    if crop_size is None:
-      height = tf.shape(image)[1]
-      width = tf.shape(image)[2]
-    else:
-      height = crop_size[0]
-      width = crop_size[1]
-
-  sample = {'image': image,
-            'image_name': image_name,
-            'height': height,
-            'width': width,
-            'video_id': video_id}
-  if label is not None:
-    sample['label'] = label
-
-  if object_label is not None:
-    sample['object_label'] = object_label
-
-  if preceding_frame_label is not None:
-    sample['preceding_frame_label'] = preceding_frame_label
-
-  if not is_training:
-    # Original image is only used during visualization.
-    sample['original_image'] = original_image
-
-  if is_training:
-    if first_frame_finetuning:
-      keep_input = tf.constant(True)
-    else:
-      keep_input = tf.logical_and(sampling_is_valid, tf.logical_and(
-          _has_enough_pixels_of_each_object_in_first_frame(
-              label, decoder_output_stride),
-          _has_foreground_and_background_in_first_frame_2(
-              label, decoder_output_stride)))
-
-    batched = tf.train.maybe_batch(sample,
-                                   keep_input=keep_input,
-                                   batch_size=batch_size,
-                                   num_threads=num_threads,
-                                   capacity=batch_capacity_factor * batch_size,
-                                   dynamic_pad=True)
-  else:
-    batched = tf.train.batch(sample,
-                             batch_size=batch_size,
-                             num_threads=num_threads,
-                             capacity=batch_capacity_factor * batch_size,
-                             dynamic_pad=True)
-
-  # Flatten from [batch, num_frames_per_video, ...] to
-  # batch * num_frames_per_video, ...].
-  cropped_height = train_utils.resolve_shape(batched['image'])[2]
-  cropped_width = train_utils.resolve_shape(batched['image'])[3]
-  if num_frames_per_video is None:
-    first_dim = -1
-  else:
-    first_dim = batch_size * num_frames_per_video
-  batched['image'] = tf.reshape(batched['image'],
-                                [first_dim, cropped_height, cropped_width, 3])
-  if label is not None:
-    batched['label'] = tf.reshape(batched['label'],
-                                  [first_dim, cropped_height, cropped_width, 1])
-  return batched
diff --git a/research/feelvos/vis_video.py b/research/feelvos/vis_video.py
deleted file mode 100644
index 211bccf52acdef83aca298285fc473748126de02..0000000000000000000000000000000000000000
--- a/research/feelvos/vis_video.py
+++ /dev/null
@@ -1,500 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Segmentation results evaluation and visualization for videos using attention.
-"""
-
-import math
-import os
-import time
-import numpy as np
-
-import tensorflow as tf
-
-from feelvos import common
-from feelvos import model
-from feelvos.datasets import video_dataset
-from feelvos.utils import embedding_utils
-from feelvos.utils import eval_utils
-from feelvos.utils import video_input_generator
-
-
-slim = tf.contrib.slim
-flags = tf.app.flags
-FLAGS = flags.FLAGS
-
-flags.DEFINE_integer('eval_interval_secs', 60 * 5,
-                     'How often (in seconds) to run evaluation.')
-
-flags.DEFINE_string('master', '', 'BNS name of the tensorflow server')
-
-flags.DEFINE_integer('vis_batch_size', 1,
-                     'The number of images in each batch during evaluation.')
-
-flags.DEFINE_string('vis_logdir', None, 'Where to write the event logs.')
-
-flags.DEFINE_string('checkpoint_dir', None, 'Directory of model checkpoints.')
-
-flags.DEFINE_integer('output_stride', 8,
-                     'The ratio of input to output spatial resolution.')
-
-flags.DEFINE_string('dataset', 'davis_2016',
-                    'Name of the segmentation dataset.')
-
-flags.DEFINE_string('vis_split', 'val',
-                    'Which split of the dataset used for visualizing results')
-
-flags.DEFINE_string(
-    'dataset_dir',
-    '/cns/is-d/home/lcchen/data/pascal_voc_seg/example_sstables',
-    'Where the dataset resides.')
-
-flags.DEFINE_integer('num_vis_examples', -1,
-                     'Number of examples for visualization. If -1, use all '
-                     'samples in the vis data.')
-
-flags.DEFINE_multi_integer('atrous_rates', None,
-                           'Atrous rates for atrous spatial pyramid pooling.')
-
-flags.DEFINE_bool('save_segmentations', False, 'Whether to save the '
-                                               'segmentation masks as '
-                                               'png images. Might be slow '
-                                               'on cns.')
-
-flags.DEFINE_bool('save_embeddings', False, 'Whether to save the embeddings as'
-                                            'pickle. Might be slow on cns.')
-
-flags.DEFINE_bool('eval_once_and_quit', False,
-                  'Whether to just run the eval a single time and quit '
-                  'afterwards. Otherwise, the eval is run in a loop with '
-                  'new checkpoints.')
-
-flags.DEFINE_boolean('first_frame_finetuning', False,
-                     'Whether to only sample the first frame for fine-tuning.')
-
-# the folder where segmentations are saved.
-_SEGMENTATION_SAVE_FOLDER = 'segmentation'
-_EMBEDDINGS_SAVE_FOLDER = 'embeddings'
-
-
-def _process_seq_data(segmentation_dir, embeddings_dir, seq_name,
-                      predicted_labels, gt_labels, embeddings):
-  """Calculates the sequence IoU and optionally save the segmentation masks.
-
-  Args:
-    segmentation_dir: Directory in which the segmentation results are stored.
-    embeddings_dir: Directory in which the embeddings are stored.
-    seq_name: String, the name of the sequence.
-    predicted_labels: Int64 np.array of shape [n_frames, height, width].
-    gt_labels: Ground truth labels, Int64 np.array of shape
-      [n_frames, height, width].
-    embeddings: Float32 np.array of embeddings of shape
-      [n_frames, decoder_height, decoder_width, embedding_dim], or None.
-
-  Returns:
-    The IoU for the sequence (float).
-  """
-  sequence_dir = os.path.join(segmentation_dir, seq_name)
-  tf.gfile.MakeDirs(sequence_dir)
-  embeddings_seq_dir = os.path.join(embeddings_dir, seq_name)
-  tf.gfile.MakeDirs(embeddings_seq_dir)
-  label_set = np.unique(gt_labels[0])
-  ious = []
-  assert len(predicted_labels) == len(gt_labels)
-  if embeddings is not None:
-    assert len(predicted_labels) == len(embeddings)
-  for t, (predicted_label, gt_label) in enumerate(
-      zip(predicted_labels, gt_labels)):
-    if FLAGS.save_segmentations:
-      seg_filename = os.path.join(segmentation_dir, seq_name, '%05d.png' % t)
-      eval_utils.save_segmentation_with_colormap(seg_filename, predicted_label)
-    if FLAGS.save_embeddings:
-      embedding_filename = os.path.join(embeddings_dir, seq_name,
-                                        '%05d.npy' % t)
-      assert embeddings is not None
-      eval_utils.save_embeddings(embedding_filename, embeddings[t])
-    object_ious_t = eval_utils.calculate_multi_object_ious(
-        predicted_label, gt_label, label_set)
-    ious.append(object_ious_t)
-  # First and last frame are excluded in DAVIS eval.
-  seq_ious = np.mean(ious[1:-1], axis=0)
-  tf.logging.info('seq ious: %s %s', seq_name, seq_ious)
-  return seq_ious
-
-
-def create_predictions(samples, reference_labels, first_frame_img,
-                       model_options):
-  """Predicts segmentation labels for each frame of the video.
-
-  Slower version than create_predictions_fast, but does support more options.
-
-  Args:
-    samples: Dictionary of input samples.
-    reference_labels: Int tensor of shape [1, height, width, 1].
-    first_frame_img: Float32 tensor of shape [height, width, 3].
-    model_options: An InternalModelOptions instance to configure models.
-
-  Returns:
-    predicted_labels: Int tensor of shape [time, height, width] of
-      predicted labels for each frame.
-    all_embeddings: Float32 tensor of shape
-      [time, height, width, embedding_dim], or None.
-  """
-
-  def predict(args, imgs):
-    """Predicts segmentation labels and softmax probabilities for each image.
-
-    Args:
-      args: A tuple of (predictions, softmax_probabilities), where predictions
-        is an int tensor of shape [1, h, w] and softmax_probabilities is a
-        float32 tensor of shape [1, h_decoder, w_decoder, n_objects].
-      imgs: Either a one-tuple of the image to predict labels for of shape
-        [h, w, 3], or pair of previous frame and current frame image.
-
-    Returns:
-      predictions: The predicted labels as int tensor of shape [1, h, w].
-      softmax_probabilities: The softmax probabilities of shape
-        [1, h_decoder, w_decoder, n_objects].
-    """
-    if FLAGS.save_embeddings:
-      last_frame_predictions, last_softmax_probabilities, _ = args
-    else:
-      last_frame_predictions, last_softmax_probabilities = args
-
-    if FLAGS.also_attend_to_previous_frame or FLAGS.use_softmax_feedback:
-      ref_labels_to_use = tf.concat(
-          [reference_labels, last_frame_predictions[..., tf.newaxis]],
-          axis=0)
-    else:
-      ref_labels_to_use = reference_labels
-
-    predictions, softmax_probabilities = model.predict_labels(
-        tf.stack((first_frame_img,) + imgs),
-        model_options=model_options,
-        image_pyramid=FLAGS.image_pyramid,
-        embedding_dimension=FLAGS.embedding_dimension,
-        reference_labels=ref_labels_to_use,
-        k_nearest_neighbors=FLAGS.k_nearest_neighbors,
-        use_softmax_feedback=FLAGS.use_softmax_feedback,
-        initial_softmax_feedback=last_softmax_probabilities,
-        embedding_seg_feature_dimension=
-        FLAGS.embedding_seg_feature_dimension,
-        embedding_seg_n_layers=FLAGS.embedding_seg_n_layers,
-        embedding_seg_kernel_size=FLAGS.embedding_seg_kernel_size,
-        embedding_seg_atrous_rates=FLAGS.embedding_seg_atrous_rates,
-        also_return_softmax_probabilities=True,
-        num_frames_per_video=
-        (3 if FLAGS.also_attend_to_previous_frame or
-         FLAGS.use_softmax_feedback else 2),
-        normalize_nearest_neighbor_distances=
-        FLAGS.normalize_nearest_neighbor_distances,
-        also_attend_to_previous_frame=FLAGS.also_attend_to_previous_frame,
-        use_local_previous_frame_attention=
-        FLAGS.use_local_previous_frame_attention,
-        previous_frame_attention_window_size=
-        FLAGS.previous_frame_attention_window_size,
-        use_first_frame_matching=FLAGS.use_first_frame_matching
-    )
-    predictions = tf.cast(predictions[common.OUTPUT_TYPE], tf.int32)
-
-    if FLAGS.save_embeddings:
-      names = [n.name for n in tf.get_default_graph().as_graph_def().node]
-      embedding_names = [x for x in names if 'embeddings' in x]
-      # This will crash when multi-scale inference is used.
-      assert len(embedding_names) == 1, len(embedding_names)
-      embedding_name = embedding_names[0] + ':0'
-      embeddings = tf.get_default_graph().get_tensor_by_name(embedding_name)
-      return predictions, softmax_probabilities, embeddings
-    else:
-      return predictions, softmax_probabilities
-
-  init_labels = tf.squeeze(reference_labels, axis=-1)
-  init_softmax = embedding_utils.create_initial_softmax_from_labels(
-      reference_labels, reference_labels, common.parse_decoder_output_stride(),
-      reduce_labels=False)
-  if FLAGS.save_embeddings:
-    decoder_height = tf.shape(init_softmax)[1]
-    decoder_width = tf.shape(init_softmax)[2]
-    n_frames = (3 if FLAGS.also_attend_to_previous_frame
-                or FLAGS.use_softmax_feedback else 2)
-    embeddings_init = tf.zeros((n_frames, decoder_height, decoder_width,
-                                FLAGS.embedding_dimension))
-    init = (init_labels, init_softmax, embeddings_init)
-  else:
-    init = (init_labels, init_softmax)
-  # Do not eval the first frame again but concat the first frame ground
-  # truth instead.
-  if FLAGS.also_attend_to_previous_frame or FLAGS.use_softmax_feedback:
-    elems = (samples[common.IMAGE][:-1], samples[common.IMAGE][1:])
-  else:
-    elems = (samples[common.IMAGE][1:],)
-  res = tf.scan(predict, elems,
-                initializer=init,
-                parallel_iterations=1,
-                swap_memory=True)
-  if FLAGS.save_embeddings:
-    predicted_labels, _, all_embeddings = res
-    first_frame_embeddings = all_embeddings[0, 0, tf.newaxis]
-    other_frame_embeddings = all_embeddings[:, -1]
-    all_embeddings = tf.concat(
-        [first_frame_embeddings, other_frame_embeddings], axis=0)
-  else:
-    predicted_labels, _ = res
-    all_embeddings = None
-  predicted_labels = tf.concat([reference_labels[..., 0],
-                                tf.squeeze(predicted_labels, axis=1)],
-                               axis=0)
-  return predicted_labels, all_embeddings
-
-
-def create_predictions_fast(samples, reference_labels, first_frame_img,
-                            model_options):
-  """Predicts segmentation labels for each frame of the video.
-
-  Faster version than create_predictions, but does not support all options.
-
-  Args:
-    samples: Dictionary of input samples.
-    reference_labels: Int tensor of shape [1, height, width, 1].
-    first_frame_img: Float32 tensor of shape [height, width, 3].
-    model_options: An InternalModelOptions instance to configure models.
-
-  Returns:
-    predicted_labels: Int tensor of shape [time, height, width] of
-      predicted labels for each frame.
-    all_embeddings: Float32 tensor of shape
-      [time, height, width, embedding_dim], or None.
-
-  Raises:
-    ValueError: If FLAGS.save_embeddings is True, FLAGS.use_softmax_feedback is
-      False, or FLAGS.also_attend_to_previous_frame is False.
-  """
-  if FLAGS.save_embeddings:
-    raise ValueError('save_embeddings does not work with '
-                     'create_predictions_fast. Use the slower '
-                     'create_predictions instead.')
-  if not FLAGS.use_softmax_feedback:
-    raise ValueError('use_softmax_feedback must be True for '
-                     'create_predictions_fast. Use the slower '
-                     'create_predictions instead.')
-  if not FLAGS.also_attend_to_previous_frame:
-    raise ValueError('also_attend_to_previous_frame must be True for '
-                     'create_predictions_fast. Use the slower '
-                     'create_predictions instead.')
-  # Extract embeddings for first frame and prepare initial predictions.
-  first_frame_embeddings = embedding_utils.get_embeddings(
-      first_frame_img[tf.newaxis], model_options, FLAGS.embedding_dimension)
-  init_labels = tf.squeeze(reference_labels, axis=-1)
-  init_softmax = embedding_utils.create_initial_softmax_from_labels(
-      reference_labels, reference_labels, common.parse_decoder_output_stride(),
-      reduce_labels=False)
-  init = (init_labels, init_softmax, first_frame_embeddings)
-
-  def predict(args, img):
-    """Predicts segmentation labels and softmax probabilities for each image.
-
-    Args:
-      args: tuple of
-        (predictions, softmax_probabilities, last_frame_embeddings), where
-        predictions is an int tensor of shape [1, h, w],
-        softmax_probabilities is a float32 tensor of shape
-        [1, h_decoder, w_decoder, n_objects],
-        and last_frame_embeddings is a float32 tensor of shape
-        [h_decoder, w_decoder, embedding_dimension].
-      img: Image to predict labels for of shape [h, w, 3].
-
-    Returns:
-      predictions: The predicted labels as int tensor of shape [1, h, w].
-      softmax_probabilities: The softmax probabilities of shape
-        [1, h_decoder, w_decoder, n_objects].
-    """
-    (last_frame_predictions, last_softmax_probabilities,
-     prev_frame_embeddings) = args
-    ref_labels_to_use = tf.concat(
-        [reference_labels, last_frame_predictions[..., tf.newaxis]],
-        axis=0)
-
-    predictions, softmax_probabilities, embeddings = model.predict_labels(
-        img[tf.newaxis],
-        model_options=model_options,
-        image_pyramid=FLAGS.image_pyramid,
-        embedding_dimension=FLAGS.embedding_dimension,
-        reference_labels=ref_labels_to_use,
-        k_nearest_neighbors=FLAGS.k_nearest_neighbors,
-        use_softmax_feedback=FLAGS.use_softmax_feedback,
-        initial_softmax_feedback=last_softmax_probabilities,
-        embedding_seg_feature_dimension=
-        FLAGS.embedding_seg_feature_dimension,
-        embedding_seg_n_layers=FLAGS.embedding_seg_n_layers,
-        embedding_seg_kernel_size=FLAGS.embedding_seg_kernel_size,
-        embedding_seg_atrous_rates=FLAGS.embedding_seg_atrous_rates,
-        also_return_softmax_probabilities=True,
-        num_frames_per_video=1,
-        normalize_nearest_neighbor_distances=
-        FLAGS.normalize_nearest_neighbor_distances,
-        also_attend_to_previous_frame=FLAGS.also_attend_to_previous_frame,
-        use_local_previous_frame_attention=
-        FLAGS.use_local_previous_frame_attention,
-        previous_frame_attention_window_size=
-        FLAGS.previous_frame_attention_window_size,
-        use_first_frame_matching=FLAGS.use_first_frame_matching,
-        also_return_embeddings=True,
-        ref_embeddings=(first_frame_embeddings, prev_frame_embeddings)
-    )
-    predictions = tf.cast(predictions[common.OUTPUT_TYPE], tf.int32)
-    return predictions, softmax_probabilities, embeddings
-
-  # Do not eval the first frame again but concat the first frame ground
-  # truth instead.
-  # If you have a lot of GPU memory, you can try to set swap_memory=False,
-  # and/or parallel_iterations=2.
-  elems = samples[common.IMAGE][1:]
-  res = tf.scan(predict, elems,
-                initializer=init,
-                parallel_iterations=1,
-                swap_memory=True)
-  predicted_labels, _, _ = res
-  predicted_labels = tf.concat([reference_labels[..., 0],
-                                tf.squeeze(predicted_labels, axis=1)],
-                               axis=0)
-  return predicted_labels
-
-
-def main(unused_argv):
-  if FLAGS.vis_batch_size != 1:
-    raise ValueError('Only batch size 1 is supported for now')
-
-  data_type = 'tf_sequence_example'
-  # Get dataset-dependent information.
-  dataset = video_dataset.get_dataset(
-      FLAGS.dataset,
-      FLAGS.vis_split,
-      dataset_dir=FLAGS.dataset_dir,
-      data_type=data_type)
-
-  # Prepare for visualization.
-  tf.gfile.MakeDirs(FLAGS.vis_logdir)
-  segmentation_dir = os.path.join(FLAGS.vis_logdir, _SEGMENTATION_SAVE_FOLDER)
-  tf.gfile.MakeDirs(segmentation_dir)
-  embeddings_dir = os.path.join(FLAGS.vis_logdir, _EMBEDDINGS_SAVE_FOLDER)
-  tf.gfile.MakeDirs(embeddings_dir)
-  num_vis_examples = (dataset.num_videos if (FLAGS.num_vis_examples < 0)
-                      else FLAGS.num_vis_examples)
-  if FLAGS.first_frame_finetuning:
-    num_vis_examples = 1
-
-  tf.logging.info('Visualizing on %s set', FLAGS.vis_split)
-  g = tf.Graph()
-  with g.as_default():
-    # Without setting device to CPU we run out of memory.
-    with tf.device('cpu:0'):
-      samples = video_input_generator.get(
-          dataset,
-          None,
-          None,
-          FLAGS.vis_batch_size,
-          min_resize_value=FLAGS.min_resize_value,
-          max_resize_value=FLAGS.max_resize_value,
-          resize_factor=FLAGS.resize_factor,
-          dataset_split=FLAGS.vis_split,
-          is_training=False,
-          model_variant=FLAGS.model_variant,
-          preprocess_image_and_label=False,
-          remap_labels_to_reference_frame=False)
-      samples[common.IMAGE] = tf.cast(samples[common.IMAGE], tf.float32)
-      samples[common.LABEL] = tf.cast(samples[common.LABEL], tf.int32)
-      first_frame_img = samples[common.IMAGE][0]
-      reference_labels = samples[common.LABEL][0, tf.newaxis]
-      gt_labels = tf.squeeze(samples[common.LABEL], axis=-1)
-      seq_name = samples[common.VIDEO_ID][0]
-
-    model_options = common.VideoModelOptions(
-        outputs_to_num_classes={common.OUTPUT_TYPE: dataset.num_classes},
-        crop_size=None,
-        atrous_rates=FLAGS.atrous_rates,
-        output_stride=FLAGS.output_stride)
-
-    all_embeddings = None
-    predicted_labels = create_predictions_fast(
-        samples, reference_labels, first_frame_img, model_options)
-    # If you need more options like saving embeddings, replace the call to
-    # create_predictions_fast with create_predictions.
-
-    tf.train.get_or_create_global_step()
-    saver = tf.train.Saver(slim.get_variables_to_restore())
-    sv = tf.train.Supervisor(graph=g,
-                             logdir=FLAGS.vis_logdir,
-                             init_op=tf.global_variables_initializer(),
-                             summary_op=None,
-                             summary_writer=None,
-                             global_step=None,
-                             saver=saver)
-    num_batches = int(
-        math.ceil(num_vis_examples / float(FLAGS.vis_batch_size)))
-    last_checkpoint = None
-
-    # Infinite loop to visualize the results when new checkpoint is created.
-    while True:
-      last_checkpoint = slim.evaluation.wait_for_new_checkpoint(
-          FLAGS.checkpoint_dir, last_checkpoint)
-      start = time.time()
-      tf.logging.info(
-          'Starting visualization at ' + time.strftime('%Y-%m-%d-%H:%M:%S',
-                                                       time.gmtime()))
-      tf.logging.info('Visualizing with model %s', last_checkpoint)
-
-      all_ious = []
-      with sv.managed_session(FLAGS.master,
-                              start_standard_services=False) as sess:
-        sv.start_queue_runners(sess)
-        sv.saver.restore(sess, last_checkpoint)
-
-        for batch in range(num_batches):
-          ops = [predicted_labels, gt_labels, seq_name]
-          if FLAGS.save_embeddings:
-            ops.append(all_embeddings)
-          tf.logging.info('Visualizing batch %d / %d', batch + 1, num_batches)
-          res = sess.run(ops)
-          tf.logging.info('Forwarding done')
-          pred_labels_val, gt_labels_val, seq_name_val = res[:3]
-          if FLAGS.save_embeddings:
-            all_embeddings_val = res[3]
-          else:
-            all_embeddings_val = None
-          seq_ious = _process_seq_data(segmentation_dir, embeddings_dir,
-                                       seq_name_val, pred_labels_val,
-                                       gt_labels_val, all_embeddings_val)
-          all_ious.append(seq_ious)
-      all_ious = np.concatenate(all_ious, axis=0)
-      tf.logging.info('n_seqs %s, mIoU %f', all_ious.shape, all_ious.mean())
-      tf.logging.info(
-          'Finished visualization at ' + time.strftime('%Y-%m-%d-%H:%M:%S',
-                                                       time.gmtime()))
-      result_dir = FLAGS.vis_logdir + '/results/'
-      tf.gfile.MakeDirs(result_dir)
-      with tf.gfile.GFile(result_dir + seq_name_val + '.txt', 'w') as f:
-        f.write(str(all_ious))
-      if FLAGS.first_frame_finetuning or FLAGS.eval_once_and_quit:
-        break
-      time_to_next_eval = start + FLAGS.eval_interval_secs - time.time()
-      if time_to_next_eval > 0:
-        time.sleep(time_to_next_eval)
-
-
-if __name__ == '__main__':
-  flags.mark_flag_as_required('checkpoint_dir')
-  flags.mark_flag_as_required('vis_logdir')
-  tf.logging.set_verbosity(tf.logging.INFO)
-  tf.app.run()
diff --git a/research/fivo/.gitattributes b/research/fivo/.gitattributes
deleted file mode 100644
index f706c0421d718f8af8e62d96d69101fe383d2b4f..0000000000000000000000000000000000000000
--- a/research/fivo/.gitattributes
+++ /dev/null
@@ -1,2 +0,0 @@
-*.pkl binary
-*.tfrecord binary
diff --git a/research/fivo/.gitignore b/research/fivo/.gitignore
deleted file mode 100644
index af2f537516daf33fdaf579436dfa33fdd9044f49..0000000000000000000000000000000000000000
--- a/research/fivo/.gitignore
+++ /dev/null
@@ -1,104 +0,0 @@
-# Byte-compiled / optimized / DLL files
-__pycache__/
-*.py[cod]
-*$py.class
-
-# C extensions
-*.so
-
-# Distribution / packaging
-.Python
-build/
-develop-eggs/
-dist/
-downloads/
-eggs/
-.eggs/
-lib/
-lib64/
-parts/
-sdist/
-var/
-wheels/
-*.egg-info/
-.installed.cfg
-*.egg
-MANIFEST
-
-# PyInstaller
-#  Usually these files are written by a python script from a template
-#  before PyInstaller builds the exe, so as to inject date/other infos into it.
-*.manifest
-*.spec
-
-# Installer logs
-pip-log.txt
-pip-delete-this-directory.txt
-
-# Unit test / coverage reports
-htmlcov/
-.tox/
-.coverage
-.coverage.*
-.cache
-nosetests.xml
-coverage.xml
-*.cover
-.hypothesis/
-
-# Translations
-*.mo
-*.pot
-
-# Django stuff:
-*.log
-.static_storage/
-.media/
-local_settings.py
-
-# Flask stuff:
-instance/
-.webassets-cache
-
-# Scrapy stuff:
-.scrapy
-
-# Sphinx documentation
-docs/_build/
-
-# PyBuilder
-target/
-
-# Jupyter Notebook
-.ipynb_checkpoints
-
-# pyenv
-.python-version
-
-# celery beat schedule file
-celerybeat-schedule
-
-# SageMath parsed files
-*.sage.py
-
-# Environments
-.env
-.venv
-env/
-venv/
-ENV/
-env.bak/
-venv.bak/
-
-# Spyder project settings
-.spyderproject
-.spyproject
-
-# Rope project settings
-.ropeproject
-
-# mkdocs documentation
-/site
-
-# mypy
-.mypy_cache/
diff --git a/research/fivo/README.md b/research/fivo/README.md
deleted file mode 100644
index 36d355b1b2961f2c8c8b721b5ce13c0c3eab1e8b..0000000000000000000000000000000000000000
--- a/research/fivo/README.md
+++ /dev/null
@@ -1,215 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Filtering Variational Objectives
-
-This folder contains a TensorFlow implementation of the algorithms from
-
-Chris J. Maddison\*, Dieterich Lawson\*, George Tucker\*, Nicolas Heess, Mohammad Norouzi, Andriy Mnih, Arnaud Doucet, and Yee Whye Teh. "Filtering Variational Objectives." NIPS 2017.
-
-[https://arxiv.org/abs/1705.09279](https://arxiv.org/abs/1705.09279)
-
-This code implements 3 different bounds for training sequential latent variable models: the evidence lower bound (ELBO), the importance weighted auto-encoder bound (IWAE), and our bound, the filtering variational objective (FIVO).
-
-Additionally it contains several sequential latent variable model implementations:
-
-* Variational recurrent neural network (VRNN)
-* Stochastic recurrent neural network (SRNN)
-* Gaussian hidden Markov model with linear conditionals (GHMM)
-
-The VRNN and SRNN can be trained for sequence modeling of pianoroll and speech data. The GHMM is trainable on a synthetic dataset, useful as a simple example of an analytically tractable model.
-
-#### Directory Structure
-The important parts of the code are organized as follows.
-
-```
-run_fivo.py         # main script, contains flag definitions
-fivo
-├─smc.py            # a sequential Monte Carlo implementation
-├─bounds.py         # code for computing each bound, uses smc.py
-├─runners.py        # code for VRNN and SRNN training and evaluation
-├─ghmm_runners.py   # code for GHMM training and evaluation
-├─data
-| ├─datasets.py     # readers for pianoroll and speech datasets
-| ├─calculate_pianoroll_mean.py  # preprocesses the pianoroll datasets
-| └─create_timit_dataset.py      # preprocesses the TIMIT dataset
-└─models
-  ├─base.py   # base classes used in other models
-  ├─vrnn.py   # VRNN implementation
-  ├─srnn.py   # SRNN implementation
-  └─ghmm.py   # Gaussian hidden Markov model (GHMM) implementation
-bin
-├─run_train.sh            # an example script that runs training
-├─run_eval.sh             # an example script that runs evaluation
-├─run_sample.sh           # an example script that runs sampling
-├─run_tests.sh            # a script that runs all tests
-└─download_pianorolls.sh  # a script that downloads pianoroll files
-```
-
-### Pianorolls
-
-Requirements before we start:
-
-* TensorFlow (see [tensorflow.org](http://tensorflow.org) for how to install)
-* [scipy](https://www.scipy.org/)
-* [sonnet](https://github.com/deepmind/sonnet)
-
-
-#### Download the Data
-
-The pianoroll datasets are encoded as pickled sparse arrays and are available at [http://www-etud.iro.umontreal.ca/~boulanni/icml2012](http://www-etud.iro.umontreal.ca/~boulanni/icml2012). You can use the script `bin/download_pianorolls.sh` to download the files into a directory of your choosing.
-```
-export PIANOROLL_DIR=~/pianorolls
-mkdir $PIANOROLL_DIR
-sh bin/download_pianorolls.sh $PIANOROLL_DIR
-```
-
-#### Preprocess the Data
-
-The script `calculate_pianoroll_mean.py` loads a pianoroll pickle file, calculates the mean, updates the pickle file to include the mean under the key `train_mean`, and writes the file back to disk in-place. You should do this for all pianoroll datasets you wish to train on.
-
-```
-python data/calculate_pianoroll_mean.py --in_file=$PIANOROLL_DIR/piano-midi.de.pkl
-python data/calculate_pianoroll_mean.py --in_file=$PIANOROLL_DIR/nottingham.de.pkl
-python data/calculate_pianoroll_mean.py --in_file=$PIANOROLL_DIR/musedata.pkl
-python data/calculate_pianoroll_mean.py --in_file=$PIANOROLL_DIR/jsb.pkl
-```
-
-#### Training
-
-Now we can train a model. Here is the command for a standard training run, taken from `bin/run_train.sh`:
-```
-python run_fivo.py \
-  --mode=train \
-  --logdir=/tmp/fivo \
-  --model=vrnn \
-  --bound=fivo \
-  --summarize_every=100 \
-  --batch_size=4 \
-  --num_samples=4 \
-  --learning_rate=0.0001 \
-  --dataset_path="$PIANOROLL_DIR/jsb.pkl" \
-  --dataset_type="pianoroll"
-```
-
-You should see output that looks something like this (with extra logging cruft):
-
-```
-Saving checkpoints for 0 into /tmp/fivo/model.ckpt.
-Step 1, fivo bound per timestep: -11.322491
-global_step/sec: 7.49971
-Step 101, fivo bound per timestep: -11.399275
-global_step/sec: 8.04498
-Step 201, fivo bound per timestep: -11.174991
-global_step/sec: 8.03989
-Step 301, fivo bound per timestep: -11.073008
-```
-#### Evaluation
-
-You can also evaluate saved checkpoints. The `eval` mode loads a model checkpoint, tests its performance on all items in a dataset, and reports the log-likelihood averaged over the dataset. For example here is a command, taken from `bin/run_eval.sh`, that will evaluate a JSB model on the test set:
-
-```
-python run_fivo.py \
-  --mode=eval \
-  --split=test \
-  --alsologtostderr \
-  --logdir=/tmp/fivo \
-  --model=vrnn \
-  --batch_size=4 \
-  --num_samples=4 \
-  --dataset_path="$PIANOROLL_DIR/jsb.pkl" \
-  --dataset_type="pianoroll"
-```
-
-You should see output like this:
-```
-Restoring parameters from /tmp/fivo/model.ckpt-0
-Model restored from step 0, evaluating.
-test elbo ll/t: -12.198834, iwae ll/t: -11.981187 fivo ll/t: -11.579776
-test elbo ll/seq: -748.564789, iwae ll/seq: -735.209206 fivo ll/seq: -710.577141
-```
-The evaluation script prints log-likelihood in both nats per timestep (ll/t) and nats per sequence (ll/seq) for all three bounds.
-
-#### Sampling
-
-You can also sample from trained models. The `sample` mode loads a model checkpoint, conditions the model on a prefix of a randomly chosen datapoint, samples a sequence of outputs from the conditioned model, and writes out the samples and prefix to a `.npz` file in `logdir`. For example here is a command that samples from a model trained on JSB, taken from `bin/run_sample.sh`:
-```
-python run_fivo.py \
-  --mode=sample \
-  --alsologtostderr \
-  --logdir="/tmp/fivo" \
-  --model=vrnn \
-  --bound=fivo \
-  --batch_size=4 \
-  --num_samples=4 \
-  --split=test \
-  --dataset_path="$PIANOROLL_DIR/jsb.pkl" \
-  --dataset_type="pianoroll" \
-  --prefix_length=25 \
-  --sample_length=50
-```
-
-Here `num_samples` denotes the number of samples used when conditioning the model as well as the number of trajectories to sample for each prefix.
-
-You should see very little output.
-```
-Restoring parameters from /tmp/fivo/model.ckpt-0
-Running local_init_op.
-Done running local_init_op.
-```
-
-Loading the samples with `np.load` confirms that we conditioned the model on 4
-prefixes of length 25 and sampled 4 sequences of length 50 for each prefix.
-```
->>> import numpy as np
->>> x = np.load("/tmp/fivo/samples.npz")
->>> x[()]['prefixes'].shape
-(25, 4, 88)
->>> x[()]['samples'].shape
-(50, 4, 4, 88)
-```
-
-### Training on TIMIT
-
-The TIMIT speech dataset is available at the [Linguistic Data Consortium website](https://catalog.ldc.upenn.edu/LDC93S1), but is unfortunately not free. These instructions will proceed assuming you have downloaded the TIMIT archive and extracted it into the directory `$RAW_TIMIT_DIR`.
-
-#### Preprocess TIMIT
-
-We preprocess TIMIT (as described in our paper) and write it out to a series of TFRecord files. To prepare the TIMIT dataset use the script `create_timit_dataset.py`
-```
-export $TIMIT_DIR=~/timit_dataset
-mkdir $TIMIT_DIR
-python data/create_timit_dataset.py \
-  --raw_timit_dir=$RAW_TIMIT_DIR \
-  --out_dir=$TIMIT_DIR
-```
-You should see this exact output:
-```
-4389 train / 231 valid / 1680 test
-train mean: 0.006060  train std: 548.136169
-```
-
-#### Training on TIMIT
-This is very similar to training on pianoroll datasets, with just a few flags switched.
-```
-python run_fivo.py \
-  --mode=train \
-  --logdir=/tmp/fivo \
-  --model=vrnn \
-  --bound=fivo \
-  --summarize_every=100 \
-  --batch_size=4 \
-  --num_samples=4 \
-  --learning_rate=0.0001 \
-  --dataset_path="$TIMIT_DIR/train" \
-  --dataset_type="speech"
-```
-Evaluation and sampling are similar.
-
-### Tests
-This codebase comes with a number of tests to verify correctness, runnable via `bin/run_tests.sh`. The tests are also useful to look at for examples of how to use the code.
-
-### Contact
-
-This codebase is maintained by Dieterich Lawson. For questions and issues please open an issue on the tensorflow/models issues tracker and assign it to @dieterichlawson.
diff --git a/research/fivo/bin/download_pianorolls.sh b/research/fivo/bin/download_pianorolls.sh
deleted file mode 100644
index ef7050b4df5fb9815be04d133e659fa31d8d055e..0000000000000000000000000000000000000000
--- a/research/fivo/bin/download_pianorolls.sh
+++ /dev/null
@@ -1,30 +0,0 @@
-#!/bin/bash
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# A script to download the pianoroll datasets.
-# Accepts one argument, the directory to put the files in
-
-if [ -z "$1" ]
-  then
-    echo "Error, must provide a directory to download the files to."
-    exit
-fi
-
-echo "Downloading datasets into $1"
-curl -s "http://www-etud.iro.umontreal.ca/~boulanni/Piano-midi.de.pickle" > $1/piano-midi.de.pkl
-curl -s "http://www-etud.iro.umontreal.ca/~boulanni/Nottingham.pickle" > $1/nottingham.pkl
-curl -s "http://www-etud.iro.umontreal.ca/~boulanni/MuseData.pickle" > $1/musedata.pkl
-curl -s "http://www-etud.iro.umontreal.ca/~boulanni/JSB%20Chorales.pickle" > $1/jsb.pkl
diff --git a/research/fivo/bin/run_sample.sh b/research/fivo/bin/run_sample.sh
deleted file mode 100644
index e0c82a0cb137822e85035a23081ecf6408b7cca1..0000000000000000000000000000000000000000
--- a/research/fivo/bin/run_sample.sh
+++ /dev/null
@@ -1,33 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# An example of sampling from the model.
-
-PIANOROLL_DIR=$HOME/pianorolls
-
-python run_fivo.py \
-  --mode=sample \
-  --alsologtostderr \
-  --logdir="/tmp/fivo" \
-  --model=vrnn \
-  --bound=fivo \
-  --batch_size=4 \
-  --num_samples=4 \
-  --split=test \
-  --dataset_path="$PIANOROLL_DIR/jsb.pkl" \
-  --dataset_type="pianoroll" \
-  --prefix_length=25 \
-  --sample_length=50
diff --git a/research/fivo/bin/run_tests.sh b/research/fivo/bin/run_tests.sh
deleted file mode 100644
index 2ea58f016620db98e258494919c6d339b5fd996e..0000000000000000000000000000000000000000
--- a/research/fivo/bin/run_tests.sh
+++ /dev/null
@@ -1,25 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-python -m fivo.smc_test && \
-python -m fivo.bounds_test && \
-python -m fivo.nested_utils_test && \
-python -m fivo.data.datasets_test && \
-python -m fivo.models.ghmm_test && \
-python -m fivo.models.vrnn_test && \
-python -m fivo.models.srnn_test && \
-python -m fivo.ghmm_runners_test && \
-python -m fivo.runners_test
diff --git a/research/fivo/bin/run_train.sh b/research/fivo/bin/run_train.sh
deleted file mode 100644
index a845959770c77cd99528005e1ee69e4593fcae0c..0000000000000000000000000000000000000000
--- a/research/fivo/bin/run_train.sh
+++ /dev/null
@@ -1,31 +0,0 @@
-#!/bin/bash
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# An example of running training.
-
-PIANOROLL_DIR=$HOME/pianorolls
-
-python run_fivo.py \
-  --mode=train \
-  --logdir=/tmp/fivo \
-  --model=vrnn \
-  --bound=fivo \
-  --summarize_every=100 \
-  --batch_size=4 \
-  --num_samples=4 \
-  --learning_rate=0.0001 \
-  --dataset_path="$PIANOROLL_DIR/jsb.pkl" \
-  --dataset_type="pianoroll"
diff --git a/research/fivo/experimental/README.md b/research/fivo/experimental/README.md
deleted file mode 100644
index 649de0ba95cdee2fa1b101a588dc48903b2ca13b..0000000000000000000000000000000000000000
--- a/research/fivo/experimental/README.md
+++ /dev/null
@@ -1 +0,0 @@
-An experimental codebase for running simple examples.
diff --git a/research/fivo/experimental/bounds.py b/research/fivo/experimental/bounds.py
deleted file mode 100644
index afc970c59a1a86dbe8438b4e8bba791d3c95aa63..0000000000000000000000000000000000000000
--- a/research/fivo/experimental/bounds.py
+++ /dev/null
@@ -1,673 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import namedtuple
-
-import tensorflow as tf
-import summary_utils as summ
-
-Loss = namedtuple("Loss", "name loss vars")
-Loss.__new__.__defaults__ = (tf.GraphKeys.TRAINABLE_VARIABLES,)
-
-
-def iwae(model, observation, num_timesteps, num_samples=1,
-         summarize=False):
-  """Compute the IWAE evidence lower bound.
-
-  Args:
-    model: A callable that computes one timestep of the model.
-    observation: A shape [batch_size*num_samples, state_size] Tensor
-      containing z_n, the observation for each sequence in the batch.
-    num_timesteps: The number of timesteps in each sequence, an integer.
-    num_samples: The number of samples to use to compute the IWAE bound.
-  Returns:
-    log_p_hat: The IWAE estimator of the lower bound on the log marginal.
-    loss: A tensor that you can perform gradient descent on to optimize the
-      bound.
-    maintain_ema_op: A no-op included for compatibility with FIVO.
-    states: The sequence of states sampled.
-  """
-  # Initialization
-  num_instances = tf.shape(observation)[0]
-  batch_size = tf.cast(num_instances / num_samples, tf.int32)
-  states = [model.zero_state(num_instances)]
-  log_weights = []
-  log_weight_acc = tf.zeros([num_samples, batch_size], dtype=observation.dtype)
-
-  for t in xrange(num_timesteps):
-    # run the model for one timestep
-    (zt, log_q_zt, log_p_zt, log_p_x_given_z, _) = model(
-        states[-1], observation, t)
-    # update accumulators
-    states.append(zt)
-    log_weight = log_p_zt + log_p_x_given_z - log_q_zt
-    log_weight_acc += tf.reshape(log_weight, [num_samples, batch_size])
-    if summarize:
-      weight_dist = tf.contrib.distributions.Categorical(
-          logits=tf.transpose(log_weight_acc, perm=[1, 0]),
-          allow_nan_stats=False)
-      weight_entropy = weight_dist.entropy()
-      weight_entropy = tf.reduce_mean(weight_entropy)
-      tf.summary.scalar("weight_entropy/%d" % t, weight_entropy)
-    log_weights.append(log_weight_acc)
-  # Compute the lower bound on the log evidence.
-  log_p_hat = (tf.reduce_logsumexp(log_weight_acc, axis=0) -
-               tf.log(tf.cast(num_samples, observation.dtype))) / num_timesteps
-  loss = -tf.reduce_mean(log_p_hat)
-  losses = [Loss("log_p_hat", loss)]
-
-  # we clip off the initial state before returning.
-  # there are no emas for iwae, so we return a noop for that
-  return log_p_hat, losses, tf.no_op(), states[1:], log_weights
-
-
-def multinomial_resampling(log_weights, states, n, b):
-  """Resample states with multinomial resampling.
-
-  Args:
-    log_weights: A (n x b) Tensor representing a batch of b logits for n-ary
-      Categorical distribution.
-    states: A list of (b*n x d) Tensors that will be resample in from the groups
-     of every n-th row.
-
-  Returns:
-    resampled_states: A list of (b*n x d) Tensors resampled via stratified sampling.
-    log_probs: A (n x b) Tensor of the log probabilities of the ancestry decisions.
-    resampling_parameters: The Tensor of parameters of the resampling distribution.
-    ancestors: An (n x b) Tensor of integral indices representing the ancestry decisions.
-    resampling_dist: The distribution object for resampling.
-  """
-  log_weights = tf.convert_to_tensor(log_weights)
-  states = [tf.convert_to_tensor(state) for state in states]
-
-  resampling_parameters = tf.transpose(log_weights, perm=[1,0])
-  resampling_dist = tf.contrib.distributions.Categorical(logits=resampling_parameters)
-  ancestors = tf.stop_gradient(
-      resampling_dist.sample(sample_shape=n))
-  log_probs = resampling_dist.log_prob(ancestors)
-
-  offset = tf.expand_dims(tf.range(b), 0)
-  ancestor_inds = tf.reshape(ancestors * b + offset, [-1])
-
-  resampled_states = []
-  for state in states:
-    resampled_states.append(tf.gather(state, ancestor_inds))
-  return resampled_states, log_probs, resampling_parameters, ancestors, resampling_dist
-
-def stratified_resampling(log_weights, states, n, b):
-  """Resample states with straitified resampling.
-
-  Args:
-    log_weights: A (n x b) Tensor representing a batch of b logits for n-ary
-      Categorical distribution.
-    states: A list of (b*n x d) Tensors that will be resample in from the groups
-     of every n-th row.
-
-  Returns:
-    resampled_states: A list of (b*n x d) Tensors resampled via stratified sampling.
-    log_probs: A (n x b) Tensor of the log probabilities of the ancestry decisions.
-    resampling_parameters: The Tensor of parameters of the resampling distribution.
-    ancestors: An (n x b) Tensor of integral indices representing the ancestry decisions.
-    resampling_dist: The distribution object for resampling.
-  """
-  log_weights = tf.convert_to_tensor(log_weights)
-  states = [tf.convert_to_tensor(state) for state in states]
-
-  log_weights = tf.transpose(log_weights, perm=[1,0])
-
-  probs = tf.nn.softmax(
-            tf.tile(tf.expand_dims(log_weights, axis=1),
-                    [1, n, 1])
-  )
-
-  cdfs = tf.concat([tf.zeros((b,n,1), dtype=probs.dtype), tf.cumsum(probs, axis=2)], 2)
-
-  bins = tf.range(n, dtype=probs.dtype) / n
-  bins = tf.tile(tf.reshape(bins, [1,-1,1]), [b,1,n+1])
-
-  strat_cdfs = tf.minimum(tf.maximum((cdfs - bins) * n, 0.0), 1.0)
-  resampling_parameters = strat_cdfs[:,:,1:] - strat_cdfs[:,:,:-1]
-
-  resampling_dist = tf.contrib.distributions.Categorical(
-      probs = resampling_parameters,
-      allow_nan_stats=False)
-
-  ancestors = tf.stop_gradient(
-      resampling_dist.sample())
-  log_probs = resampling_dist.log_prob(ancestors)
-
-  ancestors = tf.transpose(ancestors, perm=[1,0])
-  log_probs = tf.transpose(log_probs, perm=[1,0])
-
-  offset = tf.expand_dims(tf.range(b), 0)
-  ancestor_inds = tf.reshape(ancestors * b + offset, [-1])
-
-  resampled_states = []
-  for state in states:
-    resampled_states.append(tf.gather(state, ancestor_inds))
-
-  return resampled_states, log_probs, resampling_parameters, ancestors, resampling_dist
-
-def systematic_resampling(log_weights, states, n, b):
-  """Resample states with systematic resampling.
-
-  Args:
-    log_weights: A (n x b) Tensor representing a batch of b logits for n-ary
-      Categorical distribution.
-    states: A list of (b*n x d) Tensors that will be resample in from the groups
-     of every n-th row.
-
-  Returns:
-    resampled_states: A list of (b*n x d) Tensors resampled via stratified sampling.
-    log_probs: A (n x b) Tensor of the log probabilities of the ancestry decisions.
-    resampling_parameters: The Tensor of parameters of the resampling distribution.
-    ancestors: An (n x b) Tensor of integral indices representing the ancestry decisions.
-    resampling_dist: The distribution object for resampling.
-  """
-
-  log_weights = tf.convert_to_tensor(log_weights)
-  states = [tf.convert_to_tensor(state) for state in states]
-
-  log_weights = tf.transpose(log_weights, perm=[1,0])
-
-  probs = tf.nn.softmax(
-            tf.tile(tf.expand_dims(log_weights, axis=1),
-                    [1, n, 1])
-  )
-
-  cdfs = tf.concat([tf.zeros((b,n,1), dtype=probs.dtype), tf.cumsum(probs, axis=2)], 2)
-
-  bins = tf.range(n, dtype=probs.dtype) / n
-  bins = tf.tile(tf.reshape(bins, [1,-1,1]), [b,1,n+1])
-
-  strat_cdfs = tf.minimum(tf.maximum((cdfs - bins) * n, 0.0), 1.0)
-  resampling_parameters = strat_cdfs[:,:,1:] - strat_cdfs[:,:,:-1]
-
-  resampling_dist = tf.contrib.distributions.Categorical(
-      probs=resampling_parameters,
-      allow_nan_stats=True)
-
-  U = tf.random_uniform((b, 1, 1), dtype=probs.dtype)
-
-  ancestors = tf.stop_gradient(tf.reduce_sum(tf.to_float(U > strat_cdfs[:,:,1:]), axis=-1))
-  log_probs = resampling_dist.log_prob(ancestors)
-
-  ancestors = tf.transpose(ancestors, perm=[1,0])
-  log_probs = tf.transpose(log_probs, perm=[1,0])
-
-  offset = tf.expand_dims(tf.range(b, dtype=probs.dtype), 0)
-  ancestor_inds = tf.reshape(ancestors * b + offset, [-1])
-
-  resampled_states = []
-  for state in states:
-    resampled_states.append(tf.gather(state, ancestor_inds))
-
-  return resampled_states, log_probs, resampling_parameters, ancestors, resampling_dist
-
-
-def log_blend(inputs, weights):
-  """Blends state in the log space.
-
-  Args:
-    inputs: A set of scalar states, one for each particle in each particle filter.
-      Should be [num_samples, batch_size].
-    weights: A set of weights used to blend the state. Each set of weights
-      should be of dimension [num_samples] (one weight for each previous particle).
-      There should be one set of weights for each new particle in each particle filter.
-      Thus the shape should be [num_samples, batch_size, num_samples] where
-      the first axis indexes new particle and the last axis indexes old particles.
-  Returns:
-    blended: The blended states, a tensor of shape [num_samples, batch_size].
-  """
-  raw_max = tf.reduce_max(inputs, axis=0, keepdims=True)
-  my_max = tf.stop_gradient(
-      tf.where(tf.is_finite(raw_max), raw_max, tf.zeros_like(raw_max))
-  )
-  # Don't ask.
-  blended = tf.log(tf.einsum("ijk,kj->ij", weights, tf.exp(inputs - raw_max))) + my_max
-  return blended
-
-
-def relaxed_resampling(log_weights, states, num_samples, batch_size,
-                       log_r_x=None, blend_type="log", temperature=0.5,
-                       straight_through=False):
-  """Resample states with relaxed resampling.
-
-  Args:
-    log_weights: A (n x b) Tensor representing a batch of b logits for n-ary
-      Categorical distribution.
-    states: A list of (b*n x d) Tensors that will be resample in from the groups
-     of every n-th row.
-
-  Returns:
-    resampled_states: A list of (b*n x d) Tensors resampled via stratified sampling.
-    log_probs: A (n x b) Tensor of the log probabilities of the ancestry decisions.
-    resampling_parameters: The Tensor of parameters of the resampling distribution.
-    ancestors: An (n x b x n) Tensor of relaxed one hot representations of the ancestry decisions.
-    resampling_dist: The distribution object for resampling.
-  """
-  assert blend_type in ["log", "linear"], "Blend type must be 'log' or 'linear'."
-  log_weights = tf.convert_to_tensor(log_weights)
-  states = [tf.convert_to_tensor(state) for state in states]
-  state_dim = states[0].get_shape().as_list()[-1]
-  # weights are num_samples by batch_size, so we transpose to get a
-  # set of batch_size distributions over [0,num_samples).
-  resampling_parameters = tf.transpose(log_weights, perm=[1, 0])
-  resampling_dist = tf.contrib.distributions.RelaxedOneHotCategorical(
-      temperature,
-      logits=resampling_parameters)
-
-  # sample num_samples samples from the distribution, resulting in a
-  # [num_samples, batch_size, num_samples] Tensor that represents a set of
-  # [num_samples, batch_size] blending weights. The dimensions represent
-  # [sample index, batch index, blending weight index]
-  ancestors = resampling_dist.sample(sample_shape=num_samples)
-  if straight_through:
-    # Forward pass discrete choices, backwards pass soft choices
-    hard_ancestor_indices = tf.argmax(ancestors, axis=-1)
-    hard_ancestors = tf.one_hot(hard_ancestor_indices, num_samples,
-                                dtype=ancestors.dtype)
-    ancestors = tf.stop_gradient(hard_ancestors - ancestors) + ancestors
-  log_probs = resampling_dist.log_prob(ancestors)
-  if log_r_x is not None and blend_type == "log":
-    log_r_x = tf.reshape(log_r_x, [num_samples, batch_size])
-    log_r_x = log_blend(log_r_x, ancestors)
-    log_r_x = tf.reshape(log_r_x, [num_samples*batch_size])
-  elif log_r_x is not None and blend_type == "linear":
-    # If blend type is linear just add log_r to the states that will be blended
-    # linearly.
-    states.append(log_r_x)
-
-  # transpose the 'indices' to be [batch_index, blending weight index, sample index]
-  ancestor_inds = tf.transpose(ancestors, perm=[1, 2, 0])
-  resampled_states = []
-  for state in states:
-    # state is currently [num_samples * batch_size, state_dim] so we reshape
-    # to [num_samples, batch_size, state_dim] and then transpose to
-    # [batch_size, state_size, num_samples]
-    state = tf.transpose(tf.reshape(state, [num_samples, batch_size, -1]), perm=[1, 2, 0])
-    # state is now (batch_size, state_size, num_samples)
-    # and ancestor is (batch index, blending weight index, sample index)
-    # multiplying these gives a matrix of size [batch_size, state_size, num_samples]
-    next_state = tf.matmul(state, ancestor_inds)
-    # transpose the state to be [num_samples, batch_size, state_size]
-    # and then reshape it to match the state format.
-    next_state = tf.reshape(tf.transpose(next_state, perm=[2,0,1]), [num_samples*batch_size, state_dim])
-    resampled_states.append(next_state)
-
-  new_dist = tf.contrib.distributions.Categorical(
-      logits=resampling_parameters)
-
-  if log_r_x is not None and blend_type == "linear":
-    # If blend type is linear pop off log_r that we added to the states.
-    log_r_x = tf.squeeze(resampled_states[-1])
-    resampled_states = resampled_states[:-1]
-  return resampled_states, log_probs, log_r_x, resampling_parameters, ancestors, new_dist
-
-
-def fivo(model,
-         observation,
-         num_timesteps,
-         resampling_schedule,
-         num_samples=1,
-         use_resampling_grads=True,
-         resampling_type="multinomial",
-         resampling_temperature=0.5,
-         aux=True,
-         summarize=False):
-  """Compute the FIVO evidence lower bound.
-
-  Args:
-    model: A callable that computes one timestep of the model.
-    observation: A shape [batch_size*num_samples, state_size] Tensor
-      containing z_n, the observation for each sequence in the batch.
-    num_timesteps: The number of timesteps in each sequence, an integer.
-    resampling_schedule: A list of booleans of length num_timesteps, contains
-      True if a resampling should occur on a specific timestep.
-    num_samples: The number of samples to use to compute the IWAE bound.
-    use_resampling_grads: Whether or not to include the resampling gradients
-      in loss.
-    resampling type: The type of resampling, one of "multinomial", "stratified",
-      "relaxed-logblend", "relaxed-linearblend", "relaxed-stateblend", or
-      "systematic".
-    resampling_temperature: A positive temperature only used for relaxed
-      resampling.
-    aux: If true, compute the FIVO-AUX bound.
-  Returns:
-    log_p_hat: The IWAE estimator of the lower bound on the log marginal.
-    loss: A tensor that you can perform gradient descent on to optimize the
-      bound.
-    maintain_ema_op: An op to update the baseline ema used for the resampling
-      gradients.
-    states: The sequence of states sampled.
-  """
-  # Initialization
-  num_instances = tf.cast(tf.shape(observation)[0], tf.int32)
-  batch_size = tf.cast(num_instances / num_samples, tf.int32)
-  states = [model.zero_state(num_instances)]
-  prev_state = states[0]
-  log_weight_acc = tf.zeros(shape=[num_samples, batch_size], dtype=observation.dtype)
-  prev_log_r_zt = tf.zeros([num_instances], dtype=observation.dtype)
-  log_weights = []
-  log_weights_all = []
-  log_p_hats = []
-  resampling_log_probs = []
-  for t in xrange(num_timesteps):
-    # run the model for one timestep
-    (zt, log_q_zt, log_p_zt, log_p_x_given_z, log_r_zt) = model(
-        prev_state, observation, t)
-    # update accumulators
-    states.append(zt)
-    log_weight = log_p_zt + log_p_x_given_z - log_q_zt
-    if aux:
-      if t == num_timesteps - 1:
-        log_weight -= prev_log_r_zt
-      else:
-        log_weight += log_r_zt - prev_log_r_zt
-      prev_log_r_zt = log_r_zt
-    log_weight_acc += tf.reshape(log_weight, [num_samples, batch_size])
-    log_weights_all.append(log_weight_acc)
-    if resampling_schedule[t]:
-
-      # These objects will be resampled
-      to_resample = [states[-1]]
-      if aux and "relaxed" not in resampling_type:
-        to_resample.append(prev_log_r_zt)
-
-      # do the resampling
-      if resampling_type == "multinomial":
-        (resampled,
-         resampling_log_prob,
-         _, _, _) = multinomial_resampling(log_weight_acc,
-                                           to_resample,
-                                           num_samples,
-                                           batch_size)
-      elif resampling_type == "stratified":
-        (resampled,
-         resampling_log_prob,
-         _, _, _) = stratified_resampling(log_weight_acc,
-                                          to_resample,
-                                          num_samples,
-                                          batch_size)
-      elif resampling_type == "systematic":
-        (resampled,
-         resampling_log_prob,
-         _, _, _) = systematic_resampling(log_weight_acc,
-                                          to_resample,
-                                          num_samples,
-                                          batch_size)
-      elif "relaxed" in resampling_type:
-        if aux:
-          if resampling_type == "relaxed-logblend":
-            (resampled,
-             resampling_log_prob,
-             prev_log_r_zt,
-             _, _, _) = relaxed_resampling(log_weight_acc,
-                                           to_resample,
-                                           num_samples,
-                                           batch_size,
-                                           temperature=resampling_temperature,
-                                           log_r_x=prev_log_r_zt,
-                                           blend_type="log")
-          elif resampling_type == "relaxed-linearblend":
-            (resampled,
-             resampling_log_prob,
-             prev_log_r_zt,
-             _, _, _) = relaxed_resampling(log_weight_acc,
-                                           to_resample,
-                                           num_samples,
-                                           batch_size,
-                                           temperature=resampling_temperature,
-                                           log_r_x=prev_log_r_zt,
-                                           blend_type="linear")
-          elif resampling_type == "relaxed-stateblend":
-            (resampled,
-             resampling_log_prob,
-             _, _, _, _) = relaxed_resampling(log_weight_acc,
-                                              to_resample,
-                                              num_samples,
-                                              batch_size,
-                                              temperature=resampling_temperature)
-            # Calculate prev_log_r_zt from the post-resampling state
-            prev_r_zt = model.r.r_xn(resampled[0], t)
-            prev_log_r_zt = tf.reduce_sum(
-                prev_r_zt.log_prob(observation), axis=[1])
-          elif resampling_type == "relaxed-stateblend-st":
-            (resampled,
-             resampling_log_prob,
-             _, _, _, _) = relaxed_resampling(log_weight_acc,
-                                              to_resample,
-                                              num_samples,
-                                              batch_size,
-                                              temperature=resampling_temperature,
-                                              straight_through=True)
-            # Calculate prev_log_r_zt from the post-resampling state
-            prev_r_zt = model.r.r_xn(resampled[0], t)
-            prev_log_r_zt = tf.reduce_sum(
-                prev_r_zt.log_prob(observation), axis=[1])
-        else:
-          (resampled,
-           resampling_log_prob,
-           _, _, _, _) = relaxed_resampling(log_weight_acc,
-                                            to_resample,
-                                            num_samples,
-                                            batch_size,
-                                            temperature=resampling_temperature)
-      #if summarize:
-      #  resampling_entropy = resampling_dist.entropy()
-      #  resampling_entropy = tf.reduce_mean(resampling_entropy)
-      #  tf.summary.scalar("weight_entropy/%d" % t, resampling_entropy)
-
-      resampling_log_probs.append(tf.reduce_sum(resampling_log_prob, axis=0))
-      prev_state = resampled[0]
-      if aux and "relaxed" not in resampling_type:
-        # Squeeze out the extra dim potentially added by resampling.
-        # prev_log_r_zt should always be [num_instances]
-        prev_log_r_zt = tf.squeeze(resampled[1])
-      # Update the log p hat estimate, taking a log sum exp over the sample
-      # dimension. The appended tensor is [batch_size].
-      log_p_hats.append(
-          tf.reduce_logsumexp(log_weight_acc, axis=0) - tf.log(
-              tf.cast(num_samples, dtype=observation.dtype)))
-      # reset the weights
-      log_weights.append(log_weight_acc)
-      log_weight_acc = tf.zeros_like(log_weight_acc)
-    else:
-      prev_state = states[-1]
-  # Compute the final weight update. If we just resampled this will be zero.
-  final_update = (tf.reduce_logsumexp(log_weight_acc, axis=0) -
-                  tf.log(tf.cast(num_samples, dtype=observation.dtype)))
-  # If we ever resampled, then sum up the previous log p hat terms
-  if len(log_p_hats) > 0:
-    log_p_hat = tf.reduce_sum(log_p_hats, axis=0) + final_update
-  else:  # otherwise, log_p_hat only comes from the final update
-    log_p_hat = final_update
-
-  if use_resampling_grads and any(resampling_schedule):
-    # compute the rewards
-    # cumsum([a, b, c]) => [a, a+b, a+b+c]
-    # learning signal at timestep t is
-    #   [sum from i=t+1 to T of log_p_hat_i for t=1:T]
-    # so we will compute (sum from i=1 to T of log_p_hat_i)
-    # and at timestep t will subtract off (sum from i=1 to t of log_p_hat_i)
-    # rewards is a [num_resampling_events, batch_size] Tensor
-    rewards = tf.stop_gradient(
-        tf.expand_dims(log_p_hat, 0) - tf.cumsum(log_p_hats, axis=0))
-    batch_avg_rewards = tf.reduce_mean(rewards, axis=1)
-    # compute ema baseline.
-    # centered_rewards is [num_resampling_events, batch_size]
-    baseline_ema = tf.train.ExponentialMovingAverage(decay=0.94)
-    maintain_baseline_op = baseline_ema.apply([batch_avg_rewards])
-    baseline = tf.expand_dims(baseline_ema.average(batch_avg_rewards), 1)
-    centered_rewards = rewards - baseline
-    if summarize:
-      summ.summarize_learning_signal(rewards, "rewards")
-      summ.summarize_learning_signal(centered_rewards, "centered_rewards")
-    # compute the loss tensor.
-    resampling_grads = tf.reduce_sum(
-        tf.stop_gradient(centered_rewards) * resampling_log_probs, axis=0)
-    losses = [Loss("log_p_hat", -tf.reduce_mean(log_p_hat)/num_timesteps),
-              Loss("resampling_grads", -tf.reduce_mean(resampling_grads)/num_timesteps)]
-  else:
-    losses = [Loss("log_p_hat", -tf.reduce_mean(log_p_hat)/num_timesteps)]
-    maintain_baseline_op = tf.no_op()
-
-  log_p_hat /= num_timesteps
-  # we clip off the initial state before returning.
-  return log_p_hat, losses, maintain_baseline_op, states[1:], log_weights_all
-
-
-def fivo_aux_td(
-    model,
-    observation,
-    num_timesteps,
-    resampling_schedule,
-    num_samples=1,
-    summarize=False):
-  """Compute the FIVO_AUX evidence lower bound."""
-  # Initialization
-  num_instances = tf.cast(tf.shape(observation)[0], tf.int32)
-  batch_size = tf.cast(num_instances / num_samples, tf.int32)
-  states = [model.zero_state(num_instances)]
-  prev_state = states[0]
-  log_weight_acc = tf.zeros(shape=[num_samples, batch_size], dtype=observation.dtype)
-  prev_log_r = tf.zeros([num_instances], dtype=observation.dtype)
-  # must be pre-resampling
-  log_rs = []
-  # must be post-resampling
-  r_tilde_params = [model.r_tilde.r_zt(states[0], observation, 0)]
-  log_r_tildes = []
-  log_p_xs = []
-  # contains the weight at each timestep before resampling only on resampling timesteps
-  log_weights = []
-  # contains weight at each timestep before resampling
-  log_weights_all = []
-  log_p_hats = []
-  for t in xrange(num_timesteps):
-    # run the model for one timestep
-    # zt is state, [num_instances, state_dim]
-    # log_q_zt, log_p_x_given_z is [num_instances]
-    # r_tilde_mu, r_tilde_sigma is [num_instances, state_dim]
-    # p_ztplus1 is a normal distribution on [num_instances, state_dim]
-    (zt, log_q_zt, log_p_zt, log_p_x_given_z,
-     r_tilde_mu, r_tilde_sigma_sq, p_ztplus1) = model(prev_state, observation, t)
-
-    # Compute the log weight without log r.
-    log_weight = log_p_zt + log_p_x_given_z - log_q_zt
-
-    # Compute log r.
-    if t == num_timesteps - 1:
-      log_r = tf.zeros_like(prev_log_r)
-    else:
-      p_mu = p_ztplus1.mean()
-      p_sigma_sq = p_ztplus1.variance()
-      log_r = (tf.log(r_tilde_sigma_sq) -
-               tf.log(r_tilde_sigma_sq + p_sigma_sq) -
-               tf.square(r_tilde_mu - p_mu)/(r_tilde_sigma_sq + p_sigma_sq))
-      log_r = 0.5*tf.reduce_sum(log_r, axis=-1)
-
-    #log_weight += tf.stop_gradient(log_r - prev_log_r)
-    log_weight += log_r - prev_log_r
-    log_weight_acc += tf.reshape(log_weight, [num_samples, batch_size])
-
-    # Update accumulators
-    states.append(zt)
-    log_weights_all.append(log_weight_acc)
-    log_p_xs.append(log_p_x_given_z)
-    log_rs.append(log_r)
-
-    # Compute log_r_tilde as [num_instances] Tensor.
-    prev_r_tilde_mu, prev_r_tilde_sigma_sq = r_tilde_params[-1]
-    prev_log_r_tilde = -0.5*tf.reduce_sum(
-        tf.square(zt - prev_r_tilde_mu)/prev_r_tilde_sigma_sq, axis=-1)
-        #tf.square(tf.stop_gradient(zt) - r_tilde_mu)/r_tilde_sigma_sq, axis=-1)
-        #tf.square(zt - r_tilde_mu)/r_tilde_sigma_sq, axis=-1)
-    log_r_tildes.append(prev_log_r_tilde)
-
-    # optionally resample
-    if resampling_schedule[t]:
-      # These objects will be resampled
-      if t < num_timesteps - 1:
-        to_resample = [zt, log_r, r_tilde_mu, r_tilde_sigma_sq]
-      else:
-        to_resample = [zt, log_r]
-      (resampled,
-       _, _, _, _) = multinomial_resampling(log_weight_acc,
-                                            to_resample,
-                                            num_samples,
-                                            batch_size)
-      prev_state = resampled[0]
-      # Squeeze out the extra dim potentially added by resampling.
-      # prev_log_r_zt and log_r_tilde should always be [num_instances]
-      prev_log_r = tf.squeeze(resampled[1])
-      if t < num_timesteps -1:
-        r_tilde_params.append((resampled[2], resampled[3]))
-      # Update the log p hat estimate, taking a log sum exp over the sample
-      # dimension. The appended tensor is [batch_size].
-      log_p_hats.append(
-          tf.reduce_logsumexp(log_weight_acc, axis=0) - tf.log(
-              tf.cast(num_samples, dtype=observation.dtype)))
-      # reset the weights
-      log_weights.append(log_weight_acc)
-      log_weight_acc = tf.zeros_like(log_weight_acc)
-    else:
-      prev_state = zt
-      prev_log_r = log_r
-      if t < num_timesteps - 1:
-        r_tilde_params.append((r_tilde_mu, r_tilde_sigma_sq))
-
-  # Compute the final weight update. If we just resampled this will be zero.
-  final_update = (tf.reduce_logsumexp(log_weight_acc, axis=0) -
-                  tf.log(tf.cast(num_samples, dtype=observation.dtype)))
-  # If we ever resampled, then sum up the previous log p hat terms
-  if len(log_p_hats) > 0:
-    log_p_hat = tf.reduce_sum(log_p_hats, axis=0) + final_update
-  else:  # otherwise, log_p_hat only comes from the final update
-    log_p_hat = final_update
-
-  # Compute the bellman loss.
-  # Will remove the first timestep as it is not used.
-  # log p(x_t|z_t) is in row t-1.
-  log_p_x = tf.reshape(tf.stack(log_p_xs),
-                       [num_timesteps, num_samples, batch_size])
-  # log r_t is contained in row t-1.
-  # last column is zeros (because at timestep T (num_timesteps) r is 1.
-  log_r = tf.reshape(tf.stack(log_rs),
-                     [num_timesteps, num_samples, batch_size])
-  # [num_timesteps, num_instances]. log r_tilde_t is in row t-1.
-  log_r_tilde = tf.reshape(tf.stack(log_r_tildes),
-                           [num_timesteps, num_samples, batch_size])
-  log_lambda = tf.reduce_mean(log_r_tilde - log_p_x - log_r, axis=1,
-                              keepdims=True)
-  bellman_sos = tf.reduce_mean(tf.square(
-      log_r_tilde - tf.stop_gradient(log_lambda + log_p_x + log_r)), axis=[0, 1])
-  bellman_loss = tf.reduce_mean(bellman_sos)/num_timesteps
-  tf.summary.scalar("bellman_loss", bellman_loss)
-
-  if len(tf.get_collection("LOG_P_HAT_VARS")) == 0:
-    log_p_hat_collection = list(set(tf.trainable_variables()) -
-                                set(tf.get_collection("R_TILDE_VARS")))
-    for v in log_p_hat_collection:
-      tf.add_to_collection("LOG_P_HAT_VARS", v)
-
-  log_p_hat /= num_timesteps
-  losses = [Loss("log_p_hat", -tf.reduce_mean(log_p_hat), "LOG_P_HAT_VARS"),
-            Loss("bellman_loss", bellman_loss, "R_TILDE_VARS")]
-
-  return log_p_hat, losses, tf.no_op(), states[1:], log_weights_all
diff --git a/research/fivo/experimental/data.py b/research/fivo/experimental/data.py
deleted file mode 100644
index 0842f212991e1651a12cca239c5b8380fea9d0f8..0000000000000000000000000000000000000000
--- a/research/fivo/experimental/data.py
+++ /dev/null
@@ -1,192 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Datasets."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-import models
-
-
-def make_long_chain_dataset(
-    state_size=1,
-    num_obs=5,
-    steps_per_obs=3,
-    variance=1.,
-    observation_variance=1.,
-    batch_size=4,
-    num_samples=1,
-    observation_type=models.STANDARD_OBSERVATION,
-    transition_type=models.STANDARD_TRANSITION,
-    fixed_observation=None,
-    dtype="float32"):
-  """Creates a long chain data generating process.
-
-  Creates a tf.data.Dataset that provides batches of data from a long
-  chain.
-
-  Args:
-    state_size: The dimension of the state space of the process.
-    num_obs: The number of observations in the chain.
-    steps_per_obs: The number of steps between each observation.
-    variance: The variance of the normal distributions used at each timestep.
-    batch_size: The number of trajectories to include in each batch.
-    num_samples: The number of replicas of each trajectory to include in each
-      batch.
-    dtype: The datatype of the states and observations.
-  Returns:
-    dataset: A tf.data.Dataset that can be iterated over.
-  """
-  num_timesteps = num_obs * steps_per_obs
-  def data_generator():
-    """An infinite generator of latents and observations from the model."""
-    while True:
-      states = []
-      observations = []
-      # z0 ~ Normal(0, sqrt(variance)).
-      states.append(
-          np.random.normal(size=[state_size],
-                           scale=np.sqrt(variance)).astype(dtype))
-      # start at 1 because we've already generated z0
-      # go to num_timesteps+1 because we want to include the num_timesteps-th step
-      for t in xrange(1, num_timesteps+1):
-        if transition_type == models.ROUND_TRANSITION:
-          loc = np.round(states[-1])
-        elif transition_type == models.STANDARD_TRANSITION:
-          loc = states[-1]
-        new_state = np.random.normal(size=[state_size],
-                                     loc=loc,
-                                     scale=np.sqrt(variance))
-        states.append(new_state.astype(dtype))
-        if t % steps_per_obs == 0:
-          if fixed_observation is None:
-            if observation_type == models.SQUARED_OBSERVATION:
-              loc = np.square(states[-1])
-            elif observation_type == models.ABS_OBSERVATION:
-              loc = np.abs(states[-1])
-            elif observation_type == models.STANDARD_OBSERVATION:
-              loc = states[-1]
-            new_obs = np.random.normal(size=[state_size],
-                                       loc=loc,
-                                       scale=np.sqrt(observation_variance)).astype(dtype)
-          else:
-            new_obs = np.ones([state_size])* fixed_observation
-
-          observations.append(new_obs)
-      yield states, observations
-
-  dataset = tf.data.Dataset.from_generator(
-      data_generator,
-      output_types=(tf.as_dtype(dtype), tf.as_dtype(dtype)),
-      output_shapes=([num_timesteps+1, state_size], [num_obs, state_size]))
-  dataset = dataset.repeat().batch(batch_size)
-
-  def tile_batch(state, observation):
-    state = tf.tile(state, [num_samples, 1, 1])
-    observation = tf.tile(observation, [num_samples, 1, 1])
-    return state, observation
-
-  dataset = dataset.map(tile_batch, num_parallel_calls=12).prefetch(1024)
-  return dataset
-
-
-def make_dataset(bs=None,
-                 state_size=1,
-                 num_timesteps=10,
-                 variance=1.,
-                 prior_type="unimodal",
-                 bimodal_prior_weight=0.5,
-                 bimodal_prior_mean=1,
-                 transition_type=models.STANDARD_TRANSITION,
-                 fixed_observation=None,
-                 batch_size=4,
-                 num_samples=1,
-                 dtype='float32'):
-  """Creates a data generating process.
-
-  Creates a tf.data.Dataset that provides batches of data.
-
-  Args:
-    bs: The parameters of the data generating process. If None, new bs are
-      randomly generated.
-    state_size: The dimension of the state space of the process.
-    num_timesteps: The length of the state sequences in the process.
-    variance: The variance of the normal distributions used at each timestep.
-    batch_size: The number of trajectories to include in each batch.
-    num_samples: The number of replicas of each trajectory to include in each
-      batch.
-  Returns:
-    bs: The true bs used to generate the data
-    dataset: A tf.data.Dataset that can be iterated over.
-  """
-
-  if bs is None:
-    bs = [np.random.uniform(size=[state_size]).astype(dtype) for _ in xrange(num_timesteps)]
-    tf.logging.info("data generating processs bs: %s",
-                    np.array(bs).reshape(num_timesteps))
-
-
-  def data_generator():
-    """An infinite generator of latents and observations from the model."""
-    while True:
-      states = []
-      if prior_type == "unimodal" or prior_type == "nonlinear":
-        # Prior is Normal(0, sqrt(variance)).
-        states.append(np.random.normal(size=[state_size], scale=np.sqrt(variance)).astype(dtype))
-      elif prior_type == "bimodal":
-        if np.random.uniform() > bimodal_prior_weight:
-          loc = bimodal_prior_mean
-        else:
-          loc = - bimodal_prior_mean
-        states.append(np.random.normal(size=[state_size],
-                                       loc=loc,
-                                       scale=np.sqrt(variance)
-                                      ).astype(dtype))
-
-      for t in xrange(num_timesteps):
-        if transition_type == models.ROUND_TRANSITION:
-          loc = np.round(states[-1])
-        elif transition_type == models.STANDARD_TRANSITION:
-          loc = states[-1]
-        loc += bs[t]
-        new_state = np.random.normal(size=[state_size],
-                                     loc=loc,
-                                     scale=np.sqrt(variance)).astype(dtype)
-        states.append(new_state)
-
-      if fixed_observation is None:
-        observation = states[-1]
-      else:
-        observation = np.ones_like(states[-1]) * fixed_observation
-      yield np.array(states[:-1]), observation
-
-  dataset = tf.data.Dataset.from_generator(
-      data_generator,
-      output_types=(tf.as_dtype(dtype), tf.as_dtype(dtype)),
-      output_shapes=([num_timesteps, state_size], [state_size]))
-  dataset = dataset.repeat().batch(batch_size)
-
-  def tile_batch(state, observation):
-    state = tf.tile(state, [num_samples, 1, 1])
-    observation = tf.tile(observation, [num_samples, 1])
-    return state, observation
-
-  dataset = dataset.map(tile_batch, num_parallel_calls=12).prefetch(1024)
-  return np.array(bs), dataset
diff --git a/research/fivo/experimental/models.py b/research/fivo/experimental/models.py
deleted file mode 100644
index 62801ca1ee145e64c80b66e0c83dd7d834ac0847..0000000000000000000000000000000000000000
--- a/research/fivo/experimental/models.py
+++ /dev/null
@@ -1,1227 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import functools
-import sonnet as snt
-import tensorflow as tf
-import numpy as np
-import math
-
-SQUARED_OBSERVATION = "squared"
-ABS_OBSERVATION = "abs"
-STANDARD_OBSERVATION = "standard"
-OBSERVATION_TYPES = [SQUARED_OBSERVATION, ABS_OBSERVATION, STANDARD_OBSERVATION]
-
-ROUND_TRANSITION = "round"
-STANDARD_TRANSITION = "standard"
-TRANSITION_TYPES = [ROUND_TRANSITION, STANDARD_TRANSITION]
-
-
-class Q(object):
-
-  def __init__(self,
-               state_size,
-               num_timesteps,
-               sigma_min=1e-5,
-               dtype=tf.float32,
-               random_seed=None,
-               init_mu0_to_zero=False,
-               graph_collection_name="Q_VARS"):
-    self.sigma_min = sigma_min
-    self.dtype = dtype
-    self.graph_collection_name = graph_collection_name
-    initializers = []
-    for t in xrange(num_timesteps):
-      if t == 0 and init_mu0_to_zero:
-        initializers.append(
-            {"w": tf.zeros_initializer, "b": tf.zeros_initializer})
-      else:
-        initializers.append(
-            {"w": tf.random_uniform_initializer(seed=random_seed),
-             "b": tf.zeros_initializer})
-
-    def custom_getter(getter, *args, **kwargs):
-      out = getter(*args, **kwargs)
-      ref = tf.get_collection_ref(self.graph_collection_name)
-      if out not in ref:
-        ref.append(out)
-      return out
-
-    self.mus = [
-        snt.Linear(output_size=state_size,
-                   initializers=initializers[t],
-                   name="q_mu_%d" % t,
-                   custom_getter=custom_getter
-                  )
-        for t in xrange(num_timesteps)
-    ]
-    self.sigmas = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="q_sigma_%d" % (t + 1),
-            collections=[tf.GraphKeys.GLOBAL_VARIABLES, graph_collection_name],
-            initializer=tf.random_uniform_initializer(seed=random_seed))
-        for t in xrange(num_timesteps)
-    ]
-
-  def q_zt(self, observation, prev_state, t):
-    batch_size = tf.shape(prev_state)[0]
-    q_mu = self.mus[t](tf.concat([observation, prev_state], axis=1))
-    q_sigma = tf.maximum(tf.nn.softplus(self.sigmas[t]), self.sigma_min)
-    q_sigma = tf.tile(q_sigma[tf.newaxis, :], [batch_size, 1])
-    q_zt = tf.contrib.distributions.Normal(loc=q_mu, scale=tf.sqrt(q_sigma))
-    return q_zt
-
-  def summarize_weights(self):
-    for t, sigma in enumerate(self.sigmas):
-      tf.summary.scalar("q_sigma/%d" % t, sigma[0])
-    for t, f in enumerate(self.mus):
-      tf.summary.scalar("q_mu/b_%d" % t, f.b[0])
-      tf.summary.scalar("q_mu/w_obs_%d" % t, f.w[0,0])
-      if t != 0:
-        tf.summary.scalar("q_mu/w_prev_state_%d" % t, f.w[1,0])
-
-
-class PreviousStateQ(Q):
-
-  def q_zt(self, unused_observation, prev_state, t):
-    batch_size = tf.shape(prev_state)[0]
-    q_mu = self.mus[t](prev_state)
-    q_sigma = tf.maximum(tf.nn.softplus(self.sigmas[t]), self.sigma_min)
-    q_sigma = tf.tile(q_sigma[tf.newaxis, :], [batch_size, 1])
-    q_zt = tf.contrib.distributions.Normal(loc=q_mu, scale=tf.sqrt(q_sigma))
-    return q_zt
-
-  def summarize_weights(self):
-    for t, sigma in enumerate(self.sigmas):
-      tf.summary.scalar("q_sigma/%d" % t, sigma[0])
-    for t, f in enumerate(self.mus):
-      tf.summary.scalar("q_mu/b_%d" % t, f.b[0])
-      tf.summary.scalar("q_mu/w_prev_state_%d" % t, f.w[0,0])
-
-
-class ObservationQ(Q):
-
-  def q_zt(self, observation, prev_state, t):
-    batch_size = tf.shape(prev_state)[0]
-    q_mu = self.mus[t](observation)
-    q_sigma = tf.maximum(tf.nn.softplus(self.sigmas[t]), self.sigma_min)
-    q_sigma = tf.tile(q_sigma[tf.newaxis, :], [batch_size, 1])
-    q_zt = tf.contrib.distributions.Normal(loc=q_mu, scale=tf.sqrt(q_sigma))
-    return q_zt
-
-  def summarize_weights(self):
-    for t, sigma in enumerate(self.sigmas):
-      tf.summary.scalar("q_sigma/%d" % t, sigma[0])
-    for t, f in enumerate(self.mus):
-      tf.summary.scalar("q_mu/b_%d" % t, f.b[0])
-      tf.summary.scalar("q_mu/w_obs_%d" % t, f.w[0,0])
-
-
-class SimpleMeanQ(object):
-
-  def __init__(self,
-               state_size,
-               num_timesteps,
-               sigma_min=1e-5,
-               dtype=tf.float32,
-               random_seed=None,
-               init_mu0_to_zero=False,
-               graph_collection_name="Q_VARS"):
-    self.sigma_min = sigma_min
-    self.dtype = dtype
-    self.graph_collection_name = graph_collection_name
-    initializers = []
-    for t in xrange(num_timesteps):
-      if t == 0 and init_mu0_to_zero:
-        initializers.append(tf.zeros_initializer)
-      else:
-        initializers.append(tf.random_uniform_initializer(seed=random_seed))
-
-    self.mus = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="q_mu_%d" % (t + 1),
-            collections=[tf.GraphKeys.GLOBAL_VARIABLES, graph_collection_name],
-            initializer=initializers[t])
-        for t in xrange(num_timesteps)
-    ]
-    self.sigmas = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="q_sigma_%d" % (t + 1),
-            collections=[tf.GraphKeys.GLOBAL_VARIABLES, graph_collection_name],
-            initializer=tf.random_uniform_initializer(seed=random_seed))
-        for t in xrange(num_timesteps)
-    ]
-
-  def q_zt(self, unused_observation, prev_state, t):
-    batch_size = tf.shape(prev_state)[0]
-    q_mu = tf.tile(self.mus[t][tf.newaxis, :], [batch_size, 1])
-    q_sigma = tf.maximum(tf.nn.softplus(self.sigmas[t]), self.sigma_min)
-    q_sigma = tf.tile(q_sigma[tf.newaxis, :], [batch_size, 1])
-    q_zt = tf.contrib.distributions.Normal(loc=q_mu, scale=tf.sqrt(q_sigma))
-    return q_zt
-
-  def summarize_weights(self):
-    for t, sigma in enumerate(self.sigmas):
-      tf.summary.scalar("q_sigma/%d" % t, sigma[0])
-    for t, f in enumerate(self.mus):
-      tf.summary.scalar("q_mu/%d" % t, f[0])
-
-
-class R(object):
-
-  def __init__(self,
-               state_size,
-               num_timesteps,
-               sigma_min=1e-5,
-               dtype=tf.float32,
-               sigma_init=1.,
-               random_seed=None,
-               graph_collection_name="R_VARS"):
-    self.dtype = dtype
-    self.sigma_min = sigma_min
-    initializers = {"w": tf.truncated_normal_initializer(seed=random_seed),
-                    "b": tf.zeros_initializer}
-    self.graph_collection_name=graph_collection_name
-
-    def custom_getter(getter, *args, **kwargs):
-      out = getter(*args, **kwargs)
-      ref = tf.get_collection_ref(self.graph_collection_name)
-      if out not in ref:
-        ref.append(out)
-      return out
-
-    self.mus= [
-        snt.Linear(output_size=state_size,
-                   initializers=initializers,
-                   name="r_mu_%d" % t,
-                   custom_getter=custom_getter)
-        for t in xrange(num_timesteps)
-    ]
-
-    self.sigmas = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="r_sigma_%d" % (t + 1),
-            collections=[tf.GraphKeys.GLOBAL_VARIABLES, graph_collection_name],
-            #initializer=tf.random_uniform_initializer(seed=random_seed, maxval=100))
-            initializer=tf.constant_initializer(sigma_init))
-        for t in xrange(num_timesteps)
-    ]
-
-  def r_xn(self, z_t, t):
-    batch_size = tf.shape(z_t)[0]
-    r_mu = self.mus[t](z_t)
-    r_sigma = tf.maximum(tf.nn.softplus(self.sigmas[t]), self.sigma_min)
-    r_sigma = tf.tile(r_sigma[tf.newaxis, :], [batch_size, 1])
-    return tf.contrib.distributions.Normal(
-        loc=r_mu, scale=tf.sqrt(r_sigma))
-
-  def summarize_weights(self):
-    for t in range(len(self.mus) - 1):
-      tf.summary.scalar("r_mu/%d" % t, self.mus[t][0])
-      tf.summary.scalar("r_sigma/%d" % t, self.sigmas[t][0])
-
-
-class P(object):
-
-  def __init__(self,
-               state_size,
-               num_timesteps,
-               sigma_min=1e-5,
-               variance=1.0,
-               dtype=tf.float32,
-               random_seed=None,
-               trainable=True,
-               init_bs_to_zero=False,
-               graph_collection_name="P_VARS"):
-    self.state_size = state_size
-    self.num_timesteps = num_timesteps
-    self.sigma_min = sigma_min
-    self.dtype = dtype
-    self.variance = variance
-    self.graph_collection_name = graph_collection_name
-    if init_bs_to_zero:
-      initializers = [tf.zeros_initializer for _ in xrange(num_timesteps)]
-    else:
-      initializers = [tf.random_uniform_initializer(seed=random_seed) for _ in xrange(num_timesteps)]
-
-    self.bs = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="p_b_%d" % (t + 1),
-            initializer=initializers[t],
-            collections=[tf.GraphKeys.GLOBAL_VARIABLES, graph_collection_name],
-            trainable=trainable) for t in xrange(num_timesteps)
-    ]
-    self.Bs = tf.cumsum(self.bs, reverse=True, axis=0)
-
-  def posterior(self, observation, prev_state, t):
-    """Computes the true posterior p(z_t|z_{t-1}, z_n)."""
-    # bs[0] is really b_1
-    # Bs[i] is sum from k=i+1^n b_k
-    mu = observation - self.Bs[t]
-    if t > 0:
-      mu += (prev_state + self.bs[t - 1]) * float(self.num_timesteps - t)
-    mu /= float(self.num_timesteps - t + 1)
-    sigma = tf.ones_like(mu) * self.variance * (
-        float(self.num_timesteps - t) / float(self.num_timesteps - t + 1))
-    return tf.contrib.distributions.Normal(loc=mu, scale=tf.sqrt(sigma))
-
-  def lookahead(self, state, t):
-    """Computes the true lookahead distribution p(z_n|z_t)."""
-    mu = state + self.Bs[t]
-    sigma = tf.ones_like(state) * self.variance * float(self.num_timesteps - t)
-    return tf.contrib.distributions.Normal(loc=mu, scale=tf.sqrt(sigma))
-
-  def likelihood(self, observation):
-    batch_size = tf.shape(observation)[0]
-    mu = tf.tile(tf.reduce_sum(self.bs, axis=0)[tf.newaxis, :], [batch_size, 1])
-    sigma = tf.ones_like(mu) * self.variance * (self.num_timesteps + 1)
-    dist = tf.contrib.distributions.Normal(loc=mu, scale=tf.sqrt(sigma))
-    # Average over the batch and take the sum over the state size
-    return tf.reduce_mean(tf.reduce_sum(dist.log_prob(observation), axis=1))
-
-  def p_zt(self, prev_state, t):
-    """Computes the model p(z_t| z_{t-1})."""
-    batch_size = tf.shape(prev_state)[0]
-    if t > 0:
-      z_mu_p = prev_state + self.bs[t - 1]
-    else:  # p(z_0) is Normal(0,1)
-      z_mu_p = tf.zeros([batch_size, self.state_size], dtype=self.dtype)
-    p_zt = tf.contrib.distributions.Normal(
-        loc=z_mu_p, scale=tf.sqrt(tf.ones_like(z_mu_p) * self.variance))
-    return p_zt
-
-  def generative(self, unused_observation, z_nm1):
-    """Computes the model's generative distribution p(z_n| z_{n-1})."""
-    generative_p_mu = z_nm1 + self.bs[-1]
-    return tf.contrib.distributions.Normal(
-        loc=generative_p_mu, scale=tf.sqrt(tf.ones_like(generative_p_mu) * self.variance))
-
-
-class ShortChainNonlinearP(object):
-
-  def __init__(self,
-               state_size,
-               num_timesteps,
-               sigma_min=1e-5,
-               variance=1.0,
-               observation_variance=1.0,
-               transition_type=STANDARD_TRANSITION,
-               transition_dist=tf.contrib.distributions.Normal,
-               dtype=tf.float32,
-               random_seed=None):
-    self.state_size = state_size
-    self.num_timesteps = num_timesteps
-    self.sigma_min = sigma_min
-    self.dtype = dtype
-    self.variance = variance
-    self.observation_variance = observation_variance
-    self.transition_type = transition_type
-    self.transition_dist = transition_dist
-
-  def p_zt(self, prev_state, t):
-    """Computes the model p(z_t| z_{t-1})."""
-    batch_size = tf.shape(prev_state)[0]
-    if t > 0:
-      if self.transition_type == ROUND_TRANSITION:
-        loc = tf.round(prev_state)
-        tf.logging.info("p(z_%d | z_%d) ~ N(round(z_%d), %0.1f)" % (t, t-1, t-1, self.variance))
-      elif self.transition_type == STANDARD_TRANSITION:
-        loc = prev_state
-        tf.logging.info("p(z_%d | z_%d) ~ N(z_%d, %0.1f)" % (t, t-1, t-1, self.variance))
-    else:  # p(z_0) is Normal(0,1)
-      loc = tf.zeros([batch_size, self.state_size], dtype=self.dtype)
-      tf.logging.info("p(z_0) ~ N(0,%0.1f)" % self.variance)
-
-    p_zt = self.transition_dist(
-        loc=loc,
-        scale=tf.sqrt(tf.ones_like(loc) * self.variance))
-    return p_zt
-
-  def generative(self, unused_obs, z_ni):
-    """Computes the model's generative distribution p(x_i| z_{ni})."""
-    if self.transition_type == ROUND_TRANSITION:
-      loc = tf.round(z_ni)
-    elif self.transition_type == STANDARD_TRANSITION:
-      loc = z_ni
-    generative_sigma_sq = tf.ones_like(loc) * self.observation_variance
-    return self.transition_dist(
-        loc=loc, scale=tf.sqrt(generative_sigma_sq))
-
-
-class BimodalPriorP(object):
-
-  def __init__(self,
-               state_size,
-               num_timesteps,
-               mixing_coeff=0.5,
-               prior_mode_mean=1,
-               sigma_min=1e-5,
-               variance=1.0,
-               dtype=tf.float32,
-               random_seed=None,
-               trainable=True,
-               init_bs_to_zero=False,
-               graph_collection_name="P_VARS"):
-    self.state_size = state_size
-    self.num_timesteps = num_timesteps
-    self.sigma_min = sigma_min
-    self.dtype = dtype
-    self.variance = variance
-    self.mixing_coeff = mixing_coeff
-    self.prior_mode_mean = prior_mode_mean
-
-    if init_bs_to_zero:
-      initializers = [tf.zeros_initializer for _ in xrange(num_timesteps)]
-    else:
-      initializers = [tf.random_uniform_initializer(seed=random_seed) for _ in xrange(num_timesteps)]
-
-    self.bs = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="b_%d" % (t + 1),
-            initializer=initializers[t],
-            collections=[tf.GraphKeys.GLOBAL_VARIABLES, graph_collection_name],
-            trainable=trainable) for t in xrange(num_timesteps)
-    ]
-    self.Bs = tf.cumsum(self.bs, reverse=True, axis=0)
-
-  def posterior(self, observation, prev_state, t):
-    # NOTE: This is currently wrong, but would require a refactoring of
-    # summarize_q to fix as kl is not defined for a mixture
-    """Computes the true posterior p(z_t|z_{t-1}, z_n)."""
-    # bs[0] is really b_1
-    # Bs[i] is sum from k=i+1^n b_k
-    mu = observation - self.Bs[t]
-    if t > 0:
-      mu += (prev_state + self.bs[t - 1]) * float(self.num_timesteps - t)
-    mu /= float(self.num_timesteps - t + 1)
-    sigma = tf.ones_like(mu) * self.variance * (
-        float(self.num_timesteps - t) / float(self.num_timesteps - t + 1))
-    return tf.contrib.distributions.Normal(loc=mu, scale=tf.sqrt(sigma))
-
-  def lookahead(self, state, t):
-    """Computes the true lookahead distribution p(z_n|z_t)."""
-    mu = state + self.Bs[t]
-    sigma = tf.ones_like(state) * self.variance * float(self.num_timesteps - t)
-    return tf.contrib.distributions.Normal(loc=mu, scale=tf.sqrt(sigma))
-
-  def likelihood(self, observation):
-    batch_size = tf.shape(observation)[0]
-    sum_of_bs = tf.tile(tf.reduce_sum(self.bs, axis=0)[tf.newaxis, :], [batch_size, 1])
-    sigma = tf.ones_like(sum_of_bs) * self.variance * (self.num_timesteps + 1)
-    mu_pos = (tf.ones([batch_size, self.state_size], dtype=self.dtype) * self.prior_mode_mean) + sum_of_bs
-    mu_neg = (tf.ones([batch_size, self.state_size], dtype=self.dtype) * -self.prior_mode_mean) + sum_of_bs
-    zn_pos = tf.contrib.distributions.Normal(
-        loc=mu_pos,
-        scale=tf.sqrt(sigma))
-    zn_neg = tf.contrib.distributions.Normal(
-        loc=mu_neg,
-        scale=tf.sqrt(sigma))
-    mode_probs = tf.convert_to_tensor([self.mixing_coeff, 1-self.mixing_coeff], dtype=tf.float64)
-    mode_probs = tf.tile(mode_probs[tf.newaxis, tf.newaxis, :], [batch_size, 1, 1])
-    mode_selection_dist = tf.contrib.distributions.Categorical(probs=mode_probs)
-    zn_dist = tf.contrib.distributions.Mixture(
-        cat=mode_selection_dist,
-        components=[zn_pos, zn_neg],
-        validate_args=True)
-    # Average over the batch and take the sum over the state size
-    return tf.reduce_mean(tf.reduce_sum(zn_dist.log_prob(observation), axis=1))
-
-  def p_zt(self, prev_state, t):
-    """Computes the model p(z_t| z_{t-1})."""
-    batch_size = tf.shape(prev_state)[0]
-    if t > 0:
-      z_mu_p = prev_state + self.bs[t - 1]
-      p_zt = tf.contrib.distributions.Normal(
-          loc=z_mu_p, scale=tf.sqrt(tf.ones_like(z_mu_p) * self.variance))
-      return p_zt
-    else:  # p(z_0) is mixture of two Normals
-      mu_pos = tf.ones([batch_size, self.state_size], dtype=self.dtype) * self.prior_mode_mean
-      mu_neg = tf.ones([batch_size, self.state_size], dtype=self.dtype) * -self.prior_mode_mean
-      z0_pos = tf.contrib.distributions.Normal(
-          loc=mu_pos,
-          scale=tf.sqrt(tf.ones_like(mu_pos) * self.variance))
-      z0_neg = tf.contrib.distributions.Normal(
-          loc=mu_neg,
-          scale=tf.sqrt(tf.ones_like(mu_neg) * self.variance))
-      mode_probs = tf.convert_to_tensor([self.mixing_coeff, 1-self.mixing_coeff], dtype=tf.float64)
-      mode_probs = tf.tile(mode_probs[tf.newaxis, tf.newaxis, :], [batch_size, 1, 1])
-      mode_selection_dist = tf.contrib.distributions.Categorical(probs=mode_probs)
-      z0_dist = tf.contrib.distributions.Mixture(
-          cat=mode_selection_dist,
-          components=[z0_pos, z0_neg],
-          validate_args=False)
-      return z0_dist
-
-  def generative(self, unused_observation, z_nm1):
-    """Computes the model's generative distribution p(z_n| z_{n-1})."""
-    generative_p_mu = z_nm1 + self.bs[-1]
-    return tf.contrib.distributions.Normal(
-        loc=generative_p_mu, scale=tf.sqrt(tf.ones_like(generative_p_mu) * self.variance))
-
-class Model(object):
-
-  def __init__(self,
-               p,
-               q,
-               r,
-               state_size,
-               num_timesteps,
-               dtype=tf.float32):
-    self.p = p
-    self.q = q
-    self.r = r
-    self.state_size = state_size
-    self.num_timesteps = num_timesteps
-    self.dtype = dtype
-
-  def zero_state(self, batch_size):
-    return tf.zeros([batch_size, self.state_size], dtype=self.dtype)
-
-  def __call__(self, prev_state, observation, t):
-    # Compute the q distribution over z, q(z_t|z_n, z_{t-1}).
-    q_zt = self.q.q_zt(observation, prev_state, t)
-    # Compute the p distribution over z, p(z_t|z_{t-1}).
-    p_zt = self.p.p_zt(prev_state, t)
-    # sample from q
-    zt = q_zt.sample()
-    r_xn = self.r.r_xn(zt, t)
-    # Calculate the logprobs and sum over the state size.
-    log_q_zt = tf.reduce_sum(q_zt.log_prob(zt), axis=1)
-    log_p_zt = tf.reduce_sum(p_zt.log_prob(zt), axis=1)
-    log_r_xn = tf.reduce_sum(r_xn.log_prob(observation), axis=1)
-    # If we're at the last timestep, also calc the logprob of the observation.
-    if t == self.num_timesteps - 1:
-      generative_dist = self.p.generative(observation, zt)
-      log_p_x_given_z = tf.reduce_sum(generative_dist.log_prob(observation), axis=1)
-    else:
-      log_p_x_given_z = tf.zeros_like(log_q_zt)
-    return (zt, log_q_zt, log_p_zt, log_p_x_given_z, log_r_xn)
-
-  @staticmethod
-  def create(state_size,
-             num_timesteps,
-             sigma_min=1e-5,
-             r_sigma_init=1,
-             variance=1.0,
-             mixing_coeff=0.5,
-             prior_mode_mean=1.0,
-             dtype=tf.float32,
-             random_seed=None,
-             train_p=True,
-             p_type="unimodal",
-             q_type="normal",
-             observation_variance=1.0,
-             transition_type=STANDARD_TRANSITION,
-             use_bs=True):
-    if p_type == "unimodal":
-      p = P(state_size,
-            num_timesteps,
-            sigma_min=sigma_min,
-            variance=variance,
-            dtype=dtype,
-            random_seed=random_seed,
-            trainable=train_p,
-            init_bs_to_zero=not use_bs)
-    elif p_type == "bimodal":
-      p = BimodalPriorP(
-          state_size,
-          num_timesteps,
-          mixing_coeff=mixing_coeff,
-          prior_mode_mean=prior_mode_mean,
-          sigma_min=sigma_min,
-          variance=variance,
-          dtype=dtype,
-          random_seed=random_seed,
-          trainable=train_p,
-          init_bs_to_zero=not use_bs)
-    elif "nonlinear" in p_type:
-      if "cauchy" in p_type:
-        trans_dist = tf.contrib.distributions.Cauchy
-      else:
-        trans_dist = tf.contrib.distributions.Normal
-      p = ShortChainNonlinearP(
-          state_size,
-          num_timesteps,
-          sigma_min=sigma_min,
-          variance=variance,
-          observation_variance=observation_variance,
-          transition_type=transition_type,
-          transition_dist=trans_dist,
-          dtype=dtype,
-          random_seed=random_seed
-      )
-
-    if q_type == "normal":
-      q_class = Q
-    elif q_type == "simple_mean":
-      q_class = SimpleMeanQ
-    elif q_type == "prev_state":
-      q_class = PreviousStateQ
-    elif q_type == "observation":
-      q_class = ObservationQ
-
-    q = q_class(state_size,
-                num_timesteps,
-                sigma_min=sigma_min,
-                dtype=dtype,
-                random_seed=random_seed,
-                init_mu0_to_zero=not use_bs)
-    r = R(state_size,
-          num_timesteps,
-          sigma_min=sigma_min,
-          sigma_init=r_sigma_init,
-          dtype=dtype,
-          random_seed=random_seed)
-    model = Model(p, q, r, state_size, num_timesteps, dtype=dtype)
-    return model
-
-
-class BackwardsModel(object):
-
-  def __init__(self,
-               state_size,
-               num_timesteps,
-               sigma_min=1e-5,
-               dtype=tf.float32):
-    self.state_size = state_size
-    self.num_timesteps = num_timesteps
-    self.sigma_min = sigma_min
-    self.dtype = dtype
-    self.bs = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="b_%d" % (t + 1),
-            initializer=tf.zeros_initializer) for t in xrange(num_timesteps)
-    ]
-    self.Bs = tf.cumsum(self.bs, reverse=True, axis=0)
-    self.q_mus = [
-        snt.Linear(output_size=state_size) for _ in xrange(num_timesteps)
-    ]
-    self.q_sigmas = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="q_sigma_%d" % (t + 1),
-            initializer=tf.zeros_initializer) for t in xrange(num_timesteps)
-    ]
-    self.r_mus = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="r_mu_%d" % (t + 1),
-            initializer=tf.zeros_initializer) for t in xrange(num_timesteps)
-    ]
-    self.r_sigmas = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="r_sigma_%d" % (t + 1),
-            initializer=tf.zeros_initializer) for t in xrange(num_timesteps)
-    ]
-
-  def zero_state(self, batch_size):
-    return tf.zeros([batch_size, self.state_size], dtype=self.dtype)
-
-  def posterior(self, unused_observation, prev_state, unused_t):
-    # TODO(dieterichl): Correct this.
-    return tf.contrib.distributions.Normal(
-        loc=tf.zeros_like(prev_state), scale=tf.zeros_like(prev_state))
-
-  def lookahead(self, state, unused_t):
-    # TODO(dieterichl): Correct this.
-    return tf.contrib.distributions.Normal(
-        loc=tf.zeros_like(state), scale=tf.zeros_like(state))
-
-  def q_zt(self, observation, next_state, t):
-    """Computes the variational posterior q(z_{t}|z_{t+1}, z_n)."""
-    t_backwards = self.num_timesteps - t - 1
-    batch_size = tf.shape(next_state)[0]
-    q_mu = self.q_mus[t_backwards](tf.concat([observation, next_state], axis=1))
-    q_sigma = tf.maximum(
-        tf.nn.softplus(self.q_sigmas[t_backwards]), self.sigma_min)
-    q_sigma = tf.tile(q_sigma[tf.newaxis, :], [batch_size, 1])
-    q_zt = tf.contrib.distributions.Normal(loc=q_mu, scale=tf.sqrt(q_sigma))
-    return q_zt
-
-  def p_zt(self, prev_state, t):
-    """Computes the model p(z_{t+1}| z_{t})."""
-    t_backwards = self.num_timesteps - t - 1
-    z_mu_p = prev_state + self.bs[t_backwards]
-    p_zt = tf.contrib.distributions.Normal(
-        loc=z_mu_p, scale=tf.ones_like(z_mu_p))
-    return p_zt
-
-  def generative(self, unused_observation, z_nm1):
-    """Computes the model's generative distribution p(z_n| z_{n-1})."""
-    generative_p_mu = z_nm1 + self.bs[-1]
-    return tf.contrib.distributions.Normal(
-        loc=generative_p_mu, scale=tf.ones_like(generative_p_mu))
-
-  def r(self, z_t, t):
-    t_backwards = self.num_timesteps - t - 1
-    batch_size = tf.shape(z_t)[0]
-    r_mu = tf.tile(self.r_mus[t_backwards][tf.newaxis, :], [batch_size, 1])
-    r_sigma = tf.maximum(
-        tf.nn.softplus(self.r_sigmas[t_backwards]), self.sigma_min)
-    r_sigma = tf.tile(r_sigma[tf.newaxis, :], [batch_size, 1])
-    return tf.contrib.distributions.Normal(loc=r_mu, scale=tf.sqrt(r_sigma))
-
-  def likelihood(self, observation):
-    batch_size = tf.shape(observation)[0]
-    mu = tf.tile(tf.reduce_sum(self.bs, axis=0)[tf.newaxis, :], [batch_size, 1])
-    sigma = tf.ones_like(mu) * (self.num_timesteps + 1)
-    dist = tf.contrib.distributions.Normal(loc=mu, scale=tf.sqrt(sigma))
-    # Average over the batch and take the sum over the state size
-    return tf.reduce_mean(tf.reduce_sum(dist.log_prob(observation), axis=1))
-
-  def __call__(self, next_state, observation, t):
-    # next state = z_{t+1}
-    # Compute the q distribution over z, q(z_{t}|z_n, z_{t+1}).
-    q_zt = self.q_zt(observation, next_state, t)
-    # sample from q
-    zt = q_zt.sample()
-    # Compute the p distribution over z, p(z_{t+1}|z_{t}).
-    p_zt = self.p_zt(zt, t)
-    # Compute log p(z_{t+1} | z_t)
-    if t == 0:
-      log_p_zt = p_zt.log_prob(observation)
-    else:
-      log_p_zt = p_zt.log_prob(next_state)
-
-    # Compute r prior over zt
-    r_zt = self.r(zt, t)
-    log_r_zt = r_zt.log_prob(zt)
-    # Compute proposal density at zt
-    log_q_zt = q_zt.log_prob(zt)
-    # If we're at the last timestep, also calc the logprob of the observation.
-
-    if t == self.num_timesteps - 1:
-      p_z0_dist = tf.contrib.distributions.Normal(
-          loc=tf.zeros_like(zt), scale=tf.ones_like(zt))
-      z0_log_prob = p_z0_dist.log_prob(zt)
-    else:
-      z0_log_prob = tf.zeros_like(log_q_zt)
-    return (zt, log_q_zt, log_p_zt, z0_log_prob, log_r_zt)
-
-
-class LongChainP(object):
-
-  def __init__(self,
-               state_size,
-               num_obs,
-               steps_per_obs,
-               sigma_min=1e-5,
-               variance=1.0,
-               observation_variance=1.0,
-               observation_type=STANDARD_OBSERVATION,
-               transition_type=STANDARD_TRANSITION,
-               dtype=tf.float32,
-               random_seed=None):
-    self.state_size = state_size
-    self.steps_per_obs = steps_per_obs
-    self.num_obs = num_obs
-    self.num_timesteps = steps_per_obs*num_obs + 1
-    self.sigma_min = sigma_min
-    self.dtype = dtype
-    self.variance = variance
-    self.observation_variance = observation_variance
-    self.observation_type = observation_type
-    self.transition_type = transition_type
-
-  def likelihood(self, observations):
-    """Computes the model's true likelihood of the observations.
-
-    Args:
-      observations: A [batch_size, m, state_size] Tensor representing each of
-        the m observations.
-    Returns:
-      logprob: The true likelihood of the observations given the model.
-    """
-    raise ValueError("Likelihood is not defined for long-chain models")
-    # batch_size = tf.shape(observations)[0]
-    # mu = tf.zeros([batch_size, self.state_size, self.num_obs], dtype=self.dtype)
-    # sigma = np.fromfunction(
-    #     lambda i, j: 1 + self.steps_per_obs*np.minimum(i+1, j+1),
-    #     [self.num_obs, self.num_obs])
-    # sigma += np.eye(self.num_obs)
-    # sigma = tf.convert_to_tensor(sigma * self.variance, dtype=self.dtype)
-    # sigma = tf.tile(sigma[tf.newaxis, tf.newaxis, ...],
-    #                 [batch_size, self.state_size, 1, 1])
-    # dist = tf.contrib.distributions.MultivariateNormalFullCovariance(
-    #     loc=mu,
-    #     covariance_matrix=sigma)
-    # Average over the batch and take the sum over the state size
-    #return tf.reduce_mean(tf.reduce_sum(dist.log_prob(observations), axis=1))
-
-  def p_zt(self, prev_state, t):
-    """Computes the model p(z_t| z_{t-1})."""
-    batch_size = tf.shape(prev_state)[0]
-    if t > 0:
-      if self.transition_type == ROUND_TRANSITION:
-        loc = tf.round(prev_state)
-        tf.logging.info("p(z_%d | z_%d) ~ N(round(z_%d), %0.1f)" % (t, t-1, t-1, self.variance))
-      elif self.transition_type == STANDARD_TRANSITION:
-        loc = prev_state
-        tf.logging.info("p(z_%d | z_%d) ~ N(z_%d, %0.1f)" % (t, t-1, t-1, self.variance))
-    else:  # p(z_0) is Normal(0,1)
-      loc = tf.zeros([batch_size, self.state_size], dtype=self.dtype)
-      tf.logging.info("p(z_0) ~ N(0,%0.1f)" % self.variance)
-
-    p_zt = tf.contrib.distributions.Normal(
-        loc=loc,
-        scale=tf.sqrt(tf.ones_like(loc) * self.variance))
-    return p_zt
-
-  def generative(self, z_ni, t):
-    """Computes the model's generative distribution p(x_i| z_{ni})."""
-    if self.observation_type == SQUARED_OBSERVATION:
-      generative_mu = tf.square(z_ni)
-      tf.logging.info("p(x_%d | z_%d) ~ N(z_%d^2, %0.1f)" % (t, t, t, self.variance))
-    elif self.observation_type == ABS_OBSERVATION:
-      generative_mu = tf.abs(z_ni)
-      tf.logging.info("p(x_%d | z_%d) ~ N(|z_%d|, %0.1f)" % (t, t, t, self.variance))
-    elif self.observation_type == STANDARD_OBSERVATION:
-      generative_mu = z_ni
-      tf.logging.info("p(x_%d | z_%d) ~ N(z_%d, %0.1f)" % (t, t, t, self.variance))
-    generative_sigma_sq = tf.ones_like(generative_mu) * self.observation_variance
-    return tf.contrib.distributions.Normal(
-        loc=generative_mu, scale=tf.sqrt(generative_sigma_sq))
-
-
-class LongChainQ(object):
-
-  def __init__(self,
-               state_size,
-               num_obs,
-               steps_per_obs,
-               sigma_min=1e-5,
-               dtype=tf.float32,
-               random_seed=None):
-    self.state_size = state_size
-    self.sigma_min = sigma_min
-    self.dtype = dtype
-    self.steps_per_obs = steps_per_obs
-    self.num_obs = num_obs
-    self.num_timesteps = num_obs*steps_per_obs +1
-
-    initializers =  {
-      "w": tf.random_uniform_initializer(seed=random_seed),
-      "b": tf.zeros_initializer
-    }
-    self.mus = [
-        snt.Linear(output_size=state_size, initializers=initializers)
-        for t in xrange(self.num_timesteps)
-    ]
-    self.sigmas = [
-        tf.get_variable(
-            shape=[state_size],
-            dtype=self.dtype,
-            name="q_sigma_%d" % (t + 1),
-            initializer=tf.random_uniform_initializer(seed=random_seed))
-        for t in xrange(self.num_timesteps)
-    ]
-
-  def first_relevant_obs_index(self, t):
-    return int(max((t-1)/self.steps_per_obs, 0))
-
-  def q_zt(self, observations, prev_state, t):
-    """Computes a distribution over z_t.
-
-    Args:
-      observations: a [batch_size, num_observations, state_size] Tensor.
-      prev_state: a [batch_size, state_size] Tensor.
-      t: The current timestep, an int Tensor.
-    """
-    # filter out unneeded past obs
-    first_relevant_obs_index = int(math.floor(max(t-1, 0) / self.steps_per_obs))
-    num_relevant_observations = self.num_obs - first_relevant_obs_index
-    observations = observations[:,first_relevant_obs_index:,:]
-    batch_size = tf.shape(prev_state)[0]
-    # concatenate the prev state and observations along the second axis (that is
-    # not the batch or state size axis, and then flatten it to
-    # [batch_size, (num_relevant_observations + 1) * state_size] to feed it into
-    # the linear layer.
-    q_input = tf.concat([observations, prev_state[:,tf.newaxis, :]], axis=1)
-    q_input = tf.reshape(q_input,
-                         [batch_size, (num_relevant_observations + 1) * self.state_size])
-    q_mu = self.mus[t](q_input)
-    q_sigma = tf.maximum(tf.nn.softplus(self.sigmas[t]), self.sigma_min)
-    q_sigma = tf.tile(q_sigma[tf.newaxis, :], [batch_size, 1])
-    q_zt = tf.contrib.distributions.Normal(loc=q_mu, scale=tf.sqrt(q_sigma))
-    tf.logging.info(
-        "q(z_{t} | z_{tm1}, x_{obsf}:{obst}) ~ N(Linear([z_{tm1},x_{obsf}:{obst}]), sigma_{t})".format(
-            **{"t": t,
-               "tm1": t-1,
-               "obsf": (first_relevant_obs_index+1)*self.steps_per_obs,
-               "obst":self.steps_per_obs*self.num_obs}))
-    return q_zt
-
-  def summarize_weights(self):
-    pass
-
-class LongChainR(object):
-
-  def __init__(self,
-               state_size,
-               num_obs,
-               steps_per_obs,
-               sigma_min=1e-5,
-               dtype=tf.float32,
-               random_seed=None):
-    self.state_size = state_size
-    self.dtype = dtype
-    self.sigma_min = sigma_min
-    self.steps_per_obs = steps_per_obs
-    self.num_obs = num_obs
-    self.num_timesteps = num_obs*steps_per_obs + 1
-    self.sigmas = [
-        tf.get_variable(
-            shape=[self.num_future_obs(t)],
-            dtype=self.dtype,
-            name="r_sigma_%d" % (t + 1),
-            #initializer=tf.random_uniform_initializer(seed=random_seed, maxval=100))
-            initializer=tf.constant_initializer(1.0))
-        for t in range(self.num_timesteps)
-    ]
-
-  def first_future_obs_index(self, t):
-    return int(math.floor(t / self.steps_per_obs))
-
-  def num_future_obs(self, t):
-    return int(self.num_obs - self.first_future_obs_index(t))
-
-  def r_xn(self, z_t, t):
-    """Computes a distribution over the future observations given current latent
-    state.
-
-    The indexing in these messages is 1 indexed and inclusive. This is
-    consistent with the latex documents.
-
-    Args:
-      z_t: [batch_size, state_size] Tensor
-      t: Current timestep
-    """
-    tf.logging.info(
-        "r(x_{start}:{end} | z_{t}) ~ N(z_{t}, sigma_{t})".format(
-            **{"t": t,
-               "start": (self.first_future_obs_index(t)+1)*self.steps_per_obs,
-               "end": self.num_timesteps-1}))
-    batch_size = tf.shape(z_t)[0]
-    # the mean for all future observations is the same.
-    # this tiling results in a [batch_size, num_future_obs, state_size] Tensor
-    r_mu = tf.tile(z_t[:,tf.newaxis,:], [1, self.num_future_obs(t), 1])
-    # compute the variance
-    r_sigma = tf.maximum(tf.nn.softplus(self.sigmas[t]), self.sigma_min)
-    # the variance is the same across all state dimensions, so we only have to
-    # time sigma to be [batch_size, num_future_obs].
-    r_sigma = tf.tile(r_sigma[tf.newaxis,:, tf.newaxis], [batch_size, 1, self.state_size])
-    return tf.contrib.distributions.Normal(
-        loc=r_mu, scale=tf.sqrt(r_sigma))
-
-  def summarize_weights(self):
-    pass
-
-
-class LongChainModel(object):
-
-  def __init__(self,
-               p,
-               q,
-               r,
-               state_size,
-               num_obs,
-               steps_per_obs,
-               dtype=tf.float32,
-               disable_r=False):
-    self.p = p
-    self.q = q
-    self.r = r
-    self.disable_r = disable_r
-    self.state_size = state_size
-    self.num_obs = num_obs
-    self.steps_per_obs = steps_per_obs
-    self.num_timesteps = steps_per_obs*num_obs + 1
-    self.dtype = dtype
-
-  def zero_state(self, batch_size):
-    return tf.zeros([batch_size, self.state_size], dtype=self.dtype)
-
-  def next_obs_ind(self, t):
-    return int(math.floor(max(t-1,0)/self.steps_per_obs))
-
-  def __call__(self, prev_state, observations, t):
-    """Computes the importance weight for the model system.
-
-    Args:
-      prev_state: [batch_size, state_size] Tensor
-      observations: [batch_size, num_observations, state_size] Tensor
-    """
-    # Compute the q distribution over z, q(z_t|z_n, z_{t-1}).
-    q_zt = self.q.q_zt(observations, prev_state, t)
-    # Compute the p distribution over z, p(z_t|z_{t-1}).
-    p_zt = self.p.p_zt(prev_state, t)
-    # sample from q and evaluate the logprobs, summing over the state size
-    zt = q_zt.sample()
-    log_q_zt = tf.reduce_sum(q_zt.log_prob(zt), axis=1)
-    log_p_zt = tf.reduce_sum(p_zt.log_prob(zt), axis=1)
-    if not self.disable_r and t < self.num_timesteps-1:
-      # score the remaining observations using r
-      r_xn = self.r.r_xn(zt, t)
-      log_r_xn = r_xn.log_prob(observations[:, self.next_obs_ind(t+1):, :])
-      # sum over state size and observation, leaving the batch index
-      log_r_xn = tf.reduce_sum(log_r_xn, axis=[1,2])
-    else:
-      log_r_xn = tf.zeros_like(log_p_zt)
-    if t != 0 and t % self.steps_per_obs == 0:
-      generative_dist = self.p.generative(zt, t)
-      log_p_x_given_z = generative_dist.log_prob(observations[:,self.next_obs_ind(t),:])
-      log_p_x_given_z = tf.reduce_sum(log_p_x_given_z, axis=1)
-    else:
-      log_p_x_given_z = tf.zeros_like(log_q_zt)
-    return (zt, log_q_zt, log_p_zt, log_p_x_given_z, log_r_xn)
-
-  @staticmethod
-  def create(state_size,
-             num_obs,
-             steps_per_obs,
-             sigma_min=1e-5,
-             variance=1.0,
-             observation_variance=1.0,
-             observation_type=STANDARD_OBSERVATION,
-             transition_type=STANDARD_TRANSITION,
-             dtype=tf.float32,
-             random_seed=None,
-             disable_r=False):
-    p = LongChainP(
-        state_size,
-        num_obs,
-        steps_per_obs,
-        sigma_min=sigma_min,
-        variance=variance,
-        observation_variance=observation_variance,
-        observation_type=observation_type,
-        transition_type=transition_type,
-        dtype=dtype,
-        random_seed=random_seed)
-    q = LongChainQ(
-        state_size,
-        num_obs,
-        steps_per_obs,
-        sigma_min=sigma_min,
-        dtype=dtype,
-        random_seed=random_seed)
-    r = LongChainR(
-        state_size,
-        num_obs,
-        steps_per_obs,
-        sigma_min=sigma_min,
-        dtype=dtype,
-        random_seed=random_seed)
-    model = LongChainModel(
-        p, q, r, state_size, num_obs, steps_per_obs,
-        dtype=dtype,
-        disable_r=disable_r)
-    return model
-
-
-class RTilde(object):
-
-  def __init__(self,
-               state_size,
-               num_timesteps,
-               sigma_min=1e-5,
-               dtype=tf.float32,
-               random_seed=None,
-               graph_collection_name="R_TILDE_VARS"):
-    self.dtype = dtype
-    self.sigma_min = sigma_min
-    initializers = {"w": tf.truncated_normal_initializer(seed=random_seed),
-                    "b": tf.zeros_initializer}
-    self.graph_collection_name=graph_collection_name
-
-    def custom_getter(getter, *args, **kwargs):
-      out = getter(*args, **kwargs)
-      ref = tf.get_collection_ref(self.graph_collection_name)
-      if out not in ref:
-        ref.append(out)
-      return out
-
-    self.fns = [
-        snt.Linear(output_size=2*state_size,
-                   initializers=initializers,
-                   name="r_tilde_%d" % t,
-                   custom_getter=custom_getter)
-        for t in xrange(num_timesteps)
-    ]
-
-  def r_zt(self, z_t, observation, t):
-    #out = self.fns[t](tf.stop_gradient(tf.concat([z_t, observation], axis=1)))
-    out = self.fns[t](tf.concat([z_t, observation], axis=1))
-    mu, raw_sigma_sq = tf.split(out, 2, axis=1)
-    sigma_sq = tf.maximum(tf.nn.softplus(raw_sigma_sq), self.sigma_min)
-    return mu, sigma_sq
-
-class TDModel(object):
-
-  def __init__(self,
-               p,
-               q,
-               r_tilde,
-               state_size,
-               num_timesteps,
-               dtype=tf.float32,
-               disable_r=False):
-    self.p = p
-    self.q = q
-    self.r_tilde = r_tilde
-    self.disable_r = disable_r
-    self.state_size = state_size
-    self.num_timesteps = num_timesteps
-    self.dtype = dtype
-
-  def zero_state(self, batch_size):
-    return tf.zeros([batch_size, self.state_size], dtype=self.dtype)
-
-  def __call__(self, prev_state, observation, t):
-    """Computes the importance weight for the model system.
-
-    Args:
-      prev_state: [batch_size, state_size] Tensor
-      observations: [batch_size, num_observations, state_size] Tensor
-    """
-    # Compute the q distribution over z, q(z_t|z_n, z_{t-1}).
-    q_zt = self.q.q_zt(observation, prev_state, t)
-    # Compute the p distribution over z, p(z_t|z_{t-1}).
-    p_zt = self.p.p_zt(prev_state, t)
-    # sample from q and evaluate the logprobs, summing over the state size
-    zt = q_zt.sample()
-    # If it isn't the last timestep, compute the distribution over the next z.
-    if t < self.num_timesteps - 1:
-      p_ztplus1 = self.p.p_zt(zt, t+1)
-    else:
-      p_ztplus1 = None
-    log_q_zt = tf.reduce_sum(q_zt.log_prob(zt), axis=1)
-    log_p_zt = tf.reduce_sum(p_zt.log_prob(zt), axis=1)
-
-    if not self.disable_r and t < self.num_timesteps-1:
-      # score the remaining observations using r
-      r_tilde_mu, r_tilde_sigma_sq = self.r_tilde.r_zt(zt, observation, t+1)
-    else:
-      r_tilde_mu = None
-      r_tilde_sigma_sq = None
-    if t == self.num_timesteps - 1:
-      generative_dist = self.p.generative(observation, zt)
-      log_p_x_given_z = tf.reduce_sum(generative_dist.log_prob(observation), axis=1)
-    else:
-      log_p_x_given_z = tf.zeros_like(log_q_zt)
-    return (zt, log_q_zt, log_p_zt, log_p_x_given_z,
-            r_tilde_mu, r_tilde_sigma_sq, p_ztplus1)
-
-  @staticmethod
-  def create(state_size,
-             num_timesteps,
-             sigma_min=1e-5,
-             variance=1.0,
-             dtype=tf.float32,
-             random_seed=None,
-             train_p=True,
-             p_type="unimodal",
-             q_type="normal",
-             mixing_coeff=0.5,
-             prior_mode_mean=1.0,
-             observation_variance=1.0,
-             transition_type=STANDARD_TRANSITION,
-             use_bs=True):
-    if p_type == "unimodal":
-      p = P(state_size,
-            num_timesteps,
-            sigma_min=sigma_min,
-            variance=variance,
-            dtype=dtype,
-            random_seed=random_seed,
-            trainable=train_p,
-            init_bs_to_zero=not use_bs)
-    elif p_type == "bimodal":
-      p = BimodalPriorP(
-          state_size,
-          num_timesteps,
-          mixing_coeff=mixing_coeff,
-          prior_mode_mean=prior_mode_mean,
-          sigma_min=sigma_min,
-          variance=variance,
-          dtype=dtype,
-          random_seed=random_seed,
-          trainable=train_p,
-          init_bs_to_zero=not use_bs)
-    elif "nonlinear" in p_type:
-      if "cauchy" in p_type:
-        trans_dist = tf.contrib.distributions.Cauchy
-      else:
-        trans_dist = tf.contrib.distributions.Normal
-
-      p = ShortChainNonlinearP(
-          state_size,
-          num_timesteps,
-          sigma_min=sigma_min,
-          variance=variance,
-          observation_variance=observation_variance,
-          transition_type=transition_type,
-          transition_dist=trans_dist,
-          dtype=dtype,
-          random_seed=random_seed
-      )
-
-    if q_type == "normal":
-      q_class = Q
-    elif q_type == "simple_mean":
-      q_class = SimpleMeanQ
-    elif q_type == "prev_state":
-      q_class = PreviousStateQ
-    elif q_type == "observation":
-      q_class = ObservationQ
-
-    q = q_class(state_size,
-                num_timesteps,
-                sigma_min=sigma_min,
-                dtype=dtype,
-                random_seed=random_seed,
-                init_mu0_to_zero=not use_bs)
-    r_tilde = RTilde(
-        state_size,
-        num_timesteps,
-        sigma_min=sigma_min,
-        dtype=dtype,
-        random_seed=random_seed)
-    model = TDModel(p, q, r_tilde, state_size, num_timesteps, dtype=dtype)
-    return model
diff --git a/research/fivo/experimental/run.sh b/research/fivo/experimental/run.sh
deleted file mode 100644
index c650f636d5313a196960a92b509202b47e7da518..0000000000000000000000000000000000000000
--- a/research/fivo/experimental/run.sh
+++ /dev/null
@@ -1,54 +0,0 @@
-#!/bin/bash
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-model="forward"
-T=5
-num_obs=1
-var=0.1
-n=4
-lr=0.0001
-bound="fivo-aux"
-q_type="normal"
-resampling_method="multinomial"
-rgrad="true"
-p_type="unimodal"
-use_bs=false
-
-LOGDIR=/tmp/fivo/model-$model-$bound-$resampling_method-resampling-rgrad-$rgrad-T-$T-var-$var-n-$n-lr-$lr-q-$q_type-p-$p_type
-
-python train.py \
-  --logdir=$LOGDIR \
-  --model=$model \
-  --bound=$bound \
-  --q_type=$q_type \
-  --p_type=$p_type \
-  --variance=$var \
-  --use_resampling_grads=$rgrad \
-  --resampling=always \
-  --resampling_method=$resampling_method \
-  --batch_size=4 \
-  --num_samples=$n \
-  --num_timesteps=$T \
-  --num_eval_samples=256 \
-  --summarize_every=100 \
-  --learning_rate=$lr  \
-  --decay_steps=1000000 \
-  --max_steps=1000000000 \
-  --random_seed=1234 \
-  --train_p=false \
-  --use_bs=$use_bs \
-  --alsologtostderr
diff --git a/research/fivo/experimental/summary_utils.py b/research/fivo/experimental/summary_utils.py
deleted file mode 100644
index 04e4aeea257577e60d3651656d0c62355d501ea8..0000000000000000000000000000000000000000
--- a/research/fivo/experimental/summary_utils.py
+++ /dev/null
@@ -1,332 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utils for plotting and summarizing.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import matplotlib.gridspec as gridspec
-import matplotlib.pyplot as plt
-import numpy as np
-import scipy
-
-import tensorflow as tf
-
-import models
-
-
-def summarize_ess(weights, only_last_timestep=False):
-  """Plots the effective sample size.
-
-  Args:
-    weights: List of length num_timesteps Tensors of shape
-    [num_samples, batch_size]
-  """
-  num_timesteps = len(weights)
-  batch_size = tf.cast(tf.shape(weights[0])[1], dtype=tf.float64)
-  for i in range(num_timesteps):
-    if only_last_timestep and i < num_timesteps-1: continue
-
-    w = tf.nn.softmax(weights[i], dim=0)
-    centered_weights = w - tf.reduce_mean(w, axis=0, keepdims=True)
-    variance = tf.reduce_sum(tf.square(centered_weights))/(batch_size-1)
-    ess = 1./tf.reduce_mean(tf.reduce_sum(tf.square(w), axis=0))
-    tf.summary.scalar("ess/%d" % i, ess)
-    tf.summary.scalar("ese/%d" % i, ess / batch_size)
-    tf.summary.scalar("weight_variance/%d" % i, variance)
-
-
-def summarize_particles(states, weights, observation, model):
-  """Plots particle locations and weights.
-
-  Args:
-    states: List of length num_timesteps Tensors of shape
-      [batch_size*num_particles, state_size].
-    weights: List of length num_timesteps Tensors of shape [num_samples,
-      batch_size]
-    observation: Tensor of shape [batch_size*num_samples, state_size]
-  """
-  num_timesteps = len(weights)
-  num_samples, batch_size = weights[0].get_shape().as_list()
-  # get q0 information for plotting
-  q0_dist = model.q.q_zt(observation, tf.zeros_like(states[0]), 0)
-  q0_loc = q0_dist.loc[0:batch_size, 0]
-  q0_scale = q0_dist.scale[0:batch_size, 0]
-  # get posterior information for plotting
-  post = (model.p.mixing_coeff, model.p.prior_mode_mean, model.p.variance,
-          tf.reduce_sum(model.p.bs), model.p.num_timesteps)
-
-  # Reshape states and weights to be [time, num_samples, batch_size]
-  states = tf.stack(states)
-  weights = tf.stack(weights)
-  # normalize the weights over the sample dimension
-  weights = tf.nn.softmax(weights, dim=1)
-  states = tf.reshape(states, tf.shape(weights))
-
-  ess = 1./tf.reduce_sum(tf.square(weights), axis=1)
-
-  def _plot_states(states_batch, weights_batch, observation_batch, ess_batch, q0, post):
-    """
-    states: [time, num_samples, batch_size]
-    weights [time, num_samples, batch_size]
-    observation: [batch_size, 1]
-    q0: ([batch_size], [batch_size])
-    post: ...
-    """
-    num_timesteps, _, batch_size = states_batch.shape
-    plots = []
-    for i in range(batch_size):
-      states = states_batch[:,:,i]
-      weights = weights_batch[:,:,i]
-      observation = observation_batch[i]
-      ess = ess_batch[:,i]
-      q0_loc = q0[0][i]
-      q0_scale = q0[1][i]
-
-      fig = plt.figure(figsize=(7, (num_timesteps + 1) * 2))
-      # Each timestep gets two plots -- a bar plot and a histogram of state locs.
-      # The bar plot will be bar_rows rows tall.
-      # The histogram will be 1 row tall.
-      # There is also 1 extra plot at the top showing the posterior and q.
-      bar_rows = 8
-      num_rows = (num_timesteps + 1) * (bar_rows + 1)
-      gs = gridspec.GridSpec(num_rows, 1)
-
-      # Figure out how wide to make the plot
-      prior_lims = (post[1] * -2, post[1] * 2)
-      q_lims = (scipy.stats.norm.ppf(0.01, loc=q0_loc, scale=q0_scale),
-                scipy.stats.norm.ppf(0.99, loc=q0_loc, scale=q0_scale))
-      state_width = states.max() - states.min()
-      state_lims = (states.min() - state_width * 0.15,
-                    states.max() + state_width * 0.15)
-
-      lims = (min(prior_lims[0], q_lims[0], state_lims[0]),
-              max(prior_lims[1], q_lims[1], state_lims[1]))
-      # plot the posterior
-      z0 = np.arange(lims[0], lims[1], 0.1)
-      alpha, pos_mu, sigma_sq, B, T = post
-      neg_mu = -pos_mu
-      scale = np.sqrt((T + 1) * sigma_sq)
-      p_zn = (
-          alpha * scipy.stats.norm.pdf(
-              observation, loc=pos_mu + B, scale=scale) + (1 - alpha) *
-          scipy.stats.norm.pdf(observation, loc=neg_mu + B, scale=scale))
-      p_z0 = (
-          alpha * scipy.stats.norm.pdf(z0, loc=pos_mu, scale=np.sqrt(sigma_sq))
-          + (1 - alpha) * scipy.stats.norm.pdf(
-              z0, loc=neg_mu, scale=np.sqrt(sigma_sq)))
-      p_zn_given_z0 = scipy.stats.norm.pdf(
-          observation, loc=z0 + B, scale=np.sqrt(T * sigma_sq))
-      post_z0 = (p_z0 * p_zn_given_z0) / p_zn
-      # plot q
-      q_z0 = scipy.stats.norm.pdf(z0, loc=q0_loc, scale=q0_scale)
-      ax = plt.subplot(gs[0:bar_rows, :])
-      ax.plot(z0, q_z0, color="blue")
-      ax.plot(z0, post_z0, color="green")
-      ax.plot(z0, p_z0, color="red")
-      ax.legend(("q", "posterior", "prior"), loc="best", prop={"size": 10})
-
-      ax.set_xticks([])
-      ax.set_xlim(*lims)
-
-      # plot the states
-      for t in range(num_timesteps):
-        start = (t + 1) * (bar_rows + 1)
-        ax1 = plt.subplot(gs[start:start + bar_rows, :])
-        ax2 = plt.subplot(gs[start + bar_rows:start + bar_rows + 1, :])
-        # plot the states barplot
-        # ax1.hist(
-        #     states[t, :],
-        #     weights=weights[t, :],
-        #     bins=50,
-        #     edgecolor="none",
-        #     alpha=0.2)
-        ax1.bar(states[t,:], weights[t,:], width=0.02, alpha=0.2, edgecolor = "none")
-        ax1.set_ylabel("t=%d" % t)
-        ax1.set_xticks([])
-        ax1.grid(True, which="both")
-        ax1.set_xlim(*lims)
-        # plot the observation
-        ax1.axvline(x=observation, color="red", linestyle="dashed")
-        # add the ESS
-        ax1.text(0.1, 0.9, "ESS: %0.2f" % ess[t],
-                 ha='center', va='center', transform=ax1.transAxes)
-
-        # plot the state location histogram
-        ax2.hist2d(
-            states[t, :], np.zeros_like(states[t, :]), bins=[50, 1], cmap="Greys")
-        ax2.grid(False)
-        ax2.set_yticks([])
-        ax2.set_xlim(*lims)
-        if t != num_timesteps - 1:
-          ax2.set_xticks([])
-
-      fig.canvas.draw()
-      p = np.fromstring(fig.canvas.tostring_rgb(), dtype=np.uint8, sep="")
-      plots.append(p.reshape(fig.canvas.get_width_height()[::-1] + (3,)))
-      plt.close(fig)
-    return np.stack(plots)
-
-  plots = tf.py_func(_plot_states,
-                     [states, weights, observation, ess, (q0_loc, q0_scale), post],
-                     [tf.uint8])[0]
-  tf.summary.image("states", plots, 5, collections=["infrequent_summaries"])
-
-
-def plot_weights(weights, resampled=None):
-  """Plots the weights and effective sample size from an SMC rollout.
-
-  Args:
-    weights: [num_timesteps, num_samples, batch_size] importance weights
-    resampled: [num_timesteps] 0/1 indicating if resampling ocurred
-  """
-  weights = tf.convert_to_tensor(weights)
-
-  def _make_plots(weights, resampled):
-    num_timesteps, num_samples, batch_size = weights.shape
-    plots = []
-    for i in range(batch_size):
-      fig, axes = plt.subplots(nrows=1, sharex=True, figsize=(8, 4))
-      axes.stackplot(np.arange(num_timesteps), np.transpose(weights[:, :, i]))
-      axes.set_title("Weights")
-      axes.set_xlabel("Steps")
-      axes.set_ylim([0, 1])
-      axes.set_xlim([0, num_timesteps - 1])
-      for j in np.where(resampled > 0)[0]:
-        axes.axvline(x=j, color="red", linestyle="dashed", ymin=0.0, ymax=1.0)
-      fig.canvas.draw()
-      data = np.fromstring(fig.canvas.tostring_rgb(), dtype=np.uint8, sep="")
-      data = data.reshape(fig.canvas.get_width_height()[::-1] + (3,))
-      plots.append(data)
-      plt.close(fig)
-    return np.stack(plots, axis=0)
-
-  if resampled is None:
-    num_timesteps, _, batch_size = weights.get_shape().as_list()
-    resampled = tf.zeros([num_timesteps], dtype=tf.float32)
-  plots = tf.py_func(_make_plots,
-                     [tf.nn.softmax(weights, dim=1),
-                      tf.to_float(resampled)], [tf.uint8])[0]
-  batch_size = weights.get_shape().as_list()[-1]
-  tf.summary.image(
-      "weights", plots, batch_size, collections=["infrequent_summaries"])
-
-
-def summarize_weights(weights, num_timesteps, num_samples):
-  # weights is [num_timesteps, num_samples, batch_size]
-  weights = tf.convert_to_tensor(weights)
-  mean = tf.reduce_mean(weights, axis=1, keepdims=True)
-  squared_diff = tf.square(weights - mean)
-  variances = tf.reduce_sum(squared_diff, axis=1) / (num_samples - 1)
-  # average the variance over the batch
-  variances = tf.reduce_mean(variances, axis=1)
-  avg_magnitude = tf.reduce_mean(tf.abs(weights), axis=[1, 2])
-  for t in xrange(num_timesteps):
-    tf.summary.scalar("weights/variance_%d" % t, variances[t])
-    tf.summary.scalar("weights/magnitude_%d" % t, avg_magnitude[t])
-    tf.summary.histogram("weights/step_%d" % t, weights[t])
-
-
-def summarize_learning_signal(rewards, tag):
-  num_resampling_events, _ = rewards.get_shape().as_list()
-  mean = tf.reduce_mean(rewards, axis=1)
-  avg_magnitude = tf.reduce_mean(tf.abs(rewards), axis=1)
-  reward_square = tf.reduce_mean(tf.square(rewards), axis=1)
-  for t in xrange(num_resampling_events):
-    tf.summary.scalar("%s/mean_%d" % (tag, t), mean[t])
-    tf.summary.scalar("%s/magnitude_%d" % (tag, t), avg_magnitude[t])
-    tf.summary.scalar("%s/squared_%d" % (tag, t), reward_square[t])
-    tf.summary.histogram("%s/step_%d" % (tag, t), rewards[t])
-
-
-def summarize_qs(model, observation, states):
-  model.q.summarize_weights()
-  if hasattr(model.p, "posterior") and callable(getattr(model.p, "posterior")):
-    states = [tf.zeros_like(states[0])] + states[:-1]
-    for t, prev_state in enumerate(states):
-      p = model.p.posterior(observation, prev_state, t)
-      q = model.q.q_zt(observation, prev_state, t)
-      kl = tf.reduce_mean(tf.contrib.distributions.kl_divergence(p, q))
-      tf.summary.scalar("kl_q/%d" % t, tf.reduce_mean(kl))
-      mean_diff = q.loc - p.loc
-      mean_abs_err = tf.abs(mean_diff)
-      mean_rel_err = tf.abs(mean_diff / p.loc)
-      tf.summary.scalar("q_mean_convergence/absolute_error_%d" % t,
-                        tf.reduce_mean(mean_abs_err))
-      tf.summary.scalar("q_mean_convergence/relative_error_%d" % t,
-                        tf.reduce_mean(mean_rel_err))
-      sigma_diff = tf.square(q.scale) - tf.square(p.scale)
-      sigma_abs_err = tf.abs(sigma_diff)
-      sigma_rel_err = tf.abs(sigma_diff / tf.square(p.scale))
-      tf.summary.scalar("q_variance_convergence/absolute_error_%d" % t,
-                        tf.reduce_mean(sigma_abs_err))
-      tf.summary.scalar("q_variance_convergence/relative_error_%d" % t,
-                        tf.reduce_mean(sigma_rel_err))
-
-
-def summarize_rs(model, states):
-  model.r.summarize_weights()
-  for t, state in enumerate(states):
-    true_r = model.p.lookahead(state, t)
-    r = model.r.r_xn(state, t)
-    kl = tf.reduce_mean(tf.contrib.distributions.kl_divergence(true_r, r))
-    tf.summary.scalar("kl_r/%d" % t, tf.reduce_mean(kl))
-    mean_diff = true_r.loc - r.loc
-    mean_abs_err = tf.abs(mean_diff)
-    mean_rel_err = tf.abs(mean_diff / true_r.loc)
-    tf.summary.scalar("r_mean_convergence/absolute_error_%d" % t,
-                      tf.reduce_mean(mean_abs_err))
-    tf.summary.scalar("r_mean_convergence/relative_error_%d" % t,
-                      tf.reduce_mean(mean_rel_err))
-    sigma_diff = tf.square(r.scale) - tf.square(true_r.scale)
-    sigma_abs_err = tf.abs(sigma_diff)
-    sigma_rel_err = tf.abs(sigma_diff / tf.square(true_r.scale))
-    tf.summary.scalar("r_variance_convergence/absolute_error_%d" % t,
-                      tf.reduce_mean(sigma_abs_err))
-    tf.summary.scalar("r_variance_convergence/relative_error_%d" % t,
-                      tf.reduce_mean(sigma_rel_err))
-
-
-def summarize_model(model, true_bs, observation, states, bound, summarize_r=True):
-  if hasattr(model.p, "bs"):
-    model_b = tf.reduce_sum(model.p.bs, axis=0)
-    true_b = tf.reduce_sum(true_bs, axis=0)
-    abs_err = tf.abs(model_b - true_b)
-    rel_err = abs_err / true_b
-    tf.summary.scalar("sum_of_bs/data_generating_process", tf.reduce_mean(true_b))
-    tf.summary.scalar("sum_of_bs/model", tf.reduce_mean(model_b))
-    tf.summary.scalar("sum_of_bs/absolute_error", tf.reduce_mean(abs_err))
-    tf.summary.scalar("sum_of_bs/relative_error", tf.reduce_mean(rel_err))
-  #summarize_qs(model, observation, states)
-  #if bound == "fivo-aux" and summarize_r:
-  #  summarize_rs(model, states)
-
-
-def summarize_grads(grads, loss_name):
-  grad_ema = tf.train.ExponentialMovingAverage(decay=0.99)
-  vectorized_grads = tf.concat(
-      [tf.reshape(g, [-1]) for g, _ in grads if g is not None], axis=0)
-  new_second_moments = tf.square(vectorized_grads)
-  new_first_moments = vectorized_grads
-  maintain_grad_ema_op = grad_ema.apply([new_first_moments, new_second_moments])
-  first_moments = grad_ema.average(new_first_moments)
-  second_moments = grad_ema.average(new_second_moments)
-  variances = second_moments - tf.square(first_moments)
-  tf.summary.scalar("grad_variance/%s" % loss_name, tf.reduce_mean(variances))
-  tf.summary.histogram("grad_variance/%s" % loss_name, variances)
-  tf.summary.histogram("grad_mean/%s" % loss_name, first_moments)
-  return maintain_grad_ema_op
diff --git a/research/fivo/experimental/train.py b/research/fivo/experimental/train.py
deleted file mode 100644
index 8abc9909b115298a30151a332d340f7b25e3cf90..0000000000000000000000000000000000000000
--- a/research/fivo/experimental/train.py
+++ /dev/null
@@ -1,637 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Main script for running fivo"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import defaultdict
-
-import numpy as np
-import tensorflow as tf
-
-import bounds
-import data
-import models
-import summary_utils as summ
-
-tf.logging.set_verbosity(tf.logging.INFO)
-
-tf.app.flags.DEFINE_integer("random_seed", None,
-                     "A random seed for the data generating process. Same seed "
-                     "-> same data generating process and initialization.")
-tf.app.flags.DEFINE_enum("bound", "fivo", ["iwae", "fivo", "fivo-aux", "fivo-aux-td"],
-                  "The bound to optimize.")
-tf.app.flags.DEFINE_enum("model", "forward", ["forward", "long_chain"],
-                  "The model to use.")
-tf.app.flags.DEFINE_enum("q_type", "normal",
-                  ["normal", "simple_mean", "prev_state", "observation"],
-                  "The parameterization to use for q")
-tf.app.flags.DEFINE_enum("p_type", "unimodal", ["unimodal", "bimodal", "nonlinear"],
-                  "The type of prior.")
-tf.app.flags.DEFINE_boolean("train_p", True,
-                     "If false, do not train the model p.")
-
-tf.app.flags.DEFINE_integer("state_size", 1,
-                     "The dimensionality of the state space.")
-tf.app.flags.DEFINE_float("variance", 1.0,
-                   "The variance of the data generating process.")
-
-tf.app.flags.DEFINE_boolean("use_bs", True,
-                     "If False, initialize all bs to 0.")
-tf.app.flags.DEFINE_float("bimodal_prior_weight", 0.5,
-                   "The weight assigned to the positive mode of the prior in "
-                   "both the data generating process and p.")
-tf.app.flags.DEFINE_float("bimodal_prior_mean", None,
-                   "If supplied, sets the mean of the 2 modes of the prior to "
-                   "be 1 and -1 times the supplied value. This is for both the "
-                   "data generating process and p.")
-tf.app.flags.DEFINE_float("fixed_observation", None,
-                   "If supplied, fix the observation to a constant value in the"
-                   " data generating process only.")
-tf.app.flags.DEFINE_float("r_sigma_init", 1.,
-                   "Value to initialize variance of r to.")
-tf.app.flags.DEFINE_enum("observation_type",
-                  models.STANDARD_OBSERVATION, models.OBSERVATION_TYPES,
-                  "The type of observation for the long chain model.")
-tf.app.flags.DEFINE_enum("transition_type",
-                  models.STANDARD_TRANSITION, models.TRANSITION_TYPES,
-                  "The type of transition for the long chain model.")
-tf.app.flags.DEFINE_float("observation_variance", None,
-                   "The variance of the observation. Defaults to 'variance'")
-
-tf.app.flags.DEFINE_integer("num_timesteps", 5,
-                     "Number of timesteps in the sequence.")
-tf.app.flags.DEFINE_integer("num_observations", 1,
-                     "The number of observations.")
-tf.app.flags.DEFINE_integer("steps_per_observation", 5,
-                     "The number of timesteps between each observation.")
-
-tf.app.flags.DEFINE_integer("batch_size", 4,
-                     "The number of examples per batch.")
-tf.app.flags.DEFINE_integer("num_samples", 4,
-                     "The number particles to use.")
-tf.app.flags.DEFINE_integer("num_eval_samples", 512,
-                     "The batch size and # of particles to use for eval.")
-
-tf.app.flags.DEFINE_string("resampling", "always",
-                    "How to resample. Accepts 'always','never', or a "
-                    "comma-separated list of booleans like 'true,true,false'.")
-tf.app.flags.DEFINE_enum("resampling_method", "multinomial", ["multinomial",
-                                                       "stratified",
-                                                       "systematic",
-                                                       "relaxed-logblend",
-                                                       "relaxed-stateblend",
-                                                       "relaxed-linearblend",
-                                                       "relaxed-stateblend-st",],
-                  "Type of resampling method to use.")
-tf.app.flags.DEFINE_boolean("use_resampling_grads", True,
-                     "Whether or not to use resampling grads to optimize FIVO."
-                     "Disabled automatically if resampling_method=relaxed.")
-tf.app.flags.DEFINE_boolean("disable_r", False,
-                     "If false, r is not used for fivo-aux and is set to zeros.")
-
-tf.app.flags.DEFINE_float("learning_rate", 1e-4,
-                   "The learning rate to use for ADAM or SGD.")
-tf.app.flags.DEFINE_integer("decay_steps", 25000,
-                     "The number of steps before the learning rate is halved.")
-tf.app.flags.DEFINE_integer("max_steps", int(1e6),
-                     "The number of steps to run training for.")
-
-tf.app.flags.DEFINE_string("logdir", "/tmp/fivo-aux",
-                    "Directory for summaries and checkpoints.")
-
-tf.app.flags.DEFINE_integer("summarize_every", int(1e3),
-                     "The number of steps between each evaluation.")
-FLAGS = tf.app.flags.FLAGS
-
-
-def combine_grad_lists(grad_lists):
-  # grads is num_losses by num_variables.
-  # each list could have different variables.
-  # for each variable, sum the grads across all losses.
-  grads_dict = defaultdict(list)
-  var_dict = {}
-  for grad_list in grad_lists:
-    for grad, var in grad_list:
-      if grad is not None:
-        grads_dict[var.name].append(grad)
-      var_dict[var.name] = var
-
-  final_grads = []
-  for var_name, var in var_dict.iteritems():
-    grads = grads_dict[var_name]
-    if len(grads) > 0:
-      tf.logging.info("Var %s has combined grads from %s." %
-                      (var_name, [g.name for g in grads]))
-      grad = tf.reduce_sum(grads, axis=0)
-    else:
-      tf.logging.info("Var %s has no grads" % var_name)
-      grad = None
-    final_grads.append((grad, var))
-  return final_grads
-
-
-def make_apply_grads_op(losses, global_step, learning_rate, lr_decay_steps):
-  for l in losses:
-    assert isinstance(l, bounds.Loss)
-
-  lr = tf.train.exponential_decay(
-      learning_rate, global_step, lr_decay_steps, 0.5, staircase=False)
-  tf.summary.scalar("learning_rate", lr)
-  opt = tf.train.AdamOptimizer(lr)
-
-  ema_ops = []
-  grads = []
-  for loss_name, loss, loss_var_collection in losses:
-    tf.logging.info("Computing grads of %s w.r.t. vars in collection %s" %
-                    (loss_name, loss_var_collection))
-    g = opt.compute_gradients(loss,
-                              var_list=tf.get_collection(loss_var_collection))
-    ema_ops.append(summ.summarize_grads(g, loss_name))
-    grads.append(g)
-
-  all_grads = combine_grad_lists(grads)
-  apply_grads_op = opt.apply_gradients(all_grads, global_step=global_step)
-
-  # Update the emas after applying the grads.
-  with tf.control_dependencies([apply_grads_op]):
-    train_op = tf.group(*ema_ops)
-  return train_op
-
-
-def add_check_numerics_ops():
-  check_op = []
-  for op in tf.get_default_graph().get_operations():
-    bad = ["logits/Log", "sample/Reshape", "log_prob/mul",
-           "log_prob/SparseSoftmaxCrossEntropyWithLogits/Reshape",
-           "entropy/Reshape", "entropy/LogSoftmax", "Categorical", "Mean"]
-    if all([x not in op.name for x in bad]):
-      for output in op.outputs:
-        if output.dtype in [tf.float16, tf.float32, tf.float64]:
-          if op._get_control_flow_context() is not None:  # pylint: disable=protected-access
-            raise ValueError("`tf.add_check_numerics_ops() is not compatible "
-                             "with TensorFlow control flow operations such as "
-                             "`tf.cond()` or `tf.while_loop()`.")
-
-          message = op.name + ":" + str(output.value_index)
-          with tf.control_dependencies(check_op):
-            check_op = [tf.check_numerics(output, message=message)]
-  return tf.group(*check_op)
-
-
-def create_long_chain_graph(bound, state_size, num_obs, steps_per_obs,
-                            batch_size, num_samples, num_eval_samples,
-                            resampling_schedule, use_resampling_grads,
-                            learning_rate, lr_decay_steps, dtype="float64"):
-  num_timesteps = num_obs * steps_per_obs + 1
-  # Make the dataset.
-  dataset = data.make_long_chain_dataset(
-      state_size=state_size,
-      num_obs=num_obs,
-      steps_per_obs=steps_per_obs,
-      batch_size=batch_size,
-      num_samples=num_samples,
-      variance=FLAGS.variance,
-      observation_variance=FLAGS.observation_variance,
-      dtype=dtype,
-      observation_type=FLAGS.observation_type,
-      transition_type=FLAGS.transition_type,
-      fixed_observation=FLAGS.fixed_observation)
-  itr = dataset.make_one_shot_iterator()
-  _, observations = itr.get_next()
-  # Make the dataset for eval
-  eval_dataset = data.make_long_chain_dataset(
-      state_size=state_size,
-      num_obs=num_obs,
-      steps_per_obs=steps_per_obs,
-      batch_size=batch_size,
-      num_samples=num_eval_samples,
-      variance=FLAGS.variance,
-      observation_variance=FLAGS.observation_variance,
-      dtype=dtype,
-      observation_type=FLAGS.observation_type,
-      transition_type=FLAGS.transition_type,
-      fixed_observation=FLAGS.fixed_observation)
-  eval_itr = eval_dataset.make_one_shot_iterator()
-  _, eval_observations = eval_itr.get_next()
-
-  # Make the model.
-  model = models.LongChainModel.create(
-      state_size,
-      num_obs,
-      steps_per_obs,
-      observation_type=FLAGS.observation_type,
-      transition_type=FLAGS.transition_type,
-      variance=FLAGS.variance,
-      observation_variance=FLAGS.observation_variance,
-      dtype=tf.as_dtype(dtype),
-      disable_r=FLAGS.disable_r)
-
-  # Compute the bound and loss
-  if bound == "iwae":
-    (_, losses, ema_op, _, _) = bounds.iwae(
-        model,
-        observations,
-        num_timesteps,
-        num_samples=num_samples)
-    (eval_log_p_hat, _, _, _, eval_log_weights) = bounds.iwae(
-        model,
-        eval_observations,
-        num_timesteps,
-        num_samples=num_eval_samples,
-        summarize=False)
-    eval_log_p_hat = tf.reduce_mean(eval_log_p_hat)
-  elif bound == "fivo" or "fivo-aux":
-    (_, losses, ema_op, _, _) = bounds.fivo(
-        model,
-        observations,
-        num_timesteps,
-        resampling_schedule=resampling_schedule,
-        use_resampling_grads=use_resampling_grads,
-        resampling_type=FLAGS.resampling_method,
-        aux=("aux" in bound),
-        num_samples=num_samples)
-    (eval_log_p_hat, _, _, _, eval_log_weights) = bounds.fivo(
-        model,
-        eval_observations,
-        num_timesteps,
-        resampling_schedule=resampling_schedule,
-        use_resampling_grads=False,
-        resampling_type="multinomial",
-        aux=("aux" in bound),
-        num_samples=num_eval_samples,
-        summarize=False)
-    eval_log_p_hat = tf.reduce_mean(eval_log_p_hat)
-
-  summ.summarize_ess(eval_log_weights, only_last_timestep=True)
-
-  tf.summary.scalar("log_p_hat", eval_log_p_hat)
-
-  # Compute and apply grads.
-  global_step = tf.train.get_or_create_global_step()
-
-  apply_grads = make_apply_grads_op(losses,
-                                    global_step,
-                                    learning_rate,
-                                    lr_decay_steps)
-
-  # Update the emas after applying the grads.
-  with tf.control_dependencies([apply_grads]):
-    train_op = tf.group(ema_op)
-
-  # We can't calculate the likelihood for most of these models
-  # so we just return zeros.
-  eval_likelihood = tf.zeros([], dtype=dtype)
-  return global_step, train_op, eval_log_p_hat, eval_likelihood
-
-
-def create_graph(bound, state_size, num_timesteps, batch_size,
-                 num_samples, num_eval_samples, resampling_schedule,
-                 use_resampling_grads, learning_rate, lr_decay_steps,
-                 train_p, dtype='float64'):
-  if FLAGS.use_bs:
-    true_bs = None
-  else:
-    true_bs = [np.zeros([state_size]).astype(dtype) for _ in xrange(num_timesteps)]
-
-  # Make the dataset.
-  true_bs, dataset = data.make_dataset(
-      bs=true_bs,
-      state_size=state_size,
-      num_timesteps=num_timesteps,
-      batch_size=batch_size,
-      num_samples=num_samples,
-      variance=FLAGS.variance,
-      prior_type=FLAGS.p_type,
-      bimodal_prior_weight=FLAGS.bimodal_prior_weight,
-      bimodal_prior_mean=FLAGS.bimodal_prior_mean,
-      transition_type=FLAGS.transition_type,
-      fixed_observation=FLAGS.fixed_observation,
-      dtype=dtype)
-  itr = dataset.make_one_shot_iterator()
-  _, observations = itr.get_next()
-  # Make the dataset for eval
-  _, eval_dataset = data.make_dataset(
-      bs=true_bs,
-      state_size=state_size,
-      num_timesteps=num_timesteps,
-      batch_size=num_eval_samples,
-      num_samples=num_eval_samples,
-      variance=FLAGS.variance,
-      prior_type=FLAGS.p_type,
-      bimodal_prior_weight=FLAGS.bimodal_prior_weight,
-      bimodal_prior_mean=FLAGS.bimodal_prior_mean,
-      transition_type=FLAGS.transition_type,
-      fixed_observation=FLAGS.fixed_observation,
-      dtype=dtype)
-  eval_itr = eval_dataset.make_one_shot_iterator()
-  _, eval_observations = eval_itr.get_next()
-
-  # Make the model.
-  if bound == "fivo-aux-td":
-    model = models.TDModel.create(
-        state_size,
-        num_timesteps,
-        variance=FLAGS.variance,
-        train_p=train_p,
-        p_type=FLAGS.p_type,
-        q_type=FLAGS.q_type,
-        mixing_coeff=FLAGS.bimodal_prior_weight,
-        prior_mode_mean=FLAGS.bimodal_prior_mean,
-        observation_variance=FLAGS.observation_variance,
-        transition_type=FLAGS.transition_type,
-        use_bs=FLAGS.use_bs,
-        dtype=tf.as_dtype(dtype),
-        random_seed=FLAGS.random_seed)
-  else:
-    model = models.Model.create(
-        state_size,
-        num_timesteps,
-        variance=FLAGS.variance,
-        train_p=train_p,
-        p_type=FLAGS.p_type,
-        q_type=FLAGS.q_type,
-        mixing_coeff=FLAGS.bimodal_prior_weight,
-        prior_mode_mean=FLAGS.bimodal_prior_mean,
-        observation_variance=FLAGS.observation_variance,
-        transition_type=FLAGS.transition_type,
-        use_bs=FLAGS.use_bs,
-        r_sigma_init=FLAGS.r_sigma_init,
-        dtype=tf.as_dtype(dtype),
-        random_seed=FLAGS.random_seed)
-
-  # Compute the bound and loss
-  if bound == "iwae":
-    (_, losses, ema_op, _, _) = bounds.iwae(
-        model,
-        observations,
-        num_timesteps,
-        num_samples=num_samples)
-    (eval_log_p_hat, _, _, eval_states, eval_log_weights) = bounds.iwae(
-        model,
-        eval_observations,
-        num_timesteps,
-        num_samples=num_eval_samples,
-        summarize=True)
-
-    eval_log_p_hat = tf.reduce_mean(eval_log_p_hat)
-
-  elif "fivo" in bound:
-    if bound == "fivo-aux-td":
-      (_, losses, ema_op, _, _) = bounds.fivo_aux_td(
-          model,
-          observations,
-          num_timesteps,
-          resampling_schedule=resampling_schedule,
-          num_samples=num_samples)
-      (eval_log_p_hat, _, _, eval_states, eval_log_weights) = bounds.fivo_aux_td(
-          model,
-          eval_observations,
-          num_timesteps,
-          resampling_schedule=resampling_schedule,
-          num_samples=num_eval_samples,
-          summarize=True)
-    else:
-      (_, losses, ema_op, _, _) = bounds.fivo(
-          model,
-          observations,
-          num_timesteps,
-          resampling_schedule=resampling_schedule,
-          use_resampling_grads=use_resampling_grads,
-          resampling_type=FLAGS.resampling_method,
-          aux=("aux" in bound),
-          num_samples=num_samples)
-      (eval_log_p_hat, _, _, eval_states, eval_log_weights) = bounds.fivo(
-          model,
-          eval_observations,
-          num_timesteps,
-          resampling_schedule=resampling_schedule,
-          use_resampling_grads=False,
-          resampling_type="multinomial",
-          aux=("aux" in bound),
-          num_samples=num_eval_samples,
-          summarize=True)
-    eval_log_p_hat = tf.reduce_mean(eval_log_p_hat)
-
-  summ.summarize_ess(eval_log_weights, only_last_timestep=True)
-
-  # if FLAGS.p_type == "bimodal":
-    # # create the observations that showcase the model.
-    # mode_odds_ratio = tf.convert_to_tensor([1., 3., 1./3., 512., 1./512.],
-    #                                        dtype=tf.float64)
-    # mode_odds_ratio = tf.expand_dims(mode_odds_ratio, 1)
-    # k = ((num_timesteps+1) * FLAGS.variance) / (2*FLAGS.bimodal_prior_mean)
-    # explain_obs = tf.reduce_sum(model.p.bs) + tf.log(mode_odds_ratio) * k
-    # explain_obs = tf.tile(explain_obs, [num_eval_samples, 1])
-    # # run the model on the explainable observations
-    # if bound == "iwae":
-    #   (_, _, _, explain_states, explain_log_weights) = bounds.iwae(
-    #       model,
-    #       explain_obs,
-    #       num_timesteps,
-    #       num_samples=num_eval_samples)
-    # elif bound == "fivo" or "fivo-aux":
-    #   (_, _, _, explain_states, explain_log_weights) = bounds.fivo(
-    #       model,
-    #       explain_obs,
-    #       num_timesteps,
-    #       resampling_schedule=resampling_schedule,
-    #       use_resampling_grads=False,
-    #       resampling_type="multinomial",
-    #       aux=("aux" in bound),
-    #       num_samples=num_eval_samples)
-    # summ.summarize_particles(explain_states,
-    #                          explain_log_weights,
-    #                          explain_obs,
-    #                          model)
-
-  # Calculate the true likelihood.
-  if hasattr(model.p, 'likelihood') and callable(getattr(model.p, 'likelihood')):
-    eval_likelihood = model.p.likelihood(eval_observations)/ FLAGS.num_timesteps
-  else:
-    eval_likelihood = tf.zeros_like(eval_log_p_hat)
-
-  tf.summary.scalar("log_p_hat", eval_log_p_hat)
-  tf.summary.scalar("likelihood", eval_likelihood)
-  tf.summary.scalar("bound_gap", eval_likelihood - eval_log_p_hat)
-  summ.summarize_model(model, true_bs, eval_observations, eval_states, bound,
-                       summarize_r=not bound == "fivo-aux-td")
-
-  # Compute and apply grads.
-  global_step = tf.train.get_or_create_global_step()
-
-  apply_grads = make_apply_grads_op(losses,
-                                    global_step,
-                                    learning_rate,
-                                    lr_decay_steps)
-
-  # Update the emas after applying the grads.
-  with tf.control_dependencies([apply_grads]):
-    train_op = tf.group(ema_op)
-    #train_op = tf.group(ema_op, add_check_numerics_ops())
-
-  return global_step, train_op, eval_log_p_hat, eval_likelihood
-
-
-def parse_resampling_schedule(schedule, num_timesteps):
-  schedule = schedule.strip().lower()
-  if schedule == "always":
-    return [True] * (num_timesteps - 1) + [False]
-  elif schedule == "never":
-    return [False] * num_timesteps
-  elif "every" in schedule:
-    n = int(schedule.split("_")[1])
-    return [(i+1) % n == 0 for i in xrange(num_timesteps)]
-  else:
-    sched = [x.strip() == "true" for x in schedule.split(",")]
-    assert len(
-        sched
-    ) == num_timesteps, "Wrong number of timesteps in resampling schedule."
-    return sched
-
-
-def create_log_hook(step, eval_log_p_hat, eval_likelihood):
-  def summ_formatter(d):
-    return ("Step {step}, log p_hat: {log_p_hat:.5f} likelihood: {likelihood:.5f}".format(**d))
-  hook = tf.train.LoggingTensorHook(
-      {
-          "step": step,
-          "log_p_hat": eval_log_p_hat,
-          "likelihood": eval_likelihood,
-      },
-      every_n_iter=FLAGS.summarize_every,
-      formatter=summ_formatter)
-  return hook
-
-
-def create_infrequent_summary_hook():
-  infrequent_summary_hook = tf.train.SummarySaverHook(
-      save_steps=10000,
-      output_dir=FLAGS.logdir,
-      summary_op=tf.summary.merge_all(key="infrequent_summaries")
-  )
-  return infrequent_summary_hook
-
-
-def main(unused_argv):
-  if FLAGS.model == "long_chain":
-    resampling_schedule = parse_resampling_schedule(FLAGS.resampling,
-                                                    FLAGS.num_timesteps + 1)
-  else:
-    resampling_schedule = parse_resampling_schedule(FLAGS.resampling,
-                                                    FLAGS.num_timesteps)
-  if FLAGS.random_seed is None:
-    seed = np.random.randint(0, high=10000)
-  else:
-    seed = FLAGS.random_seed
-  tf.logging.info("Using random seed %d", seed)
-
-  if FLAGS.model == "long_chain":
-    assert FLAGS.q_type  == "normal", "Q type %s not supported for long chain models" % FLAGS.q_type
-    assert FLAGS.p_type == "unimodal", "Bimodal priors are not supported for long chain models"
-    assert not FLAGS.use_bs, "Bs are not supported with long chain models"
-    assert FLAGS.num_timesteps == FLAGS.num_observations * FLAGS.steps_per_observation, "Num timesteps does not match."
-    assert FLAGS.bound != "fivo-aux-td", "TD Training is not compatible with long chain models."
-
-  if FLAGS.model == "forward":
-    if "nonlinear" not in FLAGS.p_type:
-      assert FLAGS.transition_type == models.STANDARD_TRANSITION, "Non-standard transitions not supported by the forward model."
-    assert FLAGS.observation_type == models.STANDARD_OBSERVATION, "Non-standard observations not supported by the forward model."
-    assert FLAGS.observation_variance is None, "Forward model does not support observation variance."
-    assert FLAGS.num_observations == 1, "Forward model only supports 1 observation."
-
-  if "relaxed" in FLAGS.resampling_method:
-    FLAGS.use_resampling_grads = False
-    assert FLAGS.bound != "fivo-aux-td", "TD Training is not compatible with relaxed resampling."
-
-  if FLAGS.observation_variance is None:
-    FLAGS.observation_variance = FLAGS.variance
-
-  if FLAGS.p_type == "bimodal":
-    assert FLAGS.bimodal_prior_mean is not None, "Must specify prior mean if using bimodal p."
-
-  if FLAGS.p_type == "nonlinear" or FLAGS.p_type == "nonlinear-cauchy":
-    assert not FLAGS.use_bs, "Using bs is not compatible with the nonlinear model."
-
-  g = tf.Graph()
-  with g.as_default():
-    # Set the seeds.
-    tf.set_random_seed(seed)
-    np.random.seed(seed)
-    if FLAGS.model == "long_chain":
-      (global_step, train_op, eval_log_p_hat,
-       eval_likelihood) = create_long_chain_graph(
-           FLAGS.bound,
-           FLAGS.state_size,
-           FLAGS.num_observations,
-           FLAGS.steps_per_observation,
-           FLAGS.batch_size,
-           FLAGS.num_samples,
-           FLAGS.num_eval_samples,
-           resampling_schedule,
-           FLAGS.use_resampling_grads,
-           FLAGS.learning_rate,
-           FLAGS.decay_steps)
-    else:
-      (global_step, train_op,
-       eval_log_p_hat, eval_likelihood) = create_graph(
-           FLAGS.bound,
-           FLAGS.state_size,
-           FLAGS.num_timesteps,
-           FLAGS.batch_size,
-           FLAGS.num_samples,
-           FLAGS.num_eval_samples,
-           resampling_schedule,
-           FLAGS.use_resampling_grads,
-           FLAGS.learning_rate,
-           FLAGS.decay_steps,
-           FLAGS.train_p)
-
-    log_hooks = [create_log_hook(global_step, eval_log_p_hat, eval_likelihood)]
-    if len(tf.get_collection("infrequent_summaries")) > 0:
-      log_hooks.append(create_infrequent_summary_hook())
-
-    tf.logging.info("trainable variables:")
-    tf.logging.info([v.name for v in tf.trainable_variables()])
-    tf.logging.info("p vars:")
-    tf.logging.info([v.name for v in tf.get_collection("P_VARS")])
-    tf.logging.info("q vars:")
-    tf.logging.info([v.name for v in tf.get_collection("Q_VARS")])
-    tf.logging.info("r vars:")
-    tf.logging.info([v.name for v in tf.get_collection("R_VARS")])
-    tf.logging.info("r tilde vars:")
-    tf.logging.info([v.name for v in tf.get_collection("R_TILDE_VARS")])
-
-    with tf.train.MonitoredTrainingSession(
-        master="",
-        is_chief=True,
-        hooks=log_hooks,
-        checkpoint_dir=FLAGS.logdir,
-        save_checkpoint_secs=120,
-        save_summaries_steps=FLAGS.summarize_every,
-        log_step_count_steps=FLAGS.summarize_every) as sess:
-      cur_step = -1
-      while True:
-        if sess.should_stop() or cur_step > FLAGS.max_steps:
-          break
-        # run a step
-        _, cur_step = sess.run([train_op, global_step])
-
-
-if __name__ == "__main__":
-  tf.app.run(main)
diff --git a/research/fivo/fivo/__init__.py b/research/fivo/fivo/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/fivo/fivo/bounds.py b/research/fivo/fivo/bounds.py
deleted file mode 100644
index 088519033dd80669e99015b8e465888bd94a4cb1..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/bounds.py
+++ /dev/null
@@ -1,317 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Implementation of objectives for training stochastic latent variable models.
-
-Contains implementations of the Importance Weighted Autoencoder objective (IWAE)
-and the Filtering Variational objective (FIVO).
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import functools
-import tensorflow as tf
-
-from fivo import nested_utils as nested
-from fivo import smc
-
-
-def iwae(model,
-         observations,
-         seq_lengths,
-         num_samples=1,
-         parallel_iterations=30,
-         swap_memory=True):
-  """Computes the IWAE lower bound on the log marginal probability.
-
-  This method accepts a stochastic latent variable model and some observations
-  and computes a stochastic lower bound on the log marginal probability of the
-  observations. The IWAE estimator is defined by averaging multiple importance
-  weights. For more details see "Importance Weighted Autoencoders" by Burda
-  et al. https://arxiv.org/abs/1509.00519.
-
-  When num_samples = 1, this bound becomes the evidence lower bound (ELBO).
-
-  Args:
-    model: A subclass of ELBOTrainableSequenceModel that implements one
-      timestep of the model. See models/vrnn.py for an example.
-    observations: The inputs to the model. A potentially nested list or tuple of
-      Tensors each of shape [max_seq_len, batch_size, ...]. The Tensors must
-      have a rank at least two and have matching shapes in the first two
-      dimensions, which represent time and the batch respectively. The model
-      will be provided with the observations before computing the bound.
-    seq_lengths: A [batch_size] Tensor of ints encoding the length of each
-      sequence in the batch (sequences can be padded to a common length).
-    num_samples: The number of samples to use.
-    parallel_iterations: The number of parallel iterations to use for the
-      internal while loop.
-    swap_memory: Whether GPU-CPU memory swapping should be enabled for the
-      internal while loop.
-
-  Returns:
-    log_p_hat: A Tensor of shape [batch_size] containing IWAE's estimate of the
-      log marginal probability of the observations.
-    log_weights: A Tensor of shape [max_seq_len, batch_size, num_samples]
-      containing the log weights at each timestep. Will not be valid for
-      timesteps past the end of a sequence.
-  """
-  log_p_hat, log_weights, _, final_state = fivo(
-      model,
-      observations,
-      seq_lengths,
-      num_samples=num_samples,
-      resampling_criterion=smc.never_resample_criterion,
-      parallel_iterations=parallel_iterations,
-      swap_memory=swap_memory)
-  return log_p_hat, log_weights, final_state
-
-
-def fivo(model,
-         observations,
-         seq_lengths,
-         num_samples=1,
-         resampling_criterion=smc.ess_criterion,
-         resampling_type='multinomial',
-         relaxed_resampling_temperature=0.5,
-         parallel_iterations=30,
-         swap_memory=True,
-         random_seed=None):
-  """Computes the FIVO lower bound on the log marginal probability.
-
-  This method accepts a stochastic latent variable model and some observations
-  and computes a stochastic lower bound on the log marginal probability of the
-  observations. The lower bound is defined by a particle filter's unbiased
-  estimate of the marginal probability of the observations. For more details see
-  "Filtering Variational Objectives" by Maddison et al.
-  https://arxiv.org/abs/1705.09279.
-
-  When the resampling criterion is "never resample", this bound becomes IWAE.
-
-  Args:
-    model: A subclass of ELBOTrainableSequenceModel that implements one
-      timestep of the model. See models/vrnn.py for an example.
-    observations: The inputs to the model. A potentially nested list or tuple of
-      Tensors each of shape [max_seq_len, batch_size, ...]. The Tensors must
-      have a rank at least two and have matching shapes in the first two
-      dimensions, which represent time and the batch respectively. The model
-      will be provided with the observations before computing the bound.
-    seq_lengths: A [batch_size] Tensor of ints encoding the length of each
-      sequence in the batch (sequences can be padded to a common length).
-    num_samples: The number of particles to use in each particle filter.
-    resampling_criterion: The resampling criterion to use for this particle
-      filter. Must accept the number of samples, the current log weights,
-      and the current timestep and return a boolean Tensor of shape [batch_size]
-      indicating whether each particle filter should resample. See
-      ess_criterion and related functions for examples. When
-      resampling_criterion is never_resample_criterion, resampling_fn is ignored
-      and never called.
-    resampling_type: The type of resampling, one of "multinomial" or "relaxed".
-    relaxed_resampling_temperature: A positive temperature only used for relaxed
-      resampling.
-    parallel_iterations: The number of parallel iterations to use for the
-      internal while loop. Note that values greater than 1 can introduce
-      non-determinism even when random_seed is provided.
-    swap_memory: Whether GPU-CPU memory swapping should be enabled for the
-      internal while loop.
-    random_seed: The random seed to pass to the resampling operations in
-      the particle filter. Mainly useful for testing.
-
-  Returns:
-    log_p_hat: A Tensor of shape [batch_size] containing FIVO's estimate of the
-      log marginal probability of the observations.
-    log_weights: A Tensor of shape [max_seq_len, batch_size, num_samples]
-      containing the log weights at each timestep of the particle filter. Note
-      that on timesteps when a resampling operation is performed the log weights
-      are reset to 0. Will not be valid for timesteps past the end of a
-      sequence.
-    resampled: A Tensor of shape [max_seq_len, batch_size] indicating when the
-      particle filters resampled. Will be 1.0 on timesteps when resampling
-      occurred and 0.0 on timesteps when it did not.
-  """
-  # batch_size is the number of particle filters running in parallel.
-  batch_size = tf.shape(seq_lengths)[0]
-
-  # Each sequence in the batch will be the input data for a different
-  # particle filter. The batch will be laid out as:
-  #   particle 1 of particle filter 1
-  #   particle 1 of particle filter 2
-  #   ...
-  #   particle 1 of particle filter batch_size
-  #   particle 2 of particle filter 1
-  #   ...
-  #   particle num_samples of particle filter batch_size
-  observations = nested.tile_tensors(observations, [1, num_samples])
-  tiled_seq_lengths = tf.tile(seq_lengths, [num_samples])
-  model.set_observations(observations, tiled_seq_lengths)
-
-  if resampling_type == 'multinomial':
-    resampling_fn = smc.multinomial_resampling
-  elif resampling_type == 'relaxed':
-    resampling_fn = functools.partial(
-        smc.relaxed_resampling, temperature=relaxed_resampling_temperature)
-  resampling_fn = functools.partial(resampling_fn, random_seed=random_seed)
-
-  def transition_fn(prev_state, t):
-    if prev_state is None:
-      return model.zero_state(batch_size * num_samples, tf.float32)
-    return model.propose_and_weight(prev_state, t)
-
-  log_p_hat, log_weights, resampled, final_state, _ = smc.smc(
-      transition_fn,
-      seq_lengths,
-      num_particles=num_samples,
-      resampling_criterion=resampling_criterion,
-      resampling_fn=resampling_fn,
-      parallel_iterations=parallel_iterations,
-      swap_memory=swap_memory)
-
-  return log_p_hat, log_weights, resampled, final_state
-
-def fivo_aux_td(
-    model,
-    observations,
-    seq_lengths,
-    num_samples=1,
-    resampling_criterion=smc.ess_criterion,
-    resampling_type='multinomial',
-    relaxed_resampling_temperature=0.5,
-    parallel_iterations=30,
-    swap_memory=True,
-    random_seed=None):
-  """Experimental."""
-  # batch_size is the number of particle filters running in parallel.
-  batch_size = tf.shape(seq_lengths)[0]
-  max_seq_len = tf.reduce_max(seq_lengths)
-
-  # Each sequence in the batch will be the input data for a different
-  # particle filter. The batch will be laid out as:
-  #   particle 1 of particle filter 1
-  #   particle 1 of particle filter 2
-  #   ...
-  #   particle 1 of particle filter batch_size
-  #   particle 2 of particle filter 1
-  #   ...
-  #   particle num_samples of particle filter batch_size
-  observations = nested.tile_tensors(observations, [1, num_samples])
-  tiled_seq_lengths = tf.tile(seq_lengths, [num_samples])
-  model.set_observations(observations, tiled_seq_lengths)
-
-  if resampling_type == 'multinomial':
-    resampling_fn = smc.multinomial_resampling
-  elif resampling_type == 'relaxed':
-    resampling_fn = functools.partial(
-        smc.relaxed_resampling, temperature=relaxed_resampling_temperature)
-  resampling_fn = functools.partial(resampling_fn, random_seed=random_seed)
-
-  def transition_fn(prev_state, t):
-    if prev_state is None:
-      model_init_state = model.zero_state(batch_size * num_samples, tf.float32)
-      return (tf.zeros([num_samples*batch_size], dtype=tf.float32),
-              (tf.zeros([num_samples*batch_size, model.latent_size], dtype=tf.float32),
-               tf.zeros([num_samples*batch_size, model.latent_size], dtype=tf.float32)),
-              model_init_state)
-
-    prev_log_r, prev_log_r_tilde, prev_model_state = prev_state
-    (new_model_state, zt, log_q_zt, log_p_zt,
-     log_p_x_given_z, log_r_tilde, p_ztplus1) = model(prev_model_state, t)
-    r_tilde_mu, r_tilde_sigma_sq = log_r_tilde
-    # Compute the weight without r.
-    log_weight = log_p_zt + log_p_x_given_z - log_q_zt
-    # Compute log_r and log_r_tilde.
-    p_mu = tf.stop_gradient(p_ztplus1.mean())
-    p_sigma_sq = tf.stop_gradient(p_ztplus1.variance())
-    log_r = (tf.log(r_tilde_sigma_sq) -
-             tf.log(r_tilde_sigma_sq + p_sigma_sq) -
-             tf.square(r_tilde_mu - p_mu)/(r_tilde_sigma_sq + p_sigma_sq))
-    # log_r is [num_samples*batch_size, latent_size]. We sum it along the last
-    # dimension to compute log r.
-    log_r = 0.5*tf.reduce_sum(log_r, axis=-1)
-    # Compute prev log r tilde
-    prev_r_tilde_mu, prev_r_tilde_sigma_sq = prev_log_r_tilde
-    prev_log_r_tilde = -0.5*tf.reduce_sum(
-        tf.square(tf.stop_gradient(zt) - prev_r_tilde_mu)/prev_r_tilde_sigma_sq, axis=-1)
-    # If the sequence is on the last timestep, log_r and log_r_tilde are just zeros.
-    last_timestep = t >= (tiled_seq_lengths - 1)
-    log_r = tf.where(last_timestep,
-                     tf.zeros_like(log_r),
-                     log_r)
-    prev_log_r_tilde = tf.where(last_timestep,
-                                tf.zeros_like(prev_log_r_tilde),
-                                prev_log_r_tilde)
-    log_weight += tf.stop_gradient(log_r - prev_log_r)
-    new_state = (log_r, log_r_tilde, new_model_state)
-    loop_fn_args = (log_r, prev_log_r_tilde, log_p_x_given_z, log_r - prev_log_r)
-    return log_weight, new_state, loop_fn_args
-
-  def loop_fn(loop_state, loop_args, unused_model_state, log_weights, resampled, mask, t):
-    if loop_state is None:
-      return (tf.zeros([batch_size], dtype=tf.float32),
-              tf.zeros([batch_size], dtype=tf.float32),
-              tf.zeros([num_samples, batch_size], dtype=tf.float32))
-    log_p_hat_acc, bellman_loss_acc, log_r_diff_acc = loop_state
-    log_r, prev_log_r_tilde, log_p_x_given_z, log_r_diff = loop_args
-    # Compute the log_p_hat update
-    log_p_hat_update = tf.reduce_logsumexp(
-        log_weights, axis=0) - tf.log(tf.to_float(num_samples))
-    # If it is the last timestep, we always add the update.
-    log_p_hat_acc += tf.cond(t >= max_seq_len-1,
-                             lambda: log_p_hat_update,
-                             lambda: log_p_hat_update * resampled)
-    # Compute the Bellman update.
-    log_r = tf.reshape(log_r, [num_samples, batch_size])
-    prev_log_r_tilde = tf.reshape(prev_log_r_tilde, [num_samples, batch_size])
-    log_p_x_given_z = tf.reshape(log_p_x_given_z, [num_samples, batch_size])
-    mask = tf.reshape(mask, [num_samples, batch_size])
-    # On the first timestep there is no bellman error because there is no
-    # prev_log_r_tilde.
-    mask = tf.cond(tf.equal(t, 0),
-                   lambda: tf.zeros_like(mask),
-                   lambda: mask)
-    # On the first timestep also fix up prev_log_r_tilde, which will be -inf.
-    prev_log_r_tilde = tf.where(
-        tf.is_inf(prev_log_r_tilde),
-        tf.zeros_like(prev_log_r_tilde),
-        prev_log_r_tilde)
-    # log_lambda is [num_samples, batch_size]
-    log_lambda = tf.reduce_mean(prev_log_r_tilde - log_p_x_given_z - log_r,
-                                axis=0, keepdims=True)
-    bellman_error = mask * tf.square(
-        prev_log_r_tilde -
-        tf.stop_gradient(log_lambda + log_p_x_given_z + log_r)
-    )
-    bellman_loss_acc += tf.reduce_mean(bellman_error, axis=0)
-    # Compute the log_r_diff update
-    log_r_diff_acc += mask * tf.reshape(log_r_diff, [num_samples, batch_size])
-    return (log_p_hat_acc, bellman_loss_acc, log_r_diff_acc)
-
-  log_weights, resampled, accs = smc.smc(
-      transition_fn,
-      seq_lengths,
-      num_particles=num_samples,
-      resampling_criterion=resampling_criterion,
-      resampling_fn=resampling_fn,
-      loop_fn=loop_fn,
-      parallel_iterations=parallel_iterations,
-      swap_memory=swap_memory)
-
-  log_p_hat, bellman_loss, log_r_diff = accs
-  loss_per_seq = [- log_p_hat, bellman_loss]
-  tf.summary.scalar("bellman_loss",
-                    tf.reduce_mean(bellman_loss / tf.to_float(seq_lengths)))
-  tf.summary.scalar("log_r_diff",
-                    tf.reduce_mean(tf.reduce_mean(log_r_diff, axis=0) / tf.to_float(seq_lengths)))
-  return loss_per_seq, log_p_hat, log_weights, resampled
diff --git a/research/fivo/fivo/bounds_test.py b/research/fivo/fivo/bounds_test.py
deleted file mode 100644
index c970f74f4cec36a855c54bbe6cdf8d76c3f86599..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/bounds_test.py
+++ /dev/null
@@ -1,183 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for fivo.bounds"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-from fivo.test_utils import create_vrnn
-from fivo import bounds
-
-
-class BoundsTest(tf.test.TestCase):
-
-  def test_elbo(self):
-    """A golden-value test for the ELBO (the IWAE bound with num_samples=1)."""
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      model, inputs, targets, lengths = create_vrnn(random_seed=1234)
-      outs = bounds.iwae(model, (inputs, targets), lengths, num_samples=1,
-                         parallel_iterations=1)
-      sess.run(tf.global_variables_initializer())
-      log_p_hat, _, _ = sess.run(outs)
-      self.assertAllClose([-21.615765, -13.614225], log_p_hat)
-
-  def test_iwae(self):
-    """A golden-value test for the IWAE bound."""
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      model, inputs, targets, lengths = create_vrnn(random_seed=1234)
-      outs = bounds.iwae(model, (inputs, targets), lengths, num_samples=4,
-                         parallel_iterations=1)
-      sess.run(tf.global_variables_initializer())
-      log_p_hat, weights, _ = sess.run(outs)
-      self.assertAllClose([-23.301426, -13.64028], log_p_hat)
-      weights_gt = np.array(
-          [[[-3.66708851, -2.07074022, -4.91751671, -5.03293562],
-            [-2.99690723, -3.17782736, -4.50084877, -3.48536515]],
-           [[-6.2539978, -4.37615728, -7.43738699, -7.85044909],
-            [-8.27518654, -6.71545124, -8.96198845, -7.05567837]],
-           [[-9.19093227, -8.01637268, -11.64603615, -10.51128292],
-            [-12.34527206, -11.54284477, -11.8667469, -9.69417381]],
-           [[-12.20609856, -10.47217369, -13.66270638, -13.46115875],
-            [-17.17656708, -16.25190353, -15.28658581, -12.33067703]],
-           [[-16.14766312, -15.57472229, -17.47755432, -17.98189926],
-            [-17.17656708, -16.25190353, -15.28658581, -12.33067703]],
-           [[-20.07182884, -18.43191147, -20.1606636, -21.45263863],
-            [-17.17656708, -16.25190353, -15.28658581, -12.33067703]],
-           [[-24.10270691, -22.20865822, -24.14675522, -25.27248383],
-            [-17.17656708, -16.25190353, -15.28658581, -12.33067703]]])
-      self.assertAllClose(weights_gt, weights)
-
-  def test_fivo(self):
-    """A golden-value test for the FIVO bound."""
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      model, inputs, targets, lengths = create_vrnn(random_seed=1234)
-      outs = bounds.fivo(model, (inputs, targets), lengths, num_samples=4,
-                         random_seed=1234, parallel_iterations=1)
-      sess.run(tf.global_variables_initializer())
-      log_p_hat, weights, resampled, _ = sess.run(outs)
-      self.assertAllClose([-22.98902512, -14.21689224], log_p_hat)
-      weights_gt = np.array(
-          [[[-3.66708851, -2.07074022, -4.91751671, -5.03293562],
-            [-2.99690723, -3.17782736, -4.50084877, -3.48536515]],
-           [[-2.67100811, -2.30541706, -2.34178066, -2.81751347],
-            [-8.27518654, -6.71545124, -8.96198845, -7.05567837]],
-           [[-5.65190411, -5.94563246, -6.55041981, -5.4783473],
-            [-12.34527206, -11.54284477, -11.8667469, -9.69417381]],
-           [[-8.71947861, -8.40143299, -8.54593086, -8.42822266],
-            [-4.28782988, -4.50591278, -3.40847206, -2.63650274]],
-           [[-12.7003831, -13.5039815, -12.3569726, -12.9489622],
-            [-4.28782988, -4.50591278, -3.40847206, -2.63650274]],
-           [[-16.4520301, -16.3611698, -15.0314846, -16.4197006],
-            [-4.28782988, -4.50591278, -3.40847206, -2.63650274]],
-           [[-20.7010765, -20.1379165, -19.0020351, -20.2395458],
-            [-4.28782988, -4.50591278, -3.40847206, -2.63650274]]])
-      self.assertAllClose(weights_gt, weights)
-      resampled_gt = np.array(
-          [[1., 0.],
-           [0., 0.],
-           [0., 1.],
-           [0., 0.],
-           [0., 0.],
-           [0., 0.],
-           [0., 0.]])
-      self.assertAllClose(resampled_gt, resampled)
-
-  def test_fivo_relaxed(self):
-    """A golden-value test for the FIVO bound with relaxed sampling."""
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      model, inputs, targets, lengths = create_vrnn(random_seed=1234)
-      outs = bounds.fivo(model, (inputs, targets), lengths, num_samples=4,
-                         random_seed=1234, parallel_iterations=1,
-                         resampling_type="relaxed")
-      sess.run(tf.global_variables_initializer())
-      log_p_hat, weights, resampled, _ = sess.run(outs)
-      self.assertAllClose([-22.942394, -14.273882], log_p_hat)
-      weights_gt = np.array(
-          [[[-3.66708851, -2.07074118, -4.91751575, -5.03293514],
-            [-2.99690628, -3.17782831, -4.50084877, -3.48536515]],
-           [[-2.84939098, -2.30087185, -2.35649204, -2.48417377],
-            [-8.27518654, -6.71545172, -8.96199131, -7.05567837]],
-           [[-5.92327023, -5.9433074, -6.5826683, -5.04259014],
-            [-12.34527206, -11.54284668, -11.86675072, -9.69417477]],
-           [[-8.95323944, -8.40061855, -8.52760506, -7.99130583],
-            [-4.58102798, -4.56017351, -3.46283388, -2.65550804]],
-           [[-12.87836456, -13.49628639, -12.31680107, -12.74228859],
-            [-4.58102798, -4.56017351, -3.46283388, -2.65550804]],
-           [[-16.78347397, -16.35150909, -14.98797417, -16.35162735],
-            [-4.58102798, -4.56017351, -3.46283388, -2.65550804]],
-           [[-20.81165886, -20.1307621, -18.92229652, -20.17458153],
-            [-4.58102798, -4.56017351, -3.46283388, -2.65550804]]])
-      self.assertAllClose(weights_gt, weights)
-      resampled_gt = np.array(
-          [[1., 0.],
-           [0., 0.],
-           [0., 1.],
-           [0., 0.],
-           [0., 0.],
-           [0., 0.],
-           [0., 0.]])
-      self.assertAllClose(resampled_gt, resampled)
-
-  def test_fivo_aux_relaxed(self):
-    """A golden-value test for the FIVO-AUX bound with relaxed sampling."""
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      model, inputs, targets, lengths = create_vrnn(random_seed=1234,
-                                                    use_tilt=True)
-      outs = bounds.fivo(model, (inputs, targets), lengths, num_samples=4,
-                         random_seed=1234, parallel_iterations=1,
-                         resampling_type="relaxed")
-      sess.run(tf.global_variables_initializer())
-      log_p_hat, weights, resampled, _ = sess.run(outs)
-      self.assertAllClose([-23.1395, -14.271059], log_p_hat)
-      weights_gt = np.array(
-          [[[-5.19826221, -3.55476403, -5.98663855, -6.08058834],
-            [-6.31685925, -5.70243931, -7.07638931, -6.18138981]],
-           [[-3.97986865, -3.58831525, -3.85753584, -3.5010016],
-            [-11.38203049, -8.66213989, -11.23646641, -10.02024746]],
-           [[-6.62269831, -6.36680222, -6.78096485, -5.80072498],
-            [-3.55419445, -8.11326408, -3.48766923, -3.08593249]],
-           [[-10.56472301, -10.16084099, -9.96741676, -8.5270071],
-            [-6.04880285, -7.80853653, -4.72652149, -3.49711013]],
-           [[-13.36585426, -16.08720398, -13.33416367, -13.1017189],
-            [-0., -0., -0., -0.]],
-           [[-17.54233551, -17.35167503, -16.79163361, -16.51471138],
-            [0., -0., -0., -0.]],
-           [[-19.74024963, -18.69452858, -17.76246452, -18.76182365],
-            [0., -0., -0., -0.]]])
-      self.assertAllClose(weights_gt, weights)
-      resampled_gt = np.array([[1., 0.],
-                               [0., 1.],
-                               [0., 0.],
-                               [0., 1.],
-                               [0., 0.],
-                               [0., 0.],
-                               [0., 0.]])
-      self.assertAllClose(resampled_gt, resampled)
-
-
-if __name__ == "__main__":
-  np.set_printoptions(threshold=np.nan)  # Used to easily see the gold values.
-  # Use print(repr(numpy_array)) to print the values.
-  tf.test.main()
diff --git a/research/fivo/fivo/data/__init__.py b/research/fivo/fivo/data/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/fivo/fivo/data/calculate_pianoroll_mean.py b/research/fivo/fivo/data/calculate_pianoroll_mean.py
deleted file mode 100644
index 93f712bd328f61a83faffc55ad2cf6ca33b47fb7..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/data/calculate_pianoroll_mean.py
+++ /dev/null
@@ -1,65 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Script to calculate the mean of a pianoroll dataset.
-
-Given a pianoroll pickle file, this script loads the dataset and
-calculates the mean of the training set. Then it updates the pickle file
-so that the key "train_mean" points to the mean vector.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import pickle
-import numpy as np
-
-import tensorflow as tf
-
-
-from datasets import sparse_pianoroll_to_dense
-
-tf.app.flags.DEFINE_string('in_file', None,
-                           'Filename of the pickled pianoroll dataset to load.')
-tf.app.flags.DEFINE_string('out_file', None,
-                           'Name of the output pickle file. Defaults to in_file, '
-                           'updating the input pickle file.')
-tf.app.flags.mark_flag_as_required('in_file')
-
-FLAGS = tf.app.flags.FLAGS
-
-MIN_NOTE = 21
-MAX_NOTE = 108
-NUM_NOTES = MAX_NOTE - MIN_NOTE + 1
-
-
-def main(unused_argv):
-  if FLAGS.out_file is None:
-    FLAGS.out_file = FLAGS.in_file
-  with tf.gfile.Open(FLAGS.in_file, 'r') as f:
-    pianorolls = pickle.load(f)
-  dense_pianorolls = [sparse_pianoroll_to_dense(p, MIN_NOTE, NUM_NOTES)[0]
-                      for p in pianorolls['train']]
-  # Concatenate all elements along the time axis.
-  concatenated = np.concatenate(dense_pianorolls, axis=0)
-  mean = np.mean(concatenated, axis=0)
-  pianorolls['train_mean'] = mean
-  # Write out the whole pickle file, including the train mean.
-  pickle.dump(pianorolls, open(FLAGS.out_file, 'wb'))
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/fivo/fivo/data/create_timit_dataset.py b/research/fivo/fivo/data/create_timit_dataset.py
deleted file mode 100644
index ea1cd3b10cb0812c2d6aad51491924ecfe8eec37..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/data/create_timit_dataset.py
+++ /dev/null
@@ -1,180 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Preprocesses TIMIT from raw wavfiles to create a set of TFRecords.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import glob
-import os
-import random
-import re
-
-import numpy as np
-import tensorflow as tf
-
-tf.app.flags.DEFINE_string("raw_timit_dir", None,
-                          "Directory containing TIMIT files.")
-tf.app.flags.DEFINE_string("out_dir", None,
-                          "Output directory for TFRecord files.")
-tf.app.flags.DEFINE_float("valid_frac", 0.05,
-                          "Fraction of train set to use as valid set. "
-                          "Must be between 0.0 and 1.0.")
-
-tf.app.flags.mark_flag_as_required("raw_timit_dir")
-tf.app.flags.mark_flag_as_required("out_dir")
-
-FLAGS = tf.app.flags.FLAGS
-
-NUM_TRAIN_FILES = 4620
-NUM_TEST_FILES = 1680
-SAMPLES_PER_TIMESTEP = 200
-
-# Regexes for reading SPHERE header files.
-SAMPLE_COUNT_REGEX = re.compile(r"sample_count -i (\d+)")
-SAMPLE_MIN_REGEX = re.compile(r"sample_min -i (-?\d+)")
-SAMPLE_MAX_REGEX = re.compile(r"sample_max -i (-?\d+)")
-
-
-def get_filenames(split):
-  """Get all wav filenames from the TIMIT archive."""
-  path = os.path.join(FLAGS.raw_timit_dir, "TIMIT", split, "*", "*", "*.WAV")
-  # Sort the output by name so the order is deterministic.
-  files = sorted(glob.glob(path))
-  return files
-
-
-def load_timit_wav(filename):
-  """Loads a TIMIT wavfile into a numpy array.
-
-  TIMIT wavfiles include a SPHERE header, detailed in the TIMIT docs. The first
-  line is the header type and the second is the length of the header in bytes.
-  After the header, the remaining bytes are actual WAV data.
-
-  The header includes information about the WAV data such as the number of
-  samples and minimum and maximum amplitude. This function asserts that the
-  loaded wav data matches the header.
-
-  Args:
-    filename: The name of the TIMIT wavfile to load.
-  Returns:
-    wav: A numpy array containing the loaded wav data.
-  """
-  wav_file = open(filename, "rb")
-  header_type = wav_file.readline()
-  header_length_str = wav_file.readline()
-  # The header length includes the length of the first two lines.
-  header_remaining_bytes = (int(header_length_str) - len(header_type) -
-                            len(header_length_str))
-  header = wav_file.read(header_remaining_bytes)
-  # Read the relevant header fields.
-  sample_count = int(SAMPLE_COUNT_REGEX.search(header).group(1))
-  sample_min = int(SAMPLE_MIN_REGEX.search(header).group(1))
-  sample_max = int(SAMPLE_MAX_REGEX.search(header).group(1))
-  wav = np.fromstring(wav_file.read(), dtype="int16").astype("float32")
-  # Check that the loaded data conforms to the header description.
-  assert len(wav) == sample_count
-  assert wav.min() == sample_min
-  assert wav.max() == sample_max
-  return wav
-
-
-def preprocess(wavs, block_size, mean, std):
-  """Normalize the wav data and reshape it into chunks."""
-  processed_wavs = []
-  for wav in wavs:
-    wav = (wav - mean) / std
-    wav_length = wav.shape[0]
-    if wav_length % block_size != 0:
-      pad_width = block_size - (wav_length % block_size)
-      wav = np.pad(wav, (0, pad_width), "constant")
-    assert wav.shape[0] % block_size == 0
-    wav = wav.reshape((-1, block_size))
-    processed_wavs.append(wav)
-  return processed_wavs
-
-
-def create_tfrecord_from_wavs(wavs, output_file):
-  """Writes processed wav files to disk as sharded TFRecord files."""
-  with tf.python_io.TFRecordWriter(output_file) as builder:
-    for wav in wavs:
-      builder.write(wav.astype(np.float32).tobytes())
-
-
-def main(unused_argv):
-  train_filenames = get_filenames("TRAIN")
-  test_filenames = get_filenames("TEST")
-
-  num_train_files = len(train_filenames)
-  num_test_files = len(test_filenames)
-  num_valid_files = int(num_train_files * FLAGS.valid_frac)
-  num_train_files -= num_valid_files
-
-  print("%d train / %d valid / %d test" % (
-      num_train_files, num_valid_files, num_test_files))
-
-  random.seed(1234)
-  random.shuffle(train_filenames)
-
-  valid_filenames = train_filenames[:num_valid_files]
-  train_filenames = train_filenames[num_valid_files:]
-
-  # Make sure there is no overlap in the train, test, and valid sets.
-  train_s = set(train_filenames)
-  test_s = set(test_filenames)
-  valid_s = set(valid_filenames)
-  # Disable explicit length testing to make the assertions more readable.
-  # pylint: disable=g-explicit-length-test
-  assert len(train_s & test_s) == 0
-  assert len(train_s & valid_s) == 0
-  assert len(valid_s & test_s) == 0
-  # pylint: enable=g-explicit-length-test
-
-  train_wavs = [load_timit_wav(f) for f in train_filenames]
-  valid_wavs = [load_timit_wav(f) for f in valid_filenames]
-  test_wavs = [load_timit_wav(f) for f in test_filenames]
-  assert len(train_wavs) + len(valid_wavs) == NUM_TRAIN_FILES
-  assert len(test_wavs) == NUM_TEST_FILES
-
-  # Calculate the mean and standard deviation of the train set.
-  train_stacked = np.hstack(train_wavs)
-  train_mean = np.mean(train_stacked)
-  train_std = np.std(train_stacked)
-  print("train mean: %f  train std: %f" % (train_mean, train_std))
-
-  # Process all data, normalizing with the train set statistics.
-  processed_train_wavs = preprocess(train_wavs, SAMPLES_PER_TIMESTEP,
-                                    train_mean, train_std)
-  processed_valid_wavs = preprocess(valid_wavs, SAMPLES_PER_TIMESTEP,
-                                    train_mean, train_std)
-  processed_test_wavs = preprocess(test_wavs, SAMPLES_PER_TIMESTEP, train_mean,
-                                   train_std)
-
-  # Write the datasets to disk.
-  create_tfrecord_from_wavs(
-      processed_train_wavs,
-      os.path.join(FLAGS.out_dir, "train"))
-  create_tfrecord_from_wavs(
-      processed_valid_wavs,
-      os.path.join(FLAGS.out_dir, "valid"))
-  create_tfrecord_from_wavs(
-      processed_test_wavs,
-      os.path.join(FLAGS.out_dir, "test"))
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/fivo/fivo/data/datasets.py b/research/fivo/fivo/data/datasets.py
deleted file mode 100644
index 6d5324623250e31d65b23c97e7e684de59da1ba6..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/data/datasets.py
+++ /dev/null
@@ -1,453 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Code for creating sequence datasets.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import pickle
-
-import numpy as np
-from scipy.sparse import coo_matrix
-import tensorflow as tf
-
-# The default number of threads used to process data in parallel.
-DEFAULT_PARALLELISM = 12
-
-
-def sparse_pianoroll_to_dense(pianoroll, min_note, num_notes):
-  """Converts a sparse pianoroll to a dense numpy array.
-
-  Given a sparse pianoroll, converts it to a dense numpy array of shape
-  [num_timesteps, num_notes] where entry i,j is 1.0 if note j is active on
-  timestep i and 0.0 otherwise.
-
-  Args:
-    pianoroll: A sparse pianoroll object, a list of tuples where the i'th tuple
-      contains the indices of the notes active at timestep i.
-    min_note: The minimum note in the pianoroll, subtracted from all notes so
-      that the minimum note becomes 0.
-    num_notes: The number of possible different note indices, determines the
-      second dimension of the resulting dense array.
-  Returns:
-    dense_pianoroll: A [num_timesteps, num_notes] numpy array of floats.
-    num_timesteps: A python int, the number of timesteps in the pianoroll.
-  """
-  num_timesteps = len(pianoroll)
-  inds = []
-  for time, chord in enumerate(pianoroll):
-    # Re-index the notes to start from min_note.
-    inds.extend((time, note-min_note) for note in chord)
-  shape = [num_timesteps, num_notes]
-  values = [1.] * len(inds)
-  sparse_pianoroll = coo_matrix(
-      (values, ([x[0] for x in inds], [x[1] for x in inds])),
-      shape=shape)
-  return sparse_pianoroll.toarray(), num_timesteps
-
-
-def create_pianoroll_dataset(path,
-                             split,
-                             batch_size,
-                             num_parallel_calls=DEFAULT_PARALLELISM,
-                             shuffle=False,
-                             repeat=False,
-                             min_note=21,
-                             max_note=108):
-  """Creates a pianoroll dataset.
-
-  Args:
-    path: The path of a pickle file containing the dataset to load.
-    split: The split to use, can be train, test, or valid.
-    batch_size: The batch size. If repeat is False then it is not guaranteed
-      that the true batch size will match for all batches since batch_size
-      may not necessarily evenly divide the number of elements.
-    num_parallel_calls: The number of threads to use for parallel processing of
-      the data.
-    shuffle: If true, shuffles the order of the dataset.
-    repeat: If true, repeats the dataset endlessly.
-    min_note: The minimum note number of the dataset. For all pianoroll datasets
-      the minimum note is number 21, and changing this affects the dimension of
-      the data. This is useful mostly for testing.
-    max_note: The maximum note number of the dataset. For all pianoroll datasets
-      the maximum note is number 108, and changing this affects the dimension of
-      the data. This is useful mostly for testing.
-  Returns:
-    inputs: A batch of input sequences represented as a dense Tensor of shape
-      [time, batch_size, data_dimension]. The sequences in inputs are the
-      sequences in targets shifted one timestep into the future, padded with
-      zeros. This tensor is mean-centered, with the mean taken from the pickle
-      file key 'train_mean'.
-    targets: A batch of target sequences represented as a dense Tensor of
-      shape [time, batch_size, data_dimension].
-    lens: An int Tensor of shape [batch_size] representing the lengths of each
-      sequence in the batch.
-    mean: A float Tensor of shape [data_dimension] containing the mean loaded
-      from the pickle file.
-  """
-  # Load the data from disk.
-  num_notes = max_note - min_note + 1
-  with tf.gfile.Open(path, "r") as f:
-    raw_data = pickle.load(f)
-  pianorolls = raw_data[split]
-  mean = raw_data["train_mean"]
-  num_examples = len(pianorolls)
-
-  def pianoroll_generator():
-    for sparse_pianoroll in pianorolls:
-      yield sparse_pianoroll_to_dense(sparse_pianoroll, min_note, num_notes)
-
-  dataset = tf.data.Dataset.from_generator(
-      pianoroll_generator,
-      output_types=(tf.float64, tf.int64),
-      output_shapes=([None, num_notes], []))
-
-  if repeat: dataset = dataset.repeat()
-  if shuffle: dataset = dataset.shuffle(num_examples)
-
-  # Batch sequences togther, padding them to a common length in time.
-  dataset = dataset.padded_batch(batch_size,
-                                 padded_shapes=([None, num_notes], []))
-
-  def process_pianoroll_batch(data, lengths):
-    """Create mean-centered and time-major next-step prediction Tensors."""
-    data = tf.to_float(tf.transpose(data, perm=[1, 0, 2]))
-    lengths = tf.to_int32(lengths)
-    targets = data
-    # Mean center the inputs.
-    inputs = data - tf.constant(mean, dtype=tf.float32,
-                                shape=[1, 1, mean.shape[0]])
-    # Shift the inputs one step forward in time. Also remove the last timestep
-    # so that targets and inputs are the same length.
-    inputs = tf.pad(inputs, [[1, 0], [0, 0], [0, 0]], mode="CONSTANT")[:-1]
-    # Mask out unused timesteps.
-    inputs *= tf.expand_dims(tf.transpose(
-        tf.sequence_mask(lengths, dtype=inputs.dtype)), 2)
-    return inputs, targets, lengths
-
-  dataset = dataset.map(process_pianoroll_batch,
-                        num_parallel_calls=num_parallel_calls)
-  dataset = dataset.prefetch(num_examples)
-
-  itr = dataset.make_one_shot_iterator()
-  inputs, targets, lengths = itr.get_next()
-  return inputs, targets, lengths, tf.constant(mean, dtype=tf.float32)
-
-
-def create_human_pose_dataset(
-    path,
-    split,
-    batch_size,
-    num_parallel_calls=DEFAULT_PARALLELISM,
-    shuffle=False,
-    repeat=False,):
-  """Creates a human pose dataset.
-
-  Args:
-    path: The path of a pickle file containing the dataset to load.
-    split: The split to use, can be train, test, or valid.
-    batch_size: The batch size. If repeat is False then it is not guaranteed
-      that the true batch size will match for all batches since batch_size
-      may not necessarily evenly divide the number of elements.
-    num_parallel_calls: The number of threads to use for parallel processing of
-      the data.
-    shuffle: If true, shuffles the order of the dataset.
-    repeat: If true, repeats the dataset endlessly.
-  Returns:
-    inputs: A batch of input sequences represented as a dense Tensor of shape
-      [time, batch_size, data_dimension]. The sequences in inputs are the
-      sequences in targets shifted one timestep into the future, padded with
-      zeros. This tensor is mean-centered, with the mean taken from the pickle
-      file key 'train_mean'.
-    targets: A batch of target sequences represented as a dense Tensor of
-      shape [time, batch_size, data_dimension].
-    lens: An int Tensor of shape [batch_size] representing the lengths of each
-      sequence in the batch.
-    mean: A float Tensor of shape [data_dimension] containing the mean loaded
-    from the pickle file.
-  """
-  # Load the data from disk.
-  with tf.gfile.Open(path, "r") as f:
-    raw_data = pickle.load(f)
-
-  mean = raw_data["train_mean"]
-  pose_sequences = raw_data[split]
-  num_examples = len(pose_sequences)
-  num_features = pose_sequences[0].shape[1]
-
-  def pose_generator():
-    """A generator that yields pose data sequences."""
-    # Each timestep has 32 x values followed by 32 y values so is 64
-    # dimensional.
-    for pose_sequence in pose_sequences:
-      yield pose_sequence, pose_sequence.shape[0]
-
-  dataset = tf.data.Dataset.from_generator(
-      pose_generator,
-      output_types=(tf.float64, tf.int64),
-      output_shapes=([None, num_features], []))
-
-  if repeat:
-    dataset = dataset.repeat()
-  if shuffle:
-    dataset = dataset.shuffle(num_examples)
-
-  # Batch sequences togther, padding them to a common length in time.
-  dataset = dataset.padded_batch(
-      batch_size, padded_shapes=([None, num_features], []))
-
-  # Post-process each batch, ensuring that it is mean-centered and time-major.
-  def process_pose_data(data, lengths):
-    """Creates Tensors for next step prediction and mean-centers the input."""
-    data = tf.to_float(tf.transpose(data, perm=[1, 0, 2]))
-    lengths = tf.to_int32(lengths)
-    targets = data
-    # Mean center the inputs.
-    inputs = data - tf.constant(
-        mean, dtype=tf.float32, shape=[1, 1, mean.shape[0]])
-    # Shift the inputs one step forward in time. Also remove the last timestep
-    # so that targets and inputs are the same length.
-    inputs = tf.pad(inputs, [[1, 0], [0, 0], [0, 0]], mode="CONSTANT")[:-1]
-    # Mask out unused timesteps.
-    inputs *= tf.expand_dims(
-        tf.transpose(tf.sequence_mask(lengths, dtype=inputs.dtype)), 2)
-    return inputs, targets, lengths
-
-  dataset = dataset.map(
-      process_pose_data,
-      num_parallel_calls=num_parallel_calls)
-  dataset = dataset.prefetch(num_examples)
-
-  itr = dataset.make_one_shot_iterator()
-  inputs, targets, lengths = itr.get_next()
-  return inputs, targets, lengths, tf.constant(mean, dtype=tf.float32)
-
-
-def create_speech_dataset(path,
-                          batch_size,
-                          samples_per_timestep=200,
-                          num_parallel_calls=DEFAULT_PARALLELISM,
-                          prefetch_buffer_size=2048,
-                          shuffle=False,
-                          repeat=False):
-  """Creates a speech dataset.
-
-  Args:
-    path: The path of a possibly sharded TFRecord file containing the data.
-    batch_size: The batch size. If repeat is False then it is not guaranteed
-      that the true batch size will match for all batches since batch_size
-      may not necessarily evenly divide the number of elements.
-    samples_per_timestep: The number of audio samples per timestep. Used to
-      reshape the data into sequences of shape [time, samples_per_timestep].
-      Should not change except for testing -- in all speech datasets 200 is the
-      number of samples per timestep.
-    num_parallel_calls: The number of threads to use for parallel processing of
-      the data.
-    prefetch_buffer_size: The size of the prefetch queues to use after reading
-      and processing the raw data.
-    shuffle: If true, shuffles the order of the dataset.
-    repeat: If true, repeats the dataset endlessly.
-  Returns:
-    inputs: A batch of input sequences represented as a dense Tensor of shape
-      [time, batch_size, samples_per_timestep]. The sequences in inputs are the
-      sequences in targets shifted one timestep into the future, padded with
-      zeros.
-    targets: A batch of target sequences represented as a dense Tensor of
-      shape [time, batch_size, samples_per_timestep].
-    lens: An int Tensor of shape [batch_size] representing the lengths of each
-      sequence in the batch.
-  """
-  filenames = [path]
-
-  def read_speech_example(value):
-    """Parses a single tf.Example from the TFRecord file."""
-    decoded = tf.decode_raw(value, out_type=tf.float32)
-    example = tf.reshape(decoded, [-1, samples_per_timestep])
-    length = tf.shape(example)[0]
-    return example, length
-
-  # Create the dataset from the TFRecord files
-  dataset = tf.data.TFRecordDataset(filenames).map(
-      read_speech_example, num_parallel_calls=num_parallel_calls)
-  dataset = dataset.prefetch(prefetch_buffer_size)
-
-  if repeat: dataset = dataset.repeat()
-  if shuffle: dataset = dataset.shuffle(prefetch_buffer_size)
-
-  dataset = dataset.padded_batch(
-      batch_size, padded_shapes=([None, samples_per_timestep], []))
-
-  def process_speech_batch(data, lengths):
-    """Creates Tensors for next step prediction."""
-    data = tf.transpose(data, perm=[1, 0, 2])
-    lengths = tf.to_int32(lengths)
-    targets = data
-    # Shift the inputs one step forward in time. Also remove the last timestep
-    # so that targets and inputs are the same length.
-    inputs = tf.pad(data, [[1, 0], [0, 0], [0, 0]], mode="CONSTANT")[:-1]
-    # Mask out unused timesteps.
-    inputs *= tf.expand_dims(
-        tf.transpose(tf.sequence_mask(lengths, dtype=inputs.dtype)), 2)
-    return inputs, targets, lengths
-
-  dataset = dataset.map(process_speech_batch,
-                        num_parallel_calls=num_parallel_calls)
-  dataset = dataset.prefetch(prefetch_buffer_size)
-
-  itr = dataset.make_one_shot_iterator()
-  inputs, targets, lengths = itr.get_next()
-  return inputs, targets, lengths
-
-
-SQUARED_OBSERVATION = "squared"
-ABS_OBSERVATION = "abs"
-STANDARD_OBSERVATION = "standard"
-OBSERVATION_TYPES = [SQUARED_OBSERVATION, ABS_OBSERVATION, STANDARD_OBSERVATION]
-
-ROUND_TRANSITION = "round"
-STANDARD_TRANSITION = "standard"
-TRANSITION_TYPES = [ROUND_TRANSITION, STANDARD_TRANSITION]
-
-
-def create_chain_graph_dataset(
-    batch_size,
-    num_timesteps,
-    steps_per_observation=None,
-    state_size=1,
-    transition_variance=1.,
-    observation_variance=1.,
-    transition_type=STANDARD_TRANSITION,
-    observation_type=STANDARD_OBSERVATION,
-    fixed_observation=None,
-    prefetch_buffer_size=2048,
-    dtype="float32"):
-  """Creates a toy chain graph dataset.
-
-  Creates a dataset where the data are sampled from a diffusion process. The
-  'latent' states of the process are sampled as a chain of Normals:
-
-  z0 ~ N(0, transition_variance)
-  z1 ~ N(transition_fn(z0), transition_variance)
-  ...
-
-  where transition_fn could be round z0 or pass it through unchanged.
-
-  The observations are produced every steps_per_observation timesteps as a
-  function of the latent zs. For example if steps_per_observation is 3 then the
-  first observation will be produced as a function of z3:
-
-  x1 ~ N(observation_fn(z3), observation_variance)
-
-  where observation_fn could square z3, take the absolute value, or pass
-  it through unchanged.
-
-  Only the observations are returned.
-
-  Args:
-    batch_size: The batch size. The number of trajectories to run in parallel.
-    num_timesteps: The length of the chain of latent states (i.e. the
-      number of z's excluding z0.
-    steps_per_observation: The number of latent states between each observation,
-      must evenly divide num_timesteps.
-    state_size: The size of the latent state and observation, must be a
-      python int.
-    transition_variance: The variance of the transition density.
-    observation_variance: The variance of the observation density.
-    transition_type: Must be one of "round" or "standard". "round" means that
-      the transition density is centered at the rounded previous latent state.
-      "standard" centers the transition density at the previous latent state,
-      unchanged.
-    observation_type: Must be one of "squared", "abs" or "standard". "squared"
-      centers the observation density at the squared latent state. "abs"
-      centers the observaiton density at the absolute value of the current
-      latent state. "standard" centers the observation density at the current
-      latent state.
-    fixed_observation: If not None, fixes all observations to be a constant.
-      Must be a scalar.
-    prefetch_buffer_size: The size of the prefetch queues to use after reading
-      and processing the raw data.
-    dtype: A string convertible to a tensorflow datatype. The datatype used
-      to represent the states and observations.
-  Returns:
-    observations: A batch of observations represented as a dense Tensor of
-      shape [num_observations, batch_size, state_size]. num_observations is
-      num_timesteps/steps_per_observation.
-    lens: An int Tensor of shape [batch_size] representing the lengths of each
-      sequence in the batch. Will contain num_observations as each entry.
-  Raises:
-    ValueError: Raised if steps_per_observation does not evenly divide
-      num_timesteps.
-  """
-  if steps_per_observation is None:
-    steps_per_observation = num_timesteps
-  if num_timesteps % steps_per_observation != 0:
-    raise ValueError("steps_per_observation must evenly divide num_timesteps.")
-  num_observations = int(num_timesteps / steps_per_observation)
-  def data_generator():
-    """An infinite generator of latents and observations from the model."""
-    transition_std = np.sqrt(transition_variance)
-    observation_std = np.sqrt(observation_variance)
-    while True:
-      states = []
-      observations = []
-      # Sample z0 ~ Normal(0, sqrt(variance)).
-      states.append(
-          np.random.normal(size=[state_size],
-                           scale=observation_std).astype(dtype))
-      # Start the range at 1 because we've already generated z0.
-      # The range ends at num_timesteps+1 because we want to include the
-      # num_timesteps-th step.
-      for t in xrange(1, num_timesteps+1):
-        if transition_type == ROUND_TRANSITION:
-          loc = np.round(states[-1])
-        elif transition_type == STANDARD_TRANSITION:
-          loc = states[-1]
-        z_t = np.random.normal(size=[state_size], loc=loc, scale=transition_std)
-        states.append(z_t.astype(dtype))
-        if t % steps_per_observation == 0:
-          if fixed_observation is None:
-            if observation_type == SQUARED_OBSERVATION:
-              loc = np.square(states[-1])
-            elif observation_type == ABS_OBSERVATION:
-              loc = np.abs(states[-1])
-            elif observation_type == STANDARD_OBSERVATION:
-              loc = states[-1]
-            x_t = np.random.normal(size=[state_size],
-                                   loc=loc,
-                                   scale=observation_std).astype(dtype)
-          else:
-            x_t = np.ones([state_size]) * fixed_observation
-
-          observations.append(x_t)
-      yield states, observations
-
-  dataset = tf.data.Dataset.from_generator(
-      data_generator,
-      output_types=(tf.as_dtype(dtype), tf.as_dtype(dtype)),
-      output_shapes=([num_timesteps+1, state_size],
-                     [num_observations, state_size])
-  )
-  dataset = dataset.repeat().batch(batch_size)
-  dataset = dataset.prefetch(prefetch_buffer_size)
-  itr = dataset.make_one_shot_iterator()
-  _, observations = itr.get_next()
-  # Transpose observations from [batch, time, state_size] to
-  # [time, batch, state_size].
-  observations = tf.transpose(observations, perm=[1, 0, 2])
-  lengths = tf.ones([batch_size], dtype=tf.int32) * num_observations
-  return observations, lengths
diff --git a/research/fivo/fivo/data/datasets_test.py b/research/fivo/fivo/data/datasets_test.py
deleted file mode 100644
index e6bbfda67aa44efc0bc4b1a34eb0cb9f09d53de5..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/data/datasets_test.py
+++ /dev/null
@@ -1,303 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for fivo.data.datasets."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import pickle
-import os
-
-import numpy as np
-import tensorflow as tf
-
-from fivo.data import datasets
-
-FLAGS = tf.app.flags.FLAGS
-
-
-class DatasetsTest(tf.test.TestCase):
-
-  def test_sparse_pianoroll_to_dense_empty_at_end(self):
-    sparse_pianoroll = [(0, 1), (1, 0), (), (1,), (), ()]
-    dense_pianoroll, num_timesteps = datasets.sparse_pianoroll_to_dense(
-        sparse_pianoroll, min_note=0, num_notes=2)
-    self.assertEqual(num_timesteps, 6)
-    self.assertAllEqual([[1, 1],
-                         [1, 1],
-                         [0, 0],
-                         [0, 1],
-                         [0, 0],
-                         [0, 0]], dense_pianoroll)
-
-  def test_sparse_pianoroll_to_dense_with_chord(self):
-    sparse_pianoroll = [(0, 1), (1, 0), (), (1,)]
-    dense_pianoroll, num_timesteps = datasets.sparse_pianoroll_to_dense(
-        sparse_pianoroll, min_note=0, num_notes=2)
-    self.assertEqual(num_timesteps, 4)
-    self.assertAllEqual([[1, 1],
-                         [1, 1],
-                         [0, 0],
-                         [0, 1]], dense_pianoroll)
-
-  def test_sparse_pianoroll_to_dense_simple(self):
-    sparse_pianoroll = [(0,), (), (1,)]
-    dense_pianoroll, num_timesteps = datasets.sparse_pianoroll_to_dense(
-        sparse_pianoroll, min_note=0, num_notes=2)
-    self.assertEqual(num_timesteps, 3)
-    self.assertAllEqual([[1, 0],
-                         [0, 0],
-                         [0, 1]], dense_pianoroll)
-
-  def test_sparse_pianoroll_to_dense_subtracts_min_note(self):
-    sparse_pianoroll = [(4, 5), (5, 4), (), (5,), (), ()]
-    dense_pianoroll, num_timesteps = datasets.sparse_pianoroll_to_dense(
-        sparse_pianoroll, min_note=4, num_notes=2)
-    self.assertEqual(num_timesteps, 6)
-    self.assertAllEqual([[1, 1],
-                         [1, 1],
-                         [0, 0],
-                         [0, 1],
-                         [0, 0],
-                         [0, 0]], dense_pianoroll)
-
-  def test_sparse_pianoroll_to_dense_uses_num_notes(self):
-    sparse_pianoroll = [(4, 5), (5, 4), (), (5,), (), ()]
-    dense_pianoroll, num_timesteps = datasets.sparse_pianoroll_to_dense(
-        sparse_pianoroll, min_note=4, num_notes=3)
-    self.assertEqual(num_timesteps, 6)
-    self.assertAllEqual([[1, 1, 0],
-                         [1, 1, 0],
-                         [0, 0, 0],
-                         [0, 1, 0],
-                         [0, 0, 0],
-                         [0, 0, 0]], dense_pianoroll)
-
-  def test_pianoroll_dataset(self):
-    pianoroll_data = [[(0,), (), (1,)],
-                      [(0, 1), (1,)],
-                      [(1,), (0,), (), (0, 1), (), ()]]
-    pianoroll_mean = np.zeros([3])
-    pianoroll_mean[-1] = 1
-    data = {"train": pianoroll_data, "train_mean": pianoroll_mean}
-    path = os.path.join(tf.test.get_temp_dir(), "test.pkl")
-    pickle.dump(data, open(path, "wb"))
-    with self.test_session() as sess:
-      inputs, targets, lens, mean = datasets.create_pianoroll_dataset(
-          path, "train", 2, num_parallel_calls=1,
-          shuffle=False, repeat=False,
-          min_note=0, max_note=2)
-      i1, t1, l1 = sess.run([inputs, targets, lens])
-      i2, t2, l2 = sess.run([inputs, targets, lens])
-      m = sess.run(mean)
-      # Check the lengths.
-      self.assertAllEqual([3, 2], l1)
-      self.assertAllEqual([6], l2)
-      # Check the mean.
-      self.assertAllEqual(pianoroll_mean, m)
-      # Check the targets. The targets should not be mean-centered and should
-      # be padded with zeros to a common length within a batch.
-      self.assertAllEqual([[1, 0, 0],
-                           [0, 0, 0],
-                           [0, 1, 0]], t1[:, 0, :])
-      self.assertAllEqual([[1, 1, 0],
-                           [0, 1, 0],
-                           [0, 0, 0]], t1[:, 1, :])
-      self.assertAllEqual([[0, 1, 0],
-                           [1, 0, 0],
-                           [0, 0, 0],
-                           [1, 1, 0],
-                           [0, 0, 0],
-                           [0, 0, 0]], t2[:, 0, :])
-      # Check the inputs. Each sequence should start with zeros on the first
-      # timestep. Each sequence should be padded with zeros to a common length
-      # within a batch. The mean should be subtracted from all timesteps except
-      # the first and the padding.
-      self.assertAllEqual([[0, 0, 0],
-                           [1, 0, -1],
-                           [0, 0, -1]], i1[:, 0, :])
-      self.assertAllEqual([[0, 0, 0],
-                           [1, 1, -1],
-                           [0, 0, 0]], i1[:, 1, :])
-      self.assertAllEqual([[0, 0, 0],
-                           [0, 1, -1],
-                           [1, 0, -1],
-                           [0, 0, -1],
-                           [1, 1, -1],
-                           [0, 0, -1]], i2[:, 0, :])
-
-  def test_human_pose_dataset(self):
-    pose_data = [
-        [[0, 0], [2, 2]],
-        [[2, 2]],
-        [[0, 0], [0, 0], [2, 2], [2, 2], [0, 0]],
-    ]
-    pose_data = [np.array(x, dtype=np.float64) for x in pose_data]
-    pose_data_mean = np.array([1, 1], dtype=np.float64)
-    data = {
-        "train": pose_data,
-        "train_mean": pose_data_mean,
-    }
-    path = os.path.join(tf.test.get_temp_dir(), "test_human_pose_dataset.pkl")
-    with open(path, "wb") as out:
-      pickle.dump(data, out)
-    with self.test_session() as sess:
-      inputs, targets, lens, mean = datasets.create_human_pose_dataset(
-          path, "train", 2, num_parallel_calls=1, shuffle=False, repeat=False)
-      i1, t1, l1 = sess.run([inputs, targets, lens])
-      i2, t2, l2 = sess.run([inputs, targets, lens])
-      m = sess.run(mean)
-      # Check the lengths.
-      self.assertAllEqual([2, 1], l1)
-      self.assertAllEqual([5], l2)
-      # Check the mean.
-      self.assertAllEqual(pose_data_mean, m)
-      # Check the targets. The targets should not be mean-centered and should
-      # be padded with zeros to a common length within a batch.
-      self.assertAllEqual([[0, 0], [2, 2]], t1[:, 0, :])
-      self.assertAllEqual([[2, 2], [0, 0]], t1[:, 1, :])
-      self.assertAllEqual([[0, 0], [0, 0], [2, 2], [2, 2], [0, 0]], t2[:, 0, :])
-      # Check the inputs. Each sequence should start with zeros on the first
-      # timestep. Each sequence should be padded with zeros to a common length
-      # within a batch. The mean should be subtracted from all timesteps except
-      # the first and the padding.
-      self.assertAllEqual([[0, 0], [-1, -1]], i1[:, 0, :])
-      self.assertAllEqual([[0, 0], [0, 0]], i1[:, 1, :])
-      self.assertAllEqual([[0, 0], [-1, -1], [-1, -1], [1, 1], [1, 1]],
-                          i2[:, 0, :])
-
-  def test_speech_dataset(self):
-    with self.test_session() as sess:
-      path = os.path.join(
-          os.path.dirname(os.path.dirname(os.path.realpath(__file__))),
-          "test_data",
-          "tiny_speech_dataset.tfrecord")
-      inputs, targets, lens = datasets.create_speech_dataset(
-          path, 3, samples_per_timestep=2, num_parallel_calls=1,
-          prefetch_buffer_size=3, shuffle=False, repeat=False)
-      inputs1, targets1, lengths1 = sess.run([inputs, targets, lens])
-      inputs2, targets2, lengths2 = sess.run([inputs, targets, lens])
-      # Check the lengths.
-      self.assertAllEqual([1, 2, 3], lengths1)
-      self.assertAllEqual([4], lengths2)
-      # Check the targets. The targets should be padded with zeros to a common
-      # length within a batch.
-      self.assertAllEqual([[[0., 1.], [0., 1.], [0., 1.]],
-                           [[0., 0.], [2., 3.], [2., 3.]],
-                           [[0., 0.], [0., 0.], [4., 5.]]],
-                          targets1)
-      self.assertAllEqual([[[0., 1.]],
-                           [[2., 3.]],
-                           [[4., 5.]],
-                           [[6., 7.]]],
-                          targets2)
-      # Check the inputs. Each sequence should start with zeros on the first
-      # timestep. Each sequence should be padded with zeros to a common length
-      # within a batch.
-      self.assertAllEqual([[[0., 0.], [0., 0.], [0., 0.]],
-                           [[0., 0.], [0., 1.], [0., 1.]],
-                           [[0., 0.], [0., 0.], [2., 3.]]],
-                          inputs1)
-      self.assertAllEqual([[[0., 0.]],
-                           [[0., 1.]],
-                           [[2., 3.]],
-                           [[4., 5.]]],
-                          inputs2)
-
-  def test_chain_graph_raises_error_on_wrong_steps_per_observation(self):
-    with self.assertRaises(ValueError):
-      datasets.create_chain_graph_dataset(
-          batch_size=4,
-          num_timesteps=10,
-          steps_per_observation=9)
-
-  def test_chain_graph_single_obs(self):
-    with self.test_session() as sess:
-      np.random.seed(1234)
-      num_observations = 1
-      num_timesteps = 5
-      batch_size = 2
-      state_size = 1
-      observations, lengths = datasets.create_chain_graph_dataset(
-          batch_size=batch_size,
-          num_timesteps=num_timesteps,
-          state_size=state_size)
-      out_observations, out_lengths = sess.run([observations, lengths])
-      self.assertAllEqual([num_observations, num_observations], out_lengths)
-      self.assertAllClose(
-          [[[1.426677], [-1.789461]]],
-          out_observations)
-
-  def test_chain_graph_multiple_obs(self):
-    with self.test_session() as sess:
-      np.random.seed(1234)
-      num_observations = 3
-      num_timesteps = 6
-      batch_size = 2
-      state_size = 1
-      observations, lengths = datasets.create_chain_graph_dataset(
-          batch_size=batch_size,
-          num_timesteps=num_timesteps,
-          steps_per_observation=num_timesteps/num_observations,
-          state_size=state_size)
-      out_observations, out_lengths = sess.run([observations, lengths])
-      self.assertAllEqual([num_observations, num_observations], out_lengths)
-      self.assertAllClose(
-          [[[0.40051451], [1.07405114]],
-           [[1.73932898], [3.16880035]],
-           [[-1.98377144], [2.82669163]]],
-          out_observations)
-
-  def test_chain_graph_state_dims(self):
-    with self.test_session() as sess:
-      np.random.seed(1234)
-      num_observations = 1
-      num_timesteps = 5
-      batch_size = 2
-      state_size = 3
-      observations, lengths = datasets.create_chain_graph_dataset(
-          batch_size=batch_size,
-          num_timesteps=num_timesteps,
-          state_size=state_size)
-      out_observations, out_lengths = sess.run([observations, lengths])
-      self.assertAllEqual([num_observations, num_observations], out_lengths)
-      self.assertAllClose(
-          [[[1.052287, -4.560759, 3.07988],
-            [2.008926, 0.495567, 3.488678]]],
-          out_observations)
-
-  def test_chain_graph_fixed_obs(self):
-    with self.test_session() as sess:
-      np.random.seed(1234)
-      num_observations = 3
-      num_timesteps = 6
-      batch_size = 2
-      state_size = 1
-      observations, lengths = datasets.create_chain_graph_dataset(
-          batch_size=batch_size,
-          num_timesteps=num_timesteps,
-          steps_per_observation=num_timesteps/num_observations,
-          state_size=state_size,
-          fixed_observation=4.)
-      out_observations, out_lengths = sess.run([observations, lengths])
-      self.assertAllEqual([num_observations, num_observations], out_lengths)
-      self.assertAllClose(
-          np.ones([num_observations, batch_size, state_size]) * 4.,
-          out_observations)
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/fivo/fivo/ghmm_runners.py b/research/fivo/fivo/ghmm_runners.py
deleted file mode 100644
index 1f1ba6d4f9ea9ed9dee7d95449ba73285c77f24d..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/ghmm_runners.py
+++ /dev/null
@@ -1,235 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Creates and runs Gaussian HMM-related graphs."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import numpy as np
-import tensorflow as tf
-
-from fivo import smc
-from fivo import bounds
-from fivo.data import datasets
-from fivo.models import ghmm
-
-
-def run_train(config):
-  """Runs training for a Gaussian HMM setup."""
-
-  def create_logging_hook(step, bound_value, likelihood, bound_gap):
-    """Creates a logging hook that prints the bound value periodically."""
-    bound_label = config.bound + "/t"
-    def summary_formatter(log_dict):
-      string = ("Step {step}, %s: {value:.3f}, "
-                "likelihood: {ll:.3f}, gap: {gap:.3e}") % bound_label
-      return string.format(**log_dict)
-    logging_hook = tf.train.LoggingTensorHook(
-        {"step": step, "value": bound_value,
-         "ll": likelihood, "gap": bound_gap},
-        every_n_iter=config.summarize_every,
-        formatter=summary_formatter)
-    return logging_hook
-
-  def create_losses(model, observations, lengths):
-    """Creates the loss to be optimized.
-
-    Args:
-      model: A Trainable GHMM model.
-      observations: A set of observations.
-      lengths: The lengths of each sequence in the observations.
-    Returns:
-      loss: A float Tensor that when differentiated yields the gradients
-         to apply to the model. Should be optimized via gradient descent.
-      bound: A float Tensor containing the value of the bound that is
-         being optimized.
-      true_ll: The true log-likelihood of the data under the model.
-      bound_gap: The gap between the bound and the true log-likelihood.
-    """
-    # Compute lower bounds on the log likelihood.
-    if config.bound == "elbo":
-      ll_per_seq, _, _ = bounds.iwae(
-          model, observations, lengths, num_samples=1,
-          parallel_iterations=config.parallel_iterations
-      )
-    elif config.bound == "iwae":
-      ll_per_seq, _, _ = bounds.iwae(
-          model, observations, lengths, num_samples=config.num_samples,
-          parallel_iterations=config.parallel_iterations
-      )
-    elif config.bound == "fivo":
-      if config.resampling_type == "relaxed":
-        ll_per_seq, _, _, _ = bounds.fivo(
-            model,
-            observations,
-            lengths,
-            num_samples=config.num_samples,
-            resampling_criterion=smc.ess_criterion,
-            resampling_type=config.resampling_type,
-            relaxed_resampling_temperature=config.
-            relaxed_resampling_temperature,
-            random_seed=config.random_seed,
-            parallel_iterations=config.parallel_iterations)
-      else:
-        ll_per_seq, _, _, _ = bounds.fivo(
-            model, observations, lengths,
-            num_samples=config.num_samples,
-            resampling_criterion=smc.ess_criterion,
-            resampling_type=config.resampling_type,
-            random_seed=config.random_seed,
-            parallel_iterations=config.parallel_iterations
-        )
-    ll_per_t = tf.reduce_mean(ll_per_seq / tf.to_float(lengths))
-    # Compute the data's true likelihood under the model and the bound gap.
-    true_ll_per_seq = model.likelihood(tf.squeeze(observations))
-    true_ll_per_t = tf.reduce_mean(true_ll_per_seq / tf.to_float(lengths))
-    bound_gap = true_ll_per_seq - ll_per_seq
-    bound_gap = tf.reduce_mean(bound_gap/ tf.to_float(lengths))
-    tf.summary.scalar("train_ll_bound", ll_per_t)
-    tf.summary.scalar("train_true_ll", true_ll_per_t)
-    tf.summary.scalar("bound_gap", bound_gap)
-    return -ll_per_t, ll_per_t, true_ll_per_t, bound_gap
-
-  def create_graph():
-    """Creates the training graph."""
-    global_step = tf.train.get_or_create_global_step()
-    xs, lengths = datasets.create_chain_graph_dataset(
-        config.batch_size,
-        config.num_timesteps,
-        steps_per_observation=1,
-        state_size=1,
-        transition_variance=config.variance,
-        observation_variance=config.variance)
-    model = ghmm.TrainableGaussianHMM(
-        config.num_timesteps,
-        config.proposal_type,
-        transition_variances=config.variance,
-        emission_variances=config.variance,
-        random_seed=config.random_seed)
-    loss, bound, true_ll, gap = create_losses(model, xs, lengths)
-    opt = tf.train.AdamOptimizer(config.learning_rate)
-    grads = opt.compute_gradients(loss, var_list=tf.trainable_variables())
-    train_op = opt.apply_gradients(grads, global_step=global_step)
-    return bound, true_ll, gap, train_op, global_step
-
-  with tf.Graph().as_default():
-    if config.random_seed:
-      tf.set_random_seed(config.random_seed)
-      np.random.seed(config.random_seed)
-    bound, true_ll, gap, train_op, global_step = create_graph()
-    log_hook = create_logging_hook(global_step, bound, true_ll, gap)
-    with tf.train.MonitoredTrainingSession(
-        master="",
-        hooks=[log_hook],
-        checkpoint_dir=config.logdir,
-        save_checkpoint_secs=120,
-        save_summaries_steps=config.summarize_every,
-        log_step_count_steps=config.summarize_every*20) as sess:
-      cur_step = -1
-      while cur_step <= config.max_steps and not sess.should_stop():
-        cur_step = sess.run(global_step)
-        _, cur_step = sess.run([train_op, global_step])
-
-
-def run_eval(config):
-  """Evaluates a Gaussian HMM using the given config."""
-
-  def create_bound(model, xs, lengths):
-    """Creates the bound to be evaluated."""
-    if config.bound == "elbo":
-      ll_per_seq, log_weights, _ = bounds.iwae(
-          model, xs, lengths, num_samples=1,
-          parallel_iterations=config.parallel_iterations
-      )
-    elif config.bound == "iwae":
-      ll_per_seq, log_weights, _ = bounds.iwae(
-          model, xs, lengths, num_samples=config.num_samples,
-          parallel_iterations=config.parallel_iterations
-      )
-    elif config.bound == "fivo":
-      ll_per_seq, log_weights, resampled, _ = bounds.fivo(
-          model, xs, lengths,
-          num_samples=config.num_samples,
-          resampling_criterion=smc.ess_criterion,
-          resampling_type=config.resampling_type,
-          random_seed=config.random_seed,
-          parallel_iterations=config.parallel_iterations
-      )
-    # Compute bound scaled by number of timesteps.
-    bound_per_t = ll_per_seq / tf.to_float(lengths)
-    if config.bound == "fivo":
-      return bound_per_t, log_weights, resampled
-    else:
-      return bound_per_t, log_weights
-
-  def create_graph():
-    """Creates the dataset, model, and bound."""
-    xs, lengths = datasets.create_chain_graph_dataset(
-        config.batch_size,
-        config.num_timesteps,
-        steps_per_observation=1,
-        state_size=1,
-        transition_variance=config.variance,
-        observation_variance=config.variance)
-    model = ghmm.TrainableGaussianHMM(
-        config.num_timesteps,
-        config.proposal_type,
-        transition_variances=config.variance,
-        emission_variances=config.variance,
-        random_seed=config.random_seed)
-    true_likelihood = tf.reduce_mean(
-        model.likelihood(tf.squeeze(xs)) / tf.to_float(lengths))
-    outs = [true_likelihood]
-    outs.extend(list(create_bound(model, xs, lengths)))
-    return outs
-
-  with tf.Graph().as_default():
-    if config.random_seed:
-      tf.set_random_seed(config.random_seed)
-      np.random.seed(config.random_seed)
-    graph_outs = create_graph()
-    with tf.train.SingularMonitoredSession(
-        checkpoint_dir=config.logdir) as sess:
-      outs = sess.run(graph_outs)
-      likelihood = outs[0]
-      avg_bound = np.mean(outs[1])
-      std = np.std(outs[1])
-      log_weights = outs[2]
-      log_weight_variances = np.var(log_weights, axis=2)
-      avg_log_weight_variance = np.var(log_weight_variances, axis=1)
-      avg_log_weight = np.mean(log_weights, axis=(1, 2))
-      data = {"mean": avg_bound, "std": std, "log_weights": log_weights,
-              "log_weight_means": avg_log_weight,
-              "log_weight_variances": avg_log_weight_variance}
-      if len(outs) == 4:
-        data["resampled"] = outs[3]
-        data["avg_resampled"] = np.mean(outs[3], axis=1)
-      # Log some useful statistics.
-      tf.logging.info("Evaled bound %s with batch_size: %d, num_samples: %d."
-                      % (config.bound, config.batch_size, config.num_samples))
-      tf.logging.info("mean: %f, std: %f" % (avg_bound, std))
-      tf.logging.info("true likelihood: %s" % likelihood)
-      tf.logging.info("avg log weight: %s" % avg_log_weight)
-      tf.logging.info("log weight variance: %s" % avg_log_weight_variance)
-      if len(outs) == 4:
-        tf.logging.info("avg resamples per t: %s" % data["avg_resampled"])
-      if not tf.gfile.Exists(config.logdir):
-        tf.gfile.MakeDirs(config.logdir)
-      with tf.gfile.Open(os.path.join(config.logdir, "out.npz"), "w") as fout:
-        np.save(fout, data)
diff --git a/research/fivo/fivo/ghmm_runners_test.py b/research/fivo/fivo/ghmm_runners_test.py
deleted file mode 100644
index 50044ad475b3458858b580a6ff7664267485757b..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/ghmm_runners_test.py
+++ /dev/null
@@ -1,106 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for fivo.ghmm_runners."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import numpy as np
-import tensorflow as tf
-
-from fivo import ghmm_runners
-
-
-class GHMMRunnersTest(tf.test.TestCase):
-
-  def default_config(self):
-    class Config(object):
-      pass
-    config = Config()
-    config.model = "ghmm"
-    config.bound = "fivo"
-    config.proposal_type = "prior"
-    config.batch_size = 4
-    config.num_samples = 4
-    config.num_timesteps = 10
-    config.variance = 0.1
-    config.resampling_type = "multinomial"
-    config.random_seed = 1234
-    config.parallel_iterations = 1
-    config.learning_rate = 1e-4
-    config.summarize_every = 1
-    config.max_steps = 1
-    return config
-
-  def test_eval_ghmm_notraining_fivo_prior(self):
-    self.eval_ghmm_notraining("fivo", "prior", -3.063864)
-
-  def test_eval_ghmm_notraining_fivo_true_filtering(self):
-    self.eval_ghmm_notraining("fivo", "true-filtering", -1.1409812)
-
-  def test_eval_ghmm_notraining_fivo_true_smoothing(self):
-    self.eval_ghmm_notraining("fivo", "true-smoothing", -0.85592091)
-
-  def test_eval_ghmm_notraining_iwae_prior(self):
-    self.eval_ghmm_notraining("iwae", "prior", -5.9730167)
-
-  def test_eval_ghmm_notraining_iwae_true_filtering(self):
-    self.eval_ghmm_notraining("iwae", "true-filtering", -1.1485999)
-
-  def test_eval_ghmm_notraining_iwae_true_smoothing(self):
-    self.eval_ghmm_notraining("iwae", "true-smoothing", -0.85592091)
-
-  def eval_ghmm_notraining(self, bound, proposal_type, expected_bound_avg):
-    config = self.default_config()
-    config.proposal_type = proposal_type
-    config.bound = bound
-    config.logdir = os.path.join(
-        tf.test.get_temp_dir(), "test-ghmm-%s-%s" % (proposal_type, bound))
-
-    ghmm_runners.run_eval(config)
-
-    data = np.load(os.path.join(config.logdir, "out.npz")).item()
-    self.assertAlmostEqual(expected_bound_avg, data["mean"], places=3)
-
-  def test_train_ghmm_for_one_step_and_eval_fivo_filtering(self):
-    self.train_ghmm_for_one_step_and_eval("fivo", "filtering", -16.727108)
-
-  def test_train_ghmm_for_one_step_and_eval_fivo_smoothing(self):
-    self.train_ghmm_for_one_step_and_eval("fivo", "smoothing", -19.381277)
-
-  def test_train_ghmm_for_one_step_and_eval_iwae_filtering(self):
-    self.train_ghmm_for_one_step_and_eval("iwae", "filtering", -33.31966)
-
-  def test_train_ghmm_for_one_step_and_eval_iwae_smoothing(self):
-    self.train_ghmm_for_one_step_and_eval("iwae", "smoothing", -46.388447)
-
-  def train_ghmm_for_one_step_and_eval(self, bound, proposal_type, expected_bound_avg):
-    config = self.default_config()
-    config.proposal_type = proposal_type
-    config.bound = bound
-    config.max_steps = 1
-    config.logdir = os.path.join(
-        tf.test.get_temp_dir(), "test-ghmm-training-%s-%s" % (proposal_type, bound))
-    ghmm_runners.run_train(config)
-    ghmm_runners.run_eval(config)
-    data = np.load(os.path.join(config.logdir, "out.npz")).item()
-    self.assertAlmostEqual(expected_bound_avg, data["mean"], places=2)
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/fivo/fivo/models/__init__.py b/research/fivo/fivo/models/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/fivo/fivo/models/base.py b/research/fivo/fivo/models/base.py
deleted file mode 100644
index 5ffcb7af216f5659e71d7425eeb4e2c3158b3d47..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/models/base.py
+++ /dev/null
@@ -1,342 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Reusable model classes for FIVO."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import sonnet as snt
-import tensorflow as tf
-
-from fivo import nested_utils as nested
-
-tfd = tf.contrib.distributions
-
-
-class ELBOTrainableSequenceModel(object):
-  """An abstract class for ELBO-trainable sequence models to extend.
-
-  Because the ELBO, IWAE, and FIVO bounds all accept the same arguments,
-  any model that is ELBO-trainable is also IWAE- and FIVO-trainable.
-  """
-
-  def zero_state(self, batch_size, dtype):
-    """Returns the initial state of the model as a Tensor or tuple of Tensors.
-
-    Args:
-      batch_size: The batch size.
-      dtype: The datatype to use for the state.
-    """
-    raise NotImplementedError("zero_state not yet implemented.")
-
-  def set_observations(self, observations, seq_lengths):
-    """Sets the observations for the model.
-
-    This method provides the model with all observed variables including both
-    inputs and targets. It will be called before running any computations with
-    the model that require the observations, e.g. training the model or
-    computing bounds, and should be used to run any necessary preprocessing
-    steps.
-
-    Args:
-      observations: A potentially nested set of Tensors containing
-        all observations for the model, both inputs and targets. Typically
-        a set of Tensors with shape [max_seq_len, batch_size, data_size].
-      seq_lengths: A [batch_size] Tensor of ints encoding the length of each
-        sequence in the batch (sequences can be padded to a common length).
-    """
-    self.observations = observations
-    self.max_seq_len = tf.reduce_max(seq_lengths)
-    self.observations_ta = nested.tas_for_tensors(
-        observations, self.max_seq_len, clear_after_read=False)
-    self.seq_lengths = seq_lengths
-
-  def propose_and_weight(self, state, t):
-    """Propogates model state one timestep and computes log weights.
-
-    This method accepts the current state of the model and computes the state
-    for the next timestep as well as the incremental log weight of each
-    element in the batch.
-
-    Args:
-      state: The current state of the model.
-      t: A scalar integer Tensor representing the current timestep.
-    Returns:
-      next_state: The state of the model after one timestep.
-      log_weights: A [batch_size] Tensor containing the incremental log weights.
-    """
-    raise NotImplementedError("propose_and_weight not yet implemented.")
-
-DEFAULT_INITIALIZERS = {"w": tf.contrib.layers.xavier_initializer(),
-                        "b": tf.zeros_initializer()}
-
-
-class ConditionalNormalDistribution(object):
-  """A Normal distribution conditioned on Tensor inputs via a fc network."""
-
-  def __init__(self, size, hidden_layer_sizes, sigma_min=0.0,
-               raw_sigma_bias=0.25, hidden_activation_fn=tf.nn.relu,
-               initializers=None, name="conditional_normal_distribution"):
-    """Creates a conditional Normal distribution.
-
-    Args:
-      size: The dimension of the random variable.
-      hidden_layer_sizes: The sizes of the hidden layers of the fully connected
-        network used to condition the distribution on the inputs.
-      sigma_min: The minimum standard deviation allowed, a scalar.
-      raw_sigma_bias: A scalar that is added to the raw standard deviation
-        output from the fully connected network. Set to 0.25 by default to
-        prevent standard deviations close to 0.
-      hidden_activation_fn: The activation function to use on the hidden layers
-        of the fully connected network.
-      initializers: The variable intitializers to use for the fully connected
-        network. The network is implemented using snt.nets.MLP so it must
-        be a dictionary mapping the keys 'w' and 'b' to the initializers for
-        the weights and biases. Defaults to xavier for the weights and zeros
-        for the biases when initializers is None.
-      name: The name of this distribution, used for sonnet scoping.
-    """
-    self.sigma_min = sigma_min
-    self.raw_sigma_bias = raw_sigma_bias
-    self.name = name
-    self.size = size
-    if initializers is None:
-      initializers = DEFAULT_INITIALIZERS
-    self.fcnet = snt.nets.MLP(
-        output_sizes=hidden_layer_sizes + [2*size],
-        activation=hidden_activation_fn,
-        initializers=initializers,
-        activate_final=False,
-        use_bias=True,
-        name=name + "_fcnet")
-
-  def condition(self, tensor_list, **unused_kwargs):
-    """Computes the parameters of a normal distribution based on the inputs."""
-    inputs = tf.concat(tensor_list, axis=1)
-    outs = self.fcnet(inputs)
-    mu, sigma = tf.split(outs, 2, axis=1)
-    sigma = tf.maximum(tf.nn.softplus(sigma + self.raw_sigma_bias),
-                       self.sigma_min)
-    return mu, sigma
-
-  def __call__(self, *args, **kwargs):
-    """Creates a normal distribution conditioned on the inputs."""
-    mu, sigma = self.condition(args, **kwargs)
-    return tf.contrib.distributions.Normal(loc=mu, scale=sigma)
-
-
-class ConditionalBernoulliDistribution(object):
-  """A Bernoulli distribution conditioned on Tensor inputs via a fc net."""
-
-  def __init__(self, size, hidden_layer_sizes, hidden_activation_fn=tf.nn.relu,
-               initializers=None, bias_init=0.0,
-               name="conditional_bernoulli_distribution"):
-    """Creates a conditional Bernoulli distribution.
-
-    Args:
-      size: The dimension of the random variable.
-      hidden_layer_sizes: The sizes of the hidden layers of the fully connected
-        network used to condition the distribution on the inputs.
-      hidden_activation_fn: The activation function to use on the hidden layers
-        of the fully connected network.
-      initializers: The variable intiializers to use for the fully connected
-        network. The network is implemented using snt.nets.MLP so it must
-        be a dictionary mapping the keys 'w' and 'b' to the initializers for
-        the weights and biases. Defaults to xavier for the weights and zeros
-        for the biases when initializers is None.
-      bias_init: A scalar or vector Tensor that is added to the output of the
-        fully-connected network that parameterizes the mean of this
-        distribution.
-      name: The name of this distribution, used for sonnet scoping.
-    """
-    self.bias_init = bias_init
-    self.size = size
-    if initializers is None:
-      initializers = DEFAULT_INITIALIZERS
-    self.fcnet = snt.nets.MLP(
-        output_sizes=hidden_layer_sizes + [size],
-        activation=hidden_activation_fn,
-        initializers=initializers,
-        activate_final=False,
-        use_bias=True,
-        name=name + "_fcnet")
-
-  def condition(self, tensor_list):
-    """Computes the p parameter of the Bernoulli distribution."""
-    inputs = tf.concat(tensor_list, axis=1)
-    return self.fcnet(inputs) + self.bias_init
-
-  def __call__(self, *args):
-    p = self.condition(args)
-    return tf.contrib.distributions.Bernoulli(logits=p)
-
-
-class NormalApproximatePosterior(ConditionalNormalDistribution):
-  """A Normally-distributed approx. posterior with res_q parameterization."""
-
-  def __init__(self, size, hidden_layer_sizes, sigma_min=0.0,
-               raw_sigma_bias=0.25, hidden_activation_fn=tf.nn.relu,
-               initializers=None, smoothing=False,
-               name="conditional_normal_distribution"):
-    super(NormalApproximatePosterior, self).__init__(
-        size, hidden_layer_sizes, sigma_min=sigma_min,
-        raw_sigma_bias=raw_sigma_bias,
-        hidden_activation_fn=hidden_activation_fn, initializers=initializers,
-        name=name)
-    self.smoothing = smoothing
-
-  def condition(self, tensor_list, prior_mu, smoothing_tensors=None):
-    """Generates the mean and variance of the normal distribution.
-
-    Args:
-      tensor_list: The list of Tensors to condition on. Will be concatenated and
-        fed through a fully connected network.
-      prior_mu: The mean of the prior distribution associated with this
-        approximate posterior. Will be added to the mean produced by
-        this approximate posterior, in res_q fashion.
-      smoothing_tensors: A list of Tensors. If smoothing is True, these Tensors
-        will be concatenated with the tensors in tensor_list.
-    Returns:
-      mu: The mean of the approximate posterior.
-      sigma: The standard deviation of the approximate posterior.
-    """
-    if self.smoothing:
-      tensor_list.extend(smoothing_tensors)
-    mu, sigma = super(NormalApproximatePosterior, self).condition(tensor_list)
-    return mu + prior_mu, sigma
-
-
-class NonstationaryLinearDistribution(object):
-  """A set of loc-scale distributions that are linear functions of inputs.
-
-  This class defines a series of location-scale distributions such that
-  the means are learnable linear functions of the inputs and the log variances
-  are learnable constants. The functions and log variances are different across
-  timesteps, allowing the distributions to be nonstationary.
-  """
-
-  def __init__(self,
-               num_timesteps,
-               inputs_per_timestep=None,
-               outputs_per_timestep=None,
-               initializers=None,
-               variance_min=0.0,
-               output_distribution=tfd.Normal,
-               dtype=tf.float32):
-    """Creates a NonstationaryLinearDistribution.
-
-    Args:
-      num_timesteps: The number of timesteps, i.e. the number of distributions.
-      inputs_per_timestep: A list of python ints, the dimension of inputs to the
-        linear function at each timestep. If not provided, the dimension at each
-        timestep is assumed to be 1.
-      outputs_per_timestep: A list of python ints, the dimension of the output
-        distribution at each timestep. If not provided, the dimension at each
-        timestep is assumed to be 1.
-      initializers: A dictionary containing intializers for the variables. The
-        initializer under the key 'w' is used for the weights in the linear
-        function and the initializer under the key 'b' is used for the biases.
-        Defaults to xavier initialization for the weights and zeros for the
-        biases.
-      variance_min: Python float, the minimum variance of each distribution.
-      output_distribution: A locatin-scale subclass of tfd.Distribution that
-        defines the output distribution, e.g. Normal.
-      dtype: The dtype of the weights and biases.
-    """
-    if not initializers:
-      initializers = DEFAULT_INITIALIZERS
-    if not inputs_per_timestep:
-      inputs_per_timestep = [1] * num_timesteps
-    if not outputs_per_timestep:
-      outputs_per_timestep = [1] * num_timesteps
-    self.num_timesteps = num_timesteps
-    self.variance_min = variance_min
-    self.initializers = initializers
-    self.dtype = dtype
-    self.output_distribution = output_distribution
-
-    def _get_variables_ta(shapes, name, initializer, trainable=True):
-      """Creates a sequence of variables and stores them in a TensorArray."""
-      # Infer shape if all shapes are equal.
-      first_shape = shapes[0]
-      infer_shape = all(shape == first_shape for shape in shapes)
-      ta = tf.TensorArray(
-          dtype=dtype, size=len(shapes), dynamic_size=False,
-          clear_after_read=False, infer_shape=infer_shape)
-      for t, shape in enumerate(shapes):
-        var = tf.get_variable(
-            name % t, shape=shape, initializer=initializer, trainable=trainable)
-        ta = ta.write(t, var)
-      return ta
-
-    bias_shapes = [[num_outputs] for num_outputs in outputs_per_timestep]
-    self.log_variances = _get_variables_ta(
-        bias_shapes, "proposal_log_variance_%d", initializers["b"])
-    self.mean_biases = _get_variables_ta(
-        bias_shapes, "proposal_b_%d", initializers["b"])
-    weight_shapes = zip(inputs_per_timestep, outputs_per_timestep)
-    self.mean_weights = _get_variables_ta(
-        weight_shapes, "proposal_w_%d", initializers["w"])
-    self.shapes = tf.TensorArray(
-        dtype=tf.int32, size=num_timesteps,
-        dynamic_size=False, clear_after_read=False).unstack(weight_shapes)
-
-  def __call__(self, t, inputs):
-    """Computes the distribution at timestep t.
-
-    Args:
-      t: Scalar integer Tensor, the current timestep. Must be in
-        [0, num_timesteps).
-      inputs: The inputs to the linear function parameterizing the mean of
-        the current distribution. A Tensor of shape [batch_size, num_inputs_t].
-    Returns:
-      A tfd.Distribution subclass representing the distribution at timestep t.
-    """
-    b = self.mean_biases.read(t)
-    w = self.mean_weights.read(t)
-    shape = self.shapes.read(t)
-    w = tf.reshape(w, shape)
-    b = tf.reshape(b, [shape[1], 1])
-    log_variance = self.log_variances.read(t)
-    scale = tf.sqrt(tf.maximum(tf.exp(log_variance), self.variance_min))
-    loc = tf.matmul(w, inputs, transpose_a=True) + b
-    return self.output_distribution(loc=loc, scale=scale)
-
-
-def encode_all(inputs, encoder):
-  """Encodes a timeseries of inputs with a time independent encoder.
-
-  Args:
-    inputs: A [time, batch, feature_dimensions] tensor.
-    encoder: A network that takes a [batch, features_dimensions] input and
-      encodes the input.
-  Returns:
-    A [time, batch, encoded_feature_dimensions] output tensor.
-  """
-  input_shape = tf.shape(inputs)
-  num_timesteps, batch_size = input_shape[0], input_shape[1]
-  reshaped_inputs = tf.reshape(inputs, [-1, inputs.shape[-1]])
-  inputs_encoded = encoder(reshaped_inputs)
-  inputs_encoded = tf.reshape(inputs_encoded,
-                              [num_timesteps, batch_size, encoder.output_size])
-  return inputs_encoded
-
-
-def ta_for_tensor(x, **kwargs):
-  """Creates a TensorArray for the input tensor."""
-  return tf.TensorArray(
-      x.dtype, tf.shape(x)[0], dynamic_size=False, **kwargs).unstack(x)
diff --git a/research/fivo/fivo/models/ghmm.py b/research/fivo/fivo/models/ghmm.py
deleted file mode 100644
index 07cf6c50e803383ef5690e8d24010e4706286eb7..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/models/ghmm.py
+++ /dev/null
@@ -1,483 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A Gaussian hidden markov model.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from fivo.models import base
-
-tfd = tf.contrib.distributions
-
-
-class GaussianHMM(object):
-  """A hidden markov model with 1-D Gaussian latent space and observations.
-
-  This is a hidden markov model where the state and observations are
-  one-dimensional Gaussians. The mean of each latent state is a linear
-  function of the previous latent state, and the mean of each observation
-  is a linear function of the current latent state.
-
-  The description that follows is 0-indexed instead of 1-indexed to make
-  it easier to reason about the parameters passed to the model.
-
-  The parameters of the model are:
-    T: The number timesteps, latent states, and observations.
-    vz_t, t=0 to T-1: The variance of the latent state at timestep t.
-    vx_t, t=0 to T-1: The variance of the observation at timestep t.
-    wz_t, t=1 to T-1: The weight that defines the latent transition at t.
-    wx_t, t=0 to T-1: The weight that defines the observation function at t.
-
-  There are T vz_t, vx_t, and wx_t but only T-1 wz_t because there are only
-  T-1 transitions in the model.
-
-  Given these parameters, sampling from the model is defined as
-
-    z_0 ~ N(0, vz_0)
-    x_0 | z_0 ~ N(wx_0 * z_0, vx_0)
-    z_1 | z_0 ~ N(wz_1 * z_0, vz_1)
-    x_1 | z_1 ~ N(wx_1 * z_1, vx_1)
-    ...
-    z_{T-1} | z_{T-2} ~ N(wz_{T-1} * z_{T-2}, vz_{T-1})
-    x_{T-1} | z_{T-1} ~ N(wx_{T-1} * z_{T-1}, vx_{T-1}).
-  """
-
-  def __init__(self,
-               num_timesteps,
-               transition_variances=1.,
-               emission_variances=1.,
-               transition_weights=1.,
-               emission_weights=1.,
-               dtype=tf.float32):
-    """Creates a gaussian hidden markov model.
-
-    Args:
-      num_timesteps: A python int, the number of timesteps in the model.
-      transition_variances: The variance of p(z_t | z_t-1). Can be a scalar,
-        setting all variances to be the same, or a Tensor of shape
-        [num_timesteps].
-      emission_variances: The variance of p(x_t | z_t). Can be a scalar,
-        setting all variances to be the same, or a Tensor of shape
-        [num_timesteps].
-      transition_weights: The weight that defines the linear function that
-        produces the mean of z_t given z_{t-1}. Can be a scalar, setting
-        all weights to be the same, or a Tensor of shape [num_timesteps-1].
-      emission_weights: The weight that defines the linear function that
-        produces the mean of x_t given z_t. Can be a scalar, setting
-        all weights to be the same, or a Tensor of shape [num_timesteps].
-      dtype: The datatype of the state.
-    """
-    self.num_timesteps = num_timesteps
-    self.dtype = dtype
-
-    def _expand_param(param, size):
-      param = tf.convert_to_tensor(param, dtype=self.dtype)
-      if not param.get_shape().as_list():
-        param = tf.tile(param[tf.newaxis], [size])
-
-      return param
-
-    def _ta_for_param(param):
-      size = tf.shape(param)[0]
-      ta = tf.TensorArray(dtype=param.dtype,
-                          size=size,
-                          dynamic_size=False,
-                          clear_after_read=False).unstack(param)
-      return ta
-
-    self.transition_variances = _ta_for_param(
-        _expand_param(transition_variances, num_timesteps))
-    self.transition_weights = _ta_for_param(
-        _expand_param(transition_weights, num_timesteps-1))
-    em_var = _expand_param(emission_variances, num_timesteps)
-    self.emission_variances = _ta_for_param(em_var)
-    em_w = _expand_param(emission_weights, num_timesteps)
-    self.emission_weights = _ta_for_param(em_w)
-    self._compute_covariances(em_w, em_var)
-
-  def _compute_covariances(self, emission_weights, emission_variances):
-    """Compute all covariance matrices.
-
-    Computes the covaraince matrix for the latent variables, the observations,
-    and the covariance between the latents and observations.
-
-    Args:
-      emission_weights: A Tensor of shape [num_timesteps] containing
-        the emission distribution weights at each timestep.
-      emission_variances: A Tensor of shape [num_timesteps] containing
-        the emiision distribution variances at each timestep.
-    """
-    # Compute the marginal variance of each latent.
-    z_variances = [self.transition_variances.read(0)]
-    for i in range(1, self.num_timesteps):
-      z_variances.append(
-          z_variances[i-1] * tf.square(self.transition_weights.read(i-1)) +
-          self.transition_variances.read(i))
-    # Compute the latent covariance matrix.
-    sigma_z = []
-    for i in range(self.num_timesteps):
-      sigma_z_row = []
-      for j in range(self.num_timesteps):
-        if i == j:
-          sigma_z_row.append(z_variances[i])
-          continue
-        min_ind = min(i, j)
-        max_ind = max(i, j)
-        weight = tf.reduce_prod(
-            self.transition_weights.gather(tf.range(min_ind, max_ind)))
-        sigma_z_row.append(z_variances[min_ind] * weight)
-      sigma_z.append(tf.stack(sigma_z_row))
-    self.sigma_z = tf.stack(sigma_z)
-    # Compute the observation covariance matrix.
-    x_weights_outer = tf.einsum("i,j->ij", emission_weights, emission_weights)
-    self.sigma_x = x_weights_outer * self.sigma_z + tf.diag(emission_variances)
-    # Compute the latent - observation covariance matrix.
-    # The first axis will index latents, the second axis will index observtions.
-    self.sigma_zx = emission_weights[tf.newaxis, :] * self.sigma_z
-    self.obs_dist = tfd.MultivariateNormalFullCovariance(
-        loc=tf.zeros([self.num_timesteps], dtype=tf.float32),
-        covariance_matrix=self.sigma_x)
-
-  def transition(self, t, z_prev):
-    """Compute the transition distribution p(z_t | z_t-1).
-
-    Args:
-      t: The current timestep, a scalar integer Tensor. When t=0 z_prev is
-        mostly ignored and the distribution p(z_0) is returned. z_prev is
-        'mostly' ignored because it is still used to derive batch_size.
-      z_prev: A [batch_size] set of states.
-    Returns:
-      p(z_t | z_t-1) as a univariate normal distribution.
-    """
-    batch_size = tf.shape(z_prev)[0]
-    scale = tf.sqrt(self.transition_variances.read(t))
-    scale = tf.tile(scale[tf.newaxis], [batch_size])
-    loc = tf.cond(tf.greater(t, 0),
-                  lambda: self.transition_weights.read(t-1)*z_prev,
-                  lambda: tf.zeros_like(scale))
-    return tfd.Normal(loc=loc, scale=scale)
-
-  def emission(self, t, z):
-    """Compute the emission distribution p(x_t | z_t).
-
-    Args:
-      t: The current timestep, a scalar integer Tensor.
-      z: A [batch_size] set of the current states.
-    Returns:
-      p(x_t | z_t) as a univariate normal distribution.
-    """
-    batch_size = tf.shape(z)[0]
-    scale = tf.sqrt(self.emission_variances.read(t))
-    scale = tf.tile(scale[tf.newaxis], [batch_size])
-    loc = self.emission_weights.read(t)*z
-    return tfd.Normal(loc=loc, scale=scale)
-
-  def filtering(self, t, z_prev, x_cur):
-    """Computes the filtering distribution p(z_t | z_{t-1}, x_t).
-
-    Args:
-      t: A python int, the index for z_t. When t is 0, z_prev is ignored,
-        giving p(z_0 | x_0).
-      z_prev: z_{t-1}, the previous z to condition on. A Tensor of shape
-        [batch_size].
-      x_cur: x_t, the current x to condition on. A Tensor of shape [batch_size].
-    Returns:
-      p(z_t | z_{t-1}, x_t) as a univariate normal distribution.
-    """
-    z_prev = tf.convert_to_tensor(z_prev)
-    x_cur = tf.convert_to_tensor(x_cur)
-    batch_size = tf.shape(z_prev)[0]
-    z_var = self.transition_variances.read(t)
-    x_var = self.emission_variances.read(t)
-    x_weight = self.emission_weights.read(t)
-    prev_state_weight = x_var/(tf.square(x_weight)*z_var + x_var)
-    prev_state_weight *= tf.cond(tf.greater(t, 0),
-                                 lambda: self.transition_weights.read(t-1),
-                                 lambda: tf.zeros_like(prev_state_weight))
-    cur_obs_weight = (x_weight*z_var)/(tf.square(x_weight)*z_var + x_var)
-    loc = prev_state_weight*z_prev + cur_obs_weight*x_cur
-    scale = tf.sqrt((z_var*x_var)/(tf.square(x_weight)*z_var + x_var))
-    scale = tf.tile(scale[tf.newaxis], [batch_size])
-    return tfd.Normal(loc=loc, scale=scale)
-
-  def smoothing(self, t, z_prev, xs):
-    """Computes the smoothing distribution p(z_t | z_{t-1}, x_{t:num_timesteps).
-
-    Args:
-      t: A python int, the index for z_t. When t is 0, z_prev is ignored,
-        giving p(z_0 | x_{0:num_timesteps-1}).
-      z_prev: z_{t-1}, the previous z to condition on. A Tensor of shape
-        [batch_size].
-      xs: x_{t:num_timesteps}, the future xs to condition on. A Tensor of shape
-        [num_timesteps - t, batch_size].
-    Returns:
-      p(z_t | z_{t-1}, x_{t:num_timesteps}) as a univariate normal distribution.
-    """
-    xs = tf.convert_to_tensor(xs)
-    z_prev = tf.convert_to_tensor(z_prev)
-    batch_size = tf.shape(xs)[1]
-    mess_mean, mess_prec = tf.cond(
-        tf.less(t, self.num_timesteps-1),
-        lambda: tf.unstack(self._compute_backwards_messages(xs[1:]).read(0)),
-        lambda: [tf.zeros([batch_size]), tf.zeros([batch_size])])
-    return self._smoothing_from_message(t, z_prev, xs[0], mess_mean, mess_prec)
-
-  def _smoothing_from_message(self, t, z_prev, x_t, mess_mean, mess_prec):
-    """Computes the smoothing distribution given message incoming to z_t.
-
-    Computes p(z_t | z_{t-1}, x_{t:num_timesteps}) given the message incoming
-    to the node for z_t.
-
-    Args:
-      t: A python int, the index for z_t. When t is 0, z_prev is ignored.
-      z_prev: z_{t-1}, the previous z to condition on. A Tensor of shape
-        [batch_size].
-      x_t: The observation x at timestep t.
-      mess_mean: The mean of the message incoming to z_t, in information form.
-      mess_prec: The precision of the message incoming to z_t.
-    Returns:
-      p(z_t | z_{t-1}, x_{t:num_timesteps}) as a univariate normal distribution.
-    """
-
-    batch_size = tf.shape(x_t)[0]
-    z_var = self.transition_variances.read(t)
-    x_var = self.emission_variances.read(t)
-    w_x = self.emission_weights.read(t)
-
-    def transition_term():
-      return (tf.square(self.transition_weights.read(t))/
-              self.transition_variances.read(t+1))
-
-    prec = 1./z_var + tf.square(w_x)/x_var + mess_prec
-    prec += tf.cond(tf.less(t, self.num_timesteps-1),
-                    transition_term, lambda: 0.)
-    mean = x_t*(w_x/x_var) + mess_mean
-    mean += tf.cond(tf.greater(t, 0),
-                    lambda: z_prev*(self.transition_weights.read(t-1)/z_var),
-                    lambda: 0.)
-    mean = tf.reshape(mean / prec, [batch_size])
-    scale = tf.reshape(tf.sqrt(1./prec), [batch_size])
-    return tfd.Normal(loc=mean, scale=scale)
-
-  def _compute_backwards_messages(self, xs):
-    """Computes the backwards messages used in smoothing."""
-    batch_size = tf.shape(xs)[1]
-    num_xs = tf.shape(xs)[0]
-    until_t = self.num_timesteps - num_xs
-    xs = tf.TensorArray(dtype=xs.dtype,
-                        size=num_xs,
-                        dynamic_size=False,
-                        clear_after_read=True).unstack(xs)
-    messages_ta = tf.TensorArray(dtype=xs.dtype,
-                                 size=num_xs,
-                                 dynamic_size=False,
-                                 clear_after_read=False)
-
-    def compute_message(t, prev_mean, prev_prec, messages_ta):
-      """Computes one step of the backwards messages."""
-      z_var = self.transition_variances.read(t)
-      w_z = self.transition_weights.read(t-1)
-      x_var = self.emission_variances.read(t)
-      w_x = self.emission_weights.read(t)
-      cur_x = xs.read(t - until_t)
-
-      # If it isn't the first message, add the terms from the transition.
-      def transition_term():
-        return (tf.square(self.transition_weights.read(t))/
-                self.transition_variances.read(t+1))
-
-      unary_prec = 1/z_var + tf.square(w_x)/x_var
-      unary_prec += tf.cond(tf.less(t, self.num_timesteps-1),
-                            transition_term, lambda: 0.)
-
-      unary_mean = (w_x / x_var) * cur_x
-      pairwise_prec = w_z / z_var
-
-      next_prec = -tf.square(pairwise_prec)/(unary_prec + prev_prec)
-      next_mean = (pairwise_prec * (unary_mean + prev_mean) /
-                   (unary_prec + prev_prec))
-      next_prec = tf.reshape(next_prec, [batch_size])
-      next_mean = tf.reshape(next_mean, [batch_size])
-      messages_ta = messages_ta.write(t - until_t,
-                                      tf.stack([next_mean, next_prec]))
-      return t-1, next_mean, next_prec, messages_ta
-
-    def pred(t, *unused_args):
-      return tf.greater_equal(t, until_t)
-
-    init_prec = tf.zeros([batch_size], dtype=xs.dtype)
-    init_mean = tf.zeros([batch_size], dtype=xs.dtype)
-    t0 = tf.constant(self.num_timesteps - 1, dtype=tf.int32)
-
-    outs = tf.while_loop(pred, compute_message,
-                         (t0, init_mean, init_prec, messages_ta))
-    messages = outs[-1]
-    return messages
-
-  def lookahead(self, t, z_prev):
-    """Compute the 'lookahead' distribution, p(x_{t:T} | z_{t-1}).
-
-    Args:
-      t: A scalar Tensor int, the current timestep. Must be at least 1.
-      z_prev: The latent state at time t-1. A Tensor of shape [batch_size].
-    Returns:
-      p(x_{t:T} | z_{t-1}) as a multivariate normal distribution.
-    """
-    z_prev = tf.convert_to_tensor(z_prev)
-    sigma_zx = self.sigma_zx[t-1, t:]
-    z_var = self.sigma_z[t-1, t-1]
-    mean = tf.einsum("i,j->ij", z_prev, sigma_zx) / z_var
-    variance = (self.sigma_x[t:, t:] -
-                tf.einsum("i,j->ij", sigma_zx, sigma_zx) / z_var)
-    return tfd.MultivariateNormalFullCovariance(
-        loc=mean, covariance_matrix=variance)
-
-  def likelihood(self, xs):
-    """Compute the true marginal likelihood of the data.
-
-    Args:
-      xs: The observations, a [num_timesteps, batch_size] float Tensor.
-    Returns:
-      likelihoods: A [batch_size] float Tensor representing the likelihood of
-        each sequence of observations in the batch.
-    """
-    return self.obs_dist.log_prob(tf.transpose(xs))
-
-
-class TrainableGaussianHMM(GaussianHMM, base.ELBOTrainableSequenceModel):
-  """An interface between importance-sampling training methods and the GHMM."""
-
-  def __init__(self,
-               num_timesteps,
-               proposal_type,
-               transition_variances=1.,
-               emission_variances=1.,
-               transition_weights=1.,
-               emission_weights=1.,
-               random_seed=None,
-               dtype=tf.float32):
-    """Constructs a trainable Gaussian HMM.
-
-    Args:
-      num_timesteps: A python int, the number of timesteps in the model.
-      proposal_type: The type of proposal to use in the importance sampling
-        setup. Could be "filtering", "smoothing", "prior", "true-filtering",
-        or "true-smoothing". If "true-filtering" or "true-smoothing" are
-        selected, then the true filtering or smoothing distributions are used to
-        propose new states. If "learned-filtering" is selected then a
-        distribution with learnable parameters is used. Specifically at each
-        timestep the proposal is Gaussian with mean that is a learnable linear
-        function of the previous state and current observation. The log variance
-        is a per-timestep learnable constant. "learned-smoothing" is similar,
-        but the mean is a learnable linear function of the previous state and
-        all future observations. Note that this proposal class includes the true
-        posterior. If "prior" is selected then states are proposed from the
-        model's prior.
-      transition_variances: The variance of p(z_t | z_t-1). Can be a scalar,
-        setting all variances to be the same, or a Tensor of shape
-        [num_timesteps].
-      emission_variances: The variance of p(x_t | z_t). Can be a scalar,
-        setting all variances to be the same, or a Tensor of shape
-        [num_timesteps].
-      transition_weights: The weight that defines the linear function that
-        produces the mean of z_t given z_{t-1}. Can be a scalar, setting
-        all weights to be the same, or a Tensor of shape [num_timesteps-1].
-      emission_weights: The weight that defines the linear function that
-        produces the mean of x_t given z_t. Can be a scalar, setting
-        all weights to be the same, or a Tensor of shape [num_timesteps].
-      random_seed: A seed for the proposal sampling, mainly useful for testing.
-      dtype: The datatype of the state.
-    """
-    super(TrainableGaussianHMM, self).__init__(
-        num_timesteps, transition_variances, emission_variances,
-        transition_weights, emission_weights, dtype=dtype)
-    self.random_seed = random_seed
-    assert proposal_type in ["filtering", "smoothing", "prior",
-                             "true-filtering", "true-smoothing"]
-    if proposal_type == "true-filtering":
-      self.proposal = self._filtering_proposal
-    elif proposal_type == "true-smoothing":
-      self.proposal = self._smoothing_proposal
-    elif proposal_type == "prior":
-      self.proposal = self.transition
-    elif proposal_type == "filtering":
-      self._learned_proposal_fn = base.NonstationaryLinearDistribution(
-          num_timesteps, inputs_per_timestep=[1] + [2] * (num_timesteps-1))
-      self.proposal = self._learned_filtering_proposal
-    elif proposal_type == "smoothing":
-      inputs_per_timestep = [num_timesteps] + [num_timesteps - t
-                                               for t in range(num_timesteps-1)]
-      self._learned_proposal_fn = base.NonstationaryLinearDistribution(
-          num_timesteps, inputs_per_timestep=inputs_per_timestep)
-      self.proposal = self._learned_smoothing_proposal
-
-  def set_observations(self, xs, seq_lengths):
-    """Sets the observations and stores the backwards messages."""
-    # Squeeze out data dimension since everything is 1-d.
-    xs = tf.squeeze(xs)
-    self.batch_size = tf.shape(xs)[1]
-    super(TrainableGaussianHMM, self).set_observations(xs, seq_lengths)
-    self.messages = self._compute_backwards_messages(xs[1:])
-
-  def zero_state(self, batch_size, dtype):
-    return tf.zeros([batch_size], dtype=dtype)
-
-  def propose_and_weight(self, state, t):
-    """Computes the next state and log weights for the GHMM."""
-    state_shape = tf.shape(state)
-    xt = self.observations[t]
-    p_zt = self.transition(t, state)
-    q_zt = self.proposal(t, state)
-    zt = q_zt.sample(seed=self.random_seed)
-    zt = tf.reshape(zt, state_shape)
-    p_xt_given_zt = self.emission(t, zt)
-    log_p_zt = p_zt.log_prob(zt)
-    log_q_zt = q_zt.log_prob(zt)
-    log_p_xt_given_zt = p_xt_given_zt.log_prob(xt)
-    weight = log_p_zt + log_p_xt_given_zt - log_q_zt
-    return weight, zt
-
-  def _filtering_proposal(self, t, state):
-    """Uses the stored observations to compute the filtering distribution."""
-    cur_x = self.observations[t]
-    return self.filtering(t, state, cur_x)
-
-  def _smoothing_proposal(self, t, state):
-    """Uses the stored messages to compute the smoothing distribution."""
-    mess_mean, mess_prec = tf.cond(
-        tf.less(t, self.num_timesteps-1),
-        lambda: tf.unstack(self.messages.read(t)),
-        lambda: [tf.zeros([self.batch_size]), tf.zeros([self.batch_size])])
-    return self._smoothing_from_message(t, state, self.observations[t],
-                                        mess_mean, mess_prec)
-
-  def _learned_filtering_proposal(self, t, state):
-    cur_x = self.observations[t]
-    inputs = tf.cond(tf.greater(t, 0),
-                     lambda: tf.stack([state, cur_x], axis=0),
-                     lambda: cur_x[tf.newaxis, :])
-    return self._learned_proposal_fn(t, inputs)
-
-  def _learned_smoothing_proposal(self, t, state):
-    xs = self.observations_ta.gather(tf.range(t, self.num_timesteps))
-    inputs = tf.cond(tf.greater(t, 0),
-                     lambda: tf.concat([state[tf.newaxis, :], xs], axis=0),
-                     lambda: xs)
-    return self._learned_proposal_fn(t, inputs)
diff --git a/research/fivo/fivo/models/ghmm_test.py b/research/fivo/fivo/models/ghmm_test.py
deleted file mode 100644
index 15a03c0c7abeae09bd1cfc87f917ef53ecac205f..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/models/ghmm_test.py
+++ /dev/null
@@ -1,313 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for fivo.models.ghmm"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-from fivo.models.ghmm import GaussianHMM
-from fivo.models.ghmm import TrainableGaussianHMM
-
-
-class GHMMTest(tf.test.TestCase):
-
-  def test_transition_no_weights(self):
-    with self.test_session() as sess:
-      ghmm = GaussianHMM(3,
-                         transition_variances=[1., 2., 3.])
-      prev_z = tf.constant([1., 2.], dtype=tf.float32)
-      z0 = ghmm.transition(0, prev_z)
-      z1 = ghmm.transition(1, prev_z)
-      z2 = ghmm.transition(2, prev_z)
-      outs = sess.run([z0.mean(), z0.variance(),
-                       z1.mean(), z1.variance(),
-                       z2.mean(), z2.variance()])
-      self.assertAllClose(outs, [[0., 0.], [1., 1.],
-                                 [1., 2.], [2., 2.],
-                                 [1., 2.], [3., 3.]])
-
-  def test_transition_with_weights(self):
-    with self.test_session() as sess:
-      ghmm = GaussianHMM(3,
-                         transition_variances=[1., 2., 3.],
-                         transition_weights=[2., 3.])
-      prev_z = tf.constant([1., 2.], dtype=tf.float32)
-      z0 = ghmm.transition(0, prev_z)
-      z1 = ghmm.transition(1, prev_z)
-      z2 = ghmm.transition(2, prev_z)
-      outs = sess.run([z0.mean(), z0.variance(),
-                       z1.mean(), z1.variance(),
-                       z2.mean(), z2.variance()])
-      self.assertAllClose(outs, [[0., 0.], [1., 1.],
-                                 [2., 4.], [2., 2.],
-                                 [3., 6.], [3., 3.]])
-
-  def test_emission_no_weights(self):
-    with self.test_session() as sess:
-      ghmm = GaussianHMM(3, emission_variances=[1., 2., 3.])
-      z = tf.constant([1., 2.], dtype=tf.float32)
-      x0 = ghmm.emission(0, z)
-      x1 = ghmm.emission(1, z)
-      x2 = ghmm.emission(2, z)
-      outs = sess.run([x0.mean(), x0.variance(),
-                       x1.mean(), x1.variance(),
-                       x2.mean(), x2.variance()])
-      self.assertAllClose(outs, [[1., 2.], [1., 1.],
-                                 [1., 2.], [2., 2.],
-                                 [1., 2.], [3., 3.]])
-
-  def test_emission_with_weights(self):
-    with self.test_session() as sess:
-      ghmm = GaussianHMM(3,
-                         emission_variances=[1., 2., 3.],
-                         emission_weights=[1., 2., 3.])
-      z = tf.constant([1., 2.], dtype=tf.float32)
-      x0 = ghmm.emission(0, z)
-      x1 = ghmm.emission(1, z)
-      x2 = ghmm.emission(2, z)
-      outs = sess.run([x0.mean(), x0.variance(),
-                       x1.mean(), x1.variance(),
-                       x2.mean(), x2.variance()])
-      self.assertAllClose(outs, [[1., 2.], [1., 1.],
-                                 [2., 4.], [2., 2.],
-                                 [3., 6.], [3., 3.]])
-
-  def test_filtering_no_weights(self):
-    with self.test_session() as sess:
-      ghmm = GaussianHMM(3,
-                         transition_variances=[1., 2., 3.],
-                         emission_variances=[4., 5., 6.])
-      z_prev = tf.constant([1., 2.], dtype=tf.float32)
-      x_cur = tf.constant([3., 4.], dtype=tf.float32)
-      expected_outs = [[[3./5., 4./5.], [4./5., 4./5.]],
-                       [[11./7., 18./7.], [10./7., 10./7.]],
-                       [[5./3., 8./3.], [2., 2.]]]
-      f_post_0 = ghmm.filtering(0, z_prev, x_cur)
-      f_post_1 = ghmm.filtering(1, z_prev, x_cur)
-      f_post_2 = ghmm.filtering(2, z_prev, x_cur)
-      outs = sess.run([[f_post_0.mean(), f_post_0.variance()],
-                       [f_post_1.mean(), f_post_1.variance()],
-                       [f_post_2.mean(), f_post_2.variance()]])
-      self.assertAllClose(expected_outs, outs)
-
-  def test_filtering_with_weights(self):
-    with self.test_session() as sess:
-      ghmm = GaussianHMM(3,
-                         transition_variances=[1., 2., 3.],
-                         emission_variances=[4., 5., 6.],
-                         transition_weights=[7., 8.],
-                         emission_weights=[9., 10., 11])
-      z_prev = tf.constant([1., 2.], dtype=tf.float32)
-      x_cur = tf.constant([3., 4.], dtype=tf.float32)
-      expected_outs = [[[27./85., 36./85.], [4./85., 4./85.]],
-                       [[95./205., 150./205.], [10./205., 10./205.]],
-                       [[147./369., 228./369.], [18./369., 18./369.]]]
-      f_post_0 = ghmm.filtering(0, z_prev, x_cur)
-      f_post_1 = ghmm.filtering(1, z_prev, x_cur)
-      f_post_2 = ghmm.filtering(2, z_prev, x_cur)
-      outs = sess.run([[f_post_0.mean(), f_post_0.variance()],
-                       [f_post_1.mean(), f_post_1.variance()],
-                       [f_post_2.mean(), f_post_2.variance()]])
-      self.assertAllClose(expected_outs, outs)
-
-  def test_smoothing(self):
-    with self.test_session() as sess:
-      ghmm = GaussianHMM(3,
-                         transition_variances=[1., 2., 3.],
-                         emission_variances=[4., 5., 6.])
-      z_prev = tf.constant([1., 2.], dtype=tf.float32)
-      xs = tf.constant([[1., 2.],
-                        [3., 4.],
-                        [5., 6.]], dtype=tf.float32)
-      s_post1 = ghmm.smoothing(0, z_prev, xs)
-      outs = sess.run([s_post1.mean(), s_post1.variance()])
-      expected_outs = [[281./421., 410./421.], [292./421., 292./421.]]
-      self.assertAllClose(expected_outs, outs)
-
-      expected_outs = [[149./73., 222./73.], [90./73., 90./73.]]
-      s_post2 = ghmm.smoothing(1, z_prev, xs[1:])
-      outs = sess.run([s_post2.mean(), s_post2.variance()])
-      self.assertAllClose(expected_outs, outs)
-
-      s_post3 = ghmm.smoothing(2, z_prev, xs[2:])
-      outs = sess.run([s_post3.mean(), s_post3.variance()])
-      expected_outs = [[7./3., 10./3.], [2., 2.]]
-      self.assertAllClose(expected_outs, outs)
-
-  def test_smoothing_with_weights(self):
-    with self.test_session() as sess:
-      x_weight = np.array([4, 5, 6, 7], dtype=np.float32)
-      sigma_x = np.array([5, 6, 7, 8], dtype=np.float32)
-      z_weight = np.array([1, 2, 3], dtype=np.float32)
-      sigma_z = np.array([1, 2, 3, 4], dtype=np.float32)
-      z_prev = np.array([1, 2], dtype=np.float32)
-      batch_size = 2
-      xs = np.array([[1, 2], [3, 4], [5, 6], [7, 8]], dtype=np.float32)
-
-      z_cov, x_cov, z_x_cov = self._compute_covariance_matrices(
-          x_weight, z_weight, sigma_x, sigma_z)
-
-      expected_outs = []
-      # Compute mean and variance for z_0 when we don't condition
-      # on previous zs.
-      sigma_12 = z_x_cov[0, :]
-      sigma_12_22 = np.dot(sigma_12, np.linalg.inv(x_cov))
-      mean = np.dot(sigma_12_22, xs)
-      variance = np.squeeze(z_cov[0, 0] - np.dot(sigma_12_22, sigma_12))
-      expected_outs.append([mean, np.tile(variance, [batch_size])])
-
-      # Compute mean and variance for remaining z_ts.
-      for t in xrange(1, 4):
-        sigma_12 = np.concatenate([[z_cov[t, t - 1]], z_x_cov[t, t:]])
-        sigma_22 = np.vstack((
-            np.hstack((z_cov[t-1, t-1], z_x_cov[t-1, t:])),
-            np.hstack((np.transpose([z_x_cov[t-1, t:]]), x_cov[t:, t:]))
-        ))
-        sigma_12_22 = np.dot(sigma_12, np.linalg.inv(sigma_22))
-        mean = np.dot(sigma_12_22, np.vstack((z_prev, xs[t:])))
-        variance = np.squeeze(z_cov[t, t] - np.dot(sigma_12_22, sigma_12))
-        expected_outs.append([mean, np.tile(variance, [batch_size])])
-
-      ghmm = GaussianHMM(4,
-                         transition_variances=sigma_z,
-                         emission_variances=sigma_x,
-                         transition_weights=z_weight,
-                         emission_weights=x_weight)
-      out_dists = [ghmm.smoothing(t, z_prev, xs[t:]) for t in range(0, 4)]
-      outs = [[d.mean(), d.variance()] for d in out_dists]
-      run_outs = sess.run(outs)
-      self.assertAllClose(expected_outs, run_outs)
-
-  def test_covariance_matrices(self):
-    with self.test_session() as sess:
-      x_weight = np.array([4, 5, 6, 7], dtype=np.float32)
-      sigma_x = np.array([5, 6, 7, 8], dtype=np.float32)
-      z_weight = np.array([1, 2, 3], dtype=np.float32)
-      sigma_z = np.array([1, 2, 3, 4], dtype=np.float32)
-
-      z_cov, x_cov, z_x_cov = self._compute_covariance_matrices(
-          x_weight, z_weight, sigma_x, sigma_z)
-
-      ghmm = GaussianHMM(4,
-                         transition_variances=sigma_z,
-                         emission_variances=sigma_x,
-                         transition_weights=z_weight,
-                         emission_weights=x_weight)
-      self.assertAllClose(z_cov, sess.run(ghmm.sigma_z))
-      self.assertAllClose(x_cov, sess.run(ghmm.sigma_x))
-      self.assertAllClose(z_x_cov, sess.run(ghmm.sigma_zx))
-
-  def _compute_covariance_matrices(self, x_weight, z_weight, sigma_x, sigma_z):
-    # Create z covariance matrix from the definitions.
-    z_cov = np.zeros([4, 4])
-    z_cov[0, 0] = sigma_z[0]
-    for i in range(1, 4):
-      z_cov[i, i] = (z_cov[i - 1, i - 1] * np.square(z_weight[i - 1]) +
-                     sigma_z[i])
-    for i in range(4):
-      for j in range(4):
-        if i == j: continue
-        min_ind = min(i, j)
-        max_ind = max(i, j)
-        weights = np.prod(z_weight[min_ind:max_ind])
-        z_cov[i, j] = z_cov[min_ind, min_ind] * weights
-    # Compute the x covariance matrix and the z-x covariance matrix.
-    x_weights_outer = np.outer(x_weight, x_weight)
-    x_cov = x_weights_outer * z_cov + np.diag(sigma_x)
-    z_x_cov = x_weight * z_cov
-    return z_cov, x_cov, z_x_cov
-
-  def test_lookahead(self):
-    x_weight = np.array([4, 5, 6, 7], dtype=np.float32)
-    sigma_x = np.array([5, 6, 7, 8], dtype=np.float32)
-    z_weight = np.array([1, 2, 3], dtype=np.float32)
-    sigma_z = np.array([1, 2, 3, 4], dtype=np.float32)
-    z_prev = np.array([1, 2], dtype=np.float32)
-
-    with self.test_session() as sess:
-      z_cov, x_cov, z_x_cov = self._compute_covariance_matrices(
-          x_weight, z_weight, sigma_x, sigma_z)
-
-      expected_outs = []
-      for t in range(1, 4):
-        sigma_12 = z_x_cov[t-1, t:]
-        z_var = z_cov[t-1, t-1]
-        mean = np.outer(z_prev, sigma_12/z_var)
-        variance = x_cov[t:, t:] - np.outer(sigma_12, sigma_12)/ z_var
-        expected_outs.append([mean, variance])
-
-      ghmm = GaussianHMM(4,
-                         transition_variances=sigma_z,
-                         emission_variances=sigma_x,
-                         transition_weights=z_weight,
-                         emission_weights=x_weight)
-      out_dists = [ghmm.lookahead(t, z_prev) for t in range(1, 4)]
-      outs = [[d.mean(), d.covariance()] for d in out_dists]
-      run_outs = sess.run(outs)
-      self.assertAllClose(expected_outs, run_outs)
-
-
-class TrainableGHMMTest(tf.test.TestCase):
-
-  def test_filtering_proposal(self):
-    """Check that stashing the xs doesn't change the filtering distributions."""
-    with self.test_session() as sess:
-      ghmm = TrainableGaussianHMM(
-          3, "filtering",
-          transition_variances=[1., 2., 3.],
-          emission_variances=[4., 5., 6.],
-          transition_weights=[7., 8.],
-          emission_weights=[9., 10., 11])
-      observations = tf.constant([[3., 4.],
-                                  [3., 4.],
-                                  [3., 4.]], dtype=tf.float32)
-      ghmm.set_observations(observations, [3, 3])
-      z_prev = tf.constant([1., 2.], dtype=tf.float32)
-
-      proposals = [ghmm._filtering_proposal(t, z_prev) for t in range(3)]
-      dist_params = [[p.mean(), p.variance()] for p in proposals]
-
-      expected_outs = [[[27./85., 36./85.], [4./85., 4./85.]],
-                       [[95./205., 150./205.], [10./205., 10./205.]],
-                       [[147./369., 228./369.], [18./369., 18./369.]]]
-      self.assertAllClose(expected_outs, sess.run(dist_params))
-
-  def test_smoothing_proposal(self):
-    with self.test_session() as sess:
-      ghmm = TrainableGaussianHMM(
-          3, "smoothing",
-          transition_variances=[1., 2., 3.],
-          emission_variances=[4., 5., 6.])
-      xs = tf.constant([[1., 2.],
-                        [3., 4.],
-                        [5., 6.]], dtype=tf.float32)
-      ghmm.set_observations(xs, [3, 3])
-      z_prev = tf.constant([1., 2.], dtype=tf.float32)
-
-      proposals = [ghmm._smoothing_proposal(t, z_prev) for t in range(3)]
-      dist_params = [[p.mean(), p.variance()] for p in proposals]
-
-      expected_outs = [[[281./421., 410./421.], [292./421., 292./421.]],
-                       [[149./73., 222./73.], [90./73., 90./73.]],
-                       [[7./3., 10./3.], [2., 2.]]]
-      self.assertAllClose(expected_outs, sess.run(dist_params))
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/fivo/fivo/models/srnn.py b/research/fivo/fivo/models/srnn.py
deleted file mode 100644
index cdfb560eedffccf8edf41dbab4e85bbd8bbfab46..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/models/srnn.py
+++ /dev/null
@@ -1,587 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""SRNN classes."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import namedtuple
-import functools
-
-import sonnet as snt
-import tensorflow as tf
-
-from fivo.models import base
-
-
-SRNNState = namedtuple("SRNNState", "rnn_state latent_encoded")
-
-
-class SRNN(object):
-  """Implementation of a Stochastic Recurrent Neural Network (SRNN).
-
-  Introduced in "Sequential Neural Models with Stochastic Layers"
-  by Fraccaro et al. https://arxiv.org/pdf/1605.07571.pdf.
-
-  The SRNN is a sequence model similar to an RNN that uses stochastic latent
-  variables to improve its representational power. It can be thought of as a
-  sequential analogue to the variational auto-encoder (VAE).
-
-  The SRNN has a deterministic RNN as its backbone, represented by the
-  sequence of RNN hidden states h_t. The latent state is conditioned on
-  the deterministic RNN states and previous latent state. Unlike the VRNN, the
-  the RNN state is not conditioned on the previous latent state. The latent
-  states have a Markov structure and it is assumed that
-  p(z_t | z_{1:t-1}) = p(z_t | z_{t-1}).
-
-  In this implementation of the SRNN the latent state z_t is Gaussian. The
-  model's prior over z_t (also called the transition distribution) is
-  distributed as Normal(mu_t, diag(sigma_t^2)) where mu_t and sigma_t are the
-  mean and standard deviation output from a fully connected network that accepts
-  the rnn hidden state h_t and previous latent state z_{t-1} as input.
-
-  The emission distribution p(x_t|z_t, h_t) is conditioned on the latent state
-  z_t as well as the current RNN hidden state h_t via a fully connected network.
-
-  To increase the modeling power of the SRNN, two additional networks are
-  used to extract features from the data and the latent state. Those networks
-  are called data_encoder and latent_encoder respectively.
-
-  For an example of how to call the SRNN's methods see sample_step.
-
-  There are a few differences between this exposition and the paper. The main
-  goal was to be consistent with the VRNN code. A few components are renamed.
-  The backward RNN for approximating the posterior, g_phi_a in the paper, is the
-  rev_rnn_cell. The forward RNN that conditions the latent distribution, d in
-  the paper, is the rnn_cell. The paper doesn't name the NN's that serve as
-  feature extractors, and we name them here as the data_encoder and
-  latent_encoder.
-  """
-
-  def __init__(self,
-               rnn_cell,
-               data_encoder,
-               latent_encoder,
-               transition,
-               emission,
-               random_seed=None):
-    """Create a SRNN.
-
-    Args:
-      rnn_cell: A subclass of tf.nn.rnn_cell.RNNCell that will form the
-        deterministic backbone of the SRNN. The inputs to the RNN will be the
-        the encoded input of the current timestep, a Tensor of shape
-        [batch_size, encoded_data_size].
-      data_encoder: A callable that accepts a batch of data x_t and
-        'encodes' it, e.g. runs it through a fully connected network. Must
-        accept as argument the inputs x_t, a Tensor of the shape
-        [batch_size, data_size] and return a Tensor of shape
-        [batch_size, encoded_data_size]. This callable will be called multiple
-        times in the SRNN cell so if scoping is not handled correctly then
-        multiple copies of the variables in this network could be made. It is
-        recommended to use a snt.nets.MLP module, which takes care of this for
-        you.
-      latent_encoder: A callable that accepts a latent state z_t and
-        'encodes' it, e.g. runs it through a fully connected network. Must
-        accept as argument a Tensor of shape [batch_size, latent_size] and
-        return a Tensor of shape [batch_size, encoded_latent_size].
-        This callable must also have the property 'output_size' defined,
-        returning encoded_latent_size.
-      transition: A callable that implements the transition distribution
-        p(z_t|h_t, z_t-1). Must accept as argument the previous RNN hidden state
-        and previous encoded latent state then return a tf.distributions.Normal
-        distribution conditioned on the input.
-      emission: A callable that implements the emission distribution
-        p(x_t|z_t, h_t). Must accept as arguments the encoded latent state
-        and the RNN hidden state and return a subclass of
-        tf.distributions.Distribution that can be used to evaluate the logprob
-        of the targets.
-      random_seed: The seed for the random ops. Sets the seed for sample_step.
-    """
-    self.random_seed = random_seed
-    self.rnn_cell = rnn_cell
-    self.data_encoder = data_encoder
-    self.latent_encoder = latent_encoder
-    self.encoded_z_size = latent_encoder.output_size
-    self.state_size = (self.rnn_cell.state_size)
-    self._transition = transition
-    self._emission = emission
-
-  def zero_state(self, batch_size, dtype):
-    """The initial state of the SRNN.
-
-    Contains the initial state of the RNN and the inital encoded latent.
-
-    Args:
-      batch_size: The batch size.
-      dtype: The data type of the SRNN.
-    Returns:
-      zero_state: The initial state of the SRNN.
-    """
-    return SRNNState(
-        rnn_state=self.rnn_cell.zero_state(batch_size, dtype),
-        latent_encoded=tf.zeros(
-            [batch_size, self.latent_encoder.output_size], dtype=dtype))
-
-  def run_rnn(self, prev_rnn_state, inputs):
-    """Runs the deterministic RNN for one step.
-
-    Args:
-      prev_rnn_state: The state of the RNN from the previous timestep.
-      inputs: A Tensor of shape [batch_size, data_size], the current inputs to
-        the model. Most often this is x_{t-1}, the previous token in the
-        observation sequence.
-    Returns:
-      rnn_out: The output of the RNN.
-      rnn_state: The new state of the RNN.
-    """
-    rnn_inputs = self.data_encoder(tf.to_float(inputs))
-    rnn_out, rnn_state = self.rnn_cell(rnn_inputs, prev_rnn_state)
-    return rnn_out, rnn_state
-
-  def transition(self, rnn_out, prev_latent_encoded):
-    """Computes the transition distribution p(z_t|h_t, z_{t-1}).
-
-    Note that p(z_t | h_t, z_{t-1}) = p(z_t| z_{t-1}, x_{1:t-1})
-
-    Args:
-      rnn_out: The output of the rnn for the current timestep.
-      prev_latent_encoded: Float Tensor of shape
-        [batch_size, encoded_latent_size], the previous latent state z_{t-1}
-        run through latent_encoder.
-    Returns:
-      p(z_t | h_t): A normal distribution with event shape
-        [batch_size, latent_size].
-    """
-    return self._transition(rnn_out, prev_latent_encoded)
-
-  def emission(self, latent, rnn_out):
-    """Computes the emission distribution p(x_t | z_t, h_t).
-
-    Note that p(x_t | z_t, h_t) = p(x_t | z_t, x_{1:t-1})
-
-    Args:
-      latent: The stochastic latent state z_t.
-      rnn_out: The output of the rnn for the current timestep.
-    Returns:
-      p(x_t | z_t, h_t): A distribution with event shape
-        [batch_size, data_size].
-      latent_encoded: The latent state encoded with latent_encoder. Should be
-        passed to transition() on the next timestep.
-    """
-    latent_encoded = self.latent_encoder(latent)
-    return self._emission(latent_encoded, rnn_out), latent_encoded
-
-  def sample_step(self, prev_state, inputs, unused_t):
-    """Samples one output from the model.
-
-    Args:
-      prev_state: The previous state of the model, a SRNNState containing the
-        previous rnn state and the previous encoded latent.
-      inputs: A Tensor of shape [batch_size, data_size], the current inputs to
-        the model. Most often this is x_{t-1}, the previous token in the
-        observation sequence.
-      unused_t: The current timestep. Not used currently.
-    Returns:
-      new_state: The next state of the model, a SRNNState.
-      xt: A float Tensor of shape [batch_size, data_size], an output sampled
-        from the emission distribution.
-    """
-    rnn_out, rnn_state = self.run_rnn(prev_state.rnn_state,
-                                      inputs)
-    p_zt = self.transition(rnn_out, prev_state.latent_encoded)
-    zt = p_zt.sample(seed=self.random_seed)
-    p_xt_given_zt, latent_encoded = self.emission(zt, rnn_out)
-    xt = p_xt_given_zt.sample(seed=self.random_seed)
-    new_state = SRNNState(rnn_state=rnn_state, latent_encoded=latent_encoded)
-    return new_state, tf.to_float(xt)
-
-# pylint: disable=invalid-name
-# pylint thinks this is a top-level constant.
-TrainableSRNNState = namedtuple("TrainableSRNNState",
-                                SRNNState._fields + ("rnn_out",))
-# pylint: enable=g-invalid-name
-
-
-class TrainableSRNN(SRNN, base.ELBOTrainableSequenceModel):
-  """A SRNN subclass with proposals and methods for training and evaluation.
-
-  This class adds proposals used for training with importance-sampling based
-  methods such as the ELBO. The model can be configured to propose from one
-  of three proposals: a learned filtering proposal, a learned smoothing
-  proposal, or the prior (i.e. the transition distribution).
-
-  As described in the SRNN paper, the learned filtering proposal is
-  parameterized by a fully connected neural network that accepts as input the
-  current target x_t and the current rnn output h_t. The learned smoothing
-  proposal is also given the hidden state of an RNN run in reverse over the
-  inputs, so as to incorporate information about future observations.
-
-  All learned proposals use the 'res_q' parameterization, meaning that instead
-  of directly producing the mean of z_t, the proposal network predicts the
-  'residual' from the prior's mean. This is explored more in section 3.3 of
-  https://arxiv.org/pdf/1605.07571.pdf.
-
-  During training, the latent state z_t is sampled from the proposal and the
-  reparameterization trick is used to provide low-variance gradients.
-
-  Note that the SRNN paper refers to the proposals as the approximate posterior,
-  but we match the VRNN convention of referring to it as the encoder.
-  """
-
-  def __init__(self,
-               rnn_cell,
-               data_encoder,
-               latent_encoder,
-               transition,
-               emission,
-               proposal_type,
-               proposal=None,
-               rev_rnn_cell=None,
-               tilt=None,
-               random_seed=None):
-    """Create a trainable RNN.
-
-    Args:
-      rnn_cell: A subclass of tf.nn.rnn_cell.RNNCell that will form the
-        deterministic backbone of the SRNN. The inputs to the RNN will be the
-        the encoded input of the current timestep, a Tensor of shape
-        [batch_size, encoded_data_size].
-      data_encoder: A callable that accepts a batch of data x_t and
-        'encodes' it, e.g. runs it through a fully connected network. Must
-        accept as argument the inputs x_t, a Tensor of the shape
-        [batch_size, data_size] and return a Tensor of shape
-        [batch_size, encoded_data_size]. This callable will be called multiple
-        times in the SRNN cell so if scoping is not handled correctly then
-        multiple copies of the variables in this network could be made. It is
-        recommended to use a snt.nets.MLP module, which takes care of this for
-        you.
-      latent_encoder: A callable that accepts a latent state z_t and
-        'encodes' it, e.g. runs it through a fully connected network. Must
-        accept as argument a Tensor of shape [batch_size, latent_size] and
-        return a Tensor of shape [batch_size, encoded_latent_size].
-        This callable must also have the property 'output_size' defined,
-        returning encoded_latent_size.
-      transition: A callable that implements the transition distribution
-        p(z_t|h_t, z_t-1). Must accept as argument the previous RNN hidden state
-        and previous encoded latent state then return a tf.distributions.Normal
-        distribution conditioned on the input.
-      emission: A callable that implements the emission distribution
-        p(x_t|z_t, h_t). Must accept as arguments the encoded latent state
-        and the RNN hidden state and return a subclass of
-        tf.distributions.Distribution that can be used to evaluate the logprob
-        of the targets.
-      proposal_type: A string indicating the type of proposal to use. Can
-        be either "filtering", "smoothing", or "prior". When proposal_type is
-        "filtering" or "smoothing", proposal must be provided. When
-        proposal_type is "smoothing", rev_rnn_cell must also be provided.
-      proposal: A callable that implements the proposal q(z_t| h_t, x_{1:T}).
-        If proposal_type is "filtering" then proposal must accept as arguments
-        the current rnn output, the encoded target of the current timestep,
-        and the mean of the prior. If proposal_type is "smoothing" then
-        in addition to the current rnn output and the mean of the prior
-        proposal must accept as arguments the output of the reverse rnn.
-        proposal should return a tf.distributions.Normal distribution
-        conditioned on its inputs. If proposal_type is "prior" this argument is
-        ignored.
-      rev_rnn_cell: A subclass of tf.nn.rnn_cell.RNNCell that will aggregate
-        forward rnn outputs in the reverse direction. The inputs to the RNN
-        will be the encoded reverse input of the current timestep, a Tensor of
-        shape [batch_size, encoded_data_size].
-      tilt: A callable that implements the log of a positive tilting function
-        (ideally approximating log p(x_{t+1}|z_t, h_t). Must accept as arguments
-        the encoded latent state and the RNN hidden state and return a subclass
-        of tf.distributions.Distribution that can be used to evaluate the
-        logprob of x_{t+1}. Optionally, None and then no tilt is used.
-      random_seed: The seed for the random ops. Sets the seed for sample_step
-        and __call__.
-    """
-    super(TrainableSRNN, self).__init__(
-        rnn_cell, data_encoder, latent_encoder,
-        transition, emission, random_seed=random_seed)
-    self.rev_rnn_cell = rev_rnn_cell
-    self._tilt = tilt
-    assert proposal_type in ["filtering", "smoothing", "prior"]
-    self._proposal = proposal
-    self.proposal_type = proposal_type
-    if proposal_type != "prior":
-      assert proposal, "If not proposing from the prior, must provide proposal."
-    if proposal_type == "smoothing":
-      assert rev_rnn_cell, "Must provide rev_rnn_cell for smoothing proposal."
-
-  def zero_state(self, batch_size, dtype):
-    super_state = super(TrainableSRNN, self).zero_state(batch_size, dtype)
-    return TrainableSRNNState(
-        rnn_out=tf.zeros([batch_size, self.rnn_cell.output_size], dtype=dtype),
-        **super_state._asdict())
-
-  def set_observations(self, observations, seq_lengths):
-    """Stores the model's observations.
-
-    Stores the observations (inputs and targets) in TensorArrays and precomputes
-    things for later like the reverse RNN output and encoded targets.
-
-    Args:
-      observations: The observations of the model, a tuple containing two
-        Tensors of shape [max_seq_len, batch_size, data_size]. The Tensors
-        should be the inputs and targets, respectively.
-      seq_lengths: An int Tensor of shape [batch_size] containing the length
-        of each sequence in observations.
-    """
-    inputs, targets = observations
-    self.seq_lengths = seq_lengths
-    self.max_seq_len = tf.reduce_max(seq_lengths)
-    self.targets_ta = base.ta_for_tensor(targets, clear_after_read=False)
-    targets_encoded = base.encode_all(targets, self.data_encoder)
-    self.targets_encoded_ta = base.ta_for_tensor(targets_encoded,
-                                                 clear_after_read=False)
-    inputs_encoded = base.encode_all(inputs, self.data_encoder)
-    rnn_out, _ = tf.nn.dynamic_rnn(self.rnn_cell,
-                                   inputs_encoded,
-                                   time_major=True,
-                                   dtype=tf.float32,
-                                   scope="forward_rnn")
-    self.rnn_ta = base.ta_for_tensor(rnn_out,
-                                     clear_after_read=False)
-    if self.rev_rnn_cell:
-      targets_and_rnn_out = tf.concat([rnn_out, targets_encoded], 2)
-      reversed_targets_and_rnn_out = tf.reverse_sequence(
-          targets_and_rnn_out, seq_lengths, seq_axis=0, batch_axis=1)
-      # Compute the reverse rnn over the targets.
-      reverse_rnn_out, _ = tf.nn.dynamic_rnn(self.rev_rnn_cell,
-                                             reversed_targets_and_rnn_out,
-                                             time_major=True,
-                                             dtype=tf.float32,
-                                             scope="reverse_rnn")
-      reverse_rnn_out = tf.reverse_sequence(reverse_rnn_out, seq_lengths,
-                                            seq_axis=0, batch_axis=1)
-      self.reverse_rnn_ta = base.ta_for_tensor(reverse_rnn_out,
-                                               clear_after_read=False)
-
-  def _filtering_proposal(self, rnn_out, prev_latent_encoded, prior, t):
-    """Computes the filtering proposal distribution."""
-    return self._proposal(rnn_out,
-                          prev_latent_encoded,
-                          self.targets_encoded_ta.read(t),
-                          prior_mu=prior.mean())
-
-  def _smoothing_proposal(self, rnn_out, prev_latent_encoded, prior, t):
-    """Computes the smoothing proposal distribution."""
-    return self._proposal(rnn_out,
-                          prev_latent_encoded,
-                          smoothing_tensors=[self.reverse_rnn_ta.read(t)],
-                          prior_mu=prior.mean())
-
-  def proposal(self, rnn_out, prev_latent_encoded, prior, t):
-    """Computes the proposal distribution specified by proposal_type.
-
-    Args:
-      rnn_out: The output of the rnn for the current timestep.
-      prev_latent_encoded: Float Tensor of shape
-        [batch_size, encoded_latent_size], the previous latent state z_{t-1}
-        run through latent_encoder.
-      prior: A tf.distributions.Normal distribution representing the prior
-        over z_t, p(z_t | z_{1:t-1}, x_{1:t-1}). Used for 'res_q'.
-      t: A scalar int Tensor, the current timestep.
-    """
-    if self.proposal_type == "filtering":
-      return self._filtering_proposal(rnn_out, prev_latent_encoded, prior, t)
-    elif self.proposal_type == "smoothing":
-      return self._smoothing_proposal(rnn_out, prev_latent_encoded, prior, t)
-    elif self.proposal_type == "prior":
-      return self.transition(rnn_out, prev_latent_encoded)
-
-  def tilt(self, rnn_out, latent_encoded, targets):
-    r_func = self._tilt(rnn_out, latent_encoded)
-    return tf.reduce_sum(r_func.log_prob(targets), axis=-1)
-
-  def propose_and_weight(self, state, t):
-    """Runs the model and computes importance weights for one timestep.
-
-    Runs the model and computes importance weights, sampling from the proposal
-    instead of the transition/prior.
-
-    Args:
-      state: The previous state of the model, a TrainableSRNNState containing
-        the previous rnn state, the previous rnn outs, and the previous encoded
-        latent.
-      t: A scalar integer Tensor, the current timestep.
-    Returns:
-      weights: A float Tensor of shape [batch_size].
-      new_state: The new state of the model.
-    """
-    targets = self.targets_ta.read(t)
-    rnn_out = self.rnn_ta.read(t)
-    p_zt = self.transition(rnn_out, state.latent_encoded)
-    q_zt = self.proposal(rnn_out, state.latent_encoded, p_zt, t)
-    zt = q_zt.sample(seed=self.random_seed)
-    p_xt_given_zt, latent_encoded = self.emission(zt, rnn_out)
-    log_p_xt_given_zt = tf.reduce_sum(p_xt_given_zt.log_prob(targets), axis=-1)
-    log_p_zt = tf.reduce_sum(p_zt.log_prob(zt), axis=-1)
-    log_q_zt = tf.reduce_sum(q_zt.log_prob(zt), axis=-1)
-    weights = log_p_zt + log_p_xt_given_zt - log_q_zt
-    if self._tilt:
-      prev_log_r = tf.cond(
-          tf.greater(t, 0),
-          lambda: self.tilt(state.rnn_out, state.latent_encoded, targets),
-          lambda: 0.)  # On the first step, prev_log_r = 0.
-      log_r = tf.cond(
-          tf.less(t + 1, self.max_seq_len),
-          lambda: self.tilt(rnn_out, latent_encoded, self.targets_ta.read(t+1)),
-          lambda: 0.)
-      # On the last step, log_r = 0.
-      log_r *= tf.to_float(t < self.seq_lengths - 1)
-      weights += log_r - prev_log_r
-
-    # This reshape is required because the TensorArray reports different shapes
-    # than the initial state provides (where the first dimension is unknown).
-    # The difference breaks the while_loop. Reshape prevents the error.
-    rnn_out = tf.reshape(rnn_out, tf.shape(state.rnn_out))
-
-    new_state = TrainableSRNNState(rnn_out=rnn_out,
-                                   rnn_state=state.rnn_state,  # unmodified
-                                   latent_encoded=latent_encoded)
-    return weights, new_state
-
-
-_DEFAULT_INITIALIZERS = {"w": tf.contrib.layers.xavier_initializer(),
-                         "b": tf.zeros_initializer()}
-
-
-def create_srnn(
-    data_size,
-    latent_size,
-    emission_class,
-    rnn_hidden_size=None,
-    fcnet_hidden_sizes=None,
-    encoded_data_size=None,
-    encoded_latent_size=None,
-    sigma_min=0.0,
-    raw_sigma_bias=0.25,
-    emission_bias_init=0.0,
-    use_tilt=False,
-    proposal_type="filtering",
-    initializers=None,
-    random_seed=None):
-  """A factory method for creating SRNN cells.
-
-  Args:
-    data_size: The dimension of the vectors that make up the data sequences.
-    latent_size: The size of the stochastic latent state of the SRNN.
-    emission_class: The class of the emission distribution. Can be either
-      ConditionalNormalDistribution or ConditionalBernoulliDistribution.
-    rnn_hidden_size: The hidden state dimension of the RNN that forms the
-      deterministic part of this SRNN. If None, then it defaults
-      to latent_size.
-    fcnet_hidden_sizes: A list of python integers, the size of the hidden
-      layers of the fully connected networks that parameterize the conditional
-      distributions of the SRNN. If None, then it defaults to one hidden
-      layer of size latent_size.
-    encoded_data_size: The size of the output of the data encoding network. If
-      None, defaults to latent_size.
-    encoded_latent_size: The size of the output of the latent state encoding
-      network. If None, defaults to latent_size.
-    sigma_min: The minimum value that the standard deviation of the
-      distribution over the latent state can take.
-    raw_sigma_bias: A scalar that is added to the raw standard deviation
-      output from the neural networks that parameterize the prior and
-      approximate posterior. Useful for preventing standard deviations close
-      to zero.
-    emission_bias_init: A bias to added to the raw output of the fully
-      connected network that parameterizes the emission distribution. Useful
-      for initalizing the mean of the distribution to a sensible starting point
-      such as the mean of the training data. Only used with Bernoulli generative
-      distributions.
-    use_tilt: If true, create a SRNN with a tilting function.
-    proposal_type: The type of proposal to use. Can be "filtering", "smoothing",
-      or "prior".
-    initializers: The variable intitializers to use for the fully connected
-      networks and RNN cell. Must be a dictionary mapping the keys 'w' and 'b'
-      to the initializers for the weights and biases. Defaults to xavier for
-      the weights and zeros for the biases when initializers is None.
-    random_seed: A random seed for the SRNN resampling operations.
-  Returns:
-    model: A TrainableSRNN object.
-  """
-  if rnn_hidden_size is None:
-    rnn_hidden_size = latent_size
-  if fcnet_hidden_sizes is None:
-    fcnet_hidden_sizes = [latent_size]
-  if encoded_data_size is None:
-    encoded_data_size = latent_size
-  if encoded_latent_size is None:
-    encoded_latent_size = latent_size
-  if initializers is None:
-    initializers = _DEFAULT_INITIALIZERS
-  data_encoder = snt.nets.MLP(
-      output_sizes=fcnet_hidden_sizes + [encoded_data_size],
-      initializers=initializers,
-      name="data_encoder")
-  latent_encoder = snt.nets.MLP(
-      output_sizes=fcnet_hidden_sizes + [encoded_latent_size],
-      initializers=initializers,
-      name="latent_encoder")
-  transition = base.ConditionalNormalDistribution(
-      size=latent_size,
-      hidden_layer_sizes=fcnet_hidden_sizes,
-      sigma_min=sigma_min,
-      raw_sigma_bias=raw_sigma_bias,
-      initializers=initializers,
-      name="prior")
-  # Construct the emission distribution.
-  if emission_class == base.ConditionalBernoulliDistribution:
-    # For Bernoulli distributed outputs, we initialize the bias so that the
-    # network generates on average the mean from the training set.
-    emission_dist = functools.partial(base.ConditionalBernoulliDistribution,
-                                      bias_init=emission_bias_init)
-  else:
-    emission_dist = base.ConditionalNormalDistribution
-  emission = emission_dist(
-      size=data_size,
-      hidden_layer_sizes=fcnet_hidden_sizes,
-      initializers=initializers,
-      name="generative")
-  # Construct the proposal distribution.
-  if proposal_type in ["filtering", "smoothing"]:
-    proposal = base.NormalApproximatePosterior(
-        size=latent_size,
-        hidden_layer_sizes=fcnet_hidden_sizes,
-        sigma_min=sigma_min,
-        raw_sigma_bias=raw_sigma_bias,
-        initializers=initializers,
-        smoothing=(proposal_type == "smoothing"),
-        name="approximate_posterior")
-  else:
-    proposal = None
-
-  if use_tilt:
-    tilt = emission_dist(
-        size=data_size,
-        hidden_layer_sizes=fcnet_hidden_sizes,
-        initializers=initializers,
-        name="tilt")
-  else:
-    tilt = None
-
-  rnn_cell = tf.nn.rnn_cell.LSTMCell(rnn_hidden_size,
-                                     initializer=initializers["w"])
-  rev_rnn_cell = tf.nn.rnn_cell.LSTMCell(rnn_hidden_size,
-                                         initializer=initializers["w"])
-  return TrainableSRNN(
-      rnn_cell, data_encoder, latent_encoder, transition,
-      emission, proposal_type, proposal=proposal, rev_rnn_cell=rev_rnn_cell,
-      tilt=tilt, random_seed=random_seed)
diff --git a/research/fivo/fivo/models/srnn_test.py b/research/fivo/fivo/models/srnn_test.py
deleted file mode 100644
index 39e10da134d3834babcf2eef1bb3e97fce12a07a..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/models/srnn_test.py
+++ /dev/null
@@ -1,105 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for fivo.models.srnn."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from fivo.models import base
-from fivo.test_utils import create_srnn
-
-
-class SrnnTest(tf.test.TestCase):
-
-  def test_srnn_normal_emission(self):
-    self.run_srnn(base.ConditionalNormalDistribution, [-5.947752, -1.182961])
-
-  def test_srnn_bernoulli_emission(self):
-    self.run_srnn(base.ConditionalBernoulliDistribution, [-2.566631, -2.479234])
-
-  def run_srnn(self, generative_class, gt_log_alpha):
-    """Tests the SRNN.
-
-    All test values are 'golden values' derived by running the code and copying
-    the output.
-
-    Args:
-      generative_class: The class of the generative distribution to use.
-      gt_log_alpha: The ground-truth value of log alpha.
-    """
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      batch_size = 2
-      model, inputs, targets, _ = create_srnn(generative_class=generative_class,
-                                              batch_size=batch_size,
-                                              data_lengths=(1, 1),
-                                              random_seed=1234)
-      zero_state = model.zero_state(batch_size=batch_size, dtype=tf.float32)
-      model.set_observations([inputs, targets], tf.convert_to_tensor([1, 1]))
-      model_out = model.propose_and_weight(zero_state, 0)
-      sess.run(tf.global_variables_initializer())
-      log_alpha, state = sess.run(model_out)
-      self.assertAllClose(
-          state.latent_encoded,
-          [[0.591787, 1.310583], [-1.523136, 0.953918]])
-      self.assertAllClose(state.rnn_out,
-                          [[0.041675, -0.056038, -0.001823, 0.005224],
-                           [0.042925, -0.044619, 0.021401, 0.016998]])
-      self.assertAllClose(log_alpha, gt_log_alpha)
-
-  def test_srnn_with_tilt_normal_emission(self):
-    self.run_srnn_with_tilt(base.ConditionalNormalDistribution, [-9.13577, -4.56725])
-
-
-  def test_srnn_with_tilt_bernoulli_emission(self):
-    self.run_srnn_with_tilt(base.ConditionalBernoulliDistribution, [-4.617461, -5.079248])
-
-  def run_srnn_with_tilt(self, generative_class, gt_log_alpha):
-    """Tests the SRNN with a tilting function.
-
-    All test values are 'golden values' derived by running the code and copying
-    the output.
-
-    Args:
-      generative_class: The class of the generative distribution to use.
-      gt_log_alpha: The ground-truth value of log alpha.
-    """
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      batch_size = 2
-      model, inputs, targets, _ = create_srnn(generative_class=generative_class,
-                                              batch_size=batch_size,
-                                              data_lengths=(3, 2),
-                                              random_seed=1234,
-                                              use_tilt=True)
-      zero_state = model.zero_state(batch_size=batch_size, dtype=tf.float32)
-      model.set_observations([inputs, targets], tf.convert_to_tensor([3, 2]))
-      model_out = model.propose_and_weight(zero_state, 0)
-      sess.run(tf.global_variables_initializer())
-      log_alpha, state = sess.run(model_out)
-      self.assertAllClose(
-          state.latent_encoded,
-          [[0.591787, 1.310583], [-1.523136, 0.953918]])
-      self.assertAllClose(state.rnn_out,
-                          [[0.041675, -0.056038, -0.001823, 0.005224],
-                           [0.042925, -0.044619, 0.021401, 0.016998]])
-      self.assertAllClose(log_alpha, gt_log_alpha)
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/fivo/fivo/models/vrnn.py b/research/fivo/fivo/models/vrnn.py
deleted file mode 100644
index 4e2552088c19f141a75d791d2be0d0a5238ed87c..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/models/vrnn.py
+++ /dev/null
@@ -1,572 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""VRNN classes."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import namedtuple
-import functools
-
-import sonnet as snt
-import tensorflow as tf
-
-from fivo.models import base
-
-
-VRNNState = namedtuple("VRNNState", "rnn_state latent_encoded")
-
-
-class VRNN(object):
-  """Implementation of a Variational Recurrent Neural Network (VRNN).
-
-  Introduced in "A Recurrent Latent Variable Model for Sequential data"
-  by Chung et al. https://arxiv.org/pdf/1506.02216.pdf.
-
-  The VRNN is a sequence model similar to an RNN that uses stochastic latent
-  variables to improve its representational power. It can be thought of as a
-  sequential analogue to the variational auto-encoder (VAE).
-
-  The VRNN has a deterministic RNN as its backbone, represented by the
-  sequence of RNN hidden states h_t. At each timestep, the RNN hidden state h_t
-  is conditioned on the previous sequence element, x_{t-1}, as well as the
-  latent state from the previous timestep, z_{t-1}.
-
-  In this implementation of the VRNN the latent state z_t is Gaussian. The
-  model's prior over z_t (also called the transition distribution) is
-  distributed as Normal(mu_t, diag(sigma_t^2)) where mu_t and sigma_t are the
-  mean and standard deviation output from a fully connected network that accepts
-  the rnn hidden state h_t as input.
-
-  The emission distribution p(x_t|z_t, h_t) is conditioned on the latent state
-  z_t as well as the current RNN hidden state h_t via a fully connected network.
-
-  To increase the modeling power of the VRNN, two additional networks are
-  used to extract features from the data and the latent state. Those networks
-  are called data_encoder and latent_encoder respectively.
-
-  For an example of how to call the VRNN's methods see sample_step.
-
-  There are a few differences between this exposition and the paper.
-  First, the indexing scheme for h_t is different than the paper's -- what the
-  paper calls h_t we call h_{t+1}. This is the same notation used by Fraccaro
-  et al. to describe the VRNN in the paper linked above. Also, the VRNN paper
-  uses VAE terminology to refer to the different internal networks, so it
-  refers to the emission distribution as the decoder. This implementation also
-  renames the functions phi_x and phi_z in the paper to data_encoder and
-  latent_encoder.
-  """
-
-  def __init__(self,
-               rnn_cell,
-               data_encoder,
-               latent_encoder,
-               transition,
-               emission,
-               random_seed=None):
-    """Create a VRNN.
-
-    Args:
-      rnn_cell: A subclass of tf.nn.rnn_cell.RNNCell that will form the
-        deterministic backbone of the VRNN. The inputs to the RNN will be the
-        encoded latent state of the previous timestep with shape
-        [batch_size, encoded_latent_size] as well as the encoded input of the
-        current timestep, a Tensor of shape [batch_size, encoded_data_size].
-      data_encoder: A callable that accepts a batch of data x_t and
-        'encodes' it, e.g. runs it through a fully connected network. Must
-        accept as argument the inputs x_t, a Tensor of the shape
-        [batch_size, data_size] and return a Tensor of shape
-        [batch_size, encoded_data_size]. This callable will be called multiple
-        times in the VRNN cell so if scoping is not handled correctly then
-        multiple copies of the variables in this network could be made. It is
-        recommended to use a snt.nets.MLP module, which takes care of this for
-        you.
-      latent_encoder: A callable that accepts a latent state z_t and
-        'encodes' it, e.g. runs it through a fully connected network. Must
-        accept as argument a Tensor of shape [batch_size, latent_size] and
-        return a Tensor of shape [batch_size, encoded_latent_size].
-        This callable must also have the property 'output_size' defined,
-        returning encoded_latent_size.
-      transition: A callable that implements the transition distribution
-        p(z_t|h_t). Must accept as argument the previous RNN hidden state and
-        return a tf.distributions.Normal distribution conditioned on the input.
-      emission: A callable that implements the emission distribution
-        p(x_t|z_t, h_t). Must accept as arguments the encoded latent state
-        and the RNN hidden state and return a subclass of
-        tf.distributions.Distribution that can be used to evaluate the logprob
-        of the targets.
-      random_seed: The seed for the random ops. Sets the seed for sample_step.
-    """
-    self.random_seed = random_seed
-    self.rnn_cell = rnn_cell
-    self.data_encoder = data_encoder
-    self.latent_encoder = latent_encoder
-    self.encoded_z_size = latent_encoder.output_size
-    self.state_size = (self.rnn_cell.state_size)
-    self._transition = transition
-    self._emission = emission
-
-  def zero_state(self, batch_size, dtype):
-    """The initial state of the VRNN.
-
-    Contains the initial state of the RNN and the inital encoded latent.
-
-    Args:
-      batch_size: The batch size.
-      dtype: The data type of the VRNN.
-    Returns:
-      zero_state: The initial state of the VRNN.
-    """
-    return VRNNState(
-        rnn_state=self.rnn_cell.zero_state(batch_size, dtype),
-        latent_encoded=tf.zeros(
-            [batch_size, self.latent_encoder.output_size], dtype=dtype))
-
-  def run_rnn(self, prev_rnn_state, prev_latent_encoded, inputs):
-    """Runs the deterministic RNN for one step.
-
-    Args:
-      prev_rnn_state: The state of the RNN from the previous timestep.
-      prev_latent_encoded: Float Tensor of shape
-        [batch_size, encoded_latent_size], the previous latent state z_{t-1}
-        run through latent_encoder.
-      inputs: A Tensor of shape [batch_size, data_size], the current inputs to
-        the model. Most often this is x_{t-1}, the previous token in the
-        observation sequence.
-    Returns:
-      rnn_out: The output of the RNN.
-      rnn_state: The new state of the RNN.
-    """
-    inputs_encoded = self.data_encoder(tf.to_float(inputs))
-    rnn_inputs = tf.concat([inputs_encoded, prev_latent_encoded], axis=1)
-    rnn_out, rnn_state = self.rnn_cell(rnn_inputs, prev_rnn_state)
-    return rnn_out, rnn_state
-
-  def transition(self, rnn_out):
-    """Computes the transition distribution p(z_t|h_t).
-
-    Note that p(z_t | h_t) = p(z_t| z_{1:t-1}, x_{1:t-1})
-
-    Args:
-      rnn_out: The output of the rnn for the current timestep.
-    Returns:
-      p(z_t | h_t): A normal distribution with event shape
-        [batch_size, latent_size].
-    """
-    return self._transition(rnn_out)
-
-  def emission(self, latent, rnn_out):
-    """Computes the emission distribution p(x_t | z_t, h_t).
-
-    Note that p(x_t | z_t, h_t) = p(x_t | z_{1:t}, x_{1:t-1}).
-
-    Args:
-      latent: The stochastic latent state z_t.
-      rnn_out: The output of the rnn for the current timestep.
-    Returns:
-      p(x_t | z_t, h_t): A distribution with event shape
-        [batch_size, data_size].
-      latent_encoded: The latent state encoded with latent_encoder. Should be
-        passed to run_rnn on the next timestep.
-    """
-    latent_encoded = self.latent_encoder(latent)
-    return self._emission(latent_encoded, rnn_out), latent_encoded
-
-  def sample_step(self, prev_state, inputs, unused_t):
-    """Samples one output from the model.
-
-    Args:
-      prev_state: The previous state of the model, a VRNNState containing the
-        previous rnn state and the previous encoded latent.
-      inputs: A Tensor of shape [batch_size, data_size], the current inputs to
-        the model. Most often this is x_{t-1}, the previous token in the
-        observation sequence.
-      unused_t: The current timestep. Not used currently.
-    Returns:
-      new_state: The next state of the model, a VRNNState.
-      xt: A float Tensor of shape [batch_size, data_size], an output sampled
-        from the emission distribution.
-    """
-    rnn_out, rnn_state = self.run_rnn(prev_state.rnn_state,
-                                      prev_state.latent_encoded,
-                                      inputs)
-    p_zt = self.transition(rnn_out)
-    zt = p_zt.sample(seed=self.random_seed)
-    p_xt_given_zt, latent_encoded = self.emission(zt, rnn_out)
-    xt = p_xt_given_zt.sample(seed=self.random_seed)
-    new_state = VRNNState(rnn_state=rnn_state, latent_encoded=latent_encoded)
-    return new_state, tf.to_float(xt)
-
-# pylint: disable=invalid-name
-# pylint thinks this is a top-level constant.
-TrainableVRNNState = namedtuple("TrainableVRNNState",
-                                VRNNState._fields + ("rnn_out",))
-# pylint: enable=g-invalid-name
-
-
-class TrainableVRNN(VRNN, base.ELBOTrainableSequenceModel):
-  """A VRNN subclass with proposals and methods for training and evaluation.
-
-  This class adds proposals used for training with importance-sampling based
-  methods such as the ELBO. The model can be configured to propose from one
-  of three proposals: a learned filtering proposal, a learned smoothing
-  proposal, or the prior (i.e. the transition distribution).
-
-  As described in the VRNN paper, the learned filtering proposal is
-  parameterized by a fully connected neural network that accepts as input the
-  current target x_t and the current rnn output h_t. The learned smoothing
-  proposal is also given the hidden state of an RNN run in reverse over the
-  inputs, so as to incorporate information about future observations. This
-  smoothing proposal is not described in the VRNN paper.
-
-  All learned proposals use the 'res_q' parameterization, meaning that instead
-  of directly producing the mean of z_t, the proposal network predicts the
-  'residual' from the prior's mean. This is explored more in section 3.3 of
-  https://arxiv.org/pdf/1605.07571.pdf.
-
-  During training, the latent state z_t is sampled from the proposal and the
-  reparameterization trick is used to provide low-variance gradients.
-
-  Note that the VRNN paper uses VAE terminology to refer to the different
-  internal networks, so the proposal is referred to as the encoder.
-  """
-
-  def __init__(self,
-               rnn_cell,
-               data_encoder,
-               latent_encoder,
-               transition,
-               emission,
-               proposal_type,
-               proposal=None,
-               rev_rnn_cell=None,
-               tilt=None,
-               random_seed=None):
-    """Create a trainable RNN.
-
-    Args:
-      rnn_cell: A subclass of tf.nn.rnn_cell.RNNCell that will form the
-        deterministic backbone of the VRNN. The inputs to the RNN will be the
-        encoded latent state of the previous timestep with shape
-        [batch_size, encoded_latent_size] as well as the encoded input of the
-        current timestep, a Tensor of shape [batch_size, encoded_data_size].
-      data_encoder: A callable that accepts a batch of data x_t and
-        'encodes' it, e.g. runs it through a fully connected network. Must
-        accept as argument the inputs x_t, a Tensor of the shape
-        [batch_size, data_size] and return a Tensor of shape
-        [batch_size, encoded_data_size]. This callable will be called multiple
-        times in the VRNN cell so if scoping is not handled correctly then
-        multiple copies of the variables in this network could be made. It is
-        recommended to use a snt.nets.MLP module, which takes care of this for
-        you.
-      latent_encoder: A callable that accepts a latent state z_t and
-        'encodes' it, e.g. runs it through a fully connected network. Must
-        accept as argument a Tensor of shape [batch_size, latent_size] and
-        return a Tensor of shape [batch_size, encoded_latent_size].
-        This callable must also have the property 'output_size' defined,
-        returning encoded_latent_size.
-      transition: A callable that implements the transition distribution
-        p(z_t|h_t). Must accept as argument the previous RNN hidden state and
-        return a tf.distributions.Normal distribution conditioned on the input.
-      emission: A callable that implements the emission distribution
-        p(x_t|z_t, h_t). Must accept as arguments the encoded latent state
-        and the RNN hidden state and return a subclass of
-        tf.distributions.Distribution that can be used to evaluate the logprob
-        of the targets.
-      proposal_type: A string indicating the type of proposal to use. Can
-        be either "filtering", "smoothing", or "prior". When proposal_type is
-        "filtering" or "smoothing", proposal must be provided. When
-        proposal_type is "smoothing", rev_rnn_cell must also be provided.
-      proposal: A callable that implements the proposal q(z_t| h_t, x_{1:T}).
-        If proposal_type is "filtering" then proposal must accept as arguments
-        the current rnn output, the encoded target of the current timestep,
-        and the mean of the prior. If proposal_type is "smoothing" then
-        in addition to the current rnn output and the mean of the prior
-        proposal must accept as arguments the output of the reverse rnn.
-        proposal should return a tf.distributions.Normal distribution
-        conditioned on its inputs. If proposal_type is "prior" this argument is
-        ignored.
-      rev_rnn_cell: A subclass of tf.nn.rnn_cell.RNNCell that will aggregate
-        observation statistics in the reverse direction. The inputs to the RNN
-        will be the encoded reverse input of the current timestep, a Tensor of
-        shape [batch_size, encoded_data_size].
-      tilt: A callable that implements the log of a positive tilting function
-        (ideally approximating log p(x_{t+1}|z_t, h_t). Must accept as arguments
-        the encoded latent state and the RNN hidden state and return a subclass
-        of tf.distributions.Distribution that can be used to evaluate the
-        logprob of x_{t+1}. Optionally, None and then no tilt is used.
-      random_seed: The seed for the random ops. Sets the seed for sample_step
-        and __call__.
-    """
-    super(TrainableVRNN, self).__init__(
-        rnn_cell, data_encoder, latent_encoder,
-        transition, emission, random_seed=random_seed)
-    self.rev_rnn_cell = rev_rnn_cell
-    self._tilt = tilt
-    assert proposal_type in ["filtering", "smoothing", "prior"]
-    self._proposal = proposal
-    self.proposal_type = proposal_type
-    if proposal_type != "prior":
-      assert proposal, "If not proposing from the prior, must provide proposal."
-    if proposal_type == "smoothing":
-      assert rev_rnn_cell, "Must provide rev_rnn_cell for smoothing proposal."
-
-  def zero_state(self, batch_size, dtype):
-    super_state = super(TrainableVRNN, self).zero_state(batch_size, dtype)
-    return TrainableVRNNState(
-        rnn_out=tf.zeros([batch_size, self.rnn_cell.output_size], dtype=dtype),
-        **super_state._asdict())
-
-  def set_observations(self, observations, seq_lengths):
-    """Stores the model's observations.
-
-    Stores the observations (inputs and targets) in TensorArrays and precomputes
-    things for later like the reverse RNN output and encoded targets.
-
-    Args:
-      observations: The observations of the model, a tuple containing two
-        Tensors of shape [max_seq_len, batch_size, data_size]. The Tensors
-        should be the inputs and targets, respectively.
-      seq_lengths: An int Tensor of shape [batch_size] containing the length
-        of each sequence in observations.
-    """
-    inputs, targets = observations
-    self.seq_lengths = seq_lengths
-    self.max_seq_len = tf.reduce_max(seq_lengths)
-    self.inputs_ta = base.ta_for_tensor(inputs, clear_after_read=False)
-    self.targets_ta = base.ta_for_tensor(targets, clear_after_read=False)
-    targets_encoded = base.encode_all(targets, self.data_encoder)
-    self.targets_encoded_ta = base.ta_for_tensor(targets_encoded,
-                                                 clear_after_read=False)
-    if self.rev_rnn_cell:
-      reverse_targets_encoded = tf.reverse_sequence(
-          targets_encoded, seq_lengths, seq_axis=0, batch_axis=1)
-      # Compute the reverse rnn over the targets.
-      reverse_rnn_out, _ = tf.nn.dynamic_rnn(self.rev_rnn_cell,
-                                             reverse_targets_encoded,
-                                             time_major=True,
-                                             dtype=tf.float32)
-      reverse_rnn_out = tf.reverse_sequence(reverse_rnn_out, seq_lengths,
-                                            seq_axis=0, batch_axis=1)
-      self.reverse_rnn_ta = base.ta_for_tensor(reverse_rnn_out,
-                                               clear_after_read=False)
-
-  def _filtering_proposal(self, rnn_out, prior, t):
-    """Computes the filtering proposal distribution."""
-    return self._proposal(rnn_out,
-                          self.targets_encoded_ta.read(t),
-                          prior_mu=prior.mean())
-
-  def _smoothing_proposal(self, rnn_out, prior, t):
-    """Computes the smoothing proposal distribution."""
-    return self._proposal(rnn_out,
-                          smoothing_tensors=[self.reverse_rnn_ta.read(t)],
-                          prior_mu=prior.mean())
-
-  def proposal(self, rnn_out, prior, t):
-    """Computes the proposal distribution specified by proposal_type.
-
-    Args:
-      rnn_out: The output of the rnn for the current timestep.
-      prior: A tf.distributions.Normal distribution representing the prior
-        over z_t, p(z_t | z_{1:t-1}, x_{1:t-1}). Used for 'res_q'.
-      t: A scalar int Tensor, the current timestep.
-    """
-    if self.proposal_type == "filtering":
-      return self._filtering_proposal(rnn_out, prior, t)
-    elif self.proposal_type == "smoothing":
-      return self._smoothing_proposal(rnn_out, prior, t)
-    elif self.proposal_type == "prior":
-      return self.transition(rnn_out)
-
-  def tilt(self, rnn_out, latent_encoded, targets):
-    r_func = self._tilt(rnn_out, latent_encoded)
-    return tf.reduce_sum(r_func.log_prob(targets), axis=-1)
-
-  def propose_and_weight(self, state, t):
-    """Runs the model and computes importance weights for one timestep.
-
-    Runs the model and computes importance weights, sampling from the proposal
-    instead of the transition/prior.
-
-    Args:
-      state: The previous state of the model, a TrainableVRNNState containing
-        the previous rnn state, the previous rnn outs, and the previous encoded
-        latent.
-      t: A scalar integer Tensor, the current timestep.
-    Returns:
-      weights: A float Tensor of shape [batch_size].
-      new_state: The new state of the model.
-    """
-    inputs = self.inputs_ta.read(t)
-    targets = self.targets_ta.read(t)
-    rnn_out, next_rnn_state = self.run_rnn(state.rnn_state,
-                                           state.latent_encoded,
-                                           inputs)
-    p_zt = self.transition(rnn_out)
-    q_zt = self.proposal(rnn_out, p_zt, t)
-    zt = q_zt.sample(seed=self.random_seed)
-    p_xt_given_zt, latent_encoded = self.emission(zt, rnn_out)
-    log_p_xt_given_zt = tf.reduce_sum(p_xt_given_zt.log_prob(targets), axis=-1)
-    log_p_zt = tf.reduce_sum(p_zt.log_prob(zt), axis=-1)
-    log_q_zt = tf.reduce_sum(q_zt.log_prob(zt), axis=-1)
-    weights = log_p_zt + log_p_xt_given_zt - log_q_zt
-    if self._tilt:
-      prev_log_r = tf.cond(
-          tf.greater(t, 0),
-          lambda: self.tilt(state.rnn_out, state.latent_encoded, targets),
-          lambda: 0.)  # On the first step, prev_log_r = 0.
-      log_r = tf.cond(
-          tf.less(t + 1, self.max_seq_len),
-          lambda: self.tilt(rnn_out, latent_encoded, self.targets_ta.read(t+1)),
-          lambda: 0.)
-      # On the last step, log_r = 0.
-      log_r *= tf.to_float(t < self.seq_lengths - 1)
-      weights += log_r - prev_log_r
-    new_state = TrainableVRNNState(rnn_state=next_rnn_state,
-                                   rnn_out=rnn_out,
-                                   latent_encoded=latent_encoded)
-    return weights, new_state
-
-
-_DEFAULT_INITIALIZERS = {"w": tf.contrib.layers.xavier_initializer(),
-                         "b": tf.zeros_initializer()}
-
-
-def create_vrnn(
-    data_size,
-    latent_size,
-    emission_class,
-    rnn_hidden_size=None,
-    fcnet_hidden_sizes=None,
-    encoded_data_size=None,
-    encoded_latent_size=None,
-    sigma_min=0.0,
-    raw_sigma_bias=0.25,
-    emission_bias_init=0.0,
-    use_tilt=False,
-    proposal_type="filtering",
-    initializers=None,
-    random_seed=None):
-  """A factory method for creating VRNN cells.
-
-  Args:
-    data_size: The dimension of the vectors that make up the data sequences.
-    latent_size: The size of the stochastic latent state of the VRNN.
-    emission_class: The class of the emission distribution. Can be either
-      ConditionalNormalDistribution or ConditionalBernoulliDistribution.
-    rnn_hidden_size: The hidden state dimension of the RNN that forms the
-      deterministic part of this VRNN. If None, then it defaults
-      to latent_size.
-    fcnet_hidden_sizes: A list of python integers, the size of the hidden
-      layers of the fully connected networks that parameterize the conditional
-      distributions of the VRNN. If None, then it defaults to one hidden
-      layer of size latent_size.
-    encoded_data_size: The size of the output of the data encoding network. If
-      None, defaults to latent_size.
-    encoded_latent_size: The size of the output of the latent state encoding
-      network. If None, defaults to latent_size.
-    sigma_min: The minimum value that the standard deviation of the
-      distribution over the latent state can take.
-    raw_sigma_bias: A scalar that is added to the raw standard deviation
-      output from the neural networks that parameterize the prior and
-      approximate posterior. Useful for preventing standard deviations close
-      to zero.
-    emission_bias_init: A bias to added to the raw output of the fully
-      connected network that parameterizes the emission distribution. Useful
-      for initalizing the mean of the distribution to a sensible starting point
-      such as the mean of the training data. Only used with Bernoulli generative
-      distributions.
-    use_tilt: If true, create a VRNN with a tilting function.
-    proposal_type: The type of proposal to use. Can be "filtering", "smoothing",
-      or "prior".
-    initializers: The variable intitializers to use for the fully connected
-      networks and RNN cell. Must be a dictionary mapping the keys 'w' and 'b'
-      to the initializers for the weights and biases. Defaults to xavier for
-      the weights and zeros for the biases when initializers is None.
-    random_seed: A random seed for the VRNN resampling operations.
-  Returns:
-    model: A TrainableVRNN object.
-  """
-  if rnn_hidden_size is None:
-    rnn_hidden_size = latent_size
-  if fcnet_hidden_sizes is None:
-    fcnet_hidden_sizes = [latent_size]
-  if encoded_data_size is None:
-    encoded_data_size = latent_size
-  if encoded_latent_size is None:
-    encoded_latent_size = latent_size
-  if initializers is None:
-    initializers = _DEFAULT_INITIALIZERS
-  data_encoder = snt.nets.MLP(
-      output_sizes=fcnet_hidden_sizes + [encoded_data_size],
-      initializers=initializers,
-      name="data_encoder")
-  latent_encoder = snt.nets.MLP(
-      output_sizes=fcnet_hidden_sizes + [encoded_latent_size],
-      initializers=initializers,
-      name="latent_encoder")
-  transition = base.ConditionalNormalDistribution(
-      size=latent_size,
-      hidden_layer_sizes=fcnet_hidden_sizes,
-      sigma_min=sigma_min,
-      raw_sigma_bias=raw_sigma_bias,
-      initializers=initializers,
-      name="prior")
-  # Construct the emission distribution.
-  if emission_class == base.ConditionalBernoulliDistribution:
-    # For Bernoulli distributed outputs, we initialize the bias so that the
-    # network generates on average the mean from the training set.
-    emission_dist = functools.partial(base.ConditionalBernoulliDistribution,
-                                      bias_init=emission_bias_init)
-  else:
-    emission_dist = base.ConditionalNormalDistribution
-  emission = emission_dist(
-      size=data_size,
-      hidden_layer_sizes=fcnet_hidden_sizes,
-      initializers=initializers,
-      name="generative")
-  # Construct the proposal distribution.
-  if proposal_type in ["filtering", "smoothing"]:
-    proposal = base.NormalApproximatePosterior(
-        size=latent_size,
-        hidden_layer_sizes=fcnet_hidden_sizes,
-        sigma_min=sigma_min,
-        raw_sigma_bias=raw_sigma_bias,
-        initializers=initializers,
-        smoothing=(proposal_type == "smoothing"),
-        name="approximate_posterior")
-  else:
-    proposal = None
-
-  if use_tilt:
-    tilt = emission_dist(
-        size=data_size,
-        hidden_layer_sizes=fcnet_hidden_sizes,
-        initializers=initializers,
-        name="tilt")
-  else:
-    tilt = None
-
-  rnn_cell = tf.nn.rnn_cell.LSTMCell(rnn_hidden_size,
-                                     initializer=initializers["w"])
-  rev_rnn_cell = tf.nn.rnn_cell.LSTMCell(rnn_hidden_size,
-                                         initializer=initializers["w"])
-  return TrainableVRNN(
-      rnn_cell, data_encoder, latent_encoder, transition,
-      emission, proposal_type, proposal=proposal, rev_rnn_cell=rev_rnn_cell,
-      tilt=tilt, random_seed=random_seed)
diff --git a/research/fivo/fivo/models/vrnn_test.py b/research/fivo/fivo/models/vrnn_test.py
deleted file mode 100644
index 2d9bde3d5b6c6f66a82bd331cf50a87737864239..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/models/vrnn_test.py
+++ /dev/null
@@ -1,137 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for fivo.models.vrnn."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-import tensorflow as tf
-
-from fivo.models import base
-from fivo.test_utils import create_vrnn
-
-
-class VrnnTest(tf.test.TestCase):
-
-  def test_vrnn_normal_emission(self):
-    self.run_vrnn(base.ConditionalNormalDistribution, [-4.509767, -3.242221])
-
-  def test_vrnn_bernoulli_emission(self):
-    self.run_vrnn(base.ConditionalBernoulliDistribution, [-2.63812733, -2.02216434]),
-
-  def run_vrnn(self, generative_class, gt_log_p_x_given_z):
-    """Tests the VRNN.
-
-    All test values are 'golden values' derived by running the code and copying
-    the output.
-
-    Args:
-      generative_class: The class of the generative distribution to use.
-      gt_log_p_x_given_z: The ground-truth value of log p(x|z).
-    """
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      batch_size = 2
-      model, inputs, targets, _ = create_vrnn(generative_class=generative_class,
-                                              batch_size=batch_size,
-                                              data_lengths=(1, 1),
-                                              random_seed=1234)
-      zero_state = model.zero_state(batch_size=batch_size, dtype=tf.float32)
-      model.set_observations([inputs, targets], tf.convert_to_tensor([1, 1]))
-      model_out = model.propose_and_weight(zero_state, 0)
-      sess.run(tf.global_variables_initializer())
-      log_alpha, state = sess.run(model_out)
-      rnn_state, latent_state, rnn_out = state
-      self.assertAllClose(
-          rnn_state.c,
-          [[-0.15014534, 0.0143046, 0.00160489, -0.12899463],
-           [-0.25015137, 0.09377634, -0.05000039, -0.17123522]])
-      self.assertAllClose(
-          rnn_state.h,
-          [[-0.06842659, 0.00760155, 0.00096106, -0.05434214],
-           [-0.1109542, 0.0441804, -0.03121299, -0.07882939]]
-      )
-      self.assertAllClose(
-          latent_state,
-          [[0.025241, 0.122011, 1.066661, 0.316209, -0.25369, 0.108215,
-            -1.501128, -0.440111, -0.40447, -0.156649, 1.206028],
-           [0.066824, 0.519937, 0.610973, 0.977739, -0.121889, -0.223429,
-            -0.32687, -0.578763, -0.56965, 0.751886, 0.681606]]
-      )
-      self.assertAllClose(rnn_out, [[-0.068427, 0.007602, 0.000961, -0.054342],
-                                    [-0.110954, 0.04418, -0.031213, -0.078829]])
-      gt_log_q_z = [-8.0895052, -6.75819111]
-      gt_log_p_z = [-7.246827, -6.512877]
-      gt_log_alpha = (np.array(gt_log_p_z) +
-                      np.array(gt_log_p_x_given_z) -
-                      np.array(gt_log_q_z))
-      self.assertAllClose(log_alpha, gt_log_alpha)
-
-  def test_vrnn_with_tilt_normal_emission(self):
-    self.run_vrnn_with_tilt(base.ConditionalNormalDistribution, [-5.198263, -6.31686])
-
-  def test_vrnn_with_tilt_bernoulli_emission(self):
-    self.run_vrnn_with_tilt(base.ConditionalBernoulliDistribution, [-4.66985, -3.802245])
-
-  def run_vrnn_with_tilt(self, generative_class, gt_log_alpha):
-    """Tests the VRNN with a tilting function.
-
-    All test values are 'golden values' derived by running the code and copying
-    the output.
-
-    Args:
-      generative_class: The class of the generative distribution to use.
-      gt_log_alpha: The ground-truth value of log alpha.
-    """
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      batch_size = 2
-      model, inputs, targets, _ = create_vrnn(generative_class=generative_class,
-                                              batch_size=batch_size,
-                                              data_lengths=(3, 2),
-                                              random_seed=1234,
-                                              use_tilt=True)
-      zero_state = model.zero_state(batch_size=batch_size, dtype=tf.float32)
-      model.set_observations([inputs, targets], tf.convert_to_tensor([3, 2]))
-      model_out = model.propose_and_weight(zero_state, 0)
-      sess.run(tf.global_variables_initializer())
-      log_alpha, state = sess.run(model_out)
-      rnn_state, latent_state, rnn_out = state
-      self.assertAllClose(
-          rnn_state.c,
-          [[-0.15014534, 0.0143046, 0.00160489, -0.12899463],
-           [-0.25015137, 0.09377634, -0.05000039, -0.17123522]])
-      self.assertAllClose(
-          rnn_state.h,
-          [[-0.06842659, 0.00760155, 0.00096106, -0.05434214],
-           [-0.1109542, 0.0441804, -0.03121299, -0.07882939]]
-      )
-      self.assertAllClose(
-          latent_state,
-          [[0.025241, 0.122011, 1.066661, 0.316209, -0.25369, 0.108215,
-            -1.501128, -0.440111, -0.40447, -0.156649, 1.206028],
-           [0.066824, 0.519937, 0.610973, 0.977739, -0.121889, -0.223429,
-            -0.32687, -0.578763, -0.56965, 0.751886, 0.681606]]
-      )
-      self.assertAllClose(rnn_out, [[-0.068427, 0.007602, 0.000961, -0.054342],
-                                    [-0.110954, 0.04418, -0.031213, -0.078829]])
-      self.assertAllClose(log_alpha, gt_log_alpha)
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/fivo/fivo/nested_utils.py b/research/fivo/fivo/nested_utils.py
deleted file mode 100644
index ef956a80c40d55331a3acbfe78111e099559ddea..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/nested_utils.py
+++ /dev/null
@@ -1,139 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A set of utils for dealing with nested lists and tuples of Tensors."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import itertools
-import tensorflow as tf
-
-from tensorflow.python.util import nest
-
-
-def map_nested(map_fn, nested):
-  """Executes map_fn on every element in a (potentially) nested structure.
-
-  Args:
-    map_fn: A callable to execute on each element in 'nested'.
-    nested: A potentially nested combination of sequence objects. Sequence
-      objects include tuples, lists, namedtuples, and all subclasses of
-      collections.Sequence except strings. See nest.is_sequence for details.
-      For example [1, ('hello', 4.3)] is a nested structure containing elements
-      1, 'hello', and 4.3.
-  Returns:
-    out_structure: A potentially nested combination of sequence objects with the
-      same structure as the 'nested' input argument. out_structure
-      contains the result of applying map_fn to each element in 'nested'. For
-      example map_nested(lambda x: x+1, [1, (3, 4.3)]) returns [2, (4, 5.3)].
-  """
-  out = map(map_fn, nest.flatten(nested))
-  return nest.pack_sequence_as(nested, out)
-
-
-def tile_tensors(tensors, multiples):
-  """Tiles a set of Tensors.
-
-  Args:
-    tensors: A potentially nested tuple or list of Tensors with rank
-      greater than or equal to the length of 'multiples'. The Tensors do not
-      need to have the same rank, but their rank must not be dynamic.
-    multiples: A python list of ints indicating how to tile each Tensor
-      in 'tensors'. Similar to the 'multiples' argument to tf.tile.
-  Returns:
-    tiled_tensors: A potentially nested tuple or list of Tensors with the same
-      structure as the 'tensors' input argument. Contains the result of
-      applying tf.tile to each Tensor in 'tensors'. When the rank of a Tensor
-      in 'tensors' is greater than the length of multiples, multiples is padded
-      at the end with 1s. For example when tiling a 4-dimensional Tensor with
-      multiples [3, 4], multiples would be padded to [3, 4, 1, 1] before tiling.
-  """
-  def tile_fn(x):
-    return tf.tile(x, multiples + [1] * (x.shape.ndims - len(multiples)))
-
-  return map_nested(tile_fn, tensors)
-
-
-def where_tensors(condition, x_tensors, y_tensors):
-  """Performs a tf.where operation on a two sets of Tensors.
-
-  Args:
-    condition: The condition tensor to use for the where operation.
-    x_tensors: A potentially nested tuple or list of Tensors.
-    y_tensors: A potentially nested tuple or list of Tensors. Must have the
-    same structure as x_tensors.
-  Returns:
-    whered_tensors: A potentially nested tuple or list of Tensors with the
-      same structure as the 'tensors' input argument. Contains the result of
-      applying tf.where(condition, x, y) on each pair of elements in x_tensors
-      and y_tensors.
-  """
-  flat_x = nest.flatten(x_tensors)
-  flat_y = nest.flatten(y_tensors)
-  result = [tf.where(condition, x, y) for x, y in
-            itertools.izip(flat_x, flat_y)]
-
-  return nest.pack_sequence_as(x_tensors, result)
-
-
-def gather_tensors(tensors, indices):
-  """Performs a tf.gather operation on a set of Tensors.
-
-  Args:
-    tensors: A potentially nested tuple or list of Tensors.
-    indices: The indices to use for the gather operation.
-  Returns:
-    gathered_tensors: A potentially nested tuple or list of Tensors with the
-      same structure as the 'tensors' input argument. Contains the result of
-      applying tf.gather(x, indices) on each element x in 'tensors'.
-  """
-  return map_nested(lambda x: tf.gather(x, indices), tensors)
-
-
-def tas_for_tensors(tensors, length, **kwargs):
-  """Unstacks a set of Tensors into TensorArrays.
-
-  Args:
-    tensors: A potentially nested tuple or list of Tensors with length in the
-      first dimension greater than or equal to the 'length' input argument.
-    length: The desired length of the TensorArrays.
-    **kwargs: Keyword args for TensorArray constructor.
-  Returns:
-    tensorarrays: A potentially nested tuple or list of TensorArrays with the
-      same structure as 'tensors'. Contains the result of unstacking each Tensor
-      in 'tensors'.
-  """
-  def map_fn(x):
-    ta = tf.TensorArray(x.dtype, length,
-                        name=x.name.split(':')[0] + '_ta', **kwargs)
-    return ta.unstack(x[:length, :])
-  return map_nested(map_fn, tensors)
-
-
-def read_tas(tas, index):
-  """Performs a read operation on a set of TensorArrays.
-
-  Args:
-    tas: A potentially nested tuple or list of TensorArrays with length greater
-      than 'index'.
-    index: The location to read from.
-  Returns:
-    read_tensors: A potentially nested tuple or list of Tensors with the same
-      structure as the 'tas' input argument. Contains the result of
-      performing a read operation at 'index' on each TensorArray in 'tas'.
-  """
-  return map_nested(lambda ta: ta.read(index), tas)
diff --git a/research/fivo/fivo/nested_utils_test.py b/research/fivo/fivo/nested_utils_test.py
deleted file mode 100644
index 87991dd79cdb29d12944f9afa3fd0c5178dc4eb5..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/nested_utils_test.py
+++ /dev/null
@@ -1,125 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for fivo.nested_utils."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import tensorflow as tf
-nest = tf.contrib.framework.nest
-
-from fivo import nested_utils
-
-# An example namedtuple for use in the following tests.
-ExampleTuple = collections.namedtuple('ExampleTuple', ['a', 'b'])
-
-
-class NestedUtilsTest(tf.test.TestCase):
-
-  def test_map_nested_works_on_nested_structures(self):
-    """Check that map_nested works with nested structures."""
-    original = [1, (2, 3.2, (4., ExampleTuple(5, 6)))]
-    expected = [2, (3, 4.2, (5., ExampleTuple(6, 7)))]
-    out = nested_utils.map_nested(lambda x: x+1, original)
-    self.assertEqual(expected, out)
-
-  def test_map_nested_works_on_single_objects(self):
-    """Check that map_nested works with raw objects."""
-    original = 1
-    expected = 2
-    out = nested_utils.map_nested(lambda x: x+1, original)
-    self.assertEqual(expected, out)
-
-  def test_map_nested_works_on_flat_lists(self):
-    """Check that map_nested works with a flat list."""
-    original = [1, 2, 3]
-    expected = [2, 3, 4]
-    out = nested_utils.map_nested(lambda x: x+1, original)
-    self.assertEqual(expected, out)
-
-  def test_tile_tensors(self):
-    """Checks that tile_tensors correctly tiles tensors of different ranks."""
-    a = tf.range(20)
-    b = tf.reshape(a, [2, 10])
-    c = tf.reshape(a, [2, 2, 5])
-    a_tiled = tf.tile(a, [3])
-    b_tiled = tf.tile(b, [3, 1])
-    c_tiled = tf.tile(c, [3, 1, 1])
-    tensors = [a, (b, ExampleTuple(c, c))]
-    expected_tensors = [a_tiled, (b_tiled, ExampleTuple(c_tiled, c_tiled))]
-    tiled = nested_utils.tile_tensors(tensors, [3])
-    nest.assert_same_structure(expected_tensors, tiled)
-    with self.test_session() as sess:
-      expected, out = sess.run([expected_tensors, tiled])
-      expected = nest.flatten(expected)
-      out = nest.flatten(out)
-      # Check that the tiling is correct.
-      for x, y in zip(expected, out):
-        self.assertAllClose(x, y)
-
-  def test_gather_tensors(self):
-    a = tf.reshape(tf.range(20), [5, 4])
-    inds = [0, 0, 1, 4]
-    a_gathered = tf.gather(a, inds)
-    tensors = [a, (a, ExampleTuple(a, a))]
-    gt_gathered = [a_gathered, (a_gathered,
-                                ExampleTuple(a_gathered, a_gathered))]
-    gathered = nested_utils.gather_tensors(tensors, inds)
-    nest.assert_same_structure(gt_gathered, gathered)
-    with self.test_session() as sess:
-      gt, out = sess.run([gt_gathered, gathered])
-      gt = nest.flatten(gt)
-      out = nest.flatten(out)
-      # Check that the gathering is correct.
-      for x, y in zip(gt, out):
-        self.assertAllClose(x, y)
-
-  def test_tas_for_tensors(self):
-    a = tf.reshape(tf.range(20), [5, 4])
-    tensors = [a, (a, ExampleTuple(a, a))]
-    tas = nested_utils.tas_for_tensors(tensors, 5)
-    nest.assert_same_structure(tensors, tas)
-    # We can't pass TensorArrays to sess.run so instead we turn then back into
-    # tensors to check that they were created correctly.
-    stacked = nested_utils.map_nested(lambda x: x.stack(), tas)
-    with self.test_session() as sess:
-      gt, out = sess.run([tensors, stacked])
-      gt = nest.flatten(gt)
-      out = nest.flatten(out)
-      # Check that the tas were created correctly.
-      for x, y in zip(gt, out):
-        self.assertAllClose(x, y)
-
-  def test_read_tas(self):
-    a = tf.reshape(tf.range(20), [5, 4])
-    a_read = a[3, :]
-    tensors = [a, (a, ExampleTuple(a, a))]
-    gt_read = [a_read, (a_read, ExampleTuple(a_read, a_read))]
-    tas = nested_utils.tas_for_tensors(tensors, 5)
-    tas_read = nested_utils.read_tas(tas, 3)
-    nest.assert_same_structure(tas, tas_read)
-    with self.test_session() as sess:
-      gt, out = sess.run([gt_read, tas_read])
-      gt = nest.flatten(gt)
-      out = nest.flatten(out)
-      # Check that the tas were read correctly.
-      for x, y in zip(gt, out):
-        self.assertAllClose(x, y)
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/fivo/fivo/runners.py b/research/fivo/fivo/runners.py
deleted file mode 100644
index ec6fb91bf51fa2c7c44d7402e635d257f80c3f7a..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/runners.py
+++ /dev/null
@@ -1,489 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""High-level code for creating and running FIVO-related Tensorflow graphs.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import os
-import time
-
-import numpy as np
-import tensorflow as tf
-
-from fivo import bounds
-from fivo import smc
-
-from fivo.data import datasets
-from fivo.models import base
-from fivo.models import srnn
-from fivo.models import vrnn
-
-
-def create_dataset_and_model(config, split, shuffle, repeat):
-  """Creates the dataset and model for a given config.
-
-  Args:
-    config: A configuration object with config values accessible as properties.
-      Most likely a FLAGS object. This function expects the properties
-      batch_size, dataset_path, dataset_type, and latent_size to be defined.
-    split: The dataset split to load.
-    shuffle: If true, shuffle the dataset randomly.
-    repeat: If true, repeat the dataset endlessly.
-  Returns:
-    inputs: A batch of input sequences represented as a dense Tensor of shape
-      [time, batch_size, data_dimension].
-    targets: A batch of target sequences represented as a dense Tensor of
-      shape [time, batch_size, data_dimension].
-    lens: An int Tensor of shape [batch_size] representing the lengths of each
-      sequence in the batch.
-    model: A vrnn.VRNNCell model object.
-  Raises:
-    ValueError: if the config is invalid.
-  """
-  sigma_min = 0.0
-  if config.dataset_type == "pianoroll":
-    inputs, targets, lengths, mean = datasets.create_pianoroll_dataset(
-        config.dataset_path, split, config.batch_size, shuffle=shuffle,
-        repeat=repeat)
-    # Convert the mean of the training set to logit space so it can be used to
-    # initialize the bias of the generative distribution.
-    emission_bias_init = -tf.log(
-        1. / tf.clip_by_value(mean, 0.0001, 0.9999) - 1)
-    emission_distribution_class = base.ConditionalBernoulliDistribution
-  elif config.dataset_type == "speech":
-    inputs, targets, lengths = datasets.create_speech_dataset(
-        config.dataset_path, config.batch_size,
-        samples_per_timestep=config.data_dimension, prefetch_buffer_size=1,
-        shuffle=False, repeat=False)
-    # There is no bias for the generative distribution because the test set
-    # is assumed to be already standardized with the training set statistics.
-    mean = None
-    emission_bias_init = None
-    emission_distribution_class = base.ConditionalNormalDistribution
-  if config.model == "vrnn":
-    model = vrnn.create_vrnn(inputs.get_shape().as_list()[2],
-                             config.latent_size,
-                             emission_distribution_class,
-                             emission_bias_init=emission_bias_init,
-                             proposal_type=config.proposal_type,
-                             sigma_min=sigma_min,
-                             raw_sigma_bias=0.5,
-                             use_tilt=(config.bound == "fivo-aux"))
-  elif config.model == "srnn":
-    model = srnn.create_srnn(inputs.get_shape().as_list()[2],
-                             config.latent_size,
-                             emission_distribution_class,
-                             emission_bias_init=emission_bias_init,
-                             proposal_type=config.proposal_type,
-                             sigma_min=sigma_min,
-                             raw_sigma_bias=0.5,
-                             use_tilt=(config.bound == "fivo-aux"))
-  else:
-    raise ValueError("model flag: %s is unrecognized" % config.model)
-  return inputs, targets, lengths, model, mean
-
-
-def restore_checkpoint_if_exists(saver, sess, logdir):
-  """Looks for a checkpoint and restores the session from it if found.
-
-  Args:
-    saver: A tf.train.Saver for restoring the session.
-    sess: A TensorFlow session.
-    logdir: The directory to look for checkpoints in.
-  Returns:
-    True if a checkpoint was found and restored, False otherwise.
-  """
-  checkpoint = tf.train.get_checkpoint_state(logdir)
-  if checkpoint:
-    checkpoint_name = os.path.basename(checkpoint.model_checkpoint_path)
-    full_checkpoint_path = os.path.join(logdir, checkpoint_name)
-    saver.restore(sess, full_checkpoint_path)
-    return True
-  return False
-
-
-def wait_for_checkpoint(saver, sess, logdir):
-  """Loops until the session is restored from a checkpoint in logdir.
-
-  Args:
-    saver: A tf.train.Saver for restoring the session.
-    sess: A TensorFlow session.
-    logdir: The directory to look for checkpoints in.
-  """
-  while not restore_checkpoint_if_exists(saver, sess, logdir):
-    tf.logging.info("Checkpoint not found in %s, sleeping for 60 seconds."
-                    % logdir)
-    time.sleep(60)
-
-
-def run_train(config, create_dataset_and_model_fn=create_dataset_and_model):
-  """Runs training for a sequential latent variable model.
-
-  Args:
-    config: A configuration object with config values accessible as properties.
-      Most likely a FLAGS object. For a list of expected properties and their
-      meaning see the flags defined in fivo.py.
-    create_dataset_and_model_fn: If present, calls this function to create a
-      dataset and model instead of create_dataset_and_model() above. The
-      signature must be the same.
-  """
-
-  def create_logging_hook(step, bound_value):
-    """Creates a logging hook that prints the bound value periodically."""
-    bound_label = config.bound + " bound"
-    if config.normalize_by_seq_len:
-      bound_label += " per timestep"
-    else:
-      bound_label += " per sequence"
-    def summary_formatter(log_dict):
-      return "Step %d, %s: %f" % (
-          log_dict["step"], bound_label, log_dict["bound_value"])
-    logging_hook = tf.train.LoggingTensorHook(
-        {"step": step, "bound_value": bound_value},
-        every_n_iter=config.summarize_every,
-        formatter=summary_formatter)
-    return logging_hook
-
-  def create_loss():
-    """Creates the loss to be optimized.
-
-    Returns:
-      bound: A float Tensor containing the value of the bound that is
-        being optimized.
-      loss: A float Tensor that when differentiated yields the gradients
-        to apply to the model. Should be optimized via gradient descent.
-    """
-    inputs, targets, lengths, model, _ = create_dataset_and_model_fn(
-        config, split="train", shuffle=True, repeat=True)
-    # Compute lower bounds on the log likelihood.
-    if config.bound == "elbo":
-      ll_per_seq, _, _ = bounds.iwae(
-          model, (inputs, targets), lengths, num_samples=1,
-          parallel_iterations=config.parallel_iterations
-      )
-    elif config.bound == "iwae":
-      ll_per_seq, _, _ = bounds.iwae(
-          model, (inputs, targets), lengths, num_samples=config.num_samples,
-          parallel_iterations=config.parallel_iterations
-      )
-    elif config.bound in ("fivo", "fivo-aux"):
-      if config.resampling_type == "relaxed":
-        ll_per_seq, _, _, _ = bounds.fivo(
-            model, (inputs, targets),
-            lengths,
-            num_samples=config.num_samples,
-            resampling_criterion=smc.ess_criterion,
-            resampling_type=config.resampling_type,
-            random_seed=config.random_seed,
-            relaxed_resampling_temperature=config.
-            relaxed_resampling_temperature,
-            parallel_iterations=config.parallel_iterations
-        )
-      else:
-        ll_per_seq, _, _, _ = bounds.fivo(
-            model, (inputs, targets), lengths, num_samples=config.num_samples,
-            resampling_criterion=smc.ess_criterion,
-            resampling_type=config.resampling_type,
-            random_seed=config.random_seed,
-            parallel_iterations=config.parallel_iterations
-        )
-    # Compute loss scaled by number of timesteps.
-    ll_per_t = tf.reduce_mean(ll_per_seq / tf.to_float(lengths))
-    ll_per_seq = tf.reduce_mean(ll_per_seq)
-
-    tf.summary.scalar("train_ll_per_seq", ll_per_seq)
-    tf.summary.scalar("train_ll_per_t", ll_per_t)
-
-    if config.normalize_by_seq_len:
-      return ll_per_t, -ll_per_t
-    else:
-      return ll_per_seq, -ll_per_seq
-
-  def create_graph():
-    """Creates the training graph."""
-    global_step = tf.train.get_or_create_global_step()
-    bound, loss = create_loss()
-    opt = tf.train.AdamOptimizer(config.learning_rate)
-    grads = opt.compute_gradients(loss, var_list=tf.trainable_variables())
-    train_op = opt.apply_gradients(grads, global_step=global_step)
-    return bound, train_op, global_step
-
-  device = tf.train.replica_device_setter(ps_tasks=config.ps_tasks)
-  with tf.Graph().as_default():
-    if config.random_seed: tf.set_random_seed(config.random_seed)
-    with tf.device(device):
-      bound, train_op, global_step = create_graph()
-      log_hook = create_logging_hook(global_step, bound)
-      start_training = not config.stagger_workers
-      with tf.train.MonitoredTrainingSession(
-          master=config.master,
-          is_chief=config.task == 0,
-          hooks=[log_hook],
-          checkpoint_dir=config.logdir,
-          save_checkpoint_secs=120,
-          save_summaries_steps=config.summarize_every,
-          log_step_count_steps=config.summarize_every) as sess:
-        cur_step = -1
-        while not sess.should_stop() and cur_step <= config.max_steps:
-          if config.task > 0 and not start_training:
-            cur_step = sess.run(global_step)
-            tf.logging.info("task %d not active yet, sleeping at step %d" %
-                            (config.task, cur_step))
-            time.sleep(30)
-            if cur_step >= config.task * 1000:
-              start_training = True
-          else:
-            _, cur_step = sess.run([train_op, global_step])
-
-
-def run_eval(config, create_dataset_and_model_fn=create_dataset_and_model):
-  """Runs evaluation for a sequential latent variable model.
-
-  This method runs only one evaluation over the dataset, writes summaries to
-  disk, and then terminates. It does not loop indefinitely.
-
-  Args:
-    config: A configuration object with config values accessible as properties.
-      Most likely a FLAGS object. For a list of expected properties and their
-      meaning see the flags defined in fivo.py.
-    create_dataset_and_model_fn: If present, calls this function to create a
-      dataset and model instead of create_dataset_and_model() above. The
-      signature must be the same.
-  """
-
-  def create_graph():
-    """Creates the evaluation graph.
-
-    Returns:
-      lower_bounds: A tuple of float Tensors containing the values of the 3
-        evidence lower bounds, summed across the batch.
-      total_batch_length: The total number of timesteps in the batch, summed
-        across batch examples.
-      batch_size: The batch size.
-      global_step: The global step the checkpoint was loaded from.
-    """
-    global_step = tf.train.get_or_create_global_step()
-    inputs, targets, lengths, model, _ = create_dataset_and_model_fn(
-        config, split=config.split, shuffle=False, repeat=False)
-    # Compute lower bounds on the log likelihood.
-    elbo_ll_per_seq, _, _ = bounds.iwae(
-        model, (inputs, targets), lengths, num_samples=1,
-        parallel_iterations=config.parallel_iterations
-    )
-    iwae_ll_per_seq, _, _ = bounds.iwae(
-        model, (inputs, targets), lengths, num_samples=config.num_samples,
-        parallel_iterations=config.parallel_iterations
-    )
-    # The resampling type should only be used for training, so we ignore it.
-    fivo_ll_per_seq, _, _, _ = bounds.fivo(
-        model, (inputs, targets), lengths, num_samples=config.num_samples,
-        resampling_criterion=smc.ess_criterion, random_seed=config.random_seed,
-        parallel_iterations=config.parallel_iterations
-    )
-    elbo_ll = tf.reduce_sum(elbo_ll_per_seq)
-    iwae_ll = tf.reduce_sum(iwae_ll_per_seq)
-    fivo_ll = tf.reduce_sum(fivo_ll_per_seq)
-    batch_size = tf.shape(lengths)[0]
-    total_batch_length = tf.reduce_sum(lengths)
-    return ((elbo_ll, iwae_ll, fivo_ll), total_batch_length, batch_size,
-            global_step)
-
-  def average_bounds_over_dataset(lower_bounds, total_batch_length, batch_size,
-                                  sess):
-    """Computes the values of the bounds, averaged over the datset.
-
-    Args:
-      lower_bounds: Tuple of float Tensors containing the values of the bounds
-        evaluated on a single batch.
-      total_batch_length: Integer Tensor that represents the total number of
-        timesteps in the current batch.
-      batch_size: Integer Tensor containing the batch size. This can vary if the
-        requested batch_size does not evenly divide the size of the dataset.
-      sess: A TensorFlow Session object.
-    Returns:
-      ll_per_t: A length 3 numpy array of floats containing each bound's average
-        value, normalized by the total number of timesteps in the datset. Can
-        be interpreted as a lower bound on the average log likelihood per
-        timestep in the dataset.
-      ll_per_seq: A length 3 numpy array of floats containing each bound's
-        average value, normalized by the number of sequences in the dataset.
-        Can be interpreted as a lower bound on the average log likelihood per
-        sequence in the datset.
-    """
-    total_ll = np.zeros(3, dtype=np.float64)
-    total_n_elems = 0.0
-    total_length = 0.0
-    while True:
-      try:
-        outs = sess.run([lower_bounds, batch_size, total_batch_length])
-      except tf.errors.OutOfRangeError:
-        break
-      total_ll += outs[0]
-      total_n_elems += outs[1]
-      total_length += outs[2]
-    ll_per_t = total_ll / total_length
-    ll_per_seq = total_ll / total_n_elems
-    return ll_per_t, ll_per_seq
-
-  def summarize_lls(lls_per_t, lls_per_seq, summary_writer, step):
-    """Creates log-likelihood lower bound summaries and writes them to disk.
-
-    Args:
-      lls_per_t: An array of 3 python floats, contains the values of the
-        evaluated bounds normalized by the number of timesteps.
-      lls_per_seq: An array of 3 python floats, contains the values of the
-        evaluated bounds normalized by the number of sequences.
-      summary_writer: A tf.SummaryWriter.
-      step: The current global step.
-    """
-    def scalar_summary(name, value):
-      value = tf.Summary.Value(tag=name, simple_value=value)
-      return tf.Summary(value=[value])
-
-    for i, bound in enumerate(["elbo", "iwae", "fivo"]):
-      per_t_summary = scalar_summary("%s/%s_ll_per_t" % (config.split, bound),
-                                     lls_per_t[i])
-      per_seq_summary = scalar_summary("%s/%s_ll_per_seq" %
-                                       (config.split, bound),
-                                       lls_per_seq[i])
-      summary_writer.add_summary(per_t_summary, global_step=step)
-      summary_writer.add_summary(per_seq_summary, global_step=step)
-    summary_writer.flush()
-
-  with tf.Graph().as_default():
-    if config.random_seed: tf.set_random_seed(config.random_seed)
-    lower_bounds, total_batch_length, batch_size, global_step = create_graph()
-    summary_dir = config.logdir + "/" + config.split
-    summary_writer = tf.summary.FileWriter(
-        summary_dir, flush_secs=15, max_queue=100)
-    saver = tf.train.Saver()
-    with tf.train.SingularMonitoredSession() as sess:
-      wait_for_checkpoint(saver, sess, config.logdir)
-      step = sess.run(global_step)
-      tf.logging.info("Model restored from step %d, evaluating." % step)
-      ll_per_t, ll_per_seq = average_bounds_over_dataset(
-          lower_bounds, total_batch_length, batch_size, sess)
-      summarize_lls(ll_per_t, ll_per_seq, summary_writer, step)
-      tf.logging.info("%s elbo ll/t: %f, iwae ll/t: %f fivo ll/t: %f",
-                      config.split, ll_per_t[0], ll_per_t[1], ll_per_t[2])
-      tf.logging.info("%s elbo ll/seq: %f, iwae ll/seq: %f fivo ll/seq: %f",
-                      config.split, ll_per_seq[0], ll_per_seq[1], ll_per_seq[2])
-
-
-def run_sample(config, create_dataset_and_model_fn=create_dataset_and_model):
-  """Sample from the model. Only pianorolls and pose datasets are supported."""
-
-  def sample_from_model(model, initial_state, initial_inputs, mean):
-    """Samples a sequence of outputs from the model.
-
-    The mean must be supplied -- if it isn't the results will be incorrect.
-
-    Args:
-      model: A model with sample_step implemented. See models/vrnn.py for an
-        example.
-      initial_state: The initial state of the model.
-      initial_inputs: The initial inputs to feed into the model.
-      mean: The mean of the training set, a Tensor of shape [data_dimension].
-    Returns:
-      samples: A Tensor of shape [sample_length, batch_size, num_timesteps,
-        data_dimension] containing the samples from the model.
-    """
-    initial_state, initial_output = model.sample_step(initial_state,
-                                                      initial_inputs, 0)
-    output_ta = tf.TensorArray(size=config.sample_length,
-                               dtype=tf.float32,
-                               dynamic_size=False,
-                               clear_after_read=True)
-    output_ta = output_ta.write(0, initial_output)
-    t0 = tf.constant(1, dtype=tf.int32)
-
-    def sample_step(t, state, prev_outputs, output_ta):
-      state, output = model.sample_step(state, prev_outputs, t)
-      output_ta = output_ta.write(t, output)
-      centered_output = output - mean[tf.newaxis, :]
-      return t+1, state, centered_output, output_ta
-
-    def sample_predicate(t, *unused_args):
-      return t < config.sample_length
-
-    _, _, _, output_ta = tf.while_loop(
-        sample_predicate,
-        sample_step,
-        loop_vars=(t0, initial_state, initial_output, output_ta),
-        parallel_iterations=config.parallel_iterations
-    )
-    samples = output_ta.stack()
-    samples = tf.reshape(samples, [config.sample_length, config.batch_size,
-                                   config.num_samples, config.data_dimension])
-    return samples
-
-  def create_graph():
-    """Creates the graph to sample from the model.
-
-    First, the model is conditioned on a prefix by sampling a batch of data
-    and trimming it to prefix_length. The configured bound is used to do the
-    conditioning. Then the final state from the conditioning is used to sample
-    from the model.
-
-    Returns:
-      samples: A Tensor of shape [sample_length, batch_size,
-        num_samples, data_dimension] representing samples from the model.
-      prefixes: A Tensor of shape [prefix_length, batch_size, data_dimension]
-        representing the prefixes the model was conditioned on.
-    """
-    inputs, targets, lengths, model, mean = create_dataset_and_model_fn(
-        config, split=config.split, shuffle=True, repeat=True)
-    input_prefixes = inputs[:config.prefix_length]
-    target_prefixes = targets[:config.prefix_length]
-    prefix_lengths = tf.ones_like(lengths) * config.prefix_length
-    if config.bound == "elbo":
-      _, _, state = bounds.iwae(
-          model, (input_prefixes, target_prefixes),
-          prefix_lengths, num_samples=1)
-    elif config.bound == "iwae":
-      _, _, state = bounds.iwae(
-          model, (input_prefixes, target_prefixes),
-          prefix_lengths, num_samples=config.num_samples)
-    elif config.bound == "fivo":
-      _, _, _, state = bounds.fivo(
-          model, (input_prefixes, target_prefixes), prefix_lengths,
-          num_samples=config.num_samples,
-          resampling_criterion=smc.ess_criterion,
-          random_seed=config.random_seed)
-    sample_inputs = tf.tile(inputs[config.prefix_length],
-                            [config.num_samples, 1])
-    samples = sample_from_model(model, state, sample_inputs, mean)
-    return samples, target_prefixes
-
-  with tf.Graph().as_default():
-    if config.random_seed:
-      tf.set_random_seed(config.random_seed)
-    samples, prefixes = create_graph()
-    if config.sample_out_dir:
-      out_dir = config.sample_our_dir
-    else:
-      out_dir = config.logdir
-    if not tf.gfile.Exists(out_dir):
-      tf.gfile.MakeDirs(out_dir)
-    with tf.train.SingularMonitoredSession(
-        checkpoint_dir=config.logdir) as sess:
-      samples_out, prefixes_out = sess.run([samples, prefixes])
-      with tf.gfile.Open(os.path.join(out_dir, "samples.npz"), "w") as fout:
-        np.save(fout, {"prefixes": prefixes_out, "samples": samples_out})
diff --git a/research/fivo/fivo/runners_test.py b/research/fivo/fivo/runners_test.py
deleted file mode 100644
index eb050c0a0b38b2511f3d2fb9ec846e63ead3b5ac..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/runners_test.py
+++ /dev/null
@@ -1,242 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for fivo.runners"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import numpy as np
-import tensorflow as tf
-
-from fivo import runners
-from fivo.models import base
-from fivo.models import vrnn
-
-FLAGS = tf.app.flags.FLAGS
-
-
-class RunnersTest(tf.test.TestCase):
-
-  def default_config(self):
-    class Config(object):
-      pass
-    config = Config()
-    config.model = "vrnn"
-    config.latent_size = 64
-    config.batch_size = 4
-    config.num_samples = 4
-    config.resampling_type = "multinomial"
-    config.normalize_by_seq_len = True
-    config.learning_rate = 0.0001
-    config.max_steps = int(1e6)
-    config.summarize_every = 50
-    # Master must be "" to prevent state from persisting between sessions.
-    config.master = ""
-    config.task = 0
-    config.ps_tasks = 0
-    config.stagger_workers = True
-    config.random_seed = 1234
-    config.parallel_iterations = 1
-    config.dataset_type = "pianoroll"
-    config.data_dimension = None
-    config.dataset_path = os.path.join(
-        os.path.dirname(os.path.realpath(__file__)),
-        "test_data", "tiny_pianoroll.pkl")
-    config.proposal_type = "filtering"
-    return config
-
-  def run_training_one_step(self, bound, dataset_type, data_dimension,
-                            dataset_filename, dir_prefix, resampling_type,
-                            model, batch_size=2, num_samples=3,
-                            create_dataset_and_model_fn=(runners.create_dataset_and_model)):
-    config = self.default_config()
-    config.model = model
-    config.resampling_type = resampling_type
-    config.relaxed_resampling_temperature = 0.5
-    config.bound = bound
-    config.split = "train"
-    config.dataset_type = dataset_type
-    config.dataset_path = os.path.join(
-        os.path.dirname(os.path.realpath(__file__)),
-        "test_data",
-        dataset_filename)
-    config.max_steps = 1
-    config.batch_size = batch_size
-    config.num_samples = num_samples
-    config.latent_size = 4
-    config.data_dimension = data_dimension
-    config.logdir = os.path.join(tf.test.get_temp_dir(), "%s-%s-%s-%s" %
-                                 (dir_prefix, bound, dataset_type, model))
-    runners.run_train(config,
-                      create_dataset_and_model_fn=create_dataset_and_model_fn)
-    return config
-
-  def dummmy_dataset_and_model_fn(self, *unused_args, **unused_kwargs):
-    # We ignore the arguments in the dummy but need to preserve prototype.
-    batch_elements = 5
-    sequence_length = 4
-    data_dimensions = 3
-    dataset = tf.data.Dataset.from_tensors(
-        tf.zeros((sequence_length, batch_elements, data_dimensions),
-                 dtype=tf.float32))
-    inputs = dataset.make_one_shot_iterator().get_next()
-    targets = tf.zeros_like(inputs)
-    lengths = tf.constant([sequence_length] * batch_elements)
-    mean = tf.constant((0.0, 0.0, 0.0))
-    model = vrnn.create_vrnn(data_dimensions, 1,
-                             base.ConditionalNormalDistribution)
-    return inputs, targets, lengths, model, mean
-
-  def test_training_one_step_fivo_pianoroll_vrnn(self):
-    self.run_training_one_step("fivo", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "multinomial", "vrnn")
-
-  def test_training_one_step_iwae_pianoroll_vrnn(self):
-    self.run_training_one_step("iwae", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "multinomial", "vrnn")
-
-  def test_training_one_step_elbo_pianoroll_vrnn(self):
-    self.run_training_one_step("elbo", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "multinomial", "vrnn")
-
-  def test_training_one_step_fivo_speech_vrnn(self):
-    self.run_training_one_step("fivo", "speech", 2, "tiny_speech_dataset.tfrecord",
-                               "test-training", "multinomial", "vrnn")
-
-  def test_training_one_step_iwae_speech_vrnn(self):
-    self.run_training_one_step("iwae", "speech", 2, "tiny_speech_dataset.tfrecord",
-                               "test-training", "multinomial", "vrnn")
-
-  def test_training_one_step_elbo_speech_vrnn(self):
-    self.run_training_one_step("elbo", "speech", 2, "tiny_speech_dataset.tfrecord",
-                               "test-training", "multinomial", "vrnn")
-
-  def test_training_one_step_fivo_pianoroll_srnn(self):
-    self.run_training_one_step("fivo", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "multinomial", "srnn")
-
-  def test_training_one_step_iwae_pianoroll_srnn(self):
-    self.run_training_one_step("iwae", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "multinomial", "srnn")
-
-  def test_training_one_step_elbo_pianoroll_srnn(self):
-    self.run_training_one_step("elbo", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "multinomial", "srnn")
-
-  def test_training_one_step_fivo_speech_srnn(self):
-    self.run_training_one_step("fivo", "speech", 2, "tiny_speech_dataset.tfrecord",
-                               "test-training", "multinomial", "srnn")
-
-  def test_training_one_step_iwae_speech_srnn(self):
-    self.run_training_one_step("iwae", "speech", 2, "tiny_speech_dataset.tfrecord",
-                               "test-training", "multinomial", "srnn")
-
-  def test_training_one_step_elbo_speech_srnn(self):
-    self.run_training_one_step("elbo", "speech", 2, "tiny_speech_dataset.tfrecord",
-                               "test-training", "multinomial", "srnn")
-
-  def test_training_one_step_fivo_pianoroll_vrnn_relaxed(self):
-    self.run_training_one_step("fivo", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "relaxed", "vrnn")
-
-  def test_training_one_step_iwae_pianoroll_vrnn_relaxed(self):
-    self.run_training_one_step("iwae", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "relaxed", "vrnn")
-
-  def test_training_one_step_elbo_pianoroll_vrnn_relaxed(self):
-    self.run_training_one_step("elbo", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "relaxed", "vrnn")
-
-  def test_training_one_step_fivo_pianoroll_srnn_relaxed(self):
-    self.run_training_one_step("fivo", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "relaxed", "srnn")
-
-  def test_training_one_step_iwae_pianoroll_srnn_relaxed(self):
-    self.run_training_one_step("iwae", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "relaxed", "srnn")
-
-  def test_training_one_step_elbo_pianoroll_srnn_relaxed(self):
-    self.run_training_one_step("elbo", "pianoroll", 88, "tiny_pianoroll.pkl",
-                               "test-training", "relaxed", "srnn")
-
-  def test_eval_vrnn(self):
-    self.run_eval("vrnn")
-
-  def test_eval_srnn(self):
-    self.run_eval("srnn")
-
-  def run_eval(self, model):
-    config = self.run_training_one_step(
-        "fivo", "pianoroll", 88, "tiny_pianoroll.pkl", "test-eval-" + model,
-        "multinomial", model)
-    config.split = "train"
-    runners.run_eval(config)
-
-  def test_sampling_vrnn(self):
-    self.run_sampling("vrnn")
-
-  def test_sampling_srnn(self):
-    self.run_sampling("srnn")
-
-  def run_sampling(self, model):
-    """Test sampling from the model."""
-    config = self.run_training_one_step(
-        "fivo", "pianoroll", 88, "tiny_pianoroll.pkl", "test-sampling", "multinomial",
-        model)
-    config.prefix_length = 3
-    config.sample_length = 6
-    config.split = "train"
-    config.sample_out_dir = None
-
-    runners.run_sample(config)
-    unused_samples = np.load(os.path.join(config.logdir, "samples.npz"))
-
-  def test_training_with_custom_fn(self):
-    self.run_training_one_step(
-        "fivo", "pianoroll", 3, "tiny_pianoroll.pkl",
-        "test-training-custom-fn", "multinomial", "vrnn", batch_size=5,
-        create_dataset_and_model_fn=self.dummmy_dataset_and_model_fn)
-
-  def test_eval_with_custom_fn(self):
-    config = self.run_training_one_step(
-        "fivo", "pianoroll", 1, "tiny_pianoroll.pkl",
-        "test-eval-custom-fn", "multinomial", "vrnn", batch_size=1,
-        create_dataset_and_model_fn=self.dummmy_dataset_and_model_fn)
-    config.split = "train"
-    runners.run_eval(
-        config,
-        create_dataset_and_model_fn=self.dummmy_dataset_and_model_fn)
-
-  def test_sampling_with_custom_fn(self):
-    config = self.run_training_one_step(
-        "fivo", "pianoroll", 3, "tiny_pianoroll.pkl",
-        "test-sample-custom-fn", "multinomial", "vrnn", batch_size=5,
-        create_dataset_and_model_fn=self.dummmy_dataset_and_model_fn)
-    config.prefix_length = 2
-    config.sample_length = 3
-    config.split = "train"
-    config.sample_out_dir = None
-
-    runners.run_sample(
-        config,
-        create_dataset_and_model_fn=self.dummmy_dataset_and_model_fn)
-    unused_samples = np.load(os.path.join(config.logdir, "samples.npz"))
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/fivo/fivo/smc.py b/research/fivo/fivo/smc.py
deleted file mode 100644
index 25d4969043e2cb8bc2c2c7a3770d3d2dfcca0bef..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/smc.py
+++ /dev/null
@@ -1,338 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Implementation of sequential Monte Carlo algorithms.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-import fivo.nested_utils as nested
-
-
-def ess_criterion(log_weights, unused_t):
-  """A criterion that resamples based on effective sample size."""
-  num_particles = tf.shape(log_weights)[0]
-  # Calculate the effective sample size.
-  ess_num = 2 * tf.reduce_logsumexp(log_weights, axis=0)
-  ess_denom = tf.reduce_logsumexp(2 * log_weights, axis=0)
-  log_ess = ess_num - ess_denom
-  return log_ess <= tf.log(tf.to_float(num_particles) / 2.0)
-
-
-def never_resample_criterion(log_weights, unused_t):
-  """A criterion that never resamples."""
-  batch_size = tf.shape(log_weights)[1]
-  return tf.cast(tf.zeros([batch_size]), tf.bool)
-
-
-def always_resample_criterion(log_weights, unused_t):
-  """A criterion resamples at every timestep."""
-  batch_size = tf.shape(log_weights)[1]
-  return tf.cast(tf.ones([batch_size]), tf.bool)
-
-
-def multinomial_resampling(log_weights, states, num_particles, batch_size,
-                           random_seed=None):
-  """Resample states with multinomial resampling.
-
-  Args:
-    log_weights: A [num_particles, batch_size] Tensor representing a batch
-      of batch_size logits for num_particles-ary Categorical distribution.
-    states: A nested list of [batch_size*num_particles, data_size] Tensors that
-      will be resampled from the groups of every num_particles-th row.
-    num_particles: The number of particles/samples.
-    batch_size: The batch size.
-    random_seed: The random seed to pass to the resampling operations in
-      the particle filter. Mainly useful for testing.
-
-  Returns:
-    resampled_states: A nested list of [batch_size*num_particles, data_size]
-      Tensors resampled via multinomial sampling.
-  """
-  # Calculate the ancestor indices via resampling. Because we maintain the
-  # log unnormalized weights, we pass the weights in as logits, allowing
-  # the distribution object to apply a softmax and normalize them.
-  resampling_parameters = tf.transpose(log_weights, perm=[1, 0])
-  resampling_dist = tf.contrib.distributions.Categorical(
-      logits=resampling_parameters)
-  ancestors = tf.stop_gradient(
-      resampling_dist.sample(sample_shape=num_particles, seed=random_seed))
-
-  # Because the batch is flattened, we must modify ancestor_inds to index the
-  # proper samples. The particles in the ith filter are distributed every
-  # batch_size rows in the batch, and offset i rows from the top. So, to
-  # correct the indices we multiply by the batch_size and add the proper offset.
-  # Crucially, when ancestor_inds is flattened the layout of the batch is
-  # maintained.
-  offset = tf.expand_dims(tf.range(batch_size), 0)
-  ancestor_inds = tf.reshape(ancestors * batch_size + offset, [-1])
-
-  resampled_states = nested.gather_tensors(states, ancestor_inds)
-  return resampled_states
-
-
-def _blend_tensor(blending_weights, tensor, num_particles, batch_size):
-  """Blend tensor according to the weights.
-
-  The first dimension of tensor is actually a 2d index compacted to a 1d
-  index and similarly for blended_tensor. So if we index these Tensors
-  by [(i, j), k], then
-
-    blended_tensor[(i, j), k] =
-      sum_l tensor[(l, j), :] * blending_weights[i, j, l].
-
-  Args:
-    blending_weights: [num_particles, batch_size, num_particles] weights where
-      the indices represent [sample index, batch index, blending weight index].
-    tensor: [num_particles * batch_size, state_dim] Tensor to be blended.
-    num_particles: The number of particles/samples.
-    batch_size: The batch size.
-
-  Returns:
-    blended_tensor: [num_particles*batch_size, state_dim] blended Tensor.
-  """
-  # tensor is currently [num_particles * batch_size, state_dim], so we reshape
-  # it to [num_particles, batch_size, state_dim]. Then, transpose it to
-  # [batch_size, state_size, num_particles].
-  tensor = tf.transpose(
-      tf.reshape(tensor, [num_particles, batch_size, -1]), perm=[1, 2, 0])
-  blending_weights = tf.transpose(blending_weights, perm=[1, 2, 0])
-  # blendeding_weights is [batch index, blending weight index, sample index].
-  # Multiplying these gives a matrix of size [batch_size, state_size,
-  # num_particles].
-  tensor = tf.matmul(tensor, blending_weights)
-  # transpose the tensor to be [num_particles, batch_size, state_size]
-  # and then reshape it to match the original format.
-  tensor = tf.reshape(tf.transpose(tensor, perm=[2, 0, 1]),
-                      [num_particles*batch_size, -1])
-  return tensor
-
-
-def relaxed_resampling(log_weights, states, num_particles, batch_size,
-                       temperature=0.5, random_seed=None):
-  """Resample states with relaxed resampling.
-
-  Draw soft "ancestors" using the Gumbel-Softmax distribution.
-
-  Args:
-    log_weights: A [num_particles, batch_size] Tensor representing a batch
-      of batch_size logits for num_particles-ary Categorical distribution.
-    states: A nested list of [batch_size * num_particles, d] Tensors that will
-      be resampled from the groups of every num_particles-th row.
-    num_particles: The number of particles/samples.
-    batch_size: The batch size.
-    temperature: The temperature used for the relaxed one hot distribution.
-    random_seed: The random seed to pass to the resampling operations in
-      the particle filter. Mainly useful for testing.
-
-  Returns:
-    resampled_states: A nested list of [batch_size * num_particles, d]
-      Tensors resampled via multinomial sampling.
-  """
-  # log_weights are [num_particles, batch_size], so we transpose to get a
-  # set of batch_size distributions over [0, num_particles).
-  resampling_parameters = tf.transpose(log_weights, perm=[1, 0])
-  resampling_dist = tf.contrib.distributions.RelaxedOneHotCategorical(
-      temperature,
-      logits=resampling_parameters)
-
-  # Sample num_particles samples from the distribution, resulting in a
-  # [num_particles, batch_size, num_particles] Tensor that represents a set of
-  # [num_particles, batch_size] blending weights. The dimensions represent
-  # [particle index, batch index, blending weight index].
-  ancestors = resampling_dist.sample(sample_shape=num_particles,
-                                     seed=random_seed)
-  def map_fn(tensor):
-    return _blend_tensor(ancestors, tensor, num_particles, batch_size)
-
-  resampled_states = nested.map_nested(map_fn, states)
-  return resampled_states
-
-
-def smc(
-    transition_fn,
-    num_steps,
-    num_particles=1,
-    resampling_criterion=ess_criterion,
-    resampling_fn=multinomial_resampling,
-    loop_fn=None,
-    parallel_iterations=30,
-    swap_memory=True):
-  """Run a sequential Monte Carlo (SMC) algorithm.
-
-  This method runs an SMC algorithm that evolves systems of particles
-  using the supplied transition function for the specified number of steps. The
-  particles are optionally resampled using resampling_fn when indicated by
-  resampling_criterion.
-
-  Args:
-    transition_fn: A callable that propogates a batch of particles one step.
-      Must accept as arguments a batch of particle states and the current
-      timestep. Must return the particle states one timestep in the future, the
-      incremental weights of each particle as a [num_samples*batch_size] float
-      Tensor, and optionally a set of arguments to pass to the loop_fn. If
-      the loop args are not provided, they will be set to None. Before the
-      first timestep transition_fn will be called with the arguments None, -1
-      and should return the initial particle states.
-    num_steps: A [batch_size] Tensor of ints representing the number of steps
-      to run each filter for.
-    num_particles: A scalar int, the number of particles to use in each filter.
-    resampling_criterion: The resampling criterion to use for this particle
-      filter. Must accept the current log weights and timestep and
-      return a boolean Tensor of shape [batch_size] indicating whether each
-      particle filter should resample. See ess_criterion and related functions
-      for examples. When resampling_criterion is never_resample_criterion,
-      resampling_fn is ignored and never called.
-    resampling_fn: A callable that performs the resampling operation. Must
-      accept as arguments the log weights, particle states, num_particles,
-      and batch_size and return the resampled particle states. See
-      multinomial_resampling and relaxed_resampling for examples.
-    loop_fn: A callable that performs operations on the weights and
-      particle states, useful for accumulating and processing state that
-      shouldn't be resampled. At each timestep after (possibly) resampling
-      loop_fn will be called with the previous loop_state, a set of arguments
-      produced by transition_fn called loop_args, the resampled particle states,
-      the current log weights as [num_particles, batch_size] float Tensor, a
-      [batch_size] float Tensor representing whether or not each filter
-      resampled, the current mask indicating which filters are active, and the
-      current timestep. It must return the next loop state. Before the first
-      timestep loop_fn will be called with the arguments None, None, None, None,
-      -1 and must return the initial loop state. The loop state can be a
-      possibly nested structure of Tensors and TensorArrays.
-    parallel_iterations: The number of parallel iterations to use for the
-      internal while loop. Note that values greater than 1 can introduce
-      non-determinism even when resampling is deterministic.
-    swap_memory: Whether GPU-CPU memory swapping should be enabled for the
-      internal while loop.
-
-  Returns:
-    log_z_hat: A Tensor of shape [batch_size] containing an estimate of the log
-      normalizing constant that converts between the unormalized target
-      distribution (as defined by the weights) and the true target distribution.
-    log_weights: A Tensor of shape [max_num_steps, batch_size, num_particles]
-      containing the log weights at each timestep of the particle filter.
-      Will not be valid for timesteps past the supplied num_steps.
-    resampled: A float Tensor of shape [max_num_steps, batch_size] indicating
-      when the particle filters resampled. Will be 1.0 on timesteps when
-      resampling occurred and 0.0 on timesteps when it did not.
-    final_loop_state: The final state returned by loop_fn. If loop_fn is None
-      then 0 will be returned.
-  """
-  # batch_size represents the number of particle filters running in parallel.
-  batch_size = tf.shape(num_steps)[0]
-  # Create a TensorArray where element t is the [num_particles*batch_size]
-  # sequence mask for timestep t.
-  max_num_steps = tf.reduce_max(num_steps)
-  seq_mask = tf.transpose(
-      tf.sequence_mask(num_steps, maxlen=max_num_steps, dtype=tf.float32),
-      perm=[1, 0])
-  seq_mask = tf.tile(seq_mask, [1, num_particles])
-  mask_ta = tf.TensorArray(seq_mask.dtype,
-                           max_num_steps,
-                           name='mask_ta')
-  mask_ta = mask_ta.unstack(seq_mask)
-  # Initialize the state.
-  t0 = tf.constant(0, tf.int32)
-  init_particle_state = transition_fn(None, -1)
-
-  def transition(*args):
-    transition_outs = transition_fn(*args)
-    if len(transition_outs) == 2:
-      return transition_outs + (None,)
-    else:
-      return transition_outs
-
-  if loop_fn is None:
-    loop_fn = lambda *args: 0
-
-  init_loop_state = loop_fn(None, None, None, None, None, None, -1)
-  init_states = (init_particle_state, init_loop_state)
-  ta_names = ['log_weights', 'resampled']
-  tas = [tf.TensorArray(tf.float32, max_num_steps, name='%s_ta' % n)
-         for n in ta_names]
-  log_weights_acc = tf.zeros([num_particles, batch_size], dtype=tf.float32)
-  log_z_hat_acc = tf.zeros([batch_size], dtype=tf.float32)
-
-  def while_predicate(t, *unused_args):
-    return t < max_num_steps
-
-  def while_step(t, state, tas, log_weights_acc, log_z_hat_acc):
-    """Implements one timestep of the particle filter."""
-    particle_state, loop_state = state
-    cur_mask = nested.read_tas(mask_ta, t)
-    # Propagate the particles one step.
-    log_alpha, new_particle_state, loop_args = transition(particle_state, t)
-    # Update the current weights with the incremental weights.
-    log_alpha *= cur_mask
-    log_alpha = tf.reshape(log_alpha, [num_particles, batch_size])
-    log_weights_acc += log_alpha
-
-    should_resample = resampling_criterion(log_weights_acc, t)
-
-    if resampling_criterion == never_resample_criterion:
-      resampled = tf.to_float(should_resample)
-    else:
-      # Compute the states as if we did resample.
-      resampled_states = resampling_fn(
-          log_weights_acc,
-          new_particle_state,
-          num_particles,
-          batch_size)
-      # Decide whether or not we should resample; don't resample if we are past
-      # the end of a sequence.
-      should_resample = tf.logical_and(should_resample,
-                                       cur_mask[:batch_size] > 0.)
-      float_should_resample = tf.to_float(should_resample)
-      new_particle_state = nested.where_tensors(
-          tf.tile(should_resample, [num_particles]),
-          resampled_states,
-          new_particle_state)
-      resampled = float_should_resample
-
-    new_loop_state = loop_fn(loop_state, loop_args, new_particle_state,
-                             log_weights_acc, resampled, cur_mask, t)
-    # Update log Z hat.
-    log_z_hat_update = tf.reduce_logsumexp(
-        log_weights_acc, axis=0) - tf.log(tf.to_float(num_particles))
-    # If it is the last timestep, always add the update.
-    log_z_hat_acc += tf.cond(t < max_num_steps - 1,
-                             lambda: log_z_hat_update * resampled,
-                             lambda: log_z_hat_update)
-    # Update the TensorArrays before we reset the weights so that we capture
-    # the incremental weights and not zeros.
-    ta_updates = [log_weights_acc, resampled]
-    new_tas = [ta.write(t, x) for ta, x in zip(tas, ta_updates)]
-    # For the particle filters that resampled, reset weights to zero.
-    log_weights_acc *= (1. - tf.tile(resampled[tf.newaxis, :],
-                                     [num_particles, 1]))
-    new_state = (new_particle_state, new_loop_state)
-    return t + 1, new_state, new_tas, log_weights_acc, log_z_hat_acc
-
-  _, final_state, tas, _, log_z_hat = tf.while_loop(
-      while_predicate,
-      while_step,
-      loop_vars=(t0, init_states, tas, log_weights_acc, log_z_hat_acc),
-      parallel_iterations=parallel_iterations,
-      swap_memory=swap_memory)
-
-  log_weights, resampled = [x.stack() for x in tas]
-  log_weights = tf.transpose(log_weights, perm=[0, 2, 1])
-  final_particle_state, final_loop_state = final_state
-  return (log_z_hat, log_weights, resampled,
-          final_particle_state, final_loop_state)
diff --git a/research/fivo/fivo/smc_test.py b/research/fivo/fivo/smc_test.py
deleted file mode 100644
index ae32a62f21e037252bda44e3e1f47e007c9b7b9b..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/smc_test.py
+++ /dev/null
@@ -1,241 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for fivo.smc."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import scipy
-import tensorflow as tf
-
-from fivo import smc
-
-lse = scipy.special.logsumexp
-
-
-def _simple_transition_fn(state, unused_t):
-  if state is None:
-    return tf.zeros([4], dtype=tf.float32)
-  return tf.constant([5., 4., 1., 0.5]), tf.zeros([4], dtype=tf.float32)
-
-
-def _resample_at_step_criterion(step):
-  """A criterion that resamples once at a specific timestep."""
-  def criterion(log_weights, t):
-    batch_size = tf.shape(log_weights)[1]
-    return tf.fill([batch_size], tf.equal(t, step))
-  return criterion
-
-
-class SMCTest(tf.test.TestCase):
-
-  def test_never_resampling(self):
-    """Test that never_resample_criterion makes smc not resample.
-
-    Also test that the weights and log_z_hat are computed correctly when never
-    resampling.
-    """
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      outs = smc.smc(
-          _simple_transition_fn,
-          num_steps=tf.convert_to_tensor([5, 3]),
-          num_particles=2,
-          resampling_criterion=smc.never_resample_criterion)
-      log_z_hat, weights, resampled = sess.run(outs[0:3])
-      gt_weights = np.array(
-          [[[5, 1], [4, .5]],
-           [[10, 2], [8, 1]],
-           [[15, 3], [12, 1.5]],
-           [[20, 4], [12, 1.5]],
-           [[25, 5], [12, 1.5]]],
-          dtype=np.float32)
-      gt_log_z_hat = np.array(
-          [lse([25, 5]) - np.log(2),
-           lse([12, 1.5]) - np.log(2)],
-          dtype=np.float32)
-      self.assertAllClose(gt_log_z_hat, log_z_hat)
-      self.assertAllClose(gt_weights, weights)
-      self.assertAllEqual(np.zeros_like(resampled), resampled)
-
-  def test_always_resampling(self):
-    """Test always_resample_criterion makes smc always resample.
-
-    Past a sequence end the filter should not resample, however.
-    Also check that weights and log_z_hat estimate are correct.
-    """
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      outs = smc.smc(
-          _simple_transition_fn,
-          num_steps=tf.convert_to_tensor([5, 3]),
-          num_particles=2,
-          resampling_criterion=smc.always_resample_criterion)
-      log_z_hat, weights, resampled = sess.run(outs[0:3])
-      gt_weights = np.array(
-          [[[5, 1], [4, .5]],
-           [[5, 1], [4, .5]],
-           [[5, 1], [4, .5]],
-           [[5, 1], [0., 0.]],
-           [[5, 1], [0., 0.]]],
-          dtype=np.float32)
-      gt_log_z_hat = np.array(
-          [5*lse([5, 1]) - 5*np.log(2),
-           3*lse([4, .5]) - 3*np.log(2)],
-          dtype=np.float32)
-      gt_resampled = np.array(
-          [[1, 1], [1, 1], [1, 1], [1, 0], [1, 0]],
-          dtype=np.float32)
-      self.assertAllClose(gt_log_z_hat, log_z_hat)
-      self.assertAllClose(gt_weights, weights)
-      self.assertAllEqual(gt_resampled, resampled)
-
-  def test_weights_reset_when_resampling_at_sequence_end(self):
-    """Test that the weights are reset when resampling at the sequence end.
-
-    When resampling happens on the last timestep of a sequence the weights
-    should be set to zero on the next timestep and remain zero afterwards.
-    """
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      outs = smc.smc(
-          _simple_transition_fn,
-          num_steps=tf.convert_to_tensor([5, 3]),
-          num_particles=2,
-          resampling_criterion=_resample_at_step_criterion(2))
-      log_z_hat, weights, resampled = sess.run(outs[0:3])
-      gt_log_z = np.array(
-          [lse([15, 3]) + lse([10, 2]) - 2*np.log(2),
-           lse([12, 1.5]) - np.log(2)],
-          dtype=np.float32)
-      gt_resampled = np.array(
-          [[0, 0], [0, 0], [1, 1], [0, 0], [0, 0]],
-          dtype=np.float32)
-      gt_weights = np.array(
-          [[[5, 1], [4, .5]],
-           [[10, 2], [8, 1]],
-           [[15, 3], [12, 1.5]],
-           [[5, 1], [0, 0]],
-           [[10, 2], [0, 0]]],
-          dtype=np.float32)
-      self.assertAllClose(gt_log_z, log_z_hat)
-      self.assertAllEqual(gt_resampled, resampled)
-      self.assertAllEqual(gt_weights, weights)
-
-  def test_weights_not_updated_past_sequence_end(self):
-    """Test that non-zero weights are not updated past the end of a sequence."""
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      outs = smc.smc(
-          _simple_transition_fn,
-          num_steps=tf.convert_to_tensor([6, 4]),
-          num_particles=2,
-          resampling_criterion=_resample_at_step_criterion(1))
-      log_z_hat, weights, resampled = sess.run(outs[0:3])
-      gt_log_z_hat = np.array(
-          [lse([10, 2]) + lse([20, 4]) - 2*np.log(2),
-           lse([8, 1]) + lse([8, 1]) - 2*np.log(2)],
-          dtype=np.float32)
-      # Ensure that we only resample on the 2nd timestep.
-      gt_resampled = np.array(
-          [[0, 0], [1, 1], [0, 0], [0, 0], [0, 0], [0, 0]],
-          dtype=np.float32)
-      # Ensure that the weights after the end of the sequence don't change.
-      # Ensure that the weights after resampling before the end of the sequence
-      # do change.
-      gt_weights = np.array(
-          [[[5, 1], [4, .5]],
-           [[10, 2], [8, 1]],
-           [[5, 1], [4, .5]],
-           [[10, 2], [8, 1]],
-           [[15, 3], [8, 1]],
-           [[20, 4], [8, 1]]],
-          dtype=np.float32)
-      self.assertAllClose(gt_log_z_hat, log_z_hat)
-      self.assertAllEqual(gt_resampled, resampled)
-      self.assertAllEqual(gt_weights, weights)
-
-  def test_resampling_on_max_num_steps(self):
-    """Test that everything is correct when resampling on step max_num_steps.
-
-    When resampling on step max_num_steps (i.e. the last step of the longest
-    sequence), ensure that there are no off-by-one errors preventing resampling
-    and also that the weights are not updated.
-    """
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      outs = smc.smc(
-          _simple_transition_fn,
-          num_steps=tf.convert_to_tensor([4, 2]),
-          num_particles=2,
-          resampling_criterion=_resample_at_step_criterion(3))
-      log_z_hat, weights, resampled = sess.run(outs[0:3])
-      gt_log_z_hat = np.array(
-          [lse([20, 4]) - np.log(2),
-           lse([8, 1]) - np.log(2)],
-          dtype=np.float32)
-      # Ensure that we only resample on the 3rd timestep and that the second
-      # filter doesn't resample at all because it is only run for 2 steps.
-      gt_resampled = np.array(
-          [[0, 0], [0, 0], [0, 0], [1, 0]],
-          dtype=np.float32)
-      gt_weights = np.array(
-          [[[5, 1], [4, .5]],
-           [[10, 2], [8, 1]],
-           [[15, 3], [8, 1]],
-           [[20, 4], [8, 1]]],
-          dtype=np.float32)
-      self.assertAllClose(gt_log_z_hat, log_z_hat)
-      self.assertAllEqual(gt_resampled, resampled)
-      self.assertAllEqual(gt_weights, weights)
-
-  def test_multinomial_resampling(self):
-    """Test that mulitnomial resampling selects the correct states."""
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      # Setup input.
-      inf = 1000.0  # Very large value in log space.
-      num_samples = 2
-      batch_size = 2
-      log_weights = tf.convert_to_tensor([[inf, 0], [0, inf]])
-      states = tf.convert_to_tensor([1, 2, 3, 4])
-      # Run test.
-      resampled_states = smc.multinomial_resampling(
-          log_weights, states, num_samples, batch_size, random_seed=0)
-      resampled_states_values = sess.run(resampled_states)
-      self.assertAllEqual(resampled_states_values, [1, 4, 1, 4])
-
-  def test_blend_tensor(self):
-    """Test that relaxed resampling blends the correct states."""
-    tf.set_random_seed(1234)
-    with self.test_session() as sess:
-      # Setup input.
-      num_samples = 2
-      batch_size = 2
-      blending_weights = tf.convert_to_tensor(
-          [[[0.5, 0.5], [0.25, 0.75]], [[0.75, 0.25], [0.5, 0.5]]])
-      states = tf.convert_to_tensor([4., 8., 12., 16.])
-      # Run test.
-      blended_states = smc._blend_tensor(blending_weights, states,
-                                         num_samples, batch_size)
-      blended_states_values = sess.run(blended_states)
-      self.assertAllClose(blended_states_values[:, 0], [8., 14., 6., 12.])
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/fivo/fivo/test_data/tiny_pianoroll.pkl b/research/fivo/fivo/test_data/tiny_pianoroll.pkl
deleted file mode 100644
index c5501c6ceac1a6601b5f1be4c422be4f2c1baa86..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/test_data/tiny_pianoroll.pkl
+++ /dev/null
@@ -1,10979 +0,0 @@
-(dp1
-S'train_mean'
-p2
-cnumpy.core.multiarray
-_reconstruct
-p3
-(cnumpy
-ndarray
-p4
-(I0
-tS'b'
-tRp5
-(I1
-(I88
-tcnumpy
-dtype
-p6
-(S'f8'
-I0
-I1
-tRp7
-(I3
-S'<'
-NNNI-1
-I-1
-I0
-tbI00
-S'\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x9e0^X\xbez,?\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x9e0^X\xbez<?\x00\x00\x00\x00\x00\x00\x00\x00\xde\xecw\x04\xe1\xe8\x90?\\\x9c\xe1\x08\xef\x9c@?\\\x9c\xe1\x08\xef\x9cp?Y\xfb\xb4\x11\x0b\x05\x80?&\xc3MBx\xf6}?\xb4+\xe5\xc8#\xea\x94?}\xe6\x9f\xb0\xd6\x8bf?`\xa9\xbfQ\xa9\xec\xac?\x9d\xc4\xac\x06\xe8\xc2\x90?&\xc3MBx\xf6\x9d?\x02\xd8b\xa3\xaco\x97?\xb0\x8a\xb8\xd1?R\x94?\xbeMQ\x11F\x93\xc7?\x1bt\x16\x0b\xf6v\x80?\x85\xf0\xec;\xa0\xa2\xb3?"\xb6o\xf9\xbd\xa6\xb1?,\xf4as_\\\xb6?\xbf\xe7\xa3\x08\xb2b\xbb?\xa7\xa72\xec\x93\x8a\x92?\x07%\xfd\x05\x13\xe2\xd1?\xaaY\xa4\xa0X\xec\xab?\x1f\x81\xf4S\xb0\xc6\xbc?\x92t\x9f\x180\x02\xb6?\xa2\xf4\xea\x80\x99\xe7\xbb?jm=\xe14\xe4\xd4?\xab\xb4\x105N\xda\x8e?F\xfc\xc6,\x7f\x1b\xcb?^\xfe\'\x9d\\S\xc0?\xf30P\x95%;\xc6?4=\x95a\x9fT\xc4?\xfe\x80]\x83\xdd\xfb\x90?R\xf4\xe9\xaa\xe2\xb1\xda?\xe5\xe4\xa9\x1b\x94\xf4\xa7?\xcbH\xf6\xb3J\xed\xbe?v\xa4F\xc2\x0e\\\xb5?\xe3\xc1I\xea\x9c\x1f\xb9?\xca\x8f\x9bf\xbeM\xd1?\\\x9c\xe1\x08\xef\x9cp?\xdeG\xe4\x98\xd6\xd6\xc3?M\x99\x8c\xaf<9\xaf?BJUx\xba\xb9\xb1?\xb2R\xacnA9\xb0?\x83(\xf9\x9e\x9e\xbbg?@pFg\xa2\xc7\xaf?\x80\x87\xcc\xa7\xba#g?+\x99\xf5\xdein\x83?}\xe6\x9f\xb0\xd6\x8bf?b\xde:\xf7\xb6\xccQ?\x83(\xf9\x9e\x9e\xbbW?\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00'
-tbsS'train'
-p8
-(lp9
-(lp10
-(cnumpy.core.multiarray
-scalar
-p11
-(g6
-(S'i8'
-I0
-I1
-tRp12
-(I3
-S'<'
-NNNI-1
-I-1
-I0
-tbS'<\x00\x00\x00\x00\x00\x00\x00'
-tRp13
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp14
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp15
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp16
-tp17
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp18
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp19
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp20
-tp21
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp22
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp23
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp24
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp25
-tp26
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp27
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp28
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp29
-tp30
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp31
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp32
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp33
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp34
-tp35
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp36
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp37
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp38
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp39
-tp40
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp41
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp42
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp43
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp44
-tp45
-a(g11
-(g12
-S':\x00\x00\x00\x00\x00\x00\x00'
-tRp46
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp47
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp48
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp49
-tp50
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp51
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp52
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp53
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp54
-tp55
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp56
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp57
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp58
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp59
-tp60
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp61
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp62
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp63
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp64
-tp65
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp66
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp67
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp68
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp69
-tp70
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp71
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp72
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp73
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp74
-tp75
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp76
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp77
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp78
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp79
-tp80
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp81
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp82
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp83
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp84
-tp85
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp86
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp87
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp88
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp89
-tp90
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp91
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp92
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp93
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp94
-tp95
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp96
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp97
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp98
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp99
-tp100
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp101
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp102
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp103
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp104
-tp105
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp106
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp107
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp108
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp109
-tp110
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp111
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp112
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp113
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp114
-tp115
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp116
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp117
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp118
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp119
-tp120
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp121
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp122
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp123
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp124
-tp125
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp126
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp127
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp128
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp129
-tp130
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp131
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp132
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp133
-tp134
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp135
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp136
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp137
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp138
-tp139
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp140
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp141
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp142
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp143
-tp144
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp145
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp146
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp147
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp148
-tp149
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp150
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp151
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp152
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp153
-tp154
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp155
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp156
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp157
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp158
-tp159
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp160
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp161
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp162
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp163
-tp164
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp165
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp166
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp167
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp168
-tp169
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp170
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp171
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp172
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp173
-tp174
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp175
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp176
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp177
-tp178
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp179
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp180
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp181
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp182
-tp183
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp184
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp185
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp186
-tp187
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp188
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp189
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp190
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp191
-tp192
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp193
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp194
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp195
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp196
-tp197
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp198
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp199
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp200
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp201
-tp202
-a(g11
-(g12
-S':\x00\x00\x00\x00\x00\x00\x00'
-tRp203
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp204
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp205
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp206
-tp207
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp208
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp209
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp210
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp211
-tp212
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp213
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp214
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp215
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp216
-tp217
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp218
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp219
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp220
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp221
-tp222
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp223
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp224
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp225
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp226
-tp227
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp228
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp229
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp230
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp231
-tp232
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp233
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp234
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp235
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp236
-tp237
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp238
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp239
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp240
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp241
-tp242
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp243
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp244
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp245
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp246
-tp247
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp248
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp249
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp250
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp251
-tp252
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp253
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp254
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp255
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp256
-tp257
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp258
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp259
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp260
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp261
-tp262
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp263
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp264
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp265
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp266
-tp267
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp268
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp269
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp270
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp271
-tp272
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp273
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp274
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp275
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp276
-tp277
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp278
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp279
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp280
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp281
-tp282
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp283
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp284
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp285
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp286
-tp287
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp288
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp289
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp290
-tp291
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp292
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp293
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp294
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp295
-tp296
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp297
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp298
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp299
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp300
-tp301
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp302
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp303
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp304
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp305
-tp306
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp307
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp308
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp309
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp310
-tp311
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp312
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp313
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp314
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp315
-tp316
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp317
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp318
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp319
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp320
-tp321
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp322
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp323
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp324
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp325
-tp326
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp327
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp328
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp329
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp330
-tp331
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp332
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp333
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp334
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp335
-tp336
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp337
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp338
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp339
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp340
-tp341
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp342
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp343
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp344
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp345
-tp346
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp347
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp348
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp349
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp350
-tp351
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp352
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp353
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp354
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp355
-tp356
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp357
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp358
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp359
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp360
-tp361
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp362
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp363
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp364
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp365
-tp366
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp367
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp368
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp369
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp370
-tp371
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp372
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp373
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp374
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp375
-tp376
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp377
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp378
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp379
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp380
-tp381
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp382
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp383
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp384
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp385
-tp386
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp387
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp388
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp389
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp390
-tp391
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp392
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp393
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp394
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp395
-tp396
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp397
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp398
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp399
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp400
-tp401
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp402
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp403
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp404
-tp405
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp406
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp407
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp408
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp409
-tp410
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp411
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp412
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp413
-tp414
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp415
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp416
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp417
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp418
-tp419
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp420
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp421
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp422
-tp423
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp424
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp425
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp426
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp427
-tp428
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp429
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp430
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp431
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp432
-tp433
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp434
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp435
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp436
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp437
-tp438
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp439
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp440
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp441
-tp442
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp443
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp444
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp445
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp446
-tp447
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp448
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp449
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp450
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp451
-tp452
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp453
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp454
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp455
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp456
-tp457
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp458
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp459
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp460
-tp461
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp462
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp463
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp464
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp465
-tp466
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp467
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp468
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp469
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp470
-tp471
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp472
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp473
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp474
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp475
-tp476
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp477
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp478
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp479
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp480
-tp481
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp482
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp483
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp484
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp485
-tp486
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp487
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp488
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp489
-tp490
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp491
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp492
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp493
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp494
-tp495
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp496
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp497
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp498
-tp499
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp500
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp501
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp502
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp503
-tp504
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp505
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp506
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp507
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp508
-tp509
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp510
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp511
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp512
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp513
-tp514
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp515
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp516
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp517
-tp518
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp519
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp520
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp521
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp522
-tp523
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp524
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp525
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp526
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp527
-tp528
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp529
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp530
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp531
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp532
-tp533
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp534
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp535
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp536
-tp537
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp538
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp539
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp540
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp541
-tp542
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp543
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp544
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp545
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp546
-tp547
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp548
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp549
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp550
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp551
-tp552
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp553
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp554
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp555
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp556
-tp557
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp558
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp559
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp560
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp561
-tp562
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp563
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp564
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp565
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp566
-tp567
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp568
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp569
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp570
-tp571
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp572
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp573
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp574
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp575
-tp576
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp577
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp578
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp579
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp580
-tp581
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp582
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp583
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp584
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp585
-tp586
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp587
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp588
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp589
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp590
-tp591
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp592
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp593
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp594
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp595
-tp596
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp597
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp598
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp599
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp600
-tp601
-a(g11
-(g12
-S'8\x00\x00\x00\x00\x00\x00\x00'
-tRp602
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp603
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp604
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp605
-tp606
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp607
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp608
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp609
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp610
-tp611
-a(g11
-(g12
-S'6\x00\x00\x00\x00\x00\x00\x00'
-tRp612
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp613
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp614
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp615
-tp616
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp617
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp618
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp619
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp620
-tp621
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp622
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp623
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp624
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp625
-tp626
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp627
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp628
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp629
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp630
-tp631
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp632
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp633
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp634
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp635
-tp636
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp637
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp638
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp639
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp640
-tp641
-aa(lp642
-(g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp643
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp644
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp645
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp646
-tp647
-a(g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp648
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp649
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp650
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp651
-tp652
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp653
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp654
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp655
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp656
-tp657
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp658
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp659
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp660
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp661
-tp662
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp663
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp664
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp665
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp666
-tp667
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp668
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp669
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp670
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp671
-tp672
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp673
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp674
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp675
-tp676
-a(g11
-(g12
-S'8\x00\x00\x00\x00\x00\x00\x00'
-tRp677
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp678
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp679
-tp680
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp681
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp682
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp683
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp684
-tp685
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp686
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp687
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp688
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp689
-tp690
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp691
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp692
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp693
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp694
-tp695
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp696
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp697
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp698
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp699
-tp700
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp701
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp702
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp703
-tp704
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp705
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp706
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp707
-tp708
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp709
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp710
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp711
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp712
-tp713
-a(g11
-(g12
-S'8\x00\x00\x00\x00\x00\x00\x00'
-tRp714
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp715
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp716
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp717
-tp718
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp719
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp720
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp721
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp722
-tp723
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp724
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp725
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp726
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp727
-tp728
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp729
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp730
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp731
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp732
-tp733
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp734
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp735
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp736
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp737
-tp738
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp739
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp740
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp741
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp742
-tp743
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp744
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp745
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp746
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp747
-tp748
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp749
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp750
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp751
-tp752
-a(g11
-(g12
-S'8\x00\x00\x00\x00\x00\x00\x00'
-tRp753
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp754
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp755
-tp756
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp757
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp758
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp759
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp760
-tp761
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp762
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp763
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp764
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp765
-tp766
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp767
-g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp768
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp769
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp770
-tp771
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp772
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp773
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp774
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp775
-tp776
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp777
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp778
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp779
-tp780
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp781
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp782
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp783
-tp784
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp785
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp786
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp787
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp788
-tp789
-a(g11
-(g12
-S'8\x00\x00\x00\x00\x00\x00\x00'
-tRp790
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp791
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp792
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp793
-tp794
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp795
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp796
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp797
-g11
-(g12
-S'\\\x00\x00\x00\x00\x00\x00\x00'
-tRp798
-tp799
-a(g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp800
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp801
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp802
-g11
-(g12
-S'Z\x00\x00\x00\x00\x00\x00\x00'
-tRp803
-tp804
-a(g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp805
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp806
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp807
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp808
-tp809
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp810
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp811
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp812
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp813
-tp814
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp815
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp816
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp817
-tp818
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp819
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp820
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp821
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp822
-tp823
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp824
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp825
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp826
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp827
-tp828
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp829
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp830
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp831
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp832
-tp833
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp834
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp835
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp836
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp837
-tp838
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp839
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp840
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp841
-g11
-(g12
-S'Z\x00\x00\x00\x00\x00\x00\x00'
-tRp842
-tp843
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp844
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp845
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp846
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp847
-tp848
-a(g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp849
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp850
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp851
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp852
-tp853
-a(g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp854
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp855
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp856
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp857
-tp858
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp859
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp860
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp861
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp862
-tp863
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp864
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp865
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp866
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp867
-tp868
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp869
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp870
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp871
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp872
-tp873
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp874
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp875
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp876
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp877
-tp878
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp879
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp880
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp881
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp882
-tp883
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp884
-g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp885
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp886
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp887
-tp888
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp889
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp890
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp891
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp892
-tp893
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp894
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp895
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp896
-g11
-(g12
-S'Z\x00\x00\x00\x00\x00\x00\x00'
-tRp897
-tp898
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp899
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp900
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp901
-g11
-(g12
-S'\\\x00\x00\x00\x00\x00\x00\x00'
-tRp902
-tp903
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp904
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp905
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp906
-g11
-(g12
-S'Z\x00\x00\x00\x00\x00\x00\x00'
-tRp907
-tp908
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp909
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp910
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp911
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp912
-tp913
-a(g11
-(g12
-S'F\x00\x00\x00\x00\x00\x00\x00'
-tRp914
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp915
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp916
-g11
-(g12
-S'Z\x00\x00\x00\x00\x00\x00\x00'
-tRp917
-tp918
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp919
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp920
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp921
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp922
-tp923
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp924
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp925
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp926
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp927
-tp928
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp929
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp930
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp931
-g11
-(g12
-S'\\\x00\x00\x00\x00\x00\x00\x00'
-tRp932
-tp933
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp934
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp935
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp936
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp937
-tp938
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp939
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp940
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp941
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp942
-tp943
-a(g11
-(g12
-S'8\x00\x00\x00\x00\x00\x00\x00'
-tRp944
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp945
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp946
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp947
-tp948
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp949
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp950
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp951
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp952
-tp953
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp954
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp955
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp956
-g11
-(g12
-S'U\x00\x00\x00\x00\x00\x00\x00'
-tRp957
-tp958
-aa(lp959
-(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp960
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp961
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp962
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp963
-tp964
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp965
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp966
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp967
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp968
-tp969
-a(g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp970
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp971
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp972
-tp973
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp974
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp975
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp976
-g11
-(g12
-S']\x00\x00\x00\x00\x00\x00\x00'
-tRp977
-tp978
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp979
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp980
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp981
-g11
-(g12
-S']\x00\x00\x00\x00\x00\x00\x00'
-tRp982
-tp983
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp984
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp985
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp986
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp987
-tp988
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp989
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp990
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp991
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp992
-tp993
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp994
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp995
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp996
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp997
-tp998
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp999
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1000
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1001
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1002
-tp1003
-a(g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1004
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1005
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1006
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp1007
-tp1008
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1009
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1010
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1011
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1012
-tp1013
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1014
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1015
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1016
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1017
-tp1018
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1019
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1020
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1021
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1022
-tp1023
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1024
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1025
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1026
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1027
-tp1028
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1029
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1030
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1031
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1032
-tp1033
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1034
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1035
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1036
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1037
-tp1038
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1039
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1040
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1041
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1042
-tp1043
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1044
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1045
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1046
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1047
-tp1048
-a(g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1049
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1050
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1051
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1052
-tp1053
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1054
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1055
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1056
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1057
-tp1058
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1059
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1060
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1061
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1062
-tp1063
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1064
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1065
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1066
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1067
-tp1068
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1069
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1070
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1071
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1072
-tp1073
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1074
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1075
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1076
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1077
-tp1078
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1079
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1080
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1081
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1082
-tp1083
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1084
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1085
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1086
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1087
-tp1088
-a(g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1089
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1090
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1091
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp1092
-tp1093
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1094
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1095
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1096
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp1097
-tp1098
-a(g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1099
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1100
-g11
-(g12
-S'Z\x00\x00\x00\x00\x00\x00\x00'
-tRp1101
-tp1102
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1103
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1104
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1105
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp1106
-tp1107
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1108
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1109
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1110
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp1111
-tp1112
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1113
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1114
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1115
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp1116
-tp1117
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1118
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1119
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1120
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp1121
-tp1122
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1123
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1124
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1125
-g11
-(g12
-S']\x00\x00\x00\x00\x00\x00\x00'
-tRp1126
-tp1127
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1128
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1129
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1130
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp1131
-tp1132
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1133
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1134
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1135
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp1136
-tp1137
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1138
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp1139
-g11
-(g12
-S'W\x00\x00\x00\x00\x00\x00\x00'
-tRp1140
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1141
-tp1142
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1143
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1144
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1145
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp1146
-tp1147
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1148
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1149
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1150
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp1151
-tp1152
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1153
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1154
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1155
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp1156
-tp1157
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1158
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1159
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1160
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1161
-tp1162
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1163
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1164
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1165
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1166
-tp1167
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1168
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1169
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1170
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp1171
-tp1172
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1173
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1174
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1175
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1176
-tp1177
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1178
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1179
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1180
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1181
-tp1182
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1183
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1184
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1185
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1186
-tp1187
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1188
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1189
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1190
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1191
-tp1192
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1193
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1194
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1195
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1196
-tp1197
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1198
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1199
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1200
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1201
-tp1202
-aa(lp1203
-(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1204
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1205
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1206
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1207
-tp1208
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1209
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1210
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1211
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1212
-tp1213
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1214
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1215
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1216
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1217
-tp1218
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1219
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1220
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1221
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1222
-tp1223
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1224
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1225
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1226
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1227
-tp1228
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1229
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1230
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1231
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1232
-tp1233
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1234
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1235
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1236
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1237
-tp1238
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1239
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1240
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1241
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1242
-tp1243
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp1244
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1245
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1246
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1247
-tp1248
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1249
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1250
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1251
-tp1252
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1253
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1254
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1255
-tp1256
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1257
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1258
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1259
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1260
-tp1261
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1262
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1263
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1264
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1265
-tp1266
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1267
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1268
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1269
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1270
-tp1271
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1272
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1273
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1274
-tp1275
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1276
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1277
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1278
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1279
-tp1280
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1281
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1282
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1283
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1284
-tp1285
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1286
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1287
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1288
-tp1289
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1290
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1291
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1292
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1293
-tp1294
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1295
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1296
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1297
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1298
-tp1299
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1300
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1301
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1302
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1303
-tp1304
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1305
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1306
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1307
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1308
-tp1309
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1310
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1311
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1312
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1313
-tp1314
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1315
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1316
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1317
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1318
-tp1319
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1320
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1321
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1322
-tp1323
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1324
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1325
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1326
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1327
-tp1328
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1329
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1330
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1331
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp1332
-tp1333
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1334
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1335
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1336
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1337
-tp1338
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1339
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1340
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1341
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1342
-tp1343
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1344
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1345
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1346
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1347
-tp1348
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1349
-g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp1350
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1351
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1352
-tp1353
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1354
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1355
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1356
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1357
-tp1358
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1359
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1360
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1361
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1362
-tp1363
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1364
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1365
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1366
-tp1367
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1368
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1369
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1370
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1371
-tp1372
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1373
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1374
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1375
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1376
-tp1377
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1378
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1379
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1380
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp1381
-tp1382
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1383
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1384
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1385
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1386
-tp1387
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1388
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1389
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1390
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1391
-tp1392
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1393
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1394
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1395
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1396
-tp1397
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp1398
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1399
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1400
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1401
-tp1402
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1403
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1404
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1405
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1406
-tp1407
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1408
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1409
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp1410
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1411
-tp1412
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1413
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1414
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp1415
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp1416
-tp1417
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1418
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1419
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1420
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1421
-tp1422
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1423
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1424
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1425
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1426
-tp1427
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1428
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1429
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1430
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1431
-tp1432
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1433
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1434
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1435
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1436
-tp1437
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1438
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1439
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1440
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1441
-tp1442
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1443
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1444
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1445
-tp1446
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1447
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1448
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1449
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1450
-tp1451
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1452
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1453
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1454
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1455
-tp1456
-a(g11
-(g12
-S'8\x00\x00\x00\x00\x00\x00\x00'
-tRp1457
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1458
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1459
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1460
-tp1461
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1462
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1463
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1464
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1465
-tp1466
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1467
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1468
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1469
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1470
-tp1471
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1472
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1473
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1474
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1475
-tp1476
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1477
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1478
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1479
-tp1480
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1481
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1482
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1483
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1484
-tp1485
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1486
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1487
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1488
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1489
-tp1490
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp1491
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1492
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1493
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1494
-tp1495
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1496
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1497
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1498
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1499
-tp1500
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1501
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1502
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1503
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1504
-tp1505
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1506
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1507
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1508
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1509
-tp1510
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1511
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1512
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1513
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1514
-tp1515
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1516
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1517
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1518
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1519
-tp1520
-aa(lp1521
-(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp1522
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1523
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1524
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1525
-tp1526
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1527
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1528
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1529
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1530
-tp1531
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1532
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1533
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1534
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp1535
-tp1536
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1537
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1538
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1539
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1540
-tp1541
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1542
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1543
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1544
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1545
-tp1546
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1547
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1548
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1549
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1550
-tp1551
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1552
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1553
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1554
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1555
-tp1556
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1557
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1558
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1559
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1560
-tp1561
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1562
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1563
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1564
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1565
-tp1566
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1567
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1568
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1569
-tp1570
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1571
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1572
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1573
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1574
-tp1575
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1576
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1577
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1578
-tp1579
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1580
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1581
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1582
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1583
-tp1584
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1585
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1586
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1587
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1588
-tp1589
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1590
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1591
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1592
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1593
-tp1594
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1595
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1596
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1597
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1598
-tp1599
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1600
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1601
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1602
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1603
-tp1604
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp1605
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1606
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1607
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1608
-tp1609
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1610
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1611
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1612
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1613
-tp1614
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1615
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1616
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1617
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1618
-tp1619
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp1620
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1621
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1622
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1623
-tp1624
-a(g11
-(g12
-S'4\x00\x00\x00\x00\x00\x00\x00'
-tRp1625
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1626
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1627
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1628
-tp1629
-a(g11
-(g12
-S'2\x00\x00\x00\x00\x00\x00\x00'
-tRp1630
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1631
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1632
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1633
-tp1634
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp1635
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1636
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1637
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1638
-tp1639
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1640
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1641
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1642
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1643
-tp1644
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp1645
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1646
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1647
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1648
-tp1649
-a(g11
-(g12
-S'4\x00\x00\x00\x00\x00\x00\x00'
-tRp1650
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1651
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp1652
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1653
-tp1654
-a(g11
-(g12
-S'2\x00\x00\x00\x00\x00\x00\x00'
-tRp1655
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1656
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1657
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1658
-tp1659
-a(g11
-(g12
-S'4\x00\x00\x00\x00\x00\x00\x00'
-tRp1660
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1661
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1662
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1663
-tp1664
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp1665
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1666
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1667
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1668
-tp1669
-a(g11
-(g12
-S'4\x00\x00\x00\x00\x00\x00\x00'
-tRp1670
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1671
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1672
-tp1673
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp1674
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1675
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1676
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1677
-tp1678
-a(g11
-(g12
-S'2\x00\x00\x00\x00\x00\x00\x00'
-tRp1679
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1680
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1681
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1682
-tp1683
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp1684
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1685
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1686
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1687
-tp1688
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1689
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1690
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1691
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1692
-tp1693
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1694
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1695
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1696
-tp1697
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1698
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1699
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1700
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1701
-tp1702
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1703
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1704
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1705
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1706
-tp1707
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1708
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1709
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1710
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp1711
-tp1712
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1713
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1714
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1715
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1716
-tp1717
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1718
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1719
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1720
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1721
-tp1722
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1723
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1724
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1725
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1726
-tp1727
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1728
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1729
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1730
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1731
-tp1732
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1733
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1734
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1735
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1736
-tp1737
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1738
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1739
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1740
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1741
-tp1742
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1743
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1744
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1745
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1746
-tp1747
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp1748
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1749
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1750
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1751
-tp1752
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1753
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1754
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1755
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1756
-tp1757
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1758
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1759
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1760
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1761
-tp1762
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1763
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1764
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1765
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1766
-tp1767
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1768
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1769
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1770
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1771
-tp1772
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1773
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1774
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1775
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp1776
-tp1777
-a(g11
-(g12
-S'4\x00\x00\x00\x00\x00\x00\x00'
-tRp1778
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1779
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1780
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp1781
-tp1782
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1783
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp1784
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1785
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1786
-tp1787
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1788
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1789
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1790
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1791
-tp1792
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1793
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1794
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1795
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1796
-tp1797
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1798
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1799
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1800
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1801
-tp1802
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1803
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1804
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1805
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1806
-tp1807
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1808
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1809
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1810
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1811
-tp1812
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1813
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1814
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1815
-tp1816
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1817
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1818
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1819
-tp1820
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1821
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1822
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1823
-tp1824
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1825
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp1826
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1827
-tp1828
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1829
-g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1830
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1831
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1832
-tp1833
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1834
-g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1835
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1836
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1837
-tp1838
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1839
-g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1840
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1841
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1842
-tp1843
-a(g11
-(g12
-S'4\x00\x00\x00\x00\x00\x00\x00'
-tRp1844
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1845
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1846
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp1847
-tp1848
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1849
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1850
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp1851
-tp1852
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1853
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1854
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1855
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp1856
-tp1857
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp1858
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1859
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1860
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1861
-tp1862
-a(g11
-(g12
-S'4\x00\x00\x00\x00\x00\x00\x00'
-tRp1863
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1864
-g11
-(g12
-S'I\x00\x00\x00\x00\x00\x00\x00'
-tRp1865
-tp1866
-a(g11
-(g12
-S'2\x00\x00\x00\x00\x00\x00\x00'
-tRp1867
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1868
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1869
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1870
-tp1871
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp1872
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1873
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1874
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1875
-tp1876
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1877
-g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1878
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1879
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1880
-tp1881
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1882
-g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp1883
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1884
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1885
-tp1886
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1887
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1888
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1889
-tp1890
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1891
-g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp1892
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1893
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1894
-tp1895
-a(g11
-(g12
-S'2\x00\x00\x00\x00\x00\x00\x00'
-tRp1896
-g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1897
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1898
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1899
-tp1900
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1901
-g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1902
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1903
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1904
-tp1905
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1906
-g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1907
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1908
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1909
-tp1910
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp1911
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1912
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1913
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1914
-tp1915
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1916
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1917
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1918
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1919
-tp1920
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1921
-g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp1922
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1923
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1924
-tp1925
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1926
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1927
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1928
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1929
-tp1930
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp1931
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1932
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1933
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1934
-tp1935
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1936
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1937
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1938
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1939
-tp1940
-a(g11
-(g12
-S'2\x00\x00\x00\x00\x00\x00\x00'
-tRp1941
-g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp1942
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1943
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1944
-tp1945
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1946
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1947
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1948
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1949
-tp1950
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1951
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1952
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1953
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp1954
-tp1955
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp1956
-g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1957
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp1958
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1959
-tp1960
-a(g11
-(g12
-S'4\x00\x00\x00\x00\x00\x00\x00'
-tRp1961
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1962
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1963
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp1964
-tp1965
-a(g11
-(g12
-S'4\x00\x00\x00\x00\x00\x00\x00'
-tRp1966
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1967
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1968
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1969
-tp1970
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp1971
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1972
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1973
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp1974
-tp1975
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp1976
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1977
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1978
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1979
-tp1980
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp1981
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1982
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp1983
-tp1984
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp1985
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1986
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp1987
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1988
-tp1989
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp1990
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp1991
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1992
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp1993
-tp1994
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp1995
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp1996
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp1997
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp1998
-tp1999
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2000
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2001
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2002
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2003
-tp2004
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2005
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2006
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2007
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2008
-tp2009
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2010
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2011
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2012
-tp2013
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2014
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2015
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2016
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2017
-tp2018
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2019
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2020
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2021
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2022
-tp2023
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2024
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2025
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2026
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2027
-tp2028
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2029
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2030
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2031
-tp2032
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2033
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2034
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2035
-tp2036
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2037
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2038
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2039
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2040
-tp2041
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2042
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2043
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2044
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2045
-tp2046
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2047
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2048
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2049
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2050
-tp2051
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp2052
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2053
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2054
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2055
-tp2056
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2057
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2058
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2059
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2060
-tp2061
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp2062
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2063
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2064
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2065
-tp2066
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp2067
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2068
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2069
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2070
-tp2071
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp2072
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2073
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2074
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2075
-tp2076
-aa(lp2077
-(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2078
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2079
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2080
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2081
-tp2082
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2083
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2084
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2085
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2086
-tp2087
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2088
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2089
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2090
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2091
-tp2092
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2093
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2094
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2095
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2096
-tp2097
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2098
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2099
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2100
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2101
-tp2102
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2103
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2104
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2105
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2106
-tp2107
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2108
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2109
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2110
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2111
-tp2112
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2113
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2114
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2115
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2116
-tp2117
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2118
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2119
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2120
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2121
-tp2122
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2123
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2124
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2125
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2126
-tp2127
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp2128
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2129
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2130
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2131
-tp2132
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp2133
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp2134
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2135
-tp2136
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2137
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2138
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2139
-tp2140
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2141
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2142
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2143
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2144
-tp2145
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2146
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2147
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp2148
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2149
-tp2150
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2151
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2152
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2153
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2154
-tp2155
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2156
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2157
-g11
-(g12
-S'P\x00\x00\x00\x00\x00\x00\x00'
-tRp2158
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2159
-tp2160
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2161
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2162
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2163
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2164
-tp2165
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2166
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2167
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2168
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2169
-tp2170
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2171
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2172
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2173
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2174
-tp2175
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2176
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2177
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2178
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2179
-tp2180
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2181
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp2182
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2183
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2184
-tp2185
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2186
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp2187
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2188
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2189
-tp2190
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2191
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2192
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2193
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2194
-tp2195
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp2196
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2197
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2198
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2199
-tp2200
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2201
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2202
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2203
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp2204
-tp2205
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2206
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2207
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2208
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2209
-tp2210
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2211
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2212
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2213
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2214
-tp2215
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2216
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2217
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2218
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2219
-tp2220
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2221
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2222
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2223
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2224
-tp2225
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2226
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2227
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2228
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2229
-tp2230
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2231
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2232
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2233
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2234
-tp2235
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2236
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2237
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2238
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2239
-tp2240
-aa(lp2241
-(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2242
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2243
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2244
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2245
-tp2246
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp2247
-g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2248
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2249
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2250
-tp2251
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2252
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2253
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2254
-tp2255
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2256
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2257
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2258
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2259
-tp2260
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2261
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2262
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2263
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2264
-tp2265
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2266
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2267
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2268
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2269
-tp2270
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2271
-g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp2272
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2273
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2274
-tp2275
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2276
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2277
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2278
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2279
-tp2280
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2281
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2282
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2283
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2284
-tp2285
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2286
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2287
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2288
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2289
-tp2290
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2291
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2292
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2293
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2294
-tp2295
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2296
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2297
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2298
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2299
-tp2300
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2301
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2302
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2303
-tp2304
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2305
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2306
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2307
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2308
-tp2309
-a(g11
-(g12
-S'2\x00\x00\x00\x00\x00\x00\x00'
-tRp2310
-g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp2311
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2312
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2313
-tp2314
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2315
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2316
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2317
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2318
-tp2319
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2320
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2321
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2322
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2323
-tp2324
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp2325
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2326
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2327
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2328
-tp2329
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2330
-g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2331
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2332
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2333
-tp2334
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2335
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2336
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2337
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2338
-tp2339
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2340
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2341
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2342
-tp2343
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2344
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2345
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2346
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2347
-tp2348
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2349
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2350
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2351
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2352
-tp2353
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2354
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2355
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2356
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2357
-tp2358
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2359
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2360
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2361
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2362
-tp2363
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp2364
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2365
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2366
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2367
-tp2368
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp2369
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2370
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2371
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2372
-tp2373
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2374
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2375
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2376
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2377
-tp2378
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2379
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2380
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2381
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2382
-tp2383
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2384
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2385
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2386
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2387
-tp2388
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2389
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2390
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2391
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2392
-tp2393
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2394
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2395
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2396
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2397
-tp2398
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2399
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2400
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2401
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2402
-tp2403
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2404
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2405
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2406
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2407
-tp2408
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2409
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2410
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2411
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2412
-tp2413
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2414
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2415
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2416
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2417
-tp2418
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2419
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2420
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2421
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2422
-tp2423
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp2424
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2425
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2426
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2427
-tp2428
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2429
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2430
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2431
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2432
-tp2433
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2434
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2435
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2436
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2437
-tp2438
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2439
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2440
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2441
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2442
-tp2443
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2444
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2445
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2446
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2447
-tp2448
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2449
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2450
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2451
-tp2452
-a(g11
-(g12
-S'5\x00\x00\x00\x00\x00\x00\x00'
-tRp2453
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2454
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2455
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2456
-tp2457
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2458
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2459
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2460
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2461
-tp2462
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2463
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2464
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2465
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2466
-tp2467
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2468
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2469
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2470
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2471
-tp2472
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2473
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2474
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2475
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2476
-tp2477
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp2478
-g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp2479
-g11
-(g12
-S'K\x00\x00\x00\x00\x00\x00\x00'
-tRp2480
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2481
-tp2482
-a(g11
-(g12
-S'=\x00\x00\x00\x00\x00\x00\x00'
-tRp2483
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2484
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2485
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2486
-tp2487
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2488
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2489
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2490
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2491
-tp2492
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2493
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2494
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2495
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2496
-tp2497
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2498
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2499
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2500
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2501
-tp2502
-a(g11
-(g12
-S'6\x00\x00\x00\x00\x00\x00\x00'
-tRp2503
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2504
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2505
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2506
-tp2507
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2508
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2509
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2510
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2511
-tp2512
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2513
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2514
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2515
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2516
-tp2517
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2518
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2519
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2520
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2521
-tp2522
-aa(lp2523
-(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2524
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2525
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2526
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2527
-tp2528
-a(g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2529
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2530
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2531
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2532
-tp2533
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2534
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2535
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2536
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2537
-tp2538
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2539
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2540
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2541
-g11
-(g12
-S']\x00\x00\x00\x00\x00\x00\x00'
-tRp2542
-tp2543
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2544
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2545
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2546
-g11
-(g12
-S']\x00\x00\x00\x00\x00\x00\x00'
-tRp2547
-tp2548
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2549
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2550
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2551
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2552
-tp2553
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2554
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2555
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2556
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2557
-tp2558
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2559
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2560
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2561
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2562
-tp2563
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2564
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2565
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2566
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2567
-tp2568
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2569
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2570
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2571
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp2572
-tp2573
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2574
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2575
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2576
-tp2577
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2578
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2579
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2580
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2581
-tp2582
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2583
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2584
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2585
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2586
-tp2587
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2588
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2589
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2590
-tp2591
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2592
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2593
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2594
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2595
-tp2596
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2597
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2598
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2599
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2600
-tp2601
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2602
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2603
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2604
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2605
-tp2606
-a(g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2607
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2608
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2609
-tp2610
-a(g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2611
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2612
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2613
-tp2614
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2615
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2616
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2617
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2618
-tp2619
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2620
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2621
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2622
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2623
-tp2624
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2625
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2626
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2627
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2628
-tp2629
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2630
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2631
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2632
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2633
-tp2634
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2635
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2636
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2637
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2638
-tp2639
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2640
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2641
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2642
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2643
-tp2644
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2645
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2646
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2647
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2648
-tp2649
-a(g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2650
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2651
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2652
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2653
-tp2654
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2655
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2656
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2657
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2658
-tp2659
-a(g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2660
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2661
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2662
-g11
-(g12
-S'Z\x00\x00\x00\x00\x00\x00\x00'
-tRp2663
-tp2664
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2665
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2666
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2667
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2668
-tp2669
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2670
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2671
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2672
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2673
-tp2674
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2675
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2676
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2677
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2678
-tp2679
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2680
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2681
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2682
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2683
-tp2684
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2685
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2686
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2687
-g11
-(g12
-S']\x00\x00\x00\x00\x00\x00\x00'
-tRp2688
-tp2689
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2690
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2691
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2692
-g11
-(g12
-S'[\x00\x00\x00\x00\x00\x00\x00'
-tRp2693
-tp2694
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2695
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2696
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2697
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp2698
-tp2699
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2700
-g11
-(g12
-S'R\x00\x00\x00\x00\x00\x00\x00'
-tRp2701
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2702
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2703
-tp2704
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2705
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2706
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2707
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp2708
-tp2709
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2710
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2711
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2712
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp2713
-tp2714
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2715
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2716
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2717
-g11
-(g12
-S'Y\x00\x00\x00\x00\x00\x00\x00'
-tRp2718
-tp2719
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2720
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2721
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2722
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2723
-tp2724
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2725
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2726
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2727
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2728
-tp2729
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2730
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2731
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2732
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp2733
-tp2734
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2735
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2736
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2737
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2738
-tp2739
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2740
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2741
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2742
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2743
-tp2744
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2745
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2746
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2747
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2748
-tp2749
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2750
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2751
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2752
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2753
-tp2754
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2755
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2756
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2757
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2758
-tp2759
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2760
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2761
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2762
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2763
-tp2764
-aa(lp2765
-(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2766
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2767
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2768
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2769
-tp2770
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2771
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2772
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2773
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2774
-tp2775
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2776
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2777
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2778
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2779
-tp2780
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2781
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2782
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2783
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2784
-tp2785
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2786
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2787
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2788
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2789
-tp2790
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2791
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2792
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2793
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2794
-tp2795
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2796
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2797
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2798
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2799
-tp2800
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2801
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2802
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2803
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2804
-tp2805
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2806
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2807
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2808
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2809
-tp2810
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2811
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2812
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2813
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2814
-tp2815
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2816
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2817
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2818
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2819
-tp2820
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2821
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2822
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2823
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2824
-tp2825
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2826
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2827
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2828
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2829
-tp2830
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2831
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2832
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2833
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2834
-tp2835
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2836
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2837
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2838
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2839
-tp2840
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp2841
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2842
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2843
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2844
-tp2845
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2846
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2847
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2848
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2849
-tp2850
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2851
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2852
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2853
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2854
-tp2855
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2856
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2857
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2858
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2859
-tp2860
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2861
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2862
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2863
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2864
-tp2865
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2866
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp2867
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2868
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2869
-tp2870
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2871
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp2872
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2873
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2874
-tp2875
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2876
-g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp2877
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2878
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2879
-tp2880
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2881
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2882
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2883
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2884
-tp2885
-a(g11
-(g12
-S'?\x00\x00\x00\x00\x00\x00\x00'
-tRp2886
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2887
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp2888
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2889
-tp2890
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2891
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp2892
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2893
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2894
-tp2895
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp2896
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2897
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp2898
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2899
-tp2900
-a(g11
-(g12
-S'D\x00\x00\x00\x00\x00\x00\x00'
-tRp2901
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2902
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2903
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2904
-tp2905
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2906
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2907
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2908
-tp2909
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp2910
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2911
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2912
-tp2913
-a(g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2914
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2915
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2916
-tp2917
-a(g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2918
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2919
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp2920
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2921
-tp2922
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp2923
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2924
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2925
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2926
-tp2927
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2928
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2929
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2930
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2931
-tp2932
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp2933
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp2934
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2935
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2936
-tp2937
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp2938
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2939
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2940
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2941
-tp2942
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2943
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2944
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2945
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2946
-tp2947
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2948
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2949
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2950
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2951
-tp2952
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2953
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2954
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2955
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2956
-tp2957
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2958
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp2959
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2960
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2961
-tp2962
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp2963
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2964
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2965
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2966
-tp2967
-a(g11
-(g12
-S'8\x00\x00\x00\x00\x00\x00\x00'
-tRp2968
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2969
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2970
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2971
-tp2972
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp2973
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp2974
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2975
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp2976
-tp2977
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp2978
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2979
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp2980
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2981
-tp2982
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2983
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2984
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2985
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2986
-tp2987
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp2988
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2989
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp2990
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp2991
-tp2992
-a(g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2993
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp2994
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp2995
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp2996
-tp2997
-a(g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp2998
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp2999
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3000
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp3001
-tp3002
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3003
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp3004
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3005
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp3006
-tp3007
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3008
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp3009
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3010
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp3011
-tp3012
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp3013
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3014
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3015
-g11
-(g12
-S'X\x00\x00\x00\x00\x00\x00\x00'
-tRp3016
-tp3017
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp3018
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp3019
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp3020
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp3021
-tp3022
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3023
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3024
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3025
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3026
-tp3027
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3028
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3029
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3030
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3031
-tp3032
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp3033
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3034
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3035
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3036
-tp3037
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp3038
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3039
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3040
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3041
-tp3042
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp3043
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3044
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3045
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp3046
-tp3047
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp3048
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3049
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3050
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3051
-tp3052
-a(g11
-(g12
-S'6\x00\x00\x00\x00\x00\x00\x00'
-tRp3053
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3054
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp3055
-g11
-(g12
-S'V\x00\x00\x00\x00\x00\x00\x00'
-tRp3056
-tp3057
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp3058
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3059
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp3060
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3061
-tp3062
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp3063
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3064
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3065
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp3066
-tp3067
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp3068
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3069
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp3070
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3071
-tp3072
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp3073
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3074
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3075
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp3076
-tp3077
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp3078
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3079
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3080
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp3081
-tp3082
-aa(lp3083
-(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp3084
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3085
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3086
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3087
-tp3088
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp3089
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3090
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3091
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3092
-tp3093
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp3094
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp3095
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3096
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp3097
-tp3098
-a(g11
-(g12
-S';\x00\x00\x00\x00\x00\x00\x00'
-tRp3099
-g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp3100
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3101
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3102
-tp3103
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp3104
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3105
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3106
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3107
-tp3108
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp3109
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp3110
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp3111
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp3112
-tp3113
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3114
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3115
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3116
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3117
-tp3118
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3119
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp3120
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3121
-tp3122
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3123
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp3124
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3125
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3126
-tp3127
-a(g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp3128
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3129
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3130
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp3131
-tp3132
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3133
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3134
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3135
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3136
-tp3137
-a(g11
-(g12
-S'B\x00\x00\x00\x00\x00\x00\x00'
-tRp3138
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3139
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp3140
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp3141
-tp3142
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3143
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3144
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3145
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3146
-tp3147
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp3148
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3149
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3150
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp3151
-tp3152
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp3153
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3154
-g11
-(g12
-S'N\x00\x00\x00\x00\x00\x00\x00'
-tRp3155
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp3156
-tp3157
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp3158
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp3159
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3160
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3161
-tp3162
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3163
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp3164
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3165
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3166
-tp3167
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3168
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3169
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3170
-g11
-(g12
-S'T\x00\x00\x00\x00\x00\x00\x00'
-tRp3171
-tp3172
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp3173
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp3174
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3175
-g11
-(g12
-S'S\x00\x00\x00\x00\x00\x00\x00'
-tRp3176
-tp3177
-a(g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp3178
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp3179
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3180
-g11
-(g12
-S'Q\x00\x00\x00\x00\x00\x00\x00'
-tRp3181
-tp3182
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp3183
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp3184
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3185
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3186
-tp3187
-a(g11
-(g12
-S'>\x00\x00\x00\x00\x00\x00\x00'
-tRp3188
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp3189
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3190
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp3191
-tp3192
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3193
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3194
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3195
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3196
-tp3197
-a(g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3198
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp3199
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3200
-tp3201
-a(g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3202
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp3203
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3204
-g11
-(g12
-S'O\x00\x00\x00\x00\x00\x00\x00'
-tRp3205
-tp3206
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp3207
-g11
-(g12
-S'E\x00\x00\x00\x00\x00\x00\x00'
-tRp3208
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3209
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp3210
-tp3211
-a(g11
-(g12
-S'<\x00\x00\x00\x00\x00\x00\x00'
-tRp3212
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3213
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3214
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3215
-tp3216
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp3217
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3218
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3219
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3220
-tp3221
-a(g11
-(g12
-S'9\x00\x00\x00\x00\x00\x00\x00'
-tRp3222
-g11
-(g12
-S'A\x00\x00\x00\x00\x00\x00\x00'
-tRp3223
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3224
-g11
-(g12
-S'M\x00\x00\x00\x00\x00\x00\x00'
-tRp3225
-tp3226
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp3227
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3228
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3229
-g11
-(g12
-S'L\x00\x00\x00\x00\x00\x00\x00'
-tRp3230
-tp3231
-a(g11
-(g12
-S'7\x00\x00\x00\x00\x00\x00\x00'
-tRp3232
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3233
-g11
-(g12
-S'G\x00\x00\x00\x00\x00\x00\x00'
-tRp3234
-g11
-(g12
-S'J\x00\x00\x00\x00\x00\x00\x00'
-tRp3235
-tp3236
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp3237
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3238
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3239
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3240
-tp3241
-a(g11
-(g12
-S'0\x00\x00\x00\x00\x00\x00\x00'
-tRp3242
-g11
-(g12
-S'@\x00\x00\x00\x00\x00\x00\x00'
-tRp3243
-g11
-(g12
-S'C\x00\x00\x00\x00\x00\x00\x00'
-tRp3244
-g11
-(g12
-S'H\x00\x00\x00\x00\x00\x00\x00'
-tRp3245
-tp3246
-aas.
\ No newline at end of file
diff --git a/research/fivo/fivo/test_data/tiny_speech_dataset.tfrecord b/research/fivo/fivo/test_data/tiny_speech_dataset.tfrecord
deleted file mode 100644
index 93fe8791b631da35b9d03d37e6494cc7c50cb55d..0000000000000000000000000000000000000000
Binary files a/research/fivo/fivo/test_data/tiny_speech_dataset.tfrecord and /dev/null differ
diff --git a/research/fivo/fivo/test_utils.py b/research/fivo/fivo/test_utils.py
deleted file mode 100644
index 48bbd3d483c45457b82b12ac1587d4c314b79f49..0000000000000000000000000000000000000000
--- a/research/fivo/fivo/test_utils.py
+++ /dev/null
@@ -1,144 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities for testing FIVO.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-from fivo.models import base
-from fivo.models import srnn
-from fivo.models import vrnn
-
-
-def create_vrnn(generative_class=base.ConditionalNormalDistribution,
-                batch_size=2, data_size=3, rnn_hidden_size=4,
-                latent_size=5, fcnet_hidden_size=7, encoded_data_size=9,
-                encoded_latent_size=11, num_timesteps=7, data_lengths=(7, 4),
-                use_tilt=False, random_seed=None):
-  """Creates a VRNN and some dummy data to feed it for testing purposes.
-
-  Args:
-    generative_class: The class of the generative distribution.
-    batch_size: The number of elements per batch.
-    data_size: The dimension of the vectors that make up the data sequences.
-    rnn_hidden_size: The hidden state dimension of the RNN that forms the
-      deterministic part of this VRNN.
-    latent_size: The size of the stochastic latent state of the VRNN.
-    fcnet_hidden_size: The size of the hidden layer of the fully connected
-      networks that parameterize the conditional probability distributions
-      of the VRNN.
-    encoded_data_size: The size of the output of the data encoding network.
-    encoded_latent_size: The size of the output of the latent state encoding
-      network.
-    num_timesteps: The maximum number of timesteps in the data.
-    data_lengths: A tuple of size batch_size that contains the desired lengths
-      of each sequence in the dummy data.
-    use_tilt: Use a tilting function.
-    random_seed: A random seed to feed the VRNN, mainly useful for testing
-      purposes.
-
-  Returns:
-    model: A VRNN object.
-    inputs: A Tensor of shape [num_timesteps, batch_size, data_size], the inputs
-      to the model, also known as the observations.
-    targets: A Tensor of shape [num_timesteps, batch_size, data_size], the
-      desired outputs of the model.
-    lengths: A Tensor of shape [batch_size], the lengths of the sequences in the
-      batch.
-  """
-
-  fcnet_hidden_sizes = [fcnet_hidden_size]
-  initializers = {"w": tf.contrib.layers.xavier_initializer(seed=random_seed),
-                  "b": tf.zeros_initializer()}
-  model = vrnn.create_vrnn(
-      data_size,
-      latent_size,
-      generative_class,
-      rnn_hidden_size=rnn_hidden_size,
-      fcnet_hidden_sizes=fcnet_hidden_sizes,
-      encoded_data_size=encoded_data_size,
-      encoded_latent_size=encoded_latent_size,
-      use_tilt=use_tilt,
-      initializers=initializers,
-      random_seed=random_seed)
-  inputs = tf.random_uniform([num_timesteps, batch_size, data_size],
-                             seed=random_seed, dtype=tf.float32)
-  targets = tf.random_uniform([num_timesteps, batch_size, data_size],
-                              seed=random_seed, dtype=tf.float32)
-  lengths = tf.constant(data_lengths, dtype=tf.int32)
-  return model, inputs, targets, lengths
-
-
-def create_srnn(generative_class=base.ConditionalNormalDistribution,
-                batch_size=2, data_size=3, rnn_hidden_size=4,
-                latent_size=5, fcnet_hidden_size=7, encoded_data_size=3,
-                encoded_latent_size=2, num_timesteps=7, data_lengths=(7, 4),
-                use_tilt=False, random_seed=None):
-  """Creates a SRNN and some dummy data to feed it for testing purposes.
-
-  Args:
-    generative_class: The class of the generative distribution.
-    batch_size: The number of elements per batch.
-    data_size: The dimension of the vectors that make up the data sequences.
-    rnn_hidden_size: The hidden state dimension of the RNN that forms the
-      deterministic part of this SRNN.
-    latent_size: The size of the stochastic latent state of the SRNN.
-    fcnet_hidden_size: The size of the hidden layer of the fully connected
-      networks that parameterize the conditional probability distributions
-      of the SRNN.
-    encoded_data_size: The size of the output of the data encoding network.
-    encoded_latent_size: The size of the output of the latent state encoding
-      network.
-    num_timesteps: The maximum number of timesteps in the data.
-    data_lengths: A tuple of size batch_size that contains the desired lengths
-      of each sequence in the dummy data.
-    use_tilt: Use a tilting function.
-    random_seed: A random seed to feed the SRNN, mainly useful for testing
-      purposes.
-
-  Returns:
-    model: A SRNN object.
-    inputs: A Tensor of shape [num_timesteps, batch_size, data_size], the inputs
-      to the model, also known as the observations.
-    targets: A Tensor of shape [num_timesteps, batch_size, data_size], the
-      desired outputs of the model.
-    lengths: A Tensor of shape [batch_size], the lengths of the sequences in the
-      batch.
-  """
-
-  fcnet_hidden_sizes = [fcnet_hidden_size]
-  initializers = {"w": tf.contrib.layers.xavier_initializer(seed=random_seed),
-                  "b": tf.zeros_initializer()}
-  model = srnn.create_srnn(
-      data_size,
-      latent_size,
-      generative_class,
-      rnn_hidden_size=rnn_hidden_size,
-      fcnet_hidden_sizes=fcnet_hidden_sizes,
-      encoded_data_size=encoded_data_size,
-      encoded_latent_size=encoded_latent_size,
-      use_tilt=use_tilt,
-      initializers=initializers,
-      random_seed=random_seed)
-  inputs = tf.random_uniform([num_timesteps, batch_size, data_size],
-                             seed=random_seed, dtype=tf.float32)
-  targets = tf.random_uniform([num_timesteps, batch_size, data_size],
-                              seed=random_seed, dtype=tf.float32)
-  lengths = tf.constant(data_lengths, dtype=tf.int32)
-  return model, inputs, targets, lengths
diff --git a/research/fivo/run_fivo.py b/research/fivo/run_fivo.py
deleted file mode 100644
index 1ca079421f09fb65439dae210b1c3760240b51ad..0000000000000000000000000000000000000000
--- a/research/fivo/run_fivo.py
+++ /dev/null
@@ -1,142 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A script to run training for sequential latent variable models.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from fivo import ghmm_runners
-from fivo import runners
-
-# Shared flags.
-tf.app.flags.DEFINE_enum("mode", "train",
-                         ["train", "eval", "sample"],
-                         "The mode of the binary.")
-tf.app.flags.DEFINE_enum("model", "vrnn",
-                         ["vrnn", "ghmm", "srnn"],
-                         "Model choice.")
-tf.app.flags.DEFINE_integer("latent_size", 64,
-                            "The size of the latent state of the model.")
-tf.app.flags.DEFINE_enum("dataset_type", "pianoroll",
-                         ["pianoroll", "speech", "pose"],
-                         "The type of dataset.")
-tf.app.flags.DEFINE_string("dataset_path", "",
-                           "Path to load the dataset from.")
-tf.app.flags.DEFINE_integer("data_dimension", None,
-                            "The dimension of each vector in the data sequence. "
-                            "Defaults to 88 for pianoroll datasets and 200 for speech "
-                            "datasets. Should not need to be changed except for "
-                            "testing.")
-tf.app.flags.DEFINE_integer("batch_size", 4,
-                            "Batch size.")
-tf.app.flags.DEFINE_integer("num_samples", 4,
-                            "The number of samples (or particles) for multisample "
-                            "algorithms.")
-tf.app.flags.DEFINE_string("logdir", "/tmp/smc_vi",
-                           "The directory to keep checkpoints and summaries in.")
-tf.app.flags.DEFINE_integer("random_seed", None,
-                            "A random seed for seeding the TensorFlow graph.")
-tf.app.flags.DEFINE_integer("parallel_iterations", 30,
-                            "The number of parallel iterations to use for the while "
-                            "loop that computes the bounds.")
-
-# Training flags.
-tf.app.flags.DEFINE_enum("bound", "fivo",
-                         ["elbo", "iwae", "fivo", "fivo-aux"],
-                         "The bound to optimize.")
-tf.app.flags.DEFINE_boolean("normalize_by_seq_len", True,
-                            "If true, normalize the loss by the number of timesteps "
-                            "per sequence.")
-tf.app.flags.DEFINE_float("learning_rate", 0.0002,
-                          "The learning rate for ADAM.")
-tf.app.flags.DEFINE_integer("max_steps", int(1e9),
-                            "The number of gradient update steps to train for.")
-tf.app.flags.DEFINE_integer("summarize_every", 50,
-                            "The number of steps between summaries.")
-tf.app.flags.DEFINE_enum("resampling_type", "multinomial",
-                         ["multinomial", "relaxed"],
-                         "The resampling strategy to use for training.")
-tf.app.flags.DEFINE_float("relaxed_resampling_temperature", 0.5,
-                          "The relaxation temperature for relaxed resampling.")
-tf.app.flags.DEFINE_enum("proposal_type", "filtering",
-                         ["prior", "filtering", "smoothing",
-                          "true-filtering", "true-smoothing"],
-                         "The type of proposal to use. true-filtering and true-smoothing "
-                         "are only available for the GHMM. The specific implementation "
-                         "of each proposal type is left to model-writers.")
-
-# Distributed training flags.
-tf.app.flags.DEFINE_string("master", "",
-                           "The BNS name of the TensorFlow master to use.")
-tf.app.flags.DEFINE_integer("task", 0,
-                            "Task id of the replica running the training.")
-tf.app.flags.DEFINE_integer("ps_tasks", 0,
-                            "Number of tasks in the ps job. If 0 no ps job is used.")
-tf.app.flags.DEFINE_boolean("stagger_workers", True,
-                            "If true, bring one worker online every 1000 steps.")
-
-# Evaluation flags.
-tf.app.flags.DEFINE_enum("split", "train",
-                         ["train", "test", "valid"],
-                         "Split to evaluate the model on.")
-
-# Sampling flags.
-tf.app.flags.DEFINE_integer("sample_length", 50,
-                            "The number of timesteps to sample for.")
-tf.app.flags.DEFINE_integer("prefix_length", 25,
-                            "The number of timesteps to condition the model on "
-                            "before sampling.")
-tf.app.flags.DEFINE_string("sample_out_dir", None,
-                           "The directory to write the samples to. "
-                           "Defaults to logdir.")
-
-# GHMM flags.
-tf.app.flags.DEFINE_float("variance", 0.1,
-                          "The variance of the ghmm.")
-tf.app.flags.DEFINE_integer("num_timesteps", 5,
-                            "The number of timesteps to run the gmp for.")
-FLAGS = tf.app.flags.FLAGS
-
-PIANOROLL_DEFAULT_DATA_DIMENSION = 88
-SPEECH_DEFAULT_DATA_DIMENSION = 200
-
-
-def main(unused_argv):
-  tf.logging.set_verbosity(tf.logging.INFO)
-  if FLAGS.model in ["vrnn", "srnn"]:
-    if FLAGS.data_dimension is None:
-      if FLAGS.dataset_type == "pianoroll":
-        FLAGS.data_dimension = PIANOROLL_DEFAULT_DATA_DIMENSION
-      elif FLAGS.dataset_type == "speech":
-        FLAGS.data_dimension = SPEECH_DEFAULT_DATA_DIMENSION
-    if FLAGS.mode == "train":
-      runners.run_train(FLAGS)
-    elif FLAGS.mode == "eval":
-      runners.run_eval(FLAGS)
-    elif FLAGS.mode == "sample":
-      runners.run_sample(FLAGS)
-  elif FLAGS.model == "ghmm":
-    if FLAGS.mode == "train":
-      ghmm_runners.run_train(FLAGS)
-    elif FLAGS.mode == "eval":
-      ghmm_runners.run_eval(FLAGS)
-
-if __name__ == "__main__":
-  tf.app.run(main)
diff --git a/research/global_objectives/README.md b/research/global_objectives/README.md
deleted file mode 100644
index f9a778c59d420f9bf5deccf4b2b45147636de582..0000000000000000000000000000000000000000
--- a/research/global_objectives/README.md
+++ /dev/null
@@ -1,152 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Global Objectives
-The Global Objectives library provides TensorFlow loss functions that optimize
-directly for a variety of objectives including AUC, recall at precision, and
-more. The global objectives losses can be used as drop-in replacements for
-TensorFlow's standard multilabel loss functions:
-`tf.nn.sigmoid_cross_entropy_with_logits` and `tf.losses.sigmoid_cross_entropy`.
-
-Many machine learning classification models are optimized for classification
-accuracy, when the real objective the user cares about is different and can be
-precision at a fixed recall, precision-recall AUC, ROC AUC or similar metrics.
-These are referred to as "global objectives" because they depend on how the
-model classifies the dataset as a whole and do not decouple across data points
-as accuracy does.
-
-Because these objectives are combinatorial, discontinuous, and essentially
-intractable to optimize directly, the functions in this library approximate
-their corresponding objectives. This approximation approach follows the same
-pattern as optimizing for accuracy, where a surrogate objective such as
-cross-entropy or the hinge loss is used as an upper bound on the error rate.
-
-## Getting Started
-For a full example of how to use the loss functions in practice, see
-loss_layers_example.py.
-
-Briefly, global objective losses can be used to replace
-`tf.nn.sigmoid_cross_entropy_with_logits` by providing the relevant
-additional arguments. For example,
-
-``` python
-tf.nn.sigmoid_cross_entropy_with_logits(labels=labels, logits=logits)
-```
-
-could be replaced with
-
-``` python
-global_objectives.recall_at_precision_loss(
-    labels=labels,
-    logits=logits,
-    target_precision=0.95)[0]
-```
-
-Just as minimizing the cross-entropy loss will maximize accuracy, the loss
-functions in loss_layers.py were written so that minimizing the loss will
-maximize the corresponding objective.
-
-The global objective losses have two return values -- the loss tensor and
-additional quantities for debugging and customization -- which is why the first
-value is used above. For more information, see
-[Visualization & Debugging](#visualization-debugging).
-
-## Binary Label Format
-Binary classification problems can be represented as a multi-class problem with
-two classes, or as a multi-label problem with one label. (Recall that multiclass
-problems have mutually exclusive classes, e.g. 'cat xor dog', and multilabel
-have classes which are not mutually exclusive, e.g. an image can contain a cat,
-a dog, both, or neither.) The softmax loss
-(`tf.nn.softmax_cross_entropy_with_logits`) is used for multi-class problems,
-while the sigmoid loss (`tf.nn.sigmoid_cross_entropy_with_logits`) is used for
-multi-label problems.
-
-A multiclass label format for binary classification might represent positives
-with the label [1, 0] and negatives with the label [0, 1], while the multilbel
-format for the same problem would use [1] and [0], respectively.
-
-All global objectives loss functions assume that the multilabel format is used.
-Accordingly, if your current loss function is softmax, the labels will have to
-be reformatted for the loss to work properly.
-
-## Dual Variables
-Global objectives losses (except for `roc_auc_loss`) use internal variables
-called dual variables or Lagrange multipliers to enforce the desired constraint
-(e.g. if optimzing for recall at precision, the constraint is on precision).
-
-These dual variables are created and initialized internally by the loss
-functions, and are updated during training by the same optimizer used for the
-model's other variables. To initialize the dual variables to a particular value,
-use the `lambdas_initializer` argument. The dual variables can be found under
-the key `lambdas` in the `other_outputs` dictionary returned by the losses.
-
-## Loss Function Arguments
-The following arguments are common to all loss functions in the library, and are
-either required or very important.
-
-* `labels`: Corresponds directly to the `labels` argument of
-  `tf.nn.sigmoid_cross_entropy_with_logits`.
-* `logits`: Corresponds directly to the `logits` argument of
-  `tf.nn.sigmoid_cross_entropy_with_logits`.
-* `dual_rate_factor`: A floating point value which controls the step size for
-  the Lagrange multipliers. Setting this value less than 1.0 will cause the
-  constraint to be enforced more gradually and will result in more stable
-  training.
-
-In addition, the objectives with a single constraint (e.g.
-`recall_at_precision_loss`) have an argument (e.g. `target_precision`) used to
-specify the value of the constraint. The optional `precision_range` argument to
-`precision_recall_auc_loss` is used to specify the range of precision values
-over which to optimize the AUC, and defaults to the interval [0, 1].
-
-Optional arguments:
-
-* `weights`: A tensor which acts as coefficients for the loss. If a weight of x
-  is provided for a datapoint and that datapoint is a true (false) positive
-  (negative), it will be counted as x true (false) positives (negatives).
-  Defaults to 1.0.
-* `label_priors`: A tensor specifying the fraction of positive datapoints for
-  each label. If not provided, it will be computed inside the loss function.
-* `surrogate_type`: Either 'xent' or 'hinge', specifying which upper bound
-      should be used for indicator functions.
-* `lambdas_initializer`: An initializer for the dual variables (Lagrange
-  multipliers). See also the Dual Variables section.
-* `num_anchors` (precision_recall_auc_loss only): The number of grid points used
-  when approximating the AUC as a Riemann sum.
-
-## Hyperparameters
-While the functional form of the global objectives losses allow them to be
-easily substituted in place of `sigmoid_cross_entropy_with_logits`, model
-hyperparameters such as learning rate, weight decay, etc. may need to be
-fine-tuned to the new loss. Fortunately, the amount of hyperparameter re-tuning
-is usually minor.
-
-The most important hyperparameters to modify are the learning rate and
-dual_rate_factor (see the section on Loss Function Arguments, above).
-
-## Visualization & Debugging
-The global objectives losses return two values. The first is a tensor
-representing the numerical value of the loss, which can be passed to an
-optimizer. The second is a dictionary of tensors created by the loss function
-which are not necessary for optimization but useful in debugging. These vary
-depending on the loss function, but usually include `lambdas` (the Lagrange
-multipliers) as well as the lower bound on true positives and upper bound on
-false positives.
-
-When visualizing the loss during training, note that the global objectives
-losses differ from standard losses in some important ways:
-
-* The global losses may be negative. This is because the value returned by the
-  loss includes terms involving the Lagrange multipliers, which may be negative.
-* The global losses may not decrease over the course of training. To enforce the
-  constraints in the objective, the loss changes over time and may increase.
-
-## More Info
-For more details, see the [Global Objectives paper](https://arxiv.org/abs/1608.04802).
-
-## Maintainers
-
-* Mariano Schain
-* Elad Eban
-* [Alan Mackey](https://github.com/mackeya-google)
diff --git a/research/global_objectives/loss_layers.py b/research/global_objectives/loss_layers.py
deleted file mode 100644
index eaea05398ef3771247060afda63be184ea76cdf0..0000000000000000000000000000000000000000
--- a/research/global_objectives/loss_layers.py
+++ /dev/null
@@ -1,930 +0,0 @@
-# Copyright 2018 The TensorFlow Global Objectives Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Loss functions for learning global objectives.
-
-These functions have two return values: a Tensor with the value of
-the loss, and a dictionary of internal quantities for customizability.
-"""
-
-# Dependency imports
-import numpy
-import tensorflow as tf
-
-from global_objectives import util
-
-
-def precision_recall_auc_loss(
-    labels,
-    logits,
-    precision_range=(0.0, 1.0),
-    num_anchors=20,
-    weights=1.0,
-    dual_rate_factor=0.1,
-    label_priors=None,
-    surrogate_type='xent',
-    lambdas_initializer=tf.constant_initializer(1.0),
-    reuse=None,
-    variables_collections=None,
-    trainable=True,
-    scope=None):
-  """Computes precision-recall AUC loss.
-
-  The loss is based on a sum of losses for recall at a range of
-  precision values (anchor points). This sum is a Riemann sum that
-  approximates the area under the precision-recall curve.
-
-  The per-example `weights` argument changes not only the coefficients of
-  individual training examples, but how the examples are counted toward the
-  constraint. If `label_priors` is given, it MUST take `weights` into account.
-  That is,
-      label_priors = P / (P + N)
-  where
-      P = sum_i (wt_i on positives)
-      N = sum_i (wt_i on negatives).
-
-  Args:
-    labels: A `Tensor` of shape [batch_size] or [batch_size, num_labels].
-    logits: A `Tensor` with the same shape as `labels`.
-    precision_range: A length-two tuple, the range of precision values over
-      which to compute AUC. The entries must be nonnegative, increasing, and
-      less than or equal to 1.0.
-    num_anchors: The number of grid points used to approximate the Riemann sum.
-    weights: Coefficients for the loss. Must be a scalar or `Tensor` of shape
-      [batch_size] or [batch_size, num_labels].
-    dual_rate_factor: A floating point value which controls the step size for
-      the Lagrange multipliers.
-    label_priors: None, or a floating point `Tensor` of shape [num_labels]
-      containing the prior probability of each label (i.e. the fraction of the
-      training data consisting of positive examples). If None, the label
-      priors are computed from `labels` with a moving average. See the notes
-      above regarding the interaction with `weights` and do not set this unless
-      you have a good reason to do so.
-    surrogate_type: Either 'xent' or 'hinge', specifying which upper bound
-      should be used for indicator functions.
-    lambdas_initializer: An initializer for the Lagrange multipliers.
-    reuse: Whether or not the layer and its variables should be reused. To be
-      able to reuse the layer scope must be given.
-    variables_collections: Optional list of collections for the variables.
-    trainable: If `True` also add variables to the graph collection
-      `GraphKeys.TRAINABLE_VARIABLES` (see `tf.Variable`).
-    scope: Optional scope for `variable_scope`.
-
-  Returns:
-    loss: A `Tensor` of the same shape as `logits` with the component-wise
-      loss.
-    other_outputs: A dictionary of useful internal quantities for debugging. For
-      more details, see http://arxiv.org/pdf/1608.04802.pdf.
-      lambdas: A Tensor of shape [1, num_labels, num_anchors] consisting of the
-        Lagrange multipliers.
-      biases: A Tensor of shape [1, num_labels, num_anchors] consisting of the
-        learned bias term for each.
-      label_priors: A Tensor of shape [1, num_labels, 1] consisting of the prior
-        probability of each label learned by the loss, if not provided.
-      true_positives_lower_bound: Lower bound on the number of true positives
-        given `labels` and `logits`. This is the same lower bound which is used
-        in the loss expression to be optimized.
-      false_positives_upper_bound: Upper bound on the number of false positives
-        given `labels` and `logits`. This is the same upper bound which is used
-        in the loss expression to be optimized.
-
-  Raises:
-    ValueError: If `surrogate_type` is not `xent` or `hinge`.
-  """
-  with tf.variable_scope(scope,
-                         'precision_recall_auc',
-                         [labels, logits, label_priors],
-                         reuse=reuse):
-    labels, logits, weights, original_shape = _prepare_labels_logits_weights(
-        labels, logits, weights)
-    num_labels = util.get_num_labels(logits)
-
-    # Convert other inputs to tensors and standardize dtypes.
-    dual_rate_factor = util.convert_and_cast(
-        dual_rate_factor, 'dual_rate_factor', logits.dtype)
-
-    # Create Tensor of anchor points and distance between anchors.
-    precision_values, delta = _range_to_anchors_and_delta(
-        precision_range, num_anchors, logits.dtype)
-    # Create lambdas with shape [1, num_labels, num_anchors].
-    lambdas, lambdas_variable = _create_dual_variable(
-        'lambdas',
-        shape=[1, num_labels, num_anchors],
-        dtype=logits.dtype,
-        initializer=lambdas_initializer,
-        collections=variables_collections,
-        trainable=trainable,
-        dual_rate_factor=dual_rate_factor)
-    # Create biases with shape [1, num_labels, num_anchors].
-    biases = tf.contrib.framework.model_variable(
-        name='biases',
-        shape=[1, num_labels, num_anchors],
-        dtype=logits.dtype,
-        initializer=tf.zeros_initializer(),
-        collections=variables_collections,
-        trainable=trainable)
-    # Maybe create label_priors.
-    label_priors = maybe_create_label_priors(
-        label_priors, labels, weights, variables_collections)
-    label_priors = tf.reshape(label_priors, [1, num_labels, 1])
-
-    # Expand logits, labels, and weights to shape [batch_size, num_labels, 1].
-    logits = tf.expand_dims(logits, 2)
-    labels = tf.expand_dims(labels, 2)
-    weights = tf.expand_dims(weights, 2)
-
-    # Calculate weighted loss and other outputs. The log(2.0) term corrects for
-    # logloss not being an upper bound on the indicator function.
-    loss = weights * util.weighted_surrogate_loss(
-        labels,
-        logits + biases,
-        surrogate_type=surrogate_type,
-        positive_weights=1.0 + lambdas * (1.0 - precision_values),
-        negative_weights=lambdas * precision_values)
-    maybe_log2 = tf.log(2.0) if surrogate_type == 'xent' else 1.0
-    maybe_log2 = tf.cast(maybe_log2, logits.dtype.base_dtype)
-    lambda_term = lambdas * (1.0 - precision_values) * label_priors * maybe_log2
-    per_anchor_loss = loss - lambda_term
-    per_label_loss = delta * tf.reduce_sum(per_anchor_loss, 2)
-    # Normalize the AUC such that a perfect score function will have AUC 1.0.
-    # Because precision_range is discretized into num_anchors + 1 intervals
-    # but only num_anchors terms are included in the Riemann sum, the
-    # effective length of the integration interval is `delta` less than the
-    # length of precision_range.
-    scaled_loss = tf.div(per_label_loss,
-                         precision_range[1] - precision_range[0] - delta,
-                         name='AUC_Normalize')
-    scaled_loss = tf.reshape(scaled_loss, original_shape)
-
-    other_outputs = {
-        'lambdas': lambdas_variable,
-        'biases': biases,
-        'label_priors': label_priors,
-        'true_positives_lower_bound': true_positives_lower_bound(
-            labels, logits, weights, surrogate_type),
-        'false_positives_upper_bound': false_positives_upper_bound(
-            labels, logits, weights, surrogate_type)}
-
-    return scaled_loss, other_outputs
-
-
-def roc_auc_loss(
-    labels,
-    logits,
-    weights=1.0,
-    surrogate_type='xent',
-    scope=None):
-  """Computes ROC AUC loss.
-
-  The area under the ROC curve is the probability p that a randomly chosen
-  positive example will be scored higher than a randomly chosen negative
-  example. This loss approximates 1-p by using a surrogate (either hinge loss or
-  cross entropy) for the indicator function. Specifically, the loss is:
-
-    sum_i sum_j w_i*w_j*loss(logit_i - logit_j)
-
-  where i ranges over the positive datapoints, j ranges over the negative
-  datapoints, logit_k denotes the logit (or score) of the k-th datapoint, and
-  loss is either the hinge or log loss given a positive label.
-
-  Args:
-    labels: A `Tensor` of shape [batch_size] or [batch_size, num_labels].
-    logits: A `Tensor` with the same shape and dtype as `labels`.
-    weights: Coefficients for the loss. Must be a scalar or `Tensor` of shape
-      [batch_size] or [batch_size, num_labels].
-    surrogate_type: Either 'xent' or 'hinge', specifying which upper bound
-      should be used for the indicator function.
-    scope: Optional scope for `name_scope`.
-
-  Returns:
-    loss: A `Tensor` of the same shape as `logits` with the component-wise loss.
-    other_outputs: An empty dictionary, for consistency.
-
-  Raises:
-    ValueError: If `surrogate_type` is not `xent` or `hinge`.
-  """
-  with tf.name_scope(scope, 'roc_auc', [labels, logits, weights]):
-    # Convert inputs to tensors and standardize dtypes.
-    labels, logits, weights, original_shape = _prepare_labels_logits_weights(
-        labels, logits, weights)
-
-    # Create tensors of pairwise differences for logits and labels, and
-    # pairwise products of weights. These have shape
-    # [batch_size, batch_size, num_labels].
-    logits_difference = tf.expand_dims(logits, 0) - tf.expand_dims(logits, 1)
-    labels_difference = tf.expand_dims(labels, 0) - tf.expand_dims(labels, 1)
-    weights_product = tf.expand_dims(weights, 0) * tf.expand_dims(weights, 1)
-
-    signed_logits_difference = labels_difference * logits_difference
-    raw_loss = util.weighted_surrogate_loss(
-        labels=tf.ones_like(signed_logits_difference),
-        logits=signed_logits_difference,
-        surrogate_type=surrogate_type)
-    weighted_loss = weights_product * raw_loss
-
-    # Zero out entries of the loss where labels_difference zero (so loss is only
-    # computed on pairs with different labels).
-    loss = tf.reduce_mean(tf.abs(labels_difference) * weighted_loss, 0) * 0.5
-    loss = tf.reshape(loss, original_shape)
-    return loss, {}
-
-
-def recall_at_precision_loss(
-    labels,
-    logits,
-    target_precision,
-    weights=1.0,
-    dual_rate_factor=0.1,
-    label_priors=None,
-    surrogate_type='xent',
-    lambdas_initializer=tf.constant_initializer(1.0),
-    reuse=None,
-    variables_collections=None,
-    trainable=True,
-    scope=None):
-  """Computes recall at precision loss.
-
-  The loss is based on a surrogate of the form
-      wt * w(+) * loss(+) + wt * w(-) * loss(-) - c * pi,
-  where:
-  - w(+) =  1 + lambdas * (1 - target_precision)
-  - loss(+) is the cross-entropy loss on the positive examples
-  - w(-) = lambdas * target_precision
-  - loss(-) is the cross-entropy loss on the negative examples
-  - wt is a scalar or tensor of per-example weights
-  - c = lambdas * (1 - target_precision)
-  - pi is the label_priors.
-
-  The per-example weights change not only the coefficients of individual
-  training examples, but how the examples are counted toward the constraint.
-  If `label_priors` is given, it MUST take `weights` into account. That is,
-      label_priors = P / (P + N)
-  where
-      P = sum_i (wt_i on positives)
-      N = sum_i (wt_i on negatives).
-
-  Args:
-    labels: A `Tensor` of shape [batch_size] or [batch_size, num_labels].
-    logits: A `Tensor` with the same shape as `labels`.
-    target_precision: The precision at which to compute the loss. Can be a
-      floating point value between 0 and 1 for a single precision value, or a
-      `Tensor` of shape [num_labels], holding each label's target precision
-      value.
-    weights: Coefficients for the loss. Must be a scalar or `Tensor` of shape
-      [batch_size] or [batch_size, num_labels].
-    dual_rate_factor: A floating point value which controls the step size for
-      the Lagrange multipliers.
-    label_priors: None, or a floating point `Tensor` of shape [num_labels]
-      containing the prior probability of each label (i.e. the fraction of the
-      training data consisting of positive examples). If None, the label
-      priors are computed from `labels` with a moving average. See the notes
-      above regarding the interaction with `weights` and do not set this unless
-      you have a good reason to do so.
-    surrogate_type: Either 'xent' or 'hinge', specifying which upper bound
-      should be used for indicator functions.
-    lambdas_initializer: An initializer for the Lagrange multipliers.
-    reuse: Whether or not the layer and its variables should be reused. To be
-      able to reuse the layer scope must be given.
-    variables_collections: Optional list of collections for the variables.
-    trainable: If `True` also add variables to the graph collection
-      `GraphKeys.TRAINABLE_VARIABLES` (see `tf.Variable`).
-    scope: Optional scope for `variable_scope`.
-
-  Returns:
-    loss: A `Tensor` of the same shape as `logits` with the component-wise
-      loss.
-    other_outputs: A dictionary of useful internal quantities for debugging. For
-      more details, see http://arxiv.org/pdf/1608.04802.pdf.
-      lambdas: A Tensor of shape [num_labels] consisting of the Lagrange
-        multipliers.
-      label_priors: A Tensor of shape [num_labels] consisting of the prior
-        probability of each label learned by the loss, if not provided.
-      true_positives_lower_bound: Lower bound on the number of true positives
-        given `labels` and `logits`. This is the same lower bound which is used
-        in the loss expression to be optimized.
-      false_positives_upper_bound: Upper bound on the number of false positives
-        given `labels` and `logits`. This is the same upper bound which is used
-        in the loss expression to be optimized.
-
-  Raises:
-    ValueError: If `logits` and `labels` do not have the same shape.
-  """
-  with tf.variable_scope(scope,
-                         'recall_at_precision',
-                         [logits, labels, label_priors],
-                         reuse=reuse):
-    labels, logits, weights, original_shape = _prepare_labels_logits_weights(
-        labels, logits, weights)
-    num_labels = util.get_num_labels(logits)
-
-    # Convert other inputs to tensors and standardize dtypes.
-    target_precision = util.convert_and_cast(
-        target_precision, 'target_precision', logits.dtype)
-    dual_rate_factor = util.convert_and_cast(
-        dual_rate_factor, 'dual_rate_factor', logits.dtype)
-
-    # Create lambdas.
-    lambdas, lambdas_variable = _create_dual_variable(
-        'lambdas',
-        shape=[num_labels],
-        dtype=logits.dtype,
-        initializer=lambdas_initializer,
-        collections=variables_collections,
-        trainable=trainable,
-        dual_rate_factor=dual_rate_factor)
-    # Maybe create label_priors.
-    label_priors = maybe_create_label_priors(
-        label_priors, labels, weights, variables_collections)
-
-    # Calculate weighted loss and other outputs. The log(2.0) term corrects for
-    # logloss not being an upper bound on the indicator function.
-    weighted_loss = weights * util.weighted_surrogate_loss(
-        labels,
-        logits,
-        surrogate_type=surrogate_type,
-        positive_weights=1.0 + lambdas * (1.0 - target_precision),
-        negative_weights=lambdas * target_precision)
-    maybe_log2 = tf.log(2.0) if surrogate_type == 'xent' else 1.0
-    maybe_log2 = tf.cast(maybe_log2, logits.dtype.base_dtype)
-    lambda_term = lambdas * (1.0 - target_precision) * label_priors * maybe_log2
-    loss = tf.reshape(weighted_loss - lambda_term, original_shape)
-    other_outputs = {
-        'lambdas': lambdas_variable,
-        'label_priors': label_priors,
-        'true_positives_lower_bound': true_positives_lower_bound(
-            labels, logits, weights, surrogate_type),
-        'false_positives_upper_bound': false_positives_upper_bound(
-            labels, logits, weights, surrogate_type)}
-
-    return loss, other_outputs
-
-
-def precision_at_recall_loss(
-    labels,
-    logits,
-    target_recall,
-    weights=1.0,
-    dual_rate_factor=0.1,
-    label_priors=None,
-    surrogate_type='xent',
-    lambdas_initializer=tf.constant_initializer(1.0),
-    reuse=None,
-    variables_collections=None,
-    trainable=True,
-    scope=None):
-  """Computes precision at recall loss.
-
-  The loss is based on a surrogate of the form
-     wt * loss(-) + lambdas * (pi * (b - 1) + wt * loss(+))
-  where:
-  - loss(-) is the cross-entropy loss on the negative examples
-  - loss(+) is the cross-entropy loss on the positive examples
-  - wt is a scalar or tensor of per-example weights
-  - b is the target recall
-  - pi is the label_priors.
-
-  The per-example weights change not only the coefficients of individual
-  training examples, but how the examples are counted toward the constraint.
-  If `label_priors` is given, it MUST take `weights` into account. That is,
-      label_priors = P / (P + N)
-  where
-      P = sum_i (wt_i on positives)
-      N = sum_i (wt_i on negatives).
-
-  Args:
-    labels: A `Tensor` of shape [batch_size] or [batch_size, num_labels].
-    logits: A `Tensor` with the same shape as `labels`.
-    target_recall: The recall at which to compute the loss. Can be a floating
-      point value between 0 and 1 for a single target recall value, or a
-      `Tensor` of shape [num_labels] holding each label's target recall value.
-    weights: Coefficients for the loss. Must be a scalar or `Tensor` of shape
-      [batch_size] or [batch_size, num_labels].
-    dual_rate_factor: A floating point value which controls the step size for
-      the Lagrange multipliers.
-    label_priors: None, or a floating point `Tensor` of shape [num_labels]
-      containing the prior probability of each label (i.e. the fraction of the
-      training data consisting of positive examples). If None, the label
-      priors are computed from `labels` with a moving average. See the notes
-      above regarding the interaction with `weights` and do not set this unless
-      you have a good reason to do so.
-    surrogate_type: Either 'xent' or 'hinge', specifying which upper bound
-      should be used for indicator functions.
-    lambdas_initializer: An initializer for the Lagrange multipliers.
-    reuse: Whether or not the layer and its variables should be reused. To be
-      able to reuse the layer scope must be given.
-    variables_collections: Optional list of collections for the variables.
-    trainable: If `True` also add variables to the graph collection
-      `GraphKeys.TRAINABLE_VARIABLES` (see `tf.Variable`).
-    scope: Optional scope for `variable_scope`.
-
-  Returns:
-    loss: A `Tensor` of the same shape as `logits` with the component-wise
-      loss.
-    other_outputs: A dictionary of useful internal quantities for debugging. For
-      more details, see http://arxiv.org/pdf/1608.04802.pdf.
-      lambdas: A Tensor of shape [num_labels] consisting of the Lagrange
-        multipliers.
-      label_priors: A Tensor of shape [num_labels] consisting of the prior
-        probability of each label learned by the loss, if not provided.
-      true_positives_lower_bound: Lower bound on the number of true positives
-        given `labels` and `logits`. This is the same lower bound which is used
-        in the loss expression to be optimized.
-      false_positives_upper_bound: Upper bound on the number of false positives
-        given `labels` and `logits`. This is the same upper bound which is used
-        in the loss expression to be optimized.
-  """
-  with tf.variable_scope(scope,
-                         'precision_at_recall',
-                         [logits, labels, label_priors],
-                         reuse=reuse):
-    labels, logits, weights, original_shape = _prepare_labels_logits_weights(
-        labels, logits, weights)
-    num_labels = util.get_num_labels(logits)
-
-    # Convert other inputs to tensors and standardize dtypes.
-    target_recall = util.convert_and_cast(
-        target_recall, 'target_recall', logits.dtype)
-    dual_rate_factor = util.convert_and_cast(
-        dual_rate_factor, 'dual_rate_factor', logits.dtype)
-
-    # Create lambdas.
-    lambdas, lambdas_variable = _create_dual_variable(
-        'lambdas',
-        shape=[num_labels],
-        dtype=logits.dtype,
-        initializer=lambdas_initializer,
-        collections=variables_collections,
-        trainable=trainable,
-        dual_rate_factor=dual_rate_factor)
-    # Maybe create label_priors.
-    label_priors = maybe_create_label_priors(
-        label_priors, labels, weights, variables_collections)
-
-    # Calculate weighted loss and other outputs. The log(2.0) term corrects for
-    # logloss not being an upper bound on the indicator function.
-    weighted_loss = weights * util.weighted_surrogate_loss(
-        labels,
-        logits,
-        surrogate_type,
-        positive_weights=lambdas,
-        negative_weights=1.0)
-    maybe_log2 = tf.log(2.0) if surrogate_type == 'xent' else 1.0
-    maybe_log2 = tf.cast(maybe_log2, logits.dtype.base_dtype)
-    lambda_term = lambdas * label_priors * (target_recall - 1.0) * maybe_log2
-    loss = tf.reshape(weighted_loss + lambda_term, original_shape)
-    other_outputs = {
-        'lambdas': lambdas_variable,
-        'label_priors': label_priors,
-        'true_positives_lower_bound': true_positives_lower_bound(
-            labels, logits, weights, surrogate_type),
-        'false_positives_upper_bound': false_positives_upper_bound(
-            labels, logits, weights, surrogate_type)}
-
-    return loss, other_outputs
-
-
-def false_positive_rate_at_true_positive_rate_loss(
-    labels,
-    logits,
-    target_rate,
-    weights=1.0,
-    dual_rate_factor=0.1,
-    label_priors=None,
-    surrogate_type='xent',
-    lambdas_initializer=tf.constant_initializer(1.0),
-    reuse=None,
-    variables_collections=None,
-    trainable=True,
-    scope=None):
-  """Computes false positive rate at true positive rate loss.
-
-  Note that `true positive rate` is a synonym for Recall, and that minimizing
-  the false positive rate and maximizing precision are equivalent for a fixed
-  Recall. Therefore, this function is identical to precision_at_recall_loss.
-
-  The per-example weights change not only the coefficients of individual
-  training examples, but how the examples are counted toward the constraint.
-  If `label_priors` is given, it MUST take `weights` into account. That is,
-      label_priors = P / (P + N)
-  where
-      P = sum_i (wt_i on positives)
-      N = sum_i (wt_i on negatives).
-
-  Args:
-    labels: A `Tensor` of shape [batch_size] or [batch_size, num_labels].
-    logits: A `Tensor` with the same shape as `labels`.
-    target_rate: The true positive rate at which to compute the loss. Can be a
-      floating point value between 0 and 1 for a single true positive rate, or
-      a `Tensor` of shape [num_labels] holding each label's true positive rate.
-    weights: Coefficients for the loss. Must be a scalar or `Tensor` of shape
-      [batch_size] or [batch_size, num_labels].
-    dual_rate_factor: A floating point value which controls the step size for
-      the Lagrange multipliers.
-    label_priors: None, or a floating point `Tensor` of shape [num_labels]
-      containing the prior probability of each label (i.e. the fraction of the
-      training data consisting of positive examples). If None, the label
-      priors are computed from `labels` with a moving average. See the notes
-      above regarding the interaction with `weights` and do not set this unless
-      you have a good reason to do so.
-    surrogate_type: Either 'xent' or 'hinge', specifying which upper bound
-      should be used for indicator functions. 'xent' will use the cross-entropy
-      loss surrogate, and 'hinge' will use the hinge loss.
-    lambdas_initializer: An initializer op for the Lagrange multipliers.
-    reuse: Whether or not the layer and its variables should be reused. To be
-      able to reuse the layer scope must be given.
-    variables_collections: Optional list of collections for the variables.
-    trainable: If `True` also add variables to the graph collection
-      `GraphKeys.TRAINABLE_VARIABLES` (see `tf.Variable`).
-    scope: Optional scope for `variable_scope`.
-
-  Returns:
-    loss: A `Tensor` of the same shape as `logits` with the component-wise
-      loss.
-    other_outputs: A dictionary of useful internal quantities for debugging. For
-      more details, see http://arxiv.org/pdf/1608.04802.pdf.
-      lambdas: A Tensor of shape [num_labels] consisting of the Lagrange
-        multipliers.
-      label_priors: A Tensor of shape [num_labels] consisting of the prior
-        probability of each label learned by the loss, if not provided.
-      true_positives_lower_bound: Lower bound on the number of true positives
-        given `labels` and `logits`. This is the same lower bound which is used
-        in the loss expression to be optimized.
-      false_positives_upper_bound: Upper bound on the number of false positives
-        given `labels` and `logits`. This is the same upper bound which is used
-        in the loss expression to be optimized.
-
-  Raises:
-    ValueError: If `surrogate_type` is not `xent` or `hinge`.
-  """
-  return precision_at_recall_loss(labels=labels,
-                                  logits=logits,
-                                  target_recall=target_rate,
-                                  weights=weights,
-                                  dual_rate_factor=dual_rate_factor,
-                                  label_priors=label_priors,
-                                  surrogate_type=surrogate_type,
-                                  lambdas_initializer=lambdas_initializer,
-                                  reuse=reuse,
-                                  variables_collections=variables_collections,
-                                  trainable=trainable,
-                                  scope=scope)
-
-
-def true_positive_rate_at_false_positive_rate_loss(
-    labels,
-    logits,
-    target_rate,
-    weights=1.0,
-    dual_rate_factor=0.1,
-    label_priors=None,
-    surrogate_type='xent',
-    lambdas_initializer=tf.constant_initializer(1.0),
-    reuse=None,
-    variables_collections=None,
-    trainable=True,
-    scope=None):
-  """Computes true positive rate at false positive rate loss.
-
-  The loss is based on a surrogate of the form
-      wt * loss(+) + lambdas * (wt * loss(-) - r * (1 - pi))
-  where:
-  - loss(-) is the loss on the negative examples
-  - loss(+) is the loss on the positive examples
-  - wt is a scalar or tensor of per-example weights
-  - r is the target rate
-  - pi is the label_priors.
-
-  The per-example weights change not only the coefficients of individual
-  training examples, but how the examples are counted toward the constraint.
-  If `label_priors` is given, it MUST take `weights` into account. That is,
-      label_priors = P / (P + N)
-  where
-      P = sum_i (wt_i on positives)
-      N = sum_i (wt_i on negatives).
-
-  Args:
-    labels: A `Tensor` of shape [batch_size] or [batch_size, num_labels].
-    logits: A `Tensor` with the same shape as `labels`.
-    target_rate: The false positive rate at which to compute the loss. Can be a
-      floating point value between 0 and 1 for a single false positive rate, or
-      a `Tensor` of shape [num_labels] holding each label's false positive rate.
-    weights: Coefficients for the loss. Must be a scalar or `Tensor` of shape
-      [batch_size] or [batch_size, num_labels].
-    dual_rate_factor: A floating point value which controls the step size for
-      the Lagrange multipliers.
-    label_priors: None, or a floating point `Tensor` of shape [num_labels]
-      containing the prior probability of each label (i.e. the fraction of the
-      training data consisting of positive examples). If None, the label
-      priors are computed from `labels` with a moving average. See the notes
-      above regarding the interaction with `weights` and do not set this unless
-      you have a good reason to do so.
-    surrogate_type: Either 'xent' or 'hinge', specifying which upper bound
-      should be used for indicator functions. 'xent' will use the cross-entropy
-      loss surrogate, and 'hinge' will use the hinge loss.
-    lambdas_initializer: An initializer op for the Lagrange multipliers.
-    reuse: Whether or not the layer and its variables should be reused. To be
-      able to reuse the layer scope must be given.
-    variables_collections: Optional list of collections for the variables.
-    trainable: If `True` also add variables to the graph collection
-      `GraphKeys.TRAINABLE_VARIABLES` (see `tf.Variable`).
-    scope: Optional scope for `variable_scope`.
-
-  Returns:
-    loss: A `Tensor` of the same shape as `logits` with the component-wise
-      loss.
-    other_outputs: A dictionary of useful internal quantities for debugging. For
-      more details, see http://arxiv.org/pdf/1608.04802.pdf.
-      lambdas: A Tensor of shape [num_labels] consisting of the Lagrange
-        multipliers.
-      label_priors: A Tensor of shape [num_labels] consisting of the prior
-        probability of each label learned by the loss, if not provided.
-      true_positives_lower_bound: Lower bound on the number of true positives
-        given `labels` and `logits`. This is the same lower bound which is used
-        in the loss expression to be optimized.
-      false_positives_upper_bound: Upper bound on the number of false positives
-        given `labels` and `logits`. This is the same upper bound which is used
-        in the loss expression to be optimized.
-
-  Raises:
-    ValueError: If `surrogate_type` is not `xent` or `hinge`.
-  """
-  with tf.variable_scope(scope,
-                         'tpr_at_fpr',
-                         [labels, logits, label_priors],
-                         reuse=reuse):
-    labels, logits, weights, original_shape = _prepare_labels_logits_weights(
-        labels, logits, weights)
-    num_labels = util.get_num_labels(logits)
-
-    # Convert other inputs to tensors and standardize dtypes.
-    target_rate = util.convert_and_cast(
-        target_rate, 'target_rate', logits.dtype)
-    dual_rate_factor = util.convert_and_cast(
-        dual_rate_factor, 'dual_rate_factor', logits.dtype)
-
-    # Create lambdas.
-    lambdas, lambdas_variable = _create_dual_variable(
-        'lambdas',
-        shape=[num_labels],
-        dtype=logits.dtype,
-        initializer=lambdas_initializer,
-        collections=variables_collections,
-        trainable=trainable,
-        dual_rate_factor=dual_rate_factor)
-    # Maybe create label_priors.
-    label_priors = maybe_create_label_priors(
-        label_priors, labels, weights, variables_collections)
-
-    # Loss op and other outputs. The log(2.0) term corrects for
-    # logloss not being an upper bound on the indicator function.
-    weighted_loss = weights * util.weighted_surrogate_loss(
-        labels,
-        logits,
-        surrogate_type=surrogate_type,
-        positive_weights=1.0,
-        negative_weights=lambdas)
-    maybe_log2 = tf.log(2.0) if surrogate_type == 'xent' else 1.0
-    maybe_log2 = tf.cast(maybe_log2, logits.dtype.base_dtype)
-    lambda_term = lambdas * target_rate * (1.0 - label_priors) * maybe_log2
-    loss = tf.reshape(weighted_loss - lambda_term, original_shape)
-    other_outputs = {
-        'lambdas': lambdas_variable,
-        'label_priors': label_priors,
-        'true_positives_lower_bound': true_positives_lower_bound(
-            labels, logits, weights, surrogate_type),
-        'false_positives_upper_bound': false_positives_upper_bound(
-            labels, logits, weights, surrogate_type)}
-
-  return loss, other_outputs
-
-
-def _prepare_labels_logits_weights(labels, logits, weights):
-  """Validates labels, logits, and weights.
-
-  Converts inputs to tensors, checks shape compatibility, and casts dtype if
-  necessary.
-
-  Args:
-    labels: A `Tensor` of shape [batch_size] or [batch_size, num_labels].
-    logits: A `Tensor` with the same shape as `labels`.
-    weights: Either `None` or a `Tensor` with shape broadcastable to `logits`.
-
-  Returns:
-    labels: Same as `labels` arg after possible conversion to tensor, cast, and
-      reshape.
-    logits: Same as `logits` arg after possible conversion to tensor and
-      reshape.
-    weights: Same as `weights` arg after possible conversion, cast, and reshape.
-    original_shape: Shape of `labels` and `logits` before reshape.
-
-  Raises:
-    ValueError: If `labels` and `logits` do not have the same shape.
-  """
-  # Convert `labels` and `logits` to Tensors and standardize dtypes.
-  logits = tf.convert_to_tensor(logits, name='logits')
-  labels = util.convert_and_cast(labels, 'labels', logits.dtype.base_dtype)
-  weights = util.convert_and_cast(weights, 'weights', logits.dtype.base_dtype)
-
-  try:
-    labels.get_shape().merge_with(logits.get_shape())
-  except ValueError:
-    raise ValueError('logits and labels must have the same shape (%s vs %s)' %
-                     (logits.get_shape(), labels.get_shape()))
-
-  original_shape = labels.get_shape().as_list()
-  if labels.get_shape().ndims > 0:
-    original_shape[0] = -1
-  if labels.get_shape().ndims <= 1:
-    labels = tf.reshape(labels, [-1, 1])
-    logits = tf.reshape(logits, [-1, 1])
-
-  if weights.get_shape().ndims == 1:
-    # Weights has shape [batch_size]. Reshape to [batch_size, 1].
-    weights = tf.reshape(weights, [-1, 1])
-  if weights.get_shape().ndims == 0:
-    # Weights is a scalar. Change shape of weights to match logits.
-    weights *= tf.ones_like(logits)
-
-  return labels, logits, weights, original_shape
-
-
-def _range_to_anchors_and_delta(precision_range, num_anchors, dtype):
-  """Calculates anchor points from precision range.
-
-  Args:
-    precision_range: As required in precision_recall_auc_loss.
-    num_anchors: int, number of equally spaced anchor points.
-    dtype: Data type of returned tensors.
-
-  Returns:
-    precision_values: A `Tensor` of data type dtype with equally spaced values
-      in the interval precision_range.
-    delta: The spacing between the values in precision_values.
-
-  Raises:
-    ValueError: If precision_range is invalid.
-  """
-  # Validate precision_range.
-  if not 0 <= precision_range[0] <= precision_range[-1] <= 1:
-    raise ValueError('precision values must obey 0 <= %f <= %f <= 1' %
-                     (precision_range[0], precision_range[-1]))
-  if not 0 < len(precision_range) < 3:
-    raise ValueError('length of precision_range (%d) must be 1 or 2' %
-                     len(precision_range))
-
-  # Sets precision_values uniformly between min_precision and max_precision.
-  values = numpy.linspace(start=precision_range[0],
-                          stop=precision_range[1],
-                          num=num_anchors+2)[1:-1]
-  precision_values = util.convert_and_cast(
-      values, 'precision_values', dtype)
-  delta = util.convert_and_cast(
-      values[0] - precision_range[0], 'delta', dtype)
-  # Makes precision_values [1, 1, num_anchors].
-  precision_values = util.expand_outer(precision_values, 3)
-  return precision_values, delta
-
-
-def _create_dual_variable(name, shape, dtype, initializer, collections,
-                          trainable, dual_rate_factor):
-  """Creates a new dual variable.
-
-  Dual variables are required to be nonnegative. If trainable, their gradient
-  is reversed so that they are maximized (rather than minimized) by the
-  optimizer.
-
-  Args:
-    name: A string, the name for the new variable.
-    shape: Shape of the new variable.
-    dtype: Data type for the new variable.
-    initializer: Initializer for the new variable.
-    collections: List of graph collections keys. The new variable is added to
-      these collections. Defaults to `[GraphKeys.GLOBAL_VARIABLES]`.
-    trainable: If `True`, the default, also adds the variable to the graph
-      collection `GraphKeys.TRAINABLE_VARIABLES`. This collection is used as
-      the default list of variables to use by the `Optimizer` classes.
-    dual_rate_factor: A floating point value or `Tensor`. The learning rate for
-      the dual variable is scaled by this factor.
-
-  Returns:
-    dual_value: An op that computes the absolute value of the dual variable
-      and reverses its gradient.
-    dual_variable: The underlying variable itself.
-  """
-  # We disable partitioning while constructing dual variables because they will
-  # be updated with assign, which is not available for partitioned variables.
-  partitioner = tf.get_variable_scope().partitioner
-  try:
-    tf.get_variable_scope().set_partitioner(None)
-    dual_variable = tf.contrib.framework.model_variable(
-        name=name,
-        shape=shape,
-        dtype=dtype,
-        initializer=initializer,
-        collections=collections,
-        trainable=trainable)
-  finally:
-    tf.get_variable_scope().set_partitioner(partitioner)
-  # Using the absolute value enforces nonnegativity.
-  dual_value = tf.abs(dual_variable)
-
-  if trainable:
-    # To reverse the gradient on the dual variable, multiply the gradient by
-    # -dual_rate_factor
-    dual_value = (tf.stop_gradient((1.0 + dual_rate_factor) * dual_value)
-                  - dual_rate_factor * dual_value)
-  return dual_value, dual_variable
-
-
-def maybe_create_label_priors(label_priors,
-                              labels,
-                              weights,
-                              variables_collections):
-  """Creates moving average ops to track label priors, if necessary.
-
-  Args:
-    label_priors: As required in e.g. precision_recall_auc_loss.
-    labels: A `Tensor` of shape [batch_size] or [batch_size, num_labels].
-    weights: As required in e.g. precision_recall_auc_loss.
-    variables_collections: Optional list of collections for the variables, if
-      any must be created.
-
-  Returns:
-    label_priors: A Tensor of shape [num_labels] consisting of the
-      weighted label priors, after updating with moving average ops if created.
-  """
-  if label_priors is not None:
-    label_priors = util.convert_and_cast(
-        label_priors, name='label_priors', dtype=labels.dtype.base_dtype)
-    return tf.squeeze(label_priors)
-
-  label_priors = util.build_label_priors(
-      labels,
-      weights,
-      variables_collections=variables_collections)
-  return label_priors
-
-
-def true_positives_lower_bound(labels, logits, weights, surrogate_type):
-  """Calculate a lower bound on the number of true positives.
-
-  This lower bound on the number of true positives given `logits` and `labels`
-  is the same one used in the global objectives loss functions.
-
-  Args:
-    labels: A `Tensor` of shape [batch_size] or [batch_size, num_labels].
-    logits: A `Tensor` of shape [batch_size, num_labels] or
-      [batch_size, num_labels, num_anchors]. If the third dimension is present,
-      the lower bound is computed on each slice [:, :, k] independently.
-    weights: Per-example loss coefficients, with shape broadcast-compatible with
-        that of `labels`.
-    surrogate_type: Either 'xent' or 'hinge', specifying which upper bound
-      should be used for indicator functions.
-
-  Returns:
-    A `Tensor` of shape [num_labels] or [num_labels, num_anchors].
-  """
-  maybe_log2 = tf.log(2.0) if surrogate_type == 'xent' else 1.0
-  maybe_log2 = tf.cast(maybe_log2, logits.dtype.base_dtype)
-  if logits.get_shape().ndims == 3 and labels.get_shape().ndims < 3:
-    labels = tf.expand_dims(labels, 2)
-  loss_on_positives = util.weighted_surrogate_loss(
-      labels, logits, surrogate_type, negative_weights=0.0) / maybe_log2
-  return tf.reduce_sum(weights * (labels - loss_on_positives), 0)
-
-
-def false_positives_upper_bound(labels, logits, weights, surrogate_type):
-  """Calculate an upper bound on the number of false positives.
-
-  This upper bound on the number of false positives given `logits` and `labels`
-  is the same one used in the global objectives loss functions.
-
-  Args:
-    labels: A `Tensor` of shape [batch_size, num_labels]
-    logits: A `Tensor` of shape [batch_size, num_labels]  or
-      [batch_size, num_labels, num_anchors]. If the third dimension is present,
-      the lower bound is computed on each slice [:, :, k] independently.
-    weights: Per-example loss coefficients, with shape broadcast-compatible with
-        that of `labels`.
-    surrogate_type: Either 'xent' or 'hinge', specifying which upper bound
-      should be used for indicator functions.
-
-  Returns:
-    A `Tensor` of shape [num_labels] or [num_labels, num_anchors].
-  """
-  maybe_log2 = tf.log(2.0) if surrogate_type == 'xent' else 1.0
-  maybe_log2 = tf.cast(maybe_log2, logits.dtype.base_dtype)
-  loss_on_negatives = util.weighted_surrogate_loss(
-      labels, logits, surrogate_type, positive_weights=0.0) / maybe_log2
-  return tf.reduce_sum(weights *  loss_on_negatives, 0)
diff --git a/research/global_objectives/loss_layers_example.py b/research/global_objectives/loss_layers_example.py
deleted file mode 100644
index 2323cb0762e7f4eade8f283162be61cc45513d49..0000000000000000000000000000000000000000
--- a/research/global_objectives/loss_layers_example.py
+++ /dev/null
@@ -1,211 +0,0 @@
-# Copyright 2018 The TensorFlow Global Objectives Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Example for using global objectives.
-
-Illustrate, using synthetic data, how using the precision_at_recall loss
-significanly improves the performace of a linear classifier.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-import numpy as np
-from sklearn.metrics import precision_score
-import tensorflow as tf
-from global_objectives import loss_layers
-
-# When optimizing using global_objectives, if set to True then the saddle point
-# optimization steps are performed internally by the Tensorflow optimizer,
-# otherwise by dedicated saddle-point steps as part of the optimization loop.
-USE_GO_SADDLE_POINT_OPT = False
-
-TARGET_RECALL = 0.98
-TRAIN_ITERATIONS = 150
-LEARNING_RATE = 1.0
-GO_DUAL_RATE_FACTOR = 15.0
-NUM_CHECKPOINTS = 6
-
-EXPERIMENT_DATA_CONFIG = {
-    'positives_centers': [[0, 1.0], [1, -0.5]],
-    'negatives_centers': [[0, -0.5], [1, 1.0]],
-    'positives_variances': [0.15, 0.1],
-    'negatives_variances': [0.15, 0.1],
-    'positives_counts': [500, 50],
-    'negatives_counts': [3000, 100]
-}
-
-
-def create_training_and_eval_data_for_experiment(**data_config):
-  """Creates train and eval data sets.
-
-  Note: The synthesized binary-labeled data is a mixture of four Gaussians - two
-    positives and two negatives. The centers, variances, and sizes for each of
-    the two positives and negatives mixtures are passed in the respective keys
-    of data_config:
-
-  Args:
-      **data_config: Dictionary with Array entries as follows:
-        positives_centers - float [2,2] two centers of positives data sets.
-        negatives_centers - float [2,2] two centers of negatives data sets.
-        positives_variances - float [2] Variances for the positives sets.
-        negatives_variances - float [2] Variances for the negatives sets.
-        positives_counts - int [2] Counts for each of the two positives sets.
-        negatives_counts - int [2] Counts for each of the two negatives sets.
-
-  Returns:
-    A dictionary with two shuffled data sets created - one for training and one
-    for eval. The dictionary keys are 'train_data', 'train_labels', 'eval_data',
-    and 'eval_labels'. The data points are two-dimentional floats, and the
-    labels are in {0,1}.
-  """
-  def data_points(is_positives, index):
-    variance = data_config['positives_variances'
-                           if is_positives else 'negatives_variances'][index]
-    center = data_config['positives_centers'
-                         if is_positives else 'negatives_centers'][index]
-    count = data_config['positives_counts'
-                        if is_positives else 'negatives_counts'][index]
-    return variance*np.random.randn(count, 2) + np.array([center])
-
-  def create_data():
-    return np.concatenate([data_points(False, 0),
-                           data_points(True, 0),
-                           data_points(True, 1),
-                           data_points(False, 1)], axis=0)
-
-  def create_labels():
-    """Creates an array of 0.0 or 1.0 labels for the data_config batches."""
-    return np.array([0.0]*data_config['negatives_counts'][0] +
-                    [1.0]*data_config['positives_counts'][0] +
-                    [1.0]*data_config['positives_counts'][1] +
-                    [0.0]*data_config['negatives_counts'][1])
-
-  permutation = np.random.permutation(
-      sum(data_config['positives_counts'] + data_config['negatives_counts']))
-
-  train_data = create_data()[permutation, :]
-  eval_data = create_data()[permutation, :]
-  train_labels = create_labels()[permutation]
-  eval_labels = create_labels()[permutation]
-
-  return {
-      'train_data': train_data,
-      'train_labels': train_labels,
-      'eval_data': eval_data,
-      'eval_labels': eval_labels
-  }
-
-
-def train_model(data, use_global_objectives):
-  """Trains a linear model for maximal accuracy or precision at given recall."""
-
-  def precision_at_recall(scores, labels, target_recall):
-    """Computes precision - at target recall - over data."""
-    positive_scores = scores[labels == 1.0]
-    threshold = np.percentile(positive_scores, 100 - target_recall*100)
-    predicted = scores >= threshold
-    return precision_score(labels, predicted)
-
-  w = tf.Variable(tf.constant([-1.0, -1.0], shape=[2, 1]), trainable=True,
-                  name='weights', dtype=tf.float32)
-  b = tf.Variable(tf.zeros([1]), trainable=True, name='biases',
-                  dtype=tf.float32)
-
-  logits = tf.matmul(tf.cast(data['train_data'], tf.float32), w) + b
-
-  labels = tf.constant(
-      data['train_labels'],
-      shape=[len(data['train_labels']), 1],
-      dtype=tf.float32)
-
-  if use_global_objectives:
-    loss, other_outputs = loss_layers.precision_at_recall_loss(
-        labels, logits,
-        TARGET_RECALL,
-        dual_rate_factor=GO_DUAL_RATE_FACTOR)
-    loss = tf.reduce_mean(loss)
-  else:
-    loss = tf.reduce_mean(
-        tf.nn.sigmoid_cross_entropy_with_logits(labels=labels, logits=logits))
-
-  global_step = tf.Variable(0, trainable=False)
-
-  learning_rate = tf.train.polynomial_decay(
-      LEARNING_RATE,
-      global_step,
-      TRAIN_ITERATIONS, (LEARNING_RATE / TRAIN_ITERATIONS),
-      power=1.0,
-      cycle=False,
-      name='learning_rate')
-
-  optimizer = tf.train.GradientDescentOptimizer(learning_rate)
-
-  if (not use_global_objectives) or USE_GO_SADDLE_POINT_OPT:
-    training_op = optimizer.minimize(loss, global_step=global_step)
-  else:
-    lambdas = other_outputs['lambdas']
-    primal_update_op = optimizer.minimize(loss, var_list=[w, b])
-    dual_update_op = optimizer.minimize(
-        loss, global_step=global_step, var_list=[lambdas])
-
-  # Training loop:
-  with tf.Session() as sess:
-    checkpoint_step = TRAIN_ITERATIONS // NUM_CHECKPOINTS
-    sess.run(tf.global_variables_initializer())
-    step = sess.run(global_step)
-
-    while step <= TRAIN_ITERATIONS:
-      if (not use_global_objectives) or USE_GO_SADDLE_POINT_OPT:
-        _, step, loss_value, w_value, b_value = sess.run(
-            [training_op, global_step, loss, w, b])
-      else:
-        _, w_value, b_value = sess.run([primal_update_op, w, b])
-        _, loss_value, step = sess.run([dual_update_op, loss, global_step])
-
-      if use_global_objectives:
-        go_outputs = sess.run(other_outputs.values())
-
-      if step % checkpoint_step == 0:
-        precision = precision_at_recall(
-            np.dot(data['train_data'], w_value) + b_value,
-            data['train_labels'], TARGET_RECALL)
-
-        tf.logging.info('Loss = %f Precision = %f', loss_value, precision)
-        if use_global_objectives:
-          for i, output_name in enumerate(other_outputs.keys()):
-            tf.logging.info('\t%s = %f', output_name, go_outputs[i])
-
-    w_value, b_value = sess.run([w, b])
-    return precision_at_recall(np.dot(data['eval_data'], w_value) + b_value,
-                               data['eval_labels'],
-                               TARGET_RECALL)
-
-
-def main(unused_argv):
-  del unused_argv
-  experiment_data = create_training_and_eval_data_for_experiment(
-      **EXPERIMENT_DATA_CONFIG)
-  global_objectives_loss_precision = train_model(experiment_data, True)
-  tf.logging.info('global_objectives precision at requested recall is %f',
-                  global_objectives_loss_precision)
-  cross_entropy_loss_precision = train_model(experiment_data, False)
-  tf.logging.info('cross_entropy precision at requested recall is %f',
-                  cross_entropy_loss_precision)
-
-
-if __name__ == '__main__':
-  tf.logging.set_verbosity(tf.logging.INFO)
-  tf.app.run()
diff --git a/research/global_objectives/loss_layers_test.py b/research/global_objectives/loss_layers_test.py
deleted file mode 100644
index 3f91c80deec16a34f5271cdfadbd0d364c3a8cea..0000000000000000000000000000000000000000
--- a/research/global_objectives/loss_layers_test.py
+++ /dev/null
@@ -1,1379 +0,0 @@
-# Copyright 2018 The TensorFlow Global Objectives Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for global objectives loss layers."""
-
-# Dependency imports
-from absl.testing import parameterized
-import numpy
-import tensorflow as tf
-
-from global_objectives import loss_layers
-from global_objectives import util
-
-
-# TODO: Include weights in the lagrange multiplier update tests.
-class PrecisionRecallAUCLossTest(parameterized.TestCase, tf.test.TestCase):
-
-  @parameterized.named_parameters(
-      ('_xent', 'xent', 0.7),
-      ('_hinge', 'hinge', 0.7),
-      ('_hinge_2', 'hinge', 0.5)
-  )
-  def testSinglePointAUC(self, surrogate_type, target_precision):
-    # Tests a case with only one anchor point, where the loss should equal
-    # recall_at_precision_loss
-    batch_shape = [10, 2]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    labels = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-
-    auc_loss, _ = loss_layers.precision_recall_auc_loss(
-        labels,
-        logits,
-        precision_range=(target_precision - 0.01, target_precision  + 0.01),
-        num_anchors=1,
-        surrogate_type=surrogate_type)
-    point_loss, _ = loss_layers.recall_at_precision_loss(
-        labels, logits, target_precision=target_precision,
-        surrogate_type=surrogate_type)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(auc_loss.eval(), point_loss.eval())
-
-  def testThreePointAUC(self):
-    # Tests a case with three anchor points against a weighted sum of recall
-    # at precision losses.
-    batch_shape = [11, 3]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    labels = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-
-    # TODO: Place the hing/xent loss in a for loop.
-    auc_loss, _ = loss_layers.precision_recall_auc_loss(
-        labels, logits, num_anchors=1)
-    first_point_loss, _ = loss_layers.recall_at_precision_loss(
-        labels, logits, target_precision=0.25)
-    second_point_loss, _ = loss_layers.recall_at_precision_loss(
-        labels, logits, target_precision=0.5)
-    third_point_loss, _ = loss_layers.recall_at_precision_loss(
-        labels, logits, target_precision=0.75)
-    expected_loss = (first_point_loss + second_point_loss +
-                     third_point_loss) / 3
-
-    auc_loss_hinge, _ = loss_layers.precision_recall_auc_loss(
-        labels, logits, num_anchors=1, surrogate_type='hinge')
-    first_point_hinge, _ = loss_layers.recall_at_precision_loss(
-        labels, logits, target_precision=0.25, surrogate_type='hinge')
-    second_point_hinge, _ = loss_layers.recall_at_precision_loss(
-        labels, logits, target_precision=0.5, surrogate_type='hinge')
-    third_point_hinge, _ = loss_layers.recall_at_precision_loss(
-        labels, logits, target_precision=0.75, surrogate_type='hinge')
-    expected_hinge = (first_point_hinge + second_point_hinge +
-                      third_point_hinge) / 3
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(auc_loss.eval(), expected_loss.eval())
-      self.assertAllClose(auc_loss_hinge.eval(), expected_hinge.eval())
-
-  def testLagrangeMultiplierUpdateDirection(self):
-    for target_precision in [0.35, 0.65]:
-      precision_range = (target_precision - 0.01, target_precision + 0.01)
-
-      for surrogate_type in ['xent', 'hinge']:
-        kwargs = {'precision_range': precision_range,
-                  'num_anchors': 1,
-                  'surrogate_type': surrogate_type,
-                  'scope': 'pr-auc_{}_{}'.format(target_precision,
-                                                 surrogate_type)}
-        run_lagrange_multiplier_test(
-            global_objective=loss_layers.precision_recall_auc_loss,
-            objective_kwargs=kwargs,
-            data_builder=_multilabel_data,
-            test_object=self)
-        kwargs['scope'] = 'other-' + kwargs['scope']
-        run_lagrange_multiplier_test(
-            global_objective=loss_layers.precision_recall_auc_loss,
-            objective_kwargs=kwargs,
-            data_builder=_other_multilabel_data(surrogate_type),
-            test_object=self)
-
-
-class ROCAUCLossTest(parameterized.TestCase, tf.test.TestCase):
-
-  def testSimpleScores(self):
-    # Tests the loss on data with only one negative example with score zero.
-    # In this case, the loss should equal the surrogate loss on the scores with
-    # positive labels.
-    num_positives = 10
-    scores_positives = tf.constant(3.0 * numpy.random.randn(num_positives),
-                                   shape=[num_positives, 1])
-    labels = tf.constant([0.0] + [1.0] * num_positives,
-                         shape=[num_positives + 1, 1])
-    scores = tf.concat([[[0.0]], scores_positives], 0)
-
-    loss = tf.reduce_sum(
-        loss_layers.roc_auc_loss(labels, scores, surrogate_type='hinge')[0])
-    expected_loss = tf.reduce_sum(
-        tf.maximum(1.0 - scores_positives, 0)) / (num_positives + 1)
-    with self.test_session():
-      self.assertAllClose(expected_loss.eval(), loss.eval())
-
-  def testRandomROCLoss(self):
-    # Checks that random Bernoulli scores and labels has ~25% swaps.
-    shape = [1000, 30]
-    scores = tf.constant(
-        numpy.random.randint(0, 2, size=shape), shape=shape, dtype=tf.float32)
-    labels = tf.constant(
-        numpy.random.randint(0, 2, size=shape), shape=shape, dtype=tf.float32)
-    loss = tf.reduce_mean(loss_layers.roc_auc_loss(
-        labels, scores, surrogate_type='hinge')[0])
-    with self.test_session():
-      self.assertAllClose(0.25, loss.eval(), 1e-2)
-
-  @parameterized.named_parameters(
-      ('_zero_hinge', 'xent',
-       [0.0, 0.0, 0.0, 1.0, 1.0, 1.0],
-       [-5.0, -7.0, -9.0, 8.0, 10.0, 14.0],
-       0.0),
-      ('_zero_xent', 'hinge',
-       [0.0, 0.0, 0.0, 1.0, 1.0, 1.0],
-       [-0.2, 0, -0.1, 1.0, 1.1, 1.0],
-       0.0),
-      ('_xent', 'xent',
-       [0.0, 0.0, 0.0, 1.0, 1.0, 1.0],
-       [0.0, -17.0, -19.0, 1.0, 14.0, 14.0],
-       numpy.log(1.0 + numpy.exp(-1.0)) / 6),
-      ('_hinge', 'hinge',
-       [0.0, 0.0, 0.0, 1.0, 1.0, 1.0],
-       [-0.2, -0.05, 0.0, 0.95, 0.8, 1.0],
-       0.4 / 6)
-  )
-  def testManualROCLoss(self, surrogate_type, labels, logits, expected_value):
-    labels = tf.constant(labels)
-    logits = tf.constant(logits)
-    loss, _ = loss_layers.roc_auc_loss(
-        labels=labels, logits=logits, surrogate_type=surrogate_type)
-
-    with self.test_session():
-      self.assertAllClose(expected_value, tf.reduce_sum(loss).eval())
-
-  def testMultiLabelROCLoss(self):
-    # Tests the loss on multi-label data against manually computed loss.
-    targets = numpy.array([[0.0, 0.0, 1.0, 1.0], [0.0, 0.0, 1.0, 1.0]])
-    scores = numpy.array([[0.1, 1.0, 1.1, 1.0], [1.0, 0.0, 1.3, 1.1]])
-    class_1_auc = tf.reduce_sum(
-        loss_layers.roc_auc_loss(targets[0], scores[0])[0])
-    class_2_auc = tf.reduce_sum(
-        loss_layers.roc_auc_loss(targets[1], scores[1])[0])
-    total_auc = tf.reduce_sum(loss_layers.roc_auc_loss(
-        targets.transpose(), scores.transpose())[0])
-
-    with self.test_session():
-      self.assertAllClose(total_auc.eval(),
-                          class_1_auc.eval() + class_2_auc.eval())
-
-  def testWeights(self):
-    # Test the loss with per-example weights.
-    # The logits_negatives below are repeated, so that setting half their
-    # weights to 2 and the other half to 0 should leave the loss unchanged.
-    logits_positives = tf.constant([2.54321, -0.26, 3.334334], shape=[3, 1])
-    logits_negatives = tf.constant([-0.6, 1, -1.3, -1.3, -0.6, 1], shape=[6, 1])
-    logits = tf.concat([logits_positives, logits_negatives], 0)
-    targets = tf.constant([1, 1, 1, 0, 0, 0, 0, 0, 0],
-                          shape=[9, 1], dtype=tf.float32)
-    weights = tf.constant([1, 1, 1, 0, 0, 0, 2, 2, 2],
-                          shape=[9, 1], dtype=tf.float32)
-
-    loss = tf.reduce_sum(loss_layers.roc_auc_loss(targets, logits)[0])
-    weighted_loss = tf.reduce_sum(
-        loss_layers.roc_auc_loss(targets, logits, weights)[0])
-
-    with self.test_session():
-      self.assertAllClose(loss.eval(), weighted_loss.eval())
-
-
-class RecallAtPrecisionTest(tf.test.TestCase):
-
-  def testEqualWeightLoss(self):
-    # Tests a special case where the loss should equal cross entropy loss.
-    target_precision = 1.0
-    num_labels = 5
-    batch_shape = [20, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.7)))
-    label_priors = tf.constant(0.34, shape=[num_labels])
-
-    loss, _ = loss_layers.recall_at_precision_loss(
-        targets, logits, target_precision, label_priors=label_priors)
-    expected_loss = (
-        tf.contrib.nn.deprecated_flipped_sigmoid_cross_entropy_with_logits(
-            logits, targets))
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      loss_val, expected_val = session.run([loss, expected_loss])
-      self.assertAllClose(loss_val, expected_val)
-
-  def testEqualWeightLossWithMultiplePrecisions(self):
-    """Tests a case where the loss equals xent loss with multiple precisions."""
-    target_precision = [1.0, 1.0]
-    num_labels = 2
-    batch_size = 20
-    target_shape = [batch_size, num_labels]
-    logits = tf.Variable(tf.random_normal(target_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(target_shape), 0.7)))
-    label_priors = tf.constant([0.34], shape=[num_labels])
-
-    loss, _ = loss_layers.recall_at_precision_loss(
-        targets,
-        logits,
-        target_precision,
-        label_priors=label_priors,
-        surrogate_type='xent',
-    )
-
-    expected_loss = (
-        tf.contrib.nn.deprecated_flipped_sigmoid_cross_entropy_with_logits(
-            logits, targets))
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      loss_val, expected_val = session.run([loss, expected_loss])
-      self.assertAllClose(loss_val, expected_val)
-
-  def testPositivesOnlyLoss(self):
-    # Tests a special case where the loss should equal cross entropy loss
-    # on the negatives only.
-    target_precision = 1.0
-    num_labels = 3
-    batch_shape = [30, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-    label_priors = tf.constant(0.45, shape=[num_labels])
-
-    loss, _ = loss_layers.recall_at_precision_loss(
-        targets, logits, target_precision, label_priors=label_priors,
-        lambdas_initializer=tf.zeros_initializer())
-    expected_loss = util.weighted_sigmoid_cross_entropy_with_logits(
-        targets,
-        logits,
-        positive_weights=1.0,
-        negative_weights=0.0)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      loss_val, expected_val = session.run([loss, expected_loss])
-      self.assertAllClose(loss_val, expected_val)
-
-  def testEquivalenceBetweenSingleAndMultiplePrecisions(self):
-    """Checks recall at precision with different precision values.
-
-    Runs recall at precision with multiple precision values, and runs each label
-    seperately with its own precision value as a scalar. Validates that the
-    returned loss values are the same.
-    """
-    target_precision = [0.2, 0.9, 0.4]
-    num_labels = 3
-    batch_shape = [30, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-    label_priors = tf.constant([0.45, 0.8, 0.3], shape=[num_labels])
-
-    multi_label_loss, _ = loss_layers.recall_at_precision_loss(
-        targets, logits, target_precision, label_priors=label_priors,
-    )
-
-    single_label_losses = [
-        loss_layers.recall_at_precision_loss(
-            tf.expand_dims(targets[:, i], -1),
-            tf.expand_dims(logits[:, i], -1),
-            target_precision[i],
-            label_priors=label_priors[i])[0]
-        for i in range(num_labels)
-    ]
-
-    single_label_losses = tf.concat(single_label_losses, 1)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      multi_label_loss_val, single_label_loss_val = session.run(
-          [multi_label_loss, single_label_losses])
-      self.assertAllClose(multi_label_loss_val, single_label_loss_val)
-
-  def testEquivalenceBetweenSingleAndEqualMultiplePrecisions(self):
-    """Compares single and multiple target precisions with the same value.
-
-    Checks that using a single target precision and multiple target precisions
-    with the same value would result in the same loss value.
-    """
-    num_labels = 2
-    target_shape = [20, num_labels]
-    logits = tf.Variable(tf.random_normal(target_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(target_shape), 0.7)))
-    label_priors = tf.constant([0.34], shape=[num_labels])
-
-    multi_precision_loss, _ = loss_layers.recall_at_precision_loss(
-        targets,
-        logits,
-        [0.75, 0.75],
-        label_priors=label_priors,
-        surrogate_type='xent',
-    )
-
-    single_precision_loss, _ = loss_layers.recall_at_precision_loss(
-        targets,
-        logits,
-        0.75,
-        label_priors=label_priors,
-        surrogate_type='xent',
-    )
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      multi_precision_loss_val, single_precision_loss_val = session.run(
-          [multi_precision_loss, single_precision_loss])
-      self.assertAllClose(multi_precision_loss_val, single_precision_loss_val)
-
-  def testLagrangeMultiplierUpdateDirection(self):
-    for target_precision in [0.35, 0.65]:
-      for surrogate_type in ['xent', 'hinge']:
-        kwargs = {'target_precision': target_precision,
-                  'surrogate_type': surrogate_type,
-                  'scope': 'r-at-p_{}_{}'.format(target_precision,
-                                                 surrogate_type)}
-        run_lagrange_multiplier_test(
-            global_objective=loss_layers.recall_at_precision_loss,
-            objective_kwargs=kwargs,
-            data_builder=_multilabel_data,
-            test_object=self)
-        kwargs['scope'] = 'other-' + kwargs['scope']
-        run_lagrange_multiplier_test(
-            global_objective=loss_layers.recall_at_precision_loss,
-            objective_kwargs=kwargs,
-            data_builder=_other_multilabel_data(surrogate_type),
-            test_object=self)
-
-  def testLagrangeMultiplierUpdateDirectionWithMultiplePrecisions(self):
-    """Runs Lagrange multiplier test with multiple precision values."""
-    target_precision = [0.65, 0.35]
-
-    for surrogate_type in ['xent', 'hinge']:
-      scope_str = 'r-at-p_{}_{}'.format(
-          '_'.join([str(precision) for precision in target_precision]),
-          surrogate_type)
-      kwargs = {
-          'target_precision': target_precision,
-          'surrogate_type': surrogate_type,
-          'scope': scope_str,
-      }
-      run_lagrange_multiplier_test(
-          global_objective=loss_layers.recall_at_precision_loss,
-          objective_kwargs=kwargs,
-          data_builder=_multilabel_data,
-          test_object=self)
-      kwargs['scope'] = 'other-' + kwargs['scope']
-      run_lagrange_multiplier_test(
-          global_objective=loss_layers.recall_at_precision_loss,
-          objective_kwargs=kwargs,
-          data_builder=_other_multilabel_data(surrogate_type),
-          test_object=self)
-
-
-class PrecisionAtRecallTest(tf.test.TestCase):
-
-  def testCrossEntropyEquivalence(self):
-    # Checks a special case where the loss should equal cross-entropy loss.
-    target_recall = 1.0
-    num_labels = 3
-    batch_shape = [10, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-
-    loss, _ = loss_layers.precision_at_recall_loss(
-        targets, logits, target_recall,
-        lambdas_initializer=tf.constant_initializer(1.0))
-    expected_loss = util.weighted_sigmoid_cross_entropy_with_logits(
-        targets, logits)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(loss.eval(), expected_loss.eval())
-
-  def testNegativesOnlyLoss(self):
-    # Checks a special case where the loss should equal the loss on
-    # the negative examples only.
-    target_recall = 0.61828
-    num_labels = 4
-    batch_shape = [8, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.6)))
-
-    loss, _ = loss_layers.precision_at_recall_loss(
-        targets,
-        logits,
-        target_recall,
-        surrogate_type='hinge',
-        lambdas_initializer=tf.constant_initializer(0.0),
-        scope='negatives_only_test')
-    expected_loss = util.weighted_hinge_loss(
-        targets, logits, positive_weights=0.0, negative_weights=1.0)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(expected_loss.eval(), loss.eval())
-
-  def testLagrangeMultiplierUpdateDirection(self):
-    for target_recall in [0.34, 0.66]:
-      for surrogate_type in ['xent', 'hinge']:
-        kwargs = {'target_recall': target_recall,
-                  'dual_rate_factor': 1.0,
-                  'surrogate_type': surrogate_type,
-                  'scope': 'p-at-r_{}_{}'.format(target_recall, surrogate_type)}
-
-        run_lagrange_multiplier_test(
-            global_objective=loss_layers.precision_at_recall_loss,
-            objective_kwargs=kwargs,
-            data_builder=_multilabel_data,
-            test_object=self)
-        kwargs['scope'] = 'other-' + kwargs['scope']
-        run_lagrange_multiplier_test(
-            global_objective=loss_layers.precision_at_recall_loss,
-            objective_kwargs=kwargs,
-            data_builder=_other_multilabel_data(surrogate_type),
-            test_object=self)
-
-  def testCrossEntropyEquivalenceWithMultipleRecalls(self):
-    """Checks a case where the loss equals xent loss with multiple recalls."""
-    num_labels = 3
-    target_recall = [1.0] * num_labels
-    batch_shape = [10, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-
-    loss, _ = loss_layers.precision_at_recall_loss(
-        targets, logits, target_recall,
-        lambdas_initializer=tf.constant_initializer(1.0))
-    expected_loss = util.weighted_sigmoid_cross_entropy_with_logits(
-        targets, logits)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(loss.eval(), expected_loss.eval())
-
-  def testNegativesOnlyLossWithMultipleRecalls(self):
-    """Tests a case where the loss equals the loss on the negative examples.
-
-    Checks this special case using multiple target recall values.
-    """
-    num_labels = 4
-    target_recall = [0.61828] * num_labels
-    batch_shape = [8, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.6)))
-
-    loss, _ = loss_layers.precision_at_recall_loss(
-        targets,
-        logits,
-        target_recall,
-        surrogate_type='hinge',
-        lambdas_initializer=tf.constant_initializer(0.0),
-        scope='negatives_only_test')
-    expected_loss = util.weighted_hinge_loss(
-        targets, logits, positive_weights=0.0, negative_weights=1.0)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(expected_loss.eval(), loss.eval())
-
-  def testLagrangeMultiplierUpdateDirectionWithMultipleRecalls(self):
-    """Runs Lagrange multiplier test with multiple recall values."""
-    target_recall = [0.34, 0.66]
-    for surrogate_type in ['xent', 'hinge']:
-      scope_str = 'p-at-r_{}_{}'.format(
-          '_'.join([str(recall) for recall in target_recall]),
-          surrogate_type)
-      kwargs = {'target_recall': target_recall,
-                'dual_rate_factor': 1.0,
-                'surrogate_type': surrogate_type,
-                'scope': scope_str}
-
-      run_lagrange_multiplier_test(
-          global_objective=loss_layers.precision_at_recall_loss,
-          objective_kwargs=kwargs,
-          data_builder=_multilabel_data,
-          test_object=self)
-      kwargs['scope'] = 'other-' + kwargs['scope']
-      run_lagrange_multiplier_test(
-          global_objective=loss_layers.precision_at_recall_loss,
-          objective_kwargs=kwargs,
-          data_builder=_other_multilabel_data(surrogate_type),
-          test_object=self)
-
-  def testEquivalenceBetweenSingleAndMultipleRecalls(self):
-    """Checks precision at recall with multiple different recall values.
-
-    Runs precision at recall with multiple recall values, and runs each label
-    seperately with its own recall value as a scalar. Validates that the
-    returned loss values are the same.
-    """
-    target_precision = [0.7, 0.9, 0.4]
-    num_labels = 3
-    batch_shape = [30, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-    label_priors = tf.constant(0.45, shape=[num_labels])
-
-    multi_label_loss, _ = loss_layers.precision_at_recall_loss(
-        targets, logits, target_precision, label_priors=label_priors
-    )
-
-    single_label_losses = [
-        loss_layers.precision_at_recall_loss(
-            tf.expand_dims(targets[:, i], -1),
-            tf.expand_dims(logits[:, i], -1),
-            target_precision[i],
-            label_priors=label_priors[i])[0]
-        for i in range(num_labels)
-    ]
-
-    single_label_losses = tf.concat(single_label_losses, 1)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      multi_label_loss_val, single_label_loss_val = session.run(
-          [multi_label_loss, single_label_losses])
-      self.assertAllClose(multi_label_loss_val, single_label_loss_val)
-
-  def testEquivalenceBetweenSingleAndEqualMultipleRecalls(self):
-    """Compares single and multiple target recalls of the same value.
-
-    Checks that using a single target recall and multiple recalls with the
-    same value would result in the same loss value.
-    """
-    num_labels = 2
-    target_shape = [20, num_labels]
-    logits = tf.Variable(tf.random_normal(target_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(target_shape), 0.7)))
-    label_priors = tf.constant([0.34], shape=[num_labels])
-
-    multi_precision_loss, _ = loss_layers.precision_at_recall_loss(
-        targets,
-        logits,
-        [0.75, 0.75],
-        label_priors=label_priors,
-        surrogate_type='xent',
-    )
-
-    single_precision_loss, _ = loss_layers.precision_at_recall_loss(
-        targets,
-        logits,
-        0.75,
-        label_priors=label_priors,
-        surrogate_type='xent',
-    )
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      multi_precision_loss_val, single_precision_loss_val = session.run(
-          [multi_precision_loss, single_precision_loss])
-      self.assertAllClose(multi_precision_loss_val, single_precision_loss_val)
-
-
-class FalsePositiveRateAtTruePositiveRateTest(tf.test.TestCase):
-
-  def testNegativesOnlyLoss(self):
-    # Checks a special case where the loss returned should be the loss on the
-    # negative examples.
-    target_recall = 0.6
-    num_labels = 3
-    batch_shape = [3, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-    label_priors = tf.constant(numpy.random.uniform(size=[num_labels]),
-                               dtype=tf.float32)
-
-    xent_loss, _ = loss_layers.false_positive_rate_at_true_positive_rate_loss(
-        targets, logits, target_recall, label_priors=label_priors,
-        lambdas_initializer=tf.constant_initializer(0.0))
-    xent_expected = util.weighted_sigmoid_cross_entropy_with_logits(
-        targets,
-        logits,
-        positive_weights=0.0,
-        negative_weights=1.0)
-    hinge_loss, _ = loss_layers.false_positive_rate_at_true_positive_rate_loss(
-        targets, logits, target_recall, label_priors=label_priors,
-        lambdas_initializer=tf.constant_initializer(0.0),
-        surrogate_type='hinge')
-    hinge_expected = util.weighted_hinge_loss(
-        targets,
-        logits,
-        positive_weights=0.0,
-        negative_weights=1.0)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      xent_val, xent_expected = session.run([xent_loss, xent_expected])
-      self.assertAllClose(xent_val, xent_expected)
-      hinge_val, hinge_expected = session.run([hinge_loss, hinge_expected])
-      self.assertAllClose(hinge_val, hinge_expected)
-
-  def testPositivesOnlyLoss(self):
-    # Checks a special case where the loss returned should be the loss on the
-    # positive examples only.
-    target_recall = 1.0
-    num_labels = 5
-    batch_shape = [5, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.ones_like(logits)
-    label_priors = tf.constant(numpy.random.uniform(size=[num_labels]),
-                               dtype=tf.float32)
-
-    loss, _ = loss_layers.false_positive_rate_at_true_positive_rate_loss(
-        targets, logits, target_recall, label_priors=label_priors)
-    expected_loss = tf.nn.sigmoid_cross_entropy_with_logits(
-        labels=targets, logits=logits)
-    hinge_loss, _ = loss_layers.false_positive_rate_at_true_positive_rate_loss(
-        targets, logits, target_recall, label_priors=label_priors,
-        surrogate_type='hinge')
-    expected_hinge = util.weighted_hinge_loss(
-        targets, logits)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(loss.eval(), expected_loss.eval())
-      self.assertAllClose(hinge_loss.eval(), expected_hinge.eval())
-
-  def testEqualWeightLoss(self):
-    # Checks a special case where the loss returned should be proportional to
-    # the ordinary loss.
-    target_recall = 1.0
-    num_labels = 4
-    batch_shape = [40, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.6)))
-    label_priors = tf.constant(0.5, shape=[num_labels])
-
-    loss, _ = loss_layers.false_positive_rate_at_true_positive_rate_loss(
-        targets, logits, target_recall, label_priors=label_priors)
-    expected_loss = tf.nn.sigmoid_cross_entropy_with_logits(
-        labels=targets, logits=logits)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(loss.eval(), expected_loss.eval())
-
-  def testLagrangeMultiplierUpdateDirection(self):
-    for target_rate in [0.35, 0.65]:
-      for surrogate_type in ['xent', 'hinge']:
-        kwargs = {'target_rate': target_rate,
-                  'surrogate_type': surrogate_type,
-                  'scope': 'fpr-at-tpr_{}_{}'.format(target_rate,
-                                                     surrogate_type)}
-        # True positive rate is a synonym for recall, so we use the
-        # recall constraint data.
-        run_lagrange_multiplier_test(
-            global_objective=(
-                loss_layers.false_positive_rate_at_true_positive_rate_loss),
-            objective_kwargs=kwargs,
-            data_builder=_multilabel_data,
-            test_object=self)
-        kwargs['scope'] = 'other-' + kwargs['scope']
-        run_lagrange_multiplier_test(
-            global_objective=(
-                loss_layers.false_positive_rate_at_true_positive_rate_loss),
-            objective_kwargs=kwargs,
-            data_builder=_other_multilabel_data(surrogate_type),
-            test_object=self)
-
-  def testLagrangeMultiplierUpdateDirectionWithMultipleRates(self):
-    """Runs Lagrange multiplier test with multiple target rates."""
-    target_rate = [0.35, 0.65]
-    for surrogate_type in ['xent', 'hinge']:
-      kwargs = {'target_rate': target_rate,
-                'surrogate_type': surrogate_type,
-                'scope': 'fpr-at-tpr_{}_{}'.format(
-                    '_'.join([str(target) for target in target_rate]),
-                    surrogate_type)}
-      # True positive rate is a synonym for recall, so we use the
-      # recall constraint data.
-      run_lagrange_multiplier_test(
-          global_objective=(
-              loss_layers.false_positive_rate_at_true_positive_rate_loss),
-          objective_kwargs=kwargs,
-          data_builder=_multilabel_data,
-          test_object=self)
-      kwargs['scope'] = 'other-' + kwargs['scope']
-      run_lagrange_multiplier_test(
-          global_objective=(
-              loss_layers.false_positive_rate_at_true_positive_rate_loss),
-          objective_kwargs=kwargs,
-          data_builder=_other_multilabel_data(surrogate_type),
-          test_object=self)
-
-  def testEquivalenceBetweenSingleAndEqualMultipleRates(self):
-    """Compares single and multiple target rates of the same value.
-
-    Checks that using a single target rate and multiple rates with the
-    same value would result in the same loss value.
-    """
-    num_labels = 2
-    target_shape = [20, num_labels]
-    logits = tf.Variable(tf.random_normal(target_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(target_shape), 0.7)))
-    label_priors = tf.constant([0.34], shape=[num_labels])
-
-    multi_label_loss, _ = (
-        loss_layers.false_positive_rate_at_true_positive_rate_loss(
-            targets, logits, [0.75, 0.75], label_priors=label_priors))
-
-    single_label_loss, _ = (
-        loss_layers.false_positive_rate_at_true_positive_rate_loss(
-            targets, logits, 0.75, label_priors=label_priors))
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      multi_label_loss_val, single_label_loss_val = session.run(
-          [multi_label_loss, single_label_loss])
-      self.assertAllClose(multi_label_loss_val, single_label_loss_val)
-
-  def testEquivalenceBetweenSingleAndMultipleRates(self):
-    """Compares single and multiple target rates of different values.
-
-    Runs false_positive_rate_at_true_positive_rate_loss with multiple target
-    rates, and runs each label seperately with its own target rate as a
-    scalar. Validates that the returned loss values are the same.
-    """
-    target_precision = [0.7, 0.9, 0.4]
-    num_labels = 3
-    batch_shape = [30, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-    label_priors = tf.constant(0.45, shape=[num_labels])
-
-    multi_label_loss, _ = (
-        loss_layers.false_positive_rate_at_true_positive_rate_loss(
-            targets, logits, target_precision, label_priors=label_priors))
-
-    single_label_losses = [
-        loss_layers.false_positive_rate_at_true_positive_rate_loss(
-            tf.expand_dims(targets[:, i], -1),
-            tf.expand_dims(logits[:, i], -1),
-            target_precision[i],
-            label_priors=label_priors[i])[0]
-        for i in range(num_labels)
-    ]
-
-    single_label_losses = tf.concat(single_label_losses, 1)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      multi_label_loss_val, single_label_loss_val = session.run(
-          [multi_label_loss, single_label_losses])
-      self.assertAllClose(multi_label_loss_val, single_label_loss_val)
-
-
-class TruePositiveRateAtFalsePositiveRateTest(tf.test.TestCase):
-
-  def testPositivesOnlyLoss(self):
-    # A special case where the loss should equal the loss on the positive
-    # examples.
-    target_rate = numpy.random.uniform()
-    num_labels = 3
-    batch_shape = [20, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.6)))
-    label_priors = tf.constant(numpy.random.uniform(size=[num_labels]),
-                               dtype=tf.float32)
-
-    xent_loss, _ = loss_layers.true_positive_rate_at_false_positive_rate_loss(
-        targets, logits, target_rate, label_priors=label_priors,
-        lambdas_initializer=tf.constant_initializer(0.0))
-    xent_expected = util.weighted_sigmoid_cross_entropy_with_logits(
-        targets,
-        logits,
-        positive_weights=1.0,
-        negative_weights=0.0)
-    hinge_loss, _ = loss_layers.true_positive_rate_at_false_positive_rate_loss(
-        targets, logits, target_rate, label_priors=label_priors,
-        lambdas_initializer=tf.constant_initializer(0.0),
-        surrogate_type='hinge')
-    hinge_expected = util.weighted_hinge_loss(
-        targets,
-        logits,
-        positive_weights=1.0,
-        negative_weights=0.0)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(xent_expected.eval(), xent_loss.eval())
-      self.assertAllClose(hinge_expected.eval(), hinge_loss.eval())
-
-  def testNegativesOnlyLoss(self):
-    # A special case where the loss should equal the loss on the negative
-    # examples, minus target_rate * (1 - label_priors) * maybe_log2.
-    target_rate = numpy.random.uniform()
-    num_labels = 3
-    batch_shape = [25, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.zeros_like(logits)
-    label_priors = tf.constant(numpy.random.uniform(size=[num_labels]),
-                               dtype=tf.float32)
-
-    xent_loss, _ = loss_layers.true_positive_rate_at_false_positive_rate_loss(
-        targets, logits, target_rate, label_priors=label_priors)
-    xent_expected = tf.subtract(
-        util.weighted_sigmoid_cross_entropy_with_logits(targets,
-                                                        logits,
-                                                        positive_weights=0.0,
-                                                        negative_weights=1.0),
-        target_rate * (1.0 - label_priors) * numpy.log(2))
-    hinge_loss, _ = loss_layers.true_positive_rate_at_false_positive_rate_loss(
-        targets, logits, target_rate, label_priors=label_priors,
-        surrogate_type='hinge')
-    hinge_expected = util.weighted_hinge_loss(
-        targets, logits) - target_rate * (1.0 - label_priors)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(xent_expected.eval(), xent_loss.eval())
-      self.assertAllClose(hinge_expected.eval(), hinge_loss.eval())
-
-  def testLagrangeMultiplierUpdateDirection(self):
-    for target_rate in [0.35, 0.65]:
-      for surrogate_type in ['xent', 'hinge']:
-        kwargs = {'target_rate': target_rate,
-                  'surrogate_type': surrogate_type,
-                  'scope': 'tpr-at-fpr_{}_{}'.format(target_rate,
-                                                     surrogate_type)}
-        run_lagrange_multiplier_test(
-            global_objective=(
-                loss_layers.true_positive_rate_at_false_positive_rate_loss),
-            objective_kwargs=kwargs,
-            data_builder=_multilabel_data,
-            test_object=self)
-        kwargs['scope'] = 'other-' + kwargs['scope']
-        run_lagrange_multiplier_test(
-            global_objective=(
-                loss_layers.true_positive_rate_at_false_positive_rate_loss),
-            objective_kwargs=kwargs,
-            data_builder=_other_multilabel_data(surrogate_type),
-            test_object=self)
-
-  def testLagrangeMultiplierUpdateDirectionWithMultipleRates(self):
-    """Runs Lagrange multiplier test with multiple target rates."""
-    target_rate = [0.35, 0.65]
-    for surrogate_type in ['xent', 'hinge']:
-      kwargs = {'target_rate': target_rate,
-                'surrogate_type': surrogate_type,
-                'scope': 'tpr-at-fpr_{}_{}'.format(
-                    '_'.join([str(target) for target in target_rate]),
-                    surrogate_type)}
-      run_lagrange_multiplier_test(
-          global_objective=(
-              loss_layers.true_positive_rate_at_false_positive_rate_loss),
-          objective_kwargs=kwargs,
-          data_builder=_multilabel_data,
-          test_object=self)
-      kwargs['scope'] = 'other-' + kwargs['scope']
-      run_lagrange_multiplier_test(
-          global_objective=(
-              loss_layers.true_positive_rate_at_false_positive_rate_loss),
-          objective_kwargs=kwargs,
-          data_builder=_other_multilabel_data(surrogate_type),
-          test_object=self)
-
-  def testEquivalenceBetweenSingleAndEqualMultipleRates(self):
-    """Compares single and multiple target rates of the same value.
-
-    Checks that using a single target rate and multiple rates with the
-    same value would result in the same loss value.
-    """
-    num_labels = 2
-    target_shape = [20, num_labels]
-    logits = tf.Variable(tf.random_normal(target_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(target_shape), 0.7)))
-    label_priors = tf.constant([0.34], shape=[num_labels])
-
-    multi_label_loss, _ = (
-        loss_layers.true_positive_rate_at_false_positive_rate_loss(
-            targets, logits, [0.75, 0.75], label_priors=label_priors))
-
-    single_label_loss, _ = (
-        loss_layers.true_positive_rate_at_false_positive_rate_loss(
-            targets, logits, 0.75, label_priors=label_priors))
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      multi_label_loss_val, single_label_loss_val = session.run(
-          [multi_label_loss, single_label_loss])
-      self.assertAllClose(multi_label_loss_val, single_label_loss_val)
-
-  def testEquivalenceBetweenSingleAndMultipleRates(self):
-    """Compares single and multiple target rates of different values.
-
-    Runs true_positive_rate_at_false_positive_rate_loss with multiple target
-    rates, and runs each label seperately with its own target rate as a
-    scalar. Validates that the returned loss values are the same.
-    """
-    target_precision = [0.7, 0.9, 0.4]
-    num_labels = 3
-    batch_shape = [30, num_labels]
-    logits = tf.Variable(tf.random_normal(batch_shape))
-    targets = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-    label_priors = tf.constant(0.45, shape=[num_labels])
-
-    multi_label_loss, _ = (
-        loss_layers.true_positive_rate_at_false_positive_rate_loss(
-            targets, logits, target_precision, label_priors=label_priors))
-
-    single_label_losses = [
-        loss_layers.true_positive_rate_at_false_positive_rate_loss(
-            tf.expand_dims(targets[:, i], -1),
-            tf.expand_dims(logits[:, i], -1),
-            target_precision[i],
-            label_priors=label_priors[i])[0]
-        for i in range(num_labels)
-    ]
-
-    single_label_losses = tf.concat(single_label_losses, 1)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      multi_label_loss_val, single_label_loss_val = session.run(
-          [multi_label_loss, single_label_losses])
-      self.assertAllClose(multi_label_loss_val, single_label_loss_val)
-
-
-class UtilityFunctionsTest(tf.test.TestCase):
-
-  def testTrainableDualVariable(self):
-    # Confirm correct behavior of a trainable dual variable.
-    x = tf.get_variable('primal', dtype=tf.float32, initializer=2.0)
-    y_value, y = loss_layers._create_dual_variable(
-        'dual', shape=None, dtype=tf.float32, initializer=1.0, collections=None,
-        trainable=True, dual_rate_factor=0.3)
-    optimizer = tf.train.GradientDescentOptimizer(learning_rate=1.0)
-    update = optimizer.minimize(0.5 * tf.square(x - y_value))
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      update.run()
-      self.assertAllClose(0.7, y.eval())
-
-  def testUntrainableDualVariable(self):
-    # Confirm correct behavior of dual variable which is not trainable.
-    x = tf.get_variable('primal', dtype=tf.float32, initializer=-2.0)
-    y_value, y = loss_layers._create_dual_variable(
-        'dual', shape=None, dtype=tf.float32, initializer=1.0, collections=None,
-        trainable=False, dual_rate_factor=0.8)
-    optimizer = tf.train.GradientDescentOptimizer(learning_rate=1.0)
-    update = optimizer.minimize(tf.square(x) * y_value + tf.exp(y_value))
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      update.run()
-      self.assertAllClose(1.0, y.eval())
-
-
-class BoundTest(parameterized.TestCase, tf.test.TestCase):
-
-  @parameterized.named_parameters(
-      ('_xent', 'xent', 1.0, [2.0, 1.0]),
-      ('_xent_weighted', 'xent',
-       numpy.array([0, 2, 0.5, 1, 2, 3]).reshape(6, 1), [2.5, 0]),
-      ('_hinge', 'hinge', 1.0, [2.0, 1.0]),
-      ('_hinge_weighted', 'hinge',
-       numpy.array([1.0, 2, 3, 4, 5, 6]).reshape(6, 1), [5.0, 1]))
-  def testLowerBoundMultilabel(self, surrogate_type, weights, expected):
-    labels, logits, _ = _multilabel_data()
-    lower_bound = loss_layers.true_positives_lower_bound(
-        labels, logits, weights, surrogate_type)
-
-    with self.test_session():
-      self.assertAllClose(lower_bound.eval(), expected)
-
-  @parameterized.named_parameters(
-      ('_xent', 'xent'), ('_hinge', 'hinge'))
-  def testLowerBoundOtherMultilabel(self, surrogate_type):
-    labels, logits, _ = _other_multilabel_data(surrogate_type)()
-    lower_bound = loss_layers.true_positives_lower_bound(
-        labels, logits, 1.0, surrogate_type)
-
-    with self.test_session():
-      self.assertAllClose(lower_bound.eval(), [4.0, 2.0], atol=1e-5)
-
-  @parameterized.named_parameters(
-      ('_xent', 'xent', 1.0, [1.0, 2.0]),
-      ('_xent_weighted', 'xent',
-       numpy.array([3.0, 2, 1, 0, 1, 2]).reshape(6, 1), [2.0, 1.0]),
-      ('_hinge', 'hinge', 1.0, [1.0, 2.0]),
-      ('_hinge_weighted', 'hinge',
-       numpy.array([13, 12, 11, 0.5, 0, 0.5]).reshape(6, 1), [0.5, 0.5]))
-  def testUpperBoundMultilabel(self, surrogate_type, weights, expected):
-    labels, logits, _ = _multilabel_data()
-    upper_bound = loss_layers.false_positives_upper_bound(
-        labels, logits, weights, surrogate_type)
-
-    with self.test_session():
-      self.assertAllClose(upper_bound.eval(), expected)
-
-  @parameterized.named_parameters(
-      ('_xent', 'xent'), ('_hinge', 'hinge'))
-  def testUpperBoundOtherMultilabel(self, surrogate_type):
-    labels, logits, _ = _other_multilabel_data(surrogate_type)()
-    upper_bound = loss_layers.false_positives_upper_bound(
-        labels, logits, 1.0, surrogate_type)
-
-    with self.test_session():
-      self.assertAllClose(upper_bound.eval(), [2.0, 4.0], atol=1e-5)
-
-  @parameterized.named_parameters(
-      ('_lower', 'lower'), ('_upper', 'upper'))
-  def testThreeDimensionalLogits(self, bound):
-    bound_function = loss_layers.false_positives_upper_bound
-    if bound == 'lower':
-      bound_function = loss_layers.true_positives_lower_bound
-    random_labels = numpy.float32(numpy.random.uniform(size=[2, 3]) > 0.5)
-    random_logits = numpy.float32(numpy.random.randn(2, 3, 2))
-    first_slice_logits = random_logits[:, :, 0].reshape(2, 3)
-    second_slice_logits = random_logits[:, :, 1].reshape(2, 3)
-
-    full_bound = bound_function(
-        tf.constant(random_labels), tf.constant(random_logits), 1.0, 'xent')
-    first_slice_bound = bound_function(tf.constant(random_labels),
-                                       tf.constant(first_slice_logits),
-                                       1.0,
-                                       'xent')
-    second_slice_bound = bound_function(tf.constant(random_labels),
-                                        tf.constant(second_slice_logits),
-                                        1.0,
-                                        'xent')
-    stacked_bound = tf.stack([first_slice_bound, second_slice_bound], axis=1)
-
-    with self.test_session():
-      self.assertAllClose(full_bound.eval(), stacked_bound.eval())
-
-
-def run_lagrange_multiplier_test(global_objective,
-                                 objective_kwargs,
-                                 data_builder,
-                                 test_object):
-  """Runs a test for the Lagrange multiplier update of `global_objective`.
-
-  The test checks that the constraint for `global_objective` is satisfied on
-  the first label of the data produced by `data_builder` but not the second.
-
-  Args:
-    global_objective: One of the global objectives.
-    objective_kwargs: A dictionary of keyword arguments to pass to
-      `global_objective`. Must contain an entry for the constraint argument
-      of `global_objective`, e.g. 'target_rate' or 'target_precision'.
-    data_builder: A function  which returns tensors corresponding to labels,
-      logits, and label priors.
-    test_object: An instance of tf.test.TestCase.
-  """
-  # Construct global objective kwargs from a copy of `objective_kwargs`.
-  kwargs = dict(objective_kwargs)
-  targets, logits, priors = data_builder()
-  kwargs['labels'] = targets
-  kwargs['logits'] = logits
-  kwargs['label_priors'] = priors
-
-  loss, output_dict = global_objective(**kwargs)
-  lambdas = tf.squeeze(output_dict['lambdas'])
-  opt = tf.train.GradientDescentOptimizer(learning_rate=0.1)
-  update_op = opt.minimize(loss, var_list=[output_dict['lambdas']])
-
-  with test_object.test_session() as session:
-    tf.global_variables_initializer().run()
-    lambdas_before = session.run(lambdas)
-    session.run(update_op)
-    lambdas_after = session.run(lambdas)
-    test_object.assertLess(lambdas_after[0], lambdas_before[0])
-    test_object.assertGreater(lambdas_after[1], lambdas_before[1])
-
-
-class CrossFunctionTest(parameterized.TestCase, tf.test.TestCase):
-
-  @parameterized.named_parameters(
-      ('_auc01xent', loss_layers.precision_recall_auc_loss, {
-          'precision_range': (0.0, 1.0), 'surrogate_type': 'xent'
-      }),
-      ('_auc051xent', loss_layers.precision_recall_auc_loss, {
-          'precision_range': (0.5, 1.0), 'surrogate_type': 'xent'
-      }),
-      ('_auc01)hinge', loss_layers.precision_recall_auc_loss, {
-          'precision_range': (0.0, 1.0), 'surrogate_type': 'hinge'
-      }),
-      ('_ratp04', loss_layers.recall_at_precision_loss, {
-          'target_precision': 0.4, 'surrogate_type': 'xent'
-      }),
-      ('_ratp066', loss_layers.recall_at_precision_loss, {
-          'target_precision': 0.66, 'surrogate_type': 'xent'
-      }),
-      ('_ratp07_hinge', loss_layers.recall_at_precision_loss, {
-          'target_precision': 0.7, 'surrogate_type': 'hinge'
-      }),
-      ('_fpattp066', loss_layers.false_positive_rate_at_true_positive_rate_loss,
-       {'target_rate': 0.66, 'surrogate_type': 'xent'}),
-      ('_fpattp046', loss_layers.false_positive_rate_at_true_positive_rate_loss,
-       {
-           'target_rate': 0.46, 'surrogate_type': 'xent'
-       }),
-      ('_fpattp076_hinge',
-       loss_layers.false_positive_rate_at_true_positive_rate_loss, {
-           'target_rate': 0.76, 'surrogate_type': 'hinge'
-       }),
-      ('_fpattp036_hinge',
-       loss_layers.false_positive_rate_at_true_positive_rate_loss, {
-           'target_rate': 0.36, 'surrogate_type': 'hinge'
-       }),
-  )
-  def testWeigtedGlobalObjective(self,
-                                 global_objective,
-                                 objective_kwargs):
-    """Runs a test of `global_objective` with per-example weights.
-
-    Args:
-      global_objective: One of the global objectives.
-      objective_kwargs: A dictionary of keyword arguments to pass to
-        `global_objective`. Must contain keys 'surrogate_type', and the keyword
-        for the constraint argument of `global_objective`, e.g. 'target_rate' or
-        'target_precision'.
-    """
-    logits_positives = tf.constant([1, -0.5, 3], shape=[3, 1])
-    logits_negatives = tf.constant([-0.5, 1, -1, -1, -0.5, 1], shape=[6, 1])
-
-    # Dummy tensor is used to compute the gradients.
-    dummy = tf.constant(1.0)
-    logits = tf.concat([logits_positives, logits_negatives], 0)
-    logits = tf.multiply(logits, dummy)
-    targets = tf.constant([1, 1, 1, 0, 0, 0, 0, 0, 0],
-                          shape=[9, 1], dtype=tf.float32)
-    priors = tf.constant(1.0/3.0, shape=[1])
-    weights = tf.constant([1, 1, 1, 0, 0, 0, 2, 2, 2],
-                          shape=[9, 1], dtype=tf.float32)
-
-    # Construct global objective kwargs.
-    objective_kwargs['labels'] = targets
-    objective_kwargs['logits'] = logits
-    objective_kwargs['label_priors'] = priors
-
-    scope = 'weighted_test'
-    # Unweighted loss.
-    objective_kwargs['scope'] = scope + '_plain'
-    raw_loss, update = global_objective(**objective_kwargs)
-    loss = tf.reduce_sum(raw_loss)
-
-    # Weighted loss.
-    objective_kwargs['weights'] = weights
-    objective_kwargs['scope'] = scope + '_weighted'
-    raw_weighted_loss, weighted_update = global_objective(**objective_kwargs)
-    weighted_loss = tf.reduce_sum(raw_weighted_loss)
-
-    lambdas = tf.contrib.framework.get_unique_variable(scope + '_plain/lambdas')
-    weighted_lambdas = tf.contrib.framework.get_unique_variable(
-        scope + '_weighted/lambdas')
-    logits_gradient = tf.gradients(loss, dummy)
-    weighted_logits_gradient = tf.gradients(weighted_loss, dummy)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      self.assertAllClose(loss.eval(), weighted_loss.eval())
-
-      logits_grad, weighted_logits_grad = session.run(
-          [logits_gradient, weighted_logits_gradient])
-      self.assertAllClose(logits_grad, weighted_logits_grad)
-
-      session.run([update, weighted_update])
-      lambdas_value, weighted_lambdas_value = session.run(
-          [lambdas, weighted_lambdas])
-      self.assertAllClose(lambdas_value, weighted_lambdas_value)
-
-  @parameterized.named_parameters(
-      ('_prauc051xent', loss_layers.precision_recall_auc_loss, {
-          'precision_range': (0.5, 1.0), 'surrogate_type': 'xent'
-      }),
-      ('_prauc01hinge', loss_layers.precision_recall_auc_loss, {
-          'precision_range': (0.0, 1.0), 'surrogate_type': 'hinge'
-      }),
-      ('_rocxent', loss_layers.roc_auc_loss, {'surrogate_type': 'xent'}),
-      ('_rochinge', loss_layers.roc_auc_loss, {'surrogate_type': 'xent'}),
-      ('_ratp04', loss_layers.recall_at_precision_loss, {
-          'target_precision': 0.4, 'surrogate_type': 'xent'
-      }),
-      ('_ratp07_hinge', loss_layers.recall_at_precision_loss, {
-          'target_precision': 0.7, 'surrogate_type': 'hinge'
-      }),
-      ('_patr05', loss_layers.precision_at_recall_loss, {
-          'target_recall': 0.4, 'surrogate_type': 'xent'
-      }),
-      ('_patr08_hinge', loss_layers.precision_at_recall_loss, {
-          'target_recall': 0.7, 'surrogate_type': 'hinge'
-      }),
-      ('_fpattp046', loss_layers.false_positive_rate_at_true_positive_rate_loss,
-       {
-           'target_rate': 0.46, 'surrogate_type': 'xent'
-       }),
-      ('_fpattp036_hinge',
-       loss_layers.false_positive_rate_at_true_positive_rate_loss, {
-           'target_rate': 0.36, 'surrogate_type': 'hinge'
-       }),
-      ('_tpatfp076', loss_layers.true_positive_rate_at_false_positive_rate_loss,
-       {
-           'target_rate': 0.76, 'surrogate_type': 'xent'
-       }),
-      ('_tpatfp036_hinge',
-       loss_layers.true_positive_rate_at_false_positive_rate_loss, {
-           'target_rate': 0.36, 'surrogate_type': 'hinge'
-       }),
-  )
-  def testVectorAndMatrixLabelEquivalence(self,
-                                          global_objective,
-                                          objective_kwargs):
-    """Tests equivalence between label shape [batch_size] or [batch_size, 1]."""
-    vector_labels = tf.constant([1.0, 1.0, 0.0, 0.0], shape=[4])
-    vector_logits = tf.constant([1.0, 0.1, 0.1, -1.0], shape=[4])
-
-    # Construct vector global objective kwargs and loss.
-    vector_kwargs = objective_kwargs.copy()
-    vector_kwargs['labels'] = vector_labels
-    vector_kwargs['logits'] = vector_logits
-    vector_loss, _ = global_objective(**vector_kwargs)
-    vector_loss_sum = tf.reduce_sum(vector_loss)
-
-    # Construct matrix global objective kwargs and loss.
-    matrix_kwargs = objective_kwargs.copy()
-    matrix_kwargs['labels'] = tf.expand_dims(vector_labels, 1)
-    matrix_kwargs['logits'] = tf.expand_dims(vector_logits, 1)
-    matrix_loss, _ = global_objective(**matrix_kwargs)
-    matrix_loss_sum = tf.reduce_sum(matrix_loss)
-
-    self.assertEqual(1, vector_loss.get_shape().ndims)
-    self.assertEqual(2, matrix_loss.get_shape().ndims)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(vector_loss_sum.eval(), matrix_loss_sum.eval())
-
-  @parameterized.named_parameters(
-      ('_prauc', loss_layers.precision_recall_auc_loss, None),
-      ('_roc', loss_layers.roc_auc_loss, None),
-      ('_rap', loss_layers.recall_at_precision_loss, {'target_precision': 0.8}),
-      ('_patr', loss_layers.precision_at_recall_loss, {'target_recall': 0.7}),
-      ('_fpattp', loss_layers.false_positive_rate_at_true_positive_rate_loss,
-       {'target_rate': 0.9}),
-      ('_tpatfp', loss_layers.true_positive_rate_at_false_positive_rate_loss,
-       {'target_rate': 0.1})
-  )
-  def testUnknownBatchSize(self, global_objective, objective_kwargs):
-    # Tests that there are no errors when the batch size is not known.
-    batch_shape = [5, 2]
-    logits = tf.placeholder(tf.float32)
-    logits_feed = numpy.random.randn(*batch_shape)
-    labels = tf.placeholder(tf.float32)
-    labels_feed = logits_feed > 0.1
-    logits.set_shape([None, 2])
-    labels.set_shape([None, 2])
-
-    if objective_kwargs is None:
-      objective_kwargs = {}
-
-    placeholder_kwargs = objective_kwargs.copy()
-    placeholder_kwargs['labels'] = labels
-    placeholder_kwargs['logits'] = logits
-    placeholder_loss, _ = global_objective(**placeholder_kwargs)
-
-    kwargs = objective_kwargs.copy()
-    kwargs['labels'] = labels_feed
-    kwargs['logits'] = logits_feed
-    loss, _ = global_objective(**kwargs)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-      feed_loss_val = session.run(placeholder_loss,
-                                  feed_dict={logits: logits_feed,
-                                             labels: labels_feed})
-      loss_val = session.run(loss)
-      self.assertAllClose(feed_loss_val, loss_val)
-
-
-# Both sets of logits below are designed so that the surrogate precision and
-# recall (true positive rate) of class 1 is ~ 2/3, and the same surrogates for
-# class 2 are ~ 1/3. The false positive rate surrogates are ~ 1/3 and 2/3.
-def _multilabel_data():
-  targets = tf.constant([1.0, 1.0, 1.0, 0.0, 0.0, 0.0], shape=[6, 1])
-  targets = tf.concat([targets, targets], 1)
-  logits_positives = tf.constant([[0.0, 15],
-                                  [16, 0.0],
-                                  [14, 0.0]], shape=[3, 2])
-  logits_negatives = tf.constant([[-17, 0.0],
-                                  [-15, 0.0],
-                                  [0.0, -101]], shape=[3, 2])
-  logits = tf.concat([logits_positives, logits_negatives], 0)
-  priors = tf.constant(0.5, shape=[2])
-
-  return targets, logits, priors
-
-
-def _other_multilabel_data(surrogate_type):
-  targets = tf.constant(
-      [1.0] * 6 + [0.0] * 6, shape=[12, 1])
-  targets = tf.concat([targets, targets], 1)
-  logits_positives = tf.constant([[0.0, 13],
-                                  [12, 0.0],
-                                  [15, 0.0],
-                                  [0.0, 30],
-                                  [13, 0.0],
-                                  [18, 0.0]], shape=[6, 2])
-  # A score of cost_2 incurs a loss of ~2.0.
-  cost_2 = 1.0 if surrogate_type == 'hinge' else 1.09861229
-  logits_negatives = tf.constant([[-16, cost_2],
-                                  [-15, cost_2],
-                                  [cost_2, -111],
-                                  [-133, -14,],
-                                  [-14.0100101, -16,],
-                                  [-19.888828882, -101]], shape=[6, 2])
-  logits = tf.concat([logits_positives, logits_negatives], 0)
-  priors = tf.constant(0.5, shape=[2])
-
-  def builder():
-    return targets, logits, priors
-
-  return builder
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/global_objectives/test_all.py b/research/global_objectives/test_all.py
deleted file mode 100644
index d7e439e219840a9ec5c65382c6bc392b1d68b447..0000000000000000000000000000000000000000
--- a/research/global_objectives/test_all.py
+++ /dev/null
@@ -1,37 +0,0 @@
-# Copyright 2018 The TensorFlow Global Objectives Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Runs all unit tests in the Global Objectives package.
-
-Requires that TensorFlow and abseil (https://github.com/abseil/abseil-py) be
-installed on your machine. Command to run the tests:
-python test_all.py
-
-"""
-
-import os
-import sys
-import unittest
-
-this_file = os.path.realpath(__file__)
-start_dir = os.path.dirname(this_file)
-parent_dir = os.path.dirname(start_dir)
-
-sys.path.append(parent_dir)
-loader = unittest.TestLoader()
-suite = loader.discover(start_dir, pattern='*_test.py')
-
-runner = unittest.TextTestRunner(verbosity=2)
-runner.run(suite)
diff --git a/research/global_objectives/util.py b/research/global_objectives/util.py
deleted file mode 100644
index e2b287a90bd743e5466b875c933c3872868f4a5f..0000000000000000000000000000000000000000
--- a/research/global_objectives/util.py
+++ /dev/null
@@ -1,348 +0,0 @@
-# Copyright 2018 The TensorFlow Global Objectives Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Contains utility functions for the global objectives library."""
-
-# Dependency imports
-import tensorflow as tf
-
-
-def weighted_sigmoid_cross_entropy_with_logits(labels,
-                                               logits,
-                                               positive_weights=1.0,
-                                               negative_weights=1.0,
-                                               name=None):
-  """Computes a weighting of sigmoid cross entropy given `logits`.
-
-  Measures the weighted probability error in discrete classification tasks in
-  which classes are independent and not mutually exclusive.  For instance, one
-  could perform multilabel classification where a picture can contain both an
-  elephant and a dog at the same time. The class weight multiplies the
-  different types of errors.
-  For brevity, let `x = logits`, `z = labels`, `c = positive_weights`,
-  `d = negative_weights`  The
-  weighed logistic loss is
-
-  ```
-  c * z * -log(sigmoid(x)) + d * (1 - z) * -log(1 - sigmoid(x))
-  = c * z * -log(1 / (1 + exp(-x))) - d * (1 - z) * log(exp(-x) / (1 + exp(-x)))
-  = c * z * log(1 + exp(-x)) + d * (1 - z) * (-log(exp(-x)) + log(1 + exp(-x)))
-  = c * z * log(1 + exp(-x)) + d * (1 - z) * (x + log(1 + exp(-x)))
-  = (1 - z) * x * d + (1 - z + c * z ) * log(1 + exp(-x))
-  =  - d * x * z + d * x + (d - d * z + c * z ) * log(1 + exp(-x))
-  ```
-
-  To ensure stability and avoid overflow, the implementation uses the identity
-      log(1 + exp(-x)) = max(0,-x) + log(1 + exp(-abs(x)))
-  and the result is computed as
-
-    ```
-    = -d * x * z + d * x
-      + (d - d * z + c * z ) * (max(0,-x) + log(1 + exp(-abs(x))))
-    ```
-
-  Note that the loss is NOT an upper bound on the 0-1 loss, unless it is divided
-  by log(2).
-
-  Args:
-    labels: A `Tensor` of type `float32` or `float64`. `labels` can be a 2D
-      tensor with shape [batch_size, num_labels] or a 3D tensor with shape
-      [batch_size, num_labels, K].
-    logits: A `Tensor` of the same type and shape as `labels`. If `logits` has
-      shape [batch_size, num_labels, K], the loss is computed separately on each
-      slice [:, :, k] of `logits`.
-    positive_weights: A `Tensor` that holds positive weights and has the
-      following semantics according to its shape:
-        scalar - A global positive weight.
-        1D tensor - must be of size K, a weight for each 'attempt'
-        2D tensor - of size [num_labels, K'] where K' is either K or 1.
-      The `positive_weights` will be expanded to the left to match the
-      dimensions of logits and labels.
-    negative_weights: A `Tensor` that holds positive weight and has the
-      semantics identical to positive_weights.
-    name: A name for the operation (optional).
-
-  Returns:
-    A `Tensor` of the same shape as `logits` with the componentwise
-      weighted logistic losses.
-  """
-  with tf.name_scope(
-      name,
-      'weighted_logistic_loss',
-      [logits, labels, positive_weights, negative_weights]) as name:
-    labels, logits, positive_weights, negative_weights = prepare_loss_args(
-        labels, logits, positive_weights, negative_weights)
-
-    softplus_term = tf.add(tf.maximum(-logits, 0.0),
-                           tf.log(1.0 + tf.exp(-tf.abs(logits))))
-    weight_dependent_factor = (
-        negative_weights + (positive_weights - negative_weights) * labels)
-    return (negative_weights * (logits - labels * logits) +
-            weight_dependent_factor * softplus_term)
-
-
-def weighted_hinge_loss(labels,
-                        logits,
-                        positive_weights=1.0,
-                        negative_weights=1.0,
-                        name=None):
-  """Computes weighted hinge loss given logits `logits`.
-
-  The loss applies to multi-label classification tasks where labels are
-  independent and not mutually exclusive. See also
-  `weighted_sigmoid_cross_entropy_with_logits`.
-
-  Args:
-    labels: A `Tensor` of type `float32` or `float64`. Each entry must be
-      either 0 or 1. `labels` can be a 2D tensor with shape
-      [batch_size, num_labels] or a 3D tensor with shape
-      [batch_size, num_labels, K].
-    logits: A `Tensor` of the same type and shape as `labels`. If `logits` has
-      shape [batch_size, num_labels, K], the loss is computed separately on each
-      slice [:, :, k] of `logits`.
-    positive_weights: A `Tensor` that holds positive weights and has the
-      following semantics according to its shape:
-        scalar - A global positive weight.
-        1D tensor - must be of size K, a weight for each 'attempt'
-        2D tensor - of size [num_labels, K'] where K' is either K or 1.
-      The `positive_weights` will be expanded to the left to match the
-      dimensions of logits and labels.
-    negative_weights: A `Tensor` that holds positive weight and has the
-      semantics identical to positive_weights.
-    name: A name for the operation (optional).
-
-  Returns:
-    A `Tensor` of the same shape as `logits` with the componentwise
-      weighted hinge loss.
-  """
-  with tf.name_scope(
-      name, 'weighted_hinge_loss',
-      [logits, labels, positive_weights, negative_weights]) as name:
-    labels, logits, positive_weights, negative_weights = prepare_loss_args(
-        labels, logits, positive_weights, negative_weights)
-
-    positives_term = positive_weights * labels * tf.maximum(1.0 - logits, 0)
-    negatives_term = (negative_weights * (1.0 - labels)
-                      * tf.maximum(1.0 + logits, 0))
-    return positives_term + negatives_term
-
-
-def weighted_surrogate_loss(labels,
-                            logits,
-                            surrogate_type='xent',
-                            positive_weights=1.0,
-                            negative_weights=1.0,
-                            name=None):
-  """Returns either weighted cross-entropy or hinge loss.
-
-  For example `surrogate_type` is 'xent' returns the weighted cross
-  entropy loss.
-
-  Args:
-   labels: A `Tensor` of type `float32` or `float64`. Each entry must be
-      between 0 and 1. `labels` can be a 2D tensor with shape
-      [batch_size, num_labels] or a 3D tensor with shape
-      [batch_size, num_labels, K].
-    logits: A `Tensor` of the same type and shape as `labels`. If `logits` has
-      shape [batch_size, num_labels, K], each slice [:, :, k] represents an
-      'attempt' to predict `labels` and the loss is computed per slice.
-    surrogate_type: A string that determines which loss to return, supports
-    'xent' for cross-entropy and 'hinge' for hinge loss.
-    positive_weights: A `Tensor` that holds positive weights and has the
-      following semantics according to its shape:
-        scalar - A global positive weight.
-        1D tensor - must be of size K, a weight for each 'attempt'
-        2D tensor - of size [num_labels, K'] where K' is either K or 1.
-      The `positive_weights` will be expanded to the left to match the
-      dimensions of logits and labels.
-    negative_weights: A `Tensor` that holds positive weight and has the
-      semantics identical to positive_weights.
-    name: A name for the operation (optional).
-
-  Returns:
-    The weigthed loss.
-
-  Raises:
-    ValueError: If value of `surrogate_type` is not supported.
-  """
-  with tf.name_scope(
-      name, 'weighted_loss',
-      [logits, labels, surrogate_type, positive_weights,
-       negative_weights]) as name:
-    if surrogate_type == 'xent':
-      return weighted_sigmoid_cross_entropy_with_logits(
-          logits=logits,
-          labels=labels,
-          positive_weights=positive_weights,
-          negative_weights=negative_weights,
-          name=name)
-    elif surrogate_type == 'hinge':
-      return weighted_hinge_loss(
-          logits=logits,
-          labels=labels,
-          positive_weights=positive_weights,
-          negative_weights=negative_weights,
-          name=name)
-    raise ValueError('surrogate_type %s not supported.' % surrogate_type)
-
-
-def expand_outer(tensor, rank):
-  """Expands the given `Tensor` outwards to a target rank.
-
-  For example if rank = 3 and tensor.shape is [3, 4], this function will expand
-  to such that the resulting shape will be  [1, 3, 4].
-
-  Args:
-    tensor: The tensor to expand.
-    rank: The target dimension.
-
-  Returns:
-    The expanded tensor.
-
-  Raises:
-    ValueError: If rank of `tensor` is unknown, or if `rank` is smaller than
-      the rank of `tensor`.
-  """
-  if tensor.get_shape().ndims is None:
-    raise ValueError('tensor dimension must be known.')
-  if len(tensor.get_shape()) > rank:
-    raise ValueError(
-        '`rank` must be at least the current tensor dimension: (%s vs %s).' %
-        (rank, len(tensor.get_shape())))
-  while len(tensor.get_shape()) < rank:
-    tensor = tf.expand_dims(tensor, 0)
-  return tensor
-
-
-def build_label_priors(labels,
-                       weights=None,
-                       positive_pseudocount=1.0,
-                       negative_pseudocount=1.0,
-                       variables_collections=None):
-  """Creates an op to maintain and update label prior probabilities.
-
-  For each label, the label priors are estimated as
-      (P + sum_i w_i y_i) / (P + N + sum_i w_i),
-  where y_i is the ith label, w_i is the ith weight, P is a pseudo-count of
-  positive labels, and N is a pseudo-count of negative labels. The index i
-  ranges over all labels observed during all evaluations of the returned op.
-
-  Args:
-    labels: A `Tensor` with shape [batch_size, num_labels]. Entries should be
-      in [0, 1].
-    weights: Coefficients representing the weight of each label. Must be either
-      a Tensor of shape [batch_size, num_labels] or `None`, in which case each
-      weight is treated as 1.0.
-    positive_pseudocount: Number of positive labels used to initialize the label
-      priors.
-    negative_pseudocount: Number of negative labels used to initialize the label
-      priors.
-    variables_collections: Optional list of collections for created variables.
-
-  Returns:
-    label_priors: An op to update the weighted label_priors. Gives the
-      current value of the label priors when evaluated.
-  """
-  dtype = labels.dtype.base_dtype
-  num_labels = get_num_labels(labels)
-
-  if weights is None:
-    weights = tf.ones_like(labels)
-
-  # We disable partitioning while constructing dual variables because they will
-  # be updated with assign, which is not available for partitioned variables.
-  partitioner = tf.get_variable_scope().partitioner
-  try:
-    tf.get_variable_scope().set_partitioner(None)
-    # Create variable and update op for weighted label counts.
-    weighted_label_counts = tf.contrib.framework.model_variable(
-        name='weighted_label_counts',
-        shape=[num_labels],
-        dtype=dtype,
-        initializer=tf.constant_initializer(
-            [positive_pseudocount] * num_labels, dtype=dtype),
-        collections=variables_collections,
-        trainable=False)
-    weighted_label_counts_update = weighted_label_counts.assign_add(
-        tf.reduce_sum(weights * labels, 0))
-
-    # Create variable and update op for the sum of the weights.
-    weight_sum = tf.contrib.framework.model_variable(
-        name='weight_sum',
-        shape=[num_labels],
-        dtype=dtype,
-        initializer=tf.constant_initializer(
-            [positive_pseudocount + negative_pseudocount] * num_labels,
-            dtype=dtype),
-        collections=variables_collections,
-        trainable=False)
-    weight_sum_update = weight_sum.assign_add(tf.reduce_sum(weights, 0))
-
-  finally:
-    tf.get_variable_scope().set_partitioner(partitioner)
-
-  label_priors = tf.div(
-      weighted_label_counts_update,
-      weight_sum_update)
-  return label_priors
-
-
-def convert_and_cast(value, name, dtype):
-  """Convert input to tensor and cast to dtype.
-
-  Args:
-    value: An object whose type has a registered Tensor conversion function,
-        e.g. python numerical type or numpy array.
-    name: Name to use for the new Tensor, if one is created.
-    dtype: Optional element type for the returned tensor.
-
-  Returns:
-    A tensor.
-  """
-  return tf.cast(tf.convert_to_tensor(value, name=name), dtype=dtype)
-
-
-def prepare_loss_args(labels, logits, positive_weights, negative_weights):
-  """Prepare arguments for weighted loss functions.
-
-  If needed, will convert given arguments to appropriate type and shape.
-
-  Args:
-    labels: labels or labels of the loss function.
-    logits: Logits of the loss function.
-    positive_weights: Weight on the positive examples.
-    negative_weights: Weight on the negative examples.
-
-  Returns:
-    Converted labels, logits, positive_weights, negative_weights.
-  """
-  logits = tf.convert_to_tensor(logits, name='logits')
-  labels = convert_and_cast(labels, 'labels', logits.dtype)
-  if len(labels.get_shape()) == 2 and len(logits.get_shape()) == 3:
-    labels = tf.expand_dims(labels, [2])
-
-  positive_weights = convert_and_cast(positive_weights, 'positive_weights',
-                                      logits.dtype)
-  positive_weights = expand_outer(positive_weights, logits.get_shape().ndims)
-  negative_weights = convert_and_cast(negative_weights, 'negative_weights',
-                                      logits.dtype)
-  negative_weights = expand_outer(negative_weights, logits.get_shape().ndims)
-  return labels, logits, positive_weights, negative_weights
-
-
-def get_num_labels(labels_or_logits):
-  """Returns the number of labels inferred from labels_or_logits."""
-  if labels_or_logits.get_shape().ndims <= 1:
-    return 1
-  return labels_or_logits.get_shape()[1].value
diff --git a/research/global_objectives/util_test.py b/research/global_objectives/util_test.py
deleted file mode 100644
index 195252a53eb1d0a50735d2f987b0882681b0544a..0000000000000000000000000000000000000000
--- a/research/global_objectives/util_test.py
+++ /dev/null
@@ -1,333 +0,0 @@
-# Copyright 2018 The TensorFlow Global Objectives Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for global objectives util functions."""
-
-# Dependency imports
-from absl.testing import parameterized
-import numpy as np
-import tensorflow as tf
-
-from global_objectives import util
-
-
-def weighted_sigmoid_cross_entropy(targets, logits, weight):
-  return (weight * targets * np.log(1.0 + np.exp(-logits)) + (
-      (1.0 - targets) * np.log(1.0 + 1.0 / np.exp(-logits))))
-
-
-def hinge_loss(labels, logits):
-  # Mostly copied from tensorflow.python.ops.losses but with loss per datapoint.
-  labels = tf.to_float(labels)
-  all_ones = tf.ones_like(labels)
-  labels = tf.subtract(2 * labels, all_ones)
-  return tf.nn.relu(tf.subtract(all_ones, tf.multiply(labels, logits)))
-
-
-class WeightedSigmoidCrossEntropyTest(parameterized.TestCase, tf.test.TestCase):
-
-  def testTrivialCompatibilityWithSigmoidCrossEntropy(self):
-    """Tests compatibility with unweighted function with weight 1.0."""
-    x_shape = [300, 10]
-    targets = np.random.random_sample(x_shape).astype(np.float32)
-    logits = np.random.randn(*x_shape).astype(np.float32)
-    weighted_loss = util.weighted_sigmoid_cross_entropy_with_logits(
-        targets,
-        logits)
-    expected_loss = (
-        tf.contrib.nn.deprecated_flipped_sigmoid_cross_entropy_with_logits(
-            logits, targets))
-    with self.test_session():
-      self.assertAllClose(expected_loss.eval(),
-                          weighted_loss.eval(),
-                          atol=0.000001)
-
-  def testNonTrivialCompatibilityWithSigmoidCrossEntropy(self):
-    """Tests use of an arbitrary weight (4.12)."""
-    x_shape = [300, 10]
-    targets = np.random.random_sample(x_shape).astype(np.float32)
-    logits = np.random.randn(*x_shape).astype(np.float32)
-    weight = 4.12
-    weighted_loss = util.weighted_sigmoid_cross_entropy_with_logits(
-        targets,
-        logits,
-        weight,
-        weight)
-    expected_loss = (
-        weight *
-        tf.contrib.nn.deprecated_flipped_sigmoid_cross_entropy_with_logits(
-            logits, targets))
-    with self.test_session():
-      self.assertAllClose(expected_loss.eval(),
-                          weighted_loss.eval(),
-                          atol=0.000001)
-
-  def testDifferentSizeWeightedSigmoidCrossEntropy(self):
-    """Tests correctness on 3D tensors.
-
-    Tests that the function works as expected when logits is a 3D tensor and
-    targets is a 2D tensor.
-    """
-    targets_shape = [30, 4]
-    logits_shape = [targets_shape[0], targets_shape[1], 3]
-    targets = np.random.random_sample(targets_shape).astype(np.float32)
-    logits = np.random.randn(*logits_shape).astype(np.float32)
-
-    weight_vector = [2.0, 3.0, 13.0]
-    loss = util.weighted_sigmoid_cross_entropy_with_logits(targets,
-                                                           logits,
-                                                           weight_vector)
-
-    with self.test_session():
-      loss = loss.eval()
-      for i in range(0, len(weight_vector)):
-        expected = weighted_sigmoid_cross_entropy(targets, logits[:, :, i],
-                                                  weight_vector[i])
-        self.assertAllClose(loss[:, :, i], expected, atol=0.000001)
-
-  @parameterized.parameters((300, 10, 0.3), (20, 4, 2.0), (30, 4, 3.9))
-  def testWeightedSigmoidCrossEntropy(self, batch_size, num_labels, weight):
-    """Tests thats the tf and numpy functions agree on many instances."""
-    x_shape = [batch_size, num_labels]
-    targets = np.random.random_sample(x_shape).astype(np.float32)
-    logits = np.random.randn(*x_shape).astype(np.float32)
-
-    with self.test_session():
-      loss = util.weighted_sigmoid_cross_entropy_with_logits(
-          targets,
-          logits,
-          weight,
-          1.0,
-          name='weighted-loss')
-      expected = weighted_sigmoid_cross_entropy(targets, logits, weight)
-      self.assertAllClose(expected, loss.eval(), atol=0.000001)
-
-  def testGradients(self):
-    """Tests that weighted loss gradients behave as expected."""
-    dummy_tensor = tf.constant(1.0)
-
-    positives_shape = [10, 1]
-    positives_logits = dummy_tensor * tf.Variable(
-        tf.random_normal(positives_shape) + 1.0)
-    positives_targets = tf.ones(positives_shape)
-    positives_weight = 4.6
-    positives_loss = (
-        tf.contrib.nn.deprecated_flipped_sigmoid_cross_entropy_with_logits(
-            positives_logits, positives_targets) * positives_weight)
-
-    negatives_shape = [190, 1]
-    negatives_logits = dummy_tensor * tf.Variable(
-        tf.random_normal(negatives_shape))
-    negatives_targets = tf.zeros(negatives_shape)
-    negatives_weight = 0.9
-    negatives_loss = (
-        tf.contrib.nn.deprecated_flipped_sigmoid_cross_entropy_with_logits(
-            negatives_logits, negatives_targets) * negatives_weight)
-
-    all_logits = tf.concat([positives_logits, negatives_logits], 0)
-    all_targets = tf.concat([positives_targets, negatives_targets], 0)
-    weighted_loss = tf.reduce_sum(
-        util.weighted_sigmoid_cross_entropy_with_logits(
-            all_targets, all_logits, positives_weight, negatives_weight))
-    weighted_gradients = tf.gradients(weighted_loss, dummy_tensor)
-
-    expected_loss = tf.add(
-        tf.reduce_sum(positives_loss),
-        tf.reduce_sum(negatives_loss))
-    expected_gradients = tf.gradients(expected_loss, dummy_tensor)
-
-    with tf.Session() as session:
-      tf.global_variables_initializer().run()
-      grad, expected_grad = session.run(
-          [weighted_gradients, expected_gradients])
-      self.assertAllClose(grad, expected_grad)
-
-  def testDtypeFlexibility(self):
-    """Tests the loss on inputs of varying data types."""
-    shape = [20, 3]
-    logits = np.random.randn(*shape)
-    targets = tf.truncated_normal(shape)
-    positive_weights = tf.constant(3, dtype=tf.int64)
-    negative_weights = 1
-
-    loss = util.weighted_sigmoid_cross_entropy_with_logits(
-        targets, logits, positive_weights, negative_weights)
-
-    with self.test_session():
-      self.assertEqual(loss.eval().dtype, np.float)
-
-
-class WeightedHingeLossTest(tf.test.TestCase):
-
-  def testTrivialCompatibilityWithHinge(self):
-    # Tests compatibility with unweighted hinge loss.
-    x_shape = [55, 10]
-    logits = tf.constant(np.random.randn(*x_shape).astype(np.float32))
-    targets = tf.to_float(tf.constant(np.random.random_sample(x_shape) > 0.3))
-    weighted_loss = util.weighted_hinge_loss(targets, logits)
-    expected_loss = hinge_loss(targets, logits)
-    with self.test_session():
-      self.assertAllClose(expected_loss.eval(), weighted_loss.eval())
-
-  def testLessTrivialCompatibilityWithHinge(self):
-    # Tests compatibility with a constant weight for positives and negatives.
-    x_shape = [56, 11]
-    logits = tf.constant(np.random.randn(*x_shape).astype(np.float32))
-    targets = tf.to_float(tf.constant(np.random.random_sample(x_shape) > 0.7))
-    weight = 1.0 + 1.0/2 + 1.0/3 + 1.0/4 + 1.0/5 + 1.0/6 + 1.0/7
-    weighted_loss = util.weighted_hinge_loss(targets, logits, weight, weight)
-    expected_loss = hinge_loss(targets, logits) * weight
-    with self.test_session():
-      self.assertAllClose(expected_loss.eval(), weighted_loss.eval())
-
-  def testNontrivialCompatibilityWithHinge(self):
-    # Tests compatibility with different positive and negative weights.
-    x_shape = [23, 8]
-    logits_positives = tf.constant(np.random.randn(*x_shape).astype(np.float32))
-    logits_negatives = tf.constant(np.random.randn(*x_shape).astype(np.float32))
-    targets_positives = tf.ones(x_shape)
-    targets_negatives = tf.zeros(x_shape)
-    logits = tf.concat([logits_positives, logits_negatives], 0)
-    targets = tf.concat([targets_positives, targets_negatives], 0)
-
-    raw_loss = util.weighted_hinge_loss(targets,
-                                        logits,
-                                        positive_weights=3.4,
-                                        negative_weights=1.2)
-    loss = tf.reduce_sum(raw_loss, 0)
-    positives_hinge = hinge_loss(targets_positives, logits_positives)
-    negatives_hinge = hinge_loss(targets_negatives, logits_negatives)
-    expected = tf.add(tf.reduce_sum(3.4 * positives_hinge, 0),
-                      tf.reduce_sum(1.2 * negatives_hinge, 0))
-
-    with self.test_session():
-      self.assertAllClose(loss.eval(), expected.eval())
-
-  def test3DLogitsAndTargets(self):
-    # Tests correctness when logits is 3D and targets is 2D.
-    targets_shape = [30, 4]
-    logits_shape = [targets_shape[0], targets_shape[1], 3]
-    targets = tf.to_float(
-        tf.constant(np.random.random_sample(targets_shape) > 0.7))
-    logits = tf.constant(np.random.randn(*logits_shape).astype(np.float32))
-    weight_vector = [1.0, 1.0, 1.0]
-    loss = util.weighted_hinge_loss(targets, logits, weight_vector)
-
-    with self.test_session():
-      loss_value = loss.eval()
-      for i in range(len(weight_vector)):
-        expected = hinge_loss(targets, logits[:, :, i]).eval()
-        self.assertAllClose(loss_value[:, :, i], expected)
-
-
-class BuildLabelPriorsTest(tf.test.TestCase):
-
-  def testLabelPriorConsistency(self):
-    # Checks that, with zero pseudocounts, the returned label priors reproduce
-    # label frequencies in the batch.
-    batch_shape = [4, 10]
-    labels = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.678)))
-
-    label_priors_update = util.build_label_priors(
-        labels=labels, positive_pseudocount=0, negative_pseudocount=0)
-    expected_priors = tf.reduce_mean(labels, 0)
-
-    with self.test_session():
-      tf.global_variables_initializer().run()
-      self.assertAllClose(label_priors_update.eval(), expected_priors.eval())
-
-  def testLabelPriorsUpdate(self):
-    # Checks that the update of label priors behaves as expected.
-    batch_shape = [1, 5]
-    labels = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.4)))
-    label_priors_update = util.build_label_priors(labels)
-
-    label_sum = np.ones(shape=batch_shape)
-    weight_sum = 2.0 * np.ones(shape=batch_shape)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-
-      for _ in range(3):
-        label_sum += labels.eval()
-        weight_sum += np.ones(shape=batch_shape)
-        expected_posteriors = label_sum / weight_sum
-        label_priors = label_priors_update.eval().reshape(batch_shape)
-        self.assertAllClose(label_priors, expected_posteriors)
-
-        # Re-initialize labels to get a new random sample.
-        session.run(labels.initializer)
-
-  def testLabelPriorsUpdateWithWeights(self):
-    # Checks the update of label priors with per-example weights.
-    batch_size = 6
-    num_labels = 5
-    batch_shape = [batch_size, num_labels]
-    labels = tf.Variable(
-        tf.to_float(tf.greater(tf.random_uniform(batch_shape), 0.6)))
-    weights = tf.Variable(tf.random_uniform(batch_shape) * 6.2)
-
-    update_op = util.build_label_priors(labels, weights=weights)
-
-    expected_weighted_label_counts = 1.0 + tf.reduce_sum(weights * labels, 0)
-    expected_weight_sum = 2.0 + tf.reduce_sum(weights, 0)
-    expected_label_posteriors = tf.divide(expected_weighted_label_counts,
-                                          expected_weight_sum)
-
-    with self.test_session() as session:
-      tf.global_variables_initializer().run()
-
-      updated_priors, expected_posteriors = session.run(
-          [update_op, expected_label_posteriors])
-      self.assertAllClose(updated_priors, expected_posteriors)
-
-
-class WeightedSurrogateLossTest(parameterized.TestCase, tf.test.TestCase):
-
-  @parameterized.parameters(
-      ('hinge', util.weighted_hinge_loss),
-      ('xent', util.weighted_sigmoid_cross_entropy_with_logits))
-  def testCompatibilityLoss(self, loss_name, loss_fn):
-    x_shape = [28, 4]
-    logits = tf.constant(np.random.randn(*x_shape).astype(np.float32))
-    targets = tf.to_float(tf.constant(np.random.random_sample(x_shape) > 0.5))
-    positive_weights = 0.66
-    negative_weights = 11.1
-    expected_loss = loss_fn(
-        targets,
-        logits,
-        positive_weights=positive_weights,
-        negative_weights=negative_weights)
-    computed_loss = util.weighted_surrogate_loss(
-        targets,
-        logits,
-        loss_name,
-        positive_weights=positive_weights,
-        negative_weights=negative_weights)
-    with self.test_session():
-      self.assertAllClose(expected_loss.eval(), computed_loss.eval())
-
-  def testSurrogatgeError(self):
-    x_shape = [7, 3]
-    logits = tf.constant(np.random.randn(*x_shape).astype(np.float32))
-    targets = tf.to_float(tf.constant(np.random.random_sample(x_shape) > 0.5))
-
-    with self.assertRaises(ValueError):
-      util.weighted_surrogate_loss(logits, targets, 'bug')
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/im2txt/.gitignore b/research/im2txt/.gitignore
deleted file mode 100644
index fb46913cc7a5994c4324de50829c95d7858c30f4..0000000000000000000000000000000000000000
--- a/research/im2txt/.gitignore
+++ /dev/null
@@ -1,7 +0,0 @@
-/bazel-bin
-/bazel-ci_build-cache
-/bazel-genfiles
-/bazel-out
-/bazel-im2txt
-/bazel-testlogs
-/bazel-tf
diff --git a/research/im2txt/README.md b/research/im2txt/README.md
deleted file mode 100644
index 2eb72822a39e3959a5a9370f26a9cc5c12be0fda..0000000000000000000000000000000000000000
--- a/research/im2txt/README.md
+++ /dev/null
@@ -1,342 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Show and Tell: A Neural Image Caption Generator
-
-A TensorFlow implementation of the image-to-text model described in the paper:
-
-"Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning
-Challenge."
-
-Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan.
-
-*IEEE transactions on pattern analysis and machine intelligence (2016).*
-
-Full text available at: http://arxiv.org/abs/1609.06647
-
-## Contact
-***Author:*** Chris Shallue
-
-***Pull requests and issues:*** @cshallue
-
-## Contents
-* [Model Overview](#model-overview)
-    * [Introduction](#introduction)
-    * [Architecture](#architecture)
-* [Getting Started](#getting-started)
-    * [A Note on Hardware and Training Time](#a-note-on-hardware-and-training-time)
-    * [Install Required Packages](#install-required-packages)
-    * [Prepare the Training Data](#prepare-the-training-data)
-    * [Download the Inception v3 Checkpoint](#download-the-inception-v3-checkpoint)
-* [Training a Model](#training-a-model)
-    * [Initial Training](#initial-training)
-    * [Fine Tune the Inception v3 Model](#fine-tune-the-inception-v3-model)
-* [Generating Captions](#generating-captions)
-
-## Model Overview
-
-### Introduction
-
-The *Show and Tell* model is a deep neural network that learns how to describe
-the content of images. For example:
-
-![Example captions](g3doc/example_captions.jpg)
-
-### Architecture
-
-The *Show and Tell* model is an example of an *encoder-decoder* neural network.
-It works by first "encoding" an image into a fixed-length vector representation,
-and then "decoding" the representation into a natural language description.
-
-The image encoder is a deep convolutional neural network. This type of
-network is widely used for image tasks and is currently state-of-the-art for
-object recognition and detection. Our particular choice of network is the
-[*Inception v3*](http://arxiv.org/abs/1512.00567) image recognition model
-pretrained on the
-[ILSVRC-2012-CLS](http://www.image-net.org/challenges/LSVRC/2012/) image
-classification dataset.
-
-The decoder is a long short-term memory (LSTM) network. This type of network is
-commonly used for sequence modeling tasks such as language modeling and machine
-translation. In the *Show and Tell* model, the LSTM network is trained as a
-language model conditioned on the image encoding.
-
-Words in the captions are represented with an embedding model. Each word in the
-vocabulary is associated with a fixed-length vector representation that is
-learned during training.
-
-The following diagram illustrates the model architecture.
-
-![Show and Tell Architecture](g3doc/show_and_tell_architecture.png)
-
-In this diagram, \{*s*<sub>0</sub>, *s*<sub>1</sub>, ..., *s*<sub>*N*-1</sub>\}
-are the words of the caption and \{*w*<sub>*e*</sub>*s*<sub>0</sub>,
-*w*<sub>*e*</sub>*s*<sub>1</sub>, ..., *w*<sub>*e*</sub>*s*<sub>*N*-1</sub>\}
-are their corresponding word embedding vectors. The outputs \{*p*<sub>1</sub>,
-*p*<sub>2</sub>, ..., *p*<sub>*N*</sub>\} of the LSTM are probability
-distributions generated by the model for the next word in the sentence. The
-terms \{log *p*<sub>1</sub>(*s*<sub>1</sub>),
-log *p*<sub>2</sub>(*s*<sub>2</sub>), ...,
-log *p*<sub>*N*</sub>(*s*<sub>*N*</sub>)\} are the log-likelihoods of the
-correct word at each step; the negated sum of these terms is the minimization
-objective of the model.
-
-During the first phase of training the parameters of the *Inception v3* model
-are kept fixed: it is simply a static image encoder function. A single trainable
-layer is added on top of the *Inception v3* model to transform the image
-embedding into the word embedding vector space. The model is trained with
-respect to the parameters of the word embeddings, the parameters of the layer on
-top of *Inception v3* and the parameters of the LSTM. In the second phase of
-training, all parameters - including the parameters of *Inception v3* - are
-trained to jointly fine-tune the image encoder and the LSTM.
-
-Given a trained model and an image we use *beam search* to generate captions for
-that image. Captions are generated word-by-word, where at each step *t* we use
-the set of sentences already generated with length *t* - 1 to generate a new set
-of sentences with length *t*. We keep only the top *k* candidates at each step,
-where the hyperparameter *k* is called the *beam size*. We have found the best
-performance with *k* = 3.
-
-## Getting Started
-
-### A Note on Hardware and Training Time
-
-The time required to train the *Show and Tell* model depends on your specific
-hardware and computational capacity. In this guide we assume you will be running
-training on a single machine with a GPU. In our experience on an NVIDIA Tesla
-K20m GPU the initial training phase takes 1-2 weeks. The second training phase
-may take several additional weeks to achieve peak performance (but you can stop
-this phase early and still get reasonable results).
-
-It is possible to achieve a speed-up by implementing distributed training across
-a cluster of machines with GPUs, but that is not covered in this guide.
-
-Whilst it is possible to run this code on a CPU, beware that this may be
-approximately 10 times slower.
-
-### Install Required Packages
-First ensure that you have installed the following required packages:
-
-* **Bazel** ([instructions](http://bazel.io/docs/install.html))
-* **Python 2.7**
-* **TensorFlow** 1.0 or greater ([instructions](https://www.tensorflow.org/install/))
-* **NumPy** ([instructions](http://www.scipy.org/install.html))
-* **Natural Language Toolkit (NLTK)**:
-    * First install NLTK ([instructions](http://www.nltk.org/install.html))
-    * Then install the NLTK data package "punkt" ([instructions](http://www.nltk.org/data.html))
-* **Unzip**
-### Prepare the Training Data
-
-To train the model you will need to provide training data in native TFRecord
-format. The TFRecord format consists of a set of sharded files containing
-serialized `tf.SequenceExample` protocol buffers. Each `tf.SequenceExample`
-proto contains an image (JPEG format), a caption and metadata such as the image
-id.
-
-Each caption is a list of words. During preprocessing, a dictionary is created
-that assigns each word in the vocabulary to an integer-valued id. Each caption
-is encoded as a list of integer word ids in the `tf.SequenceExample` protos.
-
-We have provided a script to download and preprocess the [MSCOCO](http://mscoco.org/) image captioning data set into this format. Downloading
-and preprocessing the data may take several hours depending on your network and
-computer speed. Please be patient.
-
-Before running the script, ensure that your hard disk has at least 150GB of
-available space for storing the downloaded and processed data.
-
-```shell
-# Location to save the MSCOCO data.
-MSCOCO_DIR="${HOME}/im2txt/data/mscoco"
-
-# Build the preprocessing script.
-cd research/im2txt
-bazel build //im2txt:download_and_preprocess_mscoco
-
-# Run the preprocessing script.
-bazel-bin/im2txt/download_and_preprocess_mscoco "${MSCOCO_DIR}"
-```
-
-The final line of the output should read:
-
-```
-2016-09-01 16:47:47.296630: Finished processing all 20267 image-caption pairs in data set 'test'.
-```
-
-When the script finishes you will find 256 training, 4 validation and 8 testing
-files in `DATA_DIR`. The files will match the patterns `train-?????-of-00256`,
-`val-?????-of-00004` and `test-?????-of-00008`, respectively.
-
-### Download the Inception v3 Checkpoint
-
-The *Show and Tell* model requires a pretrained *Inception v3* checkpoint file
-to initialize the parameters of its image encoder submodel.
-
-This checkpoint file is provided by the
-[TensorFlow-Slim image classification library](https://github.com/tensorflow/models/tree/master/research/slim#tensorflow-slim-image-classification-library)
-which provides a suite of pre-trained image classification models. You can read
-more about the models provided by the library
-[here](https://github.com/tensorflow/models/tree/master/research/slim#pre-trained-models).
-
-
-Run the following commands to download the *Inception v3* checkpoint.
-
-```shell
-# Location to save the Inception v3 checkpoint.
-INCEPTION_DIR="${HOME}/im2txt/data"
-mkdir -p ${INCEPTION_DIR}
-
-wget "http://download.tensorflow.org/models/inception_v3_2016_08_28.tar.gz"
-tar -xvf "inception_v3_2016_08_28.tar.gz" -C ${INCEPTION_DIR}
-rm "inception_v3_2016_08_28.tar.gz"
-```
-
-Note that the *Inception v3* checkpoint will only be used for initializing the
-parameters of the *Show and Tell* model. Once the *Show and Tell* model starts
-training it will save its own checkpoint files containing the values of all its
-parameters (including copies of the *Inception v3* parameters). If training is
-stopped and restarted, the parameter values will be restored from the latest
-*Show and Tell* checkpoint and the *Inception v3* checkpoint will be ignored. In
-other words, the *Inception v3* checkpoint is only used in the 0-th global step
-(initialization) of training the *Show and Tell* model.
-
-## Training a Model
-
-### Initial Training
-
-Run the training script.
-
-```shell
-# Directory containing preprocessed MSCOCO data.
-MSCOCO_DIR="${HOME}/im2txt/data/mscoco"
-
-# Inception v3 checkpoint file.
-INCEPTION_CHECKPOINT="${HOME}/im2txt/data/inception_v3.ckpt"
-
-# Directory to save the model.
-MODEL_DIR="${HOME}/im2txt/model"
-
-# Build the model.
-cd research/im2txt
-bazel build -c opt //im2txt/...
-
-# Run the training script.
-bazel-bin/im2txt/train \
-  --input_file_pattern="${MSCOCO_DIR}/train-?????-of-00256" \
-  --inception_checkpoint_file="${INCEPTION_CHECKPOINT}" \
-  --train_dir="${MODEL_DIR}/train" \
-  --train_inception=false \
-  --number_of_steps=1000000
-```
-
-Run the evaluation script in a separate process. This will log evaluation
-metrics to TensorBoard which allows training progress to be monitored in
-real-time.
-
-Note that you may run out of memory if you run the evaluation script on the same
-GPU as the training script. You can run the command
-`export CUDA_VISIBLE_DEVICES=""` to force the evaluation script to run on CPU.
-If evaluation runs too slowly on CPU, you can decrease the value of
-`--num_eval_examples`.
-
-```shell
-MSCOCO_DIR="${HOME}/im2txt/data/mscoco"
-MODEL_DIR="${HOME}/im2txt/model"
-
-# Ignore GPU devices (only necessary if your GPU is currently memory
-# constrained, for example, by running the training script).
-export CUDA_VISIBLE_DEVICES=""
-
-# Run the evaluation script. This will run in a loop, periodically loading the
-# latest model checkpoint file and computing evaluation metrics.
-bazel-bin/im2txt/evaluate \
-  --input_file_pattern="${MSCOCO_DIR}/val-?????-of-00004" \
-  --checkpoint_dir="${MODEL_DIR}/train" \
-  --eval_dir="${MODEL_DIR}/eval"
-```
-
-Run a TensorBoard server in a separate process for real-time monitoring of
-training progress and evaluation metrics.
-
-```shell
-MODEL_DIR="${HOME}/im2txt/model"
-
-# Run a TensorBoard server.
-tensorboard --logdir="${MODEL_DIR}"
-```
-
-### Fine Tune the Inception v3 Model
-
-Your model will already be able to generate reasonable captions after the first
-phase of training. Try it out! (See [Generating Captions](#generating-captions)).
-
-You can further improve the performance of the model by running a
-second training phase to jointly fine-tune the parameters of the *Inception v3*
-image submodel and the LSTM.
-
-```shell
-# Restart the training script with --train_inception=true.
-bazel-bin/im2txt/train \
-  --input_file_pattern="${MSCOCO_DIR}/train-?????-of-00256" \
-  --train_dir="${MODEL_DIR}/train" \
-  --train_inception=true \
-  --number_of_steps=3000000  # Additional 2M steps (assuming 1M in initial training).
-```
-
-Note that training will proceed much slower now, and the model will continue to
-improve by a small amount for a long time. We have found that it will improve
-slowly for an additional 2-2.5 million steps before it begins to overfit. This
-may take several weeks on a single GPU. If you don't care about absolutely
-optimal performance then feel free to halt training sooner by stopping the
-training script or passing a smaller value to the flag `--number_of_steps`. Your
-model will still work reasonably well.
-
-## Generating Captions
-
-Your trained *Show and Tell* model can generate captions for any JPEG image! The
-following command line will generate captions for an image from the test set.
-
-```shell
-# Path to checkpoint file or a directory containing checkpoint files. Passing
-# a directory will only work if there is also a file named 'checkpoint' which
-# lists the available checkpoints in the directory. It will not work if you
-# point to a directory with just a copy of a model checkpoint: in that case,
-# you will need to pass the checkpoint path explicitly.
-CHECKPOINT_PATH="${HOME}/im2txt/model/train"
-
-# Vocabulary file generated by the preprocessing script.
-VOCAB_FILE="${HOME}/im2txt/data/mscoco/word_counts.txt"
-
-# JPEG image file to caption.
-IMAGE_FILE="${HOME}/im2txt/data/mscoco/raw-data/val2014/COCO_val2014_000000224477.jpg"
-
-# Build the inference binary.
-cd research/im2txt
-bazel build -c opt //im2txt:run_inference
-
-# Ignore GPU devices (only necessary if your GPU is currently memory
-# constrained, for example, by running the training script).
-export CUDA_VISIBLE_DEVICES=""
-
-# Run inference to generate captions.
-bazel-bin/im2txt/run_inference \
-  --checkpoint_path=${CHECKPOINT_PATH} \
-  --vocab_file=${VOCAB_FILE} \
-  --input_files=${IMAGE_FILE}
-```
-
-Example output:
-
-```
-Captions for image COCO_val2014_000000224477.jpg:
-  0) a man riding a wave on top of a surfboard . (p=0.040413)
-  1) a person riding a surf board on a wave (p=0.017452)
-  2) a man riding a wave on a surfboard in the ocean . (p=0.005743)
-```
-
-Note: you may get different results. Some variation between different models is
-expected.
-
-Here is the image:
-
-![Surfer](g3doc/COCO_val2014_000000224477.jpg)
diff --git a/research/im2txt/WORKSPACE b/research/im2txt/WORKSPACE
deleted file mode 100644
index 22da718b06f9c61be4ffdf45e48919ed4a5f17ae..0000000000000000000000000000000000000000
--- a/research/im2txt/WORKSPACE
+++ /dev/null
@@ -1 +0,0 @@
-workspace(name = "im2txt")
diff --git a/research/im2txt/conda-env/ubuntu-18-04-environment.yaml b/research/im2txt/conda-env/ubuntu-18-04-environment.yaml
deleted file mode 100644
index 332ff2a47f8f49fcdde7b769c29ff84cf5a5ff9d..0000000000000000000000000000000000000000
--- a/research/im2txt/conda-env/ubuntu-18-04-environment.yaml
+++ /dev/null
@@ -1,142 +0,0 @@
-name: im2txt
-channels:
-  - defaults
-dependencies:
-  - _tflow_select=2.3.0=mkl
-  - absl-py=0.5.0=py27_0
-  - astor=0.7.1=py27_0
-  - backports=1.0=py27_1
-  - backports.functools_lru_cache=1.5=py27_1
-  - backports.shutil_get_terminal_size=1.0.0=py27_2
-  - backports.weakref=1.0.post1=py27_0
-  - backports_abc=0.5=py27_0
-  - blas=1.0=mkl
-  - bleach=3.0.2=py27_0
-  - ca-certificates=2018.03.07=0
-  - certifi=2018.10.15=py27_0
-  - configparser=3.5.0=py27_0
-  - cycler=0.10.0=py27_0
-  - dbus=1.13.2=h714fa37_1
-  - decorator=4.3.0=py27_0
-  - entrypoints=0.2.3=py27_2
-  - enum34=1.1.6=py27_1
-  - expat=2.2.6=he6710b0_0
-  - fastcache=1.0.2=py27h14c3975_2
-  - fontconfig=2.13.0=h9420a91_0
-  - freetype=2.9.1=h8a8886c_1
-  - funcsigs=1.0.2=py27_0
-  - functools32=3.2.3.2=py27_1
-  - futures=3.2.0=py27_0
-  - gast=0.2.0=py27_0
-  - glib=2.56.2=hd408876_0
-  - gmp=6.1.2=h6c8ec71_1
-  - gmpy2=2.0.8=py27h10f8cd9_2
-  - grpcio=1.12.1=py27hdbcaa40_0
-  - gst-plugins-base=1.14.0=hbbd80ab_1
-  - gstreamer=1.14.0=hb453b48_1
-  - h5py=2.8.0=py27h989c5e5_3
-  - hdf5=1.10.2=hba1933b_1
-  - icu=58.2=h9c2bf20_1
-  - intel-openmp=2019.0=118
-  - ipaddress=1.0.22=py27_0
-  - ipykernel=4.10.0=py27_0
-  - ipython=5.8.0=py27_0
-  - ipython_genutils=0.2.0=py27_0
-  - ipywidgets=7.4.2=py27_0
-  - jinja2=2.10=py27_0
-  - jpeg=9b=h024ee3a_2
-  - jsonschema=2.6.0=py27_0
-  - jupyter=1.0.0=py27_7
-  - jupyter_client=5.2.3=py27_0
-  - jupyter_console=5.2.0=py27_1
-  - jupyter_core=4.4.0=py27_0
-  - keras-applications=1.0.6=py27_0
-  - keras-preprocessing=1.0.5=py27_0
-  - kiwisolver=1.0.1=py27hf484d3e_0
-  - libedit=3.1.20170329=h6b74fdf_2
-  - libffi=3.2.1=hd88cf55_4
-  - libgcc-ng=8.2.0=hdf63c60_1
-  - libgfortran-ng=7.3.0=hdf63c60_0
-  - libpng=1.6.35=hbc83047_0
-  - libprotobuf=3.6.0=hdbcaa40_0
-  - libsodium=1.0.16=h1bed415_0
-  - libstdcxx-ng=8.2.0=hdf63c60_1
-  - libuuid=1.0.3=h1bed415_2
-  - libxcb=1.13=h1bed415_1
-  - libxml2=2.9.8=h26e45fe_1
-  - linecache2=1.0.0=py27_0
-  - markdown=3.0.1=py27_0
-  - markupsafe=1.0=py27h14c3975_1
-  - matplotlib=2.2.3=py27hb69df0a_0
-  - mistune=0.8.4=py27h7b6447c_0
-  - mkl=2019.0=118
-  - mkl_fft=1.0.6=py27h7dd41cf_0
-  - mkl_random=1.0.1=py27h4414c95_1
-  - mock=2.0.0=py27_0
-  - mpc=1.1.0=h10f8cd9_1
-  - mpfr=4.0.1=hdf1c602_3
-  - mpmath=1.0.0=py27_2
-  - nbconvert=5.3.1=py27_0
-  - nbformat=4.4.0=py27_0
-  - ncurses=6.1=hf484d3e_0
-  - nltk=3.3.0=py27_0
-  - nose=1.3.7=py27_2
-  - notebook=5.7.0=py27_0
-  - numpy=1.15.3=py27h1d66e8a_0
-  - numpy-base=1.15.3=py27h81de0dd_0
-  - openssl=1.0.2p=h14c3975_0
-  - pandas=0.23.4=py27h04863e7_0
-  - pandoc=2.2.3.2=0
-  - pandocfilters=1.4.2=py27_1
-  - pathlib2=2.3.2=py27_0
-  - pbr=4.3.0=py27_0
-  - pcre=8.42=h439df22_0
-  - pexpect=4.6.0=py27_0
-  - pickleshare=0.7.5=py27_0
-  - pip=10.0.1=py27_0
-  - prometheus_client=0.4.2=py27_0
-  - prompt_toolkit=1.0.15=py27_0
-  - protobuf=3.6.0=py27hf484d3e_0
-  - ptyprocess=0.6.0=py27_0
-  - pygments=2.2.0=py27_0
-  - pyparsing=2.2.2=py27_0
-  - pyqt=5.9.2=py27h05f1152_2
-  - python=2.7.15=h77bded6_2
-  - python-dateutil=2.7.3=py27_0
-  - pytz=2018.5=py27_0
-  - pyzmq=17.1.2=py27h14c3975_0
-  - qt=5.9.6=h8703b6f_2
-  - qtconsole=4.4.2=py27_0
-  - readline=7.0=h7b6447c_5
-  - scandir=1.9.0=py27h14c3975_0
-  - scipy=1.1.0=py27hfa4b5c9_1
-  - send2trash=1.5.0=py27_0
-  - setuptools=40.4.3=py27_0
-  - simplegeneric=0.8.1=py27_2
-  - singledispatch=3.4.0.3=py27_0
-  - sip=4.19.8=py27hf484d3e_0
-  - six=1.11.0=py27_1
-  - sqlite=3.25.2=h7b6447c_0
-  - subprocess32=3.5.3=py27h7b6447c_0
-  - sympy=1.3=py27_0
-  - tensorboard=1.11.0=py27hf484d3e_0
-  - tensorflow=1.11.0=mkl_py27h25e0b76_0
-  - tensorflow-base=1.11.0=mkl_py27h3c3e929_0
-  - termcolor=1.1.0=py27_1
-  - terminado=0.8.1=py27_1
-  - testpath=0.4.2=py27_0
-  - tk=8.6.8=hbc83047_0
-  - tornado=5.1.1=py27h7b6447c_0
-  - traceback2=1.4.0=py27_0
-  - traitlets=4.3.2=py27_0
-  - unittest2=1.1.0=py27_0
-  - wcwidth=0.1.7=py27_0
-  - webencodings=0.5.1=py27_1
-  - werkzeug=0.14.1=py27_0
-  - wheel=0.32.2=py27_0
-  - widgetsnbextension=3.4.2=py27_0
-  - xz=5.2.4=h14c3975_4
-  - zeromq=4.2.5=hf484d3e_1
-  - zlib=1.2.11=ha838bed_2
-prefix: /home/arinto_murdopo/anaconda3/envs/im2txt
-
diff --git a/research/im2txt/g3doc/COCO_val2014_000000224477.jpg b/research/im2txt/g3doc/COCO_val2014_000000224477.jpg
deleted file mode 100644
index 8976fa84b40b04c5bf1205a49c8d236b747f8f9b..0000000000000000000000000000000000000000
Binary files a/research/im2txt/g3doc/COCO_val2014_000000224477.jpg and /dev/null differ
diff --git a/research/im2txt/g3doc/example_captions.jpg b/research/im2txt/g3doc/example_captions.jpg
deleted file mode 100644
index b3a8f43247e5c9c39a3f93daaf1ad34837959ec5..0000000000000000000000000000000000000000
Binary files a/research/im2txt/g3doc/example_captions.jpg and /dev/null differ
diff --git a/research/im2txt/g3doc/show_and_tell_architecture.png b/research/im2txt/g3doc/show_and_tell_architecture.png
deleted file mode 100644
index 984590d54ba4aa089b5740fd69f6dc6216b9047f..0000000000000000000000000000000000000000
Binary files a/research/im2txt/g3doc/show_and_tell_architecture.png and /dev/null differ
diff --git a/research/im2txt/im2txt/BUILD b/research/im2txt/im2txt/BUILD
deleted file mode 100644
index 8c403171153c36ee43cde2788dbfcaf9c7bf4293..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/BUILD
+++ /dev/null
@@ -1,96 +0,0 @@
-package(default_visibility = [":internal"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-package_group(
-    name = "internal",
-    packages = [
-        "//im2txt/...",
-    ],
-)
-
-py_binary(
-    name = "build_mscoco_data",
-    srcs = [
-        "data/build_mscoco_data.py",
-    ],
-)
-
-sh_binary(
-    name = "download_and_preprocess_mscoco",
-    srcs = ["data/download_and_preprocess_mscoco.sh"],
-    data = [
-        ":build_mscoco_data",
-    ],
-)
-
-py_library(
-    name = "configuration",
-    srcs = ["configuration.py"],
-    srcs_version = "PY2AND3",
-)
-
-py_library(
-    name = "show_and_tell_model",
-    srcs = ["show_and_tell_model.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        "//im2txt/ops:image_embedding",
-        "//im2txt/ops:image_processing",
-        "//im2txt/ops:inputs",
-    ],
-)
-
-py_test(
-    name = "show_and_tell_model_test",
-    size = "large",
-    srcs = ["show_and_tell_model_test.py"],
-    deps = [
-        ":configuration",
-        ":show_and_tell_model",
-    ],
-)
-
-py_library(
-    name = "inference_wrapper",
-    srcs = ["inference_wrapper.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":show_and_tell_model",
-        "//im2txt/inference_utils:inference_wrapper_base",
-    ],
-)
-
-py_binary(
-    name = "train",
-    srcs = ["train.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":configuration",
-        ":show_and_tell_model",
-    ],
-)
-
-py_binary(
-    name = "evaluate",
-    srcs = ["evaluate.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":configuration",
-        ":show_and_tell_model",
-    ],
-)
-
-py_binary(
-    name = "run_inference",
-    srcs = ["run_inference.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":configuration",
-        ":inference_wrapper",
-        "//im2txt/inference_utils:caption_generator",
-        "//im2txt/inference_utils:vocabulary",
-    ],
-)
diff --git a/research/im2txt/im2txt/configuration.py b/research/im2txt/im2txt/configuration.py
deleted file mode 100644
index 3b664eb9f0cd963fb26929d019ec9cdb3282d0a8..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/configuration.py
+++ /dev/null
@@ -1,104 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Image-to-text model and training configurations."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-class ModelConfig(object):
-  """Wrapper class for model hyperparameters."""
-
-  def __init__(self):
-    """Sets the default model hyperparameters."""
-    # File pattern of sharded TFRecord file containing SequenceExample protos.
-    # Must be provided in training and evaluation modes.
-    self.input_file_pattern = None
-
-    # Image format ("jpeg" or "png").
-    self.image_format = "jpeg"
-
-    # Approximate number of values per input shard. Used to ensure sufficient
-    # mixing between shards in training.
-    self.values_per_input_shard = 2300
-    # Minimum number of shards to keep in the input queue.
-    self.input_queue_capacity_factor = 2
-    # Number of threads for prefetching SequenceExample protos.
-    self.num_input_reader_threads = 1
-
-    # Name of the SequenceExample context feature containing image data.
-    self.image_feature_name = "image/data"
-    # Name of the SequenceExample feature list containing integer captions.
-    self.caption_feature_name = "image/caption_ids"
-
-    # Number of unique words in the vocab (plus 1, for <UNK>).
-    # The default value is larger than the expected actual vocab size to allow
-    # for differences between tokenizer versions used in preprocessing. There is
-    # no harm in using a value greater than the actual vocab size, but using a
-    # value less than the actual vocab size will result in an error.
-    self.vocab_size = 12000
-
-    # Number of threads for image preprocessing. Should be a multiple of 2.
-    self.num_preprocess_threads = 4
-
-    # Batch size.
-    self.batch_size = 32
-
-    # File containing an Inception v3 checkpoint to initialize the variables
-    # of the Inception model. Must be provided when starting training for the
-    # first time.
-    self.inception_checkpoint_file = None
-
-    # Dimensions of Inception v3 input images.
-    self.image_height = 299
-    self.image_width = 299
-
-    # Scale used to initialize model variables.
-    self.initializer_scale = 0.08
-
-    # LSTM input and output dimensionality, respectively.
-    self.embedding_size = 512
-    self.num_lstm_units = 512
-
-    # If < 1.0, the dropout keep probability applied to LSTM variables.
-    self.lstm_dropout_keep_prob = 0.7
-
-
-class TrainingConfig(object):
-  """Wrapper class for training hyperparameters."""
-
-  def __init__(self):
-    """Sets the default training hyperparameters."""
-    # Number of examples per epoch of training data.
-    self.num_examples_per_epoch = 586363
-
-    # Optimizer for training the model.
-    self.optimizer = "SGD"
-
-    # Learning rate for the initial phase of training.
-    self.initial_learning_rate = 2.0
-    self.learning_rate_decay_factor = 0.5
-    self.num_epochs_per_decay = 8.0
-
-    # Learning rate when fine tuning the Inception v3 parameters.
-    self.train_inception_learning_rate = 0.0005
-
-    # If not None, clip gradients to this value.
-    self.clip_gradients = 5.0
-
-    # How many model checkpoints to keep.
-    self.max_checkpoints_to_keep = 5
diff --git a/research/im2txt/im2txt/data/build_mscoco_data.py b/research/im2txt/im2txt/data/build_mscoco_data.py
deleted file mode 100644
index 2c3e9d977669bf63d8e39128336319b48c0432dd..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/data/build_mscoco_data.py
+++ /dev/null
@@ -1,483 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Converts MSCOCO data to TFRecord file format with SequenceExample protos.
-
-The MSCOCO images are expected to reside in JPEG files located in the following
-directory structure:
-
-  train_image_dir/COCO_train2014_000000000151.jpg
-  train_image_dir/COCO_train2014_000000000260.jpg
-  ...
-
-and
-
-  val_image_dir/COCO_val2014_000000000042.jpg
-  val_image_dir/COCO_val2014_000000000073.jpg
-  ...
-
-The MSCOCO annotations JSON files are expected to reside in train_captions_file
-and val_captions_file respectively.
-
-This script converts the combined MSCOCO data into sharded data files consisting
-of 256, 4 and 8 TFRecord files, respectively:
-
-  output_dir/train-00000-of-00256
-  output_dir/train-00001-of-00256
-  ...
-  output_dir/train-00255-of-00256
-
-and
-
-  output_dir/val-00000-of-00004
-  ...
-  output_dir/val-00003-of-00004
-
-and
-
-  output_dir/test-00000-of-00008
-  ...
-  output_dir/test-00007-of-00008
-
-Each TFRecord file contains ~2300 records. Each record within the TFRecord file
-is a serialized SequenceExample proto consisting of precisely one image-caption
-pair. Note that each image has multiple captions (usually 5) and therefore each
-image is replicated multiple times in the TFRecord files.
-
-The SequenceExample proto contains the following fields:
-
-  context:
-    image/image_id: integer MSCOCO image identifier
-    image/data: string containing JPEG encoded image in RGB colorspace
-
-  feature_lists:
-    image/caption: list of strings containing the (tokenized) caption words
-    image/caption_ids: list of integer ids corresponding to the caption words
-
-The captions are tokenized using the NLTK (http://www.nltk.org/) word tokenizer.
-The vocabulary of word identifiers is constructed from the sorted list (by
-descending frequency) of word tokens in the training set. Only tokens appearing
-at least 4 times are considered; all other words get the "unknown" word id.
-
-NOTE: This script will consume around 100GB of disk space because each image
-in the MSCOCO dataset is replicated ~5 times (once per caption) in the output.
-This is done for two reasons:
-  1. In order to better shuffle the training data.
-  2. It makes it easier to perform asynchronous preprocessing of each image in
-     TensorFlow.
-
-Running this script using 16 threads may take around 1 hour on a HP Z420.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import Counter
-from collections import namedtuple
-from datetime import datetime
-import json
-import os.path
-import random
-import sys
-import threading
-
-
-
-import nltk.tokenize
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-tf.flags.DEFINE_string("train_image_dir", "/tmp/train2014/",
-                       "Training image directory.")
-tf.flags.DEFINE_string("val_image_dir", "/tmp/val2014",
-                       "Validation image directory.")
-
-tf.flags.DEFINE_string("train_captions_file", "/tmp/captions_train2014.json",
-                       "Training captions JSON file.")
-tf.flags.DEFINE_string("val_captions_file", "/tmp/captions_val2014.json",
-                       "Validation captions JSON file.")
-
-tf.flags.DEFINE_string("output_dir", "/tmp/", "Output data directory.")
-
-tf.flags.DEFINE_integer("train_shards", 256,
-                        "Number of shards in training TFRecord files.")
-tf.flags.DEFINE_integer("val_shards", 4,
-                        "Number of shards in validation TFRecord files.")
-tf.flags.DEFINE_integer("test_shards", 8,
-                        "Number of shards in testing TFRecord files.")
-
-tf.flags.DEFINE_string("start_word", "<S>",
-                       "Special word added to the beginning of each sentence.")
-tf.flags.DEFINE_string("end_word", "</S>",
-                       "Special word added to the end of each sentence.")
-tf.flags.DEFINE_string("unknown_word", "<UNK>",
-                       "Special word meaning 'unknown'.")
-tf.flags.DEFINE_integer("min_word_count", 4,
-                        "The minimum number of occurrences of each word in the "
-                        "training set for inclusion in the vocabulary.")
-tf.flags.DEFINE_string("word_counts_output_file", "/tmp/word_counts.txt",
-                       "Output vocabulary file of word counts.")
-
-tf.flags.DEFINE_integer("num_threads", 8,
-                        "Number of threads to preprocess the images.")
-
-FLAGS = tf.flags.FLAGS
-
-ImageMetadata = namedtuple("ImageMetadata",
-                           ["image_id", "filename", "captions"])
-
-
-class Vocabulary(object):
-  """Simple vocabulary wrapper."""
-
-  def __init__(self, vocab, unk_id):
-    """Initializes the vocabulary.
-
-    Args:
-      vocab: A dictionary of word to word_id.
-      unk_id: Id of the special 'unknown' word.
-    """
-    self._vocab = vocab
-    self._unk_id = unk_id
-
-  def word_to_id(self, word):
-    """Returns the integer id of a word string."""
-    if word in self._vocab:
-      return self._vocab[word]
-    else:
-      return self._unk_id
-
-
-class ImageDecoder(object):
-  """Helper class for decoding images in TensorFlow."""
-
-  def __init__(self):
-    # Create a single TensorFlow Session for all image decoding calls.
-    self._sess = tf.Session()
-
-    # TensorFlow ops for JPEG decoding.
-    self._encoded_jpeg = tf.placeholder(dtype=tf.string)
-    self._decode_jpeg = tf.image.decode_jpeg(self._encoded_jpeg, channels=3)
-
-  def decode_jpeg(self, encoded_jpeg):
-    image = self._sess.run(self._decode_jpeg,
-                           feed_dict={self._encoded_jpeg: encoded_jpeg})
-    assert len(image.shape) == 3
-    assert image.shape[2] == 3
-    return image
-
-
-def _int64_feature(value):
-  """Wrapper for inserting an int64 Feature into a SequenceExample proto."""
-  return tf.train.Feature(int64_list=tf.train.Int64List(value=[value]))
-
-
-def _bytes_feature(value):
-  """Wrapper for inserting a bytes Feature into a SequenceExample proto."""
-  return tf.train.Feature(bytes_list=tf.train.BytesList(value=[str(value)]))
-
-
-def _int64_feature_list(values):
-  """Wrapper for inserting an int64 FeatureList into a SequenceExample proto."""
-  return tf.train.FeatureList(feature=[_int64_feature(v) for v in values])
-
-
-def _bytes_feature_list(values):
-  """Wrapper for inserting a bytes FeatureList into a SequenceExample proto."""
-  return tf.train.FeatureList(feature=[_bytes_feature(v) for v in values])
-
-
-def _to_sequence_example(image, decoder, vocab):
-  """Builds a SequenceExample proto for an image-caption pair.
-
-  Args:
-    image: An ImageMetadata object.
-    decoder: An ImageDecoder object.
-    vocab: A Vocabulary object.
-
-  Returns:
-    A SequenceExample proto.
-  """
-  with tf.gfile.FastGFile(image.filename, "r") as f:
-    encoded_image = f.read()
-
-  try:
-    decoder.decode_jpeg(encoded_image)
-  except (tf.errors.InvalidArgumentError, AssertionError):
-    print("Skipping file with invalid JPEG data: %s" % image.filename)
-    return
-
-  context = tf.train.Features(feature={
-      "image/image_id": _int64_feature(image.image_id),
-      "image/data": _bytes_feature(encoded_image),
-  })
-
-  assert len(image.captions) == 1
-  caption = image.captions[0]
-  caption_ids = [vocab.word_to_id(word) for word in caption]
-  feature_lists = tf.train.FeatureLists(feature_list={
-      "image/caption": _bytes_feature_list(caption),
-      "image/caption_ids": _int64_feature_list(caption_ids)
-  })
-  sequence_example = tf.train.SequenceExample(
-      context=context, feature_lists=feature_lists)
-
-  return sequence_example
-
-
-def _process_image_files(thread_index, ranges, name, images, decoder, vocab,
-                         num_shards):
-  """Processes and saves a subset of images as TFRecord files in one thread.
-
-  Args:
-    thread_index: Integer thread identifier within [0, len(ranges)].
-    ranges: A list of pairs of integers specifying the ranges of the dataset to
-      process in parallel.
-    name: Unique identifier specifying the dataset.
-    images: List of ImageMetadata.
-    decoder: An ImageDecoder object.
-    vocab: A Vocabulary object.
-    num_shards: Integer number of shards for the output files.
-  """
-  # Each thread produces N shards where N = num_shards / num_threads. For
-  # instance, if num_shards = 128, and num_threads = 2, then the first thread
-  # would produce shards [0, 64).
-  num_threads = len(ranges)
-  assert not num_shards % num_threads
-  num_shards_per_batch = int(num_shards / num_threads)
-
-  shard_ranges = np.linspace(ranges[thread_index][0], ranges[thread_index][1],
-                             num_shards_per_batch + 1).astype(int)
-  num_images_in_thread = ranges[thread_index][1] - ranges[thread_index][0]
-
-  counter = 0
-  for s in xrange(num_shards_per_batch):
-    # Generate a sharded version of the file name, e.g. 'train-00002-of-00010'
-    shard = thread_index * num_shards_per_batch + s
-    output_filename = "%s-%.5d-of-%.5d" % (name, shard, num_shards)
-    output_file = os.path.join(FLAGS.output_dir, output_filename)
-    writer = tf.python_io.TFRecordWriter(output_file)
-
-    shard_counter = 0
-    images_in_shard = np.arange(shard_ranges[s], shard_ranges[s + 1], dtype=int)
-    for i in images_in_shard:
-      image = images[i]
-
-      sequence_example = _to_sequence_example(image, decoder, vocab)
-      if sequence_example is not None:
-        writer.write(sequence_example.SerializeToString())
-        shard_counter += 1
-        counter += 1
-
-      if not counter % 1000:
-        print("%s [thread %d]: Processed %d of %d items in thread batch." %
-              (datetime.now(), thread_index, counter, num_images_in_thread))
-        sys.stdout.flush()
-
-    writer.close()
-    print("%s [thread %d]: Wrote %d image-caption pairs to %s" %
-          (datetime.now(), thread_index, shard_counter, output_file))
-    sys.stdout.flush()
-    shard_counter = 0
-  print("%s [thread %d]: Wrote %d image-caption pairs to %d shards." %
-        (datetime.now(), thread_index, counter, num_shards_per_batch))
-  sys.stdout.flush()
-
-
-def _process_dataset(name, images, vocab, num_shards):
-  """Processes a complete data set and saves it as a TFRecord.
-
-  Args:
-    name: Unique identifier specifying the dataset.
-    images: List of ImageMetadata.
-    vocab: A Vocabulary object.
-    num_shards: Integer number of shards for the output files.
-  """
-  # Break up each image into a separate entity for each caption.
-  images = [ImageMetadata(image.image_id, image.filename, [caption])
-            for image in images for caption in image.captions]
-
-  # Shuffle the ordering of images. Make the randomization repeatable.
-  random.seed(12345)
-  random.shuffle(images)
-
-  # Break the images into num_threads batches. Batch i is defined as
-  # images[ranges[i][0]:ranges[i][1]].
-  num_threads = min(num_shards, FLAGS.num_threads)
-  spacing = np.linspace(0, len(images), num_threads + 1).astype(np.int)
-  ranges = []
-  threads = []
-  for i in xrange(len(spacing) - 1):
-    ranges.append([spacing[i], spacing[i + 1]])
-
-  # Create a mechanism for monitoring when all threads are finished.
-  coord = tf.train.Coordinator()
-
-  # Create a utility for decoding JPEG images to run sanity checks.
-  decoder = ImageDecoder()
-
-  # Launch a thread for each batch.
-  print("Launching %d threads for spacings: %s" % (num_threads, ranges))
-  for thread_index in xrange(len(ranges)):
-    args = (thread_index, ranges, name, images, decoder, vocab, num_shards)
-    t = threading.Thread(target=_process_image_files, args=args)
-    t.start()
-    threads.append(t)
-
-  # Wait for all the threads to terminate.
-  coord.join(threads)
-  print("%s: Finished processing all %d image-caption pairs in data set '%s'." %
-        (datetime.now(), len(images), name))
-
-
-def _create_vocab(captions):
-  """Creates the vocabulary of word to word_id.
-
-  The vocabulary is saved to disk in a text file of word counts. The id of each
-  word in the file is its corresponding 0-based line number.
-
-  Args:
-    captions: A list of lists of strings.
-
-  Returns:
-    A Vocabulary object.
-  """
-  print("Creating vocabulary.")
-  counter = Counter()
-  for c in captions:
-    counter.update(c)
-  print("Total words:", len(counter))
-
-  # Filter uncommon words and sort by descending count.
-  word_counts = [x for x in counter.items() if x[1] >= FLAGS.min_word_count]
-  word_counts.sort(key=lambda x: x[1], reverse=True)
-  print("Words in vocabulary:", len(word_counts))
-
-  # Write out the word counts file.
-  with tf.gfile.FastGFile(FLAGS.word_counts_output_file, "w") as f:
-    f.write("\n".join(["%s %d" % (w, c) for w, c in word_counts]))
-  print("Wrote vocabulary file:", FLAGS.word_counts_output_file)
-
-  # Create the vocabulary dictionary.
-  reverse_vocab = [x[0] for x in word_counts]
-  unk_id = len(reverse_vocab)
-  vocab_dict = dict([(x, y) for (y, x) in enumerate(reverse_vocab)])
-  vocab = Vocabulary(vocab_dict, unk_id)
-
-  return vocab
-
-
-def _process_caption(caption):
-  """Processes a caption string into a list of tonenized words.
-
-  Args:
-    caption: A string caption.
-
-  Returns:
-    A list of strings; the tokenized caption.
-  """
-  tokenized_caption = [FLAGS.start_word]
-  tokenized_caption.extend(nltk.tokenize.word_tokenize(caption.lower()))
-  tokenized_caption.append(FLAGS.end_word)
-  return tokenized_caption
-
-
-def _load_and_process_metadata(captions_file, image_dir):
-  """Loads image metadata from a JSON file and processes the captions.
-
-  Args:
-    captions_file: JSON file containing caption annotations.
-    image_dir: Directory containing the image files.
-
-  Returns:
-    A list of ImageMetadata.
-  """
-  with tf.gfile.FastGFile(captions_file, "r") as f:
-    caption_data = json.load(f)
-
-  # Extract the filenames.
-  id_to_filename = [(x["id"], x["file_name"]) for x in caption_data["images"]]
-
-  # Extract the captions. Each image_id is associated with multiple captions.
-  id_to_captions = {}
-  for annotation in caption_data["annotations"]:
-    image_id = annotation["image_id"]
-    caption = annotation["caption"]
-    id_to_captions.setdefault(image_id, [])
-    id_to_captions[image_id].append(caption)
-
-  assert len(id_to_filename) == len(id_to_captions)
-  assert set([x[0] for x in id_to_filename]) == set(id_to_captions.keys())
-  print("Loaded caption metadata for %d images from %s" %
-        (len(id_to_filename), captions_file))
-
-  # Process the captions and combine the data into a list of ImageMetadata.
-  print("Processing captions.")
-  image_metadata = []
-  num_captions = 0
-  for image_id, base_filename in id_to_filename:
-    filename = os.path.join(image_dir, base_filename)
-    captions = [_process_caption(c) for c in id_to_captions[image_id]]
-    image_metadata.append(ImageMetadata(image_id, filename, captions))
-    num_captions += len(captions)
-  print("Finished processing %d captions for %d images in %s" %
-        (num_captions, len(id_to_filename), captions_file))
-
-  return image_metadata
-
-
-def main(unused_argv):
-  def _is_valid_num_shards(num_shards):
-    """Returns True if num_shards is compatible with FLAGS.num_threads."""
-    return num_shards < FLAGS.num_threads or not num_shards % FLAGS.num_threads
-
-  assert _is_valid_num_shards(FLAGS.train_shards), (
-      "Please make the FLAGS.num_threads commensurate with FLAGS.train_shards")
-  assert _is_valid_num_shards(FLAGS.val_shards), (
-      "Please make the FLAGS.num_threads commensurate with FLAGS.val_shards")
-  assert _is_valid_num_shards(FLAGS.test_shards), (
-      "Please make the FLAGS.num_threads commensurate with FLAGS.test_shards")
-
-  if not tf.gfile.IsDirectory(FLAGS.output_dir):
-    tf.gfile.MakeDirs(FLAGS.output_dir)
-
-  # Load image metadata from caption files.
-  mscoco_train_dataset = _load_and_process_metadata(FLAGS.train_captions_file,
-                                                    FLAGS.train_image_dir)
-  mscoco_val_dataset = _load_and_process_metadata(FLAGS.val_captions_file,
-                                                  FLAGS.val_image_dir)
-
-  # Redistribute the MSCOCO data as follows:
-  #   train_dataset = 100% of mscoco_train_dataset + 85% of mscoco_val_dataset.
-  #   val_dataset = 5% of mscoco_val_dataset (for validation during training).
-  #   test_dataset = 10% of mscoco_val_dataset (for final evaluation).
-  train_cutoff = int(0.85 * len(mscoco_val_dataset))
-  val_cutoff = int(0.90 * len(mscoco_val_dataset))
-  train_dataset = mscoco_train_dataset + mscoco_val_dataset[0:train_cutoff]
-  val_dataset = mscoco_val_dataset[train_cutoff:val_cutoff]
-  test_dataset = mscoco_val_dataset[val_cutoff:]
-
-  # Create vocabulary from the training captions.
-  train_captions = [c for image in train_dataset for c in image.captions]
-  vocab = _create_vocab(train_captions)
-
-  _process_dataset("train", train_dataset, vocab, FLAGS.train_shards)
-  _process_dataset("val", val_dataset, vocab, FLAGS.val_shards)
-  _process_dataset("test", test_dataset, vocab, FLAGS.test_shards)
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/im2txt/im2txt/data/download_and_preprocess_mscoco.sh b/research/im2txt/im2txt/data/download_and_preprocess_mscoco.sh
deleted file mode 100755
index ab3ff28d576adcbf1992de4c00dfa350dd93b1c3..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/data/download_and_preprocess_mscoco.sh
+++ /dev/null
@@ -1,90 +0,0 @@
-#!/bin/bash
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# Script to download and preprocess the MSCOCO data set.
-#
-# The outputs of this script are sharded TFRecord files containing serialized
-# SequenceExample protocol buffers. See build_mscoco_data.py for details of how
-# the SequenceExample protocol buffers are constructed.
-#
-# usage:
-#  ./download_and_preprocess_mscoco.sh
-set -e
-
-if [ -z "$1" ]; then
-  echo "usage download_and_preproces_mscoco.sh [data dir]"
-  exit
-fi
-
-if [ "$(uname)" == "Darwin" ]; then
-  UNZIP="tar -xf"
-else
-  UNZIP="unzip -nq"
-fi
-
-# Create the output directories.
-OUTPUT_DIR="${1%/}"
-SCRATCH_DIR="${OUTPUT_DIR}/raw-data"
-mkdir -p "${OUTPUT_DIR}"
-mkdir -p "${SCRATCH_DIR}"
-CURRENT_DIR=$(pwd)
-WORK_DIR="$0.runfiles/im2txt/im2txt"
-
-# Helper function to download and unpack a .zip file.
-function download_and_unzip() {
-  local BASE_URL=${1}
-  local FILENAME=${2}
-
-  if [ ! -f ${FILENAME} ]; then
-    echo "Downloading ${FILENAME} to $(pwd)"
-    wget -nd -c "${BASE_URL}/${FILENAME}"
-  else
-    echo "Skipping download of ${FILENAME}"
-  fi
-  echo "Unzipping ${FILENAME}"
-  ${UNZIP} ${FILENAME}
-}
-
-cd ${SCRATCH_DIR}
-
-# Download the images.
-BASE_IMAGE_URL="http://msvocds.blob.core.windows.net/coco2014"
-
-TRAIN_IMAGE_FILE="train2014.zip"
-download_and_unzip ${BASE_IMAGE_URL} ${TRAIN_IMAGE_FILE}
-TRAIN_IMAGE_DIR="${SCRATCH_DIR}/train2014"
-
-VAL_IMAGE_FILE="val2014.zip"
-download_and_unzip ${BASE_IMAGE_URL} ${VAL_IMAGE_FILE}
-VAL_IMAGE_DIR="${SCRATCH_DIR}/val2014"
-
-# Download the captions.
-BASE_CAPTIONS_URL="http://msvocds.blob.core.windows.net/annotations-1-0-3"
-CAPTIONS_FILE="captions_train-val2014.zip"
-download_and_unzip ${BASE_CAPTIONS_URL} ${CAPTIONS_FILE}
-TRAIN_CAPTIONS_FILE="${SCRATCH_DIR}/annotations/captions_train2014.json"
-VAL_CAPTIONS_FILE="${SCRATCH_DIR}/annotations/captions_val2014.json"
-
-# Build TFRecords of the image data.
-cd "${CURRENT_DIR}"
-BUILD_SCRIPT="${WORK_DIR}/build_mscoco_data"
-"${BUILD_SCRIPT}" \
-  --train_image_dir="${TRAIN_IMAGE_DIR}" \
-  --val_image_dir="${VAL_IMAGE_DIR}" \
-  --train_captions_file="${TRAIN_CAPTIONS_FILE}" \
-  --val_captions_file="${VAL_CAPTIONS_FILE}" \
-  --output_dir="${OUTPUT_DIR}" \
-  --word_counts_output_file="${OUTPUT_DIR}/word_counts.txt" \
diff --git a/research/im2txt/im2txt/evaluate.py b/research/im2txt/im2txt/evaluate.py
deleted file mode 100644
index 0c81a59dab56626cb2c6a19433544f4d239cbd9d..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/evaluate.py
+++ /dev/null
@@ -1,198 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Evaluate the model.
-
-This script should be run concurrently with training so that summaries show up
-in TensorBoard.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-import os.path
-import time
-
-
-import numpy as np
-import tensorflow as tf
-
-from im2txt import configuration
-from im2txt import show_and_tell_model
-
-FLAGS = tf.flags.FLAGS
-
-tf.flags.DEFINE_string("input_file_pattern", "",
-                       "File pattern of sharded TFRecord input files.")
-tf.flags.DEFINE_string("checkpoint_dir", "",
-                       "Directory containing model checkpoints.")
-tf.flags.DEFINE_string("eval_dir", "", "Directory to write event logs.")
-
-tf.flags.DEFINE_integer("eval_interval_secs", 600,
-                        "Interval between evaluation runs.")
-tf.flags.DEFINE_integer("num_eval_examples", 10132,
-                        "Number of examples for evaluation.")
-
-tf.flags.DEFINE_integer("min_global_step", 5000,
-                        "Minimum global step to run evaluation.")
-
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-def evaluate_model(sess, model, global_step, summary_writer, summary_op):
-  """Computes perplexity-per-word over the evaluation dataset.
-
-  Summaries and perplexity-per-word are written out to the eval directory.
-
-  Args:
-    sess: Session object.
-    model: Instance of ShowAndTellModel; the model to evaluate.
-    global_step: Integer; global step of the model checkpoint.
-    summary_writer: Instance of FileWriter.
-    summary_op: Op for generating model summaries.
-  """
-  # Log model summaries on a single batch.
-  summary_str = sess.run(summary_op)
-  summary_writer.add_summary(summary_str, global_step)
-
-  # Compute perplexity over the entire dataset.
-  num_eval_batches = int(
-      math.ceil(FLAGS.num_eval_examples / model.config.batch_size))
-
-  start_time = time.time()
-  sum_losses = 0.
-  sum_weights = 0.
-  for i in range(num_eval_batches):
-    cross_entropy_losses, weights = sess.run([
-        model.target_cross_entropy_losses,
-        model.target_cross_entropy_loss_weights
-    ])
-    sum_losses += np.sum(cross_entropy_losses * weights)
-    sum_weights += np.sum(weights)
-    if not i % 100:
-      tf.logging.info("Computed losses for %d of %d batches.", i + 1,
-                      num_eval_batches)
-  eval_time = time.time() - start_time
-
-  perplexity = math.exp(sum_losses / sum_weights)
-  tf.logging.info("Perplexity = %f (%.2g sec)", perplexity, eval_time)
-
-  # Log perplexity to the FileWriter.
-  summary = tf.Summary()
-  value = summary.value.add()
-  value.simple_value = perplexity
-  value.tag = "Perplexity"
-  summary_writer.add_summary(summary, global_step)
-
-  # Write the Events file to the eval directory.
-  summary_writer.flush()
-  tf.logging.info("Finished processing evaluation at global step %d.",
-                  global_step)
-
-
-def run_once(model, saver, summary_writer, summary_op):
-  """Evaluates the latest model checkpoint.
-
-  Args:
-    model: Instance of ShowAndTellModel; the model to evaluate.
-    saver: Instance of tf.train.Saver for restoring model Variables.
-    summary_writer: Instance of FileWriter.
-    summary_op: Op for generating model summaries.
-  """
-  model_path = tf.train.latest_checkpoint(FLAGS.checkpoint_dir)
-  if not model_path:
-    tf.logging.info("Skipping evaluation. No checkpoint found in: %s",
-                    FLAGS.checkpoint_dir)
-    return
-
-  with tf.Session() as sess:
-    # Load model from checkpoint.
-    tf.logging.info("Loading model from checkpoint: %s", model_path)
-    saver.restore(sess, model_path)
-    global_step = tf.train.global_step(sess, model.global_step.name)
-    tf.logging.info("Successfully loaded %s at global step = %d.",
-                    os.path.basename(model_path), global_step)
-    if global_step < FLAGS.min_global_step:
-      tf.logging.info("Skipping evaluation. Global step = %d < %d", global_step,
-                      FLAGS.min_global_step)
-      return
-
-    # Start the queue runners.
-    coord = tf.train.Coordinator()
-    threads = tf.train.start_queue_runners(coord=coord)
-
-    # Run evaluation on the latest checkpoint.
-    try:
-      evaluate_model(
-          sess=sess,
-          model=model,
-          global_step=global_step,
-          summary_writer=summary_writer,
-          summary_op=summary_op)
-    except Exception as e:  # pylint: disable=broad-except
-      tf.logging.error("Evaluation failed.")
-      coord.request_stop(e)
-
-    coord.request_stop()
-    coord.join(threads, stop_grace_period_secs=10)
-
-
-def run():
-  """Runs evaluation in a loop, and logs summaries to TensorBoard."""
-  # Create the evaluation directory if it doesn't exist.
-  eval_dir = FLAGS.eval_dir
-  if not tf.gfile.IsDirectory(eval_dir):
-    tf.logging.info("Creating eval directory: %s", eval_dir)
-    tf.gfile.MakeDirs(eval_dir)
-
-  g = tf.Graph()
-  with g.as_default():
-    # Build the model for evaluation.
-    model_config = configuration.ModelConfig()
-    model_config.input_file_pattern = FLAGS.input_file_pattern
-    model = show_and_tell_model.ShowAndTellModel(model_config, mode="eval")
-    model.build()
-
-    # Create the Saver to restore model Variables.
-    saver = tf.train.Saver()
-
-    # Create the summary operation and the summary writer.
-    summary_op = tf.summary.merge_all()
-    summary_writer = tf.summary.FileWriter(eval_dir)
-
-    g.finalize()
-
-    # Run a new evaluation run every eval_interval_secs.
-    while True:
-      start = time.time()
-      tf.logging.info("Starting evaluation at " + time.strftime(
-          "%Y-%m-%d-%H:%M:%S", time.localtime()))
-      run_once(model, saver, summary_writer, summary_op)
-      time_to_next_eval = start + FLAGS.eval_interval_secs - time.time()
-      if time_to_next_eval > 0:
-        time.sleep(time_to_next_eval)
-
-
-def main(unused_argv):
-  assert FLAGS.input_file_pattern, "--input_file_pattern is required"
-  assert FLAGS.checkpoint_dir, "--checkpoint_dir is required"
-  assert FLAGS.eval_dir, "--eval_dir is required"
-  run()
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/im2txt/im2txt/inference_utils/BUILD b/research/im2txt/im2txt/inference_utils/BUILD
deleted file mode 100644
index 82a15fd3ca487e542c41ab337404f8caa63b8c63..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/inference_utils/BUILD
+++ /dev/null
@@ -1,31 +0,0 @@
-package(default_visibility = ["//im2txt:internal"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-py_library(
-    name = "inference_wrapper_base",
-    srcs = ["inference_wrapper_base.py"],
-    srcs_version = "PY2AND3",
-)
-
-py_library(
-    name = "vocabulary",
-    srcs = ["vocabulary.py"],
-    srcs_version = "PY2AND3",
-)
-
-py_library(
-    name = "caption_generator",
-    srcs = ["caption_generator.py"],
-    srcs_version = "PY2AND3",
-)
-
-py_test(
-    name = "caption_generator_test",
-    srcs = ["caption_generator_test.py"],
-    deps = [
-        ":caption_generator",
-    ],
-)
diff --git a/research/im2txt/im2txt/inference_utils/caption_generator.py b/research/im2txt/im2txt/inference_utils/caption_generator.py
deleted file mode 100644
index f158d3d2330e8f839efdad4cbc4d38811b58d826..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/inference_utils/caption_generator.py
+++ /dev/null
@@ -1,213 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Class for generating captions from an image-to-text model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import heapq
-import math
-
-
-import numpy as np
-
-
-class Caption(object):
-  """Represents a complete or partial caption."""
-
-  def __init__(self, sentence, state, logprob, score, metadata=None):
-    """Initializes the Caption.
-
-    Args:
-      sentence: List of word ids in the caption.
-      state: Model state after generating the previous word.
-      logprob: Log-probability of the caption.
-      score: Score of the caption.
-      metadata: Optional metadata associated with the partial sentence. If not
-        None, a list of strings with the same length as 'sentence'.
-    """
-    self.sentence = sentence
-    self.state = state
-    self.logprob = logprob
-    self.score = score
-    self.metadata = metadata
-
-  def __cmp__(self, other):
-    """Compares Captions by score."""
-    assert isinstance(other, Caption)
-    if self.score == other.score:
-      return 0
-    elif self.score < other.score:
-      return -1
-    else:
-      return 1
-  
-  # For Python 3 compatibility (__cmp__ is deprecated).
-  def __lt__(self, other):
-    assert isinstance(other, Caption)
-    return self.score < other.score
-  
-  # Also for Python 3 compatibility.
-  def __eq__(self, other):
-    assert isinstance(other, Caption)
-    return self.score == other.score
-
-
-class TopN(object):
-  """Maintains the top n elements of an incrementally provided set."""
-
-  def __init__(self, n):
-    self._n = n
-    self._data = []
-
-  def size(self):
-    assert self._data is not None
-    return len(self._data)
-
-  def push(self, x):
-    """Pushes a new element."""
-    assert self._data is not None
-    if len(self._data) < self._n:
-      heapq.heappush(self._data, x)
-    else:
-      heapq.heappushpop(self._data, x)
-
-  def extract(self, sort=False):
-    """Extracts all elements from the TopN. This is a destructive operation.
-
-    The only method that can be called immediately after extract() is reset().
-
-    Args:
-      sort: Whether to return the elements in descending sorted order.
-
-    Returns:
-      A list of data; the top n elements provided to the set.
-    """
-    assert self._data is not None
-    data = self._data
-    self._data = None
-    if sort:
-      data.sort(reverse=True)
-    return data
-
-  def reset(self):
-    """Returns the TopN to an empty state."""
-    self._data = []
-
-
-class CaptionGenerator(object):
-  """Class to generate captions from an image-to-text model."""
-
-  def __init__(self,
-               model,
-               vocab,
-               beam_size=3,
-               max_caption_length=20,
-               length_normalization_factor=0.0):
-    """Initializes the generator.
-
-    Args:
-      model: Object encapsulating a trained image-to-text model. Must have
-        methods feed_image() and inference_step(). For example, an instance of
-        InferenceWrapperBase.
-      vocab: A Vocabulary object.
-      beam_size: Beam size to use when generating captions.
-      max_caption_length: The maximum caption length before stopping the search.
-      length_normalization_factor: If != 0, a number x such that captions are
-        scored by logprob/length^x, rather than logprob. This changes the
-        relative scores of captions depending on their lengths. For example, if
-        x > 0 then longer captions will be favored.
-    """
-    self.vocab = vocab
-    self.model = model
-
-    self.beam_size = beam_size
-    self.max_caption_length = max_caption_length
-    self.length_normalization_factor = length_normalization_factor
-
-  def beam_search(self, sess, encoded_image):
-    """Runs beam search caption generation on a single image.
-
-    Args:
-      sess: TensorFlow Session object.
-      encoded_image: An encoded image string.
-
-    Returns:
-      A list of Caption sorted by descending score.
-    """
-    # Feed in the image to get the initial state.
-    initial_state = self.model.feed_image(sess, encoded_image)
-
-    initial_beam = Caption(
-        sentence=[self.vocab.start_id],
-        state=initial_state[0],
-        logprob=0.0,
-        score=0.0,
-        metadata=[""])
-    partial_captions = TopN(self.beam_size)
-    partial_captions.push(initial_beam)
-    complete_captions = TopN(self.beam_size)
-
-    # Run beam search.
-    for _ in range(self.max_caption_length - 1):
-      partial_captions_list = partial_captions.extract()
-      partial_captions.reset()
-      input_feed = np.array([c.sentence[-1] for c in partial_captions_list])
-      state_feed = np.array([c.state for c in partial_captions_list])
-
-      softmax, new_states, metadata = self.model.inference_step(sess,
-                                                                input_feed,
-                                                                state_feed)
-
-      for i, partial_caption in enumerate(partial_captions_list):
-        word_probabilities = softmax[i]
-        state = new_states[i]
-        # For this partial caption, get the beam_size most probable next words.
-        # Sort the indexes with numpy, select the last self.beam_size
-        # (3 by default) (ie, the most likely) and then reverse the sorted
-        # indexes with [::-1] to sort them from higher to lower.
-        most_likely_words = np.argsort(word_probabilities)[:-self.beam_size][::-1]
-
-        for w in most_likely_words:
-          p = word_probabilities[w]
-          if p < 1e-12:
-            continue  # Avoid log(0).
-          sentence = partial_caption.sentence + [w]
-          logprob = partial_caption.logprob + math.log(p)
-          score = logprob
-          if metadata:
-            metadata_list = partial_caption.metadata + [metadata[i]]
-          else:
-            metadata_list = None
-          if w == self.vocab.end_id:
-            if self.length_normalization_factor > 0:
-              score /= len(sentence)**self.length_normalization_factor
-            beam = Caption(sentence, state, logprob, score, metadata_list)
-            complete_captions.push(beam)
-          else:
-            beam = Caption(sentence, state, logprob, score, metadata_list)
-            partial_captions.push(beam)
-      if partial_captions.size() == 0:
-        # We have run out of partial candidates; happens when beam_size = 1.
-        break
-
-    # If we have no complete captions then fall back to the partial captions.
-    # But never output a mixture of complete and partial captions because a
-    # partial caption could have a higher score than all the complete captions.
-    if not complete_captions.size():
-      complete_captions = partial_captions
-
-    return complete_captions.extract(sort=True)
diff --git a/research/im2txt/im2txt/inference_utils/caption_generator_test.py b/research/im2txt/im2txt/inference_utils/caption_generator_test.py
deleted file mode 100644
index bbd069313ac4ddb10a8463d166ab282b68b2e24d..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/inference_utils/caption_generator_test.py
+++ /dev/null
@@ -1,178 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Unit tests for CaptionGenerator."""
-
-import math
-
-
-
-import numpy as np
-import tensorflow as tf
-
-from im2txt.inference_utils import caption_generator
-
-
-class FakeVocab(object):
-  """Fake Vocabulary for testing purposes."""
-
-  def __init__(self):
-    self.start_id = 0  # Word id denoting sentence start.
-    self.end_id = 1  # Word id denoting sentence end.
-
-
-class FakeModel(object):
-  """Fake model for testing purposes."""
-
-  def __init__(self):
-    # Number of words in the vocab.
-    self._vocab_size = 12
-
-    # Dimensionality of the nominal model state.
-    self._state_size = 1
-
-    # Map of previous word to the probability distribution of the next word.
-    self._probabilities = {
-        0: {1: 0.1,
-            2: 0.2,
-            3: 0.3,
-            4: 0.4},
-        2: {5: 0.1,
-            6: 0.9},
-        3: {1: 0.1,
-            7: 0.4,
-            8: 0.5},
-        4: {1: 0.3,
-            9: 0.3,
-            10: 0.4},
-        5: {1: 1.0},
-        6: {1: 1.0},
-        7: {1: 1.0},
-        8: {1: 1.0},
-        9: {1: 0.5,
-            11: 0.5},
-        10: {1: 1.0},
-        11: {1: 1.0},
-    }
-
-  # pylint: disable=unused-argument
-
-  def feed_image(self, sess, encoded_image):
-    # Return a nominal model state.
-    return np.zeros([1, self._state_size])
-
-  def inference_step(self, sess, input_feed, state_feed):
-    # Compute the matrix of softmax distributions for the next batch of words.
-    batch_size = input_feed.shape[0]
-    softmax_output = np.zeros([batch_size, self._vocab_size])
-    for batch_index, word_id in enumerate(input_feed):
-      for next_word, probability in self._probabilities[word_id].items():
-        softmax_output[batch_index, next_word] = probability
-
-    # Nominal state and metadata.
-    new_state = np.zeros([batch_size, self._state_size])
-    metadata = None
-
-    return softmax_output, new_state, metadata
-
-  # pylint: enable=unused-argument
-
-
-class CaptionGeneratorTest(tf.test.TestCase):
-
-  def _assertExpectedCaptions(self,
-                              expected_captions,
-                              beam_size=3,
-                              max_caption_length=20,
-                              length_normalization_factor=0):
-    """Tests that beam search generates the expected captions.
-
-    Args:
-      expected_captions: A sequence of pairs (sentence, probability), where
-        sentence is a list of integer ids and probability is a float in [0, 1].
-      beam_size: Parameter passed to beam_search().
-      max_caption_length: Parameter passed to beam_search().
-      length_normalization_factor: Parameter passed to beam_search().
-    """
-    expected_sentences = [c[0] for c in expected_captions]
-    expected_probabilities = [c[1] for c in expected_captions]
-
-    # Generate captions.
-    generator = caption_generator.CaptionGenerator(
-        model=FakeModel(),
-        vocab=FakeVocab(),
-        beam_size=beam_size,
-        max_caption_length=max_caption_length,
-        length_normalization_factor=length_normalization_factor)
-    actual_captions = generator.beam_search(sess=None, encoded_image=None)
-
-    actual_sentences = [c.sentence for c in actual_captions]
-    actual_probabilities = [math.exp(c.logprob) for c in actual_captions]
-
-    self.assertEqual(expected_sentences, actual_sentences)
-    self.assertAllClose(expected_probabilities, actual_probabilities)
-
-  def testBeamSize(self):
-    # Beam size = 1.
-    expected = [([0, 4, 10, 1], 0.16)]
-    self._assertExpectedCaptions(expected, beam_size=1)
-
-    # Beam size = 2.
-    expected = [([0, 4, 10, 1], 0.16), ([0, 3, 8, 1], 0.15)]
-    self._assertExpectedCaptions(expected, beam_size=2)
-
-    # Beam size = 3.
-    expected = [
-        ([0, 2, 6, 1], 0.18), ([0, 4, 10, 1], 0.16), ([0, 3, 8, 1], 0.15)
-    ]
-    self._assertExpectedCaptions(expected, beam_size=3)
-
-  def testMaxLength(self):
-    # Max length = 1.
-    expected = [([0], 1.0)]
-    self._assertExpectedCaptions(expected, max_caption_length=1)
-
-    # Max length = 2.
-    # There are no complete sentences, so partial sentences are returned.
-    expected = [([0, 4], 0.4), ([0, 3], 0.3), ([0, 2], 0.2)]
-    self._assertExpectedCaptions(expected, max_caption_length=2)
-
-    # Max length = 3.
-    # There is at least one complete sentence, so only complete sentences are
-    # returned.
-    expected = [([0, 4, 1], 0.12), ([0, 3, 1], 0.03)]
-    self._assertExpectedCaptions(expected, max_caption_length=3)
-
-    # Max length = 4.
-    expected = [
-        ([0, 2, 6, 1], 0.18), ([0, 4, 10, 1], 0.16), ([0, 3, 8, 1], 0.15)
-    ]
-    self._assertExpectedCaptions(expected, max_caption_length=4)
-
-  def testLengthNormalization(self):
-    # Length normalization factor = 3.
-    # The longest caption is returned first, despite having low probability,
-    # because it has the highest log(probability)/length**3.
-    expected = [
-        ([0, 4, 9, 11, 1], 0.06),
-        ([0, 2, 6, 1], 0.18),
-        ([0, 4, 10, 1], 0.16),
-        ([0, 3, 8, 1], 0.15),
-    ]
-    self._assertExpectedCaptions(
-        expected, beam_size=4, length_normalization_factor=3)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/im2txt/im2txt/inference_utils/inference_wrapper_base.py b/research/im2txt/im2txt/inference_utils/inference_wrapper_base.py
deleted file mode 100644
index e94cd6af474488e4b8175fc959e1dbe33cca18c9..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/inference_utils/inference_wrapper_base.py
+++ /dev/null
@@ -1,181 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Base wrapper class for performing inference with an image-to-text model.
-
-Subclasses must implement the following methods:
-
-  build_model():
-    Builds the model for inference and returns the model object.
-
-  feed_image():
-    Takes an encoded image and returns the initial model state, where "state"
-    is a numpy array whose specifics are defined by the subclass, e.g.
-    concatenated LSTM state. It's assumed that feed_image() will be called
-    precisely once at the start of inference for each image. Subclasses may
-    compute and/or save per-image internal context in this method.
-
-  inference_step():
-    Takes a batch of inputs and states at a single time-step. Returns the
-    softmax output corresponding to the inputs, and the new states of the batch.
-    Optionally also returns metadata about the current inference step, e.g. a
-    serialized numpy array containing activations from a particular model layer.
-
-Client usage:
-  1. Build the model inference graph via build_graph_from_config() or
-     build_graph_from_proto().
-  2. Call the resulting restore_fn to load the model checkpoint.
-  3. For each image in a batch of images:
-     a) Call feed_image() once to get the initial state.
-     b) For each step of caption generation, call inference_step().
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os.path
-
-
-import tensorflow as tf
-
-# pylint: disable=unused-argument
-
-
-class InferenceWrapperBase(object):
-  """Base wrapper class for performing inference with an image-to-text model."""
-
-  def __init__(self):
-    pass
-
-  def build_model(self, model_config):
-    """Builds the model for inference.
-
-    Args:
-      model_config: Object containing configuration for building the model.
-
-    Returns:
-      model: The model object.
-    """
-    tf.logging.fatal("Please implement build_model in subclass")
-
-  def _create_restore_fn(self, checkpoint_path, saver):
-    """Creates a function that restores a model from checkpoint.
-
-    Args:
-      checkpoint_path: Checkpoint file or a directory containing a checkpoint
-        file.
-      saver: Saver for restoring variables from the checkpoint file.
-
-    Returns:
-      restore_fn: A function such that restore_fn(sess) loads model variables
-        from the checkpoint file.
-
-    Raises:
-      ValueError: If checkpoint_path does not refer to a checkpoint file or a
-        directory containing a checkpoint file.
-    """
-    if tf.gfile.IsDirectory(checkpoint_path):
-      checkpoint_path = tf.train.latest_checkpoint(checkpoint_path)
-      if not checkpoint_path:
-        raise ValueError("No checkpoint file found in: %s" % checkpoint_path)
-
-    def _restore_fn(sess):
-      tf.logging.info("Loading model from checkpoint: %s", checkpoint_path)
-      saver.restore(sess, checkpoint_path)
-      tf.logging.info("Successfully loaded checkpoint: %s",
-                      os.path.basename(checkpoint_path))
-
-    return _restore_fn
-
-  def build_graph_from_config(self, model_config, checkpoint_path):
-    """Builds the inference graph from a configuration object.
-
-    Args:
-      model_config: Object containing configuration for building the model.
-      checkpoint_path: Checkpoint file or a directory containing a checkpoint
-        file.
-
-    Returns:
-      restore_fn: A function such that restore_fn(sess) loads model variables
-        from the checkpoint file.
-    """
-    tf.logging.info("Building model.")
-    self.build_model(model_config)
-    saver = tf.train.Saver()
-
-    return self._create_restore_fn(checkpoint_path, saver)
-
-  def build_graph_from_proto(self, graph_def_file, saver_def_file,
-                             checkpoint_path):
-    """Builds the inference graph from serialized GraphDef and SaverDef protos.
-
-    Args:
-      graph_def_file: File containing a serialized GraphDef proto.
-      saver_def_file: File containing a serialized SaverDef proto.
-      checkpoint_path: Checkpoint file or a directory containing a checkpoint
-        file.
-
-    Returns:
-      restore_fn: A function such that restore_fn(sess) loads model variables
-        from the checkpoint file.
-    """
-    # Load the Graph.
-    tf.logging.info("Loading GraphDef from file: %s", graph_def_file)
-    graph_def = tf.GraphDef()
-    with tf.gfile.FastGFile(graph_def_file, "rb") as f:
-      graph_def.ParseFromString(f.read())
-    tf.import_graph_def(graph_def, name="")
-
-    # Load the Saver.
-    tf.logging.info("Loading SaverDef from file: %s", saver_def_file)
-    saver_def = tf.train.SaverDef()
-    with tf.gfile.FastGFile(saver_def_file, "rb") as f:
-      saver_def.ParseFromString(f.read())
-    saver = tf.train.Saver(saver_def=saver_def)
-
-    return self._create_restore_fn(checkpoint_path, saver)
-
-  def feed_image(self, sess, encoded_image):
-    """Feeds an image and returns the initial model state.
-
-    See comments at the top of file.
-
-    Args:
-      sess: TensorFlow Session object.
-      encoded_image: An encoded image string.
-
-    Returns:
-      state: A numpy array of shape [1, state_size].
-    """
-    tf.logging.fatal("Please implement feed_image in subclass")
-
-  def inference_step(self, sess, input_feed, state_feed):
-    """Runs one step of inference.
-
-    Args:
-      sess: TensorFlow Session object.
-      input_feed: A numpy array of shape [batch_size].
-      state_feed: A numpy array of shape [batch_size, state_size].
-
-    Returns:
-      softmax_output: A numpy array of shape [batch_size, vocab_size].
-      new_state: A numpy array of shape [batch_size, state_size].
-      metadata: Optional. If not None, a string containing metadata about the
-        current inference step (e.g. serialized numpy array containing
-        activations from a particular model layer.).
-    """
-    tf.logging.fatal("Please implement inference_step in subclass")
-
-# pylint: enable=unused-argument
diff --git a/research/im2txt/im2txt/inference_utils/vocabulary.py b/research/im2txt/im2txt/inference_utils/vocabulary.py
deleted file mode 100644
index ecf0ada9c2242cb32c2ea9a300d16411f5e83fab..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/inference_utils/vocabulary.py
+++ /dev/null
@@ -1,78 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Vocabulary class for an image-to-text model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-
-class Vocabulary(object):
-  """Vocabulary class for an image-to-text model."""
-
-  def __init__(self,
-               vocab_file,
-               start_word="<S>",
-               end_word="</S>",
-               unk_word="<UNK>"):
-    """Initializes the vocabulary.
-
-    Args:
-      vocab_file: File containing the vocabulary, where the words are the first
-        whitespace-separated token on each line (other tokens are ignored) and
-        the word ids are the corresponding line numbers.
-      start_word: Special word denoting sentence start.
-      end_word: Special word denoting sentence end.
-      unk_word: Special word denoting unknown words.
-    """
-    if not tf.gfile.Exists(vocab_file):
-      tf.logging.fatal("Vocab file %s not found.", vocab_file)
-    tf.logging.info("Initializing vocabulary from file: %s", vocab_file)
-
-    with tf.gfile.GFile(vocab_file, mode="r") as f:
-      reverse_vocab = list(f.readlines())
-    reverse_vocab = [line.split()[0] for line in reverse_vocab]
-    assert start_word in reverse_vocab
-    assert end_word in reverse_vocab
-    if unk_word not in reverse_vocab:
-      reverse_vocab.append(unk_word)
-    vocab = dict([(x, y) for (y, x) in enumerate(reverse_vocab)])
-
-    tf.logging.info("Created vocabulary with %d words" % len(vocab))
-
-    self.vocab = vocab  # vocab[word] = id
-    self.reverse_vocab = reverse_vocab  # reverse_vocab[id] = word
-
-    # Save special word ids.
-    self.start_id = vocab[start_word]
-    self.end_id = vocab[end_word]
-    self.unk_id = vocab[unk_word]
-
-  def word_to_id(self, word):
-    """Returns the integer word id of a word string."""
-    if word in self.vocab:
-      return self.vocab[word]
-    else:
-      return self.unk_id
-
-  def id_to_word(self, word_id):
-    """Returns the word string of an integer word id."""
-    if word_id >= len(self.reverse_vocab):
-      return self.reverse_vocab[self.unk_id]
-    else:
-      return self.reverse_vocab[word_id]
diff --git a/research/im2txt/im2txt/inference_wrapper.py b/research/im2txt/im2txt/inference_wrapper.py
deleted file mode 100644
index a047a9c8d084fd9e69c937915cea8553c2d51817..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/inference_wrapper.py
+++ /dev/null
@@ -1,51 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Model wrapper class for performing inference with a ShowAndTellModel."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-
-from im2txt import show_and_tell_model
-from im2txt.inference_utils import inference_wrapper_base
-
-
-class InferenceWrapper(inference_wrapper_base.InferenceWrapperBase):
-  """Model wrapper class for performing inference with a ShowAndTellModel."""
-
-  def __init__(self):
-    super(InferenceWrapper, self).__init__()
-
-  def build_model(self, model_config):
-    model = show_and_tell_model.ShowAndTellModel(model_config, mode="inference")
-    model.build()
-    return model
-
-  def feed_image(self, sess, encoded_image):
-    initial_state = sess.run(fetches="lstm/initial_state:0",
-                             feed_dict={"image_feed:0": encoded_image})
-    return initial_state
-
-  def inference_step(self, sess, input_feed, state_feed):
-    softmax_output, state_output = sess.run(
-        fetches=["softmax:0", "lstm/state:0"],
-        feed_dict={
-            "input_feed:0": input_feed,
-            "lstm/state_feed:0": state_feed,
-        })
-    return softmax_output, state_output, None
diff --git a/research/im2txt/im2txt/ops/BUILD b/research/im2txt/im2txt/ops/BUILD
deleted file mode 100644
index 7d48bf3938c7ecfc94ac6498386e7ce214b8be92..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/ops/BUILD
+++ /dev/null
@@ -1,32 +0,0 @@
-package(default_visibility = ["//im2txt:internal"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-py_library(
-    name = "image_processing",
-    srcs = ["image_processing.py"],
-    srcs_version = "PY2AND3",
-)
-
-py_library(
-    name = "image_embedding",
-    srcs = ["image_embedding.py"],
-    srcs_version = "PY2AND3",
-)
-
-py_test(
-    name = "image_embedding_test",
-    size = "small",
-    srcs = ["image_embedding_test.py"],
-    deps = [
-        ":image_embedding",
-    ],
-)
-
-py_library(
-    name = "inputs",
-    srcs = ["inputs.py"],
-    srcs_version = "PY2AND3",
-)
diff --git a/research/im2txt/im2txt/ops/image_embedding.py b/research/im2txt/im2txt/ops/image_embedding.py
deleted file mode 100644
index 58e3ddaa95fa799f245fe2a46f2e948be7d9ebf2..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/ops/image_embedding.py
+++ /dev/null
@@ -1,114 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Image embedding ops."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from tensorflow.contrib.slim.python.slim.nets.inception_v3 import inception_v3_base
-
-slim = tf.contrib.slim
-
-
-def inception_v3(images,
-                 trainable=True,
-                 is_training=True,
-                 weight_decay=0.00004,
-                 stddev=0.1,
-                 dropout_keep_prob=0.8,
-                 use_batch_norm=True,
-                 batch_norm_params=None,
-                 add_summaries=True,
-                 scope="InceptionV3"):
-  """Builds an Inception V3 subgraph for image embeddings.
-
-  Args:
-    images: A float32 Tensor of shape [batch, height, width, channels].
-    trainable: Whether the inception submodel should be trainable or not.
-    is_training: Boolean indicating training mode or not.
-    weight_decay: Coefficient for weight regularization.
-    stddev: The standard deviation of the trunctated normal weight initializer.
-    dropout_keep_prob: Dropout keep probability.
-    use_batch_norm: Whether to use batch normalization.
-    batch_norm_params: Parameters for batch normalization. See
-      tf.contrib.layers.batch_norm for details.
-    add_summaries: Whether to add activation summaries.
-    scope: Optional Variable scope.
-
-  Returns:
-    end_points: A dictionary of activations from inception_v3 layers.
-  """
-  # Only consider the inception model to be in training mode if it's trainable.
-  is_inception_model_training = trainable and is_training
-
-  if use_batch_norm:
-    # Default parameters for batch normalization.
-    if not batch_norm_params:
-      batch_norm_params = {
-          "is_training": is_inception_model_training,
-          "trainable": trainable,
-          # Decay for the moving averages.
-          "decay": 0.9997,
-          # Epsilon to prevent 0s in variance.
-          "epsilon": 0.001,
-          # Collection containing the moving mean and moving variance.
-          "variables_collections": {
-              "beta": None,
-              "gamma": None,
-              "moving_mean": ["moving_vars"],
-              "moving_variance": ["moving_vars"],
-          }
-      }
-  else:
-    batch_norm_params = None
-
-  if trainable:
-    weights_regularizer = tf.contrib.layers.l2_regularizer(weight_decay)
-  else:
-    weights_regularizer = None
-
-  with tf.variable_scope(scope, "InceptionV3", [images]) as scope:
-    with slim.arg_scope(
-        [slim.conv2d, slim.fully_connected],
-        weights_regularizer=weights_regularizer,
-        trainable=trainable):
-      with slim.arg_scope(
-          [slim.conv2d],
-          weights_initializer=tf.truncated_normal_initializer(stddev=stddev),
-          activation_fn=tf.nn.relu,
-          normalizer_fn=slim.batch_norm,
-          normalizer_params=batch_norm_params):
-        net, end_points = inception_v3_base(images, scope=scope)
-        with tf.variable_scope("logits"):
-          shape = net.get_shape()
-          net = slim.avg_pool2d(net, shape[1:3], padding="VALID", scope="pool")
-          net = slim.dropout(
-              net,
-              keep_prob=dropout_keep_prob,
-              is_training=is_inception_model_training,
-              scope="dropout")
-          net = slim.flatten(net, scope="flatten")
-
-  # Add summaries.
-  if add_summaries:
-    for v in end_points.values():
-      tf.contrib.layers.summaries.summarize_activation(v)
-
-  return net
diff --git a/research/im2txt/im2txt/ops/image_embedding_test.py b/research/im2txt/im2txt/ops/image_embedding_test.py
deleted file mode 100644
index 66324d68eee0ec9c450375c25229d80283fc909f..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/ops/image_embedding_test.py
+++ /dev/null
@@ -1,136 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for tensorflow_models.im2txt.ops.image_embedding."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from im2txt.ops import image_embedding
-
-
-class InceptionV3Test(tf.test.TestCase):
-
-  def setUp(self):
-    super(InceptionV3Test, self).setUp()
-
-    batch_size = 4
-    height = 299
-    width = 299
-    num_channels = 3
-    self._images = tf.placeholder(tf.float32,
-                                  [batch_size, height, width, num_channels])
-    self._batch_size = batch_size
-
-  def _countInceptionParameters(self):
-    """Counts the number of parameters in the inception model at top scope."""
-    counter = {}
-    for v in tf.global_variables():
-      name_tokens = v.op.name.split("/")
-      if name_tokens[0] == "InceptionV3":
-        name = "InceptionV3/" + name_tokens[1]
-        num_params = v.get_shape().num_elements()
-        assert num_params
-        counter[name] = counter.get(name, 0) + num_params
-    return counter
-
-  def _verifyParameterCounts(self):
-    """Verifies the number of parameters in the inception model."""
-    param_counts = self._countInceptionParameters()
-    expected_param_counts = {
-        "InceptionV3/Conv2d_1a_3x3": 960,
-        "InceptionV3/Conv2d_2a_3x3": 9312,
-        "InceptionV3/Conv2d_2b_3x3": 18624,
-        "InceptionV3/Conv2d_3b_1x1": 5360,
-        "InceptionV3/Conv2d_4a_3x3": 138816,
-        "InceptionV3/Mixed_5b": 256368,
-        "InceptionV3/Mixed_5c": 277968,
-        "InceptionV3/Mixed_5d": 285648,
-        "InceptionV3/Mixed_6a": 1153920,
-        "InceptionV3/Mixed_6b": 1298944,
-        "InceptionV3/Mixed_6c": 1692736,
-        "InceptionV3/Mixed_6d": 1692736,
-        "InceptionV3/Mixed_6e": 2143872,
-        "InceptionV3/Mixed_7a": 1699584,
-        "InceptionV3/Mixed_7b": 5047872,
-        "InceptionV3/Mixed_7c": 6080064,
-    }
-    self.assertDictEqual(expected_param_counts, param_counts)
-
-  def _assertCollectionSize(self, expected_size, collection):
-    actual_size = len(tf.get_collection(collection))
-    if expected_size != actual_size:
-      self.fail("Found %d items in collection %s (expected %d)." %
-                (actual_size, collection, expected_size))
-
-  def testTrainableTrueIsTrainingTrue(self):
-    embeddings = image_embedding.inception_v3(
-        self._images, trainable=True, is_training=True)
-    self.assertEqual([self._batch_size, 2048], embeddings.get_shape().as_list())
-
-    self._verifyParameterCounts()
-    self._assertCollectionSize(376, tf.GraphKeys.GLOBAL_VARIABLES)
-    self._assertCollectionSize(188, tf.GraphKeys.TRAINABLE_VARIABLES)
-    self._assertCollectionSize(188, tf.GraphKeys.UPDATE_OPS)
-    self._assertCollectionSize(94, tf.GraphKeys.REGULARIZATION_LOSSES)
-    self._assertCollectionSize(0, tf.GraphKeys.LOSSES)
-    self._assertCollectionSize(23, tf.GraphKeys.SUMMARIES)
-
-  def testTrainableTrueIsTrainingFalse(self):
-    embeddings = image_embedding.inception_v3(
-        self._images, trainable=True, is_training=False)
-    self.assertEqual([self._batch_size, 2048], embeddings.get_shape().as_list())
-
-    self._verifyParameterCounts()
-    self._assertCollectionSize(376, tf.GraphKeys.GLOBAL_VARIABLES)
-    self._assertCollectionSize(188, tf.GraphKeys.TRAINABLE_VARIABLES)
-    self._assertCollectionSize(0, tf.GraphKeys.UPDATE_OPS)
-    self._assertCollectionSize(94, tf.GraphKeys.REGULARIZATION_LOSSES)
-    self._assertCollectionSize(0, tf.GraphKeys.LOSSES)
-    self._assertCollectionSize(23, tf.GraphKeys.SUMMARIES)
-
-  def testTrainableFalseIsTrainingTrue(self):
-    embeddings = image_embedding.inception_v3(
-        self._images, trainable=False, is_training=True)
-    self.assertEqual([self._batch_size, 2048], embeddings.get_shape().as_list())
-
-    self._verifyParameterCounts()
-    self._assertCollectionSize(376, tf.GraphKeys.GLOBAL_VARIABLES)
-    self._assertCollectionSize(0, tf.GraphKeys.TRAINABLE_VARIABLES)
-    self._assertCollectionSize(0, tf.GraphKeys.UPDATE_OPS)
-    self._assertCollectionSize(0, tf.GraphKeys.REGULARIZATION_LOSSES)
-    self._assertCollectionSize(0, tf.GraphKeys.LOSSES)
-    self._assertCollectionSize(23, tf.GraphKeys.SUMMARIES)
-
-  def testTrainableFalseIsTrainingFalse(self):
-    embeddings = image_embedding.inception_v3(
-        self._images, trainable=False, is_training=False)
-    self.assertEqual([self._batch_size, 2048], embeddings.get_shape().as_list())
-
-    self._verifyParameterCounts()
-    self._assertCollectionSize(376, tf.GraphKeys.GLOBAL_VARIABLES)
-    self._assertCollectionSize(0, tf.GraphKeys.TRAINABLE_VARIABLES)
-    self._assertCollectionSize(0, tf.GraphKeys.UPDATE_OPS)
-    self._assertCollectionSize(0, tf.GraphKeys.REGULARIZATION_LOSSES)
-    self._assertCollectionSize(0, tf.GraphKeys.LOSSES)
-    self._assertCollectionSize(23, tf.GraphKeys.SUMMARIES)
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/im2txt/im2txt/ops/image_processing.py b/research/im2txt/im2txt/ops/image_processing.py
deleted file mode 100644
index 6a7545547d5507febaabebf642ee81b6f94319f6..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/ops/image_processing.py
+++ /dev/null
@@ -1,133 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Helper functions for image preprocessing."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-
-def distort_image(image, thread_id):
-  """Perform random distortions on an image.
-
-  Args:
-    image: A float32 Tensor of shape [height, width, 3] with values in [0, 1).
-    thread_id: Preprocessing thread id used to select the ordering of color
-      distortions. There should be a multiple of 2 preprocessing threads.
-
-  Returns:
-    distorted_image: A float32 Tensor of shape [height, width, 3] with values in
-      [0, 1].
-  """
-  # Randomly flip horizontally.
-  with tf.name_scope("flip_horizontal", values=[image]):
-    image = tf.image.random_flip_left_right(image)
-
-  # Randomly distort the colors based on thread id.
-  color_ordering = thread_id % 2
-  with tf.name_scope("distort_color", values=[image]):
-    if color_ordering == 0:
-      image = tf.image.random_brightness(image, max_delta=32. / 255.)
-      image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-      image = tf.image.random_hue(image, max_delta=0.032)
-      image = tf.image.random_contrast(image, lower=0.5, upper=1.5)
-    elif color_ordering == 1:
-      image = tf.image.random_brightness(image, max_delta=32. / 255.)
-      image = tf.image.random_contrast(image, lower=0.5, upper=1.5)
-      image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-      image = tf.image.random_hue(image, max_delta=0.032)
-
-    # The random_* ops do not necessarily clamp.
-    image = tf.clip_by_value(image, 0.0, 1.0)
-
-  return image
-
-
-def process_image(encoded_image,
-                  is_training,
-                  height,
-                  width,
-                  resize_height=346,
-                  resize_width=346,
-                  thread_id=0,
-                  image_format="jpeg"):
-  """Decode an image, resize and apply random distortions.
-
-  In training, images are distorted slightly differently depending on thread_id.
-
-  Args:
-    encoded_image: String Tensor containing the image.
-    is_training: Boolean; whether preprocessing for training or eval.
-    height: Height of the output image.
-    width: Width of the output image.
-    resize_height: If > 0, resize height before crop to final dimensions.
-    resize_width: If > 0, resize width before crop to final dimensions.
-    thread_id: Preprocessing thread id used to select the ordering of color
-      distortions. There should be a multiple of 2 preprocessing threads.
-    image_format: "jpeg" or "png".
-
-  Returns:
-    A float32 Tensor of shape [height, width, 3] with values in [-1, 1].
-
-  Raises:
-    ValueError: If image_format is invalid.
-  """
-  # Helper function to log an image summary to the visualizer. Summaries are
-  # only logged in thread 0.
-  def image_summary(name, image):
-    if not thread_id:
-      tf.summary.image(name, tf.expand_dims(image, 0))
-
-  # Decode image into a float32 Tensor of shape [?, ?, 3] with values in [0, 1).
-  with tf.name_scope("decode", values=[encoded_image]):
-    if image_format == "jpeg":
-      image = tf.image.decode_jpeg(encoded_image, channels=3)
-    elif image_format == "png":
-      image = tf.image.decode_png(encoded_image, channels=3)
-    else:
-      raise ValueError("Invalid image format: %s" % image_format)
-  image = tf.image.convert_image_dtype(image, dtype=tf.float32)
-  image_summary("original_image", image)
-
-  # Resize image.
-  assert (resize_height > 0) == (resize_width > 0)
-  if resize_height:
-    image = tf.image.resize_images(image,
-                                   size=[resize_height, resize_width],
-                                   method=tf.image.ResizeMethod.BILINEAR)
-
-  # Crop to final dimensions.
-  if is_training:
-    image = tf.random_crop(image, [height, width, 3])
-  else:
-    # Central crop, assuming resize_height > height, resize_width > width.
-    image = tf.image.resize_image_with_crop_or_pad(image, height, width)
-
-  image_summary("resized_image", image)
-
-  # Randomly distort the image.
-  if is_training:
-    image = distort_image(image, thread_id)
-
-  image_summary("final_image", image)
-
-  # Rescale to [-1,1] instead of [0, 1]
-  image = tf.subtract(image, 0.5)
-  image = tf.multiply(image, 2.0)
-  return image
diff --git a/research/im2txt/im2txt/ops/inputs.py b/research/im2txt/im2txt/ops/inputs.py
deleted file mode 100644
index 5dc90c0ce5dfd5c30fe0e0e543999bb15cc13a8c..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/ops/inputs.py
+++ /dev/null
@@ -1,204 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Input ops."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-
-def parse_sequence_example(serialized, image_feature, caption_feature):
-  """Parses a tensorflow.SequenceExample into an image and caption.
-
-  Args:
-    serialized: A scalar string Tensor; a single serialized SequenceExample.
-    image_feature: Name of SequenceExample context feature containing image
-      data.
-    caption_feature: Name of SequenceExample feature list containing integer
-      captions.
-
-  Returns:
-    encoded_image: A scalar string Tensor containing a JPEG encoded image.
-    caption: A 1-D uint64 Tensor with dynamically specified length.
-  """
-  context, sequence = tf.parse_single_sequence_example(
-      serialized,
-      context_features={
-          image_feature: tf.FixedLenFeature([], dtype=tf.string)
-      },
-      sequence_features={
-          caption_feature: tf.FixedLenSequenceFeature([], dtype=tf.int64),
-      })
-
-  encoded_image = context[image_feature]
-  caption = sequence[caption_feature]
-  return encoded_image, caption
-
-
-def prefetch_input_data(reader,
-                        file_pattern,
-                        is_training,
-                        batch_size,
-                        values_per_shard,
-                        input_queue_capacity_factor=16,
-                        num_reader_threads=1,
-                        shard_queue_name="filename_queue",
-                        value_queue_name="input_queue"):
-  """Prefetches string values from disk into an input queue.
-
-  In training the capacity of the queue is important because a larger queue
-  means better mixing of training examples between shards. The minimum number of
-  values kept in the queue is values_per_shard * input_queue_capacity_factor,
-  where input_queue_memory factor should be chosen to trade-off better mixing
-  with memory usage.
-
-  Args:
-    reader: Instance of tf.ReaderBase.
-    file_pattern: Comma-separated list of file patterns (e.g.
-        /tmp/train_data-?????-of-00100).
-    is_training: Boolean; whether prefetching for training or eval.
-    batch_size: Model batch size used to determine queue capacity.
-    values_per_shard: Approximate number of values per shard.
-    input_queue_capacity_factor: Minimum number of values to keep in the queue
-      in multiples of values_per_shard. See comments above.
-    num_reader_threads: Number of reader threads to fill the queue.
-    shard_queue_name: Name for the shards filename queue.
-    value_queue_name: Name for the values input queue.
-
-  Returns:
-    A Queue containing prefetched string values.
-  """
-  data_files = []
-  for pattern in file_pattern.split(","):
-    data_files.extend(tf.gfile.Glob(pattern))
-  if not data_files:
-    tf.logging.fatal("Found no input files matching %s", file_pattern)
-  else:
-    tf.logging.info("Prefetching values from %d files matching %s",
-                    len(data_files), file_pattern)
-
-  if is_training:
-    filename_queue = tf.train.string_input_producer(
-        data_files, shuffle=True, capacity=16, name=shard_queue_name)
-    min_queue_examples = values_per_shard * input_queue_capacity_factor
-    capacity = min_queue_examples + 100 * batch_size
-    values_queue = tf.RandomShuffleQueue(
-        capacity=capacity,
-        min_after_dequeue=min_queue_examples,
-        dtypes=[tf.string],
-        name="random_" + value_queue_name)
-  else:
-    filename_queue = tf.train.string_input_producer(
-        data_files, shuffle=False, capacity=1, name=shard_queue_name)
-    capacity = values_per_shard + 3 * batch_size
-    values_queue = tf.FIFOQueue(
-        capacity=capacity, dtypes=[tf.string], name="fifo_" + value_queue_name)
-
-  enqueue_ops = []
-  for _ in range(num_reader_threads):
-    _, value = reader.read(filename_queue)
-    enqueue_ops.append(values_queue.enqueue([value]))
-  tf.train.queue_runner.add_queue_runner(tf.train.queue_runner.QueueRunner(
-      values_queue, enqueue_ops))
-  tf.summary.scalar(
-      "queue/%s/fraction_of_%d_full" % (values_queue.name, capacity),
-      tf.cast(values_queue.size(), tf.float32) * (1. / capacity))
-
-  return values_queue
-
-
-def batch_with_dynamic_pad(images_and_captions,
-                           batch_size,
-                           queue_capacity,
-                           add_summaries=True):
-  """Batches input images and captions.
-
-  This function splits the caption into an input sequence and a target sequence,
-  where the target sequence is the input sequence right-shifted by 1. Input and
-  target sequences are batched and padded up to the maximum length of sequences
-  in the batch. A mask is created to distinguish real words from padding words.
-
-  Example:
-    Actual captions in the batch ('-' denotes padded character):
-      [
-        [ 1 2 3 4 5 ],
-        [ 1 2 3 4 - ],
-        [ 1 2 3 - - ],
-      ]
-
-    input_seqs:
-      [
-        [ 1 2 3 4 ],
-        [ 1 2 3 - ],
-        [ 1 2 - - ],
-      ]
-
-    target_seqs:
-      [
-        [ 2 3 4 5 ],
-        [ 2 3 4 - ],
-        [ 2 3 - - ],
-      ]
-
-    mask:
-      [
-        [ 1 1 1 1 ],
-        [ 1 1 1 0 ],
-        [ 1 1 0 0 ],
-      ]
-
-  Args:
-    images_and_captions: A list of pairs [image, caption], where image is a
-      Tensor of shape [height, width, channels] and caption is a 1-D Tensor of
-      any length. Each pair will be processed and added to the queue in a
-      separate thread.
-    batch_size: Batch size.
-    queue_capacity: Queue capacity.
-    add_summaries: If true, add caption length summaries.
-
-  Returns:
-    images: A Tensor of shape [batch_size, height, width, channels].
-    input_seqs: An int32 Tensor of shape [batch_size, padded_length].
-    target_seqs: An int32 Tensor of shape [batch_size, padded_length].
-    mask: An int32 0/1 Tensor of shape [batch_size, padded_length].
-  """
-  enqueue_list = []
-  for image, caption in images_and_captions:
-    caption_length = tf.shape(caption)[0]
-    input_length = tf.expand_dims(tf.subtract(caption_length, 1), 0)
-
-    input_seq = tf.slice(caption, [0], input_length)
-    target_seq = tf.slice(caption, [1], input_length)
-    indicator = tf.ones(input_length, dtype=tf.int32)
-    enqueue_list.append([image, input_seq, target_seq, indicator])
-
-  images, input_seqs, target_seqs, mask = tf.train.batch_join(
-      enqueue_list,
-      batch_size=batch_size,
-      capacity=queue_capacity,
-      dynamic_pad=True,
-      name="batch_and_pad")
-
-  if add_summaries:
-    lengths = tf.add(tf.reduce_sum(mask, 1), 1)
-    tf.summary.scalar("caption_length/batch_min", tf.reduce_min(lengths))
-    tf.summary.scalar("caption_length/batch_max", tf.reduce_max(lengths))
-    tf.summary.scalar("caption_length/batch_mean", tf.reduce_mean(lengths))
-
-  return images, input_seqs, target_seqs, mask
diff --git a/research/im2txt/im2txt/run_inference.py b/research/im2txt/im2txt/run_inference.py
deleted file mode 100644
index 9848522df162e52394ee8349dab1f5220aeb88f6..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/run_inference.py
+++ /dev/null
@@ -1,85 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-r"""Generate captions for images using default beam search parameters."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-import os
-
-
-import tensorflow as tf
-
-from im2txt import configuration
-from im2txt import inference_wrapper
-from im2txt.inference_utils import caption_generator
-from im2txt.inference_utils import vocabulary
-
-FLAGS = tf.flags.FLAGS
-
-tf.flags.DEFINE_string("checkpoint_path", "",
-                       "Model checkpoint file or directory containing a "
-                       "model checkpoint file.")
-tf.flags.DEFINE_string("vocab_file", "", "Text file containing the vocabulary.")
-tf.flags.DEFINE_string("input_files", "",
-                       "File pattern or comma-separated list of file patterns "
-                       "of image files.")
-
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-def main(_):
-  # Build the inference graph.
-  g = tf.Graph()
-  with g.as_default():
-    model = inference_wrapper.InferenceWrapper()
-    restore_fn = model.build_graph_from_config(configuration.ModelConfig(),
-                                               FLAGS.checkpoint_path)
-  g.finalize()
-
-  # Create the vocabulary.
-  vocab = vocabulary.Vocabulary(FLAGS.vocab_file)
-
-  filenames = []
-  for file_pattern in FLAGS.input_files.split(","):
-    filenames.extend(tf.gfile.Glob(file_pattern))
-  tf.logging.info("Running caption generation on %d files matching %s",
-                  len(filenames), FLAGS.input_files)
-
-  with tf.Session(graph=g) as sess:
-    # Load the model from checkpoint.
-    restore_fn(sess)
-
-    # Prepare the caption generator. Here we are implicitly using the default
-    # beam search parameters. See caption_generator.py for a description of the
-    # available beam search parameters.
-    generator = caption_generator.CaptionGenerator(model, vocab)
-
-    for filename in filenames:
-      with tf.gfile.GFile(filename, "rb") as f:
-        image = f.read()
-      captions = generator.beam_search(sess, image)
-      print("Captions for image %s:" % os.path.basename(filename))
-      for i, caption in enumerate(captions):
-        # Ignore begin and end words.
-        sentence = [vocab.id_to_word(w) for w in caption.sentence[1:-1]]
-        sentence = " ".join(sentence)
-        print("  %d) %s (p=%f)" % (i, sentence, math.exp(caption.logprob)))
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/im2txt/im2txt/show_and_tell_model.py b/research/im2txt/im2txt/show_and_tell_model.py
deleted file mode 100644
index 0ac29e7fdb80fbefe3594eabc972648a3fb32312..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/show_and_tell_model.py
+++ /dev/null
@@ -1,358 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Image-to-text implementation based on http://arxiv.org/abs/1411.4555.
-
-"Show and Tell: A Neural Image Caption Generator"
-Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from im2txt.ops import image_embedding
-from im2txt.ops import image_processing
-from im2txt.ops import inputs as input_ops
-
-
-class ShowAndTellModel(object):
-  """Image-to-text implementation based on http://arxiv.org/abs/1411.4555.
-
-  "Show and Tell: A Neural Image Caption Generator"
-  Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan
-  """
-
-  def __init__(self, config, mode, train_inception=False):
-    """Basic setup.
-
-    Args:
-      config: Object containing configuration parameters.
-      mode: "train", "eval" or "inference".
-      train_inception: Whether the inception submodel variables are trainable.
-    """
-    assert mode in ["train", "eval", "inference"]
-    self.config = config
-    self.mode = mode
-    self.train_inception = train_inception
-
-    # Reader for the input data.
-    self.reader = tf.TFRecordReader()
-
-    # To match the "Show and Tell" paper we initialize all variables with a
-    # random uniform initializer.
-    self.initializer = tf.random_uniform_initializer(
-        minval=-self.config.initializer_scale,
-        maxval=self.config.initializer_scale)
-
-    # A float32 Tensor with shape [batch_size, height, width, channels].
-    self.images = None
-
-    # An int32 Tensor with shape [batch_size, padded_length].
-    self.input_seqs = None
-
-    # An int32 Tensor with shape [batch_size, padded_length].
-    self.target_seqs = None
-
-    # An int32 0/1 Tensor with shape [batch_size, padded_length].
-    self.input_mask = None
-
-    # A float32 Tensor with shape [batch_size, embedding_size].
-    self.image_embeddings = None
-
-    # A float32 Tensor with shape [batch_size, padded_length, embedding_size].
-    self.seq_embeddings = None
-
-    # A float32 scalar Tensor; the total loss for the trainer to optimize.
-    self.total_loss = None
-
-    # A float32 Tensor with shape [batch_size * padded_length].
-    self.target_cross_entropy_losses = None
-
-    # A float32 Tensor with shape [batch_size * padded_length].
-    self.target_cross_entropy_loss_weights = None
-
-    # Collection of variables from the inception submodel.
-    self.inception_variables = []
-
-    # Function to restore the inception submodel from checkpoint.
-    self.init_fn = None
-
-    # Global step Tensor.
-    self.global_step = None
-
-  def is_training(self):
-    """Returns true if the model is built for training mode."""
-    return self.mode == "train"
-
-  def process_image(self, encoded_image, thread_id=0):
-    """Decodes and processes an image string.
-
-    Args:
-      encoded_image: A scalar string Tensor; the encoded image.
-      thread_id: Preprocessing thread id used to select the ordering of color
-        distortions.
-
-    Returns:
-      A float32 Tensor of shape [height, width, 3]; the processed image.
-    """
-    return image_processing.process_image(encoded_image,
-                                          is_training=self.is_training(),
-                                          height=self.config.image_height,
-                                          width=self.config.image_width,
-                                          thread_id=thread_id,
-                                          image_format=self.config.image_format)
-
-  def build_inputs(self):
-    """Input prefetching, preprocessing and batching.
-
-    Outputs:
-      self.images
-      self.input_seqs
-      self.target_seqs (training and eval only)
-      self.input_mask (training and eval only)
-    """
-    if self.mode == "inference":
-      # In inference mode, images and inputs are fed via placeholders.
-      image_feed = tf.placeholder(dtype=tf.string, shape=[], name="image_feed")
-      input_feed = tf.placeholder(dtype=tf.int64,
-                                  shape=[None],  # batch_size
-                                  name="input_feed")
-
-      # Process image and insert batch dimensions.
-      images = tf.expand_dims(self.process_image(image_feed), 0)
-      input_seqs = tf.expand_dims(input_feed, 1)
-
-      # No target sequences or input mask in inference mode.
-      target_seqs = None
-      input_mask = None
-    else:
-      # Prefetch serialized SequenceExample protos.
-      input_queue = input_ops.prefetch_input_data(
-          self.reader,
-          self.config.input_file_pattern,
-          is_training=self.is_training(),
-          batch_size=self.config.batch_size,
-          values_per_shard=self.config.values_per_input_shard,
-          input_queue_capacity_factor=self.config.input_queue_capacity_factor,
-          num_reader_threads=self.config.num_input_reader_threads)
-
-      # Image processing and random distortion. Split across multiple threads
-      # with each thread applying a slightly different distortion.
-      assert self.config.num_preprocess_threads % 2 == 0
-      images_and_captions = []
-      for thread_id in range(self.config.num_preprocess_threads):
-        serialized_sequence_example = input_queue.dequeue()
-        encoded_image, caption = input_ops.parse_sequence_example(
-            serialized_sequence_example,
-            image_feature=self.config.image_feature_name,
-            caption_feature=self.config.caption_feature_name)
-        image = self.process_image(encoded_image, thread_id=thread_id)
-        images_and_captions.append([image, caption])
-
-      # Batch inputs.
-      queue_capacity = (2 * self.config.num_preprocess_threads *
-                        self.config.batch_size)
-      images, input_seqs, target_seqs, input_mask = (
-          input_ops.batch_with_dynamic_pad(images_and_captions,
-                                           batch_size=self.config.batch_size,
-                                           queue_capacity=queue_capacity))
-
-    self.images = images
-    self.input_seqs = input_seqs
-    self.target_seqs = target_seqs
-    self.input_mask = input_mask
-
-  def build_image_embeddings(self):
-    """Builds the image model subgraph and generates image embeddings.
-
-    Inputs:
-      self.images
-
-    Outputs:
-      self.image_embeddings
-    """
-    inception_output = image_embedding.inception_v3(
-        self.images,
-        trainable=self.train_inception,
-        is_training=self.is_training())
-    self.inception_variables = tf.get_collection(
-        tf.GraphKeys.GLOBAL_VARIABLES, scope="InceptionV3")
-
-    # Map inception output into embedding space.
-    with tf.variable_scope("image_embedding") as scope:
-      image_embeddings = tf.contrib.layers.fully_connected(
-          inputs=inception_output,
-          num_outputs=self.config.embedding_size,
-          activation_fn=None,
-          weights_initializer=self.initializer,
-          biases_initializer=None,
-          scope=scope)
-
-    # Save the embedding size in the graph.
-    tf.constant(self.config.embedding_size, name="embedding_size")
-
-    self.image_embeddings = image_embeddings
-
-  def build_seq_embeddings(self):
-    """Builds the input sequence embeddings.
-
-    Inputs:
-      self.input_seqs
-
-    Outputs:
-      self.seq_embeddings
-    """
-    with tf.variable_scope("seq_embedding"), tf.device("/cpu:0"):
-      embedding_map = tf.get_variable(
-          name="map",
-          shape=[self.config.vocab_size, self.config.embedding_size],
-          initializer=self.initializer)
-      seq_embeddings = tf.nn.embedding_lookup(embedding_map, self.input_seqs)
-
-    self.seq_embeddings = seq_embeddings
-
-  def build_model(self):
-    """Builds the model.
-
-    Inputs:
-      self.image_embeddings
-      self.seq_embeddings
-      self.target_seqs (training and eval only)
-      self.input_mask (training and eval only)
-
-    Outputs:
-      self.total_loss (training and eval only)
-      self.target_cross_entropy_losses (training and eval only)
-      self.target_cross_entropy_loss_weights (training and eval only)
-    """
-    # This LSTM cell has biases and outputs tanh(new_c) * sigmoid(o), but the
-    # modified LSTM in the "Show and Tell" paper has no biases and outputs
-    # new_c * sigmoid(o).
-    lstm_cell = tf.contrib.rnn.BasicLSTMCell(
-        num_units=self.config.num_lstm_units, state_is_tuple=True)
-    if self.mode == "train":
-      lstm_cell = tf.contrib.rnn.DropoutWrapper(
-          lstm_cell,
-          input_keep_prob=self.config.lstm_dropout_keep_prob,
-          output_keep_prob=self.config.lstm_dropout_keep_prob)
-
-    with tf.variable_scope("lstm", initializer=self.initializer) as lstm_scope:
-      # Feed the image embeddings to set the initial LSTM state.
-      zero_state = lstm_cell.zero_state(
-          batch_size=self.image_embeddings.get_shape()[0], dtype=tf.float32)
-      _, initial_state = lstm_cell(self.image_embeddings, zero_state)
-
-      # Allow the LSTM variables to be reused.
-      lstm_scope.reuse_variables()
-
-      if self.mode == "inference":
-        # In inference mode, use concatenated states for convenient feeding and
-        # fetching.
-        tf.concat(axis=1, values=initial_state, name="initial_state")
-
-        # Placeholder for feeding a batch of concatenated states.
-        state_feed = tf.placeholder(dtype=tf.float32,
-                                    shape=[None, sum(lstm_cell.state_size)],
-                                    name="state_feed")
-        state_tuple = tf.split(value=state_feed, num_or_size_splits=2, axis=1)
-
-        # Run a single LSTM step.
-        lstm_outputs, state_tuple = lstm_cell(
-            inputs=tf.squeeze(self.seq_embeddings, axis=[1]),
-            state=state_tuple)
-
-        # Concatentate the resulting state.
-        tf.concat(axis=1, values=state_tuple, name="state")
-      else:
-        # Run the batch of sequence embeddings through the LSTM.
-        sequence_length = tf.reduce_sum(self.input_mask, 1)
-        lstm_outputs, _ = tf.nn.dynamic_rnn(cell=lstm_cell,
-                                            inputs=self.seq_embeddings,
-                                            sequence_length=sequence_length,
-                                            initial_state=initial_state,
-                                            dtype=tf.float32,
-                                            scope=lstm_scope)
-
-    # Stack batches vertically.
-    lstm_outputs = tf.reshape(lstm_outputs, [-1, lstm_cell.output_size])
-
-    with tf.variable_scope("logits") as logits_scope:
-      logits = tf.contrib.layers.fully_connected(
-          inputs=lstm_outputs,
-          num_outputs=self.config.vocab_size,
-          activation_fn=None,
-          weights_initializer=self.initializer,
-          scope=logits_scope)
-
-    if self.mode == "inference":
-      tf.nn.softmax(logits, name="softmax")
-    else:
-      targets = tf.reshape(self.target_seqs, [-1])
-      weights = tf.to_float(tf.reshape(self.input_mask, [-1]))
-
-      # Compute losses.
-      losses = tf.nn.sparse_softmax_cross_entropy_with_logits(labels=targets,
-                                                              logits=logits)
-      batch_loss = tf.div(tf.reduce_sum(tf.multiply(losses, weights)),
-                          tf.reduce_sum(weights),
-                          name="batch_loss")
-      tf.losses.add_loss(batch_loss)
-      total_loss = tf.losses.get_total_loss()
-
-      # Add summaries.
-      tf.summary.scalar("losses/batch_loss", batch_loss)
-      tf.summary.scalar("losses/total_loss", total_loss)
-      for var in tf.trainable_variables():
-        tf.summary.histogram("parameters/" + var.op.name, var)
-
-      self.total_loss = total_loss
-      self.target_cross_entropy_losses = losses  # Used in evaluation.
-      self.target_cross_entropy_loss_weights = weights  # Used in evaluation.
-
-  def setup_inception_initializer(self):
-    """Sets up the function to restore inception variables from checkpoint."""
-    if self.mode != "inference":
-      # Restore inception variables only.
-      saver = tf.train.Saver(self.inception_variables)
-
-      def restore_fn(sess):
-        tf.logging.info("Restoring Inception variables from checkpoint file %s",
-                        self.config.inception_checkpoint_file)
-        saver.restore(sess, self.config.inception_checkpoint_file)
-
-      self.init_fn = restore_fn
-
-  def setup_global_step(self):
-    """Sets up the global step Tensor."""
-    global_step = tf.Variable(
-        initial_value=0,
-        name="global_step",
-        trainable=False,
-        collections=[tf.GraphKeys.GLOBAL_STEP, tf.GraphKeys.GLOBAL_VARIABLES])
-
-    self.global_step = global_step
-
-  def build(self):
-    """Creates all ops for training and evaluation."""
-    self.build_inputs()
-    self.build_image_embeddings()
-    self.build_seq_embeddings()
-    self.build_model()
-    self.setup_inception_initializer()
-    self.setup_global_step()
diff --git a/research/im2txt/im2txt/show_and_tell_model_test.py b/research/im2txt/im2txt/show_and_tell_model_test.py
deleted file mode 100644
index 0bdfb6e1a3ae3c15bd1c8daf005fe2542436ca8e..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/show_and_tell_model_test.py
+++ /dev/null
@@ -1,200 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for tensorflow_models.im2txt.show_and_tell_model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import numpy as np
-import tensorflow as tf
-
-from im2txt import configuration
-from im2txt import show_and_tell_model
-
-
-class ShowAndTellModel(show_and_tell_model.ShowAndTellModel):
-  """Subclass of ShowAndTellModel without the disk I/O."""
-
-  def build_inputs(self):
-    if self.mode == "inference":
-      # Inference mode doesn't read from disk, so defer to parent.
-      return super(ShowAndTellModel, self).build_inputs()
-    else:
-      # Replace disk I/O with random Tensors.
-      self.images = tf.random_uniform(
-          shape=[self.config.batch_size, self.config.image_height,
-                 self.config.image_width, 3],
-          minval=-1,
-          maxval=1)
-      self.input_seqs = tf.random_uniform(
-          [self.config.batch_size, 15],
-          minval=0,
-          maxval=self.config.vocab_size,
-          dtype=tf.int64)
-      self.target_seqs = tf.random_uniform(
-          [self.config.batch_size, 15],
-          minval=0,
-          maxval=self.config.vocab_size,
-          dtype=tf.int64)
-      self.input_mask = tf.ones_like(self.input_seqs)
-
-
-class ShowAndTellModelTest(tf.test.TestCase):
-
-  def setUp(self):
-    super(ShowAndTellModelTest, self).setUp()
-    self._model_config = configuration.ModelConfig()
-
-  def _countModelParameters(self):
-    """Counts the number of parameters in the model at top level scope."""
-    counter = {}
-    for v in tf.global_variables():
-      name = v.op.name.split("/")[0]
-      num_params = v.get_shape().num_elements()
-      assert num_params
-      counter[name] = counter.get(name, 0) + num_params
-    return counter
-
-  def _checkModelParameters(self):
-    """Verifies the number of parameters in the model."""
-    param_counts = self._countModelParameters()
-    expected_param_counts = {
-        "InceptionV3": 21802784,
-        # inception_output_size * embedding_size
-        "image_embedding": 1048576,
-        # vocab_size * embedding_size
-        "seq_embedding": 6144000,
-        # (embedding_size + num_lstm_units + 1) * 4 * num_lstm_units
-        "lstm": 2099200,
-        # (num_lstm_units + 1) * vocab_size
-        "logits": 6156000,
-        "global_step": 1,
-    }
-    self.assertDictEqual(expected_param_counts, param_counts)
-
-  def _checkOutputs(self, expected_shapes, feed_dict=None):
-    """Verifies that the model produces expected outputs.
-
-    Args:
-      expected_shapes: A dict mapping Tensor or Tensor name to expected output
-        shape.
-      feed_dict: Values of Tensors to feed into Session.run().
-    """
-    fetches = expected_shapes.keys()
-
-    with self.test_session() as sess:
-      sess.run(tf.global_variables_initializer())
-      outputs = sess.run(fetches, feed_dict)
-
-    for index, output in enumerate(outputs):
-      tensor = fetches[index]
-      expected = expected_shapes[tensor]
-      actual = output.shape
-      if expected != actual:
-        self.fail("Tensor %s has shape %s (expected %s)." %
-                  (tensor, actual, expected))
-
-  def testBuildForTraining(self):
-    model = ShowAndTellModel(self._model_config, mode="train")
-    model.build()
-
-    self._checkModelParameters()
-
-    expected_shapes = {
-        # [batch_size, image_height, image_width, 3]
-        model.images: (32, 299, 299, 3),
-        # [batch_size, sequence_length]
-        model.input_seqs: (32, 15),
-        # [batch_size, sequence_length]
-        model.target_seqs: (32, 15),
-        # [batch_size, sequence_length]
-        model.input_mask: (32, 15),
-        # [batch_size, embedding_size]
-        model.image_embeddings: (32, 512),
-        # [batch_size, sequence_length, embedding_size]
-        model.seq_embeddings: (32, 15, 512),
-        # Scalar
-        model.total_loss: (),
-        # [batch_size * sequence_length]
-        model.target_cross_entropy_losses: (480,),
-        # [batch_size * sequence_length]
-        model.target_cross_entropy_loss_weights: (480,),
-    }
-    self._checkOutputs(expected_shapes)
-
-  def testBuildForEval(self):
-    model = ShowAndTellModel(self._model_config, mode="eval")
-    model.build()
-
-    self._checkModelParameters()
-
-    expected_shapes = {
-        # [batch_size, image_height, image_width, 3]
-        model.images: (32, 299, 299, 3),
-        # [batch_size, sequence_length]
-        model.input_seqs: (32, 15),
-        # [batch_size, sequence_length]
-        model.target_seqs: (32, 15),
-        # [batch_size, sequence_length]
-        model.input_mask: (32, 15),
-        # [batch_size, embedding_size]
-        model.image_embeddings: (32, 512),
-        # [batch_size, sequence_length, embedding_size]
-        model.seq_embeddings: (32, 15, 512),
-        # Scalar
-        model.total_loss: (),
-        # [batch_size * sequence_length]
-        model.target_cross_entropy_losses: (480,),
-        # [batch_size * sequence_length]
-        model.target_cross_entropy_loss_weights: (480,),
-    }
-    self._checkOutputs(expected_shapes)
-
-  def testBuildForInference(self):
-    model = ShowAndTellModel(self._model_config, mode="inference")
-    model.build()
-
-    self._checkModelParameters()
-
-    # Test feeding an image to get the initial LSTM state.
-    images_feed = np.random.rand(1, 299, 299, 3)
-    feed_dict = {model.images: images_feed}
-    expected_shapes = {
-        # [batch_size, embedding_size]
-        model.image_embeddings: (1, 512),
-        # [batch_size, 2 * num_lstm_units]
-        "lstm/initial_state:0": (1, 1024),
-    }
-    self._checkOutputs(expected_shapes, feed_dict)
-
-    # Test feeding a batch of inputs and LSTM states to get softmax output and
-    # LSTM states.
-    input_feed = np.random.randint(0, 10, size=3)
-    state_feed = np.random.rand(3, 1024)
-    feed_dict = {"input_feed:0": input_feed, "lstm/state_feed:0": state_feed}
-    expected_shapes = {
-        # [batch_size, 2 * num_lstm_units]
-        "lstm/state:0": (3, 1024),
-        # [batch_size, vocab_size]
-        "softmax:0": (3, 12000),
-    }
-    self._checkOutputs(expected_shapes, feed_dict)
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/im2txt/im2txt/train.py b/research/im2txt/im2txt/train.py
deleted file mode 100644
index db602735ba11e7f540a4e985333d8a457512c977..0000000000000000000000000000000000000000
--- a/research/im2txt/im2txt/train.py
+++ /dev/null
@@ -1,114 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Train the model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from im2txt import configuration
-from im2txt import show_and_tell_model
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.flags.DEFINE_string("input_file_pattern", "",
-                       "File pattern of sharded TFRecord input files.")
-tf.flags.DEFINE_string("inception_checkpoint_file", "",
-                       "Path to a pretrained inception_v3 model.")
-tf.flags.DEFINE_string("train_dir", "",
-                       "Directory for saving and loading model checkpoints.")
-tf.flags.DEFINE_boolean("train_inception", False,
-                        "Whether to train inception submodel variables.")
-tf.flags.DEFINE_integer("number_of_steps", 1000000, "Number of training steps.")
-tf.flags.DEFINE_integer("log_every_n_steps", 1,
-                        "Frequency at which loss and global step are logged.")
-
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-def main(unused_argv):
-  assert FLAGS.input_file_pattern, "--input_file_pattern is required"
-  assert FLAGS.train_dir, "--train_dir is required"
-
-  model_config = configuration.ModelConfig()
-  model_config.input_file_pattern = FLAGS.input_file_pattern
-  model_config.inception_checkpoint_file = FLAGS.inception_checkpoint_file
-  training_config = configuration.TrainingConfig()
-
-  # Create training directory.
-  train_dir = FLAGS.train_dir
-  if not tf.gfile.IsDirectory(train_dir):
-    tf.logging.info("Creating training directory: %s", train_dir)
-    tf.gfile.MakeDirs(train_dir)
-
-  # Build the TensorFlow graph.
-  g = tf.Graph()
-  with g.as_default():
-    # Build the model.
-    model = show_and_tell_model.ShowAndTellModel(
-        model_config, mode="train", train_inception=FLAGS.train_inception)
-    model.build()
-
-    # Set up the learning rate.
-    learning_rate_decay_fn = None
-    if FLAGS.train_inception:
-      learning_rate = tf.constant(training_config.train_inception_learning_rate)
-    else:
-      learning_rate = tf.constant(training_config.initial_learning_rate)
-      if training_config.learning_rate_decay_factor > 0:
-        num_batches_per_epoch = (training_config.num_examples_per_epoch /
-                                 model_config.batch_size)
-        decay_steps = int(num_batches_per_epoch *
-                          training_config.num_epochs_per_decay)
-
-        def _learning_rate_decay_fn(learning_rate, global_step):
-          return tf.train.exponential_decay(
-              learning_rate,
-              global_step,
-              decay_steps=decay_steps,
-              decay_rate=training_config.learning_rate_decay_factor,
-              staircase=True)
-
-        learning_rate_decay_fn = _learning_rate_decay_fn
-
-    # Set up the training ops.
-    train_op = tf.contrib.layers.optimize_loss(
-        loss=model.total_loss,
-        global_step=model.global_step,
-        learning_rate=learning_rate,
-        optimizer=training_config.optimizer,
-        clip_gradients=training_config.clip_gradients,
-        learning_rate_decay_fn=learning_rate_decay_fn)
-
-    # Set up the Saver for saving and restoring model checkpoints.
-    saver = tf.train.Saver(max_to_keep=training_config.max_checkpoints_to_keep)
-
-  # Run training.
-  tf.contrib.slim.learning.train(
-      train_op,
-      train_dir,
-      log_every_n_steps=FLAGS.log_every_n_steps,
-      graph=g,
-      global_step=model.global_step,
-      number_of_steps=FLAGS.number_of_steps,
-      init_fn=model.init_fn,
-      saver=saver)
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/inception/.gitignore b/research/inception/.gitignore
deleted file mode 100644
index 58cbf2f4e0d5d39a0e3910d6993508546dad429f..0000000000000000000000000000000000000000
--- a/research/inception/.gitignore
+++ /dev/null
@@ -1,7 +0,0 @@
-/bazel-bin
-/bazel-ci_build-cache
-/bazel-genfiles
-/bazel-out
-/bazel-inception
-/bazel-testlogs
-/bazel-tf
diff --git a/research/inception/README.md b/research/inception/README.md
deleted file mode 100644
index beed66cf5cd83a6843ec39b28b5dbd88f1c0d3d0..0000000000000000000000000000000000000000
--- a/research/inception/README.md
+++ /dev/null
@@ -1,858 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-**NOTE: For the most part, you will find a newer version of this code at [models/research/slim](https://github.com/tensorflow/models/tree/master/research/slim).** In particular:
-
-*   `inception_train.py` and `imagenet_train.py` should no longer be used. The slim editions for running on multiple GPUs are the current best examples.
-*   `inception_distributed_train.py` and `imagenet_distributed_train.py` are still valid examples of distributed training.
-
-For performance benchmarking, please see https://www.tensorflow.org/performance/benchmarks.
-
----
-
-# Inception in TensorFlow
-
-[ImageNet](http://www.image-net.org/) is a common academic data set in machine
-learning for training an image recognition system. Code in this directory
-demonstrates how to use TensorFlow to train and evaluate a type of convolutional
-neural network (CNN) on this academic data set. In particular, we demonstrate
-how to train the Inception v3 architecture as specified in:
-
-_Rethinking the Inception Architecture for Computer Vision_
-
-Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, Zbigniew
-Wojna
-
-http://arxiv.org/abs/1512.00567
-
-This network achieves 21.2% top-1 and 5.6% top-5 error for single frame
-evaluation with a computational cost of 5 billion multiply-adds per inference
-and with using less than 25 million parameters. Below is a visualization of the
-model architecture.
-
-![Inception-v3 Architecture](g3doc/inception_v3_architecture.png)
-
-## Description of Code
-
-The code base provides three core binaries for:
-
-*   Training an Inception v3 network from scratch across multiple GPUs and/or
-    multiple machines using the ImageNet 2012 Challenge training data set.
-*   Evaluating an Inception v3 network using the ImageNet 2012 Challenge
-    validation data set.
-*   Retraining an Inception v3 network on a novel task and back-propagating the
-    errors to fine tune the network weights.
-
-The training procedure employs synchronous stochastic gradient descent across
-multiple GPUs. The user may specify the number of GPUs they wish to harness. The
-synchronous training performs *batch-splitting* by dividing a given batch across
-multiple GPUs.
-
-The training set up is nearly identical to the section [Training a Model Using
-Multiple GPU Cards](https://www.tensorflow.org/tutorials/deep_cnn/index.html#launching_and_training_the_model_on_multiple_gpu_cards)
-where we have substituted the CIFAR-10 model architecture with Inception v3. The
-primary differences with that setup are:
-
-*   Calculate and update the batch-norm statistics during training so that they
-    may be substituted in during evaluation.
-*   Specify the model architecture using a (still experimental) higher level
-    language called TensorFlow-Slim.
-
-For more details about TensorFlow-Slim, please see the [Slim README](inception/slim/README.md). Please note that this higher-level language is still
-*experimental* and the API may change over time depending on usage and
-subsequent research.
-
-## Getting Started
-
-Before you run the training script for the first time, you will need to download
-and convert the ImageNet data to native TFRecord format. The TFRecord format
-consists of a set of sharded files where each entry is a serialized `tf.Example`
-proto. Each `tf.Example` proto contains the ImageNet image (JPEG encoded) as
-well as metadata such as label and bounding box information. See
-[`parse_example_proto`](inception/image_processing.py) for details.
-
-We provide a single [script](inception/data/download_and_preprocess_imagenet.sh) for
-downloading and converting ImageNet data to TFRecord format. Downloading and
-preprocessing the data may take several hours (up to half a day) depending on
-your network and computer speed. Please be patient.
-
-To begin, you will need to sign up for an account with [ImageNet](http://image-net.org) to gain access to the data. Look for the sign up page,
-create an account and request an access key to download the data.
-
-After you have `USERNAME` and `PASSWORD`, you are ready to run our script. Make
-sure that your hard disk has at least 500 GB of free space for downloading and
-storing the data. Here we select `DATA_DIR=$HOME/imagenet-data` as such a
-location but feel free to edit accordingly.
-
-When you run the below script, please enter *USERNAME* and *PASSWORD* when
-prompted. This will occur at the very beginning. Once these values are entered,
-you will not need to interact with the script again.
-
-```shell
-# location of where to place the ImageNet data
-DATA_DIR=$HOME/imagenet-data
-
-# build the preprocessing script.
-cd tensorflow-models/inception
-bazel build //inception:download_and_preprocess_imagenet
-
-# run it
-bazel-bin/inception/download_and_preprocess_imagenet "${DATA_DIR}"
-```
-
-The final line of the output script should read:
-
-```shell
-2016-02-17 14:30:17.287989: Finished writing all 1281167 images in data set.
-```
-
-When the script finishes, you will find 1024 training files and 128 validation
-files in the `DATA_DIR`. The files will match the patterns
-`train-?????-of-01024` and `validation-?????-of-00128`, respectively.
-
-[Congratulations!](https://www.youtube.com/watch?v=9bZkp7q19f0) You are now
-ready to train or evaluate with the ImageNet data set.
-
-## How to Train from Scratch
-
-**WARNING** Training an Inception v3 network from scratch is a computationally
-intensive task and depending on your compute setup may take several days or even
-weeks.
-
-*Before proceeding* please read the [Convolutional Neural Networks](https://www.tensorflow.org/tutorials/deep_cnn/index.html) tutorial; in
-particular, focus on [Training a Model Using Multiple GPU Cards](https://www.tensorflow.org/tutorials/deep_cnn/index.html#launching_and_training_the_model_on_multiple_gpu_cards). The model training method is nearly identical to that described in the
-CIFAR-10 multi-GPU model training. Briefly, the model training
-
-*   Places an individual model replica on each GPU.
-*   Splits the batch across the GPUs.
-*   Updates model parameters synchronously by waiting for all GPUs to finish
-    processing a batch of data.
-
-The training procedure is encapsulated by this diagram of how operations and
-variables are placed on CPU and GPUs respectively.
-
-<div style="width:40%; margin:auto; margin-bottom:10px; margin-top:20px;">
-  <img style="width:100%" src="https://www.tensorflow.org/images/Parallelism.png">
-</div>
-
-Each tower computes the gradients for a portion of the batch and the gradients
-are combined and averaged across the multiple towers in order to provide a
-single update of the Variables stored on the CPU.
-
-A crucial aspect of training a network of this size is *training speed* in terms
-of wall-clock time. The training speed is dictated by many factors -- most
-importantly the batch size and the learning rate schedule. Both of these
-parameters are heavily coupled to the hardware set up.
-
-Generally speaking, a batch size is a difficult parameter to tune as it requires
-balancing memory demands of the model, memory available on the GPU and speed of
-computation. Generally speaking, employing larger batch sizes leads to more
-efficient computation and potentially more efficient training steps.
-
-We have tested several hardware setups for training this model from scratch but
-we emphasize that depending your hardware set up, you may need to adapt the
-batch size and learning rate schedule.
-
-Please see the comments in `inception_train.py` for a few selected learning rate
-plans based on some selected hardware setups.
-
-To train this model, you simply need to specify the following:
-
-```shell
-# Build the model. Note that we need to make sure the TensorFlow is ready to
-# use before this as this command will not build TensorFlow.
-cd tensorflow-models/inception
-bazel build //inception:imagenet_train
-
-# run it
-bazel-bin/inception/imagenet_train --num_gpus=1 --batch_size=32 --train_dir=/tmp/imagenet_train --data_dir=/tmp/imagenet_data
-```
-
-The model reads in the ImageNet training data from `--data_dir`. If you followed
-the instructions in [Getting Started](#getting-started), then set
-`--data_dir="${DATA_DIR}"`. The script assumes that there exists a set of
-sharded TFRecord files containing the ImageNet data. If you have not created
-TFRecord files, please refer to [Getting Started](#getting-started)
-
-Here is the output of the above command line when running on a Tesla K40c:
-
-```shell
-2016-03-07 12:24:59.922898: step 0, loss = 13.11 (5.3 examples/sec; 6.064 sec/batch)
-2016-03-07 12:25:55.206783: step 10, loss = 13.71 (9.4 examples/sec; 3.394 sec/batch)
-2016-03-07 12:26:28.905231: step 20, loss = 14.81 (9.5 examples/sec; 3.380 sec/batch)
-2016-03-07 12:27:02.699719: step 30, loss = 14.45 (9.5 examples/sec; 3.378 sec/batch)
-2016-03-07 12:27:36.515699: step 40, loss = 13.98 (9.5 examples/sec; 3.376 sec/batch)
-2016-03-07 12:28:10.220956: step 50, loss = 13.92 (9.6 examples/sec; 3.327 sec/batch)
-2016-03-07 12:28:43.658223: step 60, loss = 13.28 (9.6 examples/sec; 3.350 sec/batch)
-...
-```
-
-In this example, a log entry is printed every 10 step and the line includes the
-total loss (starts around 13.0-14.0) and the speed of processing in terms of
-throughput (examples / sec) and batch speed (sec/batch).
-
-The number of GPU devices is specified by `--num_gpus` (which defaults to 1).
-Specifying `--num_gpus` greater then 1 splits the batch evenly split across the
-GPU cards.
-
-```shell
-# Build the model. Note that we need to make sure the TensorFlow is ready to
-# use before this as this command will not build TensorFlow.
-cd tensorflow-models/inception
-bazel build //inception:imagenet_train
-
-# run it
-bazel-bin/inception/imagenet_train --num_gpus=2 --batch_size=64 --train_dir=/tmp/imagenet_train
-```
-
-This model splits the batch of 64 images across 2 GPUs and calculates the
-average gradient by waiting for both GPUs to finish calculating the gradients
-from their respective data (See diagram above). Generally speaking, using larger
-numbers of GPUs leads to higher throughput as well as the opportunity to use
-larger batch sizes. In turn, larger batch sizes imply better estimates of the
-gradient enabling the usage of higher learning rates. In summary, using more
-GPUs results in simply faster training speed.
-
-Note that selecting a batch size is a difficult parameter to tune as it requires
-balancing memory demands of the model, memory available on the GPU and speed of
-computation. Generally speaking, employing larger batch sizes leads to more
-efficient computation and potentially more efficient training steps.
-
-Note that there is considerable noise in the loss function on individual steps
-in the previous log. Because of this noise, it is difficult to discern how well
-a model is learning. The solution to the last problem is to launch TensorBoard
-pointing to the directory containing the events log.
-
-```shell
-tensorboard --logdir=/tmp/imagenet_train
-```
-
-TensorBoard has access to the many Summaries produced by the model that describe
-multitudes of statistics tracking the model behavior and the quality of the
-learned model. In particular, TensorBoard tracks a exponentially smoothed
-version of the loss. In practice, it is far easier to judge how well a model
-learns by monitoring the smoothed version of the loss.
-
-## How to Train from Scratch in a Distributed Setting
-
-**NOTE** Distributed TensorFlow requires version 0.8 or later.
-
-Distributed TensorFlow lets us use multiple machines to train a model faster.
-This is quite different from the training with multiple GPU towers on a single
-machine where all parameters and gradients computation are in the same place. We
-coordinate the computation across multiple machines by employing a centralized
-repository for parameters that maintains a unified, single copy of model
-parameters. Each individual machine sends gradient updates to the centralized
-parameter repository which coordinates these updates and sends back updated
-parameters to the individual machines running the model training.
-
-We term each machine that runs a copy of the training a `worker` or `replica`.
-We term each machine that maintains model parameters a `ps`, short for
-`parameter server`. Note that we might have more than one machine acting as a
-`ps` as the model parameters may be sharded across multiple machines.
-
-Variables may be updated with synchronous or asynchronous gradient updates. One
-may construct a an [`Optimizer`](https://www.tensorflow.org/api_docs/python/train.html#optimizers) in TensorFlow
-that constructs the necessary graph for either case diagrammed below from the
-TensorFlow [Whitepaper](http://download.tensorflow.org/paper/whitepaper2015.pdf):
-
-<div style="width:40%; margin:auto; margin-bottom:10px; margin-top:20px;">
-  <img style="width:100%"
-  src="https://www.tensorflow.org/images/tensorflow_figure7.png">
-</div>
-
-In [a recent paper](https://arxiv.org/abs/1604.00981), synchronous gradient
-updates have demonstrated to reach higher accuracy in a shorter amount of time.
-In this distributed Inception example we employ synchronous gradient updates.
-
-Note that in this example each replica has a single tower that uses one GPU.
-
-The command-line flags `worker_hosts` and `ps_hosts` specify available servers.
-The same binary will be used for both the `worker` jobs and the `ps` jobs.
-Command line flag `job_name` will be used to specify what role a task will be
-playing and `task_id` will be used to identify which one of the jobs it is
-running. Several things to note here:
-
-*   The numbers of `ps` and `worker` tasks are inferred from the lists of hosts
-    specified in the flags. The `task_id` should be within the range `[0,
-    num_ps_tasks)` for `ps` tasks and `[0, num_worker_tasks)` for `worker`
-    tasks.
-*   `ps` and `worker` tasks can run on the same machine, as long as that machine
-    has sufficient resources to handle both tasks. Note that the `ps` task does
-    not benefit from a GPU, so it should not attempt to use one (see below).
-*   Multiple `worker` tasks can run on the same machine with multiple GPUs so
-    machine_A with 2 GPUs may have 2 workers while machine_B with 1 GPU just has
-    1 worker.
-*   The default learning rate schedule works well for a wide range of number of
-    replicas [25, 50, 100] but feel free to tune it for even better results.
-*   The command line of both `ps` and `worker` tasks should include the complete
-    list of `ps_hosts` and `worker_hosts`.
-*   There is a chief `worker` among all workers which defaults to `worker` 0.
-    The chief will be in charge of initializing all the parameters, writing out
-    the summaries and the checkpoint. The checkpoint and summary will be in the
-    `train_dir` of the host for `worker` 0.
-*   Each worker processes a batch_size number of examples but each gradient
-    update is computed from all replicas. Hence, the effective batch size of
-    this model is batch_size * num_workers.
-
-```shell
-# Build the model. Note that we need to make sure the TensorFlow is ready to
-# use before this as this command will not build TensorFlow.
-cd tensorflow-models/inception
-bazel build //inception:imagenet_distributed_train
-
-# To start worker 0, go to the worker0 host and run the following (Note that
-# task_id should be in the range [0, num_worker_tasks):
-bazel-bin/inception/imagenet_distributed_train \
---batch_size=32 \
---data_dir=$HOME/imagenet-data \
---job_name='worker' \
---task_id=0 \
---ps_hosts='ps0.example.com:2222' \
---worker_hosts='worker0.example.com:2222,worker1.example.com:2222'
-
-# To start worker 1, go to the worker1 host and run the following (Note that
-# task_id should be in the range [0, num_worker_tasks):
-bazel-bin/inception/imagenet_distributed_train \
---batch_size=32 \
---data_dir=$HOME/imagenet-data \
---job_name='worker' \
---task_id=1 \
---ps_hosts='ps0.example.com:2222' \
---worker_hosts='worker0.example.com:2222,worker1.example.com:2222'
-
-# To start the parameter server (ps), go to the ps host and run the following (Note
-# that task_id should be in the range [0, num_ps_tasks):
-bazel-bin/inception/imagenet_distributed_train \
---job_name='ps' \
---task_id=0 \
---ps_hosts='ps0.example.com:2222' \
---worker_hosts='worker0.example.com:2222,worker1.example.com:2222'
-```
-
-If you have installed a GPU-compatible version of TensorFlow, the `ps` will also
-try to allocate GPU memory although it is not helpful. This could potentially
-crash the worker on the same machine as it has little to no GPU memory to
-allocate. To avoid this, you can prepend the previous command to start `ps`
-with: `CUDA_VISIBLE_DEVICES=''`
-
-```shell
-CUDA_VISIBLE_DEVICES='' bazel-bin/inception/imagenet_distributed_train \
---job_name='ps' \
---task_id=0 \
---ps_hosts='ps0.example.com:2222' \
---worker_hosts='worker0.example.com:2222,worker1.example.com:2222'
-```
-
-If you have run everything correctly, you should see a log in each `worker` job
-that looks like the following. Note the training speed varies depending on your
-hardware and the first several steps could take much longer.
-
-```shell
-INFO:tensorflow:PS hosts are: ['ps0.example.com:2222', 'ps1.example.com:2222']
-INFO:tensorflow:Worker hosts are: ['worker0.example.com:2222', 'worker1.example.com:2222']
-I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:206] Initialize HostPortsGrpcChannelCache for job ps -> {ps0.example.com:2222, ps1.example.com:2222}
-I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:206] Initialize HostPortsGrpcChannelCache for job worker -> {localhost:2222, worker1.example.com:2222}
-I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:202] Started server with target: grpc://localhost:2222
-INFO:tensorflow:Created variable global_step:0 with shape () and init <function zeros_initializer at 0x7f6aa014b140>
-
-...
-
-INFO:tensorflow:Created variable logits/logits/biases:0 with shape (1001,) and init <function _initializer at 0x7f6a77f3cf50>
-INFO:tensorflow:SyncReplicas enabled: replicas_to_aggregate=2; total_num_replicas=2
-INFO:tensorflow:2016-04-13 01:56:26.405639 Supervisor
-INFO:tensorflow:Started 2 queues for processing input data.
-INFO:tensorflow:global_step/sec: 0
-INFO:tensorflow:Worker 0: 2016-04-13 01:58:40.342404: step 0, loss = 12.97(0.0 examples/sec; 65.428  sec/batch)
-INFO:tensorflow:global_step/sec: 0.0172907
-...
-```
-
-and a log in each `ps` job that looks like the following:
-
-```shell
-INFO:tensorflow:PS hosts are: ['ps0.example.com:2222', 'ps1.example.com:2222']
-INFO:tensorflow:Worker hosts are: ['worker0.example.com:2222', 'worker1.example.com:2222']
-I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:206] Initialize HostPortsGrpcChannelCache for job ps -> {localhost:2222, ps1.example.com:2222}
-I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:206] Initialize HostPortsGrpcChannelCache for job worker -> {worker0.example.com:2222, worker1.example.com:2222}
-I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:202] Started server with target: grpc://localhost:2222
-```
-
-If you compiled TensorFlow (from v1.1-rc3) with VERBS support and you have the
-required device and IB verbs SW stack, you can specify --protocol='grpc+verbs'
-In order to use Verbs RDMA for Tensor passing between workers and ps.
-Need to add the the --protocol flag in all tasks (ps and workers).
-The default protocol is the TensorFlow default protocol of grpc.
-
-
-[Congratulations!](https://www.youtube.com/watch?v=9bZkp7q19f0) You are now
-training Inception in a distributed manner.
-
-## How to Evaluate
-
-Evaluating an Inception v3 model on the ImageNet 2012 validation data set
-requires running a separate binary.
-
-The evaluation procedure is nearly identical to [Evaluating a Model](https://www.tensorflow.org/tutorials/deep_cnn/index.html#evaluating_a_model)
-described in the [Convolutional Neural Network](https://www.tensorflow.org/tutorials/deep_cnn/index.html) tutorial.
-
-**WARNING** Be careful not to run the evaluation and training binary on the same
-GPU or else you might run out of memory. Consider running the evaluation on a
-separate GPU if available or suspending the training binary while running the
-evaluation on the same GPU.
-
-Briefly, one can evaluate the model by running:
-
-```shell
-# Build the model. Note that we need to make sure the TensorFlow is ready to
-# use before this as this command will not build TensorFlow.
-cd tensorflow-models/inception
-bazel build //inception:imagenet_eval
-
-# run it
-bazel-bin/inception/imagenet_eval --checkpoint_dir=/tmp/imagenet_train --eval_dir=/tmp/imagenet_eval
-```
-
-Note that we point `--checkpoint_dir` to the location of the checkpoints saved
-by `inception_train.py` above. Running the above command results in the
-following output:
-
-```shell
-2016-02-17 22:32:50.391206: precision @ 1 = 0.735
-...
-```
-
-The script calculates the precision @ 1 over the entire validation data
-periodically. The precision @ 1 measures the how often the highest scoring
-prediction from the model matched the ImageNet label -- in this case, 73.5%. If
-you wish to run the eval just once and not periodically, append the `--run_once`
-option.
-
-Much like the training script, `imagenet_eval.py` also exports summaries that
-may be visualized in TensorBoard. These summaries calculate additional
-statistics on the predictions (e.g. recall @ 5) as well as monitor the
-statistics of the model activations and weights during evaluation.
-
-## How to Fine-Tune a Pre-Trained Model on a New Task
-
-### Getting Started
-
-Much like training the ImageNet model we must first convert a new data set to
-the sharded TFRecord format which each entry is a serialized `tf.Example` proto.
-
-We have provided a script demonstrating how to do this for small data set of of
-a few thousand flower images spread across 5 labels:
-
-```shell
-daisy, dandelion, roses, sunflowers, tulips
-```
-
-There is a single automated script that downloads the data set and converts it
-to the TFRecord format. Much like the ImageNet data set, each record in the
-TFRecord format is a serialized `tf.Example` proto whose entries include a
-JPEG-encoded string and an integer label. Please see [`parse_example_proto`](inception/image_processing.py) for details.
-
-The script just takes a few minutes to run depending your network connection
-speed for downloading and processing the images. Your hard disk requires 200MB
-of free storage. Here we select `DATA_DIR=/tmp/flowers-data/` as such a location
-but feel free to edit accordingly.
-
-```shell
-# location of where to place the flowers data
-FLOWERS_DATA_DIR=/tmp/flowers-data/
-
-# build the preprocessing script.
-cd tensorflow-models/inception
-bazel build //inception:download_and_preprocess_flowers
-
-# run it
-bazel-bin/inception/download_and_preprocess_flowers "${FLOWERS_DATA_DIR}"
-```
-
-If the script runs successfully, the final line of the terminal output should
-look like:
-
-```shell
-2016-02-24 20:42:25.067551: Finished writing all 3170 images in data set.
-```
-
-When the script finishes you will find 2 shards for the training and validation
-files in the `DATA_DIR`. The files will match the patterns `train-?????-of-00002`
-and `validation-?????-of-00002`, respectively.
-
-**NOTE** If you wish to prepare a custom image data set for transfer learning,
-you will need to invoke [`build_image_data.py`](inception/data/build_image_data.py) on
-your custom data set. Please see the associated options and assumptions behind
-this script by reading the comments section of [`build_image_data.py`](inception/data/build_image_data.py). Also, if your custom data has a different
-number of examples or classes, you need to change the appropriate values in
-[`imagenet_data.py`](inception/imagenet_data.py).
-
-The second piece you will need is a trained Inception v3 image model. You have
-the option of either training one yourself (See [How to Train from Scratch](#how-to-train-from-scratch) for details) or you can download a pre-trained
-model like so:
-
-```shell
-# location of where to place the Inception v3 model
-INCEPTION_MODEL_DIR=$HOME/inception-v3-model
-mkdir -p ${INCEPTION_MODEL_DIR}
-cd ${INCEPTION_MODEL_DIR}
-
-# download the Inception v3 model
-curl -O http://download.tensorflow.org/models/image/imagenet/inception-v3-2016-03-01.tar.gz
-tar xzf inception-v3-2016-03-01.tar.gz
-
-# this will create a directory called inception-v3 which contains the following files.
-> ls inception-v3
-README.txt
-checkpoint
-model.ckpt-157585
-```
-
-[Congratulations!](https://www.youtube.com/watch?v=9bZkp7q19f0) You are now
-ready to fine-tune your pre-trained Inception v3 model with the flower data set.
-
-### How to Retrain a Trained Model on the Flowers Data
-
-We are now ready to fine-tune a pre-trained Inception-v3 model on the flowers
-data set. This requires two distinct changes to our training procedure:
-
-1.  Build the exact same model as previously except we change the number of
-    labels in the final classification layer.
-
-2.  Restore all weights from the pre-trained Inception-v3 except for the final
-    classification layer; this will get randomly initialized instead.
-
-We can perform these two operations by specifying two flags:
-`--pretrained_model_checkpoint_path` and `--fine_tune`. The first flag is a
-string that points to the path of a pre-trained Inception-v3 model. If this flag
-is specified, it will load the entire model from the checkpoint before the
-script begins training.
-
-The second flag `--fine_tune` is a boolean that indicates whether the last
-classification layer should be randomly initialized or restored. You may set
-this flag to false if you wish to continue training a pre-trained model from a
-checkpoint. If you set this flag to true, you can train a new classification
-layer from scratch.
-
-In order to understand how `--fine_tune` works, please see the discussion on
-`Variables` in the TensorFlow-Slim [`README.md`](inception/slim/README.md).
-
-Putting this all together you can retrain a pre-trained Inception-v3 model on
-the flowers data set with the following command.
-
-```shell
-# Build the model. Note that we need to make sure the TensorFlow is ready to
-# use before this as this command will not build TensorFlow.
-cd tensorflow-models/inception
-bazel build //inception:flowers_train
-
-# Path to the downloaded Inception-v3 model.
-MODEL_PATH="${INCEPTION_MODEL_DIR}/inception-v3/model.ckpt-157585"
-
-# Directory where the flowers data resides.
-FLOWERS_DATA_DIR=/tmp/flowers-data/
-
-# Directory where to save the checkpoint and events files.
-TRAIN_DIR=/tmp/flowers_train/
-
-# Run the fine-tuning on the flowers data set starting from the pre-trained
-# Imagenet-v3 model.
-bazel-bin/inception/flowers_train \
-  --train_dir="${TRAIN_DIR}" \
-  --data_dir="${FLOWERS_DATA_DIR}" \
-  --pretrained_model_checkpoint_path="${MODEL_PATH}" \
-  --fine_tune=True \
-  --initial_learning_rate=0.001 \
-  --input_queue_memory_factor=1
-```
-
-We have added a few extra options to the training procedure.
-
-*   Fine-tuning a model a separate data set requires significantly lowering the
-    initial learning rate. We set the initial learning rate to 0.001.
-*   The flowers data set is quite small so we shrink the size of the shuffling
-    queue of examples. See [Adjusting Memory Demands](#adjusting-memory-demands)
-    for more details.
-
-The training script will only reports the loss. To evaluate the quality of the
-fine-tuned model, you will need to run `flowers_eval`:
-
-```shell
-# Build the model. Note that we need to make sure the TensorFlow is ready to
-# use before this as this command will not build TensorFlow.
-cd tensorflow-models/inception
-bazel build //inception:flowers_eval
-
-# Directory where we saved the fine-tuned checkpoint and events files.
-TRAIN_DIR=/tmp/flowers_train/
-
-# Directory where the flowers data resides.
-FLOWERS_DATA_DIR=/tmp/flowers-data/
-
-# Directory where to save the evaluation events files.
-EVAL_DIR=/tmp/flowers_eval/
-
-# Evaluate the fine-tuned model on a hold-out of the flower data set.
-bazel-bin/inception/flowers_eval \
-  --eval_dir="${EVAL_DIR}" \
-  --data_dir="${FLOWERS_DATA_DIR}" \
-  --subset=validation \
-  --num_examples=500 \
-  --checkpoint_dir="${TRAIN_DIR}" \
-  --input_queue_memory_factor=1 \
-  --run_once
-```
-
-We find that the evaluation arrives at roughly 93.4% precision@1 after the model
-has been running for 2000 steps.
-
-```shell
-Successfully loaded model from /tmp/flowers/model.ckpt-1999 at step=1999.
-2016-03-01 16:52:51.761219: starting evaluation on (validation).
-2016-03-01 16:53:05.450419: [20 batches out of 20] (36.5 examples/sec; 0.684sec/batch)
-2016-03-01 16:53:05.450471: precision @ 1 = 0.9340 recall @ 5 = 0.9960 [500 examples]
-```
-
-## How to Construct a New Dataset for Retraining
-
-One can use the existing scripts supplied with this model to build a new dataset
-for training or fine-tuning. The main script to employ is
-[`build_image_data.py`](inception/data/build_image_data.py). Briefly, this script takes a
-structured directory of images and converts it to a sharded `TFRecord` that can
-be read by the Inception model.
-
-In particular, you will need to create a directory of training images that
-reside within `$TRAIN_DIR` and `$VALIDATION_DIR` arranged as such:
-
-```shell
-  $TRAIN_DIR/dog/image0.jpeg
-  $TRAIN_DIR/dog/image1.jpg
-  $TRAIN_DIR/dog/image2.png
-  ...
-  $TRAIN_DIR/cat/weird-image.jpeg
-  $TRAIN_DIR/cat/my-image.jpeg
-  $TRAIN_DIR/cat/my-image.JPG
-  ...
-  $VALIDATION_DIR/dog/imageA.jpeg
-  $VALIDATION_DIR/dog/imageB.jpg
-  $VALIDATION_DIR/dog/imageC.png
-  ...
-  $VALIDATION_DIR/cat/weird-image.PNG
-  $VALIDATION_DIR/cat/that-image.jpg
-  $VALIDATION_DIR/cat/cat.JPG
-  ...
-```
-**NOTE**: This script will append an extra background class indexed at 0, so
-your class labels will range from 0 to num_labels. Using the example above, the
-corresponding class labels generated from `build_image_data.py` will be as
-follows:
-```shell
-0
-1 dog
-2 cat
-```
-
-Each sub-directory in `$TRAIN_DIR` and `$VALIDATION_DIR` corresponds to a unique
-label for the images that reside within that sub-directory. The images may be
-JPEG or PNG images. We do not support other images types currently.
-
-Once the data is arranged in this directory structure, we can run
-`build_image_data.py` on the data to generate the sharded `TFRecord` dataset.
-Each entry of the `TFRecord` is a serialized `tf.Example` protocol buffer. A
-complete list of information contained in the `tf.Example` is described in the
-comments of `build_image_data.py`.
-
-To run `build_image_data.py`, you can run the following command line:
-
-```shell
-# location to where to save the TFRecord data.
-OUTPUT_DIRECTORY=$HOME/my-custom-data/
-
-# build the preprocessing script.
-cd tensorflow-models/inception
-bazel build //inception:build_image_data
-
-# convert the data.
-bazel-bin/inception/build_image_data \
-  --train_directory="${TRAIN_DIR}" \
-  --validation_directory="${VALIDATION_DIR}" \
-  --output_directory="${OUTPUT_DIRECTORY}" \
-  --labels_file="${LABELS_FILE}" \
-  --train_shards=128 \
-  --validation_shards=24 \
-  --num_threads=8
-```
-
-where the `$OUTPUT_DIRECTORY` is the location of the sharded `TFRecords`. The
-`$LABELS_FILE` will be a text file that is read by the script that provides
-a list of all of the labels. For instance, in the case flowers data set, the
-`$LABELS_FILE` contained the following data:
-
-```shell
-daisy
-dandelion
-roses
-sunflowers
-tulips
-```
-
-Note that each row of each label corresponds with the entry in the final
-classifier in the model. That is, the `daisy` corresponds to the classifier for
-entry `1`; `dandelion` is entry `2`, etc. We skip label `0` as a background
-class.
-
-After running this script produces files that look like the following:
-
-```shell
-  $TRAIN_DIR/train-00000-of-00128
-  $TRAIN_DIR/train-00001-of-00128
-  ...
-  $TRAIN_DIR/train-00127-of-00128
-
-and
-
-  $VALIDATION_DIR/validation-00000-of-00024
-  $VALIDATION_DIR/validation-00001-of-00024
-  ...
-  $VALIDATION_DIR/validation-00023-of-00024
-```
-
-where 128 and 24 are the number of shards specified for each dataset,
-respectively. Generally speaking, we aim for selecting the number of shards such
-that roughly 1024 images reside in each shard. Once this data set is built, you
-are ready to train or fine-tune an Inception model on this data set.
-
-Note, if you are piggy backing on the flowers retraining scripts, be sure to
-update `num_classes()` and `num_examples_per_epoch()` in `flowers_data.py`
-to correspond with your data.
-
-## Practical Considerations for Training a Model
-
-The model architecture and training procedure is heavily dependent on the
-hardware used to train the model. If you wish to train or fine-tune this model
-on your machine **you will need to adjust and empirically determine a good set
-of training hyper-parameters for your setup**. What follows are some general
-considerations for novices.
-
-### Finding Good Hyperparameters
-
-Roughly 5-10 hyper-parameters govern the speed at which a network is trained. In
-addition to `--batch_size` and `--num_gpus`, there are several constants defined
-in [inception_train.py](inception/inception_train.py) which dictate the learning
-schedule.
-
-```shell
-RMSPROP_DECAY = 0.9                # Decay term for RMSProp.
-MOMENTUM = 0.9                     # Momentum in RMSProp.
-RMSPROP_EPSILON = 1.0              # Epsilon term for RMSProp.
-INITIAL_LEARNING_RATE = 0.1        # Initial learning rate.
-NUM_EPOCHS_PER_DECAY = 30.0        # Epochs after which learning rate decays.
-LEARNING_RATE_DECAY_FACTOR = 0.16  # Learning rate decay factor.
-```
-
-There are many papers that discuss the various tricks and trade-offs associated
-with training a model with stochastic gradient descent. For those new to the
-field, some great references are:
-
-*   Y Bengio, [Practical recommendations for gradient-based training of deep
-    architectures](http://arxiv.org/abs/1206.5533)
-*   I Goodfellow, Y Bengio and A Courville, [Deep Learning]
-    (http://www.deeplearningbook.org/)
-
-What follows is a summary of some general advice for identifying appropriate
-model hyper-parameters in the context of this particular model training setup.
-Namely, this library provides *synchronous* updates to model parameters based on
-batch-splitting the model across multiple GPUs.
-
-*   Higher learning rates leads to faster training. Too high of learning rate
-    leads to instability and will cause model parameters to diverge to infinity
-    or NaN.
-
-*   Larger batch sizes lead to higher quality estimates of the gradient and
-    permit training the model with higher learning rates.
-
-*   Often the GPU memory is a bottleneck that prevents employing larger batch
-    sizes. Employing more GPUs allows one to use larger batch sizes because
-    this model splits the batch across the GPUs.
-
-**NOTE** If one wishes to train this model with *asynchronous* gradient updates,
-one will need to substantially alter this model and new considerations need to
-be factored into hyperparameter tuning. See [Large Scale Distributed Deep
-Networks](http://research.google.com/archive/large_deep_networks_nips2012.html)
-for a discussion in this domain.
-
-### Adjusting Memory Demands
-
-Training this model has large memory demands in terms of the CPU and GPU. Let's
-discuss each item in turn.
-
-GPU memory is relatively small compared to CPU memory. Two items dictate the
-amount of GPU memory employed -- model architecture and batch size. Assuming
-that you keep the model architecture fixed, the sole parameter governing the GPU
-demand is the batch size. A good rule of thumb is to try employ as large of
-batch size as will fit on the GPU.
-
-If you run out of GPU memory, either lower the `--batch_size` or employ more
-GPUs on your desktop. The model performs batch-splitting across GPUs, thus N
-GPUs can handle N times the batch size of 1 GPU.
-
-The model requires a large amount of CPU memory as well. We have tuned the model
-to employ about ~20GB of CPU memory. Thus, having access to about 40 GB of CPU
-memory would be ideal.
-
-If that is not possible, you can tune down the memory demands of the model via
-lowering `--input_queue_memory_factor`. Images are preprocessed asynchronously
-with respect to the main training across `--num_preprocess_threads` threads. The
-preprocessed images are stored in shuffling queue in which each GPU performs a
-dequeue operation in order to receive a `batch_size` worth of images.
-
-In order to guarantee good shuffling across the data, we maintain a large
-shuffling queue of 1024 x `input_queue_memory_factor` images. For the current
-model architecture, this corresponds to about 4GB of CPU memory. You may lower
-`input_queue_memory_factor` in order to decrease the memory footprint. Keep in
-mind though that lowering this value drastically may result in a model with
-slightly lower predictive accuracy when training from scratch. Please see
-comments in [`image_processing.py`](inception/image_processing.py) for more details.
-
-## Troubleshooting
-
-#### The model runs out of CPU memory.
-
-In lieu of buying more CPU memory, an easy fix is to decrease
-`--input_queue_memory_factor`. See [Adjusting Memory Demands](#adjusting-memory-demands).
-
-#### The model runs out of GPU memory.
-
-The data is not able to fit on the GPU card. The simplest solution is to
-decrease the batch size of the model. Otherwise, you will need to think about a
-more sophisticated method for specifying the training which cuts up the model
-across multiple `session.run()` calls or partitions the model across multiple
-GPUs. See [Using GPUs](https://www.tensorflow.org/how_tos/using_gpu/index.html)
-and [Adjusting Memory Demands](#adjusting-memory-demands) for more information.
-
-#### The model training results in NaN's.
-
-The learning rate of the model is too high. Turn down your learning rate.
-
-#### I wish to train a model with a different image size.
-
-The simplest solution is to artificially resize your images to `299x299` pixels.
-See [Images](https://www.tensorflow.org/api_docs/python/image.html) section for
-many resizing, cropping and padding methods. Note that the entire model
-architecture is predicated on a `299x299` image, thus if you wish to change the
-input image size, then you may need to redesign the entire model architecture.
-
-#### What hardware specification are these hyper-parameters targeted for?
-
-We targeted a desktop with 128GB of CPU ram connected to 8 NVIDIA Tesla K40 GPU
-cards but we have run this on desktops with 32GB of CPU ram and 1 NVIDIA Tesla
-K40. You can get a sense of the various training configurations we tested by
-reading the comments in [`inception_train.py`](inception/inception_train.py).
-
-#### How do I continue training from a checkpoint in distributed setting?
-
-You only need to make sure that the checkpoint is in a location that can be
-reached by all of the `ps` tasks. By specifying the checkpoint location with
-`--train_dir` , the `ps` servers will load the checkpoint before commencing
-training.
diff --git a/research/inception/WORKSPACE b/research/inception/WORKSPACE
deleted file mode 100644
index 2d7b4fb254a0fcebe695cb3fd3685af29a02e0b0..0000000000000000000000000000000000000000
--- a/research/inception/WORKSPACE
+++ /dev/null
@@ -1 +0,0 @@
-workspace(name = "inception")
diff --git a/research/inception/g3doc/inception_v3_architecture.png b/research/inception/g3doc/inception_v3_architecture.png
deleted file mode 100644
index 91fb734a104b2f63114ade7c8f9b2f95ce6334a6..0000000000000000000000000000000000000000
Binary files a/research/inception/g3doc/inception_v3_architecture.png and /dev/null differ
diff --git a/research/inception/inception/BUILD b/research/inception/inception/BUILD
deleted file mode 100644
index 21fc27aa57c14f6a72359cf15d446787c8ea6c2e..0000000000000000000000000000000000000000
--- a/research/inception/inception/BUILD
+++ /dev/null
@@ -1,198 +0,0 @@
-# Description:
-# Example TensorFlow models for ImageNet.
-
-package(default_visibility = [":internal"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-package_group(
-    name = "internal",
-    packages = ["//inception/..."],
-)
-
-py_library(
-    name = "dataset",
-    srcs = [
-        "dataset.py",
-    ],
-)
-
-py_library(
-    name = "imagenet_data",
-    srcs = [
-        "imagenet_data.py",
-    ],
-    deps = [
-        ":dataset",
-    ],
-)
-
-py_library(
-    name = "flowers_data",
-    srcs = [
-        "flowers_data.py",
-    ],
-    deps = [
-        ":dataset",
-    ],
-)
-
-py_library(
-    name = "image_processing",
-    srcs = [
-        "image_processing.py",
-    ],
-)
-
-py_library(
-    name = "inception",
-    srcs = [
-        "inception_model.py",
-    ],
-    visibility = ["//visibility:public"],
-    deps = [
-        ":dataset",
-        "//inception/slim",
-    ],
-)
-
-py_binary(
-    name = "imagenet_eval",
-    srcs = [
-        "imagenet_eval.py",
-    ],
-    deps = [
-        ":imagenet_data",
-        ":inception_eval",
-    ],
-)
-
-py_binary(
-    name = "flowers_eval",
-    srcs = [
-        "flowers_eval.py",
-    ],
-    deps = [
-        ":flowers_data",
-        ":inception_eval",
-    ],
-)
-
-py_library(
-    name = "inception_eval",
-    srcs = [
-        "inception_eval.py",
-    ],
-    deps = [
-        ":image_processing",
-        ":inception",
-    ],
-)
-
-py_binary(
-    name = "imagenet_train",
-    srcs = [
-        "imagenet_train.py",
-    ],
-    deps = [
-        ":imagenet_data",
-        ":inception_train",
-    ],
-)
-
-py_binary(
-    name = "imagenet_distributed_train",
-    srcs = [
-        "imagenet_distributed_train.py",
-    ],
-    deps = [
-        ":imagenet_data",
-        ":inception_distributed_train",
-    ],
-)
-
-py_binary(
-    name = "flowers_train",
-    srcs = [
-        "flowers_train.py",
-    ],
-    deps = [
-        ":flowers_data",
-        ":inception_train",
-    ],
-)
-
-py_library(
-    name = "inception_train",
-    srcs = [
-        "inception_train.py",
-    ],
-    deps = [
-        ":image_processing",
-        ":inception",
-    ],
-)
-
-py_library(
-    name = "inception_distributed_train",
-    srcs = [
-        "inception_distributed_train.py",
-    ],
-    deps = [
-        ":image_processing",
-        ":inception",
-    ],
-)
-
-py_binary(
-    name = "build_image_data",
-    srcs = ["data/build_image_data.py"],
-)
-
-sh_binary(
-    name = "download_and_preprocess_flowers",
-    srcs = ["data/download_and_preprocess_flowers.sh"],
-    data = [
-        ":build_image_data",
-    ],
-)
-
-sh_binary(
-    name = "download_and_preprocess_imagenet",
-    srcs = ["data/download_and_preprocess_imagenet.sh"],
-    data = [
-        "data/download_imagenet.sh",
-        "data/imagenet_2012_validation_synset_labels.txt",
-        "data/imagenet_lsvrc_2015_synsets.txt",
-        "data/imagenet_metadata.txt",
-        "data/preprocess_imagenet_validation_data.py",
-        "data/process_bounding_boxes.py",
-        ":build_imagenet_data",
-    ],
-)
-
-py_binary(
-    name = "build_imagenet_data",
-    srcs = ["data/build_imagenet_data.py"],
-)
-
-filegroup(
-    name = "srcs",
-    srcs = glob(
-        [
-            "**/*.py",
-            "BUILD",
-        ],
-    ),
-)
-
-filegroup(
-    name = "imagenet_metadata",
-    srcs = [
-        "data/imagenet_lsvrc_2015_synsets.txt",
-        "data/imagenet_metadata.txt",
-    ],
-    visibility = ["//visibility:public"],
-)
diff --git a/research/inception/inception/data/build_image_data.py b/research/inception/inception/data/build_image_data.py
deleted file mode 100755
index 894388b7f758a46746870f2f0d55d1df7d3fe29b..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/build_image_data.py
+++ /dev/null
@@ -1,436 +0,0 @@
-#!/usr/bin/python
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Converts image data to TFRecords file format with Example protos.
-
-The image data set is expected to reside in JPEG files located in the
-following directory structure.
-
-  data_dir/label_0/image0.jpeg
-  data_dir/label_0/image1.jpg
-  ...
-  data_dir/label_1/weird-image.jpeg
-  data_dir/label_1/my-image.jpeg
-  ...
-
-where the sub-directory is the unique label associated with these images.
-
-This TensorFlow script converts the training and evaluation data into
-a sharded data set consisting of TFRecord files
-
-  train_directory/train-00000-of-01024
-  train_directory/train-00001-of-01024
-  ...
-  train_directory/train-01023-of-01024
-
-and
-
-  validation_directory/validation-00000-of-00128
-  validation_directory/validation-00001-of-00128
-  ...
-  validation_directory/validation-00127-of-00128
-
-where we have selected 1024 and 128 shards for each data set. Each record
-within the TFRecord file is a serialized Example proto. The Example proto
-contains the following fields:
-
-  image/encoded: string containing JPEG encoded image in RGB colorspace
-  image/height: integer, image height in pixels
-  image/width: integer, image width in pixels
-  image/colorspace: string, specifying the colorspace, always 'RGB'
-  image/channels: integer, specifying the number of channels, always 3
-  image/format: string, specifying the format, always 'JPEG'
-
-  image/filename: string containing the basename of the image file
-            e.g. 'n01440764_10026.JPEG' or 'ILSVRC2012_val_00000293.JPEG'
-  image/class/label: integer specifying the index in a classification layer.
-    The label ranges from [0, num_labels] where 0 is unused and left as
-    the background class.
-  image/class/text: string specifying the human-readable version of the label
-    e.g. 'dog'
-
-If your data set involves bounding boxes, please look at build_imagenet_data.py.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from datetime import datetime
-import os
-import random
-import sys
-import threading
-
-import numpy as np
-import tensorflow as tf
-
-tf.app.flags.DEFINE_string('train_directory', '/tmp/',
-                           'Training data directory')
-tf.app.flags.DEFINE_string('validation_directory', '/tmp/',
-                           'Validation data directory')
-tf.app.flags.DEFINE_string('output_directory', '/tmp/',
-                           'Output data directory')
-
-tf.app.flags.DEFINE_integer('train_shards', 2,
-                            'Number of shards in training TFRecord files.')
-tf.app.flags.DEFINE_integer('validation_shards', 2,
-                            'Number of shards in validation TFRecord files.')
-
-tf.app.flags.DEFINE_integer('num_threads', 2,
-                            'Number of threads to preprocess the images.')
-
-# The labels file contains a list of valid labels are held in this file.
-# Assumes that the file contains entries as such:
-#   dog
-#   cat
-#   flower
-# where each line corresponds to a label. We map each label contained in
-# the file to an integer corresponding to the line number starting from 0.
-tf.app.flags.DEFINE_string('labels_file', '', 'Labels file')
-
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def _int64_feature(value):
-  """Wrapper for inserting int64 features into Example proto."""
-  if not isinstance(value, list):
-    value = [value]
-  return tf.train.Feature(int64_list=tf.train.Int64List(value=value))
-
-
-def _bytes_feature(value):
-  """Wrapper for inserting bytes features into Example proto."""
-  return tf.train.Feature(bytes_list=tf.train.BytesList(value=[value]))
-
-
-def _convert_to_example(filename, image_buffer, label, text, height, width):
-  """Build an Example proto for an example.
-
-  Args:
-    filename: string, path to an image file, e.g., '/path/to/example.JPG'
-    image_buffer: string, JPEG encoding of RGB image
-    label: integer, identifier for the ground truth for the network
-    text: string, unique human-readable, e.g. 'dog'
-    height: integer, image height in pixels
-    width: integer, image width in pixels
-  Returns:
-    Example proto
-  """
-
-  colorspace = 'RGB'
-  channels = 3
-  image_format = 'JPEG'
-
-  example = tf.train.Example(features=tf.train.Features(feature={
-      'image/height': _int64_feature(height),
-      'image/width': _int64_feature(width),
-      'image/colorspace': _bytes_feature(tf.compat.as_bytes(colorspace)),
-      'image/channels': _int64_feature(channels),
-      'image/class/label': _int64_feature(label),
-      'image/class/text': _bytes_feature(tf.compat.as_bytes(text)),
-      'image/format': _bytes_feature(tf.compat.as_bytes(image_format)),
-      'image/filename': _bytes_feature(tf.compat.as_bytes(os.path.basename(filename))),
-      'image/encoded': _bytes_feature(tf.compat.as_bytes(image_buffer))}))
-  return example
-
-
-class ImageCoder(object):
-  """Helper class that provides TensorFlow image coding utilities."""
-
-  def __init__(self):
-    # Create a single Session to run all image coding calls.
-    self._sess = tf.Session()
-
-    # Initializes function that converts PNG to JPEG data.
-    self._png_data = tf.placeholder(dtype=tf.string)
-    image = tf.image.decode_png(self._png_data, channels=3)
-    self._png_to_jpeg = tf.image.encode_jpeg(image, format='rgb', quality=100)
-
-    # Initializes function that decodes RGB JPEG data.
-    self._decode_jpeg_data = tf.placeholder(dtype=tf.string)
-    self._decode_jpeg = tf.image.decode_jpeg(self._decode_jpeg_data, channels=3)
-
-  def png_to_jpeg(self, image_data):
-    return self._sess.run(self._png_to_jpeg,
-                          feed_dict={self._png_data: image_data})
-
-  def decode_jpeg(self, image_data):
-    image = self._sess.run(self._decode_jpeg,
-                           feed_dict={self._decode_jpeg_data: image_data})
-    assert len(image.shape) == 3
-    assert image.shape[2] == 3
-    return image
-
-
-def _is_png(filename):
-  """Determine if a file contains a PNG format image.
-
-  Args:
-    filename: string, path of the image file.
-
-  Returns:
-    boolean indicating if the image is a PNG.
-  """
-  return filename.endswith('.png')
-
-
-def _process_image(filename, coder):
-  """Process a single image file.
-
-  Args:
-    filename: string, path to an image file e.g., '/path/to/example.JPG'.
-    coder: instance of ImageCoder to provide TensorFlow image coding utils.
-  Returns:
-    image_buffer: string, JPEG encoding of RGB image.
-    height: integer, image height in pixels.
-    width: integer, image width in pixels.
-  """
-  # Read the image file.
-  with tf.gfile.FastGFile(filename, 'rb') as f:
-    image_data = f.read()
-
-  # Convert any PNG to JPEG's for consistency.
-  if _is_png(filename):
-    print('Converting PNG to JPEG for %s' % filename)
-    image_data = coder.png_to_jpeg(image_data)
-
-  # Decode the RGB JPEG.
-  image = coder.decode_jpeg(image_data)
-
-  # Check that image converted to RGB
-  assert len(image.shape) == 3
-  height = image.shape[0]
-  width = image.shape[1]
-  assert image.shape[2] == 3
-
-  return image_data, height, width
-
-
-def _process_image_files_batch(coder, thread_index, ranges, name, filenames,
-                               texts, labels, num_shards):
-  """Processes and saves list of images as TFRecord in 1 thread.
-
-  Args:
-    coder: instance of ImageCoder to provide TensorFlow image coding utils.
-    thread_index: integer, unique batch to run index is within [0, len(ranges)).
-    ranges: list of pairs of integers specifying ranges of each batches to
-      analyze in parallel.
-    name: string, unique identifier specifying the data set
-    filenames: list of strings; each string is a path to an image file
-    texts: list of strings; each string is human readable, e.g. 'dog'
-    labels: list of integer; each integer identifies the ground truth
-    num_shards: integer number of shards for this data set.
-  """
-  # Each thread produces N shards where N = int(num_shards / num_threads).
-  # For instance, if num_shards = 128, and the num_threads = 2, then the first
-  # thread would produce shards [0, 64).
-  num_threads = len(ranges)
-  assert not num_shards % num_threads
-  num_shards_per_batch = int(num_shards / num_threads)
-
-  shard_ranges = np.linspace(ranges[thread_index][0],
-                             ranges[thread_index][1],
-                             num_shards_per_batch + 1).astype(int)
-  num_files_in_thread = ranges[thread_index][1] - ranges[thread_index][0]
-
-  counter = 0
-  for s in range(num_shards_per_batch):
-    # Generate a sharded version of the file name, e.g. 'train-00002-of-00010'
-    shard = thread_index * num_shards_per_batch + s
-    output_filename = '%s-%.5d-of-%.5d' % (name, shard, num_shards)
-    output_file = os.path.join(FLAGS.output_directory, output_filename)
-    writer = tf.python_io.TFRecordWriter(output_file)
-
-    shard_counter = 0
-    files_in_shard = np.arange(shard_ranges[s], shard_ranges[s + 1], dtype=int)
-    for i in files_in_shard:
-      filename = filenames[i]
-      label = labels[i]
-      text = texts[i]
-
-      try:
-        image_buffer, height, width = _process_image(filename, coder)
-      except Exception as e:
-        print(e)
-        print('SKIPPED: Unexpected error while decoding %s.' % filename)
-        continue
-
-      example = _convert_to_example(filename, image_buffer, label,
-                                    text, height, width)
-      writer.write(example.SerializeToString())
-      shard_counter += 1
-      counter += 1
-
-      if not counter % 1000:
-        print('%s [thread %d]: Processed %d of %d images in thread batch.' %
-              (datetime.now(), thread_index, counter, num_files_in_thread))
-        sys.stdout.flush()
-
-    writer.close()
-    print('%s [thread %d]: Wrote %d images to %s' %
-          (datetime.now(), thread_index, shard_counter, output_file))
-    sys.stdout.flush()
-    shard_counter = 0
-  print('%s [thread %d]: Wrote %d images to %d shards.' %
-        (datetime.now(), thread_index, counter, num_files_in_thread))
-  sys.stdout.flush()
-
-
-def _process_image_files(name, filenames, texts, labels, num_shards):
-  """Process and save list of images as TFRecord of Example protos.
-
-  Args:
-    name: string, unique identifier specifying the data set
-    filenames: list of strings; each string is a path to an image file
-    texts: list of strings; each string is human readable, e.g. 'dog'
-    labels: list of integer; each integer identifies the ground truth
-    num_shards: integer number of shards for this data set.
-  """
-  assert len(filenames) == len(texts)
-  assert len(filenames) == len(labels)
-
-  # Break all images into batches with a [ranges[i][0], ranges[i][1]].
-  spacing = np.linspace(0, len(filenames), FLAGS.num_threads + 1).astype(np.int)
-  ranges = []
-  for i in range(len(spacing) - 1):
-    ranges.append([spacing[i], spacing[i + 1]])
-
-  # Launch a thread for each batch.
-  print('Launching %d threads for spacings: %s' % (FLAGS.num_threads, ranges))
-  sys.stdout.flush()
-
-  # Create a mechanism for monitoring when all threads are finished.
-  coord = tf.train.Coordinator()
-
-  # Create a generic TensorFlow-based utility for converting all image codings.
-  coder = ImageCoder()
-
-  threads = []
-  for thread_index in range(len(ranges)):
-    args = (coder, thread_index, ranges, name, filenames,
-            texts, labels, num_shards)
-    t = threading.Thread(target=_process_image_files_batch, args=args)
-    t.start()
-    threads.append(t)
-
-  # Wait for all the threads to terminate.
-  coord.join(threads)
-  print('%s: Finished writing all %d images in data set.' %
-        (datetime.now(), len(filenames)))
-  sys.stdout.flush()
-
-
-def _find_image_files(data_dir, labels_file):
-  """Build a list of all images files and labels in the data set.
-
-  Args:
-    data_dir: string, path to the root directory of images.
-
-      Assumes that the image data set resides in JPEG files located in
-      the following directory structure.
-
-        data_dir/dog/another-image.JPEG
-        data_dir/dog/my-image.jpg
-
-      where 'dog' is the label associated with these images.
-
-    labels_file: string, path to the labels file.
-
-      The list of valid labels are held in this file. Assumes that the file
-      contains entries as such:
-        dog
-        cat
-        flower
-      where each line corresponds to a label. We map each label contained in
-      the file to an integer starting with the integer 0 corresponding to the
-      label contained in the first line.
-
-  Returns:
-    filenames: list of strings; each string is a path to an image file.
-    texts: list of strings; each string is the class, e.g. 'dog'
-    labels: list of integer; each integer identifies the ground truth.
-  """
-  print('Determining list of input files and labels from %s.' % data_dir)
-  unique_labels = [l.strip() for l in tf.gfile.FastGFile(
-      labels_file, 'r').readlines()]
-
-  labels = []
-  filenames = []
-  texts = []
-
-  # Leave label index 0 empty as a background class.
-  label_index = 1
-
-  # Construct the list of JPEG files and labels.
-  for text in unique_labels:
-    jpeg_file_path = '%s/%s/*' % (data_dir, text)
-    matching_files = tf.gfile.Glob(jpeg_file_path)
-
-    labels.extend([label_index] * len(matching_files))
-    texts.extend([text] * len(matching_files))
-    filenames.extend(matching_files)
-
-    if not label_index % 100:
-      print('Finished finding files in %d of %d classes.' % (
-          label_index, len(labels)))
-    label_index += 1
-
-  # Shuffle the ordering of all image files in order to guarantee
-  # random ordering of the images with respect to label in the
-  # saved TFRecord files. Make the randomization repeatable.
-  shuffled_index = list(range(len(filenames)))
-  random.seed(12345)
-  random.shuffle(shuffled_index)
-
-  filenames = [filenames[i] for i in shuffled_index]
-  texts = [texts[i] for i in shuffled_index]
-  labels = [labels[i] for i in shuffled_index]
-
-  print('Found %d JPEG files across %d labels inside %s.' %
-        (len(filenames), len(unique_labels), data_dir))
-  return filenames, texts, labels
-
-
-def _process_dataset(name, directory, num_shards, labels_file):
-  """Process a complete data set and save it as a TFRecord.
-
-  Args:
-    name: string, unique identifier specifying the data set.
-    directory: string, root path to the data set.
-    num_shards: integer number of shards for this data set.
-    labels_file: string, path to the labels file.
-  """
-  filenames, texts, labels = _find_image_files(directory, labels_file)
-  _process_image_files(name, filenames, texts, labels, num_shards)
-
-
-def main(unused_argv):
-  assert not FLAGS.train_shards % FLAGS.num_threads, (
-      'Please make the FLAGS.num_threads commensurate with FLAGS.train_shards')
-  assert not FLAGS.validation_shards % FLAGS.num_threads, (
-      'Please make the FLAGS.num_threads commensurate with '
-      'FLAGS.validation_shards')
-  print('Saving results to %s' % FLAGS.output_directory)
-
-  # Run it!
-  _process_dataset('validation', FLAGS.validation_directory,
-                   FLAGS.validation_shards, FLAGS.labels_file)
-  _process_dataset('train', FLAGS.train_directory,
-                   FLAGS.train_shards, FLAGS.labels_file)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/inception/inception/data/build_imagenet_data.py b/research/inception/inception/data/build_imagenet_data.py
deleted file mode 100644
index c054735e782297f990451e29ff4383af24bbe802..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/build_imagenet_data.py
+++ /dev/null
@@ -1,707 +0,0 @@
-#!/usr/bin/python
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Converts ImageNet data to TFRecords file format with Example protos.
-
-The raw ImageNet data set is expected to reside in JPEG files located in the
-following directory structure.
-
-  data_dir/n01440764/ILSVRC2012_val_00000293.JPEG
-  data_dir/n01440764/ILSVRC2012_val_00000543.JPEG
-  ...
-
-where 'n01440764' is the unique synset label associated with
-these images.
-
-The training data set consists of 1000 sub-directories (i.e. labels)
-each containing 1200 JPEG images for a total of 1.2M JPEG images.
-
-The evaluation data set consists of 1000 sub-directories (i.e. labels)
-each containing 50 JPEG images for a total of 50K JPEG images.
-
-This TensorFlow script converts the training and evaluation data into
-a sharded data set consisting of 1024 and 128 TFRecord files, respectively.
-
-  train_directory/train-00000-of-01024
-  train_directory/train-00001-of-01024
-  ...
-  train_directory/train-01023-of-01024
-
-and
-
-  validation_directory/validation-00000-of-00128
-  validation_directory/validation-00001-of-00128
-  ...
-  validation_directory/validation-00127-of-00128
-
-Each validation TFRecord file contains ~390 records. Each training TFREcord
-file contains ~1250 records. Each record within the TFRecord file is a
-serialized Example proto. The Example proto contains the following fields:
-
-  image/encoded: string containing JPEG encoded image in RGB colorspace
-  image/height: integer, image height in pixels
-  image/width: integer, image width in pixels
-  image/colorspace: string, specifying the colorspace, always 'RGB'
-  image/channels: integer, specifying the number of channels, always 3
-  image/format: string, specifying the format, always 'JPEG'
-
-  image/filename: string containing the basename of the image file
-            e.g. 'n01440764_10026.JPEG' or 'ILSVRC2012_val_00000293.JPEG'
-  image/class/label: integer specifying the index in a classification layer.
-    The label ranges from [1, 1000] where 0 is not used.
-  image/class/synset: string specifying the unique ID of the label,
-    e.g. 'n01440764'
-  image/class/text: string specifying the human-readable version of the label
-    e.g. 'red fox, Vulpes vulpes'
-
-  image/object/bbox/xmin: list of integers specifying the 0+ human annotated
-    bounding boxes
-  image/object/bbox/xmax: list of integers specifying the 0+ human annotated
-    bounding boxes
-  image/object/bbox/ymin: list of integers specifying the 0+ human annotated
-    bounding boxes
-  image/object/bbox/ymax: list of integers specifying the 0+ human annotated
-    bounding boxes
-  image/object/bbox/label: integer specifying the index in a classification
-    layer. The label ranges from [1, 1000] where 0 is not used. Note this is
-    always identical to the image label.
-
-Note that the length of xmin is identical to the length of xmax, ymin and ymax
-for each example.
-
-Running this script using 16 threads may take around ~2.5 hours on an HP Z420.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from datetime import datetime
-import os
-import random
-import sys
-import threading
-
-import numpy as np
-import six
-import tensorflow as tf
-
-tf.app.flags.DEFINE_string('train_directory', '/tmp/',
-                           'Training data directory')
-tf.app.flags.DEFINE_string('validation_directory', '/tmp/',
-                           'Validation data directory')
-tf.app.flags.DEFINE_string('output_directory', '/tmp/',
-                           'Output data directory')
-
-tf.app.flags.DEFINE_integer('train_shards', 1024,
-                            'Number of shards in training TFRecord files.')
-tf.app.flags.DEFINE_integer('validation_shards', 128,
-                            'Number of shards in validation TFRecord files.')
-
-tf.app.flags.DEFINE_integer('num_threads', 8,
-                            'Number of threads to preprocess the images.')
-
-# The labels file contains a list of valid labels are held in this file.
-# Assumes that the file contains entries as such:
-#   n01440764
-#   n01443537
-#   n01484850
-# where each line corresponds to a label expressed as a synset. We map
-# each synset contained in the file to an integer (based on the alphabetical
-# ordering). See below for details.
-tf.app.flags.DEFINE_string('labels_file',
-                           'imagenet_lsvrc_2015_synsets.txt',
-                           'Labels file')
-
-# This file containing mapping from synset to human-readable label.
-# Assumes each line of the file looks like:
-#
-#   n02119247    black fox
-#   n02119359    silver fox
-#   n02119477    red fox, Vulpes fulva
-#
-# where each line corresponds to a unique mapping. Note that each line is
-# formatted as <synset>\t<human readable label>.
-tf.app.flags.DEFINE_string('imagenet_metadata_file',
-                           'imagenet_metadata.txt',
-                           'ImageNet metadata file')
-
-# This file is the output of process_bounding_box.py
-# Assumes each line of the file looks like:
-#
-#   n00007846_64193.JPEG,0.0060,0.2620,0.7545,0.9940
-#
-# where each line corresponds to one bounding box annotation associated
-# with an image. Each line can be parsed as:
-#
-#   <JPEG file name>, <xmin>, <ymin>, <xmax>, <ymax>
-#
-# Note that there might exist mulitple bounding box annotations associated
-# with an image file.
-tf.app.flags.DEFINE_string('bounding_box_file',
-                           './imagenet_2012_bounding_boxes.csv',
-                           'Bounding box file')
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def _int64_feature(value):
-  """Wrapper for inserting int64 features into Example proto."""
-  if not isinstance(value, list):
-    value = [value]
-  return tf.train.Feature(int64_list=tf.train.Int64List(value=value))
-
-
-def _float_feature(value):
-  """Wrapper for inserting float features into Example proto."""
-  if not isinstance(value, list):
-    value = [value]
-  return tf.train.Feature(float_list=tf.train.FloatList(value=value))
-
-
-def _bytes_feature(value):
-  """Wrapper for inserting bytes features into Example proto."""
-  if six.PY3 and isinstance(value, six.text_type):           
-    value = six.binary_type(value, encoding='utf-8') 
-  return tf.train.Feature(bytes_list=tf.train.BytesList(value=[value]))
-
-
-def _convert_to_example(filename, image_buffer, label, synset, human, bbox,
-                        height, width):
-  """Build an Example proto for an example.
-
-  Args:
-    filename: string, path to an image file, e.g., '/path/to/example.JPG'
-    image_buffer: string, JPEG encoding of RGB image
-    label: integer, identifier for the ground truth for the network
-    synset: string, unique WordNet ID specifying the label, e.g., 'n02323233'
-    human: string, human-readable label, e.g., 'red fox, Vulpes vulpes'
-    bbox: list of bounding boxes; each box is a list of integers
-      specifying [xmin, ymin, xmax, ymax]. All boxes are assumed to belong to
-      the same label as the image label.
-    height: integer, image height in pixels
-    width: integer, image width in pixels
-  Returns:
-    Example proto
-  """
-  xmin = []
-  ymin = []
-  xmax = []
-  ymax = []
-  for b in bbox:
-    assert len(b) == 4
-    # pylint: disable=expression-not-assigned
-    [l.append(point) for l, point in zip([xmin, ymin, xmax, ymax], b)]
-    # pylint: enable=expression-not-assigned
-
-  colorspace = 'RGB'
-  channels = 3
-  image_format = 'JPEG'
-
-  example = tf.train.Example(features=tf.train.Features(feature={
-      'image/height': _int64_feature(height),
-      'image/width': _int64_feature(width),
-      'image/colorspace': _bytes_feature(colorspace),
-      'image/channels': _int64_feature(channels),
-      'image/class/label': _int64_feature(label),
-      'image/class/synset': _bytes_feature(synset),
-      'image/class/text': _bytes_feature(human),
-      'image/object/bbox/xmin': _float_feature(xmin),
-      'image/object/bbox/xmax': _float_feature(xmax),
-      'image/object/bbox/ymin': _float_feature(ymin),
-      'image/object/bbox/ymax': _float_feature(ymax),
-      'image/object/bbox/label': _int64_feature([label] * len(xmin)),
-      'image/format': _bytes_feature(image_format),
-      'image/filename': _bytes_feature(os.path.basename(filename)),
-      'image/encoded': _bytes_feature(image_buffer)}))
-  return example
-
-
-class ImageCoder(object):
-  """Helper class that provides TensorFlow image coding utilities."""
-
-  def __init__(self):
-    # Create a single Session to run all image coding calls.
-    self._sess = tf.Session()
-
-    # Initializes function that converts PNG to JPEG data.
-    self._png_data = tf.placeholder(dtype=tf.string)
-    image = tf.image.decode_png(self._png_data, channels=3)
-    self._png_to_jpeg = tf.image.encode_jpeg(image, format='rgb', quality=100)
-
-    # Initializes function that converts CMYK JPEG data to RGB JPEG data.
-    self._cmyk_data = tf.placeholder(dtype=tf.string)
-    image = tf.image.decode_jpeg(self._cmyk_data, channels=0)
-    self._cmyk_to_rgb = tf.image.encode_jpeg(image, format='rgb', quality=100)
-
-    # Initializes function that decodes RGB JPEG data.
-    self._decode_jpeg_data = tf.placeholder(dtype=tf.string)
-    self._decode_jpeg = tf.image.decode_jpeg(self._decode_jpeg_data, channels=3)
-
-  def png_to_jpeg(self, image_data):
-    return self._sess.run(self._png_to_jpeg,
-                          feed_dict={self._png_data: image_data})
-
-  def cmyk_to_rgb(self, image_data):
-    return self._sess.run(self._cmyk_to_rgb,
-                          feed_dict={self._cmyk_data: image_data})
-
-  def decode_jpeg(self, image_data):
-    image = self._sess.run(self._decode_jpeg,
-                           feed_dict={self._decode_jpeg_data: image_data})
-    assert len(image.shape) == 3
-    assert image.shape[2] == 3
-    return image
-
-
-def _is_png(filename):
-  """Determine if a file contains a PNG format image.
-
-  Args:
-    filename: string, path of the image file.
-
-  Returns:
-    boolean indicating if the image is a PNG.
-  """
-  # File list from:
-  # https://groups.google.com/forum/embed/?place=forum/torch7#!topic/torch7/fOSTXHIESSU
-  return 'n02105855_2933.JPEG' in filename
-
-
-def _is_cmyk(filename):
-  """Determine if file contains a CMYK JPEG format image.
-
-  Args:
-    filename: string, path of the image file.
-
-  Returns:
-    boolean indicating if the image is a JPEG encoded with CMYK color space.
-  """
-  # File list from:
-  # https://github.com/cytsai/ilsvrc-cmyk-image-list
-  blacklist = ['n01739381_1309.JPEG', 'n02077923_14822.JPEG',
-               'n02447366_23489.JPEG', 'n02492035_15739.JPEG',
-               'n02747177_10752.JPEG', 'n03018349_4028.JPEG',
-               'n03062245_4620.JPEG', 'n03347037_9675.JPEG',
-               'n03467068_12171.JPEG', 'n03529860_11437.JPEG',
-               'n03544143_17228.JPEG', 'n03633091_5218.JPEG',
-               'n03710637_5125.JPEG', 'n03961711_5286.JPEG',
-               'n04033995_2932.JPEG', 'n04258138_17003.JPEG',
-               'n04264628_27969.JPEG', 'n04336792_7448.JPEG',
-               'n04371774_5854.JPEG', 'n04596742_4225.JPEG',
-               'n07583066_647.JPEG', 'n13037406_4650.JPEG']
-  return filename.split('/')[-1] in blacklist
-
-
-def _process_image(filename, coder):
-  """Process a single image file.
-
-  Args:
-    filename: string, path to an image file e.g., '/path/to/example.JPG'.
-    coder: instance of ImageCoder to provide TensorFlow image coding utils.
-  Returns:
-    image_buffer: string, JPEG encoding of RGB image.
-    height: integer, image height in pixels.
-    width: integer, image width in pixels.
-  """
-  # Read the image file.
-  with tf.gfile.FastGFile(filename, 'rb') as f:
-    image_data = f.read()
-
-  # Clean the dirty data.
-  if _is_png(filename):
-    # 1 image is a PNG.
-    print('Converting PNG to JPEG for %s' % filename)
-    image_data = coder.png_to_jpeg(image_data)
-  elif _is_cmyk(filename):
-    # 22 JPEG images are in CMYK colorspace.
-    print('Converting CMYK to RGB for %s' % filename)
-    image_data = coder.cmyk_to_rgb(image_data)
-
-  # Decode the RGB JPEG.
-  image = coder.decode_jpeg(image_data)
-
-  # Check that image converted to RGB
-  assert len(image.shape) == 3
-  height = image.shape[0]
-  width = image.shape[1]
-  assert image.shape[2] == 3
-
-  return image_data, height, width
-
-
-def _process_image_files_batch(coder, thread_index, ranges, name, filenames,
-                               synsets, labels, humans, bboxes, num_shards):
-  """Processes and saves list of images as TFRecord in 1 thread.
-
-  Args:
-    coder: instance of ImageCoder to provide TensorFlow image coding utils.
-    thread_index: integer, unique batch to run index is within [0, len(ranges)).
-    ranges: list of pairs of integers specifying ranges of each batches to
-      analyze in parallel.
-    name: string, unique identifier specifying the data set
-    filenames: list of strings; each string is a path to an image file
-    synsets: list of strings; each string is a unique WordNet ID
-    labels: list of integer; each integer identifies the ground truth
-    humans: list of strings; each string is a human-readable label
-    bboxes: list of bounding boxes for each image. Note that each entry in this
-      list might contain from 0+ entries corresponding to the number of bounding
-      box annotations for the image.
-    num_shards: integer number of shards for this data set.
-  """
-  # Each thread produces N shards where N = int(num_shards / num_threads).
-  # For instance, if num_shards = 128, and the num_threads = 2, then the first
-  # thread would produce shards [0, 64).
-  num_threads = len(ranges)
-  assert not num_shards % num_threads
-  num_shards_per_batch = int(num_shards / num_threads)
-
-  shard_ranges = np.linspace(ranges[thread_index][0],
-                             ranges[thread_index][1],
-                             num_shards_per_batch + 1).astype(int)
-  num_files_in_thread = ranges[thread_index][1] - ranges[thread_index][0]
-
-  counter = 0
-  for s in range(num_shards_per_batch):
-    # Generate a sharded version of the file name, e.g. 'train-00002-of-00010'
-    shard = thread_index * num_shards_per_batch + s
-    output_filename = '%s-%.5d-of-%.5d' % (name, shard, num_shards)
-    output_file = os.path.join(FLAGS.output_directory, output_filename)
-    writer = tf.python_io.TFRecordWriter(output_file)
-
-    shard_counter = 0
-    files_in_shard = np.arange(shard_ranges[s], shard_ranges[s + 1], dtype=int)
-    for i in files_in_shard:
-      filename = filenames[i]
-      label = labels[i]
-      synset = synsets[i]
-      human = humans[i]
-      bbox = bboxes[i]
-
-      image_buffer, height, width = _process_image(filename, coder)
-
-      example = _convert_to_example(filename, image_buffer, label,
-                                    synset, human, bbox,
-                                    height, width)
-      writer.write(example.SerializeToString())
-      shard_counter += 1
-      counter += 1
-
-      if not counter % 1000:
-        print('%s [thread %d]: Processed %d of %d images in thread batch.' %
-              (datetime.now(), thread_index, counter, num_files_in_thread))
-        sys.stdout.flush()
-
-    writer.close()
-    print('%s [thread %d]: Wrote %d images to %s' %
-          (datetime.now(), thread_index, shard_counter, output_file))
-    sys.stdout.flush()
-    shard_counter = 0
-  print('%s [thread %d]: Wrote %d images to %d shards.' %
-        (datetime.now(), thread_index, counter, num_files_in_thread))
-  sys.stdout.flush()
-
-
-def _process_image_files(name, filenames, synsets, labels, humans,
-                         bboxes, num_shards):
-  """Process and save list of images as TFRecord of Example protos.
-
-  Args:
-    name: string, unique identifier specifying the data set
-    filenames: list of strings; each string is a path to an image file
-    synsets: list of strings; each string is a unique WordNet ID
-    labels: list of integer; each integer identifies the ground truth
-    humans: list of strings; each string is a human-readable label
-    bboxes: list of bounding boxes for each image. Note that each entry in this
-      list might contain from 0+ entries corresponding to the number of bounding
-      box annotations for the image.
-    num_shards: integer number of shards for this data set.
-  """
-  assert len(filenames) == len(synsets)
-  assert len(filenames) == len(labels)
-  assert len(filenames) == len(humans)
-  assert len(filenames) == len(bboxes)
-
-  # Break all images into batches with a [ranges[i][0], ranges[i][1]].
-  spacing = np.linspace(0, len(filenames), FLAGS.num_threads + 1).astype(np.int)
-  ranges = []
-  threads = []
-  for i in range(len(spacing) - 1):
-    ranges.append([spacing[i], spacing[i + 1]])
-
-  # Launch a thread for each batch.
-  print('Launching %d threads for spacings: %s' % (FLAGS.num_threads, ranges))
-  sys.stdout.flush()
-
-  # Create a mechanism for monitoring when all threads are finished.
-  coord = tf.train.Coordinator()
-
-  # Create a generic TensorFlow-based utility for converting all image codings.
-  coder = ImageCoder()
-
-  threads = []
-  for thread_index in range(len(ranges)):
-    args = (coder, thread_index, ranges, name, filenames,
-            synsets, labels, humans, bboxes, num_shards)
-    t = threading.Thread(target=_process_image_files_batch, args=args)
-    t.start()
-    threads.append(t)
-
-  # Wait for all the threads to terminate.
-  coord.join(threads)
-  print('%s: Finished writing all %d images in data set.' %
-        (datetime.now(), len(filenames)))
-  sys.stdout.flush()
-
-
-def _find_image_files(data_dir, labels_file):
-  """Build a list of all images files and labels in the data set.
-
-  Args:
-    data_dir: string, path to the root directory of images.
-
-      Assumes that the ImageNet data set resides in JPEG files located in
-      the following directory structure.
-
-        data_dir/n01440764/ILSVRC2012_val_00000293.JPEG
-        data_dir/n01440764/ILSVRC2012_val_00000543.JPEG
-
-      where 'n01440764' is the unique synset label associated with these images.
-
-    labels_file: string, path to the labels file.
-
-      The list of valid labels are held in this file. Assumes that the file
-      contains entries as such:
-        n01440764
-        n01443537
-        n01484850
-      where each line corresponds to a label expressed as a synset. We map
-      each synset contained in the file to an integer (based on the alphabetical
-      ordering) starting with the integer 1 corresponding to the synset
-      contained in the first line.
-
-      The reason we start the integer labels at 1 is to reserve label 0 as an
-      unused background class.
-
-  Returns:
-    filenames: list of strings; each string is a path to an image file.
-    synsets: list of strings; each string is a unique WordNet ID.
-    labels: list of integer; each integer identifies the ground truth.
-  """
-  print('Determining list of input files and labels from %s.' % data_dir)
-  challenge_synsets = [l.strip() for l in
-                       tf.gfile.FastGFile(labels_file, 'r').readlines()]
-
-  labels = []
-  filenames = []
-  synsets = []
-
-  # Leave label index 0 empty as a background class.
-  label_index = 1
-
-  # Construct the list of JPEG files and labels.
-  for synset in challenge_synsets:
-    jpeg_file_path = '%s/%s/*.JPEG' % (data_dir, synset)
-    matching_files = tf.gfile.Glob(jpeg_file_path)
-
-    labels.extend([label_index] * len(matching_files))
-    synsets.extend([synset] * len(matching_files))
-    filenames.extend(matching_files)
-
-    if not label_index % 100:
-      print('Finished finding files in %d of %d classes.' % (
-          label_index, len(challenge_synsets)))
-    label_index += 1
-
-  # Shuffle the ordering of all image files in order to guarantee
-  # random ordering of the images with respect to label in the
-  # saved TFRecord files. Make the randomization repeatable.
-  shuffled_index = list(range(len(filenames)))
-  random.seed(12345)
-  random.shuffle(shuffled_index)
-
-  filenames = [filenames[i] for i in shuffled_index]
-  synsets = [synsets[i] for i in shuffled_index]
-  labels = [labels[i] for i in shuffled_index]
-
-  print('Found %d JPEG files across %d labels inside %s.' %
-        (len(filenames), len(challenge_synsets), data_dir))
-  return filenames, synsets, labels
-
-
-def _find_human_readable_labels(synsets, synset_to_human):
-  """Build a list of human-readable labels.
-
-  Args:
-    synsets: list of strings; each string is a unique WordNet ID.
-    synset_to_human: dict of synset to human labels, e.g.,
-      'n02119022' --> 'red fox, Vulpes vulpes'
-
-  Returns:
-    List of human-readable strings corresponding to each synset.
-  """
-  humans = []
-  for s in synsets:
-    assert s in synset_to_human, ('Failed to find: %s' % s)
-    humans.append(synset_to_human[s])
-  return humans
-
-
-def _find_image_bounding_boxes(filenames, image_to_bboxes):
-  """Find the bounding boxes for a given image file.
-
-  Args:
-    filenames: list of strings; each string is a path to an image file.
-    image_to_bboxes: dictionary mapping image file names to a list of
-      bounding boxes. This list contains 0+ bounding boxes.
-  Returns:
-    List of bounding boxes for each image. Note that each entry in this
-    list might contain from 0+ entries corresponding to the number of bounding
-    box annotations for the image.
-  """
-  num_image_bbox = 0
-  bboxes = []
-  for f in filenames:
-    basename = os.path.basename(f)
-    if basename in image_to_bboxes:
-      bboxes.append(image_to_bboxes[basename])
-      num_image_bbox += 1
-    else:
-      bboxes.append([])
-  print('Found %d images with bboxes out of %d images' % (
-      num_image_bbox, len(filenames)))
-  return bboxes
-
-
-def _process_dataset(name, directory, num_shards, synset_to_human,
-                     image_to_bboxes):
-  """Process a complete data set and save it as a TFRecord.
-
-  Args:
-    name: string, unique identifier specifying the data set.
-    directory: string, root path to the data set.
-    num_shards: integer number of shards for this data set.
-    synset_to_human: dict of synset to human labels, e.g.,
-      'n02119022' --> 'red fox, Vulpes vulpes'
-    image_to_bboxes: dictionary mapping image file names to a list of
-      bounding boxes. This list contains 0+ bounding boxes.
-  """
-  filenames, synsets, labels = _find_image_files(directory, FLAGS.labels_file)
-  humans = _find_human_readable_labels(synsets, synset_to_human)
-  bboxes = _find_image_bounding_boxes(filenames, image_to_bboxes)
-  _process_image_files(name, filenames, synsets, labels,
-                       humans, bboxes, num_shards)
-
-
-def _build_synset_lookup(imagenet_metadata_file):
-  """Build lookup for synset to human-readable label.
-
-  Args:
-    imagenet_metadata_file: string, path to file containing mapping from
-      synset to human-readable label.
-
-      Assumes each line of the file looks like:
-
-        n02119247    black fox
-        n02119359    silver fox
-        n02119477    red fox, Vulpes fulva
-
-      where each line corresponds to a unique mapping. Note that each line is
-      formatted as <synset>\t<human readable label>.
-
-  Returns:
-    Dictionary of synset to human labels, such as:
-      'n02119022' --> 'red fox, Vulpes vulpes'
-  """
-  lines = tf.gfile.FastGFile(imagenet_metadata_file, 'r').readlines()
-  synset_to_human = {}
-  for l in lines:
-    if l:
-      parts = l.strip().split('\t')
-      assert len(parts) == 2
-      synset = parts[0]
-      human = parts[1]
-      synset_to_human[synset] = human
-  return synset_to_human
-
-
-def _build_bounding_box_lookup(bounding_box_file):
-  """Build a lookup from image file to bounding boxes.
-
-  Args:
-    bounding_box_file: string, path to file with bounding boxes annotations.
-
-      Assumes each line of the file looks like:
-
-        n00007846_64193.JPEG,0.0060,0.2620,0.7545,0.9940
-
-      where each line corresponds to one bounding box annotation associated
-      with an image. Each line can be parsed as:
-
-        <JPEG file name>, <xmin>, <ymin>, <xmax>, <ymax>
-
-      Note that there might exist mulitple bounding box annotations associated
-      with an image file. This file is the output of process_bounding_boxes.py.
-
-  Returns:
-    Dictionary mapping image file names to a list of bounding boxes. This list
-    contains 0+ bounding boxes.
-  """
-  lines = tf.gfile.FastGFile(bounding_box_file, 'r').readlines()
-  images_to_bboxes = {}
-  num_bbox = 0
-  num_image = 0
-  for l in lines:
-    if l:
-      parts = l.split(',')
-      assert len(parts) == 5, ('Failed to parse: %s' % l)
-      filename = parts[0]
-      xmin = float(parts[1])
-      ymin = float(parts[2])
-      xmax = float(parts[3])
-      ymax = float(parts[4])
-      box = [xmin, ymin, xmax, ymax]
-
-      if filename not in images_to_bboxes:
-        images_to_bboxes[filename] = []
-        num_image += 1
-      images_to_bboxes[filename].append(box)
-      num_bbox += 1
-
-  print('Successfully read %d bounding boxes '
-        'across %d images.' % (num_bbox, num_image))
-  return images_to_bboxes
-
-
-def main(unused_argv):
-  assert not FLAGS.train_shards % FLAGS.num_threads, (
-      'Please make the FLAGS.num_threads commensurate with FLAGS.train_shards')
-  assert not FLAGS.validation_shards % FLAGS.num_threads, (
-      'Please make the FLAGS.num_threads commensurate with '
-      'FLAGS.validation_shards')
-  print('Saving results to %s' % FLAGS.output_directory)
-
-  # Build a map from synset to human-readable label.
-  synset_to_human = _build_synset_lookup(FLAGS.imagenet_metadata_file)
-  image_to_bboxes = _build_bounding_box_lookup(FLAGS.bounding_box_file)
-
-  # Run it!
-  _process_dataset('validation', FLAGS.validation_directory,
-                   FLAGS.validation_shards, synset_to_human, image_to_bboxes)
-  _process_dataset('train', FLAGS.train_directory, FLAGS.train_shards,
-                   synset_to_human, image_to_bboxes)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/inception/inception/data/download_and_preprocess_flowers.sh b/research/inception/inception/data/download_and_preprocess_flowers.sh
deleted file mode 100755
index ee045c164e803ab38be69fb1933134e7f37f1793..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/download_and_preprocess_flowers.sh
+++ /dev/null
@@ -1,96 +0,0 @@
-#!/bin/bash
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# Script to download and preprocess the flowers data set. This data set
-# provides a demonstration for how to perform fine-tuning (i.e. tranfer
-# learning) from one model to a new data set.
-#
-# This script provides a demonstration for how to prepare an arbitrary
-# data set for training an Inception v3 model.
-#
-# We demonstrate this with the flowers data set which consists of images
-# of labeled flower images from 5 classes:
-#
-# daisy, dandelion, roses, sunflowers, tulips
-#
-# The final output of this script are sharded TFRecord files containing
-# serialized Example protocol buffers. See build_image_data.py for
-# details of how the Example protocol buffer contains image data.
-#
-# usage:
-#  ./download_and_preprocess_flowers.sh [data-dir]
-set -e
-
-if [ -z "$1" ]; then
-  echo "Usage: download_and_preprocess_flowers.sh [data dir]"
-  exit
-fi
-
-# Create the output and temporary directories.
-DATA_DIR="${1%/}"
-SCRATCH_DIR="${DATA_DIR}/raw-data"
-WORK_DIR="$( cd "$( dirname "${BASH_SOURCE[0]}" )" >/dev/null 2>&1 && pwd )"
-mkdir -p "${DATA_DIR}"
-mkdir -p "${SCRATCH_DIR}"
-
-# Download the flowers data.
-DATA_URL="http://download.tensorflow.org/example_images/flower_photos.tgz"
-CURRENT_DIR=$(pwd)
-cd "${DATA_DIR}"
-TARBALL="flower_photos.tgz"
-if [ ! -f ${TARBALL} ]; then
-  echo "Downloading flower data set."
-  curl -o ${TARBALL} "${DATA_URL}"
-else
-  echo "Skipping download of flower data."
-fi
-
-# Note the locations of the train and validation data.
-TRAIN_DIRECTORY="${SCRATCH_DIR}/train"
-VALIDATION_DIRECTORY="${SCRATCH_DIR}/validation"
-
-# Expands the data into the flower_photos/ directory and rename it as the
-# train directory.
-tar xf flower_photos.tgz
-rm -rf "${TRAIN_DIRECTORY}" "${VALIDATION_DIRECTORY}"
-mv flower_photos "${TRAIN_DIRECTORY}"
-
-# Generate a list of 5 labels: daisy, dandelion, roses, sunflowers, tulips
-LABELS_FILE="${SCRATCH_DIR}/labels.txt"
-ls -1 "${TRAIN_DIRECTORY}" | grep -v 'LICENSE' | sed 's/\///' | sort > "${LABELS_FILE}"
-
-# Generate the validation data set.
-while read LABEL; do
-  VALIDATION_DIR_FOR_LABEL="${VALIDATION_DIRECTORY}/${LABEL}"
-  TRAIN_DIR_FOR_LABEL="${TRAIN_DIRECTORY}/${LABEL}"
-
-  # Move the first randomly selected 100 images to the validation set.
-  mkdir -p "${VALIDATION_DIR_FOR_LABEL}"
-  VALIDATION_IMAGES=$(ls -1 "${TRAIN_DIR_FOR_LABEL}" | shuf | head -100)
-  for IMAGE in ${VALIDATION_IMAGES}; do
-    mv -f "${TRAIN_DIRECTORY}/${LABEL}/${IMAGE}" "${VALIDATION_DIR_FOR_LABEL}"
-  done
-done < "${LABELS_FILE}"
-
-# Build the TFRecords version of the image data.
-cd "${CURRENT_DIR}"
-BUILD_SCRIPT="${WORK_DIR}/build_image_data.py"
-OUTPUT_DIRECTORY="${DATA_DIR}"
-"${BUILD_SCRIPT}" \
-  --train_directory="${TRAIN_DIRECTORY}" \
-  --validation_directory="${VALIDATION_DIRECTORY}" \
-  --output_directory="${OUTPUT_DIRECTORY}" \
-  --labels_file="${LABELS_FILE}"
diff --git a/research/inception/inception/data/download_and_preprocess_flowers_mac.sh b/research/inception/inception/data/download_and_preprocess_flowers_mac.sh
deleted file mode 100644
index 154905635b19aeaaea087a8e76afda9b8c624d59..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/download_and_preprocess_flowers_mac.sh
+++ /dev/null
@@ -1,96 +0,0 @@
-#!/bin/bash
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# Script to download and preprocess the flowers data set. This data set
-# provides a demonstration for how to perform fine-tuning (i.e. tranfer
-# learning) from one model to a new data set.
-#
-# This script provides a demonstration for how to prepare an arbitrary
-# data set for training an Inception v3 model.
-#
-# We demonstrate this with the flowers data set which consists of images
-# of labeled flower images from 5 classes:
-#
-# daisy, dandelion, roses, sunflowers, tulips
-#
-# The final output of this script are sharded TFRecord files containing
-# serialized Example protocol buffers. See build_image_data.py for
-# details of how the Example protocol buffer contains image data.
-#
-# usage:
-#  ./download_and_preprocess_flowers.sh [data-dir]
-set -e
-
-if [ -z "$1" ]; then
-  echo "Usage: download_and_preprocess_flowers.sh [data dir]"
-  exit
-fi
-
-# Create the output and temporary directories.
-DATA_DIR="${1%/}"
-SCRATCH_DIR="${DATA_DIR}/raw-data/"
-mkdir -p "${DATA_DIR}"
-mkdir -p "${SCRATCH_DIR}"
-WORK_DIR="$0.runfiles/inception/inception"
-
-# Download the flowers data.
-DATA_URL="http://download.tensorflow.org/example_images/flower_photos.tgz"
-CURRENT_DIR=$(pwd)
-cd "${DATA_DIR}"
-TARBALL="flower_photos.tgz"
-if [ ! -f ${TARBALL} ]; then
-  echo "Downloading flower data set."
-  curl -o ${TARBALL} "${DATA_URL}"
-else
-  echo "Skipping download of flower data."
-fi
-
-# Note the locations of the train and validation data.
-TRAIN_DIRECTORY="${SCRATCH_DIR}train/"
-VALIDATION_DIRECTORY="${SCRATCH_DIR}validation/"
-
-# Expands the data into the flower_photos/ directory and rename it as the
-# train directory.
-tar xf flower_photos.tgz
-rm -rf "${TRAIN_DIRECTORY}" "${VALIDATION_DIRECTORY}"
-mv flower_photos "${TRAIN_DIRECTORY}"
-
-# Generate a list of 5 labels: daisy, dandelion, roses, sunflowers, tulips
-LABELS_FILE="${SCRATCH_DIR}/labels.txt"
-ls -1 "${TRAIN_DIRECTORY}" | grep -v 'LICENSE' | sed 's/\///' | sort > "${LABELS_FILE}"
-
-# Generate the validation data set.
-while read LABEL; do
-  VALIDATION_DIR_FOR_LABEL="${VALIDATION_DIRECTORY}${LABEL}"
-  TRAIN_DIR_FOR_LABEL="${TRAIN_DIRECTORY}${LABEL}"
-
-  # Move the first randomly selected 100 images to the validation set.
-  mkdir -p "${VALIDATION_DIR_FOR_LABEL}"
-  VALIDATION_IMAGES=$(ls -1 "${TRAIN_DIR_FOR_LABEL}" | gshuf | head -100)
-  for IMAGE in ${VALIDATION_IMAGES}; do
-    mv -f "${TRAIN_DIRECTORY}${LABEL}/${IMAGE}" "${VALIDATION_DIR_FOR_LABEL}"
-  done
-done < "${LABELS_FILE}"
-
-# Build the TFRecords version of the image data.
-cd "${CURRENT_DIR}"
-BUILD_SCRIPT="${WORK_DIR}/build_image_data"
-OUTPUT_DIRECTORY="${DATA_DIR}"
-"${BUILD_SCRIPT}" \
-  --train_directory="${TRAIN_DIRECTORY}" \
-  --validation_directory="${VALIDATION_DIRECTORY}" \
-  --output_directory="${OUTPUT_DIRECTORY}" \
-  --labels_file="${LABELS_FILE}"
diff --git a/research/inception/inception/data/download_and_preprocess_imagenet.sh b/research/inception/inception/data/download_and_preprocess_imagenet.sh
deleted file mode 100755
index 6faae831075d4f6bfdc8bf8797219f7a0e4c1797..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/download_and_preprocess_imagenet.sh
+++ /dev/null
@@ -1,101 +0,0 @@
-#!/bin/bash
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# Script to download and preprocess ImageNet Challenge 2012
-# training and validation data set.
-#
-# The final output of this script are sharded TFRecord files containing
-# serialized Example protocol buffers. See build_imagenet_data.py for
-# details of how the Example protocol buffers contain the ImageNet data.
-#
-# The final output of this script appears as such:
-#
-#   data_dir/train-00000-of-01024
-#   data_dir/train-00001-of-01024
-#    ...
-#   data_dir/train-01023-of-01024
-#
-# and
-#
-#   data_dir/validation-00000-of-00128
-#   data_dir/validation-00001-of-00128
-#   ...
-#   data_dir/validation-00127-of-00128
-#
-# Note that this script may take several hours to run to completion. The
-# conversion of the ImageNet data to TFRecords alone takes 2-3 hours depending
-# on the speed of your machine. Please be patient.
-#
-# **IMPORTANT**
-# To download the raw images, the user must create an account with image-net.org
-# and generate a username and access_key. The latter two are required for
-# downloading the raw images.
-#
-# usage:
-#  ./download_and_preprocess_imagenet.sh [data-dir]
-set -e
-
-if [ -z "$1" ]; then
-  echo "Usage: download_and_preprocess_imagenet.sh [data dir]"
-  exit
-fi
-
-# Create the output and temporary directories.
-DATA_DIR="${1%/}"
-SCRATCH_DIR="${DATA_DIR}/raw-data/"
-mkdir -p "${DATA_DIR}"
-mkdir -p "${SCRATCH_DIR}"
-WORK_DIR="$0.runfiles/inception/inception"
-
-# Download the ImageNet data.
-LABELS_FILE="${WORK_DIR}/data/imagenet_lsvrc_2015_synsets.txt"
-DOWNLOAD_SCRIPT="${WORK_DIR}/data/download_imagenet.sh"
-"${DOWNLOAD_SCRIPT}" "${SCRATCH_DIR}" "${LABELS_FILE}"
-
-# Note the locations of the train and validation data.
-TRAIN_DIRECTORY="${SCRATCH_DIR}train/"
-VALIDATION_DIRECTORY="${SCRATCH_DIR}validation/"
-
-# Preprocess the validation data by moving the images into the appropriate
-# sub-directory based on the label (synset) of the image.
-echo "Organizing the validation data into sub-directories."
-PREPROCESS_VAL_SCRIPT="${WORK_DIR}/data/preprocess_imagenet_validation_data.py"
-VAL_LABELS_FILE="${WORK_DIR}/data/imagenet_2012_validation_synset_labels.txt"
-
-"${PREPROCESS_VAL_SCRIPT}" "${VALIDATION_DIRECTORY}" "${VAL_LABELS_FILE}"
-
-# Convert the XML files for bounding box annotations into a single CSV.
-echo "Extracting bounding box information from XML."
-BOUNDING_BOX_SCRIPT="${WORK_DIR}/data/process_bounding_boxes.py"
-BOUNDING_BOX_FILE="${SCRATCH_DIR}/imagenet_2012_bounding_boxes.csv"
-BOUNDING_BOX_DIR="${SCRATCH_DIR}bounding_boxes/"
-
-"${BOUNDING_BOX_SCRIPT}" "${BOUNDING_BOX_DIR}" "${LABELS_FILE}" \
- | sort > "${BOUNDING_BOX_FILE}"
-echo "Finished downloading and preprocessing the ImageNet data."
-
-# Build the TFRecords version of the ImageNet data.
-BUILD_SCRIPT="${WORK_DIR}/build_imagenet_data"
-OUTPUT_DIRECTORY="${DATA_DIR}"
-IMAGENET_METADATA_FILE="${WORK_DIR}/data/imagenet_metadata.txt"
-
-"${BUILD_SCRIPT}" \
-  --train_directory="${TRAIN_DIRECTORY}" \
-  --validation_directory="${VALIDATION_DIRECTORY}" \
-  --output_directory="${OUTPUT_DIRECTORY}" \
-  --imagenet_metadata_file="${IMAGENET_METADATA_FILE}" \
-  --labels_file="${LABELS_FILE}" \
-  --bounding_box_file="${BOUNDING_BOX_FILE}"
diff --git a/research/inception/inception/data/download_imagenet.sh b/research/inception/inception/data/download_imagenet.sh
deleted file mode 100755
index f6c77781c0bcaad642ec7a38a7ba00693ef8ef83..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/download_imagenet.sh
+++ /dev/null
@@ -1,104 +0,0 @@
-#!/bin/bash
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# Script to download ImageNet Challenge 2012 training and validation data set.
-#
-# Downloads and decompresses raw images and bounding boxes.
-#
-# **IMPORTANT**
-# To download the raw images, the user must create an account with image-net.org
-# and generate a username and access_key. The latter two are required for
-# downloading the raw images.
-#
-# usage:
-#  ./download_imagenet.sh [dir name] [synsets file]
-set -e
-
-if [ "x$IMAGENET_ACCESS_KEY" == x -o "x$IMAGENET_USERNAME" == x ]; then
-  cat <<END
-In order to download the imagenet data, you have to create an account with
-image-net.org. This will get you a username and an access key. You can set the
-IMAGENET_USERNAME and IMAGENET_ACCESS_KEY environment variables, or you can
-enter the credentials here.
-END
-  read -p "Username: " IMAGENET_USERNAME
-  read -p "Access key: " IMAGENET_ACCESS_KEY
-fi
-
-OUTDIR="${1:-./imagenet-data}"
-SYNSETS_FILE="${2:-./synsets.txt}"
-
-echo "Saving downloaded files to $OUTDIR"
-mkdir -p "${OUTDIR}"
-INITIAL_DIR=$(pwd)
-BBOX_DIR="${OUTDIR}bounding_boxes"
-mkdir -p "${BBOX_DIR}"
-cd "${OUTDIR}"
-
-# Download and process all of the ImageNet bounding boxes.
-BASE_URL="http://www.image-net.org/challenges/LSVRC/2012/nonpub"
-
-# See here for details: http://www.image-net.org/download-bboxes
-BOUNDING_BOX_ANNOTATIONS="${BASE_URL}/ILSVRC2012_bbox_train_v2.tar.gz"
-BBOX_TAR_BALL="${BBOX_DIR}/annotations.tar.gz"
-echo "Downloading bounding box annotations."
-wget "${BOUNDING_BOX_ANNOTATIONS}" -O "${BBOX_TAR_BALL}" || BASE_URL_CHANGE=1
-if [ $BASE_URL_CHANGE ]; then
-  BASE_URL="http://www.image-net.org/challenges/LSVRC/2012/nnoupb"
-  BOUNDING_BOX_ANNOTATIONS="${BASE_URL}/ILSVRC2012_bbox_train_v2.tar.gz"
-  wget "${BOUNDING_BOX_ANNOTATIONS}" -O "${BBOX_TAR_BALL}"
-fi
-echo "Uncompressing bounding box annotations ..."
-tar xzf "${BBOX_TAR_BALL}" -C "${BBOX_DIR}"
-
-LABELS_ANNOTATED="${BBOX_DIR}/*"
-NUM_XML=$(ls -1 ${LABELS_ANNOTATED} | wc -l)
-echo "Identified ${NUM_XML} bounding box annotations."
-
-# Download and uncompress all images from the ImageNet 2012 validation dataset.
-VALIDATION_TARBALL="ILSVRC2012_img_val.tar"
-OUTPUT_PATH="${OUTDIR}validation/"
-mkdir -p "${OUTPUT_PATH}"
-cd "${OUTDIR}/.."
-echo "Downloading ${VALIDATION_TARBALL} to ${OUTPUT_PATH}."
-wget -nd -c "${BASE_URL}/${VALIDATION_TARBALL}"
-tar xf "${VALIDATION_TARBALL}" -C "${OUTPUT_PATH}"
-
-# Download all images from the ImageNet 2012 train dataset.
-TRAIN_TARBALL="ILSVRC2012_img_train.tar"
-OUTPUT_PATH="${OUTDIR}train/"
-mkdir -p "${OUTPUT_PATH}"
-cd "${OUTDIR}/.."
-echo "Downloading ${TRAIN_TARBALL} to ${OUTPUT_PATH}."
-wget -nd -c "${BASE_URL}/${TRAIN_TARBALL}"
-
-# Un-compress the individual tar-files within the train tar-file.
-echo "Uncompressing individual train tar-balls in the training data."
-
-while read SYNSET; do
-  echo "Processing: ${SYNSET}"
-
-  # Create a directory and delete anything there.
-  mkdir -p "${OUTPUT_PATH}/${SYNSET}"
-  rm -rf "${OUTPUT_PATH}/${SYNSET}/*"
-
-  # Uncompress into the directory.
-  tar xf "${TRAIN_TARBALL}" "${SYNSET}.tar"
-  tar xf "${SYNSET}.tar" -C "${OUTPUT_PATH}/${SYNSET}/"
-  rm -f "${SYNSET}.tar"
-
-  echo "Finished processing: ${SYNSET}"
-done < "${INITIAL_DIR}/${SYNSETS_FILE}"
diff --git a/research/inception/inception/data/imagenet_2012_validation_synset_labels.txt b/research/inception/inception/data/imagenet_2012_validation_synset_labels.txt
deleted file mode 100644
index 0fc3467857e94859b933e805e54571a8d8eb459f..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/imagenet_2012_validation_synset_labels.txt
+++ /dev/null
@@ -1,50000 +0,0 @@
-n01751748
-n09193705
-n02105855
-n04263257
-n03125729
-n01735189
-n02346627
-n02776631
-n03794056
-n02328150
-n01917289
-n02125311
-n02484975
-n04065272
-n03496892
-n02066245
-n01914609
-n01616318
-n02971356
-n03126707
-n02346627
-n02091244
-n07742313
-n03956157
-n01616318
-n04380533
-n02114548
-n02089973
-n01729977
-n04435653
-n02280649
-n03444034
-n02077923
-n09835506
-n03478589
-n04532106
-n01644900
-n02666196
-n04141327
-n01773797
-n03125729
-n04049303
-n02006656
-n02097209
-n02111277
-n03950228
-n03393912
-n02089973
-n03930630
-n02640242
-n01828970
-n01632777
-n04372370
-n03485794
-n02443114
-n02930766
-n02112018
-n13040303
-n04485082
-n03482405
-n02963159
-n02093859
-n01910747
-n01693334
-n04371430
-n02526121
-n01871265
-n04532106
-n04482393
-n04370456
-n02927161
-n02074367
-n01608432
-n02966193
-n01795545
-n02791270
-n02087394
-n02116738
-n02091635
-n02895154
-n09193705
-n02088094
-n04200800
-n01737021
-n02974003
-n03032252
-n02483708
-n01632458
-n02992529
-n01698640
-n02114548
-n02497673
-n02480855
-n04147183
-n02487347
-n03895866
-n02325366
-n02033041
-n07745940
-n02415577
-n02951585
-n02087394
-n04485082
-n04505470
-n02097658
-n04591157
-n01770081
-n02992211
-n03691459
-n03594734
-n01983481
-n03937543
-n02105412
-n03843555
-n02091244
-n07831146
-n03710637
-n03733281
-n03782006
-n03733131
-n03933933
-n02980441
-n04409515
-n02606052
-n02226429
-n02883205
-n02422699
-n01614925
-n07697537
-n02123394
-n04252077
-n03337140
-n02117135
-n02107142
-n04037443
-n02397096
-n03187595
-n02319095
-n07932039
-n03372029
-n02088466
-n02319095
-n04125021
-n03954731
-n09421951
-n04487394
-n02113624
-n03843555
-n03485407
-n09332890
-n03642806
-n03710193
-n01677366
-n01950731
-n07714990
-n02114855
-n02119022
-n04086273
-n04201297
-n03733281
-n02100877
-n03016953
-n03733805
-n03063599
-n07714990
-n03854065
-n04149813
-n03786901
-n03467068
-n02087046
-n04326547
-n02100735
-n03775546
-n02111500
-n02814533
-n02097047
-n02027492
-n02109961
-n02389026
-n02105855
-n02445715
-n03259280
-n07711569
-n03710637
-n03670208
-n02128757
-n04467665
-n02114855
-n01873310
-n03476684
-n02093428
-n03891251
-n02859443
-n04125021
-n01978287
-n02643566
-n07697537
-n01560419
-n03290653
-n13037406
-n03891332
-n02883205
-n02106382
-n02672831
-n04330267
-n02489166
-n02058221
-n03584829
-n07565083
-n03125729
-n02123597
-n04536866
-n02965783
-n09428293
-n02965783
-n11879895
-n01560419
-n01775062
-n03595614
-n02110958
-n03709823
-n03777754
-n02951585
-n02100877
-n01629819
-n02909870
-n02101388
-n02091244
-n01667114
-n03998194
-n01986214
-n04192698
-n02128757
-n02793495
-n09256479
-n01443537
-n02089973
-n01981276
-n02837789
-n03888605
-n03201208
-n02480855
-n03814639
-n04090263
-n01986214
-n02415577
-n01534433
-n02093256
-n03134739
-n03016953
-n12620546
-n03937543
-n02815834
-n03776460
-n10565667
-n03207743
-n02992529
-n01631663
-n03729826
-n04033995
-n04462240
-n01443537
-n02091831
-n03874293
-n03874599
-n04238763
-n07584110
-n02749479
-n02110185
-n09193705
-n04311004
-n02788148
-n02445715
-n06874185
-n04074963
-n01631663
-n03803284
-n01828970
-n02096437
-n04554684
-n03599486
-n03595614
-n02123394
-n04515003
-n04591157
-n04560804
-n02794156
-n03344393
-n02687172
-n04328186
-n04479046
-n03967562
-n01440764
-n04465501
-n03457902
-n04532670
-n01688243
-n01749939
-n01768244
-n02091831
-n02321529
-n02939185
-n02129604
-n12985857
-n03485794
-n02408429
-n01443537
-n03590841
-n07697537
-n04154565
-n03443371
-n02514041
-n09468604
-n03769881
-n02787622
-n02526121
-n03888605
-n01622779
-n01872401
-n07745940
-n03085013
-n02445715
-n02120505
-n01751748
-n04141327
-n02443484
-n02089078
-n01608432
-n01514668
-n03160309
-n04070727
-n07715103
-n02110958
-n03976657
-n03902125
-n02909870
-n01740131
-n04532106
-n03197337
-n02493509
-n10148035
-n02172182
-n02437616
-n03062245
-n04286575
-n03018349
-n02951358
-n02130308
-n04277352
-n02096585
-n04589890
-n02965783
-n02978881
-n02804414
-n02112137
-n02007558
-n03670208
-n02894605
-n03657121
-n03876231
-n02165105
-n01669191
-n02011460
-n03710193
-n03796401
-n02916936
-n03492542
-n03998194
-n04552348
-n01824575
-n01917289
-n03461385
-n03874293
-n03272010
-n02099712
-n02999410
-n04179913
-n07831146
-n02096177
-n04350905
-n04507155
-n03743016
-n02105505
-n03649909
-n03680355
-n01910747
-n03529860
-n02787622
-n02012849
-n02011460
-n02094114
-n02950826
-n02105855
-n09288635
-n01773797
-n01774750
-n04409515
-n02497673
-n02113799
-n02786058
-n02443484
-n02981792
-n03095699
-n01664065
-n02092002
-n07711569
-n02219486
-n13133613
-n02114548
-n03529860
-n02097298
-n13133613
-n04355933
-n01537544
-n01847000
-n04428191
-n02666196
-n02268443
-n03291819
-n01828970
-n04099969
-n02747177
-n07720875
-n02088094
-n02113624
-n03710637
-n03637318
-n03942813
-n02093859
-n03794056
-n02930766
-n02930766
-n04525038
-n03796401
-n03709823
-n02097047
-n04604644
-n03938244
-n01560419
-n02097298
-n02091635
-n04136333
-n07718747
-n02417914
-n03355925
-n02445715
-n02445715
-n03495258
-n04447861
-n02111500
-n03584829
-n03977966
-n04116512
-n04019541
-n04200800
-n02408429
-n02085936
-n03992509
-n02769748
-n04613696
-n07716906
-n02085782
-n07718472
-n04398044
-n03920288
-n01860187
-n03272010
-n04008634
-n04090263
-n02028035
-n01677366
-n13037406
-n04067472
-n02095889
-n04532670
-n01582220
-n03476684
-n02395406
-n04487394
-n02443484
-n02510455
-n04550184
-n02814860
-n12144580
-n03126707
-n02486410
-n02125311
-n03777754
-n03924679
-n04613696
-n07875152
-n02058221
-n03188531
-n02777292
-n02489166
-n02066245
-n04579432
-n01630670
-n02666196
-n02091635
-n02114548
-n02356798
-n03201208
-n03240683
-n03590841
-n03018349
-n02104029
-n04251144
-n10148035
-n02169497
-n02089867
-n01734418
-n04476259
-n02843684
-n04008634
-n03400231
-n02119022
-n02137549
-n03761084
-n02490219
-n03840681
-n04346328
-n01677366
-n02102318
-n04458633
-n04476259
-n04209239
-n01795545
-n10565667
-n02114367
-n02107574
-n03032252
-n02104365
-n03133878
-n04336792
-n02112137
-n03000684
-n04553703
-n02102480
-n03825788
-n01695060
-n03250847
-n07860988
-n04310018
-n02071294
-n01945685
-n01855672
-n02037110
-n03868863
-n04229816
-n12057211
-n02408429
-n02481823
-n07716358
-n04487394
-n03662601
-n02979186
-n02910353
-n04266014
-n03895866
-n04443257
-n02917067
-n04149813
-n03041632
-n02364673
-n02999410
-n04435653
-n04228054
-n02814860
-n01531178
-n03662601
-n07880968
-n04487081
-n07614500
-n03532672
-n01807496
-n02011460
-n02074367
-n04462240
-n02977058
-n02281406
-n03041632
-n04350905
-n02788148
-n02137549
-n04562935
-n04590129
-n02093991
-n03995372
-n02111889
-n04081281
-n02133161
-n02006656
-n02107908
-n04347754
-n02950826
-n02504013
-n04560804
-n02088364
-n02128385
-n02860847
-n04399382
-n02105412
-n02115641
-n07753592
-n07880968
-n03598930
-n03724870
-n02066245
-n02128925
-n04465501
-n02094258
-n02086646
-n04141076
-n04136333
-n13133613
-n02342885
-n02281406
-n03443371
-n07613480
-n04008634
-n04141327
-n04347754
-n03314780
-n02165456
-n03930313
-n04392985
-n01872401
-n04204238
-n07831146
-n02690373
-n12144580
-n02776631
-n02877765
-n02108089
-n03532672
-n03126707
-n01560419
-n02268853
-n03691459
-n03404251
-n02364673
-n02101556
-n02326432
-n03954731
-n07831146
-n03584254
-n02012849
-n03804744
-n02128385
-n01530575
-n03933933
-n04409515
-n02823428
-n01877812
-n03920288
-n02510455
-n02112350
-n03594945
-n03642806
-n02395406
-n03452741
-n02860847
-n03673027
-n02102040
-n04505470
-n04086273
-n02099849
-n01990800
-n03781244
-n04461696
-n02106166
-n04141076
-n07717556
-n02361337
-n03976657
-n03832673
-n03109150
-n01776313
-n03788195
-n03884397
-n04019541
-n01693334
-n03633091
-n02325366
-n03623198
-n02795169
-n01744401
-n01955084
-n02002556
-n07754684
-n02174001
-n02793495
-n02095889
-n02484975
-n02094433
-n09229709
-n03207941
-n02655020
-n03773504
-n04367480
-n03933933
-n01955084
-n04355933
-n13040303
-n02786058
-n04090263
-n02101006
-n02124075
-n03720891
-n07749582
-n04517823
-n01534433
-n04335435
-n03661043
-n02101556
-n03785016
-n03133878
-n02113978
-n02930766
-n02783161
-n03958227
-n02441942
-n02859443
-n02096437
-n02447366
-n07742313
-n07583066
-n02110063
-n03146219
-n12998815
-n03425413
-n02123394
-n03594734
-n02006656
-n02992211
-n04442312
-n03032252
-n01608432
-n02927161
-n03485794
-n07583066
-n03347037
-n01847000
-n04557648
-n03478589
-n01530575
-n02098105
-n01755581
-n03045698
-n02028035
-n03538406
-n03956157
-n01871265
-n13044778
-n02119789
-n07875152
-n02107908
-n02791124
-n03697007
-n03207743
-n02791270
-n02865351
-n03345487
-n03976467
-n03124043
-n04252225
-n02165105
-n03314780
-n04040759
-n02730930
-n02236044
-n07873807
-n02006656
-n02514041
-n03534580
-n03179701
-n04366367
-n02138441
-n03450230
-n01943899
-n07836838
-n03691459
-n04467665
-n02115641
-n01742172
-n02795169
-n02481823
-n07583066
-n02749479
-n01665541
-n04131690
-n03769881
-n02009229
-n04487081
-n02123159
-n04542943
-n07760859
-n02097658
-n02113799
-n07932039
-n02097474
-n03793489
-n02791124
-n04591713
-n01735189
-n01631663
-n02892767
-n04458633
-n02277742
-n07697537
-n03781244
-n02791270
-n03854065
-n04356056
-n07802026
-n03733131
-n01980166
-n02174001
-n07684084
-n01981276
-n03874293
-n03146219
-n02099267
-n02018207
-n04398044
-n03832673
-n02493509
-n03478589
-n06359193
-n02971356
-n02093754
-n04487081
-n03929855
-n03485407
-n01930112
-n01592084
-n02088238
-n04613696
-n03967562
-n03814639
-n04311174
-n04286575
-n03884397
-n03534580
-n03793489
-n02106382
-n03045698
-n03661043
-n03814906
-n02669723
-n03459775
-n03785016
-n04584207
-n03657121
-n03476991
-n04243546
-n04560804
-n03788365
-n01796340
-n04019541
-n03496892
-n07711569
-n03788195
-n02133161
-n04548362
-n02113712
-n03673027
-n12144580
-n02481823
-n02132136
-n03956157
-n01532829
-n04493381
-n02094258
-n03483316
-n01770081
-n02006656
-n02871525
-n01580077
-n07730033
-n02097474
-n02093647
-n02088466
-n01795545
-n07716906
-n03481172
-n01608432
-n02097209
-n01629819
-n07695742
-n02389026
-n02977058
-n04090263
-n04522168
-n02871525
-n04258138
-n02127052
-n04476259
-n03617480
-n04273569
-n03485794
-n06794110
-n03085013
-n02974003
-n02869837
-n02086240
-n01685808
-n02088466
-n03584829
-n01514668
-n02114367
-n03447447
-n04435653
-n03065424
-n01616318
-n02841315
-n02655020
-n03496892
-n04040759
-n01496331
-n02094258
-n03787032
-n02172182
-n01693334
-n02168699
-n03793489
-n07613480
-n01824575
-n01665541
-n04065272
-n02699494
-n02526121
-n01774750
-n03126707
-n04254777
-n02325366
-n01665541
-n02007558
-n01873310
-n01734418
-n03271574
-n01776313
-n01644373
-n02486410
-n02106662
-n03125729
-n02087394
-n02094433
-n07684084
-n04532670
-n01843383
-n02835271
-n12985857
-n04485082
-n02167151
-n03394916
-n01664065
-n04286575
-n03874293
-n02699494
-n01601694
-n01582220
-n02486261
-n02268853
-n03947888
-n13040303
-n03967562
-n03602883
-n01882714
-n04505470
-n02226429
-n04522168
-n02481823
-n02108422
-n03670208
-n07718747
-n01688243
-n02747177
-n07248320
-n02328150
-n02963159
-n02117135
-n03676483
-n06596364
-n01775062
-n03724870
-n03347037
-n13133613
-n02319095
-n03944341
-n02088238
-n02110185
-n01443537
-n06794110
-n02606052
-n02113186
-n02704792
-n03692522
-n03018349
-n02095314
-n04523525
-n02356798
-n04228054
-n02108000
-n04371430
-n01770393
-n04456115
-n02110958
-n01631663
-n02708093
-n02835271
-n02807133
-n02280649
-n02277742
-n03857828
-n03452741
-n03388043
-n06596364
-n04252225
-n04458633
-n01689811
-n03935335
-n01560419
-n02500267
-n02319095
-n02412080
-n02096437
-n03814639
-n03494278
-n01518878
-n02486261
-n01629819
-n04606251
-n03787032
-n01877812
-n01773157
-n02104365
-n02113978
-n02123394
-n02966687
-n01728920
-n02916936
-n01860187
-n03255030
-n02011460
-n02087394
-n02817516
-n02085620
-n02437616
-n02606052
-n03447721
-n01773157
-n02497673
-n04380533
-n02056570
-n01917289
-n12267677
-n04325704
-n02130308
-n02730930
-n03933933
-n02981792
-n07892512
-n02112018
-n02398521
-n02009912
-n02002724
-n02086079
-n02100236
-n03085013
-n02837789
-n02018795
-n02106382
-n02489166
-n03937543
-n02910353
-n07836838
-n15075141
-n02877765
-n03602883
-n02233338
-n13037406
-n01580077
-n04069434
-n04371774
-n03938244
-n02326432
-n03085013
-n02804610
-n04141975
-n02484975
-n02930766
-n03000134
-n02488702
-n02113023
-n02088632
-n02783161
-n02490219
-n04505470
-n02123394
-n04357314
-n02825657
-n02493509
-n03720891
-n03673027
-n03492542
-n01739381
-n02105056
-n03481172
-n03947888
-n02099601
-n02105505
-n01514859
-n07871810
-n03445924
-n12267677
-n04536866
-n03314780
-n12768682
-n02028035
-n01980166
-n02099601
-n01981276
-n07730033
-n02909870
-n04179913
-n02089973
-n02111277
-n12057211
-n01632458
-n02123394
-n04350905
-n03937543
-n02730930
-n01795545
-n02091244
-n01632777
-n03584829
-n03709823
-n02086646
-n01824575
-n03977966
-n03417042
-n02892201
-n01806143
-n02105855
-n02115913
-n03902125
-n01774384
-n07880968
-n02112137
-n09428293
-n04116512
-n02486410
-n03930630
-n04090263
-n01843383
-n07802026
-n04429376
-n02317335
-n02027492
-n01818515
-n02086646
-n02018207
-n04371430
-n03347037
-n03014705
-n04125021
-n03764736
-n02981792
-n02114367
-n04192698
-n04330267
-n03729826
-n02607072
-n02504458
-n03769881
-n02018207
-n03929855
-n04591157
-n03947888
-n04317175
-n03125729
-n01749939
-n04399382
-n02276258
-n03598930
-n02606052
-n03089624
-n02099601
-n03770439
-n02655020
-n07745940
-n02095314
-n04336792
-n04033995
-n02112018
-n02132136
-n02860847
-n03100240
-n02966687
-n02111129
-n04273569
-n04149813
-n02092002
-n03769881
-n04599235
-n03825788
-n04118776
-n04336792
-n02115641
-n01622779
-n02909870
-n02276258
-n02977058
-n02326432
-n01608432
-n03347037
-n02978881
-n02787622
-n02093256
-n02101556
-n02100735
-n02085782
-n02342885
-n03733281
-n02085782
-n03706229
-n02002724
-n13037406
-n02422106
-n07614500
-n02113712
-n04336792
-n02486261
-n02356798
-n02268443
-n04179913
-n04277352
-n02346627
-n03089624
-n02835271
-n02086240
-n04579432
-n03180011
-n04285008
-n02408429
-n04392985
-n02091244
-n02815834
-n02834397
-n04009552
-n02488291
-n03290653
-n03325584
-n03637318
-n02730930
-n02865351
-n02119789
-n03929855
-n03676483
-n04423845
-n03874293
-n03908618
-n03598930
-n02090379
-n01944390
-n04152593
-n09288635
-n02066245
-n01768244
-n03272010
-n01531178
-n03255030
-n03676483
-n02002556
-n02749479
-n02415577
-n02403003
-n07565083
-n02981792
-n01776313
-n02097474
-n02667093
-n02096177
-n03255030
-n01819313
-n02791124
-n02279972
-n04090263
-n09193705
-n04335435
-n03733131
-n03250847
-n04263257
-n02096585
-n03976467
-n02963159
-n04613696
-n04310018
-n02107574
-n03724870
-n09428293
-n02101006
-n04372370
-n03930630
-n07584110
-n01735189
-n04599235
-n02835271
-n04330267
-n02108915
-n02110185
-n07684084
-n04204347
-n02672831
-n03742115
-n04131690
-n09428293
-n04487394
-n03710193
-n09332890
-n03478589
-n04486054
-n02951358
-n09428293
-n04596742
-n01872401
-n04505470
-n04154565
-n02666196
-n02437616
-n03724870
-n02120079
-n01828970
-n03141823
-n01698640
-n03095699
-n04099969
-n02123045
-n04482393
-n04026417
-n02110806
-n04033901
-n04041544
-n02869837
-n04136333
-n02112350
-n03388043
-n03065424
-n02128757
-n04330267
-n02879718
-n02859443
-n01968897
-n01847000
-n01871265
-n02129165
-n02408429
-n04263257
-n13054560
-n02090379
-n04553703
-n03929660
-n01990800
-n03494278
-n01514859
-n02804610
-n01773157
-n02087046
-n07802026
-n03777754
-n07720875
-n01694178
-n06794110
-n02795169
-n07583066
-n02094114
-n03841143
-n01985128
-n03776460
-n02859443
-n02808304
-n02092339
-n02441942
-n02002724
-n04296562
-n02086910
-n02690373
-n01616318
-n07718472
-n02086240
-n04049303
-n04235860
-n06359193
-n02110958
-n01518878
-n02950826
-n03447721
-n02111129
-n04517823
-n03769881
-n02112350
-n07693725
-n07747607
-n02444819
-n02109047
-n04485082
-n10148035
-n03127925
-n04328186
-n03347037
-n02102480
-n07614500
-n02676566
-n04599235
-n03534580
-n02093256
-n03710721
-n02167151
-n04116512
-n04141975
-n03877472
-n02092339
-n03042490
-n04604644
-n03355925
-n04009552
-n03598930
-n02672831
-n03425413
-n03649909
-n02099429
-n01819313
-n02640242
-n02978881
-n03670208
-n02342885
-n03888257
-n03729826
-n02457408
-n02860847
-n09246464
-n02097298
-n03649909
-n04228054
-n02113624
-n01978287
-n03895866
-n03393912
-n03127925
-n03720891
-n01774384
-n04065272
-n03485407
-n04033901
-n02488291
-n12057211
-n01774750
-n01798484
-n01537544
-n07720875
-n03838899
-n04120489
-n02264363
-n02113978
-n02799071
-n02114367
-n04332243
-n03062245
-n02077923
-n02398521
-n04435653
-n01692333
-n07831146
-n04523525
-n02342885
-n07753275
-n01807496
-n02098413
-n01744401
-n07836838
-n02104029
-n02092339
-n02092339
-n02115913
-n01608432
-n03325584
-n02066245
-n03345487
-n03394916
-n01773797
-n02113186
-n02667093
-n02124075
-n04118538
-n02134084
-n02317335
-n03047690
-n03938244
-n02219486
-n07718747
-n02490219
-n04326547
-n02690373
-n07717556
-n01580077
-n02443484
-n04443257
-n04033995
-n07590611
-n02403003
-n07768694
-n03803284
-n04371774
-n02802426
-n06794110
-n04483307
-n02791270
-n02028035
-n03764736
-n07860988
-n09421951
-n03773504
-n04152593
-n04367480
-n02950826
-n02168699
-n04458633
-n01983481
-n04404412
-n04252225
-n04596742
-n02480495
-n02281787
-n01795545
-n02089867
-n02169497
-n02666196
-n04311004
-n02879718
-n03457902
-n02074367
-n03297495
-n02481823
-n04485082
-n02091244
-n07718747
-n02102480
-n04147183
-n03014705
-n02814860
-n04532670
-n02094114
-n01532829
-n01664065
-n04090263
-n03995372
-n03134739
-n06596364
-n03710637
-n01807496
-n02096294
-n04026417
-n02165105
-n03998194
-n02112706
-n04366367
-n02177972
-n04152593
-n04442312
-n01697457
-n03775071
-n07892512
-n02091831
-n02101388
-n01749939
-n03384352
-n02484975
-n03868242
-n01753488
-n02687172
-n02807133
-n02231487
-n02018795
-n04270147
-n03063599
-n04591713
-n03895866
-n03481172
-n04456115
-n01755581
-n02319095
-n02526121
-n01796340
-n02094433
-n01558993
-n04238763
-n03127925
-n03017168
-n02692877
-n04179913
-n02791124
-n03494278
-n06596364
-n01751748
-n02074367
-n03249569
-n04357314
-n07579787
-n04550184
-n06596364
-n03761084
-n07718472
-n03376595
-n04428191
-n01773157
-n07248320
-n03400231
-n04447861
-n03854065
-n01694178
-n02111500
-n04111531
-n02090622
-n03450230
-n04536866
-n01817953
-n02843684
-n03776460
-n04201297
-n04204238
-n02094114
-n04238763
-n01667114
-n02116738
-n03709823
-n04153751
-n02422699
-n01796340
-n07836838
-n02027492
-n03478589
-n01689811
-n02110958
-n03538406
-n03207743
-n01669191
-n06794110
-n02087394
-n01641577
-n07873807
-n03314780
-n04591157
-n02487347
-n04277352
-n07749582
-n03792782
-n03947888
-n03792782
-n01669191
-n02102318
-n03788365
-n03899768
-n04392985
-n01629819
-n04557648
-n02640242
-n02325366
-n07749582
-n04264628
-n04487081
-n02978881
-n03720891
-n01494475
-n02951358
-n01828970
-n04286575
-n04540053
-n04332243
-n04367480
-n03840681
-n02106662
-n03376595
-n02113186
-n03085013
-n09246464
-n03127747
-n04367480
-n03290653
-n07760859
-n02102973
-n03290653
-n01751748
-n02089973
-n02086910
-n02112350
-n03272562
-n04456115
-n03785016
-n02110341
-n01728920
-n04554684
-n02417914
-n01756291
-n03590841
-n01877812
-n02113186
-n02093256
-n02099849
-n02397096
-n03642806
-n02231487
-n04179913
-n02012849
-n02279972
-n04447861
-n04355933
-n01560419
-n02445715
-n03770679
-n03929855
-n01688243
-n06596364
-n07930864
-n01945685
-n01631663
-n03216828
-n03995372
-n02782093
-n01860187
-n04443257
-n04579432
-n07745940
-n04146614
-n02177972
-n04392985
-n01644373
-n02317335
-n04553703
-n02138441
-n13040303
-n01985128
-n02134418
-n01945685
-n02526121
-n02317335
-n01820546
-n04501370
-n01560419
-n02268443
-n03796401
-n03916031
-n02992211
-n03127747
-n03180011
-n02102480
-n04277352
-n01776313
-n03017168
-n02111129
-n02190166
-n02098413
-n02090721
-n01776313
-n09421951
-n02113023
-n02672831
-n03764736
-n04146614
-n03347037
-n03868242
-n02667093
-n02093647
-n02169497
-n02089973
-n07747607
-n02085782
-n02815834
-n02105412
-n02086910
-n04204238
-n03530642
-n07583066
-n04039381
-n02965783
-n04501370
-n04086273
-n04263257
-n02443484
-n04162706
-n07613480
-n04525038
-n04266014
-n03721384
-n04467665
-n04523525
-n04162706
-n02025239
-n04146614
-n01677366
-n04179913
-n04125021
-n02917067
-n04392985
-n04550184
-n02090721
-n03796401
-n03014705
-n04344873
-n02091635
-n01608432
-n03690938
-n04141975
-n01629819
-n04523525
-n01955084
-n01756291
-n04443257
-n02927161
-n07880968
-n07836838
-n02484975
-n02091032
-n07714571
-n03535780
-n04149813
-n09468604
-n02033041
-n03584254
-n04550184
-n03887697
-n03838899
-n02174001
-n03272010
-n03297495
-n04074963
-n03649909
-n03496892
-n03467068
-n02268853
-n03400231
-n02093256
-n04367480
-n02091134
-n04118776
-n02086646
-n07753592
-n02504013
-n02104365
-n02096177
-n03961711
-n04069434
-n03376595
-n01817953
-n01955084
-n02107142
-n03344393
-n03709823
-n02974003
-n02090379
-n04332243
-n03125729
-n03935335
-n02814860
-n01860187
-n03220513
-n02094114
-n03877472
-n02009912
-n02108000
-n02229544
-n03697007
-n03124170
-n02206856
-n03841143
-n04153751
-n01742172
-n13133613
-n04525305
-n01930112
-n02795169
-n02233338
-n02417914
-n03935335
-n01770393
-n02125311
-n03482405
-n04604644
-n02009912
-n03791053
-n03223299
-n03032252
-n04501370
-n03372029
-n03485794
-n02110341
-n04200800
-n02106166
-n04592741
-n02950826
-n04041544
-n07831146
-n04116512
-n01514859
-n03868242
-n03026506
-n02443484
-n02701002
-n04116512
-n02815834
-n03929855
-n03676483
-n01534433
-n02701002
-n02113978
-n04371430
-n03991062
-n07718472
-n02268853
-n04264628
-n02098105
-n07565083
-n02112706
-n02094114
-n02093991
-n02488291
-n02093859
-n03047690
-n01682714
-n07717410
-n01883070
-n04562935
-n01498041
-n07745940
-n02109525
-n01644900
-n01694178
-n03063689
-n02894605
-n01682714
-n03544143
-n02101556
-n02966687
-n03485407
-n03657121
-n02236044
-n07860988
-n01677366
-n07718747
-n02690373
-n04099969
-n03814639
-n02098413
-n01985128
-n02093647
-n02504458
-n01944390
-n03445924
-n03866082
-n03355925
-n02105855
-n03041632
-n03791053
-n03954731
-n07695742
-n02102040
-n03956157
-n03983396
-n02105855
-n03249569
-n03976467
-n03843555
-n02641379
-n03272562
-n03658185
-n03976467
-n02398521
-n03791053
-n03065424
-n03759954
-n03216828
-n03796401
-n01980166
-n09193705
-n01773797
-n02129604
-n04009552
-n02980441
-n03188531
-n02100735
-n07860988
-n03929855
-n04037443
-n03467068
-n02094114
-n03899768
-n04525038
-n02074367
-n04033901
-n02012849
-n02009229
-n02109961
-n03804744
-n02396427
-n02233338
-n03240683
-n03393912
-n03777568
-n02494079
-n02106662
-n04033995
-n02231487
-n04355338
-n04550184
-n02699494
-n04118538
-n03388043
-n02869837
-n02097047
-n03063689
-n01530575
-n02091032
-n03042490
-n03930313
-n02264363
-n02442845
-n02325366
-n01883070
-n01614925
-n03447721
-n03444034
-n02979186
-n02815834
-n02123394
-n03250847
-n02883205
-n04554684
-n03047690
-n01773157
-n02172182
-n03249569
-n04613696
-n03692522
-n04044716
-n12985857
-n02342885
-n03425413
-n02895154
-n01704323
-n01560419
-n02974003
-n07695742
-n03016953
-n03729826
-n03250847
-n02927161
-n02091635
-n01990800
-n02980441
-n02676566
-n02114548
-n02422699
-n04208210
-n02109961
-n04332243
-n04127249
-n03871628
-n02391049
-n01537544
-n02124075
-n02422106
-n01775062
-n03188531
-n02443114
-n01694178
-n03063689
-n02088364
-n04476259
-n04442312
-n03792972
-n07831146
-n02483708
-n04346328
-n04591713
-n03794056
-n04153751
-n03782006
-n02058221
-n04162706
-n04522168
-n03673027
-n04483307
-n03691459
-n03478589
-n02102318
-n07749582
-n07730033
-n01829413
-n01729977
-n04501370
-n09472597
-n03781244
-n02134084
-n01742172
-n03782006
-n04553703
-n09835506
-n03804744
-n02088238
-n04067472
-n03764736
-n02992529
-n03874599
-n03124043
-n04065272
-n02782093
-n03788195
-n04389033
-n03673027
-n04389033
-n03775071
-n07753113
-n12144580
-n02013706
-n02190166
-n04275548
-n03250847
-n03947888
-n01729977
-n02138441
-n04264628
-n03967562
-n03445924
-n04355338
-n02640242
-n01440764
-n12267677
-n02489166
-n02165105
-n03599486
-n03272010
-n02018207
-n02747177
-n04487081
-n02119789
-n02666196
-n02606052
-n02086646
-n04040759
-n01984695
-n12998815
-n01751748
-n04584207
-n04149813
-n01981276
-n02841315
-n03777754
-n04376876
-n02859443
-n04389033
-n01665541
-n04208210
-n04041544
-n02071294
-n13052670
-n01616318
-n03871628
-n02028035
-n03110669
-n01819313
-n04229816
-n02769748
-n03832673
-n02095889
-n01806143
-n02708093
-n07753113
-n02804610
-n02879718
-n03595614
-n02769748
-n07802026
-n04357314
-n09288635
-n07753592
-n04525038
-n04590129
-n01981276
-n01530575
-n02006656
-n03903868
-n02095570
-n03602883
-n03476991
-n04328186
-n03617480
-n03272562
-n02328150
-n04536866
-n02814860
-n03710193
-n04263257
-n02699494
-n04418357
-n01496331
-n02086079
-n03495258
-n03417042
-n03065424
-n03041632
-n04467665
-n02085936
-n03956157
-n02110341
-n07760859
-n03467068
-n02825657
-n02669723
-n07579787
-n02097658
-n03717622
-n03590841
-n02268443
-n07697313
-n02859443
-n01622779
-n02999410
-n01877812
-n01744401
-n01669191
-n04507155
-n02108000
-n10148035
-n04009552
-n09421951
-n03457902
-n02091032
-n03759954
-n01443537
-n02011460
-n01984695
-n02791270
-n03617480
-n02089973
-n02105641
-n03595614
-n03207941
-n03146219
-n04367480
-n07695742
-n03376595
-n09835506
-n02342885
-n03393912
-n04311004
-n04589890
-n02114367
-n02104029
-n01945685
-n02094114
-n01824575
-n04380533
-n02025239
-n03218198
-n02110627
-n04026417
-n02749479
-n07613480
-n02437312
-n03347037
-n02403003
-n03942813
-n03450230
-n04252225
-n02108000
-n03837869
-n02165105
-n03000247
-n04344873
-n02504458
-n02110185
-n01498041
-n04270147
-n04239074
-n03924679
-n02086646
-n09835506
-n03424325
-n04370456
-n03777754
-n03529860
-n02102040
-n01688243
-n02110627
-n02100735
-n02102177
-n04086273
-n01883070
-n04366367
-n02107574
-n02102480
-n04008634
-n02169497
-n04141327
-n02442845
-n03662601
-n01855032
-n04589890
-n02018795
-n03271574
-n02097298
-n03445777
-n02102040
-n03617480
-n02108422
-n02097474
-n02109525
-n02097474
-n11879895
-n03223299
-n02100583
-n03840681
-n02091032
-n01843065
-n03769881
-n02091467
-n02134418
-n02109047
-n04456115
-n03866082
-n04239074
-n02484975
-n04259630
-n07760859
-n09246464
-n01484850
-n02443114
-n04251144
-n03843555
-n04131690
-n07716906
-n03584254
-n04033901
-n04146614
-n03633091
-n13037406
-n04254680
-n07583066
-n03483316
-n02056570
-n02102177
-n04355338
-n01669191
-n04039381
-n01532829
-n02978881
-n03691459
-n04118776
-n02672831
-n06785654
-n07749582
-n02536864
-n02116738
-n04239074
-n02483708
-n03124170
-n07930864
-n02018207
-n04074963
-n01514859
-n02089867
-n03804744
-n04116512
-n02802426
-n03627232
-n03787032
-n02281406
-n07613480
-n02526121
-n02860847
-n01806143
-n03706229
-n03982430
-n04009552
-n01616318
-n01828970
-n03920288
-n03680355
-n02727426
-n02963159
-n02102973
-n04209133
-n01798484
-n02190166
-n02091635
-n02089078
-n04371774
-n04515003
-n02655020
-n02104029
-n01877812
-n02794156
-n02974003
-n02096585
-n04525305
-n02672831
-n02113712
-n02917067
-n02096437
-n07745940
-n02326432
-n03314780
-n02236044
-n02102973
-n02093428
-n03297495
-n03676483
-n03775071
-n04536866
-n04554684
-n03400231
-n04346328
-n01530575
-n04133789
-n03160309
-n01930112
-n03494278
-n03063599
-n03891332
-n04476259
-n02410509
-n03417042
-n07753113
-n03498962
-n03991062
-n04086273
-n01739381
-n07753275
-n03065424
-n03476991
-n07565083
-n01608432
-n04258138
-n03803284
-n02120079
-n02454379
-n01537544
-n02492035
-n02219486
-n01735189
-n03594734
-n02442845
-n04485082
-n03599486
-n02086079
-n03995372
-n04501370
-n02113712
-n02102480
-n03599486
-n04162706
-n03868242
-n04209133
-n02791124
-n01819313
-n02116738
-n02894605
-n03764736
-n03476684
-n02123159
-n02325366
-n03457902
-n02123597
-n09399592
-n02488291
-n03788365
-n01770081
-n01498041
-n02110341
-n02834397
-n02391049
-n02113023
-n02099712
-n01739381
-n02980441
-n02027492
-n03208938
-n07734744
-n02027492
-n02108000
-n03902125
-n04044716
-n09428293
-n01981276
-n02869837
-n03425413
-n03085013
-n03804744
-n02443114
-n01983481
-n02088466
-n02077923
-n01740131
-n09468604
-n02783161
-n03888257
-n02797295
-n04252225
-n01622779
-n01669191
-n03710637
-n01669191
-n01983481
-n02108422
-n04111531
-n04179913
-n04204238
-n04389033
-n02087046
-n01872401
-n02692877
-n01632777
-n02640242
-n02927161
-n02814860
-n03792972
-n04039381
-n02480855
-n03599486
-n04326547
-n03691459
-n04592741
-n03014705
-n01582220
-n13052670
-n02802426
-n01797886
-n04263257
-n04350905
-n03372029
-n02484975
-n09428293
-n03887697
-n02112350
-n03110669
-n02910353
-n02096294
-n02102177
-n02115913
-n02804610
-n04239074
-n04005630
-n04118538
-n04067472
-n02128757
-n02097658
-n02099849
-n01882714
-n02494079
-n03379051
-n02808440
-n04392985
-n02114548
-n02206856
-n03976657
-n01729322
-n07831146
-n01883070
-n02361337
-n02128757
-n02097130
-n04447861
-n13052670
-n02096177
-n03691459
-n02134084
-n02494079
-n03642806
-n04136333
-n02268853
-n02417914
-n03891332
-n09246464
-n03032252
-n02825657
-n03498962
-n03160309
-n04026417
-n04296562
-n03534580
-n03216828
-n07880968
-n03393912
-n02948072
-n04560804
-n04152593
-n04509417
-n03884397
-n02129604
-n01944390
-n04310018
-n04086273
-n07584110
-n04258138
-n04264628
-n13040303
-n02109525
-n04462240
-n02791270
-n03384352
-n04070727
-n02108422
-n03485407
-n02093647
-n03000134
-n03089624
-n07615774
-n03956157
-n02776631
-n01729977
-n03868242
-n03899768
-n01871265
-n03180011
-n03630383
-n01968897
-n02939185
-n02097474
-n04154565
-n04462240
-n02028035
-n04041544
-n02111129
-n03026506
-n04389033
-n02808440
-n03124170
-n02129165
-n02776631
-n04259630
-n03902125
-n07760859
-n01744401
-n02128757
-n02843684
-n02091134
-n02256656
-n03814639
-n02666196
-n02497673
-n13054560
-n01914609
-n01580077
-n02089867
-n03630383
-n02025239
-n02123597
-n02807133
-n03673027
-n04317175
-n15075141
-n01795545
-n03888257
-n03062245
-n04209133
-n01531178
-n02410509
-n04162706
-n03814639
-n02102177
-n04399382
-n03220513
-n06874185
-n04152593
-n07880968
-n02066245
-n01735189
-n03271574
-n01592084
-n04355933
-n02085936
-n01978455
-n04597913
-n07871810
-n02093859
-n01773549
-n03126707
-n03452741
-n02027492
-n02408429
-n01985128
-n03670208
-n04458633
-n04273569
-n03785016
-n01751748
-n03188531
-n02917067
-n02086240
-n03770439
-n03240683
-n03920288
-n03954731
-n02109525
-n03016953
-n02107683
-n01665541
-n04310018
-n03485407
-n03187595
-n03814639
-n02095570
-n01968897
-n03874599
-n02493509
-n02130308
-n02749479
-n01945685
-n02536864
-n04154565
-n02328150
-n03908618
-n01737021
-n02408429
-n02231487
-n04131690
-n03970156
-n01530575
-n04336792
-n02951358
-n02879718
-n03944341
-n03788195
-n02895154
-n03838899
-n02037110
-n04009552
-n03141823
-n02102973
-n07730033
-n01984695
-n07693725
-n04065272
-n01631663
-n02699494
-n03095699
-n02112350
-n04019541
-n09835506
-n01484850
-n07697313
-n01729322
-n03085013
-n04041544
-n02396427
-n02879718
-n03891332
-n04590129
-n03271574
-n02454379
-n01944390
-n02099267
-n02097658
-n07720875
-n02484975
-n03733805
-n02086240
-n04204238
-n03483316
-n03201208
-n02095570
-n01630670
-n03201208
-n01755581
-n02879718
-n03065424
-n02037110
-n02108915
-n02807133
-n04023962
-n01669191
-n02098286
-n04252225
-n02115641
-n02281787
-n06794110
-n02391049
-n04486054
-n01817953
-n04041544
-n04277352
-n02107574
-n09193705
-n04371774
-n04372370
-n03724870
-n03388183
-n04371430
-n02788148
-n01817953
-n02699494
-n07730033
-n09468604
-n04254777
-n04501370
-n03637318
-n02782093
-n04152593
-n01882714
-n02916936
-n03661043
-n04336792
-n02422699
-n04019541
-n01664065
-n03325584
-n03976657
-n04423845
-n04404412
-n03527444
-n02123045
-n02094114
-n01558993
-n03062245
-n02113712
-n03662601
-n03065424
-n03388183
-n03447721
-n01667778
-n03584254
-n03000247
-n07718747
-n01737021
-n02676566
-n01795545
-n07860988
-n04086273
-n04332243
-n03447721
-n01829413
-n02236044
-n02165105
-n01796340
-n02092339
-n01443537
-n04370456
-n03961711
-n07579787
-n01753488
-n02708093
-n02111277
-n01774750
-n04286575
-n02483708
-n02002724
-n02536864
-n03400231
-n03485794
-n02480495
-n02509815
-n04111531
-n07716358
-n01968897
-n04579145
-n02892201
-n02091134
-n04118776
-n03249569
-n01601694
-n04522168
-n02441942
-n03271574
-n02692877
-n03930313
-n02100735
-n04428191
-n03706229
-n02119789
-n02111277
-n01629819
-n04476259
-n03958227
-n03240683
-n02504458
-n04461696
-n09229709
-n01728920
-n02422106
-n03450230
-n02268853
-n03902125
-n03868863
-n09428293
-n04482393
-n03680355
-n01744401
-n12620546
-n02002556
-n04136333
-n02447366
-n02226429
-n03249569
-n02281406
-n03721384
-n03874599
-n02951585
-n04074963
-n02480495
-n03929855
-n03016953
-n03376595
-n07747607
-n15075141
-n02085620
-n04141975
-n03733805
-n03670208
-n02085620
-n01491361
-n03803284
-n02415577
-n07714571
-n03929855
-n13037406
-n01740131
-n01580077
-n03891251
-n02128925
-n01664065
-n02090379
-n07920052
-n02279972
-n02490219
-n02906734
-n01914609
-n01704323
-n02105412
-n03492542
-n04482393
-n02788148
-n01985128
-n03388549
-n04251144
-n02939185
-n02114548
-n07836838
-n10148035
-n03976467
-n03447721
-n02006656
-n07802026
-n04370456
-n02417914
-n01776313
-n02112018
-n03938244
-n02536864
-n07802026
-n04501370
-n02963159
-n03759954
-n02028035
-n04044716
-n02123394
-n02823428
-n01491361
-n04008634
-n01877812
-n07615774
-n09256479
-n01833805
-n04127249
-n04507155
-n03673027
-n01882714
-n03697007
-n03637318
-n04332243
-n12267677
-n07714571
-n03485794
-n04004767
-n02795169
-n02120505
-n02086646
-n02107908
-n03888257
-n01795545
-n03272010
-n07714571
-n02097047
-n03874293
-n02391049
-n01855672
-n01871265
-n04208210
-n02487347
-n02013706
-n02096051
-n03598930
-n03873416
-n02871525
-n02102973
-n03710637
-n01773157
-n03208938
-n04325704
-n02002724
-n02137549
-n02125311
-n01440764
-n01806567
-n03345487
-n04209239
-n07860988
-n07802026
-n07714571
-n12768682
-n02108422
-n01770393
-n03124043
-n04023962
-n02105056
-n04476259
-n02871525
-n03598930
-n02206856
-n03223299
-n02259212
-n02607072
-n02834397
-n02364673
-n03131574
-n02802426
-n02117135
-n04370456
-n01829413
-n04033901
-n02123159
-n02794156
-n02132136
-n02883205
-n07720875
-n03920288
-n02892201
-n04285008
-n03345487
-n03661043
-n04423845
-n02013706
-n01924916
-n03095699
-n09428293
-n04153751
-n02865351
-n03384352
-n02786058
-n02099429
-n03014705
-n02113712
-n01833805
-n03924679
-n03937543
-n02892767
-n01819313
-n02109047
-n01694178
-n01729322
-n02808440
-n04266014
-n01978287
-n04111531
-n04540053
-n02100735
-n03935335
-n04372370
-n03930630
-n02443114
-n03854065
-n03724870
-n09193705
-n02640242
-n03967562
-n07711569
-n04147183
-n03710721
-n02965783
-n02951585
-n01582220
-n03014705
-n02643566
-n01739381
-n03814906
-n01882714
-n01729322
-n02860847
-n04350905
-n01697457
-n03220513
-n04311004
-n03877472
-n04209239
-n04149813
-n03770679
-n04548362
-n07930864
-n03661043
-n03400231
-n02930766
-n04613696
-n03866082
-n01990800
-n01534433
-n03947888
-n02492660
-n01985128
-n03793489
-n03977966
-n01795545
-n04086273
-n01688243
-n02423022
-n04277352
-n03877472
-n03208938
-n04476259
-n04550184
-n03063599
-n04523525
-n02123597
-n02708093
-n02134418
-n02086079
-n11879895
-n03676483
-n02107574
-n02113978
-n03764736
-n03642806
-n01748264
-n02167151
-n04612504
-n02817516
-n02051845
-n03724870
-n02077923
-n01443537
-n03065424
-n02105505
-n02051845
-n02087394
-n01735189
-n04310018
-n01632458
-n02509815
-n02093859
-n01669191
-n03868242
-n03400231
-n02423022
-n02090622
-n03146219
-n02397096
-n03532672
-n02013706
-n01622779
-n02483708
-n03187595
-n02114712
-n03131574
-n03476991
-n03838899
-n02105162
-n04604644
-n01689811
-n02113624
-n03691459
-n15075141
-n01773797
-n01491361
-n04209133
-n04476259
-n03444034
-n02488291
-n03485407
-n01630670
-n04599235
-n02174001
-n02834397
-n02509815
-n03538406
-n03535780
-n02105855
-n04501370
-n02098105
-n03763968
-n03095699
-n04591713
-n02363005
-n03599486
-n01491361
-n02090622
-n03590841
-n03832673
-n02013706
-n06874185
-n06596364
-n04074963
-n04389033
-n02447366
-n01631663
-n02841315
-n03733805
-n03146219
-n02974003
-n03947888
-n02095570
-n02422106
-n04049303
-n02396427
-n03891251
-n02422106
-n04486054
-n02091831
-n07760859
-n03179701
-n03947888
-n03692522
-n02097298
-n03602883
-n02974003
-n02951585
-n04141327
-n04357314
-n02786058
-n02268853
-n04596742
-n03788365
-n02111277
-n02104365
-n03584254
-n04509417
-n03494278
-n02939185
-n02363005
-n03047690
-n04366367
-n04409515
-n04380533
-n03187595
-n01882714
-n03680355
-n03124170
-n01986214
-n04004767
-n01833805
-n04141076
-n02033041
-n03109150
-n04560804
-n07930864
-n02114548
-n02877765
-n02093754
-n01737021
-n02093647
-n03794056
-n01843383
-n01978287
-n01669191
-n02870880
-n02071294
-n02098286
-n04120489
-n04239074
-n01537544
-n02504013
-n03929855
-n09193705
-n03534580
-n03018349
-n04179913
-n01735189
-n01665541
-n12768682
-n02669723
-n03930313
-n04200800
-n02363005
-n04552348
-n03992509
-n02123159
-n04505470
-n01518878
-n01742172
-n02445715
-n03584254
-n02101556
-n02398521
-n02106166
-n04372370
-n04346328
-n02109047
-n03498962
-n01980166
-n07753275
-n04447861
-n09332890
-n04417672
-n07248320
-n02412080
-n03218198
-n04428191
-n04447861
-n04557648
-n01677366
-n01774750
-n09399592
-n02859443
-n04456115
-n02018795
-n03935335
-n04465501
-n02112706
-n02799071
-n07684084
-n01614925
-n02167151
-n04606251
-n04317175
-n04311004
-n02077923
-n04326547
-n02483708
-n02963159
-n07565083
-n04557648
-n02397096
-n04133789
-n02229544
-n04317175
-n07749582
-n03803284
-n04456115
-n01828970
-n02408429
-n01632458
-n03028079
-n03291819
-n01773797
-n02096585
-n02110341
-n01669191
-n01986214
-n03742115
-n01910747
-n02966687
-n02025239
-n07615774
-n02090721
-n01855672
-n02965783
-n03924679
-n11879895
-n02113186
-n04270147
-n02804610
-n06359193
-n02965783
-n03777754
-n09399592
-n01693334
-n04033901
-n02098413
-n01981276
-n03657121
-n02096437
-n03841143
-n02123394
-n02447366
-n03345487
-n02963159
-n01580077
-n03481172
-n02483362
-n02894605
-n02109525
-n04525038
-n01917289
-n03983396
-n04462240
-n04153751
-n03992509
-n02906734
-n03290653
-n02017213
-n02808440
-n04515003
-n02422106
-n02115913
-n03720891
-n10148035
-n02794156
-n02096294
-n03220513
-n02437312
-n02058221
-n04540053
-n07753592
-n02105641
-n04325704
-n04447861
-n07695742
-n03666591
-n03642806
-n01910747
-n03733281
-n01768244
-n03888605
-n13133613
-n03590841
-n03127925
-n02488291
-n04208210
-n04592741
-n04557648
-n02169497
-n01773549
-n02672831
-n03742115
-n01983481
-n02113978
-n03494278
-n02490219
-n02488291
-n03062245
-n02167151
-n02676566
-n04392985
-n03877472
-n02168699
-n02488291
-n02840245
-n03014705
-n04044716
-n02119022
-n01824575
-n02840245
-n04023962
-n03032252
-n02486410
-n03197337
-n02974003
-n04086273
-n02441942
-n03496892
-n03721384
-n03538406
-n03041632
-n02927161
-n02408429
-n03759954
-n03690938
-n01930112
-n01744401
-n02992529
-n03873416
-n07615774
-n02012849
-n03777568
-n03676483
-n01968897
-n03866082
-n04005630
-n04285008
-n02841315
-n02106030
-n02276258
-n02422106
-n03649909
-n03017168
-n02097474
-n02948072
-n02256656
-n04179913
-n09835506
-n02111889
-n02988304
-n07836838
-n02051845
-n02971356
-n02640242
-n03065424
-n04201297
-n02281406
-n02134418
-n02500267
-n02895154
-n02870880
-n03617480
-n02415577
-n03733131
-n03594734
-n04152593
-n04258138
-n04286575
-n04336792
-n02484975
-n04041544
-n04081281
-n03291819
-n04584207
-n02100877
-n03459775
-n01498041
-n04429376
-n04252077
-n04515003
-n02108089
-n03876231
-n03838899
-n07716358
-n02025239
-n02965783
-n04033901
-n03841143
-n02102318
-n03888605
-n03777568
-n04350905
-n02870880
-n04277352
-n07720875
-n02317335
-n02504458
-n02488291
-n02137549
-n02490219
-n04428191
-n03662601
-n04532670
-n02105412
-n02091831
-n04154565
-n01531178
-n07753275
-n02117135
-n01882714
-n03272010
-n03759954
-n03866082
-n03992509
-n02137549
-n01537544
-n01494475
-n03179701
-n01694178
-n04554684
-n04204347
-n11879895
-n04366367
-n04371430
-n12057211
-n02730930
-n03461385
-n01728572
-n01688243
-n04141975
-n02174001
-n04310018
-n02077923
-n02105505
-n03250847
-n01776313
-n04532106
-n02346627
-n04493381
-n07742313
-n04335435
-n02112018
-n02097298
-n04254120
-n02231487
-n03394916
-n01806143
-n04311004
-n03216828
-n07615774
-n07614500
-n07768694
-n07248320
-n03594734
-n04008634
-n02091134
-n02606052
-n04310018
-n07714990
-n01945685
-n02326432
-n01704323
-n01944390
-n01514668
-n01514668
-n01740131
-n04356056
-n03492542
-n02643566
-n03759954
-n03854065
-n03781244
-n03125729
-n02087394
-n02093754
-n02802426
-n03527444
-n07747607
-n03394916
-n01644373
-n02823428
-n02106550
-n03954731
-n01944390
-n09472597
-n03126707
-n02102973
-n03443371
-n03529860
-n02489166
-n04606251
-n04371774
-n03197337
-n04252225
-n01986214
-n03841143
-n02111129
-n04251144
-n02782093
-n03786901
-n04542943
-n03196217
-n01735189
-n03125729
-n02089867
-n04009552
-n02860847
-n02229544
-n01871265
-n03930313
-n04296562
-n03388549
-n02437616
-n02423022
-n02190166
-n04522168
-n04136333
-n02009229
-n07716358
-n01798484
-n01990800
-n04525038
-n07754684
-n01582220
-n03673027
-n02977058
-n04317175
-n03495258
-n02692877
-n02089973
-n01843065
-n03584254
-n02802426
-n02364673
-n01807496
-n02172182
-n03742115
-n02687172
-n02769748
-n07716358
-n03028079
-n02107142
-n02749479
-n02417914
-n04296562
-n01829413
-n01698640
-n03935335
-n02096294
-n02112706
-n02692877
-n01740131
-n07754684
-n04136333
-n02112137
-n02326432
-n02113624
-n07715103
-n02484975
-n03781244
-n01630670
-n02701002
-n03776460
-n01978455
-n01755581
-n01819313
-n03838899
-n04146614
-n04251144
-n02113023
-n02483362
-n04456115
-n02101006
-n02992211
-n02037110
-n03045698
-n02963159
-n03249569
-n06359193
-n03196217
-n01693334
-n02085936
-n03697007
-n02092002
-n02099712
-n02793495
-n03710721
-n02102318
-n03895866
-n02097209
-n03127747
-n01950731
-n02106166
-n01443537
-n03372029
-n04229816
-n01990800
-n04258138
-n03637318
-n03633091
-n03770439
-n01818515
-n04069434
-n02110063
-n01664065
-n02504458
-n01641577
-n04562935
-n03825788
-n03873416
-n02484975
-n01984695
-n03761084
-n02892201
-n04392985
-n04357314
-n02097130
-n03394916
-n03124170
-n03938244
-n01582220
-n04133789
-n07871810
-n02114855
-n02445715
-n03017168
-n01729977
-n02101006
-n04153751
-n07730033
-n02802426
-n02130308
-n02096585
-n01860187
-n01980166
-n02825657
-n03450230
-n04037443
-n04090263
-n02361337
-n02823750
-n02843684
-n03372029
-n01749939
-n02808440
-n03384352
-n02129165
-n02095570
-n02916936
-n02098105
-n02093256
-n03445777
-n02111500
-n04553703
-n03871628
-n03876231
-n03062245
-n03207941
-n04428191
-n02408429
-n04005630
-n02777292
-n03877845
-n04599235
-n02514041
-n04081281
-n02111889
-n03208938
-n02105855
-n10565667
-n02493793
-n02676566
-n02219486
-n04147183
-n01531178
-n04542943
-n02492660
-n04235860
-n02321529
-n01687978
-n02066245
-n01818515
-n03461385
-n03710637
-n03854065
-n01872401
-n01847000
-n03690938
-n06596364
-n07932039
-n02102973
-n01806567
-n02106382
-n15075141
-n02109047
-n02087394
-n01774750
-n02128385
-n07871810
-n02086240
-n04209239
-n07749582
-n04392985
-n02058221
-n01644373
-n03127925
-n03690938
-n04485082
-n03388183
-n02110627
-n02165105
-n03785016
-n02259212
-n02108915
-n02099267
-n04044716
-n01990800
-n01986214
-n01632777
-n01580077
-n02106030
-n01632458
-n03337140
-n01695060
-n09399592
-n04116512
-n03443371
-n02097658
-n04039381
-n02422699
-n02105855
-n03792782
-n02229544
-n01950731
-n02256656
-n03916031
-n01534433
-n03791053
-n04200800
-n03314780
-n04120489
-n04584207
-n01820546
-n04125021
-n02930766
-n02093647
-n02910353
-n03452741
-n03482405
-n04380533
-n01622779
-n07768694
-n03042490
-n03461385
-n04285008
-n04540053
-n02099267
-n12057211
-n04118776
-n04162706
-n12620546
-n01534433
-n01675722
-n02089078
-n03290653
-n02883205
-n07697537
-n03393912
-n02113186
-n03014705
-n04435653
-n03590841
-n03773504
-n02782093
-n02980441
-n04239074
-n04228054
-n03877845
-n04023962
-n04404412
-n02088238
-n03617480
-n03670208
-n09229709
-n02971356
-n04553703
-n01748264
-n02091467
-n07697537
-n02113186
-n07615774
-n02328150
-n02883205
-n07579787
-n01514668
-n03877845
-n02108915
-n07760859
-n02125311
-n03899768
-n01924916
-n02487347
-n02979186
-n03594945
-n03895866
-n02441942
-n13040303
-n03710193
-n03709823
-n03544143
-n02843684
-n02085782
-n02088466
-n01910747
-n04599235
-n01847000
-n02423022
-n03476991
-n02690373
-n07730033
-n03733281
-n02129604
-n02027492
-n04443257
-n03977966
-n03992509
-n02108422
-n07875152
-n03793489
-n03127925
-n04579145
-n02395406
-n02119022
-n03706229
-n03902125
-n03777568
-n02125311
-n04458633
-n02672831
-n01784675
-n02138441
-n04328186
-n02120505
-n01644373
-n03544143
-n01818515
-n03877472
-n04044716
-n04009552
-n03220513
-n04067472
-n02172182
-n02823750
-n02317335
-n04467665
-n02229544
-n04049303
-n02116738
-n07584110
-n02018795
-n03930313
-n02480495
-n02172182
-n09399592
-n01530575
-n02971356
-n02105641
-n01698640
-n04553703
-n02280649
-n01807496
-n02504458
-n03617480
-n03884397
-n02011460
-n02704792
-n03393912
-n01667114
-n03598930
-n01775062
-n07717410
-n04118776
-n03218198
-n03255030
-n02111129
-n02892201
-n03444034
-n03692522
-n02364673
-n07718747
-n04418357
-n04235860
-n03000684
-n03929660
-n03670208
-n01560419
-n02494079
-n03197337
-n01737021
-n07697313
-n02127052
-n03764736
-n04270147
-n02097474
-n04204347
-n03291819
-n03134739
-n02086240
-n03691459
-n01924916
-n04550184
-n02093754
-n03110669
-n02643566
-n02108422
-n02795169
-n02483362
-n03983396
-n02093647
-n02815834
-n04069434
-n03930313
-n02326432
-n02086079
-n03958227
-n04258138
-n03498962
-n03697007
-n03126707
-n02980441
-n03530642
-n02086910
-n02087394
-n02280649
-n04285008
-n02093256
-n01950731
-n03733131
-n04277352
-n02086240
-n03544143
-n03782006
-n01632777
-n02086646
-n03297495
-n09246464
-n02123597
-n02687172
-n04487081
-n02236044
-n03710193
-n02607072
-n02788148
-n01776313
-n04376876
-n02102973
-n07873807
-n03372029
-n02104029
-n02669723
-n01693334
-n12985857
-n03785016
-n02066245
-n01698640
-n04086273
-n03047690
-n04026417
-n01773797
-n03742115
-n02018207
-n01978455
-n02988304
-n03595614
-n02965783
-n02992529
-n01773157
-n03417042
-n03376595
-n04435653
-n07711569
-n03970156
-n02877765
-n04111531
-n09256479
-n02641379
-n04179913
-n02113023
-n03977966
-n04525038
-n02190166
-n04070727
-n02111277
-n02128757
-n01784675
-n02412080
-n03146219
-n03485794
-n01773157
-n02119022
-n02704792
-n01737021
-n03697007
-n03450230
-n01770081
-n03792782
-n02089867
-n02817516
-n03141823
-n01773157
-n07860988
-n02317335
-n04442312
-n04428191
-n04049303
-n12620546
-n04591157
-n03980874
-n03314780
-n02514041
-n03376595
-n01774384
-n01774384
-n04579432
-n04336792
-n01872401
-n02483708
-n03127925
-n03314780
-n03843555
-n01770081
-n02480855
-n04118776
-n01910747
-n03126707
-n02233338
-n02114855
-n02808304
-n02107683
-n03590841
-n01737021
-n01514859
-n04346328
-n02102480
-n02093754
-n09472597
-n09332890
-n03630383
-n02492035
-n04026417
-n02110185
-n03125729
-n04465501
-n07695742
-n03775546
-n02930766
-n07753275
-n07684084
-n04486054
-n01677366
-n03127747
-n02917067
-n04347754
-n02704792
-n07583066
-n07714990
-n02111500
-n03085013
-n02233338
-n03977966
-n03876231
-n07760859
-n03623198
-n02268853
-n07730033
-n02097047
-n02981792
-n01984695
-n04584207
-n01665541
-n01734418
-n02100877
-n03109150
-n02099712
-n01855672
-n02486410
-n02099267
-n03804744
-n04179913
-n02091032
-n04200800
-n04127249
-n01833805
-n01855672
-n02909870
-n04423845
-n03345487
-n04456115
-n04517823
-n07714990
-n03492542
-n01531178
-n07892512
-n01534433
-n03982430
-n04116512
-n02097130
-n04612504
-n03146219
-n02097130
-n04517823
-n07684084
-n01978455
-n02236044
-n01798484
-n04200800
-n01985128
-n09468604
-n02268853
-n02090622
-n03000684
-n04447861
-n04154565
-n02840245
-n03126707
-n02391049
-n04532106
-n01728572
-n03124043
-n01773549
-n02480855
-n07860988
-n02105056
-n03888605
-n02116738
-n02804610
-n02113799
-n03899768
-n01729322
-n07873807
-n02116738
-n02795169
-n02256656
-n07720875
-n03584829
-n02097209
-n02092002
-n07614500
-n03599486
-n02825657
-n02966687
-n04428191
-n02488702
-n01774384
-n03908618
-n03814639
-n02444819
-n02825657
-n02325366
-n03394916
-n02077923
-n03709823
-n04579432
-n03967562
-n01514668
-n04548280
-n03899768
-n02892201
-n01704323
-n01484850
-n03535780
-n03775546
-n03337140
-n01514859
-n01580077
-n01580077
-n04509417
-n03977966
-n02115641
-n07697313
-n07753275
-n04542943
-n02910353
-n02087046
-n04443257
-n03788365
-n04429376
-n01484850
-n02843684
-n04479046
-n01990800
-n09193705
-n02115641
-n01773549
-n09246464
-n03956157
-n03065424
-n02174001
-n01824575
-n02099267
-n02093647
-n03133878
-n01580077
-n01622779
-n03271574
-n07768694
-n04376876
-n01877812
-n03110669
-n01728920
-n04141327
-n04389033
-n02096294
-n02492035
-n03876231
-n07716906
-n02097474
-n02086240
-n02708093
-n02105641
-n01984695
-n03125729
-n03944341
-n03450230
-n02109525
-n04389033
-n07760859
-n01704323
-n04540053
-n02823428
-n02115641
-n03733281
-n02093754
-n01532829
-n07802026
-n09472597
-n02091134
-n03041632
-n04372370
-n01608432
-n04265275
-n02804414
-n03109150
-n04328186
-n02107312
-n03100240
-n03250847
-n03393912
-n02090622
-n02840245
-n02870880
-n04562935
-n02397096
-n03995372
-n02106662
-n02096177
-n02493509
-n02965783
-n01981276
-n01990800
-n01698640
-n02088238
-n02107908
-n09399592
-n02790996
-n02091134
-n04252225
-n02447366
-n03179701
-n02123394
-n02974003
-n03124170
-n03045698
-n03271574
-n04067472
-n01494475
-n01984695
-n02321529
-n03062245
-n07892512
-n02123045
-n02099849
-n02672831
-n03854065
-n02825657
-n01644900
-n07745940
-n04366367
-n09288635
-n03447447
-n03124043
-n12267677
-n02091244
-n02111277
-n02088632
-n12985857
-n04517823
-n03594945
-n04049303
-n03908714
-n03697007
-n07714571
-n01986214
-n03014705
-n04238763
-n02950826
-n01755581
-n02108089
-n02111500
-n02028035
-n03425413
-n02276258
-n03690938
-n03478589
-n04579432
-n04209133
-n02492035
-n04479046
-n03131574
-n04026417
-n01981276
-n01514668
-n02643566
-n03791053
-n02870880
-n04235860
-n06596364
-n04019541
-n09246464
-n03065424
-n13054560
-n04597913
-n02111500
-n04252077
-n03857828
-n02100236
-n04442312
-n02363005
-n04040759
-n03127925
-n04033995
-n03662601
-n02966193
-n03761084
-n03838899
-n04081281
-n04243546
-n04252077
-n04487081
-n04417672
-n03662601
-n03476991
-n01829413
-n07614500
-n02701002
-n07754684
-n04258138
-n01744401
-n03259280
-n02676566
-n03017168
-n01817953
-n04049303
-n01692333
-n02108551
-n03134739
-n02410509
-n03871628
-n04525305
-n02093754
-n04461696
-n04523525
-n11939491
-n04612504
-n03706229
-n02167151
-n01582220
-n03692522
-n03595614
-n02823428
-n03950228
-n04399382
-n03877845
-n04596742
-n04005630
-n03724870
-n03445924
-n07614500
-n01883070
-n03710637
-n04120489
-n03127925
-n03249569
-n02879718
-n04562935
-n03630383
-n02106662
-n02097474
-n02114855
-n09332890
-n02096051
-n03995372
-n03016953
-n03447447
-n10565667
-n07579787
-n02102040
-n02097298
-n01514668
-n04332243
-n03770679
-n02102040
-n01616318
-n01694178
-n02817516
-n02086240
-n03787032
-n01582220
-n02097130
-n03690938
-n02825657
-n02106662
-n02490219
-n02514041
-n03958227
-n03658185
-n03187595
-n02107908
-n07734744
-n02093859
-n02011460
-n04447861
-n02640242
-n02793495
-n02514041
-n01534433
-n02132136
-n02108422
-n01768244
-n04399382
-n01734418
-n02037110
-n02444819
-n03272562
-n02906734
-n01740131
-n03325584
-n03598930
-n02277742
-n03443371
-n03447721
-n02097130
-n04347754
-n03903868
-n03529860
-n06785654
-n01985128
-n02892767
-n02074367
-n02445715
-n03131574
-n02892201
-n02114548
-n02096294
-n03787032
-n03776460
-n02870880
-n04347754
-n03930313
-n02095889
-n02124075
-n01641577
-n07753592
-n02100583
-n04591157
-n02488291
-n03690938
-n03791053
-n02860847
-n04612504
-n01677366
-n02112350
-n03062245
-n02909870
-n09428293
-n01860187
-n02999410
-n13044778
-n04070727
-n02105855
-n01950731
-n04443257
-n02110341
-n04265275
-n04273569
-n03000247
-n01675722
-n03838899
-n13040303
-n03016953
-n03793489
-n02119022
-n04366367
-n03388549
-n06874185
-n02980441
-n03676483
-n04065272
-n02102040
-n04501370
-n01740131
-n04162706
-n04325704
-n01443537
-n02672831
-n02101006
-n04417672
-n01990800
-n02133161
-n02264363
-n04548280
-n03935335
-n02906734
-n01985128
-n02107574
-n03125729
-n03208938
-n02074367
-n03133878
-n02085782
-n02607072
-n03388043
-n02096585
-n07693725
-n02786058
-n01443537
-n01873310
-n02791124
-n04325704
-n03530642
-n04147183
-n02484975
-n02091635
-n03100240
-n02879718
-n02093991
-n11879895
-n01737021
-n13054560
-n01945685
-n04356056
-n02342885
-n04192698
-n04536866
-n04435653
-n01829413
-n01496331
-n03887697
-n03770679
-n12057211
-n12985857
-n04266014
-n02916936
-n04429376
-n02229544
-n03763968
-n03595614
-n02837789
-n02109047
-n02106030
-n03180011
-n02102973
-n02865351
-n02074367
-n02169497
-n02087046
-n03141823
-n02124075
-n02437312
-n07892512
-n01776313
-n02641379
-n01644900
-n03042490
-n03630383
-n03785016
-n07730033
-n03544143
-n02007558
-n02109047
-n02910353
-n02107312
-n02389026
-n01698640
-n03633091
-n04442312
-n07248320
-n04525038
-n03459775
-n03297495
-n03676483
-n03476991
-n02097658
-n03888257
-n02115913
-n01532829
-n02085936
-n01532829
-n02107312
-n02403003
-n03933933
-n02483362
-n02105162
-n02066245
-n01518878
-n01685808
-n03782006
-n07695742
-n09835506
-n04141076
-n02454379
-n02107683
-n03874293
-n02177972
-n02106166
-n04590129
-n03388549
-n04399382
-n02096585
-n02093256
-n02319095
-n04560804
-n02089973
-n03223299
-n02091244
-n02089867
-n04335435
-n03825788
-n02056570
-n01669191
-n02113978
-n03141823
-n02640242
-n02841315
-n04146614
-n03400231
-n02490219
-n03791053
-n07880968
-n02025239
-n03873416
-n02437616
-n03220513
-n02089973
-n03045698
-n02100735
-n04228054
-n06785654
-n04554684
-n03595614
-n03933933
-n03954731
-n02110806
-n02056570
-n04476259
-n03032252
-n02445715
-n03895866
-n02317335
-n04479046
-n02782093
-n02172182
-n02417914
-n03041632
-n04507155
-n02672831
-n02108000
-n07714990
-n03532672
-n02123597
-n03218198
-n02091134
-n02825657
-n02916936
-n03874599
-n03876231
-n03160309
-n04118538
-n03259280
-n03670208
-n07745940
-n03733805
-n01669191
-n03404251
-n07718747
-n07831146
-n02403003
-n02883205
-n02415577
-n01784675
-n02492035
-n03599486
-n01877812
-n01877812
-n03498962
-n04355338
-n03617480
-n03404251
-n02277742
-n02169497
-n02113624
-n04067472
-n04465501
-n04335435
-n02444819
-n09421951
-n04591157
-n01622779
-n03425413
-n02346627
-n04162706
-n03874293
-n02138441
-n04005630
-n03769881
-n03942813
-n04285008
-n02114855
-n02114712
-n02708093
-n03124170
-n01498041
-n07613480
-n02363005
-n03355925
-n13054560
-n03180011
-n04552348
-n02423022
-n04525038
-n02504013
-n02107312
-n02091467
-n02101006
-n03721384
-n07695742
-n02823428
-n04589890
-n04584207
-n04111531
-n03160309
-n01531178
-n02123394
-n02777292
-n04208210
-n01667114
-n01667114
-n04597913
-n03529860
-n03450230
-n02123045
-n12768682
-n01924916
-n02536864
-n04442312
-n02747177
-n07831146
-n02951358
-n03857828
-n03482405
-n03028079
-n04040759
-n02417914
-n01689811
-n03188531
-n04070727
-n07720875
-n02168699
-n11939491
-n01704323
-n03223299
-n01930112
-n02747177
-n03903868
-n02093428
-n01728572
-n03459775
-n04409515
-n03977966
-n03220513
-n04355933
-n03662601
-n03916031
-n07836838
-n07714571
-n03891332
-n02105251
-n03028079
-n02117135
-n02096585
-n04458633
-n02883205
-n01818515
-n01641577
-n04070727
-n02093428
-n03494278
-n03255030
-n03769881
-n07716358
-n03877845
-n07760859
-n03495258
-n04370456
-n02091134
-n03874293
-n03026506
-n03259280
-n02097209
-n03873416
-n07760859
-n02108422
-n01872401
-n01981276
-n04153751
-n02110185
-n02095570
-n01496331
-n04285008
-n03075370
-n02815834
-n09256479
-n02092339
-n02808304
-n09428293
-n02101006
-n02412080
-n04285008
-n03954731
-n04311004
-n03476991
-n01518878
-n02687172
-n02342885
-n02346627
-n02883205
-n03457902
-n02097658
-n02504458
-n03930313
-n02087394
-n02802426
-n03272010
-n02102318
-n02091467
-n02099849
-n04552348
-n02443114
-n02276258
-n03642806
-n02342885
-n03916031
-n02125311
-n02837789
-n02130308
-n04509417
-n03207941
-n03877845
-n13052670
-n02317335
-n03444034
-n03179701
-n04371774
-n03924679
-n02950826
-n02110958
-n02113978
-n02109961
-n02363005
-n02090622
-n07930864
-n03857828
-n03763968
-n07684084
-n02497673
-n02102480
-n04275548
-n04264628
-n02058221
-n01687978
-n02877765
-n01748264
-n02028035
-n02909870
-n04332243
-n09835506
-n04192698
-n03877845
-n03832673
-n04179913
-n03623198
-n02107908
-n04548362
-n01641577
-n02992211
-n04326547
-n02783161
-n03743016
-n01729977
-n04146614
-n01695060
-n03649909
-n02087394
-n03424325
-n01688243
-n03223299
-n01914609
-n02091032
-n02095570
-n07720875
-n02606052
-n03584829
-n02110185
-n03220513
-n07745940
-n01824575
-n02099601
-n11939491
-n07749582
-n03457902
-n01784675
-n02112018
-n03733131
-n04328186
-n04037443
-n03717622
-n01694178
-n02871525
-n02808440
-n04560804
-n02097474
-n02137549
-n01981276
-n02443114
-n02101006
-n04550184
-n12985857
-n02236044
-n02488291
-n04532106
-n03895866
-n03617480
-n03417042
-n03903868
-n03584254
-n02389026
-n04435653
-n02492035
-n01796340
-n03447721
-n03447447
-n03595614
-n04579145
-n02777292
-n04147183
-n02006656
-n03843555
-n02504458
-n03444034
-n03673027
-n04417672
-n10148035
-n04179913
-n03792972
-n04552348
-n02281406
-n02326432
-n02493509
-n03314780
-n03485407
-n01980166
-n04442312
-n03602883
-n01986214
-n02108915
-n02492660
-n03384352
-n04367480
-n04467665
-n02814860
-n01728572
-n03733281
-n03216828
-n02494079
-n03733805
-n02279972
-n01692333
-n02091635
-n04487081
-n03866082
-n03208938
-n07714990
-n02906734
-n02807133
-n02095570
-n03594945
-n03492542
-n02442845
-n01833805
-n02395406
-n06874185
-n02490219
-n02071294
-n02447366
-n01537544
-n02281787
-n02268443
-n03775546
-n04429376
-n03832673
-n04398044
-n04370456
-n02128757
-n04162706
-n04146614
-n04482393
-n07860988
-n02167151
-n02095889
-n02487347
-n01632777
-n02992211
-n02097658
-n02107683
-n03980874
-n07753592
-n02037110
-n03388183
-n01695060
-n04258138
-n02802426
-n03425413
-n02403003
-n03868242
-n02006656
-n02667093
-n02607072
-n02093647
-n02536864
-n04591713
-n02669723
-n03733805
-n03259280
-n03709823
-n04483307
-n03877472
-n02113023
-n04133789
-n06359193
-n03903868
-n03089624
-n02013706
-n04266014
-n02504013
-n02101006
-n02124075
-n01774750
-n02112350
-n02526121
-n03485407
-n03496892
-n02655020
-n07714571
-n02087394
-n03160309
-n02091831
-n03047690
-n04612504
-n02859443
-n04033995
-n02950826
-n03187595
-n01592084
-n07892512
-n04507155
-n01692333
-n01981276
-n02823750
-n04251144
-n04548362
-n07565083
-n04209133
-n01877812
-n04486054
-n09421951
-n02231487
-n02113799
-n02098413
-n04081281
-n02999410
-n02107312
-n02346627
-n01675722
-n02795169
-n03649909
-n04090263
-n03871628
-n01877812
-n03670208
-n03866082
-n03496892
-n07248320
-n04162706
-n02098413
-n04069434
-n03938244
-n02101006
-n02325366
-n03388549
-n03393912
-n01739381
-n02108089
-n03000134
-n03124170
-n02037110
-n02098105
-n01986214
-n03314780
-n10148035
-n04200800
-n03457902
-n02091831
-n02835271
-n03642806
-n02101388
-n02128757
-n04004767
-n02091635
-n04311004
-n04328186
-n01829413
-n02108000
-n03877845
-n03935335
-n01744401
-n01531178
-n13044778
-n02699494
-n01775062
-n02088364
-n04239074
-n03781244
-n02442845
-n03028079
-n09421951
-n12768682
-n02454379
-n03065424
-n02113023
-n01873310
-n03594945
-n03792782
-n03529860
-n02174001
-n02487347
-n01692333
-n02837789
-n04487394
-n02509815
-n03970156
-n02445715
-n02666196
-n02009912
-n01797886
-n07583066
-n02111500
-n03461385
-n04371774
-n04296562
-n02978881
-n02066245
-n02129604
-n03761084
-n09229709
-n01774750
-n02108915
-n01797886
-n04482393
-n03792782
-n02095314
-n01693334
-n04560804
-n04376876
-n07718747
-n01532829
-n03888605
-n02980441
-n01494475
-n02093754
-n07802026
-n04562935
-n02165456
-n02356798
-n03977966
-n03124170
-n02797295
-n04201297
-n04392985
-n04579432
-n02106550
-n02782093
-n04252077
-n04326547
-n02454379
-n02437312
-n01729977
-n02123045
-n04229816
-n02077923
-n03788195
-n02124075
-n02051845
-n02087394
-n02096437
-n02403003
-n02769748
-n04392985
-n02134084
-n02840245
-n04273569
-n03125729
-n03967562
-n03961711
-n03961711
-n07579787
-n04270147
-n02965783
-n02006656
-n03995372
-n03444034
-n02814860
-n04070727
-n04208210
-n04486054
-n03729826
-n02120079
-n04591713
-n02808304
-n02105641
-n03770439
-n04228054
-n02094114
-n03400231
-n02106166
-n03868863
-n02089078
-n03954731
-n04355338
-n02669723
-n04200800
-n04266014
-n03929855
-n02107312
-n04023962
-n03958227
-n01677366
-n02791124
-n03485407
-n02129165
-n03075370
-n01558993
-n02988304
-n04355933
-n02134418
-n01675722
-n07920052
-n02321529
-n02018795
-n03992509
-n03868863
-n03796401
-n02892767
-n04254120
-n03785016
-n04591157
-n01518878
-n06794110
-n01930112
-n02951585
-n07711569
-n01496331
-n02788148
-n03207743
-n03794056
-n04332243
-n04356056
-n07873807
-n02667093
-n03271574
-n02794156
-n02493793
-n03527444
-n02951585
-n03240683
-n02109961
-n01795545
-n03599486
-n04599235
-n01644900
-n07880968
-n04317175
-n02840245
-n02408429
-n07248320
-n04285008
-n02096585
-n02704792
-n04560804
-n03785016
-n02927161
-n03697007
-n07930864
-n07248320
-n02028035
-n02123597
-n02676566
-n07583066
-n02871525
-n02134084
-n02091032
-n04462240
-n02117135
-n02009912
-n09193705
-n09472597
-n02834397
-n03764736
-n01753488
-n03895866
-n02112018
-n02165105
-n02837789
-n03457902
-n04522168
-n04023962
-n04536866
-n04005630
-n02110627
-n02708093
-n04554684
-n01514668
-n02090379
-n07836838
-n02108089
-n03095699
-n04366367
-n04039381
-n07802026
-n03100240
-n03255030
-n04235860
-n02980441
-n03218198
-n01514668
-n03000684
-n02088094
-n02815834
-n03657121
-n03891251
-n02808440
-n02916936
-n03661043
-n04243546
-n04065272
-n03666591
-n04604644
-n04509417
-n03937543
-n04509417
-n02109961
-n04251144
-n02869837
-n02113712
-n02492660
-n02841315
-n07734744
-n04456115
-n02640242
-n03929855
-n04266014
-n01644900
-n02807133
-n03814639
-n01514859
-n01784675
-n04023962
-n02256656
-n01695060
-n03532672
-n04070727
-n03742115
-n03482405
-n01773797
-n03388183
-n03792782
-n09246464
-n03394916
-n13052670
-n03498962
-n02356798
-n02966193
-n01798484
-n03394916
-n04476259
-n03854065
-n03950228
-n02708093
-n02206856
-n03026506
-n04004767
-n03691459
-n01682714
-n02095570
-n02480855
-n03424325
-n01531178
-n03868863
-n02883205
-n02795169
-n04399382
-n02840245
-n02808304
-n01695060
-n02110063
-n01601694
-n04229816
-n02927161
-n03187595
-n02454379
-n04483307
-n01986214
-n02104029
-n04485082
-n02808304
-n03384352
-n02107574
-n02927161
-n03924679
-n01685808
-n02364673
-n04389033
-n07718472
-n01558993
-n03047690
-n03595614
-n02071294
-n03028079
-n01806143
-n03814639
-n02007558
-n04525038
-n02128385
-n02391049
-n04372370
-n03769881
-n02100877
-n09288635
-n03950228
-n02786058
-n03788365
-n01667114
-n02119789
-n02279972
-n02033041
-n02086910
-n01749939
-n03337140
-n07693725
-n02492660
-n02442845
-n02917067
-n03733281
-n07920052
-n02490219
-n02111277
-n02123394
-n02128757
-n02992211
-n03424325
-n03942813
-n04399382
-n04417672
-n01828970
-n03854065
-n02325366
-n02492035
-n03220513
-n02087046
-n03602883
-n01983481
-n01498041
-n02834397
-n03791053
-n04604644
-n07730033
-n01675722
-n02105056
-n04039381
-n02835271
-n02787622
-n04591157
-n02484975
-n04044716
-n02977058
-n03000247
-n03602883
-n02112018
-n04584207
-n03733281
-n04209133
-n02106662
-n01740131
-n03983396
-n04141327
-n03476684
-n03337140
-n04311174
-n02510455
-n03476991
-n04456115
-n03141823
-n04009552
-n03461385
-n01797886
-n01734418
-n02108915
-n04251144
-n04192698
-n04525038
-n03995372
-n01985128
-n07930864
-n02514041
-n02098413
-n03388183
-n02095889
-n02992529
-n07920052
-n03249569
-n02667093
-n03393912
-n03743016
-n03876231
-n02138441
-n07875152
-n02099601
-n01630670
-n02099429
-n03706229
-n03992509
-n03141823
-n03109150
-n02504013
-n02992529
-n01943899
-n03796401
-n01675722
-n04141327
-n07697537
-n04141327
-n02871525
-n04254680
-n07836838
-n03133878
-n02346627
-n03649909
-n02090622
-n03124170
-n04458633
-n04525305
-n03666591
-n02699494
-n03680355
-n01692333
-n02480495
-n03109150
-n02342885
-n02776631
-n04596742
-n03018349
-n04525305
-n01824575
-n01882714
-n02115641
-n02788148
-n04335435
-n02085936
-n02782093
-n03095699
-n03127925
-n09468604
-n07717410
-n03417042
-n12998815
-n02113023
-n07742313
-n04296562
-n07714571
-n02107312
-n01806143
-n04033995
-n02025239
-n03930313
-n02641379
-n03804744
-n07745940
-n02097658
-n07930864
-n03089624
-n02492035
-n02791124
-n02172182
-n02865351
-n01739381
-n03950228
-n02099429
-n01644900
-n02788148
-n01622779
-n02027492
-n04254120
-n03929855
-n02814533
-n02226429
-n07715103
-n03840681
-n02256656
-n01833805
-n12267677
-n01687978
-n04592741
-n04592741
-n07873807
-n02110627
-n02277742
-n04266014
-n01776313
-n02794156
-n02093428
-n04311004
-n03920288
-n03047690
-n03992509
-n02112350
-n04591157
-n03017168
-n03459775
-n01667778
-n01820546
-n03485794
-n02804610
-n03602883
-n03666591
-n01872401
-n04589890
-n02730930
-n02090379
-n03670208
-n02892201
-n03372029
-n03062245
-n02486410
-n04562935
-n01697457
-n02099429
-n04111531
-n01728920
-n04153751
-n02113624
-n01770393
-n04266014
-n02017213
-n03483316
-n01742172
-n02480855
-n01739381
-n01768244
-n03908714
-n02006656
-n02089867
-n03026506
-n01558993
-n03980874
-n03775546
-n01980166
-n09399592
-n02804610
-n04336792
-n02027492
-n04251144
-n02100735
-n03788365
-n13040303
-n02328150
-n15075141
-n07802026
-n01532829
-n03594734
-n02676566
-n04404412
-n02346627
-n02843684
-n02108000
-n02871525
-n02606052
-n03982430
-n02165456
-n02823750
-n01871265
-n02730930
-n03770679
-n04505470
-n03404251
-n01883070
-n02979186
-n02093991
-n01630670
-n04120489
-n01443537
-n04371774
-n03866082
-n01833805
-n03527444
-n03998194
-n03873416
-n02930766
-n03776460
-n06596364
-n02321529
-n04392985
-n03796401
-n04483307
-n02526121
-n02396427
-n02113023
-n03443371
-n07747607
-n01980166
-n02058221
-n02167151
-n02769748
-n03127925
-n02190166
-n03272562
-n02097130
-n04560804
-n02086240
-n04326547
-n02095314
-n01843383
-n02107312
-n03954731
-n02281406
-n02105641
-n03075370
-n02883205
-n01829413
-n02099849
-n02112137
-n07684084
-n03095699
-n02408429
-n10565667
-n02641379
-n02259212
-n02128757
-n03344393
-n01665541
-n04004767
-n07734744
-n02088364
-n02100583
-n02672831
-n01820546
-n03376595
-n04070727
-n02981792
-n03709823
-n02206856
-n01537544
-n01776313
-n04579145
-n02492035
-n02804414
-n02113799
-n02104365
-n03483316
-n09256479
-n03642806
-n07590611
-n02094433
-n02089973
-n02497673
-n01968897
-n02090721
-n02167151
-n02974003
-n02514041
-n03781244
-n02408429
-n02279972
-n04311174
-n01990800
-n02804610
-n03146219
-n13040303
-n07930864
-n04423845
-n02437616
-n03388043
-n04487394
-n04201297
-n02704792
-n01729322
-n04371430
-n03937543
-n03216828
-n02486261
-n02666196
-n04612504
-n03180011
-n03240683
-n03627232
-n01877812
-n04486054
-n02782093
-n02814533
-n02119022
-n03788195
-n07720875
-n02096051
-n03903868
-n02105162
-n04125021
-n03272010
-n03794056
-n02058221
-n03457902
-n04584207
-n03785016
-n04311004
-n03837869
-n02101556
-n03840681
-n03425413
-n03496892
-n02127052
-n01980166
-n03770439
-n04398044
-n02105412
-n03032252
-n03594734
-n02096437
-n10148035
-n01443537
-n04125021
-n03649909
-n02939185
-n01737021
-n02510455
-n02398521
-n02490219
-n03595614
-n04277352
-n03649909
-n07716906
-n02808440
-n03124170
-n03538406
-n03376595
-n02860847
-n01797886
-n04243546
-n03673027
-n04462240
-n03595614
-n04579432
-n01558993
-n04081281
-n04136333
-n03223299
-n03197337
-n02094114
-n03452741
-n04392985
-n02666196
-n02786058
-n09332890
-n03759954
-n04125021
-n03000684
-n04597913
-n01768244
-n02099601
-n07716358
-n03530642
-n01860187
-n02012849
-n02814860
-n02110063
-n03160309
-n02091032
-n15075141
-n02127052
-n02699494
-n04447861
-n02109961
-n03532672
-n04099969
-n03594945
-n02101556
-n04200800
-n02100236
-n04149813
-n07920052
-n04149813
-n02097209
-n03793489
-n09428293
-n03840681
-n02799071
-n04332243
-n01807496
-n04479046
-n02101388
-n02099849
-n02085620
-n02655020
-n02802426
-n04204347
-n02094433
-n02814533
-n04398044
-n04090263
-n02051845
-n04548362
-n04259630
-n04209133
-n04596742
-n02114855
-n02091635
-n01795545
-n02231487
-n07831146
-n02110341
-n01728920
-n02802426
-n01978455
-n03388043
-n03041632
-n03976657
-n02443484
-n01735189
-n04310018
-n02009229
-n02325366
-n03075370
-n04149813
-n03891251
-n02125311
-n04074963
-n02105855
-n04525038
-n02002724
-n03924679
-n03947888
-n03544143
-n01704323
-n02177972
-n04509417
-n07754684
-n03961711
-n02364673
-n07614500
-n04239074
-n02825657
-n02391049
-n03447721
-n03042490
-n04442312
-n02098105
-n03388043
-n03692522
-n04428191
-n02100236
-n04591157
-n03729826
-n03775071
-n02480855
-n03697007
-n02088094
-n02012849
-n02119789
-n02085782
-n03424325
-n01872401
-n01631663
-n02788148
-n01698640
-n02672831
-n04162706
-n04591157
-n02128385
-n02992529
-n03443371
-n03792782
-n04200800
-n04069434
-n02490219
-n03868242
-n04277352
-n03770439
-n01773157
-n04026417
-n03492542
-n02107908
-n04548362
-n03379051
-n01582220
-n02109047
-n04579145
-n02114548
-n04152593
-n02769748
-n04296562
-n02097209
-n01983481
-n04366367
-n03657121
-n02879718
-n02119789
-n03947888
-n02342885
-n04152593
-n04370456
-n03032252
-n07880968
-n04328186
-n02107574
-n02017213
-n01945685
-n04550184
-n01514859
-n04479046
-n07695742
-n03481172
-n07747607
-n02437312
-n03742115
-n01924916
-n01608432
-n04584207
-n02825657
-n12144580
-n01689811
-n04228054
-n02113624
-n07697313
-n04367480
-n04026417
-n01616318
-n02643566
-n04228054
-n01443537
-n04252077
-n01734418
-n02490219
-n02814533
-n01796340
-n03160309
-n04355933
-n03666591
-n02443114
-n03595614
-n02948072
-n03786901
-n04380533
-n01824575
-n02018207
-n02111500
-n03188531
-n03417042
-n13037406
-n02869837
-n03627232
-n07716906
-n02130308
-n02422106
-n03544143
-n02108551
-n03314780
-n01694178
-n02437312
-n02978881
-n04243546
-n02823428
-n03916031
-n01616318
-n01496331
-n15075141
-n02071294
-n03095699
-n04525305
-n02483362
-n02109047
-n02930766
-n03792972
-n04507155
-n02091032
-n01744401
-n03929660
-n01632458
-n02090622
-n13037406
-n01580077
-n03028079
-n04366367
-n03000247
-n02088094
-n04376876
-n02110341
-n03983396
-n02791124
-n02977058
-n03384352
-n03042490
-n02643566
-n04522168
-n02804414
-n07760859
-n02445715
-n01728920
-n04285008
-n01697457
-n03961711
-n03134739
-n01882714
-n07716358
-n02364673
-n02536864
-n07880968
-n03662601
-n02699494
-n04133789
-n04141076
-n04366367
-n02892201
-n02100877
-n01695060
-n07747607
-n02971356
-n02804414
-n01665541
-n02422699
-n03065424
-n07693725
-n04336792
-n07932039
-n04311174
-n07715103
-n02268853
-n02096585
-n01981276
-n04133789
-n02814860
-n03388183
-n01631663
-n02447366
-n01560419
-n02319095
-n04370456
-n04152593
-n02939185
-n01534433
-n02909870
-n01537544
-n07565083
-n02106030
-n01630670
-n02837789
-n03633091
-n01614925
-n13052670
-n02104029
-n02877765
-n02106166
-n02011460
-n03590841
-n02130308
-n01968897
-n02397096
-n02966193
-n02129165
-n03393912
-n03133878
-n03743016
-n03947888
-n02133161
-n02102480
-n02457408
-n02111889
-n02364673
-n02980441
-n02138441
-n03908714
-n04599235
-n03220513
-n01729977
-n02808304
-n03223299
-n03444034
-n03538406
-n03384352
-n02607072
-n07684084
-n07697537
-n07565083
-n02939185
-n04483307
-n01843065
-n03272010
-n04370456
-n03627232
-n03259280
-n01698640
-n01775062
-n02769748
-n04428191
-n04326547
-n02090721
-n02051845
-n03124170
-n02422106
-n02134418
-n09399592
-n03447721
-n04090263
-n04584207
-n03884397
-n02356798
-n02105641
-n03786901
-n02835271
-n02090379
-n03379051
-n04389033
-n01847000
-n02125311
-n02089078
-n01498041
-n01749939
-n02102177
-n04023962
-n03788365
-n02127052
-n04326547
-n01641577
-n02484975
-n07768694
-n03777754
-n04487394
-n07873807
-n02089078
-n02112137
-n03733281
-n04141975
-n02105251
-n04040759
-n13052670
-n07684084
-n03179701
-n03804744
-n03127747
-n01748264
-n02408429
-n03126707
-n03595614
-n04235860
-n02117135
-n03938244
-n02497673
-n03425413
-n04192698
-n03980874
-n01774384
-n04591157
-n02403003
-n01729322
-n02834397
-n03527444
-n03763968
-n04120489
-n02100735
-n01955084
-n02483362
-n02510455
-n01817953
-n03868242
-n02483362
-n04418357
-n01968897
-n03691459
-n01882714
-n02883205
-n01829413
-n02870880
-n02396427
-n01843383
-n10148035
-n02699494
-n01580077
-n04238763
-n03496892
-n07684084
-n02950826
-n03445777
-n01798484
-n03877845
-n04239074
-n01622779
-n02099712
-n02837789
-n07730033
-n09835506
-n04532106
-n03976467
-n03854065
-n01756291
-n07892512
-n15075141
-n02971356
-n02113023
-n04023962
-n02108551
-n02002724
-n09288635
-n03457902
-n03124170
-n01484850
-n04548362
-n03201208
-n01734418
-n02090622
-n03929660
-n03868863
-n02480855
-n02028035
-n01692333
-n02206856
-n03970156
-n07768694
-n04376876
-n02089973
-n03976467
-n03134739
-n03788195
-n04399382
-n04023962
-n03393912
-n12620546
-n03085013
-n02277742
-n03272562
-n01698640
-n04039381
-n02877765
-n03680355
-n01873310
-n04039381
-n02980441
-n04376876
-n01729322
-n02795169
-n01530575
-n04515003
-n02794156
-n02165105
-n03594945
-n02093991
-n02256656
-n02105412
-n03216828
-n02110806
-n03297495
-n02112137
-n03710721
-n02110185
-n09421951
-n02480855
-n04336792
-n02510455
-n02087046
-n02110627
-n04005630
-n02536864
-n04277352
-n01774750
-n02667093
-n04554684
-n02823750
-n03196217
-n01496331
-n01855032
-n02128757
-n03764736
-n02981792
-n03876231
-n04458633
-n03888257
-n01860187
-n04326547
-n09421951
-n07880968
-n02500267
-n01770081
-n03584254
-n07711569
-n09468604
-n01614925
-n03788365
-n04560804
-n01729977
-n03717622
-n02410509
-n02437312
-n03000684
-n01632777
-n02028035
-n07873807
-n01630670
-n03388183
-n02110185
-n02098413
-n02107142
-n04209133
-n07932039
-n03992509
-n04612504
-n01986214
-n04270147
-n06874185
-n02909870
-n02168699
-n03785016
-n01532829
-n04264628
-n02484975
-n02799071
-n04209133
-n07584110
-n01560419
-n02117135
-n07684084
-n03814906
-n03908618
-n02279972
-n02098413
-n02097658
-n04154565
-n02125311
-n02018795
-n02168699
-n02096177
-n03047690
-n02747177
-n03788365
-n02128385
-n03000134
-n03775546
-n04204238
-n04604644
-n03980874
-n03598930
-n01855672
-n02090721
-n07715103
-n02443114
-n02102177
-n04258138
-n04591713
-n03297495
-n01667778
-n04350905
-n04589890
-n06794110
-n03884397
-n04367480
-n03877845
-n10148035
-n03492542
-n04116512
-n03785016
-n01968897
-n02111889
-n04579432
-n03492542
-n02111277
-n03535780
-n03786901
-n02113799
-n04347754
-n03535780
-n02963159
-n03249569
-n03617480
-n04070727
-n02108000
-n03075370
-n03355925
-n04418357
-n02783161
-n02112137
-n03179701
-n02114367
-n02098286
-n02119022
-n03000684
-n01695060
-n15075141
-n02877765
-n02107683
-n03721384
-n02107142
-n02092339
-n02687172
-n02396427
-n01629819
-n03272010
-n10148035
-n04141076
-n04044716
-n04277352
-n02364673
-n04141975
-n01819313
-n03775546
-n03379051
-n01756291
-n03785016
-n04476259
-n04612504
-n01632777
-n03838899
-n02007558
-n01440764
-n02088094
-n01735189
-n02356798
-n02095889
-n09229709
-n02132136
-n02091635
-n07754684
-n03146219
-n03467068
-n03047690
-n02408429
-n02086910
-n02012849
-n04522168
-n01943899
-n12144580
-n01820546
-n01824575
-n01677366
-n03868242
-n03814639
-n02091635
-n04033901
-n02074367
-n04597913
-n07880968
-n01871265
-n03000684
-n01983481
-n07753592
-n04235860
-n02229544
-n03814906
-n03527444
-n04532106
-n02447366
-n04179913
-n04116512
-n01631663
-n04037443
-n03947888
-n02708093
-n03874293
-n04612504
-n04589890
-n02097130
-n03089624
-n03670208
-n04579145
-n03344393
-n07614500
-n04462240
-n01751748
-n04201297
-n07802026
-n02795169
-n07613480
-n07747607
-n02115913
-n02493793
-n03770679
-n02268443
-n02009912
-n04423845
-n01530575
-n01685808
-n07715103
-n03016953
-n03355925
-n04554684
-n04366367
-n03207941
-n03887697
-n04336792
-n03759954
-n03595614
-n02480855
-n04525038
-n04355338
-n02129165
-n03255030
-n02843684
-n04493381
-n02992211
-n03814906
-n04239074
-n06794110
-n03977966
-n02979186
-n03207941
-n07875152
-n01798484
-n02484975
-n02127052
-n02133161
-n03929660
-n02966687
-n12985857
-n01873310
-n07584110
-n02088094
-n01748264
-n02101006
-n03450230
-n03657121
-n03991062
-n02013706
-n03742115
-n03595614
-n04591713
-n03891251
-n01943899
-n03065424
-n04127249
-n03584829
-n02018207
-n02089973
-n03773504
-n01751748
-n02119022
-n02276258
-n04086273
-n01877812
-n02917067
-n02168699
-n02107574
-n03954731
-n02443114
-n02101556
-n01943899
-n03457902
-n01644900
-n01770081
-n03495258
-n02606052
-n02109047
-n01532829
-n02099429
-n02100735
-n03216828
-n04204347
-n02095889
-n03794056
-n02104365
-n03595614
-n01630670
-n03223299
-n04389033
-n01796340
-n02098286
-n02109525
-n04509417
-n01580077
-n04209239
-n01675722
-n07718747
-n02787622
-n04553703
-n02100877
-n02708093
-n01687978
-n01944390
-n02807133
-n03908714
-n12620546
-n04009552
-n04591713
-n02112350
-n02168699
-n03773504
-n03127747
-n03393912
-n03617480
-n02704792
-n03590841
-n03445924
-n02486261
-n03803284
-n03954731
-n02971356
-n03000247
-n03887697
-n02894605
-n04286575
-n02172182
-n01873310
-n04118538
-n04357314
-n02113624
-n02667093
-n03141823
-n04423845
-n03742115
-n02085620
-n02727426
-n04606251
-n02088466
-n03109150
-n03134739
-n02361337
-n03832673
-n02087394
-n02177972
-n04347754
-n07718747
-n03710721
-n03970156
-n04229816
-n01601694
-n02606052
-n03425413
-n03447447
-n04336792
-n04486054
-n04201297
-n07614500
-n02226429
-n01622779
-n04435653
-n09288635
-n02790996
-n02108000
-n03961711
-n03417042
-n03017168
-n03840681
-n02509815
-n04019541
-n01692333
-n01843065
-n03461385
-n04296562
-n02493509
-n03133878
-n02110627
-n07932039
-n02091831
-n03249569
-n02091467
-n03680355
-n07714990
-n02412080
-n03250847
-n03447721
-n02916936
-n02107683
-n02492035
-n03404251
-n02102177
-n07932039
-n04557648
-n04372370
-n03891251
-n02974003
-n15075141
-n02444819
-n04462240
-n02100236
-n02108551
-n04515003
-n02002556
-n02794156
-n04204238
-n04090263
-n04584207
-n02120505
-n03773504
-n02165456
-n07684084
-n04311174
-n02002556
-n02106382
-n01695060
-n02783161
-n02422699
-n03982430
-n02397096
-n03976657
-n02692877
-n03841143
-n03710637
-n04259630
-n02099601
-n03942813
-n12998815
-n11939491
-n04399382
-n03065424
-n01644373
-n04462240
-n03992509
-n03534580
-n02398521
-n02095889
-n02808440
-n04264628
-n02786058
-n04399382
-n03933933
-n04487081
-n01873310
-n04409515
-n02108089
-n02091831
-n07734744
-n04552348
-n04162706
-n02123045
-n13040303
-n02492035
-n03657121
-n02488291
-n02027492
-n02769748
-n07753113
-n03814639
-n01704323
-n02276258
-n04557648
-n03478589
-n04435653
-n03535780
-n04371774
-n02823750
-n02124075
-n07695742
-n03337140
-n03884397
-n01917289
-n07720875
-n07742313
-n04019541
-n02130308
-n02102040
-n02104365
-n02963159
-n01687978
-n07754684
-n02328150
-n02791124
-n04286575
-n04606251
-n03814639
-n09246464
-n02009229
-n01665541
-n04399382
-n04429376
-n04033995
-n04238763
-n09256479
-n01632458
-n04004767
-n04111531
-n03710637
-n02107908
-n04008634
-n02106382
-n02086079
-n07871810
-n02105505
-n02013706
-n03733131
-n07875152
-n03376595
-n03594945
-n01776313
-n03016953
-n04243546
-n04252225
-n03709823
-n02939185
-n02107574
-n02097047
-n02109525
-n03916031
-n02116738
-n07579787
-n02018795
-n03967562
-n03075370
-n12998815
-n01818515
-n02190166
-n02701002
-n01685808
-n12267677
-n02107683
-n07695742
-n02085782
-n03692522
-n02086646
-n03623198
-n03534580
-n02133161
-n07584110
-n03980874
-n03710721
-n03838899
-n04311174
-n03976467
-n02966687
-n03785016
-n02097658
-n04442312
-n04380533
-n03042490
-n03982430
-n02510455
-n02408429
-n02093859
-n07718472
-n02086079
-n02834397
-n03670208
-n01728572
-n02444819
-n02091467
-n04325704
-n04332243
-n03223299
-n01734418
-n03496892
-n01697457
-n03884397
-n03483316
-n04285008
-n01795545
-n03220513
-n02007558
-n01532829
-n02236044
-n06596364
-n04111531
-n03032252
-n03814639
-n04317175
-n04033995
-n02086079
-n07684084
-n01829413
-n02128757
-n03983396
-n04487081
-n02190166
-n04523525
-n04328186
-n04116512
-n03450230
-n04228054
-n02102177
-n03873416
-n02488702
-n02226429
-n02018207
-n04044716
-n03394916
-n01818515
-n01910747
-n03584829
-n03240683
-n04133789
-n03095699
-n04325704
-n02606052
-n02102318
-n02106382
-n03424325
-n02906734
-n01818515
-n04548362
-n04086273
-n07590611
-n02033041
-n04501370
-n02486261
-n03793489
-n02974003
-n09428293
-n02088466
-n04355933
-n02113712
-n02777292
-n02490219
-n02105056
-n02071294
-n02655020
-n03425413
-n02808440
-n02493509
-n03384352
-n02108422
-n04350905
-n07695742
-n02077923
-n03476991
-n03857828
-n02494079
-n01440764
-n02277742
-n02509815
-n07730033
-n01774384
-n02951585
-n02892201
-n02488702
-n02782093
-n03854065
-n04517823
-n03467068
-n07920052
-n03180011
-n02111129
-n02361337
-n03544143
-n07717556
-n03291819
-n02110063
-n03825788
-n02110185
-n02108422
-n01744401
-n04204347
-n01744401
-n02086079
-n01773549
-n03498962
-n02979186
-n01694178
-n04265275
-n04371774
-n01669191
-n01582220
-n02128925
-n02747177
-n02108551
-n02105056
-n02107312
-n01532829
-n01698640
-n03661043
-n02834397
-n03956157
-n01739381
-n02500267
-n02317335
-n02951358
-n02105505
-n07718747
-n04192698
-n04536866
-n03710637
-n02346627
-n03476684
-n02086910
-n02747177
-n02096177
-n04548280
-n01630670
-n01682714
-n04275548
-n03538406
-n02113712
-n09421951
-n01560419
-n04252225
-n02423022
-n01697457
-n02389026
-n03595614
-n02415577
-n04004767
-n02672831
-n03018349
-n03998194
-n03089624
-n04273569
-n02058221
-n03544143
-n02395406
-n03535780
-n03450230
-n03888605
-n13052670
-n01910747
-n01843065
-n03982430
-n03447721
-n01955084
-n01630670
-n03803284
-n02120079
-n03372029
-n02504458
-n03874599
-n02011460
-n02108089
-n03627232
-n02492660
-n04399382
-n02412080
-n03325584
-n03706229
-n02500267
-n02123159
-n04238763
-n02883205
-n13044778
-n07836838
-n02799071
-n01917289
-n04273569
-n04552348
-n01795545
-n02011460
-n03944341
-n02356798
-n04264628
-n02859443
-n02108915
-n02108422
-n04591713
-n02099849
-n07693725
-n01795545
-n04596742
-n03868242
-n03958227
-n02093991
-n03134739
-n01917289
-n02099712
-n03314780
-n11879895
-n10148035
-n02018795
-n02747177
-n04542943
-n03141823
-n02797295
-n01704323
-n02777292
-n02769748
-n04033995
-n01860187
-n02321529
-n01917289
-n03785016
-n03956157
-n03100240
-n04041544
-n02165105
-n03947888
-n03891251
-n03709823
-n02988304
-n02106030
-n02095570
-n02814860
-n03649909
-n03110669
-n02444819
-n04044716
-n04487394
-n02422106
-n04069434
-n02165456
-n02098105
-n02106382
-n02280649
-n02002556
-n01980166
-n02091032
-n09229709
-n03642806
-n03770679
-n02172182
-n07892512
-n01944390
-n04462240
-n02114548
-n02403003
-n03899768
-n09472597
-n03530642
-n02974003
-n02777292
-n02093428
-n01829413
-n02097298
-n01882714
-n01833805
-n03481172
-n02094114
-n03218198
-n02640242
-n02422699
-n03297495
-n04592741
-n01644373
-n02066245
-n03028079
-n04399382
-n03355925
-n03187595
-n02071294
-n01494475
-n02119789
-n02963159
-n03976657
-n03759954
-n02916936
-n02120079
-n03109150
-n04370456
-n02817516
-n01734418
-n02415577
-n03691459
-n04023962
-n02114712
-n03995372
-n06359193
-n01943899
-n01860187
-n02859443
-n02268443
-n02488702
-n03110669
-n03250847
-n02165105
-n02102480
-n03026506
-n04465501
-n03733131
-n01910747
-n04277352
-n03065424
-n01644900
-n02951358
-n04399382
-n02326432
-n03529860
-n03764736
-n02444819
-n02093256
-n02091134
-n02091635
-n11879895
-n03657121
-n04613696
-n03452741
-n04596742
-n02097474
-n02672831
-n01968897
-n02486410
-n02488291
-n02356798
-n07749582
-n04033995
-n03000684
-n04428191
-n02089078
-n04005630
-n03476991
-n02817516
-n04371774
-n12144580
-n12144580
-n03950228
-n02009912
-n03425413
-n04141975
-n02790996
-n01818515
-n07583066
-n04116512
-n03417042
-n01739381
-n01944390
-n03447721
-n03891332
-n01689811
-n04081281
-n02892767
-n04590129
-n01632777
-n02086910
-n01742172
-n04579145
-n02814860
-n04458633
-n04487394
-n02088632
-n03942813
-n04162706
-n07613480
-n02098413
-n04037443
-n02457408
-n04461696
-n02110185
-n03887697
-n03344393
-n04336792
-n04209239
-n02480495
-n02102480
-n04040759
-n03372029
-n03017168
-n02087046
-n02110185
-n04131690
-n02133161
-n02749479
-n02092002
-n04612504
-n03388183
-n03417042
-n02168699
-n07248320
-n02012849
-n03791053
-n02027492
-n07768694
-n02115913
-n02093428
-n01630670
-n02226429
-n01514859
-n07716358
-n02860847
-n04041544
-n02105505
-n02107683
-n03394916
-n03384352
-n04536866
-n02107312
-n04487081
-n02447366
-n02113186
-n03777754
-n03496892
-n09421951
-n02097298
-n02112706
-n02128757
-n02169497
-n03933933
-n02109961
-n04254120
-n04562935
-n02457408
-n02093754
-n15075141
-n02788148
-n01751748
-n02837789
-n06359193
-n01630670
-n03908618
-n07754684
-n02013706
-n03680355
-n02788148
-n06794110
-n02102040
-n01496331
-n03482405
-n02107312
-n13054560
-n03843555
-n01644373
-n02894605
-n01818515
-n03899768
-n02134084
-n01692333
-n02948072
-n03743016
-n07583066
-n02279972
-n07760859
-n03868863
-n02422699
-n02825657
-n02480855
-n02226429
-n04033901
-n01817953
-n04285008
-n04550184
-n04476259
-n02100877
-n09835506
-n02410509
-n03207743
-n03877845
-n03947888
-n01774750
-n02641379
-n04584207
-n02481823
-n07768694
-n02130308
-n04147183
-n04596742
-n02395406
-n07754684
-n04252225
-n04118538
-n09256479
-n07742313
-n02769748
-n03888257
-n03658185
-n04067472
-n02481823
-n03255030
-n03903868
-n03124043
-n03874599
-n06596364
-n04355933
-n04613696
-n04357314
-n02814860
-n02099601
-n01806567
-n02396427
-n02106166
-n03769881
-n02113023
-n04146614
-n02640242
-n02966193
-n02841315
-n02481823
-n03724870
-n03998194
-n04522168
-n02747177
-n02317335
-n04067472
-n02129165
-n07714571
-n03992509
-n03379051
-n04141975
-n02028035
-n02085936
-n04540053
-n02112137
-n03977966
-n03637318
-n03887697
-n09468604
-n03424325
-n04584207
-n01917289
-n07579787
-n03325584
-n01829413
-n04540053
-n03127925
-n01558993
-n02027492
-n03424325
-n03109150
-n06794110
-n01773797
-n03188531
-n02106382
-n03788365
-n02123159
-n01773797
-n02229544
-n02727426
-n02823428
-n02454379
-n02106030
-n01924916
-n12998815
-n04179913
-n04099969
-n07684084
-n03450230
-n04435653
-n02422106
-n03637318
-n03018349
-n04429376
-n03868863
-n02110806
-n02226429
-n02006656
-n03843555
-n06359193
-n01860187
-n01694178
-n02138441
-n03630383
-n04009552
-n02101006
-n03496892
-n03447721
-n07920052
-n07873807
-n01729977
-n03220513
-n01614925
-n02134084
-n03908618
-n03763968
-n03544143
-n02797295
-n04392985
-n01728920
-n03876231
-n03259280
-n03325584
-n04296562
-n02909870
-n02493793
-n02112706
-n02776631
-n02447366
-n01514859
-n03954731
-n03344393
-n04125021
-n03930630
-n04116512
-n02441942
-n03344393
-n02125311
-n02643566
-n03840681
-n02106662
-n03325584
-n07695742
-n01491361
-n03814906
-n03075370
-n02098286
-n02666196
-n07718472
-n02948072
-n01698640
-n03777754
-n07714571
-n01945685
-n03085013
-n03445777
-n04380533
-n01986214
-n03673027
-n03710193
-n02441942
-n01734418
-n02105412
-n03447447
-n04591157
-n02727426
-n04486054
-n02510455
-n03958227
-n01978455
-n04461696
-n03908618
-n04522168
-n02107908
-n07715103
-n04009552
-n03457902
-n03447447
-n01820546
-n02692877
-n03874599
-n02101388
-n02115641
-n03532672
-n03127925
-n04081281
-n02814533
-n02916936
-n02483708
-n02791124
-n04505470
-n04417672
-n03876231
-n01829413
-n09246464
-n01728920
-n02363005
-n07754684
-n07717556
-n03000247
-n01873310
-n02091635
-n07831146
-n02794156
-n03825788
-n03476991
-n04033901
-n02607072
-n02123394
-n03534580
-n01770081
-n02011460
-n02843684
-n02109525
-n03916031
-n04418357
-n03710637
-n03075370
-n01644900
-n04254680
-n07768694
-n04228054
-n04258138
-n04357314
-n07836838
-n03000134
-n04310018
-n03000134
-n02098413
-n02108000
-n04252077
-n02457408
-n04483307
-n02105505
-n03125729
-n02091467
-n03868242
-n02106166
-n03240683
-n02917067
-n02105056
-n04525305
-n01753488
-n02978881
-n03977966
-n02486261
-n04162706
-n02120079
-n03709823
-n03127747
-n02089973
-n03089624
-n03814906
-n01534433
-n04613696
-n03325584
-n04505470
-n03325584
-n02115641
-n03630383
-n01930112
-n04204238
-n03063689
-n02233338
-n03916031
-n02786058
-n02113799
-n03935335
-n04179913
-n03690938
-n02442845
-n01819313
-n01534433
-n01753488
-n02823750
-n01491361
-n03124043
-n01749939
-n02328150
-n03272562
-n02094258
-n04597913
-n01773549
-n03724870
-n01871265
-n01751748
-n04039381
-n03733805
-n02783161
-n02948072
-n02397096
-n02233338
-n02093647
-n03016953
-n04344873
-n02640242
-n01677366
-n02106166
-n07745940
-n03710637
-n03529860
-n02988304
-n04350905
-n02105056
-n01630670
-n12998815
-n02094258
-n03481172
-n04515003
-n04418357
-n03075370
-n04273569
-n01592084
-n03290653
-n04487394
-n02109047
-n02259212
-n04604644
-n03976467
-n04023962
-n02910353
-n03394916
-n02106662
-n01882714
-n03494278
-n01770393
-n03445924
-n02102177
-n02110958
-n02089973
-n01924916
-n02113799
-n01817953
-n02091134
-n01697457
-n03443371
-n04482393
-n01749939
-n01985128
-n04116512
-n03452741
-n03220513
-n02510455
-n03761084
-n02916936
-n02089867
-n02281406
-n03445777
-n03642806
-n03255030
-n09428293
-n01774750
-n03220513
-n04254777
-n13037406
-n04235860
-n07875152
-n01877812
-n02086240
-n03876231
-n02484975
-n03595614
-n03733805
-n02099712
-n03884397
-n03016953
-n02088632
-n04086273
-n02797295
-n04392985
-n03124043
-n02102480
-n02100583
-n01855032
-n02667093
-n01945685
-n03250847
-n01644373
-n04147183
-n02641379
-n02342885
-n03666591
-n03000134
-n03197337
-n02807133
-n03394916
-n01797886
-n02443114
-n02056570
-n02916936
-n04090263
-n01756291
-n03724870
-n02747177
-n04553703
-n01983481
-n04479046
-n07920052
-n01631663
-n01981276
-n02097474
-n02268443
-n01944390
-n02108422
-n04487081
-n07734744
-n02091244
-n02835271
-n01824575
-n02056570
-n03773504
-n01688243
-n03345487
-n03345487
-n02486410
-n03271574
-n03485407
-n02483362
-n02113712
-n02786058
-n04579145
-n02948072
-n03595614
-n03594734
-n01491361
-n01729977
-n04033995
-n04597913
-n01871265
-n02992211
-n02361337
-n04070727
-n02007558
-n03110669
-n09399592
-n02009912
-n03249569
-n02415577
-n02190166
-n02701002
-n03042490
-n01871265
-n02091467
-n03208938
-n02105505
-n04589890
-n02138441
-n04591157
-n03344393
-n01622779
-n01924916
-n02137549
-n04328186
-n07590611
-n01776313
-n04389033
-n02058221
-n03786901
-n02865351
-n02536864
-n04154565
-n02108422
-n07583066
-n03770439
-n04235860
-n03594945
-n02096051
-n03590841
-n04525038
-n02264363
-n04592741
-n02364673
-n01735189
-n02977058
-n02488291
-n07871810
-n03062245
-n04557648
-n03837869
-n01770081
-n04273569
-n03290653
-n03124043
-n02971356
-n02423022
-n02094114
-n01695060
-n01917289
-n02814533
-n03250847
-n02110063
-n02666196
-n02488291
-n02504013
-n02130308
-n01695060
-n03089624
-n02906734
-n02791124
-n09835506
-n07695742
-n06874185
-n04229816
-n02408429
-n02087394
-n03297495
-n02058221
-n03763968
-n01491361
-n03781244
-n03873416
-n02111277
-n13052670
-n02119022
-n02108000
-n02791124
-n03028079
-n02906734
-n02112350
-n02102318
-n04118776
-n02823428
-n04435653
-n03786901
-n02105505
-n01514859
-n02860847
-n01871265
-n07742313
-n01695060
-n01735189
-n03141823
-n02692877
-n04254680
-n02483708
-n02011460
-n02927161
-n02113978
-n02106166
-n03770679
-n02169497
-n04482393
-n02277742
-n04485082
-n01984695
-n03658185
-n01697457
-n09428293
-n02102480
-n04501370
-n04141975
-n01614925
-n02089078
-n03935335
-n02486410
-n01843065
-n01984695
-n02363005
-n04536866
-n04141076
-n01950731
-n03445777
-n02102040
-n07715103
-n09256479
-n03781244
-n02090379
-n02129165
-n04532670
-n02939185
-n04259630
-n03788365
-n03461385
-n04606251
-n04428191
-n02488702
-n01518878
-n02107142
-n01622779
-n02483708
-n07753113
-n07930864
-n01984695
-n03476684
-n02655020
-n03376595
-n01806143
-n04286575
-n02490219
-n02640242
-n04141975
-n03938244
-n02100735
-n04041544
-n02108915
-n03769881
-n02108551
-n02110185
-n02086646
-n03388043
-n07697313
-n02098105
-n04597913
-n04090263
-n02492660
-n02795169
-n02086240
-n02097130
-n02346627
-n01622779
-n01978287
-n01924916
-n02655020
-n02787622
-n02108551
-n03717622
-n07697313
-n02105505
-n07753113
-n04204347
-n02909870
-n01828970
-n02018795
-n07836838
-n01775062
-n07716358
-n01675722
-n02807133
-n02493793
-n02091467
-n02804414
-n12144580
-n02823428
-n09229709
-n03379051
-n02791270
-n01828970
-n03832673
-n04366367
-n03877845
-n03372029
-n03961711
-n03916031
-n03788365
-n04265275
-n01806143
-n04008634
-n02794156
-n03777754
-n01630670
-n07860988
-n04239074
-n04270147
-n03761084
-n04270147
-n04487081
-n02481823
-n02395406
-n02093859
-n03991062
-n04264628
-n04258138
-n06359193
-n02074367
-n07614500
-n02865351
-n07718747
-n04074963
-n04482393
-n03347037
-n02110063
-n07836838
-n02090379
-n03595614
-n03482405
-n13052670
-n04023962
-n03991062
-n04548280
-n02056570
-n02794156
-n13133613
-n02100877
-n03272010
-n02107683
-n04149813
-n04152593
-n02002556
-n03954731
-n01968897
-n03388043
-n03764736
-n02690373
-n02966193
-n01518878
-n02128385
-n03197337
-n02092002
-n03110669
-n03478589
-n02457408
-n02870880
-n02011460
-n02093428
-n03063689
-n03337140
-n04356056
-n02963159
-n04435653
-n03871628
-n02110627
-n02088238
-n03160309
-n03983396
-n02992529
-n03843555
-n01773549
-n02389026
-n09468604
-n04505470
-n02109961
-n02794156
-n03854065
-n04355338
-n02094433
-n13133613
-n03272010
-n01667778
-n03494278
-n12768682
-n02481823
-n03085013
-n03179701
-n01667778
-n02102040
-n02112706
-n02951585
-n02108089
-n02099601
-n07860988
-n04033995
-n03388183
-n02127052
-n02107142
-n03814639
-n04004767
-n02099712
-n01582220
-n02102177
-n02100735
-n03958227
-n02481823
-n01773549
-n03131574
-n04540053
-n03424325
-n03871628
-n02116738
-n09229709
-n02797295
-n02704792
-n02825657
-n02115913
-n03888605
-n02009229
-n03063689
-n07734744
-n02669723
-n02101556
-n03045698
-n04532106
-n03961711
-n04372370
-n02655020
-n02094433
-n02088466
-n04005630
-n12144580
-n02892767
-n02091244
-n03110669
-n03759954
-n03594945
-n03594945
-n04462240
-n07711569
-n03259280
-n04482393
-n02018207
-n03134739
-n03832673
-n04467665
-n04285008
-n02169497
-n03796401
-n02099267
-n02909870
-n02105412
-n04265275
-n01728572
-n04336792
-n02834397
-n02804414
-n04548362
-n03109150
-n02895154
-n03929660
-n01685808
-n02111500
-n04033995
-n01768244
-n02002556
-n03887697
-n04069434
-n03594734
-n02500267
-n07714990
-n02137549
-n03014705
-n02447366
-n01537544
-n07802026
-n03895866
-n04330267
-n03602883
-n02795169
-n04153751
-n03782006
-n02489166
-n03447721
-n03417042
-n04550184
-n02500267
-n02112706
-n03347037
-n02088364
-n02640242
-n03983396
-n02817516
-n01695060
-n13133613
-n02095314
-n03887697
-n02892767
-n07697313
-n11939491
-n04332243
-n02667093
-n02643566
-n02493509
-n04251144
-n02730930
-n04118776
-n02097209
-n04335435
-n03016953
-n03691459
-n04037443
-n02100583
-n02104029
-n02088466
-n09193705
-n03495258
-n02095314
-n03355925
-n07613480
-n02971356
-n04153751
-n01945685
-n01697457
-n04532106
-n02895154
-n04548362
-n04485082
-n02002724
-n02999410
-n03976467
-n02951358
-n03874293
-n02442845
-n04229816
-n01614925
-n02769748
-n04461696
-n02486410
-n03916031
-n04562935
-n02098413
-n02097474
-n03584829
-n02606052
-n02123394
-n03871628
-n04311004
-n02865351
-n01601694
-n02111129
-n04509417
-n01882714
-n03908714
-n02102973
-n03983396
-n02093859
-n03775071
-n02667093
-n02906734
-n07873807
-n04277352
-n04153751
-n01675722
-n01601694
-n04263257
-n01582220
-n03000134
-n04263257
-n04286575
-n06359193
-n02445715
-n03179701
-n04275548
-n02444819
-n02002724
-n03124170
-n02018795
-n02776631
-n12144580
-n03041632
-n02101556
-n04435653
-n04254120
-n04505470
-n03297495
-n02093256
-n03529860
-n01734418
-n04462240
-n02089867
-n03259280
-n03804744
-n02484975
-n03372029
-n02992529
-n01629819
-n03814639
-n04004767
-n02280649
-n04275548
-n04023962
-n03476684
-n01843383
-n02490219
-n03450230
-n02088238
-n02129165
-n07716906
-n02006656
-n07615774
-n04033901
-n02101388
-n02412080
-n02871525
-n01689811
-n02447366
-n02951585
-n03325584
-n04238763
-n01817953
-n07753275
-n03803284
-n03724870
-n01694178
-n04613696
-n03961711
-n04553703
-n04493381
-n04507155
-n03388183
-n04483307
-n02840245
-n01739381
-n03837869
-n03980874
-n02093647
-n02992529
-n03983396
-n02110958
-n01688243
-n02100236
-n01873310
-n04525038
-n03496892
-n04350905
-n02115913
-n01824575
-n04443257
-n01729322
-n03197337
-n09421951
-n07614500
-n03445777
-n03680355
-n04579145
-n03345487
-n03062245
-n02655020
-n02769748
-n03930630
-n03956157
-n04332243
-n03690938
-n04153751
-n04456115
-n02883205
-n01631663
-n02841315
-n02480495
-n02396427
-n04357314
-n01695060
-n02101556
-n03947888
-n04367480
-n03958227
-n01924916
-n02111129
-n02939185
-n01829413
-n02108915
-n03388183
-n02410509
-n04273569
-n02119789
-n04505470
-n02094258
-n02231487
-n02916936
-n02441942
-n04039381
-n02883205
-n02098413
-n01496331
-n03534580
-n07714990
-n04286575
-n03000247
-n03691459
-n03376595
-n01729322
-n12144580
-n04192698
-n03998194
-n02979186
-n02102973
-n02110627
-n01728572
-n03272010
-n03786901
-n04033901
-n02097047
-n03947888
-n07873807
-n02097047
-n07754684
-n02276258
-n02104365
-n01734418
-n03976467
-n02825657
-n01694178
-n01682714
-n02747177
-n03710193
-n09288635
-n02510455
-n02319095
-n02088364
-n02129604
-n04326547
-n03871628
-n02096177
-n09246464
-n03127925
-n02488702
-n06785654
-n02066245
-n12998815
-n01632777
-n02091244
-n01742172
-n03908618
-n04536866
-n03841143
-n01917289
-n02276258
-n03457902
-n04041544
-n03259280
-n02236044
-n02090379
-n04127249
-n03873416
-n02415577
-n03590841
-n02094258
-n03884397
-n01978287
-n02172182
-n01990800
-n04476259
-n03871628
-n03584829
-n04118776
-n02509815
-n02102480
-n01729977
-n02776631
-n03125729
-n02948072
-n01774384
-n01695060
-n07734744
-n01990800
-n02445715
-n03017168
-n02606052
-n04612504
-n02119789
-n02113978
-n03706229
-n02115913
-n02655020
-n02640242
-n03478589
-n03891251
-n02892201
-n02676566
-n01877812
-n02037110
-n07745940
-n02090721
-n04548280
-n02971356
-n03042490
-n02865351
-n04310018
-n07802026
-n01843065
-n01944390
-n03443371
-n01496331
-n13044778
-n03196217
-n02111889
-n09288635
-n03777568
-n03970156
-n02027492
-n09332890
-n04326547
-n04458633
-n02093428
-n03992509
-n03908618
-n03290653
-n04311004
-n03764736
-n04465501
-n03345487
-n04099969
-n02843684
-n02361337
-n02066245
-n02099601
-n03259280
-n02105641
-n01755581
-n03937543
-n03249569
-n02124075
-n03761084
-n02834397
-n03891251
-n07753275
-n04389033
-n03599486
-n04392985
-n01582220
-n03642806
-n01749939
-n01944390
-n03146219
-n09428293
-n02112350
-n03249569
-n02085936
-n03240683
-n04597913
-n03249569
-n02256656
-n07248320
-n04376876
-n03089624
-n04118538
-n02966687
-n03891332
-n01773157
-n02948072
-n01685808
-n04371430
-n02107312
-n01749939
-n02085936
-n02091831
-n02098105
-n02708093
-n02120505
-n01601694
-n06874185
-n02319095
-n01616318
-n01775062
-n13040303
-n03796401
-n04482393
-n03272562
-n03478589
-n02190166
-n02910353
-n02951358
-n01749939
-n12985857
-n04254120
-n03944341
-n03743016
-n01855672
-n04228054
-n03642806
-n03956157
-n04162706
-n02992211
-n01883070
-n03045698
-n02018207
-n01872401
-n04239074
-n07932039
-n04392985
-n02641379
-n01484850
-n01742172
-n04376876
-n04550184
-n03733805
-n04371774
-n04317175
-n03873416
-n02361337
-n02002556
-n02168699
-n02098413
-n02104365
-n03841143
-n02074367
-n04344873
-n07615774
-n04149813
-n02321529
-n12144580
-n02509815
-n03938244
-n01978455
-n03047690
-n04252077
-n02487347
-n03141823
-n02666196
-n02123045
-n02486410
-n02492660
-n03796401
-n02112350
-n07730033
-n03950228
-n04162706
-n02895154
-n02105641
-n03404251
-n02007558
-n01739381
-n02481823
-n04409515
-n02443114
-n02879718
-n03345487
-n02268853
-n12620546
-n03930313
-n04380533
-n01518878
-n04596742
-n03680355
-n02074367
-n01667778
-n03376595
-n04366367
-n02097047
-n02101006
-n01873310
-n03876231
-n04507155
-n02086910
-n04370456
-n02687172
-n03724870
-n02966193
-n02776631
-n03089624
-n04456115
-n03325584
-n01770081
-n04428191
-n01667778
-n02132136
-n02105162
-n03743016
-n04367480
-n02098105
-n03000134
-n02100236
-n02011460
-n02097047
-n02177972
-n04493381
-n03874293
-n02017213
-n03908714
-n02361337
-n02669723
-n02119022
-n02105505
-n03884397
-n02190166
-n03216828
-n02410509
-n02101556
-n02098286
-n03250847
-n02117135
-n03929660
-n04332243
-n03891332
-n02018207
-n01498041
-n03977966
-n02892767
-n03781244
-n02094433
-n02112137
-n02910353
-n03791053
-n01773157
-n03599486
-n11939491
-n01496331
-n02950826
-n09246464
-n02099429
-n02108551
-n02895154
-n09229709
-n07932039
-n03721384
-n03529860
-n02113186
-n03929660
-n02086646
-n02787622
-n02676566
-n02006656
-n02104365
-n03045698
-n03100240
-n03599486
-n03924679
-n03937543
-n02869837
-n02123394
-n01980166
-n04355933
-n03133878
-n03709823
-n06794110
-n02110341
-n01796340
-n02978881
-n03495258
-n03452741
-n02091032
-n04442312
-n04118776
-n01630670
-n03662601
-n02174001
-n04606251
-n02107142
-n03814906
-n03457902
-n02085782
-n03598930
-n02094258
-n03000247
-n02966193
-n02489166
-n04367480
-n02110063
-n07753275
-n07715103
-n04485082
-n03075370
-n02098105
-n13054560
-n02730930
-n03670208
-n02281787
-n04462240
-n02510455
-n02814860
-n04482393
-n03498962
-n09229709
-n02097130
-n04265275
-n04004767
-n02093647
-n01443537
-n01704323
-n02096437
-n03394916
-n04423845
-n02108422
-n03706229
-n02869837
-n01737021
-n03930313
-n04039381
-n02113186
-n02403003
-n02037110
-n03637318
-n02823750
-n01677366
-n02093256
-n02096294
-n06596364
-n03220513
-n02106030
-n02917067
-n02090622
-n04141076
-n01749939
-n02981792
-n02111889
-n02116738
-n09246464
-n02791124
-n02091244
-n02119022
-n02445715
-n03216828
-n03095699
-n03481172
-n04442312
-n02802426
-n09428293
-n03065424
-n02363005
-n12057211
-n02422106
-n02999410
-n03207743
-n03786901
-n02363005
-n02417914
-n01698640
-n03063599
-n04409515
-n03891251
-n03794056
-n02101388
-n04044716
-n02226429
-n01818515
-n01558993
-n02110806
-n03337140
-n03627232
-n04204238
-n07873807
-n03930630
-n04311174
-n01616318
-n04330267
-n04179913
-n04501370
-n02687172
-n02086079
-n03976467
-n03950228
-n01773797
-n03197337
-n02640242
-n01440764
-n02342885
-n02389026
-n02895154
-n02056570
-n04584207
-n03042490
-n09421951
-n01616318
-n03384352
-n07248320
-n03590841
-n03903868
-n02129165
-n02123159
-n03837869
-n03630383
-n02119789
-n07768694
-n02102973
-n03788195
-n01682714
-n02130308
-n03495258
-n03770439
-n02398521
-n02965783
-n02033041
-n02088094
-n02939185
-n01914609
-n04147183
-n03720891
-n02105641
-n01843383
-n01818515
-n02730930
-n02109961
-n04398044
-n04131690
-n01914609
-n03481172
-n04317175
-n03344393
-n04557648
-n02120505
-n02109961
-n02128385
-n02391049
-n03041632
-n09246464
-n03666591
-n02111129
-n02974003
-n02643566
-n03492542
-n02090622
-n02389026
-n01735189
-n03478589
-n03785016
-n03854065
-n03207743
-n04399382
-n02108422
-n04428191
-n07760859
-n03888605
-n02704792
-n03697007
-n03657121
-n04141975
-n04008634
-n02799071
-n02018795
-n02877765
-n07613480
-n11939491
-n02108089
-n02098413
-n01440764
-n01776313
-n03804744
-n01817953
-n02788148
-n03400231
-n03899768
-n02027492
-n02028035
-n02087394
-n04392985
-n01944390
-n04204238
-n03995372
-n02437616
-n03000684
-n03146219
-n01496331
-n02128925
-n02025239
-n03903868
-n06596364
-n01990800
-n03877845
-n02704792
-n01773549
-n03271574
-n02667093
-n01514668
-n02089867
-n02410509
-n09193705
-n04204238
-n02110806
-n02823428
-n01807496
-n07753592
-n02835271
-n04579432
-n03763968
-n01667114
-n01770393
-n02364673
-n03777568
-n04204238
-n04252077
-n01496331
-n02877765
-n01532829
-n02640242
-n04483307
-n04332243
-n03197337
-n02094433
-n03995372
-n03485407
-n02085782
-n04591157
-n07930864
-n02086079
-n01983481
-n04162706
-n02981792
-n02447366
-n03733805
-n02097298
-n04120489
-n04442312
-n07714990
-n02823428
-n02788148
-n02791270
-n11879895
-n03776460
-n02834397
-n03657121
-n02423022
-n03785016
-n03888257
-n02018207
-n01742172
-n04154565
-n02536864
-n03447721
-n02229544
-n04540053
-n04266014
-n03457902
-n03425413
-n02504013
-n02107312
-n02177972
-n02489166
-n04330267
-n03791053
-n04311004
-n02422699
-n02319095
-n04606251
-n04229816
-n02101556
-n04592741
-n03666591
-n02088094
-n02017213
-n03759954
-n02128925
-n03544143
-n03188531
-n03459775
-n04254680
-n03496892
-n02483362
-n02906734
-n07753275
-n02879718
-n02641379
-n02814860
-n03400231
-n02966687
-n09246464
-n02114712
-n02087046
-n02115913
-n03424325
-n03529860
-n01943899
-n04238763
-n03146219
-n02747177
-n02233338
-n13044778
-n03109150
-n02112350
-n03180011
-n02091831
-n03134739
-n03133878
-n01740131
-n02125311
-n02398521
-n02219486
-n04086273
-n02091244
-n02099849
-n02119789
-n04039381
-n02094114
-n04562935
-n03938244
-n07693725
-n12998815
-n04542943
-n02389026
-n03417042
-n01440764
-n02095889
-n02090379
-n02493509
-n02672831
-n01534433
-n02794156
-n02396427
-n02117135
-n03782006
-n04336792
-n03042490
-n03075370
-n02488291
-n04332243
-n02708093
-n02097209
-n02356798
-n03837869
-n04355338
-n03584829
-n03041632
-n06359193
-n03041632
-n03888257
-n03717622
-n04235860
-n04275548
-n01592084
-n03388549
-n01669191
-n07760859
-n02090622
-n01440764
-n01729322
-n02480495
-n07871810
-n04505470
-n04418357
-n03404251
-n03676483
-n02165105
-n04008634
-n03958227
-n02480855
-n02823750
-n07579787
-n02009912
-n07734744
-n03372029
-n01440764
-n02102177
-n03840681
-n07753275
-n03026506
-n01601694
-n03047690
-n02086079
-n02979186
-n02089078
-n02397096
-n12985857
-n02808304
-n04118538
-n04229816
-n09428293
-n07880968
-n04548280
-n03804744
-n01622779
-n02110063
-n02814860
-n02128385
-n01824575
-n01496331
-n04286575
-n03599486
-n03857828
-n03866082
-n03495258
-n02526121
-n02098105
-n02102973
-n03124043
-n04357314
-n07768694
-n03000134
-n03970156
-n04040759
-n02112706
-n04008634
-n04040759
-n06794110
-n02086646
-n02066245
-n03884397
-n03967562
-n04125021
-n02910353
-n02236044
-n01981276
-n07871810
-n02099849
-n03146219
-n04146614
-n09193705
-n02113023
-n02100236
-n13044778
-n03584829
-n03180011
-n02027492
-n03240683
-n02526121
-n01494475
-n02492660
-n01774750
-n07768694
-n02113712
-n03666591
-n12998815
-n03657121
-n02110806
-n03717622
-n02087394
-n02692877
-n02497673
-n04507155
-n02114855
-n04332243
-n02100877
-n04332243
-n02110627
-n03424325
-n02104365
-n01943899
-n03535780
-n02883205
-n01667778
-n01986214
-n02666196
-n02966687
-n02097658
-n03866082
-n04239074
-n02488702
-n01735189
-n04090263
-n04008634
-n03742115
-n03877472
-n03788195
-n03794056
-n01768244
-n02797295
-n02009229
-n03085013
-n02119789
-n04557648
-n02099267
-n03424325
-n03666591
-n01667778
-n07875152
-n01514668
-n02492660
-n03482405
-n04033901
-n04044716
-n03290653
-n12057211
-n02981792
-n01496331
-n02483362
-n03314780
-n04099969
-n02669723
-n02113799
-n02074367
-n02094258
-n03866082
-n04540053
-n02777292
-n03782006
-n02105251
-n03761084
-n01955084
-n02643566
-n02106662
-n01580077
-n01828970
-n02690373
-n03063599
-n02114548
-n03014705
-n03724870
-n02088364
-n07716358
-n03724870
-n03937543
-n02091635
-n02106382
-n07613480
-n13133613
-n04591157
-n02396427
-n03776460
-n02108089
-n02017213
-n04350905
-n02107683
-n04228054
-n01773549
-n03888257
-n02488291
-n04493381
-n01817953
-n01641577
-n02012849
-n01797886
-n02787622
-n02910353
-n04067472
-n03100240
-n02087046
-n03733131
-n02643566
-n02916936
-n02480495
-n02815834
-n02086079
-n02814860
-n02114712
-n07742313
-n01728920
-n02356798
-n13044778
-n01798484
-n04613696
-n02108915
-n02109047
-n03272010
-n04008634
-n02097209
-n01843065
-n02999410
-n04086273
-n03888257
-n02123394
-n04356056
-n09468604
-n01601694
-n03950228
-n04344873
-n02672831
-n12768682
-n02110341
-n10148035
-n02114367
-n04409515
-n03240683
-n04285008
-n07831146
-n03584254
-n01855672
-n02489166
-n03216828
-n03297495
-n04086273
-n01514859
-n01629819
-n02643566
-n02113023
-n02791270
-n03983396
-n07880968
-n02268853
-n03970156
-n02091831
-n02268853
-n02167151
-n03742115
-n03947888
-n04591157
-n03729826
-n02988304
-n03717622
-n02391049
-n02096585
-n02219486
-n02093647
-n02002556
-n02504458
-n01665541
-n03938244
-n03776460
-n02093256
-n02056570
-n02096051
-n02488702
-n07693725
-n01796340
-n02950826
-n01828970
-n03534580
-n03394916
-n04404412
-n03895866
-n01944390
-n04554684
-n02444819
-n03623198
-n04263257
-n04099969
-n02105855
-n03584829
-n04442312
-n01514668
-n02088364
-n01943899
-n02091831
-n02071294
-n03461385
-n04485082
-n01630670
-n01873310
-n02011460
-n02113978
-n01629819
-n07711569
-n04023962
-n01631663
-n02815834
-n01797886
-n03662601
-n02704792
-n02494079
-n02124075
-n03530642
-n03424325
-n02974003
-n01685808
-n02086910
-n04004767
-n03720891
-n04200800
-n01755581
-n04118776
-n02058221
-n03124170
-n03584829
-n01978455
-n02100583
-n03131574
-n03467068
-n02490219
-n02978881
-n02096051
-n04254120
-n03028079
-n04371774
-n02105641
-n02397096
-n04258138
-n03297495
-n02108000
-n02096585
-n02090721
-n02786058
-n02025239
-n01784675
-n03393912
-n01755581
-n02437616
-n02219486
-n03388549
-n02769748
-n03384352
-n03998194
-n02699494
-n04277352
-n03637318
-n02415577
-n03788365
-n01943899
-n02009229
-n04325704
-n04532670
-n01498041
-n03793489
-n04141076
-n04525038
-n04548362
-n02012849
-n02093754
-n03534580
-n04532670
-n02859443
-n02027492
-n04070727
-n03673027
-n11879895
-n02643566
-n04606251
-n04613696
-n03680355
-n01860187
-n04251144
-n01739381
-n02098413
-n04019541
-n02101556
-n03201208
-n04532106
-n02879718
-n02951585
-n04604644
-n04275548
-n02097474
-n03482405
-n07734744
-n03868242
-n04332243
-n04589890
-n03788365
-n03649909
-n02090721
-n02672831
-n02109525
-n02112018
-n07615774
-n02102480
-n03125729
-n01632458
-n04252225
-n01824575
-n02666196
-n03832673
-n02105641
-n07768694
-n03871628
-n03127925
-n03344393
-n02096177
-n03887697
-n03424325
-n03014705
-n03796401
-n03617480
-n04065272
-n03982430
-n04479046
-n03763968
-n02486410
-n07742313
-n02687172
-n03794056
-n04254680
-n03661043
-n02837789
-n02454379
-n01560419
-n04443257
-n07613480
-n02110806
-n01818515
-n02099712
-n03384352
-n04366367
-n03676483
-n02892767
-n02110627
-n02096294
-n01667778
-n02870880
-n03425413
-n01751748
-n04275548
-n03187595
-n02437312
-n03623198
-n01796340
-n09472597
-n04523525
-n02486261
-n01531178
-n02493509
-n02979186
-n03584829
-n03924679
-n02099601
-n03259280
-n04229816
-n01872401
-n04579432
-n01855672
-n01622779
-n02509815
-n04525305
-n04131690
-n02484975
-n09193705
-n02097658
-n02877765
-n02749479
-n06596364
-n01806567
-n02093428
-n01773157
-n03207941
-n03947888
-n01818515
-n02092339
-n02276258
-n03207743
-n02794156
-n02106166
-n03529860
-n04493381
-n02086079
-n02011460
-n03961711
-n03680355
-n04263257
-n01819313
-n02102177
-n04254120
-n03888257
-n03729826
-n04136333
-n04346328
-n02107908
-n02447366
-n03125729
-n03476684
-n02443114
-n03788195
-n03710637
-n03657121
-n03633091
-n03141823
-n07802026
-n02113978
-n01665541
-n01744401
-n02834397
-n03633091
-n04335435
-n02011460
-n02099712
-n03527444
-n03180011
-n02408429
-n02123394
-n03980874
-n04070727
-n03445777
-n04465501
-n03530642
-n03291819
-n04252077
-n01689811
-n02058221
-n02112137
-n01950731
-n01682714
-n02231487
-n07684084
-n03481172
-n02963159
-n07768694
-n03977966
-n02165456
-n02939185
-n04258138
-n02123045
-n02128757
-n02037110
-n02128925
-n02483362
-n03483316
-n04273569
-n04208210
-n03942813
-n03291819
-n03467068
-n02091467
-n02113624
-n03950228
-n03786901
-n04228054
-n03649909
-n01629819
-n02104365
-n02865351
-n02097047
-n03902125
-n02231487
-n04033995
-n02172182
-n01632777
-n02494079
-n02391049
-n02093256
-n03992509
-n03710721
-n03272010
-n03124043
-n02422699
-n02492035
-n02410509
-n04120489
-n02793495
-n03594734
-n03841143
-n03124043
-n04265275
-n02088466
-n02123159
-n03461385
-n01675722
-n02965783
-n07753113
-n07614500
-n04154565
-n03590841
-n02361337
-n07720875
-n01843383
-n04162706
-n02134418
-n03271574
-n01494475
-n01729977
-n01689811
-n01582220
-n02655020
-n03594945
-n02099712
-n02110627
-n02441942
-n02791124
-n02007558
-n03891332
-n02791270
-n02037110
-n02127052
-n01910747
-n01829413
-n04523525
-n02417914
-n04465501
-n01860187
-n03935335
-n03908714
-n02018207
-n02006656
-n07802026
-n03950228
-n07590611
-n02092002
-n04423845
-n02790996
-n04252225
-n03666591
-n02109961
-n03930630
-n02860847
-n04552348
-n02092339
-n09229709
-n02791270
-n07579787
-n03196217
-n02500267
-n02790996
-n01622779
-n02484975
-n02669723
-n02280649
-n11879895
-n03769881
-n02167151
-n02403003
-n03717622
-n02093991
-n03942813
-n04254680
-n04443257
-n01860187
-n09229709
-n02028035
-n02087394
-n01986214
-n02115641
-n02640242
-n04328186
-n03908618
-n04154565
-n02797295
-n02097209
-n02125311
-n07932039
-n02102973
-n03529860
-n01980166
-n02443114
-n03733131
-n07718472
-n03255030
-n02009912
-n02087394
-n03218198
-n02106550
-n03888605
-n01704323
-n02091635
-n03710721
-n02325366
-n02112350
-n03207743
-n03980874
-n03042490
-n07590611
-n02096051
-n02408429
-n02091244
-n03773504
-n01491361
-n02120505
-n02607072
-n02487347
-n02504458
-n04204347
-n02037110
-n02790996
-n02107312
-n04044716
-n02002556
-n02727426
-n04606251
-n02091831
-n03598930
-n03089624
-n01807496
-n07613480
-n04404412
-n04542943
-n09229709
-n03467068
-n01943899
-n11939491
-n02086646
-n02095314
-n02328150
-n02992529
-n02281787
-n04008634
-n07697313
-n03347037
-n02012849
-n02099429
-n04179913
-n02106662
-n03841143
-n07768694
-n07880968
-n02111129
-n04456115
-n04330267
-n01629819
-n04146614
-n03710193
-n03250847
-n02808304
-n03018349
-n01943899
-n02398521
-n03388549
-n02097658
-n03529860
-n02782093
-n01592084
-n04311174
-n02823750
-n04067472
-n02422699
-n03832673
-n04367480
-n04557648
-n02051845
-n01882714
-n02012849
-n03796401
-n01735189
-n09256479
-n03529860
-n11939491
-n03673027
-n01669191
-n03742115
-n02692877
-n02328150
-n07715103
-n02268443
-n02268853
-n01770393
-n07718747
-n07714571
-n01695060
-n01843065
-n03404251
-n02823750
-n04264628
-n03478589
-n02643566
-n01514859
-n02086646
-n01692333
-n03841143
-n03977966
-n04136333
-n02089973
-n02097298
-n04311174
-n01677366
-n01930112
-n02128925
-n03710721
-n02909870
-n02027492
-n04252077
-n03544143
-n09332890
-n04118776
-n04553703
-n02488702
-n02109525
-n04443257
-n01728572
-n03384352
-n04136333
-n07718472
-n03773504
-n04273569
-n02730930
-n02259212
-n03125729
-n01748264
-n03095699
-n02504458
-n04579432
-n02231487
-n04442312
-n03447447
-n02939185
-n02110341
-n04458633
-n03492542
-n02841315
-n04285008
-n02787622
-n01514668
-n03877472
-n04486054
-n04238763
-n02480495
-n07871810
-n01968897
-n03954731
-n03584829
-n03379051
-n02123394
-n03259280
-n07920052
-n02113712
-n02092002
-n02727426
-n04149813
-n01775062
-n03457902
-n03791053
-n02106550
-n09288635
-n01742172
-n02219486
-n04332243
-n02490219
-n04033901
-n03590841
-n04344873
-n07753592
-n02085936
-n03447721
-n01580077
-n02120505
-n02504458
-n03633091
-n02113023
-n02109525
-n11879895
-n03445924
-n01882714
-n02089867
-n04604644
-n03697007
-n02814533
-n02094114
-n01631663
-n02105251
-n02948072
-n04200800
-n01820546
-n03125729
-n03290653
-n02102480
-n04525038
-n03347037
-n03950228
-n02319095
-n03160309
-n03787032
-n02107574
-n04487394
-n04548280
-n07697537
-n01580077
-n03599486
-n04599235
-n01735189
-n04612504
-n02786058
-n03000247
-n02906734
-n13054560
-n02132136
-n02939185
-n02101006
-n04141975
-n04127249
-n07565083
-n01641577
-n02017213
-n02095889
-n02096585
-n03461385
-n02231487
-n04493381
-n02092339
-n04332243
-n02497673
-n02119022
-n02099601
-n04311004
-n03920288
-n02704792
-n02091032
-n03240683
-n03538406
-n04560804
-n01440764
-n02776631
-n02013706
-n02099849
-n01532829
-n02110341
-n01944390
-n03218198
-n02099712
-n04429376
-n03249569
-n02422106
-n04254777
-n04009552
-n03617480
-n03337140
-n01692333
-n02493509
-n12144580
-n03095699
-n03781244
-n03782006
-n02099429
-n09428293
-n04179913
-n02105251
-n07716358
-n04357314
-n03895866
-n02948072
-n03888257
-n03447447
-n07248320
-n01537544
-n02487347
-n03982430
-n02910353
-n07892512
-n09468604
-n03857828
-n03290653
-n03388043
-n03843555
-n04423845
-n04404412
-n04347754
-n01537544
-n02992529
-n02101388
-n02056570
-n02093859
-n02105412
-n03933933
-n02704792
-n03063599
-n12267677
-n04482393
-n01443537
-n03670208
-n04590129
-n07565083
-n04111531
-n03188531
-n02114712
-n04409515
-n03272010
-n02107312
-n02112018
-n03676483
-n03770439
-n13133613
-n04259630
-n02105641
-n04049303
-n02807133
-n03249569
-n02099267
-n04065272
-n07716906
-n02087394
-n01669191
-n04376876
-n01847000
-n02123597
-n04131690
-n02033041
-n04357314
-n01530575
-n02841315
-n01698640
-n04179913
-n01824575
-n02092002
-n02058221
-n03617480
-n04146614
-n02097130
-n09399592
-n02892201
-n02116738
-n04204347
-n04522168
-n04136333
-n01531178
-n02346627
-n02168699
-n01980166
-n07711569
-n03347037
-n04208210
-n02823750
-n02124075
-n02509815
-n03404251
-n02088364
-n01798484
-n02009912
-n03814639
-n02172182
-n03840681
-n02002556
-n03888257
-n03065424
-n03325584
-n02317335
-n02281406
-n03658185
-n02095570
-n03920288
-n03710637
-n02123597
-n03877472
-n04357314
-n07802026
-n04067472
-n02437616
-n03482405
-n01532829
-n04553703
-n03065424
-n02058221
-n07718472
-n04252225
-n02096585
-n02097658
-n04525305
-n12057211
-n04259630
-n02490219
-n04285008
-n01534433
-n01622779
-n04067472
-n04557648
-n03888257
-n02096051
-n01632458
-n02808304
-n12985857
-n01756291
-n02111500
-n02963159
-n02790996
-n03630383
-n07714990
-n04589890
-n02128757
-n02786058
-n02951358
-n03763968
-n02356798
-n01818515
-n02607072
-n07717410
-n03877472
-n04069434
-n02483362
-n04479046
-n02268853
-n10148035
-n02815834
-n02116738
-n04501370
-n03131574
-n02099712
-n02108915
-n04209239
-n03770439
-n02226429
-n12144580
-n02906734
-n02783161
-n02667093
-n04239074
-n02110063
-n01582220
-n07768694
-n01774750
-n03787032
-n12057211
-n03764736
-n01795545
-n03623198
-n01443537
-n02892201
-n03868242
-n03384352
-n02403003
-n03658185
-n03485794
-n02085782
-n04328186
-n03388183
-n04344873
-n07716358
-n02097047
-n01737021
-n01695060
-n02098286
-n04258138
-n03127747
-n07565083
-n01667114
-n03929660
-n03476684
-n03785016
-n04041544
-n02100236
-n03854065
-n03529860
-n02097209
-n02100236
-n04540053
-n02002556
-n03495258
-n02834397
-n04346328
-n03485407
-n02835271
-n01729977
-n02802426
-n03781244
-n02793495
-n02892767
-n02086240
-n02490219
-n02119022
-n06359193
-n03207743
-n01980166
-n04467665
-n04332243
-n03598930
-n04523525
-n03877472
-n03976657
-n02256656
-n02097130
-n02606052
-n04037443
-n02793495
-n03929855
-n04118776
-n02727426
-n01833805
-n02536864
-n03710721
-n03459775
-n04311004
-n02113712
-n02480495
-n03041632
-n02966193
-n03476684
-n07716358
-n04310018
-n07579787
-n02493793
-n02094433
-n07734744
-n01744401
-n03770679
-n04523525
-n02364673
-n03355925
-n07715103
-n02403003
-n01644900
-n01518878
-n02815834
-n04251144
-n02690373
-n02124075
-n04553703
-n04081281
-n02408429
-n01704323
-n02640242
-n03478589
-n04447861
-n07875152
-n04209133
-n07734744
-n04487081
-n02177972
-n02892767
-n02113624
-n03016953
-n07753275
-n02319095
-n07745940
-n02108000
-n02028035
-n02504458
-n02106550
-n07754684
-n03063599
-n03787032
-n02098105
-n03467068
-n02089867
-n02093428
-n07718747
-n07831146
-n03496892
-n03961711
-n01924916
-n01883070
-n01704323
-n03733281
-n03791053
-n02930766
-n03478589
-n01980166
-n01985128
-n09472597
-n03967562
-n02087394
-n01914609
-n02497673
-n03924679
-n03706229
-n02108089
-n15075141
-n03977966
-n07715103
-n03187595
-n02236044
-n04599235
-n03529860
-n04023962
-n02092339
-n02977058
-n07584110
-n07730033
-n03272010
-n03676483
-n02493509
-n09468604
-n02091467
-n03534580
-n03125729
-n04467665
-n01665541
-n04330267
-n02917067
-n03196217
-n02009229
-n03042490
-n01632458
-n03100240
-n02965783
-n02172182
-n03920288
-n03109150
-n07747607
-n02093859
-n02655020
-n03658185
-n03584254
-n02110806
-n04596742
-n02113799
-n01530575
-n03345487
-n02917067
-n03788195
-n02105162
-n15075141
-n04317175
-n04251144
-n02112018
-n04326547
-n03838899
-n01955084
-n02417914
-n02099849
-n02317335
-n03095699
-n02699494
-n04554684
-n03729826
-n04005630
-n02108422
-n03127925
-n02123045
-n03832673
-n02504013
-n01806567
-n04069434
-n04023962
-n04111531
-n02097209
-n02105056
-n02097209
-n03376595
-n02095314
-n01756291
-n03773504
-n01980166
-n06794110
-n04074963
-n02747177
-n02108551
-n03255030
-n03891251
-n03935335
-n03673027
-n02111277
-n03188531
-n02100236
-n02992529
-n02607072
-n02095889
-n02002556
-n02834397
-n02134084
-n07716906
-n02804414
-n02134084
-n04008634
-n02509815
-n04254120
-n04147183
-n04204238
-n03908714
-n04162706
-n03197337
-n11879895
-n03787032
-n04111531
-n02978881
-n02102177
-n03379051
-n04371774
-n01704323
-n03710721
-n01518878
-n03016953
-n02106382
-n04540053
-n01558993
-n02105412
-n02981792
-n03028079
-n03782006
-n02086079
-n04192698
-n02233338
-n03649909
-n03496892
-n02276258
-n03832673
-n04070727
-n03899768
-n03017168
-n03485794
-n04591157
-n02493509
-n02093754
-n02107683
-n04208210
-n02992529
-n03124043
-n03876231
-n03691459
-n01667778
-n07730033
-n04252225
-n04208210
-n02860847
-n01742172
-n02094114
-n03000134
-n07860988
-n01775062
-n03958227
-n03045698
-n03759954
-n02086240
-n03676483
-n04532670
-n02100583
-n02793495
-n01855032
-n04275548
-n04409515
-n03733131
-n03710193
-n07760859
-n03854065
-n01629819
-n02840245
-n03691459
-n03452741
-n03297495
-n03877472
-n02125311
-n04037443
-n02526121
-n01698640
-n04591713
-n02860847
-n02412080
-n01728572
-n04152593
-n02879718
-n02699494
-n02115913
-n03000134
-n02326432
-n02966193
-n04326547
-n04049303
-n04501370
-n07590611
-n02088466
-n01665541
-n03141823
-n02037110
-n02110958
-n03481172
-n07860988
-n02509815
-n02869837
-n03930313
-n03492542
-n02480855
-n02486261
-n03495258
-n03478589
-n03063599
-n04525038
-n02109525
-n02787622
-n01592084
-n02437616
-n13040303
-n04118776
-n02104365
-n02927161
-n03532672
-n03814639
-n01910747
-n01737021
-n03877845
-n07579787
-n09288635
-n01981276
-n03133878
-n02667093
-n02747177
-n02500267
-n04370456
-n01601694
-n03769881
-n04372370
-n02114712
-n02326432
-n03134739
-n03041632
-n01685808
-n02233338
-n01614925
-n03982430
-n03929855
-n04069434
-n04367480
-n03961711
-n03201208
-n02092002
-n04370456
-n04376876
-n02395406
-n03717622
-n04317175
-n02088094
-n02950826
-n01697457
-n04591157
-n01784675
-n03930630
-n04251144
-n02802426
-n07697537
-n01689811
-n12998815
-n04550184
-n04486054
-n01667778
-n03916031
-n01795545
-n02790996
-n01910747
-n02085936
-n03938244
-n03976467
-n02325366
-n03527444
-n02268443
-n03290653
-n03444034
-n02105056
-n02096437
-n03457902
-n03843555
-n02500267
-n02088094
-n02769748
-n04525038
-n02606052
-n04487081
-n02486261
-n03492542
-n03733131
-n02120505
-n07745940
-n02112137
-n07579787
-n02105505
-n03452741
-n10148035
-n04125021
-n04026417
-n02089867
-n03995372
-n02177972
-n03903868
-n04409515
-n01943899
-n02100236
-n03124170
-n03197337
-n02361337
-n04325704
-n03920288
-n03825788
-n02101388
-n11879895
-n03443371
-n02071294
-n07880968
-n03769881
-n03902125
-n02110806
-n03637318
-n04019541
-n03840681
-n02342885
-n03476684
-n02094114
-n04023962
-n03706229
-n02730930
-n02877765
-n04548362
-n02088632
-n04285008
-n07873807
-n03903868
-n04501370
-n04118538
-n02025239
-n03530642
-n02018207
-n03476684
-n03602883
-n02948072
-n02102040
-n02123394
-n01944390
-n02268853
-n04590129
-n01530575
-n02117135
-n03691459
-n02504013
-n03179701
-n04357314
-n04399382
-n03218198
-n02865351
-n03598930
-n02113978
-n03697007
-n01843383
-n02074367
-n02264363
-n01742172
-n02123045
-n02795169
-n03721384
-n02129165
-n03544143
-n04522168
-n12985857
-n02814860
-n02110958
-n02100735
-n13044778
-n02817516
-n07730033
-n04429376
-n04033995
-n04367480
-n03729826
-n02493793
-n04141975
-n01740131
-n01914609
-n02134418
-n01739381
-n02687172
-n02483362
-n13037406
-n01742172
-n02396427
-n02397096
-n01689811
-n09399592
-n04347754
-n02865351
-n04344873
-n02111889
-n02939185
-n04033995
-n02037110
-n01773157
-n03599486
-n02093647
-n01532829
-n02097209
-n02492660
-n04009552
-n04033901
-n02099429
-n02056570
-n02098413
-n02992211
-n03788195
-n03207743
-n03444034
-n03814639
-n04485082
-n01981276
-n01978455
-n03461385
-n01688243
-n02277742
-n03388043
-n02871525
-n02101556
-n03131574
-n02236044
-n07248320
-n03041632
-n02095314
-n04344873
-n02119022
-n02172182
-n13054560
-n01978287
-n03532672
-n04536866
-n02105412
-n04118538
-n02443484
-n01695060
-n02909870
-n02441942
-n02017213
-n02799071
-n04147183
-n04589890
-n02056570
-n02486261
-n03345487
-n04328186
-n02328150
-n04476259
-n04346328
-n04273569
-n03290653
-n03627232
-n02791124
-n02012849
-n02259212
-n02090379
-n03627232
-n03764736
-n02817516
-n04326547
-n03065424
-n02909870
-n01675722
-n04522168
-n13133613
-n02655020
-n04209133
-n02783161
-n03796401
-n03250847
-n01872401
-n01682714
-n01873310
-n01631663
-n04005630
-n02843684
-n02769748
-n02804610
-n03782006
-n01978455
-n02097298
-n02787622
-n07716906
-n02111129
-n02123045
-n02279972
-n02497673
-n02980441
-n02111129
-n03297495
-n04487081
-n04370456
-n01667778
-n03710193
-n02096294
-n03876231
-n03938244
-n02950826
-n04311174
-n04081281
-n01687978
-n04371774
-n06794110
-n02281406
-n04326547
-n02395406
-n02096051
-n02113186
-n04070727
-n02206856
-n02690373
-n01729977
-n03000684
-n01514859
-n03197337
-n03445924
-n04604644
-n02280649
-n02090379
-n02012849
-n01534433
-n07734744
-n03838899
-n02177972
-n04423845
-n03899768
-n02098105
-n03633091
-n02701002
-n04371430
-n02114367
-n03947888
-n01820546
-n02088238
-n03929855
-n04612504
-n02963159
-n02966193
-n02037110
-n03982430
-n02107574
-n02966193
-n04355933
-n03372029
-n02113978
-n04398044
-n02087046
-n02106166
-n04465501
-n03179701
-n10565667
-n03492542
-n01735189
-n02120079
-n02105251
-n01873310
-n02110063
-n03388183
-n02444819
-n02687172
-n01871265
-n02445715
-n04590129
-n12985857
-n01819313
-n03938244
-n02443114
-n04380533
-n04277352
-n02444819
-n02536864
-n02111277
-n02948072
-n03938244
-n07753113
-n01440764
-n09193705
-n02509815
-n01770393
-n01828970
-n03794056
-n03902125
-n02097474
-n07714571
-n02107908
-n01698640
-n04590129
-n02481823
-n04418357
-n02504013
-n02815834
-n01530575
-n03131574
-n02104365
-n04204238
-n02454379
-n04147183
-n02077923
-n02488291
-n02342885
-n02097474
-n07716358
-n03337140
-n04417672
-n01694178
-n04311004
-n06785654
-n07768694
-n04149813
-n01560419
-n03970156
-n04125021
-n09428293
-n04258138
-n03720891
-n04086273
-n02804610
-n03642806
-n03133878
-n02974003
-n01629819
-n03983396
-n04154565
-n02483362
-n04019541
-n03065424
-n04040759
-n06596364
-n04131690
-n01770393
-n04550184
-n02120079
-n03255030
-n02326432
-n03344393
-n12985857
-n01675722
-n01729322
-n02112137
-n04398044
-n02013706
-n04162706
-n04069434
-n03630383
-n02840245
-n01644900
-n03680355
-n04229816
-n09193705
-n02788148
-n04462240
-n03775546
-n06596364
-n02090721
-n03388183
-n04252077
-n03042490
-n01843065
-n02111129
-n01616318
-n04409515
-n10148035
-n01677366
-n02655020
-n02107683
-n02105162
-n03888257
-n02128925
-n03868863
-n04069434
-n01773797
-n03792782
-n03792782
-n01560419
-n07742313
-n13054560
-n02981792
-n03916031
-n03623198
-n04146614
-n11879895
-n01675722
-n02097130
-n04423845
-n02089973
-n04592741
-n01968897
-n07718747
-n02992529
-n07753275
-n07745940
-n02108422
-n02804414
-n02342885
-n03379051
-n02457408
-n02437312
-n03787032
-n02091032
-n02002556
-n03666591
-n03717622
-n07831146
-n03208938
-n02840245
-n03891332
-n04589890
-n03887697
-n04141076
-n03770439
-n02113023
-n02009912
-n02823750
-n04252077
-n02396427
-n02099601
-n02279972
-n01843383
-n02749479
-n04228054
-n04590129
-n01773797
-n02027492
-n02093428
-n02259212
-n01910747
-n02088364
-n02093754
-n07860988
-n02093428
-n01494475
-n03888605
-n04589890
-n02092339
-n07584110
-n02190166
-n02096051
-n04023962
-n02484975
-n03980874
-n02870880
-n01807496
-n02090721
-n02011460
-n02033041
-n01514668
-n02094114
-n02687172
-n02013706
-n04523525
-n07718747
-n02361337
-n07720875
-n04005630
-n04509417
-n07613480
-n01622779
-n03131574
-n01631663
-n02701002
-n03014705
-n02607072
-n01560419
-n03197337
-n09193705
-n02099849
-n03000134
-n02480495
-n03733805
-n07802026
-n01749939
-n03956157
-n01955084
-n03445777
-n02927161
-n02105162
-n02088238
-n06794110
-n09332890
-n02823428
-n03773504
-n03657121
-n04044716
-n07760859
-n03207941
-n07717410
-n01664065
-n03291819
-n01580077
-n02132136
-n01687978
-n09332890
-n04590129
-n04487081
-n03838899
-n01981276
-n03899768
-n04004767
-n03207743
-n02106166
-n07873807
-n04039381
-n03388549
-n03977966
-n03384352
-n02114367
-n07695742
-n02105412
-n04591157
-n01729322
-n02066245
-n03938244
-n03240683
-n07880968
-n03782006
-n02086646
-n01632777
-n02793495
-n02281406
-n02443484
-n03208938
-n04350905
-n03179701
-n03658185
-n02480855
-n01737021
-n09256479
-n04357314
-n03424325
-n02807133
-n01855032
-n01828970
-n03980874
-n02107683
-n03895866
-n07768694
-n02090721
-n02110958
-n02669723
-n04599235
-n02105641
-n02692877
-n02927161
-n01582220
-n02325366
-n04039381
-n02790996
-n07760859
-n02114712
-n02099712
-n04275548
-n04366367
-n02687172
-n02113624
-n02454379
-n04120489
-n03785016
-n02279972
-n04209239
-n01677366
-n01682714
-n01601694
-n02483708
-n07718747
-n04344873
-n02483362
-n07717556
-n01981276
-n02699494
-n03160309
-n02123597
-n03970156
-n01669191
-n01756291
-n02606052
-n02795169
-n03478589
-n02259212
-n06785654
-n02114712
-n04311174
-n03891332
-n04443257
-n01687978
-n04259630
-n02128925
-n02526121
-n03447721
-n04239074
-n03877472
-n03710637
-n07711569
-n04153751
-n01682714
-n03598930
-n04131690
-n01819313
-n02085620
-n02113023
-n03133878
-n07768694
-n04579432
-n04532670
-n03976467
-n04326547
-n02951358
-n02279972
-n03000247
-n03837869
-n09288635
-n03196217
-n03733805
-n02111889
-n04286575
-n01985128
-n02105056
-n02783161
-n03902125
-n02643566
-n04553703
-n03787032
-n02799071
-n02137549
-n03445777
-n03240683
-n02093256
-n01847000
-n01978455
-n02089973
-n03482405
-n06874185
-n02280649
-n02129604
-n02892767
-n02480495
-n02106662
-n12144580
-n03599486
-n02066245
-n02454379
-n01873310
-n03690938
-n02389026
-n02264363
-n02966193
-n02500267
-n03538406
-n01843065
-n04254680
-n04346328
-n03961711
-n03970156
-n03207941
-n03791053
-n02085936
-n03954731
-n03857828
-n02807133
-n02443114
-n02219486
-n03670208
-n04263257
-n03110669
-n01795545
-n03467068
-n02115913
-n02119789
-n04487081
-n02791124
-n04201297
-n04265275
-n01784675
-n02814533
-n02417914
-n07932039
-n02606052
-n01768244
-n04311004
-n03662601
-n02607072
-n01773549
-n02085620
-n02730930
-n04347754
-n02051845
-n01914609
-n03729826
-n02129165
-n01537544
-n03888605
-n03764736
-n04579145
-n01630670
-n01950731
-n03599486
-n03786901
-n04243546
-n04040759
-n03594945
-n01632458
-n02823750
-n04442312
-n02859443
-n01629819
-n04254777
-n04039381
-n01641577
-n04553703
-n03443371
-n04467665
-n03991062
-n02219486
-n02799071
-n04026417
-n03930313
-n02096585
-n03534580
-n07753113
-n03868863
-n01773549
-n03720891
-n02727426
-n02096177
-n03272562
-n02100236
-n03450230
-n03697007
-n02927161
-n01798484
-n02865351
-n01631663
-n02100236
-n03871628
-n03394916
-n03983396
-n03908714
-n02641379
-n07892512
-n01877812
-n01824575
-n02106030
-n02100583
-n03424325
-n02106166
-n01682714
-n04456115
-n01784675
-n03868242
-n02100877
-n04033901
-n04266014
-n04332243
-n02443114
-n04487081
-n01774750
-n02129165
-n01984695
-n03769881
-n02422106
-n04328186
-n02108915
-n02088364
-n02795169
-n01773157
-n03063689
-n04326547
-n01644900
-n09229709
-n02133161
-n03016953
-n02085620
-n07565083
-n02317335
-n04485082
-n02125311
-n04591157
-n02396427
-n04347754
-n02129604
-n02422699
-n02123597
-n03388183
-n03590841
-n02807133
-n03676483
-n03255030
-n02174001
-n04536866
-n02104029
-n02817516
-n02087046
-n02085782
-n02115641
-n02086910
-n02834397
-n03201208
-n02086240
-n02454379
-n02422699
-n02106662
-n04560804
-n02699494
-n02871525
-n04591157
-n04149813
-n03920288
-n02099267
-n02105412
-n01667778
-n03535780
-n02085936
-n03344393
-n03871628
-n02268853
-n02276258
-n03773504
-n04505470
-n02895154
-n01740131
-n02101388
-n01847000
-n04111531
-n02280649
-n04509417
-n01496331
-n02264363
-n02109525
-n03372029
-n03903868
-n01796340
-n02988304
-n02486261
-n07932039
-n03841143
-n02089867
-n02099429
-n03062245
-n02799071
-n03485794
-n03944341
-n02090379
-n04370456
-n04125021
-n03929855
-n02110063
-n02794156
-n04141076
-n02085936
-n04606251
-n02099712
-n01773549
-n02992529
-n03347037
-n02120505
-n02727426
-n03483316
-n04479046
-n03544143
-n03888605
-n04548362
-n13037406
-n04044716
-n02259212
-n02835271
-n01797886
-n02823428
-n04086273
-n02127052
-n03133878
-n03733281
-n02676566
-n02667093
-n04026417
-n07932039
-n04252077
-n03976467
-n04366367
-n03443371
-n04346328
-n02112018
-n03781244
-n03459775
-n03876231
-n01534433
-n03017168
-n02808304
-n07730033
-n02169497
-n02514041
-n04458633
-n02002556
-n03980874
-n03131574
-n01807496
-n04330267
-n01773549
-n02123159
-n04204347
-n02395406
-n02321529
-n03124043
-n03617480
-n01910747
-n01784675
-n03733131
-n07875152
-n04599235
-n09428293
-n07565083
-n02206856
-n03127747
-n02086240
-n04146614
-n04532670
-n03259280
-n02104365
-n01855032
-n04366367
-n02977058
-n02444819
-n02088632
-n04562935
-n03891251
-n07718747
-n02783161
-n03929855
-n01872401
-n07693725
-n02859443
-n04370456
-n02259212
-n02231487
-n04065272
-n02361337
-n02395406
-n02094433
-n01833805
-n02097474
-n03868242
-n04041544
-n02493793
-n02174001
-n02085620
-n12620546
-n02412080
-n02808440
-n02489166
-n04069434
-n03763968
-n03721384
-n04522168
-n03527444
-n04147183
-n02277742
-n03743016
-n02490219
-n01443537
-n01534433
-n02965783
-n02106382
-n02007558
-n03908618
-n04357314
-n02108089
-n01980166
-n03642806
-n04090263
-n02093256
-n02841315
-n01695060
-n04152593
-n04532670
-n04201297
-n03476684
-n02236044
-n02769748
-n03187595
-n02841315
-n04081281
-n07873807
-n04548362
-n03595614
-n04532670
-n03047690
-n04552348
-n01806143
-n04542943
-n07717556
-n03782006
-n02107574
-n04118776
-n04523525
-n04141327
-n03000684
-n02124075
-n02667093
-n03976467
-n02965783
-n06785654
-n04548280
-n03840681
-n04243546
-n03447721
-n03720891
-n03825788
-n02791270
-n02870880
-n03535780
-n02165456
-n02132136
-n04044716
-n03970156
-n03692522
-n01744401
-n04418357
-n02167151
-n02790996
-n03903868
-n02860847
-n02417914
-n01985128
-n02281787
-n10148035
-n02974003
-n03777754
-n03445777
-n04532106
-n02085782
-n03452741
-n03670208
-n03866082
-n02105162
-n03220513
-n03529860
-n04376876
-n01440764
-n03498962
-n02687172
-n01665541
-n04344873
-n02489166
-n03384352
-n02443484
-n03976657
-n04540053
-n01817953
-n02098105
-n02655020
-n01756291
-n02099267
-n04141327
-n07734744
-n03690938
-n02133161
-n10148035
-n03461385
-n03840681
-n02099267
-n03908618
-n02483708
-n03710637
-n02804610
-n02906734
-n07836838
-n03930313
-n02786058
-n01795545
-n02804610
-n02095570
-n03447721
-n04311004
-n04229816
-n04208210
-n03710193
-n03584829
-n04355338
-n03146219
-n02085620
-n04522168
-n02106030
-n03908618
-n02113624
-n04429376
-n02100877
-n02894605
-n02088632
-n02490219
-n02264363
-n04204238
-n07717556
-n02699494
-n13040303
-n02782093
-n04238763
-n03935335
-n02111889
-n04147183
-n02089078
-n03598930
-n04131690
-n01534433
-n04039381
-n02113023
-n03649909
-n02804610
-n02950826
-n07695742
-n03899768
-n03662601
-n02100877
-n06359193
-n04270147
-n03527444
-n04023962
-n03207743
-n03691459
-n02086646
-n04456115
-n04335435
-n04493381
-n03355925
-n02128757
-n03710637
-n02749479
-n04111531
-n02669723
-n04591157
-n02106550
-n04069434
-n01669191
-n03496892
-n01855672
-n03803284
-n04371774
-n02965783
-n01955084
-n03710637
-n04147183
-n03792782
-n04597913
-n04266014
-n02790996
-n02099601
-n03627232
-n02219486
-n07760859
-n02877765
-n07715103
-n02259212
-n07747607
-n04376876
-n01748264
-n04317175
-n02687172
-n13037406
-n02321529
-n02981792
-n02992211
-n03891332
-n01944390
-n02398521
-n07753275
-n01687978
-n03325584
-n01806143
-n01795545
-n02256656
-n13133613
-n06785654
-n02236044
-n04033901
-n02892767
-n03792972
-n07753592
-n01580077
-n03535780
-n03602883
-n02423022
-n03599486
-n02279972
-n02655020
-n03637318
-n02108000
-n03355925
-n04486054
-n01986214
-n03014705
-n04599235
-n02107312
-n04522168
-n03782006
-n02091244
-n04238763
-n01641577
-n02268853
-n07711569
-n03662601
-n02102318
-n01677366
-n02097209
-n03763968
-n03786901
-n02509815
-n02086910
-n06794110
-n07920052
-n03379051
-n02346627
-n02018795
-n02480495
-n07711569
-n04532670
-n02099712
-n02110806
-n03759954
-n02123597
-n04154565
-n03347037
-n02077923
-n02514041
-n01616318
-n02641379
-n04086273
-n02097298
-n02930766
-n01983481
-n03995372
-n03891332
-n03218198
-n02058221
-n01729322
-n02799071
-n01820546
-n04127249
-n02834397
-n02097209
-n03196217
-n03216828
-n02096585
-n04229816
-n11879895
-n03977966
-n03876231
-n03908618
-n03255030
-n02106662
-n02488702
-n02978881
-n03868242
-n03710721
-n03494278
-n02363005
-n02939185
-n07768694
-n04505470
-n02028035
-n02894605
-n07717410
-n07745940
-n04429376
-n04344873
-n02727426
-n01753488
-n02110806
-n03661043
-n01806567
-n01955084
-n03467068
-n02110063
-n03902125
-n03450230
-n01692333
-n02114855
-n01644900
-n07742313
-n07565083
-n04505470
-n02088364
-n03733131
-n02105056
-n02606052
-n03179701
-n07715103
-n02641379
-n03259280
-n07873807
-n04584207
-n02110063
-n03218198
-n02494079
-n01644373
-n04332243
-n02115913
-n02120079
-n09229709
-n02481823
-n04235860
-n02113799
-n02823428
-n04371774
-n02442845
-n01498041
-n03944341
-n09332890
-n02091134
-n02690373
-n02788148
-n02869837
-n04204238
-n01675722
-n02236044
-n02280649
-n12144580
-n01882714
-n04120489
-n02999410
-n03692522
-n01729322
-n04532670
-n03337140
-n02966193
-n07742313
-n03793489
-n04355933
-n03220513
-n02445715
-n04443257
-n04026417
-n02823428
-n03976467
-n02102177
-n03773504
-n04487394
-n02085936
-n07614500
-n02089078
-n02206856
-n04147183
-n04501370
-n02422699
-n02085782
-n02097130
-n03929660
-n01751748
-n02099849
-n01924916
-n01692333
-n04275548
-n03991062
-n01824575
-n03218198
-n02018207
-n03530642
-n03782006
-n03697007
-n07734744
-n01820546
-n02280649
-n02115913
-n04325704
-n02104029
-n03250847
-n11879895
-n03709823
-n03271574
-n04483307
-n04525038
-n02835271
-n02102318
-n04285008
-n01491361
-n01742172
-n02077923
-n01728572
-n01914609
-n03388549
-n03085013
-n02395406
-n03868863
-n04033901
-n02011460
-n02123159
-n02391049
-n04039381
-n01695060
-n02129165
-n03944341
-n04462240
-n02403003
-n03920288
-n03649909
-n04515003
-n03372029
-n02091467
-n04372370
-n02129165
-n01753488
-n02113712
-n03445777
-n04525305
-n01768244
-n02493509
-n03743016
-n12998815
-n03770439
-n02777292
-n02097298
-n01687978
-n04179913
-n02749479
-n03627232
-n03207743
-n03476991
-n07745940
-n01883070
-n03792972
-n03769881
-n02011460
-n02870880
-n02123045
-n04040759
-n07684084
-n02111277
-n01877812
-n04019541
-n03197337
-n02494079
-n03187595
-n02687172
-n02883205
-n07754684
-n09399592
-n02791270
-n03063689
-n03902125
-n02415577
-n02086240
-n02093991
-n02802426
-n03782006
-n03478589
-n02128385
-n02894605
-n02115641
-n02011460
-n02951358
-n02128757
-n02871525
-n02346627
-n03450230
-n09229709
-n02417914
-n01796340
-n02128925
-n04486054
-n02749479
-n02346627
-n01930112
-n02091032
-n02963159
-n01944390
-n02793495
-n02018207
-n04153751
-n02790996
-n02129165
-n03538406
-n02965783
-n03179701
-n03160309
-n01644373
-n01770393
-n02109961
-n01873310
-n03085013
-n01735189
-n04370456
-n02018207
-n02018795
-n02110627
-n03804744
-n03534580
-n07760859
-n01631663
-n04482393
-n02917067
-n07753592
-n03447447
-n02112706
-n03947888
-n02927161
-n04228054
-n03259280
-n07753275
-n07753592
-n02948072
-n07697313
-n01984695
-n11879895
-n02125311
-n12998815
-n03976657
-n02096294
-n04264628
-n04548362
-n02276258
-n03891251
-n03127925
-n02834397
-n03854065
-n02979186
-n07920052
-n02110627
-n02095314
-n04049303
-n02965783
-n02895154
-n02013706
-n04044716
-n03709823
-n02138441
-n02777292
-n01943899
-n07892512
-n02091831
-n03743016
-n01514668
-n04243546
-n02105251
-n03032252
-n01855032
-n04612504
-n03770679
-n03866082
-n02091134
-n03443371
-n03777568
-n03773504
-n02480855
-n07745940
-n02391049
-n01910747
-n02277742
-n03938244
-n02788148
-n01440764
-n03425413
-n03895866
-n03950228
-n02133161
-n01843065
-n02992211
-n02834397
-n02066245
-n03337140
-n07716358
-n03584829
-n02095314
-n02093991
-n02974003
-n02025239
-n04596742
-n02916936
-n01768244
-n03720891
-n02056570
-n02102177
-n04557648
-n02268853
-n02098105
-n01514859
-n04141975
-n02071294
-n03188531
-n04254777
-n03709823
-n03095699
-n04517823
-n03733131
-n07693725
-n03476684
-n03724870
-n03983396
-n02342885
-n02510455
-n03874293
-n02823428
-n04356056
-n01494475
-n04251144
-n02894605
-n02097658
-n04273569
-n02123045
-n03250847
-n01687978
-n02012849
-n03733131
-n02096294
-n02279972
-n01641577
-n03804744
-n02871525
-n04479046
-n07697313
-n02786058
-n01924916
-n07932039
-n02099712
-n03271574
-n02488702
-n02927161
-n02815834
-n02877765
-n04560804
-n03297495
-n04590129
-n03944341
-n03980874
-n02105056
-n01734418
-n03947888
-n02363005
-n06596364
-n07753275
-n02930766
-n02093859
-n03207941
-n01818515
-n03657121
-n01629819
-n03063689
-n03255030
-n02808440
-n02981792
-n09246464
-n04591713
-n03492542
-n04517823
-n03240683
-n07716358
-n07717556
-n02814533
-n01843383
-n03691459
-n02134418
-n02110185
-n02093754
-n02807133
-n07684084
-n02091244
-n03873416
-n02113624
-n02094433
-n02917067
-n03450230
-n03888605
-n01616318
-n04435653
-n02111277
-n02006656
-n02363005
-n02497673
-n07753592
-n07711569
-n01693334
-n03954731
-n04033995
-n04208210
-n02817516
-n07754684
-n02256656
-n13052670
-n04417672
-n11939491
-n02443114
-n03445777
-n02093859
-n07684084
-n03026506
-n04081281
-n02002724
-n02317335
-n03584829
-n04039381
-n03062245
-n02091134
-n07745940
-n02092002
-n03991062
-n02843684
-n03961711
-n04069434
-n01558993
-n07745940
-n04486054
-n04347754
-n02011460
-n02808304
-n02109961
-n04229816
-n04409515
-n04116512
-n03857828
-n02445715
-n03920288
-n02488702
-n03126707
-n07932039
-n02835271
-n03445924
-n01797886
-n03476684
-n03658185
-n01943899
-n02951358
-n03532672
-n02966193
-n02988304
-n02229544
-n02095570
-n02841315
-n04536866
-n02268853
-n03445924
-n03803284
-n04254777
-n02443484
-n03133878
-n02799071
-n13133613
-n02102040
-n02107908
-n03947888
-n04487394
-n03599486
-n03452741
-n02097298
-n04417672
-n02493793
-n02325366
-n07747607
-n03188531
-n04482393
-n02088632
-n04461696
-n03249569
-n07693725
-n02096437
-n01773797
-n02105162
-n02843684
-n02950826
-n02492660
-n04366367
-n01981276
-n03207941
-n02966193
-n03534580
-n02112018
-n01688243
-n04584207
-n02415577
-n01847000
-n02514041
-n02488291
-n02749479
-n04380533
-n02510455
-n02526121
-n07745940
-n03930313
-n03877845
-n01755581
-n01667114
-n02108000
-n02699494
-n02363005
-n02100877
-n03770439
-n02114712
-n02100735
-n02108000
-n02028035
-n02108551
-n02484975
-n07718747
-n03498962
-n01665541
-n02894605
-n04118776
-n02119022
-n04258138
-n04604644
-n02115641
-n07768694
-n12267677
-n03908714
-n03876231
-n07717556
-n11879895
-n01688243
-n03208938
-n12267677
-n02669723
-n02965783
-n02276258
-n01631663
-n04487394
-n02825657
-n01749939
-n04037443
-n04041544
-n03376595
-n04532670
-n02104365
-n02233338
-n02793495
-n03770439
-n01910747
-n04154565
-n01980166
-n03793489
-n02025239
-n02480495
-n03781244
-n04399382
-n07871810
-n04065272
-n02017213
-n01943899
-n04067472
-n03761084
-n02094433
-n03538406
-n02494079
-n04147183
-n04141076
-n04589890
-n01601694
-n02123394
-n06874185
-n02114548
-n03637318
-n03710193
-n04536866
-n09399592
-n03452741
-n03594945
-n07860988
-n03085013
-n02814533
-n03461385
-n04252077
-n02859443
-n04033901
-n01530575
-n03476684
-n04069434
-n02105056
-n02128385
-n01694178
-n01688243
-n03372029
-n04465501
-n02808440
-n04235860
-n02177972
-n13044778
-n02096177
-n01770081
-n01669191
-n02481823
-n07880968
-n03888605
-n02117135
-n02096437
-n02397096
-n01592084
-n03769881
-n03026506
-n02107574
-n02114367
-n03124170
-n03733281
-n03692522
-n02037110
-n02167151
-n01930112
-n03995372
-n03355925
-n03676483
-n03000247
-n02966193
-n02910353
-n01682714
-n02910353
-n02510455
-n02106550
-n02120079
-n03841143
-n04229816
-n02447366
-n02091467
-n04456115
-n03937543
-n01818515
-n04086273
-n02865351
-n03109150
-n02808304
-n03483316
-n01560419
-n07930864
-n04392985
-n04592741
-n04192698
-n02089973
-n03485794
-n07613480
-n02951585
-n01494475
-n01443537
-n02097298
-n02877765
-n02101388
-n03271574
-n03041632
-n03895866
-n02865351
-n02091134
-n02027492
-n03201208
-n03983396
-n02364673
-n02134084
-n02165105
-n01773549
-n04127249
-n04275548
-n01883070
-n02112706
-n03776460
-n02108000
-n02397096
-n04525305
-n02113624
-n02268853
-n02091134
-n03476991
-n02815834
-n04525305
-n03857828
-n03272010
-n04523525
-n04335435
-n03595614
-n07932039
-n03345487
-n03877472
-n04485082
-n02794156
-n03877472
-n03492542
-n02114712
-n02883205
-n02106662
-n03417042
-n03617480
-n02978881
-n02101556
-n04039381
-n02105641
-n02098413
-n04552348
-n02823750
-n07753113
-n02110063
-n09332890
-n09468604
-n02457408
-n01537544
-n02497673
-n09229709
-n04311004
-n02776631
-n02692877
-n03623198
-n04328186
-n03697007
-n02102177
-n01687978
-n03207743
-n03733131
-n02099429
-n03769881
-n02099601
-n02787622
-n03000134
-n03895866
-n02127052
-n04136333
-n02106662
-n13044778
-n01981276
-n03680355
-n03372029
-n03908618
-n03877472
-n04346328
-n04557648
-n04270147
-n04428191
-n02870880
-n03297495
-n02871525
-n02391049
-n02123045
-n01871265
-n02071294
-n02119022
-n04592741
-n02509815
-n03424325
-n02514041
-n02101006
-n02747177
-n01950731
-n02172182
-n04336792
-n04356056
-n04252077
-n01740131
-n04613696
-n04023962
-n04485082
-n02128925
-n02086079
-n03983396
-n02134084
-n02133161
-n02128925
-n04517823
-n07875152
-n02128385
-n04204347
-n02077923
-n03272010
-n02840245
-n02105641
-n01817953
-n04146614
-n04554684
-n03796401
-n04039381
-n02788148
-n04483307
-n02493793
-n03692522
-n03075370
-n03733281
-n04238763
-n02815834
-n03065424
-n02672831
-n03602883
-n04346328
-n02066245
-n03444034
-n03594734
-n15075141
-n12144580
-n07579787
-n02992529
-n04515003
-n02107142
-n02117135
-n01734418
-n01693334
-n02105505
-n02992211
-n02869837
-n13133613
-n02666196
-n04041544
-n03857828
-n04418357
-n02113978
-n01744401
-n02797295
-n02699494
-n02489166
-n02098286
-n04243546
-n02134418
-n02106662
-n03670208
-n04090263
-n02692877
-n03467068
-n04238763
-n03788365
-n03657121
-n02906734
-n02326432
-n02676566
-n02607072
-n03627232
-n02894605
-n03538406
-n04136333
-n01632458
-n04125021
-n03134739
-n01697457
-n03924679
-n04243546
-n09256479
-n02493793
-n07871810
-n02177972
-n01917289
-n02088466
-n04069434
-n03891251
-n02113799
-n07711569
-n01833805
-n04270147
-n04259630
-n02859443
-n04270147
-n02110063
-n03042490
-n03290653
-n02002724
-n02100583
-n01608432
-n03710193
-n03777754
-n02971356
-n04482393
-n13037406
-n01768244
-n03929855
-n03016953
-n07584110
-n02113023
-n04447861
-n02128925
-n02988304
-n04201297
-n02006656
-n01807496
-n03658185
-n03394916
-n07716358
-n07579787
-n02102177
-n01729322
-n03775071
-n04482393
-n02415577
-n02607072
-n02909870
-n03255030
-n03344393
-n02325366
-n02102480
-n02102177
-n04423845
-n02130308
-n03785016
-n02787622
-n04200800
-n02087046
-n04487394
-n04152593
-n04065272
-n07831146
-n02843684
-n07248320
-n03498962
-n02128757
-n04523525
-n02999410
-n03697007
-n02097209
-n11939491
-n04141327
-n07248320
-n04461696
-n02110185
-n02483708
-n03902125
-n02168699
-n02834397
-n02108915
-n02963159
-n03841143
-n02120505
-n02111129
-n02112350
-n03793489
-n03649909
-n04090263
-n02727426
-n04033995
-n01608432
-n02364673
-n02895154
-n07730033
-n02423022
-n02999410
-n07579787
-n02086079
-n01631663
-n02494079
-n04118776
-n03467068
-n03476684
-n03954731
-n03775546
-n02981792
-n01873310
-n01980166
-n04049303
-n04099969
-n02965783
-n02281787
-n02823750
-n02655020
-n02403003
-n02951358
-n02028035
-n02504458
-n03814639
-n02085620
-n04486054
-n03761084
-n07930864
-n04522168
-n04347754
-n01644373
-n02992211
-n04483307
-n02102973
-n04467665
-n03026506
-n03026506
-n07697537
-n01532829
-n04442312
-n02108551
-n01824575
-n04254777
-n03109150
-n01728920
-n04380533
-n02795169
-n04493381
-n03141823
-n01817953
-n04026417
-n02909870
-n01601694
-n02834397
-n03376595
-n02909870
-n07711569
-n03891251
-n01806567
-n03854065
-n03814906
-n02808304
-n04153751
-n07768694
-n04532106
-n02102973
-n02346627
-n13133613
-n02129604
-n02443484
-n03792972
-n02804414
-n02097298
-n02708093
-n01748264
-n03992509
-n04591713
-n02105162
-n03840681
-n02276258
-n02100583
-n02408429
-n03770679
-n07717556
-n02280649
-n02006656
-n04560804
-n04285008
-n03868863
-n02088238
-n02799071
-n04560804
-n02108551
-n02487347
-n01614925
-n04505470
-n04090263
-n03661043
-n01675722
-n01531178
-n01632458
-n01695060
-n04254777
-n04355933
-n03743016
-n04259630
-n01534433
-n02110958
-n02112350
-n02488702
-n02687172
-n09246464
-n02071294
-n02497673
-n03871628
-n07717556
-n02105412
-n02999410
-n02105412
-n04208210
-n04589890
-n03379051
-n03404251
-n03014705
-n04146614
-n03938244
-n02107142
-n03452741
-n01667114
-n04311174
-n01667778
-n03127747
-n02105412
-n09399592
-n07716906
-n03673027
-n03197337
-n03450230
-n02113186
-n01775062
-n04380533
-n06359193
-n03483316
-n02172182
-n03496892
-n03843555
-n04476259
-n02110806
-n04467665
-n04548280
-n01518878
-n02281787
-n02093647
-n04404412
-n04356056
-n03840681
-n03995372
-n02326432
-n02777292
-n01776313
-n03220513
-n02795169
-n02074367
-n01968897
-n07693725
-n02906734
-n03777754
-n02497673
-n03126707
-n04259630
-n03729826
-n04026417
-n01855032
-n02808440
-n04346328
-n03930313
-n04560804
-n03127925
-n07684084
-n04417672
-n02172182
-n02325366
-n03899768
-n01644900
-n02113186
-n03710637
-n03857828
-n02114548
-n04326547
-n02643566
-n02092002
-n03124170
-n02281406
-n01806567
-n04254680
-n03344393
-n01532829
-n02116738
-n02116738
-n02094258
-n03690938
-n03272562
-n03110669
-n03786901
-n07920052
-n04355933
-n01978455
-n01806143
-n01944390
-n03450230
-n02088364
-n03956157
-n02437312
-n03590841
-n04344873
-n02277742
-n02111277
-n01784675
-n04483307
-n02132136
-n04019541
-n01693334
-n01608432
-n01667114
-n02236044
-n03775546
-n01739381
-n02100583
-n02090622
-n01729322
-n04350905
-n02056570
-n04612504
-n04505470
-n12057211
-n03837869
-n01531178
-n04376876
-n02454379
-n02124075
-n02395406
-n02114367
-n03481172
-n02109047
-n07715103
-n04154565
-n02423022
-n01756291
-n02108089
-n02493793
-n03602883
-n02168699
-n01978455
-n02097298
-n02447366
-n04229816
-n07583066
-n03207743
-n07248320
-n02100583
-n02823750
-n01608432
-n04418357
-n01833805
-n03930630
-n03425413
-n02788148
-n03637318
-n04265275
-n02281787
-n04335435
-n02093428
-n06359193
-n03944341
-n04041544
-n04515003
-n02106550
-n02097130
-n02837789
-n07753275
-n04026417
-n03673027
-n03887697
-n03110669
-n03769881
-n01532829
-n02006656
-n04296562
-n04347754
-n01828970
-n03125729
-n03877472
-n02096051
-n04483307
-n02398521
-n03770679
-n02106662
-n03775546
-n04347754
-n02676566
-n03690938
-n07831146
-n04398044
-n01985128
-n02109047
-n03785016
-n03494278
-n03792972
-n02114367
-n03777754
-n04090263
-n02132136
-n03134739
-n01491361
-n09332890
-n03803284
-n02120079
-n03075370
-n02104365
-n03884397
-n02790996
-n01751748
-n07695742
-n02123045
-n03759954
-n03733131
-n12998815
-n03223299
-n07745940
-n04532106
-n02111889
-n02708093
-n01944390
-n01534433
-n02361337
-n02113624
-n02090721
-n02093256
-n02025239
-n04355933
-n03452741
-n01530575
-n01443537
-n04209239
-n02037110
-n04154565
-n03594945
-n04465501
-n07714990
-n03868863
-n01819313
-n04026417
-n04553703
-n02112706
-n01980166
-n02797295
-n03888257
-n02342885
-n03216828
-n03388043
-n03804744
-n02138441
-n01689811
-n04553703
-n02231487
-n04208210
-n03372029
-n02096177
-n04429376
-n03272010
-n02493509
-n03127747
-n02786058
-n03777568
-n04238763
-n03535780
-n03938244
-n02408429
-n02097658
-n02123159
-n03891251
-n02165105
-n02437312
-n02114712
-n04540053
-n04270147
-n02113186
-n02281406
-n03899768
-n04442312
-n04023962
-n02963159
-n02102973
-n01860187
-n03297495
-n03733805
-n03980874
-n04336792
-n04366367
-n02412080
-n02966687
-n03763968
-n02098286
-n01756291
-n03929855
-n03944341
-n03271574
-n04026417
-n07754684
-n01985128
-n07753113
-n01675722
-n02106166
-n02116738
-n03916031
-n04065272
-n03110669
-n07747607
-n02009912
-n03950228
-n03483316
-n07716358
-n03216828
-n09835506
-n03393912
-n02526121
-n03770439
-n02002724
-n02871525
-n01776313
-n04355933
-n03450230
-n02025239
-n02107312
-n04606251
-n03063599
-n01795545
-n04254777
-n02120079
-n01833805
-n02099601
-n13052670
-n02676566
-n03457902
-n03720891
-n03793489
-n01775062
-n01978287
-n10565667
-n02916936
-n03599486
-n02110958
-n01443537
-n04204238
-n02672831
-n07717410
-n04209239
-n01491361
-n02963159
-n03424325
-n03697007
-n03344393
-n03445777
-n02999410
-n02441942
-n04525038
-n02403003
-n07684084
-n03125729
-n02095570
-n01796340
-n03599486
-n07747607
-n04507155
-n07768694
-n04501370
-n07734744
-n02676566
-n01871265
-n03680355
-n02088466
-n10565667
-n02110958
-n02096437
-n01498041
-n02130308
-n07836838
-n03884397
-n04065272
-n02033041
-n02607072
-n13040303
-n02808304
-n03095699
-n03485407
-n02395406
-n04560804
-n02676566
-n04589890
-n02110958
-n02837789
-n01669191
-n02123045
-n07579787
-n01667778
-n12998815
-n04613696
-n02951585
-n03623198
-n03764736
-n02892767
-n02102318
-n04040759
-n02123045
-n03062245
-n02701002
-n03201208
-n04266014
-n01873310
-n04597913
-n03595614
-n07716906
-n02988304
-n03445924
-n02860847
-n02095889
-n02115913
-n01756291
-n02114548
-n02457408
-n03995372
-n01614925
-n02107312
-n03930630
-n03017168
-n03535780
-n01985128
-n02177972
-n03045698
-n13133613
-n04398044
-n02099267
-n01829413
-n02114712
-n02104029
-n01440764
-n04263257
-n04251144
-n03584254
-n03874599
-n06359193
-n04070727
-n04209133
-n04065272
-n01748264
-n02980441
-n02093754
-n02097658
-n03187595
-n01742172
-n04590129
-n03188531
-n02504013
-n02017213
-n02979186
-n02843684
-n04040759
-n01667778
-n01820546
-n02116738
-n04243546
-n04090263
-n03888605
-n01985128
-n02823750
-n04141975
-n03376595
-n02108915
-n03372029
-n02423022
-n01728920
-n02102973
-n01580077
-n02492660
-n07716906
-n02096294
-n03259280
-n03884397
-n02102973
-n03666591
-n02486410
-n02102480
-n02105162
-n09246464
-n02823750
-n04152593
-n03196217
-n01818515
-n04591157
-n04328186
-n01742172
-n01753488
-n02971356
-n09428293
-n02927161
-n03180011
-n04099969
-n02795169
-n02895154
-n03929660
-n01910747
-n03854065
-n02747177
-n03803284
-n02123394
-n04264628
-n04243546
-n02123159
-n01983481
-n02526121
-n12267677
-n06785654
-n04606251
-n01855672
-n02281406
-n04296562
-n01773549
-n02127052
-n02090622
-n02088094
-n04125021
-n01728920
-n03595614
-n02090622
-n04285008
-n03874293
-n02823428
-n02028035
-n02077923
-n02017213
-n03903868
-n02127052
-n04317175
-n02107683
-n01984695
-n03995372
-n02090721
-n02089867
-n10148035
-n01737021
-n01883070
-n01819313
-n03958227
-n03841143
-n03459775
-n03777568
-n03417042
-n02110185
-n03388549
-n03924679
-n02672831
-n02165456
-n03207743
-n04136333
-n02971356
-n04039381
-n04162706
-n02791124
-n03124170
-n01843065
-n04428191
-n03874599
-n02102480
-n04487394
-n01883070
-n02966193
-n01494475
-n02110341
-n07716358
-n07248320
-n02814860
-n04133789
-n02443114
-n02110063
-n04509417
-n02108089
-n04548362
-n01748264
-n03710637
-n02091467
-n02110341
-n02113624
-n01819313
-n02939185
-n03272562
-n02787622
-n12267677
-n04141327
-n02110958
-n01687978
-n04429376
-n01729322
-n02093647
-n07920052
-n01910747
-n02107908
-n03895866
-n02086079
-n02895154
-n13037406
-n03876231
-n04590129
-n01692333
-n03717622
-n02109525
-n04355338
-n03777568
-n03314780
-n03887697
-n04141975
-n01978287
-n04597913
-n04141975
-n02782093
-n03868242
-n02002724
-n03196217
-n04153751
-n01629819
-n02808440
-n02058221
-n01531178
-n02114712
-n03494278
-n04204347
-n03793489
-n03483316
-n04209239
-n03776460
-n04336792
-n02114548
-n02667093
-n02834397
-n04456115
-n03394916
-n04346328
-n01776313
-n02124075
-n02356798
-n03895866
-n02963159
-n01883070
-n03355925
-n02226429
-n03417042
-n02106550
-n02101388
-n04200800
-n02011460
-n02112706
-n04326547
-n01985128
-n03110669
-n03804744
-n04141327
-n11939491
-n02105251
-n03201208
-n07754684
-n01632777
-n04553703
-n04149813
-n02481823
-n03947888
-n01534433
-n03457902
-n02776631
-n04209239
-n04523525
-n04074963
-n02233338
-n03930313
-n03249569
-n03884397
-n01601694
-n04560804
-n02514041
-n03417042
-n07880968
-n03594734
-n03344393
-n02088632
-n02106662
-n02108551
-n01744401
-n02483708
-n02971356
-n02909870
-n02841315
-n03496892
-n02100583
-n03476684
-n07718472
-n01641577
-n06596364
-n03954731
-n04357314
-n04259630
-n07695742
-n04423845
-n03249569
-n04111531
-n02895154
-n04149813
-n02114712
-n04252225
-n03770679
-n02837789
-n04428191
-n02361337
-n02100236
-n01728920
-n03594945
-n02268443
-n07875152
-n07695742
-n02108551
-n01531178
-n01980166
-n02106382
-n03658185
-n02988304
-n04141076
-n02906734
-n02012849
-n02786058
-n01614925
-n02206856
-n01631663
-n03100240
-n03047690
-n03180011
-n02895154
-n02782093
-n03595614
-n09332890
-n07749582
-n04258138
-n03095699
-n02096177
-n01728920
-n03538406
-n01806143
-n02088238
-n04501370
-n09229709
-n04423845
-n02397096
-n02133161
-n02088238
-n02264363
-n02101006
-n04515003
-n02870880
-n04548280
-n04461696
-n03028079
-n02268853
-n03874599
-n01877812
-n02699494
-n12985857
-n02454379
-n04326547
-n02089867
-n01560419
-n02093256
-n04204347
-n04347754
-n02086240
-n04286575
-n04482393
-n03840681
-n04065272
-n02480855
-n02749479
-n03492542
-n02096437
-n02317335
-n02174001
-n04525305
-n04039381
-n07753592
-n13037406
-n02494079
-n04258138
-n02229544
-n01843383
-n01728920
-n04330267
-n02325366
-n02808304
-n04462240
-n03874293
-n03482405
-n01629819
-n03781244
-n04392985
-n04258138
-n03160309
-n02096585
-n01614925
-n02017213
-n04133789
-n04277352
-n02106030
-n04428191
-n03400231
-n03249569
-n01514668
-n10148035
-n02397096
-n07697313
-n07802026
-n03887697
-n07248320
-n01855032
-n03908618
-n02086910
-n04254680
-n02104365
-n03445777
-n02011460
-n07695742
-n04344873
-n01667778
-n02091244
-n01534433
-n02097474
-n02701002
-n03208938
-n03676483
-n03770439
-n01755581
-n02108915
-n01753488
-n02102480
-n03633091
-n03662601
-n01770393
-n07590611
-n04264628
-n03998194
-n02396427
-n02102040
-n01770393
-n04162706
-n02281406
-n12768682
-n01945685
-n03483316
-n01978287
-n02119022
-n02169497
-n03991062
-n04465501
-n07614500
-n01990800
-n01534433
-n03770679
-n09288635
-n03188531
-n09256479
-n04259630
-n02110627
-n04560804
-n02113978
-n02095889
-n04599235
-n03259280
-n02111277
-n02794156
-n04328186
-n04254680
-n03661043
-n03599486
-n02097130
-n02033041
-n02071294
-n03937543
-n09288635
-n03709823
-n02489166
-n03673027
-n01828970
-n04532106
-n03496892
-n01924916
-n04548280
-n02319095
-n02395406
-n02782093
-n04554684
-n02086240
-n03916031
-n02791270
-n07717410
-n04238763
-n02730930
-n01514859
-n01748264
-n02988304
-n03461385
-n03272562
-n04330267
-n07860988
-n02276258
-n07871810
-n02097474
-n02999410
-n04037443
-n01614925
-n04033901
-n03944341
-n02655020
-n01608432
-n03874599
-n03594945
-n04252225
-n07892512
-n03717622
-n03763968
-n02110627
-n02795169
-n03000134
-n02494079
-n03042490
-n03100240
-n07875152
-n02802426
-n02484975
-n09229709
-n02747177
-n06596364
-n04557648
-n02123394
-n02002724
-n02167151
-n02504013
-n01616318
-n03770439
-n04428191
-n02051845
-n04579145
-n02093754
-n12267677
-n01641577
-n02963159
-n02807133
-n04590129
-n03467068
-n01629819
-n02443484
-n02088238
-n02412080
-n03532672
-n04591157
-n04486054
-n02692877
-n02727426
-n04371774
-n04273569
-n03733131
-n03544143
-n02104365
-n02109961
-n03447447
-n01872401
-n03961711
-n02116738
-n01688243
-n01749939
-n03141823
-n02509815
-n12985857
-n01829413
-n02109047
-n02526121
-n02097658
-n03216828
-n02870880
-n04266014
-n04355338
-n03633091
-n01910747
-n02006656
-n03445924
-n02906734
-n04099969
-n02099712
-n02229544
-n04443257
-n02687172
-n04273569
-n02489166
-n03924679
-n12985857
-n02167151
-n02321529
-n02102040
-n02870880
-n01693334
-n02097298
-n01882714
-n04040759
-n03791053
-n02979186
-n02454379
-n03131574
-n04141327
-n02981792
-n02974003
-n02090721
-n04131690
-n02106030
-n02493793
-n02963159
-n04596742
-n11879895
-n03457902
-n02823750
-n01774750
-n03788365
-n02389026
-n02823750
-n02493509
-n07583066
-n01682714
-n03899768
-n02279972
-n07747607
-n01692333
-n04243546
-n04317175
-n02106550
-n01664065
-n01677366
-n02093754
-n04346328
-n02106550
-n02127052
-n03666591
-n03877845
-n03125729
-n03786901
-n03775071
-n02412080
-n01518878
-n03720891
-n01735189
-n02356798
-n02110806
-n03047690
-n04462240
-n02951585
-n01558993
-n03065424
-n02860847
-n02486410
-n02398521
-n04346328
-n02106030
-n02445715
-n04153751
-n02509815
-n01828970
-n04069434
-n07714571
-n13044778
-n01955084
-n03662601
-n01664065
-n02708093
-n02408429
-n03920288
-n02190166
-n02091635
-n04229816
-n01773549
-n02106662
-n02009912
-n01558993
-n02127052
-n02843684
-n02174001
-n03345487
-n01990800
-n03584254
-n02389026
-n02389026
-n04069434
-n03032252
-n07749582
-n02110627
-n02807133
-n02012849
-n03208938
-n02107142
-n03995372
-n02927161
-n03888257
-n02802426
-n09193705
-n07716906
-n03345487
-n02088094
-n03297495
-n02871525
-n02363005
-n02206856
-n02445715
-n02783161
-n02948072
-n09421951
-n02410509
-n02808304
-n03903868
-n02110063
-n03724870
-n07836838
-n04141975
-n02487347
-n02112137
-n02804610
-n07734744
-n04462240
-n03372029
-n02177972
-n02085620
-n01917289
-n04070727
-n02823428
-n02860847
-n04392985
-n02791124
-n01847000
-n01784675
-n02093991
-n03457902
-n02939185
-n04493381
-n03271574
-n02509815
-n03793489
-n02690373
-n03983396
-n02927161
-n03018349
-n03908618
-n02110341
-n03776460
-n02124075
-n04335435
-n03127747
-n02948072
-n03085013
-n02442845
-n02916936
-n01688243
-n02879718
-n02097298
-n04589890
-n02607072
-n02948072
-n04525038
-n02100735
-n02814533
-n03000134
-n03478589
-n02037110
-n04235860
-n02112137
-n04435653
-n04273569
-n03794056
-n01910747
-n01748264
-n01883070
-n04200800
-n04590129
-n03443371
-n02791124
-n03075370
-n03673027
-n01742172
-n03476684
-n01484850
-n01675722
-n02978881
-n03938244
-n02106166
-n01729977
-n04118776
-n04209239
-n03376595
-n04008634
-n02095889
-n01855032
-n03376595
-n04456115
-n02879718
-n04238763
-n02268443
-n02794156
-n02105505
-n01914609
-n03899768
-n02676566
-n02099601
-n02106382
-n04264628
-n04501370
-n03594734
-n03895866
-n04332243
-n04008634
-n02492035
-n01773797
-n04228054
-n02110958
-n06359193
-n02403003
-n04409515
-n03337140
-n02483708
-n02106166
-n04209133
-n02114367
-n03743016
-n03201208
-n03207941
-n02804414
-n04487081
-n01945685
-n02606052
-n03388043
-n03661043
-n02804610
-n04235860
-n02795169
-n03476991
-n03444034
-n03942813
-n04026417
-n03337140
-n02108422
-n04033995
-n03041632
-n02134418
-n04554684
-n03733131
-n02116738
-n03786901
-n03937543
-n04147183
-n04131690
-n03400231
-n02125311
-n02410509
-n01775062
-n02814533
-n02110185
-n04008634
-n04597913
-n01883070
-n07714990
-n02112350
-n02437616
-n03662601
-n02074367
-n04239074
-n03063689
-n07831146
-n02869837
-n03920288
-n13052670
-n03016953
-n02788148
-n04613696
-n02113023
-n03866082
-n02992529
-n04479046
-n04467665
-n04540053
-n02927161
-n03992509
-n04347754
-n03495258
-n03633091
-n02105251
-n02231487
-n02102318
-n02667093
-n01749939
-n02133161
-n03372029
-n02486261
-n04004767
-n02088466
-n07579787
-n02791270
-n03131574
-n02391049
-n01664065
-n02099429
-n01776313
-n03920288
-n02109047
-n02317335
-n04612504
-n03584254
-n03457902
-n02051845
-n03047690
-n04507155
-n02704792
-n01748264
-n02017213
-n03450230
-n02841315
-n04070727
-n02992211
-n03404251
-n02092339
-n12768682
-n07873807
-n03041632
-n03379051
-n04435653
-n04146614
-n02012849
-n03443371
-n04152593
-n04507155
-n03447447
-n04252225
-n03770439
-n13037406
-n01748264
-n04550184
-n03207941
-n07716906
-n03595614
-n07875152
-n04560804
-n04479046
-n03127925
-n07248320
-n02342885
-n02088466
-n03485407
-n09399592
-n04039381
-n04548280
-n02099267
-n04254777
-n06785654
-n02190166
-n03868242
-n04141076
-n02980441
-n03868863
-n02437312
-n02096177
-n02701002
-n03259280
-n02834397
-n15075141
-n07880968
-n02096585
-n09256479
-n02091032
-n03457902
-n02099849
-n02398521
-n02129165
-n03404251
-n01774384
-n03977966
-n02980441
-n02137549
-n03920288
-n01770081
-n03891332
-n03196217
-n02782093
-n02510455
-n03535780
-n04263257
-n02790996
-n03146219
-n01601694
-n03379051
-n03188531
-n02790996
-n04596742
-n01560419
-n03376595
-n12768682
-n02504013
-n03388043
-n02231487
-n03134739
-n03775071
-n02509815
-n07695742
-n02325366
-n09835506
-n04418357
-n04483307
-n04069434
-n03991062
-n02487347
-n03223299
-n02817516
-n03207743
-n02110627
-n04604644
-n02112350
-n02109961
-n03534580
-n03208938
-n03125729
-n03947888
-n04154565
-n01860187
-n02328150
-n02777292
-n02112018
-n02113978
-n02033041
-n07871810
-n10148035
-n01981276
-n07860988
-n03492542
-n04005630
-n02093428
-n04355933
-n02108089
-n03841143
-n02704792
-n02277742
-n03874599
-n04371774
-n01775062
-n03461385
-n02096585
-n02093754
-n02011460
-n02814533
-n02787622
-n02114367
-n01641577
-n03992509
-n04265275
-n02096051
-n07745940
-n02422106
-n01496331
-n03188531
-n07614500
-n02101006
-n02101006
-n13040303
-n02085936
-n03961711
-n02093991
-n07714571
-n01986214
-n01669191
-n01984695
-n03297495
-n02108422
-n03249569
-n04398044
-n03775546
-n01986214
-n04579432
-n07714571
-n01945685
-n02640242
-n06785654
-n04116512
-n02099429
-n09229709
-n01682714
-n01749939
-n02007558
-n01498041
-n04507155
-n02124075
-n02101006
-n02104029
-n02676566
-n02606052
-n04238763
-n02101388
-n02107312
-n03347037
-n02493509
-n02396427
-n04065272
-n03840681
-n04515003
-n02091635
-n02325366
-n04033901
-n01675722
-n03788365
-n13037406
-n03527444
-n01695060
-n04328186
-n07590611
-n01728572
-n02119022
-n02974003
-n02410509
-n07892512
-n07730033
-n04330267
-n03868863
-n02018207
-n02500267
-n02980441
-n01843065
-n02093859
-n02094114
-n07768694
-n04154565
-n02123394
-n03843555
-n02123159
-n02107574
-n01795545
-n02917067
-n02071294
-n03895866
-n03179701
-n03950228
-n04259630
-n02165105
-n02120079
-n02804610
-n02279972
-n01728920
-n02978881
-n03710637
-n01872401
-n03160309
-n02442845
-n09256479
-n02950826
-n02841315
-n04357314
-n02865351
-n04111531
-n07747607
-n03594945
-n03763968
-n04606251
-n03895866
-n02113978
-n04554684
-n04344873
-n04254120
-n01740131
-n03976467
-n07753275
-n02443484
-n02939185
-n02977058
-n13037406
-n07747607
-n04467665
-n01784675
-n04536866
-n02123159
-n02119789
-n04548362
-n02111129
-n06794110
-n04239074
-n03733805
-n02088466
-n03764736
-n01914609
-n02105505
-n02412080
-n04254680
-n04523525
-n07697537
-n01728920
-n02794156
-n02113978
-n13040303
-n01514859
-n04398044
-n02364673
-n01924916
-n02007558
-n03803284
-n02795169
-n03916031
-n02088238
-n02086646
-n03063689
-n01806143
-n04366367
-n03109150
-n04523525
-n04208210
-n01978287
-n03272010
-n03146219
-n03933933
-n04525305
-n03124043
-n02510455
-n01687978
-n01824575
-n04613696
-n06359193
-n03110669
-n03388183
-n03691459
-n02280649
-n03133878
-n02085782
-n02087046
-n02090721
-n02497673
-n04344873
-n04330267
-n01514859
-n02488702
-n04525038
-n07711569
-n01978455
-n01768244
-n02105855
-n04604644
-n02281406
-n01739381
-n01693334
-n02113978
-n07749582
-n03786901
-n01883070
-n09246464
-n03841143
-n03482405
-n12998815
-n03938244
-n04238763
-n03929855
-n02892201
-n02486261
-n02676566
-n01843065
-n01728920
-n03379051
-n02823750
-n02776631
-n02488291
-n02317335
-n02002724
-n01755581
-n03110669
-n04019541
-n03095699
-n04004767
-n03877845
-n02120505
-n02113624
-n07695742
-n03127747
-n03041632
-n01744401
-n02098286
-n02100735
-n02264363
-n04456115
-n02219486
-n02129165
-n04275548
-n03874599
-n03706229
-n01770081
-n02988304
-n02105505
-n02130308
-n02113799
-n06596364
-n02028035
-n01784675
-n04266014
-n02422106
-n03271574
-n01622779
-n04229816
-n02988304
-n02977058
-n03594734
-n03196217
-n04008634
-n03947888
-n03032252
-n02037110
-n03424325
-n03873416
-n03379051
-n02096437
-n03887697
-n04154565
-n03803284
-n06794110
-n03956157
-n03297495
-n03444034
-n09256479
-n02317335
-n03871628
-n04192698
-n07873807
-n02793495
-n03764736
-n02483362
-n01773797
-n03788195
-n03032252
-n04311174
-n02111889
-n03970156
-n04447861
-n02018795
-n03666591
-n03314780
-n02229544
-n02172182
-n02486410
-n02607072
-n02276258
-n04254777
-n02403003
-n02094114
-n09246464
-n02114367
-n03788365
-n03297495
-n02492660
-n04326547
-n03201208
-n04286575
-n03492542
-n03877472
-n01910747
-n01608432
-n02490219
-n03710637
-n04344873
-n02951358
-n01498041
-n01729322
-n04409515
-n04146614
-n03873416
-n02090721
-n04081281
-n03976467
-n02837789
-n04409515
-n03759954
-n02168699
-n03127925
-n03970156
-n01665541
-n03160309
-n04251144
-n04311174
-n02098413
-n02480855
-n01773549
-n02489166
-n03494278
-n02229544
-n01729977
-n04552348
-n04033995
-n01882714
-n04366367
-n03271574
-n03666591
-n02093428
-n02791124
-n03384352
-n03498962
-n03709823
-n02422699
-n02085782
-n04133789
-n02486261
-n12985857
-n04372370
-n03857828
-n04367480
-n04612504
-n04399382
-n01632458
-n03717622
-n02514041
-n02018207
-n07615774
-n02098413
-n03691459
-n02108915
-n07920052
-n04228054
-n04493381
-n04081281
-n03832673
-n13052670
-n04584207
-n04252225
-n01608432
-n02708093
-n04398044
-n02087046
-n04599235
-n02177972
-n02326432
-n02490219
-n03761084
-n02101556
-n04599235
-n04467665
-n02097658
-n01978287
-n04612504
-n02397096
-n03018349
-n02391049
-n07584110
-n02457408
-n01776313
-n02120079
-n02727426
-n02791270
-n04590129
-n02058221
-n03599486
-n03788365
-n02098105
-n02097047
-n03794056
-n02966193
-n01494475
-n02514041
-n01773157
-n07613480
-n09332890
-n02086910
-n02071294
-n02105412
-n02966193
-n02481823
-n04228054
-n02825657
-n03775071
-n02096177
-n02328150
-n01768244
-n03028079
-n03534580
-n01484850
-n09428293
-n03788365
-n02106550
-n03782006
-n04258138
-n03710637
-n02097298
-n03721384
-n02391049
-n02013706
-n02840245
-n03249569
-n02454379
-n02865351
-n02206856
-n02093991
-n01877812
-n03485407
-n02101388
-n03014705
-n04456115
-n03976657
-n03188531
-n02342885
-n02096437
-n02102318
-n03376595
-n03271574
-n02177972
-n03594945
-n03126707
-n02099712
-n01692333
-n02966687
-n03930313
-n01667778
-n07716906
-n01580077
-n03804744
-n02111277
-n03100240
-n04548280
-n02814533
-n04204347
-n04141327
-n02066245
-n02096585
-n02102480
-n03125729
-n03272010
-n03980874
-n07753592
-n02105412
-n02443114
-n04579432
-n02101556
-n03995372
-n02950826
-n01534433
-n02088238
-n07715103
-n02795169
-n01484850
-n01753488
-n02607072
-n01530575
-n01692333
-n04153751
-n02111500
-n03131574
-n03803284
-n02437312
-n02974003
-n02776631
-n04125021
-n09428293
-n02843684
-n03047690
-n02417914
-n03998194
-n03110669
-n02445715
-n04525305
-n03998194
-n01514668
-n02321529
-n02088466
-n01644373
-n07714571
-n04357314
-n03991062
-n02088094
-n02687172
-n02110185
-n02089078
-n09468604
-n02408429
-n04389033
-n03706229
-n02488702
-n03992509
-n02417914
-n04086273
-n07613480
-n04270147
-n03887697
-n01601694
-n02123159
-n01518878
-n07836838
-n04443257
-n01592084
-n03109150
-n02264363
-n02808304
-n04252225
-n01630670
-n04507155
-n03047690
-n03344393
-n02981792
-n03680355
-n07579787
-n02526121
-n01984695
-n04485082
-n03814639
-n02977058
-n03866082
-n04404412
-n04116512
-n03100240
-n03127925
-n01847000
-n02051845
-n02177972
-n02106030
-n03770679
-n03535780
-n03676483
-n01843383
-n01873310
-n02085936
-n02328150
-n03089624
-n02102318
-n02500267
-n04040759
-n04552348
-n02101006
-n07749582
-n03884397
-n02111129
-n03662601
-n03250847
-n02129604
-n03461385
-n03970156
-n04317175
-n03958227
-n07714990
-n01980166
-n03929660
-n03314780
-n01855032
-n03630383
-n01817953
-n02095889
-n04505470
-n02727426
-n03598930
-n02105855
-n02115913
-n03110669
-n10148035
-n02106550
-n02086079
-n04380533
-n10565667
-n03249569
-n02095889
-n02492660
-n07873807
-n02797295
-n04209239
-n02786058
-n02837789
-n02841315
-n02704792
-n03935335
-n04562935
-n02099429
-n02112137
-n03325584
-n04442312
-n04033995
-n07614500
-n02108089
-n03710721
-n03100240
-n02093859
-n02906734
-n04254777
-n07871810
-n02422106
-n04049303
-n03961711
-n02777292
-n04443257
-n04597913
-n02927161
-n03424325
-n03032252
-n02795169
-n02123394
-n01498041
-n01751748
-n03793489
-n03345487
-n02091635
-n02123159
-n02107142
-n02484975
-n03666591
-n03085013
-n04325704
-n03208938
-n04562935
-n04152593
-n09472597
-n07875152
-n04597913
-n04099969
-n03976657
-n02028035
-n03796401
-n02917067
-n02110958
-n02730930
-n02802426
-n02917067
-n02704792
-n07760859
-n02123597
-n01981276
-n01688243
-n03400231
-n02088238
-n07753275
-n02100583
-n01955084
-n02777292
-n01534433
-n03908714
-n02120079
-n04465501
-n02641379
-n02098286
-n01534433
-n02917067
-n04371774
-n02110958
-n03538406
-n03443371
-n03902125
-n03075370
-n04336792
-n02091831
-n02510455
-n02097047
-n03908618
-n02817516
-n02111889
-n01531178
-n02481823
-n03110669
-n02095570
-n03982430
-n03444034
-n07714571
-n07932039
-n01768244
-n02837789
-n03637318
-n04141975
-n01910747
-n03873416
-n03018349
-n02114548
-n07717556
-n03494278
-n03924679
-n02012849
-n02361337
-n02398521
-n03443371
-n07615774
-n02009912
-n02395406
-n02777292
-n02783161
-n02445715
-n03743016
-n03891332
-n04542943
-n15075141
-n02091244
-n02114367
-n03404251
-n03000134
-n01667114
-n03763968
-n02233338
-n09428293
-n03793489
-n04258138
-n04023962
-n01667778
-n03899768
-n13133613
-n03599486
-n03042490
-n04467665
-n03633091
-n02437616
-n02835271
-n03791053
-n04486054
-n07717410
-n07613480
-n01728920
-n03400231
-n02790996
-n02676566
-n04562935
-n02264363
-n04141975
-n03089624
-n03954731
-n03467068
-n02690373
-n02102040
-n01985128
-n04116512
-n02497673
-n04392985
-n03937543
-n02006656
-n01773549
-n02704792
-n02999410
-n07930864
-n02011460
-n02107312
-n02910353
-n01795545
-n04111531
-n02894605
-n01614925
-n02793495
-n02100877
-n03761084
-n02504013
-n02408429
-n07583066
-n01744401
-n03447447
-n03125729
-n01978287
-n04346328
-n03742115
-n02483708
-n13054560
-n02096177
-n03920288
-n02837789
-n03877472
-n02165105
-n03937543
-n03982430
-n03787032
-n07880968
-n04371774
-n04146614
-n03394916
-n03903868
-n02687172
-n01494475
-n02536864
-n02129165
-n07920052
-n01496331
-n02009912
-n02692877
-n02101006
-n03271574
-n04371774
-n01496331
-n04557648
-n02027492
-n02125311
-n03376595
-n01872401
-n04346328
-n02091134
-n04238763
-n01776313
-n01796340
-n01770081
-n03141823
-n01665541
-n04133789
-n02096437
-n02096051
-n10565667
-n04542943
-n03447447
-n09421951
-n02113624
-n03160309
-n02504458
-n01774750
-n03871628
-n04590129
-n12057211
-n03481172
-n03000247
-n04090263
-n04141076
-n01914609
-n03775071
-n02869837
-n04509417
-n04371430
-n02097209
-n04613696
-n02669723
-n02883205
-n01748264
-n01955084
-n04204238
-n03743016
-n02177972
-n03868863
-n04133789
-n02168699
-n04041544
-n02115913
-n02259212
-n02096177
-n02277742
-n04493381
-n02093859
-n03160309
-n04120489
-n09246464
-n04005630
-n03938244
-n03208938
-n04033901
-n02835271
-n04049303
-n02951585
-n04229816
-n01755581
-n01734418
-n01843065
-n02114367
-n09288635
-n04147183
-n03196217
-n04367480
-n03467068
-n01491361
-n02091831
-n04154565
-n07875152
-n07873807
-n02690373
-n02730930
-n04389033
-n02879718
-n03223299
-n01784675
-n03447721
-n01742172
-n01728572
-n12985857
-n03376595
-n03089624
-n03887697
-n04270147
-n01930112
-n02814533
-n07802026
-n07920052
-n03425413
-n06596364
-n03134739
-n02108422
-n12998815
-n07753113
-n02056570
-n09256479
-n04238763
-n02951585
-n04033901
-n01833805
-n01737021
-n01694178
-n06785654
-n02500267
-n02085782
-n03825788
-n03899768
-n01843383
-n02782093
-n01855672
-n04239074
-n04604644
-n07583066
-n03041632
-n02777292
-n03627232
-n03884397
-n02328150
-n04005630
-n02093859
-n01749939
-n03000134
-n04037443
-n03888257
-n01824575
-n07875152
-n02526121
-n07920052
-n02102040
-n02869837
-n02099849
-n04356056
-n01749939
-n02442845
-n04487081
-n02087046
-n04201297
-n02094433
-n02480495
-n02096585
-n01518878
-n04141975
-n02981792
-n01632458
-n02093647
-n02018207
-n04040759
-n01820546
-n03840681
-n03832673
-n02051845
-n01883070
-n03534580
-n02028035
-n03857828
-n01682714
-n04049303
-n02096585
-n04254120
-n02071294
-n03868863
-n02206856
-n04086273
-n02177972
-n02085782
-n03942813
-n01496331
-n04355933
-n02790996
-n04265275
-n03976467
-n02279972
-n02086240
-n01824575
-n09421951
-n02123159
-n02086079
-n07717410
-n02422106
-n02236044
-n01608432
-n03062245
-n07734744
-n01983481
-n04542943
-n01773797
-n02526121
-n01688243
-n01990800
-n02169497
-n01768244
-n01770393
-n03977966
-n02096585
-n03532672
-n07711569
-n01734418
-n04326547
-n09332890
-n04584207
-n02114712
-n02093754
-n03495258
-n01616318
-n02326432
-n04507155
-n03527444
-n01981276
-n02097298
-n03958227
-n02165105
-n07718472
-n04591157
-n04286575
-n04208210
-n02120505
-n04265275
-n04147183
-n03271574
-n02128385
-n02110958
-n03888257
-n02730930
-n01978455
-n02843684
-n03590841
-n03065424
-n03854065
-n01739381
-n01773797
-n03976657
-n04116512
-n02092339
-n01817953
-n02119789
-n01748264
-n02169497
-n03125729
-n02091467
-n07714571
-n02704792
-n02085936
-n02108915
-n03314780
-n02086646
-n07697537
-n03584829
-n03773504
-n04204347
-n01796340
-n03930313
-n02033041
-n02236044
-n02895154
-n02708093
-n02115641
-n04209239
-n01735189
-n03201208
-n09468604
-n03047690
-n04254777
-n06596364
-n03627232
-n01532829
-n01694178
-n04081281
-n03495258
-n02788148
-n01775062
-n04355933
-n03017168
-n04599235
-n03785016
-n07871810
-n03980874
-n02071294
-n04493381
-n04372370
-n02087046
-n04584207
-n04086273
-n02092339
-n02817516
-n03240683
-n12998815
-n03075370
-n02804414
-n01833805
-n01695060
-n04596742
-n04398044
-n02106382
-n04204238
-n02219486
-n02437312
-n04335435
-n01531178
-n04201297
-n03920288
-n03759954
-n03792782
-n02412080
-n04536866
-n03874293
-n02708093
-n02437312
-n04509417
-n01990800
-n04579145
-n02480495
-n04371430
-n02105056
-n03930630
-n03481172
-n02808440
-n07932039
-n04428191
-n02971356
-n02090379
-n03857828
-n02988304
-n02115913
-n04599235
-n04033901
-n11879895
-n03014705
-n02002724
-n02445715
-n02870880
-n02951585
-n02129604
-n02123394
-n01860187
-n03788195
-n03729826
-n01665541
-n01531178
-n04442312
-n02777292
-n13044778
-n07720875
-n02027492
-n02480855
-n04447861
-n02403003
-n03874599
-n01622779
-n02860847
-n03884397
-n13040303
-n03796401
-n03388549
-n03970156
-n02112137
-n03775071
-n01601694
-n02093991
-n01664065
-n02077923
-n02487347
-n02444819
-n02480855
-n04505470
-n03980874
-n03447447
-n01955084
-n02056570
-n03127747
-n02692877
-n06596364
-n03400231
-n03482405
-n03920288
-n03871628
-n03496892
-n12267677
-n04310018
-n02865351
-n01924916
-n03000247
-n03393912
-n02825657
-n06785654
-n02097474
-n04179913
-n02112350
-n03444034
-n03133878
-n02132136
-n02843684
-n01770393
-n01871265
-n03290653
-n03207941
-n03476991
-n03481172
-n04590129
-n01532829
-n03642806
-n03388183
-n02094258
-n03496892
-n04467665
-n02963159
-n02328150
-n02101388
-n09256479
-n03777568
-n02165456
-n03042490
-n02363005
-n13054560
-n02808440
-n04532670
-n01688243
-n03602883
-n02206856
-n03400231
-n02346627
-n01871265
-n01806567
-n02727426
-n04067472
-n02088094
-n04553703
-n13037406
-n07718472
-n04252077
-n04258138
-n02808440
-n02328150
-n03325584
-n01774750
-n02123159
-n02111277
-n04591157
-n03871628
-n03775071
-n04136333
-n03976467
-n03908618
-n03483316
-n04487394
-n02769748
-n04523525
-n12998815
-n04553703
-n04152593
-n02346627
-n02007558
-n03110669
-n01440764
-n09472597
-n02730930
-n02782093
-n04483307
-n02028035
-n04040759
-n03372029
-n02808440
-n02120505
-n03141823
-n02100236
-n01770393
-n01739381
-n03208938
-n03954731
-n04536866
-n04456115
-n03000247
-n04612504
-n02837789
-n03538406
-n02699494
-n03967562
-n04398044
-n03710721
-n04356056
-n04033995
-n02415577
-n04270147
-n03866082
-n03271574
-n02133161
-n03483316
-n01514668
-n03770679
-n04532670
-n03720891
-n02096437
-n03444034
-n02088632
-n02328150
-n02787622
-n12998815
-n07716358
-n02817516
-n03961711
-n02823428
-n01753488
-n02443114
-n04370456
-n04542943
-n03876231
-n02509815
-n04371430
-n04141975
-n02112350
-n02321529
-n02097474
-n04461696
-n03804744
-n02786058
-n12768682
-n01855032
-n03992509
-n01773797
-n02443484
-n02101006
-n09421951
-n03837869
-n04356056
-n01744401
-n02701002
-n03977966
-n02105056
-n02102318
-n03095699
-n01728572
-n01873310
-n03930313
-n03930630
-n06359193
-n02033041
-n04604644
-n03781244
-n04599235
-n02114548
-n02356798
-n03271574
-n07932039
-n02100735
-n04069434
-n04346328
-n09332890
-n12768682
-n02795169
-n04049303
-n02403003
-n04239074
-n02493793
-n02127052
-n04317175
-n02363005
-n03832673
-n04296562
-n03630383
-n01739381
-n02107683
-n02012849
-n03786901
-n04033995
-n03782006
-n02113624
-n02783161
-n02134418
-n03532672
-n02012849
-n02415577
-n02096437
-n03220513
-n01945685
-n02892201
-n04044716
-n07742313
-n03376595
-n02643566
-n01735189
-n01729977
-n02105251
-n09421951
-n02099712
-n03388043
-n02174001
-n04147183
-n02013706
-n13054560
-n02978881
-n09246464
-n02699494
-n02107312
-n03017168
-n07745940
-n02233338
-n02791270
-n01950731
-n03857828
-n02025239
-n03452741
-n02101388
-n03388549
-n01484850
-n02111277
-n01950731
-n02174001
-n02105162
-n02480855
-n03325584
-n03272562
-n03876231
-n01644373
-n04380533
-n07697537
-n04380533
-n02190166
-n07753592
-n01630670
-n02730930
-n03788195
-n02669723
-n02100735
-n03271574
-n03179701
-n02486261
-n02105412
-n02417914
-n01770081
-n02123394
-n01855672
-n02480495
-n02692877
-n01532829
-n04372370
-n01910747
-n03400231
-n02444819
-n04099969
-n03498962
-n04154565
-n02783161
-n03124170
-n03417042
-n04254120
-n07717410
-n04372370
-n07565083
-n03661043
-n04074963
-n02504458
-n03720891
-n03445924
-n03873416
-n03775071
-n02443114
-n03623198
-n03000247
-n02423022
-n03929660
-n02782093
-n01930112
-n01776313
-n03388183
-n02133161
-n02782093
-n03393912
-n03794056
-n09256479
-n07920052
-n03384352
-n02666196
-n02894605
-n03476684
-n02526121
-n02123045
-n03673027
-n03197337
-n02114548
-n04599235
-n02085936
-n02963159
-n04258138
-n03983396
-n03187595
-n03290653
-n03179701
-n01531178
-n02398521
-n02119789
-n02089867
-n04548362
-n02486410
-n01704323
-n01494475
-n04141327
-n02790996
-n02056570
-n02106166
-n02018795
-n04523525
-n03598930
-n04118776
-n03662601
-n04509417
-n02606052
-n02966193
-n03775071
-n02317335
-n03146219
-n03355925
-n02229544
-n02443114
-n03355925
-n04590129
-n02804414
-n02114367
-n03379051
-n02138441
-n03461385
-n04200800
-n03584829
-n01755581
-n04335435
-n03127747
-n04263257
-n04192698
-n01622779
-n02422699
-n02107683
-n04532670
-n02906734
-n02804414
-n12768682
-n02108089
-n02909870
-n03837869
-n02113186
-n02112350
-n01677366
-n03630383
-n02526121
-n02840245
-n01687978
-n04515003
-n15075141
-n02841315
-n02422106
-n02783161
-n02814533
-n02102177
-n02415577
-n03782006
-n01770081
-n02114548
-n03958227
-n01728920
-n03494278
-n01873310
-n02894605
-n01833805
-n03160309
-n04458633
-n03223299
-n12620546
-n12998815
-n01496331
-n04461696
-n01981276
-n03595614
-n02101388
-n03937543
-n03100240
-n03791053
-n04613696
-n02134084
-n04141975
-n02093859
-n03125729
-n02326432
-n03680355
-n03998194
-n01494475
-n02342885
-n03976657
-n01819313
-n04606251
-n01740131
-n02797295
-n02123394
-n02169497
-n03630383
-n01689811
-n03950228
-n07584110
-n04591713
-n04127249
-n12144580
-n07831146
-n03791053
-n02808440
-n02793495
-n02437312
-n02138441
-n02111500
-n02109961
-n03459775
-n03126707
-n03388549
-n02096294
-n03961711
-n04209133
-n04243546
-n02791270
-n01685808
-n02965783
-n03775546
-n02074367
-n03775546
-n03584254
-n02119789
-n02437312
-n03888257
-n03187595
-n02123045
-n03937543
-n02412080
-n01729322
-n03908714
-n02125311
-n01494475
-n02894605
-n03908618
-n02114855
-n02123159
-n03598930
-n02107142
-n03290653
-n02791124
-n03803284
-n03937543
-n03388043
-n03131574
-n02788148
-n02106382
-n04467665
-n02100877
-n04330267
-n03697007
-n03710721
-n02403003
-n02108089
-n03017168
-n03733281
-n03792972
-n02105056
-n01806567
-n01630670
-n03337140
-n03467068
-n01873310
-n02398521
-n02013706
-n04120489
-n02708093
-n02110341
-n03770679
-n02480495
-n03450230
-n03584254
-n02823750
-n04127249
-n02410509
-n04562935
-n04019541
-n04613696
-n01632777
-n07836838
-n02114855
-n02100236
-n02102318
-n07831146
-n03742115
-n03662601
-n03720891
-n02804610
-n02107142
-n03733131
-n03791053
-n03991062
-n02808304
-n03594945
-n02749479
-n04562935
-n02134084
-n02342885
-n03538406
-n02107683
-n02012849
-n01682714
-n02988304
-n07932039
-n02206856
-n03447447
-n01753488
-n01755581
-n02119022
-n04597913
-n03314780
-n02865351
-n03459775
-n01530575
-n04335435
-n09288635
-n02769748
-n02256656
-n03131574
-n03770439
-n02123045
-n02096177
-n04131690
-n02397096
-n01798484
-n02107574
-n02113186
-n01855672
-n03791053
-n03770679
-n01983481
-n02093256
-n01968897
-n02692877
-n02356798
-n07875152
-n02107312
-n02837789
-n03042490
-n03188531
-n03447721
-n02825657
-n03868242
-n04552348
-n01770081
-n02095314
-n04204347
-n02087394
-n04065272
-n02132136
-n02134418
-n01632777
-n04325704
-n03776460
-n01955084
-n02129604
-n01644900
-n02101006
-n04357314
-n12985857
-n03670208
-n07760859
-n04067472
-n02099849
-n03770679
-n02978881
-n03623198
-n03717622
-n04536866
-n02835271
-n07717410
-n04429376
-n02869837
-n03124170
-n01632458
-n01531178
-n03127925
-n02097047
-n03950228
-n03028079
-n02107312
-n13052670
-n02090721
-n07711569
-n02091831
-n01530575
-n04146614
-n01667114
-n03958227
-n02098286
-n07871810
-n01980166
-n02412080
-n02500267
-n01924916
-n04254680
-n02480495
-n01774384
-n03216828
-n07711569
-n03026506
-n01749939
-n03344393
-n03938244
-n02098105
-n01986214
-n01917289
-n04418357
-n02058221
-n02106030
-n02966193
-n03032252
-n02206856
-n03063599
-n02107312
-n03843555
-n02108551
-n01855672
-n02107142
-n02102040
-n04357314
-n04505470
-n03529860
-n02437312
-n02129604
-n03773504
-n02100877
-n03877472
-n04501370
-n07880968
-n04458633
-n02167151
-n03721384
-n02102480
-n07579787
-n02123394
-n02484975
-n03942813
-n04270147
-n03777568
-n02085782
-n01729977
-n04404412
-n04311174
-n03160309
-n02454379
-n02096294
-n04065272
-n02483362
-n02364673
-n03100240
-n07873807
-n03594734
-n04344873
-n07590611
-n01883070
-n03770439
-n03141823
-n02133161
-n01689811
-n01833805
-n02814860
-n04367480
-n03710637
-n07714571
-n02071294
-n01768244
-n03388183
-n01847000
-n03325584
-n01667114
-n02236044
-n04141327
-n03467068
-n01687978
-n04285008
-n03483316
-n03447447
-n02264363
-n02097209
-n04501370
-n09468604
-n02930766
-n01917289
-n04554684
-n02979186
-n02442845
-n03345487
-n02486410
-n02841315
-n03899768
-n09399592
-n03344393
-n02088364
-n03763968
-n02105162
-n04235860
-n03903868
-n09428293
-n03661043
-n03249569
-n02268443
-n02444819
-n02116738
-n03902125
-n02093991
-n02110185
-n03832673
-n03983396
-n07716358
-n02113712
-n03887697
-n03424325
-n03958227
-n01534433
-n02086646
-n04591713
-n07753113
-n03841143
-n02790996
-n02165456
-n02009229
-n02814860
-n04462240
-n02730930
-n02085620
-n02098413
-n03337140
-n02807133
-n04263257
-n02108422
-n02138441
-n01630670
-n04008634
-n02113799
-n02643566
-n12057211
-n01665541
-n04404412
-n03691459
-n01729977
-n03290653
-n01924916
-n02486410
-n04332243
-n13052670
-n03598930
-n02437616
-n02093991
-n01729977
-n02115641
-n02825657
-n02786058
-n02788148
-n02094258
-n02793495
-n03388043
-n02128757
-n02443484
-n02088094
-n03110669
-n01985128
-n07714990
-n02869837
-n03595614
-n04592741
-n02127052
-n07880968
-n02643566
-n09256479
-n02356798
-n02509815
-n04487394
-n03721384
-n01728572
-n02992211
-n03877845
-n02231487
-n02445715
-n02095570
-n04579145
-n03706229
-n02107574
-n01833805
-n01629819
-n03445777
-n03710721
-n03014705
-n04336792
-n04311174
-n03724870
-n03920288
-n03063689
-n03908618
-n02085620
-n02699494
-n02096437
-n03804744
-n04209239
-n03249569
-n11939491
-n01882714
-n02129165
-n03773504
-n04346328
-n02102040
-n12620546
-n02177972
-n02066245
-n03492542
-n02090721
-n04482393
-n01914609
-n02174001
-n02233338
-n01693334
-n01665541
-n02280649
-n01514668
-n01641577
-n02107683
-n04040759
-n03355925
-n04579432
-n02280649
-n02361337
-n03937543
-n03891251
-n02492035
-n03759954
-n03763968
-n01582220
-n03866082
-n04086273
-n04330267
-n04476259
-n04118776
-n03180011
-n03838899
-n03627232
-n04264628
-n02101006
-n02113624
-n02395406
-n01675722
-n04090263
-n03785016
-n02137549
-n02277742
-n03642806
-n07718472
-n03447447
-n03792782
-n04008634
-n04254777
-n01631663
-n04254680
-n02074367
-n01744401
-n03127747
-n02190166
-n03623198
-n02607072
-n02877765
-n02790996
-n02992529
-n02492660
-n02117135
-n01580077
-n03028079
-n02102040
-n01494475
-n04461696
-n01917289
-n04146614
-n04004767
-n02906734
-n01560419
-n02085936
-n12267677
-n03075370
-n01682714
-n02669723
-n01751748
-n02999410
-n10148035
-n02797295
-n03958227
-n03134739
-n01860187
-n02443114
-n03028079
-n03495258
-n03787032
-n02108089
-n01687978
-n01484850
-n02098105
-n03942813
-n02109525
-n04613696
-n01631663
-n09835506
-n01784675
-n02137549
-n09472597
-n02895154
-n03676483
-n04209239
-n01784675
-n03028079
-n03355925
-n03483316
-n03337140
-n03495258
-n04311004
-n04270147
-n03791053
-n02488702
-n02895154
-n02100583
-n10565667
-n04548280
-n02091134
-n01806567
-n02264363
-n02708093
-n02111277
-n02692877
-n03837869
-n03240683
-n03773504
-n03706229
-n03742115
-n01734418
-n12998815
-n03452741
-n06596364
-n03041632
-n02096585
-n04317175
-n07892512
-n01755581
-n03777568
-n03457902
-n02106382
-n01601694
-n03691459
-n02114855
-n03461385
-n02096294
-n03498962
-n04482393
-n02412080
-n03857828
-n02124075
-n02106550
-n03950228
-n07730033
-n02093991
-n07768694
-n02870880
-n02672831
-n02268443
-n03773504
-n09332890
-n02025239
-n04562935
-n07742313
-n04192698
-n04049303
-n01644900
-n02769748
-n01774384
-n02894605
-n03127747
-n03045698
-n03388549
-n03724870
-n03706229
-n03825788
-n01775062
-n03670208
-n02492035
-n01983481
-n04435653
-n03028079
-n03445924
-n02108000
-n01882714
-n02346627
-n09399592
-n12620546
-n03047690
-n02807133
-n03630383
-n03325584
-n02110063
-n07860988
-n01443537
-n04523525
-n02112706
-n02815834
-n03720891
-n03843555
-n02992211
-n02107908
-n03662601
-n03207743
-n04507155
-n02094433
-n02791270
-n02788148
-n02094258
-n02105162
-n04179913
-n07930864
-n03873416
-n02027492
-n02790996
-n03924679
-n07753275
-n03658185
-n02444819
-n07802026
-n01484850
-n02113186
-n02110341
-n02090622
-n04366367
-n01773157
-n03792972
-n02690373
-n02090622
-n06794110
-n02101388
-n07697313
-n03297495
-n03032252
-n01688243
-n02090379
-n02017213
-n04152593
-n02108551
-n03658185
-n02643566
-n04049303
-n03544143
-n03709823
-n01632458
-n02111500
-n07717556
-n01688243
-n07747607
-n01592084
-n03485794
-n02443114
-n03888257
-n07753592
-n01930112
-n03127747
-n01580077
-n12057211
-n03344393
-n03697007
-n01601694
-n01818515
-n04517823
-n04584207
-n02002724
-n03424325
-n03895866
-n03787032
-n02100236
-n03110669
-n04523525
-n01983481
-n04465501
-n02090721
-n02980441
-n02088094
-n02492035
-n03109150
-n02091635
-n07695742
-n02074367
-n07754684
-n02783161
-n03761084
-n02096585
-n04099969
-n01930112
-n03379051
-n02105412
-n02097298
-n04026417
-n03866082
-n04004767
-n01704323
-n04286575
-n02321529
-n04417672
-n04389033
-n02909870
-n01685808
-n01806143
-n02006656
-n03832673
-n07697313
-n07932039
-n02206856
-n12144580
-n02108422
-n07753113
-n03777754
-n04259630
-n02641379
-n13052670
-n03788365
-n02870880
-n02799071
-n02137549
-n02999410
-n04317175
-n02094114
-n03529860
-n03188531
-n03160309
-n03697007
-n02091831
-n03594734
-n04389033
-n02799071
-n07747607
-n02504458
-n04277352
-n01914609
-n02281787
-n03868863
-n09421951
-n03792782
-n02102318
-n01484850
-n04192698
-n02089867
-n03584254
-n01728572
-n03062245
-n02109047
-n02108422
-n02088632
-n02447366
-n02236044
-n02910353
-n02105056
-n03498962
-n03250847
-n04120489
-n02999410
-n03467068
-n03187595
-n03255030
-n04004767
-n02091635
-n04507155
-n03782006
-n02317335
-n02165456
-n04243546
-n02099849
-n04239074
-n09246464
-n04335435
-n03770439
-n01978455
-n01644373
-n02256656
-n02509815
-n03584254
-n03710721
-n01795545
-n07753592
-n02412080
-n07892512
-n02091032
-n04074963
-n03197337
-n03075370
-n02111129
-n03930630
-n01770081
-n04235860
-n02132136
-n02100735
-n01978287
-n02097658
-n04540053
-n04149813
-n02105251
-n01984695
-n03314780
-n02115641
-n04235860
-n02843684
-n04311004
-n04118776
-n02276258
-n02909870
-n02701002
-n02051845
-n04599235
-n01689811
-n03637318
-n03344393
-n04591713
-n02018795
-n02795169
-n04462240
-n03776460
-n03404251
-n03188531
-n07749582
-n01631663
-n02123597
-n02328150
-n02110958
-n02125311
-n04023962
-n03133878
-n03131574
-n02091467
-n01484850
-n02096177
-n01496331
-n02058221
-n03028079
-n02113023
-n02480855
-n02892201
-n04418357
-n03042490
-n03124170
-n12985857
-n04141975
-n01860187
-n02130308
-n04037443
-n13052670
-n07714571
-n02391049
-n04149813
-n04099969
-n01729977
-n04243546
-n02978881
-n03131574
-n02127052
-n04366367
-n02229544
-n01669191
-n02489166
-n07716906
-n03208938
-n02088466
-n02093754
-n01632777
-n04118538
-n02363005
-n02114855
-n09256479
-n02787622
-n02105412
-n03498962
-n12768682
-n03216828
-n03598930
-n02643566
-n03837869
-n07695742
-n01817953
-n01667778
-n04251144
-n02231487
-n04005630
-n03445777
-n04597913
-n07615774
-n02769748
-n01833805
-n01828970
-n01796340
-n01694178
-n03995372
-n03494278
-n03271574
-n03014705
-n02088632
-n03788195
-n02328150
-n02992529
-n03498962
-n02169497
-n02112137
-n02483362
-n07836838
-n02086240
-n01739381
-n02325366
-n03877472
-n04589890
-n02133161
-n01632777
-n02105162
-n04019541
-n01775062
-n02107574
-n04509417
-n01860187
-n02088632
-n03459775
-n03133878
-n04254680
-n01755581
-n02939185
-n02091134
-n02114712
-n07714990
-n02484975
-n03445924
-n03018349
-n02802426
-n01774384
-n03124043
-n03355925
-n03146219
-n03388183
-n02226429
-n07860988
-n03388183
-n04009552
-n02488291
-n03899768
-n03649909
-n03393912
-n02797295
-n03014705
-n03729826
-n01560419
-n02114367
-n03637318
-n02115641
-n04517823
-n02346627
-n02033041
-n02804414
-n07714990
-n04120489
-n03481172
-n02099267
-n10565667
-n03825788
-n03240683
-n02123597
-n02097130
-n02090721
-n02094433
-n02667093
-n03461385
-n02101388
-n09399592
-n02109047
-n04153751
-n04479046
-n03223299
-n13133613
-n01688243
-n02363005
-n04493381
-n02445715
-n02280649
-n03804744
-n04596742
-n04597913
-n01729322
-n02793495
-n04604644
-n04592741
-n03425413
-n04332243
-n04562935
-n02494079
-n07693725
-n07717410
-n06874185
-n03063689
-n02389026
-n02110627
-n03930630
-n01871265
-n07716358
-n02114712
-n03216828
-n06596364
-n03494278
-n07579787
-n04548280
-n04409515
-n02102040
-n07753113
-n01632777
-n02843684
-n02395406
-n02100583
-n03481172
-n02099849
-n02708093
-n01980166
-n02096294
-n01744401
-n03291819
-n04004767
-n01534433
-n03223299
-n03773504
-n04090263
-n02002724
-n02422106
-n04325704
-n01531178
-n02948072
-n02281787
-n04239074
-n04399382
-n03400231
-n02802426
-n02165456
-n02256656
-n02104029
-n06794110
-n07932039
-n02793495
-n02093754
-n02834397
-n02165456
-n03394916
-n02138441
-n01729977
-n02138441
-n04311174
-n03388043
-n03344393
-n03445924
-n02504013
-n13040303
-n02363005
-n02206856
-n03982430
-n03661043
-n02107574
-n03785016
-n02231487
-n04487394
-n04376876
-n04277352
-n07718472
-n04118776
-n01914609
-n01798484
-n01944390
-n03355925
-n03742115
-n02108089
-n03924679
-n03134739
-n02011460
-n02974003
-n02100583
-n01496331
-n01860187
-n02100236
-n04596742
-n02119789
-n02342885
-n04044716
-n04099969
-n03602883
-n07717556
-n04548280
-n03843555
-n04409515
-n02093647
-n01797886
-n04429376
-n03063599
-n07760859
-n02487347
-n01697457
-n03706229
-n02988304
-n03134739
-n02979186
-n02892201
-n03840681
-n03425413
-n13044778
-n04330267
-n03425413
-n02099849
-n04044716
-n01440764
-n02105251
-n03599486
-n03240683
-n02097130
-n04162706
-n03443371
-n02492660
-n03793489
-n04347754
-n04296562
-n03666591
-n04584207
-n04136333
-n02123159
-n04070727
-n02981792
-n07718472
-n01694178
-n10565667
-n04532670
-n02480495
-n07590611
-n02111277
-n04554684
-n01695060
-n04311004
-n02102480
-n04447861
-n02807133
-n04398044
-n04418357
-n03690938
-n01644373
-n03837869
-n02493793
-n01796340
-n02095889
-n03781244
-n02088466
-n02906734
-n04596742
-n12057211
-n02097658
-n03954731
-n02447366
-n03223299
-n03710637
-n03459775
-n04458633
-n02397096
-n03877472
-n07584110
-n03393912
-n07716906
-n07836838
-n03720891
-n02109961
-n04326547
-n01753488
-n02389026
-n07734744
-n07745940
-n02094114
-n02981792
-n02097298
-n03930630
-n02783161
-n04346328
-n01774750
-n01829413
-n02910353
-n02894605
-n02132136
-n04372370
-n04040759
-n02493509
-n03788195
-n04357314
-n02106166
-n02168699
-n02091831
-n02105056
-n01986214
-n02268443
-n01739381
-n01774384
-n02444819
-n02105641
-n01687978
-n04606251
-n03325584
-n04596742
-n02325366
-n02950826
-n04067472
-n02086646
-n02113799
-n04557648
-n04429376
-n01704323
-n02056570
-n02488291
-n07614500
-n03089624
-n01532829
-n03160309
-n04550184
-n07730033
-n02095570
-n04367480
-n04081281
-n04254120
-n04443257
-n03777568
-n03584829
-n04201297
-n12144580
-n02834397
-n03127925
-n02100735
-n02256656
-n02092002
-n01753488
-n04259630
-n03197337
-n02510455
-n02108422
-n02013706
-n03840681
-n02108089
-n04485082
-n03584829
-n02134084
-n03814639
-n04522168
-n04589890
-n04252225
-n03188531
-n03594945
-n03691459
-n04041544
-n04033901
-n04090263
-n02486410
-n03873416
-n03871628
-n02325366
-n02841315
-n02037110
-n02909870
-n01629819
-n07565083
-n02088094
-n03954731
-n12998815
-n03661043
-n04332243
-n02167151
-n04099969
-n04266014
-n03733131
-n02033041
-n02165456
-n02109047
-n02999410
-n02177972
-n02033041
-n03899768
-n01685808
-n04023962
-n02114712
-n03775546
-n02092002
-n02107142
-n02977058
-n01582220
-n04127249
-n03814906
-n03769881
-n03393912
-n03291819
-n02497673
-n03127925
-n09193705
-n07831146
-n03980874
-n07753113
-n01558993
-n02808304
-n03854065
-n04483307
-n02102040
-n04326547
-n02443484
-n09256479
-n03961711
-n01641577
-n03733131
-n04254680
-n02099601
-n02089078
-n03016953
-n03216828
-n02101388
-n02229544
-n02606052
-n04141076
-n01694178
-n03063689
-n01774384
-n02607072
-n02091244
-n03937543
-n04328186
-n03532672
-n03485407
-n07717556
-n02006656
-n04525305
-n02123597
-n02708093
-n02137549
-n07614500
-n03947888
-n03983396
-n03544143
-n01440764
-n01440764
-n03717622
-n02085620
-n02727426
-n03485794
-n03825788
-n04259630
-n02788148
-n03930630
-n04392985
-n02454379
-n02100236
-n01534433
-n02102318
-n04044716
-n02113186
-n02066245
-n02127052
-n01950731
-n03000684
-n02843684
-n04147183
-n02110063
-n07590611
-n02113712
-n04074963
-n03871628
-n02168699
-n09246464
-n07802026
-n01693334
-n03908714
-n02130308
-n09193705
-n02091244
-n02111500
-n03642806
-n04033901
-n02999410
-n02128925
-n06359193
-n07717410
-n02102318
-n04208210
-n02086079
-n03868863
-n03743016
-n03062245
-n03717622
-n04069434
-n03598930
-n01978287
-n04026417
-n01748264
-n02096294
-n04483307
-n01592084
-n03787032
-n03742115
-n01795545
-n02807133
-n02769748
-n02108915
-n04509417
-n02093754
-n02129604
-n02090622
-n01806567
-n04579432
-n04542943
-n03400231
-n07871810
-n09399592
-n02114367
-n04049303
-n02979186
-n02494079
-n03944341
-n03535780
-n03297495
-n07831146
-n02457408
-n04254680
-n03028079
-n03498962
-n02883205
-n02077923
-n02090721
-n04005630
-n02056570
-n01775062
-n03866082
-n02087394
-n04336792
-n01917289
-n04111531
-n02007558
-n04086273
-n02843684
-n13037406
-n04200800
-n03000684
-n03991062
-n02488702
-n02808440
-n03887697
-n01784675
-n02058221
-n02841315
-n02114367
-n03657121
-n02787622
-n03095699
-n03450230
-n02123394
-n02869837
-n03793489
-n02094258
-n04380533
-n02978881
-n07584110
-n02927161
-n02930766
-n02093428
-n04507155
-n03534580
-n03857828
-n01872401
-n03337140
-n02980441
-n02102177
-n02509815
-n02097047
-n02992529
-n02797295
-n03866082
-n02279972
-n03485794
-n03530642
-n01518878
-n04483307
-n04033901
-n07749582
-n02917067
-n03623198
-n02233338
-n03623198
-n03594945
-n02256656
-n02999410
-n02093991
-n02002724
-n03788365
-n03623198
-n02110063
-n01740131
-n04346328
-n04033995
-n02095889
-n04311174
-n02445715
-n03218198
-n02640242
-n04462240
-n03180011
-n02093256
-n03425413
-n02504013
-n03877472
-n02087046
-n03976467
-n02091134
-n04044716
-n02088364
-n02009912
-n02206856
-n03297495
-n02871525
-n03633091
-n02105855
-n03075370
-n02119789
-n01644373
-n03216828
-n03478589
-n03929855
-n02939185
-n01847000
-n02317335
-n01983481
-n03657121
-n02086910
-n02088238
-n02168699
-n03976467
-n07697313
-n03743016
-n04086273
-n04200800
-n01632777
-n03529860
-n03404251
-n03255030
-n03476991
-n04311174
-n02093991
-n03924679
-n03478589
-n04258138
-n01774384
-n02277742
-n01980166
-n02951358
-n03983396
-n03482405
-n02091244
-n01592084
-n02415577
-n02125311
-n03888257
-n03871628
-n02096437
-n03743016
-n04118776
-n02526121
-n07711569
-n01694178
-n01744401
-n03424325
-n10565667
-n02007558
-n01860187
-n03127925
-n04380533
-n03637318
-n02088238
-n04118538
-n02101006
-n02110958
-n01820546
-n02106550
-n03874293
-n02229544
-n03937543
-n03838899
-n04147183
-n03697007
-n02655020
-n01677366
-n02415577
-n03891332
-n03673027
-n02328150
-n02363005
-n04209133
-n04065272
-n04399382
-n02114548
-n03724870
-n12620546
-n04277352
-n02105855
-n01704323
-n01697457
-n02094433
-n02110958
-n02092339
-n01734418
-n02108915
-n02791270
-n01534433
-n04111531
-n03476684
-n02708093
-n01955084
-n01580077
-n01592084
-n03602883
-n02871525
-n04037443
-n02086910
-n13040303
-n07749582
-n01930112
-n13037406
-n03792972
-n01775062
-n02403003
-n02974003
-n01644373
-n02966193
-n03481172
-n02095570
-n03297495
-n01614925
-n01440764
-n02879718
-n02105641
-n03125729
-n03891332
-n01697457
-n03443371
-n03794056
-n02231487
-n02395406
-n02787622
-n03425413
-n02111889
-n01632458
-n02110806
-n03584829
-n03733805
-n04613696
-n07747607
-n02687172
-n03792782
-n02492035
-n02489166
-n03393912
-n03018349
-n03843555
-n02769748
-n02168699
-n03272010
-n04532106
-n01943899
-n01882714
-n03127747
-n02088632
-n04589890
-n12768682
-n07715103
-n02410509
-n03995372
-n01728920
-n02091134
-n01820546
-n01739381
-n02917067
-n04591157
-n07697313
-n01728920
-n02835271
-n02028035
-n03908714
-n02096294
-n02106030
-n03384352
-n02174001
-n04522168
-n03866082
-n02817516
-n01978287
-n04259630
-n04399382
-n02113978
-n03447721
-n02749479
-n03188531
-n02483708
-n07693725
-n03014705
-n01622779
-n03642806
-n02018207
-n09332890
-n03670208
-n03291819
-n02017213
-n02098286
-n04141327
-n02105251
-n02447366
-n02321529
-n03792782
-n01443537
-n01943899
-n04522168
-n13133613
-n03891251
-n02106166
-n04592741
-n04179913
-n03216828
-n04467665
-n01883070
-n07614500
-n02105162
-n04456115
-n04332243
-n04049303
-n07615774
-n01616318
-n07802026
-n03291819
-n01688243
-n02396427
-n09229709
-n09399592
-n02027492
-n04517823
-n03325584
-n02165456
-n03803284
-n02802426
-n09428293
-n02168699
-n02106662
-n03259280
-n03733131
-n04258138
-n01924916
-n01945685
-n09428293
-n02871525
-n02786058
-n03721384
-n04285008
-n03485794
-n01784675
-n04428191
-n02092002
-n04372370
-n04099969
-n03026506
-n02971356
-n02106030
-n04131690
-n01847000
-n03794056
-n12985857
-n02488702
-n01872401
-n03372029
-n01806567
-n01917289
-n03444034
-n01776313
-n02814533
-n02672831
-n03637318
-n02113978
-n02165456
-n04548280
-n02917067
-n01560419
-n02825657
-n04552348
-n02999410
-n02190166
-n03065424
-n02825657
-n07716358
-n02877765
-n09421951
-n12267677
-n01819313
-n04264628
-n03344393
-n02002724
-n01641577
-n02256656
-n01532829
-n03854065
-n02791270
-n02951585
-n03014705
-n01592084
-n01728572
-n01774750
-n03868242
-n04370456
-n03337140
-n03124043
-n03290653
-n02488291
-n04505470
-n04553703
-n02107574
-n01692333
-n12620546
-n04086273
-n03657121
-n01582220
-n03485407
-n03840681
-n07768694
-n03782006
-n02114548
-n11939491
-n04552348
-n03208938
-n02006656
-n03764736
-n07695742
-n01820546
-n02326432
-n02009229
-n02408429
-n03018349
-n03018349
-n02504458
-n02089973
-n01917289
-n01739381
-n02130308
-n04099969
-n02102040
-n03788195
-n03764736
-n02422699
-n01978287
-n02860847
-n02749479
-n03877845
-n03404251
-n04209133
-n07695742
-n04090263
-n03720891
-n04311174
-n03642806
-n03933933
-n04005630
-n02093991
-n02977058
-n09835506
-n03417042
-n01742172
-n03888257
-n02782093
-n07802026
-n03208938
-n02130308
-n02090622
-n04040759
-n02422699
-n03594945
-n02437616
-n03337140
-n09399592
-n02129604
-n02488291
-n04597913
-n03089624
-n03710193
-n02930766
-n04435653
-n01806567
-n03100240
-n01582220
-n03871628
-n02422106
-n02494079
-n04372370
-n07716358
-n04277352
-n02236044
-n03891332
-n03814639
-n02396427
-n02793495
-n02096437
-n02504458
-n02085936
-n01978287
-n04239074
-n03532672
-n02869837
-n02127052
-n03680355
-n02206856
-n03602883
-n01817953
-n03733805
-n03938244
-n03450230
-n04044716
-n02965783
-n03938244
-n01592084
-n03290653
-n04479046
-n07831146
-n01735189
-n04525305
-n02870880
-n02776631
-n02172182
-n04081281
-n03876231
-n01985128
-n01917289
-n10148035
-n04286575
-n03598930
-n02085782
-n02699494
-n04009552
-n03492542
-n07749582
-n03017168
-n03494278
-n02134418
-n03792782
-n01687978
-n13040303
-n03220513
-n03347037
-n03476684
-n01828970
-n02114367
-n07715103
-n02119789
-n01749939
-n03791053
-n02457408
-n01440764
-n01824575
-n04372370
-n07802026
-n04270147
-n04033901
-n04515003
-n03950228
-n04005630
-n02091032
-n02090379
-n02486410
-n07684084
-n04592741
-n02106382
-n02165456
-n02483708
-n01737021
-n02814533
-n04081281
-n03884397
-n07749582
-n01641577
-n03929855
-n04550184
-n04467665
-n03930313
-n02951585
-n02747177
-n04487394
-n01773549
-n04228054
-n02410509
-n04596742
-n02795169
-n03496892
-n04613696
-n02398521
-n03814906
-n02823750
-n02106550
-n02128385
-n02364673
-n03770679
-n02099429
-n01669191
-n12057211
-n04476259
-n02229544
-n03781244
-n02509815
-n02807133
-n02132136
-n03447721
-n02840245
-n03743016
-n04118776
-n04356056
-n02190166
-n03424325
-n04606251
-n04146614
-n04040759
-n07754684
-n02119022
-n02454379
-n02443484
-n04310018
-n03527444
-n04399382
-n03843555
-n01740131
-n02127052
-n02749479
-n03045698
-n02086240
-n01795545
-n04592741
-n02701002
-n04149813
-n02823750
-n01728920
-n04493381
-n02894605
-n03970156
-n03838899
-n03877845
-n03534580
-n02094258
-n03047690
-n02033041
-n03208938
-n03124043
-n03000134
-n03250847
-n01817953
-n02727426
-n01669191
-n02268443
-n03770439
-n02389026
-n04550184
-n02804610
-n03461385
-n02091244
-n02363005
-n02391049
-n07717410
-n03404251
-n07695742
-n04462240
-n01817953
-n06359193
-n01685808
-n02509815
-n09835506
-n04523525
-n04398044
-n01955084
-n02423022
-n02129604
-n02066245
-n01773797
-n02859443
-n04090263
-n03617480
-n04548280
-n03929855
-n03777754
-n02791270
-n02317335
-n03791053
-n03180011
-n01677366
-n03976467
-n02497673
-n01729322
-n03297495
-n02268853
-n01742172
-n07716906
-n03630383
-n02825657
-n02094258
-n07873807
-n03776460
-n01843383
-n02840245
-n02607072
-n01491361
-n03109150
-n03908618
-n02132136
-n01950731
-n02133161
-n04070727
-n03384352
-n03594945
-n03933933
-n03891332
-n01968897
-n09229709
-n02095314
-n02088364
-n01641577
-n03124170
-n03272562
-n02817516
-n01943899
-n07590611
-n04235860
-n03991062
-n02006656
-n04026417
-n02113799
-n04311004
-n02815834
-n04008634
-n07718472
-n02437616
-n04325704
-n03676483
-n03207941
-n02066245
-n03873416
-n02489166
-n03782006
-n04523525
-n03710637
-n02791270
-n09835506
-n01768244
-n03888257
-n04325704
-n02007558
-n01641577
-n03983396
-n04179913
-n03786901
-n03425413
-n02012849
-n03876231
-n02802426
-n04067472
-n02112350
-n02797295
-n03895866
-n07753113
-n03297495
-n02091635
-n04487394
-n03729826
-n02104029
-n02102973
-n03000247
-n01871265
-n03920288
-n03627232
-n02229544
-n02092339
-n02802426
-n03018349
-n13044778
-n03014705
-n02776631
-n03109150
-n13052670
-n03218198
-n04125021
-n04550184
-n04479046
-n04443257
-n03908618
-n02094433
-n02113186
-n02105162
-n02980441
-n02971356
-n07697313
-n02102177
-n04613696
-n02095889
-n02979186
-n09472597
-n03476684
-n02692877
-n01756291
-n03976657
-n03494278
-n03026506
-n04228054
-n04146614
-n03100240
-n02018795
-n01873310
-n04026417
-n02086910
-n04192698
-n02093991
-n04116512
-n02107908
-n02066245
-n04026417
-n02444819
-n02536864
-n02361337
-n03770439
-n02086646
-n03444034
-n04008634
-n02727426
-n07615774
-n02107908
-n03637318
-n04317175
-n03662601
-n09256479
-n03933933
-n03666591
-n02102318
-n07802026
-n04467665
-n03109150
-n03710721
-n02817516
-n01855672
-n03259280
-n02108089
-n01943899
-n02655020
-n02817516
-n07871810
-n03935335
-n03250847
-n04417672
-n04252077
-n01910747
-n03950228
-n02009912
-n02690373
-n02787622
-n01685808
-n02486410
-n04326547
-n03467068
-n01742172
-n02965783
-n04209133
-n06874185
-n01797886
-n01755581
-n03942813
-n02087394
-n02137549
-n03047690
-n04447861
-n04275548
-n02229544
-n03530642
-n01930112
-n04548362
-n04552348
-n02486261
-n02328150
-n03355925
-n02096177
-n02403003
-n01817953
-n01629819
-n03983396
-n03207941
-n01806567
-n02089973
-n07714990
-n03590841
-n02086646
-n03781244
-n02090622
-n03445924
-n02051845
-n04560804
-n09288635
-n03840681
-n01622779
-n03445924
-n02058221
-n03837869
-n02125311
-n02783161
-n01698640
-n02787622
-n03706229
-n02840245
-n02808440
-n03680355
-n01560419
-n01978287
-n02422699
-n01687978
-n01537544
-n03793489
-n03016953
-n04044716
-n01560419
-n02056570
-n03179701
-n09468604
-n03623198
-n02690373
-n02454379
-n04467665
-n02112018
-n04591157
-n04243546
-n04254777
-n01558993
-n07932039
-n04258138
-n02085936
-n03240683
-n04409515
-n03661043
-n01532829
-n03930630
-n02112350
-n02837789
-n02098286
-n04485082
-n03272562
-n02105505
-n03916031
-n07742313
-n03042490
-n02105855
-n04229816
-n04447861
-n02916936
-n02120505
-n02917067
-n01984695
-n02454379
-n03529860
-n03482405
-n04049303
-n03452741
-n02113023
-n03447721
-n01728572
-n03942813
-n03929855
-n03344393
-n01692333
-n01945685
-n03929660
-n07565083
-n04579432
-n03594734
-n03793489
-n02114712
-n02111129
-n02091244
-n12057211
-n02493793
-n03404251
-n03026506
-n01817953
-n02130308
-n02930766
-n03594734
-n02777292
-n02486410
-n09468604
-n02489166
-n01981276
-n04275548
-n02865351
-n04118538
-n01641577
-n02113624
-n04008634
-n01945685
-n02692877
-n02749479
-n03891332
-n02795169
-n02105641
-n04136333
-n04417672
-n04263257
-n06596364
-n02091032
-n03770679
-n07749582
-n02977058
-n03594734
-n02317335
-n04550184
-n02437312
-n01728572
-n02395406
-n04522168
-n04209133
-n02108000
-n01843383
-n04004767
-n03804744
-n04398044
-n02643566
-n13052670
-n03443371
-n02101388
-n02133161
-n02641379
-n03814906
-n02115913
-n02108915
-n01978287
-n04277352
-n04493381
-n01608432
-n04548280
-n03379051
-n03796401
-n02051845
-n04350905
-n04612504
-n03207743
-n02097298
-n03447447
-n02804610
-n01770393
-n10148035
-n02094258
-n03720891
-n02089078
-n02130308
-n02536864
-n03942813
-n02110341
-n04579432
-n07716358
-n03095699
-n02128925
-n04141975
-n02119789
-n03481172
-n03532672
-n02655020
-n07749582
-n02109961
-n02101556
-n03662601
-n03803284
-n02641379
-n04367480
-n02101388
-n04562935
-n01694178
-n02088466
-n02536864
-n03781244
-n04192698
-n02167151
-n02089078
-n03544143
-n03026506
-n02128925
-n04251144
-n03929855
-n03085013
-n03125729
-n01677366
-n03661043
-n04584207
-n04200800
-n02487347
-n02321529
-n03814906
-n01924916
-n02802426
-n01693334
-n02169497
-n02128925
-n07717556
-n03895866
-n02099429
-n03085013
-n11939491
-n09468604
-n02109047
-n07565083
-n04310018
-n02988304
-n07754684
-n02058221
-n02114367
-n03485794
-n03424325
-n04443257
-n01697457
-n02219486
-n02877765
-n01644900
-n03775071
-n02097047
-n02085620
-n07693725
-n03160309
-n02815834
-n03110669
-n03868863
-n04008634
-n03743016
-n02094114
-n03208938
-n07590611
-n04273569
-n03706229
-n02013706
-n07753592
-n02916936
-n02112137
-n02108089
-n03841143
-n03595614
-n03125729
-n07742313
-n02487347
-n04235860
-n02782093
-n01742172
-n04604644
-n04554684
-n04086273
-n02906734
-n02091635
-n03201208
-n07693725
-n09332890
-n02088364
-n03017168
-n03729826
-n03983396
-n03676483
-n04204347
-n04251144
-n02917067
-n04081281
-n03930313
-n03494278
-n03160309
-n02389026
-n03250847
-n03133878
-n02091635
-n02389026
-n02087394
-n02113799
-n02281787
-n04548280
-n04509417
-n03384352
-n02009229
-n04370456
-n07753275
-n02102177
-n01494475
-n03459775
-n02804610
-n04456115
-n02099712
-n01494475
-n04344873
-n03788195
-n01944390
-n01910747
-n03868242
-n03452741
-n13044778
-n01883070
-n02701002
-n02793495
-n02692877
-n03220513
-n01978287
-n02483362
-n01776313
-n02808304
-n03721384
-n02012849
-n03733281
-n07920052
-n02326432
-n04192698
-n02113799
-n02106550
-n02097298
-n02509815
-n02835271
-n04548280
-n04522168
-n03950228
-n01689811
-n09428293
-n01877812
-n02100583
-n01704323
-n03680355
-n03000247
-n03742115
-n04486054
-n02097298
-n02091635
-n03680355
-n02002556
-n02101388
-n01818515
-n02454379
-n03216828
-n03933933
-n02107683
-n04252077
-n02980441
-n04039381
-n03201208
-n02102177
-n03388549
-n04523525
-n03770439
-n03710193
-n01675722
-n04501370
-n04501370
-n02092002
-n03598930
-n07932039
-n02101006
-n02268853
-n04259630
-n03871628
-n02786058
-n03485794
-n02009912
-n02091244
-n02808304
-n01860187
-n07613480
-n01843065
-n02095889
-n01943899
-n02859443
-n02112350
-n02165456
-n01773797
-n02328150
-n03485407
-n01955084
-n01601694
-n03290653
-n01796340
-n06359193
-n01558993
-n03950228
-n02096437
-n02093859
-n01773549
-n04154565
-n02437616
-n02017213
-n04146614
-n02488702
-n02137549
-n02013706
-n02100735
-n04465501
-n02727426
-n04467665
-n02095889
-n02415577
-n03075370
-n02097298
-n02027492
-n02441942
-n02104029
-n03617480
-n03623198
-n02536864
-n07875152
-n04208210
-n02423022
-n03016953
-n01669191
-n04344873
-n02526121
-n09472597
-n03873416
-n01829413
-n12057211
-n02950826
-n02786058
-n02486410
-n02486261
-n02423022
-n02107574
-n03773504
-n01558993
-n02096177
-n03961711
-n01873310
-n04118538
-n02091032
-n03483316
-n13040303
-n03180011
-n02125311
-n02172182
-n03976657
-n02094258
-n02980441
-n02107312
-n01755581
-n02776631
-n02492660
-n01664065
-n01514668
-n02966193
-n02492035
-n03482405
-n04019541
-n03954731
-n02106550
-n04404412
-n02797295
-n01955084
-n04612504
-n04069434
-n02492035
-n10565667
-n02091134
-n01631663
-n02727426
-n02071294
-n02124075
-n02092002
-n02321529
-n04208210
-n01819313
-n02087046
-n04409515
-n03485794
-n04356056
-n02087046
-n02492035
-n02085782
-n03788365
-n02483708
-n04532106
-n02106030
-n03742115
-n03868242
-n03000684
-n02100236
-n02398521
-n03976657
-n03595614
-n03884397
-n03109150
-n02978881
-n02279972
-n02391049
-n03417042
-n01734418
-n07565083
-n03970156
-n02256656
-n01689811
-n02107683
-n04591713
-n02105855
-n04099969
-n02980441
-n07720875
-n04259630
-n07920052
-n03777754
-n02099429
-n03777568
-n03759954
-n02109525
-n04264628
-n03584829
-n04525305
-n02099712
-n01689811
-n02169497
-n02011460
-n02109961
-n03814906
-n02095314
-n03866082
-n02966687
-n03710721
-n02690373
-n02514041
-n03062245
-n02797295
-n02167151
-n01518878
-n13040303
-n13044778
-n02088364
-n03045698
-n03857828
-n09288635
-n03873416
-n10148035
-n02837789
-n03388183
-n03272010
-n13054560
-n02699494
-n02051845
-n02966193
-n02437312
-n04557648
-n02177972
-n03792782
-n01751748
-n02892767
-n04344873
-n03902125
-n01558993
-n02087394
-n02006656
-n01784675
-n02099601
-n03930313
-n02980441
-n02097209
-n02091032
-n03742115
-n02606052
-n02104365
-n02097130
-n07860988
-n02120079
-n04235860
-n02883205
-n02727426
-n02099267
-n03884397
-n02992211
-n03095699
-n04254777
-n02093859
-n03146219
-n04548362
-n04335435
-n02489166
-n01531178
-n02259212
-n02894605
-n02114855
-n03188531
-n02088466
-n03956157
-n04589890
-n04525038
-n02233338
-n04612504
-n07711569
-n02437312
-n03976657
-n12144580
-n01843065
-n02120505
-n07745940
-n04552348
-n03710721
-n03425413
-n01697457
-n02396427
-n02092339
-n02493509
-n02087046
-n02123159
-n04251144
-n04259630
-n02096051
-n04507155
-n02106662
-n03445777
-n03494278
-n01756291
-n03063689
-n02105162
-n04346328
-n04591713
-n03662601
-n02093428
-n02917067
-n03710721
-n02493509
-n02794156
-n07720875
-n01669191
-n02088364
-n01873310
-n04037443
-n03598930
-n07714571
-n04069434
-n03888257
-n07718472
-n03676483
-n03929660
-n02514041
-n02105056
-n04275548
-n03534580
-n04296562
-n03770439
-n02165456
-n02704792
-n03995372
-n04344873
-n02123159
-n11879895
-n02094114
-n02514041
-n03388549
-n01629819
-n02776631
-n02963159
-n03857828
-n07768694
-n01847000
-n02229544
-n02834397
-n04380533
-n07717410
-n02112706
-n03014705
-n11939491
-n02769748
-n03075370
-n03534580
-n02116738
-n02111277
-n03482405
-n02096294
-n01819313
-n02105056
-n04540053
-n03028079
-n03467068
-n02107683
-n12768682
-n02481823
-n02447366
-n03255030
-n02977058
-n12620546
-n03131574
-n02981792
-n02110063
-n03494278
-n02415577
-n02398521
-n04554684
-n03063599
-n04579145
-n04335435
-n04264628
-n04311004
-n02457408
-n02106550
-n04483307
-n02977058
-n02091244
-n02169497
-n03041632
-n03630383
-n02669723
-n02104029
-n02364673
-n02749479
-n02107312
-n02128925
-n02091831
-n04554684
-n01978287
-n02655020
-n02125311
-n04136333
-n07753113
-n01943899
-n04204347
-n03372029
-n04418357
-n02980441
-n02859443
-n04235860
-n09472597
-n02328150
-n02017213
-n01734418
-n03930313
-n03868242
-n04355338
-n04118538
-n02804610
-n02028035
-n02835271
-n02114548
-n03710193
-n04033901
-n01984695
-n03443371
-n03956157
-n07753113
-n03532672
-n01664065
-n02786058
-n02125311
-n02085620
-n02655020
-n04235860
-n03018349
-n13040303
-n03658185
-n04254680
-n01484850
-n03594945
-n04209133
-n03877845
-n12985857
-n02102040
-n02112018
-n03467068
-n02115641
-n04562935
-n03042490
-n04429376
-n02895154
-n13052670
-n01514668
-n01491361
-n01924916
-n04039381
-n02437616
-n04065272
-n01855672
-n03733281
-n03935335
-n02492035
-n02130308
-n04131690
-n01484850
-n03197337
-n03761084
-n03899768
-n02128385
-n04604644
-n03623198
-n04152593
-n02783161
-n04252225
-n04118538
-n02412080
-n03717622
-n02480495
-n02102480
-n02676566
-n02492035
-n04265275
-n07742313
-n03483316
-n03706229
-n02129165
-n07718747
-n03967562
-n01443537
-n02190166
-n01943899
-n02089078
-n03627232
-n02110958
-n03902125
-n04081281
-n02172182
-n02099849
-n02492035
-n02999410
-n04435653
-n03127925
-n07880968
-n04243546
-n03544143
-n01877812
-n02823750
-n02814533
-n02916936
-n02120505
-n02088632
-n02977058
-n07734744
-n02676566
-n01770081
-n04116512
-n02871525
-n02091032
-n02536864
-n03223299
-n02963159
-n03180011
-n03207743
-n03496892
-n03444034
-n03100240
-n04592741
-n02091831
-n04613696
-n02097130
-n03196217
-n04523525
-n04505470
-n04153751
-n03786901
-n03220513
-n02808440
-n04399382
-n03594945
-n01978455
-n01824575
-n01986214
-n03792782
-n02730930
-n03208938
-n02641379
-n02106030
-n02106550
-n02110063
-n03786901
-n04532670
-n03595614
-n13054560
-n02233338
-n03803284
-n03355925
-n02236044
-n02951585
-n03063599
-n03047690
-n01496331
-n02708093
-n02356798
-n04442312
-n02107574
-n03459775
-n04026417
-n02860847
-n02655020
-n03983396
-n03658185
-n04589890
-n03956157
-n02093991
-n02091032
-n02977058
-n01667114
-n02500267
-n03347037
-n07716906
-n03598930
-n02841315
-n04254777
-n04049303
-n13040303
-n03495258
-n04596742
-n15075141
-n02105251
-n01667114
-n01775062
-n02002724
-n04536866
-n01768244
-n02808440
-n02087046
-n02917067
-n04111531
-n02190166
-n03690938
-n13040303
-n04133789
-n03877845
-n01985128
-n03220513
-n03970156
-n04483307
-n01641577
-n03384352
-n02823750
-n02088238
-n04346328
-n04423845
-n04356056
-n04509417
-n02606052
-n01704323
-n07831146
-n02120505
-n02099601
-n02799071
-n02233338
-n03394916
-n02865351
-n03272562
-n03843555
-n09246464
-n02825657
-n02951585
-n03692522
-n04517823
-n03803284
-n02086910
-n07613480
-n09399592
-n03775071
-n02099429
-n07695742
-n03527444
-n04330267
-n03832673
-n02894605
-n02951585
-n09332890
-n13054560
-n03623198
-n02363005
-n04275548
-n09288635
-n03902125
-n04435653
-n04398044
-n02666196
-n04147183
-n02454379
-n02107574
-n04592741
-n04200800
-n02066245
-n01629819
-n03272562
-n03877472
-n02009229
-n03532672
-n02437312
-n02089078
-n04127249
-n03443371
-n02091635
-n02667093
-n03935335
-n02364673
-n02165105
-n03770439
-n03063599
-n02363005
-n03100240
-n02815834
-n04275548
-n02791270
-n02325366
-n01695060
-n02787622
-n07753113
-n02128385
-n04125021
-n02395406
-n04371430
-n03388043
-n12620546
-n04597913
-n03967562
-n02708093
-n02280649
-n02113978
-n09288635
-n03425413
-n03207941
-n01740131
-n04120489
-n02106382
-n02536864
-n04458633
-n03633091
-n03967562
-n04371430
-n02690373
-n02113186
-n02870880
-n02114855
-n02396427
-n02132136
-n02107908
-n01950731
-n02992529
-n03814639
-n03594734
-n07613480
-n07932039
-n03721384
-n02641379
-n03721384
-n03661043
-n04509417
-n02814533
-n02437616
-n04192698
-n02002724
-n15075141
-n03670208
-n02974003
-n02094433
-n03617480
-n04486054
-n03290653
-n03255030
-n04435653
-n02916936
-n01728572
-n01632777
-n03028079
-n02106382
-n12267677
-n02279972
-n02111129
-n01820546
-n03680355
-n03991062
-n02090721
-n02879718
-n01514668
-n01728572
-n04442312
-n03379051
-n02930766
-n03982430
-n02497673
-n02115641
-n02389026
-n02793495
-n03594945
-n03661043
-n04398044
-n01773797
-n03630383
-n07892512
-n02259212
-n02128757
-n03595614
-n03126707
-n04200800
-n12620546
-n02091032
-n01531178
-n03775071
-n02346627
-n02096294
-n04204347
-n02892201
-n01807496
-n03825788
-n02342885
-n02128385
-n07745940
-n04404412
-n03720891
-n02109961
-n03976657
-n02093256
-n03787032
-n03794056
-n04136333
-n03787032
-n02105855
-n01774384
-n02974003
-n02106030
-n04023962
-n03485794
-n02086910
-n02091134
-n02727426
-n04591157
-n03804744
-n04111531
-n03733805
-n02787622
-n02980441
-n03347037
-n01630670
-n04579432
-n01944390
-n12620546
-n02114712
-n03527444
-n04239074
-n01807496
-n01592084
-n02879718
-n04429376
-n02643566
-n07871810
-n07753113
-n03042490
-n02281787
-n03179701
-n01685808
-n03814906
-n02927161
-n02346627
-n03160309
-n04037443
-n02708093
-n03590841
-n04370456
-n02948072
-n02494079
-n06785654
-n04507155
-n02011460
-n02256656
-n04037443
-n03485794
-n03271574
-n04254777
-n02128757
-n04154565
-n03461385
-n02966193
-n02226429
-n02101006
-n02112018
-n07695742
-n02110341
-n02443114
-n02110185
-n02948072
-n02840245
-n03854065
-n02096294
-n02980441
-n03062245
-n03584829
-n01644900
-n03891251
-n03599486
-n02701002
-n02172182
-n03888605
-n03642806
-n04562935
-n01930112
-n02389026
-n02783161
-n02807133
-n04099969
-n03457902
-n03633091
-n03594945
-n07695742
-n07714990
-n03208938
-n04479046
-n09835506
-n03595614
-n01983481
-n03670208
-n01734418
-n01978455
-n03721384
-n02091635
-n02133161
-n04026417
-n01734418
-n03530642
-n04209133
-n04099969
-n01616318
-n02279972
-n03676483
-n03868863
-n02666196
-n02396427
-n01768244
-n03240683
-n02112018
-n13133613
-n03032252
-n04235860
-n02110627
-n03404251
-n04350905
-n02087046
-n01843383
-n01797886
-n02992211
-n02950826
-n02268853
-n03888605
-n07248320
-n03160309
-n07248320
-n03868242
-n01704323
-n01944390
-n04462240
-n06794110
-n03032252
-n04376876
-n02281406
-n02134418
-n03584829
-n03598930
-n04254777
-n04435653
-n02017213
-n04049303
-n03180011
-n03782006
-n02749479
-n04525305
-n02791270
-n04429376
-n02102318
-n07584110
-n02966687
-n02423022
-n02107142
-n02101556
-n04179913
-n02999410
-n02091134
-n02797295
-n04560804
-n01955084
-n07583066
-n03743016
-n03623198
-n03843555
-n02134084
-n02093256
-n02105505
-n03788195
-n07716906
-n04542943
-n04296562
-n02120079
-n03920288
-n02892767
-n04311174
-n04141327
-n02117135
-n03888605
-n04557648
-n04523525
-n02281787
-n02951358
-n03680355
-n07693725
-n02870880
-n02007558
-n06596364
-n01984695
-n03345487
-n02091244
-n09256479
-n02105162
-n07693725
-n03838899
-n03534580
-n02493509
-n02096177
-n07892512
-n02018795
-n04592741
-n01728920
-n07875152
-n01773797
-n02051845
-n04273569
-n03125729
-n01773549
-n04376876
-n04336792
-n02137549
-n03633091
-n01877812
-n02128757
-n04423845
-n02981792
-n03452741
-n01735189
-n04532106
-n02268853
-n07615774
-n03538406
-n01917289
-n01496331
-n01773549
-n03788195
-n02916936
-n03045698
-n03743016
-n03868863
-n04479046
-n01882714
-n03197337
-n02013706
-n07873807
-n02480855
-n04409515
-n02930766
-n03888257
-n03127925
-n11939491
-n02328150
-n02895154
-n02408429
-n02361337
-n02092339
-n01484850
-n03065424
-n02167151
-n01798484
-n02110341
-n02085620
-n04417672
-n02097047
-n04235860
-n02692877
-n04599235
-n04201297
-n02110341
-n03776460
-n02037110
-n02174001
-n02797295
-n02939185
-n03637318
-n03710721
-n02086646
-n03657121
-n02509815
-n07836838
-n04592741
-n04264628
-n04399382
-n02814533
-n04311174
-n02137549
-n07753113
-n02704792
-n02093859
-n01694178
-n03444034
-n01784675
-n02088466
-n03692522
-n02091244
-n02133161
-n09835506
-n01614925
-n02168699
-n02113624
-n03109150
-n02190166
-n03710721
-n02092002
-n01644373
-n04357314
-n01704323
-n01882714
-n03908618
-n04592741
-n02095570
-n02870880
-n04277352
-n03666591
-n09332890
-n02090721
-n04326547
-n04251144
-n04033901
-n02977058
-n03095699
-n02114548
-n02966193
-n07717410
-n04562935
-n02814860
-n02963159
-n02090721
-n03891251
-n02325366
-n03630383
-n03742115
-n03400231
-n07753275
-n02174001
-n01877812
-n02870880
-n02892201
-n02727426
-n02115913
-n02395406
-n03956157
-n02074367
-n07760859
-n04476259
-n03018349
-n04208210
-n04560804
-n03794056
-n03803284
-n03476684
-n01514668
-n04347754
-n01773157
-n01820546
-n04443257
-n03976657
-n04146614
-n02100583
-n04476259
-n01776313
-n02095570
-n03180011
-n02110806
-n02129165
-n02504013
-n02808304
-n03854065
-n02066245
-n01685808
-n03290653
-n01924916
-n03776460
-n02102973
-n03871628
-n04266014
-n04350905
-n02104029
-n03598930
-n04344873
-n10565667
-n02123045
-n02437312
-n03759954
-n02437616
-n02123159
-n01664065
-n02916936
-n03124170
-n02504013
-n03272562
-n03617480
-n02091244
-n02051845
-n02090622
-n04376876
-n04613696
-n02108551
-n04328186
-n01682714
-n03777754
-n02095570
-n07802026
-n02437616
-n02169497
-n02100735
-n01748264
-n03942813
-n04296562
-n02264363
-n04517823
-n03207743
-n02927161
-n04332243
-n02110185
-n04409515
-n02480495
-n09468604
-n02100735
-n07716358
-n15075141
-n03814639
-n02105251
-n01537544
-n01855672
-n01644900
-n04037443
-n02870880
-n02264363
-n04336792
-n09229709
-n03146219
-n02837789
-n03733281
-n04599235
-n04008634
-n02111500
-n04560804
-n02116738
-n02009229
-n03272562
-n02106030
-n03666591
-n02356798
-n09835506
-n02727426
-n02113712
-n02397096
-n04153751
-n02808304
-n02033041
-n02992529
-n02837789
-n03355925
-n03492542
-n03991062
-n02457408
-n03085013
-n04501370
-n02843684
-n02490219
-n02106382
-n02489166
-n03670208
-n02447366
-n02655020
-n13054560
-n03445924
-n03903868
-n02099601
-n02119022
-n02422106
-n04019541
-n04355933
-n04200800
-n02123597
-n13052670
-n03250847
-n02992529
-n02951585
-n03085013
-n01768244
-n04525305
-n03187595
-n01798484
-n03467068
-n04370456
-n03832673
-n02097130
-n03240683
-n04371430
-n04579432
-n04458633
-n04483307
-n02980441
-n02102318
-n04154565
-n03452741
-n03961711
-n02808440
-n03063689
-n02114855
-n02096051
-n04461696
-n04487394
-n02113186
-n07892512
-n03223299
-n04081281
-n04371774
-n04417672
-n03249569
-n03197337
-n02101006
-n01768244
-n02113186
-n03899768
-n02783161
-n01734418
-n01728920
-n02497673
-n03063599
-n04479046
-n02895154
-n02100877
-n01983481
-n03908618
-n04507155
-n03344393
-n01829413
-n02342885
-n02190166
-n07802026
-n03991062
-n02974003
-n01698640
-n04447861
-n03623198
-n04347754
-n07614500
-n12144580
-n04254680
-n04482393
-n01943899
-n03887697
-n03598930
-n02483362
-n02120079
-n03680355
-n03485407
-n02130308
-n02894605
-n03841143
-n02172182
-n02727426
-n04418357
-n02097209
-n03495258
-n02701002
-n03481172
-n02860847
-n04435653
-n03384352
-n04131690
-n02701002
-n03868863
-n01644373
-n03000247
-n02397096
-n04118776
-n02117135
-n02051845
-n03649909
-n02869837
-n03661043
-n02090622
-n02190166
-n02134084
-n02701002
-n03496892
-n02871525
-n04277352
-n02966193
-n07697313
-n03447447
-n03388183
-n02483708
-n03623198
-n09421951
-n02128925
-n02823428
-n02410509
-n02099429
-n04162706
-n01601694
-n06794110
-n03929660
-n07920052
-n04273569
-n02259212
-n03180011
-n01685808
-n02095889
-n04204347
-n02804414
-n02236044
-n04111531
-n02132136
-n07717556
-n03388183
-n04200800
-n04154565
-n02099601
-n03065424
-n03942813
-n01930112
-n04049303
-n02965783
-n03444034
-n03131574
-n02090721
-n02281787
-n04389033
-n07615774
-n02086240
-n02105412
-n03794056
-n03977966
-n01728572
-n03218198
-n07584110
-n02134084
-n03991062
-n03124170
-n04070727
-n03908618
-n07932039
-n02110806
-n01630670
-n03598930
-n04355338
-n03014705
-n02172182
-n03721384
-n02095314
-n02979186
-n01742172
-n04409515
-n02089973
-n02422699
-n03763968
-n02492660
-n02910353
-n03743016
-n03196217
-n02840245
-n03804744
-n04532106
-n03773504
-n02100236
-n02325366
-n07753275
-n03483316
-n01494475
-n04344873
-n04259630
-n03627232
-n02280649
-n02883205
-n04404412
-n04357314
-n04286575
-n03803284
-n02098413
-n04209239
-n01632777
-n03908618
-n02110185
-n02457408
-n02788148
-n03467068
-n01443537
-n04310018
-n03325584
-n02395406
-n03133878
-n02134084
-n02089867
-n01833805
-n03443371
-n03838899
-n03216828
-n03485794
-n03761084
-n02500267
-n04435653
-n01514668
-n10565667
-n01675722
-n02233338
-n02497673
-n01784675
-n03761084
-n02279972
-n03721384
-n02088238
-n03017168
-n01770081
-n03347037
-n02231487
-n12768682
-n03877472
-n02730930
-n02088238
-n01592084
-n03998194
-n03478589
-n03776460
-n02086910
-n02113624
-n02669723
-n01930112
-n04356056
-n12768682
-n09421951
-n03908618
-n02120079
-n02133161
-n03345487
-n02087046
-n04118538
-n03344393
-n02704792
-n02112018
-n02100583
-n03196217
-n04133789
-n02640242
-n02817516
-n01740131
-n01532829
-n04548362
-n04509417
-n02364673
-n02415577
-n04204347
-n12267677
-n03445777
-n07584110
-n03544143
-n03764736
-n07892512
-n01770393
-n01688243
-n04033995
-n04590129
-n01978287
-n02113712
-n02093428
-n01819313
-n02437312
-n03706229
-n03535780
-n02112137
-n04266014
-n02137549
-n03630383
-n03089624
-n04208210
-n03100240
-n02480495
-n02860847
-n03062245
-n04409515
-n04404412
-n02687172
-n04065272
-n03770439
-n04049303
-n03249569
-n02088238
-n01978287
-n04532106
-n01687978
-n01751748
-n02981792
-n03792972
-n04326547
-n01728920
-n04612504
-n07714990
-n03764736
-n07717410
-n04141327
-n03032252
-n02107574
-n02226429
-n01820546
-n02088364
-n03961711
-n07753113
-n02094114
-n03733805
-n02607072
-n02028035
-n03857828
-n02807133
-n04456115
-n02640242
-n02206856
-n12144580
-n02115913
-n03627232
-n02699494
-n01756291
-n03630383
-n02280649
-n02799071
-n07749582
-n01773157
-n09256479
-n04235860
-n06874185
-n02002556
-n02454379
-n03775546
-n02177972
-n02009229
-n03297495
-n03895866
-n01694178
-n01698640
-n01796340
-n03124043
-n02107683
-n02981792
-n04540053
-n07695742
-n02102318
-n02123597
-n04152593
-n01695060
-n04252077
-n01689811
-n01882714
-n04141327
-n07753592
-n02793495
-n04136333
-n03876231
-n02860847
-n04591157
-n04380533
-n03259280
-n03530642
-n01558993
-n04355338
-n02017213
-n02091032
-n07615774
-n07693725
-n02319095
-n04335435
-n06794110
-n11879895
-n09332890
-n02708093
-n02643566
-n03895866
-n03838899
-n03393912
-n02112137
-n01955084
-n02094433
-n02791124
-n03877472
-n03792782
-n01756291
-n02097474
-n03259280
-n02190166
-n07715103
-n02095889
-n04532106
-n04597913
-n03743016
-n04548362
-n02481823
-n03388549
-n02319095
-n03792972
-n02823750
-n03623198
-n03933933
-n02231487
-n03476684
-n02098286
-n02169497
-n03379051
-n02457408
-n07742313
-n07615774
-n02206856
-n04239074
-n03393912
-n01592084
-n03680355
-n02837789
-n03590841
-n01986214
-n03657121
-n03697007
-n01697457
-n02447366
-n04418357
-n04367480
-n03220513
-n04479046
-n03100240
-n03000684
-n01978287
-n02105855
-n03127925
-n02105855
-n02092002
-n02028035
-n02094258
-n04204347
-n01795545
-n02125311
-n02823750
-n02112137
-n03126707
-n02123597
-n03223299
-n01798484
-n02280649
-n01776313
-n02641379
-n01608432
-n03249569
-n01630670
-n03895866
-n03888257
-n02422106
-n02093859
-n04125021
-n04065272
-n03814906
-n03992509
-n04423845
-n03393912
-n02066245
-n02114548
-n10148035
-n01608432
-n04355338
-n04277352
-n03976467
-n02859443
-n04141076
-n02127052
-n02088466
-n07880968
-n09835506
-n03874293
-n03481172
-n04355338
-n02894605
-n03544143
-n02977058
-n01773157
-n02486261
-n02112137
-n03075370
-n01601694
-n04004767
-n04273569
-n04275548
-n02966193
-n03443371
-n01755581
-n02100877
-n04325704
-n02090379
-n02088466
-n03347037
-n03691459
-n01616318
-n01820546
-n04009552
-n03637318
-n01795545
-n02108000
-n01843383
-n03908618
-n07753275
-n02950826
-n04069434
-n02701002
-n02799071
-n02786058
-n02526121
-n03459775
-n04552348
-n04462240
-n02108915
-n02088364
-n02791270
-n01682714
-n02123394
-n02101388
-n02840245
-n04493381
-n01990800
-n04162706
-n13054560
-n01632777
-n02093859
-n02025239
-n02797295
-n03179701
-n02980441
-n04596742
-n01980166
-n09835506
-n03445777
-n03110669
-n02094114
-n02086079
-n01443537
-n02110063
-n04355338
-n01560419
-n03355925
-n02119022
-n03447447
-n02219486
-n02113624
-n04523525
-n01983481
-n10565667
-n03803284
-n04367480
-n03400231
-n01980166
-n04596742
-n02417914
-n02514041
-n02033041
-n02094114
-n02134084
-n13040303
-n03763968
-n04111531
-n02090622
-n02486261
-n03452741
-n04458633
-n02094114
-n02097658
-n01978455
-n02988304
-n04229816
-n02892767
-n02804414
-n03240683
-n01443537
-n02088632
-n02172182
-n02786058
-n02701002
-n04515003
-n07693725
-n03594945
-n02100735
-n04204347
-n02093754
-n09428293
-n03958227
-n03042490
-n06359193
-n02102177
-n03445924
-n04141975
-n03690938
-n02108089
-n03075370
-n04517823
-n03208938
-n03958227
-n10148035
-n02444819
-n02092002
-n10565667
-n02437312
-n02280649
-n02909870
-n03977966
-n03110669
-n03777568
-n07930864
-n04560804
-n03888605
-n02120505
-n03014705
-n01744401
-n03770439
-n03393912
-n02727426
-n02093754
-n03379051
-n03788195
-n02099601
-n02481823
-n03291819
-n04127249
-n03803284
-n03794056
-n03478589
-n02009912
-n07579787
-n02951358
-n03297495
-n04517823
-n03794056
-n03854065
-n04325704
-n03902125
-n03207941
-n03160309
-n02727426
-n03498962
-n02056570
-n01530575
-n03290653
-n03133878
-n02099267
-n03742115
-n04273569
-n02977058
-n03724870
-n04597913
-n03763968
-n03201208
-n02672831
-n02096437
-n02916936
-n04398044
-n03110669
-n01580077
-n03775546
-n01665541
-n03109150
-n01843383
-n01751748
-n04487394
-n02804414
-n04200800
-n03661043
-n01806143
-n01641577
-n02325366
-n03976467
-n02917067
-n01819313
-n04465501
-n01955084
-n03063599
-n04099969
-n02793495
-n02086079
-n02859443
-n03690938
-n13052670
-n02088238
-n02699494
-n03721384
-n02006656
-n02415577
-n02981792
-n02492035
-n03379051
-n02280649
-n03095699
-n03720891
-n03459775
-n02422106
-n01644373
-n03347037
-n02834397
-n03218198
-n03627232
-n04557648
-n02423022
-n01784675
-n03425413
-n04579432
-n07875152
-n03461385
-n03404251
-n03658185
-n07720875
-n01943899
-n12620546
-n03967562
-n02102480
-n02500267
-n02087046
-n03595614
-n02100236
-n07892512
-n04505470
-n01986214
-n02447366
-n01978455
-n03942813
-n02917067
-n02125311
-n04275548
-n02077923
-n01829413
-n04557648
-n02483362
-n03250847
-n02454379
-n02793495
-n03891251
-n03938244
-n03467068
-n02226429
-n02106166
-n04465501
-n04423845
-n02108422
-n02776631
-n01773797
-n03250847
-n04606251
-n01664065
-n04127249
-n04254777
-n02483362
-n03041632
-n01729322
-n02093859
-n02977058
-n04252225
-n02116738
-n02950826
-n03494278
-n02130308
-n03786901
-n04462240
-n03617480
-n04418357
-n02879718
-n03018349
-n03272010
-n03379051
-n01614925
-n02102040
-n01630670
-n03627232
-n13037406
-n09288635
-n07584110
-n02102177
-n03347037
-n01632458
-n01768244
-n03584254
-n04346328
-n03599486
-n03109150
-n03692522
-n15075141
-n01742172
-n02841315
-n13040303
-n02117135
-n02107142
-n04266014
-n03724870
-n07248320
-n02704792
-n03871628
-n01990800
-n02129604
-n02119789
-n02125311
-n04606251
-n07768694
-n03187595
-n04376876
-n04483307
-n02110063
-n02107142
-n02782093
-n04487081
-n01675722
-n01608432
-n03297495
-n02098105
-n01950731
-n04238763
-n02105855
-n04552348
-n02051845
-n02128925
-n02877765
-n02128385
-n02877765
-n01872401
-n01682714
-n03481172
-n02509815
-n02236044
-n02280649
-n02488702
-n03492542
-n01749939
-n03207743
-n03179701
-n02100877
-n01981276
-n03710637
-n03223299
-n01630670
-n03877472
-n01560419
-n02259212
-n04127249
-n03796401
-n04486054
-n01807496
-n03492542
-n01694178
-n01740131
-n01985128
-n03637318
-n03584254
-n07717556
-n07753592
-n02791124
-n03786901
-n02965783
-n03733131
-n04458633
-n01614925
-n04435653
-n03534580
-n04532106
-n02276258
-n01697457
-n03187595
-n04590129
-n04004767
-n03877472
-n07248320
-n03207743
-n02892767
-n03976467
-n03133878
-n03594734
-n01877812
-n03785016
-n04613696
-n03534580
-n02013706
-n01985128
-n02110806
-n02441942
-n04554684
-n03916031
-n01748264
-n04204347
-n03450230
-n01622779
-n02799071
-n02017213
-n03201208
-n02487347
-n02497673
-n01795545
-n02487347
-n04487081
-n03710637
-n04026417
-n07747607
-n02092002
-n02701002
-n02492660
-n03995372
-n02415577
-n02091831
-n02423022
-n02165456
-n03666591
-n04604644
-n02107142
-n02951358
-n02219486
-n04542943
-n03777568
-n03787032
-n04332243
-n02927161
-n09288635
-n01704323
-n02091244
-n02894605
-n04554684
-n02085936
-n03014705
-n01871265
-n02113799
-n02107683
-n03347037
-n04296562
-n09256479
-n02110341
-n06874185
-n03967562
-n02708093
-n04344873
-n02437616
-n04523525
-n02099712
-n04404412
-n04277352
-n02948072
-n04111531
-n03452741
-n02966193
-n03452741
-n02100735
-n04597913
-n07747607
-n03764736
-n02123159
-n02107574
-n01729977
-n03976467
-n03788195
-n07717556
-n15075141
-n04596742
-n01729977
-n03042490
-n02102040
-n02093991
-n12144580
-n02107908
-n04612504
-n02981792
-n01644900
-n02128385
-n02128925
-n02110806
-n01748264
-n02777292
-n04209239
-n02112350
-n02361337
-n04141327
-n02229544
-n02281406
-n03895866
-n02108915
-n12768682
-n02106030
-n03218198
-n04133789
-n02093428
-n03461385
-n02119789
-n03444034
-n02877765
-n03724870
-n03773504
-n01698640
-n02504013
-n02231487
-n01558993
-n06785654
-n01981276
-n02389026
-n04277352
-n02687172
-n03291819
-n04447861
-n04310018
-n02486410
-n02105855
-n02948072
-n03785016
-n02002724
-n03417042
-n03188531
-n02259212
-n02776631
-n02951585
-n03337140
-n01751748
-n02879718
-n04277352
-n12057211
-n02951585
-n03967562
-n07714571
-n02085620
-n02510455
-n02869837
-n01980166
-n01756291
-n03792972
-n02112137
-n03680355
-n03841143
-n07565083
-n07693725
-n07715103
-n01820546
-n01873310
-n03777568
-n01833805
-n02676566
-n03447721
-n02500267
-n03602883
-n04239074
-n04118538
-n04536866
-n04548362
-n02776631
-n01667778
-n03825788
-n03891332
-n04258138
-n04542943
-n02099849
-n03041632
-n04179913
-n01632458
-n01537544
-n02930766
-n03814639
-n02643566
-n03498962
-n01798484
-n02692877
-n03134739
-n03314780
-n02870880
-n07768694
-n04141076
-n03786901
-n03314780
-n02172182
-n02092339
-n03259280
-n07880968
-n02115641
-n01990800
-n12768682
-n07930864
-n03527444
-n02091244
-n03769881
-n01494475
-n03249569
-n02395406
-n03776460
-n12985857
-n02056570
-n02486410
-n01737021
-n02488702
-n01978455
-n01622779
-n02510455
-n01776313
-n07831146
-n02018207
-n02808304
-n01855032
-n03803284
-n02514041
-n02099849
-n01806143
-n03837869
-n03902125
-n02895154
-n04208210
-n02107142
-n01855672
-n02480495
-n04065272
-n03761084
-n02100236
-n02111277
-n02089867
-n04552348
-n02791124
-n02101556
-n02480855
-n02097658
-n03180011
-n03899768
-n02087394
-n02236044
-n02794156
-n04550184
-n02099849
-n02111129
-n03976657
-n01847000
-n04465501
-n03063599
-n03733131
-n09332890
-n02892767
-n01978455
-n02111129
-n03832673
-n04141327
-n02276258
-n03786901
-n02672831
-n01978455
-n02807133
-n03290653
-n03297495
-n02112350
-n02894605
-n03763968
-n02776631
-n04606251
-n03498962
-n04443257
-n04355933
-n02727426
-n12057211
-n04376876
-n02403003
-n03495258
-n04584207
-n04462240
-n01729322
-n03207941
-n02483708
-n10565667
-n03866082
-n04019541
-n04154565
-n13052670
-n02992211
-n03642806
-n03372029
-n03832673
-n03617480
-n01797886
-n04591157
-n04443257
-n03045698
-n03207941
-n04081281
-n02165105
-n02105412
-n02980441
-n02097658
-n02823750
-n02397096
-n03662601
-n01514859
-n03759954
-n02859443
-n02011460
-n03467068
-n04458633
-n02111277
-n01751748
-n03127747
-n03838899
-n07715103
-n02894605
-n02793495
-n07248320
-n03995372
-n02094258
-n03937543
-n03642806
-n02607072
-n03483316
-n02090622
-n04525305
-n02085936
-n03920288
-n03063599
-n01843065
-n02099267
-n01739381
-n03793489
-n02018207
-n03775071
-n01496331
-n06785654
-n03935335
-n03887697
-n07747607
-n03773504
-n07860988
-n04456115
-n02492035
-n03874293
-n04275548
-n03063689
-n02101006
-n01807496
-n02113978
-n02655020
-n02488702
-n02174001
-n04004767
-n04579432
-n04141975
-n03584254
-n02112706
-n03127747
-n02097047
-n04458633
-n02814533
-n02510455
-n02106166
-n02492035
-n13054560
-n04090263
-n02110341
-n02965783
-n04235860
-n01735189
-n01698640
-n07697313
-n02276258
-n03868242
-n02321529
-n03042490
-n04418357
-n03814906
-n02607072
-n04517823
-n03496892
-n07717556
-n02051845
-n03291819
-n09399592
-n02791124
-n02259212
-n02233338
-n07802026
-n03047690
-n03995372
-n03530642
-n02966687
-n02492035
-n02229544
-n01689811
-n01532829
-n03733805
-n01776313
-n02112137
-n04200800
-n07747607
-n03016953
-n03729826
-n07734744
-n02088094
-n04542943
-n02667093
-n03400231
-n04355933
-n03544143
-n02128385
-n04356056
-n02112018
-n02859443
-n02128925
-n02091032
-n04004767
-n02096051
-n02113712
-n02927161
-n03476991
-n02423022
-n12144580
-n04548280
-n03724870
-n04335435
-n07583066
-n02871525
-n03272010
-n02484975
-n02786058
-n09472597
-n04209133
-n03717622
-n03598930
-n02417914
-n01824575
-n04204238
-n02999410
-n04467665
-n04239074
-n03444034
-n04263257
-n03903868
-n02492035
-n02110627
-n02007558
-n02090379
-n03995372
-n04325704
-n04277352
-n02494079
-n02321529
-n12144580
-n01687978
-n03095699
-n02074367
-n02128925
-n02363005
-n02346627
-n04579145
-n03133878
-n02776631
-n03787032
-n03127747
-n01749939
-n01860187
-n04317175
-n12768682
-n02219486
-n03630383
-n02097130
-n02859443
-n03529860
-n02229544
-n03272562
-n04116512
-n01685808
-n03902125
-n02174001
-n02112706
-n02840245
-n04141975
-n01641577
-n02326432
-n07749582
-n02797295
-n04596742
-n02974003
-n01729977
-n02504013
-n02843684
-n03825788
-n04517823
-n03216828
-n04346328
-n02408429
-n01797886
-n02493509
-n02799071
-n04204347
-n07716906
-n06874185
-n02093647
-n02111889
-n04254777
-n02966687
-n03938244
-n02321529
-n03089624
-n02096585
-n02877765
-n03259280
-n02895154
-n02107574
-n07615774
-n03131574
-n02497673
-n01688243
-n04273569
-n03873416
-n03763968
-n01534433
-n03187595
-n02786058
-n02165105
-n02099601
-n02782093
-n01601694
-n03459775
-n01770081
-n04019541
-n01742172
-n03452741
-n03891251
-n01818515
-n03825788
-n04141975
-n02087394
-n02325366
-n02092339
-n07584110
-n03649909
-n02113712
-n04579145
-n03908714
-n04392985
-n02124075
-n13040303
-n02051845
-n02231487
-n02493509
-n01748264
-n03457902
-n03146219
-n01675722
-n03787032
-n02361337
-n07579787
-n04479046
-n02168699
-n02992211
-n02113624
-n02974003
-n04357314
-n07920052
-n07615774
-n03452741
-n03534580
-n02094258
-n04505470
-n02641379
-n03868863
-n02422699
-n03249569
-n02123394
-n02106662
-n01784675
-n04371430
-n04557648
-n02514041
-n02051845
-n03916031
-n01751748
-n02504458
-n07734744
-n02494079
-n03902125
-n02930766
-n03977966
-n03724870
-n04116512
-n03272010
-n04049303
-n03590841
-n02361337
-n04044716
-n03680355
-n03637318
-n11939491
-n03866082
-n03272010
-n02119789
-n07615774
-n03602883
-n03492542
-n04310018
-n02231487
-n02110185
-n03544143
-n03995372
-n02268443
-n01440764
-n02480855
-n02317335
-n01692333
-n02109961
-n03379051
-n03075370
-n02687172
-n04442312
-n03584254
-n01729977
-n02727426
-n03134739
-n01828970
-n02093428
-n02233338
-n02091831
-n02939185
-n04579432
-n04266014
-n03291819
-n03954731
-n03838899
-n07871810
-n02077923
-n12057211
-n02415577
-n02115641
-n03781244
-n07880968
-n07711569
-n03838899
-n03180011
-n02114712
-n03887697
-n02930766
-n01644900
-n02111277
-n02999410
-n03534580
-n02497673
-n02410509
-n02777292
-n03461385
-n04086273
-n03627232
-n01689811
-n09193705
-n01955084
-n03916031
-n04355338
-n04259630
-n03617480
-n01498041
-n02169497
-n02423022
-n02422106
-n02699494
-n02494079
-n04515003
-n03724870
-n02113799
-n03930630
-n04458633
-n04065272
-n02939185
-n02281787
-n02504458
-n02190166
-n03691459
-n02408429
-n07579787
-n02114712
-n04125021
-n04461696
-n03384352
-n03388183
-n03837869
-n03485407
-n01986214
-n03255030
-n02804610
-n03255030
-n01924916
-n04398044
-n04540053
-n02667093
-n03146219
-n02483708
-n03125729
-n09256479
-n02089078
-n02607072
-n03742115
-n04067472
-n02114712
-n03196217
-n04254120
-n02105412
-n03250847
-n02111500
-n07565083
-n04162706
-n01917289
-n03018349
-n03530642
-n02107908
-n02169497
-n02018795
-n03658185
-n03424325
-n02018207
-n03630383
-n03903868
-n07745940
-n02138441
-n03372029
-n02319095
-n01855672
-n03062245
-n07753592
-n04147183
-n04254777
-n03838899
-n02219486
-n04270147
-n07871810
-n01910747
-n02999410
-n12768682
-n03649909
-n04120489
-n02002724
-n01756291
-n02445715
-n02009912
-n01798484
-n04532670
-n04604644
-n04044716
-n02169497
-n02669723
-n04461696
-n02134084
-n03743016
-n01798484
-n03404251
-n02783161
-n03201208
-n02134084
-n02607072
-n03180011
-n02094433
-n03388549
-n07590611
-n02640242
-n02085782
-n02871525
-n03967562
-n02119789
-n04507155
-n04149813
-n03492542
-n02437312
-n02098105
-n01443537
-n01632458
-n02860847
-n02113023
-n03337140
-n12620546
-n03459775
-n11879895
-n03085013
-n02096585
-n02088466
-n01751748
-n02497673
-n02236044
-n03109150
-n02130308
-n04325704
-n03676483
-n02105412
-n03180011
-n02787622
-n02025239
-n01693334
-n02325366
-n02281787
-n04597913
-n04346328
-n04404412
-n02006656
-n02107312
-n02165456
-n03042490
-n04418357
-n02093428
-n04133789
-n07754684
-n03075370
-n03916031
-n04536866
-n07711569
-n02895154
-n02105251
-n02692877
-n03344393
-n04493381
-n04579145
-n03201208
-n04243546
-n02167151
-n01797886
-n09256479
-n01582220
-n04548362
-n03476684
-n04606251
-n04579432
-n02086910
-n02134084
-n02109525
-n04238763
-n03764736
-n04044716
-n04548362
-n02692877
-n03207941
-n04229816
-n03598930
-n04591157
-n02317335
-n01734418
-n15075141
-n03825788
-n04536866
-n04254777
-n02277742
-n03877845
-n02747177
-n01667778
-n01664065
-n03180011
-n02701002
-n13040303
-n03388549
-n04591713
-n04389033
-n02699494
-n02105162
-n02280649
-n04254777
-n02607072
-n01985128
-n03045698
-n03717622
-n02086240
-n03903868
-n02326432
-n02229544
-n03530642
-n01685808
-n02091467
-n03544143
-n03902125
-n02125311
-n09399592
-n04070727
-n07730033
-n07684084
-n04398044
-n03372029
-n03483316
-n03495258
-n01728572
-n04037443
-n02395406
-n03457902
-n03761084
-n01734418
-n02090721
-n03976657
-n03785016
-n01514668
-n04357314
-n02835271
-n02504013
-n02489166
-n03530642
-n02950826
-n02111889
-n04371774
-n04560804
-n03445924
-n02091831
-n07753592
-n03447721
-n01770081
-n02487347
-n02794156
-n02097209
-n03891251
-n02790996
-n03109150
-n04380533
-n03595614
-n04153751
-n04591713
-n02108915
-n04429376
-n01641577
-n04264628
-n03271574
-n02114367
-n07930864
-n02105641
-n02104365
-n03717622
-n04423845
-n02094258
-n02116738
-n01692333
-n02909870
-n02606052
-n02099849
-n02363005
-n07734744
-n02841315
-n01860187
-n02090721
-n03841143
-n02892201
-n04125021
-n04612504
-n01537544
-n04505470
-n02281406
-n03983396
-n02123045
-n01784675
-n02493509
-n03476991
-n03534580
-n02123159
-n02808440
-n04074963
-n01616318
-n03786901
-n03721384
-n02086240
-n02488702
-n03642806
-n03160309
-n01796340
-n13044778
-n09256479
-n03089624
-n02086910
-n04604644
-n04040759
-n07584110
-n04552348
-n04149813
-n02066245
-n01580077
-n04443257
-n04336792
-n02107683
-n01797886
-n02134418
-n02134418
-n01632777
-n06359193
-n01797886
-n03485407
-n04259630
-n03992509
-n07248320
-n04486054
-n03026506
-n02088632
-n03124043
-n02442845
-n02091467
-n03376595
-n04310018
-n02966687
-n03777568
-n03100240
-n04350905
-n02843684
-n02109961
-n01631663
-n03240683
-n03141823
-n02091635
-n01443537
-n11939491
-n02002724
-n03733281
-n02106662
-n03942813
-n03337140
-n03777568
-n04251144
-n07716906
-n01820546
-n03929660
-n03478589
-n02441942
-n02364673
-n09835506
-n04515003
-n02264363
-n01773157
-n01770393
-n03777568
-n04049303
-n02219486
-n02130308
-n02437312
-n02815834
-n02093647
-n01616318
-n04332243
-n12620546
-n10148035
-n02927161
-n02128757
-n03496892
-n03417042
-n04200800
-n02484975
-n01689811
-n02107574
-n03976657
-n03998194
-n02088632
-n04243546
-n03788365
-n02087046
-n10565667
-n03832673
-n02412080
-n01558993
-n03492542
-n04540053
-n01796340
-n04376876
-n02395406
-n03075370
-n07753592
-n02481823
-n02457408
-n02110806
-n03877472
-n01667778
-n03131574
-n03956157
-n02108422
-n02114548
-n03272010
-n03394916
-n01774384
-n03623198
-n02027492
-n04099969
-n02106662
-n02951358
-n01798484
-n13133613
-n03207743
-n04560804
-n02268443
-n03775071
-n04346328
-n01930112
-n03584254
-n02790996
-n09256479
-n01985128
-n02480495
-n02268853
-n03627232
-n03180011
-n02233338
-n03982430
-n02841315
-n03649909
-n04336792
-n09468604
-n02056570
-n02787622
-n03764736
-n02442845
-n02437616
-n03445924
-n01917289
-n02107312
-n02137549
-n03599486
-n03721384
-n04041544
-n01824575
-n04285008
-n01687978
-n01514668
-n04554684
-n04209239
-n03272562
-n03425413
-n02797295
-n02106382
-n06359193
-n03642806
-n01677366
-n03134739
-n02105641
-n01985128
-n03594945
-n07583066
-n02667093
-n02086646
-n07590611
-n02111889
-n03857828
-n04259630
-n02730930
-n04285008
-n03095699
-n03761084
-n02167151
-n04404412
-n04254120
-n04461696
-n04192698
-n01873310
-n03763968
-n02804414
-n04325704
-n01682714
-n02120505
-n03584829
-n04356056
-n04476259
-n09332890
-n04399382
-n03676483
-n03961711
-n09332890
-n02096294
-n04532106
-n04149813
-n03891251
-n06874185
-n02769748
-n04485082
-n04277352
-n03793489
-n03788365
-n02389026
-n03709823
-n03032252
-n02606052
-n03271574
-n03492542
-n01665541
-n01675722
-n03691459
-n07892512
-n02799071
-n02007558
-n02510455
-n03742115
-n04136333
-n03630383
-n02910353
-n02111129
-n02488702
-n01950731
-n04204238
-n04461696
-n02102318
-n03538406
-n03916031
-n02130308
-n04311174
-n01667114
-n02115641
-n04487394
-n02233338
-n02099267
-n01797886
-n02051845
-n04428191
-n02124075
-n04532670
-n03775546
-n07892512
-n02100877
-n04398044
-n04590129
-n02101388
-n04254680
-n04485082
-n03026506
-n04111531
-n03924679
-n01667778
-n02169497
-n04311004
-n03947888
-n02093754
-n01818515
-n03763968
-n04380533
-n02077923
-n02488702
-n01770393
-n02226429
-n07932039
-n02095314
-n01847000
-n03250847
-n04296562
-n02100236
-n03045698
-n07590611
-n03787032
-n02101006
-n01873310
-n02009912
-n02096051
-n07749582
-n02112018
-n03000134
-n03447721
-n04118776
-n03970156
-n01944390
-n07613480
-n02879718
-n01873310
-n03187595
-n03325584
-n01496331
-n02097298
-n03793489
-n02111500
-n04311174
-n01739381
-n02114548
-n02165105
-n01930112
-n02823428
-n04111531
-n02137549
-n04355338
-n03916031
-n03791053
-n02113186
-n04081281
-n02104029
-n03483316
-n04579145
-n01558993
-n01748264
-n02791270
-n03929660
-n02129604
-n02102040
-n03796401
-n02007558
-n11879895
-n06794110
-n07614500
-n02006656
-n04065272
-n02486261
-n02640242
-n01806143
-n03991062
-n02788148
-n09472597
-n03935335
-n02510455
-n03958227
-n02105641
-n04428191
-n03018349
-n02116738
-n03773504
-n02087046
-n03709823
-n01749939
-n02190166
-n02085782
-n01843065
-n03743016
-n01828970
-n01828970
-n03908714
-n03937543
-n02817516
-n04592741
-n02869837
-n03874293
-n04540053
-n03250847
-n02971356
-n02114548
-n02113023
-n04081281
-n03857828
-n03450230
-n04127249
-n02108089
-n02093428
-n04392985
-n04254120
-n02782093
-n02012849
-n03179701
-n04357314
-n13133613
-n02992211
-n04243546
-n01664065
-n01695060
-n04005630
-n03400231
-n03733131
-n02107142
-n02104365
-n04597913
-n04238763
-n04371430
-n03877472
-n04589890
-n04154565
-n01734418
-n03781244
-n07745940
-n02109961
-n01755581
-n07742313
-n04118776
-n01734418
-n02085782
-n03100240
-n02013706
-n03658185
-n03290653
-n02105505
-n03888257
-n02865351
-n02277742
-n02099849
-n03131574
-n02102177
-n02093428
-n02814860
-n01734418
-n01580077
-n04136333
-n04483307
-n01774384
-n02364673
-n06874185
-n07754684
-n07734744
-n04487081
-n07802026
-n09399592
-n03602883
-n04435653
-n02096437
-n02672831
-n02107683
-n02086646
-n01698640
-n03485794
-n03967562
-n01664065
-n03837869
-n01950731
-n02909870
-n01756291
-n02091467
-n03658185
-n02690373
-n02012849
-n03709823
-n02123597
-n13044778
-n02167151
-n03425413
-n07730033
-n03721384
-n03126707
-n02883205
-n02111889
-n03866082
-n01698640
-n04584207
-n03485407
-n02105251
-n03743016
-n03314780
-n03769881
-n01494475
-n04005630
-n03291819
-n03721384
-n04118776
-n03868242
-n04265275
-n09835506
-n03443371
-n03459775
-n04501370
-n01688243
-n03494278
-n02486410
-n02105251
-n03956157
-n02410509
-n02116738
-n04532106
-n02100236
-n04591157
-n02398521
-n04131690
-n03935335
-n02098105
-n04428191
-n02110627
-n03970156
-n03950228
-n02110341
-n04201297
-n07932039
-n07920052
-n03063689
-n02137549
-n03100240
-n01665541
-n04099969
-n02106382
-n02009912
-n03223299
-n02091635
-n03982430
-n04548362
-n01978455
-n01614925
-n02841315
-n07711569
-n04335435
-n02892767
-n03345487
-n02948072
-n04127249
-n02909870
-n02099712
-n04162706
-n01981276
-n02085620
-n02917067
-n07716358
-n04332243
-n03724870
-n04074963
-n01984695
-n03794056
-n03929855
-n01773157
-n01806567
-n04350905
-n03804744
-n10565667
-n07747607
-n03218198
-n03942813
-n01877812
-n03924679
-n07753592
-n02113799
-n02086079
-n03814639
-n02834397
-n02109525
-n07720875
-n04273569
-n03018349
-n03404251
-n03888257
-n03485407
-n07730033
-n13052670
-n02095889
-n01739381
-n01514859
-n02106030
-n07860988
-n03775546
-n04263257
-n03485794
-n03924679
-n04228054
-n02319095
-n02747177
-n03770679
-n03980874
-n02097658
-n02988304
-n07579787
-n02137549
-n01644373
-n02870880
-n04069434
-n13040303
-n02106550
-n02804414
-n07565083
-n03877845
-n03187595
-n02074367
-n02099712
-n01950731
-n03884397
-n03776460
-n04209133
-n03697007
-n01978287
-n03792972
-n07716906
-n04146614
-n03887697
-n02095889
-n02096177
-n04435653
-n02091032
-n02840245
-n02097658
-n02002724
-n02058221
-n03127747
-n04501370
-n01817953
-n02113186
-n01877812
-n04004767
-n02441942
-n02408429
-n04116512
-n02134418
-n03529860
-n03041632
-n03447447
-n03188531
-n03770439
-n03633091
-n02086646
-n02011460
-n04209133
-n04229816
-n01622779
-n01667114
-n01685808
-n02113186
-n02097047
-n03876231
-n02699494
-n03961711
-n03530642
-n03452741
-n02708093
-n01985128
-n02894605
-n03124170
-n03633091
-n13054560
-n02112137
-n02120505
-n01532829
-n03929660
-n04589890
-n04507155
-n01685808
-n02077923
-n04523525
-n04592741
-n02056570
-n03841143
-n02226429
-n04243546
-n04285008
-n02483708
-n03944341
-n04553703
-n03977966
-n02441942
-n01818515
-n03871628
-n03692522
-n07768694
-n02607072
-n04456115
-n04590129
-n03476991
-n02091134
-n03394916
-n01990800
-n02066245
-n02279972
-n01944390
-n02105251
-n04273569
-n03857828
-n02110185
-n02096051
-n01770081
-n02259212
-n02799071
-n01806143
-n03476684
-n01796340
-n03100240
-n01632777
-n02190166
-n02066245
-n03976657
-n03788365
-n02108422
-n03400231
-n04589890
-n04435653
-n02326432
-n03954731
-n04591157
-n02823428
-n07716358
-n02088632
-n01824575
-n01631663
-n02086079
-n03995372
-n04517823
-n02480855
-n03445777
-n04357314
-n03884397
-n03445924
-n03777754
-n03133878
-n03873416
-n02086240
-n04553703
-n04133789
-n07693725
-n02895154
-n02317335
-n04613696
-n01819313
-n03977966
-n02109047
-n03000247
-n02443114
-n03272010
-n01697457
-n04200800
-n02109047
-n02840245
-n01739381
-n06794110
-n01756291
-n01748264
-n03950228
-n02971356
-n02123159
-n04346328
-n02092339
-n01729977
-n03187595
-n02454379
-n03794056
-n03967562
-n04039381
-n02879718
-n02441942
-n04515003
-n04311174
-n03100240
-n03868242
-n03126707
-n04461696
-n13054560
-n04398044
-n01667114
-n01664065
-n02106382
-n04613696
-n02948072
-n12144580
-n03877472
-n02096585
-n03935335
-n04429376
-n02110185
-n03207941
-n02123045
-n03788195
-n04259630
-n02097209
-n02092002
-n01877812
-n03529860
-n02966687
-n03980874
-n02013706
-n02776631
-n02445715
-n01496331
-n01807496
-n02112137
-n02086646
-n04118776
-n03658185
-n01985128
-n02504013
-n12998815
-n02233338
-n12057211
-n07875152
-n03840681
-n03721384
-n03908714
-n02412080
-n02113799
-n02096437
-n02669723
-n03775546
-n03393912
-n07718472
-n01883070
-n02120079
-n01532829
-n04443257
-n02917067
-n02877765
-n02115913
-n07920052
-n01773797
-n02123159
-n03447447
-n04613696
-n03933933
-n04380533
-n01728572
-n03535780
-n04599235
-n02877765
-n13037406
-n02971356
-n02504458
-n02101388
-n04370456
-n09229709
-n02113624
-n02492035
-n02089867
-n09421951
-n02219486
-n02494079
-n02963159
-n03930630
-n02206856
-n02091831
-n02504013
-n02097298
-n09428293
-n04596742
-n01632777
-n02018207
-n03344393
-n03388549
-n03791053
-n01729322
-n02018207
-n03599486
-n03297495
-n02093859
-n01629819
-n04037443
-n01693334
-n02058221
-n03141823
-n04252225
-n04418357
-n01774384
-n03871628
-n03598930
-n03032252
-n02321529
-n02117135
-n02206856
-n03944341
-n02111129
-n02346627
-n03404251
-n02113023
-n02009229
-n02879718
-n01748264
-n01773549
-n04252077
-n02825657
-n03476991
-n03584254
-n04350905
-n13052670
-n04141076
-n03388549
-n02415577
-n02607072
-n04346328
-n01914609
-n02641379
-n03782006
-n01601694
-n03388183
-n03803284
-n02690373
-n02106662
-n02097047
-n07892512
-n02277742
-n10148035
-n02412080
-n02091635
-n01917289
-n03742115
-n04074963
-n03124043
-n02669723
-n04507155
-n02808304
-n02111500
-n03761084
-n01797886
-n03874599
-n03476991
-n04404412
-n02108915
-n01694178
-n02802426
-n02974003
-n03028079
-n03944341
-n03742115
-n02111500
-n02117135
-n02092339
-n04133789
-n03868242
-n07714990
-n07579787
-n04252077
-n02096051
-n02102480
-n02174001
-n03085013
-n01740131
-n02107312
-n04162706
-n02869837
-n02412080
-n04612504
-n01807496
-n04041544
-n03459775
-n02017213
-n02101006
-n07749582
-n02109047
-n07718472
-n02877765
-n01622779
-n01882714
-n03781244
-n02137549
-n02342885
-n03498962
-n04127249
-n06785654
-n02105412
-n03447447
-n09193705
-n02326432
-n04590129
-n02892201
-n03425413
-n04235860
-n03000247
-n03272562
-n03598930
-n02174001
-n03347037
-n07920052
-n01784675
-n07718747
-n02279972
-n02097298
-n03394916
-n03977966
-n03692522
-n03825788
-n07717556
-n02727426
-n02396427
-n07747607
-n04330267
-n03062245
-n02389026
-n02871525
-n02107142
-n02012849
-n02077923
-n03532672
-n03216828
-n02486261
-n01494475
-n04251144
-n02109047
-n03649909
-n01873310
-n03710637
-n01632458
-n02077923
-n04263257
-n04423845
-n02279972
-n01728572
-n02128757
-n04552348
-n07747607
-n07932039
-n02071294
-n02951585
-n02123159
-n04201297
-n03680355
-n02892767
-n03930630
-n01798484
-n01729977
-n01798484
-n04371430
-n02090379
-n03347037
-n03998194
-n03947888
-n02108422
-n02837789
-n03888257
-n01739381
-n04179913
-n07590611
-n02279972
-n03063599
-n02113712
-n02444819
-n03532672
-n02687172
-n07720875
-n01819313
-n02445715
-n03793489
-n02092002
-n03899768
-n03424325
-n02978881
-n01534433
-n02999410
-n04557648
-n01608432
-n02391049
-n03929660
-n02835271
-n03876231
-n02102318
-n02777292
-n04004767
-n03933933
-n07836838
-n01751748
-n07718472
-n04254777
-n03424325
-n03063599
-n02095570
-n01824575
-n04311004
-n01677366
-n03062245
-n03627232
-n03134739
-n04372370
-n03075370
-n02802426
-n03447721
-n01829413
-n02090379
-n04192698
-n03743016
-n01692333
-n02099601
-n03720891
-n02951585
-n01532829
-n02281406
-n02096177
-n03920288
-n02927161
-n04179913
-n02100236
-n04515003
-n07802026
-n02088632
-n03950228
-n09193705
-n03841143
-n02093647
-n04336792
-n04357314
-n03929660
-n02093647
-n02093428
-n04049303
-n01873310
-n02268853
-n03838899
-n01484850
-n03337140
-n01537544
-n02174001
-n03063599
-n02640242
-n03721384
-n04596742
-n02795169
-n02492660
-n02892201
-n02361337
-n04417672
-n02113624
-n02028035
-n02999410
-n01629819
-n02115913
-n02089078
-n01768244
-n04263257
-n01944390
-n01945685
-n02071294
-n03937543
-n02391049
-n02018207
-n02129165
-n02074367
-n01518878
-n03445777
-n04149813
-n02669723
-n02097047
-n02865351
-n07753592
-n02814533
-n03874599
-n07720875
-n04116512
-n02417914
-n02027492
-n03877845
-n02123159
-n04264628
-n02236044
-n02108089
-n04133789
-n04147183
-n02085620
-n02091134
-n03944341
-n13037406
-n02422106
-n01498041
-n03775071
-n04357314
-n02102040
-n01682714
-n01775062
-n03014705
-n01693334
-n01616318
-n04604644
-n03109150
-n02088238
-n01981276
-n02422106
-n01985128
-n04026417
-n01644900
-n02095570
-n04266014
-n02236044
-n02115913
-n01883070
-n03840681
-n02481823
-n03447721
-n01981276
-n03673027
-n02835271
-n02123159
-n02113186
-n03947888
-n02100877
-n03814639
-n02510455
-n04037443
-n03929660
-n03837869
-n02791270
-n03461385
-n02951585
-n04525305
-n02788148
-n02165105
-n04592741
-n02091467
-n03188531
-n02091134
-n03617480
-n03954731
-n04328186
-n02105162
-n02870880
-n03028079
-n04596742
-n04204347
-n02108422
-n01740131
-n02363005
-n03840681
-n04116512
-n02138441
-n04367480
-n01773797
-n04350905
-n02095314
-n09229709
-n02494079
-n03788365
-n02117135
-n01641577
-n04192698
-n02087046
-n12620546
-n02410509
-n03777568
-n02948072
-n03662601
-n02690373
-n02441942
-n03127925
-n02066245
-n02097130
-n03187595
-n02977058
-n03977966
-n03291819
-n02788148
-n03482405
-n02090721
-n02105641
-n04525038
-n04328186
-n03424325
-n03498962
-n03223299
-n04552348
-n09193705
-n07697537
-n04596742
-n01797886
-n01980166
-n02093991
-n01688243
-n01817953
-n03485407
-n01795545
-n02794156
-n02102480
-n01819313
-n03188531
-n02965783
-n03534580
-n02395406
-n02033041
-n03337140
-n04200800
-n02797295
-n02804414
-n02088364
-n03000247
-n03937543
-n02389026
-n01682714
-n02101388
-n01685808
-n07880968
-n02509815
-n03938244
-n04532670
-n03967562
-n03196217
-n02892767
-n01843383
-n02978881
-n01748264
-n04423845
-n02396427
-n03388043
-n03000134
-n04429376
-n03483316
-n03485407
-n02256656
-n04086273
-n02356798
-n02747177
-n01773157
-n03297495
-n02403003
-n07718472
-n03445924
-n01843383
-n02328150
-n03447447
-n02124075
-n02098105
-n06596364
-n03388183
-n06596364
-n02504013
-n04041544
-n02009912
-n02093859
-n04350905
-n02317335
-n07871810
-n02105855
-n02607072
-n02095570
-n02389026
-n06785654
-n09421951
-n02114855
-n03216828
-n01855032
-n03095699
-n02115641
-n01955084
-n03095699
-n03133878
-n03902125
-n02395406
-n04371774
-n04525305
-n03345487
-n02108551
-n01774750
-n02480495
-n03594945
-n02091635
-n04557648
-n03388549
-n01784675
-n13040303
-n13037406
-n01776313
-n02099601
-n03134739
-n02110185
-n01537544
-n13133613
-n02102040
-n01530575
-n01735189
-n01491361
-n07583066
-n02137549
-n03908714
-n03045698
-n01914609
-n02326432
-n01631663
-n03868242
-n03920288
-n03729826
-n02002724
-n03776460
-n03535780
-n03146219
-n02094258
-n03841143
-n02797295
-n02500267
-n04392985
-n02504458
-n01773797
-n04325704
-n03920288
-n02999410
-n02655020
-n02097474
-n09472597
-n02099712
-n02980441
-n04461696
-n02814533
-n03495258
-n01784675
-n03000684
-n07760859
-n04141327
-n02641379
-n04200800
-n04141327
-n01943899
-n04037443
-n04357314
-n02097474
-n03857828
-n01630670
-n02417914
-n02747177
-n04590129
-n02037110
-n03841143
-n04204238
-n04252225
-n02791270
-n09193705
-n04376876
-n02815834
-n01817953
-n04356056
-n02007558
-n02917067
-n03544143
-n03954731
-n03372029
-n02930766
-n04310018
-n03630383
-n04009552
-n02132136
-n07745940
-n02094114
-n02480855
-n02093991
-n02113624
-n03662601
-n12144580
-n02443114
-n01914609
-n04040759
-n02834397
-n02276258
-n04557648
-n07718472
-n02108915
-n07753113
-n02093428
-n03976467
-n01984695
-n02492035
-n04275548
-n02100877
-n04254777
-n02799071
-n03908618
-n03773504
-n03347037
-n02107574
-n03529860
-n02093256
-n03291819
-n02110958
-n04275548
-n04273569
-n02113023
-n03958227
-n04417672
-n03272562
-n01980166
-n01514668
-n02002556
-n02086079
-n02104365
-n01677366
-n03770679
-n02096177
-n02094258
-n01440764
-n01943899
-n02099849
-n03899768
-n01729322
-n01776313
-n06359193
-n02447366
-n03857828
-n03384352
-n02111277
-n02226429
-n04366367
-n01737021
-n01537544
-n02951358
-n04371430
-n03196217
-n02100236
-n04443257
-n04479046
-n03983396
-n03218198
-n02105505
-n01978287
-n04286575
-n03866082
-n04208210
-n03891332
-n03857828
-n02504013
-n03982430
-n04554684
-n04317175
-n04552348
-n12057211
-n02483362
-n02097474
-n02361337
-n02120505
-n03594945
-n03498962
-n01978455
-n01829413
-n02105505
-n01978455
-n04356056
-n07718472
-n01518878
-n02795169
-n03617480
-n03372029
-n02099267
-n04229816
-n07717410
-n02895154
-n02110185
-n04149813
-n02056570
-n04404412
-n03028079
-n02110341
-n04120489
-n02804414
-n02988304
-n02167151
-n04392985
-n07747607
-n02966687
-n09399592
-n03761084
-n03400231
-n04136333
-n04423845
-n02978881
-n02099429
-n07892512
-n02137549
-n01807496
-n04033995
-n03876231
-n03063599
-n04005630
-n02489166
-n03197337
-n04456115
-n03388043
-n03062245
-n03899768
-n04371430
-n03729826
-n02165456
-n02769748
-n02412080
-n02086240
-n01665541
-n02412080
-n02445715
-n01735189
-n02086079
-n02110185
-n07697537
-n02112350
-n02137549
-n02398521
-n02971356
-n03980874
-n02106030
-n02980441
-n09193705
-n03393912
-n04562935
-n03691459
-n02870880
-n02443484
-n02979186
-n02100735
-n01682714
-n02607072
-n01688243
-n02454379
-n02443484
-n07248320
-n03814639
-n04509417
-n04019541
-n03938244
-n01667114
-n03791053
-n04442312
-n02226429
-n01693334
-n02794156
-n01773549
-n01685808
-n03598930
-n02017213
-n02124075
-n02091134
-n01530575
-n03657121
-n01768244
-n04552348
-n02106030
-n01667114
-n02790996
-n02699494
-n03291819
-n01694178
-n02423022
-n01855672
-n03459775
-n04070727
-n03770439
-n03709823
-n01924916
-n06785654
-n03272562
-n02099429
-n03100240
-n02174001
-n06794110
-n03759954
-n04357314
-n03584829
-n03345487
-n03443371
-n02100236
-n03709823
-n04350905
-n02086910
-n02977058
-n02112018
-n04409515
-n04118776
-n03376595
-n02101556
-n02776631
-n02108551
-n03291819
-n07745940
-n02109047
-n04336792
-n03494278
-n03388183
-n02398521
-n03485794
-n03018349
-n03967562
-n02116738
-n02085620
-n02108551
-n02894605
-n07695742
-n01693334
-n04356056
-n02120079
-n04540053
-n03134739
-n01644900
-n01697457
-n02108000
-n03720891
-n03733281
-n04404412
-n02098105
-n02089867
-n01530575
-n03884397
-n03602883
-n02090721
-n04228054
-n03208938
-n02483708
-n02017213
-n02097047
-n02509815
-n02447366
-n03532672
-n01518878
-n02123045
-n01847000
-n02690373
-n02092002
-n02096177
-n04487081
-n02526121
-n02124075
-n03717622
-n02106030
-n02002724
-n03240683
-n03902125
-n03709823
-n02974003
-n02100583
-n03201208
-n01833805
-n13052670
-n02219486
-n02107574
-n07742313
-n02112018
-n02489166
-n02441942
-n07753275
-n01819313
-n02643566
-n03110669
-n04482393
-n04613696
-n02129604
-n02088466
-n02134418
-n02114855
-n04591157
-n02277742
-n02112350
-n03590841
-n04476259
-n02326432
-n01755581
-n11939491
-n04264628
-n12998815
-n02101388
-n02137549
-n02236044
-n02123394
-n02909870
-n03733805
-n04120489
-n03958227
-n02100877
-n02169497
-n02168699
-n03794056
-n04146614
-n03787032
-n03937543
-n03388549
-n01978455
-n06874185
-n03717622
-n07875152
-n01820546
-n03445777
-n02109961
-n04127249
-n07716358
-n03661043
-n01534433
-n03982430
-n02490219
-n04152593
-n03062245
-n01644373
-n02951358
-n04041544
-n02974003
-n02102318
-n04127249
-n02500267
-n04548280
-n02690373
-n02125311
-n01950731
-n02007558
-n12267677
-n03045698
-n01443537
-n02447366
-n02124075
-n03916031
-n03146219
-n02843684
-n02980441
-n03187595
-n02091134
-n03124170
-n07749582
-n03594734
-n02666196
-n03782006
-n07697537
-n02111889
-n03724870
-n02085620
-n03492542
-n02102177
-n04515003
-n02167151
-n03877472
-n07720875
-n02097209
-n03208938
-n01601694
-n04067472
-n02174001
-n02123394
-n07583066
-n03599486
-n04005630
-n01698640
-n03047690
-n03793489
-n02916936
-n02124075
-n01592084
-n03127747
-n02130308
-n02094114
-n04131690
-n03063599
-n02110341
-n04008634
-n03218198
-n01496331
-n03146219
-n03496892
-n02097047
-n02397096
-n03942813
-n03787032
-n02125311
-n02119789
-n01945685
-n02105162
-n03127747
-n02107142
-n02992529
-n12620546
-n04067472
-n01630670
-n02423022
-n02948072
-n01491361
-n04067472
-n04263257
-n03223299
-n02088238
-n02231487
-n01739381
-n01532829
-n02099849
-n09256479
-n01580077
-n03895866
-n02037110
-n07742313
-n02091032
-n03841143
-n01986214
-n04356056
-n02971356
-n01774384
-n02097474
-n04019541
-n07753275
-n01944390
-n04371774
-n02120079
-n07932039
-n04033901
-n04074963
-n02843684
-n03457902
-n02089078
-n03544143
-n02088238
-n02342885
-n01753488
-n02895154
-n04009552
-n01806143
-n03794056
-n01740131
-n02423022
-n02033041
-n03942813
-n04023962
-n03630383
-n04251144
-n04376876
-n02107142
-n01740131
-n03075370
-n01494475
-n04590129
-n02786058
-n01773549
-n02028035
-n01978287
-n02966193
-n03982430
-n02442845
-n07734744
-n07615774
-n03970156
-n03000134
-n01883070
-n02124075
-n07892512
-n03970156
-n03958227
-n04532670
-n03743016
-n04479046
-n02011460
-n02391049
-n03877845
-n01981276
-n02488291
-n01592084
-n03544143
-n02168699
-n01494475
-n03887697
-n03249569
-n03777754
-n02100236
-n02017213
-n02999410
-n03590841
-n03476991
-n04192698
-n01582220
-n04604644
-n03658185
-n03773504
-n02640242
-n01819313
-n02906734
-n07697537
-n02403003
-n04270147
-n03544143
-n02859443
-n03733131
-n03733131
-n04251144
-n01806143
-n04254120
-n04350905
-n02090379
-n01582220
-n03868242
-n02088466
-n02793495
-n04136333
-n03476684
-n02129604
-n02112137
-n01622779
-n02087046
-n02114548
-n07875152
-n01773549
-n03721384
-n01843065
-n01601694
-n04254680
-n07860988
-n04523525
-n01843383
-n03314780
-n04069434
-n02791270
-n04125021
-n07880968
-n03314780
-n04346328
-n04335435
-n02093647
-n04532106
-n04465501
-n02102177
-n04344873
-n03788195
-n03803284
-n09835506
-n01872401
-n01688243
-n02233338
-n03633091
-n03888605
-n02095570
-n04579145
-n03598930
-n02980441
-n03095699
-n02088466
-n04296562
-n01739381
-n02033041
-n04346328
-n01695060
-n03733281
-n04265275
-n01796340
-n07880968
-n02894605
-n04465501
-n01644900
-n03100240
-n03447721
-n03792782
-n01828970
-n02486261
-n02690373
-n01774750
-n09229709
-n03045698
-n03874293
-n12267677
-n03637318
-n02398521
-n02782093
-n01728572
-n02457408
-n04005630
-n04525305
-n01820546
-n02138441
-n03532672
-n02808440
-n12985857
-n02085620
-n04584207
-n02125311
-n07742313
-n03355925
-n03868242
-n03871628
-n03840681
-n04310018
-n02793495
-n02489166
-n02727426
-n04592741
-n02841315
-n02490219
-n04273569
-n04228054
-n03991062
-n02093647
-n02113023
-n01698640
-n04591713
-n02111277
-n04596742
-n02110627
-n03720891
-n04251144
-n03179701
-n02091244
-n07745940
-n03000247
-n04243546
-n07697313
-n03127925
-n01985128
-n03942813
-n02013706
-n02483708
-n01632458
-n02279972
-n02009912
-n02256656
-n01768244
-n02091635
-n03770679
-n12144580
-n01806567
-n04536866
-n03991062
-n02391049
-n02326432
-n04443257
-n02097047
-n02101006
-n02051845
-n03933933
-n03595614
-n07695742
-n07579787
-n02120079
-n02110627
-n02095314
-n03201208
-n03803284
-n02444819
-n03899768
-n02233338
-n02747177
-n03483316
-n04136333
-n03220513
-n03623198
-n03134739
-n03630383
-n02808440
-n03769881
-n02799071
-n04019541
-n01498041
-n04428191
-n02094433
-n03450230
-n02092002
-n03929660
-n03000134
-n01914609
-n03721384
-n04389033
-n02128385
-n03000247
-n02091244
-n02108000
-n02110063
-n02128385
-n02641379
-n01664065
-n02109525
-n07802026
-n07714571
-n03691459
-n02109961
-n01688243
-n04515003
-n04252225
-n02877765
-n03476991
-n07717410
-n04389033
-n02129165
-n01440764
-n12985857
-n04371430
-n03447721
-n02441942
-n02110958
-n02094433
-n04146614
-n03857828
-n03788195
-n03804744
-n02102040
-n02317335
-n09246464
-n02110958
-n02256656
-n03781244
-n01689811
-n02487347
-n02092002
-n03733805
-n01531178
-n02454379
-n02088238
-n01729322
-n01945685
-n01774384
-n01632458
-n03776460
-n01877812
-n07615774
-n02423022
-n03384352
-n01518878
-n03000684
-n02018207
-n03876231
-n02113799
-n01855032
-n02910353
-n02109047
-n03967562
-n02112018
-n02708093
-n02417914
-n13040303
-n04005630
-n02794156
-n01689811
-n02113186
-n03476991
-n03773504
-n03868863
-n03788365
-n02133161
-n02708093
-n07718747
-n02106030
-n03916031
-n02493793
-n02277742
-n02701002
-n04238763
-n07742313
-n01755581
-n02321529
-n01728572
-n12057211
-n03016953
-n04009552
-n02107312
-n04486054
-n03837869
-n04127249
-n03837869
-n03895866
-n03032252
-n04380533
-n02777292
-n01729322
-n02607072
-n03792972
-n03930630
-n02814533
-n04005630
-n04099969
-n02110806
-n03594734
-n03697007
-n02071294
-n02346627
-n02096294
-n01440764
-n12267677
-n02097658
-n02111889
-n03825788
-n04153751
-n04259630
-n04254680
-n02092002
-n01833805
-n04200800
-n04435653
-n07753113
-n03888257
-n01744401
-n04192698
-n02415577
-n04550184
-n02097474
-n02793495
-n04252225
-n03388549
-n02422106
-n02807133
-n02090622
-n03598930
-n01592084
-n01924916
-n07584110
-n02114712
-n03874599
-n03590841
-n09246464
-n04589890
-n03794056
-n03180011
-n02104029
-n03272562
-n04263257
-n03874599
-n07714990
-n02791124
-n03690938
-n02837789
-n02138441
-n02859443
-n03026506
-n02442845
-n04004767
-n02397096
-n04120489
-n01882714
-n03124170
-n03992509
-n01818515
-n03124170
-n02002724
-n03680355
-n02096051
-n02492660
-n04033995
-n04019541
-n02108915
-n01872401
-n04366367
-n04501370
-n04355338
-n03661043
-n02536864
-n01796340
-n02326432
-n02493509
-n02099849
-n02096051
-n02974003
-n03481172
-n03089624
-n01773157
-n03445777
-n02138441
-n07565083
-n03916031
-n02363005
-n01944390
-n02093754
-n04560804
-n12267677
-n03967562
-n07932039
-n03666591
-n02256656
-n03770439
-n04509417
-n03720891
-n07565083
-n07875152
-n01843383
-n03481172
-n02708093
-n02165105
-n02123394
-n01644900
-n02109961
-n04335435
-n02096177
-n02110185
-n02687172
-n04116512
-n01693334
-n03133878
-n02493793
-n01806143
-n07892512
-n03670208
-n04264628
-n03014705
-n07615774
-n02992211
-n03063599
-n04209239
-n02489166
-n07920052
-n04081281
-n04486054
-n02783161
-n03594734
-n03016953
-n02834397
-n04409515
-n03544143
-n01924916
-n02174001
-n04599235
-n07754684
-n07753275
-n02112706
-n03197337
-n02095570
-n02120079
-n03804744
-n01820546
-n02099849
-n04004767
-n02092339
-n03983396
-n01749939
-n04162706
-n04264628
-n03598930
-n02098286
-n07892512
-n03929660
-n04209133
-n03000684
-n04589890
-n02963159
-n02206856
-n03970156
-n04418357
-n02090379
-n03785016
-n02488291
-n04501370
-n04118538
-n04311174
-n03838899
-n02906734
-n01665541
-n03188531
-n03642806
-n03220513
-n02105855
-n03642806
-n02123394
-n02457408
-n03208938
-n04536866
-n02056570
-n02088466
-n04019541
-n02165456
-n02097209
-n02108000
-n04536866
-n02777292
-n02939185
-n04366367
-n01616318
-n03337140
-n04229816
-n03792782
-n07831146
-n03903868
-n03041632
-n02089867
-n07695742
-n03534580
-n03271574
-n01843383
-n07836838
-n02279972
-n07584110
-n02119789
-n01843065
-n02206856
-n03042490
-n02104029
-n04447861
-n03814906
-n02280649
-n03494278
-n02256656
-n02909870
-n03602883
-n01748264
-n02093428
-n03841143
-n03710193
-n01675722
-n02395406
-n03250847
-n02397096
-n12267677
-n03770679
-n02007558
-n03642806
-n07871810
-n03742115
-n02190166
-n07716358
-n01978455
-n02169497
-n04204347
-n03417042
-n02793495
-n03530642
-n03188531
-n02105505
-n02804414
-n02093754
-n02092339
-n02860847
-n02085936
-n02786058
-n02056570
-n02165456
-n03710637
-n04200800
-n04592741
-n03935335
-n02102973
-n04296562
-n04328186
-n12267677
-n01824575
-n02494079
-n02730930
-n02356798
-n03937543
-n03290653
-n02109047
-n02112137
-n02104365
-n02085620
-n09246464
-n01817953
-n03345487
-n02410509
-n02281787
-n04487081
-n01770393
-n03814906
-n01728920
-n02481823
-n01768244
-n03891251
-n04111531
-n03347037
-n03929660
-n02951585
-n02840245
-n02489166
-n01756291
-n02669723
-n07583066
-n02268443
-n04552348
-n04263257
-n04371774
-n03379051
-n04355338
-n04355933
-n04118538
-n04099969
-n04507155
-n02480495
-n03814639
-n02105855
-n02487347
-n04553703
-n04310018
-n03895866
-n03000247
-n01796340
-n03903868
-n03903868
-n07583066
-n04192698
-n02018795
-n02096177
-n02098286
-n03970156
-n03733281
-n07614500
-n03388043
-n02110958
-n01601694
-n07715103
-n02127052
-n02325366
-n03673027
-n02950826
-n02091467
-n03110669
-n03840681
-n03680355
-n02441942
-n03485407
-n02097474
-n02398521
-n02776631
-n02701002
-n02325366
-n03388043
-n07873807
-n03763968
-n04515003
-n02094258
-n02422699
-n01667114
-n04263257
-n07590611
-n02110185
-n03899768
-n03877845
-n03197337
-n12144580
-n04152593
-n02108089
-n02493793
-n02105855
-n03481172
-n04228054
-n03899768
-n02093754
-n01737021
-n02415577
-n01685808
-n01773157
-n02101388
-n03710721
-n01873310
-n03627232
-n02708093
-n02102318
-n07747607
-n02791124
-n02870880
-n03388549
-n04372370
-n03775071
-n04347754
-n03026506
-n07720875
-n01883070
-n03690938
-n03776460
-n01558993
-n04552348
-n03457902
-n07768694
-n04356056
-n04485082
-n09288635
-n07760859
-n03991062
-n04136333
-n03938244
-n02102177
-n03991062
-n04550184
-n04127249
-n01498041
-n03691459
-n03255030
-n02417914
-n02099429
-n04254777
-n04277352
-n01855032
-n01983481
-n04604644
-n02102973
-n02790996
-n02094258
-n02489166
-n03887697
-n02443114
-n04228054
-n01667778
-n02172182
-n04133789
-n03196217
-n02018207
-n03124170
-n02841315
-n02174001
-n02138441
-n02364673
-n03874599
-n02690373
-n12267677
-n02071294
-n02396427
-n02100236
-n04125021
-n01704323
-n02281406
-n02226429
-n02097298
-n02787622
-n02086910
-n02415577
-n02123597
-n03977966
-n03743016
-n02951585
-n04548280
-n03216828
-n02096437
-n02233338
-n02536864
-n01773157
-n03657121
-n02883205
-n03777754
-n01843065
-n15075141
-n04462240
-n02086240
-n03832673
-n04026417
-n04346328
-n02808440
-n04152593
-n03017168
-n03710193
-n02110341
-n02111500
-n02117135
-n02018207
-n03769881
-n02087394
-n04286575
-n02105855
-n03218198
-n04509417
-n02749479
-n01756291
-n03584254
-n07613480
-n02437312
-n04458633
-n01518878
-n01677366
-n02797295
-n07717410
-n03775071
-n04209133
-n03425413
-n04347754
-n02028035
-n02085936
-n04317175
-n04310018
-n13044778
-n01693334
-n03047690
-n03983396
-n02268443
-n04442312
-n02109961
-n04019541
-n04335435
-n07932039
-n03743016
-n02268443
-n04523525
-n02134418
-n02860847
-n02096051
-n02817516
-n04238763
-n12620546
-n02092002
-n13037406
-n03000134
-n04228054
-n02002724
-n02086079
-n03394916
-n04265275
-n04136333
-n02481823
-n04041544
-n03272562
-n02999410
-n02488702
-n01824575
-n03967562
-n02730930
-n01843383
-n04604644
-n02177972
-n01744401
-n07860988
-n04153751
-n01491361
-n03297495
-n04346328
-n03956157
-n02325366
-n02974003
-n03733281
-n03899768
-n07717556
-n02114367
-n04366367
-n03400231
-n02808440
-n01968897
-n02259212
-n03642806
-n01955084
-n03776460
-n09835506
-n01775062
-n02979186
-n02093991
-n04263257
-n04485082
-n04482393
-n03179701
-n01739381
-n02088238
-n03991062
-n13040303
-n01534433
-n01978455
-n02480495
-n02086910
-n02097209
-n02096294
-n04209133
-n09428293
-n03018349
-n07871810
-n01986214
-n01491361
-n02106662
-n03028079
-n04179913
-n04264628
-n03450230
-n04376876
-n02129165
-n02127052
-n02111500
-n04254680
-n02951358
-n03854065
-n02488702
-n02834397
-n02128757
-n03075370
-n07583066
-n03047690
-n01829413
-n03124043
-n01843065
-n07697537
-n07734744
-n02834397
-n02814860
-n02481823
-n04356056
-n03124043
-n01990800
-n03291819
-n02487347
-n03658185
-n04404412
-n03791053
-n03866082
-n02930766
-n02074367
-n02777292
-n04458633
-n02098286
-n02843684
-n04592741
-n01641577
-n03529860
-n01484850
-n04141076
-n03485407
-n03590841
-n04037443
-n07613480
-n01688243
-n04074963
-n02701002
-n03535780
-n02090379
-n02111889
-n06874185
-n07693725
-n07802026
-n07754684
-n01774384
-n01514668
-n02028035
-n04423845
-n02096051
-n02115641
-n01774384
-n02894605
-n03026506
-n02666196
-n03690938
-n02112706
-n03787032
-n01748264
-n03733131
-n03920288
-n04141076
-n02101006
-n03944341
-n12267677
-n03782006
-n03924679
-n02437616
-n02992529
-n02871525
-n02104029
-n03376595
-n04243546
-n03854065
-n03983396
-n02104029
-n01883070
-n07716906
-n02092002
-n02114855
-n03255030
-n01873310
-n01704323
-n04192698
-n03485407
-n02916936
-n07590611
-n02869837
-n03527444
-n03595614
-n02105412
-n09835506
-n04033901
-n04285008
-n02326432
-n02104029
-n07716906
-n07760859
-n03832673
-n03492542
-n02408429
-n03781244
-n02099849
-n03840681
-n02092339
-n03590841
-n01685808
-n01694178
-n07753592
-n03535780
-n02730930
-n04270147
-n02011460
-n04483307
-n01688243
-n01737021
-n02033041
-n03100240
-n03447447
-n03584829
-n02483362
-n03998194
-n02483362
-n03481172
-n01558993
-n04606251
-n01537544
-n02808440
-n03825788
-n01773157
-n04507155
-n04141076
-n02504013
-n04562935
-n07590611
-n04357314
-n01608432
-n02097658
-n03950228
-n02814860
-n01498041
-n04553703
-n12768682
-n03032252
-n02097474
-n01955084
-n07695742
-n02483708
-n02106550
-n04515003
-n02226429
-n04370456
-n03000684
-n03837869
-n02113799
-n02102480
-n03459775
-n02120079
-n02071294
-n13054560
-n04192698
-n02504458
-n04372370
-n04251144
-n02006656
-n03908618
-n04311174
-n03018349
-n13133613
-n03796401
-n04409515
-n02102480
-n02843684
-n04040759
-n02086646
-n02948072
-n07836838
-n03476684
-n02236044
-n04296562
-n02017213
-n04612504
-n02769748
-n07717410
-n07717410
-n01751748
-n03773504
-n02085782
-n04562935
-n04239074
-n07760859
-n07768694
-n03160309
-n01692333
-n03045698
-n03272562
-n04417672
-n03954731
-n04505470
-n04154565
-n03691459
-n04209239
-n04409515
-n02363005
-n07734744
-n02422699
-n03529860
-n04235860
-n04536866
-n01981276
-n03888257
-n02276258
-n03388043
-n07718472
-n02869837
-n02006656
-n03595614
-n02917067
-n01440764
-n01855032
-n03930630
-n02105505
-n01491361
-n03345487
-n04372370
-n03187595
-n01491361
-n04264628
-n04557648
-n02119022
-n02607072
-n02396427
-n07615774
-n04553703
-n07718472
-n03530642
-n02100583
-n04557648
-n03485407
-n07745940
-n01531178
-n03954731
-n04465501
-n12768682
-n04486054
-n03595614
-n04548362
-n07753113
-n02701002
-n04525038
-n02317335
-n02443484
-n02939185
-n03314780
-n02089078
-n02859443
-n02091467
-n02124075
-n03690938
-n02091831
-n02454379
-n04065272
-n03196217
-n02655020
-n04487394
-n04286575
-n03125729
-n03854065
-n03670208
-n02108422
-n02102480
-n02988304
-n02009229
-n02099267
-n02097209
-n02948072
-n02110806
-n02177972
-n03494278
-n01737021
-n13133613
-n04447861
-n04591713
-n03495258
-n02859443
-n02860847
-n04554684
-n03637318
-n04258138
-n01797886
-n03095699
-n04041544
-n03602883
-n04525038
-n03706229
-n02093859
-n02119022
-n02454379
-n07614500
-n02276258
-n07714571
-n02177972
-n02129604
-n01601694
-n04355338
-n02999410
-n07760859
-n02165456
-n02111129
-n03220513
-n02437616
-n04465501
-n03272010
-n02167151
-n02174001
-n02607072
-n04254120
-n07584110
-n03388549
-n03063599
-n02795169
-n02727426
-n02799071
-n10565667
-n02454379
-n07717410
-n02504013
-n04266014
-n04493381
-n03832673
-n02033041
-n02447366
-n03314780
-n02930766
-n02110806
-n04033901
-n02870880
-n01872401
-n03063689
-n03814906
-n01798484
-n02219486
-n02111129
-n03124170
-n03443371
-n01855672
-n03089624
-n04239074
-n03814906
-n04285008
-n02097474
-n01819313
-n02364673
-n03773504
-n04310018
-n04398044
-n13054560
-n01665541
-n02025239
-n03976657
-n04553703
-n07715103
-n02018795
-n03794056
-n03595614
-n03026506
-n02128925
-n03717622
-n03041632
-n04417672
-n07753275
-n07718747
-n01728920
-n03447447
-n02114548
-n02769748
-n01784675
-n02100877
-n02097658
-n04523525
-n02002556
-n03404251
-n03786901
-n04162706
-n02776631
-n13133613
-n04254777
-n04355338
-n02104029
-n04201297
-n03775071
-n02093754
-n03992509
-n03134739
-n12057211
-n04116512
-n02281787
-n07920052
-n02105641
-n01943899
-n03841143
-n02487347
-n04486054
-n02281787
-n02342885
-n03775546
-n02011460
-n02089078
-n03776460
-n04423845
-n02865351
-n03089624
-n04371774
-n01514859
-n01734418
-n02328150
-n09468604
-n03063689
-n02951585
-n02095314
-n03792972
-n03776460
-n02346627
-n02894605
-n01775062
-n02130308
-n04192698
-n13044778
-n01751748
-n07697537
-n03868242
-n04525038
-n02259212
-n02391049
-n04399382
-n02667093
-n01530575
-n01632777
-n03259280
-n02840245
-n04019541
-n02422699
-n02113712
-n03930630
-n02643566
-n02231487
-n04487394
-n03937543
-n03355925
-n01828970
-n01580077
-n07932039
-n02877765
-n02167151
-n03476991
-n02825657
-n01751748
-n03207941
-n03840681
-n09288635
-n01843383
-n04536866
-n03814906
-n04429376
-n04428191
-n03814906
-n04344873
-n01693334
-n03417042
-n02747177
-n01986214
-n02277742
-n03127747
-n02422699
-n12985857
-n02672831
-n02823428
-n02112018
-n04037443
-n07695742
-n02536864
-n02788148
-n02088364
-n02105251
-n02105641
-n02123159
-n03729826
-n03125729
-n04179913
-n02097474
-n03297495
-n03042490
-n04252225
-n03141823
-n09193705
-n04149813
-n02655020
-n03788365
-n03085013
-n02037110
-n01944390
-n02120505
-n04536866
-n07695742
-n02951358
-n03417042
-n03733131
-n04325704
-n03843555
-n03179701
-n02009229
-n04523525
-n02098413
-n02096585
-n03424325
-n02105162
-n04590129
-n01537544
-n02093991
-n03394916
-n01514668
-n13133613
-n03445924
-n03873416
-n01632458
-n03706229
-n02085782
-n01632777
-n04371430
-n12144580
-n01665541
-n02102040
-n02701002
-n04131690
-n04347754
-n13040303
-n01775062
-n02114712
-n01833805
-n03759954
-n02860847
-n04330267
-n02859443
-n02138441
-n01774384
-n07717556
-n04311004
-n03908714
-n02361337
-n04065272
-n04146614
-n04179913
-n01697457
-n03857828
-n04285008
-n02089078
-n01755581
-n02056570
-n02701002
-n02483708
-n02101556
-n01737021
-n03874599
-n02107683
-n03657121
-n01592084
-n03995372
-n03788195
-n02100877
-n03447447
-n09399592
-n04350905
-n04266014
-n02979186
-n02988304
-n02879718
-n03032252
-n01530575
-n03291819
-n04131690
-n02037110
-n01632458
-n02102177
-n04367480
-n01807496
-n02107908
-n01740131
-n02096585
-n04235860
-n02363005
-n02110958
-n07711569
-n03384352
-n03530642
-n03761084
-n03602883
-n01531178
-n01774384
-n04456115
-n01985128
-n01694178
-n03065424
-n04589890
-n04049303
-n07248320
-n06874185
-n04604644
-n01775062
-n02123597
-n02095570
-n01985128
-n02115913
-n01622779
-n01601694
-n04589890
-n01560419
-n01440764
-n02051845
-n03218198
-n03047690
-n03854065
-n02442845
-n02361337
-n02835271
-n01531178
-n02108422
-n02115913
-n03141823
-n02088238
-n03690938
-n03207941
-n02510455
-n01806143
-n01740131
-n03854065
-n02488291
-n04428191
-n03063599
-n02101556
-n02087046
-n02101556
-n03792972
-n04296562
-n02101006
-n02776631
-n01773797
-n03709823
-n04458633
-n02281406
-n03691459
-n03692522
-n02089867
-n03868863
-n02012849
-n03763968
-n01944390
-n01667114
-n03950228
-n02128385
-n02319095
-n04553703
-n03452741
-n03345487
-n02672831
-n03935335
-n02104365
-n01592084
-n04149813
-n03594734
-n02233338
-n01688243
-n07718472
-n03394916
-n13040303
-n01986214
-n02510455
-n04285008
-n03956157
-n02264363
-n03127747
-n03445777
-n04467665
-n03240683
-n03065424
-n04517823
-n02165105
-n03602883
-n01753488
-n04399382
-n09256479
-n02086910
-n03956157
-n03485794
-n02484975
-n02666196
-n02097209
-n03535780
-n02112018
-n03109150
-n04590129
-n01667778
-n02787622
-n02088364
-n03388549
-n02494079
-n01843065
-n02108551
-n03929855
-n03498962
-n02109525
-n04328186
-n09256479
-n04540053
-n03459775
-n03982430
-n02444819
-n01494475
-n02086079
-n02125311
-n03529860
-n01843383
-n03992509
-n01641577
-n04099969
-n04254777
-n01608432
-n02346627
-n02397096
-n02676566
-n01491361
-n02074367
-n04252225
-n04485082
-n02092002
-n02098286
-n02727426
-n03100240
-n13054560
-n02097298
-n02123045
-n02002724
-n02109047
-n03131574
-n02692877
-n02088632
-n04465501
-n02930766
-n01843065
-n03697007
-n02102973
-n04147183
-n02117135
-n07754684
-n02787622
-n02114548
-n04515003
-n01855672
-n01682714
-n02110063
-n04127249
-n03127925
-n04429376
-n03710193
-n03796401
-n02786058
-n02794156
-n02112018
-n02423022
-n02094114
-n02092339
-n03344393
-n03888605
-n02437312
-n02107574
-n03710637
-n01491361
-n04074963
-n02128385
-n04044716
-n02093991
-n02113186
-n01592084
-n07714990
-n02174001
-n02777292
-n02090379
-n04509417
-n02486261
-n02841315
-n02096051
-n01768244
-n03895866
-n03891332
-n02102177
-n04525038
-n03777754
-n07716906
-n02091244
-n02966687
-n01981276
-n02092339
-n04612504
-n09229709
-n02099429
-n04540053
-n03935335
-n01644373
-n02088466
-n04380533
-n02105162
-n02916936
-n01944390
-n02123159
-n03459775
-n01944390
-n02100735
-n01740131
-n03599486
-n02169497
-n03888605
-n04296562
-n03794056
-n03110669
-n02356798
-n03032252
-n04482393
-n03888605
-n01748264
-n02098413
-n03967562
-n03706229
-n13052670
-n04252225
-n02009229
-n04252225
-n09421951
-n01930112
-n04461696
-n04208210
-n02443484
-n03045698
-n03967562
-n07880968
-n02177972
-n01698640
-n02704792
-n04328186
-n01828970
-n04482393
-n03400231
-n03394916
-n04467665
-n04259630
-n01860187
-n03868863
-n03000134
-n02783161
-n02509815
-n04465501
-n02417914
-n04482393
-n02787622
-n02089867
-n03240683
-n02403003
-n04296562
-n02782093
-n02892201
-n03777754
-n04612504
-n03372029
-n01756291
-n03902125
-n03355925
-n01843383
-n04579432
-n02091134
-n04579432
-n03481172
-n02841315
-n07831146
-n03075370
-n02009912
-n04201297
-n02396427
-n01753488
-n03249569
-n04090263
-n01704323
-n02526121
-n04204347
-n02777292
-n03126707
-n04254120
-n02111277
-n01582220
-n02206856
-n02939185
-n01693334
-n02641379
-n04263257
-n04347754
-n07734744
-n01990800
-n04399382
-n04270147
-n03944341
-n01773549
-n03259280
-n02089078
-n02094433
-n04525305
-n04493381
-n01669191
-n02066245
-n02841315
-n03796401
-n04371430
-n04548362
-n03944341
-n01773157
-n03223299
-n03692522
-n03594945
-n02100877
-n03000134
-n02783161
-n03345487
-n02802426
-n01944390
-n02817516
-n02102973
-n03956157
-n03627232
-n02114712
-n03837869
-n02797295
-n04458633
-n03196217
-n02963159
-n02110341
-n02108551
-n09468604
-n03452741
-n02174001
-n04380533
-n07716358
-n04037443
-n03803284
-n03958227
-n09288635
-n04442312
-n03272562
-n03891251
-n04118776
-n04532670
-n01742172
-n03733281
-n02102177
-n03026506
-n02606052
-n01818515
-n04589890
-n04428191
-n02279972
-n02123045
-n04254120
-n03000684
-n01983481
-n02704792
-n07590611
-n04162706
-n02088632
-n02112706
-n03938244
-n02112018
-n02123597
-n01531178
-n02325366
-n03000684
-n02066245
-n02859443
-n03063599
-n07753113
-n02999410
-n03777568
-n02108089
-n01872401
-n02025239
-n01484850
-n03899768
-n04162706
-n02110341
-n02091467
-n04417672
-n03000134
-n04356056
-n04417672
-n01689811
-n02412080
-n02086646
-n02096294
-n01622779
-n02089973
-n02835271
-n09193705
-n04111531
-n04456115
-n09193705
-n03633091
-n07749582
-n07697537
-n02860847
-n01855672
-n03743016
-n02077923
-n07754684
-n01833805
-n02013706
-n03976657
-n03134739
-n03720891
-n02837789
-n04355933
-n03584829
-n09472597
-n01843065
-n01749939
-n03717622
-n03982430
-n02504458
-n02127052
-n03127747
-n04026417
-n03866082
-n01872401
-n02094258
-n03291819
-n02110627
-n03982430
-n02093256
-n02277742
-n02965783
-n04428191
-n01740131
-n02795169
-n02119789
-n03535780
-n03461385
-n01980166
-n02486410
-n03720891
-n04597913
-n03666591
-n02843684
-n04252225
-n10565667
-n02268443
-n01491361
-n02098105
-n03775071
-n03187595
-n07760859
-n02259212
-n03042490
-n03942813
-n04069434
-n04120489
-n01820546
-n04548280
-n07718472
-n02417914
-n02095314
-n06874185
-n03447447
-n03983396
-n04592741
-n02102177
-n03649909
-n03594945
-n02099712
-n04370456
-n04517823
-n07875152
-n03207941
-n02398521
-n03954731
-n01796340
-n01798484
-n02113712
-n01491361
-n04423845
-n03483316
-n04461696
-n02106550
-n01773157
-n13052670
-n02091244
-n03706229
-n01560419
-n03832673
-n02492660
-n04099969
-n03982430
-n04532670
-n01631663
-n02085782
-n01728920
-n03240683
-n04584207
-n01806567
-n01729977
-n01601694
-n04350905
-n04179913
-n04592741
-n02108422
-n02110806
-n02814533
-n01773797
-n02704792
-n02782093
-n03916031
-n03467068
-n03710721
-n04554684
-n01955084
-n07717556
-n02009229
-n02256656
-n03095699
-n02094258
-n02486410
-n02027492
-n04200800
-n04371430
-n03662601
-n02444819
-n01665541
-n01614925
-n02112018
-n03773504
-n04505470
-n02951358
-n02948072
-n02101556
-n03868242
-n02093256
-n01641577
-n02128385
-n03000684
-n03874293
-n03134739
-n01440764
-n02268853
-n07584110
-n04399382
-n01843065
-n03188531
-n02086240
-n04540053
-n01829413
-n04462240
-n03018349
-n03782006
-n07730033
-n03676483
-n04275548
-n03930630
-n03764736
-n02226429
-n02007558
-n04149813
-n01820546
-n01829413
-n02110185
-n02107683
-n03840681
-n02018207
-n01833805
-n03902125
-n03868863
-n03443371
-n02113978
-n03793489
-n02859443
-n02097047
-n04192698
-n07590611
-n07880968
-n07697537
-n02342885
-n02398521
-n02002724
-n02910353
-n02442845
-n02906734
-n02494079
-n02091831
-n02823750
-n04447861
-n01796340
-n03089624
-n03924679
-n01980166
-n04435653
-n03649909
-n02107142
-n02110063
-n02403003
-n04081281
-n01735189
-n01532829
-n03891251
-n02077923
-n03977966
-n03452741
-n04465501
-n02777292
-n02113799
-n04367480
-n03787032
-n01744401
-n02667093
-n03933933
-n01580077
-n02794156
-n01796340
-n02002556
-n02837789
-n01818515
-n09835506
-n04604644
-n01917289
-n03180011
-n02102480
-n03873416
-n03995372
-n03884397
-n03657121
-n02093754
-n02102318
-n02097658
-n02108422
-n01855672
-n02489166
-n03208938
-n02116738
-n07802026
-n03584254
-n02108000
-n09256479
-n02892767
-n02105162
-n03388549
-n02870880
-n02116738
-n01807496
-n03045698
-n03717622
-n03109150
-n03388549
-n02437616
-n07930864
-n03991062
-n03709823
-n03680355
-n02033041
-n02843684
-n02795169
-n02236044
-n02509815
-n04442312
-n12998815
-n03255030
-n02111889
-n03595614
-n03788195
-n02690373
-n01756291
-n01698640
-n07565083
-n01983481
-n03445777
-n03998194
-n02879718
-n07930864
-n03255030
-n02086646
-n04120489
-n03733281
-n01667114
-n03532672
-n03179701
-n04229816
-n03733281
-n09256479
-n02105251
-n03146219
-n04330267
-n06874185
-n12620546
-n01641577
-n02106550
-n02445715
-n03146219
-n02493793
-n02509815
-n02804610
-n03590841
-n01871265
-n02483362
-n02437616
-n03895866
-n02071294
-n03291819
-n13044778
-n02114855
-n01984695
-n02500267
-n06359193
-n01843065
-n03763968
-n02643566
-n04258138
-n02667093
-n07734744
-n04153751
-n02138441
-n03188531
-n07802026
-n02100583
-n07860988
-n01817953
-n02106166
-n02483708
-n03782006
-n02007558
-n04476259
-n02835271
-n03124170
-n04550184
-n03661043
-n04204238
-n03776460
-n03837869
-n04443257
-n02486261
-n01537544
-n02317335
-n02134418
-n04557648
-n01872401
-n04209239
-n01677366
-n02100735
-n02096437
-n04479046
-n01693334
-n02965783
-n01514859
-n07613480
-n02108422
-n01914609
-n03482405
-n03710637
-n04009552
-n02106166
-n01531178
-n02704792
-n04487394
-n02834397
-n02108915
-n02484975
-n04310018
-n02095570
-n03447721
-n02119022
-n03017168
-n03697007
-n03249569
-n02835271
-n04591713
-n03347037
-n02791124
-n01692333
-n01882714
-n03196217
-n02422699
-n04041544
-n03796401
-n02028035
-n02966193
-n04235860
-n03642806
-n03838899
-n02510455
-n01930112
-n03781244
-n02091032
-n02025239
-n03196217
-n02094114
-n01978455
-n04254120
-n13040303
-n03459775
-n07716358
-n03016953
-n03876231
-n02892767
-n04069434
-n02256656
-n02168699
-n02128757
-n01986214
-n02009229
-n02790996
-n03630383
-n07718747
-n02361337
-n02951585
-n07873807
-n03223299
-n07836838
-n04266014
-n03956157
-n02002724
-n02077923
-n02002556
-n02951358
-n03259280
-n02113186
-n02843684
-n04332243
-n01775062
-n02777292
-n04118538
-n02226429
-n03908618
-n02782093
-n03777568
-n02101556
-n02701002
-n02018795
-n02102318
-n03045698
-n04254680
-n02692877
-n12620546
-n02325366
-n01560419
-n02977058
-n03127925
-n04325704
-n03483316
-n02101556
-n03450230
-n04264628
-n02101556
-n03482405
-n07715103
-n03544143
-n02395406
-n01797886
-n03207941
-n04389033
-n01978455
-n01755581
-n02708093
-n03461385
-n02342885
-n01930112
-n04009552
-n02804610
-n13037406
-n02092339
-n02106550
-n04033995
-n02395406
-n03733131
-n02859443
-n04008634
-n02841315
-n02412080
-n03785016
-n01440764
-n03100240
-n01665541
-n03710721
-n04599235
-n04370456
-n02124075
-n02138441
-n03085013
-n01744401
-n04296562
-n09835506
-n03785016
-n07754684
-n04311004
-n02124075
-n02802426
-n04239074
-n02971356
-n02009229
-n02096177
-n01695060
-n03954731
-n01828970
-n02086240
-n02447366
-n03095699
-n03590841
-n03482405
-n02107574
-n02096294
-n03085013
-n04456115
-n04486054
-n04599235
-n03141823
-n04263257
-n03877845
-n04428191
-n03976657
-n02797295
-n03637318
-n03041632
-n07579787
-n02687172
-n03201208
-n04579145
-n01608432
-n02099849
-n01667114
-n04372370
-n02106166
-n03075370
-n02138441
-n03028079
-n01930112
-n03388183
-n03825788
-n13044778
-n02687172
-n03692522
-n02391049
-n04254120
-n03146219
-n03126707
-n02025239
-n07714571
-n02869837
-n01580077
-n03594945
-n02109525
-n04099969
-n03792972
-n03623198
-n01872401
-n02441942
-n03032252
-n02687172
-n02096294
-n02037110
-n04310018
-n02280649
-n03992509
-n04037443
-n01806567
-n02325366
-n03372029
-n02259212
-n04371430
-n02391049
-n01755581
-n01820546
-n02264363
-n01494475
-n03201208
-n01774750
-n03259280
-n02687172
-n04090263
-n02483708
-n04487081
-n03218198
-n02480495
-n01692333
-n03017168
-n01843065
-n03930630
-n02056570
-n03041632
-n02799071
-n03344393
-n01514859
-n02113978
-n02027492
-n01981276
-n02397096
-n04192698
-n03134739
-n02666196
-n02117135
-n04461696
-n02231487
-n09246464
-n04149813
-n02102040
-n02086910
-n04355338
-n02457408
-n02093428
-n01689811
-n03481172
-n07836838
-n03803284
-n01910747
-n04553703
-n03478589
-n03584829
-n04254777
-n04254120
-n02105505
-n02361337
-n03992509
-n02804610
-n02102318
-n01560419
-n01773549
-n03902125
-n06359193
-n02129165
-n02120079
-n02113712
-n01728920
-n03160309
-n07871810
-n04258138
-n03045698
-n04552348
-n13044778
-n03717622
-n02025239
-n02268443
-n02108915
-n04542943
-n03240683
-n02966687
-n07754684
-n03991062
-n02769748
-n03187595
-n03271574
-n02256656
-n03637318
-n04357314
-n03207941
-n01728920
-n04074963
-n03000684
-n04118538
-n03888257
-n03000134
-n02930766
-n02437616
-n01622779
-n03954731
-n04266014
-n02108915
-n01729977
-n04553703
-n02328150
-n07715103
-n03617480
-n02441942
-n01734418
-n02229544
-n02259212
-n03017168
-n02077923
-n03871628
-n02025239
-n02992211
-n01978287
-n01755581
-n04008634
-n01773797
-n04209239
-n04584207
-n02493793
-n01616318
-n04127249
-n01877812
-n02814860
-n03535780
-n04040759
-n02879718
-n02514041
-n04592741
-n03854065
-n01614925
-n04026417
-n03837869
-n02865351
-n04239074
-n06794110
-n02190166
-n04208210
-n02088238
-n02497673
-n03179701
-n04613696
-n01693334
-n02672831
-n02817516
-n02106662
-n04392985
-n03777754
-n03649909
-n04311004
-n01664065
-n04389033
-n02807133
-n03476991
-n03141823
-n03793489
-n02988304
-n03325584
-n01871265
-n09288635
-n04326547
-n02110063
-n03220513
-n02093859
-n01693334
-n02815834
-n02107574
-n04487081
-n04347754
-n07695742
-n04086273
-n04493381
-n01580077
-n02910353
-n07754684
-n04067472
-n12768682
-n01675722
-n02437312
-n04417672
-n03868863
-n13054560
-n02100735
-n03888605
-n04009552
-n04238763
-n03876231
-n03706229
-n02859443
-n01530575
-n01824575
-n02096437
-n04486054
-n02704792
-n02110185
-n01824575
-n12620546
-n03814906
-n04154565
-n02058221
-n02111129
-n03690938
-n03857828
-n01534433
-n09229709
-n02086910
-n04507155
-n02098105
-n02089078
-n04355933
-n02930766
-n03384352
-n02892201
-n03992509
-n02109961
-n04479046
-n03000247
-n03047690
-n04258138
-n04005630
-n02281787
-n01693334
-n03379051
-n01614925
-n04479046
-n04591713
-n03920288
-n02051845
-n01756291
-n02107312
-n04435653
-n03325584
-n02058221
-n02107683
-n02111277
-n03786901
-n07768694
-n03891332
-n04204347
-n03400231
-n03961711
-n02490219
-n03347037
-n04597913
-n02090721
-n03450230
-n02112137
-n03250847
-n03868242
-n02058221
-n04141327
-n03761084
-n02090379
-n02486261
-n02095570
-n01749939
-n02804610
-n04273569
-n02777292
-n03930630
-n03775546
-n07716906
-n02916936
-n02930766
-n03709823
-n02056570
-n02412080
-n02666196
-n03196217
-n04479046
-n04509417
-n01532829
-n07697313
-n02493793
-n02058221
-n04252077
-n02002556
-n02085936
-n03063599
-n04273569
-n04550184
-n03710193
-n01742172
-n02443484
-n03720891
-n03706229
-n02643566
-n03218198
-n03877845
-n01630670
-n07714990
-n02264363
-n01532829
-n04540053
-n02113712
-n04259630
-n03661043
-n03220513
-n03445924
-n07831146
-n01530575
-n03691459
-n01773157
-n06785654
-n03290653
-n03995372
-n03866082
-n02276258
-n03777568
-n01675722
-n12985857
-n02835271
-n03444034
-n02101006
-n03637318
-n03787032
-n04258138
-n03535780
-n04065272
-n02099267
-n03347037
-n01755581
-n03908714
-n02056570
-n02093647
-n01729977
-n04344873
-n01847000
-n02112350
-n01632458
-n04562935
-n03325584
-n04127249
-n04141076
-n04554684
-n07714571
-n02027492
-n03532672
-n02992529
-n02321529
-n03538406
-n03721384
-n02013706
-n04599235
-n02093991
-n02777292
-n02123394
-n07747607
-n03424325
-n03976657
-n04209239
-n02951585
-n07753592
-n04443257
-n03388183
-n10148035
-n03344393
-n04336792
-n02120505
-n01981276
-n03933933
-n01829413
-n03916031
-n02776631
-n01775062
-n04286575
-n04209239
-n07730033
-n02099712
-n07613480
-n02100583
-n03733805
-n03873416
-n04476259
-n02113799
-n02690373
-n09468604
-n02009912
-n01980166
-n02096294
-n03764736
-n03417042
-n03000134
-n10565667
-n04120489
-n02114855
-n04039381
-n04376876
-n02843684
-n02643566
-n03924679
-n03958227
-n03773504
-n02276258
-n03776460
-n03000684
-n02129165
-n03445924
-n02108089
-n04310018
-n03873416
-n02236044
-n03483316
-n02099601
-n02115913
-n02441942
-n03967562
-n04479046
-n04344873
-n02123597
-n02229544
-n03179701
-n02791124
-n04525305
-n03976657
-n04147183
-n02835271
-n01685808
-n02280649
-n01768244
-n02489166
-n04355338
-n02279972
-n03770679
-n01498041
-n04041544
-n02085620
-n02086240
-n03532672
-n02268853
-n02978881
-n02363005
-n04442312
-n02280649
-n02108915
-n04380533
-n04462240
-n03271574
-n03930630
-n02892767
-n01797886
-n01978287
-n02437616
-n03920288
-n03160309
-n01560419
-n02666196
-n03424325
-n02514041
-n02790996
-n02397096
-n01775062
-n02071294
-n02100583
-n04380533
-n01990800
-n03903868
-n07583066
-n02013706
-n02130308
-n02113023
-n03884397
-n03000684
-n04037443
-n01687978
-n02058221
-n02704792
-n07693725
-n04039381
-n03461385
-n01950731
-n03773504
-n02104365
-n04536866
-n02328150
-n07871810
-n03372029
-n04462240
-n02133161
-n02808304
-n03443371
-n01843065
-n01914609
-n01855032
-n04380533
-n02086646
-n02363005
-n04296562
-n04033995
-n02871525
-n03742115
-n02704792
-n02108915
-n03670208
-n02093428
-n04428191
-n09421951
-n01984695
-n02128757
-n01917289
-n04033901
-n02092002
-n03840681
-n03476684
-n04286575
-n04423845
-n02951358
-n03877845
-n01728572
-n03481172
-n03208938
-n02487347
-n02107908
-n07565083
-n04479046
-n03832673
-n02948072
-n02950826
-n03929660
-n04370456
-n02978881
-n01498041
-n02783161
-n03697007
-n01820546
-n03026506
-n04584207
-n02091467
-n02422699
-n02123045
-n03793489
-n03958227
-n02443484
-n02098286
-n02788148
-n04392985
-n12768682
-n03843555
-n02894605
-n04372370
-n02077923
-n02111889
-n01770393
-n02840245
-n01631663
-n02786058
-n04462240
-n02264363
-n03942813
-n02457408
-n03476991
-n02107312
-n02917067
-n04612504
-n02100583
-n04239074
-n04476259
-n02105855
-n03929855
-n02389026
-n04389033
-n03876231
-n04041544
-n01806143
-n07584110
-n02814533
-n03868863
-n02104365
-n02128925
-n02105251
-n04447861
-n04517823
-n02395406
-n04208210
-n02091831
-n04330267
-n02444819
-n02815834
-n02264363
-n01484850
-n02105641
-n02808440
-n02116738
-n01873310
-n03792972
-n02125311
-n01855032
-n02704792
-n07717556
-n03814906
-n01667114
-n03857828
-n01784675
-n02091032
-n04409515
-n01614925
-n03769881
-n02814533
-n02093754
-n07747607
-n03857828
-n04277352
-n02104029
-n04131690
-n02951358
-n02134084
-n07749582
-n03126707
-n04325704
-n02497673
-n02105412
-n01685808
-n07871810
-n02927161
-n04380533
-n04152593
-n02106382
-n04350905
-n01795545
-n03871628
-n02965783
-n07614500
-n03884397
-n03980874
-n02492035
-n02113712
-n03417042
-n04259630
-n03483316
-n01494475
-n02088238
-n07565083
-n07753113
-n04366367
-n04120489
-n04429376
-n02091467
-n02112350
-n02699494
-n03995372
-n02113186
-n01685808
-n03347037
-n02843684
-n02108089
-n03825788
-n03773504
-n02787622
-n04325704
-n03796401
-n01698640
-n03045698
-n02422699
-n04417672
-n04141327
-n04118538
-n02113624
-n04550184
-n01728572
-n04380533
-n04209133
-n01537544
-n07920052
-n04317175
-n01742172
-n02786058
-n03417042
-n03770679
-n02804414
-n02236044
-n03085013
-n04019541
-n03661043
-n03769881
-n01773797
-n02835271
-n01494475
-n01773797
-n02097298
-n01667114
-n02106030
-n02106030
-n03146219
-n01930112
-n02102177
-n13040303
-n04357314
-n04264628
-n07875152
-n04371774
-n02099849
-n03127925
-n02869837
-n03710193
-n02097130
-n07730033
-n04311004
-n03085013
-n02102040
-n04486054
-n02111889
-n04204238
-n03792972
-n03450230
-n03617480
-n02124075
-n03495258
-n03769881
-n02916936
-n01704323
-n03063599
-n01883070
-n01614925
-n04311004
-n01692333
-n03125729
-n04192698
-n03874293
-n03496892
-n04118776
-n02454379
-n04116512
-n01677366
-n01514668
-n03476991
-n03733805
-n03942813
-n03095699
-n02883205
-n02091467
-n02817516
-n06794110
-n03131574
-n02101388
-n01978455
-n02106382
-n02108915
-n03216828
-n07615774
-n07730033
-n01770393
-n04371430
-n02123159
-n01984695
-n01737021
-n02825657
-n02099267
-n03658185
-n02815834
-n02120079
-n03908714
-n04554684
-n04604644
-n03109150
-n03866082
-n03908714
-n03617480
-n02093647
-n02510455
-n04074963
-n03089624
-n02095314
-n03218198
-n02817516
-n01943899
-n03854065
-n03891251
-n04423845
-n04131690
-n04442312
-n01537544
-n03325584
-n02095889
-n03291819
-n03042490
-n02504013
-n03146219
-n04252077
-n02328150
-n01697457
-n02655020
-n04606251
-n07720875
-n02091831
-n02097209
-n01630670
-n01950731
-n01910747
-n07695742
-n03063689
-n01871265
-n03478589
-n07583066
-n02109525
-n03982430
-n04270147
-n01871265
-n02033041
-n03476991
-n01494475
-n09229709
-n03967562
-n03902125
-n02837789
-n04311004
-n04228054
-n02087394
-n04147183
-n02133161
-n03100240
-n04204238
-n02445715
-n03481172
-n04487394
-n03796401
-n02978881
-n01877812
-n01496331
-n07717410
-n02871525
-n02442845
-n02112706
-n02879718
-n03085013
-n02799071
-n03902125
-n02965783
-n02281406
-n04404412
-n02123159
-n02747177
-n04548280
-n04591713
-n04044716
-n03742115
-n02992211
-n07717410
-n10148035
-n02099429
-n02486261
-n04447861
-n03843555
-n04263257
-n04330267
-n02787622
-n02823750
-n01740131
-n04235860
-n03498962
-n02492660
-n02437312
-n07718747
-n03803284
-n02364673
-n02906734
-n07684084
-n03970156
-n03825788
-n03814906
-n07715103
-n02749479
-n02815834
-n02877765
-n02088364
-n02088632
-n04270147
-n07248320
-n01514668
-n01883070
-n02276258
-n04554684
-n02009229
-n07248320
-n01924916
-n03376595
-n03983396
-n02112018
-n01770393
-n02403003
-n02051845
-n02870880
-n02484975
-n02113799
-n03717622
-n07930864
-n07717410
-n02730930
-n03874599
-n02105162
-n02099712
-n01530575
-n03891332
-n01773157
-n02808440
-n02177972
-n03759954
-n07579787
-n02877765
-n03958227
-n03977966
-n03825788
-n03028079
-n04501370
-n02259212
-n03961711
-n03496892
-n03706229
-n04409515
-n12144580
-n03769881
-n09193705
-n02782093
-n01734418
-n04285008
-n02120505
-n02111277
-n02640242
-n02790996
-n02099267
-n07871810
-n01986214
-n01984695
-n12985857
-n04542943
-n03888605
-n04074963
-n10565667
-n04483307
-n09835506
-n02129165
-n03538406
-n01498041
-n04461696
-n03944341
-n03259280
-n01484850
-n04486054
-n03788195
-n09193705
-n03530642
-n04557648
-n02892201
-n04509417
-n03041632
-n02093256
-n02391049
-n04479046
-n03961711
-n15075141
-n02108915
-n01847000
-n02325366
-n03770439
-n03676483
-n06794110
-n01770393
-n02788148
-n03127925
-n03710721
-n02484975
-n02536864
-n02105855
-n03733131
-n04435653
-n02124075
-n03792782
-n04465501
-n01644373
-n02085620
-n03720891
-n03814639
-n03133878
-n02892201
-n02077923
-n02992211
-n02114712
-n02410509
-n03733131
-n03843555
-n02917067
-n02128385
-n04009552
-n03888605
-n03388043
-n04596742
-n03935335
-n06785654
-n02356798
-n02398521
-n03445924
-n03041632
-n03535780
-n07753113
-n02834397
-n01824575
-n07697313
-n04487081
-n02509815
-n02106550
-n01704323
-n01742172
-n02094433
-n01817953
-n03032252
-n01742172
-n02483362
-n02096437
-n02487347
-n02096294
-n04465501
-n02948072
-n03424325
-n02111500
-n02114367
-n01537544
-n01945685
-n02607072
-n04005630
-n04127249
-n07714990
-n03662601
-n03179701
-n09468604
-n01530575
-n03100240
-n06359193
-n02510455
-n02120079
-n02096437
-n03141823
-n01484850
-n04579432
-n04118538
-n02094433
-n02086910
-n01622779
-n07747607
-n07718747
-n02106030
-n02363005
-n03599486
-n03637318
-n02101388
-n03662601
-n03188531
-n02104029
-n11939491
-n04238763
-n01945685
-n02834397
-n02099712
-n01558993
-n03450230
-n03838899
-n04243546
-n02123159
-n04536866
-n02808304
-n04120489
-n03127925
-n04505470
-n03782006
-n02281406
-n04252225
-n02776631
-n02444819
-n04005630
-n03717622
-n03961711
-n03444034
-n03970156
-n01824575
-n02396427
-n02165456
-n02226429
-n02056570
-n07693725
-n04599235
-n03944341
-n02134418
-n03788365
-n07717410
-n04264628
-n03967562
-n04265275
-n03584254
-n01614925
-n07720875
-n03814639
-n04370456
-n04037443
-n03297495
-n02129604
-n03131574
-n04243546
-n02105855
-n03895866
-n03216828
-n02317335
-n02106030
-n03661043
-n01924916
-n02165456
-n04536866
-n01616318
-n02799071
-n03788195
-n02363005
-n01924916
-n04461696
-n04270147
-n02843684
-n04258138
-n03944341
-n01737021
-n01882714
-n02817516
-n02097298
-n01843383
-n04019541
-n04118776
-n02799071
-n03967562
-n03494278
-n02229544
-n04325704
-n03967562
-n13044778
-n03344393
-n04557648
-n03447721
-n09472597
-n04118538
-n03424325
-n04599235
-n01530575
-n02835271
-n09472597
-n02092002
-n02730930
-n04599235
-n02422699
-n03657121
-n01622779
-n03903868
-n02090721
-n04443257
-n01734418
-n07714571
-n01496331
-n02264363
-n03483316
-n03742115
-n07714990
-n03590841
-n03871628
-n04311174
-n02114548
-n03255030
-n02105505
-n07579787
-n07697313
-n03400231
-n06874185
-n04591713
-n04509417
-n03255030
-n03404251
-n02268853
-n07613480
-n07768694
-n02321529
-n01818515
-n01877812
-n02895154
-n03485794
-n04553703
-n02364673
-n09229709
-n02916936
-n04235860
-n07932039
-n15075141
-n02006656
-n02487347
-n02087394
-n02480855
-n04372370
-n03733805
-n02979186
-n02033041
-n10565667
-n02006656
-n02099267
-n02108915
-n03930630
-n01728572
-n04552348
-n02090721
-n02870880
-n02951585
-n04259630
-n02328150
-n04435653
-n02843684
-n03788195
-n03887697
-n04335435
-n04228054
-n01608432
-n04355933
-n02123045
-n04589890
-n04086273
-n03832673
-n02111277
-n01704323
-n03599486
-n04254680
-n02086240
-n02817516
-n02487347
-n04592741
-n03272010
-n02018795
-n01930112
-n03223299
-n03388043
-n03888605
-n04040759
-n02169497
-n02793495
-n04376876
-n02177972
-n04485082
-n07717410
-n04081281
-n03109150
-n02090622
-n03482405
-n01664065
-n03032252
-n03355925
-n01910747
-n04536866
-n03000247
-n03527444
-n02025239
-n04254777
-n04141975
-n03793489
-n02979186
-n02127052
-n01847000
-n02328150
-n02909870
-n10565667
-n03709823
-n02992211
-n02093859
-n07747607
-n07717410
-n03249569
-n01734418
-n03944341
-n04344873
-n01677366
-n02108000
-n03876231
-n04461696
-n06596364
-n09428293
-n03482405
-n02088094
-n04136333
-n04204238
-n01697457
-n04074963
-n01514859
-n02106662
-n04252225
-n02117135
-n03476684
-n01770393
-n02795169
-n03733131
-n03676483
-n04133789
-n04435653
-n01728920
-n04033995
-n04355933
-n01675722
-n03717622
-n04428191
-n03535780
-n02105162
-n07753275
-n04483307
-n02917067
-n04118776
-n03000684
-n03000134
-n02281787
-n01770393
-n02326432
-n01753488
-n02167151
-n02808304
-n04392985
-n03197337
-n03100240
-n04286575
-n03127925
-n01945685
-n02536864
-n02799071
-n02783161
-n02346627
-n02264363
-n02088364
-n02093754
-n03617480
-n02105162
-n02966687
-n01795545
-n02091831
-n01537544
-n03041632
-n02834397
-n02699494
-n03404251
-n01860187
-n04550184
-n02992211
-n02437312
-n02098105
-n07590611
-n03527444
-n07583066
-n01748264
-n02966687
-n03803284
-n04366367
-n02119022
-n01740131
-n02099601
-n01534433
-n04606251
-n02099601
-n02488702
-n04336792
-n02391049
-n02086646
-n02086079
-n02110806
-n02110341
-n04447861
-n02119789
-n04162706
-n02259212
-n03124043
-n02101388
-n03630383
-n02980441
-n02494079
-n03602883
-n01695060
-n04141327
-n04266014
-n03047690
-n02097209
-n02113023
-n02174001
-n01669191
-n01667778
-n02096051
-n04251144
-n02112706
-n02988304
-n03461385
-n03447447
-n02077923
-n03887697
-n02342885
-n01641577
-n01616318
-n02007558
-n01698640
-n04033995
-n03804744
-n02110063
-n03355925
-n01667114
-n01914609
-n03804744
-n02669723
-n07836838
-n02412080
-n03743016
-n04336792
-n13052670
-n03791053
-n03776460
-n03017168
-n04404412
-n03777754
-n04037443
-n03796401
-n04404412
-n06596364
-n02105412
-n04023962
-n01734418
-n02328150
-n02101006
-n07684084
-n02002556
-n13133613
-n07248320
-n01753488
-n02107908
-n02123394
-n04154565
-n02504458
-n13052670
-n04008634
-n02916936
-n02107683
-n02134084
-n02443484
-n07720875
-n04493381
-n03761084
-n02102040
-n03089624
-n01985128
-n01753488
-n02137549
-n09835506
-n03443371
-n02346627
-n02002556
-n04589890
-n04562935
-n01632777
-n02317335
-n01632458
-n02493509
-n02398521
-n03970156
-n02667093
-n03825788
-n02086646
-n13044778
-n02088238
-n01776313
-n02481823
-n04423845
-n03047690
-n07749582
-n02977058
-n01796340
-n02110627
-n02910353
-n03201208
-n01728572
-n02114367
-n03980874
-n02776631
-n02165456
-n02437312
-n02364673
-n03764736
-n04041544
-n12998815
-n03388043
-n03803284
-n02113624
-n02102318
-n03424325
-n03250847
-n09288635
-n03924679
-n03956157
-n01910747
-n04560804
-n07714990
-n04542943
-n07716906
-n02128925
-n04487394
-n04399382
-n04044716
-n04465501
-n03854065
-n02398521
-n02823750
-n07583066
-n02107312
-n04584207
-n01829413
-n01833805
-n02417914
-n04081281
-n02088364
-n02113799
-n04376876
-n02093991
-n02730930
-n04133789
-n02442845
-n02018207
-n03930630
-n02910353
-n02730930
-n03776460
-n02088364
-n04264628
-n07714990
-n04461696
-n03372029
-n02090379
-n01819313
-n03657121
-n02106662
-n02109525
-n02500267
-n04376876
-n04483307
-n03843555
-n13037406
-n02097047
-n02403003
-n03290653
-n02690373
-n02536864
-n02091467
-n03843555
-n04044716
-n01537544
-n02037110
-n04146614
-n04612504
-n01484850
-n07684084
-n03220513
-n04326547
-n03127925
-n02971356
-n03476991
-n01774384
-n07565083
-n02672831
-n03967562
-n03998194
-n09229709
-n01641577
-n01682714
-n04204347
-n03160309
-n03478589
-n03792972
-n04458633
-n04392985
-n02480855
-n02099429
-n07714571
-n02098105
-n02963159
-n02777292
-n03529860
-n03706229
-n12057211
-n04612504
-n04554684
-n03590841
-n03661043
-n04065272
-n01531178
-n07614500
-n02017213
-n02859443
-n04235860
-n02256656
-n03481172
-n02110063
-n02281787
-n04579432
-n01985128
-n02363005
-n04317175
-n01737021
-n03216828
-n02095570
-n07714571
-n04525305
-n07565083
-n03494278
-n04525038
-n01494475
-n04404412
-n07718747
-n03903868
-n04376876
-n02088632
-n07720875
-n02111277
-n01728920
-n04311004
-n02877765
-n06785654
-n01978455
-n01729977
-n02906734
-n01601694
-n04429376
-n02676566
-n03733281
-n02106382
-n02817516
-n04039381
-n04356056
-n01514859
-n03791053
-n04376876
-n03630383
-n04252077
-n04417672
-n01641577
-n04141076
-n02025239
-n02992529
-n02672831
-n02088466
-n01797886
-n04501370
-n04149813
-n02172182
-n04336792
-n04417672
-n03944341
-n03961711
-n04493381
-n04258138
-n04523525
-n02423022
-n02102177
-n02865351
-n04507155
-n07930864
-n02097047
-n03916031
-n02892201
-n04254680
-n01608432
-n04461696
-n03483316
-n02500267
-n02916936
-n03452741
-n02892201
-n02113186
-n03775546
-n03478589
-n03633091
-n04599235
-n03065424
-n02097209
-n01873310
-n04604644
-n04418357
-n03794056
-n03179701
-n01440764
-n01806143
-n02093859
-n01496331
-n01669191
-n04367480
-n02971356
-n02114548
-n03249569
-n01796340
-n07613480
-n04505470
-n03804744
-n02950826
-n03743016
-n02777292
-n03089624
-n02110341
-n03485407
-n02480855
-n02356798
-n02910353
-n03662601
-n01601694
-n04141076
-n03384352
-n02492660
-n03376595
-n02776631
-n02025239
-n04065272
-n02033041
-n03417042
-n09332890
-n02097658
-n04552348
-n03447447
-n03781244
-n03000684
-n01749939
-n01677366
-n02094114
-n04465501
-n04372370
-n02281787
-n03196217
-n02277742
-n02701002
-n03290653
-n03452741
-n01806143
-n04037443
-n03825788
-n04266014
-n07716906
-n02123597
-n02110063
-n02981792
-n03804744
-n02134418
-n03970156
-n02483362
-n02486261
-n01514668
-n02134084
-n03970156
-n01558993
-n01644373
-n03692522
-n03804744
-n02804414
-n02108551
-n01560419
-n02490219
-n03710637
-n03673027
-n04552348
-n02094114
-n03967562
-n03776460
-n02447366
-n03733805
-n03127925
-n02279972
-n09428293
-n03089624
-n03938244
-n04041544
-n02113712
-n03594734
-n02206856
-n03485794
-n02256656
-n02981792
-n03347037
-n03026506
-n04356056
-n09332890
-n07565083
-n07760859
-n04286575
-n02790996
-n01873310
-n03337140
-n04483307
-n02281787
-n02114548
-n12057211
-n02971356
-n04591713
-n04371774
-n03841143
-n02229544
-n02794156
-n04270147
-n04090263
-n04592741
-n02120505
-n02120505
-n03532672
-n03062245
-n03089624
-n03710193
-n03792972
-n02085936
-n01924916
-n01692333
-n04428191
-n13044778
-n06359193
-n07693725
-n02916936
-n02488702
-n02489166
-n02102318
-n03980874
-n04265275
-n04429376
-n02480855
-n07873807
-n03478589
-n02071294
-n02097298
-n01734418
-n02123159
-n02951585
-n07714990
-n02859443
-n04447861
-n02096585
-n03902125
-n04525038
-n03028079
-n03866082
-n03891332
-n03220513
-n03207743
-n04589890
-n03871628
-n01774750
-n02125311
-n02747177
-n04153751
-n02101556
-n02095570
-n01629819
-n03042490
-n01872401
-n04311004
-n04228054
-n03983396
-n04456115
-n04070727
-n02490219
-n02093256
-n03710193
-n03742115
-n03841143
-n04285008
-n02074367
-n02526121
-n02116738
-n03666591
-n02363005
-n02910353
-n02219486
-n03063599
-n01955084
-n02104029
-n02114855
-n04023962
-n04376876
-n04275548
-n01682714
-n01641577
-n02676566
-n07892512
-n01775062
-n03457902
-n04486054
-n03457902
-n02843684
-n07768694
-n04026417
-n03355925
-n02025239
-n03781244
-n03947888
-n02280649
-n03450230
-n02098286
-n03776460
-n03594945
-n07734744
-n02276258
-n07720875
-n02988304
-n03595614
-n02951358
-n03764736
-n02939185
-n02091134
-n01978287
-n02268443
-n03127747
-n03814639
-n03874293
-n04081281
-n07768694
-n07715103
-n02790996
-n03160309
-n04525038
-n02013706
-n04540053
-n02105056
-n07715103
-n01860187
-n07920052
-n01687978
-n07590611
-n03394916
-n03947888
-n01945685
-n02110063
-n04074963
-n04606251
-n03594945
-n04254120
-n03187595
-n02110958
-n02977058
-n07930864
-n02099601
-n03590841
-n02441942
-n01806567
-n02643566
-n03874293
-n03255030
-n04487394
-n07760859
-n02112137
-n04486054
-n01496331
-n03337140
-n01882714
-n02113978
-n07615774
-n02168699
-n04465501
-n02086910
-n04136333
-n04254120
-n03530642
-n03187595
-n01770393
-n02422106
-n03709823
-n02910353
-n01855672
-n02361337
-n01580077
-n01694178
-n04120489
-n04517823
-n03775546
-n01773157
-n03775546
-n03777568
-n04355933
-n01784675
-n01498041
-n02422699
-n04447861
-n02177972
-n02319095
-n03935335
-n03980874
-n03976657
-n02442845
-n02085782
-n03976467
-n07583066
-n04461696
-n04467665
-n02105641
-n04501370
-n03777754
-n04065272
-n03447721
-n02206856
-n03459775
-n03947888
-n04111531
-n02807133
-n03481172
-n01983481
-n03733131
-n02105641
-n03841143
-n03976467
-n02391049
-n03196217
-n02422699
-n04462240
-n04328186
-n04310018
-n04417672
-n03018349
-n02965783
-n01629819
-n03207941
-n04311174
-n02226429
-n02363005
-n03041632
-n04033901
-n02410509
-n02112137
-n02747177
-n02825657
-n02097298
-n02992529
-n03032252
-n01734418
-n04090263
-n04201297
-n02094258
-n04111531
-n04265275
-n04065272
-n02676566
-n03388043
-n07930864
-n02423022
-n02108551
-n03424325
-n02815834
-n04228054
-n02097209
-n02137549
-n03314780
-n01608432
-n01820546
-n02109961
-n01580077
-n07579787
-n03788365
-n02749479
-n03930313
-n01806567
-n02927161
-n04447861
-n04548362
-n02259212
-n04252225
-n02105162
-n03345487
-n02727426
-n07584110
-n04005630
-n02096294
-n04273569
-n02422106
-n03534580
-n09288635
-n01795545
-n02397096
-n02730930
-n01806143
-n03661043
-n02807133
-n02277742
-n07613480
-n03297495
-n03761084
-n03109150
-n07716906
-n12267677
-n04204238
-n04204347
-n04596742
-n03710637
-n02481823
-n02669723
-n01491361
-n01629819
-n03982430
-n02869837
-n01843065
-n04311174
-n01820546
-n01677366
-n02108089
-n01807496
-n03710721
-n03063599
-n03498962
-n01729322
-n02769748
-n02268853
-n04081281
-n03983396
-n06359193
-n02127052
-n02107142
-n02488702
-n02006656
-n07831146
-n02676566
-n04277352
-n03527444
-n03372029
-n03314780
-n02114712
-n01978287
-n03337140
-n03538406
-n02917067
-n01756291
-n01667778
-n01795545
-n01631663
-n02088364
-n02808304
-n01797886
-n02104029
-n03201208
-n01558993
-n03967562
-n04428191
-n02494079
-n04162706
-n04515003
-n04040759
-n01774750
-n01943899
-n02098413
-n02099601
-n04270147
-n02417914
-n03065424
-n07734744
-n02007558
-n02119789
-n07695742
-n02364673
-n01689811
-n02672831
-n02124075
-n01644900
-n04335435
-n02086646
-n02095889
-n02105251
-n02391049
-n01955084
-n02480495
-n03032252
-n02808440
-n03637318
-n02877765
-n04597913
-n02112706
-n04590129
-n01910747
-n02895154
-n03062245
-n03775546
-n03372029
-n04228054
-n04258138
-n04074963
-n11879895
-n01986214
-n01943899
-n02138441
-n01806143
-n01983481
-n03478589
-n04389033
-n02951358
-n02102318
-n03763968
-n03594734
-n01689811
-n07753113
-n02074367
-n01819313
-n03467068
-n03393912
-n02056570
-n04008634
-n04254777
-n01644900
-n02106166
-n03891251
-n04435653
-n01773549
-n03729826
-n01770081
-n03529860
-n03110669
-n03841143
-n02091244
-n04067472
-n04371430
-n03796401
-n03782006
-n04238763
-n01784675
-n04019541
-n02097209
-n02259212
-n03956157
-n02112706
-n02111889
-n03527444
-n02167151
-n04442312
-n07695742
-n03710193
-n04074963
-n02099849
-n02134418
-n02825657
-n13037406
-n02085782
-n02417914
-n12620546
-n04275548
-n02804610
-n04146614
-n01514668
-n01443537
-n04509417
-n02892201
-n02088466
-n03065424
-n04254120
-n03792972
-n01924916
-n02037110
-n07697537
-n03394916
-n02101006
-n02110806
-n03146219
-n02814860
-n03649909
-n03127747
-n01980166
-n02092002
-n03787032
-n02133161
-n03874599
-n04201297
-n02106550
-n07615774
-n03710637
-n03527444
-n07714990
-n03017168
-n02111500
-n01744401
-n03950228
-n02410509
-n02483708
-n07583066
-n04589890
-n02655020
-n02259212
-n01990800
-n03457902
-n07920052
-n04505470
-n02111129
-n03216828
-n02892767
-n02095314
-n02092002
-n01664065
-n03944341
-n03495258
-n01737021
-n01677366
-n01806567
-n02097298
-n04532670
-n04522168
-n02708093
-n02066245
-n02971356
-n02906734
-n03492542
-n03930313
-n02396427
-n02037110
-n03297495
-n03017168
-n01773797
-n03786901
-n02910353
-n02102177
-n02730930
-n02480495
-n04562935
-n02109525
-n02988304
-n02091467
-n04204238
-n04476259
-n01532829
-n03208938
-n04532106
-n02165105
-n01677366
-n07715103
-n02795169
-n02127052
-n02098286
-n01728572
-n01833805
-n02445715
-n02259212
-n04209133
-n07711569
-n07860988
-n09421951
-n03125729
-n04141076
-n01742172
-n03063689
-n01704323
-n01748264
-n01770393
-n01955084
-n02894605
-n03792972
-n04141975
-n02672831
-n03018349
-n02971356
-n02859443
-n07749582
-n03792782
-n02398521
-n04254777
-n02326432
-n03877472
-n02123045
-n03623198
-n02342885
-n03187595
-n03884397
-n04330267
-n04266014
-n02138441
-n03538406
-n03000247
-n02363005
-n02883205
-n07753592
-n04371430
-n03871628
-n03633091
-n04023962
-n01740131
-n04251144
-n02870880
-n02009912
-n03461385
-n02328150
-n01945685
-n02280649
-n02012849
-n02112137
-n04326547
-n02117135
-n07930864
-n04136333
-n04370456
-n01737021
-n01817953
-n03888605
-n03452741
-n04330267
-n07932039
-n02398521
-n07930864
-n03787032
-n02112350
-n12267677
-n03494278
-n07693725
-n03857828
-n02815834
-n04376876
-n03874293
-n04371774
-n03929855
-n02841315
-n02090721
-n09468604
-n02488291
-n02106662
-n03461385
-n04485082
-n03995372
-n02493793
-n01914609
-n02002556
-n07711569
-n02098286
-n07693725
-n02422106
-n02110958
-n04613696
-n03692522
-n07920052
-n02799071
-n04037443
-n02113978
-n01530575
-n10565667
-n10148035
-n03773504
-n03347037
-n09193705
-n02113978
-n01882714
-n03527444
-n02979186
-n01877812
-n02111129
-n03417042
-n03461385
-n02114855
-n12768682
-n01950731
-n02667093
-n02011460
-n03290653
-n02108000
-n04229816
-n01930112
-n02486261
-n04542943
-n04235860
-n07768694
-n02403003
-n03786901
-n02396427
-n02109047
-n01968897
-n03388043
-n04258138
-n02112137
-n02607072
-n02134084
-n03837869
-n04200800
-n02071294
-n04141076
-n02085620
-n03218198
-n02098286
-n02099601
-n04099969
-n03216828
-n02892767
-n03482405
-n03838899
-n03018349
-n04487394
-n04141076
-n02106382
-n11939491
-n03100240
-n03908714
-n07831146
-n09256479
-n12267677
-n04152593
-n02093428
-n02791270
-n02099429
-n02105056
-n03223299
-n02643566
-n07720875
-n02124075
-n02699494
-n03888605
-n03249569
-n03584254
-n02981792
-n04133789
-n03534580
-n01518878
-n02704792
-n07747607
-n13037406
-n02488291
-n03538406
-n03627232
-n02099429
-n02704792
-n07684084
-n03733805
-n02397096
-n02114367
-n02319095
-n02086646
-n02094433
-n04133789
-n04483307
-n02504013
-n04525038
-n04265275
-n04209239
-n03967562
-n02129165
-n03777754
-n09835506
-n02727426
-n01693334
-n02457408
-n02128925
-n03903868
-n04409515
-n01950731
-n06359193
-n03187595
-n01950731
-n04041544
-n02892767
-n02363005
-n04355338
-n02277742
-n04090263
-n03314780
-n04285008
-n01847000
-n02094433
-n02098105
-n07892512
-n09229709
-n03527444
-n03530642
-n01774384
-n01773157
-n04366367
-n03676483
-n01930112
-n03933933
-n03877845
-n02104365
-n07697537
-n02444819
-n13037406
-n04296562
-n02457408
-n11879895
-n04120489
-n03958227
-n03187595
-n03930630
-n02277742
-n01774750
-n04550184
-n02837789
-n04479046
-n02500267
-n04317175
-n07875152
-n01687978
-n02088094
-n02814533
-n02109961
-n02117135
-n04579145
-n07880968
-n02190166
-n02396427
-n04542943
-n04357314
-n02114855
-n03920288
-n02120079
-n01776313
-n01847000
-n04447861
-n04019541
-n03201208
-n03857828
-n03404251
-n07754684
-n09256479
-n02442845
-n06794110
-n02917067
-n04592741
-n02389026
-n03444034
-n03724870
-n02895154
-n02165456
-n03804744
-n01742172
-n02037110
-n02087046
-n02865351
-n02025239
-n03887697
-n02814533
-n04133789
-n03891332
-n02483708
-n07714571
-n03982430
-n04579145
-n02127052
-n07932039
-n04238763
-n03710637
-n02825657
-n03977966
-n02321529
-n02493509
-n02219486
-n09193705
-n01950731
-n03457902
-n03908714
-n03980874
-n02113624
-n03393912
-n03379051
-n01688243
-n02971356
-n04243546
-n02510455
-n02092002
-n02116738
-n02391049
-n04111531
-n02128925
-n02097047
-n02071294
-n04462240
-n01748264
-n02086910
-n04326547
-n02107908
-n06874185
-n03773504
-n04039381
-n03874293
-n04482393
-n04371774
-n02088094
-n03887697
-n03452741
-n07802026
-n02509815
-n03347037
-n03983396
-n01774750
-n02879718
-n03888257
-n01796340
-n07717556
-n02112706
-n01742172
-n12998815
-n03271574
-n01775062
-n02112706
-n04153751
-n04350905
-n02481823
-n02487347
-n01950731
-n02667093
-n02089973
-n04592741
-n03393912
-n02840245
-n02006656
-n01498041
-n04548362
-n02782093
-n09193705
-n02443114
-n01773549
-n02093428
-n04116512
-n01770393
-n02128925
-n02939185
-n04133789
-n02777292
-n03976657
-n03876231
-n02443114
-n04590129
-n02114855
-n04335435
-n03372029
-n04418357
-n02109961
-n02088094
-n02279972
-n03657121
-n04482393
-n04229816
-n02264363
-n04136333
-n02027492
-n03617480
-n07753592
-n03459775
-n04154565
-n03425413
-n01955084
-n03127925
-n02017213
-n02437616
-n01774384
-n07760859
-n01818515
-n03000684
-n02128385
-n04487081
-n02105505
-n03376595
-n02130308
-n02108000
-n03042490
-n02992211
-n07718472
-n02417914
-n02701002
-n02058221
-n03888605
-n01694178
-n01855672
-n02168699
-n02676566
-n04507155
-n03777754
-n01704323
-n02088094
-n03444034
-n02883205
-n02909870
-n02787622
-n02102973
-n02514041
-n03085013
-n04328186
-n02494079
-n02093428
-n01986214
-n03594945
-n01847000
-n02110958
-n04252077
-n03041632
-n09421951
-n03776460
-n03676483
-n02804610
-n02112350
-n02096294
-n02108089
-n03690938
-n04372370
-n03877845
-n02111500
-n04476259
-n02104029
-n02085782
-n03424325
-n01943899
-n02443114
-n02865351
-n02129604
-n04487394
-n02493509
-n03026506
-n04136333
-n04507155
-n04356056
-n04039381
-n03944341
-n03947888
-n02098105
-n02133161
-n02841315
-n04251144
-n02094114
-n04505470
-n01829413
-n02493509
-n11879895
-n07875152
-n01983481
-n02500267
-n02085620
-n13040303
-n03902125
-n12620546
-n03599486
-n03891332
-n02102480
-n04118538
-n01807496
-n01860187
-n03444034
-n01491361
-n07831146
-n02666196
-n02892767
-n13040303
-n03032252
-n02125311
-n02168699
-n02117135
-n02395406
-n01537544
-n07753275
-n04428191
-n02109961
-n04235860
-n02417914
-n04584207
-n04070727
-n01873310
-n02749479
-n02769748
-n07714571
-n04367480
-n02012849
-n01665541
-n02167151
-n02088466
-n03527444
-n04409515
-n02013706
-n03325584
-n02441942
-n07613480
-n02101006
-n02088632
-n02129604
-n01685808
-n02966687
-n04367480
-n03908618
-n02977058
-n04111531
-n03042490
-n03717622
-n06785654
-n02980441
-n01968897
-n01843065
-n04554684
-n04523525
-n04417672
-n01855672
-n03873416
-n02100877
-n02105505
-n03492542
-n01833805
-n04116512
-n04487394
-n02105505
-n03297495
-n02119022
-n04392985
-n02108422
-n02098413
-n02012849
-n04487394
-n01990800
-n02817516
-n03216828
-n03187595
-n07871810
-n02669723
-n02229544
-n02966687
-n02113712
-n03930313
-n03417042
-n02389026
-n03249569
-n03633091
-n02096294
-n02110627
-n03916031
-n07920052
-n04146614
-n03207743
-n02325366
-n03954731
-n04133789
-n03788195
-n03982430
-n02112706
-n02017213
-n02492660
-n03976467
-n03792782
-n02123159
-n07754684
-n03444034
-n03063599
-n02326432
-n02009912
-n04154565
-n03492542
-n03649909
-n02101388
-n02091134
-n02892201
-n02077923
-n02168699
-n04239074
-n03899768
-n04461696
-n03124170
-n09428293
-n03000247
-n01558993
-n02104365
-n02093991
-n03837869
-n02169497
-n03492542
-n03706229
-n02129165
-n03216828
-n03662601
-n02444819
-n03930313
-n04039381
-n01601694
-n04228054
-n02788148
-n03133878
-n01983481
-n02093859
-n02106166
-n02102973
-n03982430
-n02667093
-n03891332
-n01592084
-n02172182
-n03404251
-n02259212
-n03250847
-n02817516
-n07747607
-n03063599
-n03935335
-n02085620
-n02092002
-n02999410
-n02504458
-n03100240
-n04392985
-n02105855
-n07718747
-n03721384
-n02483362
-n01629819
-n02107683
-n02951358
-n07920052
-n03733805
-n02483362
-n01798484
-n04418357
-n04251144
-n03197337
-n03908618
-n01978287
-n01817953
-n04486054
-n04127249
-n01945685
-n07711569
-n02088238
-n02105641
-n02910353
-n07892512
-n01484850
-n03657121
-n02859443
-n07860988
-n04141327
-n03868863
-n01768244
-n03657121
-n02102973
-n02111500
-n01632458
-n02319095
-n04328186
-n04311004
-n01558993
-n01773549
-n01622779
-n02442845
-n07768694
-n01632777
-n03733805
-n03133878
-n02012849
-n03496892
-n02066245
-n02094433
-n03271574
-n02128757
-n03792782
-n02018795
-n01630670
-n02101006
-n04067472
-n02100583
-n04317175
-n03602883
-n04141327
-n02102040
-n07875152
-n02892201
-n04127249
-n07753275
-n04355338
-n02236044
-n01749939
-n07717556
-n02317335
-n02606052
-n04483307
-n04435653
-n04264628
-n04347754
-n04179913
-n07583066
-n04146614
-n03478589
-n03599486
-n02676566
-n02264363
-n04371430
-n03782006
-n04604644
-n03180011
-n03045698
-n03887697
-n02085936
-n07614500
-n04296562
-n02074367
-n01729977
-n02018795
-n01735189
-n03777568
-n03775546
-n02091244
-n03838899
-n04357314
-n01945685
-n03788365
-n02441942
-n04429376
-n02119022
-n01945685
-n03627232
-n02056570
-n02437616
-n03590841
-n01491361
-n01871265
-n04442312
-n01833805
-n04596742
-n04553703
-n04487394
-n03763968
-n02514041
-n11879895
-n04525038
-n02510455
-n04275548
-n01531178
-n04162706
-n03240683
-n04589890
-n03871628
-n04443257
-n02655020
-n04264628
-n01843383
-n02138441
-n02091032
-n02281406
-n03272010
-n03775546
-n03345487
-n03532672
-n02814860
-n07714571
-n02423022
-n03187595
-n03992509
-n03933933
-n03956157
-n07920052
-n01981276
-n03710721
-n04201297
-n09472597
-n02097130
-n02111889
-n03929660
-n02804610
-n03961711
-n07613480
-n01755581
-n02277742
-n03452741
-n02396427
-n01514859
-n04590129
-n04116512
-n01631663
-n07711569
-n02134084
-n04332243
-n04517823
-n01558993
-n02817516
-n02088632
-n03457902
-n01775062
-n02328150
-n02804610
-n02077923
-n02129604
-n02095314
-n03388183
-n02536864
-n03134739
-n03014705
-n02423022
-n04254120
-n03776460
-n03788195
-n03637318
-n02112706
-n03777568
-n02089078
-n03838899
-n03661043
-n02687172
-n02097658
-n02395406
-n01820546
-n03788365
-n02963159
-n02097298
-n07717556
-n02114367
-n02219486
-n04442312
-n04536866
-n02979186
-n04458633
-n07584110
-n03633091
-n04501370
-n03000684
-n02417914
-n02093859
-n04228054
-n03478589
-n02112137
-n03642806
-n02113712
-n02817516
-n03980874
-n01644900
-n11879895
-n04347754
-n03788195
-n02825657
-n02119789
-n02128925
-n02129604
-n04523525
-n04162706
-n03000247
-n04347754
-n02447366
-n02096294
-n02002724
-n02098413
-n03467068
-n01582220
-n02002556
-n03063689
-n01855672
-n02971356
-n02086240
-n02817516
-n01930112
-n02490219
-n09428293
-n02091467
-n03710637
-n02917067
-n06596364
-n01532829
-n02056570
-n04560804
-n01735189
-n04557648
-n07711569
-n06785654
-n04118776
-n02860847
-n02007558
-n02356798
-n04070727
-n02489166
-n07714990
-n02104365
-n02007558
-n03649909
-n01667114
-n01641577
-n03028079
-n03494278
-n07880968
-n03775071
-n01632458
-n01990800
-n02442845
-n02119022
-n02006656
-n02701002
-n02483362
-n03124170
-n01531178
-n02704792
-n02099849
-n01873310
-n01735189
-n04462240
-n03065424
-n04398044
-n04120489
-n04330267
-n03967562
-n02099601
-n03388043
-n02100583
-n02093991
-n09399592
-n01773797
-n03761084
-n02342885
-n02206856
-n02098286
-n03207743
-n13040303
-n01629819
-n02927161
-n04125021
-n04554684
-n02328150
-n03476684
-n02114367
-n03793489
-n03633091
-n03930630
-n02871525
-n02097474
-n02113799
-n02408429
-n03899768
-n07831146
-n04525038
-n02808304
-n03724870
-n02033041
-n02110063
-n03063689
-n01855672
-n02395406
-n04254680
-n03063689
-n02487347
-n02640242
-n03457902
-n12267677
-n04482393
-n04009552
-n02174001
-n01990800
-n04209133
-n01950731
-n02113186
-n03095699
-n01770081
-n04127249
-n02971356
-n02490219
-n04044716
-n01667778
-n03710721
-n03141823
-n04099969
-n02325366
-n04599235
-n01978455
-n03599486
-n02090622
-n03630383
-n02117135
-n02037110
-n02219486
-n03297495
-n02105505
-n04263257
-n02442845
-n04266014
-n03393912
-n02115641
-n02883205
-n01729977
-n03047690
-n02361337
-n04560804
-n02106662
-n03876231
-n03041632
-n02098105
-n01560419
-n02089078
-n03218198
-n04153751
-n02123597
-n03584829
-n02930766
-n03781244
-n02264363
-n07711569
-n04418357
-n06596364
-n03345487
-n02835271
-n04467665
-n03450230
-n03692522
-n03929660
-n03935335
-n01630670
-n02120505
-n02172182
-n03777754
-n04209133
-n01687978
-n03481172
-n02088094
-n02112350
-n03982430
-n02124075
-n03854065
-n04141076
-n06785654
-n02981792
-n03207941
-n03028079
-n13133613
-n02423022
-n03777568
-n02328150
-n02037110
-n02092002
-n02655020
-n04443257
-n02963159
-n01687978
-n09193705
-n10148035
-n03065424
-n03792972
-n02013706
-n01494475
-n07860988
-n02099267
-n04355933
-n02457408
-n01943899
-n03733131
-n04252077
-n02978881
-n03868863
-n03544143
-n03692522
-n12768682
-n02088094
-n04023962
-n02793495
-n03840681
-n01773549
-n03843555
-n04482393
-n07753592
-n03673027
-n07930864
-n01685808
-n02037110
-n02787622
-n06596364
-n02033041
-n04204238
-n12267677
-n02321529
-n03404251
-n03000684
-n07753592
-n03804744
-n01514668
-n03594945
-n02110627
-n03793489
-n04243546
-n02490219
-n02817516
-n03291819
-n02100877
-n01440764
-n04209239
-n02088364
-n04590129
-n02110806
-n09229709
-n02447366
-n04606251
-n04562935
-n02128385
-n02837789
-n02363005
-n04133789
-n02165456
-n03649909
-n03661043
-n02107683
-n01688243
-n01843383
-n03891251
-n12620546
-n03832673
-n03452741
-n04074963
-n04228054
-n03982430
-n01795545
-n02877765
-n03196217
-n04435653
-n02105505
-n04467665
-n07695742
-n02672831
-n03690938
-n04456115
-n04125021
-n15075141
-n03761084
-n04487394
-n02108089
-n07932039
-n01806567
-n02089078
-n02028035
-n03623198
-n02108551
-n01632458
-n03445924
-n01739381
-n03887697
-n07836838
-n02364673
-n03355925
-n02113799
-n04476259
-n02437312
-n03534580
-n03841143
-n03131574
-n07697537
-n01818515
-n03929660
-n02093647
-n02892767
-n03916031
-n04081281
-n04443257
-n02441942
-n01534433
-n01843383
-n02951358
-n02089078
-n03874293
-n03127925
-n02094258
-n04366367
-n03485407
-n04597913
-n01755581
-n01795545
-n01601694
-n01944390
-n03124170
-n02395406
-n03594734
-n01685808
-n01582220
-n02110627
-n03991062
-n02699494
-n09472597
-n02500267
-n03476991
-n02963159
-n02089867
-n01697457
-n03347037
-n01806143
-n02074367
-n02699494
-n04090263
-n03763968
-n02422699
-n04070727
-n01694178
-n01797886
-n03459775
-n03977966
-n01751748
-n03803284
-n01950731
-n01532829
-n02454379
-n02051845
-n03976657
-n07248320
-n07753275
-n09332890
-n02002556
-n03602883
-n12057211
-n02123045
-n02950826
-n02219486
-n02115641
-n02085936
-n02951585
-n02111889
-n02102480
-n01443537
-n02105162
-n02794156
-n04479046
-n03047690
-n02105412
-n02692877
-n01739381
-n07930864
-n04552348
-n02835271
-n01531178
-n04120489
-n01582220
-n02840245
-n02422106
-n01697457
-n03075370
-n04136333
-n03874599
-n03492542
-n02389026
-n03207743
-n02089867
-n04136333
-n06359193
-n02106382
-n02101006
-n02091467
-n03325584
-n01616318
-n02804610
-n07717556
-n02111500
-n01608432
-n02007558
-n03887697
-n02107142
-n02641379
-n07734744
-n03710193
-n02231487
-n02028035
-n04296562
-n04009552
-n02977058
-n03710721
-n03884397
-n03775546
-n07892512
-n04254777
-n07697537
-n03792782
-n02102480
-n03000247
-n02117135
-n01796340
-n02892201
-n04254680
-n04040759
-n01773549
-n04040759
-n03124170
-n02790996
-n04037443
-n02033041
-n04509417
-n01484850
-n03697007
-n04208210
-n04209133
-n02497673
-n03840681
-n03785016
-n04086273
-n02085936
-n02134084
-n03404251
-n02098286
-n07734744
-n03998194
-n02086910
-n03250847
-n03983396
-n04336792
-n03457902
-n03026506
-n03980874
-n01818515
-n04507155
-n03933933
-n13037406
-n04235860
-n02504013
-n03297495
-n02802426
-n01491361
-n02916936
-n01755581
-n02727426
-n04228054
-n03584254
-n04317175
-n01667114
-n04486054
-n02110341
-n04465501
-n02974003
-n12768682
-n12998815
-n02111129
-n11879895
-n03775546
-n03496892
-n03791053
-n01768244
-n09421951
-n04192698
-n04517823
-n02514041
-n12985857
-n13054560
-n04330267
-n03388549
-n04254120
-n04423845
-n11879895
-n02776631
-n02137549
-n03495258
-n03355925
-n02486410
-n02749479
-n03187595
-n03388043
-n04005630
-n02100877
-n07714990
-n06359193
-n02096051
-n02105641
-n07579787
-n09472597
-n04355338
-n03680355
-n02730930
-n03874599
-n02730930
-n04552348
-n03535780
-n01753488
-n02012849
-n01704323
-n02097209
-n03908714
-n04589890
-n04372370
-n01443537
-n03457902
-n04238763
-n09246464
-n01739381
-n02488702
-n04026417
-n01530575
-n07749582
-n02102480
-n04557648
-n02096585
-n01740131
-n04389033
-n03314780
-n07875152
-n02492660
-n12057211
-n04371430
-n02099267
-n03495258
-n02096051
-n02105162
-n02105641
-n03016953
-n02808440
-n03598930
-n04542943
-n01855672
-n03733281
-n07717410
-n02504013
-n02091831
-n04133789
-n04356056
-n02879718
-n03891251
-n03379051
-n02113978
-n09288635
-n02444819
-n01945685
-n03980874
-n02526121
-n02101556
-n04040759
-n02009229
-n03837869
-n04311174
-n07583066
-n02777292
-n03950228
-n02129165
-n02114548
-n02100735
-n04590129
-n03400231
-n03868242
-n02074367
-n06874185
-n04141327
-n01833805
-n09288635
-n04070727
-n02795169
-n03944341
-n01560419
-n03187595
-n02092339
-n03388043
-n03255030
-n04532670
-n02120505
-n02894605
-n02101388
-n01608432
-n03995372
-n02259212
-n03908618
-n03223299
-n02107683
-n07932039
-n03063689
-n01629819
-n03982430
-n03188531
-n01748264
-n03877472
-n02115913
-n01748264
-n04350905
-n04070727
-n02643566
-n02966193
-n01770393
-n02672831
-n02494079
-n02930766
-n03259280
-n02442845
-n03903868
-n03710721
-n02690373
-n01531178
-n01496331
-n03710721
-n02088094
-n07717556
-n03920288
-n02089078
-n02109525
-n02808304
-n03447447
-n04548280
-n02906734
-n07716358
-n01774384
-n03637318
-n02909870
-n03788195
-n02699494
-n04355338
-n02095889
-n02606052
-n03623198
-n01641577
-n01669191
-n02457408
-n03627232
-n02769748
-n04311004
-n03584254
-n03220513
-n03530642
-n04285008
-n01644373
-n09421951
-n03733281
-n03047690
-n02808304
-n03720891
-n02437616
-n07684084
-n01749939
-n04409515
-n02494079
-n02948072
-n02110806
-n02077923
-n01924916
-n01496331
-n04604644
-n02667093
-n02107142
-n01692333
-n04277352
-n04254777
-n02676566
-n12144580
-n03630383
-n02095889
-n03666591
-n03937543
-n01498041
-n03272562
-n09472597
-n03223299
-n04456115
-n02099601
-n03000134
-n02951585
-n03717622
-n01910747
-n06596364
-n01820546
-n02018795
-n04264628
-n02096177
-n01944390
-n01978287
-n01818515
-n03125729
-n02093256
-n01855032
-n02009912
-n02097047
-n02113712
-n01883070
-n01774750
-n01665541
-n02093428
-n01980166
-n04392985
-n03947888
-n02690373
-n02090721
-n04023962
-n03476684
-n04389033
-n03729826
-n02910353
-n01632458
-n02167151
-n02676566
-n03045698
-n01770081
-n04238763
-n10148035
-n04344873
-n02481823
-n04467665
-n02013706
-n02088238
-n02877765
-n01833805
-n07718747
-n02091467
-n03627232
-n04141076
-n04209239
-n01950731
-n04467665
-n03976657
-n03729826
-n04398044
-n07754684
-n04465501
-n01776313
-n02111129
-n03207743
-n03201208
-n01847000
-n02085936
-n03710721
-n04599235
-n02817516
-n02807133
-n04389033
-n02840245
-n04423845
-n07718472
-n02356798
-n02167151
-n02966687
-n02790996
-n02840245
-n02342885
-n02437312
-n07716906
-n02233338
-n03379051
-n01990800
-n02443114
-n01498041
-n03337140
-n02165105
-n04525305
-n02226429
-n01558993
-n02110341
-n04069434
-n01644900
-n02096177
-n04347754
-n03127747
-n02106382
-n01608432
-n02412080
-n02134084
-n04486054
-n04026417
-n02437616
-n04081281
-n04417672
-n02018207
-n03018349
-n03595614
-n02120079
-n03388183
-n03902125
-n02403003
-n03933933
-n09193705
-n01872401
-n03534580
-n02129165
-n03710193
-n01981276
-n02259212
-n07873807
-n01843065
-n02457408
-n02837789
-n02177972
-n02951585
-n02101006
-n02965783
-n04482393
-n01616318
-n04465501
-n03485407
-n02086646
-n02085620
-n02361337
-n01753488
-n04579145
-n01682714
-n02105641
-n04065272
-n01968897
-n02102973
-n12144580
-n04372370
-n02127052
-n02690373
-n02895154
-n04049303
-n03676483
-n02268443
-n02869837
-n02206856
-n04201297
-n02091244
-n02101556
-n02843684
-n04380533
-n07753275
-n01534433
-n02027492
-n02971356
-n04118538
-n03384352
-n03444034
-n03676483
-n03495258
-n02666196
-n01756291
-n03482405
-n02098413
-n04355933
-n03841143
-n02120079
-n02417914
-n03857828
-n02114712
-n01729977
-n01770081
-n03733131
-n03793489
-n03590841
-n02088364
-n01847000
-n11939491
-n03724870
-n02025239
-n07717556
-n02119789
-n03016953
-n02129165
-n04033901
-n02790996
-n02012849
-n02099429
-n03691459
-n04330267
-n10148035
-n03888257
-n07584110
-n02096437
-n04515003
-n02804610
-n02096437
-n04418357
-n02033041
-n02092339
-n12620546
-n01669191
-n03160309
-n02112137
-n02172182
-n03110669
-n04380533
-n03673027
-n03347037
-n04201297
-n02492660
-n02110958
-n02783161
-n02483708
-n02110958
-n04120489
-n03908618
-n02423022
-n04350905
-n04153751
-n02444819
-n02114548
-n07747607
-n07614500
-n04070727
-n04074963
-n01616318
-n02112706
-n02096437
-n04228054
-n01644900
-n01756291
-n02442845
-n03980874
-n02441942
-n04149813
-n03950228
-n01843383
-n02910353
-n03207743
-n04263257
-n02099429
-n04486054
-n02606052
-n04238763
-n02099601
-n02177972
-n03584829
-n04356056
-n03673027
-n02086646
-n04485082
-n02692877
-n03761084
-n03249569
-n04252077
-n02092339
-n01770081
-n02877765
-n02129604
-n03032252
-n13044778
-n02607072
-n03498962
-n02120505
-n01534433
-n01491361
-n07730033
-n02098413
-n02793495
-n02017213
-n02100877
-n02948072
-n02398521
-n03498962
-n02494079
-n04026417
-n03259280
-n04209133
-n02094258
-n02028035
-n03627232
-n03529860
-n02077923
-n03843555
-n03873416
-n02116738
-n03995372
-n02104365
-n04347754
-n04590129
-n03657121
-n01774384
-n03937543
-n07836838
-n04127249
-n02391049
-n04296562
-n02492035
-n04254120
-n04201297
-n02115641
-n02094258
-n03729826
-n02090379
-n02165456
-n02107142
-n01518878
-n03649909
-n01558993
-n01843383
-n01695060
-n02134084
-n02101556
-n02123045
-n03929855
-n02110185
-n03291819
-n02099601
-n04443257
-n02487347
-n01795545
-n04458633
-n02229544
-n03325584
-n04086273
-n03017168
-n01729977
-n03388043
-n01675722
-n02009229
-n03126707
-n02117135
-n03873416
-n04332243
-n02486410
-n03394916
-n02480855
-n02837789
-n03018349
-n03998194
-n04317175
-n01819313
-n03291819
-n01664065
-n02128385
-n02417914
-n04040759
-n01440764
-n09468604
-n03240683
-n07248320
-n11939491
-n02971356
-n02096437
-n02101556
-n04467665
-n03983396
-n04146614
-n04252077
-n03476684
-n02777292
-n03617480
-n04004767
-n02102177
-n02088632
-n07749582
-n04264628
-n04487081
-n02808440
-n04399382
-n03961711
-n04229816
-n03977966
-n03133878
-n03877845
-n03995372
-n04131690
-n02093754
-n02110806
-n01872401
-n02106662
-n07836838
-n04553703
-n02095314
-n12620546
-n02231487
-n02277742
-n04456115
-n02643566
-n02317335
-n04008634
-n04476259
-n04550184
-n02107908
-n02125311
-n03355925
-n03769881
-n07615774
-n02443114
-n02167151
-n04590129
-n12620546
-n02177972
-n03866082
-n07718472
-n02102318
-n07697313
-n03384352
-n04330267
-n03874293
-n03895866
-n02444819
-n03908714
-n02395406
-n04355933
-n03220513
-n04147183
-n02099267
-n01983481
-n01770081
-n02095570
-n01695060
-n02115641
-n04355338
-n07584110
-n02843684
-n04023962
-n02102480
-n04116512
-n02094258
-n04326547
-n02951358
-n01784675
-n03494278
-n03935335
-n02106662
-n02256656
-n03944341
-n02105641
-n02666196
-n03982430
-n02814533
-n04204238
-n07730033
-n01807496
-n03042490
-n02963159
-n02504458
-n03535780
-n04355933
-n02009229
-n02423022
-n01582220
-n07614500
-n02321529
-n03272562
-n03642806
-n04251144
-n02115913
-n02107312
-n03924679
-n02699494
-n03908714
-n04522168
-n09246464
-n03617480
-n02231487
-n02127052
-n04335435
-n02804610
-n02437616
-n03249569
-n01682714
-n02790996
-n03742115
-n02112350
-n02837789
-n04371774
-n03443371
-n02992529
-n01688243
-n03733281
-n07875152
-n02105641
-n02110958
-n02018795
-n04482393
-n03063689
-n02328150
-n02109525
-n02071294
-n02808304
-n03530642
-n03970156
-n01860187
-n02102973
-n03220513
-n03032252
-n01797886
-n03792782
-n02085936
-n04487394
-n02790996
-n01773157
-n04367480
-n03290653
-n03478589
-n04542943
-n07579787
-n02190166
-n06785654
-n02002724
-n01740131
-n04033995
-n01978287
-n02011460
-n03937543
-n02096437
-n01534433
-n02978881
-n03445924
-n07716358
-n02093428
-n01776313
-n02704792
-n01687978
-n04550184
-n02102973
-n02165456
-n03347037
-n01755581
-n02111889
-n03967562
-n01491361
-n02437616
-n02089078
-n02123597
-n04507155
-n03110669
-n03868242
-n03874599
-n02120505
-n03930313
-n02165105
-n04604644
-n03445777
-n02099712
-n02009229
-n04389033
-n04371774
-n02437616
-n04243546
-n03794056
-n03775071
-n04479046
-n03796401
-n02892767
-n03929660
-n02133161
-n03944341
-n03884397
-n04589890
-n03590841
-n02071294
-n04263257
-n01768244
-n02410509
-n04465501
-n02098286
-n02747177
-n02105162
-n01667114
-n02999410
-n01560419
-n07749582
-n01968897
-n02130308
-n02110806
-n02106382
-n07590611
-n07697537
-n04591157
-n04462240
-n02988304
-n03126707
-n02727426
-n04127249
-n02843684
-n03179701
-n02443484
-n04344873
-n02280649
-n03216828
-n12985857
-n04548280
-n03602883
-n03447721
-n01694178
-n02415577
-n02699494
-n03085013
-n02895154
-n04371774
-n03495258
-n03791053
-n02641379
-n02980441
-n02950826
-n02110063
-n03788195
-n01693334
-n02606052
-n07742313
-n02113624
-n03874293
-n04209239
-n03388043
-n02927161
-n03944341
-n04579432
-n03759954
-n02101388
-n01978287
-n03443371
-n02129604
-n01693334
-n07742313
-n01770393
-n06785654
-n03126707
-n02058221
-n03721384
-n02093647
-n07684084
-n03775546
-n03494278
-n03131574
-n02823428
-n02111889
-n04208210
-n02190166
-n04228054
-n03888257
-n02169497
-n01770081
-n02974003
-n03637318
-n02089078
-n02117135
-n02457408
-n02606052
-n03877845
-n02776631
-n01882714
-n03325584
-n02095314
-n02102973
-n02236044
-n02090622
-n02797295
-n01775062
-n02098286
-n03498962
-n02128385
-n02783161
-n07768694
-n03337140
-n01751748
-n04447861
-n02172182
-n03743016
-n03599486
-n04380533
-n07892512
-n03598930
-n02085782
-n01685808
-n02879718
-n01491361
-n04273569
-n02441942
-n04553703
-n03649909
-n03141823
-n02115641
-n04372370
-n04265275
-n04493381
-n06596364
-n02825657
-n02480495
-n02097298
-n03532672
-n01531178
-n03843555
-n03770679
-n02346627
-n02127052
-n03297495
-n02869837
-n02106166
-n01440764
-n02510455
-n02095570
-n02177972
-n03347037
-n01978455
-n02488702
-n02791124
-n04229816
-n01675722
-n03630383
-n01930112
-n04005630
-n04039381
-n03950228
-n04592741
-n01914609
-n02129165
-n01871265
-n03902125
-n01689811
-n03534580
-n01945685
-n01773549
-n02089867
-n03788195
-n02788148
-n02113023
-n03534580
-n04592741
-n02797295
-n03017168
-n04355933
-n02097209
-n02167151
-n04026417
-n03271574
-n02105251
-n04004767
-n02108000
-n04350905
-n02106662
-n03201208
-n03126707
-n01443537
-n02837789
-n02165456
-n03796401
-n02870880
-n02641379
-n01622779
-n02113023
-n07880968
-n02165456
-n03840681
-n03372029
-n04044716
-n03840681
-n03692522
-n03992509
-n02085620
-n03530642
-n02113186
-n02086079
-n07614500
-n09468604
-n03602883
-n09468604
-n04270147
-n04146614
-n02892201
-n03958227
-n03832673
-n02268443
-n02236044
-n01494475
-n02009912
-n01532829
-n02093754
-n03404251
-n03770439
-n07734744
-n04252077
-n07714571
-n02120079
-n01665541
-n02123394
-n03240683
-n04264628
-n02457408
-n07614500
-n02124075
-n03425413
-n03133878
-n07930864
-n03160309
-n02484975
-n02086240
-n02978881
-n04404412
-n02643566
-n02494079
-n02749479
-n02114855
-n02106166
-n02114712
-n03662601
-n07583066
-n02396427
-n02108089
-n04335435
-n03017168
-n02113186
-n04493381
-n02909870
-n03075370
-n03627232
-n03794056
-n01734418
-n02951358
-n02457408
-n02883205
-n02917067
-n03250847
-n02804610
-n02110958
-n02088364
-n03891251
-n02641379
-n02098105
-n02113624
-n02027492
-n02066245
-n02168699
-n06359193
-n03627232
-n09229709
-n02749479
-n04355338
-n04252225
-n02939185
-n01632777
-n02395406
-n02219486
-n02988304
-n01518878
-n03891332
-n02114548
-n02892767
-n01491361
-n03933933
-n02795169
-n09472597
-n07579787
-n03032252
-n02093754
-n13054560
-n03891251
-n02105505
-n02132136
-n07873807
-n02640242
-n04461696
-n04613696
-n09468604
-n02113186
-n02493509
-n04553703
-n01968897
-n04296562
-n03467068
-n03763968
-n04209239
-n02219486
-n03888257
-n01871265
-n03325584
-n03272562
-n03854065
-n01558993
-n03670208
-n01665541
-n03325584
-n01695060
-n02457408
-n02797295
-n02950826
-n02099429
-n03291819
-n02939185
-n03976467
-n02120079
-n02879718
-n04579145
-n04120489
-n01632458
-n02009912
-n04328186
-n06874185
-n02398521
-n02488291
-n02107312
-n03026506
-n02119022
-n01843383
-n03657121
-n03062245
-n07584110
-n02091032
-n03476991
-n02013706
-n02607072
-n02113712
-n03788365
-n04355338
-n04428191
-n04442312
-n01753488
-n12620546
-n03417042
-n02108089
-n07871810
-n03930313
-n04019541
-n04074963
-n02408429
-n02817516
-n01955084
-n02747177
-n09472597
-n03866082
-n02099267
-n03782006
-n03998194
-n02823428
-n04487081
-n03956157
-n03854065
-n02002556
-n01440764
-n02093256
-n02229544
-n02109047
-n03160309
-n02825657
-n02423022
-n03016953
-n04179913
-n01860187
-n02107574
-n06359193
-n02088094
-n04065272
-n02088632
-n02130308
-n03769881
-n02966193
-n06794110
-n07590611
-n03924679
-n04153751
-n02112706
-n02509815
-n04335435
-n04579432
-n02815834
-n02361337
-n02123159
-n03133878
-n02457408
-n02092002
-n04347754
-n03775071
-n03498962
-n02101388
-n03447447
-n02443114
-n04039381
-n02791124
-n02104365
-n01776313
-n04442312
-n03584254
-n02094258
-n02086646
-n04370456
-n01797886
-n03724870
-n01775062
-n02687172
-n02091244
-n03124043
-n01632777
-n02787622
-n01930112
-n01664065
-n01734418
-n02110063
-n01818515
-n04336792
-n03793489
-n02097298
-n02017213
-n04273569
-n03485794
-n02002724
-n04507155
-n11879895
-n02087046
-n02486410
-n04033995
-n03345487
-n03692522
-n04347754
-n01986214
-n03873416
-n03483316
-n02101556
-n03425413
-n03000684
-n02114367
-n02113712
-n03535780
-n02454379
-n03788195
-n02086240
-n02095889
-n02422699
-n03400231
-n03690938
-n01494475
-n02099601
-n04612504
-n07753275
-n03814639
-n02165105
-n03314780
-n03478589
-n01796340
-n02105641
-n01847000
-n01877812
-n02447366
-n03929660
-n02992529
-n02088094
-n07745940
-n04522168
-n04069434
-n12620546
-n03673027
-n03998194
-n03028079
-n04252225
-n02033041
-n01843065
-n07720875
-n02099712
-n02939185
-n02098413
-n04296562
-n03796401
-n01729977
-n02859443
-n02105251
-n02860847
-n04209133
-n02108000
-n04235860
-n02782093
-n02814533
-n01614925
-n01484850
-n01669191
-n04525305
-n07716906
-n02119022
-n03721384
-n02259212
-n03976657
-n02415577
-n04392985
-n04023962
-n02793495
-n04592741
-n02233338
-n02777292
-n01514859
-n03127747
-n04548362
-n03947888
-n03792782
-n03445777
-n04592741
-n02165105
-n02105056
-n04525038
-n02395406
-n02129604
-n09399592
-n09229709
-n06785654
-n03045698
-n04380533
-n02835271
-n07715103
-n03692522
-n02950826
-n02259212
-n03773504
-n04560804
-n04355933
-n02167151
-n01695060
-n02091635
-n07745940
-n03958227
-n03642806
-n01537544
-n03733131
-n02028035
-n02667093
-n03617480
-n02443484
-n04532106
-n06874185
-n02730930
-n01632458
-n04067472
-n09246464
-n02264363
-n09229709
-n02708093
-n03804744
-n03042490
-n03347037
-n02120079
-n02098105
-n02092339
-n03017168
-n02099429
-n03160309
-n12267677
-n03642806
-n07579787
-n02817516
-n01770393
-n01667114
-n04417672
-n04515003
-n02091134
-n02090721
-n04428191
-n02086646
-n04536866
-n03000684
-n01692333
-n04591157
-n03967562
-n03743016
-n04579145
-n02110063
-n04040759
-n02074367
-n03100240
-n04552348
-n02916936
-n03485407
-n02489166
-n03271574
-n01677366
-n02457408
-n02966193
-n04152593
-n01491361
-n01748264
-n03530642
-n03840681
-n01768244
-n02226429
-n03642806
-n02002556
-n03598930
-n01631663
-n03787032
-n03954731
-n04462240
-n03680355
-n02013706
-n03271574
-n04357314
-n02397096
-n01697457
-n02441942
-n03661043
-n01985128
-n03658185
-n02099267
-n04522168
-n13037406
-n02108422
-n04111531
-n01728920
-n02085620
-n01644373
-n02101388
-n02795169
-n02100877
-n04509417
-n02088466
-n02769748
-n02965783
-n03649909
-n03179701
-n01742172
-n01877812
-n03769881
-n03000247
-n02106662
-n03888605
-n03937543
-n04346328
-n03976467
-n03187595
-n15075141
-n03062245
-n03710721
-n04009552
-n02447366
-n02107574
-n03970156
-n03991062
-n02098413
-n07892512
-n03529860
-n03935335
-n01531178
-n02835271
-n03787032
-n02101388
-n02085620
-n02701002
-n11939491
-n01698640
-n02233338
-n11879895
-n02101556
-n07753592
-n02441942
-n07871810
-n01914609
-n02132136
-n02097658
-n07720875
-n02259212
-n01560419
-n02510455
-n04200800
-n04254777
-n01616318
-n04522168
-n02100236
-n04356056
-n07615774
-n03160309
-n02666196
-n02169497
-n03207941
-n07831146
-n04131690
-n04136333
-n02895154
-n02002556
-n04311174
-n04243546
-n13052670
-n02895154
-n03527444
-n02090622
-n04429376
-n01667778
-n01871265
-n01608432
-n03424325
-n02111129
-n02094114
-n03706229
-n02883205
-n07590611
-n02948072
-n01770393
-n03290653
-n02128925
-n02110185
-n02110341
-n01796340
-n02342885
-n02487347
-n04310018
-n02091635
-n02708093
-n03016953
-n02264363
-n04372370
-n03272562
-n02089078
-n03764736
-n02963159
-n03874599
-n02641379
-n01984695
-n02802426
-n02346627
-n03773504
-n04273569
-n02111889
-n03498962
-n03141823
-n04350905
-n02095314
-n04335435
-n03388183
-n01537544
-n03947888
-n02106662
-n03854065
-n01484850
-n02086079
-n07714571
-n01768244
-n04070727
-n03494278
-n03584829
-n03837869
-n01945685
-n03733281
-n04429376
-n02099601
-n04554684
-n04509417
-n01943899
-n07565083
-n04515003
-n03777754
-n03594734
-n03777568
-n03840681
-n02536864
-n04442312
-n03127747
-n03445777
-n04579432
-n03063599
-n02113978
-n03787032
-n01742172
-n02487347
-n04486054
-n02093859
-n04162706
-n02328150
-n03482405
-n04517823
-n07615774
-n04192698
-n02808304
-n02037110
-n04254120
-n02490219
-n07684084
-n02094258
-n02814533
-n02174001
-n07753275
-n04033901
-n02481823
-n03770679
-n03134739
-n01560419
-n04275548
-n01667778
-n01737021
-n01806567
-n04456115
-n07613480
-n01737021
-n03761084
-n07753592
-n04461696
-n04336792
-n02137549
-n02100735
-n04005630
-n02112706
-n12144580
-n03785016
-n03372029
-n04486054
-n02117135
-n01667778
-n02927161
-n07760859
-n03924679
-n04040759
-n07742313
-n02106030
-n03388549
-n03950228
-n01768244
-n07734744
-n04479046
-n02791124
-n01807496
-n04357314
-n01484850
-n03888605
-n04277352
-n04326547
-n03876231
-n07584110
-n02092002
-n01667778
-n01682714
-n02091831
-n02108089
-n02951585
-n02219486
-n02090379
-n01950731
-n02089867
-n01828970
-n03837869
-n01978287
-n02092002
-n02814533
-n01664065
-n12768682
-n07930864
-n04357314
-n02802426
-n02089867
-n03063689
-n03535780
-n04591713
-n03796401
-n02877765
-n02823428
-n07717410
-n04612504
-n03642806
-n04033995
-n02095889
-n04074963
-n01855032
-n04270147
-n03110669
-n03255030
-n03530642
-n10148035
-n07745940
-n02490219
-n02074367
-n02097130
-n02106662
-n03891332
-n02089973
-n04209239
-n04548280
-n04154565
-n02037110
-n02113978
-n02115913
-n02018795
-n02823428
-n02091032
-n03874293
-n04146614
-n04560804
-n04522168
-n07717556
-n04311004
-n02105855
-n02109961
-n02134084
-n02930766
-n01855032
-n02480495
-n02509815
-n02100877
-n02795169
-n02125311
-n01734418
-n03124043
-n02165105
-n02840245
-n03759954
-n01622779
-n02442845
-n04328186
-n04152593
-n04554684
-n02965783
-n02510455
-n03445777
-n07615774
-n12998815
-n07717410
-n03742115
-n04264628
-n02165456
-n04074963
-n02098105
-n02132136
-n01872401
-n02441942
-n04560804
-n02422699
-n02802426
-n07768694
-n01518878
-n02096051
-n02786058
-n02483708
-n02099601
-n04435653
-n01630670
-n02177972
-n13052670
-n02028035
-n01978455
-n13054560
-n02165105
-n04317175
-n01739381
-n02168699
-n02483362
-n02342885
-n02007558
-n01798484
-n04579145
-n02361337
-n02643566
-n04147183
-n04208210
-n01798484
-n02488291
-n03773504
-n03662601
-n02483708
-n01986214
-n04005630
-n02165105
-n02009229
-n03814639
-n04462240
-n02090379
-n03786901
-n01734418
-n01770081
-n02814533
-n03445777
-n03196217
-n02747177
-n02493793
-n03970156
-n02165105
-n03930313
-n02169497
-n04204347
-n02113712
-n02979186
-n02085782
-n04265275
-n01694178
-n09229709
-n04317175
-n07760859
-n02865351
-n03841143
-n01601694
-n02128925
-n03908714
-n01775062
-n01770393
-n02877765
-n03902125
-n01744401
-n02094114
-n03271574
-n04372370
-n07697313
-n04229816
-n02692877
-n01537544
-n04153751
-n02490219
-n09193705
-n02951585
-n01986214
-n02865351
-n02105855
-n04392985
-n03825788
-n04265275
-n12267677
-n03787032
-n02088632
-n04507155
-n03481172
-n03868242
-n02797295
-n02500267
-n02480855
-n03956157
-n02948072
-n03792782
-n03478589
-n04590129
-n01729322
-n02105056
-n02837789
-n03393912
-n02319095
-n02100735
-n02093256
-n03782006
-n03388043
-n03891251
-n02391049
-n02167151
-n03045698
-n01534433
-n04067472
-n02105641
-n04423845
-n01983481
-n03160309
-n02802426
-n09428293
-n02106382
-n04325704
-n02444819
-n01755581
-n02895154
-n02129604
-n02910353
-n07873807
-n07716358
-n03325584
-n02104029
-n01883070
-n02408429
-n02992529
-n02111277
-n04141327
-n02098105
-n12998815
-n04133789
-n02837789
-n02321529
-n04041544
-n03131574
-n01968897
-n03721384
-n09428293
-n03637318
-n04536866
-n01641577
-n01828970
-n02794156
-n02105855
-n02825657
-n02100735
-n02487347
-n02281406
-n04550184
-n02804414
-n03594734
-n01806143
-n09256479
-n04204238
-n03544143
-n04350905
-n04380533
-n03459775
-n04509417
-n02480495
-n04204347
-n03967562
-n03666591
-n03481172
-n03179701
-n01728920
-n09835506
-n02509815
-n11939491
-n02125311
-n01774750
-n01924916
-n04380533
-n03496892
-n02510455
-n02808304
-n04328186
-n04009552
-n02105505
-n02454379
-n04507155
-n01592084
-n04118538
-n01644373
-n02965783
-n03742115
-n07715103
-n03733281
-n02268853
-n03967562
-n02107574
-n04597913
-n01798484
-n04562935
-n04584207
-n07717556
-n02110958
-n04597913
-n07693725
-n02086910
-n04136333
-n01843383
-n02794156
-n02101556
-n04192698
-n02389026
-n03250847
-n01817953
-n01682714
-n01491361
-n06874185
-n02093647
-n02483362
-n04435653
-n01667778
-n04548280
-n03133878
-n02840245
-n01950731
-n04229816
-n01817953
-n04346328
-n07871810
-n04493381
-n03476684
-n01882714
-n03100240
-n02105505
-n03623198
-n02128925
-n07749582
-n03124170
-n03042490
-n01531178
-n03180011
-n02276258
-n03538406
-n01843383
-n01833805
-n02109047
-n01735189
-n01514859
-n02396427
-n01537544
-n07920052
-n02077923
-n03661043
-n03445924
-n01514859
-n04418357
-n01630670
-n02256656
-n02980441
-n01985128
-n03787032
-n09399592
-n02096177
-n03095699
-n02791270
-n02002556
-n02099429
-n02687172
-n04487081
-n03775071
-n04120489
-n02100877
-n04131690
-n02111277
-n04008634
-n03796401
-n03690938
-n03496892
-n02487347
-n02098286
-n04398044
-n02281787
-n02641379
-n03179701
-n03110669
-n03314780
-n03388549
-n02441942
-n02091831
-n03933933
-n07584110
-n02510455
-n02437312
-n02417914
-n02110806
-n02667093
-n03384352
-n03529860
-n04209239
-n04254120
-n04310018
-n07615774
-n01984695
-n03188531
-n02701002
-n01749939
-n03494278
-n04317175
-n02480855
-n04553703
-n04591713
-n02093991
-n03496892
-n03498962
-n02870880
-n07734744
-n02090622
-n02095889
-n03089624
-n03814906
-n01443537
-n03775546
-n03895866
-n04254680
-n02093991
-n02094433
-n03709823
-n04133789
-n04356056
-n09421951
-n03781244
-n03970156
-n03709823
-n03873416
-n03950228
-n03425413
-n09229709
-n03141823
-n03290653
-n01675722
-n04259630
-n04613696
-n03838899
-n01443537
-n03617480
-n02112350
-n01774384
-n02108915
-n03876231
-n02099429
-n02226429
-n01770393
-n01694178
-n06794110
-n03220513
-n11879895
-n03124043
-n02105855
-n02486410
-n04004767
-n09835506
-n07745940
-n02097047
-n03721384
-n03133878
-n02093647
-n06794110
-n04317175
-n02134418
-n02692877
-n02128757
-n03794056
-n02727426
-n01484850
-n02514041
-n02106382
-n02097298
-n04613696
-n02701002
-n03770439
-n01855672
-n02328150
-n03944341
-n09468604
-n02281787
-n04554684
-n02098105
-n03179701
-n02174001
-n02109961
-n03742115
-n04562935
-n03729826
-n04133789
-n04086273
-n01514859
-n04597913
-n04476259
-n01914609
-n02095889
-n03125729
-n04366367
-n02443114
-n02098413
-n03599486
-n01614925
-n04483307
-n02105412
-n01631663
-n02500267
-n02095889
-n04264628
-n07753592
-n02123597
-n03884397
-n04579432
-n03938244
-n07831146
-n02101006
-n02092002
-n02006656
-n02106166
-n04596742
-n03770679
-n04149813
-n04599235
-n04332243
-n03379051
-n01776313
-n01806567
-n09468604
-n04554684
-n02747177
-n04243546
-n03838899
-n01855032
-n01917289
-n02226429
-n03706229
-n03843555
-n07615774
-n02268853
-n04141975
-n01728920
-n01531178
-n03838899
-n09472597
-n01847000
-n13133613
-n04522168
-n02088466
-n09193705
-n03445924
-n02092002
-n02640242
-n07742313
-n04612504
-n01986214
-n09229709
-n02488291
-n02643566
-n03891251
-n09468604
-n01983481
-n07920052
-n03770679
-n02097130
-n03769881
-n03498962
-n07697537
-n02422699
-n04254777
-n03452741
-n04152593
-n01616318
-n02259212
-n03690938
-n04501370
-n04355933
-n01498041
-n04023962
-n02488702
-n04443257
-n02091134
-n02978881
-n02091244
-n01756291
-n04120489
-n04141327
-n02504458
-n01667778
-n02108089
-n03843555
-n02951358
-n01807496
-n02102318
-n07745940
-n06794110
-n02363005
-n07753113
-n01644900
-n02363005
-n01484850
-n02105056
-n02107312
-n03482405
-n01945685
-n02823750
-n02090622
-n03710193
-n03379051
-n07873807
-n04263257
-n03062245
-n02088632
-n04208210
-n04141327
-n07932039
-n02951358
-n02790996
-n02777292
-n02804414
-n03970156
-n04501370
-n02641379
-n01774750
-n01498041
-n04116512
-n02233338
-n03706229
-n02097047
-n07697537
-n02444819
-n04153751
-n02398521
-n03908714
-n02088632
-n02113712
-n02132136
-n04258138
-n03425413
-n02397096
-n02443484
-n06785654
-n04367480
-n03717622
-n03721384
-n02981792
-n01955084
-n02090721
-n02879718
-n02113712
-n02417914
-n02093859
-n02009912
-n02006656
-n01770393
-n02701002
-n01818515
-n12998815
-n03532672
-n03666591
-n06794110
-n03110669
-n03220513
-n03976467
-n02396427
-n03888257
-n02514041
-n02837789
-n07711569
-n07613480
-n03075370
-n07684084
-n02708093
-n02099267
-n03131574
-n01843383
-n02091032
-n03796401
-n04243546
-n04389033
-n03014705
-n03868863
-n01883070
-n01744401
-n12267677
-n03876231
-n01847000
-n02219486
-n01955084
-n03089624
-n04350905
-n02119022
-n04004767
-n02793495
-n03404251
-n03014705
-n01677366
-n03690938
-n04162706
-n04552348
-n01985128
-n07873807
-n02526121
-n07932039
-n02102973
-n02108000
-n04493381
-n02097130
-n04086273
-n03832673
-n02088364
-n02119789
-n02113712
-n07716906
-n03792972
-n02097658
-n02226429
-n09428293
-n02116738
-n07753113
-n02777292
-n02017213
-n04209239
-n02077923
-n02509815
-n07716906
-n02843684
-n02417914
-n07920052
-n09288635
-n01980166
-n09193705
-n03124043
-n03944341
-n02219486
-n02127052
-n04147183
-n02106550
-n04550184
-n01728572
-n02102480
-n04371430
-n03983396
-n02815834
-n04264628
-n04356056
-n02096294
-n02106382
-n07579787
-n02536864
-n03630383
-n02114367
-n03781244
-n03271574
-n01739381
-n04008634
-n03594734
-n03201208
-n02058221
-n02134418
-n10148035
-n01631663
-n02526121
-n02002556
-n02095314
-n02098105
-n04509417
-n04612504
-n02497673
-n01580077
-n01697457
-n03109150
-n09468604
-n03874293
-n02109961
-n02110627
-n02892201
-n02088364
-n03100240
-n03532672
-n02892767
-n07860988
-n03337140
-n02951358
-n03691459
-n03134739
-n02422106
-n02788148
-n03814906
-n02444819
-n06785654
-n04612504
-n02123394
-n03042490
-n04116512
-n03527444
-n09288635
-n01983481
-n09332890
-n07715103
-n01828970
-n04037443
-n03089624
-n02504458
-n01917289
-n03223299
-n02119022
-n02206856
-n04252077
-n02012849
-n02037110
-n01751748
-n07930864
-n04131690
-n07697313
-n02841315
-n03950228
-n04254680
-n04141975
-n03983396
-n02124075
-n12998815
-n03709823
-n01689811
-n02966687
-n03590841
-n02002556
-n01770393
-n04532106
-n02109961
-n04286575
-n02910353
-n03785016
-n04125021
-n04370456
-n02115641
-n03874293
-n13054560
-n02480855
-n02105855
-n01773157
-n02108915
-n02108000
-n03764736
-n02231487
-n04507155
-n01744401
-n04325704
-n02526121
-n04371774
-n01582220
-n02088094
-n12267677
-n07880968
-n04266014
-n02417914
-n04270147
-n07684084
-n01443537
-n03866082
-n04179913
-n02422106
-n07697537
-n02687172
-n03803284
-n01692333
-n04192698
-n02481823
-n02115913
-n03404251
-n02138441
-n02999410
-n03388183
-n02317335
-n03759954
-n04335435
-n03814906
-n03692522
-n13052670
-n03729826
-n02790996
-n02012849
-n03935335
-n01667114
-n07836838
-n01580077
-n07615774
-n03535780
-n02226429
-n03903868
-n02999410
-n03532672
-n03498962
-n01531178
-n03868242
-n02128757
-n03793489
-n01755581
-n09332890
-n02087394
-n03920288
-n02128385
-n03495258
-n02114712
-n03976467
-n04259630
-n02794156
-n01774384
-n02091467
-n04467665
-n02091635
-n04579432
-n03599486
-n02328150
-n04147183
-n02486410
-n04252077
-n02395406
-n07584110
-n03075370
-n02138441
-n02105505
-n04311004
-n04086273
-n04435653
-n04467665
-n04201297
-n01689811
-n03345487
-n02090379
-n02776631
-n04023962
-n02114367
-n13044778
-n02917067
-n07711569
-n03452741
-n01734418
-n03272010
-n01744401
-n09399592
-n02114855
-n03594734
-n02860847
-n04141076
-n02133161
-n03804744
-n01924916
-n04532106
-n01770081
-n02096177
-n02797295
-n03188531
-n04204347
-n03063689
-n02841315
-n02276258
-n02086646
-n03775071
-n03947888
-n02137549
-n03063599
-n02074367
-n02051845
-n03832673
-n03982430
-n01776313
-n02102177
-n02106550
-n03929855
-n04201297
-n01592084
-n02906734
-n03124043
-n03598930
-n07590611
-n02091635
-n02128757
-n04204347
-n01698640
-n01955084
-n03891251
-n02823428
-n03417042
-n03666591
-n03958227
-n03895866
-n02690373
-n01667778
-n02692877
-n03532672
-n07920052
-n03924679
-n03085013
-n07697313
-n02444819
-n02992211
-n07248320
-n02950826
-n02077923
-n03786901
-n03016953
-n02111889
-n02892201
-n02786058
-n02106382
-n02877765
-n02687172
-n02747177
-n02105412
-n07753113
-n03207743
-n04418357
-n02009912
-n01580077
-n01616318
-n04273569
-n01945685
-n03706229
-n04326547
-n02105056
-n13037406
-n03459775
-n02526121
-n02837789
-n04346328
-n01819313
-n02321529
-n03916031
-n03026506
-n02105251
-n04599235
-n01518878
-n02110627
-n01984695
-n01943899
-n04069434
-n02113023
-n01531178
-n03947888
-n03733805
-n03873416
-n02087394
-n04273569
-n03690938
-n02281787
-n04515003
-n01630670
-n03445924
-n04317175
-n02395406
-n02018207
-n02128385
-n03255030
-n02169497
-n03717622
-n03602883
-n02488291
-n01622779
-n03992509
-n02877765
-n03873416
-n01855672
-n03478589
-n03404251
-n07584110
-n03980874
-n03476684
-n02138441
-n02977058
-n02105162
-n03485407
-n01616318
-n02051845
-n03793489
-n01768244
-n04209239
-n03930630
-n04532106
-n03259280
-n02841315
-n02966193
-n03980874
-n04532106
-n02981792
-n01776313
-n04355338
-n02110341
-n03697007
-n02454379
-n02655020
-n03841143
-n07584110
-n02123394
-n03255030
-n07711569
-n03724870
-n03110669
-n03133878
-n01641577
-n01644373
-n04049303
-n07768694
-n03075370
-n02823428
-n02640242
-n02104365
-n04009552
-n02129604
-n03733805
-n02281787
-n04208210
-n04067472
-n01514859
-n03384352
-n03544143
-n03355925
-n01694178
-n03950228
-n07717556
-n02317335
-n02113799
-n07583066
-n02999410
-n07760859
-n02410509
-n02013706
-n04285008
-n04296562
-n03196217
-n03000134
-n02110627
-n04442312
-n02787622
-n02443484
-n02137549
-n03337140
-n03594734
-n02879718
-n02415577
-n02092339
-n03450230
-n02102040
-n07747607
-n03085013
-n03026506
-n06874185
-n02493793
-n03532672
-n01644900
-n03792782
-n04004767
-n02966193
-n01784675
-n13037406
-n03481172
-n03775546
-n04033995
-n02101556
-n03666591
-n04317175
-n01882714
-n02640242
-n03063689
-n04560804
-n01860187
-n04376876
-n04523525
-n01833805
-n02169497
-n03314780
-n02988304
-n02168699
-n04044716
-n02109961
-n01770393
-n01531178
-n04152593
-n02106662
-n04389033
-n01735189
-n07871810
-n04277352
-n02077923
-n03347037
-n02111500
-n02088238
-n03534580
-n03314780
-n02791270
-n04548280
-n03109150
-n03944341
-n02137549
-n04523525
-n04592741
-n04266014
-n01978455
-n02091032
-n04398044
-n02113624
-n02408429
-n04417672
-n04009552
-n02231487
-n04599235
-n07248320
-n04086273
-n04606251
-n03532672
-n02112137
-n09256479
-n04523525
-n01697457
-n03662601
-n04070727
-n02098286
-n02017213
-n02177972
-n01689811
-n03697007
-n03874599
-n02110185
-n04417672
-n04310018
-n02130308
-n04252077
-n03534580
-n01860187
-n03814906
-n02442845
-n04487394
-n02090379
-n01930112
-n07860988
-n02869837
-n02231487
-n03956157
-n03482405
-n02489166
-n02107683
-n01677366
-n01806143
-n03775071
-n02825657
-n02783161
-n01622779
-n02268853
-n04044716
-n04540053
-n02107142
-n04487394
-n03376595
-n01496331
-n02815834
-n02099267
-n04229816
-n07615774
-n03272562
-n01855672
-n02804414
-n01818515
-n02704792
-n02483708
-n01629819
-n03393912
-n03794056
-n01644373
-n02951585
-n02497673
-n02415577
-n01871265
-n07718747
-n02966193
-n03017168
-n01530575
-n02319095
-n02090379
-n03297495
-n03388183
-n03825788
-n01798484
-n03814906
-n02027492
-n02111889
-n04118538
-n02356798
-n01983481
-n01986214
-n02808440
-n02486261
-n01751748
-n03777568
-n04335435
-n07720875
-n03633091
-n03534580
-n04141975
-n04162706
-n03998194
-n07579787
-n02676566
-n03483316
-n01693334
-n04238763
-n02071294
-n04493381
-n07875152
-n01753488
-n02091635
-n03314780
-n03291819
-n03924679
-n12768682
-n06794110
-n03291819
-n03544143
-n01698640
-n06785654
-n03782006
-n04154565
-n02012849
-n07930864
-n03017168
-n04133789
-n02138441
-n03769881
-n03773504
-n07930864
-n04589890
-n01806143
-n03207743
-n02097474
-n01582220
-n02939185
-n02640242
-n02981792
-n03657121
-n02106166
-n02666196
-n01751748
-n03188531
-n01768244
-n04429376
-n02690373
-n01806567
-n02319095
-n02107683
-n04550184
-n04350905
-n01797886
-n04447861
-n04485082
-n03443371
-n04229816
-n03443371
-n04579145
-n03125729
-n03942813
-n03649909
-n02119022
-n02105251
-n12144580
-n02992529
-n01518878
-n02977058
-n01968897
-n02233338
-n03642806
-n01833805
-n09421951
-n01985128
-n01824575
-n04286575
-n04330267
-n02106166
-n07875152
-n02094258
-n02123394
-n01537544
-n04493381
-n02102480
-n02086240
-n02085782
-n03786901
-n04254680
-n03721384
-n04311174
-n04487394
-n02099267
-n03207941
-n02883205
-n02672831
-n04008634
-n03868863
-n04251144
-n03529860
-n01608432
-n02093647
-n02028035
-n03982430
-n01687978
-n01632458
-n03125729
-n02389026
-n02085782
-n06359193
-n03459775
-n01773797
-n02093754
-n04275548
-n02120505
-n03450230
-n03854065
-n02096177
-n02112706
-n02089867
-n02138441
-n02504458
-n02865351
-n04479046
-n03180011
-n03223299
-n02804414
-n02134418
-n01751748
-n02483708
-n01692333
-n02992211
-n03404251
-n07716906
-n01924916
-n07695742
-n02112137
-n02692877
-n02423022
-n02860847
-n01877812
-n04326547
-n02051845
-n01855672
-n02667093
-n01829413
-n07760859
-n01630670
-n02869837
-n02086910
-n01740131
-n02398521
-n03016953
-n02091134
-n02096585
-n02093647
-n03220513
-n07716906
-n03188531
-n03627232
-n03690938
-n02788148
-n04254680
-n02493509
-n02098413
-n03532672
-n02111889
-n01843065
-n02666196
-n02457408
-n03785016
-n02097474
-n02704792
-n03868863
-n04540053
-n03529860
-n04238763
-n03658185
-n03970156
-n04285008
-n02526121
-n02096585
-n03814639
-n03180011
-n02480855
-n03594945
-n02101006
-n04517823
-n12985857
-n02104029
-n04111531
-n01729322
-n03773504
-n01580077
-n02098413
-n04065272
-n02085936
-n02093859
-n02104365
-n09472597
-n02865351
-n04254680
-n02951358
-n02281787
-n01496331
-n02093256
-n01910747
-n04509417
-n02417914
-n02389026
-n03666591
-n06794110
-n03786901
-n07695742
-n02133161
-n04540053
-n02782093
-n01871265
-n03690938
-n02028035
-n02106550
-n02494079
-n07831146
-n01498041
-n02130308
-n04483307
-n01820546
-n02105056
-n04487081
-n09332890
-n02437312
-n03692522
-n02871525
-n02326432
-n07749582
-n02992211
-n02497673
-n03544143
-n13052670
-n13133613
-n07714571
-n03868863
-n02606052
-n02111129
-n03874293
-n02190166
-n02226429
-n02363005
-n02443484
-n04579145
-n03425413
-n03018349
-n03452741
-n02791124
-n02346627
-n02128757
-n03998194
-n03530642
-n01592084
-n01917289
-n03764736
-n07615774
-n03977966
-n02877765
-n02089973
-n01986214
-n01872401
-n03942813
-n01689811
-n02834397
-n07714990
-n02486261
-n02397096
-n04467665
-n02909870
-n04517823
-n04131690
-n01728572
-n01729322
-n01797886
-n02108551
-n03866082
-n01677366
-n02979186
-n03710637
-n03933933
-n03930313
-n03899768
-n03763968
-n02326432
-n02107142
-n02066245
-n04099969
-n07860988
-n07695742
-n01924916
-n03895866
-n03788365
-n01632777
-n02787622
-n01768244
-n01768244
-n03146219
-n06785654
-n02110341
-n03400231
-n02123045
-n02025239
-n03670208
-n01784675
-n03982430
-n04485082
-n03208938
-n01990800
-n03930313
-n02708093
-n04597913
-n01796340
-n02100236
-n01608432
-n01828970
-n01614925
-n03400231
-n01631663
-n03759954
-n01872401
-n01917289
-n02690373
-n01664065
-n03016953
-n04376876
-n01664065
-n02950826
-n04557648
-n02793495
-n02111129
-n01968897
-n03781244
-n07871810
-n02641379
-n02097209
-n02109047
-n03065424
-n03838899
-n04501370
-n01753488
-n04049303
-n02097047
-n04311004
-n03538406
-n03666591
-n02017213
-n02093647
-n04409515
-n03207743
-n01843065
-n03697007
-n03291819
-n03197337
-n03000247
-n02443484
-n03891251
-n02085782
-n04033901
-n03658185
-n01819313
-n03388549
-n02606052
-n04612504
-n01582220
-n02883205
-n04467665
-n03535780
-n04326547
-n03895866
-n02095889
-n02123045
-n03777568
-n01631663
-n02999410
-n07717410
-n02837789
-n04461696
-n07720875
-n03141823
-n03216828
-n04589890
-n02105641
-n03196217
-n01797886
-n07742313
-n02396427
-n04532106
-n02655020
-n02437312
-n03028079
-n02037110
-n03788365
-n01978455
-n02483362
-n02444819
-n01580077
-n04347754
-n01728572
-n03063689
-n02106662
-n02672831
-n03895866
-n04560804
-n04540053
-n02233338
-n03777754
-n02788148
-n09472597
-n02484975
-n04404412
-n02087046
-n02089078
-n03255030
-n03095699
-n07714990
-n02641379
-n03218198
-n02481823
-n01514859
-n03337140
-n04399382
-n02641379
-n02129604
-n03982430
-n04127249
-n04125021
-n01774384
-n01740131
-n02325366
-n04041544
-n02667093
-n07836838
-n01739381
-n02108000
-n02277742
-n01950731
-n03777754
-n04310018
-n02917067
-n02835271
-n04515003
-n02119789
-n02966687
-n03085013
-n12144580
-n02071294
-n12998815
-n04162706
-n03028079
-n03218198
-n02895154
-n04562935
-n07613480
-n02128925
-n03649909
-n01629819
-n01883070
-n02098413
-n02002724
-n02106382
-n01530575
-n02113978
-n02124075
-n04332243
-n02655020
-n04239074
-n01910747
-n09399592
-n02096051
-n03930630
-n07693725
-n03933933
-n03187595
-n02281787
-n02892201
-n02108000
-n01687978
-n03803284
-n07892512
-n02074367
-n03891251
-n03384352
-n04409515
-n02107574
-n01860187
-n03529860
-n02280649
-n02860847
-n03325584
-n04409515
-n03692522
-n02089973
-n02782093
-n03208938
-n02980441
-n01693334
-n01773157
-n01729977
-n03063689
-n02865351
-n03459775
-n03637318
-n04263257
-n04604644
-n04311004
-n02120079
-n02112018
-n03196217
-n01871265
-n02804610
-n07892512
-n03124043
-n02219486
-n02089973
-n02109047
-n04040759
-n07711569
-n04458633
-n07720875
-n02277742
-n01675722
-n02119022
-n02106030
-n03763968
-n02105412
-n03017168
-n03857828
-n04346328
-n04005630
-n03492542
-n02480495
-n02090622
-n03814906
-n04004767
-n02992529
-n02692877
-n09332890
-n02979186
-n01770393
-n02129165
-n02391049
-n07871810
-n03355925
-n04398044
-n07860988
-n03961711
-n02089973
-n03404251
-n02395406
-n03063689
-n04070727
-n04552348
-n02112137
-n02110958
-n01753488
-n07697537
-n04389033
-n02783161
-n07693725
-n04286575
-n07753113
-n07716358
-n03394916
-n02093256
-n01737021
-n07836838
-n02268853
-n02130308
-n02906734
-n02134418
-n02108000
-n01560419
-n03131574
-n02133161
-n03000247
-n02279972
-n02951585
-n03733805
-n01677366
-n03976467
-n03535780
-n03938244
-n01644373
-n02109525
-n03649909
-n02190166
-n01692333
-n02910353
-n01807496
-n03982430
-n02974003
-n03950228
-n01978287
-n03720891
-n02892767
-n02504013
-n01855032
-n02483362
-n02025239
-n03868242
-n02094114
-n02109047
-n07749582
-n01669191
-n03785016
-n04041544
-n02087046
-n03272010
-n03447447
-n02783161
-n03976657
-n02087394
-n04548280
-n01860187
-n01689811
-n04584207
-n04251144
-n02113023
-n03977966
-n03792972
-n13054560
-n06785654
-n07734744
-n02115641
-n04606251
-n02277742
-n02794156
-n02137549
-n04479046
-n01753488
-n04485082
-n02100735
-n02869837
-n03534580
-n02879718
-n04525305
-n01829413
-n03792782
-n02109961
-n03443371
-n02009229
-n01744401
-n01728572
-n02098413
-n04311004
-n03272010
-n02095570
-n01632458
-n02783161
-n01644900
-n01601694
-n01608432
-n04335435
-n02086910
-n04418357
-n02097658
-n03124170
-n04228054
-n02494079
-n07754684
-n02493793
-n02165105
-n02133161
-n01847000
-n03394916
-n02105162
-n01950731
-n03970156
-n02233338
-n03045698
-n02099601
-n11939491
-n04467665
-n04346328
-n04347754
-n03063689
-n03100240
-n02127052
-n03887697
-n09428293
-n02361337
-n02606052
-n04590129
-n02692877
-n03796401
-n04532106
-n03538406
-n07747607
-n01978455
-n07717556
-n02894605
-n03134739
-n04243546
-n03903868
-n02879718
-n01824575
-n01877812
-n01770081
-n04525305
-n01773549
-n02099712
-n01774384
-n02823428
-n01860187
-n03461385
-n04366367
-n02167151
-n02454379
-n03777568
-n01833805
-n03761084
-n04542943
-n02504458
-n02033041
-n02095314
-n03527444
-n02280649
-n02123045
-n01644373
-n12998815
-n03792972
-n02480495
-n03417042
-n02091467
-n02415577
-n12985857
-n03544143
-n04370456
-n02110806
-n03676483
-n03602883
-n03538406
-n04201297
-n03929855
-n02504013
-n10565667
-n02097130
-n03950228
-n01675722
-n04523525
-n02966687
-n02504458
-n02089973
-n01641577
-n04330267
-n04146614
-n01631663
-n02978881
-n07802026
-n04039381
-n03485794
-n03825788
-n04265275
-n03141823
-n04033995
-n03179701
-n01986214
-n04604644
-n02730930
-n03920288
-n02799071
-n04399382
-n04023962
-n02951358
-n02114367
-n02074367
-n03992509
-n03000134
-n01824575
-n04525305
-n02119789
-n03899768
-n03617480
-n02012849
-n03814639
-n04347754
-n04597913
-n02113799
-n04562935
-n03777754
-n02687172
-n02066245
-n02704792
-n01751748
-n02090622
-n03857828
-n03777754
-n02130308
-n02606052
-n03483316
-n02808440
-n02114712
-n01774384
-n09468604
-n03045698
-n02107574
-n02112706
-n03777754
-n04209239
-n07745940
-n02690373
-n07584110
-n03388549
-n03977966
-n04584207
-n02279972
-n02443114
-n02493509
-n02494079
-n03063599
-n01774750
-n01968897
-n01695060
-n04380533
-n02128757
-n09256479
-n02909870
-n04501370
-n03935335
-n07693725
-n04591713
-n03787032
-n01498041
-n03042490
-n02086910
-n01855672
-n04596742
-n02445715
-n02859443
-n02804610
-n03709823
-n02488291
-n02410509
-n03393912
-n03498962
-n03131574
-n03791053
-n03763968
-n02097130
-n03042490
-n01641577
-n01677366
-n01828970
-n02096051
-n03888605
-n02094114
-n02892201
-n02486261
-n03983396
-n02133161
-n03602883
-n03065424
-n02749479
-n02791124
-n01968897
-n02797295
-n02877765
-n01843065
-n02892201
-n03786901
-n02174001
-n03133878
-n02107908
-n04136333
-n02437616
-n04592741
-n04044716
-n01773157
-n02130308
-n02325366
-n04591713
-n04090263
-n03902125
-n03670208
-n07753113
-n03866082
-n04201297
-n02093859
-n02410509
-n02823750
-n01740131
-n03417042
-n03874293
-n03710193
-n02871525
-n02091467
-n04254120
-n02109525
-n04404412
-n02094433
-n11939491
-n02107683
-n04356056
-n02002556
-n02168699
-n01945685
-n04376876
-n04033901
-n01530575
-n03838899
-n01776313
-n03028079
-n03658185
-n04310018
-n02090379
-n02109525
-n04376876
-n04418357
-n04409515
-n07583066
-n03841143
-n02837789
-n03494278
-n03457902
-n02497673
-n02504013
-n02110063
-n02835271
-n01491361
-n02807133
-n02085782
-n02088364
-n02607072
-n02120505
-n07718472
-n03781244
-n02389026
-n03026506
-n02769748
-n02096177
-n02840245
-n02606052
-n03857828
-n03837869
-n01735189
-n02093256
-n02112706
-n02749479
-n04525038
-n03982430
-n02510455
-n02410509
-n03680355
-n02105505
-n03017168
-n02120079
-n03532672
-n03992509
-n02009229
-n02106166
-n02105056
-n02422699
-n03770439
-n03794056
-n03777568
-n02110806
-n01950731
-n04371430
-n03417042
-n03743016
-n01729977
-n02669723
-n02094433
-n04251144
-n02119022
-n01697457
-n01682714
-n07614500
-n02127052
-n03042490
-n02113799
-n04399382
-n03794056
-n02963159
-n02730930
-n01592084
-n04067472
-n02815834
-n07753592
-n13052670
-n07875152
-n06785654
-n04509417
-n03977966
-n03345487
-n03223299
-n04277352
-n06794110
-n02389026
-n07920052
-n02100877
-n04435653
-n04239074
-n04069434
-n03617480
-n01494475
-n02672831
-n07831146
-n02097047
-n03814639
-n02514041
-n02091635
-n01687978
-n02116738
-n01630670
-n01695060
-n04204238
-n04090263
-n04081281
-n01819313
-n02132136
-n03787032
-n04044716
-n15075141
-n03954731
-n04389033
-n02002556
-n04591157
-n04133789
-n04277352
-n02641379
-n03733805
-n04417672
-n02403003
-n01580077
-n03920288
-n03673027
-n07697537
-n07836838
-n04243546
-n02977058
-n07684084
-n07697537
-n02132136
-n03131574
-n02093647
-n03443371
-n03134739
-n04550184
-n03891251
-n02087394
-n07697537
-n07583066
-n04522168
-n04493381
-n04065272
-n02097130
-n04467665
-n01614925
-n03961711
-n02802426
-n02089078
-n02018207
-n03947888
-n01748264
-n02280649
-n02002556
-n03709823
-n01494475
-n03485794
-n04479046
-n02108551
-n03325584
-n03188531
-n02091032
-n02259212
-n02033041
-n03290653
-n04033995
-n07614500
-n02169497
-n04553703
-n02268443
-n09288635
-n01843383
-n04428191
-n03717622
-n02268853
-n02012849
-n02894605
-n02134418
-n01751748
-n02823750
-n02177972
-n03424325
-n02397096
-n07753275
-n02417914
-n03379051
-n02096585
-n03814639
-n03355925
-n03127747
-n02264363
-n03733131
-n02481823
-n03447447
-n04409515
-n02066245
-n02102318
-n03028079
-n02107574
-n04026417
-n02058221
-n02106662
-n02607072
-n01641577
-n03376595
-n07892512
-n11939491
-n02488702
-n09421951
-n01910747
-n02364673
-n07248320
-n03908714
-n02939185
-n02099601
-n03680355
-n02095889
-n02917067
-n04380533
-n01592084
-n02109525
-n02123394
-n02236044
-n02346627
-n12057211
-n12620546
-n04346328
-n01531178
-n01735189
-n04152593
-n04487394
-n02123597
-n01768244
-n02129604
-n09193705
-n04131690
-n02085936
-n02088238
-n03538406
-n03131574
-n02110185
-n03124043
-n03000247
-n02107574
-n02110958
-n03018349
-n02930766
-n02229544
-n02483362
-n03887697
-n01773797
-n02264363
-n02088364
-n04127249
-n02113023
-n03146219
-n02114855
-n04536866
-n03770679
-n01796340
-n03866082
-n04380533
-n03764736
-n07749582
-n03658185
-n04579145
-n01784675
-n01644373
-n02110063
-n02971356
-n02494079
-n02361337
-n02490219
-n03803284
-n02113624
-n02106550
-n03814906
-n03180011
-n01872401
-n02730930
-n04548280
-n02814860
-n02105162
-n03676483
-n01871265
-n07716358
-n04476259
-n03887697
-n07697537
-n02514041
-n04004767
-n04371774
-n01855032
-n01518878
-n09835506
-n01943899
-n03908714
-n03400231
-n02129604
-n02492035
-n04252225
-n02107312
-n03443371
-n02950826
-n03814639
-n02951585
-n04265275
-n01806567
-n03482405
-n01882714
-n01580077
-n02091831
-n04266014
-n02895154
-n04532106
-n02999410
-n03729826
-n03345487
-n02105162
-n02690373
-n04597913
-n04325704
-n03461385
-n01695060
-n01818515
-n09472597
-n01806567
-n07754684
-n04326547
-n02093859
-n04049303
-n02641379
-n03196217
-n02088466
-n04376876
-n02009229
-n03929855
-n02025239
-n03814906
-n03291819
-n04612504
-n03000134
-n02837789
-n07718747
-n03459775
-n02281406
-n01693334
-n02219486
-n04266014
-n04399382
-n01774750
-n02980441
-n03062245
-n04418357
-n02841315
-n04239074
-n02117135
-n03908714
-n04429376
-n02089867
-n01641577
-n02444819
-n04277352
-n01443537
-n04522168
-n02137549
-n03770439
-n03697007
-n07248320
-n04523525
-n04141975
-n04442312
-n02979186
-n03929855
-n03160309
-n07613480
-n04154565
-n03452741
-n03063689
-n01983481
-n03884397
-n02687172
-n01622779
-n01774750
-n02096051
-n04074963
-n03207941
-n02107908
-n03180011
-n04557648
-n01491361
-n04209239
-n02091467
-n03930313
-n03417042
-n02395406
-n02112350
-n02108915
-n02123597
-n04125021
-n03777754
-n09288635
-n02066245
-n03196217
-n04118538
-n03733281
-n02106550
-n02111889
-n03720891
-n04604644
-n03016953
-n03249569
-n04039381
-n02100735
-n01582220
-n02423022
-n03764736
-n03109150
-n02028035
-n02510455
-n01735189
-n02666196
-n02992211
-n04356056
-n03240683
-n01978455
-n04579145
-n02963159
-n09288635
-n02442845
-n04606251
-n02087046
-n03344393
-n01883070
-n03697007
-n03891251
-n03662601
-n02138441
-n01753488
-n04613696
-n01950731
-n03485794
-n02110341
-n02892767
-n02492035
-n04273569
-n04008634
-n02095314
-n03794056
-n09472597
-n02802426
-n07716906
-n03792972
-n01872401
-n03673027
-n02279972
-n02910353
-n03933933
-n03938244
-n01558993
-n03908714
-n01914609
-n02101006
-n02672831
-n04067472
-n02526121
-n07836838
-n02817516
-n07742313
-n01828970
-n04286575
-n03649909
-n02107683
-n02988304
-n02165456
-n04560804
-n01629819
-n03814906
-n03782006
-n02264363
-n02909870
-n09246464
-n02328150
-n02730930
-n04596742
-n03095699
-n03146219
-n01824575
-n03977966
-n01807496
-n02500267
-n02098105
-n01796340
-n02113978
-n02948072
-n03089624
-n04550184
-n07565083
-n03529860
-n03544143
-n02791270
-n03775071
-n03710721
-n13044778
-n02504458
-n02514041
-n03743016
-n03483316
-n12985857
-n03709823
-n04465501
-n03028079
-n04209239
-n01807496
-n02859443
-n04398044
-n03337140
-n02783161
-n02500267
-n01644373
-n07711569
-n03888257
-n02655020
-n09399592
-n03197337
-n02007558
-n03961711
-n04542943
-n02116738
-n01580077
-n02088632
-n02096294
-n03388183
-n02099267
-n03445924
-n04133789
-n04332243
-n03201208
-n03032252
-n02504458
-n02979186
-n04584207
-n03535780
-n02229544
-n02111500
-n04525305
-n03197337
-n02398521
-n02088238
-n02364673
-n04146614
-n02113186
-n02391049
-n02098286
-n04548362
-n02009229
-n07802026
-n07716906
-n02111889
-n02730930
-n01632777
-n02099601
-n02981792
-n03637318
-n01735189
-n04049303
-n02129165
-n02443484
-n03770679
-n04149813
-n01622779
-n03110669
-n01945685
-n03937543
-n02977058
-n02457408
-n03041632
-n01694178
-n03095699
-n02085936
-n04252077
-n03529860
-n01978455
-n01768244
-n06359193
-n02107908
-n04162706
-n03494278
-n02009912
-n01740131
-n03717622
-n13054560
-n03014705
-n02087394
-n02093991
-n03063689
-n02113023
-n03733131
-n04493381
-n03825788
-n02643566
-n03495258
-n06794110
-n02280649
-n04065272
-n02110958
-n03452741
-n03314780
-n01828970
-n02871525
-n04447861
-n02815834
-n04417672
-n04328186
-n02134418
-n03788365
-n03877845
-n04487081
-n02500267
-n03372029
-n03837869
-n01968897
-n03443371
-n12768682
-n01685808
-n03584829
-n02814860
-n03485407
-n03670208
-n01817953
-n03026506
-n01440764
-n01685808
-n03691459
-n04141076
-n04179913
-n03670208
-n01755581
-n03958227
-n03388043
-n03223299
-n02504013
-n01773549
-n01694178
-n02112018
-n01739381
-n01695060
-n01980166
-n03788365
-n03187595
-n02277742
-n01669191
-n02892201
-n02123045
-n07747607
-n04604644
-n04149813
-n04074963
-n02111277
-n02101006
-n03961711
-n01978287
-n03127747
-n02129604
-n07717410
-n02264363
-n07802026
-n02089973
-n02096585
-n04243546
-n01688243
-n02817516
-n04596742
-n03673027
-n02797295
-n07753113
-n01685808
-n02871525
-n02093991
-n01984695
-n07760859
-n03032252
-n07711569
-n02280649
-n03761084
-n03160309
-n03891332
-n02883205
-n04372370
-n04041544
-n04552348
-n04264628
-n04041544
-n01910747
-n03950228
-n02666196
-n04204347
-n01560419
-n04204238
-n02236044
-n03131574
-n04487081
-n02018795
-n02843684
-n03000684
-n01667778
-n02115641
-n04548362
-n01943899
-n02100877
-n02093256
-n02018207
-n02112137
-n03141823
-n02093754
-n02174001
-n04476259
-n02480495
-n03887697
-n02769748
-n02002724
-n02113978
-n02110627
-n03874293
-n02107574
-n02109047
-n01855032
-n02794156
-n03134739
-n07742313
-n03124043
-n02486261
-n02992529
-n01734418
-n02321529
-n03047690
-n02879718
-n02025239
-n03131574
-n04347754
-n03216828
-n02264363
-n03041632
-n02071294
-n01914609
-n02497673
-n02172182
-n01667778
-n02106550
-n02814860
-n01773549
-n01986214
-n02236044
-n02009912
-n02487347
-n01755581
-n03623198
-n02445715
-n06794110
-n02085620
-n04482393
-n01820546
-n04579145
-n02326432
-n07754684
-n04111531
-n03724870
-n02093256
-n07711569
-n02017213
-n01688243
-n01669191
-n01664065
-n02092339
-n02108551
-n04525305
-n03950228
-n03929660
-n03956157
-n03891332
-n04493381
-n02102973
-n03255030
-n01990800
-n02500267
-n02281406
-n01824575
-n03032252
-n02129165
-n02356798
-n03538406
-n02009229
-n02097658
-n03095699
-n03786901
-n03743016
-n02980441
-n07742313
-n02106166
-n03314780
-n02097209
-n04037443
-n04086273
-n03394916
-n02037110
-n02112018
-n03379051
-n02951585
-n04501370
-n04355338
-n03874293
-n04153751
-n07930864
-n02930766
-n01496331
-n04265275
-n02256656
-n01667114
-n03630383
-n04591713
-n02704792
-n03207743
-n03854065
-n03720891
-n07873807
-n02120505
-n02099849
-n04152593
-n02100877
-n04560804
-n03792972
-n03733131
-n13133613
-n02114548
-n03000247
-n04146614
-n04398044
-n02325366
-n03633091
-n09256479
-n03617480
-n01530575
-n03633091
-n03018349
-n01768244
-n02871525
-n04040759
-n03658185
-n03272562
-n02447366
-n04392985
-n02797295
-n03903868
-n04548362
-n07714571
-n03884397
-n03888605
-n02105505
-n03666591
-n03063599
-n03530642
-n02097474
-n04483307
-n04554684
-n02978881
-n02492660
-n03692522
-n04589890
-n04579432
-n02127052
-n02112706
-n02804610
-n02190166
-n11939491
-n03000134
-n01697457
-n12620546
-n02256656
-n01968897
-n02950826
-n03127925
-n02939185
-n06596364
-n02091134
-n03877472
-n02113799
-n02102973
-n02027492
-n03498962
-n02834397
-n07248320
-n04286575
-n01735189
-n02417914
-n03690938
-n03404251
-n01739381
-n02099267
-n02219486
-n02108089
-n02206856
-n03208938
-n03127747
-n02279972
-n02281406
-n02113023
-n01601694
-n07715103
-n02107908
-n02120079
-n02102318
-n02096051
-n01990800
-n02917067
-n03372029
-n03538406
-n12267677
-n03314780
-n03903868
-n02009229
-n02100236
-n03759954
-n02277742
-n03804744
-n02966687
-n02102318
-n09835506
-n01484850
-n02097047
-n02795169
-n03673027
-n02169497
-n03532672
-n04067472
-n01944390
-n02786058
-n04019541
-n01665541
-n04162706
-n01695060
-n04116512
-n03680355
-n04548280
-n04517823
-n02883205
-n02869837
-n01871265
-n01737021
-n01496331
-n01773797
-n04562935
-n03617480
-n03930630
-n04033901
-n04270147
-n03388183
-n02823428
-n02090622
-n02504013
-n04356056
-n02510455
-n01860187
-n02492660
-n02879718
-n02669723
-n15075141
-n04263257
-n02422106
-n04350905
-n02105056
-n02102973
-n03776460
-n03857828
-n02120505
-n02105412
-n02643566
-n03291819
-n04447861
-n03938244
-n07717556
-n02423022
-n03450230
-n01770393
-n04254680
-n03530642
-n03476991
-n03710721
-n04116512
-n04398044
-n02930766
-n04370456
-n02231487
-n04019541
-n03476991
-n04366367
-n02930766
-n01728920
-n03908618
-n07615774
-n06794110
-n01744401
-n04153751
-n03187595
-n02009912
-n02096437
-n02018207
-n02363005
-n07717410
-n02939185
-n03495258
-n03787032
-n03920288
-n04392985
-n02109961
-n04325704
-n03240683
-n01773157
-n02317335
-n03929660
-n02493509
-n03920288
-n03447721
-n02486261
-n04562935
-n01829413
-n01930112
-n02104365
-n02992211
-n04033901
-n03710193
-n02797295
-n01847000
-n02100583
-n04483307
-n03874599
-n04275548
-n04540053
-n01558993
-n04560804
-n04542943
-n01773549
-n04317175
-n03935335
-n07717410
-n02165456
-n03832673
-n01692333
-n03788195
-n07831146
-n03590841
-n03840681
-n02277742
-n09472597
-n07614500
-n04548280
-n03443371
-n04532670
-n01774750
-n04486054
-n03127747
-n03676483
-n02669723
-n02017213
-n01945685
-n02219486
-n04599235
-n03530642
-n04254777
-n02111500
-n03125729
-n01631663
-n07880968
-n02111277
-n01817953
-n03776460
-n01622779
-n03240683
-n02906734
-n02391049
-n01695060
-n04023962
-n01514668
-n04133789
-n02871525
-n02277742
-n02090721
-n01693334
-n04074963
-n07693725
-n01873310
-n02279972
-n02971356
-n02071294
-n03991062
-n02088238
-n03538406
-n04552348
-n02112706
-n04229816
-n03126707
-n01518878
-n03903868
-n13054560
-n04149813
-n01828970
-n03197337
-n02443114
-n03255030
-n01558993
-n03529860
-n04069434
-n02396427
-n03197337
-n02356798
-n02504013
-n02641379
-n02017213
-n01882714
-n01514859
-n04429376
-n04366367
-n04443257
-n03075370
-n03782006
-n02927161
-n03899768
-n07715103
-n03980874
-n01514668
-n03761084
-n01773797
-n02120079
-n04131690
-n07248320
-n02133161
-n02096051
-n13052670
-n02979186
-n02113023
-n03594945
-n02123045
-n02120505
-n02119022
-n02493793
-n01728572
-n03482405
-n01980166
-n07745940
-n01773549
-n02123394
-n02093754
-n03534580
-n02174001
-n02641379
-n01693334
-n01983481
-n02793495
-n04456115
-n04141327
-n02096585
-n01855672
-n03223299
-n03544143
-n02321529
-n09193705
-n04409515
-n02105162
-n03775546
-n01990800
-n02128757
-n03769881
-n03314780
-n03598930
-n03452741
-n03388183
-n03958227
-n02236044
-n04208210
-n07693725
-n01945685
-n04579432
-n02486410
-n02791270
-n02099429
-n02074367
-n04208210
-n01981276
-n03240683
-n03425413
-n02115913
-n03124043
-n02002724
-n02667093
-n03724870
-n07730033
-n03733281
-n04522168
-n07717556
-n03977966
-n03788365
-n01484850
-n03482405
-n03623198
-n07892512
-n07711569
-n03710637
-n03376595
-n04141975
-n02981792
-n03804744
-n02107312
-n03733131
-n01739381
-n04252077
-n03445924
-n04599235
-n02422699
-n03637318
-n03673027
-n03425413
-n02442845
-n02325366
-n02410509
-n02641379
-n02165105
-n02769748
-n02859443
-n01806567
-n03527444
-n02099601
-n07715103
-n01531178
-n04599235
-n07697313
-n02091244
-n04317175
-n02823428
-n02096437
-n02236044
-n02190166
-n02948072
-n01728920
-n01728572
-n03000684
-n03133878
-n02017213
-n01978287
-n03775071
-n04479046
-n07720875
-n06785654
-n01843383
-n02108089
-n02606052
-n02794156
-n02100583
-n12620546
-n02412080
-n01677366
-n03710637
-n07753275
-n02417914
-n04019541
-n01697457
-n01806143
-n03759954
-n02115913
-n12985857
-n03530642
-n02133161
-n02086240
-n02782093
-n02259212
-n02110806
-n03733131
-n02096294
-n04229816
-n06794110
-n02699494
-n03761084
-n01592084
-n07695742
-n01631663
-n03017168
-n04350905
-n02256656
-n04285008
-n01984695
-n04275548
-n01883070
-n03047690
-n02445715
-n02088094
-n03223299
-n01729322
-n03837869
-n02102480
-n02088364
-n02102177
-n04265275
-n02319095
-n02229544
-n03759954
-n02869837
-n04209133
-n03291819
-n04371774
-n02138441
-n02417914
-n02128757
-n02098286
-n04591157
-n03443371
-n03902125
-n02422106
-n04423845
-n04465501
-n13052670
-n02087394
-n04367480
-n07742313
-n03538406
-n03492542
-n03868863
-n02088632
-n01582220
-n03876231
-n03770439
-n02977058
-n03457902
-n03874293
-n03902125
-n03929855
-n02391049
-n03180011
-n03956157
-n02790996
-n02099712
-n01980166
-n04041544
-n02033041
-n03976657
-n01751748
-n02127052
-n01494475
-n02128385
-n04204347
-n03690938
-n03759954
-n02412080
-n04204238
-n03662601
-n02114855
-n03788365
-n02104029
-n02101556
-n01737021
-n09288635
-n02096177
-n02492035
-n04238763
-n03393912
-n04149813
-n02398521
-n01742172
-n02130308
-n01534433
-n04404412
-n02107683
-n02708093
-n04209239
-n07715103
-n07718747
-n04462240
-n02510455
-n02098105
-n02277742
-n02096437
-n02802426
-n02486261
-n02091134
-n03272010
-n01491361
-n04604644
-n02640242
-n03692522
-n02229544
-n07720875
-n04606251
-n04201297
-n11939491
-n02088364
-n02655020
-n03657121
-n02112350
-n02326432
-n03445777
-n02028035
-n04326547
-n03400231
-n02091032
-n03710193
-n01742172
-n01806567
-n03485407
-n03450230
-n01735189
-n02319095
-n03467068
-n04458633
-n03394916
-n02500267
-n04525038
-n02112137
-n02107908
-n12768682
-n02119789
-n03662601
-n07860988
-n04584207
-n07932039
-n03062245
-n07745940
-n03085013
-n04465501
-n02483708
-n03379051
-n01631663
-n01773157
-n02364673
-n02917067
-n02488702
-n02105412
-n02423022
-n03868242
-n02018207
-n02113624
-n04041544
-n04548280
-n03483316
-n03444034
-n02125311
-n02281406
-n04041544
-n03223299
-n03602883
-n12144580
-n04192698
-n07831146
-n01748264
-n02096177
-n01798484
-n03075370
-n01807496
-n04479046
-n03457902
-n02504013
-n02097047
-n07583066
-n02979186
-n03595614
-n04286575
-n09246464
-n02981792
-n03220513
-n02090379
-n02037110
-n02009912
-n07860988
-n04435653
-n02486261
-n02129604
-n01491361
-n04579432
-n02165456
-n03259280
-n01860187
-n03796401
-n02356798
-n01828970
-n02206856
-n03983396
-n02783161
-n03134739
-n02823428
-n04371430
-n04118776
-n02106166
-n02988304
-n01770081
-n04465501
-n03447447
-n03976467
-n02977058
-n02058221
-n02280649
-n03445777
-n03884397
-n01797886
-n03240683
-n03485794
-n02974003
-n04548280
-n02168699
-n07716906
-n02002556
-n01632777
-n02111129
-n02492035
-n02123159
-n03424325
-n02231487
-n01641577
-n07873807
-n02363005
-n02100877
-n03777568
-n01530575
-n03998194
-n01829413
-n02480855
-n09288635
-n02321529
-n02509815
-n03482405
-n04493381
-n02319095
-n03223299
-n03388549
-n02113186
-n02093859
-n07718747
-n01855032
-n10148035
-n07753113
-n04154565
-n02423022
-n04179913
-n02486410
-n02106382
-n02033041
-n02483708
-n01537544
-n02123597
-n03240683
-n04026417
-n02108422
-n09399592
-n02104365
-n03794056
-n01776313
-n02787622
-n03854065
-n01729977
-n02127052
-n03942813
-n02109047
-n03133878
-n03775071
-n02268443
-n04118776
-n02009912
-n02111889
-n04542943
-n03759954
-n03633091
-n03124043
-n03016953
-n02133161
-n02106030
-n01773797
-n03887697
-n04501370
-n04120489
-n02096051
-n01682714
-n03133878
-n02992211
-n01795545
-n02033041
-n04285008
-n02113978
-n02006656
-n01768244
-n02837789
-n01622779
-n02091831
-n02992529
-n03929660
-n02493793
-n03447447
-n02013706
-n03478589
-n07615774
-n03530642
-n02410509
-n01968897
-n04252077
-n03976467
-n07871810
-n01697457
-n04200800
-n01806567
-n03998194
-n03721384
-n02107683
-n02950826
-n02834397
-n02978881
-n02106166
-n02098413
-n04204238
-n04328186
-n01943899
-n03494278
-n01798484
-n07714990
-n02105056
-n04033995
-n03207743
-n03459775
-n02704792
-n03379051
-n04372370
-n01855032
-n03124170
-n04039381
-n04355338
-n01774384
-n03016953
-n02486261
-n01632777
-n02319095
-n02106550
-n03476684
-n01644900
-n03729826
-n03047690
-n04179913
-n02437312
-n03769881
-n01664065
-n02107683
-n09835506
-n01784675
-n02483362
-n02089867
-n04356056
-n03666591
-n06359193
-n02277742
-n04456115
-n02099267
-n03657121
-n04149813
-n07579787
-n04372370
-n02095314
-n03496892
-n02483708
-n04417672
-n04447861
-n02804610
-n03126707
-n01704323
-n09332890
-n02090379
-n03837869
-n11939491
-n03866082
-n03733131
-n02165456
-n04443257
-n02281787
-n02398521
-n07718472
-n02106382
-n02066245
-n04428191
-n03527444
-n03085013
-n02112350
-n02094433
-n03942813
-n02398521
-n02865351
-n03908618
-n02229544
-n01981276
-n03208938
-n02236044
-n04542943
-n02804610
-n02843684
-n01687978
-n02447366
-n02099849
-n03017168
-n02999410
-n02013706
-n02102040
-n02825657
-n02091831
-n01833805
-n02117135
-n01910747
-n03724870
-n04209133
-n04328186
-n03761084
-n04509417
-n04612504
-n01537544
-n01748264
-n04542943
-n02892767
-n04332243
-n04591713
-n02116738
-n07714990
-n03782006
-n07697313
-n03692522
-n02776631
-n03197337
-n06874185
-n02089867
-n02790996
-n02979186
-n03938244
-n03028079
-n02823428
-n04133789
-n02794156
-n02815834
-n03063599
-n10148035
-n02486261
-n04435653
-n01943899
-n02391049
-n02090622
-n04542943
-n02058221
-n02089867
-n02115641
-n03930313
-n02105412
-n03691459
-n03781244
-n03721384
-n01484850
-n03201208
-n03710721
-n03384352
-n02410509
-n03787032
-n03970156
-n02105251
-n03958227
-n02690373
-n01729322
-n01518878
-n04254680
-n02988304
-n03670208
-n04033901
-n02018795
-n02749479
-n03447721
-n02093428
-n02099712
-n02094114
-n02814860
-n02167151
-n04525305
-n02483362
-n02105251
-n02817516
-n04125021
-n02979186
-n01829413
-n02097658
-n02909870
-n01558993
-n03216828
-n02280649
-n02051845
-n02115913
-n03938244
-n04522168
-n01632458
-n02106382
-n02939185
-n04111531
-n01693334
-n02268853
-n02109525
-n02125311
-n03617480
-n02437616
-n04146614
-n03832673
-n02870880
-n04554684
-n02071294
-n02971356
-n03775071
-n04326547
-n11879895
-n01531178
-n02667093
-n04317175
-n02027492
-n02002556
-n02206856
-n03527444
-n04557648
-n04467665
-n01742172
-n02100236
-n02096437
-n13054560
-n02389026
-n02098105
-n07871810
-n02488291
-n04251144
-n12057211
-n04483307
-n01917289
-n03637318
-n01950731
-n01955084
-n02869837
-n04037443
-n02099267
-n04254120
-n02493793
-n12144580
-n01968897
-n03770679
-n02910353
-n04146614
-n04154565
-n02128757
-n04380533
-n03530642
-n02640242
-n01530575
-n04325704
-n04562935
-n03838899
-n02692877
-n03692522
-n03916031
-n02486261
-n03724870
-n02099267
-n03207941
-n02128925
-n03461385
-n01950731
-n02492660
-n02102973
-n07749582
-n04310018
-n02110806
-n02105056
-n09428293
-n02087394
-n15075141
-n03141823
-n03709823
-n03930630
-n02280649
-n04069434
-n07718747
-n02480495
-n07754684
-n12985857
-n03602883
-n01665541
-n04465501
-n02788148
-n02114548
-n07753275
-n03788195
-n02814860
-n02090379
-n03425413
-n01751748
-n04311174
-n01796340
-n07613480
-n03445777
-n04404412
-n03124170
-n02364673
-n01829413
-n03134739
-n07730033
-n03379051
-n04485082
-n03250847
-n07730033
-n07714571
-n02790996
-n03160309
-n02268443
-n02093859
-n13052670
-n02086910
-n01632458
-n04259630
-n01806567
-n02094433
-n02093647
-n02111500
-n03876231
-n01883070
-n02098286
-n04483307
-n03344393
-n01592084
-n04579432
-n04152593
-n04579145
-n03998194
-n02093256
-n01616318
-n03085013
-n03527444
-n04116512
-n02514041
-n03627232
-n03376595
-n04443257
-n03095699
-n02403003
-n04589890
-n01910747
-n02978881
-n02727426
-n01985128
-n03482405
-n02132136
-n04277352
-n13133613
-n02033041
-n02100877
-n01806143
-n03733805
-n01748264
-n02483362
-n03776460
-n02105412
-n03887697
-n01773157
-n02056570
-n02808440
-n02007558
-n04146614
-n02097130
-n03888605
-n02412080
-n01806567
-n02457408
-n03935335
-n03775071
-n07697313
-n01774750
-n07873807
-n07749582
-n02091134
-n02871525
-n02117135
-n03657121
-n03661043
-n02088632
-n03776460
-n02120505
-n02165456
-n03089624
-n03485794
-n01534433
-n02835271
-n03240683
-n04251144
-n02086910
-n03447447
-n04200800
-n01582220
-n02655020
-n04458633
-n04371430
-n02097047
-n03970156
-n04418357
-n04243546
-n02098413
-n02992529
-n03384352
-n02640242
-n02894605
-n03920288
-n03250847
-n02607072
-n04326547
-n04485082
-n03868863
-n09472597
-n02027492
-n02692877
-n03388549
-n03874599
-n02096051
-n01847000
-n02328150
-n01534433
-n02910353
-n01829413
-n02107142
-n03977966
-n02090622
-n03444034
-n04418357
-n04254680
-n02692877
-n02002724
-n03535780
-n02108551
-n02112350
-n15075141
-n04141975
-n04507155
-n04509417
-n11939491
-n02112706
-n02110627
-n03125729
-n03680355
-n01644373
-n01644373
-n01756291
-n01753488
-n02098105
-n02342885
-n03759954
-n02110958
-n02797295
-n02006656
-n02111500
-n04033901
-n01784675
-n04277352
-n02489166
-n02481823
-n02398521
-n01739381
-n02823428
-n02939185
-n12985857
-n04275548
-n04127249
-n02087394
-n03920288
-n04482393
-n03100240
-n03000684
-n07248320
-n02454379
-n02361337
-n03218198
-n02106030
-n03544143
-n04456115
-n02165105
-n03188531
-n01641577
-n07742313
-n03761084
-n01518878
-n04376876
-n03782006
-n02422699
-n01773797
-n02106550
-n04590129
-n03902125
-n02823750
-n03393912
-n04090263
-n01737021
-n02129165
-n01498041
-n03792782
-n02966687
-n02504458
-n03838899
-n01689811
-n04347754
-n01608432
-n01817953
-n02536864
-n01729977
-n02096437
-n03924679
-n02096437
-n01798484
-n02869837
-n04336792
-n03485407
-n03868863
-n04376876
-n03602883
-n02128925
-n02102973
-n02447366
-n07716358
-n03857828
-n04517823
-n03837869
-n07749582
-n02105162
-n02281787
-n02769748
-n02085620
-n01751748
-n02093647
-n04423845
-n02488702
-n03485794
-n03908714
-n01498041
-n02231487
-n02108551
-n03179701
-n02786058
-n01855032
-n04147183
-n04254680
-n04557648
-n01728572
-n04325704
-n07860988
-n01847000
-n13044778
-n03445777
-n03447447
-n02169497
-n03290653
-n03376595
-n02094114
-n03854065
-n02422699
-n01796340
-n03459775
-n02091244
-n04399382
-n03476684
-n02951585
-n03207941
-n02174001
-n03445777
-n01950731
-n04562935
-n01728572
-n02089973
-n01945685
-n02791270
-n04090263
-n01665541
-n02264363
-n04228054
-n03345487
-n03947888
-n01944390
-n04153751
-n01664065
-n03223299
-n02930766
-n04404412
-n03992509
-n01877812
-n02977058
-n09835506
-n12267677
-n03127747
-n01980166
-n09835506
-n07753113
-n02860847
-n02840245
-n01748264
-n03891251
-n02484975
-n02095314
-n03063689
-n04372370
-n11879895
-n02447366
-n01795545
-n03201208
-n01797886
-n04548362
-n03028079
-n03201208
-n02109047
-n03804744
-n03417042
-n02111500
-n02109047
-n02415577
-n04456115
-n02486410
-n03976657
-n02109525
-n03602883
-n03937543
-n02492660
-n02127052
-n02641379
-n03146219
-n02091635
-n02110185
-n04389033
-n04330267
-n02165456
-n04152593
-n04548362
-n02094433
-n04372370
-n03208938
-n02356798
-n02666196
-n02279972
-n03661043
-n03187595
-n03131574
-n07742313
-n02104029
-n02172182
-n02090622
-n02085782
-n02123159
-n02105855
-n02422106
-n01667114
-n01943899
-n03692522
-n03788195
-n07718472
-n03146219
-n04553703
-n09472597
-n04447861
-n02790996
-n03673027
-n02102040
-n07565083
-n01532829
-n02276258
-n04141327
-n01817953
-n04118538
-n01990800
-n02123597
-n01751748
-n02025239
-n01644373
-n03355925
-n02177972
-n04286575
-n04009552
-n03899768
-n03857828
-n04613696
-n02120079
-n02007558
-n04311174
-n03594945
-n04355338
-n03325584
-n07590611
-n07831146
-n03899768
-n02165105
-n06359193
-n06874185
-n03657121
-n02056570
-n09428293
-n04597913
-n02114855
-n04548280
-n03065424
-n01986214
-n03623198
-n04485082
-n03888605
-n02114855
-n02917067
-n04067472
-n03457902
-n03775071
-n07579787
-n02509815
-n04458633
-n03347037
-n02098105
-n12985857
-n03691459
-n04525305
-n01817953
-n03393912
-n04251144
-n02088364
-n02526121
-n02444819
-n02088238
-n02051845
-n01667114
-n04487394
-n04125021
-n02883205
-n04162706
-n02085936
-n02807133
-n02978881
-n04350905
-n01843383
-n02906734
-n01608432
-n02950826
-n04131690
-n02823428
-n02106030
-n01818515
-n03840681
-n03443371
-n03447447
-n02492660
-n11879895
-n02981792
-n01514668
-n02701002
-n04192698
-n02106030
-n07717410
-n03492542
-n06794110
-n03977966
-n04008634
-n07768694
-n04515003
-n02111889
-n02363005
-n01930112
-n04447861
-n07684084
-n01883070
-n03250847
-n02825657
-n03793489
-n01616318
-n02110341
-n06596364
-n04456115
-n01749939
-n03180011
-n02690373
-n02088094
-n01984695
-n02493793
-n09428293
-n03888605
-n09229709
-n02128757
-n04239074
-n04040759
-n03062245
-n02168699
-n02977058
-n01773157
-n02101388
-n03459775
-n04532106
-n04026417
-n02870880
-n04179913
-n02115913
-n04525038
-n11939491
-n02165105
-n04258138
-n09472597
-n01491361
-n03706229
-n03937543
-n01855672
-n03673027
-n02443484
-n03706229
-n04149813
-n03599486
-n03272562
-n01704323
-n01537544
-n03424325
-n02085782
-n02190166
-n04592741
-n02504458
-n04086273
-n07754684
-n02443484
-n02086910
-n01756291
-n01873310
-n02096437
-n02870880
-n02106166
-n07613480
-n03018349
-n03447721
-n04335435
-n02114855
-n07760859
-n03825788
-n02107142
-n02095570
-n01697457
-n03837869
-n02018795
-n02113624
-n03781244
-n03942813
-n02445715
-n02111129
-n04372370
-n02115641
-n07802026
-n02137549
-n02099429
-n03998194
-n04162706
-n03208938
-n02486410
-n02536864
-n02437616
-n02128757
-n04604644
-n03016953
-n04404412
-n02096585
-n01494475
-n03657121
-n04259630
-n04423845
-n03388549
-n02640242
-n02988304
-n02165456
-n03924679
-n04086273
-n02492660
-n02113624
-n02093859
-n02089867
-n04192698
-n01944390
-n01632777
-n02966687
-n02107908
-n02098286
-n07831146
-n02007558
-n04536866
-n02808304
-n07718472
-n03930630
-n07754684
-n01774750
-n03980874
-n03384352
-n02104029
-n02769748
-n02058221
-n01695060
-n03929660
-n13040303
-n03089624
-n04443257
-n04428191
-n03775546
-n04517823
-n01945685
-n03216828
-n02965783
-n02088466
-n04133789
-n03838899
-n02123597
-n02128385
-n02486410
-n03124170
-n03530642
-n02500267
-n12768682
-n02128385
-n01592084
-n02526121
-n04356056
-n02137549
-n03854065
-n07684084
-n01855032
-n02992211
-n02484975
-n02106030
-n09421951
-n04367480
-n09256479
-n02119022
-n02493509
-n03803284
-n01685808
-n07697537
-n01807496
-n03733281
-n03417042
-n02219486
-n09229709
-n02526121
-n03908714
-n04204347
-n03527444
-n01740131
-n02492035
-n02094258
-n03769881
-n03026506
-n02804414
-n02489166
-n02883205
-n03482405
-n04366367
-n03868863
-n03891332
-n01797886
-n03447447
-n04399382
-n04146614
-n02423022
-n02268443
-n03250847
-n07753592
-n01984695
-n03709823
-n03884397
-n03630383
-n03814639
-n02834397
-n01737021
-n03786901
-n01775062
-n01883070
-n09428293
-n03977966
-n07754684
-n03384352
-n02794156
-n13054560
-n02132136
-n02769748
-n07718747
-n02950826
-n01930112
-n02086240
-n02125311
-n03947888
-n02840245
-n03220513
-n03720891
-n02791270
-n02802426
-n03866082
-n03825788
-n02487347
-n02169497
-n02860847
-n01728920
-n03535780
-n03710193
-n02091467
-n04243546
-n01616318
-n03942813
-n02128757
-n04049303
-n04417672
-n02127052
-n03838899
-n03729826
-n02909870
-n09421951
-n04515003
-n02165105
-n03146219
-n04423845
-n03602883
-n01930112
-n04208210
-n03887697
-n03761084
-n02268853
-n04392985
-n03649909
-n03447721
-n02692877
-n12267677
-n07715103
-n04392985
-n04509417
-n04041544
-n03538406
-n01664065
-n03179701
-n01820546
-n04204347
-n03929660
-n02102973
-n03903868
-n01742172
-n01770081
-n03109150
-n04273569
-n02123045
-n07590611
-n13037406
-n02102177
-n03000247
-n02410509
-n02088632
-n07768694
-n06785654
-n03393912
-n03496892
-n04275548
-n03854065
-n04355933
-n01807496
-n07720875
-n04584207
-n03792782
-n03208938
-n02666196
-n04149813
-n02107683
-n04049303
-n04118538
-n04418357
-n02877765
-n01883070
-n02509815
-n10565667
-n02497673
-n02115913
-n03837869
-n02190166
-n04592741
-n04285008
-n04606251
-n03075370
-n04125021
-n03796401
-n02091134
-n03792972
-n01824575
-n02086079
-n01855032
-n07742313
-n03393912
-n03958227
-n02137549
-n02113978
-n02356798
-n02808440
-n02105412
-n01797886
-n04204347
-n03837869
-n02111277
-n02777292
-n02129604
-n07930864
-n02489166
-n03459775
-n01644900
-n04149813
-n03854065
-n03125729
-n04141076
-n04505470
-n02089973
-n02172182
-n04266014
-n04606251
-n07768694
-n09472597
-n02134418
-n03623198
-n02793495
-n01484850
-n02276258
-n02095889
-n03733281
-n03535780
-n03983396
-n02640242
-n01818515
-n02051845
-n03544143
-n02092002
-n02906734
-n01518878
-n03769881
-n02087046
-n03891332
-n04392985
-n03485794
-n03445777
-n02115913
-n02321529
-n03633091
-n01984695
-n04590129
-n02268443
-n02676566
-n02134084
-n03658185
-n02091134
-n03733805
-n02488702
-n02869837
-n02640242
-n03160309
-n02443484
-n02441942
-n01775062
-n02825657
-n12144580
-n04591713
-n02783161
-n01882714
-n02815834
-n02814860
-n02102177
-n02988304
-n03376595
-n02165105
-n04081281
-n03495258
-n09193705
-n04493381
-n02815834
-n11939491
-n02883205
-n03063689
-n02095570
-n04033901
-n03937543
-n02107908
-n07742313
-n02114712
-n02971356
-n02906734
-n02814860
-n01692333
-n02808440
-n03706229
-n04335435
-n03791053
-n03742115
-n02099429
-n02877765
-n02321529
-n03814639
-n01592084
-n03272562
-n02786058
-n01667114
-n03947888
-n02100735
-n04409515
-n01601694
-n03777568
-n12620546
-n06794110
-n02483708
-n03666591
-n03759954
-n01871265
-n02790996
-n01955084
-n03868863
-n03026506
-n04070727
-n02233338
-n01983481
-n02640242
-n01819313
-n02794156
-n03017168
-n02486261
-n04118776
-n02769748
-n03250847
-n02113799
-n02105056
-n02108422
-n01806567
-n04229816
-n09256479
-n04141327
-n01692333
-n01644373
-n02493509
-n02892201
-n02346627
-n07747607
-n04120489
-n03032252
-n04081281
-n09468604
-n02108422
-n07753113
-n02441942
-n03775071
-n02319095
-n04579145
-n02097474
-n03697007
-n02769748
-n02129604
-n04141076
-n04476259
-n02442845
-n04442312
-n02012849
-n01806567
-n03337140
-n02097209
-n03207941
-n01632458
-n01818515
-n02233338
-n02088094
-n02727426
-n04239074
-n03095699
-n04606251
-n03902125
-n02099267
-n02086240
-n03337140
-n02085782
-n02412080
-n03637318
-n01734418
-n02113023
-n04251144
-n03764736
-n02114855
-n02799071
-n01675722
-n02843684
-n01756291
-n04417672
-n02835271
-n04141076
-n04389033
-n04482393
-n02087394
-n02115641
-n03017168
-n01753488
-n02514041
-n04509417
-n02089973
-n03075370
-n01644373
-n03791053
-n04265275
-n02111500
-n02097209
-n04458633
-n07802026
-n04141076
-n04597913
-n02281787
-n12057211
-n02277742
-n07716906
-n03920288
-n04326547
-n03127747
-n03404251
-n02108915
-n02127052
-n02391049
-n04229816
-n02837789
-n03314780
-n02089973
-n04296562
-n02791270
-n03000134
-n01644900
-n04209133
-n01669191
-n02107142
-n03908714
-n03045698
-n03485794
-n02108551
-n02807133
-n02892767
-n04525305
-n02493509
-n10148035
-n03201208
-n03690938
-n04505470
-n02206856
-n02098105
-n03478589
-n02123597
-n02783161
-n01667114
-n02106550
-n03733805
-n03424325
-n01882714
-n01855672
-n01855672
-n01983481
-n01695060
-n01847000
-n02799071
-n04428191
-n03223299
-n13052670
-n02101556
-n04265275
-n03016953
-n01775062
-n04033901
-n01753488
-n03146219
-n04235860
-n03759954
-n03788195
-n07749582
-n01829413
-n02093256
-n02231487
-n04536866
-n03146219
-n04004767
-n02493793
-n04371774
-n02395406
-n02114712
-n02747177
-n01560419
-n03814906
-n04141327
-n01833805
-n03825788
-n02128925
-n02120079
-n03658185
-n03935335
-n03530642
-n01968897
-n02114548
-n03873416
-n01985128
-n01514859
-n02669723
-n04311174
-n03141823
-n01872401
-n03920288
-n02927161
-n02397096
-n04357314
-n03535780
-n03127925
-n01807496
-n02895154
-n02794156
-n03666591
-n04004767
-n04039381
-n04179913
-n01828970
-n02128385
-n02095570
-n04592741
-n02793495
-n02096177
-n01631663
-n02111500
-n12057211
-n04356056
-n02894605
-n02226429
-n04482393
-n01950731
-n03452741
-n01632777
-n03197337
-n04505470
-n04599235
-n01484850
-n04501370
-n02095570
-n02276258
-n02410509
-n04037443
-n02276258
-n04418357
-n02892767
-n02099267
-n03791053
-n04599235
-n03642806
-n03530642
-n07718472
-n07693725
-n11939491
-n02793495
-n02988304
-n02096051
-n01514668
-n01616318
-n04243546
-n02808440
-n04270147
-n02106030
-n04344873
-n07930864
-n03444034
-n07860988
-n02119022
-n02108000
-n04562935
-n02105162
-n02492035
-n02823750
-n03481172
-n02108000
-n04310018
-n02107142
-n02226429
-n02074367
-n03785016
-n04553703
-n03495258
-n07579787
-n07745940
-n02111277
-n04476259
-n03476684
-n04487081
-n02091134
-n07714571
-n02105251
-n04404412
-n04398044
-n01924916
-n02487347
-n12620546
-n03255030
-n04325704
-n02093647
-n02814533
-n03125729
-n03000247
-n02492035
-n01530575
-n02108915
-n02114367
-n01796340
-n13044778
-n04522168
-n02443114
-n04589890
-n04201297
-n03733805
-n02168699
-n01616318
-n03594945
-n04479046
-n02391049
-n02892201
-n04447861
-n02134084
-n02096294
-n01484850
-n03930630
-n02090721
-n04118538
-n02445715
-n06596364
-n03599486
-n04579145
-n09468604
-n01986214
-n01820546
-n02526121
-n02408429
-n03854065
-n01855032
-n03272562
-n09288635
-n02106550
-n02095314
-n01667778
-n02137549
-n02483708
-n02804610
-n04125021
-n03769881
-n02814533
-n07718472
-n04263257
-n03877472
-n02107312
-n03042490
-n01697457
-n09468604
-n03146219
-n02799071
-n03764736
-n02493793
-n03787032
-n02808304
-n03485407
-n01740131
-n04589890
-n01914609
-n02883205
-n04254680
-n03777568
-n02280649
-n02102040
-n02823750
-n04147183
-n02091467
-n04069434
-n01729977
-n01818515
-n04023962
-n03584254
-n02095314
-n03983396
-n03956157
-n02097209
-n02095314
-n02825657
-n02107142
-n02219486
-n03796401
-n01687978
-n03944341
-n02097658
-n07718747
-n04552348
-n04263257
-n03942813
-n02037110
-n03787032
-n03642806
-n01689811
-n02102973
-n02480495
-n07684084
-n02408429
-n04356056
-n02117135
-n07584110
-n04265275
-n02493793
-n01682714
-n01981276
-n04592741
-n03976467
-n02948072
-n04086273
-n04277352
-n13054560
-n02480495
-n01983481
-n02085782
-n03598930
-n03345487
-n02017213
-n03179701
-n01984695
-n04296562
-n04507155
-n04328186
-n01534433
-n02494079
-n03916031
-n04376876
-n02093428
-n01843383
-n01924916
-n03207743
-n07747607
-n03785016
-n03388549
-n02113624
-n03961711
-n02086646
-n02134084
-n04606251
-n04493381
-n02096585
-n02992529
-n03891332
-n01616318
-n01496331
-n01694178
-n01695060
-n04026417
-n01695060
-n02117135
-n03584254
-n04336792
-n01698640
-n02177972
-n04532670
-n02859443
-n02095889
-n01682714
-n11879895
-n02114855
-n02484975
-n02097047
-n04204238
-n04604644
-n01775062
-n03775071
-n01773549
-n03956157
-n03792972
-n04404412
-n09835506
-n07717556
-n02037110
-n02361337
-n02105412
-n04447861
-n02835271
-n03240683
-n07613480
-n02422699
-n02488702
-n01776313
-n04579432
-n04116512
-n03857828
-n02676566
-n03063599
-n02397096
-n02977058
-n02089867
-n04429376
-n03018349
-n13037406
-n03998194
-n01693334
-n01770081
-n03991062
-n03141823
-n03691459
-n04039381
-n02894605
-n02096177
-n02093256
-n02917067
-n03791053
-n03976467
-n02795169
-n02112706
-n01692333
-n02111129
-n03110669
-n03803284
-n01592084
-n02514041
-n02104365
-n02089867
-n07860988
-n02093256
-n02403003
-n04522168
-n02837789
-n01855032
-n02793495
-n02093991
-n02437312
-n02980441
-n04116512
-n02120079
-n04371774
-n02104365
-n04153751
-n02091635
-n01775062
-n04310018
-n03529860
-n02105162
-n02814860
-n02088364
-n02116738
-n03630383
-n02229544
-n04111531
-n01882714
-n01917289
-n03877472
-n02346627
-n03476991
-n02115641
-n03110669
-n02799071
-n03272562
-n01729322
-n03599486
-n03445777
-n04099969
-n02536864
-n03026506
-n03899768
-n04485082
-n01440764
-n04370456
-n04125021
-n07565083
-n02012849
-n02437616
-n02281406
-n03141823
-n01440764
-n04548362
-n03584254
-n04366367
-n04069434
-n02108551
-n07697313
-n02916936
-n03124043
-n01697457
-n02095570
-n03016953
-n02441942
-n02106382
-n01833805
-n03045698
-n04404412
-n03888605
-n04259630
-n03075370
-n03124170
-n03534580
-n04277352
-n03717622
-n02526121
-n01797886
-n04133789
-n02105855
-n03530642
-n02130308
-n01980166
-n04192698
-n04336792
-n07742313
-n01692333
-n02279972
-n04371430
-n01592084
-n09332890
-n04332243
-n04392985
-n07720875
-n03478589
-n03291819
-n04560804
-n02106030
-n04049303
-n02927161
-n07753113
-n04065272
-n02835271
-n03047690
-n03538406
-n01582220
-n02113624
-n03792782
-n04116512
-n02093859
-n03961711
-n02109047
-n07831146
-n02825657
-n13054560
-n02951585
-n02442845
-n02817516
-n03874599
-n02093859
-n01755581
-n02860847
-n02167151
-n01537544
-n02099601
-n02111500
-n03670208
-n03179701
-n02093647
-n03444034
-n03131574
-n02111500
-n04069434
-n01744401
-n03220513
-n03393912
-n02486261
-n03372029
-n01728572
-n02422106
-n01833805
-n03594734
-n13044778
-n02074367
-n02391049
-n07873807
-n09468604
-n02799071
-n03832673
-n02361337
-n02111277
-n04204238
-n02172182
-n04562935
-n02100735
-n02007558
-n03630383
-n01484850
-n02484975
-n02096051
-n02206856
-n03770679
-n04265275
-n09246464
-n09835506
-n07614500
-n09472597
-n03379051
-n03457902
-n01855032
-n04201297
-n02951585
-n13133613
-n03770439
-n02172182
-n03992509
-n03617480
-n02802426
-n02676566
-n01687978
-n07711569
-n03690938
-n02869837
-n03942813
-n04332243
-n01491361
-n12768682
-n01910747
-n04179913
-n03627232
-n13037406
-n07745940
-n04152593
-n01806143
-n07565083
-n03627232
-n12267677
-n03837869
-n02094433
-n04238763
-n03496892
-n04612504
-n02807133
-n02106166
-n02484975
-n03208938
-n04065272
-n02107574
-n07715103
-n04517823
-n10565667
-n02807133
-n03717622
-n04557648
-n04591157
-n02326432
-n06874185
-n04442312
-n03042490
-n03188531
-n04487394
-n02006656
-n01729322
-n03929660
-n03425413
-n03216828
-n02346627
-n02526121
-n02089078
-n01669191
-n10565667
-n04376876
-n04258138
-n02489166
-n02493793
-n03584829
-n03379051
-n02094114
-n01514668
-n03770439
-n02231487
-n01855032
-n03180011
-n04606251
-n03916031
-n01774750
-n02087394
-n03297495
-n01968897
-n02105056
-n01491361
-n02114712
-n02097130
-n02692877
-n04125021
-n03476684
-n03658185
-n02966687
-n02259212
-n03355925
-n13133613
-n03394916
-n02107312
-n02788148
-n02109961
-n01440764
-n03124043
-n06359193
-n04133789
-n02500267
-n04209133
-n03344393
-n03494278
-n02977058
-n03710637
-n01622779
-n09421951
-n02790996
-n02089078
-n02256656
-n01531178
-n04479046
-n04141327
-n03000134
-n02504013
-n03627232
-n02114712
-n03325584
-n03773504
-n04004767
-n04266014
-n02977058
-n02125311
-n02281406
-n03291819
-n01675722
-n02138441
-n03804744
-n03000684
-n02114367
-n03187595
-n01943899
-n02125311
-n02113624
-n02823428
-n02233338
-n03110669
-n02500267
-n03594734
-n03347037
-n01990800
-n02074367
-n02396427
-n03954731
-n02687172
-n02883205
-n03127925
-n02111500
-n07718747
-n02447366
-n04286575
-n02930766
-n01664065
-n04153751
-n01687978
-n02422699
-n02791270
-n02835271
-n02504458
-n01917289
-n04252077
-n04548280
-n03089624
-n07590611
-n07754684
-n01739381
-n04483307
-n01914609
-n02087046
-n03697007
-n04039381
-n01820546
-n04355338
-n02100735
-n03032252
-n02091467
-n01728572
-n02002556
-n03874599
-n02859443
-n04146614
-n03534580
-n04532106
-n01981276
-n03814639
-n01689811
-n06359193
-n01675722
-n03888605
-n07714990
-n04476259
-n02536864
-n02492035
-n04265275
-n02948072
-n03804744
-n04380533
-n01518878
-n04005630
-n07590611
-n04417672
-n03709823
-n02105412
-n02363005
-n01494475
-n03680355
-n02951358
-n04597913
-n03998194
-n01855032
-n02018795
-n03271574
-n02167151
-n02009912
-n03825788
-n04482393
-n01774750
-n02500267
-n01514859
-n03908618
-n03761084
-n03633091
-n02096177
-n03729826
-n07717556
-n03670208
-n01773797
-n04554684
-n01697457
-n03691459
-n02138441
-n03764736
-n02123394
-n04192698
-n04120489
-n07615774
-n03929855
-n02494079
-n01669191
-n01498041
-n03250847
-n03924679
-n02356798
-n02823750
-n03447721
-n02058221
-n07930864
-n01530575
-n04428191
-n04372370
-n03840681
-n02027492
-n01498041
-n07718472
-n03954731
-n04099969
-n03954731
-n01770081
-n03445924
-n03045698
-n03527444
-n02840245
-n04201297
-n01735189
-n01986214
-n02002724
-n02113978
-n02177972
-n03908714
-n03888257
-n02100236
-n02437312
-n02236044
-n07871810
-n03775071
-n03947888
-n03933933
-n02066245
-n02128385
-n01491361
-n02493509
-n07717556
-n02865351
-n03187595
-n02666196
-n01917289
-n01770081
-n02788148
-n03661043
-n02481823
-n02085620
-n02799071
-n03590841
-n01749939
-n01614925
-n02950826
-n02088632
-n01498041
-n02105162
-n01737021
-n02690373
-n03584254
-n02791124
-n02088238
-n04328186
-n01582220
-n02231487
-n03717622
-n01751748
-n03721384
-n02108422
-n01669191
-n02980441
-n04243546
-n03982430
-n02422106
-n03014705
-n04371774
-n04125021
-n02090622
-n01930112
-n04552348
-n03764736
-n01582220
-n02056570
-n02089973
-n09399592
-n03450230
-n03770679
-n03445924
-n02007558
-n02268443
-n02396427
-n01440764
-n03062245
-n02134418
-n03594734
-n02094433
-n04264628
-n02992211
-n02093428
-n02100735
-n04367480
-n03764736
-n03041632
-n01443537
-n03476684
-n09229709
-n04355338
-n02128385
-n04550184
-n01806567
-n02098413
-n04086273
-n02090379
-n03958227
-n02091467
-n02108000
-n03658185
-n02843684
-n01440764
-n02981792
-n07892512
-n03297495
-n03692522
-n03937543
-n03691459
-n03240683
-n02977058
-n07730033
-n04591713
-n11939491
-n03902125
-n02783161
-n04355338
-n02281406
-n03538406
-n01608432
-n03935335
-n01983481
-n02730930
-n01968897
-n03769881
-n04493381
-n02112018
-n02391049
-n04389033
-n03775546
-n02172182
-n09399592
-n02093991
-n01806143
-n02226429
-n01669191
-n04125021
-n02113712
-n02860847
-n02074367
-n02447366
-n02783161
-n02454379
-n01984695
-n03721384
-n03633091
-n03376595
-n02120505
-n02105505
-n04517823
-n03372029
-n03527444
-n03786901
-n03478589
-n02066245
-n07892512
-n01491361
-n02108089
-n03325584
-n03717622
-n03773504
-n01582220
-n03676483
-n04540053
-n07248320
-n04118538
-n02095314
-n12267677
-n03602883
-n02815834
-n03379051
-n02172182
-n02107142
-n06874185
-n01776313
-n07714571
-n01775062
-n03452741
-n03916031
-n04118538
-n01580077
-n02497673
-n01518878
-n03673027
-n02101388
-n03187595
-n04350905
-n02408429
-n03417042
-n02514041
-n02116738
-n03476684
-n02497673
-n04285008
-n03126707
-n03544143
-n04147183
-n03481172
-n04041544
-n02268443
-n09472597
-n02085782
-n03400231
-n03954731
-n04074963
-n03782006
-n02281787
-n04023962
-n04008634
-n07875152
-n07716906
-n02109525
-n03995372
-n02096177
-n01981276
-n03884397
-n02509815
-n03529860
-n03584829
-n02268853
-n04141975
-n04599235
-n03759954
-n02894605
-n02454379
-n03014705
-n02786058
-n04505470
-n02172182
-n02979186
-n02091635
-n02007558
-n02797295
-n02817516
-n02233338
-n04099969
-n03250847
-n02950826
-n02124075
-n01484850
-n02096294
-n02965783
-n01943899
-n02028035
-n04486054
-n02417914
-n03445777
-n04009552
-n02125311
-n03770439
-n02018207
-n02219486
-n04111531
-n09288635
-n03825788
-n03223299
-n04606251
-n02396427
-n07717410
-n02111277
-n04515003
-n02643566
-n03733131
-n02093428
-n01807496
-n02480855
-n03527444
-n02099849
-n04482393
-n02361337
-n02107574
-n04201297
-n03633091
-n04033995
-n02641379
-n02790996
-n02190166
-n03127747
-n02483362
-n03126707
-n03590841
-n07717410
-n04033901
-n02676566
-n07875152
-n02100236
-n04584207
-n01737021
-n02493509
-n02105251
-n03930630
-n03873416
-n02396427
-n02493793
-n03250847
-n02088466
-n02814533
-n02108000
-n01443537
-n02988304
-n01944390
-n04285008
-n04356056
-n01930112
-n03630383
-n02281406
-n02346627
-n04493381
-n03709823
-n01755581
-n02018795
-n07802026
-n11939491
-n07836838
-n04429376
-n03967562
-n02113023
-n03724870
-n03792972
-n01753488
-n07875152
-n07753592
-n04357314
-n03642806
-n04131690
-n04258138
-n01667114
-n02782093
-n02493509
-n04465501
-n07583066
-n02256656
-n01532829
-n01872401
-n07684084
-n03763968
-n04579145
-n03492542
-n04417672
-n04350905
-n04069434
-n03866082
-n04311174
-n01756291
-n02797295
-n03642806
-n03676483
-n03697007
-n02087046
-n03207941
-n04201297
-n02074367
-n01608432
-n02111500
-n03633091
-n02804610
-n04562935
-n02093859
-n03935335
-n02051845
-n01990800
-n02799071
-n04228054
-n02100877
-n01755581
-n02129604
-n02727426
-n01860187
-n04326547
-n03776460
-n02206856
-n02093256
-n01968897
-n02326432
-n03770679
-n02509815
-n02978881
-n03018349
-n03394916
-n02977058
-n03891332
-n01665541
-n04141327
-n02233338
-n02092339
-n03388549
-n04548362
-n04296562
-n04067472
-n03014705
-n02747177
-n02441942
-n04081281
-n03290653
-n02066245
-n01983481
-n02085936
-n01518878
-n02085620
-n04346328
-n01601694
-n01532829
-n03992509
-n01694178
-n02437616
-n04612504
-n02666196
-n03950228
-n02093754
-n02123597
-n01817953
-n02190166
-n04067472
-n03933933
-n02398521
-n02097130
-n03444034
-n03792972
-n04418357
-n01871265
-n03208938
-n01768244
-n02174001
-n02219486
-n01774384
-n07742313
-n04355933
-n02129165
-n07742313
-n01697457
-n04310018
-n02669723
-n04367480
-n01592084
-n02105251
-n02113799
-n07565083
-n02091032
-n02011460
-n03773504
-n02445715
-n04275548
-n02112018
-n01632458
-n02486261
-n07714990
-n02106550
-n03478589
-n02963159
-n03743016
-n04146614
-n03970156
-n03874293
-n07749582
-n06874185
-n01950731
-n01498041
-n04090263
-n02077923
-n02106662
-n02786058
-n04591157
-n03481172
-n03924679
-n02500267
-n04258138
-n04540053
-n03160309
-n02087394
-n03494278
-n04325704
-n01669191
-n02108551
-n01980166
-n03314780
-n02808440
-n04447861
-n02281787
-n02095889
-n02489166
-n02114367
-n04344873
-n02058221
-n02444819
-n02988304
-n03495258
-n02002556
-n03874293
-n02085782
-n01695060
-n02870880
-n01608432
-n02948072
-n04067472
-n02098286
-n02093428
-n04009552
-n12267677
-n02085782
-n03376595
-n04335435
-n03891332
-n03733281
-n02264363
-n02132136
-n04263257
-n01698640
-n01753488
-n07714990
-n03417042
-n03259280
-n01737021
-n04118538
-n01773797
-n03124170
-n03874293
-n09421951
-n02747177
-n09288635
-n04136333
-n03956157
-n02093256
-n03729826
-n03538406
-n01774384
-n04355338
-n02105251
-n02403003
-n01697457
-n01828970
-n02892767
-n02018207
-n02134084
-n03733805
-n07930864
-n02097474
-n04507155
-n04344873
-n02950826
-n03721384
-n01943899
-n07920052
-n02319095
-n04149813
-n02364673
-n01742172
-n04428191
-n03450230
-n09399592
-n01689811
-n01978287
-n07716358
-n02074367
-n04557648
-n03062245
-n02105251
-n07716906
-n03623198
-n03125729
-n03876231
-n04509417
-n03041632
-n04347754
-n06359193
-n04118538
-n01806143
-n07749582
-n02105855
-n13052670
-n02094114
-n03775071
-n01873310
-n03788195
-n04311004
-n03018349
-n03089624
-n02087046
-n03379051
-n04493381
-n07714990
-n03895866
-n15075141
-n07684084
-n01755581
-n07715103
-n04285008
-n03476991
-n04049303
-n03496892
-n03041632
-n02403003
-n03832673
-n04131690
-n04479046
-n04479046
-n02259212
-n01734418
-n02002556
-n03179701
-n03992509
-n07932039
-n04467665
-n02099712
-n04456115
-n03690938
-n04367480
-n01729322
-n03961711
-n03841143
-n02963159
-n03476991
-n04074963
-n02077923
-n01532829
-n02865351
-n02966687
-n01694178
-n03017168
-n04429376
-n03935335
-n09246464
-n04004767
-n03208938
-n04111531
-n04389033
-n07760859
-n04326547
-n04209239
-n07697537
-n03785016
-n04367480
-n04037443
-n04311174
-n02814533
-n02113799
-n02825657
-n02672831
-n02114855
-n02090622
-n09399592
-n04482393
-n01910747
-n04417672
-n04162706
-n02098413
-n07717556
-n01580077
-n02092002
-n03014705
-n04370456
-n02835271
-n03047690
-n03944341
-n07613480
-n02361337
-n02356798
-n02835271
-n02011460
-n02096051
-n01843065
-n03498962
-n07583066
-n07734744
-n04277352
-n02088632
-n09835506
-n04141327
-n01820546
-n03218198
-n03825788
-n04310018
-n02099849
-n02025239
-n07753275
-n03876231
-n02099267
-n03794056
-n07590611
-n01740131
-n02091032
-n04200800
-n01770081
-n02869837
-n03379051
-n01833805
-n03929855
-n02749479
-n01644900
-n03445777
-n02110627
-n01630670
-n04273569
-n04483307
-n02138441
-n07892512
-n01983481
-n02108422
-n02948072
-n02094258
-n03141823
-n01632458
-n04517823
-n04380533
-n09472597
-n02165456
-n01930112
-n03018349
-n02268853
-n01770081
-n04141975
-n03998194
-n03384352
-n04147183
-n03045698
-n03791053
-n03944341
-n02536864
-n01829413
-n02088466
-n01694178
-n02106382
-n01748264
-n03759954
-n12985857
-n04254680
-n04465501
-n02795169
-n02096177
-n02444819
-n01558993
-n02115641
-n03445924
-n02701002
-n06359193
-n01773549
-n03637318
-n02437312
-n04332243
-n02865351
-n02088632
-n04067472
-n02092002
-n03956157
-n04326547
-n02786058
-n01784675
-n01847000
-n04146614
-n03666591
-n04310018
-n01914609
-n07695742
-n03404251
-n03891251
-n06874185
-n03062245
-n03355925
-n12267677
-n04254120
-n07714990
-n02233338
-n02804414
-n03062245
-n02018795
-n07720875
-n03075370
-n03530642
-n01980166
-n01667114
-n04553703
-n09468604
-n06794110
-n04367480
-n02963159
-n03710193
-n01980166
-n03000134
-n03938244
-n02231487
-n02493509
-n03447721
-n07583066
-n09472597
-n03877845
-n04147183
-n04229816
-n12998815
-n03877472
-n07718472
-n03063599
-n01665541
-n02111889
-n06596364
-n02094433
-n01817953
-n02091635
-n01755581
-n01740131
-n01592084
-n03673027
-n03467068
-n03924679
-n04467665
-n03733805
-n01833805
-n03089624
-n02091635
-n02489166
-n02112350
-n04192698
-n02102040
-n02823428
-n04074963
-n01872401
-n04579145
-n03788365
-n04086273
-n02009229
-n07753113
-n02504458
-n02002724
-n02097474
-n07754684
-n03134739
-n02113978
-n02403003
-n03998194
-n01688243
-n03891332
-n04133789
-n02111500
-n02916936
-n07248320
-n04404412
-n04209239
-n07590611
-n03673027
-n04008634
-n03272010
-n13040303
-n09399592
-n02007558
-n02488291
-n07716906
-n04009552
-n02111889
-n03658185
-n01980166
-n04367480
-n02892201
-n04423845
-n03131574
-n04041544
-n04266014
-n03825788
-n02033041
-n02002724
-n01871265
-n04099969
-n02321529
-n02666196
-n01698640
-n03709823
-n02356798
-n03089624
-n03873416
-n02097130
-n02108089
-n04258138
-n01667778
-n04456115
-n03492542
-n02363005
-n01871265
-n01950731
-n04153751
-n01984695
-n01614925
-n02110958
-n01824575
-n01981276
-n15075141
-n03814906
-n03874599
-n04118776
-n01675722
-n02939185
-n03742115
-n01697457
-n02326432
-n02090622
-n04532106
-n03983396
-n02415577
-n02412080
-n02102480
-n03459775
-n04380533
-n04254777
-n01631663
-n03404251
-n07871810
-n02123045
-n02226429
-n01871265
-n01820546
-n01688243
-n02825657
-n01689811
-n02095570
-n04019541
-n03777754
-n01748264
-n02123045
-n02129604
-n02105056
-n02125311
-n02089973
-n03649909
-n04540053
-n03670208
-n02097209
-n01819313
-n03110669
-n02124075
-n02437616
-n01843383
-n03935335
-n02782093
-n07753113
-n03791053
-n02111129
-n07614500
-n03761084
-n03676483
-n01978455
-n03857828
-n02488702
-n02165456
-n07734744
-n03991062
-n02860847
-n03954731
-n03045698
-n03944341
-n02111129
-n02092002
-n03891251
-n02130308
-n01945685
-n03188531
-n02457408
-n03085013
-n03796401
-n13052670
-n02398521
-n03743016
-n02229544
-n03160309
-n02276258
-n02276258
-n02504013
-n02281406
-n02877765
-n03649909
-n07697313
-n02058221
-n02077923
-n03394916
-n02256656
-n04328186
-n02009229
-n03476684
-n03388549
-n07714571
-n09193705
-n02396427
-n01806567
-n02090379
-n02100583
-n04483307
-n02120079
-n01914609
-n01630670
-n04259630
-n07695742
-n02106030
-n02883205
-n02398521
-n03995372
-n07590611
-n04099969
-n02110063
-n03785016
-n02669723
-n03125729
-n04442312
-n07920052
-n02497673
-n02454379
-n02091831
-n02454379
-n02088632
-n02115641
-n03761084
-n02606052
-n02264363
-n01843065
-n03623198
-n03445777
-n02481823
-n01773157
-n03109150
-n04458633
-n02165456
-n02190166
-n04111531
-n03197337
-n04542943
-n04507155
-n02089867
-n02342885
-n02099601
-n03787032
-n03483316
-n02454379
-n04041544
-n02086079
-n04485082
-n07831146
-n02106030
-n03445777
-n02398521
-n02666196
-n02009912
-n01534433
-n03126707
-n12057211
-n04355933
-n02025239
-n04336792
-n02906734
-n02002556
-n04487394
-n03291819
-n01614925
-n04235860
-n04270147
-n03291819
-n03837869
-n04192698
-n04120489
-n02930766
-n02128385
-n02837789
-n02105505
-n01704323
-n02481823
-n03384352
-n02167151
-n07753592
-n07614500
-n02134084
-n04515003
-n01729322
-n04033901
-n02134418
-n01514668
-n03942813
-n02101556
-n03642806
-n03733131
-n03290653
-n02174001
-n01784675
-n03777754
-n03942813
-n02802426
-n04049303
-n03535780
-n02492035
-n04070727
-n03075370
-n04372370
-n07860988
-n04367480
-n03786901
-n04562935
-n07590611
-n02102973
-n07248320
-n03095699
-n04009552
-n07614500
-n09288635
-n03724870
-n04258138
-n01698640
-n07753113
-n04263257
-n01755581
-n04447861
-n02666196
-n03733281
-n02051845
-n02058221
-n03958227
-n02403003
-n02097474
-n02099429
-n02484975
-n07836838
-n10565667
-n07720875
-n02486261
-n02321529
-n01755581
-n03100240
-n03063599
-n01664065
-n02783161
-n03803284
-n03110669
-n02086240
-n02487347
-n02097209
-n04310018
-n02012849
-n04120489
-n03482405
-n02447366
-n01749939
-n03478589
-n02963159
-n04428191
-n04285008
-n01530575
-n02111129
-n03109150
-n07697313
-n02802426
-n03690938
-n01914609
-n02481823
-n02259212
-n03538406
-n15075141
-n03649909
-n04483307
-n04613696
-n10565667
-n02488702
-n02094258
-n02096585
-n02127052
-n02391049
-n01734418
-n09332890
-n03379051
-n02133161
-n12144580
-n02099429
-n04447861
-n04120489
-n07860988
-n02129604
-n03065424
-n02095314
-n04154565
-n02655020
-n02165105
-n04275548
-n02415577
-n02786058
-n02091467
-n03444034
-n01498041
-n07590611
-n04554684
-n02109047
-n04552348
-n03814639
-n03125729
-n03888257
-n03950228
-n02089973
-n03967562
-n02749479
-n03729826
-n02018207
-n04487081
-n03017168
-n03976657
-n03938244
-n02769748
-n07836838
-n02002724
-n03100240
-n03598930
-n04479046
-n01644373
-n02708093
-n02134418
-n13054560
-n09332890
-n03133878
-n04554684
-n03041632
-n02869837
-n03014705
-n02510455
-n03954731
-n02788148
-n02859443
-n02640242
-n02087046
-n03891332
-n02124075
-n03476684
-n04270147
-n04542943
-n03916031
-n02051845
-n02104029
-n04270147
-n02422106
-n03692522
-n02115641
-n02447366
-n03710721
-n02112018
-n03000134
-n02105162
-n02097047
-n02356798
-n04037443
-n02071294
-n07892512
-n03924679
-n01687978
-n02098286
-n03345487
-n04254777
-n03680355
-n02963159
-n01582220
-n04090263
-n03761084
-n04604644
-n02097209
-n03109150
-n02088632
-n03937543
-n01943899
-n02093647
-n02093428
-n03461385
-n04270147
-n04389033
-n03534580
-n09468604
-n02107312
-n01797886
-n02090379
-n02871525
-n01667778
-n01773549
-n01755581
-n02093991
-n04350905
-n03995372
-n02280649
-n03933933
-n02226429
-n03207941
-n09399592
-n02106030
-n03590841
-n02966193
-n03787032
-n02115913
-n04099969
-n04273569
-n02037110
-n01917289
-n04254777
-n03888257
-n02807133
-n04589890
-n02091032
-n01685808
-n07714571
-n03777568
-n03379051
-n03028079
-n04275548
-n02395406
-n04040759
-n02109961
-n01872401
-n03825788
-n02112706
-n03692522
-n02086910
-n02321529
-n03131574
-n04311004
-n03929855
-n01514859
-n03804744
-n03417042
-n02794156
-n07730033
-n04120489
-n02342885
-n04041544
-n04366367
-n02116738
-n02992211
-n02276258
-n02895154
-n01984695
-n03661043
-n03207941
-n02025239
-n02123045
-n02117135
-n02107908
-n02815834
-n04355933
-n03598930
-n07742313
-n03876231
-n02259212
-n01775062
-n03617480
-n03840681
-n03902125
-n02930766
-n03633091
-n04404412
-n03825788
-n03337140
-n02018795
-n02447366
-n07613480
-n02493793
-n01694178
-n12620546
-n06874185
-n02443484
-n04209133
-n04515003
-n04540053
-n01796340
-n03623198
-n02108551
-n03763968
-n02410509
-n11879895
-n03832673
-n03930630
-n02490219
-n03937543
-n02111889
-n02096437
-n04154565
-n02971356
-n02865351
-n03776460
-n02777292
-n02190166
-n04612504
-n04081281
-n02747177
-n03777754
-n02445715
-n03857828
-n11939491
-n01981276
-n04041544
-n04458633
-n03447721
-n02106030
-n02834397
-n02097474
-n01877812
-n02085936
-n02096051
-n03272562
-n03793489
-n02099849
-n03649909
-n01882714
-n02860847
-n04039381
-n04264628
-n02484975
-n02167151
-n02074367
-n01773549
-n04367480
-n07718747
-n02841315
-n02910353
-n02106550
-n03602883
-n04153751
-n03992509
-n09468604
-n02129604
-n09229709
-n02056570
-n03594734
-n02111277
-n07590611
-n02704792
-n03868863
-n02115641
-n02444819
-n02808304
-n04355338
-n02281787
-n02138441
-n03814906
-n04409515
-n01739381
-n03495258
-n03627232
-n02085620
-n02190166
-n03355925
-n03188531
-n02100735
-n03961711
-n02823428
-n07860988
-n01740131
-n09229709
-n03777568
-n03908618
-n02108551
-n02177972
-n09288635
-n01693334
-n02106382
-n04026417
-n03388183
-n02002724
-n03208938
-n04517823
-n04336792
-n03658185
-n02097474
-n02690373
-n13044778
-n02281787
-n02641379
-n02130308
-n02704792
-n01582220
-n02027492
-n04525305
-n02119789
-n13054560
-n03724870
-n02488291
-n07697313
-n02132136
-n04336792
-n03983396
-n03944341
-n01774384
-n02027492
-n02091134
-n07860988
-n02106550
-n04357314
-n03662601
-n03868242
-n03804744
-n02112350
-n01774750
-n02088238
-n07718472
-n01742172
-n02992529
-n04404412
-n02089867
-n03345487
-n02437312
-n02930766
-n13133613
-n02206856
-n02486410
-n03843555
-n04476259
-n02094433
-n01843065
-n07714571
-n02389026
-n04099969
-n01843065
-n03180011
-n09472597
-n03670208
-n01751748
-n01807496
-n02229544
-n02101006
-n03188531
-n03290653
-n02403003
-n02699494
-n04266014
-n02708093
-n04399382
-n02804414
-n07747607
-n02749479
-n03424325
-n04522168
-n01843065
-n01682714
-n02138441
-n11879895
-n04355338
-n03662601
-n03658185
-n03483316
-n07718747
-n03476684
-n02110958
-n04040759
-n03814906
-n04461696
-n02492660
-n04044716
-n04596742
-n01770081
-n01806143
-n04589890
-n03016953
-n02493793
-n01983481
-n01484850
-n02981792
-n03710637
-n02104029
-n01498041
-n03976657
-n04009552
-n02790996
-n04235860
-n04447861
-n01910747
-n03481172
-n04090263
-n03929660
-n07248320
-n03271574
-n03661043
-n03954731
-n03016953
-n07614500
-n03920288
-n02091244
-n02676566
-n13044778
-n03843555
-n07871810
-n03832673
-n04252225
-n02174001
-n03832673
-n10148035
-n02280649
-n09229709
-n06874185
-n02823428
-n02692877
-n02823428
-n07753592
-n02782093
-n03459775
-n09288635
-n04204347
-n02483708
-n04461696
-n02791124
-n03710193
-n12768682
-n04435653
-n04204347
-n02669723
-n03657121
-n01518878
-n04026417
-n02319095
-n03791053
-n02110063
-n02281787
-n03197337
-n04152593
-n02025239
-n03633091
-n02259212
-n02423022
-n03891332
-n03874293
-n02071294
-n01773797
-n07711569
-n02007558
-n13133613
-n02017213
-n04270147
-n02113624
-n02916936
-n01675722
-n07614500
-n03673027
-n02109961
-n02950826
-n02966193
-n01685808
-n02804610
-n02095314
-n03929855
-n10565667
-n02013706
-n02123394
-n03590841
-n07711569
-n02113799
-n07860988
-n04367480
-n07873807
-n02096585
-n02002724
-n02134418
-n02398521
-n04033901
-n02110063
-n09468604
-n01990800
-n04423845
-n02177972
-n04447861
-n02096585
-n02442845
-n04265275
-n04317175
-n01807496
-n04366367
-n03814906
-n12998815
-n03482405
-n03884397
-n03673027
-n03673027
-n03793489
-n02443114
-n02988304
-n02422106
-n04326547
-n02992529
-n01860187
-n03895866
-n03180011
-n04118776
-n03461385
-n04275548
-n15075141
-n03761084
-n01944390
-n04317175
-n04152593
-n02927161
-n03956157
-n02085620
-n02727426
-n01667114
-n04493381
-n01729322
-n04081281
-n01484850
-n03124043
-n02841315
-n02108089
-n03345487
-n02892201
-n07875152
-n02093991
-n03697007
-n02119789
-n01739381
-n02319095
-n02361337
-n01883070
-n02492035
-n02107312
-n07715103
-n04264628
-n01843065
-n07860988
-n01795545
-n01592084
-n03676483
-n04254120
-n03223299
-n03220513
-n02108915
-n03873416
-n02128925
-n02389026
-n01698640
-n15075141
-n03028079
-n01644900
-n01694178
-n03761084
-n03873416
-n03710637
-n03924679
-n03627232
-n04542943
-n03095699
-n02100236
-n01784675
-n01744401
-n04153751
-n03770439
-n02107142
-n03297495
-n07753275
-n04008634
-n07615774
-n04550184
-n02110806
-n04404412
-n03976467
-n07715103
-n04525038
-n02776631
-n02099267
-n02095314
-n03028079
-n02100236
-n03930630
-n03188531
-n02094258
-n04554684
-n03887697
-n02116738
-n02007558
-n02102973
-n02130308
-n04328186
-n04141076
-n03220513
-n02444819
-n04458633
-n01735189
-n02701002
-n02071294
-n01498041
-n04070727
-n04423845
-n02089973
-n04141975
-n01729322
-n01824575
-n04251144
-n01692333
-n01484850
-n04208210
-n01667114
-n04458633
-n04141076
-n02058221
-n02088466
-n07760859
-n04560804
-n02099267
-n03000134
-n02481823
-n02788148
-n02097047
-n04487081
-n04286575
-n02233338
-n04344873
-n02490219
-n02123159
-n02120079
-n02114855
-n02088238
-n01775062
-n04136333
-n03344393
-n03535780
-n02074367
-n03782006
-n02487347
-n02134418
-n02500267
-n03208938
-n04162706
-n02410509
-n02091635
-n04417672
-n01537544
-n02951358
-n02116738
-n03594734
-n03775071
-n03594945
-n04532670
-n01695060
-n02277742
-n02123597
-n02883205
-n07932039
-n02497673
-n07754684
-n02112018
-n03538406
-n03895866
-n01494475
-n02177972
-n03197337
-n02105641
-n02992529
-n04070727
-n02109525
-n02125311
-n04456115
-n02980441
-n03841143
-n03938244
-n03661043
-n01756291
-n03794056
-n02018207
-n03126707
-n01614925
-n03992509
-n03127925
-n02115913
-n03773504
-n02776631
-n09472597
-n02177972
-n03532672
-n04476259
-n04517823
-n13052670
-n07753275
-n01685808
-n04120489
-n02120079
-n02123159
-n02087046
-n03598930
-n02487347
-n03065424
-n04517823
-n02797295
-n02804414
-n02843684
-n02018795
-n03976657
-n04005630
-n02699494
-n03814906
-n09332890
-n02493793
-n04442312
-n02100877
-n04532670
-n03047690
-n02077923
-n03733281
-n04266014
-n09835506
-n02492660
-n04330267
-n07716358
-n01601694
-n04579432
-n04380533
-n01749939
-n03444034
-n03400231
-n03584254
-n03710721
-n03895866
-n04591713
-n03903868
-n02088364
-n04141975
-n01774384
-n02112018
-n04485082
-n04259630
-n03041632
-n02097130
-n03775546
-n02093991
-n01742172
-n09193705
-n01984695
-n01924916
-n02190166
-n03706229
-n13037406
-n04604644
-n03602883
-n02504458
-n03467068
-n04536866
-n04398044
-n01986214
-n03777754
-n02066245
-n02346627
-n04370456
-n02108551
-n04204238
-n04371430
-n03792972
-n02441942
-n02096294
-n02699494
-n04589890
-n02085936
-n02105056
-n02415577
-n07734744
-n02098286
-n02113186
-n02096294
-n02871525
-n03873416
-n01784675
-n02788148
-n02051845
-n07930864
-n01692333
-n02111889
-n03662601
-n02097474
-n02165456
-n03595614
-n03452741
-n04606251
-n03796401
-n03452741
-n07693725
-n02112018
-n03388549
-n04562935
-n13133613
-n04461696
-n01796340
-n04270147
-n03187595
-n03666591
-n04120489
-n04522168
-n02111500
-n03976467
-n01729322
-n02364673
-n04356056
-n02797295
-n02114855
-n02749479
-n04357314
-n07565083
-n02676566
-n02088466
-n02823750
-n02093256
-n02256656
-n02119022
-n02883205
-n03584254
-n03775071
-n01682714
-n03124170
-n04201297
-n04044716
-n01629819
-n12998815
-n07584110
-n04532106
-n03825788
-n04501370
-n01560419
-n03065424
-n02106030
-n04229816
-n03623198
-n02280649
-n06785654
-n02342885
-n02488291
-n02606052
-n03271574
-n04070727
-n03717622
-n02447366
-n03065424
-n03527444
-n01943899
-n02095889
-n02132136
-n04204347
-n03026506
-n01749939
-n03742115
-n02105162
-n03733281
-n02006656
-n04552348
-n02493793
-n02992211
-n02089867
-n04111531
-n04590129
-n03982430
-n03495258
-n02640242
-n02099429
-n02132136
-n02444819
-n02056570
-n03494278
-n01773157
-n02137549
-n01534433
-n02018795
-n03630383
-n02281787
-n04120489
-n02104029
-n02098413
-n02488702
-n03379051
-n02807133
-n04591713
-n02110185
-n04209239
-n01558993
-n04325704
-n04264628
-n03291819
-n02793495
-n02133161
-n03908714
-n03584254
-n02091831
-n02099429
-n09835506
-n01798484
-n03041632
-n02808304
-n04136333
-n09428293
-n04465501
-n01688243
-n02093428
-n02129165
-n07749582
-n03197337
-n04392985
-n04367480
-n02484975
-n02607072
-n03089624
-n04116512
-n04286575
-n02233338
-n04118538
-n04254777
-n02410509
-n02091244
-n03016953
-n03026506
-n02113978
-n02091032
-n02096585
-n04179913
-n01775062
-n03903868
-n04277352
-n02841315
-n04597913
-n01614925
-n04067472
-n03876231
-n02095889
-n02100877
-n03444034
-n01484850
-n02490219
-n03272010
-n12057211
-n03980874
-n02097474
-n04270147
-n04429376
-n04111531
-n09399592
-n04005630
-n03595614
-n02123045
-n03657121
-n07892512
-n03840681
-n04296562
-n02807133
-n01806567
-n04258138
-n02114367
-n01675722
-n02794156
-n01698640
-n04296562
-n07717556
-n03476991
-n04005630
-n02099712
-n02099429
-n03721384
-n04277352
-n03127925
-n02256656
-n03201208
-n02088466
-n02086079
-n01632458
-n04376876
-n03998194
-n01440764
-n02704792
-n01855032
-n03095699
-n04355933
-n04465501
-n03841143
-n04501370
-n01558993
-n03042490
-n01950731
-n03935335
-n04584207
-n01984695
-n02747177
-n03775546
-n04525038
-n01632777
-n04485082
-n04116512
-n02486410
-n02096585
-n02096051
-n02110627
-n03272010
-n03775546
-n02123597
-n02992529
-n01632458
-n02089078
-n03954731
-n02437616
-n02120505
-n04507155
-n02114712
-n03532672
-n03983396
-n02108000
-n01514859
-n07802026
-n02951358
-n01882714
-n04505470
-n02231487
-n03388043
-n04482393
-n02112018
-n04008634
-n02606052
-n04273569
-n03594734
-n04532670
-n01855032
-n02342885
-n03950228
-n02093859
-n02841315
-n02025239
-n03930630
-n01797886
-n03240683
-n01775062
-n02321529
-n02342885
-n02108551
-n03216828
-n02281406
-n03710721
-n04201297
-n01950731
-n03216828
-n07880968
-n04208210
-n02514041
-n02123597
-n04517823
-n04553703
-n03482405
-n07697313
-n03690938
-n02444819
-n04049303
-n03085013
-n01843065
-n03709823
-n02117135
-n02787622
-n07579787
-n02099601
-n04229816
-n03776460
-n01644900
-n07579787
-n03733281
-n09472597
-n01797886
-n07802026
-n01806567
-n02108551
-n02093754
-n02132136
-n04254120
-n03877472
-n02480855
-n04285008
-n15075141
-n04325704
-n09332890
-n03947888
-n01828970
-n02106030
-n04501370
-n07730033
-n02113186
-n03026506
-n04266014
-n11939491
-n04270147
-n03777754
-n04522168
-n01860187
-n02443484
-n02835271
-n04125021
-n02794156
-n06596364
-n04265275
-n04136333
-n10565667
-n04483307
-n02277742
-n02094433
-n07716906
-n01514859
-n02397096
-n02102318
-n04442312
-n03680355
-n02086240
-n02174001
-n02277742
-n03832673
-n01768244
-n01739381
-n02361337
-n02607072
-n01843383
-n02091467
-n02090721
-n01756291
-n02099429
-n01806567
-n02966687
-n02094258
-n01986214
-n07697537
-n02909870
-n03967562
-n04296562
-n03388043
-n04482393
-n09421951
-n07614500
-n02865351
-n02089973
-n04557648
-n01537544
-n01819313
-n03929855
-n04136333
-n03977966
-n04099969
-n01675722
-n03832673
-n02643566
-n07749582
-n04275548
-n04005630
-n02074367
-n03623198
-n03495258
-n04296562
-n02437312
-n02113799
-n03874599
-n02454379
-n02877765
-n02109525
-n04270147
-n01729977
-n02950826
-n02110063
-n03216828
-n01484850
-n03062245
-n02128385
-n04228054
-n03179701
-n01796340
-n01694178
-n02088094
-n03942813
-n02869837
-n03770439
-n02097658
-n03047690
-n03742115
-n03724870
-n02966687
-n02098286
-n01687978
-n02100236
-n01616318
-n04442312
-n02396427
-n03998194
-n01773549
-n07747607
-n01944390
-n03891332
-n03045698
-n03877472
-n03207941
-n02494079
-n01819313
-n02093754
-n02088238
-n02168699
-n04515003
-n01675722
-n02018207
-n02690373
-n03777568
-n03026506
-n02342885
-n02102040
-n07583066
-n03961711
-n02916936
-n03958227
-n01698640
-n07714990
-n02483708
-n03680355
-n04141975
-n02085936
-n07930864
-n03691459
-n02892767
-n03770679
-n03450230
-n02165456
-n04560804
-n01614925
-n04458633
-n02500267
-n02190166
-n04380533
-n02950826
-n07860988
-n02346627
-n03814906
-n02494079
-n01817953
-n09421951
-n03041632
-n04371430
-n04371430
-n03743016
-n01630670
-n04074963
-n04326547
-n02894605
-n02086910
-n03935335
-n04461696
-n03476991
-n03697007
-n01818515
-n04263257
-n02088238
-n07697313
-n02110806
-n07747607
-n02108422
-n02641379
-n04507155
-n02124075
-n12985857
-n02342885
-n07697537
-n03742115
-n12998815
-n04591713
-n03450230
-n02110185
-n02091831
-n03424325
-n01795545
-n04507155
-n01616318
-n01704323
-n03887697
-n02128925
-n01824575
-n02099712
-n03498962
-n04273569
-n04090263
-n01775062
-n03970156
-n02480855
-n02730930
-n02326432
-n04355933
-n03355925
-n01734418
-n02107908
-n01978287
-n03874599
-n03478589
-n03788365
-n02325366
-n02445715
-n03180011
-n03792782
-n01667778
-n02490219
-n01882714
-n04005630
-n04118538
-n03775071
-n03792782
-n02123045
-n02264363
-n02776631
-n01773157
-n01614925
-n04548362
-n02009912
-n02487347
-n03272562
-n01685808
-n02835271
-n02110063
-n04153751
-n02123045
-n02417914
-n04208210
-n03476684
-n01768244
-n07697313
-n02100583
-n02504013
-n04040759
-n04067472
-n01798484
-n07248320
-n02094258
-n02483708
-n04557648
-n01828970
-n02172182
-n03658185
-n02493509
-n03991062
-n03494278
-n03291819
-n02410509
-n03733805
-n04579432
-n03124043
-n02966193
-n02190166
-n02526121
-n07753592
-n07753592
-n07768694
-n09246464
-n07711569
-n02018795
-n02105056
-n01669191
-n02268853
-n02488291
-n02793495
-n02101556
-n04476259
-n07584110
-n04542943
-n03670208
-n03929855
-n04204347
-n02094433
-n09472597
-n04479046
-n01667778
-n03459775
-n02056570
-n12620546
-n04286575
-n02795169
-n04209239
-n02101556
-n04532670
-n02009229
-n04584207
-n02795169
-n02112350
-n01667778
-n02939185
-n03908618
-n01753488
-n02841315
-n03388183
-n03218198
-n02776631
-n02363005
-n02130308
-n06596364
-n02814860
-n02110063
-n02117135
-n07684084
-n04254680
-n03109150
-n02408429
-n04389033
-n04483307
-n01797886
-n02095889
-n03958227
-n04548280
-n02410509
-n03837869
-n03720891
-n04435653
-n01498041
-n02749479
-n07718747
-n04461696
-n03388043
-n02133161
-n02165105
-n02817516
-n04532670
-n02013706
-n01682714
-n02102177
-n03290653
-n04086273
-n02090379
-n01797886
-n01440764
-n01818515
-n04562935
-n02782093
-n03793489
-n11879895
-n02814860
-n02669723
-n02974003
-n07693725
-n02104029
-n03372029
-n03045698
-n03100240
-n02127052
-n07579787
-n03874599
-n02504458
-n02132136
-n03692522
-n04517823
-n03223299
-n04418357
-n02110806
-n01728572
-n04259630
-n03930313
-n02321529
-n02105251
-n04317175
-n01491361
-n07753275
-n02028035
-n04476259
-n03742115
-n03032252
-n02328150
-n04591713
-n02088094
-n02190166
-n04067472
-n03134739
-n02102318
-n03026506
-n04371430
-n03535780
-n01614925
-n02111889
-n03977966
-n03131574
-n02071294
-n02110627
-n02109961
-n02412080
-n01580077
-n06359193
-n04209133
-n03775546
-n03630383
-n01753488
-n02672831
-n02092339
-n01644900
-n07730033
-n03124043
-n04065272
-n03697007
-n01616318
-n01558993
-n02107683
-n04044716
-n03877472
-n02786058
-n02087046
-n07717410
-n04019541
-n01622779
-n03337140
-n02978881
-n04131690
-n03887697
-n01582220
-n02536864
-n04065272
-n02977058
-n03825788
-n01687978
-n01756291
-n04486054
-n01737021
-n01968897
-n03047690
-n02106166
-n02259212
-n02326432
-n04476259
-n02115913
-n02006656
-n04254120
-n02871525
-n03220513
-n03769881
-n03692522
-n02730930
-n04235860
-n02112018
-n02107142
-n02834397
-n04008634
-n02100583
-n01729977
-n07714571
-n01629819
-n02028035
-n03724870
-n04355933
-n01614925
-n07714571
-n07584110
-n02870880
-n13054560
-n02727426
-n03877472
-n04263257
-n04127249
-n03630383
-n01978287
-n13044778
-n02509815
-n04251144
-n04141327
-n12620546
-n03388043
-n02951358
-n02412080
-n03110669
-n03937543
-n04044716
-n02101388
-n07716358
-n04462240
-n03933933
-n02840245
-n03485407
-n03461385
-n02119789
-n01944390
-n01924916
-n04127249
-n04209239
-n03908618
-n03133878
-n03992509
-n02410509
-n03796401
-n01798484
-n04557648
-n02088632
-n03000247
-n02971356
-n03840681
-n01776313
-n01773157
-n04366367
-n03325584
-n03873416
-n01807496
-n02790996
-n09421951
-n07734744
-n03000247
-n04597913
-n04332243
-n02408429
-n01677366
-n02229544
-n03891251
-n02110063
-n03532672
-n03937543
-n01558993
-n04540053
-n12057211
-n03388183
-n02841315
-n09399592
-n03933933
-n02823428
-n02102040
-n02690373
-n02895154
-n02085936
-n04458633
-n02415577
-n04579432
-n04557648
-n03630383
-n02009912
-n02113978
-n03000247
-n09246464
-n03498962
-n02992211
-n03249569
-n03930313
-n01632458
-n02086910
-n02097209
-n03032252
-n01496331
-n04118538
-n03272010
-n02095314
-n02930766
-n02112137
-n03697007
-n04127249
-n04141076
-n03376595
-n07613480
-n04023962
-n03958227
-n04515003
-n04596742
-n02108000
-n03874599
-n01776313
-n02088238
-n01950731
-n02086910
-n03384352
-n02093859
-n02088632
-n02749479
-n01631663
-n01955084
-n04275548
-n02493793
-n03690938
-n02802426
-n02110341
-n02906734
-n02124075
-n03991062
-n03584254
-n03444034
-n02979186
-n03888605
-n01534433
-n02129165
-n01614925
-n02397096
-n12985857
-n02123159
-n01984695
-n02097047
-n01616318
-n02117135
-n01682714
-n03814906
-n02105251
-n01877812
-n04367480
-n01770081
-n02099849
-n02328150
-n07590611
-n07734744
-n03673027
-n02129165
-n02111500
-n04090263
-n02129604
-n02894605
-n02128757
-n04238763
-n03720891
-n03793489
-n03424325
-n07716358
-n02493509
-n02099849
-n02091244
-n02097658
-n02138441
-n03047690
-n02093647
-n02108915
-n04263257
-n02129165
-n04335435
-n07760859
-n02091831
-n03445924
-n02280649
-n02640242
-n04613696
-n03527444
-n01798484
-n03995372
-n01728572
-n04004767
-n02099267
-n07920052
-n03709823
-n02095570
-n02018795
-n03642806
-n04074963
-n04141327
-n01917289
-n04131690
-n03250847
-n02104365
-n03602883
-n02093428
-n03109150
-n03240683
-n02086079
-n02114712
-n02093256
-n02102040
-n03495258
-n04584207
-n02870880
-n02916936
-n07875152
-n07583066
-n02730930
-n04019541
-n04254120
-n02666196
-n03141823
-n03063689
-n06596364
-n02906734
-n03445777
-n02971356
-n03891332
-n07892512
-n02442845
-n03527444
-n02667093
-n01806143
-n03902125
-n02457408
-n01693334
-n02799071
-n02814533
-n06874185
-n02088466
-n03825788
-n01484850
-n03355925
-n02095889
-n02086646
-n03942813
-n03425413
-n04550184
-n02817516
-n04049303
-n04483307
-n02097209
-n03388549
-n02815834
-n02487347
-n02074367
-n02113186
-n02536864
-n02114855
-n07697313
-n03938244
-n02492035
-n02085620
-n02085620
-n03223299
-n04273569
-n03496892
-n03866082
-n03065424
-n03877845
-n02871525
-n03404251
-n04462240
-n02113799
-n02093859
-n03742115
-n02123045
-n04487081
-n02107312
-n03938244
-n02966687
-n02342885
-n03781244
-n02493509
-n02134084
-n02749479
-n07749582
-n12144580
-n02114548
-n13052670
-n07753113
-n03777754
-n07615774
-n02483708
-n01784675
-n01978287
-n02536864
-n02443484
-n03877472
-n04074963
-n01632777
-n02815834
-n01669191
-n02104029
-n02093859
-n01883070
-n01774750
-n01667778
-n01728920
-n02219486
-n03124170
-n02123394
-n01740131
-n04228054
-n01592084
-n02128925
-n02281787
-n02093647
-n01667778
-n02128925
-n01978287
-n02130308
-n03065424
-n12620546
-n13052670
-n02480855
-n03376595
-n07734744
-n04019541
-n02536864
-n04350905
-n01773549
-n03782006
-n02111129
-n01806567
-n07753275
-n02256656
-n01984695
-n04443257
-n02410509
-n02092339
-n02115913
-n01806143
-n02815834
-n03908618
-n02279972
-n03691459
-n03216828
-n04370456
-n02676566
-n03710721
-n01629819
-n03967562
-n03482405
-n04487081
-n01744401
-n02454379
-n02007558
-n03201208
-n03793489
-n03902125
-n02672831
-n03447447
-n02749479
-n01440764
-n03538406
-n03794056
-n02097130
-n04332243
-n02814860
-n02488291
-n03032252
-n02137549
-n02281406
-n01494475
-n02749479
-n04458633
-n01847000
-n03825788
-n01819313
-n01847000
-n03908618
-n03444034
-n02483362
-n04254680
-n02123597
-n03838899
-n02104029
-n03633091
-n03775546
-n01807496
-n03692522
-n03721384
-n04208210
-n02892767
-n02086240
-n02492660
-n04049303
-n04238763
-n03793489
-n02107574
-n02364673
-n02134084
-n02092339
-n02906734
-n04371774
-n02097658
-n02102040
-n01968897
-n02090622
-n03916031
-n03658185
-n02536864
-n03697007
-n03924679
-n02325366
-n03337140
-n02999410
-n01983481
-n03141823
-n03662601
-n01729322
-n02676566
-n02992211
-n03089624
-n01632777
-n02443484
-n03534580
-n01847000
-n02102318
-n01855032
-n03961711
-n03895866
-n02892767
-n01601694
-n02443484
-n03930313
-n03062245
-n02988304
-n02090622
-n02107908
-n03290653
-n04542943
-n04296562
-n01986214
-n02233338
-n02093991
-n03482405
-n02966193
-n03786901
-n02027492
-n04392985
-n03376595
-n07714990
-n02504013
-n04606251
-n03724870
-n02093991
-n03933933
-n02804414
-n03063599
-n01698640
-n03498962
-n04252225
-n02013706
-n03026506
-n03787032
-n04536866
-n02100583
-n01582220
-n02500267
-n03388183
-n07693725
-n02033041
-n03908714
-n02219486
-n02730930
-n03710193
-n02108915
-n01749939
-n02817516
-n01729977
-n02086910
-n02107908
-n03450230
-n07565083
-n02128385
-n03141823
-n04259630
-n01914609
-n07697537
-n04447861
-n02099849
-n03126707
-n01943899
-n04118776
-n02791124
-n03763968
-n03492542
-n02094433
-n04366367
-n01614925
-n02007558
-n02128757
-n04019541
-n04612504
-n02841315
-n13044778
-n04147183
-n03933933
-n02110627
-n02226429
-n01631663
-n03676483
-n02487347
-n04507155
-n03216828
-n07718472
-n02058221
-n03127747
-n07745940
-n02102177
-n02113712
-n02965783
-n03840681
-n04310018
-n01774384
-n02177972
-n03063599
-n01697457
-n03759954
-n02085620
-n07753113
-n03393912
-n02692877
-n03868242
-n02403003
-n03249569
-n03884397
-n02396427
-n03457902
-n07718747
-n02167151
-n04154565
-n04147183
-n04118538
-n03124043
-n04372370
-n01667114
-n03998194
-n03995372
-n10565667
-n01798484
-n04591157
-n03127747
-n02105641
-n03485407
-n02102177
-n04461696
-n01824575
-n02066245
-n04317175
-n02107312
-n06874185
-n04465501
-n02939185
-n04019541
-n03459775
-n04548280
-n03047690
-n04325704
-n07871810
-n01819313
-n03782006
-n02086079
-n03584254
-n03929660
-n02492035
-n03670208
-n02412080
-n02109525
-n02397096
-n01582220
-n03188531
-n02105641
-n02033041
-n03992509
-n02328150
-n03000684
-n03126707
-n07590611
-n02102480
-n07684084
-n07590611
-n09421951
-n04285008
-n02930766
-n04604644
-n03584829
-n03447721
-n01693334
-n02910353
-n03532672
-n04127249
-n04154565
-n03014705
-n13052670
-n03483316
-n02817516
-n03759954
-n03733805
-n04204238
-n02110341
-n04147183
-n02007558
-n02268443
-n03133878
-n03255030
-n02442845
-n02018207
-n04069434
-n02667093
-n03866082
-n02113978
-n02108000
-n03832673
-n04039381
-n01677366
-n01955084
-n02113023
-n04371430
-n03134739
-n03840681
-n07714571
-n01955084
-n03785016
-n03924679
-n04443257
-n03709823
-n04204347
-n02086079
-n02361337
-n04317175
-n09229709
-n04270147
-n01518878
-n02105412
-n07720875
-n02177972
-n02098105
-n03534580
-n02492660
-n03954731
-n03874599
-n04243546
-n04344873
-n04252077
-n02009229
-n01774384
-n03843555
-n02988304
-n02422699
-n03045698
-n03775071
-n02098105
-n04099969
-n01582220
-n03026506
-n02099849
-n02814860
-n02980441
-n07875152
-n01873310
-n02117135
-n02510455
-n02108422
-n04599235
-n03450230
-n02105505
-n04239074
-n04131690
-n04033995
-n03445924
-n01558993
-n02791270
-n03770679
-n02480855
-n02134084
-n02098286
-n03478589
-n01744401
-n04532670
-n02105412
-n03874599
-n04125021
-n01682714
-n02747177
-n02992211
-n03710193
-n01514859
-n01687978
-n04418357
-n02017213
-n01677366
-n02281406
-n02138441
-n03594945
-n02106030
-n03017168
-n02105251
-n04273569
-n02488291
-n09332890
-n03873416
-n02895154
-n02494079
-n02437616
-n01692333
-n04311004
-n03218198
-n02110185
-n02256656
-n07880968
-n02666196
-n03337140
-n04399382
-n04265275
-n04254120
-n01798484
-n03602883
-n03825788
-n01833805
-n02704792
-n01734418
-n03594734
-n02701002
-n02085620
-n01582220
-n03623198
-n03000134
-n02992211
-n03691459
-n02526121
-n03998194
-n01990800
-n03933933
-n02950826
-n01748264
-n15075141
-n10565667
-n15075141
-n02116738
-n02643566
-n02837789
-n04005630
-n02091134
-n02071294
-n10148035
-n02951358
-n04127249
-n03866082
-n04579145
-n04239074
-n02492035
-n02107683
-n04239074
-n04004767
-n04550184
-n03961711
-n03201208
-n03207941
-n03134739
-n02892767
-n03394916
-n02398521
-n03868863
-n02486410
-n04487394
-n03394916
-n01496331
-n04418357
-n02168699
-n02097209
-n01537544
-n01687978
-n02799071
-n04009552
-n03345487
-n04346328
-n12057211
-n03485794
-n02443484
-n02229544
-n02840245
-n02415577
-n02104029
-n03792782
-n03888605
-n02128925
-n03045698
-n03837869
-n02749479
-n04033995
-n02422106
-n03404251
-n04208210
-n02113712
-n03459775
-n02514041
-n04371430
-n01644373
-n03447721
-n13052670
-n03492542
-n04366367
-n01968897
-n02033041
-n02114712
-n02804414
-n01796340
-n04009552
-n04597913
-n03141823
-n04612504
-n01729322
-n02492660
-n03792972
-n02130308
-n03400231
-n01632777
-n03085013
-n01729322
-n02095570
-n03970156
-n04009552
-n03950228
-n02086646
-n02108000
-n03196217
-n01580077
-n04275548
-n04599235
-n01774750
-n03498962
-n03457902
-n03930630
-n04590129
-n01968897
-n04462240
-n04554684
-n02840245
-n02804414
-n07614500
-n03482405
-n02871525
-n04192698
-n02699494
-n03388183
-n04153751
-n03733281
-n01797886
-n01689811
-n02777292
-n02389026
-n03788365
-n01514859
-n02102480
-n03942813
-n02111129
-n03017168
-n02105855
-n04328186
-n02115641
-n02093647
-n02415577
-n02536864
-n13044778
-n02113712
-n02123394
-n01735189
-n03085013
-n03127747
-n02105641
-n04606251
-n02814533
-n02980441
-n02910353
-n02098105
-n04380533
-n02098286
-n02018795
-n02788148
-n01807496
-n03908714
-n03388549
-n02100877
-n03982430
-n01986214
-n04201297
-n03347037
-n04008634
-n04557648
-n03445924
-n02980441
-n03131574
-n02948072
-n01797886
-n04005630
-n02111889
-n02325366
-n01728920
-n02129165
-n02168699
-n04465501
-n01728572
-n02105641
-n01774384
-n04418357
-n02325366
-n03888605
-n04149813
-n02281406
-n03599486
-n03124170
-n02100583
-n03956157
-n03788195
-n04286575
-n04136333
-n04344873
-n03743016
-n01494475
-n01910747
-n02787622
-n04562935
-n02909870
-n02974003
-n02111500
-n03388549
-n04550184
-n07745940
-n03673027
-n02727426
-n03207743
-n04487081
-n04009552
-n02130308
-n02105412
-n03476991
-n01632458
-n02790996
-n04505470
-n04380533
-n02108422
-n07920052
-n03467068
-n03249569
-n03633091
-n02124075
-n03763968
-n03710637
-n03100240
-n02256656
-n03461385
-n02869837
-n02948072
-n03991062
-n02091244
-n04476259
-n02099429
-n02346627
-n02782093
-n02457408
-n02009229
-n02910353
-n02087046
-n01877812
-n03787032
-n02281406
-n04461696
-n03782006
-n01924916
-n03223299
-n01768244
-n04023962
-n07717410
-n03062245
-n07875152
-n03393912
-n02364673
-n03937543
-n02101388
-n04548280
-n12620546
-n03584829
-n04606251
-n02776631
-n04443257
-n02788148
-n03838899
-n02051845
-n07768694
-n03498962
-n02100583
-n02102177
-n07716358
-n04589890
-n02128757
-n02489166
-n03417042
-n03355925
-n02111889
-n03297495
-n03180011
-n03196217
-n02859443
-n02321529
-n04443257
-n03089624
-n07730033
-n03874293
-n03594945
-n02423022
-n11879895
-n02104029
-n02916936
-n02403003
-n03709823
-n04467665
-n01833805
-n02119022
-n02687172
-n02492660
-n02877765
-n02099429
-n03942813
-n02105855
-n02168699
-n07565083
-n03895866
-n03126707
-n02346627
-n02606052
-n03670208
-n02114548
-n02109047
-n03916031
-n01871265
-n04523525
-n02690373
-n03014705
-n02356798
-n02128385
-n02133161
-n03884397
-n02108915
-n03759954
-n03630383
-n02106382
-n02256656
-n02085936
-n03197337
-n03661043
-n04590129
-n03958227
-n04525038
-n02037110
-n03956157
-n03717622
-n02326432
-n03249569
-n01631663
-n01687978
-n12144580
-n02277742
-n03692522
-n04507155
-n04389033
-n04548280
-n01914609
-n01776313
-n03125729
-n02096051
-n02769748
-n04131690
-n02669723
-n04376876
-n01818515
-n02091244
-n03207743
-n03134739
-n03838899
-n02641379
-n02666196
-n02397096
-n02009229
-n02410509
-n02276258
-n03062245
-n02097130
-n02093754
-n02123045
-n04357314
-n03089624
-n02091244
-n01685808
-n02412080
-n03841143
-n01807496
-n02098286
-n02124075
-n02086646
-n03627232
-n09468604
-n01768244
-n07920052
-n03976467
-n03534580
-n03617480
-n04467665
-n07584110
-n04040759
-n02090379
-n03393912
-n01945685
-n04482393
-n01537544
-n02231487
-n02137549
-n03045698
-n04346328
-n04597913
-n02114367
-n07613480
-n02892767
-n04209133
-n02097047
-n02100877
-n02480855
-n03259280
-n03272010
-n07684084
-n03743016
-n01773549
-n02708093
-n02939185
-n03617480
-n01753488
-n07880968
-n03218198
-n02871525
-n02093256
-n01798484
-n02417914
-n02108915
-n04125021
-n03126707
-n04285008
-n02526121
-n04111531
-n02089078
-n02927161
-n02971356
-n04553703
-n02442845
-n01945685
-n01491361
-n04347754
-n04371774
-n09428293
-n04370456
-n01682714
-n01664065
-n02085620
-n02114855
-n03255030
-n02130308
-n04200800
-n02447366
-n04127249
-n02110185
-n02793495
-n03944341
-n03196217
-n02096294
-n04133789
-n07754684
-n03384352
-n03459775
-n04579145
-n01682714
-n03041632
-n07860988
-n06596364
-n04296562
-n04152593
-n01698640
-n03792972
-n04067472
-n03394916
-n01728920
-n04597913
-n04090263
-n03445777
-n13040303
-n07717556
-n01914609
-n07730033
-n02108089
-n04597913
-n02786058
-n06785654
-n03956157
-n04584207
-n03697007
-n02114712
-n02749479
-n07248320
-n03673027
-n02090379
-n04501370
-n01917289
-n04265275
-n04515003
-n03710721
-n03495258
-n04532670
-n04040759
-n01829413
-n02840245
-n02699494
-n02106550
-n03089624
-n02105056
-n02860847
-n02487347
-n02085782
-n03888257
-n03691459
-n02398521
-n04398044
-n01687978
-n04371774
-n02777292
-n01664065
-n04476259
-n04548280
-n12144580
-n02669723
-n02095314
-n02877765
-n04429376
-n03400231
-n03729826
-n02825657
-n02802426
-n03733281
-n03124043
-n07871810
-n02169497
-n04263257
-n01689811
-n04485082
-n04099969
-n03902125
-n04371430
-n02091635
-n03344393
-n02815834
-n13044778
-n02100877
-n02130308
-n09246464
-n02843684
-n01735189
-n06874185
-n02100583
-n02100877
-n15075141
-n02109525
-n02486410
-n02950826
-n01871265
-n02823750
-n07583066
-n02051845
-n01751748
-n02483362
-n03908618
-n02977058
-n02111889
-n04447861
-n02114855
-n02095314
-n02804414
-n02489166
-n04277352
-n02236044
-n02408429
-n02655020
-n01693334
-n03447721
-n02093647
-n02791124
-n02077923
-n04536866
-n03291819
-n02093859
-n02115641
-n04254680
-n04501370
-n04019541
-n02795169
-n03459775
-n04209133
-n07860988
-n04553703
-n02484975
-n03530642
-n02906734
-n04325704
-n04008634
-n12057211
-n02342885
-n04344873
-n03794056
-n02107142
-n04090263
-n02009229
-n02971356
-n02504458
-n04273569
-n09399592
-n03272562
-n02277742
-n02279972
-n07930864
-n02917067
-n04004767
-n04392985
-n07718747
-n02089078
-n03903868
-n03208938
-n02133161
-n03376595
-n02978881
-n03201208
-n02834397
-n02443484
-n02085620
-n02111889
-n03532672
-n04263257
-n03661043
-n15075141
-n04200800
-n03786901
-n01873310
-n04423845
-n01737021
-n02951358
-n02116738
-n01798484
-n03980874
-n02834397
-n02398521
-n01531178
-n07734744
-n01847000
-n03841143
-n02110185
-n13044778
-n02727426
-n02799071
-n02107908
-n01806143
-n03770679
-n03967562
-n02086646
-n02892767
-n01855032
-n02165105
-n01514859
-n04037443
-n03877472
-n03729826
-n01728920
-n02676566
-n03627232
-n04069434
-n04192698
-n02486261
-n02795169
-n04033901
-n01824575
-n02105641
-n02444819
-n01824575
-n03908714
-n04239074
-n02102480
-n02264363
-n01498041
-n02930766
-n04355933
-n04125021
-n03481172
-n02123159
-n02099712
-n04209239
-n02111889
-n02002556
-n03690938
-n04429376
-n03814906
-n04525305
-n02107908
-n01692333
-n04127249
-n01914609
-n04201297
-n02807133
-n01985128
-n02979186
-n02088238
-n03594945
-n03388043
-n09468604
-n03729826
-n02704792
-n07930864
-n03355925
-n04554684
-n04131690
-n04026417
-n02437616
-n03769881
-n04330267
-n02091831
-n01797886
-n02687172
-n02906734
-n02091635
-n02814533
-n02114712
-n03770439
-n04099969
-n04033995
-n02085936
-n01644900
-n02930766
-n01917289
-n01704323
-n04515003
-n01950731
-n03888257
-n07836838
-n02687172
-n02102318
-n02106030
-n02676566
-n01749939
-n03314780
-n03690938
-n02823750
-n03344393
-n03666591
-n04458633
-n04398044
-n01440764
-n04482393
-n03075370
-n02701002
-n04023962
-n01558993
-n07716358
-n02325366
-n02106382
-n04590129
-n10148035
-n02236044
-n04252077
-n12144580
-n02110627
-n03000134
-n02086079
-n03032252
-n02408429
-n03394916
-n02871525
-n01806567
-n02127052
-n02879718
-n03032252
-n03935335
-n04482393
-n03710721
-n04522168
-n04371430
-n04579145
-n03967562
-n03201208
-n04355338
-n04328186
-n04111531
-n01968897
-n02115913
-n01518878
-n04344873
-n02814533
-n01697457
-n04371430
-n01855032
-n01806143
-n03598930
-n02971356
-n03372029
-n02101388
-n02963159
-n02391049
-n01560419
-n02114367
-n03933933
-n03259280
-n01756291
-n04479046
-n07583066
-n03792972
-n02100877
-n07768694
-n02007558
-n03937543
-n03666591
-n02104029
-n01910747
-n02095889
-n04417672
-n03769881
-n03929855
-n02641379
-n02229544
-n07614500
-n04311174
-n02361337
-n07753592
-n02206856
-n04090263
-n03444034
-n04525305
-n02281406
-n02526121
-n01807496
-n02096294
-n01667778
-n02480855
-n07711569
-n02009229
-n01697457
-n03271574
-n01687978
-n02100236
-n03908714
-n01531178
-n02364673
-n03773504
-n03000684
-n02981792
-n04485082
-n01797886
-n03498962
-n03538406
-n03530642
-n01872401
-n02342885
-n02457408
-n02480495
-n02480855
-n01770393
-n01560419
-n01665541
-n04540053
-n04346328
-n04485082
-n02091635
-n03733805
-n02120505
-n02988304
-n04049303
-n02607072
-n02488702
-n03026506
-n07718472
-n03627232
-n03388043
-n02403003
-n03627232
-n03877845
-n03388043
-n02487347
-n04005630
-n01682714
-n01818515
-n04311174
-n01664065
-n04509417
-n02086910
-n02219486
-n04392985
-n04344873
-n01685808
-n07717410
-n03384352
-n01728920
-n02027492
-n02012849
-n04336792
-n02481823
-n07565083
-n03868863
-n03179701
-n02109525
-n04330267
-n03982430
-n03272010
-n04005630
-n02112137
-n03770439
-n02088094
-n02114548
-n02091032
-n01728572
-n03240683
-n02808440
-n02486410
-n02930766
-n01737021
-n03733805
-n03110669
-n03016953
-n01748264
-n02325366
-n01748264
-n02364673
-n02017213
-n04252077
-n02860847
-n03124043
-n03461385
-n02090721
-n03998194
-n02095570
-n07753113
-n04423845
-n04044716
-n01695060
-n01632458
-n02643566
-n02167151
-n01860187
-n02403003
-n02840245
-n03658185
-n04116512
-n02096294
-n01735189
-n01514859
-n04131690
-n02978881
-n03461385
-n03944341
-n02441942
-n07753113
-n01693334
-n09399592
-n02105412
-n03400231
-n04550184
-n02823428
-n02112137
-n03920288
-n04509417
-n03785016
-n03534580
-n02066245
-n02807133
-n01924916
-n02017213
-n03796401
-n02090721
-n01981276
-n02497673
-n09399592
-n01749939
-n03344393
-n03344393
-n02490219
-n04335435
-n04065272
-n07873807
-n03314780
-n03530642
-n02783161
-n02114548
-n02319095
-n03018349
-n01498041
-n02859443
-n02096051
-n04251144
-n03042490
-n02167151
-n02096294
-n09246464
-n12985857
-n02100583
-n03240683
-n02236044
-n02356798
-n02317335
-n02859443
-n02510455
-n01945685
-n03792972
-n02011460
-n03220513
-n04141076
-n03662601
-n07745940
-n02747177
-n12998815
-n04209133
-n02097130
-n01685808
-n04273569
-n04515003
-n02094258
-n02109047
-n03028079
-n02408429
-n03777754
-n02113186
-n02500267
-n03891251
-n02112018
-n04487081
-n02927161
-n01664065
-n03534580
-n03729826
-n03187595
-n02105505
-n07718747
-n02802426
-n02226429
-n04116512
-n01756291
-n01817953
-n07714990
-n02457408
-n03109150
-n04026417
-n02437312
-n02124075
-n02113978
-n03109150
-n02389026
-n06785654
-n03089624
-n03444034
-n04149813
-n02091032
-n04376876
-n02606052
-n03492542
-n04579145
-n01496331
-n01592084
-n04141975
-n01580077
-n02112706
-n03388043
-n02256656
-n02087394
-n04179913
-n07930864
-n04355338
-n03874293
-n04033995
-n02088364
-n03535780
-n03476991
-n04336792
-n03888257
-n07836838
-n03028079
-n03877845
-n03982430
-n02116738
-n04596742
-n03843555
-n15075141
-n04325704
-n04398044
-n02134084
-n02132136
-n03602883
-n01955084
-n02268853
-n02490219
-n04044716
-n02492660
-n01770393
-n03447447
-n07871810
-n01739381
-n03933933
-n02110958
-n04517823
-n10565667
-n02087046
-n02909870
-n07747607
-n13037406
-n03743016
-n02113023
-n07716358
-n01828970
-n04579145
-n04482393
-n02169497
-n04371430
-n01751748
-n01632777
-n02106382
-n01697457
-n04074963
-n03062245
-n02607072
-n03868863
-n04409515
-n01829413
-n04254680
-n01728920
-n02802426
-n03666591
-n01984695
-n02708093
-n02090721
-n02089973
-n02099849
-n02134084
-n13133613
-n03733281
-n02268853
-n04347754
-n02115641
-n04346328
-n02769748
-n01665541
-n03961711
-n02391049
-n01675722
-n02017213
-n03045698
-n02356798
-n02977058
-n01873310
-n02276258
-n03692522
-n02107908
-n03954731
-n04389033
-n02226429
-n03676483
-n02107908
-n01484850
-n01774750
-n02979186
-n03761084
-n03623198
-n03445777
-n03770679
-n01728572
-n03495258
-n04613696
-n02441942
-n03594734
-n02114855
-n02883205
-n04311174
-n04532670
-n02134418
-n03717622
-n02859443
-n03930313
-n03126707
-n03977966
-n03983396
-n04456115
-n07760859
-n01532829
-n04208210
-n03991062
-n04131690
-n03649909
-n03425413
-n02017213
-n02974003
-n03958227
-n02408429
-n01614925
-n03884397
-n04429376
-n01749939
-n01756291
-n01498041
-n03992509
-n03532672
-n04286575
-n03376595
-n02108000
-n02108551
-n07565083
-n03792782
-n02089867
-n07684084
-n03404251
-n03871628
-n04311004
-n13040303
-n02111129
-n02422699
-n03733281
-n04153751
-n04179913
-n02268443
-n02443114
-n03485794
-n07579787
-n02110063
-n01616318
-n03871628
-n07697537
-n02114367
-n02091134
-n02883205
-n02814533
-n03871628
-n02105056
-n02865351
-n03991062
-n02104365
-n04275548
-n03929660
-n03814639
-n02834397
-n03792782
-n07730033
-n02445715
-n02804610
-n02119789
-n04040759
-n02415577
-n02206856
-n02114367
-n04493381
-n02276258
-n03991062
-n02236044
-n04332243
-n07760859
-n02504013
-n02090379
-n02445715
-n10565667
-n04487081
-n09472597
-n04398044
-n01873310
-n02087046
-n03788365
-n02097658
-n03467068
-n07717410
-n03642806
-n03063689
-n01914609
-n03792782
-n12267677
-n03220513
-n02119789
-n02950826
-n02113712
-n03697007
-n04009552
-n03876231
-n10148035
-n03590841
-n03461385
-n02814860
-n03729826
-n03255030
-n09288635
-n02094114
-n04550184
-n02115913
-n01990800
-n02112350
-n12998815
-n02672831
-n01860187
-n04493381
-n02979186
-n02441942
-n02128757
-n01883070
-n03803284
-n03417042
-n02992211
-n04462240
-n03759954
-n01984695
-n07584110
-n04118538
-n02105412
-n03218198
-n02835271
-n03314780
-n04070727
-n03325584
-n01742172
-n04266014
-n03447447
-n02701002
-n01877812
-n03062245
-n01592084
-n01924916
-n03781244
-n01798484
-n02730930
-n02417914
-n02791124
-n02412080
-n09256479
-n04008634
-n02493793
-n07753275
-n03980874
-n02280649
-n03400231
-n03476991
-n02787622
-n02086240
-n04041544
-n04370456
-n04591713
-n03062245
-n04254120
-n02125311
-n03920288
-n02088364
-n02002724
-n02107683
-n01498041
-n04550184
-n01984695
-n04584207
-n02971356
-n03961711
-n02447366
-n01855672
-n03126707
-n03481172
-n02640242
-n03376595
-n02814860
-n01498041
-n04442312
-n03776460
-n01882714
-n04485082
-n03201208
-n01978455
-n04456115
-n03467068
-n02086240
-n02256656
-n04517823
-n03291819
-n04263257
-n02106662
-n02823750
-n03527444
-n01807496
-n02112018
-n02860847
-n01980166
-n01514859
-n02879718
-n02128925
-n03944341
-n07831146
-n04049303
-n04004767
-n04254120
-n02108422
-n07871810
-n01775062
-n02808304
-n03929660
-n02667093
-n07716906
-n03697007
-n12057211
-n03196217
-n01855032
-n02097047
-n02444819
-n07711569
-n02071294
-n06596364
-n03584829
-n02025239
-n09256479
-n02484975
-n02840245
-n02814533
-n03188531
-n03891332
-n01560419
-n02110185
-n01685808
-n03207941
-n02096294
-n02672831
-n04311004
-n04265275
-n07730033
-n04296562
-n02167151
-n02110341
-n03832673
-n03709823
-n02115641
-n02510455
-n04325704
-n02129604
-n04296562
-n13037406
-n04554684
-n03706229
-n02500267
-n02101388
-n02206856
-n02111889
-n04442312
-n02102973
-n02098105
-n02906734
-n01770081
-n13054560
-n04325704
-n02909870
-n02927161
-n03976467
-n03014705
-n02483362
-n02012849
-n02321529
-n03841143
-n04389033
-n02094258
-n15075141
-n03733805
-n03958227
-n03792972
-n04542943
-n02979186
-n07614500
-n03666591
-n03929855
-n07802026
-n02974003
-n02319095
-n02804414
-n04325704
-n02109525
-n02999410
-n02120079
-n04404412
-n01871265
-n03871628
-n03337140
-n01667778
-n01819313
-n04532670
-n02319095
-n03457902
-n02978881
-n02119789
-n04026417
-n01693334
-n01744401
-n03825788
-n04273569
-n03942813
-n01984695
-n02727426
-n01820546
-n04487081
-n03956157
-n04465501
-n04579145
-n02117135
-n04447861
-n03085013
-n02134084
-n03769881
-n03717622
-n02105251
-n03761084
-n02088466
-n01872401
-n02807133
-n03775546
-n03590841
-n03617480
-n01677366
-n02119789
-n02226429
-n04409515
-n03995372
-n02013706
-n07697537
-n02025239
-n02114712
-n03394916
-n02494079
-n01968897
-n03977966
-n11879895
-n03492542
-n03843555
-n03742115
-n04208210
-n02423022
-n04515003
-n13054560
-n02483708
-n04507155
-n07717410
-n03255030
-n03133878
-n03877845
-n04344873
-n04540053
-n09399592
-n04517823
-n04086273
-n02978881
-n02115641
-n04461696
-n02102973
-n02277742
-n04399382
-n04330267
-n03661043
-n13037406
-n04604644
-n03958227
-n02397096
-n04125021
-n03445924
-n03492542
-n02092339
-n03787032
-n03791053
-n02804414
-n01753488
-n07754684
-n01496331
-n01990800
-n04356056
-n04065272
-n01756291
-n04136333
-n03662601
-n02006656
-n02326432
-n02018795
-n03777568
-n07932039
-n04265275
-n02268853
-n03649909
-n04548362
-n03538406
-n02104365
-n03062245
-n04131690
-n01955084
-n04606251
-n04037443
-n01990800
-n02892767
-n02113023
-n03873416
-n04254680
-n02444819
-n04606251
-n02091032
-n03623198
-n01693334
-n04162706
-n04476259
-n01773157
-n02510455
-n01616318
-n02782093
-n04209133
-n03777568
-n12998815
-n04417672
-n12620546
-n04517823
-n02259212
-n02727426
-n02797295
-n03062245
-n02794156
-n04347754
-n03417042
-n02123159
-n03530642
-n07715103
-n07716906
-n03874599
-n04179913
-n01877812
-n02101388
-n02233338
-n04141327
-n02666196
-n04131690
-n03032252
-n02114367
-n03045698
-n02090721
-n02815834
-n07873807
-n02965783
-n04429376
-n04604644
-n01855032
-n02018795
-n03729826
-n04404412
-n07615774
-n02013706
-n01955084
-n01774750
-n01644373
-n02096177
-n02114712
-n03891332
-n03482405
-n03916031
-n02099849
-n02480855
-n13044778
-n02226429
-n03670208
-n13133613
-n03670208
-n04125021
-n02276258
-n03131574
-n03929855
-n02687172
-n02443484
-n02101006
-n04367480
-n02109525
-n04049303
-n02096051
-n03929660
-n02776631
-n02027492
-n01795545
-n02109525
-n03584829
-n03595614
-n02992211
-n04243546
-n03404251
-n04023962
-n03085013
-n02128385
-n02111129
-n04613696
-n04152593
-n02978881
-n02909870
-n10565667
-n03467068
-n02280649
-n03763968
-n02056570
-n02504458
-n03958227
-n03874599
-n02133161
-n03871628
-n02099849
-n03179701
-n01985128
-n02112137
-n02098413
-n01945685
-n02105505
-n03796401
-n04152593
-n02410509
-n01665541
-n04147183
-n02655020
-n02233338
-n03297495
-n01776313
-n01945685
-n03710193
-n04462240
-n03956157
-n02229544
-n02782093
-n04355338
-n03000684
-n04542943
-n02111277
-n04505470
-n03196217
-n02112706
-n03590841
-n03197337
-n02526121
-n04522168
-n01877812
-n03617480
-n02870880
-n04591713
-n06359193
-n02110958
-n07892512
-n03796401
-n03047690
-n01518878
-n04263257
-n01910747
-n07753275
-n01882714
-n04033901
-n01784675
-n02489166
-n03534580
-n04447861
-n02403003
-n07717556
-n02027492
-n03710721
-n02281787
-n02807133
-n03124170
-n02396427
-n02981792
-n04613696
-n02481823
-n04522168
-n03930313
-n10565667
-n03776460
-n03180011
-n04235860
-n02397096
-n03016953
-n03838899
-n09193705
-n04404412
-n04336792
-n02978881
-n07720875
-n04286575
-n12985857
-n07613480
-n03063689
-n02206856
-n02011460
-n02769748
-n02317335
-n02749479
-n01770081
-n02422699
-n02088094
-n02906734
-n06785654
-n04152593
-n03916031
-n02113186
-n02115913
-n02791124
-n03764736
-n02356798
-n02979186
-n02749479
-n03630383
-n03259280
-n04023962
-n04026417
-n02909870
-n03404251
-n03868863
-n03495258
-n03899768
-n03733805
-n02823750
-n02086079
-n04356056
-n03196217
-n01806143
-n07718472
-n04335435
-n03937543
-n04070727
-n01631663
-n02643566
-n11879895
-n03690938
-n02093428
-n02105641
-n02091134
-n03131574
-n03485407
-n01677366
-n02099601
-n02123045
-n02443114
-n02134418
-n04370456
-n01883070
-n04141076
-n03467068
-n02105162
-n02226429
-n02397096
-n02692877
-n02447366
-n13037406
-n09332890
-n04482393
-n03877845
-n02102480
-n10565667
-n02791270
-n02669723
-n02808304
-n04548362
-n03658185
-n02489166
-n02098286
-n07615774
-n04532106
-n01807496
-n02992529
-n01694178
-n04428191
-n03445924
-n07742313
-n04037443
-n03887697
-n01630670
-n02099267
-n02123597
-n01981276
-n02825657
-n02106662
-n03657121
-n03249569
-n03218198
-n04152593
-n12985857
-n03160309
-n02939185
-n01817953
-n01773157
-n02999410
-n03482405
-n04200800
-n02488702
-n03272562
-n03992509
-n03544143
-n04141327
-n02099712
-n03016953
-n02107142
-n01751748
-n02009912
-n02087394
-n04355933
-n02117135
-n13054560
-n02006656
-n03733805
-n03710193
-n04141076
-n01608432
-n09835506
-n04398044
-n07579787
-n02099712
-n02123597
-n07836838
-n04131690
-n04090263
-n02981792
-n02018795
-n03602883
-n02074367
-n02443484
-n02871525
-n02457408
-n02799071
-n03764736
-n03804744
-n02190166
-n03769881
-n04399382
-n04553703
-n02058221
-n02981792
-n01692333
-n01631663
-n03868242
-n06785654
-n03977966
-n04423845
-n02791124
-n02128385
-n01664065
-n01756291
-n07802026
-n02979186
-n02814533
-n12768682
-n04201297
-n07742313
-n02489166
-n02120079
-n03743016
-n03482405
-n01795545
-n02108551
-n02096051
-n02951358
-n02169497
-n04532106
-n02268443
-n03676483
-n01798484
-n02113712
-n07697313
-n02112018
-n04525038
-n03982430
-n04239074
-n02123597
-n03063689
-n02091134
-n02138441
-n03255030
-n02012849
-n02879718
-n02111277
-n02088466
-n02105056
-n01776313
-n04584207
-n02095314
-n01806567
-n01770393
-n03271574
-n03599486
-n10148035
-n03627232
-n04275548
-n03063689
-n03016953
-n01990800
-n04141076
-n03131574
-n01968897
-n02093256
-n01774750
-n01855672
-n04435653
-n03127747
-n03657121
-n03529860
-n07730033
-n02837789
-n01828970
-n02002556
-n02132136
-n03873416
-n03424325
-n04259630
-n02097130
-n03272562
-n03496892
-n04525305
-n03916031
-n01644373
-n04591713
-n02504013
-n02091831
-n01847000
-n03000684
-n01770393
-n03763968
-n02093754
-n03063689
-n02085782
-n03290653
-n03777568
-n07718472
-n02090721
-n02089078
-n03792782
-n13037406
-n02111889
-n04550184
-n03063599
-n04229816
-n04238763
-n01693334
-n03743016
-n02108551
-n04604644
-n02281787
-n02119789
-n02808304
-n09332890
-n02106550
-n07802026
-n03249569
-n07836838
-n03775546
-n04204347
-n04592741
-n01498041
-n03929660
-n02077923
-n02108089
-n02094433
-n02107574
-n13133613
-n02749479
-n03249569
-n02641379
-n03804744
-n02321529
-n01797886
-n02690373
-n13054560
-n02950826
-n01737021
-n01689811
-n01664065
-n07693725
-n02342885
-n02169497
-n09288635
-n02087394
-n03376595
-n02120505
-n03938244
-n03345487
-n02500267
-n01797886
-n04443257
-n03492542
-n02094258
-n03721384
-n13044778
-n03868863
-n07711569
-n02236044
-n04081281
-n03838899
-n04596742
-n02111500
-n04251144
-n02100583
-n07714571
-n04238763
-n02105412
-n02443484
-n04019541
-n03394916
-n03776460
-n03000134
-n02109525
-n02109525
-n02870880
-n03393912
-n03197337
-n04081281
-n03763968
-n01688243
-n02110806
-n02834397
-n02939185
-n02279972
-n03888605
-n02268443
-n02988304
-n04310018
-n04285008
-n09246464
-n02389026
-n01558993
-n01955084
-n01930112
-n01644373
-n12620546
-n02093256
-n09256479
-n02002724
-n03160309
-n04204238
-n01753488
-n03393912
-n01641577
-n02100735
-n04584207
-n02100236
-n02879718
-n02988304
-n02105162
-n02110806
-n04258138
-n03590841
-n02927161
-n01498041
-n03720891
-n04515003
-n02134418
-n03014705
-n03344393
-n02783161
-n04443257
-n02492660
-n03218198
-n01755581
-n02090622
-n03179701
-n04252225
-n04417672
-n04037443
-n04065272
-n03721384
-n02089973
-n02091635
-n03804744
-n09288635
-n04613696
-n03796401
-n07714990
-n01770393
-n01742172
-n02128385
-n03492542
-n03916031
-n01883070
-n01739381
-n02980441
-n02966687
-n04486054
-n04443257
-n01984695
-n03026506
-n02808440
-n02977058
-n02114367
-n02094114
-n02326432
-n03016953
-n02106166
-n03710193
-n01644373
-n02091134
-n03259280
-n03018349
-n03791053
-n04008634
-n02095570
-n07718747
-n03376595
-n07717410
-n02894605
-n07583066
-n02281787
-n03483316
-n02105505
-n03837869
-n04591713
-n02749479
-n01514668
-n02090379
-n03424325
-n03642806
-n02089973
-n01532829
-n02105641
-n04591713
-n01819313
-n02127052
-n03124043
-n03649909
-n02113186
-n04067472
-n02114548
-n03791053
-n03792782
-n02093991
-n03530642
-n02397096
-n02281787
-n03661043
-n03495258
-n02174001
-n07880968
-n03459775
-n02100236
-n02727426
-n01820546
-n02988304
-n02112350
-n03476684
-n04238763
-n02028035
-n02120505
-n01704323
-n03047690
-n02268443
-n02443114
-n02112137
-n02879718
-n01697457
-n04264628
-n03314780
-n03649909
-n02133161
-n07730033
-n03670208
-n02835271
-n03584829
-n02326432
-n03916031
-n03485794
-n03314780
-n02342885
-n02105412
-n02321529
-n01669191
-n07742313
-n03045698
-n02510455
-n04201297
-n03710721
-n02966687
-n02094258
-n02109047
-n03376595
-n03017168
-n01924916
-n02017213
-n02086079
-n03666591
-n04465501
-n02981792
-n03832673
-n01806567
-n02793495
-n02110806
-n01833805
-n01622779
-n02493509
-n03495258
-n03485407
-n02051845
-n04141975
-n02909870
-n01698640
-n02096294
-n02009912
-n02097658
-n02018207
-n02804414
-n03095699
-n01665541
-n03532672
-n02102177
-n01806143
-n01847000
-n07693725
-n02268853
-n03530642
-n03908618
-n03781244
-n04286575
-n02111129
-n04273569
-n04590129
-n02100583
-n03916031
-n04404412
-n02708093
-n03160309
-n07579787
-n03476991
-n04204238
-n03344393
-n09193705
-n01665541
-n01968897
-n03180011
-n02948072
-n01871265
-n01843383
-n02494079
-n02105505
-n02356798
-n02769748
-n01955084
-n01990800
-n02113712
-n03976657
-n03633091
-n03937543
-n04252225
-n02442845
-n03461385
-n03014705
-n01644900
-n03924679
-n04152593
-n02974003
-n02804414
-n03290653
-n04344873
-n02326432
-n04371430
-n03485794
-n02107142
-n03483316
-n04330267
-n01883070
-n02105505
-n03062245
-n03924679
-n02326432
-n03761084
-n02104029
-n02074367
-n04023962
-n02123597
-n04264628
-n03902125
-n02077923
-n02927161
-n03272562
-n04399382
-n07875152
-n03478589
-n03680355
-n02093428
-n03903868
-n02396427
-n01753488
-n01914609
-n04487081
-n03372029
-n01753488
-n02096585
-n07747607
-n01601694
-n03146219
-n03733131
-n03124043
-n02090622
-n03063599
-n03599486
-n03976657
-n07880968
-n02086910
-n02494079
-n02100735
-n01693334
-n02966193
-n02089973
-n03866082
-n02640242
-n02094433
-n03947888
-n01592084
-n04039381
-n04263257
-n04326547
-n02841315
-n04009552
-n02099712
-n03271574
-n02701002
-n03791053
-n04252077
-n07717410
-n02027492
-n02097474
-n02113799
-n01773797
-n11939491
-n03494278
-n02971356
-n02509815
-n02107683
-n04328186
-n03998194
-n03938244
-n03721384
-n02089973
-n07684084
-n04613696
-n03476991
-n03444034
-n03272010
-n02219486
-n07613480
-n03899768
-n01770393
-n04532106
-n04264628
-n03314780
-n02422106
-n01689811
-n04154565
-n03991062
-n02088094
-n03384352
-n02088632
-n03146219
-n02017213
-n02123597
-n01806567
-n01740131
-n01829413
-n04004767
-n04355338
-n04044716
-n01735189
-n03218198
-n02108422
-n07831146
-n02110185
-n07932039
-n03658185
-n01773797
-n09288635
-n02133161
-n01820546
-n09332890
-n09468604
-n03935335
-n04562935
-n03908714
-n02167151
-n03216828
-n02497673
-n04493381
-n03452741
-n02117135
-n04131690
-n02120505
-n03743016
-n02364673
-n03980874
-n04462240
-n02804414
-n02051845
-n02808440
-n02172182
-n09428293
-n02093428
-n03220513
-n02699494
-n03803284
-n03804744
-n02514041
-n04099969
-n04296562
-n03388549
-n12998815
-n03933933
-n04208210
-n02410509
-n04482393
-n04487081
-n02486261
-n02113799
-n04228054
-n09835506
-n04067472
-n01664065
-n04428191
-n01740131
-n02493509
-n11939491
-n03042490
-n03584254
-n09468604
-n04120489
-n02483708
-n01498041
-n03786901
-n04523525
-n02165105
-n03888605
-n02115913
-n04201297
-n04501370
-n04037443
-n02172182
-n03793489
-n03724870
-n02391049
-n04069434
-n02807133
-n02056570
-n07584110
-n04398044
-n04398044
-n03854065
-n02655020
-n02107312
-n04366367
-n04086273
-n03485407
-n02104029
-n04251144
-n03627232
-n02132136
-n02979186
-n02317335
-n03201208
-n04479046
-n03452741
-n04258138
-n07590611
-n04149813
-n04355933
-n03207941
-n04479046
-n02441942
-n03866082
-n07583066
-n03445777
-n03017168
-n02672831
-n04204238
-n04326547
-n02113712
-n01514668
-n02415577
-n03706229
-n02981792
-n02840245
-n04389033
-n03992509
-n02403003
-n04005630
-n03637318
-n04371430
-n04347754
-n02100583
-n01518878
-n02319095
-n02492035
-n04597913
-n02206856
-n02025239
-n04591157
-n01773549
-n04081281
-n07697537
-n01682714
-n04069434
-n02085782
-n02655020
-n07714571
-n01614925
-n04008634
-n07873807
-n04131690
-n03680355
-n02422699
-n07753592
-n03840681
-n06785654
-n01530575
-n02096051
-n03764736
-n02108089
-n04044716
-n03384352
-n01818515
-n02056570
-n02097130
-n01665541
-n01688243
-n04131690
-n04606251
-n01616318
-n01688243
-n02113186
-n04613696
-n01737021
-n02776631
-n03995372
-n01806143
-n01753488
-n04037443
-n02879718
-n04009552
-n02110806
-n04332243
-n04560804
-n03884397
-n02110958
-n03888605
-n01685808
-n07565083
-n02883205
-n02492660
-n01798484
-n03100240
-n02088094
-n04229816
-n02098286
-n02841315
-n03017168
-n04120489
-n07718747
-n03933933
-n04355933
-n04483307
-n02107142
-n01744401
-n02093991
-n02112137
-n02085936
-n03929855
-n02051845
-n02091831
-n01740131
-n02948072
-n02112706
-n04584207
-n04070727
-n03584254
-n04235860
-n01749939
-n02086079
-n03424325
-n04485082
-n02165456
-n03259280
-n02132136
-n03445924
-n12768682
-n03325584
-n01644373
-n02361337
-n04523525
-n07753592
-n04067472
-n04579145
-n07880968
-n02231487
-n04486054
-n03658185
-n04429376
-n03126707
-n02085620
-n02104365
-n02692877
-n04557648
-n04606251
-n03888605
-n02105412
-n06785654
-n02101388
-n03393912
-n04370456
-n12985857
-n07871810
-n03742115
-n04238763
-n02101006
-n02090379
-n09399592
-n07930864
-n02123597
-n03494278
-n02363005
-n07892512
-n02776631
-n03785016
-n07930864
-n02123394
-n01855032
-n02883205
-n02091831
-n03868242
-n02930766
-n01945685
-n03594734
-n02493793
-n02398521
-n04501370
-n03417042
-n02815834
-n03710637
-n02100583
-n02497673
-n02894605
-n03895866
-n01756291
-n02091032
-n02120505
-n03980874
-n07745940
-n02769748
-n04208210
-n01990800
-n02397096
-n01692333
-n03814639
-n01855672
-n04154565
-n02317335
-n02815834
-n07693725
-n03720891
-n02110627
-n13037406
-n02391049
-n04131690
-n01930112
-n07760859
-n03770679
-n02111500
-n04252225
-n01877812
-n03180011
-n13044778
-n02492660
-n04273569
-n04004767
-n04238763
-n03706229
-n04357314
-n01641577
-n04311174
-n03109150
-n03866082
-n03933933
-n02412080
-n03207743
-n03218198
-n07716906
-n03218198
-n02667093
-n02799071
-n02346627
-n03874293
-n01537544
-n01728572
-n03804744
-n01855672
-n01744401
-n02747177
-n02939185
-n02676566
-n02950826
-n02097298
-n01819313
-n02276258
-n09428293
-n01682714
-n03710637
-n03920288
-n02672831
-n02447366
-n02860847
-n02412080
-n04254680
-n01692333
-n02807133
-n03394916
-n13133613
-n01806567
-n07720875
-n07836838
-n02088094
-n02102040
-n01580077
-n03775546
-n04238763
-n04118776
-n04540053
-n02096294
-n02441942
-n03781244
-n02093256
-n02988304
-n02423022
-n07871810
-n01704323
-n02132136
-n01560419
-n02206856
-n01833805
-n02980441
-n11879895
-n07875152
-n03930313
-n03042490
-n03954731
-n03933933
-n03126707
-n03461385
-n02114855
-n03929660
-n04550184
-n02783161
-n03944341
-n07693725
-n02123045
-n09288635
-n03196217
-n03297495
-n02091831
-n03670208
-n04487394
-n02105251
-n02454379
-n02099849
-n04409515
-n01592084
-n02092002
-n07590611
-n03992509
-n02412080
-n03075370
-n02447366
-n02669723
-n12985857
-n03584254
-n01753488
-n02708093
-n02497673
-n04069434
-n01484850
-n07873807
-n03492542
-n03457902
-n03670208
-n04376876
-n01697457
-n02101556
-n11879895
-n02071294
-n03710193
-n03961711
-n03930313
-n02793495
-n12768682
-n03657121
-n04596742
-n04204238
-n02093754
-n03961711
-n09472597
-n03379051
-n02417914
-n02107312
-n02489166
-n01828970
-n03884397
-n04251144
-n03792782
-n02782093
-n01820546
-n02981792
-n06359193
-n03443371
-n01735189
-n04501370
-n03673027
-n03770679
-n03085013
-n02112706
-n01978287
-n02794156
-n02087394
-n01443537
-n04286575
-n02123394
-n04264628
-n03337140
-n03710721
-n03947888
-n02514041
-n02328150
-n02110185
-n03992509
-n02965783
-n02096177
-n01824575
-n03929855
-n02815834
-n02643566
-n01744401
-n02672831
-n02447366
-n06874185
-n04325704
-n02317335
-n03126707
-n02056570
-n02457408
-n03443371
-n04125021
-n03866082
-n03127747
-n04311004
-n02134084
-n01910747
-n07716358
-n02134418
-n02071294
-n04335435
-n03594734
-n06359193
-n04336792
-n02097474
-n07717410
-n02092339
-n04376876
-n03785016
-n02087394
-n02825657
-n03208938
-n03720891
-n04366367
-n02480855
-n03124043
-n04067472
-n03180011
-n04049303
-n04243546
-n04423845
-n03127747
-n02259212
-n03697007
-n04136333
-n04590129
-n03942813
-n02268443
-n04008634
-n04254680
-n04125021
-n04040759
-n03924679
-n04485082
-n02410509
-n04259630
-n03584829
-n03196217
-n03776460
-n01774750
-n09421951
-n07802026
-n04399382
-n04536866
-n04525038
-n02091467
-n03902125
-n03544143
-n02791270
-n03888605
-n03376595
-n02397096
-n03777754
-n04592741
-n03047690
-n07693725
-n02113978
-n04398044
-n02783161
-n04596742
-n03785016
-n01582220
-n02791270
-n02791124
-n02129165
-n03404251
-n03670208
-n03903868
-n02978881
-n02094433
-n04252225
-n02096177
-n03496892
-n03000684
-n03983396
-n02111277
-n03720891
-n03782006
-n01829413
-n04153751
-n03271574
-n03538406
-n03970156
-n03924679
-n02088094
-n01806143
-n02113978
-n03207941
-n03347037
-n03633091
-n03404251
-n04579145
-n02276258
-n02086240
-n02799071
-n03871628
-n02087394
-n02264363
-n03478589
-n03788365
-n02097658
-n02093647
-n07920052
-n03788195
-n03720891
-n07717556
-n02113023
-n01855032
-n07802026
-n02037110
-n03832673
-n04350905
-n07613480
-n02814860
-n03777754
-n03218198
-n02441942
-n02115913
-n02109961
-n04347754
-n03841143
-n02786058
-n02690373
-n07697313
-n07613480
-n01873310
-n03874599
-n02113624
-n02992211
-n07871810
-n03388183
-n01644900
-n04067472
-n04039381
-n02361337
-n04039381
-n04370456
-n01843065
-n01877812
-n02488291
-n03692522
-n02669723
-n03018349
-n03207743
-n02096177
-n01514859
-n02105056
-n03495258
-n03207743
-n04523525
-n03259280
-n03127747
-n02988304
-n02096437
-n02087394
-n04370456
-n01882714
-n01644900
-n11879895
-n03814639
-n03763968
-n03788365
-n04579145
-n03837869
-n04429376
-n02219486
-n03983396
-n04591157
-n07693725
-n02281787
-n01829413
-n04606251
-n02795169
-n03467068
-n02486410
-n04505470
-n02488702
-n02108089
-n02783161
-n06596364
-n01558993
-n07871810
-n02655020
-n02256656
-n03290653
-n03131574
-n01829413
-n02930766
-n03529860
-n01871265
-n01675722
-n02840245
-n04392985
-n04286575
-n03404251
-n02823428
-n02951585
-n02077923
-n03000247
-n01843065
-n02804414
-n04525038
-n01749939
-n03095699
-n04552348
-n03532672
-n03527444
-n03947888
-n02667093
-n02346627
-n01667114
-n07749582
-n02128385
-n02093754
-n02092002
-n02782093
-n04310018
-n02104365
-n02134418
-n03769881
-n02776631
-n01984695
-n02097658
-n02095570
-n02321529
-n02108000
-n02098413
-n03623198
-n03100240
-n03109150
-n02168699
-n03017168
-n01819313
-n02117135
-n03871628
-n03924679
-n04399382
-n15075141
-n03884397
-n03425413
-n03584829
-n03976467
-n02979186
-n02124075
-n02869837
-n03998194
-n02025239
-n01558993
-n04044716
-n02107908
-n04404412
-n04266014
-n03944341
-n01751748
-n02025239
-n04040759
-n02102973
-n03930630
-n09246464
-n02174001
-n02389026
-n03764736
-n01795545
-n02790996
-n02526121
-n03133878
-n03124043
-n02979186
-n02093754
-n03598930
-n03250847
-n02134084
-n03733281
-n02226429
-n04019541
-n02105855
-n02256656
-n02787622
-n04435653
-n03599486
-n03733131
-n02325366
-n03259280
-n03028079
-n03476684
-n03133878
-n03590841
-n03197337
-n04525038
-n03494278
-n04270147
-n01860187
-n02086910
-n02457408
-n03627232
-n03133878
-n03947888
-n02823428
-n02097298
-n02108000
-n04540053
-n03141823
-n03201208
-n03476991
-n02113023
-n03777754
-n03854065
-n02415577
-n02974003
-n01820546
-n02087046
-n04149813
-n04332243
-n02090379
-n04509417
-n07760859
-n03637318
-n02672831
-n03141823
-n03538406
-n03201208
-n04286575
-n02097658
-n03873416
-n04515003
-n09193705
-n02939185
-n03933933
-n01749939
-n03483316
-n02098105
-n02107908
-n02130308
-n02105641
-n04458633
-n03692522
-n02777292
-n07565083
-n02708093
-n02783161
-n04037443
-n04259630
-n02112706
-n07802026
-n01729977
-n02168699
-n04192698
-n04209133
-n07590611
-n01729322
-n02028035
-n04579432
-n01518878
-n02443484
-n07742313
-n04376876
-n04019541
-n02791270
-n02906734
-n02264363
-n02233338
-n06874185
-n04069434
-n13044778
-n02981792
-n02117135
-n03775071
-n03249569
-n04239074
-n03868242
-n02099267
-n03467068
-n02791270
-n01632777
-n01817953
-n04325704
-n01582220
-n04081281
-n03838899
-n02865351
-n02445715
-n04009552
-n02089867
-n02256656
-n01860187
-n02815834
-n04447861
-n03786901
-n04120489
-n03584254
-n03255030
-n02006656
-n03187595
-n04152593
-n03467068
-n03942813
-n03947888
-n07831146
-n02090721
-n04532670
-n03018349
-n02093991
-n01917289
-n01729322
-n02108422
-n03197337
-n02951585
-n04263257
-n07932039
-n01537544
-n03495258
-n01755581
-n02096051
-n01737021
-n04120489
-n02111500
-n03895866
-n02106166
-n04350905
-n04081281
-n02791124
-n04501370
-n02115913
-n02088466
-n07614500
-n02410509
-n01740131
-n03483316
-n02701002
-n03792782
-n03995372
-n03016953
-n02536864
-n12144580
-n02011460
-n04355933
-n02423022
-n03658185
-n03344393
-n02096177
-n03692522
-n04423845
-n02110185
-n02177972
-n03197337
-n03924679
-n01749939
-n02229544
-n03000247
-n01744401
-n02321529
-n03874293
-n03481172
-n01872401
-n02112018
-n02492035
-n03670208
-n04372370
-n01697457
-n02788148
-n01796340
-n03272562
-n02098286
-n03781244
-n03666591
-n13037406
-n04532670
-n03394916
-n01744401
-n02114855
-n04542943
-n02860847
-n02268443
-n04254120
-n02088466
-n11939491
-n03788195
-n07860988
-n03832673
-n02134084
-n02092339
-n02797295
-n04252077
-n04591713
-n02096177
-n03134739
-n03982430
-n02107574
-n02233338
-n07697313
-n03891332
-n03325584
-n03208938
-n01518878
-n02509815
-n03710721
-n04487394
-n03014705
-n02099429
-n02834397
-n04141975
-n01978455
-n03891332
-n02870880
-n04265275
-n02497673
-n01955084
-n02963159
-n02099712
-n02793495
-n03691459
-n02085782
-n03991062
-n02088094
-n07711569
-n02346627
-n07695742
-n03218198
-n01784675
-n02799071
-n03944341
-n03179701
-n02415577
-n04370456
-n04443257
-n04254777
-n01496331
-n02699494
-n01677366
-n02514041
-n02086240
-n02107908
-n11879895
-n03770679
-n02749479
-n03803284
-n04485082
-n03201208
-n03045698
-n03944341
-n01930112
-n02113186
-n04286575
-n03706229
-n02871525
-n01774384
-n01855032
-n02109047
-n02114548
-n12998815
-n03218198
-n03216828
-n04371774
-n02114712
-n04548280
-n02276258
-n04033995
-n03393912
-n03980874
-n04389033
-n07583066
-n01704323
-n03445924
-n02018795
-n03445777
-n02098286
-n03838899
-n01689811
-n03666591
-n03000247
-n02099712
-n03483316
-n04505470
-n02490219
-n04239074
-n01531178
-n02116738
-n01950731
-n02113624
-n04204238
-n02276258
-n07715103
-n03026506
-n02108551
-n02127052
-n02088466
-n02093256
-n02102040
-n03976657
-n04532670
-n03776460
-n03220513
-n03903868
-n03792972
-n03529860
-n02009229
-n02113624
-n02447366
-n03461385
-n02102318
-n04263257
-n02114855
-n02676566
-n03425413
-n03538406
-n03666591
-n03272010
-n07768694
-n04392985
-n04330267
-n03026506
-n07730033
-n02094258
-n04515003
-n04265275
-n13044778
-n02965783
-n02120505
-n02058221
-n03314780
-n02793495
-n02708093
-n03633091
-n03014705
-n01665541
-n02526121
-n04067472
-n04428191
-n07836838
-n02177972
-n01817953
-n04296562
-n04099969
-n03956157
-n02114367
-n02091635
-n02113978
-n03838899
-n02437616
-n04370456
-n02423022
-n02112706
-n02096585
-n02497673
-n04505470
-n02098286
-n02319095
-n04560804
-n03976657
-n04330267
-n02481823
-n04532670
-n12057211
-n03584254
-n04065272
-n04596742
-n02823428
-n01494475
-n03133878
-n07579787
-n04141975
-n03794056
-n03000684
-n04067472
-n02108422
-n04254777
-n01616318
-n03814906
-n03444034
-n04277352
-n04612504
-n02917067
-n03729826
-n02095314
-n03796401
-n04486054
-n03637318
-n02786058
-n03661043
-n03400231
-n02112350
-n03980874
-n04251144
-n01978287
-n03483316
-n03633091
-n04597913
-n02093647
-n02097474
-n02097130
-n03998194
-n01689811
-n04482393
-n02231487
-n04328186
-n03188531
-n02490219
-n04579432
-n09256479
-n03770439
-n07697537
-n02389026
-n04252225
-n03594945
-n04310018
-n01978455
-n03803284
-n03063689
-n01924916
-n03240683
-n03837869
-n02114712
-n02999410
-n04371774
-n03676483
-n02091467
-n03196217
-n03347037
-n04487081
-n03888257
-n03787032
-n01631663
-n03447721
-n02086079
-n01644373
-n09468604
-n07613480
-n04356056
-n04493381
-n06785654
-n03179701
-n01675722
-n04429376
-n02966193
-n03584254
-n03673027
-n03223299
-n03443371
-n02106382
-n04125021
-n03786901
-n04467665
-n03498962
-n03662601
-n02088632
-n02510455
-n12998815
-n02747177
-n04252077
-n12267677
-n04501370
-n02113978
-n03141823
-n01817953
-n03126707
-n03110669
-n02910353
-n03417042
-n09193705
-n02102318
-n01807496
-n02268443
-n01632777
-n02814533
-n07875152
-n01484850
-n02092339
-n02791124
-n04417672
-n03160309
-n02134418
-n03483316
-n01829413
-n02095889
-n07693725
-n04579145
-n03942813
-n02091134
-n04209239
-n07584110
-n04590129
-n03873416
-n02105056
-n02488291
-n04136333
-n01855032
-n04525305
-n04039381
-n02025239
-n03476991
-n01614925
-n01735189
-n02894605
-n04505470
-n02127052
-n12267677
-n02865351
-n03481172
-n02445715
-n02892767
-n02974003
-n03249569
-n01860187
-n01687978
-n03733805
-n03445777
-n02676566
-n07734744
-n03544143
-n03676483
-n03877845
-n03372029
-n03977966
-n02090721
-n03676483
-n02655020
-n02134418
-n02364673
-n02110627
-n03527444
-n04317175
-n02280649
-n02788148
-n02119789
-n02804610
-n04435653
-n02120505
-n02802426
-n02606052
-n07717410
-n03290653
-n03017168
-n02087046
-n02093647
-n04259630
-n01819313
-n03467068
-n02113712
-n03935335
-n02927161
-n02113186
-n03673027
-n04200800
-n04192698
-n01518878
-n03417042
-n02093754
-n02088364
-n02749479
-n01688243
-n04070727
-n04604644
-n02457408
-n06874185
-n04483307
-n02422106
-n01692333
-n02834397
-n03485794
-n02219486
-n01950731
-n02028035
-n01644900
-n03125729
-n12144580
-n01682714
-n03843555
-n03602883
-n02018795
-n03447447
-n02865351
-n03223299
-n03355925
-n04592741
-n02106662
-n02033041
-n01820546
-n03761084
-n02165105
-n02397096
-n02101556
-n04328186
-n03933933
-n03355925
-n04328186
-n03950228
-n03134739
-n03535780
-n01748264
-n04330267
-n02699494
-n01985128
-n02978881
-n04141327
-n02403003
-n02120079
-n07579787
-n02317335
-n02509815
-n04146614
-n01944390
-n04467665
-n02927161
-n12620546
-n02098286
-n01914609
-n02486410
-n02963159
-n03085013
-n04525305
-n04141076
-n01742172
-n01798484
-n02102480
-n01729322
-n03938244
-n02096585
-n04099969
-n02437616
-n03729826
-n01829413
-n03527444
-n04086273
-n02013706
-n03594734
-n02105855
-n04536866
-n02489166
-n02093991
-n02109525
-n01930112
-n01580077
-n02457408
-n04328186
-n01751748
-n03026506
-n04235860
-n02113023
-n03063689
-n01882714
-n03930630
-n03710721
-n04264628
-n04081281
-n04116512
-n04044716
-n01697457
-n04330267
-n02860847
-n02107908
-n04399382
-n03873416
-n04509417
-n03792972
-n02102318
-n01883070
-n07742313
-n02033041
-n12620546
-n03995372
-n02086646
-n03485794
-n07747607
-n02098413
-n03877472
-n02106550
-n04263257
-n02134418
-n04263257
-n04606251
-n01630670
-n02280649
-n02504013
-n02871525
-n04081281
-n03782006
-n01514668
-n02396427
-n02093428
-n02979186
-n04254777
-n04009552
-n03602883
-n07747607
-n04562935
-n02033041
-n04505470
-n02906734
-n03045698
-n01629819
-n04613696
-n07717556
-n02487347
-n01917289
-n01817953
-n07753275
-n02457408
-n02992529
-n01742172
-n03950228
-n03584254
-n02526121
-n01494475
-n02085936
-n02391049
-n04355933
-n03950228
-n03584829
-n02128385
-n01872401
-n02091467
-n03481172
-n04204347
-n03899768
-n02107312
-n02692877
-n04606251
-n03770679
-n07749582
-n01558993
-n02099712
-n03792782
-n03791053
-n04317175
-n02086079
-n02480855
-n01682714
-n04509417
-n03792972
-n02108551
-n02606052
-n03995372
-n04336792
-n02490219
-n07695742
-n12998815
-n03759954
-n04265275
-n02971356
-n03661043
-n02120505
-n01530575
-n03690938
-n02422106
-n02120079
-n07873807
-n04579432
-n03930313
-n09288635
-n02509815
-n03998194
-n03791053
-n01930112
-n03991062
-n02125311
-n02909870
-n07718747
-n01729322
-n02133161
-n03763968
-n03944341
-n01943899
-n02445715
-n04443257
-n02109047
-n04141327
-n03041632
-n01592084
-n02906734
-n01828970
-n03388549
-n01917289
-n02859443
-n02110958
-n03956157
-n02797295
-n02100583
-n02776631
-n03485407
-n04285008
-n03623198
-n01753488
-n03146219
-n03535780
-n12768682
-n12768682
-n02100583
-n03976657
-n04251144
-n03444034
-n03980874
-n02066245
-n01692333
-n03223299
-n04461696
-n09835506
-n02206856
-n13040303
-n02088094
-n02487347
-n03781244
-n03832673
-n02917067
-n01806567
-n03776460
-n04208210
-n04462240
-n02093428
-n02123045
-n03047690
-n04201297
-n02895154
-n04252225
-n03837869
-n01877812
-n03961711
-n01753488
-n02105505
-n02112018
-n02110627
-n02389026
-n02782093
-n02099712
-n03742115
-n04141076
-n01735189
-n02879718
-n03594734
-n04462240
-n02788148
-n02106166
-n03991062
-n01820546
-n04259630
-n04310018
-n15075141
-n03717622
-n03595614
-n03598930
-n02132136
-n03630383
-n03692522
-n04591157
-n04154565
-n02346627
-n02687172
-n07693725
-n02514041
-n02128757
-n02095314
-n01855032
-n03942813
-n03485407
-n13133613
-n03062245
-n03447447
-n02895154
-n04380533
-n02364673
-n03146219
-n02109961
-n02113799
-n02859443
-n01558993
-n02119789
-n01930112
-n04275548
-n03602883
-n02497673
-n02037110
-n03026506
-n07930864
-n04330267
-n02480495
-n02107683
-n03786901
-n01917289
-n03133878
-n04532670
-n01775062
-n03633091
-n03777568
-n01945685
-n03109150
-n03792972
-n02895154
-n04548362
-n02114855
-n03775071
-n07717556
-n02483362
-n02909870
-n02027492
-n07584110
-n03594734
-n03642806
-n03877845
-n03379051
-n02927161
-n04417672
-n04009552
-n04004767
-n02799071
-n03874599
-n01883070
-n03933933
-n03450230
-n01698640
-n03146219
-n02113023
-n03379051
-n03160309
-n01968897
-n03976467
-n04328186
-n02018207
-n02123597
-n02791124
-n01729977
-n04228054
-n02966687
-n02094258
-n03425413
-n01819313
-n02100236
-n02389026
-n02108551
-n02085620
-n03791053
-n03916031
-n01871265
-n01698640
-n02100877
-n03146219
-n03903868
-n03803284
-n04204238
-n04037443
-n02128925
-n03131574
-n02823428
-n09421951
-n03884397
-n07742313
-n03871628
-n01770081
-n04540053
-n03000134
-n02443114
-n04476259
-n04317175
-n02091032
-n07248320
-n04146614
-n04532106
-n07920052
-n02484975
-n04612504
-n01530575
-n03929660
-n04540053
-n01796340
-n01828970
-n04162706
-n03481172
-n03983396
-n02777292
-n02018795
-n02869837
-n02835271
-n03201208
-n01518878
-n12057211
-n03787032
-n02641379
-n04554684
-n02791124
-n01819313
-n02389026
-n04090263
-n03908618
-n03792972
-n02484975
-n07590611
-n01530575
-n12985857
-n09229709
-n01755581
-n03627232
-n02123159
-n03775546
-n04596742
-n04346328
-n02669723
-n07753592
-n07613480
-n03884397
-n02892201
-n01924916
-n04467665
-n02488291
-n03868242
-n02356798
-n04265275
-n02077923
-n02102973
-n03457902
-n02190166
-n03259280
-n02105162
-n02091831
-n02256656
-n01872401
-n02493793
-n02408429
-n02106550
-n03929660
-n03325584
-n04332243
-n04270147
-n01630670
-n03250847
-n02114367
-n02106166
-n03134739
-n02814860
-n02110063
-n03903868
-n02395406
-n04311174
-n03532672
-n02840245
-n01986214
-n04429376
-n02119022
-n03218198
-n02783161
-n03770439
-n02089867
-n02966687
-n03658185
-n09193705
-n03085013
-n02971356
-n04049303
-n11939491
-n02105641
-n03494278
-n02364673
-n01534433
-n01735189
-n02105855
-n03743016
-n07718472
-n02113799
-n04443257
-n02096294
-n02128925
-n02264363
-n03796401
-n02444819
-n03770679
-n02093647
-n03483316
-n02107574
-n04127249
-n02978881
-n13054560
-n02823750
-n03794056
-n03000684
-n01496331
-n01807496
-n02791270
-n01860187
-n03218198
-n02364673
-n03498962
-n04153751
-n01688243
-n03388183
-n01968897
-n02172182
-n02112018
-n02883205
-n03854065
-n12267677
-n02094258
-n04254120
-n01855672
-n02100877
-n03344393
-n07693725
-n02669723
-n02264363
-n03763968
-n03637318
-n04447861
-n01984695
-n12267677
-n04335435
-n02120505
-n02104365
-n03450230
-n04286575
-n03207941
-n02106166
-n03325584
-n03793489
-n03788365
-n03877845
-n02190166
-n02051845
-n02100583
-n02104029
-n06359193
-n01514859
-n02106550
-n02165456
-n02276258
-n01514859
-n03485407
-n01632777
-n02408429
-n03124043
-n03717622
-n04252225
-n04517823
-n03425413
-n04310018
-n03017168
-n03832673
-n01770081
-n03127925
-n02089867
-n03461385
-n03485407
-n01592084
-n02256656
-n03146219
-n01795545
-n03947888
-n07693725
-n04483307
-n02002556
-n04532670
-n04049303
-n02892201
-n03857828
-n01494475
-n01601694
-n04131690
-n02666196
-n02098286
-n02641379
-n04228054
-n03980874
-n04590129
-n01616318
-n03690938
-n04127249
-n03345487
-n02113023
-n01749939
-n04229816
-n02927161
-n03956157
-n02111500
-n01756291
-n02492035
-n02119022
-n02443114
-n02950826
-n02319095
-n04346328
-n02128757
-n03998194
-n02667093
-n01943899
-n04467665
-n01530575
-n01614925
-n04346328
-n02093754
-n03733805
-n03742115
-n03197337
-n02107908
-n01737021
-n02281787
-n03141823
-n04254120
-n01532829
-n02526121
-n02966687
-n02484975
-n03832673
-n02113799
-n03958227
-n04350905
-n03623198
-n06874185
-n03337140
-n02097658
-n04311174
-n04201297
-n03908714
-n01740131
-n03929855
-n02509815
-n03903868
-n03658185
-n01843065
-n04557648
-n04392985
-n02454379
-n02493793
-n04275548
-n03220513
-n02606052
-n04118776
-n02514041
-n07684084
-n03388183
-n02794156
-n01632777
-n04238763
-n04372370
-n03876231
-n02948072
-n02096437
-n02497673
-n03843555
-n07565083
-n02097130
-n04509417
-n03255030
-n02129165
-n01682714
-n07753275
-n09472597
-n02134418
-n02219486
-n02097047
-n03063689
-n02091467
-n03781244
-n02807133
-n03814906
-n04355338
-n04579145
-n03272010
-n02086646
-n02106662
-n03956157
-n02783161
-n02112137
-n03188531
-n03126707
-n01608432
-n03337140
-n01847000
-n04125021
-n04147183
-n07720875
-n02319095
-n02510455
-n04311174
-n03584254
-n04542943
-n02102480
-n02114712
-n02268443
-n07718472
-n03792972
-n03724870
-n04239074
-n02091134
-n02129604
-n03127925
-n02086646
-n03207941
-n01819313
-n04522168
-n03271574
-n04487394
-n03710193
-n02105855
-n03131574
-n02105251
-n02095889
-n03384352
-n07880968
-n02259212
-n04069434
-n01669191
-n03710193
-n01855672
-n13037406
-n01484850
-n04476259
-n03871628
-n01774750
-n02108551
-n02090622
-n03733281
-n03724870
-n03976657
-n02099267
-n04127249
-n02097474
-n02056570
-n01795545
-n07714571
-n02107142
-n01608432
-n02113023
-n04486054
-n03876231
-n04270147
-n03461385
-n13040303
-n02102318
-n02910353
-n02094114
-n02786058
-n02992211
-n02396427
-n04344873
-n02097130
-n01443537
-n04325704
-n02093428
-n04258138
-n07584110
-n03443371
-n03481172
-n02110341
-n04141975
-n02226429
-n02281406
-n04141327
-n04118538
-n02037110
-n02226429
-n01692333
-n03916031
-n02787622
-n03594945
-n07860988
-n03729826
-n04515003
-n04612504
-n02007558
-n01560419
-n02951358
-n02837789
-n04456115
-n04239074
-n02094433
-n04553703
-n03045698
-n03874599
-n03595614
-n02514041
-n03876231
-n04467665
-n04146614
-n02089973
-n04005630
-n04266014
-n04074963
-n03527444
-n04355338
-n09246464
-n03980874
-n01990800
-n03697007
-n13133613
-n07613480
-n02655020
-n03240683
-n04111531
-n01871265
-n01695060
-n03478589
-n04265275
-n02094433
-n02009229
-n02708093
-n03447447
-n03216828
-n04371430
-n03991062
-n02607072
-n02481823
-n02102318
-n09256479
-n02123597
-n02927161
-n01737021
-n01675722
-n11939491
-n03937543
-n03729826
-n01820546
-n01847000
-n02112137
-n01675722
-n04613696
-n02974003
-n03384352
-n03627232
-n04429376
-n01756291
-n03496892
-n02398521
-n02168699
-n03000247
-n01739381
-n04371430
-n04335435
-n03532672
-n02441942
-n03400231
-n03793489
-n01795545
-n01740131
-n02110806
-n03063599
-n02095314
-n04579432
-n04591157
-n02321529
-n03661043
-n01440764
-n04228054
-n04462240
-n03877472
-n03720891
-n02514041
-n03272562
-n01601694
-n02091467
-n04041544
-n03796401
-n03594734
-n02089078
-n02493793
-n01440764
-n09399592
-n03775071
-n04296562
-n02099849
-n02804610
-n03384352
-n02088632
-n04026417
-n02794156
-n01968897
-n02133161
-n03777754
-n02494079
-n02107142
-n03710193
-n02640242
-n04209133
-n02443114
-n03259280
-n02172182
-n02089078
-n04049303
-n02093647
-n06785654
-n03733131
-n03476991
-n04259630
-n01768244
-n13037406
-n02168699
-n02013706
-n02089078
-n01817953
-n02280649
-n02877765
-n04273569
-n02097209
-n06785654
-n02104365
-n02107908
-n02484975
-n02906734
-n09468604
-n01632777
-n01494475
-n01983481
-n04372370
-n02364673
-n02730930
-n02100583
-n04127249
-n03355925
-n02108089
-n03197337
-n03857828
-n01496331
-n02110341
-n04074963
-n02087046
-n03000684
-n03485794
-n02500267
-n02105162
-n03425413
-n01944390
-n02112018
-n04005630
-n01582220
-n04275548
-n07754684
-n02011460
-n02132136
-n01748264
-n04228054
-n02980441
-n02113624
-n04597913
-n02123159
-n02027492
-n04590129
-n02114548
-n03208938
-n02099267
-n03538406
-n03218198
-n04254120
-n03337140
-n02089078
-n02701002
-n02086240
-n02088632
-n01943899
-n13052670
-n04606251
-n09229709
-n01687978
-n03929660
-n02093754
-n01729322
-n02107908
-n07715103
-n03773504
-n04592741
-n02107908
-n02264363
-n04154565
-n02098105
-n03485794
-n02791270
-n06874185
-n02488702
-n03014705
-n03657121
-n03854065
-n02107574
-n02669723
-n03950228
-n02317335
-n04133789
-n01685808
-n03933933
-n02097047
-n02011460
-n01819313
-n03982430
-n01784675
-n03670208
-n03220513
-n04118538
-n02782093
-n02783161
-n03496892
-n02107574
-n04040759
-n02013706
-n02777292
-n01775062
-n01748264
-n03018349
-n04111531
-n02089867
-n09246464
-n04548280
-n07734744
-n03291819
-n04552348
-n03871628
-n07753113
-n01729322
-n07715103
-n04596742
-n02128385
-n03976467
-n04548280
-n02497673
-n02134418
-n02105251
-n03970156
-n01749939
-n01795545
-n01855032
-n02395406
-n02098413
-n02111500
-n02895154
-n07565083
-n03742115
-n02108089
-n02321529
-n02971356
-n02437616
-n03208938
-n01667114
-n02226429
-n03877845
-n02910353
-n04070727
-n04152593
-n01883070
-n02870880
-n02504458
-n04243546
-n02096051
-n03899768
-n02321529
-n03877845
-n03450230
-n03290653
-n01664065
-n03908714
-n01537544
-n02088238
-n01882714
-n01773549
-n04418357
-n02727426
-n01872401
-n02106382
-n03991062
-n02017213
-n02018207
-n04370456
-n02219486
-n02669723
-n01694178
-n01784675
-n03443371
-n02114548
-n01806567
-n04090263
-n07932039
-n01608432
-n02281406
-n04238763
-n01664065
-n02028035
-n01917289
-n03793489
-n04209239
-n03042490
-n03400231
-n02356798
-n03065424
-n04335435
-n01664065
-n01692333
-n07880968
-n03297495
-n02841315
-n03095699
-n07697313
-n09399592
-n01917289
-n03724870
-n13133613
-n03787032
-n02493793
-n03843555
-n01629819
-n03843555
-n04461696
-n01669191
-n03976657
-n02097047
-n03773504
-n02951585
-n04398044
-n03599486
-n03250847
-n03796401
-n01737021
-n02776631
-n03599486
-n02110806
-n04254680
-n02138441
-n02483362
-n02747177
-n03733805
-n04118538
-n01829413
-n02112137
-n02102318
-n02097474
-n02119789
-n04136333
-n04579432
-n02493509
-n01667778
-n02442845
-n02097209
-n03404251
-n02488291
-n02091032
-n01882714
-n04081281
-n02963159
-n02088632
-n01491361
-n04380533
-n04423845
-n01629819
-n03956157
-n04548362
-n02804610
-n04310018
-n04251144
-n07860988
-n02692877
-n03938244
-n01484850
-n04325704
-n01560419
-n02916936
-n02442845
-n03998194
-n04330267
-n03425413
-n07932039
-n01984695
-n03345487
-n03259280
-n07768694
-n02444819
-n01675722
-n02328150
-n04070727
-n04423845
-n03729826
-n07684084
-n03485794
-n03498962
-n01753488
-n03958227
-n02895154
-n03100240
-n02110806
-n04118776
-n02105056
-n03874293
-n04037443
-n03496892
-n07745940
-n03871628
-n03372029
-n02100735
-n02132136
-n03623198
-n03666591
-n02823750
-n01735189
-n02106382
-n07697537
-n02454379
-n04311004
-n03110669
-n04009552
-n02074367
-n02442845
-n02099601
-n09246464
-n03814906
-n04049303
-n01749939
-n03803284
-n02667093
-n03908714
-n04409515
-n03290653
-n07730033
-n02268443
-n03028079
-n02514041
-n04592741
-n07720875
-n02988304
-n02606052
-n03877472
-n01798484
-n03742115
-n04461696
-n02917067
-n01629819
-n04486054
-n04548362
-n02860847
-n02107683
-n01944390
-n03786901
-n04044716
-n01824575
-n01440764
-n02279972
-n01914609
-n03272562
-n07590611
-n01728572
-n01687978
-n03791053
-n01518878
-n02950826
-n03982430
-n02966193
-n03841143
-n02672831
-n02787622
-n02165105
-n04525038
-n03662601
-n12057211
-n04522168
-n04613696
-n02088632
-n01985128
-n09472597
-n03271574
-n01687978
-n04147183
-n07875152
-n01580077
-n03393912
-n03903868
-n04074963
-n03788365
-n01843065
-n03690938
-n02105056
-n04525305
-n01631663
-n02097047
-n02486410
-n04152593
-n02879718
-n04443257
-n02102040
-n02093859
-n02127052
-n09332890
-n01770393
-n03527444
-n03697007
-n04515003
-n07873807
-n04429376
-n03991062
-n03085013
-n01828970
-n01608432
-n03930313
-n02105641
-n01756291
-n02500267
-n04039381
-n02168699
-n03259280
-n01855032
-n10565667
-n02115641
-n04515003
-n02669723
-n02988304
-n03825788
-n02025239
-n03706229
-n01914609
-n03344393
-n04049303
-n03259280
-n02091244
-n02514041
-n03065424
-n12057211
-n02027492
-n04118538
-n04141076
-n03899768
-n04462240
-n02096051
-n02978881
-n02114855
-n04509417
-n04505470
-n03201208
-n01986214
-n02417914
-n01677366
-n07747607
-n04409515
-n01685808
-n04599235
-n03187595
-n03657121
-n15075141
-n04372370
-n02966687
-n01820546
-n03344393
-n03476991
-n03763968
-n04070727
-n03041632
-n01877812
-n07248320
-n07875152
-n02892767
-n03355925
-n01685808
-n04228054
-n03843555
-n01755581
-n04347754
-n02277742
-n03000247
-n07742313
-n07875152
-n03075370
-n02799071
-n03133878
-n06596364
-n01806143
-n03930313
-n03930313
-n02730930
-n01773797
-n03902125
-n03721384
-n02951358
-n02119022
-n01744401
-n02112706
-n02396427
-n03633091
-n01514668
-n03791053
-n02395406
-n04370456
-n03657121
-n02096585
-n02107312
-n03970156
-n03126707
-n02105251
-n02442845
-n04461696
-n07715103
-n03873416
-n01677366
-n02012849
-n03527444
-n01798484
-n04562935
-n02279972
-n02423022
-n03992509
-n01592084
-n03788195
-n02259212
-n04462240
-n03929660
-n02090622
-n04254120
-n01592084
-n02109961
-n03769881
-n02268443
-n02909870
-n01641577
-n04550184
-n04507155
-n01630670
-n04152593
-n02090379
-n01983481
-n09421951
-n04517823
-n01744401
-n07745940
-n01843383
-n03476684
-n01735189
-n03930313
-n03916031
-n02093991
-n03207743
-n02787622
-n02106166
-n04398044
-n04428191
-n04209133
-n02085620
-n09835506
-n01871265
-n03459775
-n02089973
-n02643566
-n02481823
-n02123159
-n07875152
-n04557648
-n03196217
-n04033995
-n02037110
-n01955084
-n03089624
-n01751748
-n02099429
-n03325584
-n03445777
-n03902125
-n02116738
-n02799071
-n02843684
-n03109150
-n02869837
-n06794110
-n03908618
-n02105251
-n02790996
-n02966687
-n09256479
-n02939185
-n04417672
-n02113624
-n04266014
-n02174001
-n02483362
-n03127925
-n03717622
-n01744401
-n01739381
-n02606052
-n03290653
-n04330267
-n02486410
-n02457408
-n04355338
-n01498041
-n02134418
-n01440764
-n04552348
-n02319095
-n03781244
-n07730033
-n04525038
-n02018795
-n03494278
-n04589890
-n01829413
-n04456115
-n04118776
-n02687172
-n02992529
-n07932039
-n03075370
-n04557648
-n01728920
-n01688243
-n02443484
-n03843555
-n03786901
-n03016953
-n02536864
-n04125021
-n01514668
-n04461696
-n01983481
-n02493509
-n07614500
-n01776313
-n02091467
-n02106030
-n02814860
-n02002556
-n01818515
-n03160309
-n02092339
-n02013706
-n01753488
-n01739381
-n02981792
-n01753488
-n02704792
-n09332890
-n02317335
-n03255030
-n04201297
-n02093256
-n01688243
-n03792782
-n03028079
-n01944390
-n02107908
-n03803284
-n03775546
-n02128757
-n04542943
-n04560804
-n02514041
-n04204347
-n02916936
-n03344393
-n02364673
-n03942813
-n01614925
-n02494079
-n04542943
-n07742313
-n02490219
-n03843555
-n02281406
-n02493793
-n02123597
-n04613696
-n01796340
-n07753592
-n03384352
-n03916031
-n03908714
-n03992509
-n04201297
-n03637318
-n02977058
-n02091032
-n02494079
-n03673027
-n04548362
-n01950731
-n03721384
-n02999410
-n02483362
-n02111277
-n03709823
-n02087046
-n03929660
-n07930864
-n03954731
-n03063599
-n03692522
-n02018207
-n03788195
-n04040759
-n02011460
-n07871810
-n03690938
-n04486054
-n01986214
-n04591713
-n04127249
-n01807496
-n02095570
-n01981276
-n02128925
-n02992529
-n02815834
-n01698640
-n01632458
-n02492660
-n02319095
-n03938244
-n03876231
-n01798484
-n03666591
-n02110806
-n03782006
-n01943899
-n02643566
-n04120489
-n04399382
-n02085782
-n04389033
-n07714571
-n01614925
-n03494278
-n04141076
-n03388043
-n04118776
-n03291819
-n02389026
-n04209133
-n01685808
-n03769881
-n04074963
-n04458633
-n04532670
-n02484975
-n07579787
-n02058221
-n03000134
-n01704323
-n04044716
-n03000684
-n03179701
-n07716906
-n01518878
-n02497673
-n03445924
-n02093647
-n02410509
-n03026506
-n04153751
-n04141076
-n03532672
-n04201297
-n07836838
-n03188531
-n02486410
-n04275548
-n02133161
-n03394916
-n02098105
-n04376876
-n02106382
-n03483316
-n02490219
-n03032252
-n03770439
-n02025239
-n03840681
-n03496892
-n03633091
-n02837789
-n03126707
-n02104365
-n04584207
-n04347754
-n04243546
-n02110185
-n02865351
-n02167151
-n02871525
-n02088466
-n02138441
-n02804610
-n03935335
-n02782093
-n01744401
-n09472597
-n03445924
-n01737021
-n02102480
-n02086646
-n02137549
-n02481823
-n02107574
-n02096437
-n02701002
-n03272562
-n02978881
-n01737021
-n01824575
-n03887697
-n02097298
-n03692522
-n02437312
-n03814639
-n02236044
-n02094433
-n07742313
-n04398044
-n03255030
-n04258138
-n02422106
-n06785654
-n02319095
-n03692522
-n04350905
-n04252077
-n03804744
-n03131574
-n02107312
-n07583066
-n02006656
-n01608432
-n04428191
-n04346328
-n02493793
-n04040759
-n03733281
-n02093754
-n01677366
-n02481823
-n11939491
-n13044778
-n04070727
-n02500267
-n03347037
-n03942813
-n03218198
-n02747177
-n04286575
-n01530575
-n02437312
-n02090379
-n04447861
-n01843383
-n01629819
-n01871265
-n02077923
-n02105162
-n03873416
-n02106662
-n02096437
-n02132136
-n03000684
-n01917289
-n02777292
-n02077923
-n02110063
-n02027492
-n02124075
-n04467665
-n04192698
-n04525305
-n12057211
-n02894605
-n02108551
-n04392985
-n01742172
-n02825657
-n04336792
-n04265275
-n02172182
-n02483362
-n02168699
-n02088094
-n02128925
-n03764736
-n02113712
-n03197337
-n03393912
-n03804744
-n07697313
-n03770679
-n02795169
-n02104365
-n10148035
-n01534433
-n03089624
-n10565667
-n04536866
-n02259212
-n01828970
-n01667114
-n02110958
-n03841143
-n03325584
-n03450230
-n04423845
-n04149813
-n02802426
-n03876231
-n03868242
-n07614500
-n04356056
-n02128925
-n03379051
-n02099712
-n02870880
-n02085936
-n13044778
-n03388043
-n02113712
-n02113624
-n03141823
-n02110627
-n03394916
-n04548362
-n02927161
-n01914609
-n04275548
-n03271574
-n03527444
-n01530575
-n03775546
-n02965783
-n02105505
-n03982430
-n04258138
-n03201208
-n07684084
-n02437616
-n03388043
-n04389033
-n02841315
-n03250847
-n02480495
-n01749939
-n12998815
-n02114712
-n02056570
-n03602883
-n02281406
-n02086079
-n03769881
-n03791053
-n02165456
-n02747177
-n13040303
-n04023962
-n02948072
-n04243546
-n02690373
-n04442312
-n03837869
-n04417672
-n13054560
-n02106166
-n01776313
-n02667093
-n07565083
-n13133613
-n07730033
-n02488291
-n04423845
-n03623198
-n03977966
-n03866082
-n02100735
-n02834397
-n04461696
-n02089078
-n01694178
-n01944390
-n03706229
-n03223299
-n03980874
-n03991062
-n04004767
-n04201297
-n03761084
-n03443371
-n02033041
-n02138441
-n01924916
-n04133789
-n06359193
-n02091032
-n02981792
-n03180011
-n04522168
-n04317175
-n02106662
-n01847000
-n12768682
-n03496892
-n02892767
-n07684084
-n01877812
-n03345487
-n03495258
-n03661043
-n01990800
-n03417042
-n04330267
-n01443537
-n02397096
-n01582220
-n01910747
-n02025239
-n03724870
-n02787622
-n02892201
-n02086079
-n04417672
-n04550184
-n04525305
-n03877845
-n07718472
-n04266014
-n02396427
-n01773797
-n02009912
-n01795545
-n02120079
-n02105505
-n04252077
-n07734744
-n02793495
-n04372370
-n02667093
-n01629819
-n02493793
-n02640242
-n01748264
-n02134418
-n04335435
-n02966687
-n01608432
-n03325584
-n02013706
-n02364673
-n02791124
-n02979186
-n04493381
-n03045698
-n03032252
-n02092339
-n01806143
-n03535780
-n02319095
-n04562935
-n01873310
-n02279972
-n02124075
-n03482405
-n02056570
-n02823750
-n02823428
-n01443537
-n02860847
-n02690373
-n03825788
-n04461696
-n02106030
-n01983481
-n01632777
-n04562935
-n01847000
-n03661043
-n03272010
-n02113978
-n04550184
-n02699494
-n04505470
-n01629819
-n03944341
-n03792782
-n02071294
-n02114367
-n04536866
-n02910353
-n03355925
-n03908618
-n02786058
-n02097047
-n02088094
-n02089867
-n04356056
-n02095570
-n01756291
-n02441942
-n04208210
-n07693725
-n02088094
-n06596364
-n02992529
-n04081281
-n03467068
-n01847000
-n01693334
-n03680355
-n04501370
-n03763968
-n01917289
-n02669723
-n01924916
-n02110958
-n04041544
-n02110806
-n02134084
-n02130308
-n02443484
-n02843684
-n01968897
-n01855672
-n02113799
-n03584829
-n12768682
-n01531178
-n03197337
-n01784675
-n03075370
-n04252077
-n03935335
-n02999410
-n07716358
-n04238763
-n07753275
-n02279972
-n02666196
-n02007558
-n02105251
-n02226429
-n01751748
-n02127052
-n04579145
-n02051845
-n02445715
-n02102177
-n03759954
-n03179701
-n02007558
-n03649909
-n03992509
-n03447721
-n02916936
-n03196217
-n01883070
-n01983481
-n03000684
-n01756291
-n02111277
-n03857828
-n04479046
-n02177972
-n04067472
-n03444034
-n03854065
-n03720891
-n04208210
-n01740131
-n04423845
-n01855672
-n03388549
-n02206856
-n04606251
-n03887697
-n02865351
-n04579145
-n01496331
-n02804414
-n02787622
-n04004767
-n02097047
-n02490219
-n03529860
-n03680355
-n03942813
-n01632458
-n03733281
-n03584829
-n02797295
-n02966687
-n01824575
-n07831146
-n04366367
-n03666591
-n03788195
-n02966193
-n03042490
-n06874185
-n03345487
-n02123597
-n02895154
-n01664065
-n01819313
-n12985857
-n01855672
-n02095314
-n02102973
-n02966193
-n02115913
-n03590841
-n02093991
-n02169497
-n02814860
-n02089078
-n02138441
-n02113712
-n02883205
-n01601694
-n01774384
-n04111531
-n03000134
-n02088364
-n02489166
-n01914609
-n04009552
-n03680355
-n03843555
-n03950228
-n03680355
-n04597913
-n04347754
-n04116512
-n02747177
-n01514668
-n02840245
-n03483316
-n07715103
-n04153751
-n02500267
-n03998194
-n15075141
-n03930313
-n02112706
-n03888257
-n02110063
-n02108000
-n02102973
-n02483708
-n02097474
-n02011460
-n02492035
-n02814860
-n02009229
-n03877845
-n06596364
-n07248320
-n04344873
-n04536866
-n02823750
-n03291819
-n01770081
-n02892767
-n03481172
-n02066245
-n04370456
-n02264363
-n03670208
-n02397096
-n03075370
-n02087394
-n02536864
-n04599235
-n03982430
-n04523525
-n04522168
-n13052670
-n03633091
-n04067472
-n02988304
-n04486054
-n01677366
-n02492660
-n03127747
-n02112350
-n04336792
-n03417042
-n13133613
-n01608432
-n02865351
-n02129165
-n01773157
-n04258138
-n04041544
-n04252077
-n03197337
-n03794056
-n03877845
-n04346328
-n02086910
-n01694178
-n03445924
-n04532670
-n03781244
-n04141975
-n03124170
-n03874293
-n03498962
-n01739381
-n02791270
-n07892512
-n03444034
-n02105162
-n01734418
-n04070727
-n02916936
-n03840681
-n04399382
-n07749582
-n02480495
-n04515003
-n01688243
-n02107142
-n01914609
-n01742172
-n07753113
-n01828970
-n01797886
-n04606251
-n03062245
-n03400231
-n03483316
-n02978881
-n02109047
-n02795169
-n01728920
-n03530642
-n04209133
-n02105641
-n02111277
-n01737021
-n02092339
-n04589890
-n02454379
-n12267677
-n03627232
-n01990800
-n02109047
-n03314780
-n01798484
-n03691459
-n02669723
-n03781244
-n03467068
-n01770081
-n01796340
-n03930313
-n02226429
-n02514041
-n02356798
-n07880968
-n04131690
-n02807133
-n03841143
-n02346627
-n02397096
-n02963159
-n02641379
-n02093428
-n01537544
-n02814860
-n04074963
-n02109525
-n02085782
-n02102973
-n02319095
-n02437616
-n02395406
-n02488291
-n03777568
-n03710193
-n09421951
-n03838899
-n04004767
-n02011460
-n02526121
-n02112018
-n02687172
-n02825657
-n01882714
-n01968897
-n03196217
-n02101556
-n04389033
-n04127249
-n04254680
-n03063689
-n04125021
-n01689811
-n04325704
-n02137549
-n10565667
-n02391049
-n07836838
-n04584207
-n02423022
-n02088364
-n03961711
-n02457408
-n03535780
-n02412080
-n03017168
-n02979186
-n02676566
-n01860187
-n02423022
-n03891332
-n01494475
-n01704323
-n04423845
-n03976467
-n02091831
-n02101006
-n01491361
-n03063689
-n01910747
-n01784675
-n03967562
-n02094114
-n04065272
-n01534433
-n04372370
-n02879718
-n02871525
-n02168699
-n01784675
-n03492542
-n02101388
-n07718472
-n02110185
-n12998815
-n03127925
-n03207743
-n12057211
-n07565083
-n04525038
-n04118776
-n01616318
-n02965783
-n02206856
-n03899768
-n01687978
-n03379051
-n02104029
-n04229816
-n03124170
-n02281406
-n03032252
-n02101556
-n02980441
-n03485794
-n04366367
-n02492035
-n03599486
-n04548362
-n03764736
-n07760859
-n01978287
-n04505470
-n02488291
-n02782093
-n03417042
-n02486261
-n03843555
-n02319095
-n02493509
-n01798484
-n03857828
-n03950228
-n02791124
-n03207941
-n01751748
-n03916031
-n04074963
-n03724870
-n13133613
-n03937543
-n03255030
-n04372370
-n02168699
-n03920288
-n02514041
-n02112350
-n01443537
-n01807496
-n04070727
-n01675722
-n01518878
-n03599486
-n04162706
-n04147183
-n01795545
-n01698640
-n01873310
-n07718472
-n04033995
-n04418357
-n04429376
-n02110806
-n01944390
-n09835506
-n02092339
-n02948072
-n01978455
-n02100236
-n03710193
-n04517823
-n04154565
-n03761084
-n02346627
-n02672831
-n02422106
-n01664065
-n04125021
-n03450230
-n03980874
-n03642806
-n03866082
-n01494475
-n01910747
-n02229544
-n01770393
-n02114367
-n07920052
-n01872401
-n02109047
-n03884397
-n02704792
-n07716906
-n03843555
-n03095699
-n04532106
-n02093754
-n02879718
-n04515003
-n07718747
-n02094258
-n03838899
-n03126707
-n07730033
-n03085013
-n03680355
-n02123045
-n02279972
-n02086240
-n02134418
-n03388549
-n03637318
-n03345487
-n04517823
-n03476991
-n07734744
-n03602883
-n04371774
-n04229816
-n03249569
-n02676566
-n02011460
-n02916936
-n01806567
-n02814533
-n01560419
-n03970156
-n01978455
-n02823750
-n02883205
-n02110627
-n03787032
-n10148035
-n04596742
-n04033995
-n02444819
-n03954731
-n04311174
-n02095889
-n01914609
-n03710193
-n02782093
-n01820546
-n02091134
-n04355933
-n02389026
-n04090263
-n04254120
-n01820546
-n01641577
-n02106550
-n02326432
-n03532672
-n03065424
-n07836838
-n02786058
-n04235860
-n04264628
-n02091244
-n03773504
-n02013706
-n04458633
-n04270147
-n07711569
-n04325704
-n03017168
-n02112350
-n04192698
-n02769748
-n02096051
-n04149813
-n02483708
-n04040759
-n04265275
-n02071294
-n07873807
-n02488702
-n04200800
-n02134084
-n04418357
-n04552348
-n02999410
-n02817516
-n01981276
-n02233338
-n02504458
-n02116738
-n03633091
-n03372029
-n07714990
-n04552348
-n02504458
-n02172182
-n03691459
-n02089078
-n03594734
-n02643566
-n01665541
-n01818515
-n02802426
-n03662601
-n03495258
-n01773797
-n02206856
-n03710721
-n04442312
-n02137549
-n03657121
-n04311004
-n03775071
-n03630383
-n02412080
-n01443537
-n03874293
-n03874599
-n07590611
-n04162706
-n02108551
-n07749582
-n02804414
-n03777754
-n03584829
-n02699494
-n02097298
-n03661043
-n01774750
-n03594945
-n04005630
-n07697313
-n02009229
-n03529860
-n04355933
-n03899768
-n03337140
-n02110958
-n02092339
-n02097130
-n03337140
-n01818515
-n03345487
-n01496331
-n03124043
-n02095570
-n01558993
-n03814906
-n03216828
-n03930630
-n06874185
-n02113799
-n07720875
-n03887697
-n03697007
-n02231487
-n02669723
-n02480855
-n04366367
-n03706229
-n03529860
-n03924679
-n03527444
-n01770393
-n04493381
-n04532670
-n02883205
-n04192698
-n02129604
-n02669723
-n04259630
-n02091831
-n09332890
-n01883070
-n04026417
-n03485407
-n01877812
-n01644900
-n09256479
-n04286575
-n01601694
-n04428191
-n03065424
-n03770439
-n02174001
-n02110341
-n02916936
-n04086273
-n03393912
-n02701002
-n03991062
-n01608432
-n04273569
-n04522168
-n07760859
-n02493793
-n02804414
-n02229544
-n04009552
-n03874599
-n03649909
-n07614500
-n02094433
-n02097298
-n03662601
-n03450230
-n02093256
-n04033995
-n02113023
-n09246464
-n01704323
-n02488702
-n02096294
-n04536866
-n07873807
-n03770439
-n04409515
-n04532106
-n04542943
-n07584110
-n02808304
-n03903868
-n03888605
-n02051845
-n02115641
-n02099267
-n03452741
-n03498962
-n01945685
-n01692333
-n03930630
-n02794156
-n04311004
-n03482405
-n04540053
-n09256479
-n02607072
-n02281406
-n03991062
-n02056570
-n04243546
-n03100240
-n01532829
-n03127747
-n02119022
-n02666196
-n03379051
-n04417672
-n07920052
-n03617480
-n01818515
-n03998194
-n03388183
-n02113799
-n04344873
-n03590841
-n04228054
-n04228054
-n02231487
-n03888257
-n04086273
-n02090622
-n03933933
-n02422106
-n03720891
-n02093991
-n04347754
-n01630670
-n03843555
-n03729826
-n01644900
-n02264363
-n03126707
-n12057211
-n04461696
-n02098286
-n02276258
-n04552348
-n01514668
-n04243546
-n02871525
-n02106382
-n02100583
-n02085936
-n04487081
-n03995372
-n01601694
-n02279972
-n03444034
-n07730033
-n02011460
-n02099601
-n04536866
-n03014705
-n02486261
-n04590129
-n04265275
-n03447447
-n02102177
-n03388043
-n01665541
-n03924679
-n06874185
-n03018349
-n02403003
-n03196217
-n02132136
-n01514859
-n02397096
-n02113186
-n03924679
-n02096437
-n07831146
-n04584207
-n03777568
-n02276258
-n02108915
-n04540053
-n03874293
-n02033041
-n04270147
-n02114367
-n07730033
-n02342885
-n03929660
-n03032252
-n02992211
-n03658185
-n02777292
-n02879718
-n02319095
-n07760859
-n03888257
-n02910353
-n03868863
-n04133789
-n04136333
-n04356056
-n02028035
-n03000134
-n03355925
-n04326547
-n02494079
-n04099969
-n02966193
-n04147183
-n02966193
-n07697313
-n03877472
-n02486261
-n02510455
-n07720875
-n03764736
-n04239074
-n02443484
-n07720875
-n02840245
-n03782006
-n02119789
-n04328186
-n02417914
-n03216828
-n02108551
-n02013706
-n01734418
-n03729826
-n01689811
-n04522168
-n02422106
-n04004767
-n12620546
-n04041544
-n04116512
-n03478589
-n02174001
-n04486054
-n02107142
-n02422699
-n03400231
-n07930864
-n04200800
-n01582220
-n07753592
-n02690373
-n07880968
-n03958227
-n01665541
-n01847000
-n12768682
-n03478589
-n02091467
-n02787622
-n02776631
-n03000247
-n04074963
-n03743016
-n03325584
-n09246464
-n03871628
-n01740131
-n09288635
-n02730930
-n03884397
-n03775546
-n02114712
-n07718472
-n01728920
-n02494079
-n01774750
-n03967562
-n07718747
-n02906734
-n03444034
-n02408429
-n02319095
-n04330267
-n02113624
-n02231487
-n04141076
-n04552348
-n03759954
-n04120489
-n02869837
-n03838899
-n02268443
-n02321529
-n04023962
-n03843555
-n04525038
-n02361337
-n03924679
-n02236044
-n01530575
-n02877765
-n01980166
-n03777568
-n04008634
-n04579145
-n07873807
-n03207743
-n03970156
-n04254680
-n03345487
-n02454379
-n03110669
-n01980166
-n02536864
-n04285008
-n07684084
-n01924916
-n02108915
-n04074963
-n03837869
-n01882714
-n03873416
-n02169497
-n02687172
-n02268853
-n02906734
-n03018349
-n04310018
-n02978881
-n01693334
-n04542943
-n03770679
-n02123045
-n02974003
-n02086646
-n01530575
-n03786901
-n03710193
-n03388183
-n02112350
-n02113186
-n01883070
-n04552348
-n04344873
-n01773157
-n02109961
-n02123159
-n04404412
-n01917289
-n02169497
-n03899768
-n03697007
-n03874599
-n02669723
-n07717556
-n04147183
-n03424325
-n03498962
-n07715103
-n01632777
-n02264363
-n03018349
-n01669191
-n04204238
-n01829413
-n03785016
-n01871265
-n02992529
-n04127249
-n01774384
-n13040303
-n02090721
-n07615774
-n02231487
-n03126707
-n04399382
-n02127052
-n02480495
-n04357314
-n04597913
-n04311174
-n04376876
-n03344393
-n04146614
-n01622779
-n04325704
-n03527444
-n07753275
-n02422699
-n03759954
-n01824575
-n01704323
-n04067472
-n01872401
-n02114712
-n02979186
-n07615774
-n02094433
-n02106550
-n01930112
-n02086079
-n07754684
-n02088238
-n03764736
-n02077923
-n01770081
-n03763968
-n03544143
-n03777568
-n03706229
-n07871810
-n02100583
-n02096585
-n03538406
-n02794156
-n04325704
-n04127249
-n02277742
-n03314780
-n13037406
-n02607072
-n07720875
-n02277742
-n02412080
-n13054560
-n02865351
-n03467068
-n03891251
-n02089973
-n02002724
-n02017213
-n02917067
-n01665541
-n07714990
-n03372029
-n03584254
-n03662601
-n03337140
-n02692877
-n02110627
-n04201297
-n04154565
-n03637318
-n03255030
-n07745940
-n02056570
-n03895866
-n02169497
-n01818515
-n04493381
-n03041632
-n02110627
-n04553703
-n02099429
-n09428293
-n03495258
-n02483708
-n04336792
-n02825657
-n03891251
-n01860187
-n09472597
-n01753488
-n04540053
-n02895154
-n02321529
-n03259280
-n01630670
-n03000134
-n03866082
-n01514859
-n07873807
-n02105056
-n01978455
-n02009912
-n03794056
-n03720891
-n03995372
-n02869837
-n02169497
-n03425413
-n04355338
-n02977058
-n02916936
-n03840681
-n04560804
-n03042490
-n07734744
-n03706229
-n01774384
-n03530642
-n02346627
-n02105251
-n02229544
-n04522168
-n03535780
-n02105505
-n02168699
-n02138441
-n04131690
-n02172182
-n02111129
-n02776631
-n03785016
-n03895866
-n02457408
-n03146219
-n02134084
-n02097130
-n02361337
-n07720875
-n01871265
-n02231487
-n07717556
-n04328186
-n04317175
-n03065424
-n02442845
-n03729826
-n02892201
-n02489166
-n03721384
-n02096437
-n02093647
-n03376595
-n01692333
-n02134084
-n01978287
-n01592084
-n02504458
-n03544143
-n04039381
-n02690373
-n01756291
-n03814639
-n03443371
-n03633091
-n02066245
-n03868242
-n02133161
-n01496331
-n02108915
-n03325584
-n03372029
-n02085782
-n04026417
-n02111500
-n03482405
-n04149813
-n02108551
-n03337140
-n03970156
-n02443484
-n03657121
-n03633091
-n01675722
-n02965783
-n03908714
-n03777754
-n03394916
-n06794110
-n02492660
-n02099429
-n01828970
-n04404412
-n01532829
-n02109047
-n07768694
-n02104365
-n01632777
-n02794156
-n02807133
-n07615774
-n01532829
-n13040303
-n04149813
-n01828970
-n03345487
-n02096585
-n03291819
-n07754684
-n02123597
-n04266014
-n02114855
-n02018207
-n04532106
-n04579432
-n09246464
-n02088364
-n07615774
-n04487394
-n04612504
-n07613480
-n02058221
-n03980874
-n02134418
-n01622779
-n04209239
-n02692877
-n01560419
-n02870880
-n03445924
-n02117135
-n04356056
-n02097047
-n02281406
-n04243546
-n02129604
-n02395406
-n02089973
-n09332890
-n07747607
-n09246464
-n04417672
-n02859443
-n02105251
-n02012849
-n03724870
-n04562935
-n02790996
-n02825657
-n02510455
-n03884397
-n04069434
-n01843383
-n01440764
-n02909870
-n04344873
-n13054560
-n03976657
-n04270147
-n02804610
-n03792972
-n01704323
-n01689811
-n03908714
-n03062245
-n03376595
-n02442845
-n04589890
-n02114855
-n04465501
-n01664065
-n07711569
-n02457408
-n02165105
-n02389026
-n03207743
-n04081281
-n04458633
-n01843065
-n04335435
-n03444034
-n04311174
-n02128385
-n01819313
-n02098413
-n02110341
-n06874185
-n02098413
-n02007558
-n02077923
-n04461696
-n01514859
-n03388549
-n03447721
-n03207743
-n02443114
-n01664065
-n03825788
-n02799071
-n01753488
-n03642806
-n01847000
-n09421951
-n02086910
-n02441942
-n03141823
-n01664065
-n03642806
-n02364673
-n03884397
-n02033041
-n04019541
-n04266014
-n07749582
-n01818515
-n02415577
-n02804414
-n04599235
-n01910747
-n02965783
-n04111531
-n03794056
-n02088364
-n03733805
-n02497673
-n04296562
-n01983481
-n04041544
-n07892512
-n02085936
-n03929855
-n02396427
-n03854065
-n02802426
-n01751748
-n01632458
-n03207941
-n02110627
-n04554684
-n03729826
-n02480495
-n01914609
-n04200800
-n02480495
-n01630670
-n03825788
-n04458633
-n07754684
-n01756291
-n02807133
-n02099712
-n03223299
-n03394916
-n02100735
-n04548362
-n01774750
-n03085013
-n02974003
-n04004767
-n02111129
-n02113799
-n02963159
-n04275548
-n06874185
-n02105855
-n03710193
-n02916936
-n03125729
-n04209239
-n04033995
-n07930864
-n03443371
-n04604644
-n03788195
-n04238763
-n02174001
-n03637318
-n07615774
-n04200800
-n02107142
-n03709823
-n03786901
-n02086079
-n03201208
-n03000684
-n04099969
-n02102480
-n01950731
-n07753113
-n02013706
-n04536866
-n02423022
-n02687172
-n04208210
-n04596742
-n02051845
-n01833805
-n02058221
-n03344393
-n03857828
-n01978287
-n04118538
-n03976657
-n03717622
-n02097130
-n09399592
-n01768244
-n02317335
-n04204238
-n01580077
-n02097298
-n03673027
-n02013706
-n02105251
-n07697313
-n03980874
-n02804610
-n02125311
-n03781244
-n02095570
-n03344393
-n02408429
-n02110627
-n02807133
-n02129604
-n04332243
-n04398044
-n13044778
-n02098413
-n02129604
-n03763968
-n03028079
-n02108000
-n03825788
-n02116738
-n04344873
-n03924679
-n02486261
-n02667093
-n03584254
-n04554684
-n07932039
-n01872401
-n02128757
-n02966687
-n02101556
-n03207941
-n04476259
-n07684084
-n02109525
-n02268443
-n03793489
-n02106662
-n04335435
-n03146219
-n01774384
-n03980874
-n01930112
-n03485794
-n03710193
-n04525305
-n03916031
-n07565083
-n02264363
-n03676483
-n04235860
-n02808304
-n03796401
-n12620546
-n02098286
-n02091831
-n02319095
-n02264363
-n04317175
-n04120489
-n02788148
-n02110341
-n04252077
-n07715103
-n04540053
-n03016953
-n02091244
-n02640242
-n04612504
-n03000134
-n02112706
-n01532829
-n02115913
-n02101556
-n02119789
-n04252225
-n03492542
-n03272010
-n03770679
-n01629819
-n04517823
-n04366367
-n02410509
-n03623198
-n03777754
-n03899768
-n04367480
-n04525305
-n03208938
-n02951358
-n03110669
-n04483307
-n04517823
-n02422699
-n04509417
-n03590841
-n09332890
-n01629819
-n04557648
-n09421951
-n13052670
-n01677366
-n02058221
-n02102318
-n03126707
-n04548280
-n03187595
-n02966687
-n03938244
-n02486261
-n02096177
-n02165105
-n02979186
-n04310018
-n01669191
-n04356056
-n01644373
-n03676483
-n04311174
-n03617480
-n02107908
-n04310018
-n02100236
-n03623198
-n03841143
-n02488702
-n04507155
-n02097130
-n02769748
-n03781244
-n02441942
-n03240683
-n02115641
-n02117135
-n02137549
-n02113023
-n02129165
-n04532106
-n04118538
-n01774750
-n02917067
-n03394916
-n04458633
-n01704323
-n04399382
-n02410509
-n02111277
-n02102177
-n03000247
-n02107683
-n04037443
-n03445777
-n04296562
-n02971356
-n04418357
-n02730930
-n03841143
-n01774384
-n03271574
-n02443114
-n12144580
-n02097298
-n02948072
-n04179913
-n02105251
-n03888605
-n03208938
-n04265275
-n09421951
-n02408429
-n02101388
-n02105056
-n07836838
-n04591713
-n02011460
-n04532106
-n01698640
-n04330267
-n04039381
-n04542943
-n02317335
-n02504013
-n01704323
-n01829413
-n04357314
-n04252077
-n01601694
-n02006656
-n03124043
-n02965783
-n02814533
-n03347037
-n03920288
-n03874599
-n02364673
-n03496892
-n01978455
-n03544143
-n04252077
-n03630383
-n03717622
-n03141823
-n04259630
-n03785016
-n02174001
-n02869837
-n04335435
-n02687172
-n01729977
-n02018795
-n01494475
-n03529860
-n02106166
-n04553703
-n04523525
-n02445715
-n03891332
-n02747177
-n03676483
-n02667093
-n07920052
-n02910353
-n02097209
-n03991062
-n04204238
-n02110341
-n02089867
-n01776313
-n02328150
-n03180011
-n07717410
-n03047690
-n04505470
-n03014705
-n01518878
-n01807496
-n04591713
-n02999410
-n04254777
-n02870880
-n02002556
-n02095889
-n02487347
-n03944341
-n03770679
-n03794056
-n03759954
-n02093991
-n01968897
-n03743016
-n03388183
-n03775546
-n02437312
-n04120489
-n03642806
-n02808440
-n04099969
-n03891332
-n03958227
-n02113799
-n03998194
-n02104029
-n03250847
-n02100877
-n07714990
-n03110669
-n02676566
-n03347037
-n03530642
-n10565667
-n02108000
-n03110669
-n03690938
-n02095314
-n02012849
-n02277742
-n01532829
-n04553703
-n02051845
-n04456115
-n03998194
-n02417914
-n03594734
-n01775062
-n02105855
-n03903868
-n02096294
-n04371774
-n02927161
-n03657121
-n03937543
-n04532106
-n01883070
-n01537544
-n02667093
-n02104029
-n02487347
-n02104365
-n02051845
-n04243546
-n02006656
-n02808304
-n04251144
-n02356798
-n02391049
-n07753275
-n02974003
-n03482405
-n09193705
-n01694178
-n02168699
-n12768682
-n03272562
-n03710193
-n03843555
-n03126707
-n03196217
-n06785654
-n04350905
-n07873807
-n04310018
-n02264363
-n02492660
-n10565667
-n04275548
-n04147183
-n04366367
-n02114855
-n02100236
-n04154565
-n02276258
-n03424325
-n03777568
-n03494278
-n01806143
-n03459775
-n03598930
-n03967562
-n03775546
-n04418357
-n02412080
-n04591157
-n01770081
-n03877472
-n01531178
-n03794056
-n04485082
-n03786901
-n01773797
-n04254680
-n02128925
-n02128757
-n02442845
-n02606052
-n02099429
-n04442312
-n01807496
-n02107312
-n03710637
-n02027492
-n03016953
-n02017213
-n12768682
-n04192698
-n02747177
-n04532106
-n01537544
-n04254777
-n03259280
-n02025239
-n09835506
-n02096437
-n04372370
-n02797295
-n03871628
-n02481823
-n03837869
-n02268443
-n04522168
-n03690938
-n04550184
-n03657121
-n02105251
-n01833805
-n01755581
-n07734744
-n01873310
-n03538406
-n01688243
-n03452741
-n02120505
-n02412080
-n04254120
-n04019541
-n02112706
-n02100735
-n03201208
-n03134739
-n02514041
-n04065272
-n02165105
-n04443257
-n04149813
-n03871628
-n02100236
-n02412080
-n02992211
-n02951358
-n03776460
-n02666196
-n03000134
-n12144580
-n03141823
-n02110341
-n02094114
-n02504458
-n04389033
-n02085936
-n04553703
-n03594734
-n09468604
-n03980874
-n07831146
-n03141823
-n13054560
-n01704323
-n02356798
-n03970156
-n02071294
-n06794110
-n02860847
-n03970156
-n11879895
-n04389033
-n01770393
-n02104365
-n02033041
-n07754684
-n02666196
-n03658185
-n03447447
-n03840681
-n01990800
-n03992509
-n02319095
-n04540053
-n04141975
-n03026506
-n02009229
-n07880968
-n03459775
-n02488291
-n02108551
-n03793489
-n03041632
-n03887697
-n12057211
-n07875152
-n01828970
-n01796340
-n03494278
-n02281787
-n01698640
-n01537544
-n02110185
-n04209133
-n02536864
-n07714990
-n02100236
-n04317175
-n04265275
-n01983481
-n01833805
-n02808440
-n01443537
-n07697313
-n02109525
-n03935335
-n03903868
-n04074963
-n01807496
-n03729826
-n04111531
-n07860988
-n04133789
-n03873416
-n03991062
-n03028079
-n03207743
-n02487347
-n03207941
-n03920288
-n02100735
-n02105855
-n03544143
-n02071294
-n03496892
-n03461385
-n01443537
-n04239074
-n03956157
-n04553703
-n04371430
-n12057211
-n04118776
-n02793495
-n02808304
-n03709823
-n02099267
-n03063599
-n03018349
-n02009912
-n03467068
-n03637318
-n12998815
-n04153751
-n03063599
-n02132136
-n02879718
-n02835271
-n03089624
-n01734418
-n02027492
-n04133789
-n01491361
-n03041632
-n02361337
-n03710637
-n02169497
-n02268443
-n03291819
-n02492660
-n04069434
-n03457902
-n04200800
-n04429376
-n01945685
-n02910353
-n02096177
-n04204347
-n03347037
-n01806567
-n02002724
-n01675722
-n04404412
-n03476684
-n03868242
-n01773157
-n02102040
-n02088094
-n02797295
-n07831146
-n03764736
-n03000684
-n02536864
-n01983481
-n02106550
-n04065272
-n01685808
-n02090622
-n04579432
-n04204238
-n13054560
-n03016953
-n03937543
-n04229816
-n02492660
-n03445924
-n11939491
-n03544143
-n02894605
-n07697537
-n04153751
-n02483362
-n02134084
-n04208210
-n03197337
-n01753488
-n03680355
-n03938244
-n03857828
-n03761084
-n02105162
-n03742115
-n02536864
-n02930766
-n01514668
-n03876231
-n02493509
-n02095314
-n04517823
-n01729977
-n04442312
-n11939491
-n01614925
-n03496892
-n02281787
-n02095570
-n02105505
-n04127249
-n04579432
-n03804744
-n04613696
-n01440764
-n04133789
-n02115641
-n02099849
-n04493381
-n02102480
-n11939491
-n07565083
-n03425413
-n01756291
-n02132136
-n02109525
-n03995372
-n12057211
-n07697537
-n04023962
-n03690938
-n03676483
-n03868863
-n04147183
-n02895154
-n01773549
-n01667114
-n12267677
-n04507155
-n03658185
-n01644373
-n06785654
-n02114548
-n04065272
-n04118538
-n01491361
-n03792782
-n03773504
-n07831146
-n02092002
-n02808304
-n04330267
-n02437312
-n03481172
-n03706229
-n02100583
-n04347754
-n02666196
-n04074963
-n03976467
-n02090721
-n02002556
-n01728572
-n02129165
-n02483362
-n01910747
-n03887697
-n02422106
-n04039381
-n02356798
-n04350905
-n02871525
-n02086079
-n04485082
-n04116512
-n02346627
-n02840245
-n03345487
-n04336792
-n03777568
-n02797295
-n02093428
-n04037443
-n03188531
-n03538406
-n02108089
-n02268853
-n02219486
-n02415577
-n02113978
-n04367480
-n02111277
-n07754684
-n03207941
-n02708093
-n02791124
-n04239074
-n01872401
-n03124043
-n02788148
-n03933933
-n01798484
-n03065424
-n03658185
-n09421951
-n03000247
-n02669723
-n04592741
-n02097130
-n02105641
-n01629819
-n02793495
-n03954731
-n04141327
-n02966687
-n02769748
-n02281787
-n01687978
-n04229816
-n04009552
-n04418357
-n04461696
-n02006656
-n03770439
-n02017213
-n07716358
-n02445715
-n02389026
-n02948072
-n06785654
-n02268443
-n03457902
-n04118776
-n12768682
-n02095314
-n01518878
-n04275548
-n02894605
-n01843383
-n02840245
-n07697313
-n07930864
-n02690373
-n02788148
-n04081281
-n03127925
-n03706229
-n03721384
-n01632458
-n04265275
-n01924916
-n02979186
-n01872401
-n04235860
-n04476259
-n07697537
-n02488702
-n03920288
-n03670208
-n04493381
-n02113712
-n01682714
-n03271574
-n03018349
-n01641577
-n02422699
-n02807133
-n02749479
-n02749479
-n02480495
-n02120505
-n02277742
-n03935335
-n03759954
-n02113186
-n02100236
-n03126707
-n04458633
-n02281406
-n01775062
-n04204347
-n02116738
-n03388043
-n04418357
-n02100583
-n03584829
-n01592084
-n04456115
-n01728920
-n02091635
-n03637318
-n02105056
-n02110627
-n02776631
-n03788365
-n03179701
-n02009912
-n02219486
-n04179913
-n07590611
-n03903868
-n04560804
-n01917289
-n04133789
-n02085620
-n03259280
-n02484975
-n01744401
-n07836838
-n07753592
-n03673027
-n01494475
-n01728572
-n02174001
-n07873807
-n02058221
-n04252225
-n03782006
-n04133789
-n15075141
-n02106662
-n02346627
-n03769881
-n03630383
-n03871628
-n01984695
-n01514668
-n01749939
-n03457902
-n04347754
-n04370456
-n02892201
-n01693334
-n03109150
-n02102973
-n02098413
-n01930112
-n02834397
-n02091032
-n02489166
-n12985857
-n02092339
-n03995372
-n02089078
-n03709823
-n02111500
-n02268443
-n02410509
-n01798484
-n03720891
-n03868863
-n02092002
-n03018349
-n04487394
-n03240683
-n03803284
-n07579787
-n02804414
-n03887697
-n04542943
-n02113023
-n02607072
-n01882714
-n02102040
-n07697537
-n02443114
-n01986214
-n02777292
-n02939185
-n02009229
-n03769881
-n04554684
-n02037110
-n02817516
-n02089078
-n03691459
-n03680355
-n04591713
-n03804744
-n03617480
-n01795545
-n02865351
-n02840245
-n02909870
-n02101006
-n04208210
-n04487081
-n02111889
-n04264628
-n01629819
-n02111129
-n12768682
-n03134739
-n03075370
-n13037406
-n02100735
-n04330267
-n04540053
-n01498041
-n03874599
-n03874599
-n04485082
-n03095699
-n04252225
-n02172182
-n01667114
-n04557648
-n02119022
-n02091467
-n04350905
-n01817953
-n01985128
-n04067472
-n02504013
-n04476259
-n09229709
-n02865351
-n02105251
-n03255030
-n02325366
-n04200800
-n03065424
-n04330267
-n02403003
-n02123159
-n02326432
-n02097130
-n02966687
-n04591157
-n03538406
-n02107908
-n02009912
-n01644900
-n02356798
-n04201297
-n04235860
-n02110185
-n03544143
-n02787622
-n04296562
-n02804414
-n02114367
-n02894605
-n02119022
-n02965783
-n03837869
-n01955084
-n02701002
-n02137549
-n03794056
-n03759954
-n03956157
-n03461385
-n02939185
-n07892512
-n07715103
-n01742172
-n04350905
-n01817953
-n02865351
-n02002556
-n01644900
-n02795169
-n03617480
-n03207743
-n02403003
-n03109150
-n03590841
-n02480855
-n02091032
-n07584110
-n02102318
-n02111277
-n02692877
-n04604644
-n03793489
-n01877812
-n02412080
-n01698640
-n02110806
-n04019541
-n04476259
-n04584207
-n02012849
-n03720891
-n04311174
-n03459775
-n03781244
-n09428293
-n02106550
-n02132136
-n03630383
-n02128925
-n03903868
-n03814639
-n01630670
-n02106550
-n01855672
-n01807496
-n02088364
-n03290653
-n02109525
-n03902125
-n07583066
-n04542943
-n03937543
-n07583066
-n04008634
-n04532670
-n02095314
-n04118538
-n07584110
-n02747177
-n03929855
-n01950731
-n07742313
-n03649909
-n02319095
-n01697457
-n02092339
-n09332890
-n04347754
-n02480495
-n03478589
-n07880968
-n03935335
-n03976657
-n02835271
-n04367480
-n02177972
-n04070727
-n04277352
-n04125021
-n03134739
-n02128757
-n02504013
-n04111531
-n04152593
-n04591713
-n03400231
-n01704323
-n12768682
-n02110806
-n04418357
-n02536864
-n04409515
-n04542943
-n03763968
-n03662601
-n02490219
-n02086240
-n04404412
-n07718747
-n02096051
-n04599235
-n01944390
-n01990800
-n04152593
-n02807133
-n02086910
-n03347037
-n01847000
-n02107683
-n02279972
-n04019541
-n01695060
-n02087046
-n03891251
-n04154565
-n04398044
-n02504013
-n02138441
-n04285008
-n03942813
-n04239074
-n02704792
-n03794056
-n04476259
-n04483307
-n03982430
-n02109047
-n11939491
-n04335435
-n02727426
-n03781244
-n01978455
-n03887697
-n02268853
-n02607072
-n02009229
-n04371774
-n07892512
-n04523525
-n01748264
-n03924679
-n04200800
-n04026417
-n04208210
-n04548362
-n04389033
-n04152593
-n02910353
-n07697313
-n03196217
-n04200800
-n02279972
-n01917289
-n02488291
-n02808304
-n03992509
-n02804414
-n01774750
-n04442312
-n03535780
-n02802426
-n04044716
-n02128385
-n07697313
-n04179913
-n03400231
-n03095699
-n03871628
-n02129165
-n01773797
-n03691459
-n02018795
-n04116512
-n03089624
-n02127052
-n02111129
-n02093256
-n03742115
-n04429376
-n02009229
-n02815834
-n07747607
-n03481172
-n03220513
-n03495258
-n02974003
-n01704323
-n04277352
-n07684084
-n02107574
-n02276258
-n12998815
-n03617480
-n03721384
-n02992529
-n02321529
-n03933933
-n03764736
-n03764736
-n02317335
-n04235860
-n02808440
-n02110341
-n04542943
-n02442845
-n02869837
-n01742172
-n02088632
-n02120079
-n04259630
-n03447447
-n03876231
-n02037110
-n01914609
-n02102040
-n13054560
-n03930630
-n03759954
-n07584110
-n04259630
-n03291819
-n07697537
-n01614925
-n03814906
-n04540053
-n02116738
-n01776313
-n03954731
-n04479046
-n03658185
-n04357314
-n03763968
-n01755581
-n01749939
-n02981792
-n03485407
-n02442845
-n04548280
-n07880968
-n02825657
-n09332890
-n04596742
-n04596742
-n02930766
-n01843383
-n03532672
-n13133613
-n02963159
-n03759954
-n02098413
-n04367480
-n02643566
-n04254777
-n02415577
-n04560804
-n04485082
-n03781244
-n04597913
-n04482393
-n01530575
-n03250847
-n02108089
-n04404412
-n02687172
-n03786901
-n02108000
-n02687172
-n02317335
-n02606052
-n02165105
-n03045698
-n03218198
-n02415577
-n04069434
-n04482393
-n01806143
-n01443537
-n02100735
-n04153751
-n04254777
-n02091467
-n03482405
-n02794156
-n07754684
-n03495258
-n04542943
-n01797886
-n03085013
-n03792972
-n01980166
-n02782093
-n03920288
-n03666591
-n01695060
-n02486410
-n02088364
-n02389026
-n07753592
-n07248320
-n03355925
-n01737021
-n04266014
-n02167151
-n03930630
-n02133161
-n02107142
-n03180011
-n04023962
-n01443537
-n02443114
-n02892201
-n03109150
-n01872401
-n07565083
-n02815834
-n02206856
-n03729826
-n10565667
-n02111129
-n02704792
-n02117135
-n03000247
-n02129604
-n04550184
-n03089624
-n03785016
-n01689811
-n02441942
-n01641577
-n02229544
-n01622779
-n02089973
-n02791270
-n02102177
-n02114855
-n13040303
-n03944341
-n01667114
-n04149813
-n03792972
-n02869837
-n02112706
-n13044778
-n01688243
-n02097658
-n02109961
-n03791053
-n04286575
-n01985128
-n03014705
-n04265275
-n04467665
-n01985128
-n04344873
-n04335435
-n02676566
-n01806143
-n04599235
-n02093859
-n04486054
-n01601694
-n02966193
-n02965783
-n02099712
-n02808440
-n03785016
-n04285008
-n04141076
-n07760859
-n03717622
-n01917289
-n03942813
-n04409515
-n01819313
-n03255030
-n02328150
-n07590611
-n01985128
-n03998194
-n12985857
-n03014705
-n02823428
-n03127747
-n02825657
-n03935335
-n02793495
-n04509417
-n02655020
-n07873807
-n02906734
-n03720891
-n04037443
-n04254120
-n07614500
-n01667114
-n02415577
-n03710637
-n02361337
-n04081281
-n04070727
-n03649909
-n07720875
-n02011460
-n01443537
-n04525305
-n02894605
-n02113712
-n09229709
-n04367480
-n04266014
-n02105056
-n09421951
-n02814860
-n02167151
-n01744401
-n02808304
-n02106030
-n02074367
-n02536864
-n04485082
-n03538406
-n02108915
-n02114548
-n01698640
-n04286575
-n02797295
-n02124075
-n02927161
-n02747177
-n02641379
-n02325366
-n02536864
-n03697007
-n02281406
-n03017168
-n02090721
-n03776460
-n02037110
-n03100240
-n04398044
-n02871525
-n03792782
-n02787622
-n03180011
-n04522168
-n04266014
-n03218198
-n02088094
-n02097298
-n04548362
-n03196217
-n02095889
-n01873310
-n02088466
-n01968897
-n04548280
-n04604644
-n02090379
-n03787032
-n04229816
-n03891251
-n02356798
-n04350905
-n03782006
-n01664065
-n03950228
-n01601694
-n01558993
-n02777292
-n02091134
-n02088632
-n02442845
-n02137549
-n01669191
-n02007558
-n03782006
-n03692522
-n02916936
-n04357314
-n02132136
-n03930630
-n04019541
-n04005630
-n02102480
-n03443371
-n04523525
-n03814906
-n07693725
-n04371774
-n04209239
-n03720891
-n02086079
-n02071294
-n01774384
-n01560419
-n04204238
-n02101556
-n03998194
-n04486054
-n04505470
-n02089867
-n04179913
-n02112018
-n04201297
-n03673027
-n03908714
-n02105056
-n02791270
-n03775071
-n03785016
-n02088238
-n04376876
-n03272562
-n02132136
-n01748264
-n02939185
-n03485794
-n02105412
-n02814860
-n03527444
-n03803284
-n02396427
-n03877845
-n07614500
-n01514859
-n02105056
-n03047690
-n04254120
-n03218198
-n02910353
-n04328186
-n03776460
-n02109961
-n03467068
-n02704792
-n04136333
-n02169497
-n02094114
-n03837869
-n03131574
-n02090622
-n04238763
-n01682714
-n03388043
-n04493381
-n04040759
-n02099601
-n03803284
-n02101388
-n13044778
-n04483307
-n03404251
-n02090622
-n12768682
-n04367480
-n03134739
-n02356798
-n02408429
-n02974003
-n02101388
-n03124170
-n04435653
-n02105855
-n07920052
-n03272010
-n03180011
-n07717556
-n04235860
-n07716358
-n02088094
-n07873807
-n03775071
-n02110341
-n02817516
-n03146219
-n02113186
-n09246464
-n02119022
-n03240683
-n03706229
-n02701002
-n04154565
-n03467068
-n03843555
-n02107683
-n02088094
-n02108915
-n02786058
-n02326432
-n01629819
-n01614925
-n12267677
-n02108422
-n02481823
-n02892201
-n02877765
-n01955084
-n12057211
-n03063689
-n02113978
-n02777292
-n03717622
-n02787622
-n02437312
-n03992509
-n01930112
-n02500267
-n03627232
-n04505470
-n03250847
-n03400231
-n02977058
-n04554684
-n04456115
-n04147183
-n03676483
-n04465501
-n02094114
-n04532106
-n07892512
-n04557648
-n03482405
-n02088238
-n03991062
-n01751748
-n02104029
-n03733281
-n02536864
-n01860187
-n03133878
-n02110627
-n03208938
-n04192698
-n02106166
-n03028079
-n04515003
-n03787032
-n04317175
-n03447721
-n02326432
-n03535780
-n03998194
-n04560804
-n04507155
-n03134739
-n01697457
-n04270147
-n02107683
-n04525305
-n02410509
-n02099712
-n02132136
-n02268853
-n01817953
-n03929855
-n07615774
-n02100735
-n01833805
-n03207743
-n04584207
-n04266014
-n07248320
-n03467068
-n03908618
-n02133161
-n02486410
-n01755581
-n02445715
-n01914609
-n02841315
-n02877765
-n01697457
-n01981276
-n06794110
-n04485082
-n02119022
-n02481823
-n02802426
-n01689811
-n01796340
-n02667093
-n01622779
-n01980166
-n02442845
-n04328186
-n01871265
-n03729826
-n02123394
-n01630670
-n02106166
-n10148035
-n02437616
diff --git a/research/inception/inception/data/imagenet_lsvrc_2015_synsets.txt b/research/inception/inception/data/imagenet_lsvrc_2015_synsets.txt
deleted file mode 100644
index 88aa58f966b3c4b7264a52bb71dcb25e856f6c1f..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/imagenet_lsvrc_2015_synsets.txt
+++ /dev/null
@@ -1,1000 +0,0 @@
-n01440764
-n01443537
-n01484850
-n01491361
-n01494475
-n01496331
-n01498041
-n01514668
-n01514859
-n01518878
-n01530575
-n01531178
-n01532829
-n01534433
-n01537544
-n01558993
-n01560419
-n01580077
-n01582220
-n01592084
-n01601694
-n01608432
-n01614925
-n01616318
-n01622779
-n01629819
-n01630670
-n01631663
-n01632458
-n01632777
-n01641577
-n01644373
-n01644900
-n01664065
-n01665541
-n01667114
-n01667778
-n01669191
-n01675722
-n01677366
-n01682714
-n01685808
-n01687978
-n01688243
-n01689811
-n01692333
-n01693334
-n01694178
-n01695060
-n01697457
-n01698640
-n01704323
-n01728572
-n01728920
-n01729322
-n01729977
-n01734418
-n01735189
-n01737021
-n01739381
-n01740131
-n01742172
-n01744401
-n01748264
-n01749939
-n01751748
-n01753488
-n01755581
-n01756291
-n01768244
-n01770081
-n01770393
-n01773157
-n01773549
-n01773797
-n01774384
-n01774750
-n01775062
-n01776313
-n01784675
-n01795545
-n01796340
-n01797886
-n01798484
-n01806143
-n01806567
-n01807496
-n01817953
-n01818515
-n01819313
-n01820546
-n01824575
-n01828970
-n01829413
-n01833805
-n01843065
-n01843383
-n01847000
-n01855032
-n01855672
-n01860187
-n01871265
-n01872401
-n01873310
-n01877812
-n01882714
-n01883070
-n01910747
-n01914609
-n01917289
-n01924916
-n01930112
-n01943899
-n01944390
-n01945685
-n01950731
-n01955084
-n01968897
-n01978287
-n01978455
-n01980166
-n01981276
-n01983481
-n01984695
-n01985128
-n01986214
-n01990800
-n02002556
-n02002724
-n02006656
-n02007558
-n02009229
-n02009912
-n02011460
-n02012849
-n02013706
-n02017213
-n02018207
-n02018795
-n02025239
-n02027492
-n02028035
-n02033041
-n02037110
-n02051845
-n02056570
-n02058221
-n02066245
-n02071294
-n02074367
-n02077923
-n02085620
-n02085782
-n02085936
-n02086079
-n02086240
-n02086646
-n02086910
-n02087046
-n02087394
-n02088094
-n02088238
-n02088364
-n02088466
-n02088632
-n02089078
-n02089867
-n02089973
-n02090379
-n02090622
-n02090721
-n02091032
-n02091134
-n02091244
-n02091467
-n02091635
-n02091831
-n02092002
-n02092339
-n02093256
-n02093428
-n02093647
-n02093754
-n02093859
-n02093991
-n02094114
-n02094258
-n02094433
-n02095314
-n02095570
-n02095889
-n02096051
-n02096177
-n02096294
-n02096437
-n02096585
-n02097047
-n02097130
-n02097209
-n02097298
-n02097474
-n02097658
-n02098105
-n02098286
-n02098413
-n02099267
-n02099429
-n02099601
-n02099712
-n02099849
-n02100236
-n02100583
-n02100735
-n02100877
-n02101006
-n02101388
-n02101556
-n02102040
-n02102177
-n02102318
-n02102480
-n02102973
-n02104029
-n02104365
-n02105056
-n02105162
-n02105251
-n02105412
-n02105505
-n02105641
-n02105855
-n02106030
-n02106166
-n02106382
-n02106550
-n02106662
-n02107142
-n02107312
-n02107574
-n02107683
-n02107908
-n02108000
-n02108089
-n02108422
-n02108551
-n02108915
-n02109047
-n02109525
-n02109961
-n02110063
-n02110185
-n02110341
-n02110627
-n02110806
-n02110958
-n02111129
-n02111277
-n02111500
-n02111889
-n02112018
-n02112137
-n02112350
-n02112706
-n02113023
-n02113186
-n02113624
-n02113712
-n02113799
-n02113978
-n02114367
-n02114548
-n02114712
-n02114855
-n02115641
-n02115913
-n02116738
-n02117135
-n02119022
-n02119789
-n02120079
-n02120505
-n02123045
-n02123159
-n02123394
-n02123597
-n02124075
-n02125311
-n02127052
-n02128385
-n02128757
-n02128925
-n02129165
-n02129604
-n02130308
-n02132136
-n02133161
-n02134084
-n02134418
-n02137549
-n02138441
-n02165105
-n02165456
-n02167151
-n02168699
-n02169497
-n02172182
-n02174001
-n02177972
-n02190166
-n02206856
-n02219486
-n02226429
-n02229544
-n02231487
-n02233338
-n02236044
-n02256656
-n02259212
-n02264363
-n02268443
-n02268853
-n02276258
-n02277742
-n02279972
-n02280649
-n02281406
-n02281787
-n02317335
-n02319095
-n02321529
-n02325366
-n02326432
-n02328150
-n02342885
-n02346627
-n02356798
-n02361337
-n02363005
-n02364673
-n02389026
-n02391049
-n02395406
-n02396427
-n02397096
-n02398521
-n02403003
-n02408429
-n02410509
-n02412080
-n02415577
-n02417914
-n02422106
-n02422699
-n02423022
-n02437312
-n02437616
-n02441942
-n02442845
-n02443114
-n02443484
-n02444819
-n02445715
-n02447366
-n02454379
-n02457408
-n02480495
-n02480855
-n02481823
-n02483362
-n02483708
-n02484975
-n02486261
-n02486410
-n02487347
-n02488291
-n02488702
-n02489166
-n02490219
-n02492035
-n02492660
-n02493509
-n02493793
-n02494079
-n02497673
-n02500267
-n02504013
-n02504458
-n02509815
-n02510455
-n02514041
-n02526121
-n02536864
-n02606052
-n02607072
-n02640242
-n02641379
-n02643566
-n02655020
-n02666196
-n02667093
-n02669723
-n02672831
-n02676566
-n02687172
-n02690373
-n02692877
-n02699494
-n02701002
-n02704792
-n02708093
-n02727426
-n02730930
-n02747177
-n02749479
-n02769748
-n02776631
-n02777292
-n02782093
-n02783161
-n02786058
-n02787622
-n02788148
-n02790996
-n02791124
-n02791270
-n02793495
-n02794156
-n02795169
-n02797295
-n02799071
-n02802426
-n02804414
-n02804610
-n02807133
-n02808304
-n02808440
-n02814533
-n02814860
-n02815834
-n02817516
-n02823428
-n02823750
-n02825657
-n02834397
-n02835271
-n02837789
-n02840245
-n02841315
-n02843684
-n02859443
-n02860847
-n02865351
-n02869837
-n02870880
-n02871525
-n02877765
-n02879718
-n02883205
-n02892201
-n02892767
-n02894605
-n02895154
-n02906734
-n02909870
-n02910353
-n02916936
-n02917067
-n02927161
-n02930766
-n02939185
-n02948072
-n02950826
-n02951358
-n02951585
-n02963159
-n02965783
-n02966193
-n02966687
-n02971356
-n02974003
-n02977058
-n02978881
-n02979186
-n02980441
-n02981792
-n02988304
-n02992211
-n02992529
-n02999410
-n03000134
-n03000247
-n03000684
-n03014705
-n03016953
-n03017168
-n03018349
-n03026506
-n03028079
-n03032252
-n03041632
-n03042490
-n03045698
-n03047690
-n03062245
-n03063599
-n03063689
-n03065424
-n03075370
-n03085013
-n03089624
-n03095699
-n03100240
-n03109150
-n03110669
-n03124043
-n03124170
-n03125729
-n03126707
-n03127747
-n03127925
-n03131574
-n03133878
-n03134739
-n03141823
-n03146219
-n03160309
-n03179701
-n03180011
-n03187595
-n03188531
-n03196217
-n03197337
-n03201208
-n03207743
-n03207941
-n03208938
-n03216828
-n03218198
-n03220513
-n03223299
-n03240683
-n03249569
-n03250847
-n03255030
-n03259280
-n03271574
-n03272010
-n03272562
-n03290653
-n03291819
-n03297495
-n03314780
-n03325584
-n03337140
-n03344393
-n03345487
-n03347037
-n03355925
-n03372029
-n03376595
-n03379051
-n03384352
-n03388043
-n03388183
-n03388549
-n03393912
-n03394916
-n03400231
-n03404251
-n03417042
-n03424325
-n03425413
-n03443371
-n03444034
-n03445777
-n03445924
-n03447447
-n03447721
-n03450230
-n03452741
-n03457902
-n03459775
-n03461385
-n03467068
-n03476684
-n03476991
-n03478589
-n03481172
-n03482405
-n03483316
-n03485407
-n03485794
-n03492542
-n03494278
-n03495258
-n03496892
-n03498962
-n03527444
-n03529860
-n03530642
-n03532672
-n03534580
-n03535780
-n03538406
-n03544143
-n03584254
-n03584829
-n03590841
-n03594734
-n03594945
-n03595614
-n03598930
-n03599486
-n03602883
-n03617480
-n03623198
-n03627232
-n03630383
-n03633091
-n03637318
-n03642806
-n03649909
-n03657121
-n03658185
-n03661043
-n03662601
-n03666591
-n03670208
-n03673027
-n03676483
-n03680355
-n03690938
-n03691459
-n03692522
-n03697007
-n03706229
-n03709823
-n03710193
-n03710637
-n03710721
-n03717622
-n03720891
-n03721384
-n03724870
-n03729826
-n03733131
-n03733281
-n03733805
-n03742115
-n03743016
-n03759954
-n03761084
-n03763968
-n03764736
-n03769881
-n03770439
-n03770679
-n03773504
-n03775071
-n03775546
-n03776460
-n03777568
-n03777754
-n03781244
-n03782006
-n03785016
-n03786901
-n03787032
-n03788195
-n03788365
-n03791053
-n03792782
-n03792972
-n03793489
-n03794056
-n03796401
-n03803284
-n03804744
-n03814639
-n03814906
-n03825788
-n03832673
-n03837869
-n03838899
-n03840681
-n03841143
-n03843555
-n03854065
-n03857828
-n03866082
-n03868242
-n03868863
-n03871628
-n03873416
-n03874293
-n03874599
-n03876231
-n03877472
-n03877845
-n03884397
-n03887697
-n03888257
-n03888605
-n03891251
-n03891332
-n03895866
-n03899768
-n03902125
-n03903868
-n03908618
-n03908714
-n03916031
-n03920288
-n03924679
-n03929660
-n03929855
-n03930313
-n03930630
-n03933933
-n03935335
-n03937543
-n03938244
-n03942813
-n03944341
-n03947888
-n03950228
-n03954731
-n03956157
-n03958227
-n03961711
-n03967562
-n03970156
-n03976467
-n03976657
-n03977966
-n03980874
-n03982430
-n03983396
-n03991062
-n03992509
-n03995372
-n03998194
-n04004767
-n04005630
-n04008634
-n04009552
-n04019541
-n04023962
-n04026417
-n04033901
-n04033995
-n04037443
-n04039381
-n04040759
-n04041544
-n04044716
-n04049303
-n04065272
-n04067472
-n04069434
-n04070727
-n04074963
-n04081281
-n04086273
-n04090263
-n04099969
-n04111531
-n04116512
-n04118538
-n04118776
-n04120489
-n04125021
-n04127249
-n04131690
-n04133789
-n04136333
-n04141076
-n04141327
-n04141975
-n04146614
-n04147183
-n04149813
-n04152593
-n04153751
-n04154565
-n04162706
-n04179913
-n04192698
-n04200800
-n04201297
-n04204238
-n04204347
-n04208210
-n04209133
-n04209239
-n04228054
-n04229816
-n04235860
-n04238763
-n04239074
-n04243546
-n04251144
-n04252077
-n04252225
-n04254120
-n04254680
-n04254777
-n04258138
-n04259630
-n04263257
-n04264628
-n04265275
-n04266014
-n04270147
-n04273569
-n04275548
-n04277352
-n04285008
-n04286575
-n04296562
-n04310018
-n04311004
-n04311174
-n04317175
-n04325704
-n04326547
-n04328186
-n04330267
-n04332243
-n04335435
-n04336792
-n04344873
-n04346328
-n04347754
-n04350905
-n04355338
-n04355933
-n04356056
-n04357314
-n04366367
-n04367480
-n04370456
-n04371430
-n04371774
-n04372370
-n04376876
-n04380533
-n04389033
-n04392985
-n04398044
-n04399382
-n04404412
-n04409515
-n04417672
-n04418357
-n04423845
-n04428191
-n04429376
-n04435653
-n04442312
-n04443257
-n04447861
-n04456115
-n04458633
-n04461696
-n04462240
-n04465501
-n04467665
-n04476259
-n04479046
-n04482393
-n04483307
-n04485082
-n04486054
-n04487081
-n04487394
-n04493381
-n04501370
-n04505470
-n04507155
-n04509417
-n04515003
-n04517823
-n04522168
-n04523525
-n04525038
-n04525305
-n04532106
-n04532670
-n04536866
-n04540053
-n04542943
-n04548280
-n04548362
-n04550184
-n04552348
-n04553703
-n04554684
-n04557648
-n04560804
-n04562935
-n04579145
-n04579432
-n04584207
-n04589890
-n04590129
-n04591157
-n04591713
-n04592741
-n04596742
-n04597913
-n04599235
-n04604644
-n04606251
-n04612504
-n04613696
-n06359193
-n06596364
-n06785654
-n06794110
-n06874185
-n07248320
-n07565083
-n07579787
-n07583066
-n07584110
-n07590611
-n07613480
-n07614500
-n07615774
-n07684084
-n07693725
-n07695742
-n07697313
-n07697537
-n07711569
-n07714571
-n07714990
-n07715103
-n07716358
-n07716906
-n07717410
-n07717556
-n07718472
-n07718747
-n07720875
-n07730033
-n07734744
-n07742313
-n07745940
-n07747607
-n07749582
-n07753113
-n07753275
-n07753592
-n07754684
-n07760859
-n07768694
-n07802026
-n07831146
-n07836838
-n07860988
-n07871810
-n07873807
-n07875152
-n07880968
-n07892512
-n07920052
-n07930864
-n07932039
-n09193705
-n09229709
-n09246464
-n09256479
-n09288635
-n09332890
-n09399592
-n09421951
-n09428293
-n09468604
-n09472597
-n09835506
-n10148035
-n10565667
-n11879895
-n11939491
-n12057211
-n12144580
-n12267677
-n12620546
-n12768682
-n12985857
-n12998815
-n13037406
-n13040303
-n13044778
-n13052670
-n13054560
-n13133613
-n15075141
diff --git a/research/inception/inception/data/imagenet_metadata.txt b/research/inception/inception/data/imagenet_metadata.txt
deleted file mode 100644
index 913a237e95dfd95cfe92083f6ad51eed6b0b6f3c..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/imagenet_metadata.txt
+++ /dev/null
@@ -1,21842 +0,0 @@
-n00004475	organism, being
-n00005787	benthos
-n00006024	heterotroph
-n00006484	cell
-n00007846	person, individual, someone, somebody, mortal, soul
-n00015388	animal, animate being, beast, brute, creature, fauna
-n00017222	plant, flora, plant life
-n00021265	food, nutrient
-n00021939	artifact, artefact
-n00120010	hop
-n00141669	check-in
-n00288000	dressage
-n00288190	curvet, vaulting
-n00288384	piaffe
-n00324978	funambulism, tightrope walking
-n00326094	rock climbing
-n00433458	contact sport
-n00433661	outdoor sport, field sport
-n00433802	gymnastics, gymnastic exercise
-n00434075	acrobatics, tumbling
-n00439826	track and field
-n00440039	track, running
-n00440218	jumping
-n00440382	broad jump, long jump
-n00440509	high jump
-n00440643	Fosbury flop
-n00440747	skiing
-n00440941	cross-country skiing
-n00441073	ski jumping
-n00441824	water sport, aquatics
-n00442115	swimming, swim
-n00442437	bathe
-n00442847	dip, plunge
-n00442981	dive, diving
-n00443231	floating, natation
-n00443375	dead-man's float, prone float
-n00443517	belly flop, belly flopper, belly whop, belly whopper
-n00443692	cliff diving
-n00443803	flip
-n00443917	gainer, full gainer
-n00444142	half gainer
-n00444340	jackknife
-n00444490	swan dive, swallow dive
-n00444651	skin diving, skin-dive
-n00444846	scuba diving
-n00444937	snorkeling, snorkel diving
-n00445055	surfing, surfboarding, surfriding
-n00445226	water-skiing
-n00445351	rowing, row
-n00445685	sculling
-n00445802	boxing, pugilism, fisticuffs
-n00446311	professional boxing
-n00446411	in-fighting
-n00446493	fight
-n00446632	rope-a-dope
-n00446804	spar, sparring
-n00446980	archery
-n00447073	sledding
-n00447221	tobogganing
-n00447361	luging
-n00447463	bobsledding
-n00447540	wrestling, rassling, grappling
-n00447957	Greco-Roman wrestling
-n00448126	professional wrestling
-n00448232	sumo
-n00448466	skating
-n00448640	ice skating
-n00448748	figure skating
-n00448872	rollerblading
-n00448958	roller skating
-n00449054	skateboarding
-n00449168	speed skating
-n00449295	racing
-n00449517	auto racing, car racing
-n00449695	boat racing
-n00449796	hydroplane racing
-n00449892	camel racing
-n00449977	greyhound racing
-n00450070	horse racing
-n00450335	riding, horseback riding, equitation
-n00450700	equestrian sport
-n00450866	pony-trekking
-n00450998	showjumping, stadium jumping
-n00451186	cross-country riding, cross-country jumping
-n00451370	cycling
-n00451563	bicycling
-n00451635	motorcycling
-n00451768	dune cycling
-n00451866	blood sport
-n00452034	bullfighting, tauromachy
-n00452152	cockfighting
-n00452293	hunt, hunting
-n00452734	battue
-n00452864	beagling
-n00453126	coursing
-n00453313	deer hunting, deer hunt
-n00453396	ducking, duck hunting
-n00453478	fox hunting, foxhunt
-n00453631	pigsticking
-n00453935	fishing, sportfishing
-n00454237	angling
-n00454395	fly-fishing
-n00454493	troll, trolling
-n00454624	casting, cast
-n00454855	bait casting
-n00454983	fly casting
-n00455076	overcast
-n00455173	surf casting, surf fishing
-n00456465	day game
-n00463246	athletic game
-n00463543	ice hockey, hockey, hockey game
-n00464277	tetherball
-n00464478	water polo
-n00464651	outdoor game
-n00464894	golf, golf game
-n00466273	professional golf
-n00466377	round of golf, round
-n00466524	medal play, stroke play
-n00466630	match play
-n00466712	miniature golf
-n00466880	croquet
-n00467320	quoits, horseshoes
-n00467536	shuffleboard, shovelboard
-n00467719	field game
-n00467995	field hockey, hockey
-n00468299	shinny, shinney
-n00468480	football, football game
-n00469651	American football, American football game
-n00470554	professional football
-n00470682	touch football
-n00470830	hurling
-n00470966	rugby, rugby football, rugger
-n00471437	ball game, ballgame
-n00471613	baseball, baseball game
-n00474568	ball
-n00474657	professional baseball
-n00474769	hardball
-n00474881	perfect game
-n00475014	no-hit game, no-hitter
-n00475142	one-hitter, 1-hitter
-n00475273	two-hitter, 2-hitter
-n00475403	three-hitter, 3-hitter
-n00475535	four-hitter, 4-hitter
-n00475661	five-hitter, 5-hitter
-n00475787	softball, softball game
-n00476140	rounders
-n00476235	stickball, stickball game
-n00476389	cricket
-n00477392	lacrosse
-n00477639	polo
-n00477827	pushball
-n00478262	soccer, association football
-n00479076	court game
-n00479440	handball
-n00479616	racquetball
-n00479734	fives
-n00479887	squash, squash racquets, squash rackets
-n00480211	volleyball, volleyball game
-n00480366	jai alai, pelota
-n00480508	badminton
-n00480885	battledore, battledore and shuttlecock
-n00480993	basketball, basketball game, hoops
-n00481803	professional basketball
-n00481938	deck tennis
-n00482122	netball
-n00482298	tennis, lawn tennis
-n00483205	professional tennis
-n00483313	singles
-n00483409	singles
-n00483508	doubles
-n00483605	doubles
-n00483705	royal tennis, real tennis, court tennis
-n00483848	pallone
-n00523513	sport, athletics
-n00812526	clasp, clench, clutch, clutches, grasp, grip, hold
-n00825773	judo
-n00887544	team sport
-n01035504	Last Supper, Lord's Supper
-n01035667	Seder, Passover supper
-n01055165	camping, encampment, bivouacking, tenting
-n01314388	pest
-n01314663	critter
-n01314781	creepy-crawly
-n01314910	darter
-n01315213	peeper
-n01315330	homeotherm, homoiotherm, homotherm
-n01315581	poikilotherm, ectotherm
-n01315805	range animal
-n01316422	scavenger
-n01316579	bottom-feeder, bottom-dweller
-n01316734	bottom-feeder
-n01316949	work animal
-n01317089	beast of burden, jument
-n01317294	draft animal
-n01317391	pack animal, sumpter
-n01317541	domestic animal, domesticated animal
-n01317813	feeder
-n01317916	feeder
-n01318053	stocker
-n01318279	hatchling
-n01318381	head
-n01318478	migrator
-n01318660	molter, moulter
-n01318894	pet
-n01319001	stayer
-n01319187	stunt
-n01319467	marine animal, marine creature, sea animal, sea creature
-n01319685	by-catch, bycatch
-n01320872	female
-n01321123	hen
-n01321230	male
-n01321456	adult
-n01321579	young, offspring
-n01321770	orphan
-n01321854	young mammal
-n01322221	baby
-n01322343	pup, whelp
-n01322508	wolf pup, wolf cub
-n01322604	puppy
-n01322685	cub, young carnivore
-n01322898	lion cub
-n01322983	bear cub
-n01323068	tiger cub
-n01323155	kit
-n01323261	suckling
-n01323355	sire
-n01323493	dam
-n01323599	thoroughbred, purebred, pureblood
-n01323781	giant
-n01324305	mutant
-n01324431	carnivore
-n01324610	herbivore
-n01324799	insectivore
-n01324916	acrodont
-n01325060	pleurodont
-n01326291	microorganism, micro-organism
-n01327909	monohybrid
-n01329186	arbovirus, arborvirus
-n01330126	adenovirus
-n01330497	arenavirus
-n01332181	Marburg virus
-n01333082	Arenaviridae
-n01333483	vesiculovirus
-n01333610	Reoviridae
-n01334217	variola major, variola major virus
-n01334690	viroid, virusoid
-n01335218	coliphage
-n01337191	paramyxovirus
-n01337734	poliovirus
-n01338685	herpes, herpes virus
-n01339083	herpes simplex 1, HS1, HSV-1, HSV-I
-n01339336	herpes zoster, herpes zoster virus
-n01339471	herpes varicella zoster, herpes varicella zoster virus
-n01339801	cytomegalovirus, CMV
-n01340014	varicella zoster virus
-n01340522	polyoma, polyoma virus
-n01340785	lyssavirus
-n01340935	reovirus
-n01341090	rotavirus
-n01342269	moneran, moneron
-n01347583	archaebacteria, archaebacterium, archaeobacteria, archeobacteria
-n01349735	bacteroid
-n01350226	Bacillus anthracis, anthrax bacillus
-n01350701	Yersinia pestis
-n01351170	Brucella
-n01351315	spirillum, spirilla
-n01357328	botulinus, botulinum, Clostridium botulinum
-n01357507	clostridium perfringens
-n01358572	cyanobacteria, blue-green algae
-n01359762	trichodesmium
-n01362336	nitric bacteria, nitrobacteria
-n01363719	spirillum
-n01365474	Francisella, genus Francisella
-n01365885	gonococcus, Neisseria gonorrhoeae
-n01366700	Corynebacterium diphtheriae, C. diphtheriae, Klebs-Loeffler bacillus
-n01367772	enteric bacteria, enterobacteria, enterics, entric
-n01368672	klebsiella
-n01369358	Salmonella typhimurium
-n01369484	typhoid bacillus, Salmonella typhosa, Salmonella typhi
-n01374703	nitrate bacterium, nitric bacterium
-n01374846	nitrite bacterium, nitrous bacterium
-n01375204	actinomycete
-n01376237	streptomyces
-n01376437	Streptomyces erythreus
-n01376543	Streptomyces griseus
-n01377278	tubercle bacillus, Mycobacterium tuberculosis
-n01377510	pus-forming bacteria
-n01377694	streptobacillus
-n01378545	myxobacteria, myxobacterium, myxobacter, gliding bacteria, slime bacteria
-n01379389	staphylococcus, staphylococci, staph
-n01380610	diplococcus
-n01380754	pneumococcus, Diplococcus pneumoniae
-n01381044	streptococcus, streptococci, strep
-n01382033	spirochete, spirochaete
-n01384084	planktonic algae
-n01384164	zooplankton
-n01384687	parasite
-n01385017	endoparasite, entoparasite, entozoan, entozoon, endozoan
-n01385330	ectoparasite, ectozoan, ectozoon, epizoan, epizoon
-n01386007	pathogen
-n01386182	commensal
-n01386354	myrmecophile
-n01387065	protoctist
-n01389507	protozoan, protozoon
-n01390123	sarcodinian, sarcodine
-n01390763	heliozoan
-n01392275	endameba
-n01392380	ameba, amoeba
-n01393486	globigerina
-n01394040	testacean
-n01394492	arcella
-n01394771	difflugia
-n01395254	ciliate, ciliated protozoan, ciliophoran
-n01396048	paramecium, paramecia
-n01396617	stentor
-n01397114	alga, algae
-n01397690	arame
-n01397871	seagrass
-n01400247	golden algae
-n01400391	yellow-green algae
-n01402600	brown algae
-n01403457	kelp
-n01404365	fucoid, fucoid algae
-n01404495	fucoid
-n01405007	fucus
-n01405616	bladderwrack, Ascophyllum nodosum
-n01407798	green algae, chlorophyte
-n01410457	pond scum
-n01411450	chlorella
-n01412694	stonewort
-n01413457	desmid
-n01414216	sea moss
-n01415626	eukaryote, eucaryote
-n01415920	prokaryote, procaryote
-n01416213	zooid
-n01418498	Leishmania, genus Leishmania
-n01418620	zoomastigote, zooflagellate
-n01419332	polymastigote
-n01419573	costia, Costia necatrix
-n01419888	giardia
-n01421333	cryptomonad, cryptophyte
-n01421807	sporozoan
-n01422185	sporozoite
-n01422335	trophozoite
-n01422450	merozoite
-n01423302	coccidium, eimeria
-n01423617	gregarine
-n01424420	plasmodium, Plasmodium vivax, malaria parasite
-n01425223	leucocytozoan, leucocytozoon
-n01427399	microsporidian
-n01429172	Ostariophysi, order Ostariophysi
-n01438208	cypriniform fish
-n01438581	loach
-n01439121	cyprinid, cyprinid fish
-n01439514	carp
-n01439808	domestic carp, Cyprinus carpio
-n01440160	leather carp
-n01440242	mirror carp
-n01440467	European bream, Abramis brama
-n01440764	tench, Tinca tinca
-n01441117	dace, Leuciscus leuciscus
-n01441272	chub, Leuciscus cephalus
-n01441425	shiner
-n01441910	common shiner, silversides, Notropis cornutus
-n01442450	roach, Rutilus rutilus
-n01442710	rudd, Scardinius erythrophthalmus
-n01442972	minnow, Phoxinus phoxinus
-n01443243	gudgeon, Gobio gobio
-n01443537	goldfish, Carassius auratus
-n01443831	crucian carp, Carassius carassius, Carassius vulgaris
-n01444339	electric eel, Electrophorus electric
-n01444783	catostomid
-n01445429	buffalo fish, buffalofish
-n01445593	black buffalo, Ictiobus niger
-n01445857	hog sucker, hog molly, Hypentelium nigricans
-n01446152	redhorse, redhorse sucker
-n01446589	cyprinodont
-n01446760	killifish
-n01447139	mummichog, Fundulus heteroclitus
-n01447331	striped killifish, mayfish, may fish, Fundulus majalis
-n01447658	rivulus
-n01447946	flagfish, American flagfish, Jordanella floridae
-n01448291	swordtail, helleri, topminnow, Xyphophorus helleri
-n01448594	guppy, rainbow fish, Lebistes reticulatus
-n01448951	topminnow, poeciliid fish, poeciliid, live-bearer
-n01449374	mosquitofish, Gambusia affinis
-n01449712	platy, Platypoecilus maculatus
-n01449980	mollie, molly
-n01450661	squirrelfish
-n01450950	reef squirrelfish, Holocentrus coruscus
-n01451115	deepwater squirrelfish, Holocentrus bullisi
-n01451295	Holocentrus ascensionis
-n01451426	soldierfish, soldier-fish
-n01451863	anomalops, flashlight fish
-n01452345	flashlight fish, Photoblepharon palpebratus
-n01453087	John Dory, Zeus faber
-n01453475	boarfish, Capros aper
-n01453742	boarfish
-n01454545	cornetfish
-n01454856	stickleback, prickleback
-n01455317	three-spined stickleback, Gasterosteus aculeatus
-n01455461	ten-spined stickleback, Gasterosteus pungitius
-n01455778	pipefish, needlefish
-n01456137	dwarf pipefish, Syngnathus hildebrandi
-n01456454	deepwater pipefish, Cosmocampus profundus
-n01456756	seahorse, sea horse
-n01457082	snipefish, bellows fish
-n01457407	shrimpfish, shrimp-fish
-n01457852	trumpetfish, Aulostomus maculatus
-n01458746	pellicle
-n01458842	embryo, conceptus, fertilized egg
-n01459791	fetus, foetus
-n01460303	abortus
-n01461315	spawn
-n01461646	blastula, blastosphere
-n01462042	blastocyst, blastodermic vessicle
-n01462544	gastrula
-n01462803	morula
-n01464844	yolk, vitellus
-n01466257	chordate
-n01467336	cephalochordate
-n01467804	lancelet, amphioxus
-n01468238	tunicate, urochordate, urochord
-n01468712	ascidian
-n01469103	sea squirt
-n01469723	salp, salpa
-n01470145	doliolum
-n01470479	larvacean
-n01470733	appendicularia
-n01470895	ascidian tadpole
-n01471682	vertebrate, craniate
-n01472303	Amniota
-n01472502	amniote
-n01473806	aquatic vertebrate
-n01474283	jawless vertebrate, jawless fish, agnathan
-n01474864	ostracoderm
-n01475232	heterostracan
-n01475940	anaspid
-n01476418	conodont
-n01477080	cyclostome
-n01477525	lamprey, lamprey eel, lamper eel
-n01477875	sea lamprey, Petromyzon marinus
-n01478511	hagfish, hag, slime eels
-n01478969	Myxine glutinosa
-n01479213	eptatretus
-n01479820	gnathostome
-n01480106	placoderm
-n01480516	cartilaginous fish, chondrichthian
-n01480880	holocephalan, holocephalian
-n01481331	chimaera
-n01481498	rabbitfish, Chimaera monstrosa
-n01482071	elasmobranch, selachian
-n01482330	shark
-n01483021	cow shark, six-gilled shark, Hexanchus griseus
-n01483522	mackerel shark
-n01483830	porbeagle, Lamna nasus
-n01484097	mako, mako shark
-n01484285	shortfin mako, Isurus oxyrhincus
-n01484447	longfin mako, Isurus paucus
-n01484562	bonito shark, blue pointed, Isurus glaucus
-n01484850	great white shark, white shark, man-eater, man-eating shark, Carcharodon carcharias
-n01485479	basking shark, Cetorhinus maximus
-n01486010	thresher, thrasher, thresher shark, fox shark, Alopius vulpinus
-n01486540	carpet shark, Orectolobus barbatus
-n01486838	nurse shark, Ginglymostoma cirratum
-n01487506	sand tiger, sand shark, Carcharias taurus, Odontaspis taurus
-n01488038	whale shark, Rhincodon typus
-n01488918	requiem shark
-n01489501	bull shark, cub shark, Carcharhinus leucas
-n01489709	sandbar shark, Carcharhinus plumbeus
-n01489920	blacktip shark, sandbar shark, Carcharhinus limbatus
-n01490112	whitetip shark, oceanic whitetip shark, white-tipped shark, Carcharinus longimanus
-n01490360	dusky shark, Carcharhinus obscurus
-n01490670	lemon shark, Negaprion brevirostris
-n01491006	blue shark, great blue shark, Prionace glauca
-n01491361	tiger shark, Galeocerdo cuvieri
-n01491661	soupfin shark, soupfin, soup-fin, Galeorhinus zyopterus
-n01491874	dogfish
-n01492357	smooth dogfish
-n01492569	smoothhound, smoothhound shark, Mustelus mustelus
-n01492708	American smooth dogfish, Mustelus canis
-n01492860	Florida smoothhound, Mustelus norrisi
-n01493146	whitetip shark, reef whitetip shark, Triaenodon obseus
-n01493541	spiny dogfish
-n01493829	Atlantic spiny dogfish, Squalus acanthias
-n01494041	Pacific spiny dogfish, Squalus suckleyi
-n01494475	hammerhead, hammerhead shark
-n01494757	smooth hammerhead, Sphyrna zygaena
-n01494882	smalleye hammerhead, Sphyrna tudes
-n01495006	shovelhead, bonnethead, bonnet shark, Sphyrna tiburo
-n01495493	angel shark, angelfish, Squatina squatina, monkfish
-n01495701	ray
-n01496331	electric ray, crampfish, numbfish, torpedo
-n01497118	sawfish
-n01497413	smalltooth sawfish, Pristis pectinatus
-n01497738	guitarfish
-n01498041	stingray
-n01498406	roughtail stingray, Dasyatis centroura
-n01498699	butterfly ray
-n01498989	eagle ray
-n01499396	spotted eagle ray, spotted ray, Aetobatus narinari
-n01499732	cownose ray, cow-nosed ray, Rhinoptera bonasus
-n01500091	manta, manta ray, devilfish
-n01500476	Atlantic manta, Manta birostris
-n01500854	devil ray, Mobula hypostoma
-n01501160	skate
-n01501641	grey skate, gray skate, Raja batis
-n01501777	little skate, Raja erinacea
-n01501948	thorny skate, Raja radiata
-n01502101	barndoor skate, Raja laevis
-n01503061	bird
-n01503976	dickeybird, dickey-bird, dickybird, dicky-bird
-n01504179	fledgling, fledgeling
-n01504344	nestling, baby bird
-n01514668	cock
-n01514752	gamecock, fighting cock
-n01514859	hen
-n01514926	nester
-n01515078	night bird
-n01515217	night raven
-n01515303	bird of passage
-n01516212	archaeopteryx, archeopteryx, Archaeopteryx lithographica
-n01517389	archaeornis
-n01517565	ratite, ratite bird, flightless bird
-n01517966	carinate, carinate bird, flying bird
-n01518878	ostrich, Struthio camelus
-n01519563	cassowary
-n01519873	emu, Dromaius novaehollandiae, Emu novaehollandiae
-n01520576	kiwi, apteryx
-n01521399	rhea, Rhea americana
-n01521756	rhea, nandu, Pterocnemia pennata
-n01522450	elephant bird, aepyornis
-n01523105	moa
-n01524359	passerine, passeriform bird
-n01524761	nonpasserine bird
-n01525720	oscine, oscine bird
-n01526521	songbird, songster
-n01526766	honey eater, honeysucker
-n01527194	accentor
-n01527347	hedge sparrow, sparrow, dunnock, Prunella modularis
-n01527617	lark
-n01527917	skylark, Alauda arvensis
-n01528396	wagtail
-n01528654	pipit, titlark, lark
-n01528845	meadow pipit, Anthus pratensis
-n01529672	finch
-n01530439	chaffinch, Fringilla coelebs
-n01530575	brambling, Fringilla montifringilla
-n01531178	goldfinch, Carduelis carduelis
-n01531344	linnet, lintwhite, Carduelis cannabina
-n01531512	siskin, Carduelis spinus
-n01531639	red siskin, Carduelis cucullata
-n01531811	redpoll, Carduelis flammea
-n01531971	redpoll, Carduelis hornemanni
-n01532325	New World goldfinch, goldfinch, yellowbird, Spinus tristis
-n01532511	pine siskin, pine finch, Spinus pinus
-n01532829	house finch, linnet, Carpodacus mexicanus
-n01533000	purple finch, Carpodacus purpureus
-n01533339	canary, canary bird
-n01533481	common canary, Serinus canaria
-n01533651	serin
-n01533893	crossbill, Loxia curvirostra
-n01534155	bullfinch, Pyrrhula pyrrhula
-n01534433	junco, snowbird
-n01534582	dark-eyed junco, slate-colored junco, Junco hyemalis
-n01534762	New World sparrow
-n01535140	vesper sparrow, grass finch, Pooecetes gramineus
-n01535469	white-throated sparrow, whitethroat, Zonotrichia albicollis
-n01535690	white-crowned sparrow, Zonotrichia leucophrys
-n01536035	chipping sparrow, Spizella passerina
-n01536186	field sparrow, Spizella pusilla
-n01536334	tree sparrow, Spizella arborea
-n01536644	song sparrow, Melospiza melodia
-n01536780	swamp sparrow, Melospiza georgiana
-n01537134	bunting
-n01537544	indigo bunting, indigo finch, indigo bird, Passerina cyanea
-n01537895	ortolan, ortolan bunting, Emberiza hortulana
-n01538059	reed bunting, Emberiza schoeniclus
-n01538200	yellowhammer, yellow bunting, Emberiza citrinella
-n01538362	yellow-breasted bunting, Emberiza aureola
-n01538630	snow bunting, snowbird, snowflake, Plectrophenax nivalis
-n01538955	honeycreeper
-n01539272	banana quit
-n01539573	sparrow, true sparrow
-n01539925	English sparrow, house sparrow, Passer domesticus
-n01540090	tree sparrow, Passer montanus
-n01540233	grosbeak, grossbeak
-n01540566	evening grosbeak, Hesperiphona vespertina
-n01540832	hawfinch, Coccothraustes coccothraustes
-n01541102	pine grosbeak, Pinicola enucleator
-n01541386	cardinal, cardinal grosbeak, Richmondena Cardinalis, Cardinalis cardinalis, redbird
-n01541760	pyrrhuloxia, Pyrrhuloxia sinuata
-n01541922	towhee
-n01542168	chewink, cheewink, Pipilo erythrophthalmus
-n01542433	green-tailed towhee, Chlorura chlorura
-n01542786	weaver, weaverbird, weaver finch
-n01543175	baya, Ploceus philippinus
-n01543383	whydah, whidah, widow bird
-n01543632	Java sparrow, Java finch, ricebird, Padda oryzivora
-n01543936	avadavat, amadavat
-n01544208	grassfinch, grass finch
-n01544389	zebra finch, Poephila castanotis
-n01544704	honeycreeper, Hawaiian honeycreeper
-n01545574	lyrebird
-n01546039	scrubbird, scrub-bird, scrub bird
-n01546506	broadbill
-n01546921	tyrannid
-n01547832	New World flycatcher, flycatcher, tyrant flycatcher, tyrant bird
-n01548301	kingbird, Tyrannus tyrannus
-n01548492	Arkansas kingbird, western kingbird
-n01548694	Cassin's kingbird, Tyrannus vociferans
-n01548865	eastern kingbird
-n01549053	grey kingbird, gray kingbird, petchary, Tyrannus domenicensis domenicensis
-n01549430	pewee, peewee, peewit, pewit, wood pewee, Contopus virens
-n01549641	western wood pewee, Contopus sordidulus
-n01549886	phoebe, phoebe bird, Sayornis phoebe
-n01550172	vermillion flycatcher, firebird, Pyrocephalus rubinus mexicanus
-n01550761	cotinga, chatterer
-n01551080	cock of the rock, Rupicola rupicola
-n01551300	cock of the rock, Rupicola peruviana
-n01551711	manakin
-n01552034	bellbird
-n01552333	umbrella bird, Cephalopterus ornatus
-n01552813	ovenbird
-n01553142	antbird, ant bird
-n01553527	ant thrush
-n01553762	ant shrike
-n01554017	spotted antbird, Hylophylax naevioides
-n01554448	woodhewer, woodcreeper, wood-creeper, tree creeper
-n01555004	pitta
-n01555305	scissortail, scissortailed flycatcher, Muscivora-forficata
-n01555809	Old World flycatcher, true flycatcher, flycatcher
-n01556182	spotted flycatcher, Muscicapa striata, Muscicapa grisola
-n01556514	thickhead, whistler
-n01557185	thrush
-n01557962	missel thrush, mistle thrush, mistletoe thrush, Turdus viscivorus
-n01558149	song thrush, mavis, throstle, Turdus philomelos
-n01558307	fieldfare, snowbird, Turdus pilaris
-n01558461	redwing, Turdus iliacus
-n01558594	blackbird, merl, merle, ouzel, ousel, European blackbird, Turdus merula
-n01558765	ring ouzel, ring blackbird, ring thrush, Turdus torquatus
-n01558993	robin, American robin, Turdus migratorius
-n01559160	clay-colored robin, Turdus greyi
-n01559477	hermit thrush, Hylocichla guttata
-n01559639	veery, Wilson's thrush, Hylocichla fuscescens
-n01559804	wood thrush, Hylocichla mustelina
-n01560105	nightingale, Luscinia megarhynchos
-n01560280	thrush nightingale, Luscinia luscinia
-n01560419	bulbul
-n01560636	Old World chat, chat
-n01560793	stonechat, Saxicola torquata
-n01560935	whinchat, Saxicola rubetra
-n01561181	solitaire
-n01561452	redstart, redtail
-n01561732	wheatear
-n01562014	bluebird
-n01562265	robin, redbreast, robin redbreast, Old World robin, Erithacus rubecola
-n01562451	bluethroat, Erithacus svecicus
-n01563128	warbler
-n01563449	gnatcatcher
-n01563746	kinglet
-n01563945	goldcrest, golden-crested kinglet, Regulus regulus
-n01564101	gold-crowned kinglet, Regulus satrata
-n01564217	ruby-crowned kinglet, ruby-crowned wren, Regulus calendula
-n01564394	Old World warbler, true warbler
-n01564773	blackcap, Silvia atricapilla
-n01564914	greater whitethroat, whitethroat, Sylvia communis
-n01565078	lesser whitethroat, whitethroat, Sylvia curruca
-n01565345	wood warbler, Phylloscopus sibilatrix
-n01565599	sedge warbler, sedge bird, sedge wren, reedbird, Acrocephalus schoenobaenus
-n01565930	wren warbler
-n01566207	tailorbird, Orthotomus sutorius
-n01566645	babbler, cackler
-n01567133	New World warbler, wood warbler
-n01567678	parula warbler, northern parula, Parula americana
-n01567879	Wilson's warbler, Wilson's blackcap, Wilsonia pusilla
-n01568132	flycatching warbler
-n01568294	American redstart, redstart, Setophaga ruticilla
-n01568720	Cape May warbler, Dendroica tigrina
-n01568892	yellow warbler, golden warbler, yellowbird, Dendroica petechia
-n01569060	Blackburn, Blackburnian warbler, Dendroica fusca
-n01569262	Audubon's warbler, Audubon warbler, Dendroica auduboni
-n01569423	myrtle warbler, myrtle bird, Dendroica coronata
-n01569566	blackpoll, Dendroica striate
-n01569836	New World chat, chat
-n01569971	yellow-breasted chat, Icteria virens
-n01570267	ovenbird, Seiurus aurocapillus
-n01570421	water thrush
-n01570676	yellowthroat
-n01570839	common yellowthroat, Maryland yellowthroat, Geothlypis trichas
-n01571410	riflebird, Ptloris paradisea
-n01571904	New World oriole, American oriole, oriole
-n01572328	northern oriole, Icterus galbula
-n01572489	Baltimore oriole, Baltimore bird, hangbird, firebird, Icterus galbula galbula
-n01572654	Bullock's oriole, Icterus galbula bullockii
-n01572782	orchard oriole, Icterus spurius
-n01573074	meadowlark, lark
-n01573240	eastern meadowlark, Sturnella magna
-n01573360	western meadowlark, Sturnella neglecta
-n01573627	cacique, cazique
-n01573898	bobolink, ricebird, reedbird, Dolichonyx oryzivorus
-n01574045	New World blackbird, blackbird
-n01574390	grackle, crow blackbird
-n01574560	purple grackle, Quiscalus quiscula
-n01574801	rusty blackbird, rusty grackle, Euphagus carilonus
-n01575117	cowbird
-n01575401	red-winged blackbird, redwing, Agelaius phoeniceus
-n01575745	Old World oriole, oriole
-n01576076	golden oriole, Oriolus oriolus
-n01576358	fig-bird
-n01576695	starling
-n01577035	common starling, Sturnus vulgaris
-n01577458	rose-colored starling, rose-colored pastor, Pastor sturnus, Pastor roseus
-n01577659	myna, mynah, mina, minah, myna bird, mynah bird
-n01577941	crested myna, Acridotheres tristis
-n01578180	hill myna, Indian grackle, grackle, Gracula religiosa
-n01578575	corvine bird
-n01579028	crow
-n01579149	American crow, Corvus brachyrhyncos
-n01579260	raven, Corvus corax
-n01579410	rook, Corvus frugilegus
-n01579578	jackdaw, daw, Corvus monedula
-n01579729	chough
-n01580077	jay
-n01580379	Old World jay
-n01580490	common European jay, Garullus garullus
-n01580772	New World jay
-n01580870	blue jay, jaybird, Cyanocitta cristata
-n01581166	Canada jay, grey jay, gray jay, camp robber, whisker jack, Perisoreus canadensis
-n01581434	Rocky Mountain jay, Perisoreus canadensis capitalis
-n01581730	nutcracker
-n01581874	common nutcracker, Nucifraga caryocatactes
-n01581984	Clark's nutcracker, Nucifraga columbiana
-n01582220	magpie
-n01582398	European magpie, Pica pica
-n01582498	American magpie, Pica pica hudsonia
-n01582856	Australian magpie
-n01583209	butcherbird
-n01583495	currawong, bell magpie
-n01583828	piping crow, piping crow-shrike, Gymnorhina tibicen
-n01584225	wren, jenny wren
-n01584695	winter wren, Troglodytes troglodytes
-n01584853	house wren, Troglodytes aedon
-n01585121	marsh wren
-n01585287	long-billed marsh wren, Cistothorus palustris
-n01585422	sedge wren, short-billed marsh wren, Cistothorus platensis
-n01585715	rock wren, Salpinctes obsoletus
-n01586020	Carolina wren, Thryothorus ludovicianus
-n01586374	cactus wren
-n01586941	mockingbird, mocker, Mimus polyglotktos
-n01587278	blue mockingbird, Melanotis caerulescens
-n01587526	catbird, grey catbird, gray catbird, Dumetella carolinensis
-n01587834	thrasher, mocking thrush
-n01588002	brown thrasher, brown thrush, Toxostoma rufums
-n01588431	New Zealand wren
-n01588725	rock wren, Xenicus gilviventris
-n01588996	rifleman bird, Acanthisitta chloris
-n01589286	creeper, tree creeper
-n01589718	brown creeper, American creeper, Certhia americana
-n01589893	European creeper, Certhia familiaris
-n01590220	wall creeper, tichodrome, Tichodroma muriaria
-n01591005	European nuthatch, Sitta europaea
-n01591123	red-breasted nuthatch, Sitta canadensis
-n01591301	white-breasted nuthatch, Sitta carolinensis
-n01591697	titmouse, tit
-n01592084	chickadee
-n01592257	black-capped chickadee, blackcap, Parus atricapillus
-n01592387	tufted titmouse, Parus bicolor
-n01592540	Carolina chickadee, Parus carolinensis
-n01592694	blue tit, tomtit, Parus caeruleus
-n01593028	bushtit, bush tit
-n01593282	wren-tit, Chamaea fasciata
-n01593553	verdin, Auriparus flaviceps
-n01594004	fairy bluebird, bluebird
-n01594372	swallow
-n01594787	barn swallow, chimney swallow, Hirundo rustica
-n01594968	cliff swallow, Hirundo pyrrhonota
-n01595168	tree swallow, tree martin, Hirundo nigricans
-n01595450	white-bellied swallow, tree swallow, Iridoprocne bicolor
-n01595624	martin
-n01595974	house martin, Delichon urbica
-n01596273	bank martin, bank swallow, sand martin, Riparia riparia
-n01596608	purple martin, Progne subis
-n01597022	wood swallow, swallow shrike
-n01597336	tanager
-n01597737	scarlet tanager, Piranga olivacea, redbird, firebird
-n01597906	western tanager, Piranga ludoviciana
-n01598074	summer tanager, summer redbird, Piranga rubra
-n01598271	hepatic tanager, Piranga flava hepatica
-n01598588	shrike
-n01598988	butcherbird
-n01599159	European shrike, Lanius excubitor
-n01599269	northern shrike, Lanius borealis
-n01599388	white-rumped shrike, Lanius ludovicianus excubitorides
-n01599556	loggerhead shrike, Lanius lucovicianus
-n01599741	migrant shrike, Lanius ludovicianus migrans
-n01600085	bush shrike
-n01600341	black-fronted bush shrike, Chlorophoneus nigrifrons
-n01600657	bowerbird, catbird
-n01601068	satin bowerbird, satin bird, Ptilonorhynchus violaceus
-n01601410	great bowerbird, Chlamydera nuchalis
-n01601694	water ouzel, dipper
-n01602080	European water ouzel, Cinclus aquaticus
-n01602209	American water ouzel, Cinclus mexicanus
-n01602630	vireo
-n01602832	red-eyed vireo, Vireo olivaceous
-n01603000	solitary vireo, Vireo solitarius
-n01603152	blue-headed vireo, Vireo solitarius solitarius
-n01603600	waxwing
-n01603812	cedar waxwing, cedarbird, Bombycilla cedrorun
-n01603953	Bohemian waxwing, Bombycilla garrulus
-n01604330	bird of prey, raptor, raptorial bird
-n01604968	Accipitriformes, order Accipitriformes
-n01605630	hawk
-n01606097	eyas
-n01606177	tiercel, tercel, tercelet
-n01606522	goshawk, Accipiter gentilis
-n01606672	sparrow hawk, Accipiter nisus
-n01606809	Cooper's hawk, blue darter, Accipiter cooperii
-n01606978	chicken hawk, hen hawk
-n01607309	buteonine
-n01607429	redtail, red-tailed hawk, Buteo jamaicensis
-n01607600	rough-legged hawk, roughleg, Buteo lagopus
-n01607812	red-shouldered hawk, Buteo lineatus
-n01607962	buzzard, Buteo buteo
-n01608265	honey buzzard, Pernis apivorus
-n01608432	kite
-n01608814	black kite, Milvus migrans
-n01609062	swallow-tailed kite, swallow-tailed hawk, Elanoides forficatus
-n01609391	white-tailed kite, Elanus leucurus
-n01609751	harrier
-n01609956	marsh harrier, Circus Aeruginosus
-n01610100	Montagu's harrier, Circus pygargus
-n01610226	marsh hawk, northern harrier, hen harrier, Circus cyaneus
-n01610552	harrier eagle, short-toed eagle
-n01610955	falcon
-n01611472	peregrine, peregrine falcon, Falco peregrinus
-n01611674	falcon-gentle, falcon-gentil
-n01611800	gyrfalcon, gerfalcon, Falco rusticolus
-n01611969	kestrel, Falco tinnunculus
-n01612122	sparrow hawk, American kestrel, kestrel, Falco sparverius
-n01612275	pigeon hawk, merlin, Falco columbarius
-n01612476	hobby, Falco subbuteo
-n01612628	caracara
-n01612955	Audubon's caracara, Polyborus cheriway audubonii
-n01613177	carancha, Polyborus plancus
-n01613294	eagle, bird of Jove
-n01613615	young bird
-n01613807	eaglet
-n01614038	harpy, harpy eagle, Harpia harpyja
-n01614343	golden eagle, Aquila chrysaetos
-n01614556	tawny eagle, Aquila rapax
-n01614925	bald eagle, American eagle, Haliaeetus leucocephalus
-n01615121	sea eagle
-n01615303	Kamchatkan sea eagle, Stellar's sea eagle, Haliaeetus pelagicus
-n01615458	ern, erne, grey sea eagle, gray sea eagle, European sea eagle, white-tailed sea eagle, Haliatus albicilla
-n01615703	fishing eagle, Haliaeetus leucorhyphus
-n01616086	osprey, fish hawk, fish eagle, sea eagle, Pandion haliaetus
-n01616318	vulture
-n01616551	Aegypiidae, family Aegypiidae
-n01616764	Old World vulture
-n01617095	griffon vulture, griffon, Gyps fulvus
-n01617443	bearded vulture, lammergeier, lammergeyer, Gypaetus barbatus
-n01617766	Egyptian vulture, Pharaoh's chicken, Neophron percnopterus
-n01618082	black vulture, Aegypius monachus
-n01618503	secretary bird, Sagittarius serpentarius
-n01618922	New World vulture, cathartid
-n01619310	buzzard, turkey buzzard, turkey vulture, Cathartes aura
-n01619536	condor
-n01619835	Andean condor, Vultur gryphus
-n01620135	California condor, Gymnogyps californianus
-n01620414	black vulture, carrion crow, Coragyps atratus
-n01620735	king vulture, Sarcorhamphus papa
-n01621127	owl, bird of Minerva, bird of night, hooter
-n01621635	owlet
-n01622120	little owl, Athene noctua
-n01622352	horned owl
-n01622483	great horned owl, Bubo virginianus
-n01622779	great grey owl, great gray owl, Strix nebulosa
-n01622959	tawny owl, Strix aluco
-n01623110	barred owl, Strix varia
-n01623425	screech owl, Otus asio
-n01623615	screech owl
-n01623706	scops owl
-n01623880	spotted owl, Strix occidentalis
-n01624115	Old World scops owl, Otus scops
-n01624212	Oriental scops owl, Otus sunia
-n01624305	hoot owl
-n01624537	hawk owl, Surnia ulula
-n01624833	long-eared owl, Asio otus
-n01625121	laughing owl, laughing jackass, Sceloglaux albifacies
-n01625562	barn owl, Tyto alba
-n01627424	amphibian
-n01628331	Ichyostega
-n01628770	urodele, caudate
-n01629276	salamander
-n01629819	European fire salamander, Salamandra salamandra
-n01629962	spotted salamander, fire salamander, Salamandra maculosa
-n01630148	alpine salamander, Salamandra atra
-n01630284	newt, triton
-n01630670	common newt, Triturus vulgaris
-n01630901	red eft, Notophthalmus viridescens
-n01631175	Pacific newt
-n01631354	rough-skinned newt, Taricha granulosa
-n01631512	California newt, Taricha torosa
-n01631663	eft
-n01632047	ambystomid, ambystomid salamander
-n01632308	mole salamander, Ambystoma talpoideum
-n01632458	spotted salamander, Ambystoma maculatum
-n01632601	tiger salamander, Ambystoma tigrinum
-n01632777	axolotl, mud puppy, Ambystoma mexicanum
-n01632952	waterdog
-n01633406	hellbender, mud puppy, Cryptobranchus alleganiensis
-n01633781	giant salamander, Megalobatrachus maximus
-n01634227	olm, Proteus anguinus
-n01634522	mud puppy, Necturus maculosus
-n01635027	dicamptodon, dicamptodontid
-n01635176	Pacific giant salamander, Dicamptodon ensatus
-n01635480	olympic salamander, Rhyacotriton olympicus
-n01636127	lungless salamander, plethodont
-n01636352	eastern red-backed salamander, Plethodon cinereus
-n01636510	western red-backed salamander, Plethodon vehiculum
-n01636829	dusky salamander
-n01637112	climbing salamander
-n01637338	arboreal salamander, Aneides lugubris
-n01637615	slender salamander, worm salamander
-n01637932	web-toed salamander
-n01638194	Shasta salamander, Hydromantes shastae
-n01638329	limestone salamander, Hydromantes brunus
-n01638722	amphiuma, congo snake, congo eel, blind eel
-n01639187	siren
-n01639765	frog, toad, toad frog, anuran, batrachian, salientian
-n01640846	true frog, ranid
-n01641206	wood-frog, wood frog, Rana sylvatica
-n01641391	leopard frog, spring frog, Rana pipiens
-n01641577	bullfrog, Rana catesbeiana
-n01641739	green frog, spring frog, Rana clamitans
-n01641930	cascades frog, Rana cascadae
-n01642097	goliath frog, Rana goliath
-n01642257	pickerel frog, Rana palustris
-n01642391	tarahumara frog, Rana tarahumarae
-n01642539	grass frog, Rana temporaria
-n01642943	leptodactylid frog, leptodactylid
-n01643255	robber frog
-n01643507	barking frog, robber frog, Hylactophryne augusti
-n01643896	crapaud, South American bullfrog, Leptodactylus pentadactylus
-n01644373	tree frog, tree-frog
-n01644900	tailed frog, bell toad, ribbed toad, tailed toad, Ascaphus trui
-n01645466	Liopelma hamiltoni
-n01645776	true toad
-n01646292	bufo
-n01646388	agua, agua toad, Bufo marinus
-n01646555	European toad, Bufo bufo
-n01646648	natterjack, Bufo calamita
-n01646802	American toad, Bufo americanus
-n01646902	Eurasian green toad, Bufo viridis
-n01647033	American green toad, Bufo debilis
-n01647180	Yosemite toad, Bufo canorus
-n01647303	Texas toad, Bufo speciosus
-n01647466	southwestern toad, Bufo microscaphus
-n01647640	western toad, Bufo boreas
-n01648139	obstetrical toad, midwife toad, Alytes obstetricans
-n01648356	midwife toad, Alytes cisternasi
-n01648620	fire-bellied toad, Bombina bombina
-n01649170	spadefoot, spadefoot toad
-n01649412	western spadefoot, Scaphiopus hammondii
-n01649556	southern spadefoot, Scaphiopus multiplicatus
-n01649726	plains spadefoot, Scaphiopus bombifrons
-n01650167	tree toad, tree frog, tree-frog
-n01650690	spring peeper, Hyla crucifer
-n01650901	Pacific tree toad, Hyla regilla
-n01651059	canyon treefrog, Hyla arenicolor
-n01651285	chameleon tree frog
-n01651487	cricket frog
-n01651641	northern cricket frog, Acris crepitans
-n01651778	eastern cricket frog, Acris gryllus
-n01652026	chorus frog
-n01652297	lowland burrowing treefrog, northern casque-headed frog, Pternohyla fodiens
-n01653026	western narrow-mouthed toad, Gastrophryne olivacea
-n01653223	eastern narrow-mouthed toad, Gastrophryne carolinensis
-n01653509	sheep frog
-n01653773	tongueless frog
-n01654083	Surinam toad, Pipa pipa, Pipa americana
-n01654637	African clawed frog, Xenopus laevis
-n01654863	South American poison toad
-n01655344	caecilian, blindworm
-n01661091	reptile, reptilian
-n01661592	anapsid, anapsid reptile
-n01661818	diapsid, diapsid reptile
-n01662060	Diapsida, subclass Diapsida
-n01662622	chelonian, chelonian reptile
-n01662784	turtle
-n01663401	sea turtle, marine turtle
-n01663782	green turtle, Chelonia mydas
-n01664065	loggerhead, loggerhead turtle, Caretta caretta
-n01664369	ridley
-n01664492	Atlantic ridley, bastard ridley, bastard turtle, Lepidochelys kempii
-n01664674	Pacific ridley, olive ridley, Lepidochelys olivacea
-n01664990	hawksbill turtle, hawksbill, hawkbill, tortoiseshell turtle, Eretmochelys imbricata
-n01665541	leatherback turtle, leatherback, leathery turtle, Dermochelys coriacea
-n01665932	snapping turtle
-n01666228	common snapping turtle, snapper, Chelydra serpentina
-n01666585	alligator snapping turtle, alligator snapper, Macroclemys temmincki
-n01667114	mud turtle
-n01667432	musk turtle, stinkpot
-n01667778	terrapin
-n01668091	diamondback terrapin, Malaclemys centrata
-n01668436	red-bellied terrapin, red-bellied turtle, redbelly, Pseudemys rubriventris
-n01668665	slider, yellow-bellied terrapin, Pseudemys scripta
-n01668892	cooter, river cooter, Pseudemys concinna
-n01669191	box turtle, box tortoise
-n01669372	Western box turtle, Terrapene ornata
-n01669654	painted turtle, painted terrapin, painted tortoise, Chrysemys picta
-n01670092	tortoise
-n01670535	European tortoise, Testudo graeca
-n01670802	giant tortoise
-n01671125	gopher tortoise, gopher turtle, gopher, Gopherus polypemus
-n01671479	desert tortoise, Gopherus agassizii
-n01671705	Texas tortoise
-n01672032	soft-shelled turtle, pancake turtle
-n01672432	spiny softshell, Trionyx spiniferus
-n01672611	smooth softshell, Trionyx muticus
-n01673282	tuatara, Sphenodon punctatum
-n01674216	saurian
-n01674464	lizard
-n01674990	gecko
-n01675352	flying gecko, fringed gecko, Ptychozoon homalocephalum
-n01675722	banded gecko
-n01676755	iguanid, iguanid lizard
-n01677366	common iguana, iguana, Iguana iguana
-n01677747	marine iguana, Amblyrhynchus cristatus
-n01678043	desert iguana, Dipsosaurus dorsalis
-n01678343	chuckwalla, Sauromalus obesus
-n01678657	zebra-tailed lizard, gridiron-tailed lizard, Callisaurus draconoides
-n01679005	fringe-toed lizard, Uma notata
-n01679307	earless lizard
-n01679626	collared lizard
-n01679962	leopard lizard
-n01680264	spiny lizard
-n01680478	fence lizard
-n01680655	western fence lizard, swift, blue-belly, Sceloporus occidentalis
-n01680813	eastern fence lizard, pine lizard, Sceloporus undulatus
-n01680983	sagebrush lizard, Sceloporus graciosus
-n01681328	side-blotched lizard, sand lizard, Uta stansburiana
-n01681653	tree lizard, Urosaurus ornatus
-n01681940	horned lizard, horned toad, horny frog
-n01682172	Texas horned lizard, Phrynosoma cornutum
-n01682435	basilisk
-n01682714	American chameleon, anole, Anolis carolinensis
-n01683201	worm lizard
-n01683558	night lizard
-n01684133	skink, scincid, scincid lizard
-n01684578	western skink, Eumeces skiltonianus
-n01684741	mountain skink, Eumeces callicephalus
-n01685439	teiid lizard, teiid
-n01685808	whiptail, whiptail lizard
-n01686044	racerunner, race runner, six-lined racerunner, Cnemidophorus sexlineatus
-n01686220	plateau striped whiptail, Cnemidophorus velox
-n01686403	Chihuahuan spotted whiptail, Cnemidophorus exsanguis
-n01686609	western whiptail, Cnemidophorus tigris
-n01686808	checkered whiptail, Cnemidophorus tesselatus
-n01687128	teju
-n01687290	caiman lizard
-n01687665	agamid, agamid lizard
-n01687978	agama
-n01688243	frilled lizard, Chlamydosaurus kingi
-n01688961	moloch
-n01689081	mountain devil, spiny lizard, Moloch horridus
-n01689411	anguid lizard
-n01689811	alligator lizard
-n01690149	blindworm, slowworm, Anguis fragilis
-n01690466	glass lizard, glass snake, joint snake
-n01691217	legless lizard
-n01691652	Lanthanotus borneensis
-n01691951	venomous lizard
-n01692333	Gila monster, Heloderma suspectum
-n01692523	beaded lizard, Mexican beaded lizard, Heloderma horridum
-n01692864	lacertid lizard, lacertid
-n01693175	sand lizard, Lacerta agilis
-n01693334	green lizard, Lacerta viridis
-n01693783	chameleon, chamaeleon
-n01694178	African chameleon, Chamaeleo chamaeleon
-n01694311	horned chameleon, Chamaeleo oweni
-n01694709	monitor, monitor lizard, varan
-n01694955	African monitor, Varanus niloticus
-n01695060	Komodo dragon, Komodo lizard, dragon lizard, giant lizard, Varanus komodoensis
-n01696633	crocodilian reptile, crocodilian
-n01697178	crocodile
-n01697457	African crocodile, Nile crocodile, Crocodylus niloticus
-n01697611	Asian crocodile, Crocodylus porosus
-n01697749	Morlett's crocodile
-n01697978	false gavial, Tomistoma schlegeli
-n01698434	alligator, gator
-n01698640	American alligator, Alligator mississipiensis
-n01698782	Chinese alligator, Alligator sinensis
-n01699040	caiman, cayman
-n01699254	spectacled caiman, Caiman sclerops
-n01699675	gavial, Gavialis gangeticus
-n01701551	armored dinosaur
-n01701859	stegosaur, stegosaurus, Stegosaur stenops
-n01702256	ankylosaur, ankylosaurus
-n01702479	Edmontonia
-n01703011	bone-headed dinosaur
-n01703161	pachycephalosaur, pachycephalosaurus
-n01703569	ceratopsian, horned dinosaur
-n01704103	protoceratops
-n01704323	triceratops
-n01704626	styracosaur, styracosaurus
-n01705010	psittacosaur, psittacosaurus
-n01705591	ornithopod, ornithopod dinosaur
-n01705934	hadrosaur, hadrosaurus, duck-billed dinosaur
-n01707294	trachodon, trachodont
-n01708106	saurischian, saurischian dinosaur
-n01708998	sauropod, sauropod dinosaur
-n01709484	apatosaur, apatosaurus, brontosaur, brontosaurus, thunder lizard, Apatosaurus excelsus
-n01709876	barosaur, barosaurus
-n01710177	diplodocus
-n01711160	argentinosaur
-n01712008	theropod, theropod dinosaur, bird-footed dinosaur
-n01712752	ceratosaur, ceratosaurus
-n01713170	coelophysis
-n01713764	tyrannosaur, tyrannosaurus, Tyrannosaurus rex
-n01714231	allosaur, allosaurus
-n01715888	ornithomimid
-n01717016	maniraptor
-n01717229	oviraptorid
-n01717467	velociraptor
-n01718096	deinonychus
-n01718414	utahraptor, superslasher
-n01719403	synapsid, synapsid reptile
-n01721174	dicynodont
-n01721898	pelycosaur
-n01722670	dimetrodon
-n01722998	pterosaur, flying reptile
-n01723579	pterodactyl
-n01724231	ichthyosaur
-n01724840	ichthyosaurus
-n01725086	stenopterygius, Stenopterygius quadrisicissus
-n01725713	plesiosaur, plesiosaurus
-n01726203	nothosaur
-n01726692	snake, serpent, ophidian
-n01727646	colubrid snake, colubrid
-n01728266	hoop snake
-n01728572	thunder snake, worm snake, Carphophis amoenus
-n01728920	ringneck snake, ring-necked snake, ring snake
-n01729322	hognose snake, puff adder, sand viper
-n01729672	leaf-nosed snake
-n01729977	green snake, grass snake
-n01730185	smooth green snake, Opheodrys vernalis
-n01730307	rough green snake, Opheodrys aestivus
-n01730563	green snake
-n01730812	racer
-n01730960	blacksnake, black racer, Coluber constrictor
-n01731137	blue racer, Coluber constrictor flaviventris
-n01731277	horseshoe whipsnake, Coluber hippocrepis
-n01731545	whip-snake, whip snake, whipsnake
-n01731764	coachwhip, coachwhip snake, Masticophis flagellum
-n01731941	California whipsnake, striped racer, Masticophis lateralis
-n01732093	Sonoran whipsnake, Masticophis bilineatus
-n01732244	rat snake
-n01732614	corn snake, red rat snake, Elaphe guttata
-n01732789	black rat snake, blacksnake, pilot blacksnake, mountain blacksnake, Elaphe obsoleta
-n01732989	chicken snake
-n01733214	Indian rat snake, Ptyas mucosus
-n01733466	glossy snake, Arizona elegans
-n01733757	bull snake, bull-snake
-n01733957	gopher snake, Pituophis melanoleucus
-n01734104	pine snake
-n01734418	king snake, kingsnake
-n01734637	common kingsnake, Lampropeltis getulus
-n01734808	milk snake, house snake, milk adder, checkered adder, Lampropeltis triangulum
-n01735189	garter snake, grass snake
-n01735439	common garter snake, Thamnophis sirtalis
-n01735577	ribbon snake, Thamnophis sauritus
-n01735728	Western ribbon snake, Thamnophis proximus
-n01736032	lined snake, Tropidoclonion lineatum
-n01736375	ground snake, Sonora semiannulata
-n01736796	eastern ground snake, Potamophis striatula, Haldea striatula
-n01737021	water snake
-n01737472	common water snake, banded water snake, Natrix sipedon, Nerodia sipedon
-n01737728	water moccasin
-n01737875	grass snake, ring snake, ringed snake, Natrix natrix
-n01738065	viperine grass snake, Natrix maura
-n01738306	red-bellied snake, Storeria occipitamaculata
-n01738601	sand snake
-n01738731	banded sand snake, Chilomeniscus cinctus
-n01739094	black-headed snake
-n01739381	vine snake
-n01739647	lyre snake
-n01739871	Sonoran lyre snake, Trimorphodon lambda
-n01740131	night snake, Hypsiglena torquata
-n01740551	blind snake, worm snake
-n01740885	western blind snake, Leptotyphlops humilis
-n01741232	indigo snake, gopher snake, Drymarchon corais
-n01741442	eastern indigo snake, Drymarchon corais couperi
-n01741562	constrictor
-n01741943	boa
-n01742172	boa constrictor, Constrictor constrictor
-n01742447	rubber boa, tow-headed snake, Charina bottae
-n01742821	rosy boa, Lichanura trivirgata
-n01743086	anaconda, Eunectes murinus
-n01743605	python
-n01743936	carpet snake, Python variegatus, Morelia spilotes variegatus
-n01744100	reticulated python, Python reticulatus
-n01744270	Indian python, Python molurus
-n01744401	rock python, rock snake, Python sebae
-n01744555	amethystine python
-n01745125	elapid, elapid snake
-n01745484	coral snake, harlequin-snake, New World coral snake
-n01745902	eastern coral snake, Micrurus fulvius
-n01746191	western coral snake, Micruroides euryxanthus
-n01746359	coral snake, Old World coral snake
-n01746952	African coral snake, Aspidelaps lubricus
-n01747285	Australian coral snake, Rhynchoelaps australis
-n01747589	copperhead, Denisonia superba
-n01747885	cobra
-n01748264	Indian cobra, Naja naja
-n01748389	asp, Egyptian cobra, Naja haje
-n01748686	black-necked cobra, spitting cobra, Naja nigricollis
-n01748906	hamadryad, king cobra, Ophiophagus hannah, Naja hannah
-n01749244	ringhals, rinkhals, spitting snake, Hemachatus haemachatus
-n01749582	mamba
-n01749742	black mamba, Dendroaspis augusticeps
-n01749939	green mamba
-n01750167	death adder, Acanthophis antarcticus
-n01750437	tiger snake, Notechis scutatus
-n01750743	Australian blacksnake, Pseudechis porphyriacus
-n01751036	krait
-n01751215	banded krait, banded adder, Bungarus fasciatus
-n01751472	taipan, Oxyuranus scutellatus
-n01751748	sea snake
-n01752165	viper
-n01752585	adder, common viper, Vipera berus
-n01752736	asp, asp viper, Vipera aspis
-n01753032	puff adder, Bitis arietans
-n01753180	gaboon viper, Bitis gabonica
-n01753488	horned viper, cerastes, sand viper, horned asp, Cerastes cornutus
-n01753959	pit viper
-n01754370	copperhead, Agkistrodon contortrix
-n01754533	water moccasin, cottonmouth, cottonmouth moccasin, Agkistrodon piscivorus
-n01754876	rattlesnake, rattler
-n01755581	diamondback, diamondback rattlesnake, Crotalus adamanteus
-n01755740	timber rattlesnake, banded rattlesnake, Crotalus horridus horridus
-n01755952	canebrake rattlesnake, canebrake rattler, Crotalus horridus atricaudatus
-n01756089	prairie rattlesnake, prairie rattler, Western rattlesnake, Crotalus viridis
-n01756291	sidewinder, horned rattlesnake, Crotalus cerastes
-n01756508	Western diamondback, Western diamondback rattlesnake, Crotalus atrox
-n01756733	rock rattlesnake, Crotalus lepidus
-n01756916	tiger rattlesnake, Crotalus tigris
-n01757115	Mojave rattlesnake, Crotalus scutulatus
-n01757343	speckled rattlesnake, Crotalus mitchellii
-n01757677	massasauga, massasauga rattler, Sistrurus catenatus
-n01757901	ground rattler, massasauga, Sistrurus miliaris
-n01758141	fer-de-lance, Bothrops atrops
-n01758757	carcase, carcass
-n01758895	carrion
-n01767661	arthropod
-n01768244	trilobite
-n01769347	arachnid, arachnoid
-n01770081	harvestman, daddy longlegs, Phalangium opilio
-n01770393	scorpion
-n01770795	false scorpion, pseudoscorpion
-n01771100	book scorpion, Chelifer cancroides
-n01771417	whip-scorpion, whip scorpion
-n01771766	vinegarroon, Mastigoproctus giganteus
-n01772222	spider
-n01772664	orb-weaving spider
-n01773157	black and gold garden spider, Argiope aurantia
-n01773549	barn spider, Araneus cavaticus
-n01773797	garden spider, Aranea diademata
-n01774097	comb-footed spider, theridiid
-n01774384	black widow, Latrodectus mactans
-n01774750	tarantula
-n01775062	wolf spider, hunting spider
-n01775370	European wolf spider, tarantula, Lycosa tarentula
-n01775730	trap-door spider
-n01776192	acarine
-n01776313	tick
-n01776705	hard tick, ixodid
-n01777304	Ixodes dammini, deer tick
-n01777467	Ixodes neotomae
-n01777649	Ixodes pacificus, western black-legged tick
-n01777909	Ixodes scapularis, black-legged tick
-n01778217	sheep-tick, sheep tick, Ixodes ricinus
-n01778487	Ixodes persulcatus
-n01778621	Ixodes dentatus
-n01778801	Ixodes spinipalpis
-n01779148	wood tick, American dog tick, Dermacentor variabilis
-n01779463	soft tick, argasid
-n01779629	mite
-n01779939	web-spinning mite
-n01780142	acarid
-n01780426	trombidiid
-n01780696	trombiculid
-n01781071	harvest mite, chigger, jigger, redbug
-n01781570	acarus, genus Acarus
-n01781698	itch mite, sarcoptid
-n01781875	rust mite
-n01782209	spider mite, tetranychid
-n01782516	red spider, red spider mite, Panonychus ulmi
-n01783017	myriapod
-n01783706	garden centipede, garden symphilid, symphilid, Scutigerella immaculata
-n01784293	tardigrade
-n01784675	centipede
-n01785667	house centipede, Scutigera coleoptrata
-n01786646	millipede, millepede, milliped
-n01787006	sea spider, pycnogonid
-n01787191	Merostomata, class Merostomata
-n01787835	horseshoe crab, king crab, Limulus polyphemus, Xiphosurus polyphemus
-n01788291	Asian horseshoe crab
-n01788579	eurypterid
-n01788864	tongue worm, pentastomid
-n01789386	gallinaceous bird, gallinacean
-n01789740	domestic fowl, fowl, poultry
-n01790171	Dorking
-n01790304	Plymouth Rock
-n01790398	Cornish, Cornish fowl
-n01790557	Rock Cornish
-n01790711	game fowl
-n01790812	cochin, cochin china
-n01791107	jungle fowl, gallina
-n01791314	jungle cock
-n01791388	jungle hen
-n01791463	red jungle fowl, Gallus gallus
-n01791625	chicken, Gallus gallus
-n01791954	bantam
-n01792042	chick, biddy
-n01792158	cock, rooster
-n01792429	cockerel
-n01792530	capon
-n01792640	hen, biddy
-n01792808	cackler
-n01792955	brood hen, broody, broody hen, setting hen, sitter
-n01793085	mother hen
-n01793159	layer
-n01793249	pullet
-n01793340	spring chicken
-n01793435	Rhode Island red
-n01793565	Dominique, Dominick
-n01793715	Orpington
-n01794158	turkey, Meleagris gallopavo
-n01794344	turkey cock, gobbler, tom, tom turkey
-n01794651	ocellated turkey, Agriocharis ocellata
-n01795088	grouse
-n01795545	black grouse
-n01795735	European black grouse, heathfowl, Lyrurus tetrix
-n01795900	Asian black grouse, Lyrurus mlokosiewiczi
-n01796019	blackcock, black cock
-n01796105	greyhen, grayhen, grey hen, gray hen, heath hen
-n01796340	ptarmigan
-n01796519	red grouse, moorfowl, moorbird, moor-bird, moorgame, Lagopus scoticus
-n01796729	moorhen
-n01797020	capercaillie, capercailzie, horse of the wood, Tetrao urogallus
-n01797307	spruce grouse, Canachites canadensis
-n01797601	sage grouse, sage hen, Centrocercus urophasianus
-n01797886	ruffed grouse, partridge, Bonasa umbellus
-n01798168	sharp-tailed grouse, sprigtail, sprig tail, Pedioecetes phasianellus
-n01798484	prairie chicken, prairie grouse, prairie fowl
-n01798706	greater prairie chicken, Tympanuchus cupido
-n01798839	lesser prairie chicken, Tympanuchus pallidicinctus
-n01798979	heath hen, Tympanuchus cupido cupido
-n01799302	guan
-n01799679	curassow
-n01800195	piping guan
-n01800424	chachalaca
-n01800633	Texas chachalaca, Ortilis vetula macalli
-n01801088	megapode, mound bird, mound-bird, mound builder, scrub fowl
-n01801479	mallee fowl, leipoa, lowan, Leipoa ocellata
-n01801672	mallee hen
-n01801876	brush turkey, Alectura lathami
-n01802159	maleo, Macrocephalon maleo
-n01802721	phasianid
-n01803078	pheasant
-n01803362	ring-necked pheasant, Phasianus colchicus
-n01803641	afropavo, Congo peafowl, Afropavo congensis
-n01803893	argus, argus pheasant
-n01804163	golden pheasant, Chrysolophus pictus
-n01804478	bobwhite, bobwhite quail, partridge
-n01804653	northern bobwhite, Colinus virginianus
-n01804921	Old World quail
-n01805070	migratory quail, Coturnix coturnix, Coturnix communis
-n01805321	monal, monaul
-n01805801	peafowl, bird of Juno
-n01806061	peachick, pea-chick
-n01806143	peacock
-n01806297	peahen
-n01806364	blue peafowl, Pavo cristatus
-n01806467	green peafowl, Pavo muticus
-n01806567	quail
-n01806847	California quail, Lofortyx californicus
-n01807105	tragopan
-n01807496	partridge
-n01807828	Hungarian partridge, grey partridge, gray partridge, Perdix perdix
-n01808140	red-legged partridge, Alectoris ruffa
-n01808291	Greek partridge, rock partridge, Alectoris graeca
-n01808596	mountain quail, mountain partridge, Oreortyx picta palmeri
-n01809106	guinea fowl, guinea, Numida meleagris
-n01809371	guinea hen
-n01809752	hoatzin, hoactzin, stinkbird, Opisthocomus hoazin
-n01810268	tinamou, partridge
-n01810700	columbiform bird
-n01811243	dodo, Raphus cucullatus
-n01811909	pigeon
-n01812187	pouter pigeon, pouter
-n01812337	dove
-n01812662	rock dove, rock pigeon, Columba livia
-n01812866	band-tailed pigeon, band-tail pigeon, bandtail, Columba fasciata
-n01813088	wood pigeon, ringdove, cushat, Columba palumbus
-n01813385	turtledove
-n01813532	Streptopelia turtur
-n01813658	ringdove, Streptopelia risoria
-n01813948	Australian turtledove, turtledove, Stictopelia cuneata
-n01814217	mourning dove, Zenaidura macroura
-n01814370	domestic pigeon
-n01814549	squab
-n01814620	fairy swallow
-n01814755	roller, tumbler, tumbler pigeon
-n01814921	homing pigeon, homer
-n01815036	carrier pigeon
-n01815270	passenger pigeon, Ectopistes migratorius
-n01815601	sandgrouse, sand grouse
-n01816017	painted sandgrouse, Pterocles indicus
-n01816140	pin-tailed sandgrouse, pin-tailed grouse, Pterocles alchata
-n01816474	pallas's sandgrouse, Syrrhaptes paradoxus
-n01816887	parrot
-n01817263	popinjay
-n01817346	poll, poll parrot
-n01817953	African grey, African gray, Psittacus erithacus
-n01818299	amazon
-n01818515	macaw
-n01818832	kea, Nestor notabilis
-n01819115	cockatoo
-n01819313	sulphur-crested cockatoo, Kakatoe galerita, Cacatua galerita
-n01819465	pink cockatoo, Kakatoe leadbeateri
-n01819734	cockateel, cockatiel, cockatoo parrot, Nymphicus hollandicus
-n01820052	lovebird
-n01820348	lory
-n01820546	lorikeet
-n01820801	varied Lorikeet, Glossopsitta versicolor
-n01821076	rainbow lorikeet, Trichoglossus moluccanus
-n01821203	parakeet, parrakeet, parroket, paraquet, paroquet, parroquet
-n01821554	Carolina parakeet, Conuropsis carolinensis
-n01821869	budgerigar, budgereegah, budgerygah, budgie, grass parakeet, lovebird, shell parakeet, Melopsittacus undulatus
-n01822300	ring-necked parakeet, Psittacula krameri
-n01822602	cuculiform bird
-n01823013	cuckoo
-n01823414	European cuckoo, Cuculus canorus
-n01823740	black-billed cuckoo, Coccyzus erythropthalmus
-n01824035	roadrunner, chaparral cock, Geococcyx californianus
-n01824344	ani
-n01824575	coucal
-n01824749	crow pheasant, Centropus sinensis
-n01825278	touraco, turaco, turacou, turakoo
-n01825930	coraciiform bird
-n01826364	roller
-n01826680	European roller, Coracias garrulus
-n01826844	ground roller
-n01827403	kingfisher
-n01827793	Eurasian kingfisher, Alcedo atthis
-n01828096	belted kingfisher, Ceryle alcyon
-n01828556	kookaburra, laughing jackass, Dacelo gigas
-n01828970	bee eater
-n01829413	hornbill
-n01829869	hoopoe, hoopoo
-n01830042	Euopean hoopoe, Upupa epops
-n01830479	wood hoopoe
-n01830915	motmot, momot
-n01831360	tody
-n01831712	apodiform bird
-n01832167	swift
-n01832493	European swift, Apus apus
-n01832813	chimney swift, chimney swallow, Chateura pelagica
-n01833112	swiftlet, Collocalia inexpectata
-n01833415	tree swift, crested swift
-n01833805	hummingbird
-n01834177	Archilochus colubris
-n01834540	thornbill
-n01835276	goatsucker, nightjar, caprimulgid
-n01835769	European goatsucker, European nightjar, Caprimulgus europaeus
-n01835918	chuck-will's-widow, Caprimulgus carolinensis
-n01836087	whippoorwill, Caprimulgus vociferus
-n01836673	poorwill, Phalaenoptilus nuttallii
-n01837072	frogmouth
-n01837526	oilbird, guacharo, Steatornis caripensis
-n01838038	piciform bird
-n01838598	woodpecker, peckerwood, pecker
-n01839086	green woodpecker, Picus viridis
-n01839330	downy woodpecker
-n01839598	flicker
-n01839750	yellow-shafted flicker, Colaptes auratus, yellowhammer
-n01839949	gilded flicker, Colaptes chrysoides
-n01840120	red-shafted flicker, Colaptes caper collaris
-n01840412	ivorybill, ivory-billed woodpecker, Campephilus principalis
-n01840775	redheaded woodpecker, redhead, Melanerpes erythrocephalus
-n01841102	sapsucker
-n01841288	yellow-bellied sapsucker, Sphyrapicus varius
-n01841441	red-breasted sapsucker, Sphyrapicus varius ruber
-n01841679	wryneck
-n01841943	piculet
-n01842235	barbet
-n01842504	puffbird
-n01842788	honey guide
-n01843065	jacamar
-n01843383	toucan
-n01843719	toucanet
-n01844231	trogon
-n01844551	quetzal, quetzal bird
-n01844746	resplendent quetzel, resplendent trogon, Pharomacrus mocino
-n01844917	aquatic bird
-n01845132	waterfowl, water bird, waterbird
-n01845477	anseriform bird
-n01846331	duck
-n01847000	drake
-n01847089	quack-quack
-n01847170	duckling
-n01847253	diving duck
-n01847407	dabbling duck, dabbler
-n01847806	mallard, Anas platyrhynchos
-n01847978	black duck, Anas rubripes
-n01848123	teal
-n01848323	greenwing, green-winged teal, Anas crecca
-n01848453	bluewing, blue-winged teal, Anas discors
-n01848555	garganey, Anas querquedula
-n01848648	widgeon, wigeon, Anas penelope
-n01848840	American widgeon, baldpate, Anas americana
-n01848976	shoveler, shoveller, broadbill, Anas clypeata
-n01849157	pintail, pin-tailed duck, Anas acuta
-n01849466	sheldrake
-n01849676	shelduck
-n01849863	ruddy duck, Oxyura jamaicensis
-n01850192	bufflehead, butterball, dipper, Bucephela albeola
-n01850373	goldeneye, whistler, Bucephela clangula
-n01850553	Barrow's goldeneye, Bucephala islandica
-n01850873	canvasback, canvasback duck, Aythya valisineria
-n01851038	pochard, Aythya ferina
-n01851207	redhead, Aythya americana
-n01851375	scaup, scaup duck, bluebill, broadbill
-n01851573	greater scaup, Aythya marila
-n01851731	lesser scaup, lesser scaup duck, lake duck, Aythya affinis
-n01851895	wild duck
-n01852142	wood duck, summer duck, wood widgeon, Aix sponsa
-n01852329	wood drake
-n01852400	mandarin duck, Aix galericulata
-n01852671	muscovy duck, musk duck, Cairina moschata
-n01852861	sea duck
-n01853195	eider, eider duck
-n01853498	scoter, scooter
-n01853666	common scoter, Melanitta nigra
-n01853870	old squaw, oldwife, Clangula hyemalis
-n01854415	merganser, fish duck, sawbill, sheldrake
-n01854700	goosander, Mergus merganser
-n01854838	American merganser, Mergus merganser americanus
-n01855032	red-breasted merganser, Mergus serrator
-n01855188	smew, Mergus albellus
-n01855476	hooded merganser, hooded sheldrake, Lophodytes cucullatus
-n01855672	goose
-n01856072	gosling
-n01856155	gander
-n01856380	Chinese goose, Anser cygnoides
-n01856553	greylag, graylag, greylag goose, graylag goose, Anser anser
-n01856890	blue goose, Chen caerulescens
-n01857079	snow goose
-n01857325	brant, brant goose, brent, brent goose
-n01857512	common brant goose, Branta bernicla
-n01857632	honker, Canada goose, Canadian goose, Branta canadensis
-n01857851	barnacle goose, barnacle, Branta leucopsis
-n01858281	coscoroba
-n01858441	swan
-n01858780	cob
-n01858845	pen
-n01858906	cygnet
-n01859190	mute swan, Cygnus olor
-n01859325	whooper, whooper swan, Cygnus cygnus
-n01859496	tundra swan, Cygnus columbianus
-n01859689	whistling swan, Cygnus columbianus columbianus
-n01859852	Bewick's swan, Cygnus columbianus bewickii
-n01860002	trumpeter, trumpeter swan, Cygnus buccinator
-n01860187	black swan, Cygnus atratus
-n01860497	screamer
-n01860864	horned screamer, Anhima cornuta
-n01861148	crested screamer
-n01861330	chaja, Chauna torquata
-n01861778	mammal, mammalian
-n01862399	female mammal
-n01871265	tusker
-n01871543	prototherian
-n01871875	monotreme, egg-laying mammal
-n01872401	echidna, spiny anteater, anteater
-n01872772	echidna, spiny anteater, anteater
-n01873310	platypus, duckbill, duckbilled platypus, duck-billed platypus, Ornithorhynchus anatinus
-n01874434	marsupial, pouched mammal
-n01874928	opossum, possum
-n01875313	common opossum, Didelphis virginiana, Didelphis marsupialis
-n01875610	crab-eating opossum
-n01876034	opossum rat
-n01876326	bandicoot
-n01876667	rabbit-eared bandicoot, rabbit bandicoot, bilby, Macrotis lagotis
-n01877134	kangaroo
-n01877606	giant kangaroo, great grey kangaroo, Macropus giganteus
-n01877812	wallaby, brush kangaroo
-n01878061	common wallaby, Macropus agiles
-n01878335	hare wallaby, kangaroo hare
-n01878639	nail-tailed wallaby, nail-tailed kangaroo
-n01878929	rock wallaby, rock kangaroo
-n01879217	pademelon, paddymelon
-n01879509	tree wallaby, tree kangaroo
-n01879837	musk kangaroo, Hypsiprymnodon moschatus
-n01880152	rat kangaroo, kangaroo rat
-n01880473	potoroo
-n01880716	bettong
-n01880813	jerboa kangaroo, kangaroo jerboa
-n01881171	phalanger, opossum, possum
-n01881564	cuscus
-n01881857	brush-tailed phalanger, Trichosurus vulpecula
-n01882125	flying phalanger, flying opossum, flying squirrel
-n01882714	koala, koala bear, kangaroo bear, native bear, Phascolarctos cinereus
-n01883070	wombat
-n01883513	dasyurid marsupial, dasyurid
-n01883920	dasyure
-n01884104	eastern dasyure, Dasyurus quoll
-n01884203	native cat, Dasyurus viverrinus
-n01884476	thylacine, Tasmanian wolf, Tasmanian tiger, Thylacinus cynocephalus
-n01884834	Tasmanian devil, ursine dasyure, Sarcophilus hariisi
-n01885158	pouched mouse, marsupial mouse, marsupial rat
-n01885498	numbat, banded anteater, anteater, Myrmecobius fasciatus
-n01886045	pouched mole, marsupial mole, Notoryctus typhlops
-n01886756	placental, placental mammal, eutherian, eutherian mammal
-n01887474	livestock, stock, farm animal
-n01887623	bull
-n01887787	cow
-n01887896	calf
-n01888045	calf
-n01888181	yearling
-n01888264	buck
-n01888411	doe
-n01889074	insectivore
-n01889520	mole
-n01889849	starnose mole, star-nosed mole, Condylura cristata
-n01890144	brewer's mole, hair-tailed mole, Parascalops breweri
-n01890564	golden mole
-n01890860	shrew mole
-n01891013	Asiatic shrew mole, Uropsilus soricipes
-n01891274	American shrew mole, Neurotrichus gibbsii
-n01891633	shrew, shrewmouse
-n01892030	common shrew, Sorex araneus
-n01892145	masked shrew, Sorex cinereus
-n01892385	short-tailed shrew, Blarina brevicauda
-n01892551	water shrew
-n01892744	American water shrew, Sorex palustris
-n01893021	European water shrew, Neomys fodiens
-n01893164	Mediterranean water shrew, Neomys anomalus
-n01893399	least shrew, Cryptotis parva
-n01893825	hedgehog, Erinaceus europaeus, Erinaceus europeaeus
-n01894207	tenrec, tendrac
-n01894522	tailless tenrec, Tenrec ecaudatus
-n01894956	otter shrew, potamogale, Potamogale velox
-n01896844	eiderdown
-n01897257	aftershaft
-n01897426	sickle feather
-n01897536	contour feather
-n01897667	bastard wing, alula, spurious wing
-n01898593	saddle hackle, saddle feather
-n01899894	encolure
-n01900150	hair
-n01903234	squama
-n01903346	scute
-n01903498	sclerite
-n01904029	plastron
-n01904806	scallop shell
-n01904886	oyster shell
-n01905321	theca
-n01905661	invertebrate
-n01906749	sponge, poriferan, parazoan
-n01907287	choanocyte, collar cell
-n01907738	glass sponge
-n01908042	Venus's flower basket
-n01908958	metazoan
-n01909422	coelenterate, cnidarian
-n01909788	planula
-n01909906	polyp
-n01910252	medusa, medusoid, medusan
-n01910747	jellyfish
-n01911063	scyphozoan
-n01911403	Chrysaora quinquecirrha
-n01911839	hydrozoan, hydroid
-n01912152	hydra
-n01912454	siphonophore
-n01912809	nanomia
-n01913166	Portuguese man-of-war, man-of-war, jellyfish
-n01913346	praya
-n01913440	apolemia
-n01914163	anthozoan, actinozoan
-n01914609	sea anemone, anemone
-n01914830	actinia, actinian, actiniarian
-n01915700	sea pen
-n01915811	coral
-n01916187	gorgonian, gorgonian coral
-n01916388	sea feather
-n01916481	sea fan
-n01916588	red coral
-n01916925	stony coral, madrepore, madriporian coral
-n01917289	brain coral
-n01917611	staghorn coral, stag's-horn coral
-n01917882	mushroom coral
-n01918744	ctenophore, comb jelly
-n01919385	beroe
-n01920051	platyctenean
-n01920438	sea gooseberry
-n01921059	Venus's girdle, Cestum veneris
-n01922303	worm
-n01922717	helminth, parasitic worm
-n01922948	woodworm
-n01923025	woodborer, borer
-n01923404	acanthocephalan, spiny-headed worm
-n01923890	arrowworm, chaetognath
-n01924800	bladder worm
-n01924916	flatworm, platyhelminth
-n01925270	planarian, planaria
-n01925695	fluke, trematode, trematode worm
-n01925916	cercaria
-n01926379	liver fluke, Fasciola hepatica
-n01926689	Fasciolopsis buski
-n01927159	schistosome, blood fluke
-n01927456	tapeworm, cestode
-n01927928	echinococcus
-n01928215	taenia
-n01928517	ribbon worm, nemertean, nemertine, proboscis worm
-n01928865	beard worm, pogonophoran
-n01929186	rotifer
-n01930112	nematode, nematode worm, roundworm
-n01930852	common roundworm, Ascaris lumbricoides
-n01931140	chicken roundworm, Ascaridia galli
-n01931520	pinworm, threadworm, Enterobius vermicularis
-n01931714	eelworm
-n01932151	vinegar eel, vinegar worm, Anguillula aceti, Turbatrix aceti
-n01932936	trichina, Trichinella spiralis
-n01933151	hookworm
-n01933478	filaria
-n01933988	Guinea worm, Dracunculus medinensis
-n01934440	annelid, annelid worm, segmented worm
-n01934844	archiannelid
-n01935176	oligochaete, oligochaete worm
-n01935395	earthworm, angleworm, fishworm, fishing worm, wiggler, nightwalker, nightcrawler, crawler, dew worm, red worm
-n01936391	polychaete, polychete, polychaete worm, polychete worm
-n01936671	lugworm, lug, lobworm
-n01936858	sea mouse
-n01937579	bloodworm
-n01937909	leech, bloodsucker, hirudinean
-n01938454	medicinal leech, Hirudo medicinalis
-n01938735	horseleech
-n01940736	mollusk, mollusc, shellfish
-n01941223	scaphopod
-n01941340	tooth shell, tusk shell
-n01942177	gastropod, univalve
-n01942869	abalone, ear-shell
-n01943087	ormer, sea-ear, Haliotis tuberculata
-n01943541	scorpion shell
-n01943899	conch
-n01944118	giant conch, Strombus gigas
-n01944390	snail
-n01944812	edible snail, Helix pomatia
-n01944955	garden snail
-n01945143	brown snail, Helix aspersa
-n01945340	Helix hortensis
-n01945685	slug
-n01945845	seasnail
-n01946277	neritid, neritid gastropod
-n01946630	nerita
-n01946827	bleeding tooth, Nerita peloronta
-n01947139	neritina
-n01947396	whelk
-n01947997	moon shell, moonshell
-n01948446	periwinkle, winkle
-n01948573	limpet
-n01949085	common limpet, Patella vulgata
-n01949499	keyhole limpet, Fissurella apertura, Diodora apertura
-n01949973	river limpet, freshwater limpet, Ancylus fluviatilis
-n01950731	sea slug, nudibranch
-n01951274	sea hare, Aplysia punctata
-n01951613	Hermissenda crassicornis
-n01952029	bubble shell
-n01952712	physa
-n01953361	cowrie, cowry
-n01953594	money cowrie, Cypraea moneta
-n01953762	tiger cowrie, Cypraea tigris
-n01954516	solenogaster, aplacophoran
-n01955084	chiton, coat-of-mail shell, sea cradle, polyplacophore
-n01955933	bivalve, pelecypod, lamellibranch
-n01956344	spat
-n01956481	clam
-n01956764	seashell
-n01957335	soft-shell clam, steamer, steamer clam, long-neck clam, Mya arenaria
-n01958038	quahog, quahaug, hard-shell clam, hard clam, round clam, Venus mercenaria, Mercenaria mercenaria
-n01958346	littleneck, littleneck clam
-n01958435	cherrystone, cherrystone clam
-n01958531	geoduck
-n01959029	razor clam, jackknife clam, knife-handle
-n01959492	giant clam, Tridacna gigas
-n01959985	cockle
-n01960177	edible cockle, Cardium edule
-n01960459	oyster
-n01961234	Japanese oyster, Ostrea gigas
-n01961600	Virginia oyster
-n01961985	pearl oyster, Pinctada margaritifera
-n01962506	saddle oyster, Anomia ephippium
-n01962788	window oyster, windowpane oyster, capiz, Placuna placenta
-n01963317	ark shell
-n01963479	blood clam
-n01963571	mussel
-n01964049	marine mussel, mytilid
-n01964271	edible mussel, Mytilus edulis
-n01964441	freshwater mussel, freshwater clam
-n01964957	pearly-shelled mussel
-n01965252	thin-shelled mussel
-n01965529	zebra mussel, Dreissena polymorpha
-n01965889	scallop, scollop, escallop
-n01966377	bay scallop, Pecten irradians
-n01966586	sea scallop, giant scallop, Pecten magellanicus
-n01967094	shipworm, teredinid
-n01967308	teredo
-n01967963	piddock
-n01968315	cephalopod, cephalopod mollusk
-n01968897	chambered nautilus, pearly nautilus, nautilus
-n01969726	octopod
-n01970164	octopus, devilfish
-n01970667	paper nautilus, nautilus, Argonaut, Argonauta argo
-n01971094	decapod
-n01971280	squid
-n01971620	loligo
-n01971850	ommastrephes
-n01972131	architeuthis, giant squid
-n01972541	cuttlefish, cuttle
-n01973148	spirula, Spirula peronii
-n01974773	crustacean
-n01975687	malacostracan crustacean
-n01976146	decapod crustacean, decapod
-n01976868	brachyuran
-n01976957	crab
-n01977485	stone crab, Menippe mercenaria
-n01978010	hard-shell crab
-n01978136	soft-shell crab, soft-shelled crab
-n01978287	Dungeness crab, Cancer magister
-n01978455	rock crab, Cancer irroratus
-n01978587	Jonah crab, Cancer borealis
-n01978930	swimming crab
-n01979269	English lady crab, Portunus puber
-n01979526	American lady crab, lady crab, calico crab, Ovalipes ocellatus
-n01979874	blue crab, Callinectes sapidus
-n01980166	fiddler crab
-n01980655	pea crab
-n01981276	king crab, Alaska crab, Alaskan king crab, Alaska king crab, Paralithodes camtschatica
-n01981702	spider crab
-n01982068	European spider crab, king crab, Maja squinado
-n01982347	giant crab, Macrocheira kaempferi
-n01982650	lobster
-n01983048	true lobster
-n01983481	American lobster, Northern lobster, Maine lobster, Homarus americanus
-n01983674	European lobster, Homarus vulgaris
-n01983829	Cape lobster, Homarus capensis
-n01984245	Norway lobster, Nephrops norvegicus
-n01984695	spiny lobster, langouste, rock lobster, crawfish, crayfish, sea crawfish
-n01985128	crayfish, crawfish, crawdad, crawdaddy
-n01985493	Old World crayfish, ecrevisse
-n01985797	American crayfish
-n01986214	hermit crab
-n01986806	shrimp
-n01987076	snapping shrimp, pistol shrimp
-n01987545	prawn
-n01987727	long-clawed prawn, river prawn, Palaemon australis
-n01988203	tropical prawn
-n01988701	krill
-n01988869	Euphausia pacifica
-n01989516	opossum shrimp
-n01989869	stomatopod, stomatopod crustacean
-n01990007	mantis shrimp, mantis crab
-n01990516	squilla, mantis prawn
-n01990800	isopod
-n01991028	woodlouse, slater
-n01991520	pill bug
-n01992262	sow bug
-n01992423	sea louse, sea slater
-n01992773	amphipod
-n01993525	skeleton shrimp
-n01993830	whale louse
-n01994910	daphnia, water flea
-n01995514	fairy shrimp
-n01995686	brine shrimp, Artemia salina
-n01996280	tadpole shrimp
-n01996585	copepod, copepod crustacean
-n01997119	cyclops, water flea
-n01997825	seed shrimp, mussel shrimp, ostracod
-n01998183	barnacle, cirriped, cirripede
-n01998741	acorn barnacle, rock barnacle, Balanus balanoides
-n01999186	goose barnacle, gooseneck barnacle, Lepas fascicularis
-n01999767	onychophoran, velvet worm, peripatus
-n02000954	wading bird, wader
-n02002075	stork
-n02002556	white stork, Ciconia ciconia
-n02002724	black stork, Ciconia nigra
-n02003037	adjutant bird, adjutant, adjutant stork, Leptoptilus dubius
-n02003204	marabou, marabout, marabou stork, Leptoptilus crumeniferus
-n02003577	openbill
-n02003839	jabiru, Jabiru mycteria
-n02004131	saddlebill, jabiru, Ephippiorhynchus senegalensis
-n02004492	policeman bird, black-necked stork, jabiru, Xenorhyncus asiaticus
-n02004855	wood ibis, wood stork, flinthead, Mycteria americana
-n02005399	shoebill, shoebird, Balaeniceps rex
-n02005790	ibis
-n02006063	wood ibis, wood stork, Ibis ibis
-n02006364	sacred ibis, Threskiornis aethiopica
-n02006656	spoonbill
-n02006985	common spoonbill, Platalea leucorodia
-n02007284	roseate spoonbill, Ajaia ajaja
-n02007558	flamingo
-n02008041	heron
-n02008497	great blue heron, Ardea herodius
-n02008643	great white heron, Ardea occidentalis
-n02008796	egret
-n02009229	little blue heron, Egretta caerulea
-n02009380	snowy egret, snowy heron, Egretta thula
-n02009508	little egret, Egretta garzetta
-n02009750	great white heron, Casmerodius albus
-n02009912	American egret, great white heron, Egretta albus
-n02010272	cattle egret, Bubulcus ibis
-n02010453	night heron, night raven
-n02010728	black-crowned night heron, Nycticorax nycticorax
-n02011016	yellow-crowned night heron, Nyctanassa violacea
-n02011281	boatbill, boat-billed heron, broadbill, Cochlearius cochlearius
-n02011460	bittern
-n02011805	American bittern, stake driver, Botaurus lentiginosus
-n02011943	European bittern, Botaurus stellaris
-n02012185	least bittern, Ixobrychus exilis
-n02012849	crane
-n02013177	whooping crane, whooper, Grus americana
-n02013567	courlan, Aramus guarauna
-n02013706	limpkin, Aramus pictus
-n02014237	crested cariama, seriema, Cariama cristata
-n02014524	chunga, seriema, Chunga burmeisteri
-n02014941	rail
-n02015357	weka, maori hen, wood hen
-n02015554	crake
-n02015797	corncrake, land rail, Crex crex
-n02016066	spotted crake, Porzana porzana
-n02016358	gallinule, marsh hen, water hen, swamphen
-n02016659	Florida gallinule, Gallinula chloropus cachinnans
-n02016816	moorhen, Gallinula chloropus
-n02016956	purple gallinule
-n02017213	European gallinule, Porphyrio porphyrio
-n02017475	American gallinule, Porphyrula martinica
-n02017725	notornis, takahe, Notornis mantelli
-n02018027	coot
-n02018207	American coot, marsh hen, mud hen, water hen, Fulica americana
-n02018368	Old World coot, Fulica atra
-n02018795	bustard
-n02019190	great bustard, Otis tarda
-n02019438	plain turkey, Choriotis australis
-n02019929	button quail, button-quail, bustard quail, hemipode
-n02020219	striped button quail, Turnix sylvatica
-n02020578	plain wanderer, Pedionomus torquatus
-n02021050	trumpeter
-n02021281	Brazilian trumpeter, Psophia crepitans
-n02021795	seabird, sea bird, seafowl
-n02022684	shorebird, shore bird, limicoline bird
-n02023341	plover
-n02023855	piping plover, Charadrius melodus
-n02023992	killdeer, kildeer, killdeer plover, Charadrius vociferus
-n02024185	dotterel, dotrel, Charadrius morinellus, Eudromias morinellus
-n02024479	golden plover
-n02024763	lapwing, green plover, peewit, pewit
-n02025043	turnstone
-n02025239	ruddy turnstone, Arenaria interpres
-n02025389	black turnstone, Arenaria-Melanocephala
-n02026059	sandpiper
-n02026629	surfbird, Aphriza virgata
-n02026948	European sandpiper, Actitis hypoleucos
-n02027075	spotted sandpiper, Actitis macularia
-n02027357	least sandpiper, stint, Erolia minutilla
-n02027492	red-backed sandpiper, dunlin, Erolia alpina
-n02027897	greenshank, Tringa nebularia
-n02028035	redshank, Tringa totanus
-n02028175	yellowlegs
-n02028342	greater yellowlegs, Tringa melanoleuca
-n02028451	lesser yellowlegs, Tringa flavipes
-n02028727	pectoral sandpiper, jacksnipe, Calidris melanotos
-n02028900	knot, greyback, grayback, Calidris canutus
-n02029087	curlew sandpiper, Calidris Ferruginea
-n02029378	sanderling, Crocethia alba
-n02029706	upland sandpiper, upland plover, Bartramian sandpiper, Bartramia longicauda
-n02030035	ruff, Philomachus pugnax
-n02030224	reeve
-n02030287	tattler
-n02030568	Polynesian tattler, Heteroscelus incanus
-n02030837	willet, Catoptrophorus semipalmatus
-n02030996	woodcock
-n02031298	Eurasian woodcock, Scolopax rusticola
-n02031585	American woodcock, woodcock snipe, Philohela minor
-n02031934	snipe
-n02032222	whole snipe, Gallinago gallinago
-n02032355	Wilson's snipe, Gallinago gallinago delicata
-n02032480	great snipe, woodcock snipe, Gallinago media
-n02032769	jacksnipe, half snipe, Limnocryptes minima
-n02033041	dowitcher
-n02033208	greyback, grayback, Limnodromus griseus
-n02033324	red-breasted snipe, Limnodromus scolopaceus
-n02033561	curlew
-n02033779	European curlew, Numenius arquata
-n02033882	Eskimo curlew, Numenius borealis
-n02034129	godwit
-n02034295	Hudsonian godwit, Limosa haemastica
-n02034661	stilt, stiltbird, longlegs, long-legs, stilt plover, Himantopus stilt
-n02034971	black-necked stilt, Himantopus mexicanus
-n02035210	black-winged stilt, Himantopus himantopus
-n02035402	white-headed stilt, Himantopus himantopus leucocephalus
-n02035656	kaki, Himantopus novae-zelandiae
-n02036053	stilt, Australian stilt
-n02036228	banded stilt, Cladorhyncus leucocephalum
-n02036711	avocet
-n02037110	oystercatcher, oyster catcher
-n02037464	phalarope
-n02037869	red phalarope, Phalaropus fulicarius
-n02038141	northern phalarope, Lobipes lobatus
-n02038466	Wilson's phalarope, Steganopus tricolor
-n02038993	pratincole, glareole
-n02039171	courser
-n02039497	cream-colored courser, Cursorius cursor
-n02039780	crocodile bird, Pluvianus aegyptius
-n02040266	stone curlew, thick-knee, Burhinus oedicnemus
-n02040505	coastal diving bird
-n02041085	larid
-n02041246	gull, seagull, sea gull
-n02041678	mew, mew gull, sea mew, Larus canus
-n02041875	black-backed gull, great black-backed gull, cob, Larus marinus
-n02042046	herring gull, Larus argentatus
-n02042180	laughing gull, blackcap, pewit, pewit gull, Larus ridibundus
-n02042472	ivory gull, Pagophila eburnea
-n02042759	kittiwake
-n02043063	tern
-n02043333	sea swallow, Sterna hirundo
-n02043808	skimmer
-n02044178	jaeger
-n02044517	parasitic jaeger, arctic skua, Stercorarius parasiticus
-n02044778	skua, bonxie
-n02044908	great skua, Catharacta skua
-n02045369	auk
-n02045596	auklet
-n02045864	razorbill, razor-billed auk, Alca torda
-n02046171	little auk, dovekie, Plautus alle
-n02046759	guillemot
-n02046939	black guillemot, Cepphus grylle
-n02047045	pigeon guillemot, Cepphus columba
-n02047260	murre
-n02047411	common murre, Uria aalge
-n02047517	thick-billed murre, Uria lomvia
-n02047614	puffin
-n02047975	Atlantic puffin, Fratercula arctica
-n02048115	horned puffin, Fratercula corniculata
-n02048353	tufted puffin, Lunda cirrhata
-n02048698	gaviiform seabird
-n02049088	loon, diver
-n02049532	podicipitiform seabird
-n02050004	grebe
-n02050313	great crested grebe, Podiceps cristatus
-n02050442	red-necked grebe, Podiceps grisegena
-n02050586	black-necked grebe, eared grebe, Podiceps nigricollis
-n02050809	dabchick, little grebe, Podiceps ruficollis
-n02051059	pied-billed grebe, Podilymbus podiceps
-n02051474	pelecaniform seabird
-n02051845	pelican
-n02052204	white pelican, Pelecanus erythrorhynchos
-n02052365	Old world white pelican, Pelecanus onocrotalus
-n02052775	frigate bird, man-of-war bird
-n02053083	gannet
-n02053425	solan, solan goose, solant goose, Sula bassana
-n02053584	booby
-n02054036	cormorant, Phalacrocorax carbo
-n02054502	snakebird, anhinga, darter
-n02054711	water turkey, Anhinga anhinga
-n02055107	tropic bird, tropicbird, boatswain bird
-n02055658	sphenisciform seabird
-n02055803	penguin
-n02056228	Adelie, Adelie penguin, Pygoscelis adeliae
-n02056570	king penguin, Aptenodytes patagonica
-n02056728	emperor penguin, Aptenodytes forsteri
-n02057035	jackass penguin, Spheniscus demersus
-n02057330	rock hopper, crested penguin
-n02057731	pelagic bird, oceanic bird
-n02057898	procellariiform seabird
-n02058221	albatross, mollymawk
-n02058594	wandering albatross, Diomedea exulans
-n02058747	black-footed albatross, gooney, gooney bird, goonie, goony, Diomedea nigripes
-n02059162	petrel
-n02059541	white-chinned petrel, Procellaria aequinoctialis
-n02059852	giant petrel, giant fulmar, Macronectes giganteus
-n02060133	fulmar, fulmar petrel, Fulmarus glacialis
-n02060411	shearwater
-n02060569	Manx shearwater, Puffinus puffinus
-n02060889	storm petrel
-n02061217	stormy petrel, northern storm petrel, Hydrobates pelagicus
-n02061560	Mother Carey's chicken, Mother Carey's hen, Oceanites oceanicus
-n02061853	diving petrel
-n02062017	aquatic mammal
-n02062430	cetacean, cetacean mammal, blower
-n02062744	whale
-n02063224	baleen whale, whalebone whale
-n02063662	right whale
-n02064000	bowhead, bowhead whale, Greenland whale, Balaena mysticetus
-n02064338	rorqual, razorback
-n02064816	blue whale, sulfur bottom, Balaenoptera musculus
-n02065026	finback, finback whale, fin whale, common rorqual, Balaenoptera physalus
-n02065263	sei whale, Balaenoptera borealis
-n02065407	lesser rorqual, piked whale, minke whale, Balaenoptera acutorostrata
-n02065726	humpback, humpback whale, Megaptera novaeangliae
-n02066245	grey whale, gray whale, devilfish, Eschrichtius gibbosus, Eschrichtius robustus
-n02066707	toothed whale
-n02067240	sperm whale, cachalot, black whale, Physeter catodon
-n02067603	pygmy sperm whale, Kogia breviceps
-n02067768	dwarf sperm whale, Kogia simus
-n02068206	beaked whale
-n02068541	bottle-nosed whale, bottlenose whale, bottlenose, Hyperoodon ampullatus
-n02068974	dolphin
-n02069412	common dolphin, Delphinus delphis
-n02069701	bottlenose dolphin, bottle-nosed dolphin, bottlenose
-n02069974	Atlantic bottlenose dolphin, Tursiops truncatus
-n02070174	Pacific bottlenose dolphin, Tursiops gilli
-n02070430	porpoise
-n02070624	harbor porpoise, herring hog, Phocoena phocoena
-n02070776	vaquita, Phocoena sinus
-n02071028	grampus, Grampus griseus
-n02071294	killer whale, killer, orca, grampus, sea wolf, Orcinus orca
-n02071636	pilot whale, black whale, common blackfish, blackfish, Globicephala melaena
-n02072040	river dolphin
-n02072493	narwhal, narwal, narwhale, Monodon monoceros
-n02072798	white whale, beluga, Delphinapterus leucas
-n02073250	sea cow, sirenian mammal, sirenian
-n02073831	manatee, Trichechus manatus
-n02074367	dugong, Dugong dugon
-n02074726	Steller's sea cow, Hydrodamalis gigas
-n02075296	carnivore
-n02075612	omnivore
-n02075927	pinniped mammal, pinniped, pinnatiped
-n02076196	seal
-n02076402	crabeater seal, crab-eating seal
-n02076779	eared seal
-n02077152	fur seal
-n02077384	guadalupe fur seal, Arctocephalus philippi
-n02077658	fur seal
-n02077787	Alaska fur seal, Callorhinus ursinus
-n02077923	sea lion
-n02078292	South American sea lion, Otaria Byronia
-n02078574	California sea lion, Zalophus californianus, Zalophus californicus
-n02078738	Australian sea lion, Zalophus lobatus
-n02079005	Steller sea lion, Steller's sea lion, Eumetopias jubatus
-n02079389	earless seal, true seal, hair seal
-n02079851	harbor seal, common seal, Phoca vitulina
-n02080146	harp seal, Pagophilus groenlandicus
-n02080415	elephant seal, sea elephant
-n02080713	bearded seal, squareflipper square flipper, Erignathus barbatus
-n02081060	hooded seal, bladdernose, Cystophora cristata
-n02081571	walrus, seahorse, sea horse
-n02081798	Atlantic walrus, Odobenus rosmarus
-n02081927	Pacific walrus, Odobenus divergens
-n02082056	Fissipedia
-n02082190	fissiped mammal, fissiped
-n02082791	aardvark, ant bear, anteater, Orycteropus afer
-n02083346	canine, canid
-n02083672	bitch
-n02083780	brood bitch
-n02084071	dog, domestic dog, Canis familiaris
-n02084732	pooch, doggie, doggy, barker, bow-wow
-n02084861	cur, mongrel, mutt
-n02085019	feist, fice
-n02085118	pariah dog, pye-dog, pie-dog
-n02085272	lapdog
-n02085374	toy dog, toy
-n02085620	Chihuahua
-n02085782	Japanese spaniel
-n02085936	Maltese dog, Maltese terrier, Maltese
-n02086079	Pekinese, Pekingese, Peke
-n02086240	Shih-Tzu
-n02086346	toy spaniel
-n02086478	English toy spaniel
-n02086646	Blenheim spaniel
-n02086753	King Charles spaniel
-n02086910	papillon
-n02087046	toy terrier
-n02087122	hunting dog
-n02087314	courser
-n02087394	Rhodesian ridgeback
-n02087551	hound, hound dog
-n02088094	Afghan hound, Afghan
-n02088238	basset, basset hound
-n02088364	beagle
-n02088466	bloodhound, sleuthhound
-n02088632	bluetick
-n02088745	boarhound
-n02088839	coonhound
-n02088992	coondog
-n02089078	black-and-tan coonhound
-n02089232	dachshund, dachsie, badger dog
-n02089468	sausage dog, sausage hound
-n02089555	foxhound
-n02089725	American foxhound
-n02089867	Walker hound, Walker foxhound
-n02089973	English foxhound
-n02090129	harrier
-n02090253	Plott hound
-n02090379	redbone
-n02090475	wolfhound
-n02090622	borzoi, Russian wolfhound
-n02090721	Irish wolfhound
-n02090827	greyhound
-n02091032	Italian greyhound
-n02091134	whippet
-n02091244	Ibizan hound, Ibizan Podenco
-n02091467	Norwegian elkhound, elkhound
-n02091635	otterhound, otter hound
-n02091831	Saluki, gazelle hound
-n02092002	Scottish deerhound, deerhound
-n02092173	staghound
-n02092339	Weimaraner
-n02092468	terrier
-n02093056	bullterrier, bull terrier
-n02093256	Staffordshire bullterrier, Staffordshire bull terrier
-n02093428	American Staffordshire terrier, Staffordshire terrier, American pit bull terrier, pit bull terrier
-n02093647	Bedlington terrier
-n02093754	Border terrier
-n02093859	Kerry blue terrier
-n02093991	Irish terrier
-n02094114	Norfolk terrier
-n02094258	Norwich terrier
-n02094433	Yorkshire terrier
-n02094562	rat terrier, ratter
-n02094721	Manchester terrier, black-and-tan terrier
-n02094931	toy Manchester, toy Manchester terrier
-n02095050	fox terrier
-n02095212	smooth-haired fox terrier
-n02095314	wire-haired fox terrier
-n02095412	wirehair, wirehaired terrier, wire-haired terrier
-n02095570	Lakeland terrier
-n02095727	Welsh terrier
-n02095889	Sealyham terrier, Sealyham
-n02096051	Airedale, Airedale terrier
-n02096177	cairn, cairn terrier
-n02096294	Australian terrier
-n02096437	Dandie Dinmont, Dandie Dinmont terrier
-n02096585	Boston bull, Boston terrier
-n02096756	schnauzer
-n02097047	miniature schnauzer
-n02097130	giant schnauzer
-n02097209	standard schnauzer
-n02097298	Scotch terrier, Scottish terrier, Scottie
-n02097474	Tibetan terrier, chrysanthemum dog
-n02097658	silky terrier, Sydney silky
-n02097786	Skye terrier
-n02097967	Clydesdale terrier
-n02098105	soft-coated wheaten terrier
-n02098286	West Highland white terrier
-n02098413	Lhasa, Lhasa apso
-n02098550	sporting dog, gun dog
-n02098806	bird dog
-n02098906	water dog
-n02099029	retriever
-n02099267	flat-coated retriever
-n02099429	curly-coated retriever
-n02099601	golden retriever
-n02099712	Labrador retriever
-n02099849	Chesapeake Bay retriever
-n02099997	pointer, Spanish pointer
-n02100236	German short-haired pointer
-n02100399	setter
-n02100583	vizsla, Hungarian pointer
-n02100735	English setter
-n02100877	Irish setter, red setter
-n02101006	Gordon setter
-n02101108	spaniel
-n02101388	Brittany spaniel
-n02101556	clumber, clumber spaniel
-n02101670	field spaniel
-n02101861	springer spaniel, springer
-n02102040	English springer, English springer spaniel
-n02102177	Welsh springer spaniel
-n02102318	cocker spaniel, English cocker spaniel, cocker
-n02102480	Sussex spaniel
-n02102605	water spaniel
-n02102806	American water spaniel
-n02102973	Irish water spaniel
-n02103181	griffon, wire-haired pointing griffon
-n02103406	working dog
-n02103841	watchdog, guard dog
-n02104029	kuvasz
-n02104184	attack dog
-n02104280	housedog
-n02104365	schipperke
-n02104523	shepherd dog, sheepdog, sheep dog
-n02104882	Belgian sheepdog, Belgian shepherd
-n02105056	groenendael
-n02105162	malinois
-n02105251	briard
-n02105412	kelpie
-n02105505	komondor
-n02105641	Old English sheepdog, bobtail
-n02105855	Shetland sheepdog, Shetland sheep dog, Shetland
-n02106030	collie
-n02106166	Border collie
-n02106382	Bouvier des Flandres, Bouviers des Flandres
-n02106550	Rottweiler
-n02106662	German shepherd, German shepherd dog, German police dog, alsatian
-n02106854	police dog
-n02106966	pinscher
-n02107142	Doberman, Doberman pinscher
-n02107312	miniature pinscher
-n02107420	Sennenhunde
-n02107574	Greater Swiss Mountain dog
-n02107683	Bernese mountain dog
-n02107908	Appenzeller
-n02108000	EntleBucher
-n02108089	boxer
-n02108254	mastiff
-n02108422	bull mastiff
-n02108551	Tibetan mastiff
-n02108672	bulldog, English bulldog
-n02108915	French bulldog
-n02109047	Great Dane
-n02109150	guide dog
-n02109256	Seeing Eye dog
-n02109391	hearing dog
-n02109525	Saint Bernard, St Bernard
-n02109687	seizure-alert dog
-n02109811	sled dog, sledge dog
-n02109961	Eskimo dog, husky
-n02110063	malamute, malemute, Alaskan malamute
-n02110185	Siberian husky
-n02110341	dalmatian, coach dog, carriage dog
-n02110532	liver-spotted dalmatian
-n02110627	affenpinscher, monkey pinscher, monkey dog
-n02110806	basenji
-n02110958	pug, pug-dog
-n02111129	Leonberg
-n02111277	Newfoundland, Newfoundland dog
-n02111500	Great Pyrenees
-n02111626	spitz
-n02111889	Samoyed, Samoyede
-n02112018	Pomeranian
-n02112137	chow, chow chow
-n02112350	keeshond
-n02112497	griffon, Brussels griffon, Belgian griffon
-n02112706	Brabancon griffon
-n02112826	corgi, Welsh corgi
-n02113023	Pembroke, Pembroke Welsh corgi
-n02113186	Cardigan, Cardigan Welsh corgi
-n02113335	poodle, poodle dog
-n02113624	toy poodle
-n02113712	miniature poodle
-n02113799	standard poodle
-n02113892	large poodle
-n02113978	Mexican hairless
-n02114100	wolf
-n02114367	timber wolf, grey wolf, gray wolf, Canis lupus
-n02114548	white wolf, Arctic wolf, Canis lupus tundrarum
-n02114712	red wolf, maned wolf, Canis rufus, Canis niger
-n02114855	coyote, prairie wolf, brush wolf, Canis latrans
-n02115012	coydog
-n02115096	jackal, Canis aureus
-n02115335	wild dog
-n02115641	dingo, warrigal, warragal, Canis dingo
-n02115913	dhole, Cuon alpinus
-n02116185	crab-eating dog, crab-eating fox, Dusicyon cancrivorus
-n02116450	raccoon dog, Nyctereutes procyonides
-n02116738	African hunting dog, hyena dog, Cape hunting dog, Lycaon pictus
-n02117135	hyena, hyaena
-n02117512	striped hyena, Hyaena hyaena
-n02117646	brown hyena, strand wolf, Hyaena brunnea
-n02117900	spotted hyena, laughing hyena, Crocuta crocuta
-n02118176	aardwolf, Proteles cristata
-n02118333	fox
-n02118643	vixen
-n02118707	Reynard
-n02119022	red fox, Vulpes vulpes
-n02119247	black fox
-n02119359	silver fox
-n02119477	red fox, Vulpes fulva
-n02119634	kit fox, prairie fox, Vulpes velox
-n02119789	kit fox, Vulpes macrotis
-n02120079	Arctic fox, white fox, Alopex lagopus
-n02120278	blue fox
-n02120505	grey fox, gray fox, Urocyon cinereoargenteus
-n02120997	feline, felid
-n02121620	cat, true cat
-n02121808	domestic cat, house cat, Felis domesticus, Felis catus
-n02122298	kitty, kitty-cat, puss, pussy, pussycat
-n02122430	mouser
-n02122510	alley cat
-n02122580	stray
-n02122725	tom, tomcat
-n02122810	gib
-n02122878	tabby, queen
-n02122948	kitten, kitty
-n02123045	tabby, tabby cat
-n02123159	tiger cat
-n02123242	tortoiseshell, tortoiseshell-cat, calico cat
-n02123394	Persian cat
-n02123478	Angora, Angora cat
-n02123597	Siamese cat, Siamese
-n02123785	blue point Siamese
-n02123917	Burmese cat
-n02124075	Egyptian cat
-n02124157	Maltese, Maltese cat
-n02124313	Abyssinian, Abyssinian cat
-n02124484	Manx, Manx cat
-n02124623	wildcat
-n02125010	sand cat
-n02125081	European wildcat, catamountain, Felis silvestris
-n02125311	cougar, puma, catamount, mountain lion, painter, panther, Felis concolor
-n02125494	ocelot, panther cat, Felis pardalis
-n02125689	jaguarundi, jaguarundi cat, jaguarondi, eyra, Felis yagouaroundi
-n02125872	kaffir cat, caffer cat, Felis ocreata
-n02126028	jungle cat, Felis chaus
-n02126139	serval, Felis serval
-n02126317	leopard cat, Felis bengalensis
-n02126640	margay, margay cat, Felis wiedi
-n02126787	manul, Pallas's cat, Felis manul
-n02127052	lynx, catamount
-n02127292	common lynx, Lynx lynx
-n02127381	Canada lynx, Lynx canadensis
-n02127482	bobcat, bay lynx, Lynx rufus
-n02127586	spotted lynx, Lynx pardina
-n02127678	caracal, desert lynx, Lynx caracal
-n02127808	big cat, cat
-n02128385	leopard, Panthera pardus
-n02128598	leopardess
-n02128669	panther
-n02128757	snow leopard, ounce, Panthera uncia
-n02128925	jaguar, panther, Panthera onca, Felis onca
-n02129165	lion, king of beasts, Panthera leo
-n02129463	lioness
-n02129530	lionet
-n02129604	tiger, Panthera tigris
-n02129837	Bengal tiger
-n02129923	tigress
-n02129991	liger
-n02130086	tiglon, tigon
-n02130308	cheetah, chetah, Acinonyx jubatus
-n02130545	saber-toothed tiger, sabertooth
-n02130925	Smiledon californicus
-n02131653	bear
-n02132136	brown bear, bruin, Ursus arctos
-n02132320	bruin
-n02132466	Syrian bear, Ursus arctos syriacus
-n02132580	grizzly, grizzly bear, silvertip, silver-tip, Ursus horribilis, Ursus arctos horribilis
-n02132788	Alaskan brown bear, Kodiak bear, Kodiak, Ursus middendorffi, Ursus arctos middendorffi
-n02133161	American black bear, black bear, Ursus americanus, Euarctos americanus
-n02133400	cinnamon bear
-n02133704	Asiatic black bear, black bear, Ursus thibetanus, Selenarctos thibetanus
-n02134084	ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus
-n02134418	sloth bear, Melursus ursinus, Ursus ursinus
-n02134971	viverrine, viverrine mammal
-n02135220	civet, civet cat
-n02135610	large civet, Viverra zibetha
-n02135844	small civet, Viverricula indica, Viverricula malaccensis
-n02136103	binturong, bearcat, Arctictis bintourong
-n02136285	Cryptoprocta, genus Cryptoprocta
-n02136452	fossa, fossa cat, Cryptoprocta ferox
-n02136794	fanaloka, Fossa fossa
-n02137015	genet, Genetta genetta
-n02137302	banded palm civet, Hemigalus hardwickii
-n02137549	mongoose
-n02137722	Indian mongoose, Herpestes nyula
-n02137888	ichneumon, Herpestes ichneumon
-n02138169	palm cat, palm civet
-n02138441	meerkat, mierkat
-n02138647	slender-tailed meerkat, Suricata suricatta
-n02138777	suricate, Suricata tetradactyla
-n02139199	bat, chiropteran
-n02139671	fruit bat, megabat
-n02140049	flying fox
-n02140179	Pteropus capestratus
-n02140268	Pteropus hypomelanus
-n02140491	harpy, harpy bat, tube-nosed bat, tube-nosed fruit bat
-n02140858	Cynopterus sphinx
-n02141306	carnivorous bat, microbat
-n02141611	mouse-eared bat
-n02141713	leafnose bat, leaf-nosed bat
-n02142407	macrotus, Macrotus californicus
-n02142734	spearnose bat
-n02142898	Phyllostomus hastatus
-n02143142	hognose bat, Choeronycteris mexicana
-n02143439	horseshoe bat
-n02143891	horseshoe bat
-n02144251	orange bat, orange horseshoe bat, Rhinonicteris aurantius
-n02144593	false vampire, false vampire bat
-n02144936	big-eared bat, Megaderma lyra
-n02145424	vespertilian bat, vespertilionid
-n02145910	frosted bat, Vespertilio murinus
-n02146201	red bat, Lasiurus borealis
-n02146371	brown bat
-n02146700	little brown bat, little brown myotis, Myotis leucifugus
-n02146879	cave myotis, Myotis velifer
-n02147173	big brown bat, Eptesicus fuscus
-n02147328	serotine, European brown bat, Eptesicus serotinus
-n02147591	pallid bat, cave bat, Antrozous pallidus
-n02147947	pipistrelle, pipistrel, Pipistrellus pipistrellus
-n02148088	eastern pipistrel, Pipistrellus subflavus
-n02148512	jackass bat, spotted bat, Euderma maculata
-n02148835	long-eared bat
-n02148991	western big-eared bat, Plecotus townsendi
-n02149420	freetail, free-tailed bat, freetailed bat
-n02149653	guano bat, Mexican freetail bat, Tadarida brasiliensis
-n02149861	pocketed bat, pocketed freetail bat, Tadirida femorosacca
-n02150134	mastiff bat
-n02150482	vampire bat, true vampire bat
-n02150885	Desmodus rotundus
-n02151230	hairy-legged vampire bat, Diphylla ecaudata
-n02152740	predator, predatory animal
-n02152881	prey, quarry
-n02152991	game
-n02153109	big game
-n02153203	game bird
-n02153809	fossorial mammal
-n02156732	tetrapod
-n02156871	quadruped
-n02157206	hexapod
-n02157285	biped
-n02159955	insect
-n02160947	social insect
-n02161225	holometabola, metabola
-n02161338	defoliator
-n02161457	pollinator
-n02161588	gallfly
-n02162561	scorpion fly
-n02163008	hanging fly
-n02163297	collembolan, springtail
-n02164464	beetle
-n02165105	tiger beetle
-n02165456	ladybug, ladybeetle, lady beetle, ladybird, ladybird beetle
-n02165877	two-spotted ladybug, Adalia bipunctata
-n02166229	Mexican bean beetle, bean beetle, Epilachna varivestis
-n02166567	Hippodamia convergens
-n02166826	vedalia, Rodolia cardinalis
-n02167151	ground beetle, carabid beetle
-n02167505	bombardier beetle
-n02167820	calosoma
-n02167944	searcher, searcher beetle, Calosoma scrutator
-n02168245	firefly, lightning bug
-n02168427	glowworm
-n02168699	long-horned beetle, longicorn, longicorn beetle
-n02169023	sawyer, sawyer beetle
-n02169218	pine sawyer
-n02169497	leaf beetle, chrysomelid
-n02169705	flea beetle
-n02169974	Colorado potato beetle, Colorado beetle, potato bug, potato beetle, Leptinotarsa decemlineata
-n02170400	carpet beetle, carpet bug
-n02170599	buffalo carpet beetle, Anthrenus scrophulariae
-n02170738	black carpet beetle
-n02170993	clerid beetle, clerid
-n02171164	bee beetle
-n02171453	lamellicorn beetle
-n02171869	scarabaeid beetle, scarabaeid, scarabaean
-n02172182	dung beetle
-n02172518	scarab, scarabaeus, Scarabaeus sacer
-n02172678	tumblebug
-n02172761	dorbeetle
-n02172870	June beetle, June bug, May bug, May beetle
-n02173113	green June beetle, figeater
-n02173373	Japanese beetle, Popillia japonica
-n02173784	Oriental beetle, Asiatic beetle, Anomala orientalis
-n02174001	rhinoceros beetle
-n02174355	melolonthid beetle
-n02174659	cockchafer, May bug, May beetle, Melolontha melolontha
-n02175014	rose chafer, rose bug, Macrodactylus subspinosus
-n02175569	rose chafer, rose beetle, Cetonia aurata
-n02175916	stag beetle
-n02176261	elaterid beetle, elater, elaterid
-n02176439	click beetle, skipjack, snapping beetle
-n02176747	firefly, fire beetle, Pyrophorus noctiluca
-n02176916	wireworm
-n02177196	water beetle
-n02177506	whirligig beetle
-n02177775	deathwatch beetle, deathwatch, Xestobium rufovillosum
-n02177972	weevil
-n02178411	snout beetle
-n02178717	boll weevil, Anthonomus grandis
-n02179012	blister beetle, meloid
-n02179192	oil beetle
-n02179340	Spanish fly
-n02179891	Dutch-elm beetle, Scolytus multistriatus
-n02180233	bark beetle
-n02180427	spruce bark beetle, Dendroctonus rufipennis
-n02180875	rove beetle
-n02181235	darkling beetle, darkling groung beetle, tenebrionid
-n02181477	mealworm
-n02181724	flour beetle, flour weevil
-n02182045	seed beetle, seed weevil
-n02182355	pea weevil, Bruchus pisorum
-n02182642	bean weevil, Acanthoscelides obtectus
-n02182930	rice weevil, black weevil, Sitophylus oryzae
-n02183096	Asian longhorned beetle, Anoplophora glabripennis
-n02183507	web spinner
-n02183857	louse, sucking louse
-n02184473	common louse, Pediculus humanus
-n02184589	head louse, Pediculus capitis
-n02184720	body louse, cootie, Pediculus corporis
-n02185167	crab louse, pubic louse, crab, Phthirius pubis
-n02185481	bird louse, biting louse, louse
-n02186153	flea
-n02186717	Pulex irritans
-n02187150	dog flea, Ctenocephalides canis
-n02187279	cat flea, Ctenocephalides felis
-n02187554	chigoe, chigger, chigoe flea, Tunga penetrans
-n02187900	sticktight, sticktight flea, Echidnophaga gallinacea
-n02188699	dipterous insect, two-winged insects, dipteran, dipteron
-n02189363	gall midge, gallfly, gall gnat
-n02189670	Hessian fly, Mayetiola destructor
-n02190166	fly
-n02190790	housefly, house fly, Musca domestica
-n02191273	tsetse fly, tsetse, tzetze fly, tzetze, glossina
-n02191773	blowfly, blow fly
-n02191979	bluebottle, Calliphora vicina
-n02192252	greenbottle, greenbottle fly
-n02192513	flesh fly, Sarcophaga carnaria
-n02192814	tachina fly
-n02193009	gadfly
-n02193163	botfly
-n02194249	human botfly, Dermatobia hominis
-n02194750	sheep botfly, sheep gadfly, Oestrus ovis
-n02195091	warble fly
-n02195526	horsefly, cleg, clegg, horse fly
-n02195819	bee fly
-n02196119	robber fly, bee killer
-n02196344	fruit fly, pomace fly
-n02196896	apple maggot, railroad worm, Rhagoletis pomonella
-n02197185	Mediterranean fruit fly, medfly, Ceratitis capitata
-n02197689	drosophila, Drosophila melanogaster
-n02197877	vinegar fly
-n02198129	leaf miner, leaf-miner
-n02198532	louse fly, hippoboscid
-n02198859	horse tick, horsefly, Hippobosca equina
-n02199170	sheep ked, sheep-tick, sheep tick, Melophagus Ovinus
-n02199502	horn fly, Haematobia irritans
-n02200198	mosquito
-n02200509	wiggler, wriggler
-n02200630	gnat
-n02200850	yellow-fever mosquito, Aedes aegypti
-n02201000	Asian tiger mosquito, Aedes albopictus
-n02201497	anopheline
-n02201626	malarial mosquito, malaria mosquito
-n02202006	common mosquito, Culex pipiens
-n02202124	Culex quinquefasciatus, Culex fatigans
-n02202287	gnat
-n02202678	punkie, punky, punkey, no-see-um, biting midge
-n02203152	midge
-n02203592	fungus gnat
-n02203978	psychodid
-n02204249	sand fly, sandfly, Phlebotomus papatasii
-n02204722	fungus gnat, sciara, sciarid
-n02204907	armyworm
-n02205219	crane fly, daddy longlegs
-n02205673	blackfly, black fly, buffalo gnat
-n02206270	hymenopterous insect, hymenopteran, hymenopteron, hymenopter
-n02206856	bee
-n02207179	drone
-n02207345	queen bee
-n02207449	worker
-n02207647	soldier
-n02207805	worker bee
-n02208280	honeybee, Apis mellifera
-n02208498	Africanized bee, Africanized honey bee, killer bee, Apis mellifera scutellata, Apis mellifera adansonii
-n02208848	black bee, German bee
-n02208979	Carniolan bee
-n02209111	Italian bee
-n02209354	carpenter bee
-n02209624	bumblebee, humblebee
-n02209964	cuckoo-bumblebee
-n02210427	andrena, andrenid, mining bee
-n02210921	Nomia melanderi, alkali bee
-n02211444	leaf-cutting bee, leaf-cutter, leaf-cutter bee
-n02211627	mason bee
-n02211896	potter bee
-n02212062	wasp
-n02212602	vespid, vespid wasp
-n02212958	paper wasp
-n02213107	hornet
-n02213239	giant hornet, Vespa crabro
-n02213543	common wasp, Vespula vulgaris
-n02213663	bald-faced hornet, white-faced hornet, Vespula maculata
-n02213788	yellow jacket, yellow hornet, Vespula maculifrons
-n02214096	Polistes annularis
-n02214341	mason wasp
-n02214499	potter wasp
-n02214660	Mutillidae, family Mutillidae
-n02214773	velvet ant
-n02215161	sphecoid wasp, sphecoid
-n02215621	mason wasp
-n02215770	digger wasp
-n02216211	cicada killer, Sphecius speciosis
-n02216365	mud dauber
-n02216740	gall wasp, gallfly, cynipid wasp, cynipid gall wasp
-n02217563	chalcid fly, chalcidfly, chalcid, chalcid wasp
-n02217839	strawworm, jointworm
-n02218134	chalcis fly
-n02218371	ichneumon fly
-n02218713	sawfly
-n02219015	birch leaf miner, Fenusa pusilla
-n02219486	ant, emmet, pismire
-n02220055	pharaoh ant, pharaoh's ant, Monomorium pharaonis
-n02220225	little black ant, Monomorium minimum
-n02220518	army ant, driver ant, legionary ant
-n02220804	carpenter ant
-n02221083	fire ant
-n02221414	wood ant, Formica rufa
-n02221571	slave ant
-n02221715	Formica fusca
-n02221820	slave-making ant, slave-maker
-n02222035	sanguinary ant, Formica sanguinea
-n02222321	bulldog ant
-n02222582	Amazon ant, Polyergus rufescens
-n02223266	termite, white ant
-n02223520	dry-wood termite
-n02224023	Reticulitermes lucifugus
-n02224713	Mastotermes darwiniensis
-n02225081	Mastotermes electrodominicus
-n02225798	powder-post termite, Cryptotermes brevis
-n02226183	orthopterous insect, orthopteron, orthopteran
-n02226429	grasshopper, hopper
-n02226821	short-horned grasshopper, acridid
-n02226970	locust
-n02227247	migratory locust, Locusta migratoria
-n02227604	migratory grasshopper
-n02227966	long-horned grasshopper, tettigoniid
-n02228341	katydid
-n02228697	mormon cricket, Anabrus simplex
-n02229156	sand cricket, Jerusalem cricket, Stenopelmatus fuscus
-n02229544	cricket
-n02229765	mole cricket
-n02230023	European house cricket, Acheta domestica
-n02230187	field cricket, Acheta assimilis
-n02230480	tree cricket
-n02230634	snowy tree cricket, Oecanthus fultoni
-n02231052	phasmid, phasmid insect
-n02231487	walking stick, walkingstick, stick insect
-n02231803	diapheromera, Diapheromera femorata
-n02232223	walking leaf, leaf insect
-n02233338	cockroach, roach
-n02233943	oriental cockroach, oriental roach, Asiatic cockroach, blackbeetle, Blatta orientalis
-n02234355	American cockroach, Periplaneta americana
-n02234570	Australian cockroach, Periplaneta australasiae
-n02234848	German cockroach, Croton bug, crotonbug, water bug, Blattella germanica
-n02235205	giant cockroach
-n02236044	mantis, mantid
-n02236241	praying mantis, praying mantid, Mantis religioso
-n02236355	bug
-n02236896	hemipterous insect, bug, hemipteran, hemipteron
-n02237424	leaf bug, plant bug
-n02237581	mirid bug, mirid, capsid
-n02237868	four-lined plant bug, four-lined leaf bug, Poecilocapsus lineatus
-n02238235	lygus bug
-n02238358	tarnished plant bug, Lygus lineolaris
-n02238594	lace bug
-n02238887	lygaeid, lygaeid bug
-n02239192	chinch bug, Blissus leucopterus
-n02239528	coreid bug, coreid
-n02239774	squash bug, Anasa tristis
-n02240068	leaf-footed bug, leaf-foot bug
-n02240517	bedbug, bed bug, chinch, Cimex lectularius
-n02241008	backswimmer, Notonecta undulata
-n02241426	true bug
-n02241569	heteropterous insect
-n02241799	water bug
-n02242137	giant water bug
-n02242455	water scorpion
-n02243209	water boatman, boat bug
-n02243562	water strider, pond-skater, water skater
-n02243878	common pond-skater, Gerris lacustris
-n02244173	assassin bug, reduviid
-n02244515	conenose, cone-nosed bug, conenose bug, big bedbug, kissing bug
-n02244797	wheel bug, Arilus cristatus
-n02245111	firebug
-n02245443	cotton stainer
-n02246011	homopterous insect, homopteran
-n02246628	whitefly
-n02246941	citrus whitefly, Dialeurodes citri
-n02247216	greenhouse whitefly, Trialeurodes vaporariorum
-n02247511	sweet-potato whitefly
-n02247655	superbug, Bemisia tabaci, poinsettia strain
-n02248062	cotton strain
-n02248368	coccid insect
-n02248510	scale insect
-n02248887	soft scale
-n02249134	brown soft scale, Coccus hesperidum
-n02249515	armored scale
-n02249809	San Jose scale, Aspidiotus perniciosus
-n02250280	cochineal insect, cochineal, Dactylopius coccus
-n02250822	mealybug, mealy bug
-n02251067	citrophilous mealybug, citrophilus mealybug, Pseudococcus fragilis
-n02251233	Comstock mealybug, Comstock's mealybug, Pseudococcus comstocki
-n02251593	citrus mealybug, Planococcus citri
-n02251775	plant louse, louse
-n02252226	aphid
-n02252799	apple aphid, green apple aphid, Aphis pomi
-n02252972	blackfly, bean aphid, Aphis fabae
-n02253127	greenfly
-n02253264	green peach aphid
-n02253494	ant cow
-n02253715	woolly aphid, woolly plant louse
-n02253913	woolly apple aphid, American blight, Eriosoma lanigerum
-n02254246	woolly alder aphid, Prociphilus tessellatus
-n02254697	adelgid
-n02254901	balsam woolly aphid, Adelges piceae
-n02255023	spruce gall aphid, Adelges abietis
-n02255391	woolly adelgid
-n02256172	jumping plant louse, psylla, psyllid
-n02256656	cicada, cicala
-n02257003	dog-day cicada, harvest fly
-n02257284	seventeen-year locust, periodical cicada, Magicicada septendecim
-n02257715	spittle insect, spittlebug
-n02257985	froghopper
-n02258198	meadow spittlebug, Philaenus spumarius
-n02258508	pine spittlebug
-n02258629	Saratoga spittlebug, Aphrophora saratogensis
-n02259212	leafhopper
-n02259377	plant hopper, planthopper
-n02259708	treehopper
-n02259987	lantern fly, lantern-fly
-n02260421	psocopterous insect
-n02260863	psocid
-n02261063	bark-louse, bark louse
-n02261419	booklouse, book louse, deathwatch, Liposcelis divinatorius
-n02261757	common booklouse, Trogium pulsatorium
-n02262178	ephemerid, ephemeropteran
-n02262449	mayfly, dayfly, shadfly
-n02262803	stonefly, stone fly, plecopteran
-n02263378	neuropteron, neuropteran, neuropterous insect
-n02264021	ant lion, antlion, antlion fly
-n02264232	doodlebug, ant lion, antlion
-n02264363	lacewing, lacewing fly
-n02264591	aphid lion, aphis lion
-n02264885	green lacewing, chrysopid, stink fly
-n02265330	brown lacewing, hemerobiid, hemerobiid fly
-n02266050	dobson, dobsonfly, dobson fly, Corydalus cornutus
-n02266269	hellgrammiate, dobson
-n02266421	fish fly, fish-fly
-n02266864	alderfly, alder fly, Sialis lutaria
-n02267208	snakefly
-n02267483	mantispid
-n02268148	odonate
-n02268443	dragonfly, darning needle, devil's darning needle, sewing needle, snake feeder, snake doctor, mosquito hawk, skeeter hawk
-n02268853	damselfly
-n02269196	trichopterous insect, trichopteran, trichopteron
-n02269340	caddis fly, caddis-fly, caddice fly, caddice-fly
-n02269522	caseworm
-n02269657	caddisworm, strawworm
-n02270011	thysanuran insect, thysanuron
-n02270200	bristletail
-n02270623	silverfish, Lepisma saccharina
-n02270945	firebrat, Thermobia domestica
-n02271222	jumping bristletail, machilid
-n02271570	thysanopter, thysanopteron, thysanopterous insect
-n02271897	thrips, thrip, thripid
-n02272286	tobacco thrips, Frankliniella fusca
-n02272552	onion thrips, onion louse, Thrips tobaci
-n02272871	earwig
-n02273392	common European earwig, Forficula auricularia
-n02274024	lepidopterous insect, lepidopteron, lepidopteran
-n02274259	butterfly
-n02274822	nymphalid, nymphalid butterfly, brush-footed butterfly, four-footed butterfly
-n02275560	mourning cloak, mourning cloak butterfly, Camberwell beauty, Nymphalis antiopa
-n02275773	tortoiseshell, tortoiseshell butterfly
-n02276078	painted beauty, Vanessa virginiensis
-n02276258	admiral
-n02276355	red admiral, Vanessa atalanta
-n02276749	white admiral, Limenitis camilla
-n02276902	banded purple, white admiral, Limenitis arthemis
-n02277094	red-spotted purple, Limenitis astyanax
-n02277268	viceroy, Limenitis archippus
-n02277422	anglewing
-n02277742	ringlet, ringlet butterfly
-n02278024	comma, comma butterfly, Polygonia comma
-n02278210	fritillary
-n02278463	silverspot
-n02278839	emperor butterfly, emperor
-n02278980	purple emperor, Apatura iris
-n02279257	peacock, peacock butterfly, Inachis io
-n02279637	danaid, danaid butterfly
-n02279972	monarch, monarch butterfly, milkweed butterfly, Danaus plexippus
-n02280458	pierid, pierid butterfly
-n02280649	cabbage butterfly
-n02281015	small white, Pieris rapae
-n02281136	large white, Pieris brassicae
-n02281267	southern cabbage butterfly, Pieris protodice
-n02281406	sulphur butterfly, sulfur butterfly
-n02281787	lycaenid, lycaenid butterfly
-n02282257	blue
-n02282385	copper
-n02282553	American copper, Lycaena hypophlaeas
-n02282903	hairstreak, hairstreak butterfly
-n02283077	Strymon melinus
-n02283201	moth
-n02283617	moth miller, miller
-n02283951	tortricid, tortricid moth
-n02284224	leaf roller, leaf-roller
-n02284611	tea tortrix, tortrix, Homona coffearia
-n02284884	orange tortrix, tortrix, Argyrotaenia citrana
-n02285179	codling moth, codlin moth, Carpocapsa pomonella
-n02285548	lymantriid, tussock moth
-n02285801	tussock caterpillar
-n02286089	gypsy moth, gipsy moth, Lymantria dispar
-n02286425	browntail, brown-tail moth, Euproctis phaeorrhoea
-n02286654	gold-tail moth, Euproctis chrysorrhoea
-n02287004	geometrid, geometrid moth
-n02287352	Paleacrita vernata
-n02287622	Alsophila pometaria
-n02287799	cankerworm
-n02287987	spring cankerworm
-n02288122	fall cankerworm
-n02288268	measuring worm, inchworm, looper
-n02288789	pyralid, pyralid moth
-n02289307	bee moth, wax moth, Galleria mellonella
-n02289610	corn borer, European corn borer moth, corn borer moth, Pyrausta nubilalis
-n02289988	Mediterranean flour moth, Anagasta kuehniella
-n02290340	tobacco moth, cacao moth, Ephestia elutella
-n02290664	almond moth, fig moth, Cadra cautella
-n02290870	raisin moth, Cadra figulilella
-n02291220	tineoid, tineoid moth
-n02291572	tineid, tineid moth
-n02291748	clothes moth
-n02292085	casemaking clothes moth, Tinea pellionella
-n02292401	webbing clothes moth, webbing moth, Tineola bisselliella
-n02292692	carpet moth, tapestry moth, Trichophaga tapetzella
-n02293352	gelechiid, gelechiid moth
-n02293868	grain moth
-n02294097	angoumois moth, angoumois grain moth, Sitotroga cerealella
-n02294407	potato moth, potato tuber moth, splitworm, Phthorimaea operculella
-n02294577	potato tuberworm, Phthorimaea operculella
-n02295064	noctuid moth, noctuid, owlet moth
-n02295390	cutworm
-n02295870	underwing
-n02296021	red underwing, Catocala nupta
-n02296276	antler moth, Cerapteryx graminis
-n02296612	heliothis moth, Heliothis zia
-n02296912	army cutworm, Chorizagrotis auxiliaris
-n02297294	armyworm, Pseudaletia unipuncta
-n02297442	armyworm, army worm, Pseudaletia unipuncta
-n02297819	Spodoptera exigua
-n02297938	beet armyworm, Spodoptera exigua
-n02298095	Spodoptera frugiperda
-n02298218	fall armyworm, Spodoptera frugiperda
-n02298541	hawkmoth, hawk moth, sphingid, sphinx moth, hummingbird moth
-n02299039	Manduca sexta
-n02299157	tobacco hornworm, tomato worm, Manduca sexta
-n02299378	Manduca quinquemaculata
-n02299505	tomato hornworm, potato worm, Manduca quinquemaculata
-n02299846	death's-head moth, Acherontia atropos
-n02300173	bombycid, bombycid moth, silkworm moth
-n02300554	domestic silkworm moth, domesticated silkworm moth, Bombyx mori
-n02300797	silkworm
-n02301452	saturniid, saturniid moth
-n02301935	emperor, emperor moth, Saturnia pavonia
-n02302244	imperial moth, Eacles imperialis
-n02302459	giant silkworm moth, silkworm moth
-n02302620	silkworm, giant silkworm, wild wilkworm
-n02302969	luna moth, Actias luna
-n02303284	cecropia, cecropia moth, Hyalophora cecropia
-n02303585	cynthia moth, Samia cynthia, Samia walkeri
-n02303777	ailanthus silkworm, Samia cynthia
-n02304036	io moth, Automeris io
-n02304432	polyphemus moth, Antheraea polyphemus
-n02304657	pernyi moth, Antheraea pernyi
-n02304797	tussah, tusseh, tussur, tussore, tusser, Antheraea mylitta
-n02305085	atlas moth, Atticus atlas
-n02305407	arctiid, arctiid moth
-n02305636	tiger moth
-n02305929	cinnabar, cinnabar moth, Callimorpha jacobeae
-n02306433	lasiocampid, lasiocampid moth
-n02306825	eggar, egger
-n02307176	tent-caterpillar moth, Malacosoma americana
-n02307325	tent caterpillar
-n02307515	tent-caterpillar moth, Malacosoma disstria
-n02307681	forest tent caterpillar, Malacosoma disstria
-n02307910	lappet, lappet moth
-n02308033	lappet caterpillar
-n02308139	webworm
-n02308471	webworm moth
-n02308618	Hyphantria cunea
-n02308735	fall webworm, Hyphantria cunea
-n02309120	garden webworm, Loxostege similalis
-n02309242	instar
-n02309337	caterpillar
-n02309841	corn borer, Pyrausta nubilalis
-n02310000	bollworm
-n02310149	pink bollworm, Gelechia gossypiella
-n02310334	corn earworm, cotton bollworm, tomato fruitworm, tobacco budworm, vetchworm, Heliothis zia
-n02310585	cabbageworm, Pieris rapae
-n02310717	woolly bear, woolly bear caterpillar
-n02310941	woolly bear moth
-n02311060	larva
-n02311617	nymph
-n02311748	leptocephalus
-n02312006	grub
-n02312175	maggot
-n02312325	leatherjacket
-n02312427	pupa
-n02312640	chrysalis
-n02312912	imago
-n02313008	queen
-n02313360	phoronid
-n02313709	bryozoan, polyzoan, sea mat, sea moss, moss animal
-n02315487	brachiopod, lamp shell, lampshell
-n02315821	peanut worm, sipunculid
-n02316707	echinoderm
-n02317335	starfish, sea star
-n02317781	brittle star, brittle-star, serpent star
-n02318167	basket star, basket fish
-n02318687	Astrophyton muricatum
-n02319095	sea urchin
-n02319308	edible sea urchin, Echinus esculentus
-n02319555	sand dollar
-n02319829	heart urchin
-n02320127	crinoid
-n02320465	sea lily
-n02321170	feather star, comatulid
-n02321529	sea cucumber, holothurian
-n02322047	trepang, Holothuria edulis
-n02322992	Duplicidentata
-n02323449	lagomorph, gnawing mammal
-n02323902	leporid, leporid mammal
-n02324045	rabbit, coney, cony
-n02324431	rabbit ears
-n02324514	lapin
-n02324587	bunny, bunny rabbit
-n02324850	European rabbit, Old World rabbit, Oryctolagus cuniculus
-n02325366	wood rabbit, cottontail, cottontail rabbit
-n02325722	eastern cottontail, Sylvilagus floridanus
-n02325884	swamp rabbit, canecutter, swamp hare, Sylvilagus aquaticus
-n02326074	marsh hare, swamp rabbit, Sylvilagus palustris
-n02326432	hare
-n02326763	leveret
-n02326862	European hare, Lepus europaeus
-n02327028	jackrabbit
-n02327175	white-tailed jackrabbit, whitetail jackrabbit, Lepus townsendi
-n02327435	blacktail jackrabbit, Lepus californicus
-n02327656	polar hare, Arctic hare, Lepus arcticus
-n02327842	snowshoe hare, snowshoe rabbit, varying hare, Lepus americanus
-n02328009	Belgian hare, leporide
-n02328150	Angora, Angora rabbit
-n02328429	pika, mouse hare, rock rabbit, coney, cony
-n02328820	little chief hare, Ochotona princeps
-n02328942	collared pika, Ochotona collaris
-n02329401	rodent, gnawer
-n02330245	mouse
-n02331046	rat
-n02331309	pocket rat
-n02331842	murine
-n02332156	house mouse, Mus musculus
-n02332447	harvest mouse, Micromyx minutus
-n02332755	field mouse, fieldmouse
-n02332954	nude mouse
-n02333190	European wood mouse, Apodemus sylvaticus
-n02333546	brown rat, Norway rat, Rattus norvegicus
-n02333733	wharf rat
-n02333819	sewer rat
-n02333909	black rat, roof rat, Rattus rattus
-n02334201	bandicoot rat, mole rat
-n02334460	jerboa rat
-n02334728	kangaroo mouse
-n02335127	water rat
-n02335231	beaver rat
-n02336011	New World mouse
-n02336275	American harvest mouse, harvest mouse
-n02336641	wood mouse
-n02336826	white-footed mouse, vesper mouse, Peromyscus leucopus
-n02337001	deer mouse, Peromyscus maniculatus
-n02337171	cactus mouse, Peromyscus eremicus
-n02337332	cotton mouse, Peromyscus gossypinus
-n02337598	pygmy mouse, Baiomys taylori
-n02337902	grasshopper mouse
-n02338145	muskrat, musquash, Ondatra zibethica
-n02338449	round-tailed muskrat, Florida water rat, Neofiber alleni
-n02338722	cotton rat, Sigmodon hispidus
-n02338901	wood rat, wood-rat
-n02339282	dusky-footed wood rat
-n02339376	vole, field mouse
-n02339922	packrat, pack rat, trade rat, bushytail woodrat, Neotoma cinerea
-n02340186	dusky-footed woodrat, Neotoma fuscipes
-n02340358	eastern woodrat, Neotoma floridana
-n02340640	rice rat, Oryzomys palustris
-n02340930	pine vole, pine mouse, Pitymys pinetorum
-n02341288	meadow vole, meadow mouse, Microtus pennsylvaticus
-n02341475	water vole, Richardson vole, Microtus richardsoni
-n02341616	prairie vole, Microtus ochrogaster
-n02341974	water vole, water rat, Arvicola amphibius
-n02342250	red-backed mouse, redback vole
-n02342534	phenacomys
-n02342885	hamster
-n02343058	Eurasian hamster, Cricetus cricetus
-n02343320	golden hamster, Syrian hamster, Mesocricetus auratus
-n02343772	gerbil, gerbille
-n02344175	jird
-n02344270	tamarisk gerbil, Meriones unguiculatus
-n02344408	sand rat, Meriones longifrons
-n02344528	lemming
-n02344918	European lemming, Lemmus lemmus
-n02345078	brown lemming, Lemmus trimucronatus
-n02345340	grey lemming, gray lemming, red-backed lemming
-n02345600	pied lemming
-n02345774	Hudson bay collared lemming, Dicrostonyx hudsonius
-n02345997	southern bog lemming, Synaptomys cooperi
-n02346170	northern bog lemming, Synaptomys borealis
-n02346627	porcupine, hedgehog
-n02346998	Old World porcupine
-n02347274	brush-tailed porcupine, brush-tail porcupine
-n02347573	long-tailed porcupine, Trichys lipura
-n02347744	New World porcupine
-n02348173	Canada porcupine, Erethizon dorsatum
-n02348788	pocket mouse
-n02349205	silky pocket mouse, Perognathus flavus
-n02349390	plains pocket mouse, Perognathus flavescens
-n02349557	hispid pocket mouse, Perognathus hispidus
-n02349847	Mexican pocket mouse, Liomys irroratus
-n02350105	kangaroo rat, desert rat, Dipodomys phillipsii
-n02350357	Ord kangaroo rat, Dipodomys ordi
-n02350670	kangaroo mouse, dwarf pocket rat
-n02350989	jumping mouse
-n02351343	meadow jumping mouse, Zapus hudsonius
-n02351870	jerboa
-n02352002	typical jerboa
-n02352290	Jaculus jaculus
-n02352591	dormouse
-n02352932	loir, Glis glis
-n02353172	hazel mouse, Muscardinus avellanarius
-n02353411	lerot
-n02353861	gopher, pocket gopher, pouched rat
-n02354162	plains pocket gopher, Geomys bursarius
-n02354320	southeastern pocket gopher, Geomys pinetis
-n02354621	valley pocket gopher, Thomomys bottae
-n02354781	northern pocket gopher, Thomomys talpoides
-n02355227	squirrel
-n02355477	tree squirrel
-n02356381	eastern grey squirrel, eastern gray squirrel, cat squirrel, Sciurus carolinensis
-n02356612	western grey squirrel, western gray squirrel, Sciurus griseus
-n02356798	fox squirrel, eastern fox squirrel, Sciurus niger
-n02356977	black squirrel
-n02357111	red squirrel, cat squirrel, Sciurus vulgaris
-n02357401	American red squirrel, spruce squirrel, red squirrel, Sciurus hudsonicus, Tamiasciurus hudsonicus
-n02357585	chickeree, Douglas squirrel, Tamiasciurus douglasi
-n02357911	antelope squirrel, whitetail antelope squirrel, antelope chipmunk, Citellus leucurus
-n02358091	ground squirrel, gopher, spermophile
-n02358390	mantled ground squirrel, Citellus lateralis
-n02358584	suslik, souslik, Citellus citellus
-n02358712	flickertail, Richardson ground squirrel, Citellus richardsoni
-n02358890	rock squirrel, Citellus variegatus
-n02359047	Arctic ground squirrel, parka squirrel, Citellus parryi
-n02359324	prairie dog, prairie marmot
-n02359556	blacktail prairie dog, Cynomys ludovicianus
-n02359667	whitetail prairie dog, Cynomys gunnisoni
-n02359915	eastern chipmunk, hackee, striped squirrel, ground squirrel, Tamias striatus
-n02360282	chipmunk
-n02360480	baronduki, baranduki, barunduki, burunduki, Eutamius asiaticus, Eutamius sibiricus
-n02360781	American flying squirrel
-n02360933	southern flying squirrel, Glaucomys volans
-n02361090	northern flying squirrel, Glaucomys sabrinus
-n02361337	marmot
-n02361587	groundhog, woodchuck, Marmota monax
-n02361706	hoary marmot, whistler, whistling marmot, Marmota caligata
-n02361850	yellowbelly marmot, rockchuck, Marmota flaviventris
-n02362194	Asiatic flying squirrel
-n02363005	beaver
-n02363245	Old World beaver, Castor fiber
-n02363351	New World beaver, Castor canadensis
-n02363996	mountain beaver, sewellel, Aplodontia rufa
-n02364520	cavy
-n02364673	guinea pig, Cavia cobaya
-n02364840	aperea, wild cavy, Cavia porcellus
-n02365108	mara, Dolichotis patagonum
-n02365480	capybara, capibara, Hydrochoerus hydrochaeris
-n02366002	agouti, Dasyprocta aguti
-n02366301	paca, Cuniculus paca
-n02366579	mountain paca
-n02366959	coypu, nutria, Myocastor coypus
-n02367492	chinchilla, Chinchilla laniger
-n02367812	mountain chinchilla, mountain viscacha
-n02368116	viscacha, chinchillon, Lagostomus maximus
-n02368399	abrocome, chinchilla rat, rat chinchilla
-n02368821	mole rat
-n02369293	mole rat
-n02369555	sand rat
-n02369680	naked mole rat
-n02369935	queen, queen mole rat
-n02370137	Damaraland mole rat
-n02370525	Ungulata
-n02370806	ungulate, hoofed mammal
-n02371344	unguiculate, unguiculate mammal
-n02372140	dinoceras, uintathere
-n02372584	hyrax, coney, cony, dassie, das
-n02372952	rock hyrax, rock rabbit, Procavia capensis
-n02373336	odd-toed ungulate, perissodactyl, perissodactyl mammal
-n02374149	equine, equid
-n02374451	horse, Equus caballus
-n02375302	roan
-n02375438	stablemate, stable companion
-n02375757	gee-gee
-n02375862	eohippus, dawn horse
-n02376542	foal
-n02376679	filly
-n02376791	colt
-n02376918	male horse
-n02377063	ridgeling, ridgling, ridgel, ridgil
-n02377181	stallion, entire
-n02377291	stud, studhorse
-n02377388	gelding
-n02377480	mare, female horse
-n02377603	broodmare, stud mare
-n02377703	saddle horse, riding horse, mount
-n02378149	remount
-n02378299	palfrey
-n02378415	warhorse
-n02378541	cavalry horse
-n02378625	charger, courser
-n02378755	steed
-n02378870	prancer
-n02378969	hack
-n02379081	cow pony
-n02379183	quarter horse
-n02379329	Morgan
-n02379430	Tennessee walker, Tennessee walking horse, Walking horse, Plantation walking horse
-n02379630	American saddle horse
-n02379743	Appaloosa
-n02379908	Arabian, Arab
-n02380052	Lippizan, Lipizzan, Lippizaner
-n02380335	pony
-n02380464	polo pony
-n02380583	mustang
-n02380745	bronco, bronc, broncho
-n02380875	bucking bronco
-n02381004	buckskin
-n02381119	crowbait, crow-bait
-n02381261	dun
-n02381364	grey, gray
-n02381460	wild horse
-n02381609	tarpan, Equus caballus gomelini
-n02381831	Przewalski's horse, Przevalski's horse, Equus caballus przewalskii, Equus caballus przevalskii
-n02382039	cayuse, Indian pony
-n02382132	hack
-n02382204	hack, jade, nag, plug
-n02382338	plow horse, plough horse
-n02382437	pony
-n02382635	Shetland pony
-n02382750	Welsh pony
-n02382850	Exmoor
-n02382948	racehorse, race horse, bangtail
-n02383231	thoroughbred
-n02384741	steeplechaser
-n02384858	racer
-n02385002	finisher
-n02385098	pony
-n02385214	yearling
-n02385580	dark horse
-n02385676	mudder
-n02385776	nonstarter
-n02385898	stalking-horse
-n02386014	harness horse
-n02386141	cob
-n02386224	hackney
-n02386310	workhorse
-n02386496	draft horse, draught horse, dray horse
-n02386746	packhorse
-n02386853	carthorse, cart horse, drayhorse
-n02386968	Clydesdale
-n02387093	Percheron
-n02387254	farm horse, dobbin
-n02387346	shire, shire horse
-n02387452	pole horse, poler
-n02387722	post horse, post-horse, poster
-n02387887	coach horse
-n02387983	pacer
-n02388143	pacer, pacemaker, pacesetter
-n02388276	trotting horse, trotter
-n02388453	pole horse
-n02388588	stepper, high stepper
-n02388735	chestnut
-n02388832	liver chestnut
-n02388917	bay
-n02389026	sorrel
-n02389128	palomino
-n02389261	pinto
-n02389346	ass
-n02389559	domestic ass, donkey, Equus asinus
-n02389779	burro
-n02389865	moke
-n02389943	jack, jackass
-n02390015	jennet, jenny, jenny ass
-n02390101	mule
-n02390258	hinny
-n02390454	wild ass
-n02390640	African wild ass, Equus asinus
-n02390738	kiang, Equus kiang
-n02390834	onager, Equus hemionus
-n02390938	chigetai, dziggetai, Equus hemionus hemionus
-n02391049	zebra
-n02391234	common zebra, Burchell's zebra, Equus Burchelli
-n02391373	mountain zebra, Equus zebra zebra
-n02391508	grevy's zebra, Equus grevyi
-n02391617	quagga, Equus quagga
-n02391994	rhinoceros, rhino
-n02392434	Indian rhinoceros, Rhinoceros unicornis
-n02392555	woolly rhinoceros, Rhinoceros antiquitatis
-n02392824	white rhinoceros, Ceratotherium simum, Diceros simus
-n02393161	black rhinoceros, Diceros bicornis
-n02393580	tapir
-n02393807	New World tapir, Tapirus terrestris
-n02393940	Malayan tapir, Indian tapir, Tapirus indicus
-n02394477	even-toed ungulate, artiodactyl, artiodactyl mammal
-n02395003	swine
-n02395406	hog, pig, grunter, squealer, Sus scrofa
-n02395694	piglet, piggy, shoat, shote
-n02395855	sucking pig
-n02395931	porker
-n02396014	boar
-n02396088	sow
-n02396157	razorback, razorback hog, razorbacked hog
-n02396427	wild boar, boar, Sus scrofa
-n02396796	babirusa, babiroussa, babirussa, Babyrousa Babyrussa
-n02397096	warthog
-n02397529	peccary, musk hog
-n02397744	collared peccary, javelina, Tayassu angulatus, Tayassu tajacu, Peccari angulatus
-n02397987	white-lipped peccary, Tayassu pecari
-n02398521	hippopotamus, hippo, river horse, Hippopotamus amphibius
-n02399000	ruminant
-n02401031	bovid
-n02402010	bovine
-n02402175	ox, wild ox
-n02402425	cattle, cows, kine, oxen, Bos taurus
-n02403003	ox
-n02403153	stirk
-n02403231	bullock, steer
-n02403325	bull
-n02403454	cow, moo-cow
-n02403740	heifer
-n02403820	bullock
-n02403920	dogie, dogy, leppy
-n02404028	maverick
-n02404186	beef, beef cattle
-n02404432	longhorn, Texas longhorn
-n02404573	Brahman, Brahma, Brahmin, Bos indicus
-n02404906	zebu
-n02405101	aurochs, urus, Bos primigenius
-n02405302	yak, Bos grunniens
-n02405440	banteng, banting, tsine, Bos banteng
-n02405577	Welsh, Welsh Black
-n02405692	red poll
-n02405799	Santa Gertrudis
-n02405929	Aberdeen Angus, Angus, black Angus
-n02406046	Africander
-n02406174	dairy cattle, dairy cow, milch cow, milk cow, milcher, milker
-n02406432	Ayrshire
-n02406533	Brown Swiss
-n02406647	Charolais
-n02406749	Jersey
-n02406859	Devon
-n02406952	grade
-n02407071	Durham, shorthorn
-n02407172	milking shorthorn
-n02407276	Galloway
-n02407390	Friesian, Holstein, Holstein-Friesian
-n02407521	Guernsey
-n02407625	Hereford, whiteface
-n02407763	cattalo, beefalo
-n02407959	Old World buffalo, buffalo
-n02408429	water buffalo, water ox, Asiatic buffalo, Bubalus bubalis
-n02408660	Indian buffalo
-n02408817	carabao
-n02409038	anoa, dwarf buffalo, Anoa depressicornis
-n02409202	tamarau, tamarao, Bubalus mindorensis, Anoa mindorensis
-n02409508	Cape buffalo, Synercus caffer
-n02409870	Asian wild ox
-n02410011	gaur, Bibos gaurus
-n02410141	gayal, mithan, Bibos frontalis
-n02410509	bison
-n02410702	American bison, American buffalo, buffalo, Bison bison
-n02410900	wisent, aurochs, Bison bonasus
-n02411206	musk ox, musk sheep, Ovibos moschatus
-n02411705	sheep
-n02411999	ewe
-n02412080	ram, tup
-n02412210	wether
-n02412440	lamb
-n02412629	lambkin
-n02412700	baa-lamb
-n02412787	hog, hogget, hogg
-n02412909	teg
-n02412977	Persian lamb
-n02413050	black sheep
-n02413131	domestic sheep, Ovis aries
-n02413484	Cotswold
-n02413593	Hampshire, Hampshire down
-n02413717	Lincoln
-n02413824	Exmoor
-n02413917	Cheviot
-n02414043	broadtail, caracul, karakul
-n02414209	longwool
-n02414290	merino, merino sheep
-n02414442	Rambouillet
-n02414578	wild sheep
-n02414763	argali, argal, Ovis ammon
-n02414904	Marco Polo sheep, Marco Polo's sheep, Ovis poli
-n02415130	urial, Ovis vignei
-n02415253	Dall sheep, Dall's sheep, white sheep, Ovis montana dalli
-n02415435	mountain sheep
-n02415577	bighorn, bighorn sheep, cimarron, Rocky Mountain bighorn, Rocky Mountain sheep, Ovis canadensis
-n02415829	mouflon, moufflon, Ovis musimon
-n02416104	aoudad, arui, audad, Barbary sheep, maned sheep, Ammotragus lervia
-n02416519	goat, caprine animal
-n02416820	kid
-n02416880	billy, billy goat, he-goat
-n02416964	nanny, nanny-goat, she-goat
-n02417070	domestic goat, Capra hircus
-n02417242	Cashmere goat, Kashmir goat
-n02417387	Angora, Angora goat
-n02417534	wild goat
-n02417663	bezoar goat, pasang, Capra aegagrus
-n02417785	markhor, markhoor, Capra falconeri
-n02417914	ibex, Capra ibex
-n02418064	goat antelope
-n02418465	mountain goat, Rocky Mountain goat, Oreamnos americanus
-n02418770	goral, Naemorhedus goral
-n02419056	serow
-n02419336	chamois, Rupicapra rupicapra
-n02419634	takin, gnu goat, Budorcas taxicolor
-n02419796	antelope
-n02420509	blackbuck, black buck, Antilope cervicapra
-n02420828	gerenuk, Litocranius walleri
-n02421136	addax, Addax nasomaculatus
-n02421449	gnu, wildebeest
-n02421792	dik-dik
-n02422106	hartebeest
-n02422391	sassaby, topi, Damaliscus lunatus
-n02422699	impala, Aepyceros melampus
-n02423022	gazelle
-n02423218	Thomson's gazelle, Gazella thomsoni
-n02423362	Gazella subgutturosa
-n02423589	springbok, springbuck, Antidorcas marsupialis, Antidorcas euchore
-n02424085	bongo, Tragelaphus eurycerus, Boocercus eurycerus
-n02424305	kudu, koodoo, koudou
-n02424486	greater kudu, Tragelaphus strepsiceros
-n02424589	lesser kudu, Tragelaphus imberbis
-n02424695	harnessed antelope
-n02424909	nyala, Tragelaphus angasi
-n02425086	mountain nyala, Tragelaphus buxtoni
-n02425228	bushbuck, guib, Tragelaphus scriptus
-n02425532	nilgai, nylghai, nylghau, blue bull, Boselaphus tragocamelus
-n02425887	sable antelope, Hippotragus niger
-n02426176	saiga, Saiga tatarica
-n02426481	steenbok, steinbok, Raphicerus campestris
-n02426813	eland
-n02427032	common eland, Taurotragus oryx
-n02427183	giant eland, Taurotragus derbianus
-n02427470	kob, Kobus kob
-n02427576	lechwe, Kobus leche
-n02427724	waterbuck
-n02428089	puku, Adenota vardoni
-n02428349	oryx, pasang
-n02428508	gemsbok, gemsbuck, Oryx gazella
-n02428842	forest goat, spindle horn, Pseudoryx nghetinhensis
-n02429456	pronghorn, prongbuck, pronghorn antelope, American antelope, Antilocapra americana
-n02430045	deer, cervid
-n02430559	stag
-n02430643	royal, royal stag
-n02430748	pricket
-n02430830	fawn
-n02431122	red deer, elk, American elk, wapiti, Cervus elaphus
-n02431337	hart, stag
-n02431441	hind
-n02431542	brocket
-n02431628	sambar, sambur, Cervus unicolor
-n02431785	wapiti, elk, American elk, Cervus elaphus canadensis
-n02431976	Japanese deer, sika, Cervus nipon, Cervus sika
-n02432291	Virginia deer, white tail, whitetail, white-tailed deer, whitetail deer, Odocoileus Virginianus
-n02432511	mule deer, burro deer, Odocoileus hemionus
-n02432704	black-tailed deer, blacktail deer, blacktail, Odocoileus hemionus columbianus
-n02432983	elk, European elk, moose, Alces alces
-n02433318	fallow deer, Dama dama
-n02433546	roe deer, Capreolus capreolus
-n02433729	roebuck
-n02433925	caribou, reindeer, Greenland caribou, Rangifer tarandus
-n02434190	woodland caribou, Rangifer caribou
-n02434415	barren ground caribou, Rangifer arcticus
-n02434712	brocket
-n02434954	muntjac, barking deer
-n02435216	musk deer, Moschus moschiferus
-n02435517	pere david's deer, elaphure, Elaphurus davidianus
-n02435853	chevrotain, mouse deer
-n02436224	kanchil, Tragulus kanchil
-n02436353	napu, Tragulus Javanicus
-n02436645	water chevrotain, water deer, Hyemoschus aquaticus
-n02437136	camel
-n02437312	Arabian camel, dromedary, Camelus dromedarius
-n02437482	Bactrian camel, Camelus bactrianus
-n02437616	llama
-n02437971	domestic llama, Lama peruana
-n02438173	guanaco, Lama guanicoe
-n02438272	alpaca, Lama pacos
-n02438580	vicuna, Vicugna vicugna
-n02439033	giraffe, camelopard, Giraffa camelopardalis
-n02439398	okapi, Okapia johnstoni
-n02441326	musteline mammal, mustelid, musteline
-n02441942	weasel
-n02442172	ermine, shorttail weasel, Mustela erminea
-n02442336	stoat
-n02442446	New World least weasel, Mustela rixosa
-n02442572	Old World least weasel, Mustela nivalis
-n02442668	longtail weasel, long-tailed weasel, Mustela frenata
-n02442845	mink
-n02443015	American mink, Mustela vison
-n02443114	polecat, fitch, foulmart, foumart, Mustela putorius
-n02443346	ferret
-n02443484	black-footed ferret, ferret, Mustela nigripes
-n02443808	muishond
-n02443959	snake muishond, Poecilogale albinucha
-n02444251	striped muishond, Ictonyx striata
-n02444819	otter
-n02445004	river otter, Lutra canadensis
-n02445171	Eurasian otter, Lutra lutra
-n02445394	sea otter, Enhydra lutris
-n02445715	skunk, polecat, wood pussy
-n02446206	striped skunk, Mephitis mephitis
-n02446352	hooded skunk, Mephitis macroura
-n02446645	hog-nosed skunk, hognosed skunk, badger skunk, rooter skunk, Conepatus leuconotus
-n02447021	spotted skunk, little spotted skunk, Spilogale putorius
-n02447366	badger
-n02447762	American badger, Taxidea taxus
-n02448060	Eurasian badger, Meles meles
-n02448318	ratel, honey badger, Mellivora capensis
-n02448633	ferret badger
-n02448885	hog badger, hog-nosed badger, sand badger, Arctonyx collaris
-n02449183	wolverine, carcajou, skunk bear, Gulo luscus
-n02449350	glutton, Gulo gulo, wolverine
-n02449699	grison, Grison vittatus, Galictis vittatus
-n02450034	marten, marten cat
-n02450295	pine marten, Martes martes
-n02450426	sable, Martes zibellina
-n02450561	American marten, American sable, Martes americana
-n02450677	stone marten, beech marten, Martes foina
-n02450829	fisher, pekan, fisher cat, black cat, Martes pennanti
-n02451125	yellow-throated marten, Charronia flavigula
-n02451415	tayra, taira, Eira barbara
-n02451575	fictional animal
-n02453108	pachyderm
-n02453611	edentate
-n02454379	armadillo
-n02454794	peba, nine-banded armadillo, Texas armadillo, Dasypus novemcinctus
-n02455135	apar, three-banded armadillo, Tolypeutes tricinctus
-n02455428	tatouay, cabassous, Cabassous unicinctus
-n02455720	peludo, poyou, Euphractus sexcinctus
-n02456008	giant armadillo, tatou, tatu, Priodontes giganteus
-n02456275	pichiciago, pichiciego, fairy armadillo, chlamyphore, Chlamyphorus truncatus
-n02456962	sloth, tree sloth
-n02457408	three-toed sloth, ai, Bradypus tridactylus
-n02457945	two-toed sloth, unau, unai, Choloepus didactylus
-n02458135	two-toed sloth, unau, unai, Choloepus hoffmanni
-n02458517	megatherian, megatheriid, megatherian mammal
-n02459190	mylodontid
-n02460009	anteater, New World anteater
-n02460451	ant bear, giant anteater, great anteater, tamanoir, Myrmecophaga jubata
-n02460817	silky anteater, two-toed anteater, Cyclopes didactylus
-n02461128	tamandua, tamandu, lesser anteater, Tamandua tetradactyla
-n02461830	pangolin, scaly anteater, anteater
-n02462213	coronet
-n02469248	scapular
-n02469472	tadpole, polliwog, pollywog
-n02469914	primate
-n02470238	simian
-n02470325	ape
-n02470709	anthropoid
-n02470899	anthropoid ape
-n02471300	hominoid
-n02471762	hominid
-n02472293	homo, man, human being, human
-n02472987	world, human race, humanity, humankind, human beings, humans, mankind, man
-n02473307	Homo erectus
-n02473554	Pithecanthropus, Pithecanthropus erectus, genus Pithecanthropus
-n02473720	Java man, Trinil man
-n02473857	Peking man
-n02473983	Sinanthropus, genus Sinanthropus
-n02474110	Homo soloensis
-n02474282	Javanthropus, genus Javanthropus
-n02474605	Homo habilis
-n02474777	Homo sapiens
-n02475078	Neandertal man, Neanderthal man, Neandertal, Neanderthal, Homo sapiens neanderthalensis
-n02475358	Cro-magnon
-n02475669	Homo sapiens sapiens, modern man
-n02476219	australopithecine
-n02476567	Australopithecus afarensis
-n02476870	Australopithecus africanus
-n02477028	Australopithecus boisei
-n02477187	Zinjanthropus, genus Zinjanthropus
-n02477329	Australopithecus robustus
-n02477516	Paranthropus, genus Paranthropus
-n02477782	Sivapithecus
-n02478239	rudapithecus, Dryopithecus Rudapithecus hungaricus
-n02478875	proconsul
-n02479332	Aegyptopithecus
-n02480153	great ape, pongid
-n02480495	orangutan, orang, orangutang, Pongo pygmaeus
-n02480855	gorilla, Gorilla gorilla
-n02481103	western lowland gorilla, Gorilla gorilla gorilla
-n02481235	eastern lowland gorilla, Gorilla gorilla grauri
-n02481366	mountain gorilla, Gorilla gorilla beringei
-n02481500	silverback
-n02481823	chimpanzee, chimp, Pan troglodytes
-n02482060	western chimpanzee, Pan troglodytes verus
-n02482286	eastern chimpanzee, Pan troglodytes schweinfurthii
-n02482474	central chimpanzee, Pan troglodytes troglodytes
-n02482650	pygmy chimpanzee, bonobo, Pan paniscus
-n02483092	lesser ape
-n02483362	gibbon, Hylobates lar
-n02483708	siamang, Hylobates syndactylus, Symphalangus syndactylus
-n02484322	monkey
-n02484473	Old World monkey, catarrhine
-n02484975	guenon, guenon monkey
-n02485225	talapoin, Cercopithecus talapoin
-n02485371	grivet, Cercopithecus aethiops
-n02485536	vervet, vervet monkey, Cercopithecus aethiops pygerythrus
-n02485688	green monkey, African green monkey, Cercopithecus aethiops sabaeus
-n02485988	mangabey
-n02486261	patas, hussar monkey, Erythrocebus patas
-n02486410	baboon
-n02486657	chacma, chacma baboon, Papio ursinus
-n02486908	mandrill, Mandrillus sphinx
-n02487079	drill, Mandrillus leucophaeus
-n02487347	macaque
-n02487547	rhesus, rhesus monkey, Macaca mulatta
-n02487675	bonnet macaque, bonnet monkey, capped macaque, crown monkey, Macaca radiata
-n02487847	Barbary ape, Macaca sylvana
-n02488003	crab-eating macaque, croo monkey, Macaca irus
-n02488291	langur
-n02488415	entellus, hanuman, Presbytes entellus, Semnopithecus entellus
-n02488702	colobus, colobus monkey
-n02488894	guereza, Colobus guereza
-n02489166	proboscis monkey, Nasalis larvatus
-n02489589	New World monkey, platyrrhine, platyrrhinian
-n02490219	marmoset
-n02490597	true marmoset
-n02490811	pygmy marmoset, Cebuella pygmaea
-n02491107	tamarin, lion monkey, lion marmoset, leoncita
-n02491329	silky tamarin, Leontocebus rosalia
-n02491474	pinche, Leontocebus oedipus
-n02492035	capuchin, ringtail, Cebus capucinus
-n02492356	douroucouli, Aotus trivirgatus
-n02492660	howler monkey, howler
-n02492948	saki
-n02493224	uakari
-n02493509	titi, titi monkey
-n02493793	spider monkey, Ateles geoffroyi
-n02494079	squirrel monkey, Saimiri sciureus
-n02494383	woolly monkey
-n02495242	tree shrew
-n02496052	prosimian
-n02496913	lemur
-n02497673	Madagascar cat, ring-tailed lemur, Lemur catta
-n02498153	aye-aye, Daubentonia madagascariensis
-n02498743	slender loris, Loris gracilis
-n02499022	slow loris, Nycticebus tardigradua, Nycticebus pygmaeus
-n02499316	potto, kinkajou, Perodicticus potto
-n02499568	angwantibo, golden potto, Arctocebus calabarensis
-n02499808	galago, bushbaby, bush baby
-n02500267	indri, indris, Indri indri, Indri brevicaudatus
-n02500596	woolly indris, Avahi laniger
-n02501583	tarsier
-n02501923	Tarsius syrichta
-n02502006	Tarsius glis
-n02502514	flying lemur, flying cat, colugo
-n02502807	Cynocephalus variegatus
-n02503127	proboscidean, proboscidian
-n02503517	elephant
-n02503756	rogue elephant
-n02504013	Indian elephant, Elephas maximus
-n02504458	African elephant, Loxodonta africana
-n02504770	mammoth
-n02505063	woolly mammoth, northern mammoth, Mammuthus primigenius
-n02505238	columbian mammoth, Mammuthus columbi
-n02505485	imperial mammoth, imperial elephant, Archidiskidon imperator
-n02505998	mastodon, mastodont
-n02506947	plantigrade mammal, plantigrade
-n02507148	digitigrade mammal, digitigrade
-n02507649	procyonid
-n02508021	raccoon, racoon
-n02508213	common raccoon, common racoon, coon, ringtail, Procyon lotor
-n02508346	crab-eating raccoon, Procyon cancrivorus
-n02508742	bassarisk, cacomistle, cacomixle, coon cat, raccoon fox, ringtail, ring-tailed cat, civet cat, miner's cat, Bassariscus astutus
-n02509197	kinkajou, honey bear, potto, Potos flavus, Potos caudivolvulus
-n02509515	coati, coati-mondi, coati-mundi, coon cat, Nasua narica
-n02509815	lesser panda, red panda, panda, bear cat, cat bear, Ailurus fulgens
-n02510455	giant panda, panda, panda bear, coon bear, Ailuropoda melanoleuca
-n02511730	twitterer
-n02512053	fish
-n02512752	fingerling
-n02512830	game fish, sport fish
-n02512938	food fish
-n02513248	rough fish
-n02513355	groundfish, bottom fish
-n02513560	young fish
-n02513727	parr
-n02513805	mouthbreeder
-n02513939	spawner
-n02514041	barracouta, snoek
-n02515214	crossopterygian, lobefin, lobe-finned fish
-n02515713	coelacanth, Latimeria chalumnae
-n02516188	lungfish
-n02516776	ceratodus
-n02517442	catfish, siluriform fish
-n02517938	silurid, silurid fish
-n02518324	European catfish, sheatfish, Silurus glanis
-n02518622	electric catfish, Malopterurus electricus
-n02519148	bullhead, bullhead catfish
-n02519340	horned pout, hornpout, pout, Ameiurus Melas
-n02519472	brown bullhead
-n02519686	channel catfish, channel cat, Ictalurus punctatus
-n02519862	blue catfish, blue cat, blue channel catfish, blue channel cat
-n02520147	flathead catfish, mudcat, goujon, shovelnose catfish, spoonbill catfish, Pylodictus olivaris
-n02520525	armored catfish
-n02520810	sea catfish
-n02521646	gadoid, gadoid fish
-n02522399	cod, codfish
-n02522637	codling
-n02522722	Atlantic cod, Gadus morhua
-n02522866	Pacific cod, Alaska cod, Gadus macrocephalus
-n02523110	whiting, Merlangus merlangus, Gadus merlangus
-n02523427	burbot, eelpout, ling, cusk, Lota lota
-n02523877	haddock, Melanogrammus aeglefinus
-n02524202	pollack, pollock, Pollachius pollachius
-n02524524	hake
-n02524659	silver hake, Merluccius bilinearis, whiting
-n02524928	ling
-n02525382	cusk, torsk, Brosme brosme
-n02525703	grenadier, rattail, rattail fish
-n02526121	eel
-n02526425	elver
-n02526818	common eel, freshwater eel
-n02527057	tuna, Anguilla sucklandii
-n02527271	moray, moray eel
-n02527622	conger, conger eel
-n02528163	teleost fish, teleost, teleostan
-n02529293	beaked salmon, sandfish, Gonorhynchus gonorhynchus
-n02529772	clupeid fish, clupeid
-n02530052	whitebait
-n02530188	brit, britt
-n02530421	shad
-n02530637	common American shad, Alosa sapidissima
-n02530831	river shad, Alosa chrysocloris
-n02530999	allice shad, allis shad, allice, allis, Alosa alosa
-n02531114	alewife, Alosa pseudoharengus, Pomolobus pseudoharengus
-n02531625	menhaden, Brevoortia tyrannis
-n02532028	herring, Clupea harangus
-n02532272	Atlantic herring, Clupea harengus harengus
-n02532451	Pacific herring, Clupea harengus pallasii
-n02532602	sardine
-n02532786	sild
-n02532918	brisling, sprat, Clupea sprattus
-n02533209	pilchard, sardine, Sardina pilchardus
-n02533545	Pacific sardine, Sardinops caerulea
-n02533834	anchovy
-n02534165	mediterranean anchovy, Engraulis encrasicholus
-n02534559	salmonid
-n02534734	salmon
-n02535080	parr
-n02535163	blackfish
-n02535258	redfish
-n02535537	Atlantic salmon, Salmo salar
-n02535759	landlocked salmon, lake salmon
-n02536165	sockeye, sockeye salmon, red salmon, blueback salmon, Oncorhynchus nerka
-n02536456	chinook, chinook salmon, king salmon, quinnat salmon, Oncorhynchus tshawytscha
-n02536864	coho, cohoe, coho salmon, blue jack, silver salmon, Oncorhynchus kisutch
-n02537085	trout
-n02537319	brown trout, salmon trout, Salmo trutta
-n02537525	rainbow trout, Salmo gairdneri
-n02537716	sea trout
-n02538010	lake trout, salmon trout, Salvelinus namaycush
-n02538216	brook trout, speckled trout, Salvelinus fontinalis
-n02538406	char, charr
-n02538562	Arctic char, Salvelinus alpinus
-n02538985	whitefish
-n02539424	lake whitefish, Coregonus clupeaformis
-n02539573	cisco, lake herring, Coregonus artedi
-n02539894	round whitefish, Menominee whitefish, Prosopium cylindraceum
-n02540412	smelt
-n02540983	sparling, European smelt, Osmerus eperlanus
-n02541257	capelin, capelan, caplin
-n02541687	tarpon, Tarpon atlanticus
-n02542017	ladyfish, tenpounder, Elops saurus
-n02542432	bonefish, Albula vulpes
-n02542958	argentine
-n02543255	lanternfish
-n02543565	lizardfish, snakefish, snake-fish
-n02544274	lancetfish, lancet fish, wolffish
-n02545841	opah, moonfish, Lampris regius
-n02546028	New World opah, Lampris guttatus
-n02546331	ribbonfish
-n02546627	dealfish, Trachipterus arcticus
-n02547014	oarfish, king of the herring, ribbonfish, Regalecus glesne
-n02547733	batfish
-n02548247	goosefish, angler, anglerfish, angler fish, monkfish, lotte, allmouth, Lophius Americanus
-n02548689	toadfish, Opsanus tau
-n02548884	oyster fish, oyster-fish, oysterfish
-n02549248	frogfish
-n02549376	sargassum fish
-n02549989	needlefish, gar, billfish
-n02550203	timucu
-n02550460	flying fish
-n02550655	monoplane flying fish, two-wing flying fish
-n02551134	halfbeak
-n02551668	saury, billfish, Scomberesox saurus
-n02552171	spiny-finned fish, acanthopterygian
-n02553028	lingcod, Ophiodon elongatus
-n02554730	percoid fish, percoid, percoidean
-n02555863	perch
-n02556373	climbing perch, Anabas testudineus, A. testudineus
-n02556846	perch
-n02557182	yellow perch, Perca flavescens
-n02557318	European perch, Perca fluviatilis
-n02557591	pike-perch, pike perch
-n02557749	walleye, walleyed pike, jack salmon, dory, Stizostedion vitreum
-n02557909	blue pike, blue pickerel, blue pikeperch, blue walleye, Strizostedion vitreum glaucum
-n02558206	snail darter, Percina tanasi
-n02558860	cusk-eel
-n02559144	brotula
-n02559383	pearlfish, pearl-fish
-n02559862	robalo
-n02560110	snook
-n02561108	pike
-n02561381	northern pike, Esox lucius
-n02561514	muskellunge, Esox masquinongy
-n02561661	pickerel
-n02561803	chain pickerel, chain pike, Esox niger
-n02561937	redfin pickerel, barred pickerel, Esox americanus
-n02562315	sunfish, centrarchid
-n02562796	crappie
-n02562971	black crappie, Pomoxis nigromaculatus
-n02563079	white crappie, Pomoxis annularis
-n02563182	freshwater bream, bream
-n02563648	pumpkinseed, Lepomis gibbosus
-n02563792	bluegill, Lepomis macrochirus
-n02563949	spotted sunfish, stumpknocker, Lepomis punctatus
-n02564270	freshwater bass
-n02564403	rock bass, rock sunfish, Ambloplites rupestris
-n02564720	black bass
-n02564935	Kentucky black bass, spotted black bass, Micropterus pseudoplites
-n02565072	smallmouth, smallmouth bass, smallmouthed bass, smallmouth black bass, smallmouthed black bass, Micropterus dolomieu
-n02565324	largemouth, largemouth bass, largemouthed bass, largemouth black bass, largemouthed black bass, Micropterus salmoides
-n02565573	bass
-n02566109	serranid fish, serranid
-n02566489	white perch, silver perch, Morone americana
-n02566665	yellow bass, Morone interrupta
-n02567334	blackmouth bass, Synagrops bellus
-n02567633	rock sea bass, rock bass, Centropristis philadelphica
-n02568087	striped bass, striper, Roccus saxatilis, rockfish
-n02568447	stone bass, wreckfish, Polyprion americanus
-n02568959	grouper
-n02569484	hind
-n02569631	rock hind, Epinephelus adscensionis
-n02569905	creole-fish, Paranthias furcifer
-n02570164	jewfish, Mycteroperca bonaci
-n02570484	soapfish
-n02570838	surfperch, surffish, surf fish
-n02571167	rainbow seaperch, rainbow perch, Hipsurus caryi
-n02571652	bigeye
-n02571810	catalufa, Priacanthus arenatus
-n02572196	cardinalfish
-n02572484	flame fish, flamefish, Apogon maculatus
-n02573249	tilefish, Lopholatilus chamaeleonticeps
-n02573704	bluefish, Pomatomus saltatrix
-n02574271	cobia, Rachycentron canadum, sergeant fish
-n02574910	remora, suckerfish, sucking fish
-n02575325	sharksucker, Echeneis naucrates
-n02575590	whale sucker, whalesucker, Remilegia australis
-n02576223	carangid fish, carangid
-n02576575	jack
-n02576906	crevalle jack, jack crevalle, Caranx hippos
-n02577041	yellow jack, Caranx bartholomaei
-n02577164	runner, blue runner, Caranx crysos
-n02577403	rainbow runner, Elagatis bipinnulata
-n02577662	leatherjacket, leatherjack
-n02577952	threadfish, thread-fish, Alectis ciliaris
-n02578233	moonfish, Atlantic moonfish, horsefish, horsehead, horse-head, dollarfish, Selene setapinnis
-n02578454	lookdown, lookdown fish, Selene vomer
-n02578771	amberjack, amberfish
-n02578928	yellowtail, Seriola dorsalis
-n02579303	kingfish, Seriola grandis
-n02579557	pompano
-n02579762	Florida pompano, Trachinotus carolinus
-n02579928	permit, Trachinotus falcatus
-n02580336	scad
-n02580679	horse mackerel, jack mackerel, Spanish mackerel, saurel, Trachurus symmetricus
-n02580830	horse mackerel, saurel, Trachurus trachurus
-n02581108	bigeye scad, big-eyed scad, goggle-eye, Selar crumenophthalmus
-n02581482	mackerel scad, mackerel shad, Decapterus macarellus
-n02581642	round scad, cigarfish, quiaquia, Decapterus punctatus
-n02581957	dolphinfish, dolphin, mahimahi
-n02582220	Coryphaena hippurus
-n02582349	Coryphaena equisetis
-n02582721	pomfret, Brama raii
-n02583567	characin, characin fish, characid
-n02583890	tetra
-n02584145	cardinal tetra, Paracheirodon axelrodi
-n02584449	piranha, pirana, caribe
-n02585872	cichlid, cichlid fish
-n02586238	bolti, Tilapia nilotica
-n02586543	snapper
-n02587051	red snapper, Lutjanus blackfordi
-n02587300	grey snapper, gray snapper, mangrove snapper, Lutjanus griseus
-n02587479	mutton snapper, muttonfish, Lutjanus analis
-n02587618	schoolmaster, Lutjanus apodus
-n02587877	yellowtail, yellowtail snapper, Ocyurus chrysurus
-n02588286	grunt
-n02588794	margate, Haemulon album
-n02588945	Spanish grunt, Haemulon macrostomum
-n02589062	tomtate, Haemulon aurolineatum
-n02589196	cottonwick, Haemulon malanurum
-n02589316	sailor's-choice, sailors choice, Haemulon parra
-n02589623	porkfish, pork-fish, Anisotremus virginicus
-n02589796	pompon, black margate, Anisotremus surinamensis
-n02590094	pigfish, hogfish, Orthopristis chrysopterus
-n02590495	sparid, sparid fish
-n02590702	sea bream, bream
-n02590987	porgy
-n02591330	red porgy, Pagrus pagrus
-n02591613	European sea bream, Pagellus centrodontus
-n02591911	Atlantic sea bream, Archosargus rhomboidalis
-n02592055	sheepshead, Archosargus probatocephalus
-n02592371	pinfish, sailor's-choice, squirrelfish, Lagodon rhomboides
-n02592734	sheepshead porgy, Calamus penna
-n02593019	snapper, Chrysophrys auratus
-n02593191	black bream, Chrysophrys australis
-n02593453	scup, northern porgy, northern scup, Stenotomus chrysops
-n02593679	scup, southern porgy, southern scup, Stenotomus aculeatus
-n02594250	sciaenid fish, sciaenid
-n02594942	striped drum, Equetus pulcher
-n02595056	jackknife-fish, Equetus lanceolatus
-n02595339	silver perch, mademoiselle, Bairdiella chrysoura
-n02595702	red drum, channel bass, redfish, Sciaenops ocellatus
-n02596067	mulloway, jewfish, Sciaena antarctica
-n02596252	maigre, maiger, Sciaena aquila
-n02596381	croaker
-n02596720	Atlantic croaker, Micropogonias undulatus
-n02597004	yellowfin croaker, surffish, surf fish, Umbrina roncador
-n02597367	whiting
-n02597608	kingfish
-n02597818	king whiting, Menticirrhus americanus
-n02597972	northern whiting, Menticirrhus saxatilis
-n02598134	corbina, Menticirrhus undulatus
-n02598573	white croaker, chenfish, kingfish, Genyonemus lineatus
-n02598878	white croaker, queenfish, Seriphus politus
-n02599052	sea trout
-n02599347	weakfish, Cynoscion regalis
-n02599557	spotted weakfish, spotted sea trout, spotted squeateague, Cynoscion nebulosus
-n02599958	mullet
-n02600298	goatfish, red mullet, surmullet, Mullus surmuletus
-n02600503	red goatfish, Mullus auratus
-n02600798	yellow goatfish, Mulloidichthys martinicus
-n02601344	mullet, grey mullet, gray mullet
-n02601767	striped mullet, Mugil cephalus
-n02601921	white mullet, Mugil curema
-n02602059	liza, Mugil liza
-n02602405	silversides, silverside
-n02602760	jacksmelt, Atherinopsis californiensis
-n02603317	barracuda
-n02603540	great barracuda, Sphyraena barracuda
-n02603862	sweeper
-n02604157	sea chub
-n02604480	Bermuda chub, rudderfish, Kyphosus sectatrix
-n02604954	spadefish, angelfish, Chaetodipterus faber
-n02605316	butterfly fish
-n02605703	chaetodon
-n02605936	angelfish
-n02606052	rock beauty, Holocanthus tricolor
-n02606384	damselfish, demoiselle
-n02606751	beaugregory, Pomacentrus leucostictus
-n02607072	anemone fish
-n02607201	clown anemone fish, Amphiprion percula
-n02607470	sergeant major, Abudefduf saxatilis
-n02607862	wrasse
-n02608284	pigfish, giant pigfish, Achoerodus gouldii
-n02608547	hogfish, hog snapper, Lachnolaimus maximus
-n02608860	slippery dick, Halicoeres bivittatus
-n02608996	puddingwife, pudding-wife, Halicoeres radiatus
-n02609302	bluehead, Thalassoma bifasciatum
-n02609823	pearly razorfish, Hemipteronatus novacula
-n02610066	tautog, blackfish, Tautoga onitis
-n02610373	cunner, bergall, Tautogolabrus adspersus
-n02610664	parrotfish, polly fish, pollyfish
-n02610980	threadfin
-n02611561	jawfish
-n02611898	stargazer
-n02612167	sand stargazer
-n02613181	blenny, combtooth blenny
-n02613572	shanny, Blennius pholis
-n02613820	Molly Miller, Scartella cristata
-n02614140	clinid, clinid fish
-n02614482	pikeblenny
-n02614653	bluethroat pikeblenny, Chaenopsis ocellata
-n02614978	gunnel, bracketed blenny
-n02615298	rock gunnel, butterfish, Pholis gunnellus
-n02616128	eelblenny
-n02616397	wrymouth, ghostfish, Cryptacanthodes maculatus
-n02616851	wolffish, wolf fish, catfish
-n02617537	viviparous eelpout, Zoarces viviparus
-n02618094	ocean pout, Macrozoarces americanus
-n02618513	sand lance, sand launce, sand eel, launce
-n02618827	dragonet
-n02619165	goby, gudgeon
-n02619550	mudskipper, mudspringer
-n02619861	sleeper, sleeper goby
-n02620167	flathead
-n02620578	archerfish, Toxotes jaculatrix
-n02621258	surgeonfish
-n02621908	gempylid
-n02622249	snake mackerel, Gempylus serpens
-n02622547	escolar, Lepidocybium flavobrunneum
-n02622712	oilfish, Ruvettus pretiosus
-n02622955	cutlassfish, frost fish, hairtail
-n02623445	scombroid, scombroid fish
-n02624167	mackerel
-n02624551	common mackerel, shiner, Scomber scombrus
-n02624807	Spanish mackerel, Scomber colias
-n02624987	chub mackerel, tinker, Scomber japonicus
-n02625258	wahoo, Acanthocybium solandri
-n02625612	Spanish mackerel
-n02625851	king mackerel, cavalla, cero, Scomberomorus cavalla
-n02626089	Scomberomorus maculatus
-n02626265	cero, pintado, kingfish, Scomberomorus regalis
-n02626471	sierra, Scomberomorus sierra
-n02626762	tuna, tunny
-n02627037	albacore, long-fin tunny, Thunnus alalunga
-n02627292	bluefin, bluefin tuna, horse mackerel, Thunnus thynnus
-n02627532	yellowfin, yellowfin tuna, Thunnus albacares
-n02627835	bonito
-n02628062	skipjack, Atlantic bonito, Sarda sarda
-n02628259	Chile bonito, Chilean bonito, Pacific bonito, Sarda chiliensis
-n02628600	skipjack, skipjack tuna, Euthynnus pelamis
-n02629230	bonito, oceanic bonito, Katsuwonus pelamis
-n02629716	swordfish, Xiphias gladius
-n02630281	sailfish
-n02630615	Atlantic sailfish, Istiophorus albicans
-n02630739	billfish
-n02631041	marlin
-n02631330	blue marlin, Makaira nigricans
-n02631475	black marlin, Makaira mazara, Makaira marlina
-n02631628	striped marlin, Makaira mitsukurii
-n02631775	white marlin, Makaira albida
-n02632039	spearfish
-n02632494	louvar, Luvarus imperialis
-n02633422	dollarfish, Poronotus triacanthus
-n02633677	palometa, California pompano, Palometa simillima
-n02633977	harvestfish, Paprilus alepidotus
-n02634545	driftfish
-n02635154	barrelfish, black rudderfish, Hyperglyphe perciformis
-n02635580	clingfish
-n02636170	tripletail
-n02636405	Atlantic tripletail, Lobotes surinamensis
-n02636550	Pacific tripletail, Lobotes pacificus
-n02636854	mojarra
-n02637179	yellowfin mojarra, Gerres cinereus
-n02637475	silver jenny, Eucinostomus gula
-n02637977	whiting
-n02638596	ganoid, ganoid fish
-n02639087	bowfin, grindle, dogfish, Amia calva
-n02639605	paddlefish, duckbill, Polyodon spathula
-n02639922	Chinese paddlefish, Psephurus gladis
-n02640242	sturgeon
-n02640626	Pacific sturgeon, white sturgeon, Sacramento sturgeon, Acipenser transmontanus
-n02640857	beluga, hausen, white sturgeon, Acipenser huso
-n02641379	gar, garfish, garpike, billfish, Lepisosteus osseus
-n02642107	scorpaenoid, scorpaenoid fish
-n02642644	scorpaenid, scorpaenid fish
-n02643112	scorpionfish, scorpion fish, sea scorpion
-n02643316	plumed scorpionfish, Scorpaena grandicornis
-n02643566	lionfish
-n02643836	stonefish, Synanceja verrucosa
-n02644113	rockfish
-n02644360	copper rockfish, Sebastodes caurinus
-n02644501	vermillion rockfish, rasher, Sebastodes miniatus
-n02644665	red rockfish, Sebastodes ruberrimus
-n02644817	rosefish, ocean perch, Sebastodes marinus
-n02645538	bullhead
-n02645691	miller's-thumb
-n02645953	sea raven, Hemitripterus americanus
-n02646667	lumpfish, Cyclopterus lumpus
-n02646892	lumpsucker
-n02648035	pogge, armed bullhead, Agonus cataphractus
-n02648625	greenling
-n02648916	kelp greenling, Hexagrammos decagrammus
-n02649218	painted greenling, convict fish, convictfish, Oxylebius pictus
-n02649546	flathead
-n02650050	gurnard
-n02650413	tub gurnard, yellow gurnard, Trigla lucerna
-n02650541	sea robin, searobin
-n02651060	northern sea robin, Prionotus carolinus
-n02652132	flying gurnard, flying robin, butterflyfish
-n02652668	plectognath, plectognath fish
-n02653145	triggerfish
-n02653497	queen triggerfish, Bessy cerca, oldwench, oldwife, Balistes vetula
-n02653786	filefish
-n02654112	leatherjacket, leatherfish
-n02654425	boxfish, trunkfish
-n02654745	cowfish, Lactophrys quadricornis
-n02655020	puffer, pufferfish, blowfish, globefish
-n02655523	spiny puffer
-n02655848	porcupinefish, porcupine fish, Diodon hystrix
-n02656032	balloonfish, Diodon holocanthus
-n02656301	burrfish
-n02656670	ocean sunfish, sunfish, mola, headfish
-n02656969	sharptail mola, Mola lanceolata
-n02657368	flatfish
-n02657694	flounder
-n02658079	righteye flounder, righteyed flounder
-n02658531	plaice, Pleuronectes platessa
-n02658811	European flatfish, Platichthys flesus
-n02659176	yellowtail flounder, Limanda ferruginea
-n02659478	winter flounder, blackback flounder, lemon sole, Pseudopleuronectes americanus
-n02659808	lemon sole, Microstomus kitt
-n02660091	American plaice, Hippoglossoides platessoides
-n02660208	halibut, holibut
-n02660519	Atlantic halibut, Hippoglossus hippoglossus
-n02660640	Pacific halibut, Hippoglossus stenolepsis
-n02661017	lefteye flounder, lefteyed flounder
-n02661473	southern flounder, Paralichthys lethostigmus
-n02661618	summer flounder, Paralichthys dentatus
-n02662239	whiff
-n02662397	horned whiff, Citharichthys cornutus
-n02662559	sand dab
-n02662825	windowpane, Scophthalmus aquosus
-n02662993	brill, Scophthalmus rhombus
-n02663211	turbot, Psetta maxima
-n02663485	tonguefish, tongue-fish
-n02663849	sole
-n02664285	European sole, Solea solea
-n02664642	English sole, lemon sole, Parophrys vitulus
-n02665250	hogchoker, Trinectes maculatus
-n02665985	aba
-n02666196	abacus
-n02666501	abandoned ship, derelict
-n02666624	A battery
-n02666943	abattoir, butchery, shambles, slaughterhouse
-n02667093	abaya
-n02667244	Abbe condenser
-n02667379	abbey
-n02667478	abbey
-n02667576	abbey
-n02667693	Abney level
-n02668393	abrader, abradant
-n02668613	abrading stone
-n02669295	abutment
-n02669442	abutment arch
-n02669534	academic costume
-n02669723	academic gown, academic robe, judge's robe
-n02670186	accelerator, throttle, throttle valve
-n02670382	accelerator, particle accelerator, atom smasher
-n02670683	accelerator, accelerator pedal, gas pedal, gas, throttle, gun
-n02670935	accelerometer
-n02671780	accessory, accoutrement, accouterment
-n02672152	accommodating lens implant, accommodating IOL
-n02672371	accommodation
-n02672831	accordion, piano accordion, squeeze box
-n02675077	acetate disk, phonograph recording disk
-n02675219	acetate rayon, acetate
-n02675522	achromatic lens
-n02676097	acoustic delay line, sonic delay line
-n02676261	acoustic device
-n02676566	acoustic guitar
-n02676670	acoustic modem
-n02676938	acropolis
-n02677028	acrylic
-n02677136	acrylic, acrylic paint
-n02677436	actinometer
-n02677718	action, action mechanism
-n02678010	active matrix screen
-n02678384	actuator
-n02678897	adapter, adaptor
-n02679142	adder
-n02679257	adding machine, totalizer, totaliser
-n02679961	addressing machine, Addressograph
-n02680110	adhesive bandage
-n02680512	adit
-n02680638	adjoining room
-n02680754	adjustable wrench, adjustable spanner
-n02681392	adobe, adobe brick
-n02682311	adz, adze
-n02682407	aeolian harp, aeolian lyre, wind harp
-n02682569	aerator
-n02682811	aerial torpedo
-n02682922	aerosol, aerosol container, aerosol can, aerosol bomb, spray can
-n02683183	Aertex
-n02683323	afghan
-n02683454	Afro-wig
-n02683558	afterburner
-n02683791	after-shave, after-shave lotion
-n02684248	agateware
-n02684356	agglomerator
-n02684515	aglet, aiglet, aiguilette
-n02684649	aglet, aiglet
-n02684962	agora, public square
-n02685082	aigrette, aigret
-n02685253	aileron
-n02685365	air bag
-n02685701	airbrake
-n02685995	airbrush
-n02686121	airbus
-n02686227	air compressor
-n02686379	air conditioner, air conditioning
-n02686568	aircraft
-n02687172	aircraft carrier, carrier, flattop, attack aircraft carrier
-n02687423	aircraft engine
-n02687682	air cushion, air spring
-n02687821	airdock, hangar, repair shed
-n02687992	airfield, landing field, flying field, field
-n02688273	air filter, air cleaner
-n02688443	airfoil, aerofoil, control surface, surface
-n02689144	airframe
-n02689274	air gun, airgun, air rifle
-n02689434	air hammer, jackhammer, pneumatic hammer
-n02689748	air horn
-n02689819	airing cupboard
-n02690373	airliner
-n02690715	airmailer
-n02691156	airplane, aeroplane, plane
-n02692086	airplane propeller, airscrew, prop
-n02692232	airport, airdrome, aerodrome, drome
-n02692513	air pump, vacuum pump
-n02692680	air search radar
-n02692877	airship, dirigible
-n02693246	air terminal, airport terminal
-n02693413	air-to-air missile
-n02693540	air-to-ground missile, air-to-surface missile
-n02694045	aisle
-n02694279	Aladdin's lamp
-n02694426	alarm, warning device, alarm system
-n02694662	alarm clock, alarm
-n02694966	alb
-n02695627	alcazar
-n02695762	alcohol thermometer, alcohol-in-glass thermometer
-n02696165	alehouse
-n02696246	alembic
-n02696569	algometer
-n02696843	alidade, alidad
-n02697022	alidade, alidad
-n02697221	A-line
-n02697576	Allen screw
-n02697675	Allen wrench
-n02697876	alligator wrench
-n02698244	alms dish, alms tray
-n02698473	alpaca
-n02698634	alpenstock
-n02699494	altar
-n02699629	altar, communion table, Lord's table
-n02699770	altarpiece, reredos
-n02699915	altazimuth
-n02700064	alternator
-n02700258	altimeter
-n02700895	Amati
-n02701002	ambulance
-n02701260	amen corner
-n02701730	American organ
-n02702989	ammeter
-n02703124	ammonia clock
-n02703275	ammunition, ammo
-n02704645	amphibian, amphibious aircraft
-n02704792	amphibian, amphibious vehicle
-n02704949	amphitheater, amphitheatre, coliseum
-n02705201	amphitheater, amphitheatre
-n02705429	amphora
-n02705944	amplifier
-n02706221	ampulla
-n02706806	amusement arcade
-n02708093	analog clock
-n02708224	analog computer, analogue computer
-n02708433	analog watch
-n02708555	analytical balance, chemical balance
-n02708711	analyzer, analyser
-n02708885	anamorphosis, anamorphism
-n02709101	anastigmat
-n02709367	anchor, ground tackle
-n02709637	anchor chain, anchor rope
-n02709763	anchor light, riding light, riding lamp
-n02709908	AND circuit, AND gate
-n02710044	andiron, firedog, dog, dog-iron
-n02710201	android, humanoid, mechanical man
-n02710324	anechoic chamber
-n02710429	anemometer, wind gauge, wind gage
-n02710600	aneroid barometer, aneroid
-n02711237	angiocardiogram
-n02711780	angioscope
-n02712545	angle bracket, angle iron
-n02712643	angledozer
-n02713003	ankle brace
-n02713218	anklet, anklets, bobbysock, bobbysocks
-n02713364	anklet
-n02713496	ankus
-n02714315	anode
-n02714535	anode
-n02714751	answering machine
-n02715229	antenna, aerial, transmitting aerial
-n02715513	anteroom, antechamber, entrance hall, hall, foyer, lobby, vestibule
-n02715712	antiaircraft, antiaircraft gun, flak, flack, pom-pom, ack-ack, ack-ack gun
-n02716626	antiballistic missile, ABM
-n02720048	antifouling paint
-n02720576	anti-G suit, G suit
-n02721813	antimacassar
-n02723165	antiperspirant
-n02724722	anti-submarine rocket
-n02725872	anvil
-n02726017	ao dai
-n02726210	apadana
-n02726305	apartment, flat
-n02726681	apartment building, apartment house
-n02727016	aperture
-n02727141	aperture
-n02727426	apiary, bee house
-n02727825	apparatus, setup
-n02728440	apparel, wearing apparel, dress, clothes
-n02729222	applecart
-n02729837	appliance
-n02729965	appliance, contraption, contrivance, convenience, gadget, gizmo, gismo, widget
-n02730265	applicator, applier
-n02730568	appointment, fitting
-n02730930	apron
-n02731251	apron string
-n02731398	apse, apsis
-n02731629	aqualung, Aqua-Lung, scuba
-n02731900	aquaplane
-n02732072	aquarium, fish tank, marine museum
-n02732572	arabesque
-n02732827	arbor, arbour, bower, pergola
-n02733213	arcade, colonnade
-n02733524	arch
-n02734725	architecture
-n02734835	architrave
-n02735268	arch support
-n02735361	arc lamp, arc light
-n02735538	arctic, galosh, golosh, rubber, gumshoe
-n02735688	area
-n02736396	areaway
-n02736798	argyle, argyll
-n02737351	ark
-n02737660	arm
-n02738031	armament
-n02738271	armature
-n02738449	armband
-n02738535	armchair
-n02738741	armet
-n02738859	arm guard, arm pad
-n02738978	armhole
-n02739123	armilla
-n02739427	armlet, arm band
-n02739550	armoire
-n02739668	armor, armour
-n02739889	armored car, armoured car
-n02740061	armored car, armoured car
-n02740300	armored personnel carrier, armoured personnel carrier, APC
-n02740533	armored vehicle, armoured vehicle
-n02740764	armor plate, armour plate, armor plating, plate armor, plate armour
-n02741367	armory, armoury, arsenal
-n02741475	armrest
-n02742070	arquebus, harquebus, hackbut, hagbut
-n02742194	array
-n02742322	array, raiment, regalia
-n02742468	arrester, arrester hook
-n02742753	arrow
-n02743426	arsenal, armory, armoury
-n02744323	arterial road
-n02744844	arthrogram
-n02744961	arthroscope
-n02745492	artificial heart
-n02745611	artificial horizon, gyro horizon, flight indicator
-n02745816	artificial joint
-n02746008	artificial kidney, hemodialyzer
-n02746225	artificial skin
-n02746365	artillery, heavy weapon, gun, ordnance
-n02746595	artillery shell
-n02746683	artist's loft
-n02746978	art school
-n02747063	ascot
-n02747177	ashcan, trash can, garbage can, wastebin, ash bin, ash-bin, ashbin, dustbin, trash barrel, trash bin
-n02747672	ash-pan
-n02747802	ashtray
-n02748183	aspergill, aspersorium
-n02748359	aspersorium
-n02748491	aspirator
-n02749169	aspirin powder, headache powder
-n02749292	assault gun
-n02749479	assault rifle, assault gun
-n02749670	assegai, assagai
-n02749790	assembly
-n02749953	assembly
-n02750070	assembly hall
-n02750169	assembly plant
-n02750320	astatic coils
-n02750652	astatic galvanometer
-n02751067	astrodome
-n02751215	astrolabe
-n02751295	astronomical telescope
-n02751490	astronomy satellite
-n02752199	athenaeum, atheneum
-n02752496	athletic sock, sweat sock, varsity sock
-n02752615	athletic supporter, supporter, suspensor, jockstrap, jock
-n02752810	atlas, telamon
-n02752917	atmometer, evaporometer
-n02753044	atom bomb, atomic bomb, A-bomb, fission bomb, plutonium bomb
-n02753394	atomic clock
-n02753710	atomic pile, atomic reactor, pile, chain reactor
-n02754103	atomizer, atomiser, spray, sprayer, nebulizer, nebuliser
-n02754656	atrium
-n02755140	attache case, attache
-n02755352	attachment, bond
-n02755529	attack submarine
-n02755675	attenuator
-n02755823	attic
-n02755984	attic fan
-n02756098	attire, garb, dress
-n02756854	audio amplifier
-n02756977	audiocassette
-n02757061	audio CD, audio compact disc
-n02757337	audiometer, sonometer
-n02757462	audio system, sound system
-n02757714	audiotape
-n02757810	audiotape
-n02757927	audiovisual, audiovisual aid
-n02758134	auditorium
-n02758490	auger, gimlet, screw auger, wimble
-n02758863	autobahn
-n02758960	autoclave, sterilizer, steriliser
-n02759257	autofocus
-n02759387	autogiro, autogyro, gyroplane
-n02759700	autoinjector
-n02759963	autoloader, self-loader
-n02760099	automat
-n02760199	automat
-n02760298	automatic choke
-n02760429	automatic firearm, automatic gun, automatic weapon
-n02760658	automatic pistol, automatic
-n02760855	automatic rifle, automatic, machine rifle
-n02761034	automatic transmission, automatic drive
-n02761206	automation
-n02761392	automaton, robot, golem
-n02761557	automobile engine
-n02761696	automobile factory, auto factory, car factory
-n02761834	automobile horn, car horn, motor horn, horn, hooter
-n02762169	autopilot, automatic pilot, robot pilot
-n02762371	autoradiograph
-n02762508	autostrada
-n02762725	auxiliary boiler, donkey boiler
-n02762909	auxiliary engine, donkey engine
-n02763083	auxiliary pump, donkey pump
-n02763198	auxiliary research submarine
-n02763306	auxiliary storage, external storage, secondary storage
-n02763604	aviary, bird sanctuary, volary
-n02763714	awl
-n02763901	awning, sunshade, sunblind
-n02764044	ax, axe
-n02764398	ax handle, axe handle
-n02764505	ax head, axe head
-n02764614	axis, axis of rotation
-n02764779	axle
-n02764935	axle bar
-n02765028	axletree
-n02766168	babushka
-n02766320	baby bed, baby's bed
-n02766534	baby buggy, baby carriage, carriage, perambulator, pram, stroller, go-cart, pushchair, pusher
-n02766792	baby grand, baby grand piano, parlor grand, parlor grand piano, parlour grand, parlour grand piano
-n02767038	baby powder
-n02767147	baby shoe
-n02767433	back, backrest
-n02767665	back
-n02767956	backbench
-n02768114	backboard
-n02768226	backboard, basketball backboard
-n02768433	backbone
-n02768655	back brace
-n02768973	backgammon board
-n02769075	background, desktop, screen background
-n02769290	backhoe
-n02769669	backlighting
-n02769748	backpack, back pack, knapsack, packsack, rucksack, haversack
-n02769963	backpacking tent, pack tent
-n02770078	backplate
-n02770211	back porch
-n02770585	backsaw, back saw
-n02770721	backscratcher
-n02770830	backseat
-n02771004	backspace key, backspace, backspacer
-n02771166	backstairs
-n02771286	backstay
-n02771547	backstop
-n02771750	backsword
-n02772101	backup system
-n02772435	badminton court
-n02772554	badminton equipment
-n02772700	badminton racket, badminton racquet, battledore
-n02773037	bag
-n02773838	bag, traveling bag, travelling bag, grip, suitcase
-n02774152	bag, handbag, pocketbook, purse
-n02774630	baggage, luggage
-n02774921	baggage
-n02775039	baggage car, luggage van
-n02775178	baggage claim
-n02775483	bagpipe
-n02775689	bailey
-n02775813	bailey
-n02775897	Bailey bridge
-n02776007	bain-marie
-n02776205	bait, decoy, lure
-n02776505	baize
-n02776631	bakery, bakeshop, bakehouse
-n02776825	balaclava, balaclava helmet
-n02776978	balalaika
-n02777100	balance
-n02777292	balance beam, beam
-n02777402	balance wheel, balance
-n02777638	balbriggan
-n02777734	balcony
-n02777927	balcony
-n02778131	baldachin
-n02778294	baldric, baldrick
-n02778456	bale
-n02778588	baling wire
-n02778669	ball
-n02779435	ball
-n02779609	ball and chain
-n02779719	ball-and-socket joint
-n02779971	ballast, light ballast
-n02780315	ball bearing, needle bearing, roller bearing
-n02780445	ball cartridge
-n02780588	ballcock, ball cock
-n02780704	balldress
-n02780815	ballet skirt, tutu
-n02781121	ball gown
-n02781213	ballistic galvanometer
-n02781338	ballistic missile
-n02781517	ballistic pendulum
-n02781764	ballistocardiograph, cardiograph
-n02782093	balloon
-n02782432	balloon bomb, Fugo
-n02782602	balloon sail
-n02782681	ballot box
-n02782778	ballpark, park
-n02783035	ball-peen hammer
-n02783161	ballpoint, ballpoint pen, ballpen, Biro
-n02783324	ballroom, dance hall, dance palace
-n02783459	ball valve
-n02783900	balsa raft, Kon Tiki
-n02783994	baluster
-n02784124	banana boat
-n02784998	band
-n02785648	bandage, patch
-n02786058	Band Aid
-n02786198	bandanna, bandana
-n02786331	bandbox
-n02786463	banderilla
-n02786611	bandoleer, bandolier
-n02786736	bandoneon
-n02786837	bandsaw, band saw
-n02787120	bandwagon
-n02787269	bangalore torpedo
-n02787435	bangle, bauble, gaud, gewgaw, novelty, fallal, trinket
-n02787622	banjo
-n02788021	banner, streamer
-n02788148	bannister, banister, balustrade, balusters, handrail
-n02788386	banquette
-n02788462	banyan, banian
-n02788572	baptismal font, baptistry, baptistery, font
-n02788689	bar
-n02789487	bar
-n02790669	barbecue, barbeque
-n02790823	barbed wire, barbwire
-n02790996	barbell
-n02791124	barber chair
-n02791270	barbershop
-n02791532	barbette carriage
-n02791665	barbican, barbacan
-n02791795	bar bit
-n02792409	bareboat
-n02792552	barge, flatboat, hoy, lighter
-n02792948	barge pole
-n02793089	baritone, baritone horn
-n02793199	bark, barque
-n02793296	bar magnet
-n02793414	bar mask
-n02793495	barn
-n02793684	barndoor
-n02793842	barn door
-n02793930	barnyard
-n02794008	barograph
-n02794156	barometer
-n02794368	barong
-n02794474	barouche
-n02794664	bar printer
-n02794779	barrack
-n02794972	barrage balloon
-n02795169	barrel, cask
-n02795528	barrel, gun barrel
-n02795670	barrelhouse, honky-tonk
-n02795783	barrel knot, blood knot
-n02795978	barrel organ, grind organ, hand organ, hurdy gurdy, hurdy-gurdy, street organ
-n02796207	barrel vault
-n02796318	barrette
-n02796412	barricade
-n02796623	barrier
-n02796995	barroom, bar, saloon, ginmill, taproom
-n02797295	barrow, garden cart, lawn cart, wheelbarrow
-n02797535	bascule
-n02797692	base, pedestal, stand
-n02797881	base, bag
-n02799071	baseball
-n02799175	baseball bat, lumber
-n02799323	baseball cap, jockey cap, golf cap
-n02799897	baseball equipment
-n02800213	baseball glove, glove, baseball mitt, mitt
-n02800497	basement, cellar
-n02800675	basement
-n02800940	basic point defense missile system
-n02801047	basilica, Roman basilica
-n02801184	basilica
-n02801450	basilisk
-n02801525	basin
-n02801823	basinet
-n02801938	basket, handbasket
-n02802215	basket, basketball hoop, hoop
-n02802426	basketball
-n02802544	basketball court
-n02802721	basketball equipment
-n02802990	basket weave
-n02803349	bass
-n02803539	bass clarinet
-n02803666	bass drum, gran casa
-n02803809	basset horn
-n02803934	bass fiddle, bass viol, bull fiddle, double bass, contrabass, string bass
-n02804123	bass guitar
-n02804252	bass horn, sousaphone, tuba
-n02804414	bassinet
-n02804515	bassinet
-n02804610	bassoon
-n02805283	baster
-n02805845	bastinado
-n02805983	bastion
-n02806088	bastion, citadel
-n02806379	bat
-n02806530	bath
-n02806762	bath chair
-n02806875	bathhouse, bagnio
-n02806992	bathhouse, bathing machine
-n02807133	bathing cap, swimming cap
-n02807523	bath oil
-n02807616	bathrobe
-n02807731	bathroom, bath
-n02808185	bath salts
-n02808304	bath towel
-n02808440	bathtub, bathing tub, bath, tub
-n02808829	bathyscaphe, bathyscaph, bathyscape
-n02808968	bathysphere
-n02809105	batik
-n02809241	batiste
-n02809364	baton, wand
-n02809491	baton
-n02809605	baton
-n02809736	baton
-n02810139	battering ram
-n02810270	batter's box
-n02810471	battery, electric battery
-n02810782	battery, stamp battery
-n02811059	batting cage, cage
-n02811204	batting glove
-n02811350	batting helmet
-n02811468	battle-ax, battle-axe
-n02811618	battle cruiser
-n02811719	battle dress
-n02811936	battlement, crenelation, crenellation
-n02812201	battleship, battlewagon
-n02812342	battle sight, battlesight
-n02812631	bay
-n02812785	bay
-n02812949	bayonet
-n02813252	bay rum
-n02813399	bay window, bow window
-n02813544	bazaar, bazar
-n02813645	bazaar, bazar
-n02813752	bazooka
-n02813981	B battery
-n02814116	BB gun
-n02814338	beach house
-n02814428	beach towel
-n02814533	beach wagon, station wagon, wagon, estate car, beach waggon, station waggon, waggon
-n02814774	beachwear
-n02814860	beacon, lighthouse, beacon light, pharos
-n02815478	beading plane
-n02815749	beaker
-n02815834	beaker
-n02815950	beam
-n02816494	beam balance
-n02816656	beanbag
-n02816768	beanie, beany
-n02817031	bearing
-n02817251	bearing rein, checkrein
-n02817386	bearing wall
-n02817516	bearskin, busby, shako
-n02817650	beater
-n02817799	beating-reed instrument, reed instrument, reed
-n02818135	beaver, castor
-n02818254	beaver
-n02818687	Beckman thermometer
-n02818832	bed
-n02819697	bed
-n02820085	bed and breakfast, bed-and-breakfast
-n02820210	bedclothes, bed clothing, bedding
-n02820556	Bedford cord
-n02820675	bed jacket
-n02821202	bedpan
-n02821415	bedpost
-n02821543	bedroll
-n02821627	bedroom, sleeping room, sleeping accommodation, chamber, bedchamber
-n02821943	bedroom furniture
-n02822064	bedsitting room, bedsitter, bedsit
-n02822220	bedspread, bedcover, bed cover, bed covering, counterpane, spread
-n02822399	bedspring
-n02822579	bedstead, bedframe
-n02822762	beefcake
-n02822865	beehive, hive
-n02823124	beeper, pager
-n02823335	beer barrel, beer keg
-n02823428	beer bottle
-n02823510	beer can
-n02823586	beer garden
-n02823750	beer glass
-n02823848	beer hall
-n02823964	beer mat
-n02824058	beer mug, stein
-n02824152	belaying pin
-n02824319	belfry
-n02824448	bell
-n02825153	bell arch
-n02825240	bellarmine, longbeard, long-beard, greybeard
-n02825442	bellbottom trousers, bell-bottoms, bellbottom pants
-n02825657	bell cote, bell cot
-n02825872	bell foundry
-n02825961	bell gable
-n02826068	bell jar, bell glass
-n02826259	bellows
-n02826459	bellpull
-n02826589	bell push
-n02826683	bell seat, balloon seat
-n02826812	bell tent
-n02826886	bell tower
-n02827148	bellyband
-n02827606	belt
-n02828115	belt, belt ammunition, belted ammunition
-n02828299	belt buckle
-n02828427	belting
-n02828884	bench
-n02829246	bench clamp
-n02829353	bench hook
-n02829510	bench lathe
-n02829596	bench press
-n02830157	bender
-n02831237	beret
-n02831335	berlin
-n02831595	Bermuda shorts, Jamaica shorts
-n02831724	berth, bunk, built in bed
-n02831894	besom
-n02831998	Bessemer converter
-n02833040	bethel
-n02833140	betting shop
-n02833275	bevatron
-n02833403	bevel, bevel square
-n02833793	bevel gear, pinion and crown wheel, pinion and ring gear
-n02834027	B-flat clarinet, licorice stick
-n02834397	bib
-n02834506	bib-and-tucker
-n02834642	bicorn, bicorne
-n02834778	bicycle, bike, wheel, cycle
-n02835271	bicycle-built-for-two, tandem bicycle, tandem
-n02835412	bicycle chain
-n02835551	bicycle clip, trouser clip
-n02835724	bicycle pump
-n02835829	bicycle rack
-n02835915	bicycle seat, saddle
-n02836035	bicycle wheel
-n02836174	bidet
-n02836268	bier
-n02836392	bier
-n02836513	bi-fold door
-n02836607	bifocals
-n02836900	Big Blue, BLU-82
-n02837134	big board
-n02837567	bight
-n02837789	bikini, two-piece
-n02837887	bikini pants
-n02838014	bilge
-n02838178	bilge keel
-n02838345	bilge pump
-n02838577	bilge well
-n02838728	bill, peak, eyeshade, visor, vizor
-n02838958	bill, billhook
-n02839110	billboard, hoarding
-n02839351	billiard ball
-n02839592	billiard room, billiard saloon, billiard parlor, billiard parlour, billiard hall
-n02839910	bin
-n02840134	binder, ligature
-n02840245	binder, ring-binder
-n02840515	bindery
-n02840619	binding, book binding, cover, back
-n02841063	bin liner
-n02841187	binnacle
-n02841315	binoculars, field glasses, opera glasses
-n02841506	binocular microscope
-n02841641	biochip
-n02841847	biohazard suit
-n02842133	bioscope
-n02842573	biplane
-n02842809	birch, birch rod
-n02843029	birchbark canoe, birchbark, birch bark
-n02843158	birdbath
-n02843276	birdcage
-n02843465	birdcall
-n02843553	bird feeder, birdfeeder, feeder
-n02843684	birdhouse
-n02843777	bird shot, buckshot, duck shot
-n02843909	biretta, berretta, birretta
-n02844056	bishop
-n02844214	bistro
-n02844307	bit
-n02844714	bit
-n02845130	bite plate, biteplate
-n02845293	bitewing
-n02845985	bitumastic
-n02846141	black
-n02846260	black
-n02846511	blackboard, chalkboard
-n02846619	blackboard eraser
-n02846733	black box
-n02846874	blackface
-n02847461	blackjack, cosh, sap
-n02847631	black tie
-n02847852	blackwash
-n02848118	bladder
-n02848216	blade
-n02848523	blade, vane
-n02848806	blade
-n02848921	blank, dummy, blank shell
-n02849154	blanket, cover
-n02849885	blast furnace
-n02850060	blasting cap
-n02850358	blazer, sport jacket, sport coat, sports jacket, sports coat
-n02850732	blender, liquidizer, liquidiser
-n02850950	blimp, sausage balloon, sausage
-n02851099	blind, screen
-n02851795	blind curve, blind bend
-n02851939	blindfold
-n02852043	bling, bling bling
-n02852173	blinker, flasher
-n02852360	blister pack, bubble pack
-n02853016	block
-n02853218	blockade
-n02853336	blockade-runner
-n02853745	block and tackle
-n02853870	blockbuster
-n02854378	blockhouse
-n02854532	block plane
-n02854630	bloodmobile
-n02854739	bloomers, pants, drawers, knickers
-n02854926	blouse
-n02855089	blower
-n02855390	blowtorch, torch, blowlamp
-n02855701	blucher
-n02855793	bludgeon
-n02855925	blue
-n02856013	blue chip
-n02856237	blunderbuss
-n02856362	blunt file
-n02857365	boarding
-n02857477	boarding house, boardinghouse
-n02857644	boardroom, council chamber
-n02857907	boards
-n02858304	boat
-n02859184	boater, leghorn, Panama, Panama hat, sailor, skimmer, straw hat
-n02859343	boat hook
-n02859443	boathouse
-n02859557	boatswain's chair, bosun's chair
-n02859729	boat train
-n02859955	boatyard
-n02860415	bobbin, spool, reel
-n02860640	bobby pin, hairgrip, grip
-n02860847	bobsled, bobsleigh, bob
-n02861022	bobsled, bobsleigh
-n02861147	bocce ball, bocci ball, boccie ball
-n02861286	bodega
-n02861387	bodice
-n02861509	bodkin, threader
-n02861658	bodkin
-n02861777	bodkin
-n02861886	body
-n02862048	body armor, body armour, suit of armor, suit of armour, coat of mail, cataphract
-n02862916	body lotion
-n02863014	body stocking
-n02863176	body plethysmograph
-n02863340	body pad
-n02863426	bodywork
-n02863536	Bofors gun
-n02863638	bogy, bogie, bogey
-n02863750	boiler, steam boiler
-n02864122	boiling water reactor, BWR
-n02864504	bolero
-n02864593	bollard, bitt
-n02864987	bolo, bolo knife
-n02865351	bolo tie, bolo, bola tie, bola
-n02865665	bolt
-n02865931	bolt, deadbolt
-n02866106	bolt
-n02866386	bolt cutter
-n02866578	bomb
-n02867401	bombazine
-n02867592	bomb calorimeter, bomb
-n02867715	bomber
-n02867966	bomber jacket
-n02868240	bomblet, cluster bomblet
-n02868429	bomb rack
-n02868546	bombshell
-n02868638	bomb shelter, air-raid shelter, bombproof
-n02868975	bone-ash cup, cupel, refractory pot
-n02869155	bone china
-n02869249	bones, castanets, clappers, finger cymbals
-n02869563	boneshaker
-n02869737	bongo, bongo drum
-n02869837	bonnet, poke bonnet
-n02870526	book
-n02870676	book bag
-n02870772	bookbindery
-n02870880	bookcase
-n02871005	bookend
-n02871147	bookmark, bookmarker
-n02871314	bookmobile
-n02871439	bookshelf
-n02871525	bookshop, bookstore, bookstall
-n02871631	boom
-n02871824	boom, microphone boom
-n02871963	boomerang, throwing stick, throw stick
-n02872333	booster, booster rocket, booster unit, takeoff booster, takeoff rocket
-n02872529	booster, booster amplifier, booster station, relay link, relay station, relay transmitter
-n02872752	boot
-n02873520	boot
-n02873623	boot camp
-n02873733	bootee, bootie
-n02873839	booth, cubicle, stall, kiosk
-n02874086	booth
-n02874214	booth
-n02874336	boothose
-n02874442	bootjack
-n02874537	bootlace
-n02874642	bootleg
-n02874750	bootstrap
-n02875436	bore bit, borer, rock drill, stone drill
-n02875626	boron chamber
-n02875948	borstal
-n02876084	bosom
-n02876326	Boston rocker
-n02876457	bota
-n02876657	bottle
-n02877266	bottle, feeding bottle, nursing bottle
-n02877513	bottle bank
-n02877642	bottlebrush
-n02877765	bottlecap
-n02877962	bottle opener
-n02878107	bottling plant
-n02878222	bottom, freighter, merchantman, merchant ship
-n02878425	boucle
-n02878534	boudoir
-n02878628	boulle, boule, buhl
-n02878796	bouncing betty
-n02879087	bouquet, corsage, posy, nosegay
-n02879309	boutique, dress shop
-n02879422	boutonniere
-n02879517	bow
-n02879718	bow
-n02880189	bow, bowknot
-n02880393	bow and arrow
-n02880546	bowed stringed instrument, string
-n02880842	Bowie knife
-n02880940	bowl
-n02881193	bowl
-n02881546	bowl
-n02881757	bowler hat, bowler, derby hat, derby, plug hat
-n02881906	bowline, bowline knot
-n02882190	bowling alley
-n02882301	bowling ball, bowl
-n02882483	bowling equipment
-n02882647	bowling pin, pin
-n02882894	bowling shoe
-n02883004	bowsprit
-n02883101	bowstring
-n02883205	bow tie, bow-tie, bowtie
-n02883344	box
-n02884225	box, loge
-n02884450	box, box seat
-n02884859	box beam, box girder
-n02884994	box camera, box Kodak
-n02885108	boxcar
-n02885233	box coat
-n02885338	boxing equipment
-n02885462	boxing glove, glove
-n02885882	box office, ticket office, ticket booth
-n02886321	box spring
-n02886434	box wrench, box end wrench
-n02886599	brace, bracing
-n02887079	brace, braces, orthodontic braces
-n02887209	brace
-n02887489	brace, suspender, gallus
-n02887832	brace and bit
-n02887970	bracelet, bangle
-n02888270	bracer, armguard
-n02888429	brace wrench
-n02888569	bracket, wall bracket
-n02888898	bradawl, pricker
-n02889425	brake
-n02889646	brake
-n02889856	brake band
-n02889996	brake cylinder, hydraulic brake cylinder, master cylinder
-n02890188	brake disk
-n02890351	brake drum, drum
-n02890513	brake lining
-n02890662	brake pad
-n02890804	brake pedal
-n02890940	brake shoe, shoe, skid
-n02891188	brake system, brakes
-n02891788	brass, brass instrument
-n02892201	brass, memorial tablet, plaque
-n02892304	brass
-n02892392	brassard
-n02892499	brasserie
-n02892626	brassie
-n02892767	brassiere, bra, bandeau
-n02892948	brass knucks, knucks, brass knuckles, knuckles, knuckle duster
-n02893269	brattice
-n02893418	brazier, brasier
-n02893608	breadbasket
-n02893692	bread-bin, breadbox
-n02893941	bread knife
-n02894024	breakable
-n02894158	breakfast area, breakfast nook
-n02894337	breakfast table
-n02894605	breakwater, groin, groyne, mole, bulwark, seawall, jetty
-n02894847	breast drill
-n02895008	breast implant
-n02895154	breastplate, aegis, egis
-n02895328	breast pocket
-n02895438	breathalyzer, breathalyser
-n02896074	breechblock, breech closer
-n02896294	breechcloth, breechclout, loincloth
-n02896442	breeches, knee breeches, knee pants, knickerbockers, knickers
-n02896694	breeches buoy
-n02896856	breechloader
-n02896949	breeder reactor
-n02897097	Bren, Bren gun
-n02897389	brewpub
-n02897820	brick
-n02898093	brickkiln
-n02898173	bricklayer's hammer
-n02898269	brick trowel, mason's trowel
-n02898369	brickwork
-n02898585	bridal gown, wedding gown, wedding dress
-n02898711	bridge, span
-n02899439	bridge, nosepiece
-n02900160	bridle
-n02900459	bridle path, bridle road
-n02900594	bridoon
-n02900705	briefcase
-n02900857	briefcase bomb
-n02900987	briefcase computer
-n02901114	briefs, Jockey shorts
-n02901259	brig
-n02901377	brig
-n02901481	brigandine
-n02901620	brigantine, hermaphrodite brig
-n02901793	brilliantine
-n02901901	brilliant pebble
-n02902079	brim
-n02902687	bristle brush
-n02902816	britches
-n02902916	broad arrow
-n02903006	broadax, broadaxe
-n02903126	brochette
-n02903204	broadcaster, spreader
-n02903727	broadcloth
-n02903852	broadcloth
-n02904109	broad hatchet
-n02904233	broadloom
-n02904505	broadside
-n02904640	broadsword
-n02904803	brocade
-n02904927	brogan, brogue, clodhopper, work shoe
-n02905036	broiler
-n02905152	broken arch
-n02905886	bronchoscope
-n02906734	broom
-n02906963	broom closet
-n02907082	broomstick, broom handle
-n02907296	brougham
-n02907391	Browning automatic rifle, BAR
-n02907656	Browning machine gun, Peacemaker
-n02907873	brownstone
-n02908123	brunch coat
-n02908217	brush
-n02908773	Brussels carpet
-n02908951	Brussels lace
-n02909053	bubble
-n02909165	bubble chamber
-n02909285	bubble jet printer, bubble-jet printer, bubblejet
-n02909706	buckboard
-n02909870	bucket, pail
-n02910145	bucket seat
-n02910241	bucket shop
-n02910353	buckle
-n02910542	buckram
-n02910701	bucksaw
-n02910864	buckskins
-n02910964	buff, buffer
-n02911332	buffer, polisher
-n02911485	buffer, buffer storage, buffer store
-n02912065	buffet, counter, sideboard
-n02912319	buffing wheel
-n02912557	buggy, roadster
-n02912894	bugle
-n02913152	building, edifice
-n02914991	building complex, complex
-n02915904	bulldog clip, alligator clip
-n02916065	bulldog wrench
-n02916179	bulldozer, dozer
-n02916350	bullet, slug
-n02916936	bulletproof vest
-n02917067	bullet train, bullet
-n02917377	bullhorn, loud hailer, loud-hailer
-n02917521	bullion
-n02917607	bullnose, bullnosed plane
-n02917742	bullpen, detention cell, detention centre
-n02917964	bullpen
-n02918112	bullring
-n02918330	bulwark
-n02918455	bumboat
-n02918595	bumper
-n02918831	bumper
-n02918964	bumper car, Dodgem
-n02919148	bumper guard
-n02919308	bumper jack
-n02919414	bundle, sheaf
-n02919648	bung, spile
-n02919792	bungalow, cottage
-n02919890	bungee, bungee cord
-n02919976	bunghole
-n02920083	bunk
-n02920164	bunk, feed bunk
-n02920259	bunk bed, bunk
-n02920369	bunker, sand trap, trap
-n02920503	bunker, dugout
-n02920658	bunker
-n02921029	bunsen burner, bunsen, etna
-n02921195	bunting
-n02921292	bur, burr
-n02921406	Burberry
-n02921592	burette, buret
-n02921756	burglar alarm
-n02921884	burial chamber, sepulcher, sepulchre, sepulture
-n02922159	burial garment
-n02922292	burial mound, grave mound, barrow, tumulus
-n02922461	burin
-n02922578	burqa, burka
-n02922798	burlap, gunny
-n02922877	burn bag
-n02923129	burner
-n02923535	burnous, burnoose, burnouse
-n02923682	burp gun, machine pistol
-n02923915	burr
-n02924116	bus, autobus, coach, charabanc, double-decker, jitney, motorbus, motorcoach, omnibus, passenger vehicle
-n02925009	bushel basket
-n02925107	bushing, cylindrical lining
-n02925385	bush jacket
-n02925519	business suit
-n02925666	buskin, combat boot, desert boot, half boot, top boot
-n02926426	bustier
-n02926591	bustle
-n02927053	butcher knife
-n02927161	butcher shop, meat market
-n02927764	butter dish
-n02927887	butterfly valve
-n02928049	butter knife
-n02928299	butt hinge
-n02928413	butt joint, butt
-n02928608	button
-n02929184	buttonhook
-n02929289	buttress, buttressing
-n02929462	butt shaft
-n02929582	butt weld, butt-weld
-n02929923	buzz bomb, robot bomb, flying bomb, doodlebug, V-1
-n02930080	buzzer
-n02930214	BVD, BVD's
-n02930339	bypass condenser, bypass capacitor
-n02930645	byway, bypath, byroad
-n02930766	cab, hack, taxi, taxicab
-n02931013	cab, cabriolet
-n02931148	cab
-n02931294	cabana
-n02931417	cabaret, nightclub, night club, club, nightspot
-n02931836	caber
-n02932019	cabin
-n02932400	cabin
-n02932523	cabin car, caboose
-n02932693	cabin class, second class, economy class
-n02932891	cabin cruiser, cruiser, pleasure boat, pleasure craft
-n02933112	cabinet
-n02933340	cabinet, console
-n02933462	cabinet, locker, storage locker
-n02933649	cabinetwork
-n02933750	cabin liner
-n02933990	cable, cable television, cable system, cable television service
-n02934168	cable, line, transmission line
-n02934451	cable car, car
-n02935017	cache, memory cache
-n02935387	caddy, tea caddy
-n02935490	caesium clock
-n02935658	cafe, coffeehouse, coffee shop, coffee bar
-n02935891	cafeteria
-n02936176	cafeteria tray
-n02936281	caff
-n02936402	caftan, kaftan
-n02936570	caftan, kaftan
-n02936714	cage, coop
-n02936921	cage
-n02937010	cagoule
-n02937336	caisson
-n02937958	calash, caleche, calash top
-n02938218	calceus
-n02938321	calcimine
-n02938886	calculator, calculating machine
-n02939185	caldron, cauldron
-n02939763	calico
-n02939866	caliper, calliper
-n02940289	call-board
-n02940385	call center, call centre
-n02940570	caller ID
-n02940706	calliope, steam organ
-n02941095	calorimeter
-n02941228	calpac, calpack, kalpac
-n02941845	camail, aventail, ventail
-n02942015	camber arch
-n02942147	cambric
-n02942349	camcorder
-n02942460	camel's hair, camelhair
-n02942699	camera, photographic camera
-n02943241	camera lens, optical lens
-n02943465	camera lucida
-n02943686	camera obscura
-n02943871	camera tripod
-n02943964	camise
-n02944075	camisole
-n02944146	camisole, underbodice
-n02944256	camlet
-n02944459	camouflage
-n02944579	camouflage, camo
-n02944826	camp, encampment, cantonment, bivouac
-n02945161	camp
-n02945813	camp, refugee camp
-n02945964	campaign hat
-n02946127	campanile, belfry
-n02946270	camp chair
-n02946348	camper, camping bus, motor home
-n02946509	camper trailer
-n02946753	campstool
-n02946824	camshaft
-n02946921	can, tin, tin can
-n02947212	canal
-n02947660	canal boat, narrow boat, narrowboat
-n02947818	candelabrum, candelabra
-n02947977	candid camera
-n02948072	candle, taper, wax light
-n02948293	candlepin
-n02948403	candlesnuffer
-n02948557	candlestick, candle holder
-n02948834	candlewick
-n02948942	candy thermometer
-n02949084	cane
-n02949202	cane
-n02949356	cangue
-n02949542	canister, cannister, tin
-n02950018	cannery
-n02950120	cannikin
-n02950186	cannikin
-n02950256	cannon
-n02950482	cannon
-n02950632	cannon
-n02950826	cannon
-n02950943	cannonball, cannon ball, round shot
-n02951358	canoe
-n02951585	can opener, tin opener
-n02951703	canopic jar, canopic vase
-n02951843	canopy
-n02952109	canopy
-n02952237	canopy
-n02952374	canteen
-n02952485	canteen
-n02952585	canteen
-n02952674	canteen, mobile canteen
-n02952798	canteen
-n02952935	cant hook
-n02953056	cantilever
-n02953197	cantilever bridge
-n02953455	cantle
-n02953552	Canton crepe
-n02953673	canvas, canvass
-n02953850	canvas, canvass
-n02954163	canvas tent, canvas, canvass
-n02954340	cap
-n02954938	cap
-n02955065	cap
-n02955247	capacitor, capacitance, condenser, electrical condenser
-n02955540	caparison, trapping, housing
-n02955767	cape, mantle
-n02956393	capital ship
-n02956699	capitol
-n02956795	cap opener
-n02956883	capote, hooded cloak
-n02957008	capote, hooded coat
-n02957135	cap screw
-n02957252	capstan
-n02957427	capstone, copestone, coping stone, stretcher
-n02957755	capsule
-n02957862	captain's chair
-n02958343	car, auto, automobile, machine, motorcar
-n02959942	car, railcar, railway car, railroad car
-n02960352	car, elevator car
-n02960690	carabiner, karabiner, snap ring
-n02960903	carafe, decanter
-n02961035	caravansary, caravanserai, khan, caravan inn
-n02961225	car battery, automobile battery
-n02961451	carbine
-n02961544	car bomb
-n02961947	carbon arc lamp, carbon arc
-n02962061	carboy
-n02962200	carburetor, carburettor
-n02962414	car carrier
-n02962843	cardcase
-n02962938	cardiac monitor, heart monitor
-n02963159	cardigan
-n02963302	card index, card catalog, card catalogue
-n02963503	cardiograph, electrocardiograph
-n02963692	cardioid microphone
-n02963821	car door
-n02963987	cardroom
-n02964075	card table
-n02964196	card table
-n02964295	car-ferry
-n02964634	cargo area, cargo deck, cargo hold, hold, storage area
-n02964843	cargo container
-n02964934	cargo door
-n02965024	cargo hatch
-n02965122	cargo helicopter
-n02965216	cargo liner
-n02965300	cargo ship, cargo vessel
-n02965529	carillon
-n02965783	car mirror
-n02966068	caroche
-n02966193	carousel, carrousel, merry-go-round, roundabout, whirligig
-n02966545	carpenter's hammer, claw hammer, clawhammer
-n02966687	carpenter's kit, tool kit
-n02966786	carpenter's level
-n02966942	carpenter's mallet
-n02967081	carpenter's rule
-n02967170	carpenter's square
-n02967294	carpetbag
-n02967407	carpet beater, rug beater
-n02967540	carpet loom
-n02967626	carpet pad, rug pad, underlay, underlayment
-n02967782	carpet sweeper, sweeper
-n02967991	carpet tack
-n02968074	carport, car port
-n02968210	carrack, carack
-n02968333	carrel, carrell, cubicle, stall
-n02968473	carriage, equipage, rig
-n02969010	carriage
-n02969163	carriage bolt
-n02969323	carriageway
-n02969527	carriage wrench
-n02969634	carrick bend
-n02969886	carrier
-n02970408	carryall, holdall, tote, tote bag
-n02970534	carrycot
-n02970685	car seat
-n02970849	cart
-n02971167	car tire, automobile tire, auto tire, rubber tire
-n02971356	carton
-n02971473	cartouche, cartouch
-n02971579	car train
-n02971691	cartridge
-n02971940	cartridge, pickup
-n02972397	cartridge belt
-n02972714	cartridge extractor, cartridge remover, extractor
-n02972934	cartridge fuse
-n02973017	cartridge holder, cartridge clip, clip, magazine
-n02973236	cartwheel
-n02973805	carving fork
-n02973904	carving knife
-n02974003	car wheel
-n02974348	caryatid
-n02974454	cascade liquefier
-n02974565	cascade transformer
-n02974697	case
-n02975212	case, display case, showcase, vitrine
-n02975589	case, compositor's case, typesetter's case
-n02975994	casein paint, casein
-n02976123	case knife, sheath knife
-n02976249	case knife
-n02976350	casement
-n02976455	casement window
-n02976552	casern
-n02976641	case shot, canister, canister shot
-n02976815	cash bar
-n02976939	cashbox, money box, till
-n02977058	cash machine, cash dispenser, automated teller machine, automatic teller machine, automated teller, automatic teller, ATM
-n02977330	cashmere
-n02977438	cash register, register
-n02977619	casing, case
-n02977936	casino, gambling casino
-n02978055	casket, jewel casket
-n02978205	casque
-n02978367	casquet, casquetel
-n02978478	Cassegrainian telescope, Gregorian telescope
-n02978753	casserole
-n02978881	cassette
-n02979074	cassette deck
-n02979186	cassette player
-n02979290	cassette recorder
-n02979399	cassette tape
-n02979516	cassock
-n02979836	cast, plaster cast, plaster bandage
-n02980036	caster, castor
-n02980203	caster, castor
-n02980441	castle
-n02980625	castle, rook
-n02981024	catacomb
-n02981198	catafalque
-n02981321	catalytic converter
-n02981565	catalytic cracker, cat cracker
-n02981792	catamaran
-n02981911	catapult, arbalest, arbalist, ballista, bricole, mangonel, onager, trebuchet, trebucket
-n02982232	catapult, launcher
-n02982416	catboat
-n02982515	cat box
-n02982599	catch
-n02983072	catchall
-n02983189	catcher's mask
-n02983357	catchment
-n02983507	Caterpillar, cat
-n02983904	cathedra, bishop's throne
-n02984061	cathedral
-n02984203	cathedral, duomo
-n02984469	catheter
-n02984699	cathode
-n02985137	cathode-ray tube, CRT
-n02985606	cat-o'-nine-tails, cat
-n02985828	cat's-paw
-n02985963	catsup bottle, ketchup bottle
-n02986066	cattle car
-n02986160	cattle guard, cattle grid
-n02986348	cattleship, cattle boat
-n02987047	cautery, cauterant
-n02987379	cavalier hat, slouch hat
-n02987492	cavalry sword, saber, sabre
-n02987706	cavetto
-n02987823	cavity wall
-n02987950	C battery
-n02988066	C-clamp
-n02988156	CD drive
-n02988304	CD player
-n02988486	CD-R, compact disc recordable, CD-WO, compact disc write-once
-n02988679	CD-ROM, compact disc read-only memory
-n02988963	CD-ROM drive
-n02989099	cedar chest
-n02990373	ceiling
-n02990758	celesta
-n02991048	cell, electric cell
-n02991302	cell, jail cell, prison cell
-n02991847	cellar, wine cellar
-n02992032	cellblock, ward
-n02992211	cello, violoncello
-n02992368	cellophane
-n02992529	cellular telephone, cellular phone, cellphone, cell, mobile phone
-n02992795	cellulose tape, Scotch tape, Sellotape
-n02993194	cenotaph, empty tomb
-n02993368	censer, thurible
-n02993546	center, centre
-n02994573	center punch
-n02994743	Centigrade thermometer
-n02995345	central processing unit, CPU, C.P.U., central processor, processor, mainframe
-n02995871	centrifugal pump
-n02995998	centrifuge, extractor, separator
-n02997391	ceramic
-n02997607	ceramic ware
-n02997910	cereal bowl
-n02998003	cereal box
-n02998107	cerecloth
-n02998563	cesspool, cesspit, sink, sump
-n02998696	chachka, tsatske, tshatshke, tchotchke
-n02998841	chador, chadar, chaddar, chuddar
-n02999138	chafing dish
-n02999410	chain
-n02999936	chain
-n03000134	chainlink fence
-n03000247	chain mail, ring mail, mail, chain armor, chain armour, ring armor, ring armour
-n03000530	chain printer
-n03000684	chain saw, chainsaw
-n03001115	chain store
-n03001282	chain tongs
-n03001540	chain wrench
-n03001627	chair
-n03002096	chair
-n03002210	chair of state
-n03002341	chairlift, chair lift
-n03002555	chaise, shay
-n03002711	chaise longue, chaise, daybed
-n03002816	chalet
-n03002948	chalice, goblet
-n03003091	chalk
-n03003633	challis
-n03004275	chamberpot, potty, thunder mug
-n03004409	chambray
-n03004531	chamfer bit
-n03004620	chamfer plane
-n03004713	chamois cloth
-n03004824	chancel, sanctuary, bema
-n03005033	chancellery
-n03005147	chancery
-n03005285	chandelier, pendant, pendent
-n03005515	chandlery
-n03005619	chanfron, chamfron, testiere, frontstall, front-stall
-n03006626	chanter, melody pipe
-n03006788	chantry
-n03006903	chap
-n03007130	chapel
-n03007297	chapterhouse, fraternity house, frat house
-n03007444	chapterhouse
-n03007591	character printer, character-at-a-time printer, serial printer
-n03008177	charcuterie
-n03008817	charge-exchange accelerator
-n03008976	charger, battery charger
-n03009111	chariot
-n03009269	chariot
-n03009794	charnel house, charnel
-n03010473	chassis
-n03010656	chassis
-n03010795	chasuble
-n03010915	chateau
-n03011018	chatelaine
-n03011355	checker, chequer
-n03011741	checkout, checkout counter
-n03012013	cheekpiece
-n03012159	cheeseboard, cheese tray
-n03012373	cheesecloth
-n03012499	cheese cutter
-n03012644	cheese press
-n03012734	chemical bomb, gas bomb
-n03012897	chemical plant
-n03013006	chemical reactor
-n03013438	chemise, sack, shift
-n03013580	chemise, shimmy, shift, slip, teddy
-n03013850	chenille
-n03014440	chessman, chess piece
-n03014705	chest
-n03015149	chesterfield
-n03015254	chest of drawers, chest, bureau, dresser
-n03015478	chest protector
-n03015631	cheval-de-frise, chevaux-de-frise
-n03015851	cheval glass
-n03016209	chicane
-n03016389	chicken coop, coop, hencoop, henhouse
-n03016609	chicken wire
-n03016737	chicken yard, hen yard, chicken run, fowl run
-n03016868	chiffon
-n03016953	chiffonier, commode
-n03017070	child's room
-n03017168	chime, bell, gong
-n03017698	chimney breast
-n03017835	chimney corner, inglenook
-n03018209	china
-n03018349	china cabinet, china closet
-n03018614	chinchilla
-n03018712	Chinese lantern
-n03018848	Chinese puzzle
-n03019198	chinning bar
-n03019304	chino
-n03019434	chino
-n03019685	chin rest
-n03019806	chin strap
-n03019938	chintz
-n03020034	chip, microchip, micro chip, silicon chip, microprocessor chip
-n03020416	chip, poker chip
-n03020692	chisel
-n03021228	chlamys
-n03024064	choir
-n03024233	choir loft
-n03024333	choke
-n03024518	choke, choke coil, choking coil
-n03025070	chokey, choky
-n03025165	choo-choo
-n03025250	chopine, platform
-n03025886	chordophone
-n03026506	Christmas stocking
-n03026907	chronograph
-n03027001	chronometer
-n03027108	chronoscope
-n03027250	chuck
-n03027505	chuck wagon
-n03027625	chukka, chukka boot
-n03028079	church, church building
-n03028596	church bell
-n03028785	church hat
-n03029066	church key
-n03029197	church tower
-n03029296	churidars
-n03029445	churn, butter churn
-n03029925	ciderpress
-n03030262	cigar band
-n03030353	cigar box
-n03030557	cigar cutter
-n03030880	cigarette butt
-n03031012	cigarette case
-n03031152	cigarette holder
-n03031422	cigar lighter, cigarette lighter, pocket lighter
-n03031756	cinch, girth
-n03032252	cinema, movie theater, movie theatre, movie house, picture palace
-n03032453	cinquefoil
-n03032811	circle, round
-n03033267	circlet
-n03033362	circuit, electrical circuit, electric circuit
-n03033986	circuit board, circuit card, board, card, plug-in, add-in
-n03034244	circuit breaker, breaker
-n03034405	circuitry
-n03034516	circular plane, compass plane
-n03034663	circular saw, buzz saw
-n03035252	circus tent, big top, round top, top
-n03035510	cistern
-n03035715	cistern, water tank
-n03035832	cittern, cithern, cither, citole, gittern
-n03036022	city hall
-n03036149	cityscape
-n03036244	city university
-n03036341	civies, civvies
-n03036469	civilian clothing, civilian dress, civilian garb, plain clothes
-n03036701	clack valve, clack, clapper valve
-n03036866	clamp, clinch
-n03037108	clamshell, grapple
-n03037228	clapper, tongue
-n03037404	clapperboard
-n03037590	clarence
-n03037709	clarinet
-n03038041	Clark cell, Clark standard cell
-n03038281	clasp
-n03038480	clasp knife, jackknife
-n03038685	classroom, schoolroom
-n03038870	clavichord
-n03039015	clavier, Klavier
-n03039259	clay pigeon
-n03039353	claymore mine, claymore
-n03039493	claymore
-n03039827	cleaners, dry cleaners
-n03039947	cleaning implement, cleaning device, cleaning equipment
-n03040229	cleaning pad
-n03040376	clean room, white room
-n03040836	clearway
-n03041114	cleat
-n03041265	cleat
-n03041449	cleats
-n03041632	cleaver, meat cleaver, chopper
-n03041810	clerestory, clearstory
-n03042139	clevis
-n03042384	clews
-n03042490	cliff dwelling
-n03042697	climbing frame
-n03042829	clinch
-n03042984	clinch, clench
-n03043173	clincher
-n03043274	clinic
-n03043423	clinical thermometer, mercury-in-glass clinical thermometer
-n03043693	clinker, clinker brick
-n03043798	clinometer, inclinometer
-n03043958	clip
-n03044671	clip lead
-n03044801	clip-on
-n03044934	clipper
-n03045074	clipper
-n03045228	clipper, clipper ship
-n03045337	cloak
-n03045698	cloak
-n03045800	cloakroom, coatroom
-n03046029	cloche
-n03046133	cloche
-n03046257	clock
-n03046802	clock pendulum
-n03046921	clock radio
-n03047052	clock tower
-n03047171	clockwork
-n03047690	clog, geta, patten, sabot
-n03047799	cloisonne
-n03047941	cloister
-n03048883	closed circuit, loop
-n03049066	closed-circuit television
-n03049326	closed loop, closed-loop system
-n03049457	closet
-n03049782	closeup lens
-n03049924	cloth cap, flat cap
-n03050026	cloth covering
-n03050453	clothesbrush
-n03050546	clothes closet, clothespress
-n03050655	clothes dryer, clothes drier
-n03050864	clothes hamper, laundry basket, clothes basket, voider
-n03051041	clotheshorse
-n03051249	clothespin, clothes pin, clothes peg
-n03051396	clothes tree, coat tree, coat stand
-n03051540	clothing, article of clothing, vesture, wear, wearable, habiliment
-n03052464	clothing store, haberdashery, haberdashery store, mens store
-n03052917	clout nail, clout
-n03053047	clove hitch
-n03053976	club car, lounge car
-n03054491	clubroom
-n03054605	cluster bomb
-n03054901	clutch
-n03055159	clutch, clutch pedal
-n03055418	clutch bag, clutch
-n03055670	coach, four-in-hand, coach-and-four
-n03055857	coach house, carriage house, remise
-n03056097	coal car
-n03056215	coal chute
-n03056288	coal house
-n03056493	coal shovel
-n03056583	coaming
-n03056873	coaster brake
-n03057021	coat
-n03057541	coat button
-n03057636	coat closet
-n03057724	coatdress
-n03057841	coatee
-n03057920	coat hanger, clothes hanger, dress hanger
-n03058107	coating, coat
-n03058603	coating
-n03058949	coat of paint
-n03059103	coatrack, coat rack, hatrack
-n03059236	coattail
-n03059366	coaxial cable, coax, coax cable
-n03059685	cobweb
-n03059934	cobweb
-n03060728	Cockcroft and Walton accelerator, Cockcroft-Walton accelerator, Cockcroft and Walton voltage multiplier, Cockcroft-Walton voltage multiplier
-n03061050	cocked hat
-n03061211	cockhorse
-n03061345	cockleshell
-n03061505	cockpit
-n03061674	cockpit
-n03061819	cockpit
-n03061893	cockscomb, coxcomb
-n03062015	cocktail dress, sheath
-n03062122	cocktail lounge
-n03062245	cocktail shaker
-n03062336	cocotte
-n03062651	codpiece
-n03062798	coelostat
-n03062985	coffee can
-n03063073	coffee cup
-n03063199	coffee filter
-n03063338	coffee maker
-n03063485	coffee mill, coffee grinder
-n03063599	coffee mug
-n03063689	coffeepot
-n03063834	coffee stall
-n03063968	coffee table, cocktail table
-n03064250	coffee urn
-n03064350	coffer
-n03064562	Coffey still
-n03064758	coffin, casket
-n03064935	cog, sprocket
-n03065243	coif
-n03065424	coil, spiral, volute, whorl, helix
-n03065708	coil
-n03066232	coil
-n03066359	coil spring, volute spring
-n03066464	coin box
-n03066849	colander, cullender
-n03067093	cold cathode
-n03067212	cold chisel, set chisel
-n03067339	cold cream, coldcream, face cream, vanishing cream
-n03067518	cold frame
-n03068181	collar, neckband
-n03068998	collar
-n03069752	college
-n03070059	collet, collet chuck
-n03070193	collider
-n03070396	colliery, pit
-n03070587	collimator
-n03070854	collimator
-n03071021	cologne, cologne water, eau de cologne
-n03071160	colonnade
-n03071288	colonoscope
-n03071552	colorimeter, tintometer
-n03072056	colors, colours
-n03072201	color television, colour television, color television system, colour television system, color TV, colour TV
-n03072440	color tube, colour tube, color television tube, colour television tube, color TV tube, colour TV tube
-n03072682	color wash, colour wash
-n03073296	Colt
-n03073384	colter, coulter
-n03073545	columbarium
-n03073694	columbarium, cinerarium
-n03073977	column, pillar
-n03074380	column, pillar
-n03074855	comb
-n03075097	comb
-n03075248	comber
-n03075370	combination lock
-n03075500	combination plane
-n03075634	combine
-n03075768	comforter, pacifier, baby's dummy, teething ring
-n03075946	command module
-n03076411	commissary
-n03076623	commissary
-n03076708	commodity, trade good, good
-n03077442	common ax, common axe, Dayton ax, Dayton axe
-n03077616	common room
-n03077741	communications satellite
-n03078287	communication system
-n03078506	community center, civic center
-n03078670	commutator
-n03078802	commuter, commuter train
-n03078995	compact, powder compact
-n03079136	compact, compact car
-n03079230	compact disk, compact disc, CD
-n03079494	compact-disk burner, CD burner
-n03079616	companionway
-n03079741	compartment
-n03080309	compartment
-n03080497	compass
-n03080633	compass
-n03080731	compass card, mariner's compass
-n03080904	compass saw
-n03081859	compound
-n03081986	compound lens
-n03082127	compound lever
-n03082280	compound microscope
-n03082450	compress
-n03082656	compression bandage, tourniquet
-n03082807	compressor
-n03082979	computer, computing machine, computing device, data processor, electronic computer, information processing system
-n03084420	computer circuit
-n03084834	computerized axial tomography scanner, CAT scanner
-n03085013	computer keyboard, keypad
-n03085219	computer monitor
-n03085333	computer network
-n03085602	computer screen, computer display
-n03085781	computer store
-n03085915	computer system, computing system, automatic data processing system, ADP system, ADPS
-n03086183	concentration camp, stockade
-n03086457	concert grand, concert piano
-n03086580	concert hall
-n03086670	concertina
-n03086868	concertina
-n03087069	concrete mixer, cement mixer
-n03087245	condensation pump, diffusion pump
-n03087366	condenser, optical condenser
-n03087521	condenser
-n03087643	condenser
-n03087816	condenser microphone, capacitor microphone
-n03088389	condominium
-n03088580	condominium, condo
-n03088707	conductor
-n03089477	cone clutch, cone friction clutch
-n03089624	confectionery, confectionary, candy store
-n03089753	conference center, conference house
-n03089879	conference room
-n03090000	conference table, council table, council board
-n03090172	confessional
-n03090437	conformal projection, orthomorphic projection
-n03090710	congress boot, congress shoe, congress gaiter
-n03090856	conic projection, conical projection
-n03091044	connecting rod
-n03091223	connecting room
-n03091374	connection, connexion, connector, connecter, connective
-n03091907	conning tower
-n03092053	conning tower
-n03092166	conservatory, hothouse, indoor garden
-n03092314	conservatory, conservatoire
-n03092476	console
-n03092656	console
-n03092883	console table, console
-n03093427	consulate
-n03093792	contact, tangency
-n03094159	contact, contact lens
-n03094503	container
-n03095699	container ship, containership, container vessel
-n03095965	containment
-n03096439	contrabassoon, contrafagotto, double bassoon
-n03096960	control, controller
-n03097362	control center
-n03097535	control circuit, negative feedback circuit
-n03097673	control key, command key
-n03098140	control panel, instrument panel, control board, board, panel
-n03098515	control rod
-n03098688	control room
-n03098806	control system
-n03098959	control tower
-n03099147	convector
-n03099274	convenience store
-n03099454	convent
-n03099622	conventicle, meetinghouse
-n03099771	converging lens, convex lens
-n03099945	converter, convertor
-n03100240	convertible
-n03100346	convertible, sofa bed
-n03100490	conveyance, transport
-n03100897	conveyer belt, conveyor belt, conveyer, conveyor, transporter
-n03101156	cooker
-n03101302	cookfire
-n03101375	cookhouse
-n03101517	cookie cutter
-n03101664	cookie jar, cooky jar
-n03101796	cookie sheet, baking tray
-n03101986	cooking utensil, cookware
-n03102371	cookstove
-n03102516	coolant system
-n03102654	cooler, ice chest
-n03102859	cooling system, cooling
-n03103128	cooling system, engine cooling system
-n03103396	cooling tower
-n03103563	coonskin cap, coonskin
-n03103904	cope
-n03104019	coping saw
-n03104512	copperware
-n03105088	copyholder
-n03105214	coquille
-n03105306	coracle
-n03105467	corbel, truss
-n03105645	corbel arch
-n03105810	corbel step, corbie-step, corbiestep, crow step
-n03105974	corbie gable
-n03106722	cord, corduroy
-n03106898	cord, electric cord
-n03107046	cordage
-n03107488	cords, corduroys
-n03107716	core
-n03108455	core bit
-n03108624	core drill
-n03108759	corer
-n03108853	cork, bottle cork
-n03109033	corker
-n03109150	corkscrew, bottle screw
-n03109253	corncrib
-n03109693	corner, quoin
-n03109881	corner, nook
-n03110202	corner post
-n03110669	cornet, horn, trumpet, trump
-n03111041	cornice
-n03111177	cornice
-n03111296	cornice, valance, valance board, pelmet
-n03111690	correctional institution
-n03112240	corrugated fastener, wiggle nail
-n03112719	corselet, corslet
-n03112869	corset, girdle, stays
-n03113152	cosmetic
-n03113505	cosmotron
-n03113657	costume
-n03113835	costume
-n03114041	costume
-n03114236	costume
-n03114379	cosy, tea cosy, cozy, tea cozy
-n03114504	cot, camp bed
-n03114743	cottage tent
-n03114839	cotter, cottar
-n03115014	cotter pin
-n03115180	cotton
-n03115400	cotton flannel, Canton flannel
-n03115663	cotton mill
-n03115762	couch
-n03115897	couch
-n03116008	couchette
-n03116163	coude telescope, coude system
-n03116530	counter
-n03116767	counter, tabulator
-n03117199	counter
-n03117642	counterbore, countersink, countersink bit
-n03118346	counter tube
-n03118969	country house
-n03119203	country store, general store, trading post
-n03119396	coupe
-n03119510	coupling, coupler
-n03120198	court, courtyard
-n03120491	court
-n03120778	court, courtroom
-n03121040	court
-n03121190	Courtelle
-n03121298	courthouse
-n03121431	courthouse
-n03121897	coverall
-n03122073	covered bridge
-n03122202	covered couch
-n03122295	covered wagon, Conestoga wagon, Conestoga, prairie wagon, prairie schooner
-n03122748	covering
-n03123553	coverlet
-n03123666	cover plate
-n03123809	cowbarn, cowshed, cow barn, cowhouse, byre
-n03123917	cowbell
-n03124043	cowboy boot
-n03124170	cowboy hat, ten-gallon hat
-n03124313	cowhide
-n03124474	cowl
-n03124590	cow pen, cattle pen, corral
-n03125057	CPU board, mother board
-n03125588	crackle, crackleware, crackle china
-n03125729	cradle
-n03125870	craft
-n03126090	cramp, cramp iron
-n03126385	crampon, crampoon, climbing iron, climber
-n03126580	crampon, crampoon
-n03126707	crane
-n03126927	craniometer
-n03127024	crank, starter
-n03127203	crankcase
-n03127408	crankshaft
-n03127531	crash barrier
-n03127747	crash helmet
-n03127925	crate
-n03128085	cravat
-n03128248	crayon, wax crayon
-n03128427	crazy quilt
-n03128519	cream, ointment, emollient
-n03129001	cream pitcher, creamer
-n03129471	creche, foundling hospital
-n03129636	creche
-n03129753	credenza, credence
-n03129848	creel
-n03130066	crematory, crematorium, cremation chamber
-n03130233	crematory, crematorium
-n03130563	crepe, crape
-n03130761	crepe de Chine
-n03130866	crescent wrench
-n03131193	cretonne
-n03131574	crib, cot
-n03131669	crib
-n03131967	cricket ball
-n03132076	cricket bat, bat
-n03132261	cricket equipment
-n03132438	cringle, eyelet, loop, grommet, grummet
-n03132666	crinoline
-n03132776	crinoline
-n03133050	crochet needle, crochet hook
-n03133415	crock, earthenware jar
-n03133878	Crock Pot
-n03134118	crook, shepherd's crook
-n03134232	Crookes radiometer
-n03134394	Crookes tube
-n03134739	croquet ball
-n03134853	croquet equipment
-n03135030	croquet mallet
-n03135532	cross
-n03135656	crossbar
-n03135788	crossbar
-n03135917	crossbar
-n03136051	crossbench
-n03136254	cross bit
-n03136369	crossbow
-n03136504	crosscut saw, crosscut handsaw, cutoff saw
-n03137473	crossjack, mizzen course
-n03137579	crosspiece
-n03138128	crotchet
-n03138217	croupier's rake
-n03138344	crowbar, wrecking bar, pry, pry bar
-n03138669	crown, diadem
-n03139089	crown, crownwork, jacket, jacket crown, cap
-n03139464	crown jewels
-n03139640	crown lens
-n03139998	crow's nest
-n03140126	crucible, melting pot
-n03140292	crucifix, rood, rood-tree
-n03140431	cruet, crewet
-n03140546	cruet-stand
-n03140652	cruise control
-n03140771	cruise missile
-n03140900	cruiser
-n03141065	cruiser, police cruiser, patrol car, police car, prowl car, squad car
-n03141327	cruise ship, cruise liner
-n03141455	crupper
-n03141612	cruse
-n03141702	crusher
-n03141823	crutch
-n03142099	cryometer
-n03142205	cryoscope
-n03142325	cryostat
-n03142431	crypt
-n03142679	crystal, watch crystal, watch glass
-n03143400	crystal detector
-n03143572	crystal microphone
-n03143754	crystal oscillator, quartz oscillator
-n03144156	crystal set
-n03144873	cubitiere
-n03144982	cucking stool, ducking stool
-n03145147	cuckoo clock
-n03145277	cuddy
-n03145384	cudgel
-n03145522	cue, cue stick, pool cue, pool stick
-n03145719	cue ball
-n03145843	cuff, turnup
-n03146219	cuirass
-n03146342	cuisse
-n03146449	cul, cul de sac, dead end
-n03146560	culdoscope
-n03146687	cullis
-n03146777	culotte
-n03146846	cultivator, tiller
-n03147084	culverin
-n03147156	culverin
-n03147280	culvert
-n03147509	cup
-n03148324	cupboard, closet
-n03148518	cup hook
-n03148727	cupola
-n03148808	cupola
-n03149135	curb, curb bit
-n03149401	curb roof
-n03149686	curbstone, kerbstone
-n03149810	curette, curet
-n03150232	curler, hair curler, roller, crimper
-n03150511	curling iron
-n03150661	currycomb
-n03150795	cursor, pointer
-n03151077	curtain, drape, drapery, mantle, pall
-n03152303	customhouse, customshouse
-n03152951	cutaway, cutaway drawing, cutaway model
-n03153246	cutlas, cutlass
-n03153585	cutoff
-n03153948	cutout
-n03154073	cutter, cutlery, cutting tool
-n03154316	cutter
-n03154446	cutting implement
-n03154616	cutting room
-n03154745	cutty stool
-n03154895	cutwork
-n03155178	cybercafe
-n03155502	cyclopean masonry
-n03155915	cyclostyle
-n03156071	cyclotron
-n03156279	cylinder
-n03156405	cylinder, piston chamber
-n03156767	cylinder lock
-n03157348	cymbal
-n03158186	dacha
-n03158414	Dacron, Terylene
-n03158668	dado
-n03158796	dado plane
-n03158885	dagger, sticker
-n03159535	dairy, dairy farm
-n03159640	dais, podium, pulpit, rostrum, ambo, stump, soapbox
-n03160001	daisy print wheel, daisy wheel
-n03160186	daisywheel printer
-n03160309	dam, dike, dyke
-n03160740	damask
-n03161016	dampener, moistener
-n03161450	damper, muffler
-n03161893	damper block, piano damper
-n03162297	dark lantern, bull's-eye
-n03162460	darkroom
-n03162556	darning needle, embroidery needle
-n03162714	dart
-n03162818	dart
-n03163222	dashboard, fascia
-n03163381	dashiki, daishiki
-n03163488	dash-pot
-n03163798	data converter
-n03163973	data input device, input device
-n03164192	data multiplexer
-n03164344	data system, information system
-n03164605	davenport
-n03164722	davenport
-n03164929	davit
-n03165096	daybed, divan bed
-n03165211	daybook, ledger
-n03165466	day nursery, day care center
-n03165616	day school
-n03165823	dead axle
-n03165955	deadeye
-n03166120	deadhead
-n03166514	deanery
-n03166600	deathbed
-n03166685	death camp
-n03166809	death house, death row
-n03166951	death knell, death bell
-n03167153	death seat
-n03167978	deck
-n03168107	deck
-n03168217	deck chair, beach chair
-n03168543	deck-house
-n03168663	deckle
-n03168774	deckle edge, deckle
-n03168933	declinometer, transit declinometer
-n03169063	decoder
-n03169176	decolletage
-n03170292	decoupage
-n03170459	dedicated file server
-n03170635	deep-freeze, Deepfreeze, deep freezer, freezer
-n03170872	deerstalker
-n03171228	defense system, defence system
-n03171356	defensive structure, defense, defence
-n03171635	defibrillator
-n03171910	defilade
-n03172038	deflector
-n03172738	delayed action
-n03172965	delay line
-n03173270	delft
-n03173387	delicatessen, deli, food shop
-n03173929	delivery truck, delivery van, panel truck
-n03174079	delta wing
-n03174450	demijohn
-n03174731	demitasse
-n03175081	den
-n03175189	denim, dungaree, jean
-n03175301	densimeter, densitometer
-n03175457	densitometer
-n03175604	dental appliance
-n03175843	dental floss, floss
-n03175983	dental implant
-n03176238	dentist's drill, burr drill
-n03176386	denture, dental plate, plate
-n03176594	deodorant, deodourant
-n03176763	department store, emporium
-n03177059	departure lounge
-n03177165	depilatory, depilator, epilator
-n03177708	depressor
-n03178000	depth finder
-n03178173	depth gauge, depth gage
-n03178430	derrick
-n03178538	derrick
-n03178674	derringer
-n03179701	desk
-n03179910	desk phone
-n03180011	desktop computer
-n03180384	dessert spoon
-n03180504	destroyer, guided missile destroyer
-n03180732	destroyer escort
-n03180865	detached house, single dwelling
-n03180969	detector, sensor, sensing element
-n03181293	detector
-n03181667	detention home, detention house, house of detention, detention camp
-n03182140	detonating fuse
-n03182232	detonator, detonating device, cap
-n03182912	developer
-n03183080	device
-n03185868	Dewar flask, Dewar
-n03186199	dhoti
-n03186285	dhow
-n03186818	dial, telephone dial
-n03187037	dial
-n03187153	dial
-n03187268	dialog box, panel
-n03187595	dial telephone, dial phone
-n03187751	dialyzer, dialysis machine
-n03188290	diamante
-n03188531	diaper, nappy, napkin
-n03188725	diaper
-n03188871	diaphone
-n03189083	diaphragm, stop
-n03189311	diaphragm
-n03189818	diathermy machine
-n03190458	dibble, dibber
-n03191286	dice cup, dice box
-n03191451	dicer
-n03191561	dickey, dickie, dicky, shirtfront
-n03191776	dickey, dickie, dicky, dickey-seat, dickie-seat, dicky-seat
-n03192543	Dictaphone
-n03192907	die
-n03193107	diesel, diesel engine, diesel motor
-n03193260	diesel-electric locomotive, diesel-electric
-n03193423	diesel-hydraulic locomotive, diesel-hydraulic
-n03193597	diesel locomotive
-n03193754	diestock
-n03194170	differential analyzer
-n03194297	differential gear, differential
-n03194812	diffuser, diffusor
-n03194992	diffuser, diffusor
-n03195332	digester
-n03195485	diggings, digs, domiciliation, lodgings, pad
-n03195799	digital-analog converter, digital-to-analog converter
-n03195959	digital audiotape, DAT
-n03196062	digital camera
-n03196217	digital clock
-n03196324	digital computer
-n03196598	digital display, alphanumeric display
-n03196990	digital subscriber line, DSL
-n03197201	digital voltmeter
-n03197337	digital watch
-n03197446	digitizer, digitiser, analog-digital converter, analog-to-digital converter
-n03198223	dilator, dilater
-n03198500	dildo
-n03199358	dimity
-n03199488	dimmer
-n03199647	diner
-n03199775	dinette
-n03199901	dinghy, dory, rowboat
-n03200231	dining area
-n03200357	dining car, diner, dining compartment, buffet car
-n03200539	dining-hall
-n03200701	dining room, dining-room
-n03200906	dining-room furniture
-n03201035	dining-room table
-n03201208	dining table, board
-n03201529	dinner bell
-n03201638	dinner dress, dinner gown, formal, evening gown
-n03201776	dinner jacket, tux, tuxedo, black tie
-n03201895	dinner napkin
-n03201996	dinner pail, dinner bucket
-n03202354	dinner table
-n03202481	dinner theater, dinner theatre
-n03202760	diode, semiconductor diode, junction rectifier, crystal rectifier
-n03202940	diode, rectifying tube, rectifying valve
-n03203089	dip
-n03203806	diplomatic building
-n03204134	dipole, dipole antenna
-n03204306	dipper
-n03204436	dipstick
-n03204558	DIP switch, dual inline package switch
-n03204955	directional antenna
-n03205143	directional microphone
-n03205304	direction finder
-n03205458	dirk
-n03205574	dirndl
-n03205669	dirndl
-n03205903	dirty bomb
-n03206023	discharge lamp
-n03206158	discharge pipe
-n03206282	disco, discotheque
-n03206405	discount house, discount store, discounter, wholesale house
-n03206602	discus, saucer
-n03206718	disguise
-n03206908	dish
-n03207305	dish, dish aerial, dish antenna, saucer
-n03207548	dishpan
-n03207630	dish rack
-n03207743	dishrag, dishcloth
-n03207835	dishtowel, dish towel, tea towel
-n03207941	dishwasher, dish washer, dishwashing machine
-n03208556	disk, disc
-n03208938	disk brake, disc brake
-n03209359	disk clutch
-n03209477	disk controller
-n03209666	disk drive, disc drive, hard drive, Winchester drive
-n03209910	diskette, floppy, floppy disk
-n03210245	disk harrow, disc harrow
-n03210372	dispatch case, dispatch box
-n03210552	dispensary
-n03210683	dispenser
-n03211117	display, video display
-n03211413	display adapter, display adaptor
-n03211616	display panel, display board, board
-n03211789	display window, shop window, shopwindow, show window
-n03212114	disposal, electric pig, garbage disposal
-n03212247	disrupting explosive, bursting explosive
-n03212406	distaff
-n03212811	distillery, still
-n03213014	distributor, distributer, electrical distributor
-n03213361	distributor cam
-n03213538	distributor cap
-n03213715	distributor housing
-n03213826	distributor point, breaker point, point
-n03214253	ditch
-n03214450	ditch spade, long-handled spade
-n03214582	ditty bag
-n03214966	divan
-n03215076	divan, diwan
-n03215191	dive bomber
-n03215337	diverging lens, concave lens
-n03215508	divided highway, dual carriageway
-n03215749	divider
-n03215930	diving bell
-n03216199	divining rod, dowser, dowsing rod, waterfinder, water finder
-n03216402	diving suit, diving dress
-n03216562	dixie
-n03216710	Dixie cup, paper cup
-n03216828	dock, dockage, docking facility
-n03217653	doeskin
-n03217739	dogcart
-n03217889	doggie bag, doggy bag
-n03218198	dogsled, dog sled, dog sleigh
-n03218446	dog wrench
-n03219010	doily, doyley, doyly
-n03219135	doll, dolly
-n03219483	dollhouse, doll's house
-n03219612	dolly
-n03219859	dolman
-n03219966	dolman, dolman jacket
-n03220095	dolman sleeve
-n03220237	dolmen, cromlech, portal tomb
-n03220513	dome
-n03220692	dome, domed stadium, covered stadium
-n03221059	domino, half mask, eye mask
-n03221351	dongle
-n03221540	donkey jacket
-n03221720	door
-n03222176	door
-n03222318	door
-n03222516	doorbell, bell, buzzer
-n03222722	doorframe, doorcase
-n03222857	doorjamb, doorpost
-n03223162	doorlock
-n03223299	doormat, welcome mat
-n03223441	doornail
-n03223553	doorplate
-n03223686	doorsill, doorstep, threshold
-n03223923	doorstop, doorstopper
-n03224490	Doppler radar
-n03224603	dormer, dormer window
-n03224753	dormer window
-n03224893	dormitory, dorm, residence hall, hall, student residence
-n03225108	dormitory, dormitory room, dorm room
-n03225458	dosemeter, dosimeter
-n03225616	dossal, dossel
-n03225777	dot matrix printer, matrix printer, dot printer
-n03225988	double bed
-n03226090	double-bitted ax, double-bitted axe, Western ax, Western axe
-n03226254	double boiler, double saucepan
-n03226375	double-breasted jacket
-n03226538	double-breasted suit
-n03226880	double door
-n03227010	double glazing
-n03227184	double-hung window
-n03227317	double knit
-n03227721	doubler
-n03227856	double reed
-n03228016	double-reed instrument, double reed
-n03228254	doublet
-n03228365	doubletree
-n03228533	douche, douche bag
-n03228692	dovecote, columbarium, columbary
-n03228796	Dover's powder
-n03228967	dovetail, dovetail joint
-n03229115	dovetail plane
-n03229244	dowel, dowel pin, joggle
-n03229526	downstage
-n03231160	drafting instrument
-n03231368	drafting table, drawing table
-n03231819	Dragunov
-n03232309	drainage ditch
-n03232417	drainage system
-n03232543	drain basket
-n03232815	drainplug
-n03232923	drape
-n03233123	drapery
-n03233624	drawbar
-n03233744	drawbridge, lift bridge
-n03233905	drawer
-n03234164	drawers, underdrawers, shorts, boxers, boxershorts
-n03234952	drawing chalk
-n03235042	drawing room, withdrawing room
-n03235180	drawing room
-n03235327	drawknife, drawshave
-n03235796	drawstring bag
-n03235979	dray, camion
-n03236093	dreadnought, dreadnaught
-n03236217	dredge
-n03236423	dredger
-n03236580	dredging bucket
-n03236735	dress, frock
-n03237212	dress blues, dress whites
-n03237340	dresser
-n03237416	dress hat, high hat, opera hat, silk hat, stovepipe, top hat, topper, beaver
-n03237639	dressing, medical dressing
-n03237839	dressing case
-n03237992	dressing gown, robe-de-chambre, lounging robe
-n03238131	dressing room
-n03238286	dressing sack, dressing sacque
-n03238586	dressing table, dresser, vanity, toilet table
-n03238762	dress rack
-n03238879	dress shirt, evening shirt
-n03239054	dress suit, full dress, tailcoat, tail coat, tails, white tie, white tie and tails
-n03239259	dress uniform
-n03239607	drift net
-n03239726	drill
-n03240140	electric drill
-n03240683	drilling platform, offshore rig
-n03240892	drill press
-n03241093	drill rig, drilling rig, oilrig, oil rig
-n03241335	drinking fountain, water fountain, bubbler
-n03241496	drinking vessel
-n03241903	drip loop
-n03242120	drip mat
-n03242264	drip pan
-n03242390	dripping pan, drip pan
-n03242506	drip pot
-n03242995	drive
-n03243218	drive
-n03243625	drive line, drive line system
-n03244047	driver, number one wood
-n03244231	driveshaft
-n03244388	driveway, drive, private road
-n03244775	driving iron, one iron
-n03244919	driving wheel
-n03245271	drogue, drogue chute, drogue parachute
-n03245421	drogue parachute
-n03245724	drone, drone pipe, bourdon
-n03245889	drone, pilotless aircraft, radio-controlled aircraft
-n03246197	drop arch
-n03246312	drop cloth
-n03246454	drop curtain, drop cloth, drop
-n03246653	drop forge, drop hammer, drop press
-n03246933	drop-leaf table
-n03247083	dropper, eye dropper
-n03247351	droshky, drosky
-n03247495	drove, drove chisel
-n03248835	drugget
-n03249342	drugstore, apothecary's shop, chemist's, chemist's shop, pharmacy
-n03249569	drum, membranophone, tympan
-n03249956	drum, metal drum
-n03250089	drum brake
-n03250279	drumhead, head
-n03250405	drum printer
-n03250588	drum sander, electric sander, sander, smoother
-n03250847	drumstick
-n03250952	dry battery
-n03251100	dry-bulb thermometer
-n03251280	dry cell
-n03251533	dry dock, drydock, graving dock
-n03251766	dryer, drier
-n03251932	dry fly
-n03252231	dry kiln
-n03252324	dry masonry
-n03252422	dry point
-n03252637	dry wall, dry-stone wall
-n03252787	dual scan display
-n03253071	duck
-n03253187	duckboard
-n03253279	duckpin
-n03253714	dudeen
-n03253796	duffel, duffle
-n03253886	duffel bag, duffle bag, duffel, duffle
-n03254046	duffel coat, duffle coat
-n03254189	dugout
-n03254374	dugout canoe, dugout, pirogue
-n03254625	dulciana
-n03254737	dulcimer
-n03254862	dulcimer
-n03255030	dumbbell
-n03255167	dumb bomb, gravity bomb
-n03255322	dumbwaiter, food elevator
-n03255488	dumdum, dumdum bullet
-n03255899	dumpcart
-n03256032	Dumpster
-n03256166	dump truck, dumper, tipper truck, tipper lorry, tip truck, tipper
-n03256472	Dumpy level
-n03256631	dunce cap, dunce's cap, fool's cap
-n03256788	dune buggy, beach buggy
-n03256928	dungeon
-n03257065	duplex apartment, duplex
-n03257210	duplex house, duplex, semidetached house
-n03257586	duplicator, copier
-n03258192	dust bag, vacuum bag
-n03258330	dustcloth, dustrag, duster
-n03258456	dust cover
-n03258577	dust cover, dust sheet
-n03258905	dustmop, dust mop, dry mop
-n03259009	dustpan
-n03259280	Dutch oven
-n03259401	Dutch oven
-n03259505	dwelling, home, domicile, abode, habitation, dwelling house
-n03260206	dye-works
-n03260504	dynamo
-n03260733	dynamometer, ergometer
-n03260849	Eames chair
-n03261019	earflap, earlap
-n03261263	early warning radar
-n03261395	early warning system
-n03261603	earmuff
-n03261776	earphone, earpiece, headphone, phone
-n03262072	earplug
-n03262248	earplug
-n03262519	earthenware
-n03262717	earthwork
-n03262809	easel
-n03262932	easy chair, lounge chair, overstuffed chair
-n03263076	eaves
-n03263338	ecclesiastical attire, ecclesiastical robe
-n03263640	echinus
-n03263758	echocardiograph
-n03264906	edger
-n03265032	edge tool
-n03265754	efficiency apartment
-n03266195	egg-and-dart, egg-and-anchor, egg-and-tongue
-n03266371	eggbeater, eggwhisk
-n03266620	egg timer
-n03266749	eiderdown, duvet, continental quilt
-n03267113	eight ball
-n03267468	ejection seat, ejector seat, capsule
-n03267696	elastic
-n03267821	elastic bandage
-n03268142	Elastoplast
-n03268311	elbow
-n03268645	elbow pad
-n03268790	electric, electric automobile, electric car
-n03268918	electrical cable
-n03269073	electrical contact
-n03269203	electrical converter
-n03269401	electrical device
-n03270165	electrical system
-n03270695	electric bell
-n03270854	electric blanket
-n03271030	electric chair, chair, death chair, hot seat
-n03271260	electric clock
-n03271376	electric-discharge lamp, gas-discharge lamp
-n03271574	electric fan, blower
-n03271765	electric frying pan
-n03271865	electric furnace
-n03272010	electric guitar
-n03272125	electric hammer
-n03272239	electric heater, electric fire
-n03272383	electric lamp
-n03272562	electric locomotive
-n03272810	electric meter, power meter
-n03272940	electric mixer
-n03273061	electric motor
-n03273551	electric organ, electronic organ, Hammond organ, organ
-n03273740	electric range
-n03273913	electric refrigerator, fridge
-n03274265	electric toothbrush
-n03274435	electric typewriter
-n03274561	electro-acoustic transducer
-n03274796	electrode
-n03275125	electrodynamometer
-n03275311	electroencephalograph
-n03275566	electrograph
-n03275681	electrolytic, electrolytic capacitor, electrolytic condenser
-n03275864	electrolytic cell
-n03276179	electromagnet
-n03276696	electrometer
-n03276839	electromyograph
-n03277004	electron accelerator
-n03277149	electron gun
-n03277459	electronic balance
-n03277602	electronic converter
-n03277771	electronic device
-n03278248	electronic equipment
-n03278914	electronic fetal monitor, electronic foetal monitor, fetal monitor, foetal monitor
-n03279153	electronic instrument, electronic musical instrument
-n03279364	electronic voltmeter
-n03279508	electron microscope
-n03279804	electron multiplier
-n03279918	electrophorus
-n03280216	electroscope
-n03280394	electrostatic generator, electrostatic machine, Wimshurst machine, Van de Graaff generator
-n03280644	electrostatic printer
-n03281145	elevator, lift
-n03281524	elevator
-n03281673	elevator shaft
-n03282060	embankment
-n03282295	embassy
-n03282401	embellishment
-n03283221	emergency room, ER
-n03283413	emesis basin
-n03283827	emitter
-n03284308	empty
-n03284482	emulsion, photographic emulsion
-n03284743	enamel
-n03284886	enamel
-n03284981	enamelware
-n03285578	encaustic
-n03285730	encephalogram, pneumoencephalogram
-n03285912	enclosure
-n03286572	endoscope
-n03287351	energizer, energiser
-n03287733	engine
-n03288003	engine
-n03288500	engineering, engine room
-n03288643	enginery
-n03288742	English horn, cor anglais
-n03288886	English saddle, English cavalry saddle
-n03289660	enlarger
-n03289985	ensemble
-n03290096	ensign
-n03290195	entablature
-n03290653	entertainment center
-n03291413	entrenching tool, trenching spade
-n03291551	entrenchment, intrenchment
-n03291741	envelope
-n03291819	envelope
-n03291963	envelope, gasbag
-n03292085	eolith
-n03292362	epauliere
-n03292475	epee
-n03292603	epergne
-n03292736	epicyclic train, epicyclic gear train
-n03292960	epidiascope
-n03293095	epilating wax
-n03293741	equalizer, equaliser
-n03293863	equatorial
-n03294048	equipment
-n03294604	erasable programmable read-only memory, EPROM
-n03294833	eraser
-n03295012	erecting prism
-n03295140	erection
-n03295246	Erlenmeyer flask
-n03295928	escape hatch
-n03296081	escapement
-n03296217	escape wheel
-n03296328	escarpment, escarp, scarp, protective embankment
-n03296478	escutcheon, scutcheon
-n03296963	esophagoscope, oesophagoscope
-n03297103	espadrille
-n03297226	espalier
-n03297495	espresso maker
-n03297644	espresso shop
-n03297735	establishment
-n03298089	estaminet
-n03298352	estradiol patch
-n03298716	etagere
-n03298858	etamine, etamin
-n03299406	etching
-n03300216	ethernet
-n03300443	ethernet cable
-n03301175	Eton jacket
-n03301291	etui
-n03301389	eudiometer
-n03301568	euphonium
-n03301833	evaporative cooler
-n03301940	evening bag
-n03302671	exercise bike, exercycle
-n03302790	exercise device
-n03302938	exhaust, exhaust system
-n03303217	exhaust fan
-n03303669	exhaust valve
-n03303831	exhibition hall, exhibition area
-n03304197	Exocet
-n03304323	expansion bit, expansive bit
-n03304465	expansion bolt
-n03305300	explosive detection system, EDS
-n03305522	explosive device
-n03305953	explosive trace detection, ETD
-n03306385	express, limited
-n03306869	extension, telephone extension, extension phone
-n03307037	extension cord
-n03307573	external-combustion engine
-n03307792	external drive
-n03308152	extractor
-n03308481	eyebrow pencil
-n03308614	eyecup, eyebath, eye cup
-n03309110	eyeliner
-n03309356	eyepatch, patch
-n03309465	eyepiece, ocular
-n03309687	eyeshadow
-n03309808	fabric, cloth, material, textile
-n03313333	facade, frontage, frontal
-n03314227	face guard
-n03314378	face mask
-n03314608	faceplate
-n03314780	face powder
-n03314884	face veil
-n03315644	facing, cladding
-n03315805	facing
-n03315990	facing, veneer
-n03316105	facsimile, facsimile machine, fax
-n03316406	factory, mill, manufacturing plant, manufactory
-n03316873	factory ship
-n03317233	fagot, faggot
-n03317510	fagot stitch, faggot stitch
-n03317673	Fahrenheit thermometer
-n03317788	faience
-n03317889	faille
-n03318136	fairlead
-n03318294	fairy light
-n03318865	falchion
-n03318983	fallboard, fall-board
-n03319167	fallout shelter
-n03319457	false face
-n03319576	false teeth
-n03319745	family room
-n03320046	fan
-n03320262	fan belt
-n03320421	fan blade
-n03320519	fancy dress, masquerade, masquerade costume
-n03320845	fanion
-n03320959	fanlight
-n03321103	fanjet, fan-jet, fanjet engine, turbojet, turbojet engine, turbofan, turbofan engine
-n03321419	fanjet, fan-jet, turbofan, turbojet
-n03321563	fanny pack, butt pack
-n03321843	fan tracery
-n03321954	fan vaulting
-n03322570	farm building
-n03322704	farmer's market, green market, greenmarket
-n03322836	farmhouse
-n03322940	farm machine
-n03323096	farmplace, farm-place, farmstead
-n03323211	farmyard
-n03323319	farthingale
-n03323703	fastener, fastening, holdfast, fixing
-n03324629	fast reactor
-n03324814	fat farm
-n03324928	fatigues
-n03325088	faucet, spigot
-n03325288	fauld
-n03325403	fauteuil
-n03325584	feather boa, boa
-n03325691	featheredge
-n03325941	fedora, felt hat, homburg, Stetson, trilby
-n03326073	feedback circuit, feedback loop
-n03326371	feedlot
-n03326475	fell, felled seam
-n03326660	felloe, felly
-n03326795	felt
-n03326948	felt-tip pen, felt-tipped pen, felt tip, Magic Marker
-n03327133	felucca
-n03327234	fence, fencing
-n03327553	fencing mask, fencer's mask
-n03327691	fencing sword
-n03327841	fender, wing
-n03328201	fender, buffer, cowcatcher, pilot
-n03329302	Ferris wheel
-n03329536	ferrule, collet
-n03329663	ferry, ferryboat
-n03330002	ferule
-n03330665	festoon
-n03330792	fetoscope, foetoscope
-n03330947	fetter, hobble
-n03331077	fez, tarboosh
-n03331244	fiber, fibre, vulcanized fiber
-n03331599	fiber optic cable, fibre optic cable
-n03332005	fiberscope
-n03332173	fichu
-n03332271	fiddlestick, violin bow
-n03332393	field artillery, field gun
-n03332591	field coil, field winding
-n03332784	field-effect transistor, FET
-n03332989	field-emission microscope
-n03333129	field glass, glass, spyglass
-n03333252	field hockey ball
-n03333349	field hospital
-n03333610	field house, sports arena
-n03333711	field lens
-n03333851	field magnet
-n03334017	field-sequential color television, field-sequential color TV, field-sequential color television system, field-sequential color TV system
-n03334291	field tent
-n03334382	fieldwork
-n03334492	fife
-n03334912	fifth wheel, spare
-n03335030	fighter, fighter aircraft, attack aircraft
-n03335333	fighting chair
-n03335461	fig leaf
-n03335846	figure eight, figure of eight
-n03336168	figure loom, figured-fabric loom
-n03336282	figure skate
-n03336575	filament
-n03336742	filature
-n03336839	file
-n03337140	file, file cabinet, filing cabinet
-n03337383	file folder
-n03337494	file server
-n03337822	filigree, filagree, fillagree
-n03338287	filling
-n03338821	film, photographic film
-n03339296	film, plastic film
-n03339529	film advance
-n03339643	filter
-n03340009	filter
-n03340723	finder, viewfinder, view finder
-n03340923	finery
-n03341035	fine-tooth comb, fine-toothed comb
-n03341153	finger
-n03341297	fingerboard
-n03341606	finger bowl
-n03342015	finger paint, fingerpaint
-n03342127	finger-painting
-n03342262	finger plate, escutcheon, scutcheon
-n03342432	fingerstall, cot
-n03342657	finish coat, finishing coat
-n03342863	finish coat, finishing coat
-n03342961	finisher
-n03343047	fin keel
-n03343234	fipple
-n03343354	fipple flute, fipple pipe, recorder, vertical flute
-n03343560	fire
-n03343737	fire alarm, smoke alarm
-n03343853	firearm, piece, small-arm
-n03344305	fire bell
-n03344393	fireboat
-n03344509	firebox
-n03344642	firebrick
-n03344784	fire control radar
-n03344935	fire control system
-n03345487	fire engine, fire truck
-n03345837	fire extinguisher, extinguisher, asphyxiator
-n03346135	fire iron
-n03346289	fireman's ax, fireman's axe
-n03346455	fireplace, hearth, open fireplace
-n03347037	fire screen, fireguard
-n03347472	fire tongs, coal tongs
-n03347617	fire tower
-n03348142	firewall
-n03348868	firing chamber, gun chamber
-n03349020	firing pin
-n03349296	firkin
-n03349367	firmer chisel
-n03349469	first-aid kit
-n03349599	first-aid station
-n03349771	first base
-n03349892	first class
-n03350204	fishbowl, fish bowl, goldfish bowl
-n03350352	fisherman's bend
-n03350456	fisherman's knot, true lover's knot, truelove knot
-n03350602	fisherman's lure, fish lure
-n03351151	fishhook
-n03351262	fishing boat, fishing smack, fishing vessel
-n03351434	fishing gear, tackle, fishing tackle, fishing rig, rig
-n03351979	fishing rod, fishing pole
-n03352232	fish joint
-n03352366	fish knife
-n03352628	fishnet, fishing net
-n03352961	fish slice
-n03353281	fitment
-n03353951	fixative
-n03354207	fixer-upper
-n03354903	flag
-n03355468	flageolet, treble recorder, shepherd's pipe
-n03355768	flagon
-n03355925	flagpole, flagstaff
-n03356038	flagship
-n03356279	flail
-n03356446	flambeau
-n03356559	flamethrower
-n03356858	flange, rim
-n03356982	flannel
-n03357081	flannel, gabardine, tweed, white
-n03357267	flannelette
-n03357716	flap, flaps
-n03358172	flash, photoflash, flash lamp, flashgun, flashbulb, flash bulb
-n03358380	flash
-n03358726	flash camera
-n03358841	flasher
-n03359137	flashlight, torch
-n03359285	flashlight battery
-n03359436	flash memory
-n03359566	flask
-n03360133	flat arch, straight arch
-n03360300	flatbed
-n03360431	flatbed press, cylinder press
-n03360622	flat bench
-n03360731	flatcar, flatbed, flat
-n03361109	flat file
-n03361297	flatlet
-n03361380	flat panel display, FPD
-n03361550	flats
-n03361683	flat tip screwdriver
-n03362639	fleece
-n03362771	fleet ballistic missile submarine
-n03362890	fleur-de-lis, fleur-de-lys
-n03363363	flight simulator, trainer
-n03363549	flintlock
-n03363749	flintlock, firelock
-n03364008	flip-flop, thong
-n03364156	flipper, fin
-n03364599	float, plasterer's float
-n03364937	floating dock, floating dry dock
-n03365231	floatplane, pontoon plane
-n03365374	flood, floodlight, flood lamp, photoflood
-n03365592	floor, flooring
-n03365991	floor, level, storey, story
-n03366464	floor
-n03366721	floorboard
-n03366823	floor cover, floor covering
-n03366974	floor joist
-n03367059	floor lamp
-n03367321	flophouse, dosshouse
-n03367410	florist, florist shop, flower store
-n03367545	floss
-n03367875	flotsam, jetsam
-n03367969	flour bin
-n03368048	flour mill
-n03368352	flowerbed, flower bed, bed of flowers
-n03369276	flugelhorn, fluegelhorn
-n03369407	fluid drive
-n03369512	fluid flywheel
-n03369866	flume
-n03370387	fluorescent lamp
-n03370646	fluoroscope, roentgenoscope
-n03371875	flush toilet, lavatory
-n03372029	flute, transverse flute
-n03372549	flute, flute glass, champagne flute
-n03372822	flux applicator
-n03372933	fluxmeter
-n03373237	fly
-n03373611	flying boat
-n03373943	flying buttress, arc-boutant
-n03374102	flying carpet
-n03374282	flying jib
-n03374372	fly rod
-n03374473	fly tent
-n03374570	flytrap
-n03374649	flywheel
-n03374838	fob, watch chain, watch guard
-n03375171	foghorn
-n03375329	foglamp
-n03375575	foil
-n03376159	fold, sheepfold, sheep pen, sheepcote
-n03376279	folder
-n03376595	folding chair
-n03376771	folding door, accordion door
-n03376938	folding saw
-n03378005	food court
-n03378174	food processor
-n03378342	food hamper
-n03378442	foot
-n03378593	footage
-n03378765	football
-n03379051	football helmet
-n03379204	football stadium
-n03379343	footbath
-n03379719	foot brake
-n03379828	footbridge, overcrossing, pedestrian bridge
-n03379989	foothold, footing
-n03380301	footlocker, locker
-n03380647	foot rule
-n03380724	footstool, footrest, ottoman, tuffet
-n03380867	footwear, footgear
-n03381126	footwear
-n03381231	forceps
-n03381450	force pump
-n03381565	fore-and-after
-n03381776	fore-and-aft sail
-n03382104	forecastle, fo'c'sle
-n03382292	forecourt
-n03382413	foredeck
-n03382533	fore edge, foredge
-n03382708	foreground
-n03382856	foremast
-n03382969	fore plane
-n03383099	foresail
-n03383211	forestay
-n03383378	foretop
-n03383468	fore-topmast
-n03383562	fore-topsail
-n03383821	forge
-n03384167	fork
-n03384352	forklift
-n03384891	formalwear, eveningwear, evening dress, evening clothes
-n03385295	Formica
-n03385557	fortification, munition
-n03386011	fortress, fort
-n03386343	forty-five
-n03386544	Foucault pendulum
-n03386726	foulard
-n03386870	foul-weather gear
-n03387323	foundation garment, foundation
-n03387653	foundry, metalworks
-n03388043	fountain
-n03388183	fountain pen
-n03388323	four-in-hand
-n03388549	four-poster
-n03388711	four-pounder
-n03388990	four-stroke engine, four-stroke internal-combustion engine
-n03389611	four-wheel drive, 4WD
-n03389761	four-wheel drive, 4WD
-n03389889	four-wheeler
-n03389983	fowling piece
-n03390075	foxhole, fox hole
-n03390327	fragmentation bomb, antipersonnel bomb, anti-personnel bomb, daisy cutter
-n03390673	frail
-n03390786	fraise
-n03390983	frame, framing
-n03391301	frame
-n03391613	frame buffer
-n03391770	framework
-n03392648	Francis turbine
-n03392741	franking machine
-n03393017	free house
-n03393199	free-reed
-n03393324	free-reed instrument
-n03393761	freewheel
-n03393912	freight car
-n03394149	freight elevator, service elevator
-n03394272	freight liner, liner train
-n03394480	freight train, rattler
-n03394649	French door
-n03394916	French horn, horn
-n03395256	French polish, French polish shellac
-n03395401	French roof
-n03395514	French window
-n03395859	Fresnel lens
-n03396074	fret
-n03396580	friary
-n03396654	friction clutch
-n03396997	frieze
-n03397087	frieze
-n03397266	frigate
-n03397412	frigate
-n03397532	frill, flounce, ruffle, furbelow
-n03397947	Frisbee
-n03398153	frock
-n03398228	frock coat
-n03399579	frontlet, frontal
-n03399677	front porch
-n03399761	front projector
-n03399971	fruit machine
-n03400231	frying pan, frypan, skillet
-n03400972	fuel filter
-n03401129	fuel gauge, fuel indicator
-n03401279	fuel injection, fuel injection system
-n03401721	fuel system
-n03402188	full-dress uniform
-n03402369	full metal jacket
-n03402511	full skirt
-n03402785	fumigator
-n03402941	funeral home, funeral parlor, funeral parlour, funeral chapel, funeral church, funeral-residence
-n03403643	funnel
-n03404012	funny wagon
-n03404149	fur
-n03404251	fur coat
-n03404360	fur hat
-n03404449	furnace
-n03404900	furnace lining, refractory
-n03405111	furnace room
-n03405265	furnishing
-n03405595	furnishing, trappings
-n03405725	furniture, piece of furniture, article of furniture
-n03406759	fur-piece
-n03406966	furrow
-n03407369	fuse, electrical fuse, safety fuse
-n03407865	fusee drive, fusee
-n03408054	fuselage
-n03408264	fusil
-n03408340	fustian
-n03408444	futon
-n03409297	gabardine
-n03409393	gable, gable end, gable wall
-n03409591	gable roof, saddle roof, saddleback, saddleback roof
-n03409920	gadgetry
-n03410022	gaff
-n03410147	gaff
-n03410303	gaff
-n03410423	gaffsail, gaff-headed sail
-n03410571	gaff topsail, fore-and-aft topsail
-n03410740	gag, muzzle
-n03410938	gaiter
-n03411079	gaiter
-n03411208	Galilean telescope
-n03411339	galleon
-n03411927	gallery
-n03412058	gallery, art gallery, picture gallery
-n03412220	galley, ship's galley, caboose, cookhouse
-n03412387	galley
-n03412511	galley
-n03412906	gallows
-n03413124	gallows tree, gallows-tree, gibbet, gallous
-n03413264	galvanometer
-n03413428	gambling house, gambling den, gambling hell, gaming house
-n03413684	gambrel, gambrel roof
-n03413828	game
-n03414029	gamebag
-n03414162	game equipment
-n03414676	gaming table
-n03415252	gamp, brolly
-n03415486	gangplank, gangboard, gangway
-n03415626	gangsaw
-n03415749	gangway
-n03415868	gantlet
-n03416094	gantry, gauntry
-n03416489	garage
-n03416640	garage, service department
-n03416775	Garand rifle, Garand, M-1, M-1 rifle
-n03416900	garbage
-n03417042	garbage truck, dustcart
-n03417202	garboard, garboard plank, garboard strake
-n03417345	garden
-n03417749	garden
-n03417970	garden rake
-n03418158	garden spade
-n03418242	garden tool, lawn tool
-n03418402	garden trowel
-n03418618	gargoyle
-n03418749	garibaldi
-n03418915	garlic press
-n03419014	garment
-n03420345	garment bag
-n03420801	garrison cap, overseas cap
-n03420935	garrote, garotte, garrotte, iron collar
-n03421117	garter, supporter
-n03421324	garter belt, suspender belt
-n03421485	garter stitch
-n03421669	gas guzzler
-n03421768	gas shell
-n03421960	gas bracket
-n03422072	gas burner, gas jet
-n03422484	gas-cooled reactor
-n03422589	gas-discharge tube
-n03422771	gas engine
-n03423099	gas fixture
-n03423224	gas furnace
-n03423306	gas gun
-n03423479	gas heater
-n03423568	gas holder, gasometer
-n03423719	gasket
-n03423877	gas lamp
-n03424204	gas maser
-n03424325	gasmask, respirator, gas helmet
-n03424489	gas meter, gasometer
-n03424630	gasoline engine, petrol engine
-n03424862	gasoline gauge, gasoline gage, gas gauge, gas gage, petrol gauge, petrol gage
-n03425241	gas oven
-n03425325	gas oven
-n03425413	gas pump, gasoline pump, petrol pump, island dispenser
-n03425595	gas range, gas stove, gas cooker
-n03425769	gas ring
-n03426134	gas tank, gasoline tank, petrol tank
-n03426285	gas thermometer, air thermometer
-n03426462	gastroscope
-n03426574	gas turbine
-n03426871	gas-turbine ship
-n03427202	gat, rod
-n03427296	gate
-n03428090	gatehouse
-n03428226	gateleg table
-n03428349	gatepost
-n03429003	gathered skirt
-n03429137	Gatling gun
-n03429288	gauge, gage
-n03429682	gauntlet, gantlet
-n03429771	gauntlet, gantlet, metal glove
-n03429914	gauze, netting, veiling
-n03430091	gauze, gauze bandage
-n03430313	gavel
-n03430418	gazebo, summerhouse
-n03430551	gear, gear wheel, geared wheel, cogwheel
-n03430959	gear, paraphernalia, appurtenance
-n03431243	gear, gear mechanism
-n03431570	gearbox, gear box, gear case
-n03431745	gearing, gear, geartrain, power train, train
-n03432061	gearset
-n03432129	gearshift, gearstick, shifter, gear lever
-n03432360	Geiger counter, Geiger-Muller counter
-n03432509	Geiger tube, Geiger-Muller tube
-n03433247	gene chip, DNA chip
-n03433637	general-purpose bomb, GP bomb
-n03433877	generator
-n03434188	generator
-n03434285	generator
-n03434830	Geneva gown
-n03435593	geodesic dome
-n03435743	georgette
-n03435991	gharry
-n03436075	ghat
-n03436182	ghetto blaster, boom box
-n03436417	gift shop, novelty shop
-n03436549	gift wrapping
-n03436656	gig
-n03436772	gig
-n03436891	gig
-n03436990	gig
-n03437184	gildhall
-n03437295	gill net
-n03437430	gilt, gilding
-n03437581	gimbal
-n03437741	gingham
-n03437829	girandole, girandola
-n03437941	girder
-n03438071	girdle, cincture, sash, waistband, waistcloth
-n03438257	glass, drinking glass
-n03438661	glass
-n03438780	glass cutter
-n03438863	glasses case
-n03439348	glebe house
-n03439631	Glengarry
-n03439814	glider, sailplane
-n03440216	Global Positioning System, GPS
-n03440682	glockenspiel, orchestral bells
-n03440876	glory hole, lazaretto
-n03441112	glove
-n03441345	glove compartment
-n03441465	glow lamp
-n03441582	glow tube
-n03442288	glyptic art, glyptography
-n03442487	glyptics, lithoglyptics
-n03442597	gnomon
-n03442756	goal
-n03443005	goalmouth
-n03443149	goalpost
-n03443371	goblet
-n03443543	godown
-n03443912	goggles
-n03444034	go-kart
-n03445326	gold plate
-n03445617	golf bag
-n03445777	golf ball
-n03445924	golfcart, golf cart
-n03446070	golf club, golf-club, club
-n03446268	golf-club head, club head, club-head, clubhead
-n03446832	golf equipment
-n03447075	golf glove
-n03447358	golliwog, golliwogg
-n03447447	gondola
-n03447721	gong, tam-tam
-n03447894	goniometer
-n03448031	Gordian knot
-n03448590	gorget
-n03448696	gossamer
-n03448956	Gothic arch
-n03449217	gouache
-n03449309	gouge
-n03449451	gourd, calabash
-n03449564	government building
-n03449858	government office
-n03450230	gown
-n03450516	gown, robe
-n03450734	gown, surgical gown, scrubs
-n03450881	grab
-n03450974	grab bag
-n03451120	grab bar
-n03451253	grace cup
-n03451365	grade separation
-n03451711	graduated cylinder
-n03451798	graffito, graffiti
-n03452267	gramophone, acoustic gramophone
-n03452449	granary, garner
-n03452594	grandfather clock, longcase clock
-n03452741	grand piano, grand
-n03453231	graniteware
-n03453320	granny knot, granny
-n03453443	grape arbor, grape arbour
-n03454110	grapnel, grapnel anchor
-n03454211	grapnel, grapple, grappler, grappling hook, grappling iron
-n03454442	grass skirt
-n03454536	grate, grating
-n03454707	grate, grating
-n03454885	grater
-n03455355	graver, graving tool, pointel, pointrel
-n03455488	gravestone, headstone, tombstone
-n03455642	gravimeter, gravity meter
-n03455802	gravure, photogravure, heliogravure
-n03456024	gravy boat, gravy holder, sauceboat, boat
-n03456186	grey, gray
-n03456299	grease-gun, gun
-n03456447	greasepaint
-n03456548	greasy spoon
-n03456665	greatcoat, overcoat, topcoat
-n03457008	great hall
-n03457451	greave, jambeau
-n03457686	greengrocery
-n03457902	greenhouse, nursery, glasshouse
-n03458271	grenade
-n03458422	grid, gridiron
-n03459328	griddle
-n03459591	grill, grille, grillwork
-n03459775	grille, radiator grille
-n03459914	grillroom, grill
-n03460040	grinder
-n03460147	grinding wheel, emery wheel
-n03460297	grindstone
-n03460455	gripsack
-n03460899	gristmill
-n03461288	grocery bag
-n03461385	grocery store, grocery, food market, market
-n03461651	grogram
-n03461882	groined vault
-n03461988	groover
-n03462110	grosgrain
-n03462315	gros point
-n03462747	ground, earth
-n03462972	ground bait
-n03463185	ground control
-n03463381	ground floor, first floor, ground level
-n03463666	groundsheet, ground cloth
-n03464053	G-string, thong
-n03464467	guard, safety, safety device
-n03464628	guard boat
-n03464952	guardroom
-n03465040	guardroom
-n03465151	guard ship
-n03465320	guard's van
-n03465426	gueridon
-n03465500	Guarnerius
-n03465605	guesthouse
-n03465718	guestroom
-n03465818	guidance system, guidance device
-n03466162	guided missile
-n03466493	guided missile cruiser
-n03466600	guided missile frigate
-n03466839	guildhall
-n03466947	guilloche
-n03467068	guillotine
-n03467254	guimpe
-n03467380	guimpe
-n03467517	guitar
-n03467796	guitar pick
-n03467887	gulag
-n03467984	gun
-n03468570	gunboat
-n03468696	gun carriage
-n03468821	gun case
-n03469031	gun emplacement, weapons emplacement
-n03469175	gun enclosure, gun turret, turret
-n03469493	gunlock, firing mechanism
-n03469832	gunnery
-n03469903	gunnysack, gunny sack, burlap bag
-n03470005	gun pendulum
-n03470222	gun room
-n03470387	gunsight, gun-sight
-n03470629	gun trigger, trigger
-n03470948	gurney
-n03471030	gusher
-n03471190	gusset, inset
-n03471347	gusset, gusset plate
-n03471779	guy, guy cable, guy wire, guy rope
-n03472232	gymnastic apparatus, exerciser
-n03472535	gym shoe, sneaker, tennis shoe
-n03472672	gym suit
-n03472796	gymslip
-n03472937	gypsy cab
-n03473078	gyrocompass
-n03473227	gyroscope, gyro
-n03473465	gyrostabilizer, gyrostabiliser
-n03473817	habergeon
-n03473966	habit
-n03474167	habit, riding habit
-n03474352	hacienda
-n03474779	hacksaw, hack saw, metal saw
-n03474896	haft, helve
-n03475581	hairbrush
-n03475674	haircloth, hair
-n03475823	hairdressing, hair tonic, hair oil, hair grease
-n03475961	hairnet
-n03476083	hairpiece, false hair, postiche
-n03476313	hairpin
-n03476542	hair shirt
-n03476684	hair slide
-n03476991	hair spray
-n03477143	hairspring
-n03477303	hair trigger
-n03477410	halberd
-n03477512	half binding
-n03477773	half hatchet
-n03477902	half hitch
-n03478589	half track
-n03478756	hall
-n03478907	hall
-n03479121	hall
-n03479266	Hall of Fame
-n03479397	hall of residence
-n03479502	hallstand
-n03480579	halter
-n03480719	halter, hackamore
-n03480973	hame
-n03481172	hammer
-n03481521	hammer, power hammer
-n03482001	hammer
-n03482128	hammerhead
-n03482252	hammock, sack
-n03482405	hamper
-n03482523	hand
-n03482877	handball
-n03483086	handbarrow
-n03483230	handbell
-n03483316	hand blower, blow dryer, blow drier, hair dryer, hair drier
-n03483531	handbow
-n03483637	hand brake, emergency, emergency brake, parking brake
-n03483823	hand calculator, pocket calculator
-n03483971	handcar
-n03484083	handcart, pushcart, cart, go-cart
-n03484487	hand cream
-n03484576	handcuff, cuff, handlock, manacle
-n03484809	hand drill, handheld drill
-n03484931	hand glass, simple microscope, magnifying glass
-n03485198	hand glass, hand mirror
-n03485309	hand grenade
-n03485407	hand-held computer, hand-held microcomputer
-n03485575	handhold
-n03485794	handkerchief, hankie, hanky, hankey
-n03487090	handlebar
-n03487331	handloom
-n03487444	hand lotion
-n03487533	hand luggage
-n03487642	hand-me-down
-n03487774	hand mower
-n03487886	hand pump
-n03488111	handrest
-n03488188	handsaw, hand saw, carpenter's saw
-n03488438	handset, French telephone
-n03488603	hand shovel
-n03488784	handspike
-n03488887	handstamp, rubber stamp
-n03489048	hand throttle
-n03489162	hand tool
-n03490006	hand towel, face towel
-n03490119	hand truck, truck
-n03490324	handwear, hand wear
-n03490449	handwheel
-n03490649	handwheel
-n03490784	hangar queen
-n03490884	hanger
-n03491032	hang glider
-n03491724	hangman's rope, hangman's halter, halter, hemp, hempen necktie
-n03491988	hank
-n03492087	hansom, hansom cab
-n03492250	harbor, harbour
-n03492542	hard disc, hard disk, fixed disk
-n03492922	hard hat, tin hat, safety hat
-n03493219	hardtop
-n03493792	hardware, ironware
-n03493911	hardware store, ironmonger, ironmonger's shop
-n03494278	harmonica, mouth organ, harp, mouth harp
-n03494537	harmonium, organ, reed organ
-n03494706	harness
-n03495039	harness
-n03495258	harp
-n03495570	harp
-n03495671	harpoon
-n03495941	harpoon gun
-n03496183	harpoon log
-n03496296	harpsichord, cembalo
-n03496486	Harris Tweed
-n03496612	harrow
-n03496892	harvester, reaper
-n03497100	hash house
-n03497352	hasp
-n03497657	hat, chapeau, lid
-n03498441	hatbox
-n03498536	hatch
-n03498662	hatchback, hatchback door
-n03498781	hatchback
-n03498866	hatchel, heckle
-n03498962	hatchet
-n03499354	hatpin
-n03499468	hauberk, byrnie
-n03499907	Hawaiian guitar, steel guitar
-n03500090	hawse, hawsehole, hawsepipe
-n03500209	hawser
-n03500295	hawser bend
-n03500389	hay bale
-n03500457	hayfork
-n03500557	hayloft, haymow, mow
-n03500699	haymaker, hay conditioner
-n03500838	hayrack, hayrig
-n03500971	hayrack
-n03501152	hazard
-n03501288	head
-n03501520	head
-n03501614	head
-n03502200	headboard
-n03502331	head covering, veil
-n03502509	headdress, headgear
-n03502777	header
-n03502897	header
-n03503097	header, coping, cope
-n03503233	header, lintel
-n03503358	headfast
-n03503477	head gasket
-n03503567	head gate
-n03503718	headgear
-n03503997	headlight, headlamp
-n03504205	headpiece
-n03504293	headpin, kingpin
-n03504723	headquarters, central office, main office, home office, home base
-n03505015	headrace
-n03505133	headrest
-n03505383	headsail
-n03505504	headscarf
-n03505667	headset
-n03505764	head shop
-n03506028	headstall, headpiece
-n03506184	headstock
-n03506370	health spa, spa, health club
-n03506560	hearing aid, ear trumpet
-n03506727	hearing aid, deaf-aid
-n03506880	hearse
-n03507241	hearth, fireside
-n03507458	hearthrug
-n03507658	heart-lung machine
-n03507963	heat engine
-n03508101	heater, warmer
-n03508485	heat exchanger
-n03508881	heating pad, hot pad
-n03509394	heat lamp, infrared lamp
-n03509608	heat pump
-n03509843	heat-seeking missile
-n03510072	heat shield
-n03510244	heat sink
-n03510384	heaume
-n03510487	heaver
-n03510583	heavier-than-air craft
-n03510866	heckelphone, basset oboe
-n03510987	hectograph, heliotype
-n03511175	hedge, hedgerow
-n03511333	hedge trimmer
-n03512030	helicon, bombardon
-n03512147	helicopter, chopper, whirlybird, eggbeater
-n03512452	heliograph
-n03512624	heliometer
-n03512911	helm
-n03513137	helmet
-n03513376	helmet
-n03514129	hematocrit, haematocrit
-n03514340	hemming-stitch
-n03514451	hemostat, haemostat
-n03514693	hemstitch, hemstitching
-n03514894	henroost
-n03515338	heraldry
-n03515934	hermitage
-n03516266	herringbone
-n03516367	herringbone, herringbone pattern
-n03516647	Herschelian telescope, off-axis reflector
-n03516844	Hessian boot, hessian, jackboot, Wellington, Wellington boot
-n03516996	heterodyne receiver, superheterodyne receiver, superhet
-n03517509	hibachi
-n03517647	hideaway, retreat
-n03517760	hi-fi, high fidelity sound system
-n03517899	high altar
-n03517982	high-angle gun
-n03518135	highball glass
-n03518230	highboard
-n03518305	highboy, tallboy
-n03518445	highchair, feeding chair
-n03518631	high gear, high
-n03518829	high-hat cymbal, high hat
-n03518943	highlighter
-n03519081	highlighter
-n03519226	high-pass filter
-n03519387	high-rise, tower block
-n03519674	high table
-n03519848	high-warp loom
-n03520493	hijab
-n03521076	hinge, flexible joint
-n03521431	hinging post, swinging post
-n03521544	hip boot, thigh boot
-n03521675	hipflask, pocket flask
-n03521771	hip pad
-n03521899	hip pocket
-n03522003	hippodrome
-n03522100	hip roof, hipped roof
-n03522634	hitch
-n03522863	hitch
-n03522990	hitching post
-n03523134	hitchrack, hitching bar
-n03523398	hob
-n03523506	hobble skirt
-n03523987	hockey skate
-n03524150	hockey stick
-n03524287	hod
-n03524425	hodoscope
-n03524574	hoe
-n03524745	hoe handle
-n03524976	hogshead
-n03525074	hoist
-n03525252	hold, keep
-n03525454	holder
-n03525693	holding cell
-n03525827	holding device
-n03526062	holding pen, holding paddock, holding yard
-n03527149	hollowware, holloware
-n03527444	holster
-n03527565	holster
-n03527675	holy of holies, sanctum sanctorum
-n03528100	home, nursing home, rest home
-n03528263	home appliance, household appliance
-n03528523	home computer
-n03528901	home plate, home base, home, plate
-n03529175	home room, homeroom
-n03529444	homespun
-n03529629	homestead
-n03529860	home theater, home theatre
-n03530189	homing torpedo
-n03530511	hone
-n03530642	honeycomb
-n03530910	hood, bonnet, cowl, cowling
-n03531281	hood
-n03531447	hood
-n03531546	hood, exhaust hood
-n03531691	hood
-n03531982	hood latch
-n03532342	hook
-n03532672	hook, claw
-n03532919	hook
-n03533014	hookah, narghile, nargileh, sheesha, shisha, chicha, calean, kalian, water pipe, hubble-bubble, hubbly-bubbly
-n03533392	hook and eye
-n03533486	hookup, assemblage
-n03533654	hookup
-n03533845	hook wrench, hook spanner
-n03534580	hoopskirt, crinoline
-n03534695	hoosegow, hoosgow
-n03534776	Hoover
-n03535024	hope chest, wedding chest
-n03535284	hopper
-n03535647	hopsacking, hopsack
-n03535780	horizontal bar, high bar
-n03536122	horizontal stabilizer, horizontal stabiliser, tailplane
-n03536568	horizontal tail
-n03536761	horn
-n03537085	horn
-n03537241	horn
-n03537412	horn button
-n03537550	hornpipe, pibgorn, stockhorn
-n03538037	horse, gymnastic horse
-n03538179	horsebox
-n03538300	horsecar
-n03538406	horse cart, horse-cart
-n03538542	horsecloth
-n03538634	horse-drawn vehicle
-n03538817	horsehair
-n03538957	horsehair wig
-n03539103	horseless carriage
-n03539293	horse pistol, horse-pistol
-n03539433	horseshoe, shoe
-n03539546	horseshoe
-n03539678	horse-trail
-n03539754	horsewhip
-n03540090	hose
-n03540267	hosiery, hose
-n03540476	hospice
-n03540595	hospital, infirmary
-n03540914	hospital bed
-n03541091	hospital room
-n03541269	hospital ship
-n03541393	hospital train
-n03541537	hostel, youth hostel, student lodging
-n03541696	hostel, hostelry, inn, lodge, auberge
-n03541923	hot-air balloon
-n03542333	hotel
-n03542605	hotel-casino, casino-hotel
-n03542727	hotel-casino, casino-hotel
-n03542860	hotel room
-n03543012	hot line
-n03543112	hot pants
-n03543254	hot plate, hotplate
-n03543394	hot rod, hot-rod
-n03543511	hot spot, hotspot
-n03543603	hot tub
-n03543735	hot-water bottle, hot-water bag
-n03543945	houndstooth check, hound's-tooth check, dogstooth check, dogs-tooth check, dog's-tooth check
-n03544143	hourglass
-n03544238	hour hand, little hand
-n03544360	house
-n03545150	house
-n03545470	houseboat
-n03545585	houselights
-n03545756	house of cards, cardhouse, card-house, cardcastle
-n03545961	house of correction
-n03546112	house paint, housepaint
-n03546235	housetop
-n03546340	housing, lodging, living accommodations
-n03547054	hovel, hut, hutch, shack, shanty
-n03547229	hovercraft, ground-effect machine
-n03547397	howdah, houdah
-n03547530	huarache, huaraches
-n03547861	hub-and-spoke, hub-and-spoke system
-n03548086	hubcap
-n03548195	huck, huckaback
-n03548320	hug-me-tight
-n03548402	hula-hoop
-n03548533	hulk
-n03548626	hull
-n03548930	humeral veil, veil
-n03549199	Humvee, Hum-Vee
-n03549350	hunter, hunting watch
-n03549473	hunting knife
-n03549589	hurdle
-n03549732	hurricane deck, hurricane roof, promenade deck, awning deck
-n03549897	hurricane lamp, hurricane lantern, tornado lantern, storm lantern, storm lamp
-n03550153	hut, army hut, field hut
-n03550289	hutch
-n03550420	hutment
-n03551084	hydraulic brake, hydraulic brakes
-n03551395	hydraulic press
-n03551582	hydraulic pump, hydraulic ram
-n03551790	hydraulic system
-n03552001	hydraulic transmission, hydraulic transmission system
-n03552449	hydroelectric turbine
-n03552749	hydrofoil, hydroplane
-n03553019	hydrofoil, foil
-n03553248	hydrogen bomb, H-bomb, fusion bomb, thermonuclear bomb
-n03553486	hydrometer, gravimeter
-n03554375	hygrodeik
-n03554460	hygrometer
-n03554645	hygroscope
-n03555006	hyperbaric chamber
-n03555217	hypercoaster
-n03555426	hypermarket
-n03555564	hypodermic needle
-n03555662	hypodermic syringe, hypodermic, hypo
-n03555862	hypsometer
-n03555996	hysterosalpingogram
-n03556173	I-beam
-n03556679	ice ax, ice axe, piolet
-n03556811	iceboat, ice yacht, scooter
-n03556992	icebreaker, iceboat
-n03557270	iced-tea spoon
-n03557360	ice hockey rink, ice-hockey rink
-n03557590	ice machine
-n03557692	ice maker
-n03557840	ice pack, ice bag
-n03558007	icepick, ice pick
-n03558176	ice rink, ice-skating rink, ice
-n03558404	ice skate
-n03558633	ice tongs
-n03558739	icetray
-n03559373	iconoscope
-n03559531	Identikit, Identikit picture
-n03559999	idle pulley, idler pulley, idle wheel
-n03560430	igloo, iglu
-n03560860	ignition coil
-n03561047	ignition key
-n03561169	ignition switch
-n03561573	imaret
-n03562565	immovable bandage
-n03563200	impact printer
-n03563460	impeller
-n03563710	implant
-n03563967	implement
-n03564849	impression
-n03565288	imprint
-n03565565	improvised explosive device, I.E.D., IED
-n03565710	impulse turbine
-n03565830	in-basket, in-tray
-n03565991	incendiary bomb, incendiary, firebomb
-n03566193	incinerator
-n03566329	inclined plane
-n03566555	inclinometer, dip circle
-n03566730	inclinometer
-n03566860	incrustation, encrustation
-n03567066	incubator, brooder
-n03567635	index register
-n03567788	Indiaman
-n03567912	Indian club
-n03568117	indicator
-n03568818	induction coil
-n03569014	inductor, inductance
-n03569174	industrial watercourse
-n03569293	inertial guidance system, inertial navigation system
-n03569494	inflater, inflator
-n03571280	inhaler, inhalator
-n03571439	injector
-n03571625	ink bottle, inkpot
-n03571853	ink eraser
-n03571942	ink-jet printer
-n03572107	inkle
-n03572205	inkstand
-n03572321	inkwell, inkstand
-n03572631	inlay
-n03573574	inside caliper
-n03573848	insole, innersole
-n03574243	instep
-n03574416	instillator
-n03574555	institution
-n03574816	instrument
-n03575958	instrument of punishment
-n03576215	instrument of torture
-n03576443	intaglio, diaglyph
-n03576955	intake valve
-n03577090	integrated circuit, microcircuit
-n03577312	integrator, planimeter
-n03577474	Intelnet
-n03577672	interceptor
-n03577818	interchange
-n03578055	intercommunication system, intercom
-n03578251	intercontinental ballistic missile, ICBM
-n03578656	interface, port
-n03578981	interferometer
-n03579538	interior door
-n03579982	internal-combustion engine, ICE
-n03580518	internal drive
-n03580615	internet, net, cyberspace
-n03580845	interphone
-n03580990	interrupter
-n03581125	intersection, crossroad, crossway, crossing, carrefour
-n03581531	interstice
-n03581897	intraocular lens
-n03582508	intravenous pyelogram, IVP
-n03582959	inverter
-n03583419	ion engine
-n03583621	ionization chamber, ionization tube
-n03584254	iPod
-n03584400	video iPod
-n03584829	iron, smoothing iron
-n03585073	iron
-n03585337	iron, branding iron
-n03585438	irons, chains
-n03585551	ironclad
-n03585682	iron foundry
-n03585778	iron horse
-n03585875	ironing
-n03586219	iron lung
-n03586631	ironmongery
-n03586911	ironworks
-n03587205	irrigation ditch
-n03588216	izar
-n03588841	jabot
-n03588951	jack
-n03589313	jack, jackstones
-n03589513	jack
-n03589672	jack
-n03589791	jacket
-n03590306	jacket
-n03590475	jacket
-n03590588	jack-in-the-box
-n03590841	jack-o'-lantern
-n03590932	jack plane
-n03591116	Jacob's ladder, jack ladder, pilot ladder
-n03591313	jaconet
-n03591592	Jacquard loom, Jacquard
-n03591798	jacquard
-n03591901	jag, dag
-n03592245	jail, jailhouse, gaol, clink, slammer, poky, pokey
-n03592669	jalousie
-n03592773	jamb
-n03592931	jammer
-n03593122	jampot, jamjar
-n03593222	japan
-n03593526	jar
-n03593862	Jarvik heart, Jarvik artificial heart
-n03594010	jaunting car, jaunty car
-n03594148	javelin
-n03594277	jaw
-n03594523	Jaws of Life
-n03594734	jean, blue jean, denim
-n03594945	jeep, landrover
-n03595055	jellaba
-n03595264	jerkin
-n03595409	jeroboam, double-magnum
-n03595523	jersey
-n03595614	jersey, T-shirt, tee shirt
-n03595860	jet, jet plane, jet-propelled plane
-n03596099	jet bridge
-n03596285	jet engine
-n03596543	jetliner
-n03597147	jeweler's glass
-n03597317	jewelled headdress, jeweled headdress
-n03597916	jew's harp, jews' harp, mouth bow
-n03598151	jib
-n03598299	jibboom
-n03598385	jig
-n03598515	jig
-n03598646	jiggermast, jigger
-n03598783	jigsaw, scroll saw, fretsaw
-n03598930	jigsaw puzzle
-n03599486	jinrikisha, ricksha, rickshaw
-n03599964	jobcentre
-n03600285	jodhpurs, jodhpur breeches, riding breeches
-n03600475	jodhpur, jodhpur boot, jodhpur shoe
-n03600722	joinery
-n03600977	joint
-n03601442	Joint Direct Attack Munition, JDAM
-n03601638	jointer, jointer plane, jointing plane, long plane
-n03601840	joist
-n03602081	jolly boat, jolly
-n03602194	jorum
-n03602365	joss house
-n03602686	journal bearing
-n03602790	journal box
-n03602883	joystick
-n03603442	jungle gym
-n03603594	junk
-n03603722	jug
-n03604156	jukebox, nickelodeon
-n03604311	jumbojet, jumbo jet
-n03604400	jumper, pinafore, pinny
-n03604536	jumper
-n03604629	jumper
-n03604763	jumper
-n03604843	jumper cable, jumper lead, lead, booster cable
-n03605417	jump seat
-n03605504	jump suit
-n03605598	jump suit, jumpsuit
-n03605722	junction
-n03605915	junction, conjunction
-n03606106	junction barrier, barrier strip
-n03606251	junk shop
-n03606347	jury box
-n03606465	jury mast
-n03607029	kachina
-n03607186	kaffiyeh
-n03607527	kalansuwa
-n03607659	Kalashnikov
-n03607923	kameez
-n03608504	kanzu
-n03609147	katharometer
-n03609235	kayak
-n03609397	kazoo
-n03609542	keel
-n03609786	keelboat
-n03609959	keelson
-n03610098	keep, donjon, dungeon
-n03610418	keg
-n03610524	kennel, doghouse, dog house
-n03610682	kepi, peaked cap, service cap, yachting cap
-n03610836	keratoscope
-n03610992	kerchief
-n03612010	ketch
-n03612814	kettle, boiler
-n03612965	kettle, kettledrum, tympanum, tympani, timpani
-n03613294	key
-n03613592	key
-n03614007	keyboard
-n03614383	keyboard buffer
-n03614532	keyboard instrument
-n03614782	keyhole
-n03614887	keyhole saw
-n03615300	khadi, khaddar
-n03615406	khaki
-n03615563	khakis
-n03615655	khimar
-n03615790	khukuri
-n03616091	kick pleat
-n03616225	kicksorter, pulse height analyzer
-n03616428	kickstand
-n03616763	kick starter, kick start
-n03616979	kid glove, suede glove
-n03617095	kiln
-n03617312	kilt
-n03617480	kimono
-n03617594	kinescope, picture tube, television tube
-n03617834	Kinetoscope
-n03618101	king
-n03618339	king
-n03618546	kingbolt, kingpin, swivel pin
-n03618678	king post
-n03618797	Kipp's apparatus
-n03618982	kirk
-n03619050	kirpan
-n03619196	kirtle
-n03619275	kirtle
-n03619396	kit, outfit
-n03619650	kit
-n03619793	kitbag, kit bag
-n03619890	kitchen
-n03620052	kitchen appliance
-n03620353	kitchenette
-n03620967	kitchen table
-n03621049	kitchen utensil
-n03621377	kitchenware
-n03621694	kite balloon
-n03622058	klaxon, claxon
-n03622401	klieg light
-n03622526	klystron
-n03622839	knee brace
-n03622931	knee-high, knee-hi
-n03623198	knee pad
-n03623338	knee piece
-n03623556	knife
-n03624134	knife
-n03624400	knife blade
-n03624767	knight, horse
-n03625355	knit
-n03625539	knitting machine
-n03625646	knitting needle
-n03625943	knitwear
-n03626115	knob, boss
-n03626272	knob, pommel
-n03626418	knobble
-n03626502	knobkerrie, knobkerry
-n03626760	knocker, doorknocker, rapper
-n03627232	knot
-n03627954	knuckle joint, hinge joint
-n03628071	kohl
-n03628215	koto
-n03628421	kraal
-n03628511	kremlin
-n03628728	kris, creese, crease
-n03628831	krummhorn, crumhorn, cromorne
-n03628984	Kundt's tube
-n03629100	Kurdistan
-n03629231	kurta
-n03629520	kylix, cylix
-n03629643	kymograph, cymograph
-n03630262	lab bench, laboratory bench
-n03630383	lab coat, laboratory coat
-n03631177	lace
-n03631811	lacquer
-n03631922	lacquerware
-n03632100	lacrosse ball
-n03632577	ladder-back
-n03632729	ladder-back, ladder-back chair
-n03632852	ladder truck, aerial ladder truck
-n03632963	ladies' room, powder room
-n03633091	ladle
-n03633341	lady chapel
-n03633632	lagerphone
-n03633886	lag screw, lag bolt
-n03634034	lake dwelling, pile dwelling
-n03634899	lally, lally column
-n03635032	lamasery
-n03635108	lambrequin
-n03635330	lame
-n03635516	laminar flow clean room
-n03635668	laminate
-n03635932	lamination
-n03636248	lamp
-n03636649	lamp
-n03637027	lamp house, lamphouse, lamp housing
-n03637181	lamppost
-n03637318	lampshade, lamp shade
-n03637480	lanai
-n03637787	lancet arch, lancet
-n03637898	lancet window
-n03638014	landau
-n03638180	lander
-n03638623	landing craft
-n03638743	landing flap
-n03638883	landing gear
-n03639077	landing net
-n03639230	landing skid
-n03639497	land line, landline
-n03639675	land mine, ground-emplaced mine, booby trap
-n03639880	land office
-n03640850	lanolin
-n03640988	lantern
-n03641569	lanyard, laniard
-n03641947	lap, lap covering
-n03642144	laparoscope
-n03642341	lapboard
-n03642444	lapel
-n03642573	lap joint, splice
-n03642806	laptop, laptop computer
-n03643149	laryngoscope
-n03643253	laser, optical maser
-n03643491	laser-guided bomb, LGB
-n03643737	laser printer
-n03643907	lash, thong
-n03644073	lashing
-n03644378	lasso, lariat, riata, reata
-n03644858	latch
-n03645011	latch, door latch
-n03645168	latchet
-n03645290	latchkey
-n03645577	lateen, lateen sail
-n03646020	latex paint, latex, rubber-base paint
-n03646148	lath
-n03646296	lathe
-n03646809	latrine
-n03646916	lattice, latticework, fretwork
-n03647423	launch
-n03647520	launcher, rocket launcher
-n03648219	laundry, wash, washing, washables
-n03648431	laundry cart
-n03648667	laundry truck
-n03649003	lavalava
-n03649161	lavaliere, lavalier, lavalliere
-n03649288	laver
-n03649674	lawn chair, garden chair
-n03649797	lawn furniture
-n03649909	lawn mower, mower
-n03650551	layette
-n03651388	lead-acid battery, lead-acid accumulator
-n03651605	lead-in
-n03651843	leading rein
-n03652100	lead pencil
-n03652389	leaf spring
-n03652729	lean-to
-n03652826	lean-to tent
-n03652932	leash, tether, lead
-n03653110	leatherette, imitation leather
-n03653220	leather strip
-n03653454	Leclanche cell
-n03653583	lectern, reading desk
-n03653740	lecture room
-n03653833	lederhosen
-n03653975	ledger board
-n03654576	leg
-n03654826	leg
-n03655072	legging, leging, leg covering
-n03655470	Leiden jar, Leyden jar
-n03655720	leisure wear
-n03656484	lens, lense, lens system
-n03656957	lens, electron lens
-n03657121	lens cap, lens cover
-n03657239	lens implant, interocular lens implant, IOL
-n03657511	leotard, unitard, body suit, cat suit
-n03658102	letter case
-n03658185	letter opener, paper knife, paperknife
-n03658635	levee
-n03658858	level, spirit level
-n03659292	lever
-n03659686	lever, lever tumbler
-n03659809	lever
-n03659950	lever lock
-n03660124	Levi's, levis
-n03660562	Liberty ship
-n03660909	library
-n03661043	library
-n03661340	lid
-n03662301	Liebig condenser
-n03662452	lie detector
-n03662601	lifeboat
-n03662719	life buoy, lifesaver, life belt, life ring
-n03662887	life jacket, life vest, cork jacket
-n03663433	life office
-n03663531	life preserver, preserver, flotation device
-n03663910	life-support system, life support
-n03664159	life-support system, life support
-n03664675	lifting device
-n03664840	lift pump
-n03664943	ligament
-n03665232	ligature
-n03665366	light, light source
-n03665851	light arm
-n03665924	light bulb, lightbulb, bulb, incandescent lamp, electric light, electric-light bulb
-n03666238	light circuit, lighting circuit
-n03666362	light-emitting diode, LED
-n03666591	lighter, light, igniter, ignitor
-n03666917	lighter-than-air craft
-n03667060	light filter, diffusing screen
-n03667235	lighting
-n03667552	light machine gun
-n03667664	light meter, exposure meter, photometer
-n03667829	light microscope
-n03668067	lightning rod, lightning conductor
-n03668279	light pen, electronic stylus
-n03668488	lightship
-n03668803	Lilo
-n03669245	limber
-n03669534	limekiln
-n03669886	limiter, clipper
-n03670208	limousine, limo
-n03671914	linear accelerator, linac
-n03672521	linen
-n03672827	line printer, line-at-a-time printer
-n03673027	liner, ocean liner
-n03673270	liner, lining
-n03673450	lingerie, intimate apparel
-n03673767	lining, liner
-n03674270	link, data link
-n03674440	linkage
-n03674731	Link trainer
-n03674842	linocut
-n03675076	linoleum knife, linoleum cutter
-n03675235	Linotype, Linotype machine
-n03675445	linsey-woolsey
-n03675558	linstock
-n03675907	lion-jaw forceps
-n03676087	lip-gloss
-n03676483	lipstick, lip rouge
-n03676623	liqueur glass
-n03676759	liquid crystal display, LCD
-n03677115	liquid metal reactor
-n03677682	lisle
-n03677766	lister, lister plow, lister plough, middlebreaker, middle buster
-n03678558	litterbin, litter basket, litter-basket
-n03678729	little theater, little theatre
-n03678879	live axle, driving axle
-n03679384	living quarters, quarters
-n03679712	living room, living-room, sitting room, front room, parlor, parlour
-n03680248	load
-n03680355	Loafer
-n03680512	loaner
-n03680734	lobe
-n03680858	lobster pot
-n03680942	local
-n03681477	local area network, LAN
-n03681813	local oscillator, heterodyne oscillator
-n03682380	Lochaber ax
-n03682487	lock
-n03682877	lock, ignition lock
-n03683079	lock, lock chamber
-n03683341	lock
-n03683457	lockage
-n03683606	locker
-n03683708	locker room
-n03683995	locket
-n03684143	lock-gate
-n03684224	locking pliers
-n03684489	lockring, lock ring, lock washer
-n03684611	lockstitch
-n03684740	lockup
-n03684823	locomotive, engine, locomotive engine, railway locomotive
-n03685307	lodge, indian lodge
-n03685486	lodge, hunting lodge
-n03685640	lodge
-n03685820	lodging house, rooming house
-n03686130	loft, attic, garret
-n03686363	loft, pigeon loft
-n03686470	loft
-n03686924	log cabin
-n03687137	loggia
-n03687928	longbow
-n03688066	long iron
-n03688192	long johns
-n03688405	long sleeve
-n03688504	long tom
-n03688605	long trousers, long pants
-n03688707	long underwear, union suit
-n03688832	looking glass, glass
-n03688943	lookout, observation tower, lookout station, observatory
-n03689157	loom
-n03689570	loop knot
-n03690168	lorgnette
-n03690279	Lorraine cross, cross of Lorraine
-n03690473	lorry, camion
-n03690851	lota
-n03690938	lotion
-n03691459	loudspeaker, speaker, speaker unit, loudspeaker system, speaker system
-n03691817	lounge, waiting room, waiting area
-n03692004	lounger
-n03692136	lounging jacket, smoking jacket
-n03692272	lounging pajama, lounging pyjama
-n03692379	loungewear
-n03692522	loupe, jeweler's loupe
-n03692842	louvered window, jalousie
-n03693293	love knot, lovers' knot, lover's knot, true lovers' knot, true lover's knot
-n03693474	love seat, loveseat, tete-a-tete, vis-a-vis
-n03693707	loving cup
-n03693860	lowboy
-n03694196	low-pass filter
-n03694356	low-warp-loom
-n03694639	LP, L-P
-n03694761	L-plate
-n03694949	lubber's hole
-n03695122	lubricating system, force-feed lubricating system, force feed, pressure-feed lubricating system, pressure feed
-n03695452	luff
-n03695616	lug
-n03695753	luge
-n03695857	Luger
-n03695957	luggage carrier
-n03696065	luggage compartment, automobile trunk, trunk
-n03696301	luggage rack, roof rack
-n03696445	lugger
-n03696568	lugsail, lug
-n03696746	lug wrench
-n03696909	lumberjack, lumber jacket
-n03697007	lumbermill, sawmill
-n03697366	lunar excursion module, lunar module, LEM
-n03697552	lunchroom
-n03697812	lunette
-n03697913	lungi, lungyi, longyi
-n03698123	lunula
-n03698226	lusterware
-n03698360	lute
-n03698604	luxury liner, express luxury liner
-n03698723	lyceum
-n03698815	lychgate, lichgate
-n03699280	lyre
-n03699591	machete, matchet, panga
-n03699754	machicolation
-n03699975	machine
-n03700963	machine, simple machine
-n03701191	machine bolt
-n03701391	machine gun
-n03701640	machinery
-n03701790	machine screw
-n03702248	machine tool
-n03702440	machinist's vise, metalworking vise
-n03702582	machmeter
-n03703075	mackinaw
-n03703203	mackinaw, Mackinaw boat
-n03703463	mackinaw, Mackinaw coat
-n03703590	mackintosh, macintosh
-n03703730	macrame
-n03703862	madras
-n03703945	Mae West, air jacket
-n03704549	magazine rack
-n03704834	magic lantern
-n03705379	magnet
-n03705808	magnetic bottle
-n03706229	magnetic compass
-n03706415	magnetic core memory, core memory
-n03706653	magnetic disk, magnetic disc, disk, disc
-n03706939	magnetic head
-n03707171	magnetic mine
-n03707372	magnetic needle
-n03707597	magnetic recorder
-n03707766	magnetic stripe
-n03708036	magnetic tape, mag tape, tape
-n03708425	magneto, magnetoelectric machine
-n03708843	magnetometer, gaussmeter
-n03708962	magnetron
-n03709206	magnifier
-n03709363	magnum
-n03709545	magnus hitch
-n03709644	mail
-n03709823	mailbag, postbag
-n03709960	mailbag, mail pouch
-n03710079	mailboat, mail boat, packet, packet boat
-n03710193	mailbox, letter box
-n03710294	mail car
-n03710421	maildrop
-n03710528	mailer
-n03710637	maillot
-n03710721	maillot, tank suit
-n03710937	mailsorter
-n03711044	mail train
-n03711711	mainframe, mainframe computer
-n03711999	mainmast
-n03712111	main rotor
-n03712337	mainsail
-n03712444	mainspring
-n03712887	main-topmast
-n03712981	main-topsail
-n03713069	main yard
-n03713151	maisonette, maisonnette
-n03713436	majolica, maiolica
-n03714235	makeup, make-up, war paint
-n03715114	Maksutov telescope
-n03715275	malacca, malacca cane
-n03715386	mallet, beetle
-n03715669	mallet, hammer
-n03715892	mallet
-n03716228	mammogram
-n03716887	mandola
-n03716966	mandolin
-n03717131	manger, trough
-n03717285	mangle
-n03717447	manhole
-n03717622	manhole cover
-n03718212	man-of-war, ship of the line
-n03718335	manometer
-n03718458	manor, manor house
-n03718581	manor hall, hall
-n03718699	MANPAD
-n03718789	mansard, mansard roof
-n03718935	manse
-n03719053	mansion, mansion house, manse, hall, residence
-n03719343	mantel, mantelpiece, mantle, mantlepiece, chimneypiece
-n03719560	mantelet, mantilla
-n03719743	mantilla
-n03720005	Mao jacket
-n03720163	map
-n03720665	maquiladora
-n03720891	maraca
-n03721047	marble
-n03721252	marching order
-n03721384	marimba, xylophone
-n03721590	marina
-n03722007	marker
-n03722288	marketplace, market place, mart, market
-n03722646	marlinespike, marlinspike, marlingspike
-n03722944	marocain, crepe marocain
-n03723153	marquee, marquise
-n03723267	marquetry, marqueterie
-n03723439	marriage bed
-n03723781	martello tower
-n03723885	martingale
-n03724066	mascara
-n03724176	maser
-n03724417	masher
-n03724538	mashie, five iron
-n03724623	mashie niblick, seven iron
-n03724756	masjid, musjid
-n03724870	mask
-n03725035	mask
-n03725506	Masonite
-n03725600	Mason jar
-n03725717	masonry
-n03725869	mason's level
-n03726116	massage parlor
-n03726233	massage parlor
-n03726371	mass spectrograph
-n03726516	mass spectrometer, spectrometer
-n03726760	mast
-n03726993	mast
-n03727067	mastaba, mastabah
-n03727465	master bedroom
-n03727605	masterpiece, chef-d'oeuvre
-n03727837	mat
-n03727946	mat, gym mat
-n03728437	match, lucifer, friction match
-n03728982	match
-n03729131	matchboard
-n03729308	matchbook
-n03729402	matchbox
-n03729482	matchlock
-n03729647	match plane, tonguing and grooving plane
-n03729826	matchstick
-n03729951	material
-n03730153	materiel, equipage
-n03730334	maternity hospital
-n03730494	maternity ward
-n03730655	matrix
-n03730788	Matthew Walker, Matthew Walker knot
-n03730893	matting
-n03731019	mattock
-n03731483	mattress cover
-n03731695	maul, sledge, sledgehammer
-n03731882	maulstick, mahlstick
-n03732020	Mauser
-n03732114	mausoleum
-n03732458	maxi
-n03732543	Maxim gun
-n03732658	maximum and minimum thermometer
-n03733131	maypole
-n03733281	maze, labyrinth
-n03733465	mazer
-n03733547	means
-n03733644	measure
-n03733805	measuring cup
-n03733925	measuring instrument, measuring system, measuring device
-n03735637	measuring stick, measure, measuring rod
-n03735963	meat counter
-n03736064	meat grinder
-n03736147	meat hook
-n03736269	meat house
-n03736372	meat safe
-n03736470	meat thermometer
-n03736970	mechanical device
-n03738066	mechanical piano, Pianola, player piano
-n03738241	mechanical system
-n03738472	mechanism
-n03739518	medical building, health facility, healthcare facility
-n03739693	medical instrument
-n03742019	medicine ball
-n03742115	medicine chest, medicine cabinet
-n03742238	MEDLINE
-n03743016	megalith, megalithic structure
-n03743279	megaphone
-n03743902	memorial, monument
-n03744276	memory, computer memory, storage, computer storage, store, memory board
-n03744684	memory chip
-n03744840	memory device, storage device
-n03745146	menagerie, zoo, zoological garden
-n03745487	mending
-n03745571	menhir, standing stone
-n03746005	menorah
-n03746155	Menorah
-n03746330	man's clothing
-n03746486	men's room, men's
-n03748162	mercantile establishment, retail store, sales outlet, outlet
-n03749504	mercury barometer
-n03749634	mercury cell
-n03749807	mercury thermometer, mercury-in-glass thermometer
-n03750206	mercury-vapor lamp
-n03750437	mercy seat
-n03750614	merlon
-n03751065	mess, mess hall
-n03751269	mess jacket, monkey jacket, shell jacket
-n03751458	mess kit
-n03751590	messuage
-n03751757	metal detector
-n03752071	metallic
-n03752185	metal screw
-n03752398	metal wood
-n03752922	meteorological balloon
-n03753077	meter
-n03753514	meterstick, metrestick
-n03757604	metronome
-n03758089	mezzanine, mezzanine floor, entresol
-n03758220	mezzanine, first balcony
-n03758894	microbalance
-n03758992	microbrewery
-n03759243	microfiche
-n03759432	microfilm
-n03759661	micrometer, micrometer gauge, micrometer caliper
-n03759954	microphone, mike
-n03760310	microprocessor
-n03760671	microscope
-n03760944	microtome
-n03761084	microwave, microwave oven
-n03761588	microwave diathermy machine
-n03761731	microwave linear accelerator
-n03762238	middy, middy blouse
-n03762332	midiron, two iron
-n03762434	mihrab
-n03762602	mihrab
-n03762982	military hospital
-n03763727	military quarters
-n03763968	military uniform
-n03764276	military vehicle
-n03764606	milk bar
-n03764736	milk can
-n03764822	milk float
-n03764995	milking machine
-n03765128	milking stool
-n03765467	milk wagon, milkwagon
-n03765561	mill, grinder, milling machinery
-n03765934	milldam
-n03766044	miller, milling machine
-n03766218	milliammeter
-n03766322	millinery, woman's hat
-n03766508	millinery, hat shop
-n03766600	milling
-n03766697	millivoltmeter
-n03766935	millstone
-n03767112	millstone
-n03767203	millwheel, mill wheel
-n03767459	mimeograph, mimeo, mimeograph machine, Roneo, Roneograph
-n03767745	minaret
-n03767966	mincer, mincing machine
-n03768132	mine
-n03768683	mine detector
-n03768823	minelayer
-n03768916	mineshaft
-n03769610	minibar, cellaret
-n03769722	minibike, motorbike
-n03769881	minibus
-n03770085	minicar
-n03770224	minicomputer
-n03770316	ministry
-n03770439	miniskirt, mini
-n03770520	minisub, minisubmarine
-n03770679	minivan
-n03770834	miniver
-n03770954	mink, mink coat
-n03772077	minster
-n03772269	mint
-n03772584	minute hand, big hand
-n03772674	Minuteman
-n03773035	mirror
-n03773504	missile
-n03773835	missile defense system, missile defence system
-n03774327	miter box, mitre box
-n03774461	miter joint, mitre joint, miter, mitre
-n03775071	mitten
-n03775199	mixer
-n03775388	mixer
-n03775546	mixing bowl
-n03775636	mixing faucet
-n03775747	mizzen, mizen
-n03775847	mizzenmast, mizenmast, mizzen, mizen
-n03776167	mobcap
-n03776460	mobile home, manufactured home
-n03776877	moccasin, mocassin
-n03776997	mock-up
-n03777126	mod con
-n03777568	Model T
-n03777754	modem
-n03778459	modillion
-n03778817	module
-n03779000	module
-n03779128	mohair
-n03779246	moire, watered-silk
-n03779370	mold, mould, cast
-n03779884	moldboard, mouldboard
-n03780047	moldboard plow, mouldboard plough
-n03780799	moleskin
-n03781055	Molotov cocktail, petrol bomb, gasoline bomb
-n03781244	monastery
-n03781467	monastic habit
-n03781594	moneybag
-n03781683	money belt
-n03781787	monitor
-n03782006	monitor
-n03782190	monitor, monitoring device
-n03782794	monkey-wrench, monkey wrench
-n03782929	monk's cloth
-n03783304	monochrome
-n03783430	monocle, eyeglass
-n03783575	monofocal lens implant, monofocal IOL
-n03783873	monoplane
-n03784139	monotype
-n03784270	monstrance, ostensorium
-n03784793	mooring tower, mooring mast
-n03784896	Moorish arch, horseshoe arch
-n03785016	moped
-n03785142	mop handle
-n03785237	moquette
-n03785499	morgue, mortuary, dead room
-n03785721	morion, cabasset
-n03786096	morning dress
-n03786194	morning dress
-n03786313	morning room
-n03786621	Morris chair
-n03786715	mortar, howitzer, trench mortar
-n03786901	mortar
-n03787032	mortarboard
-n03787523	mortise joint, mortise-and-tenon joint
-n03788047	mosaic
-n03788195	mosque
-n03788365	mosquito net
-n03788498	motel
-n03788601	motel room
-n03788914	Mother Hubbard, muumuu
-n03789171	motion-picture camera, movie camera, cine-camera
-n03789400	motion-picture film, movie film, cine-film
-n03789603	motley
-n03789794	motley
-n03789946	motor
-n03790230	motorboat, powerboat
-n03790512	motorcycle, bike
-n03790755	motor hotel, motor inn, motor lodge, tourist court, court
-n03790953	motorized wheelchair
-n03791053	motor scooter, scooter
-n03791235	motor vehicle, automotive vehicle
-n03792048	mound, hill
-n03792334	mound, hill, pitcher's mound
-n03792526	mount, setting
-n03792782	mountain bike, all-terrain bike, off-roader
-n03792972	mountain tent
-n03793489	mouse, computer mouse
-n03793850	mouse button
-n03794056	mousetrap
-n03794136	mousse, hair mousse, hair gel
-n03794798	mouthpiece, embouchure
-n03795123	mouthpiece
-n03795269	mouthpiece, gumshield
-n03795758	movement
-n03795976	movie projector, cine projector, film projector
-n03796181	moving-coil galvanometer
-n03796401	moving van
-n03796522	mud brick
-n03796605	mudguard, splash guard, splash-guard
-n03796848	mudhif
-n03796974	muff
-n03797062	muffle
-n03797182	muffler
-n03797264	mufti
-n03797390	mug
-n03797896	mulch
-n03798061	mule, scuff
-n03798442	multichannel recorder
-n03798610	multiengine airplane, multiengine plane
-n03798982	multiplex
-n03799113	multiplexer
-n03799240	multiprocessor
-n03799375	multistage rocket, step rocket
-n03799610	munition, ordnance, ordnance store
-n03799876	Murphy bed
-n03800371	musette, shepherd's pipe
-n03800485	musette pipe
-n03800563	museum
-n03800772	mushroom anchor
-n03800933	musical instrument, instrument
-n03801353	music box, musical box
-n03801533	music hall, vaudeville theater, vaudeville theatre
-n03801671	music school
-n03801760	music stand, music rack
-n03801880	music stool, piano stool
-n03802007	musket
-n03802228	musket ball, ball
-n03802393	muslin
-n03802643	mustache cup, moustache cup
-n03802800	mustard plaster, sinapism
-n03802973	mute
-n03803116	muzzle loader
-n03803284	muzzle
-n03803780	myelogram
-n03804211	nacelle
-n03804744	nail
-n03805180	nailbrush
-n03805280	nailfile
-n03805374	nailhead
-n03805503	nailhead
-n03805725	nail polish, nail enamel, nail varnish
-n03805933	nainsook
-n03807334	Napier's bones, Napier's rods
-n03809211	nard, spikenard
-n03809312	narrowbody aircraft, narrow-body aircraft, narrow-body
-n03809603	narrow wale
-n03809686	narthex
-n03809802	narthex
-n03810412	nasotracheal tube
-n03810952	national monument
-n03811295	nautilus, nuclear submarine, nuclear-powered submarine
-n03811444	navigational system
-n03811847	naval equipment
-n03811965	naval gun
-n03812263	naval missile
-n03812382	naval radar
-n03812789	naval tactical data system
-n03812924	naval weaponry
-n03813078	nave
-n03813176	navigational instrument
-n03813946	nebuchadnezzar
-n03814528	neckband
-n03814639	neck brace
-n03814727	neckcloth, stock
-n03814817	neckerchief
-n03814906	necklace
-n03815149	necklet
-n03815278	neckline
-n03815482	neckpiece
-n03815615	necktie, tie
-n03816005	neckwear
-n03816136	needle
-n03816394	needle
-n03816530	needlenose pliers
-n03816849	needlework, needlecraft
-n03817191	negative
-n03817331	negative magnetic pole, negative pole, south-seeking pole
-n03817522	negative pole
-n03817647	negligee, neglige, peignoir, wrapper, housecoat
-n03818001	neolith
-n03818343	neon lamp, neon induction lamp, neon tube
-n03819047	nephoscope
-n03819336	nest
-n03819448	nest egg
-n03819595	net, network, mesh, meshing, meshwork
-n03819994	net
-n03820154	net
-n03820318	net
-n03820728	network, electronic network
-n03820950	network
-n03821145	neutron bomb
-n03821424	newel
-n03821518	newel post, newel
-n03822171	newspaper, paper
-n03822361	newsroom
-n03822504	newsroom
-n03822656	newsstand
-n03822767	Newtonian telescope, Newtonian reflector
-n03823111	nib, pen nib
-n03823216	niblick, nine iron
-n03823312	nicad, nickel-cadmium accumulator
-n03823673	nickel-iron battery, nickel-iron accumulator
-n03823906	Nicol prism
-n03824197	night bell
-n03824284	nightcap
-n03824381	nightgown, gown, nightie, night-robe, nightdress
-n03824589	night latch
-n03824713	night-light
-n03824999	nightshirt
-n03825080	nightwear, sleepwear, nightclothes
-n03825271	ninepin, skittle, skittle pin
-n03825442	ninepin ball, skittle ball
-n03825673	ninon
-n03825788	nipple
-n03825913	nipple shield
-n03826039	niqab
-n03826186	Nissen hut, Quonset hut
-n03827420	nogging
-n03827536	noisemaker
-n03828020	nonsmoker, nonsmoking car
-n03829340	non-volatile storage, nonvolatile storage
-n03829857	Norfolk jacket
-n03829954	noria
-n03831203	nosebag, feedbag
-n03831382	noseband, nosepiece
-n03831757	nose flute
-n03832144	nosewheel
-n03832673	notebook, notebook computer
-n03833907	nuclear-powered ship
-n03834040	nuclear reactor, reactor
-n03834472	nuclear rocket
-n03834604	nuclear weapon, atomic weapon
-n03835197	nude, nude painting
-n03835729	numdah, numdah rug, nammad
-n03835941	nun's habit
-n03836062	nursery, baby's room
-n03836451	nut and bolt
-n03836602	nutcracker
-n03836906	nylon
-n03836976	nylons, nylon stocking, rayons, rayon stocking, silk stocking
-n03837422	oar
-n03837606	oast
-n03837698	oast house
-n03837869	obelisk
-n03838024	object ball
-n03838298	objective, objective lens, object lens, object glass
-n03838748	oblique bandage
-n03838899	oboe, hautboy, hautbois
-n03839172	oboe da caccia
-n03839276	oboe d'amore
-n03839424	observation dome
-n03839671	observatory
-n03839795	obstacle
-n03840327	obturator
-n03840681	ocarina, sweet potato
-n03840823	octant
-n03841011	odd-leg caliper
-n03841143	odometer, hodometer, mileometer, milometer
-n03841290	oeil de boeuf
-n03841666	office, business office
-n03842012	office building, office block
-n03842156	office furniture
-n03842276	officer's mess
-n03842377	off-line equipment, auxiliary equipment
-n03842585	ogee, cyma reversa
-n03842754	ogee arch, keel arch
-n03842986	ohmmeter
-n03843092	oil, oil color, oil colour
-n03843316	oilcan
-n03843438	oilcloth
-n03843555	oil filter
-n03843883	oil heater, oilstove, kerosene heater, kerosine heater
-n03844045	oil lamp, kerosene lamp, kerosine lamp
-n03844233	oil paint
-n03844550	oil pump
-n03844673	oil refinery, petroleum refinery
-n03844815	oilskin, slicker
-n03844965	oil slick
-n03845107	oilstone
-n03845190	oil tanker, oiler, tanker, tank ship
-n03845990	old school tie
-n03846100	olive drab
-n03846234	olive drab, olive-drab uniform
-n03846431	Olympian Zeus
-n03846677	omelet pan, omelette pan
-n03846772	omnidirectional antenna, nondirectional antenna
-n03846970	omnirange, omnidirectional range, omnidirectional radio range
-n03847471	onion dome
-n03847823	open-air market, open-air marketplace, market square
-n03848033	open circuit
-n03848168	open-end wrench, tappet wrench
-n03848348	opener
-n03848537	open-hearth furnace
-n03849275	openside plane, rabbet plane
-n03849412	open sight
-n03849679	openwork
-n03849814	opera, opera house
-n03849943	opera cloak, opera hood
-n03850053	operating microscope
-n03850245	operating room, OR, operating theater, operating theatre, surgery
-n03850492	operating table
-n03850613	ophthalmoscope
-n03851341	optical device
-n03851787	optical disk, optical disc
-n03852280	optical instrument
-n03852544	optical pyrometer, pyroscope
-n03852688	optical telescope
-n03853291	orchestra pit, pit
-n03853924	ordinary, ordinary bicycle
-n03854065	organ, pipe organ
-n03854421	organdy, organdie
-n03854506	organic light-emitting diode, OLED
-n03854722	organ loft
-n03854815	organ pipe, pipe, pipework
-n03855214	organza
-n03855333	oriel, oriel window
-n03855464	oriflamme
-n03855604	O ring
-n03855756	Orlon
-n03855908	orlop deck, orlop, fourth deck
-n03856012	orphanage, orphans' asylum
-n03856335	orphrey
-n03856465	orrery
-n03856728	orthicon, image orthicon
-n03857026	orthochromatic film
-n03857156	orthopter, ornithopter
-n03857291	orthoscope
-n03857687	oscillograph
-n03857828	oscilloscope, scope, cathode-ray oscilloscope, CRO
-n03858085	ossuary
-n03858183	otoscope, auriscope, auroscope
-n03858418	ottoman, pouf, pouffe, puff, hassock
-n03858533	oubliette
-n03858837	out-basket, out-tray
-n03859000	outboard motor, outboard
-n03859170	outboard motorboat, outboard
-n03859280	outbuilding
-n03859495	outerwear, overclothes
-n03859608	outfall
-n03859958	outfit, getup, rig, turnout
-n03860234	outfitter
-n03860404	outhouse, privy, earth-closet, jakes
-n03861048	output device
-n03861271	outrigger
-n03861430	outrigger canoe
-n03861596	outside caliper
-n03861842	outside mirror
-n03862379	outwork
-n03862676	oven
-n03862862	oven thermometer
-n03863108	overall
-n03863262	overall, boilersuit, boilers suit
-n03863657	overcoat, overcoating
-n03863783	overdrive
-n03863923	overgarment, outer garment
-n03864139	overhand knot
-n03864356	overhang
-n03864692	overhead projector
-n03865288	overmantel
-n03865371	overnighter, overnight bag, overnight case
-n03865557	overpass, flyover
-n03865820	override
-n03865949	overshoe
-n03866082	overskirt
-n03867854	oxbow
-n03868044	Oxbridge
-n03868242	oxcart
-n03868324	oxeye
-n03868406	oxford
-n03868643	oximeter
-n03868763	oxyacetylene torch
-n03868863	oxygen mask
-n03869838	oyster bar
-n03869976	oyster bed, oyster bank, oyster park
-n03870105	pace car
-n03870290	pacemaker, artificial pacemaker
-n03870546	pack
-n03870672	pack
-n03870980	pack, face pack
-n03871083	package, parcel
-n03871371	package store, liquor store, off-licence
-n03871524	packaging
-n03871628	packet
-n03871724	packing box, packing case
-n03871860	packinghouse, packing plant
-n03872016	packinghouse
-n03872167	packing needle
-n03872273	packsaddle
-n03873416	paddle, boat paddle
-n03873699	paddle
-n03873848	paddle
-n03873996	paddle box, paddle-box
-n03874138	paddle steamer, paddle-wheeler
-n03874293	paddlewheel, paddle wheel
-n03874487	paddock
-n03874599	padlock
-n03874823	page printer, page-at-a-time printer
-n03875218	paint, pigment
-n03875806	paintball
-n03875955	paintball gun
-n03876111	paintbox
-n03876231	paintbrush
-n03877351	paisley
-n03877472	pajama, pyjama, pj's, jammies
-n03877674	pajama, pyjama
-n03877845	palace
-n03878066	palace, castle
-n03878211	palace
-n03878294	palanquin, palankeen
-n03878418	paleolith
-n03878511	palestra, palaestra
-n03878674	palette, pallet
-n03878828	palette knife
-n03878963	palisade
-n03879456	pallet
-n03879705	pallette, palette
-n03880032	pallium
-n03880129	pallium
-n03880323	pan
-n03880531	pan, cooking pan
-n03881305	pancake turner
-n03881404	panchromatic film
-n03881534	panda car
-n03882611	paneling, panelling, pane
-n03882960	panhandle
-n03883054	panic button
-n03883385	pannier
-n03883524	pannier
-n03883664	pannikin
-n03883773	panopticon
-n03883944	panopticon
-n03884397	panpipe, pandean pipe, syrinx
-n03884554	pantaloon
-n03884639	pantechnicon
-n03884778	pantheon
-n03884926	pantheon
-n03885028	pantie, panty, scanty, step-in
-n03885194	panting, trousering
-n03885293	pant leg, trouser leg
-n03885410	pantograph
-n03885535	pantry, larder, buttery
-n03885669	pants suit, pantsuit
-n03885788	panty girdle
-n03885904	pantyhose
-n03886053	panzer
-n03886641	paper chain
-n03886762	paper clip, paperclip, gem clip
-n03886940	paper cutter
-n03887185	paper fastener
-n03887330	paper feed
-n03887512	paper mill
-n03887697	paper towel
-n03887899	parabolic mirror
-n03888022	parabolic reflector, paraboloid reflector
-n03888257	parachute, chute
-n03888605	parallel bars, bars
-n03888808	parallel circuit, shunt circuit
-n03888998	parallel interface, parallel port
-n03889397	parang
-n03889503	parapet, breastwork
-n03889626	parapet
-n03889726	parasail
-n03889871	parasol, sunshade
-n03890093	parer, paring knife
-n03890233	parfait glass
-n03890358	pargeting, pargetting, pargetry
-n03890514	pari-mutuel machine, totalizer, totaliser, totalizator, totalisator
-n03891051	parka, windbreaker, windcheater, anorak
-n03891251	park bench
-n03891332	parking meter
-n03891538	parlor, parlour
-n03892178	parquet, parquet floor
-n03892425	parquetry, parqueterie
-n03892557	parsonage, vicarage, rectory
-n03892728	Parsons table
-n03893935	partial denture
-n03894051	particle detector
-n03894379	partition, divider
-n03894677	parts bin
-n03894933	party line
-n03895038	party wall
-n03895170	parvis
-n03895866	passenger car, coach, carriage
-n03896103	passenger ship
-n03896233	passenger train
-n03896419	passenger van
-n03896526	passe-partout
-n03896628	passive matrix display
-n03896984	passkey, passe-partout, master key, master
-n03897130	pass-through
-n03897634	pastry cart
-n03897943	patch
-n03898129	patchcord
-n03898271	patchouli, patchouly, pachouli
-n03898395	patch pocket
-n03898633	patchwork, patchwork quilt
-n03898787	patent log, screw log, taffrail log
-n03899100	paternoster
-n03899612	patina
-n03899768	patio, terrace
-n03899933	patisserie
-n03900028	patka
-n03900194	patrol boat, patrol ship
-n03900301	patty-pan
-n03900393	pave
-n03900979	pavilion, marquee
-n03901229	pavior, paviour, paving machine
-n03901338	pavis, pavise
-n03901750	pawn
-n03901974	pawnbroker's shop, pawnshop, loan office
-n03902125	pay-phone, pay-station
-n03902220	PC board
-n03902482	peach orchard
-n03902756	pea jacket, peacoat
-n03903133	peavey, peavy, cant dog, dog hook
-n03903290	pectoral, pectoral medallion
-n03903424	pedal, treadle, foot pedal, foot lever
-n03903733	pedal pusher, toreador pants
-n03903868	pedestal, plinth, footstall
-n03904060	pedestal table
-n03904183	pedestrian crossing, zebra crossing
-n03904433	pedicab, cycle rickshaw
-n03904657	pediment
-n03904782	pedometer
-n03904909	peeler
-n03905361	peep sight
-n03905540	peg, nog
-n03905730	peg, pin, thole, tholepin, rowlock, oarlock
-n03905947	peg
-n03906106	peg, wooden leg, leg, pegleg
-n03906224	pegboard
-n03906463	Pelham
-n03906590	pelican crossing
-n03906789	pelisse
-n03906894	pelvimeter
-n03906997	pen
-n03907475	penal colony
-n03907654	penal institution, penal facility
-n03907908	penalty box
-n03908111	pen-and-ink
-n03908204	pencil
-n03908456	pencil
-n03908618	pencil box, pencil case
-n03908714	pencil sharpener
-n03909020	pendant earring, drop earring, eardrop
-n03909160	pendulum
-n03909406	pendulum clock
-n03909516	pendulum watch
-n03909658	penetration bomb
-n03911406	penile implant
-n03911513	penitentiary, pen
-n03911658	penknife
-n03911767	penlight
-n03911866	pennant, pennon, streamer, waft
-n03912218	pennywhistle, tin whistle, whistle
-n03912821	penthouse
-n03913343	pentode
-n03913930	peplos, peplus, peplum
-n03914106	peplum
-n03914337	pepper mill, pepper grinder
-n03914438	pepper shaker, pepper box, pepper pot
-n03914583	pepper spray
-n03914831	percale
-n03915118	percolator
-n03915320	percussion cap
-n03915437	percussion instrument, percussive instrument
-n03915900	perforation
-n03916031	perfume, essence
-n03916289	perfumery
-n03916385	perfumery
-n03916470	perfumery
-n03916720	peripheral, computer peripheral, peripheral device
-n03917048	periscope
-n03917198	peristyle
-n03917327	periwig, peruke
-n03917814	permanent press, durable press
-n03918074	perpetual motion machine
-n03918480	personal computer, PC, microcomputer
-n03918737	personal digital assistant, PDA, personal organizer, personal organiser, organizer, organiser
-n03919096	personnel carrier
-n03919289	pestle
-n03919430	pestle, muller, pounder
-n03919808	petcock
-n03920288	Petri dish
-n03920384	petrolatum gauze
-n03920641	pet shop
-n03920737	petticoat, half-slip, underskirt
-n03920867	pew, church bench
-n03923379	phial, vial, ampule, ampul, ampoule
-n03923564	Phillips screw
-n03923692	Phillips screwdriver
-n03923918	phonograph needle, needle
-n03924069	phonograph record, phonograph recording, record, disk, disc, platter
-n03924407	photocathode
-n03924532	photocoagulator
-n03924679	photocopier
-n03926148	photographic equipment
-n03926412	photographic paper, photographic material
-n03926876	photometer
-n03927091	photomicrograph
-n03927299	Photostat, Photostat machine
-n03927539	photostat
-n03927792	physical pendulum, compound pendulum
-n03928116	piano, pianoforte, forte-piano
-n03928589	piano action
-n03928814	piano keyboard, fingerboard, clavier
-n03928994	piano wire
-n03929091	piccolo
-n03929202	pick, pickax, pickaxe
-n03929443	pick
-n03929660	pick, plectrum, plectron
-n03929855	pickelhaube
-n03930229	picket boat
-n03930313	picket fence, paling
-n03930431	picket ship
-n03930515	pickle barrel
-n03930630	pickup, pickup truck
-n03931765	picture frame
-n03931885	picture hat
-n03931980	picture rail
-n03932080	picture window
-n03932670	piece of cloth, piece of material
-n03933391	pied-a-terre
-n03933933	pier
-n03934042	pier
-n03934229	pier arch
-n03934311	pier glass, pier mirror
-n03934565	pier table
-n03934656	pieta
-n03934890	piezometer
-n03935116	pig bed, pig
-n03935234	piggery, pig farm
-n03935335	piggy bank, penny bank
-n03935883	pilaster
-n03936269	pile, spile, piling, stilt
-n03936466	pile driver
-n03937543	pill bottle
-n03937835	pillbox, toque, turban
-n03937931	pillion
-n03938037	pillory
-n03938244	pillow
-n03938401	pillow block
-n03938522	pillow lace, bobbin lace
-n03938725	pillow sham
-n03939062	pilot bit
-n03939178	pilot boat
-n03939281	pilot burner, pilot light, pilot
-n03939440	pilot cloth
-n03939565	pilot engine
-n03939677	pilothouse, wheelhouse
-n03939844	pilot light, pilot lamp, indicator lamp
-n03940256	pin
-n03940894	pin, flag
-n03941013	pin, pin tumbler
-n03941231	pinata
-n03941417	pinball machine, pin table
-n03941586	pince-nez
-n03941684	pincer, pair of pincers, tweezer, pair of tweezers
-n03941887	pinch bar
-n03942028	pincurl clip
-n03942600	pinfold
-n03942813	ping-pong ball
-n03942920	pinhead
-n03943115	pinion
-n03943266	pinnacle
-n03943623	pinprick
-n03943714	pinstripe
-n03943833	pinstripe
-n03943920	pinstripe
-n03944024	pintle
-n03944138	pinwheel, pinwheel wind collector
-n03944341	pinwheel
-n03945459	tabor pipe
-n03945615	pipe
-n03945817	pipe bomb
-n03945928	pipe cleaner
-n03946076	pipe cutter
-n03946162	pipefitting, pipe fitting
-n03947111	pipet, pipette
-n03947343	pipe vise, pipe clamp
-n03947466	pipe wrench, tube wrench
-n03947798	pique
-n03947888	pirate, pirate ship
-n03948242	piste
-n03948459	pistol, handgun, side arm, shooting iron
-n03948830	pistol grip
-n03948950	piston, plunger
-n03949145	piston ring
-n03949317	piston rod
-n03949761	pit
-n03950228	pitcher, ewer
-n03950359	pitchfork
-n03950537	pitching wedge
-n03950647	pitch pipe
-n03950899	pith hat, pith helmet, sun helmet, topee, topi
-n03951068	piton
-n03951213	Pitot-static tube, Pitot head, Pitot tube
-n03951453	Pitot tube, Pitot
-n03951800	pitsaw
-n03951971	pivot, pin
-n03952150	pivoting window
-n03952576	pizzeria, pizza shop, pizza parlor
-n03953020	place of business, business establishment
-n03953416	place of worship, house of prayer, house of God, house of worship
-n03953901	placket
-n03954393	planchet, coin blank
-n03954731	plane, carpenter's plane, woodworking plane
-n03955296	plane, planer, planing machine
-n03955489	plane seat
-n03955809	planetarium
-n03955941	planetarium
-n03956157	planetarium
-n03956331	planetary gear, epicyclic gear, planet wheel, planet gear
-n03956531	plank-bed
-n03956623	planking
-n03956785	planner
-n03956922	plant, works, industrial plant
-n03957315	planter
-n03957420	plaster, adhesive plaster, sticking plaster
-n03957762	plasterboard, gypsum board
-n03957991	plastering trowel
-n03958227	plastic bag
-n03958338	plastic bomb
-n03958630	plastic laminate
-n03958752	plastic wrap
-n03959014	plastron
-n03959123	plastron
-n03959227	plastron
-n03959701	plate, scale, shell
-n03960374	plate, collection plate
-n03960490	plate
-n03961394	platen
-n03961630	platen
-n03961711	plate rack
-n03961828	plate rail
-n03961939	platform
-n03962525	platform, weapons platform
-n03962685	platform
-n03962852	platform bed
-n03962932	platform rocker
-n03963028	plating, metal plating
-n03963198	platter
-n03963294	playback
-n03963483	playbox, play-box
-n03963645	playground
-n03964495	playpen, pen
-n03964611	playsuit
-n03965456	plaza, mall, center, shopping mall, shopping center, shopping centre
-n03965907	pleat, plait
-n03966206	plenum
-n03966325	plethysmograph
-n03966582	pleximeter, plessimeter
-n03966751	plexor, plessor, percussor
-n03966976	pliers, pair of pliers, plyers
-n03967270	plimsoll
-n03967396	plotter
-n03967562	plow, plough
-n03967942	plug, stopper, stopple
-n03968293	plug, male plug
-n03968479	plug fuse
-n03968581	plughole
-n03968728	plumb bob, plumb, plummet
-n03969510	plumb level
-n03970156	plunger, plumber's helper
-n03970363	plus fours
-n03970546	plush
-n03971218	plywood, plyboard
-n03971321	pneumatic drill
-n03971960	p-n junction
-n03972146	p-n-p transistor
-n03972372	poacher
-n03972524	pocket
-n03973003	pocket battleship
-n03973285	pocketcomb, pocket comb
-n03973402	pocket flap
-n03973520	pocket-handkerchief
-n03973628	pocketknife, pocket knife
-n03973839	pocket watch
-n03973945	pod, fuel pod
-n03974070	pogo stick
-n03974915	point-and-shoot camera
-n03975035	pointed arch
-n03975657	pointing trowel
-n03975788	point lace, needlepoint
-n03975926	poker, stove poker, fire hook, salamander
-n03976105	polarimeter, polariscope
-n03976268	Polaroid
-n03976467	Polaroid camera, Polaroid Land camera
-n03976657	pole
-n03977158	pole
-n03977266	poleax, poleaxe
-n03977430	poleax, poleaxe
-n03977592	police boat
-n03977966	police van, police wagon, paddy wagon, patrol wagon, wagon, black Maria
-n03978421	polling booth
-n03978575	polo ball
-n03978686	polo mallet, polo stick
-n03978815	polonaise
-n03978966	polo shirt, sport shirt
-n03979377	polyester
-n03979492	polygraph
-n03980026	pomade, pomatum
-n03980478	pommel horse, side horse
-n03980874	poncho
-n03980986	pongee
-n03981094	poniard, bodkin
-n03981340	pontifical
-n03981566	pontoon
-n03981760	pontoon bridge, bateau bridge, floating bridge
-n03981924	pony cart, ponycart, donkey cart, tub-cart
-n03982232	pool ball
-n03982331	poolroom
-n03982430	pool table, billiard table, snooker table
-n03982642	poop deck
-n03982767	poor box, alms box, mite box
-n03982895	poorhouse
-n03983396	pop bottle, soda bottle
-n03983499	popgun
-n03983612	poplin
-n03983712	popper
-n03983928	poppet, poppet valve
-n03984125	pop tent
-n03984234	porcelain
-n03984381	porch
-n03984643	porkpie, porkpie hat
-n03984759	porringer
-n03985069	portable
-n03985232	portable computer
-n03985441	portable circular saw, portable saw
-n03985881	portcullis
-n03986071	porte-cochere
-n03986224	porte-cochere
-n03986355	portfolio
-n03986562	porthole
-n03986704	portico
-n03986857	portiere
-n03986949	portmanteau, Gladstone, Gladstone bag
-n03987266	portrait camera
-n03987376	portrait lens
-n03987674	positive pole, positive magnetic pole, north-seeking pole
-n03987865	positive pole
-n03987990	positron emission tomography scanner, PET scanner
-n03988170	post
-n03988758	postage meter
-n03988926	post and lintel
-n03989199	post chaise
-n03989349	postern
-n03989447	post exchange, PX
-n03989665	posthole digger, post-hole digger
-n03989777	post horn
-n03989898	posthouse, post house
-n03990474	pot
-n03991062	pot, flowerpot
-n03991202	potbelly, potbelly stove
-n03991321	Potemkin village
-n03991443	potential divider, voltage divider
-n03991646	potentiometer, pot
-n03991837	potentiometer
-n03992325	potpourri
-n03992436	potsherd
-n03992509	potter's wheel
-n03992703	pottery, clayware
-n03992975	pottle
-n03993053	potty seat, potty chair
-n03993180	pouch
-n03993403	poultice, cataplasm, plaster
-n03993703	pound, dog pound
-n03993878	pound net
-n03994008	powder
-n03994297	powder and shot
-n03994417	powdered mustard, dry mustard
-n03994614	powder horn, powder flask
-n03994757	powder keg
-n03995018	power brake
-n03995265	power cord
-n03995372	power drill
-n03995535	power line, power cable
-n03995661	power loom
-n03995856	power mower, motor mower
-n03996004	power pack
-n03996145	power saw, saw, sawing machine
-n03996416	power shovel, excavator, digger, shovel
-n03996849	power steering, power-assisted steering
-n03997274	power takeoff, PTO
-n03997484	power tool
-n03997875	praetorium, pretorium
-n03998194	prayer rug, prayer mat
-n03998333	prayer shawl, tallith, tallis
-n03998673	precipitator, electrostatic precipitator, Cottrell precipitator
-n03999064	prefab
-n03999160	presbytery
-n03999621	presence chamber
-n03999992	press, mechanical press
-n04000311	press, printing press
-n04000480	press
-n04000592	press box
-n04000716	press gallery
-n04000998	press of sail, press of canvas
-n04001132	pressure cabin
-n04001265	pressure cooker
-n04001397	pressure dome
-n04001499	pressure gauge, pressure gage
-n04001661	pressurized water reactor, PWR
-n04001845	pressure suit
-n04002262	pricket
-n04002371	prie-dieu
-n04002629	primary coil, primary winding, primary
-n04003241	Primus stove, Primus
-n04003359	Prince Albert
-n04003856	print
-n04004099	print buffer
-n04004210	printed circuit
-n04004475	printer, printing machine
-n04004767	printer
-n04004990	printer cable
-n04005197	priory
-n04005630	prison, prison house
-n04005912	prison camp, internment camp, prisoner of war camp, POW camp
-n04006067	privateer
-n04006227	private line
-n04006330	privet hedge
-n04006411	probe
-n04007415	proctoscope
-n04007664	prod, goad
-n04008385	production line, assembly line, line
-n04008634	projectile, missile
-n04009552	projector
-n04009801	projector
-n04009923	prolonge
-n04010057	prolonge knot, sailor's breastplate
-n04010779	prompter, autocue
-n04010927	prong
-n04011827	propeller, propellor
-n04012084	propeller plane
-n04012482	propjet, turboprop, turbo-propeller plane
-n04012665	proportional counter tube, proportional counter
-n04013060	propulsion system
-n04013176	proscenium, proscenium wall
-n04013600	proscenium arch
-n04013729	prosthesis, prosthetic device
-n04014297	protective covering, protective cover, protection
-n04015204	protective garment
-n04015786	proton accelerator
-n04015908	protractor
-n04016240	pruner, pruning hook, lopper
-n04016479	pruning knife
-n04016576	pruning saw
-n04016684	pruning shears
-n04016846	psaltery
-n04017571	psychrometer
-n04017807	PT boat, mosquito boat, mosquito craft, motor torpedo boat
-n04018155	public address system, P.A. system, PA system, P.A., PA
-n04018399	public house, pub, saloon, pothouse, gin mill, taphouse
-n04018667	public toilet, comfort station, public convenience, convenience, public lavatory, restroom, toilet facility, wash room
-n04019101	public transport
-n04019335	public works
-n04019541	puck, hockey puck
-n04019696	pull
-n04019881	pullback, tieback
-n04020087	pull chain
-n04020298	pulley, pulley-block, pulley block, block
-n04020744	pull-off, rest area, rest stop, layby, lay-by
-n04020912	Pullman, Pullman car
-n04021028	pullover, slipover
-n04021164	pull-through
-n04021362	pulse counter
-n04021503	pulse generator
-n04021704	pulse timing circuit
-n04021798	pump
-n04022332	pump
-n04022434	pump action, slide action
-n04022708	pump house, pumping station
-n04022866	pump room
-n04023021	pump-type pliers
-n04023119	pump well
-n04023249	punch, puncher
-n04023422	punchboard
-n04023695	punch bowl
-n04023962	punching bag, punch bag, punching ball, punchball
-n04024137	punch pliers
-n04024274	punch press
-n04024862	punnet
-n04024983	punt
-n04025508	pup tent, shelter tent
-n04025633	purdah
-n04026053	purifier
-n04026180	purl, purl stitch
-n04026417	purse
-n04026813	push-bike
-n04026918	push broom
-n04027023	push button, push, button
-n04027367	push-button radio
-n04027706	pusher, zori
-n04027820	put-put
-n04027935	puttee
-n04028074	putter, putting iron
-n04028221	putty knife
-n04028315	puzzle
-n04028581	pylon, power pylon
-n04028764	pylon
-n04029416	pyramidal tent
-n04029647	pyrograph
-n04029734	pyrometer
-n04029913	pyrometric cone
-n04030054	pyrostat
-n04030161	pyx, pix
-n04030274	pyx, pix, pyx chest, pix chest
-n04030414	pyxis
-n04030518	quad, quadrangle
-n04030846	quadrant
-n04030965	quadraphony, quadraphonic system, quadriphonic system
-n04031884	quartering
-n04032509	quarterstaff
-n04032603	quartz battery, quartz mill
-n04032936	quartz lamp
-n04033287	queen
-n04033425	queen
-n04033557	queen post
-n04033801	quern
-n04033901	quill, quill pen
-n04033995	quilt, comforter, comfort, puff
-n04034262	quilted bedspread
-n04034367	quilting
-n04035231	quipu
-n04035634	quirk molding, quirk moulding
-n04035748	quirt
-n04035836	quiver
-n04035912	quoin, coign, coigne
-n04036155	quoit
-n04036303	QWERTY keyboard
-n04036776	rabbet, rebate
-n04036963	rabbet joint
-n04037076	rabbit ears
-n04037220	rabbit hutch
-n04037298	raceabout
-n04037443	racer, race car, racing car
-n04037873	raceway, race
-n04037964	racing boat
-n04038231	racing gig
-n04038338	racing skiff, single shell
-n04038440	rack, stand
-n04038727	rack
-n04039041	rack, wheel
-n04039209	rack and pinion
-n04039381	racket, racquet
-n04039742	racquetball
-n04039848	radar, microwave radar, radio detection and ranging, radiolocation
-n04040247	radial, radial tire, radial-ply tire
-n04040373	radial engine, rotary engine
-n04040540	radiation pyrometer
-n04040759	radiator
-n04041069	radiator
-n04041243	radiator cap
-n04041408	radiator hose
-n04041544	radio, wireless
-n04041747	radio antenna, radio aerial
-n04042076	radio chassis
-n04042204	radio compass
-n04042358	radiogram, radiograph, shadowgraph, skiagraph, skiagram
-n04042632	radio interferometer
-n04042795	radio link, link
-n04042985	radiometer
-n04043168	radiomicrometer
-n04043411	radio-phonograph, radio-gramophone
-n04043733	radio receiver, receiving set, radio set, radio, tuner, wireless
-n04044307	radiotelegraph, radiotelegraphy, wireless telegraph, wireless telegraphy
-n04044498	radiotelephone, radiophone, wireless telephone
-n04044716	radio telescope, radio reflector
-n04044955	radiotherapy equipment
-n04045085	radio transmitter
-n04045255	radome, radar dome
-n04045397	raft
-n04045644	rafter, balk, baulk
-n04045787	raft foundation
-n04045941	rag, shred, tag, tag end, tatter
-n04046091	ragbag
-n04046277	raglan
-n04046400	raglan sleeve
-n04046590	rail
-n04046974	rail fence
-n04047139	railhead
-n04047401	railing, rail
-n04047733	railing
-n04047834	railroad bed
-n04048441	railroad tunnel
-n04049303	rain barrel
-n04049405	raincoat, waterproof
-n04049585	rain gauge, rain gage, pluviometer, udometer
-n04049753	rain stick
-n04050066	rake
-n04050313	rake handle
-n04050600	RAM disk
-n04050933	ramekin, ramequin
-n04051269	ramjet, ramjet engine, atherodyde, athodyd, flying drainpipe
-n04051439	rammer
-n04051549	ramp, incline
-n04051705	rampant arch
-n04051825	rampart, bulwark, wall
-n04052235	ramrod
-n04052346	ramrod
-n04052442	ranch, spread, cattle ranch, cattle farm
-n04052658	ranch house
-n04052757	random-access memory, random access memory, random memory, RAM, read/write memory
-n04053508	rangefinder, range finder
-n04053677	range hood
-n04053767	range pole, ranging pole, flagpole
-n04054361	rapier, tuck
-n04054566	rariora
-n04054670	rasp, wood file
-n04055180	ratchet, rachet, ratch
-n04055447	ratchet wheel
-n04055700	rathskeller
-n04055861	ratline, ratlin
-n04056073	rat-tail file
-n04056180	rattan, ratan
-n04056413	rattrap
-n04056932	rayon
-n04057047	razor
-n04057215	razorblade
-n04057435	reaction-propulsion engine, reaction engine
-n04057673	reaction turbine
-n04057846	reactor
-n04057981	reading lamp
-n04058096	reading room
-n04058239	read-only memory, ROM, read-only storage, fixed storage
-n04058486	read-only memory chip
-n04058594	readout, read-out
-n04058721	read/write head, head
-n04059157	ready-to-wear
-n04059298	real storage
-n04059399	reamer
-n04059516	reamer, juicer, juice reamer
-n04059947	rearview mirror
-n04060198	Reaumur thermometer
-n04060448	rebozo
-n04060647	receiver, receiving system
-n04060904	receptacle
-n04061681	reception desk
-n04061793	reception room
-n04061969	recess, niche
-n04062179	reciprocating engine
-n04062428	recliner, reclining chair, lounger
-n04062644	reconnaissance plane
-n04062807	reconnaissance vehicle, scout car
-n04063154	record changer, auto-changer, changer
-n04063373	recorder, recording equipment, recording machine
-n04063868	recording
-n04064213	recording system
-n04064401	record player, phonograph
-n04064747	record sleeve, record cover
-n04064862	recovery room
-n04065272	recreational vehicle, RV, R.V.
-n04065464	recreation room, rec room
-n04065789	recycling bin
-n04065909	recycling plant
-n04066023	redbrick university
-n04066270	red carpet
-n04066388	redoubt
-n04066476	redoubt
-n04066767	reduction gear
-n04067143	reed pipe
-n04067231	reed stop
-n04067353	reef knot, flat knot
-n04067472	reel
-n04067658	reel
-n04067818	refectory
-n04067921	refectory table
-n04068441	refinery
-n04068601	reflecting telescope, reflector
-n04069166	reflectometer
-n04069276	reflector
-n04069434	reflex camera
-n04069582	reflux condenser
-n04069777	reformatory, reform school, training school
-n04070003	reformer
-n04070207	refracting telescope
-n04070415	refractometer
-n04070545	refrigeration system
-n04070727	refrigerator, icebox
-n04070964	refrigerator car
-n04071102	refuge, sanctuary, asylum
-n04071263	regalia
-n04071393	regimentals
-n04072193	regulator
-n04072551	rein
-n04072960	relay, electrical relay
-n04073425	release, button
-n04073948	religious residence, cloister
-n04074185	reliquary
-n04074963	remote control, remote
-n04075291	remote terminal, link-attached terminal, remote station, link-attached station
-n04075468	removable disk
-n04075715	rendering
-n04075813	rep, repp
-n04075916	repair shop, fix-it shop
-n04076052	repeater
-n04076284	repeating firearm, repeater
-n04076713	repository, monument
-n04077430	reproducer
-n04077594	rerebrace, upper cannon
-n04077734	rescue equipment
-n04077889	research center, research facility
-n04078002	reseau
-n04078574	reservoir
-n04078955	reset
-n04079106	reset button
-n04079244	residence
-n04079603	resistance pyrometer
-n04079933	resistor, resistance
-n04080138	resonator
-n04080454	resonator, cavity resonator, resonating chamber
-n04080705	resort hotel, spa
-n04080833	respirator, inhalator
-n04081281	restaurant, eating house, eating place, eatery
-n04081699	rest house
-n04081844	restraint, constraint
-n04082344	resuscitator
-n04082562	retainer
-n04082710	retaining wall
-n04082886	reticle, reticule, graticule
-n04083113	reticulation
-n04083309	reticule
-n04083649	retort
-n04083800	retractor
-n04084517	return key, return
-n04084682	reverberatory furnace
-n04084889	revers, revere
-n04085017	reverse, reverse gear
-n04085574	reversible
-n04085873	revetment, revetement, stone facing
-n04086066	revetment
-n04086273	revolver, six-gun, six-shooter
-n04086446	revolving door, revolver
-n04086663	rheometer
-n04086794	rheostat, variable resistor
-n04086937	rhinoscope
-n04087126	rib
-n04087432	riband, ribband
-n04087709	ribbed vault
-n04087826	ribbing
-n04088229	ribbon development
-n04088343	rib joint pliers
-n04088441	ricer
-n04088696	riddle
-n04088797	ride
-n04089152	ridge, ridgepole, rooftree
-n04089376	ridge rope
-n04089666	riding boot
-n04089836	riding crop, hunting crop
-n04089976	riding mower
-n04090263	rifle
-n04090548	rifle ball
-n04090781	rifle grenade
-n04091097	rig
-n04091466	rigger, rigger brush
-n04091584	rigger
-n04091693	rigging, tackle
-n04092168	rigout
-n04093157	ringlet
-n04093223	rings
-n04093625	rink, skating rink
-n04093775	riot gun
-n04093915	ripcord
-n04094060	ripcord
-n04094250	ripping bar
-n04094438	ripping chisel
-n04094608	ripsaw, splitsaw
-n04094720	riser
-n04094859	riser, riser pipe, riser pipeline, riser main
-n04095109	Ritz
-n04095210	river boat
-n04095342	rivet
-n04095577	riveting machine, riveter, rivetter
-n04095938	roach clip, roach holder
-n04096066	road, route
-n04096733	roadbed
-n04096848	roadblock, barricade
-n04097085	roadhouse
-n04097373	roadster, runabout, two-seater
-n04097622	roadway
-n04097760	roaster
-n04097866	robe
-n04098169	robotics equipment
-n04098260	Rochon prism, Wollaston prism
-n04098399	rock bit, roller bit
-n04098513	rocker
-n04098795	rocker, cradle
-n04099003	rocker arm, valve rocker
-n04099175	rocket, rocket engine
-n04099429	rocket, projectile
-n04099969	rocking chair, rocker
-n04100174	rod
-n04100519	rodeo
-n04101375	roll
-n04101497	roller
-n04101701	roller
-n04101860	roller bandage
-n04102037	in-line skate
-n04102162	Rollerblade
-n04102285	roller blind
-n04102406	roller coaster, big dipper, chute-the-chute
-n04102618	roller skate
-n04102760	roller towel
-n04102872	roll film
-n04102962	rolling hitch
-n04103094	rolling mill
-n04103206	rolling pin
-n04103364	rolling stock
-n04103665	roll-on
-n04103769	roll-on
-n04103918	roll-on roll-off
-n04104147	Rolodex
-n04104384	Roman arch, semicircular arch
-n04104500	Roman building
-n04104770	romper, romper suit
-n04104925	rood screen
-n04105068	roof
-n04105438	roof
-n04105704	roofing
-n04105893	room
-n04107598	roomette
-n04107743	room light
-n04107984	roost
-n04108268	rope
-n04108822	rope bridge
-n04108999	rope tow
-n04110068	rose water
-n04110178	rose window, rosette
-n04110281	rosin bag
-n04110439	rotary actuator, positioner
-n04110654	rotary engine
-n04110841	rotary press
-n04110955	rotating mechanism
-n04111190	rotating shaft, shaft
-n04111414	rotisserie
-n04111531	rotisserie
-n04111668	rotor
-n04111962	rotor, rotor coil
-n04112147	rotor
-n04112252	rotor blade, rotary wing
-n04112430	rotor head, rotor shaft
-n04112579	rotunda
-n04112654	rotunda
-n04112752	rouge, paint, blusher
-n04112921	roughcast
-n04113038	rouleau
-n04113194	roulette, toothed wheel
-n04113316	roulette ball
-n04113406	roulette wheel, wheel
-n04113641	round, unit of ammunition, one shot
-n04113765	round arch
-n04113968	round-bottom flask
-n04114069	roundel
-n04114301	round file
-n04114428	roundhouse
-n04114719	router
-n04114844	router
-n04114996	router plane
-n04115144	rowel
-n04115256	row house, town house
-n04115456	rowing boat
-n04115542	rowlock arch
-n04115802	royal
-n04115996	royal mast
-n04116098	rubber band, elastic band, elastic
-n04116294	rubber boot, gum boot
-n04116389	rubber bullet
-n04116512	rubber eraser, rubber, pencil eraser
-n04117216	rudder
-n04117464	rudder
-n04117639	rudder blade
-n04118021	rug, carpet, carpeting
-n04118538	rugby ball
-n04118635	ruin
-n04118776	rule, ruler
-n04119091	rumble
-n04119230	rumble seat
-n04119360	rummer
-n04119478	rumpus room, playroom, game room
-n04119630	runcible spoon
-n04119751	rundle, spoke, rung
-n04120489	running shoe
-n04120695	running suit
-n04120842	runway
-n04121228	rushlight, rush candle
-n04121342	russet
-n04121426	rya, rya rug
-n04121511	saber, sabre
-n04121728	saber saw, jigsaw, reciprocating saw
-n04122262	sable
-n04122349	sable, sable brush, sable's hair pencil
-n04122492	sable coat
-n04122578	sabot, wooden shoe
-n04122685	sachet
-n04122825	sack, poke, paper bag, carrier bag
-n04123026	sack, sacque
-n04123123	sackbut
-n04123228	sackcloth
-n04123317	sackcloth
-n04123448	sack coat
-n04123567	sacking, bagging
-n04123740	saddle
-n04124098	saddlebag
-n04124202	saddle blanket, saddlecloth, horse blanket
-n04124370	saddle oxford, saddle shoe
-n04124488	saddlery
-n04124573	saddle seat
-n04124887	saddle stitch
-n04125021	safe
-n04125116	safe
-n04125257	safe-deposit, safe-deposit box, safety-deposit, safety deposit box, deposit box, lockbox
-n04125541	safe house
-n04125692	safety arch
-n04125853	safety belt, life belt, safety harness
-n04126066	safety bicycle, safety bike
-n04126244	safety bolt, safety lock
-n04126541	safety curtain
-n04126659	safety fuse
-n04126852	safety lamp, Davy lamp
-n04126980	safety match, book matches
-n04127117	safety net
-n04127249	safety pin
-n04127395	safety rail, guardrail
-n04127521	safety razor
-n04127633	safety valve, relief valve, escape valve, escape cock, escape
-n04127904	sail, canvas, canvass, sheet
-n04128413	sail
-n04128499	sailboat, sailing boat
-n04128710	sailcloth
-n04128837	sailing vessel, sailing ship
-n04129490	sailing warship
-n04129688	sailor cap
-n04129766	sailor suit
-n04130143	salad bar
-n04130257	salad bowl
-n04130566	salinometer
-n04130907	sallet, salade
-n04131015	salon
-n04131113	salon
-n04131208	salon, beauty salon, beauty parlor, beauty parlour, beauty shop
-n04131368	saltbox
-n04131499	saltcellar
-n04131690	saltshaker, salt shaker
-n04131811	saltworks
-n04131929	salver
-n04132158	salwar, shalwar
-n04132465	Sam Browne belt
-n04132603	samisen, shamisen
-n04132829	samite
-n04132985	samovar
-n04133114	sampan
-n04133789	sandal
-n04134008	sandbag
-n04134170	sandblaster
-n04134523	sandbox
-n04134632	sandglass
-n04135024	sand wedge
-n04135118	sandwich board
-n04135315	sanitary napkin, sanitary towel, Kotex
-n04135710	cling film, clingfilm, Saran Wrap
-n04135933	sarcenet, sarsenet
-n04136045	sarcophagus
-n04136161	sari, saree
-n04136333	sarong
-n04136510	sash, window sash
-n04136800	sash fastener, sash lock, window lock
-n04137089	sash window
-n04137217	satchel
-n04137355	sateen
-n04137444	satellite, artificial satellite, orbiter
-n04137773	satellite receiver
-n04137897	satellite television, satellite TV
-n04138131	satellite transmitter
-n04138261	satin
-n04138869	Saturday night special
-n04138977	saucepan
-n04139140	saucepot
-n04139395	sauna, sweat room
-n04139859	savings bank, coin bank, money box, bank
-n04140064	saw
-n04140539	sawed-off shotgun
-n04140631	sawhorse, horse, sawbuck, buck
-n04140777	sawmill
-n04140853	saw set
-n04141076	sax, saxophone
-n04141198	saxhorn
-n04141327	scabbard
-n04141712	scaffolding, staging
-n04141838	scale
-n04141975	scale, weighing machine
-n04142175	scaler
-n04142327	scaling ladder
-n04142434	scalpel
-n04142731	scanner, electronic scanner
-n04142999	scanner
-n04143140	scanner, digital scanner, image scanner
-n04143365	scantling, stud
-n04143897	scarf
-n04144241	scarf joint, scarf
-n04144539	scatter rug, throw rug
-n04144651	scauper, scorper
-n04145863	Schmidt telescope, Schmidt camera
-n04146050	school, schoolhouse
-n04146343	schoolbag
-n04146504	school bell
-n04146614	school bus
-n04146862	school ship, training ship
-n04146976	school system
-n04147183	schooner
-n04147291	schooner
-n04147495	scientific instrument
-n04147793	scimitar
-n04147916	scintillation counter
-n04148054	scissors, pair of scissors
-n04148285	sclerometer
-n04148464	scoinson arch, sconcheon arch
-n04148579	sconce
-n04148703	sconce
-n04149083	scoop
-n04149374	scooter
-n04149813	scoreboard
-n04150153	scouring pad
-n04150273	scow
-n04150371	scow
-n04150980	scraper
-n04151108	scratcher
-n04151581	screen
-n04151940	screen, cover, covert, concealment
-n04152387	screen
-n04152593	screen, CRT screen
-n04153025	screen door, screen
-n04153330	screening
-n04153751	screw
-n04154152	screw, screw propeller
-n04154340	screw
-n04154565	screwdriver
-n04154753	screw eye
-n04154854	screw key
-n04154938	screw thread, thread
-n04155068	screwtop
-n04155177	screw wrench
-n04155457	scriber, scribe, scratch awl
-n04155625	scrim
-n04155735	scrimshaw
-n04155889	scriptorium
-n04156040	scrubber
-n04156140	scrub brush, scrubbing brush, scrubber
-n04156297	scrub plane
-n04156411	scuffer
-n04156591	scuffle, scuffle hoe, Dutch hoe
-n04156814	scull
-n04156946	scull
-n04157099	scullery
-n04157320	sculpture
-n04158002	scuttle, coal scuttle
-n04158138	scyphus
-n04158250	scythe
-n04158672	seabag
-n04158807	sea boat
-n04158956	sea chest
-n04160036	sealing wax, seal
-n04160261	sealskin
-n04160372	seam
-n04160586	seaplane, hydroplane
-n04160847	searchlight
-n04161010	searing iron
-n04161358	seat
-n04161981	seat
-n04162433	seat
-n04162706	seat belt, seatbelt
-n04163530	secateurs
-n04164002	secondary coil, secondary winding, secondary
-n04164199	second balcony, family circle, upper balcony, peanut gallery
-n04164406	second base
-n04164757	second hand
-n04164868	secretary, writing table, escritoire, secretaire
-n04165409	sectional
-n04165675	security blanket
-n04165945	security system, security measure, security
-n04166111	security system
-n04166281	sedan, saloon
-n04166436	sedan, sedan chair
-n04167346	seeder
-n04167489	seeker
-n04167661	seersucker
-n04168084	segmental arch
-n04168199	Segway, Segway Human Transporter, Segway HT
-n04168472	seidel
-n04168541	seine
-n04168840	seismograph
-n04169437	selector, selector switch
-n04169597	selenium cell
-n04170037	self-propelled vehicle
-n04170384	self-registering thermometer
-n04170515	self-starter
-n04170694	selsyn, synchro
-n04170933	selvage, selvedge
-n04171208	semaphore
-n04171459	semiautomatic firearm
-n04171629	semiautomatic pistol, semiautomatic
-n04171831	semiconductor device, semiconductor unit, semiconductor
-n04172107	semi-detached house
-n04172230	semigloss
-n04172342	semitrailer, semi
-n04172512	sennit
-n04172607	sensitometer
-n04172776	sentry box
-n04172904	separate
-n04173046	septic tank
-n04173172	sequence, episode
-n04173511	sequencer, sequenator
-n04173907	serape, sarape
-n04174026	serge
-n04174101	serger
-n04174234	serial port
-n04174500	serpent
-n04174705	serration
-n04175039	server
-n04175147	server, host
-n04175574	service club
-n04176068	serving cart
-n04176190	serving dish
-n04176295	servo, servomechanism, servosystem
-n04176528	set
-n04177041	set gun, spring gun
-n04177329	setscrew
-n04177545	setscrew
-n04177654	set square
-n04177755	settee
-n04177820	settle, settee
-n04177931	settlement house
-n04178190	seventy-eight, 78
-n04178329	Seven Wonders of the Ancient World, Seven Wonders of the World
-n04178668	sewage disposal plant, disposal plant
-n04179126	sewer, sewerage, cloaca
-n04179712	sewing basket
-n04179824	sewing kit
-n04179913	sewing machine
-n04180063	sewing needle
-n04180229	sewing room
-n04180888	sextant
-n04181083	sgraffito
-n04181228	shackle, bond, hamper, trammel
-n04181561	shackle
-n04181718	shade
-n04182152	shadow box
-n04182322	shaft
-n04183217	shag rug
-n04183329	shaker
-n04183957	shank
-n04184095	shank, stem
-n04184316	shantung
-n04184435	shaper, shaping machine
-n04184600	shaping tool
-n04184880	sharkskin
-n04185071	sharpener
-n04185529	Sharpie
-n04185804	shaver, electric shaver, electric razor
-n04185946	shaving brush
-n04186051	shaving cream, shaving soap
-n04186268	shaving foam
-n04186455	shawl
-n04186624	shawm
-n04186848	shears
-n04187061	sheath
-n04187233	sheathing, overlay, overlayer
-n04187547	shed
-n04187751	sheep bell
-n04187885	sheepshank
-n04187970	sheepskin coat, afghan
-n04188064	sheepwalk, sheeprun
-n04188179	sheet, bed sheet
-n04189092	sheet bend, becket bend, weaver's knot, weaver's hitch
-n04189282	sheeting
-n04189651	sheet pile, sheath pile, sheet piling
-n04189816	Sheetrock
-n04190052	shelf
-n04190376	shelf bracket
-n04190464	shell
-n04190747	shell, case, casing
-n04190997	shell, racing shell
-n04191150	shellac, shellac varnish
-n04191595	shelter
-n04191943	shelter
-n04192238	shelter
-n04192361	sheltered workshop
-n04192521	Sheraton
-n04192698	shield, buckler
-n04192858	shield
-n04193179	shielding
-n04193377	shift key, shift
-n04193742	shillelagh, shillalah
-n04193883	shim
-n04194009	shingle
-n04194127	shin guard, shinpad
-n04194289	ship
-n04196080	shipboard system
-n04196502	shipping, cargo ships, merchant marine, merchant vessels
-n04196803	shipping room
-n04196925	ship-towed long-range acoustic detection system
-n04197110	shipwreck
-n04197391	shirt
-n04197781	shirt button
-n04197878	shirtdress
-n04198015	shirtfront
-n04198233	shirting
-n04198355	shirtsleeve
-n04198453	shirttail
-n04198562	shirtwaist, shirtwaister
-n04198722	shiv
-n04198797	shock absorber, shock, cushion
-n04199027	shoe
-n04200000	shoe
-n04200258	shoebox
-n04200537	shoehorn
-n04200800	shoe shop, shoe-shop, shoe store
-n04200908	shoetree
-n04201064	shofar, shophar
-n04201297	shoji
-n04201733	shooting brake
-n04202142	shooting lodge, shooting box
-n04202282	shooting stick
-n04202417	shop, store
-n04203356	shop bell
-n04204081	shopping bag
-n04204238	shopping basket
-n04204347	shopping cart
-n04204755	short circuit, short
-n04205062	short iron
-n04205318	short pants, shorts, trunks
-n04205505	short sleeve
-n04205613	shortwave diathermy machine
-n04206070	shot
-n04206225	shot glass, jigger, pony
-n04206356	shotgun, scattergun
-n04206570	shotgun shell
-n04206790	shot tower
-n04207151	shoulder
-n04207343	shoulder bag
-n04207596	shouldered arch
-n04207763	shoulder holster
-n04207903	shoulder pad
-n04208065	shoulder patch
-n04208210	shovel
-n04208427	shovel
-n04208582	shovel hat
-n04208760	showboat
-n04208936	shower
-n04209133	shower cap
-n04209239	shower curtain
-n04209509	shower room
-n04209613	shower stall, shower bath
-n04209811	showroom, salesroom, saleroom
-n04210012	shrapnel
-n04210120	shredder
-n04210288	shrimper
-n04210390	shrine
-n04210591	shrink-wrap
-n04210858	shunt
-n04211001	shunt, electrical shunt, bypass
-n04211219	shunter
-n04211356	shutter
-n04211528	shutter
-n04211857	shuttle
-n04211970	shuttle
-n04212165	shuttle bus
-n04212282	shuttlecock, bird, birdie, shuttle
-n04212467	shuttle helicopter
-n04212810	Sibley tent
-n04213105	sickbay, sick berth
-n04213264	sickbed
-n04213353	sickle, reaping hook, reap hook
-n04213530	sickroom
-n04214046	sideboard
-n04214282	sidecar
-n04214413	side chapel
-n04214649	sidelight, running light
-n04215153	sidesaddle
-n04215402	sidewalk, pavement
-n04215588	sidewall
-n04215800	side-wheeler
-n04215910	sidewinder
-n04216634	sieve, screen
-n04216860	sifter
-n04216963	sights
-n04217387	sigmoidoscope, flexible sigmoidoscope
-n04217546	signal box, signal tower
-n04217718	signaling device
-n04217882	signboard, sign
-n04218564	silencer, muffler
-n04218921	silent butler
-n04219185	Silex
-n04219424	silk
-n04219580	silks
-n04220250	silo
-n04220805	silver plate
-n04221076	silverpoint
-n04221673	simple pendulum
-n04221823	simulator
-n04222210	single bed
-n04222307	single-breasted jacket
-n04222470	single-breasted suit
-n04222723	single prop, single-propeller plane
-n04222847	single-reed instrument, single-reed woodwind
-n04223066	single-rotor helicopter
-n04223170	singlestick, fencing stick, backsword
-n04223299	singlet, vest, undershirt
-n04224395	siren
-n04224543	sister ship
-n04224842	sitar
-n04225031	sitz bath, hip bath
-n04225222	six-pack, six pack, sixpack
-n04225729	skate
-n04225987	skateboard
-n04226322	skeg
-n04226464	skein
-n04226537	skeleton, skeletal frame, frame, underframe
-n04226826	skeleton key
-n04226962	skep
-n04227050	skep
-n04227144	sketch, study
-n04227519	sketcher
-n04227787	skew arch
-n04227900	skewer
-n04228054	ski
-n04228215	ski binding, binding
-n04228422	skibob
-n04228581	ski boot
-n04228693	ski cap, stocking cap, toboggan cap
-n04229007	skidder
-n04229107	skid lid
-n04229480	skiff
-n04229620	ski jump
-n04229737	ski lodge
-n04229816	ski mask
-n04229959	skimmer
-n04230387	ski parka, ski jacket
-n04230487	ski-plane
-n04230603	ski pole
-n04230707	ski rack
-n04230808	skirt
-n04231272	skirt
-n04231693	ski tow, ski lift, lift
-n04231905	Skivvies
-n04232153	skullcap
-n04232312	skybox
-n04232437	skyhook
-n04232800	skylight, fanlight
-n04233027	skysail
-n04233124	skyscraper
-n04233295	skywalk
-n04233715	slacks
-n04233832	slack suit
-n04234160	slasher
-n04234260	slash pocket
-n04234455	slat, spline
-n04234670	slate
-n04234763	slate pencil
-n04234887	slate roof
-n04235291	sled, sledge, sleigh
-n04235646	sleeper
-n04235771	sleeper
-n04235860	sleeping bag
-n04236001	sleeping car, sleeper, wagon-lit
-n04236377	sleeve, arm
-n04236702	sleeve
-n04236809	sleigh bed
-n04236935	sleigh bell, cascabel
-n04237174	slice bar
-n04237287	slicer
-n04237423	slicer
-n04238128	slide, playground slide, sliding board
-n04238321	slide fastener, zip, zipper, zip fastener
-n04238617	slide projector
-n04238763	slide rule, slipstick
-n04238953	slide valve
-n04239074	sliding door
-n04239218	sliding seat
-n04239333	sliding window
-n04239436	sling, scarf bandage, triangular bandage
-n04239639	sling
-n04239786	slingback, sling
-n04239900	slinger ring
-n04240434	slip clutch, slip friction clutch
-n04240752	slipcover
-n04240867	slip-joint pliers
-n04241042	slipknot
-n04241249	slip-on
-n04241394	slipper, carpet slipper
-n04241573	slip ring
-n04242084	slit lamp
-n04242315	slit trench
-n04242408	sloop
-n04242587	sloop of war
-n04242704	slop basin, slop bowl
-n04243003	slop pail, slop jar
-n04243142	slops
-n04243251	slopshop, slopseller's shop
-n04243546	slot, one-armed bandit
-n04243941	slot machine, coin machine
-n04244379	sluice, sluiceway, penstock
-n04244847	smack
-n04244997	small boat
-n04245218	small computer system interface, SCSI
-n04245412	small ship
-n04245508	small stores
-n04245847	smart bomb
-n04246060	smelling bottle
-n04246271	smocking
-n04246459	smoke bomb, smoke grenade
-n04246731	smokehouse, meat house
-n04246855	smoker, smoking car, smoking carriage, smoking compartment
-n04247011	smoke screen, smokescreen
-n04247440	smoking room
-n04247544	smoothbore
-n04247630	smooth plane, smoothing plane
-n04247736	snack bar, snack counter, buffet
-n04247876	snaffle, snaffle bit
-n04248209	snap, snap fastener, press stud
-n04248396	snap brim
-n04248507	snap-brim hat
-n04248851	snare, gin, noose
-n04249415	snare drum, snare, side drum
-n04249582	snatch block
-n04249882	snifter, brandy snifter, brandy glass
-n04250224	sniper rifle, precision rifle
-n04250473	snips, tinsnips
-n04250599	Sno-cat
-n04250692	snood
-n04250850	snorkel, schnorkel, schnorchel, snorkel breather, breather
-n04251144	snorkel
-n04251701	snowbank, snow bank
-n04251791	snowboard
-n04252077	snowmobile
-n04252225	snowplow, snowplough
-n04252331	snowshoe
-n04252560	snowsuit
-n04252653	snow thrower, snow blower
-n04253057	snuffbox
-n04253168	snuffer
-n04253304	snuffers
-n04253931	soapbox
-n04254009	soap dish
-n04254120	soap dispenser
-n04254450	soap pad
-n04254680	soccer ball
-n04254777	sock
-n04255163	socket
-n04255346	socket wrench
-n04255499	socle
-n04255586	soda can
-n04255670	soda fountain
-n04255768	soda fountain
-n04255899	sod house, soddy, adobe house
-n04256318	sodium-vapor lamp, sodium-vapour lamp
-n04256520	sofa, couch, lounge
-n04256758	soffit
-n04256891	softball, playground ball
-n04257223	soft pedal
-n04257684	soil pipe
-n04257790	solar array, solar battery, solar panel
-n04257986	solar cell, photovoltaic cell
-n04258138	solar dish, solar collector, solar furnace
-n04258333	solar heater
-n04258438	solar house
-n04258618	solar telescope
-n04258732	solar thermal system
-n04258859	soldering iron
-n04259202	solenoid
-n04259468	solleret, sabaton
-n04259630	sombrero
-n04260192	sonic depth finder, fathometer
-n04260364	sonogram, echogram
-n04260589	sonograph
-n04261116	sorter
-n04261281	souk
-n04261369	sound bow
-n04261506	soundbox, body
-n04261638	sound camera
-n04261767	sounder
-n04261868	sound film
-n04262161	sounding board, soundboard
-n04262530	sounding rocket
-n04262678	sound recording, audio recording, audio
-n04262869	sound spectrograph
-n04263257	soup bowl
-n04263336	soup ladle
-n04263502	soupspoon, soup spoon
-n04263760	source of illumination
-n04263950	sourdine
-n04264134	soutache
-n04264233	soutane
-n04264361	sou'wester
-n04264485	soybean future
-n04264628	space bar
-n04264765	space capsule, capsule
-n04264914	spacecraft, ballistic capsule, space vehicle
-n04265275	space heater
-n04265428	space helmet
-n04265904	space rocket
-n04266014	space shuttle
-n04266162	space station, space platform, space laboratory
-n04266375	spacesuit
-n04266486	spade
-n04266849	spade bit
-n04266968	spaghetti junction
-n04267091	Spandau
-n04267165	spandex
-n04267246	spandrel, spandril
-n04267435	spanker
-n04267577	spar
-n04267985	sparge pipe
-n04268142	spark arrester, sparker
-n04268275	spark arrester
-n04268418	spark chamber, spark counter
-n04268565	spark coil
-n04268799	spark gap
-n04269086	spark lever
-n04269270	spark plug, sparking plug, plug
-n04269502	sparkplug wrench
-n04269668	spark transmitter
-n04269822	spat, gaiter
-n04269944	spatula
-n04270147	spatula
-n04270371	speakerphone
-n04270576	speaking trumpet
-n04270891	spear, lance, shaft
-n04271148	spear, gig, fizgig, fishgig, lance
-n04271531	specialty store
-n04271793	specimen bottle
-n04271891	spectacle
-n04272054	spectacles, specs, eyeglasses, glasses
-n04272389	spectator pump, spectator
-n04272782	spectrograph
-n04272928	spectrophotometer
-n04273064	spectroscope, prism spectroscope
-n04273285	speculum
-n04273569	speedboat
-n04273659	speed bump
-n04273796	speedometer, speed indicator
-n04273972	speed skate, racing skate
-n04274686	spherometer
-n04274985	sphygmomanometer
-n04275093	spicemill
-n04275175	spice rack
-n04275283	spider
-n04275548	spider web, spider's web
-n04275661	spike
-n04275904	spike
-n04277352	spindle
-n04277493	spindle, mandrel, mandril, arbor
-n04277669	spindle
-n04277826	spin dryer, spin drier
-n04278247	spinet
-n04278353	spinet
-n04278447	spinnaker
-n04278605	spinner
-n04278932	spinning frame
-n04279063	spinning jenny
-n04279172	spinning machine
-n04279353	spinning rod
-n04279462	spinning wheel
-n04279858	spiral bandage
-n04279987	spiral ratchet screwdriver, ratchet screwdriver
-n04280259	spiral spring
-n04280373	spirit lamp
-n04280487	spirit stove
-n04280845	spirometer
-n04280970	spit
-n04281260	spittoon, cuspidor
-n04281375	splashboard, splasher, dashboard
-n04281571	splasher
-n04281998	splice, splicing
-n04282231	splicer
-n04282494	splint
-n04282872	split rail, fence rail
-n04282992	Spode
-n04283096	spoiler
-n04283255	spoiler
-n04283378	spoke, wheel spoke, radius
-n04283585	spokeshave
-n04283784	sponge cloth
-n04283905	sponge mop
-n04284002	spoon
-n04284341	spoon
-n04284438	Spork
-n04284572	sporran
-n04284869	sport kite, stunt kite
-n04285008	sports car, sport car
-n04285146	sports equipment
-n04285622	sports implement
-n04285803	sportswear, athletic wear, activewear
-n04285965	sport utility, sport utility vehicle, S.U.V., SUV
-n04286128	spot
-n04286575	spotlight, spot
-n04286960	spot weld, spot-weld
-n04287351	spouter
-n04287451	sprag
-n04287747	spray gun
-n04287898	spray paint
-n04287986	spreader
-n04288165	sprig
-n04288272	spring
-n04288533	spring balance, spring scale
-n04288673	springboard
-n04289027	sprinkler
-n04289195	sprinkler system
-n04289449	sprit
-n04289576	spritsail
-n04289690	sprocket, sprocket wheel
-n04289827	sprocket
-n04290079	spun yarn
-n04290259	spur, gad
-n04290507	spur gear, spur wheel
-n04290615	sputnik
-n04290762	spy satellite
-n04291069	squad room
-n04291242	square
-n04291759	square knot
-n04291992	square-rigger
-n04292080	square sail
-n04292221	squash ball
-n04292414	squash racket, squash racquet, bat
-n04292572	squawk box, squawker, intercom speaker
-n04292921	squeegee
-n04293119	squeezer
-n04293258	squelch circuit, squelch, squelcher
-n04293744	squinch
-n04294212	stabilizer, stabiliser
-n04294426	stabilizer
-n04294614	stabilizer bar, anti-sway bar
-n04294879	stable, stalls, horse barn
-n04295081	stable gear, saddlery, tack
-n04295353	stabling
-n04295571	stacks
-n04295777	staddle
-n04295881	stadium, bowl, arena, sports stadium
-n04296562	stage
-n04297098	stagecoach, stage
-n04297750	stained-glass window
-n04297847	stair-carpet
-n04298053	stair-rod
-n04298661	stairwell
-n04298765	stake
-n04299215	stall, stand, sales booth
-n04299370	stall
-n04299963	stamp
-n04300358	stamp mill, stamping mill
-n04300509	stamping machine, stamper
-n04300643	stanchion
-n04301000	stand
-n04301242	standard
-n04301474	standard cell
-n04301760	standard transmission, stick shift
-n04302200	standing press
-n04302863	stanhope
-n04302988	Stanley Steamer
-n04303095	staple
-n04303258	staple
-n04303357	staple gun, staplegun, tacker
-n04303497	stapler, stapling machine
-n04304215	starship, spaceship
-n04304375	starter, starter motor, starting motor
-n04304680	starting gate, starting stall
-n04305016	Stassano furnace, electric-arc furnace
-n04305210	Statehouse
-n04305323	stately home
-n04305471	state prison
-n04305572	stateroom
-n04305947	static tube
-n04306080	station
-n04306592	stator, stator coil
-n04306847	statue
-n04307419	stay
-n04307767	staysail
-n04307878	steakhouse, chophouse
-n04307986	steak knife
-n04308084	stealth aircraft
-n04308273	stealth bomber
-n04308397	stealth fighter
-n04308583	steam bath, steam room, vapor bath, vapour bath
-n04308807	steamboat
-n04308915	steam chest
-n04309049	steam engine
-n04309348	steamer, steamship
-n04309548	steamer
-n04309833	steam iron
-n04310018	steam locomotive
-n04310157	steamroller, road roller
-n04310507	steam shovel
-n04310604	steam turbine
-n04310721	steam whistle
-n04310904	steel
-n04311004	steel arch bridge
-n04311174	steel drum
-n04311595	steel mill, steelworks, steel plant, steel factory
-n04312020	steel-wool pad
-n04312154	steelyard, lever scale, beam scale
-n04312432	steeple, spire
-n04312654	steerage
-n04312756	steering gear
-n04312916	steering linkage
-n04313220	steering system, steering mechanism
-n04313503	steering wheel, wheel
-n04313628	stele, stela
-n04314107	stem-winder
-n04314216	stencil
-n04314522	Sten gun
-n04314632	stenograph
-n04314914	step, stair
-n04315342	step-down transformer
-n04315713	step stool
-n04315828	step-up transformer
-n04315948	stereo, stereophony, stereo system, stereophonic system
-n04316498	stereoscope
-n04316815	stern chaser
-n04316924	sternpost
-n04317063	sternwheeler
-n04317175	stethoscope
-n04317325	stewing pan, stewpan
-n04317420	stick
-n04317833	stick
-n04317976	stick, control stick, joystick
-n04318131	stick
-n04318787	stile
-n04318892	stiletto
-n04318982	still
-n04319545	stillroom, still room
-n04319774	Stillson wrench
-n04319937	stilt
-n04320405	Stinger
-n04320598	stink bomb, stench bomb
-n04320871	stirrer
-n04320973	stirrup, stirrup iron
-n04321121	stirrup pump
-n04321453	stob
-n04322026	stock, gunstock
-n04322531	stockade
-n04322692	stockcar
-n04322801	stock car
-n04323519	stockinet, stockinette
-n04323819	stocking
-n04324120	stock-in-trade
-n04324297	stockpot
-n04324387	stockroom, stock room
-n04324515	stocks
-n04325041	stock saddle, Western saddle
-n04325208	stockyard
-n04325704	stole
-n04325804	stomacher
-n04325968	stomach pump
-n04326547	stone wall
-n04326676	stoneware
-n04326799	stonework
-n04326896	stool
-n04327204	stoop, stoep
-n04327544	stop bath, short-stop, short-stop bath
-n04327682	stopcock, cock, turncock
-n04328054	stopper knot
-n04328186	stopwatch, stop watch
-n04328329	storage battery, accumulator
-n04328580	storage cell, secondary cell
-n04328703	storage ring
-n04328946	storage space
-n04329477	storeroom, storage room, stowage
-n04329681	storm cellar, cyclone cellar, tornado cellar
-n04329834	storm door
-n04329958	storm window, storm sash
-n04330109	stoup, stoop
-n04330189	stoup
-n04330267	stove
-n04330340	stove, kitchen stove, range, kitchen range, cooking stove
-n04330669	stove bolt
-n04330746	stovepipe
-n04330896	stovepipe iron
-n04330998	Stradavarius, Strad
-n04331277	straight chair, side chair
-n04331443	straightedge
-n04331639	straightener
-n04331765	straight flute, straight-fluted drill
-n04331892	straight pin
-n04332074	straight razor
-n04332243	strainer
-n04332580	straitjacket, straightjacket
-n04332987	strap
-n04333129	strap
-n04333869	strap hinge, joint hinge
-n04334105	strapless
-n04334365	streamer fly
-n04334504	streamliner
-n04334599	street
-n04335209	street
-n04335435	streetcar, tram, tramcar, trolley, trolley car
-n04335693	street clothes
-n04335886	streetlight, street lamp
-n04336792	stretcher
-n04337157	stretcher
-n04337287	stretch pants
-n04337503	strickle
-n04337650	strickle
-n04338517	stringed instrument
-n04338963	stringer
-n04339062	stringer
-n04339191	string tie
-n04339638	strip
-n04339879	strip lighting
-n04340019	strip mall
-n04340521	stroboscope, strobe, strobe light
-n04340750	strongbox, deedbox
-n04340935	stronghold, fastness
-n04341133	strongroom
-n04341288	strop
-n04341414	structural member
-n04341686	structure, construction
-n04343511	student center
-n04343630	student lamp
-n04343740	student union
-n04344003	stud finder
-n04344734	studio apartment, studio
-n04344873	studio couch, day bed
-n04345028	study
-n04345201	study hall
-n04345787	stuffing nut, packing nut
-n04346003	stump
-n04346157	stun gun, stun baton
-n04346328	stupa, tope
-n04346428	sty, pigsty, pigpen
-n04346511	stylus, style
-n04346679	stylus
-n04346855	sub-assembly
-n04347119	subcompact, subcompact car
-n04347519	submachine gun
-n04347754	submarine, pigboat, sub, U-boat
-n04348070	submarine torpedo
-n04348184	submersible, submersible warship
-n04348359	submersible
-n04348988	subtracter
-n04349189	subway token
-n04349306	subway train
-n04349401	subwoofer
-n04349913	suction cup
-n04350104	suction pump
-n04350235	sudatorium, sudatory
-n04350458	suede cloth, suede
-n04350581	sugar bowl
-n04350688	sugar refinery
-n04350769	sugar spoon, sugar shell
-n04350905	suit, suit of clothes
-n04351550	suite, rooms
-n04351699	suiting
-n04353573	sulky
-n04354026	summer house
-n04354182	sumo ring
-n04354387	sump
-n04354487	sump pump
-n04354589	sunbonnet
-n04355115	Sunday best, Sunday clothes
-n04355267	sun deck
-n04355338	sundial
-n04355511	sundress
-n04355684	sundries
-n04355821	sun gear
-n04355933	sunglass
-n04356056	sunglasses, dark glasses, shades
-n04356595	sunhat, sun hat
-n04356772	sunlamp, sun lamp, sunray lamp, sun-ray lamp
-n04356925	sun parlor, sun parlour, sun porch, sunporch, sunroom, sun lounge, solarium
-n04357121	sunroof, sunshine-roof
-n04357314	sunscreen, sunblock, sun blocker
-n04357531	sunsuit
-n04357930	supercharger
-n04358117	supercomputer
-n04358256	superconducting supercollider
-n04358491	superhighway, information superhighway
-n04358707	supermarket
-n04358874	superstructure
-n04359034	supertanker
-n04359124	supper club
-n04359217	supplejack
-n04359335	supply chamber
-n04359500	supply closet
-n04359589	support
-n04360501	support
-n04360798	support column
-n04360914	support hose, support stocking
-n04361095	supporting structure
-n04361260	supporting tower
-n04361937	surcoat
-n04362624	surface gauge, surface gage, scribing block
-n04362821	surface lift
-n04362972	surface search radar
-n04363082	surface ship
-n04363210	surface-to-air missile, SAM
-n04363412	surface-to-air missile system
-n04363671	surfboat
-n04363777	surcoat
-n04363874	surgeon's knot
-n04363991	surgery
-n04364160	surge suppressor, surge protector, spike suppressor, spike arrester, lightning arrester
-n04364397	surgical dressing
-n04364545	surgical instrument
-n04364827	surgical knife
-n04364994	surplice
-n04365112	surrey
-n04365229	surtout
-n04365328	surveillance system
-n04365484	surveying instrument, surveyor's instrument
-n04365751	surveyor's level
-n04366033	sushi bar
-n04366116	suspension, suspension system
-n04366367	suspension bridge
-n04366832	suspensory, suspensory bandage
-n04367011	sustaining pedal, loud pedal
-n04367371	suture, surgical seam
-n04367480	swab, swob, mop
-n04367746	swab
-n04367950	swaddling clothes, swaddling bands
-n04368109	swag
-n04368235	swage block
-n04368365	swagger stick
-n04368496	swallow-tailed coat, swallowtail, morning coat
-n04368695	swamp buggy, marsh buggy
-n04368840	swan's down
-n04369025	swathe, wrapping
-n04369282	swatter, flyswatter, flyswat
-n04369485	sweat bag
-n04369618	sweatband
-n04370048	sweater, jumper
-n04370288	sweat pants, sweatpants
-n04370456	sweatshirt
-n04370600	sweatshop
-n04370774	sweat suit, sweatsuit, sweats, workout suit
-n04370955	sweep, sweep oar
-n04371050	sweep hand, sweep-second
-n04371430	swimming trunks, bathing trunks
-n04371563	swimsuit, swimwear, bathing suit, swimming costume, bathing costume
-n04371774	swing
-n04371979	swing door, swinging door
-n04372370	switch, electric switch, electrical switch
-n04373089	switchblade, switchblade knife, flick-knife, flick knife
-n04373428	switch engine, donkey engine
-n04373563	swivel
-n04373704	swivel chair
-n04373795	swizzle stick
-n04373894	sword, blade, brand, steel
-n04374315	sword cane, sword stick
-n04374521	S wrench
-n04374735	synagogue, temple, tabernacle
-n04374907	synchrocyclotron
-n04375080	synchroflash
-n04375241	synchromesh
-n04375405	synchronous converter, rotary, rotary converter
-n04375615	synchronous motor
-n04375775	synchrotron
-n04375926	synchroscope, synchronoscope, synchronizer, synchroniser
-n04376400	synthesizer, synthesiser
-n04376876	syringe
-n04377057	system
-n04378489	tabard
-n04378651	Tabernacle
-n04378956	tabi, tabis
-n04379096	tab key, tab
-n04379243	table
-n04379964	table
-n04380255	tablefork
-n04380346	table knife
-n04380533	table lamp
-n04380916	table saw
-n04381073	tablespoon
-n04381450	tablet-armed chair
-n04381587	table-tennis table, ping-pong table, pingpong table
-n04381724	table-tennis racquet, table-tennis bat, pingpong paddle
-n04381860	tabletop
-n04381994	tableware
-n04382334	tabor, tabour
-n04382438	taboret, tabouret
-n04382537	tachistoscope, t-scope
-n04382695	tachograph
-n04382880	tachometer, tach
-n04383015	tachymeter, tacheometer
-n04383130	tack
-n04383301	tack hammer
-n04383839	taffeta
-n04383923	taffrail
-n04384593	tailgate, tailboard
-n04384910	taillight, tail lamp, rear light, rear lamp
-n04385079	tailor-made
-n04385157	tailor's chalk
-n04385536	tailpipe
-n04385799	tail rotor, anti-torque rotor
-n04386051	tailstock
-n04386456	take-up
-n04386664	talaria
-n04386792	talcum, talcum powder
-n04387095	tam, tam-o'-shanter, tammy
-n04387201	tambour
-n04387261	tambour, embroidery frame, embroidery hoop
-n04387400	tambourine
-n04387531	tammy
-n04387706	tamp, tamper, tamping bar
-n04387932	Tampax
-n04388040	tampion, tompion
-n04388162	tampon
-n04388473	tandoor
-n04388574	tangram
-n04388743	tank, storage tank
-n04389033	tank, army tank, armored combat vehicle, armoured combat vehicle
-n04389430	tankard
-n04389521	tank car, tank
-n04389718	tank destroyer
-n04389854	tank engine, tank locomotive
-n04389999	tanker plane
-n04390483	tank shell
-n04390577	tank top
-n04390873	tannoy
-n04390977	tap, spigot
-n04391445	tapa, tappa
-n04391838	tape, tape recording, taping
-n04392113	tape, tapeline, tape measure
-n04392526	tape deck
-n04392764	tape drive, tape transport, transport
-n04392985	tape player
-n04393095	tape recorder, tape machine
-n04393301	taper file
-n04393549	tapestry, tapis
-n04393808	tappet
-n04393913	tap wrench
-n04394031	tare
-n04394261	target, butt
-n04394421	target acquisition system
-n04394630	tarmacadam, tarmac, macadam
-n04395024	tarpaulin, tarp
-n04395106	tartan, plaid
-n04395332	tasset, tasse
-n04395651	tattoo
-n04395875	tavern, tap house
-n04396226	tawse
-n04396335	taximeter
-n04396650	T-bar lift, T-bar, Alpine lift
-n04396808	tea bag
-n04396902	tea ball
-n04397027	tea cart, teacart, tea trolley, tea wagon
-n04397168	tea chest
-n04397261	teaching aid
-n04397452	teacup
-n04397645	tea gown
-n04397768	teakettle
-n04397860	tea maker
-n04398044	teapot
-n04398497	teashop, teahouse, tearoom, tea parlor, tea parlour
-n04398688	teaspoon
-n04398834	tea-strainer
-n04398951	tea table
-n04399046	tea tray
-n04399158	tea urn
-n04399382	teddy, teddy bear
-n04399537	tee, golf tee
-n04399846	tee hinge, T hinge
-n04400109	telecom hotel, telco building
-n04400289	telecommunication system, telecom system, telecommunication equipment, telecom equipment
-n04400499	telegraph, telegraphy
-n04400737	telegraph key
-n04400899	telemeter
-n04401088	telephone, phone, telephone set
-n04401578	telephone bell
-n04401680	telephone booth, phone booth, call box, telephone box, telephone kiosk
-n04401828	telephone cord, phone cord
-n04401949	telephone jack, phone jack
-n04402057	telephone line, phone line, telephone circuit, subscriber line, line
-n04402342	telephone plug, phone plug
-n04402449	telephone pole, telegraph pole, telegraph post
-n04402580	telephone receiver, receiver
-n04402746	telephone system, phone system
-n04402984	telephone wire, telephone line, telegraph wire, telegraph line
-n04403413	telephoto lens, zoom lens
-n04403524	Teleprompter
-n04403638	telescope, scope
-n04403925	telescopic sight, telescope sight
-n04404072	telethermometer
-n04404200	teletypewriter, teleprinter, teletype machine, telex, telex machine
-n04404412	television, television system
-n04404817	television antenna, tv-antenna
-n04404997	television camera, tv camera, camera
-n04405540	television equipment, video equipment
-n04405762	television monitor, tv monitor
-n04405907	television receiver, television, television set, tv, tv set, idiot box, boob tube, telly, goggle box
-n04406239	television room, tv room
-n04406552	television transmitter
-n04406687	telpher, telfer
-n04406817	telpherage, telferage
-n04407257	tempera, poster paint, poster color, poster colour
-n04407435	temple
-n04407686	temple
-n04408871	temporary hookup, patch
-n04409011	tender, supply ship
-n04409128	tender, ship's boat, pinnace, cutter
-n04409279	tender
-n04409384	tenement, tenement house
-n04409515	tennis ball
-n04409625	tennis camp
-n04409806	tennis racket, tennis racquet
-n04409911	tenon
-n04410086	tenor drum, tom-tom
-n04410365	tenoroon
-n04410485	tenpenny nail
-n04410565	tenpin
-n04410663	tensimeter
-n04410760	tensiometer
-n04410886	tensiometer
-n04411019	tensiometer
-n04411264	tent, collapsible shelter
-n04411835	tenter
-n04411966	tenterhook
-n04412097	tent-fly, rainfly, fly sheet, fly, tent flap
-n04412300	tent peg
-n04412416	tepee, tipi, teepee
-n04413151	terminal, pole
-n04413419	terminal
-n04413969	terraced house
-n04414101	terra cotta
-n04414199	terrarium
-n04414319	terra sigillata, Samian ware
-n04414476	terry, terry cloth, terrycloth
-n04414675	Tesla coil
-n04414909	tessera
-n04415257	test equipment
-n04415663	test rocket, research rocket, test instrument vehicle
-n04415815	test room, testing room
-n04416005	testudo
-n04416901	tetraskelion, tetraskele
-n04417086	tetrode
-n04417180	textile machine
-n04417361	textile mill
-n04417672	thatch, thatched roof
-n04417809	theater, theatre, house
-n04418357	theater curtain, theatre curtain
-n04418644	theater light
-n04419073	theodolite, transit
-n04419642	theremin
-n04419868	thermal printer
-n04420024	thermal reactor
-n04420720	thermocouple, thermocouple junction
-n04421083	thermoelectric thermometer, thermel, electric thermometer
-n04421258	thermograph, thermometrograph
-n04421417	thermograph
-n04421582	thermohydrometer, thermogravimeter
-n04421740	thermojunction
-n04421872	thermometer
-n04422409	thermonuclear reactor, fusion reactor
-n04422566	thermopile
-n04422727	thermos, thermos bottle, thermos flask
-n04422875	thermostat, thermoregulator
-n04423552	thigh pad
-n04423687	thill
-n04423845	thimble
-n04424692	thinning shears
-n04425804	third base, third
-n04425977	third gear, third
-n04426184	third rail
-n04426316	thong
-n04426427	thong
-n04427216	three-centered arch, basket-handle arch
-n04427473	three-decker
-n04427559	three-dimensional radar, 3d radar
-n04427715	three-piece suit
-n04427857	three-quarter binding
-n04428008	three-way switch, three-point switch
-n04428191	thresher, thrasher, threshing machine
-n04428382	threshing floor
-n04428634	thriftshop, second-hand store
-n04429038	throat protector
-n04429376	throne
-n04430475	thrust bearing
-n04430605	thruster
-n04430896	thumb
-n04431025	thumbhole
-n04431436	thumbscrew
-n04431648	thumbstall
-n04431745	thumbtack, drawing pin, pushpin
-n04431925	thunderer
-n04432043	thwart, cross thwart
-n04432203	tiara
-n04432662	ticking
-n04432785	tickler coil
-n04433377	tie, tie beam
-n04433585	tie, railroad tie, crosstie, sleeper
-n04434207	tie rack
-n04434531	tie rod
-n04434932	tights, leotards
-n04435180	tile
-n04435552	tile cutter
-n04435653	tile roof
-n04435759	tiller
-n04435870	tilter
-n04436012	tilt-top table, tip-top table, tip table
-n04436185	timber
-n04436329	timber
-n04436401	timber hitch
-n04436542	timbrel
-n04436832	time bomb, infernal machine
-n04436992	time capsule
-n04437276	time clock
-n04437380	time-delay measuring instrument, time-delay measuring system
-n04437670	time-fuse
-n04437953	timepiece, timekeeper, horologe
-n04438304	timer
-n04438507	timer
-n04438643	time-switch
-n04438897	tin
-n04439505	tinderbox
-n04439585	tine
-n04439712	tinfoil, tin foil
-n04440597	tippet
-n04440963	tire chain, snow chain
-n04441093	tire iron, tire tool
-n04441528	titfer
-n04441662	tithe barn
-n04441790	titrator
-n04442312	toaster
-n04442441	toaster oven
-n04442582	toasting fork
-n04442741	toastrack
-n04443164	tobacco pouch
-n04443257	tobacco shop, tobacconist shop, tobacconist
-n04443433	toboggan
-n04443766	toby, toby jug, toby fillpot jug
-n04444121	tocsin, warning bell
-n04444218	toe
-n04444749	toecap
-n04444953	toehold
-n04445040	toga
-n04445154	toga virilis
-n04445327	toggle
-n04445610	toggle bolt
-n04445782	toggle joint
-n04445952	toggle switch, toggle, on-off switch, on/off switch
-n04446162	togs, threads, duds
-n04446276	toilet, lavatory, lav, can, john, privy, bathroom
-n04446844	toilet bag, sponge bag
-n04447028	toilet bowl
-n04447156	toilet kit, travel kit
-n04447276	toilet powder, bath powder, dusting powder
-n04447443	toiletry, toilet articles
-n04447861	toilet seat
-n04448070	toilet water, eau de toilette
-n04448185	tokamak
-n04448361	token
-n04449290	tollbooth, tolbooth, tollhouse
-n04449449	toll bridge
-n04449550	tollgate, tollbar
-n04449700	toll line
-n04449966	tomahawk, hatchet
-n04450133	Tommy gun, Thompson submachine gun
-n04450243	tomograph
-n04450465	tone arm, pickup, pickup arm
-n04450640	toner
-n04450749	tongs, pair of tongs
-n04450994	tongue
-n04451139	tongue and groove joint
-n04451318	tongue depressor
-n04451636	tonometer
-n04451818	tool
-n04452528	tool bag
-n04452615	toolbox, tool chest, tool cabinet, tool case
-n04452757	toolshed, toolhouse
-n04452848	tooth
-n04453037	tooth
-n04453156	toothbrush
-n04453390	toothpick
-n04453666	top
-n04453910	top, cover
-n04454654	topgallant, topgallant mast
-n04454792	topgallant, topgallant sail
-n04454908	topiary
-n04455048	topknot
-n04455250	topmast
-n04455579	topper
-n04455652	topsail
-n04456011	toque
-n04456115	torch
-n04456472	torpedo
-n04456734	torpedo
-n04457157	torpedo
-n04457326	torpedo boat
-n04457474	torpedo-boat destroyer
-n04457638	torpedo tube
-n04457767	torque converter
-n04457910	torque wrench
-n04458201	torture chamber
-n04458633	totem pole
-n04458843	touch screen, touchscreen
-n04459018	toupee, toupe
-n04459122	touring car, phaeton, tourer
-n04459243	tourist class, third class
-n04459362	towel
-n04459610	toweling, towelling
-n04459773	towel rack, towel horse
-n04459909	towel rail, towel bar
-n04460130	tower
-n04461437	town hall
-n04461570	towpath, towing path
-n04461696	tow truck, tow car, wrecker
-n04461879	toy
-n04462011	toy box, toy chest
-n04462240	toyshop
-n04462576	trace detector
-n04463679	track, rail, rails, runway
-n04464125	track
-n04464615	trackball
-n04464852	tracked vehicle
-n04465050	tract house
-n04465203	tract housing
-n04465358	traction engine
-n04465501	tractor
-n04465666	tractor
-n04466871	trail bike, dirt bike, scrambler
-n04467099	trailer, house trailer
-n04467307	trailer
-n04467506	trailer camp, trailer park
-n04467665	trailer truck, tractor trailer, trucking rig, rig, articulated lorry, semi
-n04467899	trailing edge
-n04468005	train, railroad train
-n04469003	tramline, tramway, streetcar track
-n04469251	trammel
-n04469514	trampoline
-n04469684	tramp steamer, tramp
-n04469813	tramway, tram, aerial tramway, cable tramway, ropeway
-n04470741	transdermal patch, skin patch
-n04471148	transept
-n04471315	transformer
-n04471632	transistor, junction transistor, electronic transistor
-n04471912	transit instrument
-n04472243	transmission, transmission system
-n04472563	transmission shaft
-n04472726	transmitter, sender
-n04472961	transom, traverse
-n04473108	transom, transom window, fanlight
-n04473275	transponder
-n04473884	transporter
-n04474035	transporter, car transporter
-n04474187	transport ship
-n04474466	trap
-n04475309	trap door
-n04475411	trapeze
-n04475496	trave, traverse, crossbeam, crosspiece
-n04475631	travel iron
-n04475749	trawl, dragnet, trawl net
-n04475900	trawl, trawl line, spiller, setline, trotline
-n04476116	trawler, dragger
-n04476259	tray
-n04476526	tray cloth
-n04476831	tread
-n04476972	tread
-n04477219	treadmill, treadwheel, tread-wheel
-n04477387	treadmill
-n04477548	treasure chest
-n04477725	treasure ship
-n04478066	treenail, trenail, trunnel
-n04478383	trefoil arch
-n04478512	trellis, treillage
-n04478657	trench
-n04479046	trench coat
-n04479287	trench knife
-n04479405	trepan
-n04479526	trepan, trephine
-n04479694	trestle
-n04479823	trestle
-n04479939	trestle bridge
-n04480033	trestle table
-n04480141	trestlework
-n04480303	trews
-n04480527	trial balloon
-n04480853	triangle
-n04480995	triangle
-n04481524	triclinium
-n04481642	triclinium
-n04482177	tricorn, tricorne
-n04482297	tricot
-n04482393	tricycle, trike, velocipede
-n04482975	trident
-n04483073	trigger
-n04483307	trimaran
-n04483925	trimmer
-n04484024	trimmer arch
-n04484432	triode
-n04485082	tripod
-n04485423	triptych
-n04485586	trip wire
-n04485750	trireme
-n04485884	triskelion, triskele
-n04486054	triumphal arch
-n04486213	trivet
-n04486322	trivet
-n04486616	troika
-n04486934	troll
-n04487081	trolleybus, trolley coach, trackless trolley
-n04487394	trombone
-n04487724	troop carrier, troop transport
-n04487894	troopship
-n04488202	trophy case
-n04488427	trough
-n04488530	trouser
-n04488742	trouser cuff
-n04488857	trouser press, pants presser
-n04489008	trouser, pant
-n04489695	trousseau
-n04489817	trowel
-n04490091	truck, motortruck
-n04491312	trumpet arch
-n04491388	truncheon, nightstick, baton, billy, billystick, billy club
-n04491638	trundle bed, trundle, truckle bed, truckle
-n04491769	trunk
-n04491934	trunk hose
-n04492060	trunk lid
-n04492157	trunk line
-n04492375	truss
-n04492749	truss bridge
-n04493109	try square
-n04493259	T-square
-n04493381	tub, vat
-n04494204	tube, vacuum tube, thermionic vacuum tube, thermionic tube, electron tube, thermionic valve
-n04495051	tuck box
-n04495183	tucker
-n04495310	tucker-bag
-n04495450	tuck shop
-n04495555	Tudor arch, four-centered arch
-n04495698	tudung
-n04495843	tugboat, tug, towboat, tower
-n04496614	tulle
-n04496726	tumble-dryer, tumble drier
-n04496872	tumbler
-n04497249	tumbrel, tumbril
-n04497442	tun
-n04497570	tunic
-n04497801	tuning fork
-n04498275	tupik, tupek, sealskin tent
-n04498389	turban
-n04498523	turbine
-n04498873	turbogenerator
-n04499062	tureen
-n04499300	Turkish bath
-n04499446	Turkish towel, terry towel
-n04499554	Turk's head
-n04499810	turnbuckle
-n04500060	turner, food turner
-n04500390	turnery
-n04501127	turnpike
-n04501281	turnspit
-n04501370	turnstile
-n04501550	turntable
-n04501837	turntable, lazy Susan
-n04501947	turret
-n04502059	turret clock
-n04502197	turtleneck, turtle, polo-neck
-n04502502	tweed
-n04502670	tweeter
-n04502851	twenty-two, .22
-n04502989	twenty-two pistol
-n04503073	twenty-two rifle
-n04503155	twill
-n04503269	twill, twill weave
-n04503413	twin bed
-n04503499	twinjet
-n04503593	twist bit, twist drill
-n04503705	two-by-four
-n04504038	two-man tent
-n04504141	two-piece, two-piece suit, lounge suit
-n04504770	typesetting machine
-n04505036	typewriter
-n04505345	typewriter carriage
-n04505470	typewriter keyboard
-n04505888	tyrolean, tirolean
-n04506289	uke, ukulele
-n04506402	ulster
-n04506506	ultracentrifuge
-n04506688	ultramicroscope, dark-field microscope
-n04506895	Ultrasuede
-n04506994	ultraviolet lamp, ultraviolet source
-n04507155	umbrella
-n04507326	umbrella tent
-n04507453	undercarriage
-n04507689	undercoat, underseal
-n04508163	undergarment, unmentionable
-n04508489	underpants
-n04508949	underwear, underclothes, underclothing
-n04509171	undies
-n04509260	uneven parallel bars, uneven bars
-n04509417	unicycle, monocycle
-n04509592	uniform
-n04510706	universal joint, universal
-n04511002	university
-n04513827	upholstery
-n04513998	upholstery material
-n04514095	upholstery needle
-n04514241	uplift
-n04514648	upper berth, upper
-n04515003	upright, upright piano
-n04515444	upset, swage
-n04515729	upstairs
-n04515890	urceole
-n04516116	urn
-n04516214	urn
-n04516354	used-car, secondhand car
-n04516672	utensil
-n04517211	Uzi
-n04517408	vacation home
-n04517823	vacuum, vacuum cleaner
-n04517999	vacuum chamber
-n04518132	vacuum flask, vacuum bottle
-n04518343	vacuum gauge, vacuum gage
-n04518643	Valenciennes, Valenciennes lace
-n04518764	valise
-n04519153	valve
-n04519536	valve
-n04519728	valve-in-head engine
-n04519887	vambrace, lower cannon
-n04520170	van
-n04520382	van, caravan
-n04520784	vane
-n04520962	vaporizer, vaporiser
-n04521571	variable-pitch propeller
-n04521863	variometer
-n04521987	varnish
-n04522168	vase
-n04523525	vault
-n04523831	vault, bank vault
-n04524142	vaulting horse, long horse, buck
-n04524313	vehicle
-n04524594	Velcro
-n04524716	velocipede
-n04524941	velour, velours
-n04525038	velvet
-n04525191	velveteen
-n04525305	vending machine
-n04525417	veneer, veneering
-n04525584	Venetian blind
-n04525821	Venn diagram, Venn's diagram
-n04526520	ventilation, ventilation system, ventilating system
-n04526800	ventilation shaft
-n04526964	ventilator
-n04527648	veranda, verandah, gallery
-n04528079	verdigris
-n04528968	vernier caliper, vernier micrometer
-n04529108	vernier scale, vernier
-n04529681	vertical file
-n04529962	vertical stabilizer, vertical stabiliser, vertical fin, tail fin, tailfin
-n04530283	vertical tail
-n04530456	Very pistol, Verey pistol
-n04530566	vessel, watercraft
-n04531098	vessel
-n04531873	vest, waistcoat
-n04532022	vestiture
-n04532106	vestment
-n04532398	vest pocket
-n04532504	vestry, sacristy
-n04532670	viaduct
-n04532831	vibraphone, vibraharp, vibes
-n04533042	vibrator
-n04533199	vibrator
-n04533499	Victrola
-n04533594	vicuna
-n04533700	videocassette
-n04533802	videocassette recorder, VCR
-n04533946	videodisk, videodisc, DVD
-n04534127	video recording, video
-n04534359	videotape
-n04534520	videotape
-n04534895	vigil light, vigil candle
-n04535252	villa
-n04535370	villa
-n04535524	villa
-n04536153	viol
-n04536335	viola
-n04536465	viola da braccio
-n04536595	viola da gamba, gamba, bass viol
-n04536765	viola d'amore
-n04536866	violin, fiddle
-n04537436	virginal, pair of virginals
-n04538249	viscometer, viscosimeter
-n04538403	viscose rayon, viscose
-n04538552	vise, bench vise
-n04538878	visor, vizor
-n04539053	visual display unit, VDU
-n04539203	vivarium
-n04539407	Viyella
-n04539794	voile
-n04540053	volleyball
-n04540255	volleyball net
-n04540397	voltage regulator
-n04540761	voltaic cell, galvanic cell, primary cell
-n04541136	voltaic pile, pile, galvanic pile
-n04541320	voltmeter
-n04541662	vomitory
-n04541777	von Neumann machine
-n04541987	voting booth
-n04542095	voting machine
-n04542329	voussoir
-n04542474	vox angelica, voix celeste
-n04542595	vox humana
-n04542715	waders
-n04542858	wading pool
-n04542943	waffle iron
-n04543158	wagon, waggon
-n04543509	wagon, coaster wagon
-n04543636	wagon tire
-n04543772	wagon wheel
-n04543924	wain
-n04543996	wainscot, wainscoting, wainscotting
-n04544325	wainscoting, wainscotting
-n04544450	waist pack, belt bag
-n04545305	walker, baby-walker, go-cart
-n04545471	walker, Zimmer, Zimmer frame
-n04545748	walker
-n04545858	walkie-talkie, walky-talky
-n04545984	walk-in
-n04546081	walking shoe
-n04546194	walking stick
-n04546340	Walkman
-n04546595	walk-up apartment, walk-up
-n04546855	wall
-n04547592	wall
-n04548280	wall clock
-n04548362	wallet, billfold, notecase, pocketbook
-n04549028	wall tent
-n04549122	wall unit
-n04549629	wand
-n04549721	Wankel engine, Wankel rotary engine, epitrochoidal engine
-n04549919	ward, hospital ward
-n04550184	wardrobe, closet, press
-n04550676	wardroom
-n04551055	warehouse, storage warehouse
-n04551833	warming pan
-n04552097	war paint
-n04552348	warplane, military plane
-n04552551	war room
-n04552696	warship, war vessel, combat ship
-n04553389	wash
-n04553561	wash-and-wear
-n04553703	washbasin, handbasin, washbowl, lavabo, wash-hand basin
-n04554211	washboard, splashboard
-n04554406	washboard
-n04554684	washer, automatic washer, washing machine
-n04554871	washer
-n04554998	washhouse
-n04555291	washroom
-n04555400	washstand, wash-hand stand
-n04555600	washtub
-n04555700	wastepaper basket, waste-paper basket, wastebasket, waste basket, circular file
-n04555897	watch, ticker
-n04556408	watch cap
-n04556533	watch case
-n04556664	watch glass
-n04556948	watchtower
-n04557308	water-base paint
-n04557522	water bed
-n04557648	water bottle
-n04557751	water butt
-n04558059	water cart
-n04558199	water chute
-n04558478	water closet, closet, W.C., loo
-n04558804	watercolor, water-color, watercolour, water-colour
-n04559023	water-cooled reactor
-n04559166	water cooler
-n04559451	water faucet, water tap, tap, hydrant
-n04559620	water filter
-n04559730	water gauge, water gage, water glass
-n04559910	water glass
-n04559994	water hazard
-n04560113	water heater, hot-water heater, hot-water tank
-n04560292	watering can, watering pot
-n04560502	watering cart
-n04560619	water jacket
-n04560804	water jug
-n04560882	water jump
-n04561010	water level
-n04561287	water meter
-n04561422	water mill
-n04561734	waterproof
-n04561857	waterproofing
-n04561965	water pump
-n04562122	water scooter, sea scooter, scooter
-n04562262	water ski
-n04562496	waterspout
-n04562935	water tower
-n04563020	water wagon, water waggon
-n04563204	waterwheel, water wheel
-n04563413	waterwheel, water wheel
-n04563560	water wings
-n04563790	waterworks
-n04564278	wattmeter
-n04564581	waxwork, wax figure
-n04565039	ways, shipway, slipway
-n04565375	weapon, arm, weapon system
-n04566257	weaponry, arms, implements of war, weapons system, munition
-n04566561	weapons carrier
-n04566756	weathercock
-n04567098	weatherglass
-n04567593	weather satellite, meteorological satellite
-n04567746	weather ship
-n04568069	weathervane, weather vane, vane, wind vane
-n04568557	web, entanglement
-n04568713	web
-n04568841	webbing
-n04569063	webcam
-n04569520	wedge
-n04569822	wedge
-n04570118	wedgie
-n04570214	Wedgwood
-n04570416	weeder, weed-whacker
-n04570532	weeds, widow's weeds
-n04570815	weekender
-n04570958	weighbridge
-n04571292	weight, free weight, exercising weight
-n04571566	weir
-n04571686	weir
-n04571800	welcome wagon
-n04571958	weld
-n04572121	welder's mask
-n04572235	weldment
-n04572935	well
-n04573045	wellhead
-n04573281	welt
-n04573379	Weston cell, cadmium cell
-n04573513	wet bar
-n04573625	wet-bulb thermometer
-n04573832	wet cell
-n04573937	wet fly
-n04574067	wet suit
-n04574348	whaleboat
-n04574471	whaler, whaling ship
-n04574606	whaling gun
-n04574999	wheel
-n04575723	wheel
-n04575824	wheel and axle
-n04576002	wheelchair
-n04576211	wheeled vehicle
-n04576971	wheelwork
-n04577139	wherry
-n04577293	wherry, Norfolk wherry
-n04577426	whetstone
-n04577567	whiffletree, whippletree, swingletree
-n04577769	whip
-n04578112	whipcord
-n04578329	whipping post
-n04578559	whipstitch, whipping, whipstitching
-n04578708	whirler
-n04578801	whisk, whisk broom
-n04578934	whisk
-n04579056	whiskey bottle
-n04579145	whiskey jug
-n04579230	whispering gallery, whispering dome
-n04579432	whistle
-n04579667	whistle
-n04579986	white
-n04580493	white goods
-n04581102	whitewash
-n04581595	whorehouse, brothel, bordello, bagnio, house of prostitution, house of ill repute, bawdyhouse, cathouse, sporting house
-n04581829	wick, taper
-n04582205	wicker, wickerwork, caning
-n04582349	wicker basket
-n04582771	wicket, hoop
-n04582869	wicket
-n04583022	wickiup, wikiup
-n04583212	wide-angle lens, fisheye lens
-n04583620	widebody aircraft, wide-body aircraft, wide-body, twin-aisle airplane
-n04583888	wide wale
-n04583967	widow's walk
-n04584056	Wiffle, Wiffle Ball
-n04584207	wig
-n04584373	wigwam
-n04585128	Wilton, Wilton carpet
-n04585318	wimple
-n04585456	wincey
-n04585626	winceyette
-n04585745	winch, windlass
-n04585980	Winchester
-n04586072	windbreak, shelterbelt
-n04586581	winder, key
-n04586932	wind instrument, wind
-n04587327	windjammer
-n04587404	windmill, aerogenerator, wind generator
-n04587559	windmill
-n04587648	window
-n04588739	window
-n04589190	window blind
-n04589325	window box
-n04589434	window envelope
-n04589593	window frame
-n04589890	window screen
-n04590021	window seat
-n04590129	window shade
-n04590263	windowsill
-n04590553	windshield, windscreen
-n04590746	windshield wiper, windscreen wiper, wiper, wiper blade
-n04590933	Windsor chair
-n04591056	Windsor knot
-n04591157	Windsor tie
-n04591249	wind tee
-n04591359	wind tunnel
-n04591517	wind turbine
-n04591631	wine bar
-n04591713	wine bottle
-n04591887	wine bucket, wine cooler
-n04592005	wine cask, wine barrel
-n04592099	wineglass
-n04592356	winepress
-n04592465	winery, wine maker
-n04592596	wineskin
-n04592741	wing
-n04593077	wing chair
-n04593185	wing nut, wing-nut, wing screw, butterfly nut, thumbnut
-n04593376	wing tip
-n04593524	wing tip
-n04593629	winker, blinker, blinder
-n04593866	wiper, wiper arm, contact arm
-n04594114	wiper motor
-n04594218	wire
-n04594489	wire, conducting wire
-n04594742	wire cloth
-n04594828	wire cutter
-n04594919	wire gauge, wire gage
-n04595028	wireless local area network, WLAN, wireless fidelity, WiFi
-n04595285	wire matrix printer, wire printer, stylus printer
-n04595501	wire recorder
-n04595611	wire stripper
-n04595762	wirework, grillwork
-n04595855	wiring
-n04596116	wishing cap
-n04596492	witness box, witness stand
-n04596742	wok
-n04596852	woman's clothing
-n04597066	wood
-n04597309	woodcarving
-n04597400	wood chisel
-n04597804	woodenware
-n04597913	wooden spoon
-n04598136	woodscrew
-n04598318	woodshed
-n04598416	wood vise, woodworking vise, shoulder vise
-n04598582	woodwind, woodwind instrument, wood
-n04598965	woof, weft, filling, pick
-n04599124	woofer
-n04599235	wool, woolen, woollen
-n04600312	workbasket, workbox, workbag
-n04600486	workbench, work bench, bench
-n04600912	work-clothing, work-clothes
-n04601041	workhouse
-n04601159	workhouse
-n04601938	workpiece
-n04602762	workroom
-n04602840	works, workings
-n04602956	work-shirt
-n04603399	workstation
-n04603729	worktable, work table
-n04603872	workwear
-n04604276	World Wide Web, WWW, web
-n04604644	worm fence, snake fence, snake-rail fence, Virginia fence
-n04604806	worm gear
-n04605057	worm wheel
-n04605163	worsted
-n04605321	worsted, worsted yarn
-n04605446	wrap, wrapper
-n04605572	wraparound
-n04605726	wrapping, wrap, wrapper
-n04606251	wreck
-n04606574	wrench, spanner
-n04607035	wrestling mat
-n04607242	wringer
-n04607640	wrist pad
-n04607759	wrist pin, gudgeon pin
-n04607869	wristwatch, wrist watch
-n04607982	writing arm
-n04608329	writing desk
-n04608435	writing desk
-n04608567	writing implement
-n04608809	xerographic printer
-n04608923	Xerox, xerographic copier, Xerox machine
-n04609531	X-ray film
-n04609651	X-ray machine
-n04609811	X-ray tube
-n04610013	yacht, racing yacht
-n04610176	yacht chair
-n04610274	yagi, Yagi aerial
-n04610503	yard
-n04610676	yard
-n04611351	yardarm
-n04611795	yard marker
-n04611916	yardstick, yard measure
-n04612026	yarmulke, yarmulka, yarmelke
-n04612159	yashmak, yashmac
-n04612257	yataghan
-n04612373	yawl, dandy
-n04612504	yawl
-n04612840	yoke
-n04613015	yoke
-n04613158	yoke, coupling
-n04613696	yurt
-n04613939	Zamboni
-n04614505	zero
-n04614655	ziggurat, zikkurat, zikurat
-n04614844	zill
-n04615149	zip gun
-n04615226	zither, cither, zithern
-n04615644	zoot suit
-n04682018	shading
-n04950713	grain
-n04950952	wood grain, woodgrain, woodiness
-n04951071	graining, woodgraining
-n04951186	marbleization, marbleisation, marbleizing, marbleising
-n04951373	light, lightness
-n04951716	aura, aureole, halo, nimbus, glory, gloriole
-n04951875	sunniness
-n04953296	glint
-n04953678	opalescence, iridescence
-n04955160	polish, gloss, glossiness, burnish
-n04957356	primary color for pigments, primary colour for pigments
-n04957589	primary color for light, primary colour for light
-n04958634	colorlessness, colourlessness, achromatism, achromaticity
-n04958865	mottle
-n04959061	achromia
-n04959230	shade, tint, tincture, tone
-n04959672	chromatic color, chromatic colour, spectral color, spectral colour
-n04960277	black, blackness, inkiness
-n04960582	coal black, ebony, jet black, pitch black, sable, soot black
-n04961062	alabaster
-n04961331	bone, ivory, pearl, off-white
-n04961691	gray, grayness, grey, greyness
-n04962062	ash grey, ash gray, silver, silver grey, silver gray
-n04962240	charcoal, charcoal grey, charcoal gray, oxford grey, oxford gray
-n04963111	sanguine
-n04963307	Turkey red, alizarine red
-n04963588	crimson, ruby, deep red
-n04963740	dark red
-n04964001	claret
-n04964799	fuschia
-n04964878	maroon
-n04965179	orange, orangeness
-n04965451	reddish orange
-n04965661	yellow, yellowness
-n04966543	gamboge, lemon, lemon yellow, maize
-n04966941	pale yellow, straw, wheat
-n04967191	green, greenness, viridity
-n04967561	greenishness
-n04967674	sea green
-n04967801	sage green
-n04967882	bottle green
-n04968056	emerald
-n04968139	olive green, olive-green
-n04968749	jade green, jade
-n04968895	blue, blueness
-n04969242	azure, cerulean, sapphire, lazuline, sky-blue
-n04969540	steel blue
-n04969798	greenish blue, aqua, aquamarine, turquoise, cobalt blue, peacock blue
-n04969952	purplish blue, royal blue
-n04970059	purple, purpleness
-n04970312	Tyrian purple
-n04970398	indigo
-n04970470	lavender
-n04970631	reddish purple, royal purple
-n04970916	pink
-n04971211	carnation
-n04971313	rose, rosiness
-n04972350	chestnut
-n04972451	chocolate, coffee, deep brown, umber, burnt umber
-n04972801	light brown
-n04973020	tan, topaz
-n04973291	beige, ecru
-n04973386	reddish brown, sepia, burnt sienna, Venetian red, mahogany
-n04973585	brick red
-n04973669	copper, copper color
-n04973816	Indian red
-n04974145	puce
-n04974340	olive
-n04974859	ultramarine
-n04975739	complementary color, complementary
-n04976319	pigmentation
-n04976952	complexion, skin color, skin colour
-n04977412	ruddiness, rosiness
-n04978561	nonsolid color, nonsolid colour, dithered color, dithered colour
-n04979002	aposematic coloration, warning coloration
-n04979307	cryptic coloration
-n04981658	ring
-n05102764	center of curvature, centre of curvature
-n05218119	cadaver, corpse, stiff, clay, remains
-n05233741	mandibular notch
-n05235879	rib
-n05238282	skin, tegument, cutis
-n05239437	skin graft
-n05241218	epidermal cell
-n05241485	melanocyte
-n05241662	prickle cell
-n05242070	columnar cell, columnar epithelial cell
-n05242239	spongioblast
-n05242928	squamous cell
-n05244421	amyloid plaque, amyloid protein plaque
-n05244755	dental plaque, bacterial plaque
-n05244934	macule, macula
-n05245192	freckle, lentigo
-n05257476	bouffant
-n05257967	sausage curl
-n05258051	forelock
-n05258627	spit curl, kiss curl
-n05259914	pigtail
-n05260127	pageboy
-n05260240	pompadour
-n05261310	thatch
-n05262422	soup-strainer, toothbrush
-n05262534	mustachio, moustachio, handle-bars
-n05262698	walrus mustache, walrus moustache
-n05263183	stubble
-n05263316	vandyke beard, vandyke
-n05263448	soul patch, Attilio
-n05265736	esophageal smear
-n05266096	paraduodenal smear, duodenal smear
-n05266879	specimen
-n05278922	punctum
-n05279953	glenoid fossa, glenoid cavity
-n05282652	diastema
-n05285623	marrow, bone marrow
-n05302499	mouth, oral cavity, oral fissure, rima oris
-n05314075	canthus
-n05399034	milk
-n05399243	mother's milk
-n05399356	colostrum, foremilk
-n05418717	vein, vena, venous blood vessel
-n05427346	ganglion cell, gangliocyte
-n05442594	X chromosome
-n05447757	embryonic cell, formative cell
-n05448704	myeloblast
-n05448827	sideroblast
-n05449196	osteocyte
-n05449661	megalocyte, macrocyte
-n05449959	leukocyte, leucocyte, white blood cell, white cell, white blood corpuscle, white corpuscle, WBC
-n05450617	histiocyte
-n05451099	fixed phagocyte
-n05451384	lymphocyte, lymph cell
-n05453412	monoblast
-n05453657	neutrophil, neutrophile
-n05453815	microphage
-n05454833	sickle cell
-n05454978	siderocyte
-n05455113	spherocyte
-n05458173	ootid
-n05458576	oocyte
-n05459101	spermatid
-n05459457	Leydig cell, Leydig's cell
-n05459769	striated muscle cell, striated muscle fiber
-n05460759	smooth muscle cell
-n05464534	Ranvier's nodes, nodes of Ranvier
-n05467054	neuroglia, glia
-n05467758	astrocyte
-n05468098	protoplasmic astrocyte
-n05468739	oligodendrocyte
-n05469664	proprioceptor
-n05469861	dendrite
-n05475397	sensory fiber, afferent fiber
-n05482922	subarachnoid space
-n05486510	cerebral cortex, cerebral mantle, pallium, cortex
-n05491154	renal cortex
-n05526957	prepuce, foreskin
-n05538625	head, caput
-n05539947	scalp
-n05541509	frontal eminence
-n05542893	suture, sutura, fibrous joint
-n05545879	foramen magnum
-n05571341	esophagogastric junction, oesophagogastric junction
-n05578095	heel
-n05581932	cuticle
-n05584746	hangnail, agnail
-n05586759	exoskeleton
-n05604434	abdominal wall
-n05716342	lemon
-n06008896	coordinate axis
-n06209940	landscape
-n06254669	medium
-n06255081	vehicle
-n06255613	paper
-n06259898	channel, transmission channel
-n06262567	film, cinema, celluloid
-n06262943	silver screen
-n06263202	free press
-n06263369	press, public press
-n06263609	print media
-n06263762	storage medium, data-storage medium
-n06263895	magnetic storage medium, magnetic medium, magnetic storage
-n06266417	journalism, news media
-n06266633	Fleet Street
-n06266710	photojournalism
-n06266878	news photography
-n06266973	rotogravure
-n06267145	newspaper, paper
-n06267564	daily
-n06267655	gazette
-n06267758	school newspaper, school paper
-n06267893	tabloid, rag, sheet
-n06267991	yellow journalism, tabloid, tab
-n06271778	telecommunication, telecom
-n06272290	telephone, telephony
-n06272612	voice mail, voicemail
-n06272803	call, phone call, telephone call
-n06273207	call-back
-n06273294	collect call
-n06273414	call forwarding
-n06273555	call-in
-n06273743	call waiting
-n06273890	crank call
-n06273986	local call
-n06274092	long distance, long-distance call, trunk call
-n06274292	toll call
-n06274546	wake-up call
-n06274760	three-way calling
-n06274921	telegraphy
-n06275095	cable, cablegram, overseas telegram
-n06275353	wireless
-n06275471	radiotelegraph, radiotelegraphy, wireless telegraphy
-n06276501	radiotelephone, radiotelephony, wireless telephone
-n06276697	broadcasting
-n06276902	Rediffusion
-n06277025	multiplex
-n06277135	radio, radiocommunication, wireless
-n06277280	television, telecasting, TV, video
-n06278338	cable television, cable
-n06278475	high-definition television, HDTV
-n06281040	reception
-n06281175	signal detection, detection
-n06340977	Hakham
-n06359193	web site, website, internet site, site
-n06359467	chat room, chatroom
-n06359657	portal site, portal
-n06415688	jotter
-n06417096	breviary
-n06418693	wordbook
-n06419354	desk dictionary, collegiate dictionary
-n06423496	reckoner, ready reckoner
-n06470073	document, written document, papers
-n06591815	album, record album
-n06592078	concept album
-n06592281	rock opera
-n06592421	tribute album, benefit album
-n06595351	magazine, mag
-n06596179	colour supplement
-n06596364	comic book
-n06596474	news magazine
-n06596607	pulp, pulp magazine
-n06596727	slick, slick magazine, glossy
-n06596845	trade magazine
-n06613686	movie, film, picture, moving picture, moving-picture show, motion picture, motion-picture show, picture show, pic, flick
-n06614901	outtake
-n06616216	shoot-'em-up
-n06618653	spaghetti Western
-n06625062	encyclical, encyclical letter
-n06785654	crossword puzzle, crossword
-n06793231	sign
-n06794110	street sign
-n06874185	traffic light, traffic signal, stoplight
-n06883725	swastika, Hakenkreuz
-n06892775	concert
-n06998748	artwork, art, graphics, nontextual matter
-n07005523	lobe
-n07248320	book jacket, dust cover, dust jacket, dust wrapper
-n07273802	cairn
-n07461050	three-day event
-n07556406	comfort food
-n07556637	comestible, edible, eatable, pabulum, victual, victuals
-n07556872	tuck
-n07556970	course
-n07557165	dainty, delicacy, goody, kickshaw, treat
-n07557434	dish
-n07560193	fast food
-n07560331	finger food
-n07560422	ingesta
-n07560542	kosher
-n07560652	fare
-n07560903	diet
-n07561112	diet
-n07561590	dietary
-n07561848	balanced diet
-n07562017	bland diet, ulcer diet
-n07562172	clear liquid diet
-n07562379	diabetic diet
-n07562495	dietary supplement
-n07562651	carbohydrate loading, carbo loading
-n07562881	fad diet
-n07562984	gluten-free diet
-n07563207	high-protein diet
-n07563366	high-vitamin diet, vitamin-deficiency diet
-n07563642	light diet
-n07563800	liquid diet
-n07564008	low-calorie diet
-n07564101	low-fat diet
-n07564292	low-sodium diet, low-salt diet, salt-free diet
-n07564515	macrobiotic diet
-n07564629	reducing diet, obesity diet
-n07564796	soft diet, pap, spoon food
-n07564971	vegetarianism
-n07565083	menu
-n07565161	chow, chuck, eats, grub
-n07565259	board, table
-n07565608	mess
-n07565725	ration
-n07565945	field ration
-n07566092	K ration
-n07566231	C-ration
-n07566340	foodstuff, food product
-n07566863	starches
-n07567039	breadstuff
-n07567139	coloring, colouring, food coloring, food colouring, food color, food colour
-n07567390	concentrate
-n07567611	tomato concentrate
-n07567707	meal
-n07567980	kibble
-n07568095	cornmeal, Indian meal
-n07568241	farina
-n07568389	matzo meal, matzoh meal, matzah meal
-n07568502	oatmeal, rolled oats
-n07568625	pea flour
-n07568818	roughage, fiber
-n07568991	bran
-n07569106	flour
-n07569423	plain flour
-n07569543	wheat flour
-n07569644	whole wheat flour, graham flour, graham, whole meal flour
-n07569873	soybean meal, soybean flour, soy flour
-n07570021	semolina
-n07570530	corn gluten feed
-n07570720	nutriment, nourishment, nutrition, sustenance, aliment, alimentation, victuals
-n07572353	commissariat, provisions, provender, viands, victuals
-n07572616	larder
-n07572858	frozen food, frozen foods
-n07572957	canned food, canned foods, canned goods, tinned goods
-n07573103	canned meat, tinned meat
-n07573347	Spam
-n07573453	dehydrated food, dehydrated foods
-n07573563	square meal
-n07573696	meal, repast
-n07574176	potluck
-n07574426	refection
-n07574504	refreshment
-n07574602	breakfast
-n07574780	continental breakfast, petit dejeuner
-n07574923	brunch
-n07575076	lunch, luncheon, tiffin, dejeuner
-n07575226	business lunch
-n07575392	high tea
-n07575510	tea, afternoon tea, teatime
-n07575726	dinner
-n07575984	supper
-n07576182	buffet
-n07576438	picnic
-n07576577	cookout
-n07576781	barbecue, barbeque
-n07576969	clambake
-n07577144	fish fry
-n07577374	bite, collation, snack
-n07577538	nosh
-n07577657	nosh-up
-n07577772	ploughman's lunch
-n07577918	coffee break, tea break
-n07578093	banquet, feast, spread
-n07579575	entree, main course
-n07579688	piece de resistance
-n07579787	plate
-n07579917	adobo
-n07580053	side dish, side order, entremets
-n07580253	special
-n07580359	casserole
-n07580470	chicken casserole
-n07580592	chicken cacciatore, chicken cacciatora, hunter's chicken
-n07581249	antipasto
-n07581346	appetizer, appetiser, starter
-n07581607	canape
-n07581775	cocktail
-n07581931	fruit cocktail
-n07582027	crab cocktail
-n07582152	shrimp cocktail
-n07582277	hors d'oeuvre
-n07582441	relish
-n07582609	dip
-n07582811	bean dip
-n07582892	cheese dip
-n07582970	clam dip
-n07583066	guacamole
-n07583197	soup
-n07583865	soup du jour
-n07583978	alphabet soup
-n07584110	consomme
-n07584228	madrilene
-n07584332	bisque
-n07584423	borsch, borsh, borscht, borsht, borshch, bortsch
-n07584593	broth
-n07584859	barley water
-n07584938	bouillon
-n07585015	beef broth, beef stock
-n07585107	chicken broth, chicken stock
-n07585208	broth, stock
-n07585474	stock cube
-n07585557	chicken soup
-n07585644	cock-a-leekie, cocky-leeky
-n07585758	gazpacho
-n07585906	gumbo
-n07585997	julienne
-n07586099	marmite
-n07586179	mock turtle soup
-n07586318	mulligatawny
-n07586485	oxtail soup
-n07586604	pea soup
-n07586718	pepper pot, Philadelphia pepper pot
-n07586894	petite marmite, minestrone, vegetable soup
-n07587023	potage, pottage
-n07587111	pottage
-n07587206	turtle soup, green turtle soup
-n07587331	eggdrop soup
-n07587441	chowder
-n07587618	corn chowder
-n07587700	clam chowder
-n07587819	Manhattan clam chowder
-n07587962	New England clam chowder
-n07588111	fish chowder
-n07588193	won ton, wonton, wonton soup
-n07588299	split-pea soup
-n07588419	green pea soup, potage St. Germain
-n07588574	lentil soup
-n07588688	Scotch broth
-n07588817	vichyssoise
-n07588947	stew
-n07589458	bigos
-n07589543	Brunswick stew
-n07589724	burgoo
-n07589872	burgoo
-n07589967	olla podrida, Spanish burgoo
-n07590068	mulligan stew, mulligan, Irish burgoo
-n07590177	purloo, chicken purloo, poilu
-n07590320	goulash, Hungarian goulash, gulyas
-n07590502	hotchpotch
-n07590611	hot pot, hotpot
-n07590752	beef goulash
-n07590841	pork-and-veal goulash
-n07590974	porkholt
-n07591049	Irish stew
-n07591162	oyster stew
-n07591236	lobster stew
-n07591330	lobscouse, lobscuse, scouse
-n07591473	fish stew
-n07591586	bouillabaisse
-n07591813	matelote
-n07591961	paella
-n07592094	fricassee
-n07592317	chicken stew
-n07592400	turkey stew
-n07592481	beef stew
-n07592656	ragout
-n07592768	ratatouille
-n07592922	salmi
-n07593004	pot-au-feu
-n07593107	slumgullion
-n07593199	smorgasbord
-n07593471	viand
-n07593774	ready-mix
-n07593972	brownie mix
-n07594066	cake mix
-n07594155	lemonade mix
-n07594250	self-rising flour, self-raising flour
-n07594737	choice morsel, tidbit, titbit
-n07594840	savory, savoury
-n07595051	calf's-foot jelly
-n07595180	caramel, caramelized sugar
-n07595368	lump sugar
-n07595649	cane sugar
-n07595751	castor sugar, caster sugar
-n07595914	powdered sugar
-n07596046	granulated sugar
-n07596160	icing sugar
-n07596362	corn sugar
-n07596452	brown sugar
-n07596566	demerara, demerara sugar
-n07596684	sweet, confection
-n07596967	confectionery
-n07597145	confiture
-n07597263	sweetmeat
-n07597365	candy, confect
-n07598256	candy bar
-n07598529	carob bar
-n07598622	hardbake
-n07598734	hard candy
-n07598928	barley-sugar, barley candy
-n07599068	brandyball
-n07599161	jawbreaker
-n07599242	lemon drop
-n07599383	sourball
-n07599468	patty
-n07599554	peppermint patty
-n07599649	bonbon
-n07599783	brittle, toffee, toffy
-n07599911	peanut brittle
-n07599998	chewing gum, gum
-n07600177	gum ball
-n07600285	bubble gum
-n07600394	butterscotch
-n07600506	candied fruit, succade, crystallized fruit
-n07600696	candied apple, candy apple, taffy apple, caramel apple, toffee apple
-n07600895	crystallized ginger
-n07601025	grapefruit peel
-n07601175	lemon peel
-n07601290	orange peel
-n07601407	candied citrus peel
-n07601572	candy cane
-n07601686	candy corn
-n07601809	caramel
-n07602650	center, centre
-n07604956	comfit
-n07605040	cotton candy, spun sugar, candyfloss
-n07605198	dragee
-n07605282	dragee
-n07605380	fondant
-n07605474	fudge
-n07605597	chocolate fudge
-n07605693	divinity, divinity fudge
-n07605804	penuche, penoche, panoche, panocha
-n07605944	gumdrop
-n07606058	jujube
-n07606191	honey crisp
-n07606278	mint, mint candy
-n07606419	horehound
-n07606538	peppermint, peppermint candy
-n07606669	jelly bean, jelly egg
-n07606764	kiss, candy kiss
-n07606933	molasses kiss
-n07607027	meringue kiss
-n07607138	chocolate kiss
-n07607361	licorice, liquorice
-n07607492	Life Saver
-n07607605	lollipop, sucker, all-day sucker
-n07607707	lozenge
-n07607832	cachou
-n07607967	cough drop, troche, pastille, pastil
-n07608098	marshmallow
-n07608245	marzipan, marchpane
-n07608339	nougat
-n07608429	nougat bar
-n07608533	nut bar
-n07608641	peanut bar
-n07608721	popcorn ball
-n07608866	praline
-n07608980	rock candy
-n07609083	rock candy, rock
-n07609215	sugar candy
-n07609316	sugarplum
-n07609407	taffy
-n07609549	molasses taffy
-n07609632	truffle, chocolate truffle
-n07609728	Turkish Delight
-n07609840	dessert, sweet, afters
-n07610295	ambrosia, nectar
-n07610502	ambrosia
-n07610620	baked Alaska
-n07610746	blancmange
-n07610890	charlotte
-n07611046	compote, fruit compote
-n07611148	dumpling
-n07611267	flan
-n07611358	frozen dessert
-n07611733	junket
-n07611839	mousse
-n07611991	mousse
-n07612137	pavlova
-n07612273	peach melba
-n07612367	whip
-n07612530	prune whip
-n07612632	pudding
-n07612996	pudding, pud
-n07613158	syllabub, sillabub
-n07613266	tiramisu
-n07613480	trifle
-n07613671	tipsy cake
-n07613815	jello, Jell-O
-n07614103	apple dumpling
-n07614198	ice, frappe
-n07614348	water ice, sorbet
-n07614500	ice cream, icecream
-n07614730	ice-cream cone
-n07614825	chocolate ice cream
-n07615052	Neapolitan ice cream
-n07615190	peach ice cream
-n07615289	sherbert, sherbet
-n07615460	strawberry ice cream
-n07615569	tutti-frutti
-n07615671	vanilla ice cream
-n07615774	ice lolly, lolly, lollipop, popsicle
-n07615954	ice milk
-n07616046	frozen yogurt
-n07616174	snowball
-n07616265	snowball
-n07616386	parfait
-n07616487	ice-cream sundae, sundae
-n07616590	split
-n07616748	banana split
-n07616906	frozen pudding
-n07617051	frozen custard, soft ice cream
-n07617188	pudding
-n07617344	flummery
-n07617447	fish mousse
-n07617526	chicken mousse
-n07617611	chocolate mousse
-n07617708	plum pudding, Christmas pudding
-n07617839	carrot pudding
-n07617932	corn pudding
-n07618029	steamed pudding
-n07618119	duff, plum duff
-n07618281	vanilla pudding
-n07618432	chocolate pudding
-n07618587	brown Betty
-n07618684	Nesselrode, Nesselrode pudding
-n07618871	pease pudding
-n07619004	custard
-n07619208	creme caramel
-n07619301	creme anglais
-n07619409	creme brulee
-n07619508	fruit custard
-n07619881	tapioca
-n07620047	tapioca pudding
-n07620145	roly-poly, roly-poly pudding
-n07620327	suet pudding
-n07620597	Bavarian cream
-n07620689	maraschino, maraschino cherry
-n07621264	nonpareil
-n07621497	zabaglione, sabayon
-n07621618	garnish
-n07623136	pastry, pastry dough
-n07624466	turnover
-n07624666	apple turnover
-n07624757	knish
-n07624924	pirogi, piroshki, pirozhki
-n07625061	samosa
-n07625324	timbale
-n07627931	puff paste, pate feuillete
-n07628068	phyllo
-n07628181	puff batter, pouf paste, pate a choux
-n07631926	ice-cream cake, icebox cake
-n07639069	doughnut, donut, sinker
-n07641928	fish cake, fish ball
-n07642361	fish stick, fish finger
-n07642471	conserve, preserve, conserves, preserves
-n07642742	apple butter
-n07642833	chowchow
-n07642933	jam
-n07643026	lemon curd, lemon cheese
-n07643200	strawberry jam, strawberry preserves
-n07643306	jelly
-n07643474	apple jelly
-n07643577	crabapple jelly
-n07643679	grape jelly
-n07643764	marmalade
-n07643891	orange marmalade
-n07643981	gelatin, jelly
-n07644244	gelatin dessert
-n07648913	buffalo wing
-n07648997	barbecued wing
-n07650792	mess
-n07650903	mince
-n07651025	puree
-n07654148	barbecue, barbeque
-n07654298	biryani, biriani
-n07655067	escalope de veau Orloff
-n07655263	saute
-n07663899	patty, cake
-n07665438	veal parmesan, veal parmigiana
-n07666176	veal cordon bleu
-n07672914	margarine, margarin, oleo, oleomargarine, marge
-n07678586	mincemeat
-n07678729	stuffing, dressing
-n07678953	turkey stuffing
-n07679034	oyster stuffing, oyster dressing
-n07679140	forcemeat, farce
-n07679356	bread, breadstuff, staff of life
-n07680168	anadama bread
-n07680313	bap
-n07680416	barmbrack
-n07680517	breadstick, bread-stick
-n07680655	grissino
-n07680761	brown bread, Boston brown bread
-n07680932	bun, roll
-n07681264	tea bread
-n07681355	caraway seed bread
-n07681450	challah, hallah
-n07681691	cinnamon bread
-n07681805	cracked-wheat bread
-n07681926	cracker
-n07682197	crouton
-n07682316	dark bread, whole wheat bread, whole meal bread, brown bread
-n07682477	English muffin
-n07682624	flatbread
-n07682808	garlic bread
-n07682952	gluten bread
-n07683039	graham bread
-n07683138	Host
-n07683265	flatbrod
-n07683360	bannock
-n07683490	chapatti, chapati
-n07683617	pita, pocket bread
-n07683786	loaf of bread, loaf
-n07684084	French loaf
-n07684164	matzo, matzoh, matzah, unleavened bread
-n07684289	nan, naan
-n07684422	onion bread
-n07684517	raisin bread
-n07684600	quick bread
-n07684938	banana bread
-n07685031	date bread
-n07685118	date-nut bread
-n07685218	nut bread
-n07685303	oatcake
-n07685399	Irish soda bread
-n07685546	skillet bread, fry bread
-n07685730	rye bread
-n07685918	black bread, pumpernickel
-n07686021	Jewish rye bread, Jewish rye
-n07686202	limpa
-n07686299	Swedish rye bread, Swedish rye
-n07686461	salt-rising bread
-n07686634	simnel
-n07686720	sour bread, sourdough bread
-n07686873	toast
-n07687053	wafer
-n07687211	white bread, light bread
-n07687381	baguet, baguette
-n07687469	French bread
-n07687626	Italian bread
-n07687789	cornbread
-n07688021	corn cake
-n07688130	skillet corn bread
-n07688265	ashcake, ash cake, corn tash
-n07688412	hoecake
-n07688624	cornpone, pone
-n07688757	corn dab, corn dodger, dodger
-n07688898	hush puppy, hushpuppy
-n07689003	johnnycake, johnny cake, journey cake
-n07689217	Shawnee cake
-n07689313	spoon bread, batter bread
-n07689490	cinnamon toast
-n07689624	orange toast
-n07689757	Melba toast
-n07689842	zwieback, rusk, Brussels biscuit, twice-baked bread
-n07690019	frankfurter bun, hotdog bun
-n07690152	hamburger bun, hamburger roll
-n07690273	muffin, gem
-n07690431	bran muffin
-n07690511	corn muffin
-n07690585	Yorkshire pudding
-n07690739	popover
-n07690892	scone
-n07691091	drop scone, griddlecake, Scotch pancake
-n07691237	cross bun, hot cross bun
-n07691539	brioche
-n07691650	crescent roll, croissant
-n07691758	hard roll, Vienna roll
-n07691863	soft roll
-n07691954	kaiser roll
-n07692114	Parker House roll
-n07692248	clover-leaf roll
-n07692405	onion roll
-n07692517	bialy, bialystoker
-n07692614	sweet roll, coffee roll
-n07692887	bear claw, bear paw
-n07693048	cinnamon roll, cinnamon bun, cinnamon snail
-n07693223	honey bun, sticky bun, caramel bun, schnecken
-n07693439	pinwheel roll
-n07693590	danish, danish pastry
-n07693725	bagel, beigel
-n07693889	onion bagel
-n07693972	biscuit
-n07694169	rolled biscuit
-n07694403	baking-powder biscuit
-n07694516	buttermilk biscuit, soda biscuit
-n07694659	shortcake
-n07694839	hardtack, pilot biscuit, pilot bread, sea biscuit, ship biscuit
-n07695187	saltine
-n07695284	soda cracker
-n07695410	oyster cracker
-n07695504	water biscuit
-n07695652	graham cracker
-n07695742	pretzel
-n07695878	soft pretzel
-n07695965	sandwich
-n07696403	sandwich plate
-n07696527	butty
-n07696625	ham sandwich
-n07696728	chicken sandwich
-n07696839	club sandwich, three-decker, triple-decker
-n07696977	open-face sandwich, open sandwich
-n07697100	hamburger, beefburger, burger
-n07697313	cheeseburger
-n07697408	tunaburger
-n07697537	hotdog, hot dog, red hot
-n07697699	Sloppy Joe
-n07697825	bomber, grinder, hero, hero sandwich, hoagie, hoagy, Cuban sandwich, Italian sandwich, poor boy, sub, submarine, submarine sandwich, torpedo, wedge, zep
-n07698250	gyro
-n07698401	bacon-lettuce-tomato sandwich, BLT
-n07698543	Reuben
-n07698672	western, western sandwich
-n07698782	wrap
-n07700003	spaghetti
-n07703889	hasty pudding
-n07704054	gruel
-n07704205	congee, jook
-n07704305	skilly
-n07705931	edible fruit
-n07707451	vegetable, veggie, veg
-n07708124	julienne, julienne vegetable
-n07708398	raw vegetable, rabbit food
-n07708512	crudites
-n07708685	celery stick
-n07708798	legume
-n07709046	pulse
-n07709172	potherb
-n07709333	greens, green, leafy vegetable
-n07709701	chop-suey greens
-n07709881	bean curd, tofu
-n07710007	solanaceous vegetable
-n07710283	root vegetable
-n07710616	potato, white potato, Irish potato, murphy, spud, tater
-n07710952	baked potato
-n07711080	french fries, french-fried potatoes, fries, chips
-n07711232	home fries, home-fried potatoes
-n07711371	jacket potato
-n07711569	mashed potato
-n07711683	potato skin, potato peel, potato peelings
-n07711799	Uruguay potato
-n07711907	yam
-n07712063	sweet potato
-n07712267	yam
-n07712382	snack food
-n07712559	chip, crisp, potato chip, Saratoga chip
-n07712748	corn chip
-n07712856	tortilla chip
-n07712959	nacho
-n07713074	eggplant, aubergine, mad apple
-n07713267	pieplant, rhubarb
-n07713395	cruciferous vegetable
-n07713763	mustard, mustard greens, leaf mustard, Indian mustard
-n07713895	cabbage, chou
-n07714078	kale, kail, cole
-n07714188	collards, collard greens
-n07714287	Chinese cabbage, celery cabbage, Chinese celery
-n07714448	bok choy, bok choi
-n07714571	head cabbage
-n07714802	red cabbage
-n07714895	savoy cabbage, savoy
-n07714990	broccoli
-n07715103	cauliflower
-n07715221	brussels sprouts
-n07715407	broccoli rabe, broccoli raab
-n07715561	squash
-n07715721	summer squash
-n07716034	yellow squash
-n07716203	crookneck, crookneck squash, summer crookneck
-n07716358	zucchini, courgette
-n07716504	marrow, vegetable marrow
-n07716649	cocozelle
-n07716750	pattypan squash
-n07716906	spaghetti squash
-n07717070	winter squash
-n07717410	acorn squash
-n07717556	butternut squash
-n07717714	hubbard squash
-n07717858	turban squash
-n07718068	buttercup squash
-n07718195	cushaw
-n07718329	winter crookneck squash
-n07718472	cucumber, cuke
-n07718671	gherkin
-n07718747	artichoke, globe artichoke
-n07718920	artichoke heart
-n07719058	Jerusalem artichoke, sunchoke
-n07719213	asparagus
-n07719330	bamboo shoot
-n07719437	sprout
-n07719616	bean sprout
-n07719756	alfalfa sprout
-n07719839	beet, beetroot
-n07719980	beet green
-n07720084	sugar beet
-n07720185	mangel-wurzel
-n07720277	chard, Swiss chard, spinach beet, leaf beet
-n07720442	pepper
-n07720615	sweet pepper
-n07720875	bell pepper
-n07721018	green pepper
-n07721118	globe pepper
-n07721195	pimento, pimiento
-n07721325	hot pepper
-n07721456	chili, chili pepper, chilli, chilly, chile
-n07721678	jalapeno, jalapeno pepper
-n07721833	chipotle
-n07721942	cayenne, cayenne pepper
-n07722052	tabasco, red pepper
-n07722217	onion
-n07722390	Bermuda onion
-n07722485	green onion, spring onion, scallion
-n07722666	Vidalia onion
-n07722763	Spanish onion
-n07722888	purple onion, red onion
-n07723039	leek
-n07723177	shallot
-n07723330	salad green, salad greens
-n07723559	lettuce
-n07723753	butterhead lettuce
-n07723968	buttercrunch
-n07724078	Bibb lettuce
-n07724173	Boston lettuce
-n07724269	crisphead lettuce, iceberg lettuce, iceberg
-n07724492	cos, cos lettuce, romaine, romaine lettuce
-n07724654	leaf lettuce, loose-leaf lettuce
-n07724819	celtuce
-n07724943	bean, edible bean
-n07725158	goa bean
-n07725255	lentil
-n07725376	pea
-n07725531	green pea, garden pea
-n07725663	marrowfat pea
-n07725789	snow pea, sugar pea
-n07725888	sugar snap pea
-n07726009	split-pea
-n07726095	chickpea, garbanzo
-n07726230	cajan pea, pigeon pea, dahl
-n07726386	field pea
-n07726525	mushy peas
-n07726672	black-eyed pea, cowpea
-n07726796	common bean
-n07727048	kidney bean
-n07727140	navy bean, pea bean, white bean
-n07727252	pinto bean
-n07727377	frijole
-n07727458	black bean, turtle bean
-n07727578	fresh bean
-n07727741	flageolet, haricot
-n07727868	green bean
-n07728053	snap bean, snap
-n07728181	string bean
-n07728284	Kentucky wonder, Kentucky wonder bean
-n07728391	scarlet runner, scarlet runner bean, runner bean, English runner bean
-n07728585	haricot vert, haricots verts, French bean
-n07728708	wax bean, yellow bean
-n07728804	shell bean
-n07729000	lima bean
-n07729142	Fordhooks
-n07729225	sieva bean, butter bean, butterbean, civet bean
-n07729384	fava bean, broad bean
-n07729485	soy, soybean, soya, soya bean
-n07729828	green soybean
-n07729926	field soybean
-n07730033	cardoon
-n07730207	carrot
-n07730320	carrot stick
-n07730406	celery
-n07730562	pascal celery, Paschal celery
-n07730708	celeriac, celery root
-n07730855	chicory, curly endive
-n07731006	radicchio
-n07731122	coffee substitute
-n07731284	chicory, chicory root
-n07731436	Postum
-n07731587	chicory escarole, endive, escarole
-n07731767	Belgian endive, French endive, witloof
-n07731952	corn, edible corn
-n07732168	sweet corn, green corn
-n07732302	hominy
-n07732433	lye hominy
-n07732525	pearl hominy
-n07732636	popcorn
-n07732747	cress
-n07732904	watercress
-n07733005	garden cress
-n07733124	winter cress
-n07733217	dandelion green
-n07733394	gumbo, okra
-n07733567	kohlrabi, turnip cabbage
-n07733712	lamb's-quarter, pigweed, wild spinach
-n07733847	wild spinach
-n07734017	tomato
-n07734183	beefsteak tomato
-n07734292	cherry tomato
-n07734417	plum tomato
-n07734555	tomatillo, husk tomato, Mexican husk tomato
-n07734744	mushroom
-n07734879	stuffed mushroom
-n07735052	salsify
-n07735179	oyster plant, vegetable oyster
-n07735294	scorzonera, black salsify
-n07735404	parsnip
-n07735510	pumpkin
-n07735687	radish
-n07735803	turnip
-n07735981	white turnip
-n07736087	rutabaga, swede, swedish turnip, yellow turnip
-n07736256	turnip greens
-n07736371	sorrel, common sorrel
-n07736527	French sorrel
-n07736692	spinach
-n07736813	taro, taro root, cocoyam, dasheen, edda
-n07736971	truffle, earthnut
-n07737081	edible nut
-n07737594	bunya bunya
-n07737745	peanut, earthnut, goober, goober pea, groundnut, monkey nut
-n07738105	freestone
-n07738224	cling, clingstone
-n07739035	windfall
-n07739125	apple
-n07739344	crab apple, crabapple
-n07739506	eating apple, dessert apple
-n07739923	Baldwin
-n07740033	Cortland
-n07740115	Cox's Orange Pippin
-n07740220	Delicious
-n07740342	Golden Delicious, Yellow Delicious
-n07740461	Red Delicious
-n07740597	Empire
-n07740744	Grimes' golden
-n07740855	Jonathan
-n07740954	McIntosh
-n07741138	Macoun
-n07741235	Northern Spy
-n07741357	Pearmain
-n07741461	Pippin
-n07741623	Prima
-n07741706	Stayman
-n07741804	Winesap
-n07741888	Stayman Winesap
-n07742012	cooking apple
-n07742224	Bramley's Seedling
-n07742313	Granny Smith
-n07742415	Lane's Prince Albert
-n07742513	Newtown Wonder
-n07742605	Rome Beauty
-n07742704	berry
-n07743224	bilberry, whortleberry, European blueberry
-n07743384	huckleberry
-n07743544	blueberry
-n07743723	wintergreen, boxberry, checkerberry, teaberry, spiceberry
-n07743902	cranberry
-n07744057	lingonberry, mountain cranberry, cowberry, lowbush cranberry
-n07744246	currant
-n07744430	gooseberry
-n07744559	black currant
-n07744682	red currant
-n07744811	blackberry
-n07745046	boysenberry
-n07745197	dewberry
-n07745357	loganberry
-n07745466	raspberry
-n07745661	saskatoon, serviceberry, shadberry, juneberry
-n07745940	strawberry
-n07746038	sugarberry, hackberry
-n07746186	persimmon
-n07746334	acerola, barbados cherry, surinam cherry, West Indian cherry
-n07746551	carambola, star fruit
-n07746749	ceriman, monstera
-n07746910	carissa plum, natal plum
-n07747055	citrus, citrus fruit, citrous fruit
-n07747607	orange
-n07747811	temple orange
-n07747951	mandarin, mandarin orange
-n07748157	clementine
-n07748276	satsuma
-n07748416	tangerine
-n07748574	tangelo, ugli, ugli fruit
-n07748753	bitter orange, Seville orange, sour orange
-n07748912	sweet orange
-n07749095	Jaffa orange
-n07749192	navel orange
-n07749312	Valencia orange
-n07749446	kumquat
-n07749582	lemon
-n07749731	lime
-n07749870	key lime
-n07749969	grapefruit
-n07750146	pomelo, shaddock
-n07750299	citrange
-n07750449	citron
-n07750586	almond
-n07750736	Jordan almond
-n07750872	apricot
-n07751004	peach
-n07751148	nectarine
-n07751280	pitahaya
-n07751451	plum
-n07751737	damson, damson plum
-n07751858	greengage, greengage plum
-n07751977	beach plum
-n07752109	sloe
-n07752264	Victoria plum
-n07752377	dried fruit
-n07752514	dried apricot
-n07752602	prune
-n07752664	raisin
-n07752782	seedless raisin, sultana
-n07752874	seeded raisin
-n07752966	currant
-n07753113	fig
-n07753275	pineapple, ananas
-n07753448	anchovy pear, river pear
-n07753592	banana
-n07753743	passion fruit
-n07753980	granadilla
-n07754155	sweet calabash
-n07754279	bell apple, sweet cup, water lemon, yellow granadilla
-n07754451	breadfruit
-n07754684	jackfruit, jak, jack
-n07754894	cacao bean, cocoa bean
-n07755089	cocoa
-n07755262	canistel, eggfruit
-n07755411	melon
-n07755619	melon ball
-n07755707	muskmelon, sweet melon
-n07755929	cantaloup, cantaloupe
-n07756096	winter melon
-n07756325	honeydew, honeydew melon
-n07756499	Persian melon
-n07756641	net melon, netted melon, nutmeg melon
-n07756838	casaba, casaba melon
-n07756951	watermelon
-n07757132	cherry
-n07757312	sweet cherry, black cherry
-n07757511	bing cherry
-n07757602	heart cherry, oxheart, oxheart cherry
-n07757753	blackheart, blackheart cherry
-n07757874	capulin, Mexican black cherry
-n07757990	sour cherry
-n07758125	amarelle
-n07758260	morello
-n07758407	cocoa plum, coco plum, icaco
-n07758582	gherkin
-n07758680	grape
-n07758950	fox grape
-n07759194	Concord grape
-n07759324	Catawba
-n07759424	muscadine, bullace grape
-n07759576	scuppernong
-n07759691	slipskin grape
-n07759816	vinifera grape
-n07760070	emperor
-n07760153	muscat, muscatel, muscat grape
-n07760297	ribier
-n07760395	sultana
-n07760501	Tokay
-n07760673	flame tokay
-n07760755	Thompson Seedless
-n07760859	custard apple
-n07761141	cherimoya, cherimolla
-n07761309	soursop, guanabana
-n07761611	sweetsop, annon, sugar apple
-n07761777	ilama
-n07761954	pond apple
-n07762114	papaw, pawpaw
-n07762244	papaya
-n07762373	kai apple
-n07762534	ketembilla, kitembilla, kitambilla
-n07762740	ackee, akee
-n07762913	durian
-n07763107	feijoa, pineapple guava
-n07763290	genip, Spanish lime
-n07763483	genipap, genipap fruit
-n07763629	kiwi, kiwi fruit, Chinese gooseberry
-n07763792	loquat, Japanese plum
-n07763987	mangosteen
-n07764155	mango
-n07764315	sapodilla, sapodilla plum, sapota
-n07764486	sapote, mammee, marmalade plum
-n07764630	tamarind, tamarindo
-n07764847	avocado, alligator pear, avocado pear, aguacate
-n07765073	date
-n07765208	elderberry
-n07765361	guava
-n07765517	mombin
-n07765612	hog plum, yellow mombin
-n07765728	hog plum, wild plum
-n07765862	jaboticaba
-n07765999	jujube, Chinese date, Chinese jujube
-n07766173	litchi, litchi nut, litchee, lichi, leechee, lichee, lychee
-n07766409	longanberry, dragon's eye
-n07766530	mamey, mammee, mammee apple
-n07766723	marang
-n07766891	medlar
-n07767002	medlar
-n07767171	mulberry
-n07767344	olive
-n07767549	black olive, ripe olive
-n07767709	green olive
-n07767847	pear
-n07768068	bosc
-n07768139	anjou
-n07768230	bartlett, bartlett pear
-n07768318	seckel, seckel pear
-n07768423	plantain
-n07768590	plumcot
-n07768694	pomegranate
-n07768858	prickly pear
-n07769102	Barbados gooseberry, blade apple
-n07769306	quandong, quandang, quantong, native peach
-n07769465	quandong nut
-n07769584	quince
-n07769731	rambutan, rambotan
-n07769886	pulasan, pulassan
-n07770034	rose apple
-n07770180	sorb, sorb apple
-n07770439	sour gourd, monkey bread
-n07770571	edible seed
-n07770763	pumpkin seed
-n07770869	betel nut, areca nut
-n07771082	beechnut
-n07771212	walnut
-n07771405	black walnut
-n07771539	English walnut
-n07771731	brazil nut, brazil
-n07771891	butternut
-n07772026	souari nut
-n07772147	cashew, cashew nut
-n07772274	chestnut
-n07772413	chincapin, chinkapin, chinquapin
-n07772788	hazelnut, filbert, cobnut, cob
-n07772935	coconut, cocoanut
-n07773428	coconut milk, coconut water
-n07774182	grugru nut
-n07774295	hickory nut
-n07774479	cola extract
-n07774596	macadamia nut
-n07774719	pecan
-n07774842	pine nut, pignolia, pinon nut
-n07775050	pistachio, pistachio nut
-n07775197	sunflower seed
-n07783827	anchovy paste
-n07785487	rollmops
-n07800091	feed, provender
-n07800487	cattle cake
-n07800636	creep feed
-n07800740	fodder
-n07801007	feed grain
-n07801091	eatage, forage, pasture, pasturage, grass
-n07801342	silage, ensilage
-n07801508	oil cake
-n07801709	oil meal
-n07801779	alfalfa
-n07801892	broad bean, horse bean
-n07802026	hay
-n07802152	timothy
-n07802246	stover
-n07802417	grain, food grain, cereal
-n07802767	grist
-n07802863	groats
-n07802963	millet
-n07803093	barley, barleycorn
-n07803213	pearl barley
-n07803310	buckwheat
-n07803408	bulgur, bulghur, bulgur wheat
-n07803545	wheat, wheat berry
-n07803779	cracked wheat
-n07803895	stodge
-n07803992	wheat germ
-n07804152	oat
-n07804323	rice
-n07804543	brown rice
-n07804657	white rice, polished rice
-n07804771	wild rice, Indian rice
-n07804900	paddy
-n07805006	slop, slops, swill, pigswill, pigwash
-n07805254	mash
-n07805389	chicken feed, scratch
-n07805478	cud, rechewed food
-n07805594	bird feed, bird food, birdseed
-n07805731	petfood, pet-food, pet food
-n07805966	dog food
-n07806043	cat food
-n07806120	canary seed
-n07806221	salad
-n07806633	tossed salad
-n07806774	green salad
-n07806879	Caesar salad
-n07807002	salmagundi
-n07807171	salad nicoise
-n07807317	combination salad
-n07807472	chef's salad
-n07807594	potato salad
-n07807710	pasta salad
-n07807834	macaroni salad
-n07807922	fruit salad
-n07808022	Waldorf salad
-n07808166	crab Louis
-n07808268	herring salad
-n07808352	tuna fish salad, tuna salad
-n07808479	chicken salad
-n07808587	coleslaw, slaw
-n07808675	aspic
-n07808806	molded salad
-n07808904	tabbouleh, tabooli
-n07809096	ingredient, fixings
-n07809368	flavorer, flavourer, flavoring, flavouring, seasoner, seasoning
-n07810531	bouillon cube
-n07810907	condiment
-n07811416	herb
-n07812046	fines herbes
-n07812184	spice
-n07812662	spearmint oil
-n07812790	lemon oil
-n07812913	wintergreen oil, oil of wintergreen
-n07813107	salt, table salt, common salt
-n07813324	celery salt
-n07813495	onion salt
-n07813579	seasoned salt
-n07813717	sour salt
-n07813833	five spice powder
-n07814007	allspice
-n07814203	cinnamon
-n07814390	stick cinnamon
-n07814487	clove
-n07814634	cumin, cumin seed
-n07814790	fennel
-n07814925	ginger, gingerroot
-n07815163	ginger, powdered ginger
-n07815294	mace
-n07815424	nutmeg
-n07815588	pepper, peppercorn
-n07815839	black pepper
-n07815956	white pepper
-n07816052	sassafras
-n07816164	basil, sweet basil
-n07816296	bay leaf
-n07816398	borage
-n07816575	hyssop
-n07816726	caraway
-n07816839	chervil
-n07817024	chives
-n07817160	comfrey, healing herb
-n07817315	coriander, Chinese parsley, cilantro
-n07817465	coriander, coriander seed
-n07817599	costmary
-n07817758	fennel, common fennel
-n07817871	fennel, Florence fennel, finocchio
-n07818029	fennel seed
-n07818133	fenugreek, fenugreek seed
-n07818277	garlic, ail
-n07818422	clove, garlic clove
-n07818572	garlic chive
-n07818689	lemon balm
-n07818825	lovage
-n07818995	marjoram, oregano
-n07819166	mint
-n07819303	mustard seed
-n07819480	mustard, table mustard
-n07819682	Chinese mustard
-n07819769	nasturtium
-n07819896	parsley
-n07820036	salad burnet
-n07820145	rosemary
-n07820297	rue
-n07820497	sage
-n07820683	clary sage
-n07820814	savory, savoury
-n07820960	summer savory, summer savoury
-n07821107	winter savory, winter savoury
-n07821260	sweet woodruff, waldmeister
-n07821404	sweet cicely
-n07821610	tarragon, estragon
-n07821758	thyme
-n07821919	turmeric
-n07822053	caper
-n07822197	catsup, ketchup, cetchup, tomato ketchup
-n07822323	cardamom, cardamon, cardamum
-n07822518	cayenne, cayenne pepper, red pepper
-n07822687	chili powder
-n07822845	chili sauce
-n07823105	chutney, Indian relish
-n07823280	steak sauce
-n07823369	taco sauce
-n07823460	salsa
-n07823591	mint sauce
-n07823698	cranberry sauce
-n07823814	curry powder
-n07823951	curry
-n07824191	lamb curry
-n07824268	duck sauce, hoisin sauce
-n07824383	horseradish
-n07824502	marinade
-n07824702	paprika
-n07824863	Spanish paprika
-n07824988	pickle
-n07825194	dill pickle
-n07825399	bread and butter pickle
-n07825496	pickle relish
-n07825597	piccalilli
-n07825717	sweet pickle
-n07825850	applesauce, apple sauce
-n07825972	soy sauce, soy
-n07826091	Tabasco, Tabasco sauce
-n07826250	tomato paste
-n07826340	angelica
-n07826453	angelica
-n07826544	almond extract
-n07826653	anise, aniseed, anise seed
-n07826930	Chinese anise, star anise, star aniseed
-n07827130	juniper berries
-n07827284	saffron
-n07827410	sesame seed, benniseed
-n07827554	caraway seed
-n07827750	poppy seed
-n07827896	dill, dill weed
-n07828041	dill seed
-n07828156	celery seed
-n07828275	lemon extract
-n07828378	monosodium glutamate, MSG
-n07828642	vanilla bean
-n07828987	vinegar, acetum
-n07829248	cider vinegar
-n07829331	wine vinegar
-n07829412	sauce
-n07830493	anchovy sauce
-n07830593	hot sauce
-n07830690	hard sauce
-n07830841	horseradish sauce, sauce Albert
-n07830986	bolognese pasta sauce
-n07831146	carbonara
-n07831267	tomato sauce
-n07831450	tartare sauce, tartar sauce
-n07831663	wine sauce
-n07831821	marchand de vin, mushroom wine sauce
-n07831955	bread sauce
-n07832099	plum sauce
-n07832202	peach sauce
-n07832307	apricot sauce
-n07832416	pesto
-n07832592	ravigote, ravigotte
-n07832741	remoulade sauce
-n07832902	dressing, salad dressing
-n07833333	sauce Louis
-n07833535	bleu cheese dressing, blue cheese dressing
-n07833672	blue cheese dressing, Roquefort dressing
-n07833816	French dressing, vinaigrette, sauce vinaigrette
-n07833951	Lorenzo dressing
-n07834065	anchovy dressing
-n07834160	Italian dressing
-n07834286	half-and-half dressing
-n07834507	mayonnaise, mayo
-n07834618	green mayonnaise, sauce verte
-n07834774	aioli, aioli sauce, garlic sauce
-n07834872	Russian dressing, Russian mayonnaise
-n07835051	salad cream
-n07835173	Thousand Island dressing
-n07835331	barbecue sauce
-n07835457	hollandaise
-n07835547	bearnaise
-n07835701	Bercy, Bercy butter
-n07835823	bordelaise
-n07835921	bourguignon, bourguignon sauce, Burgundy sauce
-n07836077	brown sauce, sauce Espagnole
-n07836269	Espagnole, sauce Espagnole
-n07836456	Chinese brown sauce, brown sauce
-n07836600	blanc
-n07836731	cheese sauce
-n07836838	chocolate sauce, chocolate syrup
-n07837002	hot-fudge sauce, fudge sauce
-n07837110	cocktail sauce, seafood sauce
-n07837234	Colbert, Colbert butter
-n07837362	white sauce, bechamel sauce, bechamel
-n07837545	cream sauce
-n07837630	Mornay sauce
-n07837755	demiglace, demi-glaze
-n07837912	gravy, pan gravy
-n07838073	gravy
-n07838233	spaghetti sauce, pasta sauce
-n07838441	marinara
-n07838551	mole
-n07838659	hunter's sauce, sauce chausseur
-n07838811	mushroom sauce
-n07838905	mustard sauce
-n07839055	Nantua, shrimp sauce
-n07839172	Hungarian sauce, paprika sauce
-n07839312	pepper sauce, Poivrade
-n07839478	roux
-n07839593	Smitane
-n07839730	Soubise, white onion sauce
-n07839864	Lyonnaise sauce, brown onion sauce
-n07840027	veloute
-n07840124	allemande, allemande sauce
-n07840219	caper sauce
-n07840304	poulette
-n07840395	curry sauce
-n07840520	Worcester sauce, Worcestershire, Worcestershire sauce
-n07840672	coconut milk, coconut cream
-n07840804	egg, eggs
-n07841037	egg white, white, albumen, ovalbumin
-n07841345	egg yolk, yolk
-n07841495	boiled egg, coddled egg
-n07841639	hard-boiled egg, hard-cooked egg
-n07841800	Easter egg
-n07841907	Easter egg
-n07842044	chocolate egg
-n07842130	candy egg
-n07842202	poached egg, dropped egg
-n07842308	scrambled eggs
-n07842433	deviled egg, stuffed egg
-n07842605	shirred egg, baked egg, egg en cocotte
-n07842753	omelet, omelette
-n07842972	firm omelet
-n07843117	French omelet
-n07843220	fluffy omelet
-n07843348	western omelet
-n07843464	souffle
-n07843636	fried egg
-n07843775	dairy product
-n07844042	milk
-n07844604	milk
-n07844786	sour milk
-n07844867	soya milk, soybean milk, soymilk
-n07845087	formula
-n07845166	pasteurized milk
-n07845335	cows' milk
-n07845421	yak's milk
-n07845495	goats' milk
-n07845571	acidophilus milk
-n07845702	raw milk
-n07845775	scalded milk
-n07845863	homogenized milk
-n07846014	certified milk
-n07846143	powdered milk, dry milk, dried milk, milk powder
-n07846274	nonfat dry milk
-n07846359	evaporated milk
-n07846471	condensed milk
-n07846557	skim milk, skimmed milk
-n07846688	semi-skimmed milk
-n07846802	whole milk
-n07846938	low-fat milk
-n07847047	buttermilk
-n07847198	cream
-n07847453	clotted cream, Devonshire cream
-n07847585	double creme, heavy whipping cream
-n07847706	half-and-half
-n07847827	heavy cream
-n07847917	light cream, coffee cream, single cream
-n07848093	sour cream, soured cream
-n07848196	whipping cream, light whipping cream
-n07848338	butter
-n07848771	clarified butter, drawn butter
-n07848936	ghee
-n07849026	brown butter, beurre noisette
-n07849186	Meuniere butter, lemon butter
-n07849336	yogurt, yoghurt, yoghourt
-n07849506	blueberry yogurt
-n07849619	raita
-n07849733	whey
-n07849912	curd
-n07850083	curd
-n07850219	clabber
-n07850329	cheese
-n07851054	paring
-n07851298	cream cheese
-n07851443	double cream
-n07851554	mascarpone
-n07851641	triple cream, triple creme
-n07851767	cottage cheese, pot cheese, farm cheese, farmer's cheese
-n07851926	process cheese, processed cheese
-n07852045	bleu, blue cheese
-n07852229	Stilton
-n07852302	Roquefort
-n07852376	gorgonzola
-n07852452	Danish blue
-n07852532	Bavarian blue
-n07852614	Brie
-n07852712	brick cheese
-n07852833	Camembert
-n07852919	cheddar, cheddar cheese, Armerican cheddar, American cheese
-n07853125	rat cheese, store cheese
-n07853232	Cheshire cheese
-n07853345	double Gloucester
-n07853445	Edam
-n07853560	goat cheese, chevre
-n07853648	Gouda, Gouda cheese
-n07853762	grated cheese
-n07853852	hand cheese
-n07853946	Liederkranz
-n07854066	Limburger
-n07854184	mozzarella
-n07854266	Muenster
-n07854348	Parmesan
-n07854455	quark cheese, quark
-n07854614	ricotta
-n07854707	string cheese
-n07854813	Swiss cheese
-n07854982	Emmenthal, Emmental, Emmenthaler, Emmentaler
-n07855105	Gruyere
-n07855188	sapsago
-n07855317	Velveeta
-n07855413	nut butter
-n07855510	peanut butter
-n07855603	marshmallow fluff
-n07855721	onion butter
-n07855812	pimento butter
-n07855907	shrimp butter
-n07856045	lobster butter
-n07856186	yak butter
-n07856270	spread, paste
-n07856756	cheese spread
-n07856895	anchovy butter
-n07856992	fishpaste
-n07857076	garlic butter
-n07857170	miso
-n07857356	wasabi
-n07857598	snail butter
-n07857731	hummus, humus, hommos, hoummos, humous
-n07857959	pate
-n07858114	duck pate
-n07858197	foie gras, pate de foie gras
-n07858336	tapenade
-n07858484	tahini
-n07858595	sweetening, sweetener
-n07858841	aspartame
-n07858978	honey
-n07859142	saccharin
-n07859284	sugar, refined sugar
-n07859583	syrup, sirup
-n07859796	sugar syrup
-n07859951	molasses
-n07860103	sorghum, sorghum molasses
-n07860208	treacle, golden syrup
-n07860331	grenadine
-n07860447	maple syrup
-n07860548	corn syrup
-n07860629	miraculous food, manna, manna from heaven
-n07860805	batter
-n07860988	dough
-n07861158	bread dough
-n07861247	pancake batter
-n07861334	fritter batter
-n07861557	coq au vin
-n07861681	chicken provencale
-n07861813	chicken and rice
-n07861983	moo goo gai pan
-n07862095	arroz con pollo
-n07862244	bacon and eggs
-n07862348	barbecued spareribs, spareribs
-n07862461	beef Bourguignonne, boeuf Bourguignonne
-n07862611	beef Wellington, filet de boeuf en croute
-n07862770	bitok
-n07862946	boiled dinner, New England boiled dinner
-n07863107	Boston baked beans
-n07863229	bubble and squeak
-n07863374	pasta
-n07863547	cannelloni
-n07863644	carbonnade flamande, Belgian beef stew
-n07863802	cheese souffle
-n07863935	chicken Marengo
-n07864065	chicken cordon bleu
-n07864198	Maryland chicken
-n07864317	chicken paprika, chicken paprikash
-n07864475	chicken Tetrazzini
-n07864638	Tetrazzini
-n07864756	chicken Kiev
-n07864934	chili, chili con carne
-n07865105	chili dog
-n07865196	chop suey
-n07865484	chow mein
-n07865575	codfish ball, codfish cake
-n07865700	coquille
-n07865788	coquilles Saint-Jacques
-n07866015	croquette
-n07866151	cottage pie
-n07866277	rissole
-n07866409	dolmas, stuffed grape leaves
-n07866571	egg foo yong, egg fu yung
-n07866723	egg roll, spring roll
-n07866868	eggs Benedict
-n07867021	enchilada
-n07867164	falafel, felafel
-n07867324	fish and chips
-n07867421	fondue, fondu
-n07867616	cheese fondue
-n07867751	chocolate fondue
-n07867883	fondue, fondu
-n07868045	beef fondue, boeuf fondu bourguignon
-n07868200	French toast
-n07868340	fried rice, Chinese fried rice
-n07868508	frittata
-n07868684	frog legs
-n07868830	galantine
-n07868955	gefilte fish, fish ball
-n07869111	haggis
-n07869291	ham and eggs
-n07869391	hash
-n07869522	corned beef hash
-n07869611	jambalaya
-n07869775	kabob, kebab, shish kebab
-n07869937	kedgeree
-n07870069	souvlaki, souvlakia
-n07870167	lasagna, lasagne
-n07870313	seafood Newburg
-n07870478	lobster Newburg, lobster a la Newburg
-n07870620	shrimp Newburg
-n07870734	Newburg sauce
-n07870894	lobster thermidor
-n07871065	lutefisk, lutfisk
-n07871234	macaroni and cheese
-n07871335	macedoine
-n07871436	meatball
-n07871588	porcupine ball, porcupines
-n07871720	Swedish meatball
-n07871810	meat loaf, meatloaf
-n07872593	moussaka
-n07872748	osso buco
-n07873057	marrow, bone marrow
-n07873198	pheasant under glass
-n07873348	pigs in blankets
-n07873464	pilaf, pilaff, pilau, pilaw
-n07873679	bulgur pilaf
-n07873807	pizza, pizza pie
-n07874063	sausage pizza
-n07874159	pepperoni pizza
-n07874259	cheese pizza
-n07874343	anchovy pizza
-n07874441	Sicilian pizza
-n07874531	poi
-n07874674	pork and beans
-n07874780	porridge
-n07874995	oatmeal, burgoo
-n07875086	loblolly
-n07875152	potpie
-n07875267	rijsttaffel, rijstaffel, rijstafel
-n07875436	risotto, Italian rice
-n07875560	roulade
-n07875693	fish loaf
-n07875835	salmon loaf
-n07875926	Salisbury steak
-n07876026	sauerbraten
-n07876189	sauerkraut
-n07876281	scallopine, scallopini
-n07876460	veal scallopini
-n07876550	scampi
-n07876651	Scotch egg
-n07876775	Scotch woodcock
-n07876893	scrapple
-n07877187	spaghetti and meatballs
-n07877299	Spanish rice
-n07877675	steak tartare, tartar steak, cannibal mound
-n07877849	pepper steak
-n07877961	steak au poivre, peppered steak, pepper steak
-n07878145	beef Stroganoff
-n07878283	stuffed cabbage
-n07878479	kishke, stuffed derma
-n07878647	stuffed peppers
-n07878785	stuffed tomato, hot stuffed tomato
-n07878926	stuffed tomato, cold stuffed tomato
-n07879072	succotash
-n07879174	sukiyaki
-n07879350	sashimi
-n07879450	sushi
-n07879560	Swiss steak
-n07879659	tamale
-n07879821	tamale pie
-n07879953	tempura
-n07880080	teriyaki
-n07880213	terrine
-n07880325	Welsh rarebit, Welsh rabbit, rarebit
-n07880458	schnitzel, Wiener schnitzel
-n07880751	taco
-n07880880	chicken taco
-n07880968	burrito
-n07881117	beef burrito
-n07881205	quesadilla
-n07881404	tostada
-n07881525	bean tostada
-n07881625	refried beans, frijoles refritos
-n07881800	beverage, drink, drinkable, potable
-n07882420	wish-wash
-n07882497	concoction, mixture, intermixture
-n07882886	mix, premix
-n07883031	filling
-n07883156	lekvar
-n07883251	potion
-n07883384	elixir
-n07883510	elixir of life
-n07883661	philter, philtre, love-potion, love-philter, love-philtre
-n07884567	alcohol, alcoholic drink, alcoholic beverage, intoxicant, inebriant
-n07885705	proof spirit
-n07886057	home brew, homebrew
-n07886176	hooch, hootch
-n07886317	kava, kavakava
-n07886463	aperitif
-n07886572	brew, brewage
-n07886849	beer
-n07887099	draft beer, draught beer
-n07887192	suds
-n07887304	Munich beer, Munchener
-n07887461	bock, bock beer
-n07887634	lager, lager beer
-n07887967	light beer
-n07888058	Oktoberfest, Octoberfest
-n07888229	Pilsner, Pilsener
-n07888378	shebeen
-n07888465	Weissbier, white beer, wheat beer
-n07888816	Weizenbock
-n07888909	malt
-n07889193	wort
-n07889274	malt, malt liquor
-n07889510	ale
-n07889814	bitter
-n07889990	Burton
-n07890068	pale ale
-n07890226	porter, porter's beer
-n07890352	stout
-n07890540	Guinness
-n07890617	kvass
-n07890750	mead
-n07890890	metheglin
-n07890970	hydromel
-n07891095	oenomel
-n07891189	near beer
-n07891309	ginger beer
-n07891433	sake, saki, rice beer
-n07891726	wine, vino
-n07892418	vintage
-n07892512	red wine
-n07892813	white wine
-n07893253	blush wine, pink wine, rose, rose wine
-n07893425	altar wine, sacramental wine
-n07893528	sparkling wine
-n07893642	champagne, bubbly
-n07893792	cold duck
-n07893891	Burgundy, Burgundy wine
-n07894102	Beaujolais
-n07894298	Medoc
-n07894451	Canary wine
-n07894551	Chablis, white Burgundy
-n07894703	Montrachet
-n07894799	Chardonnay, Pinot Chardonnay
-n07894965	Pinot noir
-n07895100	Pinot blanc
-n07895237	Bordeaux, Bordeaux wine
-n07895435	claret, red Bordeaux
-n07895595	Chianti
-n07895710	Cabernet, Cabernet Sauvignon
-n07895839	Merlot
-n07895962	Sauvignon blanc
-n07896060	California wine
-n07896165	Cotes de Provence
-n07896287	dessert wine
-n07896422	Dubonnet
-n07896560	jug wine
-n07896661	macon, maconnais
-n07896765	Moselle
-n07896893	Muscadet
-n07896994	plonk
-n07897116	retsina
-n07897200	Rhine wine, Rhenish, hock
-n07897438	Riesling
-n07897600	liebfraumilch
-n07897750	Rhone wine
-n07897865	Rioja
-n07897975	sack
-n07898117	Saint Emilion
-n07898247	Soave
-n07898333	zinfandel
-n07898443	Sauterne, Sauternes
-n07898617	straw wine
-n07898745	table wine
-n07898895	Tokay
-n07899003	vin ordinaire
-n07899108	vermouth
-n07899292	sweet vermouth, Italian vermouth
-n07899434	dry vermouth, French vermouth
-n07899533	Chenin blanc
-n07899660	Verdicchio
-n07899769	Vouvray
-n07899899	Yquem
-n07899976	generic, generic wine
-n07900225	varietal, varietal wine
-n07900406	fortified wine
-n07900616	Madeira
-n07900734	malmsey
-n07900825	port, port wine
-n07900958	sherry
-n07901355	Marsala
-n07901457	muscat, muscatel, muscadel, muscadelle
-n07901587	liquor, spirits, booze, hard drink, hard liquor, John Barleycorn, strong drink
-n07902121	neutral spirits, ethyl alcohol
-n07902336	aqua vitae, ardent spirits
-n07902443	eau de vie
-n07902520	moonshine, bootleg, corn liquor
-n07902698	bathtub gin
-n07902799	aquavit, akvavit
-n07902937	arrack, arak
-n07903101	bitters
-n07903208	brandy
-n07903543	applejack
-n07903643	Calvados
-n07903731	Armagnac
-n07903841	Cognac
-n07903962	grappa
-n07904072	kirsch
-n07904293	slivovitz
-n07904395	gin
-n07904637	sloe gin
-n07904760	geneva, Holland gin, Hollands
-n07904865	grog
-n07904934	ouzo
-n07905038	rum
-n07905296	demerara, demerara rum
-n07905386	Jamaica rum
-n07905474	schnapps, schnaps
-n07905618	pulque
-n07905770	mescal
-n07905979	tequila
-n07906111	vodka
-n07906284	whiskey, whisky
-n07906572	blended whiskey, blended whisky
-n07906718	bourbon
-n07906877	corn whiskey, corn whisky, corn
-n07907037	firewater
-n07907161	Irish, Irish whiskey, Irish whisky
-n07907342	poteen
-n07907429	rye, rye whiskey, rye whisky
-n07907548	Scotch, Scotch whiskey, Scotch whisky, malt whiskey, malt whisky, Scotch malt whiskey, Scotch malt whisky
-n07907831	sour mash, sour mash whiskey
-n07907943	liqueur, cordial
-n07908411	absinth, absinthe
-n07908567	amaretto
-n07908647	anisette, anisette de Bordeaux
-n07908812	benedictine
-n07908923	Chartreuse
-n07909129	coffee liqueur
-n07909231	creme de cacao
-n07909362	creme de menthe
-n07909504	creme de fraise
-n07909593	Drambuie
-n07909714	Galliano
-n07909811	orange liqueur
-n07909954	curacao, curacoa
-n07910048	triple sec
-n07910152	Grand Marnier
-n07910245	kummel
-n07910379	maraschino, maraschino liqueur
-n07910538	pastis
-n07910656	Pernod
-n07910799	pousse-cafe
-n07910970	Kahlua
-n07911061	ratafia, ratafee
-n07911249	sambuca
-n07911371	mixed drink
-n07911677	cocktail
-n07912093	Dom Pedro
-n07912211	highball
-n07913180	mixer
-n07913300	bishop
-n07913393	Bloody Mary
-n07913537	Virgin Mary, bloody shame
-n07913644	bullshot
-n07913774	cobbler
-n07913882	collins, Tom Collins
-n07914006	cooler
-n07914128	refresher
-n07914271	smoothie
-n07914413	daiquiri, rum cocktail
-n07914586	strawberry daiquiri
-n07914686	NADA daiquiri
-n07914777	spritzer
-n07914887	flip
-n07914995	gimlet
-n07915094	gin and tonic
-n07915213	grasshopper
-n07915366	Harvey Wallbanger
-n07915491	julep, mint julep
-n07915618	manhattan
-n07915800	Rob Roy
-n07915918	margarita
-n07916041	martini
-n07916183	gin and it
-n07916319	vodka martini
-n07916437	old fashioned
-n07916582	pink lady
-n07917133	Sazerac
-n07917272	screwdriver
-n07917392	sidecar
-n07917507	Scotch and soda
-n07917618	sling
-n07917791	brandy sling
-n07917874	gin sling
-n07917951	rum sling
-n07918028	sour
-n07918193	whiskey sour, whisky sour
-n07918309	stinger
-n07918706	swizzle
-n07918879	hot toddy, toddy
-n07919165	zombie, zombi
-n07919310	fizz
-n07919441	Irish coffee
-n07919572	cafe au lait
-n07919665	cafe noir, demitasse
-n07919787	decaffeinated coffee, decaf
-n07919894	drip coffee
-n07920052	espresso
-n07920222	caffe latte, latte
-n07920349	cappuccino, cappuccino coffee, coffee cappuccino
-n07920540	iced coffee, ice coffee
-n07920663	instant coffee
-n07920872	mocha, mocha coffee
-n07920989	mocha
-n07921090	cassareep
-n07921239	Turkish coffee
-n07921360	chocolate milk
-n07921455	cider, cyder
-n07921615	hard cider
-n07921834	scrumpy
-n07921948	sweet cider
-n07922041	mulled cider
-n07922147	perry
-n07922512	rotgut
-n07922607	slug
-n07922764	cocoa, chocolate, hot chocolate, drinking chocolate
-n07922955	criollo
-n07923748	juice
-n07924033	fruit juice, fruit crush
-n07924276	nectar
-n07924366	apple juice
-n07924443	cranberry juice
-n07924560	grape juice
-n07924655	must
-n07924747	grapefruit juice
-n07924834	orange juice
-n07924955	frozen orange juice, orange-juice concentrate
-n07925116	pineapple juice
-n07925229	lemon juice
-n07925327	lime juice
-n07925423	papaya juice
-n07925500	tomato juice
-n07925608	carrot juice
-n07925708	V-8 juice
-n07925808	koumiss, kumis
-n07925966	fruit drink, ade
-n07926250	lemonade
-n07926346	limeade
-n07926442	orangeade
-n07926540	malted milk
-n07926785	mate
-n07926920	mulled wine
-n07927070	negus
-n07927197	soft drink
-n07927512	pop, soda, soda pop, soda water, tonic
-n07927716	birch beer
-n07927836	bitter lemon
-n07927931	cola, dope
-n07928163	cream soda
-n07928264	egg cream
-n07928367	ginger ale, ginger pop
-n07928488	orange soda
-n07928578	phosphate
-n07928696	Coca Cola, Coke
-n07928790	Pepsi, Pepsi Cola
-n07928887	root beer
-n07928998	sarsaparilla
-n07929172	tonic, tonic water, quinine water
-n07929351	coffee bean, coffee berry, coffee
-n07929519	coffee, java
-n07929940	cafe royale, coffee royal
-n07930062	fruit punch
-n07930205	milk punch
-n07930315	mimosa, buck's fizz
-n07930433	pina colada
-n07930554	punch
-n07930864	cup
-n07931001	champagne cup
-n07931096	claret cup
-n07931280	wassail
-n07931452	planter's punch
-n07931612	White Russian
-n07931733	fish house punch
-n07931870	May wine
-n07932039	eggnog
-n07932323	cassiri
-n07932454	spruce beer
-n07932614	rickey
-n07932762	gin rickey
-n07932841	tea, tea leaf
-n07933154	tea bag
-n07933274	tea
-n07933530	tea-like drink
-n07933652	cambric tea
-n07933799	cuppa, cupper
-n07933891	herb tea, herbal tea, herbal
-n07934032	tisane
-n07934152	camomile tea
-n07934282	ice tea, iced tea
-n07934373	sun tea
-n07934530	black tea
-n07934678	congou, congo, congou tea, English breakfast tea
-n07934800	Darjeeling
-n07934908	orange pekoe, pekoe
-n07935043	souchong, soochong
-n07935152	green tea
-n07935288	hyson
-n07935379	oolong
-n07935504	water
-n07935737	bottled water
-n07935878	branch water
-n07936015	spring water
-n07936093	sugar water
-n07936263	drinking water
-n07936459	ice water
-n07936548	soda water, carbonated water, club soda, seltzer, sparkling water
-n07936745	mineral water
-n07936979	seltzer
-n07937069	Vichy water
-n07937344	perishable, spoilable
-n07937461	couscous
-n07937621	ramekin, ramequin
-n07938007	multivitamin, multivitamin pill
-n07938149	vitamin pill
-n07938313	soul food
-n07938594	mold, mould
-n07942152	people
-n07951464	collection, aggregation, accumulation, assemblage
-n07954211	book, rule book
-n07977870	library
-n08079613	baseball club, ball club, club, nine
-n08182379	crowd
-n08238463	class, form, grade, course
-n08242223	core, nucleus, core group
-n08249459	concert band, military band
-n08253141	dance
-n08256735	wedding, wedding party
-n08376250	chain, concatenation
-n08385989	power breakfast
-n08492354	aerie, aery, eyrie, eyry
-n08492461	agora
-n08494231	amusement park, funfair, pleasure ground
-n08495908	aphelion
-n08496334	apron
-n08500819	interplanetary space
-n08500989	interstellar space
-n08501887	intergalactic space
-n08505018	bush
-n08506347	semidesert
-n08511017	beam-ends
-n08517010	bridgehead
-n08517676	bus stop
-n08518171	campsite, campground, camping site, camping ground, bivouac, encampment, camping area
-n08519299	detention basin
-n08521623	cemetery, graveyard, burial site, burial ground, burying ground, memorial park, necropolis
-n08523340	trichion, crinion
-n08524735	city, metropolis, urban center
-n08539072	business district, downtown
-n08539276	outskirts
-n08540532	borough
-n08547468	cow pasture
-n08547544	crest
-n08551296	eparchy, exarchate
-n08554440	suburb, suburbia, suburban area
-n08555333	stockbroker belt
-n08555710	crawlspace, crawl space
-n08558770	sheikdom, sheikhdom
-n08558963	residence, abode
-n08559155	domicile, legal residence
-n08560295	dude ranch
-n08569482	farmland, farming area
-n08571275	midfield
-n08571642	firebreak, fireguard
-n08571898	flea market
-n08573674	battlefront, front, front line
-n08573842	garbage heap, junk heap, rubbish heap, scrapheap, trash heap, junk pile, trash pile, refuse heap
-n08578517	benthos, benthic division, benthonic zone
-n08579266	goldfield
-n08579352	grainfield, grain field
-n08580944	half-mast, half-staff
-n08583292	hemline
-n08583455	heronry
-n08583554	hipline
-n08583682	hipline
-n08584914	hole-in-the-wall
-n08586978	junkyard
-n08589670	isoclinic line, isoclinal
-n08596076	littoral, litoral, littoral zone, sands
-n08597579	magnetic pole
-n08598301	grassland
-n08598568	mecca
-n08599174	observer's meridian
-n08599292	prime meridian
-n08611339	nombril
-n08611421	no-parking zone
-n08613733	outdoors, out-of-doors, open air, open
-n08614632	fairground
-n08616050	pasture, pastureland, grazing land, lea, ley
-n08618831	perihelion
-n08619112	periselene, perilune
-n08623676	locus of infection
-n08628141	kasbah, casbah
-n08633683	waterfront
-n08640531	resort, resort hotel, holiday resort
-n08640739	resort area, playground, vacation spot
-n08640962	rough
-n08643267	ashram
-n08644045	harborage, harbourage
-n08645104	scrubland
-n08645212	weald
-n08645318	wold
-n08647264	schoolyard
-n08648917	showplace
-n08649711	bedside
-n08651104	sideline, out of bounds
-n08652376	ski resort
-n08658309	soil horizon
-n08658918	geological horizon
-n08659242	coal seam
-n08659331	coalface
-n08659446	field
-n08659861	oilfield
-n08661878	Temperate Zone
-n08662427	terreplein
-n08663051	three-mile limit
-n08663703	desktop
-n08663860	top
-n08673039	kampong, campong
-n08674344	subtropics, semitropics
-n08676253	barrio
-n08677424	veld, veldt
-n08677801	vertex, peak, apex, acme
-n08678783	waterline, water line, water level
-n08679167	high-water mark
-n08679269	low-water mark
-n08679562	continental divide
-n08685188	zodiac
-n08782627	Aegean island
-n08896327	sultanate
-n09032191	Swiss canton
-n09186592	abyssal zone
-n09189157	aerie, aery, eyrie, eyry
-n09191635	air bubble
-n09193551	alluvial flat, alluvial plain
-n09193705	alp
-n09194227	Alpine glacier, Alpine type of glacier
-n09199101	anthill, formicary
-n09201998	aquifer
-n09203827	archipelago
-n09205509	arete
-n09206896	arroyo
-n09206985	ascent, acclivity, rise, raise, climb, upgrade
-n09208496	asterism
-n09209025	asthenosphere
-n09210862	atoll
-n09213434	bank
-n09213565	bank
-n09214060	bar
-n09214269	barbecue pit
-n09214916	barrier reef
-n09215023	baryon, heavy particle
-n09215437	basin
-n09217230	beach
-n09218315	honeycomb
-n09218494	belay
-n09218641	ben
-n09219233	berm
-n09223487	bladder stone, cystolith
-n09224725	bluff
-n09226869	borrow pit
-n09228055	brae
-n09229709	bubble
-n09230041	burrow, tunnel
-n09230202	butte
-n09231117	caldera
-n09233446	canyon, canon
-n09233603	canyonside
-n09238926	cave
-n09239302	cavern
-n09242389	chasm
-n09245515	cirque, corrie, cwm
-n09246464	cliff, drop, drop-off
-n09247410	cloud
-n09248153	coast
-n09248399	coastland
-n09249034	col, gap
-n09249155	collector
-n09251407	comet
-n09255070	continental glacier
-n09256479	coral reef
-n09257843	cove
-n09259025	crag
-n09259219	crater
-n09260907	cultivated land, farmland, plowland, ploughland, tilled land, tillage, tilth
-n09262690	dale
-n09263912	defile, gorge
-n09264803	delta
-n09265620	descent, declivity, fall, decline, declination, declension, downslope
-n09266604	diapir
-n09267854	divot
-n09268007	divot
-n09269341	down
-n09269472	downhill
-n09269882	draw
-n09270160	drey
-n09270657	drumlin
-n09270735	dune, sand dune
-n09274152	escarpment, scarp
-n09274305	esker
-n09279986	fireball
-n09281252	flare star
-n09282208	floor
-n09283193	fomite, vehicle
-n09283405	foothill
-n09283514	footwall
-n09283767	foreland
-n09283866	foreshore
-n09287415	gauge boson
-n09287968	geological formation, formation
-n09288635	geyser
-n09289331	glacier
-n09289596	glen
-n09290350	gopher hole
-n09290444	gorge
-n09294877	grotto, grot
-n09295210	growler
-n09295946	gulch, flume
-n09300306	gully
-n09300905	hail
-n09302616	highland, upland
-n09303008	hill
-n09303528	hillside
-n09304750	hole, hollow
-n09305031	hollow, holler
-n09305898	hot spring, thermal spring
-n09308572	iceberg, berg
-n09308743	icecap, ice cap
-n09309046	ice field
-n09309168	ice floe, floe
-n09309292	ice mass
-n09310616	inclined fault
-n09315159	ion
-n09319604	isthmus
-n09325824	kidney stone, urinary calculus, nephrolith, renal calculus
-n09326662	knoll, mound, hillock, hummock, hammock
-n09327077	kopje, koppie
-n09327538	Kuiper belt, Edgeworth-Kuiper belt
-n09330378	lake bed, lake bottom
-n09331251	lakefront
-n09332890	lakeside, lakeshore
-n09335693	landfall
-n09335809	landfill
-n09336555	lather
-n09337048	leak
-n09337253	ledge, shelf
-n09338013	lepton
-n09339810	lithosphere, geosphere
-n09344198	lowland
-n09344324	lunar crater
-n09344724	maar
-n09348460	massif
-n09349648	meander
-n09351905	mesa, table
-n09352849	meteorite
-n09353815	microfossil
-n09354511	midstream
-n09357346	molehill
-n09357447	monocline
-n09359803	mountain, mount
-n09361517	mountainside, versant
-n09362316	mouth
-n09362945	mull
-n09366017	natural depression, depression
-n09366317	natural elevation, elevation
-n09375606	nullah
-n09376198	ocean
-n09376526	ocean floor, sea floor, ocean bottom, seabed, sea bottom, Davy Jones's locker, Davy Jones
-n09376786	oceanfront
-n09381242	outcrop, outcropping, rock outcrop
-n09382099	oxbow
-n09384106	pallasite
-n09389867	perforation
-n09391386	photosphere
-n09391644	piedmont
-n09391774	Piedmont glacier, Piedmont type of glacier
-n09392402	pinetum
-n09393524	plage
-n09393605	plain, field, champaign
-n09396465	point
-n09396608	polar glacier
-n09398076	pothole, chuckhole
-n09398677	precipice
-n09399592	promontory, headland, head, foreland
-n09400584	ptyalith
-n09400987	pulsar
-n09402944	quicksand
-n09403086	rabbit burrow, rabbit hole
-n09403211	radiator
-n09403427	rainbow
-n09403734	range, mountain range, range of mountains, chain, mountain chain, chain of mountains
-n09405078	rangeland
-n09405787	ravine
-n09406793	reef
-n09409512	ridge
-n09409752	ridge, ridgeline
-n09410224	rift valley
-n09411189	riparian forest
-n09411295	ripple mark
-n09415584	riverbank, riverside
-n09415671	riverbed, river bottom
-n09416076	rock, stone
-n09416890	roof
-n09421031	saltpan
-n09421799	sandbank
-n09421951	sandbar, sand bar
-n09422190	sandpit
-n09422631	sanitary landfill
-n09425019	sawpit
-n09425344	scablands
-n09428293	seashore, coast, seacoast, sea-coast
-n09428628	seaside, seaboard
-n09429630	seif dune
-n09432283	shell
-n09432990	shiner
-n09433312	shoal
-n09433442	shore
-n09433839	shoreline
-n09435739	sinkhole, sink, swallow hole
-n09436444	ski slope
-n09436708	sky
-n09437454	slope, incline, side
-n09438844	snowcap
-n09438940	snowdrift
-n09439032	snowfield
-n09439213	soapsuds, suds, lather
-n09442595	spit, tongue
-n09443281	spoor
-n09443641	spume
-n09444783	star
-n09445008	steep
-n09445289	steppe
-n09447666	strand
-n09448690	streambed, creek bed
-n09450163	sun, Sun
-n09451237	supernova
-n09452291	swale
-n09452395	swamp, swampland
-n09452760	swell
-n09453008	tableland, plateau
-n09454153	talus, scree
-n09454412	tangle
-n09454744	tar pit
-n09456207	terrace, bench
-n09457979	tidal basin
-n09458269	tideland
-n09459979	tor
-n09460046	tor
-n09461069	Trapezium
-n09462600	troposphere
-n09463226	tundra
-n09464486	twinkler
-n09466678	uphill
-n09467696	urolith
-n09468604	valley, vale
-n09470027	vehicle-borne transmission
-n09470222	vein, mineral vein
-n09472413	volcanic crater, crater
-n09472597	volcano
-n09474010	wadi
-n09474412	wall
-n09474765	warren, rabbit warren
-n09475044	wasp's nest, wasps' nest, hornet's nest, hornets' nest
-n09475179	watercourse
-n09475925	waterside
-n09476123	water table, water level, groundwater level
-n09478210	whinstone, whin
-n09480959	wormcast
-n09481120	xenolith
-n09493983	Circe
-n09495962	gryphon, griffin, griffon
-n09505153	spiritual leader
-n09537660	messiah, christ
-n09556121	Rhea Silvia, Rea Silvia
-n09605110	number one
-n09606009	adventurer, venturer
-n09606527	anomaly, unusual person
-n09607630	appointee, appointment
-n09607782	argonaut
-n09607903	Ashkenazi
-n09608709	benefactor, helper
-n09610255	color-blind person
-n09610405	commoner, common man, common person
-n09611722	conservator
-n09612700	contrarian
-n09613118	contadino
-n09613191	contestant
-n09613690	cosigner, cosignatory
-n09615336	discussant
-n09616573	enologist, oenologist, fermentologist
-n09616922	entertainer
-n09617161	eulogist, panegyrist
-n09617435	ex-gambler
-n09617577	experimenter
-n09617696	experimenter
-n09618760	exponent
-n09618880	ex-president
-n09618957	face
-n09619168	female, female person
-n09619452	finisher
-n09620078	inhabitant, habitant, dweller, denizen, indweller
-n09620794	native, indigen, indigene, aborigine, aboriginal
-n09621232	native
-n09622049	juvenile, juvenile person
-n09622302	lover
-n09624168	male, male person
-n09624559	mediator, go-between, intermediator, intermediary, intercessor
-n09624899	mediatrix
-n09625401	national, subject
-n09626238	peer, equal, match, compeer
-n09627807	prize winner, lottery winner
-n09627906	recipient, receiver
-n09629065	religionist
-n09629246	sensualist
-n09629752	traveler, traveller
-n09631129	unwelcome person, persona non grata
-n09632274	unskilled person
-n09632518	worker
-n09633969	wrongdoer, offender
-n09635534	Black African
-n09635635	Afrikaner, Afrikander, Boer
-n09635973	Aryan
-n09636339	Black, Black person, blackamoor, Negro, Negroid
-n09637339	Black woman
-n09638454	mulatto
-n09638875	White, White person, Caucasian
-n09639382	Circassian
-n09639919	Semite
-n09640327	Chaldean, Chaldaean, Chaldee
-n09640715	Elamite
-n09641002	white man
-n09641578	WASP, white Anglo-Saxon Protestant
-n09643799	gook, slant-eye
-n09644152	Mongol, Mongolian
-n09644657	Tatar, Tartar, Mongol Tatar
-n09648743	Nahuatl
-n09648911	Aztec
-n09649067	Olmec
-n09650729	Biloxi
-n09650839	Blackfoot
-n09650989	Brule
-n09651123	Caddo
-n09651968	Cheyenne
-n09652149	Chickasaw
-n09653144	Cocopa, Cocopah
-n09653438	Comanche
-n09654079	Creek
-n09654518	Delaware
-n09654898	Diegueno
-n09655213	Esselen
-n09655466	Eyeish
-n09656077	Havasupai
-n09657206	Hunkpapa
-n09657748	Iowa, Ioway
-n09658254	Kalapooia, Kalapuya, Calapooya, Calapuya
-n09658398	Kamia
-n09658815	Kekchi
-n09658921	Kichai
-n09659039	Kickapoo
-n09659188	Kiliwa, Kiliwi
-n09660010	Malecite
-n09660240	Maricopa
-n09661873	Mohican, Mahican
-n09662038	Muskhogean, Muskogean
-n09662661	Navaho, Navajo
-n09662951	Nootka
-n09663248	Oglala, Ogalala
-n09663786	Osage
-n09663999	Oneida
-n09664556	Paiute, Piute
-n09664908	Passamaquody
-n09665367	Penobscot
-n09665545	Penutian
-n09666349	Potawatomi
-n09666476	Powhatan
-n09666883	kachina
-n09667358	Salish
-n09668199	Shahaptian, Sahaptin, Sahaptino
-n09668437	Shasta
-n09668562	Shawnee
-n09668988	Sihasapa
-n09669631	Teton, Lakota, Teton Sioux, Teton Dakota
-n09670280	Taracahitian
-n09670521	Tarahumara
-n09670909	Tuscarora
-n09671089	Tutelo
-n09672590	Yana
-n09672725	Yavapai
-n09672840	Yokuts
-n09673091	Yuma
-n09674412	Gadaba
-n09674786	Kolam
-n09675045	Kui
-n09675673	Toda
-n09675799	Tulu
-n09675922	Gujarati, Gujerati
-n09676021	Kashmiri
-n09676247	Punjabi, Panjabi
-n09676884	Slav
-n09677427	Anabaptist
-n09678747	Adventist, Second Adventist
-n09679028	gentile, non-Jew, goy
-n09679170	gentile
-n09679925	Catholic
-n09680908	Old Catholic
-n09681107	Uniat, Uniate, Uniate Christian
-n09681234	Copt
-n09681973	Jewess
-n09683180	Jihadist
-n09683757	Buddhist
-n09683924	Zen Buddhist
-n09684082	Mahayanist
-n09684901	swami
-n09685233	Hare Krishna
-n09685806	Shintoist
-n09686262	Eurafrican
-n09686401	Eurasian
-n09688233	Gael
-n09688804	Frank
-n09689435	Afghan, Afghanistani
-n09689958	Albanian
-n09690083	Algerian
-n09690208	Altaic
-n09690496	Andorran
-n09690621	Angolan
-n09690864	Anguillan
-n09691604	Austrian
-n09691729	Bahamian
-n09691858	Bahraini, Bahreini
-n09692125	Basotho
-n09692915	Herero
-n09693244	Luba, Chiluba
-n09693982	Barbadian
-n09694664	Bolivian
-n09694771	Bornean
-n09695019	Carioca
-n09695132	Tupi
-n09695514	Bruneian
-n09695620	Bulgarian
-n09695979	Byelorussian, Belorussian, White Russian
-n09696456	Cameroonian
-n09696585	Canadian
-n09696763	French Canadian
-n09697401	Central American
-n09697986	Chilean
-n09698644	Congolese
-n09699020	Cypriot, Cypriote, Cyprian
-n09699642	Dane
-n09700125	Djiboutian
-n09700964	Britisher, Briton, Brit
-n09701148	English person
-n09701833	Englishwoman
-n09702134	Anglo-Saxon
-n09702673	Angle
-n09703101	West Saxon
-n09703344	Lombard, Langobard
-n09703485	limey, John Bull
-n09703708	Cantabrigian
-n09703809	Cornishman
-n09703932	Cornishwoman
-n09704057	Lancastrian
-n09704157	Lancastrian
-n09704283	Geordie
-n09705003	Oxonian
-n09705124	Ethiopian
-n09705671	Amhara
-n09705784	Eritrean
-n09706029	Finn
-n09706255	Komi
-n09707061	Livonian
-n09707289	Lithuanian
-n09707735	Selkup, Ostyak-Samoyed
-n09708750	Parisian
-n09708889	Parisienne
-n09709531	Creole
-n09709673	Creole
-n09710041	Gabonese
-n09710164	Greek, Hellene
-n09710886	Dorian
-n09711132	Athenian
-n09711435	Laconian
-n09712324	Guyanese
-n09712448	Haitian
-n09712696	Malay, Malayan
-n09712967	Moro
-n09713108	Netherlander, Dutchman, Hollander
-n09714120	Icelander
-n09714694	Iraqi, Iraki
-n09715165	Irishman
-n09715303	Irishwoman
-n09715427	Dubliner
-n09716047	Italian
-n09716933	Roman
-n09717233	Sabine
-n09718217	Japanese, Nipponese
-n09718811	Jordanian
-n09718936	Korean
-n09719309	Kenyan
-n09719794	Lao, Laotian
-n09720033	Lapp, Lapplander, Sami, Saami, Same, Saame
-n09720256	Latin American, Latino
-n09720595	Lebanese
-n09720702	Levantine
-n09720842	Liberian
-n09721244	Luxemburger, Luxembourger
-n09721444	Macedonian
-n09722064	Sabahan
-n09722658	Mexican
-n09722817	Chicano
-n09723067	Mexican-American, Mexicano
-n09723819	Namibian
-n09723944	Nauruan
-n09724234	Gurkha
-n09724533	New Zealander, Kiwi
-n09724656	Nicaraguan
-n09724785	Nigerian
-n09725000	Hausa, Haussa
-n09725229	North American
-n09725546	Nova Scotian, bluenose
-n09725653	Omani
-n09725772	Pakistani
-n09725935	Brahui
-n09726621	South American Indian
-n09726811	Carib, Carib Indian
-n09727440	Filipino
-n09727826	Polynesian
-n09728137	Qatari, Katari
-n09728285	Romanian, Rumanian
-n09729062	Muscovite
-n09729156	Georgian
-n09730077	Sarawakian
-n09730204	Scandinavian, Norse, Northman
-n09730824	Senegalese
-n09731343	Slovene
-n09731436	South African
-n09731571	South American
-n09732170	Sudanese
-n09733459	Syrian
-n09733793	Tahitian
-n09734185	Tanzanian
-n09734450	Tibetan
-n09734535	Togolese
-n09734639	Tuareg
-n09735258	Turki
-n09735654	Chuvash
-n09736485	Turkoman, Turkmen, Turcoman
-n09736798	Uzbek, Uzbeg, Uzbak, Usbek, Usbeg
-n09736945	Ugandan
-n09737050	Ukranian
-n09737161	Yakut
-n09737453	Tungus, Evenk
-n09738121	Igbo
-n09738400	American
-n09740724	Anglo-American
-n09741074	Alaska Native, Alaskan Native, Native Alaskan
-n09741331	Arkansan, Arkansawyer
-n09741722	Carolinian
-n09741816	Coloradan
-n09741904	Connecticuter
-n09741999	Delawarean, Delawarian
-n09742101	Floridian
-n09742315	German American
-n09742927	Illinoisan
-n09743487	Mainer, Down Easter
-n09743601	Marylander
-n09743792	Minnesotan, Gopher
-n09744161	Nebraskan, Cornhusker
-n09744346	New Hampshirite, Granite Stater
-n09744462	New Jerseyan, New Jerseyite, Garden Stater
-n09744679	New Yorker
-n09744834	North Carolinian, Tarheel
-n09745229	Oregonian, Beaver
-n09745324	Pennsylvanian, Keystone Stater
-n09745834	Texan
-n09745933	Utahan
-n09746936	Uruguayan
-n09747191	Vietnamese, Annamese
-n09747495	Gambian
-n09748101	East German
-n09748408	Berliner
-n09748648	Prussian
-n09748889	Ghanian
-n09749386	Guinean
-n09750282	Papuan
-n09750641	Walloon
-n09750770	Yemeni
-n09750891	Yugoslav, Jugoslav, Yugoslavian, Jugoslavian
-n09751076	Serbian, Serb
-n09751496	Xhosa
-n09751622	Zairese, Zairean
-n09751895	Zimbabwean
-n09752023	Zulu
-n09752519	Gemini, Twin
-n09753348	Sagittarius, Archer
-n09753792	Pisces, Fish
-n09754152	abbe
-n09754217	abbess, mother superior, prioress
-n09754633	abnegator
-n09754907	abridger, abbreviator
-n09755086	abstractor, abstracter
-n09755241	absconder
-n09755555	absolver
-n09755788	abecedarian
-n09755893	aberrant
-n09756049	abettor, abetter
-n09756195	abhorrer
-n09756961	abomination
-n09757449	abseiler, rappeller
-n09758173	abstainer, ascetic
-n09758885	academic administrator
-n09759501	academician
-n09760290	accessory before the fact
-n09760609	companion
-n09760913	accompanist, accompanyist
-n09761068	accomplice, confederate
-n09761753	account executive, account representative, registered representative, customer's broker, customer's man
-n09762011	accused
-n09762385	accuser
-n09763272	acid head
-n09763784	acquaintance, friend
-n09764201	acquirer
-n09764598	aerialist
-n09764732	action officer
-n09764900	active
-n09765118	active citizen
-n09765278	actor, histrion, player, thespian, role player
-n09767197	actor, doer, worker
-n09769076	addict, nut, freak, junkie, junky
-n09769525	adducer
-n09769929	adjuster, adjustor, claims adjuster, claims adjustor, claim agent
-n09770179	adjutant, aide, aide-de-camp
-n09770359	adjutant general
-n09771435	admirer, adorer
-n09772330	adoptee
-n09772746	adulterer, fornicator
-n09772930	adulteress, fornicatress, hussy, jade, loose woman, slut, strumpet, trollop
-n09773962	advertiser, advertizer, adman
-n09774167	advisee
-n09774783	advocate, advocator, proponent, exponent
-n09775907	aeronautical engineer
-n09776346	affiliate
-n09776642	affluent
-n09776807	aficionado
-n09777870	buck sergeant
-n09778266	agent-in-place
-n09778537	aggravator, annoyance
-n09778783	agitator, fomenter
-n09778927	agnostic
-n09779124	agnostic, doubter
-n09779280	agonist
-n09779461	agony aunt
-n09779790	agriculturist, agriculturalist, cultivator, grower, raiser
-n09780395	air attache
-n09780828	air force officer, commander
-n09780984	airhead
-n09781398	air traveler, air traveller
-n09781504	alarmist
-n09781650	albino
-n09782167	alcoholic, alky, dipsomaniac, boozer, lush, soaker, souse
-n09782397	alderman
-n09782855	alexic
-n09783537	alienee, grantee
-n09783776	alienor
-n09783884	aliterate, aliterate person
-n09784043	algebraist
-n09784160	allegorizer, allegoriser
-n09784564	alliterator
-n09785236	almoner, medical social worker
-n09785659	alpinist
-n09785891	altar boy
-n09786115	alto
-n09787534	ambassador, embassador
-n09787765	ambassador
-n09788073	ambusher
-n09788237	amicus curiae, friend of the court
-n09789150	amoralist
-n09789566	amputee
-n09789898	analogist
-n09790047	analphabet, analphabetic
-n09790482	analyst
-n09791014	industry analyst
-n09791419	market strategist
-n09791816	anarchist, nihilist, syndicalist
-n09792125	anathema, bete noire
-n09792555	ancestor, ascendant, ascendent, antecedent, root
-n09792969	anchor, anchorman, anchorperson
-n09793141	ancient
-n09793352	anecdotist, raconteur
-n09793946	angler, troller
-n09794550	animator
-n09794668	animist
-n09795010	annotator
-n09795124	announcer
-n09795334	announcer
-n09796809	anti
-n09796974	anti-American
-n09797742	anti-Semite, Jew-baiter
-n09797873	Anzac
-n09797998	ape-man
-n09798096	aphakic
-n09800469	appellant, plaintiff in error
-n09800964	appointee
-n09801102	apprehender
-n09801275	April fool
-n09801533	aspirant, aspirer, hopeful, wannabe, wannabee
-n09802445	appreciator
-n09802641	appropriator
-n09802951	Arabist
-n09804230	archaist
-n09805151	archbishop
-n09805324	archer, bowman
-n09805475	architect, designer
-n09806944	archivist
-n09807075	archpriest, hierarch, high priest, prelate, primate
-n09808080	Aristotelian, Aristotelean, Peripatetic
-n09808591	armiger
-n09809279	army attache
-n09809538	army engineer, military engineer
-n09809749	army officer
-n09809925	arranger, adapter, transcriber
-n09810166	arrival, arriver, comer
-n09811568	arthritic
-n09811712	articulator
-n09811852	artilleryman, cannoneer, gunner, machine gunner
-n09813219	artist's model, sitter
-n09814252	assayer
-n09814381	assemblyman
-n09814488	assemblywoman
-n09814567	assenter
-n09814660	asserter, declarer, affirmer, asseverator, avower
-n09815455	assignee
-n09815790	assistant, helper, help, supporter
-n09816654	assistant professor
-n09816771	associate
-n09817174	associate
-n09817386	associate professor
-n09818022	astronaut, spaceman, cosmonaut
-n09819477	cosmographer, cosmographist
-n09820044	atheist
-n09820263	athlete, jock
-n09821831	attendant, attender, tender
-n09822830	attorney general
-n09823153	auditor
-n09823287	augur, auspex
-n09823502	aunt, auntie, aunty
-n09823832	au pair girl
-n09824135	authoritarian, dictator
-n09824609	authority
-n09825096	authorizer, authoriser
-n09825750	automobile mechanic, auto-mechanic, car-mechanic, mechanic, grease monkey
-n09826204	aviator, aeronaut, airman, flier, flyer
-n09826605	aviatrix, airwoman, aviatress
-n09826821	ayah
-n09827246	babu, baboo
-n09827363	baby, babe, sister
-n09828216	baby
-n09828403	baby boomer, boomer
-n09828988	baby farmer
-n09830194	back
-n09830400	backbencher
-n09830629	backpacker, packer
-n09830759	backroom boy, brain truster
-n09830926	backscratcher
-n09831962	bad person
-n09832456	baggage
-n09832633	bag lady
-n09832978	bailee
-n09833111	bailiff
-n09833275	bailor
-n09833441	bairn
-n09833536	baker, bread maker
-n09833751	balancer
-n09833997	balker, baulker, noncompliant
-n09834258	ball-buster, ball-breaker
-n09834378	ball carrier, runner
-n09834699	ballet dancer
-n09834885	ballet master
-n09835017	ballet mistress
-n09835153	balletomane
-n09835230	ball hawk
-n09835348	balloonist
-n09835506	ballplayer, baseball player
-n09836160	bullfighter, toreador
-n09836343	banderillero
-n09836519	matador
-n09836786	picador
-n09837459	bandsman
-n09837720	banker
-n09838295	bank robber
-n09838370	bankrupt, insolvent
-n09838621	bantamweight
-n09839702	barmaid
-n09840217	baron, big businessman, business leader, king, magnate, mogul, power, top executive, tycoon
-n09840435	baron
-n09840520	baron
-n09841188	bartender, barman, barkeep, barkeeper, mixologist
-n09841515	baseball coach, baseball manager
-n09841696	base runner, runner
-n09842047	basketball player, basketeer, cager
-n09842288	basketweaver, basketmaker
-n09842395	Basket Maker
-n09842528	bass, basso
-n09842823	bastard, by-blow, love child, illegitimate child, illegitimate, whoreson
-n09843443	bat boy
-n09843602	bather
-n09843716	batman
-n09843824	baton twirler, twirler
-n09844457	Bavarian
-n09844898	beadsman, bedesman
-n09845401	beard
-n09845849	beatnik, beat
-n09846142	beauty consultant
-n09846469	Bedouin, Beduin
-n09846586	bedwetter, bed wetter, wetter
-n09846755	beekeeper, apiarist, apiculturist
-n09846894	beer drinker, ale drinker
-n09847267	beggarman
-n09847344	beggarwoman
-n09847543	beldam, beldame
-n09848110	theist
-n09848489	believer, truster
-n09849167	bell founder
-n09849990	benedick, benedict
-n09850760	berserker, berserk
-n09850974	besieger
-n09851165	best, topper
-n09851575	betrothed
-n09853541	Big Brother
-n09853645	bigot
-n09853881	big shot, big gun, big wheel, big cheese, big deal, big enchilada, big fish, head honcho
-n09854218	big sister
-n09854421	billiard player
-n09854915	biochemist
-n09855433	biographer
-n09856401	bird fancier
-n09856671	birth
-n09856827	birth-control campaigner, birth-control reformer
-n09857007	bisexual, bisexual person
-n09858165	black belt
-n09858299	blackmailer, extortioner, extortionist
-n09858733	Black Muslim
-n09859152	blacksmith
-n09859285	blade
-n09859975	blind date
-n09861287	bluecoat
-n09861599	bluestocking, bas bleu
-n09861863	boatbuilder
-n09861946	boatman, boater, waterman
-n09862183	boatswain, bos'n, bo's'n, bosun, bo'sun
-n09862621	bobby
-n09863031	bodyguard, escort
-n09863339	boffin
-n09863749	Bolshevik, Marxist, red, bolshie, bolshy
-n09863936	Bolshevik, Bolshevist
-n09864632	bombshell
-n09864968	bondman, bondsman
-n09865068	bondwoman, bondswoman, bondmaid
-n09865162	bondwoman, bondswoman, bondmaid
-n09865398	bond servant
-n09865672	book agent
-n09865744	bookbinder
-n09866115	bookkeeper
-n09866354	bookmaker
-n09866559	bookworm
-n09866661	booster, shoplifter, lifter
-n09866817	bootblack, shoeblack
-n09866922	bootlegger, moonshiner
-n09867069	bootmaker, boot maker
-n09867154	borderer
-n09867311	border patrolman
-n09868270	botanist, phytologist, plant scientist
-n09868782	bottom feeder
-n09868899	boulevardier
-n09869317	bounty hunter
-n09869447	bounty hunter
-n09869578	Bourbon
-n09870096	bowler
-n09871095	slugger, slogger
-n09871229	cub, lad, laddie, sonny, sonny boy
-n09871681	Boy Scout
-n09871867	boy scout
-n09871952	boy wonder
-n09872066	bragger, braggart, boaster, blowhard, line-shooter, vaunter
-n09872557	brahman, brahmin
-n09873348	brawler
-n09873473	breadwinner
-n09873769	breaststroker
-n09873899	breeder, stock breeder
-n09874428	brick
-n09874725	bride
-n09874862	bridesmaid, maid of honor
-n09875025	bridge agent
-n09875979	broadcast journalist
-n09876701	Brother
-n09877288	brother-in-law
-n09877587	browser
-n09877750	Brummie, Brummy
-n09877951	buddy, brother, chum, crony, pal, sidekick
-n09878921	bull
-n09879552	bully
-n09880189	bunny, bunny girl
-n09880741	burglar
-n09881265	bursar
-n09881358	busboy, waiter's assistant
-n09881895	business editor
-n09883047	business traveler
-n09883452	buster
-n09883807	busybody, nosy-parker, nosey-parker, quidnunc
-n09885059	buttinsky
-n09885866	cabinetmaker, furniture maker
-n09886403	caddie, golf caddie
-n09886540	cadet, plebe
-n09888635	caller, caller-out
-n09889065	call girl
-n09889170	calligrapher, calligraphist
-n09889691	campaigner, candidate, nominee
-n09889941	camper
-n09890192	camp follower
-n09890749	candidate, prospect
-n09891730	canonist
-n09892262	capitalist
-n09892513	captain, headwaiter, maitre d'hotel, maitre d'
-n09892693	captain, senior pilot
-n09893191	captain
-n09893344	captain, chieftain
-n09893502	captive
-n09893600	captive
-n09894143	cardinal
-n09894445	cardiologist, heart specialist, heart surgeon
-n09894654	card player
-n09894909	cardsharp, card sharp, cardsharper, card sharper, sharper, sharpie, sharpy, card shark
-n09895222	careerist
-n09895480	career man
-n09895561	caregiver
-n09895701	caretaker
-n09895902	caretaker
-n09896170	caricaturist
-n09896311	carillonneur
-n09896401	caroler, caroller
-n09896685	carpenter
-n09896826	carper, niggler
-n09898020	Cartesian
-n09899289	cashier
-n09899671	casualty, injured party
-n09899782	casualty
-n09899929	casuist, sophist
-n09901337	catechist
-n09901502	catechumen, neophyte
-n09901642	caterer
-n09901786	Catholicos
-n09901921	cat fancier
-n09902128	Cavalier, Royalist
-n09902353	cavalryman, trooper
-n09902731	caveman, cave man, cave dweller, troglodyte
-n09902851	celebrant
-n09902954	celebrant, celebrator, celebrater
-n09903153	celebrity, famous person
-n09903501	cellist, violoncellist
-n09903639	censor
-n09903936	censor
-n09904208	centenarian
-n09904837	centrist, middle of the roader, moderate, moderationist
-n09905050	centurion
-n09905185	certified public accountant, CPA
-n09905530	chachka, tsatske, tshatshke, tchotchke, tchotchkeleh
-n09906293	chambermaid, fille de chambre
-n09906449	chameleon
-n09906704	champion, champ, title-holder
-n09907804	chandler
-n09908769	prison chaplain
-n09909660	charcoal burner
-n09909929	charge d'affaires
-n09910222	charioteer
-n09910374	charmer, beguiler
-n09910556	chartered accountant
-n09910840	chartist, technical analyst
-n09911226	charwoman, char, cleaning woman, cleaning lady, woman
-n09912431	male chauvinist, sexist
-n09912681	cheapskate, tightwad
-n09912907	Chechen
-n09912995	checker
-n09913329	cheerer
-n09913455	cheerleader
-n09913593	cheerleader
-n09915434	Cheops, Khufu
-n09915651	chess master
-n09916348	chief executive officer, CEO, chief operating officer
-n09917214	chief of staff
-n09917345	chief petty officer
-n09917481	Chief Secretary
-n09917593	child, kid, youngster, minor, shaver, nipper, small fry, tiddler, tike, tyke, fry, nestling
-n09918248	child, kid
-n09918554	child, baby
-n09918867	child prodigy, infant prodigy, wonder child
-n09919061	chimneysweeper, chimneysweep, sweep
-n09919200	chiropractor
-n09919451	chit
-n09919899	choker
-n09920106	choragus
-n09920283	choreographer
-n09920901	chorus girl, showgirl, chorine
-n09921034	chosen
-n09923003	cicerone
-n09923186	cigar smoker
-n09923418	cipher, cypher, nobody, nonentity
-n09923561	circus acrobat
-n09923673	citizen
-n09923996	city editor
-n09924106	city father
-n09924195	city man
-n09924313	city slicker, city boy
-n09924437	civic leader, civil leader
-n09924996	civil rights leader, civil rights worker, civil rights activist
-n09927089	cleaner
-n09927451	clergyman, reverend, man of the cloth
-n09928136	cleric, churchman, divine, ecclesiastic
-n09928451	clerk
-n09928845	clever Dick, clever clogs
-n09929202	climatologist
-n09929298	climber
-n09929577	clinician
-n09930257	closer, finisher
-n09930628	closet queen
-n09930876	clown, buffoon, goof, goofball, merry andrew
-n09931165	clown, buffoon
-n09931418	coach, private instructor, tutor
-n09931640	coach, manager, handler
-n09932098	pitching coach
-n09932336	coachman
-n09932508	coal miner, collier, pitman
-n09932788	coastguardsman
-n09933020	cobber
-n09933098	cobbler, shoemaker
-n09933842	codger, old codger
-n09933972	co-beneficiary
-n09934337	cog
-n09934488	cognitive neuroscientist
-n09934774	coiffeur
-n09935107	coiner
-n09935434	collaborator, cooperator, partner, pardner
-n09936825	colleen
-n09936892	college student, university student
-n09937056	collegian, college man, college boy
-n09937688	colonial
-n09937802	colonialist
-n09937903	colonizer, coloniser
-n09938080	coloratura, coloratura soprano
-n09938449	color guard
-n09938991	colossus, behemoth, giant, heavyweight, titan
-n09940725	comedian
-n09940818	comedienne
-n09941089	comer
-n09941571	commander
-n09941787	commander in chief, generalissimo
-n09941964	commanding officer, commandant, commander
-n09942697	commissar, political commissar
-n09942970	commissioned officer
-n09943239	commissioned military officer
-n09943811	commissioner
-n09944022	commissioner
-n09944160	committee member
-n09944430	committeewoman
-n09945021	commodore
-n09945223	communicant
-n09945319	communist, commie
-n09945603	Communist
-n09945745	commuter
-n09946814	compere
-n09947127	complexifier
-n09950457	compulsive
-n09950728	computational linguist
-n09951070	computer scientist
-n09951274	computer user
-n09951524	Comrade
-n09951616	concert-goer, music lover
-n09952163	conciliator, make-peace, pacifier, peacemaker, reconciler
-n09953052	conductor
-n09953350	confectioner, candymaker
-n09953615	Confederate
-n09954355	confessor
-n09954639	confidant, intimate
-n09955406	Confucian, Confucianist
-n09955944	rep
-n09956578	conqueror, vanquisher
-n09957523	Conservative
-n09958133	Nonconformist, chapelgoer
-n09958292	Anglican
-n09958447	consignee
-n09958569	consigner, consignor
-n09959142	constable
-n09959658	constructivist
-n09960688	contractor
-n09961198	contralto
-n09961331	contributor
-n09961469	control freak
-n09961605	convalescent
-n09961739	convener
-n09962966	convict, con, inmate, yard bird, yardbird
-n09964202	copilot, co-pilot
-n09964411	copycat, imitator, emulator, ape, aper
-n09965515	coreligionist
-n09965787	cornerback
-n09966470	corporatist
-n09966554	correspondent, letter writer
-n09967063	cosmetician
-n09967406	cosmopolitan, cosmopolite
-n09967555	Cossack
-n09967816	cost accountant
-n09967967	co-star
-n09968259	costumier, costumer, costume designer
-n09968652	cotter, cottier
-n09968741	cotter, cottar
-n09968845	counselor, counsellor
-n09970088	counterterrorist
-n09970192	counterspy, mole
-n09970402	countess
-n09970822	compromiser
-n09971273	countrywoman
-n09971385	county agent, agricultural agent, extension agent
-n09971839	courtier
-n09972010	cousin, first cousin, cousin-german, full cousin
-n09972458	cover girl, pin-up, lovely
-n09972587	cow
-n09974648	craftsman, artisan, journeyman, artificer
-n09975425	craftsman, crafter
-n09976024	crapshooter
-n09976283	crazy, loony, looney, nutcase, weirdo
-n09976429	creature, wight
-n09976728	creditor
-n09976917	creep, weirdo, weirdie, weirdy, spook
-n09978442	criminologist
-n09979321	critic
-n09979913	Croesus
-n09980458	cross-examiner, cross-questioner
-n09980805	crossover voter, crossover
-n09980985	croupier
-n09981092	crown prince
-n09981278	crown princess
-n09981540	cryptanalyst, cryptographer, cryptologist
-n09981939	Cub Scout
-n09982152	cuckold
-n09982525	cultist
-n09983314	curandera
-n09983572	curate, minister of religion, minister, parson, pastor, rector
-n09983889	curator, conservator
-n09984960	customer agent
-n09985470	cutter, carver
-n09985809	cyberpunk
-n09985978	cyborg, bionic man, bionic woman
-n09986450	cymbalist
-n09986700	Cynic
-n09986904	cytogeneticist
-n09987045	cytologist
-n09987161	czar
-n09987239	czar, tsar, tzar
-n09988063	dad, dada, daddy, pa, papa, pappa, pop
-n09988311	dairyman
-n09988493	Dalai Lama, Grand Lama
-n09988703	dallier, dillydallier, dilly-dallier, mope, lounger
-n09989502	dancer, professional dancer, terpsichorean
-n09990415	dancer, social dancer
-n09990690	clog dancer
-n09990777	dancing-master, dance master
-n09991740	dark horse
-n09991867	darling, favorite, favourite, pet, dearie, deary, ducky
-n09992538	date, escort
-n09992837	daughter, girl
-n09993252	dawdler, drone, laggard, lagger, trailer, poke
-n09993651	day boarder
-n09994400	day laborer, day labourer
-n09994673	deacon, Protestant deacon
-n09994808	deaconess
-n09994878	deadeye
-n09995829	deipnosophist
-n09996039	dropout
-n09996304	deadhead
-n09996481	deaf person
-n09997622	debtor, debitor
-n09998788	deckhand, roustabout
-n09999135	defamer, maligner, slanderer, vilifier, libeler, backbiter, traducer
-n10000294	defense contractor
-n10000459	deist, freethinker
-n10000787	delegate
-n10001217	deliveryman, delivery boy, deliverer
-n10001481	demagogue, demagog, rabble-rouser
-n10001764	demigod, superman, Ubermensch
-n10002257	demographer, demographist, population scientist
-n10002760	demonstrator, protester
-n10003476	den mother
-n10004718	department head
-n10005006	depositor
-n10005934	deputy
-n10006177	dermatologist, skin doctor
-n10006748	descender
-n10007684	designated hitter
-n10007809	designer, intriguer
-n10007995	desk clerk, hotel desk clerk, hotel clerk
-n10008123	desk officer
-n10008254	desk sergeant, deskman, station keeper
-n10009162	detainee, political detainee
-n10009276	detective, investigator, tec, police detective
-n10009484	detective
-n10009671	detractor, disparager, depreciator, knocker
-n10010062	developer
-n10010243	deviationist
-n10010632	devisee
-n10010767	devisor
-n10010864	devourer
-n10011360	dialectician
-n10011486	diarist, diary keeper, journalist
-n10012484	dietician, dietitian, nutritionist
-n10013811	diocesan
-n10015215	director, theater director, theatre director
-n10015485	director
-n10015792	dirty old man
-n10015897	disbeliever, nonbeliever, unbeliever
-n10017272	disk jockey, disc jockey, dj
-n10017422	dispatcher
-n10018747	distortionist
-n10018861	distributor, distributer
-n10019072	district attorney, DA
-n10019187	district manager
-n10019406	diver, plunger
-n10020366	divorcee, grass widow
-n10020533	ex-wife, ex
-n10020670	divorce lawyer
-n10020807	docent
-n10020890	doctor, doc, physician, MD, Dr., medico
-n10022908	dodo, fogy, fogey, fossil
-n10023264	doge
-n10023506	dog in the manger
-n10023656	dogmatist, doctrinaire
-n10024025	dolichocephalic
-n10024362	domestic partner, significant other, spousal equivalent, spouse equivalent
-n10024937	Dominican
-n10025060	dominus, dominie, domine, dominee
-n10025295	don, father
-n10025391	Donatist
-n10025635	donna
-n10026976	dosser, street person
-n10027246	double, image, look-alike
-n10027590	double-crosser, double-dealer, two-timer, betrayer, traitor
-n10028402	down-and-out
-n10028541	doyenne
-n10029068	draftsman, drawer
-n10030277	dramatist, playwright
-n10032987	dreamer
-n10033412	dressmaker, modiste, needlewoman, seamstress, sempstress
-n10033572	dressmaker's model
-n10033663	dribbler, driveller, slobberer, drooler
-n10033888	dribbler
-n10034201	drinker, imbiber, toper, juicer
-n10034614	drinker
-n10035952	drug addict, junkie, junky
-n10036266	drug user, substance abuser, user
-n10036444	Druid
-n10036692	drum majorette, majorette
-n10036929	drummer
-n10037080	drunk
-n10037385	drunkard, drunk, rummy, sot, inebriate, wino
-n10037588	Druze, Druse
-n10037922	dry, prohibitionist
-n10038119	dry nurse
-n10038409	duchess
-n10038620	duke
-n10039271	duffer
-n10039946	dunker
-n10040240	Dutch uncle
-n10040698	dyspeptic
-n10040945	eager beaver, busy bee, live wire, sharpie, sharpy
-n10041373	earl
-n10041887	earner, wage earner
-n10042690	eavesdropper
-n10042845	eccentric, eccentric person, flake, oddball, geek
-n10043024	eclectic, eclecticist
-n10043491	econometrician, econometrist
-n10043643	economist, economic expert
-n10044682	ectomorph
-n10044879	editor, editor in chief
-n10047199	egocentric, egoist
-n10047459	egotist, egoist, swellhead
-n10048117	ejaculator
-n10048367	elder
-n10048612	elder statesman
-n10048836	elected official
-n10049363	electrician, lineman, linesman
-n10050043	elegist
-n10050880	elocutionist
-n10051026	emancipator, manumitter
-n10051761	embryologist
-n10051861	emeritus
-n10051975	emigrant, emigre, emigree, outgoer
-n10052694	emissary, envoy
-n10053439	empress
-n10053808	employee
-n10054657	employer
-n10055297	enchantress, witch
-n10055410	enchantress, temptress, siren, Delilah, femme fatale
-n10055566	encyclopedist, encyclopaedist
-n10055730	endomorph
-n10055847	enemy, foe, foeman, opposition
-n10056103	energizer, energiser, vitalizer, vitaliser, animator
-n10056611	end man
-n10056719	end man, corner man
-n10057271	endorser, indorser
-n10058411	enjoyer
-n10058962	enlisted woman
-n10059067	enophile, oenophile
-n10060075	entrant
-n10060175	entrant
-n10060352	entrepreneur, enterpriser
-n10061043	envoy, envoy extraordinary, minister plenipotentiary
-n10061195	enzymologist
-n10061431	eparch
-n10061882	epidemiologist
-n10062042	epigone, epigon
-n10062176	epileptic
-n10062275	Episcopalian
-n10062492	equerry
-n10062594	equerry
-n10062716	erotic
-n10062905	escapee
-n10062996	escapist, dreamer, wishful thinker
-n10063635	Eskimo, Esquimau, Inuit
-n10063919	espionage agent
-n10064831	esthetician, aesthetician
-n10064977	etcher
-n10065758	ethnologist
-n10066206	Etonian
-n10066314	etymologist
-n10067011	evangelist, revivalist, gospeler, gospeller
-n10067305	Evangelist
-n10067600	event planner
-n10067968	examiner, inspector
-n10068234	examiner, tester, quizzer
-n10068425	exarch
-n10069296	executant
-n10069981	executive secretary
-n10070108	executive vice president
-n10070377	executrix
-n10070449	exegete
-n10070563	exhibitor, exhibitioner, shower
-n10070711	exhibitionist, show-off
-n10071332	exile, expatriate, expat
-n10071557	existentialist, existentialist philosopher, existential philosopher
-n10072054	exorcist, exorciser
-n10074249	ex-spouse
-n10074578	extern, medical extern
-n10074735	extremist
-n10074841	extrovert, extravert
-n10075299	eyewitness
-n10075693	facilitator
-n10076224	fairy godmother
-n10076483	falangist, phalangist
-n10076604	falconer, hawker
-n10076957	falsifier
-n10077106	familiar
-n10077593	fan, buff, devotee, lover
-n10077879	fanatic, fiend
-n10078131	fancier, enthusiast
-n10078719	farm boy
-n10078806	farmer, husbandman, granger, sodbuster
-n10079399	farmhand, fieldhand, field hand, farm worker
-n10079893	fascist
-n10080117	fascista
-n10080508	fatalist, determinist, predestinarian, predestinationist
-n10080869	father, male parent, begetter
-n10081204	Father, Padre
-n10081842	father-figure
-n10082043	father-in-law
-n10082299	Fauntleroy, Little Lord Fauntleroy
-n10082423	Fauve, fauvist
-n10082562	favorite son
-n10082687	featherweight
-n10082997	federalist
-n10083677	fellow traveler, fellow traveller
-n10083823	female aristocrat
-n10084043	female offspring
-n10084295	female child, girl, little girl
-n10085101	fence
-n10085869	fiance, groom-to-be
-n10086383	fielder, fieldsman
-n10086744	field judge
-n10087434	fighter pilot
-n10087736	filer
-n10088200	film director, director
-n10090745	finder
-n10091349	fire chief, fire marshal
-n10091450	fire-eater, fire-swallower
-n10091564	fire-eater, hothead
-n10091651	fireman, firefighter, fire fighter, fire-eater
-n10091861	fire marshall
-n10091997	fire walker
-n10092488	first baseman, first sacker
-n10092643	firstborn, eldest
-n10092794	first lady
-n10092978	first lieutenant, 1st lieutenant
-n10093167	first offender
-n10093475	first sergeant, sergeant first class
-n10093818	fishmonger, fishwife
-n10094320	flagellant
-n10094584	flag officer
-n10094782	flak catcher, flak, flack catcher, flack
-n10095265	flanker back, flanker
-n10095420	flapper
-n10095769	flatmate
-n10095869	flatterer, adulator
-n10096126	flibbertigibbet, foolish woman
-n10096508	flight surgeon
-n10097262	floorwalker, shopwalker
-n10097477	flop, dud, washout
-n10097590	Florentine
-n10097842	flower girl
-n10097995	flower girl
-n10098245	flutist, flautist, flute player
-n10098388	fly-by-night
-n10098517	flyweight
-n10098624	flyweight
-n10098710	foe, enemy
-n10098862	folk dancer
-n10099002	folk poet
-n10099375	follower
-n10101308	football hero
-n10101634	football player, footballer
-n10101981	footman
-n10102800	forefather, father, sire
-n10103155	foremother
-n10103228	foreign agent
-n10103921	foreigner, outsider
-n10104064	boss
-n10104487	foreman
-n10104756	forester, tree farmer, arboriculturist
-n10104888	forewoman
-n10105085	forger, counterfeiter
-n10105733	forward
-n10105906	foster-brother, foster brother
-n10106387	foster-father, foster father
-n10106509	foster-mother, foster mother
-n10106995	foster-sister, foster sister
-n10107173	foster-son, foster son
-n10107303	founder, beginner, founding father, father
-n10108018	foundress
-n10108089	four-minute man
-n10108464	framer
-n10108832	Francophobe
-n10109443	freak, monster, monstrosity, lusus naturae
-n10109662	free agent, free spirit, freewheeler
-n10109826	free agent
-n10110093	freedom rider
-n10110731	free-liver
-n10110893	freeloader
-n10111358	free trader
-n10111779	Freudian
-n10111903	friar, mendicant
-n10112129	monk, monastic
-n10113249	frontierswoman
-n10113583	front man, front, figurehead, nominal head, straw man, strawman
-n10113869	frotteur
-n10114476	fucker
-n10114550	fucker
-n10114662	fuddy-duddy
-n10115430	fullback
-n10115946	funambulist, tightrope walker
-n10116370	fundamentalist
-n10116478	fundraiser
-n10116702	futurist
-n10117017	gadgeteer
-n10117267	gagman, gagster, gagwriter
-n10117415	gagman, standup comedian
-n10117739	gainer, weight gainer
-n10117851	gal
-n10118301	galoot
-n10118743	gambist
-n10118844	gambler
-n10119609	gamine
-n10120330	garbage man, garbageman, garbage collector, garbage carter, garbage hauler, refuse collector, dustman
-n10120671	gardener
-n10121026	garment cutter
-n10121246	garroter, garrotter, strangler, throttler, choker
-n10121714	gasman
-n10121800	gastroenterologist
-n10122300	gatherer
-n10122531	gawker
-n10123122	gendarme
-n10123844	general, full general
-n10126177	generator, source, author
-n10126424	geneticist
-n10126708	genitor
-n10127186	gent
-n10127689	geologist
-n10128519	geophysicist
-n10128748	ghostwriter, ghost
-n10129338	Gibson girl
-n10129825	girl, miss, missy, young lady, young woman, fille
-n10130686	girlfriend, girl, lady friend
-n10130877	girlfriend
-n10131151	girl wonder
-n10131268	Girondist, Girondin
-n10131590	gitano
-n10131815	gladiator
-n10132035	glassblower
-n10132502	gleaner
-n10134178	goat herder, goatherd
-n10134396	godchild
-n10134760	godfather
-n10134982	godparent
-n10135129	godson
-n10135197	gofer
-n10135297	goffer, gopher
-n10136615	goldsmith, goldworker, gold-worker
-n10136959	golfer, golf player, linksman
-n10137825	gondolier, gondoliere
-n10138369	good guy
-n10138472	good old boy, good ole boy, good ol' boy
-n10139077	good Samaritan
-n10139651	gossip columnist
-n10140051	gouger
-n10140597	governor general
-n10140683	grabber
-n10140783	grader
-n10140929	graduate nurse, trained nurse
-n10141364	grammarian, syntactician
-n10141732	granddaughter
-n10142166	grande dame
-n10142391	grandfather, gramps, granddad, grandad, granddaddy, grandpa
-n10142537	Grand Inquisitor
-n10142747	grandma, grandmother, granny, grannie, gran, nan, nanna
-n10142946	grandmaster
-n10143172	grandparent
-n10143595	grantee
-n10143725	granter
-n10144338	grass widower, divorced man
-n10145239	great-aunt, grandaunt
-n10145340	great grandchild
-n10145480	great granddaughter
-n10145590	great grandmother
-n10145774	great grandparent
-n10145902	great grandson
-n10146002	great-nephew, grandnephew
-n10146104	great-niece, grandniece
-n10146416	Green Beret
-n10146816	grenadier, grenade thrower
-n10146927	greeter, saluter, welcomer
-n10147121	gringo
-n10147262	grinner
-n10147710	grocer
-n10147935	groom, bridegroom
-n10148035	groom, bridegroom
-n10148305	grouch, grump, crank, churl, crosspatch
-n10148825	group captain
-n10149436	grunter
-n10149867	prison guard, jailer, jailor, gaoler, screw, turnkey
-n10150071	guard
-n10150794	guesser
-n10150940	guest, invitee
-n10151133	guest
-n10151261	guest of honor
-n10151367	guest worker, guestworker
-n10151570	guide
-n10151760	guitarist, guitar player
-n10152306	gunnery sergeant
-n10152616	guru
-n10152763	guru
-n10153155	guvnor
-n10153414	guy, cat, hombre, bozo
-n10153594	gymnast
-n10153865	gym rat
-n10154013	gynecologist, gynaecologist, woman's doctor
-n10154186	Gypsy, Gipsy, Romany, Rommany, Romani, Roma, Bohemian
-n10154601	hack, drudge, hacker
-n10155222	hacker, cyber-terrorist, cyberpunk
-n10155600	haggler
-n10155849	hairdresser, hairstylist, stylist, styler
-n10156629	hakim, hakeem
-n10156831	Hakka
-n10157016	halberdier
-n10157128	halfback
-n10157271	half blood
-n10158506	hand
-n10159045	animal trainer, handler
-n10159289	handyman, jack of all trades, odd-job man
-n10159533	hang glider
-n10160188	hardliner
-n10160280	harlequin
-n10160412	harmonizer, harmoniser
-n10161622	hash head
-n10162016	hatchet man, iceman
-n10162194	hater
-n10162354	hatmaker, hatter, milliner, modiste
-n10164025	headman, tribal chief, chieftain, chief
-n10164233	headmaster, schoolmaster, master
-n10164492	head nurse
-n10165448	hearer, listener, auditor, attender
-n10166189	heartbreaker
-n10166394	heathen, pagan, gentile, infidel
-n10167152	heavyweight
-n10167361	heavy
-n10167565	heckler, badgerer
-n10167838	hedger
-n10168012	hedger, equivocator, tergiversator
-n10168183	hedonist, pagan, pleasure seeker
-n10168584	heir, inheritor, heritor
-n10168837	heir apparent
-n10169147	heiress, inheritress, inheritrix
-n10169241	heir presumptive
-n10169419	hellion, heller, devil
-n10169796	helmsman, steersman, steerer
-n10170060	hire
-n10170681	hematologist, haematologist
-n10170866	hemiplegic
-n10171219	herald, trumpeter
-n10171456	herbalist, herb doctor
-n10171567	herder, herdsman, drover
-n10172080	hermaphrodite, intersex, gynandromorph, androgyne, epicene, epicene person
-n10173410	heroine
-n10173579	heroin addict
-n10173665	hero worshiper, hero worshipper
-n10173771	Herr
-n10174253	highbinder
-n10174330	highbrow
-n10174445	high commissioner
-n10174589	highflier, highflyer
-n10174695	Highlander, Scottish Highlander, Highland Scot
-n10174971	high-muck-a-muck, pooh-bah
-n10175248	high priest
-n10175725	highjacker, hijacker
-n10176913	hireling, pensionary
-n10177150	historian, historiographer
-n10178077	hitchhiker
-n10178216	hitter, striker
-n10179069	hobbyist
-n10180580	holdout
-n10180791	holdover, hangover
-n10180923	holdup man, stickup man
-n10181445	homeboy
-n10181547	homeboy
-n10181799	home buyer
-n10181878	homegirl
-n10182190	homeless, homeless person
-n10182402	homeopath, homoeopath
-n10183347	honest woman
-n10183931	honor guard, guard of honor
-n10184505	hooker
-n10185148	hoper
-n10185483	hornist
-n10185793	horseman, equestrian, horseback rider
-n10186068	horse trader
-n10186143	horsewoman
-n10186216	horse wrangler, wrangler
-n10186350	horticulturist, plantsman
-n10186686	hospital chaplain
-n10186774	host, innkeeper, boniface
-n10187130	host
-n10187491	hostess
-n10187990	hotelier, hotelkeeper, hotel manager, hotelman, hosteller
-n10188715	housekeeper
-n10188856	housemaster
-n10188957	housemate
-n10189278	house physician, resident, resident physician
-n10189597	house sitter
-n10190122	housing commissioner
-n10190516	huckster, cheap-jack
-n10191001	hugger
-n10191388	humanist, humanitarian
-n10191613	humanitarian, do-gooder, improver
-n10192839	hunk
-n10193650	huntress
-n10194231	ex-husband, ex
-n10194775	hydrologist
-n10195056	hyperope
-n10195155	hypertensive
-n10195261	hypnotist, hypnotizer, hypnotiser, mesmerist, mesmerizer
-n10195593	hypocrite, dissembler, dissimulator, phony, phoney, pretender
-n10196404	iceman
-n10196725	iconoclast
-n10197392	ideologist, ideologue
-n10198437	idol, matinee idol
-n10198832	idolizer, idoliser
-n10199251	imam, imaum
-n10200246	imperialist
-n10200781	important person, influential person, personage
-n10202225	inamorato
-n10202624	incumbent, officeholder
-n10202763	incurable
-n10203949	inductee
-n10204177	industrialist
-n10204833	infanticide
-n10205231	inferior
-n10205344	infernal
-n10205457	infielder
-n10205714	infiltrator
-n10206173	informer, betrayer, rat, squealer, blabber
-n10206506	ingenue
-n10206629	ingenue
-n10207077	polymath
-n10207169	in-law, relative-in-law
-n10208189	inquiry agent
-n10208847	inspector
-n10208950	inspector general
-n10209082	instigator, initiator
-n10209731	insurance broker, insurance agent, general agent, underwriter
-n10210137	insurgent, insurrectionist, freedom fighter, rebel
-n10210512	intelligence analyst
-n10210648	interior designer, designer, interior decorator, house decorator, room decorator, decorator
-n10210911	interlocutor, conversational partner
-n10211036	interlocutor, middleman
-n10211666	International Grandmaster
-n10211830	internationalist
-n10212231	internist
-n10212501	interpreter, translator
-n10212780	interpreter
-n10213034	intervenor
-n10213429	introvert
-n10214062	invader, encroacher
-n10214390	invalidator, voider, nullifier
-n10215623	investigator
-n10216106	investor
-n10216403	invigilator
-n10217208	irreligionist
-n10218043	Ivy Leaguer
-n10218164	Jack of all trades
-n10218292	Jacksonian
-n10219240	Jane Doe
-n10219453	janissary
-n10219879	Jat
-n10220080	Javanese, Javan
-n10220924	Jekyll and Hyde
-n10221312	jester, fool, motley fool
-n10221520	Jesuit
-n10222170	jezebel
-n10222259	jilt
-n10222497	jobber, middleman, wholesaler
-n10222716	job candidate
-n10223069	Job's comforter
-n10223177	jockey
-n10223606	John Doe
-n10224578	journalist
-n10225219	judge, justice, jurist
-n10225931	judge advocate
-n10226413	juggler
-n10227166	Jungian
-n10227266	junior
-n10227393	junior
-n10227490	Junior, Jr, Jnr
-n10227698	junior lightweight
-n10227793	junior middleweight
-n10227985	jurist, legal expert
-n10228278	juror, juryman, jurywoman
-n10228468	justice of the peace
-n10228592	justiciar, justiciary
-n10228712	kachina
-n10229883	keyboardist
-n10230216	Khedive
-n10233248	kingmaker
-n10235024	king, queen, world-beater
-n10235269	King's Counsel
-n10235385	Counsel to the Crown
-n10236304	kin, kinsperson, family
-n10236521	enate, matrikin, matrilineal kin, matrisib, matrilineal sib
-n10236842	kink
-n10237069	kinswoman
-n10237196	kisser, osculator
-n10237464	kitchen help
-n10237556	kitchen police, KP
-n10237676	Klansman, Ku Kluxer, Kluxer
-n10237799	kleptomaniac
-n10238272	kneeler
-n10238375	knight
-n10239928	knocker
-n10240082	knower, apprehender
-n10240235	know-it-all, know-all
-n10240417	kolkhoznik
-n10240821	Kshatriya
-n10241024	labor coach, birthing coach, doula, monitrice
-n10241300	laborer, manual laborer, labourer, jack
-n10242328	Labourite
-n10243137	lady
-n10243273	lady-in-waiting
-n10243483	lady's maid
-n10243664	lama
-n10243872	lamb, dear
-n10244108	lame duck
-n10244359	lamplighter
-n10244913	land agent
-n10245029	landgrave
-n10245341	landlubber, lubber, landsman
-n10245507	landlubber, landsman, landman
-n10245639	landowner, landholder, property owner
-n10245863	landscape architect, landscape gardener, landscaper, landscapist
-n10246317	langlaufer
-n10246395	languisher
-n10246703	lapidary, lapidarist
-n10247358	lass, lassie, young girl, jeune fille
-n10247880	Latin
-n10248008	Latin
-n10248198	latitudinarian
-n10248377	Jehovah's Witness
-n10249191	law agent
-n10249270	lawgiver, lawmaker
-n10249459	lawman, law officer, peace officer
-n10249869	law student
-n10249950	lawyer, attorney
-n10250712	lay reader
-n10251329	lazybones
-n10251612	leaker
-n10252075	leaseholder, lessee
-n10252222	lector, lecturer, reader
-n10252354	lector, reader
-n10252547	lecturer
-n10253122	left-hander, lefty, southpaw
-n10253296	legal representative
-n10253479	legate, official emissary
-n10253611	legatee
-n10253703	legionnaire, legionary
-n10255459	letterman
-n10257221	liberator
-n10258602	licenser
-n10258786	licentiate
-n10259348	lieutenant
-n10259780	lieutenant colonel, light colonel
-n10259997	lieutenant commander
-n10260473	lieutenant junior grade, lieutenant JG
-n10260706	life
-n10260800	lifeguard, lifesaver
-n10261211	life tenant
-n10261511	light flyweight
-n10261624	light heavyweight, cruiserweight
-n10261862	light heavyweight
-n10262343	light-o'-love, light-of-love
-n10262445	lightweight
-n10262561	lightweight
-n10262655	lightweight
-n10262880	lilliputian
-n10263146	limnologist
-n10263411	lineman
-n10263790	line officer
-n10265281	lion-hunter
-n10265801	lisper
-n10265891	lister
-n10266016	literary critic
-n10266328	literate, literate person
-n10266848	litigant, litigator
-n10267166	litterer, litterbug, litter lout
-n10267311	little brother
-n10267865	little sister
-n10268629	lobbyist
-n10269199	locksmith
-n10269289	locum tenens, locum
-n10271677	Lord, noble, nobleman
-n10272782	loser
-n10272913	loser, also-ran
-n10273064	failure, loser, nonstarter, unsuccessful person
-n10274173	Lothario
-n10274318	loudmouth, blusterer
-n10274815	lowerclassman, underclassman
-n10275249	Lowlander, Scottish Lowlander, Lowland Scot
-n10275395	loyalist, stalwart
-n10275848	Luddite
-n10276045	lumberman, lumberjack, logger, feller, faller
-n10276477	lumper
-n10276942	bedlamite
-n10277027	pyromaniac
-n10277638	lutist, lutanist, lutenist
-n10277815	Lutheran
-n10277912	lyricist, lyrist
-n10278456	macebearer, mace, macer
-n10279018	machinist, mechanic, shop mechanic
-n10279778	madame
-n10280034	maenad
-n10280130	maestro, master
-n10280598	magdalen
-n10280674	magician, prestidigitator, conjurer, conjuror, illusionist
-n10281546	magus
-n10281770	maharani, maharanee
-n10281896	mahatma
-n10282482	maid, maiden
-n10282672	maid, maidservant, housemaid, amah
-n10283170	major
-n10283366	major
-n10283546	major-domo, seneschal
-n10284064	maker, shaper
-n10284871	malahini
-n10284965	malcontent
-n10286282	malik
-n10286539	malingerer, skulker, shammer
-n10286749	Malthusian
-n10288964	adonis
-n10289039	man
-n10289176	man
-n10289462	manageress
-n10289766	mandarin
-n10290422	maneuverer, manoeuvrer
-n10290541	maniac
-n10290813	Manichaean, Manichean, Manichee
-n10290919	manicurist
-n10291110	manipulator
-n10291469	man-at-arms
-n10291822	man of action, man of deeds
-n10291942	man of letters
-n10292316	manufacturer, producer
-n10293332	marcher, parader
-n10293590	marchioness, marquise
-n10293861	margrave
-n10294020	margrave
-n10294139	Marine, devil dog, leatherneck, shipboard soldier
-n10295371	marquess
-n10295479	marquis, marquess
-n10296176	marshal, marshall
-n10296444	martinet, disciplinarian, moralist
-n10297234	mascot
-n10297367	masochist
-n10297531	mason, stonemason
-n10297841	masquerader, masker, masquer
-n10298202	masseur
-n10298271	masseuse
-n10298647	master
-n10298912	master, captain, sea captain, skipper
-n10299125	master-at-arms
-n10299250	master of ceremonies, emcee, host
-n10299700	masturbator, onanist
-n10299875	matchmaker, matcher, marriage broker
-n10300041	mate, first mate
-n10300154	mate
-n10300303	mate
-n10300500	mater
-n10300654	material
-n10300829	materialist
-n10302576	matriarch, materfamilias
-n10302700	matriarch
-n10302905	matriculate
-n10303037	matron
-n10303814	mayor, city manager
-n10304086	mayoress
-n10304650	mechanical engineer
-n10304914	medalist, medallist, medal winner
-n10305635	medical officer, medic
-n10305802	medical practitioner, medical man
-n10306004	medical scientist
-n10306279	medium, spiritualist, sensitive
-n10306496	megalomaniac
-n10306595	melancholic, melancholiac
-n10306890	Melkite, Melchite
-n10307114	melter
-n10308066	nonmember
-n10308168	board member
-n10308275	clansman, clanswoman, clan member
-n10308504	memorizer, memoriser
-n10308653	Mendelian
-n10308732	mender, repairer, fixer
-n10310783	Mesoamerican
-n10311506	messmate
-n10311661	mestiza
-n10312287	meteorologist
-n10312491	meter maid
-n10312600	Methodist
-n10313000	Metis
-n10313239	metropolitan
-n10313441	mezzo-soprano, mezzo
-n10313724	microeconomist, microeconomic expert
-n10314054	middle-aged man
-n10314182	middlebrow
-n10314517	middleweight
-n10314836	midwife, accoucheuse
-n10315217	mikado, tenno
-n10315456	Milanese
-n10315561	miler
-n10315730	miles gloriosus
-n10316360	military attache
-n10316527	military chaplain, padre, Holy Joe, sky pilot
-n10316862	military leader
-n10317007	military officer, officer
-n10317500	military policeman, MP
-n10317963	mill agent
-n10318293	mill-hand, factory worker
-n10318607	millionairess
-n10318686	millwright
-n10319313	minder
-n10320484	mining engineer
-n10320863	minister, government minister
-n10321126	ministrant
-n10321340	minor leaguer, bush leaguer
-n10321632	Minuteman
-n10321882	misanthrope, misanthropist
-n10322238	misfit
-n10323634	mistress
-n10323752	mistress, kept woman, fancy woman
-n10323999	mixed-blood
-n10324560	model, poser
-n10325549	class act
-n10325774	modeler, modeller
-n10326776	modifier
-n10327143	molecular biologist
-n10327987	Monegasque, Monacan
-n10328123	monetarist
-n10328328	moneygrubber
-n10328437	moneymaker
-n10328696	Mongoloid
-n10328941	monolingual
-n10329035	monologist
-n10330593	moonlighter
-n10330931	moralist
-n10331098	morosoph
-n10331167	morris dancer
-n10331258	mortal enemy
-n10331347	mortgagee, mortgage holder
-n10331841	mortician, undertaker, funeral undertaker, funeral director
-n10332110	moss-trooper
-n10332385	mother, female parent
-n10332861	mother
-n10332953	mother
-n10333044	mother figure
-n10333165	mother hen
-n10333317	mother-in-law
-n10333439	mother's boy, mamma's boy, mama's boy
-n10333601	mother's daughter
-n10333838	motorcycle cop, motorcycle policeman, speed cop
-n10334009	motorcyclist
-n10334461	Mound Builder
-n10334782	mountebank, charlatan
-n10335246	mourner, griever, sorrower, lamenter
-n10335801	mouthpiece, mouth
-n10335931	mover
-n10336411	moviegoer, motion-picture fan
-n10336904	muffin man
-n10337488	mugwump, independent, fencesitter
-n10338231	Mullah, Mollah, Mulla
-n10338391	muncher
-n10339179	murderess
-n10339251	murder suspect
-n10339717	musher
-n10340312	musician, instrumentalist, player
-n10341243	musicologist
-n10341343	music teacher
-n10341446	musketeer
-n10341573	Muslimah
-n10341955	mutilator, maimer, mangler
-n10342180	mutineer
-n10342367	mute, deaf-mute, deaf-and-dumb person
-n10342543	mutterer, mumbler, murmurer
-n10342893	muzzler
-n10342992	Mycenaen
-n10343088	mycologist
-n10343355	myope
-n10343449	myrmidon
-n10343554	mystic, religious mystic
-n10343869	mythologist
-n10344121	naif
-n10344203	nailer
-n10344319	namby-pamby
-n10344656	name dropper
-n10344774	namer
-n10345015	nan
-n10345100	nanny, nursemaid, nurse
-n10345302	narc, nark, narcotics agent
-n10345422	narcissist, narcist
-n10345659	nark, copper's nark
-n10346015	nationalist
-n10347204	nautch girl
-n10347446	naval commander
-n10348526	Navy SEAL, SEAL
-n10349243	obstructionist, obstructor, obstructer, resister, thwarter
-n10349750	Nazarene
-n10349836	Nazarene, Ebionite
-n10350220	Nazi, German Nazi
-n10350774	nebbish, nebbech
-n10351064	necker
-n10353016	neonate, newborn, newborn infant, newborn baby
-n10353355	nephew
-n10353928	neurobiologist
-n10354265	neurologist, brain doctor
-n10354754	neurosurgeon, brain surgeon
-n10355142	neutral
-n10355306	neutralist
-n10355449	newcomer, fledgling, fledgeling, starter, neophyte, freshman, newbie, entrant
-n10355688	newcomer
-n10355806	New Dealer
-n10356450	newspaper editor
-n10356877	newsreader, news reader
-n10357012	Newtonian
-n10357613	niece
-n10357737	niggard, skinflint, scrooge, churl
-n10358032	night porter
-n10358124	night rider, nightrider
-n10358575	NIMBY
-n10359117	niqaabi
-n10359422	nitpicker
-n10359546	Nobelist, Nobel Laureate
-n10359659	NOC
-n10360366	noncandidate
-n10360747	noncommissioned officer, noncom, enlisted officer
-n10361060	nondescript
-n10361194	nondriver
-n10361296	nonparticipant
-n10361525	nonperson, unperson
-n10362003	nonresident
-n10362319	nonsmoker
-n10362557	Northern Baptist
-n10363445	noticer
-n10363573	novelist
-n10364198	novitiate, novice
-n10364502	nuclear chemist, radiochemist
-n10365514	nudger
-n10366145	nullipara
-n10366276	number theorist
-n10366966	nurse
-n10368291	nursling, nurseling, suckling
-n10368528	nymph, houri
-n10368624	nymphet
-n10368711	nympholept
-n10368798	nymphomaniac, nympho
-n10369095	oarswoman
-n10369317	oboist
-n10369417	obscurantist
-n10369528	observer, commentator
-n10369699	obstetrician, accoucheur
-n10369955	occupier
-n10370381	occultist
-n10370955	wine lover
-n10371052	offerer, offeror
-n10371221	office-bearer
-n10371330	office boy
-n10371450	officeholder, officer
-n10373390	officiant
-n10373525	Federal, Fed, federal official
-n10374541	oilman
-n10374849	oil tycoon
-n10374943	old-age pensioner
-n10375052	old boy
-n10375314	old lady
-n10375402	old man
-n10376523	oldster, old person, senior citizen, golden ager
-n10376890	old-timer, oldtimer, gaffer, old geezer, antique
-n10377021	old woman
-n10377185	oligarch
-n10377291	Olympian
-n10377542	omnivore
-n10377633	oncologist
-n10378026	onlooker, looker-on
-n10378113	onomancer
-n10378780	operator
-n10379376	opportunist, self-seeker
-n10380126	optimist
-n10380499	Orangeman
-n10380672	orator, speechmaker, rhetorician, public speaker, speechifier
-n10381804	orderly, hospital attendant
-n10381981	orderly
-n10382157	orderly sergeant
-n10382302	ordinand
-n10382480	ordinary
-n10382710	organ-grinder
-n10382825	organist
-n10383094	organization man
-n10383237	organizer, organiser, arranger
-n10383505	organizer, organiser, labor organizer
-n10383816	originator, conceiver, mastermind
-n10384214	ornithologist, bird watcher
-n10384392	orphan
-n10384496	orphan
-n10385566	osteopath, osteopathist
-n10386196	out-and-outer
-n10386754	outdoorswoman
-n10386874	outfielder
-n10386984	outfielder
-n10387196	right fielder
-n10387324	right-handed pitcher, right-hander
-n10387836	outlier
-n10389865	owner-occupier
-n10389976	oyabun
-n10390600	packrat
-n10390698	padrone
-n10390807	padrone
-n10391416	page, pageboy
-n10393909	painter
-n10394434	Paleo-American, Paleo-Amerind, Paleo-Indian
-n10394786	paleontologist, palaeontologist, fossilist
-n10395073	pallbearer, bearer
-n10395209	palmist, palmister, chiromancer
-n10395390	pamperer, spoiler, coddler, mollycoddler
-n10395828	Panchen Lama
-n10396106	panelist, panellist
-n10396337	panhandler
-n10396727	paparazzo
-n10396908	paperboy
-n10397001	paperhanger, paperer
-n10397142	paperhanger
-n10397392	papoose, pappoose
-n10399130	pardoner
-n10400003	paretic
-n10400108	parishioner
-n10400205	park commissioner
-n10400437	Parliamentarian, Member of Parliament
-n10400618	parliamentary agent
-n10400998	parodist, lampooner
-n10401204	parricide
-n10401331	parrot
-n10401639	partaker, sharer
-n10402709	part-timer
-n10402824	party
-n10403633	party man, party liner
-n10403876	passenger, rider
-n10404426	passer
-n10404998	paster
-n10405540	pater
-n10405694	patient
-n10406266	patriarch
-n10406391	patriarch
-n10406765	patriarch, paterfamilias
-n10407310	patriot, nationalist
-n10407954	patron, sponsor, supporter
-n10408809	patternmaker
-n10409459	pawnbroker
-n10409752	payer, remunerator
-n10410246	peacekeeper
-n10410996	peasant
-n10411356	pedant, bookworm, scholastic
-n10411551	peddler, pedlar, packman, hawker, pitchman
-n10411867	pederast, paederast, child molester
-n10414239	penologist
-n10414768	pentathlete
-n10414865	Pentecostal, Pentecostalist
-n10415037	percussionist
-n10416567	periodontist
-n10417288	peshmerga
-n10417424	personality
-n10417551	personal representative
-n10417682	personage
-n10417843	persona grata
-n10417969	persona non grata
-n10418101	personification
-n10418735	perspirer, sweater
-n10419047	pervert, deviant, deviate, degenerate
-n10419472	pessimist
-n10419630	pest, blighter, cuss, pesterer, gadfly
-n10419785	Peter Pan
-n10420031	petitioner, suppliant, supplicant, requester
-n10420277	petit juror, petty juror
-n10420507	pet sitter, critter sitter
-n10420649	petter, fondler
-n10421016	Pharaoh, Pharaoh of Egypt
-n10421470	pharmacist, druggist, chemist, apothecary, pill pusher, pill roller
-n10421956	philanthropist, altruist
-n10422405	philatelist, stamp collector
-n10425946	philosopher
-n10426454	phonetician
-n10426630	phonologist
-n10427223	photojournalist
-n10427359	photometrist, photometrician
-n10427764	physical therapist, physiotherapist
-n10428004	physicist
-n10431122	piano maker
-n10431625	picker, chooser, selector
-n10432189	picnicker, picknicker
-n10432441	pilgrim
-n10432875	pill
-n10432957	pillar, mainstay
-n10433077	pill head
-n10433452	pilot
-n10433610	Piltdown man, Piltdown hoax
-n10433737	pimp, procurer, panderer, pander, pandar, fancy man, ponce
-n10435169	pipe smoker
-n10435251	pip-squeak, squirt, small fry
-n10435716	pisser, urinator
-n10435988	pitcher, hurler, twirler
-n10436334	pitchman
-n10437014	placeman, placeseeker
-n10437137	placer miner
-n10437262	plagiarist, plagiarizer, plagiariser, literary pirate, pirate
-n10437698	plainsman
-n10438172	planner, contriver, deviser
-n10438619	planter, plantation owner
-n10438842	plasterer
-n10439373	platinum blond, platinum blonde
-n10439523	platitudinarian
-n10439727	playboy, man-about-town, Corinthian
-n10439851	player, participant
-n10441037	playmate, playfellow
-n10441124	pleaser
-n10441694	pledger
-n10441962	plenipotentiary
-n10442093	plier, plyer
-n10442232	plodder, slowpoke, stick-in-the-mud, slowcoach
-n10442417	plodder, slogger
-n10442573	plotter, mapper
-n10443032	plumber, pipe fitter
-n10443659	pluralist
-n10443830	pluralist
-n10444194	poet
-n10448322	pointsman
-n10448455	point woman
-n10449664	policyholder
-n10450038	political prisoner
-n10450161	political scientist
-n10450303	politician, politico, pol, political leader
-n10451450	politician
-n10451590	pollster, poll taker, headcounter, canvasser
-n10451858	polluter, defiler
-n10453184	pool player
-n10455619	portraitist, portrait painter, portrayer, limner
-n10456070	poseuse
-n10456138	positivist, rationalist
-n10456696	postdoc, post doc
-n10457214	poster girl
-n10457444	postulator
-n10457903	private citizen
-n10458111	problem solver, solver, convergent thinker
-n10458356	pro-lifer
-n10458596	prosthetist
-n10459882	postulant
-n10460033	potboy, potman
-n10461060	poultryman, poulterer
-n10462588	power user
-n10462751	power worker, power-station worker
-n10462860	practitioner, practician
-n10464052	prayer, supplicant
-n10464542	preceptor, don
-n10464711	predecessor
-n10464870	preemptor, pre-emptor
-n10465002	preemptor, pre-emptor
-n10465451	premature baby, preterm baby, premature infant, preterm infant, preemie, premie
-n10465831	presbyter
-n10466198	presenter, sponsor
-n10466564	presentist
-n10466918	preserver
-n10467179	president
-n10467395	President of the United States, United States President, President, Chief Executive
-n10468750	president, prexy
-n10469611	press agent, publicity man, public relations man, PR man
-n10469874	press photographer
-n10470779	priest
-n10471640	prima ballerina
-n10471732	prima donna, diva
-n10471859	prima donna
-n10472129	primigravida, gravida I
-n10472447	primordial dwarf, hypoplastic dwarf, true dwarf, normal dwarf
-n10473453	prince charming
-n10473562	prince consort
-n10473789	princeling
-n10473917	Prince of Wales
-n10474064	princess
-n10474343	princess royal
-n10474446	principal, dealer
-n10474645	principal, school principal, head teacher, head
-n10475835	print seller
-n10475940	prior
-n10476467	private, buck private, common soldier
-n10477713	probationer, student nurse
-n10477955	processor
-n10478118	process-server
-n10478293	proconsul
-n10478462	proconsul
-n10478827	proctologist
-n10478960	proctor, monitor
-n10479135	procurator
-n10479328	procurer, securer
-n10481167	profit taker
-n10481268	programmer, computer programmer, coder, software engineer
-n10482054	promiser, promisor
-n10482220	promoter, booster, plugger
-n10482587	promulgator
-n10482921	propagandist
-n10483138	propagator, disseminator
-n10483395	property man, propman, property master
-n10483799	prophetess
-n10483890	prophet
-n10484858	prosecutor, public prosecutor, prosecuting officer, prosecuting attorney
-n10485298	prospector
-n10485883	protectionist
-n10486166	protegee
-n10486236	protozoologist
-n10486561	provost marshal
-n10487182	pruner, trimmer
-n10487363	psalmist
-n10487592	psephologist
-n10488016	psychiatrist, head-shrinker, shrink
-n10488309	psychic
-n10488656	psycholinguist
-n10489426	psychophysicist
-n10490421	publican, tavern keeper
-n10491998	pudge
-n10492086	puerpera
-n10492727	punching bag
-n10493199	punter
-n10493419	punter
-n10493685	puppeteer
-n10493835	puppy, pup
-n10493922	purchasing agent
-n10494195	puritan
-n10494373	Puritan
-n10495167	pursuer
-n10495421	pusher, shover
-n10495555	pusher, drug peddler, peddler, drug dealer, drug trafficker
-n10495756	pusher, thruster
-n10496393	putz
-n10496489	Pygmy, Pigmy
-n10497135	qadi
-n10497534	quadriplegic
-n10497645	quadruplet, quad
-n10498046	quaker, trembler
-n10498699	quarter
-n10498816	quarterback, signal caller, field general
-n10498986	quartermaster
-n10499110	quartermaster general
-n10499232	Quebecois
-n10499355	queen, queen regnant, female monarch
-n10499631	Queen of England
-n10499857	queen
-n10500217	queen
-n10500419	queen consort
-n10500603	queen mother
-n10500824	Queen's Counsel
-n10500942	question master, quizmaster
-n10501453	quick study, sponge
-n10501635	quietist
-n10502046	quitter
-n10502329	rabbi
-n10502950	racist, racialist
-n10503818	radiobiologist
-n10504090	radiologic technologist
-n10504206	radiologist, radiotherapist
-n10505347	rainmaker
-n10505613	raiser
-n10505732	raja, rajah
-n10505942	rake, rakehell, profligate, rip, blood, roue
-n10506336	ramrod
-n10506544	ranch hand
-n10506915	ranker
-n10507070	ranter, raver
-n10507380	rape suspect
-n10507482	rapper
-n10507565	rapporteur
-n10507692	rare bird, rara avis
-n10508141	ratepayer
-n10508379	raw recruit
-n10508710	reader
-n10509063	reading teacher
-n10509161	realist
-n10509810	real estate broker, real estate agent, estate agent, land agent, house agent
-n10510245	rear admiral
-n10510974	receiver
-n10511771	reciter
-n10512201	recruit, enlistee
-n10512372	recruit, military recruit
-n10512708	recruiter
-n10512859	recruiting-sergeant
-n10513509	redcap
-n10513823	redhead, redheader, red-header, carrottop
-n10513938	redneck, cracker
-n10514051	reeler
-n10514121	reenactor
-n10514255	referral
-n10514429	referee, ref
-n10514784	refiner
-n10515863	Reform Jew
-n10516527	registered nurse, RN
-n10517137	registrar
-n10517283	Regius professor
-n10518349	reliever, allayer, comforter
-n10519126	anchorite, hermit
-n10519494	religious leader
-n10519984	remover
-n10520286	Renaissance man, generalist
-n10520544	renegade
-n10520964	rentier
-n10521100	repairman, maintenance man, service man
-n10521662	reporter, newsman, newsperson
-n10521853	newswoman
-n10522035	representative
-n10522324	reprobate, miscreant
-n10522759	rescuer, recoverer, saver
-n10523341	reservist
-n10524076	resident commissioner
-n10524223	respecter
-n10524869	restaurateur, restauranter
-n10525134	restrainer, controller
-n10525436	retailer, retail merchant
-n10525617	retiree, retired person
-n10525878	returning officer
-n10526534	revenant
-n10527147	revisionist
-n10527334	revolutionist, revolutionary, subversive, subverter
-n10528023	rheumatologist
-n10528148	Rhodesian man, Homo rhodesiensis
-n10528493	rhymer, rhymester, versifier, poetizer, poetiser
-n10529231	rich person, wealthy person, have
-n10530150	rider
-n10530383	riding master
-n10530571	rifleman
-n10530959	right-hander, right hander, righthander
-n10531109	right-hand man, chief assistant, man Friday
-n10531445	ringer
-n10531838	ringleader
-n10533874	roadman, road mender
-n10533983	roarer, bawler, bellower, screamer, screecher, shouter, yeller
-n10536134	rocket engineer, rocket scientist
-n10536274	rocket scientist
-n10536416	rock star
-n10537708	Romanov, Romanoff
-n10537906	romanticist, romantic
-n10538629	ropemaker, rope-maker, roper
-n10538733	roper
-n10538853	roper
-n10539015	ropewalker, ropedancer
-n10539160	rosebud
-n10539278	Rosicrucian
-n10540114	Mountie
-n10540252	Rough Rider
-n10540656	roundhead
-n10541833	civil authority, civil officer
-n10542608	runner
-n10542761	runner
-n10542888	runner
-n10543161	running back
-n10543937	rusher
-n10544232	rustic
-n10544748	saboteur, wrecker, diversionist
-n10545792	sadist
-n10546428	sailing master, navigator
-n10546633	sailor, crewman
-n10548419	salesgirl, saleswoman, saleslady
-n10548537	salesman
-n10548681	salesperson, sales representative, sales rep
-n10549510	salvager, salvor
-n10550252	sandwichman
-n10550369	sangoma
-n10550468	sannup
-n10551576	sapper
-n10552393	Sassenach
-n10553140	satrap
-n10553235	saunterer, stroller, ambler
-n10554024	Savoyard
-n10554141	sawyer
-n10554846	scalper
-n10555059	scandalmonger
-n10555430	scapegrace, black sheep
-n10556033	scene painter
-n10556518	schemer, plotter
-n10556704	schizophrenic
-n10556825	schlemiel, shlemiel
-n10557246	schlockmeister, shlockmeister
-n10557854	scholar, scholarly person, bookman, student
-n10559009	scholiast
-n10559288	schoolchild, school-age child, pupil
-n10559508	schoolfriend
-n10559683	Schoolman, medieval Schoolman
-n10559996	schoolmaster
-n10560106	schoolmate, classmate, schoolfellow, class fellow
-n10560637	scientist
-n10561222	scion
-n10561320	scoffer, flouter, mocker, jeerer
-n10561736	scofflaw
-n10562135	scorekeeper, scorer
-n10562283	scorer
-n10562509	scourer
-n10562968	scout, talent scout
-n10563314	scoutmaster
-n10563403	scrambler
-n10563711	scratcher
-n10564098	screen actor, movie actor
-n10565502	scrutineer, canvasser
-n10565667	scuba diver
-n10566072	sculptor, sculpturer, carver, statue maker
-n10567613	Sea Scout
-n10567722	seasonal worker, seasonal
-n10567848	seasoner
-n10568200	second baseman, second sacker
-n10568358	second cousin
-n10568443	seconder
-n10568608	second fiddle, second banana
-n10568915	second-in-command
-n10569011	second lieutenant, 2nd lieutenant
-n10569179	second-rater, mediocrity
-n10570019	secretary
-n10570704	Secretary of Agriculture, Agriculture Secretary
-n10571907	Secretary of Health and Human Services
-n10572706	Secretary of State
-n10572889	Secretary of the Interior, Interior Secretary
-n10573957	sectarian, sectary, sectarist
-n10574311	section hand
-n10574538	secularist
-n10574840	security consultant
-n10575463	seeded player, seed
-n10575594	seeder, cloud seeder
-n10575787	seeker, searcher, quester
-n10576223	segregate
-n10576316	segregator, segregationist
-n10576676	selectman
-n10576818	selectwoman
-n10576962	selfish person
-n10577182	self-starter
-n10577284	seller, marketer, vender, vendor, trafficker
-n10577710	selling agent
-n10577820	semanticist, semiotician
-n10578021	semifinalist
-n10578162	seminarian, seminarist
-n10578471	senator
-n10578656	sendee
-n10579062	senior
-n10579549	senior vice president
-n10580030	separatist, separationist
-n10580437	septuagenarian
-n10580535	serf, helot, villein
-n10581648	spree killer
-n10581890	serjeant-at-law, serjeant, sergeant-at-law, sergeant
-n10582604	server
-n10582746	serviceman, military man, man, military personnel
-n10583387	settler, colonist
-n10583790	settler
-n10585077	sex symbol
-n10585217	sexton, sacristan
-n10585628	shaheed
-n10586166	Shakespearian, Shakespearean
-n10586265	shanghaier, seizer
-n10586444	sharecropper, cropper, sharecrop farmer
-n10586903	shaver
-n10586998	Shavian
-n10588074	sheep
-n10588357	sheik, tribal sheik, sheikh, tribal sheikh, Arab chief
-n10588724	shelver
-n10588965	shepherd
-n10589666	ship-breaker
-n10590146	shipmate
-n10590239	shipowner
-n10590452	shipping agent
-n10590903	shirtmaker
-n10591072	shogun
-n10591811	shopaholic
-n10592049	shop girl
-n10592811	shop steward, steward
-n10593521	shot putter
-n10594147	shrew, termagant
-n10594523	shuffler
-n10594857	shyster, pettifogger
-n10595164	sibling, sib
-n10595647	sick person, diseased person, sufferer
-n10596517	sightreader
-n10596899	signaler, signaller
-n10597505	signer
-n10597745	signor, signior
-n10597889	signora
-n10598013	signore
-n10598181	signorina
-n10598459	silent partner, sleeping partner
-n10598904	addle-head, addlehead, loon, birdbrain
-n10599215	simperer
-n10599806	singer, vocalist, vocalizer, vocaliser
-n10601234	Sinologist
-n10601362	sipper
-n10602119	sirrah
-n10602470	Sister
-n10602985	sister, sis
-n10603528	waverer, vacillator, hesitator, hesitater
-n10603851	sitar player
-n10604275	sixth-former
-n10604380	skateboarder
-n10604634	skeptic, sceptic, doubter
-n10604880	sketcher
-n10604979	skidder
-n10605253	skier
-n10605737	skinny-dipper
-n10607291	skin-diver, aquanaut
-n10607478	skinhead
-n10609092	slasher
-n10609198	slattern, slut, slovenly woman, trollop
-n10610465	sleeper, slumberer
-n10610850	sleeper
-n10611267	sleeping beauty
-n10611613	sleuth, sleuthhound
-n10612210	slob, sloven, pig, slovenly person
-n10612373	sloganeer
-n10612518	slopseller, slop-seller
-n10613996	smasher, stunner, knockout, beauty, ravisher, sweetheart, peach, lulu, looker, mantrap, dish
-n10614507	smirker
-n10614629	smith, metalworker
-n10615179	smoothie, smoothy, sweet talker, charmer
-n10615334	smuggler, runner, contrabandist, moon curser, moon-curser
-n10616578	sneezer
-n10617024	snob, prig, snot, snoot
-n10617193	snoop, snooper
-n10617397	snorer
-n10618234	sob sister
-n10618342	soccer player
-n10618465	social anthropologist, cultural anthropologist
-n10618685	social climber, climber
-n10618848	socialist
-n10619492	socializer, socialiser
-n10619642	social scientist
-n10619888	social secretary
-n10620212	Socinian
-n10620586	sociolinguist
-n10620758	sociologist
-n10621294	soda jerk, soda jerker
-n10621400	sodalist
-n10621514	sodomite, sodomist, sod, bugger
-n10622053	soldier
-n10624074	son, boy
-n10624310	songster
-n10624437	songstress
-n10624540	songwriter, songster, ballad maker
-n10625860	sorcerer, magician, wizard, necromancer, thaumaturge, thaumaturgist
-n10626630	sorehead
-n10627252	soul mate
-n10628097	Southern Baptist
-n10628644	sovereign, crowned head, monarch
-n10629329	spacewalker
-n10629647	Spanish American, Hispanic American, Hispanic
-n10629939	sparring partner, sparring mate
-n10630093	spastic
-n10630188	speaker, talker, utterer, verbalizer, verbaliser
-n10631131	native speaker
-n10631309	Speaker
-n10631654	speechwriter
-n10632576	specialist, medical specialist
-n10633298	specifier
-n10633450	spectator, witness, viewer, watcher, looker
-n10634464	speech therapist
-n10634849	speedskater, speed skater
-n10634990	spellbinder
-n10635788	sphinx
-n10636488	spinster, old maid
-n10637483	split end
-n10638922	sport, sportsman, sportswoman
-n10639238	sport, summercater
-n10639359	sporting man, outdoor man
-n10639637	sports announcer, sportscaster, sports commentator
-n10639817	sports editor
-n10641223	sprog
-n10642596	square dancer
-n10642705	square shooter, straight shooter, straight arrow
-n10643095	squatter
-n10643837	squire
-n10643937	squire
-n10644598	staff member, staffer
-n10645017	staff sergeant
-n10645223	stage director
-n10646032	stainer
-n10646140	stakeholder
-n10646433	stalker
-n10646641	stalking-horse
-n10646780	stammerer, stutterer
-n10646942	stamper, stomper, tramper, trampler
-n10647745	standee
-n10648237	stand-in, substitute, relief, reliever, backup, backup man, fill-in
-n10648696	star, principal, lead
-n10649197	starlet
-n10649308	starter, dispatcher
-n10650162	statesman, solon, national leader
-n10652605	state treasurer
-n10652703	stationer, stationery seller
-n10654015	stenographer, amanuensis, shorthand typist
-n10654211	stentor
-n10654321	stepbrother, half-brother, half brother
-n10654827	stepmother
-n10654932	stepparent
-n10655169	stevedore, loader, longshoreman, docker, dockhand, dock worker, dockworker, dock-walloper, lumper
-n10655442	steward
-n10655594	steward, flight attendant
-n10655730	steward
-n10655986	stickler
-n10656120	stiff
-n10656223	stifler, smotherer
-n10656969	stipendiary, stipendiary magistrate
-n10657306	stitcher
-n10657556	stockjobber
-n10657835	stock trader
-n10658304	stockist
-n10659042	stoker, fireman
-n10659762	stooper
-n10660128	store detective
-n10660621	strafer
-n10660883	straight man, second banana
-n10661002	stranger, alien, unknown
-n10661216	stranger
-n10661563	strategist, strategian
-n10661732	straw boss, assistant foreman
-n10663315	streetwalker, street girl, hooker, hustler, floozy, floozie, slattern
-n10663549	stretcher-bearer, litter-bearer
-n10665302	struggler
-n10665587	stud, he-man, macho-man
-n10665698	student, pupil, educatee
-n10666752	stumblebum, palooka
-n10667477	stylist
-n10667709	subaltern
-n10667863	subcontractor
-n10668450	subduer, surmounter, overcomer
-n10668666	subject, case, guinea pig
-n10669991	subordinate, subsidiary, underling, foot soldier
-n10671042	substitute, reserve, second-stringer
-n10671613	successor, heir
-n10671736	successor, replacement
-n10671898	succorer, succourer
-n10672371	Sufi
-n10672540	suffragan, suffragan bishop
-n10672662	suffragette
-n10673296	sugar daddy
-n10673776	suicide bomber
-n10674130	suitor, suer, wooer
-n10674713	sumo wrestler
-n10675010	sunbather
-n10675142	sundowner
-n10675609	super heavyweight
-n10676018	superior, higher-up, superordinate
-n10676434	supermom
-n10676569	supernumerary, spear carrier, extra
-n10678937	supremo
-n10679174	surgeon, operating surgeon, sawbones
-n10679503	Surgeon General
-n10679610	Surgeon General
-n10679723	surpriser
-n10680609	surveyor
-n10680796	surveyor
-n10681194	survivor, subsister
-n10681557	sutler, victualer, victualler, provisioner
-n10682713	sweeper
-n10682953	sweetheart, sweetie, steady, truelove
-n10683675	swinger, tramp
-n10684146	switcher, whipper
-n10684630	swot, grind, nerd, wonk, dweeb
-n10684827	sycophant, toady, crawler, lackey, ass-kisser
-n10685398	sylph
-n10686073	sympathizer, sympathiser, well-wisher
-n10686517	symphonist
-n10686694	syncopator
-n10686885	syndic
-n10688356	tactician
-n10688811	tagger
-n10689306	tailback
-n10690268	tallyman, tally clerk
-n10690421	tallyman
-n10690648	tanker, tank driver
-n10691318	tapper, wiretapper, phone tapper
-n10691937	Tartuffe, Tartufe
-n10692090	Tarzan
-n10692482	taster, taste tester, taste-tester, sampler
-n10692883	tax assessor, assessor
-n10693235	taxer
-n10693334	taxi dancer
-n10693824	taxonomist, taxonomer, systematist
-n10694258	teacher, instructor
-n10694939	teaching fellow
-n10695450	tearaway
-n10696101	technical sergeant
-n10696508	technician
-n10697135	Ted, Teddy boy
-n10697282	teetotaler, teetotaller, teetotalist
-n10698368	television reporter, television newscaster, TV reporter, TV newsman
-n10699558	temporizer, temporiser
-n10699752	tempter
-n10699981	term infant
-n10700105	toiler
-n10700201	tenant, renter
-n10700640	tenant
-n10700963	tenderfoot
-n10701180	tennis player
-n10701644	tennis pro, professional tennis player
-n10701962	tenor saxophonist, tenorist
-n10702167	termer
-n10702615	terror, scourge, threat
-n10703221	tertigravida, gravida III
-n10703336	testator, testate
-n10703480	testatrix
-n10703692	testee, examinee
-n10704238	test-tube baby
-n10704712	Texas Ranger, Ranger
-n10704886	thane
-n10705448	theatrical producer
-n10705615	theologian, theologist, theologizer, theologiser
-n10706812	theorist, theoretician, theorizer, theoriser, idealogue
-n10707134	theosophist
-n10707233	therapist, healer
-n10707707	Thessalonian
-n10708292	thinker, creative thinker, mind
-n10708454	thinker
-n10709529	thrower
-n10710171	thurifer
-n10710259	ticket collector, ticket taker
-n10710778	tight end
-n10710913	tiler
-n10711483	timekeeper, timer
-n10711766	Timorese
-n10712229	tinkerer, fiddler
-n10712374	tinsmith, tinner
-n10712474	tinter
-n10712690	tippler, social drinker
-n10712835	tipster, tout
-n10713254	T-man
-n10713686	toastmaster, symposiarch
-n10713843	toast mistress
-n10714195	tobogganist
-n10715030	tomboy, romp, hoyden
-n10715347	toolmaker
-n10715789	torchbearer
-n10716576	Tory
-n10716864	Tory
-n10717055	tosser
-n10717196	tosser, jerk-off, wanker
-n10717337	totalitarian
-n10718131	tourist, tourer, holidaymaker
-n10718349	tout, touter
-n10718509	tout, ticket tout
-n10718665	tovarich, tovarisch
-n10718952	towhead
-n10719036	town clerk
-n10719132	town crier, crier
-n10719267	townsman, towner
-n10719807	toxicologist
-n10720197	track star
-n10720453	trader, bargainer, dealer, monger
-n10720964	trade unionist, unionist, union member
-n10721124	traditionalist, diehard
-n10721321	traffic cop
-n10721612	tragedian
-n10721708	tragedian
-n10721819	tragedienne
-n10722029	trail boss
-n10722575	trainer
-n10722965	traitor, treasonist
-n10723230	traitress
-n10723597	transactor
-n10724132	transcriber
-n10724372	transfer, transferee
-n10724570	transferee
-n10725280	translator, transcriber
-n10726031	transvestite, cross-dresser
-n10726786	traveling salesman, travelling salesman, commercial traveler, commercial traveller, roadman, bagman
-n10727016	traverser
-n10727171	trawler
-n10727458	Treasury, First Lord of the Treasury
-n10728117	trencher
-n10728233	trend-setter, taste-maker, fashion arbiter
-n10728624	tribesman
-n10728998	trier, attempter, essayer
-n10729330	trifler
-n10730542	trooper
-n10730728	trooper, state trooper
-n10731013	Trotskyite, Trotskyist, Trot
-n10731732	truant, hooky player
-n10732010	trumpeter, cornetist
-n10732521	trusty
-n10732854	Tudor
-n10732967	tumbler
-n10733820	tutee
-n10734394	twin
-n10734741	two-timer
-n10734891	Tyke
-n10734963	tympanist, timpanist
-n10735173	typist
-n10735298	tyrant, autocrat, despot
-n10735984	umpire, ump
-n10737103	understudy, standby
-n10737264	undesirable
-n10738111	unicyclist
-n10738215	unilateralist
-n10738670	Unitarian
-n10738871	Arminian
-n10739135	universal donor
-n10739297	UNIX guru
-n10739391	Unknown Soldier
-n10740594	upsetter
-n10740732	upstager
-n10740868	upstart, parvenu, nouveau-riche, arriviste
-n10741152	upstart
-n10741367	urchin
-n10741493	urologist
-n10742005	usherette
-n10742111	usher, doorkeeper
-n10742546	usurper, supplanter
-n10742997	utility man
-n10743124	utilizer, utiliser
-n10743356	Utopian
-n10744078	uxoricide
-n10744164	vacationer, vacationist
-n10745006	valedictorian, valedictory speaker
-n10745770	valley girl
-n10746931	vaulter, pole vaulter, pole jumper
-n10747119	vegetarian
-n10747424	vegan
-n10747548	venerator
-n10747965	venture capitalist
-n10748142	venturer, merchant-venturer
-n10748506	vermin, varmint
-n10748620	very important person, VIP, high-up, dignitary, panjandrum, high muckamuck
-n10749928	vibist, vibraphonist
-n10750031	vicar
-n10750188	vicar
-n10750640	vicar-general
-n10751026	vice chancellor
-n10751152	vicegerent
-n10751265	vice president, V.P.
-n10751710	vice-regent
-n10752480	victim, dupe
-n10753061	Victorian
-n10753182	victualer, victualler
-n10753339	vigilante, vigilance man
-n10753442	villager
-n10753989	vintager
-n10754189	vintner, wine merchant
-n10754281	violator, debaucher, ravisher
-n10754449	violator, lawbreaker, law offender
-n10755080	violist
-n10755164	virago
-n10755394	virologist
-n10755648	Visayan, Bisayan
-n10756061	viscountess
-n10756148	viscount
-n10756261	Visigoth
-n10756641	visionary
-n10756837	visiting fireman
-n10757050	visiting professor
-n10757492	visualizer, visualiser
-n10758337	vixen, harpy, hellcat
-n10758445	vizier
-n10758949	voicer
-n10759151	volunteer, unpaid worker
-n10759331	volunteer, military volunteer, voluntary
-n10759982	votary
-n10760199	votary
-n10760622	vouchee
-n10760951	vower
-n10761190	voyager
-n10761326	voyeur, Peeping Tom, peeper
-n10761519	vulcanizer, vulcaniser
-n10762212	waffler
-n10762480	Wagnerian
-n10763075	waif, street child
-n10763245	wailer
-n10763383	waiter, server
-n10763620	waitress
-n10764465	walking delegate
-n10764622	walk-on
-n10764719	wallah
-n10765305	wally
-n10765587	waltzer
-n10765679	wanderer, roamer, rover, bird of passage
-n10765885	Wandering Jew
-n10766260	wanton
-n10768148	warrantee
-n10768272	warrantee
-n10768903	washer
-n10769084	washerman, laundryman
-n10769188	washwoman, washerwoman, laundrywoman, laundress
-n10769321	wassailer, carouser
-n10769459	wastrel, waster
-n10771066	Wave
-n10772092	weatherman, weather forecaster
-n10772580	weekend warrior
-n10772937	weeder
-n10773665	welder
-n10773800	welfare case, charity case
-n10774329	westerner
-n10774756	West-sider
-n10775003	wetter
-n10775128	whaler
-n10776052	Whig
-n10776339	whiner, complainer, moaner, sniveller, crybaby, bellyacher, grumbler, squawker
-n10776887	whipper-in
-n10777299	whisperer
-n10778044	whiteface
-n10778148	Carmelite, White Friar
-n10778711	Augustinian
-n10778999	white hope, great white hope
-n10779610	white supremacist
-n10779897	whoremaster, whoremonger
-n10779995	whoremaster, whoremonger, john, trick
-n10780284	widow, widow woman
-n10780632	wife, married woman
-n10781236	wiggler, wriggler, squirmer
-n10781817	wimp, chicken, crybaby
-n10782362	wing commander
-n10782471	winger
-n10782791	winner
-n10782940	winner, victor
-n10783240	window dresser, window trimmer
-n10783539	winker
-n10783646	wiper
-n10783734	wireman, wirer
-n10784113	wise guy, smart aleck, wiseacre, wisenheimer, weisenheimer
-n10784544	witch doctor
-n10784922	withdrawer
-n10785480	withdrawer
-n10787470	woman, adult female
-n10788852	woman
-n10789415	wonder boy, golden boy
-n10789709	wonderer
-n10791115	working girl
-n10791221	workman, workingman, working man, working person
-n10791820	workmate
-n10791890	worldling
-n10792335	worshiper, worshipper
-n10792506	worthy
-n10792856	wrecker
-n10793570	wright
-n10793799	write-in candidate, write-in
-n10794014	writer, author
-n10801561	Wykehamist
-n10801802	yakuza
-n10802507	yard bird, yardbird
-n10802621	yardie
-n10802953	yardman
-n10803031	yardmaster, trainmaster, train dispatcher
-n10803282	yenta
-n10803978	yogi
-n10804287	young buck, young man
-n10804636	young Turk
-n10804732	Young Turk
-n10805501	Zionist
-n10806113	zoo keeper
-n10994097	Genet, Edmund Charles Edouard Genet, Citizen Genet
-n11100798	Kennan, George F. Kennan, George Frost Kennan
-n11196627	Munro, H. H. Munro, Hector Hugh Munro, Saki
-n11242849	Popper, Karl Popper, Sir Karl Raimund Popper
-n11318824	Stoker, Bram Stoker, Abraham Stoker
-n11346873	Townes, Charles Townes, Charles Hard Townes
-n11448153	dust storm, duster, sandstorm, sirocco
-n11487732	parhelion, mock sun, sundog
-n11508382	snow, snowfall
-n11511327	facula
-n11524451	wave
-n11530008	microflora
-n11531193	wilding
-n11531334	semi-climber
-n11532682	volva
-n11533212	basidiocarp
-n11533999	domatium
-n11536567	apomict
-n11536673	aquatic
-n11537327	bryophyte, nonvascular plant
-n11539289	acrocarp, acrocarpous moss
-n11542137	sphagnum, sphagnum moss, peat moss, bog moss
-n11542640	liverwort, hepatic
-n11544015	hepatica, Marchantia polymorpha
-n11545350	pecopteris
-n11545524	pteridophyte, nonflowering plant
-n11545714	fern
-n11547562	fern ally
-n11547855	spore
-n11548728	carpospore
-n11548870	chlamydospore
-n11549009	conidium, conidiospore
-n11549245	oospore
-n11549779	tetraspore
-n11549895	zoospore
-n11552133	cryptogam
-n11552386	spermatophyte, phanerogam, seed plant
-n11552594	seedling
-n11552806	annual
-n11552976	biennial
-n11553240	perennial
-n11553522	hygrophyte
-n11596108	gymnosperm
-n11597657	gnetum, Gnetum gnemon
-n11598287	Catha edulis
-n11598686	ephedra, joint fir
-n11598886	mahuang, Ephedra sinica
-n11599324	welwitschia, Welwitschia mirabilis
-n11600372	cycad
-n11601177	sago palm, Cycas revoluta
-n11601333	false sago, fern palm, Cycas circinalis
-n11601918	zamia
-n11602091	coontie, Florida arrowroot, Seminole bread, Zamia pumila
-n11602478	ceratozamia
-n11602873	dioon
-n11603246	encephalartos
-n11603462	kaffir bread, Encephalartos caffer
-n11603835	macrozamia
-n11604046	burrawong, Macrozamia communis, Macrozamia spiralis
-n11608250	pine, pine tree, true pine
-n11609475	pinon, pinyon
-n11609684	nut pine
-n11609862	pinon pine, Mexican nut pine, Pinus cembroides
-n11610047	Rocky mountain pinon, Pinus edulis
-n11610215	single-leaf, single-leaf pine, single-leaf pinyon, Pinus monophylla
-n11610437	bishop pine, bishop's pine, Pinus muricata
-n11610602	California single-leaf pinyon, Pinus californiarum
-n11610823	Parry's pinyon, Pinus quadrifolia, Pinus parryana
-n11611087	spruce pine, Pinus glabra
-n11611233	black pine, Pinus nigra
-n11611356	pitch pine, northern pitch pine, Pinus rigida
-n11611561	pond pine, Pinus serotina
-n11611758	stone pine, umbrella pine, European nut pine, Pinus pinea
-n11612018	Swiss pine, Swiss stone pine, arolla pine, cembra nut tree, Pinus cembra
-n11612235	cembra nut, cedar nut
-n11612349	Swiss mountain pine, mountain pine, dwarf mountain pine, mugho pine, mugo pine, Pinus mugo
-n11612575	ancient pine, Pinus longaeva
-n11612923	white pine
-n11613219	American white pine, eastern white pine, weymouth pine, Pinus strobus
-n11613459	western white pine, silver pine, mountain pine, Pinus monticola
-n11613692	southwestern white pine, Pinus strobiformis
-n11613867	limber pine, Pinus flexilis
-n11614039	whitebark pine, whitebarked pine, Pinus albicaulis
-n11614250	yellow pine
-n11614420	ponderosa, ponderosa pine, western yellow pine, bull pine, Pinus ponderosa
-n11614713	Jeffrey pine, Jeffrey's pine, black pine, Pinus jeffreyi
-n11615026	shore pine, lodgepole, lodgepole pine, spruce pine, Pinus contorta
-n11615259	Sierra lodgepole pine, Pinus contorta murrayana
-n11615387	loblolly pine, frankincense pine, Pinus taeda
-n11615607	jack pine, Pinus banksiana
-n11615812	swamp pine
-n11615967	longleaf pine, pitch pine, southern yellow pine, Georgia pine, Pinus palustris
-n11616260	shortleaf pine, short-leaf pine, shortleaf yellow pine, Pinus echinata
-n11616486	red pine, Canadian red pine, Pinus resinosa
-n11616662	Scotch pine, Scots pine, Scotch fir, Pinus sylvestris
-n11616852	scrub pine, Virginia pine, Jersey pine, Pinus virginiana
-n11617090	Monterey pine, Pinus radiata
-n11617272	bristlecone pine, Rocky Mountain bristlecone pine, Pinus aristata
-n11617631	table-mountain pine, prickly pine, hickory pine, Pinus pungens
-n11617878	knobcone pine, Pinus attenuata
-n11618079	Japanese red pine, Japanese table pine, Pinus densiflora
-n11618290	Japanese black pine, black pine, Pinus thunbergii
-n11618525	Torrey pine, Torrey's pine, soledad pine, grey-leaf pine, sabine pine, Pinus torreyana
-n11618861	larch, larch tree
-n11619227	American larch, tamarack, black larch, Larix laricina
-n11619455	western larch, western tamarack, Oregon larch, Larix occidentalis
-n11619687	subalpine larch, Larix lyallii
-n11619845	European larch, Larix decidua
-n11620016	Siberian larch, Larix siberica, Larix russica
-n11620389	golden larch, Pseudolarix amabilis
-n11620673	fir, fir tree, true fir
-n11621029	silver fir
-n11621281	amabilis fir, white fir, Pacific silver fir, red silver fir, Christmas tree, Abies amabilis
-n11621547	European silver fir, Christmas tree, Abies alba
-n11621727	white fir, Colorado fir, California white fir, Abies concolor, Abies lowiana
-n11621950	balsam fir, balm of Gilead, Canada balsam, Abies balsamea
-n11622184	Fraser fir, Abies fraseri
-n11622368	lowland fir, lowland white fir, giant fir, grand fir, Abies grandis
-n11622591	Alpine fir, subalpine fir, Abies lasiocarpa
-n11622771	Santa Lucia fir, bristlecone fir, Abies bracteata, Abies venusta
-n11623105	cedar, cedar tree, true cedar
-n11623815	cedar of Lebanon, Cedrus libani
-n11623967	deodar, deodar cedar, Himalayan cedar, Cedrus deodara
-n11624192	Atlas cedar, Cedrus atlantica
-n11624531	spruce
-n11625003	Norway spruce, Picea abies
-n11625223	weeping spruce, Brewer's spruce, Picea breweriana
-n11625391	Engelmann spruce, Engelmann's spruce, Picea engelmannii
-n11625632	white spruce, Picea glauca
-n11625804	black spruce, Picea mariana, spruce pine
-n11626010	Siberian spruce, Picea obovata
-n11626152	Sitka spruce, Picea sitchensis
-n11626409	oriental spruce, Picea orientalis
-n11626585	Colorado spruce, Colorado blue spruce, silver spruce, Picea pungens
-n11626826	red spruce, eastern spruce, yellow spruce, Picea rubens
-n11627168	hemlock, hemlock tree
-n11627512	eastern hemlock, Canadian hemlock, spruce pine, Tsuga canadensis
-n11627714	Carolina hemlock, Tsuga caroliniana
-n11627908	mountain hemlock, black hemlock, Tsuga mertensiana
-n11628087	western hemlock, Pacific hemlock, west coast hemlock, Tsuga heterophylla
-n11628456	douglas fir
-n11628793	green douglas fir, douglas spruce, douglas pine, douglas hemlock, Oregon fir, Oregon pine, Pseudotsuga menziesii
-n11629047	big-cone spruce, big-cone douglas fir, Pseudotsuga macrocarpa
-n11629354	Cathaya
-n11630017	cedar, cedar tree
-n11630489	cypress, cypress tree
-n11631159	gowen cypress, Cupressus goveniana
-n11631405	pygmy cypress, Cupressus pigmaea, Cupressus goveniana pigmaea
-n11631619	Santa Cruz cypress, Cupressus abramsiana, Cupressus goveniana abramsiana
-n11631854	Arizona cypress, Cupressus arizonica
-n11631985	Guadalupe cypress, Cupressus guadalupensis
-n11632167	Monterey cypress, Cupressus macrocarpa
-n11632376	Mexican cypress, cedar of Goa, Portuguese cypress, Cupressus lusitanica
-n11632619	Italian cypress, Mediterranean cypress, Cupressus sempervirens
-n11632929	King William pine, Athrotaxis selaginoides
-n11633284	Chilean cedar, Austrocedrus chilensis
-n11634736	incense cedar, red cedar, Calocedrus decurrens, Libocedrus decurrens
-n11635152	southern white cedar, coast white cedar, Atlantic white cedar, white cypress, white cedar, Chamaecyparis thyoides
-n11635433	Oregon cedar, Port Orford cedar, Lawson's cypress, Lawson's cedar, Chamaecyparis lawsoniana
-n11635830	yellow cypress, yellow cedar, Nootka cypress, Alaska cedar, Chamaecyparis nootkatensis
-n11636204	Japanese cedar, Japan cedar, sugi, Cryptomeria japonica
-n11636835	juniper berry
-n11639084	incense cedar
-n11639306	kawaka, Libocedrus plumosa
-n11639445	pahautea, Libocedrus bidwillii, mountain pine
-n11640132	metasequoia, dawn redwood, Metasequoia glyptostrodoides
-n11643835	arborvitae
-n11644046	western red cedar, red cedar, canoe cedar, Thuja plicata
-n11644226	American arborvitae, northern white cedar, white cedar, Thuja occidentalis
-n11644462	Oriental arborvitae, Thuja orientalis, Platycladus orientalis
-n11644872	hiba arborvitae, Thujopsis dolobrata
-n11645163	keteleeria
-n11645590	Wollemi pine
-n11645914	araucaria
-n11646167	monkey puzzle, chile pine, Araucaria araucana
-n11646344	norfolk island pine, Araucaria heterophylla, Araucaria excelsa
-n11646517	new caledonian pine, Araucaria columnaris
-n11646694	bunya bunya, bunya bunya tree, Araucaria bidwillii
-n11646955	hoop pine, Moreton Bay pine, Araucaria cunninghamii
-n11647306	kauri pine, dammar pine
-n11647703	kauri, kaury, Agathis australis
-n11647868	amboina pine, amboyna pine, Agathis dammara, Agathis alba
-n11648039	dundathu pine, queensland kauri, smooth bark kauri, Agathis robusta
-n11648268	red kauri, Agathis lanceolata
-n11648776	plum-yew
-n11649150	California nutmeg, nutmeg-yew, Torreya californica
-n11649359	stinking cedar, stinking yew, Torrey tree, Torreya taxifolia
-n11649878	celery pine
-n11650160	celery top pine, celery-topped pine, Phyllocladus asplenifolius
-n11650307	tanekaha, Phyllocladus trichomanoides
-n11650430	Alpine celery pine, Phyllocladus alpinus
-n11650558	yellowwood, yellowwood tree
-n11650759	gymnospermous yellowwood
-n11652039	podocarp
-n11652217	yacca, yacca podocarp, Podocarpus coriaceus
-n11652376	brown pine, Rockingham podocarp, Podocarpus elatus
-n11652578	cape yellowwood, African yellowwood, Podocarpus elongatus
-n11652753	South-African yellowwood, Podocarpus latifolius
-n11652966	alpine totara, Podocarpus nivalis
-n11653126	totara, Podocarpus totara
-n11653570	common yellowwood, bastard yellowwood, Afrocarpus falcata
-n11653904	kahikatea, New Zealand Dacryberry, New Zealand white pine, Dacrycarpus dacrydioides, Podocarpus dacrydioides
-n11654293	rimu, imou pine, red pine, Dacrydium cupressinum
-n11654438	tarwood, tar-wood, Dacrydium colensoi
-n11654984	common sickle pine, Falcatifolium falciforme
-n11655152	yellow-leaf sickle pine, Falcatifolium taxoides
-n11655592	tarwood, tar-wood, New Zealand mountain pine, Halocarpus bidwilli, Dacrydium bidwilli
-n11655974	westland pine, silver pine, Lagarostrobus colensoi
-n11656123	huon pine, Lagarostrobus franklinii, Dacrydium franklinii
-n11656549	Chilean rimu, Lepidothamnus fonkii
-n11656771	mountain rimu, Lepidothamnus laxifolius, Dacridium laxifolius
-n11657585	nagi, Nageia nagi
-n11658331	miro, black pine, Prumnopitys ferruginea, Podocarpus ferruginea
-n11658544	matai, black pine, Prumnopitys taxifolia, Podocarpus spicata
-n11658709	plum-fruited yew, Prumnopitys andina, Prumnopitys elegans
-n11659248	Prince Albert yew, Prince Albert's yew, Saxe-gothea conspicua
-n11659627	Sundacarpus amara, Prumnopitys amara, Podocarpus amara
-n11660300	Japanese umbrella pine, Sciadopitys verticillata
-n11661372	yew
-n11661909	Old World yew, English yew, Taxus baccata
-n11662128	Pacific yew, California yew, western yew, Taxus brevifolia
-n11662371	Japanese yew, Taxus cuspidata
-n11662585	Florida yew, Taxus floridana
-n11662937	New Caledonian yew, Austrotaxus spicata
-n11663263	white-berry yew, Pseudotaxus chienii
-n11664418	ginkgo, gingko, maidenhair tree, Ginkgo biloba
-n11665372	angiosperm, flowering plant
-n11666854	dicot, dicotyledon, magnoliopsid, exogen
-n11668117	monocot, monocotyledon, liliopsid, endogen
-n11669786	floret, floweret
-n11669921	flower
-n11672269	bloomer
-n11672400	wildflower, wild flower
-n11674019	apetalous flower
-n11674332	inflorescence
-n11675025	rosebud
-n11675404	gynostegium
-n11675738	pollinium
-n11676500	pistil
-n11676743	gynobase
-n11676850	gynophore
-n11677485	stylopodium
-n11677902	carpophore
-n11678010	cornstalk, corn stalk
-n11678299	petiolule
-n11678377	mericarp
-n11679378	micropyle
-n11680457	germ tube
-n11680596	pollen tube
-n11682659	gemma
-n11683216	galbulus
-n11683838	nectary, honey gland
-n11684264	pericarp, seed vessel
-n11684499	epicarp, exocarp
-n11684654	mesocarp
-n11685091	pip
-n11685621	silique, siliqua
-n11686195	cataphyll
-n11686652	perisperm
-n11686780	monocarp, monocarpic plant, monocarpous plant
-n11686912	sporophyte
-n11687071	gametophyte
-n11687432	megasporangium, macrosporangium
-n11687789	microspore
-n11687964	microsporangium
-n11688069	microsporophyll
-n11688378	archespore, archesporium
-n11689197	bonduc nut, nicker nut, nicker seed
-n11689367	Job's tears
-n11689483	oilseed, oil-rich seed
-n11689678	castor bean
-n11689815	cottonseed
-n11689957	candlenut
-n11690088	peach pit
-n11690254	hypanthium, floral cup, calyx tube
-n11690455	petal, flower petal
-n11691046	corolla
-n11691857	lip
-n11692265	perianth, chlamys, floral envelope, perigone, perigonium
-n11692792	thistledown
-n11693981	custard apple, custard apple tree
-n11694300	cherimoya, cherimoya tree, Annona cherimola
-n11694469	ilama, ilama tree, Annona diversifolia
-n11694664	soursop, prickly custard apple, soursop tree, Annona muricata
-n11694866	bullock's heart, bullock's heart tree, bullock heart, Annona reticulata
-n11695085	sweetsop, sweetsop tree, Annona squamosa
-n11695285	pond apple, pond-apple tree, Annona glabra
-n11695599	pawpaw, papaw, papaw tree, Asimina triloba
-n11695974	ilang-ilang, ylang-ylang, Cananga odorata
-n11696450	lancewood, lancewood tree, Oxandra lanceolata
-n11696935	Guinea pepper, negro pepper, Xylopia aethiopica
-n11697560	barberry
-n11697802	American barberry, Berberis canadensis
-n11698042	common barberry, European barberry, Berberis vulgaris
-n11698245	Japanese barberry, Berberis thunbergii
-n11699442	Oregon grape, Oregon holly grape, hollygrape, mountain grape, holly-leaves barberry, Mahonia aquifolium
-n11699751	Oregon grape, Mahonia nervosa
-n11700058	mayapple, May apple, wild mandrake, Podophyllum peltatum
-n11700279	May apple
-n11700864	allspice
-n11701066	Carolina allspice, strawberry shrub, strawberry bush, sweet shrub, Calycanthus floridus
-n11701302	spicebush, California allspice, Calycanthus occidentalis
-n11702713	katsura tree, Cercidiphyllum japonicum
-n11703669	laurel
-n11704093	true laurel, bay, bay laurel, bay tree, Laurus nobilis
-n11704620	camphor tree, Cinnamomum camphora
-n11704791	cinnamon, Ceylon cinnamon, Ceylon cinnamon tree, Cinnamomum zeylanicum
-n11705171	cassia, cassia-bark tree, Cinnamomum cassia
-n11705387	cassia bark, Chinese cinnamon
-n11705573	Saigon cinnamon, Cinnamomum loureirii
-n11705776	cinnamon bark
-n11706325	spicebush, spice bush, American spicebush, Benjamin bush, Lindera benzoin, Benzoin odoriferum
-n11706761	avocado, avocado tree, Persea Americana
-n11706942	laurel-tree, red bay, Persea borbonia
-n11707229	sassafras, sassafras tree, Sassafras albidum
-n11707827	California laurel, California bay tree, Oregon myrtle, pepperwood, spice tree, sassafras laurel, California olive, mountain laurel, Umbellularia californica
-n11708658	anise tree
-n11708857	purple anise, Illicium floridanum
-n11709045	star anise, Illicium anisatum
-n11709205	star anise, Chinese anise, Illicium verum
-n11709674	magnolia
-n11710136	southern magnolia, evergreen magnolia, large-flowering magnolia, bull bay, Magnolia grandiflora
-n11710393	umbrella tree, umbrella magnolia, elkwood, elk-wood, Magnolia tripetala
-n11710658	earleaved umbrella tree, Magnolia fraseri
-n11710827	cucumber tree, Magnolia acuminata
-n11710987	large-leaved magnolia, large-leaved cucumber tree, great-leaved macrophylla, Magnolia macrophylla
-n11711289	saucer magnolia, Chinese magnolia, Magnolia soulangiana
-n11711537	star magnolia, Magnolia stellata
-n11711764	sweet bay, swamp bay, swamp laurel, Magnolia virginiana
-n11711971	manglietia, genus Manglietia
-n11712282	tulip tree, tulip poplar, yellow poplar, canary whitewood, Liriodendron tulipifera
-n11713164	moonseed
-n11713370	common moonseed, Canada moonseed, yellow parilla, Menispermum canadense
-n11713763	Carolina moonseed, Cocculus carolinus
-n11714382	nutmeg, nutmeg tree, Myristica fragrans
-n11715430	water nymph, fragrant water lily, pond lily, Nymphaea odorata
-n11715678	European white lily, Nymphaea alba
-n11716698	southern spatterdock, Nuphar sagittifolium
-n11717399	lotus, Indian lotus, sacred lotus, Nelumbo nucifera
-n11717577	water chinquapin, American lotus, yanquapin, Nelumbo lutea
-n11718296	water-shield, fanwort, Cabomba caroliniana
-n11718681	water-shield, Brasenia schreberi, water-target
-n11719286	peony, paeony
-n11720353	buttercup, butterflower, butter-flower, crowfoot, goldcup, kingcup
-n11720643	meadow buttercup, tall buttercup, tall crowfoot, tall field buttercup, Ranunculus acris
-n11720891	water crowfoot, water buttercup, Ranunculus aquatilis
-n11721337	lesser celandine, pilewort, Ranunculus ficaria
-n11721642	lesser spearwort, Ranunculus flammula
-n11722036	greater spearwort, Ranunculus lingua
-n11722342	western buttercup, Ranunculus occidentalis
-n11722466	creeping buttercup, creeping crowfoot, Ranunculus repens
-n11722621	cursed crowfoot, celery-leaved buttercup, Ranunculus sceleratus
-n11722982	aconite
-n11723227	monkshood, helmetflower, helmet flower, Aconitum napellus
-n11723452	wolfsbane, wolfbane, wolf's bane, Aconitum lycoctonum
-n11723770	baneberry, cohosh, herb Christopher
-n11723986	baneberry
-n11724109	red baneberry, redberry, red-berry, snakeberry, Actaea rubra
-n11724660	pheasant's-eye, Adonis annua
-n11725015	anemone, windflower
-n11725311	Alpine anemone, mountain anemone, Anemone tetonensis
-n11725480	Canada anemone, Anemone Canadensis
-n11725623	thimbleweed, Anemone cylindrica
-n11725821	wood anemone, Anemone nemorosa
-n11725973	wood anemone, snowdrop, Anemone quinquefolia
-n11726145	longheaded thimbleweed, Anemone riparia
-n11726269	snowdrop anemone, snowdrop windflower, Anemone sylvestris
-n11726433	Virginia thimbleweed, Anemone virginiana
-n11726707	rue anemone, Anemonella thalictroides
-n11727091	columbine, aquilegia, aquilege
-n11727358	meeting house, honeysuckle, Aquilegia canadensis
-n11727540	blue columbine, Aquilegia caerulea, Aquilegia scopulorum calcarea
-n11727738	granny's bonnets, Aquilegia vulgaris
-n11728099	marsh marigold, kingcup, meadow bright, May blob, cowslip, water dragon, Caltha palustris
-n11728769	American bugbane, summer cohosh, Cimicifuga americana
-n11728945	black cohosh, black snakeroot, rattle-top, Cimicifuga racemosa
-n11729142	fetid bugbane, foetid bugbane, Cimicifuga foetida
-n11729478	clematis
-n11729860	pine hyacinth, Clematis baldwinii, Viorna baldwinii
-n11730015	blue jasmine, blue jessamine, curly clematis, marsh clematis, Clematis crispa
-n11730458	golden clematis, Clematis tangutica
-n11730602	scarlet clematis, Clematis texensis
-n11730750	leather flower, Clematis versicolor
-n11730933	leather flower, vase-fine, vase vine, Clematis viorna
-n11731157	virgin's bower, old man's beard, devil's darning needle, Clematis virginiana
-n11731659	purple clematis, purple virgin's bower, mountain clematis, Clematis verticillaris
-n11732052	goldthread, golden thread, Coptis groenlandica, Coptis trifolia groenlandica
-n11732567	rocket larkspur, Consolida ambigua, Delphinium ajacis
-n11733054	delphinium
-n11733312	larkspur
-n11733548	winter aconite, Eranthis hyemalis
-n11734493	lenten rose, black hellebore, Helleborus orientalis
-n11734698	green hellebore, Helleborus viridis
-n11735053	hepatica, liverleaf
-n11735570	goldenseal, golden seal, yellow root, turmeric root, Hydrastis Canadensis
-n11735977	false rue anemone, false rue, Isopyrum biternatum
-n11736362	giant buttercup, Laccopetalum giganteum
-n11736694	nigella
-n11736851	love-in-a-mist, Nigella damascena
-n11737009	fennel flower, Nigella hispanica
-n11737125	black caraway, nutmeg flower, Roman coriander, Nigella sativa
-n11737534	pasqueflower, pasque flower
-n11738547	meadow rue
-n11738997	false bugbane, Trautvetteria carolinensis
-n11739365	globeflower, globe flower
-n11739978	winter's bark, winter's bark tree, Drimys winteri
-n11740414	pepper shrub, Pseudowintera colorata, Wintera colorata
-n11741175	sweet gale, Scotch gale, Myrica gale
-n11741350	wax myrtle
-n11741575	bay myrtle, puckerbush, Myrica cerifera
-n11741797	bayberry, candleberry, swamp candleberry, waxberry, Myrica pensylvanica
-n11742310	sweet fern, Comptonia peregrina, Comptonia asplenifolia
-n11742878	corkwood, corkwood tree, Leitneria floridana
-n11744011	jointed rush, Juncus articulatus
-n11744108	toad rush, Juncus bufonius
-n11744471	slender rush, Juncus tenuis
-n11745817	zebrawood, zebrawood tree
-n11746600	Connarus guianensis
-n11747468	legume, leguminous plant
-n11748002	legume
-n11748811	peanut
-n11749112	granadilla tree, granadillo, Brya ebenus
-n11749603	arariba, Centrolobium robustum
-n11750173	tonka bean, coumara nut
-n11750508	courbaril, Hymenaea courbaril
-n11750989	melilotus, melilot, sweet clover
-n11751765	darling pea, poison bush
-n11751974	smooth darling pea, Swainsona galegifolia
-n11752578	clover, trefoil
-n11752798	alpine clover, Trifolium alpinum
-n11752937	hop clover, shamrock, lesser yellow trefoil, Trifolium dubium
-n11753143	crimson clover, Italian clover, Trifolium incarnatum
-n11753355	red clover, purple clover, Trifolium pratense
-n11753562	buffalo clover, Trifolium reflexum, Trifolium stoloniferum
-n11753700	white clover, dutch clover, shamrock, Trifolium repens
-n11754893	mimosa
-n11756092	acacia
-n11756329	shittah, shittah tree
-n11756669	wattle
-n11756870	black wattle, Acacia auriculiformis
-n11757017	gidgee, stinking wattle, Acacia cambegei
-n11757190	catechu, Jerusalem thorn, Acacia catechu
-n11757653	silver wattle, mimosa, Acacia dealbata
-n11757851	huisache, cassie, mimosa bush, sweet wattle, sweet acacia, scented wattle, flame tree, Acacia farnesiana
-n11758122	lightwood, Acacia melanoxylon
-n11758276	golden wattle, Acacia pycnantha
-n11758483	fever tree, Acacia xanthophloea
-n11758799	coralwood, coral-wood, red sandalwood, Barbados pride, peacock flower fence, Adenanthera pavonina
-n11759224	albizzia, albizia
-n11759404	silk tree, Albizia julibrissin, Albizzia julibrissin
-n11759609	siris, siris tree, Albizia lebbeck, Albizzia lebbeck
-n11759853	rain tree, saman, monkeypod, monkey pod, zaman, zamang, Albizia saman
-n11760785	calliandra
-n11761202	conacaste, elephant's ear, Enterolobium cyclocarpa
-n11761650	inga
-n11761836	ice-cream bean, Inga edulis
-n11762018	guama, Inga laurina
-n11762433	lead tree, white popinac, Leucaena glauca, Leucaena leucocephala
-n11762927	wild tamarind, Lysiloma latisiliqua, Lysiloma bahamensis
-n11763142	sabicu, Lysiloma sabicu
-n11763625	nitta tree
-n11763874	Parkia javanica
-n11764478	manila tamarind, camachile, huamachil, wild tamarind, Pithecellobium dulce
-n11764814	cat's-claw, catclaw, black bead, Pithecellodium unguis-cati
-n11765568	honey mesquite, Western honey mesquite, Prosopis glandulosa
-n11766046	algarroba, algarrobilla, algarobilla
-n11766189	screw bean, screwbean, tornillo, screwbean mesquite, Prosopis pubescens
-n11766432	screw bean
-n11767354	dogbane
-n11767877	Indian hemp, rheumatism weed, Apocynum cannabinum
-n11768816	bushman's poison, ordeal tree, Acocanthera oppositifolia, Acocanthera venenata
-n11769176	impala lily, mock azalia, desert rose, kudu lily, Adenium obesum, Adenium multiflorum
-n11769621	allamanda
-n11769803	common allamanda, golden trumpet, Allamanda cathartica
-n11770256	dita, dita bark, devil tree, Alstonia scholaris
-n11771147	Nepal trumpet flower, Easter lily vine, Beaumontia grandiflora
-n11771539	carissa
-n11771746	hedge thorn, natal plum, Carissa bispinosa
-n11771924	natal plum, amatungulu, Carissa macrocarpa, Carissa grandiflora
-n11772408	periwinkle, rose periwinkle, Madagascar periwinkle, old maid, Cape periwinkle, red periwinkle, cayenne jasmine, Catharanthus roseus, Vinca rosea
-n11772879	ivory tree, conessi, kurchi, kurchee, Holarrhena pubescens, Holarrhena antidysenterica
-n11773408	white dipladenia, Mandevilla boliviensis, Dipladenia boliviensis
-n11773628	Chilean jasmine, Mandevilla laxa
-n11773987	oleander, rose bay, Nerium oleander
-n11774513	frangipani, frangipanni
-n11774972	West Indian jasmine, pagoda tree, Plumeria alba
-n11775340	rauwolfia, rauvolfia
-n11775626	snakewood, Rauwolfia serpentina
-n11776234	Strophanthus kombe
-n11777080	yellow oleander, Thevetia peruviana, Thevetia neriifolia
-n11778092	myrtle, Vinca minor
-n11778257	large periwinkle, Vinca major
-n11779300	arum, aroid
-n11780148	cuckoopint, lords-and-ladies, jack-in-the-pulpit, Arum maculatum
-n11780424	black calla, Arum palaestinum
-n11781176	calamus
-n11782036	alocasia, elephant's ear, elephant ear
-n11782266	giant taro, Alocasia macrorrhiza
-n11782761	amorphophallus
-n11782878	pungapung, telingo potato, elephant yam, Amorphophallus paeonifolius, Amorphophallus campanulatus
-n11783162	devil's tongue, snake palm, umbrella arum, Amorphophallus rivieri
-n11783920	anthurium, tailflower, tail-flower
-n11784126	flamingo flower, flamingo plant, Anthurium andraeanum, Anthurium scherzerianum
-n11784497	jack-in-the-pulpit, Indian turnip, wake-robin, Arisaema triphyllum, Arisaema atrorubens
-n11785276	friar's-cowl, Arisarum vulgare
-n11785668	caladium
-n11785875	Caladium bicolor
-n11786131	wild calla, water arum, Calla palustris
-n11786539	taro, taro plant, dalo, dasheen, Colocasia esculenta
-n11786843	taro, cocoyam, dasheen, eddo
-n11787190	cryptocoryne, water trumpet
-n11788039	dracontium
-n11788727	golden pothos, pothos, ivy arum, Epipremnum aureum, Scindapsus aureus
-n11789066	skunk cabbage, Lysichiton americanum
-n11789438	monstera
-n11789589	ceriman, Monstera deliciosa
-n11789962	nephthytis
-n11790089	Nephthytis afzelii
-n11790788	arrow arum
-n11790936	green arrow arum, tuckahoe, Peltandra virginica
-n11791341	philodendron
-n11791569	pistia, water lettuce, water cabbage, Pistia stratiotes, Pistia stratoites
-n11792029	pothos
-n11792341	spathiphyllum, peace lily, spathe flower
-n11792742	skunk cabbage, polecat weed, foetid pothos, Symplocarpus foetidus
-n11793403	yautia, tannia, spoonflower, malanga, Xanthosoma sagittifolium, Xanthosoma atrovirens
-n11793779	calla lily, calla, arum lily, Zantedeschia aethiopica
-n11794024	pink calla, Zantedeschia rehmanii
-n11794139	golden calla
-n11794519	duckweed
-n11795049	common duckweed, lesser duckweed, Lemna minor
-n11795216	star-duckweed, Lemna trisulca
-n11795580	great duckweed, water flaxseed, Spirodela polyrrhiza
-n11796005	watermeal
-n11796188	common wolffia, Wolffia columbiana
-n11797321	aralia
-n11797508	American angelica tree, devil's walking stick, Hercules'-club, Aralia spinosa
-n11797981	American spikenard, petty morel, life-of-man, Aralia racemosa
-n11798270	bristly sarsaparilla, bristly sarsparilla, dwarf elder, Aralia hispida
-n11798496	Japanese angelica tree, Aralia elata
-n11798688	Chinese angelica, Chinese angelica tree, Aralia stipulata
-n11798978	ivy, common ivy, English ivy, Hedera helix
-n11799331	puka, Meryta sinclairii
-n11799732	ginseng, nin-sin, Panax ginseng, Panax schinseng, Panax pseudoginseng
-n11800236	ginseng
-n11800565	umbrella tree, Schefflera actinophylla, Brassaia actinophylla
-n11801392	birthwort, Aristolochia clematitis
-n11801665	Dutchman's-pipe, pipe vine, Aristolochia macrophylla, Aristolochia durior
-n11801891	Virginia snakeroot, Virginia serpentaria, Virginia serpentary, Aristolochia serpentaria
-n11802410	Canada ginger, black snakeroot, Asarum canadense
-n11802586	heartleaf, heart-leaf, Asarum virginicum
-n11802800	heartleaf, heart-leaf, Asarum shuttleworthii
-n11802995	asarabacca, Asarum europaeum
-n11805255	caryophyllaceous plant
-n11805544	corn cockle, corn campion, crown-of-the-field, Agrostemma githago
-n11805956	sandwort
-n11806219	mountain sandwort, mountain starwort, mountain daisy, Arenaria groenlandica
-n11806369	pine-barren sandwort, longroot, Arenaria caroliniana
-n11806521	seabeach sandwort, Arenaria peploides
-n11806679	rock sandwort, Arenaria stricta
-n11806814	thyme-leaved sandwort, Arenaria serpyllifolia
-n11807108	mouse-ear chickweed, mouse eared chickweed, mouse ear, clammy chickweed, chickweed
-n11807525	snow-in-summer, love-in-a-mist, Cerastium tomentosum
-n11807696	Alpine mouse-ear, Arctic mouse-ear, Cerastium alpinum
-n11807979	pink, garden pink
-n11808299	sweet William, Dianthus barbatus
-n11808468	carnation, clove pink, gillyflower, Dianthus caryophyllus
-n11808721	china pink, rainbow pink, Dianthus chinensis
-n11808932	Japanese pink, Dianthus chinensis heddewigii
-n11809094	maiden pink, Dianthus deltoides
-n11809271	cheddar pink, Diangus gratianopolitanus
-n11809437	button pink, Dianthus latifolius
-n11809594	cottage pink, grass pink, Dianthus plumarius
-n11809754	fringed pink, Dianthus supurbus
-n11810030	drypis
-n11810358	baby's breath, babies'-breath, Gypsophila paniculata
-n11811059	coral necklace, Illecebrum verticullatum
-n11811473	lychnis, catchfly
-n11811706	ragged robin, cuckoo flower, Lychnis flos-cuculi, Lychins floscuculi
-n11811921	scarlet lychnis, maltese cross, Lychins chalcedonica
-n11812094	mullein pink, rose campion, gardener's delight, dusty miller, Lychnis coronaria
-n11812910	sandwort, Moehringia lateriflora
-n11813077	sandwort, Moehringia mucosa
-n11814584	soapwort, hedge pink, bouncing Bet, bouncing Bess, Saponaria officinalis
-n11814996	knawel, knawe, Scleranthus annuus
-n11815491	silene, campion, catchfly
-n11815721	moss campion, Silene acaulis
-n11815918	wild pink, Silene caroliniana
-n11816121	red campion, red bird's eye, Silene dioica, Lychnis dioica
-n11816336	white campion, evening lychnis, white cockle, bladder campion, Silene latifolia, Lychnis alba
-n11816649	fire pink, Silene virginica
-n11816829	bladder campion, Silene uniflora, Silene vulgaris
-n11817160	corn spurry, corn spurrey, Spergula arvensis
-n11817501	sand spurry, sea spurry, Spergularia rubra
-n11817914	chickweed
-n11818069	common chickweed, Stellaria media
-n11818636	cowherb, cow cockle, Vaccaria hispanica, Vaccaria pyramidata, Saponaria vaccaria
-n11819509	Hottentot fig, Hottentot's fig, sour fig, Carpobrotus edulis, Mesembryanthemum edule
-n11819912	livingstone daisy, Dorotheanthus bellidiformis
-n11820965	fig marigold, pebble plant
-n11821184	ice plant, icicle plant, Mesembryanthemum crystallinum
-n11822300	New Zealand spinach, Tetragonia tetragonioides, Tetragonia expansa
-n11823043	amaranth
-n11823305	amaranth
-n11823436	tumbleweed, Amaranthus albus, Amaranthus graecizans
-n11823756	prince's-feather, gentleman's-cane, prince's-plume, red amaranth, purple amaranth, Amaranthus cruentus, Amaranthus hybridus hypochondriacus, Amaranthus hybridus erythrostachys
-n11824146	pigweed, Amaranthus hypochondriacus
-n11824344	thorny amaranth, Amaranthus spinosus
-n11824747	alligator weed, alligator grass, Alternanthera philoxeroides
-n11825351	cockscomb, common cockscomb, Celosia cristata, Celosia argentea cristata
-n11825749	cottonweed
-n11826198	globe amaranth, bachelor's button, Gomphrena globosa
-n11826569	bloodleaf
-n11827541	saltwort, Batis maritima
-n11828577	lamb's-quarters, pigweed, wild spinach, Chenopodium album
-n11828973	good-king-henry, allgood, fat hen, wild spinach, Chenopodium bonus-henricus
-n11829205	Jerusalem oak, feather geranium, Mexican tea, Chenopodium botrys, Atriplex mexicana
-n11829672	oak-leaved goosefoot, oakleaf goosefoot, Chenopodium glaucum
-n11829922	sowbane, red goosefoot, Chenopodium hybridum
-n11830045	nettle-leaved goosefoot, nettleleaf goosefoot, Chenopodium murale
-n11830252	red goosefoot, French spinach, Chenopodium rubrum
-n11830400	stinking goosefoot, Chenopodium vulvaria
-n11830714	orach, orache
-n11830906	saltbush
-n11831100	garden orache, mountain spinach, Atriplex hortensis
-n11831297	desert holly, Atriplex hymenelytra
-n11831521	quail bush, quail brush, white thistle, Atriplex lentiformis
-n11832214	beet, common beet, Beta vulgaris
-n11832480	beetroot, Beta vulgaris rubra
-n11832671	chard, Swiss chard, spinach beet, leaf beet, chard plant, Beta vulgaris cicla
-n11832899	mangel-wurzel, mangold-wurzel, mangold, Beta vulgaris vulgaris
-n11833373	winged pigweed, tumbleweed, Cycloloma atriplicifolium
-n11833749	halogeton, Halogeton glomeratus
-n11834272	glasswort, samphire, Salicornia europaea
-n11834654	saltwort, barilla, glasswort, kali, kelpwort, Salsola kali, Salsola soda
-n11834890	Russian thistle, Russian tumbleweed, Russian cactus, tumbleweed, Salsola kali tenuifolia
-n11835251	greasewood, black greasewood, Sarcobatus vermiculatus
-n11836327	scarlet musk flower, Nyctaginia capitata
-n11836722	sand verbena
-n11837204	sweet sand verbena, Abronia fragrans
-n11837351	yellow sand verbena, Abronia latifolia
-n11837562	beach pancake, Abronia maritima
-n11837743	beach sand verbena, pink sand verbena, Abronia umbellata
-n11837970	desert sand verbena, Abronia villosa
-n11838413	trailing four o'clock, trailing windmills, Allionia incarnata
-n11838916	bougainvillea
-n11839460	umbrellawort
-n11839568	four o'clock
-n11839823	common four-o'clock, marvel-of-Peru, Mirabilis jalapa, Mirabilis uniflora
-n11840067	California four o'clock, Mirabilis laevis, Mirabilis californica
-n11840246	sweet four o'clock, maravilla, Mirabilis longiflora
-n11840476	desert four o'clock, Colorado four o'clock, maravilla, Mirabilis multiflora
-n11840764	mountain four o'clock, Mirabilis oblongifolia
-n11841247	cockspur, Pisonia aculeata
-n11843441	rattail cactus, rat's-tail cactus, Aporocactus flagelliformis
-n11844371	saguaro, sahuaro, Carnegiea gigantea
-n11844892	night-blooming cereus
-n11845557	echinocactus, barrel cactus
-n11845793	hedgehog cactus
-n11845913	golden barrel cactus, Echinocactus grusonii
-n11846312	hedgehog cereus
-n11846425	rainbow cactus
-n11846765	epiphyllum, orchid cactus
-n11847169	barrel cactus
-n11848479	night-blooming cereus
-n11848867	chichipe, Lemaireocereus chichipe
-n11849271	mescal, mezcal, peyote, Lophophora williamsii
-n11849467	mescal button, sacred mushroom, magic mushroom
-n11849871	mammillaria
-n11849983	feather ball, Mammillaria plumosa
-n11850521	garambulla, garambulla cactus, Myrtillocactus geometrizans
-n11850918	Knowlton's cactus, Pediocactus knowltonii
-n11851258	nopal
-n11851578	prickly pear, prickly pear cactus
-n11851839	cholla, Opuntia cholla
-n11852028	nopal, Opuntia lindheimeri
-n11852148	tuna, Opuntia tuna
-n11852531	Barbados gooseberry, Barbados-gooseberry vine, Pereskia aculeata
-n11853079	mistletoe cactus
-n11853356	Christmas cactus, Schlumbergera buckleyi, Schlumbergera baridgesii
-n11853813	night-blooming cereus
-n11854479	crab cactus, Thanksgiving cactus, Zygocactus truncatus, Schlumbergera truncatus
-n11855274	pokeweed
-n11855435	Indian poke, Phytolacca acinosa
-n11855553	poke, pigeon berry, garget, scoke, Phytolacca americana
-n11855842	ombu, bella sombra, Phytolacca dioica
-n11856573	bloodberry, blood berry, rougeberry, rouge plant, Rivina humilis
-n11857696	portulaca
-n11857875	rose moss, sun plant, Portulaca grandiflora
-n11858077	common purslane, pussley, pussly, verdolagas, Portulaca oleracea
-n11858703	rock purslane
-n11858814	red maids, redmaids, Calandrinia ciliata
-n11859275	Carolina spring beauty, Claytonia caroliniana
-n11859472	spring beauty, Clatonia lanceolata
-n11859737	Virginia spring beauty, Claytonia virginica
-n11860208	siskiyou lewisia, Lewisia cotyledon
-n11860555	bitterroot, Lewisia rediviva
-n11861238	broad-leaved montia, Montia cordifolia
-n11861487	blinks, blinking chickweed, water chickweed, Montia lamprosperma
-n11861641	toad lily, Montia chamissoi
-n11861853	winter purslane, miner's lettuce, Cuban spinach, Montia perfoliata
-n11862835	flame flower, flame-flower, flameflower, Talinum aurantiacum
-n11863467	pigmy talinum, Talinum brevifolium
-n11863877	jewels-of-opar, Talinum paniculatum
-n11865071	caper
-n11865276	native pomegranate, Capparis arborea
-n11865429	caper tree, Jamaica caper tree, Capparis cynophallophora
-n11865574	caper tree, bay-leaved caper, Capparis flexuosa
-n11865874	common caper, Capparis spinosa
-n11866248	spiderflower, cleome
-n11866706	Rocky Mountain bee plant, stinking clover, Cleome serrulata
-n11867311	clammyweed, Polanisia graveolens, Polanisia dodecandra
-n11868814	crucifer, cruciferous plant
-n11869351	cress, cress plant
-n11869689	watercress
-n11870044	stonecress, stone cress
-n11870418	garlic mustard, hedge garlic, sauce-alone, jack-by-the-hedge, Alliaria officinalis
-n11870747	alyssum, madwort
-n11871059	rose of Jericho, resurrection plant, Anastatica hierochuntica
-n11871496	Arabidopsis thaliana, mouse-ear cress
-n11871748	Arabidopsis lyrata
-n11872146	rock cress, rockcress
-n11872324	sicklepod, Arabis Canadensis
-n11872658	tower mustard, tower cress, Turritis glabra, Arabis glabra
-n11873182	horseradish, horseradish root
-n11873612	winter cress, St. Barbara's herb, scurvy grass
-n11874081	yellow rocket, rockcress, rocket cress, Barbarea vulgaris, Sisymbrium barbarea
-n11874423	hoary alison, hoary alyssum, Berteroa incana
-n11874878	buckler mustard, Biscutalla laevigata
-n11875523	wild cabbage, Brassica oleracea
-n11875691	cabbage, cultivated cabbage, Brassica oleracea
-n11875938	head cabbage, head cabbage plant, Brassica oleracea capitata
-n11876204	savoy cabbage
-n11876432	brussels sprout, Brassica oleracea gemmifera
-n11876634	cauliflower, Brassica oleracea botrytis
-n11876803	broccoli, Brassica oleracea italica
-n11877193	collard
-n11877283	kohlrabi, Brassica oleracea gongylodes
-n11877473	turnip plant
-n11877646	turnip, white turnip, Brassica rapa
-n11877860	rutabaga, turnip cabbage, swede, Swedish turnip, rutabaga plant, Brassica napus napobrassica
-n11878101	broccoli raab, broccoli rabe, Brassica rapa ruvo
-n11878283	mustard
-n11878633	chinese mustard, indian mustard, leaf mustard, gai choi, Brassica juncea
-n11879054	bok choy, bok choi, pakchoi, pak choi, Chinese white cabbage, Brassica rapa chinensis
-n11879722	rape, colza, Brassica napus
-n11879895	rapeseed
-n11881189	shepherd's purse, shepherd's pouch, Capsella bursa-pastoris
-n11882074	lady's smock, cuckooflower, cuckoo flower, meadow cress, Cardamine pratensis
-n11882237	coral-root bittercress, coralroot, coralwort, Cardamine bulbifera, Dentaria bulbifera
-n11882426	crinkleroot, crinkle-root, crinkle root, pepper root, toothwort, Cardamine diphylla, Dentaria diphylla
-n11882636	American watercress, mountain watercress, Cardamine rotundifolia
-n11882821	spring cress, Cardamine bulbosa
-n11882972	purple cress, Cardamine douglasii
-n11883328	wallflower, Cheiranthus cheiri, Erysimum cheiri
-n11883628	prairie rocket
-n11883945	scurvy grass, common scurvy grass, Cochlearia officinalis
-n11884384	sea kale, sea cole, Crambe maritima
-n11884967	tansy mustard, Descurainia pinnata
-n11885856	draba
-n11887119	wallflower
-n11887310	prairie rocket
-n11887476	Siberian wall flower, Erysimum allionii, Cheiranthus allionii
-n11887750	western wall flower, Erysimum asperum, Cheiranthus asperus, Erysimum arkansanum
-n11888061	wormseed mustard, Erysimum cheiranthoides
-n11888424	heliophila
-n11888800	damask violet, Dame's violet, sweet rocket, Hesperis matronalis
-n11889205	tansy-leaved rocket, Hugueninia tanacetifolia, Sisymbrium tanacetifolia
-n11889619	candytuft
-n11890022	woad
-n11890150	dyer's woad, Isatis tinctoria
-n11890884	bladderpod
-n11891175	sweet alyssum, sweet alison, Lobularia maritima
-n11892029	Malcolm stock, stock
-n11892181	Virginian stock, Virginia stock, Malcolmia maritima
-n11892637	stock, gillyflower
-n11892817	brompton stock, Matthiola incana
-n11893640	bladderpod
-n11893916	chamois cress, Pritzelago alpina, Lepidium alpina
-n11894327	radish plant, radish
-n11894558	jointed charlock, wild radish, wild rape, runch, Raphanus raphanistrum
-n11894770	radish, Raphanus sativus
-n11895092	radish, daikon, Japanese radish, Raphanus sativus longipinnatus
-n11895472	marsh cress, yellow watercress, Rorippa islandica
-n11895714	great yellowcress, Rorippa amphibia, Nasturtium amphibium
-n11896141	schizopetalon, Schizopetalon walkeri
-n11896722	field mustard, wild mustard, charlock, chadlock, Brassica kaber, Sinapis arvensis
-n11897116	hedge mustard, Sisymbrium officinale
-n11897466	desert plume, prince's-plume, Stanleya pinnata, Cleome pinnata
-n11898639	pennycress
-n11898775	field pennycress, French weed, fanweed, penny grass, stinkweed, mithridate mustard, Thlaspi arvense
-n11899223	fringepod, lacepod
-n11899762	bladderpod
-n11899921	wasabi
-n11900569	poppy
-n11901294	Iceland poppy, Papaver alpinum
-n11901452	western poppy, Papaver californicum
-n11901597	prickly poppy, Papaver argemone
-n11901759	Iceland poppy, arctic poppy, Papaver nudicaule
-n11901977	oriental poppy, Papaver orientale
-n11902200	corn poppy, field poppy, Flanders poppy, Papaver rhoeas
-n11902389	opium poppy, Papaver somniferum
-n11902709	prickly poppy, argemone, white thistle, devil's fig
-n11902982	Mexican poppy, Argemone mexicana
-n11903333	bocconia, tree celandine, Bocconia frutescens
-n11903671	celandine, greater celandine, swallowwort, swallow wort, Chelidonium majus
-n11904109	corydalis
-n11904274	climbing corydalis, Corydalis claviculata, Fumaria claviculata
-n11905392	California poppy, Eschscholtzia californica
-n11905749	horn poppy, horned poppy, yellow horned poppy, sea poppy, Glaucium flavum
-n11906127	golden cup, Mexican tulip poppy, Hunnemania fumariifolia
-n11906514	plume poppy, bocconia, Macleaya cordata
-n11906917	blue poppy, Meconopsis betonicifolia
-n11907100	Welsh poppy, Meconopsis cambrica
-n11907405	creamcups, Platystemon californicus
-n11907689	matilija poppy, California tree poppy, Romneya coulteri
-n11908549	wind poppy, flaming poppy, Stylomecon heterophyllum, Papaver heterophyllum
-n11908846	celandine poppy, wood poppy, Stylophorum diphyllum
-n11909864	climbing fumitory, Allegheny vine, Adlumia fungosa, Fumaria fungosa
-n11910271	bleeding heart, lyreflower, lyre-flower, Dicentra spectabilis
-n11910460	Dutchman's breeches, Dicentra cucullaria
-n11910666	squirrel corn, Dicentra canadensis
-n11915214	composite, composite plant
-n11915658	compass plant, compass flower
-n11915899	everlasting, everlasting flower
-n11916467	achillea
-n11916696	yarrow, milfoil, Achillea millefolium
-n11917407	pink-and-white everlasting, pink paper daisy, Acroclinium roseum
-n11917835	white snakeroot, white sanicle, Ageratina altissima, Eupatorium rugosum
-n11918286	ageratum
-n11918473	common ageratum, Ageratum houstonianum
-n11918808	sweet sultan, Amberboa moschata, Centaurea moschata
-n11919447	ragweed, ambrosia, bitterweed
-n11919761	common ragweed, Ambrosia artemisiifolia
-n11919975	great ragweed, Ambrosia trifida
-n11920133	western ragweed, perennial ragweed, Ambrosia psilostachya
-n11920498	ammobium
-n11920663	winged everlasting, Ammobium alatum
-n11920998	pellitory, pellitory-of-Spain, Anacyclus pyrethrum
-n11921395	pearly everlasting, cottonweed, Anaphalis margaritacea
-n11921792	andryala
-n11922661	plantain-leaved pussytoes
-n11922755	field pussytoes
-n11922839	solitary pussytoes
-n11922926	mountain everlasting
-n11923174	mayweed, dog fennel, stinking mayweed, stinking chamomile, Anthemis cotula
-n11923397	yellow chamomile, golden marguerite, dyers' chamomile, Anthemis tinctoria
-n11923637	corn chamomile, field chamomile, corn mayweed, Anthemis arvensis
-n11924014	woolly daisy, dwarf daisy, Antheropeas wallacei, Eriophyllum wallacei
-n11924445	burdock, clotbur
-n11924849	great burdock, greater burdock, cocklebur, Arctium lappa
-n11925303	African daisy
-n11925450	blue-eyed African daisy, Arctotis stoechadifolia, Arctotis venusta
-n11925898	marguerite, marguerite daisy, Paris daisy, Chrysanthemum frutescens, Argyranthemum frutescens
-n11926365	silversword, Argyroxiphium sandwicense
-n11926833	arnica
-n11926976	heartleaf arnica, Arnica cordifolia
-n11927215	Arnica montana
-n11927740	lamb succory, dwarf nipplewort, Arnoseris minima
-n11928352	artemisia
-n11928858	mugwort
-n11929743	sweet wormwood, Artemisia annua
-n11930038	field wormwood, Artemisia campestris
-n11930203	tarragon, estragon, Artemisia dracunculus
-n11930353	sand sage, silvery wormwood, Artemisia filifolia
-n11930571	wormwood sage, prairie sagewort, Artemisia frigida
-n11930788	western mugwort, white sage, cudweed, prairie sage, Artemisia ludoviciana, Artemisia gnaphalodes
-n11930994	Roman wormwood, Artemis pontica
-n11931135	bud brush, bud sagebrush, Artemis spinescens
-n11931540	common mugwort, Artemisia vulgaris
-n11931918	aster
-n11932745	wood aster
-n11932927	whorled aster, Aster acuminatus
-n11933099	heath aster, Aster arenosus
-n11933257	heart-leaved aster, Aster cordifolius
-n11933387	white wood aster, Aster divaricatus
-n11933546	bushy aster, Aster dumosus
-n11933728	heath aster, Aster ericoides
-n11933903	white prairie aster, Aster falcatus
-n11934041	stiff aster, Aster linarifolius
-n11934239	goldilocks, goldilocks aster, Aster linosyris, Linosyris vulgaris
-n11934463	large-leaved aster, Aster macrophyllus
-n11934616	New England aster, Aster novae-angliae
-n11934807	Michaelmas daisy, New York aster, Aster novi-belgii
-n11935027	upland white aster, Aster ptarmicoides
-n11935187	Short's aster, Aster shortii
-n11935330	sea aster, sea starwort, Aster tripolium
-n11935469	prairie aster, Aster turbinellis
-n11935627	annual salt-marsh aster
-n11935715	aromatic aster
-n11935794	arrow leaved aster
-n11935877	azure aster
-n11935953	bog aster
-n11936027	crooked-stemmed aster
-n11936113	Eastern silvery aster
-n11936199	flat-topped white aster
-n11936287	late purple aster
-n11936369	panicled aster
-n11936448	perennial salt marsh aster
-n11936539	purple-stemmed aster
-n11936624	rough-leaved aster
-n11936707	rush aster
-n11936782	Schreiber's aster
-n11936864	small white aster
-n11936946	smooth aster
-n11937023	southern aster
-n11937102	starved aster, calico aster
-n11937195	tradescant's aster
-n11937278	wavy-leaved aster
-n11937360	Western silvery aster
-n11937446	willow aster
-n11937692	ayapana, Ayapana triplinervis, Eupatorium aya-pana
-n11938556	mule fat, Baccharis viminea
-n11939180	balsamroot
-n11939491	daisy
-n11939699	common daisy, English daisy, Bellis perennis
-n11940006	bur marigold, burr marigold, beggar-ticks, beggar's-ticks, sticktight
-n11940349	Spanish needles, Bidens bipinnata
-n11940599	tickseed sunflower, Bidens coronata, Bidens trichosperma
-n11940750	European beggar-ticks, trifid beggar-ticks, trifid bur marigold, Bidens tripartita
-n11941094	slender knapweed
-n11941478	false chamomile
-n11941924	Swan River daisy, Brachycome Iberidifolia
-n11942659	woodland oxeye, Buphthalmum salicifolium
-n11943133	Indian plantain
-n11943407	calendula
-n11943660	common marigold, pot marigold, ruddles, Scotch marigold, Calendula officinalis
-n11943992	China aster, Callistephus chinensis
-n11944196	thistle
-n11944751	welted thistle, Carduus crispus
-n11944954	musk thistle, nodding thistle, Carduus nutans
-n11945367	carline thistle
-n11945514	stemless carline thistle, Carlina acaulis
-n11945783	common carline thistle, Carlina vulgaris
-n11946051	safflower, false saffron, Carthamus tinctorius
-n11946313	safflower seed
-n11946727	catananche
-n11946918	blue succory, cupid's dart, Catananche caerulea
-n11947251	centaury
-n11947629	dusty miller, Centaurea cineraria, Centaurea gymnocarpa
-n11947802	cornflower, bachelor's button, bluebottle, Centaurea cyanus
-n11948044	star-thistle, caltrop, Centauria calcitrapa
-n11948264	knapweed
-n11948469	sweet sultan, Centaurea imperialis
-n11948864	great knapweed, greater knapweed, Centaurea scabiosa
-n11949015	Barnaby's thistle, yellow star-thistle, Centaurea solstitialis
-n11949402	chamomile, camomile, Chamaemelum nobilis, Anthemis nobilis
-n11949857	chaenactis
-n11950345	chrysanthemum
-n11950686	corn marigold, field marigold, Chrysanthemum segetum
-n11950877	crown daisy, Chrysanthemum coronarium
-n11951052	chop-suey greens, tong ho, shun giku, Chrysanthemum coronarium spatiosum
-n11951511	golden aster
-n11951820	Maryland golden aster, Chrysopsis mariana
-n11952346	goldenbush
-n11952541	rabbit brush, rabbit bush, Chrysothamnus nauseosus
-n11953038	chicory, succory, chicory plant, Cichorium intybus
-n11953339	endive, witloof, Cichorium endivia
-n11953610	chicory, chicory root
-n11953884	plume thistle, plumed thistle
-n11954161	Canada thistle, creeping thistle, Cirsium arvense
-n11954345	field thistle, Cirsium discolor
-n11954484	woolly thistle, Cirsium flodmanii
-n11954642	European woolly thistle, Cirsium eriophorum
-n11954798	melancholy thistle, Cirsium heterophylum, Cirsium helenioides
-n11955040	brook thistle, Cirsium rivulare
-n11955153	bull thistle, boar thistle, spear thistle, Cirsium vulgare, Cirsium lanceolatum
-n11955532	blessed thistle, sweet sultan, Cnicus benedictus
-n11955896	mistflower, mist-flower, ageratum, Conoclinium coelestinum, Eupatorium coelestinum
-n11956348	horseweed, Canadian fleabane, fleabane, Conyza canadensis, Erigeron canadensis
-n11956850	coreopsis, tickseed, tickweed, tick-weed
-n11957317	giant coreopsis, Coreopsis gigantea
-n11957514	sea dahlia, Coreopsis maritima
-n11957678	calliopsis, Coreopsis tinctoria
-n11958080	cosmos, cosmea
-n11958499	brass buttons, Cotula coronopifolia
-n11958888	billy buttons
-n11959259	hawk's-beard, hawk's-beards
-n11959632	artichoke, globe artichoke, artichoke plant, Cynara scolymus
-n11959862	cardoon, Cynara cardunculus
-n11960245	dahlia, Dahlia pinnata
-n11960673	German ivy, Delairea odorata, Senecio milkanioides
-n11961100	florist's chrysanthemum, florists' chrysanthemum, mum, Dendranthema grandifloruom, Chrysanthemum morifolium
-n11961446	cape marigold, sun marigold, star of the veldt
-n11961871	leopard's-bane, leopardbane
-n11962272	coneflower
-n11962667	globe thistle
-n11962994	elephant's-foot
-n11963572	tassel flower, Emilia sagitta
-n11963932	brittlebush, brittle bush, incienso, Encelia farinosa
-n11964446	sunray, Enceliopsis nudicaulis
-n11964848	engelmannia
-n11965218	fireweed, Erechtites hieracifolia
-n11965627	fleabane
-n11965962	blue fleabane, Erigeron acer
-n11966083	daisy fleabane, Erigeron annuus
-n11966215	orange daisy, orange fleabane, Erigeron aurantiacus
-n11966385	spreading fleabane, Erigeron divergens
-n11966617	seaside daisy, beach aster, Erigeron glaucous
-n11966896	Philadelphia fleabane, Erigeron philadelphicus
-n11967142	robin's plantain, Erigeron pulchellus
-n11967315	showy daisy, Erigeron speciosus
-n11967744	woolly sunflower
-n11967878	golden yarrow, Eriophyllum lanatum
-n11968519	dog fennel, Eupatorium capillifolium
-n11968704	Joe-Pye weed, spotted Joe-Pye weed, Eupatorium maculatum
-n11968931	boneset, agueweed, thoroughwort, Eupatorium perfoliatum
-n11969166	Joe-Pye weed, purple boneset, trumpet weed, marsh milkweed, Eupatorium purpureum
-n11969607	blue daisy, blue marguerite, Felicia amelloides
-n11969806	kingfisher daisy, Felicia bergeriana
-n11970101	cotton rose, cudweed, filago
-n11970298	herba impia, Filago germanica
-n11970586	gaillardia
-n11971248	gazania
-n11971406	treasure flower, Gazania rigens
-n11971783	African daisy
-n11971927	Barberton daisy, Transvaal daisy, Gerbera jamesonii
-n11972291	desert sunflower, Gerea canescens
-n11972759	cudweed
-n11972959	chafeweed, wood cudweed, Gnaphalium sylvaticum
-n11973341	gumweed, gum plant, tarweed, rosinweed
-n11973634	Grindelia robusta
-n11973749	curlycup gumweed, Grindelia squarrosa
-n11974373	little-head snakeweed, Gutierrezia microcephala
-n11974557	rabbitweed, rabbit-weed, snakeweed, broom snakeweed, broom snakeroot, turpentine weed, Gutierrezia sarothrae
-n11974888	broomweed, broom-weed, Gutierrezia texana
-n11975254	velvet plant, purple velvet plant, royal velvet plant, Gynura aurantiaca
-n11976170	goldenbush
-n11976314	camphor daisy, Haplopappus phyllocephalus
-n11976511	yellow spiny daisy, Haplopappus spinulosus
-n11976933	hoary golden bush, Hazardia cana
-n11977303	sneezeweed
-n11977660	orange sneezeweed, owlclaws, Helenium hoopesii
-n11977887	rosilla, Helenium puberulum
-n11978233	sunflower, helianthus
-n11978551	swamp sunflower, Helianthus angustifolius
-n11978713	common sunflower, mirasol, Helianthus annuus
-n11978961	giant sunflower, tall sunflower, Indian potato, Helianthus giganteus
-n11979187	showy sunflower, Helianthus laetiflorus
-n11979354	Maximilian's sunflower, Helianthus maximilianii
-n11979527	prairie sunflower, Helianthus petiolaris
-n11979715	Jerusalem artichoke, girasol, Jerusalem artichoke sunflower, Helianthus tuberosus
-n11979964	Jerusalem artichoke
-n11980318	strawflower, golden everlasting, yellow paper daisy, Helichrysum bracteatum
-n11980682	heliopsis, oxeye
-n11981192	strawflower
-n11981475	hairy golden aster, prairie golden aster, Heterotheca villosa, Chrysopsis villosa
-n11982115	hawkweed
-n11982545	rattlesnake weed, Hieracium venosum
-n11982939	alpine coltsfoot, Homogyne alpina, Tussilago alpina
-n11983375	alpine gold, alpine hulsea, Hulsea algida
-n11983606	dwarf hulsea, Hulsea nana
-n11984144	cat's-ear, California dandelion, capeweed, gosmore, Hypochaeris radicata
-n11984542	inula
-n11985053	marsh elder, iva
-n11985321	burweed marsh elder, false ragweed, Iva xanthifolia
-n11985739	krigia
-n11985903	dwarf dandelion, Krigia dandelion, Krigia bulbosa
-n11986511	garden lettuce, common lettuce, Lactuca sativa
-n11986729	cos lettuce, romaine lettuce, Lactuca sativa longifolia
-n11987126	leaf lettuce, Lactuca sativa crispa
-n11987349	celtuce, stem lettuce, Lactuca sativa asparagina
-n11987511	prickly lettuce, horse thistle, Lactuca serriola, Lactuca scariola
-n11988132	goldfields, Lasthenia chrysostoma
-n11988596	tidytips, tidy tips, Layia platyglossa
-n11988893	hawkbit
-n11989087	fall dandelion, arnica bud, Leontodon autumnalis
-n11989393	edelweiss, Leontopodium alpinum
-n11989869	oxeye daisy, ox-eyed daisy, marguerite, moon daisy, white daisy, Leucanthemum vulgare, Chrysanthemum leucanthemum
-n11990167	oxeye daisy, Leucanthemum maximum, Chrysanthemum maximum
-n11990313	shasta daisy, Leucanthemum superbum, Chrysanthemum maximum maximum
-n11990627	Pyrenees daisy, Leucanthemum lacustre, Chrysanthemum lacustre
-n11990920	north island edelweiss, Leucogenes leontopodium
-n11991263	blazing star, button snakeroot, gayfeather, gay-feather, snakeroot
-n11991549	dotted gayfeather, Liatris punctata
-n11991777	dense blazing star, Liatris pycnostachya
-n11992479	Texas star, Lindheimera texana
-n11992806	African daisy, yellow ageratum, Lonas inodora, Lonas annua
-n11993203	tahoka daisy, tansy leaf aster, Machaeranthera tanacetifolia
-n11993444	sticky aster, Machaeranthera bigelovii
-n11993675	Mojave aster, Machaeranthera tortifoloia
-n11994150	tarweed
-n11995092	sweet false chamomile, wild chamomile, German chamomile, Matricaria recutita, Matricaria chamomilla
-n11995396	pineapple weed, rayless chamomile, Matricaria matricarioides
-n11996251	climbing hempweed, climbing boneset, wild climbing hempweed, climbing hemp-vine, Mikania scandens
-n11996677	mutisia
-n11997032	rattlesnake root
-n11997160	white lettuce, cankerweed, Nabalus alba, Prenanthes alba
-n11997969	daisybush, daisy-bush, daisy bush
-n11998492	New Zealand daisybush, Olearia haastii
-n11998888	cotton thistle, woolly thistle, Scotch thistle, Onopordum acanthium, Onopordon acanthium
-n11999278	othonna
-n11999656	cascade everlasting, Ozothamnus secundiflorus, Helichrysum secundiflorum
-n12000191	butterweed
-n12001294	American feverfew, wild quinine, prairie dock, Parthenium integrifolium
-n12001707	cineraria, Pericallis cruenta, Senecio cruentus
-n12001924	florest's cineraria, Pericallis hybrida
-n12002428	butterbur, bog rhubarb, Petasites hybridus, Petasites vulgaris
-n12002651	winter heliotrope, sweet coltsfoot, Petasites fragrans
-n12002826	sweet coltsfoot, Petasites sagitattus
-n12003167	oxtongue, bristly oxtongue, bitterweed, bugloss, Picris echioides
-n12003696	hawkweed
-n12004120	mouse-ear hawkweed, Pilosella officinarum, Hieracium pilocella
-n12004547	stevia
-n12004987	rattlesnake root, Prenanthes purpurea
-n12005656	fleabane, feabane mullet, Pulicaria dysenterica
-n12006306	sheep plant, vegetable sheep, Raoulia lutescens, Raoulia australis
-n12006766	coneflower
-n12006930	Mexican hat, Ratibida columnaris
-n12007196	long-head coneflower, prairie coneflower, Ratibida columnifera
-n12007406	prairie coneflower, Ratibida tagetes
-n12007766	Swan River everlasting, rhodanthe, Rhodanthe manglesii, Helipterum manglesii
-n12008252	coneflower
-n12008487	black-eyed Susan, Rudbeckia hirta, Rudbeckia serotina
-n12008749	cutleaved coneflower, Rudbeckia laciniata
-n12009047	golden glow, double gold, hortensia, Rudbeckia laciniata hortensia
-n12009420	lavender cotton, Santolina chamaecyparissus
-n12009792	creeping zinnia, Sanvitalia procumbens
-n12010628	golden thistle
-n12010815	Spanish oyster plant, Scolymus hispanicus
-n12011370	nodding groundsel, Senecio bigelovii
-n12011620	dusty miller, Senecio cineraria, Cineraria maritima
-n12012111	butterweed, ragwort, Senecio glabellus
-n12012253	ragwort, tansy ragwort, ragweed, benweed, Senecio jacobaea
-n12012510	arrowleaf groundsel, Senecio triangularis
-n12013035	black salsify, viper's grass, scorzonera, Scorzonera hispanica
-n12013511	white-topped aster
-n12013701	narrow-leaved white-topped aster
-n12014085	silver sage, silver sagebrush, grey sage, gray sage, Seriphidium canum, Artemisia cana
-n12014355	sea wormwood, Seriphidium maritimum, Artemisia maritima
-n12014923	sawwort, Serratula tinctoria
-n12015221	rosinweed, Silphium laciniatum
-n12015525	milk thistle, lady's thistle, Our Lady's mild thistle, holy thistle, blessed thistle, Silybum marianum
-n12015959	goldenrod
-n12016434	silverrod, Solidago bicolor
-n12016567	meadow goldenrod, Canadian goldenrod, Solidago canadensis
-n12016777	Missouri goldenrod, Solidago missouriensis
-n12016914	alpine goldenrod, Solidago multiradiata
-n12017127	grey goldenrod, gray goldenrod, Solidago nemoralis
-n12017326	Blue Mountain tea, sweet goldenrod, Solidago odora
-n12017511	dyer's weed, Solidago rugosa
-n12017664	seaside goldenrod, beach goldenrod, Solidago sempervirens
-n12017853	narrow goldenrod, Solidago spathulata
-n12018014	Boott's goldenrod
-n12018100	Elliott's goldenrod
-n12018188	Ohio goldenrod
-n12018271	rough-stemmed goldenrod
-n12018363	showy goldenrod
-n12018447	tall goldenrod
-n12018530	zigzag goldenrod, broad leaved goldenrod
-n12018760	sow thistle, milk thistle
-n12019035	milkweed, Sonchus oleraceus
-n12019827	stevia
-n12020184	stokes' aster, cornflower aster, Stokesia laevis
-n12020507	marigold
-n12020736	African marigold, big marigold, Aztec marigold, Tagetes erecta
-n12020941	French marigold, Tagetes patula
-n12022054	painted daisy, pyrethrum, Tanacetum coccineum, Chrysanthemum coccineum
-n12022382	pyrethrum, Dalmatian pyrethrum, Dalmatia pyrethrum, Tanacetum cinerariifolium, Chrysanthemum cinerariifolium
-n12022821	northern dune tansy, Tanacetum douglasii
-n12023108	feverfew, Tanacetum parthenium, Chrysanthemum parthenium
-n12023407	dusty miller, silver-lace, silver lace, Tanacetum ptarmiciflorum, Chrysanthemum ptarmiciflorum
-n12023726	tansy, golden buttons, scented fern, Tanacetum vulgare
-n12024176	dandelion, blowball
-n12024445	common dandelion, Taraxacum ruderalia, Taraxacum officinale
-n12024690	dandelion green
-n12024805	Russian dandelion, kok-saghyz, kok-sagyz, Taraxacum kok-saghyz
-n12025220	stemless hymenoxys, Tetraneuris acaulis, Hymenoxys acaulis
-n12026018	Mexican sunflower, tithonia
-n12026476	Easter daisy, stemless daisy, Townsendia Exscapa
-n12026981	yellow salsify, Tragopogon dubius
-n12027222	salsify, oyster plant, vegetable oyster, Tragopogon porrifolius
-n12027658	meadow salsify, goatsbeard, shepherd's clock, Tragopogon pratensis
-n12028424	scentless camomile, scentless false camomile, scentless mayweed, scentless hayweed, corn mayweed, Tripleurospermum inodorum, Matricaria inodorum
-n12029039	turfing daisy, Tripleurospermum tchihatchewii, Matricaria tchihatchewii
-n12029635	coltsfoot, Tussilago farfara
-n12030092	ursinia
-n12030654	crownbeard, crown-beard, crown beard
-n12030908	wingstem, golden ironweed, yellow ironweed, golden honey plant, Verbesina alternifolia, Actinomeris alternifolia
-n12031139	cowpen daisy, golden crownbeard, golden crown beard, butter daisy, Verbesina encelioides, Ximenesia encelioides
-n12031388	gravelweed, Verbesina helianthoides
-n12031547	Virginia crownbeard, frostweed, frost-weed, Verbesina virginica
-n12031927	ironweed, vernonia
-n12032429	mule's ears, Wyethia amplexicaulis
-n12032686	white-rayed mule's ears, Wyethia helianthoides
-n12033139	cocklebur, cockle-bur, cockleburr, cockle-burr
-n12033504	xeranthemum
-n12033709	immortelle, Xeranthemum annuum
-n12034141	zinnia, old maid, old maid flower
-n12034384	white zinnia, Zinnia acerosa
-n12034594	little golden zinnia, Zinnia grandiflora
-n12035631	blazing star, Mentzelia livicaulis, Mentzelia laevicaulis
-n12035907	bartonia, Mentzelia lindleyi
-n12036067	achene
-n12036226	samara, key fruit, key
-n12036939	campanula, bellflower
-n12037499	creeping bellflower, Campanula rapunculoides
-n12037691	Canterbury bell, cup and saucer, Campanula medium
-n12038038	tall bellflower, Campanula americana
-n12038208	marsh bellflower, Campanula aparinoides
-n12038406	clustered bellflower, Campanula glomerata
-n12038585	peach bells, peach bell, willow bell, Campanula persicifolia
-n12038760	chimney plant, chimney bellflower, Campanula pyramidalis
-n12038898	rampion, rampion bellflower, Campanula rapunculus
-n12039317	tussock bellflower, spreading bellflower, Campanula carpatica
-n12041446	orchid, orchidaceous plant
-n12043444	orchis
-n12043673	male orchis, early purple orchid, Orchis mascula
-n12043836	butterfly orchid, butterfly orchis, Orchis papilionaceae
-n12044041	showy orchis, purple orchis, purple-hooded orchis, Orchis spectabilis
-n12044467	aerides
-n12044784	angrecum
-n12045157	jewel orchid
-n12045514	puttyroot, adam-and-eve, Aplectrum hyemale
-n12045860	arethusa
-n12046028	bog rose, wild pink, dragon's mouth, Arethusa bulbosa
-n12046428	bletia
-n12046815	Bletilla striata, Bletia striata
-n12047345	brassavola
-n12047884	spider orchid, Brassia lawrenceana
-n12048056	spider orchid, Brassia verrucosa
-n12048399	caladenia
-n12048928	calanthe
-n12049282	grass pink, Calopogon pulchellum, Calopogon tuberosum
-n12049562	calypso, fairy-slipper, Calypso bulbosa
-n12050533	cattleya
-n12050959	helleborine
-n12051103	red helleborine, Cephalanthera rubra
-n12051514	spreading pogonia, funnel-crest rosebud orchid, Cleistes divaricata, Pogonia divaricata
-n12051792	rosebud orchid, Cleistes rosea, Pogonia rosea
-n12052267	satyr orchid, Coeloglossum bracteatum
-n12052447	frog orchid, Coeloglossum viride
-n12052787	coelogyne
-n12053405	coral root
-n12053690	spotted coral root, Corallorhiza maculata
-n12053962	striped coral root, Corallorhiza striata
-n12054195	early coral root, pale coral root, Corallorhiza trifida
-n12055073	swan orchid, swanflower, swan-flower, swanneck, swan-neck
-n12055516	cymbid, cymbidium
-n12056099	cypripedia
-n12056217	lady's slipper, lady-slipper, ladies' slipper, slipper orchid
-n12056601	moccasin flower, nerveroot, Cypripedium acaule
-n12056758	common lady's-slipper, showy lady's-slipper, showy lady slipper, Cypripedium reginae, Cypripedium album
-n12056990	ram's-head, ram's-head lady's slipper, Cypripedium arietinum
-n12057211	yellow lady's slipper, yellow lady-slipper, Cypripedium calceolus, Cypripedium parviflorum
-n12057447	large yellow lady's slipper, Cypripedium calceolus pubescens
-n12057660	California lady's slipper, Cypripedium californicum
-n12057895	clustered lady's slipper, Cypripedium fasciculatum
-n12058192	mountain lady's slipper, Cypripedium montanum
-n12058630	marsh orchid
-n12058822	common spotted orchid, Dactylorhiza fuchsii, Dactylorhiza maculata fuchsii
-n12059314	dendrobium
-n12059625	disa
-n12060546	phantom orchid, snow orchid, Eburophyton austinae
-n12061104	tulip orchid, Encyclia citrina, Cattleya citrina
-n12061380	butterfly orchid, Encyclia tampensis, Epidendrum tampense
-n12061614	butterfly orchid, butterfly orchis, Epidendrum venosum, Encyclia venosa
-n12062105	epidendron
-n12062468	helleborine
-n12062626	Epipactis helleborine
-n12062781	stream orchid, chatterbox, giant helleborine, Epipactis gigantea
-n12063211	tongueflower, tongue-flower
-n12063639	rattlesnake plantain, helleborine
-n12064389	fragrant orchid, Gymnadenia conopsea
-n12064591	short-spurred fragrant orchid, Gymnadenia odoratissima
-n12065316	fringed orchis, fringed orchid
-n12065649	frog orchid
-n12065777	rein orchid, rein orchis
-n12066018	bog rein orchid, bog candles, Habenaria dilatata
-n12066261	white fringed orchis, white fringed orchid, Habenaria albiflora
-n12066451	elegant Habenaria, Habenaria elegans
-n12066630	purple-fringed orchid, purple-fringed orchis, Habenaria fimbriata
-n12066821	coastal rein orchid, Habenaria greenei
-n12067029	Hooker's orchid, Habenaria hookeri
-n12067193	ragged orchid, ragged orchis, ragged-fringed orchid, green fringed orchis, Habenaria lacera
-n12067433	prairie orchid, prairie white-fringed orchis, Habenaria leucophaea
-n12067672	snowy orchid, Habenaria nivea
-n12067817	round-leaved rein orchid, Habenaria orbiculata
-n12068138	purple fringeless orchid, purple fringeless orchis, Habenaria peramoena
-n12068432	purple-fringed orchid, purple-fringed orchis, Habenaria psycodes
-n12068615	Alaska rein orchid, Habenaria unalascensis
-n12069009	crested coral root, Hexalectris spicata
-n12069217	Texas purple spike, Hexalectris warnockii
-n12069679	lizard orchid, Himantoglossum hircinum
-n12070016	laelia
-n12070381	liparis
-n12070583	twayblade
-n12070712	fen orchid, fen orchis, Liparis loeselii
-n12071259	broad-leaved twayblade, Listera convallarioides
-n12071477	lesser twayblade, Listera cordata
-n12071744	twayblade, Listera ovata
-n12072210	green adder's mouth, Malaxis-unifolia, Malaxis ophioglossoides
-n12072722	masdevallia
-n12073217	maxillaria
-n12073554	pansy orchid
-n12073991	odontoglossum
-n12074408	oncidium, dancing lady orchid, butterfly plant, butterfly orchid
-n12074867	bee orchid, Ophrys apifera
-n12075010	fly orchid, Ophrys insectifera, Ophrys muscifera
-n12075151	spider orchid
-n12075299	early spider orchid, Ophrys sphegodes
-n12075830	Venus' slipper, Venus's slipper, Venus's shoe
-n12076223	phaius
-n12076577	moth orchid, moth plant
-n12076852	butterfly plant, Phalaenopsis amabilis
-n12077244	rattlesnake orchid
-n12077944	lesser butterfly orchid, Platanthera bifolia, Habenaria bifolia
-n12078172	greater butterfly orchid, Platanthera chlorantha, Habenaria chlorantha
-n12078451	prairie white-fringed orchid, Platanthera leucophea
-n12078747	tangle orchid
-n12079120	Indian crocus
-n12079523	pleurothallis
-n12079963	pogonia
-n12080395	butterfly orchid
-n12080588	Psychopsis krameriana, Oncidium papilio kramerianum
-n12080820	Psychopsis papilio, Oncidium papilio
-n12081215	helmet orchid, greenhood
-n12081649	foxtail orchid
-n12082131	orange-blossom orchid, Sarcochilus falcatus
-n12083113	sobralia
-n12083591	ladies' tresses, lady's tresses
-n12083847	screw augur, Spiranthes cernua
-n12084158	hooded ladies' tresses, Spiranthes romanzoffiana
-n12084400	western ladies' tresses, Spiranthes porrifolia
-n12084555	European ladies' tresses, Spiranthes spiralis
-n12084890	stanhopea
-n12085267	stelis
-n12085664	fly orchid
-n12086012	vanda
-n12086192	blue orchid, Vanda caerulea
-n12086539	vanilla
-n12086778	vanilla orchid, Vanilla planifolia
-n12087961	yam, yam plant
-n12088223	yam
-n12088327	white yam, water yam, Dioscorea alata
-n12088495	cinnamon vine, Chinese yam, Dioscorea batata
-n12088909	elephant's-foot, tortoise plant, Hottentot bread vine, Hottentot's bread vine, Dioscorea elephantipes
-n12089320	wild yam, Dioscorea paniculata
-n12089496	cush-cush, Dioscorea trifida
-n12089846	black bryony, black bindweed, Tamus communis
-n12090890	primrose, primula
-n12091213	English primrose, Primula vulgaris
-n12091377	cowslip, paigle, Primula veris
-n12091550	oxlip, paigle, Primula elatior
-n12091697	Chinese primrose, Primula sinensis
-n12091953	polyanthus, Primula polyantha
-n12092262	pimpernel
-n12092417	scarlet pimpernel, red pimpernel, poor man's weatherglass, Anagallis arvensis
-n12092629	bog pimpernel, Anagallis tenella
-n12092930	chaffweed, bastard pimpernel, false pimpernel
-n12093329	cyclamen, Cyclamen purpurascens
-n12093600	sowbread, Cyclamen hederifolium, Cyclamen neopolitanum
-n12093885	sea milkwort, sea trifoly, black saltwort, Glaux maritima
-n12094244	featherfoil, feather-foil
-n12094401	water gillyflower, American featherfoil, Hottonia inflata
-n12094612	water violet, Hottonia palustris
-n12095020	loosestrife
-n12095281	gooseneck loosestrife, Lysimachia clethroides Duby
-n12095412	yellow pimpernel, Lysimachia nemorum
-n12095543	fringed loosestrife, Lysimachia ciliatum
-n12095647	moneywort, creeping Jenny, creeping Charlie, Lysimachia nummularia
-n12095934	swamp candles, Lysimachia terrestris
-n12096089	whorled loosestrife, Lysimachia quadrifolia
-n12096395	water pimpernel
-n12096563	brookweed, Samolus valerandii
-n12096674	brookweed, Samolus parviflorus, Samolus floribundus
-n12097396	coralberry, spiceberry, Ardisia crenata
-n12097556	marlberry, Ardisia escallonoides, Ardisia paniculata
-n12098403	plumbago
-n12098524	leadwort, Plumbago europaea
-n12098827	thrift
-n12099342	sea lavender, marsh rosemary, statice
-n12100187	barbasco, joewood, Jacquinia keyensis
-n12101870	gramineous plant, graminaceous plant
-n12102133	grass
-n12103680	midgrass
-n12103894	shortgrass, short-grass
-n12104104	sword grass
-n12104238	tallgrass, tall-grass
-n12104501	herbage, pasturage
-n12104734	goat grass, Aegilops triuncalis
-n12105125	wheatgrass, wheat-grass
-n12105353	crested wheatgrass, crested wheat grass, fairway crested wheat grass, Agropyron cristatum
-n12105828	bearded wheatgrass, Agropyron subsecundum
-n12105981	western wheatgrass, bluestem wheatgrass, Agropyron smithii
-n12106134	intermediate wheatgrass, Agropyron intermedium, Elymus hispidus
-n12106323	slender wheatgrass, Agropyron trachycaulum, Agropyron pauciflorum, Elymus trachycaulos
-n12107002	velvet bent, velvet bent grass, brown bent, Rhode Island bent, dog bent, Agrostis canina
-n12107191	cloud grass, Agrostis nebulosa
-n12107710	meadow foxtail, Alopecurus pratensis
-n12107970	foxtail, foxtail grass
-n12108432	broom grass
-n12108613	broom sedge, Andropogon virginicus
-n12108871	tall oat grass, tall meadow grass, evergreen grass, false oat, French rye, Arrhenatherum elatius
-n12109365	toetoe, toitoi, Arundo conspicua, Chionochloa conspicua
-n12109827	oat
-n12110085	cereal oat, Avena sativa
-n12110236	wild oat, wild oat grass, Avena fatua
-n12110352	slender wild oat, Avena barbata
-n12110475	wild red oat, animated oat, Avene sterilis
-n12110778	brome, bromegrass
-n12111238	chess, cheat, Bromus secalinus
-n12111627	field brome, Bromus arvensis
-n12112008	grama, grama grass, gramma, gramma grass
-n12112337	black grama, Bouteloua eriopoda
-n12112609	buffalo grass, Buchloe dactyloides
-n12112918	reed grass
-n12113195	feather reed grass, feathertop, Calamagrostis acutiflora
-n12113323	Australian reed grass, Calamagrostic quadriseta
-n12113657	burgrass, bur grass
-n12114010	buffel grass, Cenchrus ciliaris, Pennisetum cenchroides
-n12114590	Rhodes grass, Chloris gayana
-n12115180	pampas grass, Cortaderia selloana
-n12116058	giant star grass, Cynodon plectostachyum
-n12116429	orchard grass, cocksfoot, cockspur, Dactylis glomerata
-n12116734	Egyptian grass, crowfoot grass, Dactyloctenium aegypticum
-n12117017	crabgrass, crab grass, finger grass
-n12117235	smooth crabgrass, Digitaria ischaemum
-n12117326	large crabgrass, hairy finger grass, Digitaria sanguinalis
-n12117695	barnyard grass, barn grass, barn millet, Echinochloa crusgalli
-n12117912	Japanese millet, billion-dollar grass, Japanese barnyard millet, sanwa millet, Echinochloa frumentacea
-n12118414	yardgrass, yard grass, wire grass, goose grass, Eleusine indica
-n12118661	finger millet, ragi, ragee, African millet, coracan, corakan, kurakkan, Eleusine coracana
-n12119099	lyme grass
-n12119238	wild rye
-n12119390	giant ryegrass, Elymus condensatus, Leymus condensatus
-n12119539	sea lyme grass, European dune grass, Elymus arenarius, Leymus arenaria
-n12119717	Canada wild rye, Elymus canadensis
-n12120347	teff, teff grass, Eragrostis tef, Eragrostic abyssinica
-n12120578	weeping love grass, African love grass, Eragrostis curvula
-n12121033	plume grass
-n12121187	Ravenna grass, wool grass, Erianthus ravennae
-n12121610	fescue, fescue grass, meadow fescue, Festuca elatior
-n12122442	reed meadow grass, Glyceria grandis
-n12122725	velvet grass, Yorkshire fog, Holcus lanatus
-n12122918	creeping soft grass, Holcus mollis
-n12123648	barleycorn
-n12123741	barley grass, wall barley, Hordeum murinum
-n12124172	little barley, Hordeum pusillum
-n12124627	rye grass, ryegrass
-n12124818	perennial ryegrass, English ryegrass, Lolium perenne
-n12125001	Italian ryegrass, Italian rye, Lolium multiflorum
-n12125183	darnel, tare, bearded darnel, cheat, Lolium temulentum
-n12125584	nimblewill, nimble Will, Muhlenbergia schreberi
-n12126084	cultivated rice, Oryza sativa
-n12126360	ricegrass, rice grass
-n12126736	smilo, smilo grass, Oryzopsis miliacea
-n12127460	switch grass, Panicum virgatum
-n12127575	broomcorn millet, hog millet, Panicum miliaceum
-n12127768	goose grass, Texas millet, Panicum Texanum
-n12128071	dallisgrass, dallis grass, paspalum, Paspalum dilatatum
-n12128306	Bahia grass, Paspalum notatum
-n12128490	knotgrass, Paspalum distichum
-n12129134	fountain grass, Pennisetum ruppelii, Pennisetum setaceum
-n12129738	reed canary grass, gardener's garters, lady's laces, ribbon grass, Phalaris arundinacea
-n12129986	canary grass, birdseed grass, Phalaris canariensis
-n12130549	timothy, herd's grass, Phleum pratense
-n12131405	bluegrass, blue grass
-n12131550	meadowgrass, meadow grass
-n12132092	wood meadowgrass, Poa nemoralis, Agrostis alba
-n12132956	noble cane
-n12133151	munj, munja, Saccharum bengalense, Saccharum munja
-n12133462	broom beard grass, prairie grass, wire grass, Andropogon scoparius, Schizachyrium scoparium
-n12133682	bluestem, blue stem, Andropogon furcatus, Andropogon gerardii
-n12134025	rye, Secale cereale
-n12134486	bristlegrass, bristle grass
-n12134695	giant foxtail
-n12134836	yellow bristlegrass, yellow bristle grass, yellow foxtail, glaucous bristlegrass, Setaria glauca
-n12135049	green bristlegrass, green foxtail, rough bristlegrass, bottle-grass, bottle grass, Setaria viridis
-n12135576	Siberian millet, Setaria italica rubrofructa
-n12135729	German millet, golden wonder millet, Setaria italica stramineofructa
-n12135898	millet
-n12136392	rattan, rattan cane
-n12136581	malacca
-n12136720	reed
-n12137120	sorghum
-n12137569	grain sorghum
-n12137791	durra, doura, dourah, Egyptian corn, Indian millet, Guinea corn
-n12137954	feterita, federita, Sorghum vulgare caudatum
-n12138110	hegari
-n12138248	kaoliang
-n12138444	milo, milo maize
-n12138578	shallu, Sorghum vulgare rosburghii
-n12139196	broomcorn, Sorghum vulgare technicum
-n12139575	cordgrass, cord grass
-n12139793	salt reed grass, Spartina cynosuroides
-n12139921	prairie cordgrass, freshwater cordgrass, slough grass, Spartina pectinmata
-n12140511	smut grass, blackseed, carpet grass, Sporobolus poiretii
-n12140759	sand dropseed, Sporobolus cryptandrus
-n12140903	rush grass, rush-grass
-n12141167	St. Augustine grass, Stenotaphrum secundatum, buffalo grass
-n12141385	grain
-n12141495	cereal, cereal grass
-n12142085	wheat
-n12142357	wheat berry
-n12142450	durum, durum wheat, hard wheat, Triticum durum, Triticum turgidum, macaroni wheat
-n12143065	spelt, Triticum spelta, Triticum aestivum spelta
-n12143215	emmer, starch wheat, two-grain spelt, Triticum dicoccum
-n12143405	wild wheat, wild emmer, Triticum dicoccum dicoccoides
-n12143676	corn, maize, Indian corn, Zea mays
-n12144313	mealie
-n12144580	corn
-n12144987	dent corn, Zea mays indentata
-n12145148	flint corn, flint maize, Yankee corn, Zea mays indurata
-n12145477	popcorn, Zea mays everta
-n12146311	zoysia
-n12146488	Manila grass, Japanese carpet grass, Zoysia matrella
-n12146654	Korean lawn grass, Japanese lawn grass, Zoysia japonica
-n12147226	bamboo
-n12147835	common bamboo, Bambusa vulgaris
-n12148757	giant bamboo, kyo-chiku, Dendrocalamus giganteus
-n12150722	umbrella plant, umbrella sedge, Cyperus alternifolius
-n12150969	chufa, yellow nutgrass, earth almond, ground almond, rush nut, Cyperus esculentus
-n12151170	galingale, galangal, Cyperus longus
-n12151615	nutgrass, nut grass, nutsedge, nut sedge, Cyperus rotundus
-n12152031	sand sedge, sand reed, Carex arenaria
-n12152251	cypress sedge, Carex pseudocyperus
-n12152532	cotton grass, cotton rush
-n12152722	common cotton grass, Eriophorum angustifolium
-n12153033	hardstem bulrush, hardstemmed bulrush, Scirpus acutus
-n12153224	wool grass, Scirpus cyperinus
-n12153580	spike rush
-n12153741	water chestnut, Chinese water chestnut, Eleocharis dulcis
-n12153914	needle spike rush, needle rush, slender spike rush, hair grass, Eleocharis acicularis
-n12154114	creeping spike rush, Eleocharis palustris
-n12154773	pandanus, screw pine
-n12155009	textile screw pine, lauhala, Pandanus tectorius
-n12155583	cattail
-n12155773	cat's-tail, bullrush, bulrush, nailrod, reed mace, reedmace, Typha latifolia
-n12156679	bur reed
-n12156819	grain, caryopsis
-n12157056	kernel
-n12157179	rye
-n12157769	gourd, gourd vine
-n12158031	gourd
-n12158443	pumpkin, pumpkin vine, autumn pumpkin, Cucurbita pepo
-n12158798	squash, squash vine
-n12159055	summer squash, summer squash vine, Cucurbita pepo melopepo
-n12159388	yellow squash
-n12159555	marrow, marrow squash, vegetable marrow
-n12159804	zucchini, courgette
-n12159942	cocozelle, Italian vegetable marrow
-n12160125	cymling, pattypan squash
-n12160303	spaghetti squash
-n12160490	winter squash, winter squash plant
-n12160857	acorn squash
-n12161056	hubbard squash, Cucurbita maxima
-n12161285	turban squash, Cucurbita maxima turbaniformis
-n12161577	buttercup squash
-n12161744	butternut squash, Cucurbita maxima
-n12161969	winter crookneck, winter crookneck squash, Cucurbita moschata
-n12162181	cushaw, Cucurbita mixta, Cucurbita argyrosperma
-n12162425	prairie gourd, prairie gourd vine, Missouri gourd, wild pumpkin, buffalo gourd, calabazilla, Cucurbita foetidissima
-n12162758	prairie gourd
-n12163035	bryony, briony
-n12163279	white bryony, devil's turnip, Bryonia alba
-n12164363	sweet melon, muskmelon, sweet melon vine, Cucumis melo
-n12164656	cantaloupe, cantaloup, cantaloupe vine, cantaloup vine, Cucumis melo cantalupensis
-n12164881	winter melon, Persian melon, honeydew melon, winter melon vine, Cucumis melo inodorus
-n12165170	net melon, netted melon, nutmeg melon, Cucumis melo reticulatus
-n12165384	cucumber, cucumber vine, Cucumis sativus
-n12165758	squirting cucumber, exploding cucumber, touch-me-not, Ecballium elaterium
-n12166128	bottle gourd, calabash, Lagenaria siceraria
-n12166424	luffa, dishcloth gourd, sponge gourd, rag gourd, strainer vine
-n12166793	loofah, vegetable sponge, Luffa cylindrica
-n12166929	angled loofah, sing-kwa, Luffa acutangula
-n12167075	loofa, loofah, luffa, loufah sponge
-n12167436	balsam apple, Momordica balsamina
-n12167602	balsam pear, Momordica charantia
-n12168565	lobelia
-n12169099	water lobelia, Lobelia dortmanna
-n12170585	mallow
-n12171098	musk mallow, mus rose, Malva moschata
-n12171316	common mallow, Malva neglecta
-n12171966	okra, gumbo, okra plant, lady's-finger, Abelmoschus esculentus, Hibiscus esculentus
-n12172364	okra
-n12172481	abelmosk, musk mallow, Abelmoschus moschatus, Hibiscus moschatus
-n12172906	flowering maple
-n12173069	velvetleaf, velvet-leaf, velvetweed, Indian mallow, butter-print, China jute, Abutilon theophrasti
-n12173664	hollyhock
-n12173912	rose mallow, Alcea rosea, Althea rosea
-n12174311	althea, althaea, hollyhock
-n12174521	marsh mallow, white mallow, Althea officinalis
-n12174926	poppy mallow
-n12175181	fringed poppy mallow, Callirhoe digitata
-n12175370	purple poppy mallow, Callirhoe involucrata
-n12175598	clustered poppy mallow, Callirhoe triangulata
-n12176453	sea island cotton, tree cotton, Gossypium barbadense
-n12176709	Levant cotton, Gossypium herbaceum
-n12176953	upland cotton, Gossypium hirsutum
-n12177129	Peruvian cotton, Gossypium peruvianum
-n12177455	wild cotton, Arizona wild cotton, Gossypium thurberi
-n12178129	kenaf, kanaf, deccan hemp, bimli, bimli hemp, Indian hemp, Bombay hemp, Hibiscus cannabinus
-n12178780	sorrel tree, Hibiscus heterophyllus
-n12178896	rose mallow, swamp mallow, common rose mallow, swamp rose mallow, Hibiscus moscheutos
-n12179122	cotton rose, Confederate rose, Confederate rose mallow, Hibiscus mutabilis
-n12179632	roselle, rozelle, sorrel, red sorrel, Jamaica sorrel, Hibiscus sabdariffa
-n12180168	mahoe, majagua, mahagua, balibago, purau, Hibiscus tiliaceus
-n12180456	flower-of-an-hour, flowers-of-an-hour, bladder ketmia, black-eyed Susan, Hibiscus trionum
-n12180885	lacebark, ribbonwood, houhere, Hoheria populnea
-n12181352	wild hollyhock, Iliamna remota, Sphaeralcea remota
-n12181612	mountain hollyhock, Iliamna ruvularis, Iliamna acerifolia
-n12182049	seashore mallow
-n12182276	salt marsh mallow, Kosteletzya virginica
-n12183026	chaparral mallow, Malacothamnus fasciculatus, Sphaeralcea fasciculata
-n12183452	malope, Malope trifida
-n12183816	false mallow
-n12184095	waxmallow, wax mallow, sleeping hibiscus
-n12184468	glade mallow, Napaea dioica
-n12184912	pavonia
-n12185254	ribbon tree, ribbonwood, Plagianthus regius, Plagianthus betulinus
-n12185859	bush hibiscus, Radyera farragei, Hibiscus farragei
-n12186352	Virginia mallow, Sida hermaphrodita
-n12186554	Queensland hemp, jellyleaf, Sida rhombifolia
-n12186839	Indian mallow, Sida spinosa
-n12187247	checkerbloom, wild hollyhock, Sidalcea malviflora
-n12187663	globe mallow, false mallow
-n12187891	prairie mallow, red false mallow, Sphaeralcea coccinea, Malvastrum coccineum
-n12188289	tulipwood tree
-n12188635	portia tree, bendy tree, seaside mahoe, Thespesia populnea
-n12189429	red silk-cotton tree, simal, Bombax ceiba, Bombax malabarica
-n12189779	cream-of-tartar tree, sour gourd, Adansonia gregorii
-n12189987	baobab, monkey-bread tree, Adansonia digitata
-n12190410	kapok, ceiba tree, silk-cotton tree, white silk-cotton tree, Bombay ceiba, God tree, Ceiba pentandra
-n12190869	durian, durion, durian tree, Durio zibethinus
-n12191240	Montezuma
-n12192132	shaving-brush tree, Pseudobombax ellipticum
-n12192877	quandong, quandong tree, Brisbane quandong, silver quandong tree, blue fig, Elaeocarpus grandis
-n12193334	quandong, blue fig
-n12193665	makomako, New Zealand wine berry, wineberry, Aristotelia serrata, Aristotelia racemosa
-n12194147	Jamaican cherry, calabur tree, calabura, silk wood, silkwood, Muntingia calabura
-n12194613	breakax, breakaxe, break-axe, Sloanea jamaicensis
-n12195391	sterculia
-n12195533	Panama tree, Sterculia apetala
-n12195734	kalumpang, Java olives, Sterculia foetida
-n12196129	bottle-tree, bottle tree
-n12196336	flame tree, flame durrajong, Brachychiton acerifolius, Sterculia acerifolia
-n12196527	flame tree, broad-leaved bottletree, Brachychiton australis
-n12196694	kurrajong, currajong, Brachychiton populneus
-n12196954	Queensland bottletree, narrow-leaved bottletree, Brachychiton rupestris, Sterculia rupestris
-n12197359	kola, kola nut, kola nut tree, goora nut, Cola acuminata
-n12197601	kola nut, cola nut
-n12198286	Chinese parasol tree, Chinese parasol, Japanese varnish tree, phoenix tree, Firmiana simplex
-n12198793	flannelbush, flannel bush, California beauty
-n12199266	screw tree
-n12199399	nut-leaved screw tree, Helicteres isora
-n12199790	red beech, brown oak, booyong, crow's foot, stave wood, silky elm, Heritiera trifoliolata, Terrietia trifoliolata
-n12199982	looking glass tree, Heritiera macrophylla
-n12200143	looking-glass plant, Heritiera littoralis
-n12200504	honey bell, honeybells, Hermannia verticillata, Mahernia verticillata
-n12200905	mayeng, maple-leaved bayur, Pterospermum acerifolium
-n12201331	silver tree, Tarrietia argyrodendron
-n12201580	cacao, cacao tree, chocolate tree, Theobroma cacao
-n12201938	obeche, obechi, arere, samba, Triplochiton scleroxcylon
-n12202936	linden, linden tree, basswood, lime, lime tree
-n12203529	American basswood, American lime, Tilia americana
-n12203699	small-leaved linden, small-leaved lime, Tilia cordata
-n12203896	white basswood, cottonwood, Tilia heterophylla
-n12204032	Japanese linden, Japanese lime, Tilia japonica
-n12204175	silver lime, silver linden, Tilia tomentosa
-n12204730	corchorus
-n12205460	African hemp, Sparmannia africana
-n12205694	herb, herbaceous plant
-n12214789	protea
-n12215022	honeypot, king protea, Protea cynaroides
-n12215210	honeyflower, honey-flower, Protea mellifera
-n12215579	banksia
-n12215824	honeysuckle, Australian honeysuckle, coast banksia, Banksia integrifolia
-n12216215	smoke bush
-n12216628	Chilean firebush, Chilean flameflower, Embothrium coccineum
-n12216968	Chilean nut, Chile nut, Chile hazel, Chilean hazelnut, Guevina heterophylla, Guevina avellana
-n12217453	grevillea
-n12217851	red-flowered silky oak, Grevillea banksii
-n12218274	silky oak, Grevillea robusta
-n12218490	beefwood, Grevillea striata
-n12218868	cushion flower, pincushion hakea, Hakea laurina
-n12219668	rewa-rewa, New Zealand honeysuckle
-n12220019	honeyflower, honey-flower, mountain devil, Lambertia formosa
-n12220496	silver tree, Leucadendron argenteum
-n12220829	lomatia
-n12221191	macadamia, macadamia tree
-n12221368	Macadamia integrifolia
-n12221522	macadamia nut, macadamia nut tree, Macadamia ternifolia
-n12221801	Queensland nut, Macadamia tetraphylla
-n12222090	prickly ash, Orites excelsa
-n12222493	geebung
-n12222900	wheel tree, firewheel tree, Stenocarpus sinuatus
-n12223160	scrub beefwood, beefwood, Stenocarpus salignus
-n12223569	waratah, Telopea Oreades
-n12223764	waratah, Telopea speciosissima
-n12224978	casuarina
-n12225222	she-oak
-n12225349	beefwood
-n12225563	Australian pine, Casuarina equisetfolia
-n12226932	heath
-n12227658	tree heath, briar, brier, Erica arborea
-n12227909	briarroot
-n12228229	winter heath, spring heath, Erica carnea
-n12228387	bell heather, heather bell, fine-leaved heath, Erica cinerea
-n12228689	Cornish heath, Erica vagans
-n12228886	Spanish heath, Portuguese heath, Erica lusitanica
-n12229111	Prince-of-Wales'-heath, Prince of Wales heath, Erica perspicua
-n12229651	bog rosemary, moorwort, Andromeda glaucophylla
-n12229887	marsh andromeda, common bog rosemary, Andromeda polifolia
-n12230540	madrona, madrono, manzanita, Arbutus menziesii
-n12230794	strawberry tree, Irish strawberry, Arbutus unedo
-n12231192	bearberry
-n12231709	alpine bearberry, black bearberry, Arctostaphylos alpina
-n12232114	heartleaf manzanita, Arctostaphylos andersonii
-n12232280	Parry manzanita, Arctostaphylos manzanita
-n12232851	spike heath, Bruckenthalia spiculifolia
-n12233249	bryanthus
-n12234318	leatherleaf, Chamaedaphne calyculata
-n12234669	Connemara heath, St. Dabeoc's heath, Daboecia cantabrica
-n12235051	trailing arbutus, mayflower, Epigaea repens
-n12235479	creeping snowberry, moxie plum, maidenhair berry, Gaultheria hispidula
-n12236160	salal, shallon, Gaultheria shallon
-n12236546	huckleberry
-n12236768	black huckleberry, Gaylussacia baccata
-n12236977	dangleberry, dangle-berry, Gaylussacia frondosa
-n12237152	box huckleberry, Gaylussacia brachycera
-n12237486	kalmia
-n12237641	mountain laurel, wood laurel, American laurel, calico bush, Kalmia latifolia
-n12237855	swamp laurel, bog laurel, bog kalmia, Kalmia polifolia
-n12238756	trapper's tea, glandular Labrador tea
-n12238913	wild rosemary, marsh tea, Ledum palustre
-n12239240	sand myrtle, Leiophyllum buxifolium
-n12239647	leucothoe
-n12239880	dog laurel, dog hobble, switch-ivy, Leucothoe fontanesiana, Leucothoe editorum
-n12240150	sweet bells, Leucothoe racemosa
-n12240477	alpine azalea, mountain azalea, Loiseleuria procumbens
-n12240965	staggerbush, stagger bush, Lyonia mariana
-n12241192	maleberry, male berry, privet andromeda, he-huckleberry, Lyonia ligustrina
-n12241426	fetterbush, fetter bush, shiny lyonia, Lyonia lucida
-n12241880	false azalea, fool's huckleberry, Menziesia ferruginea
-n12242123	minniebush, minnie bush, Menziesia pilosa
-n12242409	sorrel tree, sourwood, titi, Oxydendrum arboreum
-n12242850	mountain heath, Phyllodoce caerulea, Bryanthus taxifolius
-n12243109	purple heather, Brewer's mountain heather, Phyllodoce breweri
-n12243693	fetterbush, mountain fetterbush, mountain andromeda, Pieris floribunda
-n12244153	rhododendron
-n12244458	coast rhododendron, Rhododendron californicum
-n12244650	rosebay, Rhododendron maxima
-n12244819	swamp azalea, swamp honeysuckle, white honeysuckle, Rhododendron viscosum
-n12245319	azalea
-n12245695	cranberry
-n12245885	American cranberry, large cranberry, Vaccinium macrocarpon
-n12246037	European cranberry, small cranberry, Vaccinium oxycoccus
-n12246232	blueberry, blueberry bush
-n12246773	farkleberry, sparkleberry, Vaccinium arboreum
-n12246941	low-bush blueberry, low blueberry, Vaccinium angustifolium, Vaccinium pennsylvanicum
-n12247202	rabbiteye blueberry, rabbit-eye blueberry, rabbiteye, Vaccinium ashei
-n12247407	dwarf bilberry, dwarf blueberry, Vaccinium caespitosum
-n12247963	evergreen blueberry, Vaccinium myrsinites
-n12248141	evergreen huckleberry, Vaccinium ovatum
-n12248359	bilberry, thin-leaved bilberry, mountain blue berry, Viccinium membranaceum
-n12248574	bilberry, whortleberry, whinberry, blaeberry, Viccinium myrtillus
-n12248780	bog bilberry, bog whortleberry, moor berry, Vaccinium uliginosum alpinum
-n12248941	dryland blueberry, dryland berry, Vaccinium pallidum
-n12249122	grouseberry, grouse-berry, grouse whortleberry, Vaccinium scoparium
-n12249294	deerberry, squaw huckleberry, Vaccinium stamineum
-n12249542	cowberry, mountain cranberry, lingonberry, lingenberry, lingberry, foxberry, Vaccinium vitis-idaea
-n12251001	diapensia
-n12251278	galax, galaxy, wandflower, beetleweed, coltsfoot, Galax urceolata
-n12251740	pyxie, pixie, pixy, Pyxidanthera barbulata
-n12252168	shortia
-n12252383	oconee bells, Shortia galacifolia
-n12252866	Australian heath
-n12253229	epacris
-n12253487	common heath, Epacris impressa
-n12253664	common heath, blunt-leaf heath, Epacris obtusifolia
-n12253835	Port Jackson heath, Epacris purpurascens
-n12254168	native cranberry, groundberry, ground-berry, cranberry heath, Astroloma humifusum, Styphelia humifusum
-n12255225	pink fivecorner, Styphelia triflora
-n12256112	wintergreen, pyrola
-n12256325	false wintergreen, Pyrola americana, Pyrola rotundifolia americana
-n12256522	lesser wintergreen, Pyrola minor
-n12256708	wild lily of the valley, shinleaf, Pyrola elliptica
-n12256920	wild lily of the valley, Pyrola rotundifolia
-n12257570	pipsissewa, prince's pine
-n12257725	love-in-winter, western prince's pine, Chimaphila umbellata, Chimaphila corymbosa
-n12258101	one-flowered wintergreen, one-flowered pyrola, Moneses uniflora, Pyrola uniflora
-n12258885	Indian pipe, waxflower, Monotropa uniflora
-n12259316	pinesap, false beachdrops, Monotropa hypopithys
-n12260799	beech, beech tree
-n12261359	common beech, European beech, Fagus sylvatica
-n12261571	copper beech, purple beech, Fagus sylvatica atropunicea, Fagus purpurea, Fagus sylvatica purpurea
-n12261808	American beech, white beech, red beech, Fagus grandifolia, Fagus americana
-n12262018	weeping beech, Fagus pendula, Fagus sylvatica pendula
-n12262185	Japanese beech
-n12262553	chestnut, chestnut tree
-n12263038	American chestnut, American sweet chestnut, Castanea dentata
-n12263204	European chestnut, sweet chestnut, Spanish chestnut, Castanea sativa
-n12263410	Chinese chestnut, Castanea mollissima
-n12263588	Japanese chestnut, Castanea crenata
-n12263738	Allegheny chinkapin, eastern chinquapin, chinquapin, dwarf chestnut, Castanea pumila
-n12263987	Ozark chinkapin, Ozark chinquapin, chinquapin, Castanea ozarkensis
-n12264512	oak chestnut
-n12264786	giant chinkapin, golden chinkapin, Chrysolepis chrysophylla, Castanea chrysophylla, Castanopsis chrysophylla
-n12265083	dwarf golden chinkapin, Chrysolepis sempervirens
-n12265394	tanbark oak, Lithocarpus densiflorus
-n12265600	Japanese oak, Lithocarpus glabra, Lithocarpus glaber
-n12266217	southern beech, evergreen beech
-n12266528	myrtle beech, Nothofagus cuninghamii
-n12266644	Coigue, Nothofagus dombeyi
-n12266796	New Zealand beech
-n12266984	silver beech, Nothofagus menziesii
-n12267133	roble beech, Nothofagus obliqua
-n12267265	rauli beech, Nothofagus procera
-n12267411	black beech, Nothofagus solanderi
-n12267534	hard beech, Nothofagus truncata
-n12267677	acorn
-n12267931	cupule, acorn cup
-n12268246	oak, oak tree
-n12269241	live oak
-n12269406	coast live oak, California live oak, Quercus agrifolia
-n12269652	white oak
-n12270027	American white oak, Quercus alba
-n12270278	Arizona white oak, Quercus arizonica
-n12270460	swamp white oak, swamp oak, Quercus bicolor
-n12270741	European turkey oak, turkey oak, Quercus cerris
-n12270946	canyon oak, canyon live oak, maul oak, iron oak, Quercus chrysolepis
-n12271187	scarlet oak, Quercus coccinea
-n12271451	jack oak, northern pin oak, Quercus ellipsoidalis
-n12271643	red oak
-n12271933	southern red oak, swamp red oak, turkey oak, Quercus falcata
-n12272239	Oregon white oak, Oregon oak, Garry oak, Quercus garryana
-n12272432	holm oak, holm tree, holly-leaved oak, evergreen oak, Quercus ilex
-n12272735	bear oak, Quercus ilicifolia
-n12272883	shingle oak, laurel oak, Quercus imbricaria
-n12273114	bluejack oak, turkey oak, Quercus incana
-n12273344	California black oak, Quercus kelloggii
-n12273515	American turkey oak, turkey oak, Quercus laevis
-n12273768	laurel oak, pin oak, Quercus laurifolia
-n12273939	California white oak, valley oak, valley white oak, roble, Quercus lobata
-n12274151	overcup oak, Quercus lyrata
-n12274358	bur oak, burr oak, mossy-cup oak, mossycup oak, Quercus macrocarpa
-n12274630	scrub oak
-n12274863	blackjack oak, blackjack, jack oak, Quercus marilandica
-n12275131	swamp chestnut oak, Quercus michauxii
-n12275317	Japanese oak, Quercus mongolica, Quercus grosseserrata
-n12275489	chestnut oak
-n12275675	chinquapin oak, chinkapin oak, yellow chestnut oak, Quercus muehlenbergii
-n12275888	myrtle oak, seaside scrub oak, Quercus myrtifolia
-n12276110	water oak, possum oak, Quercus nigra
-n12276314	Nuttall oak, Nuttall's oak, Quercus nuttalli
-n12276477	durmast, Quercus petraea, Quercus sessiliflora
-n12276628	basket oak, cow oak, Quercus prinus, Quercus montana
-n12276872	pin oak, swamp oak, Quercus palustris
-n12277150	willow oak, Quercus phellos
-n12277334	dwarf chinkapin oak, dwarf chinquapin oak, dwarf oak, Quercus prinoides
-n12277578	common oak, English oak, pedunculate oak, Quercus robur
-n12277800	northern red oak, Quercus rubra, Quercus borealis
-n12278107	Shumard oak, Shumard red oak, Quercus shumardii
-n12278371	post oak, box white oak, brash oak, iron oak, Quercus stellata
-n12278650	cork oak, Quercus suber
-n12278865	Spanish oak, Quercus texana
-n12279060	huckleberry oak, Quercus vaccinifolia
-n12279293	Chinese cork oak, Quercus variabilis
-n12279458	black oak, yellow oak, quercitron, quercitron oak, Quercus velutina
-n12279772	southern live oak, Quercus virginiana
-n12280060	interior live oak, Quercus wislizenii, Quercus wizlizenii
-n12280364	mast
-n12281241	birch, birch tree
-n12281788	yellow birch, Betula alleghaniensis, Betula leutea
-n12281974	American white birch, paper birch, paperbark birch, canoe birch, Betula cordifolia, Betula papyrifera
-n12282235	grey birch, gray birch, American grey birch, American gray birch, Betula populifolia
-n12282527	silver birch, common birch, European white birch, Betula pendula
-n12282737	downy birch, white birch, Betula pubescens
-n12282933	black birch, river birch, red birch, Betula nigra
-n12283147	sweet birch, cherry birch, black birch, Betula lenta
-n12283395	Yukon white birch, Betula neoalaskana
-n12283542	swamp birch, water birch, mountain birch, Western paper birch, Western birch, Betula fontinalis
-n12283790	Newfoundland dwarf birch, American dwarf birch, Betula glandulosa
-n12284262	alder, alder tree
-n12284821	common alder, European black alder, Alnus glutinosa, Alnus vulgaris
-n12285049	grey alder, gray alder, Alnus incana
-n12285195	seaside alder, Alnus maritima
-n12285369	white alder, mountain alder, Alnus rhombifolia
-n12285512	red alder, Oregon alder, Alnus rubra
-n12285705	speckled alder, Alnus rugosa
-n12285900	smooth alder, hazel alder, Alnus serrulata
-n12286068	green alder, Alnus veridis
-n12286197	green alder, Alnus veridis crispa, Alnus crispa
-n12286826	hornbeam
-n12286988	European hornbeam, Carpinus betulus
-n12287195	American hornbeam, Carpinus caroliniana
-n12287642	hop hornbeam
-n12287836	Old World hop hornbeam, Ostrya carpinifolia
-n12288005	Eastern hop hornbeam, ironwood, ironwood tree, Ostrya virginiana
-n12288823	hazelnut, hazel, hazelnut tree
-n12289310	American hazel, Corylus americana
-n12289433	cobnut, filbert, Corylus avellana, Corylus avellana grandis
-n12289585	beaked hazelnut, Corylus cornuta
-n12290748	centaury
-n12290975	rosita, Centaurium calycosum
-n12291143	lesser centaury, Centaurium minus
-n12291459	seaside centaury
-n12291671	slender centaury
-n12291959	prairie gentian, tulip gentian, bluebell, Eustoma grandiflorum
-n12292463	Persian violet, Exacum affine
-n12292877	columbo, American columbo, deer's-ear, deer's-ears, pyramid plant, American gentian
-n12293723	gentian
-n12294124	gentianella, Gentiana acaulis
-n12294331	closed gentian, blind gentian, bottle gentian, Gentiana andrewsii
-n12294542	explorer's gentian, Gentiana calycosa
-n12294723	closed gentian, blind gentian, Gentiana clausa
-n12294871	great yellow gentian, Gentiana lutea
-n12295033	marsh gentian, calathian violet, Gentiana pneumonanthe
-n12295237	soapwort gentian, Gentiana saponaria
-n12295429	striped gentian, Gentiana villosa
-n12295796	agueweed, ague weed, five-flowered gentian, stiff gentian, Gentianella quinquefolia, Gentiana quinquefolia
-n12296045	felwort, gentianella amarella
-n12296432	fringed gentian
-n12296735	Gentianopsis crinita, Gentiana crinita
-n12296929	Gentianopsis detonsa, Gentiana detonsa
-n12297110	Gentianopsid procera, Gentiana procera
-n12297280	Gentianopsis thermalis, Gentiana thermalis
-n12297507	tufted gentian, Gentianopsis holopetala, Gentiana holopetala
-n12297846	spurred gentian
-n12298165	sabbatia
-n12299640	toothbrush tree, mustard tree, Salvadora persica
-n12300840	olive tree
-n12301180	olive, European olive tree, Olea europaea
-n12301445	olive
-n12301613	black maire, Olea cunninghamii
-n12301766	white maire, Olea lanceolata
-n12302071	fringe tree
-n12302248	fringe bush, Chionanthus virginicus
-n12302565	forestiera
-n12303083	forsythia
-n12303462	ash, ash tree
-n12304115	white ash, Fraxinus Americana
-n12304286	swamp ash, Fraxinus caroliniana
-n12304420	flowering ash, Fraxinus cuspidata
-n12304703	European ash, common European ash, Fraxinus excelsior
-n12304899	Oregon ash, Fraxinus latifolia, Fraxinus oregona
-n12305089	black ash, basket ash, brown ash, hoop ash, Fraxinus nigra
-n12305293	manna ash, flowering ash, Fraxinus ornus
-n12305475	red ash, downy ash, Fraxinus pennsylvanica
-n12305654	green ash, Fraxinus pennsylvanica subintegerrima
-n12305819	blue ash, Fraxinus quadrangulata
-n12305986	mountain ash, Fraxinus texensis
-n12306089	pumpkin ash, Fraxinus tomentosa
-n12306270	Arizona ash, Fraxinus velutina
-n12306717	jasmine
-n12306938	primrose jasmine, Jasminum mesnyi
-n12307076	winter jasmine, Jasminum nudiflorum
-n12307240	common jasmine, true jasmine, jessamine, Jasminum officinale
-n12307756	privet
-n12308112	Amur privet, Ligustrum amurense
-n12308447	Japanese privet, Ligustrum japonicum
-n12308907	Ligustrum obtusifolium
-n12309277	common privet, Ligustrum vulgare
-n12309630	devilwood, American olive, Osmanthus americanus
-n12310021	mock privet
-n12310349	lilac
-n12310638	Himalayan lilac, Syringa emodi
-n12311045	Persian lilac, Syringa persica
-n12311224	Japanese tree lilac, Syringa reticulata, Syringa amurensis japonica
-n12311413	Japanese lilac, Syringa villosa
-n12311579	common lilac, Syringa vulgaris
-n12312110	bloodwort
-n12312728	kangaroo paw, kangaroo's paw, kangaroo's-foot, kangaroo-foot plant, Australian sword lily, Anigozanthus manglesii
-n12315060	Virginian witch hazel, Hamamelis virginiana
-n12315245	vernal witch hazel, Hamamelis vernalis
-n12315598	winter hazel, flowering hazel
-n12315999	fothergilla, witch alder
-n12316444	liquidambar
-n12316572	sweet gum, sweet gum tree, bilsted, red gum, American sweet gum, Liquidambar styraciflua
-n12317296	iron tree, iron-tree, ironwood, ironwood tree
-n12318378	walnut, walnut tree
-n12318782	California black walnut, Juglans californica
-n12318965	butternut, butternut tree, white walnut, Juglans cinerea
-n12319204	black walnut, black walnut tree, black hickory, Juglans nigra
-n12319414	English walnut, English walnut tree, Circassian walnut, Persian walnut, Juglans regia
-n12320010	hickory, hickory tree
-n12320414	water hickory, bitter pecan, water bitternut, Carya aquatica
-n12320627	pignut, pignut hickory, brown hickory, black hickory, Carya glabra
-n12320806	bitternut, bitternut hickory, bitter hickory, bitter pignut, swamp hickory, Carya cordiformis
-n12321077	pecan, pecan tree, Carya illinoensis, Carya illinoinsis
-n12321395	big shellbark, big shellbark hickory, big shagbark, king nut, king nut hickory, Carya laciniosa
-n12321669	nutmeg hickory, Carya myristicaeformis, Carya myristiciformis
-n12321873	shagbark, shagbark hickory, shellbark, shellbark hickory, Carya ovata
-n12322099	mockernut, mockernut hickory, black hickory, white-heart hickory, big-bud hickory, Carya tomentosa
-n12322501	wing nut, wing-nut
-n12322699	Caucasian walnut, Pterocarya fraxinifolia
-n12323665	dhawa, dhava
-n12324056	combretum
-n12324222	hiccup nut, hiccough nut, Combretum bracteosum
-n12324388	bush willow, Combretum appiculatum
-n12324558	bush willow, Combretum erythrophyllum
-n12324906	button tree, button mangrove, Conocarpus erectus
-n12325234	white mangrove, Laguncularia racemosa
-n12325787	oleaster
-n12327022	water milfoil
-n12327528	anchovy pear, anchovy pear tree, Grias cauliflora
-n12327846	brazil nut, brazil-nut tree, Bertholletia excelsa
-n12328398	loosestrife
-n12328567	purple loosestrife, spiked loosestrife, Lythrum salicaria
-n12328801	grass poly, hyssop loosestrife, Lythrum hyssopifolia
-n12329260	crape myrtle, crepe myrtle, crepe flower, Lagerstroemia indica
-n12329473	Queen's crape myrtle, pride-of-India, Lagerstroemia speciosa
-n12330239	myrtaceous tree
-n12330469	myrtle
-n12330587	common myrtle, Myrtus communis
-n12330891	bayberry, bay-rum tree, Jamaica bayberry, wild cinnamon, Pimenta acris
-n12331066	allspice, allspice tree, pimento tree, Pimenta dioica
-n12331263	allspice tree, Pimenta officinalis
-n12331655	sour cherry, Eugenia corynantha
-n12331788	nakedwood, Eugenia dicrana
-n12332030	Surinam cherry, pitanga, Eugenia uniflora
-n12332218	rose apple, rose-apple tree, jambosa, Eugenia jambos
-n12332555	feijoa, feijoa bush
-n12333053	jaboticaba, jaboticaba tree, Myrciaria cauliflora
-n12333530	guava, true guava, guava bush, Psidium guajava
-n12333771	guava, strawberry guava, yellow cattley guava, Psidium littorale
-n12333961	cattley guava, purple strawberry guava, Psidium cattleianum, Psidium littorale longipes
-n12334153	Brazilian guava, Psidium guineense
-n12334293	gum tree, gum
-n12334891	eucalyptus, eucalypt, eucalyptus tree
-n12335483	flooded gum
-n12335664	mallee
-n12335800	stringybark
-n12335937	smoothbark
-n12336092	red gum, peppermint, peppermint gum, Eucalyptus amygdalina
-n12336224	red gum, marri, Eucalyptus calophylla
-n12336333	river red gum, river gum, Eucalyptus camaldulensis, Eucalyptus rostrata
-n12336586	mountain swamp gum, Eucalyptus camphora
-n12336727	snow gum, ghost gum, white ash, Eucalyptus coriacea, Eucalyptus pauciflora
-n12336973	alpine ash, mountain oak, Eucalyptus delegatensis
-n12337131	white mallee, congoo mallee, Eucalyptus dumosa
-n12337246	white stringybark, thin-leaved stringybark, Eucalyptusd eugenioides
-n12337391	white mountain ash, Eucalyptus fraxinoides
-n12337617	blue gum, fever tree, Eucalyptus globulus
-n12337800	rose gum, Eucalypt grandis
-n12337922	cider gum, Eucalypt gunnii
-n12338034	swamp gum, Eucalypt ovata
-n12338146	spotted gum, Eucalyptus maculata
-n12338258	lemon-scented gum, Eucalyptus citriodora, Eucalyptus maculata citriodora
-n12338454	black mallee, black sally, black gum, Eucalytus stellulata
-n12338655	forest red gum, Eucalypt tereticornis
-n12338796	mountain ash, Eucalyptus regnans
-n12338979	manna gum, Eucalyptus viminalis
-n12339526	clove, clove tree, Syzygium aromaticum, Eugenia aromaticum, Eugenia caryophyllatum
-n12339831	clove
-n12340383	tupelo, tupelo tree
-n12340581	water gum, Nyssa aquatica
-n12340755	sour gum, black gum, pepperidge, Nyssa sylvatica
-n12341542	enchanter's nightshade
-n12341931	Circaea lutetiana
-n12342299	willowherb
-n12342498	fireweed, giant willowherb, rosebay willowherb, wickup, Epilobium angustifolium
-n12342852	California fuchsia, humming bird's trumpet, Epilobium canum canum, Zauschneria californica
-n12343480	fuchsia
-n12343753	lady's-eardrop, ladies'-eardrop, lady's-eardrops, ladies'-eardrops, Fuchsia coccinea
-n12344283	evening primrose
-n12344483	common evening primrose, German rampion, Oenothera biennis
-n12344700	sundrops, Oenothera fruticosa
-n12344837	Missouri primrose, Ozark sundrops, Oenothera macrocarpa
-n12345280	pomegranate, pomegranate tree, Punica granatum
-n12345899	mangrove, Rhizophora mangle
-n12346578	daphne
-n12346813	garland flower, Daphne cneorum
-n12346986	spurge laurel, wood laurel, Daphne laureola
-n12347158	mezereon, February daphne, Daphne mezereum
-n12349315	Indian rhododendron, Melastoma malabathricum
-n12349711	Medinilla magnifica
-n12350032	deer grass, meadow beauty
-n12350758	canna
-n12351091	achira, indian shot, arrowroot, Canna indica, Canna edulis
-n12351790	arrowroot, American arrowroot, obedience plant, Maranta arundinaceae
-n12352287	banana, banana tree
-n12352639	dwarf banana, Musa acuminata
-n12352844	Japanese banana, Musa basjoo
-n12352990	plantain, plantain tree, Musa paradisiaca
-n12353203	edible banana, Musa paradisiaca sapientum
-n12353431	abaca, Manila hemp, Musa textilis
-n12353754	Abyssinian banana, Ethiopian banana, Ensete ventricosum, Musa ensete
-n12355760	ginger
-n12356023	common ginger, Canton ginger, stem ginger, Zingiber officinale
-n12356395	turmeric, Curcuma longa, Curcuma domestica
-n12356960	galangal, Alpinia galanga
-n12357485	shellflower, shall-flower, shell ginger, Alpinia Zerumbet, Alpinia speciosa, Languas speciosa
-n12357968	grains of paradise, Guinea grains, Guinea pepper, melagueta pepper, Aframomum melegueta
-n12358293	cardamom, cardamon, Elettaria cardamomum
-n12360108	begonia
-n12360534	fibrous-rooted begonia
-n12360684	tuberous begonia
-n12360817	rhizomatous begonia
-n12360958	Christmas begonia, blooming-fool begonia, Begonia cheimantha
-n12361135	angel-wing begonia, Begonia cocchinea
-n12361560	beefsteak begonia, kidney begonia, Begonia erythrophylla, Begonia feastii
-n12361754	star begonia, star-leaf begonia, Begonia heracleifolia
-n12361946	rex begonia, king begonia, painted-leaf begonia, beefsteak geranium, Begonia rex
-n12362274	wax begonia, Begonia semperflorens
-n12362514	Socotra begonia, Begonia socotrana
-n12362668	hybrid tuberous begonia, Begonia tuberhybrida
-n12363301	dillenia
-n12363768	guinea gold vine, guinea flower
-n12364604	poon
-n12364940	calaba, Santa Maria tree, Calophyllum calaba
-n12365158	Maria, Calophyllum longifolium
-n12365285	laurelwood, lancewood tree, Calophyllum candidissimum
-n12365462	Alexandrian laurel, Calophyllum inophyllum
-n12365900	clusia
-n12366053	wild fig, Clusia flava
-n12366186	waxflower, Clusia insignis
-n12366313	pitch apple, strangler fig, Clusia rosea, Clusia major
-n12366675	mangosteen, mangosteen tree, Garcinia mangostana
-n12366870	gamboge tree, Garcinia hanburyi, Garcinia cambogia, Garcinia gummi-gutta
-n12367611	St John's wort
-n12368028	common St John's wort, tutsan, Hypericum androsaemum
-n12368257	great St John's wort, Hypericum ascyron, Hypericum pyramidatum
-n12368451	creeping St John's wort, Hypericum calycinum
-n12369066	low St Andrew's cross, Hypericum hypericoides
-n12369309	klammath weed, Hypericum perforatum
-n12369476	shrubby St John's wort, Hypericum prolificum, Hypericum spathulatum
-n12369665	St Peter's wort, Hypericum tetrapterum, Hypericum maculatum
-n12369845	marsh St-John's wort, Hypericum virginianum
-n12370174	mammee apple, mammee, mamey, mammee tree, Mammea americana
-n12370549	rose chestnut, ironwood, ironwood tree, Mesua ferrea
-n12371202	bower actinidia, tara vine, Actinidia arguta
-n12371439	Chinese gooseberry, kiwi, kiwi vine, Actinidia chinensis, Actinidia deliciosa
-n12371704	silvervine, silver vine, Actinidia polygama
-n12372233	wild cinnamon, white cinnamon tree, Canella winterana, Canella-alba
-n12373100	papaya, papaia, pawpaw, papaya tree, melon tree, Carica papaya
-n12373739	souari, souari nut, souari tree, Caryocar nuciferum
-n12374418	rockrose, rock rose
-n12374705	white-leaved rockrose, Cistus albidus
-n12374862	common gum cistus, Cistus ladanifer, Cistus ladanum
-n12375769	frostweed, frost-weed, frostwort, Helianthemum canadense, Crocanthemum canadense
-n12377198	dipterocarp
-n12377494	red lauan, red lauan tree, Shorea teysmanniana
-n12378249	governor's plum, governor plum, Madagascar plum, ramontchi, batoko palm, Flacourtia indica
-n12378753	kei apple, kei apple bush, Dovyalis caffra
-n12378963	ketembilla, kitembilla, kitambilla, ketembilla tree, Ceylon gooseberry, Dovyalis hebecarpa
-n12379531	chaulmoogra, chaulmoogra tree, chaulmugra, Hydnocarpus kurzii, Taraktagenos kurzii, Taraktogenos kurzii
-n12380761	wild peach, Kiggelaria africana
-n12381511	candlewood
-n12382233	boojum tree, cirio, Fouquieria columnaris, Idria columnaris
-n12382875	bird's-eye bush, Ochna serrulata
-n12383737	granadilla, purple granadillo, Passiflora edulis
-n12383894	granadilla, sweet granadilla, Passiflora ligularis
-n12384037	granadilla, giant granadilla, Passiflora quadrangularis
-n12384227	maypop, Passiflora incarnata
-n12384375	Jamaica honeysuckle, yellow granadilla, Passiflora laurifolia
-n12384569	banana passion fruit, Passiflora mollissima
-n12384680	sweet calabash, Passiflora maliformis
-n12384839	love-in-a-mist, running pop, wild water lemon, Passiflora foetida
-n12385429	reseda
-n12385566	mignonette, sweet reseda, Reseda odorata
-n12385830	dyer's rocket, dyer's mignonette, weld, Reseda luteola
-n12386945	false tamarisk, German tamarisk, Myricaria germanica
-n12387103	halophyte
-n12387633	viola
-n12387839	violet
-n12388143	field pansy, heartsease, Viola arvensis
-n12388293	American dog violet, Viola conspersa
-n12388858	dog violet, heath violet, Viola canina
-n12388989	horned violet, tufted pansy, Viola cornuta
-n12389130	two-eyed violet, heartsease, Viola ocellata
-n12389501	bird's-foot violet, pansy violet, Johnny-jump-up, wood violet, Viola pedata
-n12389727	downy yellow violet, Viola pubescens
-n12389932	long-spurred violet, Viola rostrata
-n12390099	pale violet, striped violet, cream violet, Viola striata
-n12390314	hedge violet, wood violet, Viola sylvatica, Viola reichenbachiana
-n12392070	nettle
-n12392549	stinging nettle, Urtica dioica
-n12392765	Roman nettle, Urtica pipulifera
-n12393269	ramie, ramee, Chinese silk plant, China grass, Boehmeria nivea
-n12394118	wood nettle, Laportea canadensis
-n12394328	Australian nettle, Australian nettle tree
-n12394638	pellitory-of-the-wall, wall pellitory, pellitory, Parietaria difussa
-n12395068	richweed, clearweed, dead nettle, Pilea pumilla
-n12395289	artillery plant, Pilea microphylla
-n12395463	friendship plant, panamica, panamiga, Pilea involucrata
-n12395906	Queensland grass-cloth plant, Pipturus argenteus
-n12396091	Pipturus albidus
-n12396924	cannabis, hemp
-n12397431	Indian hemp, Cannabis indica
-n12399132	mulberry, mulberry tree
-n12399384	white mulberry, Morus alba
-n12399534	black mulberry, Morus nigra
-n12399656	red mulberry, Morus rubra
-n12399899	osage orange, bow wood, mock orange, Maclura pomifera
-n12400489	breadfruit, breadfruit tree, Artocarpus communis, Artocarpus altilis
-n12400720	jackfruit, jackfruit tree, Artocarpus heterophyllus
-n12400924	marang, marang tree, Artocarpus odoratissima
-n12401335	fig tree
-n12401684	fig, common fig, common fig tree, Ficus carica
-n12401893	caprifig, Ficus carica sylvestris
-n12402051	golden fig, Florida strangler fig, strangler fig, wild fig, Ficus aurea
-n12402348	banyan, banyan tree, banian, banian tree, Indian banyan, East Indian fig tree, Ficus bengalensis
-n12402596	pipal, pipal tree, pipul, peepul, sacred fig, bo tree, Ficus religiosa
-n12402840	India-rubber tree, India-rubber plant, India-rubber fig, rubber plant, Assam rubber, Ficus elastica
-n12403075	mistletoe fig, mistletoe rubber plant, Ficus diversifolia, Ficus deltoidea
-n12403276	Port Jackson fig, rusty rig, little-leaf fig, Botany Bay fig, Ficus rubiginosa
-n12403513	sycamore, sycamore fig, mulberry fig, Ficus sycomorus
-n12403994	paper mulberry, Broussonetia papyrifera
-n12404729	trumpetwood, trumpet-wood, trumpet tree, snake wood, imbauba, Cecropia peltata
-n12405714	elm, elm tree
-n12406304	winged elm, wing elm, Ulmus alata
-n12406488	American elm, white elm, water elm, rock elm, Ulmus americana
-n12406715	smooth-leaved elm, European field elm, Ulmus carpinifolia
-n12406902	cedar elm, Ulmus crassifolia
-n12407079	witch elm, wych elm, Ulmus glabra
-n12407222	Dutch elm, Ulmus hollandica
-n12407396	Huntingdon elm, Ulmus hollandica vegetata
-n12407545	water elm, Ulmus laevis
-n12407715	Chinese elm, Ulmus parvifolia
-n12407890	English elm, European elm, Ulmus procera
-n12408077	Siberian elm, Chinese elm, dwarf elm, Ulmus pumila
-n12408280	slippery elm, red elm, Ulmus rubra
-n12408466	Jersey elm, guernsey elm, wheately elm, Ulmus sarniensis, Ulmus campestris sarniensis, Ulmus campestris wheatleyi
-n12408717	September elm, red elm, Ulmus serotina
-n12408873	rock elm, Ulmus thomasii
-n12409231	hackberry, nettle tree
-n12409470	European hackberry, Mediterranean hackberry, Celtis australis
-n12409651	American hackberry, Celtis occidentalis
-n12409840	sugarberry, Celtis laevigata
-n12411461	iridaceous plant
-n12412355	bearded iris
-n12412606	beardless iris
-n12412987	orrisroot, orris
-n12413165	dwarf iris, Iris cristata
-n12413301	Dutch iris, Iris filifolia
-n12413419	Florentine iris, orris, Iris germanica florentina, Iris florentina
-n12413642	stinking iris, gladdon, gladdon iris, stinking gladwyn, roast beef plant, Iris foetidissima
-n12413880	German iris, Iris germanica
-n12414035	Japanese iris, Iris kaempferi
-n12414159	German iris, Iris kochii
-n12414329	Dalmatian iris, Iris pallida
-n12414449	Persian iris, Iris persica
-n12414818	Dutch iris, Iris tingitana
-n12414932	dwarf iris, vernal iris, Iris verna
-n12415595	Spanish iris, xiphium iris, Iris xiphium
-n12416073	blackberry-lily, leopard lily, Belamcanda chinensis
-n12416423	crocus
-n12416703	saffron, saffron crocus, Crocus sativus
-n12417836	corn lily
-n12418221	blue-eyed grass
-n12418507	wandflower, Sparaxis tricolor
-n12419037	amaryllis
-n12419878	salsilla, Bomarea edulis
-n12420124	salsilla, Bomarea salsilla
-n12420535	blood lily
-n12420722	Cape tulip, Haemanthus coccineus
-n12421137	hippeastrum, Hippeastrum puniceum
-n12421467	narcissus
-n12421683	daffodil, Narcissus pseudonarcissus
-n12421917	jonquil, Narcissus jonquilla
-n12422129	jonquil
-n12422559	Jacobean lily, Aztec lily, Strekelia formosissima
-n12425281	liliaceous plant
-n12426623	mountain lily, Lilium auratum
-n12426749	Canada lily, wild yellow lily, meadow lily, wild meadow lily, Lilium canadense
-n12427184	tiger lily, leopard lily, pine lily, Lilium catesbaei
-n12427391	Columbia tiger lily, Oregon lily, Lilium columbianum
-n12427566	tiger lily, devil lily, kentan, Lilium lancifolium
-n12427757	Easter lily, Bermuda lily, white trumpet lily, Lilium longiflorum
-n12427946	coast lily, Lilium maritinum
-n12428076	Turk's-cap, martagon, Lilium martagon
-n12428242	Michigan lily, Lilium michiganense
-n12428412	leopard lily, panther lily, Lilium pardalinum
-n12428747	Turk's-cap, Turk's cap-lily, Lilium superbum
-n12429352	African lily, African tulip, blue African lily, Agapanthus africanus
-n12430198	colicroot, colic root, crow corn, star grass, unicorn root
-n12430471	ague root, ague grass, Aletris farinosa
-n12430675	yellow colicroot, Aletris aurea
-n12431434	alliaceous plant
-n12432069	Hooker's onion, Allium acuminatum
-n12432356	wild leek, Levant garlic, kurrat, Allium ampeloprasum
-n12432574	Canada garlic, meadow leek, rose leek, Allium canadense
-n12432707	keeled garlic, Allium carinatum
-n12433081	onion
-n12433178	shallot, eschalot, multiplier onion, Allium cepa aggregatum, Allium ascalonicum
-n12433769	nodding onion, nodding wild onion, lady's leek, Allium cernuum
-n12433952	Welsh onion, Japanese leek, Allium fistulosum
-n12434106	red-skinned onion, Allium haematochiton
-n12434483	daffodil garlic, flowering onion, Naples garlic, Allium neopolitanum
-n12434634	few-flowered leek, Allium paradoxum
-n12434775	garlic, Allium sativum
-n12434985	sand leek, giant garlic, Spanish garlic, rocambole, Allium scorodoprasum
-n12435152	chives, chive, cive, schnittlaugh, Allium schoenoprasum
-n12435486	crow garlic, false garlic, field garlic, stag's garlic, wild garlic, Allium vineale
-n12435649	wild garlic, wood garlic, Ramsons, Allium ursinum
-n12435777	garlic chive, Chinese chive, Oriental garlic, Allium tuberosum
-n12435965	round-headed leek, Allium sphaerocephalum
-n12436090	three-cornered leek, triquetrous leek, Allium triquetrum
-n12436907	cape aloe, Aloe ferox
-n12437513	kniphofia, tritoma, flame flower, flame-flower, flameflower
-n12437769	poker plant, Kniphofia uvaria
-n12437930	red-hot poker, Kniphofia praecox
-n12439154	fly poison, Amianthum muscaetoxicum, Amianthum muscitoxicum
-n12439830	amber lily, Anthericum torreyi
-n12441183	asparagus, edible asparagus, Asparagus officinales
-n12441390	asparagus fern, Asparagus setaceous, Asparagus plumosus
-n12441552	smilax, Asparagus asparagoides
-n12441958	asphodel
-n12442548	Jacob's rod
-n12443323	aspidistra, cast-iron plant, bar-room plant, Aspidistra elatio
-n12443736	coral drops, Bessera elegans
-n12444095	Christmas bells
-n12444898	climbing onion, Bowiea volubilis
-n12446200	mariposa, mariposa tulip, mariposa lily
-n12446519	globe lily, fairy lantern
-n12446737	cat's-ear
-n12446908	white globe lily, white fairy lantern, Calochortus albus
-n12447121	yellow globe lily, golden fairy lantern, Calochortus amabilis
-n12447346	rose globe lily, Calochortus amoenus
-n12447581	star tulip, elegant cat's ears, Calochortus elegans
-n12447891	desert mariposa tulip, Calochortus kennedyi
-n12448136	yellow mariposa tulip, Calochortus luteus
-n12448361	sagebrush mariposa tulip, Calochortus macrocarpus
-n12448700	sego lily, Calochortus nuttallii
-n12449296	camas, camass, quamash, camosh, camash
-n12449526	common camas, Camassia quamash
-n12449784	Leichtlin's camas, Camassia leichtlinii
-n12449934	wild hyacinth, indigo squill, Camassia scilloides
-n12450344	dogtooth violet, dogtooth, dog's-tooth violet
-n12450607	white dogtooth violet, white dog's-tooth violet, blonde lilian, Erythronium albidum
-n12450840	yellow adder's tongue, trout lily, amberbell, Erythronium americanum
-n12451070	European dogtooth, Erythronium dens-canis
-n12451240	fawn lily, Erythronium californicum
-n12451399	glacier lily, snow lily, Erythronium grandiflorum
-n12451566	avalanche lily, Erythronium montanum
-n12451915	fritillary, checkered lily
-n12452256	mission bells, rice-grain fritillary, Fritillaria affinis, Fritillaria lanceolata, Fritillaria mutica
-n12452480	mission bells, black fritillary, Fritillaria biflora
-n12452673	stink bell, Fritillaria agrestis
-n12452836	crown imperial, Fritillaria imperialis
-n12453018	white fritillary, Fritillaria liliaceae
-n12453186	snake's head fritillary, guinea-hen flower, checkered daffodil, leper lily, Fritillaria meleagris
-n12453714	adobe lily, pink fritillary, Fritillaria pluriflora
-n12453857	scarlet fritillary, Fritillaria recurva
-n12454159	tulip
-n12454436	dwarf tulip, Tulipa armena, Tulipa suaveolens
-n12454556	lady tulip, candlestick tulip, Tulipa clusiana
-n12454705	Tulipa gesneriana
-n12454793	cottage tulip
-n12454949	Darwin tulip
-n12455950	gloriosa, glory lily, climbing lily, creeping lily, Gloriosa superba
-n12457091	lemon lily, Hemerocallis lilio-asphodelus, Hemerocallis flava
-n12458550	common hyacinth, Hyacinthus orientalis
-n12458713	Roman hyacinth, Hyacinthus orientalis albulus
-n12458874	summer hyacinth, cape hyacinth, Hyacinthus candicans, Galtonia candicans
-n12459629	star-of-Bethlehem
-n12460146	bath asparagus, Prussian asparagus, Ornithogalum pyrenaicum
-n12460697	grape hyacinth
-n12460957	common grape hyacinth, Muscari neglectum
-n12461109	tassel hyacinth, Muscari comosum
-n12461466	scilla, squill
-n12461673	spring squill, Scilla verna, sea onion
-n12462032	false asphodel
-n12462221	Scotch asphodel, Tofieldia pusilla
-n12462582	sea squill, sea onion, squill, Urginea maritima
-n12462805	squill
-n12463134	butcher's broom, Ruscus aculeatus
-n12463743	bog asphodel
-n12463975	European bog asphodel, Narthecium ossifragum
-n12464128	American bog asphodel, Narthecium americanum
-n12464476	hellebore, false hellebore
-n12464649	white hellebore, American hellebore, Indian poke, bugbane, Veratrum viride
-n12465557	squaw grass, bear grass, Xerophyllum tenax
-n12466727	death camas, zigadene
-n12467018	alkali grass, Zigadenus elegans
-n12467197	white camas, Zigadenus glaucus
-n12467433	poison camas, Zigadenus nuttalli
-n12467592	grassy death camas, Zigadenus venenosus, Zigadenus venenosus gramineus
-n12468545	prairie wake-robin, prairie trillium, Trillium recurvatum
-n12468719	dwarf-white trillium, snow trillium, early wake-robin
-n12469517	herb Paris, Paris quadrifolia
-n12470092	sarsaparilla
-n12470512	bullbrier, greenbrier, catbrier, horse brier, horse-brier, brier, briar, Smilax rotundifolia
-n12470907	rough bindweed, Smilax aspera
-n12472024	clintonia, Clinton's lily
-n12473608	false lily of the valley, Maianthemum canadense
-n12473840	false lily of the valley, Maianthemum bifolium
-n12474167	Solomon's-seal
-n12474418	great Solomon's-seal, Polygonatum biflorum, Polygonatum commutatum
-n12475035	bellwort, merry bells, wild oats
-n12475242	strawflower, cornflower, Uvularia grandiflora
-n12475774	pia, Indian arrowroot, Tacca leontopetaloides, Tacca pinnatifida
-n12476510	agave, century plant, American aloe
-n12477163	American agave, Agave americana
-n12477401	sisal, Agave sisalana
-n12477583	maguey, cantala, Agave cantala
-n12477747	maguey, Agave atrovirens
-n12477983	Agave tequilana
-n12478768	cabbage tree, grass tree, Cordyline australis
-n12479537	dracaena
-n12480456	tuberose, Polianthes tuberosa
-n12480895	sansevieria, bowstring hemp
-n12481150	African bowstring hemp, African hemp, Sansevieria guineensis
-n12481289	Ceylon bowstring hemp, Sansevieria zeylanica
-n12481458	mother-in-law's tongue, snake plant, Sansevieria trifasciata
-n12482437	Spanish bayonet, Yucca aloifolia
-n12482668	Spanish bayonet, Yucca baccata
-n12482893	Joshua tree, Yucca brevifolia
-n12483282	soapweed, soap-weed, soap tree, Yucca elata
-n12483427	Adam's needle, Adam's needle-and-thread, spoonleaf yucca, needle palm, Yucca filamentosa
-n12483625	bear grass, Yucca glauca
-n12483841	Spanish dagger, Yucca gloriosa
-n12484244	Our Lord's candle, Yucca whipplei
-n12484784	water shamrock, buckbean, bogbean, bog myrtle, marsh trefoil, Menyanthes trifoliata
-n12485653	butterfly bush, buddleia
-n12485981	yellow jasmine, yellow jessamine, Carolina jasmine, evening trumpet flower, Gelsemium sempervirens
-n12486574	flax
-n12487058	calabar bean, ordeal bean
-n12488454	bonduc, bonduc tree, Caesalpinia bonduc, Caesalpinia bonducella
-n12488709	divi-divi, Caesalpinia coriaria
-n12489046	Mysore thorn, Caesalpinia decapetala, Caesalpinia sepiaria
-n12489676	brazilian ironwood, Caesalpinia ferrea
-n12489815	bird of paradise, poinciana, Caesalpinia gilliesii, Poinciana gilliesii
-n12490490	shingle tree, Acrocarpus fraxinifolius
-n12491017	mountain ebony, orchid tree, Bauhinia variegata
-n12491435	msasa, Brachystegia speciformis
-n12491826	cassia
-n12492106	golden shower tree, drumstick tree, purging cassia, pudding pipe tree, canafistola, canafistula, Cassia fistula
-n12492460	pink shower, pink shower tree, horse cassia, Cassia grandis
-n12492682	rainbow shower, Cassia javonica
-n12492900	horse cassia, Cassia roxburghii, Cassia marginata
-n12493208	carob, carob tree, carob bean tree, algarroba, Ceratonia siliqua
-n12493426	carob, carob bean, algarroba bean, algarroba, locust bean, locust pod
-n12493868	paloverde
-n12494794	royal poinciana, flamboyant, flame tree, peacock flower, Delonix regia, Poinciana regia
-n12495146	locust tree, locust
-n12495670	water locust, swamp locust, Gleditsia aquatica
-n12495895	honey locust, Gleditsia triacanthos
-n12496427	Kentucky coffee tree, bonduc, chicot, Gymnocladus dioica
-n12496949	logwood, logwood tree, campeachy, bloodwood tree, Haematoxylum campechianum
-n12497669	Jerusalem thorn, horsebean, Parkinsonia aculeata
-n12498055	palo verde, Parkinsonia florida, Cercidium floridum
-n12498457	Dalmatian laburnum, Petteria ramentacea, Cytisus ramentaceus
-n12499163	senna
-n12499757	avaram, tanner's cassia, Senna auriculata, Cassia auriculata
-n12499979	Alexandria senna, Alexandrian senna, true senna, tinnevelly senna, Indian senna, Senna alexandrina, Cassia acutifolia, Cassia augustifolia
-n12500309	wild senna, Senna marilandica, Cassia marilandica
-n12500518	sicklepod, Senna obtusifolia, Cassia tora
-n12500751	coffee senna, mogdad coffee, styptic weed, stinking weed, Senna occidentalis, Cassia occidentalis
-n12501202	tamarind, tamarind tree, tamarindo, Tamarindus indica
-n12504570	false indigo, bastard indigo, Amorpha californica
-n12504783	false indigo, bastard indigo, Amorpha fruticosa
-n12505253	hog peanut, wild peanut, Amphicarpaea bracteata, Amphicarpa bracteata
-n12506181	angelim, andelmin
-n12506341	cabbage bark, cabbage-bark tree, cabbage tree, Andira inermis
-n12506991	kidney vetch, Anthyllis vulneraria
-n12507379	groundnut, groundnut vine, Indian potato, potato bean, wild bean, Apios americana, Apios tuberosa
-n12507823	rooibos, Aspalathus linearis, Aspalathus cedcarbergensis
-n12508309	milk vetch, milk-vetch
-n12508618	alpine milk vetch, Astragalus alpinus
-n12508762	purple milk vetch, Astragalus danicus
-n12509109	camwood, African sandalwood, Baphia nitida
-n12509476	wild indigo, false indigo
-n12509665	blue false indigo, Baptisia australis
-n12509821	white false indigo, Baptisia lactea
-n12509993	indigo broom, horsefly weed, rattle weed, Baptisia tinctoria
-n12510343	dhak, dak, palas, Butea frondosa, Butea monosperma
-n12510774	pigeon pea, pigeon-pea plant, cajan pea, catjang pea, red gram, dhal, dahl, Cajanus cajan
-n12511488	sword bean, Canavalia gladiata
-n12511856	pea tree, caragana
-n12512095	Siberian pea tree, Caragana arborescens
-n12512294	Chinese pea tree, Caragana sinica
-n12512674	Moreton Bay chestnut, Australian chestnut
-n12513172	butterfly pea, Centrosema virginianum
-n12513613	Judas tree, love tree, Circis siliquastrum
-n12513933	redbud, Cercis canadensis
-n12514138	western redbud, California redbud, Cercis occidentalis
-n12514592	tagasaste, Chamaecytisus palmensis, Cytesis proliferus
-n12514992	weeping tree broom
-n12515393	flame pea
-n12515711	chickpea, chickpea plant, Egyptian pea, Cicer arietinum
-n12515925	chickpea, garbanzo
-n12516165	Kentucky yellowwood, gopherwood, Cladrastis lutea, Cladrastis kentukea
-n12516584	glory pea, clianthus
-n12516828	desert pea, Sturt pea, Sturt's desert pea, Clianthus formosus, Clianthus speciosus
-n12517077	parrot's beak, parrot's bill, Clianthus puniceus
-n12517445	butterfly pea, Clitoria mariana
-n12517642	blue pea, butterfly pea, Clitoria turnatea
-n12518013	telegraph plant, semaphore plant, Codariocalyx motorius, Desmodium motorium, Desmodium gyrans
-n12518481	bladder senna, Colutea arborescens
-n12519089	axseed, crown vetch, Coronilla varia
-n12519563	crotalaria, rattlebox
-n12520406	guar, cluster bean, Cyamopsis tetragonolobus, Cyamopsis psoraloides
-n12521186	white broom, white Spanish broom, Cytisus albus, Cytisus multiflorus
-n12521394	common broom, Scotch broom, green broom, Cytisus scoparius
-n12522188	rosewood, rosewood tree
-n12522678	Indian blackwood, East Indian rosewood, East India rosewood, Indian rosewood, Dalbergia latifolia
-n12522894	sissoo, sissu, sisham, Dalbergia sissoo
-n12523141	kingwood, kingwood tree, Dalbergia cearensis
-n12523475	Brazilian rosewood, caviuna wood, jacaranda, Dalbergia nigra
-n12523850	cocobolo, Dalbergia retusa
-n12524188	blackwood, blackwood tree
-n12525168	bitter pea
-n12525513	derris
-n12525753	derris root, tuba root, Derris elliptica
-n12526178	prairie mimosa, prickle-weed, Desmanthus ilinoensis
-n12526516	tick trefoil, beggar lice, beggar's lice
-n12526754	beggarweed, Desmodium tortuosum, Desmodium purpureum
-n12527081	Australian pea, Dipogon lignosus, Dolichos lignosus
-n12527738	coral tree, erythrina
-n12528109	kaffir boom, Cape kafferboom, Erythrina caffra
-n12528382	coral bean tree, Erythrina corallodendrum
-n12528549	ceibo, crybaby tree, cry-baby tree, common coral tree, Erythrina crista-galli
-n12528768	kaffir boom, Transvaal kafferboom, Erythrina lysistemon
-n12528974	Indian coral tree, Erythrina variegata, Erythrina Indica
-n12529220	cork tree, Erythrina vespertilio
-n12529500	goat's rue, goat rue, Galega officinalis
-n12529905	poison bush, poison pea, gastrolobium
-n12530629	Spanish broom, Spanish gorse, Genista hispanica
-n12530818	woodwaxen, dyer's greenweed, dyer's-broom, dyeweed, greenweed, whin, woadwaxen, Genista tinctoria
-n12531328	chanar, chanal, Geoffroea decorticans
-n12531727	gliricidia
-n12532564	soy, soybean, soya bean
-n12532886	licorice, liquorice, Glycyrrhiza glabra
-n12533190	wild licorice, wild liquorice, American licorice, American liquorice, Glycyrrhiza lepidota
-n12533437	licorice root
-n12534208	Western Australia coral pea, Hardenbergia comnptoniana
-n12534625	sweet vetch, Hedysarum boreale
-n12534862	French honeysuckle, sulla, Hedysarum coronarium
-n12536291	anil, Indigofera suffruticosa, Indigofera anil
-n12537253	scarlet runner, running postman, Kennedia prostrata
-n12537569	hyacinth bean, bonavist, Indian bean, Egyptian bean, Lablab purpureus, Dolichos lablab
-n12538209	Scotch laburnum, Alpine golden chain, Laburnum alpinum
-n12539074	vetchling
-n12539306	wild pea
-n12539832	everlasting pea
-n12540250	beach pea, sea pea, Lathyrus maritimus, Lathyrus japonicus
-n12540647	grass vetch, grass vetchling, Lathyrus nissolia
-n12540966	marsh pea, Lathyrus palustris
-n12541157	common vetchling, meadow pea, yellow vetchling, Lathyrus pratensis
-n12541403	grass pea, Indian pea, khesari, Lathyrus sativus
-n12542043	Tangier pea, Tangier peavine, Lalthyrus tingitanus
-n12542240	heath pea, earth-nut pea, earthnut pea, tuberous vetch, Lathyrus tuberosus
-n12543186	bicolor lespediza, ezo-yama-hagi, Lespedeza bicolor
-n12543455	japanese clover, japan clover, jap clover, Lespedeza striata
-n12543639	Korean lespedeza, Lespedeza stipulacea
-n12543826	sericea lespedeza, Lespedeza sericea, Lespedeza cuneata
-n12544240	lentil, lentil plant, Lens culinaris
-n12544539	lentil
-n12545232	prairie bird's-foot trefoil, compass plant, prairie lotus, prairie trefoil, Lotus americanus
-n12545635	bird's foot trefoil, bird's foot clover, babies' slippers, bacon and eggs, Lotus corniculatus
-n12545865	winged pea, asparagus pea, Lotus tetragonolobus
-n12546183	lupine, lupin
-n12546420	white lupine, field lupine, wolf bean, Egyptian lupine, Lupinus albus
-n12546617	tree lupine, Lupinus arboreus
-n12546962	wild lupine, sundial lupine, Indian beet, old-maid's bonnet, Lupinus perennis
-n12547215	bluebonnet, buffalo clover, Texas bluebonnet, Lupinus subcarnosus
-n12547503	Texas bluebonnet, Lupinus texensis
-n12548280	medic, medick, trefoil
-n12548564	moon trefoil, Medicago arborea
-n12548804	sickle alfalfa, sickle lucerne, sickle medick, Medicago falcata
-n12549005	Calvary clover, Medicago intertexta, Medicago echinus
-n12549192	black medick, hop clover, yellow trefoil, nonesuch clover, Medicago lupulina
-n12549420	alfalfa, lucerne, Medicago sativa
-n12549799	millettia
-n12550210	mucuna
-n12550408	cowage, velvet bean, Bengal bean, Benghal bean, Florida bean, Mucuna pruriens utilis, Mucuna deeringiana, Mucuna aterrima, Stizolobium deeringiana
-n12551173	tolu tree, tolu balsam tree, Myroxylon balsamum, Myroxylon toluiferum
-n12551457	Peruvian balsam, Myroxylon pereirae, Myroxylon balsamum pereirae
-n12552309	sainfoin, sanfoin, holy clover, esparcet, Onobrychis viciifolia, Onobrychis viciaefolia
-n12552893	restharrow, rest-harrow, Ononis repens
-n12553742	bead tree, jumby bean, jumby tree, Ormosia monosperma
-n12554029	jumby bead, jumbie bead, Ormosia coarctata
-n12554526	locoweed, crazyweed, crazy weed
-n12554729	purple locoweed, purple loco, Oxytropis lambertii
-n12554911	tumbleweed
-n12555255	yam bean, Pachyrhizus erosus
-n12555859	shamrock pea, Parochetus communis
-n12556656	pole bean
-n12557064	kidney bean, frijol, frijole
-n12557438	haricot
-n12557556	wax bean
-n12557681	scarlet runner, scarlet runner bean, Dutch case-knife bean, runner bean, Phaseolus coccineus, Phaseolus multiflorus
-n12558230	lima bean, lima bean plant, Phaseolus limensis
-n12558425	sieva bean, butter bean, butter-bean plant, lima bean, Phaseolus lunatus
-n12558680	tepary bean, Phaseolus acutifolius latifolius
-n12559044	chaparral pea, stingaree-bush, Pickeringia montana
-n12559518	Jamaica dogwood, fish fuddle, Piscidia piscipula, Piscidia erythrina
-n12560282	pea
-n12560621	garden pea
-n12560775	edible-pod pea, edible-podded pea, Pisum sativum macrocarpon
-n12561169	sugar snap pea, snap pea
-n12561309	field pea, field-pea plant, Austrian winter pea, Pisum sativum arvense, Pisum arvense
-n12561594	field pea
-n12562141	common flat pea, native holly, Playlobium obtusangulum
-n12562577	quira
-n12562785	roble, Platymiscium trinitatis
-n12563045	Panama redwood tree, Panama redwood, Platymiscium pinnatum
-n12563702	Indian beech, Pongamia glabra
-n12564083	winged bean, winged pea, goa bean, goa bean vine, Manila bean, Psophocarpus tetragonolobus
-n12564613	breadroot, Indian breadroot, pomme blanche, pomme de prairie, Psoralea esculenta
-n12565102	bloodwood tree, kiaat, Pterocarpus angolensis
-n12565912	kino, Pterocarpus marsupium
-n12566331	red sandalwood, red sanders, red sanderswood, red saunders, Pterocarpus santalinus
-n12566954	kudzu, kudzu vine, Pueraria lobata
-n12567950	bristly locust, rose acacia, moss locust, Robinia hispida
-n12568186	black locust, yellow locust, Robinia pseudoacacia
-n12568649	clammy locust, Robinia viscosa
-n12569037	carib wood, Sabinea carinalis
-n12569616	Colorado River hemp, Sesbania exaltata
-n12569851	scarlet wisteria tree, vegetable hummingbird, Sesbania grandiflora
-n12570394	Japanese pagoda tree, Chinese scholartree, Chinese scholar tree, Sophora japonica, Sophora sinensis
-n12570703	mescal bean, coral bean, frijolito, frijolillo, Sophora secundiflora
-n12570972	kowhai, Sophora tetraptera
-n12571781	jade vine, emerald creeper, Strongylodon macrobotrys
-n12572546	hoary pea
-n12572759	bastard indigo, Tephrosia purpurea
-n12572858	catgut, goat's rue, wild sweet pea, Tephrosia virginiana
-n12573256	bush pea
-n12573474	false lupine, golden pea, yellow pea, Thermopsis macrophylla
-n12573647	Carolina lupine, Thermopsis villosa
-n12573911	tipu, tipu tree, yellow jacaranda, pride of Bolivia
-n12574320	bird's foot trefoil, Trigonella ornithopodioides
-n12574470	fenugreek, Greek clover, Trigonella foenumgraecum
-n12574866	gorse, furze, whin, Irish gorse, Ulex europaeus
-n12575322	vetch
-n12575812	tufted vetch, bird vetch, Calnada pea, Vicia cracca
-n12576323	broad bean, fava bean, horsebean
-n12576451	bitter betch, Vicia orobus
-n12576695	bush vetch, Vicia sepium
-n12577362	moth bean, Vigna aconitifolia, Phaseolus aconitifolius
-n12577895	snailflower, snail-flower, snail flower, snail bean, corkscrew flower, Vigna caracalla, Phaseolus caracalla
-n12578255	mung, mung bean, green gram, golden gram, Vigna radiata, Phaseolus aureus
-n12578626	cowpea, cowpea plant, black-eyed pea, Vigna unguiculata, Vigna sinensis
-n12578916	cowpea, black-eyed pea
-n12579038	asparagus bean, yard-long bean, Vigna unguiculata sesquipedalis, Vigna sesquipedalis
-n12579404	swamp oak, Viminaria juncea, Viminaria denudata
-n12579822	keurboom, Virgilia capensis, Virgilia oroboides
-n12580012	keurboom, Virgilia divaricata
-n12580654	Japanese wistaria, Wisteria floribunda
-n12580786	Chinese wistaria, Wisteria chinensis
-n12580896	American wistaria, American wisteria, Wisteria frutescens
-n12581110	silky wisteria, Wisteria venusta
-n12582231	palm, palm tree
-n12582665	sago palm
-n12582846	feather palm
-n12583126	fan palm
-n12583401	palmetto
-n12583681	coyol, coyol palm, Acrocomia vinifera
-n12583855	grugru, gri-gri, grugru palm, macamba, Acrocomia aculeata
-n12584191	areca
-n12584365	betel palm, Areca catechu
-n12584715	sugar palm, gomuti, gomuti palm, Arenga pinnata
-n12585137	piassava palm, pissaba palm, Bahia piassava, bahia coquilla, Attalea funifera
-n12585373	coquilla nut
-n12585629	palmyra, palmyra palm, toddy palm, wine palm, lontar, longar palm, Borassus flabellifer
-n12586298	calamus
-n12586499	rattan, rattan palm, Calamus rotang
-n12586725	lawyer cane, Calamus australis
-n12586989	fishtail palm
-n12587132	wine palm, jaggery palm, kitul, kittul, kitul tree, toddy palm, Caryota urens
-n12587487	wax palm, Ceroxylon andicola, Ceroxylon alpinum
-n12587803	coconut, coconut palm, coco palm, coco, cocoa palm, coconut tree, Cocos nucifera
-n12588320	carnauba, carnauba palm, wax palm, Copernicia prunifera, Copernicia cerifera
-n12588780	caranday, caranda, caranda palm, wax palm, Copernicia australis, Copernicia alba
-n12589142	corozo, corozo palm
-n12589458	gebang palm, Corypha utan, Corypha gebanga
-n12589687	latanier, latanier palm
-n12589841	talipot, talipot palm, Corypha umbraculifera
-n12590232	oil palm
-n12590499	African oil palm, Elaeis guineensis
-n12590600	American oil palm, Elaeis oleifera
-n12590715	palm nut, palm kernel
-n12591017	cabbage palm, Euterpe oleracea
-n12591351	cabbage palm, cabbage tree, Livistona australis
-n12591702	true sago palm, Metroxylon sagu
-n12592058	nipa palm, Nipa fruticans
-n12592544	babassu, babassu palm, coco de macao, Orbignya phalerata, Orbignya spesiosa, Orbignya martiana
-n12592839	babassu nut
-n12593122	cohune palm, Orbignya cohune, cohune
-n12593341	cohune nut
-n12593994	date palm, Phoenix dactylifera
-n12594324	ivory palm, ivory-nut palm, ivory plant, Phytelephas macrocarpa
-n12594989	raffia palm, Raffia farinifera, Raffia ruffia
-n12595699	bamboo palm, Raffia vinifera
-n12595964	lady palm
-n12596148	miniature fan palm, bamboo palm, fern rhapis, Rhapis excelsa
-n12596345	reed rhapis, slender lady palm, Rhapis humilis
-n12596709	royal palm, Roystonea regia
-n12596849	cabbage palm, Roystonea oleracea
-n12597134	cabbage palmetto, cabbage palm, Sabal palmetto
-n12597466	saw palmetto, scrub palmetto, Serenoa repens
-n12597798	thatch palm, thatch tree, silver thatch, broom palm, Thrinax parviflora
-n12598027	key palm, silvertop palmetto, silver thatch, Thrinax microcarpa, Thrinax morrisii, Thrinax keyensis
-n12599185	English plantain, narrow-leaved plantain, ribgrass, ribwort, ripple-grass, buckthorn, Plantago lanceolata
-n12599435	broad-leaved plantain, common plantain, white-man's foot, whiteman's foot, cart-track plant, Plantago major
-n12599661	hoary plantain, Plantago media
-n12599874	fleawort, psyllium, Spanish psyllium, Plantago psyllium
-n12600095	rugel's plantain, broad-leaved plantain, Plantago rugelii
-n12600267	hoary plantain, Plantago virginica
-n12601494	buckwheat, Polygonum fagopyrum, Fagopyrum esculentum
-n12601805	prince's-feather, princess feather, kiss-me-over-the-garden-gate, prince's-plume, Polygonum orientale
-n12602262	eriogonum
-n12602434	umbrella plant, Eriogonum allenii
-n12602612	wild buckwheat, California buckwheat, Erigonum fasciculatum
-n12602980	rhubarb, rhubarb plant
-n12603273	Himalayan rhubarb, Indian rhubarb, red-veined pie plant, Rheum australe, Rheum emodi
-n12603449	pie plant, garden rhubarb, Rheum cultorum, Rheum rhabarbarum, Rheum rhaponticum
-n12603672	Chinese rhubarb, Rheum palmatum
-n12604228	sour dock, garden sorrel, Rumex acetosa
-n12604460	sheep sorrel, sheep's sorrel, Rumex acetosella
-n12604639	bitter dock, broad-leaved dock, yellow dock, Rumex obtusifolius
-n12604845	French sorrel, garden sorrel, Rumex scutatus
-n12605683	yellow-eyed grass
-n12606438	commelina
-n12606545	spiderwort, dayflower
-n12607456	pineapple, pineapple plant, Ananas comosus
-n12609379	pipewort, Eriocaulon aquaticum
-n12610328	water hyacinth, water orchid, Eichhornia crassipes, Eichhornia spesiosa
-n12610740	water star grass, mud plantain, Heteranthera dubia
-n12611640	naiad, water nymph
-n12612170	water plantain, Alisma plantago-aquatica
-n12612811	narrow-leaved water plantain
-n12613706	hydrilla, Hydrilla verticillata
-n12614096	American frogbit, Limnodium spongia
-n12614477	waterweed
-n12614625	Canadian pondweed, Elodea canadensis
-n12615232	tape grass, eelgrass, wild celery, Vallisneria spiralis
-n12615710	pondweed
-n12616248	curled leaf pondweed, curly pondweed, Potamogeton crispus
-n12616630	loddon pondweed, Potamogeton nodosus, Potamogeton americanus
-n12616996	frog's lettuce
-n12617559	arrow grass, Triglochin maritima
-n12618146	horned pondweed, Zannichellia palustris
-n12618727	eelgrass, grass wrack, sea wrack, Zostera marina
-n12620196	rose, rosebush
-n12620546	hip, rose hip, rosehip
-n12620969	banksia rose, Rosa banksia
-n12621410	damask rose, summer damask rose, Rosa damascena
-n12621619	sweetbrier, sweetbriar, brier, briar, eglantine, Rosa eglanteria
-n12621945	Cherokee rose, Rosa laevigata
-n12622297	musk rose, Rosa moschata
-n12622875	agrimonia, agrimony
-n12623077	harvest-lice, Agrimonia eupatoria
-n12623211	fragrant agrimony, Agrimonia procera
-n12623818	alderleaf Juneberry, alder-leaved serviceberry, Amelanchier alnifolia
-n12624381	flowering quince
-n12624568	japonica, maule's quince, Chaenomeles japonica
-n12625003	coco plum, coco plum tree, cocoa plum, icaco, Chrysobalanus icaco
-n12625383	cotoneaster
-n12625670	Cotoneaster dammeri
-n12625823	Cotoneaster horizontalis
-n12626674	parsley haw, parsley-leaved thorn, Crataegus apiifolia, Crataegus marshallii
-n12626878	scarlet haw, Crataegus biltmoreana
-n12627119	blackthorn, pear haw, pear hawthorn, Crataegus calpodendron, Crataegus tomentosa
-n12627347	cockspur thorn, cockspur hawthorn, Crataegus crus-galli
-n12627526	mayhaw, summer haw, Crataegus aestivalis
-n12628356	red haw, downy haw, Crataegus mollis, Crataegus coccinea mollis
-n12628705	red haw, Crataegus pedicellata, Crataegus coccinea
-n12628986	quince, quince bush, Cydonia oblonga
-n12629305	mountain avens, Dryas octopetala
-n12629666	loquat, loquat tree, Japanese medlar, Japanese plum, Eriobotrya japonica
-n12630763	beach strawberry, Chilean strawberry, Fragaria chiloensis
-n12630999	Virginia strawberry, scarlet strawberry, Fragaria virginiana
-n12631331	avens
-n12631637	yellow avens, Geum alleppicum strictum, Geum strictum
-n12631932	yellow avens, Geum macrophyllum
-n12632335	prairie smoke, purple avens, Geum triflorum
-n12632733	bennet, white avens, Geum virginianum
-n12633061	toyon, tollon, Christmasberry, Christmas berry, Heteromeles arbutifolia, Photinia arbutifolia
-n12633638	apple tree
-n12633994	apple, orchard apple tree, Malus pumila
-n12634211	wild apple, crab apple, crabapple
-n12634429	crab apple, crabapple, cultivated crab apple
-n12634734	Siberian crab, Siberian crab apple, cherry apple, cherry crab, Malus baccata
-n12634986	wild crab, Malus sylvestris
-n12635151	American crab apple, garland crab, Malus coronaria
-n12635359	Oregon crab apple, Malus fusca
-n12635532	Southern crab apple, flowering crab, Malus angustifolia
-n12635744	Iowa crab, Iowa crab apple, prairie crab, western crab apple, Malus ioensis
-n12635955	Bechtel crab, flowering crab
-n12636224	medlar, medlar tree, Mespilus germanica
-n12636885	cinquefoil, five-finger
-n12637123	silverweed, goose-tansy, goose grass, Potentilla anserina
-n12637485	salad burnet, burnet bloodwort, pimpernel, Poterium sanguisorba
-n12638218	plum, plum tree
-n12638556	wild plum, wild plum tree
-n12638753	Allegheny plum, Alleghany plum, sloe, Prunus alleghaniensis
-n12638964	American red plum, August plum, goose plum, Prunus americana
-n12639168	chickasaw plum, hog plum, hog plum bush, Prunus angustifolia
-n12639376	beach plum, beach plum bush, Prunus maritima
-n12639584	common plum, Prunus domestica
-n12639736	bullace, Prunus insititia
-n12639910	damson plum, damson plum tree, Prunus domestica insititia
-n12640081	big-tree plum, Prunus mexicana
-n12640284	Canada plum, Prunus nigra
-n12640435	plumcot, plumcot tree
-n12640607	apricot, apricot tree
-n12640839	Japanese apricot, mei, Prunus mume
-n12641007	common apricot, Prunus armeniaca
-n12641180	purple apricot, black apricot, Prunus dasycarpa
-n12641413	cherry, cherry tree
-n12641931	wild cherry, wild cherry tree
-n12642090	wild cherry
-n12642200	sweet cherry, Prunus avium
-n12642435	heart cherry, oxheart, oxheart cherry
-n12642600	gean, mazzard, mazzard cherry
-n12642964	capulin, capulin tree, Prunus capuli
-n12643113	cherry laurel, laurel cherry, mock orange, wild orange, Prunus caroliniana
-n12643313	cherry plum, myrobalan, myrobalan plum, Prunus cerasifera
-n12643473	sour cherry, sour cherry tree, Prunus cerasus
-n12643688	amarelle, Prunus cerasus caproniana
-n12643877	morello, Prunus cerasus austera
-n12644283	marasca
-n12644902	almond tree
-n12645174	almond, sweet almond, Prunus dulcis, Prunus amygdalus, Amygdalus communis
-n12645530	bitter almond, Prunus dulcis amara, Amygdalus communis amara
-n12646072	jordan almond
-n12646197	dwarf flowering almond, Prunus glandulosa
-n12646397	holly-leaved cherry, holly-leaf cherry, evergreen cherry, islay, Prunus ilicifolia
-n12646605	fuji, fuji cherry, Prunus incisa
-n12646740	flowering almond, oriental bush cherry, Prunus japonica
-n12646950	cherry laurel, laurel cherry, Prunus laurocerasus
-n12647231	Catalina cherry, Prunus lyonii
-n12647376	bird cherry, bird cherry tree
-n12647560	hagberry tree, European bird cherry, common bird cherry, Prunus padus
-n12647787	hagberry
-n12647893	pin cherry, Prunus pensylvanica
-n12648045	peach, peach tree, Prunus persica
-n12648196	nectarine, nectarine tree, Prunus persica nectarina
-n12648424	sand cherry, Prunus pumila, Prunus pumilla susquehanae, Prunus susquehanae, Prunus cuneata
-n12648693	Japanese plum, Prunus salicina
-n12648888	black cherry, black cherry tree, rum cherry, Prunus serotina
-n12649065	flowering cherry
-n12649317	oriental cherry, Japanese cherry, Japanese flowering cherry, Prunus serrulata
-n12649539	Japanese flowering cherry, Prunus sieboldii
-n12649866	Sierra plum, Pacific plum, Prunus subcordata
-n12650038	rosebud cherry, winter flowering cherry, Prunus subhirtella
-n12650229	Russian almond, dwarf Russian almond, Prunus tenella
-n12650379	flowering almond, Prunus triloba
-n12650556	chokecherry, chokecherry tree, Prunus virginiana
-n12650805	chokecherry
-n12650915	western chokecherry, Prunus virginiana demissa, Prunus demissa
-n12651229	Pyracantha, pyracanth, fire thorn, firethorn
-n12651611	pear, pear tree, Pyrus communis
-n12651821	fruit tree
-n12653218	bramble bush
-n12653436	lawyerbush, lawyer bush, bush lawyer, Rubus cissoides, Rubus australis
-n12653633	stone bramble, Rubus saxatilis
-n12654227	sand blackberry, Rubus cuneifolius
-n12654857	boysenberry, boysenberry bush
-n12655062	loganberry, Rubus loganobaccus, Rubus ursinus loganobaccus
-n12655245	American dewberry, Rubus canadensis
-n12655351	Northern dewberry, American dewberry, Rubus flagellaris
-n12655498	Southern dewberry, Rubus trivialis
-n12655605	swamp dewberry, swamp blackberry, Rubus hispidus
-n12655726	European dewberry, Rubus caesius
-n12655869	raspberry, raspberry bush
-n12656369	wild raspberry, European raspberry, framboise, Rubus idaeus
-n12656528	American raspberry, Rubus strigosus, Rubus idaeus strigosus
-n12656685	black raspberry, blackcap, blackcap raspberry, thimbleberry, Rubus occidentalis
-n12656909	salmonberry, Rubus spectabilis
-n12657082	salmonberry, salmon berry, thimbleberry, Rubus parviflorus
-n12657755	wineberry, Rubus phoenicolasius
-n12658118	mountain ash
-n12658308	rowan, rowan tree, European mountain ash, Sorbus aucuparia
-n12658481	rowanberry
-n12658603	American mountain ash, Sorbus americana
-n12658715	Western mountain ash, Sorbus sitchensis
-n12658846	service tree, sorb apple, sorb apple tree, Sorbus domestica
-n12659064	wild service tree, Sorbus torminalis
-n12659356	spirea, spiraea
-n12659539	bridal wreath, bridal-wreath, Saint Peter's wreath, St. Peter's wreath, Spiraea prunifolia
-n12660601	madderwort, rubiaceous plant
-n12661045	Indian madder, munjeet, Rubia cordifolia
-n12661227	madder, Rubia tinctorum
-n12661538	woodruff
-n12662074	dagame, lemonwood tree, Calycophyllum candidissimum
-n12662379	blolly, West Indian snowberry, Chiococca alba
-n12662772	coffee, coffee tree
-n12663023	Arabian coffee, Coffea arabica
-n12663254	Liberian coffee, Coffea liberica
-n12663359	robusta coffee, Rio Nunez coffee, Coffea robusta, Coffea canephora
-n12663804	cinchona, chinchona
-n12664005	Cartagena bark, Cinchona cordifolia, Cinchona lancifolia
-n12664187	calisaya, Cinchona officinalis, Cinchona ledgeriana, Cinchona calisaya
-n12664469	cinchona tree, Cinchona pubescens
-n12664710	cinchona, cinchona bark, Peruvian bark, Jesuit's bark
-n12665048	bedstraw
-n12665271	sweet woodruff, waldmeister, woodruff, fragrant bedstraw, Galium odoratum, Asperula odorata
-n12665659	Northern bedstraw, Northern snow bedstraw, Galium boreale
-n12665857	yellow bedstraw, yellow cleavers, Our Lady's bedstraw, Galium verum
-n12666050	wild licorice, Galium lanceolatum
-n12666159	cleavers, clivers, goose grass, catchweed, spring cleavers, Galium aparine
-n12666369	wild madder, white madder, white bedstraw, infant's-breath, false baby's breath, Galium mollugo
-n12666965	cape jasmine, cape jessamine, Gardenia jasminoides, Gardenia augusta
-n12667406	genipa
-n12667582	genipap fruit, jagua, marmalade box, Genipa Americana
-n12667964	hamelia
-n12668131	scarlet bush, scarlet hamelia, coloradillo, Hamelia patens, Hamelia erecta
-n12669803	lemonwood, lemon-wood, lemonwood tree, lemon-wood tree, Psychotria capensis
-n12670334	negro peach, Sarcocephalus latifolius, Sarcocephalus esculentus
-n12670758	wild medlar, wild medlar tree, medlar, Vangueria infausta
-n12670962	Spanish tamarind, Vangueria madagascariensis
-n12671651	abelia
-n12672289	bush honeysuckle, Diervilla sessilifolia
-n12673588	American twinflower, Linnaea borealis americana
-n12674120	honeysuckle
-n12674685	American fly honeysuckle, fly honeysuckle, Lonicera canadensis
-n12674895	Italian honeysuckle, Italian woodbine, Lonicera caprifolium
-n12675299	yellow honeysuckle, Lonicera flava
-n12675515	hairy honeysuckle, Lonicera hirsuta
-n12675876	Japanese honeysuckle, Lonicera japonica
-n12676134	Hall's honeysuckle, Lonicera japonica halliana
-n12676370	Morrow's honeysuckle, Lonicera morrowii
-n12676534	woodbine, Lonicera periclymenum
-n12676703	trumpet honeysuckle, coral honeysuckle, trumpet flower, trumpet vine, Lonicera sempervirens
-n12677120	European fly honeysuckle, European honeysuckle, Lonicera xylosteum
-n12677331	swamp fly honeysuckle
-n12677612	snowberry, common snowberry, waxberry, Symphoricarpos alba
-n12677841	coralberry, Indian currant, Symphoricarpos orbiculatus
-n12678794	blue elder, blue elderberry, Sambucus caerulea
-n12679023	dwarf elder, danewort, Sambucus ebulus
-n12679432	American red elder, red-berried elder, stinking elder, Sambucus pubens
-n12679593	European red elder, red-berried elder, Sambucus racemosa
-n12679876	feverroot, horse gentian, tinker's root, wild coffee, Triostium perfoliatum
-n12680402	cranberry bush, cranberry tree, American cranberry bush, highbush cranberry, Viburnum trilobum
-n12680652	wayfaring tree, twist wood, twistwood, Viburnum lantana
-n12680864	guelder rose, European cranberrybush, European cranberry bush, crampbark, cranberry tree, Viburnum opulus
-n12681376	arrow wood, Viburnum recognitum
-n12681579	black haw, Viburnum prunifolium
-n12681893	weigela, Weigela florida
-n12682411	teasel, teazel, teasle
-n12682668	common teasel, Dipsacus fullonum
-n12682882	fuller's teasel, Dipsacus sativus
-n12683096	wild teasel, Dipsacus sylvestris
-n12683407	scabious, scabiosa
-n12683571	sweet scabious, pincushion flower, mournful widow, Scabiosa atropurpurea
-n12683791	field scabious, Scabiosa arvensis
-n12684379	jewelweed, lady's earrings, orange balsam, celandine, touch-me-not, Impatiens capensis
-n12685431	geranium
-n12685831	cranesbill, crane's bill
-n12686077	wild geranium, spotted cranesbill, Geranium maculatum
-n12686274	meadow cranesbill, Geranium pratense
-n12686496	Richardson's geranium, Geranium richardsonii
-n12686676	herb robert, herbs robert, herb roberts, Geranium robertianum
-n12686877	sticky geranium, Geranium viscosissimum
-n12687044	dove's foot geranium, Geranium molle
-n12687462	rose geranium, sweet-scented geranium, Pelargonium graveolens
-n12687698	fish geranium, bedding geranium, zonal pelargonium, Pelargonium hortorum
-n12687957	ivy geranium, ivy-leaved geranium, hanging geranium, Pelargonium peltatum
-n12688187	apple geranium, nutmeg geranium, Pelargonium odoratissimum
-n12688372	lemon geranium, Pelargonium limoneum
-n12688716	storksbill, heron's bill
-n12689305	musk clover, muskus grass, white-stemmed filaree, Erodium moschatum
-n12690653	incense tree
-n12691428	elephant tree, Bursera microphylla
-n12691661	gumbo-limbo, Bursera simaruba
-n12692024	Boswellia carteri
-n12692160	salai, Boswellia serrata
-n12692521	balm of gilead, Commiphora meccanensis
-n12692714	myrrh tree, Commiphora myrrha
-n12693244	Protium heptaphyllum
-n12693352	Protium guianense
-n12693865	water starwort
-n12694486	barbados cherry, acerola, Surinam cherry, West Indian cherry, Malpighia glabra
-n12695144	mahogany, mahogany tree
-n12695975	chinaberry, chinaberry tree, China tree, Persian lilac, pride-of-India, azederach, azedarach, Melia azederach, Melia azedarach
-n12696492	neem, neem tree, nim tree, margosa, arishth, Azadirachta indica, Melia Azadirachta
-n12696830	neem seed
-n12697152	Spanish cedar, Spanish cedar tree, Cedrela odorata
-n12697514	satinwood, satinwood tree, Chloroxylon swietenia
-n12698027	African scented mahogany, cedar mahogany, sapele mahogany, Entandrophragma cylindricum
-n12698435	silver ash
-n12698598	native beech, flindosa, flindosy, Flindersia australis
-n12698774	bunji-bunji, Flindersia schottiana
-n12699031	African mahogany
-n12699301	lanseh tree, langsat, langset, Lansium domesticum
-n12699922	true mahogany, Cuban mahogany, Dominican mahogany, Swietinia mahogani
-n12700088	Honduras mahogany, Swietinia macrophylla
-n12700357	Philippine mahogany, Philippine cedar, kalantas, Toona calantas, Cedrela calantas
-n12702124	caracolito, Ruptiliocarpon caracolito
-n12703190	common wood sorrel, cuckoo bread, shamrock, Oxalis acetosella
-n12703383	Bermuda buttercup, English-weed, Oxalis pes-caprae, Oxalis cernua
-n12703557	creeping oxalis, creeping wood sorrel, Oxalis corniculata
-n12703716	goatsfoot, goat's foot, Oxalis caprina
-n12703856	violet wood sorrel, Oxalis violacea
-n12704041	oca, oka, Oxalis tuberosa, Oxalis crenata
-n12704343	carambola, carambola tree, Averrhoa carambola
-n12704513	bilimbi, Averrhoa bilimbi
-n12705013	milkwort
-n12705220	senega, Polygala alba
-n12705458	orange milkwort, yellow milkwort, candyweed, yellow bachelor's button, Polygala lutea
-n12705698	flowering wintergreen, gaywings, bird-on-the-wing, fringed polygala, Polygala paucifolia
-n12705978	Seneca snakeroot, Seneka snakeroot, senga root, senega root, senega snakeroot, Polygala senega
-n12706410	common milkwort, gand flower, Polygala vulgaris
-n12707199	rue, herb of grace, Ruta graveolens
-n12707781	citrus, citrus tree
-n12708293	orange, orange tree
-n12708654	sour orange, Seville orange, bitter orange, bitter orange tree, bigarade, marmalade orange, Citrus aurantium
-n12708941	bergamot, bergamot orange, Citrus bergamia
-n12709103	pomelo, pomelo tree, pummelo, shaddock, Citrus maxima, Citrus grandis, Citrus decumana
-n12709349	citron, citron tree, Citrus medica
-n12709688	grapefruit, Citrus paradisi
-n12709901	mandarin, mandarin orange, mandarin orange tree, Citrus reticulata
-n12710295	tangerine, tangerine tree
-n12710415	clementine, clementine tree
-n12710577	satsuma, satsuma tree
-n12710693	sweet orange, sweet orange tree, Citrus sinensis
-n12710917	temple orange, temple orange tree, tangor, king orange, Citrus nobilis
-n12711182	tangelo, tangelo tree, ugli fruit, Citrus tangelo
-n12711398	rangpur, rangpur lime, lemanderin, Citrus limonia
-n12711596	lemon, lemon tree, Citrus limon
-n12711817	sweet lemon, sweet lime, Citrus limetta
-n12711984	lime, lime tree, Citrus aurantifolia
-n12712320	citrange, citrange tree, Citroncirus webberi
-n12712626	fraxinella, dittany, burning bush, gas plant, Dictamnus alba
-n12713063	kumquat, cumquat, kumquat tree
-n12713358	marumi, marumi kumquat, round kumquat, Fortunella japonica
-n12713521	nagami, nagami kumquat, oval kumquat, Fortunella margarita
-n12713866	cork tree, Phellodendron amurense
-n12714254	trifoliate orange, trifoliata, wild orange, Poncirus trifoliata
-n12714755	prickly ash
-n12714949	toothache tree, sea ash, Zanthoxylum americanum, Zanthoxylum fraxineum
-n12715195	Hercules'-club, Hercules'-clubs, Hercules-club, Zanthoxylum clava-herculis
-n12715914	bitterwood tree
-n12716400	marupa, Simarouba amara
-n12716594	paradise tree, bitterwood, Simarouba glauca
-n12717072	ailanthus
-n12717224	tree of heaven, tree of the gods, Ailanthus altissima
-n12717644	wild mango, dika, wild mango tree, Irvingia gabonensis
-n12718074	pepper tree, Kirkia wilmsii
-n12718483	Jamaica quassia, bitterwood, Picrasma excelsa, Picrasma excelsum
-n12718995	quassia, bitterwood, Quassia amara
-n12719684	nasturtium
-n12719944	garden nasturtium, Indian cress, Tropaeolum majus
-n12720200	bush nasturtium, Tropaeolum minus
-n12720354	canarybird flower, canarybird vine, canary creeper, Tropaeolum peregrinum
-n12721122	bean caper, Syrian bean caper, Zygophyllum fabago
-n12721477	palo santo, Bulnesia sarmienti
-n12722071	lignum vitae, Guaiacum officinale
-n12723062	creosote bush, coville, hediondilla, Larrea tridentata
-n12723610	caltrop, devil's weed, Tribulus terestris
-n12724942	willow, willow tree
-n12725521	osier
-n12725738	white willow, Huntingdon willow, Salix alba
-n12725940	silver willow, silky willow, Salix alba sericea, Salix sericea
-n12726159	golden willow, Salix alba vitellina, Salix vitellina
-n12726357	cricket-bat willow, Salix alba caerulea
-n12726528	arctic willow, Salix arctica
-n12726670	weeping willow, Babylonian weeping willow, Salix babylonica
-n12726902	Wisconsin weeping willow, Salix pendulina, Salix blanda, Salix pendulina blanda
-n12727101	pussy willow, Salix discolor
-n12727301	sallow
-n12727518	goat willow, florist's willow, pussy willow, Salix caprea
-n12727729	peachleaf willow, peach-leaved willow, almond-leaves willow, Salix amygdaloides
-n12727960	almond willow, black Hollander, Salix triandra, Salix amygdalina
-n12728164	hoary willow, sage willow, Salix candida
-n12728322	crack willow, brittle willow, snap willow, Salix fragilis
-n12728508	prairie willow, Salix humilis
-n12728656	dwarf willow, Salix herbacea
-n12728864	grey willow, gray willow, Salix cinerea
-n12729023	arroyo willow, Salix lasiolepis
-n12729164	shining willow, Salix lucida
-n12729315	swamp willow, black willow, Salix nigra
-n12729521	bay willow, laurel willow, Salix pentandra
-n12729729	purple willow, red willow, red osier, basket willow, purple osier, Salix purpurea
-n12729950	balsam willow, Salix pyrifolia
-n12730143	creeping willow, Salix repens
-n12730370	Sitka willow, silky willow, Salix sitchensis
-n12730544	dwarf grey willow, dwarf gray willow, sage willow, Salix tristis
-n12730776	bearberry willow, Salix uva-ursi
-n12731029	common osier, hemp willow, velvet osier, Salix viminalis
-n12731401	poplar, poplar tree
-n12731835	balsam poplar, hackmatack, tacamahac, Populus balsamifera
-n12732009	white poplar, white aspen, abele, aspen poplar, silver-leaved poplar, Populus alba
-n12732252	grey poplar, gray poplar, Populus canescens
-n12732491	black poplar, Populus nigra
-n12732605	Lombardy poplar, Populus nigra italica
-n12732756	cottonwood
-n12732966	Eastern cottonwood, necklace poplar, Populus deltoides
-n12733218	black cottonwood, Western balsam poplar, Populus trichocarpa
-n12733428	swamp cottonwood, black cottonwood, downy poplar, swamp poplar, Populus heterophylla
-n12733647	aspen
-n12733870	quaking aspen, European quaking aspen, Populus tremula
-n12734070	American quaking aspen, American aspen, Populus tremuloides
-n12734215	Canadian aspen, bigtooth aspen, bigtoothed aspen, big-toothed aspen, large-toothed aspen, large tooth aspen, Populus grandidentata
-n12735160	sandalwood tree, true sandalwood, Santalum album
-n12736603	quandong, quandang, quandong tree, Eucarya acuminata, Fusanus acuminatus
-n12736999	rabbitwood, buffalo nut, Pyrularia pubera
-n12737383	Loranthaceae, family Loranthaceae, mistletoe family
-n12737898	mistletoe, Loranthus europaeus
-n12738259	American mistletoe, Arceuthobium pusillum
-n12739332	mistletoe, Viscum album, Old World mistletoe
-n12739966	American mistletoe, Phoradendron serotinum, Phoradendron flavescens
-n12740967	aalii
-n12741222	soapberry, soapberry tree
-n12741586	wild China tree, Sapindus drumondii, Sapindus marginatus
-n12741792	China tree, false dogwood, jaboncillo, chinaberry, Sapindus saponaria
-n12742290	akee, akee tree, Blighia sapida
-n12742741	soapberry vine
-n12742878	heartseed, Cardiospermum grandiflorum
-n12743009	balloon vine, heart pea, Cardiospermum halicacabum
-n12743352	longan, lungen, longanberry, Dimocarpus longan, Euphorbia litchi, Nephelium longana
-n12743823	harpullia
-n12743976	harpulla, Harpullia cupanioides
-n12744142	Moreton Bay tulipwood, Harpullia pendula
-n12744387	litchi, lichee, litchi tree, Litchi chinensis, Nephelium litchi
-n12744850	Spanish lime, Spanish lime tree, honey berry, mamoncillo, genip, ginep, Melicocca bijuga, Melicocca bijugatus
-n12745386	rambutan, rambotan, rambutan tree, Nephelium lappaceum
-n12745564	pulasan, pulassan, pulasan tree, Nephelium mutabile
-n12746884	pachysandra
-n12747120	Allegheny spurge, Allegheny mountain spurge, Pachysandra procumbens
-n12748248	bittersweet, American bittersweet, climbing bittersweet, false bittersweet, staff vine, waxwork, shrubby bittersweet, Celastrus scandens
-n12749049	spindle tree, spindleberry, spindleberry tree
-n12749456	winged spindle tree, Euonymous alatus
-n12749679	wahoo, burning bush, Euonymus atropurpureus
-n12749852	strawberry bush, wahoo, Euonymus americanus
-n12750076	evergreen bittersweet, Euonymus fortunei radicans, Euonymus radicans vegetus
-n12750767	cyrilla, leatherwood, white titi, Cyrilla racemiflora
-n12751172	titi, buckwheat tree, Cliftonia monophylla
-n12751675	crowberry
-n12752205	maple
-n12753007	silver maple, Acer saccharinum
-n12753245	sugar maple, rock maple, Acer saccharum
-n12753573	red maple, scarlet maple, swamp maple, Acer rubrum
-n12753762	moosewood, moose-wood, striped maple, striped dogwood, goosefoot maple, Acer pennsylvanicum
-n12754003	Oregon maple, big-leaf maple, Acer macrophyllum
-n12754174	dwarf maple, Rocky-mountain maple, Acer glabrum
-n12754311	mountain maple, mountain alder, Acer spicatum
-n12754468	vine maple, Acer circinatum
-n12754648	hedge maple, field maple, Acer campestre
-n12754781	Norway maple, Acer platanoides
-n12754981	sycamore, great maple, scottish maple, Acer pseudoplatanus
-n12755225	box elder, ash-leaved maple, Acer negundo
-n12755387	California box elder, Acer negundo Californicum
-n12755559	pointed-leaf maple, Acer argutum
-n12755727	Japanese maple, full moon maple, Acer japonicum
-n12755876	Japanese maple, Acer palmatum
-n12756457	holly
-n12757115	Chinese holly, Ilex cornuta
-n12757303	bearberry, possum haw, winterberry, Ilex decidua
-n12757458	inkberry, gallberry, gall-berry, evergreen winterberry, Ilex glabra
-n12757668	mate, Paraguay tea, Ilex paraguariensis
-n12757816	American holly, Christmas holly
-n12757930	low gallberry holly
-n12758014	tall gallberry holly
-n12758099	yaupon holly
-n12758176	deciduous holly
-n12758250	juneberry holly
-n12758325	largeleaf holly
-n12758399	Geogia holly
-n12758471	common winterberry holly
-n12758555	smooth winterberry holly
-n12759273	cashew, cashew tree, Anacardium occidentale
-n12759668	goncalo alves, Astronium fraxinifolium
-n12760539	Venetian sumac, wig tree, Cotinus coggygria
-n12760875	laurel sumac, Malosma laurina, Rhus laurina
-n12761284	mango, mango tree, Mangifera indica
-n12761702	pistachio, Pistacia vera, pistachio tree
-n12761905	terebinth, Pistacia terebinthus
-n12762049	mastic, mastic tree, lentisk, Pistacia lentiscus
-n12762405	Australian sumac, Rhodosphaera rhodanthema, Rhus rhodanthema
-n12762896	sumac, sumach, shumac
-n12763529	smooth sumac, scarlet sumac, vinegar tree, Rhus glabra
-n12764008	sugar-bush, sugar sumac, Rhus ovata
-n12764202	staghorn sumac, velvet sumac, Virginian sumac, vinegar tree, Rhus typhina
-n12764507	squawbush, squaw-bush, skunkbush, Rhus trilobata
-n12764978	aroeira blanca, Schinus chichita
-n12765115	pepper tree, molle, Peruvian mastic tree, Schinus molle
-n12765402	Brazilian pepper tree, Schinus terebinthifolius
-n12765846	hog plum, yellow mombin, yellow mombin tree, Spondias mombin
-n12766043	mombin, mombin tree, jocote, Spondias purpurea
-n12766595	poison ash, poison dogwood, poison sumac, Toxicodendron vernix, Rhus vernix
-n12766869	poison ivy, markweed, poison mercury, poison oak, Toxicodendron radicans, Rhus radicans
-n12767208	western poison oak, Toxicodendron diversilobum, Rhus diversiloba
-n12767423	eastern poison oak, Toxicodendron quercifolium, Rhus quercifolia, Rhus toxicodenedron
-n12767648	varnish tree, lacquer tree, Chinese lacquer tree, Japanese lacquer tree, Japanese varnish tree, Japanese sumac, Toxicodendron vernicifluum, Rhus verniciflua
-n12768369	horse chestnut, buckeye, Aesculus hippocastanum
-n12768682	buckeye, horse chestnut, conker
-n12768809	sweet buckeye
-n12768933	Ohio buckeye
-n12769065	dwarf buckeye, bottlebrush buckeye
-n12769219	red buckeye
-n12769318	particolored buckeye
-n12770529	ebony, ebony tree, Diospyros ebenum
-n12770892	marblewood, marble-wood, Andaman marble, Diospyros kurzii
-n12771085	marblewood, marble-wood
-n12771192	persimmon, persimmon tree
-n12771390	Japanese persimmon, kaki, Diospyros kaki
-n12771597	American persimmon, possumwood, Diospyros virginiana
-n12771890	date plum, Diospyros lotus
-n12772753	buckthorn
-n12772908	southern buckthorn, shittimwood, shittim, mock orange, Bumelia lycioides
-n12773142	false buckthorn, chittamwood, chittimwood, shittimwood, black haw, Bumelia lanuginosa
-n12773651	star apple, caimito, Chrysophyllum cainito
-n12773917	satinleaf, satin leaf, caimitillo, damson plum, Chrysophyllum oliviforme
-n12774299	balata, balata tree, beefwood, bully tree, Manilkara bidentata
-n12774641	sapodilla, sapodilla tree, Manilkara zapota, Achras zapota
-n12775070	gutta-percha tree, Palaquium gutta
-n12775393	gutta-percha tree
-n12775717	canistel, canistel tree, Pouteria campechiana nervosa
-n12775919	marmalade tree, mammee, sapote, Pouteria zapota, Calocarpum zapota
-n12776558	sweetleaf, Symplocus tinctoria
-n12776774	Asiatic sweetleaf, sapphire berry, Symplocus paniculata
-n12777436	styrax
-n12777680	snowbell, Styrax obassia
-n12777778	Japanese snowbell, Styrax japonicum
-n12777892	Texas snowbell, Texas snowbells, Styrax texana
-n12778398	silver-bell tree, silverbell tree, snowdrop tree, opossum wood, Halesia carolina, Halesia tetraptera
-n12778605	carnivorous plant
-n12779603	pitcher plant
-n12779851	common pitcher plant, huntsman's cup, huntsman's cups, Sarracenia purpurea
-n12780325	hooded pitcher plant, Sarracenia minor
-n12780563	huntsman's horn, huntsman's horns, yellow trumpet, yellow pitcher plant, trumpets, Sarracenia flava
-n12781940	tropical pitcher plant
-n12782530	sundew, sundew plant, daily dew
-n12782915	Venus's flytrap, Venus's flytraps, Dionaea muscipula
-n12783316	waterwheel plant, Aldrovanda vesiculosa
-n12783730	Drosophyllum lusitanicum
-n12784371	roridula
-n12784889	Australian pitcher plant, Cephalotus follicularis
-n12785724	sedum
-n12785889	stonecrop
-n12786273	rose-root, midsummer-men, Sedum rosea
-n12786464	orpine, orpin, livelong, live-forever, Sedum telephium
-n12786836	pinwheel, Aeonium haworthii
-n12787364	Christmas bush, Christmas tree, Ceratopetalum gummiferum
-n12788854	hortensia, Hydrangea macrophylla hortensis
-n12789054	fall-blooming hydrangea, Hydrangea paniculata
-n12789554	carpenteria, Carpenteria californica
-n12789977	decumary, Decumaria barbata, Decumaria barbara
-n12790430	deutzia
-n12791064	philadelphus
-n12791329	mock orange, syringa, Philadelphus coronarius
-n12793015	saxifrage, breakstone, rockfoil
-n12793284	yellow mountain saxifrage, Saxifraga aizoides
-n12793494	meadow saxifrage, fair-maids-of-France, Saxifraga granulata
-n12793695	mossy saxifrage, Saxifraga hypnoides
-n12793886	western saxifrage, Saxifraga occidentalis
-n12794135	purple saxifrage, Saxifraga oppositifolia
-n12794367	star saxifrage, starry saxifrage, Saxifraga stellaris
-n12794568	strawberry geranium, strawberry saxifrage, mother-of-thousands, Saxifraga stolonifera, Saxifraga sarmentosam
-n12794985	astilbe
-n12795209	false goatsbeard, Astilbe biternata
-n12795352	dwarf astilbe, Astilbe chinensis pumila
-n12795555	spirea, spiraea, Astilbe japonica
-n12796022	bergenia
-n12796385	coast boykinia, Boykinia elata, Boykinia occidentalis
-n12796849	golden saxifrage, golden spleen
-n12797368	umbrella plant, Indian rhubarb, Darmera peltata, Peltiphyllum peltatum
-n12797860	bridal wreath, bridal-wreath, Francoa ramosa
-n12798284	alumroot, alumbloom
-n12798910	coralbells, Heuchera sanguinea
-n12799269	leatherleaf saxifrage, Leptarrhena pyrolifolia
-n12799776	woodland star, Lithophragma affine, Lithophragma affinis, Tellima affinis
-n12800049	prairie star, Lithophragma parviflorum
-n12800586	miterwort, mitrewort, bishop's cap
-n12801072	five-point bishop's cap, Mitella pentandra
-n12801520	parnassia, grass-of-Parnassus
-n12801781	bog star, Parnassia palustris
-n12801966	fringed grass of Parnassus, Parnassia fimbriata
-n12803226	false alumroot, fringe cups, Tellima grandiflora
-n12803754	foamflower, coolwart, false miterwort, false mitrewort, Tiarella cordifolia
-n12803958	false miterwort, false mitrewort, Tiarella unifoliata
-n12804352	pickaback plant, piggyback plant, youth-on-age, Tolmiea menziesii
-n12805146	currant, currant bush
-n12805561	black currant, European black currant, Ribes nigrum
-n12805762	white currant, Ribes sativum
-n12806015	gooseberry, gooseberry bush, Ribes uva-crispa, Ribes grossularia
-n12806732	plane tree, sycamore, platan
-n12807251	London plane, Platanus acerifolia
-n12807409	American sycamore, American plane, buttonwood, Platanus occidentalis
-n12807624	oriental plane, Platanus orientalis
-n12807773	California sycamore, Platanus racemosa
-n12808007	Arizona sycamore, Platanus wrightii
-n12809868	Greek valerian, Polemonium reptans
-n12810007	northern Jacob's ladder, Polemonium boreale
-n12810151	skunkweed, skunk-weed, Polemonium viscosum
-n12810595	phlox
-n12811027	moss pink, mountain phlox, moss phlox, dwarf phlox, Phlox subulata
-n12811713	evening-snow, Linanthus dichotomus
-n12812235	acanthus
-n12812478	bear's breech, bear's breeches, sea holly, Acanthus mollis
-n12812801	caricature plant, Graptophyllum pictum
-n12813189	black-eyed Susan, black-eyed Susan vine, Thunbergia alata
-n12814643	catalpa, Indian bean
-n12814857	Catalpa bignioides
-n12814960	Catalpa speciosa
-n12815198	desert willow, Chilopsis linearis
-n12815668	calabash, calabash tree, Crescentia cujete
-n12815838	calabash
-n12816508	borage, tailwort, Borago officinalis
-n12816942	common amsinckia, Amsinckia intermedia
-n12817464	anchusa
-n12817694	bugloss, alkanet, Anchusa officinalis
-n12817855	cape forget-me-not, Anchusa capensis
-n12818004	cape forget-me-not, Anchusa riparia
-n12818346	Spanish elm, Equador laurel, salmwood, cypre, princewood, Cordia alliodora
-n12818601	princewood, Spanish elm, Cordia gerascanthus
-n12818966	Chinese forget-me-not, Cynoglossum amabile
-n12819141	hound's-tongue, Cynoglossum officinale
-n12819354	hound's-tongue, Cynoglossum virginaticum
-n12819728	blueweed, blue devil, blue thistle, viper's bugloss, Echium vulgare
-n12820113	beggar's lice, beggar lice
-n12820669	gromwell, Lithospermum officinale
-n12820853	puccoon, Lithospermum caroliniense
-n12821505	Virginia bluebell, Virginia cowslip, Mertensia virginica
-n12821895	garden forget-me-not, Myosotis sylvatica
-n12822115	forget-me-not, mouse ear, Myosotis scorpiodes
-n12822466	false gromwell
-n12822769	comfrey, cumfrey
-n12822955	common comfrey, boneset, Symphytum officinale
-n12823717	convolvulus
-n12823859	bindweed
-n12824053	field bindweed, wild morning-glory, Convolvulus arvensis
-n12824289	scammony, Convolvulus scammonia
-n12824735	silverweed
-n12825497	dodder
-n12826143	dichondra, Dichondra micrantha
-n12827270	cypress vine, star-glory, Indian pink, Ipomoea quamoclit, Quamoclit pennata
-n12827537	moonflower, belle de nuit, Ipomoea alba
-n12827907	wild potato vine, wild sweet potato vine, man-of-the-earth, manroot, scammonyroot, Ipomoea panurata, Ipomoea fastigiata
-n12828220	red morning-glory, star ipomoea, Ipomoea coccinea
-n12828379	man-of-the-earth, Ipomoea leptophylla
-n12828520	scammony, Ipomoea orizabensis
-n12828791	Japanese morning glory, Ipomoea nil
-n12828977	imperial Japanese morning glory, Ipomoea imperialis
-n12829582	gesneriad
-n12829975	gesneria
-n12830222	achimenes, hot water plant
-n12830568	aeschynanthus
-n12831141	lace-flower vine, Alsobia dianthiflora, Episcia dianthiflora
-n12831535	columnea
-n12831932	episcia
-n12832315	gloxinia
-n12832538	Canterbury bell, Gloxinia perennis
-n12832822	kohleria
-n12833149	African violet, Saintpaulia ionantha
-n12833985	streptocarpus
-n12834190	Cape primrose
-n12834798	waterleaf
-n12834938	Virginia waterleaf, Shawnee salad, shawny, Indian salad, John's cabbage, Hydrophyllum virginianum
-n12835331	yellow bells, California yellow bells, whispering bells, Emmanthe penduliflora
-n12835766	yerba santa, Eriodictyon californicum
-n12836212	nemophila
-n12836337	baby blue-eyes, Nemophila menziesii
-n12836508	five-spot, Nemophila maculata
-n12836862	scorpionweed, scorpion weed, phacelia
-n12837052	California bluebell, Phacelia campanularia
-n12837259	California bluebell, whitlavia, Phacelia minor, Phacelia whitlavia
-n12837466	fiddleneck, Phacelia tanacetifolia
-n12837803	fiesta flower, Pholistoma auritum, Nemophila aurita
-n12839574	basil thyme, basil balm, mother of thyme, Acinos arvensis, Satureja acinos
-n12839979	giant hyssop
-n12840168	yellow giant hyssop, Agastache nepetoides
-n12840362	anise hyssop, Agastache foeniculum
-n12840502	Mexican hyssop, Agastache mexicana
-n12840749	bugle, bugleweed
-n12841007	creeping bugle, Ajuga reptans
-n12841193	erect bugle, blue bugle, Ajuga genevensis
-n12841354	pyramid bugle, Ajuga pyramidalis
-n12842302	wood mint
-n12842519	hairy wood mint, Blephilia hirsuta
-n12842642	downy wood mint, Blephilia celiata
-n12842887	calamint
-n12843144	common calamint, Calamintha sylvatica, Satureja calamintha officinalis
-n12843316	large-flowered calamint, Calamintha grandiflora, Clinopodium grandiflorum, Satureja grandiflora
-n12843557	lesser calamint, field balm, Calamintha nepeta, Calamintha nepeta glantulosa, Satureja nepeta, Satureja calamintha glandulosa
-n12843970	wild basil, cushion calamint, Clinopodium vulgare, Satureja vulgaris
-n12844409	horse balm, horseweed, stoneroot, stone-root, richweed, stone root, Collinsonia canadensis
-n12844939	coleus, flame nettle
-n12845187	country borage, Coleus aromaticus, Coleus amboinicus, Plectranthus amboinicus
-n12845413	painted nettle, Joseph's coat, Coleus blumei, Solenostemon blumei, Solenostemon scutellarioides
-n12845908	Apalachicola rosemary, Conradina glabra
-n12846335	dragonhead, dragon's head, Dracocephalum parviflorum
-n12846690	elsholtzia
-n12847008	hemp nettle, dead nettle, Galeopsis tetrahit
-n12847374	ground ivy, alehoof, field balm, gill-over-the-ground, runaway robin, Glechoma hederaceae, Nepeta hederaceae
-n12847927	pennyroyal, American pennyroyal, Hedeoma pulegioides
-n12848499	hyssop, Hyssopus officinalis
-n12849061	dead nettle
-n12849279	white dead nettle, Lamium album
-n12849416	henbit, Lamium amplexicaule
-n12849952	English lavender, Lavandula angustifolia, Lavandula officinalis
-n12850168	French lavender, Lavandula stoechas
-n12850336	spike lavender, French lavender, Lavandula latifolia
-n12850906	dagga, Cape dagga, red dagga, wilde dagga, Leonotis leonurus
-n12851094	lion's-ear, Leonotis nepetaefolia, Leonotis nepetifolia
-n12851469	motherwort, Leonurus cardiaca
-n12851860	pitcher sage, Lepechinia calycina, Sphacele calycina
-n12852234	bugleweed, Lycopus virginicus
-n12852428	water horehound, Lycopus americanus
-n12852570	gipsywort, gypsywort, Lycopus europaeus
-n12853080	origanum
-n12853287	oregano, marjoram, pot marjoram, wild marjoram, winter sweet, Origanum vulgare
-n12853482	sweet marjoram, knotted marjoram, Origanum majorana, Majorana hortensis
-n12854048	horehound
-n12854193	common horehound, white horehound, Marrubium vulgare
-n12854600	lemon balm, garden balm, sweet balm, bee balm, beebalm, Melissa officinalis
-n12855365	corn mint, field mint, Mentha arvensis
-n12855494	water-mint, water mint, Mentha aquatica
-n12855710	bergamot mint, lemon mint, eau de cologne mint, Mentha citrata
-n12855886	horsemint, Mentha longifolia
-n12856091	peppermint, Mentha piperita
-n12856287	spearmint, Mentha spicata
-n12856479	apple mint, applemint, Mentha rotundifolia, Mentha suaveolens
-n12856680	pennyroyal, Mentha pulegium
-n12857204	yerba buena, Micromeria chamissonis, Micromeria douglasii, Satureja douglasii
-n12857779	molucca balm, bells of Ireland, Molucella laevis
-n12858150	monarda, wild bergamot
-n12858397	bee balm, beebalm, bergamot mint, oswego tea, Monarda didyma
-n12858618	horsemint, Monarda punctata
-n12858871	bee balm, beebalm, Monarda fistulosa
-n12858987	lemon mint, horsemint, Monarda citriodora
-n12859153	plains lemon monarda, Monarda pectinata
-n12859272	basil balm, Monarda clinopodia
-n12859679	mustang mint, Monardella lanceolata
-n12859986	catmint, catnip, Nepeta cataria
-n12860365	basil
-n12860978	beefsteak plant, Perilla frutescens crispa
-n12861345	phlomis
-n12861541	Jerusalem sage, Phlomis fruticosa
-n12861892	physostegia
-n12862512	plectranthus
-n12862828	patchouli, patchouly, pachouli, Pogostemon cablin
-n12863234	self-heal, heal all, Prunella vulgaris
-n12863624	mountain mint
-n12864160	rosemary, Rosmarinus officinalis
-n12865037	clary sage, Salvia clarea
-n12865562	purple sage, chaparral sage, Salvia leucophylla
-n12865708	cancerweed, cancer weed, Salvia lyrata
-n12865824	common sage, ramona, Salvia officinalis
-n12866002	meadow clary, Salvia pratensis
-n12866162	clary, Salvia sclarea
-n12866333	pitcher sage, Salvia spathacea
-n12866459	Mexican mint, Salvia divinorum
-n12866635	wild sage, wild clary, vervain sage, Salvia verbenaca
-n12866968	savory
-n12867184	summer savory, Satureja hortensis, Satureia hortensis
-n12867449	winter savory, Satureja montana, Satureia montana
-n12867826	skullcap, helmetflower
-n12868019	blue pimpernel, blue skullcap, mad-dog skullcap, mad-dog weed, Scutellaria lateriflora
-n12868880	hedge nettle, dead nettle, Stachys sylvatica
-n12869061	hedge nettle, Stachys palustris
-n12869478	germander
-n12869668	American germander, wood sage, Teucrium canadense
-n12870048	cat thyme, marum, Teucrium marum
-n12870225	wood sage, Teucrium scorodonia
-n12870535	thyme
-n12870682	common thyme, Thymus vulgaris
-n12870891	wild thyme, creeping thyme, Thymus serpyllum
-n12871272	blue curls
-n12871696	turpentine camphor weed, camphorweed, vinegarweed, Trichostema lanceolatum
-n12871859	bastard pennyroyal, Trichostema dichotomum
-n12872458	bladderwort
-n12872914	butterwort
-n12873341	genlisea
-n12873984	martynia, Martynia annua
-n12875269	common unicorn plant, devil's claw, common devil's claw, elephant-tusk, proboscis flower, ram's horn, Proboscidea louisianica
-n12875697	sand devil's claw, Proboscidea arenaria, Martynia arenaria
-n12875861	sweet unicorn plant, Proboscidea fragrans, Martynia fragrans
-n12876899	figwort
-n12877244	snapdragon
-n12877493	white snapdragon, Antirrhinum coulterianum
-n12877637	yellow twining snapdragon, Antirrhinum filipes
-n12877838	Mediterranean snapdragon, Antirrhinum majus
-n12878169	kitten-tails
-n12878325	Alpine besseya, Besseya alpina
-n12878784	false foxglove, Aureolaria pedicularia, Gerardia pedicularia
-n12879068	false foxglove, Aureolaria virginica, Gerardia virginica
-n12879527	calceolaria, slipperwort
-n12879963	Indian paintbrush, painted cup
-n12880244	desert paintbrush, Castilleja chromosa
-n12880462	giant red paintbrush, Castilleja miniata
-n12880638	great plains paintbrush, Castilleja sessiliflora
-n12880799	sulfur paintbrush, Castilleja sulphurea
-n12881105	shellflower, shell-flower, turtlehead, snakehead, snake-head, Chelone glabra
-n12881913	maiden blue-eyed Mary, Collinsia parviflora
-n12882158	blue-eyed Mary, Collinsia verna
-n12882779	foxglove, digitalis
-n12882945	common foxglove, fairy bell, fingerflower, finger-flower, fingerroot, finger-root, Digitalis purpurea
-n12883265	yellow foxglove, straw foxglove, Digitalis lutea
-n12883628	gerardia
-n12884100	blue toadflax, old-field toadflax, Linaria canadensis
-n12884260	toadflax, butter-and-eggs, wild snapdragon, devil's flax, Linaria vulgaris
-n12885045	golden-beard penstemon, Penstemon barbatus
-n12885265	scarlet bugler, Penstemon centranthifolius
-n12885510	red shrubby penstemon, redwood penstemon
-n12885754	Platte River penstemon, Penstemon cyananthus
-n12886185	hot-rock penstemon, Penstemon deustus
-n12886402	Jones' penstemon, Penstemon dolius
-n12886600	shrubby penstemon, lowbush penstemon, Penstemon fruticosus
-n12886831	narrow-leaf penstemon, Penstemon linarioides
-n12887293	balloon flower, scented penstemon, Penstemon palmeri
-n12887532	Parry's penstemon, Penstemon parryi
-n12887713	rock penstemon, cliff penstemon, Penstemon rupicola
-n12888016	Rydberg's penstemon, Penstemon rydbergii
-n12888234	cascade penstemon, Penstemon serrulatus
-n12888457	Whipple's penstemon, Penstemon whippleanus
-n12889219	moth mullein, Verbascum blattaria
-n12889412	white mullein, Verbascum lychnitis
-n12889579	purple mullein, Verbascum phoeniceum
-n12889713	common mullein, great mullein, Aaron's rod, flannel mullein, woolly mullein, torch, Verbascum thapsus
-n12890265	veronica, speedwell
-n12890490	field speedwell, Veronica agrestis
-n12890685	brooklime, American brooklime, Veronica americana
-n12890928	corn speedwell, Veronica arvensis
-n12891093	brooklime, European brooklime, Veronica beccabunga
-n12891305	germander speedwell, bird's eye, Veronica chamaedrys
-n12891469	water speedwell, Veronica michauxii, Veronica anagallis-aquatica
-n12891643	common speedwell, gypsyweed, Veronica officinalis
-n12891824	purslane speedwell, Veronica peregrina
-n12892013	thyme-leaved speedwell, Veronica serpyllifolia
-n12893463	nightshade
-n12893993	horse nettle, ball nettle, bull nettle, ball nightshade, Solanum carolinense
-n12895298	African holly, Solanum giganteum
-n12895811	potato vine, Solanum jasmoides
-n12896615	garden huckleberry, wonderberry, sunberry, Solanum nigrum guineese, Solanum melanocerasum, Solanum burbankii
-n12897118	naranjilla, Solanum quitoense
-n12897788	potato vine, giant potato creeper, Solanum wendlandii
-n12897999	potato tree, Brazilian potato tree, Solanum wrightii, Solanum macranthum
-n12898342	belladonna, belladonna plant, deadly nightshade, Atropa belladonna
-n12898774	bush violet, browallia
-n12899166	lady-of-the-night, Brunfelsia americana
-n12899537	angel's trumpet, maikoa, Brugmansia arborea, Datura arborea
-n12899752	angel's trumpet, Brugmansia suaveolens, Datura suaveolens
-n12899971	red angel's trumpet, Brugmansia sanguinea, Datura sanguinea
-n12900783	cone pepper, Capsicum annuum conoides
-n12901724	bird pepper, Capsicum frutescens baccatum, Capsicum baccatum
-n12902466	day jessamine, Cestrum diurnum
-n12902662	night jasmine, night jessamine, Cestrum nocturnum
-n12903014	tree tomato, tamarillo
-n12903367	thorn apple
-n12903503	jimsonweed, jimson weed, Jamestown weed, common thorn apple, apple of Peru, Datura stramonium
-n12903964	pichi, Fabiana imbricata
-n12904314	henbane, black henbane, stinking nightshade, Hyoscyamus niger
-n12904562	Egyptian henbane, Hyoscyamus muticus
-n12904938	matrimony vine, boxthorn
-n12905135	common matrimony vine, Duke of Argyll's tea tree, Lycium barbarum, Lycium halimifolium
-n12905412	Christmasberry, Christmas berry, Lycium carolinianum
-n12906214	plum tomato
-n12906498	mandrake, devil's apples, Mandragora officinarum
-n12906771	mandrake root, mandrake
-n12907057	apple of Peru, shoo fly, Nicandra physaloides
-n12907671	flowering tobacco, Jasmine tobacco, Nicotiana alata
-n12907857	common tobacco, Nicotiana tabacum
-n12908093	wild tobacco, Indian tobacco, Nicotiana rustica
-n12908645	cupflower, nierembergia
-n12908854	whitecup, Nierembergia repens, Nierembergia rivularis
-n12909421	petunia
-n12909614	large white petunia, Petunia axillaris
-n12909759	violet-flowered petunia, Petunia integrifolia
-n12909917	hybrid petunia, Petunia hybrida
-n12911079	cape gooseberry, purple ground cherry, Physalis peruviana
-n12911264	strawberry tomato, dwarf cape gooseberry, Physalis pruinosa
-n12911440	tomatillo, jamberry, Mexican husk tomato, Physalis ixocarpa
-n12911673	tomatillo, miltomate, purple ground cherry, jamberry, Physalis philadelphica
-n12911914	yellow henbane, Physalis viscosa
-n12912274	cock's eggs, Salpichroa organifolia, Salpichroa rhomboidea
-n12912670	salpiglossis
-n12912801	painted tongue, Salpiglossis sinuata
-n12913144	butterfly flower, poor man's orchid, schizanthus
-n12913524	Scopolia carniolica
-n12913791	chalice vine, trumpet flower, cupflower, Solandra guttata
-n12914923	verbena, vervain
-n12915140	lantana
-n12915568	black mangrove, Avicennia marina
-n12915811	white mangrove, Avicennia officinalis
-n12916179	black mangrove, Aegiceras majus
-n12916511	teak, Tectona grandis
-n12917901	spurge
-n12918609	sun spurge, wartweed, wartwort, devil's milk, Euphorbia helioscopia
-n12918810	petty spurge, devil's milk, Euphorbia peplus
-n12918991	medusa's head, Euphorbia medusae, Euphorbia caput-medusae
-n12919195	wild spurge, flowering spurge, tramp's spurge, Euphorbia corollata
-n12919403	snow-on-the-mountain, snow-in-summer, ghost weed, Euphorbia marginata
-n12919646	cypress spurge, Euphorbia cyparissias
-n12919847	leafy spurge, wolf's milk, Euphorbia esula
-n12920043	hairy spurge, Euphorbia hirsuta
-n12920204	poinsettia, Christmas star, Christmas flower, lobster plant, Mexican flameleaf, painted leaf, Euphorbia pulcherrima
-n12920521	Japanese poinsettia, mole plant, paint leaf, Euphorbia heterophylla
-n12920719	fire-on-the-mountain, painted leaf, Mexican fire plant, Euphorbia cyathophora
-n12920955	wood spurge, Euphorbia amygdaloides
-n12921315	dwarf spurge, Euphorbia exigua
-n12921499	scarlet plume, Euphorbia fulgens
-n12921660	naboom, cactus euphorbia, Euphorbia ingens
-n12921868	crown of thorns, Christ thorn, Christ plant, Euphorbia milii
-n12922119	toothed spurge, Euphorbia dentata
-n12922458	three-seeded mercury, Acalypha virginica
-n12922763	croton, Croton tiglium
-n12923108	cascarilla, Croton eluteria
-n12923257	cascarilla bark, eleuthera bark, sweetwood bark
-n12924623	castor-oil plant, castor bean plant, palma christi, palma christ, Ricinus communis
-n12925179	spurge nettle, tread-softly, devil nettle, pica-pica, Cnidoscolus urens, Jatropha urens, Jatropha stimulosus
-n12925583	physic nut, Jatropha curcus
-n12926039	Para rubber tree, caoutchouc tree, Hevea brasiliensis
-n12926480	cassava, casava
-n12926689	bitter cassava, manioc, mandioc, mandioca, tapioca plant, gari, Manihot esculenta, Manihot utilissima
-n12927013	cassava, manioc
-n12927194	sweet cassava, Manihot dulcis
-n12927494	candlenut, varnish tree, Aleurites moluccana
-n12927758	tung tree, tung, tung-oil tree, Aleurites fordii
-n12928071	slipper spurge, slipper plant
-n12928307	candelilla, Pedilanthus bracteatus, Pedilanthus pavonis
-n12928491	Jewbush, Jew-bush, Jew bush, redbird cactus, redbird flower, Pedilanthus tithymaloides
-n12928819	jumping bean, jumping seed, Mexican jumping bean
-n12929403	camellia, camelia
-n12929600	japonica, Camellia japonica
-n12930778	umbellifer, umbelliferous plant
-n12930951	wild parsley
-n12931231	fool's parsley, lesser hemlock, Aethusa cynapium
-n12931542	dill, Anethum graveolens
-n12931906	angelica, angelique
-n12932173	garden angelica, archangel, Angelica Archangelica
-n12932365	wild angelica, Angelica sylvestris
-n12932706	chervil, beaked parsley, Anthriscus cereifolium
-n12932966	cow parsley, wild chervil, Anthriscus sylvestris
-n12933274	wild celery, Apium graveolens
-n12934036	astrantia, masterwort
-n12934174	greater masterwort, Astrantia major
-n12934479	caraway, Carum carvi
-n12934685	whorled caraway
-n12934985	water hemlock, Cicuta verosa
-n12935166	spotted cowbane, spotted hemlock, spotted water hemlock
-n12935609	hemlock, poison hemlock, poison parsley, California fern, Nebraska fern, winter fern, Conium maculatum
-n12936155	earthnut, Conopodium denudatum
-n12936826	cumin, Cuminum cyminum
-n12937130	wild carrot, Queen Anne's lace, Daucus carota
-n12938081	eryngo, eringo
-n12938193	sea holly, sea holm, sea eryngium, Eryngium maritimum
-n12938445	button snakeroot, Eryngium aquaticum
-n12938667	rattlesnake master, rattlesnake's master, button snakeroot, Eryngium yuccifolium
-n12939104	fennel
-n12939282	common fennel, Foeniculum vulgare
-n12939479	Florence fennel, Foeniculum dulce, Foeniculum vulgare dulce
-n12939874	cow parsnip, hogweed, Heracleum sphondylium
-n12940226	lovage, Levisticum officinale
-n12940609	sweet cicely, Myrrhis odorata
-n12941220	water fennel, Oenanthe aquatica
-n12941536	parsnip, Pastinaca sativa
-n12941717	cultivated parsnip
-n12942025	wild parsnip, madnep
-n12942395	parsley, Petroselinum crispum
-n12942572	Italian parsley, flat-leaf parsley, Petroselinum crispum neapolitanum
-n12942729	Hamburg parsley, turnip-rooted parsley, Petroselinum crispum tuberosum
-n12943049	anise, anise plant, Pimpinella anisum
-n12943443	sanicle, snakeroot
-n12943912	purple sanicle, Sanicula bipinnatifida
-n12944095	European sanicle, Sanicula Europaea
-n12945177	water parsnip, Sium suave
-n12945366	greater water parsnip, Sium latifolium
-n12945549	skirret, Sium sisarum
-n12946849	dogwood, dogwood tree, cornel
-n12947313	common white dogwood, eastern flowering dogwood, Cornus florida
-n12947544	red osier, red osier dogwood, red dogwood, American dogwood, redbrush, Cornus stolonifera
-n12947756	silky dogwood, Cornus obliqua
-n12947895	silky cornel, silky dogwood, Cornus amomum
-n12948053	common European dogwood, red dogwood, blood-twig, pedwood, Cornus sanguinea
-n12948251	bunchberry, dwarf cornel, crackerberry, pudding berry, Cornus canadensis
-n12948495	cornelian cherry, Cornus mas
-n12949160	puka, Griselinia lucida
-n12949361	kapuka, Griselinia littoralis
-n12950126	valerian
-n12950314	common valerian, garden heliotrope, Valeriana officinalis
-n12950796	common corn salad, lamb's lettuce, Valerianella olitoria, Valerianella locusta
-n12951146	red valerian, French honeysuckle, Centranthus ruber
-n12951835	filmy fern, film fern
-n12952165	bristle fern, filmy fern
-n12952469	hare's-foot bristle fern, Trichomanes boschianum
-n12952590	Killarney fern, Trichomanes speciosum
-n12952717	kidney fern, Trichomanes reniforme
-n12953206	flowering fern, osmund
-n12953484	royal fern, royal osmund, king fern, ditch fern, French bracken, Osmunda regalis
-n12953712	interrupted fern, Osmunda clatonia
-n12954353	crape fern, Prince-of-Wales fern, Prince-of-Wales feather, Prince-of-Wales plume, Leptopteris superba, Todea superba
-n12954799	crepe fern, king fern, Todea barbara
-n12955414	curly grass, curly grass fern, Schizaea pusilla
-n12955840	pine fern, Anemia adiantifolia
-n12956170	climbing fern
-n12956367	creeping fern, Hartford fern, Lygodium palmatum
-n12956588	climbing maidenhair, climbing maidenhair fern, snake fern, Lygodium microphyllum
-n12956922	scented fern, Mohria caffrorum
-n12957608	clover fern, pepperwort
-n12957803	nardoo, nardo, common nardoo, Marsilea drummondii
-n12957924	water clover, Marsilea quadrifolia
-n12958261	pillwort, Pilularia globulifera
-n12958615	regnellidium, Regnellidium diphyllum
-n12959074	floating-moss, Salvinia rotundifolia, Salvinia auriculata
-n12959538	mosquito fern, floating fern, Carolina pond fern, Azolla caroliniana
-n12960378	adder's tongue, adder's tongue fern
-n12960552	ribbon fern, Ophioglossum pendulum
-n12960863	grape fern
-n12961242	daisyleaf grape fern, daisy-leaved grape fern, Botrychium matricariifolium
-n12961393	leathery grape fern, Botrychium multifidum
-n12961536	rattlesnake fern, Botrychium virginianum
-n12961879	flowering fern, Helminthostachys zeylanica
-n12963628	powdery mildew
-n12964920	Dutch elm fungus, Ceratostomella ulmi
-n12965626	ergot, Claviceps purpurea
-n12965951	rye ergot
-n12966804	black root rot fungus, Xylaria mali
-n12966945	dead-man's-fingers, dead-men's-fingers, Xylaria polymorpha
-n12968136	sclerotinia
-n12968309	brown cup
-n12969131	earthball, false truffle, puffball, hard-skinned puffball
-n12969425	Scleroderma citrinum, Scleroderma aurantium
-n12969670	Scleroderma flavidium, star earthball
-n12969927	Scleroderma bovista, smooth earthball
-n12970193	Podaxaceae
-n12970293	stalked puffball
-n12970733	stalked puffball
-n12971400	false truffle
-n12971804	Rhizopogon idahoensis
-n12972136	Truncocolumella citrina
-n12973443	mucor
-n12973791	rhizopus
-n12973937	bread mold, Rhizopus nigricans
-n12974987	slime mold, slime mould
-n12975804	true slime mold, acellular slime mold, plasmodial slime mold, myxomycete
-n12976198	cellular slime mold
-n12976554	dictostylium
-n12978076	pond-scum parasite
-n12979316	potato wart fungus, Synchytrium endobioticum
-n12979829	white fungus, Saprolegnia ferax
-n12980080	water mold
-n12980840	downy mildew, false mildew
-n12981086	blue mold fungus, Peronospora tabacina
-n12981301	onion mildew, Peronospora destructor
-n12981443	tobacco mildew, Peronospora hyoscyami
-n12981954	white rust
-n12982468	pythium
-n12982590	damping off fungus, Pythium debaryanum
-n12982915	Phytophthora citrophthora
-n12983048	Phytophthora infestans
-n12983654	clubroot fungus, Plasmodiophora brassicae
-n12983873	Geglossaceae
-n12983961	Sarcosomataceae
-n12984267	Rufous rubber cup
-n12984489	devil's cigar
-n12984595	devil's urn
-n12985420	truffle, earthnut, earth-ball
-n12985773	club fungus
-n12985857	coral fungus
-n12986227	tooth fungus
-n12987056	lichen
-n12987423	ascolichen
-n12987535	basidiolichen
-n12988158	lecanora
-n12988341	manna lichen
-n12988572	archil, orchil
-n12989007	roccella, Roccella tinctoria
-n12989938	beard lichen, beard moss, Usnea barbata
-n12990597	horsehair lichen, horsetail lichen
-n12991184	reindeer moss, reindeer lichen, arctic moss, Cladonia rangiferina
-n12991837	crottle, crottal, crotal
-n12992177	Iceland moss, Iceland lichen, Cetraria islandica
-n12992868	fungus
-n12994892	promycelium
-n12995601	true fungus
-n12997654	basidiomycete, basidiomycetous fungi
-n12997919	mushroom
-n12998815	agaric
-n13000891	mushroom
-n13001041	mushroom
-n13001206	toadstool
-n13001366	horse mushroom, Agaricus arvensis
-n13001529	meadow mushroom, field mushroom, Agaricus campestris
-n13001930	shiitake, shiitake mushroom, Chinese black mushroom, golden oak mushroom, Oriental black mushroom, Lentinus edodes
-n13002209	scaly lentinus, Lentinus lepideus
-n13002750	royal agaric, Caesar's agaric, Amanita caesarea
-n13002925	false deathcap, Amanita mappa
-n13003061	fly agaric, Amanita muscaria
-n13003254	death cap, death cup, death angel, destroying angel, Amanita phalloides
-n13003522	blushing mushroom, blusher, Amanita rubescens
-n13003712	destroying angel, Amanita verna
-n13004423	chanterelle, chantarelle, Cantharellus cibarius
-n13004640	floccose chanterelle, Cantharellus floccosus
-n13004826	pig's ears, Cantharellus clavatus
-n13004992	cinnabar chanterelle, Cantharellus cinnabarinus
-n13005329	jack-o-lantern fungus, jack-o-lantern, jack-a-lantern, Omphalotus illudens
-n13005984	inky cap, inky-cap mushroom, Coprinus atramentarius
-n13006171	shaggymane, shaggy cap, shaggymane mushroom, Coprinus comatus
-n13006631	milkcap, Lactarius delicioso
-n13006894	fairy-ring mushroom, Marasmius oreades
-n13007034	fairy ring, fairy circle
-n13007417	oyster mushroom, oyster fungus, oyster agaric, Pleurotus ostreatus
-n13007629	olive-tree agaric, Pleurotus phosphoreus
-n13008157	Pholiota astragalina
-n13008315	Pholiota aurea, golden pholiota
-n13008485	Pholiota destruens
-n13008689	Pholiota flammans
-n13008839	Pholiota flavida
-n13009085	nameko, viscid mushroom, Pholiota nameko
-n13009244	Pholiota squarrosa-adiposa
-n13009429	Pholiota squarrosa, scaly pholiota
-n13009656	Pholiota squarrosoides
-n13010694	Stropharia ambigua
-n13010951	Stropharia hornemannii
-n13011221	Stropharia rugoso-annulata
-n13011595	gill fungus
-n13012253	Entoloma lividum, Entoloma sinuatum
-n13012469	Entoloma aprile
-n13012973	Chlorophyllum molybdites
-n13013534	lepiota
-n13013764	parasol mushroom, Lepiota procera
-n13013965	poisonous parasol, Lepiota morgani
-n13014097	Lepiota naucina
-n13014265	Lepiota rhacodes
-n13014409	American parasol, Lepiota americana
-n13014581	Lepiota rubrotincta
-n13014741	Lepiota clypeolaria
-n13014879	onion stem, Lepiota cepaestipes
-n13015509	pink disease fungus, Corticium salmonicolor
-n13015688	bottom rot fungus, Corticium solani
-n13016076	potato fungus, Pellicularia filamentosa, Rhizoctinia solani
-n13016289	coffee fungus, Pellicularia koleroga
-n13017102	blewits, Clitocybe nuda
-n13017240	sandy mushroom, Tricholoma populinum
-n13017439	Tricholoma pessundatum
-n13017610	Tricholoma sejunctum
-n13017789	man-on-a-horse, Tricholoma flavovirens
-n13017979	Tricholoma venenata
-n13018088	Tricholoma pardinum
-n13018232	Tricholoma vaccinum
-n13018407	Tricholoma aurantium
-n13018906	Volvaria bombycina
-n13019496	Pluteus aurantiorugosus
-n13019643	Pluteus magnus, sawdust mushroom
-n13019835	deer mushroom, Pluteus cervinus
-n13020191	straw mushroom, Chinese mushroom, Volvariella volvacea
-n13020481	Volvariella bombycina
-n13020964	Clitocybe clavipes
-n13021166	Clitocybe dealbata
-n13021332	Clitocybe inornata
-n13021543	Clitocybe robusta, Clytocybe alba
-n13021689	Clitocybe irina, Tricholoma irinum, Lepista irina
-n13021867	Clitocybe subconnexa
-n13022210	winter mushroom, Flammulina velutipes
-n13022709	mycelium
-n13022903	sclerotium
-n13023134	sac fungus
-n13024012	ascomycete, ascomycetous fungus
-n13024500	Clavicipitaceae, grainy club mushrooms
-n13024653	grainy club
-n13025647	yeast
-n13025854	baker's yeast, brewer's yeast, Saccharomyces cerevisiae
-n13026015	wine-maker's yeast, Saccharomyces ellipsoides
-n13027557	Aspergillus fumigatus
-n13027879	brown root rot fungus, Thielavia basicola
-n13028611	discomycete, cup fungus
-n13028937	Leotia lubrica
-n13029122	Mitrula elegans
-n13029326	Sarcoscypha coccinea, scarlet cup
-n13029610	Caloscypha fulgens
-n13029760	Aleuria aurantia, orange peel fungus
-n13030337	elf cup
-n13030616	Peziza domicilina
-n13030852	blood cup, fairy cup, Peziza coccinea
-n13031193	Urnula craterium, urn fungus
-n13031323	Galiella rufa
-n13031474	Jafnea semitosta
-n13032115	morel
-n13032381	common morel, Morchella esculenta, sponge mushroom, sponge morel
-n13032618	Disciotis venosa, cup morel
-n13032923	Verpa, bell morel
-n13033134	Verpa bohemica, early morel
-n13033396	Verpa conica, conic Verpa
-n13033577	black morel, Morchella conica, conic morel, Morchella angusticeps, narrowhead morel
-n13033879	Morchella crassipes, thick-footed morel
-n13034062	Morchella semilibera, half-free morel, cow's head
-n13034555	Wynnea americana
-n13034788	Wynnea sparassoides
-n13035241	false morel
-n13035389	lorchel
-n13035707	helvella
-n13035925	Helvella crispa, miter mushroom
-n13036116	Helvella acetabulum
-n13036312	Helvella sulcata
-n13036804	discina
-n13037406	gyromitra
-n13037585	Gyromitra californica, California false morel
-n13037805	Gyromitra sphaerospora, round-spored gyromitra
-n13038068	Gyromitra esculenta, brain mushroom, beefsteak morel
-n13038376	Gyromitra infula, saddled-shaped false morel
-n13038577	Gyromitra fastigiata, Gyromitra brunnea
-n13038744	Gyromitra gigas
-n13039349	gasteromycete, gastromycete
-n13040303	stinkhorn, carrion fungus
-n13040629	common stinkhorn, Phallus impudicus
-n13040796	Phallus ravenelii
-n13041312	dog stinkhorn, Mutinus caninus
-n13041943	Calostoma lutescens
-n13042134	Calostoma cinnabarina
-n13042316	Calostoma ravenelii
-n13042982	stinky squid, Pseudocolus fusiformis
-n13043926	puffball, true puffball
-n13044375	giant puffball, Calvatia gigantea
-n13044778	earthstar
-n13045210	Geastrum coronatum
-n13045594	Radiigera fuscogleba
-n13045975	Astreus pteridis
-n13046130	Astreus hygrometricus
-n13046669	bird's-nest fungus
-n13047862	Gastrocybe lateritia
-n13048447	Macowanites americanus
-n13049953	polypore, pore fungus, pore mushroom
-n13050397	bracket fungus, shelf fungus
-n13050705	Albatrellus dispansus
-n13050940	Albatrellus ovinus, sheep polypore
-n13051346	Neolentinus ponderosus
-n13052014	Oligoporus leucospongia
-n13052248	Polyporus tenuiculus
-n13052670	hen-of-the-woods, hen of the woods, Polyporus frondosus, Grifola frondosa
-n13052931	Polyporus squamosus, scaly polypore
-n13053608	beefsteak fungus, Fistulina hepatica
-n13054073	agaric, Fomes igniarius
-n13054560	bolete
-n13055423	Boletus chrysenteron
-n13055577	Boletus edulis
-n13055792	Frost's bolete, Boletus frostii
-n13055949	Boletus luridus
-n13056135	Boletus mirabilis
-n13056349	Boletus pallidus
-n13056607	Boletus pulcherrimus
-n13056799	Boletus pulverulentus
-n13057054	Boletus roxanae
-n13057242	Boletus subvelutipes
-n13057422	Boletus variipes
-n13057639	Boletus zelleri
-n13058037	Fuscoboletinus paluster
-n13058272	Fuscoboletinus serotinus
-n13058608	Leccinum fibrillosum
-n13059298	Suillus albivelatus
-n13059657	old-man-of-the-woods, Strobilomyces floccopus
-n13060017	Boletellus russellii
-n13060190	jelly fungus
-n13061172	snow mushroom, Tremella fuciformis
-n13061348	witches' butter, Tremella lutescens
-n13061471	Tremella foliacea
-n13061704	Tremella reticulata
-n13062421	Jew's-ear, Jew's-ears, ear fungus, Auricularia auricula
-n13063269	rust, rust fungus
-n13063514	aecium
-n13064111	flax rust, flax rust fungus, Melampsora lini
-n13064457	blister rust, Cronartium ribicola
-n13065089	wheat rust, Puccinia graminis
-n13065514	apple rust, cedar-apple rust, Gymnosporangium juniperi-virginianae
-n13066129	smut, smut fungus
-n13066448	covered smut
-n13066979	loose smut
-n13067191	cornsmut, corn smut
-n13067330	boil smut, Ustilago maydis
-n13067532	Sphacelotheca, genus Sphacelotheca
-n13067672	head smut, Sphacelotheca reiliana
-n13068255	bunt, Tilletia caries
-n13068434	bunt, stinking smut, Tilletia foetida
-n13068735	onion smut, Urocystis cepulae
-n13068917	flag smut fungus
-n13069224	wheat flag smut, Urocystis tritici
-n13069773	felt fungus, Septobasidium pseudopedicellatum
-n13070308	waxycap
-n13070875	Hygrocybe acutoconica, conic waxycap
-n13071371	Hygrophorus borealis
-n13071553	Hygrophorus caeruleus
-n13071815	Hygrophorus inocybiformis
-n13072031	Hygrophorus kauffmanii
-n13072209	Hygrophorus marzuolus
-n13072350	Hygrophorus purpurascens
-n13072528	Hygrophorus russula
-n13072706	Hygrophorus sordidus
-n13072863	Hygrophorus tennesseensis
-n13073055	Hygrophorus turundus
-n13073703	Neohygrophorus angelesianus
-n13074619	Cortinarius armillatus
-n13074814	Cortinarius atkinsonianus
-n13075020	Cortinarius corrugatus
-n13075272	Cortinarius gentilis
-n13075441	Cortinarius mutabilis, purple-staining Cortinarius
-n13075684	Cortinarius semisanguineus
-n13075847	Cortinarius subfoetidus
-n13076041	Cortinarius violaceus
-n13076405	Gymnopilus spectabilis
-n13076643	Gymnopilus validipes
-n13076831	Gymnopilus ventricosus
-n13077033	mold, mould
-n13077295	mildew
-n13078021	verticillium
-n13079073	monilia
-n13079419	candida
-n13079567	Candida albicans, Monilia albicans
-n13080306	blastomycete
-n13080866	yellow spot fungus, Cercospora kopkei
-n13081229	green smut fungus, Ustilaginoidea virens
-n13081999	dry rot
-n13082568	rhizoctinia
-n13083023	houseplant
-n13083461	bedder, bedding plant
-n13084184	succulent
-n13084834	cultivar
-n13085113	weed
-n13085747	wort
-n13090018	brier
-n13090871	aril
-n13091620	sporophyll, sporophyl
-n13091774	sporangium, spore case, spore sac
-n13091982	sporangiophore
-n13092078	ascus
-n13092240	ascospore
-n13092385	arthrospore
-n13092987	eusporangium
-n13093275	tetrasporangium
-n13093629	gametangium
-n13094145	sorus
-n13094273	sorus
-n13095013	partial veil
-n13096779	lignum
-n13098515	vascular ray, medullary ray
-n13098962	phloem, bast
-n13099833	evergreen, evergreen plant
-n13099999	deciduous plant
-n13100156	poisonous plant
-n13100677	vine
-n13102648	creeper
-n13102775	tendril
-n13103023	root climber
-n13103660	lignosae
-n13103750	arborescent plant
-n13103877	snag
-n13104059	tree
-n13107694	timber tree
-n13107807	treelet
-n13107891	arbor
-n13108131	bean tree
-n13108323	pollard
-n13108481	sapling
-n13108545	shade tree
-n13108662	gymnospermous tree
-n13108841	conifer, coniferous tree
-n13109733	angiospermous tree, flowering tree
-n13110915	nut tree
-n13111174	spice tree
-n13111340	fever tree
-n13111504	stump, tree stump
-n13111881	bonsai
-n13112035	ming tree
-n13112201	ming tree
-n13118330	undershrub
-n13118707	subshrub, suffrutex
-n13119870	bramble
-n13120211	liana
-n13120958	geophyte
-n13121104	desert plant, xerophyte, xerophytic plant, xerophile, xerophilous plant
-n13121349	mesophyte, mesophytic plant
-n13122364	marsh plant, bog plant, swamp plant
-n13123309	hemiepiphyte, semiepiphyte
-n13123431	strangler, strangler tree
-n13123841	lithophyte, lithophytic plant
-n13124358	saprobe
-n13124654	autophyte, autophytic plant, autotroph, autotrophic organism
-n13125117	root
-n13126050	taproot
-n13126856	prop root
-n13127001	prophyll
-n13127303	rootstock
-n13127666	quickset
-n13127843	stolon, runner, offset
-n13128278	tuberous plant
-n13128582	rhizome, rootstock, rootstalk
-n13128976	rachis
-n13129078	caudex
-n13130014	cladode, cladophyll, phylloclad, phylloclade
-n13130161	receptacle
-n13130726	scape, flower stalk
-n13131028	umbel
-n13131618	petiole, leafstalk
-n13132034	peduncle
-n13132156	pedicel, pedicle
-n13132338	flower cluster
-n13132486	raceme
-n13132656	panicle
-n13132756	thyrse, thyrsus
-n13132940	cyme
-n13133140	cymule
-n13133233	glomerule
-n13133316	scorpioid cyme
-n13133613	ear, spike, capitulum
-n13133932	spadix
-n13134302	bulbous plant
-n13134531	bulbil, bulblet
-n13134844	cormous plant
-n13134947	fruit
-n13135692	fruitlet
-n13135832	seed
-n13136316	bean
-n13136556	nut
-n13136781	nutlet
-n13137010	kernel, meat
-n13137225	syconium
-n13137409	berry
-n13137672	aggregate fruit, multiple fruit, syncarp
-n13137951	simple fruit, bacca
-n13138155	acinus
-n13138308	drupe, stone fruit
-n13138658	drupelet
-n13138842	pome, false fruit
-n13139055	pod, seedpod
-n13139321	loment
-n13139482	pyxidium, pyxis
-n13139647	husk
-n13139837	cornhusk
-n13140049	pod, cod, seedcase
-n13140367	accessory fruit, pseudocarp
-n13141141	buckthorn
-n13141415	buckthorn berry, yellow berry
-n13141564	cascara buckthorn, bearberry, bearwood, chittamwood, chittimwood, Rhamnus purshianus
-n13141797	cascara, cascara sagrada, chittam bark, chittem bark
-n13141972	Carolina buckthorn, indian cherry, Rhamnus carolinianus
-n13142182	coffeeberry, California buckthorn, California coffee, Rhamnus californicus
-n13142504	redberry, red-berry, Rhamnus croceus
-n13142907	nakedwood
-n13143285	jujube, jujube bush, Christ's-thorn, Jerusalem thorn, Ziziphus jujuba
-n13143758	Christ's-thorn, Jerusalem thorn, Paliurus spina-christi
-n13144084	hazel, hazel tree, Pomaderris apetala
-n13145040	fox grape, Vitis labrusca
-n13145250	muscadine, Vitis rotundifolia
-n13145444	vinifera, vinifera grape, common grape vine, Vitis vinifera
-n13146403	Pinot blanc
-n13146583	Sauvignon grape
-n13146928	Sauvignon blanc
-n13147153	Muscadet
-n13147270	Riesling
-n13147386	Zinfandel
-n13147532	Chenin blanc
-n13147689	malvasia
-n13147918	Verdicchio
-n13148208	Boston ivy, Japanese ivy, Parthenocissus tricuspidata
-n13148384	Virginia creeper, American ivy, woodbine, Parthenocissus quinquefolia
-n13149296	true pepper, pepper vine
-n13149970	betel, betel pepper, Piper betel
-n13150378	cubeb
-n13150592	schizocarp
-n13150894	peperomia
-n13151082	watermelon begonia, Peperomia argyreia, Peperomia sandersii
-n13152339	yerba mansa, Anemopsis californica
-n13154388	pinna, pinnule
-n13154494	frond
-n13154841	bract
-n13155095	bracteole, bractlet
-n13155305	involucre
-n13155611	glume
-n13156986	palmate leaf
-n13157137	pinnate leaf
-n13157346	bijugate leaf, bijugous leaf, twice-pinnate
-n13157481	decompound leaf
-n13157684	acuminate leaf
-n13157971	deltoid leaf
-n13158167	ensiform leaf
-n13158512	linear leaf, elongate leaf
-n13158605	lyrate leaf
-n13158714	obtuse leaf
-n13158815	oblanceolate leaf
-n13159357	pandurate leaf, panduriform leaf
-n13159691	reniform leaf
-n13159890	spatulate leaf
-n13160116	even-pinnate leaf, abruptly-pinnate leaf
-n13160254	odd-pinnate leaf
-n13160365	pedate leaf
-n13160604	crenate leaf
-n13160831	dentate leaf
-n13160938	denticulate leaf
-n13161151	erose leaf
-n13161254	runcinate leaf
-n13161904	prickly-edged leaf
-n13163553	deadwood
-n13163649	haulm, halm
-n13163991	branchlet, twig, sprig
-n13164501	osier
-n13170840	giant scrambling fern, Diplopterygium longissimum
-n13171210	umbrella fern, fan fern, Sticherus flabellatus, Gleichenia flabellata
-n13171797	floating fern, water sprite, Ceratopteris pteridioides
-n13172923	polypody
-n13173132	licorice fern, Polypodium glycyrrhiza
-n13173259	grey polypody, gray polypody, resurrection fern, Polypodium polypodioides
-n13173488	leatherleaf, leathery polypody, coast polypody, Polypodium scouleri
-n13173697	rock polypody, rock brake, American wall fern, Polypodium virgianum
-n13173882	common polypody, adder's fern, wall fern, golden maidenhair, golden polypody, sweet fern, Polypodium vulgare
-n13174354	bear's-paw fern, Aglaomorpha meyeniana
-n13174670	strap fern
-n13174823	Florida strap fern, cow-tongue fern, hart's-tongue fern
-n13175682	basket fern, Drynaria rigidula
-n13176363	snake polypody, Microgramma-piloselloides
-n13176714	climbing bird's nest fern, Microsorium punctatum
-n13177048	golden polypody, serpent fern, rabbit's-foot fern, Phlebodium aureum, Polypodium aureum
-n13177529	staghorn fern
-n13177768	South American staghorn, Platycerium andinum
-n13177884	common staghorn fern, elkhorn fern, Platycerium bifurcatum, Platycerium alcicorne
-n13178284	felt fern, tongue fern, Pyrrosia lingua, Cyclophorus lingua
-n13178707	potato fern, Solanopteris bifrons
-n13179056	myrmecophyte
-n13179804	grass fern, ribbon fern, Vittaria lineata
-n13180534	spleenwort
-n13180875	black spleenwort, Asplenium adiantum-nigrum
-n13181055	bird's nest fern, Asplenium nidus
-n13181244	ebony spleenwort, Scott's Spleenwort, Asplenium platyneuron
-n13181406	black-stem spleenwort, black-stemmed spleenwort, little ebony spleenwort
-n13181811	walking fern, walking leaf, Asplenium rhizophyllum, Camptosorus rhizophyllus
-n13182164	green spleenwort, Asplenium viride
-n13182338	mountain spleenwort, Asplenium montanum
-n13182799	lobed spleenwort, Asplenium pinnatifidum
-n13182937	lanceolate spleenwort, Asplenium billotii
-n13183056	hart's-tongue, hart's-tongue fern, Asplenium scolopendrium, Phyllitis scolopendrium
-n13183489	scale fern, scaly fern, Asplenium ceterach, Ceterach officinarum
-n13184394	scolopendrium
-n13185269	deer fern, Blechnum spicant
-n13185658	doodia, rasp fern
-n13186388	chain fern
-n13186546	Virginia chain fern, Woodwardia virginica
-n13187367	silver tree fern, sago fern, black tree fern, Cyathea medullaris
-n13188096	davallia
-n13188268	hare's-foot fern
-n13188462	Canary Island hare's foot fern, Davallia canariensis
-n13188767	squirrel's-foot fern, ball fern, Davalia bullata, Davalia bullata mariesii, Davallia Mariesii
-n13190060	bracken, Pteridium esculentum
-n13190747	soft tree fern, Dicksonia antarctica
-n13191148	Scythian lamb, Cibotium barometz
-n13191620	false bracken, Culcita dubia
-n13191884	thyrsopteris, Thyrsopteris elegans
-n13192625	shield fern, buckler fern
-n13193143	broad buckler-fern, Dryopteris dilatata
-n13193269	fragrant cliff fern, fragrant shield fern, fragrant wood fern, Dryopteris fragrans
-n13193466	Goldie's fern, Goldie's shield fern, goldie's wood fern, Dryopteris goldiana
-n13193642	wood fern, wood-fern, woodfern
-n13193856	male fern, Dryopteris filix-mas
-n13194036	marginal wood fern, evergreen wood fern, leatherleaf wood fern, Dryopteris marginalis
-n13194212	mountain male fern, Dryopteris oreades
-n13194572	lady fern, Athyrium filix-femina
-n13194758	Alpine lady fern, Athyrium distentifolium
-n13194918	silvery spleenwort, glade fern, narrow-leaved spleenwort, Athyrium pycnocarpon, Diplazium pycnocarpon
-n13195341	holly fern, Cyrtomium aculeatum, Polystichum aculeatum
-n13195761	bladder fern
-n13196003	brittle bladder fern, brittle fern, fragile fern, Cystopteris fragilis
-n13196234	mountain bladder fern, Cystopteris montana
-n13196369	bulblet fern, bulblet bladder fern, berry fern, Cystopteris bulbifera
-n13196738	silvery spleenwort, Deparia acrostichoides, Athyrium thelypteroides
-n13197274	oak fern, Gymnocarpium dryopteris, Thelypteris dryopteris
-n13197507	limestone fern, northern oak fern, Gymnocarpium robertianum
-n13198054	ostrich fern, shuttlecock fern, fiddlehead, Matteuccia struthiopteris, Pteretis struthiopteris, Onoclea struthiopteris
-n13198482	hart's-tongue, hart's-tongue fern, Olfersia cervina, Polybotrya cervina, Polybotria cervina
-n13198914	sensitive fern, bead fern, Onoclea sensibilis
-n13199717	Christmas fern, canker brake, dagger fern, evergreen wood fern, Polystichum acrostichoides
-n13199970	holly fern
-n13200193	Braun's holly fern, prickly shield fern, Polystichum braunii
-n13200542	western holly fern, Polystichum scopulinum
-n13200651	soft shield fern, Polystichum setiferum
-n13200986	leather fern, leatherleaf fern, ten-day fern, Rumohra adiantiformis, Polystichum adiantiformis
-n13201423	button fern, Tectaria cicutaria
-n13201566	Indian button fern, Tectaria macrodonta
-n13201969	woodsia
-n13202125	rusty woodsia, fragrant woodsia, oblong woodsia, Woodsia ilvensis
-n13202355	Alpine woodsia, northern woodsia, flower-cup fern, Woodsia alpina
-n13202602	smooth woodsia, Woodsia glabella
-n13205058	Boston fern, Nephrolepis exaltata, Nephrolepis exaltata bostoniensis
-n13205249	basket fern, toothed sword fern, Nephrolepis pectinata
-n13206178	golden fern, leather fern, Acrostichum aureum
-n13206817	maidenhair, maidenhair fern
-n13207094	common maidenhair, Venushair, Venus'-hair fern, southern maidenhair, Venus maidenhair, Adiantum capillus-veneris
-n13207335	American maidenhair fern, five-fingered maidenhair fern, Adiantum pedatum
-n13207572	Bermuda maidenhair, Bermuda maidenhair fern, Adiantum bellum
-n13207736	brittle maidenhair, brittle maidenhair fern, Adiantum tenerum
-n13207923	Farley maidenhair, Farley maidenhair fern, Barbados maidenhair, glory fern, Adiantum tenerum farleyense
-n13208302	annual fern, Jersey fern, Anogramma leptophylla
-n13208705	lip fern, lipfern
-n13208965	smooth lip fern, Alabama lip fern, Cheilanthes alabamensis
-n13209129	lace fern, Cheilanthes gracillima
-n13209270	wooly lip fern, hairy lip fern, Cheilanthes lanosa
-n13209460	southwestern lip fern, Cheilanthes eatonii
-n13209808	bamboo fern, Coniogramme japonica
-n13210350	American rock brake, American parsley fern, Cryptogramma acrostichoides
-n13210597	European parsley fern, mountain parsley fern, Cryptogramma crispa
-n13211020	hand fern, Doryopteris pedata
-n13211790	cliff brake, cliff-brake, rock brake
-n13212025	coffee fern, Pellaea andromedifolia
-n13212175	purple rock brake, Pellaea atropurpurea
-n13212379	bird's-foot fern, Pellaea mucronata, Pellaea ornithopus
-n13212559	button fern, Pellaea rotundifolia
-n13213066	silver fern, Pityrogramma argentea
-n13213397	golden fern, Pityrogramma calomelanos aureoflava
-n13213577	gold fern, Pityrogramma chrysophylla
-n13214217	Pteris cretica
-n13214340	spider brake, spider fern, Pteris multifida
-n13214485	ribbon fern, spider fern, Pteris serrulata
-n13215258	potato fern, Marattia salicina
-n13215586	angiopteris, giant fern, Angiopteris evecta
-n13217005	skeleton fork fern, Psilotum nudum
-n13219422	horsetail
-n13219833	common horsetail, field horsetail, Equisetum arvense
-n13219976	swamp horsetail, water horsetail, Equisetum fluviatile
-n13220122	scouring rush, rough horsetail, Equisetum hyemale, Equisetum hyemale robustum, Equisetum robustum
-n13220355	marsh horsetail, Equisetum palustre
-n13220525	wood horsetail, Equisetum Sylvaticum
-n13220663	variegated horsetail, variegated scouring rush, Equisetum variegatum
-n13221529	club moss, club-moss, lycopod
-n13222877	shining clubmoss, Lycopodium lucidulum
-n13222985	alpine clubmoss, Lycopodium alpinum
-n13223090	fir clubmoss, mountain clubmoss, little clubmoss, Lycopodium selago
-n13223588	ground cedar, staghorn moss, Lycopodium complanatum
-n13223710	ground fir, princess pine, tree clubmoss, Lycopodium obscurum
-n13223843	foxtail grass, Lycopodium alopecuroides
-n13224673	spikemoss, spike moss, little club moss
-n13224922	meadow spikemoss, basket spikemoss, Selaginella apoda
-n13225244	desert selaginella, Selaginella eremophila
-n13225365	resurrection plant, rose of Jericho, Selaginella lepidophylla
-n13225617	florida selaginella, Selaginella eatonii
-n13226320	quillwort
-n13226871	earthtongue, earth-tongue
-n13228017	snuffbox fern, meadow fern, Thelypteris palustris pubescens, Dryopteris thelypteris pubescens
-n13228536	christella
-n13229543	mountain fern, Oreopteris limbosperma, Dryopteris oreopteris
-n13229951	New York fern, Parathelypteris novae-boracensis, Dryopteris noveboracensis
-n13230190	Massachusetts fern, Parathelypteris simulata, Thelypteris simulata
-n13230662	beech fern
-n13230843	broad beech fern, southern beech fern, Phegopteris hexagonoptera, Dryopteris hexagonoptera, Thelypteris hexagonoptera
-n13231078	long beech fern, narrow beech fern, northern beech fern, Phegopteris connectilis, Dryopteris phegopteris, Thelypteris phegopteris
-n13231678	shoestring fungus
-n13231919	Armillaria caligata, booted armillaria
-n13232106	Armillaria ponderosa, white matsutake
-n13232363	Armillaria zelleri
-n13232779	honey mushroom, honey fungus, Armillariella mellea
-n13233727	milkweed, silkweed
-n13234114	white milkweed, Asclepias albicans
-n13234519	poke milkweed, Asclepias exaltata
-n13234678	swamp milkweed, Asclepias incarnata
-n13234857	Mead's milkweed, Asclepias meadii, Asclepia meadii
-n13235011	purple silkweed, Asclepias purpurascens
-n13235159	showy milkweed, Asclepias speciosa
-n13235319	poison milkweed, horsetail milkweed, Asclepias subverticillata
-n13235503	butterfly weed, orange milkweed, chigger flower, chiggerflower, pleurisy root, tuber root, Indian paintbrush, Asclepias tuberosa
-n13235766	whorled milkweed, Asclepias verticillata
-n13236100	cruel plant, Araujia sericofera
-n13237188	wax plant, Hoya carnosa
-n13237508	silk vine, Periploca graeca
-n13238375	stapelia, carrion flower, starfish flower
-n13238654	Stapelias asterias
-n13238988	stephanotis
-n13239177	Madagascar jasmine, waxflower, Stephanotis floribunda
-n13239736	negro vine, Vincetoxicum hirsutum, Vincetoxicum negrum
-n13239921	zygospore
-n13240362	tree of knowledge
-n13252672	orangery
-n13354021	pocketbook
-n13555775	shit, dump
-n13579829	cordage
-n13650447	yard, pace
-n13653902	extremum, peak
-n13862407	leaf shape, leaf form
-n13862552	equilateral
-n13862780	figure
-n13863020	pencil
-n13863186	plane figure, two-dimensional figure
-n13863473	solid figure, three-dimensional figure
-n13863771	line
-n13864035	bulb
-n13864153	convex shape, convexity
-n13864965	concave shape, concavity, incurvation, incurvature
-n13865298	cylinder
-n13865483	round shape
-n13865904	heart
-n13866144	polygon, polygonal shape
-n13866626	convex polygon
-n13866827	concave polygon
-n13867005	reentrant polygon, reentering polygon
-n13867492	amorphous shape
-n13868248	closed curve
-n13868371	simple closed curve, Jordan curve
-n13868515	S-shape
-n13868944	wave, undulation
-n13869045	extrados
-n13869547	hook, crotchet
-n13869788	envelope
-n13869896	bight
-n13871717	diameter
-n13872592	cone, conoid, cone shape
-n13872822	funnel, funnel shape
-n13873361	oblong
-n13873502	circle
-n13873917	circle
-n13874073	equator
-n13874558	scallop, crenation, crenature, crenel, crenelle
-n13875392	ring, halo, annulus, doughnut, anchor ring
-n13875571	loop
-n13875884	bight
-n13876561	helix, spiral
-n13877547	element of a cone
-n13877667	element of a cylinder
-n13878306	ellipse, oval
-n13879049	quadrate
-n13879320	triangle, trigon, trilateral
-n13879816	acute triangle, acute-angled triangle
-n13880199	isosceles triangle
-n13880415	obtuse triangle, obtuse-angled triangle
-n13880551	right triangle, right-angled triangle
-n13880704	scalene triangle
-n13880994	parallel
-n13881512	trapezoid
-n13881644	star
-n13882201	pentagon
-n13882276	hexagon
-n13882487	heptagon
-n13882563	octagon
-n13882639	nonagon
-n13882713	decagon
-n13882961	rhombus, rhomb, diamond
-n13883603	spherical polygon
-n13883763	spherical triangle
-n13884261	convex polyhedron
-n13884384	concave polyhedron
-n13884930	cuboid
-n13885011	quadrangular prism
-n13886260	bell, bell shape, campana
-n13888491	angular distance
-n13889066	true anomaly
-n13889331	spherical angle
-n13891547	angle of refraction
-n13891937	acute angle
-n13893786	groove, channel
-n13894154	rut
-n13894434	bulge, bump, hump, swelling, gibbosity, gibbousness, jut, prominence, protuberance, protrusion, extrusion, excrescence
-n13895262	belly
-n13896100	bow, arc
-n13896217	crescent
-n13897198	ellipsoid
-n13897528	hypotenuse
-n13897996	balance, equilibrium, equipoise, counterbalance
-n13898207	conformation
-n13898315	symmetry, proportion
-n13898645	spheroid, ellipsoid of revolution
-n13899735	spherule
-n13900287	toroid
-n13900422	column, tower, pillar
-n13901211	barrel, drum
-n13901321	pipe, tube
-n13901423	pellet
-n13901490	bolus
-n13901858	dewdrop
-n13902048	ridge
-n13902336	rim
-n13902793	taper
-n13903079	boundary, edge, bound
-n13905121	incisure, incisura
-n13905275	notch
-n13905792	wrinkle, furrow, crease, crinkle, seam, line
-n13906484	dermatoglyphic
-n13906669	frown line
-n13906767	line of life, life line, lifeline
-n13906936	line of heart, heart line, love line, mensal line
-n13907272	crevice, cranny, crack, fissure, chap
-n13908201	cleft
-n13908580	roulette, line roulette
-n13911045	node
-n13912260	tree, tree diagram
-n13912540	stemma
-n13914141	brachium
-n13914265	fork, crotch
-n13914608	block, cube
-n13915023	ovoid
-n13915113	tetrahedron
-n13915209	pentahedron
-n13915305	hexahedron
-n13915999	regular polyhedron, regular convex solid, regular convex polyhedron, Platonic body, Platonic solid, ideal solid
-n13916363	polyhedral angle
-n13916721	cube, regular hexahedron
-n13917690	truncated pyramid
-n13917785	truncated cone
-n13918274	tail, tail end
-n13918387	tongue, knife
-n13918717	trapezohedron
-n13919547	wedge, wedge shape, cuneus
-n13919919	keel
-n13926786	place, shoes
-n14131950	herpes
-n14175579	chlamydia
-n14564779	wall
-n14582716	micronutrient
-n14583400	chyme
-n14585392	ragweed pollen
-n14592309	pina cloth
-n14603798	chlorobenzylidenemalononitrile, CS gas
-n14633206	carbon, C, atomic number 6
-n14685296	charcoal, wood coal
-n14696793	rock, stone
-n14698884	gravel, crushed rock
-n14714645	aflatoxin
-n14720833	alpha-tocopheral
-n14765422	leopard
-n14785065	bricks and mortar
-n14786943	lagging
-n14804958	hydraulic cement, Portland cement
-n14810561	choline
-n14820180	concrete
-n14821852	glass wool
-n14844693	soil, dirt
-n14853210	high explosive
-n14858292	litter
-n14867545	fish meal
-n14891255	Greek fire
-n14899328	culture medium, medium
-n14900184	agar, nutrient agar
-n14900342	blood agar
-n14908027	hip tile, hipped tile
-n14909584	hyacinth, jacinth
-n14914945	hydroxide ion, hydroxyl ion
-n14915184	ice, water ice
-n14919819	inositol
-n14938389	linoleum, lino
-n14941787	lithia water
-n14942411	lodestone, loadstone
-n14973585	pantothenic acid, pantothen
-n14974264	paper
-n14975598	papyrus
-n14976759	pantile
-n14976871	blacktop, blacktopping
-n14977188	tarmacadam, tarmac
-n14977504	paving, pavement, paving material
-n14992287	plaster
-n14993378	poison gas
-n15005577	ridge tile
-n15006012	roughcast
-n15019030	sand
-n15048888	spackle, spackling compound
-n15060326	render
-n15060688	wattle and daub
-n15062057	stucco
-n15067877	tear gas, teargas, lacrimator, lachrymator
-n15075141	toilet tissue, toilet paper, bathroom tissue
-n15086247	linseed, flaxseed
-n15089258	vitamin
-n15089472	fat-soluble vitamin
-n15089645	water-soluble vitamin
-n15089803	vitamin A, antiophthalmic factor, axerophthol, A
-n15090065	vitamin A1, retinol
-n15090238	vitamin A2, dehydroretinol
-n15090742	B-complex vitamin, B complex, vitamin B complex, vitamin B, B vitamin, B
-n15091129	vitamin B1, thiamine, thiamin, aneurin, antiberiberi factor
-n15091304	vitamin B12, cobalamin, cyanocobalamin, antipernicious anemia factor
-n15091473	vitamin B2, vitamin G, riboflavin, lactoflavin, ovoflavin, hepatoflavin
-n15091669	vitamin B6, pyridoxine, pyridoxal, pyridoxamine, adermin
-n15091846	vitamin Bc, vitamin M, folate, folic acid, folacin, pteroylglutamic acid, pteroylmonoglutamic acid
-n15092059	niacin, nicotinic acid
-n15092227	vitamin D, calciferol, viosterol, ergocalciferol, cholecalciferol, D
-n15092409	vitamin E, tocopherol, E
-n15092650	biotin, vitamin H
-n15092751	vitamin K, naphthoquinone, antihemorrhagic factor
-n15092942	vitamin K1, phylloquinone, phytonadione
-n15093049	vitamin K3, menadione
-n15093137	vitamin P, bioflavinoid, citrin
-n15093298	vitamin C, C, ascorbic acid
-n15102359	planking
-n15102455	chipboard, hardboard
-n15102894	knothole
diff --git a/research/inception/inception/data/preprocess_imagenet_validation_data.py b/research/inception/inception/data/preprocess_imagenet_validation_data.py
deleted file mode 100755
index da85f8f14c8d0fc84980a2611d6bc7522668aa62..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/preprocess_imagenet_validation_data.py
+++ /dev/null
@@ -1,89 +0,0 @@
-#!/usr/bin/python
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Process the ImageNet Challenge bounding boxes for TensorFlow model training.
-
-Associate the ImageNet 2012 Challenge validation data set with labels.
-
-The raw ImageNet validation data set is expected to reside in JPEG files
-located in the following directory structure.
-
- data_dir/ILSVRC2012_val_00000001.JPEG
- data_dir/ILSVRC2012_val_00000002.JPEG
- ...
- data_dir/ILSVRC2012_val_00050000.JPEG
-
-This script moves the files into a directory structure like such:
- data_dir/n01440764/ILSVRC2012_val_00000293.JPEG
- data_dir/n01440764/ILSVRC2012_val_00000543.JPEG
- ...
-where 'n01440764' is the unique synset label associated with
-these images.
-
-This directory reorganization requires a mapping from validation image
-number (i.e. suffix of the original file) to the associated label. This
-is provided in the ImageNet development kit via a Matlab file.
-
-In order to make life easier and divorce ourselves from Matlab, we instead
-supply a custom text file that provides this mapping for us.
-
-Sample usage:
-  ./preprocess_imagenet_validation_data.py ILSVRC2012_img_val \
-  imagenet_2012_validation_synset_labels.txt
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import errno
-import os.path
-import sys
-
-
-if __name__ == '__main__':
-  if len(sys.argv) < 3:
-    print('Invalid usage\n'
-          'usage: preprocess_imagenet_validation_data.py '
-          '<validation data dir> <validation labels file>')
-    sys.exit(-1)
-  data_dir = sys.argv[1]
-  validation_labels_file = sys.argv[2]
-
-  # Read in the 50000 synsets associated with the validation data set.
-  labels = [l.strip() for l in open(validation_labels_file).readlines()]
-  unique_labels = set(labels)
-
-  # Make all sub-directories in the validation data dir.
-  for label in unique_labels:
-    labeled_data_dir = os.path.join(data_dir, label)
-    # Catch error if sub-directory exists
-    try:
-      os.makedirs(labeled_data_dir)
-    except OSError as e:
-      # Raise all errors but 'EEXIST'
-      if e.errno != errno.EEXIST:
-        raise
-
-  # Move all of the image to the appropriate sub-directory.
-  for i in range(len(labels)):
-    basename = 'ILSVRC2012_val_000%.5d.JPEG' % (i + 1)
-    original_filename = os.path.join(data_dir, basename)
-    if not os.path.exists(original_filename):
-      print('Failed to find: %s' % original_filename)
-      sys.exit(-1)
-    new_filename = os.path.join(data_dir, labels[i], basename)
-    os.rename(original_filename, new_filename)
diff --git a/research/inception/inception/data/process_bounding_boxes.py b/research/inception/inception/data/process_bounding_boxes.py
deleted file mode 100755
index 5e9fd786e40b6d95b89fcc9f9774aa7f132c1a6f..0000000000000000000000000000000000000000
--- a/research/inception/inception/data/process_bounding_boxes.py
+++ /dev/null
@@ -1,254 +0,0 @@
-#!/usr/bin/python
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Process the ImageNet Challenge bounding boxes for TensorFlow model training.
-
-This script is called as
-
-process_bounding_boxes.py <dir> [synsets-file]
-
-Where <dir> is a directory containing the downloaded and unpacked bounding box
-data. If [synsets-file] is supplied, then only the bounding boxes whose
-synstes are contained within this file are returned. Note that the
-[synsets-file] file contains synset ids, one per line.
-
-The script dumps out a CSV text file in which each line contains an entry.
-  n00007846_64193.JPEG,0.0060,0.2620,0.7545,0.9940
-
-The entry can be read as:
-  <JPEG file name>, <xmin>, <ymin>, <xmax>, <ymax>
-
-The bounding box for <JPEG file name> contains two points (xmin, ymin) and
-(xmax, ymax) specifying the lower-left corner and upper-right corner of a
-bounding box in *relative* coordinates.
-
-The user supplies a directory where the XML files reside. The directory
-structure in the directory <dir> is assumed to look like this:
-
-<dir>/nXXXXXXXX/nXXXXXXXX_YYYY.xml
-
-Each XML file contains a bounding box annotation. The script:
-
- (1) Parses the XML file and extracts the filename, label and bounding box info.
-
- (2) The bounding box is specified in the XML files as integer (xmin, ymin) and
-    (xmax, ymax) *relative* to image size displayed to the human annotator. The
-    size of the image displayed to the human annotator is stored in the XML file
-    as integer (height, width).
-
-    Note that the displayed size will differ from the actual size of the image
-    downloaded from image-net.org. To make the bounding box annotation useable,
-    we convert bounding box to floating point numbers relative to displayed
-    height and width of the image.
-
-    Note that each XML file might contain N bounding box annotations.
-
-    Note that the points are all clamped at a range of [0.0, 1.0] because some
-    human annotations extend outside the range of the supplied image.
-
-    See details here: http://image-net.org/download-bboxes
-
-(3) By default, the script outputs all valid bounding boxes. If a
-    [synsets-file] is supplied, only the subset of bounding boxes associated
-    with those synsets are outputted. Importantly, one can supply a list of
-    synsets in the ImageNet Challenge and output the list of bounding boxes
-    associated with the training images of the ILSVRC.
-
-    We use these bounding boxes to inform the random distortion of images
-    supplied to the network.
-
-If you run this script successfully, you will see the following output
-to stderr:
-> Finished processing 544546 XML files.
-> Skipped 0 XML files not in ImageNet Challenge.
-> Skipped 0 bounding boxes not in ImageNet Challenge.
-> Wrote 615299 bounding boxes from 544546 annotated images.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import glob
-import os.path
-import sys
-import xml.etree.ElementTree as ET
-
-
-class BoundingBox(object):
-  pass
-
-
-def GetItem(name, root, index=0):
-  count = 0
-  for item in root.iter(name):
-    if count == index:
-      return item.text
-    count += 1
-  # Failed to find "index" occurrence of item.
-  return -1
-
-
-def GetInt(name, root, index=0):
-  # In some XML annotation files, the point values are not integers, but floats.
-  # So we add a float function to avoid ValueError.
-  return int(float(GetItem(name, root, index)))
-
-
-def FindNumberBoundingBoxes(root):
-  index = 0
-  while True:
-    if GetInt('xmin', root, index) == -1:
-      break
-    index += 1
-  return index
-
-
-def ProcessXMLAnnotation(xml_file):
-  """Process a single XML file containing a bounding box."""
-  # pylint: disable=broad-except
-  try:
-    tree = ET.parse(xml_file)
-  except Exception:
-    print('Failed to parse: ' + xml_file, file=sys.stderr)
-    return None
-  # pylint: enable=broad-except
-  root = tree.getroot()
-
-  num_boxes = FindNumberBoundingBoxes(root)
-  boxes = []
-
-  for index in range(num_boxes):
-    box = BoundingBox()
-    # Grab the 'index' annotation.
-    box.xmin = GetInt('xmin', root, index)
-    box.ymin = GetInt('ymin', root, index)
-    box.xmax = GetInt('xmax', root, index)
-    box.ymax = GetInt('ymax', root, index)
-
-    box.width = GetInt('width', root)
-    box.height = GetInt('height', root)
-    box.filename = GetItem('filename', root) + '.JPEG'
-    box.label = GetItem('name', root)
-
-    xmin = float(box.xmin) / float(box.width)
-    xmax = float(box.xmax) / float(box.width)
-    ymin = float(box.ymin) / float(box.height)
-    ymax = float(box.ymax) / float(box.height)
-
-    # Some images contain bounding box annotations that
-    # extend outside of the supplied image. See, e.g.
-    # n03127925/n03127925_147.xml
-    # Additionally, for some bounding boxes, the min > max
-    # or the box is entirely outside of the image.
-    min_x = min(xmin, xmax)
-    max_x = max(xmin, xmax)
-    box.xmin_scaled = min(max(min_x, 0.0), 1.0)
-    box.xmax_scaled = min(max(max_x, 0.0), 1.0)
-
-    min_y = min(ymin, ymax)
-    max_y = max(ymin, ymax)
-    box.ymin_scaled = min(max(min_y, 0.0), 1.0)
-    box.ymax_scaled = min(max(max_y, 0.0), 1.0)
-
-    boxes.append(box)
-
-  return boxes
-
-if __name__ == '__main__':
-  if len(sys.argv) < 2 or len(sys.argv) > 3:
-    print('Invalid usage\n'
-          'usage: process_bounding_boxes.py <dir> [synsets-file]',
-          file=sys.stderr)
-    sys.exit(-1)
-
-  xml_files = glob.glob(sys.argv[1] + '/*/*.xml')
-  print('Identified %d XML files in %s' % (len(xml_files), sys.argv[1]),
-        file=sys.stderr)
-
-  if len(sys.argv) == 3:
-    labels = set([l.strip() for l in open(sys.argv[2]).readlines()])
-    print('Identified %d synset IDs in %s' % (len(labels), sys.argv[2]),
-          file=sys.stderr)
-  else:
-    labels = None
-
-  skipped_boxes = 0
-  skipped_files = 0
-  saved_boxes = 0
-  saved_files = 0
-  for file_index, one_file in enumerate(xml_files):
-    # Example: <...>/n06470073/n00141669_6790.xml
-    label = os.path.basename(os.path.dirname(one_file))
-
-    # Determine if the annotation is from an ImageNet Challenge label.
-    if labels is not None and label not in labels:
-      skipped_files += 1
-      continue
-
-    bboxes = ProcessXMLAnnotation(one_file)
-    assert bboxes is not None, 'No bounding boxes found in ' + one_file
-
-    found_box = False
-    for bbox in bboxes:
-      if labels is not None:
-        if bbox.label != label:
-          # Note: There is a slight bug in the bounding box annotation data.
-          # Many of the dog labels have the human label 'Scottish_deerhound'
-          # instead of the synset ID 'n02092002' in the bbox.label field. As a
-          # simple hack to overcome this issue, we only exclude bbox labels
-          # *which are synset ID's* that do not match original synset label for
-          # the XML file.
-          if bbox.label in labels:
-            skipped_boxes += 1
-            continue
-
-      # Guard against improperly specified boxes.
-      if (bbox.xmin_scaled >= bbox.xmax_scaled or
-          bbox.ymin_scaled >= bbox.ymax_scaled):
-        skipped_boxes += 1
-        continue
-
-      # Note bbox.filename occasionally contains '%s' in the name. This is
-      # data set noise that is fixed by just using the basename of the XML file.
-      image_filename = os.path.splitext(os.path.basename(one_file))[0]
-      print('%s.JPEG,%.4f,%.4f,%.4f,%.4f' %
-            (image_filename,
-             bbox.xmin_scaled, bbox.ymin_scaled,
-             bbox.xmax_scaled, bbox.ymax_scaled))
-
-      saved_boxes += 1
-      found_box = True
-    if found_box:
-      saved_files += 1
-    else:
-      skipped_files += 1
-
-    if not file_index % 5000:
-      print('--> processed %d of %d XML files.' %
-            (file_index + 1, len(xml_files)),
-            file=sys.stderr)
-      print('--> skipped %d boxes and %d XML files.' %
-            (skipped_boxes, skipped_files), file=sys.stderr)
-
-  print('Finished processing %d XML files.' % len(xml_files), file=sys.stderr)
-  print('Skipped %d XML files not in ImageNet Challenge.' % skipped_files,
-        file=sys.stderr)
-  print('Skipped %d bounding boxes not in ImageNet Challenge.' % skipped_boxes,
-        file=sys.stderr)
-  print('Wrote %d bounding boxes from %d annotated images.' %
-        (saved_boxes, saved_files),
-        file=sys.stderr)
-  print('Finished.', file=sys.stderr)
diff --git a/research/inception/inception/dataset.py b/research/inception/inception/dataset.py
deleted file mode 100644
index 752c97e03b0361975d64b72892cc94333e353dfb..0000000000000000000000000000000000000000
--- a/research/inception/inception/dataset.py
+++ /dev/null
@@ -1,103 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Small library that points to a data set.
-
-Methods of Data class:
-  data_files: Returns a python list of all (sharded) data set files.
-  num_examples_per_epoch: Returns the number of examples in the data set.
-  num_classes: Returns the number of classes in the data set.
-  reader: Return a reader for a single entry from the data set.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from abc import ABCMeta
-from abc import abstractmethod
-import os
-
-
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-# Basic model parameters.
-tf.app.flags.DEFINE_string('data_dir', '/tmp/mydata',
-                           """Path to the processed data, i.e. """
-                           """TFRecord of Example protos.""")
-
-
-class Dataset(object):
-  """A simple class for handling data sets."""
-  __metaclass__ = ABCMeta
-
-  def __init__(self, name, subset):
-    """Initialize dataset using a subset and the path to the data."""
-    assert subset in self.available_subsets(), self.available_subsets()
-    self.name = name
-    self.subset = subset
-
-  @abstractmethod
-  def num_classes(self):
-    """Returns the number of classes in the data set."""
-    pass
-    # return 10
-
-  @abstractmethod
-  def num_examples_per_epoch(self):
-    """Returns the number of examples in the data subset."""
-    pass
-    # if self.subset == 'train':
-    #   return 10000
-    # if self.subset == 'validation':
-    #   return 1000
-
-  @abstractmethod
-  def download_message(self):
-    """Prints a download message for the Dataset."""
-    pass
-
-  def available_subsets(self):
-    """Returns the list of available subsets."""
-    return ['train', 'validation']
-
-  def data_files(self):
-    """Returns a python list of all (sharded) data subset files.
-
-    Returns:
-      python list of all (sharded) data set files.
-    Raises:
-      ValueError: if there are not data_files matching the subset.
-    """
-    tf_record_pattern = os.path.join(FLAGS.data_dir, '%s-*' % self.subset)
-    data_files = tf.gfile.Glob(tf_record_pattern)
-    if not data_files:
-      print('No files found for dataset %s/%s at %s' % (self.name,
-                                                        self.subset,
-                                                        FLAGS.data_dir))
-
-      self.download_message()
-      exit(-1)
-    return data_files
-
-  def reader(self):
-    """Return a reader for a single entry from the data set.
-
-    See io_ops.py for details of Reader class.
-
-    Returns:
-      Reader object that reads the data set.
-    """
-    return tf.TFRecordReader()
diff --git a/research/inception/inception/flowers_data.py b/research/inception/inception/flowers_data.py
deleted file mode 100644
index 022b5234deef035a6150a54ed74445b510f1b148..0000000000000000000000000000000000000000
--- a/research/inception/inception/flowers_data.py
+++ /dev/null
@@ -1,52 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Small library that points to the flowers data set.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-
-from inception.dataset import Dataset
-
-
-class FlowersData(Dataset):
-  """Flowers data set."""
-
-  def __init__(self, subset):
-    super(FlowersData, self).__init__('Flowers', subset)
-
-  def num_classes(self):
-    """Returns the number of classes in the data set."""
-    return 5
-
-  def num_examples_per_epoch(self):
-    """Returns the number of examples in the data subset."""
-    if self.subset == 'train':
-      return 3170
-    if self.subset == 'validation':
-      return 500
-
-  def download_message(self):
-    """Instruction to download and extract the tarball from Flowers website."""
-
-    print('Failed to find any Flowers %s files'% self.subset)
-    print('')
-    print('If you have already downloaded and processed the data, then make '
-          'sure to set --data_dir to point to the directory containing the '
-          'location of the sharded TFRecords.\n')
-    print('Please see README.md for instructions on how to build '
-          'the flowers dataset using download_and_preprocess_flowers.\n')
diff --git a/research/inception/inception/flowers_eval.py b/research/inception/inception/flowers_eval.py
deleted file mode 100644
index ae3e9dc14c8dc83368aa83f523ade92e12113554..0000000000000000000000000000000000000000
--- a/research/inception/inception/flowers_eval.py
+++ /dev/null
@@ -1,40 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""A binary to evaluate Inception on the flowers data set.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from inception import inception_eval
-from inception.flowers_data import FlowersData
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def main(unused_argv=None):
-  dataset = FlowersData(subset=FLAGS.subset)
-  assert dataset.data_files()
-  if tf.gfile.Exists(FLAGS.eval_dir):
-    tf.gfile.DeleteRecursively(FLAGS.eval_dir)
-  tf.gfile.MakeDirs(FLAGS.eval_dir)
-  inception_eval.evaluate(dataset)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/inception/inception/flowers_train.py b/research/inception/inception/flowers_train.py
deleted file mode 100644
index 1f044a539d48ef6ce011831210b4bc31eba278f3..0000000000000000000000000000000000000000
--- a/research/inception/inception/flowers_train.py
+++ /dev/null
@@ -1,41 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""A binary to train Inception on the flowers data set.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-
-import tensorflow as tf
-
-from inception import inception_train
-from inception.flowers_data import FlowersData
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def main(_):
-  dataset = FlowersData(subset=FLAGS.subset)
-  assert dataset.data_files()
-  if tf.gfile.Exists(FLAGS.train_dir):
-    tf.gfile.DeleteRecursively(FLAGS.train_dir)
-  tf.gfile.MakeDirs(FLAGS.train_dir)
-  inception_train.train(dataset)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/inception/inception/image_processing.py b/research/inception/inception/image_processing.py
deleted file mode 100644
index fe74f1b3c9958060b15f52df80b11606c7ccf343..0000000000000000000000000000000000000000
--- a/research/inception/inception/image_processing.py
+++ /dev/null
@@ -1,513 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Read and preprocess image data.
-
- Image processing occurs on a single image at a time. Image are read and
- preprocessed in parallel across multiple threads. The resulting images
- are concatenated together to form a single batch for training or evaluation.
-
- -- Provide processed image data for a network:
- inputs: Construct batches of evaluation examples of images.
- distorted_inputs: Construct batches of training examples of images.
- batch_inputs: Construct batches of training or evaluation examples of images.
-
- -- Data processing:
- parse_example_proto: Parses an Example proto containing a training example
-   of an image.
-
- -- Image decoding:
- decode_jpeg: Decode a JPEG encoded string into a 3-D float32 Tensor.
-
- -- Image preprocessing:
- image_preprocessing: Decode and preprocess one image for evaluation or training
- distort_image: Distort one image for training a network.
- eval_image: Prepare one image for evaluation.
- distort_color: Distort the color in one image for training.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_integer('batch_size', 32,
-                            """Number of images to process in a batch.""")
-tf.app.flags.DEFINE_integer('image_size', 299,
-                            """Provide square images of this size.""")
-tf.app.flags.DEFINE_integer('num_preprocess_threads', 4,
-                            """Number of preprocessing threads per tower. """
-                            """Please make this a multiple of 4.""")
-tf.app.flags.DEFINE_integer('num_readers', 4,
-                            """Number of parallel readers during train.""")
-
-# Images are preprocessed asynchronously using multiple threads specified by
-# --num_preprocss_threads and the resulting processed images are stored in a
-# random shuffling queue. The shuffling queue dequeues --batch_size images
-# for processing on a given Inception tower. A larger shuffling queue guarantees
-# better mixing across examples within a batch and results in slightly higher
-# predictive performance in a trained model. Empirically,
-# --input_queue_memory_factor=16 works well. A value of 16 implies a queue size
-# of 1024*16 images. Assuming RGB 299x299 images, this implies a queue size of
-# 16GB. If the machine is memory limited, then decrease this factor to
-# decrease the CPU memory footprint, accordingly.
-tf.app.flags.DEFINE_integer('input_queue_memory_factor', 16,
-                            """Size of the queue of preprocessed images. """
-                            """Default is ideal but try smaller values, e.g. """
-                            """4, 2 or 1, if host memory is constrained. See """
-                            """comments in code for more details.""")
-
-
-def inputs(dataset, batch_size=None, num_preprocess_threads=None):
-  """Generate batches of ImageNet images for evaluation.
-
-  Use this function as the inputs for evaluating a network.
-
-  Note that some (minimal) image preprocessing occurs during evaluation
-  including central cropping and resizing of the image to fit the network.
-
-  Args:
-    dataset: instance of Dataset class specifying the dataset.
-    batch_size: integer, number of examples in batch
-    num_preprocess_threads: integer, total number of preprocessing threads but
-      None defaults to FLAGS.num_preprocess_threads.
-
-  Returns:
-    images: Images. 4D tensor of size [batch_size, FLAGS.image_size,
-                                       image_size, 3].
-    labels: 1-D integer Tensor of [FLAGS.batch_size].
-  """
-  if not batch_size:
-    batch_size = FLAGS.batch_size
-
-  # Force all input processing onto CPU in order to reserve the GPU for
-  # the forward inference and back-propagation.
-  with tf.device('/cpu:0'):
-    images, labels = batch_inputs(
-        dataset, batch_size, train=False,
-        num_preprocess_threads=num_preprocess_threads,
-        num_readers=1)
-
-  return images, labels
-
-
-def distorted_inputs(dataset, batch_size=None, num_preprocess_threads=None):
-  """Generate batches of distorted versions of ImageNet images.
-
-  Use this function as the inputs for training a network.
-
-  Distorting images provides a useful technique for augmenting the data
-  set during training in order to make the network invariant to aspects
-  of the image that do not effect the label.
-
-  Args:
-    dataset: instance of Dataset class specifying the dataset.
-    batch_size: integer, number of examples in batch
-    num_preprocess_threads: integer, total number of preprocessing threads but
-      None defaults to FLAGS.num_preprocess_threads.
-
-  Returns:
-    images: Images. 4D tensor of size [batch_size, FLAGS.image_size,
-                                       FLAGS.image_size, 3].
-    labels: 1-D integer Tensor of [batch_size].
-  """
-  if not batch_size:
-    batch_size = FLAGS.batch_size
-
-  # Force all input processing onto CPU in order to reserve the GPU for
-  # the forward inference and back-propagation.
-  with tf.device('/cpu:0'):
-    images, labels = batch_inputs(
-        dataset, batch_size, train=True,
-        num_preprocess_threads=num_preprocess_threads,
-        num_readers=FLAGS.num_readers)
-  return images, labels
-
-
-def decode_jpeg(image_buffer, scope=None):
-  """Decode a JPEG string into one 3-D float image Tensor.
-
-  Args:
-    image_buffer: scalar string Tensor.
-    scope: Optional scope for name_scope.
-  Returns:
-    3-D float Tensor with values ranging from [0, 1).
-  """
-  with tf.name_scope(values=[image_buffer], name=scope,
-                     default_name='decode_jpeg'):
-    # Decode the string as an RGB JPEG.
-    # Note that the resulting image contains an unknown height and width
-    # that is set dynamically by decode_jpeg. In other words, the height
-    # and width of image is unknown at compile-time.
-    image = tf.image.decode_jpeg(image_buffer, channels=3)
-
-    # After this point, all image pixels reside in [0,1)
-    # until the very end, when they're rescaled to (-1, 1).  The various
-    # adjust_* ops all require this range for dtype float.
-    image = tf.image.convert_image_dtype(image, dtype=tf.float32)
-    return image
-
-
-def distort_color(image, thread_id=0, scope=None):
-  """Distort the color of the image.
-
-  Each color distortion is non-commutative and thus ordering of the color ops
-  matters. Ideally we would randomly permute the ordering of the color ops.
-  Rather than adding that level of complication, we select a distinct ordering
-  of color ops for each preprocessing thread.
-
-  Args:
-    image: Tensor containing single image.
-    thread_id: preprocessing thread ID.
-    scope: Optional scope for name_scope.
-  Returns:
-    color-distorted image
-  """
-  with tf.name_scope(values=[image], name=scope, default_name='distort_color'):
-    color_ordering = thread_id % 2
-
-    if color_ordering == 0:
-      image = tf.image.random_brightness(image, max_delta=32. / 255.)
-      image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-      image = tf.image.random_hue(image, max_delta=0.2)
-      image = tf.image.random_contrast(image, lower=0.5, upper=1.5)
-    elif color_ordering == 1:
-      image = tf.image.random_brightness(image, max_delta=32. / 255.)
-      image = tf.image.random_contrast(image, lower=0.5, upper=1.5)
-      image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-      image = tf.image.random_hue(image, max_delta=0.2)
-
-    # The random_* ops do not necessarily clamp.
-    image = tf.clip_by_value(image, 0.0, 1.0)
-    return image
-
-
-def distort_image(image, height, width, bbox, thread_id=0, scope=None):
-  """Distort one image for training a network.
-
-  Distorting images provides a useful technique for augmenting the data
-  set during training in order to make the network invariant to aspects
-  of the image that do not effect the label.
-
-  Args:
-    image: 3-D float Tensor of image
-    height: integer
-    width: integer
-    bbox: 3-D float Tensor of bounding boxes arranged [1, num_boxes, coords]
-      where each coordinate is [0, 1) and the coordinates are arranged
-      as [ymin, xmin, ymax, xmax].
-    thread_id: integer indicating the preprocessing thread.
-    scope: Optional scope for name_scope.
-  Returns:
-    3-D float Tensor of distorted image used for training.
-  """
-  with tf.name_scope(values=[image, height, width, bbox], name=scope,
-                     default_name='distort_image'):
-    # Each bounding box has shape [1, num_boxes, box coords] and
-    # the coordinates are ordered [ymin, xmin, ymax, xmax].
-
-    # Display the bounding box in the first thread only.
-    if not thread_id:
-      image_with_box = tf.image.draw_bounding_boxes(tf.expand_dims(image, 0),
-                                                    bbox)
-      tf.summary.image('image_with_bounding_boxes', image_with_box)
-
-  # A large fraction of image datasets contain a human-annotated bounding
-  # box delineating the region of the image containing the object of interest.
-  # We choose to create a new bounding box for the object which is a randomly
-  # distorted version of the human-annotated bounding box that obeys an allowed
-  # range of aspect ratios, sizes and overlap with the human-annotated
-  # bounding box. If no box is supplied, then we assume the bounding box is
-  # the entire image.
-    sample_distorted_bounding_box = tf.image.sample_distorted_bounding_box(
-        tf.shape(image),
-        bounding_boxes=bbox,
-        min_object_covered=0.1,
-        aspect_ratio_range=[0.75, 1.33],
-        area_range=[0.05, 1.0],
-        max_attempts=100,
-        use_image_if_no_bounding_boxes=True)
-    bbox_begin, bbox_size, distort_bbox = sample_distorted_bounding_box
-    if not thread_id:
-      image_with_distorted_box = tf.image.draw_bounding_boxes(
-          tf.expand_dims(image, 0), distort_bbox)
-      tf.summary.image('images_with_distorted_bounding_box',
-                       image_with_distorted_box)
-
-    # Crop the image to the specified bounding box.
-    distorted_image = tf.slice(image, bbox_begin, bbox_size)
-
-    # This resizing operation may distort the images because the aspect
-    # ratio is not respected. We select a resize method in a round robin
-    # fashion based on the thread number.
-    # Note that ResizeMethod contains 4 enumerated resizing methods.
-    resize_method = thread_id % 4
-    distorted_image = tf.image.resize_images(distorted_image, [height, width],
-                                             method=resize_method)
-    # Restore the shape since the dynamic slice based upon the bbox_size loses
-    # the third dimension.
-    distorted_image.set_shape([height, width, 3])
-    if not thread_id:
-      tf.summary.image('cropped_resized_image',
-                       tf.expand_dims(distorted_image, 0))
-
-    # Randomly flip the image horizontally.
-    distorted_image = tf.image.random_flip_left_right(distorted_image)
-
-    # Randomly distort the colors.
-    distorted_image = distort_color(distorted_image, thread_id)
-
-    if not thread_id:
-      tf.summary.image('final_distorted_image',
-                       tf.expand_dims(distorted_image, 0))
-    return distorted_image
-
-
-def eval_image(image, height, width, scope=None):
-  """Prepare one image for evaluation.
-
-  Args:
-    image: 3-D float Tensor
-    height: integer
-    width: integer
-    scope: Optional scope for name_scope.
-  Returns:
-    3-D float Tensor of prepared image.
-  """
-  with tf.name_scope(values=[image, height, width], name=scope,
-                     default_name='eval_image'):
-    # Crop the central region of the image with an area containing 87.5% of
-    # the original image.
-    image = tf.image.central_crop(image, central_fraction=0.875)
-
-    # Resize the image to the original height and width.
-    image = tf.expand_dims(image, 0)
-    image = tf.image.resize_bilinear(image, [height, width],
-                                     align_corners=False)
-    image = tf.squeeze(image, [0])
-    return image
-
-
-def image_preprocessing(image_buffer, bbox, train, thread_id=0):
-  """Decode and preprocess one image for evaluation or training.
-
-  Args:
-    image_buffer: JPEG encoded string Tensor
-    bbox: 3-D float Tensor of bounding boxes arranged [1, num_boxes, coords]
-      where each coordinate is [0, 1) and the coordinates are arranged as
-      [ymin, xmin, ymax, xmax].
-    train: boolean
-    thread_id: integer indicating preprocessing thread
-
-  Returns:
-    3-D float Tensor containing an appropriately scaled image
-
-  Raises:
-    ValueError: if user does not provide bounding box
-  """
-  if bbox is None:
-    raise ValueError('Please supply a bounding box.')
-
-  image = decode_jpeg(image_buffer)
-  height = FLAGS.image_size
-  width = FLAGS.image_size
-
-  if train:
-    image = distort_image(image, height, width, bbox, thread_id)
-  else:
-    image = eval_image(image, height, width)
-
-  # Finally, rescale to [-1,1] instead of [0, 1)
-  image = tf.subtract(image, 0.5)
-  image = tf.multiply(image, 2.0)
-  return image
-
-
-def parse_example_proto(example_serialized):
-  """Parses an Example proto containing a training example of an image.
-
-  The output of the build_image_data.py image preprocessing script is a dataset
-  containing serialized Example protocol buffers. Each Example proto contains
-  the following fields:
-
-    image/height: 462
-    image/width: 581
-    image/colorspace: 'RGB'
-    image/channels: 3
-    image/class/label: 615
-    image/class/synset: 'n03623198'
-    image/class/text: 'knee pad'
-    image/object/bbox/xmin: 0.1
-    image/object/bbox/xmax: 0.9
-    image/object/bbox/ymin: 0.2
-    image/object/bbox/ymax: 0.6
-    image/object/bbox/label: 615
-    image/format: 'JPEG'
-    image/filename: 'ILSVRC2012_val_00041207.JPEG'
-    image/encoded: <JPEG encoded string>
-
-  Args:
-    example_serialized: scalar Tensor tf.string containing a serialized
-      Example protocol buffer.
-
-  Returns:
-    image_buffer: Tensor tf.string containing the contents of a JPEG file.
-    label: Tensor tf.int32 containing the label.
-    bbox: 3-D float Tensor of bounding boxes arranged [1, num_boxes, coords]
-      where each coordinate is [0, 1) and the coordinates are arranged as
-      [ymin, xmin, ymax, xmax].
-    text: Tensor tf.string containing the human-readable label.
-  """
-  # Dense features in Example proto.
-  feature_map = {
-      'image/encoded': tf.FixedLenFeature([], dtype=tf.string,
-                                          default_value=''),
-      'image/class/label': tf.FixedLenFeature([1], dtype=tf.int64,
-                                              default_value=-1),
-      'image/class/text': tf.FixedLenFeature([], dtype=tf.string,
-                                             default_value=''),
-  }
-  sparse_float32 = tf.VarLenFeature(dtype=tf.float32)
-  # Sparse features in Example proto.
-  feature_map.update(
-      {k: sparse_float32 for k in ['image/object/bbox/xmin',
-                                   'image/object/bbox/ymin',
-                                   'image/object/bbox/xmax',
-                                   'image/object/bbox/ymax']})
-
-  features = tf.parse_single_example(example_serialized, feature_map)
-  label = tf.cast(features['image/class/label'], dtype=tf.int32)
-
-  xmin = tf.expand_dims(features['image/object/bbox/xmin'].values, 0)
-  ymin = tf.expand_dims(features['image/object/bbox/ymin'].values, 0)
-  xmax = tf.expand_dims(features['image/object/bbox/xmax'].values, 0)
-  ymax = tf.expand_dims(features['image/object/bbox/ymax'].values, 0)
-
-  # Note that we impose an ordering of (y, x) just to make life difficult.
-  bbox = tf.concat(axis=0, values=[ymin, xmin, ymax, xmax])
-
-  # Force the variable number of bounding boxes into the shape
-  # [1, num_boxes, coords].
-  bbox = tf.expand_dims(bbox, 0)
-  bbox = tf.transpose(bbox, [0, 2, 1])
-
-  return features['image/encoded'], label, bbox, features['image/class/text']
-
-
-def batch_inputs(dataset, batch_size, train, num_preprocess_threads=None,
-                 num_readers=1):
-  """Contruct batches of training or evaluation examples from the image dataset.
-
-  Args:
-    dataset: instance of Dataset class specifying the dataset.
-      See dataset.py for details.
-    batch_size: integer
-    train: boolean
-    num_preprocess_threads: integer, total number of preprocessing threads
-    num_readers: integer, number of parallel readers
-
-  Returns:
-    images: 4-D float Tensor of a batch of images
-    labels: 1-D integer Tensor of [batch_size].
-
-  Raises:
-    ValueError: if data is not found
-  """
-  with tf.name_scope('batch_processing'):
-    data_files = dataset.data_files()
-    if data_files is None:
-      raise ValueError('No data files found for this dataset')
-
-    # Create filename_queue
-    if train:
-      filename_queue = tf.train.string_input_producer(data_files,
-                                                      shuffle=True,
-                                                      capacity=16)
-    else:
-      filename_queue = tf.train.string_input_producer(data_files,
-                                                      shuffle=False,
-                                                      capacity=1)
-    if num_preprocess_threads is None:
-      num_preprocess_threads = FLAGS.num_preprocess_threads
-
-    if num_preprocess_threads % 4:
-      raise ValueError('Please make num_preprocess_threads a multiple '
-                       'of 4 (%d % 4 != 0).', num_preprocess_threads)
-
-    if num_readers is None:
-      num_readers = FLAGS.num_readers
-
-    if num_readers < 1:
-      raise ValueError('Please make num_readers at least 1')
-
-    # Approximate number of examples per shard.
-    examples_per_shard = 1024
-    # Size the random shuffle queue to balance between good global
-    # mixing (more examples) and memory use (fewer examples).
-    # 1 image uses 299*299*3*4 bytes = 1MB
-    # The default input_queue_memory_factor is 16 implying a shuffling queue
-    # size: examples_per_shard * 16 * 1MB = 17.6GB
-    min_queue_examples = examples_per_shard * FLAGS.input_queue_memory_factor
-    if train:
-      examples_queue = tf.RandomShuffleQueue(
-          capacity=min_queue_examples + 3 * batch_size,
-          min_after_dequeue=min_queue_examples,
-          dtypes=[tf.string])
-    else:
-      examples_queue = tf.FIFOQueue(
-          capacity=examples_per_shard + 3 * batch_size,
-          dtypes=[tf.string])
-
-    # Create multiple readers to populate the queue of examples.
-    if num_readers > 1:
-      enqueue_ops = []
-      for _ in range(num_readers):
-        reader = dataset.reader()
-        _, value = reader.read(filename_queue)
-        enqueue_ops.append(examples_queue.enqueue([value]))
-
-      tf.train.queue_runner.add_queue_runner(
-          tf.train.queue_runner.QueueRunner(examples_queue, enqueue_ops))
-      example_serialized = examples_queue.dequeue()
-    else:
-      reader = dataset.reader()
-      _, example_serialized = reader.read(filename_queue)
-
-    images_and_labels = []
-    for thread_id in range(num_preprocess_threads):
-      # Parse a serialized Example proto to extract the image and metadata.
-      image_buffer, label_index, bbox, _ = parse_example_proto(
-          example_serialized)
-      image = image_preprocessing(image_buffer, bbox, train, thread_id)
-      images_and_labels.append([image, label_index])
-
-    images, label_index_batch = tf.train.batch_join(
-        images_and_labels,
-        batch_size=batch_size,
-        capacity=2 * num_preprocess_threads * batch_size)
-
-    # Reshape images into these desired dimensions.
-    height = FLAGS.image_size
-    width = FLAGS.image_size
-    depth = 3
-
-    images = tf.cast(images, tf.float32)
-    images = tf.reshape(images, shape=[batch_size, height, width, depth])
-
-    # Display the training images in the visualizer.
-    tf.summary.image('images', images)
-
-    return images, tf.reshape(label_index_batch, [batch_size])
diff --git a/research/inception/inception/imagenet_data.py b/research/inception/inception/imagenet_data.py
deleted file mode 100644
index 0a6d22e1292632f0899355d5aa7183c3f5f33b2c..0000000000000000000000000000000000000000
--- a/research/inception/inception/imagenet_data.py
+++ /dev/null
@@ -1,59 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Small library that points to the ImageNet data set.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-
-from inception.dataset import Dataset
-
-
-class ImagenetData(Dataset):
-  """ImageNet data set."""
-
-  def __init__(self, subset):
-    super(ImagenetData, self).__init__('ImageNet', subset)
-
-  def num_classes(self):
-    """Returns the number of classes in the data set."""
-    return 1000
-
-  def num_examples_per_epoch(self):
-    """Returns the number of examples in the data set."""
-    # Bounding box data consists of 615299 bounding boxes for 544546 images.
-    if self.subset == 'train':
-      return 1281167
-    if self.subset == 'validation':
-      return 50000
-
-  def download_message(self):
-    """Instruction to download and extract the tarball from Flowers website."""
-
-    print('Failed to find any ImageNet %s files'% self.subset)
-    print('')
-    print('If you have already downloaded and processed the data, then make '
-          'sure to set --data_dir to point to the directory containing the '
-          'location of the sharded TFRecords.\n')
-    print('If you have not downloaded and prepared the ImageNet data in the '
-          'TFRecord format, you will need to do this at least once. This '
-          'process could take several hours depending on the speed of your '
-          'computer and network connection\n')
-    print('Please see README.md for instructions on how to build '
-          'the ImageNet dataset using download_and_preprocess_imagenet.\n')
-    print('Note that the raw data size is 300 GB and the processed data size '
-          'is 150 GB. Please ensure you have at least 500GB disk space.')
diff --git a/research/inception/inception/imagenet_distributed_train.py b/research/inception/inception/imagenet_distributed_train.py
deleted file mode 100644
index f3615e012f042649b52e37aeaeeb2c3efc07f92c..0000000000000000000000000000000000000000
--- a/research/inception/inception/imagenet_distributed_train.py
+++ /dev/null
@@ -1,66 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-# pylint: disable=line-too-long
-"""A binary to train Inception in a distributed manner using multiple systems.
-
-Please see accompanying README.md for details and instructions.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from inception import inception_distributed_train
-from inception.imagenet_data import ImagenetData
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def main(unused_args):
-  assert FLAGS.job_name in ['ps', 'worker'], 'job_name must be ps or worker'
-
-  # Extract all the hostnames for the ps and worker jobs to construct the
-  # cluster spec.
-  ps_hosts = FLAGS.ps_hosts.split(',')
-  worker_hosts = FLAGS.worker_hosts.split(',')
-  tf.logging.info('PS hosts are: %s' % ps_hosts)
-  tf.logging.info('Worker hosts are: %s' % worker_hosts)
-
-  cluster_spec = tf.train.ClusterSpec({'ps': ps_hosts,
-                                       'worker': worker_hosts})
-  server = tf.train.Server(
-      {'ps': ps_hosts,
-       'worker': worker_hosts},
-      job_name=FLAGS.job_name,
-      task_index=FLAGS.task_id,
-      protocol=FLAGS.protocol)
-
-  if FLAGS.job_name == 'ps':
-    # `ps` jobs wait for incoming connections from the workers.
-    server.join()
-  else:
-    # `worker` jobs will actually do the work.
-    dataset = ImagenetData(subset=FLAGS.subset)
-    assert dataset.data_files()
-    # Only the chief checks for or creates train_dir.
-    if FLAGS.task_id == 0:
-      if not tf.gfile.Exists(FLAGS.train_dir):
-        tf.gfile.MakeDirs(FLAGS.train_dir)
-    inception_distributed_train.train(server.target, dataset, cluster_spec)
-
-if __name__ == '__main__':
-  tf.logging.set_verbosity(tf.logging.INFO)
-  tf.app.run()
diff --git a/research/inception/inception/imagenet_eval.py b/research/inception/inception/imagenet_eval.py
deleted file mode 100644
index e6f8bac2ee71021914715172296d63dd56b5a6f9..0000000000000000000000000000000000000000
--- a/research/inception/inception/imagenet_eval.py
+++ /dev/null
@@ -1,46 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""A binary to evaluate Inception on the ImageNet data set.
-
-Note that using the supplied pre-trained inception checkpoint, the eval should
-achieve:
-  precision @ 1 = 0.7874 recall @ 5 = 0.9436 [50000 examples]
-
-See the README.md for more details.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from inception import inception_eval
-from inception.imagenet_data import ImagenetData
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def main(unused_argv=None):
-  dataset = ImagenetData(subset=FLAGS.subset)
-  assert dataset.data_files()
-  if tf.gfile.Exists(FLAGS.eval_dir):
-    tf.gfile.DeleteRecursively(FLAGS.eval_dir)
-  tf.gfile.MakeDirs(FLAGS.eval_dir)
-  inception_eval.evaluate(dataset)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/inception/inception/imagenet_train.py b/research/inception/inception/imagenet_train.py
deleted file mode 100644
index 3ffb55ee963e5b9f8e31915a78eef518324642aa..0000000000000000000000000000000000000000
--- a/research/inception/inception/imagenet_train.py
+++ /dev/null
@@ -1,41 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""A binary to train Inception on the ImageNet data set.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-
-import tensorflow as tf
-
-from inception import inception_train
-from inception.imagenet_data import ImagenetData
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def main(_):
-  dataset = ImagenetData(subset=FLAGS.subset)
-  assert dataset.data_files()
-  if tf.gfile.Exists(FLAGS.train_dir):
-    tf.gfile.DeleteRecursively(FLAGS.train_dir)
-  tf.gfile.MakeDirs(FLAGS.train_dir)
-  inception_train.train(dataset)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/inception/inception/inception_distributed_train.py b/research/inception/inception/inception_distributed_train.py
deleted file mode 100644
index c1a589acb5fe386fd648ae3fae926ee927c0ca79..0000000000000000000000000000000000000000
--- a/research/inception/inception/inception_distributed_train.py
+++ /dev/null
@@ -1,314 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""A library to train Inception using multiple replicas with synchronous update.
-
-Please see accompanying README.md for details and instructions.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from datetime import datetime
-import os.path
-import time
-
-import numpy as np
-import tensorflow as tf
-
-from inception import image_processing
-from inception import inception_model as inception
-from inception.slim import slim
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_string('job_name', '', 'One of "ps", "worker"')
-tf.app.flags.DEFINE_string('ps_hosts', '',
-                           """Comma-separated list of hostname:port for the """
-                           """parameter server jobs. e.g. """
-                           """'machine1:2222,machine2:1111,machine2:2222'""")
-tf.app.flags.DEFINE_string('worker_hosts', '',
-                           """Comma-separated list of hostname:port for the """
-                           """worker jobs. e.g. """
-                           """'machine1:2222,machine2:1111,machine2:2222'""")
-tf.app.flags.DEFINE_string('protocol', 'grpc',
-                           """Communication protocol to use in distributed """
-                           """execution (default grpc) """)
-
-tf.app.flags.DEFINE_string('train_dir', '/tmp/imagenet_train',
-                           """Directory where to write event logs """
-                           """and checkpoint.""")
-tf.app.flags.DEFINE_integer('max_steps', 1000000, 'Number of batches to run.')
-tf.app.flags.DEFINE_string('subset', 'train', 'Either "train" or "validation".')
-tf.app.flags.DEFINE_boolean('log_device_placement', False,
-                            'Whether to log device placement.')
-
-# Task ID is used to select the chief and also to access the local_step for
-# each replica to check staleness of the gradients in SyncReplicasOptimizer.
-tf.app.flags.DEFINE_integer(
-    'task_id', 0, 'Task ID of the worker/replica running the training.')
-
-# More details can be found in the SyncReplicasOptimizer class:
-# tensorflow/python/training/sync_replicas_optimizer.py
-tf.app.flags.DEFINE_integer('num_replicas_to_aggregate', -1,
-                            """Number of gradients to collect before """
-                            """updating the parameters.""")
-tf.app.flags.DEFINE_integer('save_interval_secs', 10 * 60,
-                            'Save interval seconds.')
-tf.app.flags.DEFINE_integer('save_summaries_secs', 180,
-                            'Save summaries interval seconds.')
-
-# **IMPORTANT**
-# Please note that this learning rate schedule is heavily dependent on the
-# hardware architecture, batch size and any changes to the model architecture
-# specification. Selecting a finely tuned learning rate schedule is an
-# empirical process that requires some experimentation. Please see README.md
-# more guidance and discussion.
-#
-# Learning rate decay factor selected from https://arxiv.org/abs/1604.00981
-tf.app.flags.DEFINE_float('initial_learning_rate', 0.045,
-                          'Initial learning rate.')
-tf.app.flags.DEFINE_float('num_epochs_per_decay', 2.0,
-                          'Epochs after which learning rate decays.')
-tf.app.flags.DEFINE_float('learning_rate_decay_factor', 0.94,
-                          'Learning rate decay factor.')
-
-# Constants dictating the learning rate schedule.
-RMSPROP_DECAY = 0.9                # Decay term for RMSProp.
-RMSPROP_MOMENTUM = 0.9             # Momentum in RMSProp.
-RMSPROP_EPSILON = 1.0              # Epsilon term for RMSProp.
-
-
-def train(target, dataset, cluster_spec):
-  """Train Inception on a dataset for a number of steps."""
-  # Number of workers and parameter servers are inferred from the workers and ps
-  # hosts string.
-  num_workers = len(cluster_spec.as_dict()['worker'])
-  num_parameter_servers = len(cluster_spec.as_dict()['ps'])
-  # If no value is given, num_replicas_to_aggregate defaults to be the number of
-  # workers.
-  if FLAGS.num_replicas_to_aggregate == -1:
-    num_replicas_to_aggregate = num_workers
-  else:
-    num_replicas_to_aggregate = FLAGS.num_replicas_to_aggregate
-
-  # Both should be greater than 0 in a distributed training.
-  assert num_workers > 0 and num_parameter_servers > 0, (' num_workers and '
-                                                         'num_parameter_servers'
-                                                         ' must be > 0.')
-
-  # Choose worker 0 as the chief. Note that any worker could be the chief
-  # but there should be only one chief.
-  is_chief = (FLAGS.task_id == 0)
-
-  # Ops are assigned to worker by default.
-  with tf.device('/job:worker/task:%d' % FLAGS.task_id):
-    # Variables and its related init/assign ops are assigned to ps.
-    with slim.scopes.arg_scope(
-        [slim.variables.variable, slim.variables.global_step],
-        device=slim.variables.VariableDeviceChooser(num_parameter_servers)):
-      # Create a variable to count the number of train() calls. This equals the
-      # number of updates applied to the variables.
-      global_step = slim.variables.global_step()
-
-      # Calculate the learning rate schedule.
-      num_batches_per_epoch = (dataset.num_examples_per_epoch() /
-                               FLAGS.batch_size)
-      # Decay steps need to be divided by the number of replicas to aggregate.
-      decay_steps = int(num_batches_per_epoch * FLAGS.num_epochs_per_decay /
-                        num_replicas_to_aggregate)
-
-      # Decay the learning rate exponentially based on the number of steps.
-      lr = tf.train.exponential_decay(FLAGS.initial_learning_rate,
-                                      global_step,
-                                      decay_steps,
-                                      FLAGS.learning_rate_decay_factor,
-                                      staircase=True)
-      # Add a summary to track the learning rate.
-      tf.summary.scalar('learning_rate', lr)
-
-      # Create an optimizer that performs gradient descent.
-      opt = tf.train.RMSPropOptimizer(lr,
-                                      RMSPROP_DECAY,
-                                      momentum=RMSPROP_MOMENTUM,
-                                      epsilon=RMSPROP_EPSILON)
-
-      images, labels = image_processing.distorted_inputs(
-          dataset,
-          batch_size=FLAGS.batch_size,
-          num_preprocess_threads=FLAGS.num_preprocess_threads)
-
-      # Number of classes in the Dataset label set plus 1.
-      # Label 0 is reserved for an (unused) background class.
-      num_classes = dataset.num_classes() + 1
-      logits = inception.inference(images, num_classes, for_training=True)
-      # Add classification loss.
-      inception.loss(logits, labels)
-
-      # Gather all of the losses including regularization losses.
-      losses = tf.get_collection(slim.losses.LOSSES_COLLECTION)
-      losses += tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)
-
-      total_loss = tf.add_n(losses, name='total_loss')
-
-      if is_chief:
-        # Compute the moving average of all individual losses and the
-        # total loss.
-        loss_averages = tf.train.ExponentialMovingAverage(0.9, name='avg')
-        loss_averages_op = loss_averages.apply(losses + [total_loss])
-
-        # Attach a scalar summmary to all individual losses and the total loss;
-        # do the same for the averaged version of the losses.
-        for l in losses + [total_loss]:
-          loss_name = l.op.name
-          # Name each loss as '(raw)' and name the moving average version of the
-          # loss as the original loss name.
-          tf.summary.scalar(loss_name + ' (raw)', l)
-          tf.summary.scalar(loss_name, loss_averages.average(l))
-
-        # Add dependency to compute loss_averages.
-        with tf.control_dependencies([loss_averages_op]):
-          total_loss = tf.identity(total_loss)
-
-      # Track the moving averages of all trainable variables.
-      # Note that we maintain a 'double-average' of the BatchNormalization
-      # global statistics.
-      # This is not needed when the number of replicas are small but important
-      # for synchronous distributed training with tens of workers/replicas.
-      exp_moving_averager = tf.train.ExponentialMovingAverage(
-          inception.MOVING_AVERAGE_DECAY, global_step)
-
-      variables_to_average = (
-          tf.trainable_variables() + tf.moving_average_variables())
-
-      # Add histograms for model variables.
-      for var in variables_to_average:
-        tf.summary.histogram(var.op.name, var)
-
-      # Create synchronous replica optimizer.
-      opt = tf.train.SyncReplicasOptimizer(
-          opt,
-          replicas_to_aggregate=num_replicas_to_aggregate,
-          total_num_replicas=num_workers,
-          variable_averages=exp_moving_averager,
-          variables_to_average=variables_to_average)
-
-      batchnorm_updates = tf.get_collection(slim.ops.UPDATE_OPS_COLLECTION)
-      assert batchnorm_updates, 'Batchnorm updates are missing'
-      batchnorm_updates_op = tf.group(*batchnorm_updates)
-      # Add dependency to compute batchnorm_updates.
-      with tf.control_dependencies([batchnorm_updates_op]):
-        total_loss = tf.identity(total_loss)
-
-      # Compute gradients with respect to the loss.
-      grads = opt.compute_gradients(total_loss)
-
-      # Add histograms for gradients.
-      for grad, var in grads:
-        if grad is not None:
-          tf.summary.histogram(var.op.name + '/gradients', grad)
-
-      apply_gradients_op = opt.apply_gradients(grads, global_step=global_step)
-
-      with tf.control_dependencies([apply_gradients_op]):
-        train_op = tf.identity(total_loss, name='train_op')
-
-      # Get chief queue_runners and init_tokens, which is used to synchronize
-      # replicas. More details can be found in SyncReplicasOptimizer.
-      chief_queue_runners = [opt.get_chief_queue_runner()]
-      init_tokens_op = opt.get_init_tokens_op()
-
-      # Create a saver.
-      saver = tf.train.Saver()
-
-      # Build the summary operation based on the TF collection of Summaries.
-      summary_op = tf.summary.merge_all()
-
-      # Build an initialization operation to run below.
-      init_op = tf.global_variables_initializer()
-
-      # We run the summaries in the same thread as the training operations by
-      # passing in None for summary_op to avoid a summary_thread being started.
-      # Running summaries and training operations in parallel could run out of
-      # GPU memory.
-      sv = tf.train.Supervisor(is_chief=is_chief,
-                               logdir=FLAGS.train_dir,
-                               init_op=init_op,
-                               summary_op=None,
-                               global_step=global_step,
-                               saver=saver,
-                               save_model_secs=FLAGS.save_interval_secs)
-
-      tf.logging.info('%s Supervisor' % datetime.now())
-
-      sess_config = tf.ConfigProto(
-          allow_soft_placement=True,
-          log_device_placement=FLAGS.log_device_placement)
-
-      # Get a session.
-      sess = sv.prepare_or_wait_for_session(target, config=sess_config)
-
-      # Start the queue runners.
-      queue_runners = tf.get_collection(tf.GraphKeys.QUEUE_RUNNERS)
-      sv.start_queue_runners(sess, queue_runners)
-      tf.logging.info('Started %d queues for processing input data.',
-                      len(queue_runners))
-
-      if is_chief:
-        sv.start_queue_runners(sess, chief_queue_runners)
-        sess.run(init_tokens_op)
-
-      # Train, checking for Nans. Concurrently run the summary operation at a
-      # specified interval. Note that the summary_op and train_op never run
-      # simultaneously in order to prevent running out of GPU memory.
-      next_summary_time = time.time() + FLAGS.save_summaries_secs
-      while not sv.should_stop():
-        try:
-          start_time = time.time()
-          loss_value, step = sess.run([train_op, global_step])
-          assert not np.isnan(loss_value), 'Model diverged with loss = NaN'
-          if step > FLAGS.max_steps:
-            break
-          duration = time.time() - start_time
-
-          if step % 30 == 0:
-            examples_per_sec = FLAGS.batch_size / float(duration)
-            format_str = ('Worker %d: %s: step %d, loss = %.2f'
-                          '(%.1f examples/sec; %.3f  sec/batch)')
-            tf.logging.info(format_str %
-                            (FLAGS.task_id, datetime.now(), step, loss_value,
-                             examples_per_sec, duration))
-
-          # Determine if the summary_op should be run on the chief worker.
-          if is_chief and next_summary_time < time.time():
-            tf.logging.info('Running Summary operation on the chief.')
-            summary_str = sess.run(summary_op)
-            sv.summary_computed(sess, summary_str)
-            tf.logging.info('Finished running Summary operation.')
-
-            # Determine the next time for running the summary.
-            next_summary_time += FLAGS.save_summaries_secs
-        except:
-          if is_chief:
-            tf.logging.info('Chief got exception while running!')
-          raise
-
-      # Stop the supervisor.  This also waits for service threads to finish.
-      sv.stop()
-
-      # Save after the training ends.
-      if is_chief:
-        saver.save(sess,
-                   os.path.join(FLAGS.train_dir, 'model.ckpt'),
-                   global_step=global_step)
diff --git a/research/inception/inception/inception_eval.py b/research/inception/inception/inception_eval.py
deleted file mode 100644
index e7cfc3c399dd82a915b3a49c7ddd4a8565292f69..0000000000000000000000000000000000000000
--- a/research/inception/inception/inception_eval.py
+++ /dev/null
@@ -1,171 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""A library to evaluate Inception on a single GPU.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from datetime import datetime
-import math
-import os.path
-import time
-
-
-import numpy as np
-import tensorflow as tf
-
-from inception import image_processing
-from inception import inception_model as inception
-
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_string('eval_dir', '/tmp/imagenet_eval',
-                           """Directory where to write event logs.""")
-tf.app.flags.DEFINE_string('checkpoint_dir', '/tmp/imagenet_train',
-                           """Directory where to read model checkpoints.""")
-
-# Flags governing the frequency of the eval.
-tf.app.flags.DEFINE_integer('eval_interval_secs', 60 * 5,
-                            """How often to run the eval.""")
-tf.app.flags.DEFINE_boolean('run_once', False,
-                            """Whether to run eval only once.""")
-
-# Flags governing the data used for the eval.
-tf.app.flags.DEFINE_integer('num_examples', 50000,
-                            """Number of examples to run. Note that the eval """
-                            """ImageNet dataset contains 50000 examples.""")
-tf.app.flags.DEFINE_string('subset', 'validation',
-                           """Either 'validation' or 'train'.""")
-
-
-def _eval_once(saver, summary_writer, top_1_op, top_5_op, summary_op):
-  """Runs Eval once.
-
-  Args:
-    saver: Saver.
-    summary_writer: Summary writer.
-    top_1_op: Top 1 op.
-    top_5_op: Top 5 op.
-    summary_op: Summary op.
-  """
-  with tf.Session() as sess:
-    ckpt = tf.train.get_checkpoint_state(FLAGS.checkpoint_dir)
-    if ckpt and ckpt.model_checkpoint_path:
-      if os.path.isabs(ckpt.model_checkpoint_path):
-        # Restores from checkpoint with absolute path.
-        saver.restore(sess, ckpt.model_checkpoint_path)
-      else:
-        # Restores from checkpoint with relative path.
-        saver.restore(sess, os.path.join(FLAGS.checkpoint_dir,
-                                         ckpt.model_checkpoint_path))
-
-      # Assuming model_checkpoint_path looks something like:
-      #   /my-favorite-path/imagenet_train/model.ckpt-0,
-      # extract global_step from it.
-      global_step = ckpt.model_checkpoint_path.split('/')[-1].split('-')[-1]
-      print('Successfully loaded model from %s at step=%s.' %
-            (ckpt.model_checkpoint_path, global_step))
-    else:
-      print('No checkpoint file found')
-      return
-
-    # Start the queue runners.
-    coord = tf.train.Coordinator()
-    try:
-      threads = []
-      for qr in tf.get_collection(tf.GraphKeys.QUEUE_RUNNERS):
-        threads.extend(qr.create_threads(sess, coord=coord, daemon=True,
-                                         start=True))
-
-      num_iter = int(math.ceil(FLAGS.num_examples / FLAGS.batch_size))
-      # Counts the number of correct predictions.
-      count_top_1 = 0.0
-      count_top_5 = 0.0
-      total_sample_count = num_iter * FLAGS.batch_size
-      step = 0
-
-      print('%s: starting evaluation on (%s).' % (datetime.now(), FLAGS.subset))
-      start_time = time.time()
-      while step < num_iter and not coord.should_stop():
-        top_1, top_5 = sess.run([top_1_op, top_5_op])
-        count_top_1 += np.sum(top_1)
-        count_top_5 += np.sum(top_5)
-        step += 1
-        if step % 20 == 0:
-          duration = time.time() - start_time
-          sec_per_batch = duration / 20.0
-          examples_per_sec = FLAGS.batch_size / sec_per_batch
-          print('%s: [%d batches out of %d] (%.1f examples/sec; %.3f'
-                'sec/batch)' % (datetime.now(), step, num_iter,
-                                examples_per_sec, sec_per_batch))
-          start_time = time.time()
-
-      # Compute precision @ 1.
-      precision_at_1 = count_top_1 / total_sample_count
-      recall_at_5 = count_top_5 / total_sample_count
-      print('%s: precision @ 1 = %.4f recall @ 5 = %.4f [%d examples]' %
-            (datetime.now(), precision_at_1, recall_at_5, total_sample_count))
-
-      summary = tf.Summary()
-      summary.ParseFromString(sess.run(summary_op))
-      summary.value.add(tag='Precision @ 1', simple_value=precision_at_1)
-      summary.value.add(tag='Recall @ 5', simple_value=recall_at_5)
-      summary_writer.add_summary(summary, global_step)
-
-    except Exception as e:  # pylint: disable=broad-except
-      coord.request_stop(e)
-
-    coord.request_stop()
-    coord.join(threads, stop_grace_period_secs=10)
-
-
-def evaluate(dataset):
-  """Evaluate model on Dataset for a number of steps."""
-  with tf.Graph().as_default():
-    # Get images and labels from the dataset.
-    images, labels = image_processing.inputs(dataset)
-
-    # Number of classes in the Dataset label set plus 1.
-    # Label 0 is reserved for an (unused) background class.
-    num_classes = dataset.num_classes() + 1
-
-    # Build a Graph that computes the logits predictions from the
-    # inference model.
-    logits, _ = inception.inference(images, num_classes)
-
-    # Calculate predictions.
-    top_1_op = tf.nn.in_top_k(logits, labels, 1)
-    top_5_op = tf.nn.in_top_k(logits, labels, 5)
-
-    # Restore the moving average version of the learned variables for eval.
-    variable_averages = tf.train.ExponentialMovingAverage(
-        inception.MOVING_AVERAGE_DECAY)
-    variables_to_restore = variable_averages.variables_to_restore()
-    saver = tf.train.Saver(variables_to_restore)
-
-    # Build the summary operation based on the TF collection of Summaries.
-    summary_op = tf.summary.merge_all()
-
-    graph_def = tf.get_default_graph().as_graph_def()
-    summary_writer = tf.summary.FileWriter(FLAGS.eval_dir,
-                                            graph_def=graph_def)
-
-    while True:
-      _eval_once(saver, summary_writer, top_1_op, top_5_op, summary_op)
-      if FLAGS.run_once:
-        break
-      time.sleep(FLAGS.eval_interval_secs)
diff --git a/research/inception/inception/inception_model.py b/research/inception/inception/inception_model.py
deleted file mode 100644
index fedae13ae712f09d23ff020b161d86e87ee46e95..0000000000000000000000000000000000000000
--- a/research/inception/inception/inception_model.py
+++ /dev/null
@@ -1,157 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Build the Inception v3 network on ImageNet data set.
-
-The Inception v3 architecture is described in http://arxiv.org/abs/1512.00567
-
-Summary of available functions:
- inference: Compute inference on the model inputs to make a prediction
- loss: Compute the loss of the prediction with respect to the labels
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import re
-
-import tensorflow as tf
-
-from inception.slim import slim
-
-FLAGS = tf.app.flags.FLAGS
-
-# If a model is trained using multiple GPUs, prefix all Op names with tower_name
-# to differentiate the operations. Note that this prefix is removed from the
-# names of the summaries when visualizing a model.
-TOWER_NAME = 'tower'
-
-# Batch normalization. Constant governing the exponential moving average of
-# the 'global' mean and variance for all activations.
-BATCHNORM_MOVING_AVERAGE_DECAY = 0.9997
-
-# The decay to use for the moving average.
-MOVING_AVERAGE_DECAY = 0.9999
-
-
-def inference(images, num_classes, for_training=False, restore_logits=True,
-              scope=None):
-  """Build Inception v3 model architecture.
-
-  See here for reference: http://arxiv.org/abs/1512.00567
-
-  Args:
-    images: Images returned from inputs() or distorted_inputs().
-    num_classes: number of classes
-    for_training: If set to `True`, build the inference model for training.
-      Kernels that operate differently for inference during training
-      e.g. dropout, are appropriately configured.
-    restore_logits: whether or not the logits layers should be restored.
-      Useful for fine-tuning a model with different num_classes.
-    scope: optional prefix string identifying the ImageNet tower.
-
-  Returns:
-    Logits. 2-D float Tensor.
-    Auxiliary Logits. 2-D float Tensor of side-head. Used for training only.
-  """
-  # Parameters for BatchNorm.
-  batch_norm_params = {
-      # Decay for the moving averages.
-      'decay': BATCHNORM_MOVING_AVERAGE_DECAY,
-      # epsilon to prevent 0s in variance.
-      'epsilon': 0.001,
-  }
-  # Set weight_decay for weights in Conv and FC layers.
-  with slim.arg_scope([slim.ops.conv2d, slim.ops.fc], weight_decay=0.00004):
-    with slim.arg_scope([slim.ops.conv2d],
-                        stddev=0.1,
-                        activation=tf.nn.relu,
-                        batch_norm_params=batch_norm_params):
-      logits, endpoints = slim.inception.inception_v3(
-          images,
-          dropout_keep_prob=0.8,
-          num_classes=num_classes,
-          is_training=for_training,
-          restore_logits=restore_logits,
-          scope=scope)
-
-  # Add summaries for viewing model statistics on TensorBoard.
-  _activation_summaries(endpoints)
-
-  # Grab the logits associated with the side head. Employed during training.
-  auxiliary_logits = endpoints['aux_logits']
-
-  return logits, auxiliary_logits
-
-
-def loss(logits, labels, batch_size=None):
-  """Adds all losses for the model.
-
-  Note the final loss is not returned. Instead, the list of losses are collected
-  by slim.losses. The losses are accumulated in tower_loss() and summed to
-  calculate the total loss.
-
-  Args:
-    logits: List of logits from inference(). Each entry is a 2-D float Tensor.
-    labels: Labels from distorted_inputs or inputs(). 1-D tensor
-            of shape [batch_size]
-    batch_size: integer
-  """
-  if not batch_size:
-    batch_size = FLAGS.batch_size
-
-  # Reshape the labels into a dense Tensor of
-  # shape [FLAGS.batch_size, num_classes].
-  sparse_labels = tf.reshape(labels, [batch_size, 1])
-  indices = tf.reshape(tf.range(batch_size), [batch_size, 1])
-  concated = tf.concat(axis=1, values=[indices, sparse_labels])
-  num_classes = logits[0].get_shape()[-1].value
-  dense_labels = tf.sparse_to_dense(concated,
-                                    [batch_size, num_classes],
-                                    1.0, 0.0)
-
-  # Cross entropy loss for the main softmax prediction.
-  slim.losses.cross_entropy_loss(logits[0],
-                                 dense_labels,
-                                 label_smoothing=0.1,
-                                 weight=1.0)
-
-  # Cross entropy loss for the auxiliary softmax head.
-  slim.losses.cross_entropy_loss(logits[1],
-                                 dense_labels,
-                                 label_smoothing=0.1,
-                                 weight=0.4,
-                                 scope='aux_loss')
-
-
-def _activation_summary(x):
-  """Helper to create summaries for activations.
-
-  Creates a summary that provides a histogram of activations.
-  Creates a summary that measure the sparsity of activations.
-
-  Args:
-    x: Tensor
-  """
-  # Remove 'tower_[0-9]/' from the name in case this is a multi-GPU training
-  # session. This helps the clarity of presentation on tensorboard.
-  tensor_name = re.sub('%s_[0-9]*/' % TOWER_NAME, '', x.op.name)
-  tf.summary.histogram(tensor_name + '/activations', x)
-  tf.summary.scalar(tensor_name + '/sparsity', tf.nn.zero_fraction(x))
-
-
-def _activation_summaries(endpoints):
-  with tf.name_scope('summaries'):
-    for act in endpoints.values():
-      _activation_summary(act)
diff --git a/research/inception/inception/inception_train.py b/research/inception/inception/inception_train.py
deleted file mode 100644
index e1c32713b2012aec8a18637ec5dd79a1cc84d90f..0000000000000000000000000000000000000000
--- a/research/inception/inception/inception_train.py
+++ /dev/null
@@ -1,357 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""A library to train Inception using multiple GPUs with synchronous updates.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import copy
-from datetime import datetime
-import os.path
-import re
-import time
-
-import numpy as np
-import tensorflow as tf
-
-from inception import image_processing
-from inception import inception_model as inception
-from inception.slim import slim
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_string('train_dir', '/tmp/imagenet_train',
-                           """Directory where to write event logs """
-                           """and checkpoint.""")
-tf.app.flags.DEFINE_integer('max_steps', 10000000,
-                            """Number of batches to run.""")
-tf.app.flags.DEFINE_string('subset', 'train',
-                           """Either 'train' or 'validation'.""")
-
-# Flags governing the hardware employed for running TensorFlow.
-tf.app.flags.DEFINE_integer('num_gpus', 1,
-                            """How many GPUs to use.""")
-tf.app.flags.DEFINE_boolean('log_device_placement', False,
-                            """Whether to log device placement.""")
-
-# Flags governing the type of training.
-tf.app.flags.DEFINE_boolean('fine_tune', False,
-                            """If set, randomly initialize the final layer """
-                            """of weights in order to train the network on a """
-                            """new task.""")
-tf.app.flags.DEFINE_string('pretrained_model_checkpoint_path', '',
-                           """If specified, restore this pretrained model """
-                           """before beginning any training.""")
-
-# **IMPORTANT**
-# Please note that this learning rate schedule is heavily dependent on the
-# hardware architecture, batch size and any changes to the model architecture
-# specification. Selecting a finely tuned learning rate schedule is an
-# empirical process that requires some experimentation. Please see README.md
-# more guidance and discussion.
-#
-# With 8 Tesla K40's and a batch size = 256, the following setup achieves
-# precision@1 = 73.5% after 100 hours and 100K steps (20 epochs).
-# Learning rate decay factor selected from http://arxiv.org/abs/1404.5997.
-tf.app.flags.DEFINE_float('initial_learning_rate', 0.1,
-                          """Initial learning rate.""")
-tf.app.flags.DEFINE_float('num_epochs_per_decay', 30.0,
-                          """Epochs after which learning rate decays.""")
-tf.app.flags.DEFINE_float('learning_rate_decay_factor', 0.16,
-                          """Learning rate decay factor.""")
-
-# Constants dictating the learning rate schedule.
-RMSPROP_DECAY = 0.9                # Decay term for RMSProp.
-RMSPROP_MOMENTUM = 0.9             # Momentum in RMSProp.
-RMSPROP_EPSILON = 1.0              # Epsilon term for RMSProp.
-
-
-def _tower_loss(images, labels, num_classes, scope, reuse_variables=None):
-  """Calculate the total loss on a single tower running the ImageNet model.
-
-  We perform 'batch splitting'. This means that we cut up a batch across
-  multiple GPUs. For instance, if the batch size = 32 and num_gpus = 2,
-  then each tower will operate on an batch of 16 images.
-
-  Args:
-    images: Images. 4D tensor of size [batch_size, FLAGS.image_size,
-                                       FLAGS.image_size, 3].
-    labels: 1-D integer Tensor of [batch_size].
-    num_classes: number of classes
-    scope: unique prefix string identifying the ImageNet tower, e.g.
-      'tower_0'.
-
-  Returns:
-     Tensor of shape [] containing the total loss for a batch of data
-  """
-  # When fine-tuning a model, we do not restore the logits but instead we
-  # randomly initialize the logits. The number of classes in the output of the
-  # logit is the number of classes in specified Dataset.
-  restore_logits = not FLAGS.fine_tune
-
-  # Build inference Graph.
-  with tf.variable_scope(tf.get_variable_scope(), reuse=reuse_variables):
-    logits = inception.inference(images, num_classes, for_training=True,
-                                 restore_logits=restore_logits,
-                                 scope=scope)
-
-  # Build the portion of the Graph calculating the losses. Note that we will
-  # assemble the total_loss using a custom function below.
-  split_batch_size = images.get_shape().as_list()[0]
-  inception.loss(logits, labels, batch_size=split_batch_size)
-
-  # Assemble all of the losses for the current tower only.
-  losses = tf.get_collection(slim.losses.LOSSES_COLLECTION, scope)
-
-  # Calculate the total loss for the current tower.
-  regularization_losses = tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)
-  total_loss = tf.add_n(losses + regularization_losses, name='total_loss')
-
-  # Compute the moving average of all individual losses and the total loss.
-  loss_averages = tf.train.ExponentialMovingAverage(0.9, name='avg')
-  loss_averages_op = loss_averages.apply(losses + [total_loss])
-
-  # Attach a scalar summmary to all individual losses and the total loss; do the
-  # same for the averaged version of the losses.
-  for l in losses + [total_loss]:
-    # Remove 'tower_[0-9]/' from the name in case this is a multi-GPU training
-    # session. This helps the clarity of presentation on TensorBoard.
-    loss_name = re.sub('%s_[0-9]*/' % inception.TOWER_NAME, '', l.op.name)
-    # Name each loss as '(raw)' and name the moving average version of the loss
-    # as the original loss name.
-    tf.summary.scalar(loss_name +' (raw)', l)
-    tf.summary.scalar(loss_name, loss_averages.average(l))
-
-  with tf.control_dependencies([loss_averages_op]):
-    total_loss = tf.identity(total_loss)
-  return total_loss
-
-
-def _average_gradients(tower_grads):
-  """Calculate the average gradient for each shared variable across all towers.
-
-  Note that this function provides a synchronization point across all towers.
-
-  Args:
-    tower_grads: List of lists of (gradient, variable) tuples. The outer list
-      is over individual gradients. The inner list is over the gradient
-      calculation for each tower.
-  Returns:
-     List of pairs of (gradient, variable) where the gradient has been averaged
-     across all towers.
-  """
-  average_grads = []
-  for grad_and_vars in zip(*tower_grads):
-    # Note that each grad_and_vars looks like the following:
-    #   ((grad0_gpu0, var0_gpu0), ... , (grad0_gpuN, var0_gpuN))
-    grads = []
-    for g, _ in grad_and_vars:
-      # Add 0 dimension to the gradients to represent the tower.
-      expanded_g = tf.expand_dims(g, 0)
-
-      # Append on a 'tower' dimension which we will average over below.
-      grads.append(expanded_g)
-
-    # Average over the 'tower' dimension.
-    grad = tf.concat(axis=0, values=grads)
-    grad = tf.reduce_mean(grad, 0)
-
-    # Keep in mind that the Variables are redundant because they are shared
-    # across towers. So .. we will just return the first tower's pointer to
-    # the Variable.
-    v = grad_and_vars[0][1]
-    grad_and_var = (grad, v)
-    average_grads.append(grad_and_var)
-  return average_grads
-
-
-def train(dataset):
-  """Train on dataset for a number of steps."""
-  with tf.Graph().as_default(), tf.device('/cpu:0'):
-    # Create a variable to count the number of train() calls. This equals the
-    # number of batches processed * FLAGS.num_gpus.
-    global_step = tf.get_variable(
-        'global_step', [],
-        initializer=tf.constant_initializer(0), trainable=False)
-
-    # Calculate the learning rate schedule.
-    num_batches_per_epoch = (dataset.num_examples_per_epoch() /
-                             FLAGS.batch_size)
-    decay_steps = int(num_batches_per_epoch * FLAGS.num_epochs_per_decay)
-
-    # Decay the learning rate exponentially based on the number of steps.
-    lr = tf.train.exponential_decay(FLAGS.initial_learning_rate,
-                                    global_step,
-                                    decay_steps,
-                                    FLAGS.learning_rate_decay_factor,
-                                    staircase=True)
-
-    # Create an optimizer that performs gradient descent.
-    opt = tf.train.RMSPropOptimizer(lr, RMSPROP_DECAY,
-                                    momentum=RMSPROP_MOMENTUM,
-                                    epsilon=RMSPROP_EPSILON)
-
-    # Get images and labels for ImageNet and split the batch across GPUs.
-    assert FLAGS.batch_size % FLAGS.num_gpus == 0, (
-        'Batch size must be divisible by number of GPUs')
-    split_batch_size = int(FLAGS.batch_size / FLAGS.num_gpus)
-
-    # Override the number of preprocessing threads to account for the increased
-    # number of GPU towers.
-    num_preprocess_threads = FLAGS.num_preprocess_threads * FLAGS.num_gpus
-    images, labels = image_processing.distorted_inputs(
-        dataset,
-        num_preprocess_threads=num_preprocess_threads)
-
-    input_summaries = copy.copy(tf.get_collection(tf.GraphKeys.SUMMARIES))
-
-    # Number of classes in the Dataset label set plus 1.
-    # Label 0 is reserved for an (unused) background class.
-    num_classes = dataset.num_classes() + 1
-
-     # Split the batch of images and labels for towers.
-    images_splits = tf.split(axis=0, num_or_size_splits=FLAGS.num_gpus, value=images)
-    labels_splits = tf.split(axis=0, num_or_size_splits=FLAGS.num_gpus, value=labels)
-
-    # Calculate the gradients for each model tower.
-    tower_grads = []
-    reuse_variables = None
-    for i in range(FLAGS.num_gpus):
-      with tf.device('/gpu:%d' % i):
-        with tf.name_scope('%s_%d' % (inception.TOWER_NAME, i)) as scope:
-          # Force all Variables to reside on the CPU.
-          with slim.arg_scope([slim.variables.variable], device='/cpu:0'):
-            # Calculate the loss for one tower of the ImageNet model. This
-            # function constructs the entire ImageNet model but shares the
-            # variables across all towers.
-            loss = _tower_loss(images_splits[i], labels_splits[i], num_classes,
-                               scope, reuse_variables)
-
-          # Reuse variables for the next tower.
-          reuse_variables = True
-
-          # Retain the summaries from the final tower.
-          summaries = tf.get_collection(tf.GraphKeys.SUMMARIES, scope)
-
-          # Retain the Batch Normalization updates operations only from the
-          # final tower. Ideally, we should grab the updates from all towers
-          # but these stats accumulate extremely fast so we can ignore the
-          # other stats from the other towers without significant detriment.
-          batchnorm_updates = tf.get_collection(slim.ops.UPDATE_OPS_COLLECTION,
-                                                scope)
-
-          # Calculate the gradients for the batch of data on this ImageNet
-          # tower.
-          grads = opt.compute_gradients(loss)
-
-          # Keep track of the gradients across all towers.
-          tower_grads.append(grads)
-
-    # We must calculate the mean of each gradient. Note that this is the
-    # synchronization point across all towers.
-    grads = _average_gradients(tower_grads)
-
-    # Add a summaries for the input processing and global_step.
-    summaries.extend(input_summaries)
-
-    # Add a summary to track the learning rate.
-    summaries.append(tf.summary.scalar('learning_rate', lr))
-
-    # Add histograms for gradients.
-    for grad, var in grads:
-      if grad is not None:
-        summaries.append(
-            tf.summary.histogram(var.op.name + '/gradients', grad))
-
-    # Apply the gradients to adjust the shared variables.
-    apply_gradient_op = opt.apply_gradients(grads, global_step=global_step)
-
-    # Add histograms for trainable variables.
-    for var in tf.trainable_variables():
-      summaries.append(tf.summary.histogram(var.op.name, var))
-
-    # Track the moving averages of all trainable variables.
-    # Note that we maintain a "double-average" of the BatchNormalization
-    # global statistics. This is more complicated then need be but we employ
-    # this for backward-compatibility with our previous models.
-    variable_averages = tf.train.ExponentialMovingAverage(
-        inception.MOVING_AVERAGE_DECAY, global_step)
-
-    # Another possibility is to use tf.slim.get_variables().
-    variables_to_average = (tf.trainable_variables() +
-                            tf.moving_average_variables())
-    variables_averages_op = variable_averages.apply(variables_to_average)
-
-    # Group all updates to into a single train op.
-    batchnorm_updates_op = tf.group(*batchnorm_updates)
-    train_op = tf.group(apply_gradient_op, variables_averages_op,
-                        batchnorm_updates_op)
-
-    # Create a saver.
-    saver = tf.train.Saver(tf.global_variables())
-
-    # Build the summary operation from the last tower summaries.
-    summary_op = tf.summary.merge(summaries)
-
-    # Build an initialization operation to run below.
-    init = tf.global_variables_initializer()
-
-    # Start running operations on the Graph. allow_soft_placement must be set to
-    # True to build towers on GPU, as some of the ops do not have GPU
-    # implementations.
-    sess = tf.Session(config=tf.ConfigProto(
-        allow_soft_placement=True,
-        log_device_placement=FLAGS.log_device_placement))
-    sess.run(init)
-
-    if FLAGS.pretrained_model_checkpoint_path:
-      assert tf.gfile.Exists(FLAGS.pretrained_model_checkpoint_path)
-      variables_to_restore = tf.get_collection(
-          slim.variables.VARIABLES_TO_RESTORE)
-      restorer = tf.train.Saver(variables_to_restore)
-      restorer.restore(sess, FLAGS.pretrained_model_checkpoint_path)
-      print('%s: Pre-trained model restored from %s' %
-            (datetime.now(), FLAGS.pretrained_model_checkpoint_path))
-
-    # Start the queue runners.
-    tf.train.start_queue_runners(sess=sess)
-
-    summary_writer = tf.summary.FileWriter(
-        FLAGS.train_dir,
-        graph=sess.graph)
-
-    for step in range(FLAGS.max_steps):
-      start_time = time.time()
-      _, loss_value = sess.run([train_op, loss])
-      duration = time.time() - start_time
-
-      assert not np.isnan(loss_value), 'Model diverged with loss = NaN'
-
-      if step % 10 == 0:
-        examples_per_sec = FLAGS.batch_size / float(duration)
-        format_str = ('%s: step %d, loss = %.2f (%.1f examples/sec; %.3f '
-                      'sec/batch)')
-        print(format_str % (datetime.now(), step, loss_value,
-                            examples_per_sec, duration))
-
-      if step % 100 == 0:
-        summary_str = sess.run(summary_op)
-        summary_writer.add_summary(summary_str, step)
-
-      # Save the model checkpoint periodically.
-      if step % 5000 == 0 or (step + 1) == FLAGS.max_steps:
-        checkpoint_path = os.path.join(FLAGS.train_dir, 'model.ckpt')
-        saver.save(sess, checkpoint_path, global_step=step)
diff --git a/research/inception/inception/slim/BUILD b/research/inception/inception/slim/BUILD
deleted file mode 100644
index 174e77d5c2654380232174a2bb8b29c6b9affc5d..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/BUILD
+++ /dev/null
@@ -1,112 +0,0 @@
-# Description:
-#   Contains the operations and nets for building TensorFlow-Slim models.
-
-package(default_visibility = ["//inception:internal"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-py_library(
-    name = "scopes",
-    srcs = ["scopes.py"],
-)
-
-py_test(
-    name = "scopes_test",
-    size = "small",
-    srcs = ["scopes_test.py"],
-    deps = [
-        ":scopes",
-    ],
-)
-
-py_library(
-    name = "variables",
-    srcs = ["variables.py"],
-    deps = [
-        ":scopes",
-    ],
-)
-
-py_test(
-    name = "variables_test",
-    size = "small",
-    srcs = ["variables_test.py"],
-    deps = [
-        ":variables",
-    ],
-)
-
-py_library(
-    name = "losses",
-    srcs = ["losses.py"],
-)
-
-py_test(
-    name = "losses_test",
-    size = "small",
-    srcs = ["losses_test.py"],
-    deps = [
-        ":losses",
-    ],
-)
-
-py_library(
-    name = "ops",
-    srcs = ["ops.py"],
-    deps = [
-        ":losses",
-        ":scopes",
-        ":variables",
-    ],
-)
-
-py_test(
-    name = "ops_test",
-    size = "small",
-    srcs = ["ops_test.py"],
-    deps = [
-        ":ops",
-        ":variables",
-    ],
-)
-
-py_library(
-    name = "inception",
-    srcs = ["inception_model.py"],
-    deps = [
-        ":ops",
-        ":scopes",
-    ],
-)
-
-py_test(
-    name = "inception_test",
-    size = "medium",
-    srcs = ["inception_test.py"],
-    deps = [
-        ":inception",
-    ],
-)
-
-py_library(
-    name = "slim",
-    srcs = ["slim.py"],
-    deps = [
-        ":inception",
-        ":losses",
-        ":ops",
-        ":scopes",
-        ":variables",
-    ],
-)
-
-py_test(
-    name = "collections_test",
-    size = "small",
-    srcs = ["collections_test.py"],
-    deps = [
-        ":slim",
-    ],
-)
diff --git a/research/inception/inception/slim/README.md b/research/inception/inception/slim/README.md
deleted file mode 100644
index 36d8b7eb19ae47d8810ed97abe203aa34be50a75..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/README.md
+++ /dev/null
@@ -1,621 +0,0 @@
-# TensorFlow-Slim
-
-TF-Slim is a lightweight library for defining, training and evaluating models in
-TensorFlow. It enables defining complex networks quickly and concisely while
-keeping a model's architecture transparent and its hyperparameters explicit.
-
-[TOC]
-
-## Teaser
-
-As a demonstration of the simplicity of using TF-Slim, compare the simplicity of
-the code necessary for defining the entire [VGG](http://www.robots.ox.ac.uk/~vgg/research/very_deep/) network using TF-Slim to
-the lengthy and verbose nature of defining just the first three layers (out of
-16) using native tensorflow:
-
-```python{.good}
-# VGG16 in TF-Slim.
-def vgg16(inputs):
-  with slim.arg_scope([slim.ops.conv2d, slim.ops.fc], stddev=0.01, weight_decay=0.0005):
-    net = slim.ops.repeat_op(2, inputs, slim.ops.conv2d, 64, [3, 3], scope='conv1')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool1')
-    net = slim.ops.repeat_op(2, net, slim.ops.conv2d, 128, [3, 3], scope='conv2')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool2')
-    net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 256, [3, 3], scope='conv3')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool3')
-    net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 512, [3, 3], scope='conv4')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool4')
-    net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 512, [3, 3], scope='conv5')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool5')
-    net = slim.ops.flatten(net, scope='flatten5')
-    net = slim.ops.fc(net, 4096, scope='fc6')
-    net = slim.ops.dropout(net, 0.5, scope='dropout6')
-    net = slim.ops.fc(net, 4096, scope='fc7')
-    net = slim.ops.dropout(net, 0.5, scope='dropout7')
-    net = slim.ops.fc(net, 1000, activation=None, scope='fc8')
-  return net
-```
-
-```python{.bad}
-# Layers 1-3 (out of 16) of VGG16 in native tensorflow.
-def vgg16(inputs):
-  with tf.name_scope('conv1_1') as scope:
-    kernel = tf.Variable(tf.truncated_normal([3, 3, 3, 64], dtype=tf.float32, stddev=1e-1), name='weights')
-    conv = tf.nn.conv2d(inputs, kernel, [1, 1, 1, 1], padding='SAME')
-    biases = tf.Variable(tf.constant(0.0, shape=[64], dtype=tf.float32), trainable=True, name='biases')
-    bias = tf.nn.bias_add(conv, biases)
-    conv1 = tf.nn.relu(bias, name=scope)
-  with tf.name_scope('conv1_2') as scope:
-    kernel = tf.Variable(tf.truncated_normal([3, 3, 64, 64], dtype=tf.float32, stddev=1e-1), name='weights')
-    conv = tf.nn.conv2d(images, kernel, [1, 1, 1, 1], padding='SAME')
-    biases = tf.Variable(tf.constant(0.0, shape=[64], dtype=tf.float32), trainable=True, name='biases')
-    bias = tf.nn.bias_add(conv, biases)
-    conv1 = tf.nn.relu(bias, name=scope)
-  with tf.name_scope('pool1')
-    pool1 = tf.nn.max_pool(conv1, ksize=[1, 2, 2, 1], strides=[1, 2, 2, 1], padding='VALID', name='pool1')
-```
-
-## Why TF-Slim?
-
-TF-Slim offers several advantages over just the built-in tensorflow libraries:
-
-*   Allows one to define models much more compactly by eliminating boilerplate
-    code. This is accomplished through the use of [argument scoping](./scopes.py)
-    and numerous high level [operations](./ops.py). These tools increase
-    readability and maintainability, reduce the likelihood of an error from
-    copy-and-pasting hyperparameter values and simplifies hyperparameter tuning.
-*   Makes developing models simple by providing commonly used [loss functions](./losses.py)
-*   Provides a concise [definition](./inception_model.py) of [Inception v3](http://arxiv.org/abs/1512.00567) network architecture ready to be used
-    out-of-the-box or subsumed into new models.
-
-Additionally TF-Slim was designed with several principles in mind:
-
-*   The various modules of TF-Slim (scopes, variables, ops, losses) are
-    independent. This flexibility allows users to pick and choose components of
-    TF-Slim completely à la carte.
-*   TF-Slim is written using a Functional Programming style. That means it's
-    super-lightweight and can be used right alongside any of TensorFlow's native
-    operations.
-*   Makes re-using network architectures easy. This allows users to build new
-    networks on top of existing ones as well as fine-tuning pre-trained models
-    on new tasks.
-
-## What are the various components of TF-Slim?
-
-TF-Slim is composed of several parts which were designed to exist independently.
-These include:
-
-*   [scopes.py](./scopes.py): provides a new scope named `arg_scope` that allows
-    a user to define default arguments for specific operations within that
-    scope.
-*   [variables.py](./variables.py): provides convenience wrappers for variable
-    creation and manipulation.
-*   [ops.py](./ops.py): provides high level operations for building models using
-    tensorflow.
-*   [losses.py](./losses.py): contains commonly used loss functions.
-
-## Defining Models
-
-Models can be succinctly defined using TF-Slim by combining its variables,
-operations and scopes. Each of these elements are defined below.
-
-### Variables
-
-Creating [`Variables`](https://www.tensorflow.org/how_tos/variables/index.html)
-in native tensorflow requires either a predefined value or an initialization
-mechanism (random, normally distributed). Furthermore, if a variable needs to be
-created on a specific device, such as a GPU, the specification must be [made
-explicit](https://www.tensorflow.org/how_tos/using_gpu/index.html). To alleviate
-the code required for variable creation, TF-Slim provides a set of thin wrapper
-functions in [variables.py](./variables.py) which allow callers to easily define
-variables.
-
-For example, to create a `weight` variable, initialize it using a truncated
-normal distribution, regularize it with an `l2_loss` and place it on the `CPU`,
-one need only declare the following:
-
-```python
-weights = variables.variable('weights',
-                             shape=[10, 10, 3 , 3],
-                             initializer=tf.truncated_normal_initializer(stddev=0.1),
-                             regularizer=lambda t: losses.l2_loss(t, weight=0.05),
-                             device='/cpu:0')
-```
-
-In addition to the functionality provided by `tf.Variable`, `slim.variables`
-keeps track of the variables created by `slim.ops` to define a model, which
-allows one to distinguish variables that belong to the model versus other
-variables.
-
-```python
-# Get all the variables defined by the model.
-model_variables = slim.variables.get_variables()
-
-# Get all the variables with the same given name, i.e. 'weights', 'biases'.
-weights = slim.variables.get_variables_by_name('weights')
-biases = slim.variables.get_variables_by_name('biases')
-
-# Get all the variables in VARIABLES_TO_RESTORE collection.
-variables_to_restore = tf.get_collection(slim.variables.VARIABLES_TO_RESTORE)
-
-
-weights = variables.variable('weights',
-                             shape=[10, 10, 3 , 3],
-                             initializer=tf.truncated_normal_initializer(stddev=0.1),
-                             regularizer=lambda t: losses.l2_loss(t, weight=0.05),
-                             device='/cpu:0')
-```
-
-### Operations (Layers)
-
-While the set of TensorFlow operations is quite extensive, builders of neural
-networks typically think of models in terms of "layers". A layer, such as a
-Convolutional Layer, a Fully Connected Layer or a BatchNorm Layer are more
-abstract than a single TensorFlow operation and typically involve many such
-operations. For example, a Convolutional Layer in a neural network is built
-using several steps:
-
-1.  Creating the weight variables
-2.  Creating the bias variables
-3.  Convolving the weights with the input from the previous layer
-4.  Adding the biases to the result of the convolution.
-
-In python code this can be rather laborious:
-
-```python
-input = ...
-with tf.name_scope('conv1_1') as scope:
-  kernel = tf.Variable(tf.truncated_normal([3, 3, 64, 128], dtype=tf.float32,
-                                           stddev=1e-1), name='weights')
-  conv = tf.nn.conv2d(input, kernel, [1, 1, 1, 1], padding='SAME')
-  biases = tf.Variable(tf.constant(0.0, shape=[128], dtype=tf.float32),
-                       trainable=True, name='biases')
-  bias = tf.nn.bias_add(conv, biases)
-  conv1 = tf.nn.relu(bias, name=scope)
-```
-
-To alleviate the need to duplicate this code repeatedly, TF-Slim provides a
-number of convenient operations defined at the (more abstract) level of neural
-network layers. For example, compare the code above to an invocation of the
-TF-Slim code:
-
-```python
-input = ...
-net = slim.ops.conv2d(input, [3, 3], 128, scope='conv1_1')
-```
-
-TF-Slim provides numerous operations used in building neural networks which
-roughly correspond to such layers. These include:
-
-Layer                 | TF-Slim Op
---------------------- | ------------------------
-Convolutional Layer   | [ops.conv2d](./ops.py)
-Fully Connected Layer | [ops.fc](./ops.py)
-BatchNorm layer       | [ops.batch_norm](./ops.py)
-Max Pooling Layer     | [ops.max_pool](./ops.py)
-Avg Pooling Layer     | [ops.avg_pool](./ops.py)
-Dropout Layer         | [ops.dropout](./ops.py)
-
-[ops.py](./ops.py) also includes operations that are not really "layers" per se,
-but are often used to manipulate hidden unit representations during inference:
-
-Operation | TF-Slim Op
---------- | ---------------------
-Flatten   | [ops.flatten](./ops.py)
-
-TF-Slim also provides a meta-operation called `repeat_op` that allows one to
-repeatedly perform the same operation. Consider the following snippet from the
-[VGG](https://www.robots.ox.ac.uk/~vgg/research/very_deep/) network whose layers
-perform several convolutions in a row between pooling layers:
-
-```python
-net = ...
-net = slim.ops.conv2d(net, 256, [3, 3], scope='conv3_1')
-net = slim.ops.conv2d(net, 256, [3, 3], scope='conv3_2')
-net = slim.ops.conv2d(net, 256, [3, 3], scope='conv3_3')
-net = slim.ops.max_pool(net, [2, 2], scope='pool3')
-```
-
-This clear duplication of code can be removed via a standard loop:
-
-```python
-net = ...
-for i in range(3):
-  net = slim.ops.conv2d(net, 256, [3, 3], scope='conv3_' % (i+1))
-net = slim.ops.max_pool(net, [2, 2], scope='pool3')
-```
-
-While this does reduce the amount of duplication, it can be made even cleaner by
-using the `RepeatOp`:
-
-```python
-net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 256, [3, 3], scope='conv3')
-net = slim.ops.max_pool(net, [2, 2], scope='pool2')
-```
-
-Notice that the RepeatOp not only applies the same argument in-line, it also is
-smart enough to unroll the scopes such that the scopes assigned to each
-subsequent call of `ops.conv2d` is appended with an underscore and iteration
-number. More concretely, the scopes in the example above would be 'conv3_1',
-'conv3_2' and 'conv3_3'.
-
-### Scopes
-
-In addition to the types of scope mechanisms in TensorFlow ([name_scope](https://www.tensorflow.org/api_docs/python/framework.html#name_scope),
-[variable_scope](https://www.tensorflow.org/api_docs/python/state_ops.html#variable_scope),
-TF-Slim adds a new scoping mechanism called "argument scope" or [arg_scope](./scopes.py). This new scope allows a user to specify one or more operations and
-a set of arguments which will be passed to each of the operations defined in the
-`arg_scope`. This functionality is best illustrated by example. Consider the
-following code snippet:
-
-```python
-net = slim.ops.conv2d(inputs, 64, [11, 11], 4, padding='SAME', stddev=0.01, weight_decay=0.0005, scope='conv1')
-net = slim.ops.conv2d(net, 128, [11, 11], padding='VALID', stddev=0.01, weight_decay=0.0005, scope='conv2')
-net = slim.ops.conv2d(net, 256, [11, 11], padding='SAME', stddev=0.01, weight_decay=0.0005, scope='conv3')
-```
-
-It should be clear that these three Convolution layers share many of the same
-hyperparameters. Two have the same padding, all three have the same weight_decay
-and standard deviation of its weights. Not only do the duplicated values make
-the code more difficult to read, it also adds the addition burder to the writer
-of needing to doublecheck that all of the values are identical in each step. One
-solution would be to specify default values using variables:
-
-```python
-padding='SAME'
-stddev=0.01
-weight_decay=0.0005
-net = slim.ops.conv2d(inputs, 64, [11, 11], 4, padding=padding, stddev=stddev, weight_decay=weight_decay, scope='conv1')
-net = slim.ops.conv2d(net, 128, [11, 11], padding='VALID', stddev=stddev, weight_decay=weight_decay, scope='conv2')
-net = slim.ops.conv2d(net, 256, [11, 11], padding=padding, stddev=stddev, weight_decay=weight_decay, scope='conv3')
-
-```
-
-This solution ensures that all three convolutions share the exact same variable
-values but doesn't reduce the code clutter. By using an `arg_scope`, we can both
-ensure that each layer uses the same values and simplify the code:
-
-```python
-  with slim.arg_scope([slim.ops.conv2d], padding='SAME', stddev=0.01, weight_decay=0.0005):
-    net = slim.ops.conv2d(inputs, 64, [11, 11], scope='conv1')
-    net = slim.ops.conv2d(net, 128, [11, 11], padding='VALID', scope='conv2')
-    net = slim.ops.conv2d(net, 256, [11, 11], scope='conv3')
-```
-
-As the example illustrates, the use of arg_scope makes the code cleaner, simpler
-and easier to maintain. Notice that while argument values are specifed in the
-arg_scope, they can be overwritten locally. In particular, while the padding
-argument has been set to 'SAME', the second convolution overrides it with the
-value of 'VALID'.
-
-One can also nest `arg_scope`s and use multiple operations in the same scope.
-For example:
-
-```python
-with arg_scope([slim.ops.conv2d, slim.ops.fc], stddev=0.01, weight_decay=0.0005):
-  with arg_scope([slim.ops.conv2d], padding='SAME'), slim.arg_scope([slim.ops.fc], bias=1.0):
-    net = slim.ops.conv2d(inputs, 64, [11, 11], 4, padding='VALID', scope='conv1')
-    net = slim.ops.conv2d(net, 256, [5, 5], stddev=0.03, scope='conv2')
-    net = slim.ops.flatten(net)
-    net = slim.ops.fc(net, 1000, activation=None, scope='fc')
-```
-
-In this example, the first `arg_scope` applies the same `stddev` and
-`weight_decay` arguments to the `conv2d` and `fc` ops in its scope. In the
-second `arg_scope`, additional default arguments to `conv2d` only are specified.
-
-In addition to `arg_scope`, TF-Slim provides several decorators that wrap the
-use of tensorflow arg scopes. These include `@AddArgScope`, `@AddNameScope`,
-`@AddVariableScope`, `@AddOpScope` and `@AddVariableOpScope`. To illustrate
-their use, consider the following example.
-
-```python
-def MyNewOp(inputs):
-  varA = ...
-  varB = ...
-  outputs = tf.multiply(varA, inputs) + varB
-  return outputs
-
-```
-
-In this example, the user has created a new op which creates two variables. To
-ensure that these variables exist within a certain variable scope (to avoid
-collisions with variables with the same name), in standard TF, the op must be
-called within a variable scope:
-
-```python
-inputs = ...
-with tf.variable_scope('layer1'):
-  outputs = MyNewOp(inputs)
-```
-
-As an alternative, one can use TF-Slim's decorators to decorate the function and
-simplify the call:
-
-```python
-@AddVariableScope
-def MyNewOp(inputs):
-  ...
-  return outputs
-
-
-inputs = ...
-outputs = MyNewOp('layer1')
-```
-
-The `@AddVariableScope` decorater simply applies the `tf.variable_scope` scoping
-to the called function taking "layer1" as its argument. This allows the code to
-be written more concisely.
-
-### Losses
-
-The loss function defines a quantity that we want to minimize. For
-classification problems, this is typically the cross entropy between the true
-(one-hot) distribution and the predicted probability distribution across
-classes. For regression problems, this is often the sum-of-squares differences
-between the predicted and true values.
-
-Certain models, such as multi-task learning models, require the use of multiple
-loss functions simultaneously. In other words, the loss function ultimatey being
-minimized is the sum of various other loss functions. For example, consider a
-model that predicts both the type of scene in an image as well as the depth from
-the camera of each pixel. This model's loss function would be the sum of the
-classification loss and depth prediction loss.
-
-TF-Slim provides an easy-to-use mechanism for defining and keeping track of loss
-functions via the [losses.py](./losses.py) module. Consider the simple case
-where we want to train the VGG network:
-
-```python
-# Load the images and labels.
-images, labels = ...
-
-# Create the model.
-predictions =  ...
-
-# Define the loss functions and get the total loss.
-loss = losses.cross_entropy_loss(predictions, labels)
-```
-
-In this example, we start by creating the model (using TF-Slim's VGG
-implementation), and add the standard classification loss. Now, lets turn to the
-case where we have a multi-task model that produces multiple outputs:
-
-```python
-# Load the images and labels.
-images, scene_labels, depth_labels = ...
-
-# Create the model.
-scene_predictions, depth_predictions = CreateMultiTaskModel(images)
-
-# Define the loss functions and get the total loss.
-classification_loss = slim.losses.cross_entropy_loss(scene_predictions, scene_labels)
-sum_of_squares_loss = slim.losses.l2loss(depth_predictions - depth_labels)
-
-# The following two lines have the same effect:
-total_loss1 = classification_loss + sum_of_squares_loss
-total_loss2 = tf.get_collection(slim.losses.LOSSES_COLLECTION)
-```
-
-In this example, we have two losses which we add by calling
-`losses.cross_entropy_loss` and `losses.l2loss`. We can obtain the
-total loss by adding them together (`total_loss1`) or by calling
-`losses.GetTotalLoss()`. How did this work? When you create a loss function via
-TF-Slim, TF-Slim adds the loss to a special TensorFlow collection of loss
-functions. This enables you to either manage the total loss manually, or allow
-TF-Slim to manage them for you.
-
-What if you want to let TF-Slim manage the losses for you but have a custom loss
-function? [losses.py](./losses.py) also has a function that adds this loss to
-TF-Slims collection. For example:
-
-```python
-# Load the images and labels.
-images, scene_labels, depth_labels, pose_labels = ...
-
-# Create the model.
-scene_predictions, depth_predictions, pose_predictions = CreateMultiTaskModel(images)
-
-# Define the loss functions and get the total loss.
-classification_loss = slim.losses.cross_entropy_loss(scene_predictions, scene_labels)
-sum_of_squares_loss = slim.losses.l2loss(depth_predictions - depth_labels)
-pose_loss = MyCustomLossFunction(pose_predictions, pose_labels)
-tf.add_to_collection(slim.losses.LOSSES_COLLECTION, pose_loss) # Letting TF-Slim know about the additional loss.
-
-# The following two lines have the same effect:
-total_loss1 = classification_loss + sum_of_squares_loss + pose_loss
-total_loss2 = losses.GetTotalLoss()
-```
-
-In this example, we can again either produce the total loss function manually or
-let TF-Slim know about the additional loss and let TF-Slim handle the losses.
-
-## Putting the Pieces Together
-
-By combining TF-Slim Variables, Operations and scopes, we can write a normally
-very complex network with very few lines of code. For example, the entire [VGG](https://www.robots.ox.ac.uk/~vgg/research/very_deep/) architecture can be
-defined with just the following snippet:
-
-```python
-with arg_scope([slim.ops.conv2d, slim.ops.fc], stddev=0.01, weight_decay=0.0005):
-  net = slim.ops.repeat_op(2, inputs, slim.ops.conv2d, 64, [3, 3], scope='conv1')
-  net = slim.ops.max_pool(net, [2, 2], scope='pool1')
-  net = slim.ops.repeat_op(2, net, slim.ops.conv2d, 128, [3, 3], scope='conv2')
-  net = slim.ops.max_pool(net, [2, 2], scope='pool2')
-  net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 256, [3, 3], scope='conv3')
-  net = slim.ops.max_pool(net, [2, 2], scope='pool3')
-  net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 512, [3, 3], scope='conv4')
-  net = slim.ops.max_pool(net, [2, 2], scope='pool4')
-  net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 512, [3, 3], scope='conv5')
-  net = slim.ops.max_pool(net, [2, 2], scope='pool5')
-  net = slim.ops.flatten(net, scope='flatten5')
-  net = slim.ops.fc(net, 4096, scope='fc6')
-  net = slim.ops.dropout(net, 0.5, scope='dropout6')
-  net = slim.ops.fc(net, 4096, scope='fc7')
-  net = slim.ops.dropout(net, 0.5, scope='dropout7')
-  net = slim.ops.fc(net, 1000, activation=None, scope='fc8')
-return net
-```
-
-## Re-using previously defined network architectures and pre-trained models.
-
-### Brief Recap on Restoring Variables from a Checkpoint
-
-After a model has been trained, it can be restored using `tf.train.Saver()`
-which restores `Variables` from a given checkpoint. For many cases,
-`tf.train.Saver()` provides a simple mechanism to restore all or just a few
-variables.
-
-```python
-# Create some variables.
-v1 = tf.Variable(..., name="v1")
-v2 = tf.Variable(..., name="v2")
-...
-# Add ops to restore all the variables.
-restorer = tf.train.Saver()
-
-# Add ops to restore some variables.
-restorer = tf.train.Saver([v1, v2])
-
-# Later, launch the model, use the saver to restore variables from disk, and
-# do some work with the model.
-with tf.Session() as sess:
-  # Restore variables from disk.
-  restorer.restore(sess, "/tmp/model.ckpt")
-  print("Model restored.")
-  # Do some work with the model
-  ...
-```
-
-See [Restoring Variables](https://www.tensorflow.org/versions/r0.7/how_tos/variables/index.html#restoring-variables)
-and [Choosing which Variables to Save and Restore](https://www.tensorflow.org/versions/r0.7/how_tos/variables/index.html#choosing-which-variables-to-save-and-restore)
-sections of the [Variables](https://www.tensorflow.org/versions/r0.7/how_tos/variables/index.html) page for
-more details.
-
-### Using slim.variables to Track which Variables need to be Restored
-
-It is often desirable to fine-tune a pre-trained model on an entirely new
-dataset or even a new task. In these situations, one must specify which layers
-of the model should be reused (and consequently loaded from a checkpoint) and
-which layers are new. Indicating which variables or layers should be restored is
-a process that quickly becomes cumbersome when done manually.
-
-To help keep track of which variables to restore, `slim.variables` provides a
-`restore` argument when creating each Variable. By default, all variables are
-marked as `restore=True`, which results in all variables defined by the model
-being restored.
-
-```python
-# Create some variables.
-v1 = slim.variables.variable(name="v1", ..., restore=False)
-v2 = slim.variables.variable(name="v2", ...) # By default restore=True
-...
-# Get list of variables to restore (which contains only 'v2')
-variables_to_restore = tf.get_collection(slim.variables.VARIABLES_TO_RESTORE)
-restorer = tf.train.Saver(variables_to_restore)
-with tf.Session() as sess:
-  # Restore variables from disk.
-  restorer.restore(sess, "/tmp/model.ckpt")
-  print("Model restored.")
-  # Do some work with the model
-  ...
-```
-
-Additionally, every layer in `slim.ops` that creates slim.variables (such as
-`slim.ops.conv2d`, `slim.ops.fc`, `slim.ops.batch_norm`) also has a `restore`
-argument which controls whether the variables created by that layer should be
-restored or not.
-
-```python
-# Create a small network.
-net = slim.ops.conv2d(images, 32, [7, 7], stride=2, scope='conv1')
-net = slim.ops.conv2d(net, 64, [3, 3], scope='conv2')
-net = slim.ops.conv2d(net, 128, [3, 3], scope='conv3')
-net = slim.ops.max_pool(net, [3, 3], stride=2, scope='pool3')
-net = slim.ops.flatten(net)
-net = slim.ops.fc(net, 10, scope='logits', restore=False)
-...
-
-# VARIABLES_TO_RESTORE would contain the 'weights' and 'bias' defined by 'conv1'
-# 'conv2' and 'conv3' but not the ones defined by 'logits'
-variables_to_restore = tf.get_collection(slim.variables.VARIABLES_TO_RESTORE)
-
-# Create a restorer that would restore only the needed variables.
-restorer = tf.train.Saver(variables_to_restore)
-
-# Create a saver that would save all the variables (including 'logits').
-saver = tf.train.Saver()
-with tf.Session() as sess:
-  # Restore variables from disk.
-  restorer.restore(sess, "/tmp/model.ckpt")
-  print("Model restored.")
-
-  # Do some work with the model
-  ...
-  saver.save(sess, "/tmp/new_model.ckpt")
-```
-
-Note: When restoring variables from a checkpoint, the `Saver` locates the
-variable names in a checkpoint file and maps them to variables in the current
-graph. Above, we created a saver by passing to it a list of variables. In this
-case, the names of the variables to locate in the checkpoint file were
-implicitly obtained from each provided variable's `var.op.name`.
-
-This works well when the variable names in the checkpoint file match those in
-the graph. However, sometimes, we want to restore a model from a checkpoint
-whose variables have different names those in the current graph. In this case,
-we must provide the `Saver` a dictionary that maps from each checkpoint variable
-name to each graph variable. Consider the following example where the checkpoint
-variables names are obtained via a simple function:
-
-```python
-# Assuming that 'conv1/weights' should be restored from 'vgg16/conv1/weights'
-def name_in_checkpoint(var):
-  return 'vgg16/' + var.op.name
-
-# Assuming that 'conv1/weights' and 'conv1/bias' should be restored from 'conv1/params1' and 'conv1/params2'
-def name_in_checkpoint(var):
-  if "weights" in var.op.name:
-    return var.op.name.replace("weights", "params1")
-  if "bias" in var.op.name:
-    return var.op.name.replace("bias", "params2")
-
-variables_to_restore = tf.get_collection(slim.variables.VARIABLES_TO_RESTORE)
-variables_to_restore = {name_in_checkpoint(var):var for var in variables_to_restore}
-restorer = tf.train.Saver(variables_to_restore)
-with tf.Session() as sess:
-  # Restore variables from disk.
-  restorer.restore(sess, "/tmp/model.ckpt")
-```
-
-### Reusing the VGG16 network defined in TF-Slim on a different task, i.e. PASCAL-VOC.
-
-Assuming one have already a pre-trained VGG16 model, one just need to replace
-the last layer `fc8` with a new layer `fc8_pascal` and use `restore=False`.
-
-```python
-def vgg16_pascal(inputs):
-  with slim.arg_scope([slim.ops.conv2d, slim.ops.fc], stddev=0.01, weight_decay=0.0005):
-    net = slim.ops.repeat_op(2, inputs, slim.ops.conv2d, 64, [3, 3], scope='conv1')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool1')
-    net = slim.ops.repeat_op(2, net, slim.ops.conv2d, 128, [3, 3], scope='conv2')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool2')
-    net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 256, [3, 3], scope='conv3')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool3')
-    net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 512, [3, 3], scope='conv4')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool4')
-    net = slim.ops.repeat_op(3, net, slim.ops.conv2d, 512, [3, 3], scope='conv5')
-    net = slim.ops.max_pool(net, [2, 2], scope='pool5')
-    net = slim.ops.flatten(net, scope='flatten5')
-    net = slim.ops.fc(net, 4096, scope='fc6')
-    net = slim.ops.dropout(net, 0.5, scope='dropout6')
-    net = slim.ops.fc(net, 4096, scope='fc7')
-    net = slim.ops.dropout(net, 0.5, scope='dropout7')
-    # To reuse vgg16 on PASCAL-VOC, just change the last layer.
-    net = slim.ops.fc(net, 21, activation=None, scope='fc8_pascal', restore=False)
-  return net
-```
-
-## Authors
-
-Sergio Guadarrama and Nathan Silberman
diff --git a/research/inception/inception/slim/collections_test.py b/research/inception/inception/slim/collections_test.py
deleted file mode 100644
index 2a1f170edaaedae337df8e0b552a03dd82b263d4..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/collections_test.py
+++ /dev/null
@@ -1,181 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for inception."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from inception.slim import slim
-
-
-def get_variables(scope=None):
-  return slim.variables.get_variables(scope)
-
-
-def get_variables_by_name(name):
-  return slim.variables.get_variables_by_name(name)
-
-
-class CollectionsTest(tf.test.TestCase):
-
-  def testVariables(self):
-    batch_size = 5
-    height, width = 299, 299
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      with slim.arg_scope([slim.ops.conv2d],
-                          batch_norm_params={'decay': 0.9997}):
-        slim.inception.inception_v3(inputs)
-      self.assertEqual(len(get_variables()), 388)
-      self.assertEqual(len(get_variables_by_name('weights')), 98)
-      self.assertEqual(len(get_variables_by_name('biases')), 2)
-      self.assertEqual(len(get_variables_by_name('beta')), 96)
-      self.assertEqual(len(get_variables_by_name('gamma')), 0)
-      self.assertEqual(len(get_variables_by_name('moving_mean')), 96)
-      self.assertEqual(len(get_variables_by_name('moving_variance')), 96)
-
-  def testVariablesWithoutBatchNorm(self):
-    batch_size = 5
-    height, width = 299, 299
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      with slim.arg_scope([slim.ops.conv2d],
-                          batch_norm_params=None):
-        slim.inception.inception_v3(inputs)
-      self.assertEqual(len(get_variables()), 196)
-      self.assertEqual(len(get_variables_by_name('weights')), 98)
-      self.assertEqual(len(get_variables_by_name('biases')), 98)
-      self.assertEqual(len(get_variables_by_name('beta')), 0)
-      self.assertEqual(len(get_variables_by_name('gamma')), 0)
-      self.assertEqual(len(get_variables_by_name('moving_mean')), 0)
-      self.assertEqual(len(get_variables_by_name('moving_variance')), 0)
-
-  def testVariablesByLayer(self):
-    batch_size = 5
-    height, width = 299, 299
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      with slim.arg_scope([slim.ops.conv2d],
-                          batch_norm_params={'decay': 0.9997}):
-        slim.inception.inception_v3(inputs)
-      self.assertEqual(len(get_variables()), 388)
-      self.assertEqual(len(get_variables('conv0')), 4)
-      self.assertEqual(len(get_variables('conv1')), 4)
-      self.assertEqual(len(get_variables('conv2')), 4)
-      self.assertEqual(len(get_variables('conv3')), 4)
-      self.assertEqual(len(get_variables('conv4')), 4)
-      self.assertEqual(len(get_variables('mixed_35x35x256a')), 28)
-      self.assertEqual(len(get_variables('mixed_35x35x288a')), 28)
-      self.assertEqual(len(get_variables('mixed_35x35x288b')), 28)
-      self.assertEqual(len(get_variables('mixed_17x17x768a')), 16)
-      self.assertEqual(len(get_variables('mixed_17x17x768b')), 40)
-      self.assertEqual(len(get_variables('mixed_17x17x768c')), 40)
-      self.assertEqual(len(get_variables('mixed_17x17x768d')), 40)
-      self.assertEqual(len(get_variables('mixed_17x17x768e')), 40)
-      self.assertEqual(len(get_variables('mixed_8x8x2048a')), 36)
-      self.assertEqual(len(get_variables('mixed_8x8x2048b')), 36)
-      self.assertEqual(len(get_variables('logits')), 2)
-      self.assertEqual(len(get_variables('aux_logits')), 10)
-
-  def testVariablesToRestore(self):
-    batch_size = 5
-    height, width = 299, 299
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      with slim.arg_scope([slim.ops.conv2d],
-                          batch_norm_params={'decay': 0.9997}):
-        slim.inception.inception_v3(inputs)
-      variables_to_restore = tf.get_collection(
-          slim.variables.VARIABLES_TO_RESTORE)
-      self.assertEqual(len(variables_to_restore), 388)
-      self.assertListEqual(variables_to_restore, get_variables())
-
-  def testVariablesToRestoreWithoutLogits(self):
-    batch_size = 5
-    height, width = 299, 299
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      with slim.arg_scope([slim.ops.conv2d],
-                          batch_norm_params={'decay': 0.9997}):
-        slim.inception.inception_v3(inputs, restore_logits=False)
-      variables_to_restore = tf.get_collection(
-          slim.variables.VARIABLES_TO_RESTORE)
-      self.assertEqual(len(variables_to_restore), 384)
-
-  def testRegularizationLosses(self):
-    batch_size = 5
-    height, width = 299, 299
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      with slim.arg_scope([slim.ops.conv2d, slim.ops.fc], weight_decay=0.00004):
-        slim.inception.inception_v3(inputs)
-      losses = tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)
-      self.assertEqual(len(losses), len(get_variables_by_name('weights')))
-
-  def testTotalLossWithoutRegularization(self):
-    batch_size = 5
-    height, width = 299, 299
-    num_classes = 1001
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      dense_labels = tf.random_uniform((batch_size, num_classes))
-      with slim.arg_scope([slim.ops.conv2d, slim.ops.fc], weight_decay=0):
-        logits, end_points = slim.inception.inception_v3(
-            inputs,
-            num_classes=num_classes)
-        # Cross entropy loss for the main softmax prediction.
-        slim.losses.cross_entropy_loss(logits,
-                                       dense_labels,
-                                       label_smoothing=0.1,
-                                       weight=1.0)
-        # Cross entropy loss for the auxiliary softmax head.
-        slim.losses.cross_entropy_loss(end_points['aux_logits'],
-                                       dense_labels,
-                                       label_smoothing=0.1,
-                                       weight=0.4,
-                                       scope='aux_loss')
-      losses = tf.get_collection(slim.losses.LOSSES_COLLECTION)
-      self.assertEqual(len(losses), 2)
-
-  def testTotalLossWithRegularization(self):
-    batch_size = 5
-    height, width = 299, 299
-    num_classes = 1000
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      dense_labels = tf.random_uniform((batch_size, num_classes))
-      with slim.arg_scope([slim.ops.conv2d, slim.ops.fc], weight_decay=0.00004):
-        logits, end_points = slim.inception.inception_v3(inputs, num_classes)
-        # Cross entropy loss for the main softmax prediction.
-        slim.losses.cross_entropy_loss(logits,
-                                       dense_labels,
-                                       label_smoothing=0.1,
-                                       weight=1.0)
-        # Cross entropy loss for the auxiliary softmax head.
-        slim.losses.cross_entropy_loss(end_points['aux_logits'],
-                                       dense_labels,
-                                       label_smoothing=0.1,
-                                       weight=0.4,
-                                       scope='aux_loss')
-      losses = tf.get_collection(slim.losses.LOSSES_COLLECTION)
-      self.assertEqual(len(losses), 2)
-      reg_losses = tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)
-      self.assertEqual(len(reg_losses), 98)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/inception/inception/slim/inception_model.py b/research/inception/inception/slim/inception_model.py
deleted file mode 100644
index 6136ab1ba68716f4f135110a4d5c518b732b23df..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/inception_model.py
+++ /dev/null
@@ -1,356 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Inception-v3 expressed in TensorFlow-Slim.
-
-  Usage:
-
-  # Parameters for BatchNorm.
-  batch_norm_params = {
-      # Decay for the batch_norm moving averages.
-      'decay': BATCHNORM_MOVING_AVERAGE_DECAY,
-      # epsilon to prevent 0s in variance.
-      'epsilon': 0.001,
-  }
-  # Set weight_decay for weights in Conv and FC layers.
-  with slim.arg_scope([slim.ops.conv2d, slim.ops.fc], weight_decay=0.00004):
-    with slim.arg_scope([slim.ops.conv2d],
-                        stddev=0.1,
-                        activation=tf.nn.relu,
-                        batch_norm_params=batch_norm_params):
-      # Force all Variables to reside on the CPU.
-      with slim.arg_scope([slim.variables.variable], device='/cpu:0'):
-        logits, endpoints = slim.inception.inception_v3(
-            images,
-            dropout_keep_prob=0.8,
-            num_classes=num_classes,
-            is_training=for_training,
-            restore_logits=restore_logits,
-            scope=scope)
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from inception.slim import ops
-from inception.slim import scopes
-
-
-def inception_v3(inputs,
-                 dropout_keep_prob=0.8,
-                 num_classes=1000,
-                 is_training=True,
-                 restore_logits=True,
-                 scope=''):
-  """Latest Inception from http://arxiv.org/abs/1512.00567.
-
-    "Rethinking the Inception Architecture for Computer Vision"
-
-    Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens,
-    Zbigniew Wojna
-
-  Args:
-    inputs: a tensor of size [batch_size, height, width, channels].
-    dropout_keep_prob: dropout keep_prob.
-    num_classes: number of predicted classes.
-    is_training: whether is training or not.
-    restore_logits: whether or not the logits layers should be restored.
-      Useful for fine-tuning a model with different num_classes.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    a list containing 'logits', 'aux_logits' Tensors.
-  """
-  # end_points will collect relevant activations for external use, for example
-  # summaries or losses.
-  end_points = {}
-  with tf.name_scope(scope, 'inception_v3', [inputs]):
-    with scopes.arg_scope([ops.conv2d, ops.fc, ops.batch_norm, ops.dropout],
-                          is_training=is_training):
-      with scopes.arg_scope([ops.conv2d, ops.max_pool, ops.avg_pool],
-                            stride=1, padding='VALID'):
-        # 299 x 299 x 3
-        end_points['conv0'] = ops.conv2d(inputs, 32, [3, 3], stride=2,
-                                         scope='conv0')
-        # 149 x 149 x 32
-        end_points['conv1'] = ops.conv2d(end_points['conv0'], 32, [3, 3],
-                                         scope='conv1')
-        # 147 x 147 x 32
-        end_points['conv2'] = ops.conv2d(end_points['conv1'], 64, [3, 3],
-                                         padding='SAME', scope='conv2')
-        # 147 x 147 x 64
-        end_points['pool1'] = ops.max_pool(end_points['conv2'], [3, 3],
-                                           stride=2, scope='pool1')
-        # 73 x 73 x 64
-        end_points['conv3'] = ops.conv2d(end_points['pool1'], 80, [1, 1],
-                                         scope='conv3')
-        # 73 x 73 x 80.
-        end_points['conv4'] = ops.conv2d(end_points['conv3'], 192, [3, 3],
-                                         scope='conv4')
-        # 71 x 71 x 192.
-        end_points['pool2'] = ops.max_pool(end_points['conv4'], [3, 3],
-                                           stride=2, scope='pool2')
-        # 35 x 35 x 192.
-        net = end_points['pool2']
-      # Inception blocks
-      with scopes.arg_scope([ops.conv2d, ops.max_pool, ops.avg_pool],
-                            stride=1, padding='SAME'):
-        # mixed: 35 x 35 x 256.
-        with tf.variable_scope('mixed_35x35x256a'):
-          with tf.variable_scope('branch1x1'):
-            branch1x1 = ops.conv2d(net, 64, [1, 1])
-          with tf.variable_scope('branch5x5'):
-            branch5x5 = ops.conv2d(net, 48, [1, 1])
-            branch5x5 = ops.conv2d(branch5x5, 64, [5, 5])
-          with tf.variable_scope('branch3x3dbl'):
-            branch3x3dbl = ops.conv2d(net, 64, [1, 1])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 96, [3, 3])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 96, [3, 3])
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.avg_pool(net, [3, 3])
-            branch_pool = ops.conv2d(branch_pool, 32, [1, 1])
-          net = tf.concat(axis=3, values=[branch1x1, branch5x5, branch3x3dbl, branch_pool])
-          end_points['mixed_35x35x256a'] = net
-        # mixed_1: 35 x 35 x 288.
-        with tf.variable_scope('mixed_35x35x288a'):
-          with tf.variable_scope('branch1x1'):
-            branch1x1 = ops.conv2d(net, 64, [1, 1])
-          with tf.variable_scope('branch5x5'):
-            branch5x5 = ops.conv2d(net, 48, [1, 1])
-            branch5x5 = ops.conv2d(branch5x5, 64, [5, 5])
-          with tf.variable_scope('branch3x3dbl'):
-            branch3x3dbl = ops.conv2d(net, 64, [1, 1])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 96, [3, 3])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 96, [3, 3])
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.avg_pool(net, [3, 3])
-            branch_pool = ops.conv2d(branch_pool, 64, [1, 1])
-          net = tf.concat(axis=3, values=[branch1x1, branch5x5, branch3x3dbl, branch_pool])
-          end_points['mixed_35x35x288a'] = net
-        # mixed_2: 35 x 35 x 288.
-        with tf.variable_scope('mixed_35x35x288b'):
-          with tf.variable_scope('branch1x1'):
-            branch1x1 = ops.conv2d(net, 64, [1, 1])
-          with tf.variable_scope('branch5x5'):
-            branch5x5 = ops.conv2d(net, 48, [1, 1])
-            branch5x5 = ops.conv2d(branch5x5, 64, [5, 5])
-          with tf.variable_scope('branch3x3dbl'):
-            branch3x3dbl = ops.conv2d(net, 64, [1, 1])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 96, [3, 3])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 96, [3, 3])
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.avg_pool(net, [3, 3])
-            branch_pool = ops.conv2d(branch_pool, 64, [1, 1])
-          net = tf.concat(axis=3, values=[branch1x1, branch5x5, branch3x3dbl, branch_pool])
-          end_points['mixed_35x35x288b'] = net
-        # mixed_3: 17 x 17 x 768.
-        with tf.variable_scope('mixed_17x17x768a'):
-          with tf.variable_scope('branch3x3'):
-            branch3x3 = ops.conv2d(net, 384, [3, 3], stride=2, padding='VALID')
-          with tf.variable_scope('branch3x3dbl'):
-            branch3x3dbl = ops.conv2d(net, 64, [1, 1])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 96, [3, 3])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 96, [3, 3],
-                                      stride=2, padding='VALID')
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.max_pool(net, [3, 3], stride=2, padding='VALID')
-          net = tf.concat(axis=3, values=[branch3x3, branch3x3dbl, branch_pool])
-          end_points['mixed_17x17x768a'] = net
-        # mixed4: 17 x 17 x 768.
-        with tf.variable_scope('mixed_17x17x768b'):
-          with tf.variable_scope('branch1x1'):
-            branch1x1 = ops.conv2d(net, 192, [1, 1])
-          with tf.variable_scope('branch7x7'):
-            branch7x7 = ops.conv2d(net, 128, [1, 1])
-            branch7x7 = ops.conv2d(branch7x7, 128, [1, 7])
-            branch7x7 = ops.conv2d(branch7x7, 192, [7, 1])
-          with tf.variable_scope('branch7x7dbl'):
-            branch7x7dbl = ops.conv2d(net, 128, [1, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 128, [7, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 128, [1, 7])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 128, [7, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 192, [1, 7])
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.avg_pool(net, [3, 3])
-            branch_pool = ops.conv2d(branch_pool, 192, [1, 1])
-          net = tf.concat(axis=3, values=[branch1x1, branch7x7, branch7x7dbl, branch_pool])
-          end_points['mixed_17x17x768b'] = net
-        # mixed_5: 17 x 17 x 768.
-        with tf.variable_scope('mixed_17x17x768c'):
-          with tf.variable_scope('branch1x1'):
-            branch1x1 = ops.conv2d(net, 192, [1, 1])
-          with tf.variable_scope('branch7x7'):
-            branch7x7 = ops.conv2d(net, 160, [1, 1])
-            branch7x7 = ops.conv2d(branch7x7, 160, [1, 7])
-            branch7x7 = ops.conv2d(branch7x7, 192, [7, 1])
-          with tf.variable_scope('branch7x7dbl'):
-            branch7x7dbl = ops.conv2d(net, 160, [1, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 160, [7, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 160, [1, 7])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 160, [7, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 192, [1, 7])
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.avg_pool(net, [3, 3])
-            branch_pool = ops.conv2d(branch_pool, 192, [1, 1])
-          net = tf.concat(axis=3, values=[branch1x1, branch7x7, branch7x7dbl, branch_pool])
-          end_points['mixed_17x17x768c'] = net
-        # mixed_6: 17 x 17 x 768.
-        with tf.variable_scope('mixed_17x17x768d'):
-          with tf.variable_scope('branch1x1'):
-            branch1x1 = ops.conv2d(net, 192, [1, 1])
-          with tf.variable_scope('branch7x7'):
-            branch7x7 = ops.conv2d(net, 160, [1, 1])
-            branch7x7 = ops.conv2d(branch7x7, 160, [1, 7])
-            branch7x7 = ops.conv2d(branch7x7, 192, [7, 1])
-          with tf.variable_scope('branch7x7dbl'):
-            branch7x7dbl = ops.conv2d(net, 160, [1, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 160, [7, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 160, [1, 7])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 160, [7, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 192, [1, 7])
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.avg_pool(net, [3, 3])
-            branch_pool = ops.conv2d(branch_pool, 192, [1, 1])
-          net = tf.concat(axis=3, values=[branch1x1, branch7x7, branch7x7dbl, branch_pool])
-          end_points['mixed_17x17x768d'] = net
-        # mixed_7: 17 x 17 x 768.
-        with tf.variable_scope('mixed_17x17x768e'):
-          with tf.variable_scope('branch1x1'):
-            branch1x1 = ops.conv2d(net, 192, [1, 1])
-          with tf.variable_scope('branch7x7'):
-            branch7x7 = ops.conv2d(net, 192, [1, 1])
-            branch7x7 = ops.conv2d(branch7x7, 192, [1, 7])
-            branch7x7 = ops.conv2d(branch7x7, 192, [7, 1])
-          with tf.variable_scope('branch7x7dbl'):
-            branch7x7dbl = ops.conv2d(net, 192, [1, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 192, [7, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 192, [1, 7])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 192, [7, 1])
-            branch7x7dbl = ops.conv2d(branch7x7dbl, 192, [1, 7])
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.avg_pool(net, [3, 3])
-            branch_pool = ops.conv2d(branch_pool, 192, [1, 1])
-          net = tf.concat(axis=3, values=[branch1x1, branch7x7, branch7x7dbl, branch_pool])
-          end_points['mixed_17x17x768e'] = net
-        # Auxiliary Head logits
-        aux_logits = tf.identity(end_points['mixed_17x17x768e'])
-        with tf.variable_scope('aux_logits'):
-          aux_logits = ops.avg_pool(aux_logits, [5, 5], stride=3,
-                                    padding='VALID')
-          aux_logits = ops.conv2d(aux_logits, 128, [1, 1], scope='proj')
-          # Shape of feature map before the final layer.
-          shape = aux_logits.get_shape()
-          aux_logits = ops.conv2d(aux_logits, 768, shape[1:3], stddev=0.01,
-                                  padding='VALID')
-          aux_logits = ops.flatten(aux_logits)
-          aux_logits = ops.fc(aux_logits, num_classes, activation=None,
-                              stddev=0.001, restore=restore_logits)
-          end_points['aux_logits'] = aux_logits
-        # mixed_8: 8 x 8 x 1280.
-        # Note that the scope below is not changed to not void previous
-        # checkpoints.
-        # (TODO) Fix the scope when appropriate.
-        with tf.variable_scope('mixed_17x17x1280a'):
-          with tf.variable_scope('branch3x3'):
-            branch3x3 = ops.conv2d(net, 192, [1, 1])
-            branch3x3 = ops.conv2d(branch3x3, 320, [3, 3], stride=2,
-                                   padding='VALID')
-          with tf.variable_scope('branch7x7x3'):
-            branch7x7x3 = ops.conv2d(net, 192, [1, 1])
-            branch7x7x3 = ops.conv2d(branch7x7x3, 192, [1, 7])
-            branch7x7x3 = ops.conv2d(branch7x7x3, 192, [7, 1])
-            branch7x7x3 = ops.conv2d(branch7x7x3, 192, [3, 3],
-                                     stride=2, padding='VALID')
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.max_pool(net, [3, 3], stride=2, padding='VALID')
-          net = tf.concat(axis=3, values=[branch3x3, branch7x7x3, branch_pool])
-          end_points['mixed_17x17x1280a'] = net
-        # mixed_9: 8 x 8 x 2048.
-        with tf.variable_scope('mixed_8x8x2048a'):
-          with tf.variable_scope('branch1x1'):
-            branch1x1 = ops.conv2d(net, 320, [1, 1])
-          with tf.variable_scope('branch3x3'):
-            branch3x3 = ops.conv2d(net, 384, [1, 1])
-            branch3x3 = tf.concat(axis=3, values=[ops.conv2d(branch3x3, 384, [1, 3]),
-                                                  ops.conv2d(branch3x3, 384, [3, 1])])
-          with tf.variable_scope('branch3x3dbl'):
-            branch3x3dbl = ops.conv2d(net, 448, [1, 1])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 384, [3, 3])
-            branch3x3dbl = tf.concat(axis=3, values=[ops.conv2d(branch3x3dbl, 384, [1, 3]),
-                                                     ops.conv2d(branch3x3dbl, 384, [3, 1])])
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.avg_pool(net, [3, 3])
-            branch_pool = ops.conv2d(branch_pool, 192, [1, 1])
-          net = tf.concat(axis=3, values=[branch1x1, branch3x3, branch3x3dbl, branch_pool])
-          end_points['mixed_8x8x2048a'] = net
-        # mixed_10: 8 x 8 x 2048.
-        with tf.variable_scope('mixed_8x8x2048b'):
-          with tf.variable_scope('branch1x1'):
-            branch1x1 = ops.conv2d(net, 320, [1, 1])
-          with tf.variable_scope('branch3x3'):
-            branch3x3 = ops.conv2d(net, 384, [1, 1])
-            branch3x3 = tf.concat(axis=3, values=[ops.conv2d(branch3x3, 384, [1, 3]),
-                                                  ops.conv2d(branch3x3, 384, [3, 1])])
-          with tf.variable_scope('branch3x3dbl'):
-            branch3x3dbl = ops.conv2d(net, 448, [1, 1])
-            branch3x3dbl = ops.conv2d(branch3x3dbl, 384, [3, 3])
-            branch3x3dbl = tf.concat(axis=3, values=[ops.conv2d(branch3x3dbl, 384, [1, 3]),
-                                                     ops.conv2d(branch3x3dbl, 384, [3, 1])])
-          with tf.variable_scope('branch_pool'):
-            branch_pool = ops.avg_pool(net, [3, 3])
-            branch_pool = ops.conv2d(branch_pool, 192, [1, 1])
-          net = tf.concat(axis=3, values=[branch1x1, branch3x3, branch3x3dbl, branch_pool])
-          end_points['mixed_8x8x2048b'] = net
-        # Final pooling and prediction
-        with tf.variable_scope('logits'):
-          shape = net.get_shape()
-          net = ops.avg_pool(net, shape[1:3], padding='VALID', scope='pool')
-          # 1 x 1 x 2048
-          net = ops.dropout(net, dropout_keep_prob, scope='dropout')
-          net = ops.flatten(net, scope='flatten')
-          # 2048
-          logits = ops.fc(net, num_classes, activation=None, scope='logits',
-                          restore=restore_logits)
-          # 1000
-          end_points['logits'] = logits
-          end_points['predictions'] = tf.nn.softmax(logits, name='predictions')
-      return logits, end_points
-
-
-def inception_v3_parameters(weight_decay=0.00004, stddev=0.1,
-                            batch_norm_decay=0.9997, batch_norm_epsilon=0.001):
-  """Yields the scope with the default parameters for inception_v3.
-
-  Args:
-    weight_decay: the weight decay for weights variables.
-    stddev: standard deviation of the truncated guassian weight distribution.
-    batch_norm_decay: decay for the moving average of batch_norm momentums.
-    batch_norm_epsilon: small float added to variance to avoid dividing by zero.
-
-  Yields:
-    a arg_scope with the parameters needed for inception_v3.
-  """
-  # Set weight_decay for weights in Conv and FC layers.
-  with scopes.arg_scope([ops.conv2d, ops.fc],
-                        weight_decay=weight_decay):
-    # Set stddev, activation and parameters for batch_norm.
-    with scopes.arg_scope([ops.conv2d],
-                          stddev=stddev,
-                          activation=tf.nn.relu,
-                          batch_norm_params={
-                              'decay': batch_norm_decay,
-                              'epsilon': batch_norm_epsilon}) as arg_scope:
-      yield arg_scope
diff --git a/research/inception/inception/slim/inception_test.py b/research/inception/inception/slim/inception_test.py
deleted file mode 100644
index 231dea298f4b761aa90224df1c263873bc890ac5..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/inception_test.py
+++ /dev/null
@@ -1,134 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for slim.inception."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from inception.slim import inception_model as inception
-
-
-class InceptionTest(tf.test.TestCase):
-
-  def testBuildLogits(self):
-    batch_size = 5
-    height, width = 299, 299
-    num_classes = 1000
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      logits, _ = inception.inception_v3(inputs, num_classes)
-      self.assertTrue(logits.op.name.startswith('logits'))
-      self.assertListEqual(logits.get_shape().as_list(),
-                           [batch_size, num_classes])
-
-  def testBuildEndPoints(self):
-    batch_size = 5
-    height, width = 299, 299
-    num_classes = 1000
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      _, end_points = inception.inception_v3(inputs, num_classes)
-      self.assertTrue('logits' in end_points)
-      logits = end_points['logits']
-      self.assertListEqual(logits.get_shape().as_list(),
-                           [batch_size, num_classes])
-      self.assertTrue('aux_logits' in end_points)
-      aux_logits = end_points['aux_logits']
-      self.assertListEqual(aux_logits.get_shape().as_list(),
-                           [batch_size, num_classes])
-      pre_pool = end_points['mixed_8x8x2048b']
-      self.assertListEqual(pre_pool.get_shape().as_list(),
-                           [batch_size, 8, 8, 2048])
-
-  def testVariablesSetDevice(self):
-    batch_size = 5
-    height, width = 299, 299
-    num_classes = 1000
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      # Force all Variables to reside on the device.
-      with tf.variable_scope('on_cpu'), tf.device('/cpu:0'):
-        inception.inception_v3(inputs, num_classes)
-      with tf.variable_scope('on_gpu'), tf.device('/gpu:0'):
-        inception.inception_v3(inputs, num_classes)
-      for v in tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, scope='on_cpu'):
-        self.assertDeviceEqual(v.device, '/cpu:0')
-      for v in tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, scope='on_gpu'):
-        self.assertDeviceEqual(v.device, '/gpu:0')
-
-  def testHalfSizeImages(self):
-    batch_size = 5
-    height, width = 150, 150
-    num_classes = 1000
-    with self.test_session():
-      inputs = tf.random_uniform((batch_size, height, width, 3))
-      logits, end_points = inception.inception_v3(inputs, num_classes)
-      self.assertTrue(logits.op.name.startswith('logits'))
-      self.assertListEqual(logits.get_shape().as_list(),
-                           [batch_size, num_classes])
-      pre_pool = end_points['mixed_8x8x2048b']
-      self.assertListEqual(pre_pool.get_shape().as_list(),
-                           [batch_size, 3, 3, 2048])
-
-  def testUnknowBatchSize(self):
-    batch_size = 1
-    height, width = 299, 299
-    num_classes = 1000
-    with self.test_session() as sess:
-      inputs = tf.placeholder(tf.float32, (None, height, width, 3))
-      logits, _ = inception.inception_v3(inputs, num_classes)
-      self.assertTrue(logits.op.name.startswith('logits'))
-      self.assertListEqual(logits.get_shape().as_list(),
-                           [None, num_classes])
-      images = tf.random_uniform((batch_size, height, width, 3))
-      sess.run(tf.global_variables_initializer())
-      output = sess.run(logits, {inputs: images.eval()})
-      self.assertEquals(output.shape, (batch_size, num_classes))
-
-  def testEvaluation(self):
-    batch_size = 2
-    height, width = 299, 299
-    num_classes = 1000
-    with self.test_session() as sess:
-      eval_inputs = tf.random_uniform((batch_size, height, width, 3))
-      logits, _ = inception.inception_v3(eval_inputs, num_classes,
-                                         is_training=False)
-      predictions = tf.argmax(logits, 1)
-      sess.run(tf.global_variables_initializer())
-      output = sess.run(predictions)
-      self.assertEquals(output.shape, (batch_size,))
-
-  def testTrainEvalWithReuse(self):
-    train_batch_size = 5
-    eval_batch_size = 2
-    height, width = 150, 150
-    num_classes = 1000
-    with self.test_session() as sess:
-      train_inputs = tf.random_uniform((train_batch_size, height, width, 3))
-      inception.inception_v3(train_inputs, num_classes)
-      tf.get_variable_scope().reuse_variables()
-      eval_inputs = tf.random_uniform((eval_batch_size, height, width, 3))
-      logits, _ = inception.inception_v3(eval_inputs, num_classes,
-                                         is_training=False)
-      predictions = tf.argmax(logits, 1)
-      sess.run(tf.global_variables_initializer())
-      output = sess.run(predictions)
-      self.assertEquals(output.shape, (eval_batch_size,))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/inception/inception/slim/losses.py b/research/inception/inception/slim/losses.py
deleted file mode 100644
index 78298d092fab3afc264e427fb060602c27ea97b0..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/losses.py
+++ /dev/null
@@ -1,174 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Contains convenience wrappers for various Neural Network TensorFlow losses.
-
-  All the losses defined here add themselves to the LOSSES_COLLECTION
-  collection.
-
-  l1_loss: Define a L1 Loss, useful for regularization, i.e. lasso.
-  l2_loss: Define a L2 Loss, useful for regularization, i.e. weight decay.
-  cross_entropy_loss: Define a cross entropy loss using
-    softmax_cross_entropy_with_logits. Useful for classification.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-# In order to gather all losses in a network, the user should use this
-# key for get_collection, i.e:
-#   losses = tf.get_collection(slim.losses.LOSSES_COLLECTION)
-LOSSES_COLLECTION = '_losses'
-
-
-def l1_regularizer(weight=1.0, scope=None):
-  """Define a L1 regularizer.
-
-  Args:
-    weight: scale the loss by this factor.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    a regularizer function.
-  """
-  def regularizer(tensor):
-    with tf.name_scope(scope, 'L1Regularizer', [tensor]):
-      l1_weight = tf.convert_to_tensor(weight,
-                                       dtype=tensor.dtype.base_dtype,
-                                       name='weight')
-      return tf.multiply(l1_weight, tf.reduce_sum(tf.abs(tensor)), name='value')
-  return regularizer
-
-
-def l2_regularizer(weight=1.0, scope=None):
-  """Define a L2 regularizer.
-
-  Args:
-    weight: scale the loss by this factor.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    a regularizer function.
-  """
-  def regularizer(tensor):
-    with tf.name_scope(scope, 'L2Regularizer', [tensor]):
-      l2_weight = tf.convert_to_tensor(weight,
-                                       dtype=tensor.dtype.base_dtype,
-                                       name='weight')
-      return tf.multiply(l2_weight, tf.nn.l2_loss(tensor), name='value')
-  return regularizer
-
-
-def l1_l2_regularizer(weight_l1=1.0, weight_l2=1.0, scope=None):
-  """Define a L1L2 regularizer.
-
-  Args:
-    weight_l1: scale the L1 loss by this factor.
-    weight_l2: scale the L2 loss by this factor.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    a regularizer function.
-  """
-  def regularizer(tensor):
-    with tf.name_scope(scope, 'L1L2Regularizer', [tensor]):
-      weight_l1_t = tf.convert_to_tensor(weight_l1,
-                                         dtype=tensor.dtype.base_dtype,
-                                         name='weight_l1')
-      weight_l2_t = tf.convert_to_tensor(weight_l2,
-                                         dtype=tensor.dtype.base_dtype,
-                                         name='weight_l2')
-      reg_l1 = tf.multiply(weight_l1_t, tf.reduce_sum(tf.abs(tensor)),
-                      name='value_l1')
-      reg_l2 = tf.multiply(weight_l2_t, tf.nn.l2_loss(tensor),
-                      name='value_l2')
-      return tf.add(reg_l1, reg_l2, name='value')
-  return regularizer
-
-
-def l1_loss(tensor, weight=1.0, scope=None):
-  """Define a L1Loss, useful for regularize, i.e. lasso.
-
-  Args:
-    tensor: tensor to regularize.
-    weight: scale the loss by this factor.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    the L1 loss op.
-  """
-  with tf.name_scope(scope, 'L1Loss', [tensor]):
-    weight = tf.convert_to_tensor(weight,
-                                  dtype=tensor.dtype.base_dtype,
-                                  name='loss_weight')
-    loss = tf.multiply(weight, tf.reduce_sum(tf.abs(tensor)), name='value')
-    tf.add_to_collection(LOSSES_COLLECTION, loss)
-    return loss
-
-
-def l2_loss(tensor, weight=1.0, scope=None):
-  """Define a L2Loss, useful for regularize, i.e. weight decay.
-
-  Args:
-    tensor: tensor to regularize.
-    weight: an optional weight to modulate the loss.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    the L2 loss op.
-  """
-  with tf.name_scope(scope, 'L2Loss', [tensor]):
-    weight = tf.convert_to_tensor(weight,
-                                  dtype=tensor.dtype.base_dtype,
-                                  name='loss_weight')
-    loss = tf.multiply(weight, tf.nn.l2_loss(tensor), name='value')
-    tf.add_to_collection(LOSSES_COLLECTION, loss)
-    return loss
-
-
-def cross_entropy_loss(logits, one_hot_labels, label_smoothing=0,
-                       weight=1.0, scope=None):
-  """Define a Cross Entropy loss using softmax_cross_entropy_with_logits.
-
-  It can scale the loss by weight factor, and smooth the labels.
-
-  Args:
-    logits: [batch_size, num_classes] logits outputs of the network .
-    one_hot_labels: [batch_size, num_classes] target one_hot_encoded labels.
-    label_smoothing: if greater than 0 then smooth the labels.
-    weight: scale the loss by this factor.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    A tensor with the softmax_cross_entropy loss.
-  """
-  logits.get_shape().assert_is_compatible_with(one_hot_labels.get_shape())
-  with tf.name_scope(scope, 'CrossEntropyLoss', [logits, one_hot_labels]):
-    num_classes = one_hot_labels.get_shape()[-1].value
-    one_hot_labels = tf.cast(one_hot_labels, logits.dtype)
-    if label_smoothing > 0:
-      smooth_positives = 1.0 - label_smoothing
-      smooth_negatives = label_smoothing / num_classes
-      one_hot_labels = one_hot_labels * smooth_positives + smooth_negatives
-    cross_entropy = tf.contrib.nn.deprecated_flipped_softmax_cross_entropy_with_logits(
-        logits, one_hot_labels, name='xentropy')
-
-    weight = tf.convert_to_tensor(weight,
-                                  dtype=logits.dtype.base_dtype,
-                                  name='loss_weight')
-    loss = tf.multiply(weight, tf.reduce_mean(cross_entropy), name='value')
-    tf.add_to_collection(LOSSES_COLLECTION, loss)
-    return loss
diff --git a/research/inception/inception/slim/losses_test.py b/research/inception/inception/slim/losses_test.py
deleted file mode 100644
index e267f6520779f63be0becf41ceccc7de494e14f7..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/losses_test.py
+++ /dev/null
@@ -1,177 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for slim.losses."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from inception.slim import losses
-
-
-class LossesTest(tf.test.TestCase):
-
-  def testL1Loss(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      weights = tf.constant(1.0, shape=shape)
-      wd = 0.01
-      loss = losses.l1_loss(weights, wd)
-      self.assertEquals(loss.op.name, 'L1Loss/value')
-      self.assertAlmostEqual(loss.eval(), num_elem * wd, 5)
-
-  def testL2Loss(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      weights = tf.constant(1.0, shape=shape)
-      wd = 0.01
-      loss = losses.l2_loss(weights, wd)
-      self.assertEquals(loss.op.name, 'L2Loss/value')
-      self.assertAlmostEqual(loss.eval(), num_elem * wd / 2, 5)
-
-
-class RegularizersTest(tf.test.TestCase):
-
-  def testL1Regularizer(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      tensor = tf.constant(1.0, shape=shape)
-      loss = losses.l1_regularizer()(tensor)
-      self.assertEquals(loss.op.name, 'L1Regularizer/value')
-      self.assertAlmostEqual(loss.eval(), num_elem, 5)
-
-  def testL1RegularizerWithScope(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      tensor = tf.constant(1.0, shape=shape)
-      loss = losses.l1_regularizer(scope='L1')(tensor)
-      self.assertEquals(loss.op.name, 'L1/value')
-      self.assertAlmostEqual(loss.eval(), num_elem, 5)
-
-  def testL1RegularizerWithWeight(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      tensor = tf.constant(1.0, shape=shape)
-      weight = 0.01
-      loss = losses.l1_regularizer(weight)(tensor)
-      self.assertEquals(loss.op.name, 'L1Regularizer/value')
-      self.assertAlmostEqual(loss.eval(), num_elem * weight, 5)
-
-  def testL2Regularizer(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      tensor = tf.constant(1.0, shape=shape)
-      loss = losses.l2_regularizer()(tensor)
-      self.assertEquals(loss.op.name, 'L2Regularizer/value')
-      self.assertAlmostEqual(loss.eval(), num_elem / 2, 5)
-
-  def testL2RegularizerWithScope(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      tensor = tf.constant(1.0, shape=shape)
-      loss = losses.l2_regularizer(scope='L2')(tensor)
-      self.assertEquals(loss.op.name, 'L2/value')
-      self.assertAlmostEqual(loss.eval(), num_elem / 2, 5)
-
-  def testL2RegularizerWithWeight(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      tensor = tf.constant(1.0, shape=shape)
-      weight = 0.01
-      loss = losses.l2_regularizer(weight)(tensor)
-      self.assertEquals(loss.op.name, 'L2Regularizer/value')
-      self.assertAlmostEqual(loss.eval(), num_elem * weight / 2, 5)
-
-  def testL1L2Regularizer(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      tensor = tf.constant(1.0, shape=shape)
-      loss = losses.l1_l2_regularizer()(tensor)
-      self.assertEquals(loss.op.name, 'L1L2Regularizer/value')
-      self.assertAlmostEqual(loss.eval(), num_elem + num_elem / 2, 5)
-
-  def testL1L2RegularizerWithScope(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      tensor = tf.constant(1.0, shape=shape)
-      loss = losses.l1_l2_regularizer(scope='L1L2')(tensor)
-      self.assertEquals(loss.op.name, 'L1L2/value')
-      self.assertAlmostEqual(loss.eval(), num_elem + num_elem / 2, 5)
-
-  def testL1L2RegularizerWithWeights(self):
-    with self.test_session():
-      shape = [5, 5, 5]
-      num_elem = 5 * 5 * 5
-      tensor = tf.constant(1.0, shape=shape)
-      weight_l1 = 0.01
-      weight_l2 = 0.05
-      loss = losses.l1_l2_regularizer(weight_l1, weight_l2)(tensor)
-      self.assertEquals(loss.op.name, 'L1L2Regularizer/value')
-      self.assertAlmostEqual(loss.eval(),
-                             num_elem * weight_l1 + num_elem * weight_l2 / 2, 5)
-
-
-class CrossEntropyLossTest(tf.test.TestCase):
-
-  def testCrossEntropyLossAllCorrect(self):
-    with self.test_session():
-      logits = tf.constant([[10.0, 0.0, 0.0],
-                            [0.0, 10.0, 0.0],
-                            [0.0, 0.0, 10.0]])
-      labels = tf.constant([[1, 0, 0],
-                            [0, 1, 0],
-                            [0, 0, 1]])
-      loss = losses.cross_entropy_loss(logits, labels)
-      self.assertEquals(loss.op.name, 'CrossEntropyLoss/value')
-      self.assertAlmostEqual(loss.eval(), 0.0, 3)
-
-  def testCrossEntropyLossAllWrong(self):
-    with self.test_session():
-      logits = tf.constant([[10.0, 0.0, 0.0],
-                            [0.0, 10.0, 0.0],
-                            [0.0, 0.0, 10.0]])
-      labels = tf.constant([[0, 0, 1],
-                            [1, 0, 0],
-                            [0, 1, 0]])
-      loss = losses.cross_entropy_loss(logits, labels)
-      self.assertEquals(loss.op.name, 'CrossEntropyLoss/value')
-      self.assertAlmostEqual(loss.eval(), 10.0, 3)
-
-  def testCrossEntropyLossAllWrongWithWeight(self):
-    with self.test_session():
-      logits = tf.constant([[10.0, 0.0, 0.0],
-                            [0.0, 10.0, 0.0],
-                            [0.0, 0.0, 10.0]])
-      labels = tf.constant([[0, 0, 1],
-                            [1, 0, 0],
-                            [0, 1, 0]])
-      loss = losses.cross_entropy_loss(logits, labels, weight=0.5)
-      self.assertEquals(loss.op.name, 'CrossEntropyLoss/value')
-      self.assertAlmostEqual(loss.eval(), 5.0, 3)
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/inception/inception/slim/ops.py b/research/inception/inception/slim/ops.py
deleted file mode 100644
index 54fda4eb81f3a138d9bb2748c21164b88570ede9..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/ops.py
+++ /dev/null
@@ -1,473 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Contains convenience wrappers for typical Neural Network TensorFlow layers.
-
-   Additionally it maintains a collection with update_ops that need to be
-   updated after the ops have been computed, for example to update moving means
-   and moving variances of batch_norm.
-
-   Ops that have different behavior during training or eval have an is_training
-   parameter. Additionally Ops that contain variables.variable have a trainable
-   parameter, which control if the ops variables are trainable or not.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from tensorflow.python.training import moving_averages
-
-from inception.slim import losses
-from inception.slim import scopes
-from inception.slim import variables
-
-# Used to keep the update ops done by batch_norm.
-UPDATE_OPS_COLLECTION = '_update_ops_'
-
-
-@scopes.add_arg_scope
-def batch_norm(inputs,
-               decay=0.999,
-               center=True,
-               scale=False,
-               epsilon=0.001,
-               moving_vars='moving_vars',
-               activation=None,
-               is_training=True,
-               trainable=True,
-               restore=True,
-               scope=None,
-               reuse=None):
-  """Adds a Batch Normalization layer.
-
-  Args:
-    inputs: a tensor of size [batch_size, height, width, channels]
-            or [batch_size, channels].
-    decay: decay for the moving average.
-    center: If True, subtract beta. If False, beta is not created and ignored.
-    scale: If True, multiply by gamma. If False, gamma is
-      not used. When the next layer is linear (also e.g. ReLU), this can be
-      disabled since the scaling can be done by the next layer.
-    epsilon: small float added to variance to avoid dividing by zero.
-    moving_vars: collection to store the moving_mean and moving_variance.
-    activation: activation function.
-    is_training: whether or not the model is in training mode.
-    trainable: whether or not the variables should be trainable or not.
-    restore: whether or not the variables should be marked for restore.
-    scope: Optional scope for variable_scope.
-    reuse: whether or not the layer and its variables should be reused. To be
-      able to reuse the layer scope must be given.
-
-  Returns:
-    a tensor representing the output of the operation.
-
-  """
-  inputs_shape = inputs.get_shape()
-  with tf.variable_scope(scope, 'BatchNorm', [inputs], reuse=reuse):
-    axis = list(range(len(inputs_shape) - 1))
-    params_shape = inputs_shape[-1:]
-    # Allocate parameters for the beta and gamma of the normalization.
-    beta, gamma = None, None
-    if center:
-      beta = variables.variable('beta',
-                                params_shape,
-                                initializer=tf.zeros_initializer(),
-                                trainable=trainable,
-                                restore=restore)
-    if scale:
-      gamma = variables.variable('gamma',
-                                 params_shape,
-                                 initializer=tf.ones_initializer(),
-                                 trainable=trainable,
-                                 restore=restore)
-    # Create moving_mean and moving_variance add them to
-    # GraphKeys.MOVING_AVERAGE_VARIABLES collections.
-    moving_collections = [moving_vars, tf.GraphKeys.MOVING_AVERAGE_VARIABLES]
-    moving_mean = variables.variable('moving_mean',
-                                     params_shape,
-                                     initializer=tf.zeros_initializer(),
-                                     trainable=False,
-                                     restore=restore,
-                                     collections=moving_collections)
-    moving_variance = variables.variable('moving_variance',
-                                         params_shape,
-                                         initializer=tf.ones_initializer(),
-                                         trainable=False,
-                                         restore=restore,
-                                         collections=moving_collections)
-    if is_training:
-      # Calculate the moments based on the individual batch.
-      mean, variance = tf.nn.moments(inputs, axis)
-
-      update_moving_mean = moving_averages.assign_moving_average(
-          moving_mean, mean, decay)
-      tf.add_to_collection(UPDATE_OPS_COLLECTION, update_moving_mean)
-      update_moving_variance = moving_averages.assign_moving_average(
-          moving_variance, variance, decay)
-      tf.add_to_collection(UPDATE_OPS_COLLECTION, update_moving_variance)
-    else:
-      # Just use the moving_mean and moving_variance.
-      mean = moving_mean
-      variance = moving_variance
-    # Normalize the activations.
-    outputs = tf.nn.batch_normalization(
-        inputs, mean, variance, beta, gamma, epsilon)
-    outputs.set_shape(inputs.get_shape())
-    if activation:
-      outputs = activation(outputs)
-    return outputs
-
-
-def _two_element_tuple(int_or_tuple):
-  """Converts `int_or_tuple` to height, width.
-
-  Several of the functions that follow accept arguments as either
-  a tuple of 2 integers or a single integer.  A single integer
-  indicates that the 2 values of the tuple are the same.
-
-  This functions normalizes the input value by always returning a tuple.
-
-  Args:
-    int_or_tuple: A list of 2 ints, a single int or a tf.TensorShape.
-
-  Returns:
-    A tuple with 2 values.
-
-  Raises:
-    ValueError: If `int_or_tuple` it not well formed.
-  """
-  if isinstance(int_or_tuple, (list, tuple)):
-    if len(int_or_tuple) != 2:
-      raise ValueError('Must be a list with 2 elements: %s' % int_or_tuple)
-    return int(int_or_tuple[0]), int(int_or_tuple[1])
-  if isinstance(int_or_tuple, int):
-    return int(int_or_tuple), int(int_or_tuple)
-  if isinstance(int_or_tuple, tf.TensorShape):
-    if len(int_or_tuple) == 2:
-      return int_or_tuple[0], int_or_tuple[1]
-  raise ValueError('Must be an int, a list with 2 elements or a TensorShape of '
-                   'length 2')
-
-
-@scopes.add_arg_scope
-def conv2d(inputs,
-           num_filters_out,
-           kernel_size,
-           stride=1,
-           padding='SAME',
-           activation=tf.nn.relu,
-           stddev=0.01,
-           bias=0.0,
-           weight_decay=0,
-           batch_norm_params=None,
-           is_training=True,
-           trainable=True,
-           restore=True,
-           scope=None,
-           reuse=None):
-  """Adds a 2D convolution followed by an optional batch_norm layer.
-
-  conv2d creates a variable called 'weights', representing the convolutional
-  kernel, that is convolved with the input. If `batch_norm_params` is None, a
-  second variable called 'biases' is added to the result of the convolution
-  operation.
-
-  Args:
-    inputs: a tensor of size [batch_size, height, width, channels].
-    num_filters_out: the number of output filters.
-    kernel_size: a list of length 2: [kernel_height, kernel_width] of
-      of the filters. Can be an int if both values are the same.
-    stride: a list of length 2: [stride_height, stride_width].
-      Can be an int if both strides are the same.  Note that presently
-      both strides must have the same value.
-    padding: one of 'VALID' or 'SAME'.
-    activation: activation function.
-    stddev: standard deviation of the truncated guassian weight distribution.
-    bias: the initial value of the biases.
-    weight_decay: the weight decay.
-    batch_norm_params: parameters for the batch_norm. If is None don't use it.
-    is_training: whether or not the model is in training mode.
-    trainable: whether or not the variables should be trainable or not.
-    restore: whether or not the variables should be marked for restore.
-    scope: Optional scope for variable_scope.
-    reuse: whether or not the layer and its variables should be reused. To be
-      able to reuse the layer scope must be given.
-  Returns:
-    a tensor representing the output of the operation.
-
-  """
-  with tf.variable_scope(scope, 'Conv', [inputs], reuse=reuse):
-    kernel_h, kernel_w = _two_element_tuple(kernel_size)
-    stride_h, stride_w = _two_element_tuple(stride)
-    num_filters_in = inputs.get_shape()[-1]
-    weights_shape = [kernel_h, kernel_w,
-                     num_filters_in, num_filters_out]
-    weights_initializer = tf.truncated_normal_initializer(stddev=stddev)
-    l2_regularizer = None
-    if weight_decay and weight_decay > 0:
-      l2_regularizer = losses.l2_regularizer(weight_decay)
-    weights = variables.variable('weights',
-                                 shape=weights_shape,
-                                 initializer=weights_initializer,
-                                 regularizer=l2_regularizer,
-                                 trainable=trainable,
-                                 restore=restore)
-    conv = tf.nn.conv2d(inputs, weights, [1, stride_h, stride_w, 1],
-                        padding=padding)
-    if batch_norm_params is not None:
-      with scopes.arg_scope([batch_norm], is_training=is_training,
-                            trainable=trainable, restore=restore):
-        outputs = batch_norm(conv, **batch_norm_params)
-    else:
-      bias_shape = [num_filters_out,]
-      bias_initializer = tf.constant_initializer(bias)
-      biases = variables.variable('biases',
-                                  shape=bias_shape,
-                                  initializer=bias_initializer,
-                                  trainable=trainable,
-                                  restore=restore)
-      outputs = tf.nn.bias_add(conv, biases)
-    if activation:
-      outputs = activation(outputs)
-    return outputs
-
-
-@scopes.add_arg_scope
-def fc(inputs,
-       num_units_out,
-       activation=tf.nn.relu,
-       stddev=0.01,
-       bias=0.0,
-       weight_decay=0,
-       batch_norm_params=None,
-       is_training=True,
-       trainable=True,
-       restore=True,
-       scope=None,
-       reuse=None):
-  """Adds a fully connected layer followed by an optional batch_norm layer.
-
-  FC creates a variable called 'weights', representing the fully connected
-  weight matrix, that is multiplied by the input. If `batch_norm` is None, a
-  second variable called 'biases' is added to the result of the initial
-  vector-matrix multiplication.
-
-  Args:
-    inputs: a [B x N] tensor where B is the batch size and N is the number of
-            input units in the layer.
-    num_units_out: the number of output units in the layer.
-    activation: activation function.
-    stddev: the standard deviation for the weights.
-    bias: the initial value of the biases.
-    weight_decay: the weight decay.
-    batch_norm_params: parameters for the batch_norm. If is None don't use it.
-    is_training: whether or not the model is in training mode.
-    trainable: whether or not the variables should be trainable or not.
-    restore: whether or not the variables should be marked for restore.
-    scope: Optional scope for variable_scope.
-    reuse: whether or not the layer and its variables should be reused. To be
-      able to reuse the layer scope must be given.
-
-  Returns:
-     the tensor variable representing the result of the series of operations.
-  """
-  with tf.variable_scope(scope, 'FC', [inputs], reuse=reuse):
-    num_units_in = inputs.get_shape()[1]
-    weights_shape = [num_units_in, num_units_out]
-    weights_initializer = tf.truncated_normal_initializer(stddev=stddev)
-    l2_regularizer = None
-    if weight_decay and weight_decay > 0:
-      l2_regularizer = losses.l2_regularizer(weight_decay)
-    weights = variables.variable('weights',
-                                 shape=weights_shape,
-                                 initializer=weights_initializer,
-                                 regularizer=l2_regularizer,
-                                 trainable=trainable,
-                                 restore=restore)
-    if batch_norm_params is not None:
-      outputs = tf.matmul(inputs, weights)
-      with scopes.arg_scope([batch_norm], is_training=is_training,
-                            trainable=trainable, restore=restore):
-        outputs = batch_norm(outputs, **batch_norm_params)
-    else:
-      bias_shape = [num_units_out,]
-      bias_initializer = tf.constant_initializer(bias)
-      biases = variables.variable('biases',
-                                  shape=bias_shape,
-                                  initializer=bias_initializer,
-                                  trainable=trainable,
-                                  restore=restore)
-      outputs = tf.nn.xw_plus_b(inputs, weights, biases)
-    if activation:
-      outputs = activation(outputs)
-    return outputs
-
-
-def one_hot_encoding(labels, num_classes, scope=None):
-  """Transform numeric labels into onehot_labels.
-
-  Args:
-    labels: [batch_size] target labels.
-    num_classes: total number of classes.
-    scope: Optional scope for name_scope.
-  Returns:
-    one hot encoding of the labels.
-  """
-  with tf.name_scope(scope, 'OneHotEncoding', [labels]):
-    batch_size = labels.get_shape()[0]
-    indices = tf.expand_dims(tf.range(0, batch_size), 1)
-    labels = tf.cast(tf.expand_dims(labels, 1), indices.dtype)
-    concated = tf.concat(axis=1, values=[indices, labels])
-    onehot_labels = tf.sparse_to_dense(
-        concated, tf.stack([batch_size, num_classes]), 1.0, 0.0)
-    onehot_labels.set_shape([batch_size, num_classes])
-    return onehot_labels
-
-
-@scopes.add_arg_scope
-def max_pool(inputs, kernel_size, stride=2, padding='VALID', scope=None):
-  """Adds a Max Pooling layer.
-
-  It is assumed by the wrapper that the pooling is only done per image and not
-  in depth or batch.
-
-  Args:
-    inputs: a tensor of size [batch_size, height, width, depth].
-    kernel_size: a list of length 2: [kernel_height, kernel_width] of the
-      pooling kernel over which the op is computed. Can be an int if both
-      values are the same.
-    stride: a list of length 2: [stride_height, stride_width].
-      Can be an int if both strides are the same.  Note that presently
-      both strides must have the same value.
-    padding: the padding method, either 'VALID' or 'SAME'.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    a tensor representing the results of the pooling operation.
-  Raises:
-    ValueError: if 'kernel_size' is not a 2-D list
-  """
-  with tf.name_scope(scope, 'MaxPool', [inputs]):
-    kernel_h, kernel_w = _two_element_tuple(kernel_size)
-    stride_h, stride_w = _two_element_tuple(stride)
-    return tf.nn.max_pool(inputs,
-                          ksize=[1, kernel_h, kernel_w, 1],
-                          strides=[1, stride_h, stride_w, 1],
-                          padding=padding)
-
-
-@scopes.add_arg_scope
-def avg_pool(inputs, kernel_size, stride=2, padding='VALID', scope=None):
-  """Adds a Avg Pooling layer.
-
-  It is assumed by the wrapper that the pooling is only done per image and not
-  in depth or batch.
-
-  Args:
-    inputs: a tensor of size [batch_size, height, width, depth].
-    kernel_size: a list of length 2: [kernel_height, kernel_width] of the
-      pooling kernel over which the op is computed. Can be an int if both
-      values are the same.
-    stride: a list of length 2: [stride_height, stride_width].
-      Can be an int if both strides are the same.  Note that presently
-      both strides must have the same value.
-    padding: the padding method, either 'VALID' or 'SAME'.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    a tensor representing the results of the pooling operation.
-  """
-  with tf.name_scope(scope, 'AvgPool', [inputs]):
-    kernel_h, kernel_w = _two_element_tuple(kernel_size)
-    stride_h, stride_w = _two_element_tuple(stride)
-    return tf.nn.avg_pool(inputs,
-                          ksize=[1, kernel_h, kernel_w, 1],
-                          strides=[1, stride_h, stride_w, 1],
-                          padding=padding)
-
-
-@scopes.add_arg_scope
-def dropout(inputs, keep_prob=0.5, is_training=True, scope=None):
-  """Returns a dropout layer applied to the input.
-
-  Args:
-    inputs: the tensor to pass to the Dropout layer.
-    keep_prob: the probability of keeping each input unit.
-    is_training: whether or not the model is in training mode. If so, dropout is
-    applied and values scaled. Otherwise, inputs is returned.
-    scope: Optional scope for name_scope.
-
-  Returns:
-    a tensor representing the output of the operation.
-  """
-  if is_training and keep_prob > 0:
-    with tf.name_scope(scope, 'Dropout', [inputs]):
-      return tf.nn.dropout(inputs, keep_prob)
-  else:
-    return inputs
-
-
-def flatten(inputs, scope=None):
-  """Flattens the input while maintaining the batch_size.
-
-    Assumes that the first dimension represents the batch.
-
-  Args:
-    inputs: a tensor of size [batch_size, ...].
-    scope: Optional scope for name_scope.
-
-  Returns:
-    a flattened tensor with shape [batch_size, k].
-  Raises:
-    ValueError: if inputs.shape is wrong.
-  """
-  if len(inputs.get_shape()) < 2:
-    raise ValueError('Inputs must be have a least 2 dimensions')
-  dims = inputs.get_shape()[1:]
-  k = dims.num_elements()
-  with tf.name_scope(scope, 'Flatten', [inputs]):
-    return tf.reshape(inputs, [-1, k])
-
-
-def repeat_op(repetitions, inputs, op, *args, **kwargs):
-  """Build a sequential Tower starting from inputs by using an op repeatedly.
-
-  It creates new scopes for each operation by increasing the counter.
-  Example: given repeat_op(3, _, ops.conv2d, 64, [3, 3], scope='conv1')
-    it will repeat the given op under the following variable_scopes:
-      conv1/Conv
-      conv1/Conv_1
-      conv1/Conv_2
-
-  Args:
-    repetitions: number or repetitions.
-    inputs: a tensor of size [batch_size, height, width, channels].
-    op: an operation.
-    *args: args for the op.
-    **kwargs: kwargs for the op.
-
-  Returns:
-    a tensor result of applying the operation op, num times.
-  Raises:
-    ValueError: if the op is unknown or wrong.
-  """
-  scope = kwargs.pop('scope', None)
-  with tf.variable_scope(scope, 'RepeatOp', [inputs]):
-    tower = inputs
-    for _ in range(repetitions):
-      tower = op(tower, *args, **kwargs)
-    return tower
diff --git a/research/inception/inception/slim/ops_test.py b/research/inception/inception/slim/ops_test.py
deleted file mode 100644
index 13dc5d9aacf6e283540a406d419a67d2d7215161..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/ops_test.py
+++ /dev/null
@@ -1,687 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for slim.ops."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import numpy as np
-import tensorflow as tf
-
-from inception.slim import ops
-from inception.slim import scopes
-from inception.slim import variables
-
-
-class ConvTest(tf.test.TestCase):
-
-  def testCreateConv(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.conv2d(images, 32, [3, 3])
-      self.assertEquals(output.op.name, 'Conv/Relu')
-      self.assertListEqual(output.get_shape().as_list(), [5, height, width, 32])
-
-  def testCreateSquareConv(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.conv2d(images, 32, 3)
-      self.assertEquals(output.op.name, 'Conv/Relu')
-      self.assertListEqual(output.get_shape().as_list(), [5, height, width, 32])
-
-  def testCreateConvWithTensorShape(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.conv2d(images, 32, images.get_shape()[1:3])
-      self.assertEquals(output.op.name, 'Conv/Relu')
-      self.assertListEqual(output.get_shape().as_list(), [5, height, width, 32])
-
-  def testCreateFullyConv(self):
-    height, width = 6, 6
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 32), seed=1)
-      output = ops.conv2d(images, 64, images.get_shape()[1:3], padding='VALID')
-      self.assertEquals(output.op.name, 'Conv/Relu')
-      self.assertListEqual(output.get_shape().as_list(), [5, 1, 1, 64])
-
-  def testCreateVerticalConv(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.conv2d(images, 32, [3, 1])
-      self.assertEquals(output.op.name, 'Conv/Relu')
-      self.assertListEqual(output.get_shape().as_list(),
-                           [5, height, width, 32])
-
-  def testCreateHorizontalConv(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.conv2d(images, 32, [1, 3])
-      self.assertEquals(output.op.name, 'Conv/Relu')
-      self.assertListEqual(output.get_shape().as_list(),
-                           [5, height, width, 32])
-
-  def testCreateConvWithStride(self):
-    height, width = 6, 6
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.conv2d(images, 32, [3, 3], stride=2)
-      self.assertEquals(output.op.name, 'Conv/Relu')
-      self.assertListEqual(output.get_shape().as_list(),
-                           [5, height/2, width/2, 32])
-
-  def testCreateConvCreatesWeightsAndBiasesVars(self):
-    height, width = 3, 3
-    images = tf.random_uniform((5, height, width, 3), seed=1)
-    with self.test_session():
-      self.assertFalse(variables.get_variables('conv1/weights'))
-      self.assertFalse(variables.get_variables('conv1/biases'))
-      ops.conv2d(images, 32, [3, 3], scope='conv1')
-      self.assertTrue(variables.get_variables('conv1/weights'))
-      self.assertTrue(variables.get_variables('conv1/biases'))
-
-  def testCreateConvWithScope(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.conv2d(images, 32, [3, 3], scope='conv1')
-      self.assertEquals(output.op.name, 'conv1/Relu')
-
-  def testCreateConvWithoutActivation(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.conv2d(images, 32, [3, 3], activation=None)
-      self.assertEquals(output.op.name, 'Conv/BiasAdd')
-
-  def testCreateConvValid(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.conv2d(images, 32, [3, 3], padding='VALID')
-      self.assertListEqual(output.get_shape().as_list(), [5, 1, 1, 32])
-
-  def testCreateConvWithWD(self):
-    height, width = 3, 3
-    with self.test_session() as sess:
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.conv2d(images, 32, [3, 3], weight_decay=0.01)
-      wd = tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)[0]
-      self.assertEquals(wd.op.name,
-                        'Conv/weights/Regularizer/L2Regularizer/value')
-      sess.run(tf.global_variables_initializer())
-      self.assertTrue(sess.run(wd) <= 0.01)
-
-  def testCreateConvWithoutWD(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.conv2d(images, 32, [3, 3], weight_decay=0)
-      self.assertEquals(
-          tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES), [])
-
-  def testReuseVars(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.conv2d(images, 32, [3, 3], scope='conv1')
-      self.assertEquals(len(variables.get_variables()), 2)
-      ops.conv2d(images, 32, [3, 3], scope='conv1', reuse=True)
-      self.assertEquals(len(variables.get_variables()), 2)
-
-  def testNonReuseVars(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.conv2d(images, 32, [3, 3])
-      self.assertEquals(len(variables.get_variables()), 2)
-      ops.conv2d(images, 32, [3, 3])
-      self.assertEquals(len(variables.get_variables()), 4)
-
-  def testReuseConvWithWD(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.conv2d(images, 32, [3, 3], weight_decay=0.01, scope='conv1')
-      self.assertEquals(len(variables.get_variables()), 2)
-      self.assertEquals(
-          len(tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)), 1)
-      ops.conv2d(images, 32, [3, 3], weight_decay=0.01, scope='conv1',
-                 reuse=True)
-      self.assertEquals(len(variables.get_variables()), 2)
-      self.assertEquals(
-          len(tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)), 1)
-
-  def testConvWithBatchNorm(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 32), seed=1)
-      with scopes.arg_scope([ops.conv2d], batch_norm_params={'decay': 0.9}):
-        net = ops.conv2d(images, 32, [3, 3])
-        net = ops.conv2d(net, 32, [3, 3])
-      self.assertEquals(len(variables.get_variables()), 8)
-      self.assertEquals(len(variables.get_variables('Conv/BatchNorm')), 3)
-      self.assertEquals(len(variables.get_variables('Conv_1/BatchNorm')), 3)
-
-  def testReuseConvWithBatchNorm(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 32), seed=1)
-      with scopes.arg_scope([ops.conv2d], batch_norm_params={'decay': 0.9}):
-        net = ops.conv2d(images, 32, [3, 3], scope='Conv')
-        net = ops.conv2d(net, 32, [3, 3], scope='Conv', reuse=True)
-      self.assertEquals(len(variables.get_variables()), 4)
-      self.assertEquals(len(variables.get_variables('Conv/BatchNorm')), 3)
-      self.assertEquals(len(variables.get_variables('Conv_1/BatchNorm')), 0)
-
-
-class FCTest(tf.test.TestCase):
-
-  def testCreateFC(self):
-    height, width = 3, 3
-    with self.test_session():
-      inputs = tf.random_uniform((5, height * width * 3), seed=1)
-      output = ops.fc(inputs, 32)
-      self.assertEquals(output.op.name, 'FC/Relu')
-      self.assertListEqual(output.get_shape().as_list(), [5, 32])
-
-  def testCreateFCWithScope(self):
-    height, width = 3, 3
-    with self.test_session():
-      inputs = tf.random_uniform((5, height * width * 3), seed=1)
-      output = ops.fc(inputs, 32, scope='fc1')
-      self.assertEquals(output.op.name, 'fc1/Relu')
-
-  def testCreateFcCreatesWeightsAndBiasesVars(self):
-    height, width = 3, 3
-    inputs = tf.random_uniform((5, height * width * 3), seed=1)
-    with self.test_session():
-      self.assertFalse(variables.get_variables('fc1/weights'))
-      self.assertFalse(variables.get_variables('fc1/biases'))
-      ops.fc(inputs, 32, scope='fc1')
-      self.assertTrue(variables.get_variables('fc1/weights'))
-      self.assertTrue(variables.get_variables('fc1/biases'))
-
-  def testReuseVars(self):
-    height, width = 3, 3
-    inputs = tf.random_uniform((5, height * width * 3), seed=1)
-    with self.test_session():
-      ops.fc(inputs, 32, scope='fc1')
-      self.assertEquals(len(variables.get_variables('fc1')), 2)
-      ops.fc(inputs, 32, scope='fc1', reuse=True)
-      self.assertEquals(len(variables.get_variables('fc1')), 2)
-
-  def testNonReuseVars(self):
-    height, width = 3, 3
-    inputs = tf.random_uniform((5, height * width * 3), seed=1)
-    with self.test_session():
-      ops.fc(inputs, 32)
-      self.assertEquals(len(variables.get_variables('FC')), 2)
-      ops.fc(inputs, 32)
-      self.assertEquals(len(variables.get_variables('FC')), 4)
-
-  def testCreateFCWithoutActivation(self):
-    height, width = 3, 3
-    with self.test_session():
-      inputs = tf.random_uniform((5, height * width * 3), seed=1)
-      output = ops.fc(inputs, 32, activation=None)
-      self.assertEquals(output.op.name, 'FC/xw_plus_b')
-
-  def testCreateFCWithWD(self):
-    height, width = 3, 3
-    with self.test_session() as sess:
-      inputs = tf.random_uniform((5, height * width * 3), seed=1)
-      ops.fc(inputs, 32, weight_decay=0.01)
-      wd = tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)[0]
-      self.assertEquals(wd.op.name,
-                        'FC/weights/Regularizer/L2Regularizer/value')
-      sess.run(tf.global_variables_initializer())
-      self.assertTrue(sess.run(wd) <= 0.01)
-
-  def testCreateFCWithoutWD(self):
-    height, width = 3, 3
-    with self.test_session():
-      inputs = tf.random_uniform((5, height * width * 3), seed=1)
-      ops.fc(inputs, 32, weight_decay=0)
-      self.assertEquals(
-          tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES), [])
-
-  def testReuseFCWithWD(self):
-    height, width = 3, 3
-    with self.test_session():
-      inputs = tf.random_uniform((5, height * width * 3), seed=1)
-      ops.fc(inputs, 32, weight_decay=0.01, scope='fc')
-      self.assertEquals(len(variables.get_variables()), 2)
-      self.assertEquals(
-          len(tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)), 1)
-      ops.fc(inputs, 32, weight_decay=0.01, scope='fc', reuse=True)
-      self.assertEquals(len(variables.get_variables()), 2)
-      self.assertEquals(
-          len(tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)), 1)
-
-  def testFCWithBatchNorm(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height * width * 3), seed=1)
-      with scopes.arg_scope([ops.fc], batch_norm_params={}):
-        net = ops.fc(images, 27)
-        net = ops.fc(net, 27)
-      self.assertEquals(len(variables.get_variables()), 8)
-      self.assertEquals(len(variables.get_variables('FC/BatchNorm')), 3)
-      self.assertEquals(len(variables.get_variables('FC_1/BatchNorm')), 3)
-
-  def testReuseFCWithBatchNorm(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height * width * 3), seed=1)
-      with scopes.arg_scope([ops.fc], batch_norm_params={'decay': 0.9}):
-        net = ops.fc(images, 27, scope='fc1')
-        net = ops.fc(net, 27, scope='fc1', reuse=True)
-      self.assertEquals(len(variables.get_variables()), 4)
-      self.assertEquals(len(variables.get_variables('fc1/BatchNorm')), 3)
-
-
-class MaxPoolTest(tf.test.TestCase):
-
-  def testCreateMaxPool(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.max_pool(images, [3, 3])
-      self.assertEquals(output.op.name, 'MaxPool/MaxPool')
-      self.assertListEqual(output.get_shape().as_list(), [5, 1, 1, 3])
-
-  def testCreateSquareMaxPool(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.max_pool(images, 3)
-      self.assertEquals(output.op.name, 'MaxPool/MaxPool')
-      self.assertListEqual(output.get_shape().as_list(), [5, 1, 1, 3])
-
-  def testCreateMaxPoolWithScope(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.max_pool(images, [3, 3], scope='pool1')
-      self.assertEquals(output.op.name, 'pool1/MaxPool')
-
-  def testCreateMaxPoolSAME(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.max_pool(images, [3, 3], padding='SAME')
-      self.assertListEqual(output.get_shape().as_list(), [5, 2, 2, 3])
-
-  def testCreateMaxPoolStrideSAME(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.max_pool(images, [3, 3], stride=1, padding='SAME')
-      self.assertListEqual(output.get_shape().as_list(), [5, height, width, 3])
-
-  def testGlobalMaxPool(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.max_pool(images, images.get_shape()[1:3], stride=1)
-      self.assertListEqual(output.get_shape().as_list(), [5, 1, 1, 3])
-
-
-class AvgPoolTest(tf.test.TestCase):
-
-  def testCreateAvgPool(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.avg_pool(images, [3, 3])
-      self.assertEquals(output.op.name, 'AvgPool/AvgPool')
-      self.assertListEqual(output.get_shape().as_list(), [5, 1, 1, 3])
-
-  def testCreateSquareAvgPool(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.avg_pool(images, 3)
-      self.assertEquals(output.op.name, 'AvgPool/AvgPool')
-      self.assertListEqual(output.get_shape().as_list(), [5, 1, 1, 3])
-
-  def testCreateAvgPoolWithScope(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.avg_pool(images, [3, 3], scope='pool1')
-      self.assertEquals(output.op.name, 'pool1/AvgPool')
-
-  def testCreateAvgPoolSAME(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.avg_pool(images, [3, 3], padding='SAME')
-      self.assertListEqual(output.get_shape().as_list(), [5, 2, 2, 3])
-
-  def testCreateAvgPoolStrideSAME(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.avg_pool(images, [3, 3], stride=1, padding='SAME')
-      self.assertListEqual(output.get_shape().as_list(), [5, height, width, 3])
-
-  def testGlobalAvgPool(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.avg_pool(images, images.get_shape()[1:3], stride=1)
-      self.assertListEqual(output.get_shape().as_list(), [5, 1, 1, 3])
-
-
-class OneHotEncodingTest(tf.test.TestCase):
-
-  def testOneHotEncodingCreate(self):
-    with self.test_session():
-      labels = tf.constant([0, 1, 2])
-      output = ops.one_hot_encoding(labels, num_classes=3)
-      self.assertEquals(output.op.name, 'OneHotEncoding/SparseToDense')
-      self.assertListEqual(output.get_shape().as_list(), [3, 3])
-
-  def testOneHotEncoding(self):
-    with self.test_session():
-      labels = tf.constant([0, 1, 2])
-      one_hot_labels = tf.constant([[1, 0, 0],
-                                    [0, 1, 0],
-                                    [0, 0, 1]])
-      output = ops.one_hot_encoding(labels, num_classes=3)
-      self.assertAllClose(output.eval(), one_hot_labels.eval())
-
-
-class DropoutTest(tf.test.TestCase):
-
-  def testCreateDropout(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.dropout(images)
-      self.assertEquals(output.op.name, 'Dropout/dropout/mul')
-      output.get_shape().assert_is_compatible_with(images.get_shape())
-
-  def testCreateDropoutNoTraining(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1, name='images')
-      output = ops.dropout(images, is_training=False)
-      self.assertEquals(output, images)
-
-
-class FlattenTest(tf.test.TestCase):
-
-  def testFlatten4D(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1, name='images')
-      output = ops.flatten(images)
-      self.assertEquals(output.get_shape().num_elements(),
-                        images.get_shape().num_elements())
-      self.assertEqual(output.get_shape()[0], images.get_shape()[0])
-
-  def testFlatten3D(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width), seed=1, name='images')
-      output = ops.flatten(images)
-      self.assertEquals(output.get_shape().num_elements(),
-                        images.get_shape().num_elements())
-      self.assertEqual(output.get_shape()[0], images.get_shape()[0])
-
-  def testFlattenBatchSize(self):
-    height, width = 3, 3
-    with self.test_session() as sess:
-      images = tf.random_uniform((5, height, width, 3), seed=1, name='images')
-      inputs = tf.placeholder(tf.int32, (None, height, width, 3))
-      output = ops.flatten(inputs)
-      self.assertEquals(output.get_shape().as_list(),
-                        [None, height * width * 3])
-      output = sess.run(output, {inputs: images.eval()})
-      self.assertEquals(output.size,
-                        images.get_shape().num_elements())
-      self.assertEqual(output.shape[0], images.get_shape()[0])
-
-
-class BatchNormTest(tf.test.TestCase):
-
-  def testCreateOp(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      output = ops.batch_norm(images)
-      self.assertTrue(output.op.name.startswith('BatchNorm/batchnorm'))
-      self.assertListEqual(output.get_shape().as_list(), [5, height, width, 3])
-
-  def testCreateVariables(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.batch_norm(images)
-      beta = variables.get_variables_by_name('beta')[0]
-      self.assertEquals(beta.op.name, 'BatchNorm/beta')
-      gamma = variables.get_variables_by_name('gamma')
-      self.assertEquals(gamma, [])
-      moving_mean = tf.moving_average_variables()[0]
-      moving_variance = tf.moving_average_variables()[1]
-      self.assertEquals(moving_mean.op.name, 'BatchNorm/moving_mean')
-      self.assertEquals(moving_variance.op.name, 'BatchNorm/moving_variance')
-
-  def testCreateVariablesWithScale(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.batch_norm(images, scale=True)
-      beta = variables.get_variables_by_name('beta')[0]
-      gamma = variables.get_variables_by_name('gamma')[0]
-      self.assertEquals(beta.op.name, 'BatchNorm/beta')
-      self.assertEquals(gamma.op.name, 'BatchNorm/gamma')
-      moving_mean = tf.moving_average_variables()[0]
-      moving_variance = tf.moving_average_variables()[1]
-      self.assertEquals(moving_mean.op.name, 'BatchNorm/moving_mean')
-      self.assertEquals(moving_variance.op.name, 'BatchNorm/moving_variance')
-
-  def testCreateVariablesWithoutCenterWithScale(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.batch_norm(images, center=False, scale=True)
-      beta = variables.get_variables_by_name('beta')
-      self.assertEquals(beta, [])
-      gamma = variables.get_variables_by_name('gamma')[0]
-      self.assertEquals(gamma.op.name, 'BatchNorm/gamma')
-      moving_mean = tf.moving_average_variables()[0]
-      moving_variance = tf.moving_average_variables()[1]
-      self.assertEquals(moving_mean.op.name, 'BatchNorm/moving_mean')
-      self.assertEquals(moving_variance.op.name, 'BatchNorm/moving_variance')
-
-  def testCreateVariablesWithoutCenterWithoutScale(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.batch_norm(images, center=False, scale=False)
-      beta = variables.get_variables_by_name('beta')
-      self.assertEquals(beta, [])
-      gamma = variables.get_variables_by_name('gamma')
-      self.assertEquals(gamma, [])
-      moving_mean = tf.moving_average_variables()[0]
-      moving_variance = tf.moving_average_variables()[1]
-      self.assertEquals(moving_mean.op.name, 'BatchNorm/moving_mean')
-      self.assertEquals(moving_variance.op.name, 'BatchNorm/moving_variance')
-
-  def testMovingAverageVariables(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.batch_norm(images, scale=True)
-      moving_mean = tf.moving_average_variables()[0]
-      moving_variance = tf.moving_average_variables()[1]
-      self.assertEquals(moving_mean.op.name, 'BatchNorm/moving_mean')
-      self.assertEquals(moving_variance.op.name, 'BatchNorm/moving_variance')
-
-  def testUpdateOps(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.batch_norm(images)
-      update_ops = tf.get_collection(ops.UPDATE_OPS_COLLECTION)
-      update_moving_mean = update_ops[0]
-      update_moving_variance = update_ops[1]
-      self.assertEquals(update_moving_mean.op.name,
-                        'BatchNorm/AssignMovingAvg')
-      self.assertEquals(update_moving_variance.op.name,
-                        'BatchNorm/AssignMovingAvg_1')
-
-  def testReuseVariables(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.batch_norm(images, scale=True, scope='bn')
-      ops.batch_norm(images, scale=True, scope='bn', reuse=True)
-      beta = variables.get_variables_by_name('beta')
-      gamma = variables.get_variables_by_name('gamma')
-      self.assertEquals(len(beta), 1)
-      self.assertEquals(len(gamma), 1)
-      moving_vars = tf.get_collection('moving_vars')
-      self.assertEquals(len(moving_vars), 2)
-
-  def testReuseUpdateOps(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      ops.batch_norm(images, scope='bn')
-      self.assertEquals(len(tf.get_collection(ops.UPDATE_OPS_COLLECTION)), 2)
-      ops.batch_norm(images, scope='bn', reuse=True)
-      self.assertEquals(len(tf.get_collection(ops.UPDATE_OPS_COLLECTION)), 4)
-
-  def testCreateMovingVars(self):
-    height, width = 3, 3
-    with self.test_session():
-      images = tf.random_uniform((5, height, width, 3), seed=1)
-      _ = ops.batch_norm(images, moving_vars='moving_vars')
-      moving_mean = tf.get_collection('moving_vars',
-                                      'BatchNorm/moving_mean')
-      self.assertEquals(len(moving_mean), 1)
-      self.assertEquals(moving_mean[0].op.name, 'BatchNorm/moving_mean')
-      moving_variance = tf.get_collection('moving_vars',
-                                          'BatchNorm/moving_variance')
-      self.assertEquals(len(moving_variance), 1)
-      self.assertEquals(moving_variance[0].op.name, 'BatchNorm/moving_variance')
-
-  def testComputeMovingVars(self):
-    height, width = 3, 3
-    with self.test_session() as sess:
-      image_shape = (10, height, width, 3)
-      image_values = np.random.rand(*image_shape)
-      expected_mean = np.mean(image_values, axis=(0, 1, 2))
-      expected_var = np.var(image_values, axis=(0, 1, 2))
-      images = tf.constant(image_values, shape=image_shape, dtype=tf.float32)
-      output = ops.batch_norm(images, decay=0.1)
-      update_ops = tf.get_collection(ops.UPDATE_OPS_COLLECTION)
-      with tf.control_dependencies(update_ops):
-        output = tf.identity(output)
-      # Initialize all variables
-      sess.run(tf.global_variables_initializer())
-      moving_mean = variables.get_variables('BatchNorm/moving_mean')[0]
-      moving_variance = variables.get_variables('BatchNorm/moving_variance')[0]
-      mean, variance = sess.run([moving_mean, moving_variance])
-      # After initialization moving_mean == 0 and moving_variance == 1.
-      self.assertAllClose(mean, [0] * 3)
-      self.assertAllClose(variance, [1] * 3)
-      for _ in range(10):
-        sess.run([output])
-      mean = moving_mean.eval()
-      variance = moving_variance.eval()
-      # After 10 updates with decay 0.1 moving_mean == expected_mean and
-      # moving_variance == expected_var.
-      self.assertAllClose(mean, expected_mean)
-      self.assertAllClose(variance, expected_var)
-
-  def testEvalMovingVars(self):
-    height, width = 3, 3
-    with self.test_session() as sess:
-      image_shape = (10, height, width, 3)
-      image_values = np.random.rand(*image_shape)
-      expected_mean = np.mean(image_values, axis=(0, 1, 2))
-      expected_var = np.var(image_values, axis=(0, 1, 2))
-      images = tf.constant(image_values, shape=image_shape, dtype=tf.float32)
-      output = ops.batch_norm(images, decay=0.1, is_training=False)
-      update_ops = tf.get_collection(ops.UPDATE_OPS_COLLECTION)
-      with tf.control_dependencies(update_ops):
-        output = tf.identity(output)
-      # Initialize all variables
-      sess.run(tf.global_variables_initializer())
-      moving_mean = variables.get_variables('BatchNorm/moving_mean')[0]
-      moving_variance = variables.get_variables('BatchNorm/moving_variance')[0]
-      mean, variance = sess.run([moving_mean, moving_variance])
-      # After initialization moving_mean == 0 and moving_variance == 1.
-      self.assertAllClose(mean, [0] * 3)
-      self.assertAllClose(variance, [1] * 3)
-      # Simulate assigment from saver restore.
-      init_assigns = [tf.assign(moving_mean, expected_mean),
-                      tf.assign(moving_variance, expected_var)]
-      sess.run(init_assigns)
-      for _ in range(10):
-        sess.run([output], {images: np.random.rand(*image_shape)})
-      mean = moving_mean.eval()
-      variance = moving_variance.eval()
-      # Although we feed different images, the moving_mean and moving_variance
-      # shouldn't change.
-      self.assertAllClose(mean, expected_mean)
-      self.assertAllClose(variance, expected_var)
-
-  def testReuseVars(self):
-    height, width = 3, 3
-    with self.test_session() as sess:
-      image_shape = (10, height, width, 3)
-      image_values = np.random.rand(*image_shape)
-      expected_mean = np.mean(image_values, axis=(0, 1, 2))
-      expected_var = np.var(image_values, axis=(0, 1, 2))
-      images = tf.constant(image_values, shape=image_shape, dtype=tf.float32)
-      output = ops.batch_norm(images, decay=0.1, is_training=False)
-      update_ops = tf.get_collection(ops.UPDATE_OPS_COLLECTION)
-      with tf.control_dependencies(update_ops):
-        output = tf.identity(output)
-      # Initialize all variables
-      sess.run(tf.global_variables_initializer())
-      moving_mean = variables.get_variables('BatchNorm/moving_mean')[0]
-      moving_variance = variables.get_variables('BatchNorm/moving_variance')[0]
-      mean, variance = sess.run([moving_mean, moving_variance])
-      # After initialization moving_mean == 0 and moving_variance == 1.
-      self.assertAllClose(mean, [0] * 3)
-      self.assertAllClose(variance, [1] * 3)
-      # Simulate assigment from saver restore.
-      init_assigns = [tf.assign(moving_mean, expected_mean),
-                      tf.assign(moving_variance, expected_var)]
-      sess.run(init_assigns)
-      for _ in range(10):
-        sess.run([output], {images: np.random.rand(*image_shape)})
-      mean = moving_mean.eval()
-      variance = moving_variance.eval()
-      # Although we feed different images, the moving_mean and moving_variance
-      # shouldn't change.
-      self.assertAllClose(mean, expected_mean)
-      self.assertAllClose(variance, expected_var)
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/inception/inception/slim/scopes.py b/research/inception/inception/slim/scopes.py
deleted file mode 100644
index 2c2fb0a2efa7d30eaddb36fc30265f30cbaeb9ef..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/scopes.py
+++ /dev/null
@@ -1,170 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Contains the new arg_scope used for TF-Slim ops.
-
-  Allows one to define models much more compactly by eliminating boilerplate
-  code. This is accomplished through the use of argument scoping (arg_scope).
-
-  Example of how to use scopes.arg_scope:
-
-  with scopes.arg_scope(ops.conv2d, padding='SAME',
-                      stddev=0.01, weight_decay=0.0005):
-    net = ops.conv2d(inputs, 64, [11, 11], 4, padding='VALID', scope='conv1')
-    net = ops.conv2d(net, 256, [5, 5], scope='conv2')
-
-  The first call to conv2d will overwrite padding:
-    ops.conv2d(inputs, 64, [11, 11], 4, padding='VALID',
-              stddev=0.01, weight_decay=0.0005, scope='conv1')
-
-  The second call to Conv will use predefined args:
-    ops.conv2d(inputs, 256, [5, 5], padding='SAME',
-               stddev=0.01, weight_decay=0.0005, scope='conv2')
-
-  Example of how to reuse an arg_scope:
-  with scopes.arg_scope(ops.conv2d, padding='SAME',
-                      stddev=0.01, weight_decay=0.0005) as conv2d_arg_scope:
-    net = ops.conv2d(net, 256, [5, 5], scope='conv1')
-    ....
-
-  with scopes.arg_scope(conv2d_arg_scope):
-    net = ops.conv2d(net, 256, [5, 5], scope='conv2')
-
-  Example of how to use scopes.add_arg_scope:
-
-  @scopes.add_arg_scope
-  def conv2d(*args, **kwargs)
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import contextlib
-import functools
-
-from tensorflow.python.framework import ops
-
-_ARGSTACK_KEY = ("__arg_stack",)
-
-_DECORATED_OPS = set()
-
-
-def _get_arg_stack():
-  stack = ops.get_collection(_ARGSTACK_KEY)
-  if stack:
-    return stack[0]
-  else:
-    stack = [{}]
-    ops.add_to_collection(_ARGSTACK_KEY, stack)
-    return stack
-
-
-def _current_arg_scope():
-  stack = _get_arg_stack()
-  return stack[-1]
-
-
-def _add_op(op):
-  key_op = (op.__module__, op.__name__)
-  if key_op not in _DECORATED_OPS:
-    _DECORATED_OPS.add(key_op)
-
-
-@contextlib.contextmanager
-def arg_scope(list_ops_or_scope, **kwargs):
-  """Stores the default arguments for the given set of list_ops.
-
-  For usage, please see examples at top of the file.
-
-  Args:
-    list_ops_or_scope: List or tuple of operations to set argument scope for or
-      a dictionary containg the current scope. When list_ops_or_scope is a dict,
-      kwargs must be empty. When list_ops_or_scope is a list or tuple, then
-      every op in it need to be decorated with @add_arg_scope to work.
-    **kwargs: keyword=value that will define the defaults for each op in
-              list_ops. All the ops need to accept the given set of arguments.
-
-  Yields:
-    the current_scope, which is a dictionary of {op: {arg: value}}
-  Raises:
-    TypeError: if list_ops is not a list or a tuple.
-    ValueError: if any op in list_ops has not be decorated with @add_arg_scope.
-  """
-  if isinstance(list_ops_or_scope, dict):
-    # Assumes that list_ops_or_scope is a scope that is being reused.
-    if kwargs:
-      raise ValueError("When attempting to re-use a scope by suppling a"
-                       "dictionary, kwargs must be empty.")
-    current_scope = list_ops_or_scope.copy()
-    try:
-      _get_arg_stack().append(current_scope)
-      yield current_scope
-    finally:
-      _get_arg_stack().pop()
-  else:
-    # Assumes that list_ops_or_scope is a list/tuple of ops with kwargs.
-    if not isinstance(list_ops_or_scope, (list, tuple)):
-      raise TypeError("list_ops_or_scope must either be a list/tuple or reused"
-                      "scope (i.e. dict)")
-    try:
-      current_scope = _current_arg_scope().copy()
-      for op in list_ops_or_scope:
-        key_op = (op.__module__, op.__name__)
-        if not has_arg_scope(op):
-          raise ValueError("%s is not decorated with @add_arg_scope", key_op)
-        if key_op in current_scope:
-          current_kwargs = current_scope[key_op].copy()
-          current_kwargs.update(kwargs)
-          current_scope[key_op] = current_kwargs
-        else:
-          current_scope[key_op] = kwargs.copy()
-      _get_arg_stack().append(current_scope)
-      yield current_scope
-    finally:
-      _get_arg_stack().pop()
-
-
-def add_arg_scope(func):
-  """Decorates a function with args so it can be used within an arg_scope.
-
-  Args:
-    func: function to decorate.
-
-  Returns:
-    A tuple with the decorated function func_with_args().
-  """
-  @functools.wraps(func)
-  def func_with_args(*args, **kwargs):
-    current_scope = _current_arg_scope()
-    current_args = kwargs
-    key_func = (func.__module__, func.__name__)
-    if key_func in current_scope:
-      current_args = current_scope[key_func].copy()
-      current_args.update(kwargs)
-    return func(*args, **current_args)
-  _add_op(func)
-  return func_with_args
-
-
-def has_arg_scope(func):
-  """Checks whether a func has been decorated with @add_arg_scope or not.
-
-  Args:
-    func: function to check.
-
-  Returns:
-    a boolean.
-  """
-  key_op = (func.__module__, func.__name__)
-  return key_op in _DECORATED_OPS
diff --git a/research/inception/inception/slim/scopes_test.py b/research/inception/inception/slim/scopes_test.py
deleted file mode 100644
index cd349399ed7300dde38ac9bcb9818abc9d0680b4..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/scopes_test.py
+++ /dev/null
@@ -1,162 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests slim.scopes."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-from inception.slim import scopes
-
-
-@scopes.add_arg_scope
-def func1(*args, **kwargs):
-  return (args, kwargs)
-
-
-@scopes.add_arg_scope
-def func2(*args, **kwargs):
-  return (args, kwargs)
-
-
-class ArgScopeTest(tf.test.TestCase):
-
-  def testEmptyArgScope(self):
-    with self.test_session():
-      self.assertEqual(scopes._current_arg_scope(), {})
-
-  def testCurrentArgScope(self):
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    key_op = (func1.__module__, func1.__name__)
-    current_scope = {key_op: func1_kwargs.copy()}
-    with self.test_session():
-      with scopes.arg_scope([func1], a=1, b=None, c=[1]) as scope:
-        self.assertDictEqual(scope, current_scope)
-
-  def testCurrentArgScopeNested(self):
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    func2_kwargs = {'b': 2, 'd': [2]}
-    key = lambda f: (f.__module__, f.__name__)
-    current_scope = {key(func1): func1_kwargs.copy(),
-                     key(func2): func2_kwargs.copy()}
-    with self.test_session():
-      with scopes.arg_scope([func1], a=1, b=None, c=[1]):
-        with scopes.arg_scope([func2], b=2, d=[2]) as scope:
-          self.assertDictEqual(scope, current_scope)
-
-  def testReuseArgScope(self):
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    key_op = (func1.__module__, func1.__name__)
-    current_scope = {key_op: func1_kwargs.copy()}
-    with self.test_session():
-      with scopes.arg_scope([func1], a=1, b=None, c=[1]) as scope1:
-        pass
-      with scopes.arg_scope(scope1) as scope:
-        self.assertDictEqual(scope, current_scope)
-
-  def testReuseArgScopeNested(self):
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    func2_kwargs = {'b': 2, 'd': [2]}
-    key = lambda f: (f.__module__, f.__name__)
-    current_scope1 = {key(func1): func1_kwargs.copy()}
-    current_scope2 = {key(func1): func1_kwargs.copy(),
-                      key(func2): func2_kwargs.copy()}
-    with self.test_session():
-      with scopes.arg_scope([func1], a=1, b=None, c=[1]) as scope1:
-        with scopes.arg_scope([func2], b=2, d=[2]) as scope2:
-          pass
-      with scopes.arg_scope(scope1):
-        self.assertDictEqual(scopes._current_arg_scope(), current_scope1)
-      with scopes.arg_scope(scope2):
-        self.assertDictEqual(scopes._current_arg_scope(), current_scope2)
-
-  def testSimpleArgScope(self):
-    func1_args = (0,)
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    with self.test_session():
-      with scopes.arg_scope([func1], a=1, b=None, c=[1]):
-        args, kwargs = func1(0)
-        self.assertTupleEqual(args, func1_args)
-        self.assertDictEqual(kwargs, func1_kwargs)
-
-  def testSimpleArgScopeWithTuple(self):
-    func1_args = (0,)
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    with self.test_session():
-      with scopes.arg_scope((func1,), a=1, b=None, c=[1]):
-        args, kwargs = func1(0)
-        self.assertTupleEqual(args, func1_args)
-        self.assertDictEqual(kwargs, func1_kwargs)
-
-  def testOverwriteArgScope(self):
-    func1_args = (0,)
-    func1_kwargs = {'a': 1, 'b': 2, 'c': [1]}
-    with scopes.arg_scope([func1], a=1, b=None, c=[1]):
-      args, kwargs = func1(0, b=2)
-      self.assertTupleEqual(args, func1_args)
-      self.assertDictEqual(kwargs, func1_kwargs)
-
-  def testNestedArgScope(self):
-    func1_args = (0,)
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    with scopes.arg_scope([func1], a=1, b=None, c=[1]):
-      args, kwargs = func1(0)
-      self.assertTupleEqual(args, func1_args)
-      self.assertDictEqual(kwargs, func1_kwargs)
-      func1_kwargs['b'] = 2
-      with scopes.arg_scope([func1], b=2):
-        args, kwargs = func1(0)
-        self.assertTupleEqual(args, func1_args)
-        self.assertDictEqual(kwargs, func1_kwargs)
-
-  def testSharedArgScope(self):
-    func1_args = (0,)
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    with scopes.arg_scope([func1, func2], a=1, b=None, c=[1]):
-      args, kwargs = func1(0)
-      self.assertTupleEqual(args, func1_args)
-      self.assertDictEqual(kwargs, func1_kwargs)
-      args, kwargs = func2(0)
-      self.assertTupleEqual(args, func1_args)
-      self.assertDictEqual(kwargs, func1_kwargs)
-
-  def testSharedArgScopeTuple(self):
-    func1_args = (0,)
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    with scopes.arg_scope((func1, func2), a=1, b=None, c=[1]):
-      args, kwargs = func1(0)
-      self.assertTupleEqual(args, func1_args)
-      self.assertDictEqual(kwargs, func1_kwargs)
-      args, kwargs = func2(0)
-      self.assertTupleEqual(args, func1_args)
-      self.assertDictEqual(kwargs, func1_kwargs)
-
-  def testPartiallySharedArgScope(self):
-    func1_args = (0,)
-    func1_kwargs = {'a': 1, 'b': None, 'c': [1]}
-    func2_args = (1,)
-    func2_kwargs = {'a': 1, 'b': None, 'd': [2]}
-    with scopes.arg_scope([func1, func2], a=1, b=None):
-      with scopes.arg_scope([func1], c=[1]), scopes.arg_scope([func2], d=[2]):
-        args, kwargs = func1(0)
-        self.assertTupleEqual(args, func1_args)
-        self.assertDictEqual(kwargs, func1_kwargs)
-        args, kwargs = func2(1)
-        self.assertTupleEqual(args, func2_args)
-        self.assertDictEqual(kwargs, func2_kwargs)
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/inception/inception/slim/slim.py b/research/inception/inception/slim/slim.py
deleted file mode 100644
index b7a5c0f8c52b66db899835480c331ffafdc386e2..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/slim.py
+++ /dev/null
@@ -1,24 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""TF-Slim grouped API. Please see README.md for details and usage."""
-# pylint: disable=unused-import
-
-# Collapse tf-slim into a single namespace.
-from inception.slim import inception_model as inception
-from inception.slim import losses
-from inception.slim import ops
-from inception.slim import scopes
-from inception.slim import variables
-from inception.slim.scopes import arg_scope
diff --git a/research/inception/inception/slim/variables.py b/research/inception/inception/slim/variables.py
deleted file mode 100644
index 1d967b79e9563724b1114995a732cfd4dd486afd..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/variables.py
+++ /dev/null
@@ -1,289 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Contains convenience wrappers for creating variables in TF-Slim.
-
-The variables module is typically used for defining model variables from the
-ops routines (see slim.ops). Such variables are used for training, evaluation
-and inference of models.
-
-All the variables created through this module would be added to the
-MODEL_VARIABLES collection, if you create a model variable outside slim, it can
-be added with slim.variables.add_variable(external_variable, reuse).
-
-Usage:
-  weights_initializer = tf.truncated_normal_initializer(stddev=0.01)
-  l2_regularizer = lambda t: losses.l2_loss(t, weight=0.0005)
-  weights = variables.variable('weights',
-                               shape=[100, 100],
-                               initializer=weights_initializer,
-                               regularizer=l2_regularizer,
-                               device='/cpu:0')
-
-  biases = variables.variable('biases',
-                              shape=[100],
-                              initializer=tf.zeros_initializer(),
-                              device='/cpu:0')
-
-  # More complex example.
-
-  net = slim.ops.conv2d(input, 32, [3, 3], scope='conv1')
-  net = slim.ops.conv2d(net, 64, [3, 3], scope='conv2')
-  with slim.arg_scope([variables.variable], restore=False):
-    net = slim.ops.conv2d(net, 64, [3, 3], scope='conv3')
-
-  # Get all model variables from all the layers.
-  model_variables = slim.variables.get_variables()
-
-  # Get all model variables from a specific the layer, i.e 'conv1'.
-  conv1_variables = slim.variables.get_variables('conv1')
-
-  # Get all weights from all the layers.
-  weights = slim.variables.get_variables_by_name('weights')
-
-  # Get all bias from all the layers.
-  biases = slim.variables.get_variables_by_name('biases')
-
-  # Get all variables to restore.
-  # (i.e. only those created by 'conv1' and 'conv2')
-  variables_to_restore = slim.variables.get_variables_to_restore()
-
-************************************************
-* Initializing model variables from a checkpoint
-************************************************
-
-# Create some variables.
-v1 = slim.variables.variable(name="v1", ..., restore=False)
-v2 = slim.variables.variable(name="v2", ...) # By default restore=True
-...
-# The list of variables to restore should only contain 'v2'.
-variables_to_restore = slim.variables.get_variables_to_restore()
-restorer = tf.train.Saver(variables_to_restore)
-with tf.Session() as sess:
-  # Restore variables from disk.
-  restorer.restore(sess, "/tmp/model.ckpt")
-  print("Model restored.")
-  # Do some work with the model
-  ...
-
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from inception.slim import scopes
-
-# Collection containing all the variables created using slim.variables
-MODEL_VARIABLES = '_model_variables_'
-
-# Collection containing the slim.variables that are created with restore=True.
-VARIABLES_TO_RESTORE = '_variables_to_restore_'
-
-
-def add_variable(var, restore=True):
-  """Adds a variable to the MODEL_VARIABLES collection.
-
-    Optionally it will add the variable to  the VARIABLES_TO_RESTORE collection.
-  Args:
-    var: a variable.
-    restore: whether the variable should be added to the
-      VARIABLES_TO_RESTORE collection.
-
-  """
-  collections = [MODEL_VARIABLES]
-  if restore:
-    collections.append(VARIABLES_TO_RESTORE)
-  for collection in collections:
-    if var not in tf.get_collection(collection):
-      tf.add_to_collection(collection, var)
-
-
-def get_variables(scope=None, suffix=None):
-  """Gets the list of variables, filtered by scope and/or suffix.
-
-  Args:
-    scope: an optional scope for filtering the variables to return.
-    suffix: an optional suffix for filtering the variables to return.
-
-  Returns:
-    a copied list of variables with scope and suffix.
-  """
-  candidates = tf.get_collection(MODEL_VARIABLES, scope)[:]
-  if suffix is not None:
-    candidates = [var for var in candidates if var.op.name.endswith(suffix)]
-  return candidates
-
-
-def get_variables_to_restore():
-  """Gets the list of variables to restore.
-
-  Returns:
-    a copied list of variables.
-  """
-  return tf.get_collection(VARIABLES_TO_RESTORE)[:]
-
-
-def get_variables_by_name(given_name, scope=None):
-  """Gets the list of variables that were given that name.
-
-  Args:
-    given_name: name given to the variable without scope.
-    scope: an optional scope for filtering the variables to return.
-
-  Returns:
-    a copied list of variables with the given name and prefix.
-  """
-  return get_variables(scope=scope, suffix=given_name)
-
-
-def get_unique_variable(name):
-  """Gets the variable uniquely identified by that name.
-
-  Args:
-    name: a name that uniquely identifies the variable.
-
-  Returns:
-    a tensorflow variable.
-
-  Raises:
-    ValueError: if no variable uniquely identified by the name exists.
-  """
-  candidates = tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, name)
-  if not candidates:
-    raise ValueError('Couldnt find variable %s' % name)
-
-  for candidate in candidates:
-    if candidate.op.name == name:
-      return candidate
-  raise ValueError('Variable %s does not uniquely identify a variable', name)
-
-
-class VariableDeviceChooser(object):
-  """Slim device chooser for variables.
-
-  When using a parameter server it will assign them in a round-robin fashion.
-  When not using a parameter server it allows GPU:0 placement otherwise CPU:0.
-  """
-
-  def __init__(self,
-               num_parameter_servers=0,
-               ps_device='/job:ps',
-               placement='CPU:0'):
-    """Initialize VariableDeviceChooser.
-
-    Args:
-      num_parameter_servers: number of parameter servers.
-      ps_device: string representing the parameter server device.
-      placement: string representing the placement of the variable either CPU:0
-        or GPU:0. When using parameter servers forced to CPU:0.
-    """
-    self._num_ps = num_parameter_servers
-    self._ps_device = ps_device
-    self._placement = placement if num_parameter_servers == 0 else 'CPU:0'
-    self._next_task_id = 0
-
-  def __call__(self, op):
-    device_string = ''
-    if self._num_ps > 0:
-      task_id = self._next_task_id
-      self._next_task_id = (self._next_task_id + 1) % self._num_ps
-      device_string = '%s/task:%d' % (self._ps_device, task_id)
-    device_string += '/%s' % self._placement
-    return device_string
-
-
-# TODO(sguada) Remove once get_variable is able to colocate op.devices.
-def variable_device(device, name):
-  """Fix the variable device to colocate its ops."""
-  if callable(device):
-    var_name = tf.get_variable_scope().name + '/' + name
-    var_def = tf.NodeDef(name=var_name, op='Variable')
-    device = device(var_def)
-  if device is None:
-    device = ''
-  return device
-
-
-@scopes.add_arg_scope
-def global_step(device=''):
-  """Returns the global step variable.
-
-  Args:
-    device: Optional device to place the variable. It can be an string or a
-      function that is called to get the device for the variable.
-
-  Returns:
-    the tensor representing the global step variable.
-  """
-  global_step_ref = tf.get_collection(tf.GraphKeys.GLOBAL_STEP)
-  if global_step_ref:
-    return global_step_ref[0]
-  else:
-    collections = [
-        VARIABLES_TO_RESTORE,
-        tf.GraphKeys.GLOBAL_VARIABLES,
-        tf.GraphKeys.GLOBAL_STEP,
-    ]
-    # Get the device for the variable.
-    with tf.device(variable_device(device, 'global_step')):
-      return tf.get_variable('global_step', shape=[], dtype=tf.int64,
-                             initializer=tf.zeros_initializer(),
-                             trainable=False, collections=collections)
-
-
-@scopes.add_arg_scope
-def variable(name, shape=None, dtype=tf.float32, initializer=None,
-             regularizer=None, trainable=True, collections=None, device='',
-             restore=True):
-  """Gets an existing variable with these parameters or creates a new one.
-
-    It also add itself to a group with its name.
-
-  Args:
-    name: the name of the new or existing variable.
-    shape: shape of the new or existing variable.
-    dtype: type of the new or existing variable (defaults to `DT_FLOAT`).
-    initializer: initializer for the variable if one is created.
-    regularizer: a (Tensor -> Tensor or None) function; the result of
-        applying it on a newly created variable will be added to the collection
-        GraphKeys.REGULARIZATION_LOSSES and can be used for regularization.
-    trainable: If `True` also add the variable to the graph collection
-      `GraphKeys.TRAINABLE_VARIABLES` (see tf.Variable).
-    collections: A list of collection names to which the Variable will be added.
-      Note that the variable is always also added to the tf.GraphKeys.GLOBAL_VARIABLES
-      and MODEL_VARIABLES collections.
-    device: Optional device to place the variable. It can be an string or a
-      function that is called to get the device for the variable.
-    restore: whether the variable should be added to the
-      VARIABLES_TO_RESTORE collection.
-
-  Returns:
-    The created or existing variable.
-  """
-  collections = list(collections or [])
-
-  # Make sure variables are added to tf.GraphKeys.GLOBAL_VARIABLES and MODEL_VARIABLES
-  collections += [tf.GraphKeys.GLOBAL_VARIABLES, MODEL_VARIABLES]
-  # Add to VARIABLES_TO_RESTORE if necessary
-  if restore:
-    collections.append(VARIABLES_TO_RESTORE)
-  # Remove duplicates
-  collections = set(collections)
-  # Get the device for the variable.
-  with tf.device(variable_device(device, name)):
-    return tf.get_variable(name, shape=shape, dtype=dtype,
-                           initializer=initializer, regularizer=regularizer,
-                           trainable=trainable, collections=collections)
diff --git a/research/inception/inception/slim/variables_test.py b/research/inception/inception/slim/variables_test.py
deleted file mode 100644
index b8c1944dfeb0fba7ad99f104b0c366c41d737c63..0000000000000000000000000000000000000000
--- a/research/inception/inception/slim/variables_test.py
+++ /dev/null
@@ -1,392 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for slim.variables."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from inception.slim import scopes
-from inception.slim import variables
-
-
-class VariablesTest(tf.test.TestCase):
-
-  def testCreateVariable(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [5])
-        self.assertEquals(a.op.name, 'A/a')
-        self.assertListEqual(a.get_shape().as_list(), [5])
-
-  def testGetVariables(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [5])
-      with tf.variable_scope('B'):
-        b = variables.variable('a', [5])
-      self.assertEquals([a, b], variables.get_variables())
-      self.assertEquals([a], variables.get_variables('A'))
-      self.assertEquals([b], variables.get_variables('B'))
-
-  def testGetVariablesSuffix(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [5])
-      with tf.variable_scope('A'):
-        b = variables.variable('b', [5])
-      self.assertEquals([a], variables.get_variables(suffix='a'))
-      self.assertEquals([b], variables.get_variables(suffix='b'))
-
-  def testGetVariableWithSingleVar(self):
-    with self.test_session():
-      with tf.variable_scope('parent'):
-        a = variables.variable('child', [5])
-      self.assertEquals(a, variables.get_unique_variable('parent/child'))
-
-  def testGetVariableWithDistractors(self):
-    with self.test_session():
-      with tf.variable_scope('parent'):
-        a = variables.variable('child', [5])
-        with tf.variable_scope('child'):
-          variables.variable('grandchild1', [7])
-          variables.variable('grandchild2', [9])
-      self.assertEquals(a, variables.get_unique_variable('parent/child'))
-
-  def testGetVariableThrowsExceptionWithNoMatch(self):
-    var_name = 'cant_find_me'
-    with self.test_session():
-      with self.assertRaises(ValueError):
-        variables.get_unique_variable(var_name)
-
-  def testGetThrowsExceptionWithChildrenButNoMatch(self):
-    var_name = 'parent/child'
-    with self.test_session():
-      with tf.variable_scope(var_name):
-        variables.variable('grandchild1', [7])
-        variables.variable('grandchild2', [9])
-      with self.assertRaises(ValueError):
-        variables.get_unique_variable(var_name)
-
-  def testGetVariablesToRestore(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [5])
-      with tf.variable_scope('B'):
-        b = variables.variable('a', [5])
-      self.assertEquals([a, b], variables.get_variables_to_restore())
-
-  def testNoneGetVariablesToRestore(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [5], restore=False)
-      with tf.variable_scope('B'):
-        b = variables.variable('a', [5], restore=False)
-      self.assertEquals([], variables.get_variables_to_restore())
-      self.assertEquals([a, b], variables.get_variables())
-
-  def testGetMixedVariablesToRestore(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [5])
-        b = variables.variable('b', [5], restore=False)
-      with tf.variable_scope('B'):
-        c = variables.variable('c', [5])
-        d = variables.variable('d', [5], restore=False)
-      self.assertEquals([a, b, c, d], variables.get_variables())
-      self.assertEquals([a, c], variables.get_variables_to_restore())
-
-  def testReuseVariable(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [])
-      with tf.variable_scope('A', reuse=True):
-        b = variables.variable('a', [])
-      self.assertEquals(a, b)
-      self.assertListEqual([a], variables.get_variables())
-
-  def testVariableWithDevice(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [], device='cpu:0')
-        b = variables.variable('b', [], device='cpu:1')
-      self.assertDeviceEqual(a.device, 'cpu:0')
-      self.assertDeviceEqual(b.device, 'cpu:1')
-
-  def testVariableWithDeviceFromScope(self):
-    with self.test_session():
-      with tf.device('/cpu:0'):
-        a = variables.variable('a', [])
-        b = variables.variable('b', [], device='cpu:1')
-      self.assertDeviceEqual(a.device, 'cpu:0')
-      self.assertDeviceEqual(b.device, 'cpu:1')
-
-  def testVariableWithDeviceFunction(self):
-    class DevFn(object):
-
-      def __init__(self):
-        self.counter = -1
-
-      def __call__(self, op):
-        self.counter += 1
-        return 'cpu:%d' % self.counter
-
-    with self.test_session():
-      with scopes.arg_scope([variables.variable], device=DevFn()):
-        a = variables.variable('a', [])
-        b = variables.variable('b', [])
-        c = variables.variable('c', [], device='cpu:12')
-        d = variables.variable('d', [])
-        with tf.device('cpu:99'):
-          e_init = tf.constant(12)
-        e = variables.variable('e', initializer=e_init)
-      self.assertDeviceEqual(a.device, 'cpu:0')
-      self.assertDeviceEqual(a.initial_value.device, 'cpu:0')
-      self.assertDeviceEqual(b.device, 'cpu:1')
-      self.assertDeviceEqual(b.initial_value.device, 'cpu:1')
-      self.assertDeviceEqual(c.device, 'cpu:12')
-      self.assertDeviceEqual(c.initial_value.device, 'cpu:12')
-      self.assertDeviceEqual(d.device, 'cpu:2')
-      self.assertDeviceEqual(d.initial_value.device, 'cpu:2')
-      self.assertDeviceEqual(e.device, 'cpu:3')
-      self.assertDeviceEqual(e.initial_value.device, 'cpu:99')
-
-  def testVariableWithReplicaDeviceSetter(self):
-    with self.test_session():
-      with tf.device(tf.train.replica_device_setter(ps_tasks=2)):
-        a = variables.variable('a', [])
-        b = variables.variable('b', [])
-        c = variables.variable('c', [], device='cpu:12')
-        d = variables.variable('d', [])
-        with tf.device('cpu:99'):
-          e_init = tf.constant(12)
-        e = variables.variable('e', initializer=e_init)
-      # The values below highlight how the replica_device_setter puts initial
-      # values on the worker job, and how it merges explicit devices.
-      self.assertDeviceEqual(a.device, '/job:ps/task:0/cpu:0')
-      self.assertDeviceEqual(a.initial_value.device, '/job:worker/cpu:0')
-      self.assertDeviceEqual(b.device, '/job:ps/task:1/cpu:0')
-      self.assertDeviceEqual(b.initial_value.device, '/job:worker/cpu:0')
-      self.assertDeviceEqual(c.device, '/job:ps/task:0/cpu:12')
-      self.assertDeviceEqual(c.initial_value.device, '/job:worker/cpu:12')
-      self.assertDeviceEqual(d.device, '/job:ps/task:1/cpu:0')
-      self.assertDeviceEqual(d.initial_value.device, '/job:worker/cpu:0')
-      self.assertDeviceEqual(e.device, '/job:ps/task:0/cpu:0')
-      self.assertDeviceEqual(e.initial_value.device, '/job:worker/cpu:99')
-
-  def testVariableWithVariableDeviceChooser(self):
-
-    with tf.Graph().as_default():
-      device_fn = variables.VariableDeviceChooser(num_parameter_servers=2)
-      with scopes.arg_scope([variables.variable], device=device_fn):
-        a = variables.variable('a', [])
-        b = variables.variable('b', [])
-        c = variables.variable('c', [], device='cpu:12')
-        d = variables.variable('d', [])
-        with tf.device('cpu:99'):
-          e_init = tf.constant(12)
-        e = variables.variable('e', initializer=e_init)
-      # The values below highlight how the VariableDeviceChooser puts initial
-      # values on the same device as the variable job.
-      self.assertDeviceEqual(a.device, '/job:ps/task:0/cpu:0')
-      self.assertDeviceEqual(a.initial_value.device, a.device)
-      self.assertDeviceEqual(b.device, '/job:ps/task:1/cpu:0')
-      self.assertDeviceEqual(b.initial_value.device, b.device)
-      self.assertDeviceEqual(c.device, '/cpu:12')
-      self.assertDeviceEqual(c.initial_value.device, c.device)
-      self.assertDeviceEqual(d.device, '/job:ps/task:0/cpu:0')
-      self.assertDeviceEqual(d.initial_value.device, d.device)
-      self.assertDeviceEqual(e.device, '/job:ps/task:1/cpu:0')
-      self.assertDeviceEqual(e.initial_value.device, '/cpu:99')
-
-  def testVariableGPUPlacement(self):
-
-    with tf.Graph().as_default():
-      device_fn = variables.VariableDeviceChooser(placement='gpu:0')
-      with scopes.arg_scope([variables.variable], device=device_fn):
-        a = variables.variable('a', [])
-        b = variables.variable('b', [])
-        c = variables.variable('c', [], device='cpu:12')
-        d = variables.variable('d', [])
-        with tf.device('cpu:99'):
-          e_init = tf.constant(12)
-        e = variables.variable('e', initializer=e_init)
-      # The values below highlight how the VariableDeviceChooser puts initial
-      # values on the same device as the variable job.
-      self.assertDeviceEqual(a.device, '/gpu:0')
-      self.assertDeviceEqual(a.initial_value.device, a.device)
-      self.assertDeviceEqual(b.device, '/gpu:0')
-      self.assertDeviceEqual(b.initial_value.device, b.device)
-      self.assertDeviceEqual(c.device, '/cpu:12')
-      self.assertDeviceEqual(c.initial_value.device, c.device)
-      self.assertDeviceEqual(d.device, '/gpu:0')
-      self.assertDeviceEqual(d.initial_value.device, d.device)
-      self.assertDeviceEqual(e.device, '/gpu:0')
-      self.assertDeviceEqual(e.initial_value.device, '/cpu:99')
-
-  def testVariableCollection(self):
-    with self.test_session():
-      a = variables.variable('a', [], collections='A')
-      b = variables.variable('b', [], collections='B')
-      self.assertEquals(a, tf.get_collection('A')[0])
-      self.assertEquals(b, tf.get_collection('B')[0])
-
-  def testVariableCollections(self):
-    with self.test_session():
-      a = variables.variable('a', [], collections=['A', 'C'])
-      b = variables.variable('b', [], collections=['B', 'C'])
-      self.assertEquals(a, tf.get_collection('A')[0])
-      self.assertEquals(b, tf.get_collection('B')[0])
-
-  def testVariableCollectionsWithArgScope(self):
-    with self.test_session():
-      with scopes.arg_scope([variables.variable], collections='A'):
-        a = variables.variable('a', [])
-        b = variables.variable('b', [])
-      self.assertListEqual([a, b], tf.get_collection('A'))
-
-  def testVariableCollectionsWithArgScopeNested(self):
-    with self.test_session():
-      with scopes.arg_scope([variables.variable], collections='A'):
-        a = variables.variable('a', [])
-        with scopes.arg_scope([variables.variable], collections='B'):
-          b = variables.variable('b', [])
-      self.assertEquals(a, tf.get_collection('A')[0])
-      self.assertEquals(b, tf.get_collection('B')[0])
-
-  def testVariableCollectionsWithArgScopeNonNested(self):
-    with self.test_session():
-      with scopes.arg_scope([variables.variable], collections='A'):
-        a = variables.variable('a', [])
-      with scopes.arg_scope([variables.variable], collections='B'):
-        b = variables.variable('b', [])
-      variables.variable('c', [])
-      self.assertListEqual([a], tf.get_collection('A'))
-      self.assertListEqual([b], tf.get_collection('B'))
-
-  def testVariableRestoreWithArgScopeNested(self):
-    with self.test_session():
-      with scopes.arg_scope([variables.variable], restore=True):
-        a = variables.variable('a', [])
-        with scopes.arg_scope([variables.variable],
-                              trainable=False,
-                              collections=['A', 'B']):
-          b = variables.variable('b', [])
-        c = variables.variable('c', [])
-      self.assertListEqual([a, b, c], variables.get_variables_to_restore())
-      self.assertListEqual([a, c], tf.trainable_variables())
-      self.assertListEqual([b], tf.get_collection('A'))
-      self.assertListEqual([b], tf.get_collection('B'))
-
-
-class GetVariablesByNameTest(tf.test.TestCase):
-
-  def testGetVariableGivenNameScoped(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [5])
-        b = variables.variable('b', [5])
-        self.assertEquals([a], variables.get_variables_by_name('a'))
-        self.assertEquals([b], variables.get_variables_by_name('b'))
-
-  def testGetVariablesByNameReturnsByValueWithScope(self):
-    with self.test_session():
-      with tf.variable_scope('A'):
-        a = variables.variable('a', [5])
-        matched_variables = variables.get_variables_by_name('a')
-
-        # If variables.get_variables_by_name returns the list by reference, the
-        # following append should persist, and be returned, in subsequent calls
-        # to variables.get_variables_by_name('a').
-        matched_variables.append(4)
-
-        matched_variables = variables.get_variables_by_name('a')
-        self.assertEquals([a], matched_variables)
-
-  def testGetVariablesByNameReturnsByValueWithoutScope(self):
-    with self.test_session():
-      a = variables.variable('a', [5])
-      matched_variables = variables.get_variables_by_name('a')
-
-      # If variables.get_variables_by_name returns the list by reference, the
-      # following append should persist, and be returned, in subsequent calls
-      # to variables.get_variables_by_name('a').
-      matched_variables.append(4)
-
-      matched_variables = variables.get_variables_by_name('a')
-      self.assertEquals([a], matched_variables)
-
-
-class GlobalStepTest(tf.test.TestCase):
-
-  def testStable(self):
-    with tf.Graph().as_default():
-      gs = variables.global_step()
-      gs2 = variables.global_step()
-      self.assertTrue(gs is gs2)
-
-  def testDevice(self):
-    with tf.Graph().as_default():
-      with scopes.arg_scope([variables.global_step], device='/gpu:0'):
-        gs = variables.global_step()
-      self.assertDeviceEqual(gs.device, '/gpu:0')
-
-  def testDeviceFn(self):
-    class DevFn(object):
-
-      def __init__(self):
-        self.counter = -1
-
-      def __call__(self, op):
-        self.counter += 1
-        return '/cpu:%d' % self.counter
-
-    with tf.Graph().as_default():
-      with scopes.arg_scope([variables.global_step], device=DevFn()):
-        gs = variables.global_step()
-        gs2 = variables.global_step()
-      self.assertDeviceEqual(gs.device, '/cpu:0')
-      self.assertEquals(gs, gs2)
-      self.assertDeviceEqual(gs2.device, '/cpu:0')
-
-  def testReplicaDeviceSetter(self):
-    device_fn = tf.train.replica_device_setter(2)
-    with tf.Graph().as_default():
-      with scopes.arg_scope([variables.global_step], device=device_fn):
-        gs = variables.global_step()
-        gs2 = variables.global_step()
-        self.assertEquals(gs, gs2)
-        self.assertDeviceEqual(gs.device, '/job:ps/task:0')
-        self.assertDeviceEqual(gs.initial_value.device, '/job:ps/task:0')
-        self.assertDeviceEqual(gs2.device, '/job:ps/task:0')
-        self.assertDeviceEqual(gs2.initial_value.device, '/job:ps/task:0')
-
-  def testVariableWithVariableDeviceChooser(self):
-
-    with tf.Graph().as_default():
-      device_fn = variables.VariableDeviceChooser()
-      with scopes.arg_scope([variables.global_step], device=device_fn):
-        gs = variables.global_step()
-        gs2 = variables.global_step()
-        self.assertEquals(gs, gs2)
-        self.assertDeviceEqual(gs.device, 'cpu:0')
-        self.assertDeviceEqual(gs.initial_value.device, gs.device)
-        self.assertDeviceEqual(gs2.device, 'cpu:0')
-        self.assertDeviceEqual(gs2.initial_value.device, gs2.device)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/keypointnet/CONTRIBUTING.md b/research/keypointnet/CONTRIBUTING.md
deleted file mode 100644
index 939e5341e74dc2371c8b47f0e27b50581bed5f63..0000000000000000000000000000000000000000
--- a/research/keypointnet/CONTRIBUTING.md
+++ /dev/null
@@ -1,28 +0,0 @@
-# How to Contribute
-
-We'd love to accept your patches and contributions to this project. There are
-just a few small guidelines you need to follow.
-
-## Contributor License Agreement
-
-Contributions to this project must be accompanied by a Contributor License
-Agreement. You (or your employer) retain the copyright to your contribution;
-this simply gives us permission to use and redistribute your contributions as
-part of the project. Head over to <https://cla.developers.google.com/> to see
-your current agreements on file or to sign a new one.
-
-You generally only need to submit a CLA once, so if you've already submitted one
-(even if it was for a different project), you probably don't need to do it
-again.
-
-## Code reviews
-
-All submissions, including submissions by project members, require review. We
-use GitHub pull requests for this purpose. Consult
-[GitHub Help](https://help.github.com/articles/about-pull-requests/) for more
-information on using pull requests.
-
-## Community Guidelines
-
-This project follows [Google's Open Source Community
-Guidelines](https://opensource.google.com/conduct/).
diff --git a/research/keypointnet/LICENSE b/research/keypointnet/LICENSE
deleted file mode 100644
index d645695673349e3947e8e5ae42332d0ac3164cd7..0000000000000000000000000000000000000000
--- a/research/keypointnet/LICENSE
+++ /dev/null
@@ -1,202 +0,0 @@
-
-                                 Apache License
-                           Version 2.0, January 2004
-                        http://www.apache.org/licenses/
-
-   TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
-
-   1. Definitions.
-
-      "License" shall mean the terms and conditions for use, reproduction,
-      and distribution as defined by Sections 1 through 9 of this document.
-
-      "Licensor" shall mean the copyright owner or entity authorized by
-      the copyright owner that is granting the License.
-
-      "Legal Entity" shall mean the union of the acting entity and all
-      other entities that control, are controlled by, or are under common
-      control with that entity. For the purposes of this definition,
-      "control" means (i) the power, direct or indirect, to cause the
-      direction or management of such entity, whether by contract or
-      otherwise, or (ii) ownership of fifty percent (50%) or more of the
-      outstanding shares, or (iii) beneficial ownership of such entity.
-
-      "You" (or "Your") shall mean an individual or Legal Entity
-      exercising permissions granted by this License.
-
-      "Source" form shall mean the preferred form for making modifications,
-      including but not limited to software source code, documentation
-      source, and configuration files.
-
-      "Object" form shall mean any form resulting from mechanical
-      transformation or translation of a Source form, including but
-      not limited to compiled object code, generated documentation,
-      and conversions to other media types.
-
-      "Work" shall mean the work of authorship, whether in Source or
-      Object form, made available under the License, as indicated by a
-      copyright notice that is included in or attached to the work
-      (an example is provided in the Appendix below).
-
-      "Derivative Works" shall mean any work, whether in Source or Object
-      form, that is based on (or derived from) the Work and for which the
-      editorial revisions, annotations, elaborations, or other modifications
-      represent, as a whole, an original work of authorship. For the purposes
-      of this License, Derivative Works shall not include works that remain
-      separable from, or merely link (or bind by name) to the interfaces of,
-      the Work and Derivative Works thereof.
-
-      "Contribution" shall mean any work of authorship, including
-      the original version of the Work and any modifications or additions
-      to that Work or Derivative Works thereof, that is intentionally
-      submitted to Licensor for inclusion in the Work by the copyright owner
-      or by an individual or Legal Entity authorized to submit on behalf of
-      the copyright owner. For the purposes of this definition, "submitted"
-      means any form of electronic, verbal, or written communication sent
-      to the Licensor or its representatives, including but not limited to
-      communication on electronic mailing lists, source code control systems,
-      and issue tracking systems that are managed by, or on behalf of, the
-      Licensor for the purpose of discussing and improving the Work, but
-      excluding communication that is conspicuously marked or otherwise
-      designated in writing by the copyright owner as "Not a Contribution."
-
-      "Contributor" shall mean Licensor and any individual or Legal Entity
-      on behalf of whom a Contribution has been received by Licensor and
-      subsequently incorporated within the Work.
-
-   2. Grant of Copyright License. Subject to the terms and conditions of
-      this License, each Contributor hereby grants to You a perpetual,
-      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
-      copyright license to reproduce, prepare Derivative Works of,
-      publicly display, publicly perform, sublicense, and distribute the
-      Work and such Derivative Works in Source or Object form.
-
-   3. Grant of Patent License. Subject to the terms and conditions of
-      this License, each Contributor hereby grants to You a perpetual,
-      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
-      (except as stated in this section) patent license to make, have made,
-      use, offer to sell, sell, import, and otherwise transfer the Work,
-      where such license applies only to those patent claims licensable
-      by such Contributor that are necessarily infringed by their
-      Contribution(s) alone or by combination of their Contribution(s)
-      with the Work to which such Contribution(s) was submitted. If You
-      institute patent litigation against any entity (including a
-      cross-claim or counterclaim in a lawsuit) alleging that the Work
-      or a Contribution incorporated within the Work constitutes direct
-      or contributory patent infringement, then any patent licenses
-      granted to You under this License for that Work shall terminate
-      as of the date such litigation is filed.
-
-   4. Redistribution. You may reproduce and distribute copies of the
-      Work or Derivative Works thereof in any medium, with or without
-      modifications, and in Source or Object form, provided that You
-      meet the following conditions:
-
-      (a) You must give any other recipients of the Work or
-          Derivative Works a copy of this License; and
-
-      (b) You must cause any modified files to carry prominent notices
-          stating that You changed the files; and
-
-      (c) You must retain, in the Source form of any Derivative Works
-          that You distribute, all copyright, patent, trademark, and
-          attribution notices from the Source form of the Work,
-          excluding those notices that do not pertain to any part of
-          the Derivative Works; and
-
-      (d) If the Work includes a "NOTICE" text file as part of its
-          distribution, then any Derivative Works that You distribute must
-          include a readable copy of the attribution notices contained
-          within such NOTICE file, excluding those notices that do not
-          pertain to any part of the Derivative Works, in at least one
-          of the following places: within a NOTICE text file distributed
-          as part of the Derivative Works; within the Source form or
-          documentation, if provided along with the Derivative Works; or,
-          within a display generated by the Derivative Works, if and
-          wherever such third-party notices normally appear. The contents
-          of the NOTICE file are for informational purposes only and
-          do not modify the License. You may add Your own attribution
-          notices within Derivative Works that You distribute, alongside
-          or as an addendum to the NOTICE text from the Work, provided
-          that such additional attribution notices cannot be construed
-          as modifying the License.
-
-      You may add Your own copyright statement to Your modifications and
-      may provide additional or different license terms and conditions
-      for use, reproduction, or distribution of Your modifications, or
-      for any such Derivative Works as a whole, provided Your use,
-      reproduction, and distribution of the Work otherwise complies with
-      the conditions stated in this License.
-
-   5. Submission of Contributions. Unless You explicitly state otherwise,
-      any Contribution intentionally submitted for inclusion in the Work
-      by You to the Licensor shall be under the terms and conditions of
-      this License, without any additional terms or conditions.
-      Notwithstanding the above, nothing herein shall supersede or modify
-      the terms of any separate license agreement you may have executed
-      with Licensor regarding such Contributions.
-
-   6. Trademarks. This License does not grant permission to use the trade
-      names, trademarks, service marks, or product names of the Licensor,
-      except as required for reasonable and customary use in describing the
-      origin of the Work and reproducing the content of the NOTICE file.
-
-   7. Disclaimer of Warranty. Unless required by applicable law or
-      agreed to in writing, Licensor provides the Work (and each
-      Contributor provides its Contributions) on an "AS IS" BASIS,
-      WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
-      implied, including, without limitation, any warranties or conditions
-      of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
-      PARTICULAR PURPOSE. You are solely responsible for determining the
-      appropriateness of using or redistributing the Work and assume any
-      risks associated with Your exercise of permissions under this License.
-
-   8. Limitation of Liability. In no event and under no legal theory,
-      whether in tort (including negligence), contract, or otherwise,
-      unless required by applicable law (such as deliberate and grossly
-      negligent acts) or agreed to in writing, shall any Contributor be
-      liable to You for damages, including any direct, indirect, special,
-      incidental, or consequential damages of any character arising as a
-      result of this License or out of the use or inability to use the
-      Work (including but not limited to damages for loss of goodwill,
-      work stoppage, computer failure or malfunction, or any and all
-      other commercial damages or losses), even if such Contributor
-      has been advised of the possibility of such damages.
-
-   9. Accepting Warranty or Additional Liability. While redistributing
-      the Work or Derivative Works thereof, You may choose to offer,
-      and charge a fee for, acceptance of support, warranty, indemnity,
-      or other liability obligations and/or rights consistent with this
-      License. However, in accepting such obligations, You may act only
-      on Your own behalf and on Your sole responsibility, not on behalf
-      of any other Contributor, and only if You agree to indemnify,
-      defend, and hold each Contributor harmless for any liability
-      incurred by, or claims asserted against, such Contributor by reason
-      of your accepting any such warranty or additional liability.
-
-   END OF TERMS AND CONDITIONS
-
-   APPENDIX: How to apply the Apache License to your work.
-
-      To apply the Apache License to your work, attach the following
-      boilerplate notice, with the fields enclosed by brackets "[]"
-      replaced with your own identifying information. (Don't include
-      the brackets!)  The text should be enclosed in the appropriate
-      comment syntax for the file format. We also recommend that a
-      file or class name and description of purpose be included on the
-      same "printed page" as the copyright notice for easier
-      identification within third-party archives.
-
-   Copyright [yyyy] [name of copyright owner]
-
-   Licensed under the Apache License, Version 2.0 (the "License");
-   you may not use this file except in compliance with the License.
-   You may obtain a copy of the License at
-
-       http://www.apache.org/licenses/LICENSE-2.0
-
-   Unless required by applicable law or agreed to in writing, software
-   distributed under the License is distributed on an "AS IS" BASIS,
-   WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-   See the License for the specific language governing permissions and
-   limitations under the License.
diff --git a/research/keypointnet/README.md b/research/keypointnet/README.md
deleted file mode 100644
index 8de88ca5a18816984302a9c20639364a7c8cde53..0000000000000000000000000000000000000000
--- a/research/keypointnet/README.md
+++ /dev/null
@@ -1,46 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# KeypointNet
-This is an implementation of the keypoint network proposed in "Discovery of
-Latent 3D Keypoints via End-to-end Geometric Reasoning
-[[pdf](https://arxiv.org/pdf/1807.03146.pdf)]". Given a single 2D image of a
-known class, this network can predict a set of 3D keypoints that are consistent
-across viewing angles of the same object and across object instances. These
-keypoints and their detectors are discovered and learned automatically without
-keypoint location supervision [[demo](https://keypointnet.github.io)].
-
-## Datasets:
-  ShapeNet's rendering for 
-  [Cars](https://storage.googleapis.com/discovery-3dkeypoints-data/cars_with_keypoints.zip),
-  [Planes](https://storage.googleapis.com/discovery-3dkeypoints-data/planes_with_keypoints.zip),
-  [Chairs](https://storage.googleapis.com/discovery-3dkeypoints-data/chairs_with_keypoints.zip).
-
-  Each set contains:
-1. tfrecords
-2. train.txt, a list of tfrecords used for training.
-2. dev.txt, a list of tfrecords used for validation.
-3. test.txt, a list of tfrecords used for testing.
-4. projection.txt, storing the global 4x4 camera projection matrix.
-5. job.txt, storing ShapeNet's object IDs in each tfrecord.
-  
-## Training:
-  Run `main.py --model_dir=MODEL_DIR --dset=DSET`
-
-  where MODEL_DIR is a folder for storing model checkpoints: (see [tf.estimator](https://www.tensorflow.org/api_docs/python/tf/estimator/Estimator)), and DSET should point to the folder containing tfrecords (download above).
-
-## Inference:
-  Run `main.py --model_dir=MODEL_DIR --input=INPUT --predict`
-
-  where MODEL_DIR is the model checkpoint folder, and INPUT is a folder containing png or jpeg test images.
-  We trained the network using the total batch size of 256 (8 x 32 replicas). You may have to tune the learning rate if your batch size is different. 
-
-## Code credit:
-  Supasorn Suwajanakorn
-
-## Contact:
-  supasorn@gmail.com, [snavely,tompson,mnorouzi]@google.com
-
-
-(This is not an officially supported Google product)
diff --git a/research/keypointnet/main.py b/research/keypointnet/main.py
deleted file mode 100644
index 04b30159404e01529c898ee75fb1ed78f705f539..0000000000000000000000000000000000000000
--- a/research/keypointnet/main.py
+++ /dev/null
@@ -1,697 +0,0 @@
-# Copyright 2018 Google LLC
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     https://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# =============================================================================
-"""KeypointNet!!
-
-A reimplementation of 'Discovery of Latent 3D Keypoints via End-to-end
-Geometric Reasoning' keypoint network. Given a single 2D image of a known class,
-this network can predict a set of 3D keypoints that are consistent across
-viewing angles of the same object and across object instances. These keypoints
-and their detectors are discovered and learned automatically without
-keypoint location supervision.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-import matplotlib.pyplot as plt
-import numpy as np
-import os
-from scipy import misc
-import sys
-import tensorflow as tf
-import tensorflow.contrib.slim as slim
-import utils
-
-FLAGS = tf.app.flags.FLAGS
-
-tf.app.flags.DEFINE_boolean("predict", False, "Running inference if true")
-tf.app.flags.DEFINE_string(
-    "input",
-    "",
-    "Input folder containing images")
-tf.app.flags.DEFINE_string("model_dir", None, "Estimator model_dir")
-tf.app.flags.DEFINE_string(
-    "dset",
-    "",
-    "Path to the directory containing the dataset.")
-tf.app.flags.DEFINE_integer("steps", 200000, "Training steps")
-tf.app.flags.DEFINE_integer("batch_size", 8, "Size of mini-batch.")
-tf.app.flags.DEFINE_string(
-    "hparams", "",
-    "A comma-separated list of `name=value` hyperparameter values. This flag "
-    "is used to override hyperparameter settings either when manually "
-    "selecting hyperparameters or when using Vizier.")
-tf.app.flags.DEFINE_integer(
-    "sync_replicas", -1,
-    "If > 0, use SyncReplicasOptimizer and use this many replicas per sync.")
-
-# Fixed input size 128 x 128.
-vw = vh = 128
-
-
-def create_input_fn(split, batch_size):
-  """Returns input_fn for tf.estimator.Estimator.
-
-  Reads tfrecords and construts input_fn for either training or eval. All
-  tfrecords not in test.txt or dev.txt will be assigned to training set.
-
-  Args:
-    split: A string indicating the split. Can be either 'train' or 'validation'.
-    batch_size: The batch size!
-
-  Returns:
-    input_fn for tf.estimator.Estimator.
-
-  Raises:
-    IOError: If test.txt or dev.txt are not found.
-  """
-
-  if (not os.path.exists(os.path.join(FLAGS.dset, "test.txt")) or
-      not os.path.exists(os.path.join(FLAGS.dset, "dev.txt"))):
-    raise IOError("test.txt or dev.txt not found")
-
-  with open(os.path.join(FLAGS.dset, "test.txt"), "r") as f:
-    testset = [x.strip() for x in f.readlines()]
-
-  with open(os.path.join(FLAGS.dset, "dev.txt"), "r") as f:
-    validset = [x.strip() for x in f.readlines()]
-
-  files = os.listdir(FLAGS.dset)
-  filenames = []
-  for f in files:
-    sp = os.path.splitext(f)
-    if sp[1] != ".tfrecord" or sp[0] in testset:
-      continue
-
-    if ((split == "validation" and sp[0] in validset) or
-        (split == "train" and sp[0] not in validset)):
-      filenames.append(os.path.join(FLAGS.dset, f))
-
-  def input_fn():
-    """input_fn for tf.estimator.Estimator."""
-
-    def parser(serialized_example):
-      """Parses a single tf.Example into image and label tensors."""
-      fs = tf.parse_single_example(
-          serialized_example,
-          features={
-              "img0": tf.FixedLenFeature([], tf.string),
-              "img1": tf.FixedLenFeature([], tf.string),
-              "mv0": tf.FixedLenFeature([16], tf.float32),
-              "mvi0": tf.FixedLenFeature([16], tf.float32),
-              "mv1": tf.FixedLenFeature([16], tf.float32),
-              "mvi1": tf.FixedLenFeature([16], tf.float32),
-          })
-
-      fs["img0"] = tf.div(tf.to_float(tf.image.decode_png(fs["img0"], 4)), 255)
-      fs["img1"] = tf.div(tf.to_float(tf.image.decode_png(fs["img1"], 4)), 255)
-
-      fs["img0"].set_shape([vh, vw, 4])
-      fs["img1"].set_shape([vh, vw, 4])
-
-      # fs["lr0"] = [fs["mv0"][0]]
-      # fs["lr1"] = [fs["mv1"][0]]
-
-      fs["lr0"] = tf.convert_to_tensor([fs["mv0"][0]])
-      fs["lr1"] = tf.convert_to_tensor([fs["mv1"][0]])
-
-      return fs
-
-    np.random.shuffle(filenames)
-    dataset = tf.data.TFRecordDataset(filenames)
-    dataset = dataset.map(parser, num_parallel_calls=4)
-    dataset = dataset.shuffle(400).repeat().batch(batch_size)
-    dataset = dataset.prefetch(buffer_size=256)
-
-    return dataset.make_one_shot_iterator().get_next(), None
-
-  return input_fn
-
-
-class Transformer(object):
-  """A utility for projecting 3D points to 2D coordinates and vice versa.
-
-  3D points are represented in 4D-homogeneous world coordinates. The pixel
-  coordinates are represented in normalized device coordinates [-1, 1].
-  See https://learnopengl.com/Getting-started/Coordinate-Systems.
-  """
-
-  def __get_matrix(self, lines):
-    return np.array([[float(y) for y in x.strip().split(" ")] for x in lines])
-
-  def __read_projection_matrix(self, filename):
-    if not os.path.exists(filename):
-      filename = "/cns/vz-d/home/supasorn/datasets/cars/projection.txt"
-    with open(filename, "r") as f:
-      lines = f.readlines()
-    return self.__get_matrix(lines)
-
-  def __init__(self, w, h, dataset_dir):
-    self.w = w
-    self.h = h
-    p = self.__read_projection_matrix(dataset_dir + "projection.txt")
-
-    # transposed of inversed projection matrix.
-    self.pinv_t = tf.constant([[1.0 / p[0, 0], 0, 0,
-                                0], [0, 1.0 / p[1, 1], 0, 0], [0, 0, 1, 0],
-                               [0, 0, 0, 1]])
-    self.f = p[0, 0]
-
-  def project(self, xyzw):
-    """Projects homogeneous 3D coordinates to normalized device coordinates."""
-
-    z = xyzw[:, :, 2:3] + 1e-8
-    return tf.concat([-self.f * xyzw[:, :, :2] / z, z], axis=2)
-
-  def unproject(self, xyz):
-    """Unprojects normalized device coordinates with depth to 3D coordinates."""
-
-    z = xyz[:, :, 2:]
-    xy = -xyz * z
-
-    def batch_matmul(a, b):
-      return tf.reshape(
-          tf.matmul(tf.reshape(a, [-1, a.shape[2].value]), b),
-          [-1, a.shape[1].value, a.shape[2].value])
-
-    return batch_matmul(
-        tf.concat([xy[:, :, :2], z, tf.ones_like(z)], axis=2), self.pinv_t)
-
-
-def meshgrid(h):
-  """Returns a meshgrid ranging from [-1, 1] in x, y axes."""
-
-  r = np.arange(0.5, h, 1) / (h / 2) - 1
-  ranx, rany = tf.meshgrid(r, -r)
-  return tf.to_float(ranx), tf.to_float(rany)
-
-
-def estimate_rotation(xyz0, xyz1, pconf, noise):
-  """Estimates the rotation between two sets of keypoints.
-
-  The rotation is estimated by first subtracting mean from each set of keypoints
-  and computing SVD of the covariance matrix.
-
-  Args:
-    xyz0: [batch, num_kp, 3] The first set of keypoints.
-    xyz1: [batch, num_kp, 3] The second set of keypoints.
-    pconf: [batch, num_kp] The weights used to compute the rotation estimate.
-    noise: A number indicating the noise added to the keypoints.
-
-  Returns:
-    [batch, 3, 3] A batch of transposed 3 x 3 rotation matrices.
-  """
-
-  xyz0 += tf.random_normal(tf.shape(xyz0), mean=0, stddev=noise)
-  xyz1 += tf.random_normal(tf.shape(xyz1), mean=0, stddev=noise)
-
-  pconf2 = tf.expand_dims(pconf, 2)
-  cen0 = tf.reduce_sum(xyz0 * pconf2, 1, keepdims=True)
-  cen1 = tf.reduce_sum(xyz1 * pconf2, 1, keepdims=True)
-
-  x = xyz0 - cen0
-  y = xyz1 - cen1
-
-  cov = tf.matmul(tf.matmul(x, tf.matrix_diag(pconf), transpose_a=True), y)
-  _, u, v = tf.svd(cov, full_matrices=True)
-
-  d = tf.matrix_determinant(tf.matmul(v, u, transpose_b=True))
-  ud = tf.concat(
-      [u[:, :, :-1], u[:, :, -1:] * tf.expand_dims(tf.expand_dims(d, 1), 1)],
-      axis=2)
-  return tf.matmul(ud, v, transpose_b=True)
-
-
-def relative_pose_loss(xyz0, xyz1, rot, pconf, noise):
-  """Computes the relative pose loss (chordal, angular).
-
-  Args:
-    xyz0: [batch, num_kp, 3] The first set of keypoints.
-    xyz1: [batch, num_kp, 3] The second set of keypoints.
-    rot: [batch, 4, 4] The ground-truth rotation matrices.
-    pconf: [batch, num_kp] The weights used to compute the rotation estimate.
-    noise: A number indicating the noise added to the keypoints.
-
-  Returns:
-    A tuple (chordal loss, angular loss).
-  """
-
-  r_transposed = estimate_rotation(xyz0, xyz1, pconf, noise)
-  rotation = rot[:, :3, :3]
-  frob_sqr = tf.reduce_sum(tf.square(r_transposed - rotation), axis=[1, 2])
-  frob = tf.sqrt(frob_sqr)
-
-  return tf.reduce_mean(frob_sqr), \
-      2.0 * tf.reduce_mean(tf.asin(tf.minimum(1.0, frob / (2 * math.sqrt(2)))))
-
-
-def separation_loss(xyz, delta):
-  """Computes the separation loss.
-
-  Args:
-    xyz: [batch, num_kp, 3] Input keypoints.
-    delta: A separation threshold. Incur 0 cost if the distance >= delta.
-
-  Returns:
-    The seperation loss.
-  """
-
-  num_kp = tf.shape(xyz)[1]
-  t1 = tf.tile(xyz, [1, num_kp, 1])
-
-  t2 = tf.reshape(tf.tile(xyz, [1, 1, num_kp]), tf.shape(t1))
-  diffsq = tf.square(t1 - t2)
-
-  # -> [batch, num_kp ^ 2]
-  lensqr = tf.reduce_sum(diffsq, axis=2)
-
-  return (tf.reduce_sum(tf.maximum(-lensqr + delta, 0.0)) / tf.to_float(
-      num_kp * FLAGS.batch_size * 2))
-
-
-def consistency_loss(uv0, uv1, pconf):
-  """Computes multi-view consistency loss between two sets of keypoints.
-
-  Args:
-    uv0: [batch, num_kp, 2] The first set of keypoint 2D coordinates.
-    uv1: [batch, num_kp, 2] The second set of keypoint 2D coordinates.
-    pconf: [batch, num_kp] The weights used to compute the rotation estimate.
-
-  Returns:
-    The consistency loss.
-  """
-
-  # [batch, num_kp, 2]
-  wd = tf.square(uv0 - uv1) * tf.expand_dims(pconf, 2)
-  wd = tf.reduce_sum(wd, axis=[1, 2])
-  return tf.reduce_mean(wd)
-
-
-def variance_loss(probmap, ranx, rany, uv):
-  """Computes the variance loss as part of Sillhouette consistency.
-
-  Args:
-    probmap: [batch, num_kp, h, w] The distribution map of keypoint locations.
-    ranx: X-axis meshgrid.
-    rany: Y-axis meshgrid.
-    uv: [batch, num_kp, 2] Keypoint locations (in NDC).
-
-  Returns:
-    The variance loss.
-  """
-
-  ran = tf.stack([ranx, rany], axis=2)
-
-  sh = tf.shape(ran)
-  # [batch, num_kp, vh, vw, 2]
-  ran = tf.reshape(ran, [1, 1, sh[0], sh[1], 2])
-
-  sh = tf.shape(uv)
-  uv = tf.reshape(uv, [sh[0], sh[1], 1, 1, 2])
-
-  diff = tf.reduce_sum(tf.square(uv - ran), axis=4)
-  diff *= probmap
-
-  return tf.reduce_mean(tf.reduce_sum(diff, axis=[2, 3]))
-
-
-def dilated_cnn(images, num_filters, is_training):
-  """Constructs a base dilated convolutional network.
-
-  Args:
-    images: [batch, h, w, 3] Input RGB images.
-    num_filters: The number of filters for all layers.
-    is_training: True if this function is called during training.
-
-  Returns:
-    Output of this dilated CNN.
-  """
-
-  net = images
-
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      normalizer_fn=slim.batch_norm,
-      activation_fn=lambda x: tf.nn.leaky_relu(x, alpha=0.1),
-      normalizer_params={"is_training": is_training}):
-    for i, r in enumerate([1, 1, 2, 4, 8, 16, 1, 2, 4, 8, 16, 1]):
-      net = slim.conv2d(net, num_filters, [3, 3], rate=r, scope="dconv%d" % i)
-
-  return net
-
-
-def orientation_network(images, num_filters, is_training):
-  """Constructs a network that infers the orientation of an object.
-
-  Args:
-    images: [batch, h, w, 3] Input RGB images.
-    num_filters: The number of filters for all layers.
-    is_training: True if this function is called during training.
-
-  Returns:
-    Output of the orientation network.
-  """
-
-  with tf.variable_scope("OrientationNetwork"):
-    net = dilated_cnn(images, num_filters, is_training)
-
-    modules = 2
-    prob = slim.conv2d(net, 2, [3, 3], rate=1, activation_fn=None)
-    prob = tf.transpose(prob, [0, 3, 1, 2])
-
-    prob = tf.reshape(prob, [-1, modules, vh * vw])
-    prob = tf.nn.softmax(prob)
-    ranx, rany = meshgrid(vh)
-
-    prob = tf.reshape(prob, [-1, 2, vh, vw])
-
-    sx = tf.reduce_sum(prob * ranx, axis=[2, 3])
-    sy = tf.reduce_sum(prob * rany, axis=[2, 3])  # -> batch x modules
-
-    out_xy = tf.reshape(tf.stack([sx, sy], -1), [-1, modules, 2])
-
-  return out_xy
-
-
-def keypoint_network(rgba,
-                     num_filters,
-                     num_kp,
-                     is_training,
-                     lr_gt=None,
-                     anneal=1):
-  """Constructs our main keypoint network that predicts 3D keypoints.
-
-  Args:
-    rgba: [batch, h, w, 4] Input RGB images with alpha channel.
-    num_filters: The number of filters for all layers.
-    num_kp: The number of keypoints.
-    is_training: True if this function is called during training.
-    lr_gt: The groundtruth orientation flag used at the beginning of training.
-        Then we linearly anneal in the prediction.
-    anneal: A number between [0, 1] where 1 means using the ground-truth
-        orientation and 0 means using our estimate.
-
-  Returns:
-    uv: [batch, num_kp, 2] 2D locations of keypoints.
-    z: [batch, num_kp] The depth of keypoints.
-    orient: [batch, 2, 2] Two 2D coordinates that correspond to [1, 0, 0] and
-        [-1, 0, 0] in object space.
-    sill: The Sillhouette loss.
-    variance: The variance loss.
-    prob_viz: A visualization of all predicted keypoints.
-    prob_vizs: A list of visualizations of each keypoint.
-
-  """
-
-  images = rgba[:, :, :, :3]
-
-  # [batch, 1]
-  orient = orientation_network(images, num_filters * 0.5, is_training)
-
-  # [batch, 1]
-  lr_estimated = tf.maximum(0.0, tf.sign(orient[:, 0, :1] - orient[:, 1, :1]))
-
-  if lr_gt is None:
-    lr = lr_estimated
-  else:
-    lr_gt = tf.maximum(0.0, tf.sign(lr_gt[:, :1]))
-    lr = tf.round(lr_gt * anneal + lr_estimated * (1 - anneal))
-
-  lrtiled = tf.tile(
-      tf.expand_dims(tf.expand_dims(lr, 1), 1),
-      [1, images.shape[1], images.shape[2], 1])
-
-  images = tf.concat([images, lrtiled], axis=3)
-
-  mask = rgba[:, :, :, 3]
-  mask = tf.cast(tf.greater(mask, tf.zeros_like(mask)), dtype=tf.float32)
-
-  net = dilated_cnn(images, num_filters, is_training)
-
-  # The probability distribution map.
-  prob = slim.conv2d(
-      net, num_kp, [3, 3], rate=1, scope="conv_xy", activation_fn=None)
-
-  # We added the  fixed camera distance as a bias.
-  z = -30 + slim.conv2d(
-      net, num_kp, [3, 3], rate=1, scope="conv_z", activation_fn=None)
-
-  prob = tf.transpose(prob, [0, 3, 1, 2])
-  z = tf.transpose(z, [0, 3, 1, 2])
-
-  prob = tf.reshape(prob, [-1, num_kp, vh * vw])
-  prob = tf.nn.softmax(prob, name="softmax")
-
-  ranx, rany = meshgrid(vh)
-  prob = tf.reshape(prob, [-1, num_kp, vh, vw])
-
-  # These are for visualizing the distribution maps.
-  prob_viz = tf.expand_dims(tf.reduce_sum(prob, 1), 3)
-  prob_vizs = [tf.expand_dims(prob[:, i, :, :], 3) for i in range(num_kp)]
-
-  sx = tf.reduce_sum(prob * ranx, axis=[2, 3])
-  sy = tf.reduce_sum(prob * rany, axis=[2, 3])  # -> batch x num_kp
-
-  # [batch, num_kp]
-  sill = tf.reduce_sum(prob * tf.expand_dims(mask, 1), axis=[2, 3])
-  sill = tf.reduce_mean(-tf.log(sill + 1e-12))
-
-  z = tf.reduce_sum(prob * z, axis=[2, 3])
-  uv = tf.reshape(tf.stack([sx, sy], -1), [-1, num_kp, 2])
-
-  variance = variance_loss(prob, ranx, rany, uv)
-
-  return uv, z, orient, sill, variance, prob_viz, prob_vizs
-
-
-def model_fn(features, labels, mode, hparams):
-  """Returns model_fn for tf.estimator.Estimator."""
-
-  del labels
-
-  is_training = (mode == tf.estimator.ModeKeys.TRAIN)
-  t = Transformer(vw, vh, FLAGS.dset)
-
-  def func1(x):
-    return tf.transpose(tf.reshape(features[x], [-1, 4, 4]), [0, 2, 1])
-
-  mv = [func1("mv%d" % i) for i in range(2)]
-  mvi = [func1("mvi%d" % i) for i in range(2)]
-
-  uvz = [None] * 2
-  uvz_proj = [None] * 2  # uvz coordinates projected on to the other view.
-  viz = [None] * 2
-  vizs = [None] * 2
-
-  loss_sill = 0
-  loss_variance = 0
-  loss_con = 0
-  loss_sep = 0
-  loss_lr = 0
-
-  for i in range(2):
-    with tf.variable_scope("KeypointNetwork", reuse=i > 0):
-      # anneal: 1 = using ground-truth, 0 = using our estimate orientation.
-      anneal = tf.to_float(hparams.lr_anneal_end - tf.train.get_global_step())
-      anneal = tf.clip_by_value(
-          anneal / (hparams.lr_anneal_end - hparams.lr_anneal_start), 0.0, 1.0)
-
-      uv, z, orient, sill, variance, viz[i], vizs[i] = keypoint_network(
-          features["img%d" % i],
-          hparams.num_filters,
-          hparams.num_kp,
-          is_training,
-          lr_gt=features["lr%d" % i],
-          anneal=anneal)
-
-      # x-positive/negative axes (dominant direction).
-      xp_axis = tf.tile(
-          tf.constant([[[1.0, 0, 0, 1], [-1.0, 0, 0, 1]]]),
-          [tf.shape(orient)[0], 1, 1])
-
-      # [batch, 2, 4]  = [batch, 2, 4] x [batch, 4, 4]
-      xp = tf.matmul(xp_axis, mv[i])
-
-      # [batch, 2, 3]
-      xp = t.project(xp)
-
-      loss_lr += tf.losses.mean_squared_error(orient[:, :, :2], xp[:, :, :2])
-      loss_variance += variance
-      loss_sill += sill
-
-      uv = tf.reshape(uv, [-1, hparams.num_kp, 2])
-      z = tf.reshape(z, [-1, hparams.num_kp, 1])
-
-      # [batch, num_kp, 3]
-      uvz[i] = tf.concat([uv, z], axis=2)
-
-      world_coords = tf.matmul(t.unproject(uvz[i]), mvi[i])
-
-      # [batch, num_kp, 3]
-      uvz_proj[i] = t.project(tf.matmul(world_coords, mv[1 - i]))
-
-  pconf = tf.ones(
-      [tf.shape(uv)[0], tf.shape(uv)[1]], dtype=tf.float32) / hparams.num_kp
-
-  for i in range(2):
-    loss_con += consistency_loss(uvz_proj[i][:, :, :2], uvz[1 - i][:, :, :2],
-                                 pconf)
-    loss_sep += separation_loss(
-        t.unproject(uvz[i])[:, :, :3], hparams.sep_delta)
-
-  chordal, angular = relative_pose_loss(
-      t.unproject(uvz[0])[:, :, :3],
-      t.unproject(uvz[1])[:, :, :3], tf.matmul(mvi[0], mv[1]), pconf,
-      hparams.noise)
-
-  loss = (
-      hparams.loss_pose * angular +
-      hparams.loss_con * loss_con +
-      hparams.loss_sep * loss_sep +
-      hparams.loss_sill * loss_sill +
-      hparams.loss_lr * loss_lr +
-      hparams.loss_variance * loss_variance
-  )
-
-  def touint8(img):
-    return tf.cast(img * 255.0, tf.uint8)
-
-  with tf.variable_scope("output"):
-    tf.summary.image("0_img0", touint8(features["img0"][:, :, :, :3]))
-    tf.summary.image("1_combined", viz[0])
-    for i in range(hparams.num_kp):
-      tf.summary.image("2_f%02d" % i, vizs[0][i])
-
-  with tf.variable_scope("stats"):
-    tf.summary.scalar("anneal", anneal)
-    tf.summary.scalar("closs", loss_con)
-    tf.summary.scalar("seploss", loss_sep)
-    tf.summary.scalar("angular", angular)
-    tf.summary.scalar("chordal", chordal)
-    tf.summary.scalar("lrloss", loss_lr)
-    tf.summary.scalar("sill", loss_sill)
-    tf.summary.scalar("vloss", loss_variance)
-
-  return {
-      "loss": loss,
-      "predictions": {
-          "img0": features["img0"],
-          "img1": features["img1"],
-          "uvz0": uvz[0],
-          "uvz1": uvz[1]
-      },
-      "eval_metric_ops": {
-          "closs": tf.metrics.mean(loss_con),
-          "angular_loss": tf.metrics.mean(angular),
-          "chordal_loss": tf.metrics.mean(chordal),
-      }
-  }
-
-
-def predict(input_folder, hparams):
-  """Predicts keypoints on all images in input_folder."""
-
-  cols = plt.cm.get_cmap("rainbow")(
-      np.linspace(0, 1.0, hparams.num_kp))[:, :4]
-
-  img = tf.placeholder(tf.float32, shape=(1, 128, 128, 4))
-
-  with tf.variable_scope("KeypointNetwork"):
-    ret = keypoint_network(
-        img, hparams.num_filters, hparams.num_kp, False)
-
-  uv = tf.reshape(ret[0], [-1, hparams.num_kp, 2])
-  z = tf.reshape(ret[1], [-1, hparams.num_kp, 1])
-  uvz = tf.concat([uv, z], axis=2)
-
-  sess = tf.Session()
-  saver = tf.train.Saver()
-  ckpt = tf.train.get_checkpoint_state(FLAGS.model_dir)
-
-  print("loading model: ", ckpt.model_checkpoint_path)
-  saver.restore(sess, ckpt.model_checkpoint_path)
-
-  files = [x for x in os.listdir(input_folder)
-           if x[-3:] in ["jpg", "png"]]
-
-  output_folder = os.path.join(input_folder, "output")
-  if not os.path.exists(output_folder):
-    os.mkdir(output_folder)
-
-  for f in files:
-    orig = misc.imread(os.path.join(input_folder, f)).astype(float) / 255
-    if orig.shape[2] == 3:
-      orig = np.concatenate((orig, np.ones_like(orig[:, :, :1])), axis=2)
-
-    uv_ret = sess.run(uvz, feed_dict={img: np.expand_dims(orig, 0)})
-
-    utils.draw_ndc_points(orig, uv_ret.reshape(hparams.num_kp, 3), cols)
-    misc.imsave(os.path.join(output_folder, f), orig)
-
-
-def _default_hparams():
-  """Returns default or overridden user-specified hyperparameters."""
-
-  hparams = tf.contrib.training.HParams(
-      num_filters=64,  # Number of filters.
-      num_kp=10,  # Numer of keypoints.
-
-      loss_pose=0.2,  # Pose Loss.
-      loss_con=1.0,  # Multiview consistency Loss.
-      loss_sep=1.0,  # Seperation Loss.
-      loss_sill=1.0,  # Sillhouette Loss.
-      loss_lr=1.0,  # Orientation Loss.
-      loss_variance=0.5,  # Variance Loss (part of Sillhouette loss).
-
-      sep_delta=0.05,  # Seperation threshold.
-      noise=0.1,  # Noise added during estimating rotation.
-
-      learning_rate=1.0e-3,
-      lr_anneal_start=30000,  # When to anneal in the orientation prediction.
-      lr_anneal_end=60000,  # When to use the prediction completely.
-  )
-  if FLAGS.hparams:
-    hparams = hparams.parse(FLAGS.hparams)
-  return hparams
-
-
-def main(argv):
-  del argv
-
-  hparams = _default_hparams()
-
-  if FLAGS.predict:
-    predict(FLAGS.input, hparams)
-  else:
-    utils.train_and_eval(
-        model_dir=FLAGS.model_dir,
-        model_fn=model_fn,
-        input_fn=create_input_fn,
-        hparams=hparams,
-        steps=FLAGS.steps,
-        batch_size=FLAGS.batch_size,
-        save_checkpoints_secs=600,
-        eval_throttle_secs=1800,
-        eval_steps=5,
-        sync_replicas=FLAGS.sync_replicas,
-    )
-
-
-if __name__ == "__main__":
-  sys.excepthook = utils.colored_hook(
-      os.path.dirname(os.path.realpath(__file__)))
-  tf.app.run()
diff --git a/research/keypointnet/tools/gen_tfrecords.py b/research/keypointnet/tools/gen_tfrecords.py
deleted file mode 100644
index 2f973b7fe5f16951dbfa01edd2a759b96b4f79db..0000000000000000000000000000000000000000
--- a/research/keypointnet/tools/gen_tfrecords.py
+++ /dev/null
@@ -1,99 +0,0 @@
-# Copyright 2018 Google LLC
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     https://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# =============================================================================
-"""An example script to generate a tfrecord file from a folder containing the
-renderings.
-
-Example usage:
-  python gen_tfrecords.py --input=FOLDER --output=output.tfrecord
-
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import os
-from scipy import misc
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-tf.app.flags.DEFINE_string("input", "", "Input folder containing images")
-tf.app.flags.DEFINE_string("output", "", "Output tfrecord.")
-
-
-def get_matrix(lines):
-  return np.array([[float(y) for y in x.strip().split(" ")] for x in lines])
-
-
-def read_model_view_matrices(filename):
-  with open(filename, "r") as f:
-    lines = f.readlines()
-  return get_matrix(lines[:4]), get_matrix(lines[4:])
-
-
-def bytes_feature(values):
-  return tf.train.Feature(bytes_list=tf.train.BytesList(value=[values]))
-
-
-def generate():
-  with tf.python_io.TFRecordWriter(FLAGS.output) as tfrecord_writer:
-    with tf.Graph().as_default():
-      im0 = tf.placeholder(dtype=tf.uint8)
-      im1 = tf.placeholder(dtype=tf.uint8)
-      encoded0 = tf.image.encode_png(im0)
-      encoded1 = tf.image.encode_png(im1)
-
-      with tf.Session() as sess:
-        count = 0
-        indir = FLAGS.input + "/"
-        while tf.gfile.Exists(indir + "%06d.txt" % count):
-          print("saving %06d" % count)
-          image0 = misc.imread(indir + "%06d.png" % (count * 2))
-          image1 = misc.imread(indir + "%06d.png" % (count * 2 + 1))
-
-          mat0, mat1 = read_model_view_matrices(indir + "%06d.txt" % count)
-
-          mati0 = np.linalg.inv(mat0).flatten()
-          mati1 = np.linalg.inv(mat1).flatten()
-          mat0 = mat0.flatten()
-          mat1 = mat1.flatten()
-
-          st0, st1 = sess.run([encoded0, encoded1],
-              feed_dict={im0: image0, im1: image1})
-
-          example = tf.train.Example(features=tf.train.Features(feature={
-            'img0': bytes_feature(st0),
-            'img1': bytes_feature(st1),
-            'mv0': tf.train.Feature(
-                float_list=tf.train.FloatList(value=mat0)),
-            'mvi0': tf.train.Feature(
-                float_list=tf.train.FloatList(value=mati0)),
-            'mv1': tf.train.Feature(
-                float_list=tf.train.FloatList(value=mat1)),
-            'mvi1': tf.train.Feature(
-                float_list=tf.train.FloatList(value=mati1)),
-            }))
-
-          tfrecord_writer.write(example.SerializeToString())
-          count += 1
-
-
-def main(argv):
-  del argv
-  generate()
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/keypointnet/tools/render.py b/research/keypointnet/tools/render.py
deleted file mode 100644
index 3a8872675d83cc414d6348dbc7a56e924541b8d7..0000000000000000000000000000000000000000
--- a/research/keypointnet/tools/render.py
+++ /dev/null
@@ -1,310 +0,0 @@
-# Copyright 2018 Google LLC
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     https://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# =============================================================================
-"""Script to render object views from ShapeNet obj models.
-
-Example usage:
-  blender -b --python render.py -- -m model.obj -o output/ -s 128 -n 120 -fov 5
-
-"""
-from __future__ import print_function
-
-import argparse
-import itertools
-import json
-from math import pi
-import os
-import random
-import sys
-from mathutils import Vector
-import math
-import mathutils
-import time
-import copy
-
-import bpy
-
-sys.path.append(os.path.dirname(__file__))
-
-BG_LUMINANCE = 0
-
-
-def look_at(obj_camera, point):
-  loc_camera = obj_camera.location
-  direction = point - loc_camera
-  # point the cameras '-Z' and use its 'Y' as up
-  rot_quat = direction.to_track_quat('-Z', 'Y')
-
-  obj_camera.rotation_euler = rot_quat.to_euler()
-
-
-def roll_camera(obj_camera):
-  roll_rotate = mathutils.Euler(
-      (0, 0, random.random() * math.pi - math.pi * 0.5), 'XYZ')
-  obj_camera.rotation_euler = (obj_camera.rotation_euler.to_matrix() *
-      roll_rotate.to_matrix()).to_euler()
-
-
-def norm(x):
-  return math.sqrt(x[0] * x[0] + x[1] * x[1] + x[2] * x[2])
-
-
-def normalize(x):
-  n = norm(x)
-  x[0] /= n
-  x[1] /= n
-  x[2] /= n
-
-
-def random_top_sphere():
-  xyz = [random.normalvariate(0, 1) for x in range(3)]
-  normalize(xyz)
-
-  if xyz[2] < 0:
-    xyz[2] *= -1
-  return xyz
-
-
-def perturb_sphere(loc, size):
-  while True:
-    xyz = [random.normalvariate(0, 1) for x in range(3)]
-    normalize(xyz)
-
-    nloc = [loc[i] + xyz[i] * random.random() * size for i in range(3)]
-    normalize(nloc)
-
-    if nloc[2] >= 0:
-      return nloc
-
-
-def perturb(loc, size):
-  while True:
-    nloc = [loc[i] + random.random() * size * 2 - size for i in range(3)]
-    if nloc[2] >= 0:
-      return nloc
-
-    bpy.ops.object.mode_set()
-
-
-def delete_all_objects():
-  bpy.ops.object.select_by_type(type="MESH")
-  bpy.ops.object.delete(use_global=False)
-
-
-def set_scene(render_size, fov, alpha=False):
-  """Set up default scene properties."""
-  delete_all_objects()
-
-  cam = bpy.data.cameras["Camera"]
-  cam.angle = fov * pi / 180
-
-  light = bpy.data.objects["Lamp"]
-  light.location = (0, 0, 1)
-  look_at(light, Vector((0.0, 0, 0)))
-  bpy.data.lamps['Lamp'].type = "HEMI"
-  bpy.data.lamps['Lamp'].energy = 1
-  bpy.data.lamps['Lamp'].use_specular = False
-  bpy.data.lamps['Lamp'].use_diffuse = True
-
-  bpy.context.scene.world.horizon_color = (
-      BG_LUMINANCE, BG_LUMINANCE, BG_LUMINANCE)
-
-  bpy.context.scene.render.resolution_x = render_size
-  bpy.context.scene.render.resolution_y = render_size
-  bpy.context.scene.render.resolution_percentage = 100
-
-  bpy.context.scene.render.use_antialiasing = True
-  bpy.context.scene.render.antialiasing_samples = '5'
-
-
-def get_modelview_matrix():
-  cam = bpy.data.objects["Camera"]
-  bpy.context.scene.update()
-
-  # when apply to object with CV coordinate i.e. to_blender * obj
-  # this gives object in blender coordinate
-  to_blender = mathutils.Matrix(
-      ((1., 0., 0., 0.),
-       (0., 0., -1., 0.),
-       (0., 1., 0., 0.),
-       (0., 0., 0., 1.)))
-  return cam.matrix_world.inverted() * to_blender
-
-
-def print_matrix(f, mat):
-  for i in range(4):
-    for j in range(4):
-      f.write("%lf " % mat[i][j])
-    f.write("\n")
-
-
-def mul(loc, v):
-  return [loc[i] * v for i in range(3)]
-
-
-def merge_all():
-  bpy.ops.object.select_by_type(type="MESH")
-  bpy.context.scene.objects.active = bpy.context.selected_objects[0]
-  bpy.ops.object.join()
-  obj = bpy.context.scene.objects.active
-  bpy.ops.object.origin_set(type="ORIGIN_CENTER_OF_MASS")
-  return obj
-
-
-def insert_frame(obj, frame_number):
-  obj.keyframe_insert(data_path="location", frame=frame_number)
-  obj.keyframe_insert(data_path="rotation_euler", frame=frame_number)
-  obj.keyframe_insert(data_path="scale", frame=frame_number)
-
-
-def render(output_prefix):
-  bpy.context.scene.render.filepath = output_prefix
-  bpy.context.scene.render.image_settings.file_format = "PNG"
-  bpy.context.scene.render.alpha_mode = "TRANSPARENT"
-  bpy.context.scene.render.image_settings.color_mode = "RGBA"
-  bpy.ops.render.render(write_still=True, animation=True)
-
-
-def render_obj(
-    obj_fn, save_dir, n, perturb_size, rotate=False, roll=False, scale=1.0):
-
-  # Load object.
-  bpy.ops.import_scene.obj(filepath=obj_fn)
-  cur_obj = merge_all()
-
-  scale = 2.0 / max(cur_obj.dimensions) * scale
-  cur_obj.scale = (scale, scale, scale)
-  # Using the center of mass as the origin doesn't really work, because Blender
-  # assumes the object is a solid shell. This seems to generate better-looking
-  # rotations.
-
-  bpy.ops.object.origin_set(type='ORIGIN_GEOMETRY', center='BOUNDS')
-
-  # bpy.ops.mesh.primitive_cube_add(location=(0, 0, 1))
-  # cube = bpy.data.objects["Cube"]
-  # cube.scale = (0.2, 0.2, 0.2)
-
-  for polygon in cur_obj.data.polygons:
-    polygon.use_smooth = True
-
-  bpy.ops.object.select_all(action="DESELECT")
-
-  camera = bpy.data.objects["Camera"]
-
-  # os.system("mkdir " + save_dir)
-  for i in range(n):
-    fo = open(save_dir + "/%06d.txt" % i, "w")
-    d = 30
-    shift = 0.2
-    if rotate:
-      t = 1.0 * i / (n-1) * 2 * math.pi
-      loc = [math.sin(t), math.cos(t), 1]
-
-      normalize(loc)
-      camera.location = mul(loc, d)
-      look_at(camera, Vector((0.0, 0, 0)))
-
-      print_matrix(fo, get_modelview_matrix())
-      print_matrix(fo, get_modelview_matrix())
-
-      insert_frame(camera, 2 * i)
-      insert_frame(camera, 2 * i + 1)
-
-    else:
-      loc = random_top_sphere()
-
-      camera.location = mul(loc, d)
-      look_at(camera, Vector((0.0, 0, 0)))
-
-      if roll:
-        roll_camera(camera)
-      camera.location = perturb(mul(loc, d), shift)
-
-      print_matrix(fo, get_modelview_matrix())
-      insert_frame(camera, 2 * i)
-
-      if perturb_size > 0:
-        loc = perturb_sphere(loc, perturb_size)
-      else:
-        loc = random_top_sphere()
-
-      camera.location = mul(loc, d)
-      look_at(camera, Vector((0.0, 0, 0)))
-      if roll:
-        roll_camera(camera)
-      camera.location = perturb(mul(loc, d), shift)
-
-      print_matrix(fo, get_modelview_matrix())
-      insert_frame(camera, 2 * i + 1)
-
-    fo.close()
-
-  # Create a bunch of views of the object
-  bpy.context.scene.frame_start = 0
-  bpy.context.scene.frame_end = 2 * n - 1
-
-  stem = os.path.join(save_dir, '######')
-  render(stem)
-
-
-def main():
-  parser = argparse.ArgumentParser()
-  parser.add_argument('-m', '--model', dest='model',
-                      required=True,
-                      help='Path to model obj file.')
-  parser.add_argument('-o', '--output_dir', dest='output_dir',
-                      required=True,
-                      help='Where to output files.')
-  parser.add_argument('-s', '--output_size', dest='output_size',
-                      required=True,
-                      help='Width and height of output in pixels, e.g. 32x32.')
-  parser.add_argument('-n', '--num_frames', dest='n', type=int,
-                      required=True,
-                      help='Number of frames to generate per clip.')
-
-  parser.add_argument('-scale', '--scale', dest='scale', type=float,
-                      help='object scaling', default=1)
-
-  parser.add_argument('-perturb', '--perturb', dest='perturb', type=float,
-                      help='sphere perturbation', default=0)
-
-  parser.add_argument('-rotate', '--rotate', dest='rotate', action='store_true',
-                      help='render rotating test set')
-
-  parser.add_argument('-roll', '--roll', dest='roll', action='store_true',
-                      help='add roll')
-
-  parser.add_argument(
-      '-fov', '--fov', dest='fov', type=float, required=True,
-      help='field of view')
-
-  if '--' not in sys.argv:
-    parser.print_help()
-    exit(1)
-
-  argv = sys.argv[sys.argv.index('--') + 1:]
-  args, _ = parser.parse_known_args(argv)
-
-  random.seed(args.model + str(time.time()) + str(os.getpid()))
-  # random.seed(0)
-
-  set_scene(int(args.output_size), args.fov)
-  render_obj(
-      args.model, args.output_dir, args.n, args.perturb, args.rotate,
-      args.roll, args.scale)
-  exit()
-
-
-if __name__ == '__main__':
-  main()
diff --git a/research/keypointnet/utils.py b/research/keypointnet/utils.py
deleted file mode 100644
index 148b7a3ed843638cff597be0c462b7e335df9857..0000000000000000000000000000000000000000
--- a/research/keypointnet/utils.py
+++ /dev/null
@@ -1,307 +0,0 @@
-# Copyright 2018 Google LLC
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     https://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# =============================================================================
-"""Utility functions for KeypointNet.
-
-These are helper / tensorflow related functions. The actual implementation and
-algorithm is in main.py.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-import numpy as np
-import os
-import re
-import tensorflow as tf
-import tensorflow.contrib.slim as slim
-import time
-import traceback
-
-
-class TrainingHook(tf.train.SessionRunHook):
-  """A utility for displaying training information such as the loss, percent
-  completed, estimated finish date and time."""
-
-  def __init__(self, steps):
-    self.steps = steps
-
-    self.last_time = time.time()
-    self.last_est = self.last_time
-
-    self.eta_interval = int(math.ceil(0.1 * self.steps))
-    self.current_interval = 0
-
-  def before_run(self, run_context):
-    graph = tf.get_default_graph()
-    return tf.train.SessionRunArgs(
-        {"loss": graph.get_collection("total_loss")[0]})
-
-  def after_run(self, run_context, run_values):
-    step = run_context.session.run(tf.train.get_global_step())
-    now = time.time()
-
-    if self.current_interval < self.eta_interval:
-      self.duration = now - self.last_est
-      self.current_interval += 1
-    if step % self.eta_interval == 0:
-      self.duration = now - self.last_est
-      self.last_est = now
-
-    eta_time = float(self.steps - step) / self.current_interval * \
-        self.duration
-    m, s = divmod(eta_time, 60)
-    h, m = divmod(m, 60)
-    eta = "%d:%02d:%02d" % (h, m, s)
-
-    print("%.2f%% (%d/%d): %.3e t %.3f  @ %s (%s)" % (
-        step * 100.0 / self.steps,
-        step,
-        self.steps,
-        run_values.results["loss"],
-        now - self.last_time,
-        time.strftime("%a %d %H:%M:%S", time.localtime(time.time() + eta_time)),
-        eta))
-
-    self.last_time = now
-
-
-def standard_model_fn(
-    func, steps, run_config=None, sync_replicas=0, optimizer_fn=None):
-  """Creates model_fn for tf.Estimator.
-
-  Args:
-    func: A model_fn with prototype model_fn(features, labels, mode, hparams).
-    steps: Training steps.
-    run_config: tf.estimatorRunConfig (usually passed in from TF_CONFIG).
-    sync_replicas: The number of replicas used to compute gradient for
-        synchronous training.
-    optimizer_fn: The type of the optimizer. Default to Adam.
-
-  Returns:
-    model_fn for tf.estimator.Estimator.
-  """
-
-  def fn(features, labels, mode, params):
-    """Returns model_fn for tf.estimator.Estimator."""
-
-    is_training = (mode == tf.estimator.ModeKeys.TRAIN)
-    ret = func(features, labels, mode, params)
-
-    tf.add_to_collection("total_loss", ret["loss"])
-    train_op = None
-
-    training_hooks = []
-    if is_training:
-      training_hooks.append(TrainingHook(steps))
-
-      if optimizer_fn is None:
-        optimizer = tf.train.AdamOptimizer(params.learning_rate)
-      else:
-        optimizer = optimizer_fn
-
-      if run_config is not None and run_config.num_worker_replicas > 1:
-        sr = sync_replicas
-        if sr <= 0:
-          sr = run_config.num_worker_replicas
-
-        optimizer = tf.train.SyncReplicasOptimizer(
-            optimizer,
-            replicas_to_aggregate=sr,
-            total_num_replicas=run_config.num_worker_replicas)
-
-        training_hooks.append(
-            optimizer.make_session_run_hook(
-                run_config.is_chief, num_tokens=run_config.num_worker_replicas))
-
-      optimizer = tf.contrib.estimator.clip_gradients_by_norm(optimizer, 5)
-      train_op = slim.learning.create_train_op(ret["loss"], optimizer)
-
-    if "eval_metric_ops" not in ret:
-      ret["eval_metric_ops"] = {}
-
-    return tf.estimator.EstimatorSpec(
-        mode=mode,
-        predictions=ret["predictions"],
-        loss=ret["loss"],
-        train_op=train_op,
-        eval_metric_ops=ret["eval_metric_ops"],
-        training_hooks=training_hooks)
-  return fn
-
-
-def train_and_eval(
-    model_dir,
-    steps,
-    batch_size,
-    model_fn,
-    input_fn,
-    hparams,
-    keep_checkpoint_every_n_hours=0.5,
-    save_checkpoints_secs=180,
-    save_summary_steps=50,
-    eval_steps=20,
-    eval_start_delay_secs=10,
-    eval_throttle_secs=300,
-    sync_replicas=0):
-  """Trains and evaluates our model. Supports local and distributed training.
-
-  Args:
-    model_dir: The output directory for trained parameters, checkpoints, etc.
-    steps: Training steps.
-    batch_size: Batch size.
-    model_fn: A func with prototype model_fn(features, labels, mode, hparams).
-    input_fn: A input function for the tf.estimator.Estimator.
-    hparams: tf.HParams containing a set of hyperparameters.
-    keep_checkpoint_every_n_hours: Number of hours between each checkpoint
-        to be saved.
-    save_checkpoints_secs: Save checkpoints every this many seconds.
-    save_summary_steps: Save summaries every this many steps.
-    eval_steps: Number of steps to evaluate model.
-    eval_start_delay_secs: Start evaluating after waiting for this many seconds.
-    eval_throttle_secs: Do not re-evaluate unless the last evaluation was
-        started at least this many seconds ago
-    sync_replicas: Number of synchronous replicas for distributed training.
-
-  Returns:
-    None
-  """
-
-  run_config = tf.estimator.RunConfig(
-      keep_checkpoint_every_n_hours=keep_checkpoint_every_n_hours,
-      save_checkpoints_secs=save_checkpoints_secs,
-      save_summary_steps=save_summary_steps)
-
-  estimator = tf.estimator.Estimator(
-      model_dir=model_dir,
-      model_fn=standard_model_fn(
-          model_fn,
-          steps,
-          run_config,
-          sync_replicas=sync_replicas),
-      params=hparams, config=run_config)
-
-  train_spec = tf.estimator.TrainSpec(
-      input_fn=input_fn(split="train", batch_size=batch_size),
-      max_steps=steps)
-
-  eval_spec = tf.estimator.EvalSpec(
-      input_fn=input_fn(split="validation", batch_size=batch_size),
-      steps=eval_steps,
-      start_delay_secs=eval_start_delay_secs,
-      throttle_secs=eval_throttle_secs)
-
-  tf.estimator.train_and_evaluate(estimator, train_spec, eval_spec)
-
-
-def draw_circle(rgb, u, v, col, r):
-  """Draws a simple anti-aliasing circle in-place.
-
-  Args:
-    rgb: Input image to be modified.
-    u: Horizontal coordinate.
-    v: Vertical coordinate.
-    col: Color.
-    r: Radius.
-  """
-
-  ir = int(math.ceil(r))
-  for i in range(-ir-1, ir+2):
-    for j in range(-ir-1, ir+2):
-      nu = int(round(u + i))
-      nv = int(round(v + j))
-      if nu < 0 or nu >= rgb.shape[1] or nv < 0 or nv >= rgb.shape[0]:
-        continue
-
-      du = abs(nu - u)
-      dv = abs(nv - v)
-
-      # need sqrt to keep scale
-      t = math.sqrt(du * du + dv * dv) - math.sqrt(r * r)
-      if t < 0:
-        rgb[nv, nu, :] = col
-      else:
-        t = 1 - t
-        if t > 0:
-          # t = t ** 0.3
-          rgb[nv, nu, :] = col * t + rgb[nv, nu, :] * (1-t)
-
-
-def draw_ndc_points(rgb, xy, cols):
-  """Draws keypoints onto an input image.
-
-  Args:
-    rgb: Input image to be modified.
-    xy: [n x 2] matrix of 2D locations.
-    cols: A list of colors for the keypoints.
-  """
-
-  vh, vw = rgb.shape[0], rgb.shape[1]
-
-  for j in range(len(cols)):
-    x, y = xy[j, :2]
-    x = (min(max(x, -1), 1) * vw / 2 + vw / 2) - 0.5
-    y = vh - 0.5 - (min(max(y, -1), 1) * vh / 2 + vh / 2)
-
-    x = int(round(x))
-    y = int(round(y))
-    if x < 0 or y < 0 or x >= vw or y >= vh:
-      continue
-
-    rad = 1.5
-    rad *= rgb.shape[0] / 128.0
-    draw_circle(rgb, x, y, np.array([0.0, 0.0, 0.0, 1.0]), rad * 1.5)
-    draw_circle(rgb, x, y, cols[j], rad)
-
-
-def colored_hook(home_dir):
-  """Colorizes python's error message.
-
-  Args:
-    home_dir: directory where code resides (to highlight your own files).
-  Returns:
-    The traceback hook.
-  """
-
-  def hook(type_, value, tb):
-    def colorize(text, color, own=0):
-      """Returns colorized text."""
-      endcolor = "\x1b[0m"
-      codes = {
-          "green": "\x1b[0;32m",
-          "green_own": "\x1b[1;32;40m",
-          "red": "\x1b[0;31m",
-          "red_own": "\x1b[1;31m",
-          "yellow": "\x1b[0;33m",
-          "yellow_own": "\x1b[1;33m",
-          "black": "\x1b[0;90m",
-          "black_own": "\x1b[1;90m",
-          "cyan": "\033[1;36m",
-      }
-      return codes[color + ("_own" if own else "")] + text + endcolor
-
-    for filename, line_num, func, text in traceback.extract_tb(tb):
-      basename = os.path.basename(filename)
-      own = (home_dir in filename) or ("/" not in filename)
-
-      print(colorize("\"" + basename + '"', "green", own) + " in " + func)
-      print("%s:  %s" % (
-          colorize("%5d" % line_num, "red", own),
-          colorize(text, "yellow", own)))
-      print("  %s" % colorize(filename, "black", own))
-
-    print(colorize("%s: %s" % (type_.__name__, value), "cyan"))
-  return hook
diff --git a/research/learned_optimizer/.gitignore b/research/learned_optimizer/.gitignore
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/learned_optimizer/BUILD b/research/learned_optimizer/BUILD
deleted file mode 100644
index 629c9a06b51d10eb7cab69ed0d9dd0bfa52fd2f0..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/BUILD
+++ /dev/null
@@ -1,33 +0,0 @@
-# Learning to Optimize Learning (LOL)
-
-package(default_visibility = ["//visibility:public"])
-
-# Libraries
-# =========
-
-py_library(
-    name = "metaopt",
-    srcs = ["metaopt.py"],
-    deps = [
-        "//learned_optimizer/problems:datasets",
-        "//learned_optimizer/problems:problem_generator",
-    ],
-)
-
-# Binaries
-# ========
-py_binary(
-    name = "metarun",
-    srcs = ["metarun.py"],
-    deps = [
-        ":metaopt",
-        "//learned_optimizer/optimizer:coordinatewise_rnn",
-        "//learned_optimizer/optimizer:global_learning_rate",
-        "//learned_optimizer/optimizer:hierarchical_rnn",
-        "//learned_optimizer/optimizer:learning_rate_schedule",
-        "//learned_optimizer/optimizer:trainable_adam",
-        "//learned_optimizer/problems:problem_sets",
-        "//learned_optimizer/problems:problem_spec",
-    ],
-)
-
diff --git a/research/learned_optimizer/README.md b/research/learned_optimizer/README.md
deleted file mode 100644
index 6a32514f053f97bc64dc87c4ec972c8223a83fe2..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/README.md
+++ /dev/null
@@ -1,47 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Learned Optimizer
-
-Code for [Learned Optimizers that Scale and Generalize](https://arxiv.org/abs/1703.04813).
-
-## Requirements
-
-* Bazel ([install](https://bazel.build/versions/master/docs/install.html))
-* TensorFlow >= v1.3
-* Python 2.7.x
-
-## Training a Learned Optimizer
-
-## Code Overview
-In the top-level directory, ```metaopt.py``` contains the code to train and test a learned optimizer. ```metarun.py``` packages the actual training procedure into a
-single file, defining and exposing many flags to tune the procedure, from selecting the optimizer type and problem set to more fine-grained hyperparameter settings.
-There is no testing binary; testing can be done ad-hoc via ```metaopt.test_optimizer``` by passing an optimizer object and a directory with a checkpoint.
-
-The ```optimizer``` directory contains a base ```trainable_optimizer.py``` class and a number of extensions, including the ```hierarchical_rnn``` optimizer used in
-the paper, a ```coordinatewise_rnn``` optimizer that more closely matches previous work, and a number of simpler optimizers to demonstrate the basic mechanics of
-a learnable optimizer.
-
-The ```problems``` directory contains the code to build the problems that were used in the meta-training set.
-
-### Binaries
-```metarun.py```: meta-training of a learned optimizer
-
-### Command-Line Flags
-The flags most relevant to meta-training are defined in ```metarun.py```. The default values will meta-train a HierarchicalRNN optimizer with the hyperparameter
-settings used in the paper.
-
-### Using a Learned Optimizer as a Black Box
-The ```trainable_optimizer``` inherits from ```tf.train.Optimizer```, so a properly instantiated version can be used to train any model in any APIs that accept
-this class. There are just 2 caveats:
-
-1. If using the Hierarchical RNN optimizer, the apply_gradients return type must be changed (see comments inline for what exactly must be removed)
-
-2. Care must be taken to restore the variables from the optimizer without overriding them. Optimizer variables should be loaded manually using a pretrained checkpoint
-and a ```tf.train.Saver``` with only the optimizer variables. Then, when constructing the session, ensure that any automatic variable initialization does not
-re-initialize the loaded optimizer variables.
-
-## Contact for Issues
-
-* Olga Wichrowska (@olganw), Niru Maheswaranathan (@nirum)
diff --git a/research/learned_optimizer/metaopt.py b/research/learned_optimizer/metaopt.py
deleted file mode 100644
index 62c06272d3096ed63296744792c8742826380536..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/metaopt.py
+++ /dev/null
@@ -1,639 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Helper utilities for training and testing optimizers."""
-
-from collections import defaultdict
-import random
-import sys
-import time
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from learned_optimizer.optimizer import trainable_optimizer
-from learned_optimizer.optimizer import utils
-from learned_optimizer.problems import datasets
-from learned_optimizer.problems import problem_generator
-
-tf.app.flags.DEFINE_integer("ps_tasks", 0,
-                            """Number of tasks in the ps job.
-                            If 0 no ps job is used.""")
-tf.app.flags.DEFINE_float("nan_l2_reg", 1e-2,
-                          """Strength of l2-reg when NaNs are encountered.""")
-tf.app.flags.DEFINE_float("l2_reg", 0.,
-                          """Lambda value for parameter regularization.""")
-# Default is 0.9
-tf.app.flags.DEFINE_float("rms_decay", 0.9,
-                          """Decay value for the RMSProp metaoptimizer.""")
-# Default is 1e-10
-tf.app.flags.DEFINE_float("rms_epsilon", 1e-20,
-                          """Epsilon value for the RMSProp metaoptimizer.""")
-tf.app.flags.DEFINE_boolean("set_profiling", False,
-                            """Enable memory usage and computation time """
-                            """tracing for tensorflow nodes (available in """
-                            """TensorBoard).""")
-tf.app.flags.DEFINE_boolean("reset_rnn_params", True,
-                            """Reset the parameters of the optimizer
-                               from one meta-iteration to the next.""")
-
-FLAGS = tf.app.flags.FLAGS
-OPTIMIZER_SCOPE = "LOL"
-OPT_SUM_COLLECTION = "LOL_summaries"
-
-
-def sigmoid_weights(n, slope=0.1, offset=5):
-  """Generates a sigmoid, scaled to sum to 1.
-
-  This function is used to generate weights that serve to mask out
-  the early objective values of an optimization problem such that
-  initial variation in the objective is phased out (hence the sigmoid
-  starts at zero and ramps up to the maximum value, and the total
-  weight is normalized to sum to one)
-
-  Args:
-    n: the number of samples
-    slope: slope of the sigmoid (Default: 0.1)
-    offset: threshold of the sigmoid (Default: 5)
-
-  Returns:
-    No
-  """
-  x = np.arange(n)
-  y = 1. / (1. + np.exp(-slope * (x-offset)))
-  y_normalized = y / np.sum(y)
-  return y_normalized
-
-
-def sample_numiter(scale, min_steps=50):
-  """Samples a number of iterations from an exponential distribution.
-
-  Args:
-    scale: parameter for the exponential distribution
-    min_steps: minimum number of steps to run (additive)
-
-  Returns:
-    num_steps: An integer equal to a rounded sample from the exponential
-               distribution + the value of min_steps.
-  """
-  return int(np.round(np.random.exponential(scale=scale)) + min_steps)
-
-
-def train_optimizer(logdir,
-                    optimizer_spec,
-                    problems_and_data,
-                    num_problems,
-                    num_meta_iterations,
-                    num_unroll_func,
-                    num_partial_unroll_itrs_func,
-                    learning_rate=1e-4,
-                    gradient_clip=5.,
-                    is_chief=False,
-                    select_random_problems=True,
-                    callbacks=None,
-                    obj_train_max_multiplier=-1,
-                    out=sys.stdout):
-  """Trains the meta-parameters of this optimizer.
-
-  Args:
-    logdir: a directory filepath for storing model checkpoints (must exist)
-    optimizer_spec: specification for an Optimizer (see utils.Spec)
-    problems_and_data: a list of tuples containing three elements: a problem
-      specification (see utils.Spec), a dataset (see datasets.Dataset), and
-      a batch_size (int) for generating a problem and corresponding dataset. If
-      the problem doesn't have data, set dataset to None.
-    num_problems: the number of problems to sample during meta-training
-    num_meta_iterations: the number of iterations (steps) to run the
-      meta-optimizer for on each subproblem.
-    num_unroll_func: called once per meta iteration and returns the number of
-      unrolls to do for that meta iteration.
-    num_partial_unroll_itrs_func: called once per unroll and returns the number
-      of iterations to do for that unroll.
-    learning_rate: learning rate of the RMSProp meta-optimizer (Default: 1e-4)
-    gradient_clip: value to clip gradients at (Default: 5.0)
-    is_chief: whether this is the chief task (Default: False)
-    select_random_problems: whether to select training problems randomly
-        (Default: True)
-    callbacks: a list of callback functions that is run after every random
-        problem draw
-    obj_train_max_multiplier: the maximum increase in the objective value over
-        a single training run. Ignored if < 0.
-    out: where to write output to, e.g. a file handle (Default: sys.stdout)
-
-  Raises:
-    ValueError: If one of the subproblems has a negative objective value.
-  """
-
-  if select_random_problems:
-    # iterate over random draws of problem / dataset pairs
-    sampler = (random.choice(problems_and_data) for _ in range(num_problems))
-  else:
-    # iterate over a random shuffle of problems, looping if necessary
-    num_repeats = (num_problems / len(problems_and_data)) + 1
-    random.shuffle(problems_and_data)
-    sampler = (problems_and_data * num_repeats)[:num_problems]
-
-  for problem_itr, (problem_spec, dataset, batch_size) in enumerate(sampler):
-
-    # timer used to time how long it takes to initialize a problem
-    problem_start_time = time.time()
-
-    # if dataset is None, use the EMPTY_DATASET
-    if dataset is None:
-      dataset = datasets.EMPTY_DATASET
-      batch_size = dataset.size
-
-    # build a new graph for this problem
-    graph = tf.Graph()
-    real_device_setter = tf.train.replica_device_setter(FLAGS.ps_tasks)
-
-    def custom_device_setter(op):
-      # Places the local variables onto the workers.
-      if trainable_optimizer.is_local_state_variable(op):
-        return "/job:worker"
-      else:
-        return real_device_setter(op)
-
-    if real_device_setter:
-      device_setter = custom_device_setter
-    else:
-      device_setter = None
-
-    with graph.as_default(), graph.device(device_setter):
-
-      # initialize a problem
-      problem = problem_spec.build()
-
-      # build the optimizer
-      opt = optimizer_spec.build()
-
-      # get the meta-objective for training the optimizer
-      train_output = opt.train(problem, dataset)
-
-      state_keys = opt.state_keys
-      for key, val in zip(state_keys, train_output.output_state[0]):
-        finite_val = utils.make_finite(val, replacement=tf.zeros_like(val))
-        tf.summary.histogram("State/{}".format(key), finite_val,
-                             collections=[OPT_SUM_COLLECTION])
-
-      tf.summary.scalar("MetaObjective", train_output.metaobj,
-                        collections=[OPT_SUM_COLLECTION])
-
-      # Per-problem meta-objective
-      tf.summary.scalar(problem_spec.callable.__name__ + "_MetaObjective",
-                        train_output.metaobj,
-                        collections=[OPT_SUM_COLLECTION])
-
-      # create the meta-train_op
-      global_step = tf.Variable(0, name="global_step", trainable=False)
-      meta_parameters = tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES,
-                                          scope=OPTIMIZER_SCOPE)
-      # parameter regularization
-      reg_l2 = FLAGS.l2_reg * sum([tf.reduce_sum(param ** 2)
-                                   for param in meta_parameters])
-
-      # compute the meta-gradients
-      meta_opt = tf.train.RMSPropOptimizer(learning_rate, decay=FLAGS.rms_decay,
-                                           use_locking=True,
-                                           epsilon=FLAGS.rms_epsilon)
-      grads_and_vars = meta_opt.compute_gradients(train_output.metaobj + reg_l2,
-                                                  meta_parameters)
-
-      # clip the gradients
-      clipped_grads_and_vars = []
-      for grad, var in grads_and_vars:
-        clipped_grad = tf.clip_by_value(
-            utils.make_finite(grad, replacement=tf.zeros_like(var)),
-            -gradient_clip, gradient_clip)
-        clipped_grads_and_vars.append((clipped_grad, var))
-
-      # histogram summary of grads and vars
-      for grad, var in grads_and_vars:
-        tf.summary.histogram(
-            var.name + "_rawgrad",
-            utils.make_finite(
-                grad, replacement=tf.zeros_like(grad)),
-            collections=[OPT_SUM_COLLECTION])
-      for grad, var in clipped_grads_and_vars:
-        tf.summary.histogram(var.name + "_var", var,
-                             collections=[OPT_SUM_COLLECTION])
-        tf.summary.histogram(var.name + "_grad", grad,
-                             collections=[OPT_SUM_COLLECTION])
-
-      # builds the train and summary operations
-      train_op = meta_opt.apply_gradients(clipped_grads_and_vars,
-                                          global_step=global_step)
-
-      # only grab summaries defined for LOL, not inside the problem
-      summary_op = tf.summary.merge_all(key=OPT_SUM_COLLECTION)
-
-      # make sure the state gets propagated after the gradients and summaries
-      # were computed.
-      with tf.control_dependencies([train_op, summary_op]):
-        propagate_loop_state_ops = []
-        for dest, src in zip(
-            train_output.init_loop_vars, train_output.output_loop_vars):
-          propagate_loop_state_ops.append(dest.assign(src))
-        propagate_loop_state_op = tf.group(*propagate_loop_state_ops)
-
-      # create the supervisor
-      sv = tf.train.Supervisor(
-          graph=graph,
-          is_chief=is_chief,
-          logdir=logdir,
-          summary_op=None,
-          save_model_secs=0,      # we save checkpoints manually
-          global_step=global_step,
-      )
-
-      with sv.managed_session() as sess:
-
-        init_time = time.time() - problem_start_time
-        out.write("--------- Problem #{} ---------\n".format(problem_itr))
-        out.write("{callable.__name__}{args}{kwargs}\n".format(
-            **problem_spec.__dict__))
-        out.write("Took {} seconds to initialize.\n".format(init_time))
-        out.flush()
-
-        # For profiling summaries
-        if FLAGS.set_profiling:
-          summary_writer = tf.summary.FileWriter(logdir, graph=sess.graph)
-
-        # used to store information during training
-        metadata = defaultdict(list)
-
-        for k in range(num_meta_iterations):
-
-          if sv.should_stop():
-            break
-
-          problem.init_fn(sess)
-
-          # set run options (for profiling)
-          full_trace_opt = tf.RunOptions(trace_level=tf.RunOptions.FULL_TRACE)
-          run_options = full_trace_opt if FLAGS.set_profiling else None
-          run_metadata = tf.RunMetadata() if FLAGS.set_profiling else None
-
-          num_unrolls = num_unroll_func()
-          partial_unroll_iters = [
-              num_partial_unroll_itrs_func() for _ in xrange(num_unrolls)
-          ]
-          total_num_iter = sum(partial_unroll_iters)
-
-          objective_weights = [np.ones(num) / float(num)
-                               for num in partial_unroll_iters]
-          db = dataset.batch_indices(total_num_iter, batch_size)
-          dataset_batches = []
-          last_index = 0
-          for num in partial_unroll_iters:
-            dataset_batches.append(db[last_index:last_index + num])
-            last_index += num
-
-          train_start_time = time.time()
-
-          unroll_itr = 0
-          additional_log_info = ""
-
-          for unroll_itr in range(num_unrolls):
-            first_unroll = unroll_itr == 0
-            if FLAGS.reset_rnn_params:
-              reset_state = first_unroll and k == 0
-            else:
-              reset_state = first_unroll
-
-            feed = {
-                train_output.obj_weights: objective_weights[unroll_itr],
-                train_output.batches: dataset_batches[unroll_itr],
-                train_output.first_unroll: first_unroll,
-                train_output.reset_state: reset_state,
-            }
-
-            # run the train and summary ops
-            # when a "save_diagnostics" flag is turned on
-            fetches_list = [
-                train_output.metaobj,
-                train_output.problem_objectives,
-                train_output.initial_obj,
-                summary_op,
-                clipped_grads_and_vars,
-                train_op
-            ]
-            if unroll_itr + 1 < num_unrolls:
-              fetches_list += [propagate_loop_state_op]
-
-            fetched = sess.run(fetches_list, feed_dict=feed,
-                               options=run_options, run_metadata=run_metadata)
-            meta_obj = fetched[0]
-            sub_obj = fetched[1]
-            init_obj = fetched[2]
-            summ = fetched[3]
-            meta_grads_and_params = fetched[4]
-
-            # assert that the subproblem objectives are non-negative
-            # (this is so that we can rescale the objective by the initial value
-            # and not worry about rescaling by a negative value)
-            if np.any(sub_obj < 0):
-              raise ValueError(
-                  "Training problem objectives must be nonnegative.")
-            # If the objective has increased more than we want, exit this
-            # training run and start over on another meta iteration.
-            if obj_train_max_multiplier > 0 and (
-                sub_obj[-1] > (init_obj +
-                               abs(init_obj) * (obj_train_max_multiplier - 1))):
-              msg = " Broke early at {} out of {} unrolls. ".format(
-                  unroll_itr + 1, num_unrolls)
-              additional_log_info += msg
-              break
-
-            # only the chief task is allowed to write the summary
-            if is_chief:
-              sv.summary_computed(sess, summ)
-
-            metadata["subproblem_objs"].append(sub_obj)
-            # store training metadata to pass to the callback
-            metadata["meta_objs"].append(meta_obj)
-            metadata["meta_grads_and_params"].append(meta_grads_and_params)
-
-          optimization_time = time.time() - train_start_time
-
-          if FLAGS.set_profiling:
-            summary_name = "%02d_iter%04d_%02d" % (FLAGS.task, problem_itr, k)
-            summary_writer.add_run_metadata(run_metadata, summary_name)
-
-          metadata["global_step"].append(sess.run(global_step))
-          metadata["runtimes"].append(optimization_time)
-
-          # write a diagnostic message to the output
-          args = (k, meta_obj, optimization_time,
-                  sum(partial_unroll_iters[:unroll_itr+1]))
-          out.write("  [{:02}] {}, {} seconds, {} iters ".format(*args))
-          out.write("(unrolled {} steps)".format(
-              ", ".join([str(s) for s in partial_unroll_iters[:unroll_itr+1]])))
-          out.write("{}\n".format(additional_log_info))
-          out.flush()
-
-        if FLAGS.set_profiling:
-          summary_writer.close()
-
-        # force a checkpoint save before we load a new problem
-        # only the chief task has the save_path and can write the checkpoint
-        if is_chief:
-          sv.saver.save(sess, sv.save_path, global_step=global_step)
-
-    # run the callbacks on the chief
-    if is_chief and callbacks is not None:
-      for callback in callbacks:
-        if hasattr(callback, "__call__"):
-          problem_name = problem_spec.callable.__name__
-          callback(problem_name, problem_itr, logdir, metadata)
-
-
-def test_optimizer(optimizer,
-                   problem,
-                   num_iter,
-                   dataset=datasets.EMPTY_DATASET,
-                   batch_size=None,
-                   seed=None,
-                   graph=None,
-                   logdir=None,
-                   record_every=None):
-  """Tests an optimization algorithm on a given problem.
-
-  Args:
-    optimizer: Either a tf.train.Optimizer instance, or an Optimizer instance
-               inheriting from trainable_optimizer.py
-    problem: A Problem instance that defines an optimization problem to solve
-    num_iter: The number of iterations of the optimizer to run
-    dataset: The dataset to train the problem against
-    batch_size: The number of samples per batch. If None (default), the
-      batch size is set to the full batch (dataset.size)
-    seed: A random seed used for drawing the initial parameters, or a list of
-      numpy arrays used to explicitly initialize the parameters.
-    graph: The tensorflow graph to execute (if None, uses the default graph)
-    logdir: A directory containing model checkpoints. If given, then the
-            parameters of the optimizer are loaded from the latest checkpoint
-            in this folder.
-    record_every: if an integer, stores the parameters, objective, and gradient
-                  every recored_every iterations. If None, nothing is stored
-
-  Returns:
-    objective_values: A list of the objective values during optimization
-    parameters: The parameters obtained after training
-    records: A dictionary containing lists of the parameters and gradients
-             during optimization saved every record_every iterations (empty if
-             record_every is set to None)
-  """
-
-  if dataset is None:
-    dataset = datasets.EMPTY_DATASET
-    batch_size = dataset.size
-  else:
-    # default batch size is the entire dataset
-    batch_size = dataset.size if batch_size is None else batch_size
-
-  graph = tf.get_default_graph() if graph is None else graph
-  with graph.as_default():
-
-    # define the parameters of the optimization problem
-    if isinstance(seed, (list, tuple)):
-      # seed is a list of arrays
-      params = problem_generator.init_fixed_variables(seed)
-    else:
-      # seed is an int or None
-      params = problem.init_variables(seed)
-
-    data_placeholder = tf.placeholder(tf.float32)
-    labels_placeholder = tf.placeholder(tf.int32)
-
-    # get the problem objective and gradient(s)
-    obj = problem.objective(params, data_placeholder, labels_placeholder)
-    gradients = problem.gradients(obj, params)
-
-    vars_to_preinitialize = params
-
-  with tf.Session(graph=graph) as sess:
-    # initialize the parameter scope variables; necessary for apply_gradients
-    sess.run(tf.variables_initializer(vars_to_preinitialize))
-    coord = tf.train.Coordinator()
-    threads = tf.train.start_queue_runners(sess=sess, coord=coord)
-
-    # create the train operation and training variables
-    try:
-      train_op, real_params = optimizer.apply_gradients(zip(gradients, params))
-      obj = problem.objective(real_params, data_placeholder, labels_placeholder)
-    except TypeError:
-      # If all goes well, this exception should only be thrown when we are using
-      # a non-hrnn optimizer.
-      train_op = optimizer.apply_gradients(zip(gradients, params))
-
-    vars_to_restore = tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES,
-                                        scope=OPTIMIZER_SCOPE)
-    vars_to_initialize = list(
-        set(tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES)) -
-        set(vars_to_restore) - set(vars_to_preinitialize))
-    # load or initialize optimizer variables
-    if logdir is not None:
-      restorer = tf.Saver(var_list=vars_to_restore)
-      ckpt = tf.train.latest_checkpoint(logdir)
-      restorer.restore(sess, ckpt)
-    else:
-      sess.run(tf.variables_initializer(vars_to_restore))
-    # initialize all the other variables
-    sess.run(tf.variables_initializer(vars_to_initialize))
-
-    problem.init_fn(sess)
-
-    # generate the minibatch indices
-    batch_inds = dataset.batch_indices(num_iter, batch_size)
-
-    # run the train operation for n iterations and save the objectives
-    records = defaultdict(list)
-    objective_values = []
-    for itr, batch in enumerate(batch_inds):
-
-      # data to feed in
-      feed = {data_placeholder: dataset.data[batch],
-              labels_placeholder: dataset.labels[batch]}
-      full_feed = {data_placeholder: dataset.data,
-                   labels_placeholder: dataset.labels}
-
-      # record stuff
-      if record_every is not None and (itr % record_every) == 0:
-        def grad_value(g):
-          if isinstance(g, tf.IndexedSlices):
-            return g.values
-          else:
-            return g
-
-        records_fetch = {}
-        for p in params:
-          for key in optimizer.get_slot_names():
-            v = optimizer.get_slot(p, key)
-            records_fetch[p.name + "_" + key] = v
-        gav_fetch = [(grad_value(g), v) for g, v in zip(gradients, params)]
-
-        _, gav_eval, records_eval = sess.run(
-            (obj, gav_fetch, records_fetch), feed_dict=feed)
-        full_obj_eval = sess.run([obj], feed_dict=full_feed)
-
-        records["objective"].append(full_obj_eval)
-        records["grad_norm"].append([np.linalg.norm(g.ravel())
-                                     for g, _ in gav_eval])
-        records["param_norm"].append([np.linalg.norm(v.ravel())
-                                      for _, v in gav_eval])
-        records["grad"].append([g for g, _ in gav_eval])
-        records["param"].append([v for _, v in gav_eval])
-        records["iter"].append(itr)
-
-        for k, v in records_eval.iteritems():
-          records[k].append(v)
-
-      # run the optimization train operation
-      objective_values.append(sess.run([train_op, obj], feed_dict=feed)[1])
-
-    # final parameters
-    parameters = [sess.run(p) for p in params]
-    coord.request_stop()
-    coord.join(threads)
-
-  return objective_values, parameters, records
-
-
-def run_wall_clock_test(optimizer,
-                        problem,
-                        num_steps,
-                        dataset=datasets.EMPTY_DATASET,
-                        seed=None,
-                        logdir=None,
-                        batch_size=None):
-  """Runs optimization with the given parameters and return average iter time.
-
-  Args:
-    optimizer: The tf.train.Optimizer instance
-    problem: The problem to optimize (a problem_generator.Problem)
-    num_steps: The number of steps to run optimization for
-    dataset: The dataset to train the problem against
-    seed: The seed used for drawing the initial parameters, or a list of
-      numpy arrays used to explicitly initialize the parameters
-    logdir: A directory containing model checkpoints. If given, then the
-            parameters of the optimizer are loaded from the latest checkpoint
-            in this folder.
-    batch_size: The number of samples per batch.
-
-  Returns:
-    The average time in seconds for a single optimization iteration.
-  """
-  if dataset is None:
-    dataset = datasets.EMPTY_DATASET
-    batch_size = dataset.size
-  else:
-    # default batch size is the entire dataset
-    batch_size = dataset.size if batch_size is None else batch_size
-
-  # define the parameters of the optimization problem
-  if isinstance(seed, (list, tuple)):
-    # seed is a list of arrays
-    params = problem_generator.init_fixed_variables(seed)
-  else:
-    # seed is an int or None
-    params = problem.init_variables(seed)
-
-  data_placeholder = tf.placeholder(tf.float32)
-  labels_placeholder = tf.placeholder(tf.int32)
-
-  obj = problem.objective(params, data_placeholder, labels_placeholder)
-  gradients = problem.gradients(obj, params)
-  vars_to_preinitialize = params
-
-  with tf.Session(graph=tf.get_default_graph()) as sess:
-    # initialize the parameter scope variables; necessary for apply_gradients
-    sess.run(tf.variables_initializer(vars_to_preinitialize))
-    train_op = optimizer.apply_gradients(zip(gradients, params))
-    if isinstance(train_op, tuple) or isinstance(train_op, list):
-      # LOL apply_gradients returns a tuple. Regular optimizers do not.
-      train_op = train_op[0]
-    vars_to_restore = tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES,
-                                        scope=OPTIMIZER_SCOPE)
-    vars_to_initialize = list(
-        set(tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES)) -
-        set(vars_to_restore) - set(vars_to_preinitialize))
-    # load or initialize optimizer variables
-    if logdir is not None:
-      restorer = tf.Saver(var_list=vars_to_restore)
-      ckpt = tf.train.latest_checkpoint(logdir)
-      restorer.restore(sess, ckpt)
-    else:
-      sess.run(tf.variables_initializer(vars_to_restore))
-    # initialize all the other variables
-    sess.run(tf.variables_initializer(vars_to_initialize))
-
-    problem.init_fn(sess)
-
-    # generate the minibatch indices
-    batch_inds = dataset.batch_indices(num_steps, batch_size)
-
-    avg_iter_time = []
-    for batch in batch_inds:
-      # data to feed in
-      feed = {data_placeholder: dataset.data[batch],
-              labels_placeholder: dataset.labels[batch]}
-
-      # run the optimization train operation
-      start = time.time()
-      sess.run([train_op], feed_dict=feed)
-      avg_iter_time.append(time.time() - start)
-
-  return np.median(np.array(avg_iter_time))
diff --git a/research/learned_optimizer/metarun.py b/research/learned_optimizer/metarun.py
deleted file mode 100644
index 45a29623c7fd1381cef590c4e8440d8749585b72..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/metarun.py
+++ /dev/null
@@ -1,394 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Scripts for meta-optimization."""
-
-from __future__ import print_function
-
-import os
-
-import tensorflow as tf
-
-import metaopt
-from learned_optimizer.optimizer import coordinatewise_rnn
-from learned_optimizer.optimizer import global_learning_rate
-from learned_optimizer.optimizer import hierarchical_rnn
-from learned_optimizer.optimizer import learning_rate_schedule
-from learned_optimizer.optimizer import trainable_adam
-from learned_optimizer.problems import problem_sets as ps
-from learned_optimizer.problems import problem_spec
-
-tf.app.flags.DEFINE_string("train_dir", "/tmp/lol/",
-                           """Directory to store parameters and results.""")
-
-tf.app.flags.DEFINE_integer("task", 0,
-                            """Task id of the replica running the training.""")
-tf.app.flags.DEFINE_integer("worker_tasks", 1,
-                            """Number of tasks in the worker job.""")
-
-tf.app.flags.DEFINE_integer("num_problems", 1000,
-                            """Number of sub-problems to run.""")
-tf.app.flags.DEFINE_integer("num_meta_iterations", 5,
-                            """Number of meta-iterations to optimize.""")
-tf.app.flags.DEFINE_integer("num_unroll_scale", 40,
-                            """The scale parameter of the exponential
-                            distribution from which the number of partial
-                            unrolls is drawn""")
-tf.app.flags.DEFINE_integer("min_num_unrolls", 1,
-                            """The minimum number of unrolls per problem.""")
-tf.app.flags.DEFINE_integer("num_partial_unroll_itr_scale", 200,
-                            """The scale parameter of the exponential
-                               distribution from which the number of iterations
-                               per unroll is drawn.""")
-tf.app.flags.DEFINE_integer("min_num_itr_partial_unroll", 50,
-                            """The minimum number of iterations for one
-                               unroll.""")
-
-tf.app.flags.DEFINE_string("optimizer", "HierarchicalRNN",
-                           """Which meta-optimizer to train.""")
-
-# CoordinatewiseRNN-specific flags
-tf.app.flags.DEFINE_integer("cell_size", 20,
-                            """Size of the RNN hidden state in each layer.""")
-tf.app.flags.DEFINE_integer("num_cells", 2,
-                            """Number of RNN layers.""")
-tf.app.flags.DEFINE_string("cell_cls", "GRUCell",
-                           """Type of RNN cell to use.""")
-
-# Metaoptimization parameters
-tf.app.flags.DEFINE_float("meta_learning_rate", 1e-6,
-                          """The learning rate for the meta-optimizer.""")
-tf.app.flags.DEFINE_float("gradient_clip_level", 1e4,
-                          """The level to clip gradients to.""")
-
-# Training set selection
-tf.app.flags.DEFINE_boolean("include_quadratic_problems", False,
-                            """Include non-noisy quadratic problems.""")
-tf.app.flags.DEFINE_boolean("include_noisy_quadratic_problems", True,
-                            """Include noisy quadratic problems.""")
-tf.app.flags.DEFINE_boolean("include_large_quadratic_problems", True,
-                            """Include very large quadratic problems.""")
-tf.app.flags.DEFINE_boolean("include_bowl_problems", True,
-                            """Include 2D bowl problems.""")
-tf.app.flags.DEFINE_boolean("include_softmax_2_class_problems", True,
-                            """Include 2-class logistic regression problems.""")
-tf.app.flags.DEFINE_boolean("include_noisy_softmax_2_class_problems", True,
-                            """Include noisy 2-class logistic regression
-                               problems.""")
-tf.app.flags.DEFINE_boolean("include_optimization_test_problems", True,
-                            """Include non-noisy versions of classic
-                               optimization test problems, e.g. Rosenbrock.""")
-tf.app.flags.DEFINE_boolean("include_noisy_optimization_test_problems", True,
-                            """Include gradient-noise versions of classic
-                               optimization test problems, e.g. Rosenbrock""")
-tf.app.flags.DEFINE_boolean("include_fully_connected_random_2_class_problems",
-                            True, """Include MLP problems for 2 classes.""")
-tf.app.flags.DEFINE_boolean("include_matmul_problems", True,
-                            """Include matrix multiplication problems.""")
-tf.app.flags.DEFINE_boolean("include_log_objective_problems", True,
-                            """Include problems where the objective is the log
-                               objective of another problem, e.g. Bowl.""")
-tf.app.flags.DEFINE_boolean("include_rescale_problems", True,
-                            """Include problems where the parameters are scaled
-                               version of the original parameters.""")
-tf.app.flags.DEFINE_boolean("include_norm_problems", True,
-                            """Include problems where the objective is the
-                               N-norm of another problem, e.g. Quadratic.""")
-tf.app.flags.DEFINE_boolean("include_sum_problems", True,
-                            """Include problems where the objective is the sum
-                               of the objectives of the subproblems that make
-                               up the problem parameters. Per-problem tensors
-                               are still independent of each other.""")
-tf.app.flags.DEFINE_boolean("include_sparse_gradient_problems", True,
-                            """Include problems where the gradient is set to 0
-                               with some high probability.""")
-tf.app.flags.DEFINE_boolean("include_sparse_softmax_problems", False,
-                            """Include sparse softmax problems.""")
-tf.app.flags.DEFINE_boolean("include_one_hot_sparse_softmax_problems", False,
-                            """Include one-hot sparse softmax problems.""")
-tf.app.flags.DEFINE_boolean("include_noisy_bowl_problems", True,
-                            """Include noisy bowl problems.""")
-tf.app.flags.DEFINE_boolean("include_noisy_norm_problems", True,
-                            """Include noisy norm problems.""")
-tf.app.flags.DEFINE_boolean("include_noisy_sum_problems", True,
-                            """Include noisy sum problems.""")
-tf.app.flags.DEFINE_boolean("include_sum_of_quadratics_problems", False,
-                            """Include sum of quadratics problems.""")
-tf.app.flags.DEFINE_boolean("include_projection_quadratic_problems", False,
-                            """Include projection quadratic problems.""")
-tf.app.flags.DEFINE_boolean("include_outward_snake_problems", False,
-                            """Include outward snake problems.""")
-tf.app.flags.DEFINE_boolean("include_dependency_chain_problems", False,
-                            """Include dependency chain problems.""")
-tf.app.flags.DEFINE_boolean("include_min_max_well_problems", False,
-                            """Include min-max well problems.""")
-
-# Optimizer parameters: initialization and scale values
-tf.app.flags.DEFINE_float("min_lr", 1e-6,
-                          """The minimum initial learning rate.""")
-tf.app.flags.DEFINE_float("max_lr", 1e-2,
-                          """The maximum initial learning rate.""")
-
-# Optimizer parameters: small features.
-tf.app.flags.DEFINE_boolean("zero_init_lr_weights", True,
-                            """Whether to initialize the learning rate weights
-                               to 0 rather than the scaled random initialization
-                               used for other RNN variables.""")
-tf.app.flags.DEFINE_boolean("use_relative_lr", True,
-                            """Whether to use the relative learning rate as an
-                               input during training. Can only be used if
-                               learnable_decay is also True.""")
-tf.app.flags.DEFINE_boolean("use_extreme_indicator", False,
-                            """Whether to use the extreme indicator for learning
-                               rates as an input during training. Can only be
-                               used if learnable_decay is also True.""")
-tf.app.flags.DEFINE_boolean("use_log_means_squared", True,
-                            """Whether to track the log of the mean squared
-                               grads instead of the means squared grads.""")
-tf.app.flags.DEFINE_boolean("use_problem_lr_mean", True,
-                            """Whether to use the mean over all learning rates
-                               in the problem when calculating the relative
-                               learning rate.""")
-
-# Optimizer parameters: major features
-tf.app.flags.DEFINE_boolean("learnable_decay", True,
-                            """Whether to learn weights that dynamically
-                              modulate the input scale via RMS decay.""")
-tf.app.flags.DEFINE_boolean("dynamic_output_scale", True,
-                            """Whether to learn weights that dynamically
-                               modulate the output scale.""")
-tf.app.flags.DEFINE_boolean("use_log_objective", True,
-                            """Whether to use the log of the scaled objective
-                               rather than just the scaled obj for training.""")
-tf.app.flags.DEFINE_boolean("use_attention", False,
-                            """Whether to learn where to attend.""")
-tf.app.flags.DEFINE_boolean("use_second_derivatives", True,
-                            """Whether to use second derivatives.""")
-tf.app.flags.DEFINE_integer("num_gradient_scales", 4,
-                            """How many different timescales to keep for
-                               gradient history. If > 1, also learns a scale
-                               factor for gradient history.""")
-tf.app.flags.DEFINE_float("max_log_lr", 33,
-                          """The maximum log learning rate allowed.""")
-tf.app.flags.DEFINE_float("objective_training_max_multiplier", -1,
-                          """How much the objective can grow before training on
-                             this problem / param pair is terminated. Sets a max
-                             on the objective value when multiplied by the
-                             initial objective. If <= 0, not used.""")
-tf.app.flags.DEFINE_boolean("use_gradient_shortcut", True,
-                            """Whether to add a learned affine projection of the
-                               gradient to the update delta in addition to the
-                               gradient function computed by the RNN.""")
-tf.app.flags.DEFINE_boolean("use_lr_shortcut", False,
-                            """Whether to add the difference between the current
-                               learning rate and the desired learning rate to
-                               the RNN input.""")
-tf.app.flags.DEFINE_boolean("use_grad_products", True,
-                            """Whether to use gradient products in the input to
-                               the RNN. Only applicable when num_gradient_scales
-                               > 1.""")
-tf.app.flags.DEFINE_boolean("use_multiple_scale_decays", False,
-                            """Whether to use many-timescale scale decays.""")
-tf.app.flags.DEFINE_boolean("use_numerator_epsilon", False,
-                            """Whether to use epsilon in the numerator of the
-                               log objective.""")
-tf.app.flags.DEFINE_boolean("learnable_inp_decay", True,
-                            """Whether to learn input decay weight and bias.""")
-tf.app.flags.DEFINE_boolean("learnable_rnn_init", True,
-                            """Whether to learn RNN state initialization.""")
-
-FLAGS = tf.app.flags.FLAGS
-
-# The Size of the RNN hidden state in each layer:
-# [PerParam, PerTensor, Global]. The length of this list must be 1, 2, or 3.
-# If less than 3, the Global and/or PerTensor RNNs will not be created.
-
-HRNN_CELL_SIZES = [10, 20, 20]
-
-
-
-def register_optimizers():
-  opts = {}
-  opts["CoordinatewiseRNN"] = coordinatewise_rnn.CoordinatewiseRNN
-  opts["GlobalLearningRate"] = global_learning_rate.GlobalLearningRate
-  opts["HierarchicalRNN"] = hierarchical_rnn.HierarchicalRNN
-  opts["LearningRateSchedule"] = learning_rate_schedule.LearningRateSchedule
-  opts["TrainableAdam"] = trainable_adam.TrainableAdam
-  return opts
-
-
-def main(unused_argv):
-  """Runs the main script."""
-
-  opts = register_optimizers()
-
-  # Choose a set of problems to optimize. By default this includes quadratics,
-  # 2-dimensional bowls, 2-class softmax problems, and non-noisy optimization
-  # test problems (e.g. Rosenbrock, Beale)
-  problems_and_data = []
-
-  if FLAGS.include_sparse_softmax_problems:
-    problems_and_data.extend(ps.sparse_softmax_2_class_sparse_problems())
-
-  if FLAGS.include_one_hot_sparse_softmax_problems:
-    problems_and_data.extend(
-        ps.one_hot_sparse_softmax_2_class_sparse_problems())
-
-  if FLAGS.include_quadratic_problems:
-    problems_and_data.extend(ps.quadratic_problems())
-
-  if FLAGS.include_noisy_quadratic_problems:
-    problems_and_data.extend(ps.quadratic_problems_noisy())
-
-  if FLAGS.include_large_quadratic_problems:
-    problems_and_data.extend(ps.quadratic_problems_large())
-
-  if FLAGS.include_bowl_problems:
-    problems_and_data.extend(ps.bowl_problems())
-
-  if FLAGS.include_noisy_bowl_problems:
-    problems_and_data.extend(ps.bowl_problems_noisy())
-
-  if FLAGS.include_softmax_2_class_problems:
-    problems_and_data.extend(ps.softmax_2_class_problems())
-
-  if FLAGS.include_noisy_softmax_2_class_problems:
-    problems_and_data.extend(ps.softmax_2_class_problems_noisy())
-
-  if FLAGS.include_optimization_test_problems:
-    problems_and_data.extend(ps.optimization_test_problems())
-
-  if FLAGS.include_noisy_optimization_test_problems:
-    problems_and_data.extend(ps.optimization_test_problems_noisy())
-
-  if FLAGS.include_fully_connected_random_2_class_problems:
-    problems_and_data.extend(ps.fully_connected_random_2_class_problems())
-
-  if FLAGS.include_matmul_problems:
-    problems_and_data.extend(ps.matmul_problems())
-
-  if FLAGS.include_log_objective_problems:
-    problems_and_data.extend(ps.log_objective_problems())
-
-  if FLAGS.include_rescale_problems:
-    problems_and_data.extend(ps.rescale_problems())
-
-  if FLAGS.include_norm_problems:
-    problems_and_data.extend(ps.norm_problems())
-
-  if FLAGS.include_noisy_norm_problems:
-    problems_and_data.extend(ps.norm_problems_noisy())
-
-  if FLAGS.include_sum_problems:
-    problems_and_data.extend(ps.sum_problems())
-
-  if FLAGS.include_noisy_sum_problems:
-    problems_and_data.extend(ps.sum_problems_noisy())
-
-  if FLAGS.include_sparse_gradient_problems:
-    problems_and_data.extend(ps.sparse_gradient_problems())
-    if FLAGS.include_fully_connected_random_2_class_problems:
-      problems_and_data.extend(ps.sparse_gradient_problems_mlp())
-
-  if FLAGS.include_min_max_well_problems:
-    problems_and_data.extend(ps.min_max_well_problems())
-
-  if FLAGS.include_sum_of_quadratics_problems:
-    problems_and_data.extend(ps.sum_of_quadratics_problems())
-
-  if FLAGS.include_projection_quadratic_problems:
-    problems_and_data.extend(ps.projection_quadratic_problems())
-
-  if FLAGS.include_outward_snake_problems:
-    problems_and_data.extend(ps.outward_snake_problems())
-
-  if FLAGS.include_dependency_chain_problems:
-    problems_and_data.extend(ps.dependency_chain_problems())
-
-  # log directory
-  logdir = os.path.join(FLAGS.train_dir,
-                        "{}_{}_{}_{}".format(FLAGS.optimizer,
-                                             FLAGS.cell_cls,
-                                             FLAGS.cell_size,
-                                             FLAGS.num_cells))
-
-  # get the optimizer class and arguments
-  optimizer_cls = opts[FLAGS.optimizer]
-
-  assert len(HRNN_CELL_SIZES) in [1, 2, 3]
-  optimizer_args = (HRNN_CELL_SIZES,)
-
-  optimizer_kwargs = {
-      "init_lr_range": (FLAGS.min_lr, FLAGS.max_lr),
-      "learnable_decay": FLAGS.learnable_decay,
-      "dynamic_output_scale": FLAGS.dynamic_output_scale,
-      "cell_cls": getattr(tf.contrib.rnn, FLAGS.cell_cls),
-      "use_attention": FLAGS.use_attention,
-      "use_log_objective": FLAGS.use_log_objective,
-      "num_gradient_scales": FLAGS.num_gradient_scales,
-      "zero_init_lr_weights": FLAGS.zero_init_lr_weights,
-      "use_log_means_squared": FLAGS.use_log_means_squared,
-      "use_relative_lr": FLAGS.use_relative_lr,
-      "use_extreme_indicator": FLAGS.use_extreme_indicator,
-      "max_log_lr": FLAGS.max_log_lr,
-      "obj_train_max_multiplier": FLAGS.objective_training_max_multiplier,
-      "use_problem_lr_mean": FLAGS.use_problem_lr_mean,
-      "use_gradient_shortcut": FLAGS.use_gradient_shortcut,
-      "use_second_derivatives": FLAGS.use_second_derivatives,
-      "use_lr_shortcut": FLAGS.use_lr_shortcut,
-      "use_grad_products": FLAGS.use_grad_products,
-      "use_multiple_scale_decays": FLAGS.use_multiple_scale_decays,
-      "use_numerator_epsilon": FLAGS.use_numerator_epsilon,
-      "learnable_inp_decay": FLAGS.learnable_inp_decay,
-      "learnable_rnn_init": FLAGS.learnable_rnn_init,
-  }
-  optimizer_spec = problem_spec.Spec(
-      optimizer_cls, optimizer_args, optimizer_kwargs)
-
-  # make log directory
-  tf.gfile.MakeDirs(logdir)
-
-  is_chief = FLAGS.task == 0
-  # if this is a distributed run, make the chief run through problems in order
-  select_random_problems = FLAGS.worker_tasks == 1 or not is_chief
-
-  def num_unrolls():
-    return metaopt.sample_numiter(FLAGS.num_unroll_scale, FLAGS.min_num_unrolls)
-
-  def num_partial_unroll_itrs():
-    return metaopt.sample_numiter(FLAGS.num_partial_unroll_itr_scale,
-                                  FLAGS.min_num_itr_partial_unroll)
-
-  # run it
-  metaopt.train_optimizer(
-      logdir,
-      optimizer_spec,
-      problems_and_data,
-      FLAGS.num_problems,
-      FLAGS.num_meta_iterations,
-      num_unrolls,
-      num_partial_unroll_itrs,
-      learning_rate=FLAGS.meta_learning_rate,
-      gradient_clip=FLAGS.gradient_clip_level,
-      is_chief=is_chief,
-      select_random_problems=select_random_problems,
-      obj_train_max_multiplier=FLAGS.objective_training_max_multiplier,
-      callbacks=[])
-
-  return 0
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/learned_optimizer/optimizer/BUILD b/research/learned_optimizer/optimizer/BUILD
deleted file mode 100644
index 8953e7592ace416b786be2a6fa59f4c537c82644..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/optimizer/BUILD
+++ /dev/null
@@ -1,69 +0,0 @@
-package(default_visibility = ["//visibility:public"])
-
-# Libraries
-# =========
-py_library(
-    name = "coordinatewise_rnn",
-    srcs = ["coordinatewise_rnn.py"],
-    deps = [
-        ":trainable_optimizer",
-        ":utils",
-    ],
-)
-
-py_library(
-    name = "global_learning_rate",
-    srcs = ["global_learning_rate.py"],
-    deps = [
-        ":trainable_optimizer",
-    ],
-)
-
-py_library(
-    name = "hierarchical_rnn",
-    srcs = ["hierarchical_rnn.py"],
-    deps = [
-        ":rnn_cells",
-        ":trainable_optimizer",
-        ":utils",
-    ],
-)
-
-py_library(
-    name = "learning_rate_schedule",
-    srcs = ["learning_rate_schedule.py"],
-    deps = [
-        ":trainable_optimizer",
-    ],
-)
-
-py_library(
-    name = "rnn_cells",
-    srcs = ["rnn_cells.py"],
-    deps = [
-        ":utils",
-    ],
-)
-
-py_library(
-    name = "trainable_adam",
-    srcs = ["trainable_adam.py"],
-    deps = [
-        ":trainable_optimizer",
-        ":utils",
-    ],
-)
-
-py_library(
-    name = "trainable_optimizer",
-    srcs = ["trainable_optimizer.py"],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "utils",
-    srcs = ["utils.py"],
-    deps = [
-    ],
-)
diff --git a/research/learned_optimizer/optimizer/coordinatewise_rnn.py b/research/learned_optimizer/optimizer/coordinatewise_rnn.py
deleted file mode 100644
index 3d699504b7a3d86643bea6b295d20b2434131a99..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/optimizer/coordinatewise_rnn.py
+++ /dev/null
@@ -1,316 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Collection of trainable optimizers for meta-optimization."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-
-import numpy as np
-import tensorflow as tf
-
-from learned_optimizer.optimizer import utils
-from learned_optimizer.optimizer import trainable_optimizer as opt
-
-
-# Default was 1e-3
-tf.app.flags.DEFINE_float("crnn_rnn_readout_scale", 0.5,
-                          """The initialization scale for the RNN readouts.""")
-tf.app.flags.DEFINE_float("crnn_default_decay_var_init", 2.2,
-                          """The default initializer value for any decay/
-                             momentum style variables and constants.
-                             sigmoid(2.2) ~ 0.9, sigmoid(-2.2) ~ 0.01.""")
-
-FLAGS = tf.flags.FLAGS
-
-
-class CoordinatewiseRNN(opt.TrainableOptimizer):
-  """RNN that operates on each coordinate of the problem independently."""
-
-  def __init__(self,
-               cell_sizes,
-               cell_cls,
-               init_lr_range=(1., 1.),
-               dynamic_output_scale=True,
-               learnable_decay=True,
-               zero_init_lr_weights=False,
-               **kwargs):
-    """Initializes the RNN per-parameter optimizer.
-
-    Args:
-      cell_sizes: List of hidden state sizes for each RNN cell in the network
-      cell_cls: tf.contrib.rnn class for specifying the RNN cell type
-      init_lr_range: the range in which to initialize the learning rates.
-      dynamic_output_scale: whether to learn weights that dynamically modulate
-          the output scale (default: True)
-      learnable_decay: whether to learn weights that dynamically modulate the
-          input scale via RMS style decay (default: True)
-      zero_init_lr_weights: whether to initialize the lr weights to zero
-      **kwargs: args passed to TrainableOptimizer's constructor
-
-    Raises:
-      ValueError: If the init lr range is not of length 2.
-      ValueError: If the init lr range is not a valid range (min > max).
-    """
-    if len(init_lr_range) != 2:
-      raise ValueError(
-          "Initial LR range must be len 2, was {}".format(len(init_lr_range)))
-    if init_lr_range[0] > init_lr_range[1]:
-      raise ValueError("Initial LR range min is greater than max.")
-    self.init_lr_range = init_lr_range
-
-    self.zero_init_lr_weights = zero_init_lr_weights
-    self.reuse_vars = False
-
-    # create the RNN cell
-    with tf.variable_scope(opt.OPTIMIZER_SCOPE):
-      self.component_cells = [cell_cls(sz) for sz in cell_sizes]
-      self.cell = tf.contrib.rnn.MultiRNNCell(self.component_cells)
-
-      # random normal initialization scaled by the output size
-      scale_factor = FLAGS.crnn_rnn_readout_scale / math.sqrt(cell_sizes[-1])
-      scaled_init = tf.random_normal_initializer(0., scale_factor)
-
-      # weights for projecting the hidden state to a parameter update
-      self.update_weights = tf.get_variable("update_weights",
-                                            shape=(cell_sizes[-1], 1),
-                                            initializer=scaled_init)
-
-      self._initialize_decay(learnable_decay, (cell_sizes[-1], 1), scaled_init)
-
-      self._initialize_lr(dynamic_output_scale, (cell_sizes[-1], 1),
-                          scaled_init)
-
-      state_size = sum([sum(state_size) for state_size in self.cell.state_size])
-      self._init_vector = tf.get_variable(
-          "init_vector", shape=[1, state_size],
-          initializer=tf.random_uniform_initializer(-1., 1.))
-
-    state_keys = ["rms", "rnn", "learning_rate", "decay"]
-    super(CoordinatewiseRNN, self).__init__("cRNN", state_keys, **kwargs)
-
-  def _initialize_decay(
-      self, learnable_decay, weights_tensor_shape, scaled_init):
-    """Initializes the decay weights and bias variables or tensors.
-
-    Args:
-      learnable_decay: Whether to use learnable decay.
-      weights_tensor_shape: The shape the weight tensor should take.
-      scaled_init: The scaled initialization for the weights tensor.
-    """
-    if learnable_decay:
-
-      # weights for projecting the hidden state to the RMS decay term
-      self.decay_weights = tf.get_variable("decay_weights",
-                                           shape=weights_tensor_shape,
-                                           initializer=scaled_init)
-      self.decay_bias = tf.get_variable(
-          "decay_bias", shape=(1,),
-          initializer=tf.constant_initializer(
-              FLAGS.crnn_default_decay_var_init))
-    else:
-      self.decay_weights = tf.zeros_like(self.update_weights)
-      self.decay_bias = tf.constant(FLAGS.crnn_default_decay_var_init)
-
-  def _initialize_lr(
-      self, dynamic_output_scale, weights_tensor_shape, scaled_init):
-    """Initializes the learning rate weights and bias variables or tensors.
-
-    Args:
-      dynamic_output_scale: Whether to use a dynamic output scale.
-      weights_tensor_shape: The shape the weight tensor should take.
-      scaled_init: The scaled initialization for the weights tensor.
-    """
-    if dynamic_output_scale:
-      zero_init = tf.constant_initializer(0.)
-      wt_init = zero_init if self.zero_init_lr_weights else scaled_init
-      self.lr_weights = tf.get_variable("learning_rate_weights",
-                                        shape=weights_tensor_shape,
-                                        initializer=wt_init)
-      self.lr_bias = tf.get_variable("learning_rate_bias", shape=(1,),
-                                     initializer=zero_init)
-    else:
-      self.lr_weights = tf.zeros_like(self.update_weights)
-      self.lr_bias = tf.zeros([1, 1])
-
-  def _initialize_state(self, var):
-    """Return a dictionary mapping names of state variables to their values."""
-    vectorized_shape = [var.get_shape().num_elements(), 1]
-
-    min_lr = self.init_lr_range[0]
-    max_lr = self.init_lr_range[1]
-    if min_lr == max_lr:
-      init_lr = tf.constant(min_lr, shape=vectorized_shape)
-    else:
-      actual_vals = tf.random_uniform(vectorized_shape,
-                                      np.log(min_lr),
-                                      np.log(max_lr))
-      init_lr = tf.exp(actual_vals)
-
-    ones = tf.ones(vectorized_shape)
-    rnn_init = ones * self._init_vector
-
-    return {
-        "rms": tf.ones(vectorized_shape),
-        "learning_rate": init_lr,
-        "rnn": rnn_init,
-        "decay": tf.ones(vectorized_shape),
-    }
-
-  def _compute_update(self, param, grad, state):
-    """Update parameters given the gradient and state.
-
-    Args:
-      param: tensor of parameters
-      grad: tensor of gradients with the same shape as param
-      state: a dictionary containing any state for the optimizer
-
-    Returns:
-      updated_param: updated parameters
-      updated_state: updated state variables in a dictionary
-    """
-
-    with tf.variable_scope(opt.OPTIMIZER_SCOPE) as scope:
-
-      if self.reuse_vars:
-        scope.reuse_variables()
-      else:
-        self.reuse_vars = True
-
-      param_shape = tf.shape(param)
-
-      (grad_values, decay_state, rms_state, rnn_state, learning_rate_state,
-       grad_indices) = self._extract_gradients_and_internal_state(
-           grad, state, param_shape)
-
-      # Vectorize and scale the gradients.
-      grad_scaled, rms = utils.rms_scaling(grad_values, decay_state, rms_state)
-
-      # Apply the RNN update.
-      rnn_state_tuples = self._unpack_rnn_state_into_tuples(rnn_state)
-      rnn_output, rnn_state_tuples = self.cell(grad_scaled, rnn_state_tuples)
-      rnn_state = self._pack_tuples_into_rnn_state(rnn_state_tuples)
-
-      # Compute the update direction (a linear projection of the RNN output).
-      delta = utils.project(rnn_output, self.update_weights)
-
-      # The updated decay is an affine projection of the hidden state
-      decay = utils.project(rnn_output, self.decay_weights,
-                            bias=self.decay_bias, activation=tf.nn.sigmoid)
-
-      # Compute the change in learning rate (an affine projection of the RNN
-      # state, passed through a 2x sigmoid, so the change is bounded).
-      learning_rate_change = 2. * utils.project(rnn_output, self.lr_weights,
-                                                bias=self.lr_bias,
-                                                activation=tf.nn.sigmoid)
-
-      # Update the learning rate.
-      new_learning_rate = learning_rate_change * learning_rate_state
-
-      # Apply the update to the parameters.
-      update = tf.reshape(new_learning_rate * delta, tf.shape(grad_values))
-
-      if isinstance(grad, tf.IndexedSlices):
-        update = utils.stack_tensor(update, grad_indices, param,
-                                    param_shape[:1])
-        rms = utils.update_slices(rms, grad_indices, state["rms"], param_shape)
-        new_learning_rate = utils.update_slices(new_learning_rate, grad_indices,
-                                                state["learning_rate"],
-                                                param_shape)
-        rnn_state = utils.update_slices(rnn_state, grad_indices, state["rnn"],
-                                        param_shape)
-        decay = utils.update_slices(decay, grad_indices, state["decay"],
-                                    param_shape)
-
-      new_param = param - update
-
-      # Collect the update and new state.
-      new_state = {
-          "rms": rms,
-          "learning_rate": new_learning_rate,
-          "rnn": rnn_state,
-          "decay": decay,
-      }
-
-    return new_param, new_state
-
-  def _extract_gradients_and_internal_state(self, grad, state, param_shape):
-    """Extracts the gradients and relevant internal state.
-
-    If the gradient is sparse, extracts the appropriate slices from the state.
-
-    Args:
-      grad: The current gradient.
-      state: The current state.
-      param_shape: The shape of the parameter (used if gradient is sparse).
-
-    Returns:
-      grad_values: The gradient value tensor.
-      decay_state: The current decay state.
-      rms_state: The current rms state.
-      rnn_state: The current state of the internal rnns.
-      learning_rate_state: The current learning rate state.
-      grad_indices: The indices for the gradient tensor, if sparse.
-          None otherwise.
-    """
-    if isinstance(grad, tf.IndexedSlices):
-      grad_indices, grad_values = utils.accumulate_sparse_gradients(grad)
-      decay_state = utils.slice_tensor(state["decay"], grad_indices,
-                                       param_shape)
-      rms_state = utils.slice_tensor(state["rms"], grad_indices, param_shape)
-      rnn_state = utils.slice_tensor(state["rnn"], grad_indices, param_shape)
-      learning_rate_state = utils.slice_tensor(state["learning_rate"],
-                                               grad_indices, param_shape)
-      decay_state.set_shape([None, 1])
-      rms_state.set_shape([None, 1])
-    else:
-      grad_values = grad
-      grad_indices = None
-
-      decay_state = state["decay"]
-      rms_state = state["rms"]
-      rnn_state = state["rnn"]
-      learning_rate_state = state["learning_rate"]
-    return (grad_values, decay_state, rms_state, rnn_state, learning_rate_state,
-            grad_indices)
-
-  def _unpack_rnn_state_into_tuples(self, rnn_state):
-    """Creates state tuples from the rnn state vector."""
-    rnn_state_tuples = []
-    cur_state_pos = 0
-    for cell in self.component_cells:
-      total_state_size = sum(cell.state_size)
-      cur_state = tf.slice(rnn_state, [0, cur_state_pos],
-                           [-1, total_state_size])
-      cur_state_tuple = tf.split(value=cur_state, num_or_size_splits=2,
-                                 axis=1)
-      rnn_state_tuples.append(cur_state_tuple)
-      cur_state_pos += total_state_size
-    return rnn_state_tuples
-
-  def _pack_tuples_into_rnn_state(self, rnn_state_tuples):
-    """Creates a single state vector concatenated along column axis."""
-    rnn_state = None
-    for new_state_tuple in rnn_state_tuples:
-      new_c, new_h = new_state_tuple
-      if rnn_state is None:
-        rnn_state = tf.concat([new_c, new_h], axis=1)
-      else:
-        rnn_state = tf.concat([rnn_state, tf.concat([new_c, new_h], 1)], axis=1)
-    return rnn_state
-
diff --git a/research/learned_optimizer/optimizer/global_learning_rate.py b/research/learned_optimizer/optimizer/global_learning_rate.py
deleted file mode 100644
index bcf102fff054e9fe9e92d4379538f6394314fe1c..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/optimizer/global_learning_rate.py
+++ /dev/null
@@ -1,40 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A trainable optimizer that learns a single global learning rate."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from learned_optimizer.optimizer import trainable_optimizer
-
-
-class GlobalLearningRate(trainable_optimizer.TrainableOptimizer):
-  """Optimizes for a single global learning rate."""
-
-  def __init__(self, initial_rate=1e-3, **kwargs):
-    """Initializes the global learning rate."""
-    with tf.variable_scope(trainable_optimizer.OPTIMIZER_SCOPE):
-      initializer = tf.constant_initializer(initial_rate)
-      self.learning_rate = tf.get_variable("global_learning_rate", shape=(),
-                                           initializer=initializer)
-    super(GlobalLearningRate, self).__init__("GLR", [], **kwargs)
-
-  def _compute_update(self, param, grad, state):
-    return param - tf.scalar_mul(self.learning_rate, grad), state
-
diff --git a/research/learned_optimizer/optimizer/hierarchical_rnn.py b/research/learned_optimizer/optimizer/hierarchical_rnn.py
deleted file mode 100644
index 953b72b5d04724a11a0e95385bbe0c6a0d91289d..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/optimizer/hierarchical_rnn.py
+++ /dev/null
@@ -1,792 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Collection of trainable optimizers for meta-optimization."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-
-import numpy as np
-import tensorflow as tf
-
-from tensorflow.python.ops import state_ops
-from learned_optimizer.optimizer import rnn_cells
-from learned_optimizer.optimizer import trainable_optimizer as opt
-from learned_optimizer.optimizer import utils
-
-# Default was 0.1
-tf.app.flags.DEFINE_float("biasgrucell_scale", 0.5,
-                          """The scale for the internal BiasGRUCell vars.""")
-# Default was 0
-tf.app.flags.DEFINE_float("biasgrucell_gate_bias_init", 2.2,
-                          """The bias for the internal BiasGRUCell reset and
-                             update gate variables.""")
-# Default was 1e-3
-tf.app.flags.DEFINE_float("hrnn_rnn_readout_scale", 0.5,
-                          """The initialization scale for the RNN readouts.""")
-tf.app.flags.DEFINE_float("hrnn_default_decay_var_init", 2.2,
-                          """The default initializer value for any decay/
-                             momentum style variables and constants.
-                             sigmoid(2.2) ~ 0.9, sigmoid(-2.2) ~ 0.01.""")
-# Default was 2.2
-tf.app.flags.DEFINE_float("scale_decay_bias_init", 3.2,
-                          """The initialization for the scale decay bias. This
-                             is the initial bias for the timescale for the
-                             exponential avg of the mean square gradients.""")
-tf.app.flags.DEFINE_float("learning_rate_momentum_logit_init", 3.2,
-                          """Initialization for the learning rate momentum.""")
-# Default was 0.1
-tf.app.flags.DEFINE_float("hrnn_affine_scale", 0.5,
-                          """The initialization scale for the weight matrix of
-                             the bias variables in layer0 and 1 of the hrnn.""")
-
-FLAGS = tf.flags.FLAGS
-
-
-class HierarchicalRNN(opt.TrainableOptimizer):
-  """3 level hierarchical RNN.
-
-  Optionally uses second order gradient information and has decoupled evaluation
-  and update locations.
-  """
-
-  def __init__(self, level_sizes, init_lr_range=(1e-6, 1e-2),
-               learnable_decay=True, dynamic_output_scale=True,
-               use_attention=False, use_log_objective=True,
-               num_gradient_scales=4, zero_init_lr_weights=True,
-               use_log_means_squared=True, use_relative_lr=True,
-               use_extreme_indicator=False, max_log_lr=33,
-               obj_train_max_multiplier=-1, use_problem_lr_mean=False,
-               use_gradient_shortcut=False, use_lr_shortcut=False,
-               use_grad_products=False, use_multiple_scale_decays=False,
-               learnable_inp_decay=True, learnable_rnn_init=True,
-               random_seed=None, **kwargs):
-    """Initializes the RNN per-parameter optimizer.
-
-    The hierarchy consists of up to three levels:
-    Level 0: per parameter RNN
-    Level 1: per tensor RNN
-    Level 2: global RNN
-
-    Args:
-      level_sizes: list or tuple with 1, 2, or 3 integers, the number of units
-          in each RNN in the hierarchy (level0, level1, level2).
-          length 1: only coordinatewise rnn's will be used
-          length 2: coordinatewise and tensor-level rnn's will be used
-          length 3: a single global-level rnn will be used in addition to
-             coordinatewise and tensor-level
-      init_lr_range: the range in which to initialize the learning rates
-      learnable_decay: whether to learn weights that dynamically modulate the
-          input scale via RMS style decay
-      dynamic_output_scale: whether to learn weights that dynamically modulate
-          the output scale
-      use_attention: whether to use attention to train the optimizer
-      use_log_objective: whether to train on the log of the objective
-      num_gradient_scales: the number of scales to use for gradient history
-      zero_init_lr_weights: whether to initialize the lr weights to zero
-      use_log_means_squared: whether to track the log of the means_squared,
-          used as a measure of signal vs. noise in gradient.
-      use_relative_lr: whether to use the relative learning rate as an
-          input during training (requires learnable_decay=True)
-      use_extreme_indicator: whether to use the extreme indicator for learning
-          rates as an input during training (requires learnable_decay=True)
-      max_log_lr: the maximum log learning rate allowed during train or test
-      obj_train_max_multiplier: max objective increase during a training run
-      use_problem_lr_mean: whether to use the mean over all learning rates in
-          the problem when calculating the relative learning rate as opposed to
-          the per-tensor mean
-      use_gradient_shortcut: Whether to add a learned affine projection of the
-          gradient to the update delta in addition to the gradient function
-          computed by the RNN
-      use_lr_shortcut: Whether to add as input the difference between the log lr
-          and the desired log lr (1e-3)
-      use_grad_products: Whether to use gradient products in the rnn input.
-          Only applicable if num_gradient_scales > 1
-      use_multiple_scale_decays: Whether to use multiple scales for the scale
-          decay, as with input decay
-      learnable_inp_decay: Whether to learn the input decay weights and bias.
-      learnable_rnn_init: Whether to learn the RNN state initialization.
-      random_seed: Random seed for random variable initializers. (Default: None)
-      **kwargs: args passed to TrainableOptimizer's constructor
-
-    Raises:
-      ValueError: If level_sizes is not a length 1, 2, or 3 list.
-      ValueError: If there are any non-integer sizes in level_sizes.
-      ValueError: If the init lr range is not of length 2.
-      ValueError: If the init lr range is not a valid range (min > max).
-    """
-    if len(level_sizes) not in [1, 2, 3]:
-      raise ValueError("HierarchicalRNN only supports 1, 2, or 3 levels in the "
-                       "hierarchy, but {} were requested.".format(
-                           len(level_sizes)))
-    if any(not isinstance(level, int) for level in level_sizes):
-      raise ValueError("Level sizes must be integer values, were {}".format(
-          level_sizes))
-    if len(init_lr_range) != 2:
-      raise ValueError(
-          "Initial LR range must be len 2, was {}".format(len(init_lr_range)))
-    if init_lr_range[0] > init_lr_range[1]:
-      raise ValueError("Initial LR range min is greater than max.")
-
-    self.learnable_decay = learnable_decay
-    self.dynamic_output_scale = dynamic_output_scale
-    self.use_attention = use_attention
-    self.use_log_objective = use_log_objective
-    self.num_gradient_scales = num_gradient_scales
-    self.zero_init_lr_weights = zero_init_lr_weights
-    self.use_log_means_squared = use_log_means_squared
-    self.use_relative_lr = use_relative_lr
-    self.use_extreme_indicator = use_extreme_indicator
-    self.max_log_lr = max_log_lr
-    self.use_problem_lr_mean = use_problem_lr_mean
-    self.use_gradient_shortcut = use_gradient_shortcut
-    self.use_lr_shortcut = use_lr_shortcut
-    self.use_grad_products = use_grad_products
-    self.use_multiple_scale_decays = use_multiple_scale_decays
-    self.learnable_inp_decay = learnable_inp_decay
-    self.learnable_rnn_init = learnable_rnn_init
-
-    self.random_seed = random_seed
-
-    self.num_layers = len(level_sizes)
-    self.init_lr_range = init_lr_range
-
-    self.reuse_vars = None
-    self.reuse_global_state = None
-    self.cells = []
-    self.init_vectors = []
-
-    with tf.variable_scope(opt.OPTIMIZER_SCOPE):
-
-      self._initialize_rnn_cells(level_sizes)
-
-      # get the cell size for the per-parameter RNN (Level 0)
-      cell_size = level_sizes[0]
-
-      # Random normal initialization scaled by the output size. This is the
-      # scale for the RNN *readouts*. RNN internal weight scale is set in the
-      # BiasGRUCell call.
-      scale_factor = FLAGS.hrnn_rnn_readout_scale / math.sqrt(cell_size)
-      scaled_init = tf.random_normal_initializer(0., scale_factor,
-                                                 seed=self.random_seed)
-
-      # weights for projecting the hidden state to a parameter update
-      self.update_weights = tf.get_variable("update_weights",
-                                            shape=(cell_size, 1),
-                                            initializer=scaled_init)
-
-      if self.use_attention:
-        # weights for projecting the hidden state to the location at which the
-        # gradient is attended
-        self.attention_weights = tf.get_variable(
-            "attention_weights",
-            initializer=self.update_weights.initialized_value())
-
-      # weights for projecting the hidden state to the RMS decay term
-      self._initialize_scale_decay((cell_size, 1), scaled_init)
-      self._initialize_input_decay((cell_size, 1), scaled_init)
-
-      self._initialize_lr((cell_size, 1), scaled_init)
-
-    state_keys = ["parameter", "layer", "scl_decay", "inp_decay", "true_param"]
-
-    if self.dynamic_output_scale:
-      state_keys.append("log_learning_rate")
-
-    for i in range(self.num_gradient_scales):
-      state_keys.append("grad_accum{}".format(i + 1))
-      state_keys.append("ms{}".format(i + 1))
-
-    super(HierarchicalRNN, self).__init__(
-        "hRNN", state_keys, use_attention=use_attention,
-        use_log_objective=use_log_objective,
-        obj_train_max_multiplier=obj_train_max_multiplier, **kwargs)
-
-  def _initialize_rnn_cells(self, level_sizes):
-    """Initializes the RNN cells to use in the hierarchical RNN."""
-
-    # RNN Cell layers (0 -> lowest, 1 -> middle, 2 -> global)
-    for level in range(self.num_layers):
-      scope = "Level{}_RNN".format(level)
-      with tf.variable_scope(scope):
-        hcell = rnn_cells.BiasGRUCell(
-            level_sizes[level],
-            scale=FLAGS.biasgrucell_scale,
-            gate_bias_init=FLAGS.biasgrucell_gate_bias_init,
-            random_seed=self.random_seed)
-        self.cells.append(hcell)
-        if self.learnable_rnn_init:
-          self.init_vectors.append(tf.Variable(
-              tf.random_uniform([1, hcell.state_size], -1., 1.,
-                                seed=self.random_seed),
-              name="init_vector"))
-        else:
-          self.init_vectors.append(
-              tf.random_uniform([1, hcell.state_size], -1., 1.,
-                                seed=self.random_seed))
-
-  def _initialize_scale_decay(self, weights_tensor_shape, scaled_init):
-    """Initializes the scale decay weights and bias variables or tensors.
-
-    Args:
-      weights_tensor_shape: The shape the weight tensor should take.
-      scaled_init: The scaled initialization for the weights tensor.
-    """
-    if self.learnable_decay:
-      self.scl_decay_weights = tf.get_variable("scl_decay_weights",
-                                               shape=weights_tensor_shape,
-                                               initializer=scaled_init)
-      scl_decay_bias_init = tf.constant_initializer(
-          FLAGS.scale_decay_bias_init)
-      self.scl_decay_bias = tf.get_variable("scl_decay_bias",
-                                            shape=(1,),
-                                            initializer=scl_decay_bias_init)
-    else:
-      self.scl_decay_weights = tf.zeros_like(self.update_weights)
-      self.scl_decay_bias = tf.log(0.93 / (1. - 0.93))
-
-  def _initialize_input_decay(self, weights_tensor_shape, scaled_init):
-    """Initializes the input scale decay weights and bias variables or tensors.
-
-    Args:
-      weights_tensor_shape: The shape the weight tensor should take.
-      scaled_init: The scaled initialization for the weights tensor.
-    """
-    if (self.learnable_decay and self.num_gradient_scales > 1 and
-        self.learnable_inp_decay):
-      self.inp_decay_weights = tf.get_variable("inp_decay_weights",
-                                               shape=weights_tensor_shape,
-                                               initializer=scaled_init)
-      inp_decay_bias_init = tf.constant_initializer(
-          FLAGS.hrnn_default_decay_var_init)
-      self.inp_decay_bias = tf.get_variable("inp_decay_bias",
-                                            shape=(1,),
-                                            initializer=inp_decay_bias_init)
-    else:
-      self.inp_decay_weights = tf.zeros_like(self.update_weights)
-      self.inp_decay_bias = tf.log(0.89 / (1. - 0.89))
-
-  def _initialize_lr(self, weights_tensor_shape, scaled_init):
-    """Initializes the learning rate weights and bias variables or tensors.
-
-    Args:
-      weights_tensor_shape: The shape the weight tensor should take.
-      scaled_init: The scaled initialization for the weights tensor.
-    """
-    if self.dynamic_output_scale:
-      zero_init = tf.constant_initializer(0.)
-      wt_init = zero_init if self.zero_init_lr_weights else scaled_init
-      self.lr_weights = tf.get_variable("learning_rate_weights",
-                                        shape=weights_tensor_shape,
-                                        initializer=wt_init)
-      self.lr_bias = tf.get_variable("learning_rate_bias", shape=(1,),
-                                     initializer=zero_init)
-    else:
-      self.lr_weights = tf.zeros_like(self.update_weights)
-      self.lr_bias = tf.zeros([1, 1])
-
-  def _initialize_state(self, var):
-    """Return a dictionary mapping names of state variables to their values."""
-    var_vectorized = tf.reshape(var, [-1, 1])
-    ndim = var_vectorized.get_shape().as_list()[0]
-
-    state = {
-        # parameter init tensor is [var_ndim x layer0_cell_size]
-        "parameter": tf.ones([ndim, 1]) * self.init_vectors[0],
-        "scl_decay": tf.zeros_like(var_vectorized),
-        "inp_decay": tf.zeros_like(var_vectorized),
-        "true_param": var,
-    }
-
-    if self.num_layers > 1:
-      # layer init tensor is [1 x layer1_cell_size]
-      state["layer"] = tf.ones([1, 1]) * self.init_vectors[1]
-
-    if self.dynamic_output_scale:
-      min_lr = self.init_lr_range[0]
-      max_lr = self.init_lr_range[1]
-      if min_lr == max_lr:
-        log_init_lr = tf.log(min_lr * tf.ones_like(var_vectorized))
-      else:
-        # Use a random offset to increase the likelihood that the average of the
-        # LRs for this variable is different from the LRs for other variables.
-        actual_vals = tf.random_uniform(var_vectorized.get_shape().as_list(),
-                                        np.log(min_lr) / 2.,
-                                        np.log(max_lr) / 2.,
-                                        seed=self.random_seed)
-        offset = tf.random_uniform((), np.log(min_lr) / 2., np.log(max_lr) / 2.,
-                                   seed=self.random_seed)
-        log_init_lr = actual_vals + offset
-      # Clip the log learning rate to the flag at the top end, and to
-      # (log(min int32) - 1) at the bottom
-      clipped = tf.clip_by_value(log_init_lr, -33, self.max_log_lr)
-      state["log_learning_rate"] = clipped
-
-    for i in range(self.num_gradient_scales):
-      state["grad_accum{}".format(i + 1)] = tf.zeros_like(var_vectorized)
-      state["ms{}".format(i + 1)] = tf.zeros_like(var_vectorized)
-
-    return state
-
-  def _initialize_global_state(self):
-    if self.num_layers < 3:
-      return []
-    rnn_global_init = tf.ones([1, 1]) * self.init_vectors[2]
-    return [rnn_global_init]
-
-  def _compute_updates(self, params, grads, states, global_state):
-    # Store the updated parameters and states.
-    updated_params = []
-    updated_attention = []
-    updated_states = []
-
-    with tf.variable_scope(opt.OPTIMIZER_SCOPE):
-
-      mean_log_lr = self._compute_mean_log_lr(states)
-
-      # Iterate over the layers.
-      for param, grad_unflat, state in zip(params, grads, states):
-
-        with tf.variable_scope("PerTensor", reuse=self.reuse_vars):
-          self.reuse_vars = True
-          grad = tf.reshape(grad_unflat, [-1, 1])
-
-          # Create the RNN input. We will optionally extend it with additional
-          # features such as curvature and gradient signal vs. noise.
-          (grads_scaled, mean_squared_gradients,
-           grads_accum) = self._compute_scaled_and_ms_grads(grad, state)
-          rnn_input = [g for g in grads_scaled]
-
-          self._extend_rnn_input(rnn_input, state, grads_scaled,
-                                 mean_squared_gradients, mean_log_lr)
-
-          # Concatenate any features we've collected.
-          rnn_input_tensor = tf.concat(rnn_input, 1)
-
-          layer_state, new_param_state = self._update_rnn_cells(
-              state, global_state, rnn_input_tensor,
-              len(rnn_input) != len(grads_scaled))
-
-          (scl_decay, inp_decay, new_log_lr, update_step, lr_attend,
-           attention_delta) = self._compute_rnn_state_projections(
-               state, new_param_state, grads_scaled)
-
-          # Apply updates and store state variables.
-          if self.use_attention:
-            truth = state["true_param"]
-            updated_param = truth - update_step
-            attention_step = tf.reshape(lr_attend * attention_delta,
-                                        truth.get_shape())
-            updated_attention.append(truth - attention_step)
-          else:
-            updated_param = param - update_step
-            updated_attention.append(updated_param)
-          updated_params.append(updated_param)
-
-          # Collect the new state.
-          new_state = {
-              "parameter": new_param_state,
-              "scl_decay": scl_decay,
-              "inp_decay": inp_decay,
-              "true_param": updated_param,
-          }
-          if layer_state is not None:
-            new_state["layer"] = layer_state
-
-          if self.dynamic_output_scale:
-            new_state["log_learning_rate"] = new_log_lr
-
-          for i in range(self.num_gradient_scales):
-            new_state["grad_accum{}".format(i + 1)] = grads_accum[i]
-            new_state["ms{}".format(i + 1)] = mean_squared_gradients[i]
-          updated_states.append(new_state)
-
-      updated_global_state = self._compute_updated_global_state([layer_state],
-                                                                global_state)
-
-    return (updated_params, updated_states, [updated_global_state],
-            updated_attention)
-
-  def _compute_mean_log_lr(self, states):
-    """Computes the mean log learning rate across all variables."""
-    if self.use_problem_lr_mean and self.use_relative_lr:
-
-      sum_log_lr = 0.
-      count_log_lr = 0.
-      for state in states:
-        sum_log_lr += tf.reduce_sum(state["log_learning_rate"])
-        # Note: get_shape().num_elements()=num elements in the original tensor.
-        count_log_lr += state["log_learning_rate"].get_shape().num_elements()
-      return sum_log_lr / count_log_lr
-
-  def _compute_scaled_and_ms_grads(self, grad, state):
-    """Computes the scaled gradient and the mean squared gradients.
-
-    Gradients are also accumulated across different timescales if appropriate.
-
-    Args:
-      grad: The gradient tensor for this layer.
-      state: The optimizer state for this layer.
-
-    Returns:
-      The scaled gradients, mean squared gradients, and accumulated gradients.
-    """
-    input_decays = [state["inp_decay"]]
-    scale_decays = [state["scl_decay"]]
-    if self.use_multiple_scale_decays and self.num_gradient_scales > 1:
-      for i in range(self.num_gradient_scales - 1):
-        scale_decays.append(tf.sqrt(scale_decays[i]))
-
-    for i in range(self.num_gradient_scales - 1):
-      # Each accumulator on twice the timescale of the one before.
-      input_decays.append(tf.sqrt(input_decays[i]))
-    grads_accum = []
-    grads_scaled = []
-    mean_squared_gradients = []
-
-    # populate the scaled gradients and associated mean_squared values
-    if self.num_gradient_scales > 0:
-      for i, decay in enumerate(input_decays):
-        if self.num_gradient_scales == 1:
-          # We don't accumulate if no scales, just take the current gradient.
-          grad_accum = grad
-        else:
-          # The state vars are 1-indexed.
-          old_accum = state["grad_accum{}".format(i + 1)]
-          grad_accum = grad * (1. - decay) + old_accum * decay
-
-        grads_accum.append(grad_accum)
-
-        sd = scale_decays[i if self.use_multiple_scale_decays else 0]
-        grad_scaled, ms = utils.rms_scaling(grad_accum, sd,
-                                            state["ms{}".format(i + 1)],
-                                            update_ms=True)
-        grads_scaled.append(grad_scaled)
-        mean_squared_gradients.append(ms)
-
-    return grads_scaled, mean_squared_gradients, grads_accum
-
-  def _extend_rnn_input(self, rnn_input, state, grads_scaled,
-                        mean_squared_gradients, mean_log_lr):
-    """Computes additional rnn inputs and adds them to the rnn_input list."""
-    if self.num_gradient_scales > 1 and self.use_grad_products:
-      # This gives a measure of curvature relative to input averaging
-      # lengthscale and to the learning rate
-      grad_products = [a * b for a, b in
-                       zip(grads_scaled[:-1], grads_scaled[1:])]
-      rnn_input.extend([g for g in grad_products])
-
-    if self.use_log_means_squared:
-      log_means_squared = [tf.log(ms + 1e-16)
-                           for ms in mean_squared_gradients]
-
-      avg = tf.reduce_mean(log_means_squared, axis=0)
-      # This gives a measure of the signal vs. noise contribution to the
-      # gradient, at the current averaging lengthscale. If all the noise
-      # is averaged out, and if updates are small, these will be 0.
-      mean_log_means_squared = [m - avg for m in log_means_squared]
-
-      rnn_input.extend([m for m in mean_log_means_squared])
-
-    if self.use_relative_lr or self.use_extreme_indicator:
-      if not self.dynamic_output_scale:
-        raise Exception("Relative LR and Extreme Indicator features "
-                        "require dynamic_output_scale to be set to True.")
-      log_lr_vec = tf.reshape(state["log_learning_rate"], [-1, 1])
-      if self.use_relative_lr:
-        if self.use_problem_lr_mean:
-          # Learning rate of this dimension vs. rest of target problem.
-          relative_lr = log_lr_vec - mean_log_lr
-        else:
-          # Learning rate of this dimension vs. rest of tensor.
-          relative_lr = log_lr_vec - tf.reduce_mean(log_lr_vec)
-        rnn_input.append(relative_lr)
-      if self.use_extreme_indicator:
-        # Indicator of extremely large or extremely small learning rate.
-        extreme_indicator = (tf.nn.relu(log_lr_vec - tf.log(1.)) -
-                             tf.nn.relu(tf.log(1e-6) - log_lr_vec))
-        rnn_input.append(extreme_indicator)
-
-    if self.use_lr_shortcut:
-      log_lr_vec = tf.reshape(state["log_learning_rate"], [-1, 1])
-      rnn_input.append(log_lr_vec - tf.log(1e-3))
-
-  def _update_rnn_cells(self, state, global_state, rnn_input_tensor,
-                        use_additional_features):
-    """Updates the component RNN cells with the given state and tensor.
-
-    Args:
-      state: The current state of the optimizer.
-      global_state: The current global RNN state.
-      rnn_input_tensor: The input tensor to the RNN.
-      use_additional_features: Whether the rnn input tensor contains additional
-          features beyond the scaled gradients (affects whether the rnn input
-          tensor is used as input to the RNN.)
-
-    Returns:
-      layer_state: The new state of the per-tensor RNN.
-      new_param_state: The new state of the per-parameter RNN.
-    """
-    # lowest level (per parameter)
-    #   input -> gradient for this parameter
-    #   bias -> output from the layer RNN
-    with tf.variable_scope("Layer0_RNN"):
-      total_bias = None
-      if self.num_layers > 1:
-        sz = 3 * self.cells[0].state_size    # size of the concatenated bias
-        param_bias = utils.affine([state["layer"]], sz,
-                                  scope="Param/Affine",
-                                  scale=FLAGS.hrnn_affine_scale,
-                                  random_seed=self.random_seed)
-        total_bias = param_bias
-        if self.num_layers == 3:
-          global_bias = utils.affine(global_state, sz,
-                                     scope="Global/Affine",
-                                     scale=FLAGS.hrnn_affine_scale,
-                                     random_seed=self.random_seed)
-          total_bias += global_bias
-
-      new_param_state, _ = self.cells[0](
-          rnn_input_tensor, state["parameter"], bias=total_bias)
-
-    if self.num_layers > 1:
-      # middle level (per layer)
-      #   input -> average hidden state from each parameter in this layer
-      #   bias -> output from the RNN at the global level
-      with tf.variable_scope("Layer1_RNN"):
-        if not use_additional_features:
-          # Restore old behavior and only add the mean of the new params.
-          layer_input = tf.reduce_mean(new_param_state, 0, keep_dims=True)
-        else:
-          layer_input = tf.reduce_mean(
-              tf.concat((new_param_state, rnn_input_tensor), 1), 0,
-              keep_dims=True)
-        if self.num_layers == 3:
-          sz = 3 * self.cells[1].state_size
-          layer_bias = utils.affine(global_state, sz,
-                                    scale=FLAGS.hrnn_affine_scale,
-                                    random_seed=self.random_seed)
-          layer_state, _ = self.cells[1](
-              layer_input, state["layer"], bias=layer_bias)
-        else:
-          layer_state, _ = self.cells[1](layer_input, state["layer"])
-    else:
-      layer_state = None
-
-    return layer_state, new_param_state
-
-  def _compute_rnn_state_projections(self, state, new_param_state,
-                                     grads_scaled):
-    """Computes the RNN state-based updates to parameters and update steps."""
-    # Compute the update direction (a linear projection of the RNN output).
-    update_weights = self.update_weights
-
-    update_delta = utils.project(new_param_state, update_weights)
-    if self.use_gradient_shortcut:
-      # Include an affine projection of just the direction of the gradient
-      # so that RNN hidden states are freed up to store more complex
-      # functions of the gradient and other parameters.
-      grads_scaled_tensor = tf.concat([g for g in grads_scaled], 1)
-      update_delta += utils.affine(grads_scaled_tensor, 1,
-                                   scope="GradsToDelta",
-                                   include_bias=False,
-                                   vec_mean=1. / len(grads_scaled),
-                                   random_seed=self.random_seed)
-    if self.dynamic_output_scale:
-      denom = tf.sqrt(tf.reduce_mean(update_delta ** 2) + 1e-16)
-
-      update_delta /= denom
-
-    if self.use_attention:
-      attention_weights = self.attention_weights
-      attention_delta = utils.project(new_param_state,
-                                      attention_weights)
-      if self.use_gradient_shortcut:
-        attention_delta += utils.affine(grads_scaled_tensor, 1,
-                                        scope="GradsToAttnDelta",
-                                        include_bias=False,
-                                        vec_mean=1. / len(grads_scaled),
-                                        random_seed=self.random_seed)
-      if self.dynamic_output_scale:
-        attention_delta /= tf.sqrt(
-            tf.reduce_mean(attention_delta ** 2) + 1e-16)
-    else:
-      attention_delta = None
-
-    # The updated decay is an affine projection of the hidden state.
-    scl_decay = utils.project(new_param_state, self.scl_decay_weights,
-                              bias=self.scl_decay_bias,
-                              activation=tf.nn.sigmoid)
-    # This is only used if learnable_decay and num_gradient_scales > 1
-    inp_decay = utils.project(new_param_state, self.inp_decay_weights,
-                              bias=self.inp_decay_bias,
-                              activation=tf.nn.sigmoid)
-
-    # Also update the learning rate.
-    lr_param, lr_attend, new_log_lr = self._compute_new_learning_rate(
-        state, new_param_state)
-
-    update_step = tf.reshape(lr_param * update_delta,
-                             state["true_param"].get_shape())
-
-    return (scl_decay, inp_decay, new_log_lr, update_step, lr_attend,
-            attention_delta)
-
-  def _compute_new_learning_rate(self, state, new_param_state):
-    if self.dynamic_output_scale:
-      # Compute the change in learning rate (an affine projection of the
-      # RNN state, passed through a sigmoid or log depending on flags).
-      # Update the learning rate, w/ momentum.
-      lr_change = utils.project(new_param_state, self.lr_weights,
-                                bias=self.lr_bias)
-      step_log_lr = state["log_learning_rate"] + lr_change
-
-      # Clip the log learning rate to the flag at the top end, and to
-      # (log(min int32) - 1) at the bottom
-
-      # Check out this hack: we want to be able to compute the gradient
-      # of the downstream result w.r.t lr weights and bias, even if the
-      # value of step_log_lr is outside the clip range. So we clip,
-      # subtract off step_log_lr, and wrap all that in a stop_gradient so
-      # TF never tries to take the gradient of the clip... or the
-      # subtraction. Then we add BACK step_log_lr so that downstream still
-      # receives the clipped value. But the GRADIENT of step_log_lr will
-      # be the gradient of the unclipped value, which we added back in
-      # after stop_gradients.
-      step_log_lr += tf.stop_gradient(
-          tf.clip_by_value(step_log_lr, -33, self.max_log_lr)
-          - step_log_lr)
-
-      lr_momentum_logit = tf.get_variable(
-          "learning_rate_momentum_logit",
-          initializer=FLAGS.learning_rate_momentum_logit_init)
-      lrm = tf.nn.sigmoid(lr_momentum_logit)
-      new_log_lr = (lrm * state["log_learning_rate"] +
-                    (1. - lrm) * step_log_lr)
-      param_stepsize_offset = tf.get_variable("param_stepsize_offset",
-                                              initializer=-1.)
-      lr_param = tf.exp(step_log_lr + param_stepsize_offset)
-      lr_attend = tf.exp(step_log_lr) if self.use_attention else lr_param
-    else:
-      # Dynamic output scale is off, LR param is always 1.
-      lr_param = 2. * utils.project(new_param_state, self.lr_weights,
-                                    bias=self.lr_bias,
-                                    activation=tf.nn.sigmoid)
-      new_log_lr = None
-      lr_attend = lr_param
-
-    return lr_param, lr_attend, new_log_lr
-
-  def _compute_updated_global_state(self, layer_states, global_state):
-    """Computes the new global state gives the layers states and old state.
-
-    Args:
-      layer_states: The current layer states.
-      global_state: The old global state.
-
-    Returns:
-      The updated global state.
-    """
-    updated_global_state = []
-    if self.num_layers == 3:
-      # highest (global) layer
-      #   input -> average hidden state from each layer-specific RNN
-      #   bias -> None
-      with tf.variable_scope("Layer2_RNN", reuse=self.reuse_global_state):
-        self.reuse_global_state = True
-        global_input = tf.reduce_mean(tf.concat(layer_states, 0), 0,
-                                      keep_dims=True)
-        updated_global_state, _ = self.cells[2](global_input, global_state[0])
-    return updated_global_state
-
-  def apply_gradients(self, grads_and_vars, global_step=None, name=None):
-    """Overwrites the tf.train.Optimizer interface for applying gradients."""
-
-    # Pull out the variables.
-    grads_and_vars = tuple(grads_and_vars)  # Make sure repeat iteration works.
-    for g, v in grads_and_vars:
-      if not isinstance(g, (tf.Tensor, tf.IndexedSlices, type(None))):
-        raise TypeError(
-            "Gradient must be a Tensor, IndexedSlices, or None: %s" % g)
-      if not isinstance(v, tf.Variable):
-        raise TypeError(
-            "Variable must be a tf.Variable: %s" % v)
-      if g is not None:
-        self._assert_valid_dtypes([g, v])
-    var_list = [v for g, v in grads_and_vars if g is not None]
-    if not var_list:
-      raise ValueError("No gradients provided for any variable: %s" %
-                       (grads_and_vars,))
-
-    # Create slots for the variables.
-    with tf.control_dependencies(None):
-      self._create_slots(var_list)
-
-    # Store update ops in this list.
-    with tf.op_scope([], name, self._name) as name:
-
-      # Prepare the global state.
-      with tf.variable_scope(self._name, reuse=self.reuse_global_state):
-        gs = self._initialize_global_state()
-        if gs:
-          global_state = [tf.get_variable("global_state", initializer=gs[0])]
-        else:
-          global_state = []
-
-      # Get the states for each variable in the list.
-      states = [{key: self.get_slot(var, key) for key in self.get_slot_names()}
-                for var in var_list]
-
-      # Compute updated values.
-      grads, params = zip(*grads_and_vars)
-      args = (params, grads, states, global_state)
-      updates = self._compute_updates(*args)
-      new_params, new_states, new_global_state, new_attention = updates
-      # Assign op for new global state.
-      update_ops = [tf.assign(gs, ngs)
-                    for gs, ngs in zip(global_state, new_global_state)]
-
-      # Create the assign ops for the params and state variables.
-      args = (params, states, new_params, new_attention, new_states)
-      for var, state, new_var, new_var_attend, new_state in zip(*args):
-        # Assign updates to the state variables.
-        state_assign_ops = [tf.assign(state_var, new_state[key])
-                            for key, state_var in state.items()]
-
-        # Update the parameter.
-        with tf.control_dependencies(state_assign_ops):
-          if self.use_attention:
-            # Assign to the attended location, rather than the actual location
-            # so that the gradients are computed where attention is.
-            param_update_op = var.assign(new_var_attend)
-          else:
-            param_update_op = var.assign(new_var)
-
-        with tf.name_scope("update_" + var.op.name):   #, tf.colocate_with(var):
-          update_ops.append(param_update_op)
-
-      real_params = [self.get_slot(var, "true_param") for var in var_list]
-
-      if global_step is None:
-        # NOTE: if using the optimizer in a non-test-optimizer setting (e.g.
-        # on Inception), remove the real_params return value. Otherwise
-        # the code will throw an error.
-        return self._finish(update_ops, name), real_params
-      else:
-        with tf.control_dependencies([self._finish(update_ops, "update")]):
-          return state_ops.assign_add(global_step, 1, name=name).op, real_params
diff --git a/research/learned_optimizer/optimizer/learning_rate_schedule.py b/research/learned_optimizer/optimizer/learning_rate_schedule.py
deleted file mode 100644
index 53db8addd3d152bfa02630ec6e37f0cc1776abc8..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/optimizer/learning_rate_schedule.py
+++ /dev/null
@@ -1,60 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A trainable optimizer that learns a learning rate schedule."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from learned_optimizer.optimizer import trainable_optimizer
-
-
-class LearningRateSchedule(trainable_optimizer.TrainableOptimizer):
-  """Learns a learning rate schedule over a fixed number of iterations."""
-
-  def __init__(self, initial_rate=0.0, n_steps=1000, **kwargs):
-    """Initializes the learning rates."""
-    self.max_index = tf.constant(n_steps-1, dtype=tf.int32)
-
-    with tf.variable_scope(trainable_optimizer.OPTIMIZER_SCOPE):
-      initializer = tf.constant_initializer(initial_rate)
-      self.learning_rates = tf.get_variable("learning_rates",
-                                            shape=([n_steps,]),
-                                            initializer=initializer)
-
-    super(LearningRateSchedule, self).__init__("LRS", ["itr"], **kwargs)
-
-  def _initialize_state(self, var):
-    """Return a dictionary mapping names of state variables to their values."""
-    return {
-        "itr": tf.constant(0, dtype=tf.int32),
-    }
-
-  def _compute_update(self, param, grad, state):
-    """Compute updates of parameters."""
-
-    # get the learning rate at the current index, if the index
-    # is greater than the number of available learning rates,
-    # use the last one
-    index = tf.minimum(state["itr"], self.max_index)
-    learning_rate = tf.gather(self.learning_rates, index)
-
-    # update the parameters: parameter - learning_rate * gradient
-    updated_param = param - tf.scalar_mul(learning_rate, grad)
-
-    return updated_param, {"itr": state["itr"] + 1}
diff --git a/research/learned_optimizer/optimizer/rnn_cells.py b/research/learned_optimizer/optimizer/rnn_cells.py
deleted file mode 100644
index 3d68de04ca5318bb0f264d4f4647ddbc6fbe08e0..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/optimizer/rnn_cells.py
+++ /dev/null
@@ -1,68 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Custom RNN cells for hierarchical RNNs."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from learned_optimizer.optimizer import utils
-
-
-class BiasGRUCell(tf.contrib.rnn.RNNCell):
-  """GRU cell (cf. http://arxiv.org/abs/1406.1078) with an additional bias."""
-
-  def __init__(self, num_units, activation=tf.tanh, scale=0.1,
-               gate_bias_init=0., random_seed=None):
-    self._num_units = num_units
-    self._activation = activation
-    self._scale = scale
-    self._gate_bias_init = gate_bias_init
-    self._random_seed = random_seed
-
-  @property
-  def state_size(self):
-    return self._num_units
-
-  @property
-  def output_size(self):
-    return self._num_units
-
-  def __call__(self, inputs, state, bias=None):
-    # Split the injected bias vector into a bias for the r, u, and c updates.
-    if bias is None:
-      bias = tf.zeros((1, 3))
-
-    r_bias, u_bias, c_bias = tf.split(bias, 3, 1)
-
-    with tf.variable_scope(type(self).__name__):  # "BiasGRUCell"
-      with tf.variable_scope("gates"):  # Reset gate and update gate.
-        proj = utils.affine([inputs, state], 2 * self._num_units,
-                            scale=self._scale, bias_init=self._gate_bias_init,
-                            random_seed=self._random_seed)
-        r_lin, u_lin = tf.split(proj, 2, 1)
-        r, u = tf.nn.sigmoid(r_lin + r_bias), tf.nn.sigmoid(u_lin + u_bias)
-
-      with tf.variable_scope("candidate"):
-        proj = utils.affine([inputs, r * state], self._num_units,
-                            scale=self._scale, random_seed=self._random_seed)
-        c = self._activation(proj + c_bias)
-
-      new_h = u * state + (1 - u) * c
-
-    return new_h, new_h
diff --git a/research/learned_optimizer/optimizer/trainable_adam.py b/research/learned_optimizer/optimizer/trainable_adam.py
deleted file mode 100644
index 638217f1b723da8633dc7a82623392eaaf190829..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/optimizer/trainable_adam.py
+++ /dev/null
@@ -1,210 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A trainable ADAM optimizer that learns its internal variables."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-from learned_optimizer.optimizer import trainable_optimizer as opt
-from learned_optimizer.optimizer import utils
-
-
-class TrainableAdam(opt.TrainableOptimizer):
-  """Adam optimizer with learnable scalar parameters.
-
-  See Kingma et. al., 2014 for algorithm (http://arxiv.org/abs/1412.6980).
-  """
-
-  def __init__(self,
-               learning_rate=1e-3,
-               beta1=0.9,
-               beta2=0.999,
-               epsilon=1e-8,
-               **kwargs):
-    """Initializes the TrainableAdam optimizer with the given initial values.
-
-    Args:
-      learning_rate: The learning rate (default: 1e-3).
-      beta1: The exponential decay rate for the 1st moment estimates.
-      beta2: The exponential decay rate for the 2nd moment estimates.
-      epsilon: A small constant for numerical stability.
-      **kwargs: Any additional keyword arguments for TrainableOptimizer.
-
-    Raises:
-      ValueError: if the learning rate or epsilon is not positive
-      ValueError: if beta1 or beta2 is not in (0, 1).
-    """
-    if learning_rate <= 0:
-      raise ValueError("Learning rate must be positive.")
-    if epsilon <= 0:
-      raise ValueError("Epsilon must be positive.")
-    if not 0 < beta1 < 1 or not 0 < beta2 < 1:
-      raise ValueError("Beta values must be between 0 and 1, exclusive.")
-
-    self._reuse_vars = False
-
-    with tf.variable_scope(opt.OPTIMIZER_SCOPE):
-      def inv_sigmoid(x):
-        return np.log(x / (1.0 - x))
-
-      self.log_learning_rate = tf.get_variable(
-          "log_learning_rate",
-          shape=[],
-          initializer=tf.constant_initializer(np.log(learning_rate)))
-      self.beta1_logit = tf.get_variable(
-          "beta1_logit",
-          shape=[],
-          initializer=tf.constant_initializer(inv_sigmoid(beta1)))
-      self.beta2_logit = tf.get_variable(
-          "beta2_logit",
-          shape=[],
-          initializer=tf.constant_initializer(inv_sigmoid(beta2)))
-      self.log_epsilon = tf.get_variable(
-          "log_epsilon",
-          shape=[],
-          initializer=tf.constant_initializer(np.log(epsilon)))
-
-    # Key names are derived from Algorithm 1 described in
-    # https://arxiv.org/pdf/1412.6980.pdf
-    state_keys = ["m", "v", "t"]
-    super(TrainableAdam, self).__init__("Adam", state_keys, **kwargs)
-
-  def _initialize_state(self, var):
-    """Returns a dictionary mapping names of state variables to their values."""
-    vectorized_shape = var.get_shape().num_elements(), 1
-
-    return {key: tf.zeros(vectorized_shape) for key in self.state_keys}
-
-  def _compute_update(self, param, grad, state):
-    """Calculates the new internal state and parameters.
-
-    If the gradient is sparse, updates the appropriate slices in the internal
-    state and stacks the update tensor.
-
-    Args:
-      param: A tensor of parameters.
-      grad: A tensor of gradients with the same shape as param.
-      state: A dictionary containing any state for the optimizer.
-
-    Returns:
-      updated_param: The updated parameters.
-      updated_state: The updated state variables in a dictionary.
-    """
-
-    with tf.variable_scope(opt.OPTIMIZER_SCOPE) as scope:
-
-      if self._reuse_vars:
-        scope.reuse_variables()
-      else:
-        self._reuse_vars = True
-
-      (grad_values, first_moment, second_moment, timestep, grad_indices
-      ) = self._extract_gradients_and_internal_state(
-          grad, state, tf.shape(param))
-
-      beta1 = tf.nn.sigmoid(self.beta1_logit)
-      beta2 = tf.nn.sigmoid(self.beta2_logit)
-      epsilon = tf.exp(self.log_epsilon) + 1e-10
-      learning_rate = tf.exp(self.log_learning_rate)
-
-      old_grad_shape = tf.shape(grad_values)
-      grad_values = tf.reshape(grad_values, [-1, 1])
-
-      new_timestep = timestep + 1
-      new_first_moment = self._update_adam_estimate(
-          first_moment, grad_values, beta1)
-      new_second_moment = self._debias_adam_estimate(
-          second_moment, tf.square(grad_values), beta2)
-
-      debiased_first_moment = self._debias_adam_estimate(
-          new_first_moment, beta1, new_timestep)
-      debiased_second_moment = self._debias_adam_estimate(
-          new_second_moment, beta2, new_timestep)
-
-      # Propagating through the square root of 0 is very bad for stability.
-      update = (learning_rate * debiased_first_moment /
-                (tf.sqrt(debiased_second_moment + 1e-10) + epsilon))
-
-      update = tf.reshape(update, old_grad_shape)
-
-      if grad_indices is not None:
-        param_shape = tf.shape(param)
-        update = utils.stack_tensor(
-            update, grad_indices, param, param_shape[:1])
-        new_first_moment = utils.update_slices(
-            new_first_moment, grad_indices, state["m"], param_shape)
-        new_second_moment = utils.update_slices(
-            new_second_moment, grad_indices, state["v"], param_shape)
-        new_timestep = utils.update_slices(
-            new_timestep, grad_indices, state["t"], param_shape)
-
-      new_param = param - update
-
-      # collect the update and new state
-      new_state = {
-          "m": new_first_moment,
-          "v": new_second_moment,
-          "t": new_timestep
-      }
-
-    return new_param, new_state
-
-  def _update_adam_estimate(self, estimate, value, beta):
-    """Returns a beta-weighted average of estimate and value."""
-    return (beta * estimate) + ((1 - beta) * value)
-
-  def _debias_adam_estimate(self, estimate, beta, t_step):
-    """Returns a debiased estimate based on beta and the timestep."""
-    return estimate / (1 - tf.pow(beta, t_step))
-
-  def _extract_gradients_and_internal_state(self, grad, state, param_shape):
-    """Extracts the gradients and relevant internal state.
-
-    If the gradient is sparse, extracts the appropriate slices from the state.
-
-    Args:
-      grad: The current gradient.
-      state: The current state.
-      param_shape: The shape of the parameter (used if gradient is sparse).
-
-    Returns:
-      grad_values: The gradient value tensor.
-      first_moment: The first moment tensor (internal state).
-      second_moment: The second moment tensor (internal state).
-      timestep: The current timestep (internal state).
-      grad_indices: The indices for the gradient tensor, if sparse.
-          None otherwise.
-    """
-    grad_values = grad
-    grad_indices = None
-    first_moment = state["m"]
-    second_moment = state["v"]
-    timestep = state["t"]
-
-    if isinstance(grad, tf.IndexedSlices):
-      grad_indices, grad_values = utils.accumulate_sparse_gradients(grad)
-      first_moment = utils.slice_tensor(
-          first_moment, grad_indices, param_shape)
-      second_moment = utils.slice_tensor(
-          second_moment, grad_indices, param_shape)
-      timestep = utils.slice_tensor(timestep, grad_indices, param_shape)
-
-    return grad_values, first_moment, second_moment, timestep, grad_indices
-
diff --git a/research/learned_optimizer/optimizer/trainable_optimizer.py b/research/learned_optimizer/optimizer/trainable_optimizer.py
deleted file mode 100644
index 955112a9dd1d3b0af5ae2f5f0fe8eff65d2dbfc7..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/optimizer/trainable_optimizer.py
+++ /dev/null
@@ -1,574 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A base class definition for trainable optimizers."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import itertools
-
-import tensorflow as tf
-
-from tensorflow.python.framework import tensor_shape
-
-OPTIMIZER_SCOPE = "LOL"
-_LOCAL_VARIABLE_PREFIX = "local_state_"
-_LOCAL_STATE_VARIABLE_COLLECTION = "local_state_collection"
-EPSILON = 1e-6
-
-
-class TrainableOptimizer(tf.train.Optimizer):
-  """Base class for trainable optimizers.
-
-  A trainable optimizer is an optimizer that has parameters that can themselves
-  be learned (meta-optimized).
-
-  Subclasses must implement:
-      _compute_update(self, param, grad, state)
-  """
-
-  def __init__(self, name, state_keys, use_attention=False,
-               use_log_objective=False, obj_train_max_multiplier=-1,
-               use_second_derivatives=True, use_numerator_epsilon=False,
-               **kwargs):
-    """Initializes the optimizer with the given name and settings.
-
-    Args:
-      name: The name string for this optimizer.
-      state_keys: The names of any required state variables (list)
-      use_attention: Whether this optimizer uses attention (Default: True)
-      use_log_objective: Whether this optimizer uses the logarithm of the
-          objective when computing the loss (Default: False)
-      obj_train_max_multiplier: The maximum multiplier for the increase in the
-          objective before meta-training is stopped. If <= 0, meta-training is
-          not stopped early. (Default: -1)
-      use_second_derivatives: Whether this optimizer uses second derivatives in
-          meta-training. This should be set to False if some second derivatives
-          in the meta-training problem set are not defined in Tensorflow.
-          (Default: True)
-      use_numerator_epsilon: Whether to use epsilon in the numerator when
-          scaling the problem objective during meta-training. (Default: False)
-      **kwargs: Any additional keyword arguments.
-    """
-    self.use_second_derivatives = use_second_derivatives
-    self.state_keys = sorted(state_keys)
-    self.use_attention = use_attention
-    self.use_log_objective = use_log_objective
-    self.obj_train_max_multiplier = obj_train_max_multiplier
-    self.use_numerator_epsilon = use_numerator_epsilon
-
-    use_locking = False
-    super(TrainableOptimizer, self).__init__(use_locking, name)
-
-  def _create_slots(self, var_list):
-    """Creates all slots needed by the variables.
-
-    Args:
-      var_list: A list of `Variable` objects.
-    """
-    for var in var_list:
-      init_states = self._initialize_state(var)
-      for slot_name in sorted(init_states):
-        slot_var_name = "{}_{}".format(self.get_name(), slot_name)
-        value = init_states[slot_name]
-        self._get_or_make_slot(var, value, slot_name, slot_var_name)
-
-  def _initialize_state(self, var):
-    """Initializes any state required for this variable.
-
-    Args:
-      var: a tensor containing parameters to be optimized
-
-    Returns:
-      state: a dictionary mapping state keys to initial state values (tensors)
-    """
-    return {}
-
-  def _initialize_global_state(self):
-    """Initializes any global state values."""
-    return []
-
-  def _apply_common(self, grad, var):
-    """Applies the optimizer updates to the variables.
-
-    Note: this should only get called via _apply_dense or _apply_sparse when
-    using the optimizer via optimizer.minimize or optimizer.apply_gradients.
-    During meta-training, the optimizer.train function should be used to
-    construct an optimization path that is differentiable.
-
-    Args:
-      grad: A tensor representing the gradient.
-      var: A tf.Variable with the same shape as grad.
-
-    Returns:
-      update_op: A tensorflow op that assigns new values to the variable, and
-          also defines dependencies that update the state variables for the
-          optimizer.
-    """
-    state = {key: self.get_slot(var, key) for key in self.get_slot_names()}
-    new_var, new_state = self._compute_update(var, grad, state)
-    state_assign_ops = [tf.assign(state_var, new_state[key])
-                        for key, state_var in state.items()]
-    with tf.control_dependencies(state_assign_ops):
-      update_op = var.assign(new_var)
-
-    return update_op
-
-  def _apply_dense(self, grad, var):
-    """Adds ops to apply dense gradients to 'var'."""
-    return self._apply_common(grad, var)
-
-  def _apply_sparse(self, grad, var):
-    """Adds ops to apply sparse gradients to 'var'."""
-    return self._apply_common(grad, var)
-
-  def _compute_update(self, param, grad, state):
-    """Computes the update step for optimization.
-
-    Args:
-      param: A tensor of parameters to optimize.
-      grad: The gradient tensor of the objective with respect to the parameters.
-          (It has the same shape as param.)
-      state: A dictionary containing any extra state required by the optimizer.
-
-    Returns:
-      updated_params: The updated parameters.
-      updated_state: The dictionary of updated state variable(s).
-    """
-    raise NotImplementedError
-
-  def _compute_updates(self, params, grads, states, global_state):
-    """Maps the compute update functions for each parameter.
-
-    This function can be overriden by a subclass if the subclass wants to
-    combine information across the different parameters in the list.
-
-    Args:
-      params: A list of parameter tensors.
-      grads: A list of gradients corresponding to each parameter.
-      states: A list of state variables corresponding to each parameter.
-      global_state: A list of global state variables for the problem.
-
-    Returns:
-      new_params: The updated parameters.
-      new_states: The updated states.
-      new_global_state: The updated global state.
-      attention_params: A list of attention parameters. This is the same as
-          new_params if the optimizer does not use attention.
-    """
-    # Zip up the arguments to _compute_update.
-    args = zip(params, grads, states)
-
-    # Call compute_update on each set of parameter/gradient/state args.
-    new_params, new_states = zip(*list(
-        itertools.starmap(self._compute_update, args)))
-
-    # Global state is unused in the basic case, just pass it through.
-    return list(new_params), list(new_states), global_state, list(new_params)
-
-  def train(self, problem, dataset):
-    """Creates graph operations to train the optimizer.
-
-    Args:
-      problem: A problem_generator.Problem instance to train on.
-      dataset: A datasets.Dataset tuple to use when training.
-
-    Returns:
-      meta_objective: A tensorflow operation for computing the meta-objective
-      obj_weights: A tensor placeholder for feeding in the objective weights
-      obj_values: The subproblem objective values during optimization
-      batches: The batch indexes tensor for overriding with feed_dict
-      first_unroll: A placeholder signifying if this is a first unroll
-        (this will propagate the gradients slightly differently).
-      reset_state: A placeholder signifying that the rnn state should be reset.
-      output_state: The final state of the optimizer
-      init_loop_vars_to_override: Local variables that can be assigned to
-        propagate the optimizer and problem state for unrolling
-      final_loop_vals: Final values of the loop variables that can be
-        assigned to init_loop_vars_to_override.
-    """
-
-    # Placeholder for the objective weights
-    obj_weights = tf.placeholder(tf.float32)
-    num_iter = tf.shape(obj_weights)[0]
-
-    # Unpack the dataset and generate the minibatches for training
-    data, labels = dataset
-    # Convert the ndarrays to tensors so we can pass them back in via feed_dict
-    data = tf.constant(data)
-    labels = tf.constant(labels)
-    batches = tf.placeholder(tf.int32)
-    first_unroll = tf.placeholder_with_default(False, [])
-    reset_state = tf.placeholder_with_default(False, [])
-
-    training_output = collections.namedtuple("TrainingOutput",
-                                             ["metaobj",
-                                              "obj_weights",
-                                              "problem_objectives",
-                                              "initial_obj",
-                                              "batches",
-                                              "first_unroll",
-                                              "reset_state",
-                                              "output_state",
-                                              "init_loop_vars",
-                                              "output_loop_vars"])
-
-    def loop_body(itr, obj_accum, params, attend_params, flattened_states,
-                  global_state, all_obj, unused_init_obj, data,
-                  labels, batches):
-      """Body of the meta-training while loop for optimizing a sub-problem.
-
-      Args:
-        itr: The current meta-training iteration.
-        obj_accum: The accumulated objective over all training steps so far.
-        params: The parameters of the sub-problem.
-        attend_params: The parameters of the sub-problems at the attended
-            location.
-        flattened_states: The states of the trainable optimizer, sorted and
-            flattened into a list (since a while loop can't handle nested lists
-            or dictionaries).
-        global_state: The global state of the optimizer.
-        all_obj: The list of all objective values in the training process.
-        unused_init_obj: The initial objective (unused here, but needed in the
-            variable list because it's used in a stopping condition in the
-            loop_cond.)
-        data: The data for this problem.
-        labels: The labels corresponding to the data.
-        batches: The batch indexes needed for shuffled minibatch creation.
-
-      Returns:
-        itr: The updated meta-training iteration.
-        obj_accum: The updated accumulated objective.
-        params: The new parameters of the sub-problem.
-        attend_params: The new parameters of the sub-problems at the attended
-            location.
-        flattened_states: The new states of the trainable optimizer.
-        global_state: The updated global state.
-        all_obj: The updates list of all objective values.
-        unused_init_obj: The initial objective.
-        data: The data for this problem.
-        labels: The labels corresponding to the data.
-        batches: The batch indexes needed for shuffled minibatch creation.
-      """
-      batch_indices = tf.gather(batches, itr)
-      batch_data = tf.gather(data, batch_indices)
-      batch_labels = tf.gather(labels, batch_indices)
-
-      # Compute the objective over the entire dataset (full batch).
-      obj = problem.objective(params, data, labels)
-
-      # Compute the gradients on just the current batch
-      if self.use_attention:
-        current_obj = problem.objective(attend_params, batch_data, batch_labels)
-        grads = problem.gradients(current_obj, attend_params)
-      else:
-        current_obj = problem.objective(params, batch_data, batch_labels)
-        grads = problem.gradients(current_obj, params)
-
-      if not self.use_second_derivatives:
-        new_grads = []
-        for grad in grads:
-          if isinstance(grad, tf.IndexedSlices):
-            new_grads.append(
-                tf.IndexedSlices(tf.stop_gradient(grad.values), grad.indices))
-          else:
-            new_grads.append(tf.stop_gradient(grad))
-        grads = new_grads
-
-      # store the objective value for the entire problem at each iteration
-      all_obj = tf.concat([all_obj, tf.reshape(obj, (1,))], 0)
-
-      # accumulate the weighted objective for the entire dataset
-      acc = tf.gather(obj_weights, itr) * obj
-
-      obj_accum = tf.add(obj_accum, acc)
-      # Set the shape to keep the shape invariant for obj_accum. Without this,
-      # the graph builder thinks the tensor shape is unknown on the 2nd iter.
-      obj_accum.set_shape([])
-
-      # convert flattened_states to dictionaries
-      dict_states = [dict(zip(self.state_keys, flat_state))
-                     for flat_state in flattened_states]
-
-      # compute the new parameters and states
-      args = (params, grads, dict_states, global_state)
-      updates = self._compute_updates(*args)
-      new_params, new_states, new_global_state, new_attend_params = updates
-
-      # flatten the states
-      new_flattened_states = map(flatten_and_sort, new_states)
-
-      return [itr + 1, obj_accum, new_params, new_attend_params,
-              new_flattened_states, new_global_state, all_obj, unused_init_obj,
-              data, labels, batches]
-
-    def loop_cond(itr, obj_accum, unused_params, unused_attend_params,
-                  unused_flattened_states, unused_global_state, all_obj,
-                  init_obj, *args):
-      """Termination conditions of the sub-problem optimization loop."""
-      del args  # unused
-
-      cond1 = tf.less(itr, num_iter)  # We've run < num_iter times
-      cond2 = tf.is_finite(obj_accum)  # The objective is still finite
-
-      if self.obj_train_max_multiplier > 0:
-        current_obj = tf.gather(all_obj, itr)
-        # Account for negative init_obj too
-        max_diff = (self.obj_train_max_multiplier - 1) * tf.abs(init_obj)
-        max_obj = init_obj + max_diff
-        # The objective is a reasonable multiplier of the original objective
-        cond3 = tf.less(current_obj, max_obj)
-
-        return tf.logical_and(tf.logical_and(cond1, cond2), cond3,
-                              name="training_loop_cond")
-      else:
-        return tf.logical_and(cond1, cond2, name="training_loop_cond")
-
-    init = self._initialize_training_loop_parameters(
-        problem, data, labels, batches, first_unroll, reset_state)
-    loop_vars, invariants, initial_obj, init_loop_vars_to_override = init
-
-    loop_output = tf.while_loop(loop_cond, loop_body, loop_vars,
-                                swap_memory=True, shape_invariants=invariants)
-    meta_obj, problem_objectives = loop_output[1], loop_output[6]
-
-    # The meta objective is normalized by the initial objective at the start of
-    # the series of partial unrolls.
-    scaled_meta_objective = self.scale_objective(
-        meta_obj, problem_objectives, initial_obj)
-
-    final_loop_vals = (
-        [initial_obj] + loop_output[2] + loop_output[3] + loop_output[5])
-    final_loop_vals.extend(itertools.chain(*loop_output[4]))
-
-    return training_output(scaled_meta_objective,
-                           obj_weights,
-                           problem_objectives,
-                           initial_obj,
-                           batches,
-                           first_unroll,
-                           reset_state,
-                           loop_output[4],
-                           init_loop_vars_to_override,
-                           final_loop_vals)
-
-  def _initialize_training_loop_parameters(
-      self, problem, data, labels, batches, first_unroll, reset_state):
-    """Initializes the vars and params needed for the training process.
-
-    Args:
-      problem: The problem being optimized.
-      data: The data for the problem.
-      labels: The corresponding labels for the data.
-      batches: The indexes needed to create shuffled batches of the data.
-      first_unroll: Whether this is the first unroll in a partial unrolling.
-      reset_state: Whether RNN state variables should be reset.
-
-    Returns:
-      loop_vars: The while loop variables for training.
-      invariants: The corresponding variable shapes (required by while loop).
-      initial_obj: The initial objective (used later for scaling).
-      init_loop_vars_to_override: The loop vars that can be overridden when
-          performing training via partial unrolls.
-    """
-    # Extract these separately so we don't have to make inter-variable
-    # dependencies.
-    initial_tensors = problem.init_tensors()
-
-    return_initial_tensor_values = first_unroll
-    initial_params_vars, initial_params = local_state_variables(
-        initial_tensors, return_initial_tensor_values)
-    initial_attend_params_vars, initial_attend_params = local_state_variables(
-        initial_tensors, return_initial_tensor_values)
-    # Recalculate the initial objective for the list on each partial unroll with
-    # the new initial_params. initial_obj holds the value from the very first
-    # unroll.
-    initial_obj_init = problem.objective(initial_params, data, labels)
-    return_initial_obj_init = first_unroll
-    [initial_obj_var], [initial_obj] = local_state_variables(
-        [initial_obj_init], return_initial_obj_init)
-
-    # Initialize the loop variables.
-    initial_itr = tf.constant(0, dtype=tf.int32)
-    initial_meta_obj = tf.constant(0, dtype=tf.float32)
-    # N.B. the use of initial_obj_init here rather than initial_obj
-    initial_problem_objectives = tf.reshape(initial_obj_init, (1,))
-
-    # Initialize the extra state.
-    initial_state_vars = []
-    initial_state = []
-    state_shapes = []
-    return_initial_state_values = reset_state
-    for param in initial_tensors:
-      param_state_vars, param_state = local_state_variables(
-          flatten_and_sort(self._initialize_state(param)),
-          return_initial_state_values)
-
-      initial_state_vars.append(param_state_vars)
-      initial_state.append(param_state)
-      state_shapes.append([f.get_shape() for f in param_state])
-
-    # Initialize any global (problem-level) state.
-    initial_global_state_vars, initial_global_state = local_state_variables(
-        self._initialize_global_state(), return_initial_state_values)
-
-    global_shapes = []
-    for item in initial_global_state:
-      global_shapes.append(item.get_shape())
-
-    # build the list of loop variables:
-    loop_vars = [
-        initial_itr,
-        initial_meta_obj,
-        initial_params,         # Local variables.
-        initial_attend_params,  # Local variables.
-        initial_state,          # Local variables.
-        initial_global_state,   # Local variables.
-        initial_problem_objectives,
-        initial_obj,            # Local variable.
-        data,
-        labels,
-        batches,
-    ]
-
-    invariants = [
-        initial_itr.get_shape(),
-        initial_meta_obj.get_shape(),
-        [t.get_shape() for t in initial_params],
-        [t.get_shape() for t in initial_attend_params],
-        state_shapes,
-        global_shapes,
-        tensor_shape.TensorShape([None]),   # The problem objectives list grows
-        initial_obj.get_shape(),
-        tensor_shape.unknown_shape(),  # Placeholder shapes are unknown
-        tensor_shape.unknown_shape(),
-        tensor_shape.unknown_shape(),
-    ]
-
-    # Initialize local variables that we will override with final tensors at the
-    # next iter.
-    init_loop_vars_to_override = (
-        [initial_obj_var] + initial_params_vars + initial_attend_params_vars +
-        initial_global_state_vars)
-    init_loop_vars_to_override.extend(itertools.chain(*initial_state_vars))
-
-    return loop_vars, invariants, initial_obj, init_loop_vars_to_override
-
-  def scale_objective(self, total_obj, all_objs, initial_obj,
-                      obj_scale_eps=1e-6):
-    """Normalizes the objective based on the initial objective value.
-
-    Args:
-      total_obj: The total accumulated objective over the training run.
-      all_objs: A list of all the individual objectives over the training run.
-      initial_obj: The initial objective value.
-      obj_scale_eps: The epsilon value to use in computations for stability.
-
-    Returns:
-      The scaled objective as a single value.
-    """
-    if self.use_log_objective:
-      if self.use_numerator_epsilon:
-        scaled_problem_obj = ((all_objs + obj_scale_eps) /
-                              (initial_obj + obj_scale_eps))
-        log_scaled_problem_obj = tf.log(scaled_problem_obj)
-      else:
-        scaled_problem_obj = all_objs / (initial_obj + obj_scale_eps)
-        log_scaled_problem_obj = tf.log(scaled_problem_obj + obj_scale_eps)
-      return tf.reduce_mean(log_scaled_problem_obj)
-    else:
-      return total_obj / (initial_obj + obj_scale_eps)
-
-
-def local_state_variables(init_values, return_init_values):
-  """Create local variables initialized from init_values.
-
-  This will create local variables from a list of init_values. Each variable
-  will be named based on the value's shape and dtype.
-
-  As a convenience, a boolean tensor allows you to return value from
-  the created local variable or from the original init value.
-
-  Args:
-    init_values: iterable of tensors
-    return_init_values: boolean tensor
-
-  Returns:
-    local_vars: list of the created local variables.
-    vals: if return_init_values is true, then this returns the values of
-      init_values. Otherwise it returns the values of the local_vars.
-  """
-  if not init_values:
-    return [], []
-
-  # This generates a harmless warning when saving the metagraph.
-  variable_use_count = tf.get_collection_ref(_LOCAL_STATE_VARIABLE_COLLECTION)
-  if not variable_use_count:
-    variable_use_count.append(collections.defaultdict(int))
-  variable_use_count = variable_use_count[0]
-
-  local_vars = []
-  with tf.variable_scope(OPTIMIZER_SCOPE):
-    # We can't use the init_value as an initializer as init_value may
-    # itself depend on some problem variables. This would produce
-    # inter-variable initialization order dependence which TensorFlow
-    # sucks at making easy.
-    for init_value in init_values:
-      name = create_local_state_variable_name(init_value)
-      unique_name = name + "_" + str(variable_use_count[name])
-      variable_use_count[name] += 1
-      # The overarching idea here is to be able to reuse variables between
-      # different sessions on the same TensorFlow master without errors. By
-      # uniquifying based on the type and name we mirror the checks made inside
-      # TensorFlow, while still allowing some memory reuse. Ultimately this is a
-      # hack due to the broken Session.reset().
-      local_vars.append(
-          tf.get_local_variable(
-              unique_name,
-              initializer=tf.zeros(
-                  init_value.get_shape(), dtype=init_value.dtype)))
-
-  # It makes things a lot simpler if we use the init_value the first
-  # iteration, instead of the variable itself. It allows us to propagate
-  # gradients through it as well as simplifying initialization. The variable
-  # ends up assigned to after the first iteration.
-  vals = tf.cond(return_init_values, lambda: init_values, lambda: local_vars)
-  if len(init_values) == 1:
-    # tf.cond extracts elements from singleton lists.
-    vals = [vals]
-  return local_vars, vals
-
-
-def create_local_state_variable_name(tensor):
-  """Create a name of the variable based on its type and shape."""
-  if not tensor.get_shape().is_fully_defined():
-    raise ValueError("Need a fully specified shape to create a local variable.")
-
-  return (_LOCAL_VARIABLE_PREFIX + "_".join(
-      map(str, tensor.get_shape().as_list())) + "_" + tensor.dtype.name)
-
-
-def is_local_state_variable(op):
-  """Returns if this op is a local state variable created for training."""
-  return op.node_def.op in ["Variable", "VariableV2"] and op.name.startswith(
-      OPTIMIZER_SCOPE + "/" + _LOCAL_VARIABLE_PREFIX)
-
-
-def flatten_and_sort(dictionary):
-  """Flattens a dictionary into a list of values sorted by the keys."""
-  return [dictionary[k] for k in sorted(dictionary.keys())]
diff --git a/research/learned_optimizer/optimizer/utils.py b/research/learned_optimizer/optimizer/utils.py
deleted file mode 100644
index 58744f4cb7919a84ecc8702ff1236e4c0a03f218..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/optimizer/utils.py
+++ /dev/null
@@ -1,278 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities and helper functions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-
-def make_finite(t, replacement):
-  """Replaces non-finite tensor values with the replacement value."""
-  return tf.where(tf.is_finite(t), t, replacement)
-
-
-def asinh(x):
-  """Computes the inverse hyperbolic sine function (in tensorflow)."""
-  return tf.log(x + tf.sqrt(1. + x ** 2))
-
-
-def affine(inputs, output_size, scope="Affine", scale=0.1, vec_mean=0.,
-           include_bias=True, bias_init=0., random_seed=None):
-  """Computes an affine function of the inputs.
-
-  Creates or recalls tensorflow variables "Matrix" and "Bias"
-  to generate an affine operation on the input.
-
-  If the inputs are a list of tensors, they are concatenated together.
-
-  Initial weights for the matrix are drawn from a Gaussian with zero
-  mean and standard deviation that is the given scale divided by the
-  square root of the input dimension. Initial weights for the bias are
-  set to zero.
-
-  Args:
-    inputs: List of tensors with shape (batch_size, input_size)
-    output_size: Size (dimension) of the output
-    scope: Variable scope for these parameters (default: "Affine")
-    scale: Initial weight scale for the matrix parameters (default: 0.1),
-      this constant is divided by the sqrt of the input size to get the
-      std. deviation of the initial weights
-    vec_mean: The mean for the random initializer
-    include_bias: Whether to include the bias term
-    bias_init: The initializer bias (default 0.)
-    random_seed: Random seed for random initializers. (Default: None)
-
-  Returns:
-    output: Tensor with shape (batch_size, output_size)
-  """
-
-  # Concatenate the input arguments.
-  x = tf.concat(inputs, 1)
-
-  with tf.variable_scope(scope):
-    input_size = x.get_shape().as_list()[1]
-
-    sigma = scale / np.sqrt(input_size)
-    rand_init = tf.random_normal_initializer(mean=vec_mean, stddev=sigma,
-                                             seed=random_seed)
-
-    matrix = tf.get_variable("Matrix", [input_size, output_size],
-                             dtype=tf.float32, initializer=rand_init)
-
-    if include_bias:
-      bias = tf.get_variable("Bias", [output_size], dtype=tf.float32,
-                             initializer=tf.constant_initializer(bias_init,
-                                                                 tf.float32))
-    else:
-      bias = 0.
-    output = tf.matmul(x, matrix) + bias
-
-  return output
-
-
-def project(inputs, weights, bias=0., activation=tf.identity):
-  """Computes an affine or linear projection of the inputs.
-
-  Projects the inputs onto the given weight vector and (optionally)
-  adds a bias and passes the result through an activation function.
-
-  Args:
-    inputs: matrix of inputs with shape [batch_size, dim]
-    weights: weight matrix with shape [dim, output_dim]
-    bias: bias vector with shape [output_dim] (default: 0)
-    activation: nonlinear activation function (default: tf.identity)
-
-  Returns:
-    outputs: an op which computes activation(inputs @ weights + bias)
-  """
-  return activation(tf.matmul(inputs, weights) + bias)
-
-
-def new_mean_squared(grad_vec, decay, ms):
-  """Calculates the new accumulated mean squared of the gradient.
-
-  Args:
-    grad_vec: the vector for the current gradient
-    decay: the decay term
-    ms: the previous mean_squared value
-
-  Returns:
-    the new mean_squared value
-  """
-  decay_size = decay.get_shape().num_elements()
-  decay_check_ops = [
-      tf.assert_less_equal(decay, 1., summarize=decay_size),
-      tf.assert_greater_equal(decay, 0., summarize=decay_size)]
-
-  with tf.control_dependencies(decay_check_ops):
-    grad_squared = tf.square(grad_vec)
-
-  # If the previous mean_squared is the 0 vector, don't use the decay and just
-  # return the full grad_squared. This should only happen on the first timestep.
-  decay = tf.cond(tf.reduce_all(tf.equal(ms, 0.)),
-                  lambda: tf.zeros_like(decay, dtype=tf.float32), lambda: decay)
-
-  # Update the running average of squared gradients.
-  epsilon = 1e-12
-  return (1. - decay) * (grad_squared + epsilon) + decay * ms
-
-
-def rms_scaling(gradient, decay, ms, update_ms=True):
-  """Vectorizes and scales a tensor of gradients.
-
-  Args:
-    gradient: the current gradient
-    decay: the current decay value.
-    ms: the previous mean squared value
-    update_ms: Whether to update the mean squared value (default: True)
-
-  Returns:
-    The scaled gradient and the new ms value if update_ms is True,
-    the old ms value otherwise.
-  """
-
-  # Vectorize the gradients and compute the squared gradients.
-  grad_vec = tf.reshape(gradient, [-1, 1])
-
-  if update_ms:
-    ms = new_mean_squared(grad_vec, decay, ms)
-
-  # Scale the current gradients by the RMS, squashed by the asinh function.
-  scaled_gradient = asinh(grad_vec / tf.sqrt(ms + 1e-16))
-
-  return scaled_gradient, ms
-
-
-def accumulate_sparse_gradients(grad):
-  """Accumulates repeated indices of a sparse gradient update.
-
-  Args:
-    grad: a tf.IndexedSlices gradient
-
-  Returns:
-    grad_indices: unique indices
-    grad_values: gradient values corresponding to the indices
-  """
-
-  grad_indices, grad_segments = tf.unique(grad.indices)
-  grad_values = tf.unsorted_segment_sum(grad.values, grad_segments,
-                                        tf.shape(grad_indices)[0])
-  return grad_indices, grad_values
-
-
-def slice_tensor(dense_tensor, indices, head_dims):
-  """Extracts slices from a partially flattened dense tensor.
-
-  indices is assumed to index into the first dimension of head_dims.
-  dense_tensor is assumed to have a shape [D_0, D_1, ...] such that
-  prod(head_dims) == D_0. This function will extract slices along the
-  first_dimension of head_dims.
-
-  Example:
-
-  Consider a tensor with shape head_dims = [100, 2] and a dense_tensor with
-  shape [200, 3]. Note that the first dimension of dense_tensor equals the
-  product of head_dims. This function will reshape dense_tensor such that
-  its shape is now [100, 2, 3] (i.e. the first dimension became head-dims)
-  and then slice it along the first dimension. After slicing, the slices will
-  have their initial dimensions flattened just as they were in dense_tensor
-  (e.g. if there are 4 indices, the return value will have a shape of [4, 3]).
-
-  Args:
-    dense_tensor: a N-D dense tensor. Shape: [D_0, D_1, ...]
-    indices: a 1-D integer tensor. Shape: [K]
-    head_dims: True dimensions of the dense_tensor's first dimension.
-
-  Returns:
-    Extracted slices. Shape [K, D_1, ...]
-  """
-
-  tail_dims = tf.shape(dense_tensor)[1:]
-  dense_tensor = tf.reshape(dense_tensor,
-                            tf.concat([head_dims, tail_dims], 0))
-
-  slices = tf.gather(dense_tensor, indices)
-  # NOTE(siege): This kills the shape annotation.
-  return tf.reshape(slices, tf.concat([[-1], tail_dims], 0))
-
-
-def stack_tensor(slices, indices, dense_tensor, head_dims):
-  """Reconsititutes a tensor from slices and corresponding indices.
-
-  This is an inverse operation to slice_tensor. Missing slices are set to 0.
-
-  Args:
-    slices: a tensor. Shape [K, D_1, ...]
-    indices: a 1-D integer tensor. Shape: [K]
-    dense_tensor: the original tensor the slices were taken
-      from. Shape: [D_0, D_1, ...]
-    head_dims: True dimensions of the dense_tensor's first dimension.
-
-  Returns:
-    Reconsituted tensor. Shape: [D_0, D_1, ...]
-  """
-  # NOTE(siege): This cast shouldn't be necessary.
-  indices = tf.cast(indices, tf.int32)
-
-  tail_dims = tf.shape(dense_tensor)[1:]
-  dense_shape = tf.concat([head_dims, tail_dims], 0)
-
-  slices = tf.reshape(slices, tf.concat([[-1], dense_shape[1:]], 0))
-  indices = tf.expand_dims(indices, -1)
-
-  return tf.reshape(tf.scatter_nd(indices, slices, dense_shape),
-                    tf.shape(dense_tensor))
-
-
-def update_slices(slices, indices, dense_tensor, head_dims):
-  """Reconstitutes a tensor from slices and corresponding indices.
-
-  Like _stack_tensor, but instead of setting missing slices to 0, sets them to
-  what they were in the original tensor. The return value is reshaped to be
-  the same as dense_tensor.
-
-  Args:
-    slices: a tensor. Shape [K, D_1, ...]
-    indices: a 1-D integer tensor. Shape: [K]
-    dense_tensor: the original tensor the slices were taken
-      from. Shape: [D_0, D_1, ...]
-    head_dims: True dimensions of the dense_tensor's first dimension.
-
-  Returns:
-    Reconsituted tensor. Shape: [D_0, D_1, ...]
-  """
-  # NOTE(siege): This cast shouldn't be necessary.
-  indices = tf.cast(indices, tf.int32)
-
-  tail_dims = tf.shape(dense_tensor)[1:]
-  dense_shape = tf.concat([head_dims, tail_dims], 0)
-
-  update_mask_vals = tf.fill(tf.shape(indices), 1)
-  reshaped_indices = tf.expand_dims(indices, -1)
-  update_mask = tf.equal(
-      tf.scatter_nd(reshaped_indices, update_mask_vals, head_dims[:1]), 1)
-
-  reshaped_dense_slices = tf.reshape(
-      stack_tensor(slices, indices, dense_tensor, head_dims), dense_shape)
-  reshaped_dense_tensor = tf.reshape(dense_tensor, dense_shape)
-
-  return tf.reshape(
-      tf.where(update_mask, reshaped_dense_slices, reshaped_dense_tensor),
-      tf.shape(dense_tensor))
diff --git a/research/learned_optimizer/problems/BUILD b/research/learned_optimizer/problems/BUILD
deleted file mode 100644
index c704618821b36ca23f221f724888cde4e5d5a5ad..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/problems/BUILD
+++ /dev/null
@@ -1,43 +0,0 @@
-package(default_visibility = ["//visibility:public"])
-
-# Libraries
-# =====
-
-py_library(
-    name = "datasets",
-    srcs = ["datasets.py"],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "model_adapter",
-    srcs = ["model_adapter.py"],
-    deps = [
-        ":problem_generator",
-    ],
-)
-
-py_library(
-    name = "problem_generator",
-    srcs = ["problem_generator.py"],
-    deps = [
-        ":problem_spec",
-    ],
-)
-
-py_library(
-    name = "problem_sets",
-    srcs = ["problem_sets.py"],
-    deps = [
-        ":datasets",
-        ":model_adapter",
-        ":problem_generator",
-    ],
-)
-
-py_library(
-    name = "problem_spec",
-    srcs = ["problem_spec.py"],
-    deps = [],
-)
diff --git a/research/learned_optimizer/problems/datasets.py b/research/learned_optimizer/problems/datasets.py
deleted file mode 100644
index edf3df6532178b0e60ab93c78611d2313798e639..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/problems/datasets.py
+++ /dev/null
@@ -1,218 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Functions to generate or load datasets for supervised learning."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import namedtuple
-
-import numpy as np
-from sklearn.datasets import make_classification
-
-MAX_SEED = 4294967295
-
-
-class Dataset(namedtuple("Dataset", "data labels")):
-  """Helper class for managing a supervised learning dataset.
-
-  Args:
-    data: an array of type float32 with N samples, each of which is the set
-      of features for that sample. (Shape (N, D_i), where N is the number of
-      samples and D_i is the number of features for that sample.)
-    labels: an array of type int32 or int64 with N elements, indicating the
-      class label for the corresponding set of features in data.
-  """
-  # Since this is an immutable object, we don't need to reserve slots.
-  __slots__ = ()
-
-  @property
-  def size(self):
-    """Dataset size (number of samples)."""
-    return len(self.data)
-
-  def batch_indices(self, num_batches, batch_size):
-    """Creates indices of shuffled minibatches.
-
-    Args:
-      num_batches: the number of batches to generate
-      batch_size: the size of each batch
-
-    Returns:
-      batch_indices: a list of minibatch indices, arranged so that the dataset
-          is randomly shuffled.
-
-    Raises:
-      ValueError: if the data and labels have different lengths
-    """
-    if len(self.data) != len(self.labels):
-      raise ValueError("Labels and data must have the same number of samples.")
-
-    batch_indices = []
-
-    # Follows logic in mnist.py to ensure we cover the entire dataset.
-    index_in_epoch = 0
-    dataset_size = len(self.data)
-    dataset_indices = np.arange(dataset_size)
-    np.random.shuffle(dataset_indices)
-
-    for _ in range(num_batches):
-      start = index_in_epoch
-      index_in_epoch += batch_size
-      if index_in_epoch > dataset_size:
-
-        # Finished epoch, reshuffle.
-        np.random.shuffle(dataset_indices)
-
-        # Start next epoch.
-        start = 0
-        index_in_epoch = batch_size
-
-      end = index_in_epoch
-      batch_indices.append(dataset_indices[start:end].tolist())
-
-    return batch_indices
-
-
-def noisy_parity_class(n_samples,
-                       n_classes=2,
-                       n_context_ids=5,
-                       noise_prob=0.25,
-                       random_seed=None):
-  """Returns a randomly generated sparse-to-sparse dataset.
-
-  The label is a parity class of a set of context classes.
-
-  Args:
-    n_samples: number of samples (data points)
-    n_classes: number of class labels (default: 2)
-    n_context_ids: how many classes to take the parity of (default: 5).
-    noise_prob: how often to corrupt the label (default: 0.25)
-    random_seed: seed used for drawing the random data (default: None)
-  Returns:
-    dataset: A Dataset namedtuple containing the generated data and labels
-  """
-  np.random.seed(random_seed)
-  x = np.random.randint(0, n_classes, [n_samples, n_context_ids])
-  noise = np.random.binomial(1, noise_prob, [n_samples])
-  y = (np.sum(x, 1) + noise) % n_classes
-  return Dataset(x.astype("float32"), y.astype("int32"))
-
-
-def random(n_features, n_samples, n_classes=2, sep=1.0, random_seed=None):
-  """Returns a randomly generated classification dataset.
-
-  Args:
-    n_features: number of features (dependent variables)
-    n_samples: number of samples (data points)
-    n_classes: number of class labels (default: 2)
-    sep: separation of the two classes, a higher value corresponds to
-      an easier classification problem (default: 1.0)
-    random_seed: seed used for drawing the random data (default: None)
-
-  Returns:
-    dataset: A Dataset namedtuple containing the generated data and labels
-  """
-  # Generate the problem data.
-  x, y = make_classification(n_samples=n_samples,
-                             n_features=n_features,
-                             n_informative=n_features,
-                             n_redundant=0,
-                             n_classes=n_classes,
-                             class_sep=sep,
-                             random_state=random_seed)
-
-  return Dataset(x.astype("float32"), y.astype("int32"))
-
-
-def random_binary(n_features, n_samples, random_seed=None):
-  """Returns a randomly generated dataset of binary values.
-
-  Args:
-    n_features: number of features (dependent variables)
-    n_samples: number of samples (data points)
-    random_seed: seed used for drawing the random data (default: None)
-
-  Returns:
-    dataset: A Dataset namedtuple containing the generated data and labels
-  """
-  random_seed = (np.random.randint(MAX_SEED) if random_seed is None
-                 else random_seed)
-  np.random.seed(random_seed)
-
-  x = np.random.randint(2, size=(n_samples, n_features))
-  y = np.zeros((n_samples, 1))
-
-  return Dataset(x.astype("float32"), y.astype("int32"))
-
-
-def random_symmetric(n_features, n_samples, random_seed=None):
-  """Returns a randomly generated dataset of values and their negatives.
-
-  Args:
-    n_features: number of features (dependent variables)
-    n_samples: number of samples (data points)
-    random_seed: seed used for drawing the random data (default: None)
-
-  Returns:
-    dataset: A Dataset namedtuple containing the generated data and labels
-  """
-  random_seed = (np.random.randint(MAX_SEED) if random_seed is None
-                 else random_seed)
-  np.random.seed(random_seed)
-
-  x1 = np.random.normal(size=(int(n_samples/2), n_features))
-  x = np.concatenate((x1, -x1), axis=0)
-  y = np.zeros((n_samples, 1))
-
-  return Dataset(x.astype("float32"), y.astype("int32"))
-
-
-def random_mlp(n_features, n_samples, random_seed=None, n_layers=6, width=20):
-  """Returns a generated output of an MLP with random weights.
-
-  Args:
-    n_features: number of features (dependent variables)
-    n_samples: number of samples (data points)
-    random_seed: seed used for drawing the random data (default: None)
-    n_layers: number of layers in random MLP
-    width: width of the layers in random MLP
-
-  Returns:
-    dataset: A Dataset namedtuple containing the generated data and labels
-  """
-  random_seed = (np.random.randint(MAX_SEED) if random_seed is None
-                 else random_seed)
-  np.random.seed(random_seed)
-
-  x = np.random.normal(size=(n_samples, n_features))
-  y = x
-  n_in = n_features
-  scale_factor = np.sqrt(2.) / np.sqrt(n_features)
-  for _ in range(n_layers):
-    weights = np.random.normal(size=(n_in, width)) * scale_factor
-    y = np.dot(y, weights).clip(min=0)
-    n_in = width
-
-  y = y[:, 0]
-  y[y > 0] = 1
-
-  return Dataset(x.astype("float32"), y.astype("int32"))
-
-
-EMPTY_DATASET = Dataset(np.array([], dtype="float32"),
-                        np.array([], dtype="int32"))
diff --git a/research/learned_optimizer/problems/model_adapter.py b/research/learned_optimizer/problems/model_adapter.py
deleted file mode 100644
index 8455992366dd46172e2a78471004779b1a4f091b..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/problems/model_adapter.py
+++ /dev/null
@@ -1,190 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Implementation of the ModelAdapter class."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import mock
-import tensorflow as tf
-
-from learned_optimizer.problems import problem_generator as pg
-
-
-class ModelAdapter(pg.Problem):
-  """Adapts Tensorflow models/graphs into a form suitable for meta-training.
-
-  This class adapts an existing TensorFlow graph into a form suitable for
-  meta-training a learned optimizer.
-  """
-
-  def __init__(self, make_loss_and_init_fn):
-    """Wraps a model in the Problem interface.
-
-    make_loss_and_init argument is a callable that returns a tuple of
-    two other callables as follows.
-
-    The first will construct most of the graph and return the problem loss. It
-    is essential that this graph contains the totality of the model's variables,
-    but none of its queues.
-
-    The second will return construct the model initialization graph given a list
-    of parameters and return a callable that is passed an instance of
-    tf.Session, and should initialize the models' parameters.
-
-    An argument value function would look like this:
-
-    ```python
-    def make_loss_and_init_fn():
-      inputs = queued_reader()
-
-      def make_loss():
-        return create_model_with_variables(inputs)
-
-      def make_init_fn(parameters):
-        saver = tf.Saver(parameters)
-        def init_fn(sess):
-          sess.restore(sess, ...)
-        return init_fn
-
-      return make_loss, make_init_fn
-    ```
-
-    Args:
-      make_loss_and_init_fn: a callable, as described aboce
-    """
-    make_loss_fn, make_init_fn = make_loss_and_init_fn()
-
-    self.make_loss_fn = make_loss_fn
-    self.parameters, self.constants = _get_variables(make_loss_fn)
-
-    if make_init_fn is not None:
-      init_fn = make_init_fn(self.parameters + self.constants)
-    else:
-      init_op = tf.initialize_variables(self.parameters + self.constants)
-      init_fn = lambda sess: sess.run(init_op)
-
-    tf.logging.info("ModelAdapter parameters: %s",
-                    [op.name for op in self.parameters])
-    tf.logging.info("ModelAdapter constants: %s",
-                    [op.name for op in self.constants])
-
-    super(ModelAdapter, self).__init__(
-        [], random_seed=None, noise_stdev=0.0, init_fn=init_fn)
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return self.parameters
-
-  def init_variables(self, seed=None):
-    """Returns a list of variables with the given shape."""
-    # NOTE(siege): This is awkward, as these are not set as trainable.
-    return self.parameters
-
-  def objective(self, parameters, data=None, labels=None):
-    """Computes the objective given a list of parameters.
-
-    Args:
-      parameters: The parameters to optimize (as a list of tensors)
-      data: An optional batch of data for calculating objectives
-      labels: An optional batch of corresponding labels
-
-    Returns:
-      A scalar tensor representing the objective value
-    """
-    # We need to set up a mapping based on the original parameter names, because
-    # the parameters passed can be arbitrary tensors.
-    parameter_mapping = {
-        old_p.name: p
-        for old_p, p in zip(self.parameters, parameters)
-    }
-
-    with tf.variable_scope(tf.get_variable_scope(), reuse=True):
-      return _make_with_custom_variables(self.make_loss_fn, parameter_mapping)
-
-
-def _get_variables(func):
-  """Calls func, returning any variables created.
-
-  The created variables are modified to not be trainable, and are placed into
-  the LOCAL_VARIABLES collection.
-
-  Args:
-    func: Function to be called.
-
-  Returns:
-    A tuple (variables, constants) where the first element is a list of
-    trainable variables and the second is the non-trainable variables.
-  """
-  variables = []
-  constants = []
-
-  # We need to create these variables like normal, so grab the original
-  # constructor before we mock it.
-  original_init = tf.Variable.__init__
-
-  def custom_init(self, *args, **kwargs):
-    trainable = kwargs["trainable"]
-    kwargs["trainable"] = False
-    # Making these variables local keeps them out of the optimizer's checkpoints
-    # somehow.
-    kwargs["collections"] = [tf.GraphKeys.LOCAL_VARIABLES]
-    original_init(self, *args, **kwargs)
-    if trainable:
-      variables.append(self)
-    else:
-      constants.append(self)
-
-  # This name-scope is just a nicety for TensorBoard.
-  with tf.name_scope("unused_graph"):
-    with mock.patch.object(tf.Variable, "__init__", custom_init):
-      func()
-
-  return variables, constants
-
-
-def _make_with_custom_variables(func, variable_mapping):
-  """Calls func and replaces the value of some variables created in it.
-
-  Args:
-    func: Function to be called.
-    variable_mapping: A mapping of variable name to the replacement tensor or
-      tf.Variable.
-
-  Returns:
-    The return value of func is returned.
-  """
-  original_value = tf.Variable.value
-
-  def custom_value(self):
-    if self.name in variable_mapping:
-      replacement = variable_mapping[self.name]
-      tf.logging.info("Replaced %s with %s" % (self.name, replacement))
-
-      # value() method needs to return a tensor, we need to call value on it.
-      # This has to be done manually like this otherwise we'll get an infinite
-      # loop.
-      if isinstance(replacement, tf.Variable):
-        replacement = original_value(replacement)
-
-      return replacement
-    else:
-      return original_value(self)
-
-  with mock.patch.object(tf.Variable, "value", custom_value):
-    with mock.patch.object(tf.Variable, "_AsTensor", custom_value):
-      return func()
diff --git a/research/learned_optimizer/problems/problem_generator.py b/research/learned_optimizer/problems/problem_generator.py
deleted file mode 100644
index abe1008faadbb04163bc27e0b991e3ec4ba9e6bc..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/problems/problem_generator.py
+++ /dev/null
@@ -1,1016 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Generates toy optimization problems.
-
-This module contains a base class, Problem, that defines a minimal interface
-for optimization problems, and a few specific problem types that subclass it.
-
-Test functions for optimization: http://www.sfu.ca/~ssurjano/optimization.html
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-from learned_optimizer.problems import problem_spec as prob_spec
-
-tf.app.flags.DEFINE_float("l2_reg_scale", 1e-3,
-                          """Scaling factor for parameter value regularization
-                             in softmax classifier problems.""")
-FLAGS = tf.app.flags.FLAGS
-
-EPSILON = 1e-6
-MAX_SEED = 4294967295
-PARAMETER_SCOPE = "parameters"
-
-_Spec = prob_spec.Spec
-
-
-class Problem(object):
-  """Base class for optimization problems.
-
-  This defines an interface for optimization problems, including objective and
-  gradients functions and a feed_generator function that yields data to pass to
-  feed_dict in tensorflow.
-
-  Subclasses of Problem must (at the minimum) override the objective method,
-  which computes the objective/loss/cost to minimize, and specify the desired
-  shape of the parameters in a list in the param_shapes attribute.
-  """
-
-  def __init__(self, param_shapes, random_seed, noise_stdev, init_fn=None):
-    """Initializes a global random seed for the problem.
-
-    Args:
-      param_shapes: A list of tuples defining the expected shapes of the
-        parameters for this problem
-      random_seed: Either an integer (or None, in which case the seed is
-        randomly drawn)
-      noise_stdev: Strength (standard deviation) of added gradient noise
-      init_fn: A function taking a tf.Session object that is used to
-        initialize the problem's variables.
-
-    Raises:
-      ValueError: If the random_seed is not an integer and not None
-    """
-    if random_seed is not None and not isinstance(random_seed, int):
-      raise ValueError("random_seed must be an integer or None")
-
-    # Pick a random seed.
-    self.random_seed = (np.random.randint(MAX_SEED) if random_seed is None
-                        else random_seed)
-
-    # Store the noise level.
-    self.noise_stdev = noise_stdev
-
-    # Set the random seed to ensure any random data in the problem is the same.
-    np.random.seed(self.random_seed)
-
-    # Store the parameter shapes.
-    self.param_shapes = param_shapes
-
-    if init_fn is not None:
-      self.init_fn = init_fn
-    else:
-      self.init_fn = lambda _: None
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_normal(shape, seed=seed) for shape in self.param_shapes]
-
-  def init_variables(self, seed=None):
-    """Returns a list of variables with the given shape."""
-    with tf.variable_scope(PARAMETER_SCOPE):
-      params = [tf.Variable(param) for param in self.init_tensors(seed)]
-    return params
-
-  def objective(self, parameters, data=None, labels=None):
-    """Computes the objective given a list of parameters.
-
-    Args:
-      parameters: The parameters to optimize (as a list of tensors)
-      data: An optional batch of data for calculating objectives
-      labels: An optional batch of corresponding labels
-
-    Returns:
-      A scalar tensor representing the objective value
-    """
-    raise NotImplementedError
-
-  def gradients(self, objective, parameters):
-    """Compute gradients of the objective with respect to the parameters.
-
-    Args:
-      objective: The objective op (e.g. output of self.objective())
-      parameters: A list of tensors (the parameters to optimize)
-
-    Returns:
-      A list of tensors representing the gradient for each parameter,
-        returned in the same order as the given list
-    """
-    grads = tf.gradients(objective, list(parameters))
-    noisy_grads = []
-
-    for grad in grads:
-      if isinstance(grad, tf.IndexedSlices):
-        noise = self.noise_stdev * tf.random_normal(tf.shape(grad.values))
-        new_grad = tf.IndexedSlices(grad.values + noise, grad.indices)
-      else:
-        new_grad = grad + self.noise_stdev * tf.random_normal(grad.get_shape())
-      noisy_grads.append(new_grad)
-
-    return noisy_grads
-
-
-class Quadratic(Problem):
-  """Optimizes a random quadratic function.
-
-  The objective is: f(x) = (1/2) ||Wx - y||_2^2
-  where W is a random Gaussian matrix and y is a random Gaussian vector.
-  """
-
-  def __init__(self, ndim, random_seed=None, noise_stdev=0.0):
-    """Initializes a random quadratic problem."""
-    param_shapes = [(ndim, 1)]
-    super(Quadratic, self).__init__(param_shapes, random_seed, noise_stdev)
-
-    # Generate a random problem instance.
-    self.w = np.random.randn(ndim, ndim).astype("float32")
-    self.y = np.random.randn(ndim, 1).astype("float32")
-
-  def objective(self, params, data=None, labels=None):
-    """Quadratic objective (see base class for details)."""
-    return tf.nn.l2_loss(tf.matmul(self.w, params[0]) - self.y)
-
-
-class SoftmaxClassifier(Problem):
-  """Helper functions for supervised softmax classification problems."""
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_normal(shape, seed=seed) * 1.2 / np.sqrt(shape[0])
-            for shape in self.param_shapes]
-
-  def inference(self, params, data):
-    """Computes logits given parameters and data.
-
-    Args:
-      params: List of parameter tensors or variables
-      data: Batch of features with samples along the first dimension
-
-    Returns:
-      logits: Un-normalized logits with shape (num_samples, num_classes)
-    """
-    raise NotImplementedError
-
-  def objective(self, params, data, labels):
-    """Computes the softmax cross entropy.
-
-    Args:
-      params: List of parameter tensors or variables
-      data: Batch of features with samples along the first dimension
-      labels: Vector of labels with the same number of samples as the data
-
-    Returns:
-      loss: Softmax cross entropy loss averaged over the samples in the batch
-
-    Raises:
-      ValueError: If the objective is to be computed over >2 classes, because
-        this operation is broken in tensorflow at the moment.
-    """
-    # Forward pass.
-    logits = self.inference(params, data)
-
-    # Compute the loss.
-    l2reg = [tf.reduce_sum(param ** 2) for param in params]
-    if int(logits.get_shape()[1]) == 2:
-      labels = tf.cast(labels, tf.float32)
-      losses = tf.nn.sigmoid_cross_entropy_with_logits(
-          labels=labels, logits=logits[:, 0])
-    else:
-      raise ValueError("Unable to compute softmax cross entropy for more than"
-                       " 2 classes.")
-
-    return tf.reduce_mean(losses) + tf.reduce_mean(l2reg) * FLAGS.l2_reg_scale
-
-  def argmax(self, logits):
-    """Samples the most likely class label given the logits.
-
-    Args:
-      logits: Un-normalized logits with shape (num_samples, num_classes)
-
-    Returns:
-      predictions: Predicted class labels, has shape (num_samples,)
-    """
-    return tf.cast(tf.argmax(tf.nn.softmax(logits), 1), tf.int32)
-
-  def accuracy(self, params, data, labels):
-    """Computes the accuracy (fraction of correct classifications).
-
-    Args:
-      params: List of parameter tensors or variables
-      data: Batch of features with samples along the first dimension
-      labels: Vector of labels with the same number of samples as the data
-
-    Returns:
-      accuracy: Fraction of correct classifications across the batch
-    """
-    predictions = self.argmax(self.inference(params, data))
-    return tf.contrib.metrics.accuracy(predictions, tf.cast(labels, tf.int32))
-
-
-class SoftmaxRegression(SoftmaxClassifier):
-  """Builds a softmax regression problem."""
-
-  def __init__(self, n_features, n_classes, activation=tf.identity,
-               random_seed=None, noise_stdev=0.0):
-    self.activation = activation
-    self.n_features = n_features
-    param_shapes = [(n_features, n_classes), (n_classes,)]
-    super(SoftmaxRegression, self).__init__(param_shapes,
-                                            random_seed,
-                                            noise_stdev)
-
-  def inference(self, params, data):
-    features = tf.reshape(data, (-1, self.n_features))
-    return tf.matmul(features, params[0]) + params[1]
-
-
-class SparseSoftmaxRegression(SoftmaxClassifier):
-  """Builds a sparse input softmax regression problem."""
-
-  def __init__(self,
-               n_features,
-               n_classes,
-               activation=tf.identity,
-               random_seed=None,
-               noise_stdev=0.0):
-    self.activation = activation
-    self.n_features = n_features
-    param_shapes = [(n_classes, n_features), (n_features, n_classes), (
-        n_classes,)]
-    super(SparseSoftmaxRegression, self).__init__(param_shapes, random_seed,
-                                                  noise_stdev)
-
-  def inference(self, params, data):
-    all_embeddings, softmax_weights, softmax_bias = params
-    embeddings = tf.nn.embedding_lookup(all_embeddings, tf.cast(data, tf.int32))
-    embeddings = tf.reduce_sum(embeddings, 1)
-    return tf.matmul(embeddings, softmax_weights) + softmax_bias
-
-
-class OneHotSparseSoftmaxRegression(SoftmaxClassifier):
-  """Builds a sparse input softmax regression problem.
-
-  This is identical to SparseSoftmaxRegression, but without using embedding
-  ops.
-  """
-
-  def __init__(self,
-               n_features,
-               n_classes,
-               activation=tf.identity,
-               random_seed=None,
-               noise_stdev=0.0):
-    self.activation = activation
-    self.n_features = n_features
-    self.n_classes = n_classes
-    param_shapes = [(n_classes, n_features), (n_features, n_classes), (
-        n_classes,)]
-    super(OneHotSparseSoftmaxRegression, self).__init__(param_shapes,
-                                                        random_seed,
-                                                        noise_stdev)
-
-  def inference(self, params, data):
-    all_embeddings, softmax_weights, softmax_bias = params
-    num_ids = tf.shape(data)[1]
-    one_hot_embeddings = tf.one_hot(tf.cast(data, tf.int32), self.n_classes)
-    one_hot_embeddings = tf.reshape(one_hot_embeddings, [-1, self.n_classes])
-    embeddings = tf.matmul(one_hot_embeddings, all_embeddings)
-    embeddings = tf.reshape(embeddings, [-1, num_ids, self.n_features])
-    embeddings = tf.reduce_sum(embeddings, 1)
-    return tf.matmul(embeddings, softmax_weights) + softmax_bias
-
-
-class FullyConnected(SoftmaxClassifier):
-  """Builds a multi-layer perceptron classifier."""
-
-  def __init__(self, n_features, n_classes, hidden_sizes=(32, 64),
-               activation=tf.nn.sigmoid, random_seed=None, noise_stdev=0.0):
-    """Initializes an multi-layer perceptron classification problem."""
-    # Store the number of features and activation function.
-    self.n_features = n_features
-    self.activation = activation
-
-    # Define the network as a list of weight + bias shapes for each layer.
-    param_shapes = []
-    for ix, sz in enumerate(hidden_sizes + (n_classes,)):
-
-      # The previous layer"s size (n_features if input).
-      prev_size = n_features if ix == 0 else hidden_sizes[ix - 1]
-
-      # Weight shape for this layer.
-      param_shapes.append((prev_size, sz))
-
-      # Bias shape for this layer.
-      param_shapes.append((sz,))
-
-    super(FullyConnected, self).__init__(param_shapes, random_seed, noise_stdev)
-
-  def inference(self, params, data):
-    # Flatten the features into a vector.
-    features = tf.reshape(data, (-1, self.n_features))
-
-    # Pass the data through the network.
-    preactivations = tf.matmul(features, params[0]) + params[1]
-
-    for layer in range(2, len(self.param_shapes), 2):
-      net = self.activation(preactivations)
-      preactivations = tf.matmul(net, params[layer]) + params[layer + 1]
-
-    return preactivations
-
-  def accuracy(self, params, data, labels):
-    """Computes the accuracy (fraction of correct classifications).
-
-    Args:
-      params: List of parameter tensors or variables
-      data: Batch of features with samples along the first dimension
-      labels: Vector of labels with the same number of samples as the data
-
-    Returns:
-      accuracy: Fraction of correct classifications across the batch
-    """
-    predictions = self.argmax(self.activation(self.inference(params, data)))
-    return tf.contrib.metrics.accuracy(predictions, tf.cast(labels, tf.int32))
-
-
-class ConvNet(SoftmaxClassifier):
-  """Builds an N-layer convnet for image classification."""
-
-  def __init__(self,
-               image_shape,
-               n_classes,
-               filter_list,
-               activation=tf.nn.relu,
-               random_seed=None,
-               noise_stdev=0.0):
-    # Number of channels, number of pixels in x- and y- dimensions.
-    n_channels, px, py = image_shape
-
-    # Store the activation.
-    self.activation = activation
-
-    param_shapes = []
-    input_size = n_channels
-    for fltr in filter_list:
-      # Add conv2d filters.
-      param_shapes.append((fltr[0], fltr[1], input_size, fltr[2]))
-      input_size = fltr[2]
-
-    # Number of units in the final (dense) layer.
-    self.affine_size = input_size * px * py
-
-    param_shapes.append((self.affine_size, n_classes))  # affine weights
-    param_shapes.append((n_classes,))  # affine bias
-
-    super(ConvNet, self).__init__(param_shapes, random_seed, noise_stdev)
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_normal(shape, mean=0., stddev=0.01, seed=seed)
-            for shape in self.param_shapes]
-
-  def inference(self, params, data):
-
-    # Unpack.
-    w_conv_list = params[:-2]
-    output_w, output_b = params[-2:]
-
-    conv_input = data
-    for w_conv in w_conv_list:
-      layer = tf.nn.conv2d(conv_input, w_conv, strides=[1] * 4, padding="SAME")
-      output = self.activation(layer)
-      conv_input = output
-
-    # Flatten.
-    flattened = tf.reshape(conv_input, (-1, self.affine_size))
-
-    # Fully connected layer.
-    return tf.matmul(flattened, output_w) + output_b
-
-
-class Bowl(Problem):
-  """A 2D quadratic bowl."""
-
-  def __init__(self, condition_number, angle=0.0,
-               random_seed=None, noise_stdev=0.0):
-    assert condition_number > 0, "Condition number must be positive."
-
-    # Define parameter shapes.
-    param_shapes = [(2, 1)]
-    super(Bowl, self).__init__(param_shapes, random_seed, noise_stdev)
-
-    self.condition_number = condition_number
-    self.angle = angle
-    self._build_matrix(condition_number, angle)
-
-  def _build_matrix(self, condition_number, angle):
-    """Builds the Hessian matrix."""
-    hessian = np.array([[condition_number, 0.], [0., 1.]], dtype="float32")
-
-    # Build the rotation matrix.
-    rotation_matrix = np.array([
-        [np.cos(angle), -np.sin(angle)],
-        [np.sin(angle), np.cos(angle)]
-    ])
-
-    # The objective is 0.5 * || Ax ||_2^2
-    # where the data matrix (A) is: sqrt(Hessian).dot(rotation_matrix).
-    self.matrix = np.sqrt(hessian).dot(rotation_matrix)
-
-  def objective(self, params, data=None, labels=None):
-    mtx = tf.constant(self.matrix, dtype=tf.float32)
-    return tf.nn.l2_loss(tf.matmul(mtx, params[0]))
-
-  def surface(self, xlim=5, ylim=5, n=50):
-    xm, ym = _mesh(xlim, ylim, n)
-    pts = np.vstack([xm.ravel(), ym.ravel()])
-    zm = 0.5 * np.linalg.norm(self.matrix.dot(pts), axis=0) ** 2
-    return xm, ym, zm.reshape(n, n)
-
-
-class Problem2D(Problem):
-
-  def __init__(self, random_seed=None, noise_stdev=0.0):
-    param_shapes = [(2,)]
-    super(Problem2D, self).__init__(param_shapes, random_seed, noise_stdev)
-
-  def surface(self, n=50, xlim=5, ylim=5):
-    """Computes the objective surface over a 2d mesh."""
-
-    # Create a mesh over the given coordinate ranges.
-    xm, ym = _mesh(xlim, ylim, n)
-
-    with tf.Graph().as_default(), tf.Session() as sess:
-
-      # Ops to compute the objective at every (x, y) point.
-      x = tf.placeholder(tf.float32, shape=xm.shape)
-      y = tf.placeholder(tf.float32, shape=ym.shape)
-      obj = self.objective([[x, y]])
-
-      # Run the computation.
-      zm = sess.run(obj, feed_dict={x: xm, y: ym})
-
-    return xm, ym, zm
-
-
-class Rosenbrock(Problem2D):
-  """See https://en.wikipedia.org/wiki/Rosenbrock_function.
-
-  This function has a single global minima at [1, 1]
-  The objective value at this point is zero.
-  """
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_uniform(shape, minval=-5., maxval=10., seed=seed)
-            for shape in self.param_shapes]
-
-  def objective(self, params, data=None, labels=None):
-    x, y = tf.split(params[0], 2, axis=0)
-    obj = (1 - x)**2 + 100 * (y - x**2)**2
-    return tf.squeeze(obj)
-
-
-def make_rosenbrock_loss_and_init(device=None):
-  """A variable-backed version of Rosenbrock problem.
-
-  See the Rosenbrock class for details.
-
-  Args:
-    device: Where to place the ops of this problem.
-
-  Returns:
-    A tuple of two callables, first of which creates the loss and the second
-    creates the parameter initializer function.
-  """
-  def make_rosenbrock_loss():
-    with tf.name_scope("optimizee"):
-      with tf.device(device):
-        x = tf.get_variable("x", [1])
-        y = tf.get_variable("y", [1])
-        c = tf.get_variable(
-            "c", [1],
-            initializer=tf.constant_initializer(100.0),
-            trainable=False)
-        obj = (1 - x)**2 + c * (y - x**2)**2
-      return tf.squeeze(obj)
-
-  def make_init_fn(parameters):
-    with tf.device(device):
-      init_op = tf.variables_initializer(parameters)
-    def init_fn(sess):
-      tf.logging.info("Initializing model parameters.")
-      sess.run(init_op)
-    return init_fn
-
-  return make_rosenbrock_loss, make_init_fn
-
-
-class Saddle(Problem2D):
-  """Loss surface around a saddle point."""
-
-  def objective(self, params, data=None, labels=None):
-    x, y = tf.split(params[0], 2, axis=0)
-    obj = x ** 2 - y ** 2
-    return tf.squeeze(obj)
-
-
-class LogSumExp(Problem2D):
-  """2D function defined by the log of the sum of exponentials."""
-
-  def objective(self, params, data=None, labels=None):
-    x, y = tf.split(params[0], 2, axis=0)
-    obj = tf.log(tf.exp(x + 3. * y - 0.1) +
-                 tf.exp(x - 3. * y - 0.1) +
-                 tf.exp(-x - 0.1) + 1.0)
-    return tf.squeeze(obj)
-
-
-class Ackley(Problem2D):
-  """Ackley's function (contains many local minima)."""
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_uniform(shape, minval=-32.768, maxval=32.768, seed=seed)
-            for shape in self.param_shapes]
-
-  def objective(self, params, data=None, labels=None):
-    x, y = tf.split(params[0], 2, axis=0)
-    obj = (-20 * tf.exp(-0.2 * tf.sqrt(0.5 * (x ** 2 + y ** 2))) -
-           tf.exp(0.5 * (tf.cos(2 * np.pi * x) + tf.cos(2 * np.pi * y))) +
-           tf.exp(1.0) + 20.)
-    return tf.squeeze(obj)
-
-
-class Beale(Problem2D):
-  """Beale function (a multimodal function with sharp peaks)."""
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_uniform(shape, minval=-4.5, maxval=4.5, seed=seed)
-            for shape in self.param_shapes]
-
-  def objective(self, params, data=None, labels=None):
-    x, y = tf.split(params[0], 2, axis=0)
-    obj = ((1.5 - x + x * y) ** 2 +
-           (2.25 - x + x * y ** 2) ** 2 +
-           (2.625 - x + x * y ** 3) ** 2)
-    return tf.squeeze(obj)
-
-
-class Booth(Problem2D):
-  """Booth's function (has a long valley along one dimension)."""
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_uniform(shape, minval=-10., maxval=10., seed=seed)
-            for shape in self.param_shapes]
-
-  def objective(self, params, data=None, labels=None):
-    x, y = tf.split(params[0], 2, axis=0)
-    obj = (x + 2 * y - 7) ** 2 + (2 * x + y - 5) ** 2
-    return tf.squeeze(obj)
-
-
-class StyblinskiTang(Problem2D):
-  """Styblinski-Tang function (a bumpy function in two dimensions)."""
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_uniform(shape, minval=-5., maxval=5., seed=seed)
-            for shape in self.param_shapes]
-
-  def objective(self, params, data=None, labels=None):
-    params = tf.split(params[0], 2, axis=0)
-    obj = 0.5 * tf.reduce_sum([x ** 4 - 16 * x ** 2 + 5 * x
-                               for x in params], 0) + 80.
-    return tf.squeeze(obj)
-
-
-class Matyas(Problem2D):
-  """Matyas function (a function with a single global minimum in a valley)."""
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_uniform(shape, minval=-10, maxval=10, seed=seed)
-            for shape in self.param_shapes]
-
-  def objective(self, params, data=None, labels=None):
-    x, y = tf.split(params[0], 2, axis=0)
-    obj = 0.26 * (x ** 2 + y ** 2) - 0.48 * x * y
-    return tf.squeeze(obj)
-
-
-class Branin(Problem2D):
-  """Branin function (a function with three global minima)."""
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    x1 = tf.random_uniform((1,), minval=-5., maxval=10.,
-                           seed=seed)
-    x2 = tf.random_uniform((1,), minval=0., maxval=15.,
-                           seed=seed)
-    return [tf.concat([x1, x2], 0)]
-
-  def objective(self, params, data=None, labels=None):
-    x, y = tf.split(params[0], 2, axis=0)
-
-    # Define some constants.
-    a = 1.
-    b = 5.1 / (4. * np.pi ** 2)
-    c = 5 / np.pi
-    r = 6.
-    s = 10.
-    t = 1 / (8. * np.pi)
-
-    # Evaluate the function.
-    obj = a * (y - b * x ** 2 + c * x - r) ** 2 + s * (1 - t) * tf.cos(x) + s
-    return tf.squeeze(obj)
-
-
-class Michalewicz(Problem2D):
-  """Michalewicz function (has steep ridges and valleys)."""
-
-  def init_tensors(self, seed=None):
-    """Returns a list of tensors with the given shape."""
-    return [tf.random_uniform(shape, minval=0., maxval=np.pi, seed=seed)
-            for shape in self.param_shapes]
-
-  def objective(self, params, data=None, labels=None):
-    x, y = tf.split(params[0], 2, axis=0)
-    m = 5    # Defines how steep the ridges are (larger m => steeper ridges).
-    obj = 2. - (tf.sin(x) * tf.sin(x ** 2 / np.pi) ** (2 * m) +
-                tf.sin(y) * tf.sin(2 * y ** 2 / np.pi) ** (2 * m))
-    return tf.squeeze(obj)
-
-
-class Rescale(Problem):
-  """Takes an existing problem, and rescales all the parameters."""
-
-  def __init__(self, problem_spec, scale=10., noise_stdev=0.0):
-    self.problem = problem_spec.build()
-    self.param_shapes = self.problem.param_shapes
-    self.scale = scale
-
-    super(Rescale, self).__init__(self.param_shapes, random_seed=None,
-                                  noise_stdev=noise_stdev)
-
-  def init_tensors(self, seed=None):
-    params_raw = self.problem.init_tensors(seed=seed)
-    params = [t * self.scale for t in params_raw]
-    return params
-
-  def objective(self, params, data=None, labels=None):
-    params_raw = [t/self.scale for t in params]
-
-    problem_obj = self.problem.objective(params_raw, data, labels)
-    return problem_obj
-
-
-class SumTask(Problem):
-  """Takes a list of problems and modifies the objective to be their sum."""
-
-  def __init__(self, problem_specs, noise_stdev=0.0):
-    self.problems = [ps.build() for ps in problem_specs]
-    self.param_shapes = []
-    for prob in self.problems:
-      self.param_shapes += prob.param_shapes
-
-    super(SumTask, self).__init__(self.param_shapes, random_seed=None,
-                                  noise_stdev=noise_stdev)
-
-  def init_tensors(self, seed=None):
-    tensors = []
-    for prob in self.problems:
-      tensors += prob.init_tensors(seed=seed)
-    return tensors
-
-  def objective(self, params, data=None, labels=None):
-    obj = 0.
-    index = 0
-    for prob in self.problems:
-      num_params = len(prob.param_shapes)
-      obj += prob.objective(params[index:index + num_params])
-      index += num_params
-    return obj
-
-
-class IsotropicQuadratic(Problem):
-  """An isotropic quadratic problem."""
-
-  def objective(self, params, data=None, labels=None):
-    return sum([tf.reduce_sum(param ** 2) for param in params])
-
-
-class Norm(Problem):
-  """Takes an existing problem and modifies the objective to be its N-norm."""
-
-  def __init__(self, ndim, random_seed=None, noise_stdev=0.0, norm_power=2.):
-    param_shapes = [(ndim, 1)]
-    super(Norm, self).__init__(param_shapes, random_seed, noise_stdev)
-
-    # Generate a random problem instance.
-    self.w = np.random.randn(ndim, ndim).astype("float32")
-    self.y = np.random.randn(ndim, 1).astype("float32")
-    self.norm_power = norm_power
-
-  def objective(self, params, data=None, labels=None):
-    diff = tf.matmul(self.w, params[0]) - self.y
-    exp = 1. / self.norm_power
-    loss = tf.reduce_sum((tf.abs(diff) + EPSILON) ** self.norm_power) ** exp
-    return loss
-
-
-class LogObjective(Problem):
-  """Takes an existing problem and modifies the objective to be its log."""
-
-  def __init__(self, problem_spec):
-    self.problem = problem_spec.build()
-    self.param_shapes = self.problem.param_shapes
-
-    super(LogObjective, self).__init__(self.param_shapes,
-                                       random_seed=None,
-                                       noise_stdev=0.0)
-
-  def objective(self, params, data=None, labels=None):
-    problem_obj = self.problem.objective(params, data, labels)
-    return tf.log(problem_obj + EPSILON) - tf.log(EPSILON)
-
-
-class SparseProblem(Problem):
-  """Takes a problem and sets gradients to 0 with the given probability."""
-
-  def __init__(self,
-               problem_spec,
-               zero_probability=0.99,
-               random_seed=None,
-               noise_stdev=0.0):
-    self.problem = problem_spec.build()
-    self.param_shapes = self.problem.param_shapes
-    self.zero_prob = zero_probability
-
-    super(SparseProblem, self).__init__(self.param_shapes,
-                                        random_seed=random_seed,
-                                        noise_stdev=noise_stdev)
-
-  def objective(self, parameters, data=None, labels=None):
-    return self.problem.objective(parameters, data, labels)
-
-  def gradients(self, objective, parameters):
-    grads = tf.gradients(objective, list(parameters))
-
-    new_grads = []
-    for grad in grads:
-      mask = tf.greater(self.zero_prob, tf.random_uniform(grad.get_shape()))
-      zero_grad = tf.zeros_like(grad, dtype=tf.float32)
-      noisy_grad = grad + self.noise_stdev * tf.random_normal(grad.get_shape())
-      new_grads.append(tf.where(mask, zero_grad, noisy_grad))
-    return new_grads
-
-
-class DependencyChain(Problem):
-  """A problem in which parameters must be optimized in order.
-
-  A sequence of parameters which all need to be brought to 0, but where each
-  parameter in the sequence can't be brought to 0 until the preceding one
-  has been. This should take a long time to optimize, with steady
-  (or accelerating) progress throughout the entire process.
-  """
-
-  def __init__(self, ndim, random_seed=None, noise_stdev=0.):
-    param_shapes = [(ndim + 1,)]
-    self.ndim = ndim
-    super(DependencyChain, self).__init__(
-        param_shapes, random_seed, noise_stdev)
-
-  def objective(self, params, data=None, labels=None):
-    terms = params[0][0]**2 + params[0][1:]**2 / (params[0][:-1]**2 + EPSILON)
-    return tf.reduce_sum(terms)
-
-
-class MinMaxWell(Problem):
-  """Problem with global min when both the min and max (absolute) params are 1.
-
-  The gradient for all but two parameters (the min and max) is zero. This
-  should therefore encourage the optimizer to behave sensible even when
-  parameters have zero gradients, as is common eg for some deep neural nets.
-  """
-
-  def __init__(self, ndim, random_seed=None, noise_stdev=0.):
-    param_shapes = [(ndim,)]
-    self.ndim = ndim
-    super(MinMaxWell, self).__init__(param_shapes, random_seed, noise_stdev)
-
-  def objective(self, params, data=None, labels=None):
-    params_sqr = params[0]**2
-    min_sqr = tf.reduce_min(params_sqr)
-    max_sqr = tf.reduce_max(params_sqr)
-    epsilon = 1e-12
-
-    return max_sqr + 1./min_sqr - 2. + epsilon
-
-
-class OutwardSnake(Problem):
-  """A winding path out to infinity.
-
-  Ideal step length stays constant along the entire path.
-  """
-
-  def __init__(self, ndim, random_seed=None, noise_stdev=0.):
-    param_shapes = [(ndim,)]
-    self.ndim = ndim
-    super(OutwardSnake, self).__init__(param_shapes, random_seed, noise_stdev)
-
-  def objective(self, params, data, labels=None):
-    radius = tf.sqrt(tf.reduce_sum(params[0]**2))
-    rad_loss = tf.reduce_sum(1. / (radius + 1e-6) * data[:, 0])
-
-    sin_dist = params[0][1:] - tf.cos(params[0][:-1]) * np.pi
-    sin_loss = tf.reduce_sum((sin_dist * data[:, 1:])**2)
-
-    return rad_loss + sin_loss
-
-
-class ProjectionQuadratic(Problem):
-  """Dataset consists of different directions to probe. Global min is at 0."""
-
-  def __init__(self, ndim, random_seed=None, noise_stdev=0.):
-    param_shapes = [(1, ndim)]
-    super(ProjectionQuadratic, self).__init__(
-        param_shapes, random_seed, noise_stdev)
-
-  def objective(self, params, data, labels=None):
-    return tf.reduce_sum((params[0] * data)**2)
-
-
-class SumOfQuadratics(Problem):
-
-  def __init__(self, ndim, random_seed=None, noise_stdev=0.):
-    param_shapes = [(1, ndim)]
-    super(SumOfQuadratics, self).__init__(
-        param_shapes, random_seed, noise_stdev)
-
-  def objective(self, params, data, labels=None):
-    epsilon = 1e-12
-    # Assume dataset is designed so that the global minimum is at params=0.
-    # Subtract loss at params=0, so that global minimum has objective value
-    # epsilon (added to avoid floating point issues).
-    return (tf.reduce_sum((params[0] - data)**2) - tf.reduce_sum(data**2) +
-            epsilon)
-
-
-class MatMulAlgorithm(Problem):
-  """A 6-th order polynomial optimization problem.
-
-  This problem is parametrized by n and k. A solution to this problem with
-  objective value exactly zero defines a matrix multiplication algorithm of
-  n x n matrices using k multiplications between matrices. When applied
-  recursively, such an algorithm has complexity O(n^(log_n(k))).
-
-  Given n, it is not known in general which values of k in [n^2, n^3] have a
-  solution. There is always a solution with k = n^3 (this is the naive
-  algorithm).
-
-  In the special case n = 2, it is known that there are solutions for k = {7, 8}
-  but not for k <= 6. For n = 3, it is known that there are exact solutions for
-  23 <= k <= 27, and there are asymptotic solutions for k = {21, 22}, but the
-  other cases are unknown.
-
-  For a given n and k, if one solution exists then infinitely many solutions
-  exist due to permutation and scaling symmetries in the parameters.
-
-  This is a very hard problem for some values of n and k (e.g. n = 3, k = 21),
-  but very easy for other values (e.g. n = 2, k = 7).
-
-  For a given n and k, the specific formulation of this problem is as follows.
-  Let theta_a, theta_b, theta_c be parameter matrices with respective dimensions
-  [n**2, k], [n**2, k], [k, n**2]. Then for any matrices a, b with shape [n, n],
-  we can form the matrix c with shape [n, n] via the operation:
-      ((vec(a) * theta_a) .* (vec(b) * theta_b)) * theta_c = vec(c),  (#)
-  where vec(x) is the operator that flattens a matrix with shape [n, n] into a
-  row vector with shape [1, n**2], * denotes matrix multiplication and .*
-  denotes elementwise multiplication.
-
-  This operation, parameterized by theta_a, theta_b, theta_c, is a matrix
-  multiplication algorithm iff c = a*b for all [n, n] matrices a and b. But
-  actually it suffices to verify all combinations of one-hot matrices a and b,
-  of which there are n**4 such combinations. This gives a batch of n**4 matrix
-  triplets (a, b, c) such that equation (#) must hold for each triplet. We solve
-  for theta_a, theta_b, theta_c by minimizing the sum of squares of errors
-  across this batch.
-
-  Finally, theta_c can be computed from theta_a and theta_b. Therefore it
-  suffices to learn theta_a and theta_b, from which theta_c and therefore the
-  objective value can be computed.
-  """
-
-  def __init__(self, n, k):
-    assert isinstance(n, int), "n must be an integer"
-    assert isinstance(k, int), "k must be an integer"
-    assert n >= 2, "Must have n >= 2"
-    assert k >= n**2 and k <= n**3, "Must have n**2 <= k <= n**3"
-
-    param_shapes = [(n**2, k), (n**2, k)]  # theta_a, theta_b
-    super(MatMulAlgorithm, self).__init__(
-        param_shapes, random_seed=None, noise_stdev=0.0)
-
-    self.n = n
-    self.k = k
-
-    # Build a batch of all combinations of one-hot matrices a, b, and their
-    # respective products c. Correctness on this batch is a necessary and
-    # sufficient condition for the algorithm to be valid. The number of matrices
-    # in {a, b, c}_3d is n**4 and each matrix is n x n.
-    onehots = np.identity(n**2).reshape(n**2, n, n)
-    a_3d = np.repeat(onehots, n**2, axis=0)
-    b_3d = np.tile(onehots, [n**2, 1, 1])
-    c_3d = np.matmul(a_3d, b_3d)
-
-    # Convert the batch to 2D Tensors.
-    self.a = tf.constant(a_3d.reshape(n**4, n**2), tf.float32, name="a")
-    self.b = tf.constant(b_3d.reshape(n**4, n**2), tf.float32, name="b")
-    self.c = tf.constant(c_3d.reshape(n**4, n**2), tf.float32, name="c")
-
-  def init_tensors(self, seed=None):
-    # Initialize params such that the columns of theta_a and theta_b have L2
-    # norm 1.
-    def _param_initializer(shape, seed=None):
-      x = tf.random_normal(shape, dtype=tf.float32, seed=seed)
-      return tf.transpose(tf.nn.l2_normalize(tf.transpose(x), 1))
-
-    return [_param_initializer(shape, seed) for shape in self.param_shapes]
-
-  def objective(self, parameters, data=None, labels=None):
-    theta_a = parameters[0]
-    theta_b = parameters[1]
-
-    # Compute theta_c from theta_a and theta_b.
-    p = tf.matmul(self.a, theta_a) * tf.matmul(self.b, theta_b)
-    p_trans = tf.transpose(p, name="p_trans")
-    p_inv = tf.matmul(
-        tf.matrix_inverse(tf.matmul(p_trans, p)), p_trans, name="p_inv")
-    theta_c = tf.matmul(p_inv, self.c, name="theta_c")
-
-    # Compute the "predicted" value of c.
-    c_hat = tf.matmul(p, theta_c, name="c_hat")
-
-    # Compute the loss (sum of squared errors).
-    loss = tf.reduce_sum((c_hat - self.c)**2, name="loss")
-
-    return loss
-
-
-def matmul_problem_sequence(n, k_min, k_max):
-  """Helper to generate a sequence of matrix multiplication problems."""
-  return [(_Spec(MatMulAlgorithm, (n, k), {}), None, None)
-          for k in range(k_min, k_max + 1)]
-
-
-def init_fixed_variables(arrays):
-  with tf.variable_scope(PARAMETER_SCOPE):
-    params = [tf.Variable(arr.astype("float32")) for arr in arrays]
-  return params
-
-
-def _mesh(xlim, ylim, n):
-  """Creates a 2D meshgrid covering the given ranges.
-
-  Args:
-    xlim: int that defines the desired x-range (-xlim, xlim)
-    ylim: int that defines the desired y-range (-ylim, ylim)
-    n: number of points in each dimension of the mesh
-
-  Returns:
-    xm: 2D array of x-values in the mesh
-    ym: 2D array of y-values in the mesh
-  """
-  return np.meshgrid(np.linspace(-xlim, xlim, n),
-                     np.linspace(-ylim, ylim, n))
diff --git a/research/learned_optimizer/problems/problem_sets.py b/research/learned_optimizer/problems/problem_sets.py
deleted file mode 100644
index eaf9273b87ef69c6b3087330bdf46c8de7107a15..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/problems/problem_sets.py
+++ /dev/null
@@ -1,561 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Groups of problems of different types for optimizer training."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-from learned_optimizer.problems import datasets
-from learned_optimizer.problems import model_adapter
-from learned_optimizer.problems import problem_generator as pg
-from learned_optimizer.problems import problem_spec
-
-_Spec = problem_spec.Spec
-
-
-def quadratic_problems():
-  return [
-      (_Spec(pg.Quadratic, (20,), {}), None, None),
-      (_Spec(pg.Quadratic, (25,), {}), None, None),
-      (_Spec(pg.Quadratic, (50,), {}), None, None),
-      (_Spec(pg.Quadratic, (100,), {}), None, None),
-  ]
-
-
-# Note: this group contains one non-noisy problem for historical reasons. The
-# original training set before the refactor included this set of quadratics.
-def quadratic_problems_noisy():
-  return [
-      (_Spec(pg.Quadratic, (20,), {"noise_stdev": 0.5}), None, None),
-      (_Spec(pg.Quadratic, (25,), {"noise_stdev": 0.0}), None, None),
-      (_Spec(pg.Quadratic, (50,), {"noise_stdev": 1.0}), None, None),
-      (_Spec(pg.Quadratic, (100,), {"noise_stdev": 2.0}), None, None),
-  ]
-
-
-def quadratic_problems_large():
-  return [
-      (_Spec(pg.Quadratic, (784,), {}), None, None),
-      (_Spec(pg.Quadratic, (1024,), {}), None, None),
-      (_Spec(pg.Quadratic, (2048,), {}), None, None),
-  ]
-
-
-def bowl_problems():
-  return [
-      (_Spec(pg.Bowl, (0.1,), {"noise_stdev": 0.0}), None, None),
-      (_Spec(pg.Bowl, (1.0,), {"noise_stdev": 0.0}), None, None),
-      (_Spec(pg.Bowl, (5.0,), {"noise_stdev": 0.0}), None, None),
-      (_Spec(pg.Bowl, (5.0,), {"noise_stdev": 0.0, "angle": np.pi / 4.}),
-       None, None),
-  ]
-
-
-def bowl_problems_noisy():
-  return [
-      (_Spec(pg.Bowl, (0.1,), {"noise_stdev": 0.1}), None, None),
-      (_Spec(pg.Bowl, (1.0,), {"noise_stdev": 0.1}), None, None),
-      (_Spec(pg.Bowl, (5.0,), {"noise_stdev": 0.1}), None, None),
-      (_Spec(pg.Bowl, (5.0,), {"noise_stdev": 0.1, "angle": np.pi / 4.}),
-       None, None),
-  ]
-
-
-def sparse_softmax_2_class_sparse_problems():
-  return [(_Spec(pg.SparseSoftmaxRegression, (5, 2), {"noise_stdev": 0.0}),
-           datasets.noisy_parity_class(5, random_seed=123), 23),]
-
-
-def one_hot_sparse_softmax_2_class_sparse_problems():
-  return [
-      (_Spec(pg.OneHotSparseSoftmaxRegression, (5, 2), {"noise_stdev": 0.0}),
-       datasets.noisy_parity_class(5, random_seed=123), 23),
-  ]
-
-
-def softmax_2_class_problems():
-  return [
-      (_Spec(pg.SoftmaxRegression, (10, 2), {}), datasets.random(
-          10, 1000, random_seed=123, sep=2.0), 100),
-      (_Spec(pg.SoftmaxRegression, (100, 2), {}), datasets.random(
-          100, 1000, random_seed=123), 50),
-      (_Spec(pg.SoftmaxRegression, (200, 2), {}), datasets.random(
-          200, 1000, random_seed=123, sep=1.5), 20),
-      (_Spec(pg.SoftmaxRegression, (256, 2), {}), datasets.random(
-          256, 1000, random_seed=123, sep=1.5), 100),
-  ]
-
-
-def softmax_2_class_problems_noisy():
-  return [
-      (_Spec(pg.SoftmaxRegression, (10, 2), {"noise_stdev": 0.5}),
-       datasets.random(10, 1000, random_seed=123, sep=2.0), 100),
-      (_Spec(pg.SoftmaxRegression, (100, 2), {"noise_stdev": 0.1}),
-       datasets.random(100, 1000, random_seed=123), 50),
-      (_Spec(pg.SoftmaxRegression, (200, 2), {"noise_stdev": 0.1}),
-       datasets.random(200, 1000, random_seed=123, sep=1.5), 20),
-      (_Spec(pg.SoftmaxRegression, (256, 2), {"noise_stdev": 0.5}),
-       datasets.random(256, 1000, random_seed=123, sep=1.5), 100),
-  ]
-
-
-def optimization_test_problems():
-  return [
-      (_Spec(pg.Ackley, (), {}), None, None),
-      (_Spec(pg.Beale, (), {}), None, None),
-      (_Spec(pg.Booth, (), {}), None, None),
-      (_Spec(pg.Branin, (), {}), None, None),
-      (_Spec(pg.LogSumExp, (), {}), None, None),
-      (_Spec(pg.Matyas, (), {}), None, None),
-      (_Spec(pg.Michalewicz, (), {}), None, None),
-      (_Spec(pg.Rosenbrock, (), {}), None, None),
-      (_Spec(pg.StyblinskiTang, (), {}), None, None),
-  ]
-
-
-def optimization_test_problems_noisy():
-  return [
-      (_Spec(pg.Ackley, (), {"noise_stdev": 1.}), None, None),
-      (_Spec(pg.Beale, (), {"noise_stdev": 1.}), None, None),
-      (_Spec(pg.Booth, (), {"noise_stdev": 1.}), None, None),
-      (_Spec(pg.Branin, (), {"noise_stdev": 1.}), None, None),
-      (_Spec(pg.LogSumExp, (), {"noise_stdev": 1.}), None, None),
-      (_Spec(pg.Matyas, (), {"noise_stdev": 1.}), None, None),
-      (_Spec(pg.Michalewicz, (), {"noise_stdev": 1.}), None, None),
-      (_Spec(pg.Rosenbrock, (), {"noise_stdev": 1.}), None, None),
-      (_Spec(pg.StyblinskiTang, (), {"noise_stdev": 1.}), None, None),
-  ]
-
-
-def fully_connected_random_2_class_problems():
-  return [
-      (_Spec(pg.FullyConnected, (8, 2),
-             {"hidden_sizes": (8, 5,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(8, 1000), 10),
-      (_Spec(pg.FullyConnected, (12, 2),
-             {"hidden_sizes": (8, 5, 3), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(12, 1000), 200),
-      (_Spec(pg.FullyConnected, (5, 2),
-             {"hidden_sizes": (4, 4, 4, 4,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(5, 1000), 100),
-      (_Spec(pg.FullyConnected, (11, 2),
-             {"hidden_sizes": (4, 5, 6,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(11, 1000), 64),
-      (_Spec(pg.FullyConnected, (9, 2),
-             {"hidden_sizes": (8,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(9, 1000), 128),
-      (_Spec(pg.FullyConnected, (7, 2),
-             {"hidden_sizes": (8, 5,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(7, 1000), 16),
-      (_Spec(pg.FullyConnected, (8, 2),
-             {"hidden_sizes": (32, 64,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(8, 1000), 10),
-      (_Spec(pg.FullyConnected, (12, 2),
-             {"hidden_sizes": (16, 8, 3), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(12, 1000), 200),
-      (_Spec(pg.FullyConnected, (5, 2),
-             {"hidden_sizes": (8, 8, 8, 8,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(5, 1000), 100),
-      (_Spec(pg.FullyConnected, (11, 2),
-             {"hidden_sizes": (10, 12, 12,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(11, 1000), 64),
-      (_Spec(pg.FullyConnected, (9, 2),
-             {"hidden_sizes": (32,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(9, 1000), 128),
-      (_Spec(pg.FullyConnected, (7, 2),
-             {"hidden_sizes": (32, 64,), "activation": tf.nn.sigmoid}),
-       datasets.random_mlp(7, 1000), 16),
-  ]
-
-
-def matmul_problems():
-  return sum([
-      pg.matmul_problem_sequence(2, 5, 8),
-      pg.matmul_problem_sequence(3, 19, 24)], [])
-
-
-def log_objective_problems():
-  return [
-      (_Spec(pg.LogObjective, [_Spec(pg.Quadratic, (20,), {})], {}),
-       None, None),
-      (_Spec(pg.LogObjective, [_Spec(pg.Quadratic, (50,), {})], {}),
-       None, None),
-      (_Spec(pg.LogObjective, [_Spec(pg.Quadratic, (100,), {})], {}),
-       None, None),
-      (_Spec(pg.LogObjective, [_Spec(pg.Bowl, (0.1,), {})], {}), None, None),
-      (_Spec(pg.LogObjective, [_Spec(pg.Bowl, (1.0,), {})], {}), None, None),
-      (_Spec(pg.LogObjective, [_Spec(pg.Bowl, (5.0,), {})], {}), None, None),
-  ]
-
-
-def sparse_gradient_problems():
-  return [
-      (_Spec(pg.SparseProblem, [_Spec(pg.Quadratic, (20,), {})], {}),
-       None, None),
-      (_Spec(pg.SparseProblem, [_Spec(pg.Quadratic, (50,), {})], {}),
-       None, None),
-      (_Spec(pg.SparseProblem, [_Spec(pg.Quadratic, (100,), {})], {}),
-       None, None),
-      (_Spec(pg.SparseProblem, [_Spec(pg.Bowl, (0.1,), {})], {}), None, None),
-      (_Spec(pg.SparseProblem, [_Spec(pg.Bowl, (1.0,), {})], {}), None, None),
-      (_Spec(pg.SparseProblem, [_Spec(pg.Bowl, (5.0,), {})], {}), None, None),
-  ]
-
-
-def sparse_gradient_problems_mlp():
-  return [
-      (_Spec(pg.SparseProblem, [
-          _Spec(pg.FullyConnected, (8, 2), {
-              "hidden_sizes": (8, 5,),
-              "activation": tf.nn.sigmoid
-          })
-      ], {}), datasets.random_mlp(8, 1000), 10),
-      (_Spec(pg.SparseProblem, [
-          _Spec(pg.FullyConnected, (12, 2), {
-              "hidden_sizes": (8, 5, 3),
-              "activation": tf.nn.sigmoid
-          })
-      ], {}), datasets.random_mlp(12, 1000), 200),
-      (_Spec(pg.SparseProblem, [
-          _Spec(pg.FullyConnected, (5, 2), {
-              "hidden_sizes": (4, 4, 4, 4,),
-              "activation": tf.nn.sigmoid
-          })
-      ], {}), datasets.random_mlp(5, 1000), 100),
-  ]
-
-
-def rescale_problems():
-  return [
-      (_Spec(pg.Rescale, [_Spec(pg.Norm, (18,), {"norm_power": 2.5})],
-             {"scale": 0.123}), None, None),
-      (_Spec(pg.Rescale, [_Spec(pg.Norm, (18,), {"norm_power": 1.5})],
-             {"scale": 8}), None, None),
-      (_Spec(pg.Rescale, [_Spec(pg.Norm, (18,), {"norm_power": 2.})],
-             {"scale": 50}), None, None),
-      (_Spec(pg.Rescale, [_Spec(pg.Norm, (18,), {"norm_power": 3.})],
-             {"scale": 200}), None, None),
-      (_Spec(pg.Rescale, [_Spec(pg.Norm, (18,), {"norm_power": 1.})],
-             {"scale": 1000}), None, None),
-      (_Spec(pg.Rescale, [_Spec(pg.Quadratic, (20,), {})], {"scale": 0.1}),
-       None, None),
-      (_Spec(pg.Rescale, [_Spec(pg.Quadratic, (25,), {})], {"scale": 10.}),
-       None, None),
-      (_Spec(pg.Rescale, [_Spec(pg.Quadratic, (50,), {})], {"scale": 350.}),
-       None, None),
-      (_Spec(pg.Rescale, [_Spec(pg.Quadratic, (100,), {})], {"scale": 132}),
-       None, None),
-  ]
-
-
-def norm_problems():
-  return [
-      # < 1 Norm causes NaN gradients early in training.
-      (_Spec(pg.Norm, (27,), {"norm_power": 1.}), None, None),
-      (_Spec(pg.Norm, (25,), {"norm_power": 2.}), None, None),
-      (_Spec(pg.Norm, (22,), {"norm_power": 3.}), None, None),
-  ]
-
-
-def norm_problems_noisy():
-  return [
-      # < 1 Norm causes NaN gradients early in training.
-      (_Spec(pg.Norm, (19,), {"noise_stdev": .1, "norm_power": 1.}),
-       None, None),
-      (_Spec(pg.Norm, (26,), {"noise_stdev": .1, "norm_power": 2.}),
-       None, None),
-      (_Spec(pg.Norm, (23,), {"noise_stdev": .1, "norm_power": 3.}),
-       None, None),
-  ]
-
-
-def sum_problems():
-  return [
-      (_Spec(pg.SumTask, [[
-          _Spec(pg.Quadratic, (11,), {}),
-          _Spec(pg.Quadratic, (3,), {}),
-          _Spec(pg.Quadratic, (9,), {}),
-          _Spec(pg.Quadratic, (7,), {}),
-          _Spec(pg.Quadratic, (5,), {}),
-          _Spec(pg.Quadratic, (13,), {}),
-          _Spec(pg.Quadratic, (12,), {})
-      ]], {}), None, None),
-      (_Spec(pg.SumTask, [[
-          _Spec(pg.Norm, (18,), {"norm_power": 3}),
-          _Spec(pg.Quadratic, (25,), {}),
-          _Spec(pg.Rosenbrock, (), {})
-      ]], {}), None, None),
-      (_Spec(pg.SumTask, [[
-          _Spec(pg.Rosenbrock, (), {}),
-          _Spec(pg.LogSumExp, (), {}),
-          _Spec(pg.Ackley, (), {}),
-          _Spec(pg.Beale, (), {}),
-          _Spec(pg.Booth, (), {}),
-          _Spec(pg.StyblinskiTang, (), {}),
-          _Spec(pg.Matyas, (), {}),
-          _Spec(pg.Branin, (), {}),
-          _Spec(pg.Michalewicz, (), {})
-      ]], {}), None, None),
-      (_Spec(pg.SumTask, [[
-          _Spec(pg.Rosenbrock, (), {}),
-          _Spec(pg.LogSumExp, (), {}),
-          _Spec(pg.Ackley, (), {}),
-          _Spec(pg.Beale, (), {}),
-          _Spec(pg.Booth, (), {}),
-          _Spec(pg.StyblinskiTang, (), {}),
-          _Spec(pg.Matyas, (), {}),
-          _Spec(pg.Branin, (), {}),
-          _Spec(pg.Michalewicz, (), {}),
-          _Spec(pg.Quadratic, (5,), {}),
-          _Spec(pg.Quadratic, (13,), {})
-      ]], {}), None, None),
-      (_Spec(pg.SumTask, [[
-          _Spec(pg.Quadratic, (11,), {}),
-          _Spec(pg.Quadratic, (3,), {})
-      ]], {}), None, None),
-      (_Spec(pg.SumTask, [[
-          _Spec(pg.Rosenbrock, (), {}),
-          _Spec(pg.LogSumExp, (), {}),
-          _Spec(pg.Ackley, (), {})
-      ]], {}), None, None),
-  ]
-
-
-def sum_problems_noisy():
-  return [
-      (_Spec(pg.SumTask, [[
-          _Spec(pg.Quadratic, (11,), {"noise_stdev": 0.1}),
-          _Spec(pg.Quadratic, (3,), {"noise_stdev": 0.1}),
-          _Spec(pg.Quadratic, (9,), {"noise_stdev": 0.1}),
-          _Spec(pg.Quadratic, (7,), {"noise_stdev": 0.1}),
-          _Spec(pg.Quadratic, (5,), {"noise_stdev": 0.1}),
-          _Spec(pg.Quadratic, (13,), {"noise_stdev": 0.1}),
-          _Spec(pg.Quadratic, (12,), {"noise_stdev": 0.1})
-      ]], {}), None, None),
-      (_Spec(pg.SumTask, [[
-          _Spec(pg.Rosenbrock, (), {}),
-          _Spec(pg.LogSumExp, (), {}),
-          _Spec(pg.Ackley, (), {}),
-          _Spec(pg.Beale, (), {}),
-          _Spec(pg.Booth, (), {}),
-          _Spec(pg.StyblinskiTang, (), {}),
-          _Spec(pg.Matyas, (), {}),
-          _Spec(pg.Branin, (), {}),
-          _Spec(pg.Michalewicz, (), {}),
-          _Spec(pg.Quadratic, (5,), {}),
-          _Spec(pg.Quadratic, (13,), {"noise_stdev": 0.5})
-      ]], {}), None, None),
-  ]
-
-
-def dependency_chain_problems():
-  return [
-      (_Spec(pg.DependencyChain, (20,), {}), datasets.random_binary(
-          20, 1000), 100),
-      (_Spec(pg.DependencyChain, (12,), {}), datasets.random_binary(
-          12, 200), 10),
-      (_Spec(pg.DependencyChain, (56,), {}), datasets.random_binary(
-          56, 5000), 100),
-      (_Spec(pg.DependencyChain, (64,), {}), datasets.random_binary(
-          64, 1000), 50),
-      (_Spec(pg.DependencyChain, (13,), {}), datasets.random_binary(
-          13, 10000), 50),
-      (_Spec(pg.DependencyChain, (20,), {}), datasets.random_binary(
-          20, 1000), 128),
-      (_Spec(pg.DependencyChain, (12,), {}), datasets.random_binary(
-          12, 300), 16),
-      (_Spec(pg.DependencyChain, (56,), {}), datasets.random_binary(
-          56, 5000), 128),
-      (_Spec(pg.DependencyChain, (64,), {}), datasets.random_binary(
-          64, 1000), 64),
-      (_Spec(pg.DependencyChain, (13,), {}), datasets.random_binary(
-          13, 10000), 32),
-  ]
-
-
-def outward_snake_problems():
-  return [
-      (_Spec(pg.OutwardSnake, (20,), {}), datasets.random_binary(
-          20, 1000), 100),
-      (_Spec(pg.OutwardSnake, (12,), {}), datasets.random_binary(
-          12, 200), 10),
-      (_Spec(pg.OutwardSnake, (56,), {}), datasets.random_binary(
-          56, 5000), 100),
-      (_Spec(pg.OutwardSnake, (64,), {}), datasets.random_binary(
-          64, 1000), 50),
-      (_Spec(pg.OutwardSnake, (13,), {}), datasets.random_binary(
-          13, 10000), 50),
-      (_Spec(pg.OutwardSnake, (20,), {}), datasets.random_binary(
-          20, 1000), 128),
-      (_Spec(pg.OutwardSnake, (12,), {}), datasets.random_binary(
-          12, 300), 16),
-      (_Spec(pg.OutwardSnake, (56,), {}), datasets.random_binary(
-          56, 5000), 128),
-      (_Spec(pg.OutwardSnake, (64,), {}), datasets.random_binary(
-          64, 1000), 64),
-      (_Spec(pg.OutwardSnake, (13,), {}), datasets.random_binary(
-          13, 10000), 32),
-  ]
-
-
-def min_max_well_problems():
-  return [
-      (_Spec(pg.MinMaxWell, (20,), {}), None, None),
-      (_Spec(pg.MinMaxWell, (12,), {}), None, None),
-      (_Spec(pg.MinMaxWell, (56,), {}), None, None),
-      (_Spec(pg.MinMaxWell, (64,), {}), None, None),
-      (_Spec(pg.MinMaxWell, (13,), {}), None, None),
-  ]
-
-
-def sum_of_quadratics_problems():
-  return [
-      (_Spec(pg.SumOfQuadratics, (20,), {}),
-       datasets.random_symmetric(20, 1000), 100),
-      (_Spec(pg.SumOfQuadratics, (12,), {}),
-       datasets.random_symmetric(12, 100), 10),
-      (_Spec(pg.SumOfQuadratics, (56,), {}),
-       datasets.random_symmetric(56, 5000), 100),
-      (_Spec(pg.SumOfQuadratics, (64,), {}),
-       datasets.random_symmetric(64, 1000), 50),
-      (_Spec(pg.SumOfQuadratics, (13,), {}),
-       datasets.random_symmetric(13, 10000), 50),
-      (_Spec(pg.SumOfQuadratics, (20,), {}),
-       datasets.random_symmetric(20, 1000), 128),
-      (_Spec(pg.SumOfQuadratics, (12,), {}),
-       datasets.random_symmetric(12, 100), 16),
-      (_Spec(pg.SumOfQuadratics, (56,), {}),
-       datasets.random_symmetric(56, 5000), 128),
-      (_Spec(pg.SumOfQuadratics, (64,), {}),
-       datasets.random_symmetric(64, 1000), 64),
-      (_Spec(pg.SumOfQuadratics, (13,), {}),
-       datasets.random_symmetric(13, 10000), 32),
-  ]
-
-
-def projection_quadratic_problems():
-  return [
-      (_Spec(pg.ProjectionQuadratic, (20,), {}),
-       datasets.random_symmetric(20, 1000), 100),
-      (_Spec(pg.ProjectionQuadratic, (12,), {}),
-       datasets.random_symmetric(12, 100), 10),
-      (_Spec(pg.ProjectionQuadratic, (56,), {}),
-       datasets.random_symmetric(56, 5000), 100),
-      (_Spec(pg.ProjectionQuadratic, (64,), {}),
-       datasets.random_symmetric(64, 1000), 50),
-      (_Spec(pg.ProjectionQuadratic, (13,), {}),
-       datasets.random_symmetric(13, 10000), 50),
-      (_Spec(pg.ProjectionQuadratic, (20,), {}),
-       datasets.random_symmetric(20, 1000), 128),
-      (_Spec(pg.ProjectionQuadratic, (12,), {}),
-       datasets.random_symmetric(12, 100), 16),
-      (_Spec(pg.ProjectionQuadratic, (56,), {}),
-       datasets.random_symmetric(56, 5000), 128),
-      (_Spec(pg.ProjectionQuadratic, (64,), {}),
-       datasets.random_symmetric(64, 1000), 64),
-      (_Spec(pg.ProjectionQuadratic, (13,), {}),
-       datasets.random_symmetric(13, 10000), 32),
-  ]
-
-
-def adapter_rosenbrock_local():
-  return [(_Spec(model_adapter.ModelAdapter,
-                 (pg.make_rosenbrock_loss_and_init,), {}), None, None),]
-
-
-def adapter_rosenbrock_worker():
-  return [(_Spec(model_adapter.ModelAdapter,
-                 (pg.make_rosenbrock_loss_and_init,),
-                 {"device": "/job:worker"}), None, None),]
-
-
-def _test_problem_mlp_scaled_init_small():
-  return [
-      np.random.randn(10, 32) * np.sqrt(2./10),
-      np.random.randn(32,) * 0.1,
-      np.random.randn(32, 64) * np.sqrt(2./32.),
-      np.random.randn(64,) * 0.1,
-      np.random.randn(64, 2) * np.sqrt(2./64.),
-      np.random.randn(2,) * 0.1
-  ]
-
-
-def _test_problem_mlp_scaled_init_large():
-  return [
-      np.random.randn(20, 32) * np.sqrt(2./20),
-      np.random.randn(32,) * 0.1,
-      np.random.randn(32, 64) * np.sqrt(2./32.),
-      np.random.randn(64,) * 0.1,
-      np.random.randn(64, 10) * np.sqrt(2./64.),
-      np.random.randn(10,) * 0.1
-  ]
-
-
-def _test_problem_mlp_scaled_init_mnist():
-  return [
-      np.random.randn(784, 64) * np.sqrt(2./784.),
-      np.random.randn(64,) * 0.1,
-      np.random.randn(64, 10) * np.sqrt(2./ 64.),
-      np.random.randn(10,) * 0.1,
-  ]
-
-
-# Wrap this construction in a function to avoid UnparsedFlagAccessError
-def test_problems():
-  """Test problems for visualizations."""
-  # Unlike the training problem sets, these test problems are made up of
-  # length-5 tuples. The final items in the tuple are the name of the problem
-  # and the initialization random_seed for testing consistency.
-  tp = [
-      (_Spec(pg.Quadratic, (20,), {"random_seed": 1234}), None, None,
-       "quad_problem", 5678),
-      (_Spec(pg.Quadratic, (20,), {"noise_stdev": 1.0, "random_seed": 1234}),
-       None, None, "quad_problem_noise", 5678),
-      (_Spec(pg.Rosenbrock, (), {"random_seed": 1234}), None, None,
-       "rosenbrock", 5678),
-      (_Spec(pg.Rosenbrock, (), {"random_seed": 1234, "noise_stdev": 1.0}),
-       None, None, "rosenbrock_noise", 5678),
-      (_Spec(pg.SoftmaxRegression, (10, 2), {}), datasets.random(
-          10, 10000, random_seed=1234), 100, "softmax", 5678),
-      (_Spec(pg.SoftmaxRegression, (10, 2), {"noise_stdev": 1.0}),
-       datasets.random(10, 10000, random_seed=1234), 100, "softmax_noise",
-       5678),
-      (_Spec(pg.FullyConnected, (10, 2), {}), datasets.random(
-          10, 10000, random_seed=1234), 100, "mlp_small",
-       _test_problem_mlp_scaled_init_small()),
-      (_Spec(pg.FullyConnected, (20, 10), {}), datasets.random(
-          20, 10000, n_classes=10, random_seed=1234), 100, "mlp_large",
-       _test_problem_mlp_scaled_init_large()),
-      (_Spec(pg.FullyConnected, (784, 10),
-             {"hidden_sizes": (64,), "activation": tf.nn.sigmoid}),
-       datasets.mnist(), 64, "mlp_mnist_sigmoid",
-       _test_problem_mlp_scaled_init_mnist()),
-      (_Spec(pg.FullyConnected, (784, 10),
-             {"hidden_sizes": (64,), "activation": tf.nn.relu}),
-       datasets.mnist(), 64, "mlp_mnist_relu",
-       _test_problem_mlp_scaled_init_mnist()),
-      (_Spec(pg.ConvNet, ((1, 28, 28), 10, [(3, 3, 8), (5, 5, 8)]),
-             {"activation": tf.nn.sigmoid}), datasets.mnist(), 64,
-       "convnet_mnist_sigmoid", None),
-      (_Spec(pg.ConvNet, ((1, 28, 28), 10, [(3, 3, 8), (5, 5, 8)]),
-             {"activation": tf.nn.relu}), datasets.mnist(), 64,
-       "convnet_mnist_relu", None),
-  ]
-  return tp
diff --git a/research/learned_optimizer/problems/problem_spec.py b/research/learned_optimizer/problems/problem_spec.py
deleted file mode 100644
index e30c47b277e5c8b3b8aba3b8d691a2af3a595ef6..0000000000000000000000000000000000000000
--- a/research/learned_optimizer/problems/problem_spec.py
+++ /dev/null
@@ -1,33 +0,0 @@
-# Copyright 2017 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Wrapper around a training problem."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import namedtuple
-
-
-class Spec(namedtuple("Spec", "callable args kwargs")):
-  """Syntactic sugar for keeping track of a function/class + args."""
-
-  # Since this is an immutable object, we don't need to reserve slots.
-  __slots__ = ()
-
-  def build(self):
-    """Returns the output of the callable."""
-    return self.callable(*self.args, **self.kwargs)
diff --git a/research/learning_to_remember_rare_events/README.md b/research/learning_to_remember_rare_events/README.md
deleted file mode 100644
index 2eeadea784d4d22efc88c56e482c5d5374c90e24..0000000000000000000000000000000000000000
--- a/research/learning_to_remember_rare_events/README.md
+++ /dev/null
@@ -1,61 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
----
-
-Code for the Memory Module as described
-in "Learning to Remember Rare Events" by
-Lukasz Kaiser, Ofir Nachum, Aurko Roy, and Samy Bengio
-published as a conference paper at ICLR 2017.
-
-Requirements:
-* TensorFlow (see tensorflow.org for how to install)
-* Some basic command-line utilities (git, unzip).
-
-Description:
-
-The general memory module is located in memory.py.
-Some code is provided to see the memory module in
-action on the standard Omniglot dataset.
-Download and setup the dataset using data_utils.py
-and then run the training script train.py
-(see example commands below).
-
-Note that the structure and parameters of the model
-are optimized for the data preparation as provided.
-
-Quick Start:
-
-First download and set-up Omniglot data by running
-
-```
-python data_utils.py
-```
-
-Then run the training script:
-
-```
-python train.py --memory_size=8192 \
-  --batch_size=16 --validation_length=50 \
-  --episode_width=5 --episode_length=30
-```
-
-The first validation batch may look like this (although it is noisy):
-```
-0-shot: 0.040, 1-shot: 0.404, 2-shot: 0.516, 3-shot: 0.604,
-  4-shot: 0.656, 5-shot: 0.684
-```
-At step 500 you may see something like this:
-```
-0-shot: 0.036, 1-shot: 0.836, 2-shot: 0.900, 3-shot: 0.940,
-  4-shot: 0.944, 5-shot: 0.916
-```
-At step 4000 you may see something like this:
-```
-0-shot: 0.044, 1-shot: 0.960, 2-shot: 1.000, 3-shot: 0.988,
-  4-shot: 0.972, 5-shot: 0.992
-```
-
-Maintained by Ofir Nachum (ofirnachum) and
-Lukasz Kaiser (lukaszkaiser).
diff --git a/research/learning_to_remember_rare_events/data_utils.py b/research/learning_to_remember_rare_events/data_utils.py
deleted file mode 100644
index 03d5dafb251d4e058a6780b447aabdcd1a84a1d4..0000000000000000000000000000000000000000
--- a/research/learning_to_remember_rare_events/data_utils.py
+++ /dev/null
@@ -1,243 +0,0 @@
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-#
-# ==============================================================================
-"""Data loading and other utilities.
-
-Use this file to first copy over and pre-process the Omniglot dataset.
-Simply call
-  python data_utils.py
-"""
-
-import logging
-import os
-import subprocess
-from six.moves import cPickle as pickle
-
-import numpy as np
-from scipy.misc import imresize
-from scipy.misc import imrotate
-from scipy.ndimage import imread
-from six.moves import xrange
-import tensorflow as tf
-
-
-MAIN_DIR = ''
-REPO_LOCATION = 'https://github.com/brendenlake/omniglot.git'
-REPO_DIR = os.path.join(MAIN_DIR, 'omniglot')
-DATA_DIR = os.path.join(REPO_DIR, 'python')
-TRAIN_DIR = os.path.join(DATA_DIR, 'images_background')
-TEST_DIR = os.path.join(DATA_DIR, 'images_evaluation')
-DATA_FILE_FORMAT = os.path.join(MAIN_DIR, '%s_omni.pkl')
-
-TRAIN_ROTATIONS = True  # augment training data with rotations
-TEST_ROTATIONS = False  # augment testing data with rotations
-IMAGE_ORIGINAL_SIZE = 105
-IMAGE_NEW_SIZE = 28
-
-
-def get_data():
-  """Get data in form suitable for episodic training.
-
-  Returns:
-    Train and test data as dictionaries mapping
-    label to list of examples.
-  """
-  with tf.gfile.GFile(DATA_FILE_FORMAT % 'train', 'rb') as f:
-    processed_train_data = pickle.load(f)
-  with tf.gfile.GFile(DATA_FILE_FORMAT % 'test', 'rb') as f:
-    processed_test_data = pickle.load(f)
-
-  train_data = {}
-  test_data = {}
-
-  for data, processed_data in zip([train_data, test_data],
-                                  [processed_train_data, processed_test_data]):
-    for image, label in zip(processed_data['images'],
-                            processed_data['labels']):
-      if label not in data:
-        data[label] = []
-      data[label].append(image.reshape([-1]).astype('float32'))
-
-  intersection = set(train_data.keys()) & set(test_data.keys())
-  assert not intersection, 'Train and test data intersect.'
-  ok_num_examples = [len(ll) == 20 for _, ll in train_data.items()]
-  assert all(ok_num_examples), 'Bad number of examples in train data.'
-  ok_num_examples = [len(ll) == 20 for _, ll in test_data.items()]
-  assert all(ok_num_examples), 'Bad number of examples in test data.'
-
-  logging.info('Number of labels in train data: %d.', len(train_data))
-  logging.info('Number of labels in test data: %d.', len(test_data))
-
-  return train_data, test_data
-
-
-def crawl_directory(directory, augment_with_rotations=False,
-                    first_label=0):
-  """Crawls data directory and returns stuff."""
-  label_idx = first_label
-  images = []
-  labels = []
-  info = []
-
-  # traverse root directory
-  for root, _, files in os.walk(directory):
-    logging.info('Reading files from %s', root)
-    fileflag = 0
-    for file_name in files:
-      full_file_name = os.path.join(root, file_name)
-      img = imread(full_file_name, flatten=True)
-      for i, angle in enumerate([0, 90, 180, 270]):
-        if not augment_with_rotations and i > 0:
-          break
-
-        images.append(imrotate(img, angle))
-        labels.append(label_idx + i)
-        info.append(full_file_name)
-
-      fileflag = 1
-
-    if fileflag:
-      label_idx += 4 if augment_with_rotations else 1
-
-  return images, labels, info
-
-
-def resize_images(images, new_width, new_height):
-  """Resize images to new dimensions."""
-  resized_images = np.zeros([images.shape[0], new_width, new_height],
-                            dtype=np.float32)
-
-  for i in range(images.shape[0]):
-    resized_images[i, :, :] = imresize(images[i, :, :],
-                                       [new_width, new_height],
-                                       interp='bilinear',
-                                       mode=None)
-  return resized_images
-
-
-def write_datafiles(directory, write_file,
-                    resize=True, rotate=False,
-                    new_width=IMAGE_NEW_SIZE, new_height=IMAGE_NEW_SIZE,
-                    first_label=0):
-  """Load and preprocess images from a directory and write them to a file.
-
-  Args:
-    directory: Directory of alphabet sub-directories.
-    write_file: Filename to write to.
-    resize: Whether to resize the images.
-    rotate: Whether to augment the dataset with rotations.
-    new_width: New resize width.
-    new_height: New resize height.
-    first_label: Label to start with.
-
-  Returns:
-    Number of new labels created.
-  """
-
-  # these are the default sizes for Omniglot:
-  imgwidth = IMAGE_ORIGINAL_SIZE
-  imgheight = IMAGE_ORIGINAL_SIZE
-
-  logging.info('Reading the data.')
-  images, labels, info = crawl_directory(directory,
-                                         augment_with_rotations=rotate,
-                                         first_label=first_label)
-
-  images_np = np.zeros([len(images), imgwidth, imgheight], dtype=np.bool)
-  labels_np = np.zeros([len(labels)], dtype=np.uint32)
-  for i in xrange(len(images)):
-    images_np[i, :, :] = images[i]
-    labels_np[i] = labels[i]
-
-  if resize:
-    logging.info('Resizing images.')
-    resized_images = resize_images(images_np, new_width, new_height)
-
-    logging.info('Writing resized data in float32 format.')
-    data = {'images': resized_images,
-            'labels': labels_np,
-            'info': info}
-    with tf.gfile.GFile(write_file, 'w') as f:
-      pickle.dump(data, f)
-  else:
-    logging.info('Writing original sized data in boolean format.')
-    data = {'images': images_np,
-            'labels': labels_np,
-            'info': info}
-    with tf.gfile.GFile(write_file, 'w') as f:
-      pickle.dump(data, f)
-
-  return len(np.unique(labels_np))
-
-
-def maybe_download_data():
-  """Download Omniglot repo if it does not exist."""
-  if os.path.exists(REPO_DIR):
-    logging.info('It appears that Git repo already exists.')
-  else:
-    logging.info('It appears that Git repo does not exist.')
-    logging.info('Cloning now.')
-
-    subprocess.check_output('git clone %s' % REPO_LOCATION, shell=True)
-
-  if os.path.exists(TRAIN_DIR):
-    logging.info('It appears that train data has already been unzipped.')
-  else:
-    logging.info('It appears that train data has not been unzipped.')
-    logging.info('Unzipping now.')
-
-    subprocess.check_output('unzip %s.zip -d %s' % (TRAIN_DIR, DATA_DIR),
-                            shell=True)
-
-  if os.path.exists(TEST_DIR):
-    logging.info('It appears that test data has already been unzipped.')
-  else:
-    logging.info('It appears that test data has not been unzipped.')
-    logging.info('Unzipping now.')
-
-    subprocess.check_output('unzip %s.zip -d %s' % (TEST_DIR, DATA_DIR),
-                            shell=True)
-
-
-def preprocess_omniglot():
-  """Download and prepare raw Omniglot data.
-
-  Downloads the data from GitHub if it does not exist.
-  Then load the images, augment with rotations if desired.
-  Resize the images and write them to a pickle file.
-  """
-
-  maybe_download_data()
-
-  directory = TRAIN_DIR
-  write_file = DATA_FILE_FORMAT % 'train'
-  num_labels = write_datafiles(
-      directory, write_file, resize=True, rotate=TRAIN_ROTATIONS,
-      new_width=IMAGE_NEW_SIZE, new_height=IMAGE_NEW_SIZE)
-
-  directory = TEST_DIR
-  write_file = DATA_FILE_FORMAT % 'test'
-  write_datafiles(directory, write_file, resize=True, rotate=TEST_ROTATIONS,
-                  new_width=IMAGE_NEW_SIZE, new_height=IMAGE_NEW_SIZE,
-                  first_label=num_labels)
-
-
-def main(unused_argv):
-  logging.basicConfig(level=logging.INFO)
-  preprocess_omniglot()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/learning_to_remember_rare_events/memory.py b/research/learning_to_remember_rare_events/memory.py
deleted file mode 100644
index 2f40ff57f9434994f08b1ad97dc23142bb23daaa..0000000000000000000000000000000000000000
--- a/research/learning_to_remember_rare_events/memory.py
+++ /dev/null
@@ -1,392 +0,0 @@
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-#
-# ==============================================================================
-"""Memory module for storing "nearest neighbors".
-
-Implements a key-value memory for generalized one-shot learning
-as described in the paper
-"Learning to Remember Rare Events"
-by Lukasz Kaiser, Ofir Nachum, Aurko Roy, Samy Bengio,
-published as a conference paper at ICLR 2017.
-"""
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-
-class Memory(object):
-  """Memory module."""
-
-  def __init__(self, key_dim, memory_size, vocab_size,
-               choose_k=256, alpha=0.1, correct_in_top=1, age_noise=8.0,
-               var_cache_device='', nn_device=''):
-    self.key_dim = key_dim
-    self.memory_size = memory_size
-    self.vocab_size = vocab_size
-    self.choose_k = min(choose_k, memory_size)
-    self.alpha = alpha
-    self.correct_in_top = correct_in_top
-    self.age_noise = age_noise
-    self.var_cache_device = var_cache_device  # Variables are cached here.
-    self.nn_device = nn_device  # Device to perform nearest neighbour matmul.
-
-    caching_device = var_cache_device if var_cache_device else None
-    self.update_memory = tf.constant(True)  # Can be fed "false" if needed.
-    self.mem_keys = tf.get_variable(
-        'memkeys', [self.memory_size, self.key_dim], trainable=False,
-        initializer=tf.random_uniform_initializer(-0.0, 0.0),
-        caching_device=caching_device)
-    self.mem_vals = tf.get_variable(
-        'memvals', [self.memory_size], dtype=tf.int32, trainable=False,
-        initializer=tf.constant_initializer(0, tf.int32),
-        caching_device=caching_device)
-    self.mem_age = tf.get_variable(
-        'memage', [self.memory_size], dtype=tf.float32, trainable=False,
-        initializer=tf.constant_initializer(0.0), caching_device=caching_device)
-    self.recent_idx = tf.get_variable(
-        'recent_idx', [self.vocab_size], dtype=tf.int32, trainable=False,
-        initializer=tf.constant_initializer(0, tf.int32))
-
-    # variable for projecting query vector into memory key
-    self.query_proj = tf.get_variable(
-        'memory_query_proj', [self.key_dim, self.key_dim], dtype=tf.float32,
-        initializer=tf.truncated_normal_initializer(0, 0.01),
-        caching_device=caching_device)
-
-  def get(self):
-    return self.mem_keys, self.mem_vals, self.mem_age, self.recent_idx
-
-  def set(self, k, v, a, r=None):
-    return tf.group(
-        self.mem_keys.assign(k),
-        self.mem_vals.assign(v),
-        self.mem_age.assign(a),
-        (self.recent_idx.assign(r) if r is not None else tf.group()))
-
-  def clear(self):
-    return tf.variables_initializer([self.mem_keys, self.mem_vals, self.mem_age,
-                                     self.recent_idx])
-
-  def get_hint_pool_idxs(self, normalized_query):
-    """Get small set of idxs to compute nearest neighbor queries on.
-
-    This is an expensive look-up on the whole memory that is used to
-    avoid more expensive operations later on.
-
-    Args:
-      normalized_query: A Tensor of shape [None, key_dim].
-
-    Returns:
-      A Tensor of shape [None, choose_k] of indices in memory
-      that are closest to the queries.
-
-    """
-    # look up in large memory, no gradients
-    with tf.device(self.nn_device):
-      similarities = tf.matmul(tf.stop_gradient(normalized_query),
-                               self.mem_keys, transpose_b=True, name='nn_mmul')
-    _, hint_pool_idxs = tf.nn.top_k(
-        tf.stop_gradient(similarities), k=self.choose_k, name='nn_topk')
-    return hint_pool_idxs
-
-  def make_update_op(self, upd_idxs, upd_keys, upd_vals,
-                     batch_size, use_recent_idx, intended_output):
-    """Function that creates all the update ops."""
-    mem_age_incr = self.mem_age.assign_add(tf.ones([self.memory_size],
-                                                   dtype=tf.float32))
-    with tf.control_dependencies([mem_age_incr]):
-      mem_age_upd = tf.scatter_update(
-          self.mem_age, upd_idxs, tf.zeros([batch_size], dtype=tf.float32))
-
-    mem_key_upd = tf.scatter_update(
-        self.mem_keys, upd_idxs, upd_keys)
-    mem_val_upd = tf.scatter_update(
-        self.mem_vals, upd_idxs, upd_vals)
-
-    if use_recent_idx:
-      recent_idx_upd = tf.scatter_update(
-          self.recent_idx, intended_output, upd_idxs)
-    else:
-      recent_idx_upd = tf.group()
-
-    return tf.group(mem_age_upd, mem_key_upd, mem_val_upd, recent_idx_upd)
-
-  def query(self, query_vec, intended_output, use_recent_idx=True):
-    """Queries memory for nearest neighbor.
-
-    Args:
-      query_vec: A batch of vectors to query (embedding of input to model).
-      intended_output: The values that would be the correct output of the
-        memory.
-      use_recent_idx: Whether to always insert at least one instance of a
-        correct memory fetch.
-
-    Returns:
-      A tuple (result, mask, teacher_loss).
-      result: The result of the memory look up.
-      mask: The affinity of the query to the result.
-      teacher_loss: The loss for training the memory module.
-    """
-
-    batch_size = tf.shape(query_vec)[0]
-    output_given = intended_output is not None
-
-    # prepare query for memory lookup
-    query_vec = tf.matmul(query_vec, self.query_proj)
-    normalized_query = tf.nn.l2_normalize(query_vec, dim=1)
-
-    hint_pool_idxs = self.get_hint_pool_idxs(normalized_query)
-
-    if output_given and use_recent_idx:  # add at least one correct memory
-      most_recent_hint_idx = tf.gather(self.recent_idx, intended_output)
-      hint_pool_idxs = tf.concat(
-          axis=1,
-          values=[hint_pool_idxs, tf.expand_dims(most_recent_hint_idx, 1)])
-    choose_k = tf.shape(hint_pool_idxs)[1]
-
-    with tf.device(self.var_cache_device):
-      # create small memory and look up with gradients
-      my_mem_keys = tf.stop_gradient(tf.gather(self.mem_keys, hint_pool_idxs,
-                                               name='my_mem_keys_gather'))
-      similarities = tf.matmul(tf.expand_dims(normalized_query, 1),
-                               my_mem_keys, adjoint_b=True, name='batch_mmul')
-      hint_pool_sims = tf.squeeze(similarities, [1], name='hint_pool_sims')
-      hint_pool_mem_vals = tf.gather(self.mem_vals, hint_pool_idxs,
-                                     name='hint_pool_mem_vals')
-    # Calculate softmax mask on the top-k if requested.
-    # Softmax temperature. Say we have K elements at dist x and one at (x+a).
-    # Softmax of the last is e^tm(x+a)/Ke^tm*x + e^tm(x+a) = e^tm*a/K+e^tm*a.
-    # To make that 20% we'd need to have e^tm*a ~= 0.2K, so tm = log(0.2K)/a.
-    softmax_temp = max(1.0, np.log(0.2 * self.choose_k) / self.alpha)
-    mask = tf.nn.softmax(hint_pool_sims[:, :choose_k - 1] * softmax_temp)
-
-    # prepare returned values
-    nearest_neighbor = tf.to_int32(
-        tf.argmax(hint_pool_sims[:, :choose_k - 1], 1))
-
-    no_teacher_idxs = tf.gather(
-        tf.reshape(hint_pool_idxs, [-1]),
-        nearest_neighbor + choose_k * tf.range(batch_size))
-
-    with tf.device(self.var_cache_device):
-      result = tf.gather(self.mem_vals, tf.reshape(no_teacher_idxs, [-1]))
-
-    if not output_given:
-        teacher_loss = None
-        return result, mask, teacher_loss
-
-    # prepare hints from the teacher on hint pool
-    teacher_hints = tf.to_float(
-        tf.abs(tf.expand_dims(intended_output, 1) - hint_pool_mem_vals))
-    teacher_hints = 1.0 - tf.minimum(1.0, teacher_hints)
-
-    teacher_vals, teacher_hint_idxs = tf.nn.top_k(
-        hint_pool_sims * teacher_hints, k=1)
-    neg_teacher_vals, _ = tf.nn.top_k(
-        hint_pool_sims * (1 - teacher_hints), k=1)
-
-    # bring back idxs to full memory
-    teacher_idxs = tf.gather(
-        tf.reshape(hint_pool_idxs, [-1]),
-        teacher_hint_idxs[:, 0] + choose_k * tf.range(batch_size))
-
-    # zero-out teacher_vals if there are no hints
-    teacher_vals *= (
-        1 - tf.to_float(tf.equal(0.0, tf.reduce_sum(teacher_hints, 1))))
-
-    # we'll determine whether to do an update to memory based on whether
-    # memory was queried correctly
-    sliced_hints = tf.slice(teacher_hints, [0, 0], [-1, self.correct_in_top])
-    incorrect_memory_lookup = tf.equal(0.0, tf.reduce_sum(sliced_hints, 1))
-
-    # loss based on triplet loss
-    teacher_loss = (tf.nn.relu(neg_teacher_vals - teacher_vals + self.alpha)
-                    - self.alpha)
-
-    # prepare memory updates
-    update_keys = normalized_query
-    update_vals = intended_output
-
-    fetched_idxs = teacher_idxs  # correctly fetched from memory
-    with tf.device(self.var_cache_device):
-      fetched_keys = tf.gather(self.mem_keys, fetched_idxs, name='fetched_keys')
-      fetched_vals = tf.gather(self.mem_vals, fetched_idxs, name='fetched_vals')
-
-    # do memory updates here
-    fetched_keys_upd = update_keys + fetched_keys  # Momentum-like update
-    fetched_keys_upd = tf.nn.l2_normalize(fetched_keys_upd, dim=1)
-    # Randomize age a bit, e.g., to select different ones in parallel workers.
-    mem_age_with_noise = self.mem_age + tf.random_uniform(
-        [self.memory_size], - self.age_noise, self.age_noise)
-
-    _, oldest_idxs = tf.nn.top_k(mem_age_with_noise, k=batch_size, sorted=False)
-
-    with tf.control_dependencies([result]):
-      upd_idxs = tf.where(incorrect_memory_lookup,
-                          oldest_idxs,
-                          fetched_idxs)
-      # upd_idxs = tf.Print(upd_idxs, [upd_idxs], "UPD IDX", summarize=8)
-      upd_keys = tf.where(incorrect_memory_lookup,
-                          update_keys,
-                          fetched_keys_upd)
-      upd_vals = tf.where(incorrect_memory_lookup,
-                          update_vals,
-                          fetched_vals)
-
-    def make_update_op():
-      return self.make_update_op(upd_idxs, upd_keys, upd_vals,
-                                 batch_size, use_recent_idx, intended_output)
-
-    update_op = tf.cond(self.update_memory, make_update_op, tf.no_op)
-
-    with tf.control_dependencies([update_op]):
-      result = tf.identity(result)
-      mask = tf.identity(mask)
-      teacher_loss = tf.identity(teacher_loss)
-
-    return result, mask, tf.reduce_mean(teacher_loss)
-
-
-class LSHMemory(Memory):
-  """Memory employing locality sensitive hashing.
-
-  Note: Not fully tested.
-  """
-
-  def __init__(self, key_dim, memory_size, vocab_size,
-               choose_k=256, alpha=0.1, correct_in_top=1, age_noise=8.0,
-               var_cache_device='', nn_device='',
-               num_hashes=None, num_libraries=None):
-    super(LSHMemory, self).__init__(
-        key_dim, memory_size, vocab_size,
-        choose_k=choose_k, alpha=alpha, correct_in_top=1, age_noise=age_noise,
-        var_cache_device=var_cache_device, nn_device=nn_device)
-
-    self.num_libraries = num_libraries or int(self.choose_k ** 0.5)
-    self.num_per_hash_slot = max(1, self.choose_k // self.num_libraries)
-    self.num_hashes = (num_hashes or
-                       int(np.log2(self.memory_size / self.num_per_hash_slot)))
-    self.num_hashes = min(max(self.num_hashes, 1), 20)
-    self.num_hash_slots = 2 ** self.num_hashes
-
-    # hashing vectors
-    self.hash_vecs = [
-        tf.get_variable(
-            'hash_vecs%d' % i, [self.num_hashes, self.key_dim],
-            dtype=tf.float32, trainable=False,
-            initializer=tf.truncated_normal_initializer(0, 1))
-        for i in xrange(self.num_libraries)]
-
-    # map representing which hash slots map to which mem keys
-    self.hash_slots = [
-        tf.get_variable(
-            'hash_slots%d' % i, [self.num_hash_slots, self.num_per_hash_slot],
-            dtype=tf.int32, trainable=False,
-            initializer=tf.random_uniform_initializer(maxval=self.memory_size,
-                                                      dtype=tf.int32))
-        for i in xrange(self.num_libraries)]
-
-  def get(self):  # not implemented
-    return self.mem_keys, self.mem_vals, self.mem_age, self.recent_idx
-
-  def set(self, k, v, a, r=None):  # not implemented
-    return tf.group(
-        self.mem_keys.assign(k),
-        self.mem_vals.assign(v),
-        self.mem_age.assign(a),
-        (self.recent_idx.assign(r) if r is not None else tf.group()))
-
-  def clear(self):
-    return tf.variables_initializer([self.mem_keys, self.mem_vals, self.mem_age,
-                                     self.recent_idx] + self.hash_slots)
-
-  def get_hash_slots(self, query):
-    """Gets hashed-to buckets for batch of queries.
-
-    Args:
-      query: 2-d Tensor of query vectors.
-
-    Returns:
-      A list of hashed-to buckets for each hash function.
-    """
-
-    binary_hash = [
-        tf.less(tf.matmul(query, self.hash_vecs[i], transpose_b=True), 0)
-        for i in xrange(self.num_libraries)]
-    hash_slot_idxs = [
-        tf.reduce_sum(
-            tf.to_int32(binary_hash[i]) *
-            tf.constant([[2 ** i for i in xrange(self.num_hashes)]],
-                        dtype=tf.int32), 1)
-        for i in xrange(self.num_libraries)]
-    return hash_slot_idxs
-
-  def get_hint_pool_idxs(self, normalized_query):
-    """Get small set of idxs to compute nearest neighbor queries on.
-
-    This is an expensive look-up on the whole memory that is used to
-    avoid more expensive operations later on.
-
-    Args:
-      normalized_query: A Tensor of shape [None, key_dim].
-
-    Returns:
-      A Tensor of shape [None, choose_k] of indices in memory
-      that are closest to the queries.
-
-    """
-    # get hash of query vecs
-    hash_slot_idxs = self.get_hash_slots(normalized_query)
-
-    # grab mem idxs in the hash slots
-    hint_pool_idxs = [
-        tf.maximum(tf.minimum(
-            tf.gather(self.hash_slots[i], idxs),
-            self.memory_size - 1), 0)
-        for i, idxs in enumerate(hash_slot_idxs)]
-
-    return tf.concat(axis=1, values=hint_pool_idxs)
-
-  def make_update_op(self, upd_idxs, upd_keys, upd_vals,
-                     batch_size, use_recent_idx, intended_output):
-    """Function that creates all the update ops."""
-    base_update_op = super(LSHMemory, self).make_update_op(
-        upd_idxs, upd_keys, upd_vals,
-        batch_size, use_recent_idx, intended_output)
-
-    # compute hash slots to be updated
-    hash_slot_idxs = self.get_hash_slots(upd_keys)
-
-    # make updates
-    update_ops = []
-    with tf.control_dependencies([base_update_op]):
-      for i, slot_idxs in enumerate(hash_slot_idxs):
-        # for each slot, choose which entry to replace
-        entry_idx = tf.random_uniform([batch_size],
-                                      maxval=self.num_per_hash_slot,
-                                      dtype=tf.int32)
-        entry_mul = 1 - tf.one_hot(entry_idx, self.num_per_hash_slot,
-                                   dtype=tf.int32)
-        entry_add = (tf.expand_dims(upd_idxs, 1) *
-                     tf.one_hot(entry_idx, self.num_per_hash_slot,
-                                dtype=tf.int32))
-
-        mul_op = tf.scatter_mul(self.hash_slots[i], slot_idxs, entry_mul)
-        with tf.control_dependencies([mul_op]):
-          add_op = tf.scatter_add(self.hash_slots[i], slot_idxs, entry_add)
-          update_ops.append(add_op)
-
-    return tf.group(*update_ops)
diff --git a/research/learning_to_remember_rare_events/model.py b/research/learning_to_remember_rare_events/model.py
deleted file mode 100644
index 7a6b460047fda3349c04d0e024c035f69a300461..0000000000000000000000000000000000000000
--- a/research/learning_to_remember_rare_events/model.py
+++ /dev/null
@@ -1,302 +0,0 @@
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-#
-# ==============================================================================
-"""Model using memory component.
-
-The model embeds images using a standard CNN architecture.
-These embeddings are used as keys to the memory component,
-which returns nearest neighbors.
-"""
-
-import tensorflow as tf
-
-import memory
-
-FLAGS = tf.flags.FLAGS
-
-
-class BasicClassifier(object):
-
-  def __init__(self, output_dim):
-    self.output_dim = output_dim
-
-  def core_builder(self, memory_val, x, y):
-    del x, y
-    y_pred = memory_val
-    loss = 0.0
-
-    return loss, y_pred
-
-
-class LeNet(object):
-  """Standard CNN architecture."""
-
-  def __init__(self, image_size, num_channels, hidden_dim):
-    self.image_size = image_size
-    self.num_channels = num_channels
-    self.hidden_dim = hidden_dim
-    self.matrix_init = tf.truncated_normal_initializer(stddev=0.1)
-    self.vector_init = tf.constant_initializer(0.0)
-
-  def core_builder(self, x):
-    """Embeds x using standard CNN architecture.
-
-    Args:
-      x: Batch of images as a 2-d Tensor [batch_size, -1].
-
-    Returns:
-      A 2-d Tensor [batch_size, hidden_dim] of embedded images.
-    """
-
-    ch1 = 32 * 2  # number of channels in 1st layer
-    ch2 = 64 * 2  # number of channels in 2nd layer
-    conv1_weights = tf.get_variable('conv1_w',
-                                    [3, 3, self.num_channels, ch1],
-                                    initializer=self.matrix_init)
-    conv1_biases = tf.get_variable('conv1_b', [ch1],
-                                   initializer=self.vector_init)
-    conv1a_weights = tf.get_variable('conv1a_w',
-                                     [3, 3, ch1, ch1],
-                                     initializer=self.matrix_init)
-    conv1a_biases = tf.get_variable('conv1a_b', [ch1],
-                                    initializer=self.vector_init)
-
-    conv2_weights = tf.get_variable('conv2_w', [3, 3, ch1, ch2],
-                                    initializer=self.matrix_init)
-    conv2_biases = tf.get_variable('conv2_b', [ch2],
-                                   initializer=self.vector_init)
-    conv2a_weights = tf.get_variable('conv2a_w', [3, 3, ch2, ch2],
-                                     initializer=self.matrix_init)
-    conv2a_biases = tf.get_variable('conv2a_b', [ch2],
-                                    initializer=self.vector_init)
-
-    # fully connected
-    fc1_weights = tf.get_variable(
-        'fc1_w', [self.image_size // 4 * self.image_size // 4 * ch2,
-                  self.hidden_dim], initializer=self.matrix_init)
-    fc1_biases = tf.get_variable('fc1_b', [self.hidden_dim],
-                                 initializer=self.vector_init)
-
-    # define model
-    x = tf.reshape(x,
-                   [-1, self.image_size, self.image_size, self.num_channels])
-    batch_size = tf.shape(x)[0]
-
-    conv1 = tf.nn.conv2d(x, conv1_weights,
-                         strides=[1, 1, 1, 1], padding='SAME')
-    relu1 = tf.nn.relu(tf.nn.bias_add(conv1, conv1_biases))
-    conv1 = tf.nn.conv2d(relu1, conv1a_weights,
-                         strides=[1, 1, 1, 1], padding='SAME')
-    relu1 = tf.nn.relu(tf.nn.bias_add(conv1, conv1a_biases))
-
-    pool1 = tf.nn.max_pool(relu1, ksize=[1, 2, 2, 1],
-                           strides=[1, 2, 2, 1], padding='SAME')
-
-    conv2 = tf.nn.conv2d(pool1, conv2_weights,
-                         strides=[1, 1, 1, 1], padding='SAME')
-    relu2 = tf.nn.relu(tf.nn.bias_add(conv2, conv2_biases))
-    conv2 = tf.nn.conv2d(relu2, conv2a_weights,
-                         strides=[1, 1, 1, 1], padding='SAME')
-    relu2 = tf.nn.relu(tf.nn.bias_add(conv2, conv2a_biases))
-
-    pool2 = tf.nn.max_pool(relu2, ksize=[1, 2, 2, 1],
-                           strides=[1, 2, 2, 1], padding='SAME')
-
-    reshape = tf.reshape(pool2, [batch_size, -1])
-    hidden = tf.matmul(reshape, fc1_weights) + fc1_biases
-
-    return hidden
-
-
-class Model(object):
-  """Model for coordinating between CNN embedder and Memory module."""
-
-  def __init__(self, input_dim, output_dim, rep_dim, memory_size, vocab_size,
-               learning_rate=0.0001, use_lsh=False):
-    self.input_dim = input_dim
-    self.output_dim = output_dim
-    self.rep_dim = rep_dim
-    self.memory_size = memory_size
-    self.vocab_size = vocab_size
-    self.learning_rate = learning_rate
-    self.use_lsh = use_lsh
-
-    self.embedder = self.get_embedder()
-    self.memory = self.get_memory()
-    self.classifier = self.get_classifier()
-
-    self.global_step = tf.train.get_or_create_global_step()
-
-  def get_embedder(self):
-    return LeNet(int(self.input_dim ** 0.5), 1, self.rep_dim)
-
-  def get_memory(self):
-    cls = memory.LSHMemory if self.use_lsh else memory.Memory
-    return cls(self.rep_dim, self.memory_size, self.vocab_size)
-
-  def get_classifier(self):
-    return BasicClassifier(self.output_dim)
-
-  def core_builder(self, x, y, keep_prob, use_recent_idx=True):
-    embeddings = self.embedder.core_builder(x)
-    if keep_prob < 1.0:
-      embeddings = tf.nn.dropout(embeddings, keep_prob)
-    memory_val, _, teacher_loss = self.memory.query(
-        embeddings, y, use_recent_idx=use_recent_idx)
-    loss, y_pred = self.classifier.core_builder(memory_val, x, y)
-
-    return loss + teacher_loss, y_pred
-
-  def train(self, x, y):
-    loss, _ = self.core_builder(x, y, keep_prob=0.3)
-    gradient_ops = self.training_ops(loss)
-    return loss, gradient_ops
-
-  def eval(self, x, y):
-    _, y_preds = self.core_builder(x, y, keep_prob=1.0,
-                                   use_recent_idx=False)
-    return y_preds
-
-  def get_xy_placeholders(self):
-    return (tf.placeholder(tf.float32, [None, self.input_dim]),
-            tf.placeholder(tf.int32, [None]))
-
-  def setup(self):
-    """Sets up all components of the computation graph."""
-
-    self.x, self.y = self.get_xy_placeholders()
-
-    # This context creates variables
-    with tf.variable_scope('core', reuse=None):
-      self.loss, self.gradient_ops = self.train(self.x, self.y)
-    # And this one re-uses them (thus the `reuse=True`)
-    with tf.variable_scope('core', reuse=True):
-      self.y_preds = self.eval(self.x, self.y)
-
-  def training_ops(self, loss):
-    opt = self.get_optimizer()
-    params = tf.trainable_variables()
-    gradients = tf.gradients(loss, params)
-    clipped_gradients, _ = tf.clip_by_global_norm(gradients, 5.0)
-    return opt.apply_gradients(zip(clipped_gradients, params),
-                               global_step=self.global_step)
-
-  def get_optimizer(self):
-    return tf.train.AdamOptimizer(learning_rate=self.learning_rate,
-                                  epsilon=1e-4)
-
-  def one_step(self, sess, x, y):
-    outputs = [self.loss, self.gradient_ops]
-    return sess.run(outputs, feed_dict={self.x: x, self.y: y})
-
-  def episode_step(self, sess, x, y, clear_memory=False):
-    """Performs training steps on episodic input.
-
-    Args:
-      sess: A Tensorflow Session.
-      x: A list of batches of images defining the episode.
-      y: A list of batches of labels corresponding to x.
-      clear_memory: Whether to clear the memory before the episode.
-
-    Returns:
-      List of losses the same length as the episode.
-    """
-
-    outputs = [self.loss, self.gradient_ops]
-
-    if clear_memory:
-      self.clear_memory(sess)
-
-    losses = []
-    for xx, yy in zip(x, y):
-      out = sess.run(outputs, feed_dict={self.x: xx, self.y: yy})
-      loss = out[0]
-      losses.append(loss)
-
-    return losses
-
-  def predict(self, sess, x, y=None):
-    """Predict the labels on a single batch of examples.
-
-    Args:
-      sess: A Tensorflow Session.
-      x: A batch of images.
-      y: The labels for the images in x.
-        This allows for updating the memory.
-
-    Returns:
-      Predicted y.
-    """
-
-    # Storing current memory state to restore it after prediction
-    mem_keys, mem_vals, mem_age, _ = self.memory.get()
-    cur_memory = (
-        tf.identity(mem_keys),
-        tf.identity(mem_vals),
-        tf.identity(mem_age),
-        None,
-    )
-
-    outputs = [self.y_preds]
-    if y is None:
-      ret = sess.run(outputs, feed_dict={self.x: x})
-    else:
-      ret = sess.run(outputs, feed_dict={self.x: x, self.y: y})
-
-    # Restoring memory state
-    self.memory.set(*cur_memory)
-
-    return ret
-
-  def episode_predict(self, sess, x, y, clear_memory=False):
-    """Predict the labels on an episode of examples.
-
-    Args:
-      sess: A Tensorflow Session.
-      x: A list of batches of images.
-      y: A list of labels for the images in x.
-        This allows for updating the memory.
-      clear_memory: Whether to clear the memory before the episode.
-
-    Returns:
-      List of predicted y.
-    """
-
-    # Storing current memory state to restore it after prediction
-    mem_keys, mem_vals, mem_age, _ = self.memory.get()
-    cur_memory = (
-        tf.identity(mem_keys),
-        tf.identity(mem_vals),
-        tf.identity(mem_age),
-        None,
-    )
-
-    if clear_memory:
-      self.clear_memory(sess)
-
-    outputs = [self.y_preds]
-    y_preds = []
-    for xx, yy in zip(x, y):
-      out = sess.run(outputs, feed_dict={self.x: xx, self.y: yy})
-      y_pred = out[0]
-      y_preds.append(y_pred)
-
-    # Restoring memory state
-    self.memory.set(*cur_memory)
-
-    return y_preds
-
-  def clear_memory(self, sess):
-    sess.run([self.memory.clear()])
diff --git a/research/learning_to_remember_rare_events/train.py b/research/learning_to_remember_rare_events/train.py
deleted file mode 100644
index c5c6d06b5ee02e73128ee2b23f3b399d29b1e212..0000000000000000000000000000000000000000
--- a/research/learning_to_remember_rare_events/train.py
+++ /dev/null
@@ -1,242 +0,0 @@
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-#
-# ==============================================================================
-r"""Script for training model.
-
-Simple command to get up and running:
-  python train.py --memory_size=8192 \
-      --batch_size=16 --validation_length=50 \
-      --episode_width=5 --episode_length=30
-"""
-
-import logging
-import os
-import random
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import data_utils
-import model
-
-FLAGS = tf.flags.FLAGS
-
-tf.flags.DEFINE_integer('rep_dim', 128,
-                        'dimension of keys to use in memory')
-tf.flags.DEFINE_integer('episode_length', 100, 'length of episode')
-tf.flags.DEFINE_integer('episode_width', 5,
-                        'number of distinct labels in a single episode')
-tf.flags.DEFINE_integer('memory_size', None, 'number of slots in memory. '
-                        'Leave as None to default to episode length')
-tf.flags.DEFINE_integer('batch_size', 16, 'batch size')
-tf.flags.DEFINE_integer('num_episodes', 100000, 'number of training episodes')
-tf.flags.DEFINE_integer('validation_frequency', 20,
-                        'every so many training episodes, '
-                        'assess validation accuracy')
-tf.flags.DEFINE_integer('validation_length', 10,
-                        'number of episodes to use to compute '
-                        'validation accuracy')
-tf.flags.DEFINE_integer('seed', 888, 'random seed for training sampling')
-tf.flags.DEFINE_string('save_dir', '', 'directory to save model to')
-tf.flags.DEFINE_bool('use_lsh', False,
-                     'use locality-sensitive hashing '
-                     '(NOTE: not fully tested)')
-
-
-class Trainer(object):
-  """Class that takes care of training, validating, and checkpointing model."""
-
-  def __init__(self, train_data, valid_data, input_dim, output_dim=None):
-    self.train_data = train_data
-    self.valid_data = valid_data
-    self.input_dim = input_dim
-
-    self.rep_dim = FLAGS.rep_dim
-    self.episode_length = FLAGS.episode_length
-    self.episode_width = FLAGS.episode_width
-    self.batch_size = FLAGS.batch_size
-    self.memory_size = (self.episode_length * self.batch_size
-                        if FLAGS.memory_size is None else FLAGS.memory_size)
-    self.use_lsh = FLAGS.use_lsh
-
-    self.output_dim = (output_dim if output_dim is not None
-                       else self.episode_width)
-
-  def get_model(self):
-    # vocab size is the number of distinct values that
-    # could go into the memory key-value storage
-    vocab_size = self.episode_width * self.batch_size
-    return model.Model(
-        self.input_dim, self.output_dim, self.rep_dim, self.memory_size,
-        vocab_size, use_lsh=self.use_lsh)
-
-  def sample_episode_batch(self, data,
-                           episode_length, episode_width, batch_size):
-    """Generates a random batch for training or validation.
-
-    Structures each element of the batch as an 'episode'.
-    Each episode contains episode_length examples and
-    episode_width distinct labels.
-
-    Args:
-      data: A dictionary mapping label to list of examples.
-      episode_length: Number of examples in each episode.
-      episode_width: Distinct number of labels in each episode.
-      batch_size: Batch size (number of episodes).
-
-    Returns:
-      A tuple (x, y) where x is a list of batches of examples
-      with size episode_length and y is a list of batches of labels.
-    """
-
-    episodes_x = [[] for _ in xrange(episode_length)]
-    episodes_y = [[] for _ in xrange(episode_length)]
-    assert len(data) >= episode_width
-    keys = data.keys()
-    for b in xrange(batch_size):
-      episode_labels = random.sample(keys, episode_width)
-      remainder = episode_length % episode_width
-      remainders = [0] * (episode_width - remainder) + [1] * remainder
-      episode_x = [
-          random.sample(data[lab],
-                        r + (episode_length - remainder) // episode_width)
-          for lab, r in zip(episode_labels, remainders)]
-      episode = sum([[(x, i, ii) for ii, x in enumerate(xx)]
-                     for i, xx in enumerate(episode_x)], [])
-      random.shuffle(episode)
-      # Arrange episode so that each distinct label is seen before moving to
-      # 2nd showing
-      episode.sort(key=lambda elem: elem[2])
-      assert len(episode) == episode_length
-      for i in xrange(episode_length):
-        episodes_x[i].append(episode[i][0])
-        episodes_y[i].append(episode[i][1] + b * episode_width)
-
-    return ([np.array(xx).astype('float32') for xx in episodes_x],
-            [np.array(yy).astype('int32') for yy in episodes_y])
-
-  def compute_correct(self, ys, y_preds):
-    return np.mean(np.equal(y_preds, np.array(ys)))
-
-  def individual_compute_correct(self, y, y_pred):
-    return y_pred == y
-
-  def run(self):
-    """Performs training.
-
-    Trains a model using episodic training.
-    Every so often, runs some evaluations on validation data.
-    """
-
-    train_data, valid_data = self.train_data, self.valid_data
-    input_dim, output_dim = self.input_dim, self.output_dim
-    rep_dim, episode_length = self.rep_dim, self.episode_length
-    episode_width, memory_size = self.episode_width, self.memory_size
-    batch_size = self.batch_size
-
-    train_size = len(train_data)
-    valid_size = len(valid_data)
-    logging.info('train_size (number of labels) %d', train_size)
-    logging.info('valid_size (number of labels) %d', valid_size)
-    logging.info('input_dim %d', input_dim)
-    logging.info('output_dim %d', output_dim)
-    logging.info('rep_dim %d', rep_dim)
-    logging.info('episode_length %d', episode_length)
-    logging.info('episode_width %d', episode_width)
-    logging.info('memory_size %d', memory_size)
-    logging.info('batch_size %d', batch_size)
-
-    assert all(len(v) >= float(episode_length) / episode_width
-               for v in train_data.values())
-    assert all(len(v) >= float(episode_length) / episode_width
-               for v in valid_data.values())
-
-    output_dim = episode_width
-    self.model = self.get_model()
-    self.model.setup()
-
-    sess = tf.Session()
-    sess.run(tf.global_variables_initializer())
-
-    saver = tf.train.Saver(max_to_keep=10)
-    ckpt = None
-    if FLAGS.save_dir:
-      ckpt = tf.train.get_checkpoint_state(FLAGS.save_dir)
-    if ckpt and ckpt.model_checkpoint_path:
-      logging.info('restoring from %s', ckpt.model_checkpoint_path)
-      saver.restore(sess, ckpt.model_checkpoint_path)
-
-    logging.info('starting now')
-    losses = []
-    random.seed(FLAGS.seed)
-    np.random.seed(FLAGS.seed)
-    for i in xrange(FLAGS.num_episodes):
-      x, y = self.sample_episode_batch(
-          train_data, episode_length, episode_width, batch_size)
-      outputs = self.model.episode_step(sess, x, y, clear_memory=True)
-      loss = outputs
-      losses.append(loss)
-
-      if i % FLAGS.validation_frequency == 0:
-        logging.info('episode batch %d, avg train loss %f',
-                     i, np.mean(losses))
-        losses = []
-
-        # validation
-        correct = []
-        num_shots = episode_length // episode_width
-        correct_by_shot = dict((k, []) for k in xrange(num_shots))
-        for _ in xrange(FLAGS.validation_length):
-          x, y = self.sample_episode_batch(
-              valid_data, episode_length, episode_width, 1)
-          outputs = self.model.episode_predict(
-              sess, x, y, clear_memory=True)
-          y_preds = outputs
-          correct.append(self.compute_correct(np.array(y), y_preds))
-
-          # compute per-shot accuracies
-          seen_counts = [0] * episode_width
-          # loop over episode steps
-          for yy, yy_preds in zip(y, y_preds):
-            # loop over batch examples
-            yyy, yyy_preds = int(yy[0]), int(yy_preds[0])
-            count = seen_counts[yyy % episode_width]
-            if count in correct_by_shot:
-              correct_by_shot[count].append(
-                self.individual_compute_correct(yyy, yyy_preds))
-            seen_counts[yyy % episode_width] = count + 1
-
-        logging.info('validation overall accuracy %f', np.mean(correct))
-        logging.info('%d-shot: %.3f, ' * num_shots,
-                     *sum([[k, np.mean(correct_by_shot[k])]
-                           for k in xrange(num_shots)], []))
-
-        if saver and FLAGS.save_dir:
-          saved_file = saver.save(sess,
-                                  os.path.join(FLAGS.save_dir, 'model.ckpt'),
-                                  global_step=self.model.global_step)
-          logging.info('saved model to %s', saved_file)
-
-
-def main(unused_argv):
-  train_data, valid_data = data_utils.get_data()
-  trainer = Trainer(train_data, valid_data, data_utils.IMAGE_NEW_SIZE ** 2)
-  trainer.run()
-
-
-if __name__ == '__main__':
-  logging.basicConfig(level=logging.INFO)
-  tf.app.run()
diff --git a/research/learning_unsupervised_learning/.gitignore b/research/learning_unsupervised_learning/.gitignore
deleted file mode 100644
index 0d20b6487c61e7d1bde93acf4a14b7a89083a16d..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/.gitignore
+++ /dev/null
@@ -1 +0,0 @@
-*.pyc
diff --git a/research/learning_unsupervised_learning/README.md b/research/learning_unsupervised_learning/README.md
deleted file mode 100644
index 0e38717f5de29df28959062889abeb1ce578feea..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/README.md
+++ /dev/null
@@ -1,40 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Learning Unsupervised Learning Rules
-This repository contains code and weights for the learned update rule
-presented in "Learning Unsupervised Learning Rules." At this time, this
-code can not meta-train the update rule.
-
-### Structure
-`run_eval.py` contains the main training loop. This constructs an op
-that runs one iteration of the learned update rule and assigns the
-results to variables. Additionally, it loads the weights from our
-pre-trained model.
-
-The base model and the update rule architecture definition can be found in
-`architectures/more_local_weight_update.py`. For a complete description
-of the model, see our [paper](https://arxiv.org/abs/1804.00222).
-
-### Dependencies
-[absl]([https://github.com/abseil/abseil-py), [tensorflow](https://tensorflow.org), [sonnet](https://github.com/deepmind/sonnet)
-
-### Usage
-
-First, download the [pre-trained optimizer model weights](https://storage.googleapis.com/learning_unsupervised_learning/200_tf_graph.zip) and extract it.
-
-```bash
-# move to the folder above this folder
-cd path_to/research/learning_unsupervised_learning/../
-
-# launch the eval script
-python -m learning_unsupervised_learning.run_eval \
---train_log_dir="/tmp/learning_unsupervised_learning" \
---checkpoint_dir="/path/to/downloaded/model/tf_graph_data.ckpt"
-```
-
-### Contact
-Luke Metz, Niru Maheswaranathan, Github: @lukemetz, @nirum. Email: {lmetz, nirum}@google.com
-
-
diff --git a/research/learning_unsupervised_learning/__init__.py b/research/learning_unsupervised_learning/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/learning_unsupervised_learning/architectures/common.py b/research/learning_unsupervised_learning/architectures/common.py
deleted file mode 100644
index 43a2d4f8965ecd337abd3a072a7ecb789df21910..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/architectures/common.py
+++ /dev/null
@@ -1,153 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import sonnet as snt
-import tensorflow as tf
-import numpy as np
-import collections
-from learning_unsupervised_learning import utils
-
-from tensorflow.python.util import nest
-
-from learning_unsupervised_learning import variable_replace
-
-
-class LinearBatchNorm(snt.AbstractModule):
-  """Module that does a Linear layer then a BatchNorm followed by an activation fn"""
-  def __init__(self, size, activation_fn=tf.nn.relu, name="LinearBatchNorm"):
-    self.size = size
-    self.activation_fn = activation_fn
-    super(LinearBatchNorm, self).__init__(name=name)
-
-  def _build(self, x):
-    x = tf.to_float(x)
-    initializers={"w": tf.truncated_normal_initializer(stddev=0.01)}
-    lin = snt.Linear(self.size, use_bias=False, initializers=initializers)
-    z = lin(x)
-
-    scale = tf.constant(1., dtype=tf.float32)
-    offset = tf.get_variable(
-        "b",
-        shape=[1, z.shape.as_list()[1]],
-        initializer=tf.truncated_normal_initializer(stddev=0.1),
-        dtype=tf.float32
-    )
-
-    mean, var = tf.nn.moments(z, [0], keep_dims=True)
-    z = ((z - mean) * tf.rsqrt(var + 1e-6)) * scale + offset
-
-    x_p = self.activation_fn(z)
-
-    return z, x_p
-
-  # This needs to work by string name sadly due to how the variable replace
-  # works and would also work even if the custom getter approuch was used.
-  # This is verbose, but it should atleast be clear as to what is going on.
-  # TODO(lmetz) a better way to do this (the next 3 functions:
-  #    _raw_name, w(), b() )
-  def _raw_name(self, var_name):
-    """Return just the name of the variable, not the scopes."""
-    return var_name.split("/")[-1].split(":")[0]
-
-
-  @property
-  def w(self):
-    var_list = snt.get_variables_in_module(self)
-    w = [x for x in var_list if self._raw_name(x.name) == "w"]
-    assert len(w) == 1
-    return w[0]
-
-  @property
-  def b(self):
-    var_list = snt.get_variables_in_module(self)
-    b = [x for x in var_list if self._raw_name(x.name) == "b"]
-    assert len(b) == 1
-    return b[0]
-
-
-
-class Linear(snt.AbstractModule):
-  def __init__(self, size, use_bias=True, init_const_mag=True):
-    self.size = size
-    self.use_bias = use_bias
-    self.init_const_mag = init_const_mag
-    super(Linear, self).__init__(name="commonLinear")
-
-  def _build(self, x):
-    if self.init_const_mag:
-      initializers={"w": tf.truncated_normal_initializer(stddev=0.01)}
-    else:
-      initializers={}
-    lin = snt.Linear(self.size, use_bias=self.use_bias, initializers=initializers)
-    z = lin(x)
-    return z
-
-  # This needs to work by string name sadly due to how the variable replace
-  # works and would also work even if the custom getter approuch was used.
-  # This is verbose, but it should atleast be clear as to what is going on.
-  # TODO(lmetz) a better way to do this (the next 3 functions:
-  #    _raw_name, w(), b() )
-  def _raw_name(self, var_name):
-    """Return just the name of the variable, not the scopes."""
-    return var_name.split("/")[-1].split(":")[0]
-
-  @property
-  def w(self):
-    var_list = snt.get_variables_in_module(self)
-    if self.use_bias:
-      assert len(var_list) == 2, "Found not 2 but %d" % len(var_list)
-    else:
-      assert len(var_list) == 1, "Found not 1 but %d" % len(var_list)
-    w = [x for x in var_list if self._raw_name(x.name) == "w"]
-    assert len(w) == 1
-    return w[0]
-
-  @property
-  def b(self):
-    var_list = snt.get_variables_in_module(self)
-    assert len(var_list) == 2, "Found not 2 but %d" % len(var_list)
-    b = [x for x in var_list if self._raw_name(x.name) == "b"]
-    assert len(b) == 1
-    return b[0]
-
-
-def transformer_at_state(base_model, new_variables):
-  """Get the base_model that has been transformed to use the variables
-  in final_state.
-  Args:
-    base_model: snt.Module
-      Goes from batch to features
-    new_variables: list
-      New list of variables to use
-  Returns:
-    func: callable of same api as base_model.
-  """
-  assert not variable_replace.in_variable_replace_scope()
-
-  def _feature_transformer(input_data):
-    """Feature transformer at the end of training."""
-    initial_variables = base_model.get_variables()
-    replacement = collections.OrderedDict(
-        utils.eqzip(initial_variables, new_variables))
-    with variable_replace.variable_replace(replacement):
-      features = base_model(input_data)
-    return features
-
-  return _feature_transformer
diff --git a/research/learning_unsupervised_learning/architectures/more_local_weight_update.py b/research/learning_unsupervised_learning/architectures/more_local_weight_update.py
deleted file mode 100644
index 117549af0f21f9e5148435b73f664a08013f8786..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/architectures/more_local_weight_update.py
+++ /dev/null
@@ -1,861 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import numpy as np
-import sonnet as snt
-import tensorflow as tf
-
-from learning_unsupervised_learning.architectures import common
-from learning_unsupervised_learning import optimizers
-from learning_unsupervised_learning import utils
-from learning_unsupervised_learning import summary_utils
-
-OptState = collections.namedtuple('OptState',
-                                  ['variables', 'opt_state', 'index'])
-
-BaseModelOutputs = collections.namedtuple(
-    'BaseModelOutputs', ['xs', 'zs', 'mods', 'batch', 'backward_mods'])
-
-
-class GradChannelReadout(snt.AbstractModule):
-  """Perform a linear readout and reshape from input 3 tensor."""
-
-  def __init__(self,
-               num_grad_channels,
-               device,
-               perm=(2, 0, 1),
-               name='GradChannelReadout'):
-    """Args:
-
-      num_grad_channels: int
-        number of channels to readout to.
-      device: str or callable
-        devicwe to place weights.
-      perm: list or tuple
-        transpose applied.
-    """
-
-    self.num_grad_channels = num_grad_channels
-    self.device = device
-    self.perm = perm
-    super(GradChannelReadout, self).__init__(name=name)
-
-  def _build(self, h):
-    with tf.device(self.device):
-      mod = snt.Linear(self.num_grad_channels)
-      ret = snt.BatchApply(mod)(h)
-      # return as [num_grad_channels] x [bs] x [num units]
-      return tf.transpose(ret, perm=self.perm)
-
-
-def get_weight_stats(x, axis):
-  """ Compute weight statistics over the given axis.
-
-  Args:
-    x: tf.Tensor
-      a batch of activations.
-    axis: int
-      axis to perform statistics over.
-  Returns:
-    tf.Tensor
-      a 3-D tensor with statistics.
-  """
-  if x is None:
-    return []
-
-  stats = []
-  l1 = tf.reduce_mean(tf.abs(x), axis=axis)
-  l2 = tf.sqrt(tf.reduce_mean(x**2, axis=axis) + 1e-6)
-
-  mean, var = tf.nn.moments(x, [axis])
-  stats.extend([l1, l2, mean, tf.sqrt(var + 1e-8)])
-
-  stats = [tf.reshape(s, [-1, 1, 1]) for s in stats]
-
-  return stats
-
-
-class AddUnitBatchStatistics(snt.AbstractModule):
-  """Compute some number of statistics over units and concat them on."""
-
-  def __init__(self, name='AddUnitBatchStatistics'):
-    super(AddUnitBatchStatistics, self).__init__(name=name)
-
-  def _build(self, x):
-    # [channel, bs, 1]
-    output = x
-    for d in [0, 1]:
-      stats = []
-      l1 = tf.reduce_mean(tf.abs(x), axis=d, keepdims=True)
-      l2 = tf.sqrt(tf.reduce_mean(x**2, axis=d, keepdims=True) + 1e-6)
-
-      mean, var = tf.nn.moments(x, [d], keepdims=True)
-      stats.extend([l1, l2, mean, tf.sqrt(var + 1e-8)])
-
-      to_add = tf.concat(stats, axis=2)  # [channels/1, units/1, stats]
-      output += snt.BatchApply(snt.Linear(x.shape.as_list()[2]))(to_add)
-    return output
-
-
-class ConcatUnitConv(snt.AbstractModule):
-  """Do a small number of convolutions over units and concat / add them on."""
-
-  def __init__(self, add=True):
-    self.add = add
-    super(ConcatUnitConv, self).__init__(name='ConcatUnitConv')
-
-  def _build(self, x):
-    # x is [units, bs, 1]
-    net = tf.transpose(x, [1, 0, 2])  # now [bs x units x 1]
-    channels = x.shape.as_list()[2]
-    mod = snt.Conv1D(output_channels=channels, kernel_shape=[3])
-    net = mod(net)
-    net = snt.BatchNorm(axis=[0, 1])(net, is_training=False)
-    net = tf.nn.relu(net)
-    mod = snt.Conv1D(output_channels=channels, kernel_shape=[3])
-    net = mod(net)
-    net = snt.BatchNorm(axis=[0, 1])(net, is_training=False)
-    net = tf.nn.relu(net)
-    to_concat = tf.transpose(net, [1, 0, 2])
-    if self.add:
-      return x + to_concat
-    else:
-      return tf.concat([x, to_concat], 2)
-
-
-class MoreLocalWeightUpdateProcess(snt.AbstractModule):
-
-  def __init__(
-      self,
-      remote_device,
-      local_device,
-      top_delta_size=64,
-      top_delta_layers=2,
-      compute_h_size=64,
-      compute_h_layers=1,
-      delta_dim=32,
-      num_grad_channels=4,
-      normalize_epsilon=1.,
-  ):
-    self.local_device = local_device
-    self.remote_device = remote_device
-    self.top_delta_size = top_delta_size
-    self.top_delta_layers = top_delta_layers
-    self.compute_h_size = compute_h_size
-    self.compute_h_layers = compute_h_layers
-    self.delta_dim = delta_dim
-    self.num_grad_channels = num_grad_channels
-    self.normalize_epsilon = normalize_epsilon,
-
-    with tf.device(local_device):
-      self.opt = optimizers.UnrollableGradientDescentRollingOptimizer(
-          learning_rate=1e-4)
-
-    # lazily initialized for readouts
-    self.readout_mods = {}
-
-    super(MoreLocalWeightUpdateProcess,
-          self).__init__(name='MoreLocalWeightUpdateProcess')
-
-    with tf.device(remote_device):
-      self()
-
-  def normalize(self, change_w, normalize_epsilon=None):
-    if normalize_epsilon is None:
-      normalize_epsilon = self.normalize_epsilon
-
-    # normalize the weights per receptive-field, rather than per-matrix
-    var = tf.reduce_mean(tf.square(change_w), axis=0, keepdims=True)
-    change_w = (change_w) / tf.sqrt(normalize_epsilon + var)
-    return change_w
-
-  def _build(self):
-    pass
-
-  @snt.reuse_variables
-  def compute_top_delta(self, z):
-    """ parameterization of topD. This converts the top level activation
-    to an error signal.
-    Args:
-      z: tf.Tensor
-        batch of final layer post activations
-    Returns
-      delta: tf.Tensor
-        the error signal
-    """
-    s_idx = 0
-    with tf.variable_scope('compute_top_delta'), tf.device(self.remote_device):
-      # typically this takes [BS, length, input_channels],
-      # We are applying this such that we convolve over the batch dimension.
-      act = tf.expand_dims(tf.transpose(z, [1, 0]), 2)  # [channels, BS, 1]
-
-      mod = snt.Conv1D(output_channels=self.top_delta_size, kernel_shape=[5])
-      act = mod(act)
-
-      act = snt.BatchNorm(axis=[0, 1])(act, is_training=False)
-      act = tf.nn.relu(act)
-
-      bs = act.shape.as_list()[0]
-      act = tf.transpose(act, [2, 1, 0])
-      act = snt.Conv1D(output_channels=bs, kernel_shape=[3])(act)
-      act = snt.BatchNorm(axis=[0, 1])(act, is_training=False)
-      act = tf.nn.relu(act)
-      act = snt.Conv1D(output_channels=bs, kernel_shape=[3])(act)
-      act = snt.BatchNorm(axis=[0, 1])(act, is_training=False)
-      act = tf.nn.relu(act)
-      act = tf.transpose(act, [2, 1, 0])
-
-      prev_act = act
-      for i in range(self.top_delta_layers):
-        mod = snt.Conv1D(output_channels=self.top_delta_size, kernel_shape=[3])
-        act = mod(act)
-
-        act = snt.BatchNorm(axis=[0, 1])(act, is_training=False)
-        act = tf.nn.relu(act)
-
-        prev_act = act
-
-      mod = snt.Conv1D(output_channels=self.delta_dim, kernel_shape=[3])
-      act = mod(act)
-
-      # [bs, feature_channels, delta_channels]
-      act = tf.transpose(act, [1, 0, 2])
-      return act
-
-  @snt.reuse_variables
-  def compute_h(self,
-                x,
-                z,
-                d,
-                bias,
-                W_bot,
-                W_top,
-                compute_perc=1.0,
-                compute_units=None):
-    """z = [BS, n_units] a = [BS, n_units] b = [BS, n_units] d = [BS, n_units, delta_channels]
-
-    """
-
-    s_idx = 0
-    if compute_perc != 1.0:
-      assert compute_units is None
-
-    with tf.device(self.remote_device):
-      inp_feat = [x, z]
-      inp_feat = [tf.transpose(f, [1, 0]) for f in inp_feat]
-
-      units = x.shape.as_list()[1]
-      bs = x.shape.as_list()[0]
-
-      # add unit ID, to help the network differentiate units
-      id_theta = tf.linspace(0., (4) * np.pi, units)
-      assert bs is not None
-      id_theta_bs = tf.reshape(id_theta, [-1, 1]) * tf.ones([1, bs])
-      inp_feat += [tf.sin(id_theta_bs), tf.cos(id_theta_bs)]
-
-      # list of [units, BS, 1]
-      inp_feat = [tf.expand_dims(f, 2) for f in inp_feat]
-
-      d_trans = tf.transpose(d, [1, 0, 2])
-
-      if compute_perc != 1.0:
-        compute_units = int(compute_perc * inp_feat.shape.as_list()[0])
-
-      # add weight matrix statistics, both from above and below
-      w_stats_bot = get_weight_stats(W_bot, 0)
-      w_stats_top = get_weight_stats(W_top, 1)
-      w_stats = w_stats_bot + w_stats_top
-      if W_bot is None or W_top is None:
-        # if it's an edge layer (top or bottom), just duplicate the stats for
-        # the weight matrix that does exist
-        w_stats = w_stats + w_stats
-      w_stats = [tf.ones([1, x.shape[0], 1]) * ww for ww in w_stats]
-      # w_stats is a list, with entries with shape UNITS x 1 x channels
-
-      if compute_units is None:
-        inp_feat_in = inp_feat
-        d_trans_in = d_trans
-        w_stats_in = w_stats
-        bias_in = tf.transpose(bias)
-      else:
-        # only run on a subset of the activations.
-        mask = tf.random_uniform(
-            minval=0,
-            maxval=1,
-            dtype=tf.float32,
-            shape=inp_feat[0].shape.as_list()[0:1])
-        _, ind = tf.nn.top_k(mask, k=compute_units)
-        ind = tf.reshape(ind, [-1, 1])
-
-        inp_feat_in = [tf.gather_nd(xx, ind) for xx in inp_feat]
-        w_stats_in = [tf.gather_nd(xx, ind) for xx in w_stats]
-        d_trans_in = tf.gather_nd(d_trans, ind)
-        bias_in = tf.gather_nd(tf.transpose(bias), ind)
-
-      w_stats_in = tf.concat(w_stats_in, 2)
-      w_stats_in_norm = w_stats_in * tf.rsqrt(
-          tf.reduce_mean(w_stats_in**2) + 1e-6)
-
-      act = tf.concat(inp_feat_in + [d_trans_in], 2)
-      act = snt.BatchNorm(axis=[0, 1])(act, is_training=True)
-
-      bias_dense = tf.reshape(bias_in, [-1, 1, 1]) * tf.ones([1, bs, 1])
-      act = tf.concat([w_stats_in_norm, bias_dense, act], 2)
-
-      mod = snt.Conv1D(output_channels=self.compute_h_size, kernel_shape=[3])
-      act = mod(act)
-
-      act = snt.BatchNorm(axis=[0, 1])(act, is_training=True)
-      act = tf.nn.relu(act)
-
-      act2 = ConcatUnitConv()(act)
-      act = act2
-
-      prev_act = act
-      for i in range(self.compute_h_layers):
-        mod = snt.Conv1D(output_channels=self.compute_h_size, kernel_shape=[3])
-        act = mod(act)
-
-        act = snt.BatchNorm(axis=[0, 1])(act, is_training=True)
-        act = tf.nn.relu(act)
-
-        act = ConcatUnitConv()(act)
-
-        prev_act = act
-
-      h = act
-      if compute_units is not None:
-        shape = inp_feat[0].shape.as_list()[:1] + h.shape.as_list()[1:]
-        h = tf.scatter_nd(ind, h, shape=shape)
-
-      h = tf.transpose(h, [1, 0, 2])  # [bs, units, channels]
-
-      return h
-
-  ## wrappers to allow forward and backward to have different variables
-  @snt.reuse_variables
-  def merge_change_w_forward(self, change_w_terms, global_prefix='', prefix=''):
-    return self.merge_change_w(
-        change_w_terms, global_prefix=global_prefix, prefix=prefix)
-
-  @snt.reuse_variables
-  def merge_change_w_backward(self, change_w_terms, global_prefix='',
-                              prefix=''):
-    return self.merge_change_w(
-        change_w_terms, global_prefix=global_prefix, prefix=prefix)
-
-  def merge_change_w(self, change_w_terms, global_prefix='', prefix=''):
-    with tf.device(
-        self.remote_device), tf.name_scope(global_prefix + '_merge_change_w'):
-      w_base = change_w_terms['w_base']
-
-      for kk in sorted(change_w_terms.keys()):
-        name = global_prefix + 'change_w_plane_%s' % kk
-        delta_w = change_w_terms[kk]
-        mean, var = tf.nn.moments(delta_w, [0, 1])
-        root_mean_square = tf.sqrt(tf.reduce_mean(delta_w**2) + 1e-6)
-
-      for kk in sorted(change_w_terms.keys()):
-        change_w_terms[kk] = self.normalize(change_w_terms[kk])
-
-      initializers = {
-          'w': tf.constant_initializer(0.1),
-          'b': tf.zeros_initializer()
-      }
-      mod = snt.Linear(
-          1,
-          name=global_prefix + '_weight_readout_coeffs',
-          initializers=initializers)
-
-      change_w_terms_list = [
-          change_w_terms[kk] for kk in sorted(change_w_terms.keys())
-      ]
-      stack_terms = tf.stack(change_w_terms_list, axis=-1)
-      change_w = tf.squeeze(
-          snt.BatchApply(mod)(stack_terms), axis=-1) / len(change_w_terms)
-
-      # only allow perpendicular updates, or updates which grow length. don't
-      # allow length to decay towards zero.
-      ip = tf.reduce_mean(change_w * w_base)
-      # zero out any updates that shrink length
-      ip = tf.nn.relu(ip)
-      change_w -= w_base * ip
-      change_w /= tf.sqrt(len(change_w_terms) * 1.)
-
-      change_w = self.normalize(change_w)
-
-      # encourage the receptive field to not collapse to 0
-      change_w -= w_base / 7.  # This is an arbitrary scale choice
-
-      return tf.identity(change_w)
-
-  @snt.reuse_variables
-  def bias_readout(self, h):
-    with tf.device(self.remote_device):
-      mod = snt.Linear(1, name='bias_readout')
-      ret = snt.BatchApply(mod)(h)
-      return tf.squeeze(ret, 2)
-
-  @snt.reuse_variables
-  def next_delta(self, z, h, d):
-    with tf.device(self.remote_device):
-      return d * tf.expand_dims(tf.nn.sigmoid(z), 2) + self.to_delta_size(h)
-
-  @utils.create_variables_in_class_scope
-  def get_readout_mod(self, name):
-    if name not in self.readout_mods:
-      self.readout_mods[name] = GradChannelReadout(
-          self.num_grad_channels, device=self.remote_device, name=name)
-
-    return self.readout_mods[name]
-
-  @utils.create_variables_in_class_scope
-  def low_rank_readout(self, name, h1, h2, psd=False):
-    BS = h1.shape.as_list()[0]
-    r_t = self.get_readout_mod(name + '_top')(h1)
-    if psd:
-      r_b = r_t
-    else:
-      r_b = self.get_readout_mod(name + '_bottom')(h2)
-    return tf.reduce_mean(tf.matmul(r_b, r_t, transpose_a=True), axis=0) / BS
-
-  @snt.reuse_variables
-  def to_delta_size(self, h):
-    with tf.device(self.remote_device):
-      mod = snt.Linear(self.delta_dim)
-      return snt.BatchApply(mod)(h)
-
-  @snt.reuse_variables
-  def initial_state(self, variables):
-    """The inner optimization state.
-
-    Args:
-      variables: list of tf.Variable
-        list of variables to get the initial state of.
-    Returns:
-      opt_state: OptState
-    """
-
-    with tf.device(self.local_device):
-      initial_opt_state = self.opt.get_state(variables)
-
-    return OptState(
-        variables=variables, opt_state=initial_opt_state, index=tf.constant(0))
-
-  @snt.reuse_variables
-  def compute_next_state(self, grads, learning_rate, cur_state,
-                         cur_transformer):
-
-    summaries = []
-    with tf.device(self.local_device):
-      with tf.control_dependencies(summaries):
-        new_vars, new_state = self.opt.compute_updates(
-            cur_state.variables, grads, learning_rate, cur_state.opt_state)
-        pass
-
-    return OptState(
-        variables=tuple(new_vars),
-        opt_state=new_state,
-        index=cur_state.index + 1)
-
-  def assign_state(self, base_model, next_state):
-    var_ups = [
-        v.assign(nv) for v, nv in utils.eqzip(base_model.get_variables(),
-                                              next_state.variables)
-    ]
-
-    opt_ups = self.opt.assign_state(next_state.opt_state)
-
-    return tf.group(opt_ups, *var_ups)
-
-  def local_variables(self):
-    return list(self.opt.get_variables())
-
-  def remote_variables(self):
-    train = list(
-        snt.get_variables_in_module(self, tf.GraphKeys.TRAINABLE_VARIABLES))
-    train += list(
-        snt.get_variables_in_module(self,
-                                    tf.GraphKeys.MOVING_AVERAGE_VARIABLES))
-    return train
-
-
-class MoreLocalWeightUpdateWLearner(snt.AbstractModule):
-  """The BaseModel that the UnsupervisedUpdateRule acts on.
-  """
-
-  def __init__(self,
-               remote_device,
-               local_device,
-               inner_size=128,
-               output_size=32,
-               n_layers=4,
-               shuffle_input=True,
-               activation_fn=tf.nn.relu,
-               identical_updates=True,
-               **kwargs):
-    self.local_device = local_device
-    self.remote_device = remote_device
-    self.inner_size = inner_size
-    self.n_layers = n_layers
-    self.shuffle_input = shuffle_input
-    self.activation_fn = activation_fn
-    self.identical_updates = identical_updates
-
-    self.output_size = output_size
-    if output_size == None:
-      self.output_size = inner_size
-
-    self.shuffle_ind = None
-
-    super(MoreLocalWeightUpdateWLearner, self).__init__(
-        name='LocalWeightUpdateWLearner', **kwargs)
-
-  @snt.reuse_variables
-  def get_shuffle_ind(self, size):
-    if self.shuffle_ind is None:
-      # put the shuffle in tf memory to make the eval jobs
-      # re-entrant.
-      shuffle_ind_val = np.random.permutation(size)
-      shuffle_ind = tf.get_variable(
-          name='shuffle_ind', dtype=tf.int64, initializer=shuffle_ind_val)
-      unshuffle_ind = tf.scatter_nd(
-          tf.reshape(shuffle_ind, [-1, 1]), tf.range(size), [size])
-
-    return shuffle_ind, unshuffle_ind
-
-  def _build(self, batch):
-    image = batch.image
-    x0 = snt.BatchFlatten()(image)
-    if self.shuffle_input:
-      size = x0.shape.as_list()[1]
-      shuffle_ind, unshuffle_ind = self.get_shuffle_ind(size)
-      x0 = tf.gather(x0, shuffle_ind, axis=1)
-
-    xs = [x0]
-    mods = []
-    zs = []
-    init = {}
-
-    for i in range(self.n_layers):
-      mod = common.LinearBatchNorm(
-          self.inner_size, activation_fn=self.activation_fn)
-      z, x = mod(xs[i])
-      xs.append(x)
-      zs.append(z)
-      mods.append(mod)
-
-    mod = common.LinearBatchNorm(
-        self.output_size, activation_fn=self.activation_fn)
-    z, x = mod(xs[-1])
-    mods.append(mod)
-
-    xs.append(x)
-    zs.append(z)
-
-    embedding_x = xs[-1]
-
-    # make a random set of backward mods
-    backward_mods = []
-    for i, (x, x_p1) in enumerate(zip(xs[0:-1], xs[1:])):
-      m = common.LinearBatchNorm(
-          x_p1.shape.as_list()[1], activation_fn=tf.identity)
-      _ = m(x)
-      backward_mods.append(m)
-
-    shape = image.shape.as_list()[1:4]
-
-    for mods_p, prefix in [(mods, 'forward'), (backward_mods, 'backward')]:
-      if self.shuffle_input:
-        unshuf_w = tf.gather(mods_p[0].w, unshuffle_ind, axis=0)
-      else:
-        unshuf_w = mods_p[0].w
-      img = summary_utils.first_layer_weight_image(unshuf_w, shape)
-      tf.summary.image(prefix + '_w0_receptive_field', img)
-
-      for i, m in enumerate(mods_p[0:]):
-        img = summary_utils.inner_layer_weight_image(m.w)
-        tf.summary.image(prefix + '_w%d' % (i + 1), img)
-
-    img = summary_utils.sorted_images(image, batch.label_onehot)
-    tf.summary.image('inputs', img)
-
-    # log out pre-activations and activations
-    for all_vis, base_name in [(xs, 'x'), (zs, 'z')]:
-      for i, x_vis in enumerate(all_vis):
-        img = summary_utils.activation_image(x_vis, batch.label_onehot)
-        tf.summary.image('%s%d' % (base_name, i), img)
-
-    embedding_x = tf.identity(embedding_x)
-
-    outputs = BaseModelOutputs(
-        xs=xs, zs=zs, mods=mods, batch=batch, backward_mods=backward_mods)
-
-    return embedding_x, outputs
-
-  def compute_next_h_d(self, meta_opt, w_bot, w_top, bias, x, z, d, backward_w):
-    """ Propogate error back down the network while computing hidden state.
-    """
-    if z is None:
-      z = x
-
-    h = meta_opt.compute_h(x, z, d, bias, w_bot,
-                           w_top)  # [bs x 60 x h_channels]
-
-    # compute the next d
-    delta = meta_opt.next_delta(z, h, d)
-
-    if backward_w is not None:
-
-      def delta_matmul(w, delta):
-        d = tf.transpose(delta, [0, 2, 1])  # [bs x delta_channels x n_units)
-        d = snt.BatchApply(lambda x: tf.matmul(x, w, transpose_b=True))(d)
-        d = tf.transpose(d, [0, 2, 1])
-        return d
-
-      # replace the "backward pass" with a random matrix.
-      d = delta_matmul(backward_w, delta)  # [bs x 60 x delta_channels]
-      var = tf.reduce_mean(tf.square(d), [2], keepdims=True)
-      d = d * tf.rsqrt(1e-6 + var)
-
-    return h, d
-
-  def weight_change_for_layer(self, meta_opt, l_idx, w_base, b_base, upper_h,
-                              lower_h, upper_x, lower_x, prefix, include_bias):
-    """Compute the change in weights for each layer.
-    This computes something roughly analagous to a gradient.
-    """
-    reduce_upper_h = upper_h
-    reduce_lower_h = lower_h
-
-    BS = lower_x.shape.as_list()[0]
-
-    change_w_terms = dict()
-
-    # initial weight value normalized
-    # normalize the weights per receptive-field, rather than per-matrix
-    weight_scale = tf.rsqrt(
-        tf.reduce_mean(w_base**2, axis=0, keepdims=True) + 1e-6)
-    w_base *= weight_scale
-
-    change_w_terms['w_base'] = w_base
-
-    # this will act to decay larger weights towards zero
-    change_w_terms['large_decay'] = w_base**2 * tf.sign(w_base)
-
-    # term based on activations
-    ux0 = upper_x - tf.reduce_mean(upper_x, axis=0, keepdims=True)
-    uxs0 = ux0 * tf.rsqrt(tf.reduce_mean(ux0**2, axis=0, keepdims=True) + 1e-6)
-    change_U = tf.matmul(uxs0, uxs0, transpose_a=True) / BS
-    change_U /= tf.sqrt(float(change_U.shape.as_list()[0]))
-
-    cw = tf.matmul(w_base, change_U)
-    cw_scale = tf.rsqrt(tf.reduce_mean(cw**2 + 1e-8))
-    cw *= cw_scale
-    change_w_terms['decorr_x'] = cw
-
-    # hebbian term
-    lx0 = lower_x - tf.reduce_mean(lower_x, axis=0, keepdims=True)
-    lxs0 = lx0 * tf.rsqrt(tf.reduce_mean(lx0**2, axis=0, keepdims=True) + 1e-6)
-    cw = tf.matmul(lxs0, uxs0, transpose_a=True) / BS
-    change_w_terms['hebb'] = -cw
-
-    # 0th order term
-    w_term = meta_opt.low_rank_readout(prefix + 'weight_readout_0', upper_h,
-                                       lower_h)
-    change_w_terms['0_order'] = w_term
-
-    # # rbf term (weight update scaled by distance from 0)
-    w_term = meta_opt.low_rank_readout(prefix + 'weight_readout_rbf',
-                                       reduce_upper_h, reduce_lower_h)
-    change_w_terms['rbf'] = tf.exp(-w_base**2) * w_term
-
-    # 1st order term (weight dependent update to weights)
-    w_term = meta_opt.low_rank_readout(prefix + 'weight_readout_1',
-                                       reduce_upper_h, reduce_lower_h)
-    change_w_terms['1_order'] = w_base * w_term
-
-    # more terms based on single layer readouts.
-    for update_type in ['lin', 'sqr']:
-      for h_source, h_source_name in [(reduce_upper_h, 'upper'),
-                                      (reduce_lower_h, 'lower')]:
-        structures = ['symm']
-        if update_type == 'lin' and h_source_name == 'upper':
-          structures += ['psd']
-        for structure in structures:
-          name = update_type + '_' + h_source_name + '_' + structure
-          if structure == 'symm':
-            change_U = meta_opt.low_rank_readout(prefix + name, h_source,
-                                                 h_source)
-            change_U = (change_U + tf.transpose(change_U)) / tf.sqrt(2.)
-            change_U = tf.matrix_set_diag(change_U,
-                                          tf.zeros(
-                                              [change_U.shape.as_list()[0]]))
-          elif structure == 'psd':
-            change_U = meta_opt.low_rank_readout(
-                prefix + name, h_source, None, psd=True)
-          else:
-            assert False
-          change_U /= tf.sqrt(float(change_U.shape.as_list()[0]))
-
-          if update_type == 'lin':
-            sign_multiplier = tf.ones_like(w_base)
-            w_base_l = w_base
-          elif update_type == 'sqr':
-            sign_multiplier = tf.sign(w_base)
-            w_base_l = tf.sqrt(1. + w_base**2) - 1.
-
-          if h_source_name == 'upper':
-            cw = tf.matmul(w_base_l, change_U)  # [N^l-1 x N^l]
-          elif h_source_name == 'lower':
-            cw = tf.matmul(change_U, w_base_l)
-          change_w_terms[name] = cw * sign_multiplier
-
-
-    if prefix == 'forward':
-      change_w = meta_opt.merge_change_w_forward(
-          change_w_terms, global_prefix=prefix, prefix='l%d' % l_idx)
-    elif prefix == 'backward':
-      change_w = meta_opt.merge_change_w_backward(
-          change_w_terms, global_prefix=prefix, prefix='l%d' % l_idx)
-    else:
-      assert (False)
-
-    if not include_bias:
-      return change_w
-
-    change_b = tf.reduce_mean(meta_opt.bias_readout(upper_h), [0])
-
-    # force nonlinearities to be exercised -- biases can't all be increased without bound
-    change_b_mean = tf.reduce_mean(change_b)
-    offset = -tf.nn.relu(-change_b_mean)
-    change_b -= offset
-
-    var = tf.reduce_mean(tf.square(change_b), [0], keepdims=True)
-    change_b = (change_b) / tf.sqrt(0.5 + var)
-    return change_w, change_b
-
-  def compute_next_state(self, outputs, meta_opt, previous_state):
-    zs = outputs.zs
-    xs = outputs.xs
-    batch = outputs.batch
-    mods = outputs.mods
-    backward_mods = outputs.backward_mods
-    variables = self.get_variables()
-
-    rev_mods = mods[::-1]
-    rev_backward_mods = backward_mods[::-1]
-    rev_xs = xs[::-1]
-    rev_zs = zs[::-1] + [None]
-
-    to_top = xs[-1]
-
-    # variables that change in the loop
-    hs = []
-    d = meta_opt.compute_top_delta(to_top)  # [bs x 32 x delta_channels]
-
-    iterator = utils.eqzip(rev_backward_mods + [None], rev_mods + [None],
-                           [None] + rev_mods, rev_xs, rev_zs)
-    for (backward_mod, lower_mod, upper_mod, x, z) in iterator:
-      w_bot = None
-      if not lower_mod is None:
-        w_bot = previous_state.variables[variables.index(lower_mod.w)]
-      w_top = None
-      if not upper_mod is None:
-        w_top = previous_state.variables[variables.index(upper_mod.w)]
-      backward_w = None
-      if backward_mod is not None:
-        backward_w = previous_state.variables[variables.index(backward_mod.w)]
-      if lower_mod is not None:
-        bias = previous_state.variables[variables.index(lower_mod.b)]
-      else:
-        bias = tf.zeros([x.shape[1]])
-
-      h, d = self.compute_next_h_d(
-          meta_opt=meta_opt,
-          w_bot=w_bot,
-          w_top=w_top,
-          bias=bias,
-          backward_w=backward_w,
-          x=x,
-          z=z,
-          d=d)
-      hs.append(h)
-
-    w_forward_var_idx = [variables.index(mod.w) for mod in rev_mods]
-    w_backward_var_idx = [variables.index(mod.w) for mod in rev_backward_mods]
-    b_var_idx = [variables.index(mod.b) for mod in rev_mods]
-
-    # storage location for outputs of below loop
-    grads = [None for _ in previous_state.variables]
-
-    # over-ride learning rate for perturbation variables
-    learning_rate = [None for _ in previous_state.variables]
-
-    # This is a map -- no state is shared cross loop
-    for l_idx, w_forward_idx, w_backward_idx, b_idx, upper_h, lower_h, lower_x, upper_x in utils.eqzip(
-        range(len(w_forward_var_idx)), w_forward_var_idx, w_backward_var_idx,
-        b_var_idx, hs[:-1], hs[1:], xs[::-1][1:], xs[::-1][:-1]):
-
-      b_base = previous_state.variables[b_idx]
-      change_w_forward, change_b = self.weight_change_for_layer(
-          meta_opt=meta_opt,
-          l_idx=l_idx,
-          w_base=previous_state.variables[w_forward_idx],
-          b_base=b_base,
-          upper_h=upper_h,
-          lower_h=lower_h,
-          upper_x=upper_x,
-          lower_x=lower_x,
-          prefix='forward',
-          include_bias=True)
-
-      if self.identical_updates:
-        change_w_backward = change_w_forward
-      else:
-        change_w_backward = self.weight_change_for_layer(
-            meta_opt=meta_opt,
-            l_idx=l_idx,
-            w_base=previous_state.variables[w_backward_idx],
-            b_base=b_base,
-            upper_h=upper_h,
-            lower_h=lower_h,
-            upper_x=upper_x,
-            lower_x=lower_x,
-            prefix='backward',
-            include_bias=False)
-
-      grads[w_forward_idx] = change_w_forward
-
-      grads[w_backward_idx] = change_w_backward
-
-      grads[b_idx] = change_b
-
-    cur_transformer = common.transformer_at_state(self,
-                                                  previous_state.variables)
-    next_state = meta_opt.compute_next_state(
-        grads,
-        learning_rate=learning_rate,
-        cur_state=previous_state,
-        cur_transformer=lambda x: cur_transformer(x)[0])
-    return next_state
-
-  def initial_state(self, meta_opt):
-    return meta_opt.initial_state(self.get_variables())
diff --git a/research/learning_unsupervised_learning/datasets/__init__.py b/research/learning_unsupervised_learning/datasets/__init__.py
deleted file mode 100644
index 9949cd96ca8f2fe1c39705a5ca8570de9cad5a66..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/datasets/__init__.py
+++ /dev/null
@@ -1,16 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import mnist
diff --git a/research/learning_unsupervised_learning/datasets/common.py b/research/learning_unsupervised_learning/datasets/common.py
deleted file mode 100644
index 11f65ceab57a4114ca3876b3cb6eed86e2263745..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/datasets/common.py
+++ /dev/null
@@ -1,29 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-
-import tensorflow as tf
-import numpy as np
-
-ImageLabelOnehot = collections.namedtuple('ImageLabelOnehot',
-                                          ['image', 'label', 'label_onehot'])
-ImageLabelOnehotRegression = collections.namedtuple(
-    "ImageLabelOnehotRegression",
-    ["image", "label", "label_onehot", "regression_target"])
diff --git a/research/learning_unsupervised_learning/datasets/mnist.py b/research/learning_unsupervised_learning/datasets/mnist.py
deleted file mode 100644
index 6ee595d99ad2523042454f038b4665095f501caf..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/datasets/mnist.py
+++ /dev/null
@@ -1,74 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-import sonnet as snt
-import tensorflow as tf
-from tensorflow.python.keras.datasets import mnist
-from learning_unsupervised_learning.datasets import common
-
-class Mnist(snt.AbstractModule):
-  def __init__(self, device, batch_size=128, name="Mnist"):
-    self.device = device
-    self.batch_size = batch_size
-
-    self._make_dataset()
-    self.iterator = None
-
-    super(Mnist, self).__init__(name=name)
-
-  def _make_dataset(self):
-    (x_train, y_train), (x_test, y_test) = mnist.load_data()
-
-    x_train = x_train.reshape(60000, 784)
-    x_test = x_test.reshape(10000, 784)
-
-    dataset = tf.data.Dataset.from_tensor_slices((x_train, y_train))
-    dataset = dataset.repeat()
-    dataset = dataset.shuffle(self.batch_size * 3)
-    dataset = dataset.batch(self.batch_size)
-    def _map_fn(image, label):
-      image = tf.to_float(image) / 255.
-      label.set_shape([self.batch_size])
-      label = tf.cast(label, dtype=tf.int32)
-      label_onehot = tf.one_hot(label, 10)
-      image = tf.reshape(image, [self.batch_size, 28, 28, 1])
-      return common.ImageLabelOnehot(
-          image=image, label=label, label_onehot=label_onehot)
-
-    self.dataset = dataset.map(_map_fn)
-
-  def _build(self):
-    if self.iterator is None:
-      self.iterator = self.dataset.make_one_shot_iterator()
-    batch = self.iterator.get_next()
-    [b.set_shape([self.batch_size] + b.shape.as_list()[1:]) for b in batch]
-    return batch
-
-
-class TinyMnist(Mnist):
-  def __init__(self, *args, **kwargs):
-    kwargs.setdefault("name", "TinyMnist")
-    super(TinyMnist, self).__init__(*args, **kwargs)
-
-  def _make_dataset(self):
-    super(TinyMnist, self)._make_dataset()
-
-    def _map_fn(batch):
-      new_img = tf.image.resize_images(batch.image, [14, 14])
-      return common.ImageLabelOnehot(
-          image=new_img, label=batch.label, label_onehot=batch.label_onehot)
-
-    self.dataset = self.dataset.map(_map_fn)
diff --git a/research/learning_unsupervised_learning/evaluation.py b/research/learning_unsupervised_learning/evaluation.py
deleted file mode 100644
index 2ec40e99a672f9420200653b92818374e0e84d78..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/evaluation.py
+++ /dev/null
@@ -1,76 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-"""Evaluation job.
-
-This sits on the side and performs evaluation on a saved model.
-This is a separate process for ease of use and stability of numbers.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from learning_unsupervised_learning import utils
-
-
-def construct_evaluation_graph(theta_process_fn=None,
-                               w_learner_fn=None,
-                               dataset_fn=None,
-                               meta_objectives=None,
-                              ):
-  """Construct the evaluation graph.
-  """
-  if meta_objectives is None:
-    meta_objectives = []
-
-  tf.train.create_global_step()
-
-  local_device = ""
-  remote_device = ""
-
-  meta_opt = theta_process_fn(
-      remote_device=remote_device, local_device=local_device)
-
-  base_model = w_learner_fn(
-      remote_device=remote_device, local_device=local_device)
-
-  train_dataset = dataset_fn(device=local_device)
-
-  # construct variables
-  x, outputs = base_model(train_dataset())
-  initial_state = base_model.initial_state(meta_opt, max_steps=10)
-  next_state = base_model.compute_next_state(outputs, meta_opt, initial_state)
-  with utils.state_barrier_context(next_state):
-    train_one_step_op = meta_opt.assign_state(base_model, next_state)
-
-  meta_objs = []
-  for meta_obj_fn in meta_objectives:
-    meta_obj = meta_obj_fn(local_device="", remote_device="")
-    meta_objs.append(meta_obj)
-    J = meta_obj(train_dataset, lambda x: base_model(x)[0])
-    tf.summary.scalar(str(meta_obj.__class__.__name__)+"_J", tf.reduce_mean(J))
-
-  # TODO(lmetz) this is kinda error prone.
-  # We should share the construction of the global variables across train and
-  # make sure both sets of savable variables are the same
-  checkpoint_vars = meta_opt.remote_variables() + [tf.train.get_global_step()]
-  for meta_obj in meta_objs:
-    checkpoint_vars.extend(meta_obj.remote_variables())
-
-  return checkpoint_vars, train_one_step_op, (base_model, train_dataset)
diff --git a/research/learning_unsupervised_learning/meta_objective/__init__.py b/research/learning_unsupervised_learning/meta_objective/__init__.py
deleted file mode 100644
index 54c46145e3c3a9f19110f92197f1d3cb2afe31fb..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/meta_objective/__init__.py
+++ /dev/null
@@ -1,18 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-import sklearn
-import linear_regression
diff --git a/research/learning_unsupervised_learning/meta_objective/linear_regression.py b/research/learning_unsupervised_learning/meta_objective/linear_regression.py
deleted file mode 100644
index b49fc2529ccba08a6b47019cd7546f8fb409b28b..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/meta_objective/linear_regression.py
+++ /dev/null
@@ -1,258 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-
-"""Closed form linear regression.
-
-Can be differentiated through.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import numpy as np
-import sonnet as snt
-import tensorflow as tf
-
-from learning_unsupervised_learning import utils
-from learning_unsupervised_learning import variable_replace
-
-
-def solve_ridge(x, y, ridge_factor):
-  with tf.name_scope("solve_ridge"):
-    # Added a column of ones to the end of the feature matrix for bias
-    A = tf.concat([x, tf.ones((x.shape.as_list()[0], 1))], axis=1)
-
-    # Analytic solution for the ridge regression loss
-    inv_target = tf.matmul(A, A, transpose_a=True)
-    np_diag_penalty = ridge_factor * np.ones(
-        A.shape.as_list()[1], dtype="float32")
-    # Remove penalty on bias component of weights
-    np_diag_penalty[-1] = 0.
-    diag_penalty = tf.constant(np_diag_penalty)
-    inv_target += tf.diag(diag_penalty)
-
-    inv = tf.matrix_inverse(inv_target)
-    w = tf.matmul(inv, tf.matmul(A, y, transpose_a=True))
-    return w
-
-
-class LinearRegressionMetaObjective(snt.AbstractModule):
-  """A meta objective based on training Ridge Regression with analytic solution.
-
-  This is used to evaluate the performance of a given feature set trained in
-  some other manner.
-  """
-
-  def __init__(self,
-               local_device=None,
-               remote_device=None,
-               zero_one_labels=True,
-               normalize_y_hat=True,
-               normalize_act=False,
-               averages=1,
-               ridge_factor=0.1,
-               center_y=True,
-               hinge_loss=False,
-               samples_per_class=10,
-               test_train_scalar=1.0,
-              ):
-    self._local_device = local_device
-    self._remote_device = remote_device
-    self.zero_one_labels = zero_one_labels
-    self.normalize_y_hat = normalize_y_hat
-    self.normalize_act = normalize_act
-    self.ridge_factor = ridge_factor
-    self.averages = averages
-    self.samples_per_class = samples_per_class
-    self.center_y=center_y
-    self.test_train_scalar=test_train_scalar
-    self.hinge_loss = hinge_loss
-
-    self.dataset_map = {}
-
-    super(LinearRegressionMetaObjective,
-          self).__init__(name="LinearRegressionMetaObjective")
-
-  def _build(self, dataset, feature_transformer):
-    if self.samples_per_class is not None:
-      if dataset not in self.dataset_map:
-        # datasets are outside of frames from while loops
-        with tf.control_dependencies(None):
-          self.dataset_map[dataset] = utils.sample_n_per_class(
-              dataset, self.samples_per_class)
-
-      dataset = self.dataset_map[dataset]
-
-    stats = collections.defaultdict(list)
-    losses = []
-    # TODO(lmetz) move this to ingraph control flow?
-    for _ in xrange(self.averages):
-      loss, stat = self._build_once(dataset, feature_transformer)
-      losses.append(loss)
-      for k, v in stat.items():
-        stats[k].append(v)
-    stats = {k: tf.add_n(v) / float(len(v)) for k, v in stats.items()}
-
-    summary_updates = []
-    for k, v in stats.items():
-      tf.summary.scalar(k, v)
-
-    with tf.control_dependencies(summary_updates):
-      return tf.add_n(losses) / float(len(losses))
-
-  def _build_once(self, dataset, feature_transformer):
-    with tf.device(self._local_device):
-      batch = dataset()
-      num_classes = batch.label_onehot.shape.as_list()[1]
-
-      regression_mod = snt.Linear(num_classes)
-
-      if self.normalize_act:
-
-        def normalize_transformer(x):
-          unnorm_x = feature_transformer(x)
-          return tf.nn.l2_normalize(unnorm_x, 0)
-
-        feature_transformer_wrap = normalize_transformer
-      else:
-        feature_transformer_wrap = feature_transformer
-
-      # construct the variables of the right shape in the sonnet module by
-      # calling a forward pass through the regressor.
-      with utils.assert_no_new_variables():
-        dummy_features = feature_transformer_wrap(batch)
-      regression_mod(dummy_features)
-      reg_w = regression_mod.w
-      reg_b = regression_mod.b
-
-      batch_test = dataset()
-      all_batch = utils.structure_map_multi(lambda x: tf.concat(x, 0), [batch, batch_test])
-      #all_batch = tf.concat([batch, batch_test], 0)
-      # Grab a new batch of data from the dataset.
-      features = feature_transformer_wrap(all_batch)
-      features, features_test = utils.structure_map_split(lambda x: tf.split(x, 2, axis=0), features)
-
-      def center_y(y):
-        y -= tf.reduce_mean(y)
-        y *= tf.rsqrt(tf.reduce_mean(tf.reduce_sum(y**2, axis=[1], keep_dims=True)))
-        return y
-      def get_y_vec(batch):
-        y_pieces = []
-        if hasattr(batch, "label_onehot"):
-          if self.zero_one_labels:
-            y_pieces += [batch.label_onehot]
-          else:
-            y_pieces += [2. * batch.label_onehot - 1.]
-        if hasattr(batch, "regression_target"):
-          y_pieces += [batch.regression_target]
-        y = tf.concat(y_pieces, 1)
-        if self.center_y:
-          y = center_y(y)
-        return y
-
-      y_train = get_y_vec(batch)
-
-      w = solve_ridge(features, y_train, self.ridge_factor)
-
-      # Generate features from another batch to evaluate loss on the validation
-      # set. This provide a less overfit signal to the learned optimizer.
-      y_test = get_y_vec(batch_test)
-
-      def compute_logit(features):
-        # We have updated the classifier mod in previous steps, we need to
-        # substitute out those variables to get new values.
-        replacement = collections.OrderedDict([(reg_w, w[:-1]), (reg_b, w[-1])])
-        with variable_replace.variable_replace(replacement):
-          logits = regression_mod(features)
-
-        return logits
-
-      batch_size = y_train.shape.as_list()[0]
-
-      logit_train = compute_logit(features)
-      logit_test_unnorm = compute_logit(features_test)
-      if self.normalize_y_hat:
-        logit_test = logit_test_unnorm / tf.sqrt(
-            tf.reduce_sum(logit_test_unnorm**2, axis=[1], keep_dims=True))
-      else:
-        logit_test = logit_test_unnorm
-
-      stats = {}
-
-      if self.hinge_loss:
-        # slightly closer to the true classification loss
-        # any distance smaller than 1 is guaranteed to map to the correct class
-        mse_test = tf.reduce_sum(tf.nn.relu(tf.reduce_sum(tf.square(logit_test - y_test), axis=1)-1.)) / batch_size
-      else:
-        mse_test = tf.reduce_sum(tf.square(logit_test - y_test)) / batch_size
-
-      stats["mse_test"] = mse_test
-
-      mse_train = tf.reduce_sum(tf.square(logit_train - y_train)) / batch_size
-      stats["mse_train"] = mse_train
-
-      is_correct_test = tf.equal(tf.argmax(logit_test, 1), tf.argmax(y_test, 1))
-      accuracy_test = tf.reduce_mean(tf.cast(is_correct_test, tf.float32))
-      stats["accuracy_test"] = accuracy_test
-
-      def test_confusion_fn():
-        test_confusion = tf.confusion_matrix(tf.argmax(y_test, 1), tf.argmax(logit_test, 1))
-        test_confusion = tf.to_float(test_confusion) / tf.constant((logit_test.shape.as_list()[0] / float(logit_test.shape.as_list()[1])), dtype=tf.float32)
-        test_confusion = tf.expand_dims(tf.expand_dims(test_confusion, 0), 3)
-        return test_confusion
-      tf.summary.image("test_confusion", test_confusion_fn())
-
-      def train_confusion_fn():
-        train_confusion = tf.confusion_matrix(tf.argmax(y_train, 1), tf.argmax(logit_train, 1))
-        train_confusion = tf.to_float(train_confusion) / tf.constant((logit_train.shape.as_list()[0] / float(logit_train.shape.as_list()[1])), dtype=tf.float32)
-        train_confusion = tf.expand_dims(tf.expand_dims(train_confusion, 0), 3)
-        return train_confusion
-      tf.summary.image("train_confusion", train_confusion_fn())
-
-      is_correct = tf.equal(tf.argmax(logit_train, 1), tf.argmax(y_train, 1))
-      accuracy_train = tf.reduce_mean(tf.cast(is_correct, tf.float32))
-      stats["accuracy_train"] = accuracy_train
-
-      reg = self.ridge_factor * tf.reduce_sum(tf.square(w[:-1])) / batch_size
-      stats["ridge_component"] = reg
-
-      stats["total_loss"] = mse_test + reg
-
-      loss_to_train_at = (reg+ mse_test) * self.test_train_scalar + (mse_train + reg)*(1 - self.test_train_scalar)
-
-      loss_to_train_at = tf.identity(loss_to_train_at)
-
-      # Minimizing the test loss should not require regurization because the
-      # metaobjective is solved for the training loss
-      return loss_to_train_at, stats
-
-  def local_variables(self):
-    """List of variables that need to be updated for each evaluation.
-
-    These variables should not be stored on a parameter server and
-    should be reset every computation of a meta_objective loss.
-
-    Returns:
-      vars: list of tf.Variable
-    """
-    return list(
-        snt.get_variables_in_module(self, tf.GraphKeys.TRAINABLE_VARIABLES))
-
-  def remote_variables(self):
-    return []
diff --git a/research/learning_unsupervised_learning/meta_objective/sklearn.py b/research/learning_unsupervised_learning/meta_objective/sklearn.py
deleted file mode 100644
index 4f1f2d59102c511fd42ad323c32ab1709bd60c90..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/meta_objective/sklearn.py
+++ /dev/null
@@ -1,167 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-"""
-
-Can NOT be differentiated through.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import numpy as np
-import sonnet as snt
-import tensorflow as tf
-from tensorflow.python.framework import function
-
-from learning_unsupervised_learning import utils
-
-from learning_unsupervised_learning.meta_objective import utils as meta_obj_utils
-
-from sklearn import svm
-from sklearn import linear_model
-
-
-def build_fit(device, model_fn, num_classes, probs=True):
-
-  def _py_fit_predict(trX, trY, teX):
-    assert len(np.unique(trY)) == num_classes
-    model = model_fn()
-    model.fit(trX, trY)
-    trP = model.predict(trX)
-    teP = model.predict(teX)
-    if probs:
-      teP_probs = model.predict_log_proba(teX)
-      return trP.astype(np.int64), teP.astype(np.int64), teP_probs.astype(
-          np.float32)
-    else:
-      teP = model.predict(teX)
-      return trP.astype(np.int64), teP.astype(np.int64)
-
-  def return_fn(trX, trY, teX):
-    with tf.device(device):
-      with tf.device("/cpu:0"):
-        if probs:
-          return tf.py_func(
-              _py_fit_predict,
-              [tf.identity(trX),
-               tf.identity(trY),
-               tf.identity(teX)], [tf.int64, tf.int64, tf.float32])
-        else:
-          return tf.py_func(
-              _py_fit_predict,
-              [tf.identity(trX),
-               tf.identity(trY),
-               tf.identity(teX)], [tf.int64, tf.int64])
-
-  return return_fn
-
-
-class SKLearn(meta_obj_utils.MultiTrialMetaObjective):
-
-  def __init__(
-      self,
-      local_device=None,
-      remote_device=None,
-      averages=1,
-      samples_per_class=10,
-      probs=False,
-      stddev=0.01,
-      n_samples=10,
-      name="SKLearn",
-  ):
-    self._local_device = local_device
-    self._remote_device = remote_device
-    self.name = name
-    self.probs = probs
-    self.n_samples = n_samples
-    self.stddev = stddev
-
-    super(SKLearn, self).__init__(
-        name=name, samples_per_class=samples_per_class, averages=averages)
-
-  def _get_model(self):
-    raise NotImplemented()
-
-  def _build_once(self, dataset, feature_transformer):
-    with tf.device(self._local_device):
-      tr_batch = dataset()
-      te_batch = dataset()
-      num_classes = tr_batch.label_onehot.shape.as_list()[1]
-      all_batch = utils.structure_map_multi(lambda x: tf.concat(x, 0),
-                                            [tr_batch, te_batch])
-      features = feature_transformer(all_batch)
-      trX, teX = utils.structure_map_split(lambda x: tf.split(x, 2, axis=0),
-                                           features)
-      trY = tf.to_int64(tr_batch.label)
-      trY_onehot = tf.to_int32(tr_batch.label_onehot)
-      teY = tf.to_int64(te_batch.label)
-      teY_shape = teY.shape.as_list()
-
-      def blackbox((trX, trY, teX, teY)):
-        trY = tf.to_int32(tf.rint(trY))
-        teY = tf.to_int32(tf.rint(teY))
-        tf_fn = build_fit(
-            self._local_device,
-            self._get_model,
-            num_classes=num_classes,
-            probs=self.probs)
-        if self.probs:
-          trP, teP, teP_probs = tf_fn(trX, trY, teX)
-        else:
-          trP, teP = tf_fn(trX, trY, teX)
-
-        teY.set_shape(teY_shape)
-        if self.probs:
-          onehot = tf.one_hot(teY, num_classes)
-          crossent = -tf.reduce_sum(onehot * teP_probs, [1])
-          return tf.reduce_mean(crossent)
-        else:
-          # use error rate as the loss if no surrogate is avalible.
-          return 1 - tf.reduce_mean(
-              tf.to_float(tf.equal(teY, tf.to_int32(teP))))
-
-      test_loss = blackbox((trX, tf.to_float(trY), teX, tf.to_float(teY)))
-
-      stats = {}
-
-      tf_fn = build_fit(
-          self._local_device,
-          self._get_model,
-          num_classes=num_classes,
-          probs=self.probs)
-      if self.probs:
-        trP, teP, teP_probs = tf_fn(trX, trY, teX)
-      else:
-        trP, teP = tf_fn(trX, trY, teX)
-      stats["%s/accuracy_train" % self.name] = tf.reduce_mean(
-          tf.to_float(tf.equal(tf.to_int32(trY), tf.to_int32(trP))))
-      stats["%s/accuracy_test" % self.name] = tf.reduce_mean(
-          tf.to_float(tf.equal(tf.to_int32(teY), tf.to_int32(teP))))
-      stats["%s/test_loss" % self.name] = test_loss
-      return test_loss, stats
-
-
-class LogisticRegression(SKLearn):
-
-  def __init__(self, C=1.0, name="LogisticRegression", probs=True, **kwargs):
-    self.C = C
-    super(LogisticRegression, self).__init__(name=name, probs=probs, **kwargs)
-
-  def _get_model(self):
-    return linear_model.LogisticRegression(C=self.C)
diff --git a/research/learning_unsupervised_learning/meta_objective/utils.py b/research/learning_unsupervised_learning/meta_objective/utils.py
deleted file mode 100644
index a29197d1d0cb7f0fdcebac3980027640651f185b..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/meta_objective/utils.py
+++ /dev/null
@@ -1,78 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import numpy as np
-import sonnet as snt
-import tensorflow as tf
-
-from learning_unsupervised_learning import optimizers
-from learning_unsupervised_learning import utils
-from learning_unsupervised_learning import summary_utils
-from learning_unsupervised_learning import variable_replace
-
-class MultiTrialMetaObjective(snt.AbstractModule):
-  def __init__(self, samples_per_class, averages, **kwargs):
-    self.samples_per_class = samples_per_class
-    self.averages = averages
-    self.dataset_map = {}
-
-    super(MultiTrialMetaObjective,
-          self).__init__(**kwargs)
-
-  def _build(self, dataset, feature_transformer):
-    if self.samples_per_class is not None:
-      if dataset not in self.dataset_map:
-        # datasets are outside of frames from while loops
-        with tf.control_dependencies(None):
-          self.dataset_map[dataset] = utils.sample_n_per_class(
-              dataset, self.samples_per_class)
-
-      dataset = self.dataset_map[dataset]
-
-    stats = collections.defaultdict(list)
-    losses = []
-    # TODO(lmetz) move this to ingraph control flow?
-    for _ in xrange(self.averages):
-      loss, stat = self._build_once(dataset, feature_transformer)
-      losses.append(loss)
-      for k, v in stat.items():
-        stats[k].append(v)
-    stats = {k: tf.add_n(v) / float(len(v)) for k, v in stats.items()}
-
-    for k, v in stats.items():
-      tf.summary.scalar(k, v)
-
-    return tf.add_n(losses) / float(len(losses))
-
-  def local_variables(self):
-    """List of variables that need to be updated for each evaluation.
-
-    These variables should not be stored on a parameter server and
-    should be reset every computation of a meta_objective loss.
-
-    Returns:
-      vars: list of tf.Variable
-    """
-    return list(
-        snt.get_variables_in_module(self, tf.GraphKeys.TRAINABLE_VARIABLES))
-
-  def remote_variables(self):
-    return []
diff --git a/research/learning_unsupervised_learning/optimizers.py b/research/learning_unsupervised_learning/optimizers.py
deleted file mode 100644
index 02c6106b19d1255907beb0ade07c46c5b065f701..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/optimizers.py
+++ /dev/null
@@ -1,133 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-
-"""Optimizers for use in unrolled optimization.
-
-These optimizers contain a compute_updates function and its own ability to keep
-track of internal state.
-These functions can be used with a tf.while_loop to perform multiple training
-steps per sess.run.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import abc
-import collections
-import tensorflow as tf
-import sonnet as snt
-
-from learning_unsupervised_learning import utils
-
-from tensorflow.python.framework import ops
-from tensorflow.python.ops import math_ops
-from tensorflow.python.ops import resource_variable_ops
-from tensorflow.python.training import optimizer
-from tensorflow.python.training import training_ops
-
-
-class UnrollableOptimizer(snt.AbstractModule):
-  """Interface for optimizers that can be used in unrolled computation.
-  apply_gradients is derrived from compute_update and assign_state.
-  """
-
-  def __init__(self, *args, **kwargs):
-    super(UnrollableOptimizer, self).__init__(*args, **kwargs)
-    self()
-
-  @abc.abstractmethod
-  def compute_updates(self, xs, gs, state=None):
-    """Compute next step updates for a given variable list and state.
-
-    Args:
-      xs: list of tensors
-        The "variables" to perform an update on.
-        Note these must match the same order for which get_state was originally
-        called.
-      gs: list of tensors
-        Gradients of `xs` with respect to some loss.
-      state: Any
-        Optimizer specific state to keep track of accumulators such as momentum
-        terms
-    """
-    raise NotImplementedError()
-
-  def _build(self):
-    pass
-
-  @abc.abstractmethod
-  def get_state(self, var_list):
-    """Get the state value associated with a list of tf.Variables.
-
-    This state is commonly going to be a NamedTuple that contains some
-    mapping between variables and the state associated with those variables.
-    This state could be a moving momentum variable tracked by the optimizer.
-
-    Args:
-        var_list: list of tf.Variable
-    Returns:
-      state: Any
-        Optimizer specific state
-    """
-    raise NotImplementedError()
-
-  def assign_state(self, state):
-    """Assigns the state to the optimizers internal variables.
-
-    Args:
-      state: Any
-    Returns:
-      op: tf.Operation
-        The operation that performs the assignment.
-    """
-    raise NotImplementedError()
-
-  def apply_gradients(self, grad_vars):
-    gradients, variables = zip(*grad_vars)
-    state = self.get_state(variables)
-    new_vars, new_state = self.compute_updates(variables, gradients, state)
-    assign_op = self.assign_state(new_state)
-    op = utils.assign_variables(variables, new_vars)
-    return tf.group(assign_op, op, name="apply_gradients")
-
-
-class UnrollableGradientDescentRollingOptimizer(UnrollableOptimizer):
-
-  def __init__(self,
-               learning_rate,
-               name="UnrollableGradientDescentRollingOptimizer"):
-    self.learning_rate = learning_rate
-    super(UnrollableGradientDescentRollingOptimizer, self).__init__(name=name)
-
-
-  def compute_updates(self, xs, gs, learning_rates, state):
-    new_vars = []
-    for x, g, lr in utils.eqzip(xs, gs, learning_rates):
-      if lr is None:
-        lr = self.learning_rate
-      if g is not None:
-        new_vars.append((x * (1 - lr) - g * lr))
-      else:
-        new_vars.append(x)
-    return new_vars, state
-
-  def get_state(self, var_list):
-    return tf.constant(0.0)
-
-  def assign_state(self, state, var_list=None):
-    return tf.no_op()
diff --git a/research/learning_unsupervised_learning/run_eval.py b/research/learning_unsupervised_learning/run_eval.py
deleted file mode 100644
index dcb2529dd4cc5354012befd5790c8d402f4caafd..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/run_eval.py
+++ /dev/null
@@ -1,122 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-""" Script that iteratively applies the unsupervised update rule and evaluates the
-
-meta-objective performance.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from absl import flags
-from absl import app
-
-from learning_unsupervised_learning import evaluation
-from learning_unsupervised_learning import datasets
-from learning_unsupervised_learning import architectures
-from learning_unsupervised_learning import summary_utils
-from learning_unsupervised_learning import meta_objective
-
-import tensorflow as tf
-import sonnet as snt
-
-from tensorflow.contrib.framework.python.framework import checkpoint_utils
-
-flags.DEFINE_string("checkpoint_dir", None, "Dir to load pretrained update rule from")
-flags.DEFINE_string("train_log_dir", None, "Training log directory")
-
-FLAGS = flags.FLAGS
-
-
-def train(train_log_dir, checkpoint_dir, eval_every_n_steps=10, num_steps=3000):
-  dataset_fn = datasets.mnist.TinyMnist
-  w_learner_fn = architectures.more_local_weight_update.MoreLocalWeightUpdateWLearner
-  theta_process_fn = architectures.more_local_weight_update.MoreLocalWeightUpdateProcess
-
-  meta_objectives = []
-  meta_objectives.append(
-      meta_objective.linear_regression.LinearRegressionMetaObjective)
-  meta_objectives.append(meta_objective.sklearn.LogisticRegression)
-
-  checkpoint_vars, train_one_step_op, (
-      base_model, dataset) = evaluation.construct_evaluation_graph(
-          theta_process_fn=theta_process_fn,
-          w_learner_fn=w_learner_fn,
-          dataset_fn=dataset_fn,
-          meta_objectives=meta_objectives)
-  batch = dataset()
-  pre_logit, outputs = base_model(batch)
-
-  global_step = tf.train.get_or_create_global_step()
-  var_list = list(
-      snt.get_variables_in_module(base_model, tf.GraphKeys.TRAINABLE_VARIABLES))
-
-  tf.logging.info("all vars")
-  for v in tf.all_variables():
-    tf.logging.info("   %s" % str(v))
-  global_step = tf.train.get_global_step()
-  accumulate_global_step = global_step.assign_add(1)
-  reset_global_step = global_step.assign(0)
-
-  train_op = tf.group(
-      train_one_step_op, accumulate_global_step, name="train_op")
-
-  summary_op = tf.summary.merge_all()
-
-  file_writer = summary_utils.LoggingFileWriter(train_log_dir, regexes=[".*"])
-  if checkpoint_dir:
-    str_var_list = checkpoint_utils.list_variables(checkpoint_dir)
-    name_to_v_map = {v.op.name: v for v in tf.all_variables()}
-    var_list = [
-        name_to_v_map[vn] for vn, _ in str_var_list if vn in name_to_v_map
-    ]
-    saver = tf.train.Saver(var_list)
-    missed_variables = [
-        v.op.name for v in set(
-            snt.get_variables_in_scope("LocalWeightUpdateProcess",
-                                       tf.GraphKeys.GLOBAL_VARIABLES)) -
-        set(var_list)
-    ]
-    assert len(missed_variables) == 0, "Missed a theta variable."
-
-  hooks = []
-
-  with tf.train.SingularMonitoredSession(master="", hooks=hooks) as sess:
-
-    # global step should be restored from the evals job checkpoint or zero for fresh.
-    step = sess.run(global_step)
-
-    if step == 0 and checkpoint_dir:
-      tf.logging.info("force restore")
-      saver.restore(sess, checkpoint_dir)
-      tf.logging.info("force restore done")
-      sess.run(reset_global_step)
-      step = sess.run(global_step)
-
-    while step < num_steps:
-      if step % eval_every_n_steps == 0:
-        s, _, step = sess.run([summary_op, train_op, global_step])
-        file_writer.add_summary(s, step)
-      else:
-        _, step = sess.run([train_op, global_step])
-
-
-def main(argv):
-  train(FLAGS.train_log_dir, FLAGS.checkpoint_dir)
-
-
-if __name__ == "__main__":
-  app.run(main)
diff --git a/research/learning_unsupervised_learning/summary_utils.py b/research/learning_unsupervised_learning/summary_utils.py
deleted file mode 100644
index d5c0fdd9186bdef0b4e25ca10978e22ab910d276..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/summary_utils.py
+++ /dev/null
@@ -1,181 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-
-import collections
-import functools
-import threading
-import tensorflow as tf
-import matplotlib
-import numpy as np
-import time
-import re
-import math
-matplotlib.use("Agg")
-
-import matplotlib.pyplot as plt
-import scipy.signal
-
-from tensorflow.python.util import tf_should_use
-from tensorflow.contrib.summary import summary_ops
-from tensorflow.python.ops import summary_op_util
-from tensorflow.contrib.summary import gen_summary_ops
-
-_DEBUG_DISABLE_SUMMARIES=False
-
-class LoggingFileWriter(tf.summary.FileWriter):
-  """A FileWriter that also logs things out.
-
-  This is entirely for ease of debugging / not having to open up Tensorboard
-  a lot.
-  """
-
-  def __init__(self, logdir, regexes=[], **kwargs):
-    self.regexes = regexes
-    super(LoggingFileWriter, self).__init__(logdir, **kwargs)
-
-  def add_summary(self, summary, global_step):
-    if type(summary) != tf.Summary:
-      summary_p = tf.Summary()
-      summary_p.ParseFromString(summary)
-      summary = summary_p
-    for s in summary.value:
-      for exists in [re.match(p, s.tag) for p in self.regexes]:
-        if exists is not None:
-          tf.logging.info("%d ] %s : %f", global_step, s.tag, s.simple_value)
-          break
-    super(LoggingFileWriter, self).add_summary(summary, global_step)
-
-
-def image_grid(images, max_grid_size=4, border=1):
-  """Given images and N, return first N^2 images as an NxN image grid.
-
-  Args:
-    images: a `Tensor` of size [batch_size, height, width, channels]
-    max_grid_size: Maximum image grid height/width
-
-  Returns:
-    Single image batch, of dim [1, h*n, w*n, c]
-  """
-  batch_size = images.shape.as_list()[0]
-  to_pad = int((np.ceil(np.sqrt(batch_size)))**2 - batch_size)
-  images = tf.pad(images, [[0, to_pad], [0, border], [0, border], [0, 0]])
-
-  batch_size = images.shape.as_list()[0]
-  grid_size = min(int(np.sqrt(batch_size)), max_grid_size)
-  assert images.shape.as_list()[0] >= grid_size * grid_size
-
-  # If we have a depth channel
-  if images.shape.as_list()[-1] == 4:
-    images = images[:grid_size * grid_size, :, :, 0:3]
-    depth = tf.image.grayscale_to_rgb(images[:grid_size * grid_size, :, :, 3:4])
-
-    images = tf.reshape(images, [-1, images.shape.as_list()[2], 3])
-    split = tf.split(images, grid_size, axis=0)
-    depth = tf.reshape(depth, [-1, images.shape.as_list()[2], 3])
-    depth_split = tf.split(depth, grid_size, axis=0)
-    grid = tf.concat(split + depth_split, 1)
-    return tf.expand_dims(grid, 0)
-  else:
-    images = images[:grid_size * grid_size, :, :, :]
-    images = tf.reshape(
-        images, [-1, images.shape.as_list()[2],
-                 images.shape.as_list()[3]])
-    split = tf.split(value=images, num_or_size_splits=grid_size, axis=0)
-    grid = tf.concat(split, 1)
-    return tf.expand_dims(grid, 0)
-
-
-def first_layer_weight_image(weight, shape):
-  weight_image = tf.reshape(weight,
-                            shape + [tf.identity(weight).shape.as_list()[1]])
-  # [winx, winy, wout]
-  mean, var = tf.nn.moments(weight_image, [0,1,2], keep_dims=True)
-  #mean, var = tf.nn.moments(weight_image, [0,1], keep_dims=True)
-  weight_image = (weight_image - mean) / tf.sqrt(var + 1e-5)
-  weight_image = (weight_image + 1.0) / 2.0
-  weight_image = tf.clip_by_value(weight_image, 0, 1)
-  weight_image = tf.transpose(weight_image, (3, 0, 1, 2))
-  grid = image_grid(weight_image, max_grid_size=10)
-  return grid
-
-def inner_layer_weight_image(weight):
-  """Visualize a weight matrix of an inner layer.
-  Add padding to make it square, then visualize as a gray scale image
-  """
-  weight = tf.identity(weight) # turn into a tensor
-  weight = weight / (tf.reduce_max(tf.abs(weight), [0], keep_dims=True))
-  weight = tf.reshape(weight, [1]+weight.shape.as_list() + [1])
-  return weight
-
-
-def activation_image(activations, label_onehot):
-  """Make a row sorted by class for each activation. Put a black line around the activations."""
-  labels = tf.argmax(label_onehot, axis=1)
-  _, n_classes = label_onehot.shape.as_list()
-  mean, var = tf.nn.moments(activations, [0, 1])
-  activations = (activations - mean)/tf.sqrt(var+1e-5)
-
-  activations = tf.clip_by_value(activations, -1, 1)
-  activations = (activations + 1.0) / 2.0 # shift to [0, 1]
-
-  canvas = []
-  for i in xrange(n_classes):
-    inds = tf.where(tf.equal(labels, i))
-
-    def _gather():
-      return tf.squeeze(tf.gather(activations, inds), 1)
-
-    def _empty():
-      return tf.zeros([0, activations.shape.as_list()[1]], dtype=tf.float32)
-
-    assert inds.shape.as_list()[0] is None
-    x = tf.cond(tf.equal(tf.shape(inds)[0], 0), _empty, _gather)
-    canvas.append(x)
-    canvas.append(tf.zeros([1, activations.shape.as_list()[1]]))
-  canvas = tf.concat(canvas, 0)
-  canvas = tf.reshape(canvas, [1, activations.shape.as_list()[0]+n_classes, canvas.shape.as_list()[1], 1])
-  return canvas
-
-
-def sorted_images(images, label_onehot):
-  # images is [bs, x, y, c]
-  labels = tf.argmax(label_onehot, axis=1)
-  _, n_classes = label_onehot.shape.as_list()
-  to_stack = []
-  for i in xrange(n_classes):
-    inds = tf.where(tf.equal(labels, i))
-
-    def _gather():
-      return tf.squeeze(tf.gather(images, inds), 1)
-
-    def _empty():
-      return tf.zeros([0] + images.shape.as_list()[1:], dtype=tf.float32)
-
-    assert inds.shape.as_list()[0] is None
-    x = tf.cond(tf.equal(tf.shape(inds)[0], 0), _empty, _gather)
-    to_stack.append(x)
-  # pad / trim all up to 10.
-  padded = []
-  for t in to_stack:
-    n_found = tf.shape(t)[0]
-    pad = tf.pad(t[0:10], tf.stack([tf.stack([0,tf.maximum(0, 10-n_found)]), [0,0], [0,0], [0,0]]))
-    padded.append(pad)
-
-  xs = [tf.concat(tf.split(p, 10), axis=1) for p in padded]
-  ys = tf.concat(xs, axis=2)
-  ys = tf.cast(tf.clip_by_value(ys, 0., 1.) * 255., tf.uint8)
-  return ys
diff --git a/research/learning_unsupervised_learning/utils.py b/research/learning_unsupervised_learning/utils.py
deleted file mode 100644
index ca56ca93181df1ed9c403fef79e8154c3c9515b4..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/utils.py
+++ /dev/null
@@ -1,287 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import contextlib
-import tensorflow as tf
-import sonnet as snt
-import itertools
-import functools
-
-from tensorflow.core.framework import node_def_pb2
-from tensorflow.python.framework import device as pydev
-from tensorflow.python.framework import errors
-from tensorflow.python.ops import variable_scope as variable_scope_ops
-from sonnet.python.modules import util as snt_util
-
-from tensorflow.python.util import nest
-
-
-def eqzip(*args):
-  """Zip but raises error if lengths don't match.
-
-  Args:
-    *args: list of lists or tuples
-  Returns:
-    list: the result of zip
-  Raises:
-    ValueError: when the lengths don't match
-  """
-
-  sizes = [len(x) for x in args]
-  if not all([sizes[0] == x for x in sizes]):
-    raise ValueError("Lists are of different sizes. \n %s"%str(sizes))
-  return zip(*args)
-
-
-@contextlib.contextmanager
-def assert_no_new_variables():
-  """Ensure that no tf.Variables are constructed inside the context.
-
-  Yields:
-    None
-  Raises:
-    ValueError: if there is a variable created.
-  """
-  num_vars = len(tf.global_variables())
-  old_variables = tf.global_variables()
-  yield
-  if len(tf.global_variables()) != num_vars:
-    new_vars = set(tf.global_variables()) - set(old_variables)
-    tf.logging.error("NEW VARIABLES CREATED")
-    tf.logging.error(10*"=")
-    for v in new_vars:
-      tf.logging.error(v)
-
-    raise ValueError("Variables created inside an "
-                     "assert_no_new_variables context")
-  if old_variables != tf.global_variables():
-    raise ValueError("Variables somehow changed inside an "
-                     "assert_no_new_variables context."
-                     "This means something modified the tf.global_variables()")
-
-
-def get_variables_in_modules(module_list):
-  var_list = []
-  for m in module_list:
-    var_list.extend(snt.get_variables_in_module(m))
-  return var_list
-
-
-def state_barrier_context(state):
-  """Return a context manager that prevents interior ops from running
-  unless the whole state has been computed.
-
-  This is to prevent assign race conditions.
-  """
-  tensors = [x for x in nest.flatten(state) if type(x) == tf.Tensor]
-  tarray = [x.flow for x in nest.flatten(state) if hasattr(x, "flow")]
-  return tf.control_dependencies(tensors + tarray)
-
-
-def _identity_fn(tf_entity):
-  if hasattr(tf_entity, "identity"):
-    return tf_entity.identity()
-  else:
-    return tf.identity(tf_entity)
-
-
-def state_barrier_result(state):
-  """Return the same state, but with a control dependency to prevent it from
-  being partially computed
-  """
-  with state_barrier_context(state):
-    return nest.map_structure(_identity_fn, state)
-
-
-def train_iterator(num_iterations):
-  """Iterator that returns an index of the current step.
-  This iterator runs forever if num_iterations is None
-  otherwise it runs for some fixed amount of steps.
-  """
-  if num_iterations is None:
-    return itertools.count()
-  else:
-    return xrange(num_iterations)
-
-
-def print_op(op, msg):
-  """Print a string and return an op wrapped in a control dependency to make
-  sure it ran."""
-  print_op = tf.Print(tf.constant(0), [tf.constant(0)], msg)
-  return tf.group(op, print_op)
-
-
-class MultiQueueRunner(tf.train.QueueRunner):
-  """A QueueRunner with multiple queues """
-  def __init__(self, queues, enqueue_ops):
-    close_op = tf.group(* [q.close() for q in queues])
-    cancel_op = tf.group(
-        * [q.close(cancel_pending_enqueues=True) for q in queues])
-    queue_closed_exception_types = (errors.OutOfRangeError,)
-
-    enqueue_op = tf.group(*enqueue_ops, name="multi_enqueue")
-
-    super(MultiQueueRunner, self).__init__(
-        queues[0],
-        enqueue_ops=[enqueue_op],
-        close_op=close_op,
-        cancel_op=cancel_op,
-        queue_closed_exception_types=queue_closed_exception_types)
-
-
-# This function is not elegant, but I tried so many other ways to get this to
-# work and this is the only one that ended up not incuring significant overhead
-# or obscure tensorflow bugs.
-def sample_n_per_class(dataset, samples_per_class):
-  """Create a new callable / dataset object that returns batches of each with
-  samples_per_class per label.
-
-  Args:
-    dataset: fn
-    samples_per_class: int
-  Returns:
-    function, [] -> batch where batch is the same type as the return of
-    dataset().
-  """
-
-  with tf.control_dependencies(None), tf.name_scope(None):
-    with tf.name_scope("queue_runner/sample_n_per_class"):
-      batch = dataset()
-      num_classes = batch.label_onehot.shape.as_list()[1]
-      batch_size = num_classes * samples_per_class
-
-      flatten = nest.flatten(batch)
-      queues = []
-      enqueue_ops = []
-      capacity = samples_per_class * 20
-      for i in xrange(num_classes):
-        queue = tf.FIFOQueue(
-            capacity=capacity,
-            shapes=[f.shape.as_list()[1:] for f in flatten],
-            dtypes=[f.dtype for f in flatten])
-        queues.append(queue)
-
-        idx = tf.where(tf.equal(batch.label, i))
-        sub_batch = []
-        to_enqueue = []
-        for elem in batch:
-          new_e = tf.gather(elem, idx)
-          new_e = tf.squeeze(new_e, 1)
-          to_enqueue.append(new_e)
-
-        remaining = (capacity - queue.size())
-        to_add = tf.minimum(tf.shape(idx)[0], remaining)
-
-        def _enqueue():
-          return queue.enqueue_many([t[:to_add] for t in to_enqueue])
-
-        enqueue_op = tf.cond(
-            tf.equal(to_add, 0), tf.no_op, _enqueue)
-        enqueue_ops.append(enqueue_op)
-
-      # This has caused many deadlocks / issues. This is some logging to at least
-      # shed light to what is going on.
-      print_lam = lambda: tf.Print(tf.constant(0.0), [q.size() for q in queues], "MultiQueueRunner queues status. Has capacity %d"%capacity)
-      some_percent_of_time = tf.less(tf.random_uniform([]), 0.0005)
-      maybe_print = tf.cond(some_percent_of_time, print_lam, lambda: tf.constant(0.0))
-      with tf.control_dependencies([maybe_print]):
-        enqueue_ops = [tf.group(e) for e in enqueue_ops]
-      qr = MultiQueueRunner(queues=queues, enqueue_ops=enqueue_ops)
-      tf.train.add_queue_runner(qr)
-
-  def dequeue_batch():
-    with tf.name_scope("sample_n_per_batch/dequeue/"):
-      entries = []
-      for q in queues:
-        entries.append(q.dequeue_many(samples_per_class))
-
-      flat_batch = [tf.concat(x, 0) for x in zip(*entries)]
-      idx = tf.random_shuffle(tf.range(batch_size))
-      flat_batch = [tf.gather(f, idx, axis=0) for f in flat_batch]
-      return nest.pack_sequence_as(batch, flat_batch)
-
-  return dequeue_batch
-
-def structure_map_multi(func, values):
-  all_values = [nest.flatten(v) for v in values]
-  rets = []
-  for pair in zip(*all_values):
-    rets.append(func(pair))
-  return nest.pack_sequence_as(values[0], rets)
-
-def structure_map_split(func, value):
-  vv = nest.flatten(value)
-  rets = []
-  for v in vv:
-    rets.append(func(v))
-  return [nest.pack_sequence_as(value, r) for r in zip(*rets)]
-
-def assign_variables(targets, values):
-  return tf.group(*[t.assign(v) for t,v in eqzip(targets, values)],
-                  name="assign_variables")
-
-
-def create_variables_in_class_scope(method):
-  """Force the variables constructed in this class to live in the sonnet module.
-  Wraps a method on a sonnet module.
-
-  For example the following will create two different variables.
-  ```
-  class Mod(snt.AbstractModule):
-    @create_variables_in_class_scope
-    def dynamic_thing(self, input, name):
-      return snt.Linear(name)(input)
-  mod.dynamic_thing(x, name="module_nameA")
-  mod.dynamic_thing(x, name="module_nameB")
-  # reuse
-  mod.dynamic_thing(y, name="module_nameA")
-  ```
-  """
-  @functools.wraps(method)
-  def wrapper(obj, *args, **kwargs):
-    def default_context_manager(reuse=None):
-      variable_scope = obj.variable_scope
-      return tf.variable_scope(variable_scope, reuse=reuse)
-
-    variable_scope_context_manager = getattr(obj, "_enter_variable_scope",
-                                             default_context_manager)
-    graph = tf.get_default_graph()
-
-    # Temporarily enter the variable scope to capture it
-    with variable_scope_context_manager() as tmp_variable_scope:
-      variable_scope = tmp_variable_scope
-
-    with variable_scope_ops._pure_variable_scope(
-        variable_scope, reuse=tf.AUTO_REUSE) as pure_variable_scope:
-
-      name_scope = variable_scope.original_name_scope
-      if name_scope[-1] != "/":
-        name_scope += "/"
-
-      with tf.name_scope(name_scope):
-        sub_scope = snt_util.to_snake_case(method.__name__)
-        with tf.name_scope(sub_scope) as scope:
-          out_ops = method(obj, *args, **kwargs)
-          return out_ops
-
-  return wrapper
-
diff --git a/research/learning_unsupervised_learning/variable_replace.py b/research/learning_unsupervised_learning/variable_replace.py
deleted file mode 100644
index ebfbeadc8aba7f8a09e1392f1de8d7b33f10d43c..0000000000000000000000000000000000000000
--- a/research/learning_unsupervised_learning/variable_replace.py
+++ /dev/null
@@ -1,112 +0,0 @@
-# Copyright 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-from __future__ import absolute_import
-from __future__ import division
-
-import tensorflow as tf
-from contextlib import contextmanager
-
-from tensorflow.python.ops import variable_scope
-
-# sanity global state to ensure non recursive.
-_is_variable_replacing = [False]
-
-def in_variable_replace_scope():
-  return _is_variable_replacing[0]
-
-@contextmanager
-def variable_replace(replacements, no_new=True):
-  """ A context manager that replaces variables.
-
-  This is a context manager that replaces all calls to
-  get_variable with the variable in replacements.
-  This function does not support recursive application.
-
-  Args:
-    replacements: dict
-        dictionary mapping a variable to replace (the key), with
-        the variable one wants to replace this variable with (the value).
-    no_new: bool
-        raise an error if variables were created.
-        This is for sanity checking.
-  Raises:
-    ValueError: if a new variable or not all the replacements are used.
-  """
-  # TODO(lmetz) This function is a bit scary, as it relies on monkey patching
-  # the call to get_variable. Ideally this can be done with variable_scope's
-  # custom_getter attribute, but when initially writing this that was not
-  # avalible.
-
-  replacements = {k: v for k, v in replacements.items() if not k == v}
-
-  init_vars = tf.trainable_variables()
-  old_get_variable = variable_scope.get_variable
-  old_tf_get_variable = tf.get_variable
-
-  names_replace = {}
-  has_replaced_names = []
-  tf.logging.vlog(2, "Trying to replace")
-  for k, v in replacements.items():
-    tf.logging.vlog(2, k.name + " >> " + v.name)
-  tf.logging.vlog(2, "===")
-
-  for k, v in replacements.items():
-    strip_name = k.name.replace("/read:0", "")
-    strip_name = strip_name.replace(":0", "")
-    names_replace[strip_name] = v
-    # TODO(lmetz) is there a cleaner way to do this?
-  def new_get_variable(name, *args, **kwargs):
-    #print "Monkeypatch get variable run with name:", name
-    n = tf.get_variable_scope().name + "/" + name
-    #print "Monkeypatch get variable run with name:", n
-    if n in names_replace:
-      has_replaced_names.append(n)
-      return names_replace[n]
-    else:
-      return old_get_variable(name, *args, **kwargs)
-
-  # perform the monkey patch
-  if _is_variable_replacing[0] == True:
-    raise ValueError("No recursive calling to variable replace allowed.")
-
-  variable_scope.get_variable = new_get_variable
-  tf.get_variable = new_get_variable
-
-  _is_variable_replacing[0] = True
-
-  yield
-
-  if set(has_replaced_names) != set(names_replace.keys()):
-    print "Didn't use all replacements"
-    print "replaced variables that are not requested??"
-    print "==="
-    for n in list(set(has_replaced_names) - set(names_replace.keys())):
-      print n
-    print "Missed replacing variables"
-    print "==="
-    for n in list(set(names_replace.keys()) - set(has_replaced_names)):
-      print n, "==>", names_replace[n].name
-    raise ValueError("Fix this -- see stderr")
-
-  # undo the monkey patch
-  tf.get_variable = old_tf_get_variable
-  variable_scope.get_variable = old_get_variable
-
-  _is_variable_replacing[0] = False
-
-  final_vars = tf.trainable_variables()
-  assert set(init_vars) == set(final_vars), "trainable variables changed"
diff --git a/research/lexnet_nc/README.md b/research/lexnet_nc/README.md
deleted file mode 100644
index 4ecb5d39867c2ebf7280b9d19bbabb41957b9465..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/README.md
+++ /dev/null
@@ -1,215 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# LexNET for Noun Compound Relation Classification
-
-This is a [Tensorflow](http://www.tensorflow.org/) implementation of the LexNET
-algorithm for classifying relationships, specifically applied to classifying the
-relationships that hold between noun compounds:
-
-* *olive oil* is oil that is *made from* olives
-* *cooking oil* which is oil that is *used for* cooking
-* *motor oil* is oil that is *contained in* a motor
-
-The model is a supervised classifier that predicts the relationship that holds
-between the constituents of a two-word noun compound using:
-
-1. A neural "paraphrase" of each syntactic dependency path that connects the
-   constituents in a large corpus. For example, given a sentence like *This fine
-   oil is made from first-press olives*, the dependency path is something like
-   `oil <NSUBJPASS made PREP> from POBJ> olive`.
-2. The distributional information provided by the individual words; i.e., the
-   word embeddings of the two consituents.
-3. The distributional signal provided by the compound itself; i.e., the
-   embedding of the noun compound in context.
-
-The model includes several variants: *path-based model* uses (1) alone, the
-*distributional model* uses (2) alone, and the *integrated model* uses (1) and
-(2).  The *distributional-nc model* and the *integrated-nc* model each add (3).
-
-Training a model requires the following:
-
-1. A collection of noun compounds that have been labeled using a *relation
-   inventory*.  The inventory describes the specific relationships that you'd
-   like the model to differentiate (e.g. *part of* versus *composed of* versus
-   *purpose*), and generally may consist of tens of classes.  You can download
-   the dataset used in the paper from
-   [here](https://vered1986.github.io/papers/Tratz2011_Dataset.tar.gz).
-2. A collection of word embeddings: the path-based model uses the word
-   embeddings as part of the path representation, and the distributional models
-   use the word embeddings directly as prediction features.
-3. The path-based model requires a collection of syntactic dependency parses
-   that connect the constituents for each noun compound. To generate these,
-   you'll need a corpus from which to train this data; we used Wikipedia and the
-   [LDC GigaWord5](https://catalog.ldc.upenn.edu/LDC2011T07) corpora.
-
-# Contents
-
-The following source code is included here:
-
-* `learn_path_embeddings.py` is a script that trains and evaluates a path-based
-  model to predict a noun-compound relationship given labeled noun-compounds and
-  dependency parse paths.
-* `learn_classifier.py` is a script that trains and evaluates a classifier based
-  on any combination of paths, word embeddings, and noun-compound embeddings.
-* `get_indicative_paths.py` is a script that generates the most indicative
-  syntactic dependency paths for a particular relationship.
-
-Also included are utilities for preparing data for training:
-
-* `text_embeddings_to_binary.py` converts a text file containing word embeddings
-  into a binary file that is quicker to load.
-* `extract_paths.py` finds all the dependency paths that connect words in a
-  corpus.
-* `sorted_paths_to_examples.py` processes the output of `extract_paths.py` to
-  produce summarized training data.
-
-This code (in particular, the utilities used to prepare the data) differs from
-the code that was used to prepare data for the paper. Notably, we used a
-proprietary dependency parser instead of spaCy, which is used here.
-
-# Dependencies
-
-* [TensorFlow](http://www.tensorflow.org/): see detailed installation
-  instructions at that site.
-* [SciKit Learn](http://scikit-learn.org/): you can probably just install this
-  with `pip install sklearn`.
-* [SpaCy](https://spacy.io/): `pip install spacy` ought to do the trick, along
-  with the English model.
-
-# Creating the Model
-
-This sections described the steps necessary to create and evaluate the model
-described in the paper.
-
-## Generate Path Data
-
-To begin, you need three text files:
-
-1. **Corpus**. This file should contain natural language sentences, written with
-   one sentence per line.  For purposes of exposition, we'll assume that you
-   have English Wikipedia serialized this way in `${HOME}/data/wiki.txt`.
-2. **Labeled Noun Compound Pairs**.  This file contain (modfier, head, label)
-   tuples, tab-separated, with one per line.  The *label* represented the
-   relationship between the head and the modifier; e.g., if `purpose` is one
-   your labels, you could possibly include `tooth<tab>paste<tab>purpose`.
-3. **Word Embeddings**. We used the
-   [GloVe](https://nlp.stanford.edu/projects/glove/) word embeddings; in
-   particular the 6B token, 300d variant.  We'll assume you have this file as
-   `${HOME}/data/glove.6B.300d.txt`.
-
-We first processed the embeddings from their text format into something that we
-can load a little bit more quickly:
-
-    ./text_embeddings_to_binary.py \
-      --input ${HOME}/data/glove.6B.300d.txt \
-      --output_vocab ${HOME}/data/vocab.txt \
-      --output_npy ${HOME}/data/glove.6B.300d.npy
-
-Next, we'll extract all the dependency parse paths connecting our labeled pairs
-from the corpus.  This process takes a *looooong* time, but is trivially
-parallelized using map-reduce if you have access to that technology.
-
-    ./extract_paths.py \
-      --corpus ${HOME}/data/wiki.txt \
-      --labeled_pairs ${HOME}/data/labeled-pairs.tsv \
-      --output ${HOME}/data/paths.tsv
-
-The file it produces (`paths.tsv`) is a tab-separated file that contains the
-modifier, the head, the label, the encoded path, and the sentence from which the
-path was drawn.  (This last is mostly for sanity checking.)  A sample row might
-look something like this (where newlines would actually be tab characters):
-
-    navy
-    captain
-    owner_emp_use
-    <X>/PROPN/dobj/>::enter/VERB/ROOT/^::follow/VERB/advcl/<::in/ADP/prep/<::footstep/NOUN/pobj/<::of/ADP/prep/<::father/NOUN/pobj/<::bover/PROPN/appos/<::<Y>/PROPN/compound/<
-    He entered the Royal Navy following in the footsteps of his father Captain John Bover and two of his elder brothers as volunteer aboard HMS Perseus
-
-This file must be sorted as follows:
-
-    sort -k1,3 -t$'\t' paths.tsv > sorted.paths.tsv
-
-In particular, rows with the same modifier, head, and label must appear
-contiguously.
-
-We next create a file that contains all the relation labels from our original
-labeled pairs:
-
-    awk 'BEGIN {FS="\t"} {print $3}' < ${HOME}/data/labeled-pairs.tsv \
-      | sort -u > ${HOME}/data/relations.txt
-
-With these in hand, we're ready to produce the train, validation, and test data:
-
-    ./sorted_paths_to_examples.py \
-       --input ${HOME}/data/sorted.paths.tsv \
-       --vocab ${HOME}/data/vocab.txt \
-       --relations ${HOME}/data/relations.txt \
-       --splits ${HOME}/data/splits.txt \
-       --output_dir ${HOME}/data
-
-Here, `splits.txt` is a file that indicates which "split" (train, test, or
-validation) you want the pair to appear in.  It should be a tab-separate file
-which conatins the modifier, head, and the dataset ( `train`, `test`, or `val`)
-into which the pair should be placed; e.g.,:
-
-    tooth <TAB> paste <TAB> train
-    banana <TAB> seat <TAB> test
-
-The program will produce a separate file for each dataset split in the directory
-specified by `--output_dir`.  Each file is contains `tf.train.Example` protocol
-buffers encoded using the `TFRecord` file format.
-
-## Create Path Embeddings
-
-Now we're ready to train the path embeddings using `learn_path_embeddings.py`:
-
-    ./learn_path_embeddings.py \
-        --train ${HOME}/data/train.tfrecs.gz \
-        --val ${HOME}/data/val.tfrecs.gz \
-        --text ${HOME}/data/test.tfrecs.gz \
-        --embeddings ${HOME}/data/glove.6B.300d.npy
-        --relations ${HOME}/data/relations.txt
-        --output ${HOME}/data/path-embeddings \
-        --logdir /tmp/learn_path_embeddings
-
-The path embeddings will be placed at the location specified by `--output`.
-
-## Train classifiers
-
-Train classifiers and evaluate on the validation and test data using
-`train_classifiers.py` script.  This shell script fragment will iterate through
-each dataset, split, corpus, and model type to train and evaluate classifiers.
-
-    LOGDIR=/tmp/learn_classifier
-    for DATASET in tratz/fine_grained tratz/coarse_grained ; do
-      for SPLIT in random lexical_head lexical_mod lexical_full ; do
-        for CORPUS in wiki_gigiawords ; do
-          for MODEL in dist dist-nc path integrated integrated-nc ; do
-            # Filename for the log that will contain the classifier results.
-            LOGFILE=$(echo "${DATASET}.${SPLIT}.${CORPUS}.${MODEL}.log" | sed -e "s,/,.,g")
-            python learn_classifier.py \
-              --dataset_dir ~/lexnet/datasets \
-              --dataset "${DATASET}" \
-              --corpus "${SPLIT}/${CORPUS}" \
-              --embeddings_base_path ~/lexnet/embeddings \
-              --logdir ${LOGDIR} \
-              --input "${MODEL}" > "${LOGDIR}/${LOGFILE}"
-          done
-        done
-      done
-    done
-
-The log file will contain the final performance (precision, recall, F1) on the
-train, dev, and test sets, and will include a confusion matrix for each.
-
-# Contact
-
-If you have any questions, issues, or suggestions, feel free to contact either
-@vered1986 or @waterson.
-
-If you use this code for any published research, please include the following citation:
-
-Olive Oil Is Made of Olives, Baby Oil Is Made for Babies: Interpreting Noun Compounds Using Paraphrases in a Neural Model. 
-Vered Shwartz and Chris Waterson. NAACL 2018. [link](https://arxiv.org/pdf/1803.08073.pdf).
diff --git a/research/lexnet_nc/extract_paths.py b/research/lexnet_nc/extract_paths.py
deleted file mode 100755
index 833eec2c1b8a176b487d4e663a737b9502b49eda..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/extract_paths.py
+++ /dev/null
@@ -1,119 +0,0 @@
-#!/usr/bin/env python
-# Copyright 2017, 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import itertools
-import sys
-
-import spacy
-import tensorflow as tf
-
-tf.flags.DEFINE_string('corpus', '', 'Filename of corpus')
-tf.flags.DEFINE_string('labeled_pairs', '', 'Filename of labeled pairs')
-tf.flags.DEFINE_string('output', '', 'Filename of output file')
-FLAGS = tf.flags.FLAGS
-
-
-def get_path(mod_token, head_token):
-  """Returns the path between a modifier token and a head token."""
-  # Compute the path from the root to each token.
-  mod_ancestors = list(reversed(list(mod_token.ancestors)))
-  head_ancestors = list(reversed(list(head_token.ancestors)))
-
-  # If the paths don't start at the same place (odd!) then there is no path at
-  # all.
-  if (not mod_ancestors or not head_ancestors
-      or mod_ancestors[0] != head_ancestors[0]):
-    return None
-
-  # Eject elements from the common path until we reach the first differing
-  # ancestor.
-  ix = 1
-  while (ix < len(mod_ancestors) and ix < len(head_ancestors)
-         and mod_ancestors[ix] == head_ancestors[ix]):
-    ix += 1
-
-  # Construct the path.  TODO: add "satellites", possibly honor sentence
-  # ordering between modifier and head rather than just always traversing from
-  # the modifier to the head?
-  path = ['/'.join(('<X>', mod_token.pos_, mod_token.dep_, '>'))]
-
-  path += ['/'.join((tok.lemma_, tok.pos_, tok.dep_, '>'))
-           for tok in reversed(mod_ancestors[ix:])]
-
-  root_token = mod_ancestors[ix - 1]
-  path += ['/'.join((root_token.lemma_, root_token.pos_, root_token.dep_, '^'))]
-
-  path += ['/'.join((tok.lemma_, tok.pos_, tok.dep_, '<'))
-           for tok in head_ancestors[ix:]]
-
-  path += ['/'.join(('<Y>', head_token.pos_, head_token.dep_, '<'))]
-
-  return '::'.join(path)
-
-
-def main(_):
-  nlp = spacy.load('en_core_web_sm')
-
-  # Grab the set of labeled pairs for which we wish to collect paths.
-  with tf.gfile.GFile(FLAGS.labeled_pairs) as fh:
-    parts = (l.decode('utf-8').split('\t') for l in fh.read().splitlines())
-    labeled_pairs = {(mod, head): rel for mod, head, rel in parts}
-
-  # Create a mapping from each head to the modifiers that are used with it.
-  mods_for_head = {
-      head: set(hm[1] for hm in head_mods)
-      for head, head_mods in itertools.groupby(
-          sorted((head, mod) for (mod, head) in labeled_pairs.iterkeys()),
-          lambda (head, mod): head)}
-
-  # Collect all the heads that we know about.
-  heads = set(mods_for_head.keys())
-
-  # For each sentence that contains a (head, modifier) pair that's in our set,
-  # emit the dependency path that connects the pair.
-  out_fh = sys.stdout if not FLAGS.output else tf.gfile.GFile(FLAGS.output, 'w')
-  in_fh = sys.stdin if not FLAGS.corpus else tf.gfile.GFile(FLAGS.corpus)
-
-  num_paths = 0
-  for line, sen in enumerate(in_fh, start=1):
-    if line % 100 == 0:
-      print('\rProcessing line %d: %d paths' % (line, num_paths),
-            end='', file=sys.stderr)
-
-    sen = sen.decode('utf-8').strip()
-    doc = nlp(sen)
-
-    for head_token in doc:
-      head_text = head_token.text.lower()
-      if head_text in heads:
-        mods = mods_for_head[head_text]
-        for mod_token in doc:
-          mod_text = mod_token.text.lower()
-          if mod_text in mods:
-            path = get_path(mod_token, head_token)
-            if path:
-              label = labeled_pairs[(mod_text, head_text)]
-              line = '\t'.join((mod_text, head_text, label, path, sen))
-              print(line.encode('utf-8'), file=out_fh)
-              num_paths += 1
-
-  out_fh.close()
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/lexnet_nc/get_indicative_paths.py b/research/lexnet_nc/get_indicative_paths.py
deleted file mode 100755
index f8b34cca221a07c0b633024b71f082b8f61b3a45..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/get_indicative_paths.py
+++ /dev/null
@@ -1,111 +0,0 @@
-#!/usr/bin/env python
-# Copyright 2017, 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Extracts paths that are indicative of each relation."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import tensorflow as tf
-
-from . import path_model
-from . import lexnet_common
-
-tf.flags.DEFINE_string(
-    'dataset_dir', 'datasets',
-    'Dataset base directory')
-
-tf.flags.DEFINE_string(
-    'dataset',
-    'tratz/fine_grained',
-    'Subdirectory containing the corpus directories: '
-    'subdirectory of dataset_dir')
-
-tf.flags.DEFINE_string(
-    'corpus', 'random/wiki',
-    'Subdirectory containing the corpus and split: '
-    'subdirectory of dataset_dir/dataset')
-
-tf.flags.DEFINE_string(
-    'embeddings_base_path', 'embeddings',
-    'Embeddings base directory')
-
-tf.flags.DEFINE_string(
-    'logdir', 'logdir',
-    'Directory of model output files')
-
-tf.flags.DEFINE_integer(
-    'top_k', 20, 'Number of top paths to extract')
-
-tf.flags.DEFINE_float(
-    'threshold', 0.8, 'Threshold above which to consider paths as indicative')
-
-FLAGS = tf.flags.FLAGS
-
-
-def main(_):
-  hparams = path_model.PathBasedModel.default_hparams()
-
-  # First things first. Load the path data.
-  path_embeddings_file = 'path_embeddings/{dataset}/{corpus}'.format(
-      dataset=FLAGS.dataset,
-      corpus=FLAGS.corpus)
-
-  path_dim = (hparams.lemma_dim + hparams.pos_dim +
-              hparams.dep_dim + hparams.dir_dim)
-
-  path_embeddings, path_to_index = path_model.load_path_embeddings(
-      os.path.join(FLAGS.embeddings_base_path, path_embeddings_file),
-      path_dim)
-
-  # Load and count the classes so we can correctly instantiate the model.
-  classes_filename = os.path.join(
-      FLAGS.dataset_dir, FLAGS.dataset, 'classes.txt')
-
-  with open(classes_filename) as f_in:
-    classes = f_in.read().splitlines()
-
-  hparams.num_classes = len(classes)
-
-  # We need the word embeddings to instantiate the model, too.
-  print('Loading word embeddings...')
-  lemma_embeddings = lexnet_common.load_word_embeddings(
-      FLAGS.embeddings_base_path, hparams.lemma_embeddings_file)
-
-  # Instantiate the model.
-  with tf.Graph().as_default():
-    with tf.variable_scope('lexnet'):
-      instance = tf.placeholder(dtype=tf.string)
-      model = path_model.PathBasedModel(
-          hparams, lemma_embeddings, instance)
-
-    with tf.Session() as session:
-      model_dir = '{logdir}/results/{dataset}/path/{corpus}'.format(
-          logdir=FLAGS.logdir,
-          dataset=FLAGS.dataset,
-          corpus=FLAGS.corpus)
-
-      saver = tf.train.Saver()
-      saver.restore(session, os.path.join(model_dir, 'best.ckpt'))
-
-      path_model.get_indicative_paths(
-          model, session, path_to_index, path_embeddings, classes,
-          model_dir, FLAGS.top_k, FLAGS.threshold)
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/lexnet_nc/learn_classifier.py b/research/lexnet_nc/learn_classifier.py
deleted file mode 100755
index ec284029535609ffd2cc0f2f5cddb9b87954aa81..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/learn_classifier.py
+++ /dev/null
@@ -1,223 +0,0 @@
-#!/usr/bin/env python
-# Copyright 2017, 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Trains the integrated LexNET classifier."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import lexnet_common
-import lexnet_model
-import path_model
-from sklearn import metrics
-import tensorflow as tf
-
-tf.flags.DEFINE_string(
-    'dataset_dir', 'datasets',
-    'Dataset base directory')
-
-tf.flags.DEFINE_string(
-    'dataset', 'tratz/fine_grained',
-    'Subdirectory containing the corpus directories: '
-    'subdirectory of dataset_dir')
-
-tf.flags.DEFINE_string(
-    'corpus', 'wiki/random',
-    'Subdirectory containing the corpus and split: '
-    'subdirectory of dataset_dir/dataset')
-
-tf.flags.DEFINE_string(
-    'embeddings_base_path', 'embeddings',
-    'Embeddings base directory')
-
-tf.flags.DEFINE_string(
-    'logdir', 'logdir',
-    'Directory of model output files')
-
-tf.flags.DEFINE_string('hparams', '', 'Hyper-parameters')
-
-tf.flags.DEFINE_string(
-    'input', 'integrated',
-    'The model(dist/dist-nc/path/integrated/integrated-nc')
-
-FLAGS = tf.flags.FLAGS
-
-
-def main(_):
-  # Pick up any one-off hyper-parameters.
-  hparams = lexnet_model.LexNETModel.default_hparams()
-  hparams.corpus = FLAGS.corpus
-  hparams.input = FLAGS.input
-  hparams.path_embeddings_file = 'path_embeddings/%s/%s' % (
-      FLAGS.dataset, FLAGS.corpus)
-
-  input_dir = hparams.input if hparams.input != 'path' else 'path_classifier'
-
-  # Set the number of classes
-  classes_filename = os.path.join(
-      FLAGS.dataset_dir, FLAGS.dataset, 'classes.txt')
-  with open(classes_filename) as f_in:
-    classes = f_in.read().splitlines()
-
-  hparams.num_classes = len(classes)
-  print('Model will predict into %d classes' % hparams.num_classes)
-
-  # Get the datasets
-  train_set, val_set, test_set = (
-      os.path.join(
-          FLAGS.dataset_dir, FLAGS.dataset, FLAGS.corpus,
-          filename + '.tfrecs.gz')
-      for filename in ['train', 'val', 'test'])
-
-  print('Running with hyper-parameters: {}'.format(hparams))
-
-  # Load the instances
-  print('Loading instances...')
-  opts = tf.python_io.TFRecordOptions(
-      compression_type=tf.python_io.TFRecordCompressionType.GZIP)
-  train_instances = list(tf.python_io.tf_record_iterator(train_set, opts))
-  val_instances = list(tf.python_io.tf_record_iterator(val_set, opts))
-  test_instances = list(tf.python_io.tf_record_iterator(test_set, opts))
-
-  # Load the word embeddings
-  print('Loading word embeddings...')
-  relata_embeddings, path_embeddings, nc_embeddings, path_to_index = (
-      None, None, None, None)
-  if hparams.input in ['dist', 'dist-nc', 'integrated', 'integrated-nc']:
-    relata_embeddings = lexnet_common.load_word_embeddings(
-        FLAGS.embeddings_base_path, hparams.relata_embeddings_file)
-
-  if hparams.input in ['path', 'integrated', 'integrated-nc']:
-    path_embeddings, path_to_index = path_model.load_path_embeddings(
-        os.path.join(FLAGS.embeddings_base_path, hparams.path_embeddings_file),
-        hparams.path_dim)
-
-  if hparams.input in ['dist-nc', 'integrated-nc']:
-    nc_embeddings = lexnet_common.load_word_embeddings(
-        FLAGS.embeddings_base_path, hparams.nc_embeddings_file)
-
-  # Define the graph and the model
-  with tf.Graph().as_default():
-    model = lexnet_model.LexNETModel(
-        hparams, relata_embeddings, path_embeddings,
-        nc_embeddings, path_to_index)
-
-    # Initialize a session and start training
-    session = tf.Session()
-    session.run(tf.global_variables_initializer())
-
-    # Initalize the path mapping
-    if hparams.input in ['path', 'integrated', 'integrated-nc']:
-      session.run(tf.tables_initializer())
-      session.run(model.initialize_path_op, {
-          model.path_initial_value_t: path_embeddings
-      })
-
-    # Initialize the NC embeddings
-    if hparams.input in ['dist-nc', 'integrated-nc']:
-      session.run(model.initialize_nc_op, {
-          model.nc_initial_value_t: nc_embeddings
-      })
-
-    # Load the labels
-    print('Loading labels...')
-    train_labels = model.load_labels(session, train_instances)
-    val_labels = model.load_labels(session, val_instances)
-    test_labels = model.load_labels(session, test_instances)
-
-    save_path = '{logdir}/results/{dataset}/{input}/{corpus}'.format(
-        logdir=FLAGS.logdir, dataset=FLAGS.dataset,
-        corpus=model.hparams.corpus, input=input_dir)
-
-    if not os.path.exists(save_path):
-      os.makedirs(save_path)
-
-    # Train the model
-    print('Training the model...')
-    model.fit(session, train_instances, epoch_completed,
-              val_instances, val_labels, save_path)
-
-    # Print the best performance on the validation set
-    print('Best performance on the validation set: F1=%.3f' %
-          epoch_completed.best_f1)
-
-    # Evaluate on the train and validation sets
-    lexnet_common.full_evaluation(model, session, train_instances, train_labels,
-                                  'Train', classes)
-    lexnet_common.full_evaluation(model, session, val_instances, val_labels,
-                                  'Validation', classes)
-    test_predictions = lexnet_common.full_evaluation(
-        model, session, test_instances, test_labels, 'Test', classes)
-
-    # Write the test predictions to a file
-    predictions_file = os.path.join(save_path, 'test_predictions.tsv')
-    print('Saving test predictions to %s' % save_path)
-    test_pairs = model.load_pairs(session, test_instances)
-    lexnet_common.write_predictions(test_pairs, test_labels, test_predictions,
-                                    classes, predictions_file)
-
-
-def epoch_completed(model, session, epoch, epoch_loss,
-                    val_instances, val_labels, save_path):
-  """Runs every time an epoch completes.
-
-  Print the performance on the validation set, and update the saved model if
-  its performance is better on the previous ones. If the performance dropped,
-  tell the training to stop.
-
-  Args:
-    model: The currently trained path-based model.
-    session: The current TensorFlow session.
-    epoch: The epoch number.
-    epoch_loss: The current epoch loss.
-    val_instances: The validation set instances (evaluation between epochs).
-    val_labels: The validation set labels (for evaluation between epochs).
-    save_path: Where to save the model.
-
-  Returns:
-    whether the training should stop.
-  """
-  stop_training = False
-
-  # Evaluate on the validation set
-  val_pred = model.predict(session, val_instances)
-  precision, recall, f1, _ = metrics.precision_recall_fscore_support(
-      val_labels, val_pred, average='weighted')
-  print(
-      'Epoch: %d/%d, Loss: %f, validation set: P: %.3f, R: %.3f, F1: %.3f\n' % (
-          epoch + 1, model.hparams.num_epochs, epoch_loss,
-          precision, recall, f1))
-
-  # If the F1 is much smaller than the previous one, stop training. Else, if
-  # it's bigger, save the model.
-  if f1 < epoch_completed.best_f1 - 0.08:
-    stop_training = True
-
-  if f1 > epoch_completed.best_f1:
-    saver = tf.train.Saver()
-    checkpoint_filename = os.path.join(save_path, 'best.ckpt')
-    print('Saving model in: %s' % checkpoint_filename)
-    saver.save(session, checkpoint_filename)
-    print('Model saved in file: %s' % checkpoint_filename)
-    epoch_completed.best_f1 = f1
-
-  return stop_training
-
-epoch_completed.best_f1 = 0
-
-if __name__ == '__main__':
-  tf.app.run(main)
diff --git a/research/lexnet_nc/learn_path_embeddings.py b/research/lexnet_nc/learn_path_embeddings.py
deleted file mode 100755
index 480378f4aa010ee27f0387685bac488cedbb2ab9..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/learn_path_embeddings.py
+++ /dev/null
@@ -1,186 +0,0 @@
-#!/usr/bin/env python
-# Copyright 2017, 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Trains the LexNET path-based model."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import lexnet_common
-import path_model
-from sklearn import metrics
-import tensorflow as tf
-
-tf.flags.DEFINE_string('train', '', 'training dataset, tfrecs')
-tf.flags.DEFINE_string('val', '', 'validation dataset, tfrecs')
-tf.flags.DEFINE_string('test', '', 'test dataset, tfrecs')
-tf.flags.DEFINE_string('embeddings', '', 'embeddings, npy')
-tf.flags.DEFINE_string('relations', '', 'file containing relation labels')
-tf.flags.DEFINE_string('output_dir', '', 'output directory for path embeddings')
-tf.flags.DEFINE_string('logdir', '', 'directory for model training')
-FLAGS = tf.flags.FLAGS
-
-
-def main(_):
-  # Pick up any one-off hyper-parameters.
-  hparams = path_model.PathBasedModel.default_hparams()
-
-  with open(FLAGS.relations) as fh:
-    relations = fh.read().splitlines()
-
-  hparams.num_classes = len(relations)
-  print('Model will predict into %d classes' % hparams.num_classes)
-
-  print('Running with hyper-parameters: {}'.format(hparams))
-
-  # Load the instances
-  print('Loading instances...')
-  opts = tf.python_io.TFRecordOptions(
-      compression_type=tf.python_io.TFRecordCompressionType.GZIP)
-
-  train_instances = list(tf.python_io.tf_record_iterator(FLAGS.train, opts))
-  val_instances = list(tf.python_io.tf_record_iterator(FLAGS.val, opts))
-  test_instances = list(tf.python_io.tf_record_iterator(FLAGS.test, opts))
-
-  # Load the word embeddings
-  print('Loading word embeddings...')
-  lemma_embeddings = lexnet_common.load_word_embeddings(FLAGS.embeddings)
-
-  # Define the graph and the model
-  with tf.Graph().as_default():
-    with tf.variable_scope('lexnet'):
-      options = tf.python_io.TFRecordOptions(
-          compression_type=tf.python_io.TFRecordCompressionType.GZIP)
-      reader = tf.TFRecordReader(options=options)
-      _, train_instance = reader.read(
-          tf.train.string_input_producer([FLAGS.train]))
-      shuffled_train_instance = tf.train.shuffle_batch(
-          [train_instance],
-          batch_size=1,
-          num_threads=1,
-          capacity=len(train_instances),
-          min_after_dequeue=100,
-      )[0]
-
-      train_model = path_model.PathBasedModel(
-          hparams, lemma_embeddings, shuffled_train_instance)
-
-    with tf.variable_scope('lexnet', reuse=True):
-      val_instance = tf.placeholder(dtype=tf.string)
-      val_model = path_model.PathBasedModel(
-          hparams, lemma_embeddings, val_instance)
-
-    # Initialize a session and start training
-    best_model_saver = tf.train.Saver()
-    f1_t = tf.placeholder(tf.float32)
-    best_f1_t = tf.Variable(0.0, trainable=False, name='best_f1')
-    assign_best_f1_op = tf.assign(best_f1_t, f1_t)
-
-    supervisor = tf.train.Supervisor(
-        logdir=FLAGS.logdir,
-        global_step=train_model.global_step)
-
-    with supervisor.managed_session() as session:
-      # Load the labels
-      print('Loading labels...')
-      val_labels = train_model.load_labels(session, val_instances)
-
-      # Train the model
-      print('Training the model...')
-
-      while True:
-        step = session.run(train_model.global_step)
-        epoch = (step + len(train_instances) - 1) // len(train_instances)
-        if epoch > hparams.num_epochs:
-          break
-
-        print('Starting epoch %d (step %d)...' % (1 + epoch, step))
-
-        epoch_loss = train_model.run_one_epoch(session, len(train_instances))
-
-        best_f1 = session.run(best_f1_t)
-        f1 = epoch_completed(val_model, session, epoch, epoch_loss,
-                             val_instances, val_labels, best_model_saver,
-                             FLAGS.logdir, best_f1)
-
-        if f1 > best_f1:
-          session.run(assign_best_f1_op, {f1_t: f1})
-
-        if f1 < best_f1 - 0.08:
-          tf.logging.info('Stopping training after %d epochs.\n' % epoch)
-          break
-
-      # Print the best performance on the validation set
-      best_f1 = session.run(best_f1_t)
-      print('Best performance on the validation set: F1=%.3f' % best_f1)
-
-      # Save the path embeddings
-      print('Computing the path embeddings...')
-      instances = train_instances + val_instances + test_instances
-      path_index, path_vectors = path_model.compute_path_embeddings(
-          val_model, session, instances)
-
-      if not os.path.exists(path_emb_dir):
-        os.makedirs(path_emb_dir)
-
-      path_model.save_path_embeddings(
-          val_model, path_vectors, path_index, FLAGS.output_dir)
-
-
-def epoch_completed(model, session, epoch, epoch_loss,
-                    val_instances, val_labels, saver, save_path, best_f1):
-  """Runs every time an epoch completes.
-
-  Print the performance on the validation set, and update the saved model if
-  its performance is better on the previous ones. If the performance dropped,
-  tell the training to stop.
-
-  Args:
-    model: The currently trained path-based model.
-    session: The current TensorFlow session.
-    epoch: The epoch number.
-    epoch_loss: The current epoch loss.
-    val_instances: The validation set instances (evaluation between epochs).
-    val_labels: The validation set labels (for evaluation between epochs).
-    saver: tf.Saver object
-    save_path: Where to save the model.
-    best_f1: the best F1 achieved so far.
-
-  Returns:
-    The F1 achieved on the training set.
-  """
-  # Evaluate on the validation set
-  val_pred = model.predict(session, val_instances)
-  precision, recall, f1, _ = metrics.precision_recall_fscore_support(
-      val_labels, val_pred, average='weighted')
-  print(
-      'Epoch: %d/%d, Loss: %f, validation set: P: %.3f, R: %.3f, F1: %.3f\n' % (
-          epoch + 1, model.hparams.num_epochs, epoch_loss,
-          precision, recall, f1))
-
-  if f1 > best_f1:
-    save_filename = os.path.join(save_path, 'best.ckpt')
-    print('Saving model in: %s' % save_filename)
-    saver.save(session, save_filename)
-    print('Model saved in file: %s' % save_filename)
-
-  return f1
-
-
-if __name__ == '__main__':
-  tf.app.run(main)
diff --git a/research/lexnet_nc/lexnet_common.py b/research/lexnet_nc/lexnet_common.py
deleted file mode 100644
index a2e8a104d00c1c2f90731f4045c3c8e69e370dbf..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/lexnet_common.py
+++ /dev/null
@@ -1,197 +0,0 @@
-# Copyright 2017, 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Common stuff used with LexNET."""
-# pylint: disable=bad-whitespace
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import numpy as np
-from sklearn import metrics
-import tensorflow as tf
-
-# Part of speech tags used in the paths.
-POSTAGS = [
-    'PAD',   'VERB',   'CONJ',   'NOUN',   'PUNCT',
-    'ADP',   'ADJ',    'DET',    'ADV',    'PART',
-    'NUM',   'X',      'INTJ',   'SYM',
-]
-
-POSTAG_TO_ID = {tag: tid for tid, tag in enumerate(POSTAGS)}
-
-# Dependency labels used in the paths.
-DEPLABELS = [
-    'PAD',     'UNK',       'ROOT',    'abbrev',    'acomp', 'advcl',
-    'advmod',  'agent',     'amod',    'appos',     'attr',  'aux',
-    'auxpass', 'cc',        'ccomp',   'complm',    'conj',  'cop',
-    'csubj',   'csubjpass', 'dep',     'det',       'dobj',  'expl',
-    'infmod',  'iobj',      'mark',    'mwe',       'nc',    'neg',
-    'nn',      'npadvmod',  'nsubj',   'nsubjpass', 'num',   'number',
-    'p',       'parataxis', 'partmod', 'pcomp',     'pobj',  'poss',
-    'preconj', 'predet',    'prep',    'prepc',     'prt',   'ps',
-    'purpcl',  'quantmod',  'rcmod',   'ref',       'rel',   'suffix',
-    'title',   'tmod',      'xcomp',   'xsubj',
-]
-
-DEPLABEL_TO_ID = {label: lid for lid, label in enumerate(DEPLABELS)}
-
-# Direction codes used in the paths.
-DIRS = '_^V<>'
-DIR_TO_ID = {dir: did for did, dir in enumerate(DIRS)}
-
-
-def load_word_embeddings(embedding_filename):
-  """Loads pretrained word embeddings from a binary file and returns the matrix.
-
-  Adds the <PAD>, <UNK>, <X>, and <Y> tokens to the beginning of the vocab.
-
-  Args:
-    embedding_filename: filename of the binary NPY data
-
-  Returns:
-    The word embeddings matrix
-  """
-  embeddings = np.load(embedding_filename)
-  dim = embeddings.shape[1]
-
-  # Four initially random vectors for the special tokens: <PAD>, <UNK>, <X>, <Y>
-  special_embeddings = np.random.normal(0, 0.1, (4, dim))
-  embeddings = np.vstack((special_embeddings, embeddings))
-  embeddings = embeddings.astype(np.float32)
-
-  return embeddings
-
-
-def full_evaluation(model, session, instances, labels, set_name, classes):
-  """Prints a full evaluation on the current set.
-
-  Performance (recall, precision and F1), classification report (per
-  class performance), and confusion matrix).
-
-  Args:
-    model: The currently trained path-based model.
-    session: The current TensorFlow session.
-    instances: The current set instances.
-    labels: The current set labels.
-    set_name: The current set name (train/validation/test).
-    classes: The class label names.
-
-  Returns:
-    The model's prediction for the given instances.
-  """
-
-  # Predict the labels
-  pred = model.predict(session, instances)
-
-  # Print the performance
-  precision, recall, f1, _ = metrics.precision_recall_fscore_support(
-      labels, pred, average='weighted')
-
-  print('%s set: Precision: %.3f, Recall: %.3f, F1: %.3f' % (
-      set_name, precision, recall, f1))
-
-  # Print a classification report
-  print('%s classification report:' % set_name)
-  print(metrics.classification_report(labels, pred, target_names=classes))
-
-  # Print the confusion matrix
-  print('%s confusion matrix:' % set_name)
-  cm = metrics.confusion_matrix(labels, pred, labels=range(len(classes)))
-  cm = cm.astype('float') / cm.sum(axis=1)[:, np.newaxis] * 100
-  print_cm(cm, labels=classes)
-  return pred
-
-
-def print_cm(cm, labels):
-  """Pretty print for confusion matrices.
-
-  From: https://gist.github.com/zachguo/10296432.
-
-  Args:
-    cm: The confusion matrix.
-    labels: The class names.
-  """
-  columnwidth = 10
-  empty_cell = ' ' * columnwidth
-  short_labels = [label[:12].rjust(10, ' ') for label in labels]
-
-  # Print header
-  header = empty_cell + ' '
-  header += ''.join([' %{0}s '.format(columnwidth) % label
-                     for label in short_labels])
-
-  print(header)
-
-  # Print rows
-  for i, label1 in enumerate(short_labels):
-    row = '%{0}s '.format(columnwidth) % label1[:10]
-    for j in range(len(short_labels)):
-      value = int(cm[i, j]) if not np.isnan(cm[i, j]) else 0
-      cell = ' %{0}d '.format(10) % value
-      row += cell + ' '
-    print(row)
-
-
-def load_all_labels(records):
-  """Reads TensorFlow examples from a RecordReader and returns only the labels.
-
-  Args:
-    records: a record list with TensorFlow examples.
-
-  Returns:
-    The labels
-  """
-  curr_features = tf.parse_example(records, {
-      'rel_id': tf.FixedLenFeature([1], dtype=tf.int64),
-  })
-
-  labels = tf.squeeze(curr_features['rel_id'], [-1])
-  return labels
-
-
-def load_all_pairs(records):
-  """Reads TensorFlow examples from a RecordReader and returns the word pairs.
-
-  Args:
-    records: a record list with TensorFlow examples.
-
-  Returns:
-    The word pairs
-  """
-  curr_features = tf.parse_example(records, {
-      'pair': tf.FixedLenFeature([1], dtype=tf.string)
-  })
-
-  word_pairs = curr_features['pair']
-  return word_pairs
-
-
-def write_predictions(pairs, labels, predictions, classes, predictions_file):
-  """Write the predictions to a file.
-
-  Args:
-    pairs: the word pairs (list of tuple of two strings).
-    labels: the gold-standard labels for these pairs (array of rel ID).
-    predictions: the predicted labels for these pairs (array of rel ID).
-    classes: a list of relation names.
-    predictions_file: where to save the predictions.
-  """
-  with open(predictions_file, 'w') as f_out:
-    for pair, label, pred in zip(pairs, labels, predictions):
-      w1, w2 = pair
-      f_out.write('\t'.join([w1, w2, classes[label], classes[pred]]) + '\n')
diff --git a/research/lexnet_nc/lexnet_model.py b/research/lexnet_nc/lexnet_model.py
deleted file mode 100644
index b0f16b030b3bb3fee68b91122bcd03226ffcfa4a..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/lexnet_model.py
+++ /dev/null
@@ -1,438 +0,0 @@
-# Copyright 2017, 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""The integrated LexNET model."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import lexnet_common
-import numpy as np
-import tensorflow as tf
-from six.moves import xrange
-
-
-class LexNETModel(object):
-  """The LexNET model for classifying relationships between noun compounds."""
-
-  @classmethod
-  def default_hparams(cls):
-    """Returns the default hyper-parameters."""
-    return tf.contrib.training.HParams(
-        batch_size=10,
-        num_classes=37,
-        num_epochs=30,
-        input_keep_prob=0.9,
-        input='integrated',  # dist/ dist-nc/ path/ integrated/ integrated-nc
-        learn_relata=False,
-        corpus='wiki_gigawords',
-        random_seed=133,  # zero means no random seed
-        relata_embeddings_file='glove/glove.6B.300d.bin',
-        nc_embeddings_file='nc_glove/vecs.6B.300d.bin',
-        path_embeddings_file='path_embeddings/tratz/fine_grained/wiki',
-        hidden_layers=1,
-        path_dim=60)
-
-  def __init__(self, hparams, relata_embeddings, path_embeddings, nc_embeddings,
-               path_to_index):
-    """Initialize the LexNET classifier.
-
-    Args:
-      hparams: the hyper-parameters.
-      relata_embeddings: word embeddings for the distributional component.
-      path_embeddings: embeddings for the paths.
-      nc_embeddings: noun compound embeddings.
-      path_to_index: a mapping from string path to an index in the path
-      embeddings matrix.
-    """
-    self.hparams = hparams
-
-    self.path_embeddings = path_embeddings
-    self.relata_embeddings = relata_embeddings
-    self.nc_embeddings = nc_embeddings
-
-    self.vocab_size, self.relata_dim = 0, 0
-    self.path_to_index = None
-    self.path_dim = 0
-
-    # Set the random seed
-    if hparams.random_seed > 0:
-      tf.set_random_seed(hparams.random_seed)
-
-    # Get the vocabulary size and relata dim
-    if self.hparams.input in ['dist', 'dist-nc', 'integrated', 'integrated-nc']:
-      self.vocab_size, self.relata_dim = self.relata_embeddings.shape
-
-    # Create the mapping from string path to an index in the embeddings matrix
-    if self.hparams.input in ['path', 'integrated', 'integrated-nc']:
-      self.path_to_index = tf.contrib.lookup.HashTable(
-          tf.contrib.lookup.KeyValueTensorInitializer(
-              tf.constant(path_to_index.keys()),
-              tf.constant(path_to_index.values()),
-              key_dtype=tf.string, value_dtype=tf.int32), 0)
-
-      self.path_dim = self.path_embeddings.shape[1]
-
-    # Create the network
-    self.__create_computation_graph__()
-
-  def __create_computation_graph__(self):
-    """Initialize the model and define the graph."""
-    network_input = 0
-
-    # Define the network inputs
-    # Distributional x and y
-    if self.hparams.input in ['dist', 'dist-nc', 'integrated', 'integrated-nc']:
-      network_input += 2 * self.relata_dim
-      self.relata_lookup = tf.get_variable(
-          'relata_lookup',
-          initializer=self.relata_embeddings,
-          dtype=tf.float32,
-          trainable=self.hparams.learn_relata)
-
-    # Path-based
-    if self.hparams.input in ['path', 'integrated', 'integrated-nc']:
-      network_input += self.path_dim
-
-      self.path_initial_value_t = tf.placeholder(tf.float32, None)
-
-      self.path_lookup = tf.get_variable(
-          name='path_lookup',
-          dtype=tf.float32,
-          trainable=False,
-          shape=self.path_embeddings.shape)
-
-      self.initialize_path_op = tf.assign(
-          self.path_lookup, self.path_initial_value_t, validate_shape=False)
-
-    # Distributional noun compound
-    if self.hparams.input in ['dist-nc', 'integrated-nc']:
-      network_input += self.relata_dim
-
-      self.nc_initial_value_t = tf.placeholder(tf.float32, None)
-
-      self.nc_lookup = tf.get_variable(
-          name='nc_lookup',
-          dtype=tf.float32,
-          trainable=False,
-          shape=self.nc_embeddings.shape)
-
-      self.initialize_nc_op = tf.assign(
-          self.nc_lookup, self.nc_initial_value_t, validate_shape=False)
-
-    hidden_dim = network_input // 2
-
-    # Define the MLP
-    if self.hparams.hidden_layers == 0:
-      self.weights1 = tf.get_variable(
-          'W1',
-          shape=[network_input, self.hparams.num_classes],
-          dtype=tf.float32)
-      self.bias1 = tf.get_variable(
-          'b1',
-          shape=[self.hparams.num_classes],
-          dtype=tf.float32)
-
-    elif self.hparams.hidden_layers == 1:
-
-      self.weights1 = tf.get_variable(
-          'W1',
-          shape=[network_input, hidden_dim],
-          dtype=tf.float32)
-      self.bias1 = tf.get_variable(
-          'b1',
-          shape=[hidden_dim],
-          dtype=tf.float32)
-
-      self.weights2 = tf.get_variable(
-          'W2',
-          shape=[hidden_dim, self.hparams.num_classes],
-          dtype=tf.float32)
-      self.bias2 = tf.get_variable(
-          'b2',
-          shape=[self.hparams.num_classes],
-          dtype=tf.float32)
-
-    else:
-      raise ValueError('Only 0 or 1 hidden layers are supported')
-
-    # Define the variables
-    self.instances = tf.placeholder(dtype=tf.string,
-                                    shape=[self.hparams.batch_size])
-
-    (self.x_embedding_id,
-     self.y_embedding_id,
-     self.nc_embedding_id,
-     self.path_embedding_id,
-     self.path_counts,
-     self.labels) = parse_tensorflow_examples(
-         self.instances, self.hparams.batch_size, self.path_to_index)
-
-    # Create the MLP
-    self.__mlp__()
-
-    self.instances_to_load = tf.placeholder(dtype=tf.string, shape=[None])
-    self.labels_to_load = lexnet_common.load_all_labels(self.instances_to_load)
-    self.pairs_to_load = lexnet_common.load_all_pairs(self.instances_to_load)
-
-  def load_labels(self, session, instances):
-    """Loads the labels for these instances.
-
-    Args:
-      session: The current TensorFlow session,
-      instances: The instances for which to load the labels.
-
-    Returns:
-      the labels of these instances.
-    """
-    return session.run(self.labels_to_load,
-                       feed_dict={self.instances_to_load: instances})
-
-  def load_pairs(self, session, instances):
-    """Loads the word pairs for these instances.
-
-    Args:
-      session: The current TensorFlow session,
-      instances: The instances for which to load the labels.
-
-    Returns:
-      the word pairs of these instances.
-    """
-    word_pairs = session.run(self.pairs_to_load,
-                             feed_dict={self.instances_to_load: instances})
-    return [pair[0].split('::') for pair in word_pairs]
-
-  def __train_single_batch__(self, session, batch_instances):
-    """Train a single batch.
-
-    Args:
-      session: The current TensorFlow session.
-      batch_instances: TensorFlow examples containing the training intances
-
-    Returns:
-      The cost for the current batch.
-    """
-    cost, _ = session.run([self.cost, self.train_op],
-                          feed_dict={self.instances: batch_instances})
-
-    return cost
-
-  def fit(self, session, inputs, on_epoch_completed, val_instances, val_labels,
-          save_path):
-    """Train the model.
-
-    Args:
-      session: The current TensorFlow session.
-      inputs:
-      on_epoch_completed: A method to call after each epoch.
-      val_instances: The validation set instances (evaluation between epochs).
-      val_labels: The validation set labels (for evaluation between epochs).
-      save_path: Where to save the model.
-    """
-    for epoch in range(self.hparams.num_epochs):
-
-      losses = []
-      epoch_indices = list(np.random.permutation(len(inputs)))
-
-      # If the number of instances doesn't divide by batch_size, enlarge it
-      # by duplicating training examples
-      mod = len(epoch_indices) % self.hparams.batch_size
-      if mod > 0:
-        epoch_indices.extend([np.random.randint(0, high=len(inputs))] * mod)
-
-      # Define the batches
-      n_batches = len(epoch_indices) // self.hparams.batch_size
-
-      for minibatch in range(n_batches):
-
-        batch_indices = epoch_indices[minibatch * self.hparams.batch_size:(
-            minibatch + 1) * self.hparams.batch_size]
-        batch_instances = [inputs[i] for i in batch_indices]
-
-        loss = self.__train_single_batch__(session, batch_instances)
-        losses.append(loss)
-
-      epoch_loss = np.nanmean(losses)
-
-      if on_epoch_completed:
-        should_stop = on_epoch_completed(self, session, epoch, epoch_loss,
-                                         val_instances, val_labels, save_path)
-        if should_stop:
-          print('Stopping training after %d epochs.' % epoch)
-          return
-
-  def predict(self, session, inputs):
-    """Predict the classification of the test set.
-
-    Args:
-      session: The current TensorFlow session.
-      inputs: the train paths, x, y and/or nc vectors
-
-    Returns:
-      The test predictions.
-    """
-    predictions, _ = zip(*self.predict_with_score(session, inputs))
-    return np.array(predictions)
-
-  def predict_with_score(self, session, inputs):
-    """Predict the classification of the test set.
-
-    Args:
-      session: The current TensorFlow session.
-      inputs: the test paths, x, y and/or nc vectors
-
-    Returns:
-      The test predictions along with their scores.
-    """
-    test_pred = [0] * len(inputs)
-
-    for chunk in xrange(0, len(test_pred), self.hparams.batch_size):
-
-      # Initialize the variables with the current batch data
-      batch_indices = list(
-          range(chunk, min(chunk + self.hparams.batch_size, len(test_pred))))
-
-      # If the batch is too small, add a few other examples
-      if len(batch_indices) < self.hparams.batch_size:
-        batch_indices += [0] * (self.hparams.batch_size-len(batch_indices))
-
-      batch_instances = [inputs[i] for i in batch_indices]
-
-      predictions, scores = session.run(
-          [self.predictions, self.scores],
-          feed_dict={self.instances: batch_instances})
-
-      for index_in_batch, index_in_dataset in enumerate(batch_indices):
-        prediction = predictions[index_in_batch]
-        score = scores[index_in_batch][prediction]
-        test_pred[index_in_dataset] = (prediction, score)
-
-    return test_pred
-
-  def __mlp__(self):
-    """Performs the MLP operations.
-
-    Returns: the prediction object to be computed in a Session
-    """
-    # Define the operations
-
-    # Network input
-    vec_inputs = []
-
-    # Distributional component
-    if self.hparams.input in ['dist', 'dist-nc', 'integrated', 'integrated-nc']:
-      for emb_id in [self.x_embedding_id, self.y_embedding_id]:
-        vec_inputs.append(tf.nn.embedding_lookup(self.relata_lookup, emb_id))
-
-    # Noun compound component
-    if self.hparams.input in ['dist-nc', 'integrated-nc']:
-      vec = tf.nn.embedding_lookup(self.nc_lookup, self.nc_embedding_id)
-      vec_inputs.append(vec)
-
-    # Path-based component
-    if self.hparams.input in ['path', 'integrated', 'integrated-nc']:
-
-      # Get the current paths for each batch instance
-      self.path_embeddings = tf.nn.embedding_lookup(self.path_lookup,
-                                                    self.path_embedding_id)
-
-      # self.path_embeddings is of shape
-      # [batch_size, max_path_per_instance, output_dim]
-      # We need to multiply it by path counts
-      # ([batch_size, max_path_per_instance]).
-      # Start by duplicating path_counts along the output_dim axis.
-      self.path_freq = tf.tile(tf.expand_dims(self.path_counts, -1),
-                               [1, 1, self.path_dim])
-
-      # Compute the averaged path vector for each instance.
-      # First, multiply the path embeddings and frequencies element-wise.
-      self.weighted = tf.multiply(self.path_freq, self.path_embeddings)
-
-      # Second, take the sum to get a tensor of shape [batch_size, output_dim].
-      self.pair_path_embeddings = tf.reduce_sum(self.weighted, 1)
-
-      # Finally, divide by the total number of paths.
-      # The number of paths for each pair has a shape [batch_size, 1],
-      # We duplicate it output_dim times along the second axis.
-      self.num_paths = tf.clip_by_value(
-          tf.reduce_sum(self.path_counts, 1), 1, np.inf)
-      self.num_paths = tf.tile(tf.expand_dims(self.num_paths, -1),
-                               [1, self.path_dim])
-
-      # And finally, divide pair_path_embeddings by num_paths element-wise.
-      self.pair_path_embeddings = tf.div(
-          self.pair_path_embeddings, self.num_paths)
-      vec_inputs.append(self.pair_path_embeddings)
-
-    # Concatenate the inputs and feed to the MLP
-    self.input_vec = tf.nn.dropout(
-        tf.concat(vec_inputs, 1),
-        keep_prob=self.hparams.input_keep_prob)
-
-    h = tf.matmul(self.input_vec, self.weights1)
-    self.output = h
-
-    if self.hparams.hidden_layers == 1:
-      self.output = tf.matmul(tf.nn.tanh(h), self.weights2)
-
-    self.scores = self.output
-    self.predictions = tf.argmax(self.scores, axis=1)
-
-    # Define the loss function and the optimization algorithm
-    self.cross_entropies = tf.nn.sparse_softmax_cross_entropy_with_logits(
-        logits=self.scores, labels=self.labels)
-    self.cost = tf.reduce_sum(self.cross_entropies, name='cost')
-    self.global_step = tf.Variable(0, name='global_step', trainable=False)
-    self.optimizer = tf.train.AdamOptimizer()
-    self.train_op = self.optimizer.minimize(
-        self.cost, global_step=self.global_step)
-
-
-def parse_tensorflow_examples(record, batch_size, path_to_index):
-  """Reads TensorFlow examples from a RecordReader.
-
-  Args:
-    record: a record with TensorFlow examples.
-    batch_size: the number of instances in a minibatch
-    path_to_index: mapping from string path to index in the embeddings matrix.
-
-  Returns:
-    The word embeddings IDs, paths and counts
-  """
-  features = tf.parse_example(
-      record, {
-          'x_embedding_id': tf.FixedLenFeature([1], dtype=tf.int64),
-          'y_embedding_id': tf.FixedLenFeature([1], dtype=tf.int64),
-          'nc_embedding_id': tf.FixedLenFeature([1], dtype=tf.int64),
-          'reprs': tf.FixedLenSequenceFeature(
-              shape=(), dtype=tf.string, allow_missing=True),
-          'counts': tf.FixedLenSequenceFeature(
-              shape=(), dtype=tf.int64, allow_missing=True),
-          'rel_id': tf.FixedLenFeature([1], dtype=tf.int64)
-      })
-
-  x_embedding_id = tf.squeeze(features['x_embedding_id'], [-1])
-  y_embedding_id = tf.squeeze(features['y_embedding_id'], [-1])
-  nc_embedding_id = tf.squeeze(features['nc_embedding_id'], [-1])
-  labels = tf.squeeze(features['rel_id'], [-1])
-  path_counts = tf.to_float(tf.reshape(features['counts'], [batch_size, -1]))
-
-  path_embedding_id = None
-  if path_to_index:
-    path_embedding_id = path_to_index.lookup(features['reprs'])
-
-  return (
-      x_embedding_id, y_embedding_id, nc_embedding_id,
-      path_embedding_id, path_counts, labels)
diff --git a/research/lexnet_nc/path_model.py b/research/lexnet_nc/path_model.py
deleted file mode 100644
index c283841775d673baa8a4bc8c438d65f288a2c555..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/path_model.py
+++ /dev/null
@@ -1,547 +0,0 @@
-# Copyright 2017, 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""LexNET Path-based Model."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import itertools
-import os
-
-import lexnet_common
-import numpy as np
-import tensorflow as tf
-
-
-class PathBasedModel(object):
-  """The LexNET path-based model for classifying semantic relations."""
-
-  @classmethod
-  def default_hparams(cls):
-    """Returns the default hyper-parameters."""
-    return tf.contrib.training.HParams(
-        max_path_len=8,
-        num_classes=37,
-        num_epochs=30,
-        input_keep_prob=0.9,
-        learning_rate=0.001,
-        learn_lemmas=False,
-        random_seed=133,  # zero means no random seed
-        lemma_embeddings_file='glove/glove.6B.50d.bin',
-        num_pos=len(lexnet_common.POSTAGS),
-        num_dep=len(lexnet_common.DEPLABELS),
-        num_directions=len(lexnet_common.DIRS),
-        lemma_dim=50,
-        pos_dim=4,
-        dep_dim=5,
-        dir_dim=1)
-
-  def __init__(self, hparams, lemma_embeddings, instance):
-    """Initialize the LexNET classifier.
-
-    Args:
-      hparams: the hyper-parameters.
-      lemma_embeddings: word embeddings for the path-based component.
-      instance: string tensor containing the input instance
-    """
-    self.hparams = hparams
-    self.lemma_embeddings = lemma_embeddings
-    self.instance = instance
-    self.vocab_size, self.lemma_dim = self.lemma_embeddings.shape
-
-    # Set the random seed
-    if hparams.random_seed > 0:
-      tf.set_random_seed(hparams.random_seed)
-
-    # Create the network
-    self.__create_computation_graph__()
-
-  def __create_computation_graph__(self):
-    """Initialize the model and define the graph."""
-    self.lstm_input_dim = sum([self.hparams.lemma_dim, self.hparams.pos_dim,
-                               self.hparams.dep_dim, self.hparams.dir_dim])
-    self.lstm_output_dim = self.lstm_input_dim
-
-    network_input = self.lstm_output_dim
-    self.lemma_lookup = tf.get_variable(
-        'lemma_lookup',
-        initializer=self.lemma_embeddings,
-        dtype=tf.float32,
-        trainable=self.hparams.learn_lemmas)
-    self.pos_lookup = tf.get_variable(
-        'pos_lookup',
-        shape=[self.hparams.num_pos, self.hparams.pos_dim],
-        dtype=tf.float32)
-    self.dep_lookup = tf.get_variable(
-        'dep_lookup',
-        shape=[self.hparams.num_dep, self.hparams.dep_dim],
-        dtype=tf.float32)
-    self.dir_lookup = tf.get_variable(
-        'dir_lookup',
-        shape=[self.hparams.num_directions, self.hparams.dir_dim],
-        dtype=tf.float32)
-
-    self.weights1 = tf.get_variable(
-        'W1',
-        shape=[network_input, self.hparams.num_classes],
-        dtype=tf.float32)
-    self.bias1 = tf.get_variable(
-        'b1',
-        shape=[self.hparams.num_classes],
-        dtype=tf.float32)
-
-    # Define the variables
-    (self.batch_paths,
-     self.path_counts,
-     self.seq_lengths,
-     self.path_strings,
-     self.batch_labels) = _parse_tensorflow_example(
-         self.instance, self.hparams.max_path_len, self.hparams.input_keep_prob)
-
-    # Create the LSTM
-    self.__lstm__()
-
-    # Create the MLP
-    self.__mlp__()
-
-    self.instances_to_load = tf.placeholder(dtype=tf.string, shape=[None])
-    self.labels_to_load = lexnet_common.load_all_labels(self.instances_to_load)
-
-  def load_labels(self, session, batch_instances):
-    """Loads the labels of the current instances.
-
-    Args:
-      session: the current TensorFlow session.
-      batch_instances: the dataset instances.
-
-    Returns:
-      the labels.
-    """
-    return session.run(self.labels_to_load,
-                       feed_dict={self.instances_to_load: batch_instances})
-
-  def run_one_epoch(self, session, num_steps):
-    """Train the model.
-
-    Args:
-      session: The current TensorFlow session.
-      num_steps: The number of steps in each epoch.
-
-    Returns:
-      The mean loss for the epoch.
-
-    Raises:
-      ArithmeticError: if the loss becomes non-finite.
-    """
-    losses = []
-
-    for step in range(num_steps):
-      curr_loss, _ = session.run([self.cost, self.train_op])
-      if not np.isfinite(curr_loss):
-        raise ArithmeticError('nan loss at step %d' % step)
-
-      losses.append(curr_loss)
-
-    return np.mean(losses)
-
-  def predict(self, session, inputs):
-    """Predict the classification of the test set.
-
-    Args:
-      session: The current TensorFlow session.
-      inputs: the train paths, x, y and/or nc vectors
-
-    Returns:
-      The test predictions.
-    """
-    predictions, _ = zip(*self.predict_with_score(session, inputs))
-    return np.array(predictions)
-
-  def predict_with_score(self, session, inputs):
-    """Predict the classification of the test set.
-
-    Args:
-      session: The current TensorFlow session.
-      inputs: the test paths, x, y and/or nc vectors
-
-    Returns:
-      The test predictions along with their scores.
-    """
-    test_pred = [0] * len(inputs)
-
-    for index, instance in enumerate(inputs):
-
-      prediction, scores = session.run(
-          [self.predictions, self.scores],
-          feed_dict={self.instance: instance})
-
-      test_pred[index] = (prediction, scores[prediction])
-
-    return test_pred
-
-  def __mlp__(self):
-    """Performs the MLP operations.
-
-    Returns: the prediction object to be computed in a Session
-    """
-    # Feed the paths to the MLP: path_embeddings is
-    # [num_batch_paths, output_dim], and when we multiply it by W
-    # ([output_dim, num_classes]), we get a matrix of class distributions:
-    # [num_batch_paths, num_classes].
-    self.distributions = tf.matmul(self.path_embeddings, self.weights1)
-
-    # Now, compute weighted average on the class distributions, using the path
-    # frequency as weights.
-
-    # First, reshape path_freq to the same shape of distributions
-    self.path_freq = tf.tile(tf.expand_dims(self.path_counts, -1),
-                             [1, self.hparams.num_classes])
-
-    # Second, multiply the distributions and frequencies element-wise.
-    self.weighted = tf.multiply(self.path_freq, self.distributions)
-
-    # Finally, take the average to get a tensor of shape [1, num_classes].
-    self.weighted_sum = tf.reduce_sum(self.weighted, 0)
-    self.num_paths = tf.clip_by_value(tf.reduce_sum(self.path_counts),
-                                      1, np.inf)
-    self.num_paths = tf.tile(tf.expand_dims(self.num_paths, -1),
-                             [self.hparams.num_classes])
-    self.scores = tf.div(self.weighted_sum, self.num_paths)
-    self.predictions = tf.argmax(self.scores)
-
-    # Define the loss function and the optimization algorithm
-    self.cross_entropies = tf.nn.sparse_softmax_cross_entropy_with_logits(
-        logits=self.scores, labels=tf.reduce_mean(self.batch_labels))
-    self.cost = tf.reduce_sum(self.cross_entropies, name='cost')
-    self.global_step = tf.Variable(0, name='global_step', trainable=False)
-    self.optimizer = tf.train.AdamOptimizer()
-    self.train_op = self.optimizer.minimize(self.cost,
-                                            global_step=self.global_step)
-
-  def __lstm__(self):
-    """Defines the LSTM operations.
-
-    Returns:
-      A matrix of path embeddings.
-    """
-    lookup_tables = [self.lemma_lookup, self.pos_lookup,
-                     self.dep_lookup, self.dir_lookup]
-
-    # Split the edges to components: list of 4 tensors
-    # [num_batch_paths, max_path_len, 1]
-    self.edge_components = tf.split(self.batch_paths, 4, axis=2)
-
-    # Look up the components embeddings and concatenate them back together
-    self.path_matrix = tf.concat([
-        tf.squeeze(tf.nn.embedding_lookup(lookup_table, component), 2)
-        for lookup_table, component in
-        zip(lookup_tables, self.edge_components)
-    ], axis=2)
-
-    self.sequence_lengths = tf.reshape(self.seq_lengths, [-1])
-
-    # Define the LSTM.
-    # The input is [num_batch_paths, max_path_len, input_dim].
-    lstm_cell = tf.contrib.rnn.BasicLSTMCell(self.lstm_output_dim)
-
-    # The output is [num_batch_paths, max_path_len, output_dim].
-    self.lstm_outputs, _ = tf.nn.dynamic_rnn(
-        lstm_cell, self.path_matrix, dtype=tf.float32,
-        sequence_length=self.sequence_lengths)
-
-    # Slice the last *relevant* output for each instance ->
-    # [num_batch_paths, output_dim]
-    self.path_embeddings = _extract_last_relevant(self.lstm_outputs,
-                                                  self.sequence_lengths)
-
-
-def _parse_tensorflow_example(record, max_path_len, input_keep_prob):
-  """Reads TensorFlow examples from a RecordReader.
-
-  Args:
-    record: a record with TensorFlow example.
-    max_path_len: the maximum path length.
-    input_keep_prob: 1 - the word dropout probability
-
-  Returns:
-    The paths and counts
-  """
-  features = tf.parse_single_example(record, {
-      'lemmas':
-          tf.FixedLenSequenceFeature(
-              shape=(), dtype=tf.int64, allow_missing=True),
-      'postags':
-          tf.FixedLenSequenceFeature(
-              shape=(), dtype=tf.int64, allow_missing=True),
-      'deplabels':
-          tf.FixedLenSequenceFeature(
-              shape=(), dtype=tf.int64, allow_missing=True),
-      'dirs':
-          tf.FixedLenSequenceFeature(
-              shape=(), dtype=tf.int64, allow_missing=True),
-      'counts':
-          tf.FixedLenSequenceFeature(
-              shape=(), dtype=tf.int64, allow_missing=True),
-      'pathlens':
-          tf.FixedLenSequenceFeature(
-              shape=(), dtype=tf.int64, allow_missing=True),
-      'reprs':
-          tf.FixedLenSequenceFeature(
-              shape=(), dtype=tf.string, allow_missing=True),
-      'rel_id':
-          tf.FixedLenFeature([], dtype=tf.int64)
-  })
-
-  path_counts = tf.to_float(features['counts'])
-  seq_lengths = features['pathlens']
-
-  # Concatenate the edge components to create a path tensor:
-  # [max_paths_per_ins, max_path_length, 4]
-  lemmas = _word_dropout(
-      tf.reshape(features['lemmas'], [-1, max_path_len]), input_keep_prob)
-
-  paths = tf.stack(
-      [lemmas] + [
-          tf.reshape(features[f], [-1, max_path_len])
-          for f in ('postags', 'deplabels', 'dirs')
-      ],
-      axis=-1)
-
-  path_strings = features['reprs']
-
-  # Add an empty path to pairs with no paths
-  paths = tf.cond(
-      tf.shape(paths)[0] > 0,
-      lambda: paths,
-      lambda: tf.zeros([1, max_path_len, 4], dtype=tf.int64))
-
-  # Paths are left-padded. We reverse them to make them right-padded.
-  #paths = tf.reverse(paths, axis=[1])
-
-  path_counts = tf.cond(
-      tf.shape(path_counts)[0] > 0,
-      lambda: path_counts,
-      lambda: tf.constant([1.0], dtype=tf.float32))
-
-  seq_lengths = tf.cond(
-      tf.shape(seq_lengths)[0] > 0,
-      lambda: seq_lengths,
-      lambda: tf.constant([1], dtype=tf.int64))
-
-  # Duplicate the label for each path
-  labels = tf.ones_like(path_counts, dtype=tf.int64) * features['rel_id']
-
-  return paths, path_counts, seq_lengths, path_strings, labels
-
-
-def _extract_last_relevant(output, seq_lengths):
-  """Get the last relevant LSTM output cell for each batch instance.
-
-  Args:
-    output: the LSTM outputs - a tensor with shape
-    [num_paths, output_dim, max_path_len]
-    seq_lengths: the sequences length per instance
-
-  Returns:
-    The last relevant LSTM output cell for each batch instance.
-  """
-  max_length = int(output.get_shape()[1])
-  path_lengths = tf.clip_by_value(seq_lengths - 1, 0, max_length)
-  relevant = tf.reduce_sum(tf.multiply(output, tf.expand_dims(
-      tf.one_hot(path_lengths, max_length), -1)), 1)
-  return relevant
-
-
-def _word_dropout(words, input_keep_prob):
-  """Drops words with probability 1 - input_keep_prob.
-
-  Args:
-    words: a list of lemmas from the paths.
-    input_keep_prob: the probability to keep the word.
-
-  Returns:
-    The revised list where some of the words are <UNK>ed.
-  """
-  # Create the mask: (-1) to drop, 1 to keep
-  prob = tf.random_uniform(tf.shape(words), 0, 1)
-  condition = tf.less(prob, (1 - input_keep_prob))
-  mask = tf.where(condition,
-                  tf.negative(tf.ones_like(words)), tf.ones_like(words))
-
-  # We need to keep zeros (<PAD>), and change other numbers to 1 (<UNK>)
-  # if their mask is -1. First, we multiply the mask and the words.
-  # Zeros will stay zeros, and words to drop will become negative.
-  # Then, we change negative values to 1.
-  masked_words = tf.multiply(mask, words)
-  condition = tf.less(masked_words, 0)
-  dropped_words = tf.where(condition, tf.ones_like(words), words)
-  return dropped_words
-
-
-def compute_path_embeddings(model, session, instances):
-  """Compute the path embeddings for all the distinct paths.
-
-  Args:
-    model: The trained path-based model.
-    session: The current TensorFlow session.
-    instances: All the train, test and validation instances.
-
-  Returns:
-    The path to ID index and the path embeddings.
-  """
-  # Get an index for each distinct path
-  path_index = collections.defaultdict(itertools.count(0).next)
-  path_vectors = {}
-
-  for instance in instances:
-    curr_path_embeddings, curr_path_strings = session.run(
-        [model.path_embeddings, model.path_strings],
-        feed_dict={model.instance: instance})
-
-    for i, path in enumerate(curr_path_strings):
-      if not path:
-        continue
-
-      # Set a new/existing index for the path
-      index = path_index[path]
-
-      # Save its vector
-      path_vectors[index] = curr_path_embeddings[i, :]
-
-  print('Number of distinct paths: %d' % len(path_index))
-  return path_index, path_vectors
-
-
-def save_path_embeddings(model, path_vectors, path_index, embeddings_base_path):
-  """Saves the path embeddings.
-
-  Args:
-    model: The trained path-based model.
-    path_vectors: The path embeddings.
-    path_index: A map from path to ID.
-    embeddings_base_path: The base directory where the embeddings are.
-  """
-  index_range = range(max(path_index.values()) + 1)
-  path_matrix = [path_vectors[i] for i in index_range]
-  path_matrix = np.vstack(path_matrix)
-
-  # Save the path embeddings
-  path_vector_filename = os.path.join(
-      embeddings_base_path, '%d_path_vectors' % model.lstm_output_dim)
-  with open(path_vector_filename, 'w') as f_out:
-    np.save(f_out, path_matrix)
-
-  index_to_path = {i: p for p, i in path_index.iteritems()}
-  path_vocab = [index_to_path[i] for i in index_range]
-
-  # Save the path vocabulary
-  path_vocab_filename = os.path.join(
-      embeddings_base_path, '%d_path_vocab' % model.lstm_output_dim)
-  with open(path_vocab_filename, 'w') as f_out:
-    f_out.write('\n'.join(path_vocab))
-    f_out.write('\n')
-
-  print('Saved path embeddings.')
-
-
-def load_path_embeddings(path_embeddings_dir, path_dim):
-  """Loads pretrained path embeddings from a binary file and returns the matrix.
-
-  Args:
-    path_embeddings_dir: The directory for the path embeddings.
-    path_dim: The dimension of the path embeddings, used as prefix to the
-    path_vocab and path_vectors files.
-
-  Returns:
-    The path embeddings matrix and the path_to_index dictionary.
-  """
-  prefix = path_embeddings_dir + '/%d' % path_dim + '_'
-  with open(prefix + 'path_vocab') as f_in:
-    vocab = f_in.read().splitlines()
-
-  vocab_size = len(vocab)
-  embedding_file = prefix + 'path_vectors'
-
-  print('Embedding file "%s" has %d paths' % (embedding_file, vocab_size))
-
-  with open(embedding_file) as f_in:
-    embeddings = np.load(f_in)
-
-  path_to_index = {p: i for i, p in enumerate(vocab)}
-  return embeddings, path_to_index
-
-
-def get_indicative_paths(model, session, path_index, path_vectors, classes,
-                         save_dir, k=20, threshold=0.8):
-  """Gets the most indicative paths for each class.
-
-  Args:
-    model: The trained path-based model.
-    session: The current TensorFlow session.
-    path_index: A map from path to ID.
-    path_vectors: The path embeddings.
-    classes: The class label names.
-    save_dir: Where to save the paths.
-    k: The k for top-k paths.
-    threshold: The threshold above which to consider paths as indicative.
-  """
-  # Define graph variables for this operation
-  p_path_embedding = tf.placeholder(dtype=tf.float32,
-                                    shape=[1, model.lstm_output_dim])
-  p_distributions = tf.nn.softmax(tf.matmul(p_path_embedding, model.weights1))
-
-  # Treat each path as a pair instance with a single path, and get the
-  # relation distribution for it. Then, take the top paths for each relation.
-
-  # This dictionary contains a relation as a key, and the value is a list of
-  # tuples of path index and score. A relation r will contain (p, s) if the
-  # path p is classified to r with a confidence of s.
-  prediction_per_relation = collections.defaultdict(list)
-
-  index_to_path = {i: p for p, i in path_index.iteritems()}
-
-  # Predict all the paths
-  for index in range(len(path_index)):
-    curr_path_vector = path_vectors[index]
-
-    distribution = session.run(p_distributions,
-                               feed_dict={
-                                   p_path_embedding: np.reshape(
-                                       curr_path_vector,
-                                       [1, model.lstm_output_dim])})
-
-    distribution = distribution[0, :]
-    prediction = np.argmax(distribution)
-    prediction_per_relation[prediction].append(
-        (index, distribution[prediction]))
-
-    if index % 10000 == 0:
-      print('Classified %d/%d (%3.2f%%) of the paths' % (
-          index, len(path_index), 100 * index / len(path_index)))
-
-  # Retrieve k-best scoring paths for each relation
-  for relation_index, relation in enumerate(classes):
-    curr_paths = sorted(prediction_per_relation[relation_index],
-                        key=lambda item: item[1], reverse=True)
-    above_t = [(p, s) for (p, s) in curr_paths if s >= threshold]
-    top_k = curr_paths[k+1]
-    relation_paths = above_t if len(above_t) > len(top_k) else top_k
-
-    paths_filename = os.path.join(save_dir, '%s.paths' % relation)
-    with open(paths_filename, 'w') as f_out:
-      for index, score in relation_paths:
-        print('\t'.join([index_to_path[index], str(score)]), file=f_out)
diff --git a/research/lexnet_nc/sorted_paths_to_examples.py b/research/lexnet_nc/sorted_paths_to_examples.py
deleted file mode 100755
index c21d25d710ae793f6eefd889b98414c923e4fbe6..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/sorted_paths_to_examples.py
+++ /dev/null
@@ -1,202 +0,0 @@
-#!/usr/bin/env python
-# Copyright 2017, 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Takes as input a sorted, tab-separated of paths to produce tf.Examples."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import itertools
-import os
-import sys
-import tensorflow as tf
-
-import lexnet_common
-
-tf.flags.DEFINE_string('input', '', 'tab-separated input data')
-tf.flags.DEFINE_string('vocab', '', 'a text file containing lemma vocabulary')
-tf.flags.DEFINE_string('relations', '', 'a text file containing the relations')
-tf.flags.DEFINE_string('output_dir', '', 'output directory')
-tf.flags.DEFINE_string('splits', '', 'text file enumerating splits')
-tf.flags.DEFINE_string('default_split', '', 'default split for unlabeled pairs')
-tf.flags.DEFINE_string('compression', 'GZIP', 'compression for output records')
-tf.flags.DEFINE_integer('max_paths', 100, 'maximum number of paths per record')
-tf.flags.DEFINE_integer('max_pathlen', 8, 'maximum path length')
-FLAGS = tf.flags.FLAGS
-
-
-def _int64_features(value):
-  return tf.train.Feature(int64_list=tf.train.Int64List(value=value))
-
-
-def _bytes_features(value):
-  value = [v.encode('utf-8') if isinstance(v, unicode) else v for v in value]
-  return tf.train.Feature(bytes_list=tf.train.BytesList(value=value))
-
-
-class CreateExampleFn(object):
-
-  def __init__(self):
-    # Read the vocabulary.  N.B. that 0 = PAD, 1 = UNK, 2 = <X>, 3 = <Y>, hence
-    # the enumeration starting at 4.
-    with tf.gfile.GFile(FLAGS.vocab) as fh:
-      self.vocab = {w: ix for ix, w in enumerate(fh.read().splitlines(), start=4)}
-
-    self.vocab.update({'<PAD>': 0, '<UNK>': 1, '<X>': 2, '<Y>': 3})
-
-    # Read the relations.
-    with tf.gfile.GFile(FLAGS.relations) as fh:
-      self.relations = {r: ix for ix, r in enumerate(fh.read().splitlines())}
-
-    # Some hackery to map from SpaCy postags to Google's.
-    lexnet_common.POSTAG_TO_ID['PROPN'] = lexnet_common.POSTAG_TO_ID['NOUN']
-    lexnet_common.POSTAG_TO_ID['PRON'] = lexnet_common.POSTAG_TO_ID['NOUN']
-    lexnet_common.POSTAG_TO_ID['CCONJ'] = lexnet_common.POSTAG_TO_ID['CONJ']
-    #lexnet_common.DEPLABEL_TO_ID['relcl'] = lexnet_common.DEPLABEL_TO_ID['rel']
-    #lexnet_common.DEPLABEL_TO_ID['compound'] = lexnet_common.DEPLABEL_TO_ID['xcomp']
-    #lexnet_common.DEPLABEL_TO_ID['oprd'] = lexnet_common.DEPLABEL_TO_ID['UNK']
-
-  def __call__(self, mod, head, rel, raw_paths):
-    # Drop any really long paths.
-    paths = []
-    counts = []
-    for raw, count in raw_paths.most_common(FLAGS.max_paths):
-      path = raw.split('::')
-      if len(path) <= FLAGS.max_pathlen:
-        paths.append(path)
-        counts.append(count)
-
-    if not paths:
-      return None
-
-    # Compute the true length.
-    pathlens = [len(path) for path in paths]
-
-    # Pad each path out to max_pathlen so the LSTM can eat it.
-    paths = (
-        itertools.islice(
-            itertools.chain(path, itertools.repeat('<PAD>/PAD/PAD/_')),
-            FLAGS.max_pathlen)
-        for path in paths)
-
-    # Split the lemma, POS, dependency label, and direction each into a
-    # separate feature.
-    lemmas, postags, deplabels, dirs = zip(
-        *(part.split('/') for part in itertools.chain(*paths)))
-
-    lemmas = [self.vocab.get(lemma, 1) for lemma in lemmas]
-    postags = [lexnet_common.POSTAG_TO_ID[pos] for pos in postags]
-    deplabels = [lexnet_common.DEPLABEL_TO_ID.get(dep, 1) for dep in deplabels]
-    dirs = [lexnet_common.DIR_TO_ID.get(d, 0) for d in dirs]
-
-    return tf.train.Example(features=tf.train.Features(feature={
-        'pair': _bytes_features(['::'.join((mod, head))]),
-        'rel': _bytes_features([rel]),
-        'rel_id': _int64_features([self.relations[rel]]),
-        'reprs': _bytes_features(raw_paths),
-        'pathlens': _int64_features(pathlens),
-        'counts': _int64_features(counts),
-        'lemmas': _int64_features(lemmas),
-        'dirs': _int64_features(dirs),
-        'deplabels': _int64_features(deplabels),
-        'postags': _int64_features(postags),
-        'x_embedding_id': _int64_features([self.vocab[mod]]),
-        'y_embedding_id': _int64_features([self.vocab[head]]),
-    }))
-
-
-def main(_):
-  # Read the splits file, if there is one.
-  assignments = {}
-  if FLAGS.splits:
-    with tf.gfile.GFile(FLAGS.splits) as fh:
-      parts = (line.split('\t') for line in fh.read().splitlines())
-      assignments = {(mod, head): split for mod, head, split in parts}
-
-  splits = set(assignments.itervalues())
-  if FLAGS.default_split:
-    default_split = FLAGS.default_split
-    splits.add(FLAGS.default_split)
-  elif splits:
-    default_split = iter(splits).next()
-  else:
-    print('Please specify --splits, --default_split, or both', file=sys.stderr)
-    return 1
-
-  last_mod, last_head, last_label = None, None, None
-  raw_paths = collections.Counter()
-
-  # Keep track of pairs we've seen to ensure that we don't get unsorted data.
-  seen_labeled_pairs = set()
-
-  # Set up output compression
-  compression_type = getattr(
-      tf.python_io.TFRecordCompressionType, FLAGS.compression)
-  options = tf.python_io.TFRecordOptions(compression_type=compression_type)
-
-  writers = {
-      split: tf.python_io.TFRecordWriter(
-          os.path.join(FLAGS.output_dir, '%s.tfrecs.gz' % split),
-          options=options)
-      for split in splits}
-
-  create_example = CreateExampleFn()
-
-  in_fh = sys.stdin if not FLAGS.input else tf.gfile.GFile(FLAGS.input)
-  for lineno, line in enumerate(in_fh, start=1):
-    if lineno % 100 == 0:
-      print('\rProcessed %d lines...' % lineno, end='', file=sys.stderr)
-
-    parts = line.decode('utf-8').strip().split('\t')
-    if len(parts) != 5:
-      print('Skipping line %d: %d columns (expected 5)' % (
-          lineno, len(parts)), file=sys.stderr)
-
-      continue
-
-    mod, head, label, raw_path, source = parts
-    if mod == last_mod and head == last_head and label == last_label:
-      raw_paths.update([raw_path])
-      continue
-
-    if last_mod and last_head and last_label and raw_paths:
-      if (last_mod, last_head, last_label) in seen_labeled_pairs:
-        print('It looks like the input data is not sorted; ignoring extra '
-              'record for (%s::%s, %s) at line %d' % (
-                  last_mod, last_head, last_label, lineno))
-      else:
-        ex = create_example(last_mod, last_head, last_label, raw_paths)
-        if ex:
-          split = assignments.get((last_mod, last_head), default_split)
-          writers[split].write(ex.SerializeToString())
-
-        seen_labeled_pairs.add((last_mod, last_head, last_label))
-
-    last_mod, last_head, last_label = mod, head, label
-    raw_paths = collections.Counter()
-
-  if last_mod and last_head and last_label and raw_paths:
-    ex = create_example(last_mod, last_head, last_label, raw_paths)
-    if ex:
-      split = assignments.get((last_mod, last_head), default_split)
-      writers[split].write(ex.SerializeToString())
-
-  for writer in writers.itervalues():
-    writer.close()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/lexnet_nc/text_embeddings_to_binary.py b/research/lexnet_nc/text_embeddings_to_binary.py
deleted file mode 100755
index 8226a7654e6da733ba1e8c46810a8ec8afd7a2c0..0000000000000000000000000000000000000000
--- a/research/lexnet_nc/text_embeddings_to_binary.py
+++ /dev/null
@@ -1,48 +0,0 @@
-#!/usr/bin/env python
-# Copyright 2017, 2018 Google, Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Converts a text embedding file into a binary format for quicker loading."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-tf.flags.DEFINE_string('input', '', 'text file containing embeddings')
-tf.flags.DEFINE_string('output_vocab', '', 'output file for vocabulary')
-tf.flags.DEFINE_string('output_npy', '', 'output file for binary')
-FLAGS = tf.flags.FLAGS
-
-def main(_):
-  vecs = []
-  vocab = []
-  with tf.gfile.GFile(FLAGS.input) as fh:
-    for line in fh:
-      parts = line.strip().split()
-      vocab.append(parts[0])
-      vecs.append([float(x) for x in parts[1:]])
-
-  with tf.gfile.GFile(FLAGS.output_vocab, 'w') as fh:
-    fh.write('\n'.join(vocab))
-    fh.write('\n')
-
-  vecs = np.array(vecs, dtype=np.float32)
-  np.save(FLAGS.output_npy, vecs, allow_pickle=False)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/lm_1b/BUILD b/research/lm_1b/BUILD
deleted file mode 100644
index ca5bc1f6ce4347a3b5f18d1bb59284aa9d07a567..0000000000000000000000000000000000000000
--- a/research/lm_1b/BUILD
+++ /dev/null
@@ -1,27 +0,0 @@
-package(default_visibility = [":internal"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-package_group(
-    name = "internal",
-    packages = [
-        "//lm_1b/...",
-    ],
-)
-
-py_library(
-    name = "data_utils",
-    srcs = ["data_utils.py"],
-)
-
-py_binary(
-    name = "lm_1b_eval",
-    srcs = [
-        "lm_1b_eval.py",
-    ],
-    deps = [
-        ":data_utils",
-    ],
-)
diff --git a/research/lm_1b/README.md b/research/lm_1b/README.md
deleted file mode 100644
index f48afbfe23aff6681e641296e73b2c6b0e5a9b48..0000000000000000000000000000000000000000
--- a/research/lm_1b/README.md
+++ /dev/null
@@ -1,198 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-<font size=4><b>Language Model on One Billion Word Benchmark</b></font>
-
-<b>Authors:</b>
-
-Oriol Vinyals (vinyals@google.com, github: OriolVinyals),
-Xin Pan
-
-<b>Paper Authors:</b>
-
-Rafal Jozefowicz, Oriol Vinyals, Mike Schuster, Noam Shazeer, Yonghui Wu
-
-<b>TL;DR</b>
-
-This is a pretrained model on One Billion Word Benchmark.
-If you use this model in your publication, please cite the original paper:
-
-@article{jozefowicz2016exploring,
-  title={Exploring the Limits of Language Modeling},
-  author={Jozefowicz, Rafal and Vinyals, Oriol and Schuster, Mike
-          and Shazeer, Noam and Wu, Yonghui},
-  journal={arXiv preprint arXiv:1602.02410},
-  year={2016}
-}
-
-<b>Introduction</b>
-
-In this release, we open source a model trained on the One Billion Word
-Benchmark (http://arxiv.org/abs/1312.3005), a large language corpus in English
-which was released in 2013. This dataset contains about one billion words, and
-has a vocabulary size of about 800K words. It contains mostly news data. Since
-sentences in the training set are shuffled, models can ignore the context and
-focus on sentence level language modeling.
-
-In the original release and subsequent work, people have used the same test set
-to train models on this dataset as a standard benchmark for language modeling.
-Recently, we wrote an article (http://arxiv.org/abs/1602.02410) describing a
-model hybrid between character CNN, a large and deep LSTM, and a specific
-Softmax architecture which allowed us to train the best model on this dataset
-thus far, almost halving the best perplexity previously obtained by others.
-
-<b>Code Release</b>
-
-The open-sourced components include:
-
-* TensorFlow GraphDef proto buffer text file.
-* TensorFlow pre-trained checkpoint shards.
-* Code used to evaluate the pre-trained model.
-* Vocabulary file.
-* Test set from LM-1B evaluation.
-
-The code supports 4 evaluation modes:
-
-* Given provided dataset, calculate the model's perplexity.
-* Given a prefix sentence, predict the next words.
-* Dump the softmax embedding, character-level CNN word embeddings.
-* Give a sentence, dump the embedding from the LSTM state.
-
-<b>Results</b>
-
-Model | Test Perplexity | Number of Params [billions]
-------|-----------------|----------------------------
-Sigmoid-RNN-2048 [Blackout] | 68.3 | 4.1
-Interpolated KN 5-gram, 1.1B n-grams [chelba2013one] | 67.6 | 1.76
-Sparse Non-Negative Matrix LM [shazeer2015sparse] | 52.9 | 33
-RNN-1024 + MaxEnt 9-gram features [chelba2013one] | 51.3 | 20
-LSTM-512-512 | 54.1 | 0.82
-LSTM-1024-512 | 48.2 | 0.82
-LSTM-2048-512 | 43.7 | 0.83
-LSTM-8192-2048 (No Dropout) | 37.9 | 3.3
-LSTM-8192-2048 (50\% Dropout) | 32.2 | 3.3
-2-Layer LSTM-8192-1024 (BIG LSTM) | 30.6 | 1.8
-(THIS RELEASE) BIG LSTM+CNN Inputs | <b>30.0</b> | <b>1.04</b>
-
-<b>How To Run</b>
-
-Prerequisites:
-
-* Install TensorFlow.
-* Install Bazel.
-* Download the data files:
-  * Model GraphDef file:
-  [link](http://download.tensorflow.org/models/LM_LSTM_CNN/graph-2016-09-10.pbtxt)
-  * Model Checkpoint sharded file:
-  [1](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-base)
-  [2](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-char-embedding)
-  [3](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-lstm)
-  [4](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-softmax0)
-  [5](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-softmax1)
-  [6](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-softmax2)
-  [7](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-softmax3)
-  [8](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-softmax4)
-  [9](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-softmax5)
-  [10](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-softmax6)
-  [11](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-softmax7)
-  [12](http://download.tensorflow.org/models/LM_LSTM_CNN/all_shards-2016-09-10/ckpt-softmax8)
-  * Vocabulary file:
-  [link](http://download.tensorflow.org/models/LM_LSTM_CNN/vocab-2016-09-10.txt)
-  * test dataset: link
-  [link](http://download.tensorflow.org/models/LM_LSTM_CNN/test/news.en.heldout-00000-of-00050)
-* It is recommended to run on a modern desktop instead of a laptop.
-
-```shell
-# 1. Clone the code to your workspace.
-# 2. Download the data to your workspace.
-# 3. Create an empty WORKSPACE file in your workspace.
-# 4. Create an empty output directory in your workspace.
-# Example directory structure below:
-$ ls -R
-.:
-data  lm_1b  output  WORKSPACE
-
-./data:
-ckpt-base            ckpt-lstm      ckpt-softmax1  ckpt-softmax3  ckpt-softmax5
-ckpt-softmax7  graph-2016-09-10.pbtxt          vocab-2016-09-10.txt
-ckpt-char-embedding  ckpt-softmax0  ckpt-softmax2  ckpt-softmax4  ckpt-softmax6
-ckpt-softmax8  news.en.heldout-00000-of-00050
-
-./lm_1b:
-BUILD  data_utils.py  lm_1b_eval.py  README.md
-
-./output:
-
-# Build the codes.
-$ bazel build -c opt lm_1b/...
-# Run sample mode:
-$ bazel-bin/lm_1b/lm_1b_eval --mode sample \
-                             --prefix "I love that I" \
-                             --pbtxt data/graph-2016-09-10.pbtxt \
-                             --vocab_file data/vocab-2016-09-10.txt  \
-                             --ckpt 'data/ckpt-*'
-...(omitted some TensorFlow output)
-I love
-I love that
-I love that I
-I love that I find
-I love that I find that
-I love that I find that amazing
-...(omitted)
-
-# Run eval mode:
-$ bazel-bin/lm_1b/lm_1b_eval --mode eval \
-                             --pbtxt data/graph-2016-09-10.pbtxt \
-                             --vocab_file data/vocab-2016-09-10.txt  \
-                             --input_data data/news.en.heldout-00000-of-00050 \
-                             --ckpt 'data/ckpt-*'
-...(omitted some TensorFlow output)
-Loaded step 14108582.
-# perplexity is high initially because words without context are harder to
-# predict.
-Eval Step: 0, Average Perplexity: 2045.512297.
-Eval Step: 1, Average Perplexity: 229.478699.
-Eval Step: 2, Average Perplexity: 208.116787.
-Eval Step: 3, Average Perplexity: 338.870601.
-Eval Step: 4, Average Perplexity: 228.950107.
-Eval Step: 5, Average Perplexity: 197.685857.
-Eval Step: 6, Average Perplexity: 156.287063.
-Eval Step: 7, Average Perplexity: 124.866189.
-Eval Step: 8, Average Perplexity: 147.204975.
-Eval Step: 9, Average Perplexity: 90.124864.
-Eval Step: 10, Average Perplexity: 59.897914.
-Eval Step: 11, Average Perplexity: 42.591137.
-...(omitted)
-Eval Step: 4529, Average Perplexity: 29.243668.
-Eval Step: 4530, Average Perplexity: 29.302362.
-Eval Step: 4531, Average Perplexity: 29.285674.
-...(omitted. At convergence, it should be around 30.)
-
-# Run dump_emb mode:
-$ bazel-bin/lm_1b/lm_1b_eval --mode dump_emb \
-                             --pbtxt data/graph-2016-09-10.pbtxt \
-                             --vocab_file data/vocab-2016-09-10.txt  \
-                             --ckpt 'data/ckpt-*' \
-                             --save_dir output
-...(omitted some TensorFlow output)
-Finished softmax weights
-Finished word embedding 0/793471
-Finished word embedding 1/793471
-Finished word embedding 2/793471
-...(omitted)
-$ ls output/
-embeddings_softmax.npy ...
-
-# Run dump_lstm_emb mode:
-$ bazel-bin/lm_1b/lm_1b_eval --mode dump_lstm_emb \
-                             --pbtxt data/graph-2016-09-10.pbtxt \
-                             --vocab_file data/vocab-2016-09-10.txt \
-                             --ckpt 'data/ckpt-*' \
-                             --sentence "I love who I am ." \
-                             --save_dir output
-$ ls output/
-lstm_emb_step_0.npy  lstm_emb_step_2.npy  lstm_emb_step_4.npy
-lstm_emb_step_6.npy  lstm_emb_step_1.npy  lstm_emb_step_3.npy
-lstm_emb_step_5.npy
-```
diff --git a/research/lm_1b/data_utils.py b/research/lm_1b/data_utils.py
deleted file mode 100644
index ad8d3391ef6db07c1d6c234450a6d23a8e19a178..0000000000000000000000000000000000000000
--- a/research/lm_1b/data_utils.py
+++ /dev/null
@@ -1,279 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A library for loading 1B word benchmark dataset."""
-
-import random
-
-import numpy as np
-import tensorflow as tf
-
-
-class Vocabulary(object):
-  """Class that holds a vocabulary for the dataset."""
-
-  def __init__(self, filename):
-    """Initialize vocabulary.
-
-    Args:
-      filename: Vocabulary file name.
-    """
-
-    self._id_to_word = []
-    self._word_to_id = {}
-    self._unk = -1
-    self._bos = -1
-    self._eos = -1
-
-    with tf.gfile.Open(filename) as f:
-      idx = 0
-      for line in f:
-        word_name = line.strip()
-        if word_name == '<S>':
-          self._bos = idx
-        elif word_name == '</S>':
-          self._eos = idx
-        elif word_name == '<UNK>':
-          self._unk = idx
-        if word_name == '!!!MAXTERMID':
-          continue
-
-        self._id_to_word.append(word_name)
-        self._word_to_id[word_name] = idx
-        idx += 1
-
-  @property
-  def bos(self):
-    return self._bos
-
-  @property
-  def eos(self):
-    return self._eos
-
-  @property
-  def unk(self):
-    return self._unk
-
-  @property
-  def size(self):
-    return len(self._id_to_word)
-
-  def word_to_id(self, word):
-    if word in self._word_to_id:
-      return self._word_to_id[word]
-    return self.unk
-
-  def id_to_word(self, cur_id):
-    if cur_id < self.size:
-      return self._id_to_word[cur_id]
-    return 'ERROR'
-
-  def decode(self, cur_ids):
-    """Convert a list of ids to a sentence, with space inserted."""
-    return ' '.join([self.id_to_word(cur_id) for cur_id in cur_ids])
-
-  def encode(self, sentence):
-    """Convert a sentence to a list of ids, with special tokens added."""
-    word_ids = [self.word_to_id(cur_word) for cur_word in sentence.split()]
-    return np.array([self.bos] + word_ids + [self.eos], dtype=np.int32)
-
-
-class CharsVocabulary(Vocabulary):
-  """Vocabulary containing character-level information."""
-
-  def __init__(self, filename, max_word_length):
-    super(CharsVocabulary, self).__init__(filename)
-    self._max_word_length = max_word_length
-    chars_set = set()
-
-    for word in self._id_to_word:
-      chars_set |= set(word)
-
-    free_ids = []
-    for i in range(256):
-      if chr(i) in chars_set:
-        continue
-      free_ids.append(chr(i))
-
-    if len(free_ids) < 5:
-      raise ValueError('Not enough free char ids: %d' % len(free_ids))
-
-    self.bos_char = free_ids[0]  # <begin sentence>
-    self.eos_char = free_ids[1]  # <end sentence>
-    self.bow_char = free_ids[2]  # <begin word>
-    self.eow_char = free_ids[3]  # <end word>
-    self.pad_char = free_ids[4]  # <padding>
-
-    chars_set |= {self.bos_char, self.eos_char, self.bow_char, self.eow_char,
-                  self.pad_char}
-
-    self._char_set = chars_set
-    num_words = len(self._id_to_word)
-
-    self._word_char_ids = np.zeros([num_words, max_word_length], dtype=np.int32)
-
-    self.bos_chars = self._convert_word_to_char_ids(self.bos_char)
-    self.eos_chars = self._convert_word_to_char_ids(self.eos_char)
-
-    for i, word in enumerate(self._id_to_word):
-      self._word_char_ids[i] = self._convert_word_to_char_ids(word)
-
-  @property
-  def word_char_ids(self):
-    return self._word_char_ids
-
-  @property
-  def max_word_length(self):
-    return self._max_word_length
-
-  def _convert_word_to_char_ids(self, word):
-    code = np.zeros([self.max_word_length], dtype=np.int32)
-    code[:] = ord(self.pad_char)
-
-    if len(word) > self.max_word_length - 2:
-      word = word[:self.max_word_length-2]
-    cur_word = self.bow_char + word + self.eow_char
-    for j in range(len(cur_word)):
-      code[j] = ord(cur_word[j])
-    return code
-
-  def word_to_char_ids(self, word):
-    if word in self._word_to_id:
-      return self._word_char_ids[self._word_to_id[word]]
-    else:
-      return self._convert_word_to_char_ids(word)
-
-  def encode_chars(self, sentence):
-    chars_ids = [self.word_to_char_ids(cur_word)
-                 for cur_word in sentence.split()]
-    return np.vstack([self.bos_chars] + chars_ids + [self.eos_chars])
-
-
-def get_batch(generator, batch_size, num_steps, max_word_length, pad=False):
-  """Read batches of input."""
-  cur_stream = [None] * batch_size
-
-  inputs = np.zeros([batch_size, num_steps], np.int32)
-  char_inputs = np.zeros([batch_size, num_steps, max_word_length], np.int32)
-  global_word_ids = np.zeros([batch_size, num_steps], np.int32)
-  targets = np.zeros([batch_size, num_steps], np.int32)
-  weights = np.ones([batch_size, num_steps], np.float32)
-
-  no_more_data = False
-  while True:
-    inputs[:] = 0
-    char_inputs[:] = 0
-    global_word_ids[:] = 0
-    targets[:] = 0
-    weights[:] = 0.0
-
-    for i in range(batch_size):
-      cur_pos = 0
-
-      while cur_pos < num_steps:
-        if cur_stream[i] is None or len(cur_stream[i][0]) <= 1:
-          try:
-            cur_stream[i] = list(generator.next())
-          except StopIteration:
-            # No more data, exhaust current streams and quit
-            no_more_data = True
-            break
-
-        how_many = min(len(cur_stream[i][0]) - 1, num_steps - cur_pos)
-        next_pos = cur_pos + how_many
-
-        inputs[i, cur_pos:next_pos] = cur_stream[i][0][:how_many]
-        char_inputs[i, cur_pos:next_pos] = cur_stream[i][1][:how_many]
-        global_word_ids[i, cur_pos:next_pos] = cur_stream[i][2][:how_many]
-        targets[i, cur_pos:next_pos] = cur_stream[i][0][1:how_many+1]
-        weights[i, cur_pos:next_pos] = 1.0
-
-        cur_pos = next_pos
-        cur_stream[i][0] = cur_stream[i][0][how_many:]
-        cur_stream[i][1] = cur_stream[i][1][how_many:]
-        cur_stream[i][2] = cur_stream[i][2][how_many:]
-
-        if pad:
-          break
-
-    if no_more_data and np.sum(weights) == 0:
-      # There is no more data and this is an empty batch. Done!
-      break
-    yield inputs, char_inputs, global_word_ids, targets, weights
-
-
-class LM1BDataset(object):
-  """Utility class for 1B word benchmark dataset.
-
-  The current implementation reads the data from the tokenized text files.
-  """
-
-  def __init__(self, filepattern, vocab):
-    """Initialize LM1BDataset reader.
-
-    Args:
-      filepattern: Dataset file pattern.
-      vocab: Vocabulary.
-    """
-    self._vocab = vocab
-    self._all_shards = tf.gfile.Glob(filepattern)
-    tf.logging.info('Found %d shards at %s', len(self._all_shards), filepattern)
-
-  def _load_random_shard(self):
-    """Randomly select a file and read it."""
-    return self._load_shard(random.choice(self._all_shards))
-
-  def _load_shard(self, shard_name):
-    """Read one file and convert to ids.
-
-    Args:
-      shard_name: file path.
-
-    Returns:
-      list of (id, char_id, global_word_id) tuples.
-    """
-    tf.logging.info('Loading data from: %s', shard_name)
-    with tf.gfile.Open(shard_name) as f:
-      sentences = f.readlines()
-    chars_ids = [self.vocab.encode_chars(sentence) for sentence in sentences]
-    ids = [self.vocab.encode(sentence) for sentence in sentences]
-
-    global_word_ids = []
-    current_idx = 0
-    for word_ids in ids:
-      current_size = len(word_ids) - 1  # without <BOS> symbol
-      cur_ids = np.arange(current_idx, current_idx + current_size)
-      global_word_ids.append(cur_ids)
-      current_idx += current_size
-
-    tf.logging.info('Loaded %d words.', current_idx)
-    tf.logging.info('Finished loading')
-    return zip(ids, chars_ids, global_word_ids)
-
-  def _get_sentence(self, forever=True):
-    while True:
-      ids = self._load_random_shard()
-      for current_ids in ids:
-        yield current_ids
-      if not forever:
-        break
-
-  def get_batch(self, batch_size, num_steps, pad=False, forever=True):
-    return get_batch(self._get_sentence(forever), batch_size, num_steps,
-                     self.vocab.max_word_length, pad=pad)
-
-  @property
-  def vocab(self):
-    return self._vocab
diff --git a/research/lm_1b/lm_1b_eval.py b/research/lm_1b/lm_1b_eval.py
deleted file mode 100644
index ce8634757558c135ba137a9b9e09a733977adc3a..0000000000000000000000000000000000000000
--- a/research/lm_1b/lm_1b_eval.py
+++ /dev/null
@@ -1,308 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Eval pre-trained 1 billion word language model.
-"""
-import os
-import sys
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from google.protobuf import text_format
-import data_utils
-
-FLAGS = tf.flags.FLAGS
-# General flags.
-tf.flags.DEFINE_string('mode', 'eval',
-                       'One of [sample, eval, dump_emb, dump_lstm_emb]. '
-                       '"sample" mode samples future word predictions, using '
-                       'FLAGS.prefix as prefix (prefix could be left empty). '
-                       '"eval" mode calculates perplexity of the '
-                       'FLAGS.input_data. '
-                       '"dump_emb" mode dumps word and softmax embeddings to '
-                       'FLAGS.save_dir. embeddings are dumped in the same '
-                       'order as words in vocabulary. All words in vocabulary '
-                       'are dumped.'
-                       'dump_lstm_emb dumps lstm embeddings of FLAGS.sentence '
-                       'to FLAGS.save_dir.')
-tf.flags.DEFINE_string('pbtxt', '',
-                       'GraphDef proto text file used to construct model '
-                       'structure.')
-tf.flags.DEFINE_string('ckpt', '',
-                       'Checkpoint directory used to fill model values.')
-tf.flags.DEFINE_string('vocab_file', '', 'Vocabulary file.')
-tf.flags.DEFINE_string('save_dir', '',
-                       'Used for "dump_emb" mode to save word embeddings.')
-# sample mode flags.
-tf.flags.DEFINE_string('prefix', '',
-                       'Used for "sample" mode to predict next words.')
-tf.flags.DEFINE_integer('max_sample_words', 100,
-                        'Sampling stops either when </S> is met or this number '
-                        'of steps has passed.')
-tf.flags.DEFINE_integer('num_samples', 3,
-                        'Number of samples to generate for the prefix.')
-# dump_lstm_emb mode flags.
-tf.flags.DEFINE_string('sentence', '',
-                       'Used as input for "dump_lstm_emb" mode.')
-# eval mode flags.
-tf.flags.DEFINE_string('input_data', '',
-                       'Input data files for eval model.')
-tf.flags.DEFINE_integer('max_eval_steps', 1000000,
-                        'Maximum mumber of steps to run "eval" mode.')
-
-
-# For saving demo resources, use batch size 1 and step 1.
-BATCH_SIZE = 1
-NUM_TIMESTEPS = 1
-MAX_WORD_LEN = 50
-
-
-def _LoadModel(gd_file, ckpt_file):
-  """Load the model from GraphDef and Checkpoint.
-
-  Args:
-    gd_file: GraphDef proto text file.
-    ckpt_file: TensorFlow Checkpoint file.
-
-  Returns:
-    TensorFlow session and tensors dict.
-  """
-  with tf.Graph().as_default():
-    sys.stderr.write('Recovering graph.\n')
-    with tf.gfile.FastGFile(gd_file, 'r') as f:
-      s = f.read().decode()
-      gd = tf.GraphDef()
-      text_format.Merge(s, gd)
-
-    tf.logging.info('Recovering Graph %s', gd_file)
-    t = {}
-    [t['states_init'], t['lstm/lstm_0/control_dependency'],
-     t['lstm/lstm_1/control_dependency'], t['softmax_out'], t['class_ids_out'],
-     t['class_weights_out'], t['log_perplexity_out'], t['inputs_in'],
-     t['targets_in'], t['target_weights_in'], t['char_inputs_in'],
-     t['all_embs'], t['softmax_weights'], t['global_step']
-    ] = tf.import_graph_def(gd, {}, ['states_init',
-                                     'lstm/lstm_0/control_dependency:0',
-                                     'lstm/lstm_1/control_dependency:0',
-                                     'softmax_out:0',
-                                     'class_ids_out:0',
-                                     'class_weights_out:0',
-                                     'log_perplexity_out:0',
-                                     'inputs_in:0',
-                                     'targets_in:0',
-                                     'target_weights_in:0',
-                                     'char_inputs_in:0',
-                                     'all_embs_out:0',
-                                     'Reshape_3:0',
-                                     'global_step:0'], name='')
-
-    sys.stderr.write('Recovering checkpoint %s\n' % ckpt_file)
-    sess = tf.Session(config=tf.ConfigProto(allow_soft_placement=True))
-    sess.run('save/restore_all', {'save/Const:0': ckpt_file})
-    sess.run(t['states_init'])
-
-  return sess, t
-
-
-def _EvalModel(dataset):
-  """Evaluate model perplexity using provided dataset.
-
-  Args:
-    dataset: LM1BDataset object.
-  """
-  sess, t = _LoadModel(FLAGS.pbtxt, FLAGS.ckpt)
-
-  current_step = t['global_step'].eval(session=sess)
-  sys.stderr.write('Loaded step %d.\n' % current_step)
-
-  data_gen = dataset.get_batch(BATCH_SIZE, NUM_TIMESTEPS, forever=False)
-  sum_num = 0.0
-  sum_den = 0.0
-  perplexity = 0.0
-  for i, (inputs, char_inputs, _, targets, weights) in enumerate(data_gen):
-    input_dict = {t['inputs_in']: inputs,
-                  t['targets_in']: targets,
-                  t['target_weights_in']: weights}
-    if 'char_inputs_in' in t:
-      input_dict[t['char_inputs_in']] = char_inputs
-    log_perp = sess.run(t['log_perplexity_out'], feed_dict=input_dict)
-
-    if np.isnan(log_perp):
-      sys.stderr.error('log_perplexity is Nan.\n')
-    else:
-      sum_num += log_perp * weights.mean()
-      sum_den += weights.mean()
-    if sum_den > 0:
-      perplexity = np.exp(sum_num / sum_den)
-
-    sys.stderr.write('Eval Step: %d, Average Perplexity: %f.\n' %
-                     (i, perplexity))
-
-    if i > FLAGS.max_eval_steps:
-      break
-
-
-def _SampleSoftmax(softmax):
-  return min(np.sum(np.cumsum(softmax) < np.random.rand()), len(softmax) - 1)
-
-
-def _SampleModel(prefix_words, vocab):
-  """Predict next words using the given prefix words.
-
-  Args:
-    prefix_words: Prefix words.
-    vocab: Vocabulary. Contains max word chard id length and converts between
-        words and ids.
-  """
-  targets = np.zeros([BATCH_SIZE, NUM_TIMESTEPS], np.int32)
-  weights = np.ones([BATCH_SIZE, NUM_TIMESTEPS], np.float32)
-
-  sess, t = _LoadModel(FLAGS.pbtxt, FLAGS.ckpt)
-
-  if prefix_words.find('<S>') != 0:
-    prefix_words = '<S> ' + prefix_words
-
-  prefix = [vocab.word_to_id(w) for w in prefix_words.split()]
-  prefix_char_ids = [vocab.word_to_char_ids(w) for w in prefix_words.split()]
-  for _ in xrange(FLAGS.num_samples):
-    inputs = np.zeros([BATCH_SIZE, NUM_TIMESTEPS], np.int32)
-    char_ids_inputs = np.zeros(
-        [BATCH_SIZE, NUM_TIMESTEPS, vocab.max_word_length], np.int32)
-    samples = prefix[:]
-    char_ids_samples = prefix_char_ids[:]
-    sent = ''
-    while True:
-      inputs[0, 0] = samples[0]
-      char_ids_inputs[0, 0, :] = char_ids_samples[0]
-      samples = samples[1:]
-      char_ids_samples = char_ids_samples[1:]
-
-      softmax = sess.run(t['softmax_out'],
-                         feed_dict={t['char_inputs_in']: char_ids_inputs,
-                                    t['inputs_in']: inputs,
-                                    t['targets_in']: targets,
-                                    t['target_weights_in']: weights})
-
-      sample = _SampleSoftmax(softmax[0])
-      sample_char_ids = vocab.word_to_char_ids(vocab.id_to_word(sample))
-
-      if not samples:
-        samples = [sample]
-        char_ids_samples = [sample_char_ids]
-      sent += vocab.id_to_word(samples[0]) + ' '
-      sys.stderr.write('%s\n' % sent)
-
-      if (vocab.id_to_word(samples[0]) == '</S>' or
-          len(sent) > FLAGS.max_sample_words):
-        break
-
-
-def _DumpEmb(vocab):
-  """Dump the softmax weights and word embeddings to files.
-
-  Args:
-    vocab: Vocabulary. Contains vocabulary size and converts word to ids.
-  """
-  assert FLAGS.save_dir, 'Must specify FLAGS.save_dir for dump_emb.'
-  inputs = np.zeros([BATCH_SIZE, NUM_TIMESTEPS], np.int32)
-  targets = np.zeros([BATCH_SIZE, NUM_TIMESTEPS], np.int32)
-  weights = np.ones([BATCH_SIZE, NUM_TIMESTEPS], np.float32)
-
-  sess, t = _LoadModel(FLAGS.pbtxt, FLAGS.ckpt)
-
-  softmax_weights = sess.run(t['softmax_weights'])
-  fname = FLAGS.save_dir + '/embeddings_softmax.npy'
-  with tf.gfile.Open(fname, mode='w') as f:
-    np.save(f, softmax_weights)
-  sys.stderr.write('Finished softmax weights\n')
-
-  all_embs = np.zeros([vocab.size, 1024])
-  for i in xrange(vocab.size):
-    input_dict = {t['inputs_in']: inputs,
-                  t['targets_in']: targets,
-                  t['target_weights_in']: weights}
-    if 'char_inputs_in' in t:
-      input_dict[t['char_inputs_in']] = (
-          vocab.word_char_ids[i].reshape([-1, 1, MAX_WORD_LEN]))
-    embs = sess.run(t['all_embs'], input_dict)
-    all_embs[i, :] = embs
-    sys.stderr.write('Finished word embedding %d/%d\n' % (i, vocab.size))
-
-  fname = FLAGS.save_dir + '/embeddings_char_cnn.npy'
-  with tf.gfile.Open(fname, mode='w') as f:
-    np.save(f, all_embs)
-  sys.stderr.write('Embedding file saved\n')
-
-
-def _DumpSentenceEmbedding(sentence, vocab):
-  """Predict next words using the given prefix words.
-
-  Args:
-    sentence: Sentence words.
-    vocab: Vocabulary. Contains max word chard id length and converts between
-        words and ids.
-  """
-  targets = np.zeros([BATCH_SIZE, NUM_TIMESTEPS], np.int32)
-  weights = np.ones([BATCH_SIZE, NUM_TIMESTEPS], np.float32)
-
-  sess, t = _LoadModel(FLAGS.pbtxt, FLAGS.ckpt)
-
-  if sentence.find('<S>') != 0:
-    sentence = '<S> ' + sentence
-
-  word_ids = [vocab.word_to_id(w) for w in sentence.split()]
-  char_ids = [vocab.word_to_char_ids(w) for w in sentence.split()]
-
-  inputs = np.zeros([BATCH_SIZE, NUM_TIMESTEPS], np.int32)
-  char_ids_inputs = np.zeros(
-      [BATCH_SIZE, NUM_TIMESTEPS, vocab.max_word_length], np.int32)
-  for i in xrange(len(word_ids)):
-    inputs[0, 0] = word_ids[i]
-    char_ids_inputs[0, 0, :] = char_ids[i]
-
-    # Add 'lstm/lstm_0/control_dependency' if you want to dump previous layer
-    # LSTM.
-    lstm_emb = sess.run(t['lstm/lstm_1/control_dependency'],
-                        feed_dict={t['char_inputs_in']: char_ids_inputs,
-                                   t['inputs_in']: inputs,
-                                   t['targets_in']: targets,
-                                   t['target_weights_in']: weights})
-
-    fname = os.path.join(FLAGS.save_dir, 'lstm_emb_step_%d.npy' % i)
-    with tf.gfile.Open(fname, mode='w') as f:
-      np.save(f, lstm_emb)
-    sys.stderr.write('LSTM embedding step %d file saved\n' % i)
-
-
-def main(unused_argv):
-  vocab = data_utils.CharsVocabulary(FLAGS.vocab_file, MAX_WORD_LEN)
-
-  if FLAGS.mode == 'eval':
-    dataset = data_utils.LM1BDataset(FLAGS.input_data, vocab)
-    _EvalModel(dataset)
-  elif FLAGS.mode == 'sample':
-    _SampleModel(FLAGS.prefix, vocab)
-  elif FLAGS.mode == 'dump_emb':
-    _DumpEmb(vocab)
-  elif FLAGS.mode == 'dump_lstm_emb':
-    _DumpSentenceEmbedding(FLAGS.sentence, vocab)
-  else:
-    raise Exception('Mode not supported.')
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/lm_commonsense/README.md b/research/lm_commonsense/README.md
deleted file mode 100644
index 78c8f53ca226f09c4b185490d6966f98bf584889..0000000000000000000000000000000000000000
--- a/research/lm_commonsense/README.md
+++ /dev/null
@@ -1,170 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# A Simple Method for Commonsense Reasoning
-
-This repository contains code to reproduce results from [*A Simple Method for Commonsense Reasoning*](https://arxiv.org/abs/1806.02847).
-
-Authors and contact:
-
-* Trieu H. Trinh (thtrieu@google.com, github: thtrieu)
-* Quoc V. Le (qvl@google.com)
-
-## TL;DR
-
-Commonsense reasoning is a long-standing challenge for deep learning. For example,
-it is difficult to use neural networks to tackle the Winograd Schema dataset - a difficult subset of Pronoun Disambiguation problems. In this work, we use language models to score substitued sentences to decide the correct reference of the ambiguous pronoun (see Figure below for an example). 
-
-![Figure 1. Overview of our method.](method.jpg)
-
-This simple unsupervised method achieves new state-of-the-art (*as of June 1st, 2018*) results on both benchmark PDP-60 and WSC-273 (See Table below), without using rule-based reasoning nor expensive annotated knowledge bases.
-
-| Commonsense-reasoning test  | Previous best result   | Ours  |
-| ----------------------------|:----------------------:|:-----:|
-| Pronoun Disambiguation      | 66.7%                  | 70%   |
-| Winograd Schema Challenge   | 52.8%                  | 63.7% |
-
-
-
-## Citation
-
-If you use our released models below in your publication, please cite the original paper:
-
-@article{TBD}
-
-
-## Requirements
-* Python >=2.6
-* Tensorflow >= v1.4
-* Numpy >= 1.12.1
-
-## Details of this release
-
-The open-sourced components include:
-
-* Test sets from Pronoun Disambiguation Problem (PDP-60) and Winograd Schema Challenges (WSC-273).
-* Tensorflow metagraph and checkpoints of 14 language models (See Appendix A in the paper).
-* A vocabulary file.
-* Code to reproduce results from the original paper.
-
-## How to run
-
-### 1. Download data files
-
-Download all files from the [Google Cloud Storage of this project](https://console.cloud.google.com/storage/browser/commonsense-reasoning/). The easiest way is to install and use `gsutil cp` command-line tool (See [install gsutil](https://cloud.google.com/storage/docs/gsutil_install)).
-
-
-```shell
-# Download everything from the project gs://commonsense-reasoning
-$ gsutil cp -R gs://commonsense-reasoning/* .
-Copying gs://commonsense-reasoning/reproduce/vocab.txt...
-Copying gs://commonsense-reasoning/reproduce/commonsense_test/pdp60.json...     
-Copying gs://commonsense-reasoning/reproduce/commonsense_test/wsc273.json...
-
-...(omitted)
-```
-
-All downloaded content should be in `./reproduce/`. This includes two tests `pdp60.json` and `wsc273.json`, a vocabulary file `vocab.txt` and checkpoints for all 14 language models, each includes three files (`.data`, `.index` and `.meta`). All checkpoint names start with `ckpt-best` since they are saved at the best perplexity on a hold-out text corpus.
-
-```shell
-# Check for the content
-$ ls reproduce/*
-reproduce/vocab.txt
-
-reproduce/commonsense_test:
-pdp60.json  wsc273.json
-
-reproduce/lm01:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm02:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm03:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm04:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm05:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm06:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm07:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm08:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm09:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm10:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm11:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm12:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm13:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-
-reproduce/lm14:
-ckpt-best.data-00000-of-00001  ckpt-best.index  ckpt-best.meta
-```
-
-### 2. Run evaluation code
-
-To reproduce results from the paper, simply run `eval.py` script.
-
-```shell
-$ python eval.py --data_dir=reproduce
-
-Restored from ./reproduce/lm01
-Reset RNN states.
-Processing patch (1, 1) / (2, 4)
-Probs for 
-[['Then' 'Dad' 'figured' ..., 'man' "'s" 'board-bill']
- ['Then' 'Dad' 'figured' ..., 'man' "'s" 'board-bill']
- ['Always' 'before' ',' ..., 'now' ',' 'for']
- ..., 
- ['Mark' 'was' 'close' ..., 'promising' 'him' ',']
- ['Mark' 'was' 'close' ..., 'promising' 'him' ',']
- ['Mark' 'was' 'close' ..., 'promising' 'him' ',']]
-=
-[[  1.64250596e-05   1.77780055e-06   4.14267970e-06 ...,   1.87315454e-03
-    1.57723188e-01   6.31845817e-02]
- [  1.64250596e-05   1.77780055e-06   4.14267970e-06 ...,   1.87315454e-03
-    1.57723188e-01   6.31845817e-02]
- [  1.28243030e-07   3.80435935e-03   1.12383246e-01 ...,   9.67682712e-03
-    2.17407525e-01   1.08243264e-01]
- ..., 
- [  1.15557734e-04   2.92792241e-03   3.46455898e-04 ...,   2.72328052e-05
-    3.37066874e-02   7.89367408e-02]
- [  1.15557734e-04   2.92792241e-03   3.46455898e-04 ...,   2.72328052e-05
-    3.37066874e-02   7.89367408e-02]
- [  1.15557734e-04   2.92792241e-03   3.46455898e-04 ...,   2.72328052e-05
-    3.37066874e-02   7.89367408e-02]]
-Processing patch (1, 2) / (2, 4)
-
-...(omitted)
-
-Accuracy of 1 LM(s) on pdp60 = 0.6
-
-...(omitted)
-
-Accuracy of 5 LM(s) on pdp60 = 0.7
-
-...(omitted)
-
-Accuracy of 10 LM(s) on wsc273 = 0.615
-
-...(omitted)
-
-Accuracy of 14 LM(s) on wsc273 = 0.637
-```
diff --git a/research/lm_commonsense/eval.py b/research/lm_commonsense/eval.py
deleted file mode 100644
index e5b7ff98b50a5af4e066d3d9f82c1acae81c3e93..0000000000000000000000000000000000000000
--- a/research/lm_commonsense/eval.py
+++ /dev/null
@@ -1,190 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import pickle as pkl
-import numpy as np
-import tensorflow as tf
-import utils
-
-tf.app.flags.DEFINE_string(
-    'data_dir', 'reproduce', 
-    'Path to directory containing data and model checkpoints.')
-
-
-FLAGS = tf.app.flags.FLAGS
-
-
-class EnsembleLM(object):
-  """Ensemble of language models."""
-
-  def __init__(self, test_data_name='wsc273'):
-    vocab_file = os.path.join(FLAGS.data_dir, 'vocab.txt')
-    self.vocab = utils.CharsVocabulary(vocab_file, 50)
-    assert test_data_name in ['pdp60', 'wsc273'], (
-        'Test data must be pdp60 or wsc273, got {}'.format(test_data_name))
-    self.test_data_name = test_data_name
-
-    test_data = utils.parse_commonsense_reasoning_test(test_data_name)
-    self.question_ids, self.sentences, self.labels = test_data
-    self.all_probs = []  # aggregate single-model prediction here.
-
-  def add_single_model(self, model_name='lm1'):
-    """Add a single model into the current ensemble."""
-    # Create single LM
-    single_lm = SingleRecurrentLanguageModel(self.vocab, model_name)
-
-    # Add the single LM prediction.
-    probs = single_lm.assign_probs(self.sentences, self.test_data_name)
-    self.all_probs.append(probs)
-    print('Done adding {}'.format(model_name))
-
-  def evaluate(self):
-    """Evaluate the current ensemble."""
-    # Attach word probabilities and correctness label to each substitution
-    ensembled_probs = sum(self.all_probs) / len(self.all_probs)
-    scorings = []
-    for i, sentence in enumerate(self.sentences):
-      correctness = self.labels[i]
-      word_probs = ensembled_probs[i, :len(sentence)]
-      joint_prob = np.prod(word_probs, dtype=np.float64)
-
-      scorings.append(dict(
-          correctness=correctness,
-          sentence=sentence,
-          joint_prob=joint_prob,
-          word_probs=word_probs))
-    scoring_mode = 'full' if self.test_data_name == 'pdp60' else 'partial'
-    return utils.compare_substitutions(
-        self.question_ids, scorings, scoring_mode)
-
-
-class SingleRecurrentLanguageModel(object):
-  """Single Recurrent Language Model."""
-
-  def __init__(self, vocab, model_name='lm01'):
-    self.vocab = vocab
-    self.log_dir = os.path.join(FLAGS.data_dir, model_name)
-
-  def reset(self):
-    self.sess.run(self.tensors['states_init'])
-
-  def _score(self, word_patch):
-    """Score a matrix of shape (batch_size, num_timesteps+1) str tokens."""
-    word_ids = np.array(
-        [[self.vocab.word_to_id(word) for word in row]
-         for row in word_patch])
-    char_ids = np.array(
-        [[self.vocab.word_to_char_ids(word) for word in row]
-         for row in word_patch])
-    print('Probs for \n{}\n='.format(np.array(word_patch)[:, 1:]))
-
-    input_ids, target_ids = word_ids[:, :-1], word_ids[:, 1:]
-    input_char_ids = char_ids[:, :-1, :]
-
-    softmax = self.sess.run(self.tensors['softmax_out'], feed_dict={
-        self.tensors['inputs_in']: input_ids,
-        self.tensors['char_inputs_in']: input_char_ids
-    })
-
-    batch_size, num_timesteps = self.shape
-    softmax = softmax.reshape((num_timesteps, batch_size, -1))
-    softmax = np.transpose(softmax, [1, 0, 2])
-    probs = np.array([[softmax[row, col, target_ids[row, col]]
-                       for col in range(num_timesteps)]
-                      for row in range(batch_size)])
-    print(probs)
-    return probs
-
-  def _score_patches(self, word_patches):
-    """Score a 2D matrix of word_patches and stitch results together."""
-    batch_size, num_timesteps = self.shape
-    nrow, ncol = len(word_patches), len(word_patches[0])
-    max_len = num_timesteps * ncol
-    probs = np.zeros([0, max_len])  # accumulate results into this.
-
-    # Loop through the 2D matrix of word_patches and score each.
-    for i, row in enumerate(word_patches):
-      print('Reset RNN states.')
-      self.reset()  # reset states before processing each row.
-      row_probs = np.zeros([batch_size, 0])
-      for j, word_patch in enumerate(row):
-        print('Processing patch '
-              '({}, {}) / ({}, {})'.format(i+1, j+1, nrow, ncol))
-        patch_probs = (self._score(word_patch) if word_patch else
-                       np.zeros([batch_size, num_timesteps]))
-        row_probs = np.concatenate([row_probs, patch_probs], 1)
-      probs = np.concatenate([probs, row_probs], 0)
-    return probs
-
-  def assign_probs(self, sentences, test_data_name='wsc273'):
-    """Return prediction accuracy using this LM for a test."""
-
-    probs_cache = os.path.join(self.log_dir, '{}.probs'.format(test_data_name))
-    if os.path.exists(probs_cache):
-      print('Reading cached result from {}'.format(probs_cache))
-      with tf.gfile.Open(probs_cache, 'r') as f:
-        probs = pkl.load(f)
-    else:
-      tf.reset_default_graph()
-      self.sess = tf.Session()
-      # Build the graph.
-      saver = tf.train.import_meta_graph(
-          os.path.join(self.log_dir, 'ckpt-best.meta'))
-      saver.restore(self.sess, os.path.join(self.log_dir, 'ckpt-best'))
-      print('Restored from {}'.format(self.log_dir))
-      graph = tf.get_default_graph()
-      self.tensors = dict(
-          inputs_in=graph.get_tensor_by_name('test_inputs_in:0'),
-          char_inputs_in=graph.get_tensor_by_name('test_char_inputs_in:0'),
-          softmax_out=graph.get_tensor_by_name('SotaRNN_1/softmax_out:0'),
-          states_init=graph.get_operation_by_name('SotaRNN_1/states_init'))
-      self.shape = self.tensors['inputs_in'].shape.as_list()
-
-      # Cut sentences into patches of shape processable by the LM.
-      batch_size, num_timesteps = self.shape
-      word_patches = utils.cut_to_patches(sentences, batch_size, num_timesteps)
-      probs = self._score_patches(word_patches)
-
-      # Cache the probs since they are expensive to evaluate
-      with tf.gfile.Open(probs_cache, 'w') as f:
-        pkl.dump(probs, f)
-    return probs
-
-
-def evaluate_ensemble(test_data_name, number_of_lms):
-  ensemble = EnsembleLM(test_data_name)
-  model_list = ['lm{:02d}'.format(i+1) for i in range(number_of_lms)]
-  for model_name in model_list:
-    ensemble.add_single_model(model_name)
-  accuracy = ensemble.evaluate()
-  print('Accuracy of {} LM(s) on {} = {}'.format(
-      number_of_lms, test_data_name, accuracy))
-
-
-def main(_):
-  evaluate_ensemble('pdp60', 1)  # 60%
-  evaluate_ensemble('pdp60', 5)  # 70%
-  evaluate_ensemble('wsc273', 10)  # 61.5%
-  evaluate_ensemble('wsc273', 14)  # 63.7%
-
-
-if __name__ == '__main__':
-  tf.app.run(main)
diff --git a/research/lm_commonsense/method.jpg b/research/lm_commonsense/method.jpg
deleted file mode 100644
index ee8a5506fccca3cbb67f7bda0ccef78303cb228b..0000000000000000000000000000000000000000
Binary files a/research/lm_commonsense/method.jpg and /dev/null differ
diff --git a/research/lm_commonsense/utils.py b/research/lm_commonsense/utils.py
deleted file mode 100644
index d75f2b0fb72716860ea6d438e6b8ca2732d13c84..0000000000000000000000000000000000000000
--- a/research/lm_commonsense/utils.py
+++ /dev/null
@@ -1,368 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import json
-import os
-import numpy as np
-import tensorflow as tf
-
-FLAGS = tf.flags.FLAGS
-
-
-class Vocabulary(object):
-  """Class that holds a vocabulary for the dataset."""
-
-  def __init__(self, filename):
-
-    self._id_to_word = []
-    self._word_to_id = {}
-    self._unk = -1
-    self._bos = -1
-    self._eos = -1
-
-    with tf.gfile.Open(filename) as f:
-      idx = 0
-      for line in f:
-        word_name = line.strip()
-        if word_name == '<S>':
-          self._bos = idx
-        elif word_name == '</S>':
-          self._eos = idx
-        elif word_name == '<UNK>':
-          self._unk = idx
-        if word_name == '!!!MAXTERMID':
-          continue
-
-        self._id_to_word.append(word_name)
-        self._word_to_id[word_name] = idx
-        idx += 1
-
-  @property
-  def bos(self):
-    return self._bos
-
-  @property
-  def eos(self):
-    return self._eos
-
-  @property
-  def unk(self):
-    return self._unk
-
-  @property
-  def size(self):
-    return len(self._id_to_word)
-
-  def word_to_id(self, word):
-    if word in self._word_to_id:
-      return self._word_to_id[word]
-    else:
-      if word.lower() in self._word_to_id:
-        return self._word_to_id[word.lower()]
-    return self.unk
-
-  def id_to_word(self, cur_id):
-    if cur_id < self.size:
-      return self._id_to_word[int(cur_id)]
-    return '<ERROR_out_of_vocab_id>'
-
-  def decode(self, cur_ids):
-    return ' '.join([self.id_to_word(cur_id) for cur_id in cur_ids])
-
-  def encode(self, sentence):
-    word_ids = [self.word_to_id(cur_word) for cur_word in sentence.split()]
-    return np.array([self.bos] + word_ids + [self.eos], dtype=np.int32)
-
-
-class CharsVocabulary(Vocabulary):
-  """Vocabulary containing character-level information."""
-
-  def __init__(self, filename, max_word_length):
-    super(CharsVocabulary, self).__init__(filename)
-
-    self._max_word_length = max_word_length
-    chars_set = set()
-
-    for word in self._id_to_word:
-      chars_set |= set(word)
-
-    free_ids = []
-    for i in range(256):
-      if chr(i) in chars_set:
-        continue
-      free_ids.append(chr(i))
-
-    if len(free_ids) < 5:
-      raise ValueError('Not enough free char ids: %d' % len(free_ids))
-
-    self.bos_char = free_ids[0]  # <begin sentence>
-    self.eos_char = free_ids[1]  # <end sentence>
-    self.bow_char = free_ids[2]  # <begin word>
-    self.eow_char = free_ids[3]  # <end word>
-    self.pad_char = free_ids[4]  # <padding>
-
-    chars_set |= {self.bos_char, self.eos_char, self.bow_char, self.eow_char,
-                  self.pad_char}
-
-    self._char_set = chars_set
-    num_words = len(self._id_to_word)
-
-    self._word_char_ids = np.zeros([num_words, max_word_length], dtype=np.int32)
-
-    self.bos_chars = self._convert_word_to_char_ids(self.bos_char)
-    self.eos_chars = self._convert_word_to_char_ids(self.eos_char)
-
-    for i, word in enumerate(self._id_to_word):
-      if i == self.bos:
-        self._word_char_ids[i] = self.bos_chars
-      elif i == self.eos:
-        self._word_char_ids[i] = self.eos_chars
-      else:
-        self._word_char_ids[i] = self._convert_word_to_char_ids(word)
-
-  @property
-  def max_word_length(self):
-    return self._max_word_length
-
-  def _convert_word_to_char_ids(self, word):
-    code = np.zeros([self.max_word_length], dtype=np.int32)
-    code[:] = ord(self.pad_char)
-
-    if len(word) > self.max_word_length - 2:
-      word = word[:self.max_word_length-2]
-    cur_word = self.bow_char + word + self.eow_char
-    for j in range(len(cur_word)):
-      code[j] = ord(cur_word[j])
-    return code
-
-  def word_to_char_ids(self, word):
-    if word in self._word_to_id:
-      return self._word_char_ids[self._word_to_id[word]]
-    else:
-      return self._convert_word_to_char_ids(word)
-
-  def encode_chars(self, sentence):
-    chars_ids = [self.word_to_char_ids(cur_word)
-                 for cur_word in sentence.split()]
-    return np.vstack([self.bos_chars] + chars_ids + [self.eos_chars])
-
-
-_SPECIAL_CHAR_MAP = {
-    '\xe2\x80\x98': '\'',
-    '\xe2\x80\x99': '\'',
-    '\xe2\x80\x9c': '"',
-    '\xe2\x80\x9d': '"',
-    '\xe2\x80\x93': '-',
-    '\xe2\x80\x94': '-',
-    '\xe2\x88\x92': '-',
-    '\xce\x84': '\'',
-    '\xc2\xb4': '\'',
-    '`': '\''
-}
-
-_START_SPECIAL_CHARS = ['.', ',', '?', '!', ';', ':', '[', ']', '\'', '+', '/',
-                        '\xc2\xa3', '$', '~', '*', '%', '{', '}', '#', '&', '-',
-                        '"', '(', ')', '='] + list(_SPECIAL_CHAR_MAP.keys())
-_SPECIAL_CHARS = _START_SPECIAL_CHARS + [
-    '\'s', '\'m', '\'t', '\'re', '\'d', '\'ve', '\'ll']
-
-
-def tokenize(sentence):
-  """Tokenize a sentence."""
-  sentence = str(sentence)
-  words = sentence.strip().split()
-  tokenized = []  # return this
-
-  for word in words:
-    if word.lower() in ['mr.', 'ms.']:
-      tokenized.append(word)
-      continue
-
-    # Split special chars at the start of word
-    will_split = True
-    while will_split:
-      will_split = False
-      for char in _START_SPECIAL_CHARS:
-        if word.startswith(char):
-          tokenized.append(char)
-          word = word[len(char):]
-          will_split = True
-
-    # Split special chars at the end of word
-    special_end_tokens = []
-    will_split = True
-    while will_split:
-      will_split = False
-      for char in _SPECIAL_CHARS:
-        if word.endswith(char):
-          special_end_tokens = [char] + special_end_tokens
-          word = word[:-len(char)]
-          will_split = True
-
-    if word:
-      tokenized.append(word)
-    tokenized += special_end_tokens
-
-  # Add necessary end of sentence token.
-  if tokenized[-1] not in ['.', '!', '?']:
-    tokenized += ['.']
-  return tokenized
-
-
-def parse_commonsense_reasoning_test(test_data_name):
-  """Read JSON test data."""
-  with tf.gfile.Open(os.path.join(
-      FLAGS.data_dir, 'commonsense_test',
-      '{}.json'.format(test_data_name)), 'r') as f:
-    data = json.load(f)
-
-  question_ids = [d['question_id'] for d in data]
-  sentences = [tokenize(d['substitution']) for d in data]
-  labels = [d['correctness'] for d in data]
-
-  return question_ids, sentences, labels
-
-
-PAD = '<padding>'
-
-
-def cut_to_patches(sentences, batch_size, num_timesteps):
-  """Cut sentences into patches of shape (batch_size, num_timesteps).
-
-  Args:
-    sentences: a list of sentences, each sentence is a list of str token.
-    batch_size: batch size
-    num_timesteps: number of backprop step
-
-  Returns:
-    patches: A 2D matrix,
-      each entry is a matrix of shape (batch_size, num_timesteps).
-  """
-  preprocessed = [['<S>']+sentence+['</S>'] for sentence in sentences]
-  max_len = max([len(sent) for sent in preprocessed])
-
-  # Pad to shape [height, width]
-  # where height is a multiple of batch_size
-  # and width is a multiple of num_timesteps
-  nrow = int(np.ceil(len(preprocessed) * 1.0 / batch_size))
-  ncol = int(np.ceil(max_len * 1.0 / num_timesteps))
-  height, width = nrow * batch_size, ncol * num_timesteps + 1
-  preprocessed = [sent + [PAD] * (width - len(sent)) for sent in preprocessed]
-  preprocessed += [[PAD] * width] * (height - len(preprocessed))
-
-  # Cut preprocessed into patches of shape [batch_size, num_timesteps]
-  patches = []
-  for row in range(nrow):
-    patches.append([])
-    for col in range(ncol):
-      patch = [sent[col * num_timesteps:
-                    (col+1) * num_timesteps + 1]
-               for sent in preprocessed[row * batch_size:
-                                        (row+1) * batch_size]]
-      if np.all(np.array(patch)[:, 1:] == PAD):
-        patch = None  # no need to process this patch.
-      patches[-1].append(patch)
-  return patches
-
-
-def _substitution_mask(sent1, sent2):
-  """Binary mask identifying substituted part in two sentences.
-
-  Example sentence and their mask:
-    First sentence  = "I like the cat        's color"
-                       0 0    0   1           0 0
-    Second sentence = "I like the yellow dog 's color"
-                       0 0    0   1      1    0 0
-
-  Args:
-    sent1: first sentence
-    sent2: second sentence
-
-  Returns:
-    mask1: mask for first sentence
-    mask2: mask for second sentence
-  """
-  mask1_start, mask2_start = [], []
-  while sent1[0] == sent2[0]:
-    sent1 = sent1[1:]
-    sent2 = sent2[1:]
-    mask1_start.append(0.)
-    mask2_start.append(0.)
-
-  mask1_end, mask2_end = [], []
-  while sent1[-1] == sent2[-1]:
-    if (len(sent1) == 1) or (len(sent2) == 1):
-      break
-    sent1 = sent1[:-1]
-    sent2 = sent2[:-1]
-    mask1_end = [0.] + mask1_end
-    mask2_end = [0.] + mask2_end
-
-  assert sent1 or sent2, 'Two sentences are identical.'
-  return (mask1_start + [1.] * len(sent1) + mask1_end,
-          mask2_start + [1.] * len(sent2) + mask2_end)
-
-
-def _convert_to_partial(scoring1, scoring2):
-  """Convert full scoring into partial scoring."""
-  mask1, mask2 = _substitution_mask(
-      scoring1['sentence'], scoring2['sentence'])
-
-  def _partial_score(scoring, mask):
-    word_probs = [max(_) for _ in zip(scoring['word_probs'], mask)]
-    scoring.update(word_probs=word_probs,
-                   joint_prob=np.prod(word_probs))
-
-  _partial_score(scoring1, mask1)
-  _partial_score(scoring2, mask2)
-
-
-def compare_substitutions(question_ids, scorings, mode='full'):
-  """Return accuracy by comparing two consecutive scorings."""
-  prediction_correctness = []
-  # Compare two consecutive substitutions
-  for i in range(len(scorings) // 2):
-    scoring1, scoring2 = scorings[2*i: 2*i+2]
-    if mode == 'partial':  # fix joint prob into partial prob
-      _convert_to_partial(scoring1, scoring2)
-
-    prediction_correctness.append(
-        (scoring2['joint_prob'] > scoring1['joint_prob']) ==
-         scoring2['correctness'])
-
-  # Two consecutive substitutions always belong to the same question
-  question_ids = [qid for i, qid in enumerate(question_ids) if i % 2 == 0]
-  assert len(question_ids) == len(prediction_correctness)
-  num_questions = len(set(question_ids))
-
-  # Question is correctly answered only if
-  # all predictions of the same question_id is correct
-  num_correct_answer = 0
-  previous_qid = None
-  correctly_answered = False
-  for predict, qid in zip(prediction_correctness, question_ids):
-    if qid != previous_qid:
-      previous_qid = qid
-      num_correct_answer += int(correctly_answered)
-      correctly_answered = True
-    correctly_answered = correctly_answered and predict
-  num_correct_answer += int(correctly_answered)
-
-  return num_correct_answer / num_questions
diff --git a/research/maskgan/README.md b/research/maskgan/README.md
deleted file mode 100644
index 10ee8a4c4dd546983469b07e2fb8207fc200534d..0000000000000000000000000000000000000000
--- a/research/maskgan/README.md
+++ /dev/null
@@ -1,111 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# MaskGAN: Better Text Generation via Filling in the ______
-
-Code for [*MaskGAN: Better Text Generation via Filling in the
-______*](https://arxiv.org/abs/1801.07736) published at ICLR 2018.
-
-## Requirements
-
-*   TensorFlow >= v1.5
-
-## Instructions
-
-Warning: The open-source version of this code is still in the process of being
-tested. Pretraining may not work correctly.
-
-For training on PTB:
-
-1. Follow instructions here ([Tensorflow RNN Language Model Tutorial](https://www.tensorflow.org/tutorials/sequences/recurrent)) to train a language model on PTB dataset.
-Copy PTB data downloaded from the above tensorflow RNN tutorial to folder "/tmp/ptb". It should contain following three files: ptb.train.txt, ptb.test.txt, ptb.valid.txt
-Make folder /tmp/pretrain-lm and copy checkpoints from above Tensorflow RNN tutorial under this folder.
-
-
-2. Run MaskGAN in MLE pretraining mode. If step 1 was not run*, set
-`language_model_ckpt_dir` to empty.
-
-```bash
-python train_mask_gan.py \
- --data_dir='/tmp/ptb' \
- --batch_size=20 \
- --sequence_length=20 \
- --base_directory='/tmp/maskGAN' \
- --hparams="gen_rnn_size=650,dis_rnn_size=650,gen_num_layers=2,dis_num_layers=2,gen_learning_rate=0.00074876,dis_learning_rate=5e-4,baseline_decay=0.99,dis_train_iterations=1,gen_learning_rate_decay=0.95" \
- --mode='TRAIN' \
- --max_steps=100000 \
- --language_model_ckpt_dir=/tmp/pretrain-lm/ \
- --generator_model='seq2seq_vd' \
- --discriminator_model='rnn_zaremba' \
- --is_present_rate=0.5 \
- --summaries_every=10 \
- --print_every=250 \
- --max_num_to_print=3 \
- --gen_training_strategy=cross_entropy \
- --seq2seq_share_embedding
-```
-
-3. Run MaskGAN in GAN mode. If step 2 was not run, set `maskgan_ckpt` to empty.
-```bash
-python train_mask_gan.py \
- --data_dir='/tmp/ptb' \
- --batch_size=128 \
- --sequence_length=20 \
- --base_directory='/tmp/maskGAN' \
- --mask_strategy=contiguous \
- --maskgan_ckpt='/tmp/maskGAN' \
- --hparams="gen_rnn_size=650,dis_rnn_size=650,gen_num_layers=2,dis_num_layers=2,gen_learning_rate=0.000038877,gen_learning_rate_decay=1.0,gen_full_learning_rate_steps=2000000,gen_vd_keep_prob=0.33971,rl_discount_rate=0.89072,dis_learning_rate=5e-4,baseline_decay=0.99,dis_train_iterations=2,dis_pretrain_learning_rate=0.005,critic_learning_rate=5.1761e-7,dis_vd_keep_prob=0.71940" \
- --mode='TRAIN' \
- --max_steps=100000 \
- --generator_model='seq2seq_vd' \
- --discriminator_model='seq2seq_vd' \
- --is_present_rate=0.5 \
- --summaries_every=250 \
- --print_every=250 \
- --max_num_to_print=3 \
- --gen_training_strategy='reinforce' \
- --seq2seq_share_embedding=true \
- --baseline_method=critic \
- --attention_option=luong
-```
-
-4. Generate samples:
-```bash
-python generate_samples.py \
- --data_dir /tmp/ptb/ \
- --data_set=ptb \
- --batch_size=256 \
- --sequence_length=20 \
- --base_directory /tmp/imdbsample/ \
- --hparams="gen_rnn_size=650,dis_rnn_size=650,gen_num_layers=2,gen_vd_keep_prob=0.33971" \
- --generator_model=seq2seq_vd \
- --discriminator_model=seq2seq_vd \
- --is_present_rate=0.0 \
- --maskgan_ckpt=/tmp/maskGAN \
- --seq2seq_share_embedding=True \
- --dis_share_embedding=True \
- --attention_option=luong \
- --mask_strategy=contiguous \
- --baseline_method=critic \
- --number_epochs=4
-```
-
-
-*  While trying to run Step 2, the following error appears:
-   NotFoundError (see above for traceback): Restoring from checkpoint failed. This is most likely due to a Variable name or other graph    key that is missing from the checkpoint. Please ensure that you have not altered the graph expected based on the checkpoint. Original    error:
-
-   Key critic/rnn/biases not found in checkpoint
-   [[node save/RestoreV2 (defined at train_mask_gan.py:431) ]]
-
-   This is an issue with seq2seq model because it uses the attention mechanism.
-   The issue arises if you saved the model with an earlier version (seq2seq is old) and restore with a recent one (saver.restore got updated).
-   The naming convention for LSTM parameters changed, e.g. cell_0/basic_lstm_cell/weights became cell_0/basic_lstm_cell/kernel.
-   Which is why you cannot restore them if you try to restore old checkpoints with recent TF.
-   The below script will help rename the variables and everything will work as expected.
-   https://github.com/tensorflow/tensorflow/blob/master/tensorflow/contrib/rnn/python/tools/checkpoint_convert.py
-
-## Contact for Issues
-
-*   Liam Fedus, @liamb315 <liam.fedus@gmail.com>
-*   Andrew M. Dai, @a-dai <adai@google.com>
diff --git a/research/maskgan/data/__init__.py b/research/maskgan/data/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/maskgan/data/imdb_loader.py b/research/maskgan/data/imdb_loader.py
deleted file mode 100644
index 8169b3336b4ac0e1a36e35dbaed4c01f38f1ec02..0000000000000000000000000000000000000000
--- a/research/maskgan/data/imdb_loader.py
+++ /dev/null
@@ -1,136 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""IMDB data loader and helpers."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-# Dependency imports
-import numpy as np
-
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-tf.app.flags.DEFINE_boolean('prefix_label', True, 'Vocabulary file.')
-
-np.set_printoptions(precision=3)
-np.set_printoptions(suppress=True)
-
-EOS_INDEX = 88892
-
-
-def _read_words(filename, use_prefix=True):
-  all_words = []
-  sequence_example = tf.train.SequenceExample()
-  for r in tf.python_io.tf_record_iterator(filename):
-    sequence_example.ParseFromString(r)
-
-    if FLAGS.prefix_label and use_prefix:
-      label = sequence_example.context.feature['class'].int64_list.value[0]
-      review_words = [EOS_INDEX + 1 + label]
-    else:
-      review_words = []
-    review_words.extend([
-        f.int64_list.value[0]
-        for f in sequence_example.feature_lists.feature_list['token_id'].feature
-    ])
-    all_words.append(review_words)
-  return all_words
-
-
-def build_vocab(vocab_file):
-  word_to_id = {}
-
-  with tf.gfile.GFile(vocab_file, 'r') as f:
-    index = 0
-    for word in f:
-      word_to_id[word.strip()] = index
-      index += 1
-    word_to_id['<eos>'] = EOS_INDEX
-
-  return word_to_id
-
-
-def imdb_raw_data(data_path=None):
-  """Load IMDB raw data from data directory "data_path".
-  Reads IMDB tf record files containing integer ids,
-  and performs mini-batching of the inputs.
-  Args:
-    data_path: string path to the directory where simple-examples.tgz has
-      been extracted.
-  Returns:
-    tuple (train_data, valid_data)
-    where each of the data objects can be passed to IMDBIterator.
-  """
-
-  train_path = os.path.join(data_path, 'train_lm.tfrecords')
-  valid_path = os.path.join(data_path, 'test_lm.tfrecords')
-
-  train_data = _read_words(train_path)
-  valid_data = _read_words(valid_path)
-  return train_data, valid_data
-
-
-def imdb_iterator(raw_data, batch_size, num_steps, epoch_size_override=None):
-  """Iterate on the raw IMDB data.
-
-  This generates batch_size pointers into the raw IMDB data, and allows
-  minibatch iteration along these pointers.
-
-  Args:
-    raw_data: one of the raw data outputs from imdb_raw_data.
-    batch_size: int, the batch size.
-    num_steps: int, the number of unrolls.
-
-  Yields:
-    Pairs of the batched data, each a matrix of shape [batch_size, num_steps].
-    The second element of the tuple is the same data time-shifted to the
-    right by one. The third is a set of weights with 1 indicating a word was
-    present and 0 not.
-
-  Raises:
-    ValueError: if batch_size or num_steps are too high.
-  """
-  del epoch_size_override
-  data_len = len(raw_data)
-  num_batches = data_len // batch_size - 1
-
-  for batch in range(num_batches):
-    x = np.zeros([batch_size, num_steps], dtype=np.int32)
-    y = np.zeros([batch_size, num_steps], dtype=np.int32)
-    w = np.zeros([batch_size, num_steps], dtype=np.float)
-
-    for i in range(batch_size):
-      data_index = batch * batch_size + i
-      example = raw_data[data_index]
-
-      if len(example) > num_steps:
-        final_x = example[:num_steps]
-        final_y = example[1:(num_steps + 1)]
-        w[i] = 1
-
-      else:
-        to_fill_in = num_steps - len(example)
-        final_x = example + [EOS_INDEX] * to_fill_in
-        final_y = final_x[1:] + [EOS_INDEX]
-        w[i] = [1] * len(example) + [0] * to_fill_in
-
-      x[i] = final_x
-      y[i] = final_y
-
-    yield (x, y, w)
diff --git a/research/maskgan/data/ptb_loader.py b/research/maskgan/data/ptb_loader.py
deleted file mode 100644
index 43105952a667f968faf12a4561f85964f0a123ae..0000000000000000000000000000000000000000
--- a/research/maskgan/data/ptb_loader.py
+++ /dev/null
@@ -1,123 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""PTB data loader and helpers."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import os
-# Dependency imports
-import numpy as np
-
-import tensorflow as tf
-
-EOS_INDEX = 0
-
-
-def _read_words(filename):
-  with tf.gfile.GFile(filename, "r") as f:
-    return f.read().decode("utf-8").replace("\n", "<eos>").split()
-
-
-def build_vocab(filename):
-  data = _read_words(filename)
-
-  counter = collections.Counter(data)
-  count_pairs = sorted(counter.items(), key=lambda x: (-x[1], x[0]))
-
-  words, _ = list(zip(*count_pairs))
-  word_to_id = dict(zip(words, range(len(words))))
-  print("<eos>:", word_to_id["<eos>"])
-  global EOS_INDEX
-  EOS_INDEX = word_to_id["<eos>"]
-
-  return word_to_id
-
-
-def _file_to_word_ids(filename, word_to_id):
-  data = _read_words(filename)
-  return [word_to_id[word] for word in data if word in word_to_id]
-
-
-def ptb_raw_data(data_path=None):
-  """Load PTB raw data from data directory "data_path".
-  Reads PTB text files, converts strings to integer ids,
-  and performs mini-batching of the inputs.
-  The PTB dataset comes from Tomas Mikolov's webpage:
-  http://www.fit.vutbr.cz/~imikolov/rnnlm/simple-examples.tgz
-  Args:
-    data_path: string path to the directory where simple-examples.tgz has
-      been extracted.
-  Returns:
-    tuple (train_data, valid_data, test_data, vocabulary)
-    where each of the data objects can be passed to PTBIterator.
-  """
-
-  train_path = os.path.join(data_path, "ptb.train.txt")
-  valid_path = os.path.join(data_path, "ptb.valid.txt")
-  test_path = os.path.join(data_path, "ptb.test.txt")
-
-  word_to_id = build_vocab(train_path)
-  train_data = _file_to_word_ids(train_path, word_to_id)
-  valid_data = _file_to_word_ids(valid_path, word_to_id)
-  test_data = _file_to_word_ids(test_path, word_to_id)
-  vocabulary = len(word_to_id)
-  return train_data, valid_data, test_data, vocabulary
-
-
-def ptb_iterator(raw_data, batch_size, num_steps, epoch_size_override=None):
-  """Iterate on the raw PTB data.
-
-  This generates batch_size pointers into the raw PTB data, and allows
-  minibatch iteration along these pointers.
-
-  Args:
-    raw_data: one of the raw data outputs from ptb_raw_data.
-    batch_size: int, the batch size.
-    num_steps: int, the number of unrolls.
-
-  Yields:
-    Pairs of the batched data, each a matrix of shape [batch_size, num_steps].
-    The second element of the tuple is the same data time-shifted to the
-    right by one.
-
-  Raises:
-    ValueError: if batch_size or num_steps are too high.
-  """
-  raw_data = np.array(raw_data, dtype=np.int32)
-
-  data_len = len(raw_data)
-  batch_len = data_len // batch_size
-  data = np.full([batch_size, batch_len], EOS_INDEX, dtype=np.int32)
-  for i in range(batch_size):
-    data[i] = raw_data[batch_len * i:batch_len * (i + 1)]
-
-  if epoch_size_override:
-    epoch_size = epoch_size_override
-  else:
-    epoch_size = (batch_len - 1) // num_steps
-
-  if epoch_size == 0:
-    raise ValueError("epoch_size == 0, decrease batch_size or num_steps")
-
-  # print("Number of batches per epoch: %d" % epoch_size)
-  for i in range(epoch_size):
-    x = data[:, i * num_steps:(i + 1) * num_steps]
-    y = data[:, i * num_steps + 1:(i + 1) * num_steps + 1]
-    w = np.ones_like(x)
-    yield (x, y, w)
diff --git a/research/maskgan/generate_samples.py b/research/maskgan/generate_samples.py
deleted file mode 100644
index d4215ebc75a074b316010eb60189bf7428dfcfc5..0000000000000000000000000000000000000000
--- a/research/maskgan/generate_samples.py
+++ /dev/null
@@ -1,281 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Generate samples from the MaskGAN.
-
-Launch command:
-  python generate_samples.py
-  --data_dir=/tmp/data/imdb  --data_set=imdb
-  --batch_size=256 --sequence_length=20 --base_directory=/tmp/imdb
-  --hparams="gen_rnn_size=650,dis_rnn_size=650,gen_num_layers=2,
-  gen_vd_keep_prob=1.0" --generator_model=seq2seq_vd
-  --discriminator_model=seq2seq_vd --is_present_rate=0.5
-  --maskgan_ckpt=/tmp/model.ckpt-45494
-  --seq2seq_share_embedding=True --dis_share_embedding=True
-  --attention_option=luong --mask_strategy=contiguous --baseline_method=critic
-  --number_epochs=4
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from functools import partial
-import os
-# Dependency imports
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import train_mask_gan
-from data import imdb_loader
-from data import ptb_loader
-
-# Data.
-from model_utils import helper
-from model_utils import model_utils
-
-SAMPLE_TRAIN = 'TRAIN'
-SAMPLE_VALIDATION = 'VALIDATION'
-
-## Sample Generation.
-## Binary and setup FLAGS.
-tf.app.flags.DEFINE_enum('sample_mode', 'TRAIN',
-                         [SAMPLE_TRAIN, SAMPLE_VALIDATION],
-                         'Dataset to sample from.')
-tf.app.flags.DEFINE_string('output_path', '/tmp', 'Model output directory.')
-tf.app.flags.DEFINE_boolean(
-    'output_masked_logs', False,
-    'Whether to display for human evaluation (show masking).')
-tf.app.flags.DEFINE_integer('number_epochs', 1,
-                            'The number of epochs to produce.')
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def get_iterator(data):
-  """Return the data iterator."""
-  if FLAGS.data_set == 'ptb':
-    iterator = ptb_loader.ptb_iterator(data, FLAGS.batch_size,
-                                       FLAGS.sequence_length,
-                                       FLAGS.epoch_size_override)
-  elif FLAGS.data_set == 'imdb':
-    iterator = imdb_loader.imdb_iterator(data, FLAGS.batch_size,
-                                         FLAGS.sequence_length)
-  return iterator
-
-
-def convert_to_human_readable(id_to_word, arr, p, max_num_to_print):
-  """Convert a np.array of indices into words using id_to_word dictionary.
-  Return max_num_to_print results.
-  """
-
-  assert arr.ndim == 2
-
-  samples = []
-  for sequence_id in xrange(min(len(arr), max_num_to_print)):
-    sample = []
-    for i, index in enumerate(arr[sequence_id, :]):
-      if p[sequence_id, i] == 1:
-        sample.append(str(id_to_word[index]))
-      else:
-        sample.append('*' + str(id_to_word[index]))
-    buffer_str = ' '.join(sample)
-    samples.append(buffer_str)
-  return samples
-
-
-def write_unmasked_log(log, id_to_word, sequence_eval):
-  """Helper function for logging evaluated sequences without mask."""
-  indices_arr = np.asarray(sequence_eval)
-  samples = helper.convert_to_human_readable(id_to_word, indices_arr,
-                                             FLAGS.batch_size)
-  for sample in samples:
-    log.write(sample + '\n')
-  log.flush()
-  return samples
-
-
-def write_masked_log(log, id_to_word, sequence_eval, present_eval):
-  indices_arr = np.asarray(sequence_eval)
-  samples = convert_to_human_readable(id_to_word, indices_arr, present_eval,
-                                      FLAGS.batch_size)
-  for sample in samples:
-    log.write(sample + '\n')
-  log.flush()
-  return samples
-
-
-def generate_logs(sess, model, log, id_to_word, feed):
-  """Impute Sequences using the model for a particular feed and send it to
-  logs.
-  """
-  # Impute Sequences.
-  [p, inputs_eval, sequence_eval] = sess.run(
-      [model.present, model.inputs, model.fake_sequence], feed_dict=feed)
-
-  # Add the 0th time-step for coherence.
-  first_token = np.expand_dims(inputs_eval[:, 0], axis=1)
-  sequence_eval = np.concatenate((first_token, sequence_eval), axis=1)
-
-  # 0th token always present.
-  p = np.concatenate((np.ones((FLAGS.batch_size, 1)), p), axis=1)
-
-  if FLAGS.output_masked_logs:
-    samples = write_masked_log(log, id_to_word, sequence_eval, p)
-  else:
-    samples = write_unmasked_log(log, id_to_word, sequence_eval)
-  return samples
-
-
-def generate_samples(hparams, data, id_to_word, log_dir, output_file):
-  """"Generate samples.
-
-    Args:
-      hparams:  Hyperparameters for the MaskGAN.
-      data: Data to evaluate.
-      id_to_word: Dictionary of indices to words.
-      log_dir: Log directory.
-      output_file:  Output file for the samples.
-  """
-  # Boolean indicating operational mode.
-  is_training = False
-
-  # Set a random seed to keep fixed mask.
-  np.random.seed(0)
-
-  with tf.Graph().as_default():
-    # Construct the model.
-    model = train_mask_gan.create_MaskGAN(hparams, is_training)
-
-    ## Retrieve the initial savers.
-    init_savers = model_utils.retrieve_init_savers(hparams)
-
-    ## Initial saver function to supervisor.
-    init_fn = partial(model_utils.init_fn, init_savers)
-
-    is_chief = FLAGS.task == 0
-
-    # Create the supervisor.  It will take care of initialization, summaries,
-    # checkpoints, and recovery.
-    sv = tf.Supervisor(
-        logdir=log_dir,
-        is_chief=is_chief,
-        saver=model.saver,
-        global_step=model.global_step,
-        recovery_wait_secs=30,
-        summary_op=None,
-        init_fn=init_fn)
-
-    # Get an initialized, and possibly recovered session.  Launch the
-    # services: Checkpointing, Summaries, step counting.
-    #
-    # When multiple replicas of this program are running the services are
-    # only launched by the 'chief' replica.
-    with sv.managed_session(
-        FLAGS.master, start_standard_services=False) as sess:
-
-      # Generator statefulness over the epoch.
-      [gen_initial_state_eval, fake_gen_initial_state_eval] = sess.run(
-          [model.eval_initial_state, model.fake_gen_initial_state])
-
-      for n in xrange(FLAGS.number_epochs):
-        print('Epoch number: %d' % n)
-        # print('Percent done: %.2f' % float(n) / float(FLAGS.number_epochs))
-        iterator = get_iterator(data)
-        for x, y, _ in iterator:
-          if FLAGS.eval_language_model:
-            is_present_rate = 0.
-          else:
-            is_present_rate = FLAGS.is_present_rate
-          tf.logging.info(
-              'Evaluating on is_present_rate=%.3f.' % is_present_rate)
-
-          model_utils.assign_percent_real(sess, model.percent_real_update,
-                                          model.new_rate, is_present_rate)
-
-          # Randomly mask out tokens.
-          p = model_utils.generate_mask()
-
-          eval_feed = {model.inputs: x, model.targets: y, model.present: p}
-
-          if FLAGS.data_set == 'ptb':
-            # Statefulness for *evaluation* Generator.
-            for i, (c, h) in enumerate(model.eval_initial_state):
-              eval_feed[c] = gen_initial_state_eval[i].c
-              eval_feed[h] = gen_initial_state_eval[i].h
-
-            # Statefulness for the Generator.
-            for i, (c, h) in enumerate(model.fake_gen_initial_state):
-              eval_feed[c] = fake_gen_initial_state_eval[i].c
-              eval_feed[h] = fake_gen_initial_state_eval[i].h
-
-          [gen_initial_state_eval, fake_gen_initial_state_eval, _] = sess.run(
-              [
-                  model.eval_final_state, model.fake_gen_final_state,
-                  model.global_step
-              ],
-              feed_dict=eval_feed)
-
-          generate_logs(sess, model, output_file, id_to_word, eval_feed)
-      output_file.close()
-      print('Closing output_file.')
-      return
-
-
-def main(_):
-  hparams = train_mask_gan.create_hparams()
-  log_dir = FLAGS.base_directory
-
-  tf.gfile.MakeDirs(FLAGS.output_path)
-  output_file = tf.gfile.GFile(
-      os.path.join(FLAGS.output_path, 'reviews.txt'), mode='w')
-
-  # Load data set.
-  if FLAGS.data_set == 'ptb':
-    raw_data = ptb_loader.ptb_raw_data(FLAGS.data_dir)
-    train_data, valid_data, _, _ = raw_data
-  elif FLAGS.data_set == 'imdb':
-    raw_data = imdb_loader.imdb_raw_data(FLAGS.data_dir)
-    train_data, valid_data = raw_data
-  else:
-    raise NotImplementedError
-
-  # Generating more data on train set.
-  if FLAGS.sample_mode == SAMPLE_TRAIN:
-    data_set = train_data
-  elif FLAGS.sample_mode == SAMPLE_VALIDATION:
-    data_set = valid_data
-  else:
-    raise NotImplementedError
-
-  # Dictionary and reverse dictionry.
-  if FLAGS.data_set == 'ptb':
-    word_to_id = ptb_loader.build_vocab(
-        os.path.join(FLAGS.data_dir, 'ptb.train.txt'))
-  elif FLAGS.data_set == 'imdb':
-    word_to_id = imdb_loader.build_vocab(
-        os.path.join(FLAGS.data_dir, 'vocab.txt'))
-  id_to_word = {v: k for k, v in word_to_id.iteritems()}
-
-  FLAGS.vocab_size = len(id_to_word)
-  print('Vocab size: %d' % FLAGS.vocab_size)
-
-  generate_samples(hparams, data_set, id_to_word, log_dir, output_file)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/maskgan/losses/__init__.py b/research/maskgan/losses/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/maskgan/losses/losses.py b/research/maskgan/losses/losses.py
deleted file mode 100644
index 38d0e7b4d13cfae9652d8c70f08bfba5c478e150..0000000000000000000000000000000000000000
--- a/research/maskgan/losses/losses.py
+++ /dev/null
@@ -1,186 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Losses for Generator and Discriminator."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-def discriminator_loss(predictions, labels, missing_tokens):
-  """Discriminator loss based on predictions and labels.
-
-  Args:
-    predictions:  Discriminator linear predictions Tensor of shape [batch_size,
-      sequence_length]
-    labels: Labels for predictions, Tensor of shape [batch_size,
-      sequence_length]
-    missing_tokens:  Indicator for the missing tokens.  Evaluate the loss only
-      on the tokens that were missing.
-
-  Returns:
-    loss:  Scalar tf.float32 loss.
-
-  """
-  loss = tf.losses.sigmoid_cross_entropy(labels,
-                                         predictions,
-                                         weights=missing_tokens)
-  loss = tf.Print(
-      loss, [loss, labels, missing_tokens],
-      message='loss, labels, missing_tokens',
-      summarize=25,
-      first_n=25)
-  return loss
-
-
-def cross_entropy_loss_matrix(gen_labels, gen_logits):
-  """Computes the cross entropy loss for G.
-
-  Args:
-    gen_labels:  Labels for the correct token.
-    gen_logits: Generator logits.
-
-  Returns:
-    loss_matrix:  Loss matrix of shape [batch_size, sequence_length].
-  """
-  cross_entropy_loss = tf.nn.sparse_softmax_cross_entropy_with_logits(
-      labels=gen_labels, logits=gen_logits)
-  return cross_entropy_loss
-
-
-def GAN_loss_matrix(dis_predictions):
-  """Computes the cross entropy loss for G.
-
-  Args:
-    dis_predictions:  Discriminator predictions.
-
-  Returns:
-    loss_matrix: Loss matrix of shape [batch_size, sequence_length].
-  """
-  eps = tf.constant(1e-7, tf.float32)
-  gan_loss_matrix = -tf.log(dis_predictions + eps)
-  return gan_loss_matrix
-
-
-def generator_GAN_loss(predictions):
-  """Generator GAN loss based on Discriminator predictions."""
-  return -tf.log(tf.reduce_mean(predictions))
-
-
-def generator_blended_forward_loss(gen_logits, gen_labels, dis_predictions,
-                                   is_real_input):
-  """Computes the masked-loss for G.  This will be a blend of cross-entropy
-  loss where the true label is known and GAN loss where the true label has been
-  masked.
-
-  Args:
-    gen_logits: Generator logits.
-    gen_labels:  Labels for the correct token.
-    dis_predictions:  Discriminator predictions.
-    is_real_input:  Tensor indicating whether the label is present.
-
-  Returns:
-    loss: Scalar tf.float32 total loss.
-  """
-  cross_entropy_loss = tf.nn.sparse_softmax_cross_entropy_with_logits(
-      labels=gen_labels, logits=gen_logits)
-  gan_loss = -tf.log(dis_predictions)
-  loss_matrix = tf.where(is_real_input, cross_entropy_loss, gan_loss)
-  return tf.reduce_mean(loss_matrix)
-
-
-def wasserstein_generator_loss(gen_logits, gen_labels, dis_values,
-                               is_real_input):
-  """Computes the masked-loss for G.  This will be a blend of cross-entropy
-  loss where the true label is known and GAN loss where the true label is
-  missing.
-
-  Args:
-    gen_logits:  Generator logits.
-    gen_labels:  Labels for the correct token.
-    dis_values:  Discriminator values Tensor of shape [batch_size,
-      sequence_length].
-    is_real_input:  Tensor indicating whether the label is present.
-
-  Returns:
-    loss: Scalar tf.float32 total loss.
-  """
-  cross_entropy_loss = tf.nn.sparse_softmax_cross_entropy_with_logits(
-      labels=gen_labels, logits=gen_logits)
-  # Maximize the dis_values (minimize the negative)
-  gan_loss = -dis_values
-  loss_matrix = tf.where(is_real_input, cross_entropy_loss, gan_loss)
-  loss = tf.reduce_mean(loss_matrix)
-  return loss
-
-
-def wasserstein_discriminator_loss(real_values, fake_values):
-  """Wasserstein discriminator loss.
-
-  Args:
-    real_values: Value given by the Wasserstein Discriminator to real data.
-    fake_values: Value given by the Wasserstein Discriminator to fake data.
-
-  Returns:
-    loss:  Scalar tf.float32 loss.
-
-  """
-  real_avg = tf.reduce_mean(real_values)
-  fake_avg = tf.reduce_mean(fake_values)
-
-  wasserstein_loss = real_avg - fake_avg
-  return wasserstein_loss
-
-
-def wasserstein_discriminator_loss_intrabatch(values, is_real_input):
-  """Wasserstein discriminator loss.  This is an odd variant where the value
-  difference is between the real tokens and the fake tokens within a single
-  batch.
-
-  Args:
-    values: Value given by the Wasserstein Discriminator of shape [batch_size,
-      sequence_length] to an imputed batch (real and fake).
-    is_real_input: tf.bool Tensor of shape [batch_size, sequence_length]. If
-      true, it indicates that the label is known.
-
-  Returns:
-    wasserstein_loss:  Scalar tf.float32 loss.
-
-  """
-  zero_tensor = tf.constant(0., dtype=tf.float32, shape=[])
-
-  present = tf.cast(is_real_input, tf.float32)
-  missing = tf.cast(1 - present, tf.float32)
-
-  # Counts for real and fake tokens.
-  real_count = tf.reduce_sum(present)
-  fake_count = tf.reduce_sum(missing)
-
-  # Averages for real and fake token values.
-  real = tf.mul(values, present)
-  fake = tf.mul(values, missing)
-  real_avg = tf.reduce_sum(real) / real_count
-  fake_avg = tf.reduce_sum(fake) / fake_count
-
-  # If there are no real or fake entries in the batch, we assign an average
-  # value of zero.
-  real_avg = tf.where(tf.equal(real_count, 0), zero_tensor, real_avg)
-  fake_avg = tf.where(tf.equal(fake_count, 0), zero_tensor, fake_avg)
-
-  wasserstein_loss = real_avg - fake_avg
-  return wasserstein_loss
diff --git a/research/maskgan/model_utils/__init__.py b/research/maskgan/model_utils/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/maskgan/model_utils/helper.py b/research/maskgan/model_utils/helper.py
deleted file mode 100644
index 36115b484a007cda715b038e5cf52cbdd0b072ba..0000000000000000000000000000000000000000
--- a/research/maskgan/model_utils/helper.py
+++ /dev/null
@@ -1,158 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Random helper functions for converting between indices and one-hot encodings
-as well as printing/logging helpers.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-
-def variable_summaries(var, name):
-  """Attach a lot of summaries to a Tensor."""
-  mean = tf.reduce_mean(var)
-  tf.summary.scalar('mean/' + name, mean)
-  with tf.name_scope('stddev'):
-    stddev = tf.sqrt(tf.reduce_sum(tf.square(var - mean)))
-  tf.summary.scalar('sttdev/' + name, stddev)
-  tf.summary.scalar('max/' + name, tf.reduce_max(var))
-  tf.summary.scalar('min/' + name, tf.reduce_min(var))
-  tf.summary.histogram(name, var)
-
-
-def zip_seq_pred_crossent(id_to_word, sequences, predictions, cross_entropy):
-  """Zip together the sequences, predictions, cross entropy."""
-  indices = convert_to_indices(sequences)
-
-  batch_of_metrics = []
-
-  for ind_batch, pred_batch, crossent_batch in zip(indices, predictions,
-                                                   cross_entropy):
-    metrics = []
-
-    for index, pred, crossent in zip(ind_batch, pred_batch, crossent_batch):
-      metrics.append([str(id_to_word[index]), pred, crossent])
-
-    batch_of_metrics.append(metrics)
-  return batch_of_metrics
-
-
-def print_and_log(log, id_to_word, sequence_eval, max_num_to_print=5):
-  """Helper function for printing and logging evaluated sequences."""
-  indices_eval = convert_to_indices(sequence_eval)
-  indices_arr = np.asarray(indices_eval)
-  samples = convert_to_human_readable(id_to_word, indices_arr, max_num_to_print)
-
-  for i, sample in enumerate(samples):
-    print('Sample', i, '. ', sample)
-    log.write('\nSample ' + str(i) + '. ' + sample)
-  log.write('\n')
-  print('\n')
-  log.flush()
-
-
-def convert_to_human_readable(id_to_word, arr, max_num_to_print):
-  """Convert a np.array of indices into words using id_to_word dictionary.
-  Return max_num_to_print results.
-  """
-  assert arr.ndim == 2
-
-  samples = []
-  for sequence_id in xrange(min(len(arr), max_num_to_print)):
-    buffer_str = ' '.join(
-        [str(id_to_word[index]) for index in arr[sequence_id, :]])
-    samples.append(buffer_str)
-  return samples
-
-
-def index_to_vocab_array(indices, vocab_size, sequence_length):
-  """Convert the indices into an array with vocab_size one-hot encoding."""
-
-  # Extract properties of the indices.
-  num_batches = len(indices)
-  shape = list(indices.shape)
-  shape.append(vocab_size)
-
-  # Construct the vocab_size array.
-  new_arr = np.zeros(shape)
-
-  for n in xrange(num_batches):
-    indices_batch = indices[n]
-    new_arr_batch = new_arr[n]
-
-    # We map all indices greater than the vocabulary size to an unknown
-    # character.
-    indices_batch = np.where(indices_batch < vocab_size, indices_batch,
-                             vocab_size - 1)
-
-    # Convert indices to vocab_size dimensions.
-    new_arr_batch[np.arange(sequence_length), indices_batch] = 1
-  return new_arr
-
-
-def convert_to_indices(sequences):
-  """Convert a list of size [batch_size, sequence_length, vocab_size] to
-  a list of size [batch_size, sequence_length] where the vocab element is
-  denoted by the index.
-  """
-  batch_of_indices = []
-
-  for sequence in sequences:
-    indices = []
-    for embedding in sequence:
-      indices.append(np.argmax(embedding))
-    batch_of_indices.append(indices)
-  return batch_of_indices
-
-
-def convert_and_zip(id_to_word, sequences, predictions):
-  """Helper function for printing or logging.  Retrieves list of sequences
-  and predictions and zips them together.
-  """
-  indices = convert_to_indices(sequences)
-
-  batch_of_indices_predictions = []
-
-  for index_batch, pred_batch in zip(indices, predictions):
-    indices_predictions = []
-
-    for index, pred in zip(index_batch, pred_batch):
-      indices_predictions.append([str(id_to_word[index]), pred])
-    batch_of_indices_predictions.append(indices_predictions)
-  return batch_of_indices_predictions
-
-
-def recursive_length(item):
-  """Recursively determine the total number of elements in nested list."""
-  if type(item) == list:
-    return sum(recursive_length(subitem) for subitem in item)
-  else:
-    return 1.
-
-
-def percent_correct(real_sequence, fake_sequences):
-  """Determine the percent of tokens correctly generated within a batch."""
-  identical = 0.
-  for fake_sequence in fake_sequences:
-    for real, fake in zip(real_sequence, fake_sequence):
-      if real == fake:
-        identical += 1.
-  return identical / recursive_length(fake_sequences)
diff --git a/research/maskgan/model_utils/model_construction.py b/research/maskgan/model_utils/model_construction.py
deleted file mode 100644
index 8dfa1df343984d903ace5984a90c36cc0b67dbe3..0000000000000000000000000000000000000000
--- a/research/maskgan/model_utils/model_construction.py
+++ /dev/null
@@ -1,234 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Model construction."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-
-import tensorflow as tf
-from models import bidirectional
-from models import bidirectional_vd
-
-from models import bidirectional_zaremba
-from models import cnn
-from models import critic_vd
-from models import feedforward
-from models import rnn
-from models import rnn_nas
-from models import rnn_vd
-from models import rnn_zaremba
-from models import seq2seq
-from models import seq2seq_nas
-from models import seq2seq_vd
-from models import seq2seq_zaremba
-
-FLAGS = tf.app.flags.FLAGS
-
-
-# TODO(adai): IMDB labels placeholder to model.
-def create_generator(hparams,
-                     inputs,
-                     targets,
-                     present,
-                     is_training,
-                     is_validating,
-                     reuse=None):
-  """Create the Generator model specified by the FLAGS and hparams.
-
-  Args;
-    hparams:  Hyperparameters for the MaskGAN.
-    inputs:  tf.int32 Tensor of the sequence input of shape [batch_size,
-      sequence_length].
-    present:  tf.bool Tensor indicating the presence or absence of the token
-      of shape [batch_size, sequence_length].
-    is_training:  Whether the model is training.
-    is_validating:  Whether the model is being run in validation mode for
-      calculating the perplexity.
-    reuse (Optional):  Whether to reuse the model.
-
-  Returns:
-    Tuple of the (sequence, logits, log_probs) of the Generator.   Sequence
-      and logits have shape [batch_size, sequence_length, vocab_size].  The
-      log_probs will have shape [batch_size, sequence_length].  Log_probs
-      corresponds to the log probability of selecting the words.
-  """
-  if FLAGS.generator_model == 'rnn':
-    (sequence, logits, log_probs, initial_state, final_state) = rnn.generator(
-        hparams,
-        inputs,
-        targets,
-        present,
-        is_training=is_training,
-        is_validating=is_validating,
-        reuse=reuse)
-  elif FLAGS.generator_model == 'rnn_zaremba':
-    (sequence, logits, log_probs, initial_state,
-     final_state) = rnn_zaremba.generator(
-         hparams,
-         inputs,
-         targets,
-         present,
-         is_training=is_training,
-         is_validating=is_validating,
-         reuse=reuse)
-  elif FLAGS.generator_model == 'seq2seq':
-    (sequence, logits, log_probs, initial_state,
-     final_state) = seq2seq.generator(
-         hparams,
-         inputs,
-         targets,
-         present,
-         is_training=is_training,
-         is_validating=is_validating,
-         reuse=reuse)
-  elif FLAGS.generator_model == 'seq2seq_zaremba':
-    (sequence, logits, log_probs, initial_state,
-     final_state) = seq2seq_zaremba.generator(
-         hparams,
-         inputs,
-         targets,
-         present,
-         is_training=is_training,
-         is_validating=is_validating,
-         reuse=reuse)
-  elif FLAGS.generator_model == 'rnn_nas':
-    (sequence, logits, log_probs, initial_state,
-     final_state) = rnn_nas.generator(
-         hparams,
-         inputs,
-         targets,
-         present,
-         is_training=is_training,
-         is_validating=is_validating,
-         reuse=reuse)
-  elif FLAGS.generator_model == 'seq2seq_nas':
-    (sequence, logits, log_probs, initial_state,
-     final_state) = seq2seq_nas.generator(
-         hparams,
-         inputs,
-         targets,
-         present,
-         is_training=is_training,
-         is_validating=is_validating,
-         reuse=reuse)
-  elif FLAGS.generator_model == 'seq2seq_vd':
-    (sequence, logits, log_probs, initial_state, final_state,
-     encoder_states) = seq2seq_vd.generator(
-         hparams,
-         inputs,
-         targets,
-         present,
-         is_training=is_training,
-         is_validating=is_validating,
-         reuse=reuse)
-  else:
-    raise NotImplementedError
-  return (sequence, logits, log_probs, initial_state, final_state,
-          encoder_states)
-
-
-def create_discriminator(hparams,
-                         sequence,
-                         is_training,
-                         reuse=None,
-                         initial_state=None,
-                         inputs=None,
-                         present=None):
-  """Create the Discriminator model specified by the FLAGS and hparams.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    sequence:  tf.int32 Tensor sequence of shape [batch_size, sequence_length]
-    is_training:  Whether the model is training.
-    reuse (Optional):  Whether to reuse the model.
-
-  Returns:
-    predictions:  tf.float32 Tensor of predictions of shape [batch_size,
-      sequence_length]
-  """
-  if FLAGS.discriminator_model == 'cnn':
-    predictions = cnn.discriminator(
-        hparams, sequence, is_training=is_training, reuse=reuse)
-  elif FLAGS.discriminator_model == 'fnn':
-    predictions = feedforward.discriminator(
-        hparams, sequence, is_training=is_training, reuse=reuse)
-  elif FLAGS.discriminator_model == 'rnn':
-    predictions = rnn.discriminator(
-        hparams, sequence, is_training=is_training, reuse=reuse)
-  elif FLAGS.discriminator_model == 'bidirectional':
-    predictions = bidirectional.discriminator(
-        hparams, sequence, is_training=is_training, reuse=reuse)
-  elif FLAGS.discriminator_model == 'bidirectional_zaremba':
-    predictions = bidirectional_zaremba.discriminator(
-        hparams, sequence, is_training=is_training, reuse=reuse)
-  elif FLAGS.discriminator_model == 'seq2seq_vd':
-    predictions = seq2seq_vd.discriminator(
-        hparams,
-        inputs,
-        present,
-        sequence,
-        is_training=is_training,
-        reuse=reuse)
-  elif FLAGS.discriminator_model == 'rnn_zaremba':
-    predictions = rnn_zaremba.discriminator(
-        hparams, sequence, is_training=is_training, reuse=reuse)
-  elif FLAGS.discriminator_model == 'rnn_nas':
-    predictions = rnn_nas.discriminator(
-        hparams, sequence, is_training=is_training, reuse=reuse)
-  elif FLAGS.discriminator_model == 'rnn_vd':
-    predictions = rnn_vd.discriminator(
-        hparams,
-        sequence,
-        is_training=is_training,
-        reuse=reuse,
-        initial_state=initial_state)
-  elif FLAGS.discriminator_model == 'bidirectional_vd':
-    predictions = bidirectional_vd.discriminator(
-        hparams,
-        sequence,
-        is_training=is_training,
-        reuse=reuse,
-        initial_state=initial_state)
-  else:
-    raise NotImplementedError
-  return predictions
-
-
-def create_critic(hparams, sequence, is_training, reuse=None):
-  """Create the Critic model specified by the FLAGS and hparams.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    sequence:  tf.int32 Tensor sequence of shape [batch_size, sequence_length]
-    is_training:  Whether the model is training.
-    reuse (Optional):  Whether to reuse the model.
-
-  Returns:
-    values:  tf.float32 Tensor of predictions of shape [batch_size,
-      sequence_length]
-  """
-  if FLAGS.baseline_method == 'critic':
-    if FLAGS.discriminator_model == 'seq2seq_vd':
-      values = critic_vd.critic_seq2seq_vd_derivative(
-          hparams, sequence, is_training, reuse=reuse)
-    else:
-      raise NotImplementedError
-  else:
-    raise NotImplementedError
-  return values
diff --git a/research/maskgan/model_utils/model_losses.py b/research/maskgan/model_utils/model_losses.py
deleted file mode 100644
index c8f337dc48b4f1efb1cf8604327376ddaa9994ea..0000000000000000000000000000000000000000
--- a/research/maskgan/model_utils/model_losses.py
+++ /dev/null
@@ -1,327 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Model loss construction."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-# Useful for REINFORCE baseline.
-from losses import losses
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def create_dis_loss(fake_predictions, real_predictions, targets_present):
-  """Compute Discriminator loss across real/fake."""
-
-  missing = tf.cast(targets_present, tf.int32)
-  missing = 1 - missing
-  missing = tf.cast(missing, tf.bool)
-
-  real_labels = tf.ones([FLAGS.batch_size, FLAGS.sequence_length])
-  dis_loss_real = tf.losses.sigmoid_cross_entropy(
-      real_labels, real_predictions, weights=missing)
-  dis_loss_fake = tf.losses.sigmoid_cross_entropy(
-      targets_present, fake_predictions, weights=missing)
-
-  dis_loss = (dis_loss_fake + dis_loss_real) / 2.
-  return dis_loss, dis_loss_fake, dis_loss_real
-
-
-def create_critic_loss(cumulative_rewards, estimated_values, present):
-  """Compute Critic loss in estimating the value function.  This should be an
-  estimate only for the missing elements."""
-  missing = tf.cast(present, tf.int32)
-  missing = 1 - missing
-  missing = tf.cast(missing, tf.bool)
-
-  loss = tf.losses.mean_squared_error(
-      labels=cumulative_rewards, predictions=estimated_values, weights=missing)
-  return loss
-
-
-def create_masked_cross_entropy_loss(targets, present, logits):
-  """Calculate the cross entropy loss matrices for the masked tokens."""
-  cross_entropy_losses = losses.cross_entropy_loss_matrix(targets, logits)
-
-  # Zeros matrix.
-  zeros_losses = tf.zeros(
-      shape=[FLAGS.batch_size, FLAGS.sequence_length], dtype=tf.float32)
-
-  missing_ce_loss = tf.where(present, zeros_losses, cross_entropy_losses)
-
-  return missing_ce_loss
-
-
-def calculate_reinforce_objective(hparams,
-                                  log_probs,
-                                  dis_predictions,
-                                  present,
-                                  estimated_values=None):
-  """Calculate the REINFORCE objectives.  The REINFORCE objective should
-  only be on the tokens that were missing.  Specifically, the final Generator
-  reward should be based on the Discriminator predictions on missing tokens.
-  The log probaibilities should be only for missing tokens and the baseline
-  should be calculated only on the missing tokens.
-
-  For this model, we optimize the reward is the log of the *conditional*
-  probability the Discriminator assigns to the distribution.  Specifically, for
-  a Discriminator D which outputs probability of real, given the past context,
-
-    r_t = log D(x_t|x_0,x_1,...x_{t-1})
-
-  And the policy for Generator G is the log-probability of taking action x2
-  given the past context.
-
-
-  Args:
-    hparams:  MaskGAN hyperparameters.
-    log_probs:  tf.float32 Tensor of log probailities of the tokens selected by
-      the Generator.  Shape [batch_size, sequence_length].
-    dis_predictions:  tf.float32 Tensor of the predictions from the
-      Discriminator.  Shape [batch_size, sequence_length].
-    present:  tf.bool Tensor indicating which tokens are present.  Shape
-      [batch_size, sequence_length].
-    estimated_values:  tf.float32 Tensor of estimated state values of tokens.
-      Shape [batch_size, sequence_length]
-
-  Returns:
-    final_gen_objective:  Final REINFORCE objective for the sequence.
-    rewards:  tf.float32 Tensor of rewards for sequence of shape [batch_size,
-      sequence_length]
-    advantages: tf.float32 Tensor of advantages for sequence of shape
-      [batch_size, sequence_length]
-    baselines:  tf.float32 Tensor of baselines for sequence of shape
-      [batch_size, sequence_length]
-    maintain_averages_op:  ExponentialMovingAverage apply average op to
-      maintain the baseline.
-  """
-  # Final Generator objective.
-  final_gen_objective = 0.
-  gamma = hparams.rl_discount_rate
-  eps = 1e-7
-
-  # Generator rewards are log-probabilities.
-  eps = tf.constant(1e-7, tf.float32)
-  dis_predictions = tf.nn.sigmoid(dis_predictions)
-  rewards = tf.log(dis_predictions + eps)
-
-  # Apply only for missing elements.
-  zeros = tf.zeros_like(present, dtype=tf.float32)
-  log_probs = tf.where(present, zeros, log_probs)
-  rewards = tf.where(present, zeros, rewards)
-
-  # Unstack Tensors into lists.
-  rewards_list = tf.unstack(rewards, axis=1)
-  log_probs_list = tf.unstack(log_probs, axis=1)
-  missing = 1. - tf.cast(present, tf.float32)
-  missing_list = tf.unstack(missing, axis=1)
-
-  # Cumulative Discounted Returns.  The true value function V*(s).
-  cumulative_rewards = []
-  for t in xrange(FLAGS.sequence_length):
-    cum_value = tf.zeros(shape=[FLAGS.batch_size])
-    for s in xrange(t, FLAGS.sequence_length):
-      cum_value += missing_list[s] * np.power(gamma, (s - t)) * rewards_list[s]
-    cumulative_rewards.append(cum_value)
-  cumulative_rewards = tf.stack(cumulative_rewards, axis=1)
-
-  ## REINFORCE with different baselines.
-  # We create a separate critic functionality for the Discriminator.  This
-  # will need to operate unidirectionally and it may take in the past context.
-  if FLAGS.baseline_method == 'critic':
-
-    # Critic loss calculated from the estimated value function \hat{V}(s)
-    # versus the true value function V*(s).
-    critic_loss = create_critic_loss(cumulative_rewards, estimated_values,
-                                     present)
-
-    # Baselines are coming from the critic's estimated state values.
-    baselines = tf.unstack(estimated_values, axis=1)
-
-    ## Calculate the Advantages, A(s,a) = Q(s,a) - \hat{V}(s).
-    advantages = []
-    for t in xrange(FLAGS.sequence_length):
-      log_probability = log_probs_list[t]
-      cum_advantage = tf.zeros(shape=[FLAGS.batch_size])
-
-      for s in xrange(t, FLAGS.sequence_length):
-        cum_advantage += missing_list[s] * np.power(gamma,
-                                                    (s - t)) * rewards_list[s]
-      cum_advantage -= baselines[t]
-      # Clip advantages.
-      cum_advantage = tf.clip_by_value(cum_advantage, -FLAGS.advantage_clipping,
-                                       FLAGS.advantage_clipping)
-      advantages.append(missing_list[t] * cum_advantage)
-      final_gen_objective += tf.multiply(
-          log_probability, missing_list[t] * tf.stop_gradient(cum_advantage))
-
-    maintain_averages_op = None
-    baselines = tf.stack(baselines, axis=1)
-    advantages = tf.stack(advantages, axis=1)
-
-  # Split the batch into half.  Use half for MC estimates for REINFORCE.
-  # Use the other half to establish a baseline.
-  elif FLAGS.baseline_method == 'dis_batch':
-    # TODO(liamfedus):  Recheck.
-    [rewards_half, baseline_half] = tf.split(
-        rewards, num_or_size_splits=2, axis=0)
-    [log_probs_half, _] = tf.split(log_probs, num_or_size_splits=2, axis=0)
-    [reward_present_half, baseline_present_half] = tf.split(
-        present, num_or_size_splits=2, axis=0)
-
-    # Unstack to lists.
-    baseline_list = tf.unstack(baseline_half, axis=1)
-    baseline_missing = 1. - tf.cast(baseline_present_half, tf.float32)
-    baseline_missing_list = tf.unstack(baseline_missing, axis=1)
-
-    baselines = []
-    for t in xrange(FLAGS.sequence_length):
-      # Calculate baseline only for missing tokens.
-      num_missing = tf.reduce_sum(baseline_missing_list[t])
-
-      avg_baseline = tf.reduce_sum(
-          baseline_missing_list[t] * baseline_list[t], keep_dims=True) / (
-              num_missing + eps)
-      baseline = tf.tile(avg_baseline, multiples=[FLAGS.batch_size / 2])
-      baselines.append(baseline)
-
-    # Unstack to lists.
-    rewards_list = tf.unstack(rewards_half, axis=1)
-    log_probs_list = tf.unstack(log_probs_half, axis=1)
-    reward_missing = 1. - tf.cast(reward_present_half, tf.float32)
-    reward_missing_list = tf.unstack(reward_missing, axis=1)
-
-    ## Calculate the Advantages, A(s,a) = Q(s,a) - \hat{V}(s).
-    advantages = []
-    for t in xrange(FLAGS.sequence_length):
-      log_probability = log_probs_list[t]
-      cum_advantage = tf.zeros(shape=[FLAGS.batch_size / 2])
-
-      for s in xrange(t, FLAGS.sequence_length):
-        cum_advantage += reward_missing_list[s] * np.power(gamma, (s - t)) * (
-            rewards_list[s] - baselines[s])
-      # Clip advantages.
-      cum_advantage = tf.clip_by_value(cum_advantage, -FLAGS.advantage_clipping,
-                                       FLAGS.advantage_clipping)
-      advantages.append(reward_missing_list[t] * cum_advantage)
-      final_gen_objective += tf.multiply(
-          log_probability,
-          reward_missing_list[t] * tf.stop_gradient(cum_advantage))
-
-    # Cumulative Discounted Returns.  The true value function V*(s).
-    cumulative_rewards = []
-    for t in xrange(FLAGS.sequence_length):
-      cum_value = tf.zeros(shape=[FLAGS.batch_size / 2])
-      for s in xrange(t, FLAGS.sequence_length):
-        cum_value += reward_missing_list[s] * np.power(gamma, (
-            s - t)) * rewards_list[s]
-      cumulative_rewards.append(cum_value)
-    cumulative_rewards = tf.stack(cumulative_rewards, axis=1)
-
-    rewards = rewards_half
-    critic_loss = None
-    maintain_averages_op = None
-    baselines = tf.stack(baselines, axis=1)
-    advantages = tf.stack(advantages, axis=1)
-
-  # Exponential Moving Average baseline.
-  elif FLAGS.baseline_method == 'ema':
-    # TODO(liamfedus): Recheck.
-    # Lists of rewards and Log probabilities of the actions taken only for
-    # missing tokens.
-    ema = tf.train.ExponentialMovingAverage(decay=hparams.baseline_decay)
-    maintain_averages_op = ema.apply(rewards_list)
-
-    baselines = []
-    for r in rewards_list:
-      baselines.append(ema.average(r))
-
-    ## Calculate the Advantages, A(s,a) = Q(s,a) - \hat{V}(s).
-    advantages = []
-    for t in xrange(FLAGS.sequence_length):
-      log_probability = log_probs_list[t]
-
-      # Calculate the forward advantage only on the missing tokens.
-      cum_advantage = tf.zeros(shape=[FLAGS.batch_size])
-      for s in xrange(t, FLAGS.sequence_length):
-        cum_advantage += missing_list[s] * np.power(gamma, (s - t)) * (
-            rewards_list[s] - baselines[s])
-      # Clip advantages.
-      cum_advantage = tf.clip_by_value(cum_advantage, -FLAGS.advantage_clipping,
-                                       FLAGS.advantage_clipping)
-      advantages.append(missing_list[t] * cum_advantage)
-      final_gen_objective += tf.multiply(
-          log_probability, missing_list[t] * tf.stop_gradient(cum_advantage))
-
-    critic_loss = None
-    baselines = tf.stack(baselines, axis=1)
-    advantages = tf.stack(advantages, axis=1)
-
-  elif FLAGS.baseline_method is None:
-    num_missing = tf.reduce_sum(missing)
-    final_gen_objective += tf.reduce_sum(rewards) / (num_missing + eps)
-    baselines = tf.zeros_like(rewards)
-    critic_loss = None
-    maintain_averages_op = None
-    advantages = cumulative_rewards
-
-  else:
-    raise NotImplementedError
-
-  return [
-      final_gen_objective, log_probs, rewards, advantages, baselines,
-      maintain_averages_op, critic_loss, cumulative_rewards
-  ]
-
-
-def calculate_log_perplexity(logits, targets, present):
-  """Calculate the average log perplexity per *missing* token.
-
-  Args:
-    logits:  tf.float32 Tensor of the logits of shape [batch_size,
-      sequence_length, vocab_size].
-    targets:  tf.int32 Tensor of the sequence target of shape [batch_size,
-      sequence_length].
-    present:  tf.bool Tensor indicating the presence or absence of the token
-      of shape [batch_size, sequence_length].
-
-  Returns:
-    avg_log_perplexity:  Scalar indicating the average log perplexity per
-      missing token in the batch.
-  """
-  # logits = tf.Print(logits, [logits], message='logits:', summarize=50)
-  # targets = tf.Print(targets, [targets], message='targets:', summarize=50)
-  eps = 1e-12
-  logits = tf.reshape(logits, [-1, FLAGS.vocab_size])
-
-  # Only calculate log-perplexity on missing tokens.
-  weights = tf.cast(present, tf.float32)
-  weights = 1. - weights
-  weights = tf.reshape(weights, [-1])
-  num_missing = tf.reduce_sum(weights)
-
-  log_perplexity = tf.contrib.legacy_seq2seq.sequence_loss_by_example(
-      [logits], [tf.reshape(targets, [-1])], [weights])
-
-  avg_log_perplexity = tf.reduce_sum(log_perplexity) / (num_missing + eps)
-  return avg_log_perplexity
diff --git a/research/maskgan/model_utils/model_optimization.py b/research/maskgan/model_utils/model_optimization.py
deleted file mode 100644
index caae271fe8bed390f032763972a43312f7a8ce9b..0000000000000000000000000000000000000000
--- a/research/maskgan/model_utils/model_optimization.py
+++ /dev/null
@@ -1,194 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Model optimization."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def create_dis_pretrain_op(hparams, dis_loss, global_step):
-  """Create a train op for pretraining."""
-  with tf.name_scope('pretrain_generator'):
-    optimizer = tf.train.AdamOptimizer(hparams.dis_pretrain_learning_rate)
-    dis_vars = [
-        v for v in tf.trainable_variables() if v.op.name.startswith('dis')
-    ]
-    if FLAGS.dis_update_share_embedding and FLAGS.dis_share_embedding:
-      shared_embedding = [
-          v for v in tf.trainable_variables()
-          if v.op.name == 'gen/decoder/rnn/embedding'
-      ][0]
-      dis_vars.append(shared_embedding)
-    dis_grads = tf.gradients(dis_loss, dis_vars)
-    dis_grads_clipped, _ = tf.clip_by_global_norm(dis_grads,
-                                                  FLAGS.grad_clipping)
-    dis_pretrain_op = optimizer.apply_gradients(
-        zip(dis_grads_clipped, dis_vars), global_step=global_step)
-    return dis_pretrain_op
-
-
-def create_gen_pretrain_op(hparams, cross_entropy_loss, global_step):
-  """Create a train op for pretraining."""
-  with tf.name_scope('pretrain_generator'):
-    optimizer = tf.train.AdamOptimizer(hparams.gen_pretrain_learning_rate)
-    gen_vars = [
-        v for v in tf.trainable_variables() if v.op.name.startswith('gen')
-    ]
-    gen_grads = tf.gradients(cross_entropy_loss, gen_vars)
-    gen_grads_clipped, _ = tf.clip_by_global_norm(gen_grads,
-                                                  FLAGS.grad_clipping)
-    gen_pretrain_op = optimizer.apply_gradients(
-        zip(gen_grads_clipped, gen_vars), global_step=global_step)
-    return gen_pretrain_op
-
-
-def create_gen_train_op(hparams, learning_rate, gen_loss, global_step, mode):
-  """Create Generator train op."""
-  del hparams
-  with tf.name_scope('train_generator'):
-    if FLAGS.generator_optimizer == 'sgd':
-      gen_optimizer = tf.train.GradientDescentOptimizer(learning_rate)
-    elif FLAGS.generator_optimizer == 'adam':
-      gen_optimizer = tf.train.AdamOptimizer(learning_rate)
-    else:
-      raise NotImplementedError
-    gen_vars = [
-        v for v in tf.trainable_variables() if v.op.name.startswith('gen')
-    ]
-    print('Optimizing Generator vars.')
-    for v in gen_vars:
-      print(v)
-    if mode == 'MINIMIZE':
-      gen_grads = tf.gradients(gen_loss, gen_vars)
-    elif mode == 'MAXIMIZE':
-      gen_grads = tf.gradients(-gen_loss, gen_vars)
-    else:
-      raise ValueError("Must be one of 'MINIMIZE' or 'MAXIMIZE'")
-    gen_grads_clipped, _ = tf.clip_by_global_norm(gen_grads,
-                                                  FLAGS.grad_clipping)
-    gen_train_op = gen_optimizer.apply_gradients(
-        zip(gen_grads_clipped, gen_vars), global_step=global_step)
-    return gen_train_op, gen_grads_clipped, gen_vars
-
-
-def create_reinforce_gen_train_op(hparams, learning_rate, final_gen_reward,
-                                  averages_op, global_step):
-  """Create the Generator train_op when using REINFORCE.
-
-  Args:
-    hparams:  MaskGAN hyperparameters.
-    learning_rate:  tf.Variable scalar learning rate.
-    final_gen_objective:  Scalar final REINFORCE objective for the sequence.
-    averages_op:  ExponentialMovingAverage apply average op to
-      maintain the baseline.
-    global_step:  global_step tf.Variable.
-
-  Returns:
-    gen_train_op: Generator training op.
-  """
-  del hparams
-  with tf.name_scope('train_generator'):
-    if FLAGS.generator_optimizer == 'sgd':
-      gen_optimizer = tf.train.GradientDescentOptimizer(learning_rate)
-    elif FLAGS.generator_optimizer == 'adam':
-      gen_optimizer = tf.train.AdamOptimizer(learning_rate)
-    else:
-      raise NotImplementedError
-    gen_vars = [
-        v for v in tf.trainable_variables() if v.op.name.startswith('gen')
-    ]
-    print('\nOptimizing Generator vars:')
-    for v in gen_vars:
-      print(v)
-
-    # Maximize reward.
-    gen_grads = tf.gradients(-final_gen_reward, gen_vars)
-    gen_grads_clipped, _ = tf.clip_by_global_norm(gen_grads,
-                                                  FLAGS.grad_clipping)
-    maximize_op = gen_optimizer.apply_gradients(
-        zip(gen_grads_clipped, gen_vars), global_step=global_step)
-
-    # Group maintain averages op.
-    if averages_op:
-      gen_train_op = tf.group(maximize_op, averages_op)
-    else:
-      gen_train_op = maximize_op
-
-    return [gen_train_op, gen_grads, gen_vars]
-
-
-def create_dis_train_op(hparams, dis_loss, global_step):
-  """Create Discriminator train op."""
-  with tf.name_scope('train_discriminator'):
-    dis_optimizer = tf.train.AdamOptimizer(hparams.dis_learning_rate)
-    dis_vars = [
-        v for v in tf.trainable_variables() if v.op.name.startswith('dis')
-    ]
-    if FLAGS.dis_update_share_embedding and FLAGS.dis_share_embedding:
-      shared_embedding = [
-          v for v in tf.trainable_variables()
-          if v.op.name == 'gen/decoder/rnn/embedding'
-      ][0]
-      dis_vars.append(shared_embedding)
-    print('\nOptimizing Discriminator vars:')
-    for v in dis_vars:
-      print(v)
-    dis_grads = tf.gradients(dis_loss, dis_vars)
-    dis_grads_clipped, _ = tf.clip_by_global_norm(dis_grads,
-                                                  FLAGS.grad_clipping)
-    dis_train_op = dis_optimizer.apply_gradients(
-        zip(dis_grads_clipped, dis_vars), global_step=global_step)
-    return dis_train_op, dis_grads_clipped, dis_vars
-
-
-def create_critic_train_op(hparams, critic_loss, global_step):
-  """Create Discriminator train op."""
-  with tf.name_scope('train_critic'):
-    critic_optimizer = tf.train.AdamOptimizer(hparams.critic_learning_rate)
-    output_vars = [
-        v for v in tf.trainable_variables() if v.op.name.startswith('critic')
-    ]
-
-    if FLAGS.critic_update_dis_vars:
-      if FLAGS.discriminator_model == 'bidirectional_vd':
-        critic_vars = [
-            v for v in tf.trainable_variables()
-            if v.op.name.startswith('dis/rnn')
-        ]
-      elif FLAGS.discriminator_model == 'seq2seq_vd':
-        critic_vars = [
-            v for v in tf.trainable_variables()
-            if v.op.name.startswith('dis/decoder/rnn/multi_rnn_cell')
-        ]
-      critic_vars.extend(output_vars)
-    else:
-      critic_vars = output_vars
-    print('\nOptimizing Critic vars:')
-    for v in critic_vars:
-      print(v)
-    critic_grads = tf.gradients(critic_loss, critic_vars)
-    critic_grads_clipped, _ = tf.clip_by_global_norm(critic_grads,
-                                                     FLAGS.grad_clipping)
-    critic_train_op = critic_optimizer.apply_gradients(
-        zip(critic_grads_clipped, critic_vars), global_step=global_step)
-    return critic_train_op, critic_grads_clipped, critic_vars
diff --git a/research/maskgan/model_utils/model_utils.py b/research/maskgan/model_utils/model_utils.py
deleted file mode 100644
index 0e3183582e0f17b7d4ca54450231ea9bad039e40..0000000000000000000000000000000000000000
--- a/research/maskgan/model_utils/model_utils.py
+++ /dev/null
@@ -1,291 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Model utilities."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-import numpy as np
-
-import tensorflow as tf
-from model_utils import variable_mapping
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def generate_mask():
-  """Generate the mask to be fed into the model."""
-  if FLAGS.mask_strategy == 'random':
-    p = np.random.choice(
-        [True, False],
-        size=[FLAGS.batch_size, FLAGS.sequence_length],
-        p=[FLAGS.is_present_rate, 1. - FLAGS.is_present_rate])
-
-  elif FLAGS.mask_strategy == 'contiguous':
-    masked_length = int((1 - FLAGS.is_present_rate) * FLAGS.sequence_length) - 1
-    # Determine location to start masking.
-    start_mask = np.random.randint(
-        1, FLAGS.sequence_length - masked_length + 1, size=FLAGS.batch_size)
-    p = np.full([FLAGS.batch_size, FLAGS.sequence_length], True, dtype=bool)
-
-    # Create contiguous masked section to be False.
-    for i, index in enumerate(start_mask):
-      p[i, index:index + masked_length] = False
-
-  else:
-    raise NotImplementedError
-
-  return p
-
-
-def assign_percent_real(session, percent_real_update, new_rate, current_rate):
-  """Run assign operation where the we load the current_rate of percent
-  real into a Tensorflow variable.
-
-  Args:
-    session:  Current tf.Session.
-    percent_real_update: tf.assign operation.
-    new_rate: tf.placeholder for the new rate.
-    current_rate: Percent of tokens that are currently real.  Fake tokens
-      are the ones being imputed by the Generator.
-  """
-  session.run(percent_real_update, feed_dict={new_rate: current_rate})
-
-
-def assign_learning_rate(session, lr_update, lr_placeholder, new_lr):
-  """Run assign operation where the we load the current_rate of percent
-  real into a Tensorflow variable.
-
-  Args:
-    session:  Current tf.Session.
-    lr_update: tf.assign operation.
-    lr_placeholder: tf.placeholder for the new learning rate.
-    new_lr: New learning rate to use.
-  """
-  session.run(lr_update, feed_dict={lr_placeholder: new_lr})
-
-
-def clip_weights(variables, c_lower, c_upper):
-  """Clip a list of weights to be within a certain range.
-
-  Args:
-    variables: List of tf.Variable weights.
-    c_lower: Lower bound for weights.
-    c_upper: Upper bound for weights.
-  """
-  clip_ops = []
-
-  for var in variables:
-    clipped_var = tf.clip_by_value(var, c_lower, c_upper)
-
-    clip_ops.append(tf.assign(var, clipped_var))
-  return tf.group(*clip_ops)
-
-
-def retrieve_init_savers(hparams):
-  """Retrieve a dictionary of all the initial savers for the models.
-
-  Args:
-    hparams:  MaskGAN hyperparameters.
-  """
-  ## Dictionary of init savers.
-  init_savers = {}
-
-  ## Load Generator weights from MaskGAN checkpoint.
-  if FLAGS.maskgan_ckpt:
-    gen_vars = [
-        v for v in tf.trainable_variables() if v.op.name.startswith('gen')
-    ]
-    init_saver = tf.train.Saver(var_list=gen_vars)
-    init_savers['init_saver'] = init_saver
-
-    ## Load the Discriminator weights from the MaskGAN checkpoint if
-    # the weights are compatible.
-    if FLAGS.discriminator_model == 'seq2seq_vd':
-      dis_variable_maps = variable_mapping.dis_seq2seq_vd(hparams)
-      dis_init_saver = tf.train.Saver(var_list=dis_variable_maps)
-      init_savers['dis_init_saver'] = dis_init_saver
-
-  ## Load weights from language model checkpoint.
-  if FLAGS.language_model_ckpt_dir:
-    if FLAGS.maskgan_ckpt is None:
-      ## Generator Variables/Savers.
-      if FLAGS.generator_model == 'rnn_nas':
-        gen_variable_maps = variable_mapping.rnn_nas(hparams, model='gen')
-        gen_init_saver = tf.train.Saver(var_list=gen_variable_maps)
-        init_savers['gen_init_saver'] = gen_init_saver
-
-      elif FLAGS.generator_model == 'seq2seq_nas':
-        # Encoder.
-        gen_encoder_variable_maps = variable_mapping.gen_encoder_seq2seq_nas(
-            hparams)
-        gen_encoder_init_saver = tf.train.Saver(
-            var_list=gen_encoder_variable_maps)
-        # Decoder.
-        gen_decoder_variable_maps = variable_mapping.gen_decoder_seq2seq_nas(
-            hparams)
-        gen_decoder_init_saver = tf.train.Saver(
-            var_list=gen_decoder_variable_maps)
-        init_savers['gen_encoder_init_saver'] = gen_encoder_init_saver
-        init_savers['gen_decoder_init_saver'] = gen_decoder_init_saver
-
-      # seq2seq_vd derived from the same code base as seq2seq_zaremba.
-      elif (FLAGS.generator_model == 'seq2seq_zaremba' or
-            FLAGS.generator_model == 'seq2seq_vd'):
-        # Encoder.
-        gen_encoder_variable_maps = variable_mapping.gen_encoder_seq2seq(
-            hparams)
-        gen_encoder_init_saver = tf.train.Saver(
-            var_list=gen_encoder_variable_maps)
-        # Decoder.
-        gen_decoder_variable_maps = variable_mapping.gen_decoder_seq2seq(
-            hparams)
-        gen_decoder_init_saver = tf.train.Saver(
-            var_list=gen_decoder_variable_maps)
-        init_savers['gen_encoder_init_saver'] = gen_encoder_init_saver
-        init_savers['gen_decoder_init_saver'] = gen_decoder_init_saver
-
-      else:
-        raise NotImplementedError
-
-    ## Discriminator Variables/Savers.
-    if FLAGS.discriminator_model == 'rnn_nas':
-      dis_variable_maps = variable_mapping.rnn_nas(hparams, model='dis')
-      dis_init_saver = tf.train.Saver(var_list=dis_variable_maps)
-      init_savers['dis_init_saver'] = dis_init_saver
-
-    # rnn_vd derived from the same code base as rnn_zaremba.
-    elif (FLAGS.discriminator_model == 'rnn_zaremba' or
-          FLAGS.discriminator_model == 'rnn_vd'):
-      dis_variable_maps = variable_mapping.rnn_zaremba(hparams, model='dis')
-      dis_init_saver = tf.train.Saver(var_list=dis_variable_maps)
-      init_savers['dis_init_saver'] = dis_init_saver
-
-    elif (FLAGS.discriminator_model == 'bidirectional_zaremba' or
-          FLAGS.discriminator_model == 'bidirectional_vd'):
-      dis_fwd_variable_maps = variable_mapping.dis_fwd_bidirectional(hparams)
-      dis_bwd_variable_maps = variable_mapping.dis_bwd_bidirectional(hparams)
-      # Savers for the forward/backward Discriminator components.
-      dis_fwd_init_saver = tf.train.Saver(var_list=dis_fwd_variable_maps)
-      dis_bwd_init_saver = tf.train.Saver(var_list=dis_bwd_variable_maps)
-      init_savers['dis_fwd_init_saver'] = dis_fwd_init_saver
-      init_savers['dis_bwd_init_saver'] = dis_bwd_init_saver
-
-    elif FLAGS.discriminator_model == 'cnn':
-      dis_variable_maps = variable_mapping.cnn()
-      dis_init_saver = tf.train.Saver(var_list=dis_variable_maps)
-      init_savers['dis_init_saver'] = dis_init_saver
-
-    elif FLAGS.discriminator_model == 'seq2seq_vd':
-      # Encoder.
-      dis_encoder_variable_maps = variable_mapping.dis_encoder_seq2seq(hparams)
-      dis_encoder_init_saver = tf.train.Saver(
-          var_list=dis_encoder_variable_maps)
-      # Decoder.
-      dis_decoder_variable_maps = variable_mapping.dis_decoder_seq2seq(hparams)
-      dis_decoder_init_saver = tf.train.Saver(
-          var_list=dis_decoder_variable_maps)
-      init_savers['dis_encoder_init_saver'] = dis_encoder_init_saver
-      init_savers['dis_decoder_init_saver'] = dis_decoder_init_saver
-
-  return init_savers
-
-
-def init_fn(init_savers, sess):
-  """The init_fn to be passed to the Supervisor.
-
-  Args:
-    init_savers:  Dictionary of init_savers.  'init_saver_name': init_saver.
-    sess:  tf.Session.
-  """
-  ## Load Generator weights from MaskGAN checkpoint.
-  if FLAGS.maskgan_ckpt:
-    print('Restoring Generator from %s.' % FLAGS.maskgan_ckpt)
-    tf.logging.info('Restoring Generator from %s.' % FLAGS.maskgan_ckpt)
-    print('Asserting Generator is a seq2seq-variant.')
-    tf.logging.info('Asserting Generator is a seq2seq-variant.')
-    assert FLAGS.generator_model.startswith('seq2seq')
-    init_saver = init_savers['init_saver']
-    init_saver.restore(sess, FLAGS.maskgan_ckpt)
-
-    ## Load the Discriminator weights from the MaskGAN checkpoint if
-    # the weights are compatible.
-    if FLAGS.discriminator_model == 'seq2seq_vd':
-      print('Restoring Discriminator from %s.' % FLAGS.maskgan_ckpt)
-      tf.logging.info('Restoring Discriminator from %s.' % FLAGS.maskgan_ckpt)
-      dis_init_saver = init_savers['dis_init_saver']
-      dis_init_saver.restore(sess, FLAGS.maskgan_ckpt)
-
-  ## Load weights from language model checkpoint.
-  if FLAGS.language_model_ckpt_dir:
-    if FLAGS.maskgan_ckpt is None:
-      ## Generator Models.
-      if FLAGS.generator_model == 'rnn_nas':
-        load_ckpt = tf.train.latest_checkpoint(FLAGS.language_model_ckpt_dir)
-        print('Restoring Generator from %s.' % load_ckpt)
-        tf.logging.info('Restoring Generator from %s.' % load_ckpt)
-        gen_init_saver = init_savers['gen_init_saver']
-        gen_init_saver.restore(sess, load_ckpt)
-
-      elif FLAGS.generator_model.startswith('seq2seq'):
-        load_ckpt = tf.train.latest_checkpoint(FLAGS.language_model_ckpt_dir)
-        print('Restoring Generator from %s.' % load_ckpt)
-        tf.logging.info('Restoring Generator from %s.' % load_ckpt)
-        gen_encoder_init_saver = init_savers['gen_encoder_init_saver']
-        gen_decoder_init_saver = init_savers['gen_decoder_init_saver']
-        gen_encoder_init_saver.restore(sess, load_ckpt)
-        gen_decoder_init_saver.restore(sess, load_ckpt)
-
-    ## Discriminator Models.
-    if (FLAGS.discriminator_model == 'rnn_nas' or
-        FLAGS.discriminator_model == 'rnn_zaremba' or
-        FLAGS.discriminator_model == 'rnn_vd' or
-        FLAGS.discriminator_model == 'cnn'):
-      load_ckpt = tf.train.latest_checkpoint(FLAGS.language_model_ckpt_dir)
-      print('Restoring Discriminator from %s.' % load_ckpt)
-      tf.logging.info('Restoring Discriminator from %s.' % load_ckpt)
-      dis_init_saver = init_savers['dis_init_saver']
-      dis_init_saver.restore(sess, load_ckpt)
-
-    elif (FLAGS.discriminator_model == 'bidirectional_zaremba' or
-          FLAGS.discriminator_model == 'bidirectional_vd'):
-      assert FLAGS.language_model_ckpt_dir_reversed is not None, (
-          'Need a reversed directory to fill in the backward components.')
-      load_fwd_ckpt = tf.train.latest_checkpoint(FLAGS.language_model_ckpt_dir)
-      load_bwd_ckpt = tf.train.latest_checkpoint(
-          FLAGS.language_model_ckpt_dir_reversed)
-      print('Restoring Discriminator from %s and %s.' % (load_fwd_ckpt,
-                                                         load_bwd_ckpt))
-      tf.logging.info('Restoring Discriminator from %s and %s.' %
-                      (load_fwd_ckpt, load_bwd_ckpt))
-      dis_fwd_init_saver = init_savers['dis_fwd_init_saver']
-      dis_bwd_init_saver = init_savers['dis_bwd_init_saver']
-      dis_fwd_init_saver.restore(sess, load_fwd_ckpt)
-      dis_bwd_init_saver.restore(sess, load_bwd_ckpt)
-
-    elif FLAGS.discriminator_model == 'seq2seq_vd':
-      load_ckpt = tf.train.latest_checkpoint(FLAGS.language_model_ckpt_dir)
-      print('Restoring Discriminator from %s.' % load_ckpt)
-      tf.logging.info('Restoring Discriminator from %s.' % load_ckpt)
-      dis_encoder_init_saver = init_savers['dis_encoder_init_saver']
-      dis_decoder_init_saver = init_savers['dis_decoder_init_saver']
-      dis_encoder_init_saver.restore(sess, load_ckpt)
-      dis_decoder_init_saver.restore(sess, load_ckpt)
-
-  else:
-    return
diff --git a/research/maskgan/model_utils/n_gram.py b/research/maskgan/model_utils/n_gram.py
deleted file mode 100644
index b889dde849a60d95aa38c57cd8c864249233514f..0000000000000000000000000000000000000000
--- a/research/maskgan/model_utils/n_gram.py
+++ /dev/null
@@ -1,66 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""We calculate n-Grams from the training text. We will use this as an
-evaluation metric."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from six.moves import xrange
-
-
-def hash_function(input_tuple):
-  """Hash function for a tuple."""
-  return hash(input_tuple)
-
-
-def find_all_ngrams(dataset, n):
-  """Generate a list of all ngrams."""
-  return zip(*[dataset[i:] for i in xrange(n)])
-
-
-def construct_ngrams_dict(ngrams_list):
-  """Construct a ngram dictionary which maps an ngram tuple to the number
-  of times it appears in the text."""
-  counts = {}
-
-  for t in ngrams_list:
-    key = hash_function(t)
-    if key in counts:
-      counts[key] += 1
-    else:
-      counts[key] = 1
-  return counts
-
-
-def percent_unique_ngrams_in_train(train_ngrams_dict, gen_ngrams_dict):
-  """Compute the percent of ngrams generated by the model that are
-  present in the training text and are unique."""
-
-  # *Total* number of n-grams produced by the generator.
-  total_ngrams_produced = 0
-
-  for _, value in gen_ngrams_dict.iteritems():
-    total_ngrams_produced += value
-
-  # The unique ngrams in the training set.
-  unique_ngrams_in_train = 0.
-
-  for key, _ in gen_ngrams_dict.iteritems():
-    if key in train_ngrams_dict:
-      unique_ngrams_in_train += 1
-  return float(unique_ngrams_in_train) / float(total_ngrams_produced)
diff --git a/research/maskgan/model_utils/variable_mapping.py b/research/maskgan/model_utils/variable_mapping.py
deleted file mode 100644
index 0301b969716fe473ac98c2e3bba5c04662461954..0000000000000000000000000000000000000000
--- a/research/maskgan/model_utils/variable_mapping.py
+++ /dev/null
@@ -1,745 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def rnn_nas(hparams, model):
-  assert model == 'gen' or model == 'dis'
-
-  # This logic is only valid for rnn_zaremba
-  if model == 'gen':
-    assert FLAGS.generator_model == 'rnn_nas'
-    assert hparams.gen_num_layers == 2
-
-  if model == 'dis':
-    assert FLAGS.discriminator_model == 'rnn_nas'
-    assert hparams.dis_num_layers == 2
-
-  # Output variables only for the Generator.  Discriminator output biases
-  # will begin randomly initialized.
-  if model == 'gen':
-    softmax_b = [
-        v for v in tf.trainable_variables() if v.op.name == 'gen/rnn/softmax_b'
-    ][0]
-
-  # Common elements to Generator and Discriminator.
-  embedding = [
-      v for v in tf.trainable_variables()
-      if v.op.name == str(model) + '/rnn/embedding'
-  ][0]
-  lstm_w_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      str(model) + '/rnn/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_h_mat'
-  ][0]
-  lstm_b_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == str(model) +
-      '/rnn/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_inputs_mat'
-  ][0]
-  lstm_w_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      str(model) + '/rnn/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_h_mat'
-  ][0]
-  lstm_b_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == str(model) +
-      '/rnn/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_inputs_mat'
-  ][0]
-
-  # Dictionary mapping.
-  if model == 'gen':
-    variable_mapping = {
-        'Model/embeddings/input_embedding':
-            embedding,
-        'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_h_mat':
-            lstm_w_0,
-        'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_inputs_mat':
-            lstm_b_0,
-        'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_h_mat':
-            lstm_w_1,
-        'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_inputs_mat':
-            lstm_b_1,
-        'Model/softmax_b':
-            softmax_b
-    }
-  else:
-    variable_mapping = {
-        'Model/embeddings/input_embedding':
-            embedding,
-        'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_h_mat':
-            lstm_w_0,
-        'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_inputs_mat':
-            lstm_b_0,
-        'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_h_mat':
-            lstm_w_1,
-        'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_inputs_mat':
-            lstm_b_1
-    }
-
-  return variable_mapping
-
-
-def cnn():
-  """Variable mapping for the CNN embedding.
-
-  Returns:
-    variable_mapping:  Dictionary with Key: ckpt_name, Value: model_var.
-  """
-  # This logic is only valid for cnn
-  assert FLAGS.discriminator_model == 'cnn'
-
-  # Retrieve CNN embedding.
-  embedding = [
-      v for v in tf.trainable_variables() if v.op.name == 'dis/embedding'
-  ][0]
-
-  # Variable mapping.
-  variable_mapping = {'Model/embedding': embedding}
-
-  return variable_mapping
-
-
-def rnn_zaremba(hparams, model):
-  """Returns the PTB Variable name to MaskGAN Variable dictionary mapping.  This
-  is a highly restrictive function just for testing.  This will need to be
-  generalized.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    model:  Model type, one of ['gen', 'dis'].
-
-  Returns:
-    variable_mapping:  Dictionary with Key: ckpt_name, Value: model_var.
-  """
-  assert model == 'gen' or model == 'dis'
-
-  # This logic is only valid for rnn_zaremba
-  if model == 'gen':
-    assert FLAGS.generator_model == 'rnn_zaremba'
-    assert hparams.gen_num_layers == 2
-
-  if model == 'dis':
-    assert (FLAGS.discriminator_model == 'rnn_zaremba' or
-            FLAGS.discriminator_model == 'rnn_vd')
-    assert hparams.dis_num_layers == 2
-
-  # Output variables only for the Generator.  Discriminator output weights
-  # and biases will begin randomly initialized.
-  if model == 'gen':
-    softmax_w = [
-        v for v in tf.trainable_variables() if v.op.name == 'gen/rnn/softmax_w'
-    ][0]
-    softmax_b = [
-        v for v in tf.trainable_variables() if v.op.name == 'gen/rnn/softmax_b'
-    ][0]
-
-  # Common elements to Generator and Discriminator.
-  if not FLAGS.dis_share_embedding or model != 'dis':
-    embedding = [
-        v for v in tf.trainable_variables()
-        if v.op.name == str(model) + '/rnn/embedding'
-    ][0]
-  lstm_w_0 = [
-      v for v in tf.trainable_variables() if v.op.name == str(model) +
-      '/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/kernel'
-  ][0]
-  lstm_b_0 = [
-      v for v in tf.trainable_variables() if v.op.name == str(model) +
-      '/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/bias'
-  ][0]
-  lstm_w_1 = [
-      v for v in tf.trainable_variables() if v.op.name == str(model) +
-      '/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/kernel'
-  ][0]
-  lstm_b_1 = [
-      v for v in tf.trainable_variables() if v.op.name == str(model) +
-      '/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/bias'
-  ][0]
-
-  # Dictionary mapping.
-  if model == 'gen':
-    variable_mapping = {
-        'Model/embedding': embedding,
-        'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel': lstm_w_0,
-        'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias': lstm_b_0,
-        'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel': lstm_w_1,
-        'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias': lstm_b_1,
-        'Model/softmax_w': softmax_w,
-        'Model/softmax_b': softmax_b
-    }
-  else:
-    if FLAGS.dis_share_embedding:
-      variable_mapping = {
-          'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel': lstm_w_0,
-          'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias': lstm_b_0,
-          'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel': lstm_w_1,
-          'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias': lstm_b_1
-      }
-    else:
-      variable_mapping = {
-          'Model/embedding': embedding,
-          'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel': lstm_w_0,
-          'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias': lstm_b_0,
-          'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel': lstm_w_1,
-          'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias': lstm_b_1
-      }
-
-  return variable_mapping
-
-
-def gen_encoder_seq2seq_nas(hparams):
-  """Returns the NAS Variable name to MaskGAN Variable
-  dictionary mapping.  This is a highly restrictive function just for testing.
-  This is for the *unidirecitional* seq2seq_nas encoder.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-
-  Returns:
-    variable_mapping:  Dictionary with Key: ckpt_name, Value: model_varself.
-  """
-  assert FLAGS.generator_model == 'seq2seq_nas'
-  assert hparams.gen_num_layers == 2
-  ## Encoder forward variables.
-
-  if not FLAGS.seq2seq_share_embedding:
-    encoder_embedding = [
-        v for v in tf.trainable_variables()
-        if v.op.name == 'gen/encoder/rnn/embedding'
-    ][0]
-  encoder_lstm_w_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      'gen/encoder/rnn/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_h_mat'
-  ][0]
-  encoder_lstm_b_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      'gen/encoder/rnn/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_inputs_mat'
-  ][0]
-  encoder_lstm_w_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      'gen/encoder/rnn/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_h_mat'
-  ][0]
-  encoder_lstm_b_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      'gen/encoder/rnn/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_inputs_mat'
-  ][0]
-
-  if not FLAGS.seq2seq_share_embedding:
-    variable_mapping = {
-        'Model/embeddings/input_embedding':
-            encoder_embedding,
-        'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_h_mat':
-            encoder_lstm_w_0,
-        'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_inputs_mat':
-            encoder_lstm_b_0,
-        'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_h_mat':
-            encoder_lstm_w_1,
-        'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_inputs_mat':
-            encoder_lstm_b_1
-    }
-  else:
-    variable_mapping = {
-        'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_h_mat':
-            encoder_lstm_w_0,
-        'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_inputs_mat':
-            encoder_lstm_b_0,
-        'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_h_mat':
-            encoder_lstm_w_1,
-        'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_inputs_mat':
-            encoder_lstm_b_1
-    }
-  return variable_mapping
-
-
-def gen_decoder_seq2seq_nas(hparams):
-  assert FLAGS.generator_model == 'seq2seq_nas'
-  assert hparams.gen_num_layers == 2
-
-  decoder_embedding = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'gen/decoder/rnn/embedding'
-  ][0]
-  decoder_lstm_w_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      'gen/decoder/rnn/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_h_mat'
-  ][0]
-  decoder_lstm_b_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      'gen/decoder/rnn/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_inputs_mat'
-  ][0]
-  decoder_lstm_w_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      'gen/decoder/rnn/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_h_mat'
-  ][0]
-  decoder_lstm_b_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name ==
-      'gen/decoder/rnn/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_inputs_mat'
-  ][0]
-
-  decoder_softmax_b = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'gen/decoder/rnn/softmax_b'
-  ][0]
-
-  variable_mapping = {
-      'Model/embeddings/input_embedding':
-          decoder_embedding,
-      'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_h_mat':
-          decoder_lstm_w_0,
-      'Model/RNN/GenericMultiRNNCell/Cell0/Alien/rnn_builder/big_inputs_mat':
-          decoder_lstm_b_0,
-      'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_h_mat':
-          decoder_lstm_w_1,
-      'Model/RNN/GenericMultiRNNCell/Cell1/Alien/rnn_builder/big_inputs_mat':
-          decoder_lstm_b_1,
-      'Model/softmax_b':
-          decoder_softmax_b
-  }
-
-  return variable_mapping
-
-
-def gen_encoder_seq2seq(hparams):
-  """Returns the PTB Variable name to MaskGAN Variable
-  dictionary mapping.  This is a highly restrictive function just for testing.
-  This is foe the *unidirecitional* seq2seq_zaremba encoder.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-
-  Returns:
-    variable_mapping:  Dictionary with Key: ckpt_name, Value: model_varself.
-  """
-  assert (FLAGS.generator_model == 'seq2seq_zaremba' or
-          FLAGS.generator_model == 'seq2seq_vd')
-  assert hparams.gen_num_layers == 2
-
-  ## Encoder forward variables.
-  if not FLAGS.seq2seq_share_embedding:
-    encoder_embedding = [
-        v for v in tf.trainable_variables()
-        if v.op.name == 'gen/encoder/rnn/embedding'
-    ][0]
-  encoder_lstm_w_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'gen/encoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/kernel'
-  ][0]
-  encoder_lstm_b_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'gen/encoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/bias'
-  ][0]
-  encoder_lstm_w_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'gen/encoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/kernel'
-  ][0]
-  encoder_lstm_b_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'gen/encoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/bias'
-  ][0]
-
-  if FLAGS.data_set == 'ptb':
-    model_str = 'Model'
-  else:
-    model_str = 'model'
-
-  if not FLAGS.seq2seq_share_embedding:
-    variable_mapping = {
-        str(model_str) + '/embedding':
-            encoder_embedding,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel':
-            encoder_lstm_w_0,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias':
-            encoder_lstm_b_0,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel':
-            encoder_lstm_w_1,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias':
-            encoder_lstm_b_1
-    }
-  else:
-    variable_mapping = {
-        str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel':
-            encoder_lstm_w_0,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias':
-            encoder_lstm_b_0,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel':
-            encoder_lstm_w_1,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias':
-            encoder_lstm_b_1
-    }
-  return variable_mapping
-
-
-def gen_decoder_seq2seq(hparams):
-  assert (FLAGS.generator_model == 'seq2seq_zaremba' or
-          FLAGS.generator_model == 'seq2seq_vd')
-  assert hparams.gen_num_layers == 2
-
-  decoder_embedding = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'gen/decoder/rnn/embedding'
-  ][0]
-  decoder_lstm_w_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'gen/decoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/kernel'
-  ][0]
-  decoder_lstm_b_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'gen/decoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/bias'
-  ][0]
-  decoder_lstm_w_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'gen/decoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/kernel'
-  ][0]
-  decoder_lstm_b_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'gen/decoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/bias'
-  ][0]
-  decoder_softmax_b = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'gen/decoder/rnn/softmax_b'
-  ][0]
-
-  if FLAGS.data_set == 'ptb':
-    model_str = 'Model'
-  else:
-    model_str = 'model'
-
-  variable_mapping = {
-      str(model_str) + '/embedding':
-          decoder_embedding,
-      str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel':
-          decoder_lstm_w_0,
-      str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias':
-          decoder_lstm_b_0,
-      str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel':
-          decoder_lstm_w_1,
-      str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias':
-          decoder_lstm_b_1,
-      str(model_str) + '/softmax_b':
-          decoder_softmax_b
-  }
-  return variable_mapping
-
-
-def dis_fwd_bidirectional(hparams):
-  """Returns the *forward* PTB Variable name to MaskGAN Variable dictionary
-  mapping.  This is a highly restrictive function just for testing. This is for
-  the bidirectional_zaremba discriminator.
-
-  Args:
-    FLAGS:  Flags for the model.
-    hparams:  Hyperparameters for the MaskGAN.
-
-  Returns:
-    variable_mapping:  Dictionary with Key: ckpt_name, Value: model_varself.
-  """
-  assert (FLAGS.discriminator_model == 'bidirectional_zaremba' or
-          FLAGS.discriminator_model == 'bidirectional_vd')
-  assert hparams.dis_num_layers == 2
-
-  # Forward Discriminator Elements.
-  if not FLAGS.dis_share_embedding:
-    embedding = [
-        v for v in tf.trainable_variables() if v.op.name == 'dis/embedding'
-    ][0]
-  fw_lstm_w_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'dis/rnn/fw/multi_rnn_cell/cell_0/basic_lstm_cell/kernel'
-  ][0]
-  fw_lstm_b_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'dis/rnn/fw/multi_rnn_cell/cell_0/basic_lstm_cell/bias'
-  ][0]
-  fw_lstm_w_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'dis/rnn/fw/multi_rnn_cell/cell_1/basic_lstm_cell/kernel'
-  ][0]
-  fw_lstm_b_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'dis/rnn/fw/multi_rnn_cell/cell_1/basic_lstm_cell/bias'
-  ][0]
-  if FLAGS.dis_share_embedding:
-    variable_mapping = {
-        'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel': fw_lstm_w_0,
-        'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias': fw_lstm_b_0,
-        'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel': fw_lstm_w_1,
-        'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias': fw_lstm_b_1
-    }
-  else:
-    variable_mapping = {
-        'Model/embedding': embedding,
-        'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel': fw_lstm_w_0,
-        'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias': fw_lstm_b_0,
-        'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel': fw_lstm_w_1,
-        'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias': fw_lstm_b_1
-    }
-  return variable_mapping
-
-
-def dis_bwd_bidirectional(hparams):
-  """Returns the *backward* PTB Variable name to MaskGAN Variable dictionary
-  mapping.  This is a highly restrictive function just for testing. This is for
-  the bidirectional_zaremba discriminator.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-
-  Returns:
-    variable_mapping:  Dictionary with Key: ckpt_name, Value: model_varself.
-  """
-  assert (FLAGS.discriminator_model == 'bidirectional_zaremba' or
-          FLAGS.discriminator_model == 'bidirectional_vd')
-  assert hparams.dis_num_layers == 2
-
-  # Backward Discriminator Elements.
-  bw_lstm_w_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'dis/rnn/bw/multi_rnn_cell/cell_0/basic_lstm_cell/kernel'
-  ][0]
-  bw_lstm_b_0 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'dis/rnn/bw/multi_rnn_cell/cell_0/basic_lstm_cell/bias'
-  ][0]
-  bw_lstm_w_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'dis/rnn/bw/multi_rnn_cell/cell_1/basic_lstm_cell/kernel'
-  ][0]
-  bw_lstm_b_1 = [
-      v for v in tf.trainable_variables()
-      if v.op.name == 'dis/rnn/bw/multi_rnn_cell/cell_1/basic_lstm_cell/bias'
-  ][0]
-
-  variable_mapping = {
-      'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel': bw_lstm_w_0,
-      'Model/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias': bw_lstm_b_0,
-      'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel': bw_lstm_w_1,
-      'Model/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias': bw_lstm_b_1
-  }
-  return variable_mapping
-
-
-def dis_encoder_seq2seq(hparams):
-  """Returns the PTB Variable name to MaskGAN Variable
-  dictionary mapping.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-
-  Returns:
-    variable_mapping:  Dictionary with Key: ckpt_name, Value: model_varself.
-  """
-  assert FLAGS.discriminator_model == 'seq2seq_vd'
-  assert hparams.dis_num_layers == 2
-
-  ## Encoder forward variables.
-  encoder_lstm_w_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/encoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/kernel'
-  ][0]
-  encoder_lstm_b_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/encoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/bias'
-  ][0]
-  encoder_lstm_w_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/encoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/kernel'
-  ][0]
-  encoder_lstm_b_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/encoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/bias'
-  ][0]
-
-  if FLAGS.data_set == 'ptb':
-    model_str = 'Model'
-  else:
-    model_str = 'model'
-
-  variable_mapping = {
-      str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel':
-          encoder_lstm_w_0,
-      str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias':
-          encoder_lstm_b_0,
-      str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel':
-          encoder_lstm_w_1,
-      str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias':
-          encoder_lstm_b_1
-  }
-  return variable_mapping
-
-
-def dis_decoder_seq2seq(hparams):
-  assert FLAGS.discriminator_model == 'seq2seq_vd'
-  assert hparams.dis_num_layers == 2
-
-  if not FLAGS.dis_share_embedding:
-    decoder_embedding = [
-        v for v in tf.trainable_variables()
-        if v.op.name == 'dis/decoder/rnn/embedding'
-    ][0]
-  decoder_lstm_w_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/decoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/kernel'
-  ][0]
-  decoder_lstm_b_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/decoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/bias'
-  ][0]
-  decoder_lstm_w_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/decoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/kernel'
-  ][0]
-  decoder_lstm_b_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/decoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/bias'
-  ][0]
-
-  if FLAGS.data_set == 'ptb':
-    model_str = 'Model'
-  else:
-    model_str = 'model'
-
-  if not FLAGS.dis_share_embedding:
-    variable_mapping = {
-        str(model_str) + '/embedding':
-            decoder_embedding,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel':
-            decoder_lstm_w_0,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias':
-            decoder_lstm_b_0,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel':
-            decoder_lstm_w_1,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias':
-            decoder_lstm_b_1
-    }
-  else:
-    variable_mapping = {
-        str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/kernel':
-            decoder_lstm_w_0,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_0/basic_lstm_cell/bias':
-            decoder_lstm_b_0,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/kernel':
-            decoder_lstm_w_1,
-        str(model_str) + '/RNN/multi_rnn_cell/cell_1/basic_lstm_cell/bias':
-            decoder_lstm_b_1,
-    }
-  return variable_mapping
-
-
-def dis_seq2seq_vd(hparams):
-  assert FLAGS.discriminator_model == 'seq2seq_vd'
-  assert hparams.dis_num_layers == 2
-
-  if not FLAGS.dis_share_embedding:
-    decoder_embedding = [
-        v for v in tf.trainable_variables()
-        if v.op.name == 'dis/decoder/rnn/embedding'
-    ][0]
-
-  ## Encoder variables.
-  encoder_lstm_w_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/encoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/kernel'
-  ][0]
-  encoder_lstm_b_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/encoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/bias'
-  ][0]
-  encoder_lstm_w_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/encoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/kernel'
-  ][0]
-  encoder_lstm_b_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/encoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/bias'
-  ][0]
-
-  ## Attention.
-  if FLAGS.attention_option is not None:
-    decoder_attention_keys = [
-        v for v in tf.trainable_variables()
-        if v.op.name == 'dis/decoder/attention_keys/weights'
-    ][0]
-    decoder_attention_construct_weights = [
-        v for v in tf.trainable_variables()
-        if v.op.name == 'dis/decoder/rnn/attention_construct/weights'
-    ][0]
-
-  ## Decoder.
-  decoder_lstm_w_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/decoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/kernel'
-  ][0]
-  decoder_lstm_b_0 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/decoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/bias'
-  ][0]
-  decoder_lstm_w_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/decoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/kernel'
-  ][0]
-  decoder_lstm_b_1 = [
-      v for v in tf.trainable_variables() if v.op.name ==
-      'dis/decoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/bias'
-  ][0]
-
-  # Standard variable mappings.
-  variable_mapping = {
-      'gen/encoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/kernel':
-          encoder_lstm_w_0,
-      'gen/encoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/bias':
-          encoder_lstm_b_0,
-      'gen/encoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/kernel':
-          encoder_lstm_w_1,
-      'gen/encoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/bias':
-          encoder_lstm_b_1,
-      'gen/decoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/kernel':
-          decoder_lstm_w_0,
-      'gen/decoder/rnn/multi_rnn_cell/cell_0/basic_lstm_cell/bias':
-          decoder_lstm_b_0,
-      'gen/decoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/kernel':
-          decoder_lstm_w_1,
-      'gen/decoder/rnn/multi_rnn_cell/cell_1/basic_lstm_cell/bias':
-          decoder_lstm_b_1
-  }
-
-  # Optional variable mappings.
-  if not FLAGS.dis_share_embedding:
-    variable_mapping['gen/decoder/rnn/embedding'] = decoder_embedding
-  if FLAGS.attention_option is not None:
-    variable_mapping[
-        'gen/decoder/attention_keys/weights'] = decoder_attention_keys
-    variable_mapping[
-        'gen/decoder/rnn/attention_construct/weights'] = decoder_attention_construct_weights
-
-  return variable_mapping
diff --git a/research/maskgan/models/__init__.py b/research/maskgan/models/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/maskgan/models/attention_utils.py b/research/maskgan/models/attention_utils.py
deleted file mode 100644
index 4bd9e41dd3178d6210e8f81d628b7d92004a6601..0000000000000000000000000000000000000000
--- a/research/maskgan/models/attention_utils.py
+++ /dev/null
@@ -1,477 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Attention-based decoder functions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-from tensorflow.python.framework import function
-
-__all__ = [
-    "prepare_attention", "attention_decoder_fn_train",
-    "attention_decoder_fn_inference"
-]
-
-
-def attention_decoder_fn_train(encoder_state,
-                               attention_keys,
-                               attention_values,
-                               attention_score_fn,
-                               attention_construct_fn,
-                               name=None):
-  """Attentional decoder function for `dynamic_rnn_decoder` during training.
-
-  The `attention_decoder_fn_train` is a training function for an
-  attention-based sequence-to-sequence model. It should be used when
-  `dynamic_rnn_decoder` is in the training mode.
-
-  The `attention_decoder_fn_train` is called with a set of the user arguments
-  and returns the `decoder_fn`, which can be passed to the
-  `dynamic_rnn_decoder`, such that
-
-  ```
-  dynamic_fn_train = attention_decoder_fn_train(encoder_state)
-  outputs_train, state_train = dynamic_rnn_decoder(
-      decoder_fn=dynamic_fn_train, ...)
-  ```
-
-  Further usage can be found in the `kernel_tests/seq2seq_test.py`.
-
-  Args:
-    encoder_state: The encoded state to initialize the `dynamic_rnn_decoder`.
-    attention_keys: to be compared with target states.
-    attention_values: to be used to construct context vectors.
-    attention_score_fn: to compute similarity between key and target states.
-    attention_construct_fn: to build attention states.
-    name: (default: `None`) NameScope for the decoder function;
-      defaults to "simple_decoder_fn_train"
-
-  Returns:
-    A decoder function with the required interface of `dynamic_rnn_decoder`
-    intended for training.
-  """
-  with tf.name_scope(name, "attention_decoder_fn_train", [
-      encoder_state, attention_keys, attention_values, attention_score_fn,
-      attention_construct_fn
-  ]):
-    pass
-
-  def decoder_fn(time, cell_state, cell_input, cell_output, context_state):
-    """Decoder function used in the `dynamic_rnn_decoder` for training.
-
-    Args:
-      time: positive integer constant reflecting the current timestep.
-      cell_state: state of RNNCell.
-      cell_input: input provided by `dynamic_rnn_decoder`.
-      cell_output: output of RNNCell.
-      context_state: context state provided by `dynamic_rnn_decoder`.
-
-    Returns:
-      A tuple (done, next state, next input, emit output, next context state)
-      where:
-
-      done: `None`, which is used by the `dynamic_rnn_decoder` to indicate
-      that `sequence_lengths` in `dynamic_rnn_decoder` should be used.
-
-      next state: `cell_state`, this decoder function does not modify the
-      given state.
-
-      next input: `cell_input`, this decoder function does not modify the
-      given input. The input could be modified when applying e.g. attention.
-
-      emit output: `cell_output`, this decoder function does not modify the
-      given output.
-
-      next context state: `context_state`, this decoder function does not
-      modify the given context state. The context state could be modified when
-      applying e.g. beam search.
-    """
-    with tf.name_scope(
-        name, "attention_decoder_fn_train",
-        [time, cell_state, cell_input, cell_output, context_state]):
-      if cell_state is None:  # first call, return encoder_state
-        cell_state = encoder_state
-
-        # init attention
-        attention = _init_attention(encoder_state)
-      else:
-        # construct attention
-        attention = attention_construct_fn(cell_output, attention_keys,
-                                           attention_values)
-        cell_output = attention
-
-      # combine cell_input and attention
-      next_input = tf.concat([cell_input, attention], 1)
-
-      return (None, cell_state, next_input, cell_output, context_state)
-
-  return decoder_fn
-
-
-def attention_decoder_fn_inference(output_fn,
-                                   encoder_state,
-                                   attention_keys,
-                                   attention_values,
-                                   attention_score_fn,
-                                   attention_construct_fn,
-                                   embeddings,
-                                   start_of_sequence_id,
-                                   end_of_sequence_id,
-                                   maximum_length,
-                                   num_decoder_symbols,
-                                   dtype=tf.int32,
-                                   name=None):
-  """Attentional decoder function for `dynamic_rnn_decoder` during inference.
-
-  The `attention_decoder_fn_inference` is a simple inference function for a
-  sequence-to-sequence model. It should be used when `dynamic_rnn_decoder` is
-  in the inference mode.
-
-  The `attention_decoder_fn_inference` is called with user arguments
-  and returns the `decoder_fn`, which can be passed to the
-  `dynamic_rnn_decoder`, such that
-
-  ```
-  dynamic_fn_inference = attention_decoder_fn_inference(...)
-  outputs_inference, state_inference = dynamic_rnn_decoder(
-      decoder_fn=dynamic_fn_inference, ...)
-  ```
-
-  Further usage can be found in the `kernel_tests/seq2seq_test.py`.
-
-  Args:
-    output_fn: An output function to project your `cell_output` onto class
-    logits.
-
-    An example of an output function;
-
-    ```
-      tf.variable_scope("decoder") as varscope
-        output_fn = lambda x: tf.contrib.layers.linear(x, num_decoder_symbols,
-                                            scope=varscope)
-
-        outputs_train, state_train = seq2seq.dynamic_rnn_decoder(...)
-        logits_train = output_fn(outputs_train)
-
-        varscope.reuse_variables()
-        logits_inference, state_inference = seq2seq.dynamic_rnn_decoder(
-            output_fn=output_fn, ...)
-    ```
-
-    If `None` is supplied it will act as an identity function, which
-    might be wanted when using the RNNCell `OutputProjectionWrapper`.
-
-    encoder_state: The encoded state to initialize the `dynamic_rnn_decoder`.
-    attention_keys: to be compared with target states.
-    attention_values: to be used to construct context vectors.
-    attention_score_fn: to compute similarity between key and target states.
-    attention_construct_fn: to build attention states.
-    embeddings: The embeddings matrix used for the decoder sized
-    `[num_decoder_symbols, embedding_size]`.
-    start_of_sequence_id: The start of sequence ID in the decoder embeddings.
-    end_of_sequence_id: The end of sequence ID in the decoder embeddings.
-    maximum_length: The maximum allowed of time steps to decode.
-    num_decoder_symbols: The number of classes to decode at each time step.
-    dtype: (default: `tf.int32`) The default data type to use when
-    handling integer objects.
-    name: (default: `None`) NameScope for the decoder function;
-      defaults to "attention_decoder_fn_inference"
-
-  Returns:
-    A decoder function with the required interface of `dynamic_rnn_decoder`
-    intended for inference.
-  """
-  with tf.name_scope(name, "attention_decoder_fn_inference", [
-      output_fn, encoder_state, attention_keys, attention_values,
-      attention_score_fn, attention_construct_fn, embeddings,
-      start_of_sequence_id, end_of_sequence_id, maximum_length,
-      num_decoder_symbols, dtype
-  ]):
-    start_of_sequence_id = tf.convert_to_tensor(start_of_sequence_id, dtype)
-    end_of_sequence_id = tf.convert_to_tensor(end_of_sequence_id, dtype)
-    maximum_length = tf.convert_to_tensor(maximum_length, dtype)
-    num_decoder_symbols = tf.convert_to_tensor(num_decoder_symbols, dtype)
-    encoder_info = tf.contrib.framework.nest.flatten(encoder_state)[0]
-    batch_size = encoder_info.get_shape()[0].value
-    if output_fn is None:
-      output_fn = lambda x: x
-    if batch_size is None:
-      batch_size = tf.shape(encoder_info)[0]
-
-  def decoder_fn(time, cell_state, cell_input, cell_output, context_state):
-    """Decoder function used in the `dynamic_rnn_decoder` for inference.
-
-    The main difference between this decoder function and the `decoder_fn` in
-    `attention_decoder_fn_train` is how `next_cell_input` is calculated. In
-    decoder function we calculate the next input by applying an argmax across
-    the feature dimension of the output from the decoder. This is a
-    greedy-search approach. (Bahdanau et al., 2014) & (Sutskever et al., 2014)
-    use beam-search instead.
-
-    Args:
-      time: positive integer constant reflecting the current timestep.
-      cell_state: state of RNNCell.
-      cell_input: input provided by `dynamic_rnn_decoder`.
-      cell_output: output of RNNCell.
-      context_state: context state provided by `dynamic_rnn_decoder`.
-
-    Returns:
-      A tuple (done, next state, next input, emit output, next context state)
-      where:
-
-      done: A boolean vector to indicate which sentences has reached a
-      `end_of_sequence_id`. This is used for early stopping by the
-      `dynamic_rnn_decoder`. When `time>=maximum_length` a boolean vector with
-      all elements as `true` is returned.
-
-      next state: `cell_state`, this decoder function does not modify the
-      given state.
-
-      next input: The embedding from argmax of the `cell_output` is used as
-      `next_input`.
-
-      emit output: If `output_fn is None` the supplied `cell_output` is
-      returned, else the `output_fn` is used to update the `cell_output`
-      before calculating `next_input` and returning `cell_output`.
-
-      next context state: `context_state`, this decoder function does not
-      modify the given context state. The context state could be modified when
-      applying e.g. beam search.
-
-    Raises:
-      ValueError: if cell_input is not None.
-
-    """
-    with tf.name_scope(
-        name, "attention_decoder_fn_inference",
-        [time, cell_state, cell_input, cell_output, context_state]):
-      if cell_input is not None:
-        raise ValueError(
-            "Expected cell_input to be None, but saw: %s" % cell_input)
-      if cell_output is None:
-        # invariant that this is time == 0
-        next_input_id = tf.ones(
-            [
-                batch_size,
-            ], dtype=dtype) * (
-                start_of_sequence_id)
-        done = tf.zeros(
-            [
-                batch_size,
-            ], dtype=tf.bool)
-        cell_state = encoder_state
-        cell_output = tf.zeros([num_decoder_symbols], dtype=tf.float32)
-        cell_input = tf.gather(embeddings, next_input_id)
-
-        # init attention
-        attention = _init_attention(encoder_state)
-      else:
-        # construct attention
-        attention = attention_construct_fn(cell_output, attention_keys,
-                                           attention_values)
-        cell_output = attention
-
-        # argmax decoder
-        cell_output = output_fn(cell_output)  # logits
-        next_input_id = tf.cast(tf.argmax(cell_output, 1), dtype=dtype)
-        done = tf.equal(next_input_id, end_of_sequence_id)
-        cell_input = tf.gather(embeddings, next_input_id)
-
-      # combine cell_input and attention
-      next_input = tf.concat([cell_input, attention], 1)
-
-      # if time > maxlen, return all true vector
-      done = tf.cond(
-          tf.greater(time, maximum_length),
-          lambda: tf.ones([
-              batch_size,], dtype=tf.bool), lambda: done)
-      return (done, cell_state, next_input, cell_output, context_state)
-
-  return decoder_fn
-
-
-## Helper functions ##
-def prepare_attention(attention_states, attention_option, num_units,
-                      reuse=None):
-  """Prepare keys/values/functions for attention.
-
-  Args:
-    attention_states: hidden states to attend over.
-    attention_option: how to compute attention, either "luong" or "bahdanau".
-    num_units: hidden state dimension.
-    reuse: whether to reuse variable scope.
-
-  Returns:
-    attention_keys: to be compared with target states.
-    attention_values: to be used to construct context vectors.
-    attention_score_fn: to compute similarity between key and target states.
-    attention_construct_fn: to build attention states.
-  """
-  # Prepare attention keys / values from attention_states
-  with tf.variable_scope("attention_keys", reuse=reuse) as scope:
-    attention_keys = tf.contrib.layers.linear(
-        attention_states, num_units, biases_initializer=None, scope=scope)
-  attention_values = attention_states
-
-  # Attention score function
-  attention_score_fn = _create_attention_score_fn("attention_score", num_units,
-                                                  attention_option, reuse)
-  # Attention construction function
-  attention_construct_fn = _create_attention_construct_fn(
-      "attention_construct", num_units, attention_score_fn, reuse)
-
-  return (attention_keys, attention_values, attention_score_fn,
-          attention_construct_fn)
-
-
-def _init_attention(encoder_state):
-  """Initialize attention. Handling both LSTM and GRU.
-
-  Args:
-    encoder_state: The encoded state to initialize the `dynamic_rnn_decoder`.
-
-  Returns:
-    attn: initial zero attention vector.
-  """
-
-  # Multi- vs single-layer
-  # TODO(thangluong): is this the best way to check?
-  if isinstance(encoder_state, tuple):
-    top_state = encoder_state[-1]
-  else:
-    top_state = encoder_state
-
-  # LSTM vs GRU
-  if isinstance(top_state, tf.contrib.rnn.LSTMStateTuple):
-    attn = tf.zeros_like(top_state.h)
-  else:
-    attn = tf.zeros_like(top_state)
-
-  return attn
-
-
-def _create_attention_construct_fn(name, num_units, attention_score_fn, reuse):
-  """Function to compute attention vectors.
-
-  Args:
-    name: to label variables.
-    num_units: hidden state dimension.
-    attention_score_fn: to compute similarity between key and target states.
-    reuse: whether to reuse variable scope.
-
-  Returns:
-    attention_construct_fn: to build attention states.
-  """
-
-  def construct_fn(attention_query, attention_keys, attention_values):
-    with tf.variable_scope(name, reuse=reuse) as scope:
-      context = attention_score_fn(attention_query, attention_keys,
-                                   attention_values)
-      concat_input = tf.concat([attention_query, context], 1)
-      attention = tf.contrib.layers.linear(
-          concat_input, num_units, biases_initializer=None, scope=scope)
-      return attention
-
-  return construct_fn
-
-
-# keys: [batch_size, attention_length, attn_size]
-# query: [batch_size, 1, attn_size]
-# return weights [batch_size, attention_length]
-@function.Defun(func_name="attn_add_fun", noinline=True)
-def _attn_add_fun(v, keys, query):
-  return tf.reduce_sum(v * tf.tanh(keys + query), [2])
-
-
-@function.Defun(func_name="attn_mul_fun", noinline=True)
-def _attn_mul_fun(keys, query):
-  return tf.reduce_sum(keys * query, [2])
-
-
-def _create_attention_score_fn(name,
-                               num_units,
-                               attention_option,
-                               reuse,
-                               dtype=tf.float32):
-  """Different ways to compute attention scores.
-
-  Args:
-    name: to label variables.
-    num_units: hidden state dimension.
-    attention_option: how to compute attention, either "luong" or "bahdanau".
-      "bahdanau": additive (Bahdanau et al., ICLR'2015)
-      "luong": multiplicative (Luong et al., EMNLP'2015)
-    reuse: whether to reuse variable scope.
-    dtype: (default: `tf.float32`) data type to use.
-
-  Returns:
-    attention_score_fn: to compute similarity between key and target states.
-  """
-  with tf.variable_scope(name, reuse=reuse):
-    if attention_option == "bahdanau":
-      query_w = tf.get_variable("attnW", [num_units, num_units], dtype=dtype)
-      score_v = tf.get_variable("attnV", [num_units], dtype=dtype)
-
-    def attention_score_fn(query, keys, values):
-      """Put attention masks on attention_values using attention_keys and query.
-
-      Args:
-        query: A Tensor of shape [batch_size, num_units].
-        keys: A Tensor of shape [batch_size, attention_length, num_units].
-        values: A Tensor of shape [batch_size, attention_length, num_units].
-
-      Returns:
-        context_vector: A Tensor of shape [batch_size, num_units].
-
-      Raises:
-        ValueError: if attention_option is neither "luong" or "bahdanau".
-
-
-      """
-      if attention_option == "bahdanau":
-        # transform query
-        query = tf.matmul(query, query_w)
-
-        # reshape query: [batch_size, 1, num_units]
-        query = tf.reshape(query, [-1, 1, num_units])
-
-        # attn_fun
-        scores = _attn_add_fun(score_v, keys, query)
-      elif attention_option == "luong":
-        # reshape query: [batch_size, 1, num_units]
-        query = tf.reshape(query, [-1, 1, num_units])
-
-        # attn_fun
-        scores = _attn_mul_fun(keys, query)
-      else:
-        raise ValueError("Unknown attention option %s!" % attention_option)
-
-      # Compute alignment weights
-      #   scores: [batch_size, length]
-      #   alignments: [batch_size, length]
-      # TODO(thangluong): not normalize over padding positions.
-      alignments = tf.nn.softmax(scores)
-
-      # Now calculate the attention-weighted vector.
-      alignments = tf.expand_dims(alignments, 2)
-      context_vector = tf.reduce_sum(alignments * values, [1])
-      context_vector.set_shape([None, num_units])
-
-      return context_vector
-
-    return attention_score_fn
diff --git a/research/maskgan/models/bidirectional.py b/research/maskgan/models/bidirectional.py
deleted file mode 100644
index 1e6b3fe45f9ffe7dffdeb5c0d571de7e68227498..0000000000000000000000000000000000000000
--- a/research/maskgan/models/bidirectional.py
+++ /dev/null
@@ -1,75 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple bidirectional model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-# ZoneoutWrapper.
-from regularization import zoneout
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def discriminator(hparams, sequence, is_training, reuse=None):
-  """Define the bidirectional Discriminator graph."""
-  sequence = tf.cast(sequence, tf.int32)
-
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        'If you wish to share Discriminator/Generator embeddings, they must be'
-        ' same dimension.')
-    with tf.variable_scope('gen/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope('dis', reuse=reuse):
-    cell_fwd = tf.contrib.rnn.LayerNormBasicLSTMCell(
-        hparams.dis_rnn_size, forget_bias=1.0, reuse=reuse)
-    cell_bwd = tf.contrib.rnn.LayerNormBasicLSTMCell(
-        hparams.dis_rnn_size, forget_bias=1.0, reuse=reuse)
-    if FLAGS.zoneout_drop_prob > 0.0:
-      cell_fwd = zoneout.ZoneoutWrapper(
-          cell_fwd,
-          zoneout_drop_prob=FLAGS.zoneout_drop_prob,
-          is_training=is_training)
-      cell_bwd = zoneout.ZoneoutWrapper(
-          cell_bwd,
-          zoneout_drop_prob=FLAGS.zoneout_drop_prob,
-          is_training=is_training)
-
-    state_fwd = cell_fwd.zero_state(FLAGS.batch_size, tf.float32)
-    state_bwd = cell_bwd.zero_state(FLAGS.batch_size, tf.float32)
-
-    if not FLAGS.dis_share_embedding:
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-    rnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-    rnn_inputs = tf.unstack(rnn_inputs, axis=1)
-
-    with tf.variable_scope('rnn') as vs:
-      outputs, _, _ = tf.contrib.rnn.static_bidirectional_rnn(
-          cell_fwd, cell_bwd, rnn_inputs, state_fwd, state_bwd, scope=vs)
-
-      # Prediction is linear output for Discriminator.
-      predictions = tf.contrib.layers.linear(outputs, 1, scope=vs)
-
-      predictions = tf.transpose(predictions, [1, 0, 2])
-      return tf.squeeze(predictions, axis=2)
diff --git a/research/maskgan/models/bidirectional_vd.py b/research/maskgan/models/bidirectional_vd.py
deleted file mode 100644
index 469af9da57a8a0dbf280327308a17fa6e0277a86..0000000000000000000000000000000000000000
--- a/research/maskgan/models/bidirectional_vd.py
+++ /dev/null
@@ -1,116 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple bidirectional model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-from regularization import variational_dropout
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def discriminator(hparams,
-                  sequence,
-                  is_training,
-                  reuse=None,
-                  initial_state=None):
-  """Define the Discriminator graph."""
-  sequence = tf.cast(sequence, tf.int32)
-
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        'If you wish to share Discriminator/Generator embeddings, they must be'
-        ' same dimension.')
-    with tf.variable_scope('gen/decoder/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope('dis', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(
-          hparams.dis_rnn_size,
-          forget_bias=0.0,
-          state_is_tuple=True,
-          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and hparams.dis_vd_keep_prob < 1:
-
-      def attn_cell():
-        return variational_dropout.VariationalDropoutWrapper(
-            lstm_cell(), FLAGS.batch_size, hparams.dis_rnn_size,
-            hparams.dis_vd_keep_prob, hparams.dis_vd_keep_prob)
-
-    cell_fwd = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-
-    cell_bwd = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-
-    # print initial_state
-    # print cell_fwd.zero_state(FLAGS.batch_size, tf.float32)
-    if initial_state:
-      state_fwd = [[tf.identity(x) for x in inner_initial_state]
-                   for inner_initial_state in initial_state]
-      state_bwd = cell_bwd.zero_state(FLAGS.batch_size, tf.float32)
-    else:
-      state_fwd = cell_fwd.zero_state(FLAGS.batch_size, tf.float32)
-      state_bwd = cell_bwd.zero_state(FLAGS.batch_size, tf.float32)
-
-    def make_mask(keep_prob, units):
-      random_tensor = keep_prob
-      # 0. if [keep_prob, 1.0) and 1. if [1.0, 1.0 + keep_prob)
-      random_tensor += tf.random_uniform(tf.stack([FLAGS.batch_size, units]))
-      return tf.floor(random_tensor) / keep_prob
-
-    if is_training:
-      output_mask = make_mask(hparams.dis_vd_keep_prob,
-                              2 * hparams.dis_rnn_size)
-
-    if not FLAGS.dis_share_embedding:
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-    rnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-
-    rnn_inputs = tf.unstack(rnn_inputs, axis=1)
-
-    with tf.variable_scope('rnn') as vs:
-      outputs, _, _ = tf.contrib.rnn.static_bidirectional_rnn(
-          cell_fwd, cell_bwd, rnn_inputs, state_fwd, state_bwd, scope=vs)
-
-      if is_training:
-        outputs *= output_mask
-
-      # Prediction is linear output for Discriminator.
-      predictions = tf.contrib.layers.linear(outputs, 1, scope=vs)
-      predictions = tf.transpose(predictions, [1, 0, 2])
-
-  if FLAGS.baseline_method == 'critic':
-    with tf.variable_scope('critic', reuse=reuse) as critic_scope:
-      values = tf.contrib.layers.linear(outputs, 1, scope=critic_scope)
-      values = tf.transpose(values, [1, 0, 2])
-
-    return tf.squeeze(predictions, axis=2), tf.squeeze(values, axis=2)
-
-  else:
-    return tf.squeeze(predictions, axis=2), None
diff --git a/research/maskgan/models/bidirectional_zaremba.py b/research/maskgan/models/bidirectional_zaremba.py
deleted file mode 100644
index b0683d7cc1493a8aa0298b7dc91020a152a9da36..0000000000000000000000000000000000000000
--- a/research/maskgan/models/bidirectional_zaremba.py
+++ /dev/null
@@ -1,83 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple bidirectional model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def discriminator(hparams, sequence, is_training, reuse=None):
-  """Define the bidirectional Discriminator graph."""
-  sequence = tf.cast(sequence, tf.int32)
-
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        'If you wish to share Discriminator/Generator embeddings, they must be'
-        ' same dimension.')
-    with tf.variable_scope('gen/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope('dis', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(
-          hparams.dis_rnn_size,
-          forget_bias=0.0,
-          state_is_tuple=True,
-          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and FLAGS.keep_prob < 1:
-
-      def attn_cell():
-        return tf.contrib.rnn.DropoutWrapper(
-            lstm_cell(), output_keep_prob=FLAGS.keep_prob)
-
-    cell_fwd = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-
-    cell_bwd = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-
-    state_fwd = cell_fwd.zero_state(FLAGS.batch_size, tf.float32)
-    state_bwd = cell_bwd.zero_state(FLAGS.batch_size, tf.float32)
-
-    if not FLAGS.dis_share_embedding:
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-    rnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-    if is_training and FLAGS.keep_prob < 1:
-      rnn_inputs = tf.nn.dropout(rnn_inputs, FLAGS.keep_prob)
-    rnn_inputs = tf.unstack(rnn_inputs, axis=1)
-
-    with tf.variable_scope('rnn') as vs:
-      outputs, _, _ = tf.contrib.rnn.static_bidirectional_rnn(
-          cell_fwd, cell_bwd, rnn_inputs, state_fwd, state_bwd, scope=vs)
-
-      # Prediction is linear output for Discriminator.
-      predictions = tf.contrib.layers.linear(outputs, 1, scope=vs)
-
-      predictions = tf.transpose(predictions, [1, 0, 2])
-      return tf.squeeze(predictions, axis=2)
diff --git a/research/maskgan/models/cnn.py b/research/maskgan/models/cnn.py
deleted file mode 100644
index ca682debf1630f5773cef48b874334d28d1fc6fc..0000000000000000000000000000000000000000
--- a/research/maskgan/models/cnn.py
+++ /dev/null
@@ -1,93 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple CNN model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def discriminator(hparams, sequence, is_training, reuse=None):
-  """Define the Discriminator graph."""
-  del is_training
-  sequence = tf.cast(sequence, tf.int32)
-
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        "If you wish to share Discriminator/Generator embeddings, they must be"
-        " same dimension.")
-    with tf.variable_scope("gen/rnn", reuse=True):
-      embedding = tf.get_variable("embedding",
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  dis_filter_sizes = [3, 4, 5, 6, 7, 8, 9, 10, 15, 20]
-
-  with tf.variable_scope("dis", reuse=reuse):
-    if not FLAGS.dis_share_embedding:
-      embedding = tf.get_variable("embedding",
-                                  [FLAGS.vocab_size, hparams.dis_rnn_size])
-    cnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-
-    # Create a convolution layer for each filter size
-    conv_outputs = []
-    for filter_size in dis_filter_sizes:
-      with tf.variable_scope("conv-%s" % filter_size):
-        # Convolution Layer
-        filter_shape = [
-            filter_size, hparams.dis_rnn_size, hparams.dis_num_filters
-        ]
-        W = tf.get_variable(
-            name="W", initializer=tf.truncated_normal(filter_shape, stddev=0.1))
-        b = tf.get_variable(
-            name="b",
-            initializer=tf.constant(0.1, shape=[hparams.dis_num_filters]))
-        conv = tf.nn.conv1d(
-            cnn_inputs, W, stride=1, padding="SAME", name="conv")
-
-        # Apply nonlinearity
-        h = tf.nn.relu(tf.nn.bias_add(conv, b), name="relu")
-
-        conv_outputs.append(h)
-
-    # Combine all the pooled features
-    dis_num_filters_total = hparams.dis_num_filters * len(dis_filter_sizes)
-
-    h_conv = tf.concat(conv_outputs, axis=2)
-    h_conv_flat = tf.reshape(h_conv, [-1, dis_num_filters_total])
-
-    # Add dropout
-    with tf.variable_scope("dropout"):
-      h_drop = tf.nn.dropout(h_conv_flat, FLAGS.keep_prob)
-
-    with tf.variable_scope("fully_connected"):
-      fc = tf.contrib.layers.fully_connected(
-          h_drop, num_outputs=dis_num_filters_total / 2)
-
-    # Final (unnormalized) scores and predictions
-    with tf.variable_scope("output"):
-      W = tf.get_variable(
-          "W",
-          shape=[dis_num_filters_total / 2, 1],
-          initializer=tf.contrib.layers.xavier_initializer())
-      b = tf.get_variable(name="b", initializer=tf.constant(0.1, shape=[1]))
-      predictions = tf.nn.xw_plus_b(fc, W, b, name="predictions")
-      predictions = tf.reshape(
-          predictions, shape=[FLAGS.batch_size, FLAGS.sequence_length])
-  return predictions
diff --git a/research/maskgan/models/critic_vd.py b/research/maskgan/models/critic_vd.py
deleted file mode 100644
index ede8b7bb77af28f562c2e3942728899fe9b16422..0000000000000000000000000000000000000000
--- a/research/maskgan/models/critic_vd.py
+++ /dev/null
@@ -1,108 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Critic model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from six.moves import xrange
-import tensorflow as tf
-from regularization import variational_dropout
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def critic_seq2seq_vd_derivative(hparams, sequence, is_training, reuse=None):
-  """Define the Critic graph which is derived from the seq2seq_vd
-  Discriminator.  This will be initialized with the same parameters as the
-  language model and will share the forward RNN components with the
-  Discriminator.   This estimates the V(s_t), where the state
-  s_t = x_0,...,x_t-1.
-  """
-  assert FLAGS.discriminator_model == 'seq2seq_vd'
-  sequence = tf.cast(sequence, tf.int32)
-
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        'If you wish to share Discriminator/Generator embeddings, they must be'
-        ' same dimension.')
-    with tf.variable_scope('gen/decoder/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-  else:
-    with tf.variable_scope('dis/decoder/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-  with tf.variable_scope(
-      'dis/decoder/rnn/multi_rnn_cell', reuse=True) as dis_scope:
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(
-          hparams.dis_rnn_size,
-          forget_bias=0.0,
-          state_is_tuple=True,
-          reuse=True)
-
-    attn_cell = lstm_cell
-    if is_training and hparams.dis_vd_keep_prob < 1:
-
-      def attn_cell():
-        return variational_dropout.VariationalDropoutWrapper(
-            lstm_cell(), FLAGS.batch_size, hparams.dis_rnn_size,
-            hparams.dis_vd_keep_prob, hparams.dis_vd_keep_prob)
-
-    cell_critic = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-
-  with tf.variable_scope('critic', reuse=reuse):
-    state_dis = cell_critic.zero_state(FLAGS.batch_size, tf.float32)
-
-    def make_mask(keep_prob, units):
-      random_tensor = keep_prob
-      # 0. if [keep_prob, 1.0) and 1. if [1.0, 1.0 + keep_prob)
-      random_tensor += tf.random_uniform(tf.stack([FLAGS.batch_size, units]))
-      return tf.floor(random_tensor) / keep_prob
-
-    if is_training:
-      output_mask = make_mask(hparams.dis_vd_keep_prob, hparams.dis_rnn_size)
-
-    with tf.variable_scope('rnn') as vs:
-      values = []
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        if t == 0:
-          rnn_in = tf.zeros_like(rnn_inputs[:, 0])
-        else:
-          rnn_in = rnn_inputs[:, t - 1]
-        rnn_out, state_dis = cell_critic(rnn_in, state_dis, scope=dis_scope)
-
-        if is_training:
-          rnn_out *= output_mask
-
-        # Prediction is linear output for Discriminator.
-        value = tf.contrib.layers.linear(rnn_out, 1, scope=vs)
-
-        values.append(value)
-  values = tf.stack(values, axis=1)
-  return tf.squeeze(values, axis=2)
diff --git a/research/maskgan/models/evaluation_utils.py b/research/maskgan/models/evaluation_utils.py
deleted file mode 100644
index fc2a3a16f0b2c03736bfaa881c5c14546240d283..0000000000000000000000000000000000000000
--- a/research/maskgan/models/evaluation_utils.py
+++ /dev/null
@@ -1,280 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Evaluation utilities."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import Counter
-# Dependency imports
-import numpy as np
-from scipy.special import expit
-
-import tensorflow as tf
-
-from model_utils import helper
-from model_utils import n_gram
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def print_and_log_losses(log, step, is_present_rate, avg_dis_loss,
-                         avg_gen_loss):
-  """Prints and logs losses to the log file.
-
-  Args:
-    log: GFile for logs.
-    step: Global step.
-    is_present_rate: Current masking rate.
-    avg_dis_loss: List of Discriminator losses.
-    avg_gen_loss: List of Generator losses.
-  """
-  print('global_step: %d' % step)
-  print(' is_present_rate: %.3f' % is_present_rate)
-  print(' D train loss: %.5f' % np.mean(avg_dis_loss))
-  print(' G train loss: %.5f' % np.mean(avg_gen_loss))
-  log.write('\nglobal_step: %d\n' % step)
-  log.write((' is_present_rate: %.3f\n' % is_present_rate))
-  log.write(' D train loss: %.5f\n' % np.mean(avg_dis_loss))
-  log.write(' G train loss: %.5f\n' % np.mean(avg_gen_loss))
-
-
-def print_and_log(log, id_to_word, sequence_eval, max_num_to_print=5):
-  """Helper function for printing and logging evaluated sequences."""
-  indices_arr = np.asarray(sequence_eval)
-  samples = helper.convert_to_human_readable(id_to_word, indices_arr,
-                                             max_num_to_print)
-
-  for i, sample in enumerate(samples):
-    print('Sample', i, '. ', sample)
-    log.write('\nSample ' + str(i) + '. ' + sample)
-  log.write('\n')
-  print('\n')
-  log.flush()
-  return samples
-
-
-def zip_seq_pred_crossent(id_to_word, sequences, predictions, cross_entropy):
-  """Zip together the sequences, predictions, cross entropy."""
-  indices = np.asarray(sequences)
-
-  batch_of_metrics = []
-
-  for ind_batch, pred_batch, crossent_batch in zip(indices, predictions,
-                                                   cross_entropy):
-    metrics = []
-
-    for index, pred, crossent in zip(ind_batch, pred_batch, crossent_batch):
-      metrics.append([str(id_to_word[index]), pred, crossent])
-
-    batch_of_metrics.append(metrics)
-  return batch_of_metrics
-
-
-def zip_metrics(indices, *args):
-  """Zip together the indices matrices with the provided metrics matrices."""
-  batch_of_metrics = []
-  for metrics_batch in zip(indices, *args):
-
-    metrics = []
-    for m in zip(*metrics_batch):
-      metrics.append(m)
-    batch_of_metrics.append(metrics)
-  return batch_of_metrics
-
-
-def print_formatted(present, id_to_word, log, batch_of_tuples):
-  """Print and log metrics."""
-  num_cols = len(batch_of_tuples[0][0])
-  repeat_float_format = '{:<12.3f} '
-  repeat_str_format = '{:<13}'
-
-  format_str = ''.join(
-      ['[{:<1}]  {:<20}',
-       str(repeat_float_format * (num_cols - 1))])
-
-  # TODO(liamfedus): Generalize the logging. This is sloppy.
-  header_format_str = ''.join(
-      ['[{:<1}]  {:<20}',
-       str(repeat_str_format * (num_cols - 1))])
-  header_str = header_format_str.format('p', 'Word', 'p(real)', 'log-perp',
-                                        'log(p(a))', 'r', 'R=V*(s)', 'b=V(s)',
-                                        'A(a,s)')
-
-  for i, batch in enumerate(batch_of_tuples):
-    print(' Sample: %d' % i)
-    log.write(' Sample %d.\n' % i)
-    print('  ', header_str)
-    log.write('  ' + str(header_str) + '\n')
-
-    for j, t in enumerate(batch):
-      t = list(t)
-      t[0] = id_to_word[t[0]]
-      buffer_str = format_str.format(int(present[i][j]), *t)
-      print('  ', buffer_str)
-      log.write('  ' + str(buffer_str) + '\n')
-  log.flush()
-
-
-def generate_RL_logs(sess, model, log, id_to_word, feed):
-  """Generate complete logs while running with REINFORCE."""
-  # Impute Sequences.
-  [
-      p,
-      fake_sequence_eval,
-      fake_predictions_eval,
-      _,
-      fake_cross_entropy_losses_eval,
-      _,
-      fake_log_probs_eval,
-      fake_rewards_eval,
-      fake_baselines_eval,
-      cumulative_rewards_eval,
-      fake_advantages_eval,
-  ] = sess.run(
-      [
-          model.present,
-          model.fake_sequence,
-          model.fake_predictions,
-          model.real_predictions,
-          model.fake_cross_entropy_losses,
-          model.fake_logits,
-          model.fake_log_probs,
-          model.fake_rewards,
-          model.fake_baselines,
-          model.cumulative_rewards,
-          model.fake_advantages,
-      ],
-      feed_dict=feed)
-
-  indices = np.asarray(fake_sequence_eval)
-
-  # Convert Discriminator linear layer to probability.
-  fake_prob_eval = expit(fake_predictions_eval)
-
-  # Add metrics.
-  fake_tuples = zip_metrics(indices, fake_prob_eval,
-                            fake_cross_entropy_losses_eval, fake_log_probs_eval,
-                            fake_rewards_eval, cumulative_rewards_eval,
-                            fake_baselines_eval, fake_advantages_eval)
-
-  # real_tuples = zip_metrics(indices, )
-
-  # Print forward sequences.
-  tuples_to_print = fake_tuples[:FLAGS.max_num_to_print]
-  print_formatted(p, id_to_word, log, tuples_to_print)
-
-  print('Samples')
-  log.write('Samples\n')
-  samples = print_and_log(log, id_to_word, fake_sequence_eval,
-                          FLAGS.max_num_to_print)
-  return samples
-
-
-def generate_logs(sess, model, log, id_to_word, feed):
-  """Impute Sequences using the model for a particular feed and send it to
-  logs."""
-  # Impute Sequences.
-  [
-      p, sequence_eval, fake_predictions_eval, fake_cross_entropy_losses_eval,
-      fake_logits_eval
-  ] = sess.run(
-      [
-          model.present, model.fake_sequence, model.fake_predictions,
-          model.fake_cross_entropy_losses, model.fake_logits
-      ],
-      feed_dict=feed)
-
-  # Convert Discriminator linear layer to probability.
-  fake_prob_eval = expit(fake_predictions_eval)
-
-  # Forward Masked Tuples.
-  fake_tuples = zip_seq_pred_crossent(id_to_word, sequence_eval, fake_prob_eval,
-                                      fake_cross_entropy_losses_eval)
-
-  tuples_to_print = fake_tuples[:FLAGS.max_num_to_print]
-
-  if FLAGS.print_verbose:
-    print('fake_logits_eval')
-    print(fake_logits_eval)
-
-  for i, batch in enumerate(tuples_to_print):
-    print(' Sample %d.' % i)
-    log.write(' Sample %d.\n' % i)
-    for j, pred in enumerate(batch):
-      buffer_str = ('[{:<1}]  {:<20}  {:<7.3f} {:<7.3f}').format(
-          int(p[i][j]), pred[0], pred[1], pred[2])
-      print('  ', buffer_str)
-      log.write('  ' + str(buffer_str) + '\n')
-  log.flush()
-
-  print('Samples')
-  log.write('Samples\n')
-  samples = print_and_log(log, id_to_word, sequence_eval,
-                          FLAGS.max_num_to_print)
-  return samples
-
-
-def create_merged_ngram_dictionaries(indices, n):
-  """Generate a single dictionary for the full batch.
-
-  Args:
-    indices:  List of lists of indices.
-    n:  Degree of n-grams.
-
-  Returns:
-    Dictionary of hashed(n-gram tuples) to counts in the batch of indices.
-  """
-  ngram_dicts = []
-
-  for ind in indices:
-    ngrams = n_gram.find_all_ngrams(ind, n=n)
-    ngram_counts = n_gram.construct_ngrams_dict(ngrams)
-    ngram_dicts.append(ngram_counts)
-
-  merged_gen_dict = Counter()
-  for ngram_dict in ngram_dicts:
-    merged_gen_dict += Counter(ngram_dict)
-  return merged_gen_dict
-
-
-def sequence_ngram_evaluation(sess, sequence, log, feed, data_ngram_count, n):
-  """Calculates the percent of ngrams produced in the sequence is present in
-  data_ngram_count.
-
-  Args:
-    sess: tf.Session.
-    sequence: Sequence Tensor from the MaskGAN model.
-    log:  gFile log.
-    feed: Feed to evaluate.
-    data_ngram_count:  Dictionary of hashed(n-gram tuples) to counts in the
-      data_set.
-
-  Returns:
-    avg_percent_captured: Percent of produced ngrams that appear in the
-      data_ngram_count.
-  """
-  del log
-  # Impute sequence.
-  [sequence_eval] = sess.run([sequence], feed_dict=feed)
-  indices = sequence_eval
-
-  # Retrieve the counts across the batch of indices.
-  gen_ngram_counts = create_merged_ngram_dictionaries(
-      indices, n=n)
-  return n_gram.percent_unique_ngrams_in_train(data_ngram_count,
-                                               gen_ngram_counts)
diff --git a/research/maskgan/models/feedforward.py b/research/maskgan/models/feedforward.py
deleted file mode 100644
index d48a517d6bea65477b8a940ed770f92203da6dfd..0000000000000000000000000000000000000000
--- a/research/maskgan/models/feedforward.py
+++ /dev/null
@@ -1,98 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple FNN model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from six.moves import xrange
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def discriminator(hparams, sequence, is_training, reuse=None):
-  """Define the Discriminator graph."""
-  del is_training
-  sequence = tf.cast(sequence, tf.int32)
-
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        "If you wish to share Discriminator/Generator embeddings, they must be"
-        " same dimension.")
-    with tf.variable_scope("gen/rnn", reuse=True):
-      embedding = tf.get_variable("embedding",
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope("dis", reuse=reuse):
-    if not FLAGS.dis_share_embedding:
-      embedding = tf.get_variable("embedding",
-                                  [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-    embeddings = tf.nn.embedding_lookup(embedding, sequence)
-
-    # Input matrices.
-    W = tf.get_variable(
-        "W",
-        initializer=tf.truncated_normal(
-            shape=[3 * hparams.dis_embedding_dim, hparams.dis_hidden_dim],
-            stddev=0.1))
-    b = tf.get_variable(
-        "b", initializer=tf.constant(0.1, shape=[hparams.dis_hidden_dim]))
-
-    # Output matrices.
-    W_out = tf.get_variable(
-        "W_out",
-        initializer=tf.truncated_normal(
-            shape=[hparams.dis_hidden_dim, 1], stddev=0.1))
-    b_out = tf.get_variable("b_out", initializer=tf.constant(0.1, shape=[1]))
-
-    predictions = []
-    for t in xrange(FLAGS.sequence_length):
-      if t > 0:
-        tf.get_variable_scope().reuse_variables()
-
-      inp = embeddings[:, t]
-
-      if t > 0:
-        past_inp = tf.unstack(embeddings[:, 0:t], axis=1)
-        avg_past_inp = tf.add_n(past_inp) / len(past_inp)
-      else:
-        avg_past_inp = tf.zeros_like(inp)
-
-      if t < FLAGS.sequence_length:
-        future_inp = tf.unstack(embeddings[:, t:], axis=1)
-        avg_future_inp = tf.add_n(future_inp) / len(future_inp)
-      else:
-        avg_future_inp = tf.zeros_like(inp)
-
-      # Cumulative input.
-      concat_inp = tf.concat([avg_past_inp, inp, avg_future_inp], axis=1)
-
-      # Hidden activations.
-      hidden = tf.nn.relu(tf.nn.xw_plus_b(concat_inp, W, b, name="scores"))
-
-      # Add dropout
-      with tf.variable_scope("dropout"):
-        hidden = tf.nn.dropout(hidden, FLAGS.keep_prob)
-
-      # Output.
-      output = tf.nn.xw_plus_b(hidden, W_out, b_out, name="output")
-
-      predictions.append(output)
-    predictions = tf.stack(predictions, axis=1)
-    return tf.squeeze(predictions, axis=2)
diff --git a/research/maskgan/models/rnn.py b/research/maskgan/models/rnn.py
deleted file mode 100644
index 40b3a7aa3b85ddfd3002d845416b5004088620fc..0000000000000000000000000000000000000000
--- a/research/maskgan/models/rnn.py
+++ /dev/null
@@ -1,211 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple RNN model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from six.moves import xrange
-import tensorflow as tf
-
-# ZoneoutWrapper.
-from regularization import zoneout
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def generator(hparams,
-              inputs,
-              targets,
-              targets_present,
-              is_training,
-              is_validating,
-              reuse=None):
-  """Define the Generator graph.
-
-    G will now impute tokens that have been masked from the input seqeunce.
-  """
-  tf.logging.warning(
-      'Undirectional generative model is not a useful model for this MaskGAN '
-      'because future context is needed.  Use only for debugging purposes.')
-  init_scale = 0.05
-  initializer = tf.random_uniform_initializer(-init_scale, init_scale)
-
-  with tf.variable_scope('gen', reuse=reuse, initializer=initializer):
-
-    def lstm_cell():
-      return tf.contrib.rnn.LayerNormBasicLSTMCell(
-          hparams.gen_rnn_size, reuse=reuse)
-
-    attn_cell = lstm_cell
-    if FLAGS.zoneout_drop_prob > 0.0:
-
-      def attn_cell():
-        return zoneout.ZoneoutWrapper(
-            lstm_cell(),
-            zoneout_drop_prob=FLAGS.zoneout_drop_prob,
-            is_training=is_training)
-
-    cell_gen = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.gen_num_layers)],
-        state_is_tuple=True)
-
-    initial_state = cell_gen.zero_state(FLAGS.batch_size, tf.float32)
-
-    with tf.variable_scope('rnn'):
-      sequence, logits, log_probs = [], [], []
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-      softmax_w = tf.get_variable('softmax_w',
-                                  [hparams.gen_rnn_size, FLAGS.vocab_size])
-      softmax_b = tf.get_variable('softmax_b', [FLAGS.vocab_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, inputs)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        # Input to the model is the first token to provide context.  The
-        # model will then predict token t > 0.
-        if t == 0:
-          # Always provide the real input at t = 0.
-          state_gen = initial_state
-          rnn_inp = rnn_inputs[:, t]
-
-        # If the target at the last time-step was present, read in the real.
-        # If the target at the last time-step was not present, read in the fake.
-        else:
-          real_rnn_inp = rnn_inputs[:, t]
-          fake_rnn_inp = tf.nn.embedding_lookup(embedding, fake)
-
-          # Use teacher forcing.
-          if (is_training and
-              FLAGS.gen_training_strategy == 'cross_entropy') or is_validating:
-            rnn_inp = real_rnn_inp
-          else:
-            # Note that targets_t-1 == inputs_(t)
-            rnn_inp = tf.where(targets_present[:, t - 1], real_rnn_inp,
-                               fake_rnn_inp)
-
-        # RNN.
-        rnn_out, state_gen = cell_gen(rnn_inp, state_gen)
-        logit = tf.matmul(rnn_out, softmax_w) + softmax_b
-
-        # Real sample.
-        real = targets[:, t]
-
-        # Fake sample.
-        categorical = tf.contrib.distributions.Categorical(logits=logit)
-        fake = categorical.sample()
-        log_prob = categorical.log_prob(fake)
-
-        # Output for Generator will either be generated or the target.
-        # If present:   Return real.
-        # If not present:  Return fake.
-        output = tf.where(targets_present[:, t], real, fake)
-
-        # Append to lists.
-        sequence.append(output)
-        logits.append(logit)
-        log_probs.append(log_prob)
-
-      # Produce the RNN state had the model operated only
-      # over real data.
-      real_state_gen = initial_state
-      for t in xrange(FLAGS.sequence_length):
-        tf.get_variable_scope().reuse_variables()
-
-        rnn_inp = rnn_inputs[:, t]
-
-        # RNN.
-        rnn_out, real_state_gen = cell_gen(rnn_inp, real_state_gen)
-
-      final_state = real_state_gen
-
-  return (tf.stack(sequence, axis=1), tf.stack(logits, axis=1), tf.stack(
-      log_probs, axis=1), initial_state, final_state)
-
-
-def discriminator(hparams, sequence, is_training, reuse=None):
-  """Define the Discriminator graph.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    FLAGS: Current flags.
-    sequence:  [FLAGS.batch_size, FLAGS.sequence_length]
-    is_training:
-    reuse
-
-  Returns:
-    predictions:
-  """
-  tf.logging.warning(
-      'Undirectional Discriminative model is not a useful model for this '
-      'MaskGAN because future context is needed.  Use only for debugging '
-      'purposes.')
-  sequence = tf.cast(sequence, tf.int32)
-
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        'If you wish to share Discriminator/Generator embeddings, they must be'
-        ' same dimension.')
-    with tf.variable_scope('gen/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope('dis', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.LayerNormBasicLSTMCell(
-          hparams.dis_rnn_size, reuse=reuse)
-
-    attn_cell = lstm_cell
-    if FLAGS.zoneout_drop_prob > 0.0:
-
-      def attn_cell():
-        return zoneout.ZoneoutWrapper(
-            lstm_cell(),
-            zoneout_drop_prob=FLAGS.zoneout_drop_prob,
-            is_training=is_training)
-
-    cell_dis = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-    state_dis = cell_dis.zero_state(FLAGS.batch_size, tf.float32)
-
-    with tf.variable_scope('rnn') as vs:
-      predictions = []
-      if not FLAGS.dis_share_embedding:
-        embedding = tf.get_variable('embedding',
-                                    [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        rnn_in = rnn_inputs[:, t]
-        rnn_out, state_dis = cell_dis(rnn_in, state_dis)
-
-        # Prediction is linear output for Discriminator.
-        pred = tf.contrib.layers.linear(rnn_out, 1, scope=vs)
-
-        predictions.append(pred)
-  predictions = tf.stack(predictions, axis=1)
-  return tf.squeeze(predictions, axis=2)
diff --git a/research/maskgan/models/rnn_nas.py b/research/maskgan/models/rnn_nas.py
deleted file mode 100644
index 618ace2f8196fb4718ae01bc406f114523fd44cc..0000000000000000000000000000000000000000
--- a/research/maskgan/models/rnn_nas.py
+++ /dev/null
@@ -1,234 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple RNN model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-from six.moves import xrange
-import tensorflow as tf
-
-# NAS Code..
-from nas_utils import configs
-from nas_utils import custom_cell
-from nas_utils import variational_dropout
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def get_config():
-  return configs.AlienConfig2()
-
-
-LSTMTuple = collections.namedtuple('LSTMTuple', ['c', 'h'])
-
-
-def generator(hparams,
-              inputs,
-              targets,
-              targets_present,
-              is_training,
-              is_validating,
-              reuse=None):
-  """Define the Generator graph.
-
-    G will now impute tokens that have been masked from the input seqeunce.
-  """
-  tf.logging.info(
-      'Undirectional generative model is not a useful model for this MaskGAN '
-      'because future context is needed.  Use only for debugging purposes.')
-  config = get_config()
-  config.keep_prob = [hparams.gen_nas_keep_prob_0, hparams.gen_nas_keep_prob_1]
-  configs.print_config(config)
-
-  init_scale = config.init_scale
-  initializer = tf.random_uniform_initializer(-init_scale, init_scale)
-
-  with tf.variable_scope('gen', reuse=reuse, initializer=initializer):
-    # Neural architecture search cell.
-    cell = custom_cell.Alien(config.hidden_size)
-
-    if is_training:
-      [h2h_masks, _, _,
-       output_mask] = variational_dropout.generate_variational_dropout_masks(
-           hparams, config.keep_prob)
-    else:
-      output_mask = None
-
-    cell_gen = custom_cell.GenericMultiRNNCell([cell] * config.num_layers)
-    initial_state = cell_gen.zero_state(FLAGS.batch_size, tf.float32)
-
-    with tf.variable_scope('rnn'):
-      sequence, logits, log_probs = [], [], []
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-      softmax_w = tf.matrix_transpose(embedding)
-      softmax_b = tf.get_variable('softmax_b', [FLAGS.vocab_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, inputs)
-
-      if is_training and FLAGS.keep_prob < 1:
-        rnn_inputs = tf.nn.dropout(rnn_inputs, FLAGS.keep_prob)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        # Input to the model is the first token to provide context.  The
-        # model will then predict token t > 0.
-        if t == 0:
-          # Always provide the real input at t = 0.
-          state_gen = initial_state
-          rnn_inp = rnn_inputs[:, t]
-
-        # If the input is present, read in the input at t.
-        # If the input is not present, read in the previously generated.
-        else:
-          real_rnn_inp = rnn_inputs[:, t]
-          fake_rnn_inp = tf.nn.embedding_lookup(embedding, fake)
-
-          # While validating, the decoder should be operating in teacher
-          # forcing regime.  Also, if we're just training with cross_entropy
-          # use teacher forcing.
-          if is_validating or (is_training and
-                               FLAGS.gen_training_strategy == 'cross_entropy'):
-            rnn_inp = real_rnn_inp
-          else:
-            rnn_inp = tf.where(targets_present[:, t - 1], real_rnn_inp,
-                               fake_rnn_inp)
-
-        if is_training:
-          state_gen = list(state_gen)
-          for layer_num, per_layer_state in enumerate(state_gen):
-            per_layer_state = LSTMTuple(
-                per_layer_state[0], per_layer_state[1] * h2h_masks[layer_num])
-            state_gen[layer_num] = per_layer_state
-
-        # RNN.
-        rnn_out, state_gen = cell_gen(rnn_inp, state_gen)
-
-        if is_training:
-          rnn_out = output_mask * rnn_out
-
-        logit = tf.matmul(rnn_out, softmax_w) + softmax_b
-
-        # Real sample.
-        real = targets[:, t]
-
-        categorical = tf.contrib.distributions.Categorical(logits=logit)
-        fake = categorical.sample()
-        log_prob = categorical.log_prob(fake)
-
-        # Output for Generator will either be generated or the input.
-        #
-        # If present:   Return real.
-        # If not present:  Return fake.
-        output = tf.where(targets_present[:, t], real, fake)
-
-        # Add to lists.
-        sequence.append(output)
-        log_probs.append(log_prob)
-        logits.append(logit)
-
-      # Produce the RNN state had the model operated only
-      # over real data.
-      real_state_gen = initial_state
-      for t in xrange(FLAGS.sequence_length):
-        tf.get_variable_scope().reuse_variables()
-
-        rnn_inp = rnn_inputs[:, t]
-
-        # RNN.
-        rnn_out, real_state_gen = cell_gen(rnn_inp, real_state_gen)
-
-      final_state = real_state_gen
-
-  return (tf.stack(sequence, axis=1), tf.stack(logits, axis=1), tf.stack(
-      log_probs, axis=1), initial_state, final_state)
-
-
-def discriminator(hparams, sequence, is_training, reuse=None):
-  """Define the Discriminator graph."""
-  tf.logging.info(
-      'Undirectional Discriminative model is not a useful model for this '
-      'MaskGAN because future context is needed.  Use only for debugging '
-      'purposes.')
-  sequence = tf.cast(sequence, tf.int32)
-
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        'If you wish to share Discriminator/Generator embeddings, they must be'
-        ' same dimension.')
-    with tf.variable_scope('gen/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  config = get_config()
-  config.keep_prob = [hparams.dis_nas_keep_prob_0, hparams.dis_nas_keep_prob_1]
-  configs.print_config(config)
-
-  with tf.variable_scope('dis', reuse=reuse):
-    # Neural architecture search cell.
-    cell = custom_cell.Alien(config.hidden_size)
-
-    if is_training:
-      [h2h_masks, _, _,
-       output_mask] = variational_dropout.generate_variational_dropout_masks(
-           hparams, config.keep_prob)
-    else:
-      output_mask = None
-
-    cell_dis = custom_cell.GenericMultiRNNCell([cell] * config.num_layers)
-    state_dis = cell_dis.zero_state(FLAGS.batch_size, tf.float32)
-
-    with tf.variable_scope('rnn') as vs:
-      predictions = []
-      if not FLAGS.dis_share_embedding:
-        embedding = tf.get_variable('embedding',
-                                    [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-
-      if is_training and FLAGS.keep_prob < 1:
-        rnn_inputs = tf.nn.dropout(rnn_inputs, FLAGS.keep_prob)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        rnn_in = rnn_inputs[:, t]
-
-        if is_training:
-          state_dis = list(state_dis)
-          for layer_num, per_layer_state in enumerate(state_dis):
-            per_layer_state = LSTMTuple(
-                per_layer_state[0], per_layer_state[1] * h2h_masks[layer_num])
-            state_dis[layer_num] = per_layer_state
-
-        # RNN.
-        rnn_out, state_dis = cell_dis(rnn_in, state_dis)
-
-        if is_training:
-          rnn_out = output_mask * rnn_out
-
-        # Prediction is linear output for Discriminator.
-        pred = tf.contrib.layers.linear(rnn_out, 1, scope=vs)
-
-        predictions.append(pred)
-  predictions = tf.stack(predictions, axis=1)
-  return tf.squeeze(predictions, axis=2)
diff --git a/research/maskgan/models/rnn_vd.py b/research/maskgan/models/rnn_vd.py
deleted file mode 100644
index 428f1a54bda7d6e5f9dd55061149664b1b3e751d..0000000000000000000000000000000000000000
--- a/research/maskgan/models/rnn_vd.py
+++ /dev/null
@@ -1,118 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple RNN model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from six.moves import xrange
-import tensorflow as tf
-from regularization import variational_dropout
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def discriminator(hparams,
-                  sequence,
-                  is_training,
-                  reuse=None,
-                  initial_state=None):
-  """Define the Discriminator graph."""
-  tf.logging.info(
-      'Undirectional Discriminative model is not a useful model for this '
-      'MaskGAN because future context is needed.  Use only for debugging '
-      'purposes.')
-  sequence = tf.cast(sequence, tf.int32)
-
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        'If you wish to share Discriminator/Generator embeddings, they must be'
-        ' same dimension.')
-    with tf.variable_scope('gen/decoder/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope('dis', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(
-          hparams.dis_rnn_size,
-          forget_bias=0.0,
-          state_is_tuple=True,
-          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and hparams.dis_vd_keep_prob < 1:
-
-      def attn_cell():
-        return variational_dropout.VariationalDropoutWrapper(
-            lstm_cell(), FLAGS.batch_size, hparams.dis_rnn_size,
-            hparams.dis_vd_keep_prob, hparams.dis_vd_keep_prob)
-
-    cell_dis = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-
-    if initial_state:
-      state_dis = [[tf.identity(x) for x in inner_initial_state]
-                   for inner_initial_state in initial_state]
-    else:
-      state_dis = cell_dis.zero_state(FLAGS.batch_size, tf.float32)
-
-    def make_mask(keep_prob, units):
-      random_tensor = keep_prob
-      # 0. if [keep_prob, 1.0) and 1. if [1.0, 1.0 + keep_prob)
-      random_tensor += tf.random_uniform(tf.stack([FLAGS.batch_size, units]))
-      return tf.floor(random_tensor) / keep_prob
-
-    if is_training:
-      output_mask = make_mask(hparams.dis_vd_keep_prob, hparams.dis_rnn_size)
-
-    with tf.variable_scope('rnn') as vs:
-      predictions, rnn_outs = [], []
-
-      if not FLAGS.dis_share_embedding:
-        embedding = tf.get_variable('embedding',
-                                    [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        rnn_in = rnn_inputs[:, t]
-        rnn_out, state_dis = cell_dis(rnn_in, state_dis)
-
-        if is_training:
-          rnn_out *= output_mask
-
-        # Prediction is linear output for Discriminator.
-        pred = tf.contrib.layers.linear(rnn_out, 1, scope=vs)
-        predictions.append(pred)
-        rnn_outs.append(rnn_out)
-
-  predictions = tf.stack(predictions, axis=1)
-
-  if FLAGS.baseline_method == 'critic':
-    with tf.variable_scope('critic', reuse=reuse) as critic_scope:
-      rnn_outs = tf.stack(rnn_outs, axis=1)
-      values = tf.contrib.layers.linear(rnn_outs, 1, scope=critic_scope)
-    return tf.squeeze(predictions, axis=2), tf.squeeze(values, axis=2)
-
-  else:
-    return tf.squeeze(predictions, axis=2), None
diff --git a/research/maskgan/models/rnn_zaremba.py b/research/maskgan/models/rnn_zaremba.py
deleted file mode 100644
index 9369c77fbb849551721b46321e6868a7aeaceea6..0000000000000000000000000000000000000000
--- a/research/maskgan/models/rnn_zaremba.py
+++ /dev/null
@@ -1,196 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple RNN model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from six.moves import xrange
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def generator(hparams,
-              inputs,
-              targets,
-              targets_present,
-              is_training,
-              is_validating,
-              reuse=None):
-  """Define the Generator graph.
-
-    G will now impute tokens that have been masked from the input seqeunce.
-  """
-  tf.logging.warning(
-      'Undirectional generative model is not a useful model for this MaskGAN '
-      'because future context is needed.  Use only for debugging purposes.')
-  init_scale = 0.05
-  initializer = tf.random_uniform_initializer(-init_scale, init_scale)
-  with tf.variable_scope('gen', reuse=reuse, initializer=initializer):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(hparams.gen_rnn_size,
-                                          forget_bias=0.0,
-                                          state_is_tuple=True,
-                                          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and FLAGS.keep_prob < 1:
-
-      def attn_cell():
-        return tf.contrib.rnn.DropoutWrapper(
-            lstm_cell(), output_keep_prob=FLAGS.keep_prob)
-
-    cell_gen = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.gen_num_layers)],
-        state_is_tuple=True)
-
-    initial_state = cell_gen.zero_state(FLAGS.batch_size, tf.float32)
-
-    with tf.variable_scope('rnn'):
-      sequence, logits, log_probs = [], [], []
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-      softmax_w = tf.get_variable('softmax_w',
-                                  [hparams.gen_rnn_size, FLAGS.vocab_size])
-      softmax_b = tf.get_variable('softmax_b', [FLAGS.vocab_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, inputs)
-
-      if is_training and FLAGS.keep_prob < 1:
-        rnn_inputs = tf.nn.dropout(rnn_inputs, FLAGS.keep_prob)
-
-      fake = None
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        # Input to the model is the first token to provide context.  The
-        # model will then predict token t > 0.
-        if t == 0:
-          # Always provide the real input at t = 0.
-          state_gen = initial_state
-          rnn_inp = rnn_inputs[:, t]
-
-        # If the input is present, read in the input at t.
-        # If the input is not present, read in the previously generated.
-        else:
-          real_rnn_inp = rnn_inputs[:, t]
-          fake_rnn_inp = tf.nn.embedding_lookup(embedding, fake)
-
-          # While validating, the decoder should be operating in teacher
-          # forcing regime.  Also, if we're just training with cross_entropy
-          # use teacher forcing.
-          if is_validating or (is_training and
-                               FLAGS.gen_training_strategy == 'cross_entropy'):
-            rnn_inp = real_rnn_inp
-          else:
-            rnn_inp = tf.where(targets_present[:, t - 1], real_rnn_inp,
-                               fake_rnn_inp)
-
-        # RNN.
-        rnn_out, state_gen = cell_gen(rnn_inp, state_gen)
-        logit = tf.matmul(rnn_out, softmax_w) + softmax_b
-
-        # Real sample.
-        real = targets[:, t]
-
-        categorical = tf.contrib.distributions.Categorical(logits=logit)
-        fake = categorical.sample()
-        log_prob = categorical.log_prob(fake)
-
-        # Output for Generator will either be generated or the input.
-        #
-        # If present:   Return real.
-        # If not present:  Return fake.
-        output = tf.where(targets_present[:, t], real, fake)
-
-        # Add to lists.
-        sequence.append(output)
-        log_probs.append(log_prob)
-        logits.append(logit)
-
-      # Produce the RNN state had the model operated only
-      # over real data.
-      real_state_gen = initial_state
-      for t in xrange(FLAGS.sequence_length):
-        tf.get_variable_scope().reuse_variables()
-
-        rnn_inp = rnn_inputs[:, t]
-
-        # RNN.
-        rnn_out, real_state_gen = cell_gen(rnn_inp, real_state_gen)
-
-      final_state = real_state_gen
-
-  return (tf.stack(sequence, axis=1), tf.stack(logits, axis=1), tf.stack(
-      log_probs, axis=1), initial_state, final_state)
-
-
-def discriminator(hparams, sequence, is_training, reuse=None):
-  """Define the Discriminator graph."""
-  tf.logging.warning(
-      'Undirectional Discriminative model is not a useful model for this '
-      'MaskGAN because future context is needed.  Use only for debugging '
-      'purposes.')
-  sequence = tf.cast(sequence, tf.int32)
-
-  with tf.variable_scope('dis', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(hparams.dis_rnn_size,
-                                          forget_bias=0.0,
-                                          state_is_tuple=True,
-                                          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and FLAGS.keep_prob < 1:
-
-      def attn_cell():
-        return tf.contrib.rnn.DropoutWrapper(
-            lstm_cell(), output_keep_prob=FLAGS.keep_prob)
-
-    cell_dis = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-
-    state_dis = cell_dis.zero_state(FLAGS.batch_size, tf.float32)
-
-    with tf.variable_scope('rnn') as vs:
-      predictions = []
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-
-      if is_training and FLAGS.keep_prob < 1:
-        rnn_inputs = tf.nn.dropout(rnn_inputs, FLAGS.keep_prob)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        rnn_in = rnn_inputs[:, t]
-        rnn_out, state_dis = cell_dis(rnn_in, state_dis)
-
-        # Prediction is linear output for Discriminator.
-        pred = tf.contrib.layers.linear(rnn_out, 1, scope=vs)
-
-        predictions.append(pred)
-  predictions = tf.stack(predictions, axis=1)
-  return tf.squeeze(predictions, axis=2)
diff --git a/research/maskgan/models/rollout.py b/research/maskgan/models/rollout.py
deleted file mode 100644
index 6919af2e31fa362f702e96e135d4a2bc06e063a2..0000000000000000000000000000000000000000
--- a/research/maskgan/models/rollout.py
+++ /dev/null
@@ -1,384 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Rollout RNN model definitions which call rnn_zaremba code."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-
-from six.moves import xrange
-import tensorflow as tf
-
-from losses import losses
-from model_utils import helper
-from model_utils import model_construction
-from model_utils import model_losses
-from model_utils import model_optimization
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def create_rollout_MaskGAN(hparams, is_training):
-  """Create the MaskGAN model.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    is_training:  Boolean indicating operational mode (train/inference).
-      evaluated with a teacher forcing regime.
-
-  Return:
-    model:  Namedtuple for specifying the MaskGAN."""
-  global_step = tf.Variable(0, name='global_step', trainable=False)
-
-  new_learning_rate = tf.placeholder(tf.float32, [], name='new_learning_rate')
-  learning_rate = tf.Variable(0.0, name='learning_rate', trainable=False)
-  learning_rate_update = tf.assign(learning_rate, new_learning_rate)
-
-  new_rate = tf.placeholder(tf.float32, [], name='new_rate')
-  percent_real_var = tf.Variable(0.0, trainable=False)
-  percent_real_update = tf.assign(percent_real_var, new_rate)
-
-  ## Placeholders.
-  inputs = tf.placeholder(
-      tf.int32, shape=[FLAGS.batch_size, FLAGS.sequence_length])
-  present = tf.placeholder(
-      tf.bool, shape=[FLAGS.batch_size, FLAGS.sequence_length])
-  inv_present = tf.placeholder(
-      tf.bool, shape=[FLAGS.batch_size, FLAGS.sequence_length])
-
-  ## Rollout Generator.
-  fwd_gen_rollouts = rollout_generator(
-      hparams, inputs, present, is_training=is_training, is_validating=False)
-  inv_gen_rollouts = rollout_generator(
-      hparams,
-      inputs,
-      inv_present,
-      is_training=is_training,
-      is_validating=False,
-      reuse=True)
-
-  ## Rollout Discriminator.
-  fwd_dis_rollouts = rollout_discriminator(
-      hparams, fwd_gen_rollouts, is_training=is_training)
-  inv_dis_rollouts = rollout_discriminator(
-      hparams, inv_gen_rollouts, is_training=is_training, reuse=True)
-
-  ## Discriminator Loss.
-  [dis_loss, dis_loss_pred, dis_loss_inv_pred] = rollout_discriminator_loss(
-      fwd_dis_rollouts, present, inv_dis_rollouts, inv_present)
-
-  ## Average log-perplexity for only missing words.  However, to do this,
-  # the logits are still computed using teacher forcing, that is, the ground
-  # truth tokens are fed in at each time point to be valid.
-  # TODO(liamfedus): Fix the naming convention.
-  with tf.variable_scope('gen_rollout'):
-    _, fwd_eval_logits, _ = model_construction.create_generator(
-        hparams,
-        inputs,
-        present,
-        is_training=False,
-        is_validating=True,
-        reuse=True)
-
-  avg_log_perplexity = model_losses.calculate_log_perplexity(
-      fwd_eval_logits, inputs, present)
-
-  ## Generator Loss.
-  # 1.  Cross Entropy losses on missing tokens.
-  [fwd_cross_entropy_losses,
-   inv_cross_entropy_losses] = rollout_masked_cross_entropy_loss(
-       inputs, present, inv_present, fwd_gen_rollouts, inv_gen_rollouts)
-
-  # 2.  GAN losses on missing tokens.
-  [fwd_RL_loss,
-   fwd_RL_statistics, fwd_averages_op] = rollout_reinforce_objective(
-       hparams, fwd_gen_rollouts, fwd_dis_rollouts, present)
-  [inv_RL_loss,
-   inv_RL_statistics, inv_averages_op] = rollout_reinforce_objective(
-       hparams, inv_gen_rollouts, inv_dis_rollouts, inv_present)
-
-  # TODO(liamfedus):  Generalize this to use all logs.
-  [fwd_sequence, fwd_logits, fwd_log_probs] = fwd_gen_rollouts[-1]
-  [inv_sequence, inv_logits, inv_log_probs] = inv_gen_rollouts[-1]
-
-  # TODO(liamfedus):  Generalize this to use all logs.
-  fwd_predictions = fwd_dis_rollouts[-1]
-  inv_predictions = inv_dis_rollouts[-1]
-
-  # TODO(liamfedus):  Generalize this to use all logs.
-  [fwd_log_probs, fwd_rewards, fwd_advantages,
-   fwd_baselines] = fwd_RL_statistics[-1]
-  [inv_log_probs, inv_rewards, inv_advantages,
-   inv_baselines] = inv_RL_statistics[-1]
-
-  ## Pre-training.
-  if FLAGS.gen_pretrain_steps:
-    # TODO(liamfedus): Rewrite this.
-    fwd_cross_entropy_loss = tf.reduce_mean(fwd_cross_entropy_losses)
-    gen_pretrain_op = model_optimization.create_gen_pretrain_op(
-        hparams, fwd_cross_entropy_loss, global_step)
-  else:
-    gen_pretrain_op = tf.no_op('gen_pretrain_no_op')
-  if FLAGS.dis_pretrain_steps:
-    dis_pretrain_op = model_optimization.create_dis_pretrain_op(
-        hparams, dis_loss, global_step)
-  else:
-    dis_pretrain_op = tf.no_op('dis_pretrain_no_op')
-
-  ##  Generator Train Op.
-  # 1.  Cross-Entropy.
-  if FLAGS.gen_training_strategy == 'cross_entropy':
-    gen_loss = tf.reduce_mean(
-        fwd_cross_entropy_losses + inv_cross_entropy_losses) / 2.
-    [gen_train_op, gen_grads,
-     gen_vars] = model_optimization.create_gen_train_op(
-         hparams, learning_rate, gen_loss, global_step, mode='MINIMIZE')
-
-  # 2.  GAN (REINFORCE)
-  elif FLAGS.gen_training_strategy == 'reinforce':
-    gen_loss = (fwd_RL_loss + inv_RL_loss) / 2.
-    [gen_train_op, gen_grads,
-     gen_vars] = model_optimization.create_reinforce_gen_train_op(
-         hparams, learning_rate, gen_loss, fwd_averages_op, inv_averages_op,
-         global_step)
-
-  else:
-    raise NotImplementedError
-
-  ## Discriminator Train Op.
-  dis_train_op, dis_grads, dis_vars = model_optimization.create_dis_train_op(
-      hparams, dis_loss, global_step)
-
-  ## Summaries.
-  with tf.name_scope('general'):
-    tf.summary.scalar('percent_real', percent_real_var)
-    tf.summary.scalar('learning_rate', learning_rate)
-
-  with tf.name_scope('generator_losses'):
-    tf.summary.scalar('gen_loss', tf.reduce_mean(gen_loss))
-    tf.summary.scalar('gen_loss_fwd_cross_entropy',
-                      tf.reduce_mean(fwd_cross_entropy_losses))
-    tf.summary.scalar('gen_loss_inv_cross_entropy',
-                      tf.reduce_mean(inv_cross_entropy_losses))
-
-  with tf.name_scope('REINFORCE'):
-    with tf.name_scope('objective'):
-      tf.summary.scalar('fwd_RL_loss', tf.reduce_mean(fwd_RL_loss))
-      tf.summary.scalar('inv_RL_loss', tf.reduce_mean(inv_RL_loss))
-
-    with tf.name_scope('rewards'):
-      helper.variable_summaries(fwd_rewards, 'fwd_rewards')
-      helper.variable_summaries(inv_rewards, 'inv_rewards')
-
-    with tf.name_scope('advantages'):
-      helper.variable_summaries(fwd_advantages, 'fwd_advantages')
-      helper.variable_summaries(inv_advantages, 'inv_advantages')
-
-    with tf.name_scope('baselines'):
-      helper.variable_summaries(fwd_baselines, 'fwd_baselines')
-      helper.variable_summaries(inv_baselines, 'inv_baselines')
-
-    with tf.name_scope('log_probs'):
-      helper.variable_summaries(fwd_log_probs, 'fwd_log_probs')
-      helper.variable_summaries(inv_log_probs, 'inv_log_probs')
-
-  with tf.name_scope('discriminator_losses'):
-    tf.summary.scalar('dis_loss', dis_loss)
-    tf.summary.scalar('dis_loss_fwd_sequence', dis_loss_pred)
-    tf.summary.scalar('dis_loss_inv_sequence', dis_loss_inv_pred)
-
-  with tf.name_scope('logits'):
-    helper.variable_summaries(fwd_logits, 'fwd_logits')
-    helper.variable_summaries(inv_logits, 'inv_logits')
-
-  for v, g in zip(gen_vars, gen_grads):
-    helper.variable_summaries(v, v.op.name)
-    helper.variable_summaries(g, 'grad/' + v.op.name)
-
-  for v, g in zip(dis_vars, dis_grads):
-    helper.variable_summaries(v, v.op.name)
-    helper.variable_summaries(g, 'grad/' + v.op.name)
-
-  merge_summaries_op = tf.summary.merge_all()
-
-  # Model saver.
-  saver = tf.train.Saver(keep_checkpoint_every_n_hours=1, max_to_keep=5)
-
-  # Named tuple that captures elements of the MaskGAN model.
-  Model = collections.namedtuple('Model', [
-      'inputs', 'present', 'inv_present', 'percent_real_update', 'new_rate',
-      'fwd_sequence', 'fwd_logits', 'fwd_rewards', 'fwd_advantages',
-      'fwd_log_probs', 'fwd_predictions', 'fwd_cross_entropy_losses',
-      'inv_sequence', 'inv_logits', 'inv_rewards', 'inv_advantages',
-      'inv_log_probs', 'inv_predictions', 'inv_cross_entropy_losses',
-      'avg_log_perplexity', 'dis_loss', 'gen_loss', 'dis_train_op',
-      'gen_train_op', 'gen_pretrain_op', 'dis_pretrain_op',
-      'merge_summaries_op', 'global_step', 'new_learning_rate',
-      'learning_rate_update', 'saver'
-  ])
-
-  model = Model(
-      inputs, present, inv_present, percent_real_update, new_rate, fwd_sequence,
-      fwd_logits, fwd_rewards, fwd_advantages, fwd_log_probs, fwd_predictions,
-      fwd_cross_entropy_losses, inv_sequence, inv_logits, inv_rewards,
-      inv_advantages, inv_log_probs, inv_predictions, inv_cross_entropy_losses,
-      avg_log_perplexity, dis_loss, gen_loss, dis_train_op, gen_train_op,
-      gen_pretrain_op, dis_pretrain_op, merge_summaries_op, global_step,
-      new_learning_rate, learning_rate_update, saver)
-  return model
-
-
-def rollout_generator(hparams,
-                      inputs,
-                      input_present,
-                      is_training,
-                      is_validating,
-                      reuse=None):
-  """Define the Generator graph which does rollouts.
-
-    G will now impute tokens that have been masked from the input seqeunce.
-  """
-  rollouts = []
-
-  with tf.variable_scope('gen_rollout'):
-    for n in xrange(FLAGS.num_rollouts):
-      if n > 0:
-        # TODO(liamfedus): Why is it necessary here to manually set reuse?
-        reuse = True
-        tf.get_variable_scope().reuse_variables()
-
-      [sequence, logits, log_probs] = model_construction.create_generator(
-          hparams,
-          inputs,
-          input_present,
-          is_training,
-          is_validating,
-          reuse=reuse)
-
-      rollouts.append([sequence, logits, log_probs])
-
-  # Length assertion.
-  assert len(rollouts) == FLAGS.num_rollouts
-
-  return rollouts
-
-
-def rollout_discriminator(hparams, gen_rollouts, is_training, reuse=None):
-  """Define the Discriminator graph which does rollouts.
-
-    G will now impute tokens that have been masked from the input seqeunce.
-  """
-  rollout_predictions = []
-
-  with tf.variable_scope('dis_rollout'):
-    for n, rollout in enumerate(gen_rollouts):
-      if n > 0:
-        # TODO(liamfedus): Why is it necessary here to manually set reuse?
-        reuse = True
-        tf.get_variable_scope().reuse_variables()
-
-      [sequence, _, _] = rollout
-
-      predictions = model_construction.create_discriminator(
-          hparams, sequence, is_training=is_training, reuse=reuse)
-
-      # Predictions for each rollout.
-      rollout_predictions.append(predictions)
-
-  # Length assertion.
-  assert len(rollout_predictions) == FLAGS.num_rollouts
-
-  return rollout_predictions
-
-
-def rollout_reinforce_objective(hparams, gen_rollouts, dis_rollouts, present):
-  cumulative_gen_objective = 0.
-  cumulative_averages_op = []
-  cumulative_statistics = []
-
-  assert len(gen_rollouts) == len(dis_rollouts)
-
-  for gen_rollout, dis_rollout in zip(gen_rollouts, dis_rollouts):
-    [_, _, log_probs] = gen_rollout
-    dis_predictions = dis_rollout
-
-    [
-        final_gen_objective, log_probs, rewards, advantages, baselines,
-        maintain_averages_op
-    ] = model_losses.calculate_reinforce_objective(hparams, log_probs,
-                                                   dis_predictions, present)
-
-    # Accumulate results.
-    cumulative_gen_objective += final_gen_objective
-    cumulative_averages_op.append(maintain_averages_op)
-    cumulative_statistics.append([log_probs, rewards, advantages, baselines])
-
-  # Group all the averaging operations.
-  cumulative_averages_op = tf.group(*cumulative_averages_op)
-  cumulative_gen_objective /= FLAGS.num_rollouts
-  [log_probs, rewards, advantages, baselines] = cumulative_statistics[-1]
-
-  # Length assertion.
-  assert len(cumulative_statistics) == FLAGS.num_rollouts
-
-  return [
-      cumulative_gen_objective, cumulative_statistics, cumulative_averages_op
-  ]
-
-
-def rollout_masked_cross_entropy_loss(inputs, present, inv_present,
-                                      fwd_rollouts, inv_rollouts):
-  cumulative_fwd_cross_entropy_losses = tf.zeros(
-      shape=[FLAGS.batch_size, FLAGS.sequence_length])
-  cumulative_inv_cross_entropy_losses = tf.zeros(
-      shape=[FLAGS.batch_size, FLAGS.sequence_length])
-
-  for fwd_rollout, inv_rollout in zip(fwd_rollouts, inv_rollouts):
-    [_, fwd_logits, _] = fwd_rollout
-    [_, inv_logits, _] = inv_rollout
-
-    [fwd_cross_entropy_losses,
-     inv_cross_entropy_losses] = model_losses.create_masked_cross_entropy_loss(
-         inputs, present, inv_present, fwd_logits, inv_logits)
-
-    cumulative_fwd_cross_entropy_losses = tf.add(
-        cumulative_fwd_cross_entropy_losses, fwd_cross_entropy_losses)
-    cumulative_inv_cross_entropy_losses = tf.add(
-        cumulative_inv_cross_entropy_losses, inv_cross_entropy_losses)
-
-  return [
-      cumulative_fwd_cross_entropy_losses, cumulative_inv_cross_entropy_losses
-  ]
-
-
-def rollout_discriminator_loss(fwd_rollouts, present, inv_rollouts,
-                               inv_present):
-
-  dis_loss = 0
-  dis_loss_pred = 0
-  dis_loss_inv_pred = 0
-
-  for fwd_predictions, inv_predictions in zip(fwd_rollouts, inv_rollouts):
-    dis_loss_pred += losses.discriminator_loss(fwd_predictions, present)
-    dis_loss_inv_pred += losses.discriminator_loss(inv_predictions, inv_present)
-
-  dis_loss_pred /= FLAGS.num_rollouts
-  dis_loss_inv_pred /= FLAGS.num_rollouts
-
-  dis_loss = (dis_loss_pred + dis_loss_inv_pred) / 2.
-  return [dis_loss, dis_loss_pred, dis_loss_inv_pred]
diff --git a/research/maskgan/models/seq2seq.py b/research/maskgan/models/seq2seq.py
deleted file mode 100644
index fac397c98381309f6c7c6d428fcec3c665bcff98..0000000000000000000000000000000000000000
--- a/research/maskgan/models/seq2seq.py
+++ /dev/null
@@ -1,277 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple seq2seq model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-from six.moves import xrange
-from models import attention_utils
-
-# ZoneoutWrapper.
-from regularization import zoneout
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def transform_input_with_is_missing_token(inputs, targets_present):
-  """Transforms the inputs to have missing tokens when it's masked out.  The
-  mask is for the targets, so therefore, to determine if an input at time t is
-  masked, we have to check if the target at time t - 1 is masked out.
-
-  e.g.
-    inputs = [a, b, c, d]
-    targets = [b, c, d, e]
-    targets_present = [1, 0, 1, 0]
-
-  then,
-    transformed_input = [a, b, <missing>, d]
-
-  Args:
-    inputs:  tf.int32 Tensor of shape [batch_size, sequence_length] with tokens
-      up to, but not including, vocab_size.
-    targets_present:  tf.bool Tensor of shape [batch_size, sequence_length] with
-      True representing the presence of the word.
-
-  Returns:
-    transformed_input:  tf.int32 Tensor of shape [batch_size, sequence_length]
-      which takes on value of inputs when the input is present and takes on
-      value=vocab_size to indicate a missing token.
-  """
-  # To fill in if the input is missing.
-  input_missing = tf.constant(
-      FLAGS.vocab_size,
-      dtype=tf.int32,
-      shape=[FLAGS.batch_size, FLAGS.sequence_length])
-
-  # The 0th input will always be present to MaskGAN.
-  zeroth_input_present = tf.constant(True, tf.bool, shape=[FLAGS.batch_size, 1])
-
-  # Input present mask.
-  inputs_present = tf.concat(
-      [zeroth_input_present, targets_present[:, :-1]], axis=1)
-
-  transformed_input = tf.where(inputs_present, inputs, input_missing)
-  return transformed_input
-
-
-def gen_encoder(hparams, inputs, targets_present, is_training, reuse=None):
-  """Define the Encoder graph."""
-  # We will use the same variable from the decoder.
-  if FLAGS.seq2seq_share_embedding:
-    with tf.variable_scope('decoder/rnn'):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope('encoder', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.LayerNormBasicLSTMCell(
-          hparams.gen_rnn_size, reuse=reuse)
-
-    attn_cell = lstm_cell
-    if FLAGS.zoneout_drop_prob > 0.0:
-
-      def attn_cell():
-        return zoneout.ZoneoutWrapper(
-            lstm_cell(),
-            zoneout_drop_prob=FLAGS.zoneout_drop_prob,
-            is_training=is_training)
-
-    cell = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.gen_num_layers)],
-        state_is_tuple=True)
-
-    initial_state = cell.zero_state(FLAGS.batch_size, tf.float32)
-
-    # Add a missing token for inputs not present.
-    real_inputs = inputs
-    masked_inputs = transform_input_with_is_missing_token(
-        inputs, targets_present)
-
-    with tf.variable_scope('rnn'):
-      hidden_states = []
-
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size + 1, hparams.gen_rnn_size])
-
-      real_rnn_inputs = tf.nn.embedding_lookup(embedding, real_inputs)
-      masked_rnn_inputs = tf.nn.embedding_lookup(embedding, masked_inputs)
-
-      state = initial_state
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        rnn_inp = masked_rnn_inputs[:, t]
-        rnn_out, state = cell(rnn_inp, state)
-        hidden_states.append(rnn_out)
-      final_masked_state = state
-      hidden_states = tf.stack(hidden_states, axis=1)
-
-      # Produce the RNN state had the model operated only
-      # over real data.
-      real_state = initial_state
-      for t in xrange(FLAGS.sequence_length):
-        tf.get_variable_scope().reuse_variables()
-
-        # RNN.
-        rnn_inp = real_rnn_inputs[:, t]
-        rnn_out, real_state = cell(rnn_inp, real_state)
-      final_state = real_state
-
-  return (hidden_states, final_masked_state), initial_state, final_state
-
-
-def gen_decoder(hparams,
-                inputs,
-                targets,
-                targets_present,
-                encoding_state,
-                is_training,
-                is_validating,
-                reuse=None):
-  """Define the Decoder graph. The Decoder will now impute tokens that
-      have been masked from the input seqeunce.
-  """
-  gen_decoder_rnn_size = hparams.gen_rnn_size
-
-  with tf.variable_scope('decoder', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.LayerNormBasicLSTMCell(
-          gen_decoder_rnn_size, reuse=reuse)
-
-    attn_cell = lstm_cell
-    if FLAGS.zoneout_drop_prob > 0.0:
-
-      def attn_cell():
-        return zoneout.ZoneoutWrapper(
-            lstm_cell(),
-            zoneout_drop_prob=FLAGS.zoneout_drop_prob,
-            is_training=is_training)
-
-    cell_gen = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.gen_num_layers)],
-        state_is_tuple=True)
-
-    # Hidden encoder states.
-    hidden_vector_encodings = encoding_state[0]
-
-    # Carry forward the final state tuple from the encoder.
-    # State tuples.
-    state_gen = encoding_state[1]
-
-    if FLAGS.attention_option is not None:
-      (attention_keys, attention_values, _,
-       attention_construct_fn) = attention_utils.prepare_attention(
-           hidden_vector_encodings,
-           FLAGS.attention_option,
-           num_units=gen_decoder_rnn_size,
-           reuse=reuse)
-
-    with tf.variable_scope('rnn'):
-      sequence, logits, log_probs = [], [], []
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, gen_decoder_rnn_size])
-      softmax_w = tf.get_variable('softmax_w',
-                                  [gen_decoder_rnn_size, FLAGS.vocab_size])
-      softmax_b = tf.get_variable('softmax_b', [FLAGS.vocab_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, inputs)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        # Input to the Decoder.
-        if t == 0:
-          # Always provide the real input at t = 0.
-          rnn_inp = rnn_inputs[:, t]
-
-        # If the input is present, read in the input at t.
-        # If the input is not present, read in the previously generated.
-        else:
-          real_rnn_inp = rnn_inputs[:, t]
-          fake_rnn_inp = tf.nn.embedding_lookup(embedding, fake)
-
-          # While validating, the decoder should be operating in teacher
-          # forcing regime.  Also, if we're just training with cross_entropy
-          # use teacher forcing.
-          if is_validating or (is_training and
-                               FLAGS.gen_training_strategy == 'cross_entropy'):
-            rnn_inp = real_rnn_inp
-          else:
-            rnn_inp = tf.where(targets_present[:, t - 1], real_rnn_inp,
-                               fake_rnn_inp)
-
-        # RNN.
-        rnn_out, state_gen = cell_gen(rnn_inp, state_gen)
-
-        if FLAGS.attention_option is not None:
-          rnn_out = attention_construct_fn(rnn_out, attention_keys,
-                                           attention_values)
-        #   # TODO(liamfedus): Assert not "monotonic" attention_type.
-        #   # TODO(liamfedus): FLAGS.attention_type.
-        #   context_state = revised_attention_utils._empty_state()
-        #   rnn_out, context_state = attention_construct_fn(
-        #       rnn_out, attention_keys, attention_values, context_state, t)
-        logit = tf.matmul(rnn_out, softmax_w) + softmax_b
-
-        # Output for Decoder.
-        # If input is present:   Return real at t+1.
-        # If input is not present:  Return fake for t+1.
-        real = targets[:, t]
-
-        categorical = tf.contrib.distributions.Categorical(logits=logit)
-        fake = categorical.sample()
-        log_prob = categorical.log_prob(fake)
-
-        output = tf.where(targets_present[:, t], real, fake)
-
-        # Add to lists.
-        sequence.append(output)
-        log_probs.append(log_prob)
-        logits.append(logit)
-
-  return (tf.stack(sequence, axis=1), tf.stack(logits, axis=1), tf.stack(
-      log_probs, axis=1))
-
-
-def generator(hparams,
-              inputs,
-              targets,
-              targets_present,
-              is_training,
-              is_validating,
-              reuse=None):
-  """Define the Generator graph."""
-  with tf.variable_scope('gen', reuse=reuse):
-    encoder_states, initial_state, final_state = gen_encoder(
-        hparams, inputs, targets_present, is_training=is_training, reuse=reuse)
-    stacked_sequence, stacked_logits, stacked_log_probs = gen_decoder(
-        hparams,
-        inputs,
-        targets,
-        targets_present,
-        encoder_states,
-        is_training=is_training,
-        is_validating=is_validating,
-        reuse=reuse)
-    return (stacked_sequence, stacked_logits, stacked_log_probs, initial_state,
-            final_state)
diff --git a/research/maskgan/models/seq2seq_nas.py b/research/maskgan/models/seq2seq_nas.py
deleted file mode 100644
index cede90f5625c6e46740ad7601681712e73f07450..0000000000000000000000000000000000000000
--- a/research/maskgan/models/seq2seq_nas.py
+++ /dev/null
@@ -1,333 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple seq2seq model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-from six.moves import xrange
-import tensorflow as tf
-
-from models import attention_utils
-
-# NAS Code..
-from nas_utils import configs
-from nas_utils import custom_cell
-from nas_utils import variational_dropout
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def get_config():
-  return configs.AlienConfig2()
-
-
-LSTMTuple = collections.namedtuple('LSTMTuple', ['c', 'h'])
-
-
-def transform_input_with_is_missing_token(inputs, targets_present):
-  """Transforms the inputs to have missing tokens when it's masked out.  The
-  mask is for the targets, so therefore, to determine if an input at time t is
-  masked, we have to check if the target at time t - 1 is masked out.
-
-  e.g.
-    inputs = [a, b, c, d]
-    targets = [b, c, d, e]
-    targets_present = [1, 0, 1, 0]
-
-  then,
-    transformed_input = [a, b, <missing>, d]
-
-  Args:
-    inputs:  tf.int32 Tensor of shape [batch_size, sequence_length] with tokens
-      up to, but not including, vocab_size.
-    targets_present:  tf.bool Tensor of shape [batch_size, sequence_length] with
-      True representing the presence of the word.
-
-  Returns:
-    transformed_input:  tf.int32 Tensor of shape [batch_size, sequence_length]
-      which takes on value of inputs when the input is present and takes on
-      value=vocab_size to indicate a missing token.
-  """
-  # To fill in if the input is missing.
-  input_missing = tf.constant(
-      FLAGS.vocab_size,
-      dtype=tf.int32,
-      shape=[FLAGS.batch_size, FLAGS.sequence_length])
-
-  # The 0th input will always be present to MaskGAN.
-  zeroth_input_present = tf.constant(True, tf.bool, shape=[FLAGS.batch_size, 1])
-
-  # Input present mask.
-  inputs_present = tf.concat(
-      [zeroth_input_present, targets_present[:, :-1]], axis=1)
-
-  transformed_input = tf.where(inputs_present, inputs, input_missing)
-  return transformed_input
-
-
-def gen_encoder(hparams, inputs, targets_present, is_training, reuse=None):
-  """Define the Encoder graph.
-
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    inputs:  tf.int32 Tensor of shape [batch_size, sequence_length] with tokens
-      up to, but not including, vocab_size.
-    targets_present:  tf.bool Tensor of shape [batch_size, sequence_length] with
-      True representing the presence of the target.
-    is_training:  Boolean indicating operational mode (train/inference).
-    reuse (Optional):   Whether to reuse the variables.
-
-  Returns:
-    Tuple of (hidden_states, final_state).
-  """
-  config = get_config()
-  configs.print_config(config)
-  # We will use the same variable from the decoder.
-  if FLAGS.seq2seq_share_embedding:
-    with tf.variable_scope('decoder/rnn'):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope('encoder', reuse=reuse):
-    # Neural architecture search cell.
-    cell = custom_cell.Alien(config.hidden_size)
-
-    if is_training:
-      [h2h_masks, h2i_masks, _,
-       output_mask] = variational_dropout.generate_variational_dropout_masks(
-           hparams, config.keep_prob)
-    else:
-      h2i_masks, output_mask = None, None
-
-    cell = custom_cell.GenericMultiRNNCell([cell] * config.num_layers)
-
-    initial_state = cell.zero_state(FLAGS.batch_size, tf.float32)
-
-    # Add a missing token for inputs not present.
-    real_inputs = inputs
-    masked_inputs = transform_input_with_is_missing_token(
-        inputs, targets_present)
-
-    with tf.variable_scope('rnn'):
-      hidden_states = []
-
-      # Split the embedding into two parts so that we can load the PTB
-      # weights into one part of the Variable.
-      if not FLAGS.seq2seq_share_embedding:
-        embedding = tf.get_variable('embedding',
-                                    [FLAGS.vocab_size, hparams.gen_rnn_size])
-      missing_embedding = tf.get_variable('missing_embedding',
-                                          [1, hparams.gen_rnn_size])
-      embedding = tf.concat([embedding, missing_embedding], axis=0)
-
-      real_rnn_inputs = tf.nn.embedding_lookup(embedding, real_inputs)
-      masked_rnn_inputs = tf.nn.embedding_lookup(embedding, masked_inputs)
-
-      if is_training and FLAGS.keep_prob < 1:
-        masked_rnn_inputs = tf.nn.dropout(masked_rnn_inputs, FLAGS.keep_prob)
-
-      state = initial_state
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        rnn_inp = masked_rnn_inputs[:, t]
-
-        if is_training:
-          state = list(state)
-          for layer_num, per_layer_state in enumerate(state):
-            per_layer_state = LSTMTuple(
-                per_layer_state[0], per_layer_state[1] * h2h_masks[layer_num])
-            state[layer_num] = per_layer_state
-
-        rnn_out, state = cell(rnn_inp, state, h2i_masks)
-
-        if is_training:
-          rnn_out = output_mask * rnn_out
-
-        hidden_states.append(rnn_out)
-      final_masked_state = state
-      hidden_states = tf.stack(hidden_states, axis=1)
-
-      # Produce the RNN state had the model operated only
-      # over real data.
-      real_state = initial_state
-      for t in xrange(FLAGS.sequence_length):
-        tf.get_variable_scope().reuse_variables()
-
-        # RNN.
-        rnn_inp = real_rnn_inputs[:, t]
-        rnn_out, real_state = cell(rnn_inp, real_state)
-      final_state = real_state
-
-  return (hidden_states, final_masked_state), initial_state, final_state
-
-
-def gen_decoder(hparams,
-                inputs,
-                targets,
-                targets_present,
-                encoding_state,
-                is_training,
-                is_validating,
-                reuse=None):
-  """Define the Decoder graph. The Decoder will now impute tokens that
-      have been masked from the input seqeunce.
-  """
-  config = get_config()
-  gen_decoder_rnn_size = hparams.gen_rnn_size
-
-  if FLAGS.seq2seq_share_embedding:
-    with tf.variable_scope('decoder/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, gen_decoder_rnn_size])
-
-  with tf.variable_scope('decoder', reuse=reuse):
-    # Neural architecture search cell.
-    cell = custom_cell.Alien(config.hidden_size)
-
-    if is_training:
-      [h2h_masks, _, _,
-       output_mask] = variational_dropout.generate_variational_dropout_masks(
-           hparams, config.keep_prob)
-    else:
-      output_mask = None
-
-    cell_gen = custom_cell.GenericMultiRNNCell([cell] * config.num_layers)
-
-    # Hidden encoder states.
-    hidden_vector_encodings = encoding_state[0]
-
-    # Carry forward the final state tuple from the encoder.
-    # State tuples.
-    state_gen = encoding_state[1]
-
-    if FLAGS.attention_option is not None:
-      (attention_keys, attention_values, _,
-       attention_construct_fn) = attention_utils.prepare_attention(
-           hidden_vector_encodings,
-           FLAGS.attention_option,
-           num_units=gen_decoder_rnn_size,
-           reuse=reuse)
-
-    with tf.variable_scope('rnn'):
-      sequence, logits, log_probs = [], [], []
-
-      if not FLAGS.seq2seq_share_embedding:
-        embedding = tf.get_variable('embedding',
-                                    [FLAGS.vocab_size, gen_decoder_rnn_size])
-      softmax_w = tf.matrix_transpose(embedding)
-      softmax_b = tf.get_variable('softmax_b', [FLAGS.vocab_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, inputs)
-
-      if is_training and FLAGS.keep_prob < 1:
-        rnn_inputs = tf.nn.dropout(rnn_inputs, FLAGS.keep_prob)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        # Input to the Decoder.
-        if t == 0:
-          # Always provide the real input at t = 0.
-          rnn_inp = rnn_inputs[:, t]
-
-        # If the input is present, read in the input at t.
-        # If the input is not present, read in the previously generated.
-        else:
-          real_rnn_inp = rnn_inputs[:, t]
-          fake_rnn_inp = tf.nn.embedding_lookup(embedding, fake)
-
-          # While validating, the decoder should be operating in teacher
-          # forcing regime.  Also, if we're just training with cross_entropy
-          # use teacher forcing.
-          if is_validating or (is_training and
-                               FLAGS.gen_training_strategy == 'cross_entropy'):
-            rnn_inp = real_rnn_inp
-          else:
-            rnn_inp = tf.where(targets_present[:, t - 1], real_rnn_inp,
-                               fake_rnn_inp)
-
-        if is_training:
-          state_gen = list(state_gen)
-          for layer_num, per_layer_state in enumerate(state_gen):
-            per_layer_state = LSTMTuple(
-                per_layer_state[0], per_layer_state[1] * h2h_masks[layer_num])
-            state_gen[layer_num] = per_layer_state
-
-        # RNN.
-        rnn_out, state_gen = cell_gen(rnn_inp, state_gen)
-
-        if is_training:
-          rnn_out = output_mask * rnn_out
-
-        if FLAGS.attention_option is not None:
-          rnn_out = attention_construct_fn(rnn_out, attention_keys,
-                                           attention_values)
-        #   # TODO(liamfedus): Assert not "monotonic" attention_type.
-        #   # TODO(liamfedus): FLAGS.attention_type.
-        #   context_state = revised_attention_utils._empty_state()
-        #   rnn_out, context_state = attention_construct_fn(
-        #       rnn_out, attention_keys, attention_values, context_state, t)
-        logit = tf.matmul(rnn_out, softmax_w) + softmax_b
-
-        # Output for Decoder.
-        # If input is present:   Return real at t+1.
-        # If input is not present:  Return fake for t+1.
-        real = targets[:, t]
-
-        categorical = tf.contrib.distributions.Categorical(logits=logit)
-        fake = categorical.sample()
-        log_prob = categorical.log_prob(fake)
-
-        output = tf.where(targets_present[:, t], real, fake)
-
-        # Add to lists.
-        sequence.append(output)
-        log_probs.append(log_prob)
-        logits.append(logit)
-
-  return (tf.stack(sequence, axis=1), tf.stack(logits, axis=1), tf.stack(
-      log_probs, axis=1))
-
-
-def generator(hparams,
-              inputs,
-              targets,
-              targets_present,
-              is_training,
-              is_validating,
-              reuse=None):
-  """Define the Generator graph."""
-  with tf.variable_scope('gen', reuse=reuse):
-    encoder_states, initial_state, final_state = gen_encoder(
-        hparams, inputs, targets_present, is_training=is_training, reuse=reuse)
-    stacked_sequence, stacked_logits, stacked_log_probs = gen_decoder(
-        hparams,
-        inputs,
-        targets,
-        targets_present,
-        encoder_states,
-        is_training=is_training,
-        is_validating=is_validating,
-        reuse=reuse)
-    return (stacked_sequence, stacked_logits, stacked_log_probs, initial_state,
-            final_state)
diff --git a/research/maskgan/models/seq2seq_vd.py b/research/maskgan/models/seq2seq_vd.py
deleted file mode 100644
index 850eda435c48c73d574a06b1b65a12f71a18f276..0000000000000000000000000000000000000000
--- a/research/maskgan/models/seq2seq_vd.py
+++ /dev/null
@@ -1,609 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple seq2seq model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from six.moves import xrange
-import tensorflow as tf
-
-from models import attention_utils
-from regularization import variational_dropout
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def transform_input_with_is_missing_token(inputs, targets_present):
-  """Transforms the inputs to have missing tokens when it's masked out.  The
-  mask is for the targets, so therefore, to determine if an input at time t is
-  masked, we have to check if the target at time t - 1 is masked out.
-
-  e.g.
-    inputs = [a, b, c, d]
-    targets = [b, c, d, e]
-    targets_present = [1, 0, 1, 0]
-
-  which computes,
-    inputs_present = [1, 1, 0, 1]
-
-  and outputs,
-    transformed_input = [a, b, <missing>, d]
-
-  Args:
-    inputs:  tf.int32 Tensor of shape [batch_size, sequence_length] with tokens
-      up to, but not including, vocab_size.
-    targets_present:  tf.bool Tensor of shape [batch_size, sequence_length] with
-      True representing the presence of the word.
-
-  Returns:
-    transformed_input:  tf.int32 Tensor of shape [batch_size, sequence_length]
-      which takes on value of inputs when the input is present and takes on
-      value=vocab_size to indicate a missing token.
-  """
-  # To fill in if the input is missing.
-  input_missing = tf.constant(
-      FLAGS.vocab_size,
-      dtype=tf.int32,
-      shape=[FLAGS.batch_size, FLAGS.sequence_length])
-
-  # The 0th input will always be present to MaskGAN.
-  zeroth_input_present = tf.constant(True, tf.bool, shape=[FLAGS.batch_size, 1])
-
-  # Input present mask.
-  inputs_present = tf.concat(
-      [zeroth_input_present, targets_present[:, :-1]], axis=1)
-
-  transformed_input = tf.where(inputs_present, inputs, input_missing)
-  return transformed_input
-
-
-# TODO(adai): IMDB labels placeholder to encoder.
-def gen_encoder(hparams, inputs, targets_present, is_training, reuse=None):
-  """Define the Encoder graph.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    inputs:  tf.int32 Tensor of shape [batch_size, sequence_length] with tokens
-      up to, but not including, vocab_size.
-    targets_present:  tf.bool Tensor of shape [batch_size, sequence_length] with
-      True representing the presence of the target.
-    is_training:  Boolean indicating operational mode (train/inference).
-    reuse (Optional):   Whether to reuse the variables.
-
-  Returns:
-    Tuple of (hidden_states, final_state).
-  """
-  # We will use the same variable from the decoder.
-  if FLAGS.seq2seq_share_embedding:
-    with tf.variable_scope('decoder/rnn'):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope('encoder', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(
-          hparams.gen_rnn_size,
-          forget_bias=0.0,
-          state_is_tuple=True,
-          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and hparams.gen_vd_keep_prob < 1:
-
-      def attn_cell():
-        return variational_dropout.VariationalDropoutWrapper(
-            lstm_cell(), FLAGS.batch_size, hparams.gen_rnn_size,
-            hparams.gen_vd_keep_prob, hparams.gen_vd_keep_prob)
-
-    cell = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.gen_num_layers)],
-        state_is_tuple=True)
-
-    initial_state = cell.zero_state(FLAGS.batch_size, tf.float32)
-
-    # Add a missing token for inputs not present.
-    real_inputs = inputs
-    masked_inputs = transform_input_with_is_missing_token(
-        inputs, targets_present)
-
-    with tf.variable_scope('rnn') as scope:
-      hidden_states = []
-
-      # Split the embedding into two parts so that we can load the PTB
-      # weights into one part of the Variable.
-      if not FLAGS.seq2seq_share_embedding:
-        embedding = tf.get_variable('embedding',
-                                    [FLAGS.vocab_size, hparams.gen_rnn_size])
-      missing_embedding = tf.get_variable('missing_embedding',
-                                          [1, hparams.gen_rnn_size])
-      embedding = tf.concat([embedding, missing_embedding], axis=0)
-
-      # TODO(adai): Perhaps append IMDB labels placeholder to input at
-      # each time point.
-      real_rnn_inputs = tf.nn.embedding_lookup(embedding, real_inputs)
-      masked_rnn_inputs = tf.nn.embedding_lookup(embedding, masked_inputs)
-
-      state = initial_state
-
-      def make_mask(keep_prob, units):
-        random_tensor = keep_prob
-        # 0. if [keep_prob, 1.0) and 1. if [1.0, 1.0 + keep_prob)
-        random_tensor += tf.random_uniform(
-            tf.stack([FLAGS.batch_size, 1, units]))
-        return tf.floor(random_tensor) / keep_prob
-
-      if is_training:
-        output_mask = make_mask(hparams.gen_vd_keep_prob, hparams.gen_rnn_size)
-
-      hidden_states, state = tf.nn.dynamic_rnn(
-          cell, masked_rnn_inputs, initial_state=state, scope=scope)
-      if is_training:
-        hidden_states *= output_mask
-
-      final_masked_state = state
-
-      # Produce the RNN state had the model operated only
-      # over real data.
-      real_state = initial_state
-      _, real_state = tf.nn.dynamic_rnn(
-          cell, real_rnn_inputs, initial_state=real_state, scope=scope)
-      final_state = real_state
-
-  return (hidden_states, final_masked_state), initial_state, final_state
-
-
-# TODO(adai): IMDB labels placeholder to encoder.
-def gen_encoder_cnn(hparams, inputs, targets_present, is_training, reuse=None):
-  """Define the CNN Encoder graph."""
-  del reuse
-  sequence = transform_input_with_is_missing_token(inputs, targets_present)
-
-  # TODO(liamfedus): Make this a hyperparameter.
-  dis_filter_sizes = [3, 4, 5, 6, 7, 8, 9, 10, 15, 20]
-
-  # Keeping track of l2 regularization loss (optional)
-  # l2_loss = tf.constant(0.0)
-
-  with tf.variable_scope('encoder', reuse=True):
-    with tf.variable_scope('rnn'):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  cnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-
-  # Create a convolution layer for each filter size
-  conv_outputs = []
-  for filter_size in dis_filter_sizes:
-    with tf.variable_scope('conv-%s' % filter_size):
-      # Convolution Layer
-      filter_shape = [
-          filter_size, hparams.gen_rnn_size, hparams.dis_num_filters
-      ]
-      W = tf.get_variable(
-          name='W', initializer=tf.truncated_normal(filter_shape, stddev=0.1))
-      b = tf.get_variable(
-          name='b',
-          initializer=tf.constant(0.1, shape=[hparams.dis_num_filters]))
-      conv = tf.nn.conv1d(cnn_inputs, W, stride=1, padding='SAME', name='conv')
-
-      # Apply nonlinearity
-      h = tf.nn.relu(tf.nn.bias_add(conv, b), name='relu')
-
-      conv_outputs.append(h)
-
-  # Combine all the pooled features
-  dis_num_filters_total = hparams.dis_num_filters * len(dis_filter_sizes)
-
-  h_conv = tf.concat(conv_outputs, axis=2)
-  h_conv_flat = tf.reshape(h_conv, [-1, dis_num_filters_total])
-
-  # Add dropout
-  if is_training:
-    with tf.variable_scope('dropout'):
-      h_conv_flat = tf.nn.dropout(h_conv_flat, hparams.gen_vd_keep_prob)
-
-  # Final (unnormalized) scores and predictions
-  with tf.variable_scope('output'):
-    W = tf.get_variable(
-        'W',
-        shape=[dis_num_filters_total, hparams.gen_rnn_size],
-        initializer=tf.contrib.layers.xavier_initializer())
-    b = tf.get_variable(
-        name='b', initializer=tf.constant(0.1, shape=[hparams.gen_rnn_size]))
-    # l2_loss += tf.nn.l2_loss(W)
-    # l2_loss += tf.nn.l2_loss(b)
-    predictions = tf.nn.xw_plus_b(h_conv_flat, W, b, name='predictions')
-    predictions = tf.reshape(
-        predictions,
-        shape=[FLAGS.batch_size, FLAGS.sequence_length, hparams.gen_rnn_size])
-  final_state = tf.reduce_mean(predictions, 1)
-  return predictions, (final_state, final_state)
-
-
-# TODO(adai): IMDB labels placeholder to decoder.
-def gen_decoder(hparams,
-                inputs,
-                targets,
-                targets_present,
-                encoding_state,
-                is_training,
-                is_validating,
-                reuse=None):
-  """Define the Decoder graph. The Decoder will now impute tokens that
-      have been masked from the input seqeunce.
-  """
-  gen_decoder_rnn_size = hparams.gen_rnn_size
-
-  targets = tf.Print(targets, [targets], message='targets', summarize=50)
-  if FLAGS.seq2seq_share_embedding:
-    with tf.variable_scope('decoder/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-
-  with tf.variable_scope('decoder', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(
-          gen_decoder_rnn_size,
-          forget_bias=0.0,
-          state_is_tuple=True,
-          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and hparams.gen_vd_keep_prob < 1:
-
-      def attn_cell():
-        return variational_dropout.VariationalDropoutWrapper(
-            lstm_cell(), FLAGS.batch_size, hparams.gen_rnn_size,
-            hparams.gen_vd_keep_prob, hparams.gen_vd_keep_prob)
-
-    cell_gen = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.gen_num_layers)],
-        state_is_tuple=True)
-
-    # Hidden encoder states.
-    hidden_vector_encodings = encoding_state[0]
-
-    # Carry forward the final state tuple from the encoder.
-    # State tuples.
-    state_gen = encoding_state[1]
-
-    if FLAGS.attention_option is not None:
-      (attention_keys, attention_values, _,
-       attention_construct_fn) = attention_utils.prepare_attention(
-           hidden_vector_encodings,
-           FLAGS.attention_option,
-           num_units=gen_decoder_rnn_size,
-           reuse=reuse)
-
-    def make_mask(keep_prob, units):
-      random_tensor = keep_prob
-      # 0. if [keep_prob, 1.0) and 1. if [1.0, 1.0 + keep_prob)
-      random_tensor += tf.random_uniform(tf.stack([FLAGS.batch_size, units]))
-      return tf.floor(random_tensor) / keep_prob
-
-    if is_training:
-      output_mask = make_mask(hparams.gen_vd_keep_prob, hparams.gen_rnn_size)
-
-    with tf.variable_scope('rnn'):
-      sequence, logits, log_probs = [], [], []
-
-      if not FLAGS.seq2seq_share_embedding:
-        embedding = tf.get_variable('embedding',
-                                    [FLAGS.vocab_size, hparams.gen_rnn_size])
-      softmax_w = tf.matrix_transpose(embedding)
-      softmax_b = tf.get_variable('softmax_b', [FLAGS.vocab_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, inputs)
-      # TODO(adai): Perhaps append IMDB labels placeholder to input at
-      # each time point.
-
-      rnn_outs = []
-
-      fake = None
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        # Input to the Decoder.
-        if t == 0:
-          # Always provide the real input at t = 0.
-          rnn_inp = rnn_inputs[:, t]
-
-        # If the input is present, read in the input at t.
-        # If the input is not present, read in the previously generated.
-        else:
-          real_rnn_inp = rnn_inputs[:, t]
-
-          # While validating, the decoder should be operating in teacher
-          # forcing regime.  Also, if we're just training with cross_entropy
-          # use teacher forcing.
-          if is_validating or FLAGS.gen_training_strategy == 'cross_entropy':
-            rnn_inp = real_rnn_inp
-          else:
-            fake_rnn_inp = tf.nn.embedding_lookup(embedding, fake)
-            rnn_inp = tf.where(targets_present[:, t - 1], real_rnn_inp,
-                               fake_rnn_inp)
-
-        # RNN.
-        rnn_out, state_gen = cell_gen(rnn_inp, state_gen)
-
-        if FLAGS.attention_option is not None:
-          rnn_out = attention_construct_fn(rnn_out, attention_keys,
-                                           attention_values)
-        if is_training:
-          rnn_out *= output_mask
-
-        rnn_outs.append(rnn_out)
-        if FLAGS.gen_training_strategy != 'cross_entropy':
-          logit = tf.nn.bias_add(tf.matmul(rnn_out, softmax_w), softmax_b)
-
-          # Output for Decoder.
-          # If input is present:   Return real at t+1.
-          # If input is not present:  Return fake for t+1.
-          real = targets[:, t]
-
-          categorical = tf.contrib.distributions.Categorical(logits=logit)
-          if FLAGS.use_gen_mode:
-            fake = categorical.mode()
-          else:
-            fake = categorical.sample()
-          log_prob = categorical.log_prob(fake)
-          output = tf.where(targets_present[:, t], real, fake)
-
-        else:
-          real = targets[:, t]
-          logit = tf.zeros(tf.stack([FLAGS.batch_size, FLAGS.vocab_size]))
-          log_prob = tf.zeros(tf.stack([FLAGS.batch_size]))
-          output = real
-
-        # Add to lists.
-        sequence.append(output)
-        log_probs.append(log_prob)
-        logits.append(logit)
-
-      if FLAGS.gen_training_strategy == 'cross_entropy':
-        logits = tf.nn.bias_add(
-            tf.matmul(
-                tf.reshape(tf.stack(rnn_outs, 1), [-1, gen_decoder_rnn_size]),
-                softmax_w), softmax_b)
-        logits = tf.reshape(logits,
-                            [-1, FLAGS.sequence_length, FLAGS.vocab_size])
-      else:
-        logits = tf.stack(logits, axis=1)
-
-  return (tf.stack(sequence, axis=1), logits, tf.stack(log_probs, axis=1))
-
-
-def dis_encoder(hparams, masked_inputs, is_training, reuse=None,
-                embedding=None):
-  """Define the Discriminator encoder.  Reads in the masked inputs for context
-  and produces the hidden states of the encoder."""
-  with tf.variable_scope('encoder', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(
-          hparams.dis_rnn_size,
-          forget_bias=0.0,
-          state_is_tuple=True,
-          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and hparams.dis_vd_keep_prob < 1:
-
-      def attn_cell():
-        return variational_dropout.VariationalDropoutWrapper(
-            lstm_cell(), FLAGS.batch_size, hparams.dis_rnn_size,
-            hparams.dis_vd_keep_prob, hparams.dis_vd_keep_prob)
-
-    cell_dis = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-
-    state_dis = cell_dis.zero_state(FLAGS.batch_size, tf.float32)
-
-    with tf.variable_scope('rnn'):
-      hidden_states = []
-
-      missing_embedding = tf.get_variable('missing_embedding',
-                                          [1, hparams.dis_rnn_size])
-      embedding = tf.concat([embedding, missing_embedding], axis=0)
-      masked_rnn_inputs = tf.nn.embedding_lookup(embedding, masked_inputs)
-
-      def make_mask(keep_prob, units):
-        random_tensor = keep_prob
-        # 0. if [keep_prob, 1.0) and 1. if [1.0, 1.0 + keep_prob)
-        random_tensor += tf.random_uniform(tf.stack([FLAGS.batch_size, units]))
-        return tf.floor(random_tensor) / keep_prob
-
-      if is_training:
-        output_mask = make_mask(hparams.dis_vd_keep_prob, hparams.dis_rnn_size)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        rnn_in = masked_rnn_inputs[:, t]
-        rnn_out, state_dis = cell_dis(rnn_in, state_dis)
-        if is_training:
-          rnn_out *= output_mask
-        hidden_states.append(rnn_out)
-      final_state = state_dis
-
-  return (tf.stack(hidden_states, axis=1), final_state)
-
-
-def dis_decoder(hparams,
-                sequence,
-                encoding_state,
-                is_training,
-                reuse=None,
-                embedding=None):
-  """Define the Discriminator decoder.  Read in the sequence and predict
-    at each time point."""
-  sequence = tf.cast(sequence, tf.int32)
-
-  with tf.variable_scope('decoder', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(
-          hparams.dis_rnn_size,
-          forget_bias=0.0,
-          state_is_tuple=True,
-          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and hparams.dis_vd_keep_prob < 1:
-
-      def attn_cell():
-        return variational_dropout.VariationalDropoutWrapper(
-            lstm_cell(), FLAGS.batch_size, hparams.dis_rnn_size,
-            hparams.dis_vd_keep_prob, hparams.dis_vd_keep_prob)
-
-    cell_dis = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.dis_num_layers)],
-        state_is_tuple=True)
-
-    # Hidden encoder states.
-    hidden_vector_encodings = encoding_state[0]
-
-    # Carry forward the final state tuple from the encoder.
-    # State tuples.
-    state = encoding_state[1]
-
-    if FLAGS.attention_option is not None:
-      (attention_keys, attention_values, _,
-       attention_construct_fn) = attention_utils.prepare_attention(
-           hidden_vector_encodings,
-           FLAGS.attention_option,
-           num_units=hparams.dis_rnn_size,
-           reuse=reuse)
-
-    def make_mask(keep_prob, units):
-      random_tensor = keep_prob
-      # 0. if [keep_prob, 1.0) and 1. if [1.0, 1.0 + keep_prob)
-      random_tensor += tf.random_uniform(tf.stack([FLAGS.batch_size, units]))
-      return tf.floor(random_tensor) / keep_prob
-
-    if is_training:
-      output_mask = make_mask(hparams.dis_vd_keep_prob, hparams.dis_rnn_size)
-
-    with tf.variable_scope('rnn') as vs:
-      predictions = []
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, sequence)
-
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        rnn_in = rnn_inputs[:, t]
-        rnn_out, state = cell_dis(rnn_in, state)
-
-        if FLAGS.attention_option is not None:
-          rnn_out = attention_construct_fn(rnn_out, attention_keys,
-                                           attention_values)
-        if is_training:
-          rnn_out *= output_mask
-
-        # Prediction is linear output for Discriminator.
-        pred = tf.contrib.layers.linear(rnn_out, 1, scope=vs)
-        predictions.append(pred)
-
-  predictions = tf.stack(predictions, axis=1)
-  return tf.squeeze(predictions, axis=2)
-
-
-def discriminator(hparams,
-                  inputs,
-                  targets_present,
-                  sequence,
-                  is_training,
-                  reuse=None):
-  """Define the Discriminator graph."""
-  if FLAGS.dis_share_embedding:
-    assert hparams.dis_rnn_size == hparams.gen_rnn_size, (
-        'If you wish to share Discriminator/Generator embeddings, they must be'
-        ' same dimension.')
-    with tf.variable_scope('gen/decoder/rnn', reuse=True):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-  else:
-    # Explicitly share the embedding.
-    with tf.variable_scope('dis/decoder/rnn', reuse=reuse):
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.dis_rnn_size])
-
-  # Mask the input sequence.
-  masked_inputs = transform_input_with_is_missing_token(inputs, targets_present)
-
-  # Confirm masking.
-  masked_inputs = tf.Print(
-      masked_inputs, [inputs, targets_present, masked_inputs, sequence],
-      message='inputs, targets_present, masked_inputs, sequence',
-      summarize=10)
-
-  with tf.variable_scope('dis', reuse=reuse):
-    encoder_states = dis_encoder(
-        hparams,
-        masked_inputs,
-        is_training=is_training,
-        reuse=reuse,
-        embedding=embedding)
-    predictions = dis_decoder(
-        hparams,
-        sequence,
-        encoder_states,
-        is_training=is_training,
-        reuse=reuse,
-        embedding=embedding)
-
-  # if FLAGS.baseline_method == 'critic':
-  #   with tf.variable_scope('critic', reuse=reuse) as critic_scope:
-  #     values = tf.contrib.layers.linear(rnn_outs, 1, scope=critic_scope)
-  #     values = tf.squeeze(values, axis=2)
-  # else:
-  #   values = None
-
-  return predictions
-
-
-# TODO(adai): IMDB labels placeholder to encoder/decoder.
-def generator(hparams,
-              inputs,
-              targets,
-              targets_present,
-              is_training,
-              is_validating,
-              reuse=None):
-  """Define the Generator graph."""
-  with tf.variable_scope('gen', reuse=reuse):
-    encoder_states, initial_state, final_state = gen_encoder(
-        hparams, inputs, targets_present, is_training=is_training, reuse=reuse)
-    stacked_sequence, stacked_logits, stacked_log_probs = gen_decoder(
-        hparams,
-        inputs,
-        targets,
-        targets_present,
-        encoder_states,
-        is_training=is_training,
-        is_validating=is_validating,
-        reuse=reuse)
-    return (stacked_sequence, stacked_logits, stacked_log_probs, initial_state,
-            final_state, encoder_states)
diff --git a/research/maskgan/models/seq2seq_zaremba.py b/research/maskgan/models/seq2seq_zaremba.py
deleted file mode 100644
index 25f6ce44f0cb2fe650e23b332ace014ab7cdf469..0000000000000000000000000000000000000000
--- a/research/maskgan/models/seq2seq_zaremba.py
+++ /dev/null
@@ -1,305 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Simple seq2seq model definitions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-from six.moves import xrange
-from models import attention_utils
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def transform_input_with_is_missing_token(inputs, targets_present):
-  """Transforms the inputs to have missing tokens when it's masked out.  The
-  mask is for the targets, so therefore, to determine if an input at time t is
-  masked, we have to check if the target at time t - 1 is masked out.
-
-  e.g.
-    inputs = [a, b, c, d]
-    targets = [b, c, d, e]
-    targets_present = [1, 0, 1, 0]
-
-  then,
-    transformed_input = [a, b, <missing>, d]
-
-  Args:
-    inputs:  tf.int32 Tensor of shape [batch_size, sequence_length] with tokens
-      up to, but not including, vocab_size.
-    targets_present:  tf.bool Tensor of shape [batch_size, sequence_length] with
-      True representing the presence of the word.
-
-  Returns:
-    transformed_input:  tf.int32 Tensor of shape [batch_size, sequence_length]
-      which takes on value of inputs when the input is present and takes on
-      value=vocab_size to indicate a missing token.
-  """
-  # To fill in if the input is missing.
-  input_missing = tf.constant(FLAGS.vocab_size,
-                              dtype=tf.int32,
-                              shape=[FLAGS.batch_size, FLAGS.sequence_length])
-
-  # The 0th input will always be present to MaskGAN.
-  zeroth_input_present = tf.constant(True, tf.bool, shape=[FLAGS.batch_size, 1])
-
-  # Input present mask.
-  inputs_present = tf.concat(
-      [zeroth_input_present, targets_present[:, :-1]], axis=1)
-
-  transformed_input = tf.where(inputs_present, inputs, input_missing)
-  return transformed_input
-
-
-def gen_encoder(hparams, inputs, targets_present, is_training, reuse=None):
-  """Define the Encoder graph.
-
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    inputs:  tf.int32 Tensor of shape [batch_size, sequence_length] with tokens
-      up to, but not including, vocab_size.
-    targets_present:  tf.bool Tensor of shape [batch_size, sequence_length] with
-      True representing the presence of the target.
-    is_training:  Boolean indicating operational mode (train/inference).
-    reuse (Optional):   Whether to reuse the variables.
-
-  Returns:
-    Tuple of (hidden_states, final_state).
-  """
-  with tf.variable_scope('encoder', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(hparams.gen_rnn_size,
-                                          forget_bias=0.0,
-                                          state_is_tuple=True,
-                                          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and FLAGS.keep_prob < 1:
-
-      def attn_cell():
-        return tf.contrib.rnn.DropoutWrapper(
-            lstm_cell(), output_keep_prob=FLAGS.keep_prob)
-
-    cell = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.gen_num_layers)],
-        state_is_tuple=True)
-
-    initial_state = cell.zero_state(FLAGS.batch_size, tf.float32)
-
-    # Add a missing token for inputs not present.
-    real_inputs = inputs
-    masked_inputs = transform_input_with_is_missing_token(inputs,
-                                                          targets_present)
-
-    with tf.variable_scope('rnn'):
-      hidden_states = []
-
-      # Split the embedding into two parts so that we can load the PTB
-      # weights into one part of the Variable.
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-      missing_embedding = tf.get_variable('missing_embedding',
-                                          [1, hparams.gen_rnn_size])
-      embedding = tf.concat([embedding, missing_embedding], axis=0)
-
-      real_rnn_inputs = tf.nn.embedding_lookup(embedding, real_inputs)
-      masked_rnn_inputs = tf.nn.embedding_lookup(embedding, masked_inputs)
-
-      if is_training and FLAGS.keep_prob < 1:
-        masked_rnn_inputs = tf.nn.dropout(masked_rnn_inputs, FLAGS.keep_prob)
-
-      state = initial_state
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        rnn_inp = masked_rnn_inputs[:, t]
-        rnn_out, state = cell(rnn_inp, state)
-        hidden_states.append(rnn_out)
-      final_masked_state = state
-      hidden_states = tf.stack(hidden_states, axis=1)
-
-      # Produce the RNN state had the model operated only
-      # over real data.
-      real_state = initial_state
-      for t in xrange(FLAGS.sequence_length):
-        tf.get_variable_scope().reuse_variables()
-
-        # RNN.
-        rnn_inp = real_rnn_inputs[:, t]
-        rnn_out, real_state = cell(rnn_inp, real_state)
-      final_state = real_state
-
-  return (hidden_states, final_masked_state), initial_state, final_state
-
-
-def gen_decoder(hparams,
-                inputs,
-                targets,
-                targets_present,
-                encoding_state,
-                is_training,
-                is_validating,
-                reuse=None):
-  """Define the Decoder graph. The Decoder will now impute tokens that
-      have been masked from the input seqeunce.
-  """
-  gen_decoder_rnn_size = hparams.gen_rnn_size
-
-  with tf.variable_scope('decoder', reuse=reuse):
-
-    def lstm_cell():
-      return tf.contrib.rnn.BasicLSTMCell(gen_decoder_rnn_size,
-                                          forget_bias=0.0,
-                                          state_is_tuple=True,
-                                          reuse=reuse)
-
-    attn_cell = lstm_cell
-    if is_training and FLAGS.keep_prob < 1:
-
-      def attn_cell():
-        return tf.contrib.rnn.DropoutWrapper(
-            lstm_cell(), output_keep_prob=FLAGS.keep_prob)
-
-    cell_gen = tf.contrib.rnn.MultiRNNCell(
-        [attn_cell() for _ in range(hparams.gen_num_layers)],
-        state_is_tuple=True)
-
-    # Hidden encoder states.
-    hidden_vector_encodings = encoding_state[0]
-
-    # Carry forward the final state tuple from the encoder.
-    # State tuples.
-    state_gen = encoding_state[1]
-
-    if FLAGS.attention_option is not None:
-      (attention_keys, attention_values, _,
-       attention_construct_fn) = attention_utils.prepare_attention(
-           hidden_vector_encodings,
-           FLAGS.attention_option,
-           num_units=gen_decoder_rnn_size,
-           reuse=reuse)
-
-    with tf.variable_scope('rnn'):
-      sequence, logits, log_probs = [], [], []
-
-      embedding = tf.get_variable('embedding',
-                                  [FLAGS.vocab_size, hparams.gen_rnn_size])
-      softmax_w = tf.matrix_transpose(embedding)
-      softmax_b = tf.get_variable('softmax_b', [FLAGS.vocab_size])
-
-      rnn_inputs = tf.nn.embedding_lookup(embedding, inputs)
-
-      if is_training and FLAGS.keep_prob < 1:
-        rnn_inputs = tf.nn.dropout(rnn_inputs, FLAGS.keep_prob)
-
-      rnn_outs = []
-
-      fake = None
-      for t in xrange(FLAGS.sequence_length):
-        if t > 0:
-          tf.get_variable_scope().reuse_variables()
-
-        # Input to the Decoder.
-        if t == 0:
-          # Always provide the real input at t = 0.
-          rnn_inp = rnn_inputs[:, t]
-
-        # If the input is present, read in the input at t.
-        # If the input is not present, read in the previously generated.
-        else:
-          real_rnn_inp = rnn_inputs[:, t]
-
-          # While validating, the decoder should be operating in teacher
-          # forcing regime.  Also, if we're just training with cross_entropy
-          # use teacher forcing.
-          if is_validating or FLAGS.gen_training_strategy == 'cross_entropy':
-            rnn_inp = real_rnn_inp
-          else:
-            fake_rnn_inp = tf.nn.embedding_lookup(embedding, fake)
-            rnn_inp = tf.where(targets_present[:, t - 1], real_rnn_inp,
-                               fake_rnn_inp)
-
-        # RNN.
-        rnn_out, state_gen = cell_gen(rnn_inp, state_gen)
-
-        if FLAGS.attention_option is not None:
-          rnn_out = attention_construct_fn(rnn_out, attention_keys,
-                                           attention_values)
-        rnn_outs.append(rnn_out)
-        if FLAGS.gen_training_strategy != 'cross_entropy':
-          logit = tf.nn.bias_add(tf.matmul(rnn_out, softmax_w), softmax_b)
-
-          # Output for Decoder.
-          # If input is present:   Return real at t+1.
-          # If input is not present:  Return fake for t+1.
-          real = targets[:, t]
-
-          categorical = tf.contrib.distributions.Categorical(logits=logit)
-          fake = categorical.sample()
-          log_prob = categorical.log_prob(fake)
-
-          output = tf.where(targets_present[:, t], real, fake)
-
-        else:
-          batch_size = tf.shape(rnn_out)[0]
-          logit = tf.zeros(tf.stack([batch_size, FLAGS.vocab_size]))
-          log_prob = tf.zeros(tf.stack([batch_size]))
-          output = targets[:, t]
-
-        # Add to lists.
-        sequence.append(output)
-        log_probs.append(log_prob)
-        logits.append(logit)
-      if FLAGS.gen_training_strategy == 'cross_entropy':
-        logits = tf.nn.bias_add(
-            tf.matmul(
-                tf.reshape(tf.stack(rnn_outs, 1), [-1, gen_decoder_rnn_size]),
-                softmax_w), softmax_b)
-        logits = tf.reshape(logits,
-                            [-1, FLAGS.sequence_length, FLAGS.vocab_size])
-      else:
-        logits = tf.stack(logits, axis=1)
-
-  return (tf.stack(sequence, axis=1), logits, tf.stack(log_probs, axis=1))
-
-
-def generator(hparams,
-              inputs,
-              targets,
-              targets_present,
-              is_training,
-              is_validating,
-              reuse=None):
-  """Define the Generator graph."""
-  with tf.variable_scope('gen', reuse=reuse):
-    encoder_states, initial_state, final_state = gen_encoder(
-        hparams, inputs, targets_present, is_training=is_training, reuse=reuse)
-    stacked_sequence, stacked_logits, stacked_log_probs = gen_decoder(
-        hparams,
-        inputs,
-        targets,
-        targets_present,
-        encoder_states,
-        is_training=is_training,
-        is_validating=is_validating,
-        reuse=reuse)
-    return (stacked_sequence, stacked_logits, stacked_log_probs, initial_state,
-            final_state)
diff --git a/research/maskgan/nas_utils/__init__.py b/research/maskgan/nas_utils/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/maskgan/nas_utils/configs.py b/research/maskgan/nas_utils/configs.py
deleted file mode 100644
index 80d867c36d1de07663d59d6c161aaf9cbe241d95..0000000000000000000000000000000000000000
--- a/research/maskgan/nas_utils/configs.py
+++ /dev/null
@@ -1,46 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-def print_config(config):
-  print("-" * 10, "Configuration Specs", "-" * 10)
-  for item in dir(config):
-    if list(item)[0] != "_":
-      print(item, getattr(config, item))
-  print("-" * 29)
-
-
-class AlienConfig2(object):
-  """Base 8 740 shared embeddings, gets 64.0 (mean: std: min: max: )."""
-  init_scale = 0.05
-  learning_rate = 1.0
-  max_grad_norm = 10
-  num_layers = 2
-  num_steps = 25
-  hidden_size = 740
-  max_epoch = 70
-  max_max_epoch = 250
-  keep_prob = [1 - 0.15, 1 - 0.45]
-  lr_decay = 0.95
-  batch_size = 20
-  vocab_size = 10000
-  weight_decay = 1e-4
-  share_embeddings = True
-  cell = "alien"
-  dropout_type = "variational"
diff --git a/research/maskgan/nas_utils/custom_cell.py b/research/maskgan/nas_utils/custom_cell.py
deleted file mode 100644
index 6add7ffa4e0d69da56d2bba7d9da3875b5c4dd3b..0000000000000000000000000000000000000000
--- a/research/maskgan/nas_utils/custom_cell.py
+++ /dev/null
@@ -1,166 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import numpy as np
-import tensorflow as tf
-
-flags = tf.flags
-FLAGS = tf.app.flags.FLAGS
-LSTMTuple = collections.namedtuple('LSTMTuple', ['c', 'h'])
-
-
-def cell_depth(num):
-  num /= 2
-  val = np.log2(1 + num)
-  assert abs(val - int(val)) == 0
-  return int(val)
-
-
-class GenericMultiRNNCell(tf.contrib.rnn.RNNCell):
-  """More generic version of MultiRNNCell that allows you to pass in a dropout mask"""
-
-  def __init__(self, cells):
-    """Create a RNN cell composed sequentially of a number of RNNCells.
-
-    Args:
-      cells: list of RNNCells that will be composed in this order.
-      state_is_tuple: If True, accepted and returned states are n-tuples, where
-        `n = len(cells)`.  If False, the states are all
-        concatenated along the column axis.  This latter behavior will soon be
-        deprecated.
-
-    Raises:
-      ValueError: if cells is empty (not allowed), or at least one of the cells
-        returns a state tuple but the flag `state_is_tuple` is `False`.
-    """
-    self._cells = cells
-
-  @property
-  def state_size(self):
-    return tuple(cell.state_size for cell in self._cells)
-
-  @property
-  def output_size(self):
-    return self._cells[-1].output_size
-
-  def __call__(self, inputs, state, input_masks=None, scope=None):
-    """Run this multi-layer cell on inputs, starting from state."""
-    with tf.variable_scope(scope or type(self).__name__):
-      cur_inp = inputs
-      new_states = []
-      for i, cell in enumerate(self._cells):
-        with tf.variable_scope('Cell%d' % i):
-          cur_state = state[i]
-          if input_masks is not None:
-            cur_inp *= input_masks[i]
-          cur_inp, new_state = cell(cur_inp, cur_state)
-          new_states.append(new_state)
-    new_states = tuple(new_states)
-    return cur_inp, new_states
-
-
-class AlienRNNBuilder(tf.contrib.rnn.RNNCell):
-
-  def __init__(self, num_units, params, additional_params, base_size):
-    self.num_units = num_units
-    self.cell_create_index = additional_params[0]
-    self.cell_inject_index = additional_params[1]
-    self.base_size = base_size
-    self.cell_params = params[
-        -2:]  # Cell injection parameters are always the last two
-    params = params[:-2]
-    self.depth = cell_depth(len(params))
-    self.params = params
-    self.units_per_layer = [2**i for i in range(self.depth)
-                           ][::-1]  # start with the biggest layer
-
-  def __call__(self, inputs, state, scope=None):
-    with tf.variable_scope(scope or type(self).__name__):
-      definition1 = ['add', 'elem_mult', 'max']
-      definition2 = [tf.identity, tf.tanh, tf.sigmoid, tf.nn.relu, tf.sin]
-      layer_outputs = [[] for _ in range(self.depth)]
-      with tf.variable_scope('rnn_builder'):
-        curr_index = 0
-        c, h = state
-
-        # Run all dense matrix multiplications at once
-        big_h_mat = tf.get_variable(
-            'big_h_mat', [self.num_units,
-                          self.base_size * self.num_units], tf.float32)
-        big_inputs_mat = tf.get_variable(
-            'big_inputs_mat', [self.num_units,
-                               self.base_size * self.num_units], tf.float32)
-        big_h_output = tf.matmul(h, big_h_mat)
-        big_inputs_output = tf.matmul(inputs, big_inputs_mat)
-        h_splits = tf.split(big_h_output, self.base_size, axis=1)
-        inputs_splits = tf.split(big_inputs_output, self.base_size, axis=1)
-
-        for layer_num, units in enumerate(self.units_per_layer):
-          for unit_num in range(units):
-            with tf.variable_scope(
-                'layer_{}_unit_{}'.format(layer_num, unit_num)):
-              if layer_num == 0:
-                prev1_mat = h_splits[unit_num]
-                prev2_mat = inputs_splits[unit_num]
-              else:
-                prev1_mat = layer_outputs[layer_num - 1][2 * unit_num]
-                prev2_mat = layer_outputs[layer_num - 1][2 * unit_num + 1]
-              if definition1[self.params[curr_index]] == 'add':
-                output = prev1_mat + prev2_mat
-              elif definition1[self.params[curr_index]] == 'elem_mult':
-                output = prev1_mat * prev2_mat
-              elif definition1[self.params[curr_index]] == 'max':
-                output = tf.maximum(prev1_mat, prev2_mat)
-              if curr_index / 2 == self.cell_create_index:  # Take the new cell before the activation
-                new_c = tf.identity(output)
-              output = definition2[self.params[curr_index + 1]](output)
-              if curr_index / 2 == self.cell_inject_index:
-                if definition1[self.cell_params[0]] == 'add':
-                  output += c
-                elif definition1[self.cell_params[0]] == 'elem_mult':
-                  output *= c
-                elif definition1[self.cell_params[0]] == 'max':
-                  output = tf.maximum(output, c)
-                output = definition2[self.cell_params[1]](output)
-              layer_outputs[layer_num].append(output)
-              curr_index += 2
-        new_h = layer_outputs[-1][-1]
-        return new_h, LSTMTuple(new_c, new_h)
-
-  @property
-  def state_size(self):
-    return LSTMTuple(self.num_units, self.num_units)
-
-  @property
-  def output_size(self):
-    return self.num_units
-
-
-class Alien(AlienRNNBuilder):
-  """Base 8 Cell."""
-
-  def __init__(self, num_units):
-    params = [
-        0, 2, 0, 3, 0, 2, 1, 3, 0, 1, 0, 2, 0, 1, 0, 2, 1, 1, 0, 1, 1, 1, 0, 2,
-        1, 0, 0, 1, 1, 1, 0, 1
-    ]
-    additional_params = [12, 8]
-    base_size = 8
-    super(Alien, self).__init__(num_units, params, additional_params, base_size)
diff --git a/research/maskgan/nas_utils/variational_dropout.py b/research/maskgan/nas_utils/variational_dropout.py
deleted file mode 100644
index 49cc29f0cd77f7bef9e3c47e7d7dae73fa877ecd..0000000000000000000000000000000000000000
--- a/research/maskgan/nas_utils/variational_dropout.py
+++ /dev/null
@@ -1,61 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Variational Dropout."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def generate_dropout_masks(keep_prob, shape, amount):
-  masks = []
-  for _ in range(amount):
-    dropout_mask = tf.random_uniform(shape) + (keep_prob)
-    dropout_mask = tf.floor(dropout_mask) / (keep_prob)
-    masks.append(dropout_mask)
-  return masks
-
-
-def generate_variational_dropout_masks(hparams, keep_prob):
-  [batch_size, num_steps, size, num_layers] = [
-      FLAGS.batch_size, FLAGS.sequence_length, hparams.gen_rnn_size,
-      hparams.gen_num_layers
-  ]
-  if len(keep_prob) == 2:
-    emb_keep_prob = keep_prob[0]  # keep prob for embedding matrix
-    h2h_keep_prob = emb_keep_prob  # keep prob for hidden to hidden connections
-    h2i_keep_prob = keep_prob[1]  # keep prob for hidden to input connections
-    out_keep_prob = h2i_keep_prob  # keep probability for output state
-  else:
-    emb_keep_prob = keep_prob[0]  # keep prob for embedding matrix
-    h2h_keep_prob = keep_prob[1]  # keep prob for hidden to hidden connections
-    h2i_keep_prob = keep_prob[2]  # keep prob for hidden to input connections
-    out_keep_prob = keep_prob[3]  # keep probability for output state
-  h2i_masks = []  # Masks for input to recurrent connections
-  h2h_masks = []  # Masks for recurrent to recurrent connections
-
-  # Input word dropout mask
-  emb_masks = generate_dropout_masks(emb_keep_prob, [num_steps, 1], batch_size)
-  output_mask = generate_dropout_masks(out_keep_prob, [batch_size, size], 1)[0]
-  h2i_masks = generate_dropout_masks(h2i_keep_prob, [batch_size, size],
-                                     num_layers)
-  h2h_masks = generate_dropout_masks(h2h_keep_prob, [batch_size, size],
-                                     num_layers)
-  return h2h_masks, h2i_masks, emb_masks, output_mask
diff --git a/research/maskgan/pretrain_mask_gan.py b/research/maskgan/pretrain_mask_gan.py
deleted file mode 100644
index 1a9d8ee947deaa3e31cc4c332969ed529e60305e..0000000000000000000000000000000000000000
--- a/research/maskgan/pretrain_mask_gan.py
+++ /dev/null
@@ -1,231 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Pretraining functions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-# Dependency imports
-
-import numpy as np
-
-import tensorflow as tf
-
-from data import imdb_loader
-from data import ptb_loader
-
-# Data.
-from model_utils import model_utils
-from models import evaluation_utils
-
-tf.app.flags.DEFINE_integer(
-    'gen_pretrain_steps', None,
-    'The number of steps to pretrain the generator with cross entropy loss.')
-tf.app.flags.DEFINE_integer(
-    'dis_pretrain_steps', None,
-    'The number of steps to pretrain the discriminator.')
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def pretrain_generator(sv, sess, model, data, log, id_to_word,
-                       data_ngram_counts, is_chief):
-  """Pretrain the generator with classic language modeling training."""
-  print('\nPretraining generator for %d steps.' % FLAGS.gen_pretrain_steps)
-  log.write(
-      '\nPretraining generator for %d steps.\n' % FLAGS.gen_pretrain_steps)
-
-  is_pretraining = True
-
-  while is_pretraining:
-
-    costs = 0.
-    iters = 0
-    if FLAGS.data_set == 'ptb':
-      iterator = ptb_loader.ptb_iterator(data, FLAGS.batch_size,
-                                         FLAGS.sequence_length,
-                                         FLAGS.epoch_size_override)
-    elif FLAGS.data_set == 'imdb':
-      iterator = imdb_loader.imdb_iterator(data, FLAGS.batch_size,
-                                           FLAGS.sequence_length)
-
-    for x, y, _ in iterator:
-
-      # For pretraining with cross entropy loss, we have all tokens in the
-      # forward sequence present (all True).
-      model_utils.assign_percent_real(sess, model.percent_real_update,
-                                      model.new_rate, 1.0)
-      p = np.ones(shape=[FLAGS.batch_size, FLAGS.sequence_length], dtype=bool)
-
-      pretrain_feed = {model.inputs: x, model.targets: y, model.present: p}
-
-      [losses, cost_eval, _, step] = sess.run(
-          [
-              model.fake_cross_entropy_losses, model.avg_log_perplexity,
-              model.gen_pretrain_op, model.global_step
-          ],
-          feed_dict=pretrain_feed)
-
-      costs += cost_eval
-      iters += FLAGS.sequence_length
-
-      # Calulate rolling perplexity.
-      perplexity = np.exp(costs / iters)
-
-      # Summaries.
-      if is_chief and step % FLAGS.summaries_every == 0:
-        # Graph summaries.
-        summary_str = sess.run(
-            model.merge_summaries_op, feed_dict=pretrain_feed)
-        sv.SummaryComputed(sess, summary_str)
-
-        # Additional summary.
-        for n, data_ngram_count in data_ngram_counts.iteritems():
-          avg_percent_captured = evaluation_utils.sequence_ngram_evaluation(
-              sess, model.fake_sequence, log, pretrain_feed, data_ngram_count,
-              int(n))
-          summary_percent_str = tf.Summary(value=[
-              tf.Summary.Value(
-                  tag='general/%s-grams_percent_correct' % n,
-                  simple_value=avg_percent_captured)
-          ])
-          sv.SummaryComputed(sess, summary_percent_str, global_step=step)
-
-        summary_perplexity_str = tf.Summary(value=[
-            tf.Summary.Value(tag='general/perplexity', simple_value=perplexity)
-        ])
-        sv.SummaryComputed(sess, summary_perplexity_str, global_step=step)
-
-      # Printing and logging
-      if is_chief and step % FLAGS.print_every == 0:
-        print('global_step: %d' % step)
-        print(' generator loss: %.3f' % np.mean(losses))
-        print(' perplexity: %.3f' % perplexity)
-        log.write('global_step: %d\n' % step)
-        log.write(' generator loss: %.3f\n' % np.mean(losses))
-        log.write(' perplexity: %.3f\n' % perplexity)
-
-        for n, data_ngram_count in data_ngram_counts.iteritems():
-          avg_percent_captured = evaluation_utils.sequence_ngram_evaluation(
-              sess, model.fake_sequence, log, pretrain_feed, data_ngram_count,
-              int(n))
-          print(' percent of %s-grams captured: %.3f.\n' %
-                (n, avg_percent_captured))
-          log.write(' percent of %s-grams captured: %.3f.\n\n' %
-                    (n, avg_percent_captured))
-
-        evaluation_utils.generate_logs(sess, model, log, id_to_word,
-                                       pretrain_feed)
-
-      if step >= FLAGS.gen_pretrain_steps:
-        is_pretraining = False
-        break
-  return
-
-
-def pretrain_discriminator(sv, sess, model, data, log, id_to_word,
-                           data_ngram_counts, is_chief):
-  print('\nPretraining discriminator for %d steps.' % FLAGS.dis_pretrain_steps)
-  log.write(
-      '\nPretraining discriminator for %d steps.\n' % FLAGS.dis_pretrain_steps)
-
-  is_pretraining = True
-
-  while is_pretraining:
-
-    cumulative_costs = 0.
-    iters = 0
-    if FLAGS.data_set == 'ptb':
-      iterator = ptb_loader.ptb_iterator(data, FLAGS.batch_size,
-                                         FLAGS.sequence_length,
-                                         FLAGS.epoch_size_override)
-    elif FLAGS.data_set == 'imdb':
-      iterator = imdb_loader.imdb_iterator(data, FLAGS.batch_size,
-                                           FLAGS.sequence_length)
-
-    for x, y, _ in iterator:
-      is_present_rate = FLAGS.is_present_rate
-      # is_present_rate = np.random.uniform(low=0.0, high=1.0)
-      model_utils.assign_percent_real(sess, model.percent_real_update,
-                                      model.new_rate, is_present_rate)
-      # Randomly mask out tokens.
-      p = model_utils.generate_mask()
-
-      pretrain_feed = {model.inputs: x, model.targets: y, model.present: p}
-
-      [_, dis_loss_eval, gen_log_perplexity_eval, step] = sess.run(
-          [
-              model.dis_pretrain_op, model.dis_loss, model.avg_log_perplexity,
-              model.global_step
-          ],
-          feed_dict=pretrain_feed)
-
-      cumulative_costs += gen_log_perplexity_eval
-      iters += 1
-
-      # Calulate rolling perplexity.
-      perplexity = np.exp(cumulative_costs / iters)
-
-      # Summaries.
-      if is_chief and step % FLAGS.summaries_every == 0:
-        # Graph summaries.
-        summary_str = sess.run(
-            model.merge_summaries_op, feed_dict=pretrain_feed)
-        sv.SummaryComputed(sess, summary_str)
-
-        # Additional summary.
-        for n, data_ngram_count in data_ngram_counts.iteritems():
-          avg_percent_captured = evaluation_utils.sequence_ngram_evaluation(
-              sess, model.fake_sequence, log, pretrain_feed, data_ngram_count,
-              int(n))
-          summary_percent_str = tf.Summary(value=[
-              tf.Summary.Value(
-                  tag='general/%s-grams_percent_correct' % n,
-                  simple_value=avg_percent_captured)
-          ])
-          sv.SummaryComputed(sess, summary_percent_str, global_step=step)
-
-        summary_perplexity_str = tf.Summary(value=[
-            tf.Summary.Value(tag='general/perplexity', simple_value=perplexity)
-        ])
-        sv.SummaryComputed(sess, summary_perplexity_str, global_step=step)
-
-      # Printing and logging
-      if is_chief and step % FLAGS.print_every == 0:
-        print('global_step: %d' % step)
-        print(' discriminator loss: %.3f' % dis_loss_eval)
-        print(' perplexity: %.3f' % perplexity)
-        log.write('global_step: %d\n' % step)
-        log.write(' discriminator loss: %.3f\n' % dis_loss_eval)
-        log.write(' perplexity: %.3f\n' % perplexity)
-
-        for n, data_ngram_count in data_ngram_counts.iteritems():
-          avg_percent_captured = evaluation_utils.sequence_ngram_evaluation(
-              sess, model.fake_sequence, log, pretrain_feed, data_ngram_count,
-              int(n))
-          print(' percent of %s-grams captured: %.3f.\n' %
-                (n, avg_percent_captured))
-          log.write(' percent of %s-grams captured: %.3f.\n\n' %
-                    (n, avg_percent_captured))
-
-        evaluation_utils.generate_logs(sess, model, log, id_to_word,
-                                       pretrain_feed)
-
-      if step >= FLAGS.dis_pretrain_steps + int(FLAGS.gen_pretrain_steps or 0):
-        is_pretraining = False
-        break
-  return
diff --git a/research/maskgan/regularization/__init__.py b/research/maskgan/regularization/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/maskgan/regularization/variational_dropout.py b/research/maskgan/regularization/variational_dropout.py
deleted file mode 100644
index d67fe52eee45c31012fe50e5de662d27565befae..0000000000000000000000000000000000000000
--- a/research/maskgan/regularization/variational_dropout.py
+++ /dev/null
@@ -1,56 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Variational Dropout Wrapper."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-class VariationalDropoutWrapper(tf.contrib.rnn.RNNCell):
-  """Add variational dropout to a RNN cell."""
-
-  def __init__(self, cell, batch_size, input_size, recurrent_keep_prob,
-               input_keep_prob):
-    self._cell = cell
-    self._recurrent_keep_prob = recurrent_keep_prob
-    self._input_keep_prob = input_keep_prob
-
-    def make_mask(keep_prob, units):
-      random_tensor = keep_prob
-      # 0. if [keep_prob, 1.0) and 1. if [1.0, 1.0 + keep_prob)
-      random_tensor += tf.random_uniform(tf.stack([batch_size, units]))
-      return tf.floor(random_tensor) / keep_prob
-
-    self._recurrent_mask = make_mask(recurrent_keep_prob,
-                                     self._cell.state_size[0])
-    self._input_mask = self._recurrent_mask
-
-  @property
-  def state_size(self):
-    return self._cell.state_size
-
-  @property
-  def output_size(self):
-    return self._cell.output_size
-
-  def __call__(self, inputs, state, scope=None):
-    dropped_inputs = inputs * self._input_mask
-    dropped_state = (state[0], state[1] * self._recurrent_mask)
-    new_h, new_state = self._cell(dropped_inputs, dropped_state, scope)
-    return new_h, new_state
diff --git a/research/maskgan/regularization/zoneout.py b/research/maskgan/regularization/zoneout.py
deleted file mode 100644
index 5f9ef3e3014ae6f2e7eea1a2937c5f1e2c356411..0000000000000000000000000000000000000000
--- a/research/maskgan/regularization/zoneout.py
+++ /dev/null
@@ -1,64 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Zoneout Wrapper"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-class ZoneoutWrapper(tf.contrib.rnn.RNNCell):
-  """Add Zoneout to a RNN cell."""
-
-  def __init__(self, cell, zoneout_drop_prob, is_training=True):
-    self._cell = cell
-    self._zoneout_prob = zoneout_drop_prob
-    self._is_training = is_training
-
-  @property
-  def state_size(self):
-    return self._cell.state_size
-
-  @property
-  def output_size(self):
-    return self._cell.output_size
-
-  def __call__(self, inputs, state, scope=None):
-    output, new_state = self._cell(inputs, state, scope)
-    if not isinstance(self._cell.state_size, tuple):
-      new_state = tf.split(value=new_state, num_or_size_splits=2, axis=1)
-      state = tf.split(value=state, num_or_size_splits=2, axis=1)
-    final_new_state = [new_state[0], new_state[1]]
-    if self._is_training:
-      for i, state_element in enumerate(state):
-        random_tensor = 1 - self._zoneout_prob  # keep probability
-        random_tensor += tf.random_uniform(tf.shape(state_element))
-        # 0. if [zoneout_prob, 1.0) and 1. if [1.0, 1.0 + zoneout_prob)
-        binary_tensor = tf.floor(random_tensor)
-        final_new_state[
-            i] = (new_state[i] - state_element) * binary_tensor + state_element
-    else:
-      for i, state_element in enumerate(state):
-        final_new_state[
-            i] = state_element * self._zoneout_prob + new_state[i] * (
-                1 - self._zoneout_prob)
-    if isinstance(self._cell.state_size, tuple):
-      return output, tf.contrib.rnn.LSTMStateTuple(
-          final_new_state[0], final_new_state[1])
-
-    return output, tf.concat([final_new_state[0], final_new_state[1]], 1)
diff --git a/research/maskgan/sample_shuffler.py b/research/maskgan/sample_shuffler.py
deleted file mode 100644
index 58c31fb573a864b33f3d6e2f17b42e42f1d0ea4d..0000000000000000000000000000000000000000
--- a/research/maskgan/sample_shuffler.py
+++ /dev/null
@@ -1,95 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Shuffle samples for human evaluation.
-
-Local launch command:
-  python sample_shuffler.py
-  --input_ml_path=/tmp/ptb/seq2seq_vd_shareemb_forreal_55_3
-  --input_gan_path=/tmp/ptb/MaskGAN_PTB_ari_avg_56.29_v2.0.0
-  --output_file_name=/tmp/ptb/shuffled_output.txt
-
-  python sample_shuffler.py
-  --input_ml_path=/tmp/generate_samples/MaskGAN_IMDB_Benchmark_87.1_v0.3.0
-  --input_gan_path=/tmp/generate_samples/MaskGAN_IMDB_v1.0.1
-  --output_file_name=/tmp/imdb/shuffled_output.txt
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-# Dependency imports
-import numpy as np
-
-import tensorflow as tf
-
-tf.app.flags.DEFINE_string('input_ml_path', '/tmp', 'Model output directory.')
-tf.app.flags.DEFINE_string('input_gan_path', '/tmp', 'Model output directory.')
-tf.app.flags.DEFINE_string('output_file_name', '/tmp/ptb/shuffled_output.txt',
-                           'Model output file.')
-tf.app.flags.DEFINE_boolean(
-    'output_masked_logs', False,
-    'Whether to display for human evaluation (show masking).')
-tf.app.flags.DEFINE_integer('number_epochs', 1,
-                            'The number of epochs to produce.')
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def shuffle_samples(input_file_1, input_file_2):
-  """Shuffle the examples."""
-  shuffled = []
-
-  # Set a random seed to keep fixed mask.
-  np.random.seed(0)
-
-  for line_1, line_2 in zip(input_file_1, input_file_2):
-    rand = np.random.randint(1, 3)
-    if rand == 1:
-      shuffled.append((rand, line_1, line_2))
-    else:
-      shuffled.append((rand, line_2, line_1))
-  input_file_1.close()
-  input_file_2.close()
-  return shuffled
-
-
-def generate_output(shuffled_tuples, output_file_name):
-  output_file = tf.gfile.GFile(output_file_name, mode='w')
-
-  for tup in shuffled_tuples:
-    formatted_tuple = ('\n{:<1}, {:<1}, {:<1}').format(tup[0], tup[1].rstrip(),
-                                                       tup[2].rstrip())
-    output_file.write(formatted_tuple)
-  output_file.close()
-
-
-def main(_):
-  ml_samples_file = tf.gfile.GFile(
-      os.path.join(FLAGS.input_ml_path, 'reviews.txt'), mode='r')
-  gan_samples_file = tf.gfile.GFile(
-      os.path.join(FLAGS.input_gan_path, 'reviews.txt'), mode='r')
-
-  # Generate shuffled tuples.
-  shuffled_tuples = shuffle_samples(ml_samples_file, gan_samples_file)
-
-  # Output to file.
-  generate_output(shuffled_tuples, FLAGS.output_file_name)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/maskgan/train_mask_gan.py b/research/maskgan/train_mask_gan.py
deleted file mode 100644
index 1e70c2284a8704b1c92dcdec850ac29fc9625667..0000000000000000000000000000000000000000
--- a/research/maskgan/train_mask_gan.py
+++ /dev/null
@@ -1,1167 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Launch example:
-
-[IMDB]
-python train_mask_gan.py --data_dir
-/tmp/imdb  --data_set imdb  --batch_size 128
---sequence_length 20  --base_directory /tmp/maskGAN_v0.01
---hparams="gen_rnn_size=650,gen_num_layers=2,dis_rnn_size=650,dis_num_layers=2
-,critic_learning_rate=0.0009756,dis_learning_rate=0.0000585,
-dis_train_iterations=8,gen_learning_rate=0.0016624,
-gen_full_learning_rate_steps=1e9,gen_learning_rate_decay=0.999999,
-rl_discount_rate=0.8835659"  --mode TRAIN  --max_steps 1000000
---generator_model seq2seq_vd  --discriminator_model seq2seq_vd
---is_present_rate 0.5  --summaries_every 25  --print_every 25
- --max_num_to_print=3 --generator_optimizer=adam
- --seq2seq_share_embedding=True --baseline_method=critic
- --attention_option=luong --n_gram_eval=4 --mask_strategy=contiguous
- --gen_training_strategy=reinforce --dis_pretrain_steps=100
- --perplexity_threshold=1000000
- --dis_share_embedding=True  --maskgan_ckpt
- /tmp/model.ckpt-171091
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-
-from functools import partial
-import os
-import time
-# Dependency imports
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import pretrain_mask_gan
-from data import imdb_loader
-from data import ptb_loader
-from model_utils import helper
-from model_utils import model_construction
-from model_utils import model_losses
-from model_utils import model_optimization
-
-# Data.
-from model_utils import model_utils
-
-from model_utils import n_gram
-from models import evaluation_utils
-
-from models import rollout
-
-np.set_printoptions(precision=3)
-np.set_printoptions(suppress=True)
-
-MODE_TRAIN = 'TRAIN'
-MODE_TRAIN_EVAL = 'TRAIN_EVAL'
-MODE_VALIDATION = 'VALIDATION'
-MODE_TEST = 'TEST'
-
-## Binary and setup FLAGS.
-tf.app.flags.DEFINE_enum(
-    'mode', 'TRAIN', [MODE_TRAIN, MODE_VALIDATION, MODE_TEST, MODE_TRAIN_EVAL],
-    'What this binary will do.')
-tf.app.flags.DEFINE_string('master', '',
-                           """Name of the TensorFlow master to use.""")
-tf.app.flags.DEFINE_string('eval_master', '',
-                           """Name prefix of the Tensorflow eval master.""")
-tf.app.flags.DEFINE_integer('task', 0,
-                            """Task id of the replica running the training.""")
-tf.app.flags.DEFINE_integer('ps_tasks', 0, """Number of tasks in the ps job.
-                            If 0 no ps job is used.""")
-
-## General FLAGS.
-tf.app.flags.DEFINE_string(
-    'hparams', '', 'Comma separated list of name=value hyperparameter pairs.')
-tf.app.flags.DEFINE_integer('batch_size', 20, 'The batch size.')
-tf.app.flags.DEFINE_integer('vocab_size', 10000, 'The vocabulary size.')
-tf.app.flags.DEFINE_integer('sequence_length', 20, 'The sequence length.')
-tf.app.flags.DEFINE_integer('max_steps', 1000000,
-                            'Maximum number of steps to run.')
-tf.app.flags.DEFINE_string(
-    'mask_strategy', 'random', 'Strategy for masking the words.  Determine the '
-    'characterisitics of how the words are dropped out.  One of '
-    "['contiguous', 'random'].")
-tf.app.flags.DEFINE_float('is_present_rate', 0.5,
-                          'Percent of tokens present in the forward sequence.')
-tf.app.flags.DEFINE_float('is_present_rate_decay', None, 'Decay rate for the '
-                          'percent of words that are real (are present).')
-tf.app.flags.DEFINE_string(
-    'generator_model', 'seq2seq',
-    "Type of Generator model.  One of ['rnn', 'seq2seq', 'seq2seq_zaremba',"
-    "'rnn_zaremba', 'rnn_nas', 'seq2seq_nas']")
-tf.app.flags.DEFINE_string(
-    'attention_option', None,
-    "Attention mechanism.  One of [None, 'luong', 'bahdanau']")
-tf.app.flags.DEFINE_string(
-    'discriminator_model', 'bidirectional',
-    "Type of Discriminator model.  One of ['cnn', 'rnn', 'bidirectional', "
-    "'rnn_zaremba', 'bidirectional_zaremba', 'rnn_nas', 'rnn_vd', 'seq2seq_vd']"
-)
-tf.app.flags.DEFINE_boolean('seq2seq_share_embedding', False,
-                            'Whether to share the '
-                            'embeddings between the encoder and decoder.')
-tf.app.flags.DEFINE_boolean(
-    'dis_share_embedding', False, 'Whether to share the '
-    'embeddings between the generator and discriminator.')
-tf.app.flags.DEFINE_boolean('dis_update_share_embedding', False, 'Whether the '
-                            'discriminator should update the shared embedding.')
-tf.app.flags.DEFINE_boolean('use_gen_mode', False,
-                            'Use the mode of the generator '
-                            'to produce samples.')
-tf.app.flags.DEFINE_boolean('critic_update_dis_vars', False,
-                            'Whether the critic '
-                            'updates the discriminator variables.')
-
-## Training FLAGS.
-tf.app.flags.DEFINE_string(
-    'gen_training_strategy', 'reinforce',
-    "Method for training the Generator. One of ['cross_entropy', 'reinforce']")
-tf.app.flags.DEFINE_string(
-    'generator_optimizer', 'adam',
-    "Type of Generator optimizer.  One of ['sgd', 'adam']")
-tf.app.flags.DEFINE_float('grad_clipping', 10., 'Norm for gradient clipping.')
-tf.app.flags.DEFINE_float('advantage_clipping', 5., 'Clipping for advantages.')
-tf.app.flags.DEFINE_string(
-    'baseline_method', None,
-    "Approach for baseline.  One of ['critic', 'dis_batch', 'ema', None]")
-tf.app.flags.DEFINE_float('perplexity_threshold', 15000,
-                          'Limit for perplexity before terminating job.')
-tf.app.flags.DEFINE_float('zoneout_drop_prob', 0.1,
-                          'Probability for dropping parameter for zoneout.')
-tf.app.flags.DEFINE_float('keep_prob', 0.5,
-                          'Probability for keeping parameter for dropout.')
-
-## Logging and evaluation FLAGS.
-tf.app.flags.DEFINE_integer('print_every', 250,
-                            'Frequency to print and log the '
-                            'outputs of the model.')
-tf.app.flags.DEFINE_integer('max_num_to_print', 5,
-                            'Number of samples to log/print.')
-tf.app.flags.DEFINE_boolean('print_verbose', False, 'Whether to print in full.')
-tf.app.flags.DEFINE_integer('summaries_every', 100,
-                            'Frequency to compute summaries.')
-tf.app.flags.DEFINE_boolean('eval_language_model', False,
-                            'Whether to evaluate on '
-                            'all words as in language modeling.')
-tf.app.flags.DEFINE_float('eval_interval_secs', 60,
-                          'Delay for evaluating model.')
-tf.app.flags.DEFINE_integer(
-    'n_gram_eval', 4, """The degree of the n-grams to use for evaluation.""")
-tf.app.flags.DEFINE_integer(
-    'epoch_size_override', None,
-    'If an integer, this dictates the size of the epochs and will potentially '
-    'not iterate over all the data.')
-tf.app.flags.DEFINE_integer('eval_epoch_size_override', None,
-                            'Number of evaluation steps.')
-
-## Directories and checkpoints.
-tf.app.flags.DEFINE_string('base_directory', '/tmp/maskGAN_v0.00',
-                           'Base directory for the logging, events and graph.')
-tf.app.flags.DEFINE_string('data_set', 'ptb', 'Data set to operate on.  One of'
-                           "['ptb', 'imdb']")
-tf.app.flags.DEFINE_string('data_dir', '/tmp/data/ptb',
-                           'Directory for the training data.')
-tf.app.flags.DEFINE_string(
-    'language_model_ckpt_dir', None,
-    'Directory storing checkpoints to initialize the model.  Pretrained models'
-    'are stored at /tmp/maskGAN/pretrained/')
-tf.app.flags.DEFINE_string(
-    'language_model_ckpt_dir_reversed', None,
-    'Directory storing checkpoints of reversed models to initialize the model.'
-    'Pretrained models stored at'
-    'are stored at  /tmp/PTB/pretrained_reversed')
-tf.app.flags.DEFINE_string(
-    'maskgan_ckpt', None,
-    'Override which checkpoint file to use to restore the '
-    'model.  A pretrained seq2seq_zaremba model is stored at '
-    '/tmp/maskGAN/pretrain/seq2seq_zaremba/train/model.ckpt-64912')
-
-tf.app.flags.DEFINE_boolean('wasserstein_objective', False,
-                            '(DEPRECATED) Whether to use the WGAN training.')
-tf.app.flags.DEFINE_integer('num_rollouts', 1,
-                            'The number of rolled out predictions to make.')
-tf.app.flags.DEFINE_float('c_lower', -0.01, 'Lower bound for weights.')
-tf.app.flags.DEFINE_float('c_upper', 0.01, 'Upper bound for weights.')
-
-FLAGS = tf.app.flags.FLAGS
-
-
-def create_hparams():
-  """Create the hparams object for generic training hyperparameters."""
-  hparams = tf.contrib.training.HParams(
-      gen_num_layers=2,
-      dis_num_layers=2,
-      gen_rnn_size=740,
-      dis_rnn_size=740,
-      gen_learning_rate=5e-4,
-      dis_learning_rate=5e-3,
-      critic_learning_rate=5e-3,
-      dis_train_iterations=1,
-      gen_learning_rate_decay=1.0,
-      gen_full_learning_rate_steps=1e7,
-      baseline_decay=0.999999,
-      rl_discount_rate=0.9,
-      gen_vd_keep_prob=0.5,
-      dis_vd_keep_prob=0.5,
-      dis_pretrain_learning_rate=5e-3,
-      dis_num_filters=128,
-      dis_hidden_dim=128,
-      gen_nas_keep_prob_0=0.85,
-      gen_nas_keep_prob_1=0.55,
-      dis_nas_keep_prob_0=0.85,
-      dis_nas_keep_prob_1=0.55)
-  # Command line flags override any of the preceding hyperparameter values.
-  if FLAGS.hparams:
-    hparams = hparams.parse(FLAGS.hparams)
-  return hparams
-
-
-def create_MaskGAN(hparams, is_training):
-  """Create the MaskGAN model.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    is_training:  Boolean indicating operational mode (train/inference).
-      evaluated with a teacher forcing regime.
-
-  Return:
-    model:  Namedtuple for specifying the MaskGAN.
-  """
-  global_step = tf.Variable(0, name='global_step', trainable=False)
-
-  new_learning_rate = tf.placeholder(tf.float32, [], name='new_learning_rate')
-  learning_rate = tf.Variable(0.0, name='learning_rate', trainable=False)
-  learning_rate_update = tf.assign(learning_rate, new_learning_rate)
-
-  new_rate = tf.placeholder(tf.float32, [], name='new_rate')
-  percent_real_var = tf.Variable(0.0, trainable=False)
-  percent_real_update = tf.assign(percent_real_var, new_rate)
-
-  ## Placeholders.
-  inputs = tf.placeholder(
-      tf.int32, shape=[FLAGS.batch_size, FLAGS.sequence_length])
-  targets = tf.placeholder(
-      tf.int32, shape=[FLAGS.batch_size, FLAGS.sequence_length])
-  present = tf.placeholder(
-      tf.bool, shape=[FLAGS.batch_size, FLAGS.sequence_length])
-  # TODO(adai): Placeholder for IMDB label.
-
-  ## Real Sequence is the targets.
-  real_sequence = targets
-
-  ## Fakse Sequence from the Generator.
-  # TODO(adai):  Generator must have IMDB labels placeholder.
-  (fake_sequence, fake_logits, fake_log_probs, fake_gen_initial_state,
-   fake_gen_final_state, _) = model_construction.create_generator(
-       hparams,
-       inputs,
-       targets,
-       present,
-       is_training=is_training,
-       is_validating=False)
-  (_, eval_logits, _, eval_initial_state, eval_final_state,
-   _) = model_construction.create_generator(
-       hparams,
-       inputs,
-       targets,
-       present,
-       is_training=False,
-       is_validating=True,
-       reuse=True)
-
-  ## Discriminator.
-  fake_predictions = model_construction.create_discriminator(
-      hparams,
-      fake_sequence,
-      is_training=is_training,
-      inputs=inputs,
-      present=present)
-  real_predictions = model_construction.create_discriminator(
-      hparams,
-      real_sequence,
-      is_training=is_training,
-      reuse=True,
-      inputs=inputs,
-      present=present)
-
-  ## Critic.
-  # The critic will be used to estimate the forward rewards to the Generator.
-  if FLAGS.baseline_method == 'critic':
-    est_state_values = model_construction.create_critic(
-        hparams, fake_sequence, is_training=is_training)
-  else:
-    est_state_values = None
-
-  ## Discriminator Loss.
-  [dis_loss, dis_loss_fake, dis_loss_real] = model_losses.create_dis_loss(
-      fake_predictions, real_predictions, present)
-
-  ## Average log-perplexity for only missing words.  However, to do this,
-  # the logits are still computed using teacher forcing, that is, the ground
-  # truth tokens are fed in at each time point to be valid.
-  avg_log_perplexity = model_losses.calculate_log_perplexity(
-      eval_logits, targets, present)
-
-  ## Generator Objective.
-  # 1.  Cross Entropy losses on missing tokens.
-  fake_cross_entropy_losses = model_losses.create_masked_cross_entropy_loss(
-      targets, present, fake_logits)
-
-  #  2.  GAN REINFORCE losses.
-  [
-      fake_RL_loss, fake_log_probs, fake_rewards, fake_advantages,
-      fake_baselines, fake_averages_op, critic_loss, cumulative_rewards
-  ] = model_losses.calculate_reinforce_objective(
-      hparams, fake_log_probs, fake_predictions, present, est_state_values)
-
-  ## Pre-training.
-  if FLAGS.gen_pretrain_steps:
-    raise NotImplementedError
-    # # TODO(liamfedus): Rewrite this.
-    # fwd_cross_entropy_loss = tf.reduce_mean(fwd_cross_entropy_losses)
-    # gen_pretrain_op = model_optimization.create_gen_pretrain_op(
-    #     hparams, fwd_cross_entropy_loss, global_step)
-  else:
-    gen_pretrain_op = None
-  if FLAGS.dis_pretrain_steps:
-    dis_pretrain_op = model_optimization.create_dis_pretrain_op(
-        hparams, dis_loss, global_step)
-  else:
-    dis_pretrain_op = None
-
-  ##  Generator Train Op.
-  # 1.  Cross-Entropy.
-  if FLAGS.gen_training_strategy == 'cross_entropy':
-    gen_loss = tf.reduce_mean(fake_cross_entropy_losses)
-    [gen_train_op, gen_grads,
-     gen_vars] = model_optimization.create_gen_train_op(
-         hparams, learning_rate, gen_loss, global_step, mode='MINIMIZE')
-
-  # 2.  GAN (REINFORCE)
-  elif FLAGS.gen_training_strategy == 'reinforce':
-    gen_loss = fake_RL_loss
-    [gen_train_op, gen_grads,
-     gen_vars] = model_optimization.create_reinforce_gen_train_op(
-         hparams, learning_rate, gen_loss, fake_averages_op, global_step)
-
-  else:
-    raise NotImplementedError
-
-  ## Discriminator Train Op.
-  dis_train_op, dis_grads, dis_vars = model_optimization.create_dis_train_op(
-      hparams, dis_loss, global_step)
-
-  ## Critic Train Op.
-  if critic_loss is not None:
-    [critic_train_op, _, _] = model_optimization.create_critic_train_op(
-        hparams, critic_loss, global_step)
-    dis_train_op = tf.group(dis_train_op, critic_train_op)
-
-  ## Summaries.
-  with tf.name_scope('general'):
-    tf.summary.scalar('percent_real', percent_real_var)
-    tf.summary.scalar('learning_rate', learning_rate)
-
-  with tf.name_scope('generator_objectives'):
-    tf.summary.scalar('gen_objective', tf.reduce_mean(gen_loss))
-    tf.summary.scalar('gen_loss_cross_entropy',
-                      tf.reduce_mean(fake_cross_entropy_losses))
-
-  with tf.name_scope('REINFORCE'):
-    with tf.name_scope('objective'):
-      tf.summary.scalar('fake_RL_loss', tf.reduce_mean(fake_RL_loss))
-
-    with tf.name_scope('rewards'):
-      helper.variable_summaries(cumulative_rewards, 'rewards')
-
-    with tf.name_scope('advantages'):
-      helper.variable_summaries(fake_advantages, 'advantages')
-
-    with tf.name_scope('baselines'):
-      helper.variable_summaries(fake_baselines, 'baselines')
-
-    with tf.name_scope('log_probs'):
-      helper.variable_summaries(fake_log_probs, 'log_probs')
-
-  with tf.name_scope('discriminator_losses'):
-    tf.summary.scalar('dis_loss', dis_loss)
-    tf.summary.scalar('dis_loss_fake_sequence', dis_loss_fake)
-    tf.summary.scalar('dis_loss_prob_fake_sequence', tf.exp(-dis_loss_fake))
-    tf.summary.scalar('dis_loss_real_sequence', dis_loss_real)
-    tf.summary.scalar('dis_loss_prob_real_sequence', tf.exp(-dis_loss_real))
-
-  if critic_loss is not None:
-    with tf.name_scope('critic_losses'):
-      tf.summary.scalar('critic_loss', critic_loss)
-
-  with tf.name_scope('logits'):
-    helper.variable_summaries(fake_logits, 'fake_logits')
-
-  for v, g in zip(gen_vars, gen_grads):
-    helper.variable_summaries(v, v.op.name)
-    helper.variable_summaries(g, 'grad/' + v.op.name)
-
-  for v, g in zip(dis_vars, dis_grads):
-    helper.variable_summaries(v, v.op.name)
-    helper.variable_summaries(g, 'grad/' + v.op.name)
-
-  merge_summaries_op = tf.summary.merge_all()
-  text_summary_placeholder = tf.placeholder(tf.string)
-  text_summary_op = tf.summary.text('Samples', text_summary_placeholder)
-
-  # Model saver.
-  saver = tf.train.Saver(keep_checkpoint_every_n_hours=1, max_to_keep=5)
-
-  # Named tuple that captures elements of the MaskGAN model.
-  Model = collections.namedtuple('Model', [
-      'inputs', 'targets', 'present', 'percent_real_update', 'new_rate',
-      'fake_sequence', 'fake_logits', 'fake_rewards', 'fake_baselines',
-      'fake_advantages', 'fake_log_probs', 'fake_predictions',
-      'real_predictions', 'fake_cross_entropy_losses', 'fake_gen_initial_state',
-      'fake_gen_final_state', 'eval_initial_state', 'eval_final_state',
-      'avg_log_perplexity', 'dis_loss', 'gen_loss', 'critic_loss',
-      'cumulative_rewards', 'dis_train_op', 'gen_train_op', 'gen_pretrain_op',
-      'dis_pretrain_op', 'merge_summaries_op', 'global_step',
-      'new_learning_rate', 'learning_rate_update', 'saver', 'text_summary_op',
-      'text_summary_placeholder'
-  ])
-
-  model = Model(
-      inputs, targets, present, percent_real_update, new_rate, fake_sequence,
-      fake_logits, fake_rewards, fake_baselines, fake_advantages,
-      fake_log_probs, fake_predictions, real_predictions,
-      fake_cross_entropy_losses, fake_gen_initial_state, fake_gen_final_state,
-      eval_initial_state, eval_final_state, avg_log_perplexity, dis_loss,
-      gen_loss, critic_loss, cumulative_rewards, dis_train_op, gen_train_op,
-      gen_pretrain_op, dis_pretrain_op, merge_summaries_op, global_step,
-      new_learning_rate, learning_rate_update, saver, text_summary_op,
-      text_summary_placeholder)
-  return model
-
-
-def compute_geometric_average(percent_captured):
-  """Compute the geometric average of the n-gram metrics."""
-
-  res = 1.
-  for _, n_gram_percent in percent_captured.iteritems():
-    res *= n_gram_percent
-
-  return np.power(res, 1. / float(len(percent_captured)))
-
-
-def compute_arithmetic_average(percent_captured):
-  """Compute the arithmetic average of the n-gram metrics."""
-  N = len(percent_captured)
-
-  res = 0.
-  for _, n_gram_percent in percent_captured.iteritems():
-    res += n_gram_percent
-
-  return res / float(N)
-
-
-def get_iterator(data):
-  """Return the data iterator."""
-  if FLAGS.data_set == 'ptb':
-    iterator = ptb_loader.ptb_iterator(data, FLAGS.batch_size,
-                                       FLAGS.sequence_length,
-                                       FLAGS.epoch_size_override)
-  elif FLAGS.data_set == 'imdb':
-    iterator = imdb_loader.imdb_iterator(data, FLAGS.batch_size,
-                                         FLAGS.sequence_length)
-  return iterator
-
-
-def train_model(hparams, data, log_dir, log, id_to_word, data_ngram_counts):
-  """Train model.
-
-  Args:
-    hparams: Hyperparameters for the MaskGAN.
-    data: Data to evaluate.
-    log_dir: Directory to save checkpoints.
-    log: Readable log for the experiment.
-    id_to_word: Dictionary of indices to words.
-    data_ngram_counts: Dictionary of hashed(n-gram tuples) to counts in the
-      data_set.
-  """
-  print('Training model.')
-  tf.logging.info('Training model.')
-
-  # Boolean indicating operational mode.
-  is_training = True
-
-  # Write all the information to the logs.
-  log.write('hparams\n')
-  log.write(str(hparams))
-  log.flush()
-
-  is_chief = FLAGS.task == 0
-
-  with tf.Graph().as_default():
-    with tf.device(tf.train.replica_device_setter(FLAGS.ps_tasks)):
-      container_name = ''
-      with tf.container(container_name):
-        # Construct the model.
-        if FLAGS.num_rollouts == 1:
-          model = create_MaskGAN(hparams, is_training)
-        elif FLAGS.num_rollouts > 1:
-          model = rollout.create_rollout_MaskGAN(hparams, is_training)
-        else:
-          raise ValueError
-
-        print('\nTrainable Variables in Graph:')
-        for v in tf.trainable_variables():
-          print(v)
-
-        ## Retrieve the initial savers.
-        init_savers = model_utils.retrieve_init_savers(hparams)
-
-        ## Initial saver function to supervisor.
-        init_fn = partial(model_utils.init_fn, init_savers)
-
-        # Create the supervisor.  It will take care of initialization,
-        # summaries, checkpoints, and recovery.
-        sv = tf.train.Supervisor(
-            logdir=log_dir,
-            is_chief=is_chief,
-            saver=model.saver,
-            global_step=model.global_step,
-            save_model_secs=60,
-            recovery_wait_secs=30,
-            summary_op=None,
-            init_fn=init_fn)
-
-        # Get an initialized, and possibly recovered session.  Launch the
-        # services: Checkpointing, Summaries, step counting.
-        #
-        # When multiple replicas of this program are running the services are
-        # only launched by the 'chief' replica.
-        with sv.managed_session(FLAGS.master) as sess:
-
-          ## Pretrain the generator.
-          if FLAGS.gen_pretrain_steps:
-            pretrain_mask_gan.pretrain_generator(sv, sess, model, data, log,
-                                                 id_to_word, data_ngram_counts,
-                                                 is_chief)
-
-          ## Pretrain the discriminator.
-          if FLAGS.dis_pretrain_steps:
-            pretrain_mask_gan.pretrain_discriminator(
-                sv, sess, model, data, log, id_to_word, data_ngram_counts,
-                is_chief)
-
-          # Initial indicators for printing and summarizing.
-          print_step_division = -1
-          summary_step_division = -1
-
-          # Run iterative computation in a loop.
-          while not sv.ShouldStop():
-            is_present_rate = FLAGS.is_present_rate
-
-            if FLAGS.is_present_rate_decay is not None:
-              is_present_rate *= (1. - FLAGS.is_present_rate_decay)
-
-            model_utils.assign_percent_real(sess, model.percent_real_update,
-                                            model.new_rate, is_present_rate)
-
-            # GAN training.
-            avg_epoch_gen_loss, avg_epoch_dis_loss = [], []
-            cumulative_costs = 0.
-            gen_iters = 0
-
-            # Generator and Discriminator statefulness initial evaluation.
-            # TODO(liamfedus): Throughout the code I am implicitly assuming
-            # that the Generator and Discriminator are equal sized.
-            [gen_initial_state_eval, fake_gen_initial_state_eval] = sess.run(
-                [model.eval_initial_state, model.fake_gen_initial_state])
-            dis_initial_state_eval = fake_gen_initial_state_eval
-
-            # Save zeros state to reset later.
-            zeros_state = fake_gen_initial_state_eval
-
-            ## Offset Discriminator.
-            if FLAGS.ps_tasks == 0:
-              dis_offset = 1
-            else:
-              dis_offset = FLAGS.task * 1000 + 1
-            dis_iterator = get_iterator(data)
-
-            for i in range(dis_offset):
-              try:
-                dis_x, dis_y, _ = next(dis_iterator)
-              except StopIteration:
-                dis_iterator = get_iterator(data)
-                dis_initial_state_eval = zeros_state
-                dis_x, dis_y, _ = next(dis_iterator)
-
-              p = model_utils.generate_mask()
-
-              # Construct the train feed.
-              train_feed = {
-                  model.inputs: dis_x,
-                  model.targets: dis_y,
-                  model.present: p
-              }
-
-              if FLAGS.data_set == 'ptb':
-                # Statefulness of the Generator being used for Discriminator.
-                for i, (c, h) in enumerate(model.fake_gen_initial_state):
-                  train_feed[c] = dis_initial_state_eval[i].c
-                  train_feed[h] = dis_initial_state_eval[i].h
-
-                # Determine the state had the Generator run over real data.  We
-                # use this state for the Discriminator.
-                [dis_initial_state_eval] = sess.run(
-                    [model.fake_gen_final_state], train_feed)
-
-            ## Training loop.
-            iterator = get_iterator(data)
-            gen_initial_state_eval = zeros_state
-
-            if FLAGS.ps_tasks > 0:
-              gen_offset = FLAGS.task * 1000 + 1
-              for i in range(gen_offset):
-                try:
-                  next(iterator)
-                except StopIteration:
-                  dis_iterator = get_iterator(data)
-                  dis_initial_state_eval = zeros_state
-                  next(dis_iterator)
-
-            for x, y, _ in iterator:
-              for _ in xrange(hparams.dis_train_iterations):
-                try:
-                  dis_x, dis_y, _ = next(dis_iterator)
-                except StopIteration:
-                  dis_iterator = get_iterator(data)
-                  dis_initial_state_eval = zeros_state
-                  dis_x, dis_y, _ = next(dis_iterator)
-
-                  if FLAGS.data_set == 'ptb':
-                    [dis_initial_state_eval] = sess.run(
-                        [model.fake_gen_initial_state])
-
-                p = model_utils.generate_mask()
-
-                # Construct the train feed.
-                train_feed = {
-                    model.inputs: dis_x,
-                    model.targets: dis_y,
-                    model.present: p
-                }
-
-                # Statefulness for the Discriminator.
-                if FLAGS.data_set == 'ptb':
-                  for i, (c, h) in enumerate(model.fake_gen_initial_state):
-                    train_feed[c] = dis_initial_state_eval[i].c
-                    train_feed[h] = dis_initial_state_eval[i].h
-
-                _, dis_loss_eval, step = sess.run(
-                    [model.dis_train_op, model.dis_loss, model.global_step],
-                    feed_dict=train_feed)
-
-                # Determine the state had the Generator run over real data.
-                # Use this state for the Discriminator.
-                [dis_initial_state_eval] = sess.run(
-                    [model.fake_gen_final_state], train_feed)
-
-              # Randomly mask out tokens.
-              p = model_utils.generate_mask()
-
-              # Construct the train feed.
-              train_feed = {model.inputs: x, model.targets: y, model.present: p}
-
-              # Statefulness for Generator.
-              if FLAGS.data_set == 'ptb':
-                tf.logging.info('Generator is stateful.')
-                print('Generator is stateful.')
-                # Statefulness for *evaluation* Generator.
-                for i, (c, h) in enumerate(model.eval_initial_state):
-                  train_feed[c] = gen_initial_state_eval[i].c
-                  train_feed[h] = gen_initial_state_eval[i].h
-
-                # Statefulness for Generator.
-                for i, (c, h) in enumerate(model.fake_gen_initial_state):
-                  train_feed[c] = fake_gen_initial_state_eval[i].c
-                  train_feed[h] = fake_gen_initial_state_eval[i].h
-
-              # Determine whether to decay learning rate.
-              lr_decay = hparams.gen_learning_rate_decay**max(
-                  step + 1 - hparams.gen_full_learning_rate_steps, 0.0)
-
-              # Assign learning rate.
-              gen_learning_rate = hparams.gen_learning_rate * lr_decay
-              model_utils.assign_learning_rate(sess, model.learning_rate_update,
-                                               model.new_learning_rate,
-                                               gen_learning_rate)
-
-              [_, gen_loss_eval, gen_log_perplexity_eval, step] = sess.run(
-                  [
-                      model.gen_train_op, model.gen_loss,
-                      model.avg_log_perplexity, model.global_step
-                  ],
-                  feed_dict=train_feed)
-
-              cumulative_costs += gen_log_perplexity_eval
-              gen_iters += 1
-
-              # Determine the state had the Generator run over real data.
-              [gen_initial_state_eval, fake_gen_initial_state_eval] = sess.run(
-                  [model.eval_final_state,
-                   model.fake_gen_final_state], train_feed)
-
-              avg_epoch_dis_loss.append(dis_loss_eval)
-              avg_epoch_gen_loss.append(gen_loss_eval)
-
-              ## Summaries.
-              # Calulate rolling perplexity.
-              perplexity = np.exp(cumulative_costs / gen_iters)
-
-              if is_chief and (step / FLAGS.summaries_every >
-                               summary_step_division):
-                summary_step_division = step / FLAGS.summaries_every
-
-                # Confirm perplexity is not infinite.
-                if (not np.isfinite(perplexity) or
-                    perplexity >= FLAGS.perplexity_threshold):
-                  print('Training raising FloatingPoinError.')
-                  raise FloatingPointError(
-                      'Training infinite perplexity: %.3f' % perplexity)
-
-                # Graph summaries.
-                summary_str = sess.run(
-                    model.merge_summaries_op, feed_dict=train_feed)
-                sv.SummaryComputed(sess, summary_str)
-
-                # Summary:  n-gram
-                avg_percent_captured = {'2': 0., '3': 0., '4': 0.}
-                for n, data_ngram_count in data_ngram_counts.iteritems():
-                  batch_percent_captured = evaluation_utils.sequence_ngram_evaluation(
-                      sess, model.fake_sequence, log, train_feed,
-                      data_ngram_count, int(n))
-                  summary_percent_str = tf.Summary(value=[
-                      tf.Summary.Value(
-                          tag='general/%s-grams_percent_correct' % n,
-                          simple_value=batch_percent_captured)
-                  ])
-                  sv.SummaryComputed(
-                      sess, summary_percent_str, global_step=step)
-
-                # Summary:  geometric_avg
-                geometric_avg = compute_geometric_average(avg_percent_captured)
-                summary_geometric_avg_str = tf.Summary(value=[
-                    tf.Summary.Value(
-                        tag='general/geometric_avg', simple_value=geometric_avg)
-                ])
-                sv.SummaryComputed(
-                    sess, summary_geometric_avg_str, global_step=step)
-
-                # Summary:  arithmetic_avg
-                arithmetic_avg = compute_arithmetic_average(
-                    avg_percent_captured)
-                summary_arithmetic_avg_str = tf.Summary(value=[
-                    tf.Summary.Value(
-                        tag='general/arithmetic_avg',
-                        simple_value=arithmetic_avg)
-                ])
-                sv.SummaryComputed(
-                    sess, summary_arithmetic_avg_str, global_step=step)
-
-                # Summary:  perplexity
-                summary_perplexity_str = tf.Summary(value=[
-                    tf.Summary.Value(
-                        tag='general/perplexity', simple_value=perplexity)
-                ])
-                sv.SummaryComputed(
-                    sess, summary_perplexity_str, global_step=step)
-
-              ## Printing and logging
-              if is_chief and (step / FLAGS.print_every > print_step_division):
-                print_step_division = (step / FLAGS.print_every)
-                print('global_step: %d' % step)
-                print(' perplexity: %.3f' % perplexity)
-                print(' gen_learning_rate: %.6f' % gen_learning_rate)
-                log.write('global_step: %d\n' % step)
-                log.write(' perplexity: %.3f\n' % perplexity)
-                log.write(' gen_learning_rate: %.6f' % gen_learning_rate)
-
-                # Average percent captured for each of the n-grams.
-                avg_percent_captured = {'2': 0., '3': 0., '4': 0.}
-                for n, data_ngram_count in data_ngram_counts.iteritems():
-                  batch_percent_captured = evaluation_utils.sequence_ngram_evaluation(
-                      sess, model.fake_sequence, log, train_feed,
-                      data_ngram_count, int(n))
-                  avg_percent_captured[n] = batch_percent_captured
-                  print(' percent of %s-grams captured: %.3f.' %
-                        (n, batch_percent_captured))
-                  log.write(' percent of %s-grams captured: %.3f.\n' %
-                            (n, batch_percent_captured))
-                geometric_avg = compute_geometric_average(avg_percent_captured)
-                print(' geometric_avg: %.3f.' % geometric_avg)
-                log.write(' geometric_avg: %.3f.' % geometric_avg)
-                arithmetic_avg = compute_arithmetic_average(
-                    avg_percent_captured)
-                print(' arithmetic_avg: %.3f.' % arithmetic_avg)
-                log.write(' arithmetic_avg: %.3f.' % arithmetic_avg)
-
-                evaluation_utils.print_and_log_losses(
-                    log, step, is_present_rate, avg_epoch_dis_loss,
-                    avg_epoch_gen_loss)
-
-                if FLAGS.gen_training_strategy == 'reinforce':
-                  evaluation_utils.generate_RL_logs(sess, model, log,
-                                                    id_to_word, train_feed)
-                else:
-                  evaluation_utils.generate_logs(sess, model, log, id_to_word,
-                                                 train_feed)
-                log.flush()
-
-  log.close()
-
-
-def evaluate_once(data, sv, model, sess, train_dir, log, id_to_word,
-                  data_ngram_counts, eval_saver):
-  """Evaluate model for a number of steps.
-
-  Args:
-    data:  Dataset.
-    sv: Supervisor.
-    model: The GAN model we have just built.
-    sess: A session to use.
-    train_dir: Path to a directory containing checkpoints.
-    log: Evaluation log for evaluation.
-    id_to_word: Dictionary of indices to words.
-    data_ngram_counts: Dictionary of hashed(n-gram tuples) to counts in the
-      data_set.
-    eval_saver:  Evaluation saver.r.
-  """
-  tf.logging.info('Evaluate Once.')
-  # Load the last model checkpoint, or initialize the graph.
-  model_save_path = tf.latest_checkpoint(train_dir)
-  if not model_save_path:
-    tf.logging.warning('No checkpoint yet in: %s', train_dir)
-    return
-
-  tf.logging.info('Starting eval of: %s' % model_save_path)
-  tf.logging.info('Only restoring trainable variables.')
-  eval_saver.restore(sess, model_save_path)
-
-  # Run the requested number of evaluation steps
-  avg_epoch_gen_loss, avg_epoch_dis_loss = [], []
-  cumulative_costs = 0.
-
-  # Average percent captured for each of the n-grams.
-  avg_percent_captured = {'2': 0., '3': 0., '4': 0.}
-
-  # Set a random seed to keep fixed mask.
-  np.random.seed(0)
-  gen_iters = 0
-
-  # Generator statefulness over the epoch.
-  # TODO(liamfedus):  Check this.
-  [gen_initial_state_eval, fake_gen_initial_state_eval] = sess.run(
-      [model.eval_initial_state, model.fake_gen_initial_state])
-
-  if FLAGS.eval_language_model:
-    is_present_rate = 0.
-    tf.logging.info('Overriding is_present_rate=0. for evaluation.')
-    print('Overriding is_present_rate=0. for evaluation.')
-
-  iterator = get_iterator(data)
-
-  for x, y, _ in iterator:
-    if FLAGS.eval_language_model:
-      is_present_rate = 0.
-    else:
-      is_present_rate = FLAGS.is_present_rate
-      tf.logging.info('Evaluating on is_present_rate=%.3f.' % is_present_rate)
-
-    model_utils.assign_percent_real(sess, model.percent_real_update,
-                                    model.new_rate, is_present_rate)
-
-    # Randomly mask out tokens.
-    p = model_utils.generate_mask()
-
-    eval_feed = {model.inputs: x, model.targets: y, model.present: p}
-
-    if FLAGS.data_set == 'ptb':
-      # Statefulness for *evaluation* Generator.
-      for i, (c, h) in enumerate(model.eval_initial_state):
-        eval_feed[c] = gen_initial_state_eval[i].c
-        eval_feed[h] = gen_initial_state_eval[i].h
-
-      # Statefulness for the Generator.
-      for i, (c, h) in enumerate(model.fake_gen_initial_state):
-        eval_feed[c] = fake_gen_initial_state_eval[i].c
-        eval_feed[h] = fake_gen_initial_state_eval[i].h
-
-    [
-        gen_log_perplexity_eval, dis_loss_eval, gen_loss_eval,
-        gen_initial_state_eval, fake_gen_initial_state_eval, step
-    ] = sess.run(
-        [
-            model.avg_log_perplexity, model.dis_loss, model.gen_loss,
-            model.eval_final_state, model.fake_gen_final_state,
-            model.global_step
-        ],
-        feed_dict=eval_feed)
-
-    for n, data_ngram_count in data_ngram_counts.iteritems():
-      batch_percent_captured = evaluation_utils.sequence_ngram_evaluation(
-          sess, model.fake_sequence, log, eval_feed, data_ngram_count, int(n))
-      avg_percent_captured[n] += batch_percent_captured
-
-    cumulative_costs += gen_log_perplexity_eval
-
-    avg_epoch_dis_loss.append(dis_loss_eval)
-    avg_epoch_gen_loss.append(gen_loss_eval)
-
-    gen_iters += 1
-
-  # Calulate rolling metrics.
-  perplexity = np.exp(cumulative_costs / gen_iters)
-  for n, _ in avg_percent_captured.iteritems():
-    avg_percent_captured[n] /= gen_iters
-
-  # Confirm perplexity is not infinite.
-  if not np.isfinite(perplexity) or perplexity >= FLAGS.perplexity_threshold:
-    print('Evaluation raising FloatingPointError.')
-    raise FloatingPointError(
-        'Evaluation infinite perplexity: %.3f' % perplexity)
-
-  ## Printing and logging.
-  evaluation_utils.print_and_log_losses(log, step, is_present_rate,
-                                        avg_epoch_dis_loss, avg_epoch_gen_loss)
-  print(' perplexity: %.3f' % perplexity)
-  log.write(' perplexity: %.3f\n' % perplexity)
-
-  for n, n_gram_percent in avg_percent_captured.iteritems():
-    n = int(n)
-    print(' percent of %d-grams captured: %.3f.' % (n, n_gram_percent))
-    log.write(' percent of %d-grams captured: %.3f.\n' % (n, n_gram_percent))
-
-  samples = evaluation_utils.generate_logs(sess, model, log, id_to_word,
-                                           eval_feed)
-
-  ## Summaries.
-  summary_str = sess.run(model.merge_summaries_op, feed_dict=eval_feed)
-  sv.SummaryComputed(sess, summary_str)
-
-  # Summary: text
-  summary_str = sess.run(model.text_summary_op,
-                         {model.text_summary_placeholder: '\n\n'.join(samples)})
-  sv.SummaryComputed(sess, summary_str, global_step=step)
-
-  # Summary:  n-gram
-  for n, n_gram_percent in avg_percent_captured.iteritems():
-    n = int(n)
-    summary_percent_str = tf.Summary(value=[
-        tf.Summary.Value(
-            tag='general/%d-grams_percent_correct' % n,
-            simple_value=n_gram_percent)
-    ])
-    sv.SummaryComputed(sess, summary_percent_str, global_step=step)
-
-  # Summary:  geometric_avg
-  geometric_avg = compute_geometric_average(avg_percent_captured)
-  summary_geometric_avg_str = tf.Summary(value=[
-      tf.Summary.Value(tag='general/geometric_avg', simple_value=geometric_avg)
-  ])
-  sv.SummaryComputed(sess, summary_geometric_avg_str, global_step=step)
-
-  # Summary:  arithmetic_avg
-  arithmetic_avg = compute_arithmetic_average(avg_percent_captured)
-  summary_arithmetic_avg_str = tf.Summary(value=[
-      tf.Summary.Value(
-          tag='general/arithmetic_avg', simple_value=arithmetic_avg)
-  ])
-  sv.SummaryComputed(sess, summary_arithmetic_avg_str, global_step=step)
-
-  # Summary:  perplexity
-  summary_perplexity_str = tf.Summary(value=[
-      tf.Summary.Value(tag='general/perplexity', simple_value=perplexity)
-  ])
-  sv.SummaryComputed(sess, summary_perplexity_str, global_step=step)
-
-
-def evaluate_model(hparams, data, train_dir, log, id_to_word,
-                   data_ngram_counts):
-  """Evaluate MaskGAN model.
-
-  Args:
-    hparams:  Hyperparameters for the MaskGAN.
-    data: Data to evaluate.
-    train_dir: Path to a directory containing checkpoints.
-    id_to_word: Dictionary of indices to words.
-    data_ngram_counts: Dictionary of hashed(n-gram tuples) to counts in the
-      data_set.
-  """
-  tf.logging.error('Evaluate model.')
-
-  # Boolean indicating operational mode.
-  is_training = False
-
-  if FLAGS.mode == MODE_VALIDATION:
-    logdir = FLAGS.base_directory + '/validation'
-  elif FLAGS.mode == MODE_TRAIN_EVAL:
-    logdir = FLAGS.base_directory + '/train_eval'
-  elif FLAGS.mode == MODE_TEST:
-    logdir = FLAGS.base_directory + '/test'
-  else:
-    raise NotImplementedError
-
-  # Wait for a checkpoint to exist.
-  print(train_dir)
-  print(tf.train.latest_checkpoint(train_dir))
-  while not tf.train.latest_checkpoint(train_dir):
-    tf.logging.error('Waiting for checkpoint...')
-    print('Waiting for checkpoint...')
-    time.sleep(10)
-
-  with tf.Graph().as_default():
-    # Use a separate container for each trial
-    container_name = ''
-    with tf.container(container_name):
-
-      # Construct the model.
-      if FLAGS.num_rollouts == 1:
-        model = create_MaskGAN(hparams, is_training)
-      elif FLAGS.num_rollouts > 1:
-        model = rollout.create_rollout_MaskGAN(hparams, is_training)
-      else:
-        raise ValueError
-
-      # Create the supervisor.  It will take care of initialization, summaries,
-      # checkpoints, and recovery.  We only pass the trainable variables
-      # to load since things like baselines keep batch_size which may not
-      # match between training and evaluation.
-      evaluation_variables = tf.trainable_variables()
-      evaluation_variables.append(model.global_step)
-      eval_saver = tf.train.Saver(var_list=evaluation_variables)
-      sv = tf.Supervisor(logdir=logdir)
-      sess = sv.PrepareSession(FLAGS.eval_master, start_standard_services=False)
-
-      tf.logging.info('Before sv.Loop.')
-      sv.Loop(FLAGS.eval_interval_secs, evaluate_once,
-              (data, sv, model, sess, train_dir, log, id_to_word,
-               data_ngram_counts, eval_saver))
-
-      sv.WaitForStop()
-      tf.logging.info('sv.Stop().')
-      sv.Stop()
-
-
-def main(_):
-  hparams = create_hparams()
-  train_dir = FLAGS.base_directory + '/train'
-
-  # Load data set.
-  if FLAGS.data_set == 'ptb':
-    raw_data = ptb_loader.ptb_raw_data(FLAGS.data_dir)
-    train_data, valid_data, test_data, _ = raw_data
-    valid_data_flat = valid_data
-  elif FLAGS.data_set == 'imdb':
-    raw_data = imdb_loader.imdb_raw_data(FLAGS.data_dir)
-    # TODO(liamfedus): Get an IMDB test partition.
-    train_data, valid_data = raw_data
-    valid_data_flat = [word for review in valid_data for word in review]
-  else:
-    raise NotImplementedError
-
-  if FLAGS.mode == MODE_TRAIN or FLAGS.mode == MODE_TRAIN_EVAL:
-    data_set = train_data
-  elif FLAGS.mode == MODE_VALIDATION:
-    data_set = valid_data
-  elif FLAGS.mode == MODE_TEST:
-    data_set = test_data
-  else:
-    raise NotImplementedError
-
-  # Dictionary and reverse dictionry.
-  if FLAGS.data_set == 'ptb':
-    word_to_id = ptb_loader.build_vocab(
-        os.path.join(FLAGS.data_dir, 'ptb.train.txt'))
-  elif FLAGS.data_set == 'imdb':
-    word_to_id = imdb_loader.build_vocab(
-        os.path.join(FLAGS.data_dir, 'vocab.txt'))
-  id_to_word = {v: k for k, v in word_to_id.iteritems()}
-
-  # Dictionary of Training Set n-gram counts.
-  bigram_tuples = n_gram.find_all_ngrams(valid_data_flat, n=2)
-  trigram_tuples = n_gram.find_all_ngrams(valid_data_flat, n=3)
-  fourgram_tuples = n_gram.find_all_ngrams(valid_data_flat, n=4)
-
-  bigram_counts = n_gram.construct_ngrams_dict(bigram_tuples)
-  trigram_counts = n_gram.construct_ngrams_dict(trigram_tuples)
-  fourgram_counts = n_gram.construct_ngrams_dict(fourgram_tuples)
-  print('Unique %d-grams: %d' % (2, len(bigram_counts)))
-  print('Unique %d-grams: %d' % (3, len(trigram_counts)))
-  print('Unique %d-grams: %d' % (4, len(fourgram_counts)))
-
-  data_ngram_counts = {
-      '2': bigram_counts,
-      '3': trigram_counts,
-      '4': fourgram_counts
-  }
-
-  # TODO(liamfedus):  This was necessary because there was a problem with our
-  # originally trained IMDB models.  The EOS_INDEX was off by one, which means,
-  # two words were mapping to index 86933.  The presence of '</s>' is going
-  # to throw and out of vocabulary error.
-  FLAGS.vocab_size = len(id_to_word)
-  print('Vocab size: %d' % FLAGS.vocab_size)
-
-  tf.gfile.MakeDirs(FLAGS.base_directory)
-
-  if FLAGS.mode == MODE_TRAIN:
-    log = tf.gfile.GFile(
-        os.path.join(FLAGS.base_directory, 'train-log.txt'), mode='w')
-  elif FLAGS.mode == MODE_VALIDATION:
-    log = tf.gfile.GFile(
-        os.path.join(FLAGS.base_directory, 'validation-log.txt'), mode='w')
-  elif FLAGS.mode == MODE_TRAIN_EVAL:
-    log = tf.gfile.GFile(
-        os.path.join(FLAGS.base_directory, 'train_eval-log.txt'), mode='w')
-  else:
-    log = tf.gfile.GFile(
-        os.path.join(FLAGS.base_directory, 'test-log.txt'), mode='w')
-
-  if FLAGS.mode == MODE_TRAIN:
-    train_model(hparams, data_set, train_dir, log, id_to_word,
-                data_ngram_counts)
-
-  elif FLAGS.mode == MODE_VALIDATION:
-    evaluate_model(hparams, data_set, train_dir, log, id_to_word,
-                   data_ngram_counts)
-  elif FLAGS.mode == MODE_TRAIN_EVAL:
-    evaluate_model(hparams, data_set, train_dir, log, id_to_word,
-                   data_ngram_counts)
-
-  elif FLAGS.mode == MODE_TEST:
-    evaluate_model(hparams, data_set, train_dir, log, id_to_word,
-                   data_ngram_counts)
-
-  else:
-    raise NotImplementedError
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/namignizer/.gitignore b/research/namignizer/.gitignore
deleted file mode 100644
index 2dae8043534bc7a079f36caa6c673f74c39e5dfa..0000000000000000000000000000000000000000
--- a/research/namignizer/.gitignore
+++ /dev/null
@@ -1,6 +0,0 @@
-# Remove the pyc files
-*.pyc
-
-# Ignore the model and the data
-model/
-data/
diff --git a/research/namignizer/README.md b/research/namignizer/README.md
deleted file mode 100644
index 475a087541913aaa3fca9d2094b4c23de52dbb41..0000000000000000000000000000000000000000
--- a/research/namignizer/README.md
+++ /dev/null
@@ -1,86 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Namignizer
-
-Use a variation of the [PTB](https://www.tensorflow.org/versions/r0.8/tutorials/recurrent/index.html#recurrent-neural-networks) model to recognize and generate names using the [Kaggle Baby Name Database](https://www.kaggle.com/kaggle/us-baby-names).
-
-### API
-Namignizer is implemented in Tensorflow 0.8r and uses the python package `pandas` for some data processing.
-
-#### How to use
-Download the data from Kaggle and place it in your data directory (or use the small training data provided). The example data looks like so:
-
-```
-Id,Name,Year,Gender,Count
-1,Mary,1880,F,7065
-2,Anna,1880,F,2604
-3,Emma,1880,F,2003
-4,Elizabeth,1880,F,1939
-5,Minnie,1880,F,1746
-6,Margaret,1880,F,1578
-7,Ida,1880,F,1472
-8,Alice,1880,F,1414
-9,Bertha,1880,F,1320
-```
-
-But any data with the two columns: `Name` and `Count` will work.
-
-With the data, we can then train the model:
-
-```python
-train("data/SmallNames.txt", "model/namignizer", SmallConfig)
-```
-
-And you will get the output:
-
-```
-Reading Name data in data/SmallNames.txt
-Epoch: 1 Learning rate: 1.000
-0.090 perplexity: 18.539 speed: 282 lps
-...
-0.890 perplexity: 1.478 speed: 285 lps
-0.990 perplexity: 1.477 speed: 284 lps
-Epoch: 13 Train Perplexity: 1.477
-```
-
-This will as a side effect write model checkpoints to the `model` directory. With this you will be able to determine the perplexity your model will give you for any arbitrary set of names like so:
-
-```python
-namignize(["mary", "ida", "gazorpazorp", "houyhnhnms", "bob"],
-  tf.train.latest_checkpoint("model"), SmallConfig)
-```
-You will provide the same config and the same checkpoint directory. This will allow you to use a the model you just trained. You will then get a perplexity output for each name like so:
-
-```
-Name mary gives us a perplexity of 1.03105580807
-Name ida gives us a perplexity of 1.07770049572
-Name gazorpazorp gives us a perplexity of 175.940353394
-Name houyhnhnms gives us a perplexity of 9.53870773315
-Name bob gives us a perplexity of 6.03938627243
-```
-
-Finally, you will also be able generate names using the model like so:
-
-```python
-namignator(tf.train.latest_checkpoint("model"), SmallConfig)
-```
-
-Again, you will need to provide the same config and the same checkpoint directory. This will allow you to use a the model you just trained. You will then get a single generated name. Examples of output that I got when using the provided data are:
-
-```
-['b', 'e', 'r', 't', 'h', 'a', '`']
-['m', 'a', 'r', 'y', '`']
-['a', 'n', 'n', 'a', '`']
-['m', 'a', 'r', 'y', '`']
-['b', 'e', 'r', 't', 'h', 'a', '`']
-['a', 'n', 'n', 'a', '`']
-['e', 'l', 'i', 'z', 'a', 'b', 'e', 't', 'h', '`']
-```
-
-Notice that each name ends with a backtick. This marks the end of the name.
-
-### Contact Info
-
-Feel free to reach out to me at knt(at google) or k.nathaniel.tucker(at gmail)
diff --git a/research/namignizer/data_utils.py b/research/namignizer/data_utils.py
deleted file mode 100644
index 4320215026ccf7a2b31ffd476c25a153ecd92b86..0000000000000000000000000000000000000000
--- a/research/namignizer/data_utils.py
+++ /dev/null
@@ -1,119 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-"""Utilities for parsing Kaggle baby names files."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import os
-
-import numpy as np
-import tensorflow as tf
-import pandas as pd
-
-# the default end of name rep will be zero
-_EON = 0
-
-
-def read_names(names_path):
-    """read data from downloaded file. See SmallNames.txt for example format
-    or go to https://www.kaggle.com/kaggle/us-baby-names for full lists
-
-    Args:
-        names_path: path to the csv file similar to the example type
-    Returns:
-        Dataset: a namedtuple of two elements: deduped names and their associated
-            counts. The names contain only 26 chars and are all lower case
-    """
-    names_data = pd.read_csv(names_path)
-    names_data.Name = names_data.Name.str.lower()
-
-    name_data = names_data.groupby(by=["Name"])["Count"].sum()
-    name_counts = np.array(name_data.tolist())
-    names_deduped = np.array(name_data.index.tolist())
-
-    Dataset = collections.namedtuple('Dataset', ['Name', 'Count'])
-    return Dataset(names_deduped, name_counts)
-
-
-def _letter_to_number(letter):
-    """converts letters to numbers between 1 and 27"""
-    # ord of lower case 'a' is 97
-    return ord(letter) - 96
-
-
-def namignizer_iterator(names, counts, batch_size, num_steps, epoch_size):
-    """Takes a list of names and counts like those output from read_names, and
-    makes an iterator yielding a batch_size by num_steps array of random names
-    separated by an end of name token. The names are chosen randomly according
-    to their counts. The batch may end mid-name
-
-    Args:
-        names: a set of lowercase names composed of 26 characters
-        counts: a list of the frequency of those names
-        batch_size: int
-        num_steps: int
-        epoch_size: number of batches to yield
-    Yields:
-        (x, y): a batch_size by num_steps array of ints representing letters, where
-            x will be the input and y will be the target
-    """
-    name_distribution = counts / counts.sum()
-
-    for i in range(epoch_size):
-        data = np.zeros(batch_size * num_steps + 1)
-        samples = np.random.choice(names, size=batch_size * num_steps // 2,
-                                   replace=True, p=name_distribution)
-
-        data_index = 0
-        for sample in samples:
-            if data_index >= batch_size * num_steps:
-                break
-            for letter in map(_letter_to_number, sample) + [_EON]:
-                if data_index >= batch_size * num_steps:
-                    break
-                data[data_index] = letter
-                data_index += 1
-
-        x = data[:batch_size * num_steps].reshape((batch_size, num_steps))
-        y = data[1:batch_size * num_steps + 1].reshape((batch_size, num_steps))
-
-        yield (x, y)
-
-
-def name_to_batch(name, batch_size, num_steps):
-    """ Takes a single name and fills a batch with it
-
-    Args:
-        name: lowercase composed of 26 characters
-        batch_size: int
-        num_steps: int
-    Returns:
-        x, y: a batch_size by num_steps array of ints representing letters, where
-            x will be the input and y will be the target. The array is filled up
-            to the length of the string, the rest is filled with zeros
-    """
-    data = np.zeros(batch_size * num_steps + 1)
-
-    data_index = 0
-    for letter in map(_letter_to_number, name) + [_EON]:
-        data[data_index] = letter
-        data_index += 1
-
-    x = data[:batch_size * num_steps].reshape((batch_size, num_steps))
-    y = data[1:batch_size * num_steps + 1].reshape((batch_size, num_steps))
-
-    return x, y
diff --git a/research/namignizer/model.py b/research/namignizer/model.py
deleted file mode 100644
index 72c5c5ecb61e8a92ec2e74b8cc7ca13bb6ace817..0000000000000000000000000000000000000000
--- a/research/namignizer/model.py
+++ /dev/null
@@ -1,136 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-"""RNN model with embeddings"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-class NamignizerModel(object):
-    """The Namignizer model ~ strongly based on PTB"""
-
-    def __init__(self, is_training, config):
-        self.batch_size = batch_size = config.batch_size
-        self.num_steps = num_steps = config.num_steps
-        size = config.hidden_size
-        # will always be 27
-        vocab_size = config.vocab_size
-
-        # placeholders for inputs
-        self._input_data = tf.placeholder(tf.int32, [batch_size, num_steps])
-        self._targets = tf.placeholder(tf.int32, [batch_size, num_steps])
-        # weights for the loss function
-        self._weights = tf.placeholder(tf.float32, [batch_size * num_steps])
-
-        # lstm for our RNN cell (GRU supported too)
-        lstm_cells = []
-        for layer in range(config.num_layers):
-            lstm_cell = tf.contrib.rnn.BasicLSTMCell(size, forget_bias=0.0)
-            if is_training and config.keep_prob < 1:
-                lstm_cell = tf.contrib.rnn.DropoutWrapper(
-                    lstm_cell, output_keep_prob=config.keep_prob)
-            lstm_cells.append(lstm_cell)
-        cell = tf.contrib.rnn.MultiRNNCell(lstm_cells)
-
-        self._initial_state = cell.zero_state(batch_size, tf.float32)
-
-        with tf.device("/cpu:0"):
-            embedding = tf.get_variable("embedding", [vocab_size, size])
-            inputs = tf.nn.embedding_lookup(embedding, self._input_data)
-
-        if is_training and config.keep_prob < 1:
-            inputs = tf.nn.dropout(inputs, config.keep_prob)
-
-        outputs = []
-        state = self._initial_state
-        with tf.variable_scope("RNN"):
-            for time_step in range(num_steps):
-                if time_step > 0:
-                    tf.get_variable_scope().reuse_variables()
-                (cell_output, state) = cell(inputs[:, time_step, :], state)
-                outputs.append(cell_output)
-
-        output = tf.reshape(tf.concat(axis=1, values=outputs), [-1, size])
-        softmax_w = tf.get_variable("softmax_w", [size, vocab_size])
-        softmax_b = tf.get_variable("softmax_b", [vocab_size])
-        logits = tf.matmul(output, softmax_w) + softmax_b
-        loss = tf.contrib.legacy_seq2seq.sequence_loss_by_example(
-            [logits],
-            [tf.reshape(self._targets, [-1])],
-            [self._weights])
-        self._loss = loss
-        self._cost = cost = tf.reduce_sum(loss) / batch_size
-        self._final_state = state
-
-        # probabilities of each letter
-        self._activations = tf.nn.softmax(logits)
-
-        # ability to save the model
-        self.saver = tf.train.Saver(tf.global_variables())
-
-        if not is_training:
-            return
-
-        self._lr = tf.Variable(0.0, trainable=False)
-        tvars = tf.trainable_variables()
-        grads, _ = tf.clip_by_global_norm(tf.gradients(cost, tvars),
-                                          config.max_grad_norm)
-        optimizer = tf.train.GradientDescentOptimizer(self.lr)
-        self._train_op = optimizer.apply_gradients(zip(grads, tvars))
-
-    def assign_lr(self, session, lr_value):
-        session.run(tf.assign(self.lr, lr_value))
-
-    @property
-    def input_data(self):
-        return self._input_data
-
-    @property
-    def targets(self):
-        return self._targets
-
-    @property
-    def activations(self):
-        return self._activations
-
-    @property
-    def weights(self):
-        return self._weights
-
-    @property
-    def initial_state(self):
-        return self._initial_state
-
-    @property
-    def cost(self):
-        return self._cost
-
-    @property
-    def loss(self):
-        return self._loss
-
-    @property
-    def final_state(self):
-        return self._final_state
-
-    @property
-    def lr(self):
-        return self._lr
-
-    @property
-    def train_op(self):
-        return self._train_op
diff --git a/research/namignizer/names.py b/research/namignizer/names.py
deleted file mode 100644
index 253742716391f2f4b7a0c0cf4987e40a2aaa808f..0000000000000000000000000000000000000000
--- a/research/namignizer/names.py
+++ /dev/null
@@ -1,259 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-"""A library showing off sequence recognition and generation with the simple
-example of names.
-
-We use recurrent neural nets to learn complex functions able to recognize and
-generate sequences of a given form. This can be used for natural language
-syntax recognition, dynamically generating maps or puzzles and of course
-baby name generation.
-
-Before using this module, it is recommended to read the Tensorflow tutorial on
-recurrent neural nets, as it explains the basic concepts of this model, and
-will show off another module, the PTB module on which this model bases itself.
-
-Here is an overview of the functions available in this module:
-
-* RNN Module for sequence functions based on PTB
-
-* Name recognition specifically for recognizing names, but can be adapted to
-    recognizing sequence patterns
-
-* Name generations specifically for generating names, but can be adapted to
-    generating arbitrary sequence patterns
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import time
-
-import tensorflow as tf
-import numpy as np
-
-from model import NamignizerModel
-import data_utils
-
-
-class SmallConfig(object):
-    """Small config."""
-    init_scale = 0.1
-    learning_rate = 1.0
-    max_grad_norm = 5
-    num_layers = 2
-    num_steps = 20
-    hidden_size = 200
-    max_epoch = 4
-    max_max_epoch = 13
-    keep_prob = 1.0
-    lr_decay = 0.5
-    batch_size = 20
-    vocab_size = 27
-    epoch_size = 100
-
-
-class LargeConfig(object):
-    """Medium config."""
-    init_scale = 0.05
-    learning_rate = 1.0
-    max_grad_norm = 5
-    num_layers = 2
-    num_steps = 35
-    hidden_size = 650
-    max_epoch = 6
-    max_max_epoch = 39
-    keep_prob = 0.5
-    lr_decay = 0.8
-    batch_size = 20
-    vocab_size = 27
-    epoch_size = 100
-
-
-class TestConfig(object):
-    """Tiny config, for testing."""
-    init_scale = 0.1
-    learning_rate = 1.0
-    max_grad_norm = 1
-    num_layers = 1
-    num_steps = 2
-    hidden_size = 2
-    max_epoch = 1
-    max_max_epoch = 1
-    keep_prob = 1.0
-    lr_decay = 0.5
-    batch_size = 20
-    vocab_size = 27
-    epoch_size = 100
-
-
-def run_epoch(session, m, names, counts, epoch_size, eval_op, verbose=False):
-    """Runs the model on the given data for one epoch
-
-    Args:
-        session: the tf session holding the model graph
-        m: an instance of the NamignizerModel
-        names: a set of lowercase names of 26 characters
-        counts: a list of the frequency of the above names
-        epoch_size: the number of batches to run
-        eval_op: whether to change the params or not, and how to do it
-    Kwargs:
-        verbose: whether to print out state of training during the epoch
-    Returns:
-        cost: the average cost during the last stage of the epoch
-    """
-    start_time = time.time()
-    costs = 0.0
-    iters = 0
-    for step, (x, y) in enumerate(data_utils.namignizer_iterator(names, counts,
-                                                                 m.batch_size, m.num_steps, epoch_size)):
-
-        cost, _ = session.run([m.cost, eval_op],
-                              {m.input_data: x,
-                               m.targets: y,
-                               m.weights: np.ones(m.batch_size * m.num_steps)})
-        costs += cost
-        iters += m.num_steps
-
-        if verbose and step % (epoch_size // 10) == 9:
-            print("%.3f perplexity: %.3f speed: %.0f lps" %
-                  (step * 1.0 / epoch_size, np.exp(costs / iters),
-                   iters * m.batch_size / (time.time() - start_time)))
-
-        if step >= epoch_size:
-            break
-
-    return np.exp(costs / iters)
-
-
-def train(data_dir, checkpoint_path, config):
-    """Trains the model with the given data
-
-    Args:
-        data_dir: path to the data for the model (see data_utils for data
-            format)
-        checkpoint_path: the path to save the trained model checkpoints
-        config: one of the above configs that specify the model and how it
-            should be run and trained
-    Returns:
-        None
-    """
-    # Prepare Name data.
-    print("Reading Name data in %s" % data_dir)
-    names, counts = data_utils.read_names(data_dir)
-
-    with tf.Graph().as_default(), tf.Session() as session:
-        initializer = tf.random_uniform_initializer(-config.init_scale,
-                                                    config.init_scale)
-        with tf.variable_scope("model", reuse=None, initializer=initializer):
-            m = NamignizerModel(is_training=True, config=config)
-
-        tf.global_variables_initializer().run()
-
-        for i in range(config.max_max_epoch):
-            lr_decay = config.lr_decay ** max(i - config.max_epoch, 0.0)
-            m.assign_lr(session, config.learning_rate * lr_decay)
-
-            print("Epoch: %d Learning rate: %.3f" % (i + 1, session.run(m.lr)))
-            train_perplexity = run_epoch(session, m, names, counts, config.epoch_size, m.train_op,
-                                         verbose=True)
-            print("Epoch: %d Train Perplexity: %.3f" %
-                  (i + 1, train_perplexity))
-
-            m.saver.save(session, checkpoint_path, global_step=i)
-
-
-def namignize(names, checkpoint_path, config):
-    """Recognizes names and prints the Perplexity of the model for each names
-    in the list
-
-    Args:
-        names: a list of names in the model format
-        checkpoint_path: the path to restore the trained model from, should not
-            include the model name, just the path to
-        config: one of the above configs that specify the model and how it
-            should be run and trained
-    Returns:
-        None
-    """
-    with tf.Graph().as_default(), tf.Session() as session:
-
-        with tf.variable_scope("model"):
-            m = NamignizerModel(is_training=False, config=config)
-
-        m.saver.restore(session, checkpoint_path)
-
-        for name in names:
-            x, y = data_utils.name_to_batch(name, m.batch_size, m.num_steps)
-
-            cost, loss, _ = session.run([m.cost, m.loss, tf.no_op()],
-                                  {m.input_data: x,
-                                   m.targets: y,
-                                   m.weights: np.concatenate((
-                                       np.ones(len(name)), np.zeros(m.batch_size * m.num_steps - len(name))))})
-
-            print("Name {} gives us a perplexity of {}".format(
-                name, np.exp(cost)))
-
-
-def namignator(checkpoint_path, config):
-    """Generates names randomly according to a given model
-
-    Args:
-        checkpoint_path: the path to restore the trained model from, should not
-            include the model name, just the path to
-        config: one of the above configs that specify the model and how it
-            should be run and trained
-    Returns:
-        None
-    """
-    # mutate the config to become a name generator config
-    config.num_steps = 1
-    config.batch_size = 1
-
-    with tf.Graph().as_default(), tf.Session() as session:
-
-        with tf.variable_scope("model"):
-            m = NamignizerModel(is_training=False, config=config)
-
-        m.saver.restore(session, checkpoint_path)
-
-        activations, final_state, _ = session.run([m.activations, m.final_state, tf.no_op()],
-                                                  {m.input_data: np.zeros((1, 1)),
-                                                   m.targets: np.zeros((1, 1)),
-                                                   m.weights: np.ones(1)})
-
-        # sample from our softmax activations
-        next_letter = np.random.choice(27, p=activations[0])
-        name = [next_letter]
-        while next_letter != 0:
-            activations, final_state, _ = session.run([m.activations, m.final_state, tf.no_op()],
-                                                      {m.input_data: [[next_letter]],
-                                                       m.targets: np.zeros((1, 1)),
-                                                       m.initial_state: final_state,
-                                                       m.weights: np.ones(1)})
-
-            next_letter = np.random.choice(27, p=activations[0])
-            name += [next_letter]
-
-        print(map(lambda x: chr(x + 96), name))
-
-
-if __name__ == "__main__":
-    train("data/SmallNames.txt", "model/namignizer", SmallConfig)
-
-    namignize(["mary", "ida", "gazorbazorb", "mmmhmm", "bob"],
-        tf.train.latest_checkpoint("model"), SmallConfig)
-
-    namignator(tf.train.latest_checkpoint("model"), SmallConfig)
diff --git a/research/neural_gpu/README.md b/research/neural_gpu/README.md
deleted file mode 100644
index 097ef318c4e071f59e4212b0cd901907758d73e7..0000000000000000000000000000000000000000
--- a/research/neural_gpu/README.md
+++ /dev/null
@@ -1,87 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# NeuralGPU
-Code for the Neural GPU model described in http://arxiv.org/abs/1511.08228.
-The extended version was described in https://arxiv.org/abs/1610.08613.
-
-Requirements:
-* TensorFlow (see tensorflow.org for how to install)
-
-The model can be trained on the following algorithmic tasks:
-
-* `sort` - Sort a symbol list
-* `kvsort` - Sort symbol keys in dictionary
-* `id` - Return the same symbol list
-* `rev` - Reverse a symbol list
-* `rev2` - Reverse a symbol dictionary by key
-* `incr` - Add one to a symbol value
-* `add` - Long decimal addition
-* `left` - First symbol in list
-* `right` - Last symbol in list
-* `left-shift` - Left shift a symbol list
-* `right-shift` - Right shift a symbol list
-* `bmul` - Long binary multiplication
-* `mul` - Long decimal multiplication
-* `dup` - Duplicate a symbol list with padding
-* `badd` - Long binary addition
-* `qadd` - Long quaternary addition
-* `search` - Search for symbol key in dictionary
-
-It can also be trained on the WMT English-French translation task:
-
-* `wmt` - WMT English-French translation (data will be downloaded)
-
-The value range for symbols are defined by the `vocab_size` flag.
-In particular, the values are in the range `vocab_size - 1`.
-So if you set `--vocab_size=16` (the default) then `--problem=rev`
-will be reversing lists of 15 symbols, and `--problem=id` will be identity
-on a list of up to 15 symbols.
-
-
-To train the model on the binary multiplication task run:
-
-```
-python neural_gpu_trainer.py --problem=bmul
-```
-
-This trains the Extended Neural GPU, to train the original model run:
-
-```
-python neural_gpu_trainer.py --problem=bmul --beam_size=0
-```
-
-While training, interim / checkpoint model parameters will be
-written to `/tmp/neural_gpu/`.
-
-Once the amount of error gets down to what you're comfortable
-with, hit `Ctrl-C` to stop the training process. The latest
-model parameters will be in `/tmp/neural_gpu/neural_gpu.ckpt-<step>`
-and used on any subsequent run.
-
-To evaluate a trained model on how well it decodes run:
-
-```
-python neural_gpu_trainer.py --problem=bmul --mode=1
-```
-
-To interact with a model (experimental, see code) run:
-
-```
-python neural_gpu_trainer.py --problem=bmul --mode=2
-```
-
-To train on WMT data, set a larger --nmaps and --vocab_size and avoid curriculum:
-
-```
-python neural_gpu_trainer.py --problem=wmt --vocab_size=32768 --nmaps=256
-  --vec_size=256 --curriculum_seq=1.0 --max_length=60 --data_dir ~/wmt
-```
-
-With less memory, try lower batch size, e.g. `--batch_size=4`. With more GPUs
-in your system, there will be a batch on every GPU so you can run larger models.
-For example, `--batch_size=4 --num_gpus=4 --nmaps=512 --vec_size=512` will
-run a large model (512-size) on 4 GPUs, with effective batches of 4*4=16.
-
-Maintained by Lukasz Kaiser (lukaszkaiser)
diff --git a/research/neural_gpu/data_utils.py b/research/neural_gpu/data_utils.py
deleted file mode 100644
index 3c14ff701fce79408fde6505239530dc5b848dd7..0000000000000000000000000000000000000000
--- a/research/neural_gpu/data_utils.py
+++ /dev/null
@@ -1,458 +0,0 @@
-# Copyright 2015 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Neural GPU -- data generation and batching utilities."""
-
-import math
-import os
-import random
-import sys
-import time
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import program_utils
-
-FLAGS = tf.app.flags.FLAGS
-
-bins = [2 + bin_idx_i for bin_idx_i in xrange(256)]
-all_tasks = ["sort", "kvsort", "id", "rev", "rev2", "incr", "add", "left",
-             "right", "left-shift", "right-shift", "bmul", "mul", "dup",
-             "badd", "qadd", "search", "progeval", "progsynth"]
-log_filename = ""
-vocab, rev_vocab = None, None
-
-
-def pad(l):
-  for b in bins:
-    if b >= l: return b
-  return bins[-1]
-
-
-def bin_for(l):
-  for i, b in enumerate(bins):
-    if b >= l: return i
-  return len(bins) - 1
-
-
-train_set = {}
-test_set = {}
-for some_task in all_tasks:
-  train_set[some_task] = []
-  test_set[some_task] = []
-  for all_max_len in xrange(10000):
-    train_set[some_task].append([])
-    test_set[some_task].append([])
-
-
-def read_tmp_file(name):
-  """Read from a file with the given name in our log directory or above."""
-  dirname = os.path.dirname(log_filename)
-  fname = os.path.join(dirname, name + ".txt")
-  if not tf.gfile.Exists(fname):
-    print_out("== not found file: " + fname)
-    fname = os.path.join(dirname, "../" + name + ".txt")
-  if not tf.gfile.Exists(fname):
-    print_out("== not found file: " + fname)
-    fname = os.path.join(dirname, "../../" + name + ".txt")
-  if not tf.gfile.Exists(fname):
-    print_out("== not found file: " + fname)
-    return None
-  print_out("== found file: " + fname)
-  res = []
-  with tf.gfile.GFile(fname, mode="r") as f:
-    for line in f:
-      res.append(line.strip())
-  return res
-
-
-def write_tmp_file(name, lines):
-  dirname = os.path.dirname(log_filename)
-  fname = os.path.join(dirname, name + ".txt")
-  with tf.gfile.GFile(fname, mode="w") as f:
-    for line in lines:
-      f.write(line + "\n")
-
-
-def add(n1, n2, base=10):
-  """Add two numbers represented as lower-endian digit lists."""
-  k = max(len(n1), len(n2)) + 1
-  d1 = n1 + [0 for _ in xrange(k - len(n1))]
-  d2 = n2 + [0 for _ in xrange(k - len(n2))]
-  res = []
-  carry = 0
-  for i in xrange(k):
-    if d1[i] + d2[i] + carry < base:
-      res.append(d1[i] + d2[i] + carry)
-      carry = 0
-    else:
-      res.append(d1[i] + d2[i] + carry - base)
-      carry = 1
-  while res and res[-1] == 0:
-    res = res[:-1]
-  if res: return res
-  return [0]
-
-
-def init_data(task, length, nbr_cases, nclass):
-  """Data initialization."""
-  def rand_pair(l, task):
-    """Random data pair for a task. Total length should be <= l."""
-    k = int((l-1)/2)
-    base = 10
-    if task[0] == "b": base = 2
-    if task[0] == "q": base = 4
-    d1 = [np.random.randint(base) for _ in xrange(k)]
-    d2 = [np.random.randint(base) for _ in xrange(k)]
-    if task in ["add", "badd", "qadd"]:
-      res = add(d1, d2, base)
-    elif task in ["mul", "bmul"]:
-      d1n = sum([d * (base ** i) for i, d in enumerate(d1)])
-      d2n = sum([d * (base ** i) for i, d in enumerate(d2)])
-      if task == "bmul":
-        res = [int(x) for x in list(reversed(str(bin(d1n * d2n))))[:-2]]
-      else:
-        res = [int(x) for x in list(reversed(str(d1n * d2n)))]
-    else:
-      sys.exit()
-    sep = [12]
-    if task in ["add", "badd", "qadd"]: sep = [11]
-    inp = [d + 1 for d in d1] + sep + [d + 1 for d in d2]
-    return inp, [r + 1 for r in res]
-
-  def rand_dup_pair(l):
-    """Random data pair for duplication task. Total length should be <= l."""
-    k = int(l/2)
-    x = [np.random.randint(nclass - 1) + 1 for _ in xrange(k)]
-    inp = x + [0 for _ in xrange(l - k)]
-    res = x + x + [0 for _ in xrange(l - 2*k)]
-    return inp, res
-
-  def rand_rev2_pair(l):
-    """Random data pair for reverse2 task. Total length should be <= l."""
-    inp = [(np.random.randint(nclass - 1) + 1,
-            np.random.randint(nclass - 1) + 1) for _ in xrange(l/2)]
-    res = [i for i in reversed(inp)]
-    return [x for p in inp for x in p], [x for p in res for x in p]
-
-  def rand_search_pair(l):
-    """Random data pair for search task. Total length should be <= l."""
-    inp = [(np.random.randint(nclass - 1) + 1,
-            np.random.randint(nclass - 1) + 1) for _ in xrange(l-1/2)]
-    q = np.random.randint(nclass - 1) + 1
-    res = 0
-    for (k, v) in reversed(inp):
-      if k == q:
-        res = v
-    return [x for p in inp for x in p] + [q], [res]
-
-  def rand_kvsort_pair(l):
-    """Random data pair for key-value sort. Total length should be <= l."""
-    keys = [(np.random.randint(nclass - 1) + 1, i) for i in xrange(l/2)]
-    vals = [np.random.randint(nclass - 1) + 1 for _ in xrange(l/2)]
-    kv = [(k, vals[i]) for (k, i) in keys]
-    sorted_kv = [(k, vals[i]) for (k, i) in sorted(keys)]
-    return [x for p in kv for x in p], [x for p in sorted_kv for x in p]
-
-  def prog_io_pair(prog, max_len, counter=0):
-    try:
-      ilen = np.random.randint(max_len - 3) + 1
-      bound = max(15 - (counter / 20), 1)
-      inp = [random.choice(range(-bound, bound)) for _ in range(ilen)]
-      inp_toks = [program_utils.prog_rev_vocab[t]
-                  for t in program_utils.tokenize(str(inp)) if t != ","]
-      out = program_utils.evaluate(prog, {"a": inp})
-      out_toks = [program_utils.prog_rev_vocab[t]
-                  for t in program_utils.tokenize(str(out)) if t != ","]
-      if counter > 400:
-        out_toks = []
-      if (out_toks and out_toks[0] == program_utils.prog_rev_vocab["["] and
-          len(out_toks) != len([o for o in out if o == ","]) + 3):
-        raise ValueError("generated list with too long ints")
-      if (out_toks and out_toks[0] != program_utils.prog_rev_vocab["["] and
-          len(out_toks) > 1):
-        raise ValueError("generated one int but tokenized it to many")
-      if len(out_toks) > max_len:
-        raise ValueError("output too long")
-      return (inp_toks, out_toks)
-    except ValueError:
-      return prog_io_pair(prog, max_len, counter+1)
-
-  def spec(inp):
-    """Return the target given the input for some tasks."""
-    if task == "sort":
-      return sorted(inp)
-    elif task == "id":
-      return inp
-    elif task == "rev":
-      return [i for i in reversed(inp)]
-    elif task == "incr":
-      carry = 1
-      res = []
-      for i in xrange(len(inp)):
-        if inp[i] + carry < nclass:
-          res.append(inp[i] + carry)
-          carry = 0
-        else:
-          res.append(1)
-          carry = 1
-      return res
-    elif task == "left":
-      return [inp[0]]
-    elif task == "right":
-      return [inp[-1]]
-    elif task == "left-shift":
-      return [inp[l-1] for l in xrange(len(inp))]
-    elif task == "right-shift":
-      return [inp[l+1] for l in xrange(len(inp))]
-    else:
-      print_out("Unknown spec for task " + str(task))
-      sys.exit()
-
-  l = length
-  cur_time = time.time()
-  total_time = 0.0
-
-  is_prog = task in ["progeval", "progsynth"]
-  if is_prog:
-    inputs_per_prog = 5
-    program_utils.make_vocab()
-    progs = read_tmp_file("programs_len%d" % (l / 10))
-    if not progs:
-      progs = program_utils.gen(l / 10, 1.2 * nbr_cases / inputs_per_prog)
-      write_tmp_file("programs_len%d" % (l / 10), progs)
-    prog_ios = read_tmp_file("programs_len%d_io" % (l / 10))
-    nbr_cases = min(nbr_cases, len(progs) * inputs_per_prog) / 1.2
-    if not prog_ios:
-      # Generate program io data.
-      prog_ios = []
-      for pidx, prog in enumerate(progs):
-        if pidx % 500 == 0:
-          print_out("== generating io pairs for program %d" % pidx)
-        if pidx * inputs_per_prog > nbr_cases * 1.2:
-          break
-        ptoks = [program_utils.prog_rev_vocab[t]
-                 for t in program_utils.tokenize(prog)]
-        ptoks.append(program_utils.prog_rev_vocab["_EOS"])
-        plen = len(ptoks)
-        for _ in xrange(inputs_per_prog):
-          if task == "progeval":
-            inp, out = prog_io_pair(prog, plen)
-            prog_ios.append(str(inp) + "\t" + str(out) + "\t" + prog)
-          elif task == "progsynth":
-            plen = max(len(ptoks), 8)
-            for _ in xrange(3):
-              inp, out = prog_io_pair(prog, plen / 2)
-              prog_ios.append(str(inp) + "\t" + str(out) + "\t" + prog)
-      write_tmp_file("programs_len%d_io" % (l / 10), prog_ios)
-    prog_ios_dict = {}
-    for s in prog_ios:
-      i, o, p = s.split("\t")
-      i_clean = "".join([c for c in i if c.isdigit() or c == " "])
-      o_clean = "".join([c for c in o if c.isdigit() or c == " "])
-      inp = [int(x) for x in i_clean.split()]
-      out = [int(x) for x in o_clean.split()]
-      if inp and out:
-        if p in prog_ios_dict:
-          prog_ios_dict[p].append([inp, out])
-        else:
-          prog_ios_dict[p] = [[inp, out]]
-    # Use prog_ios_dict to create data.
-    progs = []
-    for prog in prog_ios_dict:
-      if len([c for c in prog if c == ";"]) <= (l / 10):
-        progs.append(prog)
-    nbr_cases = min(nbr_cases, len(progs) * inputs_per_prog) / 1.2
-    print_out("== %d training cases on %d progs" % (nbr_cases, len(progs)))
-    for pidx, prog in enumerate(progs):
-      if pidx * inputs_per_prog > nbr_cases * 1.2:
-        break
-      ptoks = [program_utils.prog_rev_vocab[t]
-               for t in program_utils.tokenize(prog)]
-      ptoks.append(program_utils.prog_rev_vocab["_EOS"])
-      plen = len(ptoks)
-      dset = train_set if pidx < nbr_cases / inputs_per_prog else test_set
-      for _ in xrange(inputs_per_prog):
-        if task == "progeval":
-          inp, out = prog_ios_dict[prog].pop()
-          dset[task][bin_for(plen)].append([[ptoks, inp, [], []], [out]])
-        elif task == "progsynth":
-          plen, ilist = max(len(ptoks), 8), [[]]
-          for _ in xrange(3):
-            inp, out = prog_ios_dict[prog].pop()
-            ilist.append(inp + out)
-          dset[task][bin_for(plen)].append([ilist, [ptoks]])
-
-  for case in xrange(0 if is_prog else nbr_cases):
-    total_time += time.time() - cur_time
-    cur_time = time.time()
-    if l > 10000 and case % 100 == 1:
-      print_out("  avg gen time %.4f s" % (total_time / float(case)))
-    if task in ["add", "badd", "qadd", "bmul", "mul"]:
-      i, t = rand_pair(l, task)
-      train_set[task][bin_for(len(i))].append([[[], i, [], []], [t]])
-      i, t = rand_pair(l, task)
-      test_set[task][bin_for(len(i))].append([[[], i, [], []], [t]])
-    elif task == "dup":
-      i, t = rand_dup_pair(l)
-      train_set[task][bin_for(len(i))].append([[i], [t]])
-      i, t = rand_dup_pair(l)
-      test_set[task][bin_for(len(i))].append([[i], [t]])
-    elif task == "rev2":
-      i, t = rand_rev2_pair(l)
-      train_set[task][bin_for(len(i))].append([[i], [t]])
-      i, t = rand_rev2_pair(l)
-      test_set[task][bin_for(len(i))].append([[i], [t]])
-    elif task == "search":
-      i, t = rand_search_pair(l)
-      train_set[task][bin_for(len(i))].append([[i], [t]])
-      i, t = rand_search_pair(l)
-      test_set[task][bin_for(len(i))].append([[i], [t]])
-    elif task == "kvsort":
-      i, t = rand_kvsort_pair(l)
-      train_set[task][bin_for(len(i))].append([[i], [t]])
-      i, t = rand_kvsort_pair(l)
-      test_set[task][bin_for(len(i))].append([[i], [t]])
-    elif task not in ["progeval", "progsynth"]:
-      inp = [np.random.randint(nclass - 1) + 1 for i in xrange(l)]
-      target = spec(inp)
-      train_set[task][bin_for(l)].append([[inp], [target]])
-      inp = [np.random.randint(nclass - 1) + 1 for i in xrange(l)]
-      target = spec(inp)
-      test_set[task][bin_for(l)].append([[inp], [target]])
-
-
-def to_symbol(i):
-  """Covert ids to text."""
-  if i == 0: return ""
-  if i == 11: return "+"
-  if i == 12: return "*"
-  return str(i-1)
-
-
-def to_id(s):
-  """Covert text to ids."""
-  if s == "+": return 11
-  if s == "*": return 12
-  return int(s) + 1
-
-
-def get_batch(bin_id, batch_size, data_set, height, offset=None, preset=None):
-  """Get a batch of data, training or testing."""
-  inputs, targets = [], []
-  pad_length = bins[bin_id]
-  for b in xrange(batch_size):
-    if preset is None:
-      elem = random.choice(data_set[bin_id])
-      if offset is not None and offset + b < len(data_set[bin_id]):
-        elem = data_set[bin_id][offset + b]
-    else:
-      elem = preset
-    inpt, targett, inpl, targetl = elem[0], elem[1], [], []
-    for inp in inpt:
-      inpl.append(inp + [0 for _ in xrange(pad_length - len(inp))])
-    if len(inpl) == 1:
-      for _ in xrange(height - 1):
-        inpl.append([0 for _ in xrange(pad_length)])
-    for target in targett:
-      targetl.append(target + [0 for _ in xrange(pad_length - len(target))])
-    inputs.append(inpl)
-    targets.append(targetl)
-  res_input = np.array(inputs, dtype=np.int32)
-  res_target = np.array(targets, dtype=np.int32)
-  assert list(res_input.shape) == [batch_size, height, pad_length]
-  assert list(res_target.shape) == [batch_size, 1, pad_length]
-  return res_input, res_target
-
-
-def print_out(s, newline=True):
-  """Print a message out and log it to file."""
-  if log_filename:
-    try:
-      with tf.gfile.GFile(log_filename, mode="a") as f:
-        f.write(s + ("\n" if newline else ""))
-    # pylint: disable=bare-except
-    except:
-      sys.stderr.write("Error appending to %s\n" % log_filename)
-  sys.stdout.write(s + ("\n" if newline else ""))
-  sys.stdout.flush()
-
-
-def decode(output):
-  return [np.argmax(o, axis=1) for o in output]
-
-
-def accuracy(inpt_t, output, target_t, batch_size, nprint,
-             beam_out=None, beam_scores=None):
-  """Calculate output accuracy given target."""
-  assert nprint < batch_size + 1
-  inpt = []
-  for h in xrange(inpt_t.shape[1]):
-    inpt.extend([inpt_t[:, h, l] for l in xrange(inpt_t.shape[2])])
-  target = [target_t[:, 0, l] for l in xrange(target_t.shape[2])]
-  def tok(i):
-    if rev_vocab and i < len(rev_vocab):
-      return rev_vocab[i]
-    return str(i - 1)
-  def task_print(inp, output, target):
-    stop_bound = 0
-    print_len = 0
-    while print_len < len(target) and target[print_len] > stop_bound:
-      print_len += 1
-    print_out("    i: " + " ".join([tok(i) for i in inp if i > 0]))
-    print_out("    o: " +
-              " ".join([tok(output[l]) for l in xrange(print_len)]))
-    print_out("    t: " +
-              " ".join([tok(target[l]) for l in xrange(print_len)]))
-  decoded_target = target
-  decoded_output = decode(output)
-  # Use beam output if given and score is high enough.
-  if beam_out is not None:
-    for b in xrange(batch_size):
-      if beam_scores[b] >= 10.0:
-        for l in xrange(min(len(decoded_output), beam_out.shape[2])):
-          decoded_output[l][b] = int(beam_out[b, 0, l])
-  total = 0
-  errors = 0
-  seq = [0 for b in xrange(batch_size)]
-  for l in xrange(len(decoded_output)):
-    for b in xrange(batch_size):
-      if decoded_target[l][b] > 0:
-        total += 1
-        if decoded_output[l][b] != decoded_target[l][b]:
-          seq[b] = 1
-          errors += 1
-  e = 0  # Previous error index
-  for _ in xrange(min(nprint, sum(seq))):
-    while seq[e] == 0:
-      e += 1
-    task_print([inpt[l][e] for l in xrange(len(inpt))],
-               [decoded_output[l][e] for l in xrange(len(decoded_target))],
-               [decoded_target[l][e] for l in xrange(len(decoded_target))])
-    e += 1
-  for b in xrange(nprint - errors):
-    task_print([inpt[l][b] for l in xrange(len(inpt))],
-               [decoded_output[l][b] for l in xrange(len(decoded_target))],
-               [decoded_target[l][b] for l in xrange(len(decoded_target))])
-  return errors, total, sum(seq)
-
-
-def safe_exp(x):
-  perp = 10000
-  x = float(x)
-  if x < 100: perp = math.exp(x)
-  if perp > 10000: return 10000
-  return perp
diff --git a/research/neural_gpu/neural_gpu.py b/research/neural_gpu/neural_gpu.py
deleted file mode 100644
index 55b2b3e99224b31c672014195e9ef23fa1e892f7..0000000000000000000000000000000000000000
--- a/research/neural_gpu/neural_gpu.py
+++ /dev/null
@@ -1,747 +0,0 @@
-# Copyright 2015 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""The Neural GPU Model."""
-
-import time
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from tensorflow.python.framework import function
-import data_utils as data
-
-do_jit = False  # Gives more speed but experimental for now.
-jit_scope = tf.contrib.compiler.jit.experimental_jit_scope
-
-
-def conv_linear(args, kw, kh, nin, nout, rate, do_bias, bias_start, prefix):
-  """Convolutional linear map."""
-  if not isinstance(args, (list, tuple)):
-    args = [args]
-  with tf.variable_scope(prefix):
-    with tf.device("/cpu:0"):
-      k = tf.get_variable("CvK", [kw, kh, nin, nout])
-    if len(args) == 1:
-      arg = args[0]
-    else:
-      arg = tf.concat(axis=3, values=args)
-    res = tf.nn.convolution(arg, k, dilation_rate=(rate, 1), padding="SAME")
-    if not do_bias: return res
-    with tf.device("/cpu:0"):
-      bias_term = tf.get_variable(
-          "CvB", [nout], initializer=tf.constant_initializer(bias_start))
-    bias_term = tf.reshape(bias_term, [1, 1, 1, nout])
-    return res + bias_term
-
-
-def sigmoid_cutoff(x, cutoff):
-  """Sigmoid with cutoff, e.g., 1.2sigmoid(x) - 0.1."""
-  y = tf.sigmoid(x)
-  if cutoff < 1.01: return y
-  d = (cutoff - 1.0) / 2.0
-  return tf.minimum(1.0, tf.maximum(0.0, cutoff * y - d), name="cutoff_min")
-
-
-@function.Defun(tf.float32, noinline=True)
-def sigmoid_cutoff_12(x):
-  """Sigmoid with cutoff 1.2, specialized for speed and memory use."""
-  y = tf.sigmoid(x)
-  return tf.minimum(1.0, tf.maximum(0.0, 1.2 * y - 0.1), name="cutoff_min_12")
-
-
-@function.Defun(tf.float32, noinline=True)
-def sigmoid_hard(x):
-  """Hard sigmoid."""
-  return tf.minimum(1.0, tf.maximum(0.0, 0.25 * x + 0.5))
-
-
-def place_at14(decided, selected, it):
-  """Place selected at it-th coordinate of decided, dim=1 of 4."""
-  slice1 = decided[:, :it, :, :]
-  slice2 = decided[:, it + 1:, :, :]
-  return tf.concat(axis=1, values=[slice1, selected, slice2])
-
-
-def place_at13(decided, selected, it):
-  """Place selected at it-th coordinate of decided, dim=1 of 3."""
-  slice1 = decided[:, :it, :]
-  slice2 = decided[:, it + 1:, :]
-  return tf.concat(axis=1, values=[slice1, selected, slice2])
-
-
-def tanh_cutoff(x, cutoff):
-  """Tanh with cutoff, e.g., 1.1tanh(x) cut to [-1. 1]."""
-  y = tf.tanh(x)
-  if cutoff < 1.01: return y
-  d = (cutoff - 1.0) / 2.0
-  return tf.minimum(1.0, tf.maximum(-1.0, (1.0 + d) * y))
-
-
-@function.Defun(tf.float32, noinline=True)
-def tanh_hard(x):
-  """Hard tanh."""
-  return tf.minimum(1.0, tf.maximum(0.0, x))
-
-
-def layer_norm(x, nmaps, prefix, epsilon=1e-5):
-  """Layer normalize the 4D tensor x, averaging over the last dimension."""
-  with tf.variable_scope(prefix):
-    scale = tf.get_variable("layer_norm_scale", [nmaps],
-                            initializer=tf.ones_initializer())
-    bias = tf.get_variable("layer_norm_bias", [nmaps],
-                           initializer=tf.zeros_initializer())
-    mean, variance = tf.nn.moments(x, [3], keep_dims=True)
-    norm_x = (x - mean) / tf.sqrt(variance + epsilon)
-    return norm_x * scale + bias
-
-
-def conv_gru(inpts, mem, kw, kh, nmaps, rate, cutoff, prefix, do_layer_norm,
-             args_len=None):
-  """Convolutional GRU."""
-  def conv_lin(args, suffix, bias_start):
-    total_args_len = args_len or len(args) * nmaps
-    res = conv_linear(args, kw, kh, total_args_len, nmaps, rate, True,
-                      bias_start, prefix + "/" + suffix)
-    if do_layer_norm:
-      return layer_norm(res, nmaps, prefix + "/" + suffix)
-    else:
-      return res
-  if cutoff == 1.2:
-    reset = sigmoid_cutoff_12(conv_lin(inpts + [mem], "r", 1.0))
-    gate = sigmoid_cutoff_12(conv_lin(inpts + [mem], "g", 1.0))
-  elif cutoff > 10:
-    reset = sigmoid_hard(conv_lin(inpts + [mem], "r", 1.0))
-    gate = sigmoid_hard(conv_lin(inpts + [mem], "g", 1.0))
-  else:
-    reset = sigmoid_cutoff(conv_lin(inpts + [mem], "r", 1.0), cutoff)
-    gate = sigmoid_cutoff(conv_lin(inpts + [mem], "g", 1.0), cutoff)
-  if cutoff > 10:
-    candidate = tanh_hard(conv_lin(inpts + [reset * mem], "c", 0.0))
-  else:
-    # candidate = tanh_cutoff(conv_lin(inpts + [reset * mem], "c", 0.0), cutoff)
-    candidate = tf.tanh(conv_lin(inpts + [reset * mem], "c", 0.0))
-  return gate * mem + (1 - gate) * candidate
-
-
-CHOOSE_K = 256
-
-
-def memory_call(q, l, nmaps, mem_size, vocab_size, num_gpus, update_mem):
-  raise ValueError("Fill for experiments with additional memory structures.")
-
-
-def memory_run(step, nmaps, mem_size, batch_size, vocab_size,
-               global_step, do_training, update_mem, decay_factor, num_gpus,
-               target_emb_weights, output_w, gpu_targets_tn, it):
-  """Run memory."""
-  q = step[:, 0, it, :]
-  mlabels = gpu_targets_tn[:, it, 0]
-  res, mask, mem_loss = memory_call(
-      q, mlabels, nmaps, mem_size, vocab_size, num_gpus, update_mem)
-  res = tf.gather(target_emb_weights, res) * tf.expand_dims(mask[:, 0], 1)
-
-  # Mix gold and original in the first steps, 20% later.
-  gold = tf.nn.dropout(tf.gather(target_emb_weights, mlabels), 0.7)
-  use_gold = 1.0 - tf.cast(global_step, tf.float32) / (1000. * decay_factor)
-  use_gold = tf.maximum(use_gold, 0.2) * do_training
-  mem = tf.cond(tf.less(tf.random_uniform([]), use_gold),
-                lambda: use_gold * gold + (1.0 - use_gold) * res,
-                lambda: res)
-  mem = tf.reshape(mem, [-1, 1, 1, nmaps])
-  return mem, mem_loss, update_mem
-
-
-@tf.RegisterGradient("CustomIdG")
-def _custom_id_grad(_, grads):
-  return grads
-
-
-def quantize(t, quant_scale, max_value=1.0):
-  """Quantize a tensor t with each element in [-max_value, max_value]."""
-  t = tf.minimum(max_value, tf.maximum(t, -max_value))
-  big = quant_scale * (t + max_value) + 0.5
-  with tf.get_default_graph().gradient_override_map({"Floor": "CustomIdG"}):
-    res = (tf.floor(big) / quant_scale) - max_value
-  return res
-
-
-def quantize_weights_op(quant_scale, max_value):
-  ops = [v.assign(quantize(v, quant_scale, float(max_value)))
-         for v in tf.trainable_variables()]
-  return tf.group(*ops)
-
-
-def autoenc_quantize(x, nbits, nmaps, do_training, layers=1):
-  """Autoencoder into nbits vectors of bits, using noise and sigmoids."""
-  enc_x = tf.reshape(x, [-1, nmaps])
-  for i in xrange(layers - 1):
-    enc_x = tf.layers.dense(enc_x, nmaps, name="autoenc_%d" % i)
-  enc_x = tf.layers.dense(enc_x, nbits, name="autoenc_%d" % (layers - 1))
-  noise = tf.truncated_normal(tf.shape(enc_x), stddev=2.0)
-  dec_x = sigmoid_cutoff_12(enc_x + noise * do_training)
-  dec_x = tf.reshape(dec_x, [-1, nbits])
-  for i in xrange(layers):
-    dec_x = tf.layers.dense(dec_x, nmaps, name="autodec_%d" % i)
-  return tf.reshape(dec_x, tf.shape(x))
-
-
-def make_dense(targets, noclass, low_param):
-  """Move a batch of targets to a dense 1-hot representation."""
-  low = low_param / float(noclass - 1)
-  high = 1.0 - low * (noclass - 1)
-  targets = tf.cast(targets, tf.int64)
-  return tf.one_hot(targets, depth=noclass, on_value=high, off_value=low)
-
-
-def reorder_beam(beam_size, batch_size, beam_val, output, is_first,
-                 tensors_to_reorder):
-  """Reorder to minimize beam costs."""
-  # beam_val is [batch_size x beam_size]; let b = batch_size * beam_size
-  # decided is len x b x a x b
-  # output is b x out_size; step is b x len x a x b;
-  outputs = tf.split(axis=0, num_or_size_splits=beam_size, value=tf.nn.log_softmax(output))
-  all_beam_vals, all_beam_idx = [], []
-  beam_range = 1 if is_first else beam_size
-  for i in xrange(beam_range):
-    top_out, top_out_idx = tf.nn.top_k(outputs[i], k=beam_size)
-    cur_beam_val = beam_val[:, i]
-    top_out = tf.Print(top_out, [top_out, top_out_idx, beam_val, i,
-                                 cur_beam_val], "GREPO", summarize=8)
-    all_beam_vals.append(top_out + tf.expand_dims(cur_beam_val, 1))
-    all_beam_idx.append(top_out_idx)
-  all_beam_idx = tf.reshape(tf.transpose(tf.concat(axis=1, values=all_beam_idx), [1, 0]),
-                            [-1])
-  top_beam, top_beam_idx = tf.nn.top_k(tf.concat(axis=1, values=all_beam_vals), k=beam_size)
-  top_beam_idx = tf.Print(top_beam_idx, [top_beam, top_beam_idx],
-                          "GREP", summarize=8)
-  reordered = [[] for _ in xrange(len(tensors_to_reorder) + 1)]
-  top_out_idx = []
-  for i in xrange(beam_size):
-    which_idx = top_beam_idx[:, i] * batch_size + tf.range(batch_size)
-    top_out_idx.append(tf.gather(all_beam_idx, which_idx))
-    which_beam = top_beam_idx[:, i] / beam_size  # [batch]
-    which_beam = which_beam * batch_size + tf.range(batch_size)
-    reordered[0].append(tf.gather(output, which_beam))
-    for i, t in enumerate(tensors_to_reorder):
-      reordered[i + 1].append(tf.gather(t, which_beam))
-  new_tensors = [tf.concat(axis=0, values=t) for t in reordered]
-  top_out_idx = tf.concat(axis=0, values=top_out_idx)
-  return (top_beam, new_tensors[0], top_out_idx, new_tensors[1:])
-
-
-class NeuralGPU(object):
-  """Neural GPU Model."""
-
-  def __init__(self, nmaps, vec_size, niclass, noclass, dropout,
-               max_grad_norm, cutoff, nconvs, kw, kh, height, mem_size,
-               learning_rate, min_length, num_gpus, num_replicas,
-               grad_noise_scale, sampling_rate, act_noise=0.0, do_rnn=False,
-               atrous=False, beam_size=1, backward=True, do_layer_norm=False,
-               autoenc_decay=1.0):
-    # Feeds for parameters and ops to update them.
-    self.nmaps = nmaps
-    if backward:
-      self.global_step = tf.Variable(0, trainable=False, name="global_step")
-      self.cur_length = tf.Variable(min_length, trainable=False)
-      self.cur_length_incr_op = self.cur_length.assign_add(1)
-      self.lr = tf.Variable(learning_rate, trainable=False)
-      self.lr_decay_op = self.lr.assign(self.lr * 0.995)
-    self.do_training = tf.placeholder(tf.float32, name="do_training")
-    self.update_mem = tf.placeholder(tf.int32, name="update_mem")
-    self.noise_param = tf.placeholder(tf.float32, name="noise_param")
-
-    # Feeds for inputs, targets, outputs, losses, etc.
-    self.input = tf.placeholder(tf.int32, name="inp")
-    self.target = tf.placeholder(tf.int32, name="tgt")
-    self.prev_step = tf.placeholder(tf.float32, name="prev_step")
-    gpu_input = tf.split(axis=0, num_or_size_splits=num_gpus, value=self.input)
-    gpu_target = tf.split(axis=0, num_or_size_splits=num_gpus, value=self.target)
-    gpu_prev_step = tf.split(axis=0, num_or_size_splits=num_gpus, value=self.prev_step)
-    batch_size = tf.shape(gpu_input[0])[0]
-
-    if backward:
-      adam_lr = 0.005 * self.lr
-      adam = tf.train.AdamOptimizer(adam_lr, epsilon=1e-3)
-
-      def adam_update(grads):
-        return adam.apply_gradients(zip(grads, tf.trainable_variables()),
-                                    global_step=self.global_step,
-                                    name="adam_update")
-
-    # When switching from Adam to SGD we perform reverse-decay.
-    if backward:
-      global_step_float = tf.cast(self.global_step, tf.float32)
-      sampling_decay_exponent = global_step_float / 100000.0
-      sampling_decay = tf.maximum(0.05, tf.pow(0.5, sampling_decay_exponent))
-      self.sampling = sampling_rate * 0.05 / sampling_decay
-    else:
-      self.sampling = tf.constant(0.0)
-
-    # Cache variables on cpu if needed.
-    if num_replicas > 1 or num_gpus > 1:
-      with tf.device("/cpu:0"):
-        caching_const = tf.constant(0)
-      tf.get_variable_scope().set_caching_device(caching_const.op.device)
-      # partitioner = tf.variable_axis_size_partitioner(1024*256*4)
-      # tf.get_variable_scope().set_partitioner(partitioner)
-
-    def gpu_avg(l):
-      if l[0] is None:
-        for elem in l:
-          assert elem is None
-        return 0.0
-      if len(l) < 2:
-        return l[0]
-      return sum(l) / float(num_gpus)
-
-    self.length_tensor = tf.placeholder(tf.int32, name="length")
-
-    with tf.device("/cpu:0"):
-      emb_weights = tf.get_variable(
-          "embedding", [niclass, vec_size],
-          initializer=tf.random_uniform_initializer(-1.7, 1.7))
-      if beam_size > 0:
-        target_emb_weights = tf.get_variable(
-            "target_embedding", [noclass, nmaps],
-            initializer=tf.random_uniform_initializer(-1.7, 1.7))
-      e0 = tf.scatter_update(emb_weights,
-                             tf.constant(0, dtype=tf.int32, shape=[1]),
-                             tf.zeros([1, vec_size]))
-      output_w = tf.get_variable("output_w", [nmaps, noclass], tf.float32)
-
-    def conv_rate(layer):
-      if atrous:
-        return 2**layer
-      return 1
-
-    # pylint: disable=cell-var-from-loop
-    def enc_step(step):
-      """Encoder step."""
-      if autoenc_decay < 1.0:
-        quant_step = autoenc_quantize(step, 16, nmaps, self.do_training)
-        if backward:
-          exp_glob = tf.train.exponential_decay(1.0, self.global_step - 10000,
-                                                1000, autoenc_decay)
-          dec_factor = 1.0 - exp_glob  # * self.do_training
-          dec_factor = tf.cond(tf.less(self.global_step, 10500),
-                               lambda: tf.constant(0.05), lambda: dec_factor)
-        else:
-          dec_factor = 1.0
-        cur = tf.cond(tf.less(tf.random_uniform([]), dec_factor),
-                      lambda: quant_step, lambda: step)
-      else:
-        cur = step
-      if dropout > 0.0001:
-        cur = tf.nn.dropout(cur, keep_prob)
-      if act_noise > 0.00001:
-        cur += tf.truncated_normal(tf.shape(cur)) * act_noise_scale
-      # Do nconvs-many CGRU steps.
-      if do_jit and tf.get_variable_scope().reuse:
-        with jit_scope():
-          for layer in xrange(nconvs):
-            cur = conv_gru([], cur, kw, kh, nmaps, conv_rate(layer),
-                           cutoff, "ecgru_%d" % layer, do_layer_norm)
-      else:
-        for layer in xrange(nconvs):
-          cur = conv_gru([], cur, kw, kh, nmaps, conv_rate(layer),
-                         cutoff, "ecgru_%d" % layer, do_layer_norm)
-      return cur
-
-    zero_tgt = tf.zeros([batch_size, nmaps, 1])
-    zero_tgt.set_shape([None, nmaps, 1])
-
-    def dec_substep(step, decided):
-      """Decoder sub-step."""
-      cur = step
-      if dropout > 0.0001:
-        cur = tf.nn.dropout(cur, keep_prob)
-      if act_noise > 0.00001:
-        cur += tf.truncated_normal(tf.shape(cur)) * act_noise_scale
-      # Do nconvs-many CGRU steps.
-      if do_jit and tf.get_variable_scope().reuse:
-        with jit_scope():
-          for layer in xrange(nconvs):
-            cur = conv_gru([decided], cur, kw, kh, nmaps, conv_rate(layer),
-                           cutoff, "dcgru_%d" % layer, do_layer_norm)
-      else:
-        for layer in xrange(nconvs):
-          cur = conv_gru([decided], cur, kw, kh, nmaps, conv_rate(layer),
-                         cutoff, "dcgru_%d" % layer, do_layer_norm)
-      return cur
-    # pylint: enable=cell-var-from-loop
-
-    def dec_step(step, it, it_int, decided, output_ta, tgts,
-                 mloss, nupd_in, out_idx, beam_cost):
-      """Decoder step."""
-      nupd, mem_loss = 0, 0.0
-      if mem_size > 0:
-        it_incr = tf.minimum(it+1, length - 1)
-        mem, mem_loss, nupd = memory_run(
-            step, nmaps, mem_size, batch_size, noclass, self.global_step,
-            self.do_training, self.update_mem, 10, num_gpus,
-            target_emb_weights, output_w, gpu_targets_tn, it_incr)
-      step = dec_substep(step, decided)
-      output_l = tf.expand_dims(tf.expand_dims(step[:, it, 0, :], 1), 1)
-      # Calculate argmax output.
-      output = tf.reshape(output_l, [-1, nmaps])
-      # pylint: disable=cell-var-from-loop
-      output = tf.matmul(output, output_w)
-      if beam_size > 1:
-        beam_cost, output, out, reordered = reorder_beam(
-            beam_size, batch_size, beam_cost, output, it_int == 0,
-            [output_l, out_idx, step, decided])
-        [output_l, out_idx, step, decided] = reordered
-      else:
-        # Scheduled sampling.
-        out = tf.multinomial(tf.stop_gradient(output), 1)
-        out = tf.to_int32(tf.squeeze(out, [1]))
-      out_write = output_ta.write(it, output_l[:batch_size, :, :, :])
-      output = tf.gather(target_emb_weights, out)
-      output = tf.reshape(output, [-1, 1, nmaps])
-      output = tf.concat(axis=1, values=[output] * height)
-      tgt = tgts[it, :, :, :]
-      selected = tf.cond(tf.less(tf.random_uniform([]), self.sampling),
-                         lambda: output, lambda: tgt)
-      # pylint: enable=cell-var-from-loop
-      dec_write = place_at14(decided, tf.expand_dims(selected, 1), it)
-      out_idx = place_at13(
-          out_idx, tf.reshape(out, [beam_size * batch_size, 1, 1]), it)
-      if mem_size > 0:
-        mem = tf.concat(axis=2, values=[mem] * height)
-        dec_write = place_at14(dec_write, mem, it_incr)
-      return (step, dec_write, out_write, mloss + mem_loss, nupd_in + nupd,
-              out_idx, beam_cost)
-
-    # Main model construction.
-    gpu_outputs = []
-    gpu_losses = []
-    gpu_grad_norms = []
-    grads_list = []
-    gpu_out_idx = []
-    self.after_enc_step = []
-    for gpu in xrange(num_gpus):  # Multi-GPU towers, average gradients later.
-      length = self.length_tensor
-      length_float = tf.cast(length, tf.float32)
-      if gpu > 0:
-        tf.get_variable_scope().reuse_variables()
-      gpu_outputs.append([])
-      gpu_losses.append([])
-      gpu_grad_norms.append([])
-      with tf.name_scope("gpu%d" % gpu), tf.device("/gpu:%d" % gpu):
-        # Main graph creation loop.
-        data.print_out("Creating model.")
-        start_time = time.time()
-
-        # Embed inputs and calculate mask.
-        with tf.device("/cpu:0"):
-          tgt_shape = tf.shape(tf.squeeze(gpu_target[gpu], [1]))
-          weights = tf.where(tf.squeeze(gpu_target[gpu], [1]) > 0,
-                             tf.ones(tgt_shape), tf.zeros(tgt_shape))
-
-          # Embed inputs and targets.
-          with tf.control_dependencies([e0]):
-            start = tf.gather(emb_weights, gpu_input[gpu])  # b x h x l x nmaps
-            gpu_targets_tn = gpu_target[gpu]  # b x 1 x len
-            if beam_size > 0:
-              embedded_targets_tn = tf.gather(target_emb_weights,
-                                              gpu_targets_tn)
-              embedded_targets_tn = tf.transpose(
-                  embedded_targets_tn, [2, 0, 1, 3])  # len x b x 1 x nmaps
-              embedded_targets_tn = tf.concat(axis=2, values=[embedded_targets_tn] * height)
-
-        # First image comes from start by applying convolution and adding 0s.
-        start = tf.transpose(start, [0, 2, 1, 3])  # Now b x len x h x vec_s
-        first = conv_linear(start, 1, 1, vec_size, nmaps, 1, True, 0.0, "input")
-        first = layer_norm(first, nmaps, "input")
-
-        # Computation steps.
-        keep_prob = dropout * 3.0 / tf.sqrt(length_float)
-        keep_prob = 1.0 - self.do_training * keep_prob
-        act_noise_scale = act_noise * self.do_training
-
-        # Start with a convolutional gate merging previous step.
-        step = conv_gru([gpu_prev_step[gpu]], first,
-                        kw, kh, nmaps, 1, cutoff, "first", do_layer_norm)
-
-        # This is just for running a baseline RNN seq2seq model.
-        if do_rnn:
-          self.after_enc_step.append(step)  # Not meaningful here, but needed.
-          def lstm_cell():
-            return tf.contrib.rnn.BasicLSTMCell(height * nmaps)
-          cell = tf.contrib.rnn.MultiRNNCell(
-              [lstm_cell() for _ in range(nconvs)])
-          with tf.variable_scope("encoder"):
-            encoder_outputs, encoder_state = tf.nn.dynamic_rnn(
-                cell, tf.reshape(step, [batch_size, length, height * nmaps]),
-                dtype=tf.float32, time_major=False)
-
-          # Attention.
-          attn = tf.layers.dense(
-              encoder_outputs, height * nmaps, name="attn1")
-
-          # pylint: disable=cell-var-from-loop
-          @function.Defun(noinline=True)
-          def attention_query(query, attn_v):
-            vecs = tf.tanh(attn + tf.expand_dims(query, 1))
-            mask = tf.reduce_sum(vecs * tf.reshape(attn_v, [1, 1, -1]), 2)
-            mask = tf.nn.softmax(mask)
-            return tf.reduce_sum(encoder_outputs * tf.expand_dims(mask, 2), 1)
-
-          with tf.variable_scope("decoder"):
-            def decoder_loop_fn(state__prev_cell_out__unused, cell_inp__cur_tgt):
-              """Decoder loop function."""
-              state, prev_cell_out, _ = state__prev_cell_out__unused
-              cell_inp, cur_tgt = cell_inp__cur_tgt
-              attn_q = tf.layers.dense(prev_cell_out, height * nmaps,
-                                       name="attn_query")
-              attn_res = attention_query(attn_q, tf.get_variable(
-                  "attn_v", [height * nmaps],
-                  initializer=tf.random_uniform_initializer(-0.1, 0.1)))
-              concatenated = tf.reshape(tf.concat(axis=1, values=[cell_inp, attn_res]),
-                                        [batch_size, 2 * height * nmaps])
-              cell_inp = tf.layers.dense(
-                  concatenated, height * nmaps, name="attn_merge")
-              output, new_state = cell(cell_inp, state)
-
-              mem_loss = 0.0
-              if mem_size > 0:
-                res, mask, mem_loss = memory_call(
-                    output, cur_tgt, height * nmaps, mem_size, noclass,
-                    num_gpus, self.update_mem)
-                res = tf.gather(target_emb_weights, res)
-                res *= tf.expand_dims(mask[:, 0], 1)
-                output = tf.layers.dense(
-                    tf.concat(axis=1, values=[output, res]), height * nmaps, name="rnnmem")
-
-              return new_state, output, mem_loss
-            # pylint: enable=cell-var-from-loop
-            gpu_targets = tf.squeeze(gpu_target[gpu], [1])  # b x len
-            gpu_tgt_trans = tf.transpose(gpu_targets, [1, 0])
-            dec_zero = tf.zeros([batch_size, 1], dtype=tf.int32)
-            dec_inp = tf.concat(axis=1, values=[dec_zero, gpu_targets])
-            dec_inp = dec_inp[:, :length]
-            embedded_dec_inp = tf.gather(target_emb_weights, dec_inp)
-            embedded_dec_inp_proj = tf.layers.dense(
-                embedded_dec_inp, height * nmaps, name="dec_proj")
-            embedded_dec_inp_proj = tf.transpose(embedded_dec_inp_proj,
-                                                 [1, 0, 2])
-            init_vals = (encoder_state,
-                         tf.zeros([batch_size, height * nmaps]), 0.0)
-            _, dec_outputs, mem_losses = tf.scan(
-                decoder_loop_fn, (embedded_dec_inp_proj, gpu_tgt_trans),
-                initializer=init_vals)
-          mem_loss = tf.reduce_mean(mem_losses)
-          outputs = tf.layers.dense(dec_outputs, nmaps, name="out_proj")
-          # Final convolution to get logits, list outputs.
-          outputs = tf.matmul(tf.reshape(outputs, [-1, nmaps]), output_w)
-          outputs = tf.reshape(outputs, [length, batch_size, noclass])
-          gpu_out_idx.append(tf.argmax(outputs, 2))
-        else:  # Here we go with the Neural GPU.
-          # Encoder.
-          enc_length = length
-          step = enc_step(step)  # First step hard-coded.
-          # pylint: disable=cell-var-from-loop
-          i = tf.constant(1)
-          c = lambda i, _s: tf.less(i, enc_length)
-          def enc_step_lambda(i, step):
-            with tf.variable_scope(tf.get_variable_scope(), reuse=True):
-              new_step = enc_step(step)
-            return (i + 1, new_step)
-          _, step = tf.while_loop(
-              c, enc_step_lambda, [i, step],
-              parallel_iterations=1, swap_memory=True)
-          # pylint: enable=cell-var-from-loop
-
-          self.after_enc_step.append(step)
-
-          # Decoder.
-          if beam_size > 0:
-            output_ta = tf.TensorArray(
-                dtype=tf.float32, size=length, dynamic_size=False,
-                infer_shape=False, name="outputs")
-            out_idx = tf.zeros([beam_size * batch_size, length, 1],
-                               dtype=tf.int32)
-            decided_t = tf.zeros([beam_size * batch_size, length,
-                                  height, vec_size])
-
-            # Prepare for beam search.
-            tgts = tf.concat(axis=1, values=[embedded_targets_tn] * beam_size)
-            beam_cost = tf.zeros([batch_size, beam_size])
-            step = tf.concat(axis=0, values=[step] * beam_size)
-            # First step hard-coded.
-            step, decided_t, output_ta, mem_loss, nupd, oi, bc = dec_step(
-                step, 0, 0, decided_t, output_ta, tgts, 0.0, 0, out_idx,
-                beam_cost)
-            tf.get_variable_scope().reuse_variables()
-            # pylint: disable=cell-var-from-loop
-            def step_lambda(i, step, dec_t, out_ta, ml, nu, oi, bc):
-              with tf.variable_scope(tf.get_variable_scope(), reuse=True):
-                s, d, t, nml, nu, oi, bc = dec_step(
-                    step, i, 1, dec_t, out_ta, tgts, ml, nu, oi, bc)
-              return (i + 1, s, d, t, nml, nu, oi, bc)
-            i = tf.constant(1)
-            c = lambda i, _s, _d, _o, _ml, _nu, _oi, _bc: tf.less(i, length)
-            _, step, _, output_ta, mem_loss, nupd, out_idx, _ = tf.while_loop(
-                c, step_lambda,
-                [i, step, decided_t, output_ta, mem_loss, nupd, oi, bc],
-                parallel_iterations=1, swap_memory=True)
-            # pylint: enable=cell-var-from-loop
-            gpu_out_idx.append(tf.squeeze(out_idx, [2]))
-            outputs = output_ta.stack()
-            outputs = tf.squeeze(outputs, [2, 3])  # Now l x b x nmaps
-          else:
-            # If beam_size is 0 or less, we don't have a decoder.
-            mem_loss = 0.0
-            outputs = tf.transpose(step[:, :, 1, :], [1, 0, 2])
-            gpu_out_idx.append(tf.argmax(outputs, 2))
-
-          # Final convolution to get logits, list outputs.
-          outputs = tf.matmul(tf.reshape(outputs, [-1, nmaps]), output_w)
-          outputs = tf.reshape(outputs, [length, batch_size, noclass])
-        gpu_outputs[gpu] = tf.nn.softmax(outputs)
-
-        # Calculate cross-entropy loss and normalize it.
-        targets_soft = make_dense(tf.squeeze(gpu_target[gpu], [1]),
-                                  noclass, 0.1)
-        targets_soft = tf.reshape(targets_soft, [-1, noclass])
-        targets_hard = make_dense(tf.squeeze(gpu_target[gpu], [1]),
-                                  noclass, 0.0)
-        targets_hard = tf.reshape(targets_hard, [-1, noclass])
-        output = tf.transpose(outputs, [1, 0, 2])
-        xent_soft = tf.reshape(tf.nn.softmax_cross_entropy_with_logits(
-            logits=tf.reshape(output, [-1, noclass]), labels=targets_soft),
-                               [batch_size, length])
-        xent_hard = tf.reshape(tf.nn.softmax_cross_entropy_with_logits(
-            logits=tf.reshape(output, [-1, noclass]), labels=targets_hard),
-                               [batch_size, length])
-        low, high = 0.1 / float(noclass - 1), 0.9
-        const = high * tf.log(high) + float(noclass - 1) * low * tf.log(low)
-        weight_sum = tf.reduce_sum(weights) + 1e-20
-        true_perp = tf.reduce_sum(xent_hard * weights) / weight_sum
-        soft_loss = tf.reduce_sum(xent_soft * weights) / weight_sum
-        perp_loss = soft_loss + const
-        # Final loss: cross-entropy + shared parameter relaxation part + extra.
-        mem_loss = 0.5 * tf.reduce_mean(mem_loss) / length_float
-        total_loss = perp_loss + mem_loss
-        gpu_losses[gpu].append(true_perp)
-
-        # Gradients.
-        if backward:
-          data.print_out("Creating backward pass for the model.")
-          grads = tf.gradients(
-              total_loss, tf.trainable_variables(),
-              colocate_gradients_with_ops=True)
-          for g_i, g in enumerate(grads):
-            if isinstance(g, tf.IndexedSlices):
-              grads[g_i] = tf.convert_to_tensor(g)
-          grads, norm = tf.clip_by_global_norm(grads, max_grad_norm)
-          gpu_grad_norms[gpu].append(norm)
-          for g in grads:
-            if grad_noise_scale > 0.001:
-              g += tf.truncated_normal(tf.shape(g)) * self.noise_param
-          grads_list.append(grads)
-        else:
-          gpu_grad_norms[gpu].append(0.0)
-        data.print_out("Created model for gpu %d in %.2f s."
-                       % (gpu, time.time() - start_time))
-
-    self.updates = []
-    self.after_enc_step = tf.concat(axis=0, values=self.after_enc_step)  # Concat GPUs.
-    if backward:
-      tf.get_variable_scope()._reuse = False
-      tf.get_variable_scope().set_caching_device(None)
-      grads = [gpu_avg([grads_list[g][i] for g in xrange(num_gpus)])
-               for i in xrange(len(grads_list[0]))]
-      update = adam_update(grads)
-      self.updates.append(update)
-    else:
-      self.updates.append(tf.no_op())
-
-    self.losses = [gpu_avg([gpu_losses[g][i] for g in xrange(num_gpus)])
-                   for i in xrange(len(gpu_losses[0]))]
-    self.out_idx = tf.concat(axis=0, values=gpu_out_idx)
-    self.grad_norms = [gpu_avg([gpu_grad_norms[g][i] for g in xrange(num_gpus)])
-                       for i in xrange(len(gpu_grad_norms[0]))]
-    self.outputs = [tf.concat(axis=1, values=[gpu_outputs[g] for g in xrange(num_gpus)])]
-    self.quantize_op = quantize_weights_op(512, 8)
-    if backward:
-      self.saver = tf.train.Saver(tf.global_variables(), max_to_keep=10)
-
-  def step(self, sess, inp, target, do_backward_in, noise_param=None,
-           beam_size=2, eos_id=2, eos_cost=0.0, update_mem=None, state=None):
-    """Run a step of the network."""
-    batch_size, height, length = inp.shape[0], inp.shape[1], inp.shape[2]
-    do_backward = do_backward_in
-    train_mode = True
-    if do_backward_in is None:
-      do_backward = False
-      train_mode = False
-    if update_mem is None:
-      update_mem = do_backward
-    feed_in = {}
-    # print "    feeding sequences of length %d" % length
-    if state is None:
-      state = np.zeros([batch_size, length, height, self.nmaps])
-    feed_in[self.prev_step.name] = state
-    feed_in[self.length_tensor.name] = length
-    feed_in[self.noise_param.name] = noise_param if noise_param else 0.0
-    feed_in[self.do_training.name] = 1.0 if do_backward else 0.0
-    feed_in[self.update_mem.name] = 1 if update_mem else 0
-    if do_backward_in is False:
-      feed_in[self.sampling.name] = 0.0
-    index = 0  # We're dynamic now.
-    feed_out = []
-    if do_backward:
-      feed_out.append(self.updates[index])
-      feed_out.append(self.grad_norms[index])
-    if train_mode:
-      feed_out.append(self.losses[index])
-    feed_in[self.input.name] = inp
-    feed_in[self.target.name] = target
-    feed_out.append(self.outputs[index])
-    if train_mode:
-      # Make a full-sequence training step with one call to session.run.
-      res = sess.run([self.after_enc_step] + feed_out, feed_in)
-      after_enc_state, res = res[0], res[1:]
-    else:
-      # Make a full-sequence decoding step with one call to session.run.
-      feed_in[self.sampling.name] = 1.1  # Sample every time.
-      res = sess.run([self.after_enc_step, self.out_idx] + feed_out, feed_in)
-      after_enc_state, out_idx = res[0], res[1]
-      res = [res[2][l] for l in xrange(length)]
-      outputs = [out_idx[:, i] for i in xrange(length)]
-      cost = [0.0 for _ in xrange(beam_size * batch_size)]
-      seen_eos = [0 for _ in xrange(beam_size * batch_size)]
-      for idx, logit in enumerate(res):
-        best = outputs[idx]
-        for b in xrange(batch_size):
-          if seen_eos[b] > 1:
-            cost[b] -= eos_cost
-          else:
-            cost[b] += np.log(logit[b][best[b]])
-          if best[b] in [eos_id]:
-            seen_eos[b] += 1
-      res = [[-c for c in cost]] + outputs
-    # Collect and output results.
-    offset = 0
-    norm = None
-    if do_backward:
-      offset = 2
-      norm = res[1]
-    if train_mode:
-      outputs = res[offset + 1]
-      outputs = [outputs[l] for l in xrange(length)]
-    return res[offset], outputs, norm, after_enc_state
diff --git a/research/neural_gpu/neural_gpu_trainer.py b/research/neural_gpu/neural_gpu_trainer.py
deleted file mode 100644
index 1f704b0da880dbde4b09bf2cc108edb034d7b1a0..0000000000000000000000000000000000000000
--- a/research/neural_gpu/neural_gpu_trainer.py
+++ /dev/null
@@ -1,1027 +0,0 @@
-# Copyright 2015 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Neural GPU."""
-
-from __future__ import print_function
-
-import math
-import os
-import random
-import sys
-import threading
-import time
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import program_utils
-import data_utils as data
-import neural_gpu as ngpu
-import wmt_utils as wmt
-
-tf.app.flags.DEFINE_float("lr", 0.1, "Learning rate.")
-tf.app.flags.DEFINE_float("init_weight", 0.8, "Initial weights deviation.")
-tf.app.flags.DEFINE_float("max_grad_norm", 4.0, "Clip gradients to this norm.")
-tf.app.flags.DEFINE_float("cutoff", 1.2, "Cutoff at the gates.")
-tf.app.flags.DEFINE_float("curriculum_ppx", 9.9, "Move curriculum if ppl < X.")
-tf.app.flags.DEFINE_float("curriculum_seq", 0.3, "Move curriculum if seq < X.")
-tf.app.flags.DEFINE_float("dropout", 0.1, "Dropout that much.")
-tf.app.flags.DEFINE_float("grad_noise_scale", 0.0, "Gradient noise scale.")
-tf.app.flags.DEFINE_float("max_sampling_rate", 0.1, "Maximal sampling rate.")
-tf.app.flags.DEFINE_float("length_norm", 0.0, "Length normalization.")
-tf.app.flags.DEFINE_float("train_beam_freq", 0.0, "Beam-based training.")
-tf.app.flags.DEFINE_float("train_beam_anneal", 20000, "How many steps anneal.")
-tf.app.flags.DEFINE_integer("eval_beam_steps", 4, "How many beam steps eval.")
-tf.app.flags.DEFINE_integer("batch_size", 32, "Batch size.")
-tf.app.flags.DEFINE_integer("steps_per_checkpoint", 100, "Steps per epoch.")
-tf.app.flags.DEFINE_integer("nmaps", 64, "Number of floats in each cell.")
-tf.app.flags.DEFINE_integer("vec_size", 64, "Size of word vectors.")
-tf.app.flags.DEFINE_integer("train_data_size", 1000, "Training examples/len.")
-tf.app.flags.DEFINE_integer("max_length", 40, "Maximum length.")
-tf.app.flags.DEFINE_integer("random_seed", 125459, "Random seed.")
-tf.app.flags.DEFINE_integer("nconvs", 2, "How many convolutions / 1 step.")
-tf.app.flags.DEFINE_integer("kw", 3, "Kernel width.")
-tf.app.flags.DEFINE_integer("kh", 3, "Kernel height.")
-tf.app.flags.DEFINE_integer("height", 4, "Height.")
-tf.app.flags.DEFINE_integer("mem_size", -1, "Memory size (sqrt)")
-tf.app.flags.DEFINE_integer("soft_mem_size", 1024, "Softmax memory this size.")
-tf.app.flags.DEFINE_integer("num_gpus", 1, "Number of GPUs to use.")
-tf.app.flags.DEFINE_integer("num_replicas", 1, "Number of replicas in use.")
-tf.app.flags.DEFINE_integer("beam_size", 1, "Beam size during decoding. "
-                            "If 0, no decoder, the non-extended Neural GPU.")
-tf.app.flags.DEFINE_integer("max_target_vocab", 0,
-                            "Maximal size of target vocabulary.")
-tf.app.flags.DEFINE_integer("decode_offset", 0, "Offset for decoding.")
-tf.app.flags.DEFINE_integer("task", -1, "Task id when running on borg.")
-tf.app.flags.DEFINE_integer("nprint", 0, "How many test examples to print out.")
-tf.app.flags.DEFINE_integer("eval_bin_print", 3, "How many bins step in eval.")
-tf.app.flags.DEFINE_integer("mode", 0, "Mode: 0-train other-decode.")
-tf.app.flags.DEFINE_bool("atrous", False, "Whether to use atrous convs.")
-tf.app.flags.DEFINE_bool("layer_norm", False, "Do layer normalization.")
-tf.app.flags.DEFINE_bool("quantize", False, "Whether to quantize variables.")
-tf.app.flags.DEFINE_bool("do_train", True, "If false, only update memory.")
-tf.app.flags.DEFINE_bool("rnn_baseline", False, "If true build an RNN instead.")
-tf.app.flags.DEFINE_bool("simple_tokenizer", False,
-                         "If true, tokenize on spaces only, digits are 0.")
-tf.app.flags.DEFINE_bool("normalize_digits", True,
-                         "Whether to normalize digits with simple tokenizer.")
-tf.app.flags.DEFINE_integer("vocab_size", 16, "Joint vocabulary size.")
-tf.app.flags.DEFINE_string("data_dir", "/tmp", "Data directory")
-tf.app.flags.DEFINE_string("train_dir", "/tmp/", "Directory to store models.")
-tf.app.flags.DEFINE_string("test_file_prefix", "", "Files to test (.en,.fr).")
-tf.app.flags.DEFINE_integer("max_train_data_size", 0,
-                            "Limit on the size of training data (0: no limit).")
-tf.app.flags.DEFINE_string("word_vector_file_en", "",
-                           "Optional file with word vectors to start training.")
-tf.app.flags.DEFINE_string("word_vector_file_fr", "",
-                           "Optional file with word vectors to start training.")
-tf.app.flags.DEFINE_string("problem", "wmt", "What problem are we solving?.")
-
-tf.app.flags.DEFINE_integer("ps_tasks", 0, "Number of ps tasks used.")
-tf.app.flags.DEFINE_string("master", "", "Name of the TensorFlow master.")
-
-FLAGS = tf.app.flags.FLAGS
-EXTRA_EVAL = 10
-EVAL_LEN_INCR = 8
-MAXLEN_F = 2.0
-
-
-def zero_split(tok_list, append=None):
-  """Split tok_list (list of ints) on 0s, append int to all parts if given."""
-  res, cur, l = [], [], 0
-  for tok in tok_list:
-    if tok == 0:
-      if append is not None:
-        cur.append(append)
-      res.append(cur)
-      l = max(l, len(cur))
-      cur = []
-    else:
-      cur.append(tok)
-  if append is not None:
-    cur.append(append)
-  res.append(cur)
-  l = max(l, len(cur))
-  return res, l
-
-
-def read_data(source_path, target_path, buckets, max_size=None, print_out=True):
-  """Read data from source and target files and put into buckets.
-
-  Args:
-    source_path: path to the files with token-ids for the source language.
-    target_path: path to the file with token-ids for the target language;
-      it must be aligned with the source file: n-th line contains the desired
-      output for n-th line from the source_path.
-    buckets: the buckets to use.
-    max_size: maximum number of lines to read, all other will be ignored;
-      if 0 or None, data files will be read completely (no limit).
-      If set to 1, no data will be returned (empty lists of the right form).
-    print_out: whether to print out status or not.
-
-  Returns:
-    data_set: a list of length len(_buckets); data_set[n] contains a list of
-      (source, target) pairs read from the provided data files that fit
-      into the n-th bucket, i.e., such that len(source) < _buckets[n][0] and
-      len(target) < _buckets[n][1]; source and target are lists of token-ids.
-  """
-  data_set = [[] for _ in buckets]
-  counter = 0
-  if max_size != 1:
-    with tf.gfile.GFile(source_path, mode="r") as source_file:
-      with tf.gfile.GFile(target_path, mode="r") as target_file:
-        source, target = source_file.readline(), target_file.readline()
-        while source and target and (not max_size or counter < max_size):
-          counter += 1
-          if counter % 100000 == 0 and print_out:
-            print("  reading data line %d" % counter)
-            sys.stdout.flush()
-          source_ids = [int(x) for x in source.split()]
-          target_ids = [int(x) for x in target.split()]
-          source_ids, source_len = zero_split(source_ids)
-          target_ids, target_len = zero_split(target_ids, append=wmt.EOS_ID)
-          for bucket_id, size in enumerate(buckets):
-            if source_len <= size and target_len <= size:
-              data_set[bucket_id].append([source_ids, target_ids])
-              break
-          source, target = source_file.readline(), target_file.readline()
-  return data_set
-
-
-global_train_set = {"wmt": []}
-train_buckets_scale = {"wmt": []}
-
-
-def calculate_buckets_scale(data_set, buckets, problem):
-  """Calculate buckets scales for the given data set."""
-  train_bucket_sizes = [len(data_set[b]) for b in xrange(len(buckets))]
-  train_total_size = max(1, float(sum(train_bucket_sizes)))
-
-  # A bucket scale is a list of increasing numbers from 0 to 1 that we'll use
-  # to select a bucket. Length of [scale[i], scale[i+1]] is proportional to
-  # the size if i-th training bucket, as used later.
-  if problem not in train_buckets_scale:
-    train_buckets_scale[problem] = []
-  train_buckets_scale[problem].append(
-      [sum(train_bucket_sizes[:i + 1]) / train_total_size
-       for i in xrange(len(train_bucket_sizes))])
-  return train_total_size
-
-
-def read_data_into_global(source_path, target_path, buckets,
-                          max_size=None, print_out=True):
-  """Read data into the global variables (can be in a separate thread)."""
-  # pylint: disable=global-variable-not-assigned
-  global global_train_set, train_buckets_scale
-  # pylint: enable=global-variable-not-assigned
-  data_set = read_data(source_path, target_path, buckets, max_size, print_out)
-  global_train_set["wmt"].append(data_set)
-  train_total_size = calculate_buckets_scale(data_set, buckets, "wmt")
-  if print_out:
-    print("  Finished global data reading (%d)." % train_total_size)
-
-
-def initialize(sess=None):
-  """Initialize data and model."""
-  global MAXLEN_F
-  # Create training directory if it does not exist.
-  if not tf.gfile.IsDirectory(FLAGS.train_dir):
-    data.print_out("Creating training directory %s." % FLAGS.train_dir)
-    tf.gfile.MkDir(FLAGS.train_dir)
-  decode_suffix = "beam%dln%d" % (FLAGS.beam_size,
-                                  int(100 * FLAGS.length_norm))
-  if FLAGS.mode == 0:
-    decode_suffix = ""
-  if FLAGS.task >= 0:
-    data.log_filename = os.path.join(FLAGS.train_dir,
-                                     "log%d%s" % (FLAGS.task, decode_suffix))
-  else:
-    data.log_filename = os.path.join(FLAGS.train_dir, "neural_gpu/log")
-
-  # Set random seed.
-  if FLAGS.random_seed > 0:
-    seed = FLAGS.random_seed + max(0, FLAGS.task)
-    tf.set_random_seed(seed)
-    random.seed(seed)
-    np.random.seed(seed)
-
-  # Check data sizes.
-  assert data.bins
-  max_length = min(FLAGS.max_length, data.bins[-1])
-  while len(data.bins) > 1 and data.bins[-2] >= max_length + EXTRA_EVAL:
-    data.bins = data.bins[:-1]
-  if sess is None and FLAGS.task == 0 and FLAGS.num_replicas > 1:
-    if max_length > 60:
-      max_length = max_length * 1 / 2  # Save memory on chief.
-  min_length = min(14, max_length - 3) if FLAGS.problem == "wmt" else 3
-  for p in FLAGS.problem.split("-"):
-    if p in ["progeval", "progsynth"]:
-      min_length = max(26, min_length)
-  assert max_length + 1 > min_length
-  while len(data.bins) > 1 and data.bins[-2] >= max_length + EXTRA_EVAL:
-    data.bins = data.bins[:-1]
-
-  # Create checkpoint directory if it does not exist.
-  if FLAGS.mode == 0 or FLAGS.task < 0:
-    checkpoint_dir = os.path.join(FLAGS.train_dir, "neural_gpu%s"
-                                  % ("" if FLAGS.task < 0 else str(FLAGS.task)))
-  else:
-    checkpoint_dir = FLAGS.train_dir
-  if not tf.gfile.IsDirectory(checkpoint_dir):
-    data.print_out("Creating checkpoint directory %s." % checkpoint_dir)
-    tf.gfile.MkDir(checkpoint_dir)
-
-  # Prepare data.
-  if FLAGS.problem == "wmt":
-    # Prepare WMT data.
-    data.print_out("Preparing WMT data in %s" % FLAGS.data_dir)
-    if FLAGS.simple_tokenizer:
-      MAXLEN_F = 3.5
-      (en_train, fr_train, en_dev, fr_dev,
-       en_path, fr_path) = wmt.prepare_wmt_data(
-           FLAGS.data_dir, FLAGS.vocab_size,
-           tokenizer=wmt.space_tokenizer,
-           normalize_digits=FLAGS.normalize_digits)
-    else:
-      (en_train, fr_train, en_dev, fr_dev,
-       en_path, fr_path) = wmt.prepare_wmt_data(
-           FLAGS.data_dir, FLAGS.vocab_size)
-
-    # Read data into buckets and compute their sizes.
-    fr_vocab, rev_fr_vocab = wmt.initialize_vocabulary(fr_path)
-    data.vocab = fr_vocab
-    data.rev_vocab = rev_fr_vocab
-    data.print_out("Reading development and training data (limit: %d)."
-                   % FLAGS.max_train_data_size)
-    dev_set = {}
-    dev_set["wmt"] = read_data(en_dev, fr_dev, data.bins)
-    def data_read(size, print_out):
-      read_data_into_global(en_train, fr_train, data.bins, size, print_out)
-    data_read(50000, False)
-    read_thread_small = threading.Thread(
-        name="reading-data-small", target=lambda: data_read(900000, False))
-    read_thread_small.start()
-    read_thread_full = threading.Thread(
-        name="reading-data-full",
-        target=lambda: data_read(FLAGS.max_train_data_size, True))
-    read_thread_full.start()
-    data.print_out("Data reading set up.")
-  else:
-    # Prepare algorithmic data.
-    en_path, fr_path = None, None
-    tasks = FLAGS.problem.split("-")
-    data_size = FLAGS.train_data_size
-    for t in tasks:
-      data.print_out("Generating data for %s." % t)
-      if t in ["progeval", "progsynth"]:
-        data.init_data(t, data.bins[-1], 20 * data_size, FLAGS.vocab_size)
-        if len(program_utils.prog_vocab) > FLAGS.vocab_size - 2:
-          raise ValueError("Increase vocab_size to %d for prog-tasks."
-                           % (len(program_utils.prog_vocab) + 2))
-        data.rev_vocab = program_utils.prog_vocab
-        data.vocab = program_utils.prog_rev_vocab
-      else:
-        for l in xrange(max_length + EXTRA_EVAL - 1):
-          data.init_data(t, l, data_size, FLAGS.vocab_size)
-        data.init_data(t, data.bins[-2], data_size, FLAGS.vocab_size)
-        data.init_data(t, data.bins[-1], data_size, FLAGS.vocab_size)
-      if t not in global_train_set:
-        global_train_set[t] = []
-      global_train_set[t].append(data.train_set[t])
-      calculate_buckets_scale(data.train_set[t], data.bins, t)
-    dev_set = data.test_set
-
-  # Grid-search parameters.
-  lr = FLAGS.lr
-  init_weight = FLAGS.init_weight
-  max_grad_norm = FLAGS.max_grad_norm
-  if sess is not None and FLAGS.task > -1:
-    def job_id_factor(step):
-      """If jobid / step mod 3 is 0, 1, 2: say 0, 1, -1."""
-      return ((((FLAGS.task / step) % 3) + 1) % 3) - 1
-    lr *= math.pow(2, job_id_factor(1))
-    init_weight *= math.pow(1.5, job_id_factor(3))
-    max_grad_norm *= math.pow(2, job_id_factor(9))
-
-  # Print out parameters.
-  curriculum = FLAGS.curriculum_seq
-  msg1 = ("layers %d kw %d h %d kh %d batch %d noise %.2f"
-          % (FLAGS.nconvs, FLAGS.kw, FLAGS.height, FLAGS.kh,
-             FLAGS.batch_size, FLAGS.grad_noise_scale))
-  msg2 = ("cut %.2f lr %.3f iw %.2f cr %.2f nm %d d%.4f gn %.2f %s"
-          % (FLAGS.cutoff, lr, init_weight, curriculum, FLAGS.nmaps,
-             FLAGS.dropout, max_grad_norm, msg1))
-  data.print_out(msg2)
-
-  # Create model and initialize it.
-  tf.get_variable_scope().set_initializer(
-      tf.orthogonal_initializer(gain=1.8 * init_weight))
-  max_sampling_rate = FLAGS.max_sampling_rate if FLAGS.mode == 0 else 0.0
-  o = FLAGS.vocab_size if FLAGS.max_target_vocab < 1 else FLAGS.max_target_vocab
-  ngpu.CHOOSE_K = FLAGS.soft_mem_size
-  do_beam_model = FLAGS.train_beam_freq > 0.0001 and FLAGS.beam_size > 1
-  beam_size = FLAGS.beam_size if FLAGS.mode > 0 and not do_beam_model else 1
-  beam_size = min(beam_size, FLAGS.beam_size)
-  beam_model = None
-  def make_ngpu(cur_beam_size, back):
-    return ngpu.NeuralGPU(
-        FLAGS.nmaps, FLAGS.vec_size, FLAGS.vocab_size, o,
-        FLAGS.dropout, max_grad_norm, FLAGS.cutoff, FLAGS.nconvs,
-        FLAGS.kw, FLAGS.kh, FLAGS.height, FLAGS.mem_size,
-        lr / math.sqrt(FLAGS.num_replicas), min_length + 3, FLAGS.num_gpus,
-        FLAGS.num_replicas, FLAGS.grad_noise_scale, max_sampling_rate,
-        atrous=FLAGS.atrous, do_rnn=FLAGS.rnn_baseline,
-        do_layer_norm=FLAGS.layer_norm, beam_size=cur_beam_size, backward=back)
-  if sess is None:
-    with tf.device(tf.train.replica_device_setter(FLAGS.ps_tasks)):
-      model = make_ngpu(beam_size, True)
-      if do_beam_model:
-        tf.get_variable_scope().reuse_variables()
-        beam_model = make_ngpu(FLAGS.beam_size, False)
-  else:
-    model = make_ngpu(beam_size, True)
-    if do_beam_model:
-      tf.get_variable_scope().reuse_variables()
-      beam_model = make_ngpu(FLAGS.beam_size, False)
-
-  sv = None
-  if sess is None:
-    # The supervisor configuration has a few overriden options.
-    sv = tf.train.Supervisor(logdir=checkpoint_dir,
-                             is_chief=(FLAGS.task < 1),
-                             saver=model.saver,
-                             summary_op=None,
-                             save_summaries_secs=60,
-                             save_model_secs=15 * 60,
-                             global_step=model.global_step)
-
-    config = tf.ConfigProto(allow_soft_placement=True)
-    sess = sv.PrepareSession(FLAGS.master, config=config)
-
-  data.print_out("Created model. Checkpoint dir %s" % checkpoint_dir)
-
-  # Load model from parameters if a checkpoint exists.
-  ckpt = tf.train.get_checkpoint_state(checkpoint_dir)
-  if ckpt and tf.gfile.Exists(ckpt.model_checkpoint_path + ".index"):
-    data.print_out("Reading model parameters from %s"
-                   % ckpt.model_checkpoint_path)
-    model.saver.restore(sess, ckpt.model_checkpoint_path)
-  elif sv is None:
-    sess.run(tf.global_variables_initializer())
-    data.print_out("Initialized variables (no supervisor mode).")
-  elif FLAGS.task < 1 and FLAGS.mem_size > 0:
-    # sess.run(model.mem_norm_op)
-    data.print_out("Created new model and normalized mem (on chief).")
-
-  # Return the model and needed variables.
-  return (model, beam_model, min_length, max_length, checkpoint_dir,
-          (global_train_set, dev_set, en_path, fr_path), sv, sess)
-
-
-def m_step(model, beam_model, sess, batch_size, inp, target, bucket, nsteps, p):
-  """Evaluation multi-step for program synthesis."""
-  state, scores, hist = None, [[-11.0 for _ in xrange(batch_size)]], []
-  for _ in xrange(nsteps):
-    # Get the best beam (no training, just forward model).
-    new_target, new_first, new_inp, new_scores = get_best_beam(
-        beam_model, sess, inp, target,
-        batch_size, FLAGS.beam_size, bucket, hist, p, test_mode=True)
-    hist.append(new_first)
-    _, _, _, state = model.step(sess, inp, new_target, False, state=state)
-    inp = new_inp
-    scores.append([max(scores[-1][i], new_scores[i])
-                   for i in xrange(batch_size)])
-  # The final step with the true target.
-  loss, res, _, _ = model.step(sess, inp, target, False, state=state)
-  return loss, res, new_target, scores[1:]
-
-
-def single_test(bin_id, model, sess, nprint, batch_size, dev, p, print_out=True,
-                offset=None, beam_model=None):
-  """Test model on test data of length l using the given session."""
-  if not dev[p][bin_id]:
-    data.print_out("  bin %d (%d)\t%s\tppl NA errors NA seq-errors NA"
-                   % (bin_id, data.bins[bin_id], p))
-    return 1.0, 1.0, 0.0
-  inpt, target = data.get_batch(
-      bin_id, batch_size, dev[p], FLAGS.height, offset)
-  if FLAGS.beam_size > 1 and beam_model:
-    loss, res, new_tgt, scores = m_step(
-        model, beam_model, sess, batch_size, inpt, target, bin_id,
-        FLAGS.eval_beam_steps, p)
-    score_avgs = [sum(s) / float(len(s)) for s in scores]
-    score_maxs = [max(s) for s in scores]
-    score_str = ["(%.2f, %.2f)" % (score_avgs[i], score_maxs[i])
-                 for i in xrange(FLAGS.eval_beam_steps)]
-    data.print_out("  == scores (avg, max): %s" % "; ".join(score_str))
-    errors, total, seq_err = data.accuracy(inpt, res, target, batch_size,
-                                           nprint, new_tgt, scores[-1])
-  else:
-    loss, res, _, _ = model.step(sess, inpt, target, False)
-    errors, total, seq_err = data.accuracy(inpt, res, target, batch_size,
-                                           nprint)
-  seq_err = float(seq_err) / batch_size
-  if total > 0:
-    errors = float(errors) / total
-  if print_out:
-    data.print_out("  bin %d (%d)\t%s\tppl %.2f errors %.2f seq-errors %.2f"
-                   % (bin_id, data.bins[bin_id], p, data.safe_exp(loss),
-                      100 * errors, 100 * seq_err))
-  return (errors, seq_err, loss)
-
-
-def assign_vectors(word_vector_file, embedding_key, vocab_path, sess):
-  """Assign the embedding_key variable from the given word vectors file."""
-  # For words in the word vector file, set their embedding at start.
-  if not tf.gfile.Exists(word_vector_file):
-    data.print_out("Word vector file does not exist: %s" % word_vector_file)
-    sys.exit(1)
-  vocab, _ = wmt.initialize_vocabulary(vocab_path)
-  vectors_variable = [v for v in tf.trainable_variables()
-                      if embedding_key == v.name]
-  if len(vectors_variable) != 1:
-    data.print_out("Word vector variable not found or too many.")
-    sys.exit(1)
-  vectors_variable = vectors_variable[0]
-  vectors = vectors_variable.eval()
-  data.print_out("Pre-setting word vectors from %s" % word_vector_file)
-  with tf.gfile.GFile(word_vector_file, mode="r") as f:
-    # Lines have format: dog 0.045123 -0.61323 0.413667 ...
-    for line in f:
-      line_parts = line.split()
-      # The first part is the word.
-      word = line_parts[0]
-      if word in vocab:
-        # Remaining parts are components of the vector.
-        word_vector = np.array(map(float, line_parts[1:]))
-        if len(word_vector) != FLAGS.vec_size:
-          data.print_out("Warn: Word '%s', Expecting vector size %d, "
-                         "found %d" % (word, FLAGS.vec_size,
-                                       len(word_vector)))
-        else:
-          vectors[vocab[word]] = word_vector
-  # Assign the modified vectors to the vectors_variable in the graph.
-  sess.run([vectors_variable.initializer],
-           {vectors_variable.initializer.inputs[1]: vectors})
-
-
-def print_vectors(embedding_key, vocab_path, word_vector_file):
-  """Print vectors from the given variable."""
-  _, rev_vocab = wmt.initialize_vocabulary(vocab_path)
-  vectors_variable = [v for v in tf.trainable_variables()
-                      if embedding_key == v.name]
-  if len(vectors_variable) != 1:
-    data.print_out("Word vector variable not found or too many.")
-    sys.exit(1)
-  vectors_variable = vectors_variable[0]
-  vectors = vectors_variable.eval()
-  l, s = vectors.shape[0], vectors.shape[1]
-  data.print_out("Printing %d word vectors from %s to %s."
-                 % (l, embedding_key, word_vector_file))
-  with tf.gfile.GFile(word_vector_file, mode="w") as f:
-    # Lines have format: dog 0.045123 -0.61323 0.413667 ...
-    for i in xrange(l):
-      f.write(rev_vocab[i])
-      for j in xrange(s):
-        f.write(" %.8f" % vectors[i][j])
-      f.write("\n")
-
-
-def get_bucket_id(train_buckets_scale_c, max_cur_length, data_set):
-  """Get a random bucket id."""
-  # Choose a bucket according to data distribution. Pick a random number
-  # in [0, 1] and use the corresponding interval in train_buckets_scale.
-  random_number_01 = np.random.random_sample()
-  bucket_id = min([i for i in xrange(len(train_buckets_scale_c))
-                   if train_buckets_scale_c[i] > random_number_01])
-  while bucket_id > 0 and not data_set[bucket_id]:
-    bucket_id -= 1
-  for _ in xrange(10 if np.random.random_sample() < 0.9 else 1):
-    if data.bins[bucket_id] > max_cur_length:
-      random_number_01 = min(random_number_01, np.random.random_sample())
-      bucket_id = min([i for i in xrange(len(train_buckets_scale_c))
-                       if train_buckets_scale_c[i] > random_number_01])
-      while bucket_id > 0 and not data_set[bucket_id]:
-        bucket_id -= 1
-  return bucket_id
-
-
-def score_beams(beams, target, inp, history, p,
-                print_out=False, test_mode=False):
-  """Score beams."""
-  if p == "progsynth":
-    return score_beams_prog(beams, target, inp, history, print_out, test_mode)
-  elif test_mode:
-    return beams[0], 10.0 if str(beams[0][:len(target)]) == str(target) else 0.0
-  else:
-    history_s = [str(h) for h in history]
-    best, best_score, tgt, eos_id = None, -1000.0, target, None
-    if p == "wmt":
-      eos_id = wmt.EOS_ID
-    if eos_id and eos_id in target:
-      tgt = target[:target.index(eos_id)]
-    for beam in beams:
-      if eos_id and eos_id in beam:
-        beam = beam[:beam.index(eos_id)]
-      l = min(len(tgt), len(beam))
-      score = len([i for i in xrange(l) if tgt[i] == beam[i]]) / float(len(tgt))
-      hist_score = 20.0 if str([b for b in beam if b > 0]) in history_s else 0.0
-      if score < 1.0:
-        score -= hist_score
-      if score > best_score:
-        best = beam
-        best_score = score
-    return best, best_score
-
-
-def score_beams_prog(beams, target, inp, history, print_out=False,
-                     test_mode=False):
-  """Score beams for program synthesis."""
-  tgt_prog = linearize(target, program_utils.prog_vocab, True, 1)
-  hist_progs = [linearize(h, program_utils.prog_vocab, True, 1)
-                for h in history]
-  tgt_set = set(target)
-  if print_out:
-    print("target: ", tgt_prog)
-  inps, tgt_outs = [], []
-  for i in xrange(3):
-    ilist = [inp[i + 1, l] for l in xrange(inp.shape[1])]
-    clist = [program_utils.prog_vocab[x] for x in ilist if x > 0]
-    olist = clist[clist.index("]") + 1:]  # outputs
-    clist = clist[1:clist.index("]")]     # inputs
-    inps.append([int(x) for x in clist])
-    if olist[0] == "[":  # olist may be [int] or just int
-      tgt_outs.append(str([int(x) for x in olist[1:-1]]))
-    else:
-      if len(olist) == 1:
-        tgt_outs.append(olist[0])
-      else:
-        print([program_utils.prog_vocab[x] for x in ilist if x > 0])
-        print(olist)
-        print(tgt_prog)
-        print(program_utils.evaluate(tgt_prog, {"a": inps[-1]}))
-        print("AAAAA")
-        tgt_outs.append(olist[0])
-  if not test_mode:
-    for _ in xrange(7):
-      ilen = np.random.randint(len(target) - 3) + 1
-      inps.append([random.choice(range(-15, 15)) for _ in range(ilen)])
-    tgt_outs.extend([program_utils.evaluate(tgt_prog, {"a": inp})
-                     for inp in inps[3:]])
-  best, best_prog, best_score = None, "", -1000.0
-  for beam in beams:
-    b_prog = linearize(beam, program_utils.prog_vocab, True, 1)
-    b_set = set(beam)
-    jsim = len(tgt_set & b_set) / float(len(tgt_set | b_set))
-    b_outs = [program_utils.evaluate(b_prog, {"a": inp}) for inp in inps]
-    errs = len([x for x in b_outs if x == "ERROR"])
-    imatches = len([i for i in xrange(3) if b_outs[i] == tgt_outs[i]])
-    perfect = 10.0 if imatches == 3 else 0.0
-    hist_score = 20.0 if b_prog in hist_progs else 0.0
-    if test_mode:
-      score = perfect - errs
-    else:
-      matches = len([i for i in xrange(10) if b_outs[i] == tgt_outs[i]])
-      score = perfect + matches + jsim - errs
-    if score < 10.0:
-      score -= hist_score
-    # print b_prog
-    # print "jsim: ", jsim, " errs: ", errs, " mtchs: ", matches, " s: ", score
-    if score > best_score:
-      best = beam
-      best_prog = b_prog
-      best_score = score
-  if print_out:
-    print("best score: ", best_score, " best prog: ", best_prog)
-  return best, best_score
-
-
-def get_best_beam(beam_model, sess, inp, target, batch_size, beam_size,
-                  bucket, history, p, test_mode=False):
-  """Run beam_model, score beams, and return the best as target and in input."""
-  _, output_logits, _, _ = beam_model.step(
-      sess, inp, target, None, beam_size=FLAGS.beam_size)
-  new_targets, new_firsts, scores, new_inp = [], [], [], np.copy(inp)
-  for b in xrange(batch_size):
-    outputs = []
-    history_b = [[h[b, 0, l] for l in xrange(data.bins[bucket])]
-                 for h in history]
-    for beam_idx in xrange(beam_size):
-      outputs.append([int(o[beam_idx * batch_size + b])
-                      for o in output_logits])
-    target_t = [target[b, 0, l] for l in xrange(data.bins[bucket])]
-    best, best_score = score_beams(
-        outputs, [t for t in target_t if t > 0], inp[b, :, :],
-        [[t for t in h if t > 0] for h in history_b], p, test_mode=test_mode)
-    scores.append(best_score)
-    if 1 in best:  # Only until _EOS.
-      best = best[:best.index(1) + 1]
-    best += [0 for _ in xrange(len(target_t) - len(best))]
-    new_targets.append([best])
-    first, _ = score_beams(
-        outputs, [t for t in target_t if t > 0], inp[b, :, :],
-        [[t for t in h if t > 0] for h in history_b], p, test_mode=True)
-    if 1 in first:  # Only until _EOS.
-      first = first[:first.index(1) + 1]
-    first += [0 for _ in xrange(len(target_t) - len(first))]
-    new_inp[b, 0, :] = np.array(first, dtype=np.int32)
-    new_firsts.append([first])
-  # Change target if we found a great answer.
-  new_target = np.array(new_targets, dtype=np.int32)
-  for b in xrange(batch_size):
-    if scores[b] >= 10.0:
-      target[b, 0, :] = new_target[b, 0, :]
-  new_first = np.array(new_firsts, dtype=np.int32)
-  return new_target, new_first, new_inp, scores
-
-
-def train():
-  """Train the model."""
-  batch_size = FLAGS.batch_size * FLAGS.num_gpus
-  (model, beam_model, min_length, max_length, checkpoint_dir,
-   (train_set, dev_set, en_vocab_path, fr_vocab_path), sv, sess) = initialize()
-  with sess.as_default():
-    quant_op = model.quantize_op
-    max_cur_length = min(min_length + 3, max_length)
-    prev_acc_perp = [1000000 for _ in xrange(5)]
-    prev_seq_err = 1.0
-    is_chief = FLAGS.task < 1
-    do_report = False
-
-    # Main traning loop.
-    while not sv.ShouldStop():
-      global_step, max_cur_length, learning_rate = sess.run(
-          [model.global_step, model.cur_length, model.lr])
-      acc_loss, acc_l1, acc_total, acc_errors, acc_seq_err = 0.0, 0.0, 0, 0, 0
-      acc_grad_norm, step_count, step_c1, step_time = 0.0, 0, 0, 0.0
-
-      # For words in the word vector file, set their embedding at start.
-      bound1 = FLAGS.steps_per_checkpoint - 1
-      if FLAGS.word_vector_file_en and global_step < bound1 and is_chief:
-        assign_vectors(FLAGS.word_vector_file_en, "embedding:0",
-                       en_vocab_path, sess)
-        if FLAGS.max_target_vocab < 1:
-          assign_vectors(FLAGS.word_vector_file_en, "target_embedding:0",
-                         en_vocab_path, sess)
-
-      if FLAGS.word_vector_file_fr and global_step < bound1 and is_chief:
-        assign_vectors(FLAGS.word_vector_file_fr, "embedding:0",
-                       fr_vocab_path, sess)
-        if FLAGS.max_target_vocab < 1:
-          assign_vectors(FLAGS.word_vector_file_fr, "target_embedding:0",
-                         fr_vocab_path, sess)
-
-      for _ in xrange(FLAGS.steps_per_checkpoint):
-        step_count += 1
-        step_c1 += 1
-        global_step = int(model.global_step.eval())
-        train_beam_anneal = global_step / float(FLAGS.train_beam_anneal)
-        train_beam_freq = FLAGS.train_beam_freq * min(1.0, train_beam_anneal)
-        p = random.choice(FLAGS.problem.split("-"))
-        train_set = global_train_set[p][-1]
-        bucket_id = get_bucket_id(train_buckets_scale[p][-1], max_cur_length,
-                                  train_set)
-        # Prefer longer stuff 60% of time if not wmt.
-        if np.random.randint(100) < 60 and FLAGS.problem != "wmt":
-          bucket1 = get_bucket_id(train_buckets_scale[p][-1], max_cur_length,
-                                  train_set)
-          bucket_id = max(bucket1, bucket_id)
-
-        # Run a step and time it.
-        start_time = time.time()
-        inp, target = data.get_batch(bucket_id, batch_size, train_set,
-                                     FLAGS.height)
-        noise_param = math.sqrt(math.pow(global_step + 1, -0.55) *
-                                prev_seq_err) * FLAGS.grad_noise_scale
-        # In multi-step mode, we use best from beam for middle steps.
-        state, new_target, scores, history = None, None, None, []
-        while (FLAGS.beam_size > 1 and
-               train_beam_freq > np.random.random_sample()):
-          # Get the best beam (no training, just forward model).
-          new_target, new_first, new_inp, scores = get_best_beam(
-              beam_model, sess, inp, target,
-              batch_size, FLAGS.beam_size, bucket_id, history, p)
-          history.append(new_first)
-          # Training step with the previous input and the best beam as target.
-          _, _, _, state = model.step(sess, inp, new_target, FLAGS.do_train,
-                                      noise_param, update_mem=True, state=state)
-          # Change input to the new one for the next step.
-          inp = new_inp
-          # If all results are great, stop (todo: not to wait for all?).
-          if FLAGS.nprint > 1:
-            print(scores)
-          if sum(scores) / float(len(scores)) >= 10.0:
-            break
-        # The final step with the true target.
-        loss, res, gnorm, _ = model.step(
-            sess, inp, target, FLAGS.do_train, noise_param,
-            update_mem=True, state=state)
-        step_time += time.time() - start_time
-        acc_grad_norm += 0.0 if gnorm is None else float(gnorm)
-
-        # Accumulate statistics.
-        acc_loss += loss
-        acc_l1 += loss
-        errors, total, seq_err = data.accuracy(
-            inp, res, target, batch_size, 0, new_target, scores)
-        if FLAGS.nprint > 1:
-          print("seq_err: ", seq_err)
-        acc_total += total
-        acc_errors += errors
-        acc_seq_err += seq_err
-
-        # Report summary every 10 steps.
-        if step_count + 3 > FLAGS.steps_per_checkpoint:
-          do_report = True  # Don't polute plot too early.
-        if is_chief and step_count % 10 == 1 and do_report:
-          cur_loss = acc_l1 / float(step_c1)
-          acc_l1, step_c1 = 0.0, 0
-          cur_perp = data.safe_exp(cur_loss)
-          summary = tf.Summary()
-          summary.value.extend(
-              [tf.Summary.Value(tag="log_perplexity", simple_value=cur_loss),
-               tf.Summary.Value(tag="perplexity", simple_value=cur_perp)])
-          sv.SummaryComputed(sess, summary, global_step)
-
-      # Normalize and print out accumulated statistics.
-      acc_loss /= step_count
-      step_time /= FLAGS.steps_per_checkpoint
-      acc_seq_err = float(acc_seq_err) / (step_count * batch_size)
-      prev_seq_err = max(0.0, acc_seq_err - 0.02)  # No noise at error < 2%.
-      acc_errors = float(acc_errors) / acc_total if acc_total > 0 else 1.0
-      t_size = float(sum([len(x) for x in train_set])) / float(1000000)
-      msg = ("step %d step-time %.2f train-size %.3f lr %.6f grad-norm %.4f"
-             % (global_step + 1, step_time, t_size, learning_rate,
-                acc_grad_norm / FLAGS.steps_per_checkpoint))
-      data.print_out("%s len %d ppl %.6f errors %.2f sequence-errors %.2f" %
-                     (msg, max_cur_length, data.safe_exp(acc_loss),
-                      100*acc_errors, 100*acc_seq_err))
-
-      # If errors are below the curriculum threshold, move curriculum forward.
-      is_good = FLAGS.curriculum_ppx > data.safe_exp(acc_loss)
-      is_good = is_good and FLAGS.curriculum_seq > acc_seq_err
-      if is_good and is_chief:
-        if FLAGS.quantize:
-          # Quantize weights.
-          data.print_out("  Quantizing parameters.")
-          sess.run([quant_op])
-        # Increase current length (until the next with training data).
-        sess.run(model.cur_length_incr_op)
-        # Forget last perplexities if we're not yet at the end.
-        if max_cur_length < max_length:
-          prev_acc_perp.append(1000000)
-
-      # Lower learning rate if we're worse than the last 5 checkpoints.
-      acc_perp = data.safe_exp(acc_loss)
-      if acc_perp > max(prev_acc_perp[-5:]) and is_chief:
-        sess.run(model.lr_decay_op)
-      prev_acc_perp.append(acc_perp)
-
-      # Save checkpoint.
-      if is_chief:
-        checkpoint_path = os.path.join(checkpoint_dir, "neural_gpu.ckpt")
-        model.saver.save(sess, checkpoint_path,
-                         global_step=model.global_step)
-
-        # Run evaluation.
-        bin_bound = 4
-        for p in FLAGS.problem.split("-"):
-          total_loss, total_err, tl_counter = 0.0, 0.0, 0
-          for bin_id in xrange(len(data.bins)):
-            if bin_id < bin_bound or bin_id % FLAGS.eval_bin_print == 1:
-              err, _, loss = single_test(bin_id, model, sess, FLAGS.nprint,
-                                         batch_size * 4, dev_set, p,
-                                         beam_model=beam_model)
-              if loss > 0.0:
-                total_loss += loss
-                total_err += err
-                tl_counter += 1
-          test_loss = total_loss / max(1, tl_counter)
-          test_err = total_err / max(1, tl_counter)
-          test_perp = data.safe_exp(test_loss)
-          summary = tf.Summary()
-          summary.value.extend(
-              [tf.Summary.Value(tag="test/%s/loss" % p, simple_value=test_loss),
-               tf.Summary.Value(tag="test/%s/error" % p, simple_value=test_err),
-               tf.Summary.Value(tag="test/%s/perplexity" % p,
-                                simple_value=test_perp)])
-          sv.SummaryComputed(sess, summary, global_step)
-
-
-def linearize(output, rev_fr_vocab, simple_tokenizer=None, eos_id=wmt.EOS_ID):
-  # If there is an EOS symbol in outputs, cut them at that point (WMT).
-  if eos_id in output:
-    output = output[:output.index(eos_id)]
-  # Print out French sentence corresponding to outputs.
-  if simple_tokenizer or FLAGS.simple_tokenizer:
-    vlen = len(rev_fr_vocab)
-    def vget(o):
-      if o < vlen:
-        return rev_fr_vocab[o]
-      return "UNK"
-    return " ".join([vget(o) for o in output])
-  else:
-    return wmt.basic_detokenizer([rev_fr_vocab[o] for o in output])
-
-
-def evaluate():
-  """Evaluate an existing model."""
-  batch_size = FLAGS.batch_size * FLAGS.num_gpus
-  with tf.Session(config=tf.ConfigProto(allow_soft_placement=True)) as sess:
-    (model, beam_model, _, _, _,
-     (_, dev_set, en_vocab_path, fr_vocab_path), _, sess) = initialize(sess)
-    for p in FLAGS.problem.split("-"):
-      for bin_id in xrange(len(data.bins)):
-        if (FLAGS.task >= 0 and bin_id > 4) or (FLAGS.nprint == 0 and
-                                                bin_id > 8 and p == "wmt"):
-          break
-        single_test(bin_id, model, sess, FLAGS.nprint, batch_size, dev_set, p,
-                    beam_model=beam_model)
-    path = FLAGS.test_file_prefix
-    xid = "" if FLAGS.task < 0 else ("%.4d" % (FLAGS.task+FLAGS.decode_offset))
-    en_path, fr_path = path + ".en" + xid, path + ".fr" + xid
-    # Evaluate the test file if they exist.
-    if path and tf.gfile.Exists(en_path) and tf.gfile.Exists(fr_path):
-      data.print_out("Translating test set %s" % en_path)
-      # Read lines.
-      en_lines, fr_lines = [], []
-      with tf.gfile.GFile(en_path, mode="r") as f:
-        for line in f:
-          en_lines.append(line.strip())
-      with tf.gfile.GFile(fr_path, mode="r") as f:
-        for line in f:
-          fr_lines.append(line.strip())
-      # Tokenize and convert to ids.
-      en_vocab, _ = wmt.initialize_vocabulary(en_vocab_path)
-      _, rev_fr_vocab = wmt.initialize_vocabulary(fr_vocab_path)
-      if FLAGS.simple_tokenizer:
-        en_ids = [wmt.sentence_to_token_ids(
-            l, en_vocab, tokenizer=wmt.space_tokenizer,
-            normalize_digits=FLAGS.normalize_digits)
-                  for l in en_lines]
-      else:
-        en_ids = [wmt.sentence_to_token_ids(l, en_vocab) for l in en_lines]
-      # Translate.
-      results = []
-      for idx, token_ids in enumerate(en_ids):
-        if idx % 5 == 0:
-          data.print_out("Translating example %d of %d." % (idx, len(en_ids)))
-        # Which bucket does it belong to?
-        buckets = [b for b in xrange(len(data.bins))
-                   if data.bins[b] >= len(token_ids)]
-        if buckets:
-          result, result_cost = [], 100000000.0
-          for bucket_id in buckets:
-            if data.bins[bucket_id] > MAXLEN_F * len(token_ids) + EVAL_LEN_INCR:
-              break
-            # Get a 1-element batch to feed the sentence to the model.
-            used_batch_size = 1  # batch_size
-            inp, target = data.get_batch(
-                bucket_id, used_batch_size, None, FLAGS.height,
-                preset=([token_ids], [[]]))
-            loss, output_logits, _, _ = model.step(
-                sess, inp, target, None, beam_size=FLAGS.beam_size)
-            outputs = [int(o[0]) for o in output_logits]
-            loss = loss[0] - (data.bins[bucket_id] * FLAGS.length_norm)
-            if FLAGS.simple_tokenizer:
-              cur_out = outputs
-              if wmt.EOS_ID in cur_out:
-                cur_out = cur_out[:cur_out.index(wmt.EOS_ID)]
-              res_tags = [rev_fr_vocab[o] for o in cur_out]
-              bad_words, bad_brack = wmt.parse_constraints(token_ids, res_tags)
-              loss += 1000.0 * bad_words + 100.0 * bad_brack
-            # print (bucket_id, loss)
-            if loss < result_cost:
-              result = outputs
-              result_cost = loss
-          final = linearize(result, rev_fr_vocab)
-          results.append("%s\t%s\n" % (final, fr_lines[idx]))
-          # print result_cost
-          sys.stderr.write(results[-1])
-          sys.stderr.flush()
-        else:
-          sys.stderr.write("TOOO_LONG\t%s\n" % fr_lines[idx])
-          sys.stderr.flush()
-      if xid:
-        decode_suffix = "beam%dln%dn" % (FLAGS.beam_size,
-                                         int(100 * FLAGS.length_norm))
-        with tf.gfile.GFile(path + ".res" + decode_suffix + xid, mode="w") as f:
-          for line in results:
-            f.write(line)
-
-
-def mul(l):
-  res = 1.0
-  for s in l:
-    res *= s
-  return res
-
-
-def interactive():
-  """Interactively probe an existing model."""
-  with tf.Session(config=tf.ConfigProto(allow_soft_placement=True)) as sess:
-    # Initialize model.
-    (model, _, _, _, _, (_, _, en_path, fr_path), _, _) = initialize(sess)
-    # Load vocabularies.
-    en_vocab, rev_en_vocab = wmt.initialize_vocabulary(en_path)
-    _, rev_fr_vocab = wmt.initialize_vocabulary(fr_path)
-    # Print out vectors and variables.
-    if FLAGS.nprint > 0 and FLAGS.word_vector_file_en:
-      print_vectors("embedding:0", en_path, FLAGS.word_vector_file_en)
-    if FLAGS.nprint > 0 and FLAGS.word_vector_file_fr:
-      print_vectors("target_embedding:0", fr_path, FLAGS.word_vector_file_fr)
-    total = 0
-    for v in tf.trainable_variables():
-      shape = v.get_shape().as_list()
-      total += mul(shape)
-      print(v.name, shape, mul(shape))
-    print(total)
-    # Start interactive loop.
-    sys.stdout.write("Input to Neural GPU Translation Model.\n")
-    sys.stdout.write("> ")
-    sys.stdout.flush()
-    inpt = sys.stdin.readline(), ""
-    while inpt:
-      cures = []
-      # Get token-ids for the input sentence.
-      if FLAGS.simple_tokenizer:
-        token_ids = wmt.sentence_to_token_ids(
-            inpt, en_vocab, tokenizer=wmt.space_tokenizer,
-            normalize_digits=FLAGS.normalize_digits)
-      else:
-        token_ids = wmt.sentence_to_token_ids(inpt, en_vocab)
-      print([rev_en_vocab[t] for t in token_ids])
-      # Which bucket does it belong to?
-      buckets = [b for b in xrange(len(data.bins))
-                 if data.bins[b] >= max(len(token_ids), len(cures))]
-      if cures:
-        buckets = [buckets[0]]
-      if buckets:
-        result, result_cost = [], 10000000.0
-        for bucket_id in buckets:
-          if data.bins[bucket_id] > MAXLEN_F * len(token_ids) + EVAL_LEN_INCR:
-            break
-          glen = 1
-          for gen_idx in xrange(glen):
-            # Get a 1-element batch to feed the sentence to the model.
-            inp, target = data.get_batch(
-                bucket_id, 1, None, FLAGS.height, preset=([token_ids], [cures]))
-            loss, output_logits, _, _ = model.step(
-                sess, inp, target, None, beam_size=FLAGS.beam_size,
-                update_mem=False)
-            # If it is a greedy decoder, outputs are argmaxes of output_logits.
-            if FLAGS.beam_size > 1:
-              outputs = [int(o) for o in output_logits]
-            else:
-              loss = loss[0] - (data.bins[bucket_id] * FLAGS.length_norm)
-              outputs = [int(np.argmax(logit, axis=1))
-                         for logit in output_logits]
-            print([rev_fr_vocab[t] for t in outputs])
-            print(loss, data.bins[bucket_id])
-            print(linearize(outputs, rev_fr_vocab))
-            cures.append(outputs[gen_idx])
-            print(cures)
-            print(linearize(cures, rev_fr_vocab))
-          if FLAGS.simple_tokenizer:
-            cur_out = outputs
-            if wmt.EOS_ID in cur_out:
-              cur_out = cur_out[:cur_out.index(wmt.EOS_ID)]
-            res_tags = [rev_fr_vocab[o] for o in cur_out]
-            bad_words, bad_brack = wmt.parse_constraints(token_ids, res_tags)
-            loss += 1000.0 * bad_words + 100.0 * bad_brack
-          if loss < result_cost:
-            result = outputs
-            result_cost = loss
-        print("FINAL", result_cost)
-        print([rev_fr_vocab[t] for t in result])
-        print(linearize(result, rev_fr_vocab))
-      else:
-        print("TOOO_LONG")
-      sys.stdout.write("> ")
-      sys.stdout.flush()
-      inpt = sys.stdin.readline(), ""
-
-
-def main(_):
-  if FLAGS.mode == 0:
-    train()
-  elif FLAGS.mode == 1:
-    evaluate()
-  else:
-    interactive()
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/neural_gpu/program_utils.py b/research/neural_gpu/program_utils.py
deleted file mode 100644
index 1f49d01292012487c4a01a5832fb044a378645ff..0000000000000000000000000000000000000000
--- a/research/neural_gpu/program_utils.py
+++ /dev/null
@@ -1,444 +0,0 @@
-# Copyright 2015 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Utilities for generating program synthesis and evaluation data."""
-
-import contextlib
-import sys
-import random
-import os
-
-try:
-  import StringIO
-except ImportError:
-  from io import StringIO
-
-class ListType(object):
-  def __init__(self, arg):
-    self.arg = arg
-
-  def __str__(self):
-    return "[" + str(self.arg) + "]"
-
-  def __eq__(self, other):
-    if not isinstance(other, ListType):
-      return False
-    return self.arg == other.arg
-
-  def __hash__(self):
-    return hash(self.arg)
-
-class VarType(object):
-  def __init__(self, arg):
-    self.arg = arg
-
-  def __str__(self):
-    return str(self.arg)
-
-  def __eq__(self, other):
-    if not isinstance(other, VarType):
-      return False
-    return self.arg == other.arg
-
-  def __hash__(self):
-    return hash(self.arg)
-
-class FunctionType(object):
-  def __init__(self, args):
-    self.args = args
-
-  def __str__(self):
-    return str(self.args[0]) + " -> " + str(self.args[1])
-
-  def __eq__(self, other):
-    if not isinstance(other, FunctionType):
-      return False
-    return self.args == other.args
-
-  def __hash__(self):
-    return hash(tuple(self.args))
-
-
-class Function(object):
-  def __init__(self, name, arg_types, output_type, fn_arg_types = None):
-    self.name = name
-    self.arg_types = arg_types
-    self.fn_arg_types = fn_arg_types or []
-    self.output_type = output_type
-
-Null = 100
-## Functions
-f_head = Function("c_head", [ListType("Int")], "Int")
-def c_head(xs): return xs[0] if len(xs) > 0 else Null
-
-f_last = Function("c_last", [ListType("Int")], "Int")
-def c_last(xs): return xs[-1] if len(xs) > 0 else Null
-
-f_take = Function("c_take", ["Int", ListType("Int")], ListType("Int"))
-def c_take(n, xs): return xs[:n]
-
-f_drop = Function("c_drop", ["Int", ListType("Int")], ListType("Int"))
-def c_drop(n, xs): return xs[n:]
-
-f_access = Function("c_access", ["Int", ListType("Int")], "Int")
-def c_access(n, xs): return xs[n] if n >= 0 and len(xs) > n else Null
-
-f_max = Function("c_max", [ListType("Int")], "Int")
-def c_max(xs): return max(xs) if len(xs) > 0 else Null
-
-f_min = Function("c_min", [ListType("Int")], "Int")
-def c_min(xs): return min(xs) if len(xs) > 0 else Null
-
-f_reverse = Function("c_reverse", [ListType("Int")], ListType("Int"))
-def c_reverse(xs): return list(reversed(xs))
-
-f_sort = Function("sorted", [ListType("Int")], ListType("Int"))
-# def c_sort(xs): return sorted(xs)
-
-f_sum = Function("sum", [ListType("Int")], "Int")
-# def c_sum(xs): return sum(xs)
-
-
-## Lambdas
-# Int -> Int
-def plus_one(x): return x + 1
-def minus_one(x): return x - 1
-def times_two(x): return x * 2
-def neg(x): return x * (-1)
-def div_two(x): return int(x/2)
-def sq(x): return x**2
-def times_three(x): return x * 3
-def div_three(x): return int(x/3)
-def times_four(x): return x * 4
-def div_four(x): return int(x/4)
-
-# Int -> Bool
-def pos(x): return x > 0
-def neg(x): return x < 0
-def even(x): return x%2 == 0
-def odd(x): return x%2 == 1
-
-# Int -> Int -> Int
-def add(x, y): return x + y
-def sub(x, y): return x - y
-def mul(x, y): return x * y
-
-# HOFs
-f_map = Function("map", [ListType("Int")],
-                        ListType("Int"),
-                        [FunctionType(["Int", "Int"])])
-f_filter = Function("filter", [ListType("Int")],
-                              ListType("Int"),
-                              [FunctionType(["Int", "Bool"])])
-f_count = Function("c_count", [ListType("Int")],
-                              "Int",
-                              [FunctionType(["Int", "Bool"])])
-def c_count(f, xs): return len([x for x in xs if f(x)])
-
-f_zipwith = Function("c_zipwith", [ListType("Int"), ListType("Int")],
-                                  ListType("Int"),
-                                  [FunctionType(["Int", "Int", "Int"])]) #FIX
-def c_zipwith(f, xs, ys): return [f(x, y) for (x, y) in zip(xs, ys)]
-
-f_scan = Function("c_scan", [ListType("Int")],
-                            ListType("Int"),
-                            [FunctionType(["Int", "Int", "Int"])])
-def c_scan(f, xs):
-  out = xs
-  for i in range(1, len(xs)):
-    out[i] = f(xs[i], xs[i -1])
-  return out
-
-@contextlib.contextmanager
-def stdoutIO(stdout=None):
-  old = sys.stdout
-  if stdout is None:
-    stdout = StringIO.StringIO()
-  sys.stdout = stdout
-  yield stdout
-  sys.stdout = old
-
-
-def evaluate(program_str, input_names_to_vals, default="ERROR"):
-  exec_str = []
-  for name, val in input_names_to_vals.iteritems():
-    exec_str += name + " = " + str(val) + "; "
-  exec_str += program_str
-  if type(exec_str) is list:
-    exec_str = "".join(exec_str)
-
-  with stdoutIO() as s:
-    # pylint: disable=bare-except
-    try:
-      exec(exec_str + " print(out)")
-      return s.getvalue()[:-1]
-    except:
-      return default
-   # pylint: enable=bare-except
-
-
-class Statement(object):
-  """Statement class."""
-
-  def __init__(self, fn, output_var, arg_vars, fn_args=None):
-    self.fn = fn
-    self.output_var = output_var
-    self.arg_vars = arg_vars
-    self.fn_args = fn_args or []
-
-  def __str__(self):
-    return "%s = %s(%s%s%s)"%(self.output_var,
-                              self.fn.name,
-                              ", ".join(self.fn_args),
-                              ", " if self.fn_args else "",
-                              ", ".join(self.arg_vars))
-
-  def substitute(self, env):
-    self.output_var = env.get(self.output_var, self.output_var)
-    self.arg_vars = [env.get(v, v) for v in self.arg_vars]
-
-
-class ProgramGrower(object):
-  """Grow programs."""
-
-  def __init__(self, functions, types_to_lambdas):
-    self.functions = functions
-    self.types_to_lambdas = types_to_lambdas
-
-  def grow_body(self, new_var_name, dependencies, types_to_vars):
-    """Grow the program body."""
-    choices = []
-    for f in self.functions:
-      if all([a in types_to_vars.keys() for a in f.arg_types]):
-        choices.append(f)
-
-    f = random.choice(choices)
-    args = []
-    for t in f.arg_types:
-      possible_vars = random.choice(types_to_vars[t])
-      var = random.choice(possible_vars)
-      args.append(var)
-      dependencies.setdefault(new_var_name, []).extend(
-          [var] + (dependencies[var]))
-
-    fn_args = [random.choice(self.types_to_lambdas[t]) for t in f.fn_arg_types]
-    types_to_vars.setdefault(f.output_type, []).append(new_var_name)
-
-    return Statement(f, new_var_name, args, fn_args)
-
-  def grow(self, program_len, input_types):
-    """Grow the program."""
-    var_names = list(reversed(map(chr, range(97, 123))))
-    dependencies = dict()
-    types_to_vars = dict()
-    input_names = []
-    for t in input_types:
-      var = var_names.pop()
-      dependencies[var] = []
-      types_to_vars.setdefault(t, []).append(var)
-      input_names.append(var)
-
-    statements = []
-    for _ in range(program_len - 1):
-      var = var_names.pop()
-      statements.append(self.grow_body(var, dependencies, types_to_vars))
-    statements.append(self.grow_body("out", dependencies, types_to_vars))
-
-    new_var_names = [c for c in map(chr, range(97, 123))
-                     if c not in input_names]
-    new_var_names.reverse()
-    keep_statements = []
-    env = dict()
-    for s in statements:
-      if s.output_var in dependencies["out"]:
-        keep_statements.append(s)
-        env[s.output_var] = new_var_names.pop()
-      if s.output_var == "out":
-        keep_statements.append(s)
-
-    for k in keep_statements:
-      k.substitute(env)
-
-    return Program(input_names, input_types, ";".join(
-        [str(k) for k in keep_statements]))
-
-
-class Program(object):
-  """The program class."""
-
-  def __init__(self, input_names, input_types, body):
-    self.input_names = input_names
-    self.input_types = input_types
-    self.body = body
-
-  def evaluate(self, inputs):
-    """Evaluate this program."""
-    if len(inputs) != len(self.input_names):
-      raise AssertionError("inputs and input_names have to"
-                           "have the same len. inp: %s , names: %s" %
-                           (str(inputs), str(self.input_names)))
-    inp_str = ""
-    for (name, inp) in zip(self.input_names, inputs):
-      inp_str += name + " = " + str(inp) + "; "
-
-    with stdoutIO() as s:
-      # pylint: disable=exec-used
-      exec(inp_str + self.body + "; print(out)")
-      # pylint: enable=exec-used
-    return s.getvalue()[:-1]
-
-  def flat_str(self):
-    out = ""
-    for s in self.body.split(";"):
-      out += s + ";"
-    return out
-
-  def __str__(self):
-    out = ""
-    for (n, t) in zip(self.input_names, self.input_types):
-      out += n + " = " + str(t) + "\n"
-    for s in self.body.split(";"):
-      out += s + "\n"
-    return out
-
-
-prog_vocab = []
-prog_rev_vocab = {}
-
-
-def tokenize(string, tokens=None):
-  """Tokenize the program string."""
-  if tokens is None:
-    tokens = prog_vocab
-  tokens = sorted(tokens, key=len, reverse=True)
-  out = []
-  string = string.strip()
-  while string:
-    found = False
-    for t in tokens:
-      if string.startswith(t):
-        out.append(t)
-        string = string[len(t):]
-        found = True
-        break
-    if not found:
-      raise ValueError("Couldn't tokenize this: " + string)
-    string = string.strip()
-  return out
-
-
-def clean_up(output, max_val=100):
-  o = eval(str(output))
-  if isinstance(o, bool):
-    return o
-  if isinstance(o, int):
-    if o >= 0:
-      return min(o, max_val)
-    else:
-      return max(o, -1 * max_val)
-  if isinstance(o, list):
-    return [clean_up(l) for l in o]
-
-
-def make_vocab():
-  gen(2, 0)
-
-
-def gen(max_len, how_many):
-  """Generate some programs."""
-  functions = [f_head, f_last, f_take, f_drop, f_access, f_max, f_min,
-               f_reverse, f_sort, f_sum, f_map, f_filter, f_count, f_zipwith,
-               f_scan]
-
-  types_to_lambdas = {
-      FunctionType(["Int", "Int"]): ["plus_one", "minus_one", "times_two",
-                                     "div_two", "sq", "times_three",
-                                     "div_three", "times_four", "div_four"],
-      FunctionType(["Int", "Bool"]): ["pos", "neg", "even", "odd"],
-      FunctionType(["Int", "Int", "Int"]): ["add", "sub", "mul"]
-  }
-
-  tokens = []
-  for f in functions:
-    tokens.append(f.name)
-  for v in types_to_lambdas.values():
-    tokens.extend(v)
-  tokens.extend(["=", ";", ",", "(", ")", "[", "]", "Int", "out"])
-  tokens.extend(map(chr, range(97, 123)))
-
-  io_tokens = map(str, range(-220, 220))
-  if not prog_vocab:
-    prog_vocab.extend(["_PAD", "_EOS"] + tokens + io_tokens)
-    for i, t in enumerate(prog_vocab):
-      prog_rev_vocab[t] = i
-
-  io_tokens += [",", "[", "]", ")", "(", "None"]
-  grower = ProgramGrower(functions=functions,
-                         types_to_lambdas=types_to_lambdas)
-
-  def mk_inp(l):
-    return [random.choice(range(-5, 5)) for _ in range(l)]
-
-  tar = [ListType("Int")]
-  inps = [[mk_inp(3)], [mk_inp(5)], [mk_inp(7)], [mk_inp(15)]]
-
-  save_prefix = None
-  outcomes_to_programs = dict()
-  tried = set()
-  counter = 0
-  choices = [0] if max_len == 0 else range(max_len)
-  while counter < 100 * how_many and len(outcomes_to_programs) < how_many:
-    counter += 1
-    length = random.choice(choices)
-    t = grower.grow(length, tar)
-    while t in tried:
-      length = random.choice(choices)
-      t = grower.grow(length, tar)
-    # print(t.flat_str())
-    tried.add(t)
-    outcomes = [clean_up(t.evaluate(i)) for i in inps]
-    outcome_str = str(zip(inps, outcomes))
-    if outcome_str in outcomes_to_programs:
-      outcomes_to_programs[outcome_str] = min(
-          [t.flat_str(), outcomes_to_programs[outcome_str]],
-          key=lambda x: len(tokenize(x, tokens)))
-    else:
-      outcomes_to_programs[outcome_str] = t.flat_str()
-    if counter % 5000 == 0:
-      print("== proggen: tried: " + str(counter))
-      print("== proggen: kept:  " + str(len(outcomes_to_programs)))
-
-    if counter % 250000 == 0 and save_prefix is not None:
-      print("saving...")
-      save_counter = 0
-      progfilename = os.path.join(save_prefix, "prog_" + str(counter) + ".txt")
-      iofilename = os.path.join(save_prefix, "io_" + str(counter) + ".txt")
-      prog_token_filename = os.path.join(save_prefix,
-                                         "prog_tokens_" + str(counter) + ".txt")
-      io_token_filename = os.path.join(save_prefix,
-                                       "io_tokens_" + str(counter) + ".txt")
-      with open(progfilename, "a+") as fp,  \
-           open(iofilename, "a+") as fi, \
-           open(prog_token_filename, "a+") as ftp, \
-           open(io_token_filename, "a+") as fti:
-        for (o, p) in outcomes_to_programs.iteritems():
-          save_counter += 1
-          if save_counter % 500 == 0:
-            print("saving %d of %d" % (save_counter, len(outcomes_to_programs)))
-          fp.write(p+"\n")
-          fi.write(o+"\n")
-          ftp.write(str(tokenize(p, tokens))+"\n")
-          fti.write(str(tokenize(o, io_tokens))+"\n")
-
-  return list(outcomes_to_programs.values())
diff --git a/research/neural_gpu/wmt_utils.py b/research/neural_gpu/wmt_utils.py
deleted file mode 100644
index ef831918f9c9279eb1c6e560e5730739e5fe9521..0000000000000000000000000000000000000000
--- a/research/neural_gpu/wmt_utils.py
+++ /dev/null
@@ -1,437 +0,0 @@
-# Copyright 2015 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Utilities for downloading data from WMT, tokenizing, vocabularies."""
-
-from __future__ import print_function
-
-import gzip
-import os
-import re
-import tarfile
-
-from six.moves import urllib
-import tensorflow as tf
-
-# Special vocabulary symbols - we always put them at the start.
-_PAD = b"_PAD"
-_GO = b"_GO"
-_EOS = b"_EOS"
-_UNK = b"_CHAR_UNK"
-_SPACE = b"_SPACE"
-_START_VOCAB = [_PAD, _GO, _EOS, _UNK, _SPACE]
-
-PAD_ID = 0
-GO_ID = 1
-EOS_ID = 2
-UNK_ID = 3
-SPACE_ID = 4
-
-# Regular expressions used to tokenize.
-_CHAR_MARKER = "_CHAR_"
-_CHAR_MARKER_LEN = len(_CHAR_MARKER)
-_SPEC_CHARS = "" + chr(226) + chr(153) + chr(128)
-_PUNCTUATION = "][.,!?\"':;%$#@&*+}{|><=/^~)(_`,0123456789" + _SPEC_CHARS + "-"
-_WORD_SPLIT = re.compile("([" + _PUNCTUATION + "])")
-_OLD_WORD_SPLIT = re.compile(b"([.,!?\"':;)(])")
-_DIGIT_RE = re.compile(br"\d")
-
-# URLs for WMT data.
-_WMT_ENFR_TRAIN_URL = "http://www.statmt.org/wmt10/training-giga-fren.tar"
-_WMT_ENFR_DEV_URL = "http://www.statmt.org/wmt15/dev-v2.tgz"
-
-
-def maybe_download(directory, filename, url):
-  """Download filename from url unless it's already in directory."""
-  if not tf.gfile.Exists(directory):
-    print("Creating directory %s" % directory)
-    os.mkdir(directory)
-  filepath = os.path.join(directory, filename)
-  if not tf.gfile.Exists(filepath):
-    print("Downloading %s to %s" % (url, filepath))
-    filepath, _ = urllib.request.urlretrieve(url, filepath)
-    statinfo = os.stat(filepath)
-    print("Successfully downloaded", filename, statinfo.st_size, "bytes")
-  return filepath
-
-
-def gunzip_file(gz_path, new_path):
-  """Unzips from gz_path into new_path."""
-  print("Unpacking %s to %s" % (gz_path, new_path))
-  with gzip.open(gz_path, "rb") as gz_file:
-    with open(new_path, "wb") as new_file:
-      for line in gz_file:
-        new_file.write(line)
-
-
-def get_wmt_enfr_train_set(directory):
-  """Download the WMT en-fr training corpus to directory unless it's there."""
-  train_path = os.path.join(directory, "giga-fren.release2.fixed")
-  if not (tf.gfile.Exists(train_path +".fr") and
-          tf.gfile.Exists(train_path +".en")):
-    corpus_file = maybe_download(directory, "training-giga-fren.tar",
-                                 _WMT_ENFR_TRAIN_URL)
-    print("Extracting tar file %s" % corpus_file)
-    with tarfile.open(corpus_file, "r") as corpus_tar:
-      corpus_tar.extractall(directory)
-    gunzip_file(train_path + ".fr.gz", train_path + ".fr")
-    gunzip_file(train_path + ".en.gz", train_path + ".en")
-  return train_path
-
-
-def get_wmt_enfr_dev_set(directory):
-  """Download the WMT en-fr training corpus to directory unless it's there."""
-  dev_name = "newstest2013"
-  dev_path = os.path.join(directory, dev_name)
-  if not (tf.gfile.Exists(dev_path + ".fr") and
-          tf.gfile.Exists(dev_path + ".en")):
-    dev_file = maybe_download(directory, "dev-v2.tgz", _WMT_ENFR_DEV_URL)
-    print("Extracting tgz file %s" % dev_file)
-    with tarfile.open(dev_file, "r:gz") as dev_tar:
-      fr_dev_file = dev_tar.getmember("dev/" + dev_name + ".fr")
-      en_dev_file = dev_tar.getmember("dev/" + dev_name + ".en")
-      fr_dev_file.name = dev_name + ".fr"  # Extract without "dev/" prefix.
-      en_dev_file.name = dev_name + ".en"
-      dev_tar.extract(fr_dev_file, directory)
-      dev_tar.extract(en_dev_file, directory)
-  return dev_path
-
-
-def is_char(token):
-  if len(token) > _CHAR_MARKER_LEN:
-    if token[:_CHAR_MARKER_LEN] == _CHAR_MARKER:
-      return True
-  return False
-
-
-def basic_detokenizer(tokens):
-  """Reverse the process of the basic tokenizer below."""
-  result = []
-  previous_nospace = True
-  for t in tokens:
-    if is_char(t):
-      result.append(t[_CHAR_MARKER_LEN:])
-      previous_nospace = True
-    elif t == _SPACE:
-      result.append(" ")
-      previous_nospace = True
-    elif previous_nospace:
-      result.append(t)
-      previous_nospace = False
-    else:
-      result.extend([" ", t])
-      previous_nospace = False
-  return "".join(result)
-
-
-old_style = False
-
-
-def basic_tokenizer(sentence):
-  """Very basic tokenizer: split the sentence into a list of tokens."""
-  words = []
-  if old_style:
-    for space_separated_fragment in sentence.strip().split():
-      words.extend(re.split(_OLD_WORD_SPLIT, space_separated_fragment))
-    return [w for w in words if w]
-  for space_separated_fragment in sentence.strip().split():
-    tokens = [t for t in re.split(_WORD_SPLIT, space_separated_fragment) if t]
-    first_is_char = False
-    for i, t in enumerate(tokens):
-      if len(t) == 1 and t in _PUNCTUATION:
-        tokens[i] = _CHAR_MARKER + t
-        if i == 0:
-          first_is_char = True
-    if words and words[-1] != _SPACE and (first_is_char or is_char(words[-1])):
-      tokens = [_SPACE] + tokens
-    spaced_tokens = []
-    for i, tok in enumerate(tokens):
-      spaced_tokens.append(tokens[i])
-      if i < len(tokens) - 1:
-        if tok != _SPACE and not (is_char(tok) or is_char(tokens[i+1])):
-          spaced_tokens.append(_SPACE)
-    words.extend(spaced_tokens)
-  return words
-
-
-def space_tokenizer(sentence):
-  return sentence.strip().split()
-
-
-def is_pos_tag(token):
-  """Check if token is a part-of-speech tag."""
-  return(token in ["CC", "CD", "DT", "EX", "FW", "IN", "JJ", "JJR",
-                   "JJS", "LS", "MD", "NN", "NNS", "NNP", "NNPS", "PDT",
-                   "POS", "PRP", "PRP$", "RB", "RBR", "RBS", "RP", "SYM", "TO",
-                   "UH", "VB", "VBD", "VBG", "VBN", "VBP", "VBZ", "WDT", "WP",
-                   "WP$", "WRB", ".", ",", ":", ")", "-LRB-", "(", "-RRB-",
-                   "HYPH", "$", "``", "''", "ADD", "AFX", "QTR", "BES", "-DFL-",
-                   "GW", "HVS", "NFP"])
-
-
-def parse_constraints(inpt, res):
-  ntags = len(res)
-  nwords = len(inpt)
-  npostags = len([x for x in res if is_pos_tag(x)])
-  nclose = len([x for x in res if x[0] == "/"])
-  nopen = ntags - nclose - npostags
-  return (abs(npostags - nwords), abs(nclose - nopen))
-
-
-def create_vocabulary(vocabulary_path, data_path, max_vocabulary_size,
-                      tokenizer=None, normalize_digits=False):
-  """Create vocabulary file (if it does not exist yet) from data file.
-
-  Data file is assumed to contain one sentence per line. Each sentence is
-  tokenized and digits are normalized (if normalize_digits is set).
-  Vocabulary contains the most-frequent tokens up to max_vocabulary_size.
-  We write it to vocabulary_path in a one-token-per-line format, so that later
-  token in the first line gets id=0, second line gets id=1, and so on.
-
-  Args:
-    vocabulary_path: path where the vocabulary will be created.
-    data_path: data file that will be used to create vocabulary.
-    max_vocabulary_size: limit on the size of the created vocabulary.
-    tokenizer: a function to use to tokenize each data sentence;
-      if None, basic_tokenizer will be used.
-    normalize_digits: Boolean; if true, all digits are replaced by 0s.
-  """
-  if not tf.gfile.Exists(vocabulary_path):
-    print("Creating vocabulary %s from data %s" % (vocabulary_path, data_path))
-    vocab, chars = {}, {}
-    for c in _PUNCTUATION:
-      chars[c] = 1
-
-    # Read French file.
-    with tf.gfile.GFile(data_path + ".fr", mode="rb") as f:
-      counter = 0
-      for line_in in f:
-        line = " ".join(line_in.split())
-        counter += 1
-        if counter % 100000 == 0:
-          print("  processing fr line %d" % counter)
-        for c in line:
-          if c in chars:
-            chars[c] += 1
-          else:
-            chars[c] = 1
-        tokens = tokenizer(line) if tokenizer else basic_tokenizer(line)
-        tokens = [t for t in tokens if not is_char(t) and t != _SPACE]
-        for w in tokens:
-          word = re.sub(_DIGIT_RE, b"0", w) if normalize_digits else w
-          if word in vocab:
-            vocab[word] += 1000000000  # We want target words first.
-          else:
-            vocab[word] = 1000000000
-
-    # Read English file.
-    with tf.gfile.GFile(data_path + ".en", mode="rb") as f:
-      counter = 0
-      for line_in in f:
-        line = " ".join(line_in.split())
-        counter += 1
-        if counter % 100000 == 0:
-          print("  processing en line %d" % counter)
-        for c in line:
-          if c in chars:
-            chars[c] += 1
-          else:
-            chars[c] = 1
-        tokens = tokenizer(line) if tokenizer else basic_tokenizer(line)
-        tokens = [t for t in tokens if not is_char(t) and t != _SPACE]
-        for w in tokens:
-          word = re.sub(_DIGIT_RE, b"0", w) if normalize_digits else w
-          if word in vocab:
-            vocab[word] += 1
-          else:
-            vocab[word] = 1
-
-      sorted_vocab = sorted(vocab, key=vocab.get, reverse=True)
-      sorted_chars = sorted(chars, key=vocab.get, reverse=True)
-      sorted_chars = [_CHAR_MARKER + c for c in sorted_chars]
-      vocab_list = _START_VOCAB + sorted_chars + sorted_vocab
-      if tokenizer:
-        vocab_list = _START_VOCAB + sorted_vocab
-      if len(vocab_list) > max_vocabulary_size:
-        vocab_list = vocab_list[:max_vocabulary_size]
-      with tf.gfile.GFile(vocabulary_path, mode="wb") as vocab_file:
-        for w in vocab_list:
-          vocab_file.write(w + b"\n")
-
-
-def initialize_vocabulary(vocabulary_path):
-  """Initialize vocabulary from file.
-
-  We assume the vocabulary is stored one-item-per-line, so a file:
-    dog
-    cat
-  will result in a vocabulary {"dog": 0, "cat": 1}, and this function will
-  also return the reversed-vocabulary ["dog", "cat"].
-
-  Args:
-    vocabulary_path: path to the file containing the vocabulary.
-
-  Returns:
-    a pair: the vocabulary (a dictionary mapping string to integers), and
-    the reversed vocabulary (a list, which reverses the vocabulary mapping).
-
-  Raises:
-    ValueError: if the provided vocabulary_path does not exist.
-  """
-  if tf.gfile.Exists(vocabulary_path):
-    rev_vocab = []
-    with tf.gfile.GFile(vocabulary_path, mode="rb") as f:
-      rev_vocab.extend(f.readlines())
-    rev_vocab = [line.strip() for line in rev_vocab]
-    vocab = dict([(x, y) for (y, x) in enumerate(rev_vocab)])
-    return vocab, rev_vocab
-  else:
-    raise ValueError("Vocabulary file %s not found.", vocabulary_path)
-
-
-def sentence_to_token_ids_raw(sentence, vocabulary,
-                              tokenizer=None, normalize_digits=old_style):
-  """Convert a string to list of integers representing token-ids.
-
-  For example, a sentence "I have a dog" may become tokenized into
-  ["I", "have", "a", "dog"] and with vocabulary {"I": 1, "have": 2,
-  "a": 4, "dog": 7"} this function will return [1, 2, 4, 7].
-
-  Args:
-    sentence: the sentence in bytes format to convert to token-ids.
-    vocabulary: a dictionary mapping tokens to integers.
-    tokenizer: a function to use to tokenize each sentence;
-      if None, basic_tokenizer will be used.
-    normalize_digits: Boolean; if true, all digits are replaced by 0s.
-
-  Returns:
-    a list of integers, the token-ids for the sentence.
-  """
-  if tokenizer:
-    words = tokenizer(sentence)
-  else:
-    words = basic_tokenizer(sentence)
-  result = []
-  for w in words:
-    if normalize_digits:
-      w = re.sub(_DIGIT_RE, b"0", w)
-    if w in vocabulary:
-      result.append(vocabulary[w])
-    else:
-      if tokenizer:
-        result.append(UNK_ID)
-      else:
-        result.append(SPACE_ID)
-        for c in w:
-          result.append(vocabulary.get(_CHAR_MARKER + c, UNK_ID))
-        result.append(SPACE_ID)
-  while result and result[0] == SPACE_ID:
-    result = result[1:]
-  while result and result[-1] == SPACE_ID:
-    result = result[:-1]
-  return result
-
-
-def sentence_to_token_ids(sentence, vocabulary,
-                          tokenizer=None, normalize_digits=old_style):
-  """Convert a string to list of integers representing token-ids, tab=0."""
-  tab_parts = sentence.strip().split("\t")
-  toks = [sentence_to_token_ids_raw(t, vocabulary, tokenizer, normalize_digits)
-          for t in tab_parts]
-  res = []
-  for t in toks:
-    res.extend(t)
-    res.append(0)
-  return res[:-1]
-
-
-def data_to_token_ids(data_path, target_path, vocabulary_path,
-                      tokenizer=None, normalize_digits=False):
-  """Tokenize data file and turn into token-ids using given vocabulary file.
-
-  This function loads data line-by-line from data_path, calls the above
-  sentence_to_token_ids, and saves the result to target_path. See comment
-  for sentence_to_token_ids on the details of token-ids format.
-
-  Args:
-    data_path: path to the data file in one-sentence-per-line format.
-    target_path: path where the file with token-ids will be created.
-    vocabulary_path: path to the vocabulary file.
-    tokenizer: a function to use to tokenize each sentence;
-      if None, basic_tokenizer will be used.
-    normalize_digits: Boolean; if true, all digits are replaced by 0s.
-  """
-  if not tf.gfile.Exists(target_path):
-    print("Tokenizing data in %s" % data_path)
-    vocab, _ = initialize_vocabulary(vocabulary_path)
-    with tf.gfile.GFile(data_path, mode="rb") as data_file:
-      with tf.gfile.GFile(target_path, mode="w") as tokens_file:
-        counter = 0
-        for line in data_file:
-          counter += 1
-          if counter % 100000 == 0:
-            print("  tokenizing line %d" % counter)
-          token_ids = sentence_to_token_ids(line, vocab, tokenizer,
-                                            normalize_digits)
-          tokens_file.write(" ".join([str(tok) for tok in token_ids]) + "\n")
-
-
-def prepare_wmt_data(data_dir, vocabulary_size,
-                     tokenizer=None, normalize_digits=False):
-  """Get WMT data into data_dir, create vocabularies and tokenize data.
-
-  Args:
-    data_dir: directory in which the data sets will be stored.
-    vocabulary_size: size of the joint vocabulary to create and use.
-    tokenizer: a function to use to tokenize each data sentence;
-      if None, basic_tokenizer will be used.
-    normalize_digits: Boolean; if true, all digits are replaced by 0s.
-
-  Returns:
-    A tuple of 6 elements:
-      (1) path to the token-ids for English training data-set,
-      (2) path to the token-ids for French training data-set,
-      (3) path to the token-ids for English development data-set,
-      (4) path to the token-ids for French development data-set,
-      (5) path to the vocabulary file,
-      (6) path to the vocabulary file (for compatibility with non-joint vocab).
-  """
-  # Get wmt data to the specified directory.
-  train_path = get_wmt_enfr_train_set(data_dir)
-  dev_path = get_wmt_enfr_dev_set(data_dir)
-
-  # Create vocabularies of the appropriate sizes.
-  vocab_path = os.path.join(data_dir, "vocab%d.txt" % vocabulary_size)
-  create_vocabulary(vocab_path, train_path, vocabulary_size,
-                    tokenizer=tokenizer, normalize_digits=normalize_digits)
-
-  # Create token ids for the training data.
-  fr_train_ids_path = train_path + (".ids%d.fr" % vocabulary_size)
-  en_train_ids_path = train_path + (".ids%d.en" % vocabulary_size)
-  data_to_token_ids(train_path + ".fr", fr_train_ids_path, vocab_path,
-                    tokenizer=tokenizer, normalize_digits=normalize_digits)
-  data_to_token_ids(train_path + ".en", en_train_ids_path, vocab_path,
-                    tokenizer=tokenizer, normalize_digits=normalize_digits)
-
-  # Create token ids for the development data.
-  fr_dev_ids_path = dev_path + (".ids%d.fr" % vocabulary_size)
-  en_dev_ids_path = dev_path + (".ids%d.en" % vocabulary_size)
-  data_to_token_ids(dev_path + ".fr", fr_dev_ids_path, vocab_path,
-                    tokenizer=tokenizer, normalize_digits=normalize_digits)
-  data_to_token_ids(dev_path + ".en", en_dev_ids_path, vocab_path,
-                    tokenizer=tokenizer, normalize_digits=normalize_digits)
-
-  return (en_train_ids_path, fr_train_ids_path,
-          en_dev_ids_path, fr_dev_ids_path,
-          vocab_path, vocab_path)
diff --git a/research/neural_programmer/README.md b/research/neural_programmer/README.md
deleted file mode 100644
index dcc27f6fb015ec625935a0ea37d814a2ba10d2e3..0000000000000000000000000000000000000000
--- a/research/neural_programmer/README.md
+++ /dev/null
@@ -1,26 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Neural Programmer
-
-Implementation of the Neural Programmer model as described in this [paper](https://openreview.net/pdf?id=ry2YOrcge).
-
-Download and extract the data from the [WikiTableQuestions](https://ppasupat.github.io/WikiTableQuestions/) site. The dataset contains
-11321, 2831, and 4344 examples for training, development, and testing respectively. We use their tokenization, number and date pre-processing. Please note that the above paper used the [initial release](https://github.com/ppasupat/WikiTableQuestions/releases/tag/v0.2) for training, development and testing. 
-
-Change the `data_dir FLAG` to the location of the data.
-
-### Training 
-Run `python neural_programmer.py` 
-
-The models are written to `FLAGS.output_dir`.
-
-### Testing 
-Run `python neural_programmer.py --evaluator_job=True`
-
-The models are loaded from `FLAGS.output_dir`. The evaluation is done on development data.
-
-In case of errors because of encoding, add `"# -*- coding: utf-8 -*-"` as the first line in `wiki_data.py`
-
-Maintained by Arvind Neelakantan (arvind2505)
diff --git a/research/neural_programmer/data_utils.py b/research/neural_programmer/data_utils.py
deleted file mode 100644
index 4df80c66ad21d2e046fabf78446dd199ae117b44..0000000000000000000000000000000000000000
--- a/research/neural_programmer/data_utils.py
+++ /dev/null
@@ -1,666 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Functions for constructing vocabulary, converting the examples to integer format and building the required masks for batch computation Author: aneelakantan (Arvind Neelakantan)
-"""
-
-from __future__ import print_function
-
-import copy
-import numbers
-import numpy as np
-import wiki_data
-
-
-def return_index(a):
-  for i in range(len(a)):
-    if (a[i] == 1.0):
-      return i
-
-
-def construct_vocab(data, utility, add_word=False):
-  ans = []
-  for example in data:
-    sent = ""
-    for word in example.question:
-      if (not (isinstance(word, numbers.Number))):
-        sent += word + " "
-    example.original_nc = copy.deepcopy(example.number_columns)
-    example.original_wc = copy.deepcopy(example.word_columns)
-    example.original_nc_names = copy.deepcopy(example.number_column_names)
-    example.original_wc_names = copy.deepcopy(example.word_column_names)
-    if (add_word):
-      continue
-    number_found = 0
-    if (not (example.is_bad_example)):
-      for word in example.question:
-        if (isinstance(word, numbers.Number)):
-          number_found += 1
-        else:
-          if (not (utility.word_ids.has_key(word))):
-            utility.words.append(word)
-            utility.word_count[word] = 1
-            utility.word_ids[word] = len(utility.word_ids)
-            utility.reverse_word_ids[utility.word_ids[word]] = word
-          else:
-            utility.word_count[word] += 1
-      for col_name in example.word_column_names:
-        for word in col_name:
-          if (isinstance(word, numbers.Number)):
-            number_found += 1
-          else:
-            if (not (utility.word_ids.has_key(word))):
-              utility.words.append(word)
-              utility.word_count[word] = 1
-              utility.word_ids[word] = len(utility.word_ids)
-              utility.reverse_word_ids[utility.word_ids[word]] = word
-            else:
-              utility.word_count[word] += 1
-      for col_name in example.number_column_names:
-        for word in col_name:
-          if (isinstance(word, numbers.Number)):
-            number_found += 1
-          else:
-            if (not (utility.word_ids.has_key(word))):
-              utility.words.append(word)
-              utility.word_count[word] = 1
-              utility.word_ids[word] = len(utility.word_ids)
-              utility.reverse_word_ids[utility.word_ids[word]] = word
-            else:
-              utility.word_count[word] += 1
-
-
-def word_lookup(word, utility):
-  if (utility.word_ids.has_key(word)):
-    return word
-  else:
-    return utility.unk_token
-
-
-def convert_to_int_2d_and_pad(a, utility):
-  ans = []
-  #print a
-  for b in a:
-    temp = []
-    if (len(b) > utility.FLAGS.max_entry_length):
-      b = b[0:utility.FLAGS.max_entry_length]
-    for remaining in range(len(b), utility.FLAGS.max_entry_length):
-      b.append(utility.dummy_token)
-    assert len(b) == utility.FLAGS.max_entry_length
-    for word in b:
-      temp.append(utility.word_ids[word_lookup(word, utility)])
-    ans.append(temp)
-  #print ans
-  return ans
-
-
-def convert_to_bool_and_pad(a, utility):
-  a = a.tolist()
-  for i in range(len(a)):
-    for j in range(len(a[i])):
-      if (a[i][j] < 1):
-        a[i][j] = False
-      else:
-        a[i][j] = True
-    a[i] = a[i] + [False] * (utility.FLAGS.max_elements - len(a[i]))
-  return a
-
-
-seen_tables = {}
-
-
-def partial_match(question, table, number):
-  answer = []
-  match = {}
-  for i in range(len(table)):
-    temp = []
-    for j in range(len(table[i])):
-      temp.append(0)
-    answer.append(temp)
-  for i in range(len(table)):
-    for j in range(len(table[i])):
-      for word in question:
-        if (number):
-          if (word == table[i][j]):
-            answer[i][j] = 1.0
-            match[i] = 1.0
-        else:
-          if (word in table[i][j]):
-            answer[i][j] = 1.0
-            match[i] = 1.0
-  return answer, match
-
-
-def exact_match(question, table, number):
-  #performs exact match operation
-  answer = []
-  match = {}
-  matched_indices = []
-  for i in range(len(table)):
-    temp = []
-    for j in range(len(table[i])):
-      temp.append(0)
-    answer.append(temp)
-  for i in range(len(table)):
-    for j in range(len(table[i])):
-      if (number):
-        for word in question:
-          if (word == table[i][j]):
-            match[i] = 1.0
-            answer[i][j] = 1.0
-      else:
-        table_entry = table[i][j]
-        for k in range(len(question)):
-          if (k + len(table_entry) <= len(question)):
-            if (table_entry == question[k:(k + len(table_entry))]):
-              #if(len(table_entry) == 1):
-              #print "match: ", table_entry, question
-              match[i] = 1.0
-              answer[i][j] = 1.0
-              matched_indices.append((k, len(table_entry)))
-  return answer, match, matched_indices
-
-
-def partial_column_match(question, table, number):
-  answer = []
-  for i in range(len(table)):
-    answer.append(0)
-  for i in range(len(table)):
-    for word in question:
-      if (word in table[i]):
-        answer[i] = 1.0
-  return answer
-
-
-def exact_column_match(question, table, number):
-  #performs exact match on column names
-  answer = []
-  matched_indices = []
-  for i in range(len(table)):
-    answer.append(0)
-  for i in range(len(table)):
-    table_entry = table[i]
-    for k in range(len(question)):
-      if (k + len(table_entry) <= len(question)):
-        if (table_entry == question[k:(k + len(table_entry))]):
-          answer[i] = 1.0
-          matched_indices.append((k, len(table_entry)))
-  return answer, matched_indices
-
-
-def get_max_entry(a):
-  e = {}
-  for w in a:
-    if (w != "UNK, "):
-      if (e.has_key(w)):
-        e[w] += 1
-      else:
-        e[w] = 1
-  if (len(e) > 0):
-    (key, val) = sorted(e.items(), key=lambda x: -1 * x[1])[0]
-    if (val > 1):
-      return key
-    else:
-      return -1.0
-  else:
-    return -1.0
-
-
-def list_join(a):
-  ans = ""
-  for w in a:
-    ans += str(w) + ", "
-  return ans
-
-
-def group_by_max(table, number):
-  #computes the most frequently occurring entry in a column
-  answer = []
-  for i in range(len(table)):
-    temp = []
-    for j in range(len(table[i])):
-      temp.append(0)
-    answer.append(temp)
-  for i in range(len(table)):
-    if (number):
-      curr = table[i]
-    else:
-      curr = [list_join(w) for w in table[i]]
-    max_entry = get_max_entry(curr)
-    #print i, max_entry
-    for j in range(len(curr)):
-      if (max_entry == curr[j]):
-        answer[i][j] = 1.0
-      else:
-        answer[i][j] = 0.0
-  return answer
-
-
-def pick_one(a):
-  for i in range(len(a)):
-    if (1.0 in a[i]):
-      return True
-  return False
-
-
-def check_processed_cols(col, utility):
-  return True in [
-      True for y in col
-      if (y != utility.FLAGS.pad_int and y !=
-          utility.FLAGS.bad_number_pre_process)
-  ]
-
-
-def complete_wiki_processing(data, utility, train=True):
-  #convert to integers and padding
-  processed_data = []
-  num_bad_examples = 0
-  for example in data:
-    number_found = 0
-    if (example.is_bad_example):
-      num_bad_examples += 1
-    if (not (example.is_bad_example)):
-      example.string_question = example.question[:]
-      #entry match
-      example.processed_number_columns = example.processed_number_columns[:]
-      example.processed_word_columns = example.processed_word_columns[:]
-      example.word_exact_match, word_match, matched_indices = exact_match(
-          example.string_question, example.original_wc, number=False)
-      example.number_exact_match, number_match, _ = exact_match(
-          example.string_question, example.original_nc, number=True)
-      if (not (pick_one(example.word_exact_match)) and not (
-          pick_one(example.number_exact_match))):
-        assert len(word_match) == 0
-        assert len(number_match) == 0
-        example.word_exact_match, word_match = partial_match(
-            example.string_question, example.original_wc, number=False)
-      #group by max
-      example.word_group_by_max = group_by_max(example.original_wc, False)
-      example.number_group_by_max = group_by_max(example.original_nc, True)
-      #column name match
-      example.word_column_exact_match, wcol_matched_indices = exact_column_match(
-          example.string_question, example.original_wc_names, number=False)
-      example.number_column_exact_match, ncol_matched_indices = exact_column_match(
-          example.string_question, example.original_nc_names, number=False)
-      if (not (1.0 in example.word_column_exact_match) and not (
-          1.0 in example.number_column_exact_match)):
-        example.word_column_exact_match = partial_column_match(
-            example.string_question, example.original_wc_names, number=False)
-        example.number_column_exact_match = partial_column_match(
-            example.string_question, example.original_nc_names, number=False)
-      if (len(word_match) > 0 or len(number_match) > 0):
-        example.question.append(utility.entry_match_token)
-      if (1.0 in example.word_column_exact_match or
-          1.0 in example.number_column_exact_match):
-        example.question.append(utility.column_match_token)
-      example.string_question = example.question[:]
-      example.number_lookup_matrix = np.transpose(
-          example.number_lookup_matrix)[:]
-      example.word_lookup_matrix = np.transpose(example.word_lookup_matrix)[:]
-      example.columns = example.number_columns[:]
-      example.word_columns = example.word_columns[:]
-      example.len_total_cols = len(example.word_column_names) + len(
-          example.number_column_names)
-      example.column_names = example.number_column_names[:]
-      example.word_column_names = example.word_column_names[:]
-      example.string_column_names = example.number_column_names[:]
-      example.string_word_column_names = example.word_column_names[:]
-      example.sorted_number_index = []
-      example.sorted_word_index = []
-      example.column_mask = []
-      example.word_column_mask = []
-      example.processed_column_mask = []
-      example.processed_word_column_mask = []
-      example.word_column_entry_mask = []
-      example.question_attention_mask = []
-      example.question_number = example.question_number_1 = -1
-      example.question_attention_mask = []
-      example.ordinal_question = []
-      example.ordinal_question_one = []
-      new_question = []
-      if (len(example.number_columns) > 0):
-        example.len_col = len(example.number_columns[0])
-      else:
-        example.len_col = len(example.word_columns[0])
-      for (start, length) in matched_indices:
-        for j in range(length):
-          example.question[start + j] = utility.unk_token
-      #print example.question
-      for word in example.question:
-        if (isinstance(word, numbers.Number) or wiki_data.is_date(word)):
-          if (not (isinstance(word, numbers.Number)) and
-              wiki_data.is_date(word)):
-            word = word.replace("X", "").replace("-", "")
-          number_found += 1
-          if (number_found == 1):
-            example.question_number = word
-            if (len(example.ordinal_question) > 0):
-              example.ordinal_question[len(example.ordinal_question) - 1] = 1.0
-            else:
-              example.ordinal_question.append(1.0)
-          elif (number_found == 2):
-            example.question_number_1 = word
-            if (len(example.ordinal_question_one) > 0):
-              example.ordinal_question_one[len(example.ordinal_question_one) -
-                                           1] = 1.0
-            else:
-              example.ordinal_question_one.append(1.0)
-        else:
-          new_question.append(word)
-          example.ordinal_question.append(0.0)
-          example.ordinal_question_one.append(0.0)
-      example.question = [
-          utility.word_ids[word_lookup(w, utility)] for w in new_question
-      ]
-      example.question_attention_mask = [0.0] * len(example.question)
-      #when the first question number occurs before a word
-      example.ordinal_question = example.ordinal_question[0:len(
-          example.question)]
-      example.ordinal_question_one = example.ordinal_question_one[0:len(
-          example.question)]
-      #question-padding
-      example.question = [utility.word_ids[utility.dummy_token]] * (
-          utility.FLAGS.question_length - len(example.question)
-      ) + example.question
-      example.question_attention_mask = [-10000.0] * (
-          utility.FLAGS.question_length - len(example.question_attention_mask)
-      ) + example.question_attention_mask
-      example.ordinal_question = [0.0] * (utility.FLAGS.question_length -
-                                          len(example.ordinal_question)
-                                         ) + example.ordinal_question
-      example.ordinal_question_one = [0.0] * (utility.FLAGS.question_length -
-                                              len(example.ordinal_question_one)
-                                             ) + example.ordinal_question_one
-      if (True):
-        #number columns and related-padding
-        num_cols = len(example.columns)
-        start = 0
-        for column in example.number_columns:
-          if (check_processed_cols(example.processed_number_columns[start],
-                                   utility)):
-            example.processed_column_mask.append(0.0)
-          sorted_index = sorted(
-              range(len(example.processed_number_columns[start])),
-              key=lambda k: example.processed_number_columns[start][k],
-              reverse=True)
-          sorted_index = sorted_index + [utility.FLAGS.pad_int] * (
-              utility.FLAGS.max_elements - len(sorted_index))
-          example.sorted_number_index.append(sorted_index)
-          example.columns[start] = column + [utility.FLAGS.pad_int] * (
-              utility.FLAGS.max_elements - len(column))
-          example.processed_number_columns[start] += [utility.FLAGS.pad_int] * (
-              utility.FLAGS.max_elements -
-              len(example.processed_number_columns[start]))
-          start += 1
-          example.column_mask.append(0.0)
-        for remaining in range(num_cols, utility.FLAGS.max_number_cols):
-          example.sorted_number_index.append([utility.FLAGS.pad_int] *
-                                             (utility.FLAGS.max_elements))
-          example.columns.append([utility.FLAGS.pad_int] *
-                                 (utility.FLAGS.max_elements))
-          example.processed_number_columns.append([utility.FLAGS.pad_int] *
-                                                  (utility.FLAGS.max_elements))
-          example.number_exact_match.append([0.0] *
-                                            (utility.FLAGS.max_elements))
-          example.number_group_by_max.append([0.0] *
-                                             (utility.FLAGS.max_elements))
-          example.column_mask.append(-100000000.0)
-          example.processed_column_mask.append(-100000000.0)
-          example.number_column_exact_match.append(0.0)
-          example.column_names.append([utility.dummy_token])
-        #word column  and related-padding
-        start = 0
-        word_num_cols = len(example.word_columns)
-        for column in example.word_columns:
-          if (check_processed_cols(example.processed_word_columns[start],
-                                   utility)):
-            example.processed_word_column_mask.append(0.0)
-          sorted_index = sorted(
-              range(len(example.processed_word_columns[start])),
-              key=lambda k: example.processed_word_columns[start][k],
-              reverse=True)
-          sorted_index = sorted_index + [utility.FLAGS.pad_int] * (
-              utility.FLAGS.max_elements - len(sorted_index))
-          example.sorted_word_index.append(sorted_index)
-          column = convert_to_int_2d_and_pad(column, utility)
-          example.word_columns[start] = column + [[
-              utility.word_ids[utility.dummy_token]
-          ] * utility.FLAGS.max_entry_length] * (utility.FLAGS.max_elements -
-                                                 len(column))
-          example.processed_word_columns[start] += [utility.FLAGS.pad_int] * (
-              utility.FLAGS.max_elements -
-              len(example.processed_word_columns[start]))
-          example.word_column_entry_mask.append([0] * len(column) + [
-              utility.word_ids[utility.dummy_token]
-          ] * (utility.FLAGS.max_elements - len(column)))
-          start += 1
-          example.word_column_mask.append(0.0)
-        for remaining in range(word_num_cols, utility.FLAGS.max_word_cols):
-          example.sorted_word_index.append([utility.FLAGS.pad_int] *
-                                           (utility.FLAGS.max_elements))
-          example.word_columns.append([[utility.word_ids[utility.dummy_token]] *
-                                       utility.FLAGS.max_entry_length] *
-                                      (utility.FLAGS.max_elements))
-          example.word_column_entry_mask.append(
-              [utility.word_ids[utility.dummy_token]] *
-              (utility.FLAGS.max_elements))
-          example.word_exact_match.append([0.0] * (utility.FLAGS.max_elements))
-          example.word_group_by_max.append([0.0] * (utility.FLAGS.max_elements))
-          example.processed_word_columns.append([utility.FLAGS.pad_int] *
-                                                (utility.FLAGS.max_elements))
-          example.word_column_mask.append(-100000000.0)
-          example.processed_word_column_mask.append(-100000000.0)
-          example.word_column_exact_match.append(0.0)
-          example.word_column_names.append([utility.dummy_token] *
-                                           utility.FLAGS.max_entry_length)
-        seen_tables[example.table_key] = 1
-      #convert column and word column names to integers
-      example.column_ids = convert_to_int_2d_and_pad(example.column_names,
-                                                     utility)
-      example.word_column_ids = convert_to_int_2d_and_pad(
-          example.word_column_names, utility)
-      for i_em in range(len(example.number_exact_match)):
-        example.number_exact_match[i_em] = example.number_exact_match[
-            i_em] + [0.0] * (utility.FLAGS.max_elements -
-                             len(example.number_exact_match[i_em]))
-        example.number_group_by_max[i_em] = example.number_group_by_max[
-            i_em] + [0.0] * (utility.FLAGS.max_elements -
-                             len(example.number_group_by_max[i_em]))
-      for i_em in range(len(example.word_exact_match)):
-        example.word_exact_match[i_em] = example.word_exact_match[
-            i_em] + [0.0] * (utility.FLAGS.max_elements -
-                             len(example.word_exact_match[i_em]))
-        example.word_group_by_max[i_em] = example.word_group_by_max[
-            i_em] + [0.0] * (utility.FLAGS.max_elements -
-                             len(example.word_group_by_max[i_em]))
-      example.exact_match = example.number_exact_match + example.word_exact_match
-      example.group_by_max = example.number_group_by_max + example.word_group_by_max
-      example.exact_column_match = example.number_column_exact_match + example.word_column_exact_match
-      #answer and related mask, padding
-      if (example.is_lookup):
-        example.answer = example.calc_answer
-        example.number_print_answer = example.number_lookup_matrix.tolist()
-        example.word_print_answer = example.word_lookup_matrix.tolist()
-        for i_answer in range(len(example.number_print_answer)):
-          example.number_print_answer[i_answer] = example.number_print_answer[
-              i_answer] + [0.0] * (utility.FLAGS.max_elements -
-                                   len(example.number_print_answer[i_answer]))
-        for i_answer in range(len(example.word_print_answer)):
-          example.word_print_answer[i_answer] = example.word_print_answer[
-              i_answer] + [0.0] * (utility.FLAGS.max_elements -
-                                   len(example.word_print_answer[i_answer]))
-        example.number_lookup_matrix = convert_to_bool_and_pad(
-            example.number_lookup_matrix, utility)
-        example.word_lookup_matrix = convert_to_bool_and_pad(
-            example.word_lookup_matrix, utility)
-        for remaining in range(num_cols, utility.FLAGS.max_number_cols):
-          example.number_lookup_matrix.append([False] *
-                                              utility.FLAGS.max_elements)
-          example.number_print_answer.append([0.0] * utility.FLAGS.max_elements)
-        for remaining in range(word_num_cols, utility.FLAGS.max_word_cols):
-          example.word_lookup_matrix.append([False] *
-                                            utility.FLAGS.max_elements)
-          example.word_print_answer.append([0.0] * utility.FLAGS.max_elements)
-        example.print_answer = example.number_print_answer + example.word_print_answer
-      else:
-        example.answer = example.calc_answer
-        example.print_answer = [[0.0] * (utility.FLAGS.max_elements)] * (
-            utility.FLAGS.max_number_cols + utility.FLAGS.max_word_cols)
-      #question_number masks
-      if (example.question_number == -1):
-        example.question_number_mask = np.zeros([utility.FLAGS.max_elements])
-      else:
-        example.question_number_mask = np.ones([utility.FLAGS.max_elements])
-      if (example.question_number_1 == -1):
-        example.question_number_one_mask = -10000.0
-      else:
-        example.question_number_one_mask = np.float64(0.0)
-      if (example.len_col > utility.FLAGS.max_elements):
-        continue
-      processed_data.append(example)
-  return processed_data
-
-
-def add_special_words(utility):
-  utility.words.append(utility.entry_match_token)
-  utility.word_ids[utility.entry_match_token] = len(utility.word_ids)
-  utility.reverse_word_ids[utility.word_ids[
-      utility.entry_match_token]] = utility.entry_match_token
-  utility.entry_match_token_id = utility.word_ids[utility.entry_match_token]
-  print("entry match token: ", utility.word_ids[
-      utility.entry_match_token], utility.entry_match_token_id)
-  utility.words.append(utility.column_match_token)
-  utility.word_ids[utility.column_match_token] = len(utility.word_ids)
-  utility.reverse_word_ids[utility.word_ids[
-      utility.column_match_token]] = utility.column_match_token
-  utility.column_match_token_id = utility.word_ids[utility.column_match_token]
-  print("entry match token: ", utility.word_ids[
-      utility.column_match_token], utility.column_match_token_id)
-  utility.words.append(utility.dummy_token)
-  utility.word_ids[utility.dummy_token] = len(utility.word_ids)
-  utility.reverse_word_ids[utility.word_ids[
-      utility.dummy_token]] = utility.dummy_token
-  utility.dummy_token_id = utility.word_ids[utility.dummy_token]
-  utility.words.append(utility.unk_token)
-  utility.word_ids[utility.unk_token] = len(utility.word_ids)
-  utility.reverse_word_ids[utility.word_ids[
-      utility.unk_token]] = utility.unk_token
-
-
-def perform_word_cutoff(utility):
-  if (utility.FLAGS.word_cutoff > 0):
-    for word in utility.word_ids.keys():
-      if (utility.word_count.has_key(word) and utility.word_count[word] <
-          utility.FLAGS.word_cutoff and word != utility.unk_token and
-          word != utility.dummy_token and word != utility.entry_match_token and
-          word != utility.column_match_token):
-        utility.word_ids.pop(word)
-        utility.words.remove(word)
-
-
-def word_dropout(question, utility):
-  if (utility.FLAGS.word_dropout_prob > 0.0):
-    new_question = []
-    for i in range(len(question)):
-      if (question[i] != utility.dummy_token_id and
-          utility.random.random() > utility.FLAGS.word_dropout_prob):
-        new_question.append(utility.word_ids[utility.unk_token])
-      else:
-        new_question.append(question[i])
-    return new_question
-  else:
-    return question
-
-
-def generate_feed_dict(data, curr, batch_size, gr, train=False, utility=None):
-  #prepare feed dict dictionary
-  feed_dict = {}
-  feed_examples = []
-  for j in range(batch_size):
-    feed_examples.append(data[curr + j])
-  if (train):
-    feed_dict[gr.batch_question] = [
-        word_dropout(feed_examples[j].question, utility)
-        for j in range(batch_size)
-    ]
-  else:
-    feed_dict[gr.batch_question] = [
-        feed_examples[j].question for j in range(batch_size)
-    ]
-  feed_dict[gr.batch_question_attention_mask] = [
-      feed_examples[j].question_attention_mask for j in range(batch_size)
-  ]
-  feed_dict[
-      gr.batch_answer] = [feed_examples[j].answer for j in range(batch_size)]
-  feed_dict[gr.batch_number_column] = [
-      feed_examples[j].columns for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_processed_number_column] = [
-      feed_examples[j].processed_number_columns for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_processed_sorted_index_number_column] = [
-      feed_examples[j].sorted_number_index for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_processed_sorted_index_word_column] = [
-      feed_examples[j].sorted_word_index for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_question_number] = np.array(
-      [feed_examples[j].question_number for j in range(batch_size)]).reshape(
-          (batch_size, 1))
-  feed_dict[gr.batch_question_number_one] = np.array(
-      [feed_examples[j].question_number_1 for j in range(batch_size)]).reshape(
-          (batch_size, 1))
-  feed_dict[gr.batch_question_number_mask] = [
-      feed_examples[j].question_number_mask for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_question_number_one_mask] = np.array(
-      [feed_examples[j].question_number_one_mask for j in range(batch_size)
-      ]).reshape((batch_size, 1))
-  feed_dict[gr.batch_print_answer] = [
-      feed_examples[j].print_answer for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_exact_match] = [
-      feed_examples[j].exact_match for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_group_by_max] = [
-      feed_examples[j].group_by_max for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_column_exact_match] = [
-      feed_examples[j].exact_column_match for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_ordinal_question] = [
-      feed_examples[j].ordinal_question for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_ordinal_question_one] = [
-      feed_examples[j].ordinal_question_one for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_number_column_mask] = [
-      feed_examples[j].column_mask for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_number_column_names] = [
-      feed_examples[j].column_ids for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_processed_word_column] = [
-      feed_examples[j].processed_word_columns for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_word_column_mask] = [
-      feed_examples[j].word_column_mask for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_word_column_names] = [
-      feed_examples[j].word_column_ids for j in range(batch_size)
-  ]
-  feed_dict[gr.batch_word_column_entry_mask] = [
-      feed_examples[j].word_column_entry_mask for j in range(batch_size)
-  ]
-  return feed_dict
diff --git a/research/neural_programmer/model.py b/research/neural_programmer/model.py
deleted file mode 100644
index 610d66699e6e41188be58cc1f623c030d243c689..0000000000000000000000000000000000000000
--- a/research/neural_programmer/model.py
+++ /dev/null
@@ -1,679 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Author: aneelakantan (Arvind Neelakantan)
-"""
-
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-import nn_utils
-
-
-class Graph():
-
-  def __init__(self, utility, batch_size, max_passes, mode="train"):
-    self.utility = utility
-    self.data_type = self.utility.tf_data_type[self.utility.FLAGS.data_type]
-    self.max_elements = self.utility.FLAGS.max_elements
-    max_elements = self.utility.FLAGS.max_elements
-    self.num_cols = self.utility.FLAGS.max_number_cols
-    self.num_word_cols = self.utility.FLAGS.max_word_cols
-    self.question_length = self.utility.FLAGS.question_length
-    self.batch_size = batch_size
-    self.max_passes = max_passes
-    self.mode = mode
-    self.embedding_dims = self.utility.FLAGS.embedding_dims
-    #input question and a mask
-    self.batch_question = tf.placeholder(tf.int32,
-                                         [batch_size, self.question_length])
-    self.batch_question_attention_mask = tf.placeholder(
-        self.data_type, [batch_size, self.question_length])
-    #ground truth scalar answer and lookup answer
-    self.batch_answer = tf.placeholder(self.data_type, [batch_size])
-    self.batch_print_answer = tf.placeholder(
-        self.data_type,
-        [batch_size, self.num_cols + self.num_word_cols, max_elements])
-    #number columns and its processed version
-    self.batch_number_column = tf.placeholder(
-        self.data_type, [batch_size, self.num_cols, max_elements
-                        ])  #columns with numeric entries
-    self.batch_processed_number_column = tf.placeholder(
-        self.data_type, [batch_size, self.num_cols, max_elements])
-    self.batch_processed_sorted_index_number_column = tf.placeholder(
-        tf.int32, [batch_size, self.num_cols, max_elements])
-    #word columns and its processed version
-    self.batch_processed_word_column = tf.placeholder(
-        self.data_type, [batch_size, self.num_word_cols, max_elements])
-    self.batch_processed_sorted_index_word_column = tf.placeholder(
-        tf.int32, [batch_size, self.num_word_cols, max_elements])
-    self.batch_word_column_entry_mask = tf.placeholder(
-        tf.int32, [batch_size, self.num_word_cols, max_elements])
-    #names of word and number columns along with their mask
-    self.batch_word_column_names = tf.placeholder(
-        tf.int32,
-        [batch_size, self.num_word_cols, self.utility.FLAGS.max_entry_length])
-    self.batch_word_column_mask = tf.placeholder(
-        self.data_type, [batch_size, self.num_word_cols])
-    self.batch_number_column_names = tf.placeholder(
-        tf.int32,
-        [batch_size, self.num_cols, self.utility.FLAGS.max_entry_length])
-    self.batch_number_column_mask = tf.placeholder(self.data_type,
-                                                   [batch_size, self.num_cols])
-    #exact match and group by max operation
-    self.batch_exact_match = tf.placeholder(
-        self.data_type,
-        [batch_size, self.num_cols + self.num_word_cols, max_elements])
-    self.batch_column_exact_match = tf.placeholder(
-        self.data_type, [batch_size, self.num_cols + self.num_word_cols])
-    self.batch_group_by_max = tf.placeholder(
-        self.data_type,
-        [batch_size, self.num_cols + self.num_word_cols, max_elements])
-    #numbers in the question along with their position. This is used to compute arguments to the comparison operations
-    self.batch_question_number = tf.placeholder(self.data_type, [batch_size, 1])
-    self.batch_question_number_one = tf.placeholder(self.data_type,
-                                                    [batch_size, 1])
-    self.batch_question_number_mask = tf.placeholder(
-        self.data_type, [batch_size, max_elements])
-    self.batch_question_number_one_mask = tf.placeholder(self.data_type,
-                                                         [batch_size, 1])
-    self.batch_ordinal_question = tf.placeholder(
-        self.data_type, [batch_size, self.question_length])
-    self.batch_ordinal_question_one = tf.placeholder(
-        self.data_type, [batch_size, self.question_length])
-
-  def LSTM_question_embedding(self, sentence, sentence_length):
-    #LSTM processes the input question
-    lstm_params = "question_lstm"
-    hidden_vectors = []
-    sentence = self.batch_question
-    question_hidden = tf.zeros(
-        [self.batch_size, self.utility.FLAGS.embedding_dims], self.data_type)
-    question_c_hidden = tf.zeros(
-        [self.batch_size, self.utility.FLAGS.embedding_dims], self.data_type)
-    if (self.utility.FLAGS.rnn_dropout > 0.0):
-      if (self.mode == "train"):
-        rnn_dropout_mask = tf.cast(
-            tf.random_uniform(
-                tf.shape(question_hidden), minval=0.0, maxval=1.0) <
-            self.utility.FLAGS.rnn_dropout,
-            self.data_type) / self.utility.FLAGS.rnn_dropout
-      else:
-        rnn_dropout_mask = tf.ones_like(question_hidden)
-    for question_iterator in range(self.question_length):
-      curr_word = sentence[:, question_iterator]
-      question_vector = nn_utils.apply_dropout(
-          nn_utils.get_embedding(curr_word, self.utility, self.params),
-          self.utility.FLAGS.dropout, self.mode)
-      question_hidden, question_c_hidden = nn_utils.LSTMCell(
-          question_vector, question_hidden, question_c_hidden, lstm_params,
-          self.params)
-      if (self.utility.FLAGS.rnn_dropout > 0.0):
-        question_hidden = question_hidden * rnn_dropout_mask
-      hidden_vectors.append(tf.expand_dims(question_hidden, 0))
-    hidden_vectors = tf.concat(axis=0, values=hidden_vectors)
-    return question_hidden, hidden_vectors
-
-  def history_recurrent_step(self, curr_hprev, hprev):
-    #A single RNN step for controller or history RNN
-    return tf.tanh(
-        tf.matmul(
-            tf.concat(axis=1, values=[hprev, curr_hprev]), self.params[
-                "history_recurrent"])) + self.params["history_recurrent_bias"]
-
-  def question_number_softmax(self, hidden_vectors):
-    #Attention on quetsion to decide the question number to passed to comparison ops
-    def compute_ans(op_embedding, comparison):
-      op_embedding = tf.expand_dims(op_embedding, 0)
-      #dot product of operation embedding with hidden state to the left of the number occurrence
-      first = tf.transpose(
-          tf.matmul(op_embedding,
-                    tf.transpose(
-                        tf.reduce_sum(hidden_vectors * tf.tile(
-                            tf.expand_dims(
-                                tf.transpose(self.batch_ordinal_question), 2),
-                            [1, 1, self.utility.FLAGS.embedding_dims]), 0))))
-      second = self.batch_question_number_one_mask + tf.transpose(
-          tf.matmul(op_embedding,
-                    tf.transpose(
-                        tf.reduce_sum(hidden_vectors * tf.tile(
-                            tf.expand_dims(
-                                tf.transpose(self.batch_ordinal_question_one), 2
-                            ), [1, 1, self.utility.FLAGS.embedding_dims]), 0))))
-      question_number_softmax = tf.nn.softmax(tf.concat(axis=1, values=[first, second]))
-      if (self.mode == "test"):
-        cond = tf.equal(question_number_softmax,
-                        tf.reshape(
-                            tf.reduce_max(question_number_softmax, 1),
-                            [self.batch_size, 1]))
-        question_number_softmax = tf.where(
-            cond,
-            tf.fill(tf.shape(question_number_softmax), 1.0),
-            tf.fill(tf.shape(question_number_softmax), 0.0))
-        question_number_softmax = tf.cast(question_number_softmax,
-                                          self.data_type)
-      ans = tf.reshape(
-          tf.reduce_sum(question_number_softmax * tf.concat(
-              axis=1, values=[self.batch_question_number, self.batch_question_number_one]),
-                        1), [self.batch_size, 1])
-      return ans
-
-    def compute_op_position(op_name):
-      for i in range(len(self.utility.operations_set)):
-        if (op_name == self.utility.operations_set[i]):
-          return i
-
-    def compute_question_number(op_name):
-      op_embedding = tf.nn.embedding_lookup(self.params_unit,
-                                            compute_op_position(op_name))
-      return compute_ans(op_embedding, op_name)
-
-    curr_greater_question_number = compute_question_number("greater")
-    curr_lesser_question_number = compute_question_number("lesser")
-    curr_geq_question_number = compute_question_number("geq")
-    curr_leq_question_number = compute_question_number("leq")
-    return curr_greater_question_number, curr_lesser_question_number, curr_geq_question_number, curr_leq_question_number
-
-  def perform_attention(self, context_vector, hidden_vectors, length, mask):
-    #Performs attention on hiddent_vectors using context vector
-    context_vector = tf.tile(
-        tf.expand_dims(context_vector, 0), [length, 1, 1])  #time * bs * d
-    attention_softmax = tf.nn.softmax(
-        tf.transpose(tf.reduce_sum(context_vector * hidden_vectors, 2)) +
-        mask)  #batch_size * time
-    attention_softmax = tf.tile(
-        tf.expand_dims(tf.transpose(attention_softmax), 2),
-        [1, 1, self.embedding_dims])
-    ans_vector = tf.reduce_sum(attention_softmax * hidden_vectors, 0)
-    return ans_vector
-
-  #computes embeddings for column names using parameters of question module
-  def get_column_hidden_vectors(self):
-    #vector representations for the column names
-    self.column_hidden_vectors = tf.reduce_sum(
-        nn_utils.get_embedding(self.batch_number_column_names, self.utility,
-                               self.params), 2)
-    self.word_column_hidden_vectors = tf.reduce_sum(
-        nn_utils.get_embedding(self.batch_word_column_names, self.utility,
-                               self.params), 2)
-
-  def create_summary_embeddings(self):
-    #embeddings for each text entry in the table using parameters of the question module
-    self.summary_text_entry_embeddings = tf.reduce_sum(
-        tf.expand_dims(self.batch_exact_match, 3) * tf.expand_dims(
-            tf.expand_dims(
-                tf.expand_dims(
-                    nn_utils.get_embedding(self.utility.entry_match_token_id,
-                                           self.utility, self.params), 0), 1),
-            2), 2)
-
-  def compute_column_softmax(self, column_controller_vector, time_step):
-    #compute softmax over all the columns using column controller vector
-    column_controller_vector = tf.tile(
-        tf.expand_dims(column_controller_vector, 1),
-        [1, self.num_cols + self.num_word_cols, 1])  #max_cols * bs * d
-    column_controller_vector = nn_utils.apply_dropout(
-        column_controller_vector, self.utility.FLAGS.dropout, self.mode)
-    self.full_column_hidden_vectors = tf.concat(
-        axis=1, values=[self.column_hidden_vectors, self.word_column_hidden_vectors])
-    self.full_column_hidden_vectors += self.summary_text_entry_embeddings
-    self.full_column_hidden_vectors = nn_utils.apply_dropout(
-        self.full_column_hidden_vectors, self.utility.FLAGS.dropout, self.mode)
-    column_logits = tf.reduce_sum(
-        column_controller_vector * self.full_column_hidden_vectors, 2) + (
-            self.params["word_match_feature_column_name"] *
-            self.batch_column_exact_match) + self.full_column_mask
-    column_softmax = tf.nn.softmax(column_logits)  #batch_size * max_cols
-    return column_softmax
-
-  def compute_first_or_last(self, select, first=True):
-    #perform first ot last operation on row select with probabilistic row selection
-    answer = tf.zeros_like(select)
-    running_sum = tf.zeros([self.batch_size, 1], self.data_type)
-    for i in range(self.max_elements):
-      if (first):
-        current = tf.slice(select, [0, i], [self.batch_size, 1])
-      else:
-        current = tf.slice(select, [0, self.max_elements - 1 - i],
-                           [self.batch_size, 1])
-      curr_prob = current * (1 - running_sum)
-      curr_prob = curr_prob * tf.cast(curr_prob >= 0.0, self.data_type)
-      running_sum += curr_prob
-      temp_ans = []
-      curr_prob = tf.expand_dims(tf.reshape(curr_prob, [self.batch_size]), 0)
-      for i_ans in range(self.max_elements):
-        if (not (first) and i_ans == self.max_elements - 1 - i):
-          temp_ans.append(curr_prob)
-        elif (first and i_ans == i):
-          temp_ans.append(curr_prob)
-        else:
-          temp_ans.append(tf.zeros_like(curr_prob))
-      temp_ans = tf.transpose(tf.concat(axis=0, values=temp_ans))
-      answer += temp_ans
-    return answer
-
-  def make_hard_softmax(self, softmax):
-    #converts soft selection to hard selection. used at test time
-    cond = tf.equal(
-        softmax, tf.reshape(tf.reduce_max(softmax, 1), [self.batch_size, 1]))
-    softmax = tf.where(
-        cond, tf.fill(tf.shape(softmax), 1.0), tf.fill(tf.shape(softmax), 0.0))
-    softmax = tf.cast(softmax, self.data_type)
-    return softmax
-
-  def compute_max_or_min(self, select, maxi=True):
-    #computes the argmax and argmin of a column with probabilistic row selection
-    answer = tf.zeros([
-        self.batch_size, self.num_cols + self.num_word_cols, self.max_elements
-    ], self.data_type)
-    sum_prob = tf.zeros([self.batch_size, self.num_cols + self.num_word_cols],
-                        self.data_type)
-    for j in range(self.max_elements):
-      if (maxi):
-        curr_pos = j
-      else:
-        curr_pos = self.max_elements - 1 - j
-      select_index = tf.slice(self.full_processed_sorted_index_column,
-                              [0, 0, curr_pos], [self.batch_size, -1, 1])
-      select_mask = tf.equal(
-          tf.tile(
-              tf.expand_dims(
-                  tf.tile(
-                      tf.expand_dims(tf.range(self.max_elements), 0),
-                      [self.batch_size, 1]), 1),
-              [1, self.num_cols + self.num_word_cols, 1]), select_index)
-      curr_prob = tf.expand_dims(select, 1) * tf.cast(
-          select_mask, self.data_type) * self.select_bad_number_mask
-      curr_prob = curr_prob * tf.expand_dims((1 - sum_prob), 2)
-      curr_prob = curr_prob * tf.expand_dims(
-          tf.cast((1 - sum_prob) > 0.0, self.data_type), 2)
-      answer = tf.where(select_mask, curr_prob, answer)
-      sum_prob += tf.reduce_sum(curr_prob, 2)
-    return answer
-
-  def perform_operations(self, softmax, full_column_softmax, select,
-                         prev_select_1, curr_pass):
-    #performs all the 15 operations. computes scalar output, lookup answer and row selector
-    column_softmax = tf.slice(full_column_softmax, [0, 0],
-                              [self.batch_size, self.num_cols])
-    word_column_softmax = tf.slice(full_column_softmax, [0, self.num_cols],
-                                   [self.batch_size, self.num_word_cols])
-    init_max = self.compute_max_or_min(select, maxi=True)
-    init_min = self.compute_max_or_min(select, maxi=False)
-    #operations that are column  independent
-    count = tf.reshape(tf.reduce_sum(select, 1), [self.batch_size, 1])
-    select_full_column_softmax = tf.tile(
-        tf.expand_dims(full_column_softmax, 2),
-        [1, 1, self.max_elements
-        ])  #BS * (max_cols + max_word_cols) * max_elements
-    select_word_column_softmax = tf.tile(
-        tf.expand_dims(word_column_softmax, 2),
-        [1, 1, self.max_elements])  #BS * max_word_cols * max_elements
-    select_greater = tf.reduce_sum(
-        self.init_select_greater * select_full_column_softmax,
-        1) * self.batch_question_number_mask  #BS * max_elements
-    select_lesser = tf.reduce_sum(
-        self.init_select_lesser * select_full_column_softmax,
-        1) * self.batch_question_number_mask  #BS * max_elements
-    select_geq = tf.reduce_sum(
-        self.init_select_geq * select_full_column_softmax,
-        1) * self.batch_question_number_mask  #BS * max_elements
-    select_leq = tf.reduce_sum(
-        self.init_select_leq * select_full_column_softmax,
-        1) * self.batch_question_number_mask  #BS * max_elements
-    select_max = tf.reduce_sum(init_max * select_full_column_softmax,
-                               1)  #BS * max_elements
-    select_min = tf.reduce_sum(init_min * select_full_column_softmax,
-                               1)  #BS * max_elements
-    select_prev = tf.concat(axis=1, values=[
-        tf.slice(select, [0, 1], [self.batch_size, self.max_elements - 1]),
-        tf.cast(tf.zeros([self.batch_size, 1]), self.data_type)
-    ])
-    select_next = tf.concat(axis=1, values=[
-        tf.cast(tf.zeros([self.batch_size, 1]), self.data_type), tf.slice(
-            select, [0, 0], [self.batch_size, self.max_elements - 1])
-    ])
-    select_last_rs = self.compute_first_or_last(select, False)
-    select_first_rs = self.compute_first_or_last(select, True)
-    select_word_match = tf.reduce_sum(self.batch_exact_match *
-                                      select_full_column_softmax, 1)
-    select_group_by_max = tf.reduce_sum(self.batch_group_by_max *
-                                        select_full_column_softmax, 1)
-    length_content = 1
-    length_select = 13
-    length_print = 1
-    values = tf.concat(axis=1, values=[count])
-    softmax_content = tf.slice(softmax, [0, 0],
-                               [self.batch_size, length_content])
-    #compute scalar output
-    output = tf.reduce_sum(tf.multiply(softmax_content, values), 1)
-    #compute lookup answer
-    softmax_print = tf.slice(softmax, [0, length_content + length_select],
-                             [self.batch_size, length_print])
-    curr_print = select_full_column_softmax * tf.tile(
-        tf.expand_dims(select, 1),
-        [1, self.num_cols + self.num_word_cols, 1
-        ])  #BS * max_cols * max_elements (conisders only column)
-    self.batch_lookup_answer = curr_print * tf.tile(
-        tf.expand_dims(softmax_print, 2),
-        [1, self.num_cols + self.num_word_cols, self.max_elements
-        ])  #BS * max_cols * max_elements
-    self.batch_lookup_answer = self.batch_lookup_answer * self.select_full_mask
-    #compute row select
-    softmax_select = tf.slice(softmax, [0, length_content],
-                              [self.batch_size, length_select])
-    select_lists = [
-        tf.expand_dims(select_prev, 1), tf.expand_dims(select_next, 1),
-        tf.expand_dims(select_first_rs, 1), tf.expand_dims(select_last_rs, 1),
-        tf.expand_dims(select_group_by_max, 1),
-        tf.expand_dims(select_greater, 1), tf.expand_dims(select_lesser, 1),
-        tf.expand_dims(select_geq, 1), tf.expand_dims(select_leq, 1),
-        tf.expand_dims(select_max, 1), tf.expand_dims(select_min, 1),
-        tf.expand_dims(select_word_match, 1),
-        tf.expand_dims(self.reset_select, 1)
-    ]
-    select = tf.reduce_sum(
-        tf.tile(tf.expand_dims(softmax_select, 2), [1, 1, self.max_elements]) *
-        tf.concat(axis=1, values=select_lists), 1)
-    select = select * self.select_whole_mask
-    return output, select
-
-  def one_pass(self, select, question_embedding, hidden_vectors, hprev,
-               prev_select_1, curr_pass):
-    #Performs one timestep which involves selecting an operation and a column
-    attention_vector = self.perform_attention(
-        hprev, hidden_vectors, self.question_length,
-        self.batch_question_attention_mask)  #batch_size * embedding_dims
-    controller_vector = tf.nn.relu(
-        tf.matmul(hprev, self.params["controller_prev"]) + tf.matmul(
-            tf.concat(axis=1, values=[question_embedding, attention_vector]), self.params[
-                "controller"]))
-    column_controller_vector = tf.nn.relu(
-        tf.matmul(hprev, self.params["column_controller_prev"]) + tf.matmul(
-            tf.concat(axis=1, values=[question_embedding, attention_vector]), self.params[
-                "column_controller"]))
-    controller_vector = nn_utils.apply_dropout(
-        controller_vector, self.utility.FLAGS.dropout, self.mode)
-    self.operation_logits = tf.matmul(controller_vector,
-                                      tf.transpose(self.params_unit))
-    softmax = tf.nn.softmax(self.operation_logits)
-    soft_softmax = softmax
-    #compute column softmax: bs * max_columns
-    weighted_op_representation = tf.transpose(
-        tf.matmul(tf.transpose(self.params_unit), tf.transpose(softmax)))
-    column_controller_vector = tf.nn.relu(
-        tf.matmul(
-            tf.concat(axis=1, values=[
-                column_controller_vector, weighted_op_representation
-            ]), self.params["break_conditional"]))
-    full_column_softmax = self.compute_column_softmax(column_controller_vector,
-                                                      curr_pass)
-    soft_column_softmax = full_column_softmax
-    if (self.mode == "test"):
-      full_column_softmax = self.make_hard_softmax(full_column_softmax)
-      softmax = self.make_hard_softmax(softmax)
-    output, select = self.perform_operations(softmax, full_column_softmax,
-                                             select, prev_select_1, curr_pass)
-    return output, select, softmax, soft_softmax, full_column_softmax, soft_column_softmax
-
-  def compute_lookup_error(self, val):
-    #computes lookup error.
-    cond = tf.equal(self.batch_print_answer, val)
-    inter = tf.where(
-        cond, self.init_print_error,
-        tf.tile(
-            tf.reshape(tf.constant(1e10, self.data_type), [1, 1, 1]), [
-                self.batch_size, self.utility.FLAGS.max_word_cols +
-                self.utility.FLAGS.max_number_cols,
-                self.utility.FLAGS.max_elements
-            ]))
-    return tf.reduce_min(tf.reduce_min(inter, 1), 1) * tf.cast(
-        tf.greater(
-            tf.reduce_sum(tf.reduce_sum(tf.cast(cond, self.data_type), 1), 1),
-            0.0), self.data_type)
-
-  def soft_min(self, x, y):
-    return tf.maximum(-1.0 * (1 / (
-        self.utility.FLAGS.soft_min_value + 0.0)) * tf.log(
-            tf.exp(-self.utility.FLAGS.soft_min_value * x) + tf.exp(
-                -self.utility.FLAGS.soft_min_value * y)), tf.zeros_like(x))
-
-  def error_computation(self):
-    #computes the error of each example in a batch
-    math_error = 0.5 * tf.square(tf.subtract(self.scalar_output, self.batch_answer))
-    #scale math error
-    math_error = math_error / self.rows
-    math_error = tf.minimum(math_error, self.utility.FLAGS.max_math_error *
-                            tf.ones(tf.shape(math_error), self.data_type))
-    self.init_print_error = tf.where(
-        self.batch_gold_select, -1 * tf.log(self.batch_lookup_answer + 1e-300 +
-                                            self.invert_select_full_mask), -1 *
-        tf.log(1 - self.batch_lookup_answer)) * self.select_full_mask
-    print_error_1 = self.init_print_error * tf.cast(
-        tf.equal(self.batch_print_answer, 0.0), self.data_type)
-    print_error = tf.reduce_sum(tf.reduce_sum((print_error_1), 1), 1)
-    for val in range(1, 58):
-      print_error += self.compute_lookup_error(val + 0.0)
-    print_error = print_error * self.utility.FLAGS.print_cost / self.num_entries
-    if (self.mode == "train"):
-      error = tf.where(
-          tf.logical_and(
-              tf.not_equal(self.batch_answer, 0.0),
-              tf.not_equal(
-                  tf.reduce_sum(tf.reduce_sum(self.batch_print_answer, 1), 1),
-                  0.0)),
-          self.soft_min(math_error, print_error),
-          tf.where(
-              tf.not_equal(self.batch_answer, 0.0), math_error, print_error))
-    else:
-      error = tf.where(
-          tf.logical_and(
-              tf.equal(self.scalar_output, 0.0),
-              tf.equal(
-                  tf.reduce_sum(tf.reduce_sum(self.batch_lookup_answer, 1), 1),
-                  0.0)),
-          tf.ones_like(math_error),
-          tf.where(
-              tf.equal(self.scalar_output, 0.0), print_error, math_error))
-    return error
-
-  def batch_process(self):
-    #Computes loss and fraction of correct examples in a batch.
-    self.params_unit = nn_utils.apply_dropout(
-        self.params["unit"], self.utility.FLAGS.dropout, self.mode)
-    batch_size = self.batch_size
-    max_passes = self.max_passes
-    num_timesteps = 1
-    max_elements = self.max_elements
-    select = tf.cast(
-        tf.fill([self.batch_size, max_elements], 1.0), self.data_type)
-    hprev = tf.cast(
-        tf.fill([self.batch_size, self.embedding_dims], 0.0),
-        self.data_type)  #running sum of the hidden states of the model
-    output = tf.cast(tf.fill([self.batch_size, 1], 0.0),
-                     self.data_type)  #output of the model
-    correct = tf.cast(
-        tf.fill([1], 0.0), self.data_type
-    )  #to compute accuracy, returns number of correct examples for this batch
-    total_error = 0.0
-    prev_select_1 = tf.zeros_like(select)
-    self.create_summary_embeddings()
-    self.get_column_hidden_vectors()
-    #get question embedding
-    question_embedding, hidden_vectors = self.LSTM_question_embedding(
-        self.batch_question, self.question_length)
-    #compute arguments for comparison operation
-    greater_question_number, lesser_question_number, geq_question_number, leq_question_number = self.question_number_softmax(
-        hidden_vectors)
-    self.init_select_greater = tf.cast(
-        tf.greater(self.full_processed_column,
-                   tf.expand_dims(greater_question_number, 2)), self.
-        data_type) * self.select_bad_number_mask  #bs * max_cols * max_elements
-    self.init_select_lesser = tf.cast(
-        tf.less(self.full_processed_column,
-                tf.expand_dims(lesser_question_number, 2)), self.
-        data_type) * self.select_bad_number_mask  #bs * max_cols * max_elements
-    self.init_select_geq = tf.cast(
-        tf.greater_equal(self.full_processed_column,
-                         tf.expand_dims(geq_question_number, 2)), self.
-        data_type) * self.select_bad_number_mask  #bs * max_cols * max_elements
-    self.init_select_leq = tf.cast(
-        tf.less_equal(self.full_processed_column,
-                      tf.expand_dims(leq_question_number, 2)), self.
-        data_type) * self.select_bad_number_mask  #bs * max_cols * max_elements
-    self.init_select_word_match = 0
-    if (self.utility.FLAGS.rnn_dropout > 0.0):
-      if (self.mode == "train"):
-        history_rnn_dropout_mask = tf.cast(
-            tf.random_uniform(
-                tf.shape(hprev), minval=0.0, maxval=1.0) <
-            self.utility.FLAGS.rnn_dropout,
-            self.data_type) / self.utility.FLAGS.rnn_dropout
-      else:
-        history_rnn_dropout_mask = tf.ones_like(hprev)
-    select = select * self.select_whole_mask
-    self.batch_log_prob = tf.zeros([self.batch_size], dtype=self.data_type)
-    #Perform max_passes and at each  pass select operation and column
-    for curr_pass in range(max_passes):
-      print("step: ", curr_pass)
-      output, select, softmax, soft_softmax, column_softmax, soft_column_softmax = self.one_pass(
-          select, question_embedding, hidden_vectors, hprev, prev_select_1,
-          curr_pass)
-      prev_select_1 = select
-      #compute input to history RNN
-      input_op = tf.transpose(
-          tf.matmul(
-              tf.transpose(self.params_unit), tf.transpose(
-                  soft_softmax)))  #weighted average of emebdding of operations
-      input_col = tf.reduce_sum(
-          tf.expand_dims(soft_column_softmax, 2) *
-          self.full_column_hidden_vectors, 1)
-      history_input = tf.concat(axis=1, values=[input_op, input_col])
-      history_input = nn_utils.apply_dropout(
-          history_input, self.utility.FLAGS.dropout, self.mode)
-      hprev = self.history_recurrent_step(history_input, hprev)
-      if (self.utility.FLAGS.rnn_dropout > 0.0):
-        hprev = hprev * history_rnn_dropout_mask
-    self.scalar_output = output
-    error = self.error_computation()
-    cond = tf.less(error, 0.0001, name="cond")
-    correct_add = tf.where(
-        cond, tf.fill(tf.shape(cond), 1.0), tf.fill(tf.shape(cond), 0.0))
-    correct = tf.reduce_sum(correct_add)
-    error = error / batch_size
-    total_error = tf.reduce_sum(error)
-    total_correct = correct / batch_size
-    return total_error, total_correct
-
-  def compute_error(self):
-    #Sets mask variables and performs batch processing
-    self.batch_gold_select = self.batch_print_answer > 0.0
-    self.full_column_mask = tf.concat(
-        axis=1, values=[self.batch_number_column_mask, self.batch_word_column_mask])
-    self.full_processed_column = tf.concat(
-        axis=1,
-        values=[self.batch_processed_number_column, self.batch_processed_word_column])
-    self.full_processed_sorted_index_column = tf.concat(axis=1, values=[
-        self.batch_processed_sorted_index_number_column,
-        self.batch_processed_sorted_index_word_column
-    ])
-    self.select_bad_number_mask = tf.cast(
-        tf.logical_and(
-            tf.not_equal(self.full_processed_column,
-                         self.utility.FLAGS.pad_int),
-            tf.not_equal(self.full_processed_column,
-                         self.utility.FLAGS.bad_number_pre_process)),
-        self.data_type)
-    self.select_mask = tf.cast(
-        tf.logical_not(
-            tf.equal(self.batch_number_column, self.utility.FLAGS.pad_int)),
-        self.data_type)
-    self.select_word_mask = tf.cast(
-        tf.logical_not(
-            tf.equal(self.batch_word_column_entry_mask,
-                     self.utility.dummy_token_id)), self.data_type)
-    self.select_full_mask = tf.concat(
-        axis=1, values=[self.select_mask, self.select_word_mask])
-    self.select_whole_mask = tf.maximum(
-        tf.reshape(
-            tf.slice(self.select_mask, [0, 0, 0],
-                     [self.batch_size, 1, self.max_elements]),
-            [self.batch_size, self.max_elements]),
-        tf.reshape(
-            tf.slice(self.select_word_mask, [0, 0, 0],
-                     [self.batch_size, 1, self.max_elements]),
-            [self.batch_size, self.max_elements]))
-    self.invert_select_full_mask = tf.cast(
-        tf.concat(axis=1, values=[
-            tf.equal(self.batch_number_column, self.utility.FLAGS.pad_int),
-            tf.equal(self.batch_word_column_entry_mask,
-                     self.utility.dummy_token_id)
-        ]), self.data_type)
-    self.batch_lookup_answer = tf.zeros(tf.shape(self.batch_gold_select))
-    self.reset_select = self.select_whole_mask
-    self.rows = tf.reduce_sum(self.select_whole_mask, 1)
-    self.num_entries = tf.reshape(
-        tf.reduce_sum(tf.reduce_sum(self.select_full_mask, 1), 1),
-        [self.batch_size])
-    self.final_error, self.final_correct = self.batch_process()
-    return self.final_error
-
-  def create_graph(self, params, global_step):
-    #Creates the graph to compute error, gradient computation and updates parameters
-    self.params = params
-    batch_size = self.batch_size
-    learning_rate = tf.cast(self.utility.FLAGS.learning_rate, self.data_type)
-    self.total_cost = self.compute_error()
-    optimize_params = self.params.values()
-    optimize_names = self.params.keys()
-    print("optimize params ", optimize_names)
-    if (self.utility.FLAGS.l2_regularizer > 0.0):
-      reg_cost = 0.0
-      for ind_param in self.params.keys():
-        reg_cost += tf.nn.l2_loss(self.params[ind_param])
-      self.total_cost += self.utility.FLAGS.l2_regularizer * reg_cost
-    grads = tf.gradients(self.total_cost, optimize_params, name="gradients")
-    grad_norm = 0.0
-    for p, name in zip(grads, optimize_names):
-      print("grads: ", p, name)
-      if isinstance(p, tf.IndexedSlices):
-        grad_norm += tf.reduce_sum(p.values * p.values)
-      elif not (p == None):
-        grad_norm += tf.reduce_sum(p * p)
-    grad_norm = tf.sqrt(grad_norm)
-    max_grad_norm = np.float32(self.utility.FLAGS.clip_gradients).astype(
-        self.utility.np_data_type[self.utility.FLAGS.data_type])
-    grad_scale = tf.minimum(
-        tf.cast(1.0, self.data_type), max_grad_norm / grad_norm)
-    clipped_grads = list()
-    for p in grads:
-      if isinstance(p, tf.IndexedSlices):
-        tmp = p.values * grad_scale
-        clipped_grads.append(tf.IndexedSlices(tmp, p.indices))
-      elif not (p == None):
-        clipped_grads.append(p * grad_scale)
-      else:
-        clipped_grads.append(p)
-    grads = clipped_grads
-    self.global_step = global_step
-    params_list = self.params.values()
-    params_list.append(self.global_step)
-    adam = tf.train.AdamOptimizer(
-        learning_rate,
-        epsilon=tf.cast(self.utility.FLAGS.eps, self.data_type),
-        use_locking=True)
-    self.step = adam.apply_gradients(zip(grads, optimize_params),
-					global_step=self.global_step)
-    self.init_op = tf.global_variables_initializer()
diff --git a/research/neural_programmer/neural_programmer.py b/research/neural_programmer/neural_programmer.py
deleted file mode 100644
index 145ca13d6ac8ce80d651f902440bfb3240f1c7a2..0000000000000000000000000000000000000000
--- a/research/neural_programmer/neural_programmer.py
+++ /dev/null
@@ -1,239 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Implementation of the Neural Programmer model described in https://openreview.net/pdf?id=ry2YOrcge
-
-This file calls functions to load & pre-process data, construct the TF graph
-and performs training or evaluation as specified by the flag evaluator_job
-Author: aneelakantan (Arvind Neelakantan)
-"""
-from __future__ import print_function
-
-import time
-from random import Random
-import numpy as np
-import tensorflow as tf
-import model
-import wiki_data
-import parameters
-import data_utils
-
-tf.flags.DEFINE_integer("train_steps", 100001, "Number of steps to train")
-tf.flags.DEFINE_integer("eval_cycle", 500,
-                        "Evaluate model at every eval_cycle steps")
-tf.flags.DEFINE_integer("max_elements", 100,
-                        "maximum rows that are  considered for processing")
-tf.flags.DEFINE_integer(
-    "max_number_cols", 15,
-    "maximum number columns that are considered for processing")
-tf.flags.DEFINE_integer(
-    "max_word_cols", 25,
-    "maximum number columns that are considered for processing")
-tf.flags.DEFINE_integer("question_length", 62, "maximum question length")
-tf.flags.DEFINE_integer("max_entry_length", 1, "")
-tf.flags.DEFINE_integer("max_passes", 4, "number of operation passes")
-tf.flags.DEFINE_integer("embedding_dims", 256, "")
-tf.flags.DEFINE_integer("batch_size", 20, "")
-tf.flags.DEFINE_float("clip_gradients", 1.0, "")
-tf.flags.DEFINE_float("eps", 1e-6, "")
-tf.flags.DEFINE_float("param_init", 0.1, "")
-tf.flags.DEFINE_float("learning_rate", 0.001, "")
-tf.flags.DEFINE_float("l2_regularizer", 0.0001, "")
-tf.flags.DEFINE_float("print_cost", 50.0,
-                      "weighting factor in the objective function")
-tf.flags.DEFINE_string("job_id", "temp", """job id""")
-tf.flags.DEFINE_string("output_dir", "../model/",
-                       """output_dir""")
-tf.flags.DEFINE_string("data_dir", "../data/",
-                       """data_dir""")
-tf.flags.DEFINE_integer("write_every", 500, "wrtie every N")
-tf.flags.DEFINE_integer("param_seed", 150, "")
-tf.flags.DEFINE_integer("python_seed", 200, "")
-tf.flags.DEFINE_float("dropout", 0.8, "dropout keep probability")
-tf.flags.DEFINE_float("rnn_dropout", 0.9,
-                      "dropout keep probability for rnn connections")
-tf.flags.DEFINE_float("pad_int", -20000.0,
-                      "number columns are padded with pad_int")
-tf.flags.DEFINE_string("data_type", "double", "float or double")
-tf.flags.DEFINE_float("word_dropout_prob", 0.9, "word dropout keep prob")
-tf.flags.DEFINE_integer("word_cutoff", 10, "")
-tf.flags.DEFINE_integer("vocab_size", 10800, "")
-tf.flags.DEFINE_boolean("evaluator_job", False,
-                        "wehther to run as trainer/evaluator")
-tf.flags.DEFINE_float(
-    "bad_number_pre_process", -200000.0,
-    "number that is added to a corrupted table entry in a number column")
-tf.flags.DEFINE_float("max_math_error", 3.0,
-                      "max square loss error that is considered")
-tf.flags.DEFINE_float("soft_min_value", 5.0, "")
-FLAGS = tf.flags.FLAGS
-
-
-class Utility:
-  #holds FLAGS and other variables that are used in different files
-  def __init__(self):
-    global FLAGS
-    self.FLAGS = FLAGS
-    self.unk_token = "UNK"
-    self.entry_match_token = "entry_match"
-    self.column_match_token = "column_match"
-    self.dummy_token = "dummy_token"
-    self.tf_data_type = {}
-    self.tf_data_type["double"] = tf.float64
-    self.tf_data_type["float"] = tf.float32
-    self.np_data_type = {}
-    self.np_data_type["double"] = np.float64
-    self.np_data_type["float"] = np.float32
-    self.operations_set = ["count"] + [
-        "prev", "next", "first_rs", "last_rs", "group_by_max", "greater",
-        "lesser", "geq", "leq", "max", "min", "word-match"
-    ] + ["reset_select"] + ["print"]
-    self.word_ids = {}
-    self.reverse_word_ids = {}
-    self.word_count = {}
-    self.random = Random(FLAGS.python_seed)
-
-
-def evaluate(sess, data, batch_size, graph, i):
-  #computes accuracy
-  num_examples = 0.0
-  gc = 0.0
-  for j in range(0, len(data) - batch_size + 1, batch_size):
-    [ct] = sess.run([graph.final_correct],
-                    feed_dict=data_utils.generate_feed_dict(data, j, batch_size,
-                                                            graph))
-    gc += ct * batch_size
-    num_examples += batch_size
-  print("dev set accuracy   after ", i, " : ", gc / num_examples)
-  print(num_examples, len(data))
-  print("--------")
-
-
-def Train(graph, utility, batch_size, train_data, sess, model_dir,
-          saver):
-  #performs training
-  curr = 0
-  train_set_loss = 0.0
-  utility.random.shuffle(train_data)
-  start = time.time()
-  for i in range(utility.FLAGS.train_steps):
-    curr_step = i
-    if (i > 0 and i % FLAGS.write_every == 0):
-      model_file = model_dir + "/model_" + str(i)
-      saver.save(sess, model_file)
-    if curr + batch_size >= len(train_data):
-      curr = 0
-      utility.random.shuffle(train_data)
-    step, cost_value = sess.run(
-        [graph.step, graph.total_cost],
-        feed_dict=data_utils.generate_feed_dict(
-            train_data, curr, batch_size, graph, train=True, utility=utility))
-    curr = curr + batch_size
-    train_set_loss += cost_value
-    if (i > 0 and i % FLAGS.eval_cycle == 0):
-      end = time.time()
-      time_taken = end - start
-      print("step ", i, " ", time_taken, " seconds ")
-      start = end
-      print(" printing train set loss: ", train_set_loss / utility.FLAGS.eval_cycle)
-      train_set_loss = 0.0
-
-
-def master(train_data, dev_data, utility):
-  #creates TF graph and calls trainer or evaluator
-  batch_size = utility.FLAGS.batch_size
-  model_dir = utility.FLAGS.output_dir + "/model" + utility.FLAGS.job_id + "/"
-  #create all paramters of the model
-  param_class = parameters.Parameters(utility)
-  params, global_step, init = param_class.parameters(utility)
-  key = "test" if (FLAGS.evaluator_job) else "train"
-  graph = model.Graph(utility, batch_size, utility.FLAGS.max_passes, mode=key)
-  graph.create_graph(params, global_step)
-  prev_dev_error = 0.0
-  final_loss = 0.0
-  final_accuracy = 0.0
-  #start session
-  with tf.Session() as sess:
-    sess.run(init.name)
-    sess.run(graph.init_op.name)
-    to_save = params.copy()
-    saver = tf.train.Saver(to_save, max_to_keep=500)
-    if (FLAGS.evaluator_job):
-      while True:
-        selected_models = {}
-        file_list = tf.gfile.ListDirectory(model_dir)
-        for model_file in file_list:
-          if ("checkpoint" in model_file or "index" in model_file or
-              "meta" in model_file):
-            continue
-          if ("data" in model_file):
-            model_file = model_file.split(".")[0]
-          model_step = int(
-              model_file.split("_")[len(model_file.split("_")) - 1])
-          selected_models[model_step] = model_file
-        file_list = sorted(selected_models.items(), key=lambda x: x[0])
-        if (len(file_list) > 0):
-          file_list = file_list[0:len(file_list) - 1]
-        print("list of models: ", file_list)
-        for model_file in file_list:
-          model_file = model_file[1]
-          print("restoring: ", model_file)
-          saver.restore(sess, model_dir + "/" + model_file)
-          model_step = int(
-              model_file.split("_")[len(model_file.split("_")) - 1])
-          print("evaluating on dev ", model_file, model_step)
-          evaluate(sess, dev_data, batch_size, graph, model_step)
-    else:
-      ckpt = tf.train.get_checkpoint_state(model_dir)
-      print("model dir: ", model_dir)
-      if (not (tf.gfile.IsDirectory(utility.FLAGS.output_dir))):
-        print("create dir: ", utility.FLAGS.output_dir)
-        tf.gfile.MkDir(utility.FLAGS.output_dir)
-      if (not (tf.gfile.IsDirectory(model_dir))):
-        print("create dir: ", model_dir)
-        tf.gfile.MkDir(model_dir)
-      Train(graph, utility, batch_size, train_data, sess, model_dir,
-            saver)
-
-def main(args):
-  utility = Utility()
-  train_name = "random-split-1-train.examples"
-  dev_name = "random-split-1-dev.examples"
-  test_name = "pristine-unseen-tables.examples"
-  #load data
-  dat = wiki_data.WikiQuestionGenerator(train_name, dev_name, test_name, FLAGS.data_dir)
-  train_data, dev_data, test_data = dat.load()
-  utility.words = []
-  utility.word_ids = {}
-  utility.reverse_word_ids = {}
-  #construct vocabulary
-  data_utils.construct_vocab(train_data, utility)
-  data_utils.construct_vocab(dev_data, utility, True)
-  data_utils.construct_vocab(test_data, utility, True)
-  data_utils.add_special_words(utility)
-  data_utils.perform_word_cutoff(utility)
-  #convert data to int format and pad the inputs
-  train_data = data_utils.complete_wiki_processing(train_data, utility, True)
-  dev_data = data_utils.complete_wiki_processing(dev_data, utility, False)
-  test_data = data_utils.complete_wiki_processing(test_data, utility, False)
-  print("# train examples ", len(train_data))
-  print("# dev examples ", len(dev_data))
-  print("# test examples ", len(test_data))
-  print("running open source")
-  #construct TF graph and train or evaluate
-  master(train_data, dev_data, utility)
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/neural_programmer/nn_utils.py b/research/neural_programmer/nn_utils.py
deleted file mode 100644
index 2f3a1a98bf7f71631410fc88982b336d33a02f52..0000000000000000000000000000000000000000
--- a/research/neural_programmer/nn_utils.py
+++ /dev/null
@@ -1,68 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Author: aneelakantan (Arvind Neelakantan)
-"""
-
-import tensorflow as tf
-
-def get_embedding(word, utility, params):
-  return tf.nn.embedding_lookup(params["word"], word)
-
-
-def apply_dropout(x, dropout_rate, mode):
-  if (dropout_rate > 0.0):
-    if (mode == "train"):
-      x = tf.nn.dropout(x, dropout_rate)
-    else:
-      x = x
-  return x
-
-
-def LSTMCell(x, mprev, cprev, key, params):
-  """Create an LSTM cell.
-
-  Implements the equations in pg.2 from
-  "Long Short-Term Memory Based Recurrent Neural Network Architectures
-  For Large Vocabulary Speech Recognition",
-  Hasim Sak, Andrew Senior, Francoise Beaufays.
-
-  Args:
-    w: A dictionary of the weights and optional biases as returned
-      by LSTMParametersSplit().
-    x: Inputs to this cell.
-    mprev: m_{t-1}, the recurrent activations (same as the output)
-      from the previous cell.
-    cprev: c_{t-1}, the cell activations from the previous cell.
-    keep_prob: Keep probability on the input and the outputs of a cell.
-
-  Returns:
-    m: Outputs of this cell.
-    c: Cell Activations.
-    """
-
-  i = tf.matmul(x, params[key + "_ix"]) + tf.matmul(mprev, params[key + "_im"])
-  i = tf.nn.bias_add(i, params[key + "_i"])
-  f = tf.matmul(x, params[key + "_fx"]) + tf.matmul(mprev, params[key + "_fm"])
-  f = tf.nn.bias_add(f, params[key + "_f"])
-  c = tf.matmul(x, params[key + "_cx"]) + tf.matmul(mprev, params[key + "_cm"])
-  c = tf.nn.bias_add(c, params[key + "_c"])
-  o = tf.matmul(x, params[key + "_ox"]) + tf.matmul(mprev, params[key + "_om"])
-  o = tf.nn.bias_add(o, params[key + "_o"])
-  i = tf.sigmoid(i, name="i_gate")
-  f = tf.sigmoid(f, name="f_gate")
-  o = tf.sigmoid(o, name="o_gate")
-  c = f * cprev + i * tf.tanh(c)
-  m = o * c
-  return m, c
diff --git a/research/neural_programmer/parameters.py b/research/neural_programmer/parameters.py
deleted file mode 100644
index c576ae822b2d93c561381e27fe65afd2902b564e..0000000000000000000000000000000000000000
--- a/research/neural_programmer/parameters.py
+++ /dev/null
@@ -1,89 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Author: aneelakantan (Arvind Neelakantan)
-"""
-
-import numpy as np
-import tensorflow as tf
-
-
-class Parameters:
-
-  def __init__(self, u):
-    self.utility = u
-    self.init_seed_counter = 0
-    self.word_init = {}
-
-  def parameters(self, utility):
-    params = {}
-    inits = []
-    embedding_dims = self.utility.FLAGS.embedding_dims
-    params["unit"] = tf.Variable(
-        self.RandomUniformInit([len(utility.operations_set), embedding_dims]))
-    params["word"] = tf.Variable(
-        self.RandomUniformInit([utility.FLAGS.vocab_size, embedding_dims]))
-    params["word_match_feature_column_name"] = tf.Variable(
-        self.RandomUniformInit([1]))
-    params["controller"] = tf.Variable(
-        self.RandomUniformInit([2 * embedding_dims, embedding_dims]))
-    params["column_controller"] = tf.Variable(
-        self.RandomUniformInit([2 * embedding_dims, embedding_dims]))
-    params["column_controller_prev"] = tf.Variable(
-        self.RandomUniformInit([embedding_dims, embedding_dims]))
-    params["controller_prev"] = tf.Variable(
-        self.RandomUniformInit([embedding_dims, embedding_dims]))
-    global_step = tf.Variable(1, name="global_step")
-    #weigths of question and history RNN (or LSTM)
-    key_list = ["question_lstm"]
-    for key in key_list:
-      # Weights going from inputs to nodes.
-      for wgts in ["ix", "fx", "cx", "ox"]:
-        params[key + "_" + wgts] = tf.Variable(
-            self.RandomUniformInit([embedding_dims, embedding_dims]))
-      # Weights going from nodes to nodes.
-      for wgts in ["im", "fm", "cm", "om"]:
-        params[key + "_" + wgts] = tf.Variable(
-            self.RandomUniformInit([embedding_dims, embedding_dims]))
-      #Biases for the gates and cell
-      for bias in ["i", "f", "c", "o"]:
-        if (bias == "f"):
-          print("forget gate bias")
-          params[key + "_" + bias] = tf.Variable(
-              tf.random_uniform([embedding_dims], 1.0, 1.1, self.utility.
-                                tf_data_type[self.utility.FLAGS.data_type]))
-        else:
-          params[key + "_" + bias] = tf.Variable(
-              self.RandomUniformInit([embedding_dims]))
-    params["history_recurrent"] = tf.Variable(
-        self.RandomUniformInit([3 * embedding_dims, embedding_dims]))
-    params["history_recurrent_bias"] = tf.Variable(
-        self.RandomUniformInit([1, embedding_dims]))
-    params["break_conditional"] = tf.Variable(
-        self.RandomUniformInit([2 * embedding_dims, embedding_dims]))
-    init = tf.global_variables_initializer()
-    return params, global_step, init
-
-  def RandomUniformInit(self, shape):
-    """Returns a RandomUniform Tensor between -param_init and param_init."""
-    param_seed = self.utility.FLAGS.param_seed
-    self.init_seed_counter += 1
-    return tf.random_uniform(
-        shape, -1.0 *
-        (np.float32(self.utility.FLAGS.param_init)
-        ).astype(self.utility.np_data_type[self.utility.FLAGS.data_type]),
-        (np.float32(self.utility.FLAGS.param_init)
-        ).astype(self.utility.np_data_type[self.utility.FLAGS.data_type]),
-        self.utility.tf_data_type[self.utility.FLAGS.data_type],
-        param_seed + self.init_seed_counter)
diff --git a/research/neural_programmer/wiki_data.py b/research/neural_programmer/wiki_data.py
deleted file mode 100644
index c91637ca1ae537526ebddf4408b0fccd22d0f5e1..0000000000000000000000000000000000000000
--- a/research/neural_programmer/wiki_data.py
+++ /dev/null
@@ -1,532 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Loads the WikiQuestions dataset.
-
-An example consists of question, table. Additionally, we store the processed
-columns which store the entries after performing number, date and other
-preprocessing as done in the baseline.
-columns, column names and processed columns are split into word and number
-columns.
-lookup answer (or matrix) is also split into number and word lookup matrix
-Author: aneelakantan (Arvind Neelakantan)
-"""
-from __future__ import print_function
-
-import math
-import os
-import re
-import numpy as np
-import unicodedata as ud
-import tensorflow as tf
-
-bad_number = -200000.0  #number that is added to a corrupted table entry in a number column
-
-def is_nan_or_inf(number):
-  return math.isnan(number) or math.isinf(number)
-
-def strip_accents(s):
-  u = unicode(s, "utf-8")
-  u_new = ''.join(c for c in ud.normalize('NFKD', u) if ud.category(c) != 'Mn')
-  return u_new.encode("utf-8")
-
-
-def correct_unicode(string):
-  string = strip_accents(string)
-  string = re.sub("\xc2\xa0", " ", string).strip()
-  string = re.sub("\xe2\x80\x93", "-", string).strip()
-  #string = re.sub(ur'[\u0300-\u036F]', "", string)
-  string = re.sub("â€š", ",", string)
-  string = re.sub("â€¦", "...", string)
-  #string = re.sub("[Â·ãƒ»]", ".", string)
-  string = re.sub("Ë†", "^", string)
-  string = re.sub("Ëœ", "~", string)
-  string = re.sub("â€¹", "<", string)
-  string = re.sub("â€º", ">", string)
-  #string = re.sub("[â€˜â€™Â´`]", "'", string)
-  #string = re.sub("[â€œâ€Â«Â»]", "\"", string)
-  #string = re.sub("[â€¢â€ â€¡]", "", string)
-  #string = re.sub("[â€â€‘â€“â€”]", "-", string)
-  string = re.sub(r'[\u2E00-\uFFFF]', "", string)
-  string = re.sub("\\s+", " ", string).strip()
-  return string
-
-
-def simple_normalize(string):
-  string = correct_unicode(string)
-  # Citations
-  string = re.sub("\[(nb ?)?\d+\]", "", string)
-  string = re.sub("\*+$", "", string)
-  # Year in parenthesis
-  string = re.sub("\(\d* ?-? ?\d*\)", "", string)
-  string = re.sub("^\"(.*)\"$", "", string)
-  return string
-
-
-def full_normalize(string):
-  #print "an: ", string
-  string = simple_normalize(string)
-  # Remove trailing info in brackets
-  string = re.sub("\[[^\]]*\]", "", string)
-  # Remove most unicode characters in other languages
-  string = re.sub(r'[\u007F-\uFFFF]', "", string.strip())
-  # Remove trailing info in parenthesis
-  string = re.sub("\([^)]*\)$", "", string.strip())
-  string = final_normalize(string)
-  # Get rid of question marks
-  string = re.sub("\?", "", string).strip()
-  # Get rid of trailing colons (usually occur in column titles)
-  string = re.sub("\:$", " ", string).strip()
-  # Get rid of slashes
-  string = re.sub(r"/", " ", string).strip()
-  string = re.sub(r"\\", " ", string).strip()
-  # Replace colon, slash, and dash with space
-  # Note: need better replacement for this when parsing time
-  string = re.sub(r"\:", " ", string).strip()
-  string = re.sub("/", " ", string).strip()
-  string = re.sub("-", " ", string).strip()
-  # Convert empty strings to UNK
-  # Important to do this last or near last
-  if not string:
-    string = "UNK"
-  return string
-
-def final_normalize(string):
-  # Remove leading and trailing whitespace
-  string = re.sub("\\s+", " ", string).strip()
-  # Convert entirely to lowercase
-  string = string.lower()
-  # Get rid of strangely escaped newline characters
-  string = re.sub("\\\\n", " ", string).strip()
-  # Get rid of quotation marks
-  string = re.sub(r"\"", "", string).strip()
-  string = re.sub(r"\'", "", string).strip()
-  string = re.sub(r"`", "", string).strip()
-  # Get rid of *
-  string = re.sub("\*", "", string).strip()
-  return string
-
-def is_number(x):
-  try:
-    f = float(x)
-    return not is_nan_or_inf(f)
-  except ValueError:
-    return False
-  except TypeError:
-    return False
-
-
-class WikiExample(object):
-
-  def __init__(self, id, question, answer, table_key):
-    self.question_id = id
-    self.question = question
-    self.answer = answer
-    self.table_key = table_key
-    self.lookup_matrix = []
-    self.is_bad_example = False
-    self.is_word_lookup = False
-    self.is_ambiguous_word_lookup = False
-    self.is_number_lookup = False
-    self.is_number_calc = False
-    self.is_unknown_answer = False
-
-
-class TableInfo(object):
-
-  def __init__(self, word_columns, word_column_names, word_column_indices,
-               number_columns, number_column_names, number_column_indices,
-               processed_word_columns, processed_number_columns, orig_columns):
-    self.word_columns = word_columns
-    self.word_column_names = word_column_names
-    self.word_column_indices = word_column_indices
-    self.number_columns = number_columns
-    self.number_column_names = number_column_names
-    self.number_column_indices = number_column_indices
-    self.processed_word_columns = processed_word_columns
-    self.processed_number_columns = processed_number_columns
-    self.orig_columns = orig_columns
-
-
-class WikiQuestionLoader(object):
-
-  def __init__(self, data_name, root_folder):
-    self.root_folder = root_folder
-    self.data_folder = os.path.join(self.root_folder, "data")
-    self.examples = []
-    self.data_name = data_name
-
-  def num_questions(self):
-    return len(self.examples)
-
-  def load_qa(self):
-    data_source = os.path.join(self.data_folder, self.data_name)
-    f = tf.gfile.GFile(data_source, "r")
-    id_regex = re.compile("\(id ([^\)]*)\)")
-    for line in f:
-      id_match = id_regex.search(line)
-      id = id_match.group(1)
-      self.examples.append(id)
-
-  def load(self):
-    self.load_qa()
-
-
-def is_date(word):
-  if (not (bool(re.search("[a-z0-9]", word, re.IGNORECASE)))):
-    return False
-  if (len(word) != 10):
-    return False
-  if (word[4] != "-"):
-    return False
-  if (word[7] != "-"):
-    return False
-  for i in range(len(word)):
-    if (not (word[i] == "X" or word[i] == "x" or word[i] == "-" or re.search(
-        "[0-9]", word[i]))):
-      return False
-  return True
-
-
-class WikiQuestionGenerator(object):
-
-  def __init__(self, train_name, dev_name, test_name, root_folder):
-    self.train_name = train_name
-    self.dev_name = dev_name
-    self.test_name = test_name
-    self.train_loader = WikiQuestionLoader(train_name, root_folder)
-    self.dev_loader = WikiQuestionLoader(dev_name, root_folder)
-    self.test_loader = WikiQuestionLoader(test_name, root_folder)
-    self.bad_examples = 0
-    self.root_folder = root_folder
-    self.data_folder = os.path.join(self.root_folder, "annotated/data")
-    self.annotated_examples = {}
-    self.annotated_tables = {}
-    self.annotated_word_reject = {}
-    self.annotated_word_reject["-lrb-"] = 1
-    self.annotated_word_reject["-rrb-"] = 1
-    self.annotated_word_reject["UNK"] = 1
-
-  def is_money(self, word):
-    if (not (bool(re.search("[a-z0-9]", word, re.IGNORECASE)))):
-      return False
-    for i in range(len(word)):
-      if (not (word[i] == "E" or word[i] == "." or re.search("[0-9]",
-                                                             word[i]))):
-        return False
-    return True
-
-  def remove_consecutive(self, ner_tags, ner_values):
-    for i in range(len(ner_tags)):
-      if ((ner_tags[i] == "NUMBER" or ner_tags[i] == "MONEY" or
-           ner_tags[i] == "PERCENT" or ner_tags[i] == "DATE") and
-          i + 1 < len(ner_tags) and ner_tags[i] == ner_tags[i + 1] and
-          ner_values[i] == ner_values[i + 1] and ner_values[i] != ""):
-        word = ner_values[i]
-        word = word.replace(">", "").replace("<", "").replace("=", "").replace(
-            "%", "").replace("~", "").replace("$", "").replace("£", "").replace(
-                "€", "")
-        if (re.search("[A-Z]", word) and not (is_date(word)) and not (
-            self.is_money(word))):
-          ner_values[i] = "A"
-        else:
-          ner_values[i] = ","
-    return ner_tags, ner_values
-
-  def pre_process_sentence(self, tokens, ner_tags, ner_values):
-    sentence = []
-    tokens = tokens.split("|")
-    ner_tags = ner_tags.split("|")
-    ner_values = ner_values.split("|")
-    ner_tags, ner_values = self.remove_consecutive(ner_tags, ner_values)
-    #print "old: ", tokens
-    for i in range(len(tokens)):
-      word = tokens[i]
-      if (ner_values[i] != "" and
-          (ner_tags[i] == "NUMBER" or ner_tags[i] == "MONEY" or
-           ner_tags[i] == "PERCENT" or ner_tags[i] == "DATE")):
-        word = ner_values[i]
-        word = word.replace(">", "").replace("<", "").replace("=", "").replace(
-            "%", "").replace("~", "").replace("$", "").replace("£", "").replace(
-                "€", "")
-        if (re.search("[A-Z]", word) and not (is_date(word)) and not (
-            self.is_money(word))):
-          word = tokens[i]
-        if (is_number(ner_values[i])):
-          word = float(ner_values[i])
-        elif (is_number(word)):
-          word = float(word)
-        if (tokens[i] == "score"):
-          word = "score"
-      if (is_number(word)):
-        word = float(word)
-      if (not (self.annotated_word_reject.has_key(word))):
-        if (is_number(word) or is_date(word) or self.is_money(word)):
-          sentence.append(word)
-        else:
-          word = full_normalize(word)
-          if (not (self.annotated_word_reject.has_key(word)) and
-              bool(re.search("[a-z0-9]", word, re.IGNORECASE))):
-            m = re.search(",", word)
-            sentence.append(word.replace(",", ""))
-    if (len(sentence) == 0):
-      sentence.append("UNK")
-    return sentence
-
-  def load_annotated_data(self, in_file):
-    self.annotated_examples = {}
-    self.annotated_tables = {}
-    f = tf.gfile.GFile(in_file, "r")
-    counter = 0
-    for line in f:
-      if (counter > 0):
-        line = line.strip()
-        (question_id, utterance, context, target_value, tokens, lemma_tokens,
-         pos_tags, ner_tags, ner_values, target_canon) = line.split("\t")
-        question = self.pre_process_sentence(tokens, ner_tags, ner_values)
-        target_canon = target_canon.split("|")
-        self.annotated_examples[question_id] = WikiExample(
-            question_id, question, target_canon, context)
-        self.annotated_tables[context] = []
-      counter += 1
-    print("Annotated examples loaded ", len(self.annotated_examples))
-    f.close()
-
-  def is_number_column(self, a):
-    for w in a:
-      if (len(w) != 1):
-        return False
-      if (not (is_number(w[0]))):
-        return False
-    return True
-
-  def convert_table(self, table):
-    answer = []
-    for i in range(len(table)):
-      temp = []
-      for j in range(len(table[i])):
-        temp.append(" ".join([str(w) for w in table[i][j]]))
-      answer.append(temp)
-    return answer
-
-  def load_annotated_tables(self):
-    for table in self.annotated_tables.keys():
-      annotated_table = table.replace("csv", "annotated")
-      orig_columns = []
-      processed_columns = []
-      f = tf.gfile.GFile(os.path.join(self.root_folder, annotated_table), "r")
-      counter = 0
-      for line in f:
-        if (counter > 0):
-          line = line.strip()
-          line = line + "\t" * (13 - len(line.split("\t")))
-          (row, col, read_id, content, tokens, lemma_tokens, pos_tags, ner_tags,
-           ner_values, number, date, num2, read_list) = line.split("\t")
-        counter += 1
-      f.close()
-      max_row = int(row)
-      max_col = int(col)
-      for i in range(max_col + 1):
-        orig_columns.append([])
-        processed_columns.append([])
-        for j in range(max_row + 1):
-          orig_columns[i].append(bad_number)
-          processed_columns[i].append(bad_number)
-      #print orig_columns
-      f = tf.gfile.GFile(os.path.join(self.root_folder, annotated_table), "r")
-      counter = 0
-      column_names = []
-      for line in f:
-        if (counter > 0):
-          line = line.strip()
-          line = line + "\t" * (13 - len(line.split("\t")))
-          (row, col, read_id, content, tokens, lemma_tokens, pos_tags, ner_tags,
-           ner_values, number, date, num2, read_list) = line.split("\t")
-          entry = self.pre_process_sentence(tokens, ner_tags, ner_values)
-          if (row == "-1"):
-            column_names.append(entry)
-          else:
-            orig_columns[int(col)][int(row)] = entry
-            if (len(entry) == 1 and is_number(entry[0])):
-              processed_columns[int(col)][int(row)] = float(entry[0])
-            else:
-              for single_entry in entry:
-                if (is_number(single_entry)):
-                  processed_columns[int(col)][int(row)] = float(single_entry)
-                  break
-              nt = ner_tags.split("|")
-              nv = ner_values.split("|")
-              for i_entry in range(len(tokens.split("|"))):
-                if (nt[i_entry] == "DATE" and
-                    is_number(nv[i_entry].replace("-", "").replace("X", ""))):
-                  processed_columns[int(col)][int(row)] = float(nv[
-                      i_entry].replace("-", "").replace("X", ""))
-                  #processed_columns[int(col)][int(row)] =  float(nv[i_entry])
-            if (len(entry) == 1 and (is_number(entry[0]) or is_date(entry[0]) or
-                                     self.is_money(entry[0]))):
-              if (len(entry) == 1 and not (is_number(entry[0])) and
-                  is_date(entry[0])):
-                entry[0] = entry[0].replace("X", "x")
-        counter += 1
-      word_columns = []
-      processed_word_columns = []
-      word_column_names = []
-      word_column_indices = []
-      number_columns = []
-      processed_number_columns = []
-      number_column_names = []
-      number_column_indices = []
-      for i in range(max_col + 1):
-        if (self.is_number_column(orig_columns[i])):
-          number_column_indices.append(i)
-          number_column_names.append(column_names[i])
-          temp = []
-          for w in orig_columns[i]:
-            if (is_number(w[0])):
-              temp.append(w[0])
-          number_columns.append(temp)
-          processed_number_columns.append(processed_columns[i])
-        else:
-          word_column_indices.append(i)
-          word_column_names.append(column_names[i])
-          word_columns.append(orig_columns[i])
-          processed_word_columns.append(processed_columns[i])
-      table_info = TableInfo(
-          word_columns, word_column_names, word_column_indices, number_columns,
-          number_column_names, number_column_indices, processed_word_columns,
-          processed_number_columns, orig_columns)
-      self.annotated_tables[table] = table_info
-      f.close()
-
-  def answer_classification(self):
-    lookup_questions = 0
-    number_lookup_questions = 0
-    word_lookup_questions = 0
-    ambiguous_lookup_questions = 0
-    number_questions = 0
-    bad_questions = 0
-    ice_bad_questions = 0
-    tot = 0
-    got = 0
-    ice = {}
-    with tf.gfile.GFile(
-        self.root_folder + "/arvind-with-norms-2.tsv", mode="r") as f:
-      lines = f.readlines()
-      for line in lines:
-        line = line.strip()
-        if (not (self.annotated_examples.has_key(line.split("\t")[0]))):
-          continue
-        if (len(line.split("\t")) == 4):
-          line = line + "\t" * (5 - len(line.split("\t")))
-          if (not (is_number(line.split("\t")[2]))):
-            ice_bad_questions += 1
-        (example_id, ans_index, ans_raw, process_answer,
-         matched_cells) = line.split("\t")
-        if (ice.has_key(example_id)):
-          ice[example_id].append(line.split("\t"))
-        else:
-          ice[example_id] = [line.split("\t")]
-    for q_id in self.annotated_examples.keys():
-      tot += 1
-      example = self.annotated_examples[q_id]
-      table_info = self.annotated_tables[example.table_key]
-      # Figure out if the answer is numerical or lookup
-      n_cols = len(table_info.orig_columns)
-      n_rows = len(table_info.orig_columns[0])
-      example.lookup_matrix = np.zeros((n_rows, n_cols))
-      exact_matches = {}
-      for (example_id, ans_index, ans_raw, process_answer,
-           matched_cells) in ice[q_id]:
-        for match_cell in matched_cells.split("|"):
-          if (len(match_cell.split(",")) == 2):
-            (row, col) = match_cell.split(",")
-            row = int(row)
-            col = int(col)
-            if (row >= 0):
-              exact_matches[ans_index] = 1
-      answer_is_in_table = len(exact_matches) == len(example.answer)
-      if (answer_is_in_table):
-        for (example_id, ans_index, ans_raw, process_answer,
-             matched_cells) in ice[q_id]:
-          for match_cell in matched_cells.split("|"):
-            if (len(match_cell.split(",")) == 2):
-              (row, col) = match_cell.split(",")
-              row = int(row)
-              col = int(col)
-              example.lookup_matrix[row, col] = float(ans_index) + 1.0
-      example.lookup_number_answer = 0.0
-      if (answer_is_in_table):
-        lookup_questions += 1
-        if len(example.answer) == 1 and is_number(example.answer[0]):
-          example.number_answer = float(example.answer[0])
-          number_lookup_questions += 1
-          example.is_number_lookup = True
-        else:
-          #print "word lookup"
-          example.calc_answer = example.number_answer = 0.0
-          word_lookup_questions += 1
-          example.is_word_lookup = True
-      else:
-        if (len(example.answer) == 1 and is_number(example.answer[0])):
-          example.number_answer = example.answer[0]
-          example.is_number_calc = True
-        else:
-          bad_questions += 1
-          example.is_bad_example = True
-          example.is_unknown_answer = True
-      example.is_lookup = example.is_word_lookup or example.is_number_lookup
-      if not example.is_word_lookup and not example.is_bad_example:
-        number_questions += 1
-        example.calc_answer = example.answer[0]
-        example.lookup_number_answer = example.calc_answer
-      # Split up the lookup matrix into word part and number part
-      number_column_indices = table_info.number_column_indices
-      word_column_indices = table_info.word_column_indices
-      example.word_columns = table_info.word_columns
-      example.number_columns = table_info.number_columns
-      example.word_column_names = table_info.word_column_names
-      example.processed_number_columns = table_info.processed_number_columns
-      example.processed_word_columns = table_info.processed_word_columns
-      example.number_column_names = table_info.number_column_names
-      example.number_lookup_matrix = example.lookup_matrix[:,
-                                                           number_column_indices]
-      example.word_lookup_matrix = example.lookup_matrix[:, word_column_indices]
-
-  def load(self):
-    train_data = []
-    dev_data = []
-    test_data = []
-    self.load_annotated_data(
-        os.path.join(self.data_folder, "training.annotated"))
-    self.load_annotated_tables()
-    self.answer_classification()
-    self.train_loader.load()
-    self.dev_loader.load()
-    for i in range(self.train_loader.num_questions()):
-      example = self.train_loader.examples[i]
-      example = self.annotated_examples[example]
-      train_data.append(example)
-    for i in range(self.dev_loader.num_questions()):
-      example = self.dev_loader.examples[i]
-      dev_data.append(self.annotated_examples[example])
-
-    self.load_annotated_data(
-        os.path.join(self.data_folder, "pristine-unseen-tables.annotated"))
-    self.load_annotated_tables()
-    self.answer_classification()
-    self.test_loader.load()
-    for i in range(self.test_loader.num_questions()):
-      example = self.test_loader.examples[i]
-      test_data.append(self.annotated_examples[example])
-    return train_data, dev_data, test_data
diff --git a/research/next_frame_prediction/README.md b/research/next_frame_prediction/README.md
deleted file mode 100644
index 9aa9b6fc5a3146a5e24ce53422d985570891d42b..0000000000000000000000000000000000000000
--- a/research/next_frame_prediction/README.md
+++ /dev/null
@@ -1,89 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-<font size=4><b>Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks.</b></font>
-
-<b>Introduction</b>
-
-https://arxiv.org/pdf/1607.02586v1.pdf
-
-This is an implementation based on my understanding, with small
-variations. It doesn't necessarily represents the paper published
-by the original authors.
-
-Authors: Xin Pan, Anelia Angelova
-
-<b>Results:</b>
-
-![Sample1](g3doc/cross_conv.png)
-
-![Sample2](g3doc/cross_conv2.png)
-
-![Loss](g3doc/cross_conv3.png)
-
-<b>Prerequisite:</b>
-
-1. Install TensorFlow (r0.12), Bazel.
-
-2. Download the Sprites dataset or generate moving object dataset.
-
-Sprites data is located here:
-
-http://www.scottreed.info/files/nips2015-analogy-data.tar.gz
-
-Convert .mat files into images and use sprites_gen.py to convert them
-to tf.SequenceExample.
-
-<b>How to run:</b>
-
-```shell
-$ ls -R
-.:
-data  next_frame_prediction  WORKSPACE
-
-./data:
-tfrecords  tfrecords_test
-
-./next_frame_prediction:
-cross_conv  g3doc  README.md
-
-./next_frame_prediction/cross_conv:
-BUILD  eval.py  objects_gen.py  model.py  reader.py  sprites_gen.py  train.py
-
-./next_frame_prediction/g3doc:
-cross_conv2.png  cross_conv3.png  cross_conv.png
-
-
-# Build everything.
-$ bazel build -c opt next_frame_prediction/...
-
-# The following example runs the generated 2d objects.
-# For Sprites dataset, image_size should be 60, norm_scale should be 255.0.
-# Batch size is normally 16~64, depending on your memory size.
-
-# Run training.
-$ bazel-bin/next_frame_prediction/cross_conv/train \
-    --batch_size=1 \
-    --data_filepattern=data/tfrecords \
-    --image_size=64 \
-    --log_root=/tmp/predict
-
-step: 1, loss: 24.428671
-step: 2, loss: 19.211605
-step: 3, loss: 5.543143
-step: 4, loss: 3.035339
-step: 5, loss: 1.771392
-step: 6, loss: 2.099824
-step: 7, loss: 1.747665
-step: 8, loss: 1.572436
-step: 9, loss: 1.586816
-step: 10, loss: 1.434191
-
-# Run eval.
-$ bazel-bin/next_frame_prediction/cross_conv/eval \
-    --batch_size=1 \
-    --data_filepattern=data/tfrecords_test \
-    --image_size=64 \
-    --log_root=/tmp/predict
-```
diff --git a/research/next_frame_prediction/cross_conv/BUILD b/research/next_frame_prediction/cross_conv/BUILD
deleted file mode 100644
index b435087f34f6ffbeba016119c60724d8ac3eb180..0000000000000000000000000000000000000000
--- a/research/next_frame_prediction/cross_conv/BUILD
+++ /dev/null
@@ -1,48 +0,0 @@
-licenses(["notice"])  # Apache 2.0
-
-package_group(
-    name = "internal",
-    packages = [
-        "//next_frame_prediction/...",
-    ],
-)
-
-package(default_visibility = [":internal"])
-
-py_library(
-    name = "model",
-    srcs = ["model.py"],
-)
-
-py_library(
-    name = "reader",
-    srcs = ["reader.py"],
-)
-
-py_binary(
-    name = "train",
-    srcs = ["train.py"],
-    deps = [
-        ":model",
-        ":reader",
-    ],
-)
-
-py_binary(
-    name = "eval",
-    srcs = ["eval.py"],
-    deps = [
-        ":model",
-        ":reader",
-    ],
-)
-
-py_binary(
-    name = "example_gen",
-    srcs = ["example_gen.py"],
-)
-
-py_binary(
-    name = "sprites_gen",
-    srcs = ["sprites_gen.py"],
-)
diff --git a/research/next_frame_prediction/cross_conv/eval.py b/research/next_frame_prediction/cross_conv/eval.py
deleted file mode 100644
index 17ebc0e0edd2911f828cbb145ee40a06db8795b5..0000000000000000000000000000000000000000
--- a/research/next_frame_prediction/cross_conv/eval.py
+++ /dev/null
@@ -1,119 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Eval Cross Convolutional Model."""
-import io
-import os
-import sys
-import time
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import model as cross_conv_model
-import reader
-
-FLAGS = tf.flags.FLAGS
-tf.flags.DEFINE_string('log_root', '/tmp/moving_obj', 'The root dir of output.')
-tf.flags.DEFINE_string('data_filepattern',
-                       'est',
-                       'training data file pattern.')
-tf.flags.DEFINE_integer('batch_size', 1, 'Batch size.')
-tf.flags.DEFINE_integer('image_size', 64, 'Image height and width.')
-tf.flags.DEFINE_float('norm_scale', 1.0, 'Normalize the original image')
-tf.flags.DEFINE_float('scale', 10.0,
-                      'Scale the image after norm_scale and move the diff '
-                      'to the positive realm.')
-tf.flags.DEFINE_integer('sequence_length', 2, 'tf.SequenceExample length.')
-tf.flags.DEFINE_integer('eval_batch_count', 100,
-                        'Average the result this number of examples.')
-tf.flags.DEFINE_bool('l2_loss', True, 'If true, include l2_loss.')
-tf.flags.DEFINE_bool('reconstr_loss', False, 'If true, include reconstr_loss.')
-tf.flags.DEFINE_bool('kl_loss', True, 'If true, include KL loss.')
-
-slim = tf.contrib.slim
-
-
-def _Eval():
-  params = dict()
-  params['batch_size'] = FLAGS.batch_size
-  params['seq_len'] = FLAGS.sequence_length
-  params['image_size'] = FLAGS.image_size
-  params['is_training'] = False
-  params['norm_scale'] = FLAGS.norm_scale
-  params['scale'] = FLAGS.scale
-  params['l2_loss'] = FLAGS.l2_loss
-  params['reconstr_loss'] = FLAGS.reconstr_loss
-  params['kl_loss'] = FLAGS.kl_loss
-
-  eval_dir = os.path.join(FLAGS.log_root, 'eval')
-
-  images = reader.ReadInput(
-      FLAGS.data_filepattern, shuffle=False, params=params)
-  images *= params['scale']
-  # Increase the value makes training much faster.
-  image_diff_list = reader.SequenceToImageAndDiff(images)
-  model = cross_conv_model.CrossConvModel(image_diff_list, params)
-  model.Build()
-
-  summary_writer = tf.summary.FileWriter(eval_dir)
-  saver = tf.train.Saver()
-  sess = tf.Session('', config=tf.ConfigProto(allow_soft_placement=True))
-  tf.train.start_queue_runners(sess)
-
-  while True:
-    time.sleep(60)
-    try:
-      ckpt_state = tf.train.get_checkpoint_state(FLAGS.log_root)
-    except tf.errors.OutOfRangeError as e:
-      sys.stderr.write('Cannot restore checkpoint: %s\n' % e)
-      continue
-    if not (ckpt_state and ckpt_state.model_checkpoint_path):
-      sys.stderr.write('No model to eval yet at %s\n' % FLAGS.log_root)
-      continue
-    sys.stderr.write('Loading checkpoint %s\n' %
-                     ckpt_state.model_checkpoint_path)
-    saver.restore(sess, ckpt_state.model_checkpoint_path)
-    # Use the empirical distribution of z from training set.
-    if not tf.gfile.Exists(os.path.join(FLAGS.log_root, 'z_mean.npy')):
-      sys.stderr.write('No z at %s\n' % FLAGS.log_root)
-      continue
-
-    with tf.gfile.Open(os.path.join(FLAGS.log_root, 'z_mean.npy')) as f:
-      sample_z_mean = np.load(io.BytesIO(f.read()))
-    with tf.gfile.Open(
-        os.path.join(FLAGS.log_root, 'z_stddev_log.npy')) as f:
-      sample_z_stddev_log = np.load(io.BytesIO(f.read()))
-
-    total_loss = 0.0
-    for _ in xrange(FLAGS.eval_batch_count):
-      loss_val, total_steps, summaries = sess.run(
-          [model.loss, model.global_step, model.summary_op],
-          feed_dict={model.z_mean: sample_z_mean,
-                     model.z_stddev_log: sample_z_stddev_log})
-      total_loss += loss_val
-
-    summary_writer.add_summary(summaries, total_steps)
-    sys.stderr.write('steps: %d, loss: %f\n' %
-                     (total_steps, total_loss / FLAGS.eval_batch_count))
-
-
-def main(_):
-  _Eval()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/next_frame_prediction/cross_conv/example_gen.py b/research/next_frame_prediction/cross_conv/example_gen.py
deleted file mode 100644
index bcda0bc405a60c3116e8c488cae92f502720fec4..0000000000000000000000000000000000000000
--- a/research/next_frame_prediction/cross_conv/example_gen.py
+++ /dev/null
@@ -1,93 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Generate examples of two objects moving in different directions."""
-import random
-import sys
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-
-tf.flags.DEFINE_string('out_file', '',
-                       'Output file for the tfrecords.')
-
-
-def _add_object(obj_type, image, image2, xpos, ypos):
-  """Add a moving obj to two consecutive images."""
-  obj_size = random.randint(8, 10)
-  channel = random.randint(0, 2)
-  move = random.randint(6, 10)
-
-  obj = np.zeros([obj_size, obj_size, 3])
-  if obj_type == 'rectangle':
-    xpos2 = xpos + move
-    ypos2 = ypos
-    for i in xrange(obj_size):
-      obj[i, 0:i+1, channel] = [1.0 for _ in xrange(i+1)]
-  elif obj_type == 'square':
-    xpos2 = xpos
-    ypos2 = ypos + move
-    obj[:, :, channel] = 1.0
-
-  for x in xrange(obj_size):
-    for y in xrange(obj_size):
-      if obj[x, y, channel] == 1.0:
-        image[xpos+x, ypos+y, channel] = 1.0
-        image2[xpos2+x, ypos2+y, channel] = 1.0
-
-
-def _images_to_example(image, image2):
-  """Convert two consecutive images to SequenceExample."""
-  example = tf.SequenceExample()
-  feature_list = example.feature_lists.feature_list['moving_objs']
-  feature = feature_list.feature.add()
-  feature.float_list.value.extend(np.reshape(image, [-1]).tolist())
-  feature = feature_list.feature.add()
-  feature.float_list.value.extend(np.reshape(image2, [-1]).tolist())
-  return example
-
-
-def generate_input():
-  """Generate tfrecords."""
-  writer = tf.python_io.TFRecordWriter(tf.flags.FLAGS.out_file)
-  writer2 = tf.python_io.TFRecordWriter(tf.flags.FLAGS.out_file + '_test')
-
-  examples = []
-  for xpos in xrange(0, 40, 3):
-    for ypos in xrange(0, 40, 3):
-      for xpos2 in xrange(0, 40, 3):
-        for ypos2 in xrange(0, 40, 3):
-          image = np.zeros([64, 64, 3])
-          image2 = np.zeros([64, 64, 3])
-          _add_object('rectangle', image, image2, xpos, ypos)
-          _add_object('square', image, image2, xpos2, ypos2)
-          examples.append(_images_to_example(image, image2))
-
-  sys.stderr.write('Finish generating examples.\n')
-  random.shuffle(examples)
-  for count, ex in enumerate(examples):
-    if count % 10 == 0:
-      writer2.write(ex.SerializeToString())
-    else:
-      writer.write(ex.SerializeToString())
-
-def main(_):
-  generate_input()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/next_frame_prediction/cross_conv/model.py b/research/next_frame_prediction/cross_conv/model.py
deleted file mode 100644
index 7b48e446e18b70fec87142f6834f33332287d02e..0000000000000000000000000000000000000000
--- a/research/next_frame_prediction/cross_conv/model.py
+++ /dev/null
@@ -1,233 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Cross Convolutional Model.
-
-https://arxiv.org/pdf/1607.02586v1.pdf
-"""
-import math
-import sys
-
-from six.moves import xrange
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-
-class CrossConvModel(object):
-
-  def __init__(self, image_diff_list, params):
-    """Constructor.
-
-    Args:
-      image_diff_list: A list of (image, diff) tuples, with shape
-          [batch_size, image_size, image_size, 3] and image_sizes as
-          [32, 64, 128, 256].
-      params: Dict of parameters.
-    """
-    self.images = [i for (i, _) in image_diff_list]
-    # Move the diff to the positive realm.
-    self.diffs = [(d + params['scale']) / 2 for (i, d) in image_diff_list]
-    self.params = params
-
-  def Build(self):
-    with tf.device('/gpu:0'):
-      with slim.arg_scope([slim.conv2d],
-                          activation_fn=tf.nn.relu,
-                          normalizer_fn=slim.batch_norm,
-                          normalizer_params={'is_training':
-                                             self.params['is_training']}):
-        self._BuildMotionKernel()
-        encoded_images = self._BuildImageEncoder()
-        cross_conved_images = self._CrossConv(encoded_images)
-        self._BuildImageDecoder(cross_conved_images)
-        self._BuildLoss()
-
-      image = self.images[1]
-      diff = self.diffs[1]
-
-      self.global_step = tf.Variable(0, name='global_step', trainable=False)
-
-      if self.params['is_training']:
-        self._BuildTrainOp()
-
-      diff = diff * 2.0 - self.params['scale']
-      diff_output = self.diff_output * 2.0 - self.params['scale']
-      concat_image = tf.concat(
-          axis=1, values=[image, image + diff_output, image + diff, diff_output])
-      tf.summary.image('origin_predict_expect_predictdiff', concat_image)
-      self.summary_op = tf.summary.merge_all()
-      return self.loss
-
-  def _BuildTrainOp(self):
-    lrn_rate = tf.maximum(
-        0.01,  # min_lr_rate.
-        tf.train.exponential_decay(
-            self.params['learning_rate'], self.global_step, 10000, 0.5))
-    tf.summary.scalar('learning rate', lrn_rate)
-    optimizer = tf.train.GradientDescentOptimizer(lrn_rate)
-    self.train_op = slim.learning.create_train_op(
-        self.loss, optimizer, global_step=self.global_step)
-
-  def _BuildLoss(self):
-    # 1. reconstr_loss seems doesn't do better than l2 loss.
-    # 2. Only works when using reduce_mean. reduce_sum doesn't work.
-    # 3. It seems kl loss doesn't play an important role.
-    self.loss = 0
-    with tf.variable_scope('loss'):
-      if self.params['l2_loss']:
-        l2_loss = tf.reduce_mean(tf.square(self.diff_output - self.diffs[1]))
-        tf.summary.scalar('l2_loss', l2_loss)
-        self.loss += l2_loss
-      if self.params['reconstr_loss']:
-        reconstr_loss = (-tf.reduce_mean(
-            self.diffs[1] * (1e-10 + self.diff_output) +
-            (1-self.diffs[1]) * tf.log(1e-10 + 1 - self.diff_output)))
-        reconstr_loss = tf.check_numerics(reconstr_loss, 'reconstr_loss')
-        tf.summary.scalar('reconstr_loss', reconstr_loss)
-        self.loss += reconstr_loss
-      if self.params['kl_loss']:
-        kl_loss = (0.5 * tf.reduce_mean(
-            tf.square(self.z_mean) + tf.square(self.z_stddev) -
-            2 * self.z_stddev_log - 1))
-        tf.summary.scalar('kl_loss', kl_loss)
-        self.loss += kl_loss
-
-      tf.summary.scalar('loss', self.loss)
-
-  def _BuildMotionKernel(self):
-    image = self.images[-2]
-    diff = self.diffs[-2]
-    shape = image.get_shape().as_list()
-    assert shape[1] == shape[2] and shape[1] == 128
-    batch_size = shape[0]
-
-    net = tf.concat(axis=3, values=[image, diff])
-    with tf.variable_scope('motion_encoder'):
-      with slim.arg_scope([slim.conv2d], padding='VALID'):
-        net = slim.conv2d(net, 96, [5, 5], stride=1)
-        net = slim.max_pool2d(net, [2, 2])
-        net = slim.conv2d(net, 96, [5, 5], stride=1)
-        net = slim.max_pool2d(net, [2, 2])
-        net = slim.conv2d(net, 128, [5, 5], stride=1)
-        net = slim.conv2d(net, 128, [5, 5], stride=1)
-        net = slim.max_pool2d(net, [2, 2])
-        net = slim.conv2d(net, 256, [4, 4], stride=1)
-        net = slim.conv2d(net, 256, [3, 3], stride=1)
-
-        z = tf.reshape(net, shape=[batch_size, -1])
-        self.z_mean, self.z_stddev_log = tf.split(
-            axis=1, num_or_size_splits=2, value=z)
-        self.z_stddev = tf.exp(self.z_stddev_log)
-
-        epsilon = tf.random_normal(
-            self.z_mean.get_shape().as_list(), 0, 1, dtype=tf.float32)
-        kernel = self.z_mean + tf.multiply(self.z_stddev, epsilon)
-
-        width = int(math.sqrt(kernel.get_shape().as_list()[1] // 128))
-        kernel = tf.reshape(kernel, [batch_size, width, width, 128])
-    with tf.variable_scope('kernel_decoder'):
-      with slim.arg_scope([slim.conv2d], padding='SAME'):
-        kernel = slim.conv2d(kernel, 128, [5, 5], stride=1)
-        self.kernel = slim.conv2d(kernel, 128, [5, 5], stride=1)
-
-    sys.stderr.write('kernel shape: %s\n' % kernel.get_shape())
-
-  def _BuildImageEncoder(self):
-    feature_maps = []
-    for (i, image) in enumerate(self.images):
-      with tf.variable_scope('image_encoder_%d' % i):
-        with slim.arg_scope([slim.conv2d, slim.max_pool2d], padding='SAME'):
-          net = slim.conv2d(image, 64, [5, 5], stride=1)
-          net = slim.conv2d(net, 64, [5, 5], stride=1)
-          net = slim.max_pool2d(net, [5, 5])
-          net = slim.conv2d(net, 64, [5, 5], stride=1)
-          net = slim.conv2d(net, 32, [5, 5], stride=1)
-          net = slim.max_pool2d(net, [2, 2])
-      sys.stderr.write('image_conv shape: %s\n' % net.get_shape())
-      feature_maps.append(net)
-    return feature_maps
-
-  def _CrossConvHelper(self, encoded_image, kernel):
-    """Cross Convolution.
-
-      The encoded image and kernel are of the same shape. Namely
-      [batch_size, image_size, image_size, channels]. They are split
-      into [image_size, image_size] image squares [kernel_size, kernel_size]
-      kernel squares. kernel squares are used to convolute image squares.
-    """
-    images = tf.expand_dims(encoded_image, 0)
-    kernels = tf.expand_dims(kernel, 3)
-    return tf.nn.depthwise_conv2d(images, kernels, [1, 1, 1, 1], 'SAME')
-
-  def _CrossConv(self, encoded_images):
-    """Apply the motion kernel on the encoded_images."""
-    cross_conved_images = []
-    kernels = tf.split(axis=3, num_or_size_splits=4, value=self.kernel)
-    for (i, encoded_image) in enumerate(encoded_images):
-      with tf.variable_scope('cross_conv_%d' % i):
-        kernel = kernels[i]
-
-        encoded_image = tf.unstack(encoded_image, axis=0)
-        kernel = tf.unstack(kernel, axis=0)
-        assert len(encoded_image) == len(kernel)
-        assert len(encoded_image) == self.params['batch_size']
-        conved_image = []
-        for j in xrange(len(encoded_image)):
-          conved_image.append(self._CrossConvHelper(
-              encoded_image[j], kernel[j]))
-        cross_conved_images.append(tf.concat(axis=0, values=conved_image))
-        sys.stderr.write('cross_conved shape: %s\n' %
-                         cross_conved_images[-1].get_shape())
-    return cross_conved_images
-
-  def _Deconv(self, net, out_filters, kernel_size, stride):
-    shape = net.get_shape().as_list()
-    in_filters = shape[3]
-    kernel_shape = [kernel_size, kernel_size, out_filters, in_filters]
-
-    weights = tf.get_variable(
-        name='weights',
-        shape=kernel_shape,
-        dtype=tf.float32,
-        initializer=tf.truncated_normal_initializer(stddev=0.01))
-
-
-    out_height = shape[1] * stride
-    out_width = shape[2] * stride
-    batch_size = shape[0]
-
-    output_shape = [batch_size, out_height, out_width, out_filters]
-    net = tf.nn.conv2d_transpose(net, weights, output_shape,
-                                 [1, stride, stride, 1], padding='SAME')
-    slim.batch_norm(net)
-    return net
-
-  def _BuildImageDecoder(self, cross_conved_images):
-    """Decode the cross_conved feature maps into the predicted images."""
-    nets = []
-    for i, cross_conved_image in enumerate(cross_conved_images):
-      with tf.variable_scope('image_decoder_%d' % i):
-        stride = 64 / cross_conved_image.get_shape().as_list()[1]
-        # TODO(xpan): Alternative solution for upsampling?
-        nets.append(self._Deconv(
-            cross_conved_image, 64, kernel_size=3, stride=stride))
-
-    net = tf.concat(axis=3, values=nets)
-    net = slim.conv2d(net, 128, [9, 9], padding='SAME', stride=1)
-    net = slim.conv2d(net, 128, [1, 1], padding='SAME', stride=1)
-    net = slim.conv2d(net, 3, [1, 1], padding='SAME', stride=1)
-    self.diff_output = net
-    sys.stderr.write('diff_output shape: %s\n' % self.diff_output.get_shape())
diff --git a/research/next_frame_prediction/cross_conv/reader.py b/research/next_frame_prediction/cross_conv/reader.py
deleted file mode 100644
index ab4ab698dda938f182be0019168aa132c1e3c5af..0000000000000000000000000000000000000000
--- a/research/next_frame_prediction/cross_conv/reader.py
+++ /dev/null
@@ -1,86 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Read image sequence."""
-
-from six.moves import xrange
-import tensorflow as tf
-
-
-def SequenceToImageAndDiff(images):
-  """Convert image sequence batch into image and diff batch.
-
-    Each image pair is converted to the first image and their diff.
-    Batch size will increase if sequence length is larger than 2.
-
-  Args:
-    images: Image sequence with shape
-        [batch_size, seq_len, image_size, image_size, channel]
-
-  Returns:
-    the list of (image, diff) tuples with shape
-        [batch_size2, image_size, image_size, channel]. image_sizes are
-        [32, 64, 128, 256].
-  """
-  image_diff_list = []
-  image_seq = tf.unstack(images, axis=1)
-  for size in [32, 64, 128, 256]:
-    resized_images = [
-        tf.image.resize_images(i, [size, size]) for i in image_seq]
-    diffs = []
-    for i in xrange(0, len(resized_images)-1):
-      diffs.append(resized_images[i+1] - resized_images[i])
-    image_diff_list.append(
-        (tf.concat(axis=0, values=resized_images[:-1]), tf.concat(axis=0, values=diffs)))
-  return image_diff_list
-
-
-def ReadInput(data_filepattern, shuffle, params):
-  """Read the tf.SequenceExample tfrecord files.
-
-  Args:
-    data_filepattern: tf.SequenceExample tfrecord filepattern.
-    shuffle: Whether to shuffle the examples.
-    params: parameter dict.
-
-  Returns:
-    image sequence batch [batch_size, seq_len, image_size, image_size, channel].
-  """
-  image_size = params['image_size']
-  filenames = tf.gfile.Glob(data_filepattern)
-  filename_queue = tf.train.string_input_producer(filenames, shuffle=shuffle)
-  reader = tf.TFRecordReader()
-  _, example = reader.read(filename_queue)
-  feature_sepc = {
-      'moving_objs': tf.FixedLenSequenceFeature(
-          shape=[image_size * image_size * 3], dtype=tf.float32)}
-  _, features = tf.parse_single_sequence_example(
-      example, sequence_features=feature_sepc)
-  moving_objs = tf.reshape(
-      features['moving_objs'], [params['seq_len'], image_size, image_size, 3])
-  if shuffle:
-    examples = tf.train.shuffle_batch(
-        [moving_objs],
-        batch_size=params['batch_size'],
-        num_threads=64,
-        capacity=params['batch_size'] * 100,
-        min_after_dequeue=params['batch_size'] * 4)
-  else:
-    examples = tf.train.batch([moving_objs],
-                              batch_size=params['batch_size'],
-                              num_threads=16,
-                              capacity=params['batch_size'])
-  examples /= params['norm_scale']
-  return examples
diff --git a/research/next_frame_prediction/cross_conv/sprites_gen.py b/research/next_frame_prediction/cross_conv/sprites_gen.py
deleted file mode 100644
index 0d36c255cd93a90797272d7a80389f16fc6f3702..0000000000000000000000000000000000000000
--- a/research/next_frame_prediction/cross_conv/sprites_gen.py
+++ /dev/null
@@ -1,98 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Generate the sprites tfrecords from raw_images."""
-import os
-import random
-import re
-import sys
-
-import numpy as np
-import scipy.misc
-from six.moves import xrange
-import tensorflow as tf
-
-
-tf.flags.DEFINE_string('data_filepattern', '', 'The raw images.')
-tf.flags.DEFINE_string('out_file', '',
-                       'File name for the tfrecord output.')
-
-
-def _read_images():
-  """Read images from image files into data structure."""
-  sprites = dict()
-  files = tf.gfile.Glob(tf.flags.FLAGS.data_filepattern)
-  for f in files:
-    image = scipy.misc.imread(f)
-    m = re.search('image_([0-9]+)_([0-9]+)_([0-9]+).jpg', os.path.basename(f))
-    if m.group(1) not in sprites:
-      sprites[m.group(1)] = dict()
-    character = sprites[m.group(1)]
-    if m.group(2) not in character:
-      character[m.group(2)] = dict()
-    pose = character[m.group(2)]
-    pose[int(m.group(3))] = image
-  return sprites
-
-
-def _images_to_example(image, image2):
-  """Convert 2 consecutive image to a SequenceExample."""
-  example = tf.SequenceExample()
-  feature_list = example.feature_lists.feature_list['moving_objs']
-  feature = feature_list.feature.add()
-  feature.float_list.value.extend(np.reshape(image, [-1]).tolist())
-  feature = feature_list.feature.add()
-  feature.float_list.value.extend(np.reshape(image2, [-1]).tolist())
-  return example
-
-
-def generate_input():
-  """Generate tfrecords."""
-  sprites = _read_images()
-  sys.stderr.write('Finish reading images.\n')
-  train_writer = tf.python_io.TFRecordWriter(
-      tf.flags.FLAGS.out_file.replace('sprites', 'sprites_train'))
-  test_writer = tf.python_io.TFRecordWriter(
-      tf.flags.FLAGS.out_file.replace('sprites', 'sprites_test'))
-
-  train_examples = []
-  test_examples = []
-  for i in sprites:
-    if int(i) < 24:
-      examples = test_examples
-    else:
-      examples = train_examples
-
-    character = sprites[i]
-    for j in character.keys():
-      pose = character[j]
-      for k in xrange(1, len(pose), 1):
-        image = pose[k]
-        image2 = pose[k+1]
-        examples.append(_images_to_example(image, image2))
-
-  sys.stderr.write('Finish generating examples: %d, %d.\n' %
-                   (len(train_examples), len(test_examples)))
-  random.shuffle(train_examples)
-  _ = [train_writer.write(ex.SerializeToString()) for ex in train_examples]
-  _ = [test_writer.write(ex.SerializeToString()) for ex in test_examples]
-
-
-def main(_):
-  generate_input()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/next_frame_prediction/cross_conv/train.py b/research/next_frame_prediction/cross_conv/train.py
deleted file mode 100644
index 5b9973f52cc3946b3396c1e0b87fda19901735f6..0000000000000000000000000000000000000000
--- a/research/next_frame_prediction/cross_conv/train.py
+++ /dev/null
@@ -1,122 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Train the cross convolutional model."""
-import os
-import sys
-
-import numpy as np
-import tensorflow as tf
-
-import model as cross_conv_model
-import reader
-
-FLAGS = tf.flags.FLAGS
-tf.flags.DEFINE_string('master', '', 'Session address.')
-tf.flags.DEFINE_string('log_root', '/tmp/moving_obj', 'The root dir of output.')
-tf.flags.DEFINE_string('data_filepattern', '',
-                       'training data file pattern.')
-tf.flags.DEFINE_integer('image_size', 64, 'Image height and width.')
-tf.flags.DEFINE_integer('batch_size', 1, 'Batch size.')
-tf.flags.DEFINE_float('norm_scale', 1.0, 'Normalize the original image')
-tf.flags.DEFINE_float('scale', 10.0,
-                      'Scale the image after norm_scale and move the diff '
-                      'to the positive realm.')
-tf.flags.DEFINE_integer('sequence_length', 2, 'tf.SequenceExample length.')
-tf.flags.DEFINE_float('learning_rate', 0.8, 'Learning rate.')
-tf.flags.DEFINE_bool('l2_loss', True, 'If true, include l2_loss.')
-tf.flags.DEFINE_bool('reconstr_loss', False, 'If true, include reconstr_loss.')
-tf.flags.DEFINE_bool('kl_loss', True, 'If true, include KL loss.')
-
-slim = tf.contrib.slim
-
-
-def _Train():
-  params = dict()
-  params['batch_size'] = FLAGS.batch_size
-  params['seq_len'] = FLAGS.sequence_length
-  params['image_size'] = FLAGS.image_size
-  params['is_training'] = True
-  params['norm_scale'] = FLAGS.norm_scale
-  params['scale'] = FLAGS.scale
-  params['learning_rate'] = FLAGS.learning_rate
-  params['l2_loss'] = FLAGS.l2_loss
-  params['reconstr_loss'] = FLAGS.reconstr_loss
-  params['kl_loss'] = FLAGS.kl_loss
-
-  train_dir = os.path.join(FLAGS.log_root, 'train')
-
-  images = reader.ReadInput(FLAGS.data_filepattern, shuffle=True, params=params)
-  images *= params['scale']
-  # Increase the value makes training much faster.
-  image_diff_list = reader.SequenceToImageAndDiff(images)
-  model = cross_conv_model.CrossConvModel(image_diff_list, params)
-  model.Build()
-  tf.contrib.tfprof.model_analyzer.print_model_analysis(tf.get_default_graph())
-
-  summary_writer = tf.summary.FileWriter(train_dir)
-  sv = tf.train.Supervisor(logdir=FLAGS.log_root,
-                           summary_op=None,
-                           is_chief=True,
-                           save_model_secs=60,
-                           global_step=model.global_step)
-  sess = sv.prepare_or_wait_for_session(
-      FLAGS.master, config=tf.ConfigProto(allow_soft_placement=True))
-
-  total_loss = 0.0
-  step = 0
-  sample_z_mean = np.zeros(model.z_mean.get_shape().as_list())
-  sample_z_stddev_log = np.zeros(model.z_stddev_log.get_shape().as_list())
-  sample_step = 0
-
-  while True:
-    _, loss_val, total_steps, summaries, z_mean, z_stddev_log = sess.run(
-        [model.train_op, model.loss, model.global_step,
-         model.summary_op,
-         model.z_mean, model.z_stddev_log])
-
-    sample_z_mean += z_mean
-    sample_z_stddev_log += z_stddev_log
-    total_loss += loss_val
-    step += 1
-    sample_step += 1
-
-    if step % 100 == 0:
-      summary_writer.add_summary(summaries, total_steps)
-      sys.stderr.write('step: %d, loss: %f\n' %
-                       (total_steps, total_loss / step))
-      total_loss = 0.0
-      step = 0
-
-    # Sampled z is used for eval.
-    # It seems 10k is better than 1k. Maybe try 100k next?
-    if sample_step % 10000 == 0:
-      with tf.gfile.Open(os.path.join(FLAGS.log_root, 'z_mean.npy'), 'w') as f:
-        np.save(f, sample_z_mean / sample_step)
-      with tf.gfile.Open(
-          os.path.join(FLAGS.log_root, 'z_stddev_log.npy'), 'w') as f:
-        np.save(f, sample_z_stddev_log / sample_step)
-      sample_z_mean = np.zeros(model.z_mean.get_shape().as_list())
-      sample_z_stddev_log = np.zeros(
-          model.z_stddev_log.get_shape().as_list())
-      sample_step = 0
-
-
-def main(_):
-  _Train()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/next_frame_prediction/g3doc/cross_conv.png b/research/next_frame_prediction/g3doc/cross_conv.png
deleted file mode 100644
index 13915f944188adf0b0a3dc85219fce7bcb5e7de9..0000000000000000000000000000000000000000
Binary files a/research/next_frame_prediction/g3doc/cross_conv.png and /dev/null differ
diff --git a/research/next_frame_prediction/g3doc/cross_conv2.png b/research/next_frame_prediction/g3doc/cross_conv2.png
deleted file mode 100644
index c4b5e8e9d6169a1a908858a91fdc6467ae03ea2a..0000000000000000000000000000000000000000
Binary files a/research/next_frame_prediction/g3doc/cross_conv2.png and /dev/null differ
diff --git a/research/next_frame_prediction/g3doc/cross_conv3.png b/research/next_frame_prediction/g3doc/cross_conv3.png
deleted file mode 100644
index 054d7d1edf2043c50a3ea8d332cc83a8dcb32c9b..0000000000000000000000000000000000000000
Binary files a/research/next_frame_prediction/g3doc/cross_conv3.png and /dev/null differ
diff --git a/research/object_detection/CONTRIBUTING.md b/research/object_detection/CONTRIBUTING.md
index e3d87e3ce90fb4dd22b00a2c5368bf17c3610661..8073982f4ad3830dc31f32381c7dff3fa7f62b4f 100644
--- a/research/object_detection/CONTRIBUTING.md
+++ b/research/object_detection/CONTRIBUTING.md
@@ -1,6 +1,6 @@
-# Contributing to the Tensorflow Object Detection API
+# Contributing to the TensorFlow Object Detection API
 
-Patches to Tensorflow Object Detection API are welcome!
+Patches to TensorFlow Object Detection API are welcome!
 
 We require contributors to fill out either the individual or corporate
 Contributor License Agreement (CLA).
@@ -9,5 +9,5 @@ Contributor License Agreement (CLA).
   * If you work for a company that wants to allow you to contribute your work, then you'll need to sign a [corporate CLA](http://code.google.com/legal/corporate-cla-v1.0.html).
 
 Please follow the
-[Tensorflow contributing guidelines](https://github.com/tensorflow/tensorflow/blob/master/CONTRIBUTING.md)
+[TensorFlow contributing guidelines](https://github.com/tensorflow/tensorflow/blob/master/CONTRIBUTING.md)
 when submitting pull requests.
diff --git a/research/object_detection/README.md b/research/object_detection/README.md
index b6dc9ad0ae97caa359b90ef5108de43c6ce71734..a311fd9168b14882dab810dbf029ce15933391be 100644
--- a/research/object_detection/README.md
+++ b/research/object_detection/README.md
@@ -1,18 +1,17 @@
-![TensorFlow Requirement: 1.15](https://img.shields.io/badge/TensorFlow%20Requirement-1.15-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
+# TensorFlow Object Detection API
+[![TensorFlow 2.2](https://img.shields.io/badge/TensorFlow-2.2-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v2.2.0)
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+[![Python 3.6](https://img.shields.io/badge/Python-3.6-3776AB)](https://www.python.org/downloads/release/python-360/)
 
-# Tensorflow Object Detection API
 Creating accurate machine learning models capable of localizing and identifying
 multiple objects in a single image remains a core challenge in computer vision.
 The TensorFlow Object Detection API is an open source framework built on top of
 TensorFlow that makes it easy to construct, train and deploy object detection
-models.  At Google we’ve certainly found this codebase to be useful for our
-computer vision needs, and we hope that you will as well.
-<p align="center">
-  <img src="g3doc/img/kites_detections_output.jpg" width=676 height=450>
-</p>
+models. At Google we’ve certainly found this codebase to be useful for our
+computer vision needs, and we hope that you will as well. <p align="center">
+<img src="g3doc/img/kites_detections_output.jpg" width=676 height=450> </p>
 Contributions to the codebase are welcome and we would love to hear back from
-you if you find this API useful.  Finally if you use the Tensorflow Object
+you if you find this API useful. Finally if you use the TensorFlow Object
 Detection API for a research publication, please consider citing:
 
 ```
@@ -20,321 +19,166 @@ Detection API for a research publication, please consider citing:
 Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I, Wojna Z,
 Song Y, Guadarrama S, Murphy K, CVPR 2017
 ```
-\[[link](https://arxiv.org/abs/1611.10012)\]\[[bibtex](
-https://scholar.googleusercontent.com/scholar.bib?q=info:l291WsrB-hQJ:scholar.google.com/&output=citation&scisig=AAGBfm0AAAAAWUIIlnPZ_L9jxvPwcC49kDlELtaeIyU-&scisf=4&ct=citation&cd=-1&hl=en&scfhb=1)\]
+
+\[[link](https://arxiv.org/abs/1611.10012)\]\[[bibtex](https://scholar.googleusercontent.com/scholar.bib?q=info:l291WsrB-hQJ:scholar.google.com/&output=citation&scisig=AAGBfm0AAAAAWUIIlnPZ_L9jxvPwcC49kDlELtaeIyU-&scisf=4&ct=citation&cd=-1&hl=en&scfhb=1)\]
 
 <p align="center">
   <img src="g3doc/img/tf-od-api-logo.png" width=140 height=195>
 </p>
 
-## Maintainers
-
-| Name | GitHub |
-| --- | --- |
-| Jonathan Huang | [jch1](https://github.com/jch1) |
-| Vivek Rathod | [tombstone](https://github.com/tombstone) |
-| Ronny Votel | [ronnyvotel](https://github.com/ronnyvotel) |
-| Derek Chow | [derekjchow](https://github.com/derekjchow) |
-| Chen Sun | [jesu9](https://github.com/jesu9) |
-| Menglong Zhu | [dreamdragon](https://github.com/dreamdragon) |
-| Alireza Fathi | [afathi3](https://github.com/afathi3) |
-| Zhichao Lu | [pkulzc](https://github.com/pkulzc) |
-
-## Table of contents
-
-Setup:
-
-  * <a href='g3doc/installation.md'>Installation</a><br>
-
-Quick Start:
-
-  * <a href='object_detection_tutorial.ipynb'>
-      Quick Start: Jupyter notebook for off-the-shelf inference</a><br>
-  * <a href="g3doc/running_pets.md">Quick Start: Training a pet detector</a><br>
-
-Customizing a Pipeline:
-
-  * <a href='g3doc/configuring_jobs.md'>
-      Configuring an object detection pipeline</a><br>
-  * <a href='g3doc/preparing_inputs.md'>Preparing inputs</a><br>
-
-Running:
-
-  * <a href='g3doc/running_locally.md'>Running locally</a><br>
-  * <a href='g3doc/running_on_cloud.md'>Running on the cloud</a><br>
-
-Extras:
-
-  * <a href='g3doc/detection_model_zoo.md'>Tensorflow detection model zoo</a><br>
-  * <a href='g3doc/exporting_models.md'>
-      Exporting a trained model for inference</a><br>
-  * <a href='g3doc/tpu_exporters.md'>
-      Exporting a trained model for TPU inference</a><br>
-  * <a href='g3doc/defining_your_own_model.md'>
-      Defining your own model architecture</a><br>
-  * <a href='g3doc/using_your_own_dataset.md'>
-      Bringing in your own dataset</a><br>
-  * <a href='g3doc/evaluation_protocols.md'>
-      Supported object detection evaluation protocols</a><br>
-  * <a href='g3doc/oid_inference_and_evaluation.md'>
-      Inference and evaluation on the Open Images dataset</a><br>
-  * <a href='g3doc/instance_segmentation.md'>
-      Run an instance segmentation model</a><br>
-  * <a href='g3doc/challenge_evaluation.md'>
-      Run the evaluation for the Open Images Challenge 2018/2019</a><br>
-  * <a href='g3doc/tpu_compatibility.md'>
-      TPU compatible detection pipelines</a><br>
-  * <a href='g3doc/running_on_mobile_tensorflowlite.md'>
-      Running object detection on mobile devices with TensorFlow Lite</a><br>
-
-## Getting Help
-
-To get help with issues you may encounter using the Tensorflow Object Detection
-API, create a new question on [StackOverflow](https://stackoverflow.com/) with
-the tags "tensorflow" and "object-detection".
-
-Please report bugs (actually broken code, not usage questions) to the
-tensorflow/models GitHub
-[issue tracker](https://github.com/tensorflow/models/issues), prefixing the
-issue name with "object_detection".
-
-Please check [FAQ](g3doc/faq.md) for frequently asked questions before
-reporting an issue.
-
-
-## Release information
-
-### May 19th, 2020
-We have released
-[MobileDets](https://arxiv.org/abs/2004.14525),
-a set of high-performance models for mobile CPUs, DSPs and EdgeTPUs.
-
-* MobileDets outperform MobileNetV3+SSDLite by 1.7 mAP at comparable mobile CPU
-inference latencies. MobileDets also outperform MobileNetV2+SSDLite by 1.9 mAP
-on mobile CPUs, 3.7 mAP on EdgeTPUs and 3.4 mAP on DSPs while running equally
-fast. MobileDets also offer up to 2x speedup over MnasFPN on EdgeTPUs and DSPs.
-
-For each of the three hardware platforms we have released model definition,
-model checkpoints trained on the COCO14 dataset and converted TFLite models in
-fp32 and/or uint8.
-
-<b>Thanks to contributors</b>: Yunyang Xiong, Hanxiao Liu, Suyog Gupta,
-Berkin Akin, Gabriel Bender, Pieter-Jan Kindermans, Mingxing Tan, Vikas Singh,
-Bo Chen, Quoc Le, Zhichao Lu.
-
-
-### May 7th, 2020
-We have released a mobile model with the
-[MnasFPN head](https://arxiv.org/abs/1912.01106).
-
-
-* MnasFPN with MobileNet-V2 backbone is the most accurate (26.6 mAP at 183ms on
-Pixel 1) mobile detection model we have released to date. With depth-multiplier,
-MnasFPN with MobileNet-V2 backbone is 1.8 mAP higher than MobileNet-V3-Large
-with SSDLite (23.8 mAP vs 22.0 mAP) at similar latency (120ms) on Pixel 1.
-
-We have released model definition, model checkpoints trained on
-the COCO14 dataset and a converted TFLite model.
-
-<b>Thanks to contributors</b>: Bo Chen, Golnaz Ghiasi, Hanxiao Liu,
-Tsung-Yi Lin, Dmitry Kalenichenko, Hartwig Adam, Quoc Le, Zhichao Lu,
-Jonathan Huang, Hao Xu.
-
-
-
-### Nov 13th, 2019
-We have released MobileNetEdgeTPU SSDLite model.
-
-* SSDLite with MobileNetEdgeTPU backbone, which achieves 10% mAP higher than
-MobileNetV2 SSDLite (24.3 mAP vs 22 mAP) on a Google Pixel4 at comparable
-latency (6.6ms vs 6.8ms).
-
-Along with the model definition, we are also releasing model checkpoints
-trained on the COCO dataset.
-
-<b>Thanks to contributors</b>: Yunyang Xiong, Bo Chen, Suyog Gupta, Hanxiao Liu,
-Gabriel Bender, Mingxing Tan, Berkin Akin, Zhichao Lu, Quoc Le
-
-### Oct 15th, 2019
-We have released two MobileNet V3 SSDLite models (presented in
-[Searching for MobileNetV3](https://arxiv.org/abs/1905.02244)).
+## Support for TensorFlow 2 and 1
+The TensorFlow Object Detection API supports both TensorFlow 2 (TF2) and
+TensorFlow 1 (TF1). A majority of the modules in the library are both TF1 and
+TF2 compatible. In cases where they are not, we provide two versions.
 
-* SSDLite with MobileNet-V3-Large backbone, which is 27% faster than Mobilenet
-V2 SSDLite (119ms vs 162ms) on a Google Pixel phone CPU at the same mAP.
-* SSDLite with MobileNet-V3-Small backbone, which is 37% faster than MnasNet
-SSDLite reduced with depth-multiplier (43ms vs 68ms) at the same mAP.
+Although we will continue to maintain the TF1 models and provide support, we
+encourage users to try the Object Detection API with TF2 for the following
+reasons:
+
+* We provide new architectures supported in TF2 only and we will continue to
+  develop in TF2 going forward.
+
+* The popular models we ported from TF1 to TF2 achieve the same performance.
 
-Along with the model definition, we are also releasing model checkpoints
-trained on the COCO dataset.
+* A single training and evaluation binary now supports both GPU and TPU
+  distribution strategies making it possible to train models with synchronous
+  SGD by default.
 
-<b>Thanks to contributors</b>: Bo Chen, Zhichao Lu, Vivek Rathod, Jonathan Huang
+* Eager execution with new binaries makes debugging easy!
+
+Finally, if are an existing user of the Object Detection API we have retained
+the same config language you are familiar with and ensured that the
+TF2 training/eval binary takes the same arguments as our TF1 binaries.
+
+Note: The models we provide in [TF2 Zoo](g3doc/tf2_detection_zoo.md) and
+[TF1 Zoo](g3doc/tf1_detection_zoo.md) are specific to the TensorFlow major
+version and are not interoperable.
+
+Please select one of the links below for TensorFlow version-specific
+documentation of the Object Detection API:
+
+<!-- mdlint off(WHITESPACE_LINE_LENGTH) -->
+### Tensorflow 2.x
+  *   <a href='g3doc/tf2.md'>
+        Object Detection API TensorFlow 2</a><br>
+  *   <a href='g3doc/tf2_detection_zoo.md'>
+        TensorFlow 2 Model Zoo</a><br>
 
+### Tensorflow 1.x
+  *   <a href='g3doc/tf1.md'>
+        Object Detection API TensorFlow 1</a><br>
+  *   <a href='g3doc/tf1_detection_zoo.md'>
+        TensorFlow 1 Model Zoo</a><br>
+<!-- mdlint on -->
 
-### July 1st, 2019
+## Whats New
 
-We have released an updated set of utils and an updated
-[tutorial](g3doc/challenge_evaluation.md) for all three tracks of the
-[Open Images Challenge 2019](https://storage.googleapis.com/openimages/web/challenge2019.html)!
+### TensorFlow 2 Support
 
-The Instance Segmentation metric for
-[Open Images V5](https://storage.googleapis.com/openimages/web/index.html)
-and [Challenge 2019](https://storage.googleapis.com/openimages/web/challenge2019.html)
-is part of this release. Check out [the metric description](https://storage.googleapis.com/openimages/web/evaluation.html#instance_segmentation_eval)
-on the Open Images website.
+We are happy to announce that the TF OD API officially supports TF2! Our release
+includes:
 
-<b>Thanks to contributors</b>: Alina Kuznetsova, Rodrigo Benenson
+* New binaries for train/eval/export that are designed to run in eager mode.
+* A suite of TF2 compatible (Keras-based) models; this includes migrations of
+  our most popular TF1.x models (e.g., SSD with MobileNet, RetinaNet,
+  Faster R-CNN, Mask R-CNN), as well as a few new architectures for which we
+  will only maintain TF2 implementations:
 
-### Feb 11, 2019
+    1. CenterNet - a simple and effective anchor-free architecture based on
+       the recent [Objects as Points](https://arxiv.org/abs/1904.07850) paper by
+       Zhou et al.
+    2. [EfficientDet](https://arxiv.org/abs/1911.09070) - a recent family of
+       SOTA models discovered with the help of Neural Architecture Search.
 
-We have released detection models trained on the Open Images Dataset V4
-in our detection model zoo, including
+* COCO pre-trained weights for all of the models provided as TF2 style
+  object-based checkpoints.
+* Access to [Distribution Strategies](https://www.tensorflow.org/guide/distributed_training)
+  for distributed training --- our model are designed to be trainable using sync
+  multi-GPU and TPU platforms.
+* Colabs demo’ing eager mode training and inference.
 
-* Faster R-CNN detector with Inception Resnet V2 feature extractor
-* SSD detector with MobileNet V2 feature extractor
-* SSD detector with ResNet 101 FPN feature extractor (aka RetinaNet-101)
+See our release blogpost [here](https://blog.tensorflow.org/2020/07/tensorflow-2-meets-object-detection-api.html).
+If you are an existing user of the TF OD API using TF 1.x, don’t worry, we’ve
+got you covered.
 
-<b>Thanks to contributors</b>: Alina Kuznetsova, Yinxiao Li
+**Thanks to contributors**: Akhil Chinnakotla, Allen Lavoie, Anirudh Vegesana,
+Anjali Sridhar, Austin Myers, Dan Kondratyuk, David Ross, Derek Chow, Jaeyoun
+Kim, Jing Li, Jonathan Huang, Jordi Pont-Tuset, Karmel Allison, Kathy Ruan,
+Kaushik Shivakumar, Lu He, Mingxing Tan, Pengchong Jin, Ronny Votel, Sara Beery,
+Sergi Caelles Prat, Shan Yang, Sudheendra Vijayanarasimhan, Tina Tian, Tomer
+Kaftan, Vighnesh Birodkar, Vishnu Banna, Vivek Rathod, Yanhui Liang, Yiming Shi,
+Yixin Shi, Yu-hui Chen, Zhichao Lu.
 
-### Sep 17, 2018
+### MobileDet GPU
 
-We have released Faster R-CNN detectors with ResNet-50 / ResNet-101 feature
-extractors trained on the [iNaturalist Species Detection Dataset](https://github.com/visipedia/inat_comp/blob/master/2017/README.md#bounding-boxes).
-The models are trained on the training split of the iNaturalist data for 4M
-iterations, they achieve 55% and 58% mean AP@.5 over 2854 classes respectively.
-For more details please refer to this [paper](https://arxiv.org/abs/1707.06642).
+We have released SSDLite with MobileDet GPU backbone, which achieves 17% mAP
+higher than the MobileNetV2 SSDLite (27.5 mAP vs 23.5 mAP) on a NVIDIA Jetson
+Xavier at comparable latency (3.2ms vs 3.3ms).
 
-<b>Thanks to contributors</b>: Chen Sun
+Along with the model definition, we are also releasing model checkpoints trained
+on the COCO dataset.
 
-### July 13, 2018
+<b>Thanks to contributors</b>: Yongzhe Wang, Bo Chen, Hanxiao Liu, Le An
+(NVIDIA), Yu-Te Cheng (NVIDIA), Oliver Knieps (NVIDIA), and Josh Park (NVIDIA).
 
-There are many new updates in this release, extending the functionality and
-capability of the API:
+### Context R-CNN
 
-* Moving from slim-based training to [Estimator](https://www.tensorflow.org/api_docs/python/tf/estimator/Estimator)-based
-training.
-* Support for [RetinaNet](https://arxiv.org/abs/1708.02002), and a [MobileNet](https://ai.googleblog.com/2017/06/mobilenets-open-source-models-for.html)
-adaptation of RetinaNet.
-* A novel SSD-based architecture called the [Pooling Pyramid Network](https://arxiv.org/abs/1807.03284) (PPN).
-* Releasing several [TPU](https://cloud.google.com/tpu/)-compatible models.
-These can be found in the `samples/configs/` directory with a comment in the
-pipeline configuration files indicating TPU compatibility.
-* Support for quantized training.
-* Updated documentation for new binaries, Cloud training, and [Tensorflow Lite](https://www.tensorflow.org/mobile/tflite/).
+We have released [Context R-CNN](https://arxiv.org/abs/1912.03538), a model that
+uses attention to incorporate contextual information images (e.g. from
+temporally nearby frames taken by a static camera) in order to improve accuracy.
+Importantly, these contextual images need not be labeled.
 
-See also our [expanded announcement blogpost](https://ai.googleblog.com/2018/07/accelerated-training-and-inference-with.html) and accompanying tutorial at the [TensorFlow blog](https://medium.com/tensorflow/training-and-serving-a-realtime-mobile-object-detector-in-30-minutes-with-cloud-tpus-b78971cf1193).
+*   When applied to a challenging wildlife detection dataset
+    ([Snapshot Serengeti](http://lila.science/datasets/snapshot-serengeti)),
+    Context R-CNN with context from up to a month of images outperforms a
+    single-frame baseline by 17.9% mAP, and outperforms S3D (a 3d convolution
+    based baseline) by 11.2% mAP.
+*   Context R-CNN leverages temporal context from the unlabeled frames of a
+    novel camera deployment to improve performance at that camera, boosting
+    model generalizeability.
 
-<b>Thanks to contributors</b>: Sara Robinson, Aakanksha Chowdhery, Derek Chow,
-Pengchong Jin, Jonathan Huang, Vivek Rathod, Zhichao Lu, Ronny Votel
+Read about Context R-CNN on the Google AI blog
+[here](https://ai.googleblog.com/2020/06/leveraging-temporal-context-for-object.html).
 
+We have provided code for generating data with associated context
+[here](g3doc/context_rcnn.md), and a sample config for a Context R-CNN model
+[here](samples/configs/context_rcnn_resnet101_snapshot_serengeti_sync.config).
 
-### June 25, 2018
+Snapshot Serengeti-trained Faster R-CNN and Context R-CNN models can be found in
+the
+[model zoo](https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/tf1_detection_zoo.md#snapshot-serengeti-camera-trap-trained-models).
+
+A colab demonstrating Context R-CNN is provided
+[here](colab_tutorials/context_rcnn_tutorial.ipynb).
+
+<b>Thanks to contributors</b>: Sara Beery, Jonathan Huang, Guanhang Wu, Vivek
+Rathod, Ronny Votel, Zhichao Lu, David Ross, Pietro Perona, Tanya Birch, and the
+Wildlife Insights AI Team.
+
+## Release Notes
+See [notes](g3doc/release_notes.md) for all past releases.
 
-Additional evaluation tools for the [Open Images Challenge 2018](https://storage.googleapis.com/openimages/web/challenge.html) are out.
-Check out our short tutorial on data preparation and running evaluation [here](g3doc/challenge_evaluation.md)!
-
-<b>Thanks to contributors</b>: Alina Kuznetsova
-
-### June 5, 2018
-
-We have released the implementation of evaluation metrics for both tracks of the [Open Images Challenge 2018](https://storage.googleapis.com/openimages/web/challenge.html) as a part of the Object Detection API - see the [evaluation protocols](g3doc/evaluation_protocols.md) for more details.
-Additionally, we have released a tool for hierarchical labels expansion for the Open Images Challenge: check out [oid_hierarchical_labels_expansion.py](dataset_tools/oid_hierarchical_labels_expansion.py).
-
-<b>Thanks to contributors</b>: Alina Kuznetsova, Vittorio Ferrari, Jasper Uijlings
-
-### April 30, 2018
-
-We have released a Faster R-CNN detector with ResNet-101 feature extractor trained on [AVA](https://research.google.com/ava/) v2.1.
-Compared with other commonly used object detectors, it changes the action classification loss function to per-class Sigmoid loss to handle boxes with multiple labels.
-The model is trained on the training split of AVA v2.1 for 1.5M iterations, it achieves mean AP of 11.25% over 60 classes on the validation split of AVA v2.1.
-For more details please refer to this [paper](https://arxiv.org/abs/1705.08421).
-
-<b>Thanks to contributors</b>: Chen Sun, David Ross
-
-### April 2, 2018
-
-Supercharge your mobile phones with the next generation mobile object detector!
-We are adding support for MobileNet V2 with SSDLite presented in
-[MobileNetV2: Inverted Residuals and Linear Bottlenecks](https://arxiv.org/abs/1801.04381).
-This model is 35% faster than Mobilenet V1 SSD on a Google Pixel phone CPU (200ms vs. 270ms) at the same accuracy.
-Along with the model definition, we are also releasing a model checkpoint trained on the COCO dataset.
-
-<b>Thanks to contributors</b>: Menglong Zhu, Mark Sandler, Zhichao Lu, Vivek Rathod, Jonathan Huang
-
-### February 9, 2018
-
-We now support instance segmentation!!  In this API update we support a number of instance segmentation models similar to those discussed in the [Mask R-CNN paper](https://arxiv.org/abs/1703.06870). For further details refer to
-[our slides](http://presentations.cocodataset.org/Places17-GMRI.pdf) from the 2017 Coco + Places Workshop.
-Refer to the section on [Running an Instance Segmentation Model](g3doc/instance_segmentation.md) for instructions on how to configure a model
-that predicts masks in addition to object bounding boxes.
-
-<b>Thanks to contributors</b>: Alireza Fathi, Zhichao Lu, Vivek Rathod, Ronny Votel, Jonathan Huang
-
-### November 17, 2017
-
-As a part of the Open Images V3 release we have released:
-
-* An implementation of the Open Images evaluation metric and the [protocol](g3doc/evaluation_protocols.md#open-images).
-* Additional tools to separate inference of detection and evaluation (see [this tutorial](g3doc/oid_inference_and_evaluation.md)).
-* A new detection model trained on the Open Images V2 data release (see [Open Images model](g3doc/detection_model_zoo.md#open-images-models)).
-
-See more information on the [Open Images website](https://github.com/openimages/dataset)!
-
-<b>Thanks to contributors</b>: Stefan Popov, Alina Kuznetsova
-
-### November 6, 2017
-
-We have re-released faster versions of our (pre-trained) models in the
-<a href='g3doc/detection_model_zoo.md'>model zoo</a>.  In addition to what
-was available before, we are also adding Faster R-CNN models trained on COCO
-with Inception V2 and Resnet-50 feature extractors, as well as a Faster R-CNN
-with Resnet-101 model trained on the KITTI dataset.
-
-<b>Thanks to contributors</b>: Jonathan Huang, Vivek Rathod, Derek Chow,
-Tal Remez, Chen Sun.
-
-### October 31, 2017
-
-We have released a new state-of-the-art model for object detection using
-the Faster-RCNN with the
-[NASNet-A image featurization](https://arxiv.org/abs/1707.07012). This
-model achieves mAP of 43.1% on the test-dev validation dataset for COCO,
-improving on the best available model in the zoo by 6% in terms
-of absolute mAP.
-
-<b>Thanks to contributors</b>: Barret Zoph, Vijay Vasudevan, Jonathon Shlens, Quoc Le
-
-### August 11, 2017
-
-We have released an update to the [Android Detect
-demo](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/android)
-which will now run models trained using the Tensorflow Object
-Detection API on an Android device.  By default, it currently runs a
-frozen SSD w/Mobilenet detector trained on COCO, but we encourage
-you to try out other detection models!
-
-<b>Thanks to contributors</b>: Jonathan Huang, Andrew Harp
+## Getting Help
 
+To get help with issues you may encounter using the TensorFlow Object Detection
+API, create a new question on [StackOverflow](https://stackoverflow.com/) with
+the tags "tensorflow" and "object-detection".
 
-### June 15, 2017
+Please report bugs (actually broken code, not usage questions) to the
+tensorflow/models GitHub
+[issue tracker](https://github.com/tensorflow/models/issues), prefixing the
+issue name with "object_detection".
 
-In addition to our base Tensorflow detection model definitions, this
-release includes:
+Please check the [FAQ](g3doc/faq.md) for frequently asked questions before
+reporting an issue.
 
-* A selection of trainable detection models, including:
-  * Single Shot Multibox Detector (SSD) with MobileNet,
-  * SSD with Inception V2,
-  * Region-Based Fully Convolutional Networks (R-FCN) with Resnet 101,
-  * Faster RCNN with Resnet 101,
-  * Faster RCNN with Inception Resnet v2
-* Frozen weights (trained on the COCO dataset) for each of the above models to
-  be used for out-of-the-box inference purposes.
-* A [Jupyter notebook](object_detection_tutorial.ipynb) for performing
-  out-of-the-box inference with one of our released models
-* Convenient [local training](g3doc/running_locally.md) scripts as well as
-  distributed training and evaluation pipelines via
-  [Google Cloud](g3doc/running_on_cloud.md).
+## Maintainers
 
-<b>Thanks to contributors</b>: Jonathan Huang, Vivek Rathod, Derek Chow, Chen
-Sun, Menglong Zhu, Matthew Tang, Anoop Korattikara, Alireza Fathi, Ian Fischer,
-Zbigniew Wojna, Yang Song, Sergio Guadarrama, Jasper Uijlings, Viacheslav
-Kovalevskyi, Kevin Murphy
+* Jonathan Huang ([@GitHub jch1](https://github.com/jch1))
+* Vivek Rathod ([@GitHub tombstone](https://github.com/tombstone))
+* Vighnesh Birodkar ([@GitHub vighneshbirodkar](https://github.com/vighneshbirodkar))
+* Austin Myers ([@GitHub austin-myers](https://github.com/austin-myers))
+* Zhichao Lu ([@GitHub pkulzc](https://github.com/pkulzc))
+* Ronny Votel ([@GitHub ronnyvotel](https://github.com/ronnyvotel))
+* Yu-hui Chen ([@GitHub yuhuichen1015](https://github.com/yuhuichen1015))
+* Derek Chow  ([@GitHub derekjchow](https://github.com/derekjchow))
diff --git a/research/object_detection/builders/box_predictor_builder_test.py b/research/object_detection/builders/box_predictor_builder_test.py
index 7154cd2efc06e2c4581e654d718e3519152bc6bb..88b9843d7618e633592264f3ea10afb8c07736e5 100644
--- a/research/object_detection/builders/box_predictor_builder_test.py
+++ b/research/object_detection/builders/box_predictor_builder_test.py
@@ -17,9 +17,8 @@
 """Tests for box_predictor_builder."""
 
 import unittest
-import mock
+from unittest import mock  # pylint: disable=g-importing-member
 import tensorflow.compat.v1 as tf
-
 from google.protobuf import text_format
 from object_detection.builders import box_predictor_builder
 from object_detection.builders import hyperparams_builder
diff --git a/research/object_detection/builders/dataset_builder_test.py b/research/object_detection/builders/dataset_builder_test.py
index 7c3de113e92a1784462d32459197b98c51f88562..eb2cdb3ccbd891e5f089281d9b506d636d26d6a9 100644
--- a/research/object_detection/builders/dataset_builder_test.py
+++ b/research/object_detection/builders/dataset_builder_test.py
@@ -390,7 +390,7 @@ class DatasetBuilderTest(test_case.TestCase):
       return iter1.get_next(), iter2.get_next()
 
     output_dict1, output_dict2 = self.execute(graph_fn, [])
-    self.assertAllEqual(['0'], output_dict1[fields.InputDataFields.source_id])
+    self.assertAllEqual([b'0'], output_dict1[fields.InputDataFields.source_id])
     self.assertEqual([b'1'], output_dict2[fields.InputDataFields.source_id])
 
   def test_sample_one_of_n_shards(self):
diff --git a/research/object_detection/builders/decoder_builder.py b/research/object_detection/builders/decoder_builder.py
index 59880735cd3fd6be3d4e9c567af615227d0a1fb1..c0895051ac1a5dcdd8bd1528f13a7d5f909b306f 100644
--- a/research/object_detection/builders/decoder_builder.py
+++ b/research/object_detection/builders/decoder_builder.py
@@ -58,7 +58,8 @@ def build(input_reader_config):
           use_display_name=input_reader_config.use_display_name,
           num_additional_channels=input_reader_config.num_additional_channels,
           num_keypoints=input_reader_config.num_keypoints,
-          expand_hierarchy_labels=input_reader_config.expand_labels_hierarchy)
+          expand_hierarchy_labels=input_reader_config.expand_labels_hierarchy,
+          load_dense_pose=input_reader_config.load_dense_pose)
       return decoder
     elif input_type == input_reader_pb2.InputType.Value('TF_SEQUENCE_EXAMPLE'):
       decoder = tf_sequence_example_decoder.TfSequenceExampleDecoder(
diff --git a/research/object_detection/builders/graph_rewriter_builder_tf1_test.py b/research/object_detection/builders/graph_rewriter_builder_tf1_test.py
index 8af8fe9627bf3041b0499909cc298d1790810753..bec3cf8348f3c3803f2b2f2265e948426d363b22 100644
--- a/research/object_detection/builders/graph_rewriter_builder_tf1_test.py
+++ b/research/object_detection/builders/graph_rewriter_builder_tf1_test.py
@@ -14,7 +14,7 @@
 # ==============================================================================
 """Tests for graph_rewriter_builder."""
 import unittest
-import mock
+from unittest import mock  # pylint: disable=g-importing-member
 import tensorflow.compat.v1 as tf
 import tf_slim as slim
 
diff --git a/research/object_detection/builders/hyperparams_builder.py b/research/object_detection/builders/hyperparams_builder.py
index f34e1112a81bd9dad1c30ba39af6b1a20a252d2c..90aef43ac1bd92fb86dbd730cdb0420858572c18 100644
--- a/research/object_detection/builders/hyperparams_builder.py
+++ b/research/object_detection/builders/hyperparams_builder.py
@@ -64,6 +64,7 @@ class KerasLayerHyperparams(object):
       self._batch_norm_params = _build_keras_batch_norm_params(
           hyperparams_config.batch_norm)
 
+    self._force_use_bias = hyperparams_config.force_use_bias
     self._activation_fn = _build_activation_fn(hyperparams_config.activation)
     # TODO(kaftan): Unclear if these kwargs apply to separable & depthwise conv
     # (Those might use depthwise_* instead of kernel_*)
@@ -80,6 +81,13 @@ class KerasLayerHyperparams(object):
   def use_batch_norm(self):
     return self._batch_norm_params is not None
 
+  def force_use_bias(self):
+    return self._force_use_bias
+
+  def use_bias(self):
+    return (self._force_use_bias or not
+            (self.use_batch_norm() and self.batch_norm_params()['center']))
+
   def batch_norm_params(self, **overrides):
     """Returns a dict containing batchnorm layer construction hyperparameters.
 
@@ -168,10 +176,7 @@ class KerasLayerHyperparams(object):
     new_params['activation'] = None
     if include_activation:
       new_params['activation'] = self._activation_fn
-    if self.use_batch_norm() and self.batch_norm_params()['center']:
-      new_params['use_bias'] = False
-    else:
-      new_params['use_bias'] = True
+    new_params['use_bias'] = self.use_bias()
     new_params.update(**overrides)
     return new_params
 
@@ -210,6 +215,10 @@ def build(hyperparams_config, is_training):
     raise ValueError('hyperparams_config not of type '
                      'hyperparams_pb.Hyperparams.')
 
+  if hyperparams_config.force_use_bias:
+    raise ValueError('Hyperparams force_use_bias only supported by '
+                     'KerasLayerHyperparams.')
+
   normalizer_fn = None
   batch_norm_params = None
   if hyperparams_config.HasField('batch_norm'):
diff --git a/research/object_detection/builders/hyperparams_builder_test.py b/research/object_detection/builders/hyperparams_builder_test.py
index 2c6fcd5af70a64003e62fe41a38b0238863fea97..e48ac23bcb547c9729038b901a9612d3712d69cb 100644
--- a/research/object_detection/builders/hyperparams_builder_test.py
+++ b/research/object_detection/builders/hyperparams_builder_test.py
@@ -667,6 +667,67 @@ class KerasHyperparamsBuilderTest(tf.test.TestCase):
     self.assertIsInstance(identity_layer,
                           tf.keras.layers.Lambda)
 
+  def test_do_not_use_bias_if_batch_norm_center_keras(self):
+    conv_hyperparams_text_proto = """
+      regularizer {
+        l2_regularizer {
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+        }
+      }
+      batch_norm {
+        decay: 0.7
+        center: true
+        scale: true
+        epsilon: 0.03
+        train: true
+      }
+    """
+    conv_hyperparams_proto = hyperparams_pb2.Hyperparams()
+    text_format.Merge(conv_hyperparams_text_proto, conv_hyperparams_proto)
+    keras_config = hyperparams_builder.KerasLayerHyperparams(
+        conv_hyperparams_proto)
+
+    self.assertTrue(keras_config.use_batch_norm())
+    batch_norm_params = keras_config.batch_norm_params()
+    self.assertTrue(batch_norm_params['center'])
+    self.assertTrue(batch_norm_params['scale'])
+    hyperparams = keras_config.params()
+    self.assertFalse(hyperparams['use_bias'])
+
+  def test_force_use_bias_if_batch_norm_center_keras(self):
+    conv_hyperparams_text_proto = """
+      regularizer {
+        l2_regularizer {
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+        }
+      }
+      batch_norm {
+        decay: 0.7
+        center: true
+        scale: true
+        epsilon: 0.03
+        train: true
+      }
+      force_use_bias: true
+    """
+    conv_hyperparams_proto = hyperparams_pb2.Hyperparams()
+    text_format.Merge(conv_hyperparams_text_proto, conv_hyperparams_proto)
+    keras_config = hyperparams_builder.KerasLayerHyperparams(
+        conv_hyperparams_proto)
+
+    self.assertTrue(keras_config.use_batch_norm())
+    batch_norm_params = keras_config.batch_norm_params()
+    self.assertTrue(batch_norm_params['center'])
+    self.assertTrue(batch_norm_params['scale'])
+    hyperparams = keras_config.params()
+    self.assertTrue(hyperparams['use_bias'])
+
   def test_use_none_activation_keras(self):
     conv_hyperparams_text_proto = """
       regularizer {
diff --git a/research/object_detection/builders/model_builder.py b/research/object_detection/builders/model_builder.py
index 33dc56d329819f329c8f8ebd9baad8a6f0587bd6..971fcacc6934e43de77d41dd70bc9630812797b2 100644
--- a/research/object_detection/builders/model_builder.py
+++ b/research/object_detection/builders/model_builder.py
@@ -16,6 +16,7 @@
 """A function to build a DetectionModel from configuration."""
 
 import functools
+import sys
 from object_detection.builders import anchor_generator_builder
 from object_detection.builders import box_coder_builder
 from object_detection.builders import box_predictor_builder
@@ -38,6 +39,7 @@ from object_detection.protos import losses_pb2
 from object_detection.protos import model_pb2
 from object_detection.utils import label_map_util
 from object_detection.utils import ops
+from object_detection.utils import spatial_transform_ops as spatial_ops
 from object_detection.utils import tf_version
 
 ## Feature Extractors for TF
@@ -47,6 +49,7 @@ from object_detection.utils import tf_version
 # pylint: disable=g-import-not-at-top
 if tf_version.is_tf2():
   from object_detection.models import center_net_hourglass_feature_extractor
+  from object_detection.models import center_net_mobilenet_v2_feature_extractor
   from object_detection.models import center_net_resnet_feature_extractor
   from object_detection.models import center_net_resnet_v1_fpn_feature_extractor
   from object_detection.models import faster_rcnn_inception_resnet_v2_keras_feature_extractor as frcnn_inc_res_keras
@@ -58,6 +61,8 @@ if tf_version.is_tf2():
   from object_detection.models.ssd_mobilenet_v2_fpn_keras_feature_extractor import SSDMobileNetV2FpnKerasFeatureExtractor
   from object_detection.models.ssd_mobilenet_v2_keras_feature_extractor import SSDMobileNetV2KerasFeatureExtractor
   from object_detection.predictors import rfcn_keras_box_predictor
+  if sys.version_info[0] >= 3:
+    from object_detection.models import ssd_efficientnet_bifpn_feature_extractor as ssd_efficientnet_bifpn
 
 if tf_version.is_tf1():
   from object_detection.models import faster_rcnn_inception_resnet_v2_feature_extractor as frcnn_inc_res
@@ -99,6 +104,22 @@ if tf_version.is_tf2():
           ssd_resnet_v1_fpn_keras.SSDResNet101V1FpnKerasFeatureExtractor,
       'ssd_resnet152_v1_fpn_keras':
           ssd_resnet_v1_fpn_keras.SSDResNet152V1FpnKerasFeatureExtractor,
+      'ssd_efficientnet-b0_bifpn_keras':
+          ssd_efficientnet_bifpn.SSDEfficientNetB0BiFPNKerasFeatureExtractor,
+      'ssd_efficientnet-b1_bifpn_keras':
+          ssd_efficientnet_bifpn.SSDEfficientNetB1BiFPNKerasFeatureExtractor,
+      'ssd_efficientnet-b2_bifpn_keras':
+          ssd_efficientnet_bifpn.SSDEfficientNetB2BiFPNKerasFeatureExtractor,
+      'ssd_efficientnet-b3_bifpn_keras':
+          ssd_efficientnet_bifpn.SSDEfficientNetB3BiFPNKerasFeatureExtractor,
+      'ssd_efficientnet-b4_bifpn_keras':
+          ssd_efficientnet_bifpn.SSDEfficientNetB4BiFPNKerasFeatureExtractor,
+      'ssd_efficientnet-b5_bifpn_keras':
+          ssd_efficientnet_bifpn.SSDEfficientNetB5BiFPNKerasFeatureExtractor,
+      'ssd_efficientnet-b6_bifpn_keras':
+          ssd_efficientnet_bifpn.SSDEfficientNetB6BiFPNKerasFeatureExtractor,
+      'ssd_efficientnet-b7_bifpn_keras':
+          ssd_efficientnet_bifpn.SSDEfficientNetB7BiFPNKerasFeatureExtractor,
   }
 
   FASTER_RCNN_KERAS_FEATURE_EXTRACTOR_CLASS_MAP = {
@@ -121,11 +142,18 @@ if tf_version.is_tf2():
   CENTER_NET_EXTRACTOR_FUNCTION_MAP = {
       'resnet_v2_50': center_net_resnet_feature_extractor.resnet_v2_50,
       'resnet_v2_101': center_net_resnet_feature_extractor.resnet_v2_101,
+      'resnet_v1_18_fpn':
+          center_net_resnet_v1_fpn_feature_extractor.resnet_v1_18_fpn,
+      'resnet_v1_34_fpn':
+          center_net_resnet_v1_fpn_feature_extractor.resnet_v1_34_fpn,
       'resnet_v1_50_fpn':
           center_net_resnet_v1_fpn_feature_extractor.resnet_v1_50_fpn,
       'resnet_v1_101_fpn':
           center_net_resnet_v1_fpn_feature_extractor.resnet_v1_101_fpn,
-      'hourglass_104': center_net_hourglass_feature_extractor.hourglass_104,
+      'hourglass_104':
+          center_net_hourglass_feature_extractor.hourglass_104,
+      'mobilenet_v2':
+          center_net_mobilenet_v2_feature_extractor.mobilenet_v2,
   }
 
   FEATURE_EXTRACTOR_MAPS = [
@@ -310,6 +338,14 @@ def _build_ssd_feature_extractor(feature_extractor_config,
             feature_extractor_config.fpn.additional_layer_depth,
     })
 
+  if feature_extractor_config.HasField('bifpn'):
+    kwargs.update({
+        'bifpn_min_level': feature_extractor_config.bifpn.min_level,
+        'bifpn_max_level': feature_extractor_config.bifpn.max_level,
+        'bifpn_num_iterations': feature_extractor_config.bifpn.num_iterations,
+        'bifpn_num_filters': feature_extractor_config.bifpn.num_filters,
+        'bifpn_combine_method': feature_extractor_config.bifpn.combine_method,
+    })
 
   return feature_extractor_class(**kwargs)
 
@@ -621,8 +657,9 @@ def _build_faster_rcnn_model(frcnn_config, is_training, add_summaries):
         second_stage_localization_loss_weight)
 
   crop_and_resize_fn = (
-      ops.matmul_crop_and_resize if frcnn_config.use_matmul_crop_and_resize
-      else ops.native_crop_and_resize)
+      spatial_ops.multilevel_matmul_crop_and_resize
+      if frcnn_config.use_matmul_crop_and_resize
+      else spatial_ops.multilevel_native_crop_and_resize)
   clip_anchors_to_image = (
       frcnn_config.clip_anchors_to_image)
 
@@ -843,6 +880,22 @@ def mask_proto_to_params(mask_config):
       heatmap_bias_init=mask_config.heatmap_bias_init)
 
 
+def densepose_proto_to_params(densepose_config):
+  """Converts CenterNet.DensePoseEstimation proto to parameter namedtuple."""
+  classification_loss, localization_loss, _, _, _, _, _ = (
+      losses_builder.build(densepose_config.loss))
+  return center_net_meta_arch.DensePoseParams(
+      class_id=densepose_config.class_id,
+      classification_loss=classification_loss,
+      localization_loss=localization_loss,
+      part_loss_weight=densepose_config.part_loss_weight,
+      coordinate_loss_weight=densepose_config.coordinate_loss_weight,
+      num_parts=densepose_config.num_parts,
+      task_loss_weight=densepose_config.task_loss_weight,
+      upsample_to_input_res=densepose_config.upsample_to_input_res,
+      heatmap_bias_init=densepose_config.heatmap_bias_init)
+
+
 def _build_center_net_model(center_net_config, is_training, add_summaries):
   """Build a CenterNet detection model.
 
@@ -895,6 +948,11 @@ def _build_center_net_model(center_net_config, is_training, add_summaries):
   if center_net_config.HasField('mask_estimation_task'):
     mask_params = mask_proto_to_params(center_net_config.mask_estimation_task)
 
+  densepose_params = None
+  if center_net_config.HasField('densepose_estimation_task'):
+    densepose_params = densepose_proto_to_params(
+        center_net_config.densepose_estimation_task)
+
   return center_net_meta_arch.CenterNetMetaArch(
       is_training=is_training,
       add_summaries=add_summaries,
@@ -904,7 +962,8 @@ def _build_center_net_model(center_net_config, is_training, add_summaries):
       object_center_params=object_center_params,
       object_detection_params=object_detection_params,
       keypoint_params_dict=keypoint_params_dict,
-      mask_params=mask_params)
+      mask_params=mask_params,
+      densepose_params=densepose_params)
 
 
 def _build_center_net_feature_extractor(
diff --git a/research/object_detection/builders/model_builder_test.py b/research/object_detection/builders/model_builder_test.py
index 511c640f3e7413bf7e1e31948547efb9daf76e4f..34d3a06c3adc50f8dd8561d88cd2eb7b47902bba 100644
--- a/research/object_detection/builders/model_builder_test.py
+++ b/research/object_detection/builders/model_builder_test.py
@@ -39,6 +39,9 @@ class ModelBuilderTest(test_case.TestCase, parameterized.TestCase):
   def ssd_feature_extractors(self):
     raise NotImplementedError
 
+  def get_override_base_feature_extractor_hyperparams(self, extractor_type):
+    raise NotImplementedError
+
   def faster_rcnn_feature_extractors(self):
     raise NotImplementedError
 
@@ -70,7 +73,6 @@ class ModelBuilderTest(test_case.TestCase, parameterized.TestCase):
                 }
               }
           }
-          override_base_feature_extractor_hyperparams: true
         }
         box_coder {
           faster_rcnn_box_coder {
@@ -205,6 +207,8 @@ class ModelBuilderTest(test_case.TestCase, parameterized.TestCase):
     for extractor_type, extractor_class in self.ssd_feature_extractors().items(
     ):
       model_proto.ssd.feature_extractor.type = extractor_type
+      model_proto.ssd.feature_extractor.override_base_feature_extractor_hyperparams = (
+          self.get_override_base_feature_extractor_hyperparams(extractor_type))
       model = model_builder.build(model_proto, is_training=True)
       self.assertIsInstance(model, ssd_meta_arch.SSDMetaArch)
       self.assertIsInstance(model._feature_extractor, extractor_class)
diff --git a/research/object_detection/builders/model_builder_tf1_test.py b/research/object_detection/builders/model_builder_tf1_test.py
index 083275ac466250220dd532f52288bab9a5a66daf..b8877a5f6e4e3d4a33fb9e3973384ef0685fc56d 100644
--- a/research/object_detection/builders/model_builder_tf1_test.py
+++ b/research/object_detection/builders/model_builder_tf1_test.py
@@ -38,6 +38,9 @@ class ModelBuilderTF1Test(model_builder_test.ModelBuilderTest):
   def ssd_feature_extractors(self):
     return model_builder.SSD_FEATURE_EXTRACTOR_CLASS_MAP
 
+  def get_override_base_feature_extractor_hyperparams(self, extractor_type):
+    return extractor_type in {'ssd_inception_v2', 'ssd_inception_v3'}
+
   def faster_rcnn_feature_extractors(self):
     return model_builder.FASTER_RCNN_FEATURE_EXTRACTOR_CLASS_MAP
 
diff --git a/research/object_detection/builders/model_builder_tf2_test.py b/research/object_detection/builders/model_builder_tf2_test.py
index c2cd237292ab8cb534aa760380fb31e7a68f1e43..9cbefdc0f1f598b380570d0b0ab140c29855d8d0 100644
--- a/research/object_detection/builders/model_builder_tf2_test.py
+++ b/research/object_detection/builders/model_builder_tf2_test.py
@@ -42,6 +42,9 @@ class ModelBuilderTF2Test(model_builder_test.ModelBuilderTest):
   def ssd_feature_extractors(self):
     return model_builder.SSD_KERAS_FEATURE_EXTRACTOR_CLASS_MAP
 
+  def get_override_base_feature_extractor_hyperparams(self, extractor_type):
+    return extractor_type in {}
+
   def faster_rcnn_feature_extractors(self):
     return model_builder.FASTER_RCNN_KERAS_FEATURE_EXTRACTOR_CLASS_MAP
 
@@ -161,6 +164,28 @@ class ModelBuilderTF2Test(model_builder_test.ModelBuilderTest):
     return text_format.Merge(proto_txt,
                              center_net_pb2.CenterNet.MaskEstimation())
 
+  def get_fake_densepose_proto(self):
+    proto_txt = """
+      task_loss_weight: 0.5
+      class_id: 0
+      loss {
+        classification_loss {
+          weighted_softmax {}
+        }
+        localization_loss {
+          l1_localization_loss {
+          }
+        }
+      }
+      num_parts: 24
+      part_loss_weight: 1.0
+      coordinate_loss_weight: 2.0
+      upsample_to_input_res: true
+      heatmap_bias_init: -2.0
+    """
+    return text_format.Merge(proto_txt,
+                             center_net_pb2.CenterNet.DensePoseEstimation())
+
   def test_create_center_net_model(self):
     """Test building a CenterNet model from proto txt."""
     proto_txt = """
@@ -192,6 +217,8 @@ class ModelBuilderTF2Test(model_builder_test.ModelBuilderTest):
         self.get_fake_label_map_file_path())
     config.center_net.mask_estimation_task.CopyFrom(
         self.get_fake_mask_proto())
+    config.center_net.densepose_estimation_task.CopyFrom(
+        self.get_fake_densepose_proto())
 
     # Build the model from the configuration.
     model = model_builder.build(config, is_training=True)
@@ -248,6 +275,21 @@ class ModelBuilderTF2Test(model_builder_test.ModelBuilderTest):
     self.assertAlmostEqual(
         model._mask_params.heatmap_bias_init, -2.0, places=4)
 
+    # Check DensePose related parameters.
+    self.assertEqual(model._densepose_params.class_id, 0)
+    self.assertIsInstance(model._densepose_params.classification_loss,
+                          losses.WeightedSoftmaxClassificationLoss)
+    self.assertIsInstance(model._densepose_params.localization_loss,
+                          losses.L1LocalizationLoss)
+    self.assertAlmostEqual(model._densepose_params.part_loss_weight, 1.0)
+    self.assertAlmostEqual(model._densepose_params.coordinate_loss_weight, 2.0)
+    self.assertEqual(model._densepose_params.num_parts, 24)
+    self.assertAlmostEqual(model._densepose_params.task_loss_weight, 0.5)
+    self.assertTrue(model._densepose_params.upsample_to_input_res)
+    self.assertEqual(model._densepose_params.upsample_method, 'bilinear')
+    self.assertAlmostEqual(
+        model._densepose_params.heatmap_bias_init, -2.0, places=4)
+
     # Check feature extractor parameters.
     self.assertIsInstance(
         model._feature_extractor,
diff --git a/research/object_detection/builders/preprocessor_builder.py b/research/object_detection/builders/preprocessor_builder.py
index aa6a6bc963f8635827aa8776252889d6c5f8d0e9..b61239d2e1ec87c232fbfdb53d0cb3c39da26e3e 100644
--- a/research/object_detection/builders/preprocessor_builder.py
+++ b/research/object_detection/builders/preprocessor_builder.py
@@ -151,6 +151,7 @@ def build(preprocessor_step_config):
             {
                 'keypoint_flip_permutation': tuple(
                     config.keypoint_flip_permutation) or None,
+                'probability': config.probability or None,
             })
 
   if step_type == 'random_vertical_flip':
@@ -159,10 +160,17 @@ def build(preprocessor_step_config):
             {
                 'keypoint_flip_permutation': tuple(
                     config.keypoint_flip_permutation) or None,
+                'probability': config.probability or None,
             })
 
   if step_type == 'random_rotation90':
-    return (preprocessor.random_rotation90, {})
+    config = preprocessor_step_config.random_rotation90
+    return (preprocessor.random_rotation90,
+            {
+                'keypoint_rot_permutation': tuple(
+                    config.keypoint_rot_permutation) or None,
+                'probability': config.probability or None,
+            })
 
   if step_type == 'random_crop_image':
     config = preprocessor_step_config.random_crop_image
@@ -409,4 +417,12 @@ def build(preprocessor_step_config):
         'num_scales': config.num_scales
     }
 
+  if step_type == 'random_scale_crop_and_pad_to_square':
+    config = preprocessor_step_config.random_scale_crop_and_pad_to_square
+    return preprocessor.random_scale_crop_and_pad_to_square, {
+        'scale_min': config.scale_min,
+        'scale_max': config.scale_max,
+        'output_size': config.output_size,
+    }
+
   raise ValueError('Unknown preprocessing step.')
diff --git a/research/object_detection/builders/preprocessor_builder_test.py b/research/object_detection/builders/preprocessor_builder_test.py
index 4c283238c59695dace4d769b0a0fe0941a6a027c..9e90344d0478229fa95355b53ecfa5f876325936 100644
--- a/research/object_detection/builders/preprocessor_builder_test.py
+++ b/research/object_detection/builders/preprocessor_builder_test.py
@@ -65,13 +65,15 @@ class PreprocessorBuilderTest(tf.test.TestCase):
       keypoint_flip_permutation: 3
       keypoint_flip_permutation: 5
       keypoint_flip_permutation: 4
+      probability: 0.5
     }
     """
     preprocessor_proto = preprocessor_pb2.PreprocessingStep()
     text_format.Merge(preprocessor_text_proto, preprocessor_proto)
     function, args = preprocessor_builder.build(preprocessor_proto)
     self.assertEqual(function, preprocessor.random_horizontal_flip)
-    self.assertEqual(args, {'keypoint_flip_permutation': (1, 0, 2, 3, 5, 4)})
+    self.assertEqual(args, {'keypoint_flip_permutation': (1, 0, 2, 3, 5, 4),
+                            'probability': 0.5})
 
   def test_build_random_vertical_flip(self):
     preprocessor_text_proto = """
@@ -82,23 +84,32 @@ class PreprocessorBuilderTest(tf.test.TestCase):
       keypoint_flip_permutation: 3
       keypoint_flip_permutation: 5
       keypoint_flip_permutation: 4
+      probability: 0.5
     }
     """
     preprocessor_proto = preprocessor_pb2.PreprocessingStep()
     text_format.Merge(preprocessor_text_proto, preprocessor_proto)
     function, args = preprocessor_builder.build(preprocessor_proto)
     self.assertEqual(function, preprocessor.random_vertical_flip)
-    self.assertEqual(args, {'keypoint_flip_permutation': (1, 0, 2, 3, 5, 4)})
+    self.assertEqual(args, {'keypoint_flip_permutation': (1, 0, 2, 3, 5, 4),
+                            'probability': 0.5})
 
   def test_build_random_rotation90(self):
     preprocessor_text_proto = """
-    random_rotation90 {}
+    random_rotation90 {
+      keypoint_rot_permutation: 3
+      keypoint_rot_permutation: 0
+      keypoint_rot_permutation: 1
+      keypoint_rot_permutation: 2
+      probability: 0.5
+    }
     """
     preprocessor_proto = preprocessor_pb2.PreprocessingStep()
     text_format.Merge(preprocessor_text_proto, preprocessor_proto)
     function, args = preprocessor_builder.build(preprocessor_proto)
     self.assertEqual(function, preprocessor.random_rotation90)
-    self.assertEqual(args, {})
+    self.assertEqual(args, {'keypoint_rot_permutation': (3, 0, 1, 2),
+                            'probability': 0.5})
 
   def test_build_random_pixel_value_scale(self):
     preprocessor_text_proto = """
diff --git a/research/object_detection/colab_tutorials/eager_few_shot_od_training_tf2_colab.ipynb b/research/object_detection/colab_tutorials/eager_few_shot_od_training_tf2_colab.ipynb
new file mode 100644
index 0000000000000000000000000000000000000000..a779528fa76abb1f7b08bdf0fcb7fb417738e3a0
--- /dev/null
+++ b/research/object_detection/colab_tutorials/eager_few_shot_od_training_tf2_colab.ipynb
@@ -0,0 +1,685 @@
+{
+  "cells": [
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "rOvvWAVTkMR7"
+      },
+      "source": [
+        "# Eager Few Shot Object Detection Colab\n",
+        "\n",
+        "Welcome to the Eager Few Shot Object Detection Colab --- in this colab we demonstrate fine tuning of a (TF2 friendly) RetinaNet architecture on very few examples of a novel class after initializing from a pre-trained COCO checkpoint.\n",
+        "Training runs in eager mode.\n",
+        "\n",
+        "Estimated time to run through this colab (with GPU): \u003c 5 minutes."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "vPs64QA1Zdov"
+      },
+      "source": [
+        "## Imports"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "LBZ9VWZZFUCT"
+      },
+      "outputs": [],
+      "source": [
+        "!pip install -U --pre tensorflow==\"2.2.0\""
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "oi28cqGGFWnY"
+      },
+      "outputs": [],
+      "source": [
+        "import os\n",
+        "import pathlib\n",
+        "\n",
+        "# Clone the tensorflow models repository if it doesn't already exist\n",
+        "if \"models\" in pathlib.Path.cwd().parts:\n",
+        "  while \"models\" in pathlib.Path.cwd().parts:\n",
+        "    os.chdir('..')\n",
+        "elif not pathlib.Path('models').exists():\n",
+        "  !git clone --depth 1 https://github.com/tensorflow/models"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "NwdsBdGhFanc"
+      },
+      "outputs": [],
+      "source": [
+        "# Install the Object Detection API\n",
+        "%%bash\n",
+        "cd models/research/\n",
+        "protoc object_detection/protos/*.proto --python_out=.\n",
+        "cp object_detection/packages/tf2/setup.py .\n",
+        "python -m pip install ."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "uZcqD4NLdnf4"
+      },
+      "outputs": [],
+      "source": [
+        "import matplotlib\n",
+        "import matplotlib.pyplot as plt\n",
+        "\n",
+        "import os\n",
+        "import random\n",
+        "import io\n",
+        "import imageio\n",
+        "import glob\n",
+        "import scipy.misc\n",
+        "import numpy as np\n",
+        "from six import BytesIO\n",
+        "from PIL import Image, ImageDraw, ImageFont\n",
+        "from IPython.display import display, Javascript\n",
+        "from IPython.display import Image as IPyImage\n",
+        "\n",
+        "import tensorflow as tf\n",
+        "\n",
+        "from object_detection.utils import label_map_util\n",
+        "from object_detection.utils import config_util\n",
+        "from object_detection.utils import visualization_utils as viz_utils\n",
+        "from object_detection.utils import colab_utils\n",
+        "from object_detection.builders import model_builder\n",
+        "\n",
+        "%matplotlib inline"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "IogyryF2lFBL"
+      },
+      "source": [
+        "# Utilities"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "-y9R0Xllefec"
+      },
+      "outputs": [],
+      "source": [
+        "def load_image_into_numpy_array(path):\n",
+        "  \"\"\"Load an image from file into a numpy array.\n",
+        "\n",
+        "  Puts image into numpy array to feed into tensorflow graph.\n",
+        "  Note that by convention we put it into a numpy array with shape\n",
+        "  (height, width, channels), where channels=3 for RGB.\n",
+        "\n",
+        "  Args:\n",
+        "    path: a file path.\n",
+        "\n",
+        "  Returns:\n",
+        "    uint8 numpy array with shape (img_height, img_width, 3)\n",
+        "  \"\"\"\n",
+        "  img_data = tf.io.gfile.GFile(path, 'rb').read()\n",
+        "  image = Image.open(BytesIO(img_data))\n",
+        "  (im_width, im_height) = image.size\n",
+        "  return np.array(image.getdata()).reshape(\n",
+        "      (im_height, im_width, 3)).astype(np.uint8)\n",
+        "\n",
+        "def plot_detections(image_np,\n",
+        "                    boxes,\n",
+        "                    classes,\n",
+        "                    scores,\n",
+        "                    category_index,\n",
+        "                    figsize=(12, 16),\n",
+        "                    image_name=None):\n",
+        "  \"\"\"Wrapper function to visualize detections.\n",
+        "\n",
+        "  Args:\n",
+        "    image_np: uint8 numpy array with shape (img_height, img_width, 3)\n",
+        "    boxes: a numpy array of shape [N, 4]\n",
+        "    classes: a numpy array of shape [N]. Note that class indices are 1-based,\n",
+        "      and match the keys in the label map.\n",
+        "    scores: a numpy array of shape [N] or None.  If scores=None, then\n",
+        "      this function assumes that the boxes to be plotted are groundtruth\n",
+        "      boxes and plot all boxes as black with no classes or scores.\n",
+        "    category_index: a dict containing category dictionaries (each holding\n",
+        "      category index `id` and category name `name`) keyed by category indices.\n",
+        "    figsize: size for the figure.\n",
+        "    image_name: a name for the image file.\n",
+        "  \"\"\"\n",
+        "  image_np_with_annotations = image_np.copy()\n",
+        "  viz_utils.visualize_boxes_and_labels_on_image_array(\n",
+        "      image_np_with_annotations,\n",
+        "      boxes,\n",
+        "      classes,\n",
+        "      scores,\n",
+        "      category_index,\n",
+        "      use_normalized_coordinates=True,\n",
+        "      min_score_thresh=0.8)\n",
+        "  if image_name:\n",
+        "    plt.imsave(image_name, image_np_with_annotations)\n",
+        "  else:\n",
+        "    plt.imshow(image_np_with_annotations)\n"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "sSaXL28TZfk1"
+      },
+      "source": [
+        "# Rubber Ducky data\n",
+        "\n",
+        "We will start with some toy (literally) data consisting of 5 images of a rubber\n",
+        "ducky.  Note that the [coco](https://cocodataset.org/#explore) dataset contains a number of animals, but notably, it does *not* contain rubber duckies (or even ducks for that matter), so this is a novel class."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "SQy3ND7EpFQM"
+      },
+      "outputs": [],
+      "source": [
+        "# Load images and visualize\n",
+        "train_image_dir = 'models/research/object_detection/test_images/ducky/train/'\n",
+        "train_images_np = []\n",
+        "for i in range(1, 6):\n",
+        "  image_path = os.path.join(train_image_dir, 'robertducky' + str(i) + '.jpg')\n",
+        "  train_images_np.append(load_image_into_numpy_array(image_path))\n",
+        "\n",
+        "plt.rcParams['axes.grid'] = False\n",
+        "plt.rcParams['xtick.labelsize'] = False\n",
+        "plt.rcParams['ytick.labelsize'] = False\n",
+        "plt.rcParams['xtick.top'] = False\n",
+        "plt.rcParams['xtick.bottom'] = False\n",
+        "plt.rcParams['ytick.left'] = False\n",
+        "plt.rcParams['ytick.right'] = False\n",
+        "plt.rcParams['figure.figsize'] = [14, 7]\n",
+        "\n",
+        "for idx, train_image_np in enumerate(train_images_np):\n",
+        "  plt.subplot(2, 3, idx+1)\n",
+        "  plt.imshow(train_image_np)\n",
+        "plt.show()"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "cbKXmQoxcUgE"
+      },
+      "source": [
+        "# Annotate images with bounding boxes\n",
+        "\n",
+        "In this cell you will annotate the rubber duckies --- draw a box around the rubber ducky in each image; click `next image` to go to the next image and `submit` when there are no more images.\n",
+        "\n",
+        "If you'd like to skip the manual annotation step, we totally understand.  In this case, simply skip this cell and run the next cell instead, where we've prepopulated the groundtruth with pre-annotated bounding boxes.\n",
+        "\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "-nEDRoUEcUgL"
+      },
+      "outputs": [],
+      "source": [
+        "gt_boxes = []\n",
+        "colab_utils.annotate(train_images_np, box_storage_pointer=gt_boxes)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "wTP9AFqecUgS"
+      },
+      "source": [
+        "# In case you didn't want to label...\n",
+        "\n",
+        "Run this cell only if you didn't annotate anything above and\n",
+        "would prefer to just use our preannotated boxes.  Don't forget\n",
+        "to uncomment."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "wIAT6ZUmdHOC"
+      },
+      "outputs": [],
+      "source": [
+        "# gt_boxes = [\n",
+        "#             np.array([[0.436, 0.591, 0.629, 0.712]], dtype=np.float32),\n",
+        "#             np.array([[0.539, 0.583, 0.73, 0.71]], dtype=np.float32),\n",
+        "#             np.array([[0.464, 0.414, 0.626, 0.548]], dtype=np.float32),\n",
+        "#             np.array([[0.313, 0.308, 0.648, 0.526]], dtype=np.float32),\n",
+        "#             np.array([[0.256, 0.444, 0.484, 0.629]], dtype=np.float32)\n",
+        "# ]"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "Dqb_yjAo3cO_"
+      },
+      "source": [
+        "# Prepare data for training\n",
+        "\n",
+        "Below we add the class annotations (for simplicity, we assume a single class in this colab; though it should be straightforward to extend this to handle multiple classes).  We also convert everything to the format that the training\n",
+        "loop below expects (e.g., everything converted to tensors, classes converted to one-hot representations, etc.)."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "HWBqFVMcweF-"
+      },
+      "outputs": [],
+      "source": [
+        "\n",
+        "# By convention, our non-background classes start counting at 1.  Given\n",
+        "# that we will be predicting just one class, we will therefore assign it a\n",
+        "# `class id` of 1.\n",
+        "duck_class_id = 1\n",
+        "num_classes = 1\n",
+        "\n",
+        "category_index = {duck_class_id: {'id': duck_class_id, 'name': 'rubber_ducky'}}\n",
+        "\n",
+        "# Convert class labels to one-hot; convert everything to tensors.\n",
+        "# The `label_id_offset` here shifts all classes by a certain number of indices;\n",
+        "# we do this here so that the model receives one-hot labels where non-background\n",
+        "# classes start counting at the zeroth index.  This is ordinarily just handled\n",
+        "# automatically in our training binaries, but we need to reproduce it here.\n",
+        "label_id_offset = 1\n",
+        "train_image_tensors = []\n",
+        "gt_classes_one_hot_tensors = []\n",
+        "gt_box_tensors = []\n",
+        "for (train_image_np, gt_box_np) in zip(\n",
+        "    train_images_np, gt_boxes):\n",
+        "  train_image_tensors.append(tf.expand_dims(tf.convert_to_tensor(\n",
+        "      train_image_np, dtype=tf.float32), axis=0))\n",
+        "  gt_box_tensors.append(tf.convert_to_tensor(gt_box_np, dtype=tf.float32))\n",
+        "  zero_indexed_groundtruth_classes = tf.convert_to_tensor(\n",
+        "      np.ones(shape=[gt_box_np.shape[0]], dtype=np.int32) - label_id_offset)\n",
+        "  gt_classes_one_hot_tensors.append(tf.one_hot(\n",
+        "      zero_indexed_groundtruth_classes, num_classes))\n",
+        "print('Done prepping data.')\n"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "b3_Z3mJWN9KJ"
+      },
+      "source": [
+        "# Let's just visualize the rubber duckies as a sanity check\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "YBD6l-E4N71y"
+      },
+      "outputs": [],
+      "source": [
+        "dummy_scores = np.array([1.0], dtype=np.float32)  # give boxes a score of 100%\n",
+        "\n",
+        "plt.figure(figsize=(30, 15))\n",
+        "for idx in range(5):\n",
+        "  plt.subplot(2, 3, idx+1)\n",
+        "  plot_detections(\n",
+        "      train_images_np[idx],\n",
+        "      gt_boxes[idx],\n",
+        "      np.ones(shape=[gt_boxes[idx].shape[0]], dtype=np.int32),\n",
+        "      dummy_scores, category_index)\n",
+        "plt.show()"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "ghDAsqfoZvPh"
+      },
+      "source": [
+        "# Create model and restore weights for all but last layer\n",
+        "\n",
+        "In this cell we build a single stage detection architecture (RetinaNet) and restore all but the classification layer at the top (which will be automatically randomly initialized).\n",
+        "\n",
+        "For simplicity, we have hardcoded a number of things in this colab for the specific RetinaNet architecture at hand (including assuming that the image size will always be 640x640), however it is not difficult to generalize to other model configurations."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "9J16r3NChD-7"
+      },
+      "outputs": [],
+      "source": [
+        "# Download the checkpoint and put it into models/research/object_detection/test_data/\n",
+        "\n",
+        "!wget http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_resnet50_v1_fpn_640x640_coco17_tpu-8.tar.gz\n",
+        "!tar -xf ssd_resnet50_v1_fpn_640x640_coco17_tpu-8.tar.gz\n",
+        "!mv ssd_resnet50_v1_fpn_640x640_coco17_tpu-8/checkpoint models/research/object_detection/test_data/"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "RyT4BUbaMeG-"
+      },
+      "outputs": [],
+      "source": [
+        "tf.keras.backend.clear_session()\n",
+        "\n",
+        "print('Building model and restoring weights for fine-tuning...', flush=True)\n",
+        "num_classes = 1\n",
+        "pipeline_config = 'models/research/object_detection/configs/tf2/ssd_resnet50_v1_fpn_640x640_coco17_tpu-8.config'\n",
+        "checkpoint_path = 'models/research/object_detection/test_data/checkpoint/ckpt-0'\n",
+        "\n",
+        "# Load pipeline config and build a detection model.\n",
+        "#\n",
+        "# Since we are working off of a COCO architecture which predicts 90\n",
+        "# class slots by default, we override the `num_classes` field here to be just\n",
+        "# one (for our new rubber ducky class).\n",
+        "configs = config_util.get_configs_from_pipeline_file(pipeline_config)\n",
+        "model_config = configs['model']\n",
+        "model_config.ssd.num_classes = num_classes\n",
+        "model_config.ssd.freeze_batchnorm = True\n",
+        "detection_model = model_builder.build(\n",
+        "      model_config=model_config, is_training=True)\n",
+        "\n",
+        "# Set up object-based checkpoint restore --- RetinaNet has two prediction\n",
+        "# `heads` --- one for classification, the other for box regression.  We will\n",
+        "# restore the box regression head but initialize the classification head\n",
+        "# from scratch (we show the omission below by commenting out the line that\n",
+        "# we would add if we wanted to restore both heads)\n",
+        "fake_box_predictor = tf.compat.v2.train.Checkpoint(\n",
+        "    _base_tower_layers_for_heads=detection_model._box_predictor._base_tower_layers_for_heads,\n",
+        "    # _prediction_heads=detection_model._box_predictor._prediction_heads,\n",
+        "    #    (i.e., the classification head that we *will not* restore)\n",
+        "    _box_prediction_head=detection_model._box_predictor._box_prediction_head,\n",
+        "    )\n",
+        "fake_model = tf.compat.v2.train.Checkpoint(\n",
+        "          _feature_extractor=detection_model._feature_extractor,\n",
+        "          _box_predictor=fake_box_predictor)\n",
+        "ckpt = tf.compat.v2.train.Checkpoint(model=fake_model)\n",
+        "ckpt.restore(checkpoint_path).expect_partial()\n",
+        "\n",
+        "# Run model through a dummy image so that variables are created\n",
+        "image, shapes = detection_model.preprocess(tf.zeros([1, 640, 640, 3]))\n",
+        "prediction_dict = detection_model.predict(image, shapes)\n",
+        "_ = detection_model.postprocess(prediction_dict, shapes)\n",
+        "print('Weights restored!')"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "pCkWmdoZZ0zJ"
+      },
+      "source": [
+        "# Eager mode custom training loop\n",
+        "\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "nyHoF4mUrv5-"
+      },
+      "outputs": [],
+      "source": [
+        "tf.keras.backend.set_learning_phase(True)\n",
+        "\n",
+        "# These parameters can be tuned; since our training set has 5 images\n",
+        "# it doesn't make sense to have a much larger batch size, though we could\n",
+        "# fit more examples in memory if we wanted to.\n",
+        "batch_size = 4\n",
+        "learning_rate = 0.01\n",
+        "num_batches = 100\n",
+        "\n",
+        "# Select variables in top layers to fine-tune.\n",
+        "trainable_variables = detection_model.trainable_variables\n",
+        "to_fine_tune = []\n",
+        "prefixes_to_train = [\n",
+        "  'WeightSharedConvolutionalBoxPredictor/WeightSharedConvolutionalBoxHead',\n",
+        "  'WeightSharedConvolutionalBoxPredictor/WeightSharedConvolutionalClassHead']\n",
+        "for var in trainable_variables:\n",
+        "  if any([var.name.startswith(prefix) for prefix in prefixes_to_train]):\n",
+        "    to_fine_tune.append(var)\n",
+        "\n",
+        "# Set up forward + backward pass for a single train step.\n",
+        "def get_model_train_step_function(model, optimizer, vars_to_fine_tune):\n",
+        "  \"\"\"Get a tf.function for training step.\"\"\"\n",
+        "\n",
+        "  # Use tf.function for a bit of speed.\n",
+        "  # Comment out the tf.function decorator if you want the inside of the\n",
+        "  # function to run eagerly.\n",
+        "  @tf.function\n",
+        "  def train_step_fn(image_tensors,\n",
+        "                    groundtruth_boxes_list,\n",
+        "                    groundtruth_classes_list):\n",
+        "    \"\"\"A single training iteration.\n",
+        "\n",
+        "    Args:\n",
+        "      image_tensors: A list of [1, height, width, 3] Tensor of type tf.float32.\n",
+        "        Note that the height and width can vary across images, as they are\n",
+        "        reshaped within this function to be 640x640.\n",
+        "      groundtruth_boxes_list: A list of Tensors of shape [N_i, 4] with type\n",
+        "        tf.float32 representing groundtruth boxes for each image in the batch.\n",
+        "      groundtruth_classes_list: A list of Tensors of shape [N_i, num_classes]\n",
+        "        with type tf.float32 representing groundtruth boxes for each image in\n",
+        "        the batch.\n",
+        "\n",
+        "    Returns:\n",
+        "      A scalar tensor representing the total loss for the input batch.\n",
+        "    \"\"\"\n",
+        "    shapes = tf.constant(batch_size * [[640, 640, 3]], dtype=tf.int32)\n",
+        "    model.provide_groundtruth(\n",
+        "        groundtruth_boxes_list=groundtruth_boxes_list,\n",
+        "        groundtruth_classes_list=groundtruth_classes_list)\n",
+        "    with tf.GradientTape() as tape:\n",
+        "      preprocessed_images = tf.concat(\n",
+        "          [detection_model.preprocess(image_tensor)[0]\n",
+        "           for image_tensor in image_tensors], axis=0)\n",
+        "      prediction_dict = model.predict(preprocessed_images, shapes)\n",
+        "      losses_dict = model.loss(prediction_dict, shapes)\n",
+        "      total_loss = losses_dict['Loss/localization_loss'] + losses_dict['Loss/classification_loss']\n",
+        "      gradients = tape.gradient(total_loss, vars_to_fine_tune)\n",
+        "      optimizer.apply_gradients(zip(gradients, vars_to_fine_tune))\n",
+        "    return total_loss\n",
+        "\n",
+        "  return train_step_fn\n",
+        "\n",
+        "optimizer = tf.keras.optimizers.SGD(learning_rate=learning_rate, momentum=0.9)\n",
+        "train_step_fn = get_model_train_step_function(\n",
+        "    detection_model, optimizer, to_fine_tune)\n",
+        "\n",
+        "print('Start fine-tuning!', flush=True)\n",
+        "for idx in range(num_batches):\n",
+        "  # Grab keys for a random subset of examples\n",
+        "  all_keys = list(range(len(train_images_np)))\n",
+        "  random.shuffle(all_keys)\n",
+        "  example_keys = all_keys[:batch_size]\n",
+        "\n",
+        "  # Note that we do not do data augmentation in this demo.  If you want a\n",
+        "  # a fun exercise, we recommend experimenting with random horizontal flipping\n",
+        "  # and random cropping :)\n",
+        "  gt_boxes_list = [gt_box_tensors[key] for key in example_keys]\n",
+        "  gt_classes_list = [gt_classes_one_hot_tensors[key] for key in example_keys]\n",
+        "  image_tensors = [train_image_tensors[key] for key in example_keys]\n",
+        "\n",
+        "  # Training step (forward pass + backwards pass)\n",
+        "  total_loss = train_step_fn(image_tensors, gt_boxes_list, gt_classes_list)\n",
+        "\n",
+        "  if idx % 10 == 0:\n",
+        "    print('batch ' + str(idx) + ' of ' + str(num_batches)\n",
+        "    + ', loss=' +  str(total_loss.numpy()), flush=True)\n",
+        "\n",
+        "print('Done fine-tuning!')"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "WHlXL1x_Z3tc"
+      },
+      "source": [
+        "# Load test images and run inference with new model!"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "WcE6OwrHQJya"
+      },
+      "outputs": [],
+      "source": [
+        "test_image_dir = 'models/research/object_detection/test_images/ducky/test/'\n",
+        "test_images_np = []\n",
+        "for i in range(1, 50):\n",
+        "  image_path = os.path.join(test_image_dir, 'out' + str(i) + '.jpg')\n",
+        "  test_images_np.append(np.expand_dims(\n",
+        "      load_image_into_numpy_array(image_path), axis=0))\n",
+        "\n",
+        "# Again, uncomment this decorator if you want to run inference eagerly\n",
+        "@tf.function\n",
+        "def detect(input_tensor):\n",
+        "  \"\"\"Run detection on an input image.\n",
+        "\n",
+        "  Args:\n",
+        "    input_tensor: A [1, height, width, 3] Tensor of type tf.float32.\n",
+        "      Note that height and width can be anything since the image will be\n",
+        "      immediately resized according to the needs of the model within this\n",
+        "      function.\n",
+        "\n",
+        "  Returns:\n",
+        "    A dict containing 3 Tensors (`detection_boxes`, `detection_classes`,\n",
+        "      and `detection_scores`).\n",
+        "  \"\"\"\n",
+        "  preprocessed_image, shapes = detection_model.preprocess(input_tensor)\n",
+        "  prediction_dict = detection_model.predict(preprocessed_image, shapes)\n",
+        "  return detection_model.postprocess(prediction_dict, shapes)\n",
+        "\n",
+        "# Note that the first frame will trigger tracing of the tf.function, which will\n",
+        "# take some time, after which inference should be fast.\n",
+        "\n",
+        "label_id_offset = 1\n",
+        "for i in range(len(test_images_np)):\n",
+        "  input_tensor = tf.convert_to_tensor(test_images_np[i], dtype=tf.float32)\n",
+        "  detections = detect(input_tensor)\n",
+        "\n",
+        "  plot_detections(\n",
+        "      test_images_np[i][0],\n",
+        "      detections['detection_boxes'][0].numpy(),\n",
+        "      detections['detection_classes'][0].numpy().astype(np.uint32)\n",
+        "      + label_id_offset,\n",
+        "      detections['detection_scores'][0].numpy(),\n",
+        "      category_index, figsize=(15, 20), image_name=\"gif_frame_\" + ('%02d' % i) + \".jpg\")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "RW1FrT2iNnpy"
+      },
+      "outputs": [],
+      "source": [
+        "imageio.plugins.freeimage.download()\n",
+        "\n",
+        "anim_file = 'duckies_test.gif'\n",
+        "\n",
+        "filenames = glob.glob('gif_frame_*.jpg')\n",
+        "filenames = sorted(filenames)\n",
+        "last = -1\n",
+        "images = []\n",
+        "for filename in filenames:\n",
+        "  image = imageio.imread(filename)\n",
+        "  images.append(image)\n",
+        "\n",
+        "imageio.mimsave(anim_file, images, 'GIF-FI', fps=5)\n",
+        "\n",
+        "display(IPyImage(open(anim_file, 'rb').read()))"
+      ]
+    }
+  ],
+  "metadata": {
+    "accelerator": "GPU",
+    "colab": {
+      "collapsed_sections": [],
+      "name": "interactive_eager_few_shot_od_training_colab.ipynb",
+      "provenance": []
+    },
+    "kernelspec": {
+      "display_name": "Python 3",
+      "name": "python3"
+    }
+  },
+  "nbformat": 4,
+  "nbformat_minor": 0
+}
diff --git a/research/object_detection/colab_tutorials/inference_from_saved_model_tf2_colab.ipynb b/research/object_detection/colab_tutorials/inference_from_saved_model_tf2_colab.ipynb
new file mode 100644
index 0000000000000000000000000000000000000000..1e88f4c5d52435297bd2ba8c8bb47a2f3c346f30
--- /dev/null
+++ b/research/object_detection/colab_tutorials/inference_from_saved_model_tf2_colab.ipynb
@@ -0,0 +1,313 @@
+{
+  "nbformat": 4,
+  "nbformat_minor": 0,
+  "metadata": {
+    "colab": {
+      "name": "inference_from_saved_model_tf2_colab.ipynb",
+      "provenance": [],
+      "collapsed_sections": []
+    },
+    "kernelspec": {
+      "display_name": "Python 3",
+      "name": "python3"
+    }
+  },
+  "cells": [
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "cT5cdSLPX0ui"
+      },
+      "source": [
+        "# Intro to Object Detection Colab\n",
+        "\n",
+        "Welcome to the object detection colab! This demo will take you through the steps of running an \"out-of-the-box\" detection model in SavedModel format on a collection of images.\n",
+        "\n"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "vPs64QA1Zdov"
+      },
+      "source": [
+        "Imports"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "id": "OBzb04bdNGM8",
+        "colab_type": "code",
+        "colab": {}
+      },
+      "source": [
+        "!pip install -U --pre tensorflow==\"2.2.0\""
+      ],
+      "execution_count": null,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "id": "NgSXyvKSNHIl",
+        "colab_type": "code",
+        "colab": {}
+      },
+      "source": [
+        "import os\n",
+        "import pathlib\n",
+        "\n",
+        "# Clone the tensorflow models repository if it doesn't already exist\n",
+        "if \"models\" in pathlib.Path.cwd().parts:\n",
+        "  while \"models\" in pathlib.Path.cwd().parts:\n",
+        "    os.chdir('..')\n",
+        "elif not pathlib.Path('models').exists():\n",
+        "  !git clone --depth 1 https://github.com/tensorflow/models"
+      ],
+      "execution_count": null,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "id": "rhpPgW7TNLs6",
+        "colab_type": "code",
+        "colab": {}
+      },
+      "source": [
+        "# Install the Object Detection API\n",
+        "%%bash\n",
+        "cd models/research/\n",
+        "protoc object_detection/protos/*.proto --python_out=.\n",
+        "cp object_detection/packages/tf2/setup.py .\n",
+        "python -m pip install ."
+      ],
+      "execution_count": null,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "colab_type": "code",
+        "id": "yn5_uV1HLvaz",
+        "colab": {}
+      },
+      "source": [
+        "import io\n",
+        "import os\n",
+        "import scipy.misc\n",
+        "import numpy as np\n",
+        "import six\n",
+        "import time\n",
+        "\n",
+        "from six import BytesIO\n",
+        "\n",
+        "import matplotlib\n",
+        "import matplotlib.pyplot as plt\n",
+        "from PIL import Image, ImageDraw, ImageFont\n",
+        "\n",
+        "import tensorflow as tf\n",
+        "from object_detection.utils import visualization_utils as viz_utils\n",
+        "\n",
+        "%matplotlib inline"
+      ],
+      "execution_count": null,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "colab_type": "code",
+        "id": "-y9R0Xllefec",
+        "colab": {}
+      },
+      "source": [
+        "def load_image_into_numpy_array(path):\n",
+        "  \"\"\"Load an image from file into a numpy array.\n",
+        "\n",
+        "  Puts image into numpy array to feed into tensorflow graph.\n",
+        "  Note that by convention we put it into a numpy array with shape\n",
+        "  (height, width, channels), where channels=3 for RGB.\n",
+        "\n",
+        "  Args:\n",
+        "    path: a file path (this can be local or on colossus)\n",
+        "\n",
+        "  Returns:\n",
+        "    uint8 numpy array with shape (img_height, img_width, 3)\n",
+        "  \"\"\"\n",
+        "  img_data = tf.io.gfile.GFile(path, 'rb').read()\n",
+        "  image = Image.open(BytesIO(img_data))\n",
+        "  (im_width, im_height) = image.size\n",
+        "  return np.array(image.getdata()).reshape(\n",
+        "      (im_height, im_width, 3)).astype(np.uint8)\n",
+        "\n",
+        "# Load the COCO Label Map\n",
+        "category_index = {\n",
+        "    1: {'id': 1, 'name': 'person'},\n",
+        "    2: {'id': 2, 'name': 'bicycle'},\n",
+        "    3: {'id': 3, 'name': 'car'},\n",
+        "    4: {'id': 4, 'name': 'motorcycle'},\n",
+        "    5: {'id': 5, 'name': 'airplane'},\n",
+        "    6: {'id': 6, 'name': 'bus'},\n",
+        "    7: {'id': 7, 'name': 'train'},\n",
+        "    8: {'id': 8, 'name': 'truck'},\n",
+        "    9: {'id': 9, 'name': 'boat'},\n",
+        "    10: {'id': 10, 'name': 'traffic light'},\n",
+        "    11: {'id': 11, 'name': 'fire hydrant'},\n",
+        "    13: {'id': 13, 'name': 'stop sign'},\n",
+        "    14: {'id': 14, 'name': 'parking meter'},\n",
+        "    15: {'id': 15, 'name': 'bench'},\n",
+        "    16: {'id': 16, 'name': 'bird'},\n",
+        "    17: {'id': 17, 'name': 'cat'},\n",
+        "    18: {'id': 18, 'name': 'dog'},\n",
+        "    19: {'id': 19, 'name': 'horse'},\n",
+        "    20: {'id': 20, 'name': 'sheep'},\n",
+        "    21: {'id': 21, 'name': 'cow'},\n",
+        "    22: {'id': 22, 'name': 'elephant'},\n",
+        "    23: {'id': 23, 'name': 'bear'},\n",
+        "    24: {'id': 24, 'name': 'zebra'},\n",
+        "    25: {'id': 25, 'name': 'giraffe'},\n",
+        "    27: {'id': 27, 'name': 'backpack'},\n",
+        "    28: {'id': 28, 'name': 'umbrella'},\n",
+        "    31: {'id': 31, 'name': 'handbag'},\n",
+        "    32: {'id': 32, 'name': 'tie'},\n",
+        "    33: {'id': 33, 'name': 'suitcase'},\n",
+        "    34: {'id': 34, 'name': 'frisbee'},\n",
+        "    35: {'id': 35, 'name': 'skis'},\n",
+        "    36: {'id': 36, 'name': 'snowboard'},\n",
+        "    37: {'id': 37, 'name': 'sports ball'},\n",
+        "    38: {'id': 38, 'name': 'kite'},\n",
+        "    39: {'id': 39, 'name': 'baseball bat'},\n",
+        "    40: {'id': 40, 'name': 'baseball glove'},\n",
+        "    41: {'id': 41, 'name': 'skateboard'},\n",
+        "    42: {'id': 42, 'name': 'surfboard'},\n",
+        "    43: {'id': 43, 'name': 'tennis racket'},\n",
+        "    44: {'id': 44, 'name': 'bottle'},\n",
+        "    46: {'id': 46, 'name': 'wine glass'},\n",
+        "    47: {'id': 47, 'name': 'cup'},\n",
+        "    48: {'id': 48, 'name': 'fork'},\n",
+        "    49: {'id': 49, 'name': 'knife'},\n",
+        "    50: {'id': 50, 'name': 'spoon'},\n",
+        "    51: {'id': 51, 'name': 'bowl'},\n",
+        "    52: {'id': 52, 'name': 'banana'},\n",
+        "    53: {'id': 53, 'name': 'apple'},\n",
+        "    54: {'id': 54, 'name': 'sandwich'},\n",
+        "    55: {'id': 55, 'name': 'orange'},\n",
+        "    56: {'id': 56, 'name': 'broccoli'},\n",
+        "    57: {'id': 57, 'name': 'carrot'},\n",
+        "    58: {'id': 58, 'name': 'hot dog'},\n",
+        "    59: {'id': 59, 'name': 'pizza'},\n",
+        "    60: {'id': 60, 'name': 'donut'},\n",
+        "    61: {'id': 61, 'name': 'cake'},\n",
+        "    62: {'id': 62, 'name': 'chair'},\n",
+        "    63: {'id': 63, 'name': 'couch'},\n",
+        "    64: {'id': 64, 'name': 'potted plant'},\n",
+        "    65: {'id': 65, 'name': 'bed'},\n",
+        "    67: {'id': 67, 'name': 'dining table'},\n",
+        "    70: {'id': 70, 'name': 'toilet'},\n",
+        "    72: {'id': 72, 'name': 'tv'},\n",
+        "    73: {'id': 73, 'name': 'laptop'},\n",
+        "    74: {'id': 74, 'name': 'mouse'},\n",
+        "    75: {'id': 75, 'name': 'remote'},\n",
+        "    76: {'id': 76, 'name': 'keyboard'},\n",
+        "    77: {'id': 77, 'name': 'cell phone'},\n",
+        "    78: {'id': 78, 'name': 'microwave'},\n",
+        "    79: {'id': 79, 'name': 'oven'},\n",
+        "    80: {'id': 80, 'name': 'toaster'},\n",
+        "    81: {'id': 81, 'name': 'sink'},\n",
+        "    82: {'id': 82, 'name': 'refrigerator'},\n",
+        "    84: {'id': 84, 'name': 'book'},\n",
+        "    85: {'id': 85, 'name': 'clock'},\n",
+        "    86: {'id': 86, 'name': 'vase'},\n",
+        "    87: {'id': 87, 'name': 'scissors'},\n",
+        "    88: {'id': 88, 'name': 'teddy bear'},\n",
+        "    89: {'id': 89, 'name': 'hair drier'},\n",
+        "    90: {'id': 90, 'name': 'toothbrush'},\n",
+        "}"
+      ],
+      "execution_count": null,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "id": "QwcBC2TlPSwg",
+        "colab_type": "code",
+        "colab": {}
+      },
+      "source": [
+        "# Download the saved model and put it into models/research/object_detection/test_data/\n",
+        "!wget http://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d5_coco17_tpu-32.tar.gz\n",
+        "!tar -xf efficientdet_d5_coco17_tpu-32.tar.gz\n",
+        "!mv efficientdet_d5_coco17_tpu-32/ models/research/object_detection/test_data/"
+      ],
+      "execution_count": null,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "colab_type": "code",
+        "id": "Z2p-PmKLYCVU",
+        "colab": {}
+      },
+      "source": [
+        "start_time = time.time()\n",
+        "tf.keras.backend.clear_session()\n",
+        "detect_fn = tf.saved_model.load('models/research/object_detection/test_data/efficientdet_d5_coco17_tpu-32/saved_model/')\n",
+        "end_time = time.time()\n",
+        "elapsed_time = end_time - start_time\n",
+        "print('Elapsed time: ' + str(elapsed_time) + 's')"
+      ],
+      "execution_count": null,
+      "outputs": []
+    },
+    {
+      "cell_type": "code",
+      "metadata": {
+        "colab_type": "code",
+        "id": "vukkhd5-9NSL",
+        "colab": {}
+      },
+      "source": [
+        "import time\n",
+        "\n",
+        "image_dir = 'models/research/object_detection/test_images'\n",
+        "\n",
+        "elapsed = []\n",
+        "for i in range(2):\n",
+        "  image_path = os.path.join(image_dir, 'image' + str(i + 1) + '.jpg')\n",
+        "  image_np = load_image_into_numpy_array(image_path)\n",
+        "  input_tensor = np.expand_dims(image_np, 0)\n",
+        "  start_time = time.time()\n",
+        "  detections = detect_fn(input_tensor)\n",
+        "  end_time = time.time()\n",
+        "  elapsed.append(end_time - start_time)\n",
+        "\n",
+        "  plt.rcParams['figure.figsize'] = [42, 21]\n",
+        "  label_id_offset = 1\n",
+        "  image_np_with_detections = image_np.copy()\n",
+        "  viz_utils.visualize_boxes_and_labels_on_image_array(\n",
+        "        image_np_with_detections,\n",
+        "        detections['detection_boxes'][0].numpy(),\n",
+        "        detections['detection_classes'][0].numpy().astype(np.int32),\n",
+        "        detections['detection_scores'][0].numpy(),\n",
+        "        category_index,\n",
+        "        use_normalized_coordinates=True,\n",
+        "        max_boxes_to_draw=200,\n",
+        "        min_score_thresh=.40,\n",
+        "        agnostic_mode=False)\n",
+        "  plt.subplot(2, 1, i+1)\n",
+        "  plt.imshow(image_np_with_detections)\n",
+        "\n",
+        "mean_elapsed = sum(elapsed) / float(len(elapsed))\n",
+        "print('Elapsed time: ' + str(mean_elapsed) + ' second per image')"
+      ],
+      "execution_count": null,
+      "outputs": []
+    }
+  ]
+}
\ No newline at end of file
diff --git a/research/object_detection/colab_tutorials/inference_tf2_colab.ipynb b/research/object_detection/colab_tutorials/inference_tf2_colab.ipynb
new file mode 100644
index 0000000000000000000000000000000000000000..6b5cfaa787f074f142010eb2e33db7f38bd18c00
--- /dev/null
+++ b/research/object_detection/colab_tutorials/inference_tf2_colab.ipynb
@@ -0,0 +1,470 @@
+{
+  "cells": [
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "rOvvWAVTkMR7"
+      },
+      "source": [
+        "# Intro to Object Detection Colab\n",
+        "\n",
+        "Welcome to the object detection colab!  This demo will take you through the steps of running an \"out-of-the-box\" detection model on a collection of images."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "vPs64QA1Zdov"
+      },
+      "source": [
+        "## Imports and Setup"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "LBZ9VWZZFUCT"
+      },
+      "outputs": [],
+      "source": [
+        "!pip install -U --pre tensorflow==\"2.2.0\""
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "oi28cqGGFWnY"
+      },
+      "outputs": [],
+      "source": [
+        "import os\n",
+        "import pathlib\n",
+        "\n",
+        "# Clone the tensorflow models repository if it doesn't already exist\n",
+        "if \"models\" in pathlib.Path.cwd().parts:\n",
+        "  while \"models\" in pathlib.Path.cwd().parts:\n",
+        "    os.chdir('..')\n",
+        "elif not pathlib.Path('models').exists():\n",
+        "  !git clone --depth 1 https://github.com/tensorflow/models"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "NwdsBdGhFanc"
+      },
+      "outputs": [],
+      "source": [
+        "# Install the Object Detection API\n",
+        "%%bash\n",
+        "cd models/research/\n",
+        "protoc object_detection/protos/*.proto --python_out=.\n",
+        "cp object_detection/packages/tf2/setup.py .\n",
+        "python -m pip install ."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "yn5_uV1HLvaz"
+      },
+      "outputs": [],
+      "source": [
+        "import matplotlib\n",
+        "import matplotlib.pyplot as plt\n",
+        "\n",
+        "import io\n",
+        "import scipy.misc\n",
+        "import numpy as np\n",
+        "from six import BytesIO\n",
+        "from PIL import Image, ImageDraw, ImageFont\n",
+        "\n",
+        "import tensorflow as tf\n",
+        "\n",
+        "from object_detection.utils import label_map_util\n",
+        "from object_detection.utils import config_util\n",
+        "from object_detection.utils import visualization_utils as viz_utils\n",
+        "from object_detection.builders import model_builder\n",
+        "\n",
+        "%matplotlib inline"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "IogyryF2lFBL"
+      },
+      "source": [
+        "## Utilities"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "-y9R0Xllefec"
+      },
+      "outputs": [],
+      "source": [
+        "def load_image_into_numpy_array(path):\n",
+        "  \"\"\"Load an image from file into a numpy array.\n",
+        "\n",
+        "  Puts image into numpy array to feed into tensorflow graph.\n",
+        "  Note that by convention we put it into a numpy array with shape\n",
+        "  (height, width, channels), where channels=3 for RGB.\n",
+        "\n",
+        "  Args:\n",
+        "    path: the file path to the image\n",
+        "\n",
+        "  Returns:\n",
+        "    uint8 numpy array with shape (img_height, img_width, 3)\n",
+        "  \"\"\"\n",
+        "  img_data = tf.io.gfile.GFile(path, 'rb').read()\n",
+        "  image = Image.open(BytesIO(img_data))\n",
+        "  (im_width, im_height) = image.size\n",
+        "  return np.array(image.getdata()).reshape(\n",
+        "      (im_height, im_width, 3)).astype(np.uint8)\n",
+        "\n",
+        "def get_keypoint_tuples(eval_config):\n",
+        "  \"\"\"Return a tuple list of keypoint edges from the eval config.\n",
+        "  \n",
+        "  Args:\n",
+        "    eval_config: an eval config containing the keypoint edges\n",
+        "  \n",
+        "  Returns:\n",
+        "    a list of edge tuples, each in the format (start, end)\n",
+        "  \"\"\"\n",
+        "  tuple_list = []\n",
+        "  kp_list = eval_config.keypoint_edge\n",
+        "  for edge in kp_list:\n",
+        "    tuple_list.append((edge.start, edge.end))\n",
+        "  return tuple_list"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "R4YjnOjME1gy"
+      },
+      "outputs": [],
+      "source": [
+        "# @title Choose the model to use, then evaluate the cell.\n",
+        "MODELS = {'centernet_with_keypoints': 'centernet_hg104_512x512_kpts_coco17_tpu-32', 'centernet_without_keypoints': 'centernet_hg104_512x512_coco17_tpu-8'}\n",
+        "\n",
+        "model_display_name = 'centernet_with_keypoints' # @param ['centernet_with_keypoints', 'centernet_without_keypoints']\n",
+        "model_name = MODELS[model_display_name]"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "6917xnUSlp9x"
+      },
+      "source": [
+        "### Build a detection model and load pre-trained model weights\n",
+        "\n",
+        "This sometimes takes a little while, please be patient!"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "ctPavqlyPuU_"
+      },
+      "outputs": [],
+      "source": [
+        "# Download the checkpoint and put it into models/research/object_detection/test_data/\n",
+        "\n",
+        "if model_display_name == 'centernet_with_keypoints':\n",
+        "  !wget http://download.tensorflow.org/models/object_detection/tf2/20200711/centernet_hg104_512x512_kpts_coco17_tpu-32.tar.gz\n",
+        "  !tar -xf centernet_hg104_512x512_kpts_coco17_tpu-32.tar.gz\n",
+        "  !mv centernet_hg104_512x512_kpts_coco17_tpu-32/checkpoint models/research/object_detection/test_data/\n",
+        "else:\n",
+        "  !wget http://download.tensorflow.org/models/object_detection/tf2/20200711/centernet_hg104_512x512_coco17_tpu-8.tar.gz\n",
+        "  !tar -xf centernet_hg104_512x512_coco17_tpu-8.tar.gz\n",
+        "  !mv centernet_hg104_512x512_coco17_tpu-8/checkpoint models/research/object_detection/test_data/"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "4cni4SSocvP_"
+      },
+      "outputs": [],
+      "source": [
+        "pipeline_config = os.path.join('models/research/object_detection/configs/tf2/',\n",
+        "                                model_name + '.config')\n",
+        "model_dir = 'models/research/object_detection/test_data/checkpoint/'\n",
+        "\n",
+        "# Load pipeline config and build a detection model\n",
+        "configs = config_util.get_configs_from_pipeline_file(pipeline_config)\n",
+        "model_config = configs['model']\n",
+        "detection_model = model_builder.build(\n",
+        "      model_config=model_config, is_training=False)\n",
+        "\n",
+        "# Restore checkpoint\n",
+        "ckpt = tf.compat.v2.train.Checkpoint(\n",
+        "      model=detection_model)\n",
+        "ckpt.restore(os.path.join(model_dir, 'ckpt-0')).expect_partial()\n",
+        "\n",
+        "def get_model_detection_function(model):\n",
+        "  \"\"\"Get a tf.function for detection.\"\"\"\n",
+        "\n",
+        "  @tf.function\n",
+        "  def detect_fn(image):\n",
+        "    \"\"\"Detect objects in image.\"\"\"\n",
+        "\n",
+        "    image, shapes = model.preprocess(image)\n",
+        "    prediction_dict = model.predict(image, shapes)\n",
+        "    detections = model.postprocess(prediction_dict, shapes)\n",
+        "\n",
+        "    return detections, prediction_dict, tf.reshape(shapes, [-1])\n",
+        "\n",
+        "  return detect_fn\n",
+        "\n",
+        "detect_fn = get_model_detection_function(detection_model)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "NKtD0IeclbL5"
+      },
+      "source": [
+        "# Load label map data (for plotting).\n",
+        "\n",
+        "Label maps correspond index numbers to category names, so that when our convolution network predicts `5`, we know that this corresponds to `airplane`.  Here we use internal utility functions, but anything that returns a dictionary mapping integers to appropriate string labels would be fine."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "5mucYUS6exUJ"
+      },
+      "outputs": [],
+      "source": [
+        "label_map_path = configs['eval_input_config'].label_map_path\n",
+        "label_map = label_map_util.load_labelmap(label_map_path)\n",
+        "categories = label_map_util.convert_label_map_to_categories(\n",
+        "    label_map,\n",
+        "    max_num_classes=label_map_util.get_max_label_map_index(label_map),\n",
+        "    use_display_name=True)\n",
+        "category_index = label_map_util.create_category_index(categories)\n",
+        "label_map_dict = label_map_util.get_label_map_dict(label_map, use_display_name=True)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "RLusV1o-mAx8"
+      },
+      "source": [
+        "### Putting everything together!\n",
+        "\n",
+        "Run the below code which loads an image, runs it through the detection model and visualizes the detection results, including the keypoints.\n",
+        "\n",
+        "Note that this will take a long time (several minutes) the first time you run this code due to tf.function's trace-compilation --- on subsequent runs (e.g. on new images), things will be faster.\n",
+        "\n",
+        "Here are some simple things to try out if you are curious:\n",
+        "* Try running inference on your own images (local paths work)\n",
+        "* Modify some of the input images and see if detection still works.  Some simple things to try out here (just uncomment the relevant portions of code) include flipping the image horizontally, or converting to grayscale (note that we still expect the input image to have 3 channels).\n",
+        "* Print out `detections['detection_boxes']` and try to match the box locations to the boxes in the image.  Notice that coordinates are given in normalized form (i.e., in the interval [0, 1]).\n",
+        "* Set min_score_thresh to other values (between 0 and 1) to allow more detections in or to filter out more detections.\n",
+        "\n",
+        "Note that you can run this cell repeatedly without rerunning earlier cells.\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "vr_Fux-gfaG9"
+      },
+      "outputs": [],
+      "source": [
+        "image_dir = 'models/research/object_detection/test_images/'\n",
+        "image_path = os.path.join(image_dir, 'image2.jpg')\n",
+        "image_np = load_image_into_numpy_array(image_path)\n",
+        "\n",
+        "# Things to try:\n",
+        "# Flip horizontally\n",
+        "# image_np = np.fliplr(image_np).copy()\n",
+        "\n",
+        "# Convert image to grayscale\n",
+        "# image_np = np.tile(\n",
+        "#     np.mean(image_np, 2, keepdims=True), (1, 1, 3)).astype(np.uint8)\n",
+        "\n",
+        "input_tensor = tf.convert_to_tensor(\n",
+        "    np.expand_dims(image_np, 0), dtype=tf.float32)\n",
+        "detections, predictions_dict, shapes = detect_fn(input_tensor)\n",
+        "\n",
+        "label_id_offset = 1\n",
+        "image_np_with_detections = image_np.copy()\n",
+        "\n",
+        "# Use keypoints if available in detections\n",
+        "keypoints, keypoint_scores = None, None\n",
+        "if 'detection_keypoints' in detections:\n",
+        "  keypoints = detections['detection_keypoints'][0].numpy()\n",
+        "  keypoint_scores = detections['detection_keypoint_scores'][0].numpy()\n",
+        "\n",
+        "viz_utils.visualize_boxes_and_labels_on_image_array(\n",
+        "      image_np_with_detections,\n",
+        "      detections['detection_boxes'][0].numpy(),\n",
+        "      (detections['detection_classes'][0].numpy() + label_id_offset).astype(int),\n",
+        "      detections['detection_scores'][0].numpy(),\n",
+        "      category_index,\n",
+        "      use_normalized_coordinates=True,\n",
+        "      max_boxes_to_draw=200,\n",
+        "      min_score_thresh=.30,\n",
+        "      agnostic_mode=False,\n",
+        "      keypoints=keypoints,\n",
+        "      keypoint_scores=keypoint_scores,\n",
+        "      keypoint_edges=get_keypoint_tuples(configs['eval_config']))\n",
+        "\n",
+        "plt.figure(figsize=(12,16))\n",
+        "plt.imshow(image_np_with_detections)\n",
+        "plt.show()"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "colab_type": "text",
+        "id": "lYnOxprty3TD"
+      },
+      "source": [
+        "## Digging into the model's intermediate predictions\n",
+        "\n",
+        "For this part we will assume that the detection model is a CenterNet model following Zhou et al (https://arxiv.org/abs/1904.07850).  And more specifically, we will assume that `detection_model` is of type `meta_architectures.center_net_meta_arch.CenterNetMetaArch`.\n",
+        "\n",
+        "As one of its intermediate predictions, CenterNet produces a heatmap of box centers for each class (for example, it will produce a heatmap whose size is proportional to that of the image that lights up at the center of each, e.g., \"zebra\"). In the following, we will visualize these intermediate class center heatmap predictions."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "colab": {},
+        "colab_type": "code",
+        "id": "xBgYgSGMhHVi"
+      },
+      "outputs": [],
+      "source": [
+        "if detection_model.__class__.__name__ != 'CenterNetMetaArch':\n",
+        "  raise AssertionError('The meta-architecture for this section '\n",
+        "  'is assumed to be CenterNetMetaArch!')\n",
+        "\n",
+        "def get_heatmap(predictions_dict, class_name):\n",
+        "  \"\"\"Grabs class center logits and apply inverse logit transform.\n",
+        "\n",
+        "  Args:\n",
+        "    predictions_dict: dictionary of tensors containing a `object_center`\n",
+        "      field of shape [1, heatmap_width, heatmap_height, num_classes]\n",
+        "    class_name: string name of category (e.g., `horse`)\n",
+        "\n",
+        "  Returns:\n",
+        "    heatmap: 2d Tensor heatmap representing heatmap of centers for a given class\n",
+        "      (For CenterNet, this is 128x128 or 256x256) with values in [0,1]\n",
+        "  \"\"\"\n",
+        "  class_index = label_map_dict[class_name]\n",
+        "  class_center_logits = predictions_dict['object_center'][0]\n",
+        "  class_center_logits = class_center_logits[0][\n",
+        "    :, :, class_index - label_id_offset]\n",
+        "  heatmap = tf.exp(class_center_logits) / (tf.exp(class_center_logits) + 1)\n",
+        "  return heatmap\n",
+        "\n",
+        "def unpad_heatmap(heatmap, image_np):\n",
+        "  \"\"\"Reshapes/unpads heatmap appropriately.\n",
+        "\n",
+        "  Reshapes/unpads heatmap appropriately to match image_np.\n",
+        "\n",
+        "  Args:\n",
+        "    heatmap: Output of `get_heatmap`, a 2d Tensor\n",
+        "    image_np: uint8 numpy array with shape (img_height, img_width, 3).  Note\n",
+        "      that due to padding, the relationship between img_height and img_width\n",
+        "      might not be a simple scaling.\n",
+        "\n",
+        "  Returns:\n",
+        "    resized_heatmap_unpadded: a resized heatmap (2d Tensor) that is the same\n",
+        "      size as `image_np`\n",
+        "  \"\"\"\n",
+        "  heatmap = tf.tile(tf.expand_dims(heatmap, 2), [1, 1, 3]) * 255\n",
+        "  pre_strided_size = detection_model._stride * heatmap.shape[0]\n",
+        "  resized_heatmap = tf.image.resize(\n",
+        "      heatmap, [pre_strided_size, pre_strided_size],\n",
+        "      method=tf.image.ResizeMethod.NEAREST_NEIGHBOR)\n",
+        "  resized_heatmap_unpadded = tf.slice(resized_heatmap, begin=[0,0,0], size=shapes)\n",
+        "  return tf.image.resize(\n",
+        "      resized_heatmap_unpadded,\n",
+        "      [image_np.shape[0], image_np.shape[1]],\n",
+        "      method=tf.image.ResizeMethod.NEAREST_NEIGHBOR)[:,:,0]\n",
+        "\n",
+        "\n",
+        "class_name = 'kite'\n",
+        "heatmap = get_heatmap(predictions_dict, class_name)\n",
+        "resized_heatmap_unpadded = unpad_heatmap(heatmap, image_np)\n",
+        "plt.figure(figsize=(12,16))\n",
+        "plt.imshow(image_np_with_detections)\n",
+        "plt.imshow(resized_heatmap_unpadded, alpha=0.7,vmin=0, vmax=160, cmap='viridis')\n",
+        "plt.title('Object center heatmap (class: ' + class_name + ')')\n",
+        "plt.show()\n",
+        "\n",
+        "class_name = 'person'\n",
+        "heatmap = get_heatmap(predictions_dict, class_name)\n",
+        "resized_heatmap_unpadded = unpad_heatmap(heatmap, image_np)\n",
+        "plt.figure(figsize=(12,16))\n",
+        "plt.imshow(image_np_with_detections)\n",
+        "plt.imshow(resized_heatmap_unpadded, alpha=0.7,vmin=0, vmax=160, cmap='viridis')\n",
+        "plt.title('Object center heatmap (class: ' + class_name + ')')\n",
+        "plt.show()"
+      ]
+    }
+  ],
+  "metadata": {
+    "colab": {
+      "collapsed_sections": [],
+      "name": "inference_tf2_colab.ipynb",
+      "provenance": []
+    },
+    "kernelspec": {
+      "display_name": "Python 3",
+      "name": "python3"
+    }
+  },
+  "nbformat": 4,
+  "nbformat_minor": 0
+}
diff --git a/research/object_detection/colab_tutorials/object_detection_tutorial.ipynb b/research/object_detection/colab_tutorials/object_detection_tutorial.ipynb
index 9063f2cd33aa8fffe160b138b3a3ec69c0d3abdb..0dd1253207f26f31aa7a23471399553574b6eae3 100644
--- a/research/object_detection/colab_tutorials/object_detection_tutorial.ipynb
+++ b/research/object_detection/colab_tutorials/object_detection_tutorial.ipynb
@@ -10,11 +10,11 @@
         "# Object Detection API Demo\n",
         "\n",
         "\u003ctable align=\"left\"\u003e\u003ctd\u003e\n",
-        "  \u003ca target=\"_blank\"  href=\"https://colab.sandbox.google.com/github/tensorflow/models/blob/master/research/object_detection/colab_tutorials/colab_tutorials/object_detection_tutorial.ipynb\"\u003e\n",
+        "  \u003ca target=\"_blank\"  href=\"https://colab.sandbox.google.com/github/tensorflow/models/blob/master/research/object_detection/colab_tutorials/object_detection_tutorial.ipynb\"\u003e\n",
         "    \u003cimg src=\"https://www.tensorflow.org/images/colab_logo_32px.png\" /\u003eRun in Google Colab\n",
         "  \u003c/a\u003e\n",
         "\u003c/td\u003e\u003ctd\u003e\n",
-        "  \u003ca target=\"_blank\"  href=\"https://github.com/tensorflow/models/blob/master/research/object_detection/colab_tutorials/colab_tutorials/object_detection_tutorial.ipynb\"\u003e\n",
+        "  \u003ca target=\"_blank\"  href=\"https://github.com/tensorflow/models/blob/master/research/object_detection/colab_tutorials/object_detection_tutorial.ipynb\"\u003e\n",
         "    \u003cimg width=32px src=\"https://www.tensorflow.org/images/GitHub-Mark-32px.png\" /\u003eView source on GitHub\u003c/a\u003e\n",
         "\u003c/td\u003e\u003c/table\u003e"
       ]
@@ -71,7 +71,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -95,7 +95,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -118,7 +118,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -149,7 +149,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -164,7 +164,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -189,7 +189,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -224,7 +224,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -249,7 +249,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -300,7 +300,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -319,7 +319,6 @@
         "  model_dir = pathlib.Path(model_dir)/\"saved_model\"\n",
         "\n",
         "  model = tf.saved_model.load(str(model_dir))\n",
-        "  model = model.signatures['serving_default']\n",
         "\n",
         "  return model"
       ]
@@ -337,7 +336,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -362,7 +361,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -398,7 +397,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -417,12 +416,12 @@
         "id": "yN1AYfAEJIGp"
       },
       "source": [
-        "Check the model's input signature, it expects a batch of 3-color images of type uint8: "
+        "Check the model's input signature, it expects a batch of 3-color images of type uint8:"
       ]
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -430,7 +429,7 @@
       },
       "outputs": [],
       "source": [
-        "print(detection_model.inputs)"
+        "print(detection_model.signatures['serving_default'].inputs)"
       ]
     },
     {
@@ -445,7 +444,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -453,12 +452,12 @@
       },
       "outputs": [],
       "source": [
-        "detection_model.output_dtypes"
+        "detection_model.signatures['serving_default'].output_dtypes"
       ]
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -466,7 +465,7 @@
       },
       "outputs": [],
       "source": [
-        "detection_model.output_shapes"
+        "detection_model.signatures['serving_default'].output_shapes"
       ]
     },
     {
@@ -481,7 +480,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -497,7 +496,8 @@
         "  input_tensor = input_tensor[tf.newaxis,...]\n",
         "\n",
         "  # Run inference\n",
-        "  output_dict = model(input_tensor)\n",
+        "  model_fn = model.signatures['serving_default']\n",
+        "  output_dict = model_fn(input_tensor)\n",
         "\n",
         "  # All outputs are batches tensors.\n",
         "  # Convert to numpy arrays, and take index [0] to remove the batch dimension.\n",
@@ -535,7 +535,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -565,7 +565,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -589,7 +589,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -613,7 +613,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -626,7 +626,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 0,
+      "execution_count": null,
       "metadata": {
         "colab": {},
         "colab_type": "code",
@@ -637,19 +637,6 @@
         "for image_path in TEST_IMAGE_PATHS:\n",
         "  show_inference(masking_model, image_path)"
       ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": 0,
-      "metadata": {
-        "colab": {},
-        "colab_type": "code",
-        "id": "nLlmm9JojEKm"
-      },
-      "outputs": [],
-      "source": [
-        ""
-      ]
     }
   ],
   "metadata": {
@@ -663,6 +650,10 @@
       "name": "object_detection_tutorial.ipynb",
       "private_outputs": true,
       "provenance": [
+        {
+          "file_id": "/piper/depot/google3/third_party/tensorflow_models/object_detection/colab_tutorials/object_detection_tutorial.ipynb",
+          "timestamp": 1594335690840
+        },
         {
           "file_id": "1LNYL6Zsn9Xlil2CVNOTsgDZQSBKeOjCh",
           "timestamp": 1566498233247
@@ -699,8 +690,7 @@
           "file_id": "https://github.com/tensorflow/models/blob/master/research/object_detection/object_detection_tutorial.ipynb",
           "timestamp": 1556150293326
         }
-      ],
-      "version": "0.3.2"
+      ]
     },
     "kernelspec": {
       "display_name": "Python 3",
diff --git a/research/object_detection/configs/tf2/center_net_hourglass104_1024x1024_coco17_tpu-32.config b/research/object_detection/configs/tf2/center_net_hourglass104_1024x1024_coco17_tpu-32.config
new file mode 100644
index 0000000000000000000000000000000000000000..c0a90ef44c95a2572d5a80d4fcf21e11e04fe669
--- /dev/null
+++ b/research/object_detection/configs/tf2/center_net_hourglass104_1024x1024_coco17_tpu-32.config
@@ -0,0 +1,129 @@
+# CenterNet meta-architecture from the "Objects as Points" [2] paper with the
+# hourglass[1] backbone.
+# [1]: https://arxiv.org/abs/1603.06937
+# [2]: https://arxiv.org/abs/1904.07850
+# Trained on COCO, initialized from Extremenet Detection checkpoint
+# Train on TPU-32 v3
+#
+# Achieves 44.6 mAP on COCO17 Val
+
+
+model {
+  center_net {
+    num_classes: 90
+    feature_extractor {
+      type: "hourglass_104"
+      bgr_ordering: true
+      channel_means: [104.01362025, 114.03422265, 119.9165958 ]
+      channel_stds: [73.6027665 , 69.89082075, 70.9150767 ]
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 1024
+        max_dimension: 1024
+        pad_to_max_dimension: true
+      }
+    }
+    object_detection_task {
+      task_loss_weight: 1.0
+      offset_loss_weight: 1.0
+      scale_loss_weight: 0.1
+      localization_loss {
+        l1_localization_loss {
+        }
+      }
+    }
+    object_center_params {
+      object_center_loss_weight: 1.0
+      min_box_overlap_iou: 0.7
+      max_box_predictions: 100
+      classification_loss {
+        penalty_reduced_logistic_focal_loss {
+          alpha: 2.0
+          beta: 4.0
+        }
+      }
+    }
+  }
+}
+
+train_config: {
+
+  batch_size: 128
+  num_steps: 50000
+
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_hue {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_contrast {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_saturation {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_brightness {
+    }
+  }
+
+   data_augmentation_options {
+     random_square_crop_by_scale {
+      scale_min: 0.6
+      scale_max: 1.3
+    }
+  }
+
+  optimizer {
+    adam_optimizer: {
+      epsilon: 1e-7  # Match tf.keras.optimizers.Adam's default.
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 1e-3
+          total_steps: 50000
+          warmup_learning_rate: 2.5e-4
+          warmup_steps: 5000
+        }
+      }
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-1"
+  fine_tune_checkpoint_type: "detection"
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/center_net_hourglass104_512x512_coco17_tpu-8.config b/research/object_detection/configs/tf2/center_net_hourglass104_512x512_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..9e38d98939b25edc176d83352aa4f526a91a5828
--- /dev/null
+++ b/research/object_detection/configs/tf2/center_net_hourglass104_512x512_coco17_tpu-8.config
@@ -0,0 +1,143 @@
+# CenterNet meta-architecture from the "Objects as Points" [2] paper with the
+# hourglass[1] backbone.
+# [1]: https://arxiv.org/abs/1603.06937
+# [2]: https://arxiv.org/abs/1904.07850
+# Trained on COCO, initialized from Extremenet Detection checkpoint
+# Train on TPU-8
+#
+# Achieves 41.9 mAP on COCO17 Val
+
+model {
+  center_net {
+    num_classes: 90
+    feature_extractor {
+      type: "hourglass_104"
+      bgr_ordering: true
+      channel_means: [104.01362025, 114.03422265, 119.9165958 ]
+      channel_stds: [73.6027665 , 69.89082075, 70.9150767 ]
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 512
+        max_dimension: 512
+        pad_to_max_dimension: true
+      }
+    }
+    object_detection_task {
+      task_loss_weight: 1.0
+      offset_loss_weight: 1.0
+      scale_loss_weight: 0.1
+      localization_loss {
+        l1_localization_loss {
+        }
+      }
+    }
+    object_center_params {
+      object_center_loss_weight: 1.0
+      min_box_overlap_iou: 0.7
+      max_box_predictions: 100
+      classification_loss {
+        penalty_reduced_logistic_focal_loss {
+          alpha: 2.0
+          beta: 4.0
+        }
+      }
+    }
+  }
+}
+
+train_config: {
+
+  batch_size: 128
+  num_steps: 140000
+
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  data_augmentation_options {
+    random_crop_image {
+      min_aspect_ratio: 0.5
+      max_aspect_ratio: 1.7
+      random_coef: 0.25
+    }
+  }
+
+
+  data_augmentation_options {
+    random_adjust_hue {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_contrast {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_saturation {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_brightness {
+    }
+  }
+
+  data_augmentation_options {
+    random_absolute_pad_image {
+       max_height_padding: 200
+       max_width_padding: 200
+       pad_color: [0, 0, 0]
+    }
+  }
+
+  optimizer {
+    adam_optimizer: {
+      epsilon: 1e-7  # Match tf.keras.optimizers.Adam's default.
+      learning_rate: {
+        manual_step_learning_rate {
+          initial_learning_rate: 1e-3
+          schedule {
+           step: 90000
+           learning_rate: 1e-4
+          }
+          schedule {
+            step: 120000
+            learning_rate: 1e-5
+          }
+        }
+      }
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-1"
+  fine_tune_checkpoint_type: "detection"
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/center_net_resnet101_v1_fpn_512x512_coco17_tpu-8.config b/research/object_detection/configs/tf2/center_net_resnet101_v1_fpn_512x512_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..2bb7f07ce5e6c325457d174202484a413c288ccd
--- /dev/null
+++ b/research/object_detection/configs/tf2/center_net_resnet101_v1_fpn_512x512_coco17_tpu-8.config
@@ -0,0 +1,141 @@
+# CenterNet meta-architecture from the "Objects as Points" [1] paper
+# with the ResNet-v1-101 FPN backbone.
+# [1]: https://arxiv.org/abs/1904.07850
+
+# Train on TPU-8
+#
+# Achieves 34.18 mAP on COCO17 Val
+
+
+model {
+  center_net {
+    num_classes: 90
+    feature_extractor {
+      type: "resnet_v2_101"
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 512
+        max_dimension: 512
+        pad_to_max_dimension: true
+      }
+    }
+    object_detection_task {
+      task_loss_weight: 1.0
+      offset_loss_weight: 1.0
+      scale_loss_weight: 0.1
+      localization_loss {
+        l1_localization_loss {
+        }
+      }
+    }
+    object_center_params {
+      object_center_loss_weight: 1.0
+      min_box_overlap_iou: 0.7
+      max_box_predictions: 100
+      classification_loss {
+        penalty_reduced_logistic_focal_loss {
+          alpha: 2.0
+          beta: 4.0
+        }
+      }
+    }
+  }
+}
+
+train_config: {
+
+  batch_size: 128
+  num_steps: 140000
+
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  data_augmentation_options {
+    random_crop_image {
+      min_aspect_ratio: 0.5
+      max_aspect_ratio: 1.7
+      random_coef: 0.25
+    }
+  }
+
+
+  data_augmentation_options {
+    random_adjust_hue {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_contrast {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_saturation {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_brightness {
+    }
+  }
+
+  data_augmentation_options {
+    random_absolute_pad_image {
+       max_height_padding: 200
+       max_width_padding: 200
+       pad_color: [0, 0, 0]
+    }
+  }
+
+  optimizer {
+    adam_optimizer: {
+      epsilon: 1e-7  # Match tf.keras.optimizers.Adam's default.
+      learning_rate: {
+        manual_step_learning_rate {
+          initial_learning_rate: 1e-3
+          schedule {
+           step: 90000
+           learning_rate: 1e-4
+          }
+          schedule {
+            step: 120000
+            learning_rate: 1e-5
+          }
+        }
+      }
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/weights-1"
+  fine_tune_checkpoint_type: "classification"
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
+
diff --git a/research/object_detection/configs/tf2/faster_rcnn_resnet101_v1_1024x1024_coco17_tpu-8.config b/research/object_detection/configs/tf2/faster_rcnn_resnet101_v1_1024x1024_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..c38f6b9e2143a7248964ac7f68ebd38c47627d68
--- /dev/null
+++ b/research/object_detection/configs/tf2/faster_rcnn_resnet101_v1_1024x1024_coco17_tpu-8.config
@@ -0,0 +1,166 @@
+# Faster R-CNN with Resnet-101 (v1),
+# w/high res inputs, long training schedule
+# Trained on COCO, initialized from Imagenet classification checkpoint
+#
+# Train on TPU-8
+#
+# Achieves 37.1 mAP on COCO17 val
+
+model {
+  faster_rcnn {
+    num_classes: 90
+    image_resizer {
+      fixed_shape_resizer {
+        width: 1024
+        height: 1024
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_resnet101_keras'
+      batch_norm_trainable: true
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+        share_box_across_classes: true
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 300
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+    use_static_shapes: true
+    use_matmul_crop_and_resize: true
+    clip_anchors_to_image: true
+    use_static_balanced_label_sampler: true
+    use_matmul_gather_in_matcher: true
+  }
+}
+
+train_config: {
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 100000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 100000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet101.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_hue {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_contrast {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_saturation {
+    }
+  }
+
+  data_augmentation_options {
+     random_square_crop_by_scale {
+      scale_min: 0.6
+      scale_max: 1.3
+    }
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+  use_bfloat16: true  # works only on TPUs
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/faster_rcnn_resnet101_v1_640x640_coco17_tpu-8.config b/research/object_detection/configs/tf2/faster_rcnn_resnet101_v1_640x640_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..af07c7df6278ba936b863abff7301446bf7cf1f8
--- /dev/null
+++ b/research/object_detection/configs/tf2/faster_rcnn_resnet101_v1_640x640_coco17_tpu-8.config
@@ -0,0 +1,145 @@
+# Faster R-CNN with Resnet-50 (v1)
+# Trained on COCO, initialized from Imagenet classification checkpoint
+#
+# Train on TPU-8
+#
+# Achieves 31.8 mAP on COCO17 val
+
+model {
+  faster_rcnn {
+    num_classes: 90
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 640
+        max_dimension: 640
+        pad_to_max_dimension: true
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_resnet101_keras'
+      batch_norm_trainable: true
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+        share_box_across_classes: true
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 300
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+    use_static_shapes: true
+    use_matmul_crop_and_resize: true
+    clip_anchors_to_image: true
+    use_static_balanced_label_sampler: true
+    use_matmul_gather_in_matcher: true
+  }
+}
+
+train_config: {
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 25000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 25000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet101.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+  use_bfloat16: true  # works only on TPUs
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/faster_rcnn_resnet101_v1_800x1333_coco17_gpu-8.config b/research/object_detection/configs/tf2/faster_rcnn_resnet101_v1_800x1333_coco17_gpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..8eb4da02f59cb0a955678865112b863bff37ac1c
--- /dev/null
+++ b/research/object_detection/configs/tf2/faster_rcnn_resnet101_v1_800x1333_coco17_gpu-8.config
@@ -0,0 +1,154 @@
+# Faster R-CNN with Resnet-101 (v1),
+# Initialized from Imagenet classification checkpoint
+#
+# Train on GPU-8
+#
+# Achieves 36.6 mAP on COCO17 val
+
+model {
+  faster_rcnn {
+    num_classes: 90
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 800
+        max_dimension: 1333
+        pad_to_max_dimension: true
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_resnet101_keras'
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+  }
+}
+
+train_config: {
+  batch_size: 16
+  num_steps: 200000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 0.01
+          total_steps: 200000
+          warmup_learning_rate: 0.0
+          warmup_steps: 5000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  gradient_clipping_by_norm: 10.0
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet101.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_hue {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_contrast {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_saturation {
+    }
+  }
+
+  data_augmentation_options {
+     random_square_crop_by_scale {
+      scale_min: 0.6
+      scale_max: 1.3
+    }
+  }
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/faster_rcnn_resnet152_v1_1024x1024_coco17_tpu-8.config b/research/object_detection/configs/tf2/faster_rcnn_resnet152_v1_1024x1024_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..034667ffe38fb997b2e2d36406686faaa23492bd
--- /dev/null
+++ b/research/object_detection/configs/tf2/faster_rcnn_resnet152_v1_1024x1024_coco17_tpu-8.config
@@ -0,0 +1,166 @@
+# Faster R-CNN with Resnet-152 (v1)
+# w/high res inputs, long training schedule
+# Trained on COCO, initialized from Imagenet classification checkpoint
+#
+# Train on TPU-8
+#
+# Achieves 37.6 mAP on COCO17 val
+
+model {
+  faster_rcnn {
+    num_classes: 90
+    image_resizer {
+      fixed_shape_resizer {
+        width: 1024
+        height: 1024
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_resnet152_keras'
+      batch_norm_trainable: true
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+        share_box_across_classes: true
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 300
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+    use_static_shapes: true
+    use_matmul_crop_and_resize: true
+    clip_anchors_to_image: true
+    use_static_balanced_label_sampler: true
+    use_matmul_gather_in_matcher: true
+  }
+}
+
+train_config: {
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 100000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 100000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet152.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_hue {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_contrast {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_saturation {
+    }
+  }
+
+  data_augmentation_options {
+     random_square_crop_by_scale {
+      scale_min: 0.6
+      scale_max: 1.3
+    }
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+  use_bfloat16: true  # works only on TPUs
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/faster_rcnn_resnet152_v1_640x640_coco17_tpu-8.config b/research/object_detection/configs/tf2/faster_rcnn_resnet152_v1_640x640_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..525c4ac456a058646145c1127593b2e0f891fb89
--- /dev/null
+++ b/research/object_detection/configs/tf2/faster_rcnn_resnet152_v1_640x640_coco17_tpu-8.config
@@ -0,0 +1,145 @@
+# Faster R-CNN with Resnet-152 (v1)
+# Trained on COCO, initialized from Imagenet classification checkpoint
+#
+# Train on TPU-8
+#
+# Achieves 32.4 mAP on COCO17 val
+
+model {
+  faster_rcnn {
+    num_classes: 90
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 640
+        max_dimension: 640
+        pad_to_max_dimension: true
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_resnet152_keras'
+      batch_norm_trainable: true
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+        share_box_across_classes: true
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 300
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+    use_static_shapes: true
+    use_matmul_crop_and_resize: true
+    clip_anchors_to_image: true
+    use_static_balanced_label_sampler: true
+    use_matmul_gather_in_matcher: true
+  }
+}
+
+train_config: {
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 25000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 25000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet152.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+  use_bfloat16: true  # works only on TPUs
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/faster_rcnn_resnet152_v1_800x1333_coco17_gpu-8.config b/research/object_detection/configs/tf2/faster_rcnn_resnet152_v1_800x1333_coco17_gpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..8d1879f7b9b9d784244ea1504ec64b9200ab5e34
--- /dev/null
+++ b/research/object_detection/configs/tf2/faster_rcnn_resnet152_v1_800x1333_coco17_gpu-8.config
@@ -0,0 +1,154 @@
+# Faster R-CNN with Resnet-152 (v1),
+# Initialized from Imagenet classification checkpoint
+#
+# Train on GPU-8
+#
+# Achieves 37.3 mAP on COCO17 val
+
+model {
+  faster_rcnn {
+    num_classes: 90
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 800
+        max_dimension: 1333
+        pad_to_max_dimension: true
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_resnet152_keras'
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+  }
+}
+
+train_config: {
+  batch_size: 16
+  num_steps: 200000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 0.01
+          total_steps: 200000
+          warmup_learning_rate: 0.0
+          warmup_steps: 5000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  gradient_clipping_by_norm: 10.0
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet152.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_hue {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_contrast {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_saturation {
+    }
+  }
+
+  data_augmentation_options {
+     random_square_crop_by_scale {
+      scale_min: 0.6
+      scale_max: 1.3
+    }
+  }
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/faster_rcnn_resnet50_v1_1024x1024_coco17_tpu-8.config b/research/object_detection/configs/tf2/faster_rcnn_resnet50_v1_1024x1024_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..b6e590ee7179681a159e70df117916a77195fcf5
--- /dev/null
+++ b/research/object_detection/configs/tf2/faster_rcnn_resnet50_v1_1024x1024_coco17_tpu-8.config
@@ -0,0 +1,166 @@
+# Faster R-CNN with Resnet-50 (v1),
+# w/high res inputs, long training schedule
+# Trained on COCO, initialized from Imagenet classification checkpoint
+#
+# Train on TPU-8
+#
+# Achieves 31.0 mAP on COCO17 val
+
+model {
+  faster_rcnn {
+    num_classes: 90
+    image_resizer {
+      fixed_shape_resizer {
+        width: 1024
+        height: 1024
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_resnet50_keras'
+      batch_norm_trainable: true
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+        share_box_across_classes: true
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 300
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+    use_static_shapes: true
+    use_matmul_crop_and_resize: true
+    clip_anchors_to_image: true
+    use_static_balanced_label_sampler: true
+    use_matmul_gather_in_matcher: true
+  }
+}
+
+train_config: {
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 100000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 100000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet50.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_hue {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_contrast {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_saturation {
+    }
+  }
+
+  data_augmentation_options {
+     random_square_crop_by_scale {
+      scale_min: 0.6
+      scale_max: 1.3
+    }
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+  use_bfloat16: true  # works only on TPUs
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/faster_rcnn_resnet50_v1_640x640_coco17_tpu-8.config b/research/object_detection/configs/tf2/faster_rcnn_resnet50_v1_640x640_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..c8601c6fed1f63de85fc7d53e13658ea498fb9eb
--- /dev/null
+++ b/research/object_detection/configs/tf2/faster_rcnn_resnet50_v1_640x640_coco17_tpu-8.config
@@ -0,0 +1,145 @@
+# Faster R-CNN with Resnet-50 (v1) with 640x640 input resolution
+# Trained on COCO, initialized from Imagenet classification checkpoint
+#
+# Train on TPU-8
+#
+# Achieves 29.3 mAP on COCO17 Val
+
+model {
+  faster_rcnn {
+    num_classes: 90
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 640
+        max_dimension: 640
+        pad_to_max_dimension: true
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_resnet50_keras'
+      batch_norm_trainable: true
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+        share_box_across_classes: true
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 300
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+    use_static_shapes: true
+    use_matmul_crop_and_resize: true
+    clip_anchors_to_image: true
+    use_static_balanced_label_sampler: true
+    use_matmul_gather_in_matcher: true
+  }
+}
+
+train_config: {
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 25000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 25000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet50.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+  use_bfloat16: true  # works only on TPUs
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/faster_rcnn_resnet50_v1_800x1333_coco17_gpu-8.config b/research/object_detection/configs/tf2/faster_rcnn_resnet50_v1_800x1333_coco17_gpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..264be5f0b79660d524aaa866c0bee98d3ee6b199
--- /dev/null
+++ b/research/object_detection/configs/tf2/faster_rcnn_resnet50_v1_800x1333_coco17_gpu-8.config
@@ -0,0 +1,154 @@
+# Faster R-CNN with Resnet-50 (v1),
+# Initialized from Imagenet classification checkpoint
+#
+# Train on GPU-8
+#
+# Achieves 31.4 mAP on COCO17 val
+
+model {
+  faster_rcnn {
+    num_classes: 90
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 800
+        max_dimension: 1333
+        pad_to_max_dimension: true
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_resnet50_keras'
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+  }
+}
+
+train_config: {
+  batch_size: 16
+  num_steps: 200000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 0.01
+          total_steps: 200000
+          warmup_learning_rate: 0.0
+          warmup_steps: 5000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  gradient_clipping_by_norm: 10.0
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet50.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_hue {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_contrast {
+    }
+  }
+
+  data_augmentation_options {
+    random_adjust_saturation {
+    }
+  }
+
+  data_augmentation_options {
+     random_square_crop_by_scale {
+      scale_min: 0.6
+      scale_max: 1.3
+    }
+  }
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/mask_rcnn_inception_resnet_v2_1024x1024_coco17_gpu-8.config b/research/object_detection/configs/tf2/mask_rcnn_inception_resnet_v2_1024x1024_coco17_gpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..974c1d1710b15ce639825cc57f625756cbed7134
--- /dev/null
+++ b/research/object_detection/configs/tf2/mask_rcnn_inception_resnet_v2_1024x1024_coco17_gpu-8.config
@@ -0,0 +1,160 @@
+# Mask R-CNN with Inception Resnet v2 (no atrous)
+# Sync-trained on COCO (with 8 GPUs) with batch size 16 (1024x1024 resolution)
+# Initialized from Imagenet classification checkpoint
+#
+# Train on GPU-8
+#
+# Achieves 40.4 box mAP and 35.5 mask mAP on COCO17 val
+
+model {
+  faster_rcnn {
+    number_of_stages: 3
+    num_classes: 90
+    image_resizer {
+      fixed_shape_resizer {
+        height: 1024
+        width: 1024
+      }
+    }
+    feature_extractor {
+      type: 'faster_rcnn_inception_resnet_v2_keras'
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        scales: [0.25, 0.5, 1.0, 2.0]
+        aspect_ratios: [0.5, 1.0, 2.0]
+        height_stride: 16
+        width_stride: 16
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.01
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.7
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 17
+    maxpool_kernel_size: 1
+    maxpool_stride: 1
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+        mask_height: 33
+        mask_width: 33
+        mask_prediction_conv_depth: 0
+        mask_prediction_num_conv_layers: 4
+        conv_hyperparams {
+          op: CONV
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            truncated_normal_initializer {
+              stddev: 0.01
+            }
+          }
+        }
+        predict_instance_masks: true
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+    second_stage_mask_prediction_loss_weight: 4.0
+    resize_masks: false
+  }
+}
+
+train_config: {
+  batch_size: 16
+  num_steps: 200000
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 0.008
+          total_steps: 200000
+          warmup_learning_rate: 0.0
+          warmup_steps: 5000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  gradient_clipping_by_norm: 10.0
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/inception_resnet_v2.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+  load_instance_masks: true
+  mask_type: PNG_MASKS
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  metrics_set: "coco_mask_metrics"
+  eval_instance_masks: true
+  use_moving_averages: false
+  batch_size: 1
+  include_metrics_per_category: true
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+  load_instance_masks: true
+  mask_type: PNG_MASKS
+}
diff --git a/research/object_detection/configs/tf2/ssd_efficientdet_d0_512x512_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_efficientdet_d0_512x512_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..ffcd461f77f4a1c6d0eb5335032f6b11d711da99
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_efficientdet_d0_512x512_coco17_tpu-8.config
@@ -0,0 +1,199 @@
+ # SSD with EfficientNet-b0 + BiFPN feature extractor,
+# shared box predictor and focal loss (a.k.a EfficientDet-d0).
+# See EfficientDet, Tan et al, https://arxiv.org/abs/1911.09070
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from an EfficientNet-b0 checkpoint.
+#
+# Train on TPU-8
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    add_background_class: false
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 3
+      }
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 512
+        max_dimension: 512
+        pad_to_max_dimension: true
+        }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 64
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          force_use_bias: true
+          activation: SWISH
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true
+            decay: 0.99
+            epsilon: 0.001
+          }
+        }
+        num_layers_before_predictor: 3
+        kernel_size: 3
+        use_depthwise: true
+      }
+    }
+    feature_extractor {
+      type: 'ssd_efficientnet-b0_bifpn_keras'
+      bifpn {
+        min_level: 3
+        max_level: 7
+        num_iterations: 3
+        num_filters: 64
+      }
+      conv_hyperparams {
+        force_use_bias: true
+        activation: SWISH
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.99,
+          epsilon: 0.001,
+        }
+      }
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 1.5
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.5
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-0"
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 300000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_scale_crop_and_pad_to_square {
+      output_size: 512
+      scale_min: 0.1
+      scale_max: 2.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 8e-2
+          total_steps: 300000
+          warmup_learning_rate: .001
+          warmup_steps: 2500
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_efficientdet_d1_640x640_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_efficientdet_d1_640x640_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..5eacfeda854606ed8cddc7c6653134fce39a17ca
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_efficientdet_d1_640x640_coco17_tpu-8.config
@@ -0,0 +1,199 @@
+ # SSD with EfficientNet-b1 + BiFPN feature extractor,
+# shared box predictor and focal loss (a.k.a EfficientDet-d1).
+# See EfficientDet, Tan et al, https://arxiv.org/abs/1911.09070
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from an EfficientNet-b1 checkpoint.
+#
+# Train on TPU-8
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    add_background_class: false
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 3
+      }
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 640
+        max_dimension: 640
+        pad_to_max_dimension: true
+        }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 88
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          force_use_bias: true
+          activation: SWISH
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true
+            decay: 0.99
+            epsilon: 0.001
+          }
+        }
+        num_layers_before_predictor: 3
+        kernel_size: 3
+        use_depthwise: true
+      }
+    }
+    feature_extractor {
+      type: 'ssd_efficientnet-b1_bifpn_keras'
+      bifpn {
+        min_level: 3
+        max_level: 7
+        num_iterations: 4
+        num_filters: 88
+      }
+      conv_hyperparams {
+        force_use_bias: true
+        activation: SWISH
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.99,
+          epsilon: 0.001,
+        }
+      }
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 1.5
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.5
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-0"
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 300000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_scale_crop_and_pad_to_square {
+      output_size: 640
+      scale_min: 0.1
+      scale_max: 2.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 8e-2
+          total_steps: 300000
+          warmup_learning_rate: .001
+          warmup_steps: 2500
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_efficientdet_d2_768x768_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_efficientdet_d2_768x768_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..d2ca75d468c9f062150b70800f911c458f85615a
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_efficientdet_d2_768x768_coco17_tpu-8.config
@@ -0,0 +1,199 @@
+ # SSD with EfficientNet-b2 + BiFPN feature extractor,
+# shared box predictor and focal loss (a.k.a EfficientDet-d2).
+# See EfficientDet, Tan et al, https://arxiv.org/abs/1911.09070
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from an EfficientNet-b2 checkpoint.
+#
+# Train on TPU-8
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    add_background_class: false
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 3
+      }
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 768
+        max_dimension: 768
+        pad_to_max_dimension: true
+        }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 112
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          force_use_bias: true
+          activation: SWISH
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true
+            decay: 0.99
+            epsilon: 0.001
+          }
+        }
+        num_layers_before_predictor: 3
+        kernel_size: 3
+        use_depthwise: true
+      }
+    }
+    feature_extractor {
+      type: 'ssd_efficientnet-b2_bifpn_keras'
+      bifpn {
+        min_level: 3
+        max_level: 7
+        num_iterations: 5
+        num_filters: 112
+      }
+      conv_hyperparams {
+        force_use_bias: true
+        activation: SWISH
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.99,
+          epsilon: 0.001,
+        }
+      }
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 1.5
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.5
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-0"
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 300000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_scale_crop_and_pad_to_square {
+      output_size: 768
+      scale_min: 0.1
+      scale_max: 2.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 8e-2
+          total_steps: 300000
+          warmup_learning_rate: .001
+          warmup_steps: 2500
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_efficientdet_d3_896x896_coco17_tpu-32.config b/research/object_detection/configs/tf2/ssd_efficientdet_d3_896x896_coco17_tpu-32.config
new file mode 100644
index 0000000000000000000000000000000000000000..b072d13a89fbb901eaeca2dbe7505cb77ac51bb5
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_efficientdet_d3_896x896_coco17_tpu-32.config
@@ -0,0 +1,199 @@
+ # SSD with EfficientNet-b3 + BiFPN feature extractor,
+# shared box predictor and focal loss (a.k.a EfficientDet-d3).
+# See EfficientDet, Tan et al, https://arxiv.org/abs/1911.09070
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from an EfficientNet-b3 checkpoint.
+#
+# Train on TPU-32
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    add_background_class: false
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 3
+      }
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 896
+        max_dimension: 896
+        pad_to_max_dimension: true
+        }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 160
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          force_use_bias: true
+          activation: SWISH
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true
+            decay: 0.99
+            epsilon: 0.001
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+        use_depthwise: true
+      }
+    }
+    feature_extractor {
+      type: 'ssd_efficientnet-b3_bifpn_keras'
+      bifpn {
+        min_level: 3
+        max_level: 7
+        num_iterations: 6
+        num_filters: 160
+      }
+      conv_hyperparams {
+        force_use_bias: true
+        activation: SWISH
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.99,
+          epsilon: 0.001,
+        }
+      }
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 1.5
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.5
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-0"
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 300000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_scale_crop_and_pad_to_square {
+      output_size: 896
+      scale_min: 0.1
+      scale_max: 2.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 8e-2
+          total_steps: 300000
+          warmup_learning_rate: .001
+          warmup_steps: 2500
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_efficientdet_d4_1024x1024_coco17_tpu-32.config b/research/object_detection/configs/tf2/ssd_efficientdet_d4_1024x1024_coco17_tpu-32.config
new file mode 100644
index 0000000000000000000000000000000000000000..b13b2d46974baa6c87f39ccb26a6070d5f1d7158
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_efficientdet_d4_1024x1024_coco17_tpu-32.config
@@ -0,0 +1,199 @@
+ # SSD with EfficientNet-b4 + BiFPN feature extractor,
+# shared box predictor and focal loss (a.k.a EfficientDet-d4).
+# See EfficientDet, Tan et al, https://arxiv.org/abs/1911.09070
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from an EfficientNet-b4 checkpoint.
+#
+# Train on TPU-32
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    add_background_class: false
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 3
+      }
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 1024
+        max_dimension: 1024
+        pad_to_max_dimension: true
+        }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 224
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          force_use_bias: true
+          activation: SWISH
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true
+            decay: 0.99
+            epsilon: 0.001
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+        use_depthwise: true
+      }
+    }
+    feature_extractor {
+      type: 'ssd_efficientnet-b4_bifpn_keras'
+      bifpn {
+        min_level: 3
+        max_level: 7
+        num_iterations: 7
+        num_filters: 224
+      }
+      conv_hyperparams {
+        force_use_bias: true
+        activation: SWISH
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.99,
+          epsilon: 0.001,
+        }
+      }
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 1.5
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.5
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-0"
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 300000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_scale_crop_and_pad_to_square {
+      output_size: 1024
+      scale_min: 0.1
+      scale_max: 2.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 8e-2
+          total_steps: 300000
+          warmup_learning_rate: .001
+          warmup_steps: 2500
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_efficientdet_d5_1280x1280_coco17_tpu-32.config b/research/object_detection/configs/tf2/ssd_efficientdet_d5_1280x1280_coco17_tpu-32.config
new file mode 100644
index 0000000000000000000000000000000000000000..bcb33d50300dadc8e8381d60646632714dfde3f2
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_efficientdet_d5_1280x1280_coco17_tpu-32.config
@@ -0,0 +1,199 @@
+ # SSD with EfficientNet-b5 + BiFPN feature extractor,
+# shared box predictor and focal loss (a.k.a EfficientDet-d5).
+# See EfficientDet, Tan et al, https://arxiv.org/abs/1911.09070
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from an EfficientNet-b5 checkpoint.
+#
+# Train on TPU-32
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    add_background_class: false
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 3
+      }
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 1280
+        max_dimension: 1280
+        pad_to_max_dimension: true
+        }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 288
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          force_use_bias: true
+          activation: SWISH
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true
+            decay: 0.99
+            epsilon: 0.001
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+        use_depthwise: true
+      }
+    }
+    feature_extractor {
+      type: 'ssd_efficientnet-b5_bifpn_keras'
+      bifpn {
+        min_level: 3
+        max_level: 7
+        num_iterations: 7
+        num_filters: 288
+      }
+      conv_hyperparams {
+        force_use_bias: true
+        activation: SWISH
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.99,
+          epsilon: 0.001,
+        }
+      }
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 1.5
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.5
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-0"
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 300000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_scale_crop_and_pad_to_square {
+      output_size: 1280
+      scale_min: 0.1
+      scale_max: 2.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 8e-2
+          total_steps: 300000
+          warmup_learning_rate: .001
+          warmup_steps: 2500
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_efficientdet_d6_1408x1408_coco17_tpu-32.config b/research/object_detection/configs/tf2/ssd_efficientdet_d6_1408x1408_coco17_tpu-32.config
new file mode 100644
index 0000000000000000000000000000000000000000..1f24607431c3fdc1d008e8f05fe18a1ef527b7ab
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_efficientdet_d6_1408x1408_coco17_tpu-32.config
@@ -0,0 +1,201 @@
+ # SSD with EfficientNet-b6 + BiFPN feature extractor,
+# shared box predictor and focal loss (a.k.a EfficientDet-d6).
+# See EfficientDet, Tan et al, https://arxiv.org/abs/1911.09070
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from an EfficientNet-b6 checkpoint.
+#
+# Train on TPU-32
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    add_background_class: false
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 3
+      }
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 1408
+        max_dimension: 1408
+        pad_to_max_dimension: true
+        }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 384
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          force_use_bias: true
+          activation: SWISH
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true
+            decay: 0.99
+            epsilon: 0.001
+          }
+        }
+        num_layers_before_predictor: 5
+        kernel_size: 3
+        use_depthwise: true
+      }
+    }
+    feature_extractor {
+      type: 'ssd_efficientnet-b6_bifpn_keras'
+      bifpn {
+        min_level: 3
+        max_level: 7
+        num_iterations: 8
+        num_filters: 384
+        # Use unweighted sum for stability.
+        combine_method: 'sum'
+      }
+      conv_hyperparams {
+        force_use_bias: true
+        activation: SWISH
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.99,
+          epsilon: 0.001,
+        }
+      }
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 1.5
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.5
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-0"
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 300000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_scale_crop_and_pad_to_square {
+      output_size: 1408
+      scale_min: 0.1
+      scale_max: 2.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 8e-2
+          total_steps: 300000
+          warmup_learning_rate: .001
+          warmup_steps: 2500
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_efficientdet_d7_1536x1536_coco17_tpu-32.config b/research/object_detection/configs/tf2/ssd_efficientdet_d7_1536x1536_coco17_tpu-32.config
new file mode 100644
index 0000000000000000000000000000000000000000..81954aa8bdd9509d0835b7abac8f593c909bceb8
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_efficientdet_d7_1536x1536_coco17_tpu-32.config
@@ -0,0 +1,201 @@
+ # SSD with EfficientNet-b6 + BiFPN feature extractor,
+# shared box predictor and focal loss (a.k.a EfficientDet-d7).
+# See EfficientDet, Tan et al, https://arxiv.org/abs/1911.09070
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from an EfficientNet-b6 checkpoint.
+#
+# Train on TPU-32
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    add_background_class: false
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 3
+      }
+    }
+    image_resizer {
+      keep_aspect_ratio_resizer {
+        min_dimension: 1536
+        max_dimension: 1536
+        pad_to_max_dimension: true
+        }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 384
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          force_use_bias: true
+          activation: SWISH
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true
+            decay: 0.99
+            epsilon: 0.001
+          }
+        }
+        num_layers_before_predictor: 5
+        kernel_size: 3
+        use_depthwise: true
+      }
+    }
+    feature_extractor {
+      type: 'ssd_efficientnet-b6_bifpn_keras'
+      bifpn {
+        min_level: 3
+        max_level: 7
+        num_iterations: 8
+        num_filters: 384
+        # Use unweighted sum for stability.
+        combine_method: 'sum'
+      }
+      conv_hyperparams {
+        force_use_bias: true
+        activation: SWISH
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.99,
+          epsilon: 0.001,
+        }
+      }
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 1.5
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.5
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/ckpt-0"
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 300000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_scale_crop_and_pad_to_square {
+      output_size: 1536
+      scale_min: 0.1
+      scale_max: 2.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: 8e-2
+          total_steps: 300000
+          warmup_learning_rate: .001
+          warmup_steps: 2500
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BEE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_mobilenet_v1_fpn_640x640_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_mobilenet_v1_fpn_640x640_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..3cfe304f171061778587c8aaf59add2df5bb746e
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_mobilenet_v1_fpn_640x640_coco17_tpu-8.config
@@ -0,0 +1,197 @@
+# SSD with Mobilenet v1 FPN feature extractor, shared box predictor and focal
+# loss (a.k.a Retinanet).
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 29.1 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 2
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 640
+        width: 640
+      }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 256
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true,
+            decay: 0.997,
+            epsilon: 0.001,
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+      }
+    }
+    feature_extractor {
+      type: 'ssd_mobilenet_v1_fpn_keras'
+      fpn {
+        min_level: 3
+        max_level: 7
+      }
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          random_normal_initializer {
+            stddev: 0.01
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.997,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/mobilenet_v1.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 25000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_crop_image {
+      min_object_covered: 0.0
+      min_aspect_ratio: 0.75
+      max_aspect_ratio: 3.0
+      min_area: 0.75
+      max_area: 1.0
+      overlap_thresh: 0.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 25000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1;
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_mobilenet_v2_320x320_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_mobilenet_v2_320x320_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..dc3a4a7f3e70e59f98dc108e6d7f0240881f928d
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_mobilenet_v2_320x320_coco17_tpu-8.config
@@ -0,0 +1,197 @@
+# SSD with Mobilenet v2
+# Trained on COCO17, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 22.2 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      ssd_anchor_generator {
+        num_layers: 6
+        min_scale: 0.2
+        max_scale: 0.95
+        aspect_ratios: 1.0
+        aspect_ratios: 2.0
+        aspect_ratios: 0.5
+        aspect_ratios: 3.0
+        aspect_ratios: 0.3333
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 300
+        width: 300
+      }
+    }
+    box_predictor {
+      convolutional_box_predictor {
+        min_depth: 0
+        max_depth: 0
+        num_layers_before_predictor: 0
+        use_dropout: false
+        dropout_keep_probability: 0.8
+        kernel_size: 1
+        box_code_size: 4
+        apply_sigmoid_to_scores: false
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            train: true,
+            scale: true,
+            center: true,
+            decay: 0.97,
+            epsilon: 0.001,
+          }
+        }
+      }
+    }
+    feature_extractor {
+      type: 'ssd_mobilenet_v2_keras'
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          train: true,
+          scale: true,
+          center: true,
+          decay: 0.97,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.75,
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+          delta: 1.0
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/mobilenet_v2.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 512
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 50000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    ssd_random_crop {
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .8
+          total_steps: 50000
+          warmup_learning_rate: 0.13333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_mobilenet_v2_fpnlite_320x320_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_mobilenet_v2_fpnlite_320x320_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..656e324c5d99db860002bbb78db3ba7505d21a6b
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_mobilenet_v2_fpnlite_320x320_coco17_tpu-8.config
@@ -0,0 +1,201 @@
+# SSD with Mobilenet v2 FPN-lite (go/fpn-lite) feature extractor, shared box
+# predictor and focal loss (a mobile version of Retinanet).
+# Retinanet: see Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 22.2 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 2
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 320
+        width: 320
+      }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 128
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true,
+            decay: 0.997,
+            epsilon: 0.001,
+          }
+        }
+        num_layers_before_predictor: 4
+        share_prediction_tower: true
+        use_depthwise: true
+        kernel_size: 3
+      }
+    }
+    feature_extractor {
+      type: 'ssd_mobilenet_v2_fpn_keras'
+      use_depthwise: true
+      fpn {
+        min_level: 3
+        max_level: 7
+        additional_layer_depth: 128
+      }
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          random_normal_initializer {
+            stddev: 0.01
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.997,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/mobilenet_v2.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 50000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_crop_image {
+      min_object_covered: 0.0
+      min_aspect_ratio: 0.75
+      max_aspect_ratio: 3.0
+      min_area: 0.75
+      max_area: 1.0
+      overlap_thresh: 0.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .08
+          total_steps: 50000
+          warmup_learning_rate: .026666
+          warmup_steps: 1000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
+
diff --git a/research/object_detection/configs/tf2/ssd_mobilenet_v2_fpnlite_640x640_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_mobilenet_v2_fpnlite_640x640_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..5e4bca1688c678585ce075b75c489ca900627e5e
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_mobilenet_v2_fpnlite_640x640_coco17_tpu-8.config
@@ -0,0 +1,201 @@
+# SSD with Mobilenet v2 FPN-lite (go/fpn-lite) feature extractor, shared box
+# predictor and focal loss (a mobile version of Retinanet).
+# Retinanet: see Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 28.2 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 2
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 640
+        width: 640
+      }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 128
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.00004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true,
+            decay: 0.997,
+            epsilon: 0.001,
+          }
+        }
+        num_layers_before_predictor: 4
+        share_prediction_tower: true
+        use_depthwise: true
+        kernel_size: 3
+      }
+    }
+    feature_extractor {
+      type: 'ssd_mobilenet_v2_fpn_keras'
+      use_depthwise: true
+      fpn {
+        min_level: 3
+        max_level: 7
+        additional_layer_depth: 128
+      }
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.00004
+          }
+        }
+        initializer {
+          random_normal_initializer {
+            stddev: 0.01
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.997,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/mobilenet_v2.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 128
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  num_steps: 50000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_crop_image {
+      min_object_covered: 0.0
+      min_aspect_ratio: 0.75
+      max_aspect_ratio: 3.0
+      min_area: 0.75
+      max_area: 1.0
+      overlap_thresh: 0.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .08
+          total_steps: 50000
+          warmup_learning_rate: .026666
+          warmup_steps: 1000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
+
diff --git a/research/object_detection/configs/tf2/ssd_resnet101_v1_fpn_1024x1024_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_resnet101_v1_fpn_1024x1024_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..015617ba444c940d8e91f7f0baeb763f2688a56e
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_resnet101_v1_fpn_1024x1024_coco17_tpu-8.config
@@ -0,0 +1,197 @@
+# SSD with Resnet 101 v1 FPN feature extractor, shared box predictor and focal
+# loss (a.k.a Retinanet).
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 39.5 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 2
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 1024
+        width: 1024
+      }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 256
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.0004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true,
+            decay: 0.997,
+            epsilon: 0.001,
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+      }
+    }
+    feature_extractor {
+      type: 'ssd_resnet101_v1_fpn_keras'
+      fpn {
+        min_level: 3
+        max_level: 7
+      }
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.0004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.997,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet101.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 100000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_crop_image {
+      min_object_covered: 0.0
+      min_aspect_ratio: 0.75
+      max_aspect_ratio: 3.0
+      min_area: 0.75
+      max_area: 1.0
+      overlap_thresh: 0.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 100000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_resnet101_v1_fpn_640x640_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_resnet101_v1_fpn_640x640_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..37e9b9b632cad937e1917732dcbda046dbc84770
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_resnet101_v1_fpn_640x640_coco17_tpu-8.config
@@ -0,0 +1,197 @@
+# SSD with Resnet 101 v1 FPN feature extractor, shared box predictor and focal
+# loss (a.k.a Retinanet).
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 35.4 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 2
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 640
+        width: 640
+      }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 256
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.0004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true,
+            decay: 0.997,
+            epsilon: 0.001,
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+      }
+    }
+    feature_extractor {
+      type: 'ssd_resnet101_v1_fpn_keras'
+      fpn {
+        min_level: 3
+        max_level: 7
+      }
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.0004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.997,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet101.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 25000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_crop_image {
+      min_object_covered: 0.0
+      min_aspect_ratio: 0.75
+      max_aspect_ratio: 3.0
+      min_area: 0.75
+      max_area: 1.0
+      overlap_thresh: 0.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 25000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_resnet152_v1_fpn_1024x1024_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_resnet152_v1_fpn_1024x1024_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..9dbc06e3d72bbc8170b450f1b03b5d25a1b4c64f
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_resnet152_v1_fpn_1024x1024_coco17_tpu-8.config
@@ -0,0 +1,197 @@
+# SSD with Resnet 152 v1 FPN feature extractor, shared box predictor and focal
+# loss (a.k.a Retinanet).
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 39.6 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 2
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 1024
+        width: 1024
+      }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 256
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.0004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true,
+            decay: 0.997,
+            epsilon: 0.001,
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+      }
+    }
+    feature_extractor {
+      type: 'ssd_resnet152_v1_fpn_keras'
+      fpn {
+        min_level: 3
+        max_level: 7
+      }
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.0004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.997,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet152.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 100000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_crop_image {
+      min_object_covered: 0.0
+      min_aspect_ratio: 0.75
+      max_aspect_ratio: 3.0
+      min_area: 0.75
+      max_area: 1.0
+      overlap_thresh: 0.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 100000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_resnet152_v1_fpn_640x640_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_resnet152_v1_fpn_640x640_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..aa99f0a115e6a4fa29cfc6a4d8dbaa331dbe5173
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_resnet152_v1_fpn_640x640_coco17_tpu-8.config
@@ -0,0 +1,197 @@
+# SSD with Resnet 152 v1 FPN feature extractor, shared box predictor and focal
+# loss (a.k.a Retinanet).
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 35.6 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 2
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 640
+        width: 640
+      }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 256
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.0004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true,
+            decay: 0.997,
+            epsilon: 0.001,
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+      }
+    }
+    feature_extractor {
+      type: 'ssd_resnet152_v1_fpn_keras'
+      fpn {
+        min_level: 3
+        max_level: 7
+      }
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.0004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.997,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet152.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 25000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_crop_image {
+      min_object_covered: 0.0
+      min_aspect_ratio: 0.75
+      max_aspect_ratio: 3.0
+      min_area: 0.75
+      max_area: 1.0
+      overlap_thresh: 0.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 25000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_resnet50_v1_fpn_1024x1024_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_resnet50_v1_fpn_1024x1024_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..e1575a002992fe8ae7748a482de8eb192871a92b
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_resnet50_v1_fpn_1024x1024_coco17_tpu-8.config
@@ -0,0 +1,197 @@
+# SSD with Resnet 50 v1 FPN feature extractor, shared box predictor and focal
+# loss (a.k.a Retinanet).
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 38.3 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 2
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 1024
+        width: 1024
+      }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 256
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.0004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true,
+            decay: 0.997,
+            epsilon: 0.001,
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+      }
+    }
+    feature_extractor {
+      type: 'ssd_resnet50_v1_fpn_keras'
+      fpn {
+        min_level: 3
+        max_level: 7
+      }
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.0004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.997,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet50.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 100000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_crop_image {
+      min_object_covered: 0.0
+      min_aspect_ratio: 0.75
+      max_aspect_ratio: 3.0
+      min_area: 0.75
+      max_area: 1.0
+      overlap_thresh: 0.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 100000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/configs/tf2/ssd_resnet50_v1_fpn_640x640_coco17_tpu-8.config b/research/object_detection/configs/tf2/ssd_resnet50_v1_fpn_640x640_coco17_tpu-8.config
new file mode 100644
index 0000000000000000000000000000000000000000..7164144b7305d47c7f5915d0d25979b89260eee7
--- /dev/null
+++ b/research/object_detection/configs/tf2/ssd_resnet50_v1_fpn_640x640_coco17_tpu-8.config
@@ -0,0 +1,197 @@
+# SSD with Resnet 50 v1 FPN feature extractor, shared box predictor and focal
+# loss (a.k.a Retinanet).
+# See Lin et al, https://arxiv.org/abs/1708.02002
+# Trained on COCO, initialized from Imagenet classification checkpoint
+# Train on TPU-8
+#
+# Achieves 34.3 mAP on COCO17 Val
+
+model {
+  ssd {
+    inplace_batchnorm_update: true
+    freeze_batchnorm: false
+    num_classes: 90
+    box_coder {
+      faster_rcnn_box_coder {
+        y_scale: 10.0
+        x_scale: 10.0
+        height_scale: 5.0
+        width_scale: 5.0
+      }
+    }
+    matcher {
+      argmax_matcher {
+        matched_threshold: 0.5
+        unmatched_threshold: 0.5
+        ignore_thresholds: false
+        negatives_lower_than_unmatched: true
+        force_match_for_each_row: true
+        use_matmul_gather: true
+      }
+    }
+    similarity_calculator {
+      iou_similarity {
+      }
+    }
+    encode_background_as_zeros: true
+    anchor_generator {
+      multiscale_anchor_generator {
+        min_level: 3
+        max_level: 7
+        anchor_scale: 4.0
+        aspect_ratios: [1.0, 2.0, 0.5]
+        scales_per_octave: 2
+      }
+    }
+    image_resizer {
+      fixed_shape_resizer {
+        height: 640
+        width: 640
+      }
+    }
+    box_predictor {
+      weight_shared_convolutional_box_predictor {
+        depth: 256
+        class_prediction_bias_init: -4.6
+        conv_hyperparams {
+          activation: RELU_6,
+          regularizer {
+            l2_regularizer {
+              weight: 0.0004
+            }
+          }
+          initializer {
+            random_normal_initializer {
+              stddev: 0.01
+              mean: 0.0
+            }
+          }
+          batch_norm {
+            scale: true,
+            decay: 0.997,
+            epsilon: 0.001,
+          }
+        }
+        num_layers_before_predictor: 4
+        kernel_size: 3
+      }
+    }
+    feature_extractor {
+      type: 'ssd_resnet50_v1_fpn_keras'
+      fpn {
+        min_level: 3
+        max_level: 7
+      }
+      min_depth: 16
+      depth_multiplier: 1.0
+      conv_hyperparams {
+        activation: RELU_6,
+        regularizer {
+          l2_regularizer {
+            weight: 0.0004
+          }
+        }
+        initializer {
+          truncated_normal_initializer {
+            stddev: 0.03
+            mean: 0.0
+          }
+        }
+        batch_norm {
+          scale: true,
+          decay: 0.997,
+          epsilon: 0.001,
+        }
+      }
+      override_base_feature_extractor_hyperparams: true
+    }
+    loss {
+      classification_loss {
+        weighted_sigmoid_focal {
+          alpha: 0.25
+          gamma: 2.0
+        }
+      }
+      localization_loss {
+        weighted_smooth_l1 {
+        }
+      }
+      classification_weight: 1.0
+      localization_weight: 1.0
+    }
+    normalize_loss_by_num_matches: true
+    normalize_loc_loss_by_codesize: true
+    post_processing {
+      batch_non_max_suppression {
+        score_threshold: 1e-8
+        iou_threshold: 0.6
+        max_detections_per_class: 100
+        max_total_detections: 100
+      }
+      score_converter: SIGMOID
+    }
+  }
+}
+
+train_config: {
+  fine_tune_checkpoint_version: V2
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/resnet50.ckpt-1"
+  fine_tune_checkpoint_type: "classification"
+  batch_size: 64
+  sync_replicas: true
+  startup_delay_steps: 0
+  replicas_to_aggregate: 8
+  use_bfloat16: true
+  num_steps: 25000
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  data_augmentation_options {
+    random_crop_image {
+      min_object_covered: 0.0
+      min_aspect_ratio: 0.75
+      max_aspect_ratio: 3.0
+      min_area: 0.75
+      max_area: 1.0
+      overlap_thresh: 0.0
+    }
+  }
+  optimizer {
+    momentum_optimizer: {
+      learning_rate: {
+        cosine_decay_learning_rate {
+          learning_rate_base: .04
+          total_steps: 25000
+          warmup_learning_rate: .013333
+          warmup_steps: 2000
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+}
+
+train_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/train2017-?????-of-00256.tfrecord"
+  }
+}
+
+eval_config: {
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+}
+
+eval_input_reader: {
+  label_map_path: "PATH_TO_BE_CONFIGURED/label_map.txt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/val2017-?????-of-00032.tfrecord"
+  }
+}
diff --git a/research/object_detection/core/box_predictor.py b/research/object_detection/core/box_predictor.py
index 342bca832d3bfc9c19c0240a441039d23ef16b37..27d77d299bfb8c44de338dd364258b2840ed3927 100644
--- a/research/object_detection/core/box_predictor.py
+++ b/research/object_detection/core/box_predictor.py
@@ -134,7 +134,7 @@ class BoxPredictor(object):
     pass
 
 
-class KerasBoxPredictor(tf.keras.Model):
+class KerasBoxPredictor(tf.keras.layers.Layer):
   """Keras-based BoxPredictor."""
 
   def __init__(self, is_training, num_classes, freeze_batchnorm,
diff --git a/research/object_detection/core/densepose_ops.py b/research/object_detection/core/densepose_ops.py
new file mode 100644
index 0000000000000000000000000000000000000000..0ad852fb4e2de10d3545bf5575050c4f0fc3cefb
--- /dev/null
+++ b/research/object_detection/core/densepose_ops.py
@@ -0,0 +1,377 @@
+# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+
+"""DensePose operations.
+
+DensePose part ids are represented as tensors of shape
+[num_instances, num_points] and coordinates are represented as tensors of shape
+[num_instances, num_points, 4] where each point holds (y, x, v, u). The location
+of the DensePose sampled point is (y, x) in normalized coordinates. The surface
+coordinate (in the part coordinate frame) is (v, u). Note that dim 1 of both
+tensors may contain padding, since the number of sampled points per instance
+is not fixed. The value `num_points` represents the maximum number of sampled
+points for an instance in the example.
+"""
+import os
+
+import scipy.io
+import tensorflow.compat.v1 as tf
+
+from object_detection.utils import shape_utils
+
+PART_NAMES = [
+    b'torso_back', b'torso_front', b'right_hand', b'left_hand', b'left_foot',
+    b'right_foot', b'right_upper_leg_back', b'left_upper_leg_back',
+    b'right_upper_leg_front', b'left_upper_leg_front', b'right_lower_leg_back',
+    b'left_lower_leg_back', b'right_lower_leg_front', b'left_lower_leg_front',
+    b'left_upper_arm_back', b'right_upper_arm_back', b'left_upper_arm_front',
+    b'right_upper_arm_front', b'left_lower_arm_back', b'right_lower_arm_back',
+    b'left_lower_arm_front', b'right_lower_arm_front', b'right_face',
+    b'left_face',
+]
+
+
+def scale(dp_surface_coords, y_scale, x_scale, scope=None):
+  """Scales DensePose coordinates in y and x dimensions.
+
+  Args:
+    dp_surface_coords: a tensor of shape [num_instances, num_points, 4], with
+      coordinates in (y, x, v, u) format.
+    y_scale: (float) scalar tensor
+    x_scale: (float) scalar tensor
+    scope: name scope.
+
+  Returns:
+    new_dp_surface_coords: a tensor of shape [num_instances, num_points, 4]
+  """
+  with tf.name_scope(scope, 'DensePoseScale'):
+    y_scale = tf.cast(y_scale, tf.float32)
+    x_scale = tf.cast(x_scale, tf.float32)
+    new_keypoints = dp_surface_coords * [[[y_scale, x_scale, 1, 1]]]
+    return new_keypoints
+
+
+def clip_to_window(dp_surface_coords, window, scope=None):
+  """Clips DensePose points to a window.
+
+  This op clips any input DensePose points to a window.
+
+  Args:
+    dp_surface_coords: a tensor of shape [num_instances, num_points, 4] with
+      DensePose surface coordinates in (y, x, v, u) format.
+    window: a tensor of shape [4] representing the [y_min, x_min, y_max, x_max]
+      window to which the op should clip the keypoints.
+    scope: name scope.
+
+  Returns:
+    new_dp_surface_coords: a tensor of shape [num_instances, num_points, 4].
+  """
+  with tf.name_scope(scope, 'DensePoseClipToWindow'):
+    y, x, v, u = tf.split(value=dp_surface_coords, num_or_size_splits=4, axis=2)
+    win_y_min, win_x_min, win_y_max, win_x_max = tf.unstack(window)
+    y = tf.maximum(tf.minimum(y, win_y_max), win_y_min)
+    x = tf.maximum(tf.minimum(x, win_x_max), win_x_min)
+    new_dp_surface_coords = tf.concat([y, x, v, u], 2)
+    return new_dp_surface_coords
+
+
+def prune_outside_window(dp_num_points, dp_part_ids, dp_surface_coords, window,
+                         scope=None):
+  """Prunes DensePose points that fall outside a given window.
+
+  This function replaces points that fall outside the given window with zeros.
+  See also clip_to_window which clips any DensePose points that fall outside the
+  given window.
+
+  Note that this operation uses dynamic shapes, and therefore is not currently
+  suitable for TPU.
+
+  Args:
+    dp_num_points: a tensor of shape [num_instances] that indicates how many
+      (non-padded) DensePose points there are per instance.
+    dp_part_ids: a tensor of shape [num_instances, num_points] with DensePose
+      part ids. These part_ids are 0-indexed, where the first non-background
+      part has index 0.
+    dp_surface_coords: a tensor of shape [num_instances, num_points, 4] with
+      DensePose surface coordinates in (y, x, v, u) format.
+    window: a tensor of shape [4] representing the [y_min, x_min, y_max, x_max]
+      window outside of which the op should prune the points.
+    scope: name scope.
+
+  Returns:
+    new_dp_num_points: a tensor of shape [num_instances] that indicates how many
+      (non-padded) DensePose points there are per instance after pruning.
+    new_dp_part_ids: a tensor of shape [num_instances, num_points] with
+      DensePose part ids. These part_ids are 0-indexed, where the first
+      non-background part has index 0.
+    new_dp_surface_coords: a tensor of shape [num_instances, num_points, 4] with
+      DensePose surface coordinates after pruning.
+  """
+  with tf.name_scope(scope, 'DensePosePruneOutsideWindow'):
+    y, x, _, _ = tf.unstack(dp_surface_coords, axis=-1)
+    win_y_min, win_x_min, win_y_max, win_x_max = tf.unstack(window)
+
+    num_instances, num_points = shape_utils.combined_static_and_dynamic_shape(
+        dp_part_ids)
+    dp_num_points_tiled = tf.tile(dp_num_points[:, tf.newaxis],
+                                  multiples=[1, num_points])
+    range_tiled = tf.tile(tf.range(num_points)[tf.newaxis, :],
+                          multiples=[num_instances, 1])
+    valid_initial = range_tiled < dp_num_points_tiled
+    valid_in_window = tf.logical_and(
+        tf.logical_and(y >= win_y_min, y <= win_y_max),
+        tf.logical_and(x >= win_x_min, x <= win_x_max))
+    valid_indices = tf.logical_and(valid_initial, valid_in_window)
+
+    new_dp_num_points = tf.math.reduce_sum(
+        tf.cast(valid_indices, tf.int32), axis=1)
+    max_num_points = tf.math.reduce_max(new_dp_num_points)
+
+    def gather_and_reshuffle(elems):
+      dp_part_ids, dp_surface_coords, valid_indices = elems
+      locs = tf.where(valid_indices)[:, 0]
+      valid_part_ids = tf.gather(dp_part_ids, locs, axis=0)
+      valid_part_ids_padded = shape_utils.pad_or_clip_nd(
+          valid_part_ids, output_shape=[max_num_points])
+      valid_surface_coords = tf.gather(dp_surface_coords, locs, axis=0)
+      valid_surface_coords_padded = shape_utils.pad_or_clip_nd(
+          valid_surface_coords, output_shape=[max_num_points, 4])
+      return [valid_part_ids_padded, valid_surface_coords_padded]
+
+    new_dp_part_ids, new_dp_surface_coords = (
+        shape_utils.static_or_dynamic_map_fn(
+            gather_and_reshuffle,
+            elems=[dp_part_ids, dp_surface_coords, valid_indices],
+            dtype=[tf.int32, tf.float32],
+            back_prop=False))
+    return new_dp_num_points, new_dp_part_ids, new_dp_surface_coords
+
+
+def change_coordinate_frame(dp_surface_coords, window, scope=None):
+  """Changes coordinate frame of the points to be relative to window's frame.
+
+  Given a window of the form [y_min, x_min, y_max, x_max] in normalized
+  coordinates, changes DensePose coordinates to be relative to this window.
+
+  An example use case is data augmentation: where we are given groundtruth
+  points and would like to randomly crop the image to some window. In this
+  case we need to change the coordinate frame of each sampled point to be
+  relative to this new window.
+
+  Args:
+    dp_surface_coords: a tensor of shape [num_instances, num_points, 4] with
+      DensePose surface coordinates in (y, x, v, u) format.
+    window: a tensor of shape [4] representing the [y_min, x_min, y_max, x_max]
+      window we should change the coordinate frame to.
+    scope: name scope.
+
+  Returns:
+    new_dp_surface_coords: a tensor of shape [num_instances, num_points, 4].
+  """
+  with tf.name_scope(scope, 'DensePoseChangeCoordinateFrame'):
+    win_height = window[2] - window[0]
+    win_width = window[3] - window[1]
+    new_dp_surface_coords = scale(
+        dp_surface_coords - [window[0], window[1], 0, 0],
+        1.0 / win_height, 1.0 / win_width)
+    return new_dp_surface_coords
+
+
+def to_normalized_coordinates(dp_surface_coords, height, width,
+                              check_range=True, scope=None):
+  """Converts absolute DensePose coordinates to normalized in range [0, 1].
+
+  This function raises an assertion failed error at graph execution time when
+  the maximum coordinate is smaller than 1.01 (which means that coordinates are
+  already normalized). The value 1.01 is to deal with small rounding errors.
+
+  Args:
+    dp_surface_coords: a tensor of shape [num_instances, num_points, 4] with
+      DensePose absolute surface coordinates in (y, x, v, u) format.
+    height: Height of image.
+    width: Width of image.
+    check_range: If True, checks if the coordinates are already normalized.
+    scope: name scope.
+
+  Returns:
+    A tensor of shape [num_instances, num_points, 4] with normalized
+    coordinates.
+  """
+  with tf.name_scope(scope, 'DensePoseToNormalizedCoordinates'):
+    height = tf.cast(height, tf.float32)
+    width = tf.cast(width, tf.float32)
+
+    if check_range:
+      max_val = tf.reduce_max(dp_surface_coords[:, :, :2])
+      max_assert = tf.Assert(tf.greater(max_val, 1.01),
+                             ['max value is lower than 1.01: ', max_val])
+      with tf.control_dependencies([max_assert]):
+        width = tf.identity(width)
+
+    return scale(dp_surface_coords, 1.0 / height, 1.0 / width)
+
+
+def to_absolute_coordinates(dp_surface_coords, height, width,
+                            check_range=True, scope=None):
+  """Converts normalized DensePose coordinates to absolute pixel coordinates.
+
+  This function raises an assertion failed error when the maximum
+  coordinate value is larger than 1.01 (in which case coordinates are already
+  absolute).
+
+  Args:
+    dp_surface_coords: a tensor of shape [num_instances, num_points, 4] with
+      DensePose normalized surface coordinates in (y, x, v, u) format.
+    height: Height of image.
+    width: Width of image.
+    check_range: If True, checks if the coordinates are normalized or not.
+    scope: name scope.
+
+  Returns:
+    A tensor of shape [num_instances, num_points, 4] with absolute coordinates.
+  """
+  with tf.name_scope(scope, 'DensePoseToAbsoluteCoordinates'):
+    height = tf.cast(height, tf.float32)
+    width = tf.cast(width, tf.float32)
+
+    if check_range:
+      max_val = tf.reduce_max(dp_surface_coords[:, :, :2])
+      max_assert = tf.Assert(tf.greater_equal(1.01, max_val),
+                             ['maximum coordinate value is larger than 1.01: ',
+                              max_val])
+      with tf.control_dependencies([max_assert]):
+        width = tf.identity(width)
+
+    return scale(dp_surface_coords, height, width)
+
+
+class DensePoseHorizontalFlip(object):
+  """Class responsible for horizontal flipping of parts and surface coords."""
+
+  def __init__(self):
+    """Constructor."""
+
+    path = os.path.dirname(os.path.abspath(__file__))
+    uv_symmetry_transforms_path = tf.resource_loader.get_path_to_datafile(
+        os.path.join(path, '..', 'dataset_tools', 'densepose',
+                     'UV_symmetry_transforms.mat'))
+    tf.logging.info('Loading DensePose symmetry transforms file from {}'.format(
+        uv_symmetry_transforms_path))
+    with tf.io.gfile.GFile(uv_symmetry_transforms_path, 'rb') as f:
+      data = scipy.io.loadmat(f)
+
+    # Create lookup maps which indicate how a VU coordinate changes after a
+    # horizontal flip.
+    uv_symmetry_map = {}
+    for key in ('U_transforms', 'V_transforms'):
+      uv_symmetry_map_per_part = []
+      for i in range(data[key].shape[1]):
+        # The following tensor has shape [256, 256].
+        map_per_part = tf.constant(data[key][0, i], dtype=tf.float32)
+        uv_symmetry_map_per_part.append(map_per_part)
+      uv_symmetry_map[key] = tf.reshape(
+          tf.stack(uv_symmetry_map_per_part, axis=0), [-1])
+    # The following dictionary contains flattened lookup maps for the U and V
+    # coordinates separately. The shape of each is [24 * 256 * 256].
+    self.uv_symmetries = uv_symmetry_map
+
+    # Create a list of that maps part index to flipped part index (0-indexed).
+    part_symmetries = []
+    for i, part_name in enumerate(PART_NAMES):
+      if b'left' in part_name:
+        part_symmetries.append(PART_NAMES.index(
+            part_name.replace(b'left', b'right')))
+      elif b'right' in part_name:
+        part_symmetries.append(PART_NAMES.index(
+            part_name.replace(b'right', b'left')))
+      else:
+        part_symmetries.append(i)
+    self.part_symmetries = part_symmetries
+
+  def flip_parts_and_coords(self, part_ids, vu):
+    """Flips part ids and coordinates.
+
+    Args:
+      part_ids: a [num_instances, num_points] int32 tensor with pre-flipped part
+        ids. These part_ids are 0-indexed, where the first non-background part
+        has index 0.
+      vu: a [num_instances, num_points, 2] float32 tensor with pre-flipped vu
+        normalized coordinates.
+
+    Returns:
+      new_part_ids: a [num_instances, num_points] int32 tensor with post-flipped
+        part ids. These part_ids are 0-indexed, where the first non-background
+        part has index 0.
+      new_vu: a [num_instances, num_points, 2] float32 tensor with post-flipped
+        vu coordinates.
+    """
+    num_instances, num_points = shape_utils.combined_static_and_dynamic_shape(
+        part_ids)
+    part_ids_flattened = tf.reshape(part_ids, [-1])
+    new_part_ids_flattened = tf.gather(self.part_symmetries, part_ids_flattened)
+    new_part_ids = tf.reshape(new_part_ids_flattened,
+                              [num_instances, num_points])
+
+    # Convert VU floating point coordinates to values in [256, 256] grid.
+    vu = tf.math.minimum(tf.math.maximum(vu, 0.0), 1.0)
+    vu_locs = tf.cast(vu * 256., dtype=tf.int32)
+    vu_locs_flattened = tf.reshape(vu_locs, [-1, 2])
+    v_locs_flattened, u_locs_flattened = tf.unstack(vu_locs_flattened, axis=1)
+
+    # Convert vu_locs into lookup indices (in flattened part symmetries map).
+    symmetry_lookup_inds = (
+        part_ids_flattened * 65536 + 256 * v_locs_flattened + u_locs_flattened)
+
+    # New VU coordinates.
+    v_new = tf.gather(self.uv_symmetries['V_transforms'], symmetry_lookup_inds)
+    u_new = tf.gather(self.uv_symmetries['U_transforms'], symmetry_lookup_inds)
+    new_vu_flattened = tf.stack([v_new, u_new], axis=1)
+    new_vu = tf.reshape(new_vu_flattened, [num_instances, num_points, 2])
+
+    return new_part_ids, new_vu
+
+
+def flip_horizontal(dp_part_ids, dp_surface_coords, scope=None):
+  """Flips the DensePose points horizontally around the flip_point.
+
+  This operation flips dense pose annotations horizontally. Note that part ids
+  and surface coordinates may or may not change as a result of the flip.
+
+  Args:
+    dp_part_ids: a tensor of shape [num_instances, num_points] with DensePose
+      part ids. These part_ids are 0-indexed, where the first non-background
+      part has index 0.
+    dp_surface_coords: a tensor of shape [num_instances, num_points, 4] with
+      DensePose surface coordinates in (y, x, v, u) normalized format.
+    scope: name scope.
+
+  Returns:
+    new_dp_part_ids: a tensor of shape [num_instances, num_points] with
+      DensePose part ids after flipping.
+    new_dp_surface_coords: a tensor of shape [num_instances, num_points, 4] with
+      DensePose surface coordinates after flipping.
+  """
+  with tf.name_scope(scope, 'DensePoseFlipHorizontal'):
+    # First flip x coordinate.
+    y, x, vu = tf.split(dp_surface_coords, num_or_size_splits=[1, 1, 2], axis=2)
+    xflipped = 1.0 - x
+
+    # Flip part ids and surface coordinates.
+    horizontal_flip = DensePoseHorizontalFlip()
+    new_dp_part_ids, new_vu = horizontal_flip.flip_parts_and_coords(
+        dp_part_ids, vu)
+    new_dp_surface_coords = tf.concat([y, xflipped, new_vu], axis=2)
+    return new_dp_part_ids, new_dp_surface_coords
+
diff --git a/research/object_detection/core/densepose_ops_test.py b/research/object_detection/core/densepose_ops_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..5b814406d04f5854c339b12c66de6956e765251d
--- /dev/null
+++ b/research/object_detection/core/densepose_ops_test.py
@@ -0,0 +1,178 @@
+# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+
+"""Tests for object_detection.core.densepose_ops."""
+import numpy as np
+import tensorflow.compat.v1 as tf
+
+from object_detection.core import densepose_ops
+from object_detection.utils import test_case
+
+
+class DensePoseOpsTest(test_case.TestCase):
+  """Tests for common DensePose operations."""
+
+  def test_scale(self):
+    def graph_fn():
+      dp_surface_coords = tf.constant([
+          [[0.0, 0.0, 0.1, 0.2], [100.0, 200.0, 0.3, 0.4]],
+          [[50.0, 120.0, 0.5, 0.6], [100.0, 140.0, 0.7, 0.8]]
+      ])
+      y_scale = tf.constant(1.0 / 100)
+      x_scale = tf.constant(1.0 / 200)
+
+      output = densepose_ops.scale(dp_surface_coords, y_scale, x_scale)
+      return output
+    output = self.execute(graph_fn, [])
+
+    expected_dp_surface_coords = np.array([
+        [[0., 0., 0.1, 0.2], [1.0, 1.0, 0.3, 0.4]],
+        [[0.5, 0.6, 0.5, 0.6], [1.0, 0.7, 0.7, 0.8]]
+    ])
+    self.assertAllClose(output, expected_dp_surface_coords)
+
+  def test_clip_to_window(self):
+    def graph_fn():
+      dp_surface_coords = tf.constant([
+          [[0.25, 0.5, 0.1, 0.2], [0.75, 0.75, 0.3, 0.4]],
+          [[0.5, 0.0, 0.5, 0.6], [1.0, 1.0, 0.7, 0.8]]
+      ])
+      window = tf.constant([0.25, 0.25, 0.75, 0.75])
+
+      output = densepose_ops.clip_to_window(dp_surface_coords, window)
+      return output
+    output = self.execute(graph_fn, [])
+
+    expected_dp_surface_coords = np.array([
+        [[0.25, 0.5, 0.1, 0.2], [0.75, 0.75, 0.3, 0.4]],
+        [[0.5, 0.25, 0.5, 0.6], [0.75, 0.75, 0.7, 0.8]]
+    ])
+    self.assertAllClose(output, expected_dp_surface_coords)
+
+  def test_prune_outside_window(self):
+    def graph_fn():
+      dp_num_points = tf.constant([2, 0, 1])
+      dp_part_ids = tf.constant([[1, 1], [0, 0], [16, 0]])
+      dp_surface_coords = tf.constant([
+          [[0.9, 0.5, 0.1, 0.2], [0.75, 0.75, 0.3, 0.4]],
+          [[0.0, 0.0, 0.0, 0.0], [0.0, 0.0, 0.0, 0.0]],
+          [[0.8, 0.5, 0.6, 0.6], [0.5, 0.5, 0.7, 0.7]]
+      ])
+      window = tf.constant([0.25, 0.25, 0.75, 0.75])
+
+      new_dp_num_points, new_dp_part_ids, new_dp_surface_coords = (
+          densepose_ops.prune_outside_window(dp_num_points, dp_part_ids,
+                                             dp_surface_coords, window))
+      return new_dp_num_points, new_dp_part_ids, new_dp_surface_coords
+    new_dp_num_points, new_dp_part_ids, new_dp_surface_coords = (
+        self.execute_cpu(graph_fn, []))
+
+    expected_dp_num_points = np.array([1, 0, 0])
+    expected_dp_part_ids = np.array([[1], [0], [0]])
+    expected_dp_surface_coords = np.array([
+        [[0.75, 0.75, 0.3, 0.4]],
+        [[0.0, 0.0, 0.0, 0.0]],
+        [[0.0, 0.0, 0.0, 0.0]]
+    ])
+    self.assertAllEqual(new_dp_num_points, expected_dp_num_points)
+    self.assertAllEqual(new_dp_part_ids, expected_dp_part_ids)
+    self.assertAllClose(new_dp_surface_coords, expected_dp_surface_coords)
+
+  def test_change_coordinate_frame(self):
+    def graph_fn():
+      dp_surface_coords = tf.constant([
+          [[0.25, 0.5, 0.1, 0.2], [0.75, 0.75, 0.3, 0.4]],
+          [[0.5, 0.0, 0.5, 0.6], [1.0, 1.0, 0.7, 0.8]]
+      ])
+      window = tf.constant([0.25, 0.25, 0.75, 0.75])
+
+      output = densepose_ops.change_coordinate_frame(dp_surface_coords, window)
+      return output
+    output = self.execute(graph_fn, [])
+
+    expected_dp_surface_coords = np.array([
+        [[0, 0.5, 0.1, 0.2], [1.0, 1.0, 0.3, 0.4]],
+        [[0.5, -0.5, 0.5, 0.6], [1.5, 1.5, 0.7, 0.8]]
+    ])
+    self.assertAllClose(output, expected_dp_surface_coords)
+
+  def test_to_normalized_coordinates(self):
+    def graph_fn():
+      dp_surface_coords = tf.constant([
+          [[10., 30., 0.1, 0.2], [30., 45., 0.3, 0.4]],
+          [[20., 0., 0.5, 0.6], [40., 60., 0.7, 0.8]]
+      ])
+      output = densepose_ops.to_normalized_coordinates(
+          dp_surface_coords, 40, 60)
+      return output
+    output = self.execute(graph_fn, [])
+
+    expected_dp_surface_coords = np.array([
+        [[0.25, 0.5, 0.1, 0.2], [0.75, 0.75, 0.3, 0.4]],
+        [[0.5, 0.0, 0.5, 0.6], [1.0, 1.0, 0.7, 0.8]]
+    ])
+    self.assertAllClose(output, expected_dp_surface_coords)
+
+  def test_to_absolute_coordinates(self):
+    def graph_fn():
+      dp_surface_coords = tf.constant([
+          [[0.25, 0.5, 0.1, 0.2], [0.75, 0.75, 0.3, 0.4]],
+          [[0.5, 0.0, 0.5, 0.6], [1.0, 1.0, 0.7, 0.8]]
+      ])
+      output = densepose_ops.to_absolute_coordinates(
+          dp_surface_coords, 40, 60)
+      return output
+    output = self.execute(graph_fn, [])
+
+    expected_dp_surface_coords = np.array([
+        [[10., 30., 0.1, 0.2], [30., 45., 0.3, 0.4]],
+        [[20., 0., 0.5, 0.6], [40., 60., 0.7, 0.8]]
+    ])
+    self.assertAllClose(output, expected_dp_surface_coords)
+
+  def test_horizontal_flip(self):
+    part_ids_np = np.array([[1, 4], [0, 8]], dtype=np.int32)
+    surf_coords_np = np.array([
+        [[0.1, 0.7, 0.2, 0.4], [0.3, 0.8, 0.2, 0.4]],
+        [[0.0, 0.5, 0.8, 0.7], [0.6, 1.0, 0.7, 0.9]],
+    ], dtype=np.float32)
+    def graph_fn():
+      part_ids = tf.constant(part_ids_np, dtype=tf.int32)
+      surf_coords = tf.constant(surf_coords_np, dtype=tf.float32)
+      flipped_part_ids, flipped_surf_coords = densepose_ops.flip_horizontal(
+          part_ids, surf_coords)
+      flipped_twice_part_ids, flipped_twice_surf_coords = (
+          densepose_ops.flip_horizontal(flipped_part_ids, flipped_surf_coords))
+      return (flipped_part_ids, flipped_surf_coords,
+              flipped_twice_part_ids, flipped_twice_surf_coords)
+    (flipped_part_ids, flipped_surf_coords, flipped_twice_part_ids,
+     flipped_twice_surf_coords) = self.execute(graph_fn, [])
+
+    expected_flipped_part_ids = [[1, 5],  # 1->1, 4->5
+                                 [0, 9]]  # 0->0, 8->9
+    expected_flipped_surf_coords_yx = np.array([
+        [[0.1, 1.0-0.7], [0.3, 1.0-0.8]],
+        [[0.0, 1.0-0.5], [0.6, 1.0-1.0]],
+    ], dtype=np.float32)
+    self.assertAllEqual(expected_flipped_part_ids, flipped_part_ids)
+    self.assertAllClose(expected_flipped_surf_coords_yx,
+                        flipped_surf_coords[:, :, 0:2])
+    self.assertAllEqual(part_ids_np, flipped_twice_part_ids)
+    self.assertAllClose(surf_coords_np, flipped_twice_surf_coords, rtol=1e-2,
+                        atol=1e-2)
+
+
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/research/object_detection/core/keypoint_ops.py b/research/object_detection/core/keypoint_ops.py
index e321783d986b3c330300f347158c261a7e3f94a6..1b0c4ccfed42aae492550331e870173c624f0316 100644
--- a/research/object_detection/core/keypoint_ops.py
+++ b/research/object_detection/core/keypoint_ops.py
@@ -217,7 +217,7 @@ def to_absolute_coordinates(keypoints, height, width,
     return scale(keypoints, height, width)
 
 
-def flip_horizontal(keypoints, flip_point, flip_permutation, scope=None):
+def flip_horizontal(keypoints, flip_point, flip_permutation=None, scope=None):
   """Flips the keypoints horizontally around the flip_point.
 
   This operation flips the x coordinate for each keypoint around the flip_point
@@ -227,13 +227,14 @@ def flip_horizontal(keypoints, flip_point, flip_permutation, scope=None):
     keypoints: a tensor of shape [num_instances, num_keypoints, 2]
     flip_point:  (float) scalar tensor representing the x coordinate to flip the
       keypoints around.
-    flip_permutation: rank 1 int32 tensor containing the keypoint flip
-      permutation. This specifies the mapping from original keypoint indices
-      to the flipped keypoint indices. This is used primarily for keypoints
-      that are not reflection invariant. E.g. Suppose there are 3 keypoints
-      representing ['head', 'right_eye', 'left_eye'], then a logical choice for
-      flip_permutation might be [0, 2, 1] since we want to swap the 'left_eye'
-      and 'right_eye' after a horizontal flip.
+    flip_permutation: integer list or rank 1 int32 tensor containing the
+      keypoint flip permutation. This specifies the mapping from original
+      keypoint indices to the flipped keypoint indices. This is used primarily
+      for keypoints that are not reflection invariant. E.g. Suppose there are 3
+      keypoints representing ['head', 'right_eye', 'left_eye'], then a logical
+      choice for flip_permutation might be [0, 2, 1] since we want to swap the
+      'left_eye' and 'right_eye' after a horizontal flip.
+      Default to None or empty list to keep the original order after flip.
     scope: name scope.
 
   Returns:
@@ -241,7 +242,8 @@ def flip_horizontal(keypoints, flip_point, flip_permutation, scope=None):
   """
   with tf.name_scope(scope, 'FlipHorizontal'):
     keypoints = tf.transpose(keypoints, [1, 0, 2])
-    keypoints = tf.gather(keypoints, flip_permutation)
+    if flip_permutation:
+      keypoints = tf.gather(keypoints, flip_permutation)
     v, u = tf.split(value=keypoints, num_or_size_splits=2, axis=2)
     u = flip_point * 2.0 - u
     new_keypoints = tf.concat([v, u], 2)
@@ -249,7 +251,7 @@ def flip_horizontal(keypoints, flip_point, flip_permutation, scope=None):
     return new_keypoints
 
 
-def flip_vertical(keypoints, flip_point, flip_permutation, scope=None):
+def flip_vertical(keypoints, flip_point, flip_permutation=None, scope=None):
   """Flips the keypoints vertically around the flip_point.
 
   This operation flips the y coordinate for each keypoint around the flip_point
@@ -259,13 +261,14 @@ def flip_vertical(keypoints, flip_point, flip_permutation, scope=None):
     keypoints: a tensor of shape [num_instances, num_keypoints, 2]
     flip_point:  (float) scalar tensor representing the y coordinate to flip the
       keypoints around.
-    flip_permutation: rank 1 int32 tensor containing the keypoint flip
-      permutation. This specifies the mapping from original keypoint indices
-      to the flipped keypoint indices. This is used primarily for keypoints
-      that are not reflection invariant. E.g. Suppose there are 3 keypoints
-      representing ['head', 'right_eye', 'left_eye'], then a logical choice for
-      flip_permutation might be [0, 2, 1] since we want to swap the 'left_eye'
-      and 'right_eye' after a horizontal flip.
+    flip_permutation: integer list or rank 1 int32 tensor containing the
+      keypoint flip permutation. This specifies the mapping from original
+      keypoint indices to the flipped keypoint indices. This is used primarily
+      for keypoints that are not reflection invariant. E.g. Suppose there are 3
+      keypoints representing ['head', 'right_eye', 'left_eye'], then a logical
+      choice for flip_permutation might be [0, 2, 1] since we want to swap the
+      'left_eye' and 'right_eye' after a horizontal flip.
+      Default to None or empty list to keep the original order after flip.
     scope: name scope.
 
   Returns:
@@ -273,7 +276,8 @@ def flip_vertical(keypoints, flip_point, flip_permutation, scope=None):
   """
   with tf.name_scope(scope, 'FlipVertical'):
     keypoints = tf.transpose(keypoints, [1, 0, 2])
-    keypoints = tf.gather(keypoints, flip_permutation)
+    if flip_permutation:
+      keypoints = tf.gather(keypoints, flip_permutation)
     v, u = tf.split(value=keypoints, num_or_size_splits=2, axis=2)
     v = flip_point * 2.0 - v
     new_keypoints = tf.concat([v, u], 2)
@@ -281,18 +285,24 @@ def flip_vertical(keypoints, flip_point, flip_permutation, scope=None):
     return new_keypoints
 
 
-def rot90(keypoints, scope=None):
+def rot90(keypoints, rotation_permutation=None, scope=None):
   """Rotates the keypoints counter-clockwise by 90 degrees.
 
   Args:
     keypoints: a tensor of shape [num_instances, num_keypoints, 2]
+    rotation_permutation:  integer list or rank 1 int32 tensor containing the
+      keypoint flip permutation. This specifies the mapping from original
+      keypoint indices to the rotated keypoint indices. This is used primarily
+      for keypoints that are not rotation invariant.
+      Default to None or empty list to keep the original order after rotation.
     scope: name scope.
-
   Returns:
     new_keypoints: a tensor of shape [num_instances, num_keypoints, 2]
   """
   with tf.name_scope(scope, 'Rot90'):
     keypoints = tf.transpose(keypoints, [1, 0, 2])
+    if rotation_permutation:
+      keypoints = tf.gather(keypoints, rotation_permutation)
     v, u = tf.split(value=keypoints[:, :, ::-1], num_or_size_splits=2, axis=2)
     v = 1.0 - v
     new_keypoints = tf.concat([v, u], 2)
diff --git a/research/object_detection/core/keypoint_ops_test.py b/research/object_detection/core/keypoint_ops_test.py
index 695e8fa1c6efcac8900577cd4657393b01d6d8d1..bbdcf01940dcaf96da283bd6bcf73e91b633f0ee 100644
--- a/research/object_detection/core/keypoint_ops_test.py
+++ b/research/object_detection/core/keypoint_ops_test.py
@@ -180,6 +180,21 @@ class KeypointOpsTest(test_case.TestCase):
           [[0.1, 0.1], [0.2, 0.2], [0.3, 0.3]],
           [[0.4, 0.4], [0.5, 0.5], [0.6, 0.6]]
       ])
+      expected_keypoints = tf.constant([
+          [[0.1, 0.9], [0.2, 0.8], [0.3, 0.7]],
+          [[0.4, 0.6], [0.5, 0.5], [0.6, 0.4]],
+      ])
+      output = keypoint_ops.flip_horizontal(keypoints, 0.5)
+      return output, expected_keypoints
+
+    output, expected_keypoints = self.execute(graph_fn, [])
+    self.assertAllClose(output, expected_keypoints)
+
+  def test_flip_horizontal_permutation(self):
+
+    def graph_fn():
+      keypoints = tf.constant([[[0.1, 0.1], [0.2, 0.2], [0.3, 0.3]],
+                               [[0.4, 0.4], [0.5, 0.5], [0.6, 0.6]]])
       flip_permutation = [0, 2, 1]
 
       expected_keypoints = tf.constant([
@@ -197,6 +212,22 @@ class KeypointOpsTest(test_case.TestCase):
           [[0.1, 0.1], [0.2, 0.2], [0.3, 0.3]],
           [[0.4, 0.4], [0.5, 0.5], [0.6, 0.6]]
       ])
+
+      expected_keypoints = tf.constant([
+          [[0.9, 0.1], [0.8, 0.2], [0.7, 0.3]],
+          [[0.6, 0.4], [0.5, 0.5], [0.4, 0.6]],
+      ])
+      output = keypoint_ops.flip_vertical(keypoints, 0.5)
+      return output, expected_keypoints
+
+    output, expected_keypoints = self.execute(graph_fn, [])
+    self.assertAllClose(output, expected_keypoints)
+
+  def test_flip_vertical_permutation(self):
+
+    def graph_fn():
+      keypoints = tf.constant([[[0.1, 0.1], [0.2, 0.2], [0.3, 0.3]],
+                               [[0.4, 0.4], [0.5, 0.5], [0.6, 0.6]]])
       flip_permutation = [0, 2, 1]
 
       expected_keypoints = tf.constant([
@@ -223,6 +254,23 @@ class KeypointOpsTest(test_case.TestCase):
     output, expected_keypoints = self.execute(graph_fn, [])
     self.assertAllClose(output, expected_keypoints)
 
+  def test_rot90_permutation(self):
+
+    def graph_fn():
+      keypoints = tf.constant([[[0.1, 0.1], [0.2, 0.2], [0.3, 0.3]],
+                               [[0.4, 0.6], [0.5, 0.6], [0.6, 0.7]]])
+      rot_permutation = [0, 2, 1]
+      expected_keypoints = tf.constant([
+          [[0.9, 0.1], [0.7, 0.3], [0.8, 0.2]],
+          [[0.4, 0.4], [0.3, 0.6], [0.4, 0.5]],
+      ])
+      output = keypoint_ops.rot90(keypoints,
+                                  rotation_permutation=rot_permutation)
+      return output, expected_keypoints
+
+    output, expected_keypoints = self.execute(graph_fn, [])
+    self.assertAllClose(output, expected_keypoints)
+
   def test_keypoint_weights_from_visibilities(self):
     def graph_fn():
       keypoint_visibilities = tf.constant([
diff --git a/research/object_detection/core/model.py b/research/object_detection/core/model.py
index 0430b37b5c31c6e1ce9604898aaa8e73319400f8..f0ace5d050c3df472b6eb53fa5c277523376573b 100644
--- a/research/object_detection/core/model.py
+++ b/research/object_detection/core/model.py
@@ -102,7 +102,7 @@ class DetectionModel(six.with_metaclass(abc.ABCMeta, _BaseClass)):
     Args:
       field: a string key, options are
         fields.BoxListFields.{boxes,classes,masks,keypoints,
-        keypoint_visibilities} or
+        keypoint_visibilities, densepose_*}
         fields.InputDataFields.is_annotated.
 
     Returns:
@@ -123,7 +123,7 @@ class DetectionModel(six.with_metaclass(abc.ABCMeta, _BaseClass)):
     Args:
       field: a string key, options are
         fields.BoxListFields.{boxes,classes,masks,keypoints,
-        keypoint_visibilities} or
+        keypoint_visibilities, densepose_*} or
         fields.InputDataFields.is_annotated.
 
     Returns:
@@ -251,9 +251,14 @@ class DetectionModel(six.with_metaclass(abc.ABCMeta, _BaseClass)):
         detection_classes: [batch, max_detections]
           (If a model is producing class-agnostic detections, this field may be
           missing)
-        instance_masks: [batch, max_detections, image_height, image_width]
+        detection_masks: [batch, max_detections, mask_height, mask_width]
           (optional)
-        keypoints: [batch, max_detections, num_keypoints, 2] (optional)
+        detection_keypoints: [batch, max_detections, num_keypoints, 2]
+          (optional)
+        detection_keypoint_scores: [batch, max_detections, num_keypoints]
+          (optional)
+        detection_surface_coords: [batch, max_detections, mask_height,
+          mask_width, 2] (optional)
         num_detections: [batch]
 
         In addition to the above fields this stage also outputs the following
@@ -288,19 +293,23 @@ class DetectionModel(six.with_metaclass(abc.ABCMeta, _BaseClass)):
     """
     pass
 
-  def provide_groundtruth(self,
-                          groundtruth_boxes_list,
-                          groundtruth_classes_list,
-                          groundtruth_masks_list=None,
-                          groundtruth_keypoints_list=None,
-                          groundtruth_keypoint_visibilities_list=None,
-                          groundtruth_weights_list=None,
-                          groundtruth_confidences_list=None,
-                          groundtruth_is_crowd_list=None,
-                          groundtruth_group_of_list=None,
-                          groundtruth_area_list=None,
-                          is_annotated_list=None,
-                          groundtruth_labeled_classes=None):
+  def provide_groundtruth(
+      self,
+      groundtruth_boxes_list,
+      groundtruth_classes_list,
+      groundtruth_masks_list=None,
+      groundtruth_keypoints_list=None,
+      groundtruth_keypoint_visibilities_list=None,
+      groundtruth_dp_num_points_list=None,
+      groundtruth_dp_part_ids_list=None,
+      groundtruth_dp_surface_coords_list=None,
+      groundtruth_weights_list=None,
+      groundtruth_confidences_list=None,
+      groundtruth_is_crowd_list=None,
+      groundtruth_group_of_list=None,
+      groundtruth_area_list=None,
+      is_annotated_list=None,
+      groundtruth_labeled_classes=None):
     """Provide groundtruth tensors.
 
     Args:
@@ -324,6 +333,15 @@ class DetectionModel(six.with_metaclass(abc.ABCMeta, _BaseClass)):
         `groundtruth_keypoint_visibilities_list`).
       groundtruth_keypoint_visibilities_list: a list of 3-D tf.bool tensors
         of shape [num_boxes, num_keypoints] containing keypoint visibilities.
+      groundtruth_dp_num_points_list: a list of 1-D tf.int32 tensors of shape
+        [num_boxes] containing the number of DensePose sampled points.
+      groundtruth_dp_part_ids_list: a list of 2-D tf.int32 tensors of shape
+        [num_boxes, max_sampled_points] containing the DensePose part ids
+        (0-indexed) for each sampled point. Note that there may be padding.
+      groundtruth_dp_surface_coords_list: a list of 3-D tf.float32 tensors of
+        shape [num_boxes, max_sampled_points, 4] containing the DensePose
+        surface coordinates for each sampled point. Note that there may be
+        padding.
       groundtruth_weights_list: A list of 1-D tf.float32 tensors of shape
         [num_boxes] containing weights for groundtruth boxes.
       groundtruth_confidences_list: A list of 2-D tf.float32 tensors of shape
@@ -361,6 +379,18 @@ class DetectionModel(six.with_metaclass(abc.ABCMeta, _BaseClass)):
       self._groundtruth_lists[
           fields.BoxListFields.keypoint_visibilities] = (
               groundtruth_keypoint_visibilities_list)
+    if groundtruth_dp_num_points_list:
+      self._groundtruth_lists[
+          fields.BoxListFields.densepose_num_points] = (
+              groundtruth_dp_num_points_list)
+    if groundtruth_dp_part_ids_list:
+      self._groundtruth_lists[
+          fields.BoxListFields.densepose_part_ids] = (
+              groundtruth_dp_part_ids_list)
+    if groundtruth_dp_surface_coords_list:
+      self._groundtruth_lists[
+          fields.BoxListFields.densepose_surface_coords] = (
+              groundtruth_dp_surface_coords_list)
     if groundtruth_is_crowd_list:
       self._groundtruth_lists[
           fields.BoxListFields.is_crowd] = groundtruth_is_crowd_list
@@ -391,7 +421,9 @@ class DetectionModel(six.with_metaclass(abc.ABCMeta, _BaseClass)):
     pass
 
   @abc.abstractmethod
-  def restore_map(self, fine_tune_checkpoint_type='detection'):
+  def restore_map(self,
+                  fine_tune_checkpoint_type='detection',
+                  load_all_detection_checkpoint_vars=False):
     """Returns a map of variables to load from a foreign checkpoint.
 
     Returns a map of variable names to load from a checkpoint to variables in
@@ -407,6 +439,9 @@ class DetectionModel(six.with_metaclass(abc.ABCMeta, _BaseClass)):
         checkpoint (with compatible variable names) or to restore from a
         classification checkpoint for initialization prior to training.
         Valid values: `detection`, `classification`. Default 'detection'.
+      load_all_detection_checkpoint_vars: whether to load all variables (when
+         `fine_tune_checkpoint_type` is `detection`). If False, only variables
+         within the feature extractor scope are included. Default False.
 
     Returns:
       A dict mapping variable names (to load from a checkpoint) to variables in
@@ -414,6 +449,36 @@ class DetectionModel(six.with_metaclass(abc.ABCMeta, _BaseClass)):
     """
     pass
 
+  @abc.abstractmethod
+  def restore_from_objects(self, fine_tune_checkpoint_type='detection'):
+    """Returns a map of variables to load from a foreign checkpoint.
+
+    Returns a dictionary of Tensorflow 2 Trackable objects (e.g. tf.Module
+    or Checkpoint). This enables the model to initialize based on weights from
+    another task. For example, the feature extractor variables from a
+    classification model can be used to bootstrap training of an object
+    detector. When loading from an object detection model, the checkpoint model
+    should have the same parameters as this detection model with exception of
+    the num_classes parameter.
+
+    Note that this function is intended to be used to restore Keras-based
+    models when running Tensorflow 2, whereas restore_map (above) is intended
+    to be used to restore Slim-based models when running Tensorflow 1.x.
+
+    TODO(jonathanhuang,rathodv): Check tf_version and raise unimplemented
+    error for both restore_map and restore_from_objects depending on version.
+
+    Args:
+      fine_tune_checkpoint_type: whether to restore from a full detection
+        checkpoint (with compatible variable names) or to restore from a
+        classification checkpoint for initialization prior to training.
+        Valid values: `detection`, `classification`. Default 'detection'.
+
+    Returns:
+      A dict mapping keys to Trackable objects (tf.Module or Checkpoint).
+    """
+    pass
+
   @abc.abstractmethod
   def updates(self):
     """Returns a list of update operators for this model.
diff --git a/research/object_detection/core/model_test.py b/research/object_detection/core/model_test.py
index 2bb1ab343a6634ffc8df9f71378e83371921da7a..fcc36c03d4a77a78193975766b5e96b37a32b075 100644
--- a/research/object_detection/core/model_test.py
+++ b/research/object_detection/core/model_test.py
@@ -57,6 +57,9 @@ class FakeModel(model.DetectionModel):
   def restore_map(self):
     return {}
 
+  def restore_from_objects(self, fine_tune_checkpoint_type):
+    pass
+
   def regularization_losses(self):
     return []
 
diff --git a/research/object_detection/core/preprocessor.py b/research/object_detection/core/preprocessor.py
index 8b8fdff5e5446f0739396eafd10b4b5d39bd14b5..6cebfd99112cb9542e97626c917778c9be3a7de9 100644
--- a/research/object_detection/core/preprocessor.py
+++ b/research/object_detection/core/preprocessor.py
@@ -79,6 +79,7 @@ import tensorflow.compat.v1 as tf
 from tensorflow.python.ops import control_flow_ops
 from object_detection.core import box_list
 from object_detection.core import box_list_ops
+from object_detection.core import densepose_ops
 from object_detection.core import keypoint_ops
 from object_detection.core import preprocessor_cache
 from object_detection.core import standard_fields as fields
@@ -568,13 +569,14 @@ def random_horizontal_flip(image,
                            masks=None,
                            keypoints=None,
                            keypoint_visibilities=None,
+                           densepose_part_ids=None,
+                           densepose_surface_coords=None,
                            keypoint_flip_permutation=None,
+                           probability=0.5,
                            seed=None,
                            preprocess_vars_cache=None):
   """Randomly flips the image and detections horizontally.
 
-  The probability of flipping the image is 50%.
-
   Args:
     image: rank 3 float32 tensor with shape [height, width, channels].
     boxes: (optional) rank 2 float32 tensor with shape [N, 4]
@@ -590,8 +592,19 @@ def random_horizontal_flip(image,
                normalized coordinates.
     keypoint_visibilities: (optional) rank 2 bool tensor with shape
                            [num_instances, num_keypoints].
+    densepose_part_ids: (optional) rank 2 int32 tensor with shape
+                        [num_instances, num_points] holding the part id for each
+                        sampled point. These part_ids are 0-indexed, where the
+                        first non-background part has index 0.
+    densepose_surface_coords: (optional) rank 3 float32 tensor with shape
+                              [num_instances, num_points, 4]. The DensePose
+                              coordinates are of the form (y, x, v, u)  where
+                              (y, x) are the normalized image coordinates for a
+                              sampled point, and (v, u) is the surface
+                              coordinate for the part.
     keypoint_flip_permutation: rank 1 int32 tensor containing the keypoint flip
                                permutation.
+    probability: the probability of performing this augmentation.
     seed: random seed
     preprocess_vars_cache: PreprocessorCache object that records previously
                            performed augmentations. Updated in-place. If this
@@ -601,9 +614,9 @@ def random_horizontal_flip(image,
   Returns:
     image: image which is the same shape as input image.
 
-    If boxes, masks, keypoints, keypoint_visibilities, and
-    keypoint_flip_permutation are not None,the function also returns the
-    following tensors.
+    If boxes, masks, keypoints, keypoint_visibilities,
+    keypoint_flip_permutation, densepose_part_ids, or densepose_surface_coords
+    are not None,the function also returns the following tensors.
 
     boxes: rank 2 float32 tensor containing the bounding boxes -> [N, 4].
            Boxes are in normalized form meaning their coordinates vary
@@ -614,9 +627,15 @@ def random_horizontal_flip(image,
                [num_instances, num_keypoints, 2]
     keypoint_visibilities: rank 2 bool tensor with shape
                            [num_instances, num_keypoints].
+    densepose_part_ids: rank 2 int32 tensor with shape
+                        [num_instances, num_points].
+    densepose_surface_coords: rank 3 float32 tensor with shape
+                              [num_instances, num_points, 4].
 
   Raises:
     ValueError: if keypoints are provided but keypoint_flip_permutation is not.
+    ValueError: if either densepose_part_ids or densepose_surface_coords is
+                not None, but both are not None.
   """
 
   def _flip_image(image):
@@ -628,6 +647,11 @@ def random_horizontal_flip(image,
     raise ValueError(
         'keypoints are provided but keypoints_flip_permutation is not provided')
 
+  if ((densepose_part_ids is not None and densepose_surface_coords is None) or
+      (densepose_part_ids is None and densepose_surface_coords is not None)):
+    raise ValueError(
+        'Must provide both `densepose_part_ids` and `densepose_surface_coords`')
+
   with tf.name_scope('RandomHorizontalFlip', values=[image, boxes]):
     result = []
     # random variable defining whether to do flip or not
@@ -636,7 +660,7 @@ def random_horizontal_flip(image,
         generator_func,
         preprocessor_cache.PreprocessorCache.HORIZONTAL_FLIP,
         preprocess_vars_cache)
-    do_a_flip_random = tf.greater(do_a_flip_random, 0.5)
+    do_a_flip_random = tf.less(do_a_flip_random, probability)
 
     # flip image
     image = tf.cond(do_a_flip_random, lambda: _flip_image(image), lambda: image)
@@ -666,7 +690,6 @@ def random_horizontal_flip(image,
     # flip keypoint visibilities
     if (keypoint_visibilities is not None and
         keypoint_flip_permutation is not None):
-      permutation = keypoint_flip_permutation
       kpt_flip_perm = keypoint_flip_permutation
       keypoint_visibilities = tf.cond(
           do_a_flip_random,
@@ -674,6 +697,17 @@ def random_horizontal_flip(image,
           lambda: keypoint_visibilities)
       result.append(keypoint_visibilities)
 
+    # flip DensePose parts and coordinates
+    if densepose_part_ids is not None:
+      flip_densepose_fn = functools.partial(
+          densepose_ops.flip_horizontal, densepose_part_ids,
+          densepose_surface_coords)
+      densepose_tensors = tf.cond(
+          do_a_flip_random,
+          flip_densepose_fn,
+          lambda: (densepose_part_ids, densepose_surface_coords))
+      result.extend(densepose_tensors)
+
     return tuple(result)
 
 
@@ -682,6 +716,7 @@ def random_vertical_flip(image,
                          masks=None,
                          keypoints=None,
                          keypoint_flip_permutation=None,
+                         probability=0.5,
                          seed=None,
                          preprocess_vars_cache=None):
   """Randomly flips the image and detections vertically.
@@ -703,6 +738,7 @@ def random_vertical_flip(image,
                normalized coordinates.
     keypoint_flip_permutation: rank 1 int32 tensor containing the keypoint flip
                                permutation.
+    probability: the probability of performing this augmentation.
     seed: random seed
     preprocess_vars_cache: PreprocessorCache object that records previously
                            performed augmentations. Updated in-place. If this
@@ -743,7 +779,7 @@ def random_vertical_flip(image,
     do_a_flip_random = _get_or_create_preprocess_rand_vars(
         generator_func, preprocessor_cache.PreprocessorCache.VERTICAL_FLIP,
         preprocess_vars_cache)
-    do_a_flip_random = tf.greater(do_a_flip_random, 0.5)
+    do_a_flip_random = tf.less(do_a_flip_random, probability)
 
     # flip image
     image = tf.cond(do_a_flip_random, lambda: _flip_image(image), lambda: image)
@@ -777,6 +813,8 @@ def random_rotation90(image,
                       boxes=None,
                       masks=None,
                       keypoints=None,
+                      keypoint_rot_permutation=None,
+                      probability=0.5,
                       seed=None,
                       preprocess_vars_cache=None):
   """Randomly rotates the image and detections 90 degrees counter-clockwise.
@@ -799,6 +837,9 @@ def random_rotation90(image,
     keypoints: (optional) rank 3 float32 tensor with shape
                [num_instances, num_keypoints, 2]. The keypoints are in y-x
                normalized coordinates.
+    keypoint_rot_permutation: rank 1 int32 tensor containing the keypoint flip
+                              permutation.
+    probability: the probability of performing this augmentation.
     seed: random seed
     preprocess_vars_cache: PreprocessorCache object that records previously
                            performed augmentations. Updated in-place. If this
@@ -833,7 +874,7 @@ def random_rotation90(image,
     do_a_rot90_random = _get_or_create_preprocess_rand_vars(
         generator_func, preprocessor_cache.PreprocessorCache.ROTATION90,
         preprocess_vars_cache)
-    do_a_rot90_random = tf.greater(do_a_rot90_random, 0.5)
+    do_a_rot90_random = tf.less(do_a_rot90_random, probability)
 
     # flip image
     image = tf.cond(do_a_rot90_random, lambda: _rot90_image(image),
@@ -856,7 +897,7 @@ def random_rotation90(image,
     if keypoints is not None:
       keypoints = tf.cond(
           do_a_rot90_random,
-          lambda: keypoint_ops.rot90(keypoints),
+          lambda: keypoint_ops.rot90(keypoints, keypoint_rot_permutation),
           lambda: keypoints)
       result.append(keypoints)
 
@@ -1278,6 +1319,9 @@ def _strict_random_crop_image(image,
                               masks=None,
                               keypoints=None,
                               keypoint_visibilities=None,
+                              densepose_num_points=None,
+                              densepose_part_ids=None,
+                              densepose_surface_coords=None,
                               min_object_covered=1.0,
                               aspect_ratio_range=(0.75, 1.33),
                               area_range=(0.1, 1.0),
@@ -1315,6 +1359,19 @@ def _strict_random_crop_image(image,
                normalized coordinates.
     keypoint_visibilities: (optional) rank 2 bool tensor with shape
                [num_instances, num_keypoints].
+    densepose_num_points: (optional) rank 1 int32 tensor with shape
+                          [num_instances] with the number of sampled points per
+                          instance.
+    densepose_part_ids: (optional) rank 2 int32 tensor with shape
+                        [num_instances, num_points] holding the part id for each
+                        sampled point. These part_ids are 0-indexed, where the
+                        first non-background part has index 0.
+    densepose_surface_coords: (optional) rank 3 float32 tensor with shape
+                              [num_instances, num_points, 4]. The DensePose
+                              coordinates are of the form (y, x, v, u) where
+                              (y, x) are the normalized image coordinates for a
+                              sampled point, and (v, u) is the surface
+                              coordinate for the part.
     min_object_covered: the cropped image must cover at least this fraction of
                         at least one of the input bounding boxes.
     aspect_ratio_range: allowed range for aspect ratio of cropped image.
@@ -1334,8 +1391,9 @@ def _strict_random_crop_image(image,
            Boxes are in normalized form.
     labels: new labels.
 
-    If label_weights, multiclass_scores, masks, keypoints, or
-    keypoint_visibilities is not None, the function also returns:
+    If label_weights, multiclass_scores, masks, keypoints,
+    keypoint_visibilities, densepose_num_points, densepose_part_ids, or
+    densepose_surface_coords is not None, the function also returns:
     label_weights: rank 1 float32 tensor with shape [num_instances].
     multiclass_scores: rank 2 float32 tensor with shape
                        [num_instances, num_classes]
@@ -1344,9 +1402,24 @@ def _strict_random_crop_image(image,
     keypoints: rank 3 float32 tensor with shape
                [num_instances, num_keypoints, 2]
     keypoint_visibilities: rank 2 bool tensor with shape
-               [num_instances, num_keypoints]
+                           [num_instances, num_keypoints]
+    densepose_num_points: rank 1 int32 tensor with shape [num_instances].
+    densepose_part_ids: rank 2 int32 tensor with shape
+                        [num_instances, num_points].
+    densepose_surface_coords: rank 3 float32 tensor with shape
+                              [num_instances, num_points, 4].
+
+  Raises:
+    ValueError: If some but not all of the DensePose tensors are provided.
   """
   with tf.name_scope('RandomCropImage', values=[image, boxes]):
+    densepose_tensors = [densepose_num_points, densepose_part_ids,
+                         densepose_surface_coords]
+    if (any(t is not None for t in densepose_tensors) and
+        not all(t is not None for t in densepose_tensors)):
+      raise ValueError('If cropping DensePose labels, must provide '
+                       '`densepose_num_points`, `densepose_part_ids`, and '
+                       '`densepose_surface_coords`')
     image_shape = tf.shape(image)
 
     # boxes are [N, 4]. Lets first make them [N, 1, 4].
@@ -1457,6 +1530,23 @@ def _strict_random_crop_image(image,
             new_keypoints, kpt_vis_of_boxes_completely_inside_window)
         result.append(new_kpt_visibilities)
 
+    if densepose_num_points is not None:
+      filtered_dp_tensors = []
+      for dp_tensor in densepose_tensors:
+        dp_tensor_inside_window = tf.gather(dp_tensor, inside_window_ids)
+        dp_tensor_completely_inside_window = tf.gather(dp_tensor_inside_window,
+                                                       keep_ids)
+        filtered_dp_tensors.append(dp_tensor_completely_inside_window)
+      new_dp_num_points = filtered_dp_tensors[0]
+      new_dp_point_ids = filtered_dp_tensors[1]
+      new_dp_surf_coords = densepose_ops.change_coordinate_frame(
+          filtered_dp_tensors[2], im_box_rank1)
+      if clip_boxes:
+        new_dp_num_points, new_dp_point_ids, new_dp_surf_coords = (
+            densepose_ops.prune_outside_window(
+                new_dp_num_points, new_dp_point_ids, new_dp_surf_coords,
+                window=[0.0, 0.0, 1.0, 1.0]))
+      result.extend([new_dp_num_points, new_dp_point_ids, new_dp_surf_coords])
     return tuple(result)
 
 
@@ -1469,6 +1559,9 @@ def random_crop_image(image,
                       masks=None,
                       keypoints=None,
                       keypoint_visibilities=None,
+                      densepose_num_points=None,
+                      densepose_part_ids=None,
+                      densepose_surface_coords=None,
                       min_object_covered=1.0,
                       aspect_ratio_range=(0.75, 1.33),
                       area_range=(0.1, 1.0),
@@ -1516,6 +1609,19 @@ def random_crop_image(image,
                normalized coordinates.
     keypoint_visibilities: (optional) rank 2 bool tensor with shape
                            [num_instances, num_keypoints].
+    densepose_num_points: (optional) rank 1 int32 tensor with shape
+                          [num_instances] with the number of sampled points per
+                          instance.
+    densepose_part_ids: (optional) rank 2 int32 tensor with shape
+                        [num_instances, num_points] holding the part id for each
+                        sampled point. These part_ids are 0-indexed, where the
+                        first non-background part has index 0.
+    densepose_surface_coords: (optional) rank 3 float32 tensor with shape
+                              [num_instances, num_points, 4]. The DensePose
+                              coordinates are of the form (y, x, v, u) where
+                              (y, x) are the normalized image coordinates for a
+                              sampled point, and (v, u) is the surface
+                              coordinate for the part.
     min_object_covered: the cropped image must cover at least this fraction of
                         at least one of the input bounding boxes.
     aspect_ratio_range: allowed range for aspect ratio of cropped image.
@@ -1540,8 +1646,9 @@ def random_crop_image(image,
            form.
     labels: new labels.
 
-    If label_weights, multiclass_scores, masks, keypoints, keypoint_visibilities
-    is not None, the function also returns:
+    If label_weights, multiclass_scores, masks, keypoints,
+    keypoint_visibilities, densepose_num_points, densepose_part_ids,
+    densepose_surface_coords is not None, the function also returns:
     label_weights: rank 1 float32 tensor with shape [num_instances].
     multiclass_scores: rank 2 float32 tensor with shape
                        [num_instances, num_classes]
@@ -1550,7 +1657,12 @@ def random_crop_image(image,
     keypoints: rank 3 float32 tensor with shape
                [num_instances, num_keypoints, 2]
     keypoint_visibilities: rank 2 bool tensor with shape
-               [num_instances, num_keypoints]
+                           [num_instances, num_keypoints]
+    densepose_num_points: rank 1 int32 tensor with shape [num_instances].
+    densepose_part_ids: rank 2 int32 tensor with shape
+                        [num_instances, num_points].
+    densepose_surface_coords: rank 3 float32 tensor with shape
+                              [num_instances, num_points, 4].
   """
 
   def strict_random_crop_image_fn():
@@ -1564,6 +1676,9 @@ def random_crop_image(image,
         masks=masks,
         keypoints=keypoints,
         keypoint_visibilities=keypoint_visibilities,
+        densepose_num_points=densepose_num_points,
+        densepose_part_ids=densepose_part_ids,
+        densepose_surface_coords=densepose_surface_coords,
         min_object_covered=min_object_covered,
         aspect_ratio_range=aspect_ratio_range,
         area_range=area_range,
@@ -1595,6 +1710,9 @@ def random_crop_image(image,
       outputs.append(keypoints)
     if keypoint_visibilities is not None:
       outputs.append(keypoint_visibilities)
+    if densepose_num_points is not None:
+      outputs.extend([densepose_num_points, densepose_part_ids,
+                      densepose_surface_coords])
 
     result = tf.cond(do_a_crop_random, strict_random_crop_image_fn,
                      lambda: tuple(outputs))
@@ -1605,6 +1723,7 @@ def random_pad_image(image,
                      boxes,
                      masks=None,
                      keypoints=None,
+                     densepose_surface_coords=None,
                      min_image_size=None,
                      max_image_size=None,
                      pad_color=None,
@@ -1632,6 +1751,11 @@ def random_pad_image(image,
     keypoints: (optional) rank 3 float32 tensor with shape
                [N, num_keypoints, 2]. The keypoints are in y-x normalized
                coordinates.
+    densepose_surface_coords: (optional) rank 3 float32 tensor with shape
+                              [N, num_points, 4]. The DensePose coordinates are
+                              of the form (y, x, v, u) where (y, x) are the
+                              normalized image coordinates for a sampled point,
+                              and (v, u) is the surface coordinate for the part.
     min_image_size: a tensor of size [min_height, min_width], type tf.int32.
                     If passed as None, will be set to image size
                     [height, width].
@@ -1656,6 +1780,9 @@ def random_pad_image(image,
     masks: rank 3 float32 tensor with shape [N, new_height, new_width]
     if keypoints is not None, the function also returns:
     keypoints: rank 3 float32 tensor with shape [N, num_keypoints, 2]
+    if densepose_surface_coords is not None, the function also returns:
+    densepose_surface_coords: rank 3 float32 tensor with shape
+      [num_instances, num_points, 4]
   """
   if pad_color is None:
     pad_color = tf.reduce_mean(image, axis=[0, 1])
@@ -1747,6 +1874,11 @@ def random_pad_image(image,
     new_keypoints = keypoint_ops.change_coordinate_frame(keypoints, new_window)
     result.append(new_keypoints)
 
+  if densepose_surface_coords is not None:
+    new_densepose_surface_coords = densepose_ops.change_coordinate_frame(
+        densepose_surface_coords, new_window)
+    result.append(new_densepose_surface_coords)
+
   return tuple(result)
 
 
@@ -1754,6 +1886,7 @@ def random_absolute_pad_image(image,
                               boxes,
                               masks=None,
                               keypoints=None,
+                              densepose_surface_coords=None,
                               max_height_padding=None,
                               max_width_padding=None,
                               pad_color=None,
@@ -1778,6 +1911,11 @@ def random_absolute_pad_image(image,
     keypoints: (optional) rank 3 float32 tensor with shape
                [N, num_keypoints, 2]. The keypoints are in y-x normalized
                coordinates.
+    densepose_surface_coords: (optional) rank 3 float32 tensor with shape
+                              [N, num_points, 4]. The DensePose coordinates are
+                              of the form (y, x, v, u) where (y, x) are the
+                              normalized image coordinates for a sampled point,
+                              and (v, u) is the surface coordinate for the part.
     max_height_padding: a scalar tf.int32 tensor denoting the maximum amount of
                         height padding. The padding will be chosen uniformly at
                         random from [0, max_height_padding).
@@ -1810,6 +1948,7 @@ def random_absolute_pad_image(image,
       boxes,
       masks=masks,
       keypoints=keypoints,
+      densepose_surface_coords=densepose_surface_coords,
       min_image_size=min_image_size,
       max_image_size=max_image_size,
       pad_color=pad_color,
@@ -3845,7 +3984,7 @@ def random_square_crop_by_scale(image, boxes, labels, label_weights,
 
   Args:
     image: rank 3 float32 tensor containing 1 image ->
-           [height, width,channels].
+           [height, width, channels].
     boxes: rank 2 float32 tensor containing the bounding boxes -> [N, 4].
            Boxes are in normalized form meaning their coordinates vary
            between [0, 1]. Each row is in the form of [ymin, xmin, ymax, xmax].
@@ -3989,12 +4128,138 @@ def random_square_crop_by_scale(image, boxes, labels, label_weights,
   return return_values
 
 
+def random_scale_crop_and_pad_to_square(
+    image,
+    boxes,
+    labels,
+    label_weights,
+    masks=None,
+    keypoints=None,
+    scale_min=0.1,
+    scale_max=2.0,
+    output_size=512,
+    resize_method=tf.image.ResizeMethod.BILINEAR,
+    seed=None):
+  """Randomly scale, crop, and then pad an image to fixed square dimensions.
+
+   Randomly scale, crop, and then pad an image to the desired square output
+   dimensions. Specifically, this method first samples a random_scale factor
+   from a uniform distribution between scale_min and scale_max, and then resizes
+   the image such that it's maximum dimension is (output_size * random_scale).
+   Secondly, a square output_size crop is extracted from the resized image
+   (note, this will only occur when random_scale > 1.0). Lastly, the cropped
+   region is padded to the desired square output_size, by filling with zeros.
+   The augmentation is borrowed from [1]
+   [1]: https://arxiv.org/abs/1911.09070
+
+  Args:
+    image: rank 3 float32 tensor containing 1 image ->
+      [height, width, channels].
+    boxes: rank 2 float32 tensor containing the bounding boxes -> [N, 4]. Boxes
+      are in normalized form meaning their coordinates vary between [0, 1]. Each
+      row is in the form of [ymin, xmin, ymax, xmax]. Boxes on the crop boundary
+      are clipped to the boundary and boxes falling outside the crop are
+      ignored.
+    labels: rank 1 int32 tensor containing the object classes.
+    label_weights: float32 tensor of shape [num_instances] representing the
+      weight for each box.
+    masks: (optional) rank 3 float32 tensor with shape [num_instances, height,
+      width] containing instance masks. The masks are of the same height, width
+      as the input `image`.
+    keypoints: (optional) rank 3 float32 tensor with shape [num_instances,
+      num_keypoints, 2]. The keypoints are in y-x normalized coordinates.
+    scale_min: float, the minimum value for the random scale factor.
+    scale_max: float, the maximum value for the random scale factor.
+    output_size: int, the desired (square) output image size.
+    resize_method: tf.image.ResizeMethod, resize method to use when scaling the
+      input images.
+    seed: random seed.
+
+  Returns:
+    image: image which is the same rank as input image.
+    boxes: boxes which is the same rank as input boxes.
+           Boxes are in normalized form.
+    labels: new labels.
+    label_weights: rank 1 float32 tensor with shape [num_instances].
+    masks: rank 3 float32 tensor with shape [num_instances, height, width]
+           containing instance masks.
+
+  """
+
+  img_shape = tf.shape(image)
+  input_height, input_width = img_shape[0], img_shape[1]
+  random_scale = tf.random_uniform([], scale_min, scale_max, seed=seed)
+
+  # Compute the scaled height and width from the random scale.
+  max_input_dim = tf.cast(tf.maximum(input_height, input_width), tf.float32)
+  input_ar_y = tf.cast(input_height, tf.float32) / max_input_dim
+  input_ar_x = tf.cast(input_width, tf.float32) / max_input_dim
+  scaled_height = tf.cast(random_scale * output_size * input_ar_y, tf.int32)
+  scaled_width = tf.cast(random_scale * output_size * input_ar_x, tf.int32)
+
+  # Compute the offsets:
+  offset_y = tf.cast(scaled_height - output_size, tf.float32)
+  offset_x = tf.cast(scaled_width - output_size, tf.float32)
+  offset_y = tf.maximum(0.0, offset_y) * tf.random_uniform([], 0, 1, seed=seed)
+  offset_x = tf.maximum(0.0, offset_x) * tf.random_uniform([], 0, 1, seed=seed)
+  offset_y = tf.cast(offset_y, tf.int32)
+  offset_x = tf.cast(offset_x, tf.int32)
+
+  # Scale, crop, and pad the input image.
+  scaled_image = tf.image.resize_images(
+      image, [scaled_height, scaled_width], method=resize_method)
+  scaled_image = scaled_image[offset_y:offset_y + output_size,
+                              offset_x:offset_x + output_size, :]
+  output_image = tf.image.pad_to_bounding_box(scaled_image, 0, 0, output_size,
+                                              output_size)
+
+  # Update the boxes.
+  new_window = tf.cast(
+      tf.stack([offset_y, offset_x,
+                offset_y + output_size, offset_x + output_size]),
+      dtype=tf.float32)
+  new_window /= tf.cast(
+      tf.stack([scaled_height, scaled_width, scaled_height, scaled_width]),
+      dtype=tf.float32)
+  boxlist = box_list.BoxList(boxes)
+  boxlist = box_list_ops.change_coordinate_frame(boxlist, new_window)
+  boxlist, indices = box_list_ops.prune_completely_outside_window(
+      boxlist, [0.0, 0.0, 1.0, 1.0])
+  boxlist = box_list_ops.clip_to_window(
+      boxlist, [0.0, 0.0, 1.0, 1.0], filter_nonoverlapping=False)
+
+  return_values = [output_image, boxlist.get(),
+                   tf.gather(labels, indices),
+                   tf.gather(label_weights, indices)]
+
+  if masks is not None:
+    new_masks = tf.expand_dims(masks, -1)
+    new_masks = tf.image.resize_images(
+        new_masks, [scaled_height, scaled_width], method=resize_method)
+    new_masks = new_masks[:, offset_y:offset_y + output_size,
+                          offset_x:offset_x + output_size, :]
+    new_masks = tf.image.pad_to_bounding_box(
+        new_masks, 0, 0, output_size, output_size)
+    new_masks = tf.squeeze(new_masks, [-1])
+    return_values.append(tf.gather(new_masks, indices))
+
+  if keypoints is not None:
+    keypoints = tf.gather(keypoints, indices)
+    keypoints = keypoint_ops.change_coordinate_frame(keypoints, new_window)
+    keypoints = keypoint_ops.prune_outside_window(
+        keypoints, [0.0, 0.0, 1.0, 1.0])
+    return_values.append(keypoints)
+
+  return return_values
+
+
 def get_default_func_arg_map(include_label_weights=True,
                              include_label_confidences=False,
                              include_multiclass_scores=False,
                              include_instance_masks=False,
                              include_keypoints=False,
-                             include_keypoint_visibilities=False):
+                             include_keypoint_visibilities=False,
+                             include_dense_pose=False):
   """Returns the default mapping from a preprocessor function to its args.
 
   Args:
@@ -4010,6 +4275,8 @@ def get_default_func_arg_map(include_label_weights=True,
       keypoints, too.
     include_keypoint_visibilities: If True, preprocessing functions will modify
       the keypoint visibilities, too.
+    include_dense_pose: If True, preprocessing functions will modify the
+      DensePose labels, too.
 
   Returns:
     A map from preprocessing functions to the arguments they receive.
@@ -4042,6 +4309,17 @@ def get_default_func_arg_map(include_label_weights=True,
     groundtruth_keypoint_visibilities = (
         fields.InputDataFields.groundtruth_keypoint_visibilities)
 
+  groundtruth_dp_num_points = None
+  groundtruth_dp_part_ids = None
+  groundtruth_dp_surface_coords = None
+  if include_dense_pose:
+    groundtruth_dp_num_points = (
+        fields.InputDataFields.groundtruth_dp_num_points)
+    groundtruth_dp_part_ids = (
+        fields.InputDataFields.groundtruth_dp_part_ids)
+    groundtruth_dp_surface_coords = (
+        fields.InputDataFields.groundtruth_dp_surface_coords)
+
   prep_func_arg_map = {
       normalize_image: (fields.InputDataFields.image,),
       random_horizontal_flip: (
@@ -4050,6 +4328,8 @@ def get_default_func_arg_map(include_label_weights=True,
           groundtruth_instance_masks,
           groundtruth_keypoints,
           groundtruth_keypoint_visibilities,
+          groundtruth_dp_part_ids,
+          groundtruth_dp_surface_coords,
       ),
       random_vertical_flip: (
           fields.InputDataFields.image,
@@ -4075,21 +4355,22 @@ def get_default_func_arg_map(include_label_weights=True,
       random_adjust_saturation: (fields.InputDataFields.image,),
       random_distort_color: (fields.InputDataFields.image,),
       random_jitter_boxes: (fields.InputDataFields.groundtruth_boxes,),
-      random_crop_image: (fields.InputDataFields.image,
-                          fields.InputDataFields.groundtruth_boxes,
-                          fields.InputDataFields.groundtruth_classes,
-                          groundtruth_label_weights,
-                          groundtruth_label_confidences, multiclass_scores,
-                          groundtruth_instance_masks, groundtruth_keypoints,
-                          groundtruth_keypoint_visibilities),
+      random_crop_image:
+          (fields.InputDataFields.image,
+           fields.InputDataFields.groundtruth_boxes,
+           fields.InputDataFields.groundtruth_classes,
+           groundtruth_label_weights, groundtruth_label_confidences,
+           multiclass_scores, groundtruth_instance_masks, groundtruth_keypoints,
+           groundtruth_keypoint_visibilities, groundtruth_dp_num_points,
+           groundtruth_dp_part_ids, groundtruth_dp_surface_coords),
       random_pad_image:
           (fields.InputDataFields.image,
            fields.InputDataFields.groundtruth_boxes, groundtruth_instance_masks,
-           groundtruth_keypoints),
+           groundtruth_keypoints, groundtruth_dp_surface_coords),
       random_absolute_pad_image:
           (fields.InputDataFields.image,
            fields.InputDataFields.groundtruth_boxes, groundtruth_instance_masks,
-           groundtruth_keypoints),
+           groundtruth_keypoints, groundtruth_dp_surface_coords),
       random_crop_pad_image: (fields.InputDataFields.image,
                               fields.InputDataFields.groundtruth_boxes,
                               fields.InputDataFields.groundtruth_classes,
@@ -4204,6 +4485,12 @@ def get_default_func_arg_map(include_label_weights=True,
            fields.InputDataFields.groundtruth_classes,
            groundtruth_label_weights, groundtruth_instance_masks,
            groundtruth_keypoints),
+      random_scale_crop_and_pad_to_square:
+          (fields.InputDataFields.image,
+           fields.InputDataFields.groundtruth_boxes,
+           fields.InputDataFields.groundtruth_classes,
+           groundtruth_label_weights, groundtruth_instance_masks,
+           groundtruth_keypoints),
   }
 
   return prep_func_arg_map
diff --git a/research/object_detection/core/preprocessor_test.py b/research/object_detection/core/preprocessor_test.py
index fdb56c90a0b85db6772bccc4cfb8793d2012f8ed..396ff96da95948d0adebac14634e8f6ecbfcd7fc 100644
--- a/research/object_detection/core/preprocessor_test.py
+++ b/research/object_detection/core/preprocessor_test.py
@@ -119,8 +119,29 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
     ])
     return tf.constant(keypoints, dtype=tf.float32)
 
+  def createTestDensePose(self):
+    dp_num_points = tf.constant([1, 3], dtype=tf.int32)
+    dp_part_ids = tf.constant(
+        [[4, 0, 0],
+         [1, 0, 5]], dtype=tf.int32)
+    dp_surface_coords = tf.constant(
+        [
+            # Instance 0.
+            [[0.1, 0.2, 0.6, 0.7],
+             [0.0, 0.0, 0.0, 0.0],
+             [0.0, 0.0, 0.0, 0.0]],
+            # Instance 1.
+            [[0.8, 0.9, 0.2, 0.4],
+             [0.1, 0.3, 0.2, 0.8],
+             [0.6, 1.0, 0.3, 0.4]],
+        ], dtype=tf.float32)
+    return dp_num_points, dp_part_ids, dp_surface_coords
+
   def createKeypointFlipPermutation(self):
-    return np.array([0, 2, 1], dtype=np.int32)
+    return [0, 2, 1]
+
+  def createKeypointRotPermutation(self):
+    return [0, 2, 1]
 
   def createTestLabels(self):
     labels = tf.constant([1, 2], dtype=tf.int32)
@@ -691,51 +712,6 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
                                 test_masks=True,
                                 test_keypoints=True)
 
-  def testRunRandomHorizontalFlipWithMaskAndKeypoints(self):
-
-    def graph_fn():
-      preprocess_options = [(preprocessor.random_horizontal_flip, {})]
-      image_height = 3
-      image_width = 3
-      images = tf.random_uniform([1, image_height, image_width, 3])
-      boxes = self.createTestBoxes()
-      masks = self.createTestMasks()
-      keypoints, keypoint_visibilities = self.createTestKeypoints()
-      keypoint_flip_permutation = self.createKeypointFlipPermutation()
-      tensor_dict = {
-          fields.InputDataFields.image:
-              images,
-          fields.InputDataFields.groundtruth_boxes:
-              boxes,
-          fields.InputDataFields.groundtruth_instance_masks:
-              masks,
-          fields.InputDataFields.groundtruth_keypoints:
-              keypoints,
-          fields.InputDataFields.groundtruth_keypoint_visibilities:
-              keypoint_visibilities
-      }
-      preprocess_options = [(preprocessor.random_horizontal_flip, {
-          'keypoint_flip_permutation': keypoint_flip_permutation
-      })]
-      preprocessor_arg_map = preprocessor.get_default_func_arg_map(
-          include_instance_masks=True,
-          include_keypoints=True,
-          include_keypoint_visibilities=True)
-      tensor_dict = preprocessor.preprocess(
-          tensor_dict, preprocess_options, func_arg_map=preprocessor_arg_map)
-      boxes = tensor_dict[fields.InputDataFields.groundtruth_boxes]
-      masks = tensor_dict[fields.InputDataFields.groundtruth_instance_masks]
-      keypoints = tensor_dict[fields.InputDataFields.groundtruth_keypoints]
-      keypoint_visibilities = tensor_dict[
-          fields.InputDataFields.groundtruth_keypoint_visibilities]
-      return [boxes, masks, keypoints, keypoint_visibilities]
-
-    boxes, masks, keypoints, keypoint_visibilities = self.execute_cpu(
-        graph_fn, [])
-    self.assertIsNotNone(boxes)
-    self.assertIsNotNone(masks)
-    self.assertIsNotNone(keypoints)
-    self.assertIsNotNone(keypoint_visibilities)
 
   def testRandomVerticalFlip(self):
 
@@ -912,19 +888,22 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
                                 test_keypoints=True)
 
   def testRunRandomRotation90WithMaskAndKeypoints(self):
-    preprocess_options = [(preprocessor.random_rotation90, {})]
     image_height = 3
     image_width = 3
     images = tf.random_uniform([1, image_height, image_width, 3])
     boxes = self.createTestBoxes()
     masks = self.createTestMasks()
     keypoints, _ = self.createTestKeypoints()
+    keypoint_rot_permutation = self.createKeypointRotPermutation()
     tensor_dict = {
         fields.InputDataFields.image: images,
         fields.InputDataFields.groundtruth_boxes: boxes,
         fields.InputDataFields.groundtruth_instance_masks: masks,
         fields.InputDataFields.groundtruth_keypoints: keypoints
     }
+    preprocess_options = [(preprocessor.random_rotation90, {
+        'keypoint_rot_permutation': keypoint_rot_permutation
+    })]
     preprocessor_arg_map = preprocessor.get_default_func_arg_map(
         include_instance_masks=True, include_keypoints=True)
     tensor_dict = preprocessor.preprocess(
@@ -1880,6 +1859,65 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
       self.assertAllClose(
           distorted_keypoints_.flatten(), expected_keypoints.flatten())
 
+  def testRunRandomCropImageWithDensePose(self):
+    def graph_fn():
+      image = self.createColorfulTestImage()
+      boxes = self.createTestBoxes()
+      labels = self.createTestLabels()
+      weights = self.createTestGroundtruthWeights()
+      dp_num_points, dp_part_ids, dp_surface_coords = self.createTestDensePose()
+
+      tensor_dict = {
+          fields.InputDataFields.image: image,
+          fields.InputDataFields.groundtruth_boxes: boxes,
+          fields.InputDataFields.groundtruth_classes: labels,
+          fields.InputDataFields.groundtruth_weights: weights,
+          fields.InputDataFields.groundtruth_dp_num_points: dp_num_points,
+          fields.InputDataFields.groundtruth_dp_part_ids: dp_part_ids,
+          fields.InputDataFields.groundtruth_dp_surface_coords:
+              dp_surface_coords
+      }
+
+      preprocessor_arg_map = preprocessor.get_default_func_arg_map(
+          include_dense_pose=True)
+
+      preprocessing_options = [(preprocessor.random_crop_image, {})]
+
+      with mock.patch.object(
+          tf.image,
+          'sample_distorted_bounding_box'
+      ) as mock_sample_distorted_bounding_box:
+        mock_sample_distorted_bounding_box.return_value = (
+            tf.constant([6, 40, 0], dtype=tf.int32),
+            tf.constant([134, 340, -1], dtype=tf.int32),
+            tf.constant([[[0.03, 0.1, 0.7, 0.95]]], dtype=tf.float32))
+        distorted_tensor_dict = preprocessor.preprocess(
+            tensor_dict,
+            preprocessing_options,
+            func_arg_map=preprocessor_arg_map)
+        distorted_image = distorted_tensor_dict[fields.InputDataFields.image]
+        distorted_dp_num_points = distorted_tensor_dict[
+            fields.InputDataFields.groundtruth_dp_num_points]
+        distorted_dp_part_ids = distorted_tensor_dict[
+            fields.InputDataFields.groundtruth_dp_part_ids]
+        distorted_dp_surface_coords = distorted_tensor_dict[
+            fields.InputDataFields.groundtruth_dp_surface_coords]
+        return [distorted_image, distorted_dp_num_points, distorted_dp_part_ids,
+                distorted_dp_surface_coords]
+    (distorted_image_, distorted_dp_num_points_, distorted_dp_part_ids_,
+     distorted_dp_surface_coords_) = self.execute_cpu(graph_fn, [])
+    expected_dp_num_points = np.array([1, 1])
+    expected_dp_part_ids = np.array([[4], [0]])
+    expected_dp_surface_coords = np.array([
+        [[0.10447761, 0.1176470, 0.6, 0.7]],
+        [[0.10447761, 0.2352941, 0.2, 0.8]],
+    ])
+    self.assertAllEqual(distorted_image_.shape, [1, 134, 340, 3])
+    self.assertAllEqual(distorted_dp_num_points_, expected_dp_num_points)
+    self.assertAllEqual(distorted_dp_part_ids_, expected_dp_part_ids)
+    self.assertAllClose(distorted_dp_surface_coords_,
+                        expected_dp_surface_coords)
+
   def testRunRetainBoxesAboveThreshold(self):
     def graph_fn():
       boxes = self.createTestBoxes()
@@ -2270,7 +2308,10 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
     self.assertTrue(np.all((boxes_[:, 3] - boxes_[:, 1]) >= (
         padded_boxes_[:, 3] - padded_boxes_[:, 1])))
 
-  def testRandomPadImageWithKeypointsAndMasks(self):
+  @parameterized.parameters(
+      {'include_dense_pose': False},
+  )
+  def testRandomPadImageWithKeypointsAndMasks(self, include_dense_pose):
     def graph_fn():
       preprocessing_options = [(preprocessor.normalize_image, {
           'original_minval': 0,
@@ -2284,12 +2325,15 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
       labels = self.createTestLabels()
       masks = self.createTestMasks()
       keypoints, _ = self.createTestKeypoints()
+      _, _, dp_surface_coords = self.createTestDensePose()
       tensor_dict = {
           fields.InputDataFields.image: images,
           fields.InputDataFields.groundtruth_boxes: boxes,
           fields.InputDataFields.groundtruth_classes: labels,
           fields.InputDataFields.groundtruth_instance_masks: masks,
           fields.InputDataFields.groundtruth_keypoints: keypoints,
+          fields.InputDataFields.groundtruth_dp_surface_coords:
+              dp_surface_coords
       }
       tensor_dict = preprocessor.preprocess(tensor_dict, preprocessing_options)
       images = tensor_dict[fields.InputDataFields.image]
@@ -2298,7 +2342,8 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
       func_arg_map = preprocessor.get_default_func_arg_map(
           include_instance_masks=True,
           include_keypoints=True,
-          include_keypoint_visibilities=True)
+          include_keypoint_visibilities=True,
+          include_dense_pose=include_dense_pose)
       padded_tensor_dict = preprocessor.preprocess(tensor_dict,
                                                    preprocessing_options,
                                                    func_arg_map=func_arg_map)
@@ -2317,15 +2362,29 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
       padded_keypoints_shape = tf.shape(padded_keypoints)
       images_shape = tf.shape(images)
       padded_images_shape = tf.shape(padded_images)
-      return [boxes_shape, padded_boxes_shape, padded_masks_shape,
-              keypoints_shape, padded_keypoints_shape, images_shape,
-              padded_images_shape, boxes, padded_boxes, keypoints,
-              padded_keypoints]
-
-    (boxes_shape_, padded_boxes_shape_, padded_masks_shape_,
-     keypoints_shape_, padded_keypoints_shape_, images_shape_,
-     padded_images_shape_, boxes_, padded_boxes_,
-     keypoints_, padded_keypoints_) = self.execute_cpu(graph_fn, [])
+      outputs = [boxes_shape, padded_boxes_shape, padded_masks_shape,
+                 keypoints_shape, padded_keypoints_shape, images_shape,
+                 padded_images_shape, boxes, padded_boxes, keypoints,
+                 padded_keypoints]
+      if include_dense_pose:
+        padded_dp_surface_coords = padded_tensor_dict[
+            fields.InputDataFields.groundtruth_dp_surface_coords]
+        outputs.extend([dp_surface_coords, padded_dp_surface_coords])
+      return outputs
+
+    outputs = self.execute_cpu(graph_fn, [])
+    boxes_shape_ = outputs[0]
+    padded_boxes_shape_ = outputs[1]
+    padded_masks_shape_ = outputs[2]
+    keypoints_shape_ = outputs[3]
+    padded_keypoints_shape_ = outputs[4]
+    images_shape_ = outputs[5]
+    padded_images_shape_ = outputs[6]
+    boxes_ = outputs[7]
+    padded_boxes_ = outputs[8]
+    keypoints_ = outputs[9]
+    padded_keypoints_ = outputs[10]
+
     self.assertAllEqual(boxes_shape_, padded_boxes_shape_)
     self.assertAllEqual(keypoints_shape_, padded_keypoints_shape_)
     self.assertTrue((images_shape_[1] >= padded_images_shape_[1] * 0.5).all)
@@ -2341,6 +2400,11 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
         padded_keypoints_[1, :, 0] - padded_keypoints_[0, :, 0])))
     self.assertTrue(np.all((keypoints_[1, :, 1] - keypoints_[0, :, 1]) >= (
         padded_keypoints_[1, :, 1] - padded_keypoints_[0, :, 1])))
+    if include_dense_pose:
+      dp_surface_coords = outputs[11]
+      padded_dp_surface_coords = outputs[12]
+      self.assertAllClose(padded_dp_surface_coords[:, :, 2:],
+                          dp_surface_coords[:, :, 2:])
 
   def testRandomAbsolutePadImage(self):
     height_padding = 10
@@ -3777,6 +3841,90 @@ class PreprocessorTest(test_case.TestCase, parameterized.TestCase):
     size = max(image.shape)
     self.assertAlmostEqual(scale * 256.0, size)
 
+    self.assertAllClose(image[:, :, 0], masks[0, :, :])
+
+  @parameterized.named_parameters(('scale_0_1', 0.1), ('scale_1_0', 1.0),
+                                  ('scale_2_0', 2.0))
+  def test_random_scale_crop_and_pad_to_square(self, scale):
+
+    def graph_fn():
+      image = np.random.randn(512, 256, 1)
+      box_centers = [0.25, 0.5, 0.75]
+      box_size = 0.1
+      box_corners = []
+      box_labels = []
+      box_label_weights = []
+      keypoints = []
+      masks = []
+      for center_y in box_centers:
+        for center_x in box_centers:
+          box_corners.append(
+              [center_y - box_size / 2.0, center_x - box_size / 2.0,
+               center_y + box_size / 2.0, center_x + box_size / 2.0])
+          box_labels.append([1])
+          box_label_weights.append([1.])
+          keypoints.append(
+              [[center_y - box_size / 2.0, center_x - box_size / 2.0],
+               [center_y + box_size / 2.0, center_x + box_size / 2.0]])
+          masks.append(image[:, :, 0].reshape(512, 256))
+
+      image = tf.constant(image)
+      boxes = tf.constant(box_corners)
+      labels = tf.constant(box_labels)
+      label_weights = tf.constant(box_label_weights)
+      keypoints = tf.constant(keypoints)
+      masks = tf.constant(np.stack(masks))
+
+      (new_image, new_boxes, _, _, new_masks,
+       new_keypoints) = preprocessor.random_scale_crop_and_pad_to_square(
+           image,
+           boxes,
+           labels,
+           label_weights,
+           masks=masks,
+           keypoints=keypoints,
+           scale_min=scale,
+           scale_max=scale,
+           output_size=512)
+      return new_image, new_boxes, new_masks, new_keypoints
+
+    image, boxes, masks, keypoints = self.execute_cpu(graph_fn, [])
+
+    # Since random_scale_crop_and_pad_to_square may prune and clip boxes,
+    # we only need to find one of the boxes that was not clipped and check
+    # that it matches the expected dimensions. Note, assertAlmostEqual(a, b)
+    # is equivalent to round(a-b, 7) == 0.
+    any_box_has_correct_size = False
+    effective_scale_y = int(scale * 512) / 512.0
+    effective_scale_x = int(scale * 256) / 512.0
+    expected_size_y = 0.1 * effective_scale_y
+    expected_size_x = 0.1 * effective_scale_x
+    for box in boxes:
+      ymin, xmin, ymax, xmax = box
+      any_box_has_correct_size |= (
+          (round(ymin, 7) != 0.0) and (round(xmin, 7) != 0.0) and
+          (round(ymax, 7) != 1.0) and (round(xmax, 7) != 1.0) and
+          (round((ymax - ymin) - expected_size_y, 7) == 0.0) and
+          (round((xmax - xmin) - expected_size_x, 7) == 0.0))
+    self.assertTrue(any_box_has_correct_size)
+
+    # Similar to the approach above where we check for at least one box with the
+    # expected dimensions, we check for at least one pair of keypoints whose
+    # distance matches the expected dimensions.
+    any_keypoint_pair_has_correct_dist = False
+    for keypoint_pair in keypoints:
+      ymin, xmin = keypoint_pair[0]
+      ymax, xmax = keypoint_pair[1]
+      any_keypoint_pair_has_correct_dist |= (
+          (round(ymin, 7) != 0.0) and (round(xmin, 7) != 0.0) and
+          (round(ymax, 7) != 1.0) and (round(xmax, 7) != 1.0) and
+          (round((ymax - ymin) - expected_size_y, 7) == 0.0) and
+          (round((xmax - xmin) - expected_size_x, 7) == 0.0))
+    self.assertTrue(any_keypoint_pair_has_correct_dist)
+
+    self.assertAlmostEqual(512.0, image.shape[0])
+    self.assertAlmostEqual(512.0, image.shape[1])
+
     self.assertAllClose(image[:, :, 0],
                         masks[0, :, :])
 
diff --git a/research/object_detection/core/standard_fields.py b/research/object_detection/core/standard_fields.py
index df995b4a429ec4e587d83cf8a94fb8c223ad4dca..ddb4a842db2a70c8d9497b1032249785ae539eef 100644
--- a/research/object_detection/core/standard_fields.py
+++ b/research/object_detection/core/standard_fields.py
@@ -66,6 +66,11 @@ class InputDataFields(object):
     groundtruth_keypoint_weights: groundtruth weight factor for keypoints.
     groundtruth_label_weights: groundtruth label weights.
     groundtruth_weights: groundtruth weight factor for bounding boxes.
+    groundtruth_dp_num_points: The number of DensePose sampled points for each
+      instance.
+    groundtruth_dp_part_ids: Part indices for DensePose points.
+    groundtruth_dp_surface_coords: Image locations and UV coordinates for
+      DensePose points.
     num_groundtruth_boxes: number of groundtruth boxes.
     is_annotated: whether an image has been labeled or not.
     true_image_shapes: true shapes of images in the resized images, as resized
@@ -108,6 +113,9 @@ class InputDataFields(object):
   groundtruth_keypoint_weights = 'groundtruth_keypoint_weights'
   groundtruth_label_weights = 'groundtruth_label_weights'
   groundtruth_weights = 'groundtruth_weights'
+  groundtruth_dp_num_points = 'groundtruth_dp_num_points'
+  groundtruth_dp_part_ids = 'groundtruth_dp_part_ids'
+  groundtruth_dp_surface_coords = 'groundtruth_dp_surface_coords'
   num_groundtruth_boxes = 'num_groundtruth_boxes'
   is_annotated = 'is_annotated'
   true_image_shape = 'true_image_shape'
@@ -133,6 +141,8 @@ class DetectionResultFields(object):
       for detection boxes in the image including background class.
     detection_classes: detection-level class labels.
     detection_masks: contains a segmentation mask for each detection box.
+    detection_surface_coords: contains DensePose surface coordinates for each
+      box.
     detection_boundaries: contains an object boundary for each detection box.
     detection_keypoints: contains detection keypoints for each detection box.
     detection_keypoint_scores: contains detection keypoint scores.
@@ -153,6 +163,7 @@ class DetectionResultFields(object):
   detection_features = 'detection_features'
   detection_classes = 'detection_classes'
   detection_masks = 'detection_masks'
+  detection_surface_coords = 'detection_surface_coords'
   detection_boundaries = 'detection_boundaries'
   detection_keypoints = 'detection_keypoints'
   detection_keypoint_scores = 'detection_keypoint_scores'
@@ -174,7 +185,11 @@ class BoxListFields(object):
     masks: masks per bounding box.
     boundaries: boundaries per bounding box.
     keypoints: keypoints per bounding box.
+    keypoint_visibilities: keypoint visibilities per bounding box.
     keypoint_heatmaps: keypoint heatmaps per bounding box.
+    densepose_num_points: number of DensePose points per bounding box.
+    densepose_part_ids: DensePose part ids per bounding box.
+    densepose_surface_coords: DensePose surface coordinates per bounding box.
     is_crowd: is_crowd annotation per bounding box.
   """
   boxes = 'boxes'
@@ -188,6 +203,9 @@ class BoxListFields(object):
   keypoints = 'keypoints'
   keypoint_visibilities = 'keypoint_visibilities'
   keypoint_heatmaps = 'keypoint_heatmaps'
+  densepose_num_points = 'densepose_num_points'
+  densepose_part_ids = 'densepose_part_ids'
+  densepose_surface_coords = 'densepose_surface_coords'
   is_crowd = 'is_crowd'
   group_of = 'group_of'
 
diff --git a/research/object_detection/core/target_assigner.py b/research/object_detection/core/target_assigner.py
index fd9020ebeac12c2610449afcbdd1f29dd3237f85..37bbeffdc997a970c7adc0a1b6626f1bb5bba40f 100644
--- a/research/object_detection/core/target_assigner.py
+++ b/research/object_detection/core/target_assigner.py
@@ -45,6 +45,7 @@ from object_detection.box_coders import mean_stddev_box_coder
 from object_detection.core import box_coder
 from object_detection.core import box_list
 from object_detection.core import box_list_ops
+from object_detection.core import densepose_ops
 from object_detection.core import keypoint_ops
 from object_detection.core import matcher as mat
 from object_detection.core import region_similarity_calculator as sim_calc
@@ -799,17 +800,15 @@ def get_batch_predictions_from_indices(batch_predictions, indices):
   function.
 
   Args:
-    batch_predictions: A tensor of shape [batch_size, height, width, 2] for
-      single class offsets and [batch_size, height, width, class, 2] for
-      multiple classes offsets (e.g. keypoint joint offsets) representing the
-      (height, width) or (y_offset, x_offset) predictions over a batch.
-    indices: A tensor of shape [num_instances, 3] for single class offset and
-      [num_instances, 4] for multiple classes offsets representing the indices
-      in the batch to be penalized in a loss function
+    batch_predictions: A tensor of shape [batch_size, height, width, channels]
+      or [batch_size, height, width, class, channels] for class-specific
+      features (e.g. keypoint joint offsets).
+    indices: A tensor of shape [num_instances, 3] for single class features or
+      [num_instances, 4] for multiple classes features.
 
   Returns:
-    values: A tensor of shape [num_instances, 2] holding the predicted values
-      at the given indices.
+    values: A tensor of shape [num_instances, channels] holding the predicted
+      values at the given indices.
   """
   return tf.gather_nd(batch_predictions, indices)
 
@@ -1601,6 +1600,17 @@ class CenterNetKeypointTargetAssigner(object):
     return (batch_indices, batch_offsets, batch_weights)
 
 
+def _resize_masks(masks, height, width, method):
+  # Resize segmentation masks to conform to output dimensions. Use TF2
+  # image resize because TF1's version is buggy:
+  # https://yaqs.corp.google.com/eng/q/4970450458378240
+  masks = tf2.image.resize(
+      masks[:, :, :, tf.newaxis],
+      size=(height, width),
+      method=method)
+  return masks[:, :, :, 0]
+
+
 class CenterNetMaskTargetAssigner(object):
   """Wrapper to compute targets for segmentation masks."""
 
@@ -1642,18 +1652,248 @@ class CenterNetMaskTargetAssigner(object):
 
     segmentation_targets_list = []
     for gt_masks, gt_classes in zip(gt_masks_list, gt_classes_list):
-      # Resize segmentation masks to conform to output dimensions. Use TF2
-      # image resize because TF1's version is buggy:
-      # https://yaqs.corp.google.com/eng/q/4970450458378240
-      gt_masks = tf2.image.resize(
-          gt_masks[:, :, :, tf.newaxis],
-          size=(output_height, output_width),
-          method=mask_resize_method)
+      gt_masks = _resize_masks(gt_masks, output_height, output_width,
+                               mask_resize_method)
+      gt_masks = gt_masks[:, :, :, tf.newaxis]
       gt_classes_reshaped = tf.reshape(gt_classes, [-1, 1, 1, num_classes])
       # Shape: [h, w, num_classes].
       segmentations_for_image = tf.reduce_max(
           gt_masks * gt_classes_reshaped, axis=0)
+      # Avoid the case where max of an empty array is -inf.
+      segmentations_for_image = tf.maximum(segmentations_for_image, 0.0)
       segmentation_targets_list.append(segmentations_for_image)
 
     segmentation_target = tf.stack(segmentation_targets_list, axis=0)
     return segmentation_target
+
+
+class CenterNetDensePoseTargetAssigner(object):
+  """Wrapper to compute targets for DensePose task."""
+
+  def __init__(self, stride, num_parts=24):
+    self._stride = stride
+    self._num_parts = num_parts
+
+  def assign_part_and_coordinate_targets(self,
+                                         height,
+                                         width,
+                                         gt_dp_num_points_list,
+                                         gt_dp_part_ids_list,
+                                         gt_dp_surface_coords_list,
+                                         gt_weights_list=None):
+    """Returns the DensePose part_id and coordinate targets and their indices.
+
+    The returned values are expected to be used with predicted tensors
+    of size (batch_size, height//self._stride, width//self._stride, 2). The
+    predicted values at the relevant indices can be retrieved with the
+    get_batch_predictions_from_indices function.
+
+    Args:
+      height: int, height of input to the model. This is used to determine the
+        height of the output.
+      width: int, width of the input to the model. This is used to determine the
+        width of the output.
+      gt_dp_num_points_list: a list of 1-D tf.int32 tensors of shape [num_boxes]
+        containing the number of DensePose sampled points per box.
+      gt_dp_part_ids_list: a list of 2-D tf.int32 tensors of shape
+        [num_boxes, max_sampled_points] containing the DensePose part ids
+        (0-indexed) for each sampled point. Note that there may be padding, as
+        boxes may contain a different number of sampled points.
+      gt_dp_surface_coords_list: a list of 3-D tf.float32 tensors of shape
+        [num_boxes, max_sampled_points, 4] containing the DensePose surface
+        coordinates (normalized) for each sampled point. Note that there may be
+        padding.
+      gt_weights_list: A list of 1-D tensors with shape [num_boxes]
+        corresponding to the weight of each groundtruth detection box.
+
+    Returns:
+      batch_indices: an integer tensor of shape [num_total_points, 4] holding
+        the indices inside the predicted tensor which should be penalized. The
+        first column indicates the index along the batch dimension and the
+        second and third columns indicate the index along the y and x
+        dimensions respectively. The fourth column is the part index.
+      batch_part_ids: an int tensor of shape [num_total_points, num_parts]
+        holding 1-hot encodings of parts for each sampled point.
+      batch_surface_coords: a float tensor of shape [num_total_points, 2]
+        holding the expected (v, u) coordinates for each sampled point.
+      batch_weights: a float tensor of shape [num_total_points] indicating the
+        weight of each prediction.
+      Note that num_total_points = batch_size * num_boxes * max_sampled_points.
+    """
+
+    if gt_weights_list is None:
+      gt_weights_list = [None] * len(gt_dp_num_points_list)
+
+    batch_indices = []
+    batch_part_ids = []
+    batch_surface_coords = []
+    batch_weights = []
+
+    for i, (num_points, part_ids, surface_coords, weights) in enumerate(
+        zip(gt_dp_num_points_list, gt_dp_part_ids_list,
+            gt_dp_surface_coords_list, gt_weights_list)):
+      num_boxes, max_sampled_points = (
+          shape_utils.combined_static_and_dynamic_shape(part_ids))
+      part_ids_flattened = tf.reshape(part_ids, [-1])
+      part_ids_one_hot = tf.one_hot(part_ids_flattened, depth=self._num_parts)
+      # Get DensePose coordinates in the output space.
+      surface_coords_abs = densepose_ops.to_absolute_coordinates(
+          surface_coords, height // self._stride, width // self._stride)
+      surface_coords_abs = tf.reshape(surface_coords_abs, [-1, 4])
+      # Each tensor has shape [num_boxes * max_sampled_points].
+      yabs, xabs, v, u = tf.unstack(surface_coords_abs, axis=-1)
+
+      # Get the indices (in output space) for the DensePose coordinates. Note
+      # that if self._stride is larger than 1, this will have the effect of
+      # reducing spatial resolution of the groundtruth points.
+      indices_y = tf.cast(yabs, tf.int32)
+      indices_x = tf.cast(xabs, tf.int32)
+
+      # Assign ones if weights are not provided.
+      if weights is None:
+        weights = tf.ones(num_boxes, dtype=tf.float32)
+      # Create per-point weights.
+      weights_per_point = tf.reshape(
+          tf.tile(weights[:, tf.newaxis], multiples=[1, max_sampled_points]),
+          shape=[-1])
+      # Mask out invalid (i.e. padded) DensePose points.
+      num_points_tiled = tf.tile(num_points[:, tf.newaxis],
+                                 multiples=[1, max_sampled_points])
+      range_tiled = tf.tile(tf.range(max_sampled_points)[tf.newaxis, :],
+                            multiples=[num_boxes, 1])
+      valid_points = tf.math.less(range_tiled, num_points_tiled)
+      valid_points = tf.cast(tf.reshape(valid_points, [-1]), dtype=tf.float32)
+      weights_per_point = weights_per_point * valid_points
+
+      # Shape of [num_boxes * max_sampled_points] integer tensor filled with
+      # current batch index.
+      batch_index = i * tf.ones_like(indices_y, dtype=tf.int32)
+      batch_indices.append(
+          tf.stack([batch_index, indices_y, indices_x, part_ids_flattened],
+                   axis=1))
+      batch_part_ids.append(part_ids_one_hot)
+      batch_surface_coords.append(tf.stack([v, u], axis=1))
+      batch_weights.append(weights_per_point)
+
+    batch_indices = tf.concat(batch_indices, axis=0)
+    batch_part_ids = tf.concat(batch_part_ids, axis=0)
+    batch_surface_coords = tf.concat(batch_surface_coords, axis=0)
+    batch_weights = tf.concat(batch_weights, axis=0)
+    return batch_indices, batch_part_ids, batch_surface_coords, batch_weights
+
+
+def filter_mask_overlap_min_area(masks):
+  """If a pixel belongs to 2 instances, remove it from the larger instance."""
+
+  num_instances = tf.shape(masks)[0]
+  def _filter_min_area():
+    """Helper function to filter non empty masks."""
+    areas = tf.reduce_sum(masks, axis=[1, 2], keepdims=True)
+    per_pixel_area = masks * areas
+    # Make sure background is ignored in argmin.
+    per_pixel_area = (masks * per_pixel_area +
+                      (1 - masks) * per_pixel_area.dtype.max)
+    min_index = tf.cast(tf.argmin(per_pixel_area, axis=0), tf.int32)
+
+    filtered_masks = (
+        tf.range(num_instances)[:, tf.newaxis, tf.newaxis]
+        ==
+        min_index[tf.newaxis, :, :]
+    )
+
+    return tf.cast(filtered_masks, tf.float32) * masks
+
+  return tf.cond(num_instances > 0, _filter_min_area,
+                 lambda: masks)
+
+
+def filter_mask_overlap(masks, method='min_area'):
+
+  if method == 'min_area':
+    return filter_mask_overlap_min_area(masks)
+  else:
+    raise ValueError('Unknown mask overlap filter type - {}'.format(method))
+
+
+class CenterNetCornerOffsetTargetAssigner(object):
+  """Wrapper to compute corner offsets for boxes using masks."""
+
+  def __init__(self, stride, overlap_resolution='min_area'):
+    """Initializes the corner offset target assigner.
+
+    Args:
+      stride: int, the stride of the network in output pixels.
+      overlap_resolution: string, specifies how we handle overlapping
+        instance masks. Currently only 'min_area' is supported which assigns
+        overlapping pixels to the instance with the minimum area.
+    """
+
+    self._stride = stride
+    self._overlap_resolution = overlap_resolution
+
+  def assign_corner_offset_targets(
+      self, gt_boxes_list, gt_masks_list):
+    """Computes the corner offset targets and foreground map.
+
+    For each pixel that is part of any object's foreground, this function
+    computes the relative offsets to the top-left and bottom-right corners of
+    that instance's bounding box. It also returns a foreground map to indicate
+    which pixels contain valid corner offsets.
+
+    Args:
+      gt_boxes_list: A list of float tensors with shape [num_boxes, 4]
+        representing the groundtruth detection bounding boxes for each sample in
+        the batch. The coordinates are expected in normalized coordinates.
+      gt_masks_list: A list of float tensors with shape [num_boxes,
+        input_height, input_width] with values in {0, 1} representing instance
+        masks for each object.
+
+    Returns:
+      corner_offsets: A float tensor of shape [batch_size, height, width, 4]
+        containing, in order, the (y, x) offsets to the top left corner and
+        the (y, x) offsets to the bottom right corner for each foregroung pixel
+      foreground: A float tensor of shape [batch_size, height, width] in which
+        each pixel is set to 1 if it is a part of any instance's foreground
+        (and thus contains valid corner offsets) and 0 otherwise.
+
+    """
+    _, input_height, input_width = (
+        shape_utils.combined_static_and_dynamic_shape(gt_masks_list[0]))
+    output_height = input_height // self._stride
+    output_width = input_width // self._stride
+    y_grid, x_grid = tf.meshgrid(
+        tf.range(output_height), tf.range(output_width),
+        indexing='ij')
+    y_grid, x_grid = tf.cast(y_grid, tf.float32), tf.cast(x_grid, tf.float32)
+
+    corner_targets = []
+    foreground_targets = []
+    for gt_masks, gt_boxes in zip(gt_masks_list, gt_boxes_list):
+      gt_masks = _resize_masks(gt_masks, output_height, output_width,
+                               method=ResizeMethod.NEAREST_NEIGHBOR)
+      gt_masks = filter_mask_overlap(gt_masks, self._overlap_resolution)
+
+      ymin, xmin, ymax, xmax = tf.unstack(gt_boxes, axis=1)
+      ymin, ymax = ymin * output_height, ymax * output_height
+      xmin, xmax = xmin * output_width, xmax * output_width
+
+      top_y = ymin[:, tf.newaxis, tf.newaxis] - y_grid[tf.newaxis]
+      left_x = xmin[:, tf.newaxis, tf.newaxis] - x_grid[tf.newaxis]
+      bottom_y = ymax[:, tf.newaxis, tf.newaxis] - y_grid[tf.newaxis]
+      right_x = xmax[:, tf.newaxis, tf.newaxis] - x_grid[tf.newaxis]
+
+      foreground_target = tf.cast(tf.reduce_sum(gt_masks, axis=0) > 0.5,
+                                  tf.float32)
+      foreground_targets.append(foreground_target)
+
+      corner_target = tf.stack([
+          tf.reduce_sum(top_y * gt_masks, axis=0),
+          tf.reduce_sum(left_x * gt_masks, axis=0),
+          tf.reduce_sum(bottom_y * gt_masks, axis=0),
+          tf.reduce_sum(right_x * gt_masks, axis=0),
+      ], axis=2)
+
+      corner_targets.append(corner_target)
+
+    return (tf.stack(corner_targets, axis=0),
+            tf.stack(foreground_targets, axis=0))
diff --git a/research/object_detection/core/target_assigner_test.py b/research/object_detection/core/target_assigner_test.py
index 5a0ca43e558beac27076bce02ae8f62d6624d6f7..8bdc4f2e443e59bcc644b8041df32c1cc0d72f02 100644
--- a/research/object_detection/core/target_assigner_test.py
+++ b/research/object_detection/core/target_assigner_test.py
@@ -1905,6 +1905,290 @@ class CenterNetMaskTargetAssignerTest(test_case.TestCase):
     np.testing.assert_array_almost_equal(
         expected_seg_target, segmentation_target)
 
+  def test_assign_segmentation_targets_no_objects(self):
+    def graph_fn():
+      gt_masks_list = [tf.zeros((0, 5, 5))]
+      gt_classes_list = [tf.zeros((0, 10))]
+      cn_assigner = targetassigner.CenterNetMaskTargetAssigner(stride=1)
+      segmentation_target = cn_assigner.assign_segmentation_targets(
+          gt_masks_list=gt_masks_list,
+          gt_classes_list=gt_classes_list,
+          mask_resize_method=targetassigner.ResizeMethod.NEAREST_NEIGHBOR)
+      return segmentation_target
+
+    segmentation_target = self.execute(graph_fn, [])
+    expected_seg_target = np.zeros((1, 5, 5, 10))
+    np.testing.assert_array_almost_equal(
+        expected_seg_target, segmentation_target)
+
+
+class CenterNetDensePoseTargetAssignerTest(test_case.TestCase):
+
+  def test_assign_part_and_coordinate_targets(self):
+    def graph_fn():
+      gt_dp_num_points_list = [
+          # Example 0.
+          tf.constant([2, 0, 3], dtype=tf.int32),
+          # Example 1.
+          tf.constant([1, 1], dtype=tf.int32),
+      ]
+      gt_dp_part_ids_list = [
+          # Example 0.
+          tf.constant([[1, 6, 0],
+                       [0, 0, 0],
+                       [0, 2, 3]], dtype=tf.int32),
+          # Example 1.
+          tf.constant([[7, 0, 0],
+                       [0, 0, 0]], dtype=tf.int32),
+      ]
+      gt_dp_surface_coords_list = [
+          # Example 0.
+          tf.constant(
+              [[[0.11, 0.2, 0.3, 0.4],  # Box 0.
+                [0.6, 0.4, 0.1, 0.0],
+                [0.0, 0.0, 0.0, 0.0]],
+               [[0.0, 0.0, 0.0, 0.0],  # Box 1.
+                [0.0, 0.0, 0.0, 0.0],
+                [0.0, 0.0, 0.0, 0.0]],
+               [[0.22, 0.1, 0.6, 0.8],  # Box 2.
+                [0.0, 0.4, 0.5, 1.0],
+                [0.3, 0.2, 0.4, 0.1]]],
+              dtype=tf.float32),
+          # Example 1.
+          tf.constant(
+              [[[0.5, 0.5, 0.3, 1.0],  # Box 0.
+                [0.0, 0.0, 0.0, 0.0],
+                [0.0, 0.0, 0.0, 0.0]],
+               [[0.2, 0.2, 0.5, 0.8],  # Box 1.
+                [0.0, 0.0, 0.0, 0.0],
+                [0.0, 0.0, 0.0, 0.0]]],
+              dtype=tf.float32),
+      ]
+      gt_weights_list = [
+          # Example 0.
+          tf.constant([1.0, 1.0, 0.5], dtype=tf.float32),
+          # Example 1.
+          tf.constant([0.0, 1.0], dtype=tf.float32),
+      ]
+      cn_assigner = targetassigner.CenterNetDensePoseTargetAssigner(stride=4)
+      batch_indices, batch_part_ids, batch_surface_coords, batch_weights = (
+          cn_assigner.assign_part_and_coordinate_targets(
+              height=120,
+              width=80,
+              gt_dp_num_points_list=gt_dp_num_points_list,
+              gt_dp_part_ids_list=gt_dp_part_ids_list,
+              gt_dp_surface_coords_list=gt_dp_surface_coords_list,
+              gt_weights_list=gt_weights_list))
+
+      return batch_indices, batch_part_ids, batch_surface_coords, batch_weights
+    batch_indices, batch_part_ids, batch_surface_coords, batch_weights = (
+        self.execute(graph_fn, []))
+
+    expected_batch_indices = np.array([
+        # Example 0. e.g.
+        # The first set of indices is calculated as follows:
+        # floor(0.11*120/4) = 3, floor(0.2*80/4) = 4.
+        [0, 3, 4, 1], [0, 18, 8, 6], [0, 0, 0, 0], [0, 0, 0, 0], [0, 0, 0, 0],
+        [0, 0, 0, 0], [0, 6, 2, 0], [0, 0, 8, 2], [0, 9, 4, 3],
+        # Example 1.
+        [1, 15, 10, 7], [1, 0, 0, 0], [1, 0, 0, 0], [1, 6, 4, 0], [1, 0, 0, 0],
+        [1, 0, 0, 0]
+    ], dtype=np.int32)
+    expected_batch_part_ids = tf.one_hot(
+        [1, 6, 0, 0, 0, 0, 0, 2, 3, 7, 0, 0, 0, 0, 0], depth=24).numpy()
+    expected_batch_surface_coords = np.array([
+        # Box 0.
+        [0.3, 0.4], [0.1, 0.0], [0.0, 0.0], [0.0, 0.0], [0.0, 0.0], [0.0, 0.0],
+        [0.6, 0.8], [0.5, 1.0], [0.4, 0.1],
+        # Box 1.
+        [0.3, 1.0], [0.0, 0.0], [0.0, 0.0], [0.5, 0.8], [0.0, 0.0], [0.0, 0.0],
+    ], np.float32)
+    expected_batch_weights = np.array([
+        # Box 0.
+        1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.5, 0.5, 0.5,
+        # Box 1.
+        0.0, 0.0, 0.0, 1.0, 0.0, 0.0
+    ], dtype=np.float32)
+    self.assertAllEqual(expected_batch_indices, batch_indices)
+    self.assertAllEqual(expected_batch_part_ids, batch_part_ids)
+    self.assertAllClose(expected_batch_surface_coords, batch_surface_coords)
+    self.assertAllClose(expected_batch_weights, batch_weights)
+
+
+class CornerOffsetTargetAssignerTest(test_case.TestCase):
+
+  def test_filter_overlap_min_area_empty(self):
+    """Test that empty masks work on CPU."""
+    def graph_fn(masks):
+      return targetassigner.filter_mask_overlap_min_area(masks)
+
+    masks = self.execute_cpu(graph_fn, [np.zeros((0, 5, 5), dtype=np.float32)])
+    self.assertEqual(masks.shape, (0, 5, 5))
+
+  def test_filter_overlap_min_area(self):
+    """Test the object with min. area is selected instead of overlap."""
+    def graph_fn(masks):
+      return targetassigner.filter_mask_overlap_min_area(masks)
+
+    masks = np.zeros((3, 4, 4), dtype=np.float32)
+    masks[0, :2, :2] = 1.0
+    masks[1, :3, :3] = 1.0
+    masks[2, 3, 3] = 1.0
+
+    masks = self.execute(graph_fn, [masks])
+
+    self.assertAllClose(masks[0],
+                        [[1, 1, 0, 0],
+                         [1, 1, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 0]])
+    self.assertAllClose(masks[1],
+                        [[0, 0, 1, 0],
+                         [0, 0, 1, 0],
+                         [1, 1, 1, 0],
+                         [0, 0, 0, 0]])
+
+    self.assertAllClose(masks[2],
+                        [[0, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 1]])
+
+  def test_assign_corner_offset_single_object(self):
+    """Test that corner offsets are correct with a single object."""
+    assigner = targetassigner.CenterNetCornerOffsetTargetAssigner(stride=1)
+
+    def graph_fn():
+      boxes = [
+          tf.constant([[0., 0., 1., 1.]])
+      ]
+      mask = np.zeros((1, 4, 4), dtype=np.float32)
+      mask[0, 1:3, 1:3] = 1.0
+
+      masks = [tf.constant(mask)]
+      return assigner.assign_corner_offset_targets(boxes, masks)
+
+    corner_offsets, foreground = self.execute(graph_fn, [])
+    self.assertAllClose(foreground[0],
+                        [[0, 0, 0, 0],
+                         [0, 1, 1, 0],
+                         [0, 1, 1, 0],
+                         [0, 0, 0, 0]])
+
+    self.assertAllClose(corner_offsets[0, :, :, 0],
+                        [[0, 0, 0, 0],
+                         [0, -1, -1, 0],
+                         [0, -2, -2, 0],
+                         [0, 0, 0, 0]])
+    self.assertAllClose(corner_offsets[0, :, :, 1],
+                        [[0, 0, 0, 0],
+                         [0, -1, -2, 0],
+                         [0, -1, -2, 0],
+                         [0, 0, 0, 0]])
+    self.assertAllClose(corner_offsets[0, :, :, 2],
+                        [[0, 0, 0, 0],
+                         [0, 3, 3, 0],
+                         [0, 2, 2, 0],
+                         [0, 0, 0, 0]])
+    self.assertAllClose(corner_offsets[0, :, :, 3],
+                        [[0, 0, 0, 0],
+                         [0, 3, 2, 0],
+                         [0, 3, 2, 0],
+                         [0, 0, 0, 0]])
+
+  def test_assign_corner_offset_multiple_objects(self):
+    """Test corner offsets are correct with multiple objects."""
+    assigner = targetassigner.CenterNetCornerOffsetTargetAssigner(stride=1)
+
+    def graph_fn():
+      boxes = [
+          tf.constant([[0., 0., 1., 1.], [0., 0., 0., 0.]]),
+          tf.constant([[0., 0., .25, .25], [.25, .25, 1., 1.]])
+      ]
+      mask1 = np.zeros((2, 4, 4), dtype=np.float32)
+      mask1[0, 0, 0] = 1.0
+      mask1[0, 3, 3] = 1.0
+
+      mask2 = np.zeros((2, 4, 4), dtype=np.float32)
+      mask2[0, :2, :2] = 1.0
+      mask2[1, 1:, 1:] = 1.0
+
+      masks = [tf.constant(mask1), tf.constant(mask2)]
+      return assigner.assign_corner_offset_targets(boxes, masks)
+
+    corner_offsets, foreground = self.execute(graph_fn, [])
+    self.assertEqual(corner_offsets.shape, (2, 4, 4, 4))
+    self.assertEqual(foreground.shape, (2, 4, 4))
+
+    self.assertAllClose(foreground[0],
+                        [[1, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 1]])
+
+    self.assertAllClose(corner_offsets[0, :, :, 0],
+                        [[0, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, -3]])
+    self.assertAllClose(corner_offsets[0, :, :, 1],
+                        [[0, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, -3]])
+    self.assertAllClose(corner_offsets[0, :, :, 2],
+                        [[4, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 1]])
+    self.assertAllClose(corner_offsets[0, :, :, 3],
+                        [[4, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 0],
+                         [0, 0, 0, 1]])
+
+    self.assertAllClose(foreground[1],
+                        [[1, 1, 0, 0],
+                         [1, 1, 1, 1],
+                         [0, 1, 1, 1],
+                         [0, 1, 1, 1]])
+
+    self.assertAllClose(corner_offsets[1, :, :, 0],
+                        [[0, 0, 0, 0],
+                         [-1, -1, 0, 0],
+                         [0, -1, -1, -1],
+                         [0, -2, -2, -2]])
+    self.assertAllClose(corner_offsets[1, :, :, 1],
+                        [[0, -1, 0, 0],
+                         [0, -1, -1, -2],
+                         [0, 0, -1, -2],
+                         [0, 0, -1, -2]])
+    self.assertAllClose(corner_offsets[1, :, :, 2],
+                        [[1, 1, 0, 0],
+                         [0, 0, 3, 3],
+                         [0, 2, 2, 2],
+                         [0, 1, 1, 1]])
+    self.assertAllClose(corner_offsets[1, :, :, 3],
+                        [[1, 0, 0, 0],
+                         [1, 0, 2, 1],
+                         [0, 3, 2, 1],
+                         [0, 3, 2, 1]])
+
+  def test_assign_corner_offsets_no_objects(self):
+    """Test assignment works with empty input on cpu."""
+    assigner = targetassigner.CenterNetCornerOffsetTargetAssigner(stride=1)
+
+    def graph_fn():
+      boxes = [
+          tf.zeros((0, 4), dtype=tf.float32)
+      ]
+      masks = [tf.zeros((0, 5, 5), dtype=tf.float32)]
+      return assigner.assign_corner_offset_targets(boxes, masks)
+
+    corner_offsets, foreground = self.execute_cpu(graph_fn, [])
+    self.assertAllClose(corner_offsets, np.zeros((1, 5, 5, 4)))
+    self.assertAllClose(foreground, np.zeros((1, 5, 5)))
+
 
 if __name__ == '__main__':
   tf.enable_v2_behavior()
diff --git a/research/object_detection/data_decoders/tf_example_decoder.py b/research/object_detection/data_decoders/tf_example_decoder.py
index bd1fa2c771ec61a0ebc438392a966c08aff6faad..adf4bd6898ab590566542e906511fcc13b62696e 100644
--- a/research/object_detection/data_decoders/tf_example_decoder.py
+++ b/research/object_detection/data_decoders/tf_example_decoder.py
@@ -30,6 +30,7 @@ from object_detection.core import data_decoder
 from object_detection.core import standard_fields as fields
 from object_detection.protos import input_reader_pb2
 from object_detection.utils import label_map_util
+from object_detection.utils import shape_utils
 
 # pylint: disable=g-import-not-at-top
 try:
@@ -123,40 +124,6 @@ class _ClassTensorHandler(slim_example_decoder.Tensor):
                       self._display_name_to_id_table.lookup(unmapped_tensor))
 
 
-class _BackupHandler(slim_example_decoder.ItemHandler):
-  """An ItemHandler that tries two ItemHandlers in order."""
-
-  def __init__(self, handler, backup):
-    """Initializes the BackupHandler handler.
-
-    If the first Handler's tensors_to_item returns a Tensor with no elements,
-    the second Handler is used.
-
-    Args:
-      handler: The primary ItemHandler.
-      backup: The backup ItemHandler.
-
-    Raises:
-      ValueError: if either is not an ItemHandler.
-    """
-    if not isinstance(handler, slim_example_decoder.ItemHandler):
-      raise ValueError('Primary handler is of type %s instead of ItemHandler' %
-                       type(handler))
-    if not isinstance(backup, slim_example_decoder.ItemHandler):
-      raise ValueError(
-          'Backup handler is of type %s instead of ItemHandler' % type(backup))
-    self._handler = handler
-    self._backup = backup
-    super(_BackupHandler, self).__init__(handler.keys + backup.keys)
-
-  def tensors_to_item(self, keys_to_tensors):
-    item = self._handler.tensors_to_item(keys_to_tensors)
-    return tf.cond(
-        pred=tf.equal(tf.reduce_prod(tf.shape(item)), 0),
-        true_fn=lambda: self._backup.tensors_to_item(keys_to_tensors),
-        false_fn=lambda: item)
-
-
 class TfExampleDecoder(data_decoder.DataDecoder):
   """Tensorflow Example proto decoder."""
 
@@ -170,7 +137,8 @@ class TfExampleDecoder(data_decoder.DataDecoder):
                num_additional_channels=0,
                load_multiclass_scores=False,
                load_context_features=False,
-               expand_hierarchy_labels=False):
+               expand_hierarchy_labels=False,
+               load_dense_pose=False):
     """Constructor sets keys_to_features and items_to_handlers.
 
     Args:
@@ -201,6 +169,7 @@ class TfExampleDecoder(data_decoder.DataDecoder):
         account the provided hierarchy in the label_map_proto_file. For positive
         classes, the labels are extended to ancestor. For negative classes,
         the labels are expanded to descendants.
+      load_dense_pose: Whether to load DensePose annotations.
 
     Raises:
       ValueError: If `instance_mask_type` option is not one of
@@ -371,15 +340,43 @@ class TfExampleDecoder(data_decoder.DataDecoder):
                     self._decode_png_instance_masks))
       else:
         raise ValueError('Did not recognize the `instance_mask_type` option.')
+    if load_dense_pose:
+      self.keys_to_features['image/object/densepose/num'] = (
+          tf.VarLenFeature(tf.int64))
+      self.keys_to_features['image/object/densepose/part_index'] = (
+          tf.VarLenFeature(tf.int64))
+      self.keys_to_features['image/object/densepose/x'] = (
+          tf.VarLenFeature(tf.float32))
+      self.keys_to_features['image/object/densepose/y'] = (
+          tf.VarLenFeature(tf.float32))
+      self.keys_to_features['image/object/densepose/u'] = (
+          tf.VarLenFeature(tf.float32))
+      self.keys_to_features['image/object/densepose/v'] = (
+          tf.VarLenFeature(tf.float32))
+      self.items_to_handlers[
+          fields.InputDataFields.groundtruth_dp_num_points] = (
+              slim_example_decoder.Tensor('image/object/densepose/num'))
+      self.items_to_handlers[fields.InputDataFields.groundtruth_dp_part_ids] = (
+          slim_example_decoder.ItemHandlerCallback(
+              ['image/object/densepose/part_index',
+               'image/object/densepose/num'], self._dense_pose_part_indices))
+      self.items_to_handlers[
+          fields.InputDataFields.groundtruth_dp_surface_coords] = (
+              slim_example_decoder.ItemHandlerCallback(
+                  ['image/object/densepose/x', 'image/object/densepose/y',
+                   'image/object/densepose/u', 'image/object/densepose/v',
+                   'image/object/densepose/num'],
+                  self._dense_pose_surface_coordinates))
+
     if label_map_proto_file:
       # If the label_map_proto is provided, try to use it in conjunction with
       # the class text, and fall back to a materialized ID.
-      label_handler = _BackupHandler(
+      label_handler = slim_example_decoder.BackupHandler(
           _ClassTensorHandler(
               'image/object/class/text', label_map_proto_file,
               default_value=''),
           slim_example_decoder.Tensor('image/object/class/label'))
-      image_label_handler = _BackupHandler(
+      image_label_handler = slim_example_decoder.BackupHandler(
           _ClassTensorHandler(
               fields.TfExampleFields.image_class_text,
               label_map_proto_file,
@@ -547,6 +544,14 @@ class TfExampleDecoder(data_decoder.DataDecoder):
       group_of = fields.InputDataFields.groundtruth_group_of
       tensor_dict[group_of] = tf.cast(tensor_dict[group_of], dtype=tf.bool)
 
+    if fields.InputDataFields.groundtruth_dp_num_points in tensor_dict:
+      tensor_dict[fields.InputDataFields.groundtruth_dp_num_points] = tf.cast(
+          tensor_dict[fields.InputDataFields.groundtruth_dp_num_points],
+          dtype=tf.int32)
+      tensor_dict[fields.InputDataFields.groundtruth_dp_part_ids] = tf.cast(
+          tensor_dict[fields.InputDataFields.groundtruth_dp_part_ids],
+          dtype=tf.int32)
+
     return tensor_dict
 
   def _reshape_keypoints(self, keys_to_tensors):
@@ -697,6 +702,97 @@ class TfExampleDecoder(data_decoder.DataDecoder):
         lambda: tf.map_fn(decode_png_mask, png_masks, dtype=tf.float32),
         lambda: tf.zeros(tf.cast(tf.stack([0, height, width]), dtype=tf.int32)))
 
+  def _dense_pose_part_indices(self, keys_to_tensors):
+    """Creates a tensor that contains part indices for each DensePose point.
+
+    Args:
+      keys_to_tensors: a dictionary from keys to tensors.
+
+    Returns:
+      A 2-D int32 tensor of shape [num_instances, num_points] where each element
+      contains the DensePose part index (0-23). The value `num_points`
+      corresponds to the maximum number of sampled points across all instances
+      in the image. Note that instances with less sampled points will be padded
+      with zeros in the last dimension.
+    """
+    num_points_per_instances = keys_to_tensors['image/object/densepose/num']
+    part_index = keys_to_tensors['image/object/densepose/part_index']
+    if isinstance(num_points_per_instances, tf.SparseTensor):
+      num_points_per_instances = tf.sparse_tensor_to_dense(
+          num_points_per_instances)
+    if isinstance(part_index, tf.SparseTensor):
+      part_index = tf.sparse_tensor_to_dense(part_index)
+    part_index = tf.cast(part_index, dtype=tf.int32)
+    max_points_per_instance = tf.cast(
+        tf.math.reduce_max(num_points_per_instances), dtype=tf.int32)
+    num_points_cumulative = tf.concat([
+        [0], tf.math.cumsum(num_points_per_instances)], axis=0)
+
+    def pad_parts_tensor(instance_ind):
+      points_range_start = num_points_cumulative[instance_ind]
+      points_range_end = num_points_cumulative[instance_ind + 1]
+      part_inds = part_index[points_range_start:points_range_end]
+      return shape_utils.pad_or_clip_nd(part_inds,
+                                        output_shape=[max_points_per_instance])
+
+    return tf.map_fn(pad_parts_tensor,
+                     tf.range(tf.size(num_points_per_instances)),
+                     dtype=tf.int32)
+
+  def _dense_pose_surface_coordinates(self, keys_to_tensors):
+    """Creates a tensor that contains surface coords for each DensePose point.
+
+    Args:
+      keys_to_tensors: a dictionary from keys to tensors.
+
+    Returns:
+      A 3-D float32 tensor of shape [num_instances, num_points, 4] where each
+      point contains (y, x, v, u) data for each sampled DensePose point. The
+      (y, x) coordinate has normalized image locations for the point, and (v, u)
+      contains the surface coordinate (also normalized) for the part. The value
+      `num_points` corresponds to the maximum number of sampled points across
+      all instances in the image. Note that instances with less sampled points
+      will be padded with zeros in dim=1.
+    """
+    num_points_per_instances = keys_to_tensors['image/object/densepose/num']
+    dp_y = keys_to_tensors['image/object/densepose/y']
+    dp_x = keys_to_tensors['image/object/densepose/x']
+    dp_v = keys_to_tensors['image/object/densepose/v']
+    dp_u = keys_to_tensors['image/object/densepose/u']
+    if isinstance(num_points_per_instances, tf.SparseTensor):
+      num_points_per_instances = tf.sparse_tensor_to_dense(
+          num_points_per_instances)
+    if isinstance(dp_y, tf.SparseTensor):
+      dp_y = tf.sparse_tensor_to_dense(dp_y)
+    if isinstance(dp_x, tf.SparseTensor):
+      dp_x = tf.sparse_tensor_to_dense(dp_x)
+    if isinstance(dp_v, tf.SparseTensor):
+      dp_v = tf.sparse_tensor_to_dense(dp_v)
+    if isinstance(dp_u, tf.SparseTensor):
+      dp_u = tf.sparse_tensor_to_dense(dp_u)
+    max_points_per_instance = tf.cast(
+        tf.math.reduce_max(num_points_per_instances), dtype=tf.int32)
+    num_points_cumulative = tf.concat([
+        [0], tf.math.cumsum(num_points_per_instances)], axis=0)
+
+    def pad_surface_coordinates_tensor(instance_ind):
+      """Pads DensePose surface coordinates for each instance."""
+      points_range_start = num_points_cumulative[instance_ind]
+      points_range_end = num_points_cumulative[instance_ind + 1]
+      y = dp_y[points_range_start:points_range_end]
+      x = dp_x[points_range_start:points_range_end]
+      v = dp_v[points_range_start:points_range_end]
+      u = dp_u[points_range_start:points_range_end]
+      # Create [num_points_i, 4] tensor, where num_points_i is the number of
+      # sampled points for instance i.
+      unpadded_tensor = tf.stack([y, x, v, u], axis=1)
+      return shape_utils.pad_or_clip_nd(
+          unpadded_tensor, output_shape=[max_points_per_instance, 4])
+
+    return tf.map_fn(pad_surface_coordinates_tensor,
+                     tf.range(tf.size(num_points_per_instances)),
+                     dtype=tf.float32)
+
   def _expand_image_label_hierarchy(self, image_classes, image_confidences):
     """Expand image level labels according to the hierarchy.
 
diff --git a/research/object_detection/data_decoders/tf_example_decoder_test.py b/research/object_detection/data_decoders/tf_example_decoder_test.py
index 9cbed32fc05f3d6b2c9e3233633627412482e0f5..81ed9258e650d7534bd9e3ae76aa574bc2a06b61 100644
--- a/research/object_detection/data_decoders/tf_example_decoder_test.py
+++ b/research/object_detection/data_decoders/tf_example_decoder_test.py
@@ -1096,8 +1096,8 @@ class TfExampleDecoderTest(test_case.TestCase):
       return example_decoder.decode(tf.convert_to_tensor(example))
 
     tensor_dict = self.execute_cpu(graph_fn, [])
-    self.assertTrue(
-        fields.InputDataFields.groundtruth_instance_masks not in tensor_dict)
+    self.assertNotIn(fields.InputDataFields.groundtruth_instance_masks,
+                     tensor_dict)
 
   def testDecodeImageLabels(self):
     image_tensor = np.random.randint(256, size=(4, 5, 3)).astype(np.uint8)
@@ -1116,8 +1116,7 @@ class TfExampleDecoderTest(test_case.TestCase):
       return example_decoder.decode(tf.convert_to_tensor(example))
 
     tensor_dict = self.execute_cpu(graph_fn_1, [])
-    self.assertTrue(
-        fields.InputDataFields.groundtruth_image_classes in tensor_dict)
+    self.assertIn(fields.InputDataFields.groundtruth_image_classes, tensor_dict)
     self.assertAllEqual(
         tensor_dict[fields.InputDataFields.groundtruth_image_classes],
         np.array([1, 2]))
@@ -1152,8 +1151,7 @@ class TfExampleDecoderTest(test_case.TestCase):
       return example_decoder.decode(tf.convert_to_tensor(example))
 
     tensor_dict = self.execute_cpu(graph_fn_2, [])
-    self.assertTrue(
-        fields.InputDataFields.groundtruth_image_classes in tensor_dict)
+    self.assertIn(fields.InputDataFields.groundtruth_image_classes, tensor_dict)
     self.assertAllEqual(
         tensor_dict[fields.InputDataFields.groundtruth_image_classes],
         np.array([1, 3]))
@@ -1345,6 +1343,93 @@ class TfExampleDecoderTest(test_case.TestCase):
         expected_image_confidence,
         tensor_dict[fields.InputDataFields.groundtruth_image_confidences])
 
+  def testDecodeDensePose(self):
+    image_tensor = np.random.randint(256, size=(4, 5, 3)).astype(np.uint8)
+    encoded_jpeg, _ = self._create_encoded_and_decoded_data(
+        image_tensor, 'jpeg')
+    bbox_ymins = [0.0, 4.0, 2.0]
+    bbox_xmins = [1.0, 5.0, 8.0]
+    bbox_ymaxs = [2.0, 6.0, 1.0]
+    bbox_xmaxs = [3.0, 7.0, 3.3]
+    densepose_num = [0, 4, 2]
+    densepose_part_index = [2, 2, 3, 4, 2, 9]
+    densepose_x = [0.1, 0.2, 0.3, 0.4, 0.5, 0.6]
+    densepose_y = [0.9, 0.8, 0.7, 0.6, 0.5, 0.4]
+    densepose_u = [0.01, 0.02, 0.03, 0.04, 0.05, 0.06]
+    densepose_v = [0.99, 0.98, 0.97, 0.96, 0.95, 0.94]
+
+    def graph_fn():
+      example = tf.train.Example(
+          features=tf.train.Features(
+              feature={
+                  'image/encoded':
+                      dataset_util.bytes_feature(encoded_jpeg),
+                  'image/format':
+                      dataset_util.bytes_feature(six.b('jpeg')),
+                  'image/object/bbox/ymin':
+                      dataset_util.float_list_feature(bbox_ymins),
+                  'image/object/bbox/xmin':
+                      dataset_util.float_list_feature(bbox_xmins),
+                  'image/object/bbox/ymax':
+                      dataset_util.float_list_feature(bbox_ymaxs),
+                  'image/object/bbox/xmax':
+                      dataset_util.float_list_feature(bbox_xmaxs),
+                  'image/object/densepose/num':
+                      dataset_util.int64_list_feature(densepose_num),
+                  'image/object/densepose/part_index':
+                      dataset_util.int64_list_feature(densepose_part_index),
+                  'image/object/densepose/x':
+                      dataset_util.float_list_feature(densepose_x),
+                  'image/object/densepose/y':
+                      dataset_util.float_list_feature(densepose_y),
+                  'image/object/densepose/u':
+                      dataset_util.float_list_feature(densepose_u),
+                  'image/object/densepose/v':
+                      dataset_util.float_list_feature(densepose_v),
+
+              })).SerializeToString()
+
+      example_decoder = tf_example_decoder.TfExampleDecoder(
+          load_dense_pose=True)
+      output = example_decoder.decode(tf.convert_to_tensor(example))
+      dp_num_points = output[fields.InputDataFields.groundtruth_dp_num_points]
+      dp_part_ids = output[fields.InputDataFields.groundtruth_dp_part_ids]
+      dp_surface_coords = output[
+          fields.InputDataFields.groundtruth_dp_surface_coords]
+      return dp_num_points, dp_part_ids, dp_surface_coords
+
+    dp_num_points, dp_part_ids, dp_surface_coords = self.execute_cpu(
+        graph_fn, [])
+
+    expected_dp_num_points = [0, 4, 2]
+    expected_dp_part_ids = [
+        [0, 0, 0, 0],
+        [2, 2, 3, 4],
+        [2, 9, 0, 0]
+    ]
+    expected_dp_surface_coords = np.array(
+        [
+            # Instance 0 (no points).
+            [[0., 0., 0., 0.],
+             [0., 0., 0., 0.],
+             [0., 0., 0., 0.],
+             [0., 0., 0., 0.]],
+            # Instance 1 (4 points).
+            [[0.9, 0.1, 0.99, 0.01],
+             [0.8, 0.2, 0.98, 0.02],
+             [0.7, 0.3, 0.97, 0.03],
+             [0.6, 0.4, 0.96, 0.04]],
+            # Instance 2 (2 points).
+            [[0.5, 0.5, 0.95, 0.05],
+             [0.4, 0.6, 0.94, 0.06],
+             [0., 0., 0., 0.],
+             [0., 0., 0., 0.]],
+        ], dtype=np.float32)
+
+    self.assertAllEqual(dp_num_points, expected_dp_num_points)
+    self.assertAllEqual(dp_part_ids, expected_dp_part_ids)
+    self.assertAllClose(dp_surface_coords, expected_dp_surface_coords)
+
 
 if __name__ == '__main__':
   tf.test.main()
diff --git a/research/object_detection/dataset_tools/context_rcnn/add_context_to_examples.py b/research/object_detection/dataset_tools/context_rcnn/add_context_to_examples.py
index 9c05387980e193f9cb40a767944357d80379384c..a5b8b0ab7f7c0dab76027325dcd4d96e93eeccbc 100644
--- a/research/object_detection/dataset_tools/context_rcnn/add_context_to_examples.py
+++ b/research/object_detection/dataset_tools/context_rcnn/add_context_to_examples.py
@@ -43,70 +43,22 @@ from __future__ import absolute_import
 from __future__ import division
 from __future__ import print_function
 
+import argparse
 import copy
 import datetime
 import io
 import itertools
 import json
 import os
-
-from absl import app
-from absl import flags
-import apache_beam as beam
 import numpy as np
 import PIL.Image
 import six
 import tensorflow as tf
 
-from apache_beam import runners
-
-
-flags.DEFINE_string('input_tfrecord', None, 'TFRecord containing images in '
-                    'tf.Example format for object detection, with bounding'
-                    'boxes and contextual feature embeddings.')
-flags.DEFINE_string('output_tfrecord', None,
-                    'TFRecord containing images in tf.Example format, with '
-                    'added contextual memory banks.')
-flags.DEFINE_string('sequence_key', None, 'Key to use when grouping sequences: '
-                    'so far supports `image/seq_id` and `image/location`.')
-flags.DEFINE_string('time_horizon', None, 'What time horizon to use when '
-                    'splitting the data, if any. Options are: `year`, `month`,'
-                    ' `week`, `day `, `hour`, `minute`, `None`.')
-flags.DEFINE_integer('subsample_context_features_rate', 0, 'Whether to '
-                     'subsample the context_features, and if so how many to '
-                     'sample. If the rate is set to X, it will sample context '
-                     'from 1 out of every X images. Default is sampling from '
-                     'every image, which is X=0.')
-flags.DEFINE_boolean('reduce_image_size', True, 'downsamples images to'
-                     'have longest side max_image_dimension, maintaining aspect'
-                     ' ratio')
-flags.DEFINE_integer('max_image_dimension', 1024, 'sets max image dimension')
-flags.DEFINE_boolean('add_context_features', True, 'adds a memory bank of'
-                     'embeddings to each clip')
-flags.DEFINE_boolean('sorted_image_ids', True, 'whether the image source_ids '
-                     'are sortable to deal with date_captured tie-breaks')
-flags.DEFINE_string('image_ids_to_keep', 'All', 'path to .json list of image'
-                    'ids to keep, used for ground truth eval creation')
-flags.DEFINE_boolean('keep_context_features_image_id_list', False, 'Whether or '
-                     'not to keep a list of the image_ids corresponding to the '
-                     'memory bank')
-flags.DEFINE_boolean('keep_only_positives', False, 'Whether or not to '
-                     'keep only positive boxes based on score')
-flags.DEFINE_boolean('keep_only_positives_gt', False, 'Whether or not to '
-                     'keep only positive boxes based on gt class')
-flags.DEFINE_float('context_features_score_threshold', 0.7, 'What score '
-                   'threshold to use for boxes in context_features')
-flags.DEFINE_integer('max_num_elements_in_context_features', 2000, 'Sets max '
-                     'num elements per memory bank')
-flags.DEFINE_integer('num_shards', 0, 'Number of output shards.')
-flags.DEFINE_string('output_type', 'tf_sequence_example', 'Output type, one of '
-                    '`tf_example`, `tf_sequence_example`')
-flags.DEFINE_integer('max_clip_length', None, 'Max length for sequence '
-                     'example outputs.')
-
-FLAGS = flags.FLAGS
-
-DEFAULT_FEATURE_LENGTH = 2057
+try:
+  import apache_beam as beam  # pylint:disable=g-import-not-at-top
+except ModuleNotFoundError:
+  pass
 
 
 class ReKeyDataFn(beam.DoFn):
@@ -406,7 +358,8 @@ class GenerateContextFn(beam.DoFn):
                keep_only_positives_gt=False,
                max_num_elements_in_context_features=5000,
                pad_context_features=False,
-               output_type='tf_example', max_clip_length=None):
+               output_type='tf_example', max_clip_length=None,
+               context_feature_length=2057):
     """Initialization function.
 
     Args:
@@ -432,6 +385,8 @@ class GenerateContextFn(beam.DoFn):
       output_type: What type of output, tf_example of tf_sequence_example
       max_clip_length: The maximum length of a sequence example, before
         splitting into multiple
+      context_feature_length: The length of the context feature embeddings
+        stored in the input data.
     """
     self._session = None
     self._num_examples_processed = beam.metrics.Metrics.counter(
@@ -456,6 +411,7 @@ class GenerateContextFn(beam.DoFn):
     self._context_features_score_threshold = context_features_score_threshold
     self._max_num_elements_in_context_features = (
         max_num_elements_in_context_features)
+    self._context_feature_length = context_feature_length
 
     self._images_kept = beam.metrics.Metrics.counter(
         'sequence_data_generation', 'images_kept')
@@ -506,9 +462,9 @@ class GenerateContextFn(beam.DoFn):
       context_features_image_id_list.append(example_image_id)
 
     if not example_embedding:
-      example_embedding.append(np.zeros(DEFAULT_FEATURE_LENGTH))
+      example_embedding.append(np.zeros(self._context_feature_length))
 
-    feature_length = DEFAULT_FEATURE_LENGTH
+    feature_length = self._context_feature_length
 
     # If the example_list is not empty and image/embedding_length is in the
     # featture dict, feature_length will be assigned to that. Otherwise, it will
@@ -703,7 +659,8 @@ class GenerateContextFn(beam.DoFn):
     return list_of_examples
 
 
-def construct_pipeline(input_tfrecord,
+def construct_pipeline(pipeline,
+                       input_tfrecord,
                        output_tfrecord,
                        sequence_key,
                        time_horizon=None,
@@ -720,10 +677,12 @@ def construct_pipeline(input_tfrecord,
                        max_num_elements_in_context_features=5000,
                        num_shards=0,
                        output_type='tf_example',
-                       max_clip_length=None):
+                       max_clip_length=None,
+                       context_feature_length=2057):
   """Returns a beam pipeline to run object detection inference.
 
   Args:
+    pipeline: Initialized beam pipeline.
     input_tfrecord: An TFRecord of tf.train.Example protos containing images.
     output_tfrecord: An TFRecord of tf.train.Example protos that contain images
       in the input TFRecord and the detections from the model.
@@ -755,91 +714,225 @@ def construct_pipeline(input_tfrecord,
     output_type: What type of output, tf_example of tf_sequence_example
     max_clip_length: The maximum length of a sequence example, before
       splitting into multiple
+    context_feature_length: The length of the context feature embeddings stored
+      in the input data.
   """
-  def pipeline(root):
-    if output_type == 'tf_example':
-      coder = beam.coders.ProtoCoder(tf.train.Example)
-    elif output_type == 'tf_sequence_example':
-      coder = beam.coders.ProtoCoder(tf.train.SequenceExample)
-    else:
-      raise ValueError('Unsupported output type.')
-    input_collection = (
-        root | 'ReadInputTFRecord' >> beam.io.tfrecordio.ReadFromTFRecord(
-            input_tfrecord,
-            coder=beam.coders.BytesCoder()))
-    rekey_collection = input_collection | 'RekeyExamples' >> beam.ParDo(
-        ReKeyDataFn(sequence_key, time_horizon,
-                    reduce_image_size, max_image_dimension))
-    grouped_collection = (
-        rekey_collection | 'GroupBySequenceKey' >> beam.GroupByKey())
-    grouped_collection = (
-        grouped_collection | 'ReshuffleGroups' >> beam.Reshuffle())
-    ordered_collection = (
-        grouped_collection | 'OrderByFrameNumber' >> beam.ParDo(
-            SortGroupedDataFn(sequence_key, sorted_image_ids,
-                              max_num_elements_in_context_features)))
-    ordered_collection = (
-        ordered_collection | 'ReshuffleSortedGroups' >> beam.Reshuffle())
-    output_collection = (
-        ordered_collection | 'AddContextToExamples' >> beam.ParDo(
-            GenerateContextFn(
-                sequence_key, add_context_features, image_ids_to_keep,
-                keep_context_features_image_id_list=(
-                    keep_context_features_image_id_list),
-                subsample_context_features_rate=subsample_context_features_rate,
-                keep_only_positives=keep_only_positives,
-                keep_only_positives_gt=keep_only_positives_gt,
-                context_features_score_threshold=(
-                    context_features_score_threshold),
-                max_num_elements_in_context_features=(
-                    max_num_elements_in_context_features),
-                output_type=output_type,
-                max_clip_length=max_clip_length)))
-
-    output_collection = (
-        output_collection | 'ReshuffleExamples' >> beam.Reshuffle())
-    _ = output_collection | 'WritetoDisk' >> beam.io.tfrecordio.WriteToTFRecord(
-        output_tfrecord,
-        num_shards=num_shards,
-        coder=coder)
-  return pipeline
-
-
-def main(_):
-  """Runs the Beam pipeline that builds context features.
+  if output_type == 'tf_example':
+    coder = beam.coders.ProtoCoder(tf.train.Example)
+  elif output_type == 'tf_sequence_example':
+    coder = beam.coders.ProtoCoder(tf.train.SequenceExample)
+  else:
+    raise ValueError('Unsupported output type.')
+  input_collection = (
+      pipeline | 'ReadInputTFRecord' >> beam.io.tfrecordio.ReadFromTFRecord(
+          input_tfrecord,
+          coder=beam.coders.BytesCoder()))
+  rekey_collection = input_collection | 'RekeyExamples' >> beam.ParDo(
+      ReKeyDataFn(sequence_key, time_horizon,
+                  reduce_image_size, max_image_dimension))
+  grouped_collection = (
+      rekey_collection | 'GroupBySequenceKey' >> beam.GroupByKey())
+  grouped_collection = (
+      grouped_collection | 'ReshuffleGroups' >> beam.Reshuffle())
+  ordered_collection = (
+      grouped_collection | 'OrderByFrameNumber' >> beam.ParDo(
+          SortGroupedDataFn(sequence_key, sorted_image_ids,
+                            max_num_elements_in_context_features)))
+  ordered_collection = (
+      ordered_collection | 'ReshuffleSortedGroups' >> beam.Reshuffle())
+  output_collection = (
+      ordered_collection | 'AddContextToExamples' >> beam.ParDo(
+          GenerateContextFn(
+              sequence_key, add_context_features, image_ids_to_keep,
+              keep_context_features_image_id_list=(
+                  keep_context_features_image_id_list),
+              subsample_context_features_rate=subsample_context_features_rate,
+              keep_only_positives=keep_only_positives,
+              keep_only_positives_gt=keep_only_positives_gt,
+              context_features_score_threshold=(
+                  context_features_score_threshold),
+              max_num_elements_in_context_features=(
+                  max_num_elements_in_context_features),
+              output_type=output_type,
+              max_clip_length=max_clip_length,
+              context_feature_length=context_feature_length)))
+
+  output_collection = (
+      output_collection | 'ReshuffleExamples' >> beam.Reshuffle())
+  _ = output_collection | 'WritetoDisk' >> beam.io.tfrecordio.WriteToTFRecord(
+      output_tfrecord,
+      num_shards=num_shards,
+      coder=coder)
+
+
+def parse_args(argv):
+  """Command-line argument parser.
+
+  Args:
+    argv: command line arguments
+  Returns:
+    beam_args: Arguments for the beam pipeline.
+    pipeline_args: Arguments for the pipeline options, such as runner type.
+  """
+  parser = argparse.ArgumentParser()
+  parser.add_argument(
+      '--input_tfrecord',
+      dest='input_tfrecord',
+      required=True,
+      help='TFRecord containing images in tf.Example format for object '
+      'detection, with bounding boxes and contextual feature embeddings.')
+  parser.add_argument(
+      '--output_tfrecord',
+      dest='output_tfrecord',
+      required=True,
+      help='TFRecord containing images in tf.Example format, with added '
+      'contextual memory banks.')
+  parser.add_argument(
+      '--sequence_key',
+      dest='sequence_key',
+      default='image/location',
+      help='Key to use when grouping sequences: so far supports `image/seq_id` '
+      'and `image/location`.')
+  parser.add_argument(
+      '--context_feature_length',
+      dest='context_feature_length',
+      default=2057,
+      help='The length of the context feature embeddings stored in the input '
+      'data.')
+  parser.add_argument(
+      '--time_horizon',
+      dest='time_horizon',
+      default=None,
+      help='What time horizon to use when splitting the data, if any. Options '
+      'are: `year`, `month`, `week`, `day `, `hour`, `minute`, `None`.')
+  parser.add_argument(
+      '--subsample_context_features_rate',
+      dest='subsample_context_features_rate',
+      default=0,
+      help='Whether to subsample the context_features, and if so how many to '
+      'sample. If the rate is set to X, it will sample context from 1 out of '
+      'every X images. Default is sampling from every image, which is X=0.')
+  parser.add_argument(
+      '--reduce_image_size',
+      dest='reduce_image_size',
+      default=True,
+      help='downsamples images to have longest side max_image_dimension, '
+      'maintaining aspect ratio')
+  parser.add_argument(
+      '--max_image_dimension',
+      dest='max_image_dimension',
+      default=1024,
+      help='Sets max image dimension for resizing.')
+  parser.add_argument(
+      '--add_context_features',
+      dest='add_context_features',
+      default=True,
+      help='Adds a memory bank of embeddings to each clip')
+  parser.add_argument(
+      '--sorted_image_ids',
+      dest='sorted_image_ids',
+      default=True,
+      help='Whether the image source_ids are sortable to deal with '
+      'date_captured tie-breaks.')
+  parser.add_argument(
+      '--image_ids_to_keep',
+      dest='image_ids_to_keep',
+      default='All',
+      help='Path to .json list of image ids to keep, used for ground truth '
+      'eval creation.')
+  parser.add_argument(
+      '--keep_context_features_image_id_list',
+      dest='keep_context_features_image_id_list',
+      default=False,
+      help='Whether or not to keep a list of the image_ids corresponding to '
+      'the memory bank.')
+  parser.add_argument(
+      '--keep_only_positives',
+      dest='keep_only_positives',
+      default=False,
+      help='Whether or not to keep only positive boxes based on score.')
+  parser.add_argument(
+      '--context_features_score_threshold',
+      dest='context_features_score_threshold',
+      default=0.7,
+      help='What score threshold to use for boxes in context_features, when '
+      '`keep_only_positives` is set to `True`.')
+  parser.add_argument(
+      '--keep_only_positives_gt',
+      dest='keep_only_positives_gt',
+      default=False,
+      help='Whether or not to keep only positive boxes based on gt class.')
+  parser.add_argument(
+      '--max_num_elements_in_context_features',
+      dest='max_num_elements_in_context_features',
+      default=2000,
+      help='Sets max number of context feature elements per memory bank. '
+      'If the number of images in the context group is greater than '
+      '`max_num_elements_in_context_features`, the context group will be split.'
+      )
+  parser.add_argument(
+      '--output_type',
+      dest='output_type',
+      default='tf_example',
+      help='Output type, one of `tf_example`, `tf_sequence_example`.')
+  parser.add_argument(
+      '--max_clip_length',
+      dest='max_clip_length',
+      default=None,
+      help='Max length for sequence example outputs.')
+  parser.add_argument(
+      '--num_shards',
+      dest='num_shards',
+      default=0,
+      help='Number of output shards.')
+  beam_args, pipeline_args = parser.parse_known_args(argv)
+  return beam_args, pipeline_args
+
+
+def main(argv=None, save_main_session=True):
+  """Runs the Beam pipeline that performs inference.
 
   Args:
-    _: unused
+    argv: Command line arguments.
+    save_main_session: Whether to save the main session.
   """
-  # must create before flags are used
-  runner = runners.DirectRunner()
+  args, pipeline_args = parse_args(argv)
 
-  dirname = os.path.dirname(FLAGS.output_tfrecord)
+  pipeline_options = beam.options.pipeline_options.PipelineOptions(
+            pipeline_args)
+  pipeline_options.view_as(
+      beam.options.pipeline_options.SetupOptions).save_main_session = (
+          save_main_session)
+
+  dirname = os.path.dirname(args.output_tfrecord)
   tf.io.gfile.makedirs(dirname)
-  runner.run(
-      construct_pipeline(FLAGS.input_tfrecord,
-                         FLAGS.output_tfrecord,
-                         FLAGS.sequence_key,
-                         FLAGS.time_horizon,
-                         FLAGS.subsample_context_features_rate,
-                         FLAGS.reduce_image_size,
-                         FLAGS.max_image_dimension,
-                         FLAGS.add_context_features,
-                         FLAGS.sorted_image_ids,
-                         FLAGS.image_ids_to_keep,
-                         FLAGS.keep_context_features_image_id_list,
-                         FLAGS.keep_only_positives,
-                         FLAGS.context_features_score_threshold,
-                         FLAGS.keep_only_positives_gt,
-                         FLAGS.max_num_elements_in_context_features,
-                         FLAGS.num_shards,
-                         FLAGS.output_type,
-                         FLAGS.max_clip_length))
+
+  p = beam.Pipeline(options=pipeline_options)
+
+  construct_pipeline(
+      p,
+      args.input_tfrecord,
+      args.output_tfrecord,
+      args.sequence_key,
+      args.time_horizon,
+      args.subsample_context_features_rate,
+      args.reduce_image_size,
+      args.max_image_dimension,
+      args.add_context_features,
+      args.sorted_image_ids,
+      args.image_ids_to_keep,
+      args.keep_context_features_image_id_list,
+      args.keep_only_positives,
+      args.context_features_score_threshold,
+      args.keep_only_positives_gt,
+      args.max_num_elements_in_context_features,
+      args.num_shards,
+      args.output_type,
+      args.max_clip_length,
+      args.context_feature_length)
+
+  p.run()
 
 
 if __name__ == '__main__':
-  flags.mark_flags_as_required([
-      'input_tfrecord',
-      'output_tfrecord'
-  ])
-  app.run(main)
+  main()
diff --git a/research/object_detection/dataset_tools/context_rcnn/add_context_to_examples_tf1_test.py b/research/object_detection/dataset_tools/context_rcnn/add_context_to_examples_tf2_test.py
similarity index 91%
rename from research/object_detection/dataset_tools/context_rcnn/add_context_to_examples_tf1_test.py
rename to research/object_detection/dataset_tools/context_rcnn/add_context_to_examples_tf2_test.py
index 99bb479794b7d899e5275862dc7a00e7945c2731..ae4e02bdca3df7f46aff48137b5064fa9938db06 100644
--- a/research/object_detection/dataset_tools/context_rcnn/add_context_to_examples_tf1_test.py
+++ b/research/object_detection/dataset_tools/context_rcnn/add_context_to_examples_tf2_test.py
@@ -22,13 +22,20 @@ import datetime
 import os
 import tempfile
 import unittest
+
 import numpy as np
 import six
-import tensorflow.compat.v1 as tf
+import tensorflow as tf
 
-from object_detection.dataset_tools.context_rcnn import add_context_to_examples
 from object_detection.utils import tf_version
-from apache_beam import runners
+
+if tf_version.is_tf2():
+  from object_detection.dataset_tools.context_rcnn import add_context_to_examples  # pylint:disable=g-import-not-at-top
+
+try:
+  import apache_beam as beam  # pylint:disable=g-import-not-at-top
+except ModuleNotFoundError:
+  pass
 
 
 @contextlib.contextmanager
@@ -36,7 +43,7 @@ def InMemoryTFRecord(entries):
   temp = tempfile.NamedTemporaryFile(delete=False)
   filename = temp.name
   try:
-    with tf.python_io.TFRecordWriter(filename) as writer:
+    with tf.io.TFRecordWriter(filename) as writer:
       for value in entries:
         writer.write(value)
     yield filename
@@ -64,13 +71,12 @@ def FloatListFeature(value):
   return tf.train.Feature(float_list=tf.train.FloatList(value=value))
 
 
-@unittest.skipIf(tf_version.is_tf2(), 'Skipping TF1.X only test.')
+@unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
 class GenerateContextDataTest(tf.test.TestCase):
 
   def _create_first_tf_example(self):
-    with self.test_session():
-      encoded_image = tf.image.encode_jpeg(
-          tf.constant(np.ones((4, 4, 3)).astype(np.uint8))).eval()
+    encoded_image = tf.io.encode_jpeg(
+        tf.constant(np.ones((4, 4, 3)).astype(np.uint8))).numpy()
 
     example = tf.train.Example(features=tf.train.Features(feature={
         'image/encoded': BytesFeature(encoded_image),
@@ -99,9 +105,8 @@ class GenerateContextDataTest(tf.test.TestCase):
     return example.SerializeToString()
 
   def _create_second_tf_example(self):
-    with self.test_session():
-      encoded_image = tf.image.encode_jpeg(
-          tf.constant(np.ones((4, 4, 3)).astype(np.uint8))).eval()
+    encoded_image = tf.io.encode_jpeg(
+        tf.constant(np.ones((4, 4, 3)).astype(np.uint8))).numpy()
 
     example = tf.train.Example(features=tf.train.Features(feature={
         'image/encoded': BytesFeature(encoded_image),
@@ -200,7 +205,7 @@ class GenerateContextDataTest(tf.test.TestCase):
         seq_feature_dict['region/label/string'].feature[1].bytes_list.value[:])
 
   def assert_expected_key(self, key):
-    self.assertAllEqual(key, '01')
+    self.assertAllEqual(key, b'01')
 
   def assert_sorted(self, example_collection):
     example_list = list(example_collection)
@@ -329,22 +334,26 @@ class GenerateContextDataTest(tf.test.TestCase):
     with InMemoryTFRecord(
         [self._create_first_tf_example(),
          self._create_second_tf_example()]) as input_tfrecord:
-      runner = runners.DirectRunner()
       temp_dir = tempfile.mkdtemp(dir=os.environ.get('TEST_TMPDIR'))
       output_tfrecord = os.path.join(temp_dir, 'output_tfrecord')
       sequence_key = six.ensure_binary('image/seq_id')
       max_num_elements = 10
       num_shards = 1
-      pipeline = add_context_to_examples.construct_pipeline(
+      pipeline_options = beam.options.pipeline_options.PipelineOptions(
+          runner='DirectRunner')
+      p = beam.Pipeline(options=pipeline_options)
+      add_context_to_examples.construct_pipeline(
+          p,
           input_tfrecord,
           output_tfrecord,
           sequence_key,
           max_num_elements_in_context_features=max_num_elements,
           num_shards=num_shards)
-      runner.run(pipeline)
+      p.run()
       filenames = tf.io.gfile.glob(output_tfrecord + '-?????-of-?????')
       actual_output = []
-      record_iterator = tf.python_io.tf_record_iterator(path=filenames[0])
+      record_iterator = tf.data.TFRecordDataset(
+          tf.convert_to_tensor(filenames)).as_numpy_iterator()
       for record in record_iterator:
         actual_output.append(record)
       self.assertEqual(len(actual_output), 2)
@@ -355,24 +364,27 @@ class GenerateContextDataTest(tf.test.TestCase):
     with InMemoryTFRecord(
         [self._create_first_tf_example(),
          self._create_second_tf_example()]) as input_tfrecord:
-      runner = runners.DirectRunner()
       temp_dir = tempfile.mkdtemp(dir=os.environ.get('TEST_TMPDIR'))
       output_tfrecord = os.path.join(temp_dir, 'output_tfrecord')
       sequence_key = six.ensure_binary('image/seq_id')
       max_num_elements = 10
       num_shards = 1
-      pipeline = add_context_to_examples.construct_pipeline(
+      pipeline_options = beam.options.pipeline_options.PipelineOptions(
+          runner='DirectRunner')
+      p = beam.Pipeline(options=pipeline_options)
+      add_context_to_examples.construct_pipeline(
+          p,
           input_tfrecord,
           output_tfrecord,
           sequence_key,
           max_num_elements_in_context_features=max_num_elements,
           num_shards=num_shards,
           output_type='tf_sequence_example')
-      runner.run(pipeline)
+      p.run()
       filenames = tf.io.gfile.glob(output_tfrecord + '-?????-of-?????')
       actual_output = []
-      record_iterator = tf.python_io.tf_record_iterator(
-          path=filenames[0])
+      record_iterator = tf.data.TFRecordDataset(
+          tf.convert_to_tensor(filenames)).as_numpy_iterator()
       for record in record_iterator:
         actual_output.append(record)
       self.assertEqual(len(actual_output), 1)
diff --git a/research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_main.py b/research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_main.py
index 106cf5adb94d8d1017a1834de42ab2096d85c67c..dbf3cad0eacaa4883aba340e34bb623a96d3af50 100644
--- a/research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_main.py
+++ b/research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_main.py
@@ -33,31 +33,20 @@ from __future__ import absolute_import
 from __future__ import division
 from __future__ import print_function
 
+import argparse
 import hashlib
 import io
 import json
-import logging
 import os
-from absl import app
-from absl import flags
-import apache_beam as beam
 import numpy as np
 import PIL.Image
-import tensorflow.compat.v1 as tf
-from apache_beam import runners
+import tensorflow as tf
 from object_detection.utils import dataset_util
 
-flags.DEFINE_string('image_directory', None, 'Directory where images are '
-                    'stored')
-flags.DEFINE_string('output_tfrecord_prefix', None,
-                    'TFRecord containing images in tf.Example format.')
-flags.DEFINE_string('input_annotations_file', None, 'Path to Coco-CameraTraps'
-                    'style annotations file')
-flags.DEFINE_integer('num_images_per_shard',
-                     200,
-                     'The number of  images to be stored in each shard.')
-
-FLAGS = flags.FLAGS
+try:
+  import apache_beam as beam  # pylint:disable=g-import-not-at-top
+except ModuleNotFoundError:
+  pass
 
 
 class ParseImage(beam.DoFn):
@@ -120,16 +109,9 @@ class ParseImage(beam.DoFn):
         encoded_jpg = fid.read()
       encoded_jpg_io = io.BytesIO(encoded_jpg)
       image = PIL.Image.open(encoded_jpg_io)
-      # Ensure the image can be read by tf
-      with tf.Graph().as_default():
-        image = tf.image.decode_jpeg(encoded_jpg, channels=3)
-        init_op = tf.initialize_all_tables()
-        with tf.Session() as sess:
-          sess.run(init_op)
-          sess.run(image)
-    except Exception as e:  # pylint: disable=broad-except
+      image = tf.io.decode_jpeg(encoded_jpg, channels=3)
+    except Exception:  # pylint: disable=broad-except
       # The image file is missing or corrupt
-      tf.logging.error(str(e))
       return []
 
     key = hashlib.sha256(encoded_jpg).hexdigest()
@@ -243,13 +225,14 @@ class ParseImage(beam.DoFn):
     return [(example)]
 
 
-def _load_json_data(data_file):
+def load_json_data(data_file):
   with tf.io.gfile.GFile(data_file, 'r') as fid:
     data_dict = json.load(fid)
   return data_dict
 
 
-def create_pipeline(image_directory,
+def create_pipeline(pipeline,
+                    image_directory,
                     input_annotations_file,
                     output_tfrecord_prefix=None,
                     num_images_per_shard=200,
@@ -257,68 +240,95 @@ def create_pipeline(image_directory,
   """Creates a beam pipeline for producing a COCO-CameraTraps Image dataset.
 
   Args:
+    pipeline: Initialized beam pipeline.
     image_directory: Path to image directory
     input_annotations_file: Path to a coco-cameratraps annotation file
     output_tfrecord_prefix: Absolute path for tfrecord outputs. Final files will
       be named {output_tfrecord_prefix}@N.
     num_images_per_shard: The number of images to store in each shard
     keep_bboxes: Whether to keep any bounding boxes that exist in the json file
-
-  Returns:
-    A Beam pipeline.
   """
 
-  logging.info('Reading data from COCO-CameraTraps Dataset.')
-
-  data = _load_json_data(input_annotations_file)
+  data = load_json_data(input_annotations_file)
 
   num_shards = int(np.ceil(float(len(data['images']))/num_images_per_shard))
 
-  def pipeline(root):
-    """Builds beam pipeline."""
+  image_examples = (
+      pipeline | ('CreateCollections') >> beam.Create(
+          [im['id'] for im in data['images']])
+      | ('ParseImage') >> beam.ParDo(ParseImage(
+          image_directory, data['images'], data['annotations'],
+          data['categories'], keep_bboxes=keep_bboxes)))
+  _ = (image_examples
+       | ('Reshuffle') >> beam.Reshuffle()
+       | ('WriteTfImageExample') >> beam.io.tfrecordio.WriteToTFRecord(
+           output_tfrecord_prefix,
+           num_shards=num_shards,
+           coder=beam.coders.ProtoCoder(tf.train.Example)))
 
-    image_examples = (
-        root
-        | ('CreateCollections') >> beam.Create(
-            [im['id'] for im in data['images']])
-        | ('ParseImage') >> beam.ParDo(ParseImage(
-            image_directory, data['images'], data['annotations'],
-            data['categories'], keep_bboxes=keep_bboxes)))
-    _ = (image_examples
-         | ('Reshuffle') >> beam.Reshuffle()
-         | ('WriteTfImageExample') >> beam.io.tfrecordio.WriteToTFRecord(
-             output_tfrecord_prefix,
-             num_shards=num_shards,
-             coder=beam.coders.ProtoCoder(tf.train.Example)))
 
-  return pipeline
+def parse_args(argv):
+  """Command-line argument parser.
 
-
-def main(_):
+  Args:
+    argv: command line arguments
+  Returns:
+    beam_args: Arguments for the beam pipeline.
+    pipeline_args: Arguments for the pipeline options, such as runner type.
+  """
+  parser = argparse.ArgumentParser()
+  parser.add_argument(
+      '--image_directory',
+      dest='image_directory',
+      required=True,
+      help='Path to the directory where the images are stored.')
+  parser.add_argument(
+      '--output_tfrecord_prefix',
+      dest='output_tfrecord_prefix',
+      required=True,
+      help='Path and prefix to store TFRecords containing images in tf.Example'
+      'format.')
+  parser.add_argument(
+      '--input_annotations_file',
+      dest='input_annotations_file',
+      required=True,
+      help='Path to Coco-CameraTraps style annotations file.')
+  parser.add_argument(
+      '--num_images_per_shard',
+      dest='num_images_per_shard',
+      default=200,
+      help='The number of  images to be stored in each outputshard.')
+  beam_args, pipeline_args = parser.parse_known_args(argv)
+  return beam_args, pipeline_args
+
+
+def main(argv=None, save_main_session=True):
   """Runs the Beam pipeline that performs inference.
 
   Args:
-    _: unused
+    argv: Command line arguments.
+    save_main_session: Whether to save the main session.
   """
+  args, pipeline_args = parse_args(argv)
 
-  # must create before flags are used
-  runner = runners.DirectRunner()
+  pipeline_options = beam.options.pipeline_options.PipelineOptions(
+            pipeline_args)
+  pipeline_options.view_as(
+      beam.options.pipeline_options.SetupOptions).save_main_session = (
+          save_main_session)
 
-  dirname = os.path.dirname(FLAGS.output_tfrecord_prefix)
+  dirname = os.path.dirname(args.output_tfrecord_prefix)
   tf.io.gfile.makedirs(dirname)
 
-  runner.run(
-      create_pipeline(
-          image_directory=FLAGS.image_directory,
-          input_annotations_file=FLAGS.input_annotations_file,
-          output_tfrecord_prefix=FLAGS.output_tfrecord_prefix,
-          num_images_per_shard=FLAGS.num_images_per_shard))
+  p = beam.Pipeline(options=pipeline_options)
+  create_pipeline(
+      pipeline=p,
+      image_directory=args.image_directory,
+      input_annotations_file=args.input_annotations_file,
+      output_tfrecord_prefix=args.output_tfrecord_prefix,
+      num_images_per_shard=args.num_images_per_shard)
+  p.run()
 
 
 if __name__ == '__main__':
-  flags.mark_flags_as_required([
-      'image_directory',
-      'input_annotations_file',
-      'output_tfrecord_prefix'
-  ])
-  app.run(main)
+  main()
diff --git a/research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_tf1_test.py b/research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_tf2_test.py
similarity index 84%
rename from research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_tf1_test.py
rename to research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_tf2_test.py
index be6dc0dc4dd765c4e6e0bab39f4228996a3c95b0..0a1ac203f334574a3b09654fd736047b8236fa38 100644
--- a/research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_tf1_test.py
+++ b/research/object_detection/dataset_tools/context_rcnn/create_cococameratraps_tfexample_tf2_test.py
@@ -21,16 +21,23 @@ import json
 import os
 import tempfile
 import unittest
+
 import numpy as np
 
 from PIL import Image
-import tensorflow.compat.v1 as tf
-from object_detection.dataset_tools.context_rcnn import create_cococameratraps_tfexample_main
+import tensorflow as tf
 from object_detection.utils import tf_version
-from apache_beam import runners
+
+if tf_version.is_tf2():
+  from object_detection.dataset_tools.context_rcnn import create_cococameratraps_tfexample_main  # pylint:disable=g-import-not-at-top
+
+try:
+  import apache_beam as beam  # pylint:disable=g-import-not-at-top
+except ModuleNotFoundError:
+  pass
 
 
-@unittest.skipIf(tf_version.is_tf2(), 'Skipping TF1.X only test.')
+@unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
 class CreateCOCOCameraTrapsTfexampleTest(tf.test.TestCase):
 
   IMAGE_HEIGHT = 360
@@ -95,13 +102,13 @@ class CreateCOCOCameraTrapsTfexampleTest(tf.test.TestCase):
         .int64_list.value, [1])
     self.assertAllEqual(
         example.features.feature['image/object/class/text']
-        .bytes_list.value, ['animal'])
+        .bytes_list.value, [b'animal'])
     self.assertAllClose(
         example.features.feature['image/class/label']
         .int64_list.value, [1])
     self.assertAllEqual(
         example.features.feature['image/class/text']
-        .bytes_list.value, ['animal'])
+        .bytes_list.value, [b'animal'])
 
     # Check other essential attributes.
     self.assertAllEqual(
@@ -112,7 +119,7 @@ class CreateCOCOCameraTrapsTfexampleTest(tf.test.TestCase):
         [self.IMAGE_WIDTH])
     self.assertAllEqual(
         example.features.feature['image/source_id'].bytes_list.value,
-        ['im_0'])
+        [b'im_0'])
     self.assertTrue(
         example.features.feature['image/encoded'].bytes_list.value)
 
@@ -134,13 +141,13 @@ class CreateCOCOCameraTrapsTfexampleTest(tf.test.TestCase):
         .int64_list.value, [1])
     self.assertAllEqual(
         example.features.feature['image/object/class/text']
-        .bytes_list.value, ['animal'])
+        .bytes_list.value, [b'animal'])
     self.assertAllClose(
         example.features.feature['image/class/label']
         .int64_list.value, [1])
     self.assertAllEqual(
         example.features.feature['image/class/text']
-        .bytes_list.value, ['animal'])
+        .bytes_list.value, [b'animal'])
 
     # Check other essential attributes.
     self.assertAllEqual(
@@ -151,24 +158,27 @@ class CreateCOCOCameraTrapsTfexampleTest(tf.test.TestCase):
         [self.IMAGE_WIDTH])
     self.assertAllEqual(
         example.features.feature['image/source_id'].bytes_list.value,
-        ['im_0'])
+        [b'im_0'])
     self.assertTrue(
         example.features.feature['image/encoded'].bytes_list.value)
 
   def test_beam_pipeline(self):
-    runner = runners.DirectRunner()
     num_frames = 1
     temp_dir = tempfile.mkdtemp(dir=os.environ.get('TEST_TMPDIR'))
     json_path = self._create_json_file(temp_dir, num_frames)
     output_tfrecord = temp_dir+'/output'
     self._write_random_images_to_directory(temp_dir, num_frames)
-    pipeline = create_cococameratraps_tfexample_main.create_pipeline(
-        temp_dir, json_path,
+    pipeline_options = beam.options.pipeline_options.PipelineOptions(
+        runner='DirectRunner')
+    p = beam.Pipeline(options=pipeline_options)
+    create_cococameratraps_tfexample_main.create_pipeline(
+        p, temp_dir, json_path,
         output_tfrecord_prefix=output_tfrecord)
-    runner.run(pipeline)
+    p.run()
     filenames = tf.io.gfile.glob(output_tfrecord + '-?????-of-?????')
     actual_output = []
-    record_iterator = tf.python_io.tf_record_iterator(path=filenames[0])
+    record_iterator = tf.data.TFRecordDataset(
+        tf.convert_to_tensor(filenames)).as_numpy_iterator()
     for record in record_iterator:
       actual_output.append(record)
     self.assertEqual(len(actual_output), num_frames)
@@ -176,20 +186,23 @@ class CreateCOCOCameraTrapsTfexampleTest(tf.test.TestCase):
         actual_output[0]))
 
   def test_beam_pipeline_bbox(self):
-    runner = runners.DirectRunner()
     num_frames = 1
     temp_dir = tempfile.mkdtemp(dir=os.environ.get('TEST_TMPDIR'))
     json_path = self._create_json_file(temp_dir, num_frames, keep_bboxes=True)
     output_tfrecord = temp_dir+'/output'
     self._write_random_images_to_directory(temp_dir, num_frames)
-    pipeline = create_cococameratraps_tfexample_main.create_pipeline(
-        temp_dir, json_path,
+    pipeline_options = beam.options.pipeline_options.PipelineOptions(
+        runner='DirectRunner')
+    p = beam.Pipeline(options=pipeline_options)
+    create_cococameratraps_tfexample_main.create_pipeline(
+        p, temp_dir, json_path,
         output_tfrecord_prefix=output_tfrecord,
         keep_bboxes=True)
-    runner.run(pipeline)
+    p.run()
     filenames = tf.io.gfile.glob(output_tfrecord+'-?????-of-?????')
     actual_output = []
-    record_iterator = tf.python_io.tf_record_iterator(path=filenames[0])
+    record_iterator = tf.data.TFRecordDataset(
+        tf.convert_to_tensor(filenames)).as_numpy_iterator()
     for record in record_iterator:
       actual_output.append(record)
     self.assertEqual(len(actual_output), num_frames)
diff --git a/research/object_detection/dataset_tools/context_rcnn/generate_detection_data.py b/research/object_detection/dataset_tools/context_rcnn/generate_detection_data.py
index 95c16c1358a15ece03aaa9e80353e1ebf2c17166..eb04cc8cde77066f264edaeedcef5025d51d656c 100644
--- a/research/object_detection/dataset_tools/context_rcnn/generate_detection_data.py
+++ b/research/object_detection/dataset_tools/context_rcnn/generate_detection_data.py
@@ -45,26 +45,15 @@ from __future__ import absolute_import
 from __future__ import division
 from __future__ import print_function
 
+import argparse
 import os
 import threading
-from absl import app
-from absl import flags
-import apache_beam as beam
-import tensorflow.compat.v1 as tf
-from apache_beam import runners
+import tensorflow as tf
 
-
-flags.DEFINE_string('detection_input_tfrecord', None, 'TFRecord containing '
-                    'images in tf.Example format for object detection.')
-flags.DEFINE_string('detection_output_tfrecord', None,
-                    'TFRecord containing detections in tf.Example format.')
-flags.DEFINE_string('detection_model_dir', None, 'Path to directory containing'
-                    'an object detection SavedModel.')
-flags.DEFINE_float('confidence_threshold', 0.9,
-                   'Min confidence to keep bounding boxes')
-flags.DEFINE_integer('num_shards', 0, 'Number of output shards.')
-
-FLAGS = flags.FLAGS
+try:
+  import apache_beam as beam  # pylint:disable=g-import-not-at-top
+except ModuleNotFoundError:
+  pass
 
 
 class GenerateDetectionDataFn(beam.DoFn):
@@ -97,22 +86,7 @@ class GenerateDetectionDataFn(beam.DoFn):
     # one instance across all threads in the worker. This is possible since
     # tf.Session.run() is thread safe.
     with self.session_lock:
-      if self._session is None:
-        graph = tf.Graph()
-        self._session = tf.Session(graph=graph)
-        with graph.as_default():
-          meta_graph = tf.saved_model.loader.load(
-              self._session, [tf.saved_model.tag_constants.SERVING],
-              self._model_dir)
-        signature = meta_graph.signature_def['serving_default']
-        input_tensor_name = signature.inputs['inputs'].name
-        self._input = graph.get_tensor_by_name(input_tensor_name)
-        self._boxes_node = graph.get_tensor_by_name(
-            signature.outputs['detection_boxes'].name)
-        self._scores_node = graph.get_tensor_by_name(
-            signature.outputs['detection_scores'].name)
-        self._num_detections_node = graph.get_tensor_by_name(
-            signature.outputs['num_detections'].name)
+      self._detect_fn = tf.saved_model.load(self._model_dir)
 
   def process(self, tfrecord_entry):
     return self._run_inference_and_generate_detections(tfrecord_entry)
@@ -124,9 +98,11 @@ class GenerateDetectionDataFn(beam.DoFn):
       # There are already ground truth boxes for this image, just keep them.
       return [input_example]
 
-    detection_boxes, detection_scores, num_detections = self._session.run(
-        [self._boxes_node, self._scores_node, self._num_detections_node],
-        feed_dict={self._input: [tfrecord_entry]})
+    detections = self._detect_fn.signatures['serving_default'](
+        (tf.expand_dims(tf.convert_to_tensor(tfrecord_entry), 0)))
+    detection_boxes = detections['detection_boxes']
+    num_detections = detections['num_detections']
+    detection_scores = detections['detection_scores']
 
     example = tf.train.Example()
 
@@ -205,58 +181,103 @@ class GenerateDetectionDataFn(beam.DoFn):
     return [example]
 
 
-def construct_pipeline(input_tfrecord, output_tfrecord, model_dir,
+def construct_pipeline(pipeline, input_tfrecord, output_tfrecord, model_dir,
                        confidence_threshold, num_shards):
   """Returns a Beam pipeline to run object detection inference.
 
   Args:
+    pipeline: Initialized beam pipeline.
     input_tfrecord: A TFRecord of tf.train.Example protos containing images.
     output_tfrecord: A TFRecord of tf.train.Example protos that contain images
       in the input TFRecord and the detections from the model.
     model_dir: Path to `saved_model` to use for inference.
     confidence_threshold: Threshold to use when keeping detection results.
     num_shards: The number of output shards.
+  """
+  input_collection = (
+      pipeline | 'ReadInputTFRecord' >> beam.io.tfrecordio.ReadFromTFRecord(
+          input_tfrecord,
+          coder=beam.coders.BytesCoder()))
+  output_collection = input_collection | 'RunInference' >> beam.ParDo(
+      GenerateDetectionDataFn(model_dir, confidence_threshold))
+  output_collection = output_collection | 'Reshuffle' >> beam.Reshuffle()
+  _ = output_collection | 'WritetoDisk' >> beam.io.tfrecordio.WriteToTFRecord(
+      output_tfrecord,
+      num_shards=num_shards,
+      coder=beam.coders.ProtoCoder(tf.train.Example))
+
+
+def parse_args(argv):
+  """Command-line argument parser.
+
+  Args:
+    argv: command line arguments
   Returns:
-    pipeline: A Beam pipeline.
+    beam_args: Arguments for the beam pipeline.
+    pipeline_args: Arguments for the pipeline options, such as runner type.
   """
-  def pipeline(root):
-    input_collection = (
-        root | 'ReadInputTFRecord' >> beam.io.tfrecordio.ReadFromTFRecord(
-            input_tfrecord,
-            coder=beam.coders.BytesCoder()))
-    output_collection = input_collection | 'RunInference' >> beam.ParDo(
-        GenerateDetectionDataFn(model_dir, confidence_threshold))
-    output_collection = output_collection | 'Reshuffle' >> beam.Reshuffle()
-    _ = output_collection | 'WritetoDisk' >> beam.io.tfrecordio.WriteToTFRecord(
-        output_tfrecord,
-        num_shards=num_shards,
-        coder=beam.coders.ProtoCoder(tf.train.Example))
-  return pipeline
-
-
-def main(_):
+  parser = argparse.ArgumentParser()
+  parser.add_argument(
+      '--detection_input_tfrecord',
+      dest='detection_input_tfrecord',
+      required=True,
+      help='TFRecord containing images in tf.Example format for object '
+      'detection.')
+  parser.add_argument(
+      '--detection_output_tfrecord',
+      dest='detection_output_tfrecord',
+      required=True,
+      help='TFRecord containing detections in tf.Example format.')
+  parser.add_argument(
+      '--detection_model_dir',
+      dest='detection_model_dir',
+      required=True,
+      help='Path to directory containing an object detection SavedModel.')
+  parser.add_argument(
+      '--confidence_threshold',
+      dest='confidence_threshold',
+      default=0.9,
+      help='Min confidence to keep bounding boxes.')
+  parser.add_argument(
+      '--num_shards',
+      dest='num_shards',
+      default=0,
+      help='Number of output shards.')
+  beam_args, pipeline_args = parser.parse_known_args(argv)
+  return beam_args, pipeline_args
+
+
+def main(argv=None, save_main_session=True):
   """Runs the Beam pipeline that performs inference.
 
   Args:
-    _: unused
+    argv: Command line arguments.
+    save_main_session: Whether to save the main session.
   """
-  # must create before flags are used
-  runner = runners.DirectRunner()
 
-  dirname = os.path.dirname(FLAGS.detection_output_tfrecord)
+  args, pipeline_args = parse_args(argv)
+
+  pipeline_options = beam.options.pipeline_options.PipelineOptions(
+            pipeline_args)
+  pipeline_options.view_as(
+      beam.options.pipeline_options.SetupOptions).save_main_session = (
+          save_main_session)
+
+  dirname = os.path.dirname(args.detection_output_tfrecord)
   tf.io.gfile.makedirs(dirname)
-  runner.run(
-      construct_pipeline(FLAGS.detection_input_tfrecord,
-                         FLAGS.detection_output_tfrecord,
-                         FLAGS.detection_model_dir,
-                         FLAGS.confidence_threshold,
-                         FLAGS.num_shards))
+
+  p = beam.Pipeline(options=pipeline_options)
+
+  construct_pipeline(
+      p,
+      args.detection_input_tfrecord,
+      args.detection_output_tfrecord,
+      args.detection_model_dir,
+      args.confidence_threshold,
+      args.num_shards)
+
+  p.run()
 
 
 if __name__ == '__main__':
-  flags.mark_flags_as_required([
-      'detection_input_tfrecord',
-      'detection_output_tfrecord',
-      'detection_model_dir'
-  ])
-  app.run(main)
+  main()
diff --git a/research/object_detection/dataset_tools/context_rcnn/generate_detection_data_tf1_test.py b/research/object_detection/dataset_tools/context_rcnn/generate_detection_data_tf2_test.py
similarity index 72%
rename from research/object_detection/dataset_tools/context_rcnn/generate_detection_data_tf1_test.py
rename to research/object_detection/dataset_tools/context_rcnn/generate_detection_data_tf2_test.py
index 9002e750f7a9fdee49baf7c589dca4095721e1ee..db5a716dd9a6daa22f0e3336a7bb8bf780f8d5f5 100644
--- a/research/object_detection/dataset_tools/context_rcnn/generate_detection_data_tf1_test.py
+++ b/research/object_detection/dataset_tools/context_rcnn/generate_detection_data_tf2_test.py
@@ -24,34 +24,46 @@ import tempfile
 import unittest
 import numpy as np
 import six
-import tensorflow.compat.v1 as tf
+import tensorflow as tf
 
-from object_detection import exporter
+from object_detection import exporter_lib_v2
 from object_detection.builders import model_builder
 from object_detection.core import model
-from object_detection.dataset_tools.context_rcnn import generate_detection_data
 from object_detection.protos import pipeline_pb2
 from object_detection.utils import tf_version
-from apache_beam import runners
+
+if tf_version.is_tf2():
+  from object_detection.dataset_tools.context_rcnn import generate_detection_data  # pylint:disable=g-import-not-at-top
 
 if six.PY2:
   import mock  # pylint: disable=g-import-not-at-top
 else:
   mock = unittest.mock
 
+try:
+  import apache_beam as beam  # pylint:disable=g-import-not-at-top
+except ModuleNotFoundError:
+  pass
+
 
 class FakeModel(model.DetectionModel):
-  """A Fake Detection model with expected output nodes from post-processing."""
+
+  def __init__(self, conv_weight_scalar=1.0):
+    super(FakeModel, self).__init__(num_classes=5)
+    self._conv = tf.keras.layers.Conv2D(
+        filters=1, kernel_size=1, strides=(1, 1), padding='valid',
+        kernel_initializer=tf.keras.initializers.Constant(
+            value=conv_weight_scalar))
 
   def preprocess(self, inputs):
     true_image_shapes = []  # Doesn't matter for the fake model.
     return tf.identity(inputs), true_image_shapes
 
   def predict(self, preprocessed_inputs, true_image_shapes):
-    return {'image': tf.layers.conv2d(preprocessed_inputs, 3, 1)}
+    return {'image': self._conv(preprocessed_inputs)}
 
   def postprocess(self, prediction_dict, true_image_shapes):
-    with tf.control_dependencies(prediction_dict.values()):
+    with tf.control_dependencies(list(prediction_dict.values())):
       postprocessed_tensors = {
           'detection_boxes': tf.constant([[[0.0, 0.1, 0.5, 0.6],
                                            [0.5, 0.5, 0.8, 0.8]]], tf.float32),
@@ -67,6 +79,9 @@ class FakeModel(model.DetectionModel):
   def restore_map(self, checkpoint_path, fine_tune_checkpoint_type):
     pass
 
+  def restore_from_objects(self, fine_tune_checkpoint_type):
+    pass
+
   def loss(self, prediction_dict, true_image_shapes):
     pass
 
@@ -82,7 +97,7 @@ def InMemoryTFRecord(entries):
   temp = tempfile.NamedTemporaryFile(delete=False)
   filename = temp.name
   try:
-    with tf.python_io.TFRecordWriter(filename) as writer:
+    with tf.io.TFRecordWriter(filename) as writer:
       for value in entries:
         writer.write(value)
     yield filename
@@ -90,7 +105,7 @@ def InMemoryTFRecord(entries):
     os.unlink(filename)
 
 
-@unittest.skipIf(tf_version.is_tf2(), 'Skipping TF1.X only test.')
+@unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
 class GenerateDetectionDataTest(tf.test.TestCase):
 
   def _save_checkpoint_from_mock_model(self, checkpoint_path):
@@ -99,64 +114,39 @@ class GenerateDetectionDataTest(tf.test.TestCase):
     Args:
       checkpoint_path: Path to save checkpoint from Fake model.
     """
-    g = tf.Graph()
-    with g.as_default():
-      mock_model = FakeModel(num_classes=5)
-      preprocessed_inputs, true_image_shapes = mock_model.preprocess(
-          tf.placeholder(tf.float32, shape=[None, None, None, 3]))
-      predictions = mock_model.predict(preprocessed_inputs, true_image_shapes)
-      mock_model.postprocess(predictions, true_image_shapes)
-      tf.train.get_or_create_global_step()
-      saver = tf.train.Saver()
-      init = tf.global_variables_initializer()
-      with self.test_session(graph=g) as sess:
-        sess.run(init)
-        saver.save(sess, checkpoint_path)
+    mock_model = FakeModel()
+    fake_image = tf.zeros(shape=[1, 10, 10, 3], dtype=tf.float32)
+    preprocessed_inputs, true_image_shapes = mock_model.preprocess(fake_image)
+    predictions = mock_model.predict(preprocessed_inputs, true_image_shapes)
+    mock_model.postprocess(predictions, true_image_shapes)
+    ckpt = tf.train.Checkpoint(model=mock_model)
+    exported_checkpoint_manager = tf.train.CheckpointManager(
+        ckpt, checkpoint_path, max_to_keep=1)
+    exported_checkpoint_manager.save(checkpoint_number=0)
 
   def _export_saved_model(self):
     tmp_dir = self.get_temp_dir()
-    checkpoint_path = os.path.join(tmp_dir, 'model.ckpt')
-    self._save_checkpoint_from_mock_model(checkpoint_path)
+    self._save_checkpoint_from_mock_model(tmp_dir)
     output_directory = os.path.join(tmp_dir, 'output')
     saved_model_path = os.path.join(output_directory, 'saved_model')
     tf.io.gfile.makedirs(output_directory)
     with mock.patch.object(
         model_builder, 'build', autospec=True) as mock_builder:
-      mock_builder.return_value = FakeModel(num_classes=5)
+      mock_builder.return_value = FakeModel()
+      output_directory = os.path.join(tmp_dir, 'output')
       pipeline_config = pipeline_pb2.TrainEvalPipelineConfig()
-      pipeline_config.eval_config.use_moving_averages = False
-      detection_model = model_builder.build(pipeline_config.model,
-                                            is_training=False)
-      outputs, placeholder_tensor = exporter.build_detection_graph(
+      exporter_lib_v2.export_inference_graph(
           input_type='tf_example',
-          detection_model=detection_model,
-          input_shape=None,
-          output_collection_name='inference_op',
-          graph_hook_fn=None)
-      output_node_names = ','.join(outputs.keys())
-      saver = tf.train.Saver()
-      input_saver_def = saver.as_saver_def()
-      frozen_graph_def = exporter.freeze_graph_with_def_protos(
-          input_graph_def=tf.get_default_graph().as_graph_def(),
-          input_saver_def=input_saver_def,
-          input_checkpoint=checkpoint_path,
-          output_node_names=output_node_names,
-          restore_op_name='save/restore_all',
-          filename_tensor_name='save/Const:0',
-          output_graph='',
-          clear_devices=True,
-          initializer_nodes='')
-      exporter.write_saved_model(
-          saved_model_path=saved_model_path,
-          frozen_graph_def=frozen_graph_def,
-          inputs=placeholder_tensor,
-          outputs=outputs)
-      return saved_model_path
+          pipeline_config=pipeline_config,
+          trained_checkpoint_dir=tmp_dir,
+          output_directory=output_directory)
+      saved_model_path = os.path.join(output_directory, 'saved_model')
+    return saved_model_path
 
   def _create_tf_example(self):
     with self.test_session():
-      encoded_image = tf.image.encode_jpeg(
-          tf.constant(np.ones((4, 6, 3)).astype(np.uint8))).eval()
+      encoded_image = tf.io.encode_jpeg(
+          tf.constant(np.ones((4, 6, 3)).astype(np.uint8))).numpy()
 
     def BytesFeature(value):
       return tf.train.Feature(bytes_list=tf.train.BytesList(value=[value]))
@@ -243,19 +233,22 @@ class GenerateDetectionDataTest(tf.test.TestCase):
 
   def test_beam_pipeline(self):
     with InMemoryTFRecord([self._create_tf_example()]) as input_tfrecord:
-      runner = runners.DirectRunner()
       temp_dir = tempfile.mkdtemp(dir=os.environ.get('TEST_TMPDIR'))
       output_tfrecord = os.path.join(temp_dir, 'output_tfrecord')
       saved_model_path = self._export_saved_model()
       confidence_threshold = 0.8
       num_shards = 1
-      pipeline = generate_detection_data.construct_pipeline(
-          input_tfrecord, output_tfrecord, saved_model_path,
+      pipeline_options = beam.options.pipeline_options.PipelineOptions(
+          runner='DirectRunner')
+      p = beam.Pipeline(options=pipeline_options)
+      generate_detection_data.construct_pipeline(
+          p, input_tfrecord, output_tfrecord, saved_model_path,
           confidence_threshold, num_shards)
-      runner.run(pipeline)
+      p.run()
       filenames = tf.io.gfile.glob(output_tfrecord + '-?????-of-?????')
       actual_output = []
-      record_iterator = tf.python_io.tf_record_iterator(path=filenames[0])
+      record_iterator = tf.data.TFRecordDataset(
+          tf.convert_to_tensor(filenames)).as_numpy_iterator()
       for record in record_iterator:
         actual_output.append(record)
       self.assertEqual(len(actual_output), 1)
diff --git a/research/object_detection/dataset_tools/context_rcnn/generate_embedding_data.py b/research/object_detection/dataset_tools/context_rcnn/generate_embedding_data.py
index a147c4e88339f44ff417dc38b60cff28ffe010ed..30ea6283570d9c59fa56f1bc4828ce7647d191ef 100644
--- a/research/object_detection/dataset_tools/context_rcnn/generate_embedding_data.py
+++ b/research/object_detection/dataset_tools/context_rcnn/generate_embedding_data.py
@@ -34,7 +34,8 @@ python tensorflow_models/object_detection/export_inference_graph.py \
     --input_type tf_example \
     --pipeline_config_path path/to/faster_rcnn_model.config \
     --trained_checkpoint_prefix path/to/model.ckpt \
-    --output_directory path/to/exported_model_directory
+    --output_directory path/to/exported_model_directory \
+    --additional_output_tensor_names detection_features
 
 python generate_embedding_data.py \
     --alsologtostderr \
@@ -47,34 +48,19 @@ from __future__ import absolute_import
 from __future__ import division
 from __future__ import print_function
 
+import argparse
 import datetime
 import os
 import threading
-from absl import app
-from absl import flags
-import apache_beam as beam
+
 import numpy as np
 import six
-import tensorflow.compat.v1 as tf
-from apache_beam import runners
-
-flags.DEFINE_string('embedding_input_tfrecord', None, 'TFRecord containing'
-                    'images in tf.Example format for object detection.')
-flags.DEFINE_string('embedding_output_tfrecord', None,
-                    'TFRecord containing embeddings in tf.Example format.')
-flags.DEFINE_string('embedding_model_dir', None, 'Path to directory containing'
-                    'an object detection SavedModel with'
-                    'detection_box_classifier_features in the output.')
-flags.DEFINE_integer('top_k_embedding_count', 1,
-                     'The number of top k embeddings to add to the memory bank.'
-                    )
-flags.DEFINE_integer('bottom_k_embedding_count', 0,
-                     'The number of bottom k embeddings to add to the memory '
-                     'bank.')
-flags.DEFINE_integer('num_shards', 0, 'Number of output shards.')
-
+import tensorflow as tf
 
-FLAGS = flags.FLAGS
+try:
+  import apache_beam as beam  # pylint:disable=g-import-not-at-top
+except ModuleNotFoundError:
+  pass
 
 
 class GenerateEmbeddingDataFn(beam.DoFn):
@@ -109,27 +95,7 @@ class GenerateEmbeddingDataFn(beam.DoFn):
     # one instance across all threads in the worker. This is possible since
     # tf.Session.run() is thread safe.
     with self.session_lock:
-      if self._session is None:
-        graph = tf.Graph()
-        self._session = tf.Session(graph=graph)
-        with graph.as_default():
-          meta_graph = tf.saved_model.loader.load(
-              self._session, [tf.saved_model.tag_constants.SERVING],
-              self._model_dir)
-        signature = meta_graph.signature_def['serving_default']
-        input_tensor_name = signature.inputs['inputs'].name
-        detection_features_name = signature.outputs['detection_features'].name
-        detection_boxes_name = signature.outputs['detection_boxes'].name
-        num_detections_name = signature.outputs['num_detections'].name
-        self._input = graph.get_tensor_by_name(input_tensor_name)
-        self._embedding_node = graph.get_tensor_by_name(detection_features_name)
-        self._box_node = graph.get_tensor_by_name(detection_boxes_name)
-        self._scores_node = graph.get_tensor_by_name(
-            signature.outputs['detection_scores'].name)
-        self._num_detections = graph.get_tensor_by_name(num_detections_name)
-        tf.logging.info(signature.outputs['detection_features'].name)
-        tf.logging.info(signature.outputs['detection_boxes'].name)
-        tf.logging.info(signature.outputs['num_detections'].name)
+      self._detect_fn = tf.saved_model.load(self._model_dir)
 
   def process(self, tfrecord_entry):
     return self._run_inference_and_generate_embedding(tfrecord_entry)
@@ -198,13 +164,12 @@ class GenerateEmbeddingDataFn(beam.DoFn):
     example.features.feature['image/unix_time'].float_list.value.extend(
         [unix_time])
 
-    (detection_features, detection_boxes, num_detections,
-     detection_scores) = self._session.run(
-         [
-             self._embedding_node, self._box_node, self._num_detections[0],
-             self._scores_node
-         ],
-         feed_dict={self._input: [tfrecord_entry]})
+    detections = self._detect_fn.signatures['serving_default'](
+        (tf.expand_dims(tf.convert_to_tensor(tfrecord_entry), 0)))
+    detection_features = detections['detection_features']
+    detection_boxes = detections['detection_boxes']
+    num_detections = detections['num_detections']
+    detection_scores = detections['detection_scores']
 
     num_detections = int(num_detections)
     embed_all = []
@@ -321,12 +286,13 @@ class GenerateEmbeddingDataFn(beam.DoFn):
     return [example]
 
 
-def construct_pipeline(input_tfrecord, output_tfrecord, model_dir,
+def construct_pipeline(pipeline, input_tfrecord, output_tfrecord, model_dir,
                        top_k_embedding_count, bottom_k_embedding_count,
                        num_shards):
   """Returns a beam pipeline to run object detection inference.
 
   Args:
+    pipeline: Initialized beam pipeline.
     input_tfrecord: An TFRecord of tf.train.Example protos containing images.
     output_tfrecord: An TFRecord of tf.train.Example protos that contain images
       in the input TFRecord and the detections from the model.
@@ -335,44 +301,98 @@ def construct_pipeline(input_tfrecord, output_tfrecord, model_dir,
     bottom_k_embedding_count: The number of low-confidence embeddings to store.
     num_shards: The number of output shards.
   """
-  def pipeline(root):
-    input_collection = (
-        root | 'ReadInputTFRecord' >> beam.io.tfrecordio.ReadFromTFRecord(
-            input_tfrecord,
-            coder=beam.coders.BytesCoder()))
-    output_collection = input_collection | 'ExtractEmbedding' >> beam.ParDo(
-        GenerateEmbeddingDataFn(model_dir, top_k_embedding_count,
-                                bottom_k_embedding_count))
-    output_collection = output_collection | 'Reshuffle' >> beam.Reshuffle()
-    _ = output_collection | 'WritetoDisk' >> beam.io.tfrecordio.WriteToTFRecord(
-        output_tfrecord,
-        num_shards=num_shards,
-        coder=beam.coders.ProtoCoder(tf.train.Example))
-  return pipeline
-
-
-def main(_):
+  input_collection = (
+      pipeline | 'ReadInputTFRecord' >> beam.io.tfrecordio.ReadFromTFRecord(
+          input_tfrecord,
+          coder=beam.coders.BytesCoder()))
+  output_collection = input_collection | 'ExtractEmbedding' >> beam.ParDo(
+      GenerateEmbeddingDataFn(model_dir, top_k_embedding_count,
+                              bottom_k_embedding_count))
+  output_collection = output_collection | 'Reshuffle' >> beam.Reshuffle()
+  _ = output_collection | 'WritetoDisk' >> beam.io.tfrecordio.WriteToTFRecord(
+      output_tfrecord,
+      num_shards=num_shards,
+      coder=beam.coders.ProtoCoder(tf.train.Example))
+
+
+def parse_args(argv):
+  """Command-line argument parser.
+
+  Args:
+    argv: command line arguments
+  Returns:
+    beam_args: Arguments for the beam pipeline.
+    pipeline_args: Arguments for the pipeline options, such as runner type.
+  """
+  parser = argparse.ArgumentParser()
+  parser.add_argument(
+      '--embedding_input_tfrecord',
+      dest='embedding_input_tfrecord',
+      required=True,
+      help='TFRecord containing images in tf.Example format for object '
+      'detection.')
+  parser.add_argument(
+      '--embedding_output_tfrecord',
+      dest='embedding_output_tfrecord',
+      required=True,
+      help='TFRecord containing embeddings in tf.Example format.')
+  parser.add_argument(
+      '--embedding_model_dir',
+      dest='embedding_model_dir',
+      required=True,
+      help='Path to directory containing an object detection SavedModel with'
+      'detection_box_classifier_features in the output.')
+  parser.add_argument(
+      '--top_k_embedding_count',
+      dest='top_k_embedding_count',
+      default=1,
+      help='The number of top k embeddings to add to the memory bank.')
+  parser.add_argument(
+      '--bottom_k_embedding_count',
+      dest='bottom_k_embedding_count',
+      default=0,
+      help='The number of bottom k embeddings to add to the memory bank.')
+  parser.add_argument(
+      '--num_shards',
+      dest='num_shards',
+      default=0,
+      help='Number of output shards.')
+  beam_args, pipeline_args = parser.parse_known_args(argv)
+  return beam_args, pipeline_args
+
+
+def main(argv=None, save_main_session=True):
   """Runs the Beam pipeline that performs inference.
 
   Args:
-    _: unused
+    argv: Command line arguments.
+    save_main_session: Whether to save the main session.
   """
-  # must create before flags are used
-  runner = runners.DirectRunner()
+  args, pipeline_args = parse_args(argv)
 
-  dirname = os.path.dirname(FLAGS.embedding_output_tfrecord)
+  pipeline_options = beam.options.pipeline_options.PipelineOptions(
+            pipeline_args)
+  pipeline_options.view_as(
+      beam.options.pipeline_options.SetupOptions).save_main_session = (
+          save_main_session)
+
+  dirname = os.path.dirname(args.embedding_output_tfrecord)
   tf.io.gfile.makedirs(dirname)
-  runner.run(
-      construct_pipeline(FLAGS.embedding_input_tfrecord,
-                         FLAGS.embedding_output_tfrecord,
-                         FLAGS.embedding_model_dir, FLAGS.top_k_embedding_count,
-                         FLAGS.bottom_k_embedding_count, FLAGS.num_shards))
+
+  p = beam.Pipeline(options=pipeline_options)
+
+  construct_pipeline(
+      p,
+      args.embedding_input_tfrecord,
+      args.embedding_output_tfrecord,
+      args.embedding_model_dir,
+      args.top_k_embedding_count,
+      args.bottom_k_embedding_count,
+      args.num_shards)
+
+  p.run()
 
 
 if __name__ == '__main__':
-  flags.mark_flags_as_required([
-      'embedding_input_tfrecord',
-      'embedding_output_tfrecord',
-      'embedding_model_dir'
-  ])
-  app.run(main)
+  main()
+
diff --git a/research/object_detection/dataset_tools/context_rcnn/generate_embedding_data_tf1_test.py b/research/object_detection/dataset_tools/context_rcnn/generate_embedding_data_tf2_test.py
similarity index 77%
rename from research/object_detection/dataset_tools/context_rcnn/generate_embedding_data_tf1_test.py
rename to research/object_detection/dataset_tools/context_rcnn/generate_embedding_data_tf2_test.py
index 064a57e13c1a6b5ff42e0dd46868cab22d94346d..5c8503c98b714ad40a88c89647b2d310c51fc962 100644
--- a/research/object_detection/dataset_tools/context_rcnn/generate_embedding_data_tf1_test.py
+++ b/research/object_detection/dataset_tools/context_rcnn/generate_embedding_data_tf2_test.py
@@ -23,30 +23,42 @@ import tempfile
 import unittest
 import numpy as np
 import six
-import tensorflow.compat.v1 as tf
-from object_detection import exporter
+import tensorflow as tf
+from object_detection import exporter_lib_v2
 from object_detection.builders import model_builder
 from object_detection.core import model
-from object_detection.dataset_tools.context_rcnn import generate_embedding_data
 from object_detection.protos import pipeline_pb2
 from object_detection.utils import tf_version
-from apache_beam import runners
+
+if tf_version.is_tf2():
+  from object_detection.dataset_tools.context_rcnn import generate_embedding_data  # pylint:disable=g-import-not-at-top
 
 if six.PY2:
   import mock  # pylint: disable=g-import-not-at-top
 else:
   mock = unittest.mock
 
+try:
+  import apache_beam as beam  # pylint:disable=g-import-not-at-top
+except ModuleNotFoundError:
+  pass
+
 
 class FakeModel(model.DetectionModel):
-  """A Fake Detection model with expected output nodes from post-processing."""
+
+  def __init__(self, conv_weight_scalar=1.0):
+    super(FakeModel, self).__init__(num_classes=5)
+    self._conv = tf.keras.layers.Conv2D(
+        filters=1, kernel_size=1, strides=(1, 1), padding='valid',
+        kernel_initializer=tf.keras.initializers.Constant(
+            value=conv_weight_scalar))
 
   def preprocess(self, inputs):
     true_image_shapes = []  # Doesn't matter for the fake model.
     return tf.identity(inputs), true_image_shapes
 
   def predict(self, preprocessed_inputs, true_image_shapes):
-    return {'image': tf.layers.conv2d(preprocessed_inputs, 3, 1)}
+    return {'image': self._conv(preprocessed_inputs)}
 
   def postprocess(self, prediction_dict, true_image_shapes):
     with tf.control_dependencies(prediction_dict.values()):
@@ -73,6 +85,9 @@ class FakeModel(model.DetectionModel):
   def restore_map(self, checkpoint_path, fine_tune_checkpoint_type):
     pass
 
+  def restore_from_objects(self, fine_tune_checkpoint_type):
+    pass
+
   def loss(self, prediction_dict, true_image_shapes):
     pass
 
@@ -88,7 +103,7 @@ def InMemoryTFRecord(entries):
   temp = tempfile.NamedTemporaryFile(delete=False)
   filename = temp.name
   try:
-    with tf.python_io.TFRecordWriter(filename) as writer:
+    with tf.io.TFRecordWriter(filename) as writer:
       for value in entries:
         writer.write(value)
     yield filename
@@ -96,7 +111,7 @@ def InMemoryTFRecord(entries):
     os.unlink(temp.name)
 
 
-@unittest.skipIf(tf_version.is_tf2(), 'Skipping TF1.X only test.')
+@unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
 class GenerateEmbeddingData(tf.test.TestCase):
 
   def _save_checkpoint_from_mock_model(self, checkpoint_path):
@@ -105,64 +120,38 @@ class GenerateEmbeddingData(tf.test.TestCase):
     Args:
       checkpoint_path: Path to save checkpoint from Fake model.
     """
-    g = tf.Graph()
-    with g.as_default():
-      mock_model = FakeModel(num_classes=5)
-      preprocessed_inputs, true_image_shapes = mock_model.preprocess(
-          tf.placeholder(tf.float32, shape=[None, None, None, 3]))
-      predictions = mock_model.predict(preprocessed_inputs, true_image_shapes)
-      mock_model.postprocess(predictions, true_image_shapes)
-      tf.train.get_or_create_global_step()
-      saver = tf.train.Saver()
-      init = tf.global_variables_initializer()
-      with self.test_session(graph=g) as sess:
-        sess.run(init)
-        saver.save(sess, checkpoint_path)
+    mock_model = FakeModel()
+    fake_image = tf.zeros(shape=[1, 10, 10, 3], dtype=tf.float32)
+    preprocessed_inputs, true_image_shapes = mock_model.preprocess(fake_image)
+    predictions = mock_model.predict(preprocessed_inputs, true_image_shapes)
+    mock_model.postprocess(predictions, true_image_shapes)
+    ckpt = tf.train.Checkpoint(model=mock_model)
+    exported_checkpoint_manager = tf.train.CheckpointManager(
+        ckpt, checkpoint_path, max_to_keep=1)
+    exported_checkpoint_manager.save(checkpoint_number=0)
 
   def _export_saved_model(self):
     tmp_dir = self.get_temp_dir()
-    checkpoint_path = os.path.join(tmp_dir, 'model.ckpt')
-    self._save_checkpoint_from_mock_model(checkpoint_path)
+    self._save_checkpoint_from_mock_model(tmp_dir)
     output_directory = os.path.join(tmp_dir, 'output')
     saved_model_path = os.path.join(output_directory, 'saved_model')
     tf.io.gfile.makedirs(output_directory)
     with mock.patch.object(
         model_builder, 'build', autospec=True) as mock_builder:
-      mock_builder.return_value = FakeModel(num_classes=5)
+      mock_builder.return_value = FakeModel()
+      output_directory = os.path.join(tmp_dir, 'output')
       pipeline_config = pipeline_pb2.TrainEvalPipelineConfig()
-      pipeline_config.eval_config.use_moving_averages = False
-      detection_model = model_builder.build(pipeline_config.model,
-                                            is_training=False)
-      outputs, placeholder_tensor = exporter.build_detection_graph(
+      exporter_lib_v2.export_inference_graph(
           input_type='tf_example',
-          detection_model=detection_model,
-          input_shape=None,
-          output_collection_name='inference_op',
-          graph_hook_fn=None)
-      output_node_names = ','.join(outputs.keys())
-      saver = tf.train.Saver()
-      input_saver_def = saver.as_saver_def()
-      frozen_graph_def = exporter.freeze_graph_with_def_protos(
-          input_graph_def=tf.get_default_graph().as_graph_def(),
-          input_saver_def=input_saver_def,
-          input_checkpoint=checkpoint_path,
-          output_node_names=output_node_names,
-          restore_op_name='save/restore_all',
-          filename_tensor_name='save/Const:0',
-          output_graph='',
-          clear_devices=True,
-          initializer_nodes='')
-      exporter.write_saved_model(
-          saved_model_path=saved_model_path,
-          frozen_graph_def=frozen_graph_def,
-          inputs=placeholder_tensor,
-          outputs=outputs)
-      return saved_model_path
+          pipeline_config=pipeline_config,
+          trained_checkpoint_dir=tmp_dir,
+          output_directory=output_directory)
+      saved_model_path = os.path.join(output_directory, 'saved_model')
+    return saved_model_path
 
   def _create_tf_example(self):
-    with self.test_session():
-      encoded_image = tf.image.encode_jpeg(
-          tf.constant(np.ones((4, 4, 3)).astype(np.uint8))).eval()
+    encoded_image = tf.io.encode_jpeg(
+        tf.constant(np.ones((4, 4, 3)).astype(np.uint8))).numpy()
 
     def BytesFeature(value):
       return tf.train.Feature(bytes_list=tf.train.BytesList(value=[value]))
@@ -236,13 +225,13 @@ class GenerateEmbeddingData(tf.test.TestCase):
         .int64_list.value, [5])
     self.assertAllEqual(
         example.features.feature['image/object/class/text']
-        .bytes_list.value, ['hyena'])
+        .bytes_list.value, [b'hyena'])
     self.assertAllClose(
         example.features.feature['image/class/label']
         .int64_list.value, [5])
     self.assertAllEqual(
         example.features.feature['image/class/text']
-        .bytes_list.value, ['hyena'])
+        .bytes_list.value, [b'hyena'])
 
     # Check other essential attributes.
     self.assertAllEqual(
@@ -251,7 +240,7 @@ class GenerateEmbeddingData(tf.test.TestCase):
         example.features.feature['image/width'].int64_list.value, [600])
     self.assertAllEqual(
         example.features.feature['image/source_id'].bytes_list.value,
-        ['image_id'])
+        [b'image_id'])
     self.assertTrue(
         example.features.feature['image/encoded'].bytes_list.value)
 
@@ -268,7 +257,7 @@ class GenerateEmbeddingData(tf.test.TestCase):
                         .int64_list.value, [5])
     self.assertAllEqual(tf.train.Example.FromString(
         generated_example).features.feature['image/object/class/text']
-                        .bytes_list.value, ['hyena'])
+                        .bytes_list.value, [b'hyena'])
     output = inference_fn.process(generated_example)
     output_example = output[0]
     self.assert_expected_example(output_example)
@@ -304,28 +293,31 @@ class GenerateEmbeddingData(tf.test.TestCase):
         .feature['image/object/class/label'].int64_list.value, [5])
     self.assertAllEqual(
         tf.train.Example.FromString(generated_example).features
-        .feature['image/object/class/text'].bytes_list.value, ['hyena'])
+        .feature['image/object/class/text'].bytes_list.value, [b'hyena'])
     output = inference_fn.process(generated_example)
     output_example = output[0]
     self.assert_expected_example(output_example, botk=True)
 
   def test_beam_pipeline(self):
     with InMemoryTFRecord([self._create_tf_example()]) as input_tfrecord:
-      runner = runners.DirectRunner()
       temp_dir = tempfile.mkdtemp(dir=os.environ.get('TEST_TMPDIR'))
       output_tfrecord = os.path.join(temp_dir, 'output_tfrecord')
       saved_model_path = self._export_saved_model()
       top_k_embedding_count = 1
       bottom_k_embedding_count = 0
       num_shards = 1
-      pipeline = generate_embedding_data.construct_pipeline(
-          input_tfrecord, output_tfrecord, saved_model_path,
+      pipeline_options = beam.options.pipeline_options.PipelineOptions(
+          runner='DirectRunner')
+      p = beam.Pipeline(options=pipeline_options)
+      generate_embedding_data.construct_pipeline(
+          p, input_tfrecord, output_tfrecord, saved_model_path,
           top_k_embedding_count, bottom_k_embedding_count, num_shards)
-      runner.run(pipeline)
+      p.run()
       filenames = tf.io.gfile.glob(
           output_tfrecord + '-?????-of-?????')
       actual_output = []
-      record_iterator = tf.python_io.tf_record_iterator(path=filenames[0])
+      record_iterator = tf.data.TFRecordDataset(
+          tf.convert_to_tensor(filenames)).as_numpy_iterator()
       for record in record_iterator:
         actual_output.append(record)
       self.assertEqual(len(actual_output), 1)
diff --git a/research/object_detection/dataset_tools/create_coco_tf_record.py b/research/object_detection/dataset_tools/create_coco_tf_record.py
index 51ed389105f827335de68ec9c85e04c0083242a5..2703c427e9bae8ebca5233f1ddaf7c42e5f1b82e 100644
--- a/research/object_detection/dataset_tools/create_coco_tf_record.py
+++ b/research/object_detection/dataset_tools/create_coco_tf_record.py
@@ -14,6 +14,9 @@
 # ==============================================================================
 r"""Convert raw COCO dataset to TFRecord for object_detection.
 
+This tool supports data generation for object detection (boxes, masks),
+keypoint detection, and DensePose.
+
 Please note that this tool creates sharded output files.
 
 Example usage:
@@ -63,7 +66,18 @@ tf.flags.DEFINE_string('train_keypoint_annotations_file', '',
                        'Training annotations JSON file.')
 tf.flags.DEFINE_string('val_keypoint_annotations_file', '',
                        'Validation annotations JSON file.')
+# DensePose is only available for coco 2014.
+tf.flags.DEFINE_string('train_densepose_annotations_file', '',
+                       'Training annotations JSON file for DensePose.')
+tf.flags.DEFINE_string('val_densepose_annotations_file', '',
+                       'Validation annotations JSON file for DensePose.')
 tf.flags.DEFINE_string('output_dir', '/tmp/', 'Output data directory.')
+# Whether to only produce images/annotations on person class (for keypoint /
+# densepose task).
+tf.flags.DEFINE_boolean('remove_non_person_annotations', False, 'Whether to '
+                        'remove all annotations for non-person objects.')
+tf.flags.DEFINE_boolean('remove_non_person_images', False, 'Whether to '
+                        'remove all examples that do not contain a person.')
 
 FLAGS = flags.FLAGS
 
@@ -77,13 +91,33 @@ _COCO_KEYPOINT_NAMES = [
     b'left_knee', b'right_knee', b'left_ankle', b'right_ankle'
 ]
 
+_COCO_PART_NAMES = [
+    b'torso_back', b'torso_front', b'right_hand', b'left_hand', b'left_foot',
+    b'right_foot', b'right_upper_leg_back', b'left_upper_leg_back',
+    b'right_upper_leg_front', b'left_upper_leg_front', b'right_lower_leg_back',
+    b'left_lower_leg_back', b'right_lower_leg_front', b'left_lower_leg_front',
+    b'left_upper_arm_back', b'right_upper_arm_back', b'left_upper_arm_front',
+    b'right_upper_arm_front', b'left_lower_arm_back', b'right_lower_arm_back',
+    b'left_lower_arm_front', b'right_lower_arm_front', b'right_face',
+    b'left_face',
+]
+
+_DP_PART_ID_OFFSET = 1
+
+
+def clip_to_unit(x):
+  return min(max(x, 0.0), 1.0)
+
 
 def create_tf_example(image,
                       annotations_list,
                       image_dir,
                       category_index,
                       include_masks=False,
-                      keypoint_annotations_dict=None):
+                      keypoint_annotations_dict=None,
+                      densepose_annotations_dict=None,
+                      remove_non_person_annotations=False,
+                      remove_non_person_images=False):
   """Converts image and annotations to a tf.Example proto.
 
   Args:
@@ -108,10 +142,23 @@ def create_tf_example(image,
       dictionary with keys: [u'keypoints', u'num_keypoints'] represeting the
       keypoint information for this person object annotation. If None, then
       no keypoint annotations will be populated.
+    densepose_annotations_dict: A dictionary that maps from annotation_id to a
+      dictionary with keys: [u'dp_I', u'dp_x', u'dp_y', 'dp_U', 'dp_V']
+      representing part surface coordinates. For more information see
+      http://densepose.org/.
+    remove_non_person_annotations: Whether to remove any annotations that are
+      not the "person" class.
+    remove_non_person_images: Whether to remove any images that do not contain
+      at least one "person" annotation.
 
   Returns:
+    key: SHA256 hash of the image.
     example: The converted tf.Example
     num_annotations_skipped: Number of (invalid) annotations that were ignored.
+    num_keypoint_annotation_skipped: Number of keypoint annotations that were
+      skipped.
+    num_densepose_annotation_skipped: Number of DensePose annotations that were
+      skipped.
 
   Raises:
     ValueError: if the image pointed to by data['filename'] is not a valid JPEG
@@ -146,6 +193,16 @@ def create_tf_example(image,
   num_annotations_skipped = 0
   num_keypoint_annotation_used = 0
   num_keypoint_annotation_skipped = 0
+  dp_part_index = []
+  dp_x = []
+  dp_y = []
+  dp_u = []
+  dp_v = []
+  dp_num_points = []
+  densepose_keys = ['dp_I', 'dp_U', 'dp_V', 'dp_x', 'dp_y', 'bbox']
+  include_densepose = densepose_annotations_dict is not None
+  num_densepose_annotation_used = 0
+  num_densepose_annotation_skipped = 0
   for object_annotations in annotations_list:
     (x, y, width, height) = tuple(object_annotations['bbox'])
     if width <= 0 or height <= 0:
@@ -154,14 +211,18 @@ def create_tf_example(image,
     if x + width > image_width or y + height > image_height:
       num_annotations_skipped += 1
       continue
+    category_id = int(object_annotations['category_id'])
+    category_name = category_index[category_id]['name'].encode('utf8')
+    if remove_non_person_annotations and category_name != b'person':
+      num_annotations_skipped += 1
+      continue
     xmin.append(float(x) / image_width)
     xmax.append(float(x + width) / image_width)
     ymin.append(float(y) / image_height)
     ymax.append(float(y + height) / image_height)
     is_crowd.append(object_annotations['iscrowd'])
-    category_id = int(object_annotations['category_id'])
     category_ids.append(category_id)
-    category_names.append(category_index[category_id]['name'].encode('utf8'))
+    category_names.append(category_name)
     area.append(object_annotations['area'])
 
     if include_masks:
@@ -197,6 +258,40 @@ def create_tf_example(image,
         keypoints_visibility.extend([0] * len(_COCO_KEYPOINT_NAMES))
         keypoints_name.extend(_COCO_KEYPOINT_NAMES)
         num_keypoints.append(0)
+
+    if include_densepose:
+      annotation_id = object_annotations['id']
+      if (annotation_id in densepose_annotations_dict and
+          all(key in densepose_annotations_dict[annotation_id]
+              for key in densepose_keys)):
+        dp_annotations = densepose_annotations_dict[annotation_id]
+        num_densepose_annotation_used += 1
+        dp_num_points.append(len(dp_annotations['dp_I']))
+        dp_part_index.extend([int(i - _DP_PART_ID_OFFSET)
+                              for i in dp_annotations['dp_I']])
+        # DensePose surface coordinates are defined on a [256, 256] grid
+        # relative to each instance box (i.e. absolute coordinates in range
+        # [0., 256.]). The following converts the coordinates
+        # so that they are expressed in normalized image coordinates.
+        dp_x_box_rel = [
+            clip_to_unit(val / 256.) for val in dp_annotations['dp_x']]
+        dp_x_norm = [(float(x) + x_box_rel * width) / image_width
+                     for x_box_rel in dp_x_box_rel]
+        dp_y_box_rel = [
+            clip_to_unit(val / 256.) for val in dp_annotations['dp_y']]
+        dp_y_norm = [(float(y) + y_box_rel * height) / image_height
+                     for y_box_rel in dp_y_box_rel]
+        dp_x.extend(dp_x_norm)
+        dp_y.extend(dp_y_norm)
+        dp_u.extend(dp_annotations['dp_U'])
+        dp_v.extend(dp_annotations['dp_V'])
+      else:
+        dp_num_points.append(0)
+
+  if (remove_non_person_images and
+      not any(name == b'person' for name in category_names)):
+    return (key, None, num_annotations_skipped,
+            num_keypoint_annotation_skipped, num_densepose_annotation_skipped)
   feature_dict = {
       'image/height':
           dataset_util.int64_feature(image_height),
@@ -243,15 +338,34 @@ def create_tf_example(image,
         dataset_util.bytes_list_feature(keypoints_name))
     num_keypoint_annotation_skipped = (
         len(keypoint_annotations_dict) - num_keypoint_annotation_used)
+  if include_densepose:
+    feature_dict['image/object/densepose/num'] = (
+        dataset_util.int64_list_feature(dp_num_points))
+    feature_dict['image/object/densepose/part_index'] = (
+        dataset_util.int64_list_feature(dp_part_index))
+    feature_dict['image/object/densepose/x'] = (
+        dataset_util.float_list_feature(dp_x))
+    feature_dict['image/object/densepose/y'] = (
+        dataset_util.float_list_feature(dp_y))
+    feature_dict['image/object/densepose/u'] = (
+        dataset_util.float_list_feature(dp_u))
+    feature_dict['image/object/densepose/v'] = (
+        dataset_util.float_list_feature(dp_v))
+    num_densepose_annotation_skipped = (
+        len(densepose_annotations_dict) - num_densepose_annotation_used)
 
   example = tf.train.Example(features=tf.train.Features(feature=feature_dict))
-  return key, example, num_annotations_skipped, num_keypoint_annotation_skipped
+  return (key, example, num_annotations_skipped,
+          num_keypoint_annotation_skipped, num_densepose_annotation_skipped)
 
 
 def _create_tf_record_from_coco_annotations(annotations_file, image_dir,
                                             output_path, include_masks,
                                             num_shards,
-                                            keypoint_annotations_file=''):
+                                            keypoint_annotations_file='',
+                                            densepose_annotations_file='',
+                                            remove_non_person_annotations=False,
+                                            remove_non_person_images=False):
   """Loads COCO annotation json files and converts to tf.Record format.
 
   Args:
@@ -264,6 +378,12 @@ def _create_tf_record_from_coco_annotations(annotations_file, image_dir,
     keypoint_annotations_file: JSON file containing the person keypoint
       annotations. If empty, then no person keypoint annotations will be
       generated.
+    densepose_annotations_file: JSON file containing the DensePose annotations.
+      If empty, then no DensePose annotations will be generated.
+    remove_non_person_annotations: Whether to remove any annotations that are
+      not the "person" class.
+    remove_non_person_images: Whether to remove any images that do not contain
+      at least one "person" annotation.
   """
   with contextlib2.ExitStack() as tf_record_close_stack, \
       tf.gfile.GFile(annotations_file, 'r') as fid:
@@ -288,7 +408,8 @@ def _create_tf_record_from_coco_annotations(annotations_file, image_dir,
       if image_id not in annotations_index:
         missing_annotation_count += 1
         annotations_index[image_id] = []
-    logging.info('%d images are missing annotations.', missing_annotation_count)
+    logging.info('%d images are missing annotations.',
+                 missing_annotation_count)
 
     keypoint_annotations_index = {}
     if keypoint_annotations_file:
@@ -301,8 +422,20 @@ def _create_tf_record_from_coco_annotations(annotations_file, image_dir,
             keypoint_annotations_index[image_id] = {}
           keypoint_annotations_index[image_id][annotation['id']] = annotation
 
+    densepose_annotations_index = {}
+    if densepose_annotations_file:
+      with tf.gfile.GFile(densepose_annotations_file, 'r') as fid:
+        densepose_groundtruth_data = json.load(fid)
+      if 'annotations' in densepose_groundtruth_data:
+        for annotation in densepose_groundtruth_data['annotations']:
+          image_id = annotation['image_id']
+          if image_id not in densepose_annotations_index:
+            densepose_annotations_index[image_id] = {}
+          densepose_annotations_index[image_id][annotation['id']] = annotation
+
     total_num_annotations_skipped = 0
     total_num_keypoint_annotations_skipped = 0
+    total_num_densepose_annotations_skipped = 0
     for idx, image in enumerate(images):
       if idx % 100 == 0:
         logging.info('On image %d of %d', idx, len(images))
@@ -312,19 +445,31 @@ def _create_tf_record_from_coco_annotations(annotations_file, image_dir,
         keypoint_annotations_dict = {}
         if image['id'] in keypoint_annotations_index:
           keypoint_annotations_dict = keypoint_annotations_index[image['id']]
-      (_, tf_example, num_annotations_skipped,
-       num_keypoint_annotations_skipped) = create_tf_example(
+      densepose_annotations_dict = None
+      if densepose_annotations_file:
+        densepose_annotations_dict = {}
+        if image['id'] in densepose_annotations_index:
+          densepose_annotations_dict = densepose_annotations_index[image['id']]
+      (_, tf_example, num_annotations_skipped, num_keypoint_annotations_skipped,
+       num_densepose_annotations_skipped) = create_tf_example(
            image, annotations_list, image_dir, category_index, include_masks,
-           keypoint_annotations_dict)
+           keypoint_annotations_dict, densepose_annotations_dict,
+           remove_non_person_annotations, remove_non_person_images)
       total_num_annotations_skipped += num_annotations_skipped
       total_num_keypoint_annotations_skipped += num_keypoint_annotations_skipped
+      total_num_densepose_annotations_skipped += (
+          num_densepose_annotations_skipped)
       shard_idx = idx % num_shards
-      output_tfrecords[shard_idx].write(tf_example.SerializeToString())
+      if tf_example:
+        output_tfrecords[shard_idx].write(tf_example.SerializeToString())
     logging.info('Finished writing, skipped %d annotations.',
                  total_num_annotations_skipped)
     if keypoint_annotations_file:
       logging.info('Finished writing, skipped %d keypoint annotations.',
                    total_num_keypoint_annotations_skipped)
+    if densepose_annotations_file:
+      logging.info('Finished writing, skipped %d DensePose annotations.',
+                   total_num_densepose_annotations_skipped)
 
 
 def main(_):
@@ -347,20 +492,26 @@ def main(_):
       train_output_path,
       FLAGS.include_masks,
       num_shards=100,
-      keypoint_annotations_file=FLAGS.train_keypoint_annotations_file)
+      keypoint_annotations_file=FLAGS.train_keypoint_annotations_file,
+      densepose_annotations_file=FLAGS.train_densepose_annotations_file,
+      remove_non_person_annotations=FLAGS.remove_non_person_annotations,
+      remove_non_person_images=FLAGS.remove_non_person_images)
   _create_tf_record_from_coco_annotations(
       FLAGS.val_annotations_file,
       FLAGS.val_image_dir,
       val_output_path,
       FLAGS.include_masks,
-      num_shards=100,
-      keypoint_annotations_file=FLAGS.val_keypoint_annotations_file)
+      num_shards=50,
+      keypoint_annotations_file=FLAGS.val_keypoint_annotations_file,
+      densepose_annotations_file=FLAGS.val_densepose_annotations_file,
+      remove_non_person_annotations=FLAGS.remove_non_person_annotations,
+      remove_non_person_images=FLAGS.remove_non_person_images)
   _create_tf_record_from_coco_annotations(
       FLAGS.testdev_annotations_file,
       FLAGS.test_image_dir,
       testdev_output_path,
       FLAGS.include_masks,
-      num_shards=100)
+      num_shards=50)
 
 
 if __name__ == '__main__':
diff --git a/research/object_detection/dataset_tools/create_coco_tf_record_test.py b/research/object_detection/dataset_tools/create_coco_tf_record_test.py
index 0bcc8be9c7437734414e73e43cae8effb7c95681..659142b7b7022a4243025146162eaac4b8c9f165 100644
--- a/research/object_detection/dataset_tools/create_coco_tf_record_test.py
+++ b/research/object_detection/dataset_tools/create_coco_tf_record_test.py
@@ -89,7 +89,7 @@ class CreateCocoTFRecordTest(tf.test.TestCase):
     }
 
     (_, example,
-     num_annotations_skipped, _) = create_coco_tf_record.create_tf_example(
+     num_annotations_skipped, _, _) = create_coco_tf_record.create_tf_example(
          image, annotations_list, image_dir, category_index)
 
     self.assertEqual(num_annotations_skipped, 0)
@@ -156,7 +156,7 @@ class CreateCocoTFRecordTest(tf.test.TestCase):
     }
 
     (_, example,
-     num_annotations_skipped, _) = create_coco_tf_record.create_tf_example(
+     num_annotations_skipped, _, _) = create_coco_tf_record.create_tf_example(
          image, annotations_list, image_dir, category_index, include_masks=True)
 
     self.assertEqual(num_annotations_skipped, 0)
@@ -259,14 +259,14 @@ class CreateCocoTFRecordTest(tf.test.TestCase):
         }
     }
 
-    (_, example, _,
-     num_keypoint_annotation_skipped) = create_coco_tf_record.create_tf_example(
-         image,
-         annotations_list,
-         image_dir,
-         category_index,
-         include_masks=False,
-         keypoint_annotations_dict=keypoint_annotations_dict)
+    _, example, _, num_keypoint_annotation_skipped, _ = (
+        create_coco_tf_record.create_tf_example(
+            image,
+            annotations_list,
+            image_dir,
+            category_index,
+            include_masks=False,
+            keypoint_annotations_dict=keypoint_annotations_dict))
 
     self.assertEqual(num_keypoint_annotation_skipped, 0)
     self._assertProtoEqual(
@@ -310,6 +310,132 @@ class CreateCocoTFRecordTest(tf.test.TestCase):
         example.features.feature[
             'image/object/keypoint/visibility'].int64_list.value, vv)
 
+  def test_create_tf_example_with_dense_pose(self):
+    image_dir = self.get_temp_dir()
+    image_file_name = 'tmp_image.jpg'
+    image_data = np.random.randint(low=0, high=256, size=(256, 256, 3)).astype(
+        np.uint8)
+    save_path = os.path.join(image_dir, image_file_name)
+    image = PIL.Image.fromarray(image_data, 'RGB')
+    image.save(save_path)
+
+    image = {
+        'file_name': image_file_name,
+        'height': 256,
+        'width': 256,
+        'id': 11,
+    }
+
+    min_x, min_y = 64, 64
+    max_x, max_y = 128, 128
+    keypoints = []
+    num_visible_keypoints = 0
+    xv = []
+    yv = []
+    vv = []
+    for _ in range(17):
+      xc = min_x + int(np.random.rand()*(max_x - min_x))
+      yc = min_y + int(np.random.rand()*(max_y - min_y))
+      vis = np.random.randint(0, 3)
+      xv.append(xc)
+      yv.append(yc)
+      vv.append(vis)
+      keypoints.extend([xc, yc, vis])
+      num_visible_keypoints += (vis > 0)
+
+    annotations_list = [{
+        'area': 0.5,
+        'iscrowd': False,
+        'image_id': 11,
+        'bbox': [64, 64, 128, 128],
+        'category_id': 1,
+        'id': 1000
+    }]
+
+    num_points = 45
+    dp_i = np.random.randint(1, 25, (num_points,)).astype(np.float32)
+    dp_u = np.random.randn(num_points)
+    dp_v = np.random.randn(num_points)
+    dp_x = np.random.rand(num_points)*256.
+    dp_y = np.random.rand(num_points)*256.
+    densepose_annotations_dict = {
+        1000: {
+            'dp_I': dp_i,
+            'dp_U': dp_u,
+            'dp_V': dp_v,
+            'dp_x': dp_x,
+            'dp_y': dp_y,
+            'bbox': [64, 64, 128, 128],
+        }
+    }
+
+    category_index = {
+        1: {
+            'name': 'person',
+            'id': 1
+        }
+    }
+
+    _, example, _, _, num_densepose_annotation_skipped = (
+        create_coco_tf_record.create_tf_example(
+            image,
+            annotations_list,
+            image_dir,
+            category_index,
+            include_masks=False,
+            densepose_annotations_dict=densepose_annotations_dict))
+
+    self.assertEqual(num_densepose_annotation_skipped, 0)
+    self._assertProtoEqual(
+        example.features.feature['image/height'].int64_list.value, [256])
+    self._assertProtoEqual(
+        example.features.feature['image/width'].int64_list.value, [256])
+    self._assertProtoEqual(
+        example.features.feature['image/filename'].bytes_list.value,
+        [six.b(image_file_name)])
+    self._assertProtoEqual(
+        example.features.feature['image/source_id'].bytes_list.value,
+        [six.b(str(image['id']))])
+    self._assertProtoEqual(
+        example.features.feature['image/format'].bytes_list.value,
+        [six.b('jpeg')])
+    self._assertProtoEqual(
+        example.features.feature['image/object/bbox/xmin'].float_list.value,
+        [0.25])
+    self._assertProtoEqual(
+        example.features.feature['image/object/bbox/ymin'].float_list.value,
+        [0.25])
+    self._assertProtoEqual(
+        example.features.feature['image/object/bbox/xmax'].float_list.value,
+        [0.75])
+    self._assertProtoEqual(
+        example.features.feature['image/object/bbox/ymax'].float_list.value,
+        [0.75])
+    self._assertProtoEqual(
+        example.features.feature['image/object/class/text'].bytes_list.value,
+        [six.b('person')])
+    self._assertProtoEqual(
+        example.features.feature['image/object/densepose/num'].int64_list.value,
+        [num_points])
+    self.assertAllEqual(
+        example.features.feature[
+            'image/object/densepose/part_index'].int64_list.value,
+        dp_i.astype(np.int64) - create_coco_tf_record._DP_PART_ID_OFFSET)
+    self.assertAllClose(
+        example.features.feature['image/object/densepose/u'].float_list.value,
+        dp_u)
+    self.assertAllClose(
+        example.features.feature['image/object/densepose/v'].float_list.value,
+        dp_v)
+    expected_dp_x = (64 + dp_x * 128. / 256.) / 256.
+    expected_dp_y = (64 + dp_y * 128. / 256.) / 256.
+    self.assertAllClose(
+        example.features.feature['image/object/densepose/x'].float_list.value,
+        expected_dp_x)
+    self.assertAllClose(
+        example.features.feature['image/object/densepose/y'].float_list.value,
+        expected_dp_y)
+
   def test_create_sharded_tf_record(self):
     tmp_dir = self.get_temp_dir()
     image_paths = ['tmp1_image.jpg', 'tmp2_image.jpg']
diff --git a/research/object_detection/dataset_tools/densepose/UV_symmetry_transforms.mat b/research/object_detection/dataset_tools/densepose/UV_symmetry_transforms.mat
new file mode 100644
index 0000000000000000000000000000000000000000..d09d70fb1264efe1d151c6c7e557c020ae569af1
Binary files /dev/null and b/research/object_detection/dataset_tools/densepose/UV_symmetry_transforms.mat differ
diff --git a/research/object_detection/dataset_tools/oid_tfrecord_creation.py b/research/object_detection/dataset_tools/oid_tfrecord_creation.py
index a4618a6aac0b884174abfddc31228c039aae7c66..0cddbbb9cd3ac5df3b8c1bde3e535b8f46fc9988 100644
--- a/research/object_detection/dataset_tools/oid_tfrecord_creation.py
+++ b/research/object_detection/dataset_tools/oid_tfrecord_creation.py
@@ -51,25 +51,25 @@ def tf_example_from_annotations_data_frame(annotations_data_frame, label_map,
   feature_map = {
       standard_fields.TfExampleFields.object_bbox_ymin:
           dataset_util.float_list_feature(
-              filtered_data_frame_boxes.YMin.as_matrix()),
+              filtered_data_frame_boxes.YMin.to_numpy()),
       standard_fields.TfExampleFields.object_bbox_xmin:
           dataset_util.float_list_feature(
-              filtered_data_frame_boxes.XMin.as_matrix()),
+              filtered_data_frame_boxes.XMin.to_numpy()),
       standard_fields.TfExampleFields.object_bbox_ymax:
           dataset_util.float_list_feature(
-              filtered_data_frame_boxes.YMax.as_matrix()),
+              filtered_data_frame_boxes.YMax.to_numpy()),
       standard_fields.TfExampleFields.object_bbox_xmax:
           dataset_util.float_list_feature(
-              filtered_data_frame_boxes.XMax.as_matrix()),
+              filtered_data_frame_boxes.XMax.to_numpy()),
       standard_fields.TfExampleFields.object_class_text:
           dataset_util.bytes_list_feature([
               six.ensure_binary(label_text)
-              for label_text in filtered_data_frame_boxes.LabelName.as_matrix()
+              for label_text in filtered_data_frame_boxes.LabelName.to_numpy()
           ]),
       standard_fields.TfExampleFields.object_class_label:
           dataset_util.int64_list_feature(
               filtered_data_frame_boxes.LabelName.map(
-                  lambda x: label_map[x]).as_matrix()),
+                  lambda x: label_map[x]).to_numpy()),
       standard_fields.TfExampleFields.filename:
           dataset_util.bytes_feature(
               six.ensure_binary('{}.jpg'.format(image_id))),
@@ -82,31 +82,31 @@ def tf_example_from_annotations_data_frame(annotations_data_frame, label_map,
   if 'IsGroupOf' in filtered_data_frame.columns:
     feature_map[standard_fields.TfExampleFields.
                 object_group_of] = dataset_util.int64_list_feature(
-                    filtered_data_frame_boxes.IsGroupOf.as_matrix().astype(int))
+                    filtered_data_frame_boxes.IsGroupOf.to_numpy().astype(int))
   if 'IsOccluded' in filtered_data_frame.columns:
     feature_map[standard_fields.TfExampleFields.
                 object_occluded] = dataset_util.int64_list_feature(
-                    filtered_data_frame_boxes.IsOccluded.as_matrix().astype(
+                    filtered_data_frame_boxes.IsOccluded.to_numpy().astype(
                         int))
   if 'IsTruncated' in filtered_data_frame.columns:
     feature_map[standard_fields.TfExampleFields.
                 object_truncated] = dataset_util.int64_list_feature(
-                    filtered_data_frame_boxes.IsTruncated.as_matrix().astype(
+                    filtered_data_frame_boxes.IsTruncated.to_numpy().astype(
                         int))
   if 'IsDepiction' in filtered_data_frame.columns:
     feature_map[standard_fields.TfExampleFields.
                 object_depiction] = dataset_util.int64_list_feature(
-                    filtered_data_frame_boxes.IsDepiction.as_matrix().astype(
+                    filtered_data_frame_boxes.IsDepiction.to_numpy().astype(
                         int))
 
   if 'ConfidenceImageLabel' in filtered_data_frame_labels.columns:
     feature_map[standard_fields.TfExampleFields.
                 image_class_label] = dataset_util.int64_list_feature(
                     filtered_data_frame_labels.LabelName.map(
-                        lambda x: label_map[x]).as_matrix())
+                        lambda x: label_map[x]).to_numpy())
     feature_map[standard_fields.TfExampleFields
                 .image_class_text] = dataset_util.bytes_list_feature([
                     six.ensure_binary(label_text) for label_text in
-                    filtered_data_frame_labels.LabelName.as_matrix()
+                    filtered_data_frame_labels.LabelName.to_numpy()
                 ]),
   return tf.train.Example(features=tf.train.Features(feature=feature_map))
diff --git a/research/object_detection/dataset_tools/seq_example_util_test.py b/research/object_detection/dataset_tools/seq_example_util_test.py
index ba898d73520badae017deb341e34ddbeb2b0eae8..fd721954be896b4044735dd67928044e413422e7 100644
--- a/research/object_detection/dataset_tools/seq_example_util_test.py
+++ b/research/object_detection/dataset_tools/seq_example_util_test.py
@@ -288,7 +288,7 @@ class SeqExampleUtilTest(tf.test.TestCase):
         [0.75, 1.],
         seq_feature_dict['region/bbox/xmax'].feature[0].float_list.value[:])
     self.assertAllEqual(
-        ['cat', 'frog'],
+        [b'cat', b'frog'],
         seq_feature_dict['region/label/string'].feature[0].bytes_list.value[:])
     self.assertAllClose(
         [0.],
@@ -332,7 +332,7 @@ class SeqExampleUtilTest(tf.test.TestCase):
         [0.75],
         seq_feature_dict['region/bbox/xmax'].feature[1].float_list.value[:])
     self.assertAllEqual(
-        ['cat'],
+        [b'cat'],
         seq_feature_dict['region/label/string'].feature[1].bytes_list.value[:])
     self.assertAllClose(
         [],
diff --git a/research/object_detection/dataset_tools/tf_record_creation_util_test.py b/research/object_detection/dataset_tools/tf_record_creation_util_test.py
index 2873a6d146fbdb8ae62c558abe8f62e76943b515..5722c86472e617f5e2e2aba916ad9e90c418948b 100644
--- a/research/object_detection/dataset_tools/tf_record_creation_util_test.py
+++ b/research/object_detection/dataset_tools/tf_record_creation_util_test.py
@@ -42,7 +42,7 @@ class OpenOutputTfrecordsTests(tf.test.TestCase):
       tf_record_path = '{}-{:05d}-of-00010'.format(
           os.path.join(tf.test.get_temp_dir(), 'test.tfrec'), idx)
       records = list(tf.python_io.tf_record_iterator(tf_record_path))
-      self.assertAllEqual(records, ['test_{}'.format(idx)])
+      self.assertAllEqual(records, ['test_{}'.format(idx).encode('utf-8')])
 
 
 if __name__ == '__main__':
diff --git a/research/object_detection/dockerfiles/tf1/Dockerfile b/research/object_detection/dockerfiles/tf1/Dockerfile
new file mode 100644
index 0000000000000000000000000000000000000000..9d77523096ab12bbda292595c9ada1cd42c97500
--- /dev/null
+++ b/research/object_detection/dockerfiles/tf1/Dockerfile
@@ -0,0 +1,41 @@
+FROM tensorflow/tensorflow:1.15.2-gpu-py3
+
+ARG DEBIAN_FRONTEND=noninteractive
+
+# Install apt dependencies
+RUN apt-get update && apt-get install -y \
+    git \
+    gpg-agent \
+    python3-cairocffi \
+    protobuf-compiler \
+    python3-pil \
+    python3-lxml \
+    python3-tk \
+    wget
+
+# Install gcloud and gsutil commands
+# https://cloud.google.com/sdk/docs/quickstart-debian-ubuntu
+RUN export CLOUD_SDK_REPO="cloud-sdk-$(lsb_release -c -s)" && \
+    echo "deb http://packages.cloud.google.com/apt $CLOUD_SDK_REPO main" | tee -a /etc/apt/sources.list.d/google-cloud-sdk.list && \
+    curl https://packages.cloud.google.com/apt/doc/apt-key.gpg | apt-key add - && \
+    apt-get update -y && apt-get install google-cloud-sdk -y
+
+# Add new user to avoid running as root
+RUN useradd -ms /bin/bash tensorflow
+USER tensorflow
+WORKDIR /home/tensorflow
+
+# Copy this version of of the model garden into the image
+COPY --chown=tensorflow . /home/tensorflow/models
+
+# Compile protobuf configs
+RUN (cd /home/tensorflow/models/research/ && protoc object_detection/protos/*.proto --python_out=.)
+WORKDIR /home/tensorflow/models/research/
+
+RUN cp object_detection/packages/tf1/setup.py ./
+ENV PATH="/home/tensorflow/.local/bin:${PATH}"
+
+RUN python -m pip install --user -U pip
+RUN python -m pip install --user .
+
+ENV TF_CPP_MIN_LOG_LEVEL 3
diff --git a/research/object_detection/dockerfiles/tf1/README.md b/research/object_detection/dockerfiles/tf1/README.md
new file mode 100644
index 0000000000000000000000000000000000000000..9e4503ca0fa54edb745194f382b025f976f08d6f
--- /dev/null
+++ b/research/object_detection/dockerfiles/tf1/README.md
@@ -0,0 +1,11 @@
+# TensorFlow Object Detection on Docker
+
+These instructions are experimental.
+
+## Building and running:
+
+```bash
+# From the root of the git repository
+docker build -f research/object_detection/dockerfiles/tf1/Dockerfile -t od .
+docker run -it od
+```
diff --git a/research/object_detection/dockerfiles/tf2/Dockerfile b/research/object_detection/dockerfiles/tf2/Dockerfile
new file mode 100644
index 0000000000000000000000000000000000000000..c4dfc6b23070bfffea7a07f5e23fe0bac0901c99
--- /dev/null
+++ b/research/object_detection/dockerfiles/tf2/Dockerfile
@@ -0,0 +1,41 @@
+FROM tensorflow/tensorflow:2.2.0-gpu
+
+ARG DEBIAN_FRONTEND=noninteractive
+
+# Install apt dependencies
+RUN apt-get update && apt-get install -y \
+    git \
+    gpg-agent \
+    python3-cairocffi \
+    protobuf-compiler \
+    python3-pil \
+    python3-lxml \
+    python3-tk \
+    wget
+
+# Install gcloud and gsutil commands
+# https://cloud.google.com/sdk/docs/quickstart-debian-ubuntu
+RUN export CLOUD_SDK_REPO="cloud-sdk-$(lsb_release -c -s)" && \
+    echo "deb http://packages.cloud.google.com/apt $CLOUD_SDK_REPO main" | tee -a /etc/apt/sources.list.d/google-cloud-sdk.list && \
+    curl https://packages.cloud.google.com/apt/doc/apt-key.gpg | apt-key add - && \
+    apt-get update -y && apt-get install google-cloud-sdk -y
+
+# Add new user to avoid running as root
+RUN useradd -ms /bin/bash tensorflow
+USER tensorflow
+WORKDIR /home/tensorflow
+
+# Copy this version of of the model garden into the image
+COPY --chown=tensorflow . /home/tensorflow/models
+
+# Compile protobuf configs
+RUN (cd /home/tensorflow/models/research/ && protoc object_detection/protos/*.proto --python_out=.)
+WORKDIR /home/tensorflow/models/research/
+
+RUN cp object_detection/packages/tf2/setup.py ./
+ENV PATH="/home/tensorflow/.local/bin:${PATH}"
+
+RUN python -m pip install -U pip
+RUN python -m pip install .
+
+ENV TF_CPP_MIN_LOG_LEVEL 3
diff --git a/research/object_detection/dockerfiles/tf2/README.md b/research/object_detection/dockerfiles/tf2/README.md
new file mode 100644
index 0000000000000000000000000000000000000000..14b5184c55d801400f523c80d6d21b9c38960a26
--- /dev/null
+++ b/research/object_detection/dockerfiles/tf2/README.md
@@ -0,0 +1,11 @@
+# TensorFlow Object Detection on Docker
+
+These instructions are experimental.
+
+## Building and running:
+
+```bash
+# From the root of the git repository
+docker build -f research/object_detection/dockerfiles/tf2/Dockerfile -t od .
+docker run -it od
+```
diff --git a/research/object_detection/eval_util.py b/research/object_detection/eval_util.py
index 3b365df19a8093ad2c2a2ad39b8dd46f6d1a82c7..b6d57f738942d740277add29c15aabd0de4c9c18 100644
--- a/research/object_detection/eval_util.py
+++ b/research/object_detection/eval_util.py
@@ -552,7 +552,11 @@ def _resize_detection_masks(args):
   detection_boxes, detection_masks, image_shape = args
   detection_masks_reframed = ops.reframe_box_masks_to_image_masks(
       detection_masks, detection_boxes, image_shape[0], image_shape[1])
-  return tf.cast(tf.greater(detection_masks_reframed, 0.5), tf.uint8)
+  # If the masks are currently float, binarize them. Otherwise keep them as
+  # integers, since they have already been thresholded.
+  if detection_masks_reframed.dtype == tf.float32:
+    detection_masks_reframed = tf.greater(detection_masks_reframed, 0.5)
+  return tf.cast(detection_masks_reframed, tf.uint8)
 
 
 def _resize_groundtruth_masks(args):
@@ -570,6 +574,17 @@ def _resize_groundtruth_masks(args):
   return tf.cast(tf.squeeze(mask, 3), tf.uint8)
 
 
+def _resize_surface_coordinate_masks(args):
+  detection_boxes, surface_coords, image_shape = args
+  surface_coords_v, surface_coords_u = tf.unstack(surface_coords, axis=-1)
+  surface_coords_v_reframed = ops.reframe_box_masks_to_image_masks(
+      surface_coords_v, detection_boxes, image_shape[0], image_shape[1])
+  surface_coords_u_reframed = ops.reframe_box_masks_to_image_masks(
+      surface_coords_u, detection_boxes, image_shape[0], image_shape[1])
+  return tf.stack([surface_coords_v_reframed, surface_coords_u_reframed],
+                  axis=-1)
+
+
 def _scale_keypoint_to_absolute(args):
   keypoints, image_shape = args
   return keypoint_ops.scale(keypoints, image_shape[0], image_shape[1])
@@ -720,6 +735,12 @@ def result_dict_for_batched_example(images,
         num_keypoints] bool tensor with keypoint visibilities (Optional).
       'groundtruth_labeled_classes': [batch_size, num_classes] int64
         tensor of 1-indexed classes. (Optional)
+      'groundtruth_dp_num_points': [batch_size, max_number_of_boxes] int32
+        tensor. (Optional)
+      'groundtruth_dp_part_ids': [batch_size, max_number_of_boxes,
+        max_sampled_points] int32 tensor. (Optional)
+      'groundtruth_dp_surface_coords_list': [batch_size, max_number_of_boxes,
+        max_sampled_points, 4] float32 tensor. (Optional)
     class_agnostic: Boolean indicating whether the detections are class-agnostic
       (i.e. binary). Default False.
     scale_to_absolute: Boolean indicating whether boxes and keypoints should be
@@ -747,12 +768,16 @@ def result_dict_for_batched_example(images,
     'detection_scores': [batch_size, max_detections] float32 tensor of scores.
     'detection_classes': [batch_size, max_detections] int64 tensor of 1-indexed
       classes.
-    'detection_masks': [batch_size, max_detections, H, W] float32 tensor of
-      binarized masks, reframed to full image masks. (Optional)
+    'detection_masks': [batch_size, max_detections, H, W] uint8 tensor of
+      instance masks, reframed to full image masks. Note that these may be
+      binarized (e.g. {0, 1}), or may contain 1-indexed part labels. (Optional)
     'detection_keypoints': [batch_size, max_detections, num_keypoints, 2]
       float32 tensor containing keypoint coordinates. (Optional)
     'detection_keypoint_scores': [batch_size, max_detections, num_keypoints]
       float32 tensor containing keypoint scores. (Optional)
+    'detection_surface_coords': [batch_size, max_detection, H, W, 2] float32
+      tensor with normalized surface coordinates (e.g. DensePose UV
+      coordinates). (Optional)
     'num_detections': [batch_size] int64 tensor containing number of valid
       detections.
     'groundtruth_boxes': [batch_size, num_boxes, 4] float32 tensor of boxes, in
@@ -844,14 +869,21 @@ def result_dict_for_batched_example(images,
 
   if detection_fields.detection_masks in detections:
     detection_masks = detections[detection_fields.detection_masks]
-    # TODO(rathodv): This should be done in model's postprocess
-    # function ideally.
     output_dict[detection_fields.detection_masks] = (
         shape_utils.static_or_dynamic_map_fn(
             _resize_detection_masks,
             elems=[detection_boxes, detection_masks,
                    original_image_spatial_shapes],
             dtype=tf.uint8))
+    if detection_fields.detection_surface_coords in detections:
+      detection_surface_coords = detections[
+          detection_fields.detection_surface_coords]
+      output_dict[detection_fields.detection_surface_coords] = (
+          shape_utils.static_or_dynamic_map_fn(
+              _resize_surface_coordinate_masks,
+              elems=[detection_boxes, detection_surface_coords,
+                     original_image_spatial_shapes],
+              dtype=tf.float32))
 
   if detection_fields.detection_keypoints in detections:
     detection_keypoints = detections[detection_fields.detection_keypoints]
@@ -1074,3 +1106,8 @@ def evaluator_options_from_eval_config(eval_config):
           'recall_upper_bound': (eval_config.recall_upper_bound)
       }
   return evaluator_options
+
+
+def has_densepose(eval_dict):
+  return (fields.DetectionResultFields.detection_masks in eval_dict and
+          fields.DetectionResultFields.detection_surface_coords in eval_dict)
diff --git a/research/object_detection/export_tflite_ssd_graph_lib_tf1_test.py b/research/object_detection/export_tflite_ssd_graph_lib_tf1_test.py
index 0da7b9aa294054380dcc6ebaf627accdd5e619ff..3625b9f651c157f52f690b1c9adf1e7ce19f2b94 100644
--- a/research/object_detection/export_tflite_ssd_graph_lib_tf1_test.py
+++ b/research/object_detection/export_tflite_ssd_graph_lib_tf1_test.py
@@ -74,6 +74,9 @@ class FakeModel(model.DetectionModel):
   def restore_map(self, checkpoint_path, from_detection_checkpoint):
     pass
 
+  def restore_from_objects(self, fine_tune_checkpoint_type):
+    pass
+
   def loss(self, prediction_dict, true_image_shapes):
     pass
 
@@ -416,7 +419,7 @@ class ExportTfliteGraphTest(tf.test.TestCase):
     tflite_graph_file = self._export_graph_with_postprocessing_op(
         pipeline_config)
     self.assertTrue(os.path.exists(tflite_graph_file))
-    mock_get.assert_called_once()
+    self.assertEqual(1, mock_get.call_count)
 
 
 if __name__ == '__main__':
diff --git a/research/object_detection/exporter_lib_tf2_test.py b/research/object_detection/exporter_lib_tf2_test.py
index d30d80cb09a14206d0fbfc1614d99aeccc25489b..99cbf263bece871d1a7d3b5a9f92e22c3f356412 100644
--- a/research/object_detection/exporter_lib_tf2_test.py
+++ b/research/object_detection/exporter_lib_tf2_test.py
@@ -76,6 +76,9 @@ class FakeModel(model.DetectionModel):
   def restore_map(self, checkpoint_path, fine_tune_checkpoint_type):
     pass
 
+  def restore_from_objects(self, fine_tune_checkpoint_type):
+    pass
+
   def loss(self, prediction_dict, true_image_shapes):
     pass
 
diff --git a/research/object_detection/exporter_tf1_test.py b/research/object_detection/exporter_tf1_test.py
index 40bdb966f584fdfb56eb05a2ecb9fd5c551941d5..b33bafd8db4f77627d6a64a1035f8c08bf6c09ee 100644
--- a/research/object_detection/exporter_tf1_test.py
+++ b/research/object_detection/exporter_tf1_test.py
@@ -105,6 +105,9 @@ class FakeModel(model.DetectionModel):
   def restore_map(self, checkpoint_path, fine_tune_checkpoint_type):
     pass
 
+  def restore_from_objects(self, fine_tune_checkpoint_type):
+    pass
+
   def loss(self, prediction_dict, true_image_shapes):
     pass
 
diff --git a/research/object_detection/g3doc/challenge_evaluation.md b/research/object_detection/g3doc/challenge_evaluation.md
index d8ea21017d1ae5c6fefb148aa3178e1a079e9861..15f032d4e8ae807470ba1ae47e961b84c0976dc7 100644
--- a/research/object_detection/g3doc/challenge_evaluation.md
+++ b/research/object_detection/g3doc/challenge_evaluation.md
@@ -47,7 +47,7 @@ python object_detection/dataset_tools/oid_hierarchical_labels_expansion.py \
     --annotation_type=2
 ```
 
-1.  If you are not using Tensorflow, you can run evaluation directly using your
+1.  If you are not using TensorFlow, you can run evaluation directly using your
     algorithm's output and generated ground-truth files. {value=4}
 
 After step 3 you produced the ground-truth files suitable for running 'OID
@@ -73,7 +73,7 @@ For the Object Detection Track, the participants will be ranked on:
 
 -   "OpenImagesDetectionChallenge_Precision/mAP@0.5IOU"
 
-To use evaluation within Tensorflow training, use metric name
+To use evaluation within TensorFlow training, use metric name
 `oid_challenge_detection_metrics` in the evaluation config.
 
 ## Instance Segmentation Track
@@ -130,7 +130,7 @@ python object_detection/dataset_tools/oid_hierarchical_labels_expansion.py \
     --annotation_type=1
 ```
 
-1.  If you are not using Tensorflow, you can run evaluation directly using your
+1.  If you are not using TensorFlow, you can run evaluation directly using your
     algorithm's output and generated ground-truth files. {value=4}
 
 ```
diff --git a/research/object_detection/g3doc/configuring_jobs.md b/research/object_detection/g3doc/configuring_jobs.md
index c088169bc9987918ae6a9a1e81305e00c313d672..59925f293b5cf8ba05d1db33b9dc9577bd06121a 100644
--- a/research/object_detection/g3doc/configuring_jobs.md
+++ b/research/object_detection/g3doc/configuring_jobs.md
@@ -2,7 +2,7 @@
 
 ## Overview
 
-The Tensorflow Object Detection API uses protobuf files to configure the
+The TensorFlow Object Detection API uses protobuf files to configure the
 training and evaluation process. The schema for the training pipeline can be
 found in object_detection/protos/pipeline.proto. At a high level, the config
 file is split into 5 parts:
@@ -60,7 +60,7 @@ to a value suited for the dataset the user is training on.
 
 ## Defining Inputs
 
-The Tensorflow Object Detection API accepts inputs in the TFRecord file format.
+The TensorFlow Object Detection API accepts inputs in the TFRecord file format.
 Users must specify the locations of both the training and evaluation files.
 Additionally, users should also specify a label map, which define the mapping
 between a class id and class name. The label map should be identical between
@@ -126,24 +126,6 @@ data_augmentation_options {
 }
 ```
 
-### Model Parameter Initialization
-
-While optional, it is highly recommended that users utilize other object
-detection checkpoints. Training an object detector from scratch can take days.
-To speed up the training process, it is recommended that users re-use the
-feature extractor parameters from a pre-existing image classification or
-object detection checkpoint. `train_config` provides two fields to specify
-pre-existing checkpoints: `fine_tune_checkpoint` and
-`from_detection_checkpoint`. `fine_tune_checkpoint` should provide a path to
-the pre-existing checkpoint
-(ie:"/usr/home/username/checkpoint/model.ckpt-#####").
-`from_detection_checkpoint` is a boolean value. If false, it assumes the
-checkpoint was from an object classification checkpoint. Note that starting
-from a detection checkpoint will usually result in a faster training job than
-a classification checkpoint.
-
-The list of provided checkpoints can be found [here](detection_model_zoo.md).
-
 ### Input Preprocessing
 
 The `data_augmentation_options` in `train_config` can be used to specify
diff --git a/research/object_detection/g3doc/context_rcnn.md b/research/object_detection/g3doc/context_rcnn.md
index d322b3fd8c52f5bcaf2510abcfa021011964a3ff..14a42d89afed63d91298a7a55b34b1a481d129a8 100644
--- a/research/object_detection/g3doc/context_rcnn.md
+++ b/research/object_detection/g3doc/context_rcnn.md
@@ -1,5 +1,7 @@
 # Context R-CNN
 
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+
 Context R-CNN is an object detection model that uses contextual features to
 improve object detection. See https://arxiv.org/abs/1912.03538 for more details.
 
@@ -22,6 +24,21 @@ contextual features. We focus on building context from object-centric features
 generated with a pre-trained Faster R-CNN model, but you can adapt the provided
 code to use alternative feature extractors.
 
+Each of these data processing scripts uses Apache Beam, which can be installed
+using
+
+```
+pip install apache-beam
+```
+
+and can be run locally, or on a cluster for efficient processing of large
+amounts of data. Note that generate_detection_data.py and
+generate_embedding_data.py both involve running inference, and may be very slow
+to run locally. See the
+[Apache Beam documentation](https://beam.apache.org/documentation/runners/dataflow/)
+for more information, and Google Cloud Documentation for a tutorial on
+[running Beam jobs on DataFlow](https://cloud.google.com/dataflow/docs/quickstarts/quickstart-python).
+
 ### Generating TfRecords from a set of images and a COCO-CameraTraps style JSON
 
 If your data is already stored in TfRecords, you can skip this first step.
@@ -99,6 +116,10 @@ python object_detection/export_inference_graph.py \
   --additional_output_tensor_names detection_features
 ```
 
+Make sure that you have set `output_final_box_features: true` within
+your config file before exporting. This is needed to export the features as an
+output, but it does not need to be set during training.
+
 To generate and save contextual features for your data, run
 
 ```
@@ -111,7 +132,8 @@ python object_detection/dataset_tools/context_rcnn/generate_embedding_data.py \
 
 ### Building up contextual memory banks and storing them for each context group
 
-To build the context features into memory banks, run
+To build the context features you just added for each image into memory banks,
+run
 
 ```
 python object_detection/dataset_tools/context_rcnn/add_context_to_examples.py \
@@ -121,6 +143,9 @@ python object_detection/dataset_tools/context_rcnn/add_context_to_examples.py \
   --time_horizon month
 ```
 
+where the input_tfrecords for add_context_to_examples.py are the
+output_tfrecords from generate_embedding_data.py.
+
 For all options, see add_context_to_examples.py. By default, this code builds
 TfSequenceExamples, which are more data efficient (this allows you to store the
 context features once for each context group, as opposed to once per image). If
@@ -171,3 +196,6 @@ python export_inference_graph.py \
     --side_input_types float,int
 
 ```
+
+If you have questions about Context R-CNN, please contact
+[Sara Beery](https://beerys.github.io/).
diff --git a/research/object_detection/g3doc/defining_your_own_model.md b/research/object_detection/g3doc/defining_your_own_model.md
index 865f6af169bfe35a41765d91d36bbcfbac0a937a..dabc0649f6e5c98cd1db3fdf700c843b3ff7d85b 100644
--- a/research/object_detection/g3doc/defining_your_own_model.md
+++ b/research/object_detection/g3doc/defining_your_own_model.md
@@ -2,14 +2,14 @@
 
 In this section, we discuss some of the abstractions that we use
 for defining detection models. If you would like to define a new model
-architecture for detection and use it in the Tensorflow Detection API,
+architecture for detection and use it in the TensorFlow Detection API,
 then this section should also serve as a high level guide to the files that you
 will need to edit to get your new model working.
 
 ## DetectionModels (`object_detection/core/model.py`)
 
 In order to be trained, evaluated, and exported for serving  using our
-provided binaries, all models under the Tensorflow Object Detection API must
+provided binaries, all models under the TensorFlow Object Detection API must
 implement the `DetectionModel` interface (see the full definition in `object_detection/core/model.py`).  In particular,
 each of these models are responsible for implementing 5 functions:
 
@@ -20,7 +20,7 @@ each of these models are responsible for implementing 5 functions:
   postprocess functions.
 * `postprocess`: Convert predicted output tensors to final detections.
 * `loss`: Compute scalar loss tensors with respect to provided groundtruth.
-* `restore`: Load a checkpoint into the Tensorflow graph.
+* `restore`: Load a checkpoint into the TensorFlow graph.
 
 Given a `DetectionModel` at training time, we pass each image batch through
 the following sequence of functions to compute a loss which can be optimized via
@@ -87,7 +87,7 @@ functions:
 * `_extract_box_classifier_features`: Extract second stage Box Classifier
   features.
 * `restore_from_classification_checkpoint_fn`: Load a checkpoint into the
-  Tensorflow graph.
+  TensorFlow graph.
 
 See the `object_detection/models/faster_rcnn_resnet_v1_feature_extractor.py`
 definition as one example. Some remarks:
diff --git a/research/object_detection/g3doc/evaluation_protocols.md b/research/object_detection/g3doc/evaluation_protocols.md
index e431fa7233ee199443dcaf5f2b7840c5bb20cd99..d5a070f6bc0a7bf9721a0e11fe19690f926d3e8f 100644
--- a/research/object_detection/g3doc/evaluation_protocols.md
+++ b/research/object_detection/g3doc/evaluation_protocols.md
@@ -1,6 +1,6 @@
 # Supported object detection evaluation protocols
 
-The Tensorflow Object Detection API currently supports three evaluation protocols,
+The TensorFlow Object Detection API currently supports three evaluation protocols,
 that can be configured in `EvalConfig` by setting `metrics_set` to the
 corresponding value.
 
diff --git a/research/object_detection/g3doc/exporting_models.md b/research/object_detection/g3doc/exporting_models.md
index c64408302e961be5f4b5de9767c7df3cb021601e..701acf3c4305faf3d0fa830bfa755b56c2fe8042 100644
--- a/research/object_detection/g3doc/exporting_models.md
+++ b/research/object_detection/g3doc/exporting_models.md
@@ -1,6 +1,8 @@
 # Exporting a trained model for inference
 
-After your model has been trained, you should export it to a Tensorflow
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+
+After your model has been trained, you should export it to a TensorFlow
 graph proto. A checkpoint will typically consist of three files:
 
 * model.ckpt-${CHECKPOINT_NUMBER}.data-00000-of-00001
diff --git a/research/object_detection/g3doc/faq.md b/research/object_detection/g3doc/faq.md
index c0ca503fc6e7eae40192b97b8de2578d74e1b7f9..f2a6e30ccf78de8bc4b8b9898128e24ddfd8d233 100644
--- a/research/object_detection/g3doc/faq.md
+++ b/research/object_detection/g3doc/faq.md
@@ -22,6 +22,6 @@ A: Similar to BackupHandler, syncing your fork to HEAD should make it work.
 ## Q: Why can't I get the inference time as reported in model zoo?
 A: The inference time reported in model zoo is mean time of testing hundreds of
 images with an internal machine. As mentioned in
-[Tensorflow detection model zoo](detection_model_zoo.md), this speed depends
+[TensorFlow detection model zoo](tf1_detection_zoo.md), this speed depends
 highly on one's specific hardware configuration and should be treated more as
 relative timing.
diff --git a/research/object_detection/g3doc/installation.md b/research/object_detection/g3doc/installation.md
deleted file mode 100644
index 05c891802af0dbc74adcb9337a8e435562877e28..0000000000000000000000000000000000000000
--- a/research/object_detection/g3doc/installation.md
+++ /dev/null
@@ -1,184 +0,0 @@
-# Installation
-
-## Dependencies
-
-Tensorflow Object Detection API depends on the following libraries:
-
-*   Protobuf 3.0.0
-*   Python-tk
-*   Pillow 1.0
-*   lxml
-*   tf-slim (https://github.com/google-research/tf-slim)
-*   slim (which is included in the "tensorflow/models/research/" checkout)
-*   Jupyter notebook
-*   Matplotlib
-*   Tensorflow (1.15.0)
-*   Cython
-*   contextlib2
-*   cocoapi
-
-For detailed steps to install Tensorflow, follow the [Tensorflow installation
-instructions](https://www.tensorflow.org/install/). A typical user can install
-Tensorflow using one of the following commands:
-
-``` bash
-# For CPU
-pip install tensorflow
-# For GPU
-pip install tensorflow-gpu
-```
-
-The remaining libraries can be installed on Ubuntu 16.04 using via apt-get:
-
-```bash
-sudo apt-get install protobuf-compiler python-pil python-lxml python-tk
-pip install --user Cython
-pip install --user contextlib2
-pip install --user jupyter
-pip install --user matplotlib
-pip install --user tf_slim
-```
-
-Alternatively, users can install dependencies using pip:
-
-```bash
-pip install --user Cython
-pip install --user contextlib2
-pip install --user pillow
-pip install --user lxml
-pip install --user jupyter
-pip install --user matplotlib
-pip install --user tf_slim
-```
-
-<!-- common_typos_disable -->
-**Note**: sometimes "sudo apt-get install protobuf-compiler" will install
-Protobuf 3+ versions for you and some users have issues when using 3.5.
-If that is your case, try the [manual](#Manual-protobuf-compiler-installation-and-usage) installation.
-
-## Download the tensorflow/models repository
-
-```bash
-git clone https://github.com/tensorflow/models.git
-```
-
-To use this library, you need to download this repository, whenever it says
-`<path-to-tensorflow>` it will be referring to the folder that you downloaded
-this repository into.
-
-## COCO API installation
-
-Download the
-[cocoapi](https://github.com/cocodataset/cocoapi) and
-copy the pycocotools subfolder to the tensorflow/models/research directory if
-you are interested in using COCO evaluation metrics. The default metrics are
-based on those used in Pascal VOC evaluation. To use the COCO object detection
-metrics add `metrics_set: "coco_detection_metrics"` to the `eval_config` message
-in the config file. To use the COCO instance segmentation metrics add
-`metrics_set: "coco_mask_metrics"` to the `eval_config` message in the config
-file.
-
-```bash
-git clone https://github.com/cocodataset/cocoapi.git
-cd cocoapi/PythonAPI
-make
-cp -r pycocotools <path_to_tensorflow>/models/research/
-```
-
-Alternatively, users can install `pycocotools` using pip:
-
-```bash
-pip install --user pycocotools
-```
-
-## Protobuf Compilation
-
-The Tensorflow Object Detection API uses Protobufs to configure model and
-training parameters. Before the framework can be used, the Protobuf libraries
-must be compiled. This should be done by running the following command from
-the [tensorflow/models/research/
-](https://github.com/tensorflow/models/tree/master/research/)
-directory:
-
-
-``` bash
-# From tensorflow/models/research/
-protoc object_detection/protos/*.proto --python_out=.
-```
-
-**Note**: If you're getting errors while compiling, you might be using an incompatible protobuf compiler. If that's the case, use the following manual installation
-
-## Manual protobuf-compiler installation and usage
-
-**If you are on linux:**
-
-Download and install the 3.0 release of protoc, then unzip the file.
-
-```bash
-# From tensorflow/models/research/
-wget -O protobuf.zip https://github.com/google/protobuf/releases/download/v3.0.0/protoc-3.0.0-linux-x86_64.zip
-unzip protobuf.zip
-```
-
-Run the compilation process again, but use the downloaded version of protoc
-
-```bash
-# From tensorflow/models/research/
-./bin/protoc object_detection/protos/*.proto --python_out=.
-```
-
-**If you are on MacOS:**
-
-If you have homebrew, download and install the protobuf with
-```brew install protobuf```
-
-Alternately, run:
-```PROTOC_ZIP=protoc-3.3.0-osx-x86_64.zip
-curl -OL https://github.com/google/protobuf/releases/download/v3.3.0/$PROTOC_ZIP
-sudo unzip -o $PROTOC_ZIP -d /usr/local bin/protoc
-rm -f $PROTOC_ZIP
-```
-
-Run the compilation process again:
-
-``` bash
-# From tensorflow/models/research/
-protoc object_detection/protos/*.proto --python_out=.
-```
-
-## Add Libraries to PYTHONPATH
-
-When running locally, the tensorflow/models/research/ and slim directories
-should be appended to PYTHONPATH. This can be done by running the following from
-tensorflow/models/research/:
-
-
-``` bash
-# From tensorflow/models/research/
-export PYTHONPATH=$PYTHONPATH:`pwd`:`pwd`/slim
-```
-
-Note: This command needs to run from every new terminal you start. If you wish
-to avoid running this manually, you can add it as a new line to the end of your
-~/.bashrc file, replacing \`pwd\` with the absolute path of
-tensorflow/models/research on your system. After updating ~/.bashrc file you
-can run the following command:
-
-Note: Some of the functions defined in tensorflow/models/research/slim has been
-moved to [tf-slim](https://github.com/google-research/tf-slim), so installing
-tf_slim is required now.
-
-``` bash
-source ~/.bashrc
-```
-
-# Testing the Installation
-
-You can test that you have correctly installed the Tensorflow Object Detection\
-API by running the following command:
-
-
-```bash
-# If using Tensorflow 1.X:
-python object_detection/builders/model_builder_tf1_test.py
-```
diff --git a/research/object_detection/g3doc/instance_segmentation.md b/research/object_detection/g3doc/instance_segmentation.md
index 8ebf7d8c3d7329b95b81fbb14d272f6b0134e138..f9b4856c90f47d60f4e8ea2a15b1813bab2847a5 100644
--- a/research/object_detection/g3doc/instance_segmentation.md
+++ b/research/object_detection/g3doc/instance_segmentation.md
@@ -67,7 +67,7 @@ your own models:
 1.  <a href="https://github.com/tensorflow/models/blob/master/research/object_detection/samples/configs/mask_rcnn_resnet50_atrous_coco.config" target=_blank>mask_rcnn_resnet50_atrous_coco</a>
 1.  <a href="https://github.com/tensorflow/models/blob/master/research/object_detection/samples/configs/mask_rcnn_inception_v2_coco.config" target=_blank>mask_rcnn_inception_v2_coco</a>
 
-For more details see the [detection model zoo](detection_model_zoo.md).
+For more details see the [detection model zoo](tf1_detection_zoo.md).
 
 ### Updating a Faster R-CNN config file
 
diff --git a/research/object_detection/g3doc/oid_inference_and_evaluation.md b/research/object_detection/g3doc/oid_inference_and_evaluation.md
index 4babf10a2768a871191a645e85d00779ee7b22a9..d54ad23940b5292bcd4db78bfaf3fd609d82cfe2 100644
--- a/research/object_detection/g3doc/oid_inference_and_evaluation.md
+++ b/research/object_detection/g3doc/oid_inference_and_evaluation.md
@@ -113,10 +113,10 @@ computations on subsets of the validation and test sets.
 ## Inferring detections
 
 Inference requires a trained object detection model. In this tutorial we will
-use a model from the [detections model zoo](detection_model_zoo.md), which can
+use a model from the [detections model zoo](tf1_detection_zoo.md), which can
 be downloaded and unpacked by running the commands below. More information about
 the model, such as its architecture and how it was trained, is available in the
-[model zoo page](detection_model_zoo.md).
+[model zoo page](tf1_detection_zoo.md).
 
 ```bash
 # From tensorflow/models/research/oid
diff --git a/research/object_detection/g3doc/preparing_inputs.md b/research/object_detection/g3doc/preparing_inputs.md
index 8e690e8c345c314f9a65c8fdff5b6ca22e8159c2..7e8df08502b9d5598f9c6bcd8b5f6b1f0c4edf3e 100644
--- a/research/object_detection/g3doc/preparing_inputs.md
+++ b/research/object_detection/g3doc/preparing_inputs.md
@@ -1,6 +1,6 @@
 # Preparing Inputs
 
-Tensorflow Object Detection API reads data using the TFRecord file format. Two
+TensorFlow Object Detection API reads data using the TFRecord file format. Two
 sample scripts (`create_pascal_tf_record.py` and `create_pet_tf_record.py`) are
 provided to convert from the PASCAL VOC dataset and Oxford-IIIT Pet dataset to
 TFRecords.
diff --git a/research/object_detection/g3doc/release_notes.md b/research/object_detection/g3doc/release_notes.md
new file mode 100644
index 0000000000000000000000000000000000000000..5841230db48d3e26379e8e71d62129b743b32bdb
--- /dev/null
+++ b/research/object_detection/g3doc/release_notes.md
@@ -0,0 +1,351 @@
+# Release Notes
+
+### July 10th, 2020
+
+We are happy to announce that the TF OD API officially supports TF2! Our release
+includes:
+
+*   New binaries for train/eval/export that are designed to run in eager mode.
+*   A suite of TF2 compatible (Keras-based) models; this includes migrations of
+    our most popular TF1.x models (e.g., SSD with MobileNet, RetinaNet, Faster
+    R-CNN, Mask R-CNN), as well as a few new architectures for which we will
+    only maintain TF2 implementations:
+
+    1.  CenterNet - a simple and effective anchor-free architecture based on the
+        recent [Objects as Points](https://arxiv.org/abs/1904.07850) paper by
+        Zhou et al
+    2.  [EfficientDet](https://arxiv.org/abs/1911.09070) - a recent family of
+        SOTA models discovered with the help of Neural Architecture Search.
+
+*   COCO pre-trained weights for all of the models provided as TF2 style
+    object-based checkpoints.
+
+*   Access to
+    [Distribution Strategies](https://www.tensorflow.org/guide/distributed_training)
+    for distributed training --- our model are designed to be trainable using
+    sync multi-GPU and TPU platforms.
+
+*   Colabs demo’ing eager mode training and inference.
+
+See our release blogpost
+[here](https://blog.tensorflow.org/2020/07/tensorflow-2-meets-object-detection-api.html).
+If you are an existing user of the TF OD API using TF 1.x, don’t worry, we’ve
+got you covered.
+
+**Thanks to contributors**: Akhil Chinnakotla, Allen Lavoie, Anirudh Vegesana,
+Anjali Sridhar, Austin Myers, Dan Kondratyuk, David Ross, Derek Chow, Jaeyoun
+Kim, Jing Li, Jonathan Huang, Jordi Pont-Tuset, Karmel Allison, Kathy Ruan,
+Kaushik Shivakumar, Lu He, Mingxing Tan, Pengchong Jin, Ronny Votel, Sara Beery,
+Sergi Caelles Prat, Shan Yang, Sudheendra Vijayanarasimhan, Tina Tian, Tomer
+Kaftan, Vighnesh Birodkar, Vishnu Banna, Vivek Rathod, Yanhui Liang, Yiming Shi,
+Yixin Shi, Yu-hui Chen, Zhichao Lu.
+
+### June 26th, 2020
+
+We have released SSDLite with MobileDet GPU backbone, which achieves 17% mAP
+higher than the MobileNetV2 SSDLite (27.5 mAP vs 23.5 mAP) on a NVIDIA Jetson
+Xavier at comparable latency (3.2ms vs 3.3ms).
+
+Along with the model definition, we are also releasing model checkpoints trained
+on the COCO dataset.
+
+<b>Thanks to contributors</b>: Yongzhe Wang, Bo Chen, Hanxiao Liu, Le An
+(NVIDIA), Yu-Te Cheng (NVIDIA), Oliver Knieps (NVIDIA), and Josh Park (NVIDIA).
+
+### June 17th, 2020
+
+We have released [Context R-CNN](https://arxiv.org/abs/1912.03538), a model that
+uses attention to incorporate contextual information images (e.g. from
+temporally nearby frames taken by a static camera) in order to improve accuracy.
+Importantly, these contextual images need not be labeled.
+
+*   When applied to a challenging wildlife detection dataset
+    ([Snapshot Serengeti](http://lila.science/datasets/snapshot-serengeti)),
+    Context R-CNN with context from up to a month of images outperforms a
+    single-frame baseline by 17.9% mAP, and outperforms S3D (a 3d convolution
+    based baseline) by 11.2% mAP.
+*   Context R-CNN leverages temporal context from the unlabeled frames of a
+    novel camera deployment to improve performance at that camera, boosting
+    model generalizeability.
+
+Read about Context R-CNN on the Google AI blog
+[here](https://ai.googleblog.com/2020/06/leveraging-temporal-context-for-object.html).
+
+We have provided code for generating data with associated context
+[here](context_rcnn.md), and a sample config for a Context R-CNN model
+[here](../samples/configs/context_rcnn_resnet101_snapshot_serengeti_sync.config).
+
+Snapshot Serengeti-trained Faster R-CNN and Context R-CNN models can be found in
+the
+[model zoo](https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/tf1_detection_zoo.md#snapshot-serengeti-camera-trap-trained-models).
+
+A colab demonstrating Context R-CNN is provided
+[here](../colab_tutorials/context_rcnn_tutorial.ipynb).
+
+<b>Thanks to contributors</b>: Sara Beery, Jonathan Huang, Guanhang Wu, Vivek
+Rathod, Ronny Votel, Zhichao Lu, David Ross, Pietro Perona, Tanya Birch, and the
+Wildlife Insights AI Team.
+
+### May 19th, 2020
+
+We have released [MobileDets](https://arxiv.org/abs/2004.14525), a set of
+high-performance models for mobile CPUs, DSPs and EdgeTPUs.
+
+*   MobileDets outperform MobileNetV3+SSDLite by 1.7 mAP at comparable mobile
+    CPU inference latencies. MobileDets also outperform MobileNetV2+SSDLite by
+    1.9 mAP on mobile CPUs, 3.7 mAP on EdgeTPUs and 3.4 mAP on DSPs while
+    running equally fast. MobileDets also offer up to 2x speedup over MnasFPN on
+    EdgeTPUs and DSPs.
+
+For each of the three hardware platforms we have released model definition,
+model checkpoints trained on the COCO14 dataset and converted TFLite models in
+fp32 and/or uint8.
+
+<b>Thanks to contributors</b>: Yunyang Xiong, Hanxiao Liu, Suyog Gupta, Berkin
+Akin, Gabriel Bender, Pieter-Jan Kindermans, Mingxing Tan, Vikas Singh, Bo Chen,
+Quoc Le, Zhichao Lu.
+
+### May 7th, 2020
+
+We have released a mobile model with the
+[MnasFPN head](https://arxiv.org/abs/1912.01106).
+
+*   MnasFPN with MobileNet-V2 backbone is the most accurate (26.6 mAP at 183ms
+    on Pixel 1) mobile detection model we have released to date. With
+    depth-multiplier, MnasFPN with MobileNet-V2 backbone is 1.8 mAP higher than
+    MobileNet-V3-Large with SSDLite (23.8 mAP vs 22.0 mAP) at similar latency
+    (120ms) on Pixel 1.
+
+We have released model definition, model checkpoints trained on the COCO14
+dataset and a converted TFLite model.
+
+<b>Thanks to contributors</b>: Bo Chen, Golnaz Ghiasi, Hanxiao Liu, Tsung-Yi
+Lin, Dmitry Kalenichenko, Hartwig Adam, Quoc Le, Zhichao Lu, Jonathan Huang, Hao
+Xu.
+
+### Nov 13th, 2019
+
+We have released MobileNetEdgeTPU SSDLite model.
+
+*   SSDLite with MobileNetEdgeTPU backbone, which achieves 10% mAP higher than
+    MobileNetV2 SSDLite (24.3 mAP vs 22 mAP) on a Google Pixel4 at comparable
+    latency (6.6ms vs 6.8ms).
+
+Along with the model definition, we are also releasing model checkpoints trained
+on the COCO dataset.
+
+<b>Thanks to contributors</b>: Yunyang Xiong, Bo Chen, Suyog Gupta, Hanxiao Liu,
+Gabriel Bender, Mingxing Tan, Berkin Akin, Zhichao Lu, Quoc Le
+
+### Oct 15th, 2019
+
+We have released two MobileNet V3 SSDLite models (presented in
+[Searching for MobileNetV3](https://arxiv.org/abs/1905.02244)).
+
+*   SSDLite with MobileNet-V3-Large backbone, which is 27% faster than Mobilenet
+    V2 SSDLite (119ms vs 162ms) on a Google Pixel phone CPU at the same mAP.
+*   SSDLite with MobileNet-V3-Small backbone, which is 37% faster than MnasNet
+    SSDLite reduced with depth-multiplier (43ms vs 68ms) at the same mAP.
+
+Along with the model definition, we are also releasing model checkpoints trained
+on the COCO dataset.
+
+<b>Thanks to contributors</b>: Bo Chen, Zhichao Lu, Vivek Rathod, Jonathan Huang
+
+### July 1st, 2019
+
+We have released an updated set of utils and an updated
+[tutorial](challenge_evaluation.md) for all three tracks of the
+[Open Images Challenge 2019](https://storage.googleapis.com/openimages/web/challenge2019.html)!
+
+The Instance Segmentation metric for
+[Open Images V5](https://storage.googleapis.com/openimages/web/index.html) and
+[Challenge 2019](https://storage.googleapis.com/openimages/web/challenge2019.html)
+is part of this release. Check out
+[the metric description](https://storage.googleapis.com/openimages/web/evaluation.html#instance_segmentation_eval)
+on the Open Images website.
+
+<b>Thanks to contributors</b>: Alina Kuznetsova, Rodrigo Benenson
+
+### Feb 11, 2019
+
+We have released detection models trained on the Open Images Dataset V4 in our
+detection model zoo, including
+
+*   Faster R-CNN detector with Inception Resnet V2 feature extractor
+*   SSD detector with MobileNet V2 feature extractor
+*   SSD detector with ResNet 101 FPN feature extractor (aka RetinaNet-101)
+
+<b>Thanks to contributors</b>: Alina Kuznetsova, Yinxiao Li
+
+### Sep 17, 2018
+
+We have released Faster R-CNN detectors with ResNet-50 / ResNet-101 feature
+extractors trained on the
+[iNaturalist Species Detection Dataset](https://github.com/visipedia/inat_comp/blob/master/2017/README.md#bounding-boxes).
+The models are trained on the training split of the iNaturalist data for 4M
+iterations, they achieve 55% and 58% mean AP@.5 over 2854 classes respectively.
+For more details please refer to this [paper](https://arxiv.org/abs/1707.06642).
+
+<b>Thanks to contributors</b>: Chen Sun
+
+### July 13, 2018
+
+There are many new updates in this release, extending the functionality and
+capability of the API:
+
+*   Moving from slim-based training to
+    [Estimator](https://www.tensorflow.org/api_docs/python/tf/estimator/Estimator)-based
+    training.
+*   Support for [RetinaNet](https://arxiv.org/abs/1708.02002), and a
+    [MobileNet](https://ai.googleblog.com/2017/06/mobilenets-open-source-models-for.html)
+    adaptation of RetinaNet.
+*   A novel SSD-based architecture called the
+    [Pooling Pyramid Network](https://arxiv.org/abs/1807.03284) (PPN).
+*   Releasing several [TPU](https://cloud.google.com/tpu/)-compatible models.
+    These can be found in the `samples/configs/` directory with a comment in the
+    pipeline configuration files indicating TPU compatibility.
+*   Support for quantized training.
+*   Updated documentation for new binaries, Cloud training, and
+    [TensorFlow Lite](https://www.tensorflow.org/mobile/tflite/).
+
+See also our
+[expanded announcement blogpost](https://ai.googleblog.com/2018/07/accelerated-training-and-inference-with.html)
+and accompanying tutorial at the
+[TensorFlow blog](https://medium.com/tensorflow/training-and-serving-a-realtime-mobile-object-detector-in-30-minutes-with-cloud-tpus-b78971cf1193).
+
+<b>Thanks to contributors</b>: Sara Robinson, Aakanksha Chowdhery, Derek Chow,
+Pengchong Jin, Jonathan Huang, Vivek Rathod, Zhichao Lu, Ronny Votel
+
+### June 25, 2018
+
+Additional evaluation tools for the
+[Open Images Challenge 2018](https://storage.googleapis.com/openimages/web/challenge.html)
+are out. Check out our short tutorial on data preparation and running evaluation
+[here](challenge_evaluation.md)!
+
+<b>Thanks to contributors</b>: Alina Kuznetsova
+
+### June 5, 2018
+
+We have released the implementation of evaluation metrics for both tracks of the
+[Open Images Challenge 2018](https://storage.googleapis.com/openimages/web/challenge.html)
+as a part of the Object Detection API - see the
+[evaluation protocols](evaluation_protocols.md) for more details. Additionally,
+we have released a tool for hierarchical labels expansion for the Open Images
+Challenge: check out
+[oid_hierarchical_labels_expansion.py](../dataset_tools/oid_hierarchical_labels_expansion.py).
+
+<b>Thanks to contributors</b>: Alina Kuznetsova, Vittorio Ferrari, Jasper
+Uijlings
+
+### April 30, 2018
+
+We have released a Faster R-CNN detector with ResNet-101 feature extractor
+trained on [AVA](https://research.google.com/ava/) v2.1. Compared with other
+commonly used object detectors, it changes the action classification loss
+function to per-class Sigmoid loss to handle boxes with multiple labels. The
+model is trained on the training split of AVA v2.1 for 1.5M iterations, it
+achieves mean AP of 11.25% over 60 classes on the validation split of AVA v2.1.
+For more details please refer to this [paper](https://arxiv.org/abs/1705.08421).
+
+<b>Thanks to contributors</b>: Chen Sun, David Ross
+
+### April 2, 2018
+
+Supercharge your mobile phones with the next generation mobile object detector!
+We are adding support for MobileNet V2 with SSDLite presented in
+[MobileNetV2: Inverted Residuals and Linear Bottlenecks](https://arxiv.org/abs/1801.04381).
+This model is 35% faster than Mobilenet V1 SSD on a Google Pixel phone CPU
+(200ms vs. 270ms) at the same accuracy. Along with the model definition, we are
+also releasing a model checkpoint trained on the COCO dataset.
+
+<b>Thanks to contributors</b>: Menglong Zhu, Mark Sandler, Zhichao Lu, Vivek
+Rathod, Jonathan Huang
+
+### February 9, 2018
+
+We now support instance segmentation!! In this API update we support a number of
+instance segmentation models similar to those discussed in the
+[Mask R-CNN paper](https://arxiv.org/abs/1703.06870). For further details refer
+to [our slides](http://presentations.cocodataset.org/Places17-GMRI.pdf) from the
+2017 Coco + Places Workshop. Refer to the section on
+[Running an Instance Segmentation Model](instance_segmentation.md) for
+instructions on how to configure a model that predicts masks in addition to
+object bounding boxes.
+
+<b>Thanks to contributors</b>: Alireza Fathi, Zhichao Lu, Vivek Rathod, Ronny
+Votel, Jonathan Huang
+
+### November 17, 2017
+
+As a part of the Open Images V3 release we have released:
+
+*   An implementation of the Open Images evaluation metric and the
+    [protocol](evaluation_protocols.md#open-images).
+*   Additional tools to separate inference of detection and evaluation (see
+    [this tutorial](oid_inference_and_evaluation.md)).
+*   A new detection model trained on the Open Images V2 data release (see
+    [Open Images model](tf1_detection_zoo.md#open-images-models)).
+
+See more information on the
+[Open Images website](https://github.com/openimages/dataset)!
+
+<b>Thanks to contributors</b>: Stefan Popov, Alina Kuznetsova
+
+### November 6, 2017
+
+We have re-released faster versions of our (pre-trained) models in the
+<a href='tf1_detection_zoo.md'>model zoo</a>. In addition to what was available
+before, we are also adding Faster R-CNN models trained on COCO with Inception V2
+and Resnet-50 feature extractors, as well as a Faster R-CNN with Resnet-101
+model trained on the KITTI dataset.
+
+<b>Thanks to contributors</b>: Jonathan Huang, Vivek Rathod, Derek Chow, Tal
+Remez, Chen Sun.
+
+### October 31, 2017
+
+We have released a new state-of-the-art model for object detection using the
+Faster-RCNN with the
+[NASNet-A image featurization](https://arxiv.org/abs/1707.07012). This model
+achieves mAP of 43.1% on the test-dev validation dataset for COCO, improving on
+the best available model in the zoo by 6% in terms of absolute mAP.
+
+<b>Thanks to contributors</b>: Barret Zoph, Vijay Vasudevan, Jonathon Shlens,
+Quoc Le
+
+### August 11, 2017
+
+We have released an update to the
+[Android Detect demo](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/android)
+which will now run models trained using the TensorFlow Object Detection API on
+an Android device. By default, it currently runs a frozen SSD w/Mobilenet
+detector trained on COCO, but we encourage you to try out other detection
+models!
+
+<b>Thanks to contributors</b>: Jonathan Huang, Andrew Harp
+
+### June 15, 2017
+
+In addition to our base TensorFlow detection model definitions, this release
+includes:
+
+*   A selection of trainable detection models, including:
+    *   Single Shot Multibox Detector (SSD) with MobileNet,
+    *   SSD with Inception V2,
+    *   Region-Based Fully Convolutional Networks (R-FCN) with Resnet 101,
+    *   Faster RCNN with Resnet 101,
+    *   Faster RCNN with Inception Resnet v2
+*   Frozen weights (trained on the COCO dataset) for each of the above models to
+    be used for out-of-the-box inference purposes.
+*   A [Jupyter notebook](../colab_tutorials/object_detection_tutorial.ipynb) for
+    performing out-of-the-box inference with one of our released models
+*   Convenient training and evaluation
+    [instructions](tf1_training_and_evaluation.md) for local runs and Google
+    Cloud.
+
+<b>Thanks to contributors</b>: Jonathan Huang, Vivek Rathod, Derek Chow, Chen
+Sun, Menglong Zhu, Matthew Tang, Anoop Korattikara, Alireza Fathi, Ian Fischer,
+Zbigniew Wojna, Yang Song, Sergio Guadarrama, Jasper Uijlings, Viacheslav
+Kovalevskyi, Kevin Murphy
diff --git a/research/object_detection/g3doc/running_locally.md b/research/object_detection/g3doc/running_locally.md
deleted file mode 100644
index 5e72ea4337856d1499892359d02b48503a0aa4c5..0000000000000000000000000000000000000000
--- a/research/object_detection/g3doc/running_locally.md
+++ /dev/null
@@ -1,66 +0,0 @@
-# Running Locally
-
-This page walks through the steps required to train an object detection model
-on a local machine. It assumes the reader has completed the
-following prerequisites:
-
-1. The Tensorflow Object Detection API has been installed as documented in the
-[installation instructions](installation.md). This includes installing library
-dependencies, compiling the configuration protobufs and setting up the Python
-environment.
-2. A valid data set has been created. See [this page](preparing_inputs.md) for
-instructions on how to generate a dataset for the PASCAL VOC challenge or the
-Oxford-IIIT Pet dataset.
-3. A Object Detection pipeline configuration has been written. See
-[this page](configuring_jobs.md) for details on how to write a pipeline configuration.
-
-## Recommended Directory Structure for Training and Evaluation
-
-```
-+data
-  -label_map file
-  -train TFRecord file
-  -eval TFRecord file
-+models
-  + model
-    -pipeline config file
-    +train
-    +eval
-```
-
-## Running the Training Job
-
-A local training job can be run with the following command:
-
-```bash
-# From the tensorflow/models/research/ directory
-PIPELINE_CONFIG_PATH={path to pipeline config file}
-MODEL_DIR={path to model directory}
-NUM_TRAIN_STEPS=50000
-SAMPLE_1_OF_N_EVAL_EXAMPLES=1
-python object_detection/model_main.py \
-    --pipeline_config_path=${PIPELINE_CONFIG_PATH} \
-    --model_dir=${MODEL_DIR} \
-    --num_train_steps=${NUM_TRAIN_STEPS} \
-    --sample_1_of_n_eval_examples=$SAMPLE_1_OF_N_EVAL_EXAMPLES \
-    --alsologtostderr
-```
-
-where `${PIPELINE_CONFIG_PATH}` points to the pipeline config and
-`${MODEL_DIR}` points to the directory in which training checkpoints
-and events will be written to. Note that this binary will interleave both
-training and evaluation.
-
-## Running Tensorboard
-
-Progress for training and eval jobs can be inspected using Tensorboard. If
-using the recommended directory structure, Tensorboard can be run using the
-following command:
-
-```bash
-tensorboard --logdir=${MODEL_DIR}
-```
-
-where `${MODEL_DIR}` points to the directory that contains the
-train and eval directories. Please note it may take Tensorboard a couple minutes
-to populate with data.
diff --git a/research/object_detection/g3doc/running_notebook.md b/research/object_detection/g3doc/running_notebook.md
index c2b8ad1876258d023a997d1166b0d269c2f10f48..b92aec33aa1df12d0e29da9a50e9c8a59e5c2ff7 100644
--- a/research/object_detection/g3doc/running_notebook.md
+++ b/research/object_detection/g3doc/running_notebook.md
@@ -1,5 +1,8 @@
 # Quick Start: Jupyter notebook for off-the-shelf inference
 
+[![TensorFlow 2.2](https://img.shields.io/badge/TensorFlow-2.2-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v2.2.0)
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+
 If you'd like to hit the ground running and run detection on a few example
 images right out of the box, we recommend trying out the Jupyter notebook demo.
 To run the Jupyter notebook, run the following command from
diff --git a/research/object_detection/g3doc/running_on_cloud.md b/research/object_detection/g3doc/running_on_cloud.md
deleted file mode 100644
index 5ee5d87a223fb382c39c7c4d24641dcace9a606c..0000000000000000000000000000000000000000
--- a/research/object_detection/g3doc/running_on_cloud.md
+++ /dev/null
@@ -1,170 +0,0 @@
-# Running on Google Cloud ML Engine
-
-The Tensorflow Object Detection API supports distributed training on Google
-Cloud ML Engine. This section documents instructions on how to train and
-evaluate your model using Cloud ML. The reader should complete the following
-prerequistes:
-
-1. The reader has created and configured a project on Google Cloud Platform.
-See [the Cloud ML quick start guide](https://cloud.google.com/ml-engine/docs/quickstarts/command-line).
-2. The reader has installed the Tensorflow Object Detection API as documented
-in the [installation instructions](installation.md).
-3. The reader has a valid data set and stored it in a Google Cloud Storage
-bucket. See [this page](preparing_inputs.md) for instructions on how to generate
-a dataset for the PASCAL VOC challenge or the Oxford-IIIT Pet dataset.
-4. The reader has configured a valid Object Detection pipeline, and stored it
-in a Google Cloud Storage bucket. See [this page](configuring_jobs.md) for
-details on how to write a pipeline configuration.
-
-Additionally, it is recommended users test their job by running training and
-evaluation jobs for a few iterations
-[locally on their own machines](running_locally.md).
-
-## Packaging
-
-In order to run the Tensorflow Object Detection API on Cloud ML, it must be
-packaged (along with it's TF-Slim dependency and the
-[pycocotools](https://github.com/cocodataset/cocoapi/tree/master/PythonAPI/pycocotools)
-library). The required packages can be created with the following command
-
-``` bash
-# From tensorflow/models/research/
-bash object_detection/dataset_tools/create_pycocotools_package.sh /tmp/pycocotools
-python setup.py sdist
-(cd slim && python setup.py sdist)
-```
-
-This will create python packages dist/object_detection-0.1.tar.gz,
-slim/dist/slim-0.1.tar.gz, and /tmp/pycocotools/pycocotools-2.0.tar.gz.
-
-## Running a Multiworker (GPU) Training Job on CMLE
-
-Google Cloud ML requires a YAML configuration file for a multiworker training
-job using GPUs. A sample YAML file is given below:
-
-```
-trainingInput:
-  runtimeVersion: "1.12"
-  scaleTier: CUSTOM
-  masterType: standard_gpu
-  workerCount: 9
-  workerType: standard_gpu
-  parameterServerCount: 3
-  parameterServerType: standard
-
-
-```
-
-Please keep the following guidelines in mind when writing the YAML
-configuration:
-
-* A job with n workers will have n + 1 training machines (n workers + 1 master).
-* The number of parameters servers used should be an odd number to prevent
-  a parameter server from storing only weight variables or only bias variables
-  (due to round robin parameter scheduling).
-* The learning rate in the training config should be decreased when using a
-  larger number of workers. Some experimentation is required to find the
-  optimal learning rate.
-
-The YAML file should be saved on the local machine (not on GCP). Once it has
-been written, a user can start a training job on Cloud ML Engine using the
-following command:
-
-```bash
-# From tensorflow/models/research/
-gcloud ml-engine jobs submit training object_detection_`date +%m_%d_%Y_%H_%M_%S` \
-    --runtime-version 1.12 \
-    --job-dir=gs://${MODEL_DIR} \
-    --packages dist/object_detection-0.1.tar.gz,slim/dist/slim-0.1.tar.gz,/tmp/pycocotools/pycocotools-2.0.tar.gz \
-    --module-name object_detection.model_main \
-    --region us-central1 \
-    --config ${PATH_TO_LOCAL_YAML_FILE} \
-    -- \
-    --model_dir=gs://${MODEL_DIR} \
-    --pipeline_config_path=gs://${PIPELINE_CONFIG_PATH}
-```
-
-Where `${PATH_TO_LOCAL_YAML_FILE}` is the local path to the YAML configuration,
-`gs://${MODEL_DIR}` specifies the directory on Google Cloud Storage where the
-training checkpoints and events will be written to and
-`gs://${PIPELINE_CONFIG_PATH}` points to the pipeline configuration stored on
-Google Cloud Storage.
-
-Users can monitor the progress of their training job on the [ML Engine
-Dashboard](https://console.cloud.google.com/mlengine/jobs).
-
-Note: This sample is supported for use with 1.12 runtime version.
-
-## Running a TPU Training Job on CMLE
-
-Launching a training job with a TPU compatible pipeline config requires using a
-similar command:
-
-```bash
-gcloud ml-engine jobs submit training `whoami`_object_detection_`date +%m_%d_%Y_%H_%M_%S` \
---job-dir=gs://${MODEL_DIR} \
---packages dist/object_detection-0.1.tar.gz,slim/dist/slim-0.1.tar.gz,/tmp/pycocotools/pycocotools-2.0.tar.gz \
---module-name object_detection.model_tpu_main \
---runtime-version 1.12 \
---scale-tier BASIC_TPU \
---region us-central1 \
--- \
---tpu_zone us-central1 \
---model_dir=gs://${MODEL_DIR} \
---pipeline_config_path=gs://${PIPELINE_CONFIG_PATH}
-```
-
-In contrast with the GPU training command, there is no need to specify a YAML
-file and we point to the *object_detection.model_tpu_main* binary instead of
-*object_detection.model_main*. We must also now set `scale-tier` to be
-`BASIC_TPU` and provide a `tpu_zone`. Finally as before `pipeline_config_path`
-points to a points to the pipeline configuration stored on Google Cloud Storage
-(but is now must be a TPU compatible model).
-
-## Running an Evaluation Job on CMLE
-
-Note: You only need to do this when using TPU for training as it does not
-interleave evaluation during training as in the case of Multiworker GPU
-training.
-
-Evaluation jobs run on a single machine, so it is not necessary to write a YAML
-configuration for evaluation. Run the following command to start the evaluation
-job:
-
-```bash
-gcloud ml-engine jobs submit training object_detection_eval_`date +%m_%d_%Y_%H_%M_%S` \
-    --runtime-version 1.12 \
-    --job-dir=gs://${MODEL_DIR} \
-    --packages dist/object_detection-0.1.tar.gz,slim/dist/slim-0.1.tar.gz,/tmp/pycocotools/pycocotools-2.0.tar.gz \
-    --module-name object_detection.model_main \
-    --region us-central1 \
-    --scale-tier BASIC_GPU \
-    -- \
-    --model_dir=gs://${MODEL_DIR} \
-    --pipeline_config_path=gs://${PIPELINE_CONFIG_PATH} \
-    --checkpoint_dir=gs://${MODEL_DIR}
-```
-
-Where `gs://${MODEL_DIR}` points to the directory on Google Cloud Storage where
-training checkpoints are saved (same as the training job), as well as
-to where evaluation events will be saved on Google Cloud Storage and
-`gs://${PIPELINE_CONFIG_PATH}` points to where the pipeline configuration is
-stored on Google Cloud Storage.
-
-Typically one starts an evaluation job concurrently with the training job.
-Note that we do not support running evaluation on TPU, so the above command
-line for launching evaluation jobs is the same whether you are training
-on GPU or TPU.
-
-## Running Tensorboard
-
-You can run Tensorboard locally on your own machine to view progress of your
-training and eval jobs on Google Cloud ML. Run the following command to start
-Tensorboard:
-
-``` bash
-tensorboard --logdir=gs://${YOUR_CLOUD_BUCKET}
-```
-
-Note it may Tensorboard a few minutes to populate with results.
-
diff --git a/research/object_detection/g3doc/running_on_mobile_tensorflowlite.md b/research/object_detection/g3doc/running_on_mobile_tensorflowlite.md
index db166bcd394f4eed54a54e038091ec50a42410df..379652e34cb2241d6294679548c988e8916510bc 100644
--- a/research/object_detection/g3doc/running_on_mobile_tensorflowlite.md
+++ b/research/object_detection/g3doc/running_on_mobile_tensorflowlite.md
@@ -1,5 +1,7 @@
 # Running on mobile with TensorFlow Lite
 
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+
 In this section, we will show you how to use [TensorFlow
 Lite](https://www.tensorflow.org/mobile/tflite/) to get a smaller model and
 allow you take advantage of ops that have been optimized for mobile devices.
diff --git a/research/object_detection/g3doc/running_pets.md b/research/object_detection/g3doc/running_pets.md
index bb62db5612adf427c6ab2c0a853b63a23376aa94..7d6b7bfa7c0149a9083312849a544e1c3334cde8 100644
--- a/research/object_detection/g3doc/running_pets.md
+++ b/research/object_detection/g3doc/running_pets.md
@@ -1,6 +1,8 @@
 # Quick Start: Distributed Training on the Oxford-IIIT Pets Dataset on Google Cloud
 
-This page is a walkthrough for training an object detector using the Tensorflow
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+
+This page is a walkthrough for training an object detector using the TensorFlow
 Object Detection API. In this tutorial, we'll be training on the Oxford-IIIT Pets
 dataset to build a system to detect various breeds of cats and dogs. The output
 of the detector will look like the following:
@@ -40,10 +42,10 @@ export YOUR_GCS_BUCKET=${YOUR_GCS_BUCKET}
 It is also possible to run locally by following
 [the running locally instructions](running_locally.md).
 
-## Installing Tensorflow and the Tensorflow Object Detection API
+## Installing TensorFlow and the TensorFlow Object Detection API
 
 Please run through the [installation instructions](installation.md) to install
-Tensorflow and all it dependencies. Ensure the Protobuf libraries are
+TensorFlow and all it dependencies. Ensure the Protobuf libraries are
 compiled and the library directories are added to `PYTHONPATH`.
 
 ## Getting the Oxford-IIIT Pets Dataset and Uploading it to Google Cloud Storage
@@ -77,7 +79,7 @@ should appear as follows:
 ... other files and directories
 ```
 
-The Tensorflow Object Detection API expects data to be in the TFRecord format,
+The TensorFlow Object Detection API expects data to be in the TFRecord format,
 so we'll now run the `create_pet_tf_record` script to convert from the raw
 Oxford-IIIT Pet dataset into TFRecords. Run the following commands from the
 `tensorflow/models/research/` directory:
@@ -134,7 +136,7 @@ in the following step.
 
 ## Configuring the Object Detection Pipeline
 
-In the Tensorflow Object Detection API, the model parameters, training
+In the TensorFlow Object Detection API, the model parameters, training
 parameters and eval parameters are all defined by a config file. More details
 can be found [here](configuring_jobs.md). For this tutorial, we will use some
 predefined templates provided with the source code. In the
@@ -188,10 +190,10 @@ browser](https://console.cloud.google.com/storage/browser).
 
 Before we can start a job on Google Cloud ML Engine, we must:
 
-1. Package the Tensorflow Object Detection code.
+1. Package the TensorFlow Object Detection code.
 2. Write a cluster configuration for our Google Cloud ML job.
 
-To package the Tensorflow Object Detection code, run the following commands from
+To package the TensorFlow Object Detection code, run the following commands from
 the `tensorflow/models/research/` directory:
 
 ```bash
@@ -248,7 +250,7 @@ web browser. You should see something similar to the following:
 
 ![](img/tensorboard.png)
 
-Make sure your Tensorboard version is the same minor version as your Tensorflow (1.x)
+Make sure your Tensorboard version is the same minor version as your TensorFlow (1.x)
 
 You will also want to click on the images tab to see example detections made by
 the model while it trains. After about an hour and a half of training, you can
@@ -265,9 +267,9 @@ the training jobs are configured to go for much longer than is necessary for
 convergence.  To save money, we recommend killing your jobs once you've seen
 that they've converged.
 
-## Exporting the Tensorflow Graph
+## Exporting the TensorFlow Graph
 
-After your model has been trained, you should export it to a Tensorflow graph
+After your model has been trained, you should export it to a TensorFlow graph
 proto. First, you need to identify a candidate checkpoint to export. You can
 search your bucket using the [Google Cloud Storage
 Browser](https://console.cloud.google.com/storage/browser). The file should be
diff --git a/research/object_detection/g3doc/tf1.md b/research/object_detection/g3doc/tf1.md
new file mode 100644
index 0000000000000000000000000000000000000000..565c2a0d5c2ddb563f2aa43535f67e97f100141f
--- /dev/null
+++ b/research/object_detection/g3doc/tf1.md
@@ -0,0 +1,94 @@
+# Object Detection API with TensorFlow 1
+
+## Requirements
+
+[![Python 3.6](https://img.shields.io/badge/Python-3.6-3776AB)](https://www.python.org/downloads/release/python-360/)
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+[![Protobuf Compiler >= 3.0](https://img.shields.io/badge/ProtoBuf%20Compiler-%3E3.0-brightgreen)](https://grpc.io/docs/protoc-installation/#install-using-a-package-manager)
+
+## Installation
+
+You can install the TensorFlow Object Detection API either with Python Package
+Installer (pip) or Docker. For local runs we recommend using Docker and for
+Google Cloud runs we recommend using pip.
+
+Clone the TensorFlow Models repository and proceed to one of the installation
+options.
+
+```bash
+git clone https://github.com/tensorflow/models.git
+```
+
+### Docker Installation
+
+```bash
+# From the root of the git repository
+docker build -f research/object_detection/dockerfiles/tf1/Dockerfile -t od .
+docker run -it od
+```
+
+### Python Package Installation
+
+```bash
+cd models/research
+# Compile protos.
+protoc object_detection/protos/*.proto --python_out=.
+# Install TensorFlow Object Detection API.
+cp object_detection/packages/tf1/setup.py .
+python -m pip install .
+```
+
+```bash
+# Test the installation.
+python object_detection/builders/model_builder_tf1_test.py
+```
+
+## Quick Start
+
+### Colabs
+
+*   [Jupyter notebook for off-the-shelf inference](../colab_tutorials/object_detection_tutorial.ipynb)
+*   [Training a pet detector](running_pets.md)
+
+### Training and Evaluation
+
+To train and evaluate your models either locally or on Google Cloud see
+[instructions](tf1_training_and_evaluation.md).
+
+## Model Zoo
+
+We provide a large collection of models that are trained on several datasets in
+the [Model Zoo](tf1_detection_zoo.md).
+
+## Guides
+
+*   <a href='configuring_jobs.md'>
+      Configuring an object detection pipeline</a><br>
+*   <a href='preparing_inputs.md'>Preparing inputs</a><br>
+*   <a href='defining_your_own_model.md'>
+      Defining your own model architecture</a><br>
+*   <a href='using_your_own_dataset.md'>
+      Bringing in your own dataset</a><br>
+*   <a href='evaluation_protocols.md'>
+      Supported object detection evaluation protocols</a><br>
+*   <a href='tpu_compatibility.md'>
+      TPU compatible detection pipelines</a><br>
+*   <a href='tf1_training_and_evaluation.md'>
+      Training and evaluation guide (CPU, GPU, or TPU)</a><br>
+
+## Extras:
+
+*   <a href='exporting_models.md'>
+      Exporting a trained model for inference</a><br>
+*   <a href='tpu_exporters.md'>
+      Exporting a trained model for TPU inference</a><br>
+*   <a href='oid_inference_and_evaluation.md'>
+      Inference and evaluation on the Open Images dataset</a><br>
+*   <a href='instance_segmentation.md'>
+      Run an instance segmentation model</a><br>
+*   <a href='challenge_evaluation.md'>
+      Run the evaluation for the Open Images Challenge 2018/2019</a><br>
+*   <a href='running_on_mobile_tensorflowlite.md'>
+      Running object detection on mobile devices with TensorFlow Lite</a><br>
+*   <a href='context_rcnn.md'>
+      Context R-CNN documentation for data preparation, training, and export</a><br>
diff --git a/research/object_detection/g3doc/detection_model_zoo.md b/research/object_detection/g3doc/tf1_detection_zoo.md
similarity index 97%
rename from research/object_detection/g3doc/detection_model_zoo.md
rename to research/object_detection/g3doc/tf1_detection_zoo.md
index cb515b813ba8296005da503703bb659b9cb8b9dd..15416bb7aec947933097af1ef26205f3b8d60e20 100644
--- a/research/object_detection/g3doc/detection_model_zoo.md
+++ b/research/object_detection/g3doc/tf1_detection_zoo.md
@@ -1,4 +1,7 @@
-# Tensorflow detection model zoo
+# TensorFlow 1 Detection Model Zoo
+
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+[![Python 3.6](https://img.shields.io/badge/Python-3.6-3776AB)](https://www.python.org/downloads/release/python-360/)
 
 We provide a collection of detection models pre-trained on the
 [COCO dataset](http://cocodataset.org), the
@@ -64,9 +67,9 @@ Some remarks on frozen inference graphs:
     metrics.
 *   Our frozen inference graphs are generated using the
     [v1.12.0](https://github.com/tensorflow/tensorflow/tree/v1.12.0) release
-    version of Tensorflow and we do not guarantee that these will work with
+    version of TensorFlow and we do not guarantee that these will work with
     other versions; this being said, each frozen inference graph can be
-    regenerated using your current version of Tensorflow by re-running the
+    regenerated using your current version of TensorFlow by re-running the
     [exporter](https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/exporting_models.md),
     pointing it at the model directory as well as the corresponding config file
     in
diff --git a/research/object_detection/g3doc/tf1_training_and_evaluation.md b/research/object_detection/g3doc/tf1_training_and_evaluation.md
new file mode 100644
index 0000000000000000000000000000000000000000..76c601f1897be6f70e670130a13330a4eddb63d0
--- /dev/null
+++ b/research/object_detection/g3doc/tf1_training_and_evaluation.md
@@ -0,0 +1,237 @@
+# Training and Evaluation with TensorFlow 1
+
+[![Python 3.6](https://img.shields.io/badge/Python-3.6-3776AB)](https://www.python.org/downloads/release/python-360/)
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+
+This page walks through the steps required to train an object detection model.
+It assumes the reader has completed the following prerequisites:
+
+1.  The TensorFlow Object Detection API has been installed as documented in the
+    [installation instructions](tf1.md#installation).
+2.  A valid data set has been created. See [this page](preparing_inputs.md) for
+    instructions on how to generate a dataset for the PASCAL VOC challenge or
+    the Oxford-IIIT Pet dataset.
+
+## Recommended Directory Structure for Training and Evaluation
+
+```bash
+.
+├── data/
+│   ├── eval-00000-of-00001.tfrecord
+│   ├── label_map.txt
+│   ├── train-00000-of-00002.tfrecord
+│   └── train-00001-of-00002.tfrecord
+└── models/
+    └── my_model_dir/
+        ├── eval/                      # Created by evaluation job.
+        ├── my_model.config
+        └── train/                     #
+            └── model_ckpt-100-data@1  # Created by training job.
+            └── model_ckpt-100-index   #
+            └── checkpoint             #
+```
+
+## Writing a model configuration
+
+Please refer to sample [TF1 configs](../samples/configs) and
+[configuring jobs](configuring_jobs.md) to create a model config.
+
+### Model Parameter Initialization
+
+While optional, it is highly recommended that users utilize classification or
+object detection checkpoints. Training an object detector from scratch can take
+days. To speed up the training process, it is recommended that users re-use the
+feature extractor parameters from a pre-existing image classification or object
+detection checkpoint. The`train_config` section in the config provides two
+fields to specify pre-existing checkpoints:
+
+*   `fine_tune_checkpoint`: a path prefix to the pre-existing checkpoint
+    (ie:"/usr/home/username/checkpoint/model.ckpt-#####").
+
+*   `fine_tune_checkpoint_type`: with value `classification` or `detection`
+    depending on the type.
+
+A list of detection checkpoints can be found [here](tf1_detection_zoo.md).
+
+## Local
+
+### Training
+
+A local training job can be run with the following command:
+
+```bash
+# From the tensorflow/models/research/ directory
+PIPELINE_CONFIG_PATH={path to pipeline config file}
+MODEL_DIR={path to model directory}
+NUM_TRAIN_STEPS=50000
+SAMPLE_1_OF_N_EVAL_EXAMPLES=1
+python object_detection/model_main.py \
+    --pipeline_config_path=${PIPELINE_CONFIG_PATH} \
+    --model_dir=${MODEL_DIR} \
+    --num_train_steps=${NUM_TRAIN_STEPS} \
+    --sample_1_of_n_eval_examples=${SAMPLE_1_OF_N_EVAL_EXAMPLES} \
+    --alsologtostderr
+```
+
+where `${PIPELINE_CONFIG_PATH}` points to the pipeline config and `${MODEL_DIR}`
+points to the directory in which training checkpoints and events will be
+written. Note that this binary will interleave both training and evaluation.
+
+## Google Cloud AI Platform
+
+The TensorFlow Object Detection API supports training on Google Cloud AI
+Platform. This section documents instructions on how to train and evaluate your
+model using Cloud AI Platform. The reader should complete the following
+prerequistes:
+
+1.  The reader has created and configured a project on Google Cloud AI Platform.
+    See
+    [Using GPUs](https://cloud.google.com/ai-platform/training/docs/using-gpus)
+    and
+    [Using TPUs](https://cloud.google.com/ai-platform/training/docs/using-tpus)
+    guides.
+2.  The reader has a valid data set and stored it in a Google Cloud Storage
+    bucket. See [this page](preparing_inputs.md) for instructions on how to
+    generate a dataset for the PASCAL VOC challenge or the Oxford-IIIT Pet
+    dataset.
+
+Additionally, it is recommended users test their job by running training and
+evaluation jobs for a few iterations [locally on their own machines](#local).
+
+### Training with multiple workers with single GPU
+
+Google Cloud ML requires a YAML configuration file for a multiworker training
+job using GPUs. A sample YAML file is given below:
+
+```
+trainingInput:
+  runtimeVersion: "1.15"
+  scaleTier: CUSTOM
+  masterType: standard_gpu
+  workerCount: 9
+  workerType: standard_gpu
+  parameterServerCount: 3
+  parameterServerType: standard
+
+```
+
+Please keep the following guidelines in mind when writing the YAML
+configuration:
+
+*   A job with n workers will have n + 1 training machines (n workers + 1
+    master).
+*   The number of parameters servers used should be an odd number to prevent a
+    parameter server from storing only weight variables or only bias variables
+    (due to round robin parameter scheduling).
+*   The learning rate in the training config should be decreased when using a
+    larger number of workers. Some experimentation is required to find the
+    optimal learning rate.
+
+The YAML file should be saved on the local machine (not on GCP). Once it has
+been written, a user can start a training job on Cloud ML Engine using the
+following command:
+
+```bash
+# From the tensorflow/models/research/ directory
+cp object_detection/packages/tf1/setup.py .
+gcloud ml-engine jobs submit training object_detection_`date +%m_%d_%Y_%H_%M_%S` \
+    --runtime-version 1.15 \
+    --python-version 3.6 \
+    --job-dir=gs://${MODEL_DIR} \
+    --package-path ./object_detection \
+    --module-name object_detection.model_main \
+    --region us-central1 \
+    --config ${PATH_TO_LOCAL_YAML_FILE} \
+    -- \
+    --model_dir=gs://${MODEL_DIR} \
+    --pipeline_config_path=gs://${PIPELINE_CONFIG_PATH}
+```
+
+Where `${PATH_TO_LOCAL_YAML_FILE}` is the local path to the YAML configuration,
+`gs://${MODEL_DIR}` specifies the directory on Google Cloud Storage where the
+training checkpoints and events will be written to and
+`gs://${PIPELINE_CONFIG_PATH}` points to the pipeline configuration stored on
+Google Cloud Storage.
+
+Users can monitor the progress of their training job on the
+[ML Engine Dashboard](https://console.cloud.google.com/ai-platform/jobs).
+
+## Training with TPU
+
+Launching a training job with a TPU compatible pipeline config requires using a
+similar command:
+
+```bash
+# From the tensorflow/models/research/ directory
+cp object_detection/packages/tf1/setup.py .
+gcloud ml-engine jobs submit training `whoami`_object_detection_`date +%m_%d_%Y_%H_%M_%S` \
+    --job-dir=gs://${MODEL_DIR} \
+    --package-path ./object_detection \
+    --module-name object_detection.model_tpu_main \
+    --runtime-version 1.15 \
+    --python-version 3.6 \
+    --scale-tier BASIC_TPU \
+    --region us-central1 \
+    -- \
+    --tpu_zone us-central1 \
+    --model_dir=gs://${MODEL_DIR} \
+    --pipeline_config_path=gs://${PIPELINE_CONFIG_PATH}
+```
+
+In contrast with the GPU training command, there is no need to specify a YAML
+file, and we point to the *object_detection.model_tpu_main* binary instead of
+*object_detection.model_main*. We must also now set `scale-tier` to be
+`BASIC_TPU` and provide a `tpu_zone`. Finally as before `pipeline_config_path`
+points to a points to the pipeline configuration stored on Google Cloud Storage
+(but is now must be a TPU compatible model).
+
+## Evaluation with GPU
+
+Note: You only need to do this when using TPU for training, as it does not
+interleave evaluation during training, as in the case of Multiworker GPU
+training.
+
+Evaluation jobs run on a single machine, so it is not necessary to write a YAML
+configuration for evaluation. Run the following command to start the evaluation
+job:
+
+```bash
+# From the tensorflow/models/research/ directory
+cp object_detection/packages/tf1/setup.py .
+gcloud ml-engine jobs submit training object_detection_eval_`date +%m_%d_%Y_%H_%M_%S` \
+    --runtime-version 1.15 \
+    --python-version 3.6 \
+    --job-dir=gs://${MODEL_DIR} \
+    --package-path ./object_detection \
+    --module-name object_detection.model_main \
+    --region us-central1 \
+    --scale-tier BASIC_GPU \
+    -- \
+    --model_dir=gs://${MODEL_DIR} \
+    --pipeline_config_path=gs://${PIPELINE_CONFIG_PATH} \
+    --checkpoint_dir=gs://${MODEL_DIR}
+```
+
+Where `gs://${MODEL_DIR}` points to the directory on Google Cloud Storage where
+training checkpoints are saved (same as the training job), as well as to where
+evaluation events will be saved on Google Cloud Storage and
+`gs://${PIPELINE_CONFIG_PATH}` points to where the pipeline configuration is
+stored on Google Cloud Storage.
+
+Typically one starts an evaluation job concurrently with the training job. Note
+that we do not support running evaluation on TPU, so the above command line for
+launching evaluation jobs is the same whether you are training on GPU or TPU.
+
+## Running Tensorboard
+
+Progress for training and eval jobs can be inspected using Tensorboard. If using
+the recommended directory structure, Tensorboard can be run using the following
+command:
+
+```bash
+tensorboard --logdir=${MODEL_DIR}
+```
+
+where `${MODEL_DIR}` points to the directory that contains the train and eval
+directories. Please note it may take Tensorboard a couple minutes to populate
+with data.
diff --git a/research/object_detection/g3doc/tf2.md b/research/object_detection/g3doc/tf2.md
new file mode 100644
index 0000000000000000000000000000000000000000..2ce40bcff84b277071acf7420a33014a56bd6bd1
--- /dev/null
+++ b/research/object_detection/g3doc/tf2.md
@@ -0,0 +1,84 @@
+# Object Detection API with TensorFlow 2
+
+## Requirements
+
+[![Python 3.6](https://img.shields.io/badge/Python-3.6-3776AB)](https://www.python.org/downloads/release/python-360/)
+[![TensorFlow 2.2](https://img.shields.io/badge/TensorFlow-2.2-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v2.2.0)
+[![Protobuf Compiler >= 3.0](https://img.shields.io/badge/ProtoBuf%20Compiler-%3E3.0-brightgreen)](https://grpc.io/docs/protoc-installation/#install-using-a-package-manager)
+
+## Installation
+
+You can install the TensorFlow Object Detection API either with Python Package
+Installer (pip) or Docker. For local runs we recommend using Docker and for
+Google Cloud runs we recommend using pip.
+
+Clone the TensorFlow Models repository and proceed to one of the installation
+options.
+
+```bash
+git clone https://github.com/tensorflow/models.git
+```
+
+### Docker Installation
+
+```bash
+# From the root of the git repository
+docker build -f research/object_detection/dockerfiles/tf2/Dockerfile -t od .
+docker run -it od
+```
+
+### Python Package Installation
+
+```bash
+cd models/research
+# Compile protos.
+protoc object_detection/protos/*.proto --python_out=.
+# Install TensorFlow Object Detection API.
+cp object_detection/packages/tf2/setup.py .
+python -m pip install .
+```
+
+```bash
+# Test the installation.
+python object_detection/builders/model_builder_tf2_test.py
+```
+
+## Quick Start
+
+### Colabs
+
+<!-- mdlint off(URL_BAD_G3DOC_PATH) -->
+
+*   Training -
+    [Fine-tune a pre-trained detector in eager mode on custom data](../colab_tutorials/eager_few_shot_od_training_tf2_colab.ipynb)
+
+*   Inference -
+    [Run inference with models from the zoo](../colab_tutorials/inference_tf2_colab.ipynb)
+
+<!-- mdlint on -->
+
+## Training and Evaluation
+
+To train and evaluate your models either locally or on Google Cloud see
+[instructions](tf2_training_and_evaluation.md).
+
+## Model Zoo
+
+We provide a large collection of models that are trained on COCO 2017 in the
+[Model Zoo](tf2_detection_zoo.md).
+
+## Guides
+
+*   <a href='configuring_jobs.md'>
+      Configuring an object detection pipeline</a><br>
+*   <a href='preparing_inputs.md'>Preparing inputs</a><br>
+*   <a href='defining_your_own_model.md'>
+      Defining your own model architecture</a><br>
+*   <a href='using_your_own_dataset.md'>
+      Bringing in your own dataset</a><br>
+*   <a href='evaluation_protocols.md'>
+      Supported object detection evaluation protocols</a><br>
+*   <a href='tpu_compatibility.md'>
+      TPU compatible detection pipelines</a><br>
+*   <a href='tf2_training_and_evaluation.md'>
+      Training and evaluation guide (CPU, GPU, or TPU)</a><br>
\ No newline at end of file
diff --git a/research/object_detection/g3doc/tf2_classification_zoo.md b/research/object_detection/g3doc/tf2_classification_zoo.md
new file mode 100644
index 0000000000000000000000000000000000000000..23c629ac0e9296b10d069159aed3b26bb2cb54e5
--- /dev/null
+++ b/research/object_detection/g3doc/tf2_classification_zoo.md
@@ -0,0 +1,25 @@
+# TensorFlow 2 Classification Model Zoo
+
+[![TensorFlow 2.2](https://img.shields.io/badge/TensorFlow-2.2-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v2.2.0)
+[![Python 3.6](https://img.shields.io/badge/Python-3.6-3776AB)](https://www.python.org/downloads/release/python-360/)
+
+We provide a collection of classification models pre-trained on the
+[Imagenet](http://www.image-net.org). These can be used to initilize detection
+model parameters.
+
+Model name |
+---------- |
+[EfficientNet B0](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/efficientnet_b0.tar.gz)     |
+[EfficientNet B1](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/efficientnet_b1.tar.gz)     |
+[EfficientNet B2](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/efficientnet_b2.tar.gz)     |
+[EfficientNet B3](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/efficientnet_b3.tar.gz)     |
+[EfficientNet B4](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/efficientnet_b4.tar.gz)     |
+[EfficientNet B5](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/efficientnet_b5.tar.gz)     |
+[EfficientNet B6](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/efficientnet_b6.tar.gz)     |
+[EfficientNet B7](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/efficientnet_b7.tar.gz)     |
+[Resnet V1 50](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/resnet50_v1.tar.gz)         |
+[Resnet V1 101](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/resnet101_v1.tar.gz)       |
+[Resnet V1 152](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/resnet152_v1.tar.gz)       |
+[Inception Resnet V2](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/inception_resnet_v2.tar.gz) |
+[MobileNet V1](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/mobilnet_v1.tar.gz)        |
+[MobileNet V2](http://download.tensorflow.org/models/object_detection/classification/tf2/20200710/mobilnet_v2.tar.gz)        |
diff --git a/research/object_detection/g3doc/tf2_detection_zoo.md b/research/object_detection/g3doc/tf2_detection_zoo.md
new file mode 100644
index 0000000000000000000000000000000000000000..7af5e21ffa40646306034f18a087810ab866b9b0
--- /dev/null
+++ b/research/object_detection/g3doc/tf2_detection_zoo.md
@@ -0,0 +1,65 @@
+# TensorFlow 2 Detection Model Zoo
+
+[![TensorFlow 2.2](https://img.shields.io/badge/TensorFlow-2.2-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v2.2.0)
+[![Python 3.6](https://img.shields.io/badge/Python-3.6-3776AB)](https://www.python.org/downloads/release/python-360/)
+
+<!-- mdlint off(URL_BAD_G3DOC_PATH) -->
+
+We provide a collection of detection models pre-trained on the
+[COCO 2017 dataset](http://cocodataset.org). These models can be useful for
+out-of-the-box inference if you are interested in categories already in those
+datasets. You can try it in our inference
+[colab](../colab_tutorials/inference_tf2_colab.ipynb)
+
+They are also useful for initializing your models when training on novel
+datasets. You can try this out on our few-shot training
+[colab](../colab_tutorials/eager_few_shot_od_training_tf2_colab.ipynb).
+
+<!-- mdlint on -->
+
+Finally, if you would like to train these models from scratch, you can find the
+model configs in this [directory](../configs/tf2) (also in the linked
+`tar.gz`s).
+
+Model name                                                                                                                                                                  | Speed (ms) | COCO mAP | Outputs
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :--------: | :----------: | :-----:
+[CenterNet HourGlass104 512x512](http://download.tensorflow.org/models/object_detection/tf2/20200713/centernet_hg104_512x512_coco17_tpu-8.tar.gz)                    | 70         | 41.9           | Boxes
+[CenterNet HourGlass104 Keypoints 512x512](http://download.tensorflow.org/models/object_detection/tf2/20200711/centernet_hg104_512x512_kpts_coco17_tpu-32.tar.gz)                    | 76         | 40.0/61.4           | Boxes/Keypoints
+[CenterNet HourGlass104 1024x1024](http://download.tensorflow.org/models/object_detection/tf2/20200713/centernet_hg104_1024x1024_coco17_tpu-32.tar.gz)               | 197       | 44.5           | Boxes
+[CenterNet HourGlass104 Keypoints 1024x1024](http://download.tensorflow.org/models/object_detection/tf2/20200711/centernet_hg104_1024x1024_kpts_coco17_tpu-32.tar.gz)               | 211       | 42.8/64.5          | Boxes/Keypoints
+[CenterNet Resnet50 V1 FPN 512x512](http://download.tensorflow.org/models/object_detection/tf2/20200711/centernet_resnet50_v1_fpn_512x512_coco17_tpu-8.tar.gz)     | 27         | 31.2           | Boxes
+[CenterNet Resnet50 V1 FPN Keypoints 512x512](http://download.tensorflow.org/models/object_detection/tf2/20200711/centernet_resnet50_v1_fpn_512x512_kpts_coco17_tpu-8.tar.gz)     | 30         | 29.3/50.7         | Boxes/Keypoints
+[CenterNet Resnet101 V1 FPN 512x512](http://download.tensorflow.org/models/object_detection/tf2/20200711/centernet_resnet101_v1_fpn_512x512_coco17_tpu-8.tar.gz)     | 34         | 34.2           | Boxes
+[CenterNet Resnet50 V2 512x512](http://download.tensorflow.org/models/object_detection/tf2/20200711/centernet_resnet50_v2_512x512_coco17_tpu-8.tar.gz)     | 27         | 29.5           | Boxes
+[CenterNet Resnet50 V2 Keypoints 512x512](http://download.tensorflow.org/models/object_detection/tf2/20200711/centernet_resnet50_v2_512x512_kpts_coco17_tpu-8.tar.gz)     | 30         | 27.6/48.2           | Boxes/Keypoints
+[EfficientDet D0 512x512](http://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d0_coco17_tpu-32.tar.gz)                                  | 39         | 33.6           | Boxes
+[EfficientDet D1 640x640](http://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d1_coco17_tpu-32.tar.gz)                                  | 54         | 38.4           | Boxes
+[EfficientDet D2 768x768](http://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d2_coco17_tpu-32.tar.gz)                                  | 67         | 41.8           | Boxes
+[EfficientDet D3 896x896](http://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d3_coco17_tpu-32.tar.gz)                                  | 95         | 45.4           | Boxes
+[EfficientDet D4 1024x1024](http://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d4_coco17_tpu-32.tar.gz)                              | 133         | 48.5           | Boxes
+[EfficientDet D5 1280x1280](http://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d5_coco17_tpu-32.tar.gz)                             | 222         | 49.7           | Boxes
+[EfficientDet D6 1280x1280](http://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d6_coco17_tpu-32.tar.gz)                             | 268         | 50.5           | Boxes
+[EfficientDet D7 1536x1536](http://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d7_coco17_tpu-32.tar.gz)                             | 325         | 51.2           | Boxes
+[SSD MobileNet v2 320x320](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_mobilenet_v2_320x320_coco17_tpu-8.tar.gz)                                |19         | 20.2           | Boxes
+[SSD MobileNet V1 FPN 640x640](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_mobilenet_v1_fpn_640x640_coco17_tpu-8.tar.gz)                        | 48        | 29.1           | Boxes
+[SSD MobileNet V2 FPNLite 320x320](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_mobilenet_v2_fpnlite_320x320_coco17_tpu-8.tar.gz)                | 22         | 22.2           | Boxes
+[SSD MobileNet V2 FPNLite 640x640](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_mobilenet_v2_fpnlite_640x640_coco17_tpu-8.tar.gz)                | 39         | 28.2           | Boxes
+[SSD ResNet50 V1 FPN 640x640 (RetinaNet50)](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_resnet50_v1_fpn_640x640_coco17_tpu-8.tar.gz)                          | 46         | 34.3           | Boxes
+[SSD ResNet50 V1 FPN 1024x1024 (RetinaNet50)](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_resnet50_v1_fpn_1024x1024_coco17_tpu-8.tar.gz)                      | 87         | 38.3           | Boxes
+[SSD ResNet101 V1 FPN 640x640 (RetinaNet101)](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_resnet101_v1_fpn_640x640_coco17_tpu-8.tar.gz)                        | 57         | 35.6           | Boxes
+[SSD ResNet101 V1 FPN 1024x1024 (RetinaNet101)](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_resnet101_v1_fpn_1024x1024_coco17_tpu-8.tar.gz)                    | 104        | 39.5           | Boxes
+[SSD ResNet152 V1 FPN 640x640 (RetinaNet152)](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_resnet152_v1_fpn_640x640_coco17_tpu-8.tar.gz)                        | 80         | 35.4           | Boxes
+[SSD ResNet152 V1 FPN 1024x1024 (RetinaNet152)](http://download.tensorflow.org/models/object_detection/tf2/20200711/ssd_resnet152_v1_fpn_1024x1024_coco17_tpu-8.tar.gz)                    | 111        | 39.6           | Boxes
+[Faster R-CNN ResNet50 V1 640x640](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet50_v1_640x640_coco17_tpu-8.tar.gz)                 | 53         | 29.3           | Boxes
+[Faster R-CNN ResNet50 V1 1024x1024](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet50_v1_1024x1024_coco17_tpu-8.tar.gz)             | 65         | 31.0           | Boxes
+[Faster R-CNN ResNet50 V1 800x1333](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet50_v1_800x1333_coco17_gpu-8.tar.gz)               | 65         | 31.6           | Boxes
+[Faster R-CNN ResNet101 V1 640x640](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet101_v1_640x640_coco17_tpu-8.tar.gz)               |    55      | 31.8           | Boxes
+[Faster R-CNN ResNet101 V1 1024x1024](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet101_v1_1024x1024_coco17_tpu-8.tar.gz)           | 72         | 37.1           | Boxes
+[Faster R-CNN ResNet101 V1 800x1333](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet101_v1_800x1333_coco17_gpu-8.tar.gz)             | 77         | 36.6           | Boxes
+[Faster R-CNN ResNet152 V1 640x640](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet152_v1_640x640_coco17_tpu-8.tar.gz)               | 64         | 32.4           | Boxes
+[Faster R-CNN ResNet152 V1 1024x1024](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet152_v1_1024x1024_coco17_tpu-8.tar.gz)           | 85         | 37.6           | Boxes
+[Faster R-CNN ResNet152 V1 800x1333](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_resnet152_v1_800x1333_coco17_gpu-8.tar.gz)             | 101         | 37.4           | Boxes
+[Faster R-CNN Inception ResNet V2 640x640](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_inception_resnet_v2_640x640_coco17_tpu-8.tar.gz)             | 206         | 37.7           | Boxes
+[Faster R-CNN Inception ResNet V2 1024x1024](http://download.tensorflow.org/models/object_detection/tf2/20200711/faster_rcnn_inception_resnet_v2_1024x1024_coco17_tpu-8.tar.gz)             | 236         | 38.7           | Boxes
+[Mask R-CNN Inception ResNet V2 1024x1024](http://download.tensorflow.org/models/object_detection/tf2/20200711/mask_rcnn_inception_resnet_v2_1024x1024_coco17_gpu-8.tar.gz) |    301      | 39.0/34.6           | Boxes/Masks
+[ExtremeNet](http://download.tensorflow.org/models/object_detection/tf2/20200711/extremenet.tar.gz)                                                                         | --         | --           | Boxes
diff --git a/research/object_detection/g3doc/tf2_training_and_evaluation.md b/research/object_detection/g3doc/tf2_training_and_evaluation.md
new file mode 100644
index 0000000000000000000000000000000000000000..8d05a04f8dbf6d4e7e152ded8d7b936d8a70f3ad
--- /dev/null
+++ b/research/object_detection/g3doc/tf2_training_and_evaluation.md
@@ -0,0 +1,285 @@
+# Training and Evaluation with TensorFlow 2
+
+[![TensorFlow 2.2](https://img.shields.io/badge/TensorFlow-2.2-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v2.2.0)
+[![Python 3.6](https://img.shields.io/badge/Python-3.6-3776AB)](https://www.python.org/downloads/release/python-360/)
+
+This page walks through the steps required to train an object detection model.
+It assumes the reader has completed the following prerequisites:
+
+1.  The TensorFlow Object Detection API has been installed as documented in the
+    [installation instructions](tf2.md#installation).
+2.  A valid data set has been created. See [this page](preparing_inputs.md) for
+    instructions on how to generate a dataset for the PASCAL VOC challenge or
+    the Oxford-IIIT Pet dataset.
+
+## Recommended Directory Structure for Training and Evaluation
+
+```bash
+.
+├── data/
+│   ├── eval-00000-of-00001.tfrecord
+│   ├── label_map.txt
+│   ├── train-00000-of-00002.tfrecord
+│   └── train-00001-of-00002.tfrecord
+└── models/
+    └── my_model_dir/
+        ├── eval/                 # Created by evaluation job.
+        ├── my_model.config
+        └── model_ckpt-100-data@1 #
+        └── model_ckpt-100-index  # Created by training job.
+        └── checkpoint            #
+```
+
+## Writing a model configuration
+
+Please refer to sample [TF2 configs](../configs/tf2) and
+[configuring jobs](configuring_jobs.md) to create a model config.
+
+### Model Parameter Initialization
+
+While optional, it is highly recommended that users utilize classification or
+object detection checkpoints. Training an object detector from scratch can take
+days. To speed up the training process, it is recommended that users re-use the
+feature extractor parameters from a pre-existing image classification or object
+detection checkpoint. The `train_config` section in the config provides two
+fields to specify pre-existing checkpoints:
+
+*   `fine_tune_checkpoint`: a path prefix to the pre-existing checkpoint
+    (ie:"/usr/home/username/checkpoint/model.ckpt-#####").
+
+*   `fine_tune_checkpoint_type`: with value `classification` or `detection`
+    depending on the type.
+
+A list of classification checkpoints can be found
+[here](tf2_classification_zoo.md)
+
+A list of detection checkpoints can be found [here](tf2_detection_zoo.md).
+
+## Local
+
+### Training
+
+A local training job can be run with the following command:
+
+```bash
+# From the tensorflow/models/research/ directory
+PIPELINE_CONFIG_PATH={path to pipeline config file}
+MODEL_DIR={path to model directory}
+python object_detection/model_main_tf2.py \
+    --pipeline_config_path=${PIPELINE_CONFIG_PATH} \
+    --model_dir=${MODEL_DIR} \
+    --alsologtostderr
+```
+
+where `${PIPELINE_CONFIG_PATH}` points to the pipeline config and `${MODEL_DIR}`
+points to the directory in which training checkpoints and events will be
+written.
+
+### Evaluation
+
+A local evaluation job can be run with the following command:
+
+```bash
+# From the tensorflow/models/research/ directory
+PIPELINE_CONFIG_PATH={path to pipeline config file}
+MODEL_DIR={path to model directory}
+CHECKPOINT_DIR=${MODEL_DIR}
+MODEL_DIR={path to model directory}
+python object_detection/model_main_tf2.py \
+    --pipeline_config_path=${PIPELINE_CONFIG_PATH} \
+    --model_dir=${MODEL_DIR} \
+    --checkpoint_dir=${CHECKPOINT_DIR} \
+    --alsologtostderr
+```
+
+where `${CHECKPOINT_DIR}` points to the directory with checkpoints produced by
+the training job. Evaluation events are written to `${MODEL_DIR/eval}`
+
+## Google Cloud VM
+
+The TensorFlow Object Detection API supports training on Google Cloud with Deep
+Learning GPU VMs and TPU VMs. This section documents instructions on how to
+train and evaluate your model on them. The reader should complete the following
+prerequistes:
+
+1.  The reader has create and configured a GPU VM or TPU VM on Google Cloud with
+    TensorFlow >= 2.2.0. See
+    [TPU quickstart](https://cloud.google.com/tpu/docs/quickstart) and
+    [GPU quickstart](https://cloud.google.com/ai-platform/deep-learning-vm/docs/tensorflow_start_instance#with-one-or-more-gpus)
+
+2.  The reader has installed the TensorFlow Object Detection API as documented
+    in the [installation instructions](tf2.md#installation) on the VM.
+
+3.  The reader has a valid data set and stored it in a Google Cloud Storage
+    bucket or locally on the VM. See [this page](preparing_inputs.md) for
+    instructions on how to generate a dataset for the PASCAL VOC challenge or
+    the Oxford-IIIT Pet dataset.
+
+Additionally, it is recommended users test their job by running training and
+evaluation jobs for a few iterations [locally on their own machines](#local).
+
+### Training
+
+Training on GPU or TPU VMs is similar to local training. It can be launched
+using the following command.
+
+```bash
+# From the tensorflow/models/research/ directory
+USE_TPU=true
+TPU_NAME="MY_TPU_NAME"
+PIPELINE_CONFIG_PATH={path to pipeline config file}
+MODEL_DIR={path to model directory}
+python object_detection/model_main_tf2.py \
+    --pipeline_config_path=${PIPELINE_CONFIG_PATH} \
+    --model_dir=${MODEL_DIR} \
+    --use_tpu=${USE_TPU} \  # (optional) only required for TPU training.
+    --tpu_name=${TPU_NAME} \  # (optional) only required for TPU training.
+    --alsologtostderr
+```
+
+where `${PIPELINE_CONFIG_PATH}` points to the pipeline config and `${MODEL_DIR}`
+points to the root directory for the files produces. Training checkpoints and
+events are written to `${MODEL_DIR}`. Note that the paths can be either local or
+a path to GCS bucket.
+
+### Evaluation
+
+Evaluation is only supported on GPU. Similar to local evaluation it can be
+launched using the following command:
+
+```bash
+# From the tensorflow/models/research/ directory
+PIPELINE_CONFIG_PATH={path to pipeline config file}
+MODEL_DIR={path to model directory}
+CHECKPOINT_DIR=${MODEL_DIR}
+MODEL_DIR={path to model directory}
+python object_detection/model_main_tf2.py \
+    --pipeline_config_path=${PIPELINE_CONFIG_PATH} \
+    --model_dir=${MODEL_DIR} \
+    --checkpoint_dir=${CHECKPOINT_DIR} \
+    --alsologtostderr
+```
+
+where `${CHECKPOINT_DIR}` points to the directory with checkpoints produced by
+the training job. Evaluation events are written to `${MODEL_DIR/eval}`. Note
+that the paths can be either local or a path to GCS bucket.
+
+## Google Cloud AI Platform
+
+The TensorFlow Object Detection API supports also supports training on Google
+Cloud AI Platform. This section documents instructions on how to train and
+evaluate your model using Cloud ML. The reader should complete the following
+prerequistes:
+
+1.  The reader has created and configured a project on Google Cloud AI Platform.
+    See
+    [Using GPUs](https://cloud.google.com/ai-platform/training/docs/using-gpus)
+    and
+    [Using TPUs](https://cloud.google.com/ai-platform/training/docs/using-tpus)
+    guides.
+2.  The reader has a valid data set and stored it in a Google Cloud Storage
+    bucket. See [this page](preparing_inputs.md) for instructions on how to
+    generate a dataset for the PASCAL VOC challenge or the Oxford-IIIT Pet
+    dataset.
+
+Additionally, it is recommended users test their job by running training and
+evaluation jobs for a few iterations [locally on their own machines](#local).
+
+### Training with multiple GPUs
+
+A user can start a training job on Cloud AI Platform using the following
+command:
+
+```bash
+# From the tensorflow/models/research/ directory
+cp object_detection/packages/tf2/setup.py .
+gcloud ai-platform jobs submit training object_detection_`date +%m_%d_%Y_%H_%M_%S` \
+    --runtime-version 2.1 \
+    --python-version 3.6 \
+    --job-dir=gs://${MODEL_DIR} \
+    --package-path ./object_detection \
+    --module-name object_detection.model_main_tf2 \
+    --region us-central1 \
+    --master-machine-type n1-highcpu-16 \
+    --master-accelerator count=8,type=nvidia-tesla-v100 \
+    -- \
+    --model_dir=gs://${MODEL_DIR} \
+    --pipeline_config_path=gs://${PIPELINE_CONFIG_PATH}
+```
+
+Where `gs://${MODEL_DIR}` specifies the directory on Google Cloud Storage where
+the training checkpoints and events will be written to and
+`gs://${PIPELINE_CONFIG_PATH}` points to the pipeline configuration stored on
+Google Cloud Storage.
+
+Users can monitor the progress of their training job on the
+[ML Engine Dashboard](https://console.cloud.google.com/ai-platform/jobs).
+
+### Training with TPU
+
+Launching a training job with a TPU compatible pipeline config requires using a
+similar command:
+
+```bash
+# From the tensorflow/models/research/ directory
+cp object_detection/packages/tf2/setup.py .
+gcloud ai-platform jobs submit training `whoami`_object_detection_`date +%m_%d_%Y_%H_%M_%S` \
+    --job-dir=gs://${MODEL_DIR} \
+    --package-path ./object_detection \
+    --module-name object_detection.model_main_tf2 \
+    --runtime-version 2.1 \
+    --python-version 3.6 \
+    --scale-tier BASIC_TPU \
+    --region us-central1 \
+    -- \
+    --use_tpu true \
+    --model_dir=gs://${MODEL_DIR} \
+    --pipeline_config_path=gs://${PIPELINE_CONFIG_PATH}
+```
+
+As before `pipeline_config_path` points to the pipeline configuration stored on
+Google Cloud Storage (but is now must be a TPU compatible model).
+
+### Evaluating with GPU
+
+Evaluation jobs run on a single machine. Run the following command to start the
+evaluation job:
+
+```bash
+# From the tensorflow/models/research/ directory
+cp object_detection/packages/tf2/setup.py .
+gcloud ai-platform jobs submit training object_detection_eval_`date +%m_%d_%Y_%H_%M_%S` \
+    --runtime-version 2.1 \
+    --python-version 3.6 \
+    --job-dir=gs://${MODEL_DIR} \
+    --package-path ./object_detection \
+    --module-name object_detection.model_main_tf2 \
+    --region us-central1 \
+    --scale-tier BASIC_GPU \
+    -- \
+    --model_dir=gs://${MODEL_DIR} \
+    --pipeline_config_path=gs://${PIPELINE_CONFIG_PATH} \
+    --checkpoint_dir=gs://${MODEL_DIR}
+```
+
+where `gs://${MODEL_DIR}` points to the directory on Google Cloud Storage where
+training checkpoints are saved and `gs://{PIPELINE_CONFIG_PATH}` points to where
+the model configuration file stored on Google Cloud Storage. Evaluation events
+are written to `gs://${MODEL_DIR}/eval`
+
+Typically one starts an evaluation job concurrently with the training job. Note
+that we do not support running evaluation on TPU.
+
+## Running Tensorboard
+
+Progress for training and eval jobs can be inspected using Tensorboard. If using
+the recommended directory structure, Tensorboard can be run using the following
+command:
+
+```bash
+tensorboard --logdir=${MODEL_DIR}
+```
+
+where `${MODEL_DIR}` points to the directory that contains the train and eval
+directories. Please note it may take Tensorboard a couple minutes to populate
+with data.
diff --git a/research/object_detection/g3doc/tpu_compatibility.md b/research/object_detection/g3doc/tpu_compatibility.md
index 0eb0c7a20ee3c030f867ed098e374c2047929071..411f1c55cf55140d49c66250e6273e8f6b12d50b 100644
--- a/research/object_detection/g3doc/tpu_compatibility.md
+++ b/research/object_detection/g3doc/tpu_compatibility.md
@@ -2,7 +2,7 @@
 
 [TOC]
 
-The Tensorflow Object Detection API supports TPU training for some models. To
+The TensorFlow Object Detection API supports TPU training for some models. To
 make models TPU compatible you need to make a few tweaks to the model config as
 mentioned below. We also provide several sample configs that you can use as a
 template.
@@ -11,7 +11,7 @@ template.
 
 ### Static shaped tensors
 
-TPU training currently requires all tensors in the Tensorflow Graph to have
+TPU training currently requires all tensors in the TensorFlow Graph to have
 static shapes. However, most of the sample configs in Object Detection API have
 a few different tensors that are dynamically shaped. Fortunately, we provide
 simple alternatives in the model configuration that modifies these tensors to
@@ -62,7 +62,7 @@ have static shape:
 ### TPU friendly ops
 
 Although TPU supports a vast number of tensorflow ops, a few used in the
-Tensorflow Object Detection API are unsupported. We list such ops below and
+TensorFlow Object Detection API are unsupported. We list such ops below and
 recommend compatible substitutes.
 
 *   **Anchor sampling** - Typically we use hard example mining in standard SSD
diff --git a/research/object_detection/g3doc/tpu_exporters.md b/research/object_detection/g3doc/tpu_exporters.md
index 0368359067e696a88b4a2ff6b961823f7fcf6210..4cc3395aea676e8e05fec9a5b86790edc7a9e36a 100644
--- a/research/object_detection/g3doc/tpu_exporters.md
+++ b/research/object_detection/g3doc/tpu_exporters.md
@@ -1,5 +1,7 @@
 # Object Detection TPU Inference Exporter
 
+[![TensorFlow 1.15](https://img.shields.io/badge/TensorFlow-1.15-FF6F00?logo=tensorflow)](https://github.com/tensorflow/tensorflow/releases/tag/v1.15.0)
+
 This package contains SavedModel Exporter for TPU Inference of object detection
 models.
 
diff --git a/research/object_detection/g3doc/using_your_own_dataset.md b/research/object_detection/g3doc/using_your_own_dataset.md
index 23222f26e26936dd957163c1ae69ef6e3592dfec..6192af2dda1320bc7c0961ee7f2c9dc9972148d1 100644
--- a/research/object_detection/g3doc/using_your_own_dataset.md
+++ b/research/object_detection/g3doc/using_your_own_dataset.md
@@ -2,7 +2,7 @@
 
 [TOC]
 
-To use your own dataset in Tensorflow Object Detection API, you must convert it
+To use your own dataset in TensorFlow Object Detection API, you must convert it
 into the [TFRecord file format](https://www.tensorflow.org/api_guides/python/python_io#tfrecords_format_details).
 This document outlines how to write a script to generate the TFRecord file.
 
diff --git a/research/object_detection/inputs.py b/research/object_detection/inputs.py
index a3eb2f0bd2514a723442a568e124a579eb801794..f0b74a3be5ab2dac1143d1c428a4b7e626c78b2e 100644
--- a/research/object_detection/inputs.py
+++ b/research/object_detection/inputs.py
@@ -27,6 +27,7 @@ from object_detection.builders import model_builder
 from object_detection.builders import preprocessor_builder
 from object_detection.core import box_list
 from object_detection.core import box_list_ops
+from object_detection.core import densepose_ops
 from object_detection.core import keypoint_ops
 from object_detection.core import preprocessor
 from object_detection.core import standard_fields as fields
@@ -289,6 +290,13 @@ def transform_input_data(tensor_dict,
             out_tensor_dict[flds_gt_kpt_vis],
             keypoint_type_weight))
 
+  dp_surface_coords_fld = fields.InputDataFields.groundtruth_dp_surface_coords
+  if dp_surface_coords_fld in tensor_dict:
+    dp_surface_coords = out_tensor_dict[dp_surface_coords_fld]
+    realigned_dp_surface_coords = densepose_ops.change_coordinate_frame(
+        dp_surface_coords, im_box)
+    out_tensor_dict[dp_surface_coords_fld] = realigned_dp_surface_coords
+
   if use_bfloat16:
     preprocessed_resized_image = tf.cast(
         preprocessed_resized_image, tf.bfloat16)
@@ -355,7 +363,8 @@ def pad_input_data_to_static_shapes(tensor_dict,
                                     num_classes,
                                     spatial_image_shape=None,
                                     max_num_context_features=None,
-                                    context_feature_length=None):
+                                    context_feature_length=None,
+                                    max_dp_points=336):
   """Pads input tensors to static shapes.
 
   In case num_additional_channels > 0, we assume that the additional channels
@@ -372,6 +381,11 @@ def pad_input_data_to_static_shapes(tensor_dict,
     max_num_context_features (optional): The maximum number of context
       features needed to compute shapes padding.
     context_feature_length (optional): The length of the context feature.
+    max_dp_points (optional): The maximum number of DensePose sampled points per
+      instance. The default (336) is selected since the original DensePose paper
+      (https://arxiv.org/pdf/1802.00434.pdf) indicates that the maximum number
+      of samples per part is 14, and therefore 24 * 14 = 336 is the maximum
+      sampler per instance.
 
   Returns:
     A dictionary keyed by fields.InputDataFields containing padding shapes for
@@ -476,6 +490,15 @@ def pad_input_data_to_static_shapes(tensor_dict,
     padding_shape = [max_num_boxes, shape_utils.get_dim_as_int(tensor_shape[1])]
     padding_shapes[fields.InputDataFields.
                    groundtruth_keypoint_weights] = padding_shape
+  if fields.InputDataFields.groundtruth_dp_num_points in tensor_dict:
+    padding_shapes[
+        fields.InputDataFields.groundtruth_dp_num_points] = [max_num_boxes]
+    padding_shapes[
+        fields.InputDataFields.groundtruth_dp_part_ids] = [
+            max_num_boxes, max_dp_points]
+    padding_shapes[
+        fields.InputDataFields.groundtruth_dp_surface_coords] = [
+            max_num_boxes, max_dp_points, 4]
 
   # Prepare for ContextRCNN related fields.
   if fields.InputDataFields.context_features in tensor_dict:
@@ -535,6 +558,10 @@ def augment_input_data(tensor_dict, data_augmentation_options):
                                in tensor_dict)
   include_multiclass_scores = (fields.InputDataFields.multiclass_scores in
                                tensor_dict)
+  dense_pose_fields = [fields.InputDataFields.groundtruth_dp_num_points,
+                       fields.InputDataFields.groundtruth_dp_part_ids,
+                       fields.InputDataFields.groundtruth_dp_surface_coords]
+  include_dense_pose = all(field in tensor_dict for field in dense_pose_fields)
   tensor_dict = preprocessor.preprocess(
       tensor_dict, data_augmentation_options,
       func_arg_map=preprocessor.get_default_func_arg_map(
@@ -543,7 +570,8 @@ def augment_input_data(tensor_dict, data_augmentation_options):
           include_multiclass_scores=include_multiclass_scores,
           include_instance_masks=include_instance_masks,
           include_keypoints=include_keypoints,
-          include_keypoint_visibilities=include_keypoint_visibilities))
+          include_keypoint_visibilities=include_keypoint_visibilities,
+          include_dense_pose=include_dense_pose))
   tensor_dict[fields.InputDataFields.image] = tf.squeeze(
       tensor_dict[fields.InputDataFields.image], axis=0)
   return tensor_dict
@@ -572,6 +600,9 @@ def _get_labels_dict(input_dict):
       fields.InputDataFields.groundtruth_difficult,
       fields.InputDataFields.groundtruth_keypoint_visibilities,
       fields.InputDataFields.groundtruth_keypoint_weights,
+      fields.InputDataFields.groundtruth_dp_num_points,
+      fields.InputDataFields.groundtruth_dp_part_ids,
+      fields.InputDataFields.groundtruth_dp_surface_coords
   ]
 
   for key in optional_label_keys:
@@ -720,6 +751,17 @@ def train_input(train_config, train_input_config,
         groundtruth visibilities for each keypoint.
       labels[fields.InputDataFields.groundtruth_labeled_classes] is a
         [batch_size, num_classes] float32 k-hot tensor of classes.
+      labels[fields.InputDataFields.groundtruth_dp_num_points] is a
+        [batch_size, num_boxes] int32 tensor with the number of sampled
+        DensePose points per object.
+      labels[fields.InputDataFields.groundtruth_dp_part_ids] is a
+        [batch_size, num_boxes, max_sampled_points] int32 tensor with the
+        DensePose part ids (0-indexed) per object.
+      labels[fields.InputDataFields.groundtruth_dp_surface_coords] is a
+        [batch_size, num_boxes, max_sampled_points, 4] float32 tensor with the
+        DensePose surface coordinates. The format is (y, x, v, u), where (y, x)
+        are normalized image coordinates and (v, u) are normalized surface part
+        coordinates.
 
   Raises:
     TypeError: if the `train_config`, `train_input_config` or `model_config`
@@ -861,6 +903,17 @@ def eval_input(eval_config, eval_input_config, model_config,
         same class which heavily occlude each other.
       labels[fields.InputDataFields.groundtruth_labeled_classes] is a
         [num_boxes, num_classes] float32 k-hot tensor of classes.
+      labels[fields.InputDataFields.groundtruth_dp_num_points] is a
+        [batch_size, num_boxes] int32 tensor with the number of sampled
+        DensePose points per object.
+      labels[fields.InputDataFields.groundtruth_dp_part_ids] is a
+        [batch_size, num_boxes, max_sampled_points] int32 tensor with the
+        DensePose part ids (0-indexed) per object.
+      labels[fields.InputDataFields.groundtruth_dp_surface_coords] is a
+        [batch_size, num_boxes, max_sampled_points, 4] float32 tensor with the
+        DensePose surface coordinates. The format is (y, x, v, u), where (y, x)
+        are normalized image coordinates and (v, u) are normalized surface part
+        coordinates.
 
   Raises:
     TypeError: if the `eval_config`, `eval_input_config` or `model_config`
@@ -1041,8 +1094,12 @@ def get_reduce_to_frame_fn(input_reader_config, is_training):
           num_frames = tf.cast(
               tf.shape(tensor_dict[fields.InputDataFields.source_id])[0],
               dtype=tf.int32)
-          frame_index = tf.random.uniform((), minval=0, maxval=num_frames,
-                                          dtype=tf.int32)
+          if input_reader_config.frame_index == -1:
+            frame_index = tf.random.uniform((), minval=0, maxval=num_frames,
+                                            dtype=tf.int32)
+          else:
+            frame_index = tf.constant(input_reader_config.frame_index,
+                                      dtype=tf.int32)
           out_tensor_dict = {}
           for key in tensor_dict:
             if key in fields.SEQUENCE_FIELDS:
diff --git a/research/object_detection/inputs_test.py b/research/object_detection/inputs_test.py
index 1fca6538f071d11605ef1f83db24d184d3e6ab8d..2cd5fe0e4e39a93564c28dd145a2497253186072 100644
--- a/research/object_detection/inputs_test.py
+++ b/research/object_detection/inputs_test.py
@@ -61,7 +61,7 @@ def _get_configs_for_model(model_name):
       configs, kwargs_dict=override_dict)
 
 
-def _get_configs_for_model_sequence_example(model_name):
+def _get_configs_for_model_sequence_example(model_name, frame_index=-1):
   """Returns configurations for model."""
   fname = os.path.join(tf.resource_loader.get_data_files_path(),
                        'test_data/' + model_name + '.config')
@@ -74,7 +74,8 @@ def _get_configs_for_model_sequence_example(model_name):
   override_dict = {
       'train_input_path': data_path,
       'eval_input_path': data_path,
-      'label_map_path': label_map_path
+      'label_map_path': label_map_path,
+      'frame_index': frame_index
   }
   return config_util.merge_external_params_with_configs(
       configs, kwargs_dict=override_dict)
@@ -312,6 +313,46 @@ class InputFnTest(test_case.TestCase, parameterized.TestCase):
         tf.float32,
         labels[fields.InputDataFields.groundtruth_weights].dtype)
 
+  def test_context_rcnn_resnet50_train_input_with_sequence_example_frame_index(
+      self, train_batch_size=8):
+    """Tests the training input function for FasterRcnnResnet50."""
+    configs = _get_configs_for_model_sequence_example(
+        'context_rcnn_camera_trap', frame_index=2)
+    model_config = configs['model']
+    train_config = configs['train_config']
+    train_config.batch_size = train_batch_size
+    train_input_fn = inputs.create_train_input_fn(
+        train_config, configs['train_input_config'], model_config)
+    features, labels = _make_initializable_iterator(train_input_fn()).get_next()
+
+    self.assertAllEqual([train_batch_size, 640, 640, 3],
+                        features[fields.InputDataFields.image].shape.as_list())
+    self.assertEqual(tf.float32, features[fields.InputDataFields.image].dtype)
+    self.assertAllEqual([train_batch_size],
+                        features[inputs.HASH_KEY].shape.as_list())
+    self.assertEqual(tf.int32, features[inputs.HASH_KEY].dtype)
+    self.assertAllEqual(
+        [train_batch_size, 100, 4],
+        labels[fields.InputDataFields.groundtruth_boxes].shape.as_list())
+    self.assertEqual(tf.float32,
+                     labels[fields.InputDataFields.groundtruth_boxes].dtype)
+    self.assertAllEqual(
+        [train_batch_size, 100, model_config.faster_rcnn.num_classes],
+        labels[fields.InputDataFields.groundtruth_classes].shape.as_list())
+    self.assertEqual(tf.float32,
+                     labels[fields.InputDataFields.groundtruth_classes].dtype)
+    self.assertAllEqual(
+        [train_batch_size, 100],
+        labels[fields.InputDataFields.groundtruth_weights].shape.as_list())
+    self.assertEqual(tf.float32,
+                     labels[fields.InputDataFields.groundtruth_weights].dtype)
+    self.assertAllEqual(
+        [train_batch_size, 100, model_config.faster_rcnn.num_classes],
+        labels[fields.InputDataFields.groundtruth_confidences].shape.as_list())
+    self.assertEqual(
+        tf.float32,
+        labels[fields.InputDataFields.groundtruth_confidences].dtype)
+
   def test_ssd_inceptionV2_train_input(self):
     """Tests the training input function for SSDInceptionV2."""
     configs = _get_configs_for_model('ssd_inception_v2_pets')
@@ -1293,6 +1334,51 @@ class DataTransformationFnTest(test_case.TestCase, parameterized.TestCase):
         groundtruth_keypoint_weights,
         [[1.0, 1.0], [1.0, 1.0]])
 
+  def test_groundtruth_dense_pose(self):
+    def graph_fn():
+      tensor_dict = {
+          fields.InputDataFields.image:
+              tf.constant(np.random.rand(100, 50, 3).astype(np.float32)),
+          fields.InputDataFields.groundtruth_boxes:
+              tf.constant(np.array([[.5, .5, 1, 1], [.0, .0, .5, .5]],
+                                   np.float32)),
+          fields.InputDataFields.groundtruth_classes:
+              tf.constant(np.array([1, 2], np.int32)),
+          fields.InputDataFields.groundtruth_dp_num_points:
+              tf.constant([0, 2], dtype=tf.int32),
+          fields.InputDataFields.groundtruth_dp_part_ids:
+              tf.constant([[0, 0], [4, 23]], dtype=tf.int32),
+          fields.InputDataFields.groundtruth_dp_surface_coords:
+              tf.constant([[[0., 0., 0., 0.,], [0., 0., 0., 0.,]],
+                           [[0.1, 0.2, 0.3, 0.4,], [0.6, 0.8, 0.6, 0.7,]]],
+                          dtype=tf.float32),
+      }
+
+      num_classes = 1
+      input_transformation_fn = functools.partial(
+          inputs.transform_input_data,
+          model_preprocess_fn=_fake_resize50_preprocess_fn,
+          image_resizer_fn=_fake_image_resizer_fn,
+          num_classes=num_classes)
+      transformed_inputs = input_transformation_fn(tensor_dict=tensor_dict)
+      transformed_dp_num_points = transformed_inputs[
+          fields.InputDataFields.groundtruth_dp_num_points]
+      transformed_dp_part_ids = transformed_inputs[
+          fields.InputDataFields.groundtruth_dp_part_ids]
+      transformed_dp_surface_coords = transformed_inputs[
+          fields.InputDataFields.groundtruth_dp_surface_coords]
+      return (transformed_dp_num_points, transformed_dp_part_ids,
+              transformed_dp_surface_coords)
+
+    dp_num_points, dp_part_ids, dp_surface_coords = self.execute_cpu(
+        graph_fn, [])
+    self.assertAllEqual(dp_num_points, [0, 2])
+    self.assertAllEqual(dp_part_ids, [[0, 0], [4, 23]])
+    self.assertAllClose(
+        dp_surface_coords,
+        [[[0., 0., 0., 0.,], [0., 0., 0., 0.,]],
+         [[0.1, 0.1, 0.3, 0.4,], [0.6, 0.4, 0.6, 0.7,]]])
+
 
 class PadInputDataToStaticShapesFnTest(test_case.TestCase):
 
@@ -1454,6 +1540,35 @@ class PadInputDataToStaticShapesFnTest(test_case.TestCase):
             fields.InputDataFields.groundtruth_keypoint_visibilities]
         .shape.as_list(), [3, 16])
 
+  def test_dense_pose(self):
+    input_tensor_dict = {
+        fields.InputDataFields.groundtruth_dp_num_points:
+            tf.constant([0, 2], dtype=tf.int32),
+        fields.InputDataFields.groundtruth_dp_part_ids:
+            tf.constant([[0, 0], [4, 23]], dtype=tf.int32),
+        fields.InputDataFields.groundtruth_dp_surface_coords:
+            tf.constant([[[0., 0., 0., 0.,], [0., 0., 0., 0.,]],
+                         [[0.1, 0.2, 0.3, 0.4,], [0.6, 0.8, 0.6, 0.7,]]],
+                        dtype=tf.float32),
+    }
+
+    padded_tensor_dict = inputs.pad_input_data_to_static_shapes(
+        tensor_dict=input_tensor_dict,
+        max_num_boxes=3,
+        num_classes=1,
+        spatial_image_shape=[128, 128],
+        max_dp_points=200)
+
+    self.assertAllEqual(
+        padded_tensor_dict[fields.InputDataFields.groundtruth_dp_num_points]
+        .shape.as_list(), [3])
+    self.assertAllEqual(
+        padded_tensor_dict[fields.InputDataFields.groundtruth_dp_part_ids]
+        .shape.as_list(), [3, 200])
+    self.assertAllEqual(
+        padded_tensor_dict[fields.InputDataFields.groundtruth_dp_surface_coords]
+        .shape.as_list(), [3, 200, 4])
+
   def test_context_features(self):
     context_memory_size = 8
     context_feature_length = 10
diff --git a/research/object_detection/legacy/trainer_tf1_test.py b/research/object_detection/legacy/trainer_tf1_test.py
index 5b3f01c018a6b95398b9f9198ba3f3394ce709cc..0cde654e6a8bba2cfedea939e67d44698f882e04 100644
--- a/research/object_detection/legacy/trainer_tf1_test.py
+++ b/research/object_detection/legacy/trainer_tf1_test.py
@@ -185,6 +185,9 @@ class FakeDetectionModel(model.DetectionModel):
     """
     return {var.op.name: var for var in tf.global_variables()}
 
+  def restore_from_objects(self, fine_tune_checkpoint_type):
+    pass
+
   def updates(self):
     """Returns a list of update operators for this model.
 
diff --git a/research/object_detection/meta_architectures/center_net_meta_arch.py b/research/object_detection/meta_architectures/center_net_meta_arch.py
index 8ae98bb1f07588abb6080ce7f16e64ecd54f0f2a..3d1bb3b514be38323b7e27b703ff7d179583aa48 100644
--- a/research/object_detection/meta_architectures/center_net_meta_arch.py
+++ b/research/object_detection/meta_architectures/center_net_meta_arch.py
@@ -118,6 +118,19 @@ class CenterNetFeatureExtractor(tf.keras.Model):
     """Ther number of feature outputs returned by the feature extractor."""
     pass
 
+  @abc.abstractmethod
+  def get_sub_model(self, sub_model_type):
+    """Returns the underlying keras model for the given sub_model_type.
+
+    This function is useful when we only want to get a subset of weights to
+    be restored from a checkpoint.
+
+    Args:
+      sub_model_type: string, the type of sub model. Currently, CenterNet
+        feature extractors support 'detection' and 'classification'.
+    """
+    pass
+
 
 def make_prediction_net(num_out_channels, kernel_size=3, num_filters=256,
                         bias_fill=None):
@@ -924,13 +937,16 @@ def convert_strided_predictions_to_normalized_keypoints(
 
 
 def convert_strided_predictions_to_instance_masks(
-    boxes, classes, masks, stride, mask_height, mask_width,
-    true_image_shapes, score_threshold=0.5):
+    boxes, classes, masks, true_image_shapes,
+    densepose_part_heatmap=None, densepose_surface_coords=None, stride=4,
+    mask_height=256, mask_width=256, score_threshold=0.5,
+    densepose_class_index=-1):
   """Converts predicted full-image masks into instance masks.
 
   For each predicted detection box:
-    * Crop and resize the predicted mask based on the detected bounding box
-      coordinates and class prediction. Uses bilinear resampling.
+    * Crop and resize the predicted mask (and optionally DensePose coordinates)
+      based on the detected bounding box coordinates and class prediction. Uses
+      bilinear resampling.
     * Binarize the mask using the provided score threshold.
 
   Args:
@@ -940,57 +956,212 @@ def convert_strided_predictions_to_instance_masks(
       detected class for each box (0-indexed).
     masks: A [batch, output_height, output_width, num_classes] float32
       tensor with class probabilities.
+    true_image_shapes: A tensor of shape [batch, 3] representing the true
+      shape of the inputs not considering padding.
+    densepose_part_heatmap: (Optional) A [batch, output_height, output_width,
+      num_parts] float32 tensor with part scores (i.e. logits).
+    densepose_surface_coords: (Optional) A [batch, output_height, output_width,
+      2 * num_parts] float32 tensor with predicted part coordinates (in
+      vu-format).
     stride: The stride in the output space.
     mask_height: The desired resized height for instance masks.
     mask_width: The desired resized width for instance masks.
-    true_image_shapes: A tensor of shape [batch, 3] representing the true
-      shape of the inputs not considering padding.
     score_threshold: The threshold at which to convert predicted mask
        into foreground pixels.
+    densepose_class_index: The class index (0-indexed) corresponding to the
+      class which has DensePose labels (e.g. person class).
 
   Returns:
-    A [batch_size, max_detections, mask_height, mask_width] uint8 tensor with
-    predicted foreground mask for each instance. The masks take values in
-    {0, 1}.
+    A tuple of masks and surface_coords.
+    instance_masks: A [batch_size, max_detections, mask_height, mask_width]
+      uint8 tensor with predicted foreground mask for each
+      instance. If DensePose tensors are provided, then each pixel value in the
+      mask encodes the 1-indexed part.
+    surface_coords: A [batch_size, max_detections, mask_height, mask_width, 2]
+      float32 tensor with (v, u) coordinates. Note that v, u coordinates are
+      only defined on instance masks, and the coordinates at each location of
+      the foreground mask correspond to coordinates on a local part coordinate
+      system (the specific part can be inferred from the `instance_masks`
+      output. If DensePose feature maps are not passed to this function, this
+      output will be None.
+
+  Raises:
+    ValueError: If one but not both of `densepose_part_heatmap` and
+    `densepose_surface_coords` is provided.
   """
-  _, output_height, output_width, _ = (
+  batch_size, output_height, output_width, _ = (
       shape_utils.combined_static_and_dynamic_shape(masks))
   input_height = stride * output_height
   input_width = stride * output_width
 
+  true_heights, true_widths, _ = tf.unstack(true_image_shapes, axis=1)
+  # If necessary, create dummy DensePose tensors to simplify the map function.
+  densepose_present = True
+  if ((densepose_part_heatmap is not None) ^
+      (densepose_surface_coords is not None)):
+    raise ValueError('To use DensePose, both `densepose_part_heatmap` and '
+                     '`densepose_surface_coords` must be provided')
+  if densepose_part_heatmap is None and densepose_surface_coords is None:
+    densepose_present = False
+    densepose_part_heatmap = tf.zeros(
+        (batch_size, output_height, output_width, 1), dtype=tf.float32)
+    densepose_surface_coords = tf.zeros(
+        (batch_size, output_height, output_width, 2), dtype=tf.float32)
+  crop_and_threshold_fn = functools.partial(
+      crop_and_threshold_masks, input_height=input_height,
+      input_width=input_width, mask_height=mask_height, mask_width=mask_width,
+      score_threshold=score_threshold,
+      densepose_class_index=densepose_class_index)
+
+  instance_masks, surface_coords = shape_utils.static_or_dynamic_map_fn(
+      crop_and_threshold_fn,
+      elems=[boxes, classes, masks, densepose_part_heatmap,
+             densepose_surface_coords, true_heights, true_widths],
+      dtype=[tf.uint8, tf.float32],
+      back_prop=False)
+  surface_coords = surface_coords if densepose_present else None
+  return instance_masks, surface_coords
+
+
+def crop_and_threshold_masks(elems, input_height, input_width, mask_height=256,
+                             mask_width=256, score_threshold=0.5,
+                             densepose_class_index=-1):
+  """Crops and thresholds masks based on detection boxes.
+
+  Args:
+    elems: A tuple of
+      boxes - float32 tensor of shape [max_detections, 4]
+      classes - int32 tensor of shape [max_detections] (0-indexed)
+      masks - float32 tensor of shape [output_height, output_width, num_classes]
+      part_heatmap - float32 tensor of shape [output_height, output_width,
+        num_parts]
+      surf_coords - float32 tensor of shape [output_height, output_width,
+        2 * num_parts]
+      true_height - scalar int tensor
+      true_width - scalar int tensor
+    input_height: Input height to network.
+    input_width: Input width to network.
+    mask_height: Height for resizing mask crops.
+    mask_width: Width for resizing mask crops.
+    score_threshold: The threshold at which to convert predicted mask
+      into foreground pixels.
+    densepose_class_index: scalar int tensor with the class index (0-indexed)
+      for DensePose.
+
+  Returns:
+    A tuple of
+    all_instances: A [max_detections, mask_height, mask_width] uint8 tensor
+      with a predicted foreground mask for each instance. Background is encoded
+      as 0, and foreground is encoded as a positive integer. Specific part
+      indices are encoded as 1-indexed parts (for classes that have part
+      information).
+    surface_coords: A [max_detections, mask_height, mask_width, 2]
+      float32 tensor with (v, u) coordinates. for each part.
+  """
+  (boxes, classes, masks, part_heatmap, surf_coords, true_height,
+   true_width) = elems
   # Boxes are in normalized coordinates relative to true image shapes. Convert
   # coordinates to be normalized relative to input image shapes (since masks
   # may still have padding).
-  # Then crop and resize each mask.
-  def crop_and_threshold_masks(args):
-    """Crops masks based on detection boxes."""
-    boxes, classes, masks, true_height, true_width = args
-    boxlist = box_list.BoxList(boxes)
-    y_scale = true_height / input_height
-    x_scale = true_width / input_width
-    boxlist = box_list_ops.scale(boxlist, y_scale, x_scale)
-    boxes = boxlist.get()
-    # Convert masks from [input_height, input_width, num_classes] to
-    # [num_classes, input_height, input_width, 1].
-    masks_4d = tf.transpose(masks, perm=[2, 0, 1])[:, :, :, tf.newaxis]
-    cropped_masks = tf2.image.crop_and_resize(
-        masks_4d,
-        boxes=boxes,
-        box_indices=classes,
-        crop_size=[mask_height, mask_width],
-        method='bilinear')
-    masks_3d = tf.squeeze(cropped_masks, axis=3)
-    masks_binarized = tf.math.greater_equal(masks_3d, score_threshold)
-    return tf.cast(masks_binarized, tf.uint8)
+  boxlist = box_list.BoxList(boxes)
+  y_scale = true_height / input_height
+  x_scale = true_width / input_width
+  boxlist = box_list_ops.scale(boxlist, y_scale, x_scale)
+  boxes = boxlist.get()
+  # Convert masks from [output_height, output_width, num_classes] to
+  # [num_classes, output_height, output_width, 1].
+  num_classes = tf.shape(masks)[-1]
+  masks_4d = tf.transpose(masks, perm=[2, 0, 1])[:, :, :, tf.newaxis]
+  # Tile part and surface coordinate masks for all classes.
+  part_heatmap_4d = tf.tile(part_heatmap[tf.newaxis, :, :, :],
+                            multiples=[num_classes, 1, 1, 1])
+  surf_coords_4d = tf.tile(surf_coords[tf.newaxis, :, :, :],
+                           multiples=[num_classes, 1, 1, 1])
+  feature_maps_concat = tf.concat([masks_4d, part_heatmap_4d, surf_coords_4d],
+                                  axis=-1)
+  # The following tensor has shape
+  # [max_detections, mask_height, mask_width, 1 + 3 * num_parts].
+  cropped_masks = tf2.image.crop_and_resize(
+      feature_maps_concat,
+      boxes=boxes,
+      box_indices=classes,
+      crop_size=[mask_height, mask_width],
+      method='bilinear')
+
+  # Split the cropped masks back into instance masks, part masks, and surface
+  # coordinates.
+  num_parts = tf.shape(part_heatmap)[-1]
+  instance_masks, part_heatmap_cropped, surface_coords_cropped = tf.split(
+      cropped_masks, [1, num_parts, 2 * num_parts], axis=-1)
+
+  # Threshold the instance masks. Resulting tensor has shape
+  # [max_detections, mask_height, mask_width, 1].
+  instance_masks_int = tf.cast(
+      tf.math.greater_equal(instance_masks, score_threshold), dtype=tf.int32)
+
+  # Produce a binary mask that is 1.0 only:
+  #  - in the foreground region for an instance
+  #  - in detections corresponding to the DensePose class
+  det_with_parts = tf.equal(classes, densepose_class_index)
+  det_with_parts = tf.cast(
+      tf.reshape(det_with_parts, [-1, 1, 1, 1]), dtype=tf.int32)
+  instance_masks_with_parts = tf.math.multiply(instance_masks_int,
+                                               det_with_parts)
+
+  # Similarly, produce a binary mask that holds the foreground masks only for
+  # instances without parts (i.e. non-DensePose classes).
+  det_without_parts = 1 - det_with_parts
+  instance_masks_without_parts = tf.math.multiply(instance_masks_int,
+                                                  det_without_parts)
+
+  # Assemble a tensor that has standard instance segmentation masks for
+  # non-DensePose classes (with values in [0, 1]), and part segmentation masks
+  # for DensePose classes (with vaues in [0, 1, ..., num_parts]).
+  part_mask_int_zero_indexed = tf.math.argmax(
+      part_heatmap_cropped, axis=-1, output_type=tf.int32)[:, :, :, tf.newaxis]
+  part_mask_int_one_indexed = part_mask_int_zero_indexed + 1
+  all_instances = (instance_masks_without_parts +
+                   instance_masks_with_parts * part_mask_int_one_indexed)
+
+  # Gather the surface coordinates for the parts.
+  surface_coords_cropped = tf.reshape(
+      surface_coords_cropped, [-1, mask_height, mask_width, num_parts, 2])
+  surface_coords = gather_surface_coords_for_parts(surface_coords_cropped,
+                                                   part_mask_int_zero_indexed)
+  surface_coords = (
+      surface_coords * tf.cast(instance_masks_with_parts, tf.float32))
+
+  return [tf.squeeze(all_instances, axis=3), surface_coords]
+
+
+def gather_surface_coords_for_parts(surface_coords_cropped,
+                                    highest_scoring_part):
+  """Gathers the (v, u) coordinates for the highest scoring DensePose parts.
 
-  true_heights, true_widths, _ = tf.unstack(true_image_shapes, axis=1)
-  masks_for_image = shape_utils.static_or_dynamic_map_fn(
-      crop_and_threshold_masks,
-      elems=[boxes, classes, masks, true_heights, true_widths],
-      dtype=tf.uint8,
-      back_prop=False)
-  masks = tf.stack(masks_for_image, axis=0)
-  return masks
+  Args:
+    surface_coords_cropped: A [max_detections, height, width, num_parts, 2]
+      float32 tensor with (v, u) surface coordinates.
+    highest_scoring_part: A [max_detections, height, width] integer tensor with
+      the highest scoring part (0-indexed) indices for each location.
+
+  Returns:
+    A [max_detections, height, width, 2] float32 tensor with the (v, u)
+    coordinates selected from the highest scoring parts.
+  """
+  max_detections, height, width, num_parts, _ = (
+      shape_utils.combined_static_and_dynamic_shape(surface_coords_cropped))
+  flattened_surface_coords = tf.reshape(surface_coords_cropped, [-1, 2])
+  flattened_part_ids = tf.reshape(highest_scoring_part, [-1])
+
+  # Produce lookup indices that represent the locations of the highest scoring
+  # parts in the `flattened_surface_coords` tensor.
+  flattened_lookup_indices = (
+      num_parts * tf.range(max_detections * height * width) +
+      flattened_part_ids)
+
+  vu_coords_flattened = tf.gather(flattened_surface_coords,
+                                  flattened_lookup_indices, axis=0)
+  return tf.reshape(vu_coords_flattened, [max_detections, height, width, 2])
 
 
 class ObjectDetectionParams(
@@ -1235,6 +1406,64 @@ class MaskParams(
                               score_threshold, heatmap_bias_init)
 
 
+class DensePoseParams(
+    collections.namedtuple('DensePoseParams', [
+        'class_id', 'classification_loss', 'localization_loss',
+        'part_loss_weight', 'coordinate_loss_weight', 'num_parts',
+        'task_loss_weight', 'upsample_to_input_res', 'upsample_method',
+        'heatmap_bias_init'
+    ])):
+  """Namedtuple to store DensePose prediction related parameters."""
+
+  __slots__ = ()
+
+  def __new__(cls,
+              class_id,
+              classification_loss,
+              localization_loss,
+              part_loss_weight=1.0,
+              coordinate_loss_weight=1.0,
+              num_parts=24,
+              task_loss_weight=1.0,
+              upsample_to_input_res=True,
+              upsample_method='bilinear',
+              heatmap_bias_init=-2.19):
+    """Constructor with default values for DensePoseParams.
+
+    Args:
+      class_id: the ID of the class that contains the DensePose groundtruth.
+        This should typically correspond to the "person" class. Note that the ID
+        is 0-based, meaning that class 0 corresponds to the first non-background
+        object class.
+      classification_loss: an object_detection.core.losses.Loss object to
+        compute the loss for the body part predictions in CenterNet.
+      localization_loss: an object_detection.core.losses.Loss object to compute
+        the loss for the surface coordinate regression in CenterNet.
+      part_loss_weight: The loss weight to apply to part prediction.
+      coordinate_loss_weight: The loss weight to apply to surface coordinate
+        prediction.
+      num_parts: The number of DensePose parts to predict.
+      task_loss_weight: float, the loss weight for the DensePose task.
+      upsample_to_input_res: Whether to upsample the DensePose feature maps to
+        the input resolution before applying loss. Note that the prediction
+        outputs are still at the standard CenterNet output stride.
+      upsample_method: Method for upsampling DensePose feature maps. Options are
+        either 'bilinear' or 'nearest'). This takes no effect when
+        `upsample_to_input_res` is False.
+      heatmap_bias_init: float, the initial value of bias in the convolutional
+        kernel of the part prediction head. If set to None, the
+        bias is initialized with zeros.
+
+    Returns:
+      An initialized DensePoseParams namedtuple.
+    """
+    return super(DensePoseParams,
+                 cls).__new__(cls, class_id, classification_loss,
+                              localization_loss, part_loss_weight,
+                              coordinate_loss_weight, num_parts,
+                              task_loss_weight, upsample_to_input_res,
+                              upsample_method, heatmap_bias_init)
+
 # The following constants are used to generate the keys of the
 # (prediction, loss, target assigner,...) dictionaries used in CenterNetMetaArch
 # class.
@@ -1247,6 +1476,9 @@ KEYPOINT_HEATMAP = 'keypoint/heatmap'
 KEYPOINT_OFFSET = 'keypoint/offset'
 SEGMENTATION_TASK = 'segmentation_task'
 SEGMENTATION_HEATMAP = 'segmentation/heatmap'
+DENSEPOSE_TASK = 'densepose_task'
+DENSEPOSE_HEATMAP = 'densepose/heatmap'
+DENSEPOSE_REGRESSION = 'densepose/regression'
 LOSS_KEY_PREFIX = 'Loss'
 
 
@@ -1290,7 +1522,8 @@ class CenterNetMetaArch(model.DetectionModel):
                object_center_params,
                object_detection_params=None,
                keypoint_params_dict=None,
-               mask_params=None):
+               mask_params=None,
+               densepose_params=None):
     """Initializes a CenterNet model.
 
     Args:
@@ -1318,6 +1551,10 @@ class CenterNetMetaArch(model.DetectionModel):
       mask_params: A MaskParams namedtuple. This object
         holds the hyper-parameters for segmentation. Please see the class
         definition for more details.
+      densepose_params: A DensePoseParams namedtuple. This object holds the
+        hyper-parameters for DensePose prediction. Please see the class
+        definition for more details. Note that if this is provided, it is
+        expected that `mask_params` is also provided.
     """
     assert object_detection_params or keypoint_params_dict
     # Shorten the name for convenience and better formatting.
@@ -1333,6 +1570,10 @@ class CenterNetMetaArch(model.DetectionModel):
     self._od_params = object_detection_params
     self._kp_params_dict = keypoint_params_dict
     self._mask_params = mask_params
+    if densepose_params is not None and mask_params is None:
+      raise ValueError('To run DensePose prediction, `mask_params` must also '
+                       'be supplied.')
+    self._densepose_params = densepose_params
 
     # Construct the prediction head nets.
     self._prediction_head_dict = self._construct_prediction_heads(
@@ -1413,8 +1654,18 @@ class CenterNetMetaArch(model.DetectionModel):
     if self._mask_params is not None:
       prediction_heads[SEGMENTATION_HEATMAP] = [
           make_prediction_net(num_classes,
-                              bias_fill=class_prediction_bias_init)
+                              bias_fill=self._mask_params.heatmap_bias_init)
+          for _ in range(num_feature_outputs)]
+    if self._densepose_params is not None:
+      prediction_heads[DENSEPOSE_HEATMAP] = [
+          make_prediction_net(  # pylint: disable=g-complex-comprehension
+              self._densepose_params.num_parts,
+              bias_fill=self._densepose_params.heatmap_bias_init)
           for _ in range(num_feature_outputs)]
+      prediction_heads[DENSEPOSE_REGRESSION] = [
+          make_prediction_net(2 * self._densepose_params.num_parts)
+          for _ in range(num_feature_outputs)
+      ]
     return prediction_heads
 
   def _initialize_target_assigners(self, stride, min_box_overlap_iou):
@@ -1449,6 +1700,10 @@ class CenterNetMetaArch(model.DetectionModel):
     if self._mask_params is not None:
       target_assigners[SEGMENTATION_TASK] = (
           cn_assigner.CenterNetMaskTargetAssigner(stride))
+    if self._densepose_params is not None:
+      dp_stride = 1 if self._densepose_params.upsample_to_input_res else stride
+      target_assigners[DENSEPOSE_TASK] = (
+          cn_assigner.CenterNetDensePoseTargetAssigner(dp_stride))
 
     return target_assigners
 
@@ -1860,6 +2115,113 @@ class CenterNetMetaArch(model.DetectionModel):
         float(len(segmentation_predictions)) * total_pixels_in_loss)
     return total_loss
 
+  def _compute_densepose_losses(self, input_height, input_width,
+                                prediction_dict):
+    """Computes the weighted DensePose losses.
+
+    Args:
+      input_height: An integer scalar tensor representing input image height.
+      input_width: An integer scalar tensor representing input image width.
+      prediction_dict: A dictionary holding predicted tensors output by the
+        "predict" function. See the "predict" function for more detailed
+        description.
+
+    Returns:
+      A dictionary of scalar float tensors representing the weighted losses for
+      the DensePose task:
+         DENSEPOSE_HEATMAP: the weighted part segmentation loss.
+         DENSEPOSE_REGRESSION: the weighted part surface coordinate loss.
+    """
+    dp_heatmap_loss, dp_regression_loss = (
+        self._compute_densepose_part_and_coordinate_losses(
+            input_height=input_height,
+            input_width=input_width,
+            part_predictions=prediction_dict[DENSEPOSE_HEATMAP],
+            surface_coord_predictions=prediction_dict[DENSEPOSE_REGRESSION]))
+    loss_dict = {}
+    loss_dict[DENSEPOSE_HEATMAP] = (
+        self._densepose_params.part_loss_weight * dp_heatmap_loss)
+    loss_dict[DENSEPOSE_REGRESSION] = (
+        self._densepose_params.coordinate_loss_weight * dp_regression_loss)
+    return loss_dict
+
+  def _compute_densepose_part_and_coordinate_losses(
+      self, input_height, input_width, part_predictions,
+      surface_coord_predictions):
+    """Computes the individual losses for the DensePose task.
+
+    Args:
+      input_height: An integer scalar tensor representing input image height.
+      input_width: An integer scalar tensor representing input image width.
+      part_predictions: A list of float tensors of shape [batch_size,
+        out_height, out_width, num_parts].
+      surface_coord_predictions: A list of float tensors of shape [batch_size,
+        out_height, out_width, 2 * num_parts].
+
+    Returns:
+      A tuple with two scalar loss tensors: part_prediction_loss and
+      surface_coord_loss.
+    """
+    gt_dp_num_points_list = self.groundtruth_lists(
+        fields.BoxListFields.densepose_num_points)
+    gt_dp_part_ids_list = self.groundtruth_lists(
+        fields.BoxListFields.densepose_part_ids)
+    gt_dp_surface_coords_list = self.groundtruth_lists(
+        fields.BoxListFields.densepose_surface_coords)
+    gt_weights_list = self.groundtruth_lists(fields.BoxListFields.weights)
+
+    assigner = self._target_assigner_dict[DENSEPOSE_TASK]
+    batch_indices, batch_part_ids, batch_surface_coords, batch_weights = (
+        assigner.assign_part_and_coordinate_targets(
+            height=input_height,
+            width=input_width,
+            gt_dp_num_points_list=gt_dp_num_points_list,
+            gt_dp_part_ids_list=gt_dp_part_ids_list,
+            gt_dp_surface_coords_list=gt_dp_surface_coords_list,
+            gt_weights_list=gt_weights_list))
+
+    part_prediction_loss = 0
+    surface_coord_loss = 0
+    classification_loss_fn = self._densepose_params.classification_loss
+    localization_loss_fn = self._densepose_params.localization_loss
+    num_predictions = float(len(part_predictions))
+    num_valid_points = tf.math.count_nonzero(batch_weights)
+    num_valid_points = tf.cast(tf.math.maximum(num_valid_points, 1), tf.float32)
+    for part_pred, surface_coord_pred in zip(part_predictions,
+                                             surface_coord_predictions):
+      # Potentially upsample the feature maps, so that better quality (i.e.
+      # higher res) groundtruth can be applied.
+      if self._densepose_params.upsample_to_input_res:
+        part_pred = tf.keras.layers.UpSampling2D(
+            self._stride, interpolation=self._densepose_params.upsample_method)(
+                part_pred)
+        surface_coord_pred = tf.keras.layers.UpSampling2D(
+            self._stride, interpolation=self._densepose_params.upsample_method)(
+                surface_coord_pred)
+      # Compute the part prediction loss.
+      part_pred = cn_assigner.get_batch_predictions_from_indices(
+          part_pred, batch_indices[:, 0:3])
+      part_prediction_loss += classification_loss_fn(
+          part_pred[:, tf.newaxis, :],
+          batch_part_ids[:, tf.newaxis, :],
+          weights=batch_weights[:, tf.newaxis, tf.newaxis])
+      # Compute the surface coordinate loss.
+      batch_size, out_height, out_width, _ = _get_shape(
+          surface_coord_pred, 4)
+      surface_coord_pred = tf.reshape(
+          surface_coord_pred, [batch_size, out_height, out_width, -1, 2])
+      surface_coord_pred = cn_assigner.get_batch_predictions_from_indices(
+          surface_coord_pred, batch_indices)
+      surface_coord_loss += localization_loss_fn(
+          surface_coord_pred,
+          batch_surface_coords,
+          weights=batch_weights[:, tf.newaxis])
+    part_prediction_loss = tf.reduce_sum(part_prediction_loss) / (
+        num_predictions * num_valid_points)
+    surface_coord_loss = tf.reduce_sum(surface_coord_loss) / (
+        num_predictions * num_valid_points)
+    return part_prediction_loss, surface_coord_loss
+
   def preprocess(self, inputs):
     outputs = shape_utils.resize_images_and_return_shapes(
         inputs, self._image_resizer_fn)
@@ -1909,6 +2271,13 @@ class CenterNetMetaArch(model.DetectionModel):
         'segmentation/heatmap' - [optional] A list of size num_feature_outputs
           holding float tensors of size [batch_size, output_height,
           output_width, num_classes] representing the mask logits.
+        'densepose/heatmap' - [optional] A list of size num_feature_outputs
+          holding float tensors of size [batch_size, output_height,
+          output_width, num_parts] representing the mask logits for each part.
+        'densepose/regression' - [optional] A list of size num_feature_outputs
+          holding float tensors of size [batch_size, output_height,
+          output_width, 2 * num_parts] representing the DensePose surface
+          coordinate predictions.
         Note the $TASK_NAME is provided by the KeypointEstimation namedtuple
         used to differentiate between different keypoint tasks.
     """
@@ -1938,10 +2307,16 @@ class CenterNetMetaArch(model.DetectionModel):
       scope: Optional scope name.
 
     Returns:
-      A dictionary mapping the keys ['Loss/object_center', 'Loss/box/scale',
-        'Loss/box/offset', 'Loss/$TASK_NAME/keypoint/heatmap',
-        'Loss/$TASK_NAME/keypoint/offset',
-        'Loss/$TASK_NAME/keypoint/regression', 'Loss/segmentation/heatmap'] to
+      A dictionary mapping the keys [
+        'Loss/object_center',
+        'Loss/box/scale',  (optional)
+        'Loss/box/offset', (optional)
+        'Loss/$TASK_NAME/keypoint/heatmap', (optional)
+        'Loss/$TASK_NAME/keypoint/offset', (optional)
+        'Loss/$TASK_NAME/keypoint/regression', (optional)
+        'Loss/segmentation/heatmap', (optional)
+        'Loss/densepose/heatmap', (optional)
+        'Loss/densepose/regression]' (optional)
         scalar tensors corresponding to the losses for different tasks. Note the
         $TASK_NAME is provided by the KeypointEstimation namedtuple used to
         differentiate between different keypoint tasks.
@@ -1999,6 +2374,16 @@ class CenterNetMetaArch(model.DetectionModel):
         seg_losses[key] = seg_losses[key] * self._mask_params.task_loss_weight
       losses.update(seg_losses)
 
+    if self._densepose_params is not None:
+      densepose_losses = self._compute_densepose_losses(
+          input_height=input_height,
+          input_width=input_width,
+          prediction_dict=prediction_dict)
+      for key in densepose_losses:
+        densepose_losses[key] = (
+            densepose_losses[key] * self._densepose_params.task_loss_weight)
+      losses.update(densepose_losses)
+
     # Prepend the LOSS_KEY_PREFIX to the keys in the dictionary such that the
     # losses will be grouped together in Tensorboard.
     return dict([('%s/%s' % (LOSS_KEY_PREFIX, key), val)
@@ -2033,9 +2418,14 @@ class CenterNetMetaArch(model.DetectionModel):
           invalid keypoints have their coordinates and scores set to 0.0.
         detection_keypoint_scores: (Optional) A float tensor of shape [batch,
           max_detection, num_keypoints] with scores for each keypoint.
-        detection_masks: (Optional) An int tensor of shape [batch,
-          max_detections, mask_height, mask_width] with binarized masks for each
-          detection.
+        detection_masks: (Optional) A uint8 tensor of shape [batch,
+          max_detections, mask_height, mask_width] with masks for each
+          detection. Background is specified with 0, and foreground is specified
+          with positive integers (1 for standard instance segmentation mask, and
+          1-indexed parts for DensePose task).
+        detection_surface_coords: (Optional) A float32 tensor of shape [batch,
+          max_detection, mask_height, mask_width, 2] with DensePose surface
+          coordinates, in (v, u) format.
     """
     object_center_prob = tf.nn.sigmoid(prediction_dict[OBJECT_CENTER][-1])
     # Get x, y and channel indices corresponding to the top indices in the class
@@ -2076,14 +2466,27 @@ class CenterNetMetaArch(model.DetectionModel):
 
     if self._mask_params:
       masks = tf.nn.sigmoid(prediction_dict[SEGMENTATION_HEATMAP][-1])
-      instance_masks = convert_strided_predictions_to_instance_masks(
-          boxes, classes, masks, self._stride, self._mask_params.mask_height,
-          self._mask_params.mask_width, true_image_shapes,
-          self._mask_params.score_threshold)
-      postprocess_dict.update({
-          fields.DetectionResultFields.detection_masks:
-              instance_masks
-      })
+      densepose_part_heatmap, densepose_surface_coords = None, None
+      densepose_class_index = 0
+      if self._densepose_params:
+        densepose_part_heatmap = prediction_dict[DENSEPOSE_HEATMAP][-1]
+        densepose_surface_coords = prediction_dict[DENSEPOSE_REGRESSION][-1]
+        densepose_class_index = self._densepose_params.class_id
+      instance_masks, surface_coords = (
+          convert_strided_predictions_to_instance_masks(
+              boxes, classes, masks, true_image_shapes,
+              densepose_part_heatmap, densepose_surface_coords,
+              stride=self._stride, mask_height=self._mask_params.mask_height,
+              mask_width=self._mask_params.mask_width,
+              score_threshold=self._mask_params.score_threshold,
+              densepose_class_index=densepose_class_index))
+      postprocess_dict[
+          fields.DetectionResultFields.detection_masks] = instance_masks
+      if self._densepose_params:
+        postprocess_dict[
+            fields.DetectionResultFields.detection_surface_coords] = (
+                surface_coords)
+
     return postprocess_dict
 
   def _postprocess_keypoints(self, prediction_dict, classes, y_indices,
@@ -2330,18 +2733,50 @@ class CenterNetMetaArch(model.DetectionModel):
   def regularization_losses(self):
     return []
 
-  def restore_map(self, fine_tune_checkpoint_type='classification',
+  def restore_map(self,
+                  fine_tune_checkpoint_type='detection',
                   load_all_detection_checkpoint_vars=False):
+    raise RuntimeError('CenterNetMetaArch not supported under TF1.x.')
+
+  def restore_from_objects(self, fine_tune_checkpoint_type='detection'):
+    """Returns a map of Trackable objects to load from a foreign checkpoint.
+
+    Returns a dictionary of Tensorflow 2 Trackable objects (e.g. tf.Module
+    or Checkpoint). This enables the model to initialize based on weights from
+    another task. For example, the feature extractor variables from a
+    classification model can be used to bootstrap training of an object
+    detector. When loading from an object detection model, the checkpoint model
+    should have the same parameters as this detection model with exception of
+    the num_classes parameter.
 
-    if fine_tune_checkpoint_type == 'classification':
-      return {'feature_extractor': self._feature_extractor.get_base_model()}
+    Note that this function is intended to be used to restore Keras-based
+    models when running Tensorflow 2, whereas restore_map (not implemented
+    in CenterNet) is intended to be used to restore Slim-based models when
+    running Tensorflow 1.x.
 
-    if fine_tune_checkpoint_type == 'detection':
-      return {'feature_extractor': self._feature_extractor.get_model()}
+    TODO(jonathanhuang): Make this function consistent with other
+    meta-architectures.
+
+    Args:
+      fine_tune_checkpoint_type: whether to restore from a full detection
+        checkpoint (with compatible variable names) or to restore from a
+        classification checkpoint for initialization prior to training.
+        Valid values: `detection`, `classification`. Default 'detection'.
+        'detection': used when loading in the Hourglass model pre-trained on
+          other detection task.
+        'classification': used when loading in the ResNet model pre-trained on
+          image classification task. Note that only the image feature encoding
+          part is loaded but not those upsampling layers.
+        'fine_tune': used when loading the entire CenterNet feature extractor
+          pre-trained on other tasks. The checkpoints saved during CenterNet
+          model training can be directly loaded using this mode.
+
+    Returns:
+      A dict mapping keys to Trackable objects (tf.Module or Checkpoint).
+    """
 
-    else:
-      raise ValueError('Unknown fine tune checkpoint type - {}'.format(
-          fine_tune_checkpoint_type))
+    sub_model = self._feature_extractor.get_sub_model(fine_tune_checkpoint_type)
+    return {'feature_extractor': sub_model}
 
   def updates(self):
     raise RuntimeError('This model is intended to be used with model_lib_v2 '
diff --git a/research/object_detection/meta_architectures/center_net_meta_arch_tf2_test.py b/research/object_detection/meta_architectures/center_net_meta_arch_tf2_test.py
index 247ffd1bdc767e1a2ad717b87b5dfc6901ad82ce..6cce07b80de6e6cc33a7dec98a34c7fda0cba74a 100644
--- a/research/object_detection/meta_architectures/center_net_meta_arch_tf2_test.py
+++ b/research/object_detection/meta_architectures/center_net_meta_arch_tf2_test.py
@@ -17,7 +17,9 @@
 from __future__ import division
 
 import functools
+import re
 import unittest
+
 from absl.testing import parameterized
 import numpy as np
 import tensorflow.compat.v1 as tf
@@ -266,7 +268,7 @@ class CenterNetMetaArchHelpersTest(test_case.TestCase, parameterized.TestCase):
       masks_np[0, :, :3, 1] = 1  # Class 1.
       masks = tf.constant(masks_np)
       true_image_shapes = tf.constant([[6, 8, 3]])
-      instance_masks = cnma.convert_strided_predictions_to_instance_masks(
+      instance_masks, _ = cnma.convert_strided_predictions_to_instance_masks(
           boxes, classes, masks, stride=2, mask_height=2, mask_width=2,
           true_image_shapes=true_image_shapes)
       return instance_masks
@@ -289,6 +291,104 @@ class CenterNetMetaArchHelpersTest(test_case.TestCase, parameterized.TestCase):
         ])
     np.testing.assert_array_equal(expected_instance_masks, instance_masks)
 
+  def test_convert_strided_predictions_raises_error_with_one_tensor(self):
+    def graph_fn():
+      boxes = tf.constant(
+          [
+              [[0.5, 0.5, 1.0, 1.0],
+               [0.0, 0.5, 0.5, 1.0],
+               [0.0, 0.0, 0.0, 0.0]],
+          ], tf.float32)
+      classes = tf.constant(
+          [
+              [0, 1, 0],
+          ], tf.int32)
+      masks_np = np.zeros((1, 4, 4, 2), dtype=np.float32)
+      masks_np[0, :, 2:, 0] = 1  # Class 0.
+      masks_np[0, :, :3, 1] = 1  # Class 1.
+      masks = tf.constant(masks_np)
+      true_image_shapes = tf.constant([[6, 8, 3]])
+      densepose_part_heatmap = tf.random.uniform(
+          [1, 4, 4, 24])
+      instance_masks, _ = cnma.convert_strided_predictions_to_instance_masks(
+          boxes, classes, masks, true_image_shapes,
+          densepose_part_heatmap=densepose_part_heatmap,
+          densepose_surface_coords=None)
+      return instance_masks
+
+    with self.assertRaises(ValueError):
+      self.execute_cpu(graph_fn, [])
+
+  def test_crop_and_threshold_masks(self):
+    boxes_np = np.array(
+        [[0., 0., 0.5, 0.5],
+         [0.25, 0.25, 1.0, 1.0]], dtype=np.float32)
+    classes_np = np.array([0, 2], dtype=np.int32)
+    masks_np = np.zeros((4, 4, _NUM_CLASSES), dtype=np.float32)
+    masks_np[0, 0, 0] = 0.8
+    masks_np[1, 1, 0] = 0.6
+    masks_np[3, 3, 2] = 0.7
+    part_heatmap_np = np.zeros((4, 4, _DENSEPOSE_NUM_PARTS), dtype=np.float32)
+    part_heatmap_np[0, 0, 4] = 1
+    part_heatmap_np[0, 0, 2] = 0.6  # Lower scoring.
+    part_heatmap_np[1, 1, 8] = 0.2
+    part_heatmap_np[3, 3, 4] = 0.5
+    surf_coords_np = np.zeros((4, 4, 2 * _DENSEPOSE_NUM_PARTS),
+                              dtype=np.float32)
+    surf_coords_np[:, :, 8:10] = 0.2, 0.9
+    surf_coords_np[:, :, 16:18] = 0.3, 0.5
+    true_height, true_width = 10, 10
+    input_height, input_width = 10, 10
+    mask_height = 4
+    mask_width = 4
+    def graph_fn():
+      elems = [
+          tf.constant(boxes_np),
+          tf.constant(classes_np),
+          tf.constant(masks_np),
+          tf.constant(part_heatmap_np),
+          tf.constant(surf_coords_np),
+          tf.constant(true_height, dtype=tf.int32),
+          tf.constant(true_width, dtype=tf.int32)
+      ]
+      part_masks, surface_coords = cnma.crop_and_threshold_masks(
+          elems, input_height, input_width, mask_height=mask_height,
+          mask_width=mask_width, densepose_class_index=0)
+      return part_masks, surface_coords
+
+    part_masks, surface_coords = self.execute_cpu(graph_fn, [])
+
+    expected_part_masks = np.zeros((2, 4, 4), dtype=np.uint8)
+    expected_part_masks[0, 0, 0] = 5  # Recall classes are 1-indexed in output.
+    expected_part_masks[0, 2, 2] = 9  # Recall classes are 1-indexed in output.
+    expected_part_masks[1, 3, 3] = 1  # Standard instance segmentation mask.
+    expected_surface_coords = np.zeros((2, 4, 4, 2), dtype=np.float32)
+    expected_surface_coords[0, 0, 0, :] = 0.2, 0.9
+    expected_surface_coords[0, 2, 2, :] = 0.3, 0.5
+    np.testing.assert_allclose(expected_part_masks, part_masks)
+    np.testing.assert_allclose(expected_surface_coords, surface_coords)
+
+  def test_gather_surface_coords_for_parts(self):
+    surface_coords_cropped_np = np.zeros((2, 5, 5, _DENSEPOSE_NUM_PARTS, 2),
+                                         dtype=np.float32)
+    surface_coords_cropped_np[0, 0, 0, 5] = 0.3, 0.4
+    surface_coords_cropped_np[0, 1, 0, 9] = 0.5, 0.6
+    highest_scoring_part_np = np.zeros((2, 5, 5), dtype=np.int32)
+    highest_scoring_part_np[0, 0, 0] = 5
+    highest_scoring_part_np[0, 1, 0] = 9
+    def graph_fn():
+      surface_coords_cropped = tf.constant(surface_coords_cropped_np,
+                                           tf.float32)
+      highest_scoring_part = tf.constant(highest_scoring_part_np, tf.int32)
+      surface_coords_gathered = cnma.gather_surface_coords_for_parts(
+          surface_coords_cropped, highest_scoring_part)
+      return surface_coords_gathered
+
+    surface_coords_gathered = self.execute_cpu(graph_fn, [])
+
+    np.testing.assert_allclose([0.3, 0.4], surface_coords_gathered[0, 0, 0])
+    np.testing.assert_allclose([0.5, 0.6], surface_coords_gathered[0, 1, 0])
+
   def test_top_k_feature_map_locations(self):
     feature_map_np = np.zeros((2, 3, 3, 2), dtype=np.float32)
     feature_map_np[0, 2, 0, 1] = 1.0
@@ -535,6 +635,8 @@ class CenterNetMetaArchHelpersTest(test_case.TestCase, parameterized.TestCase):
     keypoint_heatmap_np[1, 0, 1, 1] = 0.9
     keypoint_heatmap_np[1, 2, 0, 1] = 0.8
 
+    # Note that the keypoint offsets are now per keypoint (as opposed to
+    # keypoint agnostic, in the test test_keypoint_candidate_prediction).
     keypoint_heatmap_offsets_np = np.zeros((2, 3, 3, 4), dtype=np.float32)
     keypoint_heatmap_offsets_np[0, 0, 0] = [0.5, 0.25, 0.0, 0.0]
     keypoint_heatmap_offsets_np[0, 2, 1] = [-0.25, 0.5, 0.0, 0.0]
@@ -949,6 +1051,7 @@ class CenterNetMetaArchHelpersTest(test_case.TestCase, parameterized.TestCase):
 _NUM_CLASSES = 10
 _KEYPOINT_INDICES = [0, 1, 2, 3]
 _NUM_KEYPOINTS = len(_KEYPOINT_INDICES)
+_DENSEPOSE_NUM_PARTS = 24
 _TASK_NAME = 'human_pose'
 
 
@@ -991,6 +1094,20 @@ def get_fake_mask_params():
       mask_width=4)
 
 
+def get_fake_densepose_params():
+  """Returns the fake DensePose estimation parameter namedtuple."""
+  return cnma.DensePoseParams(
+      class_id=1,
+      classification_loss=losses.WeightedSoftmaxClassificationLoss(),
+      localization_loss=losses.L1LocalizationLoss(),
+      part_loss_weight=1.0,
+      coordinate_loss_weight=1.0,
+      num_parts=_DENSEPOSE_NUM_PARTS,
+      task_loss_weight=1.0,
+      upsample_to_input_res=True,
+      upsample_method='nearest')
+
+
 def build_center_net_meta_arch(build_resnet=False):
   """Builds the CenterNet meta architecture."""
   if build_resnet:
@@ -1018,7 +1135,8 @@ def build_center_net_meta_arch(build_resnet=False):
       object_center_params=get_fake_center_params(),
       object_detection_params=get_fake_od_params(),
       keypoint_params_dict={_TASK_NAME: get_fake_kp_params()},
-      mask_params=get_fake_mask_params())
+      mask_params=get_fake_mask_params(),
+      densepose_params=get_fake_densepose_params())
 
 
 def _logit(p):
@@ -1102,6 +1220,16 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
         fake_feature_map)
     self.assertEqual((4, 128, 128, _NUM_CLASSES), output.shape)
 
+    # "densepose parts" head:
+    output = model._prediction_head_dict[cnma.DENSEPOSE_HEATMAP][-1](
+        fake_feature_map)
+    self.assertEqual((4, 128, 128, _DENSEPOSE_NUM_PARTS), output.shape)
+
+    # "densepose surface coordinates" head:
+    output = model._prediction_head_dict[cnma.DENSEPOSE_REGRESSION][-1](
+        fake_feature_map)
+    self.assertEqual((4, 128, 128, 2 * _DENSEPOSE_NUM_PARTS), output.shape)
+
   def test_initialize_target_assigners(self):
     model = build_center_net_meta_arch()
     assigner_dict = model._initialize_target_assigners(
@@ -1125,6 +1253,10 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
     self.assertIsInstance(assigner_dict[cnma.SEGMENTATION_TASK],
                           cn_assigner.CenterNetMaskTargetAssigner)
 
+    # DensePose estimation target assigner:
+    self.assertIsInstance(assigner_dict[cnma.DENSEPOSE_TASK],
+                          cn_assigner.CenterNetDensePoseTargetAssigner)
+
   def test_predict(self):
     """Test the predict function."""
 
@@ -1145,6 +1277,10 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
                      (2, 32, 32, 2))
     self.assertEqual(prediction_dict[cnma.SEGMENTATION_HEATMAP][0].shape,
                      (2, 32, 32, _NUM_CLASSES))
+    self.assertEqual(prediction_dict[cnma.DENSEPOSE_HEATMAP][0].shape,
+                     (2, 32, 32, _DENSEPOSE_NUM_PARTS))
+    self.assertEqual(prediction_dict[cnma.DENSEPOSE_REGRESSION][0].shape,
+                     (2, 32, 32, 2 * _DENSEPOSE_NUM_PARTS))
 
   def test_loss(self):
     """Test the loss function."""
@@ -1157,7 +1293,13 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
         groundtruth_keypoints_list=groundtruth_dict[
             fields.BoxListFields.keypoints],
         groundtruth_masks_list=groundtruth_dict[
-            fields.BoxListFields.masks])
+            fields.BoxListFields.masks],
+        groundtruth_dp_num_points_list=groundtruth_dict[
+            fields.BoxListFields.densepose_num_points],
+        groundtruth_dp_part_ids_list=groundtruth_dict[
+            fields.BoxListFields.densepose_part_ids],
+        groundtruth_dp_surface_coords_list=groundtruth_dict[
+            fields.BoxListFields.densepose_surface_coords])
 
     prediction_dict = get_fake_prediction_dict(
         input_height=16, input_width=32, stride=4)
@@ -1193,6 +1335,12 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
     self.assertGreater(
         0.01, loss_dict['%s/%s' % (cnma.LOSS_KEY_PREFIX,
                                    cnma.SEGMENTATION_HEATMAP)])
+    self.assertGreater(
+        0.01, loss_dict['%s/%s' % (cnma.LOSS_KEY_PREFIX,
+                                   cnma.DENSEPOSE_HEATMAP)])
+    self.assertGreater(
+        0.01, loss_dict['%s/%s' % (cnma.LOSS_KEY_PREFIX,
+                                   cnma.DENSEPOSE_REGRESSION)])
 
   @parameterized.parameters(
       {'target_class_id': 1},
@@ -1230,6 +1378,14 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
     segmentation_heatmap[:, 14:18, 14:18, target_class_id] = 1.0
     segmentation_heatmap = _logit(segmentation_heatmap)
 
+    dp_part_ind = 4
+    dp_part_heatmap = np.zeros((1, 32, 32, _DENSEPOSE_NUM_PARTS),
+                               dtype=np.float32)
+    dp_part_heatmap[0, 14:18, 14:18, dp_part_ind] = 1.0
+    dp_part_heatmap = _logit(dp_part_heatmap)
+
+    dp_surf_coords = np.random.randn(1, 32, 32, 2 * _DENSEPOSE_NUM_PARTS)
+
     class_center = tf.constant(class_center)
     height_width = tf.constant(height_width)
     offset = tf.constant(offset)
@@ -1237,6 +1393,8 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
     keypoint_offsets = tf.constant(keypoint_offsets, dtype=tf.float32)
     keypoint_regression = tf.constant(keypoint_regression, dtype=tf.float32)
     segmentation_heatmap = tf.constant(segmentation_heatmap, dtype=tf.float32)
+    dp_part_heatmap = tf.constant(dp_part_heatmap, dtype=tf.float32)
+    dp_surf_coords = tf.constant(dp_surf_coords, dtype=tf.float32)
 
     prediction_dict = {
         cnma.OBJECT_CENTER: [class_center],
@@ -1249,6 +1407,8 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
         cnma.get_keypoint_name(_TASK_NAME, cnma.KEYPOINT_REGRESSION):
             [keypoint_regression],
         cnma.SEGMENTATION_HEATMAP: [segmentation_heatmap],
+        cnma.DENSEPOSE_HEATMAP: [dp_part_heatmap],
+        cnma.DENSEPOSE_REGRESSION: [dp_surf_coords]
     }
 
     def graph_fn():
@@ -1271,12 +1431,13 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
     self.assertAllEqual([1, max_detection, 4, 4],
                         detections['detection_masks'].shape)
 
-    # There should be some section of the first mask (correspond to the only
-    # detection) with non-zero mask values.
-    self.assertGreater(np.sum(detections['detection_masks'][0, 0, :, :] > 0), 0)
+    # Masks should be empty for everything but the first detection.
     self.assertAllEqual(
         detections['detection_masks'][0, 1:, :, :],
         np.zeros_like(detections['detection_masks'][0, 1:, :, :]))
+    self.assertAllEqual(
+        detections['detection_surface_coords'][0, 1:, :, :],
+        np.zeros_like(detections['detection_surface_coords'][0, 1:, :, :]))
 
     if target_class_id == 1:
       expected_kpts_for_obj_0 = np.array(
@@ -1287,6 +1448,12 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
                                  expected_kpts_for_obj_0, rtol=1e-6)
       np.testing.assert_allclose(detections['detection_keypoint_scores'][0][0],
                                  expected_kpt_scores_for_obj_0, rtol=1e-6)
+      # First detection has DensePose parts.
+      self.assertSameElements(
+          np.unique(detections['detection_masks'][0, 0, :, :]),
+          set([0, dp_part_ind + 1]))
+      self.assertGreater(np.sum(np.abs(detections['detection_surface_coords'])),
+                         0.0)
     else:
       # All keypoint outputs should be zeros.
       np.testing.assert_allclose(
@@ -1297,6 +1464,14 @@ class CenterNetMetaArchTest(test_case.TestCase, parameterized.TestCase):
           detections['detection_keypoint_scores'][0][0],
           np.zeros([num_keypoints], np.float),
           rtol=1e-6)
+      # Binary segmentation mask.
+      self.assertSameElements(
+          np.unique(detections['detection_masks'][0, 0, :, :]),
+          set([0, 1]))
+      # No DensePose surface coordinates.
+      np.testing.assert_allclose(
+          detections['detection_surface_coords'][0, 0, :, :],
+          np.zeros_like(detections['detection_surface_coords'][0, 0, :, :]))
 
   def test_get_instance_indices(self):
     classes = tf.constant([[0, 1, 2, 0], [2, 1, 2, 2]], dtype=tf.int32)
@@ -1353,6 +1528,17 @@ def get_fake_prediction_dict(input_height, input_width, stride):
   mask_heatmap[0, 2, 4, 1] = 1.0
   mask_heatmap = _logit(mask_heatmap)
 
+  densepose_heatmap = np.zeros((2, output_height, output_width,
+                                _DENSEPOSE_NUM_PARTS), dtype=np.float32)
+  densepose_heatmap[0, 2, 4, 5] = 1.0
+  densepose_heatmap = _logit(densepose_heatmap)
+
+  densepose_regression = np.zeros((2, output_height, output_width,
+                                   2 * _DENSEPOSE_NUM_PARTS), dtype=np.float32)
+  # The surface coordinate indices for part index 5 are:
+  # (5 * 2, 5 * 2 + 1), or (10, 11).
+  densepose_regression[0, 2, 4, 10:12] = 0.4, 0.7
+
   prediction_dict = {
       'preprocessed_inputs':
           tf.zeros((2, input_height, input_width, 3)),
@@ -1383,6 +1569,14 @@ def get_fake_prediction_dict(input_height, input_width, stride):
       cnma.SEGMENTATION_HEATMAP: [
           tf.constant(mask_heatmap),
           tf.constant(mask_heatmap)
+      ],
+      cnma.DENSEPOSE_HEATMAP: [
+          tf.constant(densepose_heatmap),
+          tf.constant(densepose_heatmap),
+      ],
+      cnma.DENSEPOSE_REGRESSION: [
+          tf.constant(densepose_regression),
+          tf.constant(densepose_regression),
       ]
   }
   return prediction_dict
@@ -1427,12 +1621,30 @@ def get_fake_groundtruth_dict(input_height, input_width, stride):
       tf.constant(mask),
       tf.zeros_like(mask),
   ]
+  densepose_num_points = [
+      tf.constant([1], dtype=tf.int32),
+      tf.constant([0], dtype=tf.int32),
+  ]
+  densepose_part_ids = [
+      tf.constant([[5, 0, 0]], dtype=tf.int32),
+      tf.constant([[0, 0, 0]], dtype=tf.int32),
+  ]
+  densepose_surface_coords_np = np.zeros((1, 3, 4), dtype=np.float32)
+  densepose_surface_coords_np[0, 0, :] = 0.55, 0.55, 0.4, 0.7
+  densepose_surface_coords = [
+      tf.constant(densepose_surface_coords_np),
+      tf.zeros_like(densepose_surface_coords_np)
+  ]
   groundtruth_dict = {
       fields.BoxListFields.boxes: boxes,
       fields.BoxListFields.weights: weights,
       fields.BoxListFields.classes: classes,
       fields.BoxListFields.keypoints: keypoints,
       fields.BoxListFields.masks: masks,
+      fields.BoxListFields.densepose_num_points: densepose_num_points,
+      fields.BoxListFields.densepose_part_ids: densepose_part_ids,
+      fields.BoxListFields.densepose_surface_coords:
+          densepose_surface_coords,
       fields.InputDataFields.groundtruth_labeled_classes: labeled_classes,
   }
   return groundtruth_dict
@@ -1574,8 +1786,18 @@ class CenterNetMetaArchRestoreTest(test_case.TestCase):
     """Test restore map for a resnet backbone."""
 
     model = build_center_net_meta_arch(build_resnet=True)
-    restore_map = model.restore_map('classification')
-    self.assertIsInstance(restore_map['feature_extractor'], tf.keras.Model)
+    restore_from_objects_map = model.restore_from_objects('classification')
+    self.assertIsInstance(restore_from_objects_map['feature_extractor'],
+                          tf.keras.Model)
+
+  def test_retore_map_error(self):
+    """Test that restoring unsupported checkpoint type raises an error."""
+
+    model = build_center_net_meta_arch(build_resnet=True)
+    msg = ("Sub model detection is not defined for ResNet."
+           "Supported types are ['classification'].")
+    with self.assertRaisesRegex(ValueError, re.escape(msg)):
+      model.restore_from_objects('detection')
 
 
 class DummyFeatureExtractor(cnma.CenterNetFeatureExtractor):
@@ -1601,9 +1823,6 @@ class DummyFeatureExtractor(cnma.CenterNetFeatureExtractor):
   def postprocess(self):
     pass
 
-  def restore_map(self):
-    pass
-
   def call(self, inputs):
     batch_size, input_height, input_width, _ = inputs.shape
     fake_output = tf.ones([
diff --git a/research/object_detection/meta_architectures/context_rcnn_lib_tf2.py b/research/object_detection/meta_architectures/context_rcnn_lib_tf2.py
new file mode 100644
index 0000000000000000000000000000000000000000..de5b2e177cb64f143185fb5ceb21b88315cc8da0
--- /dev/null
+++ b/research/object_detection/meta_architectures/context_rcnn_lib_tf2.py
@@ -0,0 +1,238 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Library functions for Context R-CNN."""
+import tensorflow as tf
+
+from object_detection.core import freezable_batch_norm
+
+# The negative value used in padding the invalid weights.
+_NEGATIVE_PADDING_VALUE = -100000
+
+
+class ContextProjection(tf.keras.layers.Layer):
+  """Custom layer to do batch normalization and projection."""
+
+  def __init__(self, projection_dimension, **kwargs):
+    self.batch_norm = freezable_batch_norm.FreezableBatchNorm(
+        epsilon=0.001,
+        center=True,
+        scale=True,
+        momentum=0.97,
+        trainable=True)
+    self.projection = tf.keras.layers.Dense(units=projection_dimension,
+                                            activation=tf.nn.relu6,
+                                            use_bias=True)
+    super(ContextProjection, self).__init__(**kwargs)
+
+  def build(self, input_shape):
+    self.batch_norm.build(input_shape)
+    self.projection.build(input_shape)
+
+  def call(self, input_features, is_training=False):
+    return self.projection(self.batch_norm(input_features, is_training))
+
+
+class AttentionBlock(tf.keras.layers.Layer):
+  """Custom layer to perform all attention."""
+
+  def __init__(self, bottleneck_dimension, attention_temperature,
+               output_dimension=None, is_training=False,
+               name='AttentionBlock', **kwargs):
+    """Constructs an attention block.
+
+    Args:
+      bottleneck_dimension: A int32 Tensor representing the bottleneck dimension
+        for intermediate projections.
+      attention_temperature: A float Tensor. It controls the temperature of the
+        softmax for weights calculation. The formula for calculation as follows:
+          weights = exp(weights / temperature) / sum(exp(weights / temperature))
+      output_dimension: A int32 Tensor representing the last dimension of the
+        output feature.
+      is_training: A boolean Tensor (affecting batch normalization).
+      name: A string describing what to name the variables in this block.
+      **kwargs: Additional keyword arguments.
+    """
+
+    self._key_proj = ContextProjection(bottleneck_dimension)
+    self._val_proj = ContextProjection(bottleneck_dimension)
+    self._query_proj = ContextProjection(bottleneck_dimension)
+    self._feature_proj = None
+    self._attention_temperature = attention_temperature
+    self._bottleneck_dimension = bottleneck_dimension
+    self._is_training = is_training
+    self._output_dimension = output_dimension
+    if self._output_dimension:
+      self._feature_proj = ContextProjection(self._output_dimension)
+    super(AttentionBlock, self).__init__(name=name, **kwargs)
+
+  def build(self, input_shapes):
+    """Finishes building the attention block.
+
+    Args:
+      input_shapes: the shape of the primary input box features.
+    """
+    if not self._feature_proj:
+      self._output_dimension = input_shapes[-1]
+      self._feature_proj = ContextProjection(self._output_dimension)
+
+  def call(self, box_features, context_features, valid_context_size):
+    """Handles a call by performing attention.
+
+    Args:
+      box_features: A float Tensor of shape [batch_size, input_size,
+        num_input_features].
+      context_features: A float Tensor of shape [batch_size, context_size,
+        num_context_features].
+      valid_context_size: A int32 Tensor of shape [batch_size].
+
+    Returns:
+      A float Tensor with shape [batch_size, input_size, num_input_features]
+      containing output features after attention with context features.
+    """
+
+    _, context_size, _ = context_features.shape
+    valid_mask = compute_valid_mask(valid_context_size, context_size)
+
+    # Average pools over height and width dimension so that the shape of
+    # box_features becomes [batch_size, max_num_proposals, channels].
+    box_features = tf.reduce_mean(box_features, [2, 3])
+
+    queries = project_features(
+        box_features, self._bottleneck_dimension, self._is_training,
+        self._query_proj, normalize=True)
+    keys = project_features(
+        context_features, self._bottleneck_dimension, self._is_training,
+        self._key_proj, normalize=True)
+    values = project_features(
+        context_features, self._bottleneck_dimension, self._is_training,
+        self._val_proj, normalize=True)
+
+    weights = tf.matmul(queries, keys, transpose_b=True)
+    weights, values = filter_weight_value(weights, values, valid_mask)
+    weights = tf.nn.softmax(weights / self._attention_temperature)
+
+    features = tf.matmul(weights, values)
+    output_features = project_features(
+        features, self._output_dimension, self._is_training,
+        self._feature_proj, normalize=False)
+
+    output_features = output_features[:, :, tf.newaxis, tf.newaxis, :]
+
+    return output_features
+
+
+def filter_weight_value(weights, values, valid_mask):
+  """Filters weights and values based on valid_mask.
+
+  _NEGATIVE_PADDING_VALUE will be added to invalid elements in the weights to
+  avoid their contribution in softmax. 0 will be set for the invalid elements in
+  the values.
+
+  Args:
+    weights: A float Tensor of shape [batch_size, input_size, context_size].
+    values: A float Tensor of shape [batch_size, context_size,
+      projected_dimension].
+    valid_mask: A boolean Tensor of shape [batch_size, context_size]. True means
+      valid and False means invalid.
+
+  Returns:
+    weights: A float Tensor of shape [batch_size, input_size, context_size].
+    values: A float Tensor of shape [batch_size, context_size,
+      projected_dimension].
+
+  Raises:
+    ValueError: If shape of doesn't match.
+  """
+  w_batch_size, _, w_context_size = weights.shape
+  v_batch_size, v_context_size, _ = values.shape
+  m_batch_size, m_context_size = valid_mask.shape
+  if w_batch_size != v_batch_size or v_batch_size != m_batch_size:
+    raise ValueError('Please make sure the first dimension of the input'
+                     ' tensors are the same.')
+
+  if w_context_size != v_context_size:
+    raise ValueError('Please make sure the third dimension of weights matches'
+                     ' the second dimension of values.')
+
+  if w_context_size != m_context_size:
+    raise ValueError('Please make sure the third dimension of the weights'
+                     ' matches the second dimension of the valid_mask.')
+
+  valid_mask = valid_mask[..., tf.newaxis]
+
+  # Force the invalid weights to be very negative so it won't contribute to
+  # the softmax.
+  weights += tf.transpose(
+      tf.cast(tf.math.logical_not(valid_mask), weights.dtype) *
+      _NEGATIVE_PADDING_VALUE,
+      perm=[0, 2, 1])
+
+  # Force the invalid values to be 0.
+  values *= tf.cast(valid_mask, values.dtype)
+
+  return weights, values
+
+
+def project_features(features, bottleneck_dimension, is_training,
+                     layer, normalize=True):
+  """Projects features to another feature space.
+
+  Args:
+    features: A float Tensor of shape [batch_size, features_size,
+      num_features].
+    bottleneck_dimension: A int32 Tensor.
+    is_training: A boolean Tensor (affecting batch normalization).
+    layer: Contains a custom layer specific to the particular operation
+          being performed (key, value, query, features)
+    normalize: A boolean Tensor. If true, the output features will be l2
+      normalized on the last dimension.
+
+  Returns:
+    A float Tensor of shape [batch, features_size, projection_dimension].
+  """
+  shape_arr = features.shape
+  batch_size, _, num_features = shape_arr
+  features = tf.reshape(features, [-1, num_features])
+
+  projected_features = layer(features, is_training)
+
+  projected_features = tf.reshape(projected_features,
+                                  [batch_size, -1, bottleneck_dimension])
+
+  if normalize:
+    projected_features = tf.keras.backend.l2_normalize(projected_features,
+                                                       axis=-1)
+
+  return projected_features
+
+
+def compute_valid_mask(num_valid_elements, num_elements):
+  """Computes mask of valid entries within padded context feature.
+
+  Args:
+    num_valid_elements: A int32 Tensor of shape [batch_size].
+    num_elements: An int32 Tensor.
+
+  Returns:
+    A boolean Tensor of the shape [batch_size, num_elements]. True means
+      valid and False means invalid.
+  """
+  batch_size = num_valid_elements.shape[0]
+  element_idxs = tf.range(num_elements, dtype=tf.int32)
+  batch_element_idxs = tf.tile(element_idxs[tf.newaxis, ...], [batch_size, 1])
+  num_valid_elements = num_valid_elements[..., tf.newaxis]
+  valid_mask = tf.less(batch_element_idxs, num_valid_elements)
+  return valid_mask
diff --git a/research/object_detection/meta_architectures/context_rcnn_lib_tf2_test.py b/research/object_detection/meta_architectures/context_rcnn_lib_tf2_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..5fb116c18b1c720d23b4c8454c24ec7820265221
--- /dev/null
+++ b/research/object_detection/meta_architectures/context_rcnn_lib_tf2_test.py
@@ -0,0 +1,120 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for context_rcnn_lib."""
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+
+import unittest
+from absl.testing import parameterized
+import tensorflow.compat.v1 as tf
+
+from object_detection.meta_architectures import context_rcnn_lib_tf2 as context_rcnn_lib
+from object_detection.utils import test_case
+from object_detection.utils import tf_version
+
+_NEGATIVE_PADDING_VALUE = -100000
+
+
+@unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
+class ContextRcnnLibTest(parameterized.TestCase, test_case.TestCase):
+  """Tests for the functions in context_rcnn_lib."""
+
+  def test_compute_valid_mask(self):
+    num_elements = tf.constant(3, tf.int32)
+    num_valid_elementss = tf.constant((1, 2), tf.int32)
+    valid_mask = context_rcnn_lib.compute_valid_mask(num_valid_elementss,
+                                                     num_elements)
+    expected_valid_mask = tf.constant([[1, 0, 0], [1, 1, 0]], tf.float32)
+    self.assertAllEqual(valid_mask, expected_valid_mask)
+
+  def test_filter_weight_value(self):
+    weights = tf.ones((2, 3, 2), tf.float32) * 4
+    values = tf.ones((2, 2, 4), tf.float32)
+    valid_mask = tf.constant([[True, True], [True, False]], tf.bool)
+
+    filtered_weights, filtered_values = context_rcnn_lib.filter_weight_value(
+        weights, values, valid_mask)
+    expected_weights = tf.constant([[[4, 4], [4, 4], [4, 4]],
+                                    [[4, _NEGATIVE_PADDING_VALUE + 4],
+                                     [4, _NEGATIVE_PADDING_VALUE + 4],
+                                     [4, _NEGATIVE_PADDING_VALUE + 4]]])
+
+    expected_values = tf.constant([[[1, 1, 1, 1], [1, 1, 1, 1]],
+                                   [[1, 1, 1, 1], [0, 0, 0, 0]]])
+    self.assertAllEqual(filtered_weights, expected_weights)
+    self.assertAllEqual(filtered_values, expected_values)
+
+    # Changes the valid_mask so the results will be different.
+    valid_mask = tf.constant([[True, True], [False, False]], tf.bool)
+
+    filtered_weights, filtered_values = context_rcnn_lib.filter_weight_value(
+        weights, values, valid_mask)
+    expected_weights = tf.constant(
+        [[[4, 4], [4, 4], [4, 4]],
+         [[_NEGATIVE_PADDING_VALUE + 4, _NEGATIVE_PADDING_VALUE + 4],
+          [_NEGATIVE_PADDING_VALUE + 4, _NEGATIVE_PADDING_VALUE + 4],
+          [_NEGATIVE_PADDING_VALUE + 4, _NEGATIVE_PADDING_VALUE + 4]]])
+
+    expected_values = tf.constant([[[1, 1, 1, 1], [1, 1, 1, 1]],
+                                   [[0, 0, 0, 0], [0, 0, 0, 0]]])
+    self.assertAllEqual(filtered_weights, expected_weights)
+    self.assertAllEqual(filtered_values, expected_values)
+
+  @parameterized.parameters((2, True, True), (2, False, True),
+                            (10, True, False), (10, False, False))
+  def test_project_features(self, projection_dimension, is_training, normalize):
+    features = tf.ones([2, 3, 4], tf.float32)
+    projected_features = context_rcnn_lib.project_features(
+        features,
+        projection_dimension,
+        is_training,
+        context_rcnn_lib.ContextProjection(projection_dimension),
+        normalize=normalize)
+
+    # Makes sure the shape is correct.
+    self.assertAllEqual(projected_features.shape, [2, 3, projection_dimension])
+
+  @parameterized.parameters(
+      (2, 10, 1),
+      (3, 10, 2),
+      (4, None, 3),
+      (5, 20, 4),
+      (7, None, 5),
+  )
+  def test_attention_block(self, bottleneck_dimension, output_dimension,
+                           attention_temperature):
+    input_features = tf.ones([2, 8, 3, 3, 3], tf.float32)
+    context_features = tf.ones([2, 20, 10], tf.float32)
+    attention_block = context_rcnn_lib.AttentionBlock(
+        bottleneck_dimension,
+        attention_temperature,
+        output_dimension=output_dimension,
+        is_training=False)
+    valid_context_size = tf.random_uniform((2,),
+                                           minval=0,
+                                           maxval=10,
+                                           dtype=tf.int32)
+    output_features = attention_block(input_features, context_features,
+                                      valid_context_size)
+
+    # Makes sure the shape is correct.
+    self.assertAllEqual(output_features.shape,
+                        [2, 8, 1, 1, (output_dimension or 3)])
+
+
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/research/object_detection/meta_architectures/context_rcnn_meta_arch.py b/research/object_detection/meta_architectures/context_rcnn_meta_arch.py
index abe30558b01218df8999b3f0f7698e57f67f8ff2..5c76f0a5257dd33b0e0ffcd8ee54bec1fd291961 100644
--- a/research/object_detection/meta_architectures/context_rcnn_meta_arch.py
+++ b/research/object_detection/meta_architectures/context_rcnn_meta_arch.py
@@ -27,7 +27,9 @@ import functools
 
 from object_detection.core import standard_fields as fields
 from object_detection.meta_architectures import context_rcnn_lib
+from object_detection.meta_architectures import context_rcnn_lib_tf2
 from object_detection.meta_architectures import faster_rcnn_meta_arch
+from object_detection.utils import tf_version
 
 
 class ContextRCNNMetaArch(faster_rcnn_meta_arch.FasterRCNNMetaArch):
@@ -264,11 +266,17 @@ class ContextRCNNMetaArch(faster_rcnn_meta_arch.FasterRCNNMetaArch):
             return_raw_detections_during_predict),
         output_final_box_features=output_final_box_features)
 
-    self._context_feature_extract_fn = functools.partial(
-        context_rcnn_lib.compute_box_context_attention,
-        bottleneck_dimension=attention_bottleneck_dimension,
-        attention_temperature=attention_temperature,
-        is_training=is_training)
+    if tf_version.is_tf1():
+      self._context_feature_extract_fn = functools.partial(
+          context_rcnn_lib.compute_box_context_attention,
+          bottleneck_dimension=attention_bottleneck_dimension,
+          attention_temperature=attention_temperature,
+          is_training=is_training)
+    else:
+      self._context_feature_extract_fn = context_rcnn_lib_tf2.AttentionBlock(
+          bottleneck_dimension=attention_bottleneck_dimension,
+          attention_temperature=attention_temperature,
+          is_training=is_training)
 
   @staticmethod
   def get_side_inputs(features):
@@ -323,8 +331,9 @@ class ContextRCNNMetaArch(faster_rcnn_meta_arch.FasterRCNNMetaArch):
     Returns:
       A float32 Tensor with shape [K, new_height, new_width, depth].
     """
+
     box_features = self._crop_and_resize_fn(
-        features_to_crop, proposal_boxes_normalized,
+        [features_to_crop], proposal_boxes_normalized, None,
         [self._initial_crop_size, self._initial_crop_size])
 
     attention_features = self._context_feature_extract_fn(
diff --git a/research/object_detection/meta_architectures/context_rcnn_meta_arch_tf1_test.py b/research/object_detection/meta_architectures/context_rcnn_meta_arch_test.py
similarity index 96%
rename from research/object_detection/meta_architectures/context_rcnn_meta_arch_tf1_test.py
rename to research/object_detection/meta_architectures/context_rcnn_meta_arch_test.py
index a5dc8cc8e12f8e2ee95465c651b3570db0cca80f..4f9a576faae093d056c03a57c5101da687708274 100644
--- a/research/object_detection/meta_architectures/context_rcnn_meta_arch_tf1_test.py
+++ b/research/object_detection/meta_architectures/context_rcnn_meta_arch_test.py
@@ -20,8 +20,8 @@ from __future__ import print_function
 
 import functools
 import unittest
+from unittest import mock  # pylint: disable=g-importing-member
 from absl.testing import parameterized
-import mock
 import tensorflow.compat.v1 as tf
 import tf_slim as slim
 
@@ -41,7 +41,7 @@ from object_detection.meta_architectures import faster_rcnn_meta_arch
 from object_detection.protos import box_predictor_pb2
 from object_detection.protos import hyperparams_pb2
 from object_detection.protos import post_processing_pb2
-from object_detection.utils import ops
+from object_detection.utils import spatial_transform_ops as spatial_ops
 from object_detection.utils import test_case
 from object_detection.utils import test_utils
 from object_detection.utils import tf_version
@@ -109,7 +109,6 @@ class FakeFasterRCNNKerasFeatureExtractor(
     ])
 
 
-@unittest.skipIf(tf_version.is_tf2(), 'Skipping TF1.X only test.')
 class ContextRCNNMetaArchTest(test_case.TestCase, parameterized.TestCase):
 
   def _get_model(self, box_predictor, **common_kwargs):
@@ -363,8 +362,9 @@ class ContextRCNNMetaArchTest(test_case.TestCase, parameterized.TestCase):
           max_negatives_per_positive=None)
 
     crop_and_resize_fn = (
-        ops.matmul_crop_and_resize
-        if use_matmul_crop_and_resize else ops.native_crop_and_resize)
+        spatial_ops.multilevel_matmul_crop_and_resize
+        if use_matmul_crop_and_resize
+        else spatial_ops.multilevel_native_crop_and_resize)
     common_kwargs = {
         'is_training':
             is_training,
@@ -439,15 +439,16 @@ class ContextRCNNMetaArchTest(test_case.TestCase, parameterized.TestCase):
             masks_are_class_agnostic=masks_are_class_agnostic,
             share_box_across_classes=share_box_across_classes), **common_kwargs)
 
+  @unittest.skipIf(tf_version.is_tf2(), 'Skipping TF1.X only test.')
   @mock.patch.object(context_rcnn_meta_arch, 'context_rcnn_lib')
-  def test_prediction_mock(self, mock_context_rcnn_lib):
-    """Mocks the context_rcnn_lib module to test the prediction.
+  def test_prediction_mock_tf1(self, mock_context_rcnn_lib_v1):
+    """Mocks the context_rcnn_lib_v1 module to test the prediction.
 
     Using mock object so that we can ensure compute_box_context_attention is
     called in side the prediction function.
 
     Args:
-      mock_context_rcnn_lib: mock module for the context_rcnn_lib.
+      mock_context_rcnn_lib_v1: mock module for the context_rcnn_lib_v1.
     """
     model = self._build_model(
         is_training=False,
@@ -456,7 +457,7 @@ class ContextRCNNMetaArchTest(test_case.TestCase, parameterized.TestCase):
         num_classes=42)
     mock_tensor = tf.ones([2, 8, 3, 3, 3], tf.float32)
 
-    mock_context_rcnn_lib.compute_box_context_attention.return_value = mock_tensor
+    mock_context_rcnn_lib_v1.compute_box_context_attention.return_value = mock_tensor
     inputs_shape = (2, 20, 20, 3)
     inputs = tf.cast(
         tf.random_uniform(inputs_shape, minval=0, maxval=255, dtype=tf.int32),
@@ -478,7 +479,7 @@ class ContextRCNNMetaArchTest(test_case.TestCase, parameterized.TestCase):
     side_inputs = model.get_side_inputs(features)
 
     _ = model.predict(preprocessed_inputs, true_image_shapes, **side_inputs)
-    mock_context_rcnn_lib.compute_box_context_attention.assert_called_once()
+    mock_context_rcnn_lib_v1.compute_box_context_attention.assert_called_once()
 
   @parameterized.named_parameters(
       {'testcase_name': 'static_shapes', 'static_shapes': True},
@@ -517,7 +518,6 @@ class ContextRCNNMetaArchTest(test_case.TestCase, parameterized.TestCase):
       }
 
       side_inputs = model.get_side_inputs(features)
-
       prediction_dict = model.predict(preprocessed_inputs, true_image_shapes,
                                       **side_inputs)
       return (prediction_dict['rpn_box_predictor_features'],
diff --git a/research/object_detection/meta_architectures/faster_rcnn_meta_arch.py b/research/object_detection/meta_architectures/faster_rcnn_meta_arch.py
index 2b6c093a36d0d12a03ce98c673790cf5d65bf34a..23257662573add38b6982a9cb703d11bcc62c44a 100644
--- a/research/object_detection/meta_architectures/faster_rcnn_meta_arch.py
+++ b/research/object_detection/meta_architectures/faster_rcnn_meta_arch.py
@@ -261,31 +261,6 @@ class FasterRCNNKerasFeatureExtractor(object):
     """Get model that extracts second stage box classifier features."""
     pass
 
-  def restore_from_classification_checkpoint_fn(
-      self,
-      first_stage_feature_extractor_scope,
-      second_stage_feature_extractor_scope):
-    """Returns a map of variables to load from a foreign checkpoint.
-
-    Args:
-      first_stage_feature_extractor_scope: A scope name for the first stage
-        feature extractor.
-      second_stage_feature_extractor_scope: A scope name for the second stage
-        feature extractor.
-
-    Returns:
-      A dict mapping variable names (to load from a checkpoint) to variables in
-      the model graph.
-    """
-    variables_to_restore = {}
-    for variable in variables_helper.get_global_variables_safely():
-      for scope_name in [first_stage_feature_extractor_scope,
-                         second_stage_feature_extractor_scope]:
-        if variable.op.name.startswith(scope_name):
-          var_name = variable.op.name.replace(scope_name + '/', '')
-          variables_to_restore[var_name] = variable
-    return variables_to_restore
-
 
 class FasterRCNNMetaArch(model.DetectionModel):
   """Faster R-CNN Meta-architecture definition."""
@@ -1973,9 +1948,16 @@ class FasterRCNNMetaArch(model.DetectionModel):
     Returns:
       A float32 tensor with shape [K, new_height, new_width, depth].
     """
+    features_to_crop = [features_to_crop]
+    num_levels = len(features_to_crop)
+    box_levels = None
+    if num_levels != 1:
+      # If there are multiple levels to select, get the box levels
+      box_levels = ops.fpn_feature_levels(num_levels, num_levels - 1,
+                                          1.0/224, proposal_boxes_normalized)
     cropped_regions = self._flatten_first_two_dimensions(
         self._crop_and_resize_fn(
-            features_to_crop, proposal_boxes_normalized,
+            features_to_crop, proposal_boxes_normalized, box_levels,
             [self._initial_crop_size, self._initial_crop_size]))
     return self._maxpool_layer(cropped_regions)
 
@@ -2542,8 +2524,8 @@ class FasterRCNNMetaArch(model.DetectionModel):
             image_shape[1], image_shape[2], check_range=False).get()
 
         flat_cropped_gt_mask = self._crop_and_resize_fn(
-            tf.expand_dims(flat_gt_masks, -1),
-            tf.expand_dims(flat_normalized_proposals, axis=1),
+            [tf.expand_dims(flat_gt_masks, -1)],
+            tf.expand_dims(flat_normalized_proposals, axis=1), None,
             [mask_height, mask_width])
         # Without stopping gradients into cropped groundtruth masks the
         # performance with 100-padded groundtruth masks when batch size > 1 is
@@ -2572,7 +2554,7 @@ class FasterRCNNMetaArch(model.DetectionModel):
       if second_stage_mask_loss is not None:
         mask_loss = tf.multiply(self._second_stage_mask_loss_weight,
                                 second_stage_mask_loss, name='mask_loss')
-        loss_dict[mask_loss.op.name] = mask_loss
+        loss_dict['Loss/BoxClassifierLoss/mask_loss'] = mask_loss
     return loss_dict
 
   def _get_mask_proposal_boxes_and_classes(
@@ -2801,6 +2783,46 @@ class FasterRCNNMetaArch(model.DetectionModel):
         variables_to_restore, include_patterns=include_patterns)
     return {var.op.name: var for var in feature_extractor_variables}
 
+  def restore_from_objects(self, fine_tune_checkpoint_type='detection'):
+    """Returns a map of Trackable objects to load from a foreign checkpoint.
+
+    Returns a dictionary of Tensorflow 2 Trackable objects (e.g. tf.Module
+    or Checkpoint). This enables the model to initialize based on weights from
+    another task. For example, the feature extractor variables from a
+    classification model can be used to bootstrap training of an object
+    detector. When loading from an object detection model, the checkpoint model
+    should have the same parameters as this detection model with exception of
+    the num_classes parameter.
+
+    Note that this function is intended to be used to restore Keras-based
+    models when running Tensorflow 2, whereas restore_map (above) is intended
+    to be used to restore Slim-based models when running Tensorflow 1.x.
+
+    Args:
+      fine_tune_checkpoint_type: whether to restore from a full detection
+        checkpoint (with compatible variable names) or to restore from a
+        classification checkpoint for initialization prior to training.
+        Valid values: `detection`, `classification`. Default 'detection'.
+
+    Returns:
+      A dict mapping keys to Trackable objects (tf.Module or Checkpoint).
+    """
+    if fine_tune_checkpoint_type == 'classification':
+      return {
+          'feature_extractor':
+              self._feature_extractor.classification_backbone
+      }
+    elif fine_tune_checkpoint_type == 'detection':
+      fake_model = tf.train.Checkpoint(
+          _feature_extractor_for_box_classifier_features=
+          self._feature_extractor_for_box_classifier_features,
+          _feature_extractor_for_proposal_features=
+          self._feature_extractor_for_proposal_features)
+      return {'model': fake_model}
+    else:
+      raise ValueError('Not supported fine_tune_checkpoint_type: {}'.format(
+          fine_tune_checkpoint_type))
+
   def updates(self):
     """Returns a list of update operators for this model.
 
diff --git a/research/object_detection/meta_architectures/faster_rcnn_meta_arch_test_lib.py b/research/object_detection/meta_architectures/faster_rcnn_meta_arch_test_lib.py
index beead134d515a1084b4ed1a57f63d601e07a02b2..8cbd7793a62b57bf390a5cf6cc8fac6f4c9a2f50 100644
--- a/research/object_detection/meta_architectures/faster_rcnn_meta_arch_test_lib.py
+++ b/research/object_detection/meta_architectures/faster_rcnn_meta_arch_test_lib.py
@@ -34,7 +34,7 @@ from object_detection.meta_architectures import faster_rcnn_meta_arch
 from object_detection.protos import box_predictor_pb2
 from object_detection.protos import hyperparams_pb2
 from object_detection.protos import post_processing_pb2
-from object_detection.utils import ops
+from object_detection.utils import spatial_transform_ops as spatial_ops
 from object_detection.utils import test_case
 from object_detection.utils import test_utils
 from object_detection.utils import tf_version
@@ -377,8 +377,9 @@ class FasterRCNNMetaArchTestBase(test_case.TestCase, parameterized.TestCase):
           max_negatives_per_positive=None)
 
     crop_and_resize_fn = (
-        ops.matmul_crop_and_resize
-        if use_matmul_crop_and_resize else ops.native_crop_and_resize)
+        spatial_ops.multilevel_matmul_crop_and_resize
+        if use_matmul_crop_and_resize
+        else spatial_ops.multilevel_native_crop_and_resize)
     common_kwargs = {
         'is_training':
             is_training,
diff --git a/research/object_detection/meta_architectures/ssd_meta_arch.py b/research/object_detection/meta_architectures/ssd_meta_arch.py
index d401b0de75a6a1c04984caad12986029e3166226..eb1fd320d7061a72fe6fa48955b421eee3b0f96e 100644
--- a/research/object_detection/meta_architectures/ssd_meta_arch.py
+++ b/research/object_detection/meta_architectures/ssd_meta_arch.py
@@ -250,35 +250,6 @@ class SSDKerasFeatureExtractor(tf.keras.Model):
   def call(self, inputs, **kwargs):
     return self._extract_features(inputs)
 
-  def restore_from_classification_checkpoint_fn(self, feature_extractor_scope):
-    """Returns a map of variables to load from a foreign checkpoint.
-
-    Args:
-      feature_extractor_scope: A scope name for the feature extractor.
-
-    Returns:
-      A dict mapping variable names (to load from a checkpoint) to variables in
-      the model graph.
-    """
-    variables_to_restore = {}
-    if tf.executing_eagerly():
-      for variable in self.variables:
-        # variable.name includes ":0" at the end, but the names in the
-        # checkpoint do not have the suffix ":0". So, we strip it here.
-        var_name = variable.name[:-2]
-        if var_name.startswith(feature_extractor_scope + '/'):
-          var_name = var_name.replace(feature_extractor_scope + '/', '')
-        variables_to_restore[var_name] = variable
-    else:
-      # b/137854499: use global_variables.
-      for variable in variables_helper.get_global_variables_safely():
-        var_name = variable.op.name
-        if var_name.startswith(feature_extractor_scope + '/'):
-          var_name = var_name.replace(feature_extractor_scope + '/', '')
-          variables_to_restore[var_name] = variable
-
-    return variables_to_restore
-
 
 class SSDMetaArch(model.DetectionModel):
   """SSD Meta-architecture definition."""
@@ -508,12 +479,9 @@ class SSDMetaArch(model.DetectionModel):
       ValueError: if inputs tensor does not have type tf.float32
     """
     with tf.name_scope('Preprocessor'):
-      (resized_inputs,
-       true_image_shapes) = shape_utils.resize_images_and_return_shapes(
-           inputs, self._image_resizer_fn)
-
-      return (self._feature_extractor.preprocess(resized_inputs),
-              true_image_shapes)
+      normalized_inputs = self._feature_extractor.preprocess(inputs)
+      return shape_utils.resize_images_and_return_shapes(
+          normalized_inputs, self._image_resizer_fn)
 
   def _compute_clip_window(self, preprocessed_images, true_image_shapes):
     """Computes clip window to use during post_processing.
@@ -1295,8 +1263,8 @@ class SSDMetaArch(model.DetectionModel):
         classification checkpoint for initialization prior to training.
         Valid values: `detection`, `classification`. Default 'detection'.
       load_all_detection_checkpoint_vars: whether to load all variables (when
-         `fine_tune_checkpoint_type='detection'`). If False, only variables
-         within the appropriate scopes are included. Default False.
+         `fine_tune_checkpoint_type` is `detection`). If False, only variables
+         within the feature extractor scope are included. Default False.
 
     Returns:
       A dict mapping variable names (to load from a checkpoint) to variables in
@@ -1311,36 +1279,56 @@ class SSDMetaArch(model.DetectionModel):
 
     elif fine_tune_checkpoint_type == 'detection':
       variables_to_restore = {}
-      if tf.executing_eagerly():
+      for variable in variables_helper.get_global_variables_safely():
+        var_name = variable.op.name
         if load_all_detection_checkpoint_vars:
-          # Grab all detection vars by name
-          for variable in self.variables:
-            # variable.name includes ":0" at the end, but the names in the
-            # checkpoint do not have the suffix ":0". So, we strip it here.
-            var_name = variable.name[:-2]
-            variables_to_restore[var_name] = variable
+          variables_to_restore[var_name] = variable
         else:
-          # Grab just the feature extractor vars by name
-          for variable in self._feature_extractor.variables:
-            # variable.name includes ":0" at the end, but the names in the
-            # checkpoint do not have the suffix ":0". So, we strip it here.
-            var_name = variable.name[:-2]
-            variables_to_restore[var_name] = variable
-      else:
-        for variable in variables_helper.get_global_variables_safely():
-          var_name = variable.op.name
-          if load_all_detection_checkpoint_vars:
+          if var_name.startswith(self._extract_features_scope):
             variables_to_restore[var_name] = variable
-          else:
-            if var_name.startswith(self._extract_features_scope):
-              variables_to_restore[var_name] = variable
-
       return variables_to_restore
 
     else:
       raise ValueError('Not supported fine_tune_checkpoint_type: {}'.format(
           fine_tune_checkpoint_type))
 
+  def restore_from_objects(self, fine_tune_checkpoint_type='detection'):
+    """Returns a map of Trackable objects to load from a foreign checkpoint.
+
+    Returns a dictionary of Tensorflow 2 Trackable objects (e.g. tf.Module
+    or Checkpoint). This enables the model to initialize based on weights from
+    another task. For example, the feature extractor variables from a
+    classification model can be used to bootstrap training of an object
+    detector. When loading from an object detection model, the checkpoint model
+    should have the same parameters as this detection model with exception of
+    the num_classes parameter.
+
+    Note that this function is intended to be used to restore Keras-based
+    models when running Tensorflow 2, whereas restore_map (above) is intended
+    to be used to restore Slim-based models when running Tensorflow 1.x.
+
+    Args:
+      fine_tune_checkpoint_type: whether to restore from a full detection
+        checkpoint (with compatible variable names) or to restore from a
+        classification checkpoint for initialization prior to training.
+        Valid values: `detection`, `classification`. Default 'detection'.
+
+    Returns:
+      A dict mapping keys to Trackable objects (tf.Module or Checkpoint).
+    """
+    if fine_tune_checkpoint_type == 'classification':
+      return {
+          'feature_extractor':
+              self._feature_extractor.classification_backbone
+      }
+    elif fine_tune_checkpoint_type == 'detection':
+      fake_model = tf.train.Checkpoint(
+          _feature_extractor=self._feature_extractor)
+      return {'model': fake_model}
+    else:
+      raise ValueError('Not supported fine_tune_checkpoint_type: {}'.format(
+          fine_tune_checkpoint_type))
+
   def updates(self):
     """Returns a list of update operators for this model.
 
diff --git a/research/object_detection/metrics/coco_evaluation.py b/research/object_detection/metrics/coco_evaluation.py
index 3ecfddb0dd4221c3e511fab628b884bc5eb514e6..f721bbe3a503666d938fe4233b4619c044301e09 100644
--- a/research/object_detection/metrics/coco_evaluation.py
+++ b/research/object_detection/metrics/coco_evaluation.py
@@ -432,14 +432,9 @@ class CocoDetectionEvaluator(object_detection_evaluation.DetectionEvaluator):
     return eval_metric_ops
 
 
-def _check_mask_type_and_value(array_name, masks):
-  """Checks whether mask dtype is uint8 and the values are either 0 or 1."""
-  if masks.dtype != np.uint8:
-    raise ValueError('{} must be of type np.uint8. Found {}.'.format(
-        array_name, masks.dtype))
-  if np.any(np.logical_and(masks != 0, masks != 1)):
-    raise ValueError('{} elements can only be either 0 or 1.'.format(
-        array_name))
+def convert_masks_to_binary(masks):
+  """Converts masks to 0 or 1 and uint8 type."""
+  return (masks > 0).astype(np.uint8)
 
 
 class CocoKeypointEvaluator(CocoDetectionEvaluator):
@@ -952,9 +947,8 @@ class CocoMaskEvaluator(object_detection_evaluation.DetectionEvaluator):
 
     groundtruth_instance_masks = groundtruth_dict[
         standard_fields.InputDataFields.groundtruth_instance_masks]
-    _check_mask_type_and_value(standard_fields.InputDataFields.
-                               groundtruth_instance_masks,
-                               groundtruth_instance_masks)
+    groundtruth_instance_masks = convert_masks_to_binary(
+        groundtruth_instance_masks)
     self._groundtruth_list.extend(
         coco_tools.
         ExportSingleImageGroundtruthToCoco(
@@ -1013,9 +1007,7 @@ class CocoMaskEvaluator(object_detection_evaluation.DetectionEvaluator):
                        'are incompatible: {} vs {}'.format(
                            groundtruth_masks_shape,
                            detection_masks.shape))
-    _check_mask_type_and_value(standard_fields.DetectionResultFields.
-                               detection_masks,
-                               detection_masks)
+    detection_masks = convert_masks_to_binary(detection_masks)
     self._detection_masks_list.extend(
         coco_tools.ExportSingleImageDetectionMasksToCoco(
             image_id=image_id,
diff --git a/research/object_detection/metrics/coco_evaluation_test.py b/research/object_detection/metrics/coco_evaluation_test.py
index 165c94780d93bb93bab9ab1187c7fa41b79b96b9..110690bf211fb9ab903c3df433e542a977c64ff0 100644
--- a/research/object_detection/metrics/coco_evaluation_test.py
+++ b/research/object_detection/metrics/coco_evaluation_test.py
@@ -1424,14 +1424,16 @@ class CocoMaskEvaluationTest(tf.test.TestCase):
         image_id='image3',
         detections_dict={
             standard_fields.DetectionResultFields.detection_boxes:
-            np.array([[25., 25., 50., 50.]]),
+                np.array([[25., 25., 50., 50.]]),
             standard_fields.DetectionResultFields.detection_scores:
-            np.array([.8]),
+                np.array([.8]),
             standard_fields.DetectionResultFields.detection_classes:
-            np.array([1]),
+                np.array([1]),
             standard_fields.DetectionResultFields.detection_masks:
-            np.pad(np.ones([1, 25, 25], dtype=np.uint8),
-                   ((0, 0), (10, 10), (10, 10)), mode='constant')
+                # The value of 5 is equivalent to 1, since masks will be
+                # thresholded and binarized before evaluation.
+                np.pad(5 * np.ones([1, 25, 25], dtype=np.uint8),
+                       ((0, 0), (10, 10), (10, 10)), mode='constant')
         })
     metrics = coco_evaluator.evaluate()
     self.assertAlmostEqual(metrics['DetectionMasks_Precision/mAP'], 1.0)
diff --git a/research/object_detection/metrics/oid_challenge_evaluation_utils.py b/research/object_detection/metrics/oid_challenge_evaluation_utils.py
index 64893dbd87b194d407d7b49498361d6fc9990db6..844cce3e8f362c2c15403269584149878f60bc51 100644
--- a/research/object_detection/metrics/oid_challenge_evaluation_utils.py
+++ b/research/object_detection/metrics/oid_challenge_evaluation_utils.py
@@ -136,15 +136,15 @@ def build_groundtruth_dictionary(data, class_label_map):
 
   dictionary = {
       standard_fields.InputDataFields.groundtruth_boxes:
-          data_location[['YMin', 'XMin', 'YMax', 'XMax']].as_matrix(),
+          data_location[['YMin', 'XMin', 'YMax', 'XMax']].to_numpy(),
       standard_fields.InputDataFields.groundtruth_classes:
           data_location['LabelName'].map(lambda x: class_label_map[x]
-                                        ).as_matrix(),
+                                        ).to_numpy(),
       standard_fields.InputDataFields.groundtruth_group_of:
-          data_location['IsGroupOf'].as_matrix().astype(int),
+          data_location['IsGroupOf'].to_numpy().astype(int),
       standard_fields.InputDataFields.groundtruth_image_classes:
           data_labels['LabelName'].map(lambda x: class_label_map[x]
-                                      ).as_matrix(),
+                                      ).to_numpy(),
   }
 
   if 'Mask' in data_location:
@@ -179,9 +179,9 @@ def build_predictions_dictionary(data, class_label_map):
   """
   dictionary = {
       standard_fields.DetectionResultFields.detection_classes:
-          data['LabelName'].map(lambda x: class_label_map[x]).as_matrix(),
+          data['LabelName'].map(lambda x: class_label_map[x]).to_numpy(),
       standard_fields.DetectionResultFields.detection_scores:
-          data['Score'].as_matrix()
+          data['Score'].to_numpy()
   }
 
   if 'Mask' in data:
@@ -192,6 +192,6 @@ def build_predictions_dictionary(data, class_label_map):
   else:
     dictionary[standard_fields.DetectionResultFields.detection_boxes] = data[[
         'YMin', 'XMin', 'YMax', 'XMax'
-    ]].as_matrix()
+    ]].to_numpy()
 
   return dictionary
diff --git a/research/object_detection/metrics/oid_vrd_challenge_evaluation_utils.py b/research/object_detection/metrics/oid_vrd_challenge_evaluation_utils.py
index 34be018c4b2d44ec0c43135b433423a4ac379b64..590c8c84857470ab347377744c4f3493cf238feb 100644
--- a/research/object_detection/metrics/oid_vrd_challenge_evaluation_utils.py
+++ b/research/object_detection/metrics/oid_vrd_challenge_evaluation_utils.py
@@ -53,16 +53,16 @@ def build_groundtruth_vrd_dictionary(data, class_label_map,
 
   boxes = np.zeros(data_boxes.shape[0], dtype=vrd_evaluation.vrd_box_data_type)
   boxes['subject'] = data_boxes[['YMin1', 'XMin1', 'YMax1',
-                                 'XMax1']].as_matrix()
-  boxes['object'] = data_boxes[['YMin2', 'XMin2', 'YMax2', 'XMax2']].as_matrix()
+                                 'XMax1']].to_numpy()
+  boxes['object'] = data_boxes[['YMin2', 'XMin2', 'YMax2', 'XMax2']].to_numpy()
 
   labels = np.zeros(data_boxes.shape[0], dtype=vrd_evaluation.label_data_type)
   labels['subject'] = data_boxes['LabelName1'].map(
-      lambda x: class_label_map[x]).as_matrix()
+      lambda x: class_label_map[x]).to_numpy()
   labels['object'] = data_boxes['LabelName2'].map(
-      lambda x: class_label_map[x]).as_matrix()
+      lambda x: class_label_map[x]).to_numpy()
   labels['relation'] = data_boxes['RelationshipLabel'].map(
-      lambda x: relationship_label_map[x]).as_matrix()
+      lambda x: relationship_label_map[x]).to_numpy()
 
   return {
       standard_fields.InputDataFields.groundtruth_boxes:
@@ -71,7 +71,7 @@ def build_groundtruth_vrd_dictionary(data, class_label_map,
           labels,
       standard_fields.InputDataFields.groundtruth_image_classes:
           data_labels['LabelName'].map(lambda x: class_label_map[x])
-          .as_matrix(),
+          .to_numpy(),
   }
 
 
@@ -104,16 +104,16 @@ def build_predictions_vrd_dictionary(data, class_label_map,
 
   boxes = np.zeros(data_boxes.shape[0], dtype=vrd_evaluation.vrd_box_data_type)
   boxes['subject'] = data_boxes[['YMin1', 'XMin1', 'YMax1',
-                                 'XMax1']].as_matrix()
-  boxes['object'] = data_boxes[['YMin2', 'XMin2', 'YMax2', 'XMax2']].as_matrix()
+                                 'XMax1']].to_numpy()
+  boxes['object'] = data_boxes[['YMin2', 'XMin2', 'YMax2', 'XMax2']].to_numpy()
 
   labels = np.zeros(data_boxes.shape[0], dtype=vrd_evaluation.label_data_type)
   labels['subject'] = data_boxes['LabelName1'].map(
-      lambda x: class_label_map[x]).as_matrix()
+      lambda x: class_label_map[x]).to_numpy()
   labels['object'] = data_boxes['LabelName2'].map(
-      lambda x: class_label_map[x]).as_matrix()
+      lambda x: class_label_map[x]).to_numpy()
   labels['relation'] = data_boxes['RelationshipLabel'].map(
-      lambda x: relationship_label_map[x]).as_matrix()
+      lambda x: relationship_label_map[x]).to_numpy()
 
   return {
       standard_fields.DetectionResultFields.detection_boxes:
@@ -121,5 +121,5 @@ def build_predictions_vrd_dictionary(data, class_label_map,
       standard_fields.DetectionResultFields.detection_classes:
           labels,
       standard_fields.DetectionResultFields.detection_scores:
-          data_boxes['Score'].as_matrix()
+          data_boxes['Score'].to_numpy()
   }
diff --git a/research/object_detection/model_lib.py b/research/object_detection/model_lib.py
index 5791251512071440f455744af6b280753cc253a1..1eb482d19bdd546b5f7e4ab49bde2039421ead39 100644
--- a/research/object_detection/model_lib.py
+++ b/research/object_detection/model_lib.py
@@ -23,9 +23,9 @@ import functools
 import os
 
 import tensorflow.compat.v1 as tf
+import tensorflow.compat.v2 as tf2
 import tf_slim as slim
 
-
 from object_detection import eval_util
 from object_detection import exporter as exporter_lib
 from object_detection import inputs
@@ -43,7 +43,6 @@ from object_detection.utils import visualization_utils as vis_utils
 # pylint: disable=g-import-not-at-top
 try:
   from tensorflow.contrib import learn as contrib_learn
-  from tensorflow.contrib import tpu as contrib_tpu
 except ImportError:
   # TF 2.0 doesn't ship with contrib.
   pass
@@ -94,6 +93,15 @@ def _prepare_groundtruth_for_eval(detection_model, class_agnostic,
         of groundtruth boxes per image..
       'groundtruth_keypoints': [batch_size, num_boxes, num_keypoints, 2] float32
         tensor of keypoints (if provided in groundtruth).
+      'groundtruth_dp_num_points_list': [batch_size, num_boxes] int32 tensor
+        with the number of DensePose points for each instance (if provided in
+        groundtruth).
+      'groundtruth_dp_part_ids_list': [batch_size, num_boxes,
+        max_sampled_points] int32 tensor with the part ids for each DensePose
+        sampled point (if provided in groundtruth).
+      'groundtruth_dp_surface_coords_list': [batch_size, num_boxes,
+        max_sampled_points, 4] containing the DensePose surface coordinates for
+        each sampled point (if provided in groundtruth).
       'groundtruth_group_of': [batch_size, num_boxes] bool tensor indicating
         group_of annotations (if provided in groundtruth).
       'groundtruth_labeled_classes': [batch_size, num_classes] int64
@@ -164,6 +172,21 @@ def _prepare_groundtruth_for_eval(detection_model, class_agnostic,
       groundtruth[input_data_fields.groundtruth_labeled_classes] = tf.stack(
           labeled_classes)
 
+  if detection_model.groundtruth_has_field(
+      fields.BoxListFields.densepose_num_points):
+    groundtruth[input_data_fields.groundtruth_dp_num_points] = tf.stack(
+        detection_model.groundtruth_lists(
+            fields.BoxListFields.densepose_num_points))
+  if detection_model.groundtruth_has_field(
+      fields.BoxListFields.densepose_part_ids):
+    groundtruth[input_data_fields.groundtruth_dp_part_ids] = tf.stack(
+        detection_model.groundtruth_lists(
+            fields.BoxListFields.densepose_part_ids))
+  if detection_model.groundtruth_has_field(
+      fields.BoxListFields.densepose_surface_coords):
+    groundtruth[input_data_fields.groundtruth_dp_surface_coords] = tf.stack(
+        detection_model.groundtruth_lists(
+            fields.BoxListFields.densepose_surface_coords))
   groundtruth[input_data_fields.num_groundtruth_boxes] = (
       tf.tile([max_number_of_boxes], multiples=[groundtruth_boxes_shape[0]]))
   return groundtruth
@@ -219,6 +242,9 @@ def unstack_batch(tensor_dict, unpad_groundtruth_tensors=True):
         fields.InputDataFields.groundtruth_boxes,
         fields.InputDataFields.groundtruth_keypoints,
         fields.InputDataFields.groundtruth_keypoint_visibilities,
+        fields.InputDataFields.groundtruth_dp_num_points,
+        fields.InputDataFields.groundtruth_dp_part_ids,
+        fields.InputDataFields.groundtruth_dp_surface_coords,
         fields.InputDataFields.groundtruth_group_of,
         fields.InputDataFields.groundtruth_difficult,
         fields.InputDataFields.groundtruth_is_crowd,
@@ -269,6 +295,18 @@ def provide_groundtruth(model, labels):
   if fields.InputDataFields.groundtruth_keypoint_visibilities in labels:
     gt_keypoint_visibilities_list = labels[
         fields.InputDataFields.groundtruth_keypoint_visibilities]
+  gt_dp_num_points_list = None
+  if fields.InputDataFields.groundtruth_dp_num_points in labels:
+    gt_dp_num_points_list = labels[
+        fields.InputDataFields.groundtruth_dp_num_points]
+  gt_dp_part_ids_list = None
+  if fields.InputDataFields.groundtruth_dp_part_ids in labels:
+    gt_dp_part_ids_list = labels[
+        fields.InputDataFields.groundtruth_dp_part_ids]
+  gt_dp_surface_coords_list = None
+  if fields.InputDataFields.groundtruth_dp_surface_coords in labels:
+    gt_dp_surface_coords_list = labels[
+        fields.InputDataFields.groundtruth_dp_surface_coords]
   gt_weights_list = None
   if fields.InputDataFields.groundtruth_weights in labels:
     gt_weights_list = labels[fields.InputDataFields.groundtruth_weights]
@@ -297,13 +335,16 @@ def provide_groundtruth(model, labels):
       groundtruth_masks_list=gt_masks_list,
       groundtruth_keypoints_list=gt_keypoints_list,
       groundtruth_keypoint_visibilities_list=gt_keypoint_visibilities_list,
+      groundtruth_dp_num_points_list=gt_dp_num_points_list,
+      groundtruth_dp_part_ids_list=gt_dp_part_ids_list,
+      groundtruth_dp_surface_coords_list=gt_dp_surface_coords_list,
       groundtruth_weights_list=gt_weights_list,
       groundtruth_is_crowd_list=gt_is_crowd_list,
       groundtruth_group_of_list=gt_group_of_list,
       groundtruth_area_list=gt_area_list)
 
 
-def create_model_fn(detection_model_fn, configs, hparams, use_tpu=False,
+def create_model_fn(detection_model_fn, configs, hparams=None, use_tpu=False,
                     postprocess_on_cpu=False):
   """Creates a model function for `Estimator`.
 
@@ -349,7 +390,7 @@ def create_model_fn(detection_model_fn, configs, hparams, use_tpu=False,
       from tensorflow.python.keras.engine import base_layer_utils  # pylint: disable=g-import-not-at-top
       # Enable v2 behavior, as `mixed_bfloat16` is only supported in TF 2.0.
       base_layer_utils.enable_v2_dtype_behavior()
-      tf.compat.v2.keras.mixed_precision.experimental.set_policy(
+      tf2.keras.mixed_precision.experimental.set_policy(
           'mixed_bfloat16')
     detection_model = detection_model_fn(
         is_training=is_training, add_summaries=(not use_tpu))
@@ -377,7 +418,7 @@ def create_model_fn(detection_model_fn, configs, hparams, use_tpu=False,
     side_inputs = detection_model.get_side_inputs(features)
 
     if use_tpu and train_config.use_bfloat16:
-      with contrib_tpu.bfloat16_scope():
+      with tf.tpu.bfloat16_scope():
         prediction_dict = detection_model.predict(
             preprocessed_images,
             features[fields.InputDataFields.true_image_shape], **side_inputs)
@@ -392,7 +433,7 @@ def create_model_fn(detection_model_fn, configs, hparams, use_tpu=False,
 
     if mode in (tf.estimator.ModeKeys.EVAL, tf.estimator.ModeKeys.PREDICT):
       if use_tpu and postprocess_on_cpu:
-        detections = contrib_tpu.outside_compilation(
+        detections = tf.tpu.outside_compilation(
             postprocess_wrapper,
             (prediction_dict,
              features[fields.InputDataFields.true_image_shape]))
@@ -468,7 +509,7 @@ def create_model_fn(detection_model_fn, configs, hparams, use_tpu=False,
 
     if mode == tf.estimator.ModeKeys.TRAIN:
       if use_tpu:
-        training_optimizer = contrib_tpu.CrossShardOptimizer(training_optimizer)
+        training_optimizer = tf.tpu.CrossShardOptimizer(training_optimizer)
 
       # Optionally freeze some layers by setting their gradients to be zero.
       trainable_variables = None
@@ -588,7 +629,7 @@ def create_model_fn(detection_model_fn, configs, hparams, use_tpu=False,
 
     # EVAL executes on CPU, so use regular non-TPU EstimatorSpec.
     if use_tpu and mode != tf.estimator.ModeKeys.EVAL:
-      return contrib_tpu.TPUEstimatorSpec(
+      return tf.estimator.tpu.TPUEstimatorSpec(
           mode=mode,
           scaffold_fn=scaffold_fn,
           predictions=detections,
@@ -619,8 +660,8 @@ def create_model_fn(detection_model_fn, configs, hparams, use_tpu=False,
 
 
 def create_estimator_and_inputs(run_config,
-                                hparams,
-                                pipeline_config_path,
+                                hparams=None,
+                                pipeline_config_path=None,
                                 config_override=None,
                                 train_steps=None,
                                 sample_1_of_n_eval_examples=1,
@@ -639,7 +680,7 @@ def create_estimator_and_inputs(run_config,
 
   Args:
     run_config: A `RunConfig`.
-    hparams: A `HParams`.
+    hparams: (optional) A `HParams`.
     pipeline_config_path: A path to a pipeline config file.
     config_override: A pipeline_pb2.TrainEvalPipelineConfig text proto to
       override the config from `pipeline_config_path`.
@@ -762,14 +803,14 @@ def create_estimator_and_inputs(run_config,
       model_config=model_config, predict_input_config=eval_input_configs[0])
 
   # Read export_to_tpu from hparams if not passed.
-  if export_to_tpu is None:
+  if export_to_tpu is None and hparams is not None:
     export_to_tpu = hparams.get('export_to_tpu', False)
   tf.logging.info('create_estimator_and_inputs: use_tpu %s, export_to_tpu %s',
                   use_tpu, export_to_tpu)
   model_fn = model_fn_creator(detection_model_fn, configs, hparams, use_tpu,
                               postprocess_on_cpu)
   if use_tpu_estimator:
-    estimator = contrib_tpu.TPUEstimator(
+    estimator = tf.estimator.tpu.TPUEstimator(
         model_fn=model_fn,
         train_batch_size=train_config.batch_size,
         # For each core, only batch size 1 is supported for eval.
diff --git a/research/object_detection/model_lib_tf2_test.py b/research/object_detection/model_lib_tf2_test.py
index 8c6d961722facae731804e66296948e62c6b6844..f65273660195752227b2bcc90dceb04184a6eb62 100644
--- a/research/object_detection/model_lib_tf2_test.py
+++ b/research/object_detection/model_lib_tf2_test.py
@@ -24,9 +24,9 @@ import unittest
 import numpy as np
 import six
 import tensorflow.compat.v1 as tf
+import tensorflow.compat.v2 as tf2
 
 from object_detection import inputs
-from object_detection import model_hparams
 from object_detection import model_lib_v2
 from object_detection.builders import model_builder
 from object_detection.core import model
@@ -82,24 +82,25 @@ class ModelLibTest(tf.test.TestCase):
 
   def test_train_loop_then_eval_loop(self):
     """Tests that Estimator and input function are constructed correctly."""
-    hparams = model_hparams.create_hparams(
-        hparams_overrides='load_pretrained=false')
+    model_dir = tf.test.get_temp_dir()
     pipeline_config_path = get_pipeline_config_path(MODEL_NAME_FOR_TEST)
+    new_pipeline_config_path = os.path.join(model_dir, 'new_pipeline.config')
+    config_util.clear_fine_tune_checkpoint(pipeline_config_path,
+                                           new_pipeline_config_path)
     config_kwarg_overrides = _get_config_kwarg_overrides()
-    model_dir = tf.test.get_temp_dir()
 
     train_steps = 2
-    model_lib_v2.train_loop(
-        hparams,
-        pipeline_config_path,
-        model_dir=model_dir,
-        train_steps=train_steps,
-        checkpoint_every_n=1,
-        **config_kwarg_overrides)
+    strategy = tf2.distribute.OneDeviceStrategy(device='/cpu:0')
+    with strategy.scope():
+      model_lib_v2.train_loop(
+          new_pipeline_config_path,
+          model_dir=model_dir,
+          train_steps=train_steps,
+          checkpoint_every_n=1,
+          **config_kwarg_overrides)
 
     model_lib_v2.eval_continuously(
-        hparams,
-        pipeline_config_path,
+        new_pipeline_config_path,
         model_dir=model_dir,
         checkpoint_dir=model_dir,
         train_steps=train_steps,
@@ -122,6 +123,9 @@ class SimpleModel(model.DetectionModel):
     return []
 
   def restore_map(self, *args, **kwargs):
+    pass
+
+  def restore_from_objects(self, fine_tune_checkpoint_type):
     return {'model': self}
 
   def preprocess(self, _):
@@ -148,21 +152,24 @@ class ModelCheckpointTest(tf.test.TestCase):
   def test_checkpoint_max_to_keep(self):
     """Test that only the most recent checkpoints are kept."""
 
+    strategy = tf2.distribute.OneDeviceStrategy(device='/cpu:0')
     with mock.patch.object(
         model_builder, 'build', autospec=True) as mock_builder:
-      mock_builder.return_value = SimpleModel()
-
-      hparams = model_hparams.create_hparams(
-          hparams_overrides='load_pretrained=false')
+      with strategy.scope():
+        mock_builder.return_value = SimpleModel()
+      model_dir = tempfile.mkdtemp(dir=self.get_temp_dir())
       pipeline_config_path = get_pipeline_config_path(MODEL_NAME_FOR_TEST)
+      new_pipeline_config_path = os.path.join(model_dir, 'new_pipeline.config')
+      config_util.clear_fine_tune_checkpoint(pipeline_config_path,
+                                             new_pipeline_config_path)
       config_kwarg_overrides = _get_config_kwarg_overrides()
-      model_dir = tempfile.mkdtemp(dir=self.get_temp_dir())
 
-      model_lib_v2.train_loop(
-          hparams, pipeline_config_path, model_dir=model_dir,
-          train_steps=20, checkpoint_every_n=2, checkpoint_max_to_keep=3,
-          **config_kwarg_overrides
-      )
+      with strategy.scope():
+        model_lib_v2.train_loop(
+            new_pipeline_config_path, model_dir=model_dir,
+            train_steps=20, checkpoint_every_n=2, checkpoint_max_to_keep=3,
+            **config_kwarg_overrides
+        )
       ckpt_files = tf.io.gfile.glob(os.path.join(model_dir, 'ckpt-*.index'))
       self.assertEqual(len(ckpt_files), 3,
                        '{} not of length 3.'.format(ckpt_files))
@@ -170,7 +177,7 @@ class ModelCheckpointTest(tf.test.TestCase):
 
 class IncompatibleModel(SimpleModel):
 
-  def restore_map(self, *args, **kwargs):
+  def restore_from_objects(self, *args, **kwargs):
     return {'weight': self.weight}
 
 
@@ -203,7 +210,6 @@ class CheckpointV2Test(tf.test.TestCase):
     model_lib_v2.load_fine_tune_checkpoint(
         self._model, self._ckpt_path, checkpoint_type='',
         checkpoint_version=train_pb2.CheckpointVersion.V2,
-        load_all_detection_checkpoint_vars=True,
         input_dataset=self._train_input_fn(),
         unpad_groundtruth_tensors=True)
     np.testing.assert_allclose(self._model.weight.numpy(), 42)
@@ -216,8 +222,9 @@ class CheckpointV2Test(tf.test.TestCase):
       model_lib_v2.load_fine_tune_checkpoint(
           IncompatibleModel(), self._ckpt_path, checkpoint_type='',
           checkpoint_version=train_pb2.CheckpointVersion.V2,
-          load_all_detection_checkpoint_vars=True,
           input_dataset=self._train_input_fn(),
           unpad_groundtruth_tensors=True)
 
 
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/research/object_detection/model_lib_v2.py b/research/object_detection/model_lib_v2.py
index 29eb5a29578ec4cf53713e2fadae78e63e8c4bdb..68a4b302300f2bef61e31c1ad35e7f9ecebea8f8 100644
--- a/research/object_detection/model_lib_v2.py
+++ b/research/object_detection/model_lib_v2.py
@@ -23,6 +23,7 @@ import os
 import time
 
 import tensorflow.compat.v1 as tf
+import tensorflow.compat.v2 as tf2
 
 from object_detection import eval_util
 from object_detection import inputs
@@ -34,7 +35,6 @@ from object_detection.protos import train_pb2
 from object_detection.utils import config_util
 from object_detection.utils import label_map_util
 from object_detection.utils import ops
-from object_detection.utils import variables_helper
 from object_detection.utils import visualization_utils as vutils
 
 # pylint: disable=g-import-not-at-top
@@ -47,13 +47,6 @@ except ImportError:
 
 MODEL_BUILD_UTIL_MAP = model_lib.MODEL_BUILD_UTIL_MAP
 
-### NOTE: This file is a wip.
-### TODO(kaftan): Explore adding unit tests for individual methods
-### TODO(kaftan): Add unit test that checks training on a single image w/
-#### groundtruth, and verfiy that loss goes to zero.
-#### Possibly have version that takes it as the whole train & eval dataset,
-#### & verify the loss output from the eval_loop method.
-### TODO(kaftan): Make sure the unit tests run in TAP presubmits or Kokoro
 
 RESTORE_MAP_ERROR_TEMPLATE = (
     'Since we are restoring a v2 style checkpoint'
@@ -101,6 +94,12 @@ def _compute_losses_and_predictions_dicts(
           instance masks for objects.
         labels[fields.InputDataFields.groundtruth_keypoints] is a
           float32 tensor containing keypoints for each box.
+        labels[fields.InputDataFields.groundtruth_dp_num_points] is an int32
+          tensor with the number of sampled DensePose points per object.
+        labels[fields.InputDataFields.groundtruth_dp_part_ids] is an int32
+          tensor with the DensePose part ids (0-indexed) per object.
+        labels[fields.InputDataFields.groundtruth_dp_surface_coords] is a
+          float32 tensor with the DensePose surface coordinates.
         labels[fields.InputDataFields.groundtruth_group_of] is a tf.bool tensor
           containing group_of annotations.
         labels[fields.InputDataFields.groundtruth_labeled_classes] is a float32
@@ -119,7 +118,8 @@ def _compute_losses_and_predictions_dicts(
 
   prediction_dict = model.predict(
       preprocessed_images,
-      features[fields.InputDataFields.true_image_shape])
+      features[fields.InputDataFields.true_image_shape],
+      **model.get_side_inputs(features))
   prediction_dict = ops.bfloat16_to_float32_nested(prediction_dict)
 
   losses_dict = model.loss(
@@ -203,6 +203,17 @@ def eager_train_step(detection_model,
         labels[fields.InputDataFields.groundtruth_keypoints] is a
           [batch_size, num_boxes, num_keypoints, 2] float32 tensor containing
           keypoints for each box.
+        labels[fields.InputDataFields.groundtruth_dp_num_points] is a
+          [batch_size, num_boxes] int32 tensor with the number of DensePose
+          sampled points per instance.
+        labels[fields.InputDataFields.groundtruth_dp_part_ids] is a
+          [batch_size, num_boxes, max_sampled_points] int32 tensor with the
+          part ids (0-indexed) for each instance.
+        labels[fields.InputDataFields.groundtruth_dp_surface_coords] is a
+          [batch_size, num_boxes, max_sampled_points, 4] float32 tensor with the
+          surface coordinates for each point. Each surface coordinate is of the
+          form (y, x, v, u) where (y, x) are normalized image locations and
+          (v, u) are part-relative normalized surface coordinates.
         labels[fields.InputDataFields.groundtruth_labeled_classes] is a float32
           k-hot tensor of classes.
     unpad_groundtruth_tensors: A parameter passed to unstack_batch.
@@ -277,14 +288,21 @@ def validate_tf_v2_checkpoint_restore_map(checkpoint_restore_map):
   """
 
   for key, value in checkpoint_restore_map.items():
-    if not (isinstance(key, str) and isinstance(value, tf.Module)):
+    if not (isinstance(key, str) and
+            (isinstance(value, tf.Module)
+             or isinstance(value, tf.train.Checkpoint))):
       raise TypeError(RESTORE_MAP_ERROR_TEMPLATE.format(
           key.__class__.__name__, value.__class__.__name__))
 
 
+def is_object_based_checkpoint(checkpoint_path):
+  """Returns true if `checkpoint_path` points to an object-based checkpoint."""
+  var_names = [var[0] for var in tf.train.list_variables(checkpoint_path)]
+  return '_CHECKPOINTABLE_OBJECT_GRAPH' in var_names
+
+
 def load_fine_tune_checkpoint(
-    model, checkpoint_path, checkpoint_type, checkpoint_version,
-    load_all_detection_checkpoint_vars, input_dataset,
+    model, checkpoint_path, checkpoint_type, checkpoint_version, input_dataset,
     unpad_groundtruth_tensors):
   """Load a fine tuning classification or detection checkpoint.
 
@@ -292,8 +310,7 @@ def load_fine_tune_checkpoint(
   the model by computing a dummy loss. (Models might not have built their
   variables before their first execution)
 
-  It then loads a variable-name based classification or detection checkpoint
-  that comes from converted TF 1.x slim model checkpoints.
+  It then loads an object-based classification or detection checkpoint.
 
   This method updates the model in-place and does not return a value.
 
@@ -306,14 +323,22 @@ def load_fine_tune_checkpoint(
       classification checkpoint for initialization prior to training.
       Valid values: `detection`, `classification`.
     checkpoint_version: train_pb2.CheckpointVersion.V1 or V2 enum indicating
-      whether to load checkpoints in V1 style or V2 style.
-    load_all_detection_checkpoint_vars: whether to load all variables (when
-      `fine_tune_checkpoint_type` is `detection`). If False, only variables
-      within the feature extractor scopes are included. Default False.
+      whether to load checkpoints in V1 style or V2 style.  In this binary
+      we only support V2 style (object-based) checkpoints.
     input_dataset: The tf.data Dataset the model is being trained on. Needed
       to get the shapes for the dummy loss computation.
     unpad_groundtruth_tensors: A parameter passed to unstack_batch.
+
+  Raises:
+    IOError: if `checkpoint_path` does not point at a valid object-based
+      checkpoint
+    ValueError: if `checkpoint_version` is not train_pb2.CheckpointVersion.V2
   """
+  if not is_object_based_checkpoint(checkpoint_path):
+    raise IOError('Checkpoint is expected to be an object-based checkpoint.')
+  if checkpoint_version == train_pb2.CheckpointVersion.V1:
+    raise ValueError('Checkpoint version should be V2')
+
   features, labels = iter(input_dataset).next()
 
   @tf.function
@@ -330,32 +355,24 @@ def load_fine_tune_checkpoint(
         labels)
 
   strategy = tf.compat.v2.distribute.get_strategy()
-  strategy.run(
-      _dummy_computation_fn, args=(
-          features,
-          labels,
-      ))
+  if hasattr(tf.distribute.Strategy, 'run'):
+    strategy.run(
+        _dummy_computation_fn, args=(
+            features,
+            labels,
+        ))
+  else:
+    strategy.experimental_run_v2(
+        _dummy_computation_fn, args=(
+            features,
+            labels,
+        ))
 
-  if checkpoint_version == train_pb2.CheckpointVersion.V1:
-    var_map = model.restore_map(
-        fine_tune_checkpoint_type=checkpoint_type,
-        load_all_detection_checkpoint_vars=(
-            load_all_detection_checkpoint_vars))
-    available_var_map = variables_helper.get_variables_available_in_checkpoint(
-        var_map,
-        checkpoint_path,
-        include_global_step=False)
-    tf.train.init_from_checkpoint(checkpoint_path,
-                                  available_var_map)
-  elif checkpoint_version == train_pb2.CheckpointVersion.V2:
-    restore_map = model.restore_map(
-        fine_tune_checkpoint_type=checkpoint_type,
-        load_all_detection_checkpoint_vars=(
-            load_all_detection_checkpoint_vars))
-    validate_tf_v2_checkpoint_restore_map(restore_map)
-
-    ckpt = tf.train.Checkpoint(**restore_map)
-    ckpt.restore(checkpoint_path).assert_existing_objects_matched()
+  restore_from_objects_dict = model.restore_from_objects(
+      fine_tune_checkpoint_type=checkpoint_type)
+  validate_tf_v2_checkpoint_restore_map(restore_from_objects_dict)
+  ckpt = tf.train.Checkpoint(**restore_from_objects_dict)
+  ckpt.restore(checkpoint_path).assert_existing_objects_matched()
 
 
 def get_filepath(strategy, filepath):
@@ -392,16 +409,15 @@ def clean_temporary_directories(strategy, filepath):
 
 
 def train_loop(
-    hparams,
     pipeline_config_path,
     model_dir,
     config_override=None,
     train_steps=None,
     use_tpu=False,
     save_final_config=False,
-    export_to_tpu=None,
     checkpoint_every_n=1000,
     checkpoint_max_to_keep=7,
+    record_summaries=True,
     **kwargs):
   """Trains a model using eager + functions.
 
@@ -417,7 +433,6 @@ def train_loop(
     8. Logs the training metrics as TensorBoard summaries.
 
   Args:
-    hparams: A `HParams`.
     pipeline_config_path: A path to a pipeline config file.
     model_dir:
       The directory to save checkpoints and summaries to.
@@ -428,14 +443,11 @@ def train_loop(
     use_tpu: Boolean, whether training and evaluation should run on TPU.
     save_final_config: Whether to save final config (obtained after applying
       overrides) to `model_dir`.
-    export_to_tpu: When use_tpu and export_to_tpu are true,
-      `export_savedmodel()` exports a metagraph for serving on TPU besides the
-      one on CPU. If export_to_tpu is not provided, we will look for it in
-      hparams too.
     checkpoint_every_n:
       Checkpoint every n training steps.
     checkpoint_max_to_keep:
       int, the number of most recent checkpoints to keep in the model directory.
+    record_summaries: Boolean, whether or not to record summaries.
     **kwargs: Additional keyword arguments for configuration override.
   """
   ## Parse the configs
@@ -453,7 +465,7 @@ def train_loop(
       'use_bfloat16': configs['train_config'].use_bfloat16 and use_tpu
   })
   configs = merge_external_params_with_configs(
-      configs, hparams, kwargs_dict=kwargs)
+      configs, None, kwargs_dict=kwargs)
   model_config = configs['model']
   train_config = configs['train_config']
   train_input_config = configs['train_input_config']
@@ -468,33 +480,14 @@ def train_loop(
   if train_steps is None and train_config.num_steps != 0:
     train_steps = train_config.num_steps
 
-  # Read export_to_tpu from hparams if not passed.
-  if export_to_tpu is None:
-    export_to_tpu = hparams.get('export_to_tpu', False)
-  tf.logging.info(
-      'train_loop: use_tpu %s, export_to_tpu %s', use_tpu,
-      export_to_tpu)
-
   if kwargs['use_bfloat16']:
     tf.compat.v2.keras.mixed_precision.experimental.set_policy('mixed_bfloat16')
 
-  # Parse the checkpoint fine tuning configs
-  if hparams.load_pretrained:
-    fine_tune_checkpoint_path = train_config.fine_tune_checkpoint
-  else:
-    fine_tune_checkpoint_path = None
-  load_all_detection_checkpoint_vars = (
-      train_config.load_all_detection_checkpoint_vars)
-  # TODO(kaftan) (or anyone else): move this piece of config munging to
-  ## utils/config_util.py
-  if not train_config.fine_tune_checkpoint_type:
-    # train_config.from_detection_checkpoint field is deprecated. For
-    # backward compatibility, set train_config.fine_tune_checkpoint_type
-    # based on train_config.from_detection_checkpoint.
-    if train_config.from_detection_checkpoint:
-      train_config.fine_tune_checkpoint_type = 'detection'
-    else:
-      train_config.fine_tune_checkpoint_type = 'classification'
+  if train_config.load_all_detection_checkpoint_vars:
+    raise ValueError('train_pb2.load_all_detection_checkpoint_vars '
+                     'unsupported in TF2')
+
+  config_util.update_fine_tune_checkpoint_type(train_config)
   fine_tune_checkpoint_type = train_config.fine_tune_checkpoint_type
   fine_tune_checkpoint_version = train_config.fine_tune_checkpoint_version
 
@@ -541,8 +534,11 @@ def train_loop(
   # is the chief.
   summary_writer_filepath = get_filepath(strategy,
                                          os.path.join(model_dir, 'train'))
-  summary_writer = tf.compat.v2.summary.create_file_writer(
-      summary_writer_filepath)
+  if record_summaries:
+    summary_writer = tf.compat.v2.summary.create_file_writer(
+        summary_writer_filepath)
+  else:
+    summary_writer = tf2.summary.create_noop_writer()
 
   if use_tpu:
     num_steps_per_iteration = 100
@@ -556,11 +552,11 @@ def train_loop(
       with tf.compat.v2.summary.record_if(
           lambda: global_step % num_steps_per_iteration == 0):
         # Load a fine-tuning checkpoint.
-        if fine_tune_checkpoint_path:
-          load_fine_tune_checkpoint(detection_model, fine_tune_checkpoint_path,
+        if train_config.fine_tune_checkpoint:
+          load_fine_tune_checkpoint(detection_model,
+                                    train_config.fine_tune_checkpoint,
                                     fine_tune_checkpoint_type,
                                     fine_tune_checkpoint_version,
-                                    load_all_detection_checkpoint_vars,
                                     train_input,
                                     unpad_groundtruth_tensors)
 
@@ -597,8 +593,12 @@ def train_loop(
 
         def _sample_and_train(strategy, train_step_fn, data_iterator):
           features, labels = data_iterator.next()
-          per_replica_losses = strategy.run(
-              train_step_fn, args=(features, labels))
+          if hasattr(tf.distribute.Strategy, 'run'):
+            per_replica_losses = strategy.run(
+                train_step_fn, args=(features, labels))
+          else:
+            per_replica_losses = strategy.experimental_run_v2(
+                train_step_fn, args=(features, labels))
           # TODO(anjalisridhar): explore if it is safe to remove the
           ## num_replicas scaling of the loss and switch this to a ReduceOp.Mean
           return strategy.reduce(tf.distribute.ReduceOp.SUM,
@@ -610,7 +610,9 @@ def train_loop(
 
           if num_steps_per_iteration > 1:
             for _ in tf.range(num_steps_per_iteration - 1):
-              _sample_and_train(strategy, train_step_fn, data_iterator)
+              # Following suggestion on yaqs/5402607292645376
+              with tf.name_scope(''):
+                _sample_and_train(strategy, train_step_fn, data_iterator)
 
           return _sample_and_train(strategy, train_step_fn, data_iterator)
 
@@ -771,28 +773,25 @@ def eager_eval_loop(
 
     return eval_dict, losses_dict, class_agnostic
 
+  agnostic_categories = label_map_util.create_class_agnostic_category_index()
+  per_class_categories = label_map_util.create_category_index_from_labelmap(
+      eval_input_config.label_map_path)
+  keypoint_edges = [
+      (kp.start, kp.end) for kp in eval_config.keypoint_edge]
+
   for i, (features, labels) in enumerate(eval_dataset):
     eval_dict, losses_dict, class_agnostic = compute_eval_dict(features, labels)
 
+    if class_agnostic:
+      category_index = agnostic_categories
+    else:
+      category_index = per_class_categories
+
     if i % 100 == 0:
       tf.logging.info('Finished eval step %d', i)
 
     use_original_images = fields.InputDataFields.original_image in features
-    if not use_tpu and use_original_images:
-      # Summary for input images.
-      tf.compat.v2.summary.image(
-          name='eval_input_images',
-          step=global_step,
-          data=eval_dict['original_image'],
-          max_outputs=1)
-      # Summary for prediction/groundtruth side-by-side images.
-      if class_agnostic:
-        category_index = label_map_util.create_class_agnostic_category_index()
-      else:
-        category_index = label_map_util.create_category_index_from_labelmap(
-            eval_input_config.label_map_path)
-      keypoint_edges = [
-          (kp.start, kp.end) for kp in eval_config.keypoint_edge]
+    if use_original_images and i < eval_config.num_visualizations:
       sbys_image_list = vutils.draw_side_by_side_evaluation_image(
           eval_dict,
           category_index=category_index,
@@ -802,10 +801,19 @@ def eager_eval_loop(
           keypoint_edges=keypoint_edges or None)
       sbys_images = tf.concat(sbys_image_list, axis=0)
       tf.compat.v2.summary.image(
-          name='eval_side_by_side',
+          name='eval_side_by_side_' + str(i),
           step=global_step,
           data=sbys_images,
           max_outputs=eval_config.num_visualizations)
+      if eval_util.has_densepose(eval_dict):
+        dp_image_list = vutils.draw_densepose_visualizations(
+            eval_dict)
+        dp_images = tf.concat(dp_image_list, axis=0)
+        tf.compat.v2.summary.image(
+            name='densepose_detections_' + str(i),
+            step=global_step,
+            data=dp_images,
+            max_outputs=eval_config.num_visualizations)
 
     if evaluators is None:
       if class_agnostic:
@@ -834,14 +842,15 @@ def eager_eval_loop(
     eval_metrics[loss_key] = loss_metrics[loss_key].result()
 
   eval_metrics = {str(k): v for k, v in eval_metrics.items()}
+  tf.logging.info('Eval metrics at step %d', global_step)
   for k in eval_metrics:
     tf.compat.v2.summary.scalar(k, eval_metrics[k], step=global_step)
+    tf.logging.info('\t+ %s: %f', k, eval_metrics[k])
 
   return eval_metrics
 
 
 def eval_continuously(
-    hparams,
     pipeline_config_path,
     config_override=None,
     train_steps=None,
@@ -850,11 +859,11 @@ def eval_continuously(
     use_tpu=False,
     override_eval_num_epochs=True,
     postprocess_on_cpu=False,
-    export_to_tpu=None,
     model_dir=None,
     checkpoint_dir=None,
     wait_interval=180,
     timeout=3600,
+    eval_index=None,
     **kwargs):
   """Run continuous evaluation of a detection model eagerly.
 
@@ -863,7 +872,6 @@ def eval_continuously(
   on the evaluation data.
 
   Args:
-    hparams: A `HParams`.
     pipeline_config_path: A path to a pipeline config file.
     config_override: A pipeline_pb2.TrainEvalPipelineConfig text proto to
       override the config from `pipeline_config_path`.
@@ -879,16 +887,14 @@ def eval_continuously(
       eval_input.
     postprocess_on_cpu: When use_tpu and postprocess_on_cpu are true,
       postprocess is scheduled on the host cpu.
-    export_to_tpu: When use_tpu and export_to_tpu are true,
-      `export_savedmodel()` exports a metagraph for serving on TPU besides the
-      one on CPU. If export_to_tpu is not provided, we will look for it in
-      hparams too.
     model_dir: Directory to output resulting evaluation summaries to.
     checkpoint_dir: Directory that contains the training checkpoints.
     wait_interval: The mimmum number of seconds to wait before checking for a
       new checkpoint.
     timeout: The maximum number of seconds to wait for a checkpoint. Execution
       will terminate if no new checkpoints are found after these many seconds.
+    eval_index: int, optional If give, only evaluate the dataset at the given
+      index.
 
     **kwargs: Additional keyword arguments for configuration override.
   """
@@ -910,7 +916,7 @@ def eval_continuously(
     tf.logging.warning(
         'Forced number of epochs for all eval validations to be 1.')
   configs = merge_external_params_with_configs(
-      configs, hparams, kwargs_dict=kwargs)
+      configs, None, kwargs_dict=kwargs)
   model_config = configs['model']
   train_input_config = configs['train_input_config']
   eval_config = configs['eval_config']
@@ -942,11 +948,10 @@ def eval_continuously(
         model=detection_model)
     eval_inputs.append((eval_input_config.name, next_eval_input))
 
-  # Read export_to_tpu from hparams if not passed.
-  if export_to_tpu is None:
-    export_to_tpu = hparams.get('export_to_tpu', False)
-  tf.logging.info('eval_continuously: use_tpu %s, export_to_tpu %s',
-                  use_tpu, export_to_tpu)
+  if eval_index is not None:
+    eval_inputs = [eval_inputs[eval_index]]
+    tf.logging.info('eval_index selected - {}'.format(
+        eval_inputs))
 
   global_step = tf.compat.v2.Variable(
       0, trainable=False, dtype=tf.compat.v2.dtypes.int64)
@@ -960,7 +965,7 @@ def eval_continuously(
 
     for eval_name, eval_input in eval_inputs:
       summary_writer = tf.compat.v2.summary.create_file_writer(
-          model_dir + '/eval' + eval_name)
+          os.path.join(model_dir, 'eval', eval_name))
       with summary_writer.as_default():
         eager_eval_loop(
             detection_model,
diff --git a/research/object_detection/model_main.py b/research/object_detection/model_main.py
index 2636ad4bdf0e7a950af387cf97c8e53684efcb52..e8afb555e1e4981cfc8be376b5863d4178cf088b 100644
--- a/research/object_detection/model_main.py
+++ b/research/object_detection/model_main.py
@@ -22,7 +22,6 @@ from absl import flags
 
 import tensorflow.compat.v1 as tf
 
-from object_detection import model_hparams
 from object_detection import model_lib
 
 flags.DEFINE_string(
@@ -41,10 +40,6 @@ flags.DEFINE_integer('sample_1_of_n_eval_on_train_examples', 5, 'Will sample '
                      'one of every n train input examples for evaluation, '
                      'where n is provided. This is only used if '
                      '`eval_training_data` is True.')
-flags.DEFINE_string(
-    'hparams_overrides', None, 'Hyperparameter overrides, '
-    'represented as a string containing comma-separated '
-    'hparam_name=value pairs.')
 flags.DEFINE_string(
     'checkpoint_dir', None, 'Path to directory holding a checkpoint.  If '
     '`checkpoint_dir` is provided, this binary operates in eval-only mode, '
@@ -68,7 +63,6 @@ def main(unused_argv):
 
   train_and_eval_dict = model_lib.create_estimator_and_inputs(
       run_config=config,
-      hparams=model_hparams.create_hparams(FLAGS.hparams_overrides),
       pipeline_config_path=FLAGS.pipeline_config_path,
       train_steps=FLAGS.num_train_steps,
       sample_1_of_n_eval_examples=FLAGS.sample_1_of_n_eval_examples,
diff --git a/research/object_detection/model_main_tf2.py b/research/object_detection/model_main_tf2.py
index f6832ba8425b8939b815ccc880682267e640b354..0cf053039ec16461fef0c1eb2f94df66fad2b70c 100644
--- a/research/object_detection/model_main_tf2.py
+++ b/research/object_detection/model_main_tf2.py
@@ -16,14 +16,6 @@
 
 r"""Creates and runs TF2 object detection models.
 
-##################################
-NOTE: This module has not been fully tested; please bear with us while we iron
-out the kinks.
-##################################
-
-When a TPU device is available, this binary uses TPUStrategy. Otherwise, it uses
-GPUS with MirroredStrategy/MultiWorkerMirroredStrategy.
-
 For local training/evaluation run:
 PIPELINE_CONFIG_PATH=path/to/pipeline.config
 MODEL_DIR=/tmp/model_outputs
@@ -37,7 +29,6 @@ python model_main_tf2.py -- \
 """
 from absl import flags
 import tensorflow.compat.v2 as tf
-from object_detection import model_hparams
 from object_detection import model_lib_v2
 
 flags.DEFINE_string('pipeline_config_path', None, 'Path to pipeline config '
@@ -51,10 +42,6 @@ flags.DEFINE_integer('sample_1_of_n_eval_on_train_examples', 5, 'Will sample '
                      'one of every n train input examples for evaluation, '
                      'where n is provided. This is only used if '
                      '`eval_training_data` is True.')
-flags.DEFINE_string(
-    'hparams_overrides', None, 'Hyperparameter overrides, '
-    'represented as a string containing comma-separated '
-    'hparam_name=value pairs.')
 flags.DEFINE_string(
     'model_dir', None, 'Path to output model directory '
                        'where event and checkpoint files will be written.')
@@ -65,10 +52,21 @@ flags.DEFINE_string(
 
 flags.DEFINE_integer('eval_timeout', 3600, 'Number of seconds to wait for an'
                      'evaluation checkpoint before exiting.')
+
+flags.DEFINE_bool('use_tpu', False, 'Whether the job is executing on a TPU.')
+flags.DEFINE_string(
+    'tpu_name',
+    default=None,
+    help='Name of the Cloud TPU for Cluster Resolvers.')
 flags.DEFINE_integer(
     'num_workers', 1, 'When num_workers > 1, training uses '
     'MultiWorkerMirroredStrategy. When num_workers = 1 it uses '
     'MirroredStrategy.')
+flags.DEFINE_integer(
+    'checkpoint_every_n', 1000, 'Integer defining how often we checkpoint.')
+flags.DEFINE_boolean('record_summaries', True,
+                     ('Whether or not to record summaries during'
+                      ' training.'))
 
 FLAGS = flags.FLAGS
 
@@ -80,7 +78,6 @@ def main(unused_argv):
 
   if FLAGS.checkpoint_dir:
     model_lib_v2.eval_continuously(
-        hparams=model_hparams.create_hparams(FLAGS.hparams_overrides),
         pipeline_config_path=FLAGS.pipeline_config_path,
         model_dir=FLAGS.model_dir,
         train_steps=FLAGS.num_train_steps,
@@ -90,8 +87,11 @@ def main(unused_argv):
         checkpoint_dir=FLAGS.checkpoint_dir,
         wait_interval=300, timeout=FLAGS.eval_timeout)
   else:
-    if tf.config.get_visible_devices('TPU'):
-      resolver = tf.distribute.cluster_resolver.TPUClusterResolver()
+    if FLAGS.use_tpu:
+      # TPU is automatically inferred if tpu_name is None and
+      # we are running under cloud ai-platform.
+      resolver = tf.distribute.cluster_resolver.TPUClusterResolver(
+          FLAGS.tpu_name)
       tf.config.experimental_connect_to_cluster(resolver)
       tf.tpu.experimental.initialize_tpu_system(resolver)
       strategy = tf.distribute.experimental.TPUStrategy(resolver)
@@ -102,11 +102,12 @@ def main(unused_argv):
 
     with strategy.scope():
       model_lib_v2.train_loop(
-          hparams=model_hparams.create_hparams(FLAGS.hparams_overrides),
           pipeline_config_path=FLAGS.pipeline_config_path,
           model_dir=FLAGS.model_dir,
           train_steps=FLAGS.num_train_steps,
-          use_tpu=FLAGS.use_tpu)
+          use_tpu=FLAGS.use_tpu,
+          checkpoint_every_n=FLAGS.checkpoint_every_n,
+          record_summaries=FLAGS.record_summaries)
 
 if __name__ == '__main__':
-  tf.app.run()
+  tf.compat.v1.app.run()
diff --git a/research/object_detection/model_tpu_main.py b/research/object_detection/model_tpu_main.py
index a1229eb10f81d56a7014f89c9297c2acdb01b8c4..d72cc01c6ef5e18d99c74f417c2a428e220c019e 100644
--- a/research/object_detection/model_tpu_main.py
+++ b/research/object_detection/model_tpu_main.py
@@ -26,18 +26,8 @@ from absl import flags
 import tensorflow.compat.v1 as tf
 
 
-from object_detection import model_hparams
 from object_detection import model_lib
 
-# pylint: disable=g-import-not-at-top
-try:
-  from tensorflow.contrib import cluster_resolver as contrib_cluster_resolver
-  from tensorflow.contrib import tpu as contrib_tpu
-except ImportError:
-  # TF 2.0 doesn't ship with contrib.
-  pass
-# pylint: enable=g-import-not-at-top
-
 tf.flags.DEFINE_bool('use_tpu', True, 'Use TPUs rather than plain CPUs')
 
 # Cloud TPU Cluster Resolvers
@@ -67,10 +57,6 @@ flags.DEFINE_string('mode', 'train',
 flags.DEFINE_integer('train_batch_size', None, 'Batch size for training. If '
                      'this is not provided, batch size is read from training '
                      'config.')
-
-flags.DEFINE_string(
-    'hparams_overrides', None, 'Comma-separated list of '
-    'hyperparameters to override defaults.')
 flags.DEFINE_integer('num_train_steps', None, 'Number of train steps.')
 flags.DEFINE_boolean('eval_training_data', False,
                      'If training data should be evaluated for this job.')
@@ -99,15 +85,15 @@ def main(unused_argv):
   flags.mark_flag_as_required('pipeline_config_path')
 
   tpu_cluster_resolver = (
-      contrib_cluster_resolver.TPUClusterResolver(
+      tf.distribute.cluster_resolver.TPUClusterResolver(
           tpu=[FLAGS.tpu_name], zone=FLAGS.tpu_zone, project=FLAGS.gcp_project))
   tpu_grpc_url = tpu_cluster_resolver.get_master()
 
-  config = contrib_tpu.RunConfig(
+  config = tf.estimator.tpu.RunConfig(
       master=tpu_grpc_url,
       evaluation_master=tpu_grpc_url,
       model_dir=FLAGS.model_dir,
-      tpu_config=contrib_tpu.TPUConfig(
+      tpu_config=tf.estimator.tpu.TPUConfig(
           iterations_per_loop=FLAGS.iterations_per_loop,
           num_shards=FLAGS.num_shards))
 
@@ -117,7 +103,6 @@ def main(unused_argv):
 
   train_and_eval_dict = model_lib.create_estimator_and_inputs(
       run_config=config,
-      hparams=model_hparams.create_hparams(FLAGS.hparams_overrides),
       pipeline_config_path=FLAGS.pipeline_config_path,
       train_steps=FLAGS.num_train_steps,
       sample_1_of_n_eval_examples=FLAGS.sample_1_of_n_eval_examples,
diff --git a/research/object_detection/models/bidirectional_feature_pyramid_generators.py b/research/object_detection/models/bidirectional_feature_pyramid_generators.py
new file mode 100644
index 0000000000000000000000000000000000000000..b53dc60ef6465c408900800216cbe066e6d18259
--- /dev/null
+++ b/research/object_detection/models/bidirectional_feature_pyramid_generators.py
@@ -0,0 +1,486 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Functions to generate bidirectional feature pyramids based on image features.
+
+Provides bidirectional feature pyramid network (BiFPN) generators that can be
+used to build object detection feature extractors, as proposed by Tan et al.
+See https://arxiv.org/abs/1911.09070 for more details.
+"""
+import collections
+import functools
+from six.moves import range
+from six.moves import zip
+import tensorflow as tf
+
+from object_detection.utils import bifpn_utils
+
+
+def _create_bifpn_input_config(fpn_min_level,
+                               fpn_max_level,
+                               input_max_level,
+                               level_scales=None):
+  """Creates a BiFPN input config for the input levels from a backbone network.
+
+  Args:
+    fpn_min_level: the minimum pyramid level (highest feature map resolution) to
+      use in the BiFPN.
+    fpn_max_level: the maximum pyramid level (lowest feature map resolution) to
+      use in the BiFPN.
+    input_max_level: the maximum pyramid level that will be provided as input to
+      the BiFPN. Accordingly, the BiFPN will compute additional pyramid levels
+      from input_max_level, up to the desired fpn_max_level.
+    level_scales: a list of pyramid level scale factors. If 'None', each level's
+      scale is set to 2^level by default, which corresponds to each successive
+      feature map scaling by a factor of 2.
+
+  Returns:
+    A list of dictionaries for each feature map expected as input to the BiFPN,
+    where each has entries for the feature map 'name' and 'scale'.
+  """
+  if not level_scales:
+    level_scales = [2**i for i in range(fpn_min_level, fpn_max_level + 1)]
+
+  bifpn_input_params = []
+  for i in range(fpn_min_level, min(fpn_max_level, input_max_level) + 1):
+    bifpn_input_params.append({
+        'name': '0_up_lvl_{}'.format(i),
+        'scale': level_scales[i - fpn_min_level]
+    })
+
+  return bifpn_input_params
+
+
+def _get_bifpn_output_node_names(fpn_min_level, fpn_max_level, node_config):
+  """Returns a list of BiFPN output node names, given a BiFPN node config.
+
+  Args:
+    fpn_min_level: the minimum pyramid level (highest feature map resolution)
+      used by the BiFPN.
+    fpn_max_level: the maximum pyramid level (lowest feature map resolution)
+      used by the BiFPN.
+    node_config: the BiFPN node_config, a list of dictionaries corresponding to
+      each node in the BiFPN computation graph, where each entry should have an
+      associated 'name'.
+
+  Returns:
+    A list of strings corresponding to the names of the output BiFPN nodes.
+  """
+  num_output_nodes = fpn_max_level - fpn_min_level + 1
+  return [node['name'] for node in node_config[-num_output_nodes:]]
+
+
+def _create_bifpn_node_config(bifpn_num_iterations,
+                              bifpn_num_filters,
+                              fpn_min_level,
+                              fpn_max_level,
+                              input_max_level,
+                              bifpn_node_params=None,
+                              level_scales=None):
+  """Creates a config specifying a bidirectional feature pyramid network.
+
+  Args:
+    bifpn_num_iterations: the number of top-down bottom-up feature computations
+      to repeat in the BiFPN.
+    bifpn_num_filters: the number of filters (channels) for every feature map
+      used in the BiFPN.
+    fpn_min_level: the minimum pyramid level (highest feature map resolution) to
+      use in the BiFPN.
+    fpn_max_level: the maximum pyramid level (lowest feature map resolution) to
+      use in the BiFPN.
+    input_max_level: the maximum pyramid level that will be provided as input to
+      the BiFPN. Accordingly, the BiFPN will compute additional pyramid levels
+      from input_max_level, up to the desired fpn_max_level.
+    bifpn_node_params: If not 'None', a dictionary of additional default BiFPN
+      node parameters that will be applied to all BiFPN nodes.
+    level_scales: a list of pyramid level scale factors. If 'None', each level's
+      scale is set to 2^level by default, which corresponds to each successive
+      feature map scaling by a factor of 2.
+
+  Returns:
+    A list of dictionaries used to define nodes in the BiFPN computation graph,
+    as proposed by EfficientDet, Tan et al (https://arxiv.org/abs/1911.09070).
+    Each node's entry has the corresponding keys:
+      name: String. The name of this node in the BiFPN. The node name follows
+        the format '{bifpn_iteration}_{dn|up}_lvl_{pyramid_level}', where 'dn'
+        or 'up' refers to whether the node is in the top-down or bottom-up
+        portion of a single BiFPN iteration.
+      scale: the scale factor for this node, by default 2^level.
+      inputs: A list of names of nodes which are inputs to this node.
+      num_channels: The number of channels for this node.
+      combine_method: String. Name of the method used to combine input
+        node feature maps, 'fast_attention' by default for nodes which have more
+        than one input. Otherwise, 'None' for nodes with only one input node.
+      input_op: A (partial) function which is called to construct the layers
+        that will be applied to this BiFPN node's inputs. This function is
+        called with the arguments:
+          input_op(name, input_scale, input_num_channels, output_scale,
+                   output_num_channels, conv_hyperparams, is_training,
+                   freeze_batchnorm)
+      post_combine_op: A (partial) function which is called to construct the
+        layers that will be applied to the result of the combine operation for
+        this BiFPN node. This function will be called with the arguments:
+          post_combine_op(name, conv_hyperparams, is_training, freeze_batchnorm)
+        If 'None', then no layers will be applied after the combine operation
+        for this node.
+  """
+  if not level_scales:
+    level_scales = [2**i for i in range(fpn_min_level, fpn_max_level + 1)]
+
+  default_node_params = {
+      'num_channels':
+          bifpn_num_filters,
+      'combine_method':
+          'fast_attention',
+      'input_op':
+          functools.partial(
+              _create_bifpn_resample_block, downsample_method='max_pooling'),
+      'post_combine_op':
+          functools.partial(
+              bifpn_utils.create_conv_block,
+              num_filters=bifpn_num_filters,
+              kernel_size=3,
+              strides=1,
+              padding='SAME',
+              use_separable=True,
+              apply_batchnorm=True,
+              apply_activation=True,
+              conv_bn_act_pattern=False),
+  }
+  if bifpn_node_params:
+    default_node_params.update(bifpn_node_params)
+
+  bifpn_node_params = []
+  # Create additional base pyramid levels not provided as input to the BiFPN.
+  # Note, combine_method and post_combine_op are set to None for additional
+  # base pyramid levels because they do not combine multiple input BiFPN nodes.
+  for i in range(input_max_level + 1, fpn_max_level + 1):
+    node_params = dict(default_node_params)
+    node_params.update({
+        'name': '0_up_lvl_{}'.format(i),
+        'scale': level_scales[i - fpn_min_level],
+        'inputs': ['0_up_lvl_{}'.format(i - 1)],
+        'combine_method': None,
+        'post_combine_op': None,
+    })
+    bifpn_node_params.append(node_params)
+
+  for i in range(bifpn_num_iterations):
+    # The first bottom-up feature pyramid (which includes the input pyramid
+    # levels from the backbone network and the additional base pyramid levels)
+    # is indexed at 0. So, the first top-down bottom-up pass of the BiFPN is
+    # indexed from 1, and repeated for bifpn_num_iterations iterations.
+    bifpn_i = i + 1
+
+    # Create top-down nodes.
+    for level_i in reversed(range(fpn_min_level, fpn_max_level)):
+      inputs = []
+      # BiFPN nodes in the top-down pass receive input from the corresponding
+      # level from the previous BiFPN iteration's bottom-up pass, except for the
+      # bottom-most (min) level node, which is computed once in the initial
+      # bottom-up pass, and is afterwards only computed in each top-down pass.
+      if level_i > fpn_min_level or bifpn_i == 1:
+        inputs.append('{}_up_lvl_{}'.format(bifpn_i - 1, level_i))
+      else:
+        inputs.append('{}_dn_lvl_{}'.format(bifpn_i - 1, level_i))
+      inputs.append(bifpn_node_params[-1]['name'])
+      node_params = dict(default_node_params)
+      node_params.update({
+          'name': '{}_dn_lvl_{}'.format(bifpn_i, level_i),
+          'scale': level_scales[level_i - fpn_min_level],
+          'inputs': inputs
+      })
+      bifpn_node_params.append(node_params)
+
+    # Create bottom-up nodes.
+    for level_i in range(fpn_min_level + 1, fpn_max_level + 1):
+      # BiFPN nodes in the bottom-up pass receive input from the corresponding
+      # level from the preceding top-down pass, except for the top (max) level
+      # which does not have a corresponding node in the top-down pass.
+      inputs = ['{}_up_lvl_{}'.format(bifpn_i - 1, level_i)]
+      if level_i < fpn_max_level:
+        inputs.append('{}_dn_lvl_{}'.format(bifpn_i, level_i))
+      inputs.append(bifpn_node_params[-1]['name'])
+      node_params = dict(default_node_params)
+      node_params.update({
+          'name': '{}_up_lvl_{}'.format(bifpn_i, level_i),
+          'scale': level_scales[level_i - fpn_min_level],
+          'inputs': inputs
+      })
+      bifpn_node_params.append(node_params)
+
+  return bifpn_node_params
+
+
+def _create_bifpn_resample_block(name,
+                                 input_scale,
+                                 input_num_channels,
+                                 output_scale,
+                                 output_num_channels,
+                                 conv_hyperparams,
+                                 is_training,
+                                 freeze_batchnorm,
+                                 downsample_method=None,
+                                 use_native_resize_op=False,
+                                 maybe_apply_1x1_conv=True,
+                                 apply_1x1_pre_sampling=True,
+                                 apply_1x1_post_sampling=False):
+  """Creates resample block layers for input feature maps to BiFPN nodes.
+
+  Args:
+    name: String. Name used for this block of layers.
+    input_scale: Scale factor of the input feature map.
+    input_num_channels: Number of channels in the input feature map.
+    output_scale: Scale factor of the output feature map.
+    output_num_channels: Number of channels in the output feature map.
+    conv_hyperparams: A `hyperparams_builder.KerasLayerHyperparams` object
+      containing hyperparameters for convolution ops.
+    is_training: Indicates whether the feature generator is in training mode.
+    freeze_batchnorm: Bool. Whether to freeze batch norm parameters during
+      training or not. When training with a small batch size (e.g. 1), it is
+      desirable to freeze batch norm update and use pretrained batch norm
+      params.
+    downsample_method: String. Method to use when downsampling feature maps.
+    use_native_resize_op: Bool. Whether to use the native resize up when
+      upsampling feature maps.
+    maybe_apply_1x1_conv: Bool. If 'True', a 1x1 convolution will only be
+      applied if the input_num_channels differs from the output_num_channels.
+    apply_1x1_pre_sampling: Bool. Whether a 1x1 convolution will be applied to
+      the input feature map before the up/down-sampling operation.
+    apply_1x1_post_sampling: Bool. Whether a 1x1 convolution will be applied to
+      the input feature map after the up/down-sampling operation.
+
+  Returns:
+    A list of layers which may be applied to the input feature maps in order to
+    compute feature maps with the specified scale and number of channels.
+  """
+  # By default, 1x1 convolutions are only applied before sampling when the
+  # number of input and output channels differ.
+  if maybe_apply_1x1_conv and output_num_channels == input_num_channels:
+    apply_1x1_pre_sampling = False
+    apply_1x1_post_sampling = False
+
+  apply_bn_for_resampling = True
+  layers = []
+  if apply_1x1_pre_sampling:
+    layers.extend(
+        bifpn_utils.create_conv_block(
+            name=name + '1x1_pre_sample/',
+            num_filters=output_num_channels,
+            kernel_size=1,
+            strides=1,
+            padding='SAME',
+            use_separable=False,
+            apply_batchnorm=apply_bn_for_resampling,
+            apply_activation=False,
+            conv_hyperparams=conv_hyperparams,
+            is_training=is_training,
+            freeze_batchnorm=freeze_batchnorm))
+
+  layers.extend(
+      bifpn_utils.create_resample_feature_map_ops(input_scale, output_scale,
+                                                  downsample_method,
+                                                  use_native_resize_op,
+                                                  conv_hyperparams, is_training,
+                                                  freeze_batchnorm, name))
+
+  if apply_1x1_post_sampling:
+    layers.extend(
+        bifpn_utils.create_conv_block(
+            name=name + '1x1_post_sample/',
+            num_filters=output_num_channels,
+            kernel_size=1,
+            strides=1,
+            padding='SAME',
+            use_separable=False,
+            apply_batchnorm=apply_bn_for_resampling,
+            apply_activation=False,
+            conv_hyperparams=conv_hyperparams,
+            is_training=is_training,
+            freeze_batchnorm=freeze_batchnorm))
+
+  return layers
+
+
+def _create_bifpn_combine_op(num_inputs, name, combine_method):
+  """Creates a BiFPN output config, a list of the output BiFPN node names.
+
+  Args:
+    num_inputs: The number of inputs to this combine operation.
+    name: String. The name of this combine operation.
+    combine_method: String. The method used to combine input feature maps.
+
+  Returns:
+    A function which may be called with a list of num_inputs feature maps
+    and which will return a single feature map.
+  """
+
+  combine_op = None
+  if num_inputs < 1:
+    raise ValueError('Expected at least 1 input for BiFPN combine.')
+  elif num_inputs == 1:
+    combine_op = lambda x: x[0]
+  else:
+    combine_op = bifpn_utils.BiFPNCombineLayer(
+        combine_method=combine_method, name=name)
+  return combine_op
+
+
+class KerasBiFpnFeatureMaps(tf.keras.Model):
+  """Generates Keras based BiFPN feature maps from an input feature map pyramid.
+
+  A Keras model that generates multi-scale feature maps for detection by
+  iteratively computing top-down and bottom-up feature pyramids, as in the
+  EfficientDet paper by Tan et al, see arxiv.org/abs/1911.09070 for details.
+  """
+
+  def __init__(self,
+               bifpn_num_iterations,
+               bifpn_num_filters,
+               fpn_min_level,
+               fpn_max_level,
+               input_max_level,
+               is_training,
+               conv_hyperparams,
+               freeze_batchnorm,
+               bifpn_node_params=None,
+               name=None):
+    """Constructor.
+
+    Args:
+      bifpn_num_iterations: The number of top-down bottom-up iterations.
+      bifpn_num_filters: The number of filters (channels) to be used for all
+        feature maps in this BiFPN.
+      fpn_min_level: The minimum pyramid level (highest feature map resolution)
+        to use in the BiFPN.
+      fpn_max_level: The maximum pyramid level (lowest feature map resolution)
+        to use in the BiFPN.
+      input_max_level: The maximum pyramid level that will be provided as input
+        to the BiFPN. Accordingly, the BiFPN will compute any additional pyramid
+        levels from input_max_level up to the desired fpn_max_level, with each
+        successivel level downsampling by a scale factor of 2 by default.
+      is_training: Indicates whether the feature generator is in training mode.
+      conv_hyperparams: A `hyperparams_builder.KerasLayerHyperparams` object
+        containing hyperparameters for convolution ops.
+      freeze_batchnorm: Bool. Whether to freeze batch norm parameters during
+        training or not. When training with a small batch size (e.g. 1), it is
+        desirable to freeze batch norm update and use pretrained batch norm
+        params.
+      bifpn_node_params: An optional dictionary that may be used to specify
+        default parameters for BiFPN nodes, without the need to provide a custom
+        bifpn_node_config. For example, if '{ combine_method: 'sum' }', then all
+        BiFPN nodes will combine input feature maps by summation, rather than
+        by the default fast attention method.
+      name: A string name scope to assign to the model. If 'None', Keras
+        will auto-generate one from the class name.
+    """
+    super(KerasBiFpnFeatureMaps, self).__init__(name=name)
+    bifpn_node_config = _create_bifpn_node_config(
+        bifpn_num_iterations, bifpn_num_filters, fpn_min_level, fpn_max_level,
+        input_max_level, bifpn_node_params)
+    bifpn_input_config = _create_bifpn_input_config(
+        fpn_min_level, fpn_max_level, input_max_level)
+    bifpn_output_node_names = _get_bifpn_output_node_names(
+        fpn_min_level, fpn_max_level, bifpn_node_config)
+
+    self.bifpn_node_config = bifpn_node_config
+    self.bifpn_output_node_names = bifpn_output_node_names
+    self.node_input_blocks = []
+    self.node_combine_op = []
+    self.node_post_combine_block = []
+
+    all_node_params = bifpn_input_config
+    all_node_names = [node['name'] for node in all_node_params]
+    for node_config in bifpn_node_config:
+      # Maybe transform and/or resample input feature maps.
+      input_blocks = []
+      for input_name in node_config['inputs']:
+        if input_name not in all_node_names:
+          raise ValueError(
+              'Input feature map ({}) does not exist:'.format(input_name))
+        input_index = all_node_names.index(input_name)
+        input_params = all_node_params[input_index]
+        input_block = node_config['input_op'](
+            name='{}/input_{}/'.format(node_config['name'], input_name),
+            input_scale=input_params['scale'],
+            input_num_channels=input_params.get('num_channels', None),
+            output_scale=node_config['scale'],
+            output_num_channels=node_config['num_channels'],
+            conv_hyperparams=conv_hyperparams,
+            is_training=is_training,
+            freeze_batchnorm=freeze_batchnorm)
+        input_blocks.append((input_index, input_block))
+
+      # Combine input feature maps.
+      combine_op = _create_bifpn_combine_op(
+          num_inputs=len(input_blocks),
+          name=(node_config['name'] + '/combine'),
+          combine_method=node_config['combine_method'])
+
+      # Post-combine layers.
+      post_combine_block = []
+      if node_config['post_combine_op']:
+        post_combine_block.extend(node_config['post_combine_op'](
+            name=node_config['name'] + '/post_combine/',
+            conv_hyperparams=conv_hyperparams,
+            is_training=is_training,
+            freeze_batchnorm=freeze_batchnorm))
+
+      self.node_input_blocks.append(input_blocks)
+      self.node_combine_op.append(combine_op)
+      self.node_post_combine_block.append(post_combine_block)
+      all_node_params.append(node_config)
+      all_node_names.append(node_config['name'])
+
+  def call(self, feature_pyramid):
+    """Compute BiFPN feature maps from input feature pyramid.
+
+    Executed when calling the `.__call__` method on input.
+
+    Args:
+      feature_pyramid: list of tuples of (tensor_name, image_feature_tensor).
+
+    Returns:
+      feature_maps: an OrderedDict mapping keys (feature map names) to
+        tensors where each tensor has shape [batch, height_i, width_i, depth_i].
+    """
+    feature_maps = [el[1] for el in feature_pyramid]
+    output_feature_maps = [None for node in self.bifpn_output_node_names]
+
+    for index, node in enumerate(self.bifpn_node_config):
+      node_scope = 'node_{:02d}'.format(index)
+      with tf.name_scope(node_scope):
+        # Apply layer blocks to this node's input feature maps.
+        input_block_results = []
+        for input_index, input_block in self.node_input_blocks[index]:
+          block_result = feature_maps[input_index]
+          for layer in input_block:
+            block_result = layer(block_result)
+          input_block_results.append(block_result)
+
+        # Combine the resulting feature maps.
+        node_result = self.node_combine_op[index](input_block_results)
+
+        # Apply post-combine layer block if applicable.
+        for layer in self.node_post_combine_block[index]:
+          node_result = layer(node_result)
+
+        feature_maps.append(node_result)
+
+        if node['name'] in self.bifpn_output_node_names:
+          index = self.bifpn_output_node_names.index(node['name'])
+          output_feature_maps[index] = node_result
+
+    return collections.OrderedDict(
+        zip(self.bifpn_output_node_names, output_feature_maps))
diff --git a/research/object_detection/models/bidirectional_feature_pyramid_generators_tf2_test.py b/research/object_detection/models/bidirectional_feature_pyramid_generators_tf2_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..cbc815cc446add205a5b307cd56cf81ee60a1041
--- /dev/null
+++ b/research/object_detection/models/bidirectional_feature_pyramid_generators_tf2_test.py
@@ -0,0 +1,167 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+
+"""Tests for bidirectional feature pyramid generators."""
+import unittest
+from absl.testing import parameterized
+
+import tensorflow.compat.v1 as tf
+
+from google.protobuf import text_format
+
+from object_detection.builders import hyperparams_builder
+from object_detection.models import bidirectional_feature_pyramid_generators as bifpn_generators
+from object_detection.protos import hyperparams_pb2
+from object_detection.utils import test_case
+from object_detection.utils import test_utils
+from object_detection.utils import tf_version
+
+
+@parameterized.parameters({'bifpn_num_iterations': 2},
+                          {'bifpn_num_iterations': 8})
+@unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
+class BiFPNFeaturePyramidGeneratorTest(test_case.TestCase):
+
+  def _build_conv_hyperparams(self):
+    conv_hyperparams = hyperparams_pb2.Hyperparams()
+    conv_hyperparams_text_proto = """
+      regularizer {
+        l2_regularizer {
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+        }
+      }
+      force_use_bias: true
+    """
+    text_format.Merge(conv_hyperparams_text_proto, conv_hyperparams)
+    return hyperparams_builder.KerasLayerHyperparams(conv_hyperparams)
+
+  def test_get_expected_feature_map_shapes(self, bifpn_num_iterations):
+    with test_utils.GraphContextOrNone() as g:
+      image_features = [
+          ('block3', tf.random_uniform([4, 16, 16, 256], dtype=tf.float32)),
+          ('block4', tf.random_uniform([4, 8, 8, 256], dtype=tf.float32)),
+          ('block5', tf.random_uniform([4, 4, 4, 256], dtype=tf.float32))
+      ]
+      bifpn_generator = bifpn_generators.KerasBiFpnFeatureMaps(
+          bifpn_num_iterations=bifpn_num_iterations,
+          bifpn_num_filters=128,
+          fpn_min_level=3,
+          fpn_max_level=7,
+          input_max_level=5,
+          is_training=True,
+          conv_hyperparams=self._build_conv_hyperparams(),
+          freeze_batchnorm=False)
+    def graph_fn():
+      feature_maps = bifpn_generator(image_features)
+      return feature_maps
+
+    expected_feature_map_shapes = {
+        '{}_dn_lvl_3'.format(bifpn_num_iterations): (4, 16, 16, 128),
+        '{}_up_lvl_4'.format(bifpn_num_iterations): (4, 8, 8, 128),
+        '{}_up_lvl_5'.format(bifpn_num_iterations): (4, 4, 4, 128),
+        '{}_up_lvl_6'.format(bifpn_num_iterations): (4, 2, 2, 128),
+        '{}_up_lvl_7'.format(bifpn_num_iterations): (4, 1, 1, 128)}
+    out_feature_maps = self.execute(graph_fn, [], g)
+    out_feature_map_shapes = dict(
+        (key, value.shape) for key, value in out_feature_maps.items())
+    self.assertDictEqual(expected_feature_map_shapes, out_feature_map_shapes)
+
+  def test_get_expected_variable_names(self, bifpn_num_iterations):
+    with test_utils.GraphContextOrNone() as g:
+      image_features = [
+          ('block3', tf.random_uniform([4, 16, 16, 256], dtype=tf.float32)),
+          ('block4', tf.random_uniform([4, 8, 8, 256], dtype=tf.float32)),
+          ('block5', tf.random_uniform([4, 4, 4, 256], dtype=tf.float32))
+      ]
+      bifpn_generator = bifpn_generators.KerasBiFpnFeatureMaps(
+          bifpn_num_iterations=bifpn_num_iterations,
+          bifpn_num_filters=128,
+          fpn_min_level=3,
+          fpn_max_level=7,
+          input_max_level=5,
+          is_training=True,
+          conv_hyperparams=self._build_conv_hyperparams(),
+          freeze_batchnorm=False,
+          name='bifpn')
+    def graph_fn():
+      return bifpn_generator(image_features)
+
+    self.execute(graph_fn, [], g)
+    expected_variables = [
+        'bifpn/node_00/0_up_lvl_6/input_0_up_lvl_5/1x1_pre_sample/conv/bias',
+        'bifpn/node_00/0_up_lvl_6/input_0_up_lvl_5/1x1_pre_sample/conv/kernel',
+        'bifpn/node_03/1_dn_lvl_5/input_0_up_lvl_5/1x1_pre_sample/conv/bias',
+        'bifpn/node_03/1_dn_lvl_5/input_0_up_lvl_5/1x1_pre_sample/conv/kernel',
+        'bifpn/node_04/1_dn_lvl_4/input_0_up_lvl_4/1x1_pre_sample/conv/bias',
+        'bifpn/node_04/1_dn_lvl_4/input_0_up_lvl_4/1x1_pre_sample/conv/kernel',
+        'bifpn/node_05/1_dn_lvl_3/input_0_up_lvl_3/1x1_pre_sample/conv/bias',
+        'bifpn/node_05/1_dn_lvl_3/input_0_up_lvl_3/1x1_pre_sample/conv/kernel',
+        'bifpn/node_06/1_up_lvl_4/input_0_up_lvl_4/1x1_pre_sample/conv/bias',
+        'bifpn/node_06/1_up_lvl_4/input_0_up_lvl_4/1x1_pre_sample/conv/kernel',
+        'bifpn/node_07/1_up_lvl_5/input_0_up_lvl_5/1x1_pre_sample/conv/bias',
+        'bifpn/node_07/1_up_lvl_5/input_0_up_lvl_5/1x1_pre_sample/conv/kernel']
+    expected_node_variable_patterns = [
+        ['bifpn/node_{:02}/{}_dn_lvl_6/combine/bifpn_combine_weights',
+         'bifpn/node_{:02}/{}_dn_lvl_6/post_combine/separable_conv/bias',
+         'bifpn/node_{:02}/{}_dn_lvl_6/post_combine/separable_conv/depthwise_kernel',
+         'bifpn/node_{:02}/{}_dn_lvl_6/post_combine/separable_conv/pointwise_kernel'],
+        ['bifpn/node_{:02}/{}_dn_lvl_5/combine/bifpn_combine_weights',
+         'bifpn/node_{:02}/{}_dn_lvl_5/post_combine/separable_conv/bias',
+         'bifpn/node_{:02}/{}_dn_lvl_5/post_combine/separable_conv/depthwise_kernel',
+         'bifpn/node_{:02}/{}_dn_lvl_5/post_combine/separable_conv/pointwise_kernel'],
+        ['bifpn/node_{:02}/{}_dn_lvl_4/combine/bifpn_combine_weights',
+         'bifpn/node_{:02}/{}_dn_lvl_4/post_combine/separable_conv/bias',
+         'bifpn/node_{:02}/{}_dn_lvl_4/post_combine/separable_conv/depthwise_kernel',
+         'bifpn/node_{:02}/{}_dn_lvl_4/post_combine/separable_conv/pointwise_kernel'],
+        ['bifpn/node_{:02}/{}_dn_lvl_3/combine/bifpn_combine_weights',
+         'bifpn/node_{:02}/{}_dn_lvl_3/post_combine/separable_conv/bias',
+         'bifpn/node_{:02}/{}_dn_lvl_3/post_combine/separable_conv/depthwise_kernel',
+         'bifpn/node_{:02}/{}_dn_lvl_3/post_combine/separable_conv/pointwise_kernel'],
+        ['bifpn/node_{:02}/{}_up_lvl_4/combine/bifpn_combine_weights',
+         'bifpn/node_{:02}/{}_up_lvl_4/post_combine/separable_conv/bias',
+         'bifpn/node_{:02}/{}_up_lvl_4/post_combine/separable_conv/depthwise_kernel',
+         'bifpn/node_{:02}/{}_up_lvl_4/post_combine/separable_conv/pointwise_kernel'],
+        ['bifpn/node_{:02}/{}_up_lvl_5/combine/bifpn_combine_weights',
+         'bifpn/node_{:02}/{}_up_lvl_5/post_combine/separable_conv/bias',
+         'bifpn/node_{:02}/{}_up_lvl_5/post_combine/separable_conv/depthwise_kernel',
+         'bifpn/node_{:02}/{}_up_lvl_5/post_combine/separable_conv/pointwise_kernel'],
+        ['bifpn/node_{:02}/{}_up_lvl_6/combine/bifpn_combine_weights',
+         'bifpn/node_{:02}/{}_up_lvl_6/post_combine/separable_conv/bias',
+         'bifpn/node_{:02}/{}_up_lvl_6/post_combine/separable_conv/depthwise_kernel',
+         'bifpn/node_{:02}/{}_up_lvl_6/post_combine/separable_conv/pointwise_kernel'],
+        ['bifpn/node_{:02}/{}_up_lvl_7/combine/bifpn_combine_weights',
+         'bifpn/node_{:02}/{}_up_lvl_7/post_combine/separable_conv/bias',
+         'bifpn/node_{:02}/{}_up_lvl_7/post_combine/separable_conv/depthwise_kernel',
+         'bifpn/node_{:02}/{}_up_lvl_7/post_combine/separable_conv/pointwise_kernel']]
+
+    node_i = 2
+    for iter_i in range(1, bifpn_num_iterations+1):
+      for node_variable_patterns in expected_node_variable_patterns:
+        for pattern in node_variable_patterns:
+          expected_variables.append(pattern.format(node_i, iter_i))
+        node_i += 1
+
+    expected_variables = set(expected_variables)
+    actual_variable_set = set(
+        [var.name.split(':')[0] for var in bifpn_generator.variables])
+    self.assertSetEqual(expected_variables, actual_variable_set)
+
+# TODO(aom): Tests for create_bifpn_combine_op.
+
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/research/object_detection/models/center_net_hourglass_feature_extractor.py b/research/object_detection/models/center_net_hourglass_feature_extractor.py
index 4761915aa5ad0023673199f2083ff355816f7bb1..4b5da41e223a97c5442308ab45fdc77a7a4b0d99 100644
--- a/research/object_detection/models/center_net_hourglass_feature_extractor.py
+++ b/research/object_detection/models/center_net_hourglass_feature_extractor.py
@@ -62,8 +62,14 @@ class CenterNetHourglassFeatureExtractor(
     """Ther number of feature outputs returned by the feature extractor."""
     return self._network.num_hourglasses
 
-  def get_model(self):
-    return self._network
+  def get_sub_model(self, sub_model_type):
+    if sub_model_type == 'detection':
+      return self._network
+    else:
+      supported_types = ['detection']
+      raise ValueError(
+          ('Sub model {} is not defined for Hourglass.'.format(sub_model_type) +
+           'Supported types are {}.'.format(supported_types)))
 
 
 def hourglass_104(channel_means, channel_stds, bgr_ordering):
diff --git a/research/object_detection/models/center_net_mobilenet_v2_feature_extractor.py b/research/object_detection/models/center_net_mobilenet_v2_feature_extractor.py
new file mode 100644
index 0000000000000000000000000000000000000000..e2dbfb11ccdf7d73fb4adf7890ed3e066594dc8f
--- /dev/null
+++ b/research/object_detection/models/center_net_mobilenet_v2_feature_extractor.py
@@ -0,0 +1,123 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""MobileNet V2[1] feature extractor for CenterNet[2] meta architecture.
+
+[1]: https://arxiv.org/abs/1801.04381
+[2]: https://arxiv.org/abs/1904.07850
+"""
+
+import tensorflow.compat.v1 as tf
+
+from object_detection.meta_architectures import center_net_meta_arch
+from object_detection.models.keras_models import mobilenet_v2 as mobilenetv2
+
+
+class CenterNetMobileNetV2FeatureExtractor(
+    center_net_meta_arch.CenterNetFeatureExtractor):
+  """The MobileNet V2 feature extractor for CenterNet."""
+
+  def __init__(self,
+               mobilenet_v2_net,
+               channel_means=(0., 0., 0.),
+               channel_stds=(1., 1., 1.),
+               bgr_ordering=False):
+    """Intializes the feature extractor.
+
+    Args:
+      mobilenet_v2_net: The underlying mobilenet_v2 network to use.
+      channel_means: A tuple of floats, denoting the mean of each channel
+        which will be subtracted from it.
+      channel_stds: A tuple of floats, denoting the standard deviation of each
+        channel. Each channel will be divided by its standard deviation value.
+      bgr_ordering: bool, if set will change the channel ordering to be in the
+        [blue, red, green] order.
+    """
+
+    super(CenterNetMobileNetV2FeatureExtractor, self).__init__(
+        channel_means=channel_means,
+        channel_stds=channel_stds,
+        bgr_ordering=bgr_ordering)
+    self._network = mobilenet_v2_net
+
+    output = self._network(self._network.input)
+
+    # TODO(nkhadke): Try out MobileNet+FPN next (skip connections are cheap and
+    # should help with performance).
+    # MobileNet by itself transforms a 224x224x3 volume into a 7x7x1280, which
+    # leads to a stride of 32. We perform upsampling to get it to a target
+    # stride of 4.
+    for num_filters in [256, 128, 64]:
+      # 1. We use a simple convolution instead of a deformable convolution
+      conv = tf.keras.layers.Conv2D(
+          filters=num_filters, kernel_size=1, strides=1, padding='same')
+      output = conv(output)
+      output = tf.keras.layers.BatchNormalization()(output)
+      output = tf.keras.layers.ReLU()(output)
+
+      # 2. We use the default initialization for the convolution layers
+      # instead of initializing it to do bilinear upsampling.
+      conv_transpose = tf.keras.layers.Conv2DTranspose(
+          filters=num_filters, kernel_size=3, strides=2, padding='same')
+      output = conv_transpose(output)
+      output = tf.keras.layers.BatchNormalization()(output)
+      output = tf.keras.layers.ReLU()(output)
+
+    self._network = tf.keras.models.Model(
+        inputs=self._network.input, outputs=output)
+
+  def preprocess(self, resized_inputs):
+    resized_inputs = super(CenterNetMobileNetV2FeatureExtractor,
+                           self).preprocess(resized_inputs)
+    return tf.keras.applications.mobilenet_v2.preprocess_input(resized_inputs)
+
+  def load_feature_extractor_weights(self, path):
+    self._network.load_weights(path)
+
+  def get_base_model(self):
+    return self._network
+
+  def call(self, inputs):
+    return [self._network(inputs)]
+
+  @property
+  def out_stride(self):
+    """The stride in the output image of the network."""
+    return 4
+
+  @property
+  def num_feature_outputs(self):
+    """The number of feature outputs returned by the feature extractor."""
+    return 1
+
+  def get_sub_model(self, sub_model_type):
+    if sub_model_type == 'detection':
+      return self._network
+    else:
+      supported_types = ['detection']
+      raise ValueError(
+          ('Sub model {} is not defined for MobileNet.'.format(sub_model_type) +
+           'Supported types are {}.'.format(supported_types)))
+
+
+def mobilenet_v2(channel_means, channel_stds, bgr_ordering):
+  """The MobileNetV2 backbone for CenterNet."""
+
+  # We set 'is_training' to True for now.
+  network = mobilenetv2.mobilenet_v2(True, include_top=False)
+  return CenterNetMobileNetV2FeatureExtractor(
+      network,
+      channel_means=channel_means,
+      channel_stds=channel_stds,
+      bgr_ordering=bgr_ordering)
diff --git a/research/object_detection/models/center_net_mobilenet_v2_feature_extractor_tf2_test.py b/research/object_detection/models/center_net_mobilenet_v2_feature_extractor_tf2_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..5211701138d8e134bba7c2ff6b247cf19d156691
--- /dev/null
+++ b/research/object_detection/models/center_net_mobilenet_v2_feature_extractor_tf2_test.py
@@ -0,0 +1,46 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Testing mobilenet_v2 feature extractor for CenterNet."""
+import unittest
+import numpy as np
+import tensorflow.compat.v1 as tf
+
+from object_detection.models import center_net_mobilenet_v2_feature_extractor
+from object_detection.models.keras_models import mobilenet_v2
+from object_detection.utils import test_case
+from object_detection.utils import tf_version
+
+
+@unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
+class CenterNetMobileNetV2FeatureExtractorTest(test_case.TestCase):
+
+  def test_center_net_mobilenet_v2_feature_extractor(self):
+
+    net = mobilenet_v2.mobilenet_v2(True, include_top=False)
+
+    model = center_net_mobilenet_v2_feature_extractor.CenterNetMobileNetV2FeatureExtractor(
+        net)
+
+    def graph_fn():
+      img = np.zeros((8, 224, 224, 3), dtype=np.float32)
+      processed_img = model.preprocess(img)
+      return model(processed_img)
+
+    outputs = self.execute(graph_fn, [])
+    self.assertEqual(outputs.shape, (8, 56, 56, 64))
+
+
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/research/object_detection/models/center_net_resnet_feature_extractor.py b/research/object_detection/models/center_net_resnet_feature_extractor.py
index 477fa4c50ea9e0bc62b43a75c1674acfef7a183c..630c9cfdc5ddf5ec0d912a3d5bec03ec21d83982 100644
--- a/research/object_detection/models/center_net_resnet_feature_extractor.py
+++ b/research/object_detection/models/center_net_resnet_feature_extractor.py
@@ -101,10 +101,6 @@ class CenterNetResnetFeatureExtractor(CenterNetFeatureExtractor):
   def load_feature_extractor_weights(self, path):
     self._base_model.load_weights(path)
 
-  def get_base_model(self):
-    """Get base resnet model for inspection and testing."""
-    return self._base_model
-
   def call(self, inputs):
     """Returns image features extracted by the backbone.
 
@@ -127,6 +123,17 @@ class CenterNetResnetFeatureExtractor(CenterNetFeatureExtractor):
   def out_stride(self):
     return 4
 
+  def get_sub_model(self, sub_model_type):
+    if sub_model_type == 'classification':
+      return self._base_model
+    else:
+      supported_types = ['classification']
+      raise ValueError(
+          ('Sub model {} is not defined for ResNet.'.format(sub_model_type)
+           + 'Supported types are {}.'.format(supported_types)
+           + 'Use the script convert_keras_models.py to create your own '
+           + 'classification checkpoints.'))
+
 
 def resnet_v2_101(channel_means, channel_stds, bgr_ordering):
   """The ResNet v2 101 feature extractor."""
diff --git a/research/object_detection/models/center_net_resnet_v1_fpn_feature_extractor.py b/research/object_detection/models/center_net_resnet_v1_fpn_feature_extractor.py
index 842e9cf1b2e5393a6bc87df3989f173d0409de70..7382f9219e9c18f2ae483aa124c05658a8b67bc7 100644
--- a/research/object_detection/models/center_net_resnet_v1_fpn_feature_extractor.py
+++ b/research/object_detection/models/center_net_resnet_v1_fpn_feature_extractor.py
@@ -21,9 +21,14 @@
 import tensorflow.compat.v1 as tf
 
 from object_detection.meta_architectures.center_net_meta_arch import CenterNetFeatureExtractor
+from object_detection.models.keras_models import resnet_v1
 
 
 _RESNET_MODEL_OUTPUT_LAYERS = {
+    'resnet_v1_18': ['conv2_block2_out', 'conv3_block2_out',
+                     'conv4_block2_out', 'conv5_block2_out'],
+    'resnet_v1_34': ['conv2_block3_out', 'conv3_block4_out',
+                     'conv4_block6_out', 'conv5_block3_out'],
     'resnet_v1_50': ['conv2_block3_out', 'conv3_block4_out',
                      'conv4_block6_out', 'conv5_block3_out'],
     'resnet_v1_101': ['conv2_block3_out', 'conv3_block4_out',
@@ -69,6 +74,10 @@ class CenterNetResnetV1FpnFeatureExtractor(CenterNetFeatureExtractor):
       self._base_model = tf.keras.applications.ResNet50(weights=None)
     elif resnet_type == 'resnet_v1_101':
       self._base_model = tf.keras.applications.ResNet101(weights=None)
+    elif resnet_type == 'resnet_v1_18':
+      self._base_model = resnet_v1.resnet_v1_18(weights=None)
+    elif resnet_type == 'resnet_v1_34':
+      self._base_model = resnet_v1.resnet_v1_34(weights=None)
     else:
       raise ValueError('Unknown Resnet Model {}'.format(resnet_type))
     output_layers = _RESNET_MODEL_OUTPUT_LAYERS[resnet_type]
@@ -128,10 +137,6 @@ class CenterNetResnetV1FpnFeatureExtractor(CenterNetFeatureExtractor):
   def load_feature_extractor_weights(self, path):
     self._base_model.load_weights(path)
 
-  def get_base_model(self):
-    """Get base resnet model for inspection and testing."""
-    return self._base_model
-
   def call(self, inputs):
     """Returns image features extracted by the backbone.
 
@@ -154,6 +159,17 @@ class CenterNetResnetV1FpnFeatureExtractor(CenterNetFeatureExtractor):
   def out_stride(self):
     return 4
 
+  def get_sub_model(self, sub_model_type):
+    if sub_model_type == 'classification':
+      return self._base_model
+    else:
+      supported_types = ['classification']
+      raise ValueError(
+          ('Sub model {} is not defined for ResNet FPN.'.format(sub_model_type)
+           + 'Supported types are {}.'.format(supported_types))
+          + 'Use the script convert_keras_models.py to create your own '
+          + 'classification checkpoints.')
+
 
 def resnet_v1_101_fpn(channel_means, channel_stds, bgr_ordering):
   """The ResNet v1 101 FPN feature extractor."""
@@ -174,3 +190,24 @@ def resnet_v1_50_fpn(channel_means, channel_stds, bgr_ordering):
       channel_means=channel_means,
       channel_stds=channel_stds,
       bgr_ordering=bgr_ordering)
+
+
+def resnet_v1_34_fpn(channel_means, channel_stds, bgr_ordering):
+  """The ResNet v1 34 FPN feature extractor."""
+
+  return CenterNetResnetV1FpnFeatureExtractor(
+      resnet_type='resnet_v1_34',
+      channel_means=channel_means,
+      channel_stds=channel_stds,
+      bgr_ordering=bgr_ordering
+  )
+
+
+def resnet_v1_18_fpn(channel_means, channel_stds, bgr_ordering):
+  """The ResNet v1 18 FPN feature extractor."""
+
+  return CenterNetResnetV1FpnFeatureExtractor(
+      resnet_type='resnet_v1_18',
+      channel_means=channel_means,
+      channel_stds=channel_stds,
+      bgr_ordering=bgr_ordering)
diff --git a/research/object_detection/models/center_net_resnet_v1_fpn_feature_extractor_tf2_test.py b/research/object_detection/models/center_net_resnet_v1_fpn_feature_extractor_tf2_test.py
index 3f1524904f0a055e48342d09febdd7bd3ec6fb3c..14d87bd157142e10eb8f238522efbb8896311408 100644
--- a/research/object_detection/models/center_net_resnet_v1_fpn_feature_extractor_tf2_test.py
+++ b/research/object_detection/models/center_net_resnet_v1_fpn_feature_extractor_tf2_test.py
@@ -31,6 +31,8 @@ class CenterNetResnetV1FpnFeatureExtractorTest(test_case.TestCase,
   @parameterized.parameters(
       {'resnet_type': 'resnet_v1_50'},
       {'resnet_type': 'resnet_v1_101'},
+      {'resnet_type': 'resnet_v1_18'},
+      {'resnet_type': 'resnet_v1_34'},
   )
   def test_correct_output_size(self, resnet_type):
     """Verify that shape of features returned by the backbone is correct."""
diff --git a/research/object_detection/models/faster_rcnn_inception_resnet_v2_keras_feature_extractor.py b/research/object_detection/models/faster_rcnn_inception_resnet_v2_keras_feature_extractor.py
index 9196871bd3acbdf5d2b8379b56e1a8778daf3065..f185aa01dd377c66b94ca37cc244350b2071f21c 100644
--- a/research/object_detection/models/faster_rcnn_inception_resnet_v2_keras_feature_extractor.py
+++ b/research/object_detection/models/faster_rcnn_inception_resnet_v2_keras_feature_extractor.py
@@ -59,6 +59,7 @@ class FasterRCNNInceptionResnetV2KerasFeatureExtractor(
         is_training, first_stage_features_stride, batch_norm_trainable,
         weight_decay)
     self._variable_dict = {}
+    self.classification_backbone = None
 
   def preprocess(self, resized_inputs):
     """Faster R-CNN with Inception Resnet v2 preprocessing.
@@ -95,19 +96,20 @@ class FasterRCNNInceptionResnetV2KerasFeatureExtractor(
       And returns rpn_feature_map:
         A tensor with shape [batch, height, width, depth]
     """
-    with tf.name_scope(name):
-      with tf.name_scope('InceptionResnetV2'):
-        model = inception_resnet_v2.inception_resnet_v2(
+    if not self.classification_backbone:
+      self.classification_backbone = inception_resnet_v2.inception_resnet_v2(
               self._train_batch_norm,
               output_stride=self._first_stage_features_stride,
               align_feature_maps=True,
               weight_decay=self._weight_decay,
               weights=None,
               include_top=False)
-        proposal_features = model.get_layer(
+    with tf.name_scope(name):
+      with tf.name_scope('InceptionResnetV2'):
+        proposal_features = self.classification_backbone.get_layer(
             name='block17_20_ac').output
         keras_model = tf.keras.Model(
-            inputs=model.inputs,
+            inputs=self.classification_backbone.inputs,
             outputs=proposal_features)
         for variable in keras_model.variables:
           self._variable_dict[variable.name[:-2]] = variable
@@ -132,962 +134,26 @@ class FasterRCNNInceptionResnetV2KerasFeatureExtractor(
         [batch_size * self.max_num_proposals, height, width, depth]
         representing box classifier features for each proposal.
     """
+    if not self.classification_backbone:
+      self.classification_backbone = inception_resnet_v2.inception_resnet_v2(
+              self._train_batch_norm,
+              output_stride=self._first_stage_features_stride,
+              align_feature_maps=True,
+              weight_decay=self._weight_decay,
+              weights=None,
+              include_top=False)
     with tf.name_scope(name):
       with tf.name_scope('InceptionResnetV2'):
-        model = inception_resnet_v2.inception_resnet_v2(
-            self._train_batch_norm,
-            output_stride=16,
-            align_feature_maps=False,
-            weight_decay=self._weight_decay,
-            weights=None,
-            include_top=False)
-
-        proposal_feature_maps = model.get_layer(
+        proposal_feature_maps = self.classification_backbone.get_layer(
             name='block17_20_ac').output
-        proposal_classifier_features = model.get_layer(
+        proposal_classifier_features = self.classification_backbone.get_layer(
             name='conv_7b_ac').output
 
         keras_model = model_util.extract_submodel(
-            model=model,
+            model=self.classification_backbone,
             inputs=proposal_feature_maps,
             outputs=proposal_classifier_features)
         for variable in keras_model.variables:
           self._variable_dict[variable.name[:-2]] = variable
         return keras_model
 
-  def restore_from_classification_checkpoint_fn(
-      self,
-      first_stage_feature_extractor_scope,
-      second_stage_feature_extractor_scope):
-    """Returns a map of variables to load from a foreign checkpoint.
-
-    This uses a hard-coded conversion to load into Keras from a slim-trained
-    inception_resnet_v2 checkpoint.
-    Note that this overrides the default implementation in
-    faster_rcnn_meta_arch.FasterRCNNKerasFeatureExtractor which does not work
-    for InceptionResnetV2 checkpoints.
-
-    Args:
-      first_stage_feature_extractor_scope: A scope name for the first stage
-        feature extractor.
-      second_stage_feature_extractor_scope: A scope name for the second stage
-        feature extractor.
-
-    Returns:
-      A dict mapping variable names (to load from a checkpoint) to variables in
-      the model graph.
-    """
-
-    keras_to_slim_name_mapping = {
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d/kernel': 'InceptionResnetV2/Conv2d_1a_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm/beta': 'InceptionResnetV2/Conv2d_1a_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm/moving_mean': 'InceptionResnetV2/Conv2d_1a_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm/moving_variance': 'InceptionResnetV2/Conv2d_1a_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_1/kernel': 'InceptionResnetV2/Conv2d_2a_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_1/beta': 'InceptionResnetV2/Conv2d_2a_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_1/moving_mean': 'InceptionResnetV2/Conv2d_2a_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_1/moving_variance': 'InceptionResnetV2/Conv2d_2a_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_2/kernel': 'InceptionResnetV2/Conv2d_2b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_2/beta': 'InceptionResnetV2/Conv2d_2b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_2/moving_mean': 'InceptionResnetV2/Conv2d_2b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_2/moving_variance': 'InceptionResnetV2/Conv2d_2b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_3/kernel': 'InceptionResnetV2/Conv2d_3b_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_3/beta': 'InceptionResnetV2/Conv2d_3b_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_3/moving_mean': 'InceptionResnetV2/Conv2d_3b_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_3/moving_variance': 'InceptionResnetV2/Conv2d_3b_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_4/kernel': 'InceptionResnetV2/Conv2d_4a_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_4/beta': 'InceptionResnetV2/Conv2d_4a_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_4/moving_mean': 'InceptionResnetV2/Conv2d_4a_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_4/moving_variance': 'InceptionResnetV2/Conv2d_4a_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_5/kernel': 'InceptionResnetV2/Mixed_5b/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_5/beta': 'InceptionResnetV2/Mixed_5b/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_5/moving_mean': 'InceptionResnetV2/Mixed_5b/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_5/moving_variance': 'InceptionResnetV2/Mixed_5b/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_6/kernel': 'InceptionResnetV2/Mixed_5b/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_6/beta': 'InceptionResnetV2/Mixed_5b/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_6/moving_mean': 'InceptionResnetV2/Mixed_5b/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_6/moving_variance': 'InceptionResnetV2/Mixed_5b/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_7/kernel': 'InceptionResnetV2/Mixed_5b/Branch_1/Conv2d_0b_5x5/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_7/beta': 'InceptionResnetV2/Mixed_5b/Branch_1/Conv2d_0b_5x5/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_7/moving_mean': 'InceptionResnetV2/Mixed_5b/Branch_1/Conv2d_0b_5x5/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_7/moving_variance': 'InceptionResnetV2/Mixed_5b/Branch_1/Conv2d_0b_5x5/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_8/kernel': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_8/beta': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_8/moving_mean': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_8/moving_variance': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_9/kernel': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_9/beta': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_9/moving_mean': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_9/moving_variance': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_10/kernel': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_10/beta': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_10/moving_mean': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_10/moving_variance': 'InceptionResnetV2/Mixed_5b/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_11/kernel': 'InceptionResnetV2/Mixed_5b/Branch_3/Conv2d_0b_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_11/beta': 'InceptionResnetV2/Mixed_5b/Branch_3/Conv2d_0b_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_11/moving_mean': 'InceptionResnetV2/Mixed_5b/Branch_3/Conv2d_0b_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_11/moving_variance': 'InceptionResnetV2/Mixed_5b/Branch_3/Conv2d_0b_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_12/kernel': 'InceptionResnetV2/Repeat/block35_1/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_12/beta': 'InceptionResnetV2/Repeat/block35_1/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_12/moving_mean': 'InceptionResnetV2/Repeat/block35_1/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_12/moving_variance': 'InceptionResnetV2/Repeat/block35_1/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_13/kernel': 'InceptionResnetV2/Repeat/block35_1/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_13/beta': 'InceptionResnetV2/Repeat/block35_1/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_13/moving_mean': 'InceptionResnetV2/Repeat/block35_1/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_13/moving_variance': 'InceptionResnetV2/Repeat/block35_1/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_14/kernel': 'InceptionResnetV2/Repeat/block35_1/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_14/beta': 'InceptionResnetV2/Repeat/block35_1/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_14/moving_mean': 'InceptionResnetV2/Repeat/block35_1/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_14/moving_variance': 'InceptionResnetV2/Repeat/block35_1/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_15/kernel': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_15/beta': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_15/moving_mean': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_15/moving_variance': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_16/kernel': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_16/beta': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_16/moving_mean': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_16/moving_variance': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_17/kernel': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_17/beta': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_17/moving_mean': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_17/moving_variance': 'InceptionResnetV2/Repeat/block35_1/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_1_conv/kernel': 'InceptionResnetV2/Repeat/block35_1/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_1_conv/bias': 'InceptionResnetV2/Repeat/block35_1/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_18/kernel': 'InceptionResnetV2/Repeat/block35_2/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_18/beta': 'InceptionResnetV2/Repeat/block35_2/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_18/moving_mean': 'InceptionResnetV2/Repeat/block35_2/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_18/moving_variance': 'InceptionResnetV2/Repeat/block35_2/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_19/kernel': 'InceptionResnetV2/Repeat/block35_2/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_19/beta': 'InceptionResnetV2/Repeat/block35_2/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_19/moving_mean': 'InceptionResnetV2/Repeat/block35_2/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_19/moving_variance': 'InceptionResnetV2/Repeat/block35_2/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_20/kernel': 'InceptionResnetV2/Repeat/block35_2/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_20/beta': 'InceptionResnetV2/Repeat/block35_2/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_20/moving_mean': 'InceptionResnetV2/Repeat/block35_2/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_20/moving_variance': 'InceptionResnetV2/Repeat/block35_2/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_21/kernel': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_21/beta': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_21/moving_mean': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_21/moving_variance': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_22/kernel': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_22/beta': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_22/moving_mean': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_22/moving_variance': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_23/kernel': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_23/beta': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_23/moving_mean': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_23/moving_variance': 'InceptionResnetV2/Repeat/block35_2/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_2_conv/kernel': 'InceptionResnetV2/Repeat/block35_2/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_2_conv/bias': 'InceptionResnetV2/Repeat/block35_2/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_24/kernel': 'InceptionResnetV2/Repeat/block35_3/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_24/beta': 'InceptionResnetV2/Repeat/block35_3/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_24/moving_mean': 'InceptionResnetV2/Repeat/block35_3/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_24/moving_variance': 'InceptionResnetV2/Repeat/block35_3/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_25/kernel': 'InceptionResnetV2/Repeat/block35_3/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_25/beta': 'InceptionResnetV2/Repeat/block35_3/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_25/moving_mean': 'InceptionResnetV2/Repeat/block35_3/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_25/moving_variance': 'InceptionResnetV2/Repeat/block35_3/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_26/kernel': 'InceptionResnetV2/Repeat/block35_3/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_26/beta': 'InceptionResnetV2/Repeat/block35_3/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_26/moving_mean': 'InceptionResnetV2/Repeat/block35_3/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_26/moving_variance': 'InceptionResnetV2/Repeat/block35_3/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_27/kernel': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_27/beta': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_27/moving_mean': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_27/moving_variance': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_28/kernel': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_28/beta': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_28/moving_mean': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_28/moving_variance': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_29/kernel': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_29/beta': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_29/moving_mean': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_29/moving_variance': 'InceptionResnetV2/Repeat/block35_3/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_3_conv/kernel': 'InceptionResnetV2/Repeat/block35_3/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_3_conv/bias': 'InceptionResnetV2/Repeat/block35_3/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_30/kernel': 'InceptionResnetV2/Repeat/block35_4/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_30/beta': 'InceptionResnetV2/Repeat/block35_4/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_30/moving_mean': 'InceptionResnetV2/Repeat/block35_4/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_30/moving_variance': 'InceptionResnetV2/Repeat/block35_4/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_31/kernel': 'InceptionResnetV2/Repeat/block35_4/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_31/beta': 'InceptionResnetV2/Repeat/block35_4/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_31/moving_mean': 'InceptionResnetV2/Repeat/block35_4/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_31/moving_variance': 'InceptionResnetV2/Repeat/block35_4/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_32/kernel': 'InceptionResnetV2/Repeat/block35_4/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_32/beta': 'InceptionResnetV2/Repeat/block35_4/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_32/moving_mean': 'InceptionResnetV2/Repeat/block35_4/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_32/moving_variance': 'InceptionResnetV2/Repeat/block35_4/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_33/kernel': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_33/beta': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_33/moving_mean': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_33/moving_variance': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_34/kernel': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_34/beta': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_34/moving_mean': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_34/moving_variance': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_35/kernel': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_35/beta': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_35/moving_mean': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_35/moving_variance': 'InceptionResnetV2/Repeat/block35_4/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_4_conv/kernel': 'InceptionResnetV2/Repeat/block35_4/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_4_conv/bias': 'InceptionResnetV2/Repeat/block35_4/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_36/kernel': 'InceptionResnetV2/Repeat/block35_5/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_36/beta': 'InceptionResnetV2/Repeat/block35_5/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_36/moving_mean': 'InceptionResnetV2/Repeat/block35_5/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_36/moving_variance': 'InceptionResnetV2/Repeat/block35_5/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_37/kernel': 'InceptionResnetV2/Repeat/block35_5/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_37/beta': 'InceptionResnetV2/Repeat/block35_5/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_37/moving_mean': 'InceptionResnetV2/Repeat/block35_5/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_37/moving_variance': 'InceptionResnetV2/Repeat/block35_5/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_38/kernel': 'InceptionResnetV2/Repeat/block35_5/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_38/beta': 'InceptionResnetV2/Repeat/block35_5/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_38/moving_mean': 'InceptionResnetV2/Repeat/block35_5/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_38/moving_variance': 'InceptionResnetV2/Repeat/block35_5/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_39/kernel': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_39/beta': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_39/moving_mean': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_39/moving_variance': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_40/kernel': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_40/beta': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_40/moving_mean': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_40/moving_variance': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_41/kernel': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_41/beta': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_41/moving_mean': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_41/moving_variance': 'InceptionResnetV2/Repeat/block35_5/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_5_conv/kernel': 'InceptionResnetV2/Repeat/block35_5/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_5_conv/bias': 'InceptionResnetV2/Repeat/block35_5/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_42/kernel': 'InceptionResnetV2/Repeat/block35_6/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_42/beta': 'InceptionResnetV2/Repeat/block35_6/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_42/moving_mean': 'InceptionResnetV2/Repeat/block35_6/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_42/moving_variance': 'InceptionResnetV2/Repeat/block35_6/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_43/kernel': 'InceptionResnetV2/Repeat/block35_6/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_43/beta': 'InceptionResnetV2/Repeat/block35_6/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_43/moving_mean': 'InceptionResnetV2/Repeat/block35_6/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_43/moving_variance': 'InceptionResnetV2/Repeat/block35_6/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_44/kernel': 'InceptionResnetV2/Repeat/block35_6/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_44/beta': 'InceptionResnetV2/Repeat/block35_6/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_44/moving_mean': 'InceptionResnetV2/Repeat/block35_6/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_44/moving_variance': 'InceptionResnetV2/Repeat/block35_6/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_45/kernel': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_45/beta': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_45/moving_mean': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_45/moving_variance': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_46/kernel': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_46/beta': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_46/moving_mean': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_46/moving_variance': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_47/kernel': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_47/beta': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_47/moving_mean': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_47/moving_variance': 'InceptionResnetV2/Repeat/block35_6/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_6_conv/kernel': 'InceptionResnetV2/Repeat/block35_6/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_6_conv/bias': 'InceptionResnetV2/Repeat/block35_6/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_48/kernel': 'InceptionResnetV2/Repeat/block35_7/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_48/beta': 'InceptionResnetV2/Repeat/block35_7/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_48/moving_mean': 'InceptionResnetV2/Repeat/block35_7/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_48/moving_variance': 'InceptionResnetV2/Repeat/block35_7/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_49/kernel': 'InceptionResnetV2/Repeat/block35_7/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_49/beta': 'InceptionResnetV2/Repeat/block35_7/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_49/moving_mean': 'InceptionResnetV2/Repeat/block35_7/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_49/moving_variance': 'InceptionResnetV2/Repeat/block35_7/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_50/kernel': 'InceptionResnetV2/Repeat/block35_7/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_50/beta': 'InceptionResnetV2/Repeat/block35_7/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_50/moving_mean': 'InceptionResnetV2/Repeat/block35_7/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_50/moving_variance': 'InceptionResnetV2/Repeat/block35_7/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_51/kernel': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_51/beta': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_51/moving_mean': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_51/moving_variance': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_52/kernel': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_52/beta': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_52/moving_mean': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_52/moving_variance': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_53/kernel': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_53/beta': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_53/moving_mean': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_53/moving_variance': 'InceptionResnetV2/Repeat/block35_7/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_7_conv/kernel': 'InceptionResnetV2/Repeat/block35_7/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_7_conv/bias': 'InceptionResnetV2/Repeat/block35_7/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_54/kernel': 'InceptionResnetV2/Repeat/block35_8/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_54/beta': 'InceptionResnetV2/Repeat/block35_8/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_54/moving_mean': 'InceptionResnetV2/Repeat/block35_8/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_54/moving_variance': 'InceptionResnetV2/Repeat/block35_8/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_55/kernel': 'InceptionResnetV2/Repeat/block35_8/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_55/beta': 'InceptionResnetV2/Repeat/block35_8/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_55/moving_mean': 'InceptionResnetV2/Repeat/block35_8/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_55/moving_variance': 'InceptionResnetV2/Repeat/block35_8/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_56/kernel': 'InceptionResnetV2/Repeat/block35_8/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_56/beta': 'InceptionResnetV2/Repeat/block35_8/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_56/moving_mean': 'InceptionResnetV2/Repeat/block35_8/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_56/moving_variance': 'InceptionResnetV2/Repeat/block35_8/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_57/kernel': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_57/beta': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_57/moving_mean': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_57/moving_variance': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_58/kernel': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_58/beta': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_58/moving_mean': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_58/moving_variance': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_59/kernel': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_59/beta': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_59/moving_mean': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_59/moving_variance': 'InceptionResnetV2/Repeat/block35_8/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_8_conv/kernel': 'InceptionResnetV2/Repeat/block35_8/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_8_conv/bias': 'InceptionResnetV2/Repeat/block35_8/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_60/kernel': 'InceptionResnetV2/Repeat/block35_9/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_60/beta': 'InceptionResnetV2/Repeat/block35_9/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_60/moving_mean': 'InceptionResnetV2/Repeat/block35_9/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_60/moving_variance': 'InceptionResnetV2/Repeat/block35_9/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_61/kernel': 'InceptionResnetV2/Repeat/block35_9/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_61/beta': 'InceptionResnetV2/Repeat/block35_9/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_61/moving_mean': 'InceptionResnetV2/Repeat/block35_9/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_61/moving_variance': 'InceptionResnetV2/Repeat/block35_9/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_62/kernel': 'InceptionResnetV2/Repeat/block35_9/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_62/beta': 'InceptionResnetV2/Repeat/block35_9/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_62/moving_mean': 'InceptionResnetV2/Repeat/block35_9/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_62/moving_variance': 'InceptionResnetV2/Repeat/block35_9/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_63/kernel': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_63/beta': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_63/moving_mean': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_63/moving_variance': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_64/kernel': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_64/beta': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_64/moving_mean': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_64/moving_variance': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_65/kernel': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_65/beta': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_65/moving_mean': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_65/moving_variance': 'InceptionResnetV2/Repeat/block35_9/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_9_conv/kernel': 'InceptionResnetV2/Repeat/block35_9/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_9_conv/bias': 'InceptionResnetV2/Repeat/block35_9/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_66/kernel': 'InceptionResnetV2/Repeat/block35_10/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_66/beta': 'InceptionResnetV2/Repeat/block35_10/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_66/moving_mean': 'InceptionResnetV2/Repeat/block35_10/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_66/moving_variance': 'InceptionResnetV2/Repeat/block35_10/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_67/kernel': 'InceptionResnetV2/Repeat/block35_10/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_67/beta': 'InceptionResnetV2/Repeat/block35_10/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_67/moving_mean': 'InceptionResnetV2/Repeat/block35_10/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_67/moving_variance': 'InceptionResnetV2/Repeat/block35_10/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_68/kernel': 'InceptionResnetV2/Repeat/block35_10/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_68/beta': 'InceptionResnetV2/Repeat/block35_10/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_68/moving_mean': 'InceptionResnetV2/Repeat/block35_10/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_68/moving_variance': 'InceptionResnetV2/Repeat/block35_10/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_69/kernel': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_69/beta': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_69/moving_mean': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_69/moving_variance': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_70/kernel': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_70/beta': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_70/moving_mean': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_70/moving_variance': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_71/kernel': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0c_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_71/beta': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0c_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_71/moving_mean': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_71/moving_variance': 'InceptionResnetV2/Repeat/block35_10/Branch_2/Conv2d_0c_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_10_conv/kernel': 'InceptionResnetV2/Repeat/block35_10/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block35_10_conv/bias': 'InceptionResnetV2/Repeat/block35_10/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_72/kernel': 'InceptionResnetV2/Mixed_6a/Branch_0/Conv2d_1a_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_72/beta': 'InceptionResnetV2/Mixed_6a/Branch_0/Conv2d_1a_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_72/moving_mean': 'InceptionResnetV2/Mixed_6a/Branch_0/Conv2d_1a_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_72/moving_variance': 'InceptionResnetV2/Mixed_6a/Branch_0/Conv2d_1a_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_73/kernel': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_73/beta': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_73/moving_mean': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_73/moving_variance': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_74/kernel': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_0b_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_74/beta': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_0b_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_74/moving_mean': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_74/moving_variance': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_75/kernel': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_1a_3x3/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_75/beta': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_1a_3x3/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_75/moving_mean': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_1a_3x3/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_75/moving_variance': 'InceptionResnetV2/Mixed_6a/Branch_1/Conv2d_1a_3x3/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_76/kernel': 'InceptionResnetV2/Repeat_1/block17_1/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_76/beta': 'InceptionResnetV2/Repeat_1/block17_1/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_76/moving_mean': 'InceptionResnetV2/Repeat_1/block17_1/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_76/moving_variance': 'InceptionResnetV2/Repeat_1/block17_1/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_77/kernel': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_77/beta': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_77/moving_mean': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_77/moving_variance': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_78/kernel': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_78/beta': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_78/moving_mean': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_78/moving_variance': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_79/kernel': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_79/beta': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_79/moving_mean': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_79/moving_variance': 'InceptionResnetV2/Repeat_1/block17_1/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_1_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_1/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_1_conv/bias': 'InceptionResnetV2/Repeat_1/block17_1/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_80/kernel': 'InceptionResnetV2/Repeat_1/block17_2/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_80/beta': 'InceptionResnetV2/Repeat_1/block17_2/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_80/moving_mean': 'InceptionResnetV2/Repeat_1/block17_2/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_80/moving_variance': 'InceptionResnetV2/Repeat_1/block17_2/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_81/kernel': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_81/beta': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_81/moving_mean': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_81/moving_variance': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_82/kernel': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_82/beta': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_82/moving_mean': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_82/moving_variance': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_83/kernel': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_83/beta': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_83/moving_mean': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_83/moving_variance': 'InceptionResnetV2/Repeat_1/block17_2/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_2_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_2/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_2_conv/bias': 'InceptionResnetV2/Repeat_1/block17_2/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_84/kernel': 'InceptionResnetV2/Repeat_1/block17_3/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_84/beta': 'InceptionResnetV2/Repeat_1/block17_3/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_84/moving_mean': 'InceptionResnetV2/Repeat_1/block17_3/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_84/moving_variance': 'InceptionResnetV2/Repeat_1/block17_3/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_85/kernel': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_85/beta': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_85/moving_mean': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_85/moving_variance': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_86/kernel': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_86/beta': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_86/moving_mean': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_86/moving_variance': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_87/kernel': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_87/beta': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_87/moving_mean': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_87/moving_variance': 'InceptionResnetV2/Repeat_1/block17_3/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_3_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_3/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_3_conv/bias': 'InceptionResnetV2/Repeat_1/block17_3/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_88/kernel': 'InceptionResnetV2/Repeat_1/block17_4/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_88/beta': 'InceptionResnetV2/Repeat_1/block17_4/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_88/moving_mean': 'InceptionResnetV2/Repeat_1/block17_4/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_88/moving_variance': 'InceptionResnetV2/Repeat_1/block17_4/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_89/kernel': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_89/beta': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_89/moving_mean': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_89/moving_variance': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_90/kernel': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_90/beta': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_90/moving_mean': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_90/moving_variance': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_91/kernel': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_91/beta': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_91/moving_mean': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_91/moving_variance': 'InceptionResnetV2/Repeat_1/block17_4/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_4_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_4/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_4_conv/bias': 'InceptionResnetV2/Repeat_1/block17_4/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_92/kernel': 'InceptionResnetV2/Repeat_1/block17_5/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_92/beta': 'InceptionResnetV2/Repeat_1/block17_5/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_92/moving_mean': 'InceptionResnetV2/Repeat_1/block17_5/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_92/moving_variance': 'InceptionResnetV2/Repeat_1/block17_5/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_93/kernel': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_93/beta': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_93/moving_mean': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_93/moving_variance': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_94/kernel': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_94/beta': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_94/moving_mean': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_94/moving_variance': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_95/kernel': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_95/beta': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_95/moving_mean': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_95/moving_variance': 'InceptionResnetV2/Repeat_1/block17_5/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_5_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_5/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_5_conv/bias': 'InceptionResnetV2/Repeat_1/block17_5/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_96/kernel': 'InceptionResnetV2/Repeat_1/block17_6/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_96/beta': 'InceptionResnetV2/Repeat_1/block17_6/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_96/moving_mean': 'InceptionResnetV2/Repeat_1/block17_6/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_96/moving_variance': 'InceptionResnetV2/Repeat_1/block17_6/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_97/kernel': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_97/beta': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_97/moving_mean': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_97/moving_variance': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_98/kernel': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_98/beta': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_98/moving_mean': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_98/moving_variance': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_99/kernel': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_99/beta': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_99/moving_mean': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_99/moving_variance': 'InceptionResnetV2/Repeat_1/block17_6/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_6_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_6/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_6_conv/bias': 'InceptionResnetV2/Repeat_1/block17_6/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_100/kernel': 'InceptionResnetV2/Repeat_1/block17_7/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_100/beta': 'InceptionResnetV2/Repeat_1/block17_7/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_100/moving_mean': 'InceptionResnetV2/Repeat_1/block17_7/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_100/moving_variance': 'InceptionResnetV2/Repeat_1/block17_7/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_101/kernel': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_101/beta': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_101/moving_mean': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_101/moving_variance': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_102/kernel': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_102/beta': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_102/moving_mean': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_102/moving_variance': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_103/kernel': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_103/beta': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_103/moving_mean': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_103/moving_variance': 'InceptionResnetV2/Repeat_1/block17_7/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_7_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_7/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_7_conv/bias': 'InceptionResnetV2/Repeat_1/block17_7/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_104/kernel': 'InceptionResnetV2/Repeat_1/block17_8/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_104/beta': 'InceptionResnetV2/Repeat_1/block17_8/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_104/moving_mean': 'InceptionResnetV2/Repeat_1/block17_8/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_104/moving_variance': 'InceptionResnetV2/Repeat_1/block17_8/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_105/kernel': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_105/beta': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_105/moving_mean': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_105/moving_variance': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_106/kernel': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_106/beta': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_106/moving_mean': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_106/moving_variance': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_107/kernel': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_107/beta': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_107/moving_mean': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_107/moving_variance': 'InceptionResnetV2/Repeat_1/block17_8/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_8_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_8/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_8_conv/bias': 'InceptionResnetV2/Repeat_1/block17_8/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_108/kernel': 'InceptionResnetV2/Repeat_1/block17_9/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_108/beta': 'InceptionResnetV2/Repeat_1/block17_9/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_108/moving_mean': 'InceptionResnetV2/Repeat_1/block17_9/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_108/moving_variance': 'InceptionResnetV2/Repeat_1/block17_9/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_109/kernel': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_109/beta': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_109/moving_mean': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_109/moving_variance': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_110/kernel': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_110/beta': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_110/moving_mean': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_110/moving_variance': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_111/kernel': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_111/beta': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_111/moving_mean': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_111/moving_variance': 'InceptionResnetV2/Repeat_1/block17_9/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_9_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_9/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_9_conv/bias': 'InceptionResnetV2/Repeat_1/block17_9/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_112/kernel': 'InceptionResnetV2/Repeat_1/block17_10/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_112/beta': 'InceptionResnetV2/Repeat_1/block17_10/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_112/moving_mean': 'InceptionResnetV2/Repeat_1/block17_10/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_112/moving_variance': 'InceptionResnetV2/Repeat_1/block17_10/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_113/kernel': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_113/beta': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_113/moving_mean': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_113/moving_variance': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_114/kernel': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_114/beta': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_114/moving_mean': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_114/moving_variance': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_115/kernel': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_115/beta': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_115/moving_mean': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_115/moving_variance': 'InceptionResnetV2/Repeat_1/block17_10/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_10_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_10/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_10_conv/bias': 'InceptionResnetV2/Repeat_1/block17_10/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_116/kernel': 'InceptionResnetV2/Repeat_1/block17_11/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_116/beta': 'InceptionResnetV2/Repeat_1/block17_11/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_116/moving_mean': 'InceptionResnetV2/Repeat_1/block17_11/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_116/moving_variance': 'InceptionResnetV2/Repeat_1/block17_11/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_117/kernel': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_117/beta': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_117/moving_mean': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_117/moving_variance': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_118/kernel': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_118/beta': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_118/moving_mean': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_118/moving_variance': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_119/kernel': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_119/beta': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_119/moving_mean': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_119/moving_variance': 'InceptionResnetV2/Repeat_1/block17_11/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_11_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_11/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_11_conv/bias': 'InceptionResnetV2/Repeat_1/block17_11/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_120/kernel': 'InceptionResnetV2/Repeat_1/block17_12/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_120/beta': 'InceptionResnetV2/Repeat_1/block17_12/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_120/moving_mean': 'InceptionResnetV2/Repeat_1/block17_12/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_120/moving_variance': 'InceptionResnetV2/Repeat_1/block17_12/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_121/kernel': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_121/beta': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_121/moving_mean': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_121/moving_variance': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_122/kernel': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_122/beta': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_122/moving_mean': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_122/moving_variance': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_123/kernel': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_123/beta': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_123/moving_mean': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_123/moving_variance': 'InceptionResnetV2/Repeat_1/block17_12/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_12_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_12/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_12_conv/bias': 'InceptionResnetV2/Repeat_1/block17_12/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_124/kernel': 'InceptionResnetV2/Repeat_1/block17_13/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_124/beta': 'InceptionResnetV2/Repeat_1/block17_13/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_124/moving_mean': 'InceptionResnetV2/Repeat_1/block17_13/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_124/moving_variance': 'InceptionResnetV2/Repeat_1/block17_13/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_125/kernel': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_125/beta': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_125/moving_mean': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_125/moving_variance': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_126/kernel': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_126/beta': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_126/moving_mean': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_126/moving_variance': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_127/kernel': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_127/beta': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_127/moving_mean': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_127/moving_variance': 'InceptionResnetV2/Repeat_1/block17_13/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_13_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_13/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_13_conv/bias': 'InceptionResnetV2/Repeat_1/block17_13/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_128/kernel': 'InceptionResnetV2/Repeat_1/block17_14/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_128/beta': 'InceptionResnetV2/Repeat_1/block17_14/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_128/moving_mean': 'InceptionResnetV2/Repeat_1/block17_14/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_128/moving_variance': 'InceptionResnetV2/Repeat_1/block17_14/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_129/kernel': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_129/beta': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_129/moving_mean': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_129/moving_variance': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_130/kernel': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_130/beta': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_130/moving_mean': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_130/moving_variance': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_131/kernel': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_131/beta': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_131/moving_mean': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_131/moving_variance': 'InceptionResnetV2/Repeat_1/block17_14/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_14_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_14/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_14_conv/bias': 'InceptionResnetV2/Repeat_1/block17_14/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_132/kernel': 'InceptionResnetV2/Repeat_1/block17_15/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_132/beta': 'InceptionResnetV2/Repeat_1/block17_15/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_132/moving_mean': 'InceptionResnetV2/Repeat_1/block17_15/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_132/moving_variance': 'InceptionResnetV2/Repeat_1/block17_15/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_133/kernel': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_133/beta': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_133/moving_mean': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_133/moving_variance': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_134/kernel': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_134/beta': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_134/moving_mean': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_134/moving_variance': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_135/kernel': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_135/beta': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_135/moving_mean': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_135/moving_variance': 'InceptionResnetV2/Repeat_1/block17_15/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_15_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_15/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_15_conv/bias': 'InceptionResnetV2/Repeat_1/block17_15/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_136/kernel': 'InceptionResnetV2/Repeat_1/block17_16/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_136/beta': 'InceptionResnetV2/Repeat_1/block17_16/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_136/moving_mean': 'InceptionResnetV2/Repeat_1/block17_16/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_136/moving_variance': 'InceptionResnetV2/Repeat_1/block17_16/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_137/kernel': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_137/beta': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_137/moving_mean': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_137/moving_variance': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_138/kernel': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_138/beta': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_138/moving_mean': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_138/moving_variance': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_139/kernel': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_139/beta': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_139/moving_mean': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_139/moving_variance': 'InceptionResnetV2/Repeat_1/block17_16/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_16_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_16/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_16_conv/bias': 'InceptionResnetV2/Repeat_1/block17_16/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_140/kernel': 'InceptionResnetV2/Repeat_1/block17_17/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_140/beta': 'InceptionResnetV2/Repeat_1/block17_17/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_140/moving_mean': 'InceptionResnetV2/Repeat_1/block17_17/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_140/moving_variance': 'InceptionResnetV2/Repeat_1/block17_17/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_141/kernel': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_141/beta': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_141/moving_mean': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_141/moving_variance': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_142/kernel': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_142/beta': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_142/moving_mean': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_142/moving_variance': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_143/kernel': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_143/beta': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_143/moving_mean': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_143/moving_variance': 'InceptionResnetV2/Repeat_1/block17_17/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_17_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_17/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_17_conv/bias': 'InceptionResnetV2/Repeat_1/block17_17/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_144/kernel': 'InceptionResnetV2/Repeat_1/block17_18/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_144/beta': 'InceptionResnetV2/Repeat_1/block17_18/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_144/moving_mean': 'InceptionResnetV2/Repeat_1/block17_18/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_144/moving_variance': 'InceptionResnetV2/Repeat_1/block17_18/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_145/kernel': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_145/beta': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_145/moving_mean': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_145/moving_variance': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_146/kernel': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_146/beta': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_146/moving_mean': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_146/moving_variance': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_147/kernel': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_147/beta': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_147/moving_mean': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_147/moving_variance': 'InceptionResnetV2/Repeat_1/block17_18/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_18_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_18/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_18_conv/bias': 'InceptionResnetV2/Repeat_1/block17_18/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_148/kernel': 'InceptionResnetV2/Repeat_1/block17_19/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_148/beta': 'InceptionResnetV2/Repeat_1/block17_19/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_148/moving_mean': 'InceptionResnetV2/Repeat_1/block17_19/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_148/moving_variance': 'InceptionResnetV2/Repeat_1/block17_19/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_149/kernel': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_149/beta': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_149/moving_mean': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_149/moving_variance': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_150/kernel': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_150/beta': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_150/moving_mean': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_150/moving_variance': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_151/kernel': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_151/beta': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_151/moving_mean': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_151/moving_variance': 'InceptionResnetV2/Repeat_1/block17_19/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_19_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_19/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_19_conv/bias': 'InceptionResnetV2/Repeat_1/block17_19/Conv2d_1x1/biases',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_152/kernel': 'InceptionResnetV2/Repeat_1/block17_20/Branch_0/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_152/beta': 'InceptionResnetV2/Repeat_1/block17_20/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_152/moving_mean': 'InceptionResnetV2/Repeat_1/block17_20/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_152/moving_variance': 'InceptionResnetV2/Repeat_1/block17_20/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_153/kernel': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0a_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_153/beta': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_153/moving_mean': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_153/moving_variance': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_154/kernel': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0b_1x7/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_154/beta': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0b_1x7/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_154/moving_mean': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_154/moving_variance': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0b_1x7/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/conv2d_155/kernel': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0c_7x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_155/beta': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0c_7x1/BatchNorm/beta',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_155/moving_mean': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_mean',
-        'FirstStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_155/moving_variance': 'InceptionResnetV2/Repeat_1/block17_20/Branch_1/Conv2d_0c_7x1/BatchNorm/moving_variance',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_20_conv/kernel': 'InceptionResnetV2/Repeat_1/block17_20/Conv2d_1x1/weights',
-        'FirstStageFeatureExtractor/InceptionResnetV2/block17_20_conv/bias': 'InceptionResnetV2/Repeat_1/block17_20/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_359/kernel': 'InceptionResnetV2/Mixed_7a/Branch_0/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_359/beta': 'InceptionResnetV2/Mixed_7a/Branch_0/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_359/moving_mean': 'InceptionResnetV2/Mixed_7a/Branch_0/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_359/moving_variance': 'InceptionResnetV2/Mixed_7a/Branch_0/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_360/kernel': 'InceptionResnetV2/Mixed_7a/Branch_0/Conv2d_1a_3x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_360/beta': 'InceptionResnetV2/Mixed_7a/Branch_0/Conv2d_1a_3x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_360/moving_mean': 'InceptionResnetV2/Mixed_7a/Branch_0/Conv2d_1a_3x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_360/moving_variance': 'InceptionResnetV2/Mixed_7a/Branch_0/Conv2d_1a_3x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_361/kernel': 'InceptionResnetV2/Mixed_7a/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_361/beta': 'InceptionResnetV2/Mixed_7a/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_361/moving_mean': 'InceptionResnetV2/Mixed_7a/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_361/moving_variance': 'InceptionResnetV2/Mixed_7a/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_362/kernel': 'InceptionResnetV2/Mixed_7a/Branch_1/Conv2d_1a_3x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_362/beta': 'InceptionResnetV2/Mixed_7a/Branch_1/Conv2d_1a_3x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_362/moving_mean': 'InceptionResnetV2/Mixed_7a/Branch_1/Conv2d_1a_3x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_362/moving_variance': 'InceptionResnetV2/Mixed_7a/Branch_1/Conv2d_1a_3x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_363/kernel': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_363/beta': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_363/moving_mean': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_363/moving_variance': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_364/kernel': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_0b_3x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_364/beta': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_0b_3x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_364/moving_mean': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_364/moving_variance': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_0b_3x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_365/kernel': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_1a_3x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_365/beta': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_1a_3x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_365/moving_mean': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_1a_3x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_365/moving_variance': 'InceptionResnetV2/Mixed_7a/Branch_2/Conv2d_1a_3x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_366/kernel': 'InceptionResnetV2/Repeat_2/block8_1/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_366/beta': 'InceptionResnetV2/Repeat_2/block8_1/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_366/moving_mean': 'InceptionResnetV2/Repeat_2/block8_1/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_366/moving_variance': 'InceptionResnetV2/Repeat_2/block8_1/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_367/kernel': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_367/beta': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_367/moving_mean': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_367/moving_variance': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_368/kernel': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_368/beta': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_368/moving_mean': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_368/moving_variance': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_369/kernel': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_369/beta': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_369/moving_mean': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_369/moving_variance': 'InceptionResnetV2/Repeat_2/block8_1/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_1_conv/kernel': 'InceptionResnetV2/Repeat_2/block8_1/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_1_conv/bias': 'InceptionResnetV2/Repeat_2/block8_1/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_370/kernel': 'InceptionResnetV2/Repeat_2/block8_2/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_370/beta': 'InceptionResnetV2/Repeat_2/block8_2/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_370/moving_mean': 'InceptionResnetV2/Repeat_2/block8_2/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_370/moving_variance': 'InceptionResnetV2/Repeat_2/block8_2/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_371/kernel': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_371/beta': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_371/moving_mean': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_371/moving_variance': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_372/kernel': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_372/beta': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_372/moving_mean': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_372/moving_variance': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_373/kernel': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_373/beta': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_373/moving_mean': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_373/moving_variance': 'InceptionResnetV2/Repeat_2/block8_2/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_2_conv/kernel': 'InceptionResnetV2/Repeat_2/block8_2/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_2_conv/bias': 'InceptionResnetV2/Repeat_2/block8_2/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_374/kernel': 'InceptionResnetV2/Repeat_2/block8_3/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_374/beta': 'InceptionResnetV2/Repeat_2/block8_3/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_374/moving_mean': 'InceptionResnetV2/Repeat_2/block8_3/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_374/moving_variance': 'InceptionResnetV2/Repeat_2/block8_3/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_375/kernel': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_375/beta': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_375/moving_mean': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_375/moving_variance': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_376/kernel': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_376/beta': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_376/moving_mean': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_376/moving_variance': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_377/kernel': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_377/beta': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_377/moving_mean': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_377/moving_variance': 'InceptionResnetV2/Repeat_2/block8_3/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_3_conv/kernel': 'InceptionResnetV2/Repeat_2/block8_3/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_3_conv/bias': 'InceptionResnetV2/Repeat_2/block8_3/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_378/kernel': 'InceptionResnetV2/Repeat_2/block8_4/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_378/beta': 'InceptionResnetV2/Repeat_2/block8_4/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_378/moving_mean': 'InceptionResnetV2/Repeat_2/block8_4/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_378/moving_variance': 'InceptionResnetV2/Repeat_2/block8_4/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_379/kernel': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_379/beta': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_379/moving_mean': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_379/moving_variance': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_380/kernel': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_380/beta': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_380/moving_mean': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_380/moving_variance': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_381/kernel': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_381/beta': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_381/moving_mean': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_381/moving_variance': 'InceptionResnetV2/Repeat_2/block8_4/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_4_conv/kernel': 'InceptionResnetV2/Repeat_2/block8_4/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_4_conv/bias': 'InceptionResnetV2/Repeat_2/block8_4/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_382/kernel': 'InceptionResnetV2/Repeat_2/block8_5/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_382/beta': 'InceptionResnetV2/Repeat_2/block8_5/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_382/moving_mean': 'InceptionResnetV2/Repeat_2/block8_5/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_382/moving_variance': 'InceptionResnetV2/Repeat_2/block8_5/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_383/kernel': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_383/beta': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_383/moving_mean': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_383/moving_variance': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_384/kernel': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_384/beta': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_384/moving_mean': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_384/moving_variance': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_385/kernel': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_385/beta': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_385/moving_mean': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_385/moving_variance': 'InceptionResnetV2/Repeat_2/block8_5/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_5_conv/kernel': 'InceptionResnetV2/Repeat_2/block8_5/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_5_conv/bias': 'InceptionResnetV2/Repeat_2/block8_5/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_386/kernel': 'InceptionResnetV2/Repeat_2/block8_6/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_386/beta': 'InceptionResnetV2/Repeat_2/block8_6/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_386/moving_mean': 'InceptionResnetV2/Repeat_2/block8_6/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_386/moving_variance': 'InceptionResnetV2/Repeat_2/block8_6/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_387/kernel': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_387/beta': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_387/moving_mean': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_387/moving_variance': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_388/kernel': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_388/beta': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_388/moving_mean': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_388/moving_variance': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_389/kernel': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_389/beta': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_389/moving_mean': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_389/moving_variance': 'InceptionResnetV2/Repeat_2/block8_6/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_6_conv/kernel': 'InceptionResnetV2/Repeat_2/block8_6/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_6_conv/bias': 'InceptionResnetV2/Repeat_2/block8_6/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_390/kernel': 'InceptionResnetV2/Repeat_2/block8_7/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_390/beta': 'InceptionResnetV2/Repeat_2/block8_7/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_390/moving_mean': 'InceptionResnetV2/Repeat_2/block8_7/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_390/moving_variance': 'InceptionResnetV2/Repeat_2/block8_7/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_391/kernel': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_391/beta': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_391/moving_mean': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_391/moving_variance': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_392/kernel': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_392/beta': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_392/moving_mean': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_392/moving_variance': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_393/kernel': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_393/beta': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_393/moving_mean': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_393/moving_variance': 'InceptionResnetV2/Repeat_2/block8_7/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_7_conv/kernel': 'InceptionResnetV2/Repeat_2/block8_7/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_7_conv/bias': 'InceptionResnetV2/Repeat_2/block8_7/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_394/kernel': 'InceptionResnetV2/Repeat_2/block8_8/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_394/beta': 'InceptionResnetV2/Repeat_2/block8_8/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_394/moving_mean': 'InceptionResnetV2/Repeat_2/block8_8/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_394/moving_variance': 'InceptionResnetV2/Repeat_2/block8_8/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_395/kernel': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_395/beta': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_395/moving_mean': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_395/moving_variance': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_396/kernel': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_396/beta': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_396/moving_mean': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_396/moving_variance': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_397/kernel': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_397/beta': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_397/moving_mean': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_397/moving_variance': 'InceptionResnetV2/Repeat_2/block8_8/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_8_conv/kernel': 'InceptionResnetV2/Repeat_2/block8_8/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_8_conv/bias': 'InceptionResnetV2/Repeat_2/block8_8/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_398/kernel': 'InceptionResnetV2/Repeat_2/block8_9/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_398/beta': 'InceptionResnetV2/Repeat_2/block8_9/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_398/moving_mean': 'InceptionResnetV2/Repeat_2/block8_9/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_398/moving_variance': 'InceptionResnetV2/Repeat_2/block8_9/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_399/kernel': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_399/beta': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_399/moving_mean': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_399/moving_variance': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_400/kernel': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_400/beta': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_400/moving_mean': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_400/moving_variance': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_401/kernel': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_401/beta': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_401/moving_mean': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_401/moving_variance': 'InceptionResnetV2/Repeat_2/block8_9/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_9_conv/kernel': 'InceptionResnetV2/Repeat_2/block8_9/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_9_conv/bias': 'InceptionResnetV2/Repeat_2/block8_9/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_402/kernel': 'InceptionResnetV2/Block8/Branch_0/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_402/beta': 'InceptionResnetV2/Block8/Branch_0/Conv2d_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_402/moving_mean': 'InceptionResnetV2/Block8/Branch_0/Conv2d_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_402/moving_variance': 'InceptionResnetV2/Block8/Branch_0/Conv2d_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_403/kernel': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0a_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_403/beta': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0a_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_403/moving_mean': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_403/moving_variance': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0a_1x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_404/kernel': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0b_1x3/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_404/beta': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0b_1x3/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_404/moving_mean': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_404/moving_variance': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0b_1x3/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv2d_405/kernel': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0c_3x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_405/beta': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0c_3x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_405/moving_mean': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/freezable_batch_norm_405/moving_variance': 'InceptionResnetV2/Block8/Branch_1/Conv2d_0c_3x1/BatchNorm/moving_variance',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_10_conv/kernel': 'InceptionResnetV2/Block8/Conv2d_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/block8_10_conv/bias': 'InceptionResnetV2/Block8/Conv2d_1x1/biases',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv_7b/kernel': 'InceptionResnetV2/Conv2d_7b_1x1/weights',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv_7b_bn/beta': 'InceptionResnetV2/Conv2d_7b_1x1/BatchNorm/beta',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv_7b_bn/moving_mean': 'InceptionResnetV2/Conv2d_7b_1x1/BatchNorm/moving_mean',
-        'SecondStageFeatureExtractor/InceptionResnetV2/conv_7b_bn/moving_variance': 'InceptionResnetV2/Conv2d_7b_1x1/BatchNorm/moving_variance',
-    }
-
-    variables_to_restore = {}
-    if tf.executing_eagerly():
-      for key in self._variable_dict:
-        # variable.name includes ":0" at the end, but the names in the
-        # checkpoint do not have the suffix ":0". So, we strip it here.
-        var_name = keras_to_slim_name_mapping.get(key)
-        if var_name:
-          variables_to_restore[var_name] = self._variable_dict[key]
-    else:
-      for variable in variables_helper.get_global_variables_safely():
-        var_name = keras_to_slim_name_mapping.get(variable.op.name)
-        if var_name:
-          variables_to_restore[var_name] = variable
-    return variables_to_restore
diff --git a/research/object_detection/models/faster_rcnn_inception_resnet_v2_keras_feature_extractor_tf2_test.py b/research/object_detection/models/faster_rcnn_inception_resnet_v2_keras_feature_extractor_tf2_test.py
index 49c56045714e8b2e145c9dd5131884727bb48d7f..20bb50ef836aaf71448f9711f430b532d5a01b5b 100644
--- a/research/object_detection/models/faster_rcnn_inception_resnet_v2_keras_feature_extractor_tf2_test.py
+++ b/research/object_detection/models/faster_rcnn_inception_resnet_v2_keras_feature_extractor_tf2_test.py
@@ -73,7 +73,7 @@ class FasterRcnnInceptionResnetV2KerasFeatureExtractorTest(tf.test.TestCase):
     proposal_classifier_features = (
         model(proposal_feature_maps))
     features_shape = tf.shape(proposal_classifier_features)
-    self.assertAllEqual(features_shape.numpy(), [2, 8, 8, 1536])
+    self.assertAllEqual(features_shape.numpy(), [2, 9, 9, 1536])
 
 
 if __name__ == '__main__':
diff --git a/research/object_detection/models/faster_rcnn_resnet_keras_feature_extractor.py b/research/object_detection/models/faster_rcnn_resnet_keras_feature_extractor.py
index a2029d24251bf7541405b06918fcfc0897f44af7..a6b1e25404c71be5a3b68df9ce85416ffd4e982e 100644
--- a/research/object_detection/models/faster_rcnn_resnet_keras_feature_extractor.py
+++ b/research/object_detection/models/faster_rcnn_resnet_keras_feature_extractor.py
@@ -175,23 +175,6 @@ class FasterRCNNResnetKerasFeatureExtractor(
           self._variable_dict[variable.name[:-2]] = variable
         return keras_model
 
-  def restore_from_classification_checkpoint_fn(
-      self,
-      first_stage_feature_extractor_scope,
-      second_stage_feature_extractor_scope):
-    """Returns a map for restoring from an (object-based) checkpoint.
-
-    Args:
-      first_stage_feature_extractor_scope: A scope name for the first stage
-        feature extractor (unused).
-      second_stage_feature_extractor_scope: A scope name for the second stage
-        feature extractor (unused).
-
-    Returns:
-      A dict mapping keys to Keras models
-    """
-    return {'feature_extractor': self.classification_backbone}
-
 
 class FasterRCNNResnet50KerasFeatureExtractor(
     FasterRCNNResnetKerasFeatureExtractor):
diff --git a/research/object_detection/models/faster_rcnn_resnet_v1_fpn_keras_feature_extractor.py b/research/object_detection/models/faster_rcnn_resnet_v1_fpn_keras_feature_extractor.py
index c6dd3b710c31356407fda0dd0139c8827c2f5721..fdd597d8479e2b9b97b9ba8961de4260cd6fc531 100644
--- a/research/object_detection/models/faster_rcnn_resnet_v1_fpn_keras_feature_extractor.py
+++ b/research/object_detection/models/faster_rcnn_resnet_v1_fpn_keras_feature_extractor.py
@@ -32,6 +32,7 @@ _RESNET_MODEL_OUTPUT_LAYERS = {
                       'conv4_block36_out', 'conv5_block3_out'],
 }
 
+
 class _ResnetFPN(tf.keras.layers.Layer):
   """Construct Resnet FPN layer."""
 
@@ -76,7 +77,8 @@ class _ResnetFPN(tf.keras.layers.Layer):
       feature_maps: A list of tensors with shape [batch, height, width, depth]
         represent extracted features.
     """
-    inputs = ops.pad_to_multiple(inputs, self.pad_to_multiple)
+    inputs = 
+    .pad_to_multiple(inputs, self.pad_to_multiple)
     backbone_outputs = self.classification_backbone(inputs)
 
     feature_block_list = []
@@ -128,7 +130,7 @@ class FasterRCNNResnetV1FpnKerasFeatureExtractor(
         the resnet_v1.resnet_v1_{50,101,152} models.
       resnet_v1_base_model_name: model name under which to construct resnet v1.
       first_stage_features_stride: See base class.
-      conv_hyperparameters: a `hyperparams_builder.KerasLayerHyperparams` object
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
         containing convolution hyperparameters for the layers added on top of
         the base feature extractor.
       batch_norm_trainable: See base class.
@@ -166,6 +168,7 @@ class FasterRCNNResnetV1FpnKerasFeatureExtractor(
     self._additional_layer_depth = additional_layer_depth
     self._freeze_batchnorm = (not batch_norm_trainable)
     self._pad_to_multiple = pad_to_multiple
+
     self._override_base_feature_extractor_hyperparams = \
                     override_base_feature_extractor_hyperparams
     self._resnet_block_names = ['block1', 'block2', 'block3', 'block4']
@@ -216,13 +219,14 @@ class FasterRCNNResnetV1FpnKerasFeatureExtractor(
       with tf.name_scope('ResnetV1FPN'):
         full_resnet_v1_model = self._resnet_v1_base_model(
             batchnorm_training=self._train_batch_norm,
-            conv_hyperparams=(self._conv_hyperparams
-                              if self._override_base_feature_extractor_hyperparams
+            conv_hyperparams=(self._conv_hyperparams if
+                              self._override_base_feature_extractor_hyperparams
                               else None),
             classes=None,
             weights=None,
             include_top=False)
-        output_layers = _RESNET_MODEL_OUTPUT_LAYERS[self._resnet_v1_base_model_name]
+        output_layers = _RESNET_MODEL_OUTPUT_LAYERS[
+            self._resnet_v1_base_model_name]
         outputs = [full_resnet_v1_model.get_layer(output_layer_name).output
                    for output_layer_name in output_layers]
         self.classification_backbone = tf.keras.Model(
@@ -291,7 +295,6 @@ class FasterRCNNResnetV1FpnKerasFeatureExtractor(
     """
     with tf.name_scope(name):
       with tf.name_scope('ResnetV1FPN'):
-        # TODO: Add a batchnorm layer between two fc layers.
         feature_extractor_model = tf.keras.models.Sequential([
             tf.keras.layers.Flatten(),
             tf.keras.layers.Dense(units=1024, activation='relu'),
@@ -341,7 +344,9 @@ class FasterRCNNResnet50FpnKerasFeatureExtractor(
         fpn_min_level=fpn_min_level,
         fpn_max_level=fpn_max_level,
         additional_layer_depth=additional_layer_depth,
-        override_base_feature_extractor_hyperparams=override_base_feature_extractor_hyperparams)
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams
+    )
 
 
 class FasterRCNNResnet101FpnKerasFeatureExtractor(
@@ -381,7 +386,8 @@ class FasterRCNNResnet101FpnKerasFeatureExtractor(
         fpn_min_level=fpn_min_level,
         fpn_max_level=fpn_max_level,
         additional_layer_depth=additional_layer_depth,
-        override_base_feature_extractor_hyperparams=override_base_feature_extractor_hyperparams)
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams)
 
 
 class FasterRCNNResnet152FpnKerasFeatureExtractor(
@@ -422,4 +428,5 @@ class FasterRCNNResnet152FpnKerasFeatureExtractor(
         fpn_min_level=fpn_min_level,
         fpn_max_level=fpn_max_level,
         additional_layer_depth=additional_layer_depth,
-        override_base_feature_extractor_hyperparams=override_base_feature_extractor_hyperparams)
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams)
diff --git a/research/object_detection/models/faster_rcnn_resnet_v1_fpn_keras_feature_extractor_tf2_test.py b/research/object_detection/models/faster_rcnn_resnet_v1_fpn_keras_feature_extractor_tf2_test.py
index 193a27756cb5a86363293be994a39a5b590ccc8f..0e5ecba32291367281804f9c5842f5d167379b77 100644
--- a/research/object_detection/models/faster_rcnn_resnet_v1_fpn_keras_feature_extractor_tf2_test.py
+++ b/research/object_detection/models/faster_rcnn_resnet_v1_fpn_keras_feature_extractor_tf2_test.py
@@ -21,8 +21,8 @@ from google.protobuf import text_format
 
 from object_detection.builders import hyperparams_builder
 from object_detection.models import faster_rcnn_resnet_v1_fpn_keras_feature_extractor as frcnn_res_fpn
-from object_detection.utils import tf_version
 from object_detection.protos import hyperparams_pb2
+from object_detection.utils import tf_version
 
 
 @unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
diff --git a/research/object_detection/models/keras_models/hourglass_network.py b/research/object_detection/models/keras_models/hourglass_network.py
index d216b1669e3864083ff477a395f48c596172e356..09fb8ed4f4fb6f4b2712d8403ba1b94985ad25ad 100644
--- a/research/object_detection/models/keras_models/hourglass_network.py
+++ b/research/object_detection/models/keras_models/hourglass_network.py
@@ -43,6 +43,15 @@ def _get_padding_for_kernel_size(kernel_size):
         kernel_size))
 
 
+def batchnorm():
+  try:
+    return tf.keras.layers.experimental.SyncBatchNormalization(
+        name='batchnorm', epsilon=1e-5, momentum=0.1)
+  except AttributeError:
+    return tf.keras.layers.BatchNormalization(
+        name='batchnorm', epsilon=1e-5, momentum=0.1, fused=BATCH_NORM_FUSED)
+
+
 class ConvolutionalBlock(tf.keras.layers.Layer):
   """Block that aggregates Convolution + Norm layer + ReLU."""
 
@@ -73,8 +82,7 @@ class ConvolutionalBlock(tf.keras.layers.Layer):
         filters=out_channels, kernel_size=kernel_size, use_bias=False,
         strides=stride, padding=padding)
 
-    self.norm = tf.keras.layers.experimental.SyncBatchNormalization(
-        name='batchnorm', epsilon=1e-5, momentum=0.1)
+    self.norm = batchnorm()
 
     if relu:
       self.relu = tf.keras.layers.ReLU()
@@ -124,8 +132,7 @@ class ResidualBlock(tf.keras.layers.Layer):
     self.conv = tf.keras.layers.Conv2D(
         filters=out_channels, kernel_size=kernel_size, use_bias=False,
         strides=1, padding=padding)
-    self.norm = tf.keras.layers.experimental.SyncBatchNormalization(
-        name='batchnorm', epsilon=1e-5, momentum=0.1)
+    self.norm = batchnorm()
 
     if skip_conv:
       self.skip = SkipConvolution(out_channels=out_channels,
diff --git a/research/object_detection/models/keras_models/resnet_v1.py b/research/object_detection/models/keras_models/resnet_v1.py
index d5426ad6b5e499171dbd955dc9c3fe465c4b6051..e4c7bf250abd1ca01eb9be33d7a3dfea2ccd475e 100644
--- a/research/object_detection/models/keras_models/resnet_v1.py
+++ b/research/object_detection/models/keras_models/resnet_v1.py
@@ -21,6 +21,7 @@ from __future__ import print_function
 
 import tensorflow.compat.v1 as tf
 
+from tensorflow.python.keras.applications import resnet
 from object_detection.core import freezable_batch_norm
 from object_detection.models.keras_models import model_utils
 
@@ -95,11 +96,11 @@ class _LayersOverride(object):
     self.regularizer = tf.keras.regularizers.l2(weight_decay)
     self.initializer = tf.variance_scaling_initializer()
 
-  def _FixedPaddingLayer(self, kernel_size, rate=1):
+  def _FixedPaddingLayer(self, kernel_size, rate=1):  # pylint: disable=invalid-name
     return tf.keras.layers.Lambda(
         lambda x: _fixed_padding(x, kernel_size, rate))
 
-  def Conv2D(self, filters, kernel_size, **kwargs):
+  def Conv2D(self, filters, kernel_size, **kwargs):  # pylint: disable=invalid-name
     """Builds a Conv2D layer according to the current Object Detection config.
 
     Overrides the Keras Resnet application's convolutions with ones that
@@ -141,7 +142,7 @@ class _LayersOverride(object):
     else:
       return tf.keras.layers.Conv2D(filters, kernel_size, **kwargs)
 
-  def Activation(self, *args, **kwargs):  # pylint: disable=unused-argument
+  def Activation(self, *args, **kwargs):  # pylint: disable=unused-argument,invalid-name
     """Builds an activation layer.
 
     Overrides the Keras application Activation layer specified by the
@@ -163,7 +164,7 @@ class _LayersOverride(object):
     else:
       return tf.keras.layers.Lambda(tf.nn.relu, name=name)
 
-  def BatchNormalization(self, **kwargs):
+  def BatchNormalization(self, **kwargs):  # pylint: disable=invalid-name
     """Builds a normalization layer.
 
     Overrides the Keras application batch norm with the norm specified by the
@@ -191,7 +192,7 @@ class _LayersOverride(object):
           momentum=self._default_batchnorm_momentum,
           **kwargs)
 
-  def Input(self, shape):
+  def Input(self, shape):  # pylint: disable=invalid-name
     """Builds an Input layer.
 
     Overrides the Keras application Input layer with one that uses a
@@ -219,7 +220,7 @@ class _LayersOverride(object):
         input=input_tensor, shape=[None] + shape)
     return model_utils.input_layer(shape, placeholder_with_default)
 
-  def MaxPooling2D(self, pool_size, **kwargs):
+  def MaxPooling2D(self, pool_size, **kwargs):  # pylint: disable=invalid-name
     """Builds a MaxPooling2D layer with default padding as 'SAME'.
 
     This is specified by the default resnet arg_scope in slim.
@@ -237,7 +238,7 @@ class _LayersOverride(object):
   # Add alias as Keras also has it.
   MaxPool2D = MaxPooling2D  # pylint: disable=invalid-name
 
-  def ZeroPadding2D(self, padding, **kwargs):  # pylint: disable=unused-argument
+  def ZeroPadding2D(self, padding, **kwargs):  # pylint: disable=unused-argument,invalid-name
     """Replaces explicit padding in the Keras application with a no-op.
 
     Args:
@@ -395,3 +396,146 @@ def resnet_v1_152(batchnorm_training,
   return tf.keras.applications.resnet.ResNet152(
       layers=layers_override, **kwargs)
 # pylint: enable=invalid-name
+
+
+# The following codes are based on the existing keras ResNet model pattern:
+# google3/third_party/tensorflow/python/keras/applications/resnet.py
+def block_basic(x,
+                filters,
+                kernel_size=3,
+                stride=1,
+                conv_shortcut=False,
+                name=None):
+  """A residual block for ResNet18/34.
+
+  Arguments:
+      x: input tensor.
+      filters: integer, filters of the bottleneck layer.
+      kernel_size: default 3, kernel size of the bottleneck layer.
+      stride: default 1, stride of the first layer.
+      conv_shortcut: default False, use convolution shortcut if True, otherwise
+        identity shortcut.
+      name: string, block label.
+
+  Returns:
+    Output tensor for the residual block.
+  """
+  layers = tf.keras.layers
+  bn_axis = 3 if tf.keras.backend.image_data_format() == 'channels_last' else 1
+
+  preact = layers.BatchNormalization(
+      axis=bn_axis, epsilon=1.001e-5, name=name + '_preact_bn')(
+          x)
+  preact = layers.Activation('relu', name=name + '_preact_relu')(preact)
+
+  if conv_shortcut:
+    shortcut = layers.Conv2D(
+        filters, 1, strides=1, name=name + '_0_conv')(
+            preact)
+  else:
+    shortcut = layers.MaxPooling2D(1, strides=stride)(x) if stride > 1 else x
+
+  x = layers.ZeroPadding2D(
+      padding=((1, 1), (1, 1)), name=name + '_1_pad')(
+          preact)
+  x = layers.Conv2D(
+      filters, kernel_size, strides=1, use_bias=False, name=name + '_1_conv')(
+          x)
+  x = layers.BatchNormalization(
+      axis=bn_axis, epsilon=1.001e-5, name=name + '_1_bn')(
+          x)
+  x = layers.Activation('relu', name=name + '_1_relu')(x)
+
+  x = layers.ZeroPadding2D(padding=((1, 1), (1, 1)), name=name + '_2_pad')(x)
+  x = layers.Conv2D(
+      filters,
+      kernel_size,
+      strides=stride,
+      use_bias=False,
+      name=name + '_2_conv')(
+          x)
+  x = layers.BatchNormalization(
+      axis=bn_axis, epsilon=1.001e-5, name=name + '_2_bn')(
+          x)
+  x = layers.Activation('relu', name=name + '_2_relu')(x)
+  x = layers.Add(name=name + '_out')([shortcut, x])
+  return x
+
+
+def stack_basic(x, filters, blocks, stride1=2, name=None):
+  """A set of stacked residual blocks for ResNet18/34.
+
+  Arguments:
+      x: input tensor.
+      filters: integer, filters of the bottleneck layer in a block.
+      blocks: integer, blocks in the stacked blocks.
+      stride1: default 2, stride of the first layer in the first block.
+      name: string, stack label.
+
+  Returns:
+      Output tensor for the stacked blocks.
+  """
+  x = block_basic(x, filters, conv_shortcut=True, name=name + '_block1')
+  for i in range(2, blocks):
+    x = block_basic(x, filters, name=name + '_block' + str(i))
+  x = block_basic(
+      x, filters, stride=stride1, name=name + '_block' + str(blocks))
+  return x
+
+
+def resnet_v1_18(include_top=True,
+                 weights='imagenet',
+                 input_tensor=None,
+                 input_shape=None,
+                 pooling=None,
+                 classes=1000,
+                 classifier_activation='softmax'):
+  """Instantiates the ResNet18 architecture."""
+
+  def stack_fn(x):
+    x = stack_basic(x, 64, 2, stride1=1, name='conv2')
+    x = stack_basic(x, 128, 2, name='conv3')
+    x = stack_basic(x, 256, 2, name='conv4')
+    return stack_basic(x, 512, 2, name='conv5')
+
+  return resnet.ResNet(
+      stack_fn,
+      True,
+      True,
+      'resnet18',
+      include_top,
+      weights,
+      input_tensor,
+      input_shape,
+      pooling,
+      classes,
+      classifier_activation=classifier_activation)
+
+
+def resnet_v1_34(include_top=True,
+                 weights='imagenet',
+                 input_tensor=None,
+                 input_shape=None,
+                 pooling=None,
+                 classes=1000,
+                 classifier_activation='softmax'):
+  """Instantiates the ResNet34 architecture."""
+
+  def stack_fn(x):
+    x = stack_basic(x, 64, 3, stride1=1, name='conv2')
+    x = stack_basic(x, 128, 4, name='conv3')
+    x = stack_basic(x, 256, 6, name='conv4')
+    return stack_basic(x, 512, 3, name='conv5')
+
+  return resnet.ResNet(
+      stack_fn,
+      True,
+      True,
+      'resnet34',
+      include_top,
+      weights,
+      input_tensor,
+      input_shape,
+      pooling,
+      classes,
+      classifier_activation=classifier_activation)
diff --git a/research/object_detection/models/keras_models/resnet_v1_tf2_test.py b/research/object_detection/models/keras_models/resnet_v1_tf2_test.py
index 71cc5f22bd994b6432957bf5b34837f829c9b8da..9063efb2418eff95350b0a60339304b4456f6e6c 100644
--- a/research/object_detection/models/keras_models/resnet_v1_tf2_test.py
+++ b/research/object_detection/models/keras_models/resnet_v1_tf2_test.py
@@ -20,12 +20,13 @@ object detection. To verify the consistency of the two models, we compare:
   2. Number of global variables.
 """
 import unittest
+
+from absl.testing import parameterized
 import numpy as np
 from six.moves import zip
 import tensorflow.compat.v1 as tf
 
 from google.protobuf import text_format
-
 from object_detection.builders import hyperparams_builder
 from object_detection.models.keras_models import resnet_v1
 from object_detection.protos import hyperparams_pb2
@@ -180,5 +181,46 @@ class ResnetV1Test(test_case.TestCase):
       self.assertEqual(len(variables), var_num)
 
 
+class ResnetShapeTest(test_case.TestCase, parameterized.TestCase):
+
+  @unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
+  @parameterized.parameters(
+      {
+          'resnet_type':
+              'resnet_v1_34',
+          'output_layer_names': [
+              'conv2_block3_out', 'conv3_block4_out', 'conv4_block6_out',
+              'conv5_block3_out'
+          ]
+      }, {
+          'resnet_type':
+              'resnet_v1_18',
+          'output_layer_names': [
+              'conv2_block2_out', 'conv3_block2_out', 'conv4_block2_out',
+              'conv5_block2_out'
+          ]
+      })
+  def test_output_shapes(self, resnet_type, output_layer_names):
+    if resnet_type == 'resnet_v1_34':
+      model = resnet_v1.resnet_v1_34(weights=None)
+    else:
+      model = resnet_v1.resnet_v1_18(weights=None)
+    outputs = [
+        model.get_layer(output_layer_name).output
+        for output_layer_name in output_layer_names
+    ]
+    resnet_model = tf.keras.models.Model(inputs=model.input, outputs=outputs)
+    outputs = resnet_model(np.zeros((2, 64, 64, 3), dtype=np.float32))
+
+    # Check the shape of 'conv2_block3_out':
+    self.assertEqual(outputs[0].shape, [2, 16, 16, 64])
+    # Check the shape of 'conv3_block4_out':
+    self.assertEqual(outputs[1].shape, [2, 8, 8, 128])
+    # Check the shape of 'conv4_block6_out':
+    self.assertEqual(outputs[2].shape, [2, 4, 4, 256])
+    # Check the shape of 'conv5_block3_out':
+    self.assertEqual(outputs[3].shape, [2, 2, 2, 512])
+
+
 if __name__ == '__main__':
   tf.test.main()
diff --git a/research/object_detection/models/ssd_efficientnet_bifpn_feature_extractor.py b/research/object_detection/models/ssd_efficientnet_bifpn_feature_extractor.py
new file mode 100644
index 0000000000000000000000000000000000000000..2ecf8fb01b2dba536b8d6c531a3e8becac75091c
--- /dev/null
+++ b/research/object_detection/models/ssd_efficientnet_bifpn_feature_extractor.py
@@ -0,0 +1,925 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""SSD Keras-based EfficientNet + BiFPN (EfficientDet) Feature Extractor."""
+
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+
+from absl import logging
+from six.moves import range
+from six.moves import zip
+import tensorflow.compat.v2 as tf
+
+from object_detection.meta_architectures import ssd_meta_arch
+from object_detection.models import bidirectional_feature_pyramid_generators as bifpn_generators
+from object_detection.utils import ops
+from object_detection.utils import shape_utils
+from object_detection.utils import tf_version
+# pylint: disable=g-import-not-at-top
+if tf_version.is_tf2():
+  from official.vision.image_classification.efficientnet import efficientnet_model
+
+_EFFICIENTNET_LEVEL_ENDPOINTS = {
+    1: 'stack_0/block_0/project_bn',
+    2: 'stack_1/block_1/add',
+    3: 'stack_2/block_1/add',
+    4: 'stack_4/block_2/add',
+    5: 'stack_6/block_0/project_bn',
+}
+
+
+class SSDEfficientNetBiFPNKerasFeatureExtractor(
+    ssd_meta_arch.SSDKerasFeatureExtractor):
+  """SSD Keras-based EfficientNetBiFPN (EfficientDet) Feature Extractor."""
+
+  def __init__(self,
+               is_training,
+               depth_multiplier,
+               min_depth,
+               pad_to_multiple,
+               conv_hyperparams,
+               freeze_batchnorm,
+               inplace_batchnorm_update,
+               bifpn_min_level,
+               bifpn_max_level,
+               bifpn_num_iterations,
+               bifpn_num_filters,
+               bifpn_combine_method,
+               efficientnet_version,
+               use_explicit_padding=None,
+               use_depthwise=None,
+               override_base_feature_extractor_hyperparams=None,
+               name=None):
+    """SSD Keras-based EfficientNetBiFPN (EfficientDet) feature extractor.
+
+    Args:
+      is_training: whether the network is in training mode.
+      depth_multiplier: unsupported by EfficientNetBiFPN. float, depth
+        multiplier for the feature extractor.
+      min_depth: minimum feature extractor depth.
+      pad_to_multiple: the nearest multiple to zero pad the input height and
+        width dimensions to.
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
+        containing convolution hyperparameters for the layers added on top of
+        the base feature extractor.
+      freeze_batchnorm: whether to freeze batch norm parameters during training
+        or not. When training with a small batch size (e.g. 1), it is desirable
+        to freeze batch norm update and use pretrained batch norm params.
+      inplace_batchnorm_update: whether to update batch norm moving average
+        values inplace. When this is false train op must add a control
+        dependency on tf.graphkeys.UPDATE_OPS collection in order to update
+        batch norm statistics.
+      bifpn_min_level: the highest resolution feature map to use in BiFPN. The
+        valid values are {2, 3, 4, 5} which map to Resnet blocks {1, 2, 3, 4}
+        respectively.
+      bifpn_max_level: the smallest resolution feature map to use in the BiFPN.
+        BiFPN constructions uses features maps starting from bifpn_min_level
+        upto the bifpn_max_level. In the case that there are not enough feature
+        maps in the backbone network, additional feature maps are created by
+        applying stride 2 convolutions until we get the desired number of BiFPN
+        levels.
+      bifpn_num_iterations: number of BiFPN iterations. Overrided if
+        efficientdet_version is provided.
+      bifpn_num_filters: number of filters (channels) in all BiFPN layers.
+        Overrided if efficientdet_version is provided.
+      bifpn_combine_method: the method used to combine BiFPN nodes.
+      efficientnet_version: the EfficientNet version to use for this feature
+        extractor's backbone.
+      use_explicit_padding: unsupported by EfficientNetBiFPN. Whether to use
+        explicit padding when extracting features.
+      use_depthwise: unsupported by EfficientNetBiFPN, since BiFPN uses regular
+        convolutions when inputs to a node have a differing number of channels,
+        and use separable convolutions after combine operations.
+      override_base_feature_extractor_hyperparams: unsupported. Whether to
+        override hyperparameters of the base feature extractor with the one from
+        `conv_hyperparams`.
+      name: a string name scope to assign to the model. If 'None', Keras will
+        auto-generate one from the class name.
+    """
+    super(SSDEfficientNetBiFPNKerasFeatureExtractor, self).__init__(
+        is_training=is_training,
+        depth_multiplier=depth_multiplier,
+        min_depth=min_depth,
+        pad_to_multiple=pad_to_multiple,
+        conv_hyperparams=conv_hyperparams,
+        freeze_batchnorm=freeze_batchnorm,
+        inplace_batchnorm_update=inplace_batchnorm_update,
+        use_explicit_padding=None,
+        use_depthwise=None,
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams,
+        name=name)
+    if depth_multiplier != 1.0:
+      raise ValueError('EfficientNetBiFPN does not support a non-default '
+                       'depth_multiplier.')
+    if use_explicit_padding:
+      raise ValueError('EfficientNetBiFPN does not support explicit padding.')
+    if use_depthwise:
+      raise ValueError('EfficientNetBiFPN does not support use_depthwise.')
+    if override_base_feature_extractor_hyperparams:
+      raise ValueError('EfficientNetBiFPN does not support '
+                       'override_base_feature_extractor_hyperparams.')
+
+    self._bifpn_min_level = bifpn_min_level
+    self._bifpn_max_level = bifpn_max_level
+    self._bifpn_num_iterations = bifpn_num_iterations
+    self._bifpn_num_filters = max(bifpn_num_filters, min_depth)
+    self._bifpn_node_params = {'combine_method': bifpn_combine_method}
+    self._efficientnet_version = efficientnet_version
+
+    logging.info('EfficientDet EfficientNet backbone version: %s',
+                 self._efficientnet_version)
+    logging.info('EfficientDet BiFPN num filters: %d', self._bifpn_num_filters)
+    logging.info('EfficientDet BiFPN num iterations: %d',
+                 self._bifpn_num_iterations)
+
+    self._backbone_max_level = min(
+        max(_EFFICIENTNET_LEVEL_ENDPOINTS.keys()), bifpn_max_level)
+    self._output_layer_names = [
+        _EFFICIENTNET_LEVEL_ENDPOINTS[i]
+        for i in range(bifpn_min_level, self._backbone_max_level + 1)]
+    self._output_layer_alias = [
+        'level_{}'.format(i)
+        for i in range(bifpn_min_level, self._backbone_max_level + 1)]
+
+    # Initialize the EfficientNet backbone.
+    # Note, this is currently done in the init method rather than in the build
+    # method, since doing so introduces an error which is not well understood.
+    efficientnet_base = efficientnet_model.EfficientNet.from_name(
+        model_name=self._efficientnet_version,
+        overrides={'rescale_input': False})
+    outputs = [efficientnet_base.get_layer(output_layer_name).output
+               for output_layer_name in self._output_layer_names]
+    self._efficientnet = tf.keras.Model(
+        inputs=efficientnet_base.inputs, outputs=outputs)
+    self.classification_backbone = efficientnet_base
+    self._bifpn_stage = None
+
+  def build(self, input_shape):
+    self._bifpn_stage = bifpn_generators.KerasBiFpnFeatureMaps(
+        bifpn_num_iterations=self._bifpn_num_iterations,
+        bifpn_num_filters=self._bifpn_num_filters,
+        fpn_min_level=self._bifpn_min_level,
+        fpn_max_level=self._bifpn_max_level,
+        input_max_level=self._backbone_max_level,
+        is_training=self._is_training,
+        conv_hyperparams=self._conv_hyperparams,
+        freeze_batchnorm=self._freeze_batchnorm,
+        bifpn_node_params=self._bifpn_node_params,
+        name='bifpn')
+    self.built = True
+
+  def preprocess(self, inputs):
+    """SSD preprocessing.
+
+    Channel-wise mean subtraction and scaling.
+
+    Args:
+      inputs: a [batch, height, width, channels] float tensor representing a
+        batch of images.
+
+    Returns:
+      preprocessed_inputs: a [batch, height, width, channels] float tensor
+        representing a batch of images.
+    """
+    if inputs.shape.as_list()[3] == 3:
+      # Input images are expected to be in the range [0, 255].
+      channel_offset = [0.485, 0.456, 0.406]
+      channel_scale = [0.229, 0.224, 0.225]
+      return ((inputs / 255.0) - [[channel_offset]]) / [[channel_scale]]
+    else:
+      return inputs
+
+  def _extract_features(self, preprocessed_inputs):
+    """Extract features from preprocessed inputs.
+
+    Args:
+      preprocessed_inputs: a [batch, height, width, channels] float tensor
+        representing a batch of images.
+
+    Returns:
+      feature_maps: a list of tensors where the ith tensor has shape
+        [batch, height_i, width_i, depth_i]
+    """
+    preprocessed_inputs = shape_utils.check_min_image_dim(
+        129, preprocessed_inputs)
+
+    base_feature_maps = self._efficientnet(
+        ops.pad_to_multiple(preprocessed_inputs, self._pad_to_multiple))
+
+    output_feature_map_dict = self._bifpn_stage(
+        list(zip(self._output_layer_alias, base_feature_maps)))
+
+    return list(output_feature_map_dict.values())
+
+
+class SSDEfficientNetB0BiFPNKerasFeatureExtractor(
+    SSDEfficientNetBiFPNKerasFeatureExtractor):
+  """SSD Keras EfficientNet-b0 BiFPN (EfficientDet-d0) Feature Extractor."""
+
+  def __init__(self,
+               is_training,
+               depth_multiplier,
+               min_depth,
+               pad_to_multiple,
+               conv_hyperparams,
+               freeze_batchnorm,
+               inplace_batchnorm_update,
+               bifpn_min_level=3,
+               bifpn_max_level=7,
+               bifpn_num_iterations=3,
+               bifpn_num_filters=64,
+               bifpn_combine_method='fast_attention',
+               use_explicit_padding=None,
+               use_depthwise=None,
+               override_base_feature_extractor_hyperparams=None,
+               name='EfficientDet-D0'):
+    """SSD Keras EfficientNet-b0 BiFPN (EfficientDet-d0) Feature Extractor.
+
+    Args:
+      is_training: whether the network is in training mode.
+      depth_multiplier: unsupported by EfficientNetBiFPN. float, depth
+        multiplier for the feature extractor.
+      min_depth: minimum feature extractor depth.
+      pad_to_multiple: the nearest multiple to zero pad the input height and
+        width dimensions to.
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
+        containing convolution hyperparameters for the layers added on top of
+        the base feature extractor.
+      freeze_batchnorm: whether to freeze batch norm parameters during training
+        or not. When training with a small batch size (e.g. 1), it is desirable
+        to freeze batch norm update and use pretrained batch norm params.
+      inplace_batchnorm_update: whether to update batch norm moving average
+        values inplace. When this is false train op must add a control
+        dependency on tf.graphkeys.UPDATE_OPS collection in order to update
+        batch norm statistics.
+      bifpn_min_level: the highest resolution feature map to use in BiFPN. The
+        valid values are {2, 3, 4, 5} which map to Resnet blocks {1, 2, 3, 4}
+        respectively.
+      bifpn_max_level: the smallest resolution feature map to use in the BiFPN.
+        BiFPN constructions uses features maps starting from bifpn_min_level
+        upto the bifpn_max_level. In the case that there are not enough feature
+        maps in the backbone network, additional feature maps are created by
+        applying stride 2 convolutions until we get the desired number of BiFPN
+        levels.
+      bifpn_num_iterations: number of BiFPN iterations. Overrided if
+        efficientdet_version is provided.
+      bifpn_num_filters: number of filters (channels) in all BiFPN layers.
+        Overrided if efficientdet_version is provided.
+      bifpn_combine_method: the method used to combine BiFPN nodes.
+      use_explicit_padding: unsupported by EfficientNetBiFPN. Whether to use
+        explicit padding when extracting features.
+      use_depthwise: unsupported by EfficientNetBiFPN, since BiFPN uses regular
+        convolutions when inputs to a node have a differing number of channels,
+        and use separable convolutions after combine operations.
+      override_base_feature_extractor_hyperparams: unsupported. Whether to
+        override hyperparameters of the base feature extractor with the one from
+        `conv_hyperparams`.
+      name: a string name scope to assign to the model. If 'None', Keras will
+        auto-generate one from the class name.
+    """
+    super(SSDEfficientNetB0BiFPNKerasFeatureExtractor, self).__init__(
+        is_training=is_training,
+        depth_multiplier=depth_multiplier,
+        min_depth=min_depth,
+        pad_to_multiple=pad_to_multiple,
+        conv_hyperparams=conv_hyperparams,
+        freeze_batchnorm=freeze_batchnorm,
+        inplace_batchnorm_update=inplace_batchnorm_update,
+        bifpn_min_level=bifpn_min_level,
+        bifpn_max_level=bifpn_max_level,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method,
+        efficientnet_version='efficientnet-b0',
+        use_explicit_padding=use_explicit_padding,
+        use_depthwise=use_depthwise,
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams,
+        name=name)
+
+
+class SSDEfficientNetB1BiFPNKerasFeatureExtractor(
+    SSDEfficientNetBiFPNKerasFeatureExtractor):
+  """SSD Keras EfficientNet-b1 BiFPN (EfficientDet-d1) Feature Extractor."""
+
+  def __init__(self,
+               is_training,
+               depth_multiplier,
+               min_depth,
+               pad_to_multiple,
+               conv_hyperparams,
+               freeze_batchnorm,
+               inplace_batchnorm_update,
+               bifpn_min_level=3,
+               bifpn_max_level=7,
+               bifpn_num_iterations=4,
+               bifpn_num_filters=88,
+               bifpn_combine_method='fast_attention',
+               use_explicit_padding=None,
+               use_depthwise=None,
+               override_base_feature_extractor_hyperparams=None,
+               name='EfficientDet-D1'):
+    """SSD Keras EfficientNet-b1 BiFPN (EfficientDet-d1) Feature Extractor.
+
+    Args:
+      is_training: whether the network is in training mode.
+      depth_multiplier: unsupported by EfficientNetBiFPN. float, depth
+        multiplier for the feature extractor.
+      min_depth: minimum feature extractor depth.
+      pad_to_multiple: the nearest multiple to zero pad the input height and
+        width dimensions to.
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
+        containing convolution hyperparameters for the layers added on top of
+        the base feature extractor.
+      freeze_batchnorm: whether to freeze batch norm parameters during training
+        or not. When training with a small batch size (e.g. 1), it is desirable
+        to freeze batch norm update and use pretrained batch norm params.
+      inplace_batchnorm_update: whether to update batch norm moving average
+        values inplace. When this is false train op must add a control
+        dependency on tf.graphkeys.UPDATE_OPS collection in order to update
+        batch norm statistics.
+      bifpn_min_level: the highest resolution feature map to use in BiFPN. The
+        valid values are {2, 3, 4, 5} which map to Resnet blocks {1, 2, 3, 4}
+        respectively.
+      bifpn_max_level: the smallest resolution feature map to use in the BiFPN.
+        BiFPN constructions uses features maps starting from bifpn_min_level
+        upto the bifpn_max_level. In the case that there are not enough feature
+        maps in the backbone network, additional feature maps are created by
+        applying stride 2 convolutions until we get the desired number of BiFPN
+        levels.
+      bifpn_num_iterations: number of BiFPN iterations. Overrided if
+        efficientdet_version is provided.
+      bifpn_num_filters: number of filters (channels) in all BiFPN layers.
+        Overrided if efficientdet_version is provided.
+      bifpn_combine_method: the method used to combine BiFPN nodes.
+      use_explicit_padding: unsupported by EfficientNetBiFPN. Whether to use
+        explicit padding when extracting features.
+      use_depthwise: unsupported by EfficientNetBiFPN, since BiFPN uses regular
+        convolutions when inputs to a node have a differing number of channels,
+        and use separable convolutions after combine operations.
+      override_base_feature_extractor_hyperparams: unsupported. Whether to
+        override hyperparameters of the base feature extractor with the one from
+        `conv_hyperparams`.
+      name: a string name scope to assign to the model. If 'None', Keras will
+        auto-generate one from the class name.
+    """
+    super(SSDEfficientNetB1BiFPNKerasFeatureExtractor, self).__init__(
+        is_training=is_training,
+        depth_multiplier=depth_multiplier,
+        min_depth=min_depth,
+        pad_to_multiple=pad_to_multiple,
+        conv_hyperparams=conv_hyperparams,
+        freeze_batchnorm=freeze_batchnorm,
+        inplace_batchnorm_update=inplace_batchnorm_update,
+        bifpn_min_level=bifpn_min_level,
+        bifpn_max_level=bifpn_max_level,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method,
+        efficientnet_version='efficientnet-b1',
+        use_explicit_padding=use_explicit_padding,
+        use_depthwise=use_depthwise,
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams,
+        name=name)
+
+
+class SSDEfficientNetB2BiFPNKerasFeatureExtractor(
+    SSDEfficientNetBiFPNKerasFeatureExtractor):
+  """SSD Keras EfficientNet-b2 BiFPN (EfficientDet-d2) Feature Extractor."""
+
+  def __init__(self,
+               is_training,
+               depth_multiplier,
+               min_depth,
+               pad_to_multiple,
+               conv_hyperparams,
+               freeze_batchnorm,
+               inplace_batchnorm_update,
+               bifpn_min_level=3,
+               bifpn_max_level=7,
+               bifpn_num_iterations=5,
+               bifpn_num_filters=112,
+               bifpn_combine_method='fast_attention',
+               use_explicit_padding=None,
+               use_depthwise=None,
+               override_base_feature_extractor_hyperparams=None,
+               name='EfficientDet-D2'):
+
+    """SSD Keras EfficientNet-b2 BiFPN (EfficientDet-d2) Feature Extractor.
+
+    Args:
+      is_training: whether the network is in training mode.
+      depth_multiplier: unsupported by EfficientNetBiFPN. float, depth
+        multiplier for the feature extractor.
+      min_depth: minimum feature extractor depth.
+      pad_to_multiple: the nearest multiple to zero pad the input height and
+        width dimensions to.
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
+        containing convolution hyperparameters for the layers added on top of
+        the base feature extractor.
+      freeze_batchnorm: whether to freeze batch norm parameters during training
+        or not. When training with a small batch size (e.g. 1), it is desirable
+        to freeze batch norm update and use pretrained batch norm params.
+      inplace_batchnorm_update: whether to update batch norm moving average
+        values inplace. When this is false train op must add a control
+        dependency on tf.graphkeys.UPDATE_OPS collection in order to update
+        batch norm statistics.
+      bifpn_min_level: the highest resolution feature map to use in BiFPN. The
+        valid values are {2, 3, 4, 5} which map to Resnet blocks {1, 2, 3, 4}
+        respectively.
+      bifpn_max_level: the smallest resolution feature map to use in the BiFPN.
+        BiFPN constructions uses features maps starting from bifpn_min_level
+        upto the bifpn_max_level. In the case that there are not enough feature
+        maps in the backbone network, additional feature maps are created by
+        applying stride 2 convolutions until we get the desired number of BiFPN
+        levels.
+      bifpn_num_iterations: number of BiFPN iterations. Overrided if
+        efficientdet_version is provided.
+      bifpn_num_filters: number of filters (channels) in all BiFPN layers.
+        Overrided if efficientdet_version is provided.
+      bifpn_combine_method: the method used to combine BiFPN nodes.
+      use_explicit_padding: unsupported by EfficientNetBiFPN. Whether to use
+        explicit padding when extracting features.
+      use_depthwise: unsupported by EfficientNetBiFPN, since BiFPN uses regular
+        convolutions when inputs to a node have a differing number of channels,
+        and use separable convolutions after combine operations.
+      override_base_feature_extractor_hyperparams: unsupported. Whether to
+        override hyperparameters of the base feature extractor with the one from
+        `conv_hyperparams`.
+      name: a string name scope to assign to the model. If 'None', Keras will
+        auto-generate one from the class name.
+    """
+    super(SSDEfficientNetB2BiFPNKerasFeatureExtractor, self).__init__(
+        is_training=is_training,
+        depth_multiplier=depth_multiplier,
+        min_depth=min_depth,
+        pad_to_multiple=pad_to_multiple,
+        conv_hyperparams=conv_hyperparams,
+        freeze_batchnorm=freeze_batchnorm,
+        inplace_batchnorm_update=inplace_batchnorm_update,
+        bifpn_min_level=bifpn_min_level,
+        bifpn_max_level=bifpn_max_level,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method,
+        efficientnet_version='efficientnet-b2',
+        use_explicit_padding=use_explicit_padding,
+        use_depthwise=use_depthwise,
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams,
+        name=name)
+
+
+class SSDEfficientNetB3BiFPNKerasFeatureExtractor(
+    SSDEfficientNetBiFPNKerasFeatureExtractor):
+  """SSD Keras EfficientNet-b3 BiFPN (EfficientDet-d3) Feature Extractor."""
+
+  def __init__(self,
+               is_training,
+               depth_multiplier,
+               min_depth,
+               pad_to_multiple,
+               conv_hyperparams,
+               freeze_batchnorm,
+               inplace_batchnorm_update,
+               bifpn_min_level=3,
+               bifpn_max_level=7,
+               bifpn_num_iterations=6,
+               bifpn_num_filters=160,
+               bifpn_combine_method='fast_attention',
+               use_explicit_padding=None,
+               use_depthwise=None,
+               override_base_feature_extractor_hyperparams=None,
+               name='EfficientDet-D3'):
+
+    """SSD Keras EfficientNet-b3 BiFPN (EfficientDet-d3) Feature Extractor.
+
+    Args:
+      is_training: whether the network is in training mode.
+      depth_multiplier: unsupported by EfficientNetBiFPN. float, depth
+        multiplier for the feature extractor.
+      min_depth: minimum feature extractor depth.
+      pad_to_multiple: the nearest multiple to zero pad the input height and
+        width dimensions to.
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
+        containing convolution hyperparameters for the layers added on top of
+        the base feature extractor.
+      freeze_batchnorm: whether to freeze batch norm parameters during training
+        or not. When training with a small batch size (e.g. 1), it is desirable
+        to freeze batch norm update and use pretrained batch norm params.
+      inplace_batchnorm_update: whether to update batch norm moving average
+        values inplace. When this is false train op must add a control
+        dependency on tf.graphkeys.UPDATE_OPS collection in order to update
+        batch norm statistics.
+      bifpn_min_level: the highest resolution feature map to use in BiFPN. The
+        valid values are {2, 3, 4, 5} which map to Resnet blocks {1, 2, 3, 4}
+        respectively.
+      bifpn_max_level: the smallest resolution feature map to use in the BiFPN.
+        BiFPN constructions uses features maps starting from bifpn_min_level
+        upto the bifpn_max_level. In the case that there are not enough feature
+        maps in the backbone network, additional feature maps are created by
+        applying stride 2 convolutions until we get the desired number of BiFPN
+        levels.
+      bifpn_num_iterations: number of BiFPN iterations. Overrided if
+        efficientdet_version is provided.
+      bifpn_num_filters: number of filters (channels) in all BiFPN layers.
+        Overrided if efficientdet_version is provided.
+      bifpn_combine_method: the method used to combine BiFPN nodes.
+      use_explicit_padding: unsupported by EfficientNetBiFPN. Whether to use
+        explicit padding when extracting features.
+      use_depthwise: unsupported by EfficientNetBiFPN, since BiFPN uses regular
+        convolutions when inputs to a node have a differing number of channels,
+        and use separable convolutions after combine operations.
+      override_base_feature_extractor_hyperparams: unsupported. Whether to
+        override hyperparameters of the base feature extractor with the one from
+        `conv_hyperparams`.
+      name: a string name scope to assign to the model. If 'None', Keras will
+        auto-generate one from the class name.
+    """
+    super(SSDEfficientNetB3BiFPNKerasFeatureExtractor, self).__init__(
+        is_training=is_training,
+        depth_multiplier=depth_multiplier,
+        min_depth=min_depth,
+        pad_to_multiple=pad_to_multiple,
+        conv_hyperparams=conv_hyperparams,
+        freeze_batchnorm=freeze_batchnorm,
+        inplace_batchnorm_update=inplace_batchnorm_update,
+        bifpn_min_level=bifpn_min_level,
+        bifpn_max_level=bifpn_max_level,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method,
+        efficientnet_version='efficientnet-b3',
+        use_explicit_padding=use_explicit_padding,
+        use_depthwise=use_depthwise,
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams,
+        name=name)
+
+
+class SSDEfficientNetB4BiFPNKerasFeatureExtractor(
+    SSDEfficientNetBiFPNKerasFeatureExtractor):
+  """SSD Keras EfficientNet-b4 BiFPN (EfficientDet-d4) Feature Extractor."""
+
+  def __init__(self,
+               is_training,
+               depth_multiplier,
+               min_depth,
+               pad_to_multiple,
+               conv_hyperparams,
+               freeze_batchnorm,
+               inplace_batchnorm_update,
+               bifpn_min_level=3,
+               bifpn_max_level=7,
+               bifpn_num_iterations=7,
+               bifpn_num_filters=224,
+               bifpn_combine_method='fast_attention',
+               use_explicit_padding=None,
+               use_depthwise=None,
+               override_base_feature_extractor_hyperparams=None,
+               name='EfficientDet-D4'):
+
+    """SSD Keras EfficientNet-b4 BiFPN (EfficientDet-d4) Feature Extractor.
+
+    Args:
+      is_training: whether the network is in training mode.
+      depth_multiplier: unsupported by EfficientNetBiFPN. float, depth
+        multiplier for the feature extractor.
+      min_depth: minimum feature extractor depth.
+      pad_to_multiple: the nearest multiple to zero pad the input height and
+        width dimensions to.
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
+        containing convolution hyperparameters for the layers added on top of
+        the base feature extractor.
+      freeze_batchnorm: whether to freeze batch norm parameters during training
+        or not. When training with a small batch size (e.g. 1), it is desirable
+        to freeze batch norm update and use pretrained batch norm params.
+      inplace_batchnorm_update: whether to update batch norm moving average
+        values inplace. When this is false train op must add a control
+        dependency on tf.graphkeys.UPDATE_OPS collection in order to update
+        batch norm statistics.
+      bifpn_min_level: the highest resolution feature map to use in BiFPN. The
+        valid values are {2, 3, 4, 5} which map to Resnet blocks {1, 2, 3, 4}
+        respectively.
+      bifpn_max_level: the smallest resolution feature map to use in the BiFPN.
+        BiFPN constructions uses features maps starting from bifpn_min_level
+        upto the bifpn_max_level. In the case that there are not enough feature
+        maps in the backbone network, additional feature maps are created by
+        applying stride 2 convolutions until we get the desired number of BiFPN
+        levels.
+      bifpn_num_iterations: number of BiFPN iterations. Overrided if
+        efficientdet_version is provided.
+      bifpn_num_filters: number of filters (channels) in all BiFPN layers.
+        Overrided if efficientdet_version is provided.
+      bifpn_combine_method: the method used to combine BiFPN nodes.
+      use_explicit_padding: unsupported by EfficientNetBiFPN. Whether to use
+        explicit padding when extracting features.
+      use_depthwise: unsupported by EfficientNetBiFPN, since BiFPN uses regular
+        convolutions when inputs to a node have a differing number of channels,
+        and use separable convolutions after combine operations.
+      override_base_feature_extractor_hyperparams: unsupported. Whether to
+        override hyperparameters of the base feature extractor with the one from
+        `conv_hyperparams`.
+      name: a string name scope to assign to the model. If 'None', Keras will
+        auto-generate one from the class name.
+    """
+    super(SSDEfficientNetB4BiFPNKerasFeatureExtractor, self).__init__(
+        is_training=is_training,
+        depth_multiplier=depth_multiplier,
+        min_depth=min_depth,
+        pad_to_multiple=pad_to_multiple,
+        conv_hyperparams=conv_hyperparams,
+        freeze_batchnorm=freeze_batchnorm,
+        inplace_batchnorm_update=inplace_batchnorm_update,
+        bifpn_min_level=bifpn_min_level,
+        bifpn_max_level=bifpn_max_level,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method,
+        efficientnet_version='efficientnet-b4',
+        use_explicit_padding=use_explicit_padding,
+        use_depthwise=use_depthwise,
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams,
+        name=name)
+
+
+class SSDEfficientNetB5BiFPNKerasFeatureExtractor(
+    SSDEfficientNetBiFPNKerasFeatureExtractor):
+  """SSD Keras EfficientNet-b5 BiFPN (EfficientDet-d5) Feature Extractor."""
+
+  def __init__(self,
+               is_training,
+               depth_multiplier,
+               min_depth,
+               pad_to_multiple,
+               conv_hyperparams,
+               freeze_batchnorm,
+               inplace_batchnorm_update,
+               bifpn_min_level=3,
+               bifpn_max_level=7,
+               bifpn_num_iterations=7,
+               bifpn_num_filters=288,
+               bifpn_combine_method='fast_attention',
+               use_explicit_padding=None,
+               use_depthwise=None,
+               override_base_feature_extractor_hyperparams=None,
+               name='EfficientDet-D5'):
+
+    """SSD Keras EfficientNet-b5 BiFPN (EfficientDet-d5) Feature Extractor.
+
+    Args:
+      is_training: whether the network is in training mode.
+      depth_multiplier: unsupported by EfficientNetBiFPN. float, depth
+        multiplier for the feature extractor.
+      min_depth: minimum feature extractor depth.
+      pad_to_multiple: the nearest multiple to zero pad the input height and
+        width dimensions to.
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
+        containing convolution hyperparameters for the layers added on top of
+        the base feature extractor.
+      freeze_batchnorm: whether to freeze batch norm parameters during training
+        or not. When training with a small batch size (e.g. 1), it is desirable
+        to freeze batch norm update and use pretrained batch norm params.
+      inplace_batchnorm_update: whether to update batch norm moving average
+        values inplace. When this is false train op must add a control
+        dependency on tf.graphkeys.UPDATE_OPS collection in order to update
+        batch norm statistics.
+      bifpn_min_level: the highest resolution feature map to use in BiFPN. The
+        valid values are {2, 3, 4, 5} which map to Resnet blocks {1, 2, 3, 4}
+        respectively.
+      bifpn_max_level: the smallest resolution feature map to use in the BiFPN.
+        BiFPN constructions uses features maps starting from bifpn_min_level
+        upto the bifpn_max_level. In the case that there are not enough feature
+        maps in the backbone network, additional feature maps are created by
+        applying stride 2 convolutions until we get the desired number of BiFPN
+        levels.
+      bifpn_num_iterations: number of BiFPN iterations. Overrided if
+        efficientdet_version is provided.
+      bifpn_num_filters: number of filters (channels) in all BiFPN layers.
+        Overrided if efficientdet_version is provided.
+      bifpn_combine_method: the method used to combine BiFPN nodes.
+      use_explicit_padding: unsupported by EfficientNetBiFPN. Whether to use
+        explicit padding when extracting features.
+      use_depthwise: unsupported by EfficientNetBiFPN, since BiFPN uses regular
+        convolutions when inputs to a node have a differing number of channels,
+        and use separable convolutions after combine operations.
+      override_base_feature_extractor_hyperparams: unsupported. Whether to
+        override hyperparameters of the base feature extractor with the one from
+        `conv_hyperparams`.
+      name: a string name scope to assign to the model. If 'None', Keras will
+        auto-generate one from the class name.
+    """
+    super(SSDEfficientNetB5BiFPNKerasFeatureExtractor, self).__init__(
+        is_training=is_training,
+        depth_multiplier=depth_multiplier,
+        min_depth=min_depth,
+        pad_to_multiple=pad_to_multiple,
+        conv_hyperparams=conv_hyperparams,
+        freeze_batchnorm=freeze_batchnorm,
+        inplace_batchnorm_update=inplace_batchnorm_update,
+        bifpn_min_level=bifpn_min_level,
+        bifpn_max_level=bifpn_max_level,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method,
+        efficientnet_version='efficientnet-b5',
+        use_explicit_padding=use_explicit_padding,
+        use_depthwise=use_depthwise,
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams,
+        name=name)
+
+
+class SSDEfficientNetB6BiFPNKerasFeatureExtractor(
+    SSDEfficientNetBiFPNKerasFeatureExtractor):
+  """SSD Keras EfficientNet-b6 BiFPN (EfficientDet-d[6,7]) Feature Extractor."""
+
+  def __init__(self,
+               is_training,
+               depth_multiplier,
+               min_depth,
+               pad_to_multiple,
+               conv_hyperparams,
+               freeze_batchnorm,
+               inplace_batchnorm_update,
+               bifpn_min_level=3,
+               bifpn_max_level=7,
+               bifpn_num_iterations=8,
+               bifpn_num_filters=384,
+               bifpn_combine_method='sum',
+               use_explicit_padding=None,
+               use_depthwise=None,
+               override_base_feature_extractor_hyperparams=None,
+               name='EfficientDet-D6-D7'):
+
+    """SSD Keras EfficientNet-b6 BiFPN (EfficientDet-d[6,7]) Feature Extractor.
+
+    SSD Keras EfficientNet-b6 BiFPN Feature Extractor, a.k.a. EfficientDet-d6
+    and EfficientDet-d7. The EfficientDet-d[6,7] models use the same backbone
+    EfficientNet-b6 and the same BiFPN architecture, and therefore have the same
+    number of parameters. They only differ in their input resolutions.
+
+    Args:
+      is_training: whether the network is in training mode.
+      depth_multiplier: unsupported by EfficientNetBiFPN. float, depth
+        multiplier for the feature extractor.
+      min_depth: minimum feature extractor depth.
+      pad_to_multiple: the nearest multiple to zero pad the input height and
+        width dimensions to.
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
+        containing convolution hyperparameters for the layers added on top of
+        the base feature extractor.
+      freeze_batchnorm: whether to freeze batch norm parameters during training
+        or not. When training with a small batch size (e.g. 1), it is desirable
+        to freeze batch norm update and use pretrained batch norm params.
+      inplace_batchnorm_update: whether to update batch norm moving average
+        values inplace. When this is false train op must add a control
+        dependency on tf.graphkeys.UPDATE_OPS collection in order to update
+        batch norm statistics.
+      bifpn_min_level: the highest resolution feature map to use in BiFPN. The
+        valid values are {2, 3, 4, 5} which map to Resnet blocks {1, 2, 3, 4}
+        respectively.
+      bifpn_max_level: the smallest resolution feature map to use in the BiFPN.
+        BiFPN constructions uses features maps starting from bifpn_min_level
+        upto the bifpn_max_level. In the case that there are not enough feature
+        maps in the backbone network, additional feature maps are created by
+        applying stride 2 convolutions until we get the desired number of BiFPN
+        levels.
+      bifpn_num_iterations: number of BiFPN iterations. Overrided if
+        efficientdet_version is provided.
+      bifpn_num_filters: number of filters (channels) in all BiFPN layers.
+        Overrided if efficientdet_version is provided.
+      bifpn_combine_method: the method used to combine BiFPN nodes.
+      use_explicit_padding: unsupported by EfficientNetBiFPN. Whether to use
+        explicit padding when extracting features.
+      use_depthwise: unsupported by EfficientNetBiFPN, since BiFPN uses regular
+        convolutions when inputs to a node have a differing number of channels,
+        and use separable convolutions after combine operations.
+      override_base_feature_extractor_hyperparams: unsupported. Whether to
+        override hyperparameters of the base feature extractor with the one from
+        `conv_hyperparams`.
+      name: a string name scope to assign to the model. If 'None', Keras will
+        auto-generate one from the class name.
+    """
+    super(SSDEfficientNetB6BiFPNKerasFeatureExtractor, self).__init__(
+        is_training=is_training,
+        depth_multiplier=depth_multiplier,
+        min_depth=min_depth,
+        pad_to_multiple=pad_to_multiple,
+        conv_hyperparams=conv_hyperparams,
+        freeze_batchnorm=freeze_batchnorm,
+        inplace_batchnorm_update=inplace_batchnorm_update,
+        bifpn_min_level=bifpn_min_level,
+        bifpn_max_level=bifpn_max_level,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method,
+        efficientnet_version='efficientnet-b6',
+        use_explicit_padding=use_explicit_padding,
+        use_depthwise=use_depthwise,
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams,
+        name=name)
+
+
+class SSDEfficientNetB7BiFPNKerasFeatureExtractor(
+    SSDEfficientNetBiFPNKerasFeatureExtractor):
+  """SSD Keras EfficientNet-b7 BiFPN Feature Extractor."""
+
+  def __init__(self,
+               is_training,
+               depth_multiplier,
+               min_depth,
+               pad_to_multiple,
+               conv_hyperparams,
+               freeze_batchnorm,
+               inplace_batchnorm_update,
+               bifpn_min_level=3,
+               bifpn_max_level=7,
+               bifpn_num_iterations=8,
+               bifpn_num_filters=384,
+               bifpn_combine_method='sum',
+               use_explicit_padding=None,
+               use_depthwise=None,
+               override_base_feature_extractor_hyperparams=None,
+               name='EfficientNet-B7_BiFPN'):
+
+    """SSD Keras EfficientNet-b7 BiFPN Feature Extractor.
+
+    Args:
+      is_training: whether the network is in training mode.
+      depth_multiplier: unsupported by EfficientNetBiFPN. float, depth
+        multiplier for the feature extractor.
+      min_depth: minimum feature extractor depth.
+      pad_to_multiple: the nearest multiple to zero pad the input height and
+        width dimensions to.
+      conv_hyperparams: a `hyperparams_builder.KerasLayerHyperparams` object
+        containing convolution hyperparameters for the layers added on top of
+        the base feature extractor.
+      freeze_batchnorm: whether to freeze batch norm parameters during training
+        or not. When training with a small batch size (e.g. 1), it is desirable
+        to freeze batch norm update and use pretrained batch norm params.
+      inplace_batchnorm_update: whether to update batch norm moving average
+        values inplace. When this is false train op must add a control
+        dependency on tf.graphkeys.UPDATE_OPS collection in order to update
+        batch norm statistics.
+      bifpn_min_level: the highest resolution feature map to use in BiFPN. The
+        valid values are {2, 3, 4, 5} which map to Resnet blocks {1, 2, 3, 4}
+        respectively.
+      bifpn_max_level: the smallest resolution feature map to use in the BiFPN.
+        BiFPN constructions uses features maps starting from bifpn_min_level
+        upto the bifpn_max_level. In the case that there are not enough feature
+        maps in the backbone network, additional feature maps are created by
+        applying stride 2 convolutions until we get the desired number of BiFPN
+        levels.
+      bifpn_num_iterations: number of BiFPN iterations. Overrided if
+        efficientdet_version is provided.
+      bifpn_num_filters: number of filters (channels) in all BiFPN layers.
+        Overrided if efficientdet_version is provided.
+      bifpn_combine_method: the method used to combine BiFPN nodes.
+      use_explicit_padding: unsupported by EfficientNetBiFPN. Whether to use
+        explicit padding when extracting features.
+      use_depthwise: unsupported by EfficientNetBiFPN, since BiFPN uses regular
+        convolutions when inputs to a node have a differing number of channels,
+        and use separable convolutions after combine operations.
+      override_base_feature_extractor_hyperparams: unsupported. Whether to
+        override hyperparameters of the base feature extractor with the one from
+        `conv_hyperparams`.
+      name: a string name scope to assign to the model. If 'None', Keras will
+        auto-generate one from the class name.
+    """
+    super(SSDEfficientNetB7BiFPNKerasFeatureExtractor, self).__init__(
+        is_training=is_training,
+        depth_multiplier=depth_multiplier,
+        min_depth=min_depth,
+        pad_to_multiple=pad_to_multiple,
+        conv_hyperparams=conv_hyperparams,
+        freeze_batchnorm=freeze_batchnorm,
+        inplace_batchnorm_update=inplace_batchnorm_update,
+        bifpn_min_level=bifpn_min_level,
+        bifpn_max_level=bifpn_max_level,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method,
+        efficientnet_version='efficientnet-b7',
+        use_explicit_padding=use_explicit_padding,
+        use_depthwise=use_depthwise,
+        override_base_feature_extractor_hyperparams=
+        override_base_feature_extractor_hyperparams,
+        name=name)
diff --git a/research/object_detection/models/ssd_efficientnet_bifpn_feature_extractor_tf2_test.py b/research/object_detection/models/ssd_efficientnet_bifpn_feature_extractor_tf2_test.py
new file mode 100644
index 0000000000000000000000000000000000000000..450fcc8f8548a0f9bb27de5da0126f13b3c93da2
--- /dev/null
+++ b/research/object_detection/models/ssd_efficientnet_bifpn_feature_extractor_tf2_test.py
@@ -0,0 +1,179 @@
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Tests for the ssd_efficientnet_bifpn_feature_extractor."""
+import unittest
+from absl.testing import parameterized
+
+import numpy as np
+import tensorflow.compat.v2 as tf
+
+from google.protobuf import text_format
+from object_detection.builders import hyperparams_builder
+from object_detection.models import ssd_efficientnet_bifpn_feature_extractor
+from object_detection.protos import hyperparams_pb2
+from object_detection.utils import test_case
+from object_detection.utils import tf_version
+
+
+def _count_params(model, trainable_only=True):
+  """Returns the count of all model parameters, or just trainable ones."""
+  if not trainable_only:
+    return model.count_params()
+  else:
+    return int(np.sum([
+        tf.keras.backend.count_params(p) for p in model.trainable_weights]))
+
+
+@parameterized.parameters(
+    {'efficientdet_version': 'efficientdet-d0',
+     'efficientnet_version': 'efficientnet-b0',
+     'bifpn_num_iterations': 3,
+     'bifpn_num_filters': 64,
+     'bifpn_combine_method': 'fast_attention'},
+    {'efficientdet_version': 'efficientdet-d1',
+     'efficientnet_version': 'efficientnet-b1',
+     'bifpn_num_iterations': 4,
+     'bifpn_num_filters': 88,
+     'bifpn_combine_method': 'fast_attention'},
+    {'efficientdet_version': 'efficientdet-d2',
+     'efficientnet_version': 'efficientnet-b2',
+     'bifpn_num_iterations': 5,
+     'bifpn_num_filters': 112,
+     'bifpn_combine_method': 'fast_attention'},
+    {'efficientdet_version': 'efficientdet-d3',
+     'efficientnet_version': 'efficientnet-b3',
+     'bifpn_num_iterations': 6,
+     'bifpn_num_filters': 160,
+     'bifpn_combine_method': 'fast_attention'},
+    {'efficientdet_version': 'efficientdet-d4',
+     'efficientnet_version': 'efficientnet-b4',
+     'bifpn_num_iterations': 7,
+     'bifpn_num_filters': 224,
+     'bifpn_combine_method': 'fast_attention'},
+    {'efficientdet_version': 'efficientdet-d5',
+     'efficientnet_version': 'efficientnet-b5',
+     'bifpn_num_iterations': 7,
+     'bifpn_num_filters': 288,
+     'bifpn_combine_method': 'fast_attention'},
+    # efficientdet-d6 and efficientdet-d7 only differ in input size.
+    {'efficientdet_version': 'efficientdet-d6-d7',
+     'efficientnet_version': 'efficientnet-b6',
+     'bifpn_num_iterations': 8,
+     'bifpn_num_filters': 384,
+     'bifpn_combine_method': 'sum'})
+@unittest.skipIf(tf_version.is_tf1(), 'Skipping TF2.X only test.')
+class SSDEfficientNetBiFPNFeatureExtractorTest(
+    test_case.TestCase, parameterized.TestCase):
+
+  def _build_conv_hyperparams(self, add_batch_norm=True):
+    conv_hyperparams = hyperparams_pb2.Hyperparams()
+    conv_hyperparams_text_proto = """
+      force_use_bias: true
+      activation: SWISH
+      regularizer {
+        l2_regularizer {
+          weight: 0.0004
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.03
+          mean: 0.0
+        }
+      }
+    """
+    if add_batch_norm:
+      batch_norm_proto = """
+        batch_norm {
+          scale: true,
+          decay: 0.99,
+          epsilon: 0.001,
+        }
+      """
+      conv_hyperparams_text_proto += batch_norm_proto
+    text_format.Merge(conv_hyperparams_text_proto, conv_hyperparams)
+    return hyperparams_builder.KerasLayerHyperparams(conv_hyperparams)
+
+  def _create_feature_extractor(self,
+                                efficientnet_version='efficientnet-b0',
+                                bifpn_num_iterations=3,
+                                bifpn_num_filters=64,
+                                bifpn_combine_method='fast_attention'):
+    """Constructs a new EfficientNetBiFPN feature extractor."""
+    depth_multiplier = 1.0
+    pad_to_multiple = 1
+    min_depth = 16
+    return (ssd_efficientnet_bifpn_feature_extractor
+            .SSDEfficientNetBiFPNKerasFeatureExtractor(
+                is_training=True,
+                depth_multiplier=depth_multiplier,
+                min_depth=min_depth,
+                pad_to_multiple=pad_to_multiple,
+                conv_hyperparams=self._build_conv_hyperparams(),
+                freeze_batchnorm=False,
+                inplace_batchnorm_update=False,
+                bifpn_min_level=3,
+                bifpn_max_level=7,
+                bifpn_num_iterations=bifpn_num_iterations,
+                bifpn_num_filters=bifpn_num_filters,
+                bifpn_combine_method=bifpn_combine_method,
+                efficientnet_version=efficientnet_version))
+
+  def test_efficientdet_feature_extractor_shapes(self,
+                                                 efficientdet_version,
+                                                 efficientnet_version,
+                                                 bifpn_num_iterations,
+                                                 bifpn_num_filters,
+                                                 bifpn_combine_method):
+    feature_extractor = self._create_feature_extractor(
+        efficientnet_version=efficientnet_version,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method)
+    outputs = feature_extractor(np.zeros((2, 256, 256, 3), dtype=np.float32))
+
+    self.assertEqual(outputs[0].shape, (2, 32, 32, bifpn_num_filters))
+    self.assertEqual(outputs[1].shape, (2, 16, 16, bifpn_num_filters))
+    self.assertEqual(outputs[2].shape, (2, 8, 8, bifpn_num_filters))
+    self.assertEqual(outputs[3].shape, (2, 4, 4, bifpn_num_filters))
+    self.assertEqual(outputs[4].shape, (2, 2, 2, bifpn_num_filters))
+
+  def test_efficientdet_feature_extractor_params(self,
+                                                 efficientdet_version,
+                                                 efficientnet_version,
+                                                 bifpn_num_iterations,
+                                                 bifpn_num_filters,
+                                                 bifpn_combine_method):
+    feature_extractor = self._create_feature_extractor(
+        efficientnet_version=efficientnet_version,
+        bifpn_num_iterations=bifpn_num_iterations,
+        bifpn_num_filters=bifpn_num_filters,
+        bifpn_combine_method=bifpn_combine_method)
+    _ = feature_extractor(np.zeros((2, 256, 256, 3), dtype=np.float32))
+    expected_params = {
+        'efficientdet-d0': 5484829,
+        'efficientdet-d1': 8185156,
+        'efficientdet-d2': 9818153,
+        'efficientdet-d3': 13792706,
+        'efficientdet-d4': 22691445,
+        'efficientdet-d5': 35795677,
+        'efficientdet-d6-d7': 53624512,
+    }
+    num_params = _count_params(feature_extractor)
+    self.assertEqual(expected_params[efficientdet_version], num_params)
+
+
+if __name__ == '__main__':
+  tf.test.main()
diff --git a/research/object_detection/models/ssd_mobilenet_v1_keras_feature_extractor.py b/research/object_detection/models/ssd_mobilenet_v1_keras_feature_extractor.py
index 82b48c1a8032c316b1a139150bdf0fef66ed743a..2f0df91540ae3598cde3d08c764b023c3c7f758e 100644
--- a/research/object_detection/models/ssd_mobilenet_v1_keras_feature_extractor.py
+++ b/research/object_detection/models/ssd_mobilenet_v1_keras_feature_extractor.py
@@ -163,14 +163,3 @@ class SSDMobileNetV1KerasFeatureExtractor(
         'Conv2d_13_pointwise': image_features[1]})
 
     return list(feature_maps.values())
-
-  def restore_from_classification_checkpoint_fn(self, feature_extractor_scope):
-    """Returns a map for restoring from an (object-based) checkpoint.
-
-    Args:
-      feature_extractor_scope: A scope name for the feature extractor (unused).
-
-    Returns:
-      A dict mapping keys to Keras models
-    """
-    return {'feature_extractor': self.classification_backbone}
diff --git a/research/object_detection/models/ssd_mobilenet_v2_fpn_keras_feature_extractor.py b/research/object_detection/models/ssd_mobilenet_v2_fpn_keras_feature_extractor.py
index 0e36e8bda08e376ace2cb51ca7e7045d6c689663..0834ea6b9db2d853b06392b48b594a7c9a5f301b 100644
--- a/research/object_detection/models/ssd_mobilenet_v2_fpn_keras_feature_extractor.py
+++ b/research/object_detection/models/ssd_mobilenet_v2_fpn_keras_feature_extractor.py
@@ -241,14 +241,3 @@ class SSDMobileNetV2FpnKerasFeatureExtractor(
         last_feature_map = layer(last_feature_map)
       feature_maps.append(last_feature_map)
     return feature_maps
-
-  def restore_from_classification_checkpoint_fn(self, feature_extractor_scope):
-    """Returns a map for restoring from an (object-based) checkpoint.
-
-    Args:
-      feature_extractor_scope: A scope name for the feature extractor (unused).
-
-    Returns:
-      A dict mapping keys to Keras models
-    """
-    return {'feature_extractor': self.classification_backbone}
diff --git a/research/object_detection/models/ssd_mobilenet_v2_keras_feature_extractor.py b/research/object_detection/models/ssd_mobilenet_v2_keras_feature_extractor.py
index 9f0622f3290fbfd15790eaad15f2831f7ae08c34..0f79fc271d55edbc0e61384948bd816fa6f9cd3b 100644
--- a/research/object_detection/models/ssd_mobilenet_v2_keras_feature_extractor.py
+++ b/research/object_detection/models/ssd_mobilenet_v2_keras_feature_extractor.py
@@ -166,14 +166,3 @@ class SSDMobileNetV2KerasFeatureExtractor(
         'layer_19': image_features[1]})
 
     return list(feature_maps.values())
-
-  def restore_from_classification_checkpoint_fn(self, feature_extractor_scope):
-    """Returns a map for restoring from an (object-based) checkpoint.
-
-    Args:
-      feature_extractor_scope: A scope name for the feature extractor (unused).
-
-    Returns:
-      A dict mapping keys to Keras models
-    """
-    return {'feature_extractor': self.classification_backbone}
diff --git a/research/object_detection/models/ssd_resnet_v1_fpn_keras_feature_extractor.py b/research/object_detection/models/ssd_resnet_v1_fpn_keras_feature_extractor.py
index 6de9ae3e5b8f24885d139b2b277b09ccd1782169..0ac929cc6349a21b541f20adb624ad157d4f4a63 100644
--- a/research/object_detection/models/ssd_resnet_v1_fpn_keras_feature_extractor.py
+++ b/research/object_detection/models/ssd_resnet_v1_fpn_keras_feature_extractor.py
@@ -246,17 +246,6 @@ class SSDResNetV1FpnKerasFeatureExtractor(
       feature_maps.append(last_feature_map)
     return feature_maps
 
-  def restore_from_classification_checkpoint_fn(self, feature_extractor_scope):
-    """Returns a map for restoring from an (object-based) checkpoint.
-
-    Args:
-      feature_extractor_scope: A scope name for the feature extractor (unused).
-
-    Returns:
-      A dict mapping keys to Keras models
-    """
-    return {'feature_extractor': self.classification_backbone}
-
 
 class SSDResNet50V1FpnKerasFeatureExtractor(
     SSDResNetV1FpnKerasFeatureExtractor):
diff --git a/research/object_detection/packages/tf1/setup.py b/research/object_detection/packages/tf1/setup.py
new file mode 100644
index 0000000000000000000000000000000000000000..dc3bfaca0b8949c372b12e808cc3304a3d963ff7
--- /dev/null
+++ b/research/object_detection/packages/tf1/setup.py
@@ -0,0 +1,27 @@
+"""Setup script for object_detection with TF1.0."""
+import os
+from setuptools import find_packages
+from setuptools import setup
+
+REQUIRED_PACKAGES = ['pillow', 'lxml', 'matplotlib', 'Cython',
+                     'contextlib2', 'tf-slim', 'six', 'pycocotools', 'scipy',
+                     'pandas']
+
+setup(
+    name='object_detection',
+    version='0.1',
+    install_requires=REQUIRED_PACKAGES,
+    include_package_data=True,
+    packages=(
+        [p for p in find_packages() if p.startswith('object_detection')] +
+        find_packages(where=os.path.join('.', 'slim'))),
+    package_dir={
+        'datasets': os.path.join('slim', 'datasets'),
+        'nets': os.path.join('slim', 'nets'),
+        'preprocessing': os.path.join('slim', 'preprocessing'),
+        'deployment': os.path.join('slim', 'deployment'),
+        'scripts': os.path.join('slim', 'scripts'),
+    },
+    description='Tensorflow Object Detection Library with TF1.0',
+    python_requires='>3.6',
+)
diff --git a/research/object_detection/packages/tf2/setup.py b/research/object_detection/packages/tf2/setup.py
new file mode 100644
index 0000000000000000000000000000000000000000..73c270b3ede723e5e3d2c4ef02fef2fdfad38bb7
--- /dev/null
+++ b/research/object_detection/packages/tf2/setup.py
@@ -0,0 +1,44 @@
+"""Setup script for object_detection with TF2.0."""
+import os
+from setuptools import find_packages
+from setuptools import setup
+
+# Note: adding apache-beam to required packages causes conflict with
+# tf-models-offical requirements. These packages request for incompatible
+# oauth2client package.
+REQUIRED_PACKAGES = [
+    # Required for apache-beam with PY3
+    'avro-python3==1.8.1',
+    'apache-beam',
+    'pillow',
+    'lxml',
+    'matplotlib',
+    'Cython',
+    'contextlib2',
+    'tf-slim',
+    'six',
+    'pycocotools',
+    'scipy',
+    'pandas',
+    # Required to avoid Numpy 1.19.1 conflict with TF 2.3
+    'tf-models-official==2.2.2'
+]
+
+setup(
+    name='object_detection',
+    version='0.1',
+    install_requires=REQUIRED_PACKAGES,
+    include_package_data=True,
+    packages=(
+        [p for p in find_packages() if p.startswith('object_detection')] +
+        find_packages(where=os.path.join('.', 'slim'))),
+    package_dir={
+        'datasets': os.path.join('slim', 'datasets'),
+        'nets': os.path.join('slim', 'nets'),
+        'preprocessing': os.path.join('slim', 'preprocessing'),
+        'deployment': os.path.join('slim', 'deployment'),
+        'scripts': os.path.join('slim', 'scripts'),
+    },
+    description='Tensorflow Object Detection Library',
+    python_requires='>3.6',
+)
diff --git a/research/object_detection/predictors/convolutional_keras_box_predictor.py b/research/object_detection/predictors/convolutional_keras_box_predictor.py
index 630c680398baa4a60c945a0bd2d874ea0f8c1783..fc72fb04c2d47301b1ac5fc185ca98c6b00073c0 100644
--- a/research/object_detection/predictors/convolutional_keras_box_predictor.py
+++ b/research/object_detection/predictors/convolutional_keras_box_predictor.py
@@ -314,7 +314,8 @@ class WeightSharedConvolutionalBoxPredictor(box_predictor.KerasBoxPredictor):
       self, inserted_layer_counter, target_channel):
     projection_layers = []
     if inserted_layer_counter >= 0:
-      use_bias = False if self._apply_batch_norm else True
+      use_bias = False if (self._apply_batch_norm and not
+                           self._conv_hyperparams.force_use_bias()) else True
       projection_layers.append(keras.Conv2D(
           target_channel, [1, 1], strides=1, padding='SAME',
           name='ProjectionLayer/conv2d_{}'.format(inserted_layer_counter),
@@ -331,7 +332,8 @@ class WeightSharedConvolutionalBoxPredictor(box_predictor.KerasBoxPredictor):
     conv_layers = []
     batch_norm_layers = []
     activation_layers = []
-    use_bias = False if self._apply_batch_norm else True
+    use_bias = False if (self._apply_batch_norm and not
+                         self._conv_hyperparams.force_use_bias()) else True
     for additional_conv_layer_idx in range(self._num_layers_before_predictor):
       layer_name = '{}/conv2d_{}'.format(
           tower_name_scope, additional_conv_layer_idx)
@@ -363,7 +365,9 @@ class WeightSharedConvolutionalBoxPredictor(box_predictor.KerasBoxPredictor):
             training=(self._is_training and not self._freeze_batchnorm),
             name='{}/conv2d_{}/BatchNorm/feature_{}'.format(
                 tower_name_scope, additional_conv_layer_idx, feature_index)))
-      activation_layers.append(tf.keras.layers.Lambda(tf.nn.relu6))
+      activation_layers.append(self._conv_hyperparams.build_activation_layer(
+          name='{}/conv2d_{}/activation_{}'.format(
+              tower_name_scope, additional_conv_layer_idx, feature_index)))
 
     # Set conv layers as the shared conv layers for different feature maps with
     # the same tower_name_scope.
diff --git a/research/object_detection/predictors/heads/head.py b/research/object_detection/predictors/heads/head.py
index d2780319dd8ce8bf58ff79db5c552d55a3568eb8..7dc2a9492f2229a874446c26334e5ed5840e722e 100644
--- a/research/object_detection/predictors/heads/head.py
+++ b/research/object_detection/predictors/heads/head.py
@@ -61,7 +61,7 @@ class Head(object):
     pass
 
 
-class KerasHead(tf.keras.Model):
+class KerasHead(tf.keras.layers.Layer):
   """Keras head base class."""
 
   def call(self, features):
diff --git a/research/object_detection/protos/center_net.proto b/research/object_detection/protos/center_net.proto
index 5047c000f3d4ba22d42127b54e61fbb8726429f8..a4ad0beef1688af23072d473705ee0dca6052173 100644
--- a/research/object_detection/protos/center_net.proto
+++ b/research/object_detection/protos/center_net.proto
@@ -183,6 +183,41 @@ message CenterNet {
     optional float heatmap_bias_init = 3 [default = -2.19];
   }
   optional MaskEstimation mask_estimation_task = 8;
+
+  // Parameters which are related to DensePose estimation task.
+  // http://densepose.org/
+  message DensePoseEstimation {
+    // Weight of the task loss. The total loss of the model will be their
+    // summation of task losses weighted by the weights.
+    optional float task_loss_weight = 1 [default = 1.0];
+
+    // Class ID (0-indexed) that corresponds to the object in the label map that
+    // contains DensePose data.
+    optional int32 class_id = 2;
+
+    // Loss configuration for DensePose heatmap and regression losses. Note
+    // that the localization loss is used for surface coordinate losses and
+    // classification loss is used for part classification losses.
+    optional Loss loss = 3;
+
+    // The number of body parts.
+    optional int32 num_parts = 4 [default = 24];
+
+    // Loss weights for the two DensePose heads.
+    optional float part_loss_weight = 5 [default = 1.0];
+    optional float coordinate_loss_weight = 6 [default = 1.0];
+
+    // Whether to upsample the prediction feature maps back to the original
+    // input dimension prior to applying loss. This has the benefit of
+    // maintaining finer groundtruth location information.
+    optional bool upsample_to_input_res = 7 [default = true];
+
+    // The initial bias value of the convlution kernel of the class heatmap
+    // prediction head. -2.19 corresponds to predicting foreground with
+    // a probability of 0.1.
+    optional float heatmap_bias_init = 8 [default = -2.19];
+  }
+  optional DensePoseEstimation densepose_estimation_task = 9;
 }
 
 message CenterNetFeatureExtractor {
diff --git a/research/object_detection/protos/faster_rcnn.proto b/research/object_detection/protos/faster_rcnn.proto
index 486cc77ea8b156fb54500b0bbf7a01d4b17ac7b6..3a8fe4e51837c271e40d37567f82898be35ec702 100644
--- a/research/object_detection/protos/faster_rcnn.proto
+++ b/research/object_detection/protos/faster_rcnn.proto
@@ -8,6 +8,7 @@ import "object_detection/protos/hyperparams.proto";
 import "object_detection/protos/image_resizer.proto";
 import "object_detection/protos/losses.proto";
 import "object_detection/protos/post_processing.proto";
+import "object_detection/protos/fpn.proto";
 
 // Configuration for Faster R-CNN models.
 // See meta_architectures/faster_rcnn_meta_arch.py and models/model_builder.py
@@ -212,4 +213,21 @@ message FasterRcnnFeatureExtractor {
   // When training with a relative large batch size (e.g. 8), it could be
   // desirable to enable batch norm update.
   optional bool batch_norm_trainable = 3 [default = false];
+
+  // Hyperparameters that affect the layers of feature extractor added on top
+  // of the base feature extractor.
+  optional Hyperparams conv_hyperparams = 4;
+
+  // if the value is set to true, the base feature extractor's hyperparams will
+  // be overridden with the `conv_hyperparams`.
+  optional bool override_base_feature_extractor_hyperparams = 5
+      [default = false];
+
+  // The nearest multiple to zero-pad the input height and width dimensions to.
+  // For example, if pad_to_multiple = 2, input dimensions are zero-padded
+  // until the resulting dimensions are even.
+  optional int32 pad_to_multiple = 6 [default = 32];
+
+  // Feature Pyramid Networks config.
+  optional FeaturePyramidNetworks fpn = 7;
 }
diff --git a/research/object_detection/protos/fpn.proto b/research/object_detection/protos/fpn.proto
new file mode 100644
index 0000000000000000000000000000000000000000..568aa848de67a899709918de235d9939c776ec93
--- /dev/null
+++ b/research/object_detection/protos/fpn.proto
@@ -0,0 +1,50 @@
+syntax = "proto2";
+
+package object_detection.protos;
+
+// Configuration for Feature Pyramid Networks.
+message FeaturePyramidNetworks {
+  // We recommend to use multi_resolution_feature_map_generator with FPN, and
+  // the levels there must match the levels defined below for better
+  // performance.
+  // Correspondence from FPN levels to Resnet/Mobilenet V1 feature maps:
+  // FPN Level        Resnet Feature Map      Mobilenet-V1 Feature Map
+  //     2               Block 1                Conv2d_3_pointwise
+  //     3               Block 2                Conv2d_5_pointwise
+  //     4               Block 3                Conv2d_11_pointwise
+  //     5               Block 4                Conv2d_13_pointwise
+  //     6               Bottomup_5             bottom_up_Conv2d_14
+  //     7               Bottomup_6             bottom_up_Conv2d_15
+  //     8               Bottomup_7             bottom_up_Conv2d_16
+  //     9               Bottomup_8             bottom_up_Conv2d_17
+
+  // minimum level in feature pyramid
+  optional int32 min_level = 1 [default = 3];
+
+  // maximum level in feature pyramid
+  optional int32 max_level = 2 [default = 7];
+
+  // channel depth for additional coarse feature layers.
+  optional int32 additional_layer_depth = 3 [default = 256];
+
+}
+
+// Configuration for Bidirectional Feature Pyramid Networks.
+message BidirectionalFeaturePyramidNetworks {
+  // minimum level in the feature pyramid.
+  optional int32 min_level = 1 [default = 3];
+
+  // maximum level in the feature pyramid.
+  optional int32 max_level = 2 [default = 7];
+
+  // The number of repeated top-down bottom-up iterations for BiFPN-based
+  // feature extractors (bidirectional feature pyramid networks).
+  optional int32 num_iterations = 3;
+
+  // The number of filters (channels) to use in feature pyramid layers for
+  // BiFPN-based feature extractors (bidirectional feature pyramid networks).
+  optional int32 num_filters = 4;
+
+  // Method used to combine inputs to BiFPN nodes.
+  optional string combine_method = 5 [default = 'fast_attention'];
+}
diff --git a/research/object_detection/protos/hyperparams.proto b/research/object_detection/protos/hyperparams.proto
index 2b1053877613c3ced70515a559014b6463175f9e..e2fee247ca1303dfdbb9bdb69f187b7520c4e89c 100644
--- a/research/object_detection/protos/hyperparams.proto
+++ b/research/object_detection/protos/hyperparams.proto
@@ -52,6 +52,12 @@ message Hyperparams {
   // Whether depthwise convolutions should be regularized. If this parameter is
   // NOT set then the conv hyperparams will default to the parent scope.
   optional bool regularize_depthwise = 6 [default = false];
+
+  // By default, use_bias is set to False if batch_norm is not None and
+  // batch_norm.center is True. When force_use_bias is set to True, this
+  // behavior will be overridden, and use_bias will be set to True, regardless
+  // of batch norm parameters. Note, this only applies to KerasLayerHyperparams.
+  optional bool force_use_bias = 8 [default = false];
 }
 
 // Proto with one-of field for regularizers.
diff --git a/research/object_detection/protos/input_reader.proto b/research/object_detection/protos/input_reader.proto
index 2d9deda117d69031b9acca8047c8a3bc1db2c4a4..c6bacd2894536645c9496525f0c53a550ed0bfdb 100644
--- a/research/object_detection/protos/input_reader.proto
+++ b/research/object_detection/protos/input_reader.proto
@@ -2,7 +2,6 @@ syntax = "proto2";
 
 package object_detection.protos;
 
-import "object_detection/protos/image_resizer.proto";
 
 // Configuration proto for defining input readers that generate Object Detection
 // Examples from input sources. Input readers are expected to generate a
@@ -31,7 +30,7 @@ enum InputType {
   TF_SEQUENCE_EXAMPLE = 2;        // TfSequenceExample Input
 }
 
-// Next id: 31
+// Next id: 33
 message InputReader {
   // Name of input reader. Typically used to describe the dataset that is read
   // by this input reader.
@@ -119,6 +118,10 @@ message InputReader {
   // Type of instance mask.
   optional InstanceMaskType mask_type = 10 [default = NUMERICAL_MASKS];
 
+  // Whether to load DensePose data. If set, must also set load_instance_masks
+  // to true.
+  optional bool load_dense_pose = 31 [default = false];
+
   // Whether to use the display name when decoding examples. This is only used
   // when mapping class text strings to integers.
   optional bool use_display_name = 17 [default = false];
@@ -129,6 +132,10 @@ message InputReader {
   // Whether input data type is tf.Examples or tf.SequenceExamples
   optional InputType input_type = 30 [default = TF_EXAMPLE];
 
+  // Which frame to choose from the input if Sequence Example. -1 indicates
+  // random choice.
+  optional int32 frame_index = 32 [default = -1];
+
   oneof input_reader {
     TFRecordInputReader tf_record_input_reader = 8;
     ExternalInputReader external_input_reader = 9;
diff --git a/research/object_detection/protos/preprocessor.proto b/research/object_detection/protos/preprocessor.proto
index aa83939f334b3bbd80e54c2ac7f367cb9cbf8869..a99be94194a1d412676e6d4d387cba53a8667c02 100644
--- a/research/object_detection/protos/preprocessor.proto
+++ b/research/object_detection/protos/preprocessor.proto
@@ -4,7 +4,7 @@ package object_detection.protos;
 
 // Message for defining a preprocessing operation on input data.
 // See: //third_party/tensorflow_models/object_detection/core/preprocessor.py
-// Next ID: 38
+// Next ID: 39
 message PreprocessingStep {
   oneof preprocessing_step {
     NormalizeImage normalize_image = 1;
@@ -44,6 +44,7 @@ message PreprocessingStep {
     RandomDownscaleToTargetPixels random_downscale_to_target_pixels = 35;
     RandomPatchGaussian random_patch_gaussian = 36;
     RandomSquareCropByScale random_square_crop_by_scale = 37;
+    RandomScaleCropAndPadToSquare random_scale_crop_and_pad_to_square = 38;
   }
 }
 
@@ -57,7 +58,8 @@ message NormalizeImage {
   optional float target_maxval = 4 [default=1];
 }
 
-// Randomly horizontally flips the image and detections 50% of the time.
+// Randomly horizontally flips the image and detections with the specified
+// probability, default to 50% of the time.
 message RandomHorizontalFlip {
   // Specifies a mapping from the original keypoint indices to horizontally
   // flipped indices. This is used in the event that keypoints are specified,
@@ -71,10 +73,15 @@ message RandomHorizontalFlip {
   // keypoint_flip_permutation: 3
   // keypoint_flip_permutation: 5
   // keypoint_flip_permutation: 4
+  // If nothing is specified the order of keypoint will be mantained.
   repeated int32 keypoint_flip_permutation = 1;
+
+  // The probability of running this augmentation for each image.
+  optional float probability = 2 [default=0.5];
 }
 
-// Randomly vertically flips the image and detections 50% of the time.
+// Randomly vertically flips the image and detections with the specified
+// probability, default to 50% of the time.
 message RandomVerticalFlip {
   // Specifies a mapping from the original keypoint indices to vertically
   // flipped indices. This is used in the event that keypoints are specified,
@@ -89,11 +96,23 @@ message RandomVerticalFlip {
   // keypoint_flip_permutation: 5
   // keypoint_flip_permutation: 4
   repeated int32 keypoint_flip_permutation = 1;
+
+  // The probability of running this augmentation for each image.
+  optional float probability = 2 [default=0.5];
 }
 
 // Randomly rotates the image and detections by 90 degrees counter-clockwise
-// 50% of the time.
-message RandomRotation90 {}
+// with the specified probability, default to 50% of the time.
+message RandomRotation90 {
+  // Specifies a mapping from the original keypoint indices to 90 degree counter
+  // clockwise indices. This is used in the event that keypoints are specified,
+  // in which case when the image is rotated the keypoints might need to be
+  // permuted.
+  repeated int32 keypoint_rot_permutation = 1;
+
+  // The probability of running this augmentation for each image.
+  optional float probability = 2 [default=0.5];
+}
 
 // Randomly scales the values of all pixels in the image by some constant value
 // between [minval, maxval], then clip the value to a range between [0, 1.0].
@@ -457,7 +476,6 @@ message SSDRandomCropPadFixedAspectRatio {
 // Converts class logits to softmax optionally scaling the values by temperature
 // first.
 message ConvertClassLogitsToSoftmax {
-
   // Scale to use on logits before applying softmax.
   optional float temperature = 1 [default=1.0];
 }
@@ -472,12 +490,10 @@ message RandomSelfConcatImage {
 
 // Apply an Autoaugment policy to the image and bounding boxes.
 message AutoAugmentImage {
-
   // What AutoAugment policy to apply to the Image
   optional string policy_name = 1 [default="v0"];
 }
 
-
 // Randomly drops ground truth boxes for a label with some probability.
 message DropLabelProbabilistically {
   // The label that should be dropped. This corresponds to one of the entries
@@ -487,7 +503,6 @@ message DropLabelProbabilistically {
   optional float drop_probability = 2 [default = 1.0];
 }
 
-
 //Remap a set of labels to a new label.
 message RemapLabels {
    // Labels to be remapped.
@@ -558,3 +573,20 @@ message RandomSquareCropByScale {
   // [min_scale, max_scale]
   optional int32 num_scales = 4 [default=8];
 }
+
+// Randomly scale, crop, and then pad an image to the desired square output
+// dimensions. Specifically, this method first samples a random_scale factor
+// from a uniform distribution between scale_min and scale_max, and then resizes
+// the image such that it's maximum dimension is (output_size * random_scale).
+// Secondly, a square output_size crop is extracted from the resized image, and
+// finally the cropped region is padded to the desired square output_size.
+// The augmentation is borrowed from [1]
+// [1]: https://arxiv.org/abs/1911.09070
+message RandomScaleCropAndPadToSquare {
+  // The (square) output image size
+  optional int32 output_size = 1 [default = 512];
+
+  // The minimum and maximum values from which to sample the random scale.
+  optional float scale_min = 2 [default=0.1];
+  optional float scale_max = 3 [default=2.0];
+}
diff --git a/research/object_detection/protos/ssd.proto b/research/object_detection/protos/ssd.proto
index 1fd9324b1fcd3c3cc0d55592a969cea27ded1018..e4b6ffa18c24d4430587f40ebfb7cc6b183ee9ee 100644
--- a/research/object_detection/protos/ssd.proto
+++ b/research/object_detection/protos/ssd.proto
@@ -11,6 +11,7 @@ import "object_detection/protos/losses.proto";
 import "object_detection/protos/matcher.proto";
 import "object_detection/protos/post_processing.proto";
 import "object_detection/protos/region_similarity_calculator.proto";
+import "object_detection/protos/fpn.proto";
 
 // Configuration for Single Shot Detection (SSD) models.
 // Next id: 27
@@ -145,7 +146,7 @@ message Ssd {
   optional MaskHead mask_head_config = 25;
 }
 
-// Next id: 18.
+// Next id: 20.
 message SsdFeatureExtractor {
   reserved 6;
 
@@ -185,8 +186,13 @@ message SsdFeatureExtractor {
   // feature maps added by SSD.
   optional bool use_depthwise = 8 [default = false];
 
-  // Feature Pyramid Networks config.
-  optional FeaturePyramidNetworks fpn = 10;
+  oneof feature_pyramid_oneof {
+    // Feature Pyramid Networks config.
+    FeaturePyramidNetworks fpn = 10;
+
+    // Bidirectional Feature Pyramid Networks config.
+    BidirectionalFeaturePyramidNetworks bifpn = 19;
+  }
 
   // If true, replace preprocess function of feature extractor with a
   // placeholder. This should only be used if all the image preprocessing steps
@@ -198,30 +204,3 @@ message SsdFeatureExtractor {
 
 }
 
-// Configuration for Feature Pyramid Networks.
-message FeaturePyramidNetworks {
-  // We recommend to use multi_resolution_feature_map_generator with FPN, and
-  // the levels there must match the levels defined below for better
-  // performance.
-  // Correspondence from FPN levels to Resnet/Mobilenet V1 feature maps:
-  // FPN Level        Resnet Feature Map      Mobilenet-V1 Feature Map
-  //     2               Block 1                Conv2d_3_pointwise
-  //     3               Block 2                Conv2d_5_pointwise
-  //     4               Block 3                Conv2d_11_pointwise
-  //     5               Block 4                Conv2d_13_pointwise
-  //     6               Bottomup_5             bottom_up_Conv2d_14
-  //     7               Bottomup_6             bottom_up_Conv2d_15
-  //     8               Bottomup_7             bottom_up_Conv2d_16
-  //     9               Bottomup_8             bottom_up_Conv2d_17
-
-  // minimum level in feature pyramid
-  optional int32 min_level = 1 [default = 3];
-
-  // maximum level in feature pyramid
-  optional int32 max_level = 2 [default = 7];
-
-  // channel depth for additional coarse feature layers.
-  optional int32 additional_layer_depth = 3 [default = 256];
-
-}
-
diff --git a/research/object_detection/protos/train.proto b/research/object_detection/protos/train.proto
index 0da8b2ede150f645e44e3d8f8a5ba995703c8712..62d326cdf67c7329ddaa22250a4f2734a4f43066 100644
--- a/research/object_detection/protos/train.proto
+++ b/research/object_detection/protos/train.proto
@@ -59,7 +59,8 @@ message TrainConfig {
 
   // Whether to load all checkpoint vars that match model variable names and
   // sizes. This option is only available if `from_detection_checkpoint` is
-  // True.
+  // True.  This option is *not* supported for TF2 --- setting it to true
+  // will raise an error.
   optional bool load_all_detection_checkpoint_vars = 19 [default = false];
 
   // Number of steps to train the DetectionModel for. If 0, will train the model
diff --git a/research/object_detection/samples/configs/context_rcnn_resnet101_snapshot_serengeti.config b/research/object_detection/samples/configs/context_rcnn_resnet101_snapshot_serengeti.config
new file mode 100644
index 0000000000000000000000000000000000000000..8167731c7b72b5598391c65794f9f8c265903a4c
--- /dev/null
+++ b/research/object_detection/samples/configs/context_rcnn_resnet101_snapshot_serengeti.config
@@ -0,0 +1,164 @@
+# Context R-CNN configuration for Snapshot Serengeti Dataset, with sequence
+# example input data with context_features.
+# This model uses attention into contextual features within the Faster R-CNN
+# object detection framework to improve object detection performance.
+# See https://arxiv.org/abs/1912.03538 for more information.
+# Search for "PATH_TO_BE_CONFIGURED" to find the fields that should be
+# configured.
+
+model {
+  faster_rcnn {
+    num_classes: 48
+    image_resizer {
+      fixed_shape_resizer {
+        height: 640
+        width: 640
+      }
+    }
+    feature_extractor {
+      type: "faster_rcnn_resnet101"
+      first_stage_features_stride: 16
+      batch_norm_trainable: true
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        height_stride: 16
+        width_stride: 16
+        scales: 0.25
+        scales: 0.5
+        scales: 1.0
+        scales: 2.0
+        aspect_ratios: 0.5
+        aspect_ratios: 1.0
+        aspect_ratios: 2.0
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.00999999977648
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.699999988079
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        share_box_across_classes: true
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.600000023842
+        max_detections_per_class: 100
+        max_total_detections: 300
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+    use_matmul_crop_and_resize: true
+    clip_anchors_to_image: true
+    use_matmul_gather_in_matcher: true
+    use_static_balanced_label_sampler: true
+    use_static_shapes: true
+    context_config {
+      max_num_context_features: 2000
+      context_feature_length: 2057
+    }
+  }
+}
+train_config {
+  batch_size: 8
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  sync_replicas: true
+  optimizer {
+    momentum_optimizer {
+      learning_rate {
+        manual_step_learning_rate {
+          initial_learning_rate: 0.0
+          schedule {
+            step: 400000
+            learning_rate: 0.002
+          }
+          schedule {
+            step: 500000
+            learning_rate: 0.0002
+          }
+          schedule {
+            step: 600000
+            learning_rate: 0.00002
+          }
+          warmup: true
+        }
+      }
+      momentum_optimizer_value: 0.9
+    }
+    use_moving_average: false
+  }
+  gradient_clipping_by_norm: 10.0
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/faster_rcnn_resnet101_coco_2018_08_14/model.ckpt"
+  from_detection_checkpoint: true
+  num_steps: 5000000
+  replicas_to_aggregate: 8
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+  use_bfloat16: true
+}
+train_input_reader {
+  label_map_path: "PATH_TO_BE_CONFIGURED/ss_label_map.pbtxt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/snapshot_serengeti_train-?????-of-?????"
+  }
+  load_context_features: true
+  input_type: TF_SEQUENCE_EXAMPLE
+}
+eval_config {
+  max_evals: 50
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 1
+}
+eval_input_reader {
+  label_map_path: "PATH_TO_BE_CONFIGURED/ss_label_map.pbtxt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/snapshot_serengeti_val-?????-of-?????"
+  }
+  load_context_features: true
+  input_type: TF_SEQUENCE_EXAMPLE
+}
diff --git a/research/object_detection/samples/configs/context_rcnn_resnet101_snapshot_serengeti_sync.config b/research/object_detection/samples/configs/context_rcnn_resnet101_snapshot_serengeti_sync.config
new file mode 100644
index 0000000000000000000000000000000000000000..b96dea467ed600c648219595f33c4b147a0c3215
--- /dev/null
+++ b/research/object_detection/samples/configs/context_rcnn_resnet101_snapshot_serengeti_sync.config
@@ -0,0 +1,166 @@
+# Context R-CNN configuration for Snapshot Serengeti Dataset, with sequence
+# example input data with context_features.
+# This model uses attention into contextual features within the Faster R-CNN
+# object detection framework to improve object detection performance.
+# See https://arxiv.org/abs/1912.03538 for more information.
+# Search for "PATH_TO_BE_CONFIGURED" to find the fields that should be
+# configured.
+
+# This config is TPU compatible.
+
+model {
+  faster_rcnn {
+    num_classes: 48
+    image_resizer {
+      fixed_shape_resizer {
+        height: 640
+        width: 640
+      }
+    }
+    feature_extractor {
+      type: "faster_rcnn_resnet101"
+      first_stage_features_stride: 16
+      batch_norm_trainable: true
+    }
+    first_stage_anchor_generator {
+      grid_anchor_generator {
+        height_stride: 16
+        width_stride: 16
+        scales: 0.25
+        scales: 0.5
+        scales: 1.0
+        scales: 2.0
+        aspect_ratios: 0.5
+        aspect_ratios: 1.0
+        aspect_ratios: 2.0
+      }
+    }
+    first_stage_box_predictor_conv_hyperparams {
+      op: CONV
+      regularizer {
+        l2_regularizer {
+          weight: 0.0
+        }
+      }
+      initializer {
+        truncated_normal_initializer {
+          stddev: 0.00999999977648
+        }
+      }
+    }
+    first_stage_nms_score_threshold: 0.0
+    first_stage_nms_iou_threshold: 0.699999988079
+    first_stage_max_proposals: 300
+    first_stage_localization_loss_weight: 2.0
+    first_stage_objectness_loss_weight: 1.0
+    initial_crop_size: 14
+    maxpool_kernel_size: 2
+    maxpool_stride: 2
+    second_stage_box_predictor {
+      mask_rcnn_box_predictor {
+        fc_hyperparams {
+          op: FC
+          regularizer {
+            l2_regularizer {
+              weight: 0.0
+            }
+          }
+          initializer {
+            variance_scaling_initializer {
+              factor: 1.0
+              uniform: true
+              mode: FAN_AVG
+            }
+          }
+        }
+        use_dropout: false
+        dropout_keep_probability: 1.0
+        share_box_across_classes: true
+      }
+    }
+    second_stage_post_processing {
+      batch_non_max_suppression {
+        score_threshold: 0.0
+        iou_threshold: 0.600000023842
+        max_detections_per_class: 100
+        max_total_detections: 300
+      }
+      score_converter: SOFTMAX
+    }
+    second_stage_localization_loss_weight: 2.0
+    second_stage_classification_loss_weight: 1.0
+    use_matmul_crop_and_resize: true
+    clip_anchors_to_image: true
+    use_matmul_gather_in_matcher: true
+    use_static_balanced_label_sampler: true
+    use_static_shapes: true
+    context_config {
+      max_num_context_features: 2000
+      context_feature_length: 2057
+    }
+  }
+}
+train_config {
+  batch_size: 64
+  data_augmentation_options {
+    random_horizontal_flip {
+    }
+  }
+  sync_replicas: true
+  optimizer {
+    momentum_optimizer {
+      learning_rate {
+        manual_step_learning_rate {
+          initial_learning_rate: 0.0
+          schedule {
+            step: 2000
+            learning_rate: 0.00200000009499
+          }
+          schedule {
+            step: 200000
+            learning_rate: 0.000199999994948
+          }
+          schedule {
+            step: 300000
+            learning_rate: 1.99999994948e-05
+          }
+          warmup: true
+        }
+      }
+      momentum_optimizer_value: 0.899999976158
+    }
+    use_moving_average: false
+  }
+  gradient_clipping_by_norm: 10.0
+  fine_tune_checkpoint: "PATH_TO_BE_CONFIGURED/faster_rcnn_resnet101_coco_2018_08_14/model.ckpt"
+  from_detection_checkpoint: true
+  num_steps: 500000
+  replicas_to_aggregate: 8
+  max_number_of_boxes: 100
+  unpad_groundtruth_tensors: false
+  use_bfloat16: true
+}
+train_input_reader {
+  label_map_path: "PATH_TO_BE_CONFIGURED/ss_label_map.pbtxt"
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/snapshot_serengeti_train-?????-of-?????"
+  }
+  load_context_features: true
+  input_type: TF_SEQUENCE_EXAMPLE
+}
+eval_config {
+  max_evals: 50
+  metrics_set: "coco_detection_metrics"
+  use_moving_averages: false
+  batch_size: 4
+}
+eval_input_reader {
+  label_map_path: "PATH_TO_BE_CONFIGURED/ss_label_map.pbtxt"
+  shuffle: false
+  num_epochs: 1
+  tf_record_input_reader {
+    input_path: "PATH_TO_BE_CONFIGURED/snapshot_serengeti_val-?????-of-?????"
+  }
+  load_context_features: true
+  input_type: TF_SEQUENCE_EXAMPLE
+}
diff --git a/research/object_detection/test_images/ducky/test/out1.jpg b/research/object_detection/test_images/ducky/test/out1.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..5f4835e0d8007619aa4508f89cc5122af53e4782
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out1.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out10.jpg b/research/object_detection/test_images/ducky/test/out10.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..ecbaa5a123865c65d6f4efb3ff6d7ba4f9ff0117
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out10.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out11.jpg b/research/object_detection/test_images/ducky/test/out11.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..34390f3f8607e5d85ba194ff5e19d49e6e2ed69f
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out11.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out12.jpg b/research/object_detection/test_images/ducky/test/out12.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..4fb8c00ff162347ba782b585cc7def02676f56fa
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out12.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out13.jpg b/research/object_detection/test_images/ducky/test/out13.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..ca6006fe0657820b8fd8427521f0d7f5e2682ffe
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out13.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out14.jpg b/research/object_detection/test_images/ducky/test/out14.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..964fc828c68430bbee81d1d21da469826d58c925
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out14.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out15.jpg b/research/object_detection/test_images/ducky/test/out15.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..9cf52ed3a4be4c3a55e2582bce65896e7ab02299
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out15.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out16.jpg b/research/object_detection/test_images/ducky/test/out16.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..533e5a2d75e62cab5b3e98e2be2ad6b2d63b6f42
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out16.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out17.jpg b/research/object_detection/test_images/ducky/test/out17.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..0764f647318afca8860e772480f52cc1382ee1ab
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out17.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out18.jpg b/research/object_detection/test_images/ducky/test/out18.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..dc07bdc0c3f98da9cbfbf48f517efbbbdb425455
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out18.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out19.jpg b/research/object_detection/test_images/ducky/test/out19.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..982b9ea68ffab4d323e6eff36d113a589477a5f4
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out19.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out2.jpg b/research/object_detection/test_images/ducky/test/out2.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..92c21fe074f6b90e8cf4edac2595f5ed5535b5fd
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out2.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out20.jpg b/research/object_detection/test_images/ducky/test/out20.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..8a4b34ece165a745ecd2fc77ab5b6a0f83930b43
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out20.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out21.jpg b/research/object_detection/test_images/ducky/test/out21.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..383af0a5be3a67a7a69cbe2b2dfa97e379833186
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out21.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out22.jpg b/research/object_detection/test_images/ducky/test/out22.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..a2729ca0024a81c4f3e62694db82fb46edb7693e
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out22.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out23.jpg b/research/object_detection/test_images/ducky/test/out23.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..630f0170cc0a09ff8f7815877e8ecd47293d7d85
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out23.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out24.jpg b/research/object_detection/test_images/ducky/test/out24.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..e9e378a136f17a7553643ee27eefb602cdd18d16
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out24.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out25.jpg b/research/object_detection/test_images/ducky/test/out25.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..96fdcf70a309fed6125fa7a5f56766ce6699d834
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out25.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out26.jpg b/research/object_detection/test_images/ducky/test/out26.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..5af3e1fc820b89044c67c6d17d5f636601911b35
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out26.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out27.jpg b/research/object_detection/test_images/ducky/test/out27.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..948db7a4ea191d473201cc21797458d4fd141e2b
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out27.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out28.jpg b/research/object_detection/test_images/ducky/test/out28.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..b02e5d4291a4f123fe1b782811a94b38ef96095e
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out28.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out29.jpg b/research/object_detection/test_images/ducky/test/out29.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..5e2a51a307115adc67d66c445c760cae63fe1401
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out29.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out3.jpg b/research/object_detection/test_images/ducky/test/out3.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..a8c65ce09faeb3561ed7190b6195bab6b1efe8aa
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out3.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out30.jpg b/research/object_detection/test_images/ducky/test/out30.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..403447f1b1e1e47bb413d5406186f33185a04250
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out30.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out31.jpg b/research/object_detection/test_images/ducky/test/out31.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..eae84da04d1b61b6292f6292c7504847545ab300
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out31.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out32.jpg b/research/object_detection/test_images/ducky/test/out32.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..d4895843b01ff1e7f850a51af167b7c8b4822dfe
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out32.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out33.jpg b/research/object_detection/test_images/ducky/test/out33.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..70bb7c9ffb1d6f68ce7d8611b922456ba662ef4f
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out33.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out34.jpg b/research/object_detection/test_images/ducky/test/out34.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..e697ba85e4689ec74c34f1812858457d7616f906
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out34.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out35.jpg b/research/object_detection/test_images/ducky/test/out35.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..ef2ca65bcaa3a93ef64fc0427f512edf732539e8
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out35.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out36.jpg b/research/object_detection/test_images/ducky/test/out36.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..f0d9071fbb5a22c293a4d1eda09cc349c6e26458
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out36.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out37.jpg b/research/object_detection/test_images/ducky/test/out37.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..d7083edf73908042ea937562f1004bf96d2a223f
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out37.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out38.jpg b/research/object_detection/test_images/ducky/test/out38.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..ce5cc018da0be3432067dfff42799c7fb2ee92d3
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out38.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out39.jpg b/research/object_detection/test_images/ducky/test/out39.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..aa0f98e7cfe03401aa0d1b891b81741b1bb1c889
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out39.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out4.jpg b/research/object_detection/test_images/ducky/test/out4.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..f0689dbd3d8575054a6ba9a7cae990f71c0ac029
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out4.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out40.jpg b/research/object_detection/test_images/ducky/test/out40.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..dd7ebb6268790df69b60ac6ecd449075968a6fb2
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out40.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out41.jpg b/research/object_detection/test_images/ducky/test/out41.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..d1806422ec62eeaf31ac5e1c5e386b11a4652eb6
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out41.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out42.jpg b/research/object_detection/test_images/ducky/test/out42.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..3f95c93ad1d1e3ccb55d5a3ba6deaac48e4bf069
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out42.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out43.jpg b/research/object_detection/test_images/ducky/test/out43.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..ddbd476b925da67d421c0c0d3de63ec633768043
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out43.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out44.jpg b/research/object_detection/test_images/ducky/test/out44.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..ced4b3bebeea87412c8590b6b29be4898230ea7d
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out44.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out45.jpg b/research/object_detection/test_images/ducky/test/out45.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..f55ccee050f569426ee85c3db377f5973af5c793
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out45.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out46.jpg b/research/object_detection/test_images/ducky/test/out46.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..8b1418264c8e1605b8508918ab0fe9b87aeacd03
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out46.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out47.jpg b/research/object_detection/test_images/ducky/test/out47.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..63539dc431260c5e458f138144bde8c2e1ad896a
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out47.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out48.jpg b/research/object_detection/test_images/ducky/test/out48.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..6ab6863772ffde5fb4af6504c093438349feb52e
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out48.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out49.jpg b/research/object_detection/test_images/ducky/test/out49.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..57934cdc6a54735b7513271d2fdeaf7453ad4880
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out49.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out5.jpg b/research/object_detection/test_images/ducky/test/out5.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..c3d58a617bf0437eb4095de41450dd9ffa642e09
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out5.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out6.jpg b/research/object_detection/test_images/ducky/test/out6.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..76e2c6f4af677df7535e07d7da60d8faf2b747bd
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out6.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out7.jpg b/research/object_detection/test_images/ducky/test/out7.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..3a486ac4eb677c644c045d15b597623484325a7a
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out7.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out8.jpg b/research/object_detection/test_images/ducky/test/out8.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..67235d4b7d5c10bcc963c8a2313285ae26778b34
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out8.jpg differ
diff --git a/research/object_detection/test_images/ducky/test/out9.jpg b/research/object_detection/test_images/ducky/test/out9.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..17faadb7c43b5ced7a1e3bcc6444ab6fc68ac370
Binary files /dev/null and b/research/object_detection/test_images/ducky/test/out9.jpg differ
diff --git a/research/object_detection/test_images/ducky/train/robertducky1.jpg b/research/object_detection/test_images/ducky/train/robertducky1.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..d122ad1ed8373693535c6a5ea6842d634880ec1c
Binary files /dev/null and b/research/object_detection/test_images/ducky/train/robertducky1.jpg differ
diff --git a/research/object_detection/test_images/ducky/train/robertducky2.jpg b/research/object_detection/test_images/ducky/train/robertducky2.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..09617f0f3aa48f6e5fa41a559f0950091d643072
Binary files /dev/null and b/research/object_detection/test_images/ducky/train/robertducky2.jpg differ
diff --git a/research/object_detection/test_images/ducky/train/robertducky3.jpg b/research/object_detection/test_images/ducky/train/robertducky3.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..69b0f691f712be0c6d26a58d8be81b4d1a9caf23
Binary files /dev/null and b/research/object_detection/test_images/ducky/train/robertducky3.jpg differ
diff --git a/research/object_detection/test_images/ducky/train/robertducky4.jpg b/research/object_detection/test_images/ducky/train/robertducky4.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..6d47cf783bf2049df9d366870897b5d4a52ff762
Binary files /dev/null and b/research/object_detection/test_images/ducky/train/robertducky4.jpg differ
diff --git a/research/object_detection/test_images/ducky/train/robertducky5.jpg b/research/object_detection/test_images/ducky/train/robertducky5.jpg
new file mode 100644
index 0000000000000000000000000000000000000000..fdc8a51cf1cb1e8d185b0a6a48e2d26b879b58c5
Binary files /dev/null and b/research/object_detection/test_images/ducky/train/robertducky5.jpg differ
diff --git a/research/object_detection/utils/bifpn_utils.py b/research/object_detection/utils/bifpn_utils.py
index b4b2443553850cd5ce4ac9f921f614552cd4364b..d14cb841e3eb848bec6e4bf4257ee70e768b5108 100644
--- a/research/object_detection/utils/bifpn_utils.py
+++ b/research/object_detection/utils/bifpn_utils.py
@@ -26,7 +26,8 @@ from object_detection.utils import shape_utils
 
 def create_conv_block(name, num_filters, kernel_size, strides, padding,
                       use_separable, apply_batchnorm, apply_activation,
-                      conv_hyperparams, is_training, freeze_batchnorm):
+                      conv_hyperparams, is_training, freeze_batchnorm,
+                      conv_bn_act_pattern=True):
   """Create Keras layers for regular or separable convolutions.
 
   Args:
@@ -50,6 +51,9 @@ def create_conv_block(name, num_filters, kernel_size, strides, padding,
       training or not. When training with a small batch size (e.g. 1), it is
       desirable to freeze batch norm update and use pretrained batch norm
       params.
+    conv_bn_act_pattern: Bool. By default, when True, the layers returned by
+      this function are in the order [conv, batchnorm, activation]. Otherwise,
+      when False, the order of the layers is [activation, conv, batchnorm].
 
   Returns:
     A list of keras layers, including (regular or seperable) convolution, and
@@ -73,7 +77,7 @@ def create_conv_block(name, num_filters, kernel_size, strides, padding,
             depth_multiplier=1,
             padding=padding,
             strides=strides,
-            name=name + '_separable_conv',
+            name=name + 'separable_conv',
             **kwargs))
   else:
     layers.append(
@@ -82,18 +86,22 @@ def create_conv_block(name, num_filters, kernel_size, strides, padding,
             kernel_size=kernel_size,
             padding=padding,
             strides=strides,
-            name=name + '_conv',
+            name=name + 'conv',
             **conv_hyperparams.params()))
 
   if apply_batchnorm:
     layers.append(
         conv_hyperparams.build_batch_norm(
             training=(is_training and not freeze_batchnorm),
-            name=name + '_batchnorm'))
+            name=name + 'batchnorm'))
 
   if apply_activation:
-    layers.append(
-        conv_hyperparams.build_activation_layer(name=name + '_activation'))
+    activation_layer = conv_hyperparams.build_activation_layer(
+        name=name + 'activation')
+    if conv_bn_act_pattern:
+      layers.append(activation_layer)
+    else:
+      layers = [activation_layer] + layers
 
   return layers
 
@@ -133,28 +141,28 @@ def create_downsample_feature_map_ops(scale, downsample_method,
             pool_size=kernel_size,
             strides=stride,
             padding=padding,
-            name=name + '_downsample_max_x{}'.format(stride)))
+            name=name + 'downsample_max_x{}'.format(stride)))
   elif downsample_method == 'avg_pooling':
     layers.append(
         tf.keras.layers.AveragePooling2D(
             pool_size=kernel_size,
             strides=stride,
             padding=padding,
-            name=name + '_downsample_avg_x{}'.format(stride)))
+            name=name + 'downsample_avg_x{}'.format(stride)))
   elif downsample_method == 'depthwise_conv':
     layers.append(
         tf.keras.layers.DepthwiseConv2D(
             kernel_size=kernel_size,
             strides=stride,
             padding=padding,
-            name=name + '_downsample_depthwise_x{}'.format(stride)))
+            name=name + 'downsample_depthwise_x{}'.format(stride)))
     layers.append(
         conv_hyperparams.build_batch_norm(
             training=(is_training and not freeze_batchnorm),
-            name=name + '_downsample_batchnorm'))
+            name=name + 'downsample_batchnorm'))
     layers.append(
         conv_hyperparams.build_activation_layer(name=name +
-                                                '_downsample_activation'))
+                                                'downsample_activation'))
   else:
     raise ValueError('Unknown downsample method: {}'.format(downsample_method))
 
diff --git a/research/object_detection/utils/colab_utils.py b/research/object_detection/utils/colab_utils.py
new file mode 100644
index 0000000000000000000000000000000000000000..a6da959c1558e66c55d89b3f706ea3866d7c5021
--- /dev/null
+++ b/research/object_detection/utils/colab_utils.py
@@ -0,0 +1,480 @@
+# Lint as: python3
+# Copyright 2020 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ==============================================================================
+"""Utils for colab tutorials located in object_detection/colab_tutorials/..."""
+import base64
+import io
+import json
+from typing import Dict
+from typing import List
+from typing import Union
+import uuid
+
+from IPython.display import display
+from IPython.display import Javascript
+import numpy as np
+from PIL import Image
+
+from google.colab import output
+from google.colab.output import eval_js
+
+
+def image_from_numpy(image):
+  """Open an image at the specified path and encode it in Base64.
+
+  Args:
+    image: np.ndarray
+      Image represented as a numpy array
+
+  Returns:
+    An encoded Base64 representation of the image
+  """
+
+  with io.BytesIO() as img_output:
+    Image.fromarray(image).save(img_output, format='JPEG')
+    data = img_output.getvalue()
+  data = str(base64.b64encode(data))[2:-1]
+  return data
+
+
+def draw_bbox(image_urls, callbackId):  # pylint: disable=invalid-name
+  """Open the bounding box UI and send the results to a callback function.
+
+  Args:
+    image_urls: list[str | np.ndarray]
+      List of locations from where to load the images from. If a np.ndarray is
+      given, the array is interpretted as an image and sent to the frontend.
+      If a str is given, the string is interpreted as a path and is read as a
+      np.ndarray before being sent to the frontend.
+
+    callbackId: str
+      The ID for the callback function to send the bounding box results to
+      when the user hits submit.
+  """
+  js = Javascript('''
+              async function load_image(imgs, callbackId) {
+                  //init organizational elements
+                  const div = document.createElement('div');
+                  var image_cont = document.createElement('div');
+                  var errorlog = document.createElement('div');
+                  var crosshair_h = document.createElement('div');
+                  crosshair_h.style.position = "absolute";
+                  crosshair_h.style.backgroundColor = "transparent";
+                  crosshair_h.style.width = "100%";
+                  crosshair_h.style.height = "0px";
+                  crosshair_h.style.zIndex = 9998;
+                  crosshair_h.style.borderStyle = "dotted";
+                  crosshair_h.style.borderWidth = "2px";
+                  crosshair_h.style.borderColor = "rgba(255, 0, 0, 0.75)";
+                  crosshair_h.style.cursor = "crosshair";
+                  var crosshair_v = document.createElement('div');
+                  crosshair_v.style.position = "absolute";
+                  crosshair_v.style.backgroundColor = "transparent";
+                  crosshair_v.style.width = "0px";
+                  crosshair_v.style.height = "100%";
+                  crosshair_v.style.zIndex = 9999;
+                  crosshair_v.style.top = "0px";
+                  crosshair_v.style.borderStyle = "dotted";
+                  crosshair_v.style.borderWidth = "2px";
+                  crosshair_v.style.borderColor = "rgba(255, 0, 0, 0.75)";
+                  crosshair_v.style.cursor = "crosshair";
+                  crosshair_v.style.marginTop = "23px";
+                  var brdiv = document.createElement('br');
+
+
+                  //init control elements
+                  var next = document.createElement('button');
+                  var prev = document.createElement('button');
+                  var submit = document.createElement('button');
+                  var deleteButton = document.createElement('button');
+                  var deleteAllbutton = document.createElement('button');
+
+                  //init image containers
+                  var image = new Image();
+                  var canvas_img = document.createElement('canvas');
+                  var ctx = canvas_img.getContext("2d");
+                  canvas_img.style.cursor = "crosshair";
+                  canvas_img.setAttribute('draggable', false);
+                  crosshair_v.setAttribute('draggable', false);
+                  crosshair_h.setAttribute('draggable', false);
+
+                  // bounding box containers
+                  const height = 600
+                  var allBoundingBoxes = [];
+                  var curr_image = 0
+                  var im_height = 0;
+                  var im_width = 0;
+
+                  //initialize bounding boxes
+                  for (var i = 0; i < imgs.length; i++) {
+                    allBoundingBoxes[i] = [];
+                  }
+                  //initialize image view
+                  errorlog.id = 'errorlog';
+                  image.style.display = 'block';
+                  image.setAttribute('draggable', false);
+
+                  //load the first image
+                  img = imgs[curr_image];
+                  image.src = "data:image/png;base64," + img;
+                  image.onload = function() {
+                      // normalize display height and canvas
+                      image.height = height;
+                      image_cont.height = canvas_img.height = image.height;
+                      image_cont.width = canvas_img.width = image.naturalWidth;
+                      crosshair_v.style.height = image_cont.height + "px";
+                      crosshair_h.style.width = image_cont.width + "px";
+
+                      // draw the new image
+                      ctx.drawImage(image, 0, 0, image.naturalWidth, image.naturalHeight, 0, 0,  canvas_img.width,  canvas_img.height);
+
+                  };
+
+                  // move to next image in array
+                  next.textContent = "next image";
+                  next.onclick = function(){
+                      if (curr_image < imgs.length - 1){
+                          // clear canvas and load new image
+                          curr_image += 1;
+                          errorlog.innerHTML = "";
+                      }
+                      else{
+                          errorlog.innerHTML = "All images completed!!";
+                      }
+                      resetcanvas();
+                  }
+
+                  //move forward through list of images
+                  prev.textContent = "prev image"
+                  prev.onclick = function(){
+                      if (curr_image > 0){
+                          // clear canvas and load new image
+                          curr_image -= 1;
+                          errorlog.innerHTML = "";
+                      }
+                      else{
+                          errorlog.innerHTML = "at the beginning";
+                      }
+                      resetcanvas();
+                  }
+                  // on delete, deletes the last bounding box
+                  deleteButton.textContent = "undo bbox";
+                  deleteButton.onclick = function(){
+                    boundingBoxes.pop();
+                    ctx.clearRect(0, 0, canvas_img.width, canvas_img.height);
+                    image.src = "data:image/png;base64," + img;
+                    image.onload = function() {
+                        ctx.drawImage(image, 0, 0, image.naturalWidth, image.naturalHeight, 0, 0,  canvas_img.width,  canvas_img.height);
+                        boundingBoxes.map(r => {drawRect(r)});
+                    };
+                  }
+                  // on all delete, deletes all of the bounding box
+                  deleteAllbutton.textContent = "delete all"
+                  deleteAllbutton.onclick = function(){
+                    boundingBoxes = [];
+                    ctx.clearRect(0, 0, canvas_img.width, canvas_img.height);
+                    image.src = "data:image/png;base64," + img;
+                    image.onload = function() {
+                        ctx.drawImage(image, 0, 0, image.naturalWidth, image.naturalHeight, 0, 0,  canvas_img.width,  canvas_img.height);
+                        //boundingBoxes.map(r => {drawRect(r)});
+                    };
+                  }
+
+                  // on submit, send the boxes to display
+                  submit.textContent = "submit";
+                  submit.onclick = function(){
+                    errorlog.innerHTML = "";
+
+                    // send box data to callback fucntion
+                    google.colab.kernel.invokeFunction(callbackId, [allBoundingBoxes], {});
+                  }
+
+                // init template for annotations
+                const annotation = {
+                      x: 0,
+                      y: 0,
+                      w: 0,
+                      h: 0,
+                };
+
+                // the array of all rectangles
+                let boundingBoxes = allBoundingBoxes[curr_image];
+
+                // the actual rectangle, the one that is being drawn
+                let o = {};
+
+                // a variable to store the mouse position
+                let m = {},
+
+                // a variable to store the point where you begin to draw the
+                // rectangle
+                start = {};
+
+                // a boolean variable to store the drawing state
+                let isDrawing = false;
+                var elem = null;
+
+                function handleMouseDown(e) {
+                  // on mouse click set change the cursor and start tracking the mouse position
+                  start = oMousePos(canvas_img, e);
+
+                  // configure is drawing to true
+                  isDrawing = true;
+                }
+
+                function handleMouseMove(e) {
+                    // move crosshairs, but only within the bounds of the canvas
+                    if (document.elementsFromPoint(e.pageX, e.pageY).includes(canvas_img)) {
+                      crosshair_h.style.top = e.pageY + "px";
+                      crosshair_v.style.left = e.pageX + "px";
+                    }
+
+                    // move the bounding box
+                    if(isDrawing){
+                      m = oMousePos(canvas_img, e);
+                      draw();
+                    }
+                }
+
+                function handleMouseUp(e) {
+                    if (isDrawing) {
+                        // on mouse release, push a bounding box to array and draw all boxes
+                        isDrawing = false;
+
+                        const box = Object.create(annotation);
+
+                        // calculate the position of the rectangle
+                        if (o.w > 0){
+                          box.x = o.x;
+                        }
+                        else{
+                          box.x = o.x + o.w;
+                        }
+                        if (o.h > 0){
+                          box.y = o.y;
+                        }
+                        else{
+                          box.y = o.y + o.h;
+                        }
+                        box.w = Math.abs(o.w);
+                        box.h = Math.abs(o.h);
+
+                        // add the bounding box to the image
+                        boundingBoxes.push(box);
+                        draw();
+                    }
+                }
+
+                function draw() {
+                    o.x = (start.x)/image.width;  // start position of x
+                    o.y = (start.y)/image.height;  // start position of y
+                    o.w = (m.x - start.x)/image.width;  // width
+                    o.h = (m.y - start.y)/image.height;  // height
+
+                    ctx.clearRect(0, 0, canvas_img.width, canvas_img.height);
+                    ctx.drawImage(image, 0, 0, image.naturalWidth, image.naturalHeight, 0, 0,  canvas_img.width,  canvas_img.height);
+                    // draw all the rectangles saved in the rectsRy
+                    boundingBoxes.map(r => {drawRect(r)});
+                    // draw the actual rectangle
+                    drawRect(o);
+                }
+
+                // add the handlers needed for dragging
+                crosshair_h.addEventListener("mousedown", handleMouseDown);
+                crosshair_v.addEventListener("mousedown", handleMouseDown);
+                document.addEventListener("mousemove", handleMouseMove);
+                document.addEventListener("mouseup", handleMouseUp);
+
+
+                function resetcanvas(){
+                    // clear canvas
+                    ctx.clearRect(0, 0, canvas_img.width, canvas_img.height);
+                    img = imgs[curr_image]
+                    image.src = "data:image/png;base64," + img;
+
+                    // onload init new canvas and display image
+                    image.onload = function() {
+                        // normalize display height and canvas
+                        image.height = height;
+                        image_cont.height = canvas_img.height = image.height;
+                        image_cont.width = canvas_img.width = image.naturalWidth;
+                        crosshair_v.style.height = image_cont.height + "px";
+                        crosshair_h.style.width = image_cont.width + "px";
+
+                        // draw the new image
+                        ctx.drawImage(image, 0, 0, image.naturalWidth, image.naturalHeight, 0, 0,  canvas_img.width,  canvas_img.height);
+
+                        // draw bounding boxes
+                        boundingBoxes = allBoundingBoxes[curr_image];
+                        boundingBoxes.map(r => {drawRect(r)});
+                    };
+                }
+
+                function drawRect(o){
+                    // draw a predefined rectangle
+                    ctx.strokeStyle = "red";
+                    ctx.lineWidth = 2;
+                    ctx.beginPath(o);
+                    ctx.rect(o.x * image.width, o.y * image.height, o.w * image.width, o.h * image.height);
+                    ctx.stroke();
+                }
+
+                // Function to detect the mouse position
+                function oMousePos(canvas_img, evt) {
+                  let ClientRect = canvas_img.getBoundingClientRect();
+                    return {
+                      x: evt.clientX - ClientRect.left,
+                      y: evt.clientY - ClientRect.top
+                    };
+                }
+
+
+                //configure colab output display
+                google.colab.output.setIframeHeight(document.documentElement.scrollHeight, true);
+
+                //build the html document that will be seen in output
+                div.appendChild(document.createElement('br'))
+                div.appendChild(image_cont)
+                image_cont.appendChild(canvas_img)
+                image_cont.appendChild(crosshair_h)
+                image_cont.appendChild(crosshair_v)
+                div.appendChild(document.createElement('br'))
+                div.appendChild(errorlog)
+                div.appendChild(prev)
+                div.appendChild(next)
+                div.appendChild(deleteButton)
+                div.appendChild(deleteAllbutton)
+                div.appendChild(document.createElement('br'))
+                div.appendChild(brdiv)
+                div.appendChild(submit)
+                document.querySelector("#output-area").appendChild(div);
+                return
+            }''')
+
+  # load the images as a byte array
+  bytearrays = []
+  for image in image_urls:
+    if isinstance(image, np.ndarray):
+      bytearrays.append(image_from_numpy(image))
+    else:
+      raise TypeError('Image has unsupported type {}.'.format(type(image)))
+
+  # format arrays for input
+  image_data = json.dumps(bytearrays)
+  del bytearrays
+
+  # call java script function pass string byte array(image_data) as input
+  display(js)
+  eval_js('load_image({}, \'{}\')'.format(image_data, callbackId))
+  return
+
+
+def annotate(imgs: List[Union[str, np.ndarray]],  # pylint: disable=invalid-name
+             box_storage_pointer: List[np.ndarray],
+             callbackId: str = None):
+  """Open the bounding box UI and prompt the user for input.
+
+  Args:
+    imgs: list[str | np.ndarray]
+      List of locations from where to load the images from. If a np.ndarray is
+      given, the array is interpretted as an image and sent to the frontend. If
+      a str is given, the string is interpreted as a path and is read as a
+      np.ndarray before being sent to the frontend.
+
+    box_storage_pointer: list[np.ndarray]
+      Destination list for bounding box arrays. Each array in this list
+      corresponds to one of the images given in imgs. The array is a
+      N x 4 array where N is the number of bounding boxes given by the user
+      for that particular image. If there are no bounding boxes for an image,
+      None is used instead of an empty array.
+
+    callbackId: str, optional
+      The ID for the callback function that communicates between the fontend
+      and the backend. If no ID is given, a random UUID string is used instead.
+  """
+
+  # Set a random ID for the callback function
+  if callbackId is None:
+    callbackId = str(uuid.uuid1()).replace('-', '')
+
+  def dictToList(input_bbox):  # pylint: disable=invalid-name
+    """Convert bbox.
+
+    This function converts the dictionary from the frontend (if the format
+    {x, y, w, h} as shown in callbackFunction) into a list
+    ([y_min, x_min, y_max, x_max])
+
+    Args:
+      input_bbox:
+
+    Returns:
+      A list with bbox coordinates in the form [ymin, xmin, ymax, xmax].
+    """
+    return (input_bbox['y'], input_bbox['x'], input_bbox['y'] + input_bbox['h'],
+            input_bbox['x'] + input_bbox['w'])
+
+  def callbackFunction(annotations: List[List[Dict[str, float]]]):  # pylint: disable=invalid-name
+    """Callback function.
+
+    This is the call back function to capture the data from the frontend and
+    convert the data into a numpy array.
+
+    Args:
+      annotations: list[list[dict[str, float]]]
+        The input of the call back function is a list of list of objects
+        corresponding to the annotations. The format of annotations is shown
+        below
+
+        [
+          // stuff for image 1
+          [
+            // stuff for rect 1
+            {x, y, w, h},
+            // stuff for rect 2
+            {x, y, w, h},
+            ...
+          ],
+          // stuff for image 2
+          [
+            // stuff for rect 1
+            {x, y, w, h},
+            // stuff for rect 2
+            {x, y, w, h},
+            ...
+          ],
+          ...
+        ]
+    """
+
+    # reset the boxes list
+    nonlocal box_storage_pointer
+    boxes: List[np.ndarray] = box_storage_pointer
+    boxes.clear()
+
+    # load the new annotations into the boxes list
+    for annotations_per_img in annotations:
+      rectangles_as_arrays = [np.clip(dictToList(annotation), 0, 1)
+                              for annotation in annotations_per_img]
+      if rectangles_as_arrays:
+        boxes.append(np.stack(rectangles_as_arrays))
+      else:
+        boxes.append(None)
+
+    # output the annotations to the errorlog
+    with output.redirect_to_element('#errorlog'):
+      display('--boxes array populated--')
+
+  output.register_callback(callbackId, callbackFunction)
+  draw_bbox(imgs, callbackId)
diff --git a/research/object_detection/utils/config_util.py b/research/object_detection/utils/config_util.py
index 71185a5a601272cb7801050c4ccca3bcb4b1be99..662d42e1305538534e1cb6671086e4faa6cdf00c 100644
--- a/research/object_detection/utils/config_util.py
+++ b/research/object_detection/utils/config_util.py
@@ -142,6 +142,35 @@ def get_configs_from_pipeline_file(pipeline_config_path, config_override=None):
   return create_configs_from_pipeline_proto(pipeline_config)
 
 
+def clear_fine_tune_checkpoint(pipeline_config_path,
+                               new_pipeline_config_path):
+  """Clears fine_tune_checkpoint and writes a new pipeline config file."""
+  configs = get_configs_from_pipeline_file(pipeline_config_path)
+  configs["train_config"].fine_tune_checkpoint = ""
+  configs["train_config"].load_all_detection_checkpoint_vars = False
+  pipeline_proto = create_pipeline_proto_from_configs(configs)
+  with tf.gfile.Open(new_pipeline_config_path, "wb") as f:
+    f.write(text_format.MessageToString(pipeline_proto))
+
+
+def update_fine_tune_checkpoint_type(train_config):
+  """Set `fine_tune_checkpoint_type` using `from_detection_checkpoint`.
+
+  `train_config.from_detection_checkpoint` field is deprecated. For backward
+  compatibility, this function sets `train_config.fine_tune_checkpoint_type`
+  based on `train_config.from_detection_checkpoint`.
+
+  Args:
+    train_config: train_pb2.TrainConfig proto object.
+
+  """
+  if not train_config.fine_tune_checkpoint_type:
+    if train_config.from_detection_checkpoint:
+      train_config.fine_tune_checkpoint_type = "detection"
+    else:
+      train_config.fine_tune_checkpoint_type = "classification"
+
+
 def create_configs_from_pipeline_proto(pipeline_config):
   """Creates a configs dictionary from pipeline_pb2.TrainEvalPipelineConfig.
 
diff --git a/research/object_detection/utils/model_util.py b/research/object_detection/utils/model_util.py
index 6a46265c3d596230a9073c5b5b8f4bb622f346bf..bc5cfe48231bccc034147f6a06a4d5f09cca0159 100644
--- a/research/object_detection/utils/model_util.py
+++ b/research/object_detection/utils/model_util.py
@@ -54,8 +54,8 @@ def extract_submodel(model, inputs, outputs, name=None):
   for layer in model.layers:
     layer_output = layer.output
     layer_inputs = layer.input
-    output_to_layer[layer_output.ref()] = layer
-    output_to_layer_input[layer_output.ref()] = layer_inputs
+    output_to_layer[layer_output.experimental_ref()] = layer
+    output_to_layer_input[layer_output.experimental_ref()] = layer_inputs
 
   model_inputs_dict = {}
   memoized_results = {}
@@ -63,21 +63,22 @@ def extract_submodel(model, inputs, outputs, name=None):
   # Relies on recursion, very low limit in python
   def _recurse_in_model(tensor):
     """Walk the existing model recursively to copy a submodel."""
-    if tensor.ref() in memoized_results:
-      return memoized_results[tensor.ref()]
-    if (tensor.ref() == inputs.ref()) or (
+    if tensor.experimental_ref() in memoized_results:
+      return memoized_results[tensor.experimental_ref()]
+    if (tensor.experimental_ref() == inputs.experimental_ref()) or (
         isinstance(inputs, list) and tensor in inputs):
-      if tensor.ref() not in model_inputs_dict:
-        model_inputs_dict[tensor.ref()] = tf.keras.layers.Input(tensor=tensor)
-      out = model_inputs_dict[tensor.ref()]
+      if tensor.experimental_ref() not in model_inputs_dict:
+        model_inputs_dict[tensor.experimental_ref()] = tf.keras.layers.Input(
+            tensor=tensor)
+      out = model_inputs_dict[tensor.experimental_ref()]
     else:
-      cur_inputs = output_to_layer_input[tensor.ref()]
-      cur_layer = output_to_layer[tensor.ref()]
+      cur_inputs = output_to_layer_input[tensor.experimental_ref()]
+      cur_layer = output_to_layer[tensor.experimental_ref()]
       if isinstance(cur_inputs, list):
         out = cur_layer([_recurse_in_model(inp) for inp in cur_inputs])
       else:
         out = cur_layer(_recurse_in_model(cur_inputs))
-    memoized_results[tensor.ref()] = out
+    memoized_results[tensor.experimental_ref()] = out
     return out
 
   if isinstance(outputs, list):
@@ -86,8 +87,10 @@ def extract_submodel(model, inputs, outputs, name=None):
     model_outputs = _recurse_in_model(outputs)
 
   if isinstance(inputs, list):
-    model_inputs = [model_inputs_dict[tensor.ref()] for tensor in inputs]
+    model_inputs = [
+        model_inputs_dict[tensor.experimental_ref()] for tensor in inputs
+    ]
   else:
-    model_inputs = model_inputs_dict[inputs.ref()]
+    model_inputs = model_inputs_dict[inputs.experimental_ref()]
 
   return tf.keras.Model(inputs=model_inputs, outputs=model_outputs, name=name)
diff --git a/research/object_detection/utils/ops.py b/research/object_detection/utils/ops.py
index 9492b2137a7a3267b481db48723d94af8c500d48..412e878ae83b3bfae357533d1f7f5ec321a6a403 100644
--- a/research/object_detection/utils/ops.py
+++ b/research/object_detection/utils/ops.py
@@ -799,14 +799,14 @@ def position_sensitive_crop_regions(image,
 
 
 def reframe_box_masks_to_image_masks(box_masks, boxes, image_height,
-                                     image_width):
+                                     image_width, resize_method='bilinear'):
   """Transforms the box masks back to full image masks.
 
   Embeds masks in bounding boxes of larger masks whose shapes correspond to
   image shape.
 
   Args:
-    box_masks: A tf.float32 tensor of size [num_masks, mask_height, mask_width].
+    box_masks: A tensor of size [num_masks, mask_height, mask_width].
     boxes: A tf.float32 tensor of size [num_masks, 4] containing the box
            corners. Row i contains [ymin, xmin, ymax, xmax] of the box
            corresponding to mask i. Note that the box corners are in
@@ -815,10 +815,14 @@ def reframe_box_masks_to_image_masks(box_masks, boxes, image_height,
                   the image height.
     image_width: Image width. The output mask will have the same width as the
                  image width.
+    resize_method: The resize method, either 'bilinear' or 'nearest'. Note that
+      'bilinear' is only respected if box_masks is a float.
 
   Returns:
-    A tf.float32 tensor of size [num_masks, image_height, image_width].
+    A tensor of size [num_masks, image_height, image_width] with the same dtype
+    as `box_masks`.
   """
+  resize_method = 'nearest' if box_masks.dtype == tf.uint8 else resize_method
   # TODO(rathodv): Make this a public function.
   def reframe_box_masks_to_image_masks_default():
     """The default function when there are more than 0 box masks."""
@@ -840,16 +844,19 @@ def reframe_box_masks_to_image_masks(box_masks, boxes, image_height,
 
     # TODO(vighneshb) Use matmul_crop_and_resize so that the output shape
     # is static. This will help us run and test on TPUs.
-    return tf.image.crop_and_resize(
+    resized_crops = tf.image.crop_and_resize(
         image=box_masks_expanded,
         boxes=reverse_boxes,
         box_ind=tf.range(num_boxes),
         crop_size=[image_height, image_width],
-        extrapolation_value=0.0)
+        method=resize_method,
+        extrapolation_value=0)
+    return tf.cast(resized_crops, box_masks.dtype)
+
   image_masks = tf.cond(
       tf.shape(box_masks)[0] > 0,
       reframe_box_masks_to_image_masks_default,
-      lambda: tf.zeros([0, image_height, image_width, 1], dtype=tf.float32))
+      lambda: tf.zeros([0, image_height, image_width, 1], box_masks.dtype))
   return tf.squeeze(image_masks, axis=3)
 
 
diff --git a/research/object_detection/utils/ops_test.py b/research/object_detection/utils/ops_test.py
index d4da7b1071b899002faa830a07686541c69d99bd..c5252d644ccd90a4518ad32d32656100c6f3c4f9 100644
--- a/research/object_detection/utils/ops_test.py
+++ b/research/object_detection/utils/ops_test.py
@@ -18,6 +18,8 @@ from __future__ import absolute_import
 from __future__ import division
 from __future__ import print_function
 
+
+from absl.testing import parameterized
 import numpy as np
 import six
 from six.moves import range
@@ -1082,7 +1084,7 @@ class OpsTestPositionSensitiveCropRegions(test_case.TestCase):
         return ps_crop_and_pool
 
       output = self.execute(graph_fn, [])
-      self.assertAllEqual(output, expected_output[crop_size_mult - 1])
+      self.assertAllClose(output, expected_output[crop_size_mult - 1])
 
   def test_raise_value_error_on_non_square_block_size(self):
     num_spatial_bins = [3, 2]
@@ -1190,36 +1192,59 @@ class OpsTestBatchPositionSensitiveCropRegions(test_case.TestCase):
 
 # The following tests are only executed on CPU because the output
 # shape is not constant.
-class ReframeBoxMasksToImageMasksTest(test_case.TestCase):
-
-  def testZeroImageOnEmptyMask(self):
+class ReframeBoxMasksToImageMasksTest(test_case.TestCase,
+                                      parameterized.TestCase):
+
+  @parameterized.parameters(
+      {'mask_dtype': tf.float32, 'mask_dtype_np': np.float32,
+       'resize_method': 'bilinear'},
+      {'mask_dtype': tf.float32, 'mask_dtype_np': np.float32,
+       'resize_method': 'nearest'},
+      {'mask_dtype': tf.uint8, 'mask_dtype_np': np.uint8,
+       'resize_method': 'bilinear'},
+      {'mask_dtype': tf.uint8, 'mask_dtype_np': np.uint8,
+       'resize_method': 'nearest'},
+  )
+  def testZeroImageOnEmptyMask(self, mask_dtype, mask_dtype_np, resize_method):
     np_expected_image_masks = np.array([[[0, 0, 0, 0],
                                          [0, 0, 0, 0],
                                          [0, 0, 0, 0],
-                                         [0, 0, 0, 0]]], dtype=np.float32)
+                                         [0, 0, 0, 0]]])
     def graph_fn():
       box_masks = tf.constant([[[0, 0],
-                                [0, 0]]], dtype=tf.float32)
+                                [0, 0]]], dtype=mask_dtype)
       boxes = tf.constant([[0.0, 0.0, 1.0, 1.0]], dtype=tf.float32)
-      image_masks = ops.reframe_box_masks_to_image_masks(box_masks, boxes,
-                                                         image_height=4,
-                                                         image_width=4)
+      image_masks = ops.reframe_box_masks_to_image_masks(
+          box_masks, boxes, image_height=4, image_width=4,
+          resize_method=resize_method)
       return image_masks
 
     np_image_masks = self.execute_cpu(graph_fn, [])
+    self.assertEqual(np_image_masks.dtype, mask_dtype_np)
     self.assertAllClose(np_image_masks, np_expected_image_masks)
 
-  def testZeroBoxMasks(self):
+  @parameterized.parameters(
+      {'mask_dtype': tf.float32, 'mask_dtype_np': np.float32,
+       'resize_method': 'bilinear'},
+      {'mask_dtype': tf.float32, 'mask_dtype_np': np.float32,
+       'resize_method': 'nearest'},
+      {'mask_dtype': tf.uint8, 'mask_dtype_np': np.uint8,
+       'resize_method': 'bilinear'},
+      {'mask_dtype': tf.uint8, 'mask_dtype_np': np.uint8,
+       'resize_method': 'nearest'},
+  )
+  def testZeroBoxMasks(self, mask_dtype, mask_dtype_np, resize_method):
 
     def graph_fn():
-      box_masks = tf.zeros([0, 3, 3], dtype=tf.float32)
+      box_masks = tf.zeros([0, 3, 3], dtype=mask_dtype)
       boxes = tf.zeros([0, 4], dtype=tf.float32)
-      image_masks = ops.reframe_box_masks_to_image_masks(box_masks, boxes,
-                                                         image_height=4,
-                                                         image_width=4)
+      image_masks = ops.reframe_box_masks_to_image_masks(
+          box_masks, boxes, image_height=4, image_width=4,
+          resize_method=resize_method)
       return image_masks
 
     np_image_masks = self.execute_cpu(graph_fn, [])
+    self.assertEqual(np_image_masks.dtype, mask_dtype_np)
     self.assertAllEqual(np_image_masks.shape, np.array([0, 4, 4]))
 
   def testBoxWithZeroArea(self):
@@ -1235,40 +1260,70 @@ class ReframeBoxMasksToImageMasksTest(test_case.TestCase):
     np_image_masks = self.execute_cpu(graph_fn, [])
     self.assertAllEqual(np_image_masks.shape, np.array([1, 4, 4]))
 
-  def testMaskIsCenteredInImageWhenBoxIsCentered(self):
+  @parameterized.parameters(
+      {'mask_dtype': tf.float32, 'mask_dtype_np': np.float32,
+       'resize_method': 'bilinear'},
+      {'mask_dtype': tf.float32, 'mask_dtype_np': np.float32,
+       'resize_method': 'nearest'},
+      {'mask_dtype': tf.uint8, 'mask_dtype_np': np.uint8,
+       'resize_method': 'bilinear'},
+      {'mask_dtype': tf.uint8, 'mask_dtype_np': np.uint8,
+       'resize_method': 'nearest'},
+  )
+  def testMaskIsCenteredInImageWhenBoxIsCentered(self, mask_dtype,
+                                                 mask_dtype_np, resize_method):
 
     def graph_fn():
-      box_masks = tf.constant([[[1, 1],
-                                [1, 1]]], dtype=tf.float32)
+      box_masks = tf.constant([[[4, 4],
+                                [4, 4]]], dtype=mask_dtype)
       boxes = tf.constant([[0.25, 0.25, 0.75, 0.75]], dtype=tf.float32)
-      image_masks = ops.reframe_box_masks_to_image_masks(box_masks, boxes,
-                                                         image_height=4,
-                                                         image_width=4)
+      image_masks = ops.reframe_box_masks_to_image_masks(
+          box_masks, boxes, image_height=4, image_width=4,
+          resize_method=resize_method)
       return image_masks
 
     np_expected_image_masks = np.array([[[0, 0, 0, 0],
-                                         [0, 1, 1, 0],
-                                         [0, 1, 1, 0],
-                                         [0, 0, 0, 0]]], dtype=np.float32)
+                                         [0, 4, 4, 0],
+                                         [0, 4, 4, 0],
+                                         [0, 0, 0, 0]]], dtype=mask_dtype_np)
     np_image_masks = self.execute_cpu(graph_fn, [])
+    self.assertEqual(np_image_masks.dtype, mask_dtype_np)
     self.assertAllClose(np_image_masks, np_expected_image_masks)
 
-  def testMaskOffCenterRemainsOffCenterInImage(self):
+  @parameterized.parameters(
+      {'mask_dtype': tf.float32, 'mask_dtype_np': np.float32,
+       'resize_method': 'bilinear'},
+      {'mask_dtype': tf.float32, 'mask_dtype_np': np.float32,
+       'resize_method': 'nearest'},
+      {'mask_dtype': tf.uint8, 'mask_dtype_np': np.uint8,
+       'resize_method': 'bilinear'},
+      {'mask_dtype': tf.uint8, 'mask_dtype_np': np.uint8,
+       'resize_method': 'nearest'},
+  )
+  def testMaskOffCenterRemainsOffCenterInImage(self, mask_dtype,
+                                               mask_dtype_np, resize_method):
 
     def graph_fn():
       box_masks = tf.constant([[[1, 0],
-                                [0, 1]]], dtype=tf.float32)
+                                [0, 1]]], dtype=mask_dtype)
       boxes = tf.constant([[0.25, 0.5, 0.75, 1.0]], dtype=tf.float32)
-      image_masks = ops.reframe_box_masks_to_image_masks(box_masks, boxes,
-                                                         image_height=4,
-                                                         image_width=4)
+      image_masks = ops.reframe_box_masks_to_image_masks(
+          box_masks, boxes, image_height=4, image_width=4,
+          resize_method=resize_method)
       return image_masks
 
-    np_expected_image_masks = np.array([[[0, 0, 0, 0],
-                                         [0, 0, 0.6111111, 0.16666669],
-                                         [0, 0, 0.3888889, 0.83333337],
-                                         [0, 0, 0, 0]]], dtype=np.float32)
+    if mask_dtype == tf.float32 and resize_method == 'bilinear':
+      np_expected_image_masks = np.array([[[0, 0, 0, 0],
+                                           [0, 0, 0.6111111, 0.16666669],
+                                           [0, 0, 0.3888889, 0.83333337],
+                                           [0, 0, 0, 0]]], dtype=np.float32)
+    else:
+      np_expected_image_masks = np.array([[[0, 0, 0, 0],
+                                           [0, 0, 1, 0],
+                                           [0, 0, 0, 1],
+                                           [0, 0, 0, 0]]], dtype=mask_dtype_np)
     np_image_masks = self.execute_cpu(graph_fn, [])
+    self.assertEqual(np_image_masks.dtype, mask_dtype_np)
     self.assertAllClose(np_image_masks, np_expected_image_masks)
 
 
diff --git a/research/object_detection/utils/spatial_transform_ops.py b/research/object_detection/utils/spatial_transform_ops.py
index 95aaf967984172c6ef49a5c9de98b5a8ed0cdc0b..1880dffea1a4a6ddd21c60aac31dc1fda0a9ff30 100644
--- a/research/object_detection/utils/spatial_transform_ops.py
+++ b/research/object_detection/utils/spatial_transform_ops.py
@@ -411,6 +411,56 @@ def multilevel_roi_align(features, boxes, box_levels, output_size,
     return features_per_box
 
 
+def multilevel_native_crop_and_resize(images, boxes, box_levels,
+                                      crop_size, scope=None):
+  """Multilevel native crop and resize.
+
+  Same as `multilevel_matmul_crop_and_resize` but uses tf.image.crop_and_resize.
+
+  Args:
+    images: A list of 4-D tensor of shape
+      [batch, image_height, image_width, depth] representing features of
+      different size.
+    boxes: A `Tensor` of type `float32`.
+      A 3-D tensor of shape `[batch, num_boxes, 4]`. The boxes are specified in
+      normalized coordinates and are of the form `[y1, x1, y2, x2]`. A
+      normalized coordinate value of `y` is mapped to the image coordinate at
+      `y * (image_height - 1)`, so as the `[0, 1]` interval of normalized image
+      height is mapped to `[0, image_height - 1] in image height coordinates.
+      We do allow y1 > y2, in which case the sampled crop is an up-down flipped
+      version of the original image. The width dimension is treated similarly.
+      Normalized coordinates outside the `[0, 1]` range are allowed, in which
+      case we use `extrapolation_value` to extrapolate the input image values.
+    box_levels: A 2-D tensor of shape [batch, num_boxes] representing the level
+      of the box.
+    crop_size: A list of two integers `[crop_height, crop_width]`. All
+      cropped image patches are resized to this size. The aspect ratio of the
+      image content is not preserved. Both `crop_height` and `crop_width` need
+      to be positive.
+    scope: A name for the operation (optional).
+
+  Returns:
+    A 5-D float tensor of shape `[batch, num_boxes, crop_height, crop_width,
+    depth]`
+  """
+  if box_levels is None:
+    return native_crop_and_resize(images[0], boxes, crop_size, scope)
+  with tf.name_scope('MultiLevelNativeCropAndResize'):
+    cropped_feature_list = []
+    for level, image in enumerate(images):
+      # For each level, crop the feature according to all boxes
+      # set the cropped feature not at this level to 0 tensor.
+      # Consider more efficient way of computing cropped features.
+      cropped = native_crop_and_resize(image, boxes, crop_size, scope)
+      cond = tf.tile(
+          tf.equal(box_levels, level)[:, :, tf.newaxis],
+          [1, 1] + [tf.math.reduce_prod(cropped.shape.as_list()[2:])])
+      cond = tf.reshape(cond, cropped.shape)
+      cropped_final = tf.where(cond, cropped, tf.zeros_like(cropped))
+      cropped_feature_list.append(cropped_final)
+    return tf.math.reduce_sum(cropped_feature_list, axis=0)
+
+
 def native_crop_and_resize(image, boxes, crop_size, scope=None):
   """Same as `matmul_crop_and_resize` but uses tf.image.crop_and_resize."""
   def get_box_inds(proposals):
@@ -431,6 +481,50 @@ def native_crop_and_resize(image, boxes, crop_size, scope=None):
     return tf.reshape(cropped_regions, final_shape)
 
 
+def multilevel_matmul_crop_and_resize(images, boxes, box_levels, crop_size,
+                                      extrapolation_value=0.0, scope=None):
+  """Multilevel matmul crop and resize.
+
+  Same as `matmul_crop_and_resize` but crop images according to box levels.
+
+  Args:
+    images: A list of 4-D tensor of shape
+      [batch, image_height, image_width, depth] representing features of
+      different size.
+    boxes: A `Tensor` of type `float32` or 'bfloat16'.
+      A 3-D tensor of shape `[batch, num_boxes, 4]`. The boxes are specified in
+      normalized coordinates and are of the form `[y1, x1, y2, x2]`. A
+      normalized coordinate value of `y` is mapped to the image coordinate at
+      `y * (image_height - 1)`, so as the `[0, 1]` interval of normalized image
+      height is mapped to `[0, image_height - 1] in image height coordinates.
+      We do allow y1 > y2, in which case the sampled crop is an up-down flipped
+      version of the original image. The width dimension is treated similarly.
+      Normalized coordinates outside the `[0, 1]` range are allowed, in which
+      case we use `extrapolation_value` to extrapolate the input image values.
+    box_levels: A 2-D tensor of shape [batch, num_boxes] representing the level
+      of the box.
+    crop_size: A list of two integers `[crop_height, crop_width]`. All
+      cropped image patches are resized to this size. The aspect ratio of the
+      image content is not preserved. Both `crop_height` and `crop_width` need
+      to be positive.
+    extrapolation_value: A float value to use for extrapolation.
+    scope: A name for the operation (optional).
+
+  Returns:
+    A 5-D float tensor of shape `[batch, num_boxes, crop_height, crop_width,
+    depth]`
+  """
+  with tf.name_scope(scope, 'MultiLevelMatMulCropAndResize'):
+    if box_levels is None:
+      box_levels = tf.zeros(tf.shape(boxes)[:2], dtype=tf.int32)
+    return multilevel_roi_align(images,
+                                boxes,
+                                box_levels,
+                                crop_size,
+                                align_corners=True,
+                                extrapolation_value=extrapolation_value)
+
+
 def matmul_crop_and_resize(image, boxes, crop_size, extrapolation_value=0.0,
                            scope=None):
   """Matrix multiplication based implementation of the crop and resize op.
diff --git a/research/object_detection/utils/spatial_transform_ops_test.py b/research/object_detection/utils/spatial_transform_ops_test.py
index d18456640673cf736ee152b19291ed91226fcd31..2261f078ed2bfac9adf1e147a8ea15bef900c5f2 100644
--- a/research/object_detection/utils/spatial_transform_ops_test.py
+++ b/research/object_detection/utils/spatial_transform_ops_test.py
@@ -512,6 +512,38 @@ class MatMulCropAndResizeTest(test_case.TestCase):
     crop_output = self.execute(graph_fn, [image, boxes])
     self.assertAllClose(crop_output, expected_output)
 
+  def testMultilevelMatMulCropAndResize(self):
+
+    def graph_fn(image1, image2, boxes, box_levels):
+      return spatial_ops.multilevel_matmul_crop_and_resize([image1, image2],
+                                                           boxes,
+                                                           box_levels,
+                                                           crop_size=[2, 2])
+
+    image = [np.array([[[[1, 0], [2, 0], [3, 0]],
+                        [[4, 0], [5, 0], [6, 0]],
+                        [[7, 0], [8, 0], [9, 0]]],
+                       [[[1, 0], [2, 0], [3, 0]],
+                        [[4, 0], [5, 0], [6, 0]],
+                        [[7, 0], [8, 0], [9, 0]]]], dtype=np.float32),
+             np.array([[[[1, 0], [2, 1], [3, 2]],
+                        [[4, 3], [5, 4], [6, 5]],
+                        [[7, 6], [8, 7], [9, 8]]],
+                       [[[1, 0], [2, 1], [3, 2]],
+                        [[4, 3], [5, 4], [6, 5]],
+                        [[7, 6], [8, 7], [9, 8]]]], dtype=np.float32)]
+    boxes = np.array([[[1, 1, 0, 0],
+                       [.5, .5, 0, 0]],
+                      [[0, 0, 1, 1],
+                       [0, 0, .5, .5]]], dtype=np.float32)
+    box_levels = np.array([[0, 1], [1, 1]], dtype=np.int32)
+    expected_output = [[[[[9, 0], [7, 0]], [[3, 0], [1, 0]]],
+                        [[[5, 4], [4, 3]], [[2, 1], [1, 0]]]],
+                       [[[[1, 0], [3, 2]], [[7, 6], [9, 8]]],
+                        [[[1, 0], [2, 1]], [[4, 3], [5, 4]]]]]
+    crop_output = self.execute(graph_fn, image + [boxes, box_levels])
+    self.assertAllClose(crop_output, expected_output)
+
 
 class NativeCropAndResizeTest(test_case.TestCase):
 
@@ -537,6 +569,35 @@ class NativeCropAndResizeTest(test_case.TestCase):
     crop_output = self.execute_cpu(graph_fn, [image, boxes])
     self.assertAllClose(crop_output, expected_output)
 
+  def testMultilevelBatchCropAndResize3x3To2x2_2Channels(self):
+
+    def graph_fn(image1, image2, boxes, box_levels):
+      return spatial_ops.multilevel_native_crop_and_resize([image1, image2],
+                                                           boxes,
+                                                           box_levels,
+                                                           crop_size=[2, 2])
+    image = [np.array([[[[1, 0], [2, 1], [3, 2]],
+                        [[4, 3], [5, 4], [6, 5]],
+                        [[7, 6], [8, 7], [9, 8]]],
+                       [[[1, 0], [2, 1], [3, 2]],
+                        [[4, 3], [5, 4], [6, 5]],
+                        [[7, 6], [8, 7], [9, 8]]]], dtype=np.float32),
+             np.array([[[[1, 0], [2, 1]],
+                        [[4, 3], [5, 4]]],
+                       [[[1, 0], [2, 1]],
+                        [[4, 3], [5, 4]]]], dtype=np.float32)]
+    boxes = np.array([[[0, 0, 1, 1],
+                       [0, 0, .5, .5]],
+                      [[1, 1, 0, 0],
+                       [.5, .5, 0, 0]]], dtype=np.float32)
+    box_levels = np.array([[0, 1], [0, 0]], dtype=np.float32)
+    expected_output = [[[[[1, 0], [3, 2]], [[7, 6], [9, 8]]],
+                        [[[1, 0], [1.5, 0.5]], [[2.5, 1.5], [3, 2]]]],
+                       [[[[9, 8], [7, 6]], [[3, 2], [1, 0]]],
+                        [[[5, 4], [4, 3]], [[2, 1], [1, 0]]]]]
+    crop_output = self.execute_cpu(graph_fn, image + [boxes, box_levels])
+    self.assertAllClose(crop_output, expected_output)
+
 
 if __name__ == '__main__':
   tf.test.main()
diff --git a/research/object_detection/utils/visualization_utils.py b/research/object_detection/utils/visualization_utils.py
index 058a47b1578c04a477fa3e488712aef6e234d420..756d98e30ba71349685e04f3b2376b6ad6b76fc8 100644
--- a/research/object_detection/utils/visualization_utils.py
+++ b/research/object_detection/utils/visualization_utils.py
@@ -790,6 +790,81 @@ def draw_side_by_side_evaluation_image(eval_dict,
   return images_with_detections_list
 
 
+def draw_densepose_visualizations(eval_dict,
+                                  max_boxes_to_draw=20,
+                                  min_score_thresh=0.2,
+                                  num_parts=24,
+                                  dp_coord_to_visualize=0):
+  """Draws DensePose visualizations.
+
+  Args:
+    eval_dict: The evaluation dictionary returned by
+      eval_util.result_dict_for_batched_example().
+    max_boxes_to_draw: The maximum number of boxes to draw for detections.
+    min_score_thresh: The minimum score threshold for showing detections.
+    num_parts: The number of different densepose parts.
+    dp_coord_to_visualize: Whether to visualize v-coordinates (0) or
+      u-coordinates (0) overlaid on the person masks.
+
+  Returns:
+    A list of [1, H, W, C] uint8 tensor, each element corresponding to an image
+    in the batch.
+
+  Raises:
+    ValueError: If `dp_coord_to_visualize` is not 0 or 1.
+  """
+  if dp_coord_to_visualize not in (0, 1):
+    raise ValueError('`dp_coord_to_visualize` must be either 0 for v '
+                     'coordinates), or 1 for u coordinates, but instead got '
+                     '{}'.format(dp_coord_to_visualize))
+  detection_fields = fields.DetectionResultFields()
+  input_data_fields = fields.InputDataFields()
+
+  if detection_fields.detection_masks not in eval_dict:
+    raise ValueError('Expected `detection_masks` in `eval_dict`.')
+  if detection_fields.detection_surface_coords not in eval_dict:
+    raise ValueError('Expected `detection_surface_coords` in `eval_dict`.')
+
+  images_with_detections_list = []
+  for indx in range(eval_dict[input_data_fields.original_image].shape[0]):
+    # Note that detection masks have already been resized to the original image
+    # shapes, but `original_image` has not.
+    # TODO(ronnyvotel): Consider resizing `original_image` in
+    # eval_util.result_dict_for_batched_example().
+    true_shape = eval_dict[input_data_fields.true_image_shape][indx]
+    original_shape = eval_dict[
+        input_data_fields.original_image_spatial_shape][indx]
+    image = eval_dict[input_data_fields.original_image][indx]
+    image = shape_utils.pad_or_clip_nd(image, [true_shape[0], true_shape[1], 3])
+    image = _resize_original_image(image, original_shape)
+
+    scores = eval_dict[detection_fields.detection_scores][indx]
+    detection_masks = eval_dict[detection_fields.detection_masks][indx]
+    surface_coords = eval_dict[detection_fields.detection_surface_coords][indx]
+
+    def draw_densepose_py_func(image, detection_masks, surface_coords, scores):
+      """Overlays part masks and surface coords on original images."""
+      surface_coord_image = np.copy(image)
+      for i, (score, surface_coord, mask) in enumerate(
+          zip(scores, surface_coords, detection_masks)):
+        if i == max_boxes_to_draw:
+          break
+        if score > min_score_thresh:
+          draw_part_mask_on_image_array(image, mask, num_parts=num_parts)
+          draw_float_channel_on_image_array(
+              surface_coord_image, surface_coord[:, :, dp_coord_to_visualize],
+              mask)
+      return np.concatenate([image, surface_coord_image], axis=1)
+
+    image_with_densepose = tf.py_func(
+        draw_densepose_py_func,
+        [image, detection_masks, surface_coords, scores],
+        tf.uint8)
+    images_with_detections_list.append(
+        image_with_densepose[tf.newaxis, :, :, :])
+  return images_with_detections_list
+
+
 def draw_keypoints_on_image_array(image,
                                   keypoints,
                                   keypoint_scores=None,
@@ -918,8 +993,6 @@ def draw_mask_on_image_array(image, mask, color='red', alpha=0.4):
     raise ValueError('`image` not of type np.uint8')
   if mask.dtype != np.uint8:
     raise ValueError('`mask` not of type np.uint8')
-  if np.any(np.logical_and(mask != 1, mask != 0)):
-    raise ValueError('`mask` elements should be in [0, 1]')
   if image.shape[:2] != mask.shape:
     raise ValueError('The image has spatial dimensions %s but the mask has '
                      'dimensions %s' % (image.shape[:2], mask.shape))
@@ -929,11 +1002,85 @@ def draw_mask_on_image_array(image, mask, color='red', alpha=0.4):
   solid_color = np.expand_dims(
       np.ones_like(mask), axis=2) * np.reshape(list(rgb), [1, 1, 3])
   pil_solid_color = Image.fromarray(np.uint8(solid_color)).convert('RGBA')
-  pil_mask = Image.fromarray(np.uint8(255.0*alpha*mask)).convert('L')
+  pil_mask = Image.fromarray(np.uint8(255.0*alpha*(mask > 0))).convert('L')
   pil_image = Image.composite(pil_solid_color, pil_image, pil_mask)
   np.copyto(image, np.array(pil_image.convert('RGB')))
 
 
+def draw_part_mask_on_image_array(image, mask, alpha=0.4, num_parts=24):
+  """Draws part mask on an image.
+
+  Args:
+    image: uint8 numpy array with shape (img_height, img_height, 3)
+    mask: a uint8 numpy array of shape (img_height, img_height) with
+      1-indexed parts (0 for background).
+    alpha: transparency value between 0 and 1 (default: 0.4)
+    num_parts: the maximum number of parts that may exist in the image (default
+      24 for DensePose).
+
+  Raises:
+    ValueError: On incorrect data type for image or masks.
+  """
+  if image.dtype != np.uint8:
+    raise ValueError('`image` not of type np.uint8')
+  if mask.dtype != np.uint8:
+    raise ValueError('`mask` not of type np.uint8')
+  if image.shape[:2] != mask.shape:
+    raise ValueError('The image has spatial dimensions %s but the mask has '
+                     'dimensions %s' % (image.shape[:2], mask.shape))
+
+  pil_image = Image.fromarray(image)
+  part_colors = np.zeros_like(image)
+  mask_1_channel = mask[:, :, np.newaxis]
+  for i, color in enumerate(STANDARD_COLORS[:num_parts]):
+    rgb = np.array(ImageColor.getrgb(color), dtype=np.uint8)
+    part_colors += (mask_1_channel == i + 1) * rgb[np.newaxis, np.newaxis, :]
+  pil_part_colors = Image.fromarray(np.uint8(part_colors)).convert('RGBA')
+  pil_mask = Image.fromarray(np.uint8(255.0 * alpha * (mask > 0))).convert('L')
+  pil_image = Image.composite(pil_part_colors, pil_image, pil_mask)
+  np.copyto(image, np.array(pil_image.convert('RGB')))
+
+
+def draw_float_channel_on_image_array(image, channel, mask, alpha=0.9,
+                                      cmap='YlGn'):
+  """Draws a floating point channel on an image array.
+
+  Args:
+    image: uint8 numpy array with shape (img_height, img_height, 3)
+    channel: float32 numpy array with shape (img_height, img_height). The values
+      should be in the range [0, 1], and will be mapped to colors using the
+      provided colormap `cmap` argument.
+    mask: a uint8 numpy array of shape (img_height, img_height) with
+      1-indexed parts (0 for background).
+    alpha: transparency value between 0 and 1 (default: 0.9)
+    cmap: string with the colormap to use.
+
+  Raises:
+    ValueError: On incorrect data type for image or masks.
+  """
+  if image.dtype != np.uint8:
+    raise ValueError('`image` not of type np.uint8')
+  if channel.dtype != np.float32:
+    raise ValueError('`channel` not of type np.float32')
+  if mask.dtype != np.uint8:
+    raise ValueError('`mask` not of type np.uint8')
+  if image.shape[:2] != channel.shape:
+    raise ValueError('The image has spatial dimensions %s but the channel has '
+                     'dimensions %s' % (image.shape[:2], channel.shape))
+  if image.shape[:2] != mask.shape:
+    raise ValueError('The image has spatial dimensions %s but the mask has '
+                     'dimensions %s' % (image.shape[:2], mask.shape))
+
+  cm = plt.get_cmap(cmap)
+  pil_image = Image.fromarray(image)
+  colored_channel = cm(channel)[:, :, :3]
+  pil_colored_channel = Image.fromarray(
+      np.uint8(colored_channel * 255)).convert('RGBA')
+  pil_mask = Image.fromarray(np.uint8(255.0 * alpha * (mask > 0))).convert('L')
+  pil_image = Image.composite(pil_colored_channel, pil_image, pil_mask)
+  np.copyto(image, np.array(pil_image.convert('RGB')))
+
+
 def visualize_boxes_and_labels_on_image_array(
     image,
     boxes,
@@ -973,8 +1120,8 @@ def visualize_boxes_and_labels_on_image_array(
       boxes and plot all boxes as black with no classes or scores.
     category_index: a dict containing category dictionaries (each holding
       category index `id` and category name `name`) keyed by category indices.
-    instance_masks: a numpy array of shape [N, image_height, image_width] with
-      values ranging between 0 and 1, can be None.
+    instance_masks: a uint8 numpy array of shape [N, image_height, image_width],
+      can be None.
     instance_boundaries: a numpy array of shape [N, image_height, image_width]
       with values ranging between 0 and 1, can be None.
     keypoints: a numpy array of shape [N, num_keypoints, 2], can
diff --git a/research/object_detection/utils/visualization_utils_test.py b/research/object_detection/utils/visualization_utils_test.py
index 2909a54a7fffcdb94af19761f9581b135b65454c..fb0f11516a5b0c08d187b3a2d56d07a26a78245c 100644
--- a/research/object_detection/utils/visualization_utils_test.py
+++ b/research/object_detection/utils/visualization_utils_test.py
@@ -373,6 +373,38 @@ class VisualizationUtilsTest(test_case.TestCase):
                                                  color='Blue', alpha=.5)
     self.assertAllEqual(test_image, expected_result)
 
+  def test_draw_part_mask_on_image_array(self):
+    test_image = np.asarray([[[0, 0, 0], [0, 0, 0]],
+                             [[0, 0, 0], [0, 0, 0]]], dtype=np.uint8)
+    mask = np.asarray([[0, 1],
+                       [1, 6]], dtype=np.uint8)
+
+    visualization_utils.draw_part_mask_on_image_array(test_image, mask,
+                                                      alpha=.5)
+    self.assertAllEqual([0, 0, 0], test_image[0, 0])
+    self.assertAllGreater(test_image[0, 1], 0)
+    self.assertAllGreater(test_image[1, 0], 0)
+    self.assertAllGreater(test_image[1, 1], 0)
+    self.assertAllEqual(test_image[0, 1], test_image[1, 0])
+
+  def test_draw_float_channel_on_image_array(self):
+    test_image = np.asarray([[[0, 0, 0], [0, 0, 0]],
+                             [[0, 0, 0], [0, 0, 0]]], dtype=np.uint8)
+    channel = np.asarray([[0., 0.5],
+                          [0., 1.]], dtype=np.float32)
+    mask = np.asarray([[0, 1],
+                       [1, 1]], dtype=np.uint8)
+
+    # The colormap ('bwr') maps the values as follows:
+    # 0.0 -> Blue
+    # 0.5 -> White
+    # 1.0 -> Red
+    visualization_utils.draw_float_channel_on_image_array(
+        test_image, channel, mask, alpha=1.0, cmap='bwr')
+    expected_result = np.asarray([[[0, 0, 0], [255, 254, 254]],
+                                  [[0, 0, 255], [255, 0, 0]]], dtype=np.uint8)
+    self.assertAllEqual(test_image, expected_result)
+
   def test_draw_heatmaps_on_image(self):
     test_image = self.create_colorful_test_image()
     test_image = Image.fromarray(test_image)
diff --git a/research/ptn/.gitignore b/research/ptn/.gitignore
deleted file mode 100644
index 8479374e96149a2e772046da042820dae34ba305..0000000000000000000000000000000000000000
--- a/research/ptn/.gitignore
+++ /dev/null
@@ -1,8 +0,0 @@
-bazel
-.idea
-bazel-bin
-bazel-out
-bazel-genfiles
-bazel-ptn
-bazel-testlogs
-*.pyc
diff --git a/research/ptn/BUILD b/research/ptn/BUILD
deleted file mode 100644
index f08c6172c4e899de77d3c10dc8e9d61519e750a8..0000000000000000000000000000000000000000
--- a/research/ptn/BUILD
+++ /dev/null
@@ -1,94 +0,0 @@
-py_library(
-    name = "input_generator",
-    srcs = ["input_generator.py"],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "losses",
-    srcs = ["losses.py"],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "metrics",
-    srcs = ["metrics.py"],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "utils",
-    srcs = ["utils.py"],
-    deps = [
-    ],
-)
-
-# Defines the Rotator model here
-py_library(
-    name = "model_rotator",
-    srcs = ["model_rotator.py"],
-    deps = [
-        ":input_generator",
-        ":losses",
-        ":metrics",
-        ":utils",
-        "//nets:deeprotator_factory",
-    ],
-)
-
-# Defines the Im2vox model here
-py_library(
-    name = "model_voxel_generation",
-    srcs = ["model_voxel_generation.py"],
-    deps = [
-        ":input_generator",
-        "//nets:im2vox_factory",
-    ],
-)
-
-py_library(
-    name = "model_ptn",
-    srcs = ["model_ptn.py"],
-    deps = [
-        ":losses",
-        ":metrics",
-        ":model_voxel_generation",
-        ":utils",
-        "//nets:im2vox_factory",
-    ],
-)
-
-py_binary(
-    name = "train_ptn",
-    srcs = ["train_ptn.py"],
-    deps = [
-        ":model_ptn",
-    ],
-)
-
-py_binary(
-    name = "eval_ptn",
-    srcs = ["eval_ptn.py"],
-    deps = [
-        ":model_ptn",
-    ],
-)
-
-py_binary(
-    name = "pretrain_rotator",
-    srcs = ["pretrain_rotator.py"],
-    deps = [
-        ":model_rotator",
-    ],
-)
-
-py_binary(
-    name = "eval_rotator",
-    srcs = ["eval_rotator.py"],
-    deps = [
-        ":model_rotator",
-    ],
-)
diff --git a/research/ptn/README.md b/research/ptn/README.md
deleted file mode 100644
index e9558cd097f7c5f8dd30321222d777172cf5276d..0000000000000000000000000000000000000000
--- a/research/ptn/README.md
+++ /dev/null
@@ -1,75 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Perspective Transformer Nets
-
-## Introduction
-This is the TensorFlow implementation for the NIPS 2016 work ["Perspective Transformer Nets: Learning Single-View 3D Object Reconstrution without 3D Supervision"](https://papers.nips.cc/paper/6206-perspective-transformer-nets-learning-single-view-3d-object-reconstruction-without-3d-supervision.pdf)
-
-Re-implemented by Xinchen Yan, Arkanath Pathak, Jasmine Hsu, Honglak Lee
-
-Reference: [Orginal implementation in Torch](https://github.com/xcyan/nips16_PTN)
-
-## How to run this code
-
-This implementation is ready to be run locally or ["distributed across multiple machines/tasks"](https://www.tensorflow.org/deploy/distributed).
-You will need to set the task number flag for each task when running in a distributed fashion.
-Please refer to the original paper for parameter explanations and training details.
-
-### Installation
-*   TensorFlow
-    *   This code requires the latest open-source TensorFlow that you will need to build manually.
-    The [documentation](https://www.tensorflow.org/install/install_sources) provides the steps required for that.
-*   Bazel
-    *   Follow the instructions [here](http://bazel.build/docs/install.html).
-    *   Alternately, Download bazel from
-        [https://github.com/bazelbuild/bazel/releases](https://github.com/bazelbuild/bazel/releases)
-        for your system configuration.
-    *   Check for the bazel version using this command: bazel version
-*   matplotlib
-    *   Follow the instructions [here](https://matplotlib.org/users/installing.html).
-    *   You can use a package repository like pip.
-*   scikit-image
-    *   Follow the instructions [here](http://scikit-image.org/docs/dev/install.html).
-    *   You can use a package repository like pip.
-*   PIL
-    *   Install from [here](https://pypi.python.org/pypi/Pillow/2.2.1).
-
-### Dataset
-
-This code requires the dataset to be in *tfrecords* format with the following features:
-*   image
-    *   Flattened list of image (float representations) for each view point.
-*   mask
-    *   Flattened list of image masks (float representations) for each view point.
-*   vox
-    *   Flattened list of voxels (float representations) for the object.
-    *   This is needed for using vox loss and for prediction comparison.
-
-You can download the ShapeNet Dataset in tfrecords format from [here](https://drive.google.com/file/d/0B12XukcbU7T7OHQ4MGh6d25qQlk)<sup>*</sup>.
-
-<sup>*</sup> Disclaimer: This data is hosted personally by Arkanath Pathak for non-commercial research purposes. Please cite the [ShapeNet paper](https://arxiv.org/pdf/1512.03012.pdf) in your works when using ShapeNet for non-commercial research purposes.
-
-### Pretraining: pretrain_rotator.py for each RNN step
-$ bazel run -c opt :pretrain_rotator -- --step_size={} --init_model={}
-
-Pass the init_model as the checkpoint path for the last step trained model.
-You'll also need to set the inp_dir flag to where your data resides.
-
-### Training: train_ptn.py with last pretrained model.
-$ bazel run -c opt :train_ptn -- --init_model={}
-
-### Example TensorBoard Visualizations
-
-To compare the visualizations make sure to set the model_name flag different for each parametric setting:
-
-This code adds summaries for each loss. For instance, these are the losses we encountered in the distributed pretraining for ShapeNet Chair Dataset with 10 workers and 16 parameter servers:
-![ShapeNet Chair Pretraining](https://drive.google.com/uc?export=view&id=0B12XukcbU7T7bWdlTjhzbGJVaWs "ShapeNet Chair Experiment Pretraining Losses")
-
-You can expect such images after fine tuning the training as "grid_vis" under **Image** summaries in TensorBoard:
-![ShapeNet Chair experiments with projection weight of 1](https://drive.google.com/uc?export=view&id=0B12XukcbU7T7ZFV6aEVBSDdCMjQ "ShapeNet Chair Dataset Predictions")
-Here the third and fifth columns are the predicted masks and voxels respectively, alongside their ground truth values.
-
-A similar image for when trained on all ShapeNet Categories (Voxel visualizations might be skewed):
-![ShapeNet All Categories experiments](https://drive.google.com/uc?export=view&id=0B12XukcbU7T7bDZKNFlkTVAzZmM "ShapeNet All Categories Dataset Predictions")
diff --git a/research/ptn/WORKSPACE b/research/ptn/WORKSPACE
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/ptn/eval_ptn.py b/research/ptn/eval_ptn.py
deleted file mode 100644
index 2f8dd96b1938452083253832a586eaf525d5e072..0000000000000000000000000000000000000000
--- a/research/ptn/eval_ptn.py
+++ /dev/null
@@ -1,132 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contains evaluation plan for the Im2vox model."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import tensorflow as tf
-from tensorflow import app
-
-import model_ptn
-
-flags = tf.app.flags
-slim = tf.contrib.slim
-
-flags.DEFINE_string('inp_dir',
-                    '',
-                    'Directory path containing the input data (tfrecords).')
-flags.DEFINE_string(
-    'dataset_name', 'shapenet_chair',
-    'Dataset name that is to be used for training and evaluation.')
-flags.DEFINE_integer('z_dim', 512, '')
-flags.DEFINE_integer('f_dim', 64, '')
-flags.DEFINE_integer('fc_dim', 1024, '')
-flags.DEFINE_integer('num_views', 24, 'Num of viewpoints in the input data.')
-flags.DEFINE_integer('image_size', 64,
-                     'Input images dimension (pixels) - width & height.')
-flags.DEFINE_integer('vox_size', 32, 'Voxel prediction dimension.')
-flags.DEFINE_integer('step_size', 24, '')
-flags.DEFINE_integer('batch_size', 1, 'Batch size while training.')
-flags.DEFINE_float('focal_length', 0.866, '')
-flags.DEFINE_float('focal_range', 1.732, '')
-flags.DEFINE_string('encoder_name', 'ptn_encoder',
-                    'Name of the encoder network being used.')
-flags.DEFINE_string('decoder_name', 'ptn_vox_decoder',
-                    'Name of the decoder network being used.')
-flags.DEFINE_string('projector_name', 'ptn_projector',
-                    'Name of the projector network being used.')
-# Save options
-flags.DEFINE_string('checkpoint_dir', '/tmp/ptn/eval/',
-                    'Directory path for saving trained models and other data.')
-flags.DEFINE_string('model_name', 'ptn_proj',
-                    'Name of the model used in naming the TF job. Must be different for each run.')
-flags.DEFINE_string('eval_set', 'val', 'Data partition to form evaluation on.')
-# Optimization
-flags.DEFINE_float('proj_weight', 10, 'Weighting factor for projection loss.')
-flags.DEFINE_float('volume_weight', 0, 'Weighting factor for volume loss.')
-flags.DEFINE_float('viewpoint_weight', 1,
-                   'Weighting factor for viewpoint loss.')
-flags.DEFINE_float('learning_rate', 0.0001, 'Learning rate.')
-flags.DEFINE_float('weight_decay', 0.001, '')
-flags.DEFINE_float('clip_gradient_norm', 0, '')
-# Summary
-flags.DEFINE_integer('save_summaries_secs', 15, '')
-flags.DEFINE_integer('eval_interval_secs', 60 * 5, '')
-# Distribution
-flags.DEFINE_string('master', '', '')
-
-FLAGS = flags.FLAGS
-
-
-def main(argv=()):
-  del argv  # Unused.
-  eval_dir = os.path.join(FLAGS.checkpoint_dir, FLAGS.model_name, 'train')
-  log_dir = os.path.join(FLAGS.checkpoint_dir, FLAGS.model_name,
-                         'eval_%s' % FLAGS.eval_set)
-  if not os.path.exists(eval_dir):
-    os.makedirs(eval_dir)
-  if not os.path.exists(log_dir):
-    os.makedirs(log_dir)
-  g = tf.Graph()
-
-  with g.as_default():
-    eval_params = FLAGS
-    eval_params.batch_size = 1
-    eval_params.step_size = FLAGS.num_views
-    ###########
-    ## model ##
-    ###########
-    model = model_ptn.model_PTN(eval_params)
-    ##########
-    ## data ##
-    ##########
-    eval_data = model.get_inputs(
-        FLAGS.inp_dir,
-        FLAGS.dataset_name,
-        eval_params.eval_set,
-        eval_params.batch_size,
-        eval_params.image_size,
-        eval_params.vox_size,
-        is_training=False)
-    inputs = model.preprocess_with_all_views(eval_data)
-    ##############
-    ## model_fn ##
-    ##############
-    model_fn = model.get_model_fn(is_training=False, run_projection=False)
-    outputs = model_fn(inputs)
-    #############
-    ## metrics ##
-    #############
-    names_to_values, names_to_updates = model.get_metrics(inputs, outputs)
-    del names_to_values
-    ################
-    ## evaluation ##
-    ################
-    num_batches = eval_data['num_samples']
-    slim.evaluation.evaluation_loop(
-        master=FLAGS.master,
-        checkpoint_dir=eval_dir,
-        logdir=log_dir,
-        num_evals=num_batches,
-        eval_op=names_to_updates.values(),
-        eval_interval_secs=FLAGS.eval_interval_secs)
-
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/ptn/eval_rotator.py b/research/ptn/eval_rotator.py
deleted file mode 100644
index b7fcf0fe4ab2b98754ffbc0d75efa64828db6e25..0000000000000000000000000000000000000000
--- a/research/ptn/eval_rotator.py
+++ /dev/null
@@ -1,126 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contains evaluation plan for the Rotator model."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import tensorflow as tf
-from tensorflow import app
-
-import model_rotator as model
-
-flags = tf.app.flags
-slim = tf.contrib.slim
-
-flags.DEFINE_string('inp_dir',
-                    '',
-                    'Directory path containing the input data (tfrecords).')
-flags.DEFINE_string(
-    'dataset_name', 'shapenet_chair',
-    'Dataset name that is to be used for training and evaluation.')
-flags.DEFINE_integer('z_dim', 512, '')
-flags.DEFINE_integer('a_dim', 3, '')
-flags.DEFINE_integer('f_dim', 64, '')
-flags.DEFINE_integer('fc_dim', 1024, '')
-flags.DEFINE_integer('num_views', 24, 'Num of viewpoints in the input data.')
-flags.DEFINE_integer('image_size', 64,
-                     'Input images dimension (pixels) - width & height.')
-flags.DEFINE_integer('step_size', 24, '')
-flags.DEFINE_integer('batch_size', 2, '')
-flags.DEFINE_string('encoder_name', 'ptn_encoder',
-                    'Name of the encoder network being used.')
-flags.DEFINE_string('decoder_name', 'ptn_im_decoder',
-                    'Name of the decoder network being used.')
-flags.DEFINE_string('rotator_name', 'ptn_rotator',
-                    'Name of the rotator network being used.')
-# Save options
-flags.DEFINE_string('checkpoint_dir', '/tmp/ptn_train/',
-                    'Directory path for saving trained models and other data.')
-flags.DEFINE_string('model_name', 'ptn_proj',
-                    'Name of the model used in naming the TF job. Must be different for each run.')
-# Optimization
-flags.DEFINE_float('image_weight', 10, '')
-flags.DEFINE_float('mask_weight', 1, '')
-flags.DEFINE_float('learning_rate', 0.0001, 'Learning rate.')
-flags.DEFINE_float('weight_decay', 0.001, '')
-flags.DEFINE_float('clip_gradient_norm', 0, '')
-# Summary
-flags.DEFINE_integer('save_summaries_secs', 15, '')
-flags.DEFINE_integer('eval_interval_secs', 60 * 5, '')
-# Scheduling
-flags.DEFINE_string('master', '', '')
-
-FLAGS = flags.FLAGS
-
-
-def main(argv=()):
-  del argv  # Unused.
-  eval_dir = os.path.join(FLAGS.checkpoint_dir,
-                          FLAGS.model_name, 'train')
-  log_dir = os.path.join(FLAGS.checkpoint_dir,
-                         FLAGS.model_name, 'eval')
-
-  if not os.path.exists(eval_dir):
-    os.makedirs(eval_dir)
-  if not os.path.exists(log_dir):
-    os.makedirs(log_dir)
-  g = tf.Graph()
-
-  if FLAGS.step_size < FLAGS.num_views:
-    raise ValueError('Impossible step_size, must not be less than num_views.')
-
-  g = tf.Graph()
-  with g.as_default():
-    ##########
-    ## data ##
-    ##########
-    val_data = model.get_inputs(
-        FLAGS.inp_dir,
-        FLAGS.dataset_name,
-        'val',
-        FLAGS.batch_size,
-        FLAGS.image_size,
-        is_training=False)
-    inputs = model.preprocess(val_data, FLAGS.step_size)
-    ###########
-    ## model ##
-    ###########
-    model_fn = model.get_model_fn(FLAGS, is_training=False)
-    outputs = model_fn(inputs)
-    #############
-    ## metrics ##
-    #############
-    names_to_values, names_to_updates = model.get_metrics(
-        inputs, outputs, FLAGS)
-    del names_to_values
-    ################
-    ## evaluation ##
-    ################
-    num_batches = int(val_data['num_samples'] / FLAGS.batch_size)
-    slim.evaluation.evaluation_loop(
-        master=FLAGS.master,
-        checkpoint_dir=eval_dir,
-        logdir=log_dir,
-        num_evals=num_batches,
-        eval_op=names_to_updates.values(),
-        eval_interval_secs=FLAGS.eval_interval_secs)
-
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/ptn/input_generator.py b/research/ptn/input_generator.py
deleted file mode 100644
index 7047d6483030b8b6ad0e68897121822030f84769..0000000000000000000000000000000000000000
--- a/research/ptn/input_generator.py
+++ /dev/null
@@ -1,130 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Provides dataset dictionaries as used in our network models."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import tensorflow as tf
-import tensorflow.contrib.slim as slim
-
-from tensorflow.contrib.slim.python.slim.data import dataset
-from tensorflow.contrib.slim.python.slim.data import dataset_data_provider
-from tensorflow.contrib.slim.python.slim.data import tfexample_decoder
-
-_ITEMS_TO_DESCRIPTIONS = {
-    'image': 'Images',
-    'mask': 'Masks',
-    'vox': 'Voxels'
-}
-
-
-def _get_split(file_pattern, num_samples, num_views, image_size, vox_size):
-  """Get dataset.Dataset for the given dataset file pattern and properties."""
-
-  # A dictionary from TF-Example keys to tf.FixedLenFeature instance.
-  keys_to_features = {
-      'image': tf.FixedLenFeature(
-          shape=[num_views, image_size, image_size, 3],
-          dtype=tf.float32, default_value=None),
-      'mask': tf.FixedLenFeature(
-          shape=[num_views, image_size, image_size, 1],
-          dtype=tf.float32, default_value=None),
-      'vox': tf.FixedLenFeature(
-          shape=[vox_size, vox_size, vox_size, 1],
-          dtype=tf.float32, default_value=None),
-  }
-
-  items_to_handler = {
-      'image': tfexample_decoder.Tensor(
-          'image', shape=[num_views, image_size, image_size, 3]),
-      'mask': tfexample_decoder.Tensor(
-          'mask', shape=[num_views, image_size, image_size, 1]),
-      'vox': tfexample_decoder.Tensor(
-          'vox', shape=[vox_size, vox_size, vox_size, 1])
-  }
-
-  decoder = tfexample_decoder.TFExampleDecoder(
-      keys_to_features, items_to_handler)
-
-  return dataset.Dataset(
-      data_sources=file_pattern,
-      reader=tf.TFRecordReader,
-      decoder=decoder,
-      num_samples=num_samples,
-      items_to_descriptions=_ITEMS_TO_DESCRIPTIONS)
-
-
-def get(dataset_dir,
-        dataset_name,
-        split_name,
-        shuffle=True,
-        num_readers=1,
-        common_queue_capacity=64,
-        common_queue_min=50):
-  """Provides input data for a specified dataset and split."""
-
-  dataset_to_kwargs = {
-      'shapenet_chair': {
-          'file_pattern': '03001627_%s.tfrecords' % split_name,
-          'num_views': 24,
-          'image_size': 64,
-          'vox_size': 32,
-      }, 'shapenet_all': {
-          'file_pattern': '*_%s.tfrecords' % split_name,
-          'num_views': 24,
-          'image_size': 64,
-          'vox_size': 32,
-      },
-  }
-
-  split_sizes = {
-      'shapenet_chair': {
-          'train': 4744,
-          'val': 678,
-          'test': 1356,
-      },
-      'shapenet_all': {
-          'train': 30643,
-          'val': 4378,
-          'test': 8762,
-      }
-  }
-
-  kwargs = dataset_to_kwargs[dataset_name]
-  kwargs['file_pattern'] = os.path.join(dataset_dir, kwargs['file_pattern'])
-  kwargs['num_samples'] = split_sizes[dataset_name][split_name]
-
-  dataset_split = _get_split(**kwargs)
-  data_provider = dataset_data_provider.DatasetDataProvider(
-      dataset_split,
-      num_readers=num_readers,
-      common_queue_capacity=common_queue_capacity,
-      common_queue_min=common_queue_min,
-      shuffle=shuffle)
-
-  inputs = {
-      'num_samples': dataset_split.num_samples,
-  }
-
-  [image, mask, vox] = data_provider.get(['image', 'mask', 'vox'])
-  inputs['image'] = image
-  inputs['mask'] = mask
-  inputs['voxel'] = vox
-
-  return inputs
diff --git a/research/ptn/losses.py b/research/ptn/losses.py
deleted file mode 100644
index 53cc28847a32af88b718fbb9e53ca287b48a8b65..0000000000000000000000000000000000000000
--- a/research/ptn/losses.py
+++ /dev/null
@@ -1,178 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Defines the various loss functions in use by the PTN model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-
-def add_rotator_image_loss(inputs, outputs, step_size, weight_scale):
-  """Computes the image loss of deep rotator model.
-
-  Args:
-    inputs: Input dictionary to the model containing keys
-      such as `images_k'.
-    outputs: Output dictionary returned by the model containing keys
-      such as `images_k'.
-    step_size: A scalar representing the number of recurrent
-      steps (number of repeated out-of-plane rotations)
-      in the deep rotator network (int).
-    weight_scale: A reweighting factor applied over the image loss (float).
-
-  Returns:
-    A `Tensor' scalar that returns averaged L2 loss
-      (divided by batch_size and step_size) between the
-      ground-truth images (RGB) and predicted images (tf.float32).
-
-  """
-  batch_size = tf.shape(inputs['images_0'])[0]
-  image_loss = 0
-  for k in range(1, step_size + 1):
-    image_loss += tf.nn.l2_loss(
-        inputs['images_%d' % k] - outputs['images_%d' % k])
-
-  image_loss /= tf.to_float(step_size * batch_size)
-  slim.summaries.add_scalar_summary(
-      image_loss, 'image_loss', prefix='losses')
-  image_loss *= weight_scale
-  return image_loss
-
-
-def add_rotator_mask_loss(inputs, outputs, step_size, weight_scale):
-  """Computes the mask loss of deep rotator model.
-
-  Args:
-    inputs: Input dictionary to the model containing keys
-      such as `masks_k'.
-    outputs: Output dictionary returned by the model containing
-      keys such as `masks_k'.
-    step_size: A scalar representing the number of recurrent
-      steps (number of repeated out-of-plane rotations)
-      in the deep rotator network (int).
-    weight_scale: A reweighting factor applied over the mask loss (float).
-
-  Returns:
-    A `Tensor' that returns averaged L2 loss
-      (divided by batch_size and step_size) between the ground-truth masks
-      (object silhouettes) and predicted masks (tf.float32).
-
-  """
-  batch_size = tf.shape(inputs['images_0'])[0]
-  mask_loss = 0
-  for k in range(1, step_size + 1):
-    mask_loss += tf.nn.l2_loss(
-        inputs['masks_%d' % k] - outputs['masks_%d' % k])
-
-  mask_loss /= tf.to_float(step_size * batch_size)
-  slim.summaries.add_scalar_summary(
-      mask_loss, 'mask_loss', prefix='losses')
-  mask_loss *= weight_scale
-  return mask_loss
-
-
-def add_volume_proj_loss(inputs, outputs, num_views, weight_scale):
-  """Computes the projection loss of voxel generation model.
-
-  Args:
-    inputs: Input dictionary to the model containing keys such as
-      `images_1'.
-    outputs: Output dictionary returned by the model containing keys
-      such as `masks_k' and ``projs_k'.
-    num_views: A integer scalar represents the total number of
-      viewpoints for each of the object (int).
-    weight_scale: A reweighting factor applied over the projection loss (float).
-
-  Returns:
-    A `Tensor' that returns the averaged L2 loss
-      (divided by batch_size and num_views) between the ground-truth
-      masks (object silhouettes) and predicted masks (tf.float32).
-
-  """
-  batch_size = tf.shape(inputs['images_1'])[0]
-  proj_loss = 0
-  for k in range(num_views):
-    proj_loss += tf.nn.l2_loss(
-        outputs['masks_%d' % (k + 1)] - outputs['projs_%d' % (k + 1)])
-  proj_loss /= tf.to_float(num_views * batch_size)
-  slim.summaries.add_scalar_summary(
-      proj_loss, 'proj_loss', prefix='losses')
-  proj_loss *= weight_scale
-  return proj_loss
-
-
-def add_volume_loss(inputs, outputs, num_views, weight_scale):
-  """Computes the volume loss of voxel generation model.
-
-  Args:
-    inputs: Input dictionary to the model containing keys such as
-      `images_1' and `voxels'.
-    outputs: Output dictionary returned by the model containing keys
-      such as `voxels_k'.
-    num_views: A scalar representing the total number of
-      viewpoints for each object (int).
-    weight_scale: A reweighting factor applied over the volume
-      loss (tf.float32).
-
-  Returns:
-    A `Tensor' that returns the averaged L2 loss
-      (divided by batch_size and num_views) between the ground-truth
-      volumes and predicted volumes (tf.float32).
-
-  """
-  batch_size = tf.shape(inputs['images_1'])[0]
-  vol_loss = 0
-  for k in range(num_views):
-    vol_loss += tf.nn.l2_loss(
-        inputs['voxels'] - outputs['voxels_%d' % (k + 1)])
-  vol_loss /= tf.to_float(num_views * batch_size)
-  slim.summaries.add_scalar_summary(
-      vol_loss, 'vol_loss', prefix='losses')
-  vol_loss *= weight_scale
-  return vol_loss
-
-
-def regularization_loss(scopes, params):
-  """Computes the weight decay as regularization during training.
-
-  Args:
-    scopes: A list of different components of the model such as
-      ``encoder'', ``decoder'' and ``projector''.
-    params: Parameters of the model.
-
-  Returns:
-    Regularization loss (tf.float32).
-  """
-
-  reg_loss = tf.zeros(dtype=tf.float32, shape=[])
-  if params.weight_decay > 0:
-    is_trainable = lambda x: x in tf.trainable_variables()
-    is_weights = lambda x: 'weights' in x.name
-    for scope in scopes:
-      scope_vars = filter(is_trainable,
-                          tf.contrib.framework.get_model_variables(scope))
-      scope_vars = filter(is_weights, scope_vars)
-      if scope_vars:
-        reg_loss += tf.add_n([tf.nn.l2_loss(var) for var in scope_vars])
-
-  slim.summaries.add_scalar_summary(
-      reg_loss, 'reg_loss', prefix='losses')
-  reg_loss *= params.weight_decay
-  return reg_loss
diff --git a/research/ptn/metrics.py b/research/ptn/metrics.py
deleted file mode 100644
index 5f31dd5fd5a67af3f5cc94de88493c25750e413c..0000000000000000000000000000000000000000
--- a/research/ptn/metrics.py
+++ /dev/null
@@ -1,111 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Provides metrics used by PTN."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from six.moves import xrange
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-
-def add_image_pred_metrics(
-    inputs, outputs, num_views, upscale_factor):
-  """Computes the image prediction metrics.
-
-  Args:
-    inputs: Input dictionary of the deep rotator model (model_rotator.py).
-    outputs: Output dictionary of the deep rotator model (model_rotator.py).
-    num_views: An integer scalar representing the total number
-      of different viewpoints for each object in the dataset.
-    upscale_factor: A float scalar representing the number of pixels
-      per image (num_channels x image_height x image_width).
-
-  Returns:
-    names_to_values: A dictionary representing the current value
-      of the metric.
-    names_to_updates: A dictionary representing the operation
-      that accumulates the error from a batch of data.
-  """
-  names_to_values = dict()
-  names_to_updates = dict()
-  for k in xrange(num_views):
-    tmp_value, tmp_update = tf.contrib.metrics.streaming_mean_squared_error(
-        outputs['images_%d' % (k + 1)], inputs['images_%d' % (k + 1)])
-    name = 'image_pred/rnn_%d' % (k + 1)
-    names_to_values.update({name: tmp_value * upscale_factor})
-    names_to_updates.update({name: tmp_update})
-  return names_to_values, names_to_updates
-
-
-def add_mask_pred_metrics(
-    inputs, outputs, num_views, upscale_factor):
-  """Computes the mask prediction metrics.
-
-  Args:
-    inputs: Input dictionary of the deep rotator model (model_rotator.py).
-    outputs: Output dictionary of the deep rotator model (model_rotator.py).
-    num_views: An integer scalar representing the total number
-      of different viewpoints for each object in the dataset.
-    upscale_factor: A float scalar representing the number of pixels
-      per image (num_channels x image_height x image_width).
-
-  Returns:
-    names_to_values: A dictionary representing the current value
-      of the metric.
-    names_to_updates: A dictionary representing the operation
-      that accumulates the error from a batch of data.
-
-  """
-  names_to_values = dict()
-  names_to_updates = dict()
-  for k in xrange(num_views):
-    tmp_value, tmp_update = tf.contrib.metrics.streaming_mean_squared_error(
-        outputs['masks_%d' % (k + 1)], inputs['masks_%d' % (k + 1)])
-    name = 'mask_pred/rnn_%d' % (k + 1)
-    names_to_values.update({name: tmp_value * upscale_factor})
-    names_to_updates.update({name: tmp_update})
-  return names_to_values, names_to_updates
-
-
-def add_volume_iou_metrics(inputs, outputs):
-  """Computes the per-instance volume IOU.
-
-  Args:
-    inputs: Input dictionary of the voxel generation model.
-    outputs: Output dictionary returned by the voxel generation model.
-
-  Returns:
-    names_to_values: metrics->values (dict).
-    names_to_updates: metrics->ops (dict).
-
-  """
-  names_to_values = dict()
-  names_to_updates = dict()
-  labels = tf.greater_equal(inputs['voxels'], 0.5)
-  predictions = tf.greater_equal(outputs['voxels_1'], 0.5)
-  labels = (2 - tf.to_int32(labels)) - 1
-  predictions = (3 - tf.to_int32(predictions) * 2) - 1
-  tmp_values, tmp_updates = tf.metrics.mean_iou(
-      labels=labels,
-      predictions=predictions,
-      num_classes=3)
-  names_to_values['volume_iou'] = tmp_values * 3.0
-  names_to_updates['volume_iou'] = tmp_updates
-  return names_to_values, names_to_updates
diff --git a/research/ptn/model_ptn.py b/research/ptn/model_ptn.py
deleted file mode 100644
index cc0fc4fa38e5479d78307e764ae490e124e34ab7..0000000000000000000000000000000000000000
--- a/research/ptn/model_ptn.py
+++ /dev/null
@@ -1,232 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Implementations for Im2Vox PTN (NIPS16) model."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import losses
-import metrics
-import model_voxel_generation
-import utils
-from nets import im2vox_factory
-
-slim = tf.contrib.slim
-
-
-class model_PTN(model_voxel_generation.Im2Vox):  # pylint:disable=invalid-name
-  """Inherits the generic Im2Vox model class and implements the functions."""
-
-  def __init__(self, params):
-    super(model_PTN, self).__init__(params)
-
-  # For testing, this selects all views in input
-  def preprocess_with_all_views(self, raw_inputs):
-    (quantity, num_views) = raw_inputs['images'].get_shape().as_list()[:2]
-
-    inputs = dict()
-    inputs['voxels'] = []
-    inputs['images_1'] = []
-    for k in xrange(num_views):
-      inputs['matrix_%d' % (k + 1)] = []
-    inputs['matrix_1'] = []
-    for n in xrange(quantity):
-      for k in xrange(num_views):
-        inputs['images_1'].append(raw_inputs['images'][n, k, :, :, :])
-        inputs['voxels'].append(raw_inputs['voxels'][n, :, :, :, :])
-        tf_matrix = self.get_transform_matrix(k)
-        inputs['matrix_%d' % (k + 1)].append(tf_matrix)
-
-    inputs['images_1'] = tf.stack(inputs['images_1'])
-    inputs['voxels'] = tf.stack(inputs['voxels'])
-    for k in xrange(num_views):
-      inputs['matrix_%d' % (k + 1)] = tf.stack(inputs['matrix_%d' % (k + 1)])
-
-    return inputs
-
-  def get_model_fn(self, is_training=True, reuse=False, run_projection=True):
-    return im2vox_factory.get(self._params, is_training, reuse, run_projection)
-
-  def get_regularization_loss(self, scopes):
-    return losses.regularization_loss(scopes, self._params)
-
-  def get_loss(self, inputs, outputs):
-    """Computes the loss used for PTN paper (projection + volume loss)."""
-    g_loss = tf.zeros(dtype=tf.float32, shape=[])
-
-    if self._params.proj_weight:
-      g_loss += losses.add_volume_proj_loss(
-          inputs, outputs, self._params.step_size, self._params.proj_weight)
-
-    if self._params.volume_weight:
-      g_loss += losses.add_volume_loss(inputs, outputs, 1,
-                                       self._params.volume_weight)
-
-    slim.summaries.add_scalar_summary(g_loss, 'im2vox_loss', prefix='losses')
-
-    return g_loss
-
-  def get_metrics(self, inputs, outputs):
-    """Aggregate the metrics for voxel generation model.
-
-    Args:
-      inputs: Input dictionary of the voxel generation model.
-      outputs: Output dictionary returned by the voxel generation model.
-
-    Returns:
-      names_to_values: metrics->values (dict).
-      names_to_updates: metrics->ops (dict).
-    """
-    names_to_values = dict()
-    names_to_updates = dict()
-
-    tmp_values, tmp_updates = metrics.add_volume_iou_metrics(inputs, outputs)
-
-    names_to_values.update(tmp_values)
-    names_to_updates.update(tmp_updates)
-
-    for name, value in names_to_values.iteritems():
-      slim.summaries.add_scalar_summary(
-          value, name, prefix='eval', print_summary=True)
-
-    return names_to_values, names_to_updates
-
-  def write_disk_grid(self,
-                      global_step,
-                      log_dir,
-                      input_images,
-                      gt_projs,
-                      pred_projs,
-                      input_voxels=None,
-                      output_voxels=None):
-    """Function called by TF to save the prediction periodically."""
-    summary_freq = self._params.save_every
-
-    def write_grid(input_images, gt_projs, pred_projs, global_step,
-                   input_voxels, output_voxels):
-      """Native python function to call for writing images to files."""
-      grid = _build_image_grid(
-          input_images,
-          gt_projs,
-          pred_projs,
-          input_voxels=input_voxels,
-          output_voxels=output_voxels)
-
-      if global_step % summary_freq == 0:
-        img_path = os.path.join(log_dir, '%s.jpg' % str(global_step))
-        utils.save_image(grid, img_path)
-      return grid
-
-    save_op = tf.py_func(write_grid, [
-        input_images, gt_projs, pred_projs, global_step, input_voxels,
-        output_voxels
-    ], [tf.uint8], 'write_grid')[0]
-    slim.summaries.add_image_summary(
-        tf.expand_dims(save_op, axis=0), name='grid_vis')
-    return save_op
-
-  def get_transform_matrix(self, view_out):
-    """Get the 4x4 Perspective Transfromation matrix used for PTN."""
-    num_views = self._params.num_views
-    focal_length = self._params.focal_length
-    focal_range = self._params.focal_range
-    phi = 30
-    theta_interval = 360.0 / num_views
-    theta = theta_interval * view_out
-
-    #  pylint: disable=invalid-name
-    camera_matrix = np.zeros((4, 4), dtype=np.float32)
-    intrinsic_matrix = np.eye(4, dtype=np.float32)
-    extrinsic_matrix = np.eye(4, dtype=np.float32)
-
-    sin_phi = np.sin(float(phi) / 180.0 * np.pi)
-    cos_phi = np.cos(float(phi) / 180.0 * np.pi)
-    sin_theta = np.sin(float(-theta) / 180.0 * np.pi)
-    cos_theta = np.cos(float(-theta) / 180.0 * np.pi)
-
-    rotation_azimuth = np.zeros((3, 3), dtype=np.float32)
-    rotation_azimuth[0, 0] = cos_theta
-    rotation_azimuth[2, 2] = cos_theta
-    rotation_azimuth[0, 2] = -sin_theta
-    rotation_azimuth[2, 0] = sin_theta
-    rotation_azimuth[1, 1] = 1.0
-
-    ## rotation axis -- x
-    rotation_elevation = np.zeros((3, 3), dtype=np.float32)
-    rotation_elevation[0, 0] = cos_phi
-    rotation_elevation[0, 1] = sin_phi
-    rotation_elevation[1, 0] = -sin_phi
-    rotation_elevation[1, 1] = cos_phi
-    rotation_elevation[2, 2] = 1.0
-
-    rotation_matrix = np.matmul(rotation_azimuth, rotation_elevation)
-    displacement = np.zeros((3, 1), dtype=np.float32)
-    displacement[0, 0] = float(focal_length) + float(focal_range) / 2.0
-    displacement = np.matmul(rotation_matrix, displacement)
-
-    extrinsic_matrix[0:3, 0:3] = rotation_matrix
-    extrinsic_matrix[0:3, 3:4] = -displacement
-
-    intrinsic_matrix[2, 2] = 1.0 / float(focal_length)
-    intrinsic_matrix[1, 1] = 1.0 / float(focal_length)
-
-    camera_matrix = np.matmul(extrinsic_matrix, intrinsic_matrix)
-    return camera_matrix
-
-
-def _build_image_grid(input_images,
-                      gt_projs,
-                      pred_projs,
-                      input_voxels,
-                      output_voxels,
-                      vis_size=128):
-  """Builds a grid image by concatenating the input images."""
-  quantity = input_images.shape[0]
-
-  for row in xrange(int(quantity / 3)):
-    for col in xrange(3):
-      index = row * 3 + col
-      input_img_ = utils.resize_image(input_images[index, :, :, :], vis_size,
-                                      vis_size)
-      gt_proj_ = utils.resize_image(gt_projs[index, :, :, :], vis_size,
-                                    vis_size)
-      pred_proj_ = utils.resize_image(pred_projs[index, :, :, :], vis_size,
-                                      vis_size)
-      gt_voxel_vis = utils.resize_image(
-          utils.display_voxel(input_voxels[index, :, :, :, 0]), vis_size,
-          vis_size)
-      pred_voxel_vis = utils.resize_image(
-          utils.display_voxel(output_voxels[index, :, :, :, 0]), vis_size,
-          vis_size)
-      if col == 0:
-        tmp_ = np.concatenate(
-            [input_img_, gt_proj_, pred_proj_, gt_voxel_vis, pred_voxel_vis], 1)
-      else:
-        tmp_ = np.concatenate([
-            tmp_, input_img_, gt_proj_, pred_proj_, gt_voxel_vis, pred_voxel_vis
-        ], 1)
-    if row == 0:
-      out_grid = tmp_
-    else:
-      out_grid = np.concatenate([out_grid, tmp_], 0)
-
-  return out_grid
diff --git a/research/ptn/model_rotator.py b/research/ptn/model_rotator.py
deleted file mode 100644
index 28860bc1025f78b5413b84d0c2bbf632874e0853..0000000000000000000000000000000000000000
--- a/research/ptn/model_rotator.py
+++ /dev/null
@@ -1,266 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Helper functions for pretraining (rotator) as described in PTN paper."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import input_generator
-import losses
-import metrics
-import utils
-from nets import deeprotator_factory
-
-slim = tf.contrib.slim
-
-
-def _get_data_from_provider(inputs, batch_size, split_name):
-  """Returns dictionary of batch input data processed by tf.train.batch."""
-  images, masks = tf.train.batch(
-      [inputs['image'], inputs['mask']],
-      batch_size=batch_size,
-      num_threads=8,
-      capacity=8 * batch_size,
-      name='batching_queues/%s' % (split_name))
-
-  outputs = dict()
-  outputs['images'] = images
-  outputs['masks'] = masks
-  outputs['num_samples'] = inputs['num_samples']
-
-  return outputs
-
-
-def get_inputs(dataset_dir, dataset_name, split_name, batch_size, image_size,
-               is_training):
-  """Loads the given dataset and split."""
-  del image_size  # Unused
-  with tf.variable_scope('data_loading_%s/%s' % (dataset_name, split_name)):
-    common_queue_min = 50
-    common_queue_capacity = 256
-    num_readers = 4
-
-    inputs = input_generator.get(
-        dataset_dir,
-        dataset_name,
-        split_name,
-        shuffle=is_training,
-        num_readers=num_readers,
-        common_queue_min=common_queue_min,
-        common_queue_capacity=common_queue_capacity)
-
-    return _get_data_from_provider(inputs, batch_size, split_name)
-
-
-def preprocess(raw_inputs, step_size):
-  """Selects the subset of viewpoints to train on."""
-  shp = raw_inputs['images'].get_shape().as_list()
-  quantity = shp[0]
-  num_views = shp[1]
-  image_size = shp[2]
-  del image_size  # Unused
-
-  batch_rot = np.zeros((quantity, 3), dtype=np.float32)
-  inputs = dict()
-  for n in xrange(step_size + 1):
-    inputs['images_%d' % n] = []
-    inputs['masks_%d' % n] = []
-
-  for n in xrange(quantity):
-    view_in = np.random.randint(0, num_views)
-    rng_rot = np.random.randint(0, 2)
-    if step_size == 1:
-      rng_rot = np.random.randint(0, 3)
-
-    delta = 0
-    if rng_rot == 0:
-      delta = -1
-      batch_rot[n, 2] = 1
-    elif rng_rot == 1:
-      delta = 1
-      batch_rot[n, 0] = 1
-    else:
-      delta = 0
-      batch_rot[n, 1] = 1
-
-    inputs['images_0'].append(raw_inputs['images'][n, view_in, :, :, :])
-    inputs['masks_0'].append(raw_inputs['masks'][n, view_in, :, :, :])
-
-    view_out = view_in
-    for k in xrange(1, step_size + 1):
-      view_out += delta
-      if view_out >= num_views:
-        view_out = 0
-      if view_out < 0:
-        view_out = num_views - 1
-
-      inputs['images_%d' % k].append(raw_inputs['images'][n, view_out, :, :, :])
-      inputs['masks_%d' % k].append(raw_inputs['masks'][n, view_out, :, :, :])
-
-  for n in xrange(step_size + 1):
-    inputs['images_%d' % n] = tf.stack(inputs['images_%d' % n])
-    inputs['masks_%d' % n] = tf.stack(inputs['masks_%d' % n])
-
-  inputs['actions'] = tf.constant(batch_rot, dtype=tf.float32)
-  return inputs
-
-
-def get_init_fn(scopes, params):
-  """Initialization assignment operator function used while training."""
-  if not params.init_model:
-    return None
-
-  is_trainable = lambda x: x in tf.trainable_variables()
-  var_list = []
-  for scope in scopes:
-    var_list.extend(
-        filter(is_trainable, tf.contrib.framework.get_model_variables(scope)))
-
-  init_assign_op, init_feed_dict = slim.assign_from_checkpoint(
-      params.init_model, var_list)
-
-  def init_assign_function(sess):
-    sess.run(init_assign_op, init_feed_dict)
-
-  return init_assign_function
-
-
-def get_model_fn(params, is_training, reuse=False):
-  return deeprotator_factory.get(params, is_training, reuse)
-
-
-def get_regularization_loss(scopes, params):
-  return losses.regularization_loss(scopes, params)
-
-
-def get_loss(inputs, outputs, params):
-  """Computes the rotator loss."""
-  g_loss = tf.zeros(dtype=tf.float32, shape=[])
-
-  if hasattr(params, 'image_weight'):
-    g_loss += losses.add_rotator_image_loss(inputs, outputs, params.step_size,
-                                            params.image_weight)
-
-  if hasattr(params, 'mask_weight'):
-    g_loss += losses.add_rotator_mask_loss(inputs, outputs, params.step_size,
-                                           params.mask_weight)
-
-  slim.summaries.add_scalar_summary(
-      g_loss, 'rotator_loss', prefix='losses')
-
-  return g_loss
-
-
-def get_train_op_for_scope(loss, optimizer, scopes, params):
-  """Train operation function for the given scope used file training."""
-  is_trainable = lambda x: x in tf.trainable_variables()
-
-  var_list = []
-  update_ops = []
-
-  for scope in scopes:
-    var_list.extend(
-        filter(is_trainable, tf.contrib.framework.get_model_variables(scope)))
-    update_ops.extend(tf.get_collection(tf.GraphKeys.UPDATE_OPS, scope))
-
-  return slim.learning.create_train_op(
-      loss,
-      optimizer,
-      update_ops=update_ops,
-      variables_to_train=var_list,
-      clip_gradient_norm=params.clip_gradient_norm)
-
-
-def get_metrics(inputs, outputs, params):
-  """Aggregate the metrics for rotator model.
-
-  Args:
-    inputs: Input dictionary of the rotator model.
-    outputs: Output dictionary returned by the rotator model.
-    params: Hyperparameters of the rotator model.
-
-  Returns:
-    names_to_values: metrics->values (dict).
-    names_to_updates: metrics->ops (dict).
-  """
-  names_to_values = dict()
-  names_to_updates = dict()
-
-  tmp_values, tmp_updates = metrics.add_image_pred_metrics(
-      inputs, outputs, params.num_views, 3*params.image_size**2)
-  names_to_values.update(tmp_values)
-  names_to_updates.update(tmp_updates)
-
-  tmp_values, tmp_updates = metrics.add_mask_pred_metrics(
-      inputs, outputs, params.num_views, params.image_size**2)
-  names_to_values.update(tmp_values)
-  names_to_updates.update(tmp_updates)
-
-  for name, value in names_to_values.iteritems():
-    slim.summaries.add_scalar_summary(
-        value, name, prefix='eval', print_summary=True)
-
-  return names_to_values, names_to_updates
-
-
-def write_disk_grid(global_step, summary_freq, log_dir, input_images,
-                    output_images, pred_images, pred_masks):
-  """Function called by TF to save the prediction periodically."""
-
-  def write_grid(grid, global_step):
-    """Native python function to call for writing images to files."""
-    if global_step % summary_freq == 0:
-      img_path = os.path.join(log_dir, '%s.jpg' % str(global_step))
-      utils.save_image(grid, img_path)
-    return 0
-
-  grid = _build_image_grid(input_images, output_images, pred_images, pred_masks)
-  slim.summaries.add_image_summary(
-      tf.expand_dims(grid, axis=0), name='grid_vis')
-  save_op = tf.py_func(write_grid, [grid, global_step], [tf.int64],
-                       'write_grid')[0]
-  return save_op
-
-
-def _build_image_grid(input_images, output_images, pred_images, pred_masks):
-  """Builds a grid image by concatenating the input images."""
-  quantity = input_images.get_shape().as_list()[0]
-
-  for row in xrange(int(quantity / 4)):
-    for col in xrange(4):
-      index = row * 4 + col
-      input_img_ = input_images[index, :, :, :]
-      output_img_ = output_images[index, :, :, :]
-      pred_img_ = pred_images[index, :, :, :]
-      pred_mask_ = tf.tile(pred_masks[index, :, :, :], [1, 1, 3])
-      if col == 0:
-        tmp_ = tf.concat([input_img_, output_img_, pred_img_, pred_mask_],
-                         1)  ## to the right
-      else:
-        tmp_ = tf.concat([tmp_, input_img_, output_img_, pred_img_, pred_mask_],
-                         1)
-    if row == 0:
-      out_grid = tmp_
-    else:
-      out_grid = tf.concat([out_grid, tmp_], 0)
-
-  return out_grid
diff --git a/research/ptn/model_voxel_generation.py b/research/ptn/model_voxel_generation.py
deleted file mode 100644
index 0c8fc84669a7fccc0f3a33ecb4c847162d99521e..0000000000000000000000000000000000000000
--- a/research/ptn/model_voxel_generation.py
+++ /dev/null
@@ -1,222 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Base class for voxel generation model."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import abc
-import os
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-import input_generator
-import utils
-
-slim = tf.contrib.slim
-
-
-class Im2Vox(object):
-  """Defines the voxel generation model."""
-
-  __metaclass__ = abc.ABCMeta
-
-  def __init__(self, params):
-    self._params = params
-
-  @abc.abstractmethod
-  def get_metrics(self, inputs, outputs):
-    """Gets dictionaries from metrics to value `Tensors` & update `Tensors`."""
-    pass
-
-  @abc.abstractmethod
-  def get_loss(self, inputs, outputs):
-    pass
-
-  @abc.abstractmethod
-  def get_regularization_loss(self, scopes):
-    pass
-
-  def set_params(self, params):
-    self._params = params
-
-  def get_inputs(self,
-                 dataset_dir,
-                 dataset_name,
-                 split_name,
-                 batch_size,
-                 image_size,
-                 vox_size,
-                 is_training=True):
-    """Loads data for a specified dataset and split."""
-    del image_size, vox_size
-    with tf.variable_scope('data_loading_%s/%s' % (dataset_name, split_name)):
-      common_queue_min = 64
-      common_queue_capacity = 256
-      num_readers = 4
-
-      inputs = input_generator.get(
-          dataset_dir,
-          dataset_name,
-          split_name,
-          shuffle=is_training,
-          num_readers=num_readers,
-          common_queue_min=common_queue_min,
-          common_queue_capacity=common_queue_capacity)
-
-      images, voxels = tf.train.batch(
-          [inputs['image'], inputs['voxel']],
-          batch_size=batch_size,
-          num_threads=8,
-          capacity=8 * batch_size,
-          name='batching_queues/%s/%s' % (dataset_name, split_name))
-
-      outputs = dict()
-      outputs['images'] = images
-      outputs['voxels'] = voxels
-      outputs['num_samples'] = inputs['num_samples']
-
-    return outputs
-
-  def preprocess(self, raw_inputs, step_size):
-    """Selects the subset of viewpoints to train on."""
-    (quantity, num_views) = raw_inputs['images'].get_shape().as_list()[:2]
-
-    inputs = dict()
-    inputs['voxels'] = raw_inputs['voxels']
-
-    for k in xrange(step_size):
-      inputs['images_%d' % (k + 1)] = []
-      inputs['matrix_%d' % (k + 1)] = []
-
-    for n in xrange(quantity):
-      selected_views = np.random.choice(num_views, step_size, replace=False)
-      for k in xrange(step_size):
-        view_selected = selected_views[k]
-        inputs['images_%d' %
-               (k + 1)].append(raw_inputs['images'][n, view_selected, :, :, :])
-        tf_matrix = self.get_transform_matrix(view_selected)
-        inputs['matrix_%d' % (k + 1)].append(tf_matrix)
-
-    for k in xrange(step_size):
-      inputs['images_%d' % (k + 1)] = tf.stack(inputs['images_%d' % (k + 1)])
-      inputs['matrix_%d' % (k + 1)] = tf.stack(inputs['matrix_%d' % (k + 1)])
-
-    return inputs
-
-  def get_init_fn(self, scopes):
-    """Initialization assignment operator function used while training."""
-    if not self._params.init_model:
-      return None
-
-    is_trainable = lambda x: x in tf.trainable_variables()
-    var_list = []
-    for scope in scopes:
-      var_list.extend(
-          filter(is_trainable, tf.contrib.framework.get_model_variables(scope)))
-
-    init_assign_op, init_feed_dict = slim.assign_from_checkpoint(
-        self._params.init_model, var_list)
-
-    def init_assign_function(sess):
-      sess.run(init_assign_op, init_feed_dict)
-
-    return init_assign_function
-
-  def get_train_op_for_scope(self, loss, optimizer, scopes):
-    """Train operation function for the given scope used file training."""
-    is_trainable = lambda x: x in tf.trainable_variables()
-
-    var_list = []
-    update_ops = []
-
-    for scope in scopes:
-      var_list.extend(
-          filter(is_trainable, tf.contrib.framework.get_model_variables(scope)))
-      update_ops.extend(tf.get_collection(tf.GraphKeys.UPDATE_OPS, scope))
-
-    return slim.learning.create_train_op(
-        loss,
-        optimizer,
-        update_ops=update_ops,
-        variables_to_train=var_list,
-        clip_gradient_norm=self._params.clip_gradient_norm)
-
-  def write_disk_grid(self,
-                      global_step,
-                      log_dir,
-                      input_images,
-                      gt_projs,
-                      pred_projs,
-                      pred_voxels=None):
-    """Function called by TF to save the prediction periodically."""
-    summary_freq = self._params.save_every
-
-    def write_grid(input_images, gt_projs, pred_projs, pred_voxels,
-                   global_step):
-      """Native python function to call for writing images to files."""
-      grid = _build_image_grid(input_images, gt_projs, pred_projs, pred_voxels)
-
-      if global_step % summary_freq == 0:
-        img_path = os.path.join(log_dir, '%s.jpg' % str(global_step))
-        utils.save_image(grid, img_path)
-        with open(
-            os.path.join(log_dir, 'pred_voxels_%s' % str(global_step)),
-            'w') as fout:
-          np.save(fout, pred_voxels)
-        with open(
-            os.path.join(log_dir, 'input_images_%s' % str(global_step)),
-            'w') as fout:
-          np.save(fout, input_images)
-
-      return grid
-
-    py_func_args = [
-        input_images, gt_projs, pred_projs, pred_voxels, global_step
-    ]
-    save_grid_op = tf.py_func(write_grid, py_func_args, [tf.uint8],
-                              'wrtie_grid')[0]
-    slim.summaries.add_image_summary(
-        tf.expand_dims(save_grid_op, axis=0), name='grid_vis')
-    return save_grid_op
-
-
-def _build_image_grid(input_images, gt_projs, pred_projs, pred_voxels):
-  """Build the visualization grid with py_func."""
-  quantity, img_height, img_width = input_images.shape[:3]
-  for row in xrange(int(quantity / 3)):
-    for col in xrange(3):
-      index = row * 3 + col
-      input_img_ = input_images[index, :, :, :]
-      gt_proj_ = gt_projs[index, :, :, :]
-      pred_proj_ = pred_projs[index, :, :, :]
-      pred_voxel_ = utils.display_voxel(pred_voxels[index, :, :, :, 0])
-      pred_voxel_ = utils.resize_image(pred_voxel_, img_height, img_width)
-      if col == 0:
-        tmp_ = np.concatenate([input_img_, gt_proj_, pred_proj_, pred_voxel_],
-                              1)
-      else:
-        tmp_ = np.concatenate(
-            [tmp_, input_img_, gt_proj_, pred_proj_, pred_voxel_], 1)
-    if row == 0:
-      out_grid = tmp_
-    else:
-      out_grid = np.concatenate([out_grid, tmp_], 0)
-
-  out_grid = out_grid.astype(np.uint8)
-  return out_grid
diff --git a/research/ptn/nets/BUILD b/research/ptn/nets/BUILD
deleted file mode 100644
index 987499341ef25c8e19d532fdb14d3a9e842d0909..0000000000000000000000000000000000000000
--- a/research/ptn/nets/BUILD
+++ /dev/null
@@ -1,64 +0,0 @@
-package(default_visibility = ["//visibility:public"])
-
-py_library(
-    name = "deeprotator_factory",
-    srcs = ["deeprotator_factory.py"],
-    deps = [
-        ":ptn_encoder",
-        ":ptn_im_decoder",
-        ":ptn_rotator",
-    ],
-)
-
-py_library(
-    name = "im2vox_factory",
-    srcs = ["im2vox_factory.py"],
-    deps = [
-        ":perspective_projector",
-        ":ptn_encoder",
-        ":ptn_vox_decoder",
-    ],
-)
-
-py_library(
-    name = "perspective_projector",
-    srcs = ["perspective_projector.py"],
-    deps = [
-        ":perspective_transform",
-    ],
-)
-
-py_library(
-    name = "perspective_transform",
-    srcs = ["perspective_transform.py"],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "ptn_encoder",
-    srcs = ["ptn_encoder.py"],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "ptn_im_decoder",
-    srcs = ["ptn_im_decoder.py"],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "ptn_rotator",
-    srcs = ["ptn_rotator.py"],
-    deps = [
-    ],
-)
-
-py_library(
-    name = "ptn_vox_decoder",
-    srcs = ["ptn_vox_decoder.py"],
-    deps = [
-    ],
-)
diff --git a/research/ptn/nets/deeprotator_factory.py b/research/ptn/nets/deeprotator_factory.py
deleted file mode 100644
index e16170c41b7daae4a00068183cd0a4056ff279c3..0000000000000000000000000000000000000000
--- a/research/ptn/nets/deeprotator_factory.py
+++ /dev/null
@@ -1,91 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Factory module for different encoder/decoder network models."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from nets import ptn_encoder
-from nets import ptn_im_decoder
-from nets import ptn_rotator
-
-_NAME_TO_NETS = {
-    'ptn_encoder': ptn_encoder,
-    'ptn_rotator': ptn_rotator,
-    'ptn_im_decoder': ptn_im_decoder,
-}
-
-
-def _get_network(name):
-  """Gets a single network component."""
-
-  if name not in _NAME_TO_NETS:
-    raise ValueError('Network name [%s] not recognized.' % name)
-  return _NAME_TO_NETS[name].model
-
-
-def get(params, is_training=False, reuse=False):
-  """Factory function to retrieve a network model.
-
-  Args:
-    params: Different parameters used througout ptn, typically FLAGS (dict)
-    is_training: Set to True if while training (boolean)
-    reuse: Set as True if either using a pre-trained model or
-      in the training loop while the graph has already been built (boolean)
-  Returns:
-    Model function for network (inputs to outputs)
-  """
-
-  def model(inputs):
-    """Model function corresponding to a specific network architecture."""
-    outputs = {}
-
-    # First, build the encoder.
-    encoder_fn = _get_network(params.encoder_name)
-    with tf.variable_scope('encoder', reuse=reuse):
-      # Produces id/pose units
-      features = encoder_fn(inputs['images_0'], params, is_training)
-      outputs['ids'] = features['ids']
-      outputs['poses_0'] = features['poses']
-
-    # Second, build the rotator and decoder.
-    rotator_fn = _get_network(params.rotator_name)
-    with tf.variable_scope('rotator', reuse=reuse):
-      outputs['poses_1'] = rotator_fn(outputs['poses_0'], inputs['actions'],
-                                      params, is_training)
-    decoder_fn = _get_network(params.decoder_name)
-    with tf.variable_scope('decoder', reuse=reuse):
-      dec_output = decoder_fn(outputs['ids'], outputs['poses_1'], params,
-                              is_training)
-      outputs['images_1'] = dec_output['images']
-      outputs['masks_1'] = dec_output['masks']
-
-    # Third, build the recurrent connection
-    for k in range(1, params.step_size):
-      with tf.variable_scope('rotator', reuse=True):
-        outputs['poses_%d' % (k + 1)] = rotator_fn(
-            outputs['poses_%d' % k], inputs['actions'], params, is_training)
-      with tf.variable_scope('decoder', reuse=True):
-        dec_output = decoder_fn(outputs['ids'], outputs['poses_%d' % (k + 1)],
-                                params, is_training)
-        outputs['images_%d' % (k + 1)] = dec_output['images']
-        outputs['masks_%d' % (k + 1)] = dec_output['masks']
-
-    return outputs
-
-  return model
diff --git a/research/ptn/nets/im2vox_factory.py b/research/ptn/nets/im2vox_factory.py
deleted file mode 100644
index c54b96c24a56a8fa796cbc4f881a78cfbf41b86d..0000000000000000000000000000000000000000
--- a/research/ptn/nets/im2vox_factory.py
+++ /dev/null
@@ -1,92 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Factory module for getting the complete image to voxel generation network."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from nets import perspective_projector
-from nets import ptn_encoder
-from nets import ptn_vox_decoder
-
-_NAME_TO_NETS = {
-    'ptn_encoder': ptn_encoder,
-    'ptn_vox_decoder': ptn_vox_decoder,
-    'perspective_projector': perspective_projector,
-}
-
-
-def _get_network(name):
-  """Gets a single encoder/decoder network model."""
-
-  if name not in _NAME_TO_NETS:
-    raise ValueError('Network name [%s] not recognized.' % name)
-  return _NAME_TO_NETS[name].model
-
-
-def get(params, is_training=False, reuse=False, run_projection=True):
-  """Factory function to get the training/pretraining im->vox model (NIPS16).
-
-  Args:
-    params: Different parameters used througout ptn, typically FLAGS (dict).
-    is_training: Set to True if while training (boolean).
-    reuse: Set as True if sharing variables with a model that has already
-      been built (boolean).
-    run_projection: Set as False if not interested in mask and projection
-      images. Useful in evaluation routine (boolean).
-  Returns:
-    Model function for network (inputs to outputs).
-  """
-  def model(inputs):
-    """Model function corresponding to a specific network architecture."""
-    outputs = {}
-
-    # First, build the encoder
-    encoder_fn = _get_network(params.encoder_name)
-    with tf.variable_scope('encoder', reuse=reuse):
-      # Produces id/pose units
-      enc_outputs = encoder_fn(inputs['images_1'], params, is_training)
-      outputs['ids_1'] = enc_outputs['ids']
-
-    # Second, build the decoder and projector
-    decoder_fn = _get_network(params.decoder_name)
-    with tf.variable_scope('decoder', reuse=reuse):
-      outputs['voxels_1'] = decoder_fn(outputs['ids_1'], params, is_training)
-    if run_projection:
-      projector_fn = _get_network(params.projector_name)
-      with tf.variable_scope('projector', reuse=reuse):
-        outputs['projs_1'] = projector_fn(
-            outputs['voxels_1'], inputs['matrix_1'], params, is_training)
-      # Infer the ground-truth mask
-      with tf.variable_scope('oracle', reuse=reuse):
-        outputs['masks_1'] = projector_fn(inputs['voxels'], inputs['matrix_1'],
-                                          params, False)
-
-      # Third, build the entire graph (bundled strategy described in PTN paper)
-      for k in range(1, params.step_size):
-        with tf.variable_scope('projector', reuse=True):
-          outputs['projs_%d' % (k + 1)] = projector_fn(
-              outputs['voxels_1'], inputs['matrix_%d' %
-                                          (k + 1)], params, is_training)
-        with tf.variable_scope('oracle', reuse=True):
-          outputs['masks_%d' % (k + 1)] = projector_fn(
-              inputs['voxels'], inputs['matrix_%d' % (k + 1)], params, False)
-
-    return outputs
-
-  return model
diff --git a/research/ptn/nets/perspective_projector.py b/research/ptn/nets/perspective_projector.py
deleted file mode 100644
index 38c7df86b203884327d4c4eda5f02b7fc1b16323..0000000000000000000000000000000000000000
--- a/research/ptn/nets/perspective_projector.py
+++ /dev/null
@@ -1,53 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""3D->2D projector model as used in PTN (NIPS16)."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-from nets import perspective_transform
-
-
-def model(voxels, transform_matrix, params, is_training):
-  """Model transforming the 3D voxels into 2D projections.
-
-  Args:
-    voxels: A tensor of size [batch, depth, height, width, channel]
-      representing the input of projection layer (tf.float32).
-    transform_matrix: A tensor of size [batch, 16] representing
-      the flattened 4-by-4 matrix for transformation (tf.float32).
-    params: Model parameters (dict).
-    is_training: Set to True if while training (boolean).
-
-  Returns:
-    A transformed tensor (tf.float32)
-
-  """
-  del is_training  # Doesn't make a difference for projector
-  # Rearrangement (batch, z, y, x, channel) --> (batch, y, z, x, channel).
-  # By the standard, projection happens along z-axis but the voxels
-  # are stored in a different way. So we need to switch the y and z
-  # axis for transformation operation.
-  voxels = tf.transpose(voxels, [0, 2, 1, 3, 4])
-  z_near = params.focal_length
-  z_far = params.focal_length + params.focal_range
-  transformed_voxels = perspective_transform.transformer(
-      voxels, transform_matrix, [params.vox_size] * 3, z_near, z_far)
-  views = tf.reduce_max(transformed_voxels, [1])
-  views = tf.reverse(views, [1])
-  return views
diff --git a/research/ptn/nets/perspective_transform.py b/research/ptn/nets/perspective_transform.py
deleted file mode 100644
index 1c01f15f21d4f6f36c4638d0b8ad916e895aa0b2..0000000000000000000000000000000000000000
--- a/research/ptn/nets/perspective_transform.py
+++ /dev/null
@@ -1,278 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Perspective Transformer Layer Implementation.
-
-Transform the volume based on 4 x 4 perspective projection matrix.
-
-Reference:
-(1) "Perspective Transformer Nets: Perspective Transformer Nets:
-Learning Single-View 3D Object Reconstruction without 3D Supervision."
-Xinchen Yan, Jimei Yang, Ersin Yumer, Yijie Guo, Honglak Lee. In NIPS 2016
-https://papers.nips.cc/paper/6206-perspective-transformer-nets-learning-single-view-3d-object-reconstruction-without-3d-supervision.pdf
-
-(2) Official implementation in Torch: https://github.com/xcyan/ptnbhwd
-
-(3) 2D Transformer implementation in TF:
-github.com/tensorflow/models/tree/master/research/transformer
-
-"""
-
-import tensorflow as tf
-
-
-def transformer(voxels,
-                theta,
-                out_size,
-                z_near,
-                z_far,
-                name='PerspectiveTransformer'):
-  """Perspective Transformer Layer.
-
-  Args:
-    voxels: A tensor of size [num_batch, depth, height, width, num_channels].
-      It is the output of a deconv/upsampling conv network (tf.float32).
-    theta: A tensor of size [num_batch, 16].
-      It is the inverse camera transformation matrix (tf.float32).
-    out_size: A tuple representing the size of output of
-      transformer layer (float).
-    z_near: A number representing the near clipping plane (float).
-    z_far: A number representing the far clipping plane (float).
-
-  Returns:
-    A transformed tensor (tf.float32).
-
-  """
-  def _repeat(x, n_repeats):
-    with tf.variable_scope('_repeat'):
-      rep = tf.transpose(
-          tf.expand_dims(tf.ones(shape=tf.stack([
-              n_repeats,
-          ])), 1), [1, 0])
-      rep = tf.to_int32(rep)
-      x = tf.matmul(tf.reshape(x, (-1, 1)), rep)
-      return tf.reshape(x, [-1])
-
-  def _interpolate(im, x, y, z, out_size):
-    """Bilinear interploation layer.
-
-    Args:
-      im: A 5D tensor of size [num_batch, depth, height, width, num_channels].
-        It is the input volume for the transformation layer (tf.float32).
-      x: A tensor of size [num_batch, out_depth, out_height, out_width]
-        representing the inverse coordinate mapping for x (tf.float32).
-      y: A tensor of size [num_batch, out_depth, out_height, out_width]
-        representing the inverse coordinate mapping for y (tf.float32).
-      z: A tensor of size [num_batch, out_depth, out_height, out_width]
-        representing the inverse coordinate mapping for z (tf.float32).
-      out_size: A tuple representing the output size of transformation layer
-        (float).
-
-    Returns:
-      A transformed tensor (tf.float32).
-
-    """
-    with tf.variable_scope('_interpolate'):
-      num_batch = im.get_shape().as_list()[0]
-      depth = im.get_shape().as_list()[1]
-      height = im.get_shape().as_list()[2]
-      width = im.get_shape().as_list()[3]
-      channels = im.get_shape().as_list()[4]
-
-      x = tf.to_float(x)
-      y = tf.to_float(y)
-      z = tf.to_float(z)
-      depth_f = tf.to_float(depth)
-      height_f = tf.to_float(height)
-      width_f = tf.to_float(width)
-      # Number of disparity interpolated.
-      out_depth = out_size[0]
-      out_height = out_size[1]
-      out_width = out_size[2]
-      zero = tf.zeros([], dtype='int32')
-      # 0 <= z < depth, 0 <= y < height & 0 <= x < width.
-      max_z = tf.to_int32(tf.shape(im)[1] - 1)
-      max_y = tf.to_int32(tf.shape(im)[2] - 1)
-      max_x = tf.to_int32(tf.shape(im)[3] - 1)
-
-      # Converts scale indices from [-1, 1] to [0, width/height/depth].
-      x = (x + 1.0) * (width_f) / 2.0
-      y = (y + 1.0) * (height_f) / 2.0
-      z = (z + 1.0) * (depth_f) / 2.0
-
-      x0 = tf.to_int32(tf.floor(x))
-      x1 = x0 + 1
-      y0 = tf.to_int32(tf.floor(y))
-      y1 = y0 + 1
-      z0 = tf.to_int32(tf.floor(z))
-      z1 = z0 + 1
-
-      x0_clip = tf.clip_by_value(x0, zero, max_x)
-      x1_clip = tf.clip_by_value(x1, zero, max_x)
-      y0_clip = tf.clip_by_value(y0, zero, max_y)
-      y1_clip = tf.clip_by_value(y1, zero, max_y)
-      z0_clip = tf.clip_by_value(z0, zero, max_z)
-      z1_clip = tf.clip_by_value(z1, zero, max_z)
-      dim3 = width
-      dim2 = width * height
-      dim1 = width * height * depth
-      base = _repeat(
-          tf.range(num_batch) * dim1, out_depth * out_height * out_width)
-      base_z0_y0 = base + z0_clip * dim2 + y0_clip * dim3
-      base_z0_y1 = base + z0_clip * dim2 + y1_clip * dim3
-      base_z1_y0 = base + z1_clip * dim2 + y0_clip * dim3
-      base_z1_y1 = base + z1_clip * dim2 + y1_clip * dim3
-
-      idx_z0_y0_x0 = base_z0_y0 + x0_clip
-      idx_z0_y0_x1 = base_z0_y0 + x1_clip
-      idx_z0_y1_x0 = base_z0_y1 + x0_clip
-      idx_z0_y1_x1 = base_z0_y1 + x1_clip
-      idx_z1_y0_x0 = base_z1_y0 + x0_clip
-      idx_z1_y0_x1 = base_z1_y0 + x1_clip
-      idx_z1_y1_x0 = base_z1_y1 + x0_clip
-      idx_z1_y1_x1 = base_z1_y1 + x1_clip
-
-      # Use indices to lookup pixels in the flat image and restore
-      # channels dim
-      im_flat = tf.reshape(im, tf.stack([-1, channels]))
-      im_flat = tf.to_float(im_flat)
-      i_z0_y0_x0 = tf.gather(im_flat, idx_z0_y0_x0)
-      i_z0_y0_x1 = tf.gather(im_flat, idx_z0_y0_x1)
-      i_z0_y1_x0 = tf.gather(im_flat, idx_z0_y1_x0)
-      i_z0_y1_x1 = tf.gather(im_flat, idx_z0_y1_x1)
-      i_z1_y0_x0 = tf.gather(im_flat, idx_z1_y0_x0)
-      i_z1_y0_x1 = tf.gather(im_flat, idx_z1_y0_x1)
-      i_z1_y1_x0 = tf.gather(im_flat, idx_z1_y1_x0)
-      i_z1_y1_x1 = tf.gather(im_flat, idx_z1_y1_x1)
-
-      # Finally calculate interpolated values.
-      x0_f = tf.to_float(x0)
-      x1_f = tf.to_float(x1)
-      y0_f = tf.to_float(y0)
-      y1_f = tf.to_float(y1)
-      z0_f = tf.to_float(z0)
-      z1_f = tf.to_float(z1)
-      # Check the out-of-boundary case.
-      x0_valid = tf.to_float(
-          tf.less_equal(x0, max_x) & tf.greater_equal(x0, 0))
-      x1_valid = tf.to_float(
-          tf.less_equal(x1, max_x) & tf.greater_equal(x1, 0))
-      y0_valid = tf.to_float(
-          tf.less_equal(y0, max_y) & tf.greater_equal(y0, 0))
-      y1_valid = tf.to_float(
-          tf.less_equal(y1, max_y) & tf.greater_equal(y1, 0))
-      z0_valid = tf.to_float(
-          tf.less_equal(z0, max_z) & tf.greater_equal(z0, 0))
-      z1_valid = tf.to_float(
-          tf.less_equal(z1, max_z) & tf.greater_equal(z1, 0))
-
-      w_z0_y0_x0 = tf.expand_dims(((x1_f - x) * (y1_f - y) *
-                                   (z1_f - z) * x1_valid * y1_valid * z1_valid),
-                                  1)
-      w_z0_y0_x1 = tf.expand_dims(((x - x0_f) * (y1_f - y) *
-                                   (z1_f - z) * x0_valid * y1_valid * z1_valid),
-                                  1)
-      w_z0_y1_x0 = tf.expand_dims(((x1_f - x) * (y - y0_f) *
-                                   (z1_f - z) * x1_valid * y0_valid * z1_valid),
-                                  1)
-      w_z0_y1_x1 = tf.expand_dims(((x - x0_f) * (y - y0_f) *
-                                   (z1_f - z) * x0_valid * y0_valid * z1_valid),
-                                  1)
-      w_z1_y0_x0 = tf.expand_dims(((x1_f - x) * (y1_f - y) *
-                                   (z - z0_f) * x1_valid * y1_valid * z0_valid),
-                                  1)
-      w_z1_y0_x1 = tf.expand_dims(((x - x0_f) * (y1_f - y) *
-                                   (z - z0_f) * x0_valid * y1_valid * z0_valid),
-                                  1)
-      w_z1_y1_x0 = tf.expand_dims(((x1_f - x) * (y - y0_f) *
-                                   (z - z0_f) * x1_valid * y0_valid * z0_valid),
-                                  1)
-      w_z1_y1_x1 = tf.expand_dims(((x - x0_f) * (y - y0_f) *
-                                   (z - z0_f) * x0_valid * y0_valid * z0_valid),
-                                  1)
-
-      output = tf.add_n([
-          w_z0_y0_x0 * i_z0_y0_x0, w_z0_y0_x1 * i_z0_y0_x1,
-          w_z0_y1_x0 * i_z0_y1_x0, w_z0_y1_x1 * i_z0_y1_x1,
-          w_z1_y0_x0 * i_z1_y0_x0, w_z1_y0_x1 * i_z1_y0_x1,
-          w_z1_y1_x0 * i_z1_y1_x0, w_z1_y1_x1 * i_z1_y1_x1
-      ])
-      return output
-
-  def _meshgrid(depth, height, width, z_near, z_far):
-    with tf.variable_scope('_meshgrid'):
-      x_t = tf.reshape(
-          tf.tile(tf.linspace(-1.0, 1.0, width), [height * depth]),
-          [depth, height, width])
-      y_t = tf.reshape(
-          tf.tile(tf.linspace(-1.0, 1.0, height), [width * depth]),
-          [depth, width, height])
-      y_t = tf.transpose(y_t, [0, 2, 1])
-      sample_grid = tf.tile(
-          tf.linspace(float(z_near), float(z_far), depth), [width * height])
-      z_t = tf.reshape(sample_grid, [height, width, depth])
-      z_t = tf.transpose(z_t, [2, 0, 1])
-
-      z_t = 1 / z_t
-      d_t = 1 / z_t
-      x_t /= z_t
-      y_t /= z_t
-
-      x_t_flat = tf.reshape(x_t, (1, -1))
-      y_t_flat = tf.reshape(y_t, (1, -1))
-      d_t_flat = tf.reshape(d_t, (1, -1))
-
-      ones = tf.ones_like(x_t_flat)
-      grid = tf.concat([d_t_flat, y_t_flat, x_t_flat, ones], 0)
-      return grid
-
-  def _transform(theta, input_dim, out_size, z_near, z_far):
-    with tf.variable_scope('_transform'):
-      num_batch = input_dim.get_shape().as_list()[0]
-      num_channels = input_dim.get_shape().as_list()[4]
-      theta = tf.reshape(theta, (-1, 4, 4))
-      theta = tf.cast(theta, 'float32')
-
-      out_depth = out_size[0]
-      out_height = out_size[1]
-      out_width = out_size[2]
-      grid = _meshgrid(out_depth, out_height, out_width, z_near, z_far)
-      grid = tf.expand_dims(grid, 0)
-      grid = tf.reshape(grid, [-1])
-      grid = tf.tile(grid, tf.stack([num_batch]))
-      grid = tf.reshape(grid, tf.stack([num_batch, 4, -1]))
-
-      # Transform A x (x_t', y_t', 1, d_t)^T -> (x_s, y_s, z_s, 1).
-      t_g = tf.matmul(theta, grid)
-      z_s = tf.slice(t_g, [0, 0, 0], [-1, 1, -1])
-      y_s = tf.slice(t_g, [0, 1, 0], [-1, 1, -1])
-      x_s = tf.slice(t_g, [0, 2, 0], [-1, 1, -1])
-
-      z_s_flat = tf.reshape(z_s, [-1])
-      y_s_flat = tf.reshape(y_s, [-1])
-      x_s_flat = tf.reshape(x_s, [-1])
-
-      input_transformed = _interpolate(input_dim, x_s_flat, y_s_flat, z_s_flat,
-                                       out_size)
-
-      output = tf.reshape(
-          input_transformed,
-          tf.stack([num_batch, out_depth, out_height, out_width, num_channels]))
-
-      return output
-
-  with tf.variable_scope(name):
-    output = _transform(theta, voxels, out_size, z_near, z_far)
-    return output
diff --git a/research/ptn/nets/ptn_encoder.py b/research/ptn/nets/ptn_encoder.py
deleted file mode 100644
index ede556834e6ea42e9e0a266bc0d525a679924077..0000000000000000000000000000000000000000
--- a/research/ptn/nets/ptn_encoder.py
+++ /dev/null
@@ -1,54 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Training/Pretraining encoder as used in PTN (NIPS16)."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-
-def _preprocess(images):
-  return images * 2 - 1
-
-
-def model(images, params, is_training):
-  """Model encoding the images into view-invariant embedding."""
-  del is_training  # Unused
-  image_size = images.get_shape().as_list()[1]
-  f_dim = params.f_dim
-  fc_dim = params.fc_dim
-  z_dim = params.z_dim
-  outputs = dict()
-
-  images = _preprocess(images)
-  with slim.arg_scope(
-      [slim.conv2d, slim.fully_connected],
-      weights_initializer=tf.truncated_normal_initializer(stddev=0.02, seed=1)):
-    h0 = slim.conv2d(images, f_dim, [5, 5], stride=2, activation_fn=tf.nn.relu)
-    h1 = slim.conv2d(h0, f_dim * 2, [5, 5], stride=2, activation_fn=tf.nn.relu)
-    h2 = slim.conv2d(h1, f_dim * 4, [5, 5], stride=2, activation_fn=tf.nn.relu)
-    # Reshape layer
-    s8 = image_size // 8
-    h2 = tf.reshape(h2, [-1, s8 * s8 * f_dim * 4])
-    h3 = slim.fully_connected(h2, fc_dim, activation_fn=tf.nn.relu)
-    h4 = slim.fully_connected(h3, fc_dim, activation_fn=tf.nn.relu)
-
-    outputs['ids'] = slim.fully_connected(h4, z_dim, activation_fn=tf.nn.relu)
-    outputs['poses'] = slim.fully_connected(h4, z_dim, activation_fn=tf.nn.relu)
-  return outputs
diff --git a/research/ptn/nets/ptn_im_decoder.py b/research/ptn/nets/ptn_im_decoder.py
deleted file mode 100644
index 8ee512e878d549c2d16f13f58dee8786e129e751..0000000000000000000000000000000000000000
--- a/research/ptn/nets/ptn_im_decoder.py
+++ /dev/null
@@ -1,81 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Image/Mask decoder used while pretraining the network."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-_FEATURE_MAP_SIZE = 8
-
-
-def _postprocess_im(images):
-  """Performs post-processing for the images returned from conv net.
-
-  Transforms the value from [-1, 1] to [0, 1].
-  """
-  return (images + 1) * 0.5
-
-
-def model(identities, poses, params, is_training):
-  """Decoder model to get image and mask from latent embedding."""
-  del is_training
-  f_dim = params.f_dim
-  fc_dim = params.fc_dim
-
-  outputs = dict()
-
-  with slim.arg_scope(
-      [slim.fully_connected, slim.conv2d_transpose],
-      weights_initializer=tf.truncated_normal_initializer(stddev=0.02, seed=1)):
-    # Concatenate the identity and pose units
-    h0 = tf.concat([identities, poses], 1)
-    h0 = slim.fully_connected(h0, fc_dim, activation_fn=tf.nn.relu)
-    h1 = slim.fully_connected(h0, fc_dim, activation_fn=tf.nn.relu)
-
-    # Mask decoder
-    dec_m0 = slim.fully_connected(
-        h1, (_FEATURE_MAP_SIZE**2) * f_dim * 2, activation_fn=tf.nn.relu)
-    dec_m0 = tf.reshape(
-        dec_m0, [-1, _FEATURE_MAP_SIZE, _FEATURE_MAP_SIZE, f_dim * 2])
-
-    dec_m1 = slim.conv2d_transpose(
-        dec_m0, f_dim, [5, 5], stride=2, activation_fn=tf.nn.relu)
-    dec_m2 = slim.conv2d_transpose(
-        dec_m1, int(f_dim / 2), [5, 5], stride=2, activation_fn=tf.nn.relu)
-    dec_m3 = slim.conv2d_transpose(
-        dec_m2, 1, [5, 5], stride=2, activation_fn=tf.nn.sigmoid)
-
-    # Image decoder
-    dec_i0 = slim.fully_connected(
-        h1, (_FEATURE_MAP_SIZE**2) * f_dim * 4, activation_fn=tf.nn.relu)
-    dec_i0 = tf.reshape(
-        dec_i0, [-1, _FEATURE_MAP_SIZE, _FEATURE_MAP_SIZE, f_dim * 4])
-
-    dec_i1 = slim.conv2d_transpose(
-        dec_i0, f_dim * 2, [5, 5], stride=2, activation_fn=tf.nn.relu)
-    dec_i2 = slim.conv2d_transpose(
-        dec_i1, f_dim * 2, [5, 5], stride=2, activation_fn=tf.nn.relu)
-    dec_i3 = slim.conv2d_transpose(
-        dec_i2, 3, [5, 5], stride=2, activation_fn=tf.nn.tanh)
-
-    outputs = dict()
-    outputs['images'] = _postprocess_im(dec_i3)
-    outputs['masks'] = dec_m3
-  return outputs
diff --git a/research/ptn/nets/ptn_rotator.py b/research/ptn/nets/ptn_rotator.py
deleted file mode 100644
index 2cc73bb8dfe2edb624d9fa56ecd95347fbf0cf3f..0000000000000000000000000000000000000000
--- a/research/ptn/nets/ptn_rotator.py
+++ /dev/null
@@ -1,58 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Creates rotator network model.
-
-This model performs the out-of-plane rotations given input image and action.
-The action is either no-op, rotate clockwise or rotate counter-clockwise.
-
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-def bilinear(input_x, input_y, output_size):
-  """Define the bilinear transformation layer."""
-  shape_x = input_x.get_shape().as_list()
-  shape_y = input_y.get_shape().as_list()
-
-  weights_initializer = tf.truncated_normal_initializer(stddev=0.02,
-                                                        seed=1)
-  biases_initializer = tf.constant_initializer(0.0)
-
-  matrix = tf.get_variable("Matrix", [shape_x[1], shape_y[1], output_size],
-                           tf.float32, initializer=weights_initializer)
-  bias = tf.get_variable("Bias", [output_size],
-                         initializer=biases_initializer)
-  # Add to GraphKeys.MODEL_VARIABLES
-  tf.contrib.framework.add_model_variable(matrix)
-  tf.contrib.framework.add_model_variable(bias)
-  # Define the transformation
-  h0 = tf.matmul(input_x, tf.reshape(matrix,
-                                     [shape_x[1], shape_y[1]*output_size]))
-  h0 = tf.reshape(h0, [-1, shape_y[1], output_size])
-  h1 = tf.tile(tf.reshape(input_y, [-1, shape_y[1], 1]),
-               [1, 1, output_size])
-  h1 = tf.multiply(h0, h1)
-  return tf.reduce_sum(h1, 1) + bias
-
-
-def model(poses, actions, params, is_training):
-  """Model for performing rotation."""
-  del is_training  # Unused
-  return bilinear(poses, actions, params.z_dim)
diff --git a/research/ptn/nets/ptn_vox_decoder.py b/research/ptn/nets/ptn_vox_decoder.py
deleted file mode 100644
index 87ea27fa2bfcffc6fd9b9292686096e06f36e6a6..0000000000000000000000000000000000000000
--- a/research/ptn/nets/ptn_vox_decoder.py
+++ /dev/null
@@ -1,118 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Training decoder as used in PTN (NIPS16)."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-slim = tf.contrib.slim
-
-
-@tf.contrib.framework.add_arg_scope
-def conv3d_transpose(inputs,
-                     num_outputs,
-                     kernel_size,
-                     stride=1,
-                     padding='SAME',
-                     activation_fn=tf.nn.relu,
-                     weights_initializer=tf.contrib.layers.xavier_initializer(),
-                     biases_initializer=tf.zeros_initializer(),
-                     reuse=None,
-                     trainable=True,
-                     scope=None):
-  """Wrapper for conv3d_transpose layer.
-
-  This function wraps the tf.conv3d_transpose with basic non-linearity.
-  Tt creates a variable called `weights`, representing the kernel,
-  that is convoled with the input. A second varibale called `biases'
-  is added to the result of operation.
-  """
-  with tf.variable_scope(
-      scope, 'Conv3d_transpose', [inputs], reuse=reuse):
-    dtype = inputs.dtype.base_dtype
-    kernel_d, kernel_h, kernel_w = kernel_size[0:3]
-    num_filters_in = inputs.get_shape()[4]
-
-    weights_shape = [kernel_d, kernel_h, kernel_w, num_outputs, num_filters_in]
-    weights = tf.get_variable('weights',
-                              shape=weights_shape,
-                              dtype=dtype,
-                              initializer=weights_initializer,
-                              trainable=trainable)
-    tf.contrib.framework.add_model_variable(weights)
-
-    input_shape = inputs.get_shape().as_list()
-    batch_size = input_shape[0]
-    depth = input_shape[1]
-    height = input_shape[2]
-    width = input_shape[3]
-
-    def get_deconv_dim(dim_size, stride_size):
-      # Only support padding='SAME'.
-      if isinstance(dim_size, tf.Tensor):
-        dim_size = tf.multiply(dim_size, stride_size)
-      elif dim_size is not None:
-        dim_size *= stride_size
-      return dim_size
-
-    out_depth = get_deconv_dim(depth, stride)
-    out_height = get_deconv_dim(height, stride)
-    out_width = get_deconv_dim(width, stride)
-
-    out_shape = [batch_size, out_depth, out_height, out_width, num_outputs]
-    outputs = tf.nn.conv3d_transpose(inputs, weights, out_shape,
-                                     [1, stride, stride, stride, 1],
-                                     padding=padding)
-
-    outputs.set_shape(out_shape)
-
-    if biases_initializer is not None:
-      biases = tf.get_variable('biases',
-                               shape=[num_outputs,],
-                               dtype=dtype,
-                               initializer=biases_initializer,
-                               trainable=trainable)
-      tf.contrib.framework.add_model_variable(biases)
-      outputs = tf.nn.bias_add(outputs, biases)
-
-    if activation_fn:
-      outputs = activation_fn(outputs)
-    return outputs
-
-
-def model(identities, params, is_training):
-  """Model transforming embedding to voxels."""
-  del is_training  # Unused
-  f_dim = params.f_dim
-
-  # Please refer to the original implementation: github.com/xcyan/nips16_PTN
-  # In TF replication, we use a slightly different architecture.
-  with slim.arg_scope(
-      [slim.fully_connected, conv3d_transpose],
-      weights_initializer=tf.truncated_normal_initializer(stddev=0.02, seed=1)):
-    h0 = slim.fully_connected(
-        identities, 4 * 4 * 4 * f_dim * 8, activation_fn=tf.nn.relu)
-    h1 = tf.reshape(h0, [-1, 4, 4, 4, f_dim * 8])
-    h1 = conv3d_transpose(
-        h1, f_dim * 4, [4, 4, 4], stride=2, activation_fn=tf.nn.relu)
-    h2 = conv3d_transpose(
-        h1, int(f_dim * 3 / 2), [5, 5, 5], stride=2, activation_fn=tf.nn.relu)
-    h3 = conv3d_transpose(
-        h2, 1, [6, 6, 6], stride=2, activation_fn=tf.nn.sigmoid)
-  return h3
diff --git a/research/ptn/pretrain_rotator.py b/research/ptn/pretrain_rotator.py
deleted file mode 100644
index 6307f2d4f6ad4341105de7e265411d10e55f61c9..0000000000000000000000000000000000000000
--- a/research/ptn/pretrain_rotator.py
+++ /dev/null
@@ -1,236 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contains training plan for the Rotator model (Pretraining in NIPS16)."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import numpy as np
-from six.moves import xrange
-import tensorflow as tf
-
-from tensorflow import app
-
-import model_rotator as model
-
-flags = tf.app.flags
-slim = tf.contrib.slim
-
-flags.DEFINE_string('inp_dir', '',
-                    'Directory path containing the input data (tfrecords).')
-flags.DEFINE_string(
-    'dataset_name', 'shapenet_chair',
-    'Dataset name that is to be used for training and evaluation.')
-flags.DEFINE_integer('z_dim', 512, '')
-flags.DEFINE_integer('a_dim', 3, '')
-flags.DEFINE_integer('f_dim', 64, '')
-flags.DEFINE_integer('fc_dim', 1024, '')
-flags.DEFINE_integer('num_views', 24, 'Num of viewpoints in the input data.')
-flags.DEFINE_integer('image_size', 64,
-                     'Input images dimension (pixels) - width & height.')
-flags.DEFINE_integer('step_size', 1, 'Steps to take for rotation in pretraining.')
-flags.DEFINE_integer('batch_size', 32, 'Batch size for training.')
-flags.DEFINE_string('encoder_name', 'ptn_encoder',
-                    'Name of the encoder network being used.')
-flags.DEFINE_string('decoder_name', 'ptn_im_decoder',
-                    'Name of the decoder network being used.')
-flags.DEFINE_string('rotator_name', 'ptn_rotator',
-                    'Name of the rotator network being used.')
-# Save options
-flags.DEFINE_string('checkpoint_dir', '/tmp/ptn_train/',
-                    'Directory path for saving trained models and other data.')
-flags.DEFINE_string('model_name', 'deeprotator_pretrain',
-                    'Name of the model used in naming the TF job. Must be different for each run.')
-flags.DEFINE_string('init_model', None,
-                    'Checkpoint path of the model to initialize with.')
-flags.DEFINE_integer('save_every', 1000,
-                     'Average period of steps after which we save a model.')
-# Optimization
-flags.DEFINE_float('image_weight', 10, 'Weighting factor for image loss.')
-flags.DEFINE_float('mask_weight', 1, 'Weighting factor for mask loss.')
-flags.DEFINE_float('learning_rate', 0.0001, 'Learning rate.')
-flags.DEFINE_float('weight_decay', 0.001, 'Weight decay parameter while training.')
-flags.DEFINE_float('clip_gradient_norm', 0, 'Gradient clim norm, leave 0 if no gradient clipping.')
-flags.DEFINE_integer('max_number_of_steps', 320000, 'Maximum number of steps for training.')
-# Summary
-flags.DEFINE_integer('save_summaries_secs', 15, 'Seconds interval for dumping TF summaries.')
-flags.DEFINE_integer('save_interval_secs', 60 * 5, 'Seconds interval to save models.')
-# Distribution
-flags.DEFINE_string('master', '', 'The address of the tensorflow master if running distributed.')
-flags.DEFINE_bool('sync_replicas', False, 'Whether to sync gradients between replicas for optimizer.')
-flags.DEFINE_integer('worker_replicas', 1, 'Number of worker replicas (train tasks).')
-flags.DEFINE_integer('backup_workers', 0, 'Number of backup workers.')
-flags.DEFINE_integer('ps_tasks', 0, 'Number of ps tasks.')
-flags.DEFINE_integer('task', 0,
-                     'Task identifier flag to be set for each task running in distributed manner. Task number 0 '
-                     'will be chosen as the chief.')
-
-FLAGS = flags.FLAGS
-
-
-def main(_):
-  train_dir = os.path.join(FLAGS.checkpoint_dir, FLAGS.model_name, 'train')
-  save_image_dir = os.path.join(train_dir, 'images')
-  if not os.path.exists(train_dir):
-    os.makedirs(train_dir)
-  if not os.path.exists(save_image_dir):
-    os.makedirs(save_image_dir)
-
-  g = tf.Graph()
-  with g.as_default():
-    with tf.device(tf.train.replica_device_setter(FLAGS.ps_tasks)):
-      global_step = slim.get_or_create_global_step()
-      ##########
-      ## data ##
-      ##########
-      train_data = model.get_inputs(
-          FLAGS.inp_dir,
-          FLAGS.dataset_name,
-          'train',
-          FLAGS.batch_size,
-          FLAGS.image_size,
-          is_training=True)
-      inputs = model.preprocess(train_data, FLAGS.step_size)
-      ###########
-      ## model ##
-      ###########
-      model_fn = model.get_model_fn(FLAGS, is_training=True)
-      outputs = model_fn(inputs)
-      ##########
-      ## loss ##
-      ##########
-      task_loss = model.get_loss(inputs, outputs, FLAGS)
-      regularization_loss = model.get_regularization_loss(
-          ['encoder', 'rotator', 'decoder'], FLAGS)
-      loss = task_loss + regularization_loss
-      ###############
-      ## optimizer ##
-      ###############
-      optimizer = tf.train.AdamOptimizer(FLAGS.learning_rate)
-      if FLAGS.sync_replicas:
-        optimizer = tf.train.SyncReplicasOptimizer(
-            optimizer,
-            replicas_to_aggregate=FLAGS.workers_replicas - FLAGS.backup_workers,
-            total_num_replicas=FLAGS.worker_replicas)
-
-      ##############
-      ## train_op ##
-      ##############
-      train_op = model.get_train_op_for_scope(
-          loss, optimizer, ['encoder', 'rotator', 'decoder'], FLAGS)
-      ###########
-      ## saver ##
-      ###########
-      saver = tf.train.Saver(max_to_keep=np.minimum(5,
-                                                    FLAGS.worker_replicas + 1))
-
-      if FLAGS.task == 0:
-        val_data = model.get_inputs(
-            FLAGS.inp_dir,
-            FLAGS.dataset_name,
-            'val',
-            FLAGS.batch_size,
-            FLAGS.image_size,
-            is_training=False)
-        val_inputs = model.preprocess(val_data, FLAGS.step_size)
-        # Note: don't compute loss here
-        reused_model_fn = model.get_model_fn(
-            FLAGS, is_training=False, reuse=True)
-        val_outputs = reused_model_fn(val_inputs)
-        with tf.device(tf.DeviceSpec(device_type='CPU')):
-          if FLAGS.step_size == 1:
-            vis_input_images = val_inputs['images_0'] * 255.0
-            vis_output_images = val_inputs['images_1'] * 255.0
-            vis_pred_images = val_outputs['images_1'] * 255.0
-            vis_pred_masks = (val_outputs['masks_1'] * (-1) + 1) * 255.0
-          else:
-            rep_times = int(np.ceil(32.0 / float(FLAGS.step_size)))
-            vis_list_1 = []
-            vis_list_2 = []
-            vis_list_3 = []
-            vis_list_4 = []
-            for j in xrange(rep_times):
-              for k in xrange(FLAGS.step_size):
-                vis_input_image = val_inputs['images_0'][j],
-                vis_output_image = val_inputs['images_%d' % (k + 1)][j]
-                vis_pred_image = val_outputs['images_%d' % (k + 1)][j]
-                vis_pred_mask = val_outputs['masks_%d' % (k + 1)][j]
-                vis_list_1.append(tf.expand_dims(vis_input_image, 0))
-                vis_list_2.append(tf.expand_dims(vis_output_image, 0))
-                vis_list_3.append(tf.expand_dims(vis_pred_image, 0))
-                vis_list_4.append(tf.expand_dims(vis_pred_mask, 0))
-
-            vis_list_1 = tf.reshape(
-                tf.stack(vis_list_1), [
-                    rep_times * FLAGS.step_size, FLAGS.image_size,
-                    FLAGS.image_size, 3
-                ])
-            vis_list_2 = tf.reshape(
-                tf.stack(vis_list_2), [
-                    rep_times * FLAGS.step_size, FLAGS.image_size,
-                    FLAGS.image_size, 3
-                ])
-            vis_list_3 = tf.reshape(
-                tf.stack(vis_list_3), [
-                    rep_times * FLAGS.step_size, FLAGS.image_size,
-                    FLAGS.image_size, 3
-                ])
-            vis_list_4 = tf.reshape(
-                tf.stack(vis_list_4), [
-                    rep_times * FLAGS.step_size, FLAGS.image_size,
-                    FLAGS.image_size, 1
-                ])
-
-            vis_input_images = vis_list_1 * 255.0
-            vis_output_images = vis_list_2 * 255.0
-            vis_pred_images = vis_list_3 * 255.0
-            vis_pred_masks = (vis_list_4 * (-1) + 1) * 255.0
-
-          write_disk_op = model.write_disk_grid(
-              global_step=global_step,
-              summary_freq=FLAGS.save_every,
-              log_dir=save_image_dir,
-              input_images=vis_input_images,
-              output_images=vis_output_images,
-              pred_images=vis_pred_images,
-              pred_masks=vis_pred_masks)
-        with tf.control_dependencies([write_disk_op]):
-          train_op = tf.identity(train_op)
-
-      #############
-      ## init_fn ##
-      #############
-      init_fn = model.get_init_fn(['encoder, ' 'rotator', 'decoder'], FLAGS)
-
-      ##############
-      ## training ##
-      ##############
-      slim.learning.train(
-          train_op=train_op,
-          logdir=train_dir,
-          init_fn=init_fn,
-          master=FLAGS.master,
-          is_chief=(FLAGS.task == 0),
-          number_of_steps=FLAGS.max_number_of_steps,
-          saver=saver,
-          save_summaries_secs=FLAGS.save_summaries_secs,
-          save_interval_secs=FLAGS.save_interval_secs)
-
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/ptn/train_ptn.py b/research/ptn/train_ptn.py
deleted file mode 100644
index 1b42245d4c2d7fc445e275aa4e933e89e6108699..0000000000000000000000000000000000000000
--- a/research/ptn/train_ptn.py
+++ /dev/null
@@ -1,230 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contains training plan for the Im2vox model."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-
-import numpy as np
-import tensorflow as tf
-
-from tensorflow import app
-
-import model_ptn
-
-flags = tf.app.flags
-slim = tf.contrib.slim
-
-flags.DEFINE_string('inp_dir',
-                    '',
-                    'Directory path containing the input data (tfrecords).')
-flags.DEFINE_string(
-    'dataset_name', 'shapenet_chair',
-    'Dataset name that is to be used for training and evaluation.')
-flags.DEFINE_integer('z_dim', 512, '')
-flags.DEFINE_integer('f_dim', 64, '')
-flags.DEFINE_integer('fc_dim', 1024, '')
-flags.DEFINE_integer('num_views', 24, 'Num of viewpoints in the input data.')
-flags.DEFINE_integer('image_size', 64,
-                     'Input images dimension (pixels) - width & height.')
-flags.DEFINE_integer('vox_size', 32, 'Voxel prediction dimension.')
-flags.DEFINE_integer('step_size', 24, 'Steps to take in rotation to fetch viewpoints.')
-flags.DEFINE_integer('batch_size', 6, 'Batch size while training.')
-flags.DEFINE_float('focal_length', 0.866, 'Focal length parameter used in perspective projection.')
-flags.DEFINE_float('focal_range', 1.732, 'Focal length parameter used in perspective projection.')
-flags.DEFINE_string('encoder_name', 'ptn_encoder',
-                    'Name of the encoder network being used.')
-flags.DEFINE_string('decoder_name', 'ptn_vox_decoder',
-                    'Name of the decoder network being used.')
-flags.DEFINE_string('projector_name', 'perspective_projector',
-                    'Name of the projector network being used.')
-# Save options
-flags.DEFINE_string('checkpoint_dir', '/tmp/ptn_train/',
-                    'Directory path for saving trained models and other data.')
-flags.DEFINE_string('model_name', 'ptn_finetune',
-                    'Name of the model used in naming the TF job. Must be different for each run.')
-flags.DEFINE_string('init_model', None,
-                    'Checkpoint path of the model to initialize with.')
-flags.DEFINE_integer('save_every', 1000,
-                     'Average period of steps after which we save a model.')
-# Optimization
-flags.DEFINE_float('proj_weight', 10, 'Weighting factor for projection loss.')
-flags.DEFINE_float('volume_weight', 0, 'Weighting factor for volume loss.')
-flags.DEFINE_float('viewpoint_weight', 1, 'Weighting factor for viewpoint loss.')
-flags.DEFINE_float('learning_rate', 0.0001, 'Learning rate.')
-flags.DEFINE_float('weight_decay', 0.001, 'Weight decay parameter while training.')
-flags.DEFINE_float('clip_gradient_norm', 0, 'Gradient clim norm, leave 0 if no gradient clipping.')
-flags.DEFINE_integer('max_number_of_steps', 10000, 'Maximum number of steps for training.')
-# Summary
-flags.DEFINE_integer('save_summaries_secs', 15, 'Seconds interval for dumping TF summaries.')
-flags.DEFINE_integer('save_interval_secs', 60 * 5, 'Seconds interval to save models.')
-
-# Scheduling
-flags.DEFINE_string('master', '', 'The address of the tensorflow master')
-flags.DEFINE_bool('sync_replicas', False, 'Whether to sync gradients between replicas for optimizer.')
-flags.DEFINE_integer('worker_replicas', 1, 'Number of worker replicas (train tasks).')
-flags.DEFINE_integer('backup_workers', 0, 'Number of backup workers.')
-flags.DEFINE_integer('ps_tasks', 0, 'Number of ps tasks.')
-flags.DEFINE_integer('task', 0,
-                     'Task identifier flag to be set for each task running in distributed manner. Task number 0 '
-                     'will be chosen as the chief.')
-
-FLAGS = flags.FLAGS
-
-
-def main(_):
-  train_dir = os.path.join(FLAGS.checkpoint_dir, FLAGS.model_name, 'train')
-  save_image_dir = os.path.join(train_dir, 'images')
-  if not os.path.exists(train_dir):
-    os.makedirs(train_dir)
-  if not os.path.exists(save_image_dir):
-    os.makedirs(save_image_dir)
-
-  g = tf.Graph()
-  with g.as_default():
-    with tf.device(tf.train.replica_device_setter(FLAGS.ps_tasks)):
-      global_step = slim.get_or_create_global_step()
-      ###########
-      ## model ##
-      ###########
-      model = model_ptn.model_PTN(FLAGS)
-      ##########
-      ## data ##
-      ##########
-      train_data = model.get_inputs(
-          FLAGS.inp_dir,
-          FLAGS.dataset_name,
-          'train',
-          FLAGS.batch_size,
-          FLAGS.image_size,
-          FLAGS.vox_size,
-          is_training=True)
-      inputs = model.preprocess(train_data, FLAGS.step_size)
-      ##############
-      ## model_fn ##
-      ##############
-      model_fn = model.get_model_fn(
-          is_training=True, reuse=False, run_projection=True)
-      outputs = model_fn(inputs)
-      ##################
-      ## train_scopes ##
-      ##################
-      if FLAGS.init_model:
-        train_scopes = ['decoder']
-        init_scopes = ['encoder']
-      else:
-        train_scopes = ['encoder', 'decoder']
-
-      ##########
-      ## loss ##
-      ##########
-      task_loss = model.get_loss(inputs, outputs)
-
-      regularization_loss = model.get_regularization_loss(train_scopes)
-      loss = task_loss + regularization_loss
-      ###############
-      ## optimizer ##
-      ###############
-      optimizer = tf.train.AdamOptimizer(FLAGS.learning_rate)
-      if FLAGS.sync_replicas:
-        optimizer = tf.train.SyncReplicasOptimizer(
-            optimizer,
-            replicas_to_aggregate=FLAGS.workers_replicas - FLAGS.backup_workers,
-            total_num_replicas=FLAGS.worker_replicas)
-
-      ##############
-      ## train_op ##
-      ##############
-      train_op = model.get_train_op_for_scope(loss, optimizer, train_scopes)
-      ###########
-      ## saver ##
-      ###########
-      saver = tf.train.Saver(max_to_keep=np.minimum(5,
-                                                    FLAGS.worker_replicas + 1))
-
-      if FLAGS.task == 0:
-        params = FLAGS
-        params.batch_size = params.num_views
-        params.step_size = 1
-        model.set_params(params)
-        val_data = model.get_inputs(
-            params.inp_dir,
-            params.dataset_name,
-            'val',
-            params.batch_size,
-            params.image_size,
-            params.vox_size,
-            is_training=False)
-        val_inputs = model.preprocess(val_data, params.step_size)
-        # Note: don't compute loss here
-        reused_model_fn = model.get_model_fn(is_training=False, reuse=True)
-        val_outputs = reused_model_fn(val_inputs)
-
-        with tf.device(tf.DeviceSpec(device_type='CPU')):
-          vis_input_images = val_inputs['images_1'] * 255.0
-          vis_gt_projs = (val_outputs['masks_1'] * (-1) + 1) * 255.0
-          vis_pred_projs = (val_outputs['projs_1'] * (-1) + 1) * 255.0
-
-          vis_gt_projs = tf.concat([vis_gt_projs] * 3, axis=3)
-          vis_pred_projs = tf.concat([vis_pred_projs] * 3, axis=3)
-          # rescale
-          new_size = [FLAGS.image_size] * 2
-          vis_gt_projs = tf.image.resize_nearest_neighbor(
-              vis_gt_projs, new_size)
-          vis_pred_projs = tf.image.resize_nearest_neighbor(
-              vis_pred_projs, new_size)
-          # flip
-          # vis_gt_projs = utils.image_flipud(vis_gt_projs)
-          # vis_pred_projs = utils.image_flipud(vis_pred_projs)
-          # vis_gt_projs is of shape [batch, height, width, channels]
-          write_disk_op = model.write_disk_grid(
-              global_step=global_step,
-              log_dir=save_image_dir,
-              input_images=vis_input_images,
-              gt_projs=vis_gt_projs,
-              pred_projs=vis_pred_projs,
-              input_voxels=val_inputs['voxels'],
-              output_voxels=val_outputs['voxels_1'])
-        with tf.control_dependencies([write_disk_op]):
-          train_op = tf.identity(train_op)
-
-      #############
-      ## init_fn ##
-      #############
-      if FLAGS.init_model:
-        init_fn = model.get_init_fn(init_scopes)
-      else:
-        init_fn = None
-
-      ##############
-      ## training ##
-      ##############
-      slim.learning.train(
-          train_op=train_op,
-          logdir=train_dir,
-          init_fn=init_fn,
-          master=FLAGS.master,
-          is_chief=(FLAGS.task == 0),
-          number_of_steps=FLAGS.max_number_of_steps,
-          saver=saver,
-          save_summaries_secs=FLAGS.save_summaries_secs,
-          save_interval_secs=FLAGS.save_interval_secs)
-
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/ptn/utils.py b/research/ptn/utils.py
deleted file mode 100644
index adf71731edb78740c6716d7abddfd77b557aaecd..0000000000000000000000000000000000000000
--- a/research/ptn/utils.py
+++ /dev/null
@@ -1,119 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utility functions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import StringIO
-import matplotlib
-matplotlib.use('Agg')
-from matplotlib import pylab as p
-# axes3d is being used implictly for visualization.
-from mpl_toolkits.mplot3d import axes3d as p3  # pylint:disable=unused-import
-import numpy as np
-from PIL import Image
-from skimage import measure
-from six.moves import xrange
-
-import tensorflow as tf
-
-
-def save_image(inp_array, image_file):
-  """Function that dumps the image to disk."""
-  inp_array = np.clip(inp_array, 0, 255).astype(np.uint8)
-  image = Image.fromarray(inp_array)
-  buf = StringIO.StringIO()
-  image.save(buf, format='JPEG')
-  with open(image_file, 'w') as f:
-    f.write(buf.getvalue())
-  return None
-
-
-def image_flipud(images):
-  """Function that flip (up-down) the np image."""
-  quantity = images.get_shape().as_list()[0]
-  image_list = []
-  for k in xrange(quantity):
-    image_list.append(tf.image.flip_up_down(images[k, :, :, :]))
-  outputs = tf.stack(image_list)
-  return outputs
-
-
-def resize_image(inp_array, new_height, new_width):
-  """Function that resize the np image."""
-  inp_array = np.clip(inp_array, 0, 255).astype(np.uint8)
-  image = Image.fromarray(inp_array)
-  # Reverse order
-  image = image.resize((new_width, new_height))
-  return np.array(image)
-
-
-def display_voxel(points, vis_size=128):
-  """Function to display 3D voxel."""
-  try:
-    data = visualize_voxel_spectral(points, vis_size)
-  except ValueError:
-    data = visualize_voxel_scatter(points, vis_size)
-  return data
-
-
-def visualize_voxel_spectral(points, vis_size=128):
-  """Function to visualize voxel (spectral)."""
-  points = np.rint(points)
-  points = np.swapaxes(points, 0, 2)
-  fig = p.figure(figsize=(1, 1), dpi=vis_size)
-  verts, faces = measure.marching_cubes_classic(points, 0, spacing=(0.1, 0.1, 0.1))
-  ax = fig.add_subplot(111, projection='3d')
-  ax.plot_trisurf(
-      verts[:, 0], verts[:, 1], faces, verts[:, 2], cmap='Spectral_r', lw=0.1)
-  ax.set_axis_off()
-  fig.tight_layout(pad=0)
-  fig.canvas.draw()
-  data = np.fromstring(
-      fig.canvas.tostring_rgb(), dtype=np.uint8, sep='').reshape(
-          vis_size, vis_size, 3)
-  p.close('all')
-  return data
-
-
-def visualize_voxel_scatter(points, vis_size=128):
-  """Function to visualize voxel (scatter)."""
-  points = np.rint(points)
-  points = np.swapaxes(points, 0, 2)
-  fig = p.figure(figsize=(1, 1), dpi=vis_size)
-  ax = fig.add_subplot(111, projection='3d')
-  x = []
-  y = []
-  z = []
-  (x_dimension, y_dimension, z_dimension) = points.shape
-  for i in range(x_dimension):
-    for j in range(y_dimension):
-      for k in range(z_dimension):
-        if points[i, j, k]:
-          x.append(i)
-          y.append(j)
-          z.append(k)
-  ax.scatter3D(x, y, z)
-  ax.set_axis_off()
-  fig.tight_layout(pad=0)
-  fig.canvas.draw()
-  data = np.fromstring(
-      fig.canvas.tostring_rgb(), dtype=np.uint8, sep='').reshape(
-          vis_size, vis_size, 3)
-  p.close('all')
-  return data
diff --git a/research/qa_kg/README.md b/research/qa_kg/README.md
deleted file mode 100644
index 7224ac8f8f5ff2f6c0003c5bbd9ac1717fe7addf..0000000000000000000000000000000000000000
--- a/research/qa_kg/README.md
+++ /dev/null
@@ -1,83 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Module networks for question answering on knowledge graph
-
-This code repository contains a TensorFlow model for question answering on
-knowledge graph with end-to-end module networks. The original paper describing
-end-to-end module networks is as follows.
-
-R. Hu, J. Andreas, M. Rohrbach, T. Darrell, K. Saenko, *Learning to Reason:
-End-to-End Module Networks for Visual Question Answering*. in arXiv preprint
-arXiv:1704.05526, 2017. ([PDF](https://arxiv.org/pdf/1704.05526.pdf))
-
-```
-@article{hu2017learning,
-  title={Learning to Reason: End-to-End Module Networks for Visual Question Answering},
-  author={Hu, Ronghang and Andreas, Jacob and Rohrbach, Marcus and Darrell, Trevor and Saenko, Kate},
-  journal={arXiv preprint arXiv:1704.05526},
-  year={2017}
-}
-```
-
-The code in this repository is based on the original
-[implementation](https://github.com/ronghanghu/n2nmn) for this paper.
-
-## Requirements
-
-1.  Install TensorFlow 1.0.0. Follow the [official
-    guide](https://www.tensorflow.org/install/). Please note that newer or older
-    versions of TensorFlow may fail to work due to incompatibility with
-    TensorFlow Fold.
-2.  Install TensorFlow Fold. Follow the
-    [setup instructions](https://github.com/tensorflow/fold/blob/master/tensorflow_fold/g3doc/setup.md).
-    TensorFlow Fold only supports Linux platform. We have not tested
-    the code on other platforms.
-
-## Data
-
-1.  Download the [MetaQA dataset](https://goo.gl/f3AmcY). Click the button
-    `MetaQA` and then click `Download` in the drop-down list. Extract the zip
-    file after downloading completed. Read the documents there for dataset
-    details.
-2.  Move the `MetaQA` folder to the root directory of this repository.
-
-## How to use this code
-
-We provide an experiment folder `exp_1_hop`, which applies the implemented model
-to the 1-hop vanilla dataset in MetaQA. More experiment folders are coming soon.
-
-Currently, we provide code for training with ground truth layout, and testing
-the saved model. Configurations can be modified in `config.py`. They can also be
-set via command line parameters.
-
-To train the model:
-
-```
-python exp_1_hop/train_gt_layout.py
-```
-
-To test the saved model (need to provide the snapshot name):
-
-```
-python exp_1_hop/test.py --snapshot_name 00010000
-```
-
-## Model introduction
-
-1.  In this model, we store the knowledge graph in a key-value based memory. For
-    each knowledge graph edge (subject, relation, object), we use the (subject,
-    relation) as the key and the object as the value.
-2.  All entities and relations are embedded as fixed-dimension vectors. These
-    embeddings are also end-to-end learned.
-3.  Neural modules can separately operate on either the key side or the value
-    side.
-4.  The attention is shared between keys and corresponding values.
-5.  The answer output is based on the attention-weighted sum over keys or
-    values, depending on the output module.
-
-## Contact
-Authors: Yuyu Zhang, Xin Pan
-
-Pull requests and issues: @yuyuz
diff --git a/research/qa_kg/exp_1_hop/config.py b/research/qa_kg/exp_1_hop/config.py
deleted file mode 100644
index 95d8cf5f512243279365ad99c9f582c945d6c8e2..0000000000000000000000000000000000000000
--- a/research/qa_kg/exp_1_hop/config.py
+++ /dev/null
@@ -1,80 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import argparse
-import os
-
-
-def str2bool(v):
-  return v.lower() in ('true', '1')
-
-
-def add_argument_group(name):
-  arg = parser.add_argument_group(name)
-  arg_lists.append(arg)
-  return arg
-
-
-def get_config():
-  config, unparsed = parser.parse_known_args()
-  return config, unparsed
-
-
-arg_lists = []
-parser = argparse.ArgumentParser()
-work_dir = os.path.abspath(os.path.join(__file__, '../../'))
-
-net_arg = add_argument_group('Network')
-net_arg.add_argument('--lstm_dim', type=int, default=128)
-net_arg.add_argument('--num_layers', type=int, default=1)
-net_arg.add_argument('--embed_dim_txt', type=int, default=128)
-net_arg.add_argument('--embed_dim_nmn', type=int, default=128)
-net_arg.add_argument(
-  '--T_encoder', type=int, default=0)  # will be updated when reading data
-net_arg.add_argument('--T_decoder', type=int, default=5)
-
-train_arg = add_argument_group('Training')
-train_arg.add_argument('--train_tag', type=str, default='n2nmn')
-train_arg.add_argument('--batch_size', type=int, default=128)
-train_arg.add_argument('--max_iter', type=int, default=1000000)
-train_arg.add_argument('--weight_decay', type=float, default=1e-5)
-train_arg.add_argument('--baseline_decay', type=float, default=0.99)
-train_arg.add_argument('--max_grad_norm', type=float, default=10)
-train_arg.add_argument('--random_seed', type=int, default=123)
-
-data_arg = add_argument_group('Data')
-data_path = work_dir + '/MetaQA/'
-data_arg.add_argument('--KB_file', type=str, default=data_path + 'kb.txt')
-data_arg.add_argument(
-  '--data_dir', type=str, default=data_path + '1-hop/vanilla/')
-data_arg.add_argument('--train_data_file', type=str, default='qa_train.txt')
-data_arg.add_argument('--dev_data_file', type=str, default='qa_dev.txt')
-data_arg.add_argument('--test_data_file', type=str, default='qa_test.txt')
-
-exp_arg = add_argument_group('Experiment')
-exp_path = work_dir + '/exp_1_hop/'
-exp_arg.add_argument('--exp_dir', type=str, default=exp_path)
-
-log_arg = add_argument_group('Log')
-log_arg.add_argument('--log_dir', type=str, default='logs')
-log_arg.add_argument('--log_interval', type=int, default=1000)
-log_arg.add_argument('--num_log_samples', type=int, default=3)
-log_arg.add_argument(
-  '--log_level', type=str, default='INFO', choices=['INFO', 'DEBUG', 'WARN'])
-
-io_arg = add_argument_group('IO')
-io_arg.add_argument('--model_dir', type=str, default='model')
-io_arg.add_argument('--snapshot_interval', type=int, default=1000)
-io_arg.add_argument('--output_dir', type=str, default='output')
diff --git a/research/qa_kg/exp_1_hop/test.py b/research/qa_kg/exp_1_hop/test.py
deleted file mode 100644
index 2937c0d582a54f6de2702a4f741963e0fe2b2f72..0000000000000000000000000000000000000000
--- a/research/qa_kg/exp_1_hop/test.py
+++ /dev/null
@@ -1,135 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import os
-import sys
-sys.path.append(os.path.abspath(os.path.join(__file__, '../../')))
-import numpy as np
-import tensorflow as tf
-from config import get_config
-from model_n2nmn.assembler import Assembler
-from model_n2nmn.model import Model
-from util.data_reader import DataReader
-from util.data_reader import SampleBuilder
-from util.misc import prepare_dirs_and_logger
-
-FLAGS = tf.flags.FLAGS
-tf.flags.DEFINE_string('snapshot_name', '00001000', 'snapshot file name')
-
-
-def main(_):
-  config = prepare_dirs_and_logger(config_raw)
-
-  rng = np.random.RandomState(config.random_seed)
-  tf.set_random_seed(config.random_seed)
-  config.rng = rng
-
-  config.module_names = ['_key_find', '_key_filter', '_val_desc', '<eos>']
-  config.gt_layout_tokens = ['_key_find', '_key_filter', '_val_desc', '<eos>']
-  assembler = Assembler(config)
-
-  sample_builder = SampleBuilder(config)
-  config = sample_builder.config  # update T_encoder according to data
-  data_test = sample_builder.data_all['test']
-  data_reader_test = DataReader(
-      config, data_test, assembler, shuffle=False, one_pass=True)
-
-  num_vocab_txt = len(sample_builder.dict_all)
-  num_vocab_nmn = len(assembler.module_names)
-  num_choices = len(sample_builder.dict_all)
-
-  # Network inputs
-  text_seq_batch = tf.placeholder(tf.int32, [None, None])
-  seq_len_batch = tf.placeholder(tf.int32, [None])
-
-  # The model
-  model = Model(
-      config,
-      sample_builder.kb,
-      text_seq_batch,
-      seq_len_batch,
-      num_vocab_txt=num_vocab_txt,
-      num_vocab_nmn=num_vocab_nmn,
-      EOS_idx=assembler.EOS_idx,
-      num_choices=num_choices,
-      decoder_sampling=False)
-  compiler = model.compiler
-  scores = model.scores
-
-  sess = tf.Session()
-  sess.run(tf.global_variables_initializer())
-  snapshot_file = os.path.join(config.model_dir, FLAGS.snapshot_name)
-  tf.logging.info('Snapshot file: %s' % snapshot_file)
-
-  snapshot_saver = tf.train.Saver()
-  snapshot_saver.restore(sess, snapshot_file)
-
-  # Evaluation metrics
-  num_questions = len(data_test.Y)
-  tf.logging.info('# of test questions: %d' % num_questions)
-
-  answer_correct = 0
-  layout_correct = 0
-  layout_valid = 0
-  for batch in data_reader_test.batches():
-    # set up input and output tensors
-    h = sess.partial_run_setup(
-        fetches=[model.predicted_tokens, scores],
-        feeds=[text_seq_batch, seq_len_batch, compiler.loom_input_tensor])
-
-    # Part 1: Generate module layout
-    tokens = sess.partial_run(
-        h,
-        fetches=model.predicted_tokens,
-        feed_dict={
-            text_seq_batch: batch['input_seq_batch'],
-            seq_len_batch: batch['seq_len_batch']
-        })
-
-    # Compute accuracy of the predicted layout
-    gt_tokens = batch['gt_layout_batch']
-    layout_correct += np.sum(
-        np.all(
-            np.logical_or(tokens == gt_tokens, gt_tokens == assembler.EOS_idx),
-            axis=0))
-
-    # Assemble the layout tokens into network structure
-    expr_list, expr_validity_array = assembler.assemble(tokens)
-    layout_valid += np.sum(expr_validity_array)
-    labels = batch['ans_label_batch']
-    # Build TensorFlow Fold input for NMN
-    expr_feed = compiler.build_feed_dict(expr_list)
-
-    # Part 2: Run NMN and learning steps
-    scores_val = sess.partial_run(h, scores, feed_dict=expr_feed)
-
-    # Compute accuracy
-    predictions = np.argmax(scores_val, axis=1)
-    answer_correct += np.sum(
-        np.logical_and(expr_validity_array, predictions == labels))
-
-  answer_accuracy = answer_correct * 1.0 / num_questions
-  layout_accuracy = layout_correct * 1.0 / num_questions
-  layout_validity = layout_valid * 1.0 / num_questions
-
-  tf.logging.info('test answer accuracy = %f, '
-                  'test layout accuracy = %f, '
-                  'test layout validity = %f' %
-                  (answer_accuracy, layout_accuracy, layout_validity))
-
-
-if __name__ == '__main__':
-  config_raw, unparsed = get_config()
-  tf.app.run(main=main, argv=[sys.argv[0]] + unparsed)
diff --git a/research/qa_kg/exp_1_hop/train_gt_layout.py b/research/qa_kg/exp_1_hop/train_gt_layout.py
deleted file mode 100644
index 02bafc428afc4d1f2b39d6bd56e6098d4b1b8ca7..0000000000000000000000000000000000000000
--- a/research/qa_kg/exp_1_hop/train_gt_layout.py
+++ /dev/null
@@ -1,194 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import os
-import sys
-sys.path.append(os.path.abspath(os.path.join(__file__, '../../')))
-import numpy as np
-import tensorflow as tf
-from config import get_config
-from model_n2nmn.assembler import Assembler
-from model_n2nmn.model import Model
-from util.data_reader import DataReader
-from util.data_reader import SampleBuilder
-from util.misc import prepare_dirs_and_logger
-from util.misc import save_config
-from util.misc import show_all_variables
-
-
-def main(_):
-  config = prepare_dirs_and_logger(config_raw)
-  save_config(config)
-
-  rng = np.random.RandomState(config.random_seed)
-  tf.set_random_seed(config.random_seed)
-  config.rng = rng
-
-  config.module_names = ['_key_find', '_key_filter', '_val_desc', '<eos>']
-  config.gt_layout_tokens = ['_key_find', '_key_filter', '_val_desc', '<eos>']
-  assembler = Assembler(config)
-
-  sample_builder = SampleBuilder(config)
-  config = sample_builder.config  # update T_encoder according to data
-  data_train = sample_builder.data_all['train']
-  data_reader_train = DataReader(
-      config, data_train, assembler, shuffle=True, one_pass=False)
-
-  num_vocab_txt = len(sample_builder.dict_all)
-  num_vocab_nmn = len(assembler.module_names)
-  num_choices = len(sample_builder.dict_all)
-
-  # Network inputs
-  text_seq_batch = tf.placeholder(tf.int32, [None, None])
-  seq_len_batch = tf.placeholder(tf.int32, [None])
-  ans_label_batch = tf.placeholder(tf.int32, [None])
-  use_gt_layout = tf.constant(True, dtype=tf.bool)
-  gt_layout_batch = tf.placeholder(tf.int32, [None, None])
-
-  # The model for training
-  model = Model(
-      config,
-      sample_builder.kb,
-      text_seq_batch,
-      seq_len_batch,
-      num_vocab_txt=num_vocab_txt,
-      num_vocab_nmn=num_vocab_nmn,
-      EOS_idx=assembler.EOS_idx,
-      num_choices=num_choices,
-      decoder_sampling=True,
-      use_gt_layout=use_gt_layout,
-      gt_layout_batch=gt_layout_batch)
-  compiler = model.compiler
-  scores = model.scores
-  log_seq_prob = model.log_seq_prob
-
-  # Loss function
-  softmax_loss_per_sample = tf.nn.sparse_softmax_cross_entropy_with_logits(
-      logits=scores, labels=ans_label_batch)
-  # The final per-sample loss, which is loss for valid expr
-  # and invalid_expr_loss for invalid expr
-  final_loss_per_sample = softmax_loss_per_sample  # All exprs are valid
-
-  avg_sample_loss = tf.reduce_mean(final_loss_per_sample)
-  seq_likelihood_loss = tf.reduce_mean(-log_seq_prob)
-
-  total_training_loss = seq_likelihood_loss + avg_sample_loss
-  total_loss = total_training_loss + config.weight_decay * model.l2_reg
-
-  # Train with Adam optimizer
-  solver = tf.train.AdamOptimizer()
-  gradients = solver.compute_gradients(total_loss)
-
-  # Clip gradient by L2 norm
-  gradients = [(tf.clip_by_norm(g, config.max_grad_norm), v)
-               for g, v in gradients]
-  solver_op = solver.apply_gradients(gradients)
-
-  # Training operation
-  with tf.control_dependencies([solver_op]):
-    train_step = tf.constant(0)
-
-  # Write summary to TensorBoard
-  log_writer = tf.summary.FileWriter(config.log_dir, tf.get_default_graph())
-
-  loss_ph = tf.placeholder(tf.float32, [])
-  entropy_ph = tf.placeholder(tf.float32, [])
-  accuracy_ph = tf.placeholder(tf.float32, [])
-  summary_train = [
-      tf.summary.scalar('avg_sample_loss', loss_ph),
-      tf.summary.scalar('entropy', entropy_ph),
-      tf.summary.scalar('avg_accuracy', accuracy_ph)
-  ]
-  log_step_train = tf.summary.merge(summary_train)
-
-  # Training
-  sess = tf.Session()
-  sess.run(tf.global_variables_initializer())
-  snapshot_saver = tf.train.Saver(max_to_keep=None)  # keep all snapshots
-  show_all_variables()
-
-  avg_accuracy = 0
-  accuracy_decay = 0.99
-  for n_iter, batch in enumerate(data_reader_train.batches()):
-    if n_iter >= config.max_iter:
-      break
-
-    # set up input and output tensors
-    h = sess.partial_run_setup(
-        fetches=[
-            model.predicted_tokens, model.entropy_reg, scores, avg_sample_loss,
-            train_step
-        ],
-        feeds=[
-            text_seq_batch, seq_len_batch, gt_layout_batch,
-            compiler.loom_input_tensor, ans_label_batch
-        ])
-
-    # Part 1: Generate module layout
-    tokens, entropy_reg_val = sess.partial_run(
-        h,
-        fetches=(model.predicted_tokens, model.entropy_reg),
-        feed_dict={
-            text_seq_batch: batch['input_seq_batch'],
-            seq_len_batch: batch['seq_len_batch'],
-            gt_layout_batch: batch['gt_layout_batch']
-        })
-    # Assemble the layout tokens into network structure
-    expr_list, expr_validity_array = assembler.assemble(tokens)
-    # all exprs should be valid (since they are ground-truth)
-    assert np.all(expr_validity_array)
-    labels = batch['ans_label_batch']
-    # Build TensorFlow Fold input for NMN
-    expr_feed = compiler.build_feed_dict(expr_list)
-    expr_feed[ans_label_batch] = labels
-
-    # Part 2: Run NMN and learning steps
-    scores_val, avg_sample_loss_val, _ = sess.partial_run(
-        h, fetches=(scores, avg_sample_loss, train_step), feed_dict=expr_feed)
-
-    # Compute accuracy
-    predictions = np.argmax(scores_val, axis=1)
-    accuracy = np.mean(
-        np.logical_and(expr_validity_array, predictions == labels))
-    avg_accuracy += (1 - accuracy_decay) * (accuracy - avg_accuracy)
-
-    # Add to TensorBoard summary
-    if (n_iter + 1) % config.log_interval == 0:
-      tf.logging.info('iter = %d\n\t'
-                      'loss = %f, accuracy (cur) = %f, '
-                      'accuracy (avg) = %f, entropy = %f' %
-                      (n_iter + 1, avg_sample_loss_val, accuracy, avg_accuracy,
-                       -entropy_reg_val))
-      summary = sess.run(
-          fetches=log_step_train,
-          feed_dict={
-              loss_ph: avg_sample_loss_val,
-              entropy_ph: -entropy_reg_val,
-              accuracy_ph: avg_accuracy
-          })
-      log_writer.add_summary(summary, n_iter + 1)
-
-    # Save snapshot
-    if (n_iter + 1) % config.snapshot_interval == 0:
-      snapshot_file = os.path.join(config.model_dir, '%08d' % (n_iter + 1))
-      snapshot_saver.save(sess, snapshot_file, write_meta_graph=False)
-      tf.logging.info('Snapshot saved to %s' % snapshot_file)
-
-  tf.logging.info('Run finished.')
-
-
-if __name__ == '__main__':
-  config_raw, unparsed = get_config()
-  tf.app.run(main=main, argv=[sys.argv[0]] + unparsed)
diff --git a/research/qa_kg/model_n2nmn/__init__.py b/research/qa_kg/model_n2nmn/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/qa_kg/model_n2nmn/assembler.py b/research/qa_kg/model_n2nmn/assembler.py
deleted file mode 100644
index f5839f6f49d7d30774195749405d43ed014a0049..0000000000000000000000000000000000000000
--- a/research/qa_kg/model_n2nmn/assembler.py
+++ /dev/null
@@ -1,145 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import numpy as np
-
-# the number of attention input to each module
-_module_input_num = {
-    '_key_find': 0,
-    '_key_filter': 1,
-    '_val_desc': 1}
-_module_output_type = {
-    '_key_find': 'att',
-    '_key_filter': 'att',
-    '_val_desc': 'ans'
-}
-
-INVALID_EXPR = 'INVALID_EXPR'
-
-
-class Assembler:
-
-  def __init__(self, config):
-    # read the module list, and record the index of each module and <eos>
-    self.module_names = config.module_names
-    # find the index of <eos>
-    for n_s in range(len(self.module_names)):
-      if self.module_names[n_s] == '<eos>':
-        self.EOS_idx = n_s
-        break
-    # build a dictionary from module name to token index
-    self.name2idx_dict = {
-        name: n_s
-        for n_s, name in enumerate(self.module_names)
-    }
-
-  def module_list2tokens(self, module_list, max_len=None):
-    layout_tokens = [self.name2idx_dict[name] for name in module_list]
-    if max_len is not None:
-      if len(module_list) >= max_len:
-        raise ValueError('Not enough time steps to add <eos>')
-      layout_tokens += [self.EOS_idx] * (max_len - len(module_list))
-    return layout_tokens
-
-  def _layout_tokens2str(self, layout_tokens):
-    return ' '.join([self.module_names[idx] for idx in layout_tokens])
-
-  def _invalid_expr(self, layout_tokens, error_str):
-    return {
-        'module': INVALID_EXPR,
-        'expr_str': self._layout_tokens2str(layout_tokens),
-        'error': error_str
-    }
-
-  def _assemble_layout_tokens(self, layout_tokens, batch_idx):
-    # Every module takes a time_idx as the index from LSTM hidden states
-    # (even if it doesn't need it, like _and), and different arity of
-    # attention inputs. The output type can be either attention or answer
-    #
-    # The final assembled expression for each instance is as follows:
-    # expr_type :=
-    #    {'module': '_find',        'output_type': 'att', 'time_idx': idx}
-    #  | {'module': '_relocate',   'output_type': 'att', 'time_idx': idx,
-    #     'inputs_0': <expr_type>}
-    #  | {'module': '_and',         'output_type': 'att', 'time_idx': idx,
-    #     'inputs_0': <expr_type>,  'inputs_1': <expr_type>)}
-    #  | {'module': '_describe',      'output_type': 'ans', 'time_idx': idx,
-    #     'inputs_0': <expr_type>}
-    #  | {'module': INVALID_EXPR, 'expr_str': '...', 'error': '...',
-    #     'assembly_loss': <float32>} (for invalid expressions)
-    #
-
-    # A valid layout must contain <eos>. Assembly fails if it doesn't.
-    if not np.any(layout_tokens == self.EOS_idx):
-      return self._invalid_expr(layout_tokens, 'cannot find <eos>')
-
-    # Decoding Reverse Polish Notation with a stack
-    decoding_stack = []
-    for t in range(len(layout_tokens)):
-      # decode a module/operation
-      module_idx = layout_tokens[t]
-      if module_idx == self.EOS_idx:
-        break
-      module_name = self.module_names[module_idx]
-      expr = {
-          'module': module_name,
-          'output_type': _module_output_type[module_name],
-          'time_idx': t,
-          'batch_idx': batch_idx
-      }
-
-      input_num = _module_input_num[module_name]
-      # Check if there are enough input in the stack
-      if len(decoding_stack) < input_num:
-        # Invalid expression. Not enough input.
-        return self._invalid_expr(layout_tokens,
-                                  'not enough input for ' + module_name)
-
-      # Get the input from stack
-      for n_input in range(input_num - 1, -1, -1):
-        stack_top = decoding_stack.pop()
-        if stack_top['output_type'] != 'att':
-          # Invalid expression. Input must be attention
-          return self._invalid_expr(layout_tokens,
-                                    'input incompatible for ' + module_name)
-        expr['input_%d' % n_input] = stack_top
-
-      decoding_stack.append(expr)
-
-    # After decoding the reverse polish expression, there should be exactly
-    # one expression in the stack
-    if len(decoding_stack) != 1:
-      return self._invalid_expr(
-          layout_tokens,
-          'final stack size not equal to 1 (%d remains)' % len(decoding_stack))
-
-    result = decoding_stack[0]
-    # The result type should be answer, not attention
-    if result['output_type'] != 'ans':
-      return self._invalid_expr(layout_tokens,
-                                'result type must be ans, not att')
-    return result
-
-  def assemble(self, layout_tokens_batch):
-    # layout_tokens_batch is a numpy array with shape [max_dec_len, batch_size],
-    # containing module tokens and <eos>, in Reverse Polish Notation.
-    _, batch_size = layout_tokens_batch.shape
-    expr_list = [
-        self._assemble_layout_tokens(layout_tokens_batch[:, batch_i], batch_i)
-        for batch_i in range(batch_size)
-    ]
-    expr_validity = np.array(
-        [expr['module'] != INVALID_EXPR for expr in expr_list], np.bool)
-    return expr_list, expr_validity
diff --git a/research/qa_kg/model_n2nmn/model.py b/research/qa_kg/model_n2nmn/model.py
deleted file mode 100644
index 56896f438006ac28c82d8e92ded2d7bbf8cf3863..0000000000000000000000000000000000000000
--- a/research/qa_kg/model_n2nmn/model.py
+++ /dev/null
@@ -1,119 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import numpy as np
-import tensorflow as tf
-import tensorflow_fold as td
-from model_n2nmn import netgen_att
-from model_n2nmn import assembler
-from model_n2nmn.modules import Modules
-
-
-class Model:
-
-  def __init__(self,
-               config,
-               kb,
-               text_seq_batch,
-               seq_length_batch,
-               num_vocab_txt,
-               num_vocab_nmn,
-               EOS_idx,
-               num_choices,
-               decoder_sampling,
-               use_gt_layout=None,
-               gt_layout_batch=None,
-               scope='neural_module_network',
-               reuse=None):
-    with tf.variable_scope(scope, reuse=reuse):
-      # Part 1: Seq2seq RNN to generate module layout tokens
-
-      embedding_mat = tf.get_variable(
-        'embedding_mat', [num_vocab_txt, config.embed_dim_txt],
-        initializer=tf.contrib.layers.xavier_initializer())
-
-      with tf.variable_scope('layout_generation'):
-        att_seq2seq = netgen_att.AttentionSeq2Seq(
-            config, text_seq_batch, seq_length_batch, num_vocab_txt,
-            num_vocab_nmn, EOS_idx, decoder_sampling, embedding_mat,
-            use_gt_layout, gt_layout_batch)
-        self.att_seq2seq = att_seq2seq
-        predicted_tokens = att_seq2seq.predicted_tokens
-        token_probs = att_seq2seq.token_probs
-        word_vecs = att_seq2seq.word_vecs
-        neg_entropy = att_seq2seq.neg_entropy
-        self.atts = att_seq2seq.atts
-
-        self.predicted_tokens = predicted_tokens
-        self.token_probs = token_probs
-        self.word_vecs = word_vecs
-        self.neg_entropy = neg_entropy
-
-        # log probability of each generated sequence
-        self.log_seq_prob = tf.reduce_sum(tf.log(token_probs), axis=0)
-
-      # Part 2: Neural Module Network
-      with tf.variable_scope('layout_execution'):
-        modules = Modules(config, kb, word_vecs, num_choices, embedding_mat)
-        self.modules = modules
-        # Recursion of modules
-        att_shape = [len(kb)]
-        # Forward declaration of module recursion
-        att_expr_decl = td.ForwardDeclaration(td.PyObjectType(),
-                                              td.TensorType(att_shape))
-        # _key_find
-        case_key_find = td.Record([('time_idx', td.Scalar(dtype='int32')),
-                                   ('batch_idx', td.Scalar(dtype='int32'))])
-        case_key_find = case_key_find >> td.ScopedLayer(
-            modules.KeyFindModule, name_or_scope='KeyFindModule')
-        # _key_filter
-        case_key_filter = td.Record([('input_0', att_expr_decl()),
-                                     ('time_idx', td.Scalar('int32')),
-                                     ('batch_idx', td.Scalar('int32'))])
-        case_key_filter = case_key_filter >> td.ScopedLayer(
-            modules.KeyFilterModule, name_or_scope='KeyFilterModule')
-        recursion_cases = td.OneOf(
-            td.GetItem('module'),
-            {'_key_find': case_key_find,
-             '_key_filter': case_key_filter})
-        att_expr_decl.resolve_to(recursion_cases)
-        # _val_desc: output scores for choice (for valid expressions)
-        predicted_scores = td.Record([('input_0', recursion_cases),
-                                      ('time_idx', td.Scalar('int32')),
-                                      ('batch_idx', td.Scalar('int32'))])
-        predicted_scores = predicted_scores >> td.ScopedLayer(
-            modules.ValDescribeModule, name_or_scope='ValDescribeModule')
-
-        # For invalid expressions, define a dummy answer
-        # so that all answers have the same form
-        INVALID = assembler.INVALID_EXPR
-        dummy_scores = td.Void() >> td.FromTensor(
-            np.zeros(num_choices, np.float32))
-        output_scores = td.OneOf(
-            td.GetItem('module'),
-            {'_val_desc': predicted_scores,
-             INVALID: dummy_scores})
-
-        # compile and get the output scores
-        self.compiler = td.Compiler.create(output_scores)
-        self.scores = self.compiler.output_tensors[0]
-
-      # Regularization: Entropy + L2
-      self.entropy_reg = tf.reduce_mean(neg_entropy)
-      module_weights = [
-          v for v in tf.trainable_variables()
-          if (scope in v.op.name and v.op.name.endswith('weights'))
-      ]
-      self.l2_reg = tf.add_n([tf.nn.l2_loss(v) for v in module_weights])
diff --git a/research/qa_kg/model_n2nmn/modules.py b/research/qa_kg/model_n2nmn/modules.py
deleted file mode 100644
index 8c7a7370f81a5b81d08c87136688765dce556ada..0000000000000000000000000000000000000000
--- a/research/qa_kg/model_n2nmn/modules.py
+++ /dev/null
@@ -1,131 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import tensorflow as tf
-
-
-class Modules:
-
-  def __init__(self, config, kb, word_vecs, num_choices, embedding_mat):
-    self.config = config
-
-    self.embedding_mat = embedding_mat
-
-    # kb has shape [N_kb, 3]
-    self.kb = kb
-    self.embed_keys_e, self.embed_keys_r, self.embed_vals_e = self.embed_kb()
-
-    # word_vecs has shape [T_decoder, N, D_txt]
-    self.word_vecs = word_vecs
-    self.num_choices = num_choices
-
-  def embed_kb(self):
-    keys_e, keys_r, vals_e = [], [], []
-    for idx_sub, idx_rel, idx_obj in self.kb:
-      keys_e.append(idx_sub)
-      keys_r.append(idx_rel)
-      vals_e.append(idx_obj)
-    embed_keys_e = tf.nn.embedding_lookup(self.embedding_mat, keys_e)
-    embed_keys_r = tf.nn.embedding_lookup(self.embedding_mat, keys_r)
-    embed_vals_e = tf.nn.embedding_lookup(self.embedding_mat, vals_e)
-    return embed_keys_e, embed_keys_r, embed_vals_e
-
-  def _slice_word_vecs(self, time_idx, batch_idx):
-    # this callable will be wrapped into a td.Function
-    # In TF Fold, batch_idx and time_idx are both [N_batch, 1] tensors
-    # time is highest dim in word_vecs
-    joint_index = tf.stack([time_idx, batch_idx], axis=1)
-    return tf.gather_nd(self.word_vecs, joint_index)
-
-  # All the layers are wrapped with td.ScopedLayer
-  def KeyFindModule(self,
-                    time_idx,
-                    batch_idx,
-                    scope='KeyFindModule',
-                    reuse=None):
-    # In TF Fold, batch_idx and time_idx are both [N_batch, 1] tensors
-    text_param = self._slice_word_vecs(time_idx, batch_idx)
-
-    # Mapping: embed_keys_e x text_param -> att
-    # Input:
-    #   embed_keys_e: [N_kb, D_txt]
-    #   text_param: [N, D_txt]
-    # Output:
-    #   att: [N, N_kb]
-    #
-    # Implementation:
-    #   1. Elementwise multiplication between embed_key_e and text_param
-    #   2. L2-normalization
-    with tf.variable_scope(scope, reuse=reuse):
-      m = tf.matmul(text_param, self.embed_keys_e, transpose_b=True)
-      att = tf.nn.l2_normalize(m, dim=1)
-    return att
-
-  def KeyFilterModule(self,
-                      input_0,
-                      time_idx,
-                      batch_idx,
-                      scope='KeyFilterModule',
-                      reuse=None):
-    att_0 = input_0
-    text_param = self._slice_word_vecs(time_idx, batch_idx)
-
-    # Mapping: and(embed_keys_r x text_param, att) -> att
-    # Input:
-    #   embed_keys_r: [N_kb, D_txt]
-    #   text_param: [N, D_txt]
-    #   att_0: [N, N_kb]
-    # Output:
-    #   att: [N, N_kb]
-    #
-    # Implementation:
-    #   1. Elementwise multiplication between embed_key_r and text_param
-    #   2. L2-normalization
-    #   3. Take the elementwise-min
-    with tf.variable_scope(scope, reuse=reuse):
-      m = tf.matmul(text_param, self.embed_keys_r, transpose_b=True)
-      att_1 = tf.nn.l2_normalize(m, dim=1)
-      att = tf.minimum(att_0, att_1)
-    return att
-
-  def ValDescribeModule(self,
-                        input_0,
-                        time_idx,
-                        batch_idx,
-                        scope='ValDescribeModule',
-                        reuse=None):
-    att = input_0
-
-    # Mapping: att -> answer probs
-    # Input:
-    #   embed_vals_e: [N_kb, D_txt]
-    #   att: [N, N_kb]
-    #   embedding_mat: [self.num_choices, D_txt]
-    # Output:
-    #   answer_scores: [N, self.num_choices]
-    #
-    # Implementation:
-    #   1. Attention-weighted sum over values
-    #   2. Compute cosine similarity scores between the weighted sum and
-    #      each candidate answer
-    with tf.variable_scope(scope, reuse=reuse):
-      # weighted_sum has shape [N, D_txt]
-      weighted_sum = tf.matmul(att, self.embed_vals_e)
-      # scores has shape [N, self.num_choices]
-      scores = tf.matmul(
-          weighted_sum,
-          tf.nn.l2_normalize(self.embedding_mat, dim=1),
-          transpose_b=True)
-    return scores
diff --git a/research/qa_kg/model_n2nmn/netgen_att.py b/research/qa_kg/model_n2nmn/netgen_att.py
deleted file mode 100644
index df6509946a5457bb07f2dfdcfab44aaf67447d0f..0000000000000000000000000000000000000000
--- a/research/qa_kg/model_n2nmn/netgen_att.py
+++ /dev/null
@@ -1,295 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import tensorflow as tf
-from util.nn import fc_layer as fc
-
-
-def _get_lstm_cell(num_layers, lstm_dim):
-  cell_list = [
-      tf.contrib.rnn.BasicLSTMCell(lstm_dim, state_is_tuple=True)
-      for _ in range(num_layers)
-  ]
-  cell = tf.contrib.rnn.MultiRNNCell(cell_list, state_is_tuple=True)
-  return cell
-
-
-class AttentionSeq2Seq:
-
-  def __init__(self,
-               config,
-               text_seq_batch,
-               seq_length_batch,
-               num_vocab_txt,
-               num_vocab_nmn,
-               EOS_token,
-               decoder_sampling,
-               embedding_mat,
-               use_gt_layout=None,
-               gt_layout_batch=None,
-               scope='encoder_decoder',
-               reuse=None):
-    self.T_decoder = config.T_decoder
-    self.encoder_num_vocab = num_vocab_txt
-    self.encoder_embed_dim = config.embed_dim_txt
-    self.decoder_num_vocab = num_vocab_nmn
-    self.decoder_embed_dim = config.embed_dim_nmn
-    self.lstm_dim = config.lstm_dim
-    self.num_layers = config.num_layers
-    self.EOS_token = EOS_token
-    self.decoder_sampling = decoder_sampling
-    self.embedding_mat = embedding_mat
-
-    with tf.variable_scope(scope, reuse=reuse):
-      self._build_encoder(text_seq_batch, seq_length_batch)
-      self._build_decoder(use_gt_layout, gt_layout_batch)
-
-  def _build_encoder(self,
-                     text_seq_batch,
-                     seq_length_batch,
-                     scope='encoder',
-                     reuse=None):
-    lstm_dim = self.lstm_dim
-    num_layers = self.num_layers
-
-    with tf.variable_scope(scope, reuse=reuse):
-      T = tf.shape(text_seq_batch)[0]
-      N = tf.shape(text_seq_batch)[1]
-      self.T_encoder = T
-      self.N = N
-
-      # text_seq has shape [T, N] and embedded_seq has shape [T, N, D]
-      embedded_seq = tf.nn.embedding_lookup(self.embedding_mat, text_seq_batch)
-      self.embedded_input_seq = embedded_seq
-
-      # The RNN
-      cell = _get_lstm_cell(num_layers, lstm_dim)
-
-      # encoder_outputs has shape [T, N, lstm_dim]
-      encoder_outputs, encoder_states = tf.nn.dynamic_rnn(
-          cell,
-          embedded_seq,
-          seq_length_batch,
-          dtype=tf.float32,
-          time_major=True,
-          scope='lstm')
-      self.encoder_outputs = encoder_outputs
-      self.encoder_states = encoder_states
-
-      # transform the encoder outputs for further attention alignments
-      # encoder_outputs_flat has shape [T, N, lstm_dim]
-      encoder_h_transformed = fc(
-          'encoder_h_transform',
-          tf.reshape(encoder_outputs, [-1, lstm_dim]),
-          output_dim=lstm_dim)
-      encoder_h_transformed = tf.reshape(encoder_h_transformed,
-                                         [T, N, lstm_dim])
-      self.encoder_h_transformed = encoder_h_transformed
-
-      # seq_not_finished is a shape [T, N, 1] tensor,
-      # where seq_not_finished[t, n]
-      # is 1 iff sequence n is not finished at time t, and 0 otherwise
-      seq_not_finished = tf.less(
-          tf.range(T)[:, tf.newaxis, tf.newaxis],
-          seq_length_batch[:, tf.newaxis])
-      seq_not_finished = tf.cast(seq_not_finished, tf.float32)
-      self.seq_not_finished = seq_not_finished
-
-  def _build_decoder(self,
-                     use_gt_layout,
-                     gt_layout_batch,
-                     scope='decoder',
-                     reuse=None):
-    # The main difference from before is that the decoders now takes another
-    # input (the attention) when computing the next step
-    # T_max is the maximum length of decoded sequence (including <eos>)
-    #
-    # This function is for decoding only. It performs greedy search or sampling.
-    # the first input is <go> (its embedding vector) and the subsequent inputs
-    # are the outputs from previous time step
-    # num_vocab does not include <go>
-    #
-    # use_gt_layout is None or a bool tensor, and gt_layout_batch is a tensor
-    # with shape [T_max, N].
-    # If use_gt_layout is not None, then when use_gt_layout is true, predict
-    # exactly the tokens in gt_layout_batch, regardless of actual probability.
-    # Otherwise, if sampling is True, sample from the token probability
-    # If sampling is False, do greedy decoding (beam size 1)
-    N = self.N
-    encoder_states = self.encoder_states
-    T_max = self.T_decoder
-    lstm_dim = self.lstm_dim
-    num_layers = self.num_layers
-    EOS_token = self.EOS_token
-    sampling = self.decoder_sampling
-
-    with tf.variable_scope(scope, reuse=reuse):
-      embedding_mat = tf.get_variable(
-          'embedding_mat', [self.decoder_num_vocab, self.decoder_embed_dim])
-      # we use a separate embedding for <go>, as it is only used in the
-      # beginning of the sequence
-      go_embedding = tf.get_variable('go_embedding',
-                                     [1, self.decoder_embed_dim])
-
-      with tf.variable_scope('att_prediction'):
-        v = tf.get_variable('v', [lstm_dim])
-        W_a = tf.get_variable(
-            'weights', [lstm_dim, lstm_dim],
-            initializer=tf.contrib.layers.xavier_initializer())
-        b_a = tf.get_variable(
-            'biases', lstm_dim, initializer=tf.constant_initializer(0.))
-
-      # The parameters to predict the next token
-      with tf.variable_scope('token_prediction'):
-        W_y = tf.get_variable(
-            'weights', [lstm_dim * 2, self.decoder_num_vocab],
-            initializer=tf.contrib.layers.xavier_initializer())
-        b_y = tf.get_variable(
-            'biases',
-            self.decoder_num_vocab,
-            initializer=tf.constant_initializer(0.))
-
-      # Attentional decoding
-      # Loop function is called at time t BEFORE the cell execution at time t,
-      # and its next_input is used as the input at time t (not t+1)
-      # c.f. https://www.tensorflow.org/api_docs/python/tf/nn/raw_rnn
-      mask_range = tf.reshape(
-          tf.range(self.decoder_num_vocab, dtype=tf.int32), [1, -1])
-      all_eos_pred = EOS_token * tf.ones([N], tf.int32)
-      all_one_prob = tf.ones([N], tf.float32)
-      all_zero_entropy = tf.zeros([N], tf.float32)
-      if use_gt_layout is not None:
-        gt_layout_mult = tf.cast(use_gt_layout, tf.int32)
-        pred_layout_mult = 1 - gt_layout_mult
-
-      def loop_fn(time, cell_output, cell_state, loop_state):
-        if cell_output is None:  # time == 0
-          next_cell_state = encoder_states
-          next_input = tf.tile(go_embedding, [N, 1])
-        else:  # time > 0
-          next_cell_state = cell_state
-
-          # compute the attention map over the input sequence
-          # a_raw has shape [T, N, 1]
-          att_raw = tf.reduce_sum(
-              tf.tanh(
-                  tf.nn.xw_plus_b(cell_output, W_a, b_a) +
-                  self.encoder_h_transformed) * v,
-              axis=2,
-              keep_dims=True)
-          # softmax along the first dimension (T) over not finished examples
-          # att has shape [T, N, 1]
-          att = tf.nn.softmax(att_raw, dim=0) * self.seq_not_finished
-          att = att / tf.reduce_sum(att, axis=0, keep_dims=True)
-          # d has shape [N, lstm_dim]
-          d2 = tf.reduce_sum(att * self.encoder_outputs, axis=0)
-
-          # token_scores has shape [N, num_vocab]
-          token_scores = tf.nn.xw_plus_b(
-              tf.concat([cell_output, d2], axis=1), W_y, b_y)
-          # predict the next token (behavior depending on parameters)
-          if sampling:
-            # predicted_token has shape [N]
-            logits = token_scores
-            predicted_token = tf.cast(
-                tf.reshape(tf.multinomial(token_scores, 1), [-1]), tf.int32)
-          else:
-            # predicted_token has shape [N]
-            predicted_token = tf.cast(tf.argmax(token_scores, 1), tf.int32)
-          if use_gt_layout is not None:
-            predicted_token = (gt_layout_batch[time - 1] * gt_layout_mult +
-                               predicted_token * pred_layout_mult)
-
-          # token_prob has shape [N], the probability of the predicted token
-          # although token_prob is not needed for predicting the next token
-          # it is needed in output (for policy gradient training)
-          # [N, num_vocab]
-          # mask has shape [N, num_vocab]
-          mask = tf.equal(mask_range, tf.reshape(predicted_token, [-1, 1]))
-          all_token_probs = tf.nn.softmax(token_scores)
-          token_prob = tf.reduce_sum(
-              all_token_probs * tf.cast(mask, tf.float32), axis=1)
-          neg_entropy = tf.reduce_sum(
-              all_token_probs * tf.log(all_token_probs), axis=1)
-
-          # is_eos_predicted is a [N] bool tensor, indicating whether
-          # <eos> has already been predicted previously in each sequence
-          is_eos_predicted = loop_state[2]
-          predicted_token_old = predicted_token
-          # if <eos> has already been predicted, now predict <eos> with
-          # prob 1
-          predicted_token = tf.where(is_eos_predicted, all_eos_pred,
-                                     predicted_token)
-          token_prob = tf.where(is_eos_predicted, all_one_prob, token_prob)
-          neg_entropy = tf.where(is_eos_predicted, all_zero_entropy,
-                                 neg_entropy)
-          is_eos_predicted = tf.logical_or(is_eos_predicted,
-                                           tf.equal(predicted_token_old,
-                                                    EOS_token))
-
-          # the prediction is from the cell output of the last step
-          # timestep (t-1), feed it as input into timestep t
-          next_input = tf.nn.embedding_lookup(embedding_mat, predicted_token)
-
-        elements_finished = tf.greater_equal(time, T_max)
-
-        # loop_state is a 5-tuple, representing
-        #   1) the predicted_tokens
-        #   2) the prob of predicted_tokens
-        #   3) whether <eos> has already been predicted
-        #   4) the negative entropy of policy (accumulated across timesteps)
-        #   5) the attention
-        if loop_state is None:  # time == 0
-          # Write the predicted token into the output
-          predicted_token_array = tf.TensorArray(
-              dtype=tf.int32, size=T_max, infer_shape=False)
-          token_prob_array = tf.TensorArray(
-              dtype=tf.float32, size=T_max, infer_shape=False)
-          att_array = tf.TensorArray(
-              dtype=tf.float32, size=T_max, infer_shape=False)
-          next_loop_state = (predicted_token_array, token_prob_array, tf.zeros(
-              [N], dtype=tf.bool), tf.zeros([N], dtype=tf.float32), att_array)
-        else:  # time > 0
-          t_write = time - 1
-          next_loop_state = (
-              loop_state[0].write(t_write, predicted_token),
-              loop_state[1].write(t_write, token_prob),
-              is_eos_predicted,
-              loop_state[3] + neg_entropy,
-              loop_state[4].write(t_write, att))
-        return (elements_finished, next_input, next_cell_state, cell_output,
-                next_loop_state)
-
-      # The RNN
-      cell = _get_lstm_cell(num_layers, lstm_dim)
-      _, _, decodes_ta = tf.nn.raw_rnn(cell, loop_fn, scope='lstm')
-      predicted_tokens = decodes_ta[0].stack()
-      token_probs = decodes_ta[1].stack()
-      neg_entropy = decodes_ta[3]
-      # atts has shape [T_decoder, T_encoder, N, 1]
-      atts = decodes_ta[4].stack()
-      self.atts = atts
-      # word_vec has shape [T_decoder, N, D]
-      word_vecs = tf.reduce_sum(atts * self.embedded_input_seq, axis=1)
-
-      predicted_tokens.set_shape([None, None])
-      token_probs.set_shape([None, None])
-      neg_entropy.set_shape([None])
-      word_vecs.set_shape([None, None, self.encoder_embed_dim])
-
-      self.predicted_tokens = predicted_tokens
-      self.token_probs = token_probs
-      self.neg_entropy = neg_entropy
-      self.word_vecs = word_vecs
diff --git a/research/qa_kg/util/__init__.py b/research/qa_kg/util/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/qa_kg/util/data_reader.py b/research/qa_kg/util/data_reader.py
deleted file mode 100644
index 397390af6d95b350559fbd20cc55e85a12ce03c0..0000000000000000000000000000000000000000
--- a/research/qa_kg/util/data_reader.py
+++ /dev/null
@@ -1,231 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from collections import namedtuple
-try:
-  from queue import Queue  # Python 3
-except ImportError:
-  from Queue import Queue  # Python 2
-import re
-import threading
-import numpy as np
-import tensorflow as tf
-
-Data = namedtuple('Data', ['X', 'Y', 'MultiYs', 'qid'])
-
-
-class SampleBuilder:
-
-  def __init__(self, config):
-    self.config = config
-
-    self.kb_raw = self.read_kb()
-    self.data_raw = self.read_raw_data()
-
-    # dictionary of entities, normal words, and relations
-    self.dict_all = self.gen_dict()
-    self.reverse_dict_all = dict(
-        zip(self.dict_all.values(), self.dict_all.keys()))
-
-    tf.logging.info('size of dict: %d' % len(self.dict_all))
-
-    self.kb = self.build_kb()
-    self.data_all = self.build_samples()
-
-  def read_kb(self):
-    kb_raw = []
-    for line in open(self.config.KB_file):
-      sub, rel, obj = line.strip().split('|')
-      kb_raw.append((sub, rel, obj))
-    tf.logging.info('# of KB records: %d' % len(kb_raw))
-    return kb_raw
-
-  def read_raw_data(self):
-    data = dict()
-    for name in self.config.data_files:
-      raw = []
-      tf.logging.info(
-        'Reading data file {}'.format(self.config.data_files[name]))
-      for line in open(self.config.data_files[name]):
-        question, answers = line.strip().split('\t')
-        question = question.replace('],', ']')  # ignore ',' in the template
-        raw.append((question, answers))
-      data[name] = raw
-    return data
-
-  def build_kb(self):
-    tf.logging.info('Indexing KB...')
-    kb = []
-    for sub, rel, obj in self.kb_raw:
-      kb.append([self.dict_all[sub], self.dict_all[rel], self.dict_all[obj]])
-    return kb
-
-  def gen_dict(self):
-    s = set()
-    for sub, rel, obj in self.kb_raw:
-      s.add(sub)
-      s.add(rel)
-      s.add(obj)
-    for name in self.data_raw:
-      for question, answers in self.data_raw[name]:
-        normal = re.split('\[[^\]]+\]', question)
-        for phrase in normal:
-          for word in phrase.split():
-            s.add(word)
-    s = list(s)
-    d = {s[idx]: idx for idx in range(len(s))}
-    return d
-
-  def build_samples(self):
-
-    def map_entity_idx(text):
-      entities = re.findall('\[[^\]]+\]', text)
-      for entity in entities:
-        entity = entity[1:-1]
-        index = self.dict_all[entity]
-        text = text.replace('[%s]' % entity, '@%d' % index)
-      return text
-
-    data_all = dict()
-
-    for name in self.data_raw:
-      X, Y, MultiYs, qid = [], [], [], []
-      for i, (question, answers) in enumerate(self.data_raw[name]):
-        qdata, labels = [], []
-        question = map_entity_idx(question)
-        for word in question.split():
-          if word[0] == '@':
-            qdata.append(int(word[1:]))
-          else:
-            qdata.append(self.dict_all[word])
-        for answer in answers.split('|'):
-          labels.append(self.dict_all[answer])
-        if len(qdata) > self.config.T_encoder:
-          self.config.T_encoder = len(qdata)
-        for label in labels:
-          X.append(qdata)
-          Y.append(label)
-          MultiYs.append(set(labels))
-          qid.append(i)
-      data_all[name] = Data(X=X, Y=Y, MultiYs=MultiYs, qid=qid)
-
-    return data_all
-
-
-def _run_prefetch(prefetch_queue, batch_loader, data, shuffle, one_pass,
-                  config):
-  assert len(data.X) == len(data.Y) == len(data.MultiYs) == len(data.qid)
-  num_samples = len(data.X)
-  batch_size = config.batch_size
-
-  n_sample = 0
-  fetch_order = config.rng.permutation(num_samples)
-  while True:
-    sample_ids = fetch_order[n_sample:n_sample + batch_size]
-    batch = batch_loader.load_one_batch(sample_ids)
-    prefetch_queue.put(batch, block=True)
-
-    n_sample += len(sample_ids)
-    if n_sample >= num_samples:
-      if one_pass:
-        prefetch_queue.put(None, block=True)
-      n_sample = 0
-      if shuffle:
-        fetch_order = config.rng.permutation(num_samples)
-
-
-class DataReader:
-  def __init__(self,
-               config,
-               data,
-               assembler,
-               shuffle=True,
-               one_pass=False,
-               prefetch_num=10):
-    self.config = config
-
-    self.data = data
-    self.assembler = assembler
-    self.batch_loader = BatchLoader(self.config,
-                                    self.data, self.assembler)
-
-    self.shuffle = shuffle
-    self.one_pass = one_pass
-    self.prefetch_queue = Queue(maxsize=prefetch_num)
-    self.prefetch_thread = threading.Thread(target=_run_prefetch,
-                                            args=(self.prefetch_queue,
-                                                  self.batch_loader, self.data,
-                                                  self.shuffle, self.one_pass,
-                                                  self.config))
-    self.prefetch_thread.daemon = True
-    self.prefetch_thread.start()
-
-  def batches(self):
-    while True:
-      if self.prefetch_queue.empty():
-        tf.logging.warning('Waiting for data loading (IO is slow)...')
-      batch = self.prefetch_queue.get(block=True)
-      if batch is None:
-        assert self.one_pass
-        tf.logging.info('One pass finished!')
-        raise StopIteration()
-      yield batch
-
-
-class BatchLoader:
-  def __init__(self, config,
-               data, assembler):
-    self.config = config
-
-    self.data = data
-    self.assembler = assembler
-
-    self.T_encoder = config.T_encoder
-    self.T_decoder = config.T_decoder
-
-    tf.logging.info('T_encoder: %d' % self.T_encoder)
-    tf.logging.info('T_decoder: %d' % self.T_decoder)
-    tf.logging.info('batch size: %d' % self.config.batch_size)
-
-    self.gt_layout_tokens = config.gt_layout_tokens
-
-  def load_one_batch(self, sample_ids):
-    actual_batch_size = len(sample_ids)
-    input_seq_batch = np.zeros((self.T_encoder, actual_batch_size), np.int32)
-    seq_len_batch = np.zeros(actual_batch_size, np.int32)
-    ans_label_batch = np.zeros(actual_batch_size, np.int32)
-    ans_set_labels_list = [None] * actual_batch_size
-    question_id_list = [None] * actual_batch_size
-    gt_layout_batch = np.zeros((self.T_decoder, actual_batch_size), np.int32)
-
-    for batch_i in range(actual_batch_size):
-      idx = sample_ids[batch_i]
-      seq_len = len(self.data.X[idx])
-      seq_len_batch[batch_i] = seq_len
-      input_seq_batch[:seq_len, batch_i] = self.data.X[idx]
-      ans_label_batch[batch_i] = self.data.Y[idx]
-      ans_set_labels_list[batch_i] = self.data.MultiYs[idx]
-      question_id_list[batch_i] = self.data.qid[idx]
-
-      gt_layout_batch[:, batch_i] = self.assembler.module_list2tokens(
-        self.gt_layout_tokens, self.T_decoder)
-
-    batch = dict(input_seq_batch=input_seq_batch,
-                 seq_len_batch=seq_len_batch,
-                 ans_label_batch=ans_label_batch,
-                 gt_layout_batch=gt_layout_batch,
-                 ans_set_labels_list=ans_set_labels_list,
-                 question_id_list=question_id_list)
-    return batch
diff --git a/research/qa_kg/util/misc.py b/research/qa_kg/util/misc.py
deleted file mode 100644
index 9a0199bb403709f3c04e58c17951459febdb40f4..0000000000000000000000000000000000000000
--- a/research/qa_kg/util/misc.py
+++ /dev/null
@@ -1,77 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from datetime import datetime
-import json
-import logging
-import os
-import tensorflow as tf
-import tensorflow.contrib.slim as slim
-
-
-def prepare_dirs_and_logger(config):
-  formatter = logging.Formatter('%(asctime)s:%(levelname)s::%(message)s')
-  logger = logging.getLogger('tensorflow')
-
-  for hdlr in logger.handlers:
-    logger.removeHandler(hdlr)
-
-  handler = logging.StreamHandler()
-  handler.setFormatter(formatter)
-
-  logger.addHandler(handler)
-  logger.setLevel(tf.logging.INFO)
-
-  config.log_dir = os.path.join(config.exp_dir, config.log_dir,
-                                config.train_tag)
-  config.model_dir = os.path.join(config.exp_dir, config.model_dir,
-                                  config.train_tag)
-  config.output_dir = os.path.join(config.exp_dir, config.output_dir,
-                                   config.train_tag)
-
-  for path in [
-    config.log_dir, config.model_dir, config.output_dir
-  ]:
-    if not os.path.exists(path):
-      os.makedirs(path)
-
-  config.data_files = {
-    'train': os.path.join(config.data_dir, config.train_data_file),
-    'dev': os.path.join(config.data_dir, config.dev_data_file),
-    'test': os.path.join(config.data_dir, config.test_data_file)
-  }
-
-  return config
-
-
-def get_time():
-  return datetime.now().strftime('%Y-%m-%d_%H-%M-%S')
-
-
-def show_all_variables():
-  model_vars = tf.trainable_variables()
-  slim.model_analyzer.analyze_vars(model_vars, print_info=True)
-
-
-def save_config(config):
-  param_path = os.path.join(config.model_dir, 'params.json')
-
-  tf.logging.info('log dir: %s' % config.log_dir)
-  tf.logging.info('model dir: %s' % config.model_dir)
-  tf.logging.info('param path: %s' % param_path)
-  tf.logging.info('output dir: %s' % config.output_dir)
-
-  with open(param_path, 'w') as f:
-    f.write(json.dumps(config.__dict__, indent=4, sort_keys=True))
diff --git a/research/qa_kg/util/nn.py b/research/qa_kg/util/nn.py
deleted file mode 100644
index 38ba02b2ecac1cea51308287a48e3062ee51fa81..0000000000000000000000000000000000000000
--- a/research/qa_kg/util/nn.py
+++ /dev/null
@@ -1,55 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import tensorflow as tf
-
-
-def fc_layer(name,
-             bottom,
-             output_dim,
-             bias_term=True,
-             weights_initializer=None,
-             biases_initializer=None,
-             reuse=None):
-  # flatten bottom input
-  shape = bottom.get_shape().as_list()
-  input_dim = 1
-  for d in shape[1:]:
-    input_dim *= d
-  flat_bottom = tf.reshape(bottom, [-1, input_dim])
-
-  # weights and biases variables
-  with tf.variable_scope(name, reuse=reuse):
-    # initialize the variables
-    if weights_initializer is None:
-      weights_initializer = tf.contrib.layers.xavier_initializer()
-    if bias_term and biases_initializer is None:
-      biases_initializer = tf.constant_initializer(0.)
-
-    # weights has shape [input_dim, output_dim]
-    weights = tf.get_variable(
-        'weights', [input_dim, output_dim], initializer=weights_initializer)
-    if bias_term:
-      biases = tf.get_variable(
-          'biases', output_dim, initializer=biases_initializer)
-    if not reuse:
-      tf.add_to_collection(tf.GraphKeys.REGULARIZATION_LOSSES,
-                           tf.nn.l2_loss(weights))
-
-  if bias_term:
-    fc = tf.nn.xw_plus_b(flat_bottom, weights, biases)
-  else:
-    fc = tf.matmul(flat_bottom, weights)
-  return fc
diff --git a/research/real_nvp/README.md b/research/real_nvp/README.md
deleted file mode 100644
index c20ef111eb070be94bf6c11ea15ec1ce9d2ad686..0000000000000000000000000000000000000000
--- a/research/real_nvp/README.md
+++ /dev/null
@@ -1,282 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Real NVP in TensorFlow
-
-*A Tensorflow implementation of the training procedure of*
-[*Density estimation using Real NVP*](https://arxiv.org/abs/1605.08803)*, by
-Laurent Dinh, Jascha Sohl-Dickstein and Samy Bengio, for Imagenet
-(32x32 and 64x64), CelebA and LSUN Including the scripts to
-put the datasets in `.tfrecords` format.*
-
-We are happy to open source the code for *Real NVP*, a novel approach to
-density estimation using deep neural networks that enables tractable density
-estimation and efficient one-pass inference and sampling. This model
-successfully decomposes images into hierarchical features ranging from
-high-level concepts to low-resolution details. Visualizations are available
-[here](http://goo.gl/yco14s).
-
-## Installation
-*   python 2.7:
-    * python 3 support is not available yet
-*   pip (python package manager)
-    * `apt-get install python-pip` on Ubuntu
-    * `brew` installs pip along with python on OSX
-*   Install the dependencies for [LSUN](https://github.com/fyu/lsun.git)
-    * Install [OpenCV](http://opencv.org/)
-    * `pip install numpy lmdb`
-*   Install the python dependencies
-    * `pip install scipy scikit-image Pillow`
-*   Install the
-[latest Tensorflow Pip package](https://www.tensorflow.org/get_started/os_setup.html#using-pip)
-for Python 2.7
-
-## Getting Started
-Once you have successfully installed the dependencies, you can start by
-downloading the repository:
-```shell
-git clone --recursive https://github.com/tensorflow/models.git
-```
-Afterward, you can use the utilities in this folder prepare the datasets.
-
-## Preparing datasets
-### CelebA
-For [*CelebA*](http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html), download
-`img_align_celeba.zip` from the Dropbox link on this
-[page](http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html) under the
-link *Align&Cropped Images* in the *Img* directory and `list_eval_partition.txt`
-under the link *Train/Val/Test Partitions* in the *Eval* directory. Then do:
-
-```shell
-mkdir celeba
-cd celeba
-unzip img_align_celeba.zip
-```
-
-We'll format the training subset:
-```shell
-python2.7 ../models/real_nvp/celeba_formatting.py \
-    --partition_fn list_eval_partition.txt \
-    --file_out celeba_train \
-    --fn_root img_align_celeba \
-    --set 0
-```
-
-Then the validation subset:
-```shell
-python2.7 ../models/real_nvp/celeba_formatting.py \
-    --partition_fn list_eval_partition.txt \
-    --file_out celeba_valid \
-    --fn_root img_align_celeba \
-    --set 1
-```
-
-And finally the test subset:
-```shell
-python2.7 ../models/real_nvp/celeba_formatting.py \
-    --partition_fn list_eval_partition.txt \
-    --file_out celeba_test \
-    --fn_root img_align_celeba \
-    --set 2
-```
-
-Afterward:
-```shell
-cd ..
-```
-
-### Small Imagenet
-Downloading the [*small Imagenet*](http://image-net.org/small/download.php)
-dataset is more straightforward and can be done
-entirely in Shell:
-```shell
-mkdir small_imnet
-cd small_imnet
-for FILENAME in train_32x32.tar valid_32x32.tar train_64x64.tar valid_64x64.tar
-do
-    curl -O http://image-net.org/small/$FILENAME
-    tar -xvf $FILENAME
-done
-```
-
-Then, you can format the datasets as follow:
-```shell
-for DIRNAME in train_32x32 valid_32x32 train_64x64 valid_64x64
-do
-    python2.7 ../models/real_nvp/imnet_formatting.py \
-        --file_out $DIRNAME \
-        --fn_root $DIRNAME
-done
-cd ..
-```
-
-### LSUN
-To prepare the [*LSUN*](http://lsun.cs.princeton.edu/2016/) dataset, we will
-need to use the code associated:
-```shell
-git clone https://github.com/fyu/lsun.git
-cd lsun
-```
-Then we'll download the db files:
-```shell
-for CATEGORY in bedroom church_outdoor tower
-do
-    python2.7 download.py -c $CATEGORY
-    unzip "$CATEGORY"_train_lmdb.zip
-    unzip "$CATEGORY"_val_lmdb.zip
-    python2.7 data.py export "$CATEGORY"_train_lmdb \
-        --out_dir "$CATEGORY"_train --flat
-    python2.7 data.py export "$CATEGORY"_val_lmdb \
-        --out_dir "$CATEGORY"_val --flat
-done
-```
-
-Finally, we then format the dataset into `.tfrecords`:
-```shell
-for CATEGORY in bedroom church_outdoor tower
-do
-    python2.7 ../models/real_nvp/lsun_formatting.py \
-        --file_out "$CATEGORY"_train \
-        --fn_root "$CATEGORY"_train
-    python2.7 ../models/real_nvp/lsun_formatting.py \
-        --file_out "$CATEGORY"_val \
-        --fn_root "$CATEGORY"_val
-done
-cd ..
-```
-
-
-## Training
-We'll give an example on how to train a model on the small Imagenet
-dataset (32x32):
-```shell
-cd models/real_nvp/
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 32 \
---hpconfig=n_scale=4,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset imnet \
---traindir /tmp/real_nvp_imnet32/train \
---logdir /tmp/real_nvp_imnet32/train \
---data_path ../../small_imnet/train_32x32_?????.tfrecords
-```
-In parallel, you can run the script to generate visualization from the model:
-```shell
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 32 \
---hpconfig=n_scale=4,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset imnet \
---traindir /tmp/real_nvp_imnet32/train \
---logdir /tmp/real_nvp_imnet32/sample \
---data_path ../../small_imnet/valid_32x32_?????.tfrecords \
---mode sample
-```
-Additionally, you can also run in the script to evaluate the model on the
-validation set:
-```shell
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 32 \
---hpconfig=n_scale=4,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset imnet \
---traindir /tmp/real_nvp_imnet32/train \
---logdir /tmp/real_nvp_imnet32/eval \
---data_path ../../small_imnet/valid_32x32_?????.tfrecords \
---eval_set_size 50000
---mode eval
-```
-The visualizations and validation set evaluation can be seen through
-[Tensorboard](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/tensorboard/README.md).
-
-Another example would be how to run the model on LSUN (bedroom category):
-```shell
-# train the model
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 64 \
---hpconfig=n_scale=5,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset lsun \
---traindir /tmp/real_nvp_church_outdoor/train \
---logdir /tmp/real_nvp_church_outdoor/train \
---data_path ../../lsun/church_outdoor_train_?????.tfrecords
-```
-
-```shell
-# sample from the model
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 64 \
---hpconfig=n_scale=5,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset lsun \
---traindir /tmp/real_nvp_church_outdoor/train \
---logdir /tmp/real_nvp_church_outdoor/sample \
---data_path ../../lsun/church_outdoor_val_?????.tfrecords \
---mode sample
-```
-
-```shell
-# evaluate the model
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 64 \
---hpconfig=n_scale=5,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset lsun \
---traindir /tmp/real_nvp_church_outdoor/train \
---logdir /tmp/real_nvp_church_outdoor/eval \
---data_path ../../lsun/church_outdoor_val_?????.tfrecords \
---eval_set_size 300
---mode eval
-```
-
-Finally, we'll give the commands to run the model on the CelebA dataset:
-```shell
-# train the model
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 64 \
---hpconfig=n_scale=5,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset lsun \
---traindir /tmp/real_nvp_celeba/train \
---logdir /tmp/real_nvp_celeba/train \
---data_path ../../celeba/celeba_train.tfrecords
-```
-
-```shell
-# sample from the model
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 64 \
---hpconfig=n_scale=5,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset celeba \
---traindir /tmp/real_nvp_celeba/train \
---logdir /tmp/real_nvp_celeba/sample \
---data_path ../../celeba/celeba_valid.tfrecords \
---mode sample
-```
-
-```shell
-# evaluate the model on validation set
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 64 \
---hpconfig=n_scale=5,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset celeba \
---traindir /tmp/real_nvp_celeba/train \
---logdir /tmp/real_nvp_celeba/eval_valid \
---data_path ../../celeba/celeba_valid.tfrecords \
---eval_set_size 19867
---mode eval
-
-# evaluate the model on test set
-python2.7 real_nvp_multiscale_dataset.py \
---image_size 64 \
---hpconfig=n_scale=5,base_dim=32,clip_gradient=100,residual_blocks=4 \
---dataset celeba \
---traindir /tmp/real_nvp_celeba/train \
---logdir /tmp/real_nvp_celeba/eval_test \
---data_path ../../celeba/celeba_test.tfrecords \
---eval_set_size 19962
---mode eval
-```
-
-## Credits
-This code was written by Laurent Dinh
-([@laurent-dinh](https://github.com/laurent-dinh)) with
-the help of
-Jascha Sohl-Dickstein ([@Sohl-Dickstein](https://github.com/Sohl-Dickstein)
-and [jaschasd@google.com](mailto:jaschasd@google.com)),
-Samy Bengio, Jon Shlens, Sherry Moore and
-David Andersen.
diff --git a/research/real_nvp/__init__.py b/research/real_nvp/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/real_nvp/celeba_formatting.py b/research/real_nvp/celeba_formatting.py
deleted file mode 100644
index e03571086d88763264d7660aa5e9db5e9074dec5..0000000000000000000000000000000000000000
--- a/research/real_nvp/celeba_formatting.py
+++ /dev/null
@@ -1,96 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""CelebA dataset formating.
-
-Download img_align_celeba.zip from
-http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html under the
-link "Align&Cropped Images" in the "Img" directory and list_eval_partition.txt
-under the link "Train/Val/Test Partitions" in the "Eval" directory. Then do:
-unzip img_align_celeba.zip
-
-Use the script as follow:
-python celeba_formatting.py \
-    --partition_fn [PARTITION_FILE_PATH] \
-    --file_out [OUTPUT_FILE_PATH_PREFIX] \
-    --fn_root [CELEBA_FOLDER] \
-    --set [SUBSET_INDEX]
-
-"""
-
-from __future__ import print_function
-
-import os
-import os.path
-
-import scipy.io
-import scipy.io.wavfile
-import scipy.ndimage
-import tensorflow as tf
-
-
-tf.flags.DEFINE_string("file_out", "",
-                       "Filename of the output .tfrecords file.")
-tf.flags.DEFINE_string("fn_root", "", "Name of root file path.")
-tf.flags.DEFINE_string("partition_fn", "", "Partition file path.")
-tf.flags.DEFINE_string("set", "", "Name of subset.")
-
-FLAGS = tf.flags.FLAGS
-
-
-def _int64_feature(value):
-    return tf.train.Feature(int64_list=tf.train.Int64List(value=[value]))
-
-
-def _bytes_feature(value):
-    return tf.train.Feature(bytes_list=tf.train.BytesList(value=[value]))
-
-
-def main():
-    """Main converter function."""
-    # Celeb A
-    with open(FLAGS.partition_fn, "r") as infile:
-        img_fn_list = infile.readlines()
-    img_fn_list = [elem.strip().split() for elem in img_fn_list]
-    img_fn_list = [elem[0] for elem in img_fn_list if elem[1] == FLAGS.set]
-    fn_root = FLAGS.fn_root
-    num_examples = len(img_fn_list)
-
-    file_out = "%s.tfrecords" % FLAGS.file_out
-    writer = tf.python_io.TFRecordWriter(file_out)
-    for example_idx, img_fn in enumerate(img_fn_list):
-        if example_idx % 1000 == 0:
-            print(example_idx, "/", num_examples)
-        image_raw = scipy.ndimage.imread(os.path.join(fn_root, img_fn))
-        rows = image_raw.shape[0]
-        cols = image_raw.shape[1]
-        depth = image_raw.shape[2]
-        image_raw = image_raw.tostring()
-        example = tf.train.Example(
-            features=tf.train.Features(
-                feature={
-                    "height": _int64_feature(rows),
-                    "width": _int64_feature(cols),
-                    "depth": _int64_feature(depth),
-                    "image_raw": _bytes_feature(image_raw)
-                }
-            )
-        )
-        writer.write(example.SerializeToString())
-    writer.close()
-
-
-if __name__ == "__main__":
-    main()
diff --git a/research/real_nvp/imnet_formatting.py b/research/real_nvp/imnet_formatting.py
deleted file mode 100644
index 1775dd54d368b62d047d1428d4bcf79ad4a68ae0..0000000000000000000000000000000000000000
--- a/research/real_nvp/imnet_formatting.py
+++ /dev/null
@@ -1,105 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""LSUN dataset formatting.
-
-Download and format the Imagenet dataset as follow:
-mkdir [IMAGENET_PATH]
-cd [IMAGENET_PATH]
-for FILENAME in train_32x32.tar valid_32x32.tar train_64x64.tar valid_64x64.tar
-do
-    curl -O http://image-net.org/small/$FILENAME
-    tar -xvf $FILENAME
-done
-
-Then use the script as follow:
-for DIRNAME in train_32x32 valid_32x32 train_64x64 valid_64x64
-do
-    python imnet_formatting.py \
-        --file_out $DIRNAME \
-        --fn_root $DIRNAME
-done
-
-"""
-
-from __future__ import print_function
-
-import os
-import os.path
-
-import scipy.io
-import scipy.io.wavfile
-import scipy.ndimage
-import tensorflow as tf
-
-
-tf.flags.DEFINE_string("file_out", "",
-                       "Filename of the output .tfrecords file.")
-tf.flags.DEFINE_string("fn_root", "", "Name of root file path.")
-
-FLAGS = tf.flags.FLAGS
-
-
-def _int64_feature(value):
-    return tf.train.Feature(int64_list=tf.train.Int64List(value=[value]))
-
-
-def _bytes_feature(value):
-    return tf.train.Feature(bytes_list=tf.train.BytesList(value=[value]))
-
-
-def main():
-    """Main converter function."""
-    # LSUN
-    fn_root = FLAGS.fn_root
-    img_fn_list = os.listdir(fn_root)
-    img_fn_list = [img_fn for img_fn in img_fn_list
-                   if img_fn.endswith('.png')]
-    num_examples = len(img_fn_list)
-
-    n_examples_per_file = 10000
-    for example_idx, img_fn in enumerate(img_fn_list):
-        if example_idx % n_examples_per_file == 0:
-            file_out = "%s_%05d.tfrecords"
-            file_out = file_out % (FLAGS.file_out,
-                                   example_idx // n_examples_per_file)
-            print("Writing on:", file_out)
-            writer = tf.python_io.TFRecordWriter(file_out)
-        if example_idx % 1000 == 0:
-            print(example_idx, "/", num_examples)
-        image_raw = scipy.ndimage.imread(os.path.join(fn_root, img_fn))
-        rows = image_raw.shape[0]
-        cols = image_raw.shape[1]
-        depth = image_raw.shape[2]
-        image_raw = image_raw.astype("uint8")
-        image_raw = image_raw.tostring()
-        example = tf.train.Example(
-            features=tf.train.Features(
-                feature={
-                    "height": _int64_feature(rows),
-                    "width": _int64_feature(cols),
-                    "depth": _int64_feature(depth),
-                    "image_raw": _bytes_feature(image_raw)
-                }
-            )
-        )
-        writer.write(example.SerializeToString())
-        if example_idx % n_examples_per_file == (n_examples_per_file - 1):
-            writer.close()
-    writer.close()
-
-
-if __name__ == "__main__":
-    main()
diff --git a/research/real_nvp/lsun_formatting.py b/research/real_nvp/lsun_formatting.py
deleted file mode 100644
index 13a21c5e90f86dcdea777419d1df848fcea03d45..0000000000000000000000000000000000000000
--- a/research/real_nvp/lsun_formatting.py
+++ /dev/null
@@ -1,105 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""LSUN dataset formatting.
-
-Download and format the LSUN dataset as follow:
-git clone https://github.com/fyu/lsun.git
-cd lsun
-python2.7 download.py -c [CATEGORY]
-
-Then unzip the downloaded .zip files before executing:
-python2.7 data.py export [IMAGE_DB_PATH] --out_dir [LSUN_FOLDER] --flat
-
-Then use the script as follow:
-python lsun_formatting.py \
-    --file_out [OUTPUT_FILE_PATH_PREFIX] \
-    --fn_root [LSUN_FOLDER]
-
-"""
-from __future__ import print_function
-
-import os
-import os.path
-
-import numpy
-import skimage.transform
-from PIL import Image
-import tensorflow as tf
-
-
-tf.flags.DEFINE_string("file_out", "",
-                       "Filename of the output .tfrecords file.")
-tf.flags.DEFINE_string("fn_root", "", "Name of root file path.")
-
-FLAGS = tf.flags.FLAGS
-
-
-def _int64_feature(value):
-    return tf.train.Feature(int64_list=tf.train.Int64List(value=[value]))
-
-
-def _bytes_feature(value):
-    return tf.train.Feature(bytes_list=tf.train.BytesList(value=[value]))
-
-
-def main():
-    """Main converter function."""
-    fn_root = FLAGS.fn_root
-    img_fn_list = os.listdir(fn_root)
-    img_fn_list = [img_fn for img_fn in img_fn_list
-                   if img_fn.endswith('.webp')]
-    num_examples = len(img_fn_list)
-
-    n_examples_per_file = 10000
-    for example_idx, img_fn in enumerate(img_fn_list):
-        if example_idx % n_examples_per_file == 0:
-            file_out = "%s_%05d.tfrecords"
-            file_out = file_out % (FLAGS.file_out,
-                                   example_idx // n_examples_per_file)
-            print("Writing on:", file_out)
-            writer = tf.python_io.TFRecordWriter(file_out)
-        if example_idx % 1000 == 0:
-            print(example_idx, "/", num_examples)
-        image_raw = numpy.array(Image.open(os.path.join(fn_root, img_fn)))
-        rows = image_raw.shape[0]
-        cols = image_raw.shape[1]
-        depth = image_raw.shape[2]
-        downscale = min(rows / 96., cols / 96.)
-        image_raw = skimage.transform.pyramid_reduce(image_raw, downscale)
-        image_raw *= 255.
-        image_raw = image_raw.astype("uint8")
-        rows = image_raw.shape[0]
-        cols = image_raw.shape[1]
-        depth = image_raw.shape[2]
-        image_raw = image_raw.tostring()
-        example = tf.train.Example(
-            features=tf.train.Features(
-                feature={
-                    "height": _int64_feature(rows),
-                    "width": _int64_feature(cols),
-                    "depth": _int64_feature(depth),
-                    "image_raw": _bytes_feature(image_raw)
-                }
-            )
-        )
-        writer.write(example.SerializeToString())
-        if example_idx % n_examples_per_file == (n_examples_per_file - 1):
-            writer.close()
-    writer.close()
-
-
-if __name__ == "__main__":
-    main()
diff --git a/research/real_nvp/real_nvp_multiscale_dataset.py b/research/real_nvp/real_nvp_multiscale_dataset.py
deleted file mode 100644
index c0e1864f1988cd983cdba14ced2462dae1b67e29..0000000000000000000000000000000000000000
--- a/research/real_nvp/real_nvp_multiscale_dataset.py
+++ /dev/null
@@ -1,1639 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Script for training, evaluation and sampling for Real NVP.
-
-$ python real_nvp_multiscale_dataset.py \
---alsologtostderr \
---image_size 64 \
---hpconfig=n_scale=5,base_dim=8 \
---dataset imnet \
---data_path [DATA_PATH]
-"""
-
-from __future__ import print_function
-
-import time
-from datetime import datetime
-import os
-
-import numpy
-from six.moves import xrange
-import tensorflow as tf
-
-from tensorflow import gfile
-
-from real_nvp_utils import (
-    batch_norm, batch_norm_log_diff, conv_layer,
-    squeeze_2x2, squeeze_2x2_ordered, standard_normal_ll,
-    standard_normal_sample, unsqueeze_2x2, variable_on_cpu)
-
-
-tf.flags.DEFINE_string("master", "local",
-                       "BNS name of the TensorFlow master, or local.")
-
-tf.flags.DEFINE_string("logdir", "/tmp/real_nvp_multiscale",
-                       "Directory to which writes logs.")
-
-tf.flags.DEFINE_string("traindir", "/tmp/real_nvp_multiscale",
-                       "Directory to which writes logs.")
-
-tf.flags.DEFINE_integer("train_steps", 1000000000000000000,
-                        "Number of steps to train for.")
-
-tf.flags.DEFINE_string("data_path", "", "Path to the data.")
-
-tf.flags.DEFINE_string("mode", "train",
-                       "Mode of execution. Must be 'train', "
-                       "'sample' or 'eval'.")
-
-tf.flags.DEFINE_string("dataset", "imnet",
-                       "Dataset used. Must be 'imnet', "
-                       "'celeba' or 'lsun'.")
-
-tf.flags.DEFINE_integer("recursion_type", 2,
-                        "Type of the recursion.")
-
-tf.flags.DEFINE_integer("image_size", 64,
-                        "Size of the input image.")
-
-tf.flags.DEFINE_integer("eval_set_size", 0,
-                        "Size of evaluation dataset.")
-
-tf.flags.DEFINE_string(
-    "hpconfig", "",
-    "A comma separated list of hyperparameters for the model. Format is "
-    "hp1=value1,hp2=value2,etc. If this FLAG is set, the model will be trained "
-    "with the specified hyperparameters, filling in missing hyperparameters "
-    "from the default_values in |hyper_params|.")
-
-FLAGS = tf.flags.FLAGS
-
-class HParams(object):
-    """Dictionary of hyperparameters."""
-    def __init__(self, **kwargs):
-        self.dict_ = kwargs
-        self.__dict__.update(self.dict_)
-
-    def update_config(self, in_string):
-        """Update the dictionary with a comma separated list."""
-        pairs = in_string.split(",")
-        pairs = [pair.split("=") for pair in pairs]
-        for key, val in pairs:
-            self.dict_[key] = type(self.dict_[key])(val)
-        self.__dict__.update(self.dict_)
-        return self
-
-    def __getitem__(self, key):
-        return self.dict_[key]
-
-    def __setitem__(self, key, val):
-        self.dict_[key] = val
-        self.__dict__.update(self.dict_)
-
-
-def get_default_hparams():
-    """Get the default hyperparameters."""
-    return HParams(
-        batch_size=64,
-        residual_blocks=2,
-        n_couplings=2,
-        n_scale=4,
-        learning_rate=0.001,
-        momentum=1e-1,
-        decay=1e-3,
-        l2_coeff=0.00005,
-        clip_gradient=100.,
-        optimizer="adam",
-        dropout_mask=0,
-        base_dim=32,
-        bottleneck=0,
-        use_batch_norm=1,
-        alternate=1,
-        use_aff=1,
-        skip=1,
-        data_constraint=.9,
-        n_opt=0)
-
-
-# RESNET UTILS
-def residual_block(input_, dim, name, use_batch_norm=True,
-                   train=True, weight_norm=True, bottleneck=False):
-    """Residual convolutional block."""
-    with tf.variable_scope(name):
-        res = input_
-        if use_batch_norm:
-            res = batch_norm(
-                input_=res, dim=dim, name="bn_in", scale=False,
-                train=train, epsilon=1e-4, axes=[0, 1, 2])
-        res = tf.nn.relu(res)
-        if bottleneck:
-            res = conv_layer(
-                input_=res, filter_size=[1, 1], dim_in=dim, dim_out=dim,
-                name="h_0", stddev=numpy.sqrt(2. / (dim)),
-                strides=[1, 1, 1, 1], padding="SAME",
-                nonlinearity=None, bias=(not use_batch_norm),
-                weight_norm=weight_norm, scale=False)
-            if use_batch_norm:
-                res = batch_norm(
-                    input_=res, dim=dim,
-                    name="bn_0", scale=False, train=train,
-                    epsilon=1e-4, axes=[0, 1, 2])
-            res = tf.nn.relu(res)
-            res = conv_layer(
-                input_=res, filter_size=[3, 3], dim_in=dim,
-                dim_out=dim, name="h_1", stddev=numpy.sqrt(2. / (1. * dim)),
-                strides=[1, 1, 1, 1], padding="SAME", nonlinearity=None,
-                bias=(not use_batch_norm),
-                weight_norm=weight_norm, scale=False)
-            if use_batch_norm:
-                res = batch_norm(
-                    input_=res, dim=dim, name="bn_1", scale=False,
-                    train=train, epsilon=1e-4, axes=[0, 1, 2])
-            res = tf.nn.relu(res)
-            res = conv_layer(
-                input_=res, filter_size=[1, 1], dim_in=dim, dim_out=dim,
-                name="out", stddev=numpy.sqrt(2. / (1. * dim)),
-                strides=[1, 1, 1, 1], padding="SAME", nonlinearity=None,
-                bias=True, weight_norm=weight_norm, scale=True)
-        else:
-            res = conv_layer(
-                input_=res, filter_size=[3, 3], dim_in=dim, dim_out=dim,
-                name="h_0", stddev=numpy.sqrt(2. / (dim)),
-                strides=[1, 1, 1, 1], padding="SAME",
-                nonlinearity=None, bias=(not use_batch_norm),
-                weight_norm=weight_norm, scale=False)
-            if use_batch_norm:
-                res = batch_norm(
-                    input_=res, dim=dim, name="bn_0", scale=False,
-                    train=train, epsilon=1e-4, axes=[0, 1, 2])
-            res = tf.nn.relu(res)
-            res = conv_layer(
-                input_=res, filter_size=[3, 3], dim_in=dim, dim_out=dim,
-                name="out", stddev=numpy.sqrt(2. / (1. * dim)),
-                strides=[1, 1, 1, 1], padding="SAME", nonlinearity=None,
-                bias=True, weight_norm=weight_norm, scale=True)
-        res += input_
-
-    return res
-
-
-def resnet(input_, dim_in, dim, dim_out, name, use_batch_norm=True,
-           train=True, weight_norm=True, residual_blocks=5,
-           bottleneck=False, skip=True):
-    """Residual convolutional network."""
-    with tf.variable_scope(name):
-        res = input_
-        if residual_blocks != 0:
-            res = conv_layer(
-                input_=res, filter_size=[3, 3], dim_in=dim_in, dim_out=dim,
-                name="h_in", stddev=numpy.sqrt(2. / (dim_in)),
-                strides=[1, 1, 1, 1], padding="SAME",
-                nonlinearity=None, bias=True,
-                weight_norm=weight_norm, scale=False)
-            if skip:
-                out = conv_layer(
-                    input_=res, filter_size=[1, 1], dim_in=dim, dim_out=dim,
-                    name="skip_in", stddev=numpy.sqrt(2. / (dim)),
-                    strides=[1, 1, 1, 1], padding="SAME",
-                    nonlinearity=None, bias=True,
-                    weight_norm=weight_norm, scale=True)
-
-            # residual blocks
-            for idx_block in xrange(residual_blocks):
-                res = residual_block(res, dim, "block_%d" % idx_block,
-                                     use_batch_norm=use_batch_norm, train=train,
-                                     weight_norm=weight_norm,
-                                     bottleneck=bottleneck)
-                if skip:
-                    out += conv_layer(
-                        input_=res, filter_size=[1, 1], dim_in=dim, dim_out=dim,
-                        name="skip_%d" % idx_block, stddev=numpy.sqrt(2. / (dim)),
-                        strides=[1, 1, 1, 1], padding="SAME",
-                        nonlinearity=None, bias=True,
-                        weight_norm=weight_norm, scale=True)
-            # outputs
-            if skip:
-                res = out
-            if use_batch_norm:
-                res = batch_norm(
-                    input_=res, dim=dim, name="bn_pre_out", scale=False,
-                    train=train, epsilon=1e-4, axes=[0, 1, 2])
-            res = tf.nn.relu(res)
-            res = conv_layer(
-                input_=res, filter_size=[1, 1], dim_in=dim,
-                dim_out=dim_out,
-                name="out", stddev=numpy.sqrt(2. / (1. * dim)),
-                strides=[1, 1, 1, 1], padding="SAME",
-                nonlinearity=None, bias=True,
-                weight_norm=weight_norm, scale=True)
-        else:
-            if bottleneck:
-                res = conv_layer(
-                    input_=res, filter_size=[1, 1], dim_in=dim_in, dim_out=dim,
-                    name="h_0", stddev=numpy.sqrt(2. / (dim_in)),
-                    strides=[1, 1, 1, 1], padding="SAME",
-                    nonlinearity=None, bias=(not use_batch_norm),
-                    weight_norm=weight_norm, scale=False)
-                if use_batch_norm:
-                    res = batch_norm(
-                        input_=res, dim=dim, name="bn_0", scale=False,
-                        train=train, epsilon=1e-4, axes=[0, 1, 2])
-                res = tf.nn.relu(res)
-                res = conv_layer(
-                    input_=res, filter_size=[3, 3], dim_in=dim,
-                    dim_out=dim, name="h_1", stddev=numpy.sqrt(2. / (1. * dim)),
-                    strides=[1, 1, 1, 1], padding="SAME",
-                    nonlinearity=None,
-                    bias=(not use_batch_norm),
-                    weight_norm=weight_norm, scale=False)
-                if use_batch_norm:
-                    res = batch_norm(
-                        input_=res, dim=dim, name="bn_1", scale=False,
-                        train=train, epsilon=1e-4, axes=[0, 1, 2])
-                res = tf.nn.relu(res)
-                res = conv_layer(
-                    input_=res, filter_size=[1, 1], dim_in=dim, dim_out=dim_out,
-                    name="out", stddev=numpy.sqrt(2. / (1. * dim)),
-                    strides=[1, 1, 1, 1], padding="SAME",
-                    nonlinearity=None, bias=True,
-                    weight_norm=weight_norm, scale=True)
-            else:
-                res = conv_layer(
-                    input_=res, filter_size=[3, 3], dim_in=dim_in, dim_out=dim,
-                    name="h_0", stddev=numpy.sqrt(2. / (dim_in)),
-                    strides=[1, 1, 1, 1], padding="SAME",
-                    nonlinearity=None, bias=(not use_batch_norm),
-                    weight_norm=weight_norm, scale=False)
-                if use_batch_norm:
-                    res = batch_norm(
-                        input_=res, dim=dim, name="bn_0", scale=False,
-                        train=train, epsilon=1e-4, axes=[0, 1, 2])
-                res = tf.nn.relu(res)
-                res = conv_layer(
-                    input_=res, filter_size=[3, 3], dim_in=dim, dim_out=dim_out,
-                    name="out", stddev=numpy.sqrt(2. / (1. * dim)),
-                    strides=[1, 1, 1, 1], padding="SAME",
-                    nonlinearity=None, bias=True,
-                    weight_norm=weight_norm, scale=True)
-        return res
-
-
-# COUPLING LAYERS
-# masked convolution implementations
-def masked_conv_aff_coupling(input_, mask_in, dim, name,
-                             use_batch_norm=True, train=True, weight_norm=True,
-                             reverse=False, residual_blocks=5,
-                             bottleneck=False, use_width=1., use_height=1.,
-                             mask_channel=0., skip=True):
-    """Affine coupling with masked convolution."""
-    with tf.variable_scope(name) as scope:
-        if reverse or (not train):
-            scope.reuse_variables()
-        shape = input_.get_shape().as_list()
-        batch_size = shape[0]
-        height = shape[1]
-        width = shape[2]
-        channels = shape[3]
-
-        # build mask
-        mask = use_width * numpy.arange(width)
-        mask = use_height * numpy.arange(height).reshape((-1, 1)) + mask
-        mask = mask.astype("float32")
-        mask = tf.mod(mask_in + mask, 2)
-        mask = tf.reshape(mask, [-1, height, width, 1])
-        if mask.get_shape().as_list()[0] == 1:
-            mask = tf.tile(mask, [batch_size, 1, 1, 1])
-        res = input_ * tf.mod(mask_channel + mask, 2)
-
-        # initial input
-        if use_batch_norm:
-            res = batch_norm(
-                input_=res, dim=channels, name="bn_in", scale=False,
-                train=train, epsilon=1e-4, axes=[0, 1, 2])
-            res *= 2.
-        res = tf.concat([res, -res], 3)
-        res = tf.concat([res, mask], 3)
-        dim_in = 2. * channels + 1
-        res = tf.nn.relu(res)
-        res = resnet(input_=res, dim_in=dim_in, dim=dim,
-                     dim_out=2 * channels,
-                     name="resnet", use_batch_norm=use_batch_norm,
-                     train=train, weight_norm=weight_norm,
-                     residual_blocks=residual_blocks,
-                     bottleneck=bottleneck, skip=skip)
-        mask = tf.mod(mask_channel + mask, 2)
-        res = tf.split(axis=3, num_or_size_splits=2, value=res)
-        shift, log_rescaling = res[-2], res[-1]
-        scale = variable_on_cpu(
-            "rescaling_scale", [],
-            tf.constant_initializer(0.))
-        shift = tf.reshape(
-            shift, [batch_size, height, width, channels])
-        log_rescaling = tf.reshape(
-            log_rescaling, [batch_size, height, width, channels])
-        log_rescaling = scale * tf.tanh(log_rescaling)
-        if not use_batch_norm:
-            scale_shift = variable_on_cpu(
-                "scale_shift", [],
-                tf.constant_initializer(0.))
-            log_rescaling += scale_shift
-        shift *= (1. - mask)
-        log_rescaling *= (1. - mask)
-        if reverse:
-            res = input_
-            if use_batch_norm:
-                mean, var = batch_norm_log_diff(
-                    input_=res * (1. - mask), dim=channels, name="bn_out",
-                    train=False, epsilon=1e-4, axes=[0, 1, 2])
-                log_var = tf.log(var)
-                res *= tf.exp(.5 * log_var * (1. - mask))
-                res += mean * (1. - mask)
-            res *= tf.exp(-log_rescaling)
-            res -= shift
-            log_diff = -log_rescaling
-            if use_batch_norm:
-                log_diff += .5 * log_var * (1. - mask)
-        else:
-            res = input_
-            res += shift
-            res *= tf.exp(log_rescaling)
-            log_diff = log_rescaling
-            if use_batch_norm:
-                mean, var = batch_norm_log_diff(
-                    input_=res * (1. - mask), dim=channels, name="bn_out",
-                    train=train, epsilon=1e-4, axes=[0, 1, 2])
-                log_var = tf.log(var)
-                res -= mean * (1. - mask)
-                res *= tf.exp(-.5 * log_var * (1. - mask))
-                log_diff -= .5 * log_var * (1. - mask)
-
-    return res, log_diff
-
-
-def masked_conv_add_coupling(input_, mask_in, dim, name,
-                             use_batch_norm=True, train=True, weight_norm=True,
-                             reverse=False, residual_blocks=5,
-                             bottleneck=False, use_width=1., use_height=1.,
-                             mask_channel=0., skip=True):
-    """Additive coupling with masked convolution."""
-    with tf.variable_scope(name) as scope:
-        if reverse or (not train):
-            scope.reuse_variables()
-        shape = input_.get_shape().as_list()
-        batch_size = shape[0]
-        height = shape[1]
-        width = shape[2]
-        channels = shape[3]
-
-        # build mask
-        mask = use_width * numpy.arange(width)
-        mask = use_height * numpy.arange(height).reshape((-1, 1)) + mask
-        mask = mask.astype("float32")
-        mask = tf.mod(mask_in + mask, 2)
-        mask = tf.reshape(mask, [-1, height, width, 1])
-        if mask.get_shape().as_list()[0] == 1:
-            mask = tf.tile(mask, [batch_size, 1, 1, 1])
-        res = input_ * tf.mod(mask_channel + mask, 2)
-
-        # initial input
-        if use_batch_norm:
-            res = batch_norm(
-                input_=res, dim=channels, name="bn_in", scale=False,
-                train=train, epsilon=1e-4, axes=[0, 1, 2])
-            res *= 2.
-        res = tf.concat([res, -res], 3)
-        res = tf.concat([res, mask], 3)
-        dim_in = 2. * channels + 1
-        res = tf.nn.relu(res)
-        shift = resnet(input_=res, dim_in=dim_in, dim=dim, dim_out=channels,
-                       name="resnet", use_batch_norm=use_batch_norm,
-                       train=train, weight_norm=weight_norm,
-                       residual_blocks=residual_blocks,
-                       bottleneck=bottleneck, skip=skip)
-        mask = tf.mod(mask_channel + mask, 2)
-        shift *= (1. - mask)
-        # use_batch_norm = False
-        if reverse:
-            res = input_
-            if use_batch_norm:
-                mean, var = batch_norm_log_diff(
-                    input_=res * (1. - mask),
-                    dim=channels, name="bn_out", train=False, epsilon=1e-4)
-                log_var = tf.log(var)
-                res *= tf.exp(.5 * log_var * (1. - mask))
-                res += mean * (1. - mask)
-            res -= shift
-            log_diff = tf.zeros_like(res)
-            if use_batch_norm:
-                log_diff += .5 * log_var * (1. - mask)
-        else:
-            res = input_
-            res += shift
-            log_diff = tf.zeros_like(res)
-            if use_batch_norm:
-                mean, var = batch_norm_log_diff(
-                    input_=res * (1. - mask), dim=channels,
-                    name="bn_out", train=train, epsilon=1e-4, axes=[0, 1, 2])
-                log_var = tf.log(var)
-                res -= mean * (1. - mask)
-                res *= tf.exp(-.5 * log_var * (1. - mask))
-                log_diff -= .5 * log_var * (1. - mask)
-
-    return res, log_diff
-
-
-def masked_conv_coupling(input_, mask_in, dim, name,
-                         use_batch_norm=True, train=True, weight_norm=True,
-                         reverse=False, residual_blocks=5,
-                         bottleneck=False, use_aff=True,
-                         use_width=1., use_height=1.,
-                         mask_channel=0., skip=True):
-    """Coupling with masked convolution."""
-    if use_aff:
-        return masked_conv_aff_coupling(
-            input_=input_, mask_in=mask_in, dim=dim, name=name,
-            use_batch_norm=use_batch_norm, train=train, weight_norm=weight_norm,
-            reverse=reverse, residual_blocks=residual_blocks,
-            bottleneck=bottleneck, use_width=use_width, use_height=use_height,
-            mask_channel=mask_channel, skip=skip)
-    else:
-        return masked_conv_add_coupling(
-            input_=input_, mask_in=mask_in, dim=dim, name=name,
-            use_batch_norm=use_batch_norm, train=train, weight_norm=weight_norm,
-            reverse=reverse, residual_blocks=residual_blocks,
-            bottleneck=bottleneck, use_width=use_width, use_height=use_height,
-            mask_channel=mask_channel, skip=skip)
-
-
-# channel-axis splitting implementations
-def conv_ch_aff_coupling(input_, dim, name,
-                         use_batch_norm=True, train=True, weight_norm=True,
-                         reverse=False, residual_blocks=5,
-                         bottleneck=False, change_bottom=True, skip=True):
-    """Affine coupling with channel-wise splitting."""
-    with tf.variable_scope(name) as scope:
-        if reverse or (not train):
-            scope.reuse_variables()
-
-        if change_bottom:
-            input_, canvas = tf.split(axis=3, num_or_size_splits=2, value=input_)
-        else:
-            canvas, input_ = tf.split(axis=3, num_or_size_splits=2, value=input_)
-        shape = input_.get_shape().as_list()
-        batch_size = shape[0]
-        height = shape[1]
-        width = shape[2]
-        channels = shape[3]
-        res = input_
-
-        # initial input
-        if use_batch_norm:
-            res = batch_norm(
-                input_=res, dim=channels, name="bn_in", scale=False,
-                train=train, epsilon=1e-4, axes=[0, 1, 2])
-        res = tf.concat([res, -res], 3)
-        dim_in = 2. * channels
-        res = tf.nn.relu(res)
-        res = resnet(input_=res, dim_in=dim_in, dim=dim, dim_out=2 * channels,
-                     name="resnet", use_batch_norm=use_batch_norm,
-                     train=train, weight_norm=weight_norm,
-                     residual_blocks=residual_blocks,
-                     bottleneck=bottleneck, skip=skip)
-        shift, log_rescaling = tf.split(axis=3, num_or_size_splits=2, value=res)
-        scale = variable_on_cpu(
-            "scale", [],
-            tf.constant_initializer(1.))
-        shift = tf.reshape(
-            shift, [batch_size, height, width, channels])
-        log_rescaling = tf.reshape(
-            log_rescaling, [batch_size, height, width, channels])
-        log_rescaling = scale * tf.tanh(log_rescaling)
-        if not use_batch_norm:
-            scale_shift = variable_on_cpu(
-                "scale_shift", [],
-                tf.constant_initializer(0.))
-            log_rescaling += scale_shift
-        if reverse:
-            res = canvas
-            if use_batch_norm:
-                mean, var = batch_norm_log_diff(
-                    input_=res, dim=channels, name="bn_out", train=False,
-                    epsilon=1e-4, axes=[0, 1, 2])
-                log_var = tf.log(var)
-                res *= tf.exp(.5 * log_var)
-                res += mean
-            res *= tf.exp(-log_rescaling)
-            res -= shift
-            log_diff = -log_rescaling
-            if use_batch_norm:
-                log_diff += .5 * log_var
-        else:
-            res = canvas
-            res += shift
-            res *= tf.exp(log_rescaling)
-            log_diff = log_rescaling
-            if use_batch_norm:
-                mean, var = batch_norm_log_diff(
-                    input_=res, dim=channels, name="bn_out", train=train,
-                    epsilon=1e-4, axes=[0, 1, 2])
-                log_var = tf.log(var)
-                res -= mean
-                res *= tf.exp(-.5 * log_var)
-                log_diff -= .5 * log_var
-        if change_bottom:
-            res = tf.concat([input_, res], 3)
-            log_diff = tf.concat([tf.zeros_like(log_diff), log_diff], 3)
-        else:
-            res = tf.concat([res, input_], 3)
-            log_diff = tf.concat([log_diff, tf.zeros_like(log_diff)], 3)
-
-    return res, log_diff
-
-
-def conv_ch_add_coupling(input_, dim, name,
-                         use_batch_norm=True, train=True, weight_norm=True,
-                         reverse=False, residual_blocks=5,
-                         bottleneck=False, change_bottom=True, skip=True):
-    """Additive coupling with channel-wise splitting."""
-    with tf.variable_scope(name) as scope:
-        if reverse or (not train):
-            scope.reuse_variables()
-
-        if change_bottom:
-            input_, canvas = tf.split(axis=3, num_or_size_splits=2, value=input_)
-        else:
-            canvas, input_ = tf.split(axis=3, num_or_size_splits=2, value=input_)
-        shape = input_.get_shape().as_list()
-        channels = shape[3]
-        res = input_
-
-        # initial input
-        if use_batch_norm:
-            res = batch_norm(
-                input_=res, dim=channels, name="bn_in", scale=False,
-                train=train, epsilon=1e-4, axes=[0, 1, 2])
-        res = tf.concat([res, -res], 3)
-        dim_in = 2. * channels
-        res = tf.nn.relu(res)
-        shift = resnet(input_=res, dim_in=dim_in, dim=dim, dim_out=channels,
-                       name="resnet", use_batch_norm=use_batch_norm,
-                       train=train, weight_norm=weight_norm,
-                       residual_blocks=residual_blocks,
-                       bottleneck=bottleneck, skip=skip)
-        if reverse:
-            res = canvas
-            if use_batch_norm:
-                mean, var = batch_norm_log_diff(
-                    input_=res, dim=channels, name="bn_out", train=False,
-                    epsilon=1e-4, axes=[0, 1, 2])
-                log_var = tf.log(var)
-                res *= tf.exp(.5 * log_var)
-                res += mean
-            res -= shift
-            log_diff = tf.zeros_like(res)
-            if use_batch_norm:
-                log_diff += .5 * log_var
-        else:
-            res = canvas
-            res += shift
-            log_diff = tf.zeros_like(res)
-            if use_batch_norm:
-                mean, var = batch_norm_log_diff(
-                    input_=res, dim=channels, name="bn_out", train=train,
-                    epsilon=1e-4, axes=[0, 1, 2])
-                log_var = tf.log(var)
-                res -= mean
-                res *= tf.exp(-.5 * log_var)
-                log_diff -= .5 * log_var
-        if change_bottom:
-            res = tf.concat([input_, res], 3)
-            log_diff = tf.concat([tf.zeros_like(log_diff), log_diff], 3)
-        else:
-            res = tf.concat([res, input_], 3)
-            log_diff = tf.concat([log_diff, tf.zeros_like(log_diff)], 3)
-
-    return res, log_diff
-
-
-def conv_ch_coupling(input_, dim, name,
-                     use_batch_norm=True, train=True, weight_norm=True,
-                     reverse=False, residual_blocks=5,
-                     bottleneck=False, use_aff=True, change_bottom=True,
-                     skip=True):
-    """Coupling with channel-wise splitting."""
-    if use_aff:
-        return conv_ch_aff_coupling(
-            input_=input_, dim=dim, name=name,
-            use_batch_norm=use_batch_norm, train=train, weight_norm=weight_norm,
-            reverse=reverse, residual_blocks=residual_blocks,
-            bottleneck=bottleneck, change_bottom=change_bottom, skip=skip)
-    else:
-        return conv_ch_add_coupling(
-            input_=input_, dim=dim, name=name,
-            use_batch_norm=use_batch_norm, train=train, weight_norm=weight_norm,
-            reverse=reverse, residual_blocks=residual_blocks,
-            bottleneck=bottleneck, change_bottom=change_bottom, skip=skip)
-
-
-# RECURSIVE USE OF COUPLING LAYERS
-def rec_masked_conv_coupling(input_, hps, scale_idx, n_scale,
-                             use_batch_norm=True, weight_norm=True,
-                             train=True):
-    """Recursion on coupling layers."""
-    shape = input_.get_shape().as_list()
-    channels = shape[3]
-    residual_blocks = hps.residual_blocks
-    base_dim = hps.base_dim
-    mask = 1.
-    use_aff = hps.use_aff
-    res = input_
-    skip = hps.skip
-    log_diff = tf.zeros_like(input_)
-    dim = base_dim
-    if FLAGS.recursion_type < 4:
-        dim *= 2 ** scale_idx
-    with tf.variable_scope("scale_%d" % scale_idx):
-        # initial coupling layers
-        res, inc_log_diff = masked_conv_coupling(
-            input_=res,
-            mask_in=mask, dim=dim,
-            name="coupling_0",
-            use_batch_norm=use_batch_norm, train=train,
-            weight_norm=weight_norm,
-            reverse=False, residual_blocks=residual_blocks,
-            bottleneck=hps.bottleneck, use_aff=use_aff,
-            use_width=1., use_height=1., skip=skip)
-        log_diff += inc_log_diff
-        res, inc_log_diff = masked_conv_coupling(
-            input_=res,
-            mask_in=1. - mask, dim=dim,
-            name="coupling_1",
-            use_batch_norm=use_batch_norm, train=train,
-            weight_norm=weight_norm,
-            reverse=False, residual_blocks=residual_blocks,
-            bottleneck=hps.bottleneck, use_aff=use_aff,
-            use_width=1., use_height=1., skip=skip)
-        log_diff += inc_log_diff
-        res, inc_log_diff = masked_conv_coupling(
-            input_=res,
-            mask_in=mask, dim=dim,
-            name="coupling_2",
-            use_batch_norm=use_batch_norm, train=train,
-            weight_norm=weight_norm,
-            reverse=False, residual_blocks=residual_blocks,
-            bottleneck=hps.bottleneck, use_aff=True,
-            use_width=1., use_height=1., skip=skip)
-        log_diff += inc_log_diff
-    if scale_idx < (n_scale - 1):
-        with tf.variable_scope("scale_%d" % scale_idx):
-            res = squeeze_2x2(res)
-            log_diff = squeeze_2x2(log_diff)
-            res, inc_log_diff = conv_ch_coupling(
-                input_=res,
-                change_bottom=True, dim=2 * dim,
-                name="coupling_4",
-                use_batch_norm=use_batch_norm, train=train,
-                weight_norm=weight_norm,
-                reverse=False, residual_blocks=residual_blocks,
-                bottleneck=hps.bottleneck, use_aff=use_aff, skip=skip)
-            log_diff += inc_log_diff
-            res, inc_log_diff = conv_ch_coupling(
-                input_=res,
-                change_bottom=False, dim=2 * dim,
-                name="coupling_5",
-                use_batch_norm=use_batch_norm, train=train,
-                weight_norm=weight_norm,
-                reverse=False, residual_blocks=residual_blocks,
-                bottleneck=hps.bottleneck, use_aff=use_aff, skip=skip)
-            log_diff += inc_log_diff
-            res, inc_log_diff = conv_ch_coupling(
-                input_=res,
-                change_bottom=True, dim=2 * dim,
-                name="coupling_6",
-                use_batch_norm=use_batch_norm, train=train,
-                weight_norm=weight_norm,
-                reverse=False, residual_blocks=residual_blocks,
-                bottleneck=hps.bottleneck, use_aff=True, skip=skip)
-            log_diff += inc_log_diff
-            res = unsqueeze_2x2(res)
-            log_diff = unsqueeze_2x2(log_diff)
-        if FLAGS.recursion_type > 1:
-            res = squeeze_2x2_ordered(res)
-            log_diff = squeeze_2x2_ordered(log_diff)
-            if FLAGS.recursion_type > 2:
-                res_1 = res[:, :, :, :channels]
-                res_2 = res[:, :, :, channels:]
-                log_diff_1 = log_diff[:, :, :, :channels]
-                log_diff_2 = log_diff[:, :, :, channels:]
-            else:
-                res_1, res_2 = tf.split(axis=3, num_or_size_splits=2, value=res)
-                log_diff_1, log_diff_2 = tf.split(axis=3, num_or_size_splits=2, value=log_diff)
-            res_1, inc_log_diff = rec_masked_conv_coupling(
-                input_=res_1, hps=hps, scale_idx=scale_idx + 1, n_scale=n_scale,
-                use_batch_norm=use_batch_norm, weight_norm=weight_norm,
-                train=train)
-            res = tf.concat([res_1, res_2], 3)
-            log_diff_1 += inc_log_diff
-            log_diff = tf.concat([log_diff_1, log_diff_2], 3)
-            res = squeeze_2x2_ordered(res, reverse=True)
-            log_diff = squeeze_2x2_ordered(log_diff, reverse=True)
-        else:
-            res = squeeze_2x2_ordered(res)
-            log_diff = squeeze_2x2_ordered(log_diff)
-            res, inc_log_diff = rec_masked_conv_coupling(
-                input_=res, hps=hps, scale_idx=scale_idx + 1, n_scale=n_scale,
-                use_batch_norm=use_batch_norm, weight_norm=weight_norm,
-                train=train)
-            log_diff += inc_log_diff
-            res = squeeze_2x2_ordered(res, reverse=True)
-            log_diff = squeeze_2x2_ordered(log_diff, reverse=True)
-    else:
-        with tf.variable_scope("scale_%d" % scale_idx):
-            res, inc_log_diff = masked_conv_coupling(
-                input_=res,
-                mask_in=1. - mask, dim=dim,
-                name="coupling_3",
-                use_batch_norm=use_batch_norm, train=train,
-                weight_norm=weight_norm,
-                reverse=False, residual_blocks=residual_blocks,
-                bottleneck=hps.bottleneck, use_aff=True,
-                use_width=1., use_height=1., skip=skip)
-            log_diff += inc_log_diff
-    return res, log_diff
-
-
-def rec_masked_deconv_coupling(input_, hps, scale_idx, n_scale,
-                               use_batch_norm=True, weight_norm=True,
-                               train=True):
-    """Recursion on inverting coupling layers."""
-    shape = input_.get_shape().as_list()
-    channels = shape[3]
-    residual_blocks = hps.residual_blocks
-    base_dim = hps.base_dim
-    mask = 1.
-    use_aff = hps.use_aff
-    res = input_
-    log_diff = tf.zeros_like(input_)
-    skip = hps.skip
-    dim = base_dim
-    if FLAGS.recursion_type < 4:
-        dim *= 2 ** scale_idx
-    if scale_idx < (n_scale - 1):
-        if FLAGS.recursion_type > 1:
-            res = squeeze_2x2_ordered(res)
-            log_diff = squeeze_2x2_ordered(log_diff)
-            if FLAGS.recursion_type > 2:
-                res_1 = res[:, :, :, :channels]
-                res_2 = res[:, :, :, channels:]
-                log_diff_1 = log_diff[:, :, :, :channels]
-                log_diff_2 = log_diff[:, :, :, channels:]
-            else:
-                res_1, res_2 = tf.split(axis=3, num_or_size_splits=2, value=res)
-                log_diff_1, log_diff_2 = tf.split(axis=3, num_or_size_splits=2, value=log_diff)
-            res_1, log_diff_1 = rec_masked_deconv_coupling(
-                input_=res_1, hps=hps,
-                scale_idx=scale_idx + 1, n_scale=n_scale,
-                use_batch_norm=use_batch_norm, weight_norm=weight_norm,
-                train=train)
-            res = tf.concat([res_1, res_2], 3)
-            log_diff = tf.concat([log_diff_1, log_diff_2], 3)
-            res = squeeze_2x2_ordered(res, reverse=True)
-            log_diff = squeeze_2x2_ordered(log_diff, reverse=True)
-        else:
-            res = squeeze_2x2_ordered(res)
-            log_diff = squeeze_2x2_ordered(log_diff)
-            res, log_diff = rec_masked_deconv_coupling(
-                input_=res, hps=hps,
-                scale_idx=scale_idx + 1, n_scale=n_scale,
-                use_batch_norm=use_batch_norm, weight_norm=weight_norm,
-                train=train)
-            res = squeeze_2x2_ordered(res, reverse=True)
-            log_diff = squeeze_2x2_ordered(log_diff, reverse=True)
-        with tf.variable_scope("scale_%d" % scale_idx):
-            res = squeeze_2x2(res)
-            log_diff = squeeze_2x2(log_diff)
-            res, inc_log_diff = conv_ch_coupling(
-                input_=res,
-                change_bottom=True, dim=2 * dim,
-                name="coupling_6",
-                use_batch_norm=use_batch_norm, train=train,
-                weight_norm=weight_norm,
-                reverse=True, residual_blocks=residual_blocks,
-                bottleneck=hps.bottleneck, use_aff=True, skip=skip)
-            log_diff += inc_log_diff
-            res, inc_log_diff = conv_ch_coupling(
-                input_=res,
-                change_bottom=False, dim=2 * dim,
-                name="coupling_5",
-                use_batch_norm=use_batch_norm, train=train,
-                weight_norm=weight_norm,
-                reverse=True, residual_blocks=residual_blocks,
-                bottleneck=hps.bottleneck, use_aff=use_aff, skip=skip)
-            log_diff += inc_log_diff
-            res, inc_log_diff = conv_ch_coupling(
-                input_=res,
-                change_bottom=True, dim=2 * dim,
-                name="coupling_4",
-                use_batch_norm=use_batch_norm, train=train,
-                weight_norm=weight_norm,
-                reverse=True, residual_blocks=residual_blocks,
-                bottleneck=hps.bottleneck, use_aff=use_aff, skip=skip)
-            log_diff += inc_log_diff
-            res = unsqueeze_2x2(res)
-            log_diff = unsqueeze_2x2(log_diff)
-    else:
-        with tf.variable_scope("scale_%d" % scale_idx):
-            res, inc_log_diff = masked_conv_coupling(
-                input_=res,
-                mask_in=1. - mask, dim=dim,
-                name="coupling_3",
-                use_batch_norm=use_batch_norm, train=train,
-                weight_norm=weight_norm,
-                reverse=True, residual_blocks=residual_blocks,
-                bottleneck=hps.bottleneck, use_aff=True,
-                use_width=1., use_height=1., skip=skip)
-            log_diff += inc_log_diff
-
-    with tf.variable_scope("scale_%d" % scale_idx):
-        res, inc_log_diff = masked_conv_coupling(
-            input_=res,
-            mask_in=mask, dim=dim,
-            name="coupling_2",
-            use_batch_norm=use_batch_norm, train=train, weight_norm=weight_norm,
-            reverse=True, residual_blocks=residual_blocks,
-            bottleneck=hps.bottleneck, use_aff=True,
-            use_width=1., use_height=1., skip=skip)
-        log_diff += inc_log_diff
-        res, inc_log_diff = masked_conv_coupling(
-            input_=res,
-            mask_in=1. - mask, dim=dim,
-            name="coupling_1",
-            use_batch_norm=use_batch_norm, train=train, weight_norm=weight_norm,
-            reverse=True, residual_blocks=residual_blocks,
-            bottleneck=hps.bottleneck, use_aff=use_aff,
-            use_width=1., use_height=1., skip=skip)
-        log_diff += inc_log_diff
-        res, inc_log_diff = masked_conv_coupling(
-            input_=res,
-            mask_in=mask, dim=dim,
-            name="coupling_0",
-            use_batch_norm=use_batch_norm, train=train, weight_norm=weight_norm,
-            reverse=True, residual_blocks=residual_blocks,
-            bottleneck=hps.bottleneck, use_aff=use_aff,
-            use_width=1., use_height=1., skip=skip)
-        log_diff += inc_log_diff
-
-    return res, log_diff
-
-
-# ENCODER AND DECODER IMPLEMENTATIONS
-# start the recursions
-def encoder(input_, hps, n_scale, use_batch_norm=True,
-            weight_norm=True, train=True):
-    """Encoding/gaussianization function."""
-    res = input_
-    log_diff = tf.zeros_like(input_)
-    res, inc_log_diff = rec_masked_conv_coupling(
-        input_=res, hps=hps, scale_idx=0, n_scale=n_scale,
-        use_batch_norm=use_batch_norm, weight_norm=weight_norm,
-        train=train)
-    log_diff += inc_log_diff
-
-    return res, log_diff
-
-
-def decoder(input_, hps, n_scale, use_batch_norm=True,
-            weight_norm=True, train=True):
-    """Decoding/generator function."""
-    res, log_diff = rec_masked_deconv_coupling(
-        input_=input_, hps=hps, scale_idx=0, n_scale=n_scale,
-        use_batch_norm=use_batch_norm, weight_norm=weight_norm,
-        train=train)
-
-    return res, log_diff
-
-
-class RealNVP(object):
-    """Real NVP model."""
-
-    def __init__(self, hps, sampling=False):
-        # DATA TENSOR INSTANTIATION
-        device = "/cpu:0"
-        if FLAGS.dataset == "imnet":
-            with tf.device(
-                tf.train.replica_device_setter(0, worker_device=device)):
-                filename_queue = tf.train.string_input_producer(
-                    gfile.Glob(FLAGS.data_path), num_epochs=None)
-                reader = tf.TFRecordReader()
-                _, serialized_example = reader.read(filename_queue)
-                features = tf.parse_single_example(
-                    serialized_example,
-                    features={
-                        "image_raw": tf.FixedLenFeature([], tf.string),
-                    })
-                image = tf.decode_raw(features["image_raw"], tf.uint8)
-                image.set_shape([FLAGS.image_size * FLAGS.image_size * 3])
-                image = tf.cast(image, tf.float32)
-                if FLAGS.mode == "train":
-                    images = tf.train.shuffle_batch(
-                        [image], batch_size=hps.batch_size, num_threads=1,
-                        capacity=1000 + 3 * hps.batch_size,
-                        # Ensures a minimum amount of shuffling of examples.
-                        min_after_dequeue=1000)
-                else:
-                    images = tf.train.batch(
-                        [image], batch_size=hps.batch_size, num_threads=1,
-                        capacity=1000 + 3 * hps.batch_size)
-            self.x_orig = x_orig = images
-            image_size = FLAGS.image_size
-            x_in = tf.reshape(
-                x_orig,
-                [hps.batch_size, FLAGS.image_size, FLAGS.image_size, 3])
-            x_in = tf.clip_by_value(x_in, 0, 255)
-            x_in = (tf.cast(x_in, tf.float32)
-                    + tf.random_uniform(tf.shape(x_in))) / 256.
-        elif FLAGS.dataset == "celeba":
-            with tf.device(
-                tf.train.replica_device_setter(0, worker_device=device)):
-                filename_queue = tf.train.string_input_producer(
-                    gfile.Glob(FLAGS.data_path), num_epochs=None)
-                reader = tf.TFRecordReader()
-                _, serialized_example = reader.read(filename_queue)
-                features = tf.parse_single_example(
-                    serialized_example,
-                    features={
-                        "image_raw": tf.FixedLenFeature([], tf.string),
-                    })
-                image = tf.decode_raw(features["image_raw"], tf.uint8)
-                image.set_shape([218 * 178 * 3])  # 218, 178
-                image = tf.cast(image, tf.float32)
-                image = tf.reshape(image, [218, 178, 3])
-                image = image[40:188, 15:163, :]
-                if FLAGS.mode == "train":
-                    image = tf.image.random_flip_left_right(image)
-                    images = tf.train.shuffle_batch(
-                        [image], batch_size=hps.batch_size, num_threads=1,
-                        capacity=1000 + 3 * hps.batch_size,
-                        min_after_dequeue=1000)
-                else:
-                    images = tf.train.batch(
-                        [image], batch_size=hps.batch_size, num_threads=1,
-                        capacity=1000 + 3 * hps.batch_size)
-            self.x_orig = x_orig = images
-            image_size = 64
-            x_in = tf.reshape(x_orig, [hps.batch_size, 148, 148, 3])
-            x_in = tf.image.resize_images(
-                x_in, [64, 64], method=0, align_corners=False)
-            x_in = (tf.cast(x_in, tf.float32)
-                    + tf.random_uniform(tf.shape(x_in))) / 256.
-        elif FLAGS.dataset == "lsun":
-            with tf.device(
-                tf.train.replica_device_setter(0, worker_device=device)):
-                filename_queue = tf.train.string_input_producer(
-                    gfile.Glob(FLAGS.data_path), num_epochs=None)
-                reader = tf.TFRecordReader()
-                _, serialized_example = reader.read(filename_queue)
-                features = tf.parse_single_example(
-                    serialized_example,
-                    features={
-                        "image_raw": tf.FixedLenFeature([], tf.string),
-                        "height": tf.FixedLenFeature([], tf.int64),
-                        "width": tf.FixedLenFeature([], tf.int64),
-                        "depth": tf.FixedLenFeature([], tf.int64)
-                    })
-                image = tf.decode_raw(features["image_raw"], tf.uint8)
-                height = tf.reshape((features["height"], tf.int64)[0], [1])
-                height = tf.cast(height, tf.int32)
-                width = tf.reshape((features["width"], tf.int64)[0], [1])
-                width = tf.cast(width, tf.int32)
-                depth = tf.reshape((features["depth"], tf.int64)[0], [1])
-                depth = tf.cast(depth, tf.int32)
-                image = tf.reshape(image, tf.concat([height, width, depth], 0))
-                image = tf.random_crop(image, [64, 64, 3])
-                if FLAGS.mode == "train":
-                    image = tf.image.random_flip_left_right(image)
-                    images = tf.train.shuffle_batch(
-                        [image], batch_size=hps.batch_size, num_threads=1,
-                        capacity=1000 + 3 * hps.batch_size,
-                        # Ensures a minimum amount of shuffling of examples.
-                        min_after_dequeue=1000)
-                else:
-                    images = tf.train.batch(
-                        [image], batch_size=hps.batch_size, num_threads=1,
-                        capacity=1000 + 3 * hps.batch_size)
-            self.x_orig = x_orig = images
-            image_size = 64
-            x_in = tf.reshape(x_orig, [hps.batch_size, 64, 64, 3])
-            x_in = (tf.cast(x_in, tf.float32)
-                    + tf.random_uniform(tf.shape(x_in))) / 256.
-        else:
-            raise ValueError("Unknown dataset.")
-        x_in = tf.reshape(x_in, [hps.batch_size, image_size, image_size, 3])
-        side_shown = int(numpy.sqrt(hps.batch_size))
-        shown_x = tf.transpose(
-            tf.reshape(
-                x_in[:(side_shown * side_shown), :, :, :],
-                [side_shown, image_size * side_shown, image_size, 3]),
-            [0, 2, 1, 3])
-        shown_x = tf.transpose(
-            tf.reshape(
-                shown_x,
-                [1, image_size * side_shown, image_size * side_shown, 3]),
-            [0, 2, 1, 3]) * 255.
-        tf.summary.image(
-            "inputs",
-            tf.cast(shown_x, tf.uint8),
-            max_outputs=1)
-
-        # restrict the data
-        FLAGS.image_size = image_size
-        data_constraint = hps.data_constraint
-        pre_logit_scale = numpy.log(data_constraint)
-        pre_logit_scale -= numpy.log(1. - data_constraint)
-        pre_logit_scale = tf.cast(pre_logit_scale, tf.float32)
-        logit_x_in = 2. * x_in  # [0, 2]
-        logit_x_in -= 1.  # [-1, 1]
-        logit_x_in *= data_constraint  # [-.9, .9]
-        logit_x_in += 1.  # [.1, 1.9]
-        logit_x_in /= 2.  # [.05, .95]
-        # logit the data
-        logit_x_in = tf.log(logit_x_in) - tf.log(1. - logit_x_in)
-        transform_cost = tf.reduce_sum(
-            tf.nn.softplus(logit_x_in) + tf.nn.softplus(-logit_x_in)
-            - tf.nn.softplus(-pre_logit_scale),
-            [1, 2, 3])
-
-        # INFERENCE AND COSTS
-        z_out, log_diff = encoder(
-            input_=logit_x_in, hps=hps, n_scale=hps.n_scale,
-            use_batch_norm=hps.use_batch_norm, weight_norm=True,
-            train=True)
-        if FLAGS.mode != "train":
-              z_out, log_diff = encoder(
-                  input_=logit_x_in, hps=hps, n_scale=hps.n_scale,
-                  use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                  train=False)
-        final_shape = [image_size, image_size, 3]
-        prior_ll = standard_normal_ll(z_out)
-        prior_ll = tf.reduce_sum(prior_ll, [1, 2, 3])
-        log_diff = tf.reduce_sum(log_diff, [1, 2, 3])
-        log_diff += transform_cost
-        cost = -(prior_ll + log_diff)
-
-        self.x_in = x_in
-        self.z_out = z_out
-        self.cost = cost = tf.reduce_mean(cost)
-
-        l2_reg = sum(
-            [tf.reduce_sum(tf.square(v)) for v in tf.trainable_variables()
-             if ("magnitude" in v.name) or ("rescaling_scale" in v.name)])
-
-        bit_per_dim = ((cost + numpy.log(256.) * image_size * image_size * 3.)
-                       / (image_size * image_size * 3. * numpy.log(2.)))
-        self.bit_per_dim = bit_per_dim
-
-        # OPTIMIZATION
-        momentum = 1. - hps.momentum
-        decay = 1. - hps.decay
-        if hps.optimizer == "adam":
-            optimizer = tf.train.AdamOptimizer(
-                learning_rate=hps.learning_rate,
-                beta1=momentum, beta2=decay, epsilon=1e-08,
-                use_locking=False, name="Adam")
-        elif hps.optimizer == "rmsprop":
-            optimizer = tf.train.RMSPropOptimizer(
-                learning_rate=hps.learning_rate, decay=decay,
-                momentum=momentum, epsilon=1e-04,
-                use_locking=False, name="RMSProp")
-        else:
-            optimizer = tf.train.MomentumOptimizer(hps.learning_rate,
-                                                   momentum=momentum)
-
-        step = tf.get_variable(
-            "global_step", [], tf.int64,
-            tf.zeros_initializer(),
-            trainable=False)
-        self.step = step
-        grads_and_vars = optimizer.compute_gradients(
-            cost + hps.l2_coeff * l2_reg,
-            tf.trainable_variables())
-        grads, vars_ = zip(*grads_and_vars)
-        capped_grads, gradient_norm = tf.clip_by_global_norm(
-            grads, clip_norm=hps.clip_gradient)
-        gradient_norm = tf.check_numerics(gradient_norm,
-                                          "Gradient norm is NaN or Inf.")
-
-        l2_z = tf.reduce_sum(tf.square(z_out), [1, 2, 3])
-        if not sampling:
-            tf.summary.scalar("negative_log_likelihood", tf.reshape(cost, []))
-            tf.summary.scalar("gradient_norm", tf.reshape(gradient_norm, []))
-            tf.summary.scalar("bit_per_dim", tf.reshape(bit_per_dim, []))
-            tf.summary.scalar("log_diff", tf.reshape(tf.reduce_mean(log_diff), []))
-            tf.summary.scalar("prior_ll", tf.reshape(tf.reduce_mean(prior_ll), []))
-            tf.summary.scalar(
-                "log_diff_var",
-                tf.reshape(tf.reduce_mean(tf.square(log_diff))
-                           - tf.square(tf.reduce_mean(log_diff)), []))
-            tf.summary.scalar(
-                "prior_ll_var",
-                tf.reshape(tf.reduce_mean(tf.square(prior_ll))
-                           - tf.square(tf.reduce_mean(prior_ll)), []))
-            tf.summary.scalar("l2_z_mean", tf.reshape(tf.reduce_mean(l2_z), []))
-            tf.summary.scalar(
-                "l2_z_var",
-                tf.reshape(tf.reduce_mean(tf.square(l2_z))
-                           - tf.square(tf.reduce_mean(l2_z)), []))
-
-
-        capped_grads_and_vars = zip(capped_grads, vars_)
-        self.train_step = optimizer.apply_gradients(
-            capped_grads_and_vars, global_step=step)
-
-        # SAMPLING AND VISUALIZATION
-        if sampling:
-            # SAMPLES
-            sample = standard_normal_sample([100] + final_shape)
-            sample, _ = decoder(
-                input_=sample, hps=hps, n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=True)
-            sample = tf.nn.sigmoid(sample)
-
-            sample = tf.clip_by_value(sample, 0, 1) * 255.
-            sample = tf.reshape(sample, [100, image_size, image_size, 3])
-            sample = tf.transpose(
-                tf.reshape(sample, [10, image_size * 10, image_size, 3]),
-                [0, 2, 1, 3])
-            sample = tf.transpose(
-                tf.reshape(sample, [1, image_size * 10, image_size * 10, 3]),
-                [0, 2, 1, 3])
-            tf.summary.image(
-                "samples",
-                tf.cast(sample, tf.uint8),
-                max_outputs=1)
-
-            # CONCATENATION
-            concatenation, _ = encoder(
-                input_=logit_x_in, hps=hps,
-                n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=False)
-            concatenation = tf.reshape(
-                concatenation,
-                [(side_shown * side_shown), image_size, image_size, 3])
-            concatenation = tf.transpose(
-                tf.reshape(
-                    concatenation,
-                    [side_shown, image_size * side_shown, image_size, 3]),
-                [0, 2, 1, 3])
-            concatenation = tf.transpose(
-                tf.reshape(
-                    concatenation,
-                    [1, image_size * side_shown, image_size * side_shown, 3]),
-                [0, 2, 1, 3])
-            concatenation, _ = decoder(
-                input_=concatenation, hps=hps, n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=False)
-            concatenation = tf.nn.sigmoid(concatenation) * 255.
-            tf.summary.image(
-                "concatenation",
-                tf.cast(concatenation, tf.uint8),
-                max_outputs=1)
-
-            # MANIFOLD
-
-            # Data basis
-            z_u, _ = encoder(
-                input_=logit_x_in[:8, :, :, :], hps=hps,
-                n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=False)
-            u_1 = tf.reshape(z_u[0, :, :, :], [-1])
-            u_2 = tf.reshape(z_u[1, :, :, :], [-1])
-            u_3 = tf.reshape(z_u[2, :, :, :], [-1])
-            u_4 = tf.reshape(z_u[3, :, :, :], [-1])
-            u_5 = tf.reshape(z_u[4, :, :, :], [-1])
-            u_6 = tf.reshape(z_u[5, :, :, :], [-1])
-            u_7 = tf.reshape(z_u[6, :, :, :], [-1])
-            u_8 = tf.reshape(z_u[7, :, :, :], [-1])
-
-            # 3D dome
-            manifold_side = 8
-            angle_1 = numpy.arange(manifold_side) * 1. / manifold_side
-            angle_2 = numpy.arange(manifold_side) * 1. / manifold_side
-            angle_1 *= 2. * numpy.pi
-            angle_2 *= 2. * numpy.pi
-            angle_1 = angle_1.astype("float32")
-            angle_2 = angle_2.astype("float32")
-            angle_1 = tf.reshape(angle_1, [1, -1, 1])
-            angle_1 += tf.zeros([manifold_side, manifold_side, 1])
-            angle_2 = tf.reshape(angle_2, [-1, 1, 1])
-            angle_2 += tf.zeros([manifold_side, manifold_side, 1])
-            n_angle_3 = 40
-            angle_3 = numpy.arange(n_angle_3) * 1. / n_angle_3
-            angle_3 *= 2 * numpy.pi
-            angle_3 = angle_3.astype("float32")
-            angle_3 = tf.reshape(angle_3, [-1, 1, 1, 1])
-            angle_3 += tf.zeros([n_angle_3, manifold_side, manifold_side, 1])
-            manifold = tf.cos(angle_1) * (
-                tf.cos(angle_2) * (
-                    tf.cos(angle_3) * u_1 + tf.sin(angle_3) * u_2)
-                + tf.sin(angle_2) * (
-                    tf.cos(angle_3) * u_3 + tf.sin(angle_3) * u_4))
-            manifold += tf.sin(angle_1) * (
-                tf.cos(angle_2) * (
-                    tf.cos(angle_3) * u_5 + tf.sin(angle_3) * u_6)
-                + tf.sin(angle_2) * (
-                    tf.cos(angle_3) * u_7 + tf.sin(angle_3) * u_8))
-            manifold = tf.reshape(
-                manifold,
-                [n_angle_3 * manifold_side * manifold_side] + final_shape)
-            manifold, _ = decoder(
-                input_=manifold, hps=hps, n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=False)
-            manifold = tf.nn.sigmoid(manifold)
-
-            manifold = tf.clip_by_value(manifold, 0, 1) * 255.
-            manifold = tf.reshape(
-                manifold,
-                [n_angle_3,
-                 manifold_side * manifold_side,
-                 image_size,
-                 image_size,
-                 3])
-            manifold = tf.transpose(
-                tf.reshape(
-                    manifold,
-                    [n_angle_3, manifold_side,
-                     image_size * manifold_side, image_size, 3]), [0, 1, 3, 2, 4])
-            manifold = tf.transpose(
-                tf.reshape(
-                    manifold,
-                    [n_angle_3, image_size * manifold_side,
-                     image_size * manifold_side, 3]),
-                [0, 2, 1, 3])
-            manifold = tf.transpose(manifold, [1, 2, 0, 3])
-            manifold = tf.reshape(
-                manifold,
-                [1, image_size * manifold_side,
-                 image_size * manifold_side, 3 * n_angle_3])
-            tf.summary.image(
-                "manifold",
-                tf.cast(manifold[:, :, :, :3], tf.uint8),
-                max_outputs=1)
-
-            # COMPRESSION
-            z_complete, _ = encoder(
-                input_=logit_x_in[:hps.n_scale, :, :, :], hps=hps,
-                n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=False)
-            z_compressed_list = [z_complete]
-            z_noisy_list = [z_complete]
-            z_lost = z_complete
-            for scale_idx in xrange(hps.n_scale - 1):
-                z_lost = squeeze_2x2_ordered(z_lost)
-                z_lost, _ = tf.split(axis=3, num_or_size_splits=2, value=z_lost)
-                z_compressed = z_lost
-                z_noisy = z_lost
-                for _ in xrange(scale_idx + 1):
-                    z_compressed = tf.concat(
-                        [z_compressed, tf.zeros_like(z_compressed)], 3)
-                    z_compressed = squeeze_2x2_ordered(
-                        z_compressed, reverse=True)
-                    z_noisy = tf.concat(
-                        [z_noisy, tf.random_normal(
-                            z_noisy.get_shape().as_list())], 3)
-                    z_noisy = squeeze_2x2_ordered(z_noisy, reverse=True)
-                z_compressed_list.append(z_compressed)
-                z_noisy_list.append(z_noisy)
-            self.z_reduced = z_lost
-            z_compressed = tf.concat(z_compressed_list, 0)
-            z_noisy = tf.concat(z_noisy_list, 0)
-            noisy_images, _ = decoder(
-                input_=z_noisy, hps=hps, n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=False)
-            compressed_images, _ = decoder(
-                input_=z_compressed, hps=hps, n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=False)
-            noisy_images = tf.nn.sigmoid(noisy_images)
-            compressed_images = tf.nn.sigmoid(compressed_images)
-
-            noisy_images = tf.clip_by_value(noisy_images, 0, 1) * 255.
-            noisy_images = tf.reshape(
-                noisy_images,
-                [(hps.n_scale * hps.n_scale), image_size, image_size, 3])
-            noisy_images = tf.transpose(
-                tf.reshape(
-                    noisy_images,
-                    [hps.n_scale, image_size * hps.n_scale, image_size, 3]),
-                [0, 2, 1, 3])
-            noisy_images = tf.transpose(
-                tf.reshape(
-                    noisy_images,
-                    [1, image_size * hps.n_scale, image_size * hps.n_scale, 3]),
-                [0, 2, 1, 3])
-            tf.summary.image(
-                "noise",
-                tf.cast(noisy_images, tf.uint8),
-                max_outputs=1)
-            compressed_images = tf.clip_by_value(compressed_images, 0, 1) * 255.
-            compressed_images = tf.reshape(
-                compressed_images,
-                [(hps.n_scale * hps.n_scale), image_size, image_size, 3])
-            compressed_images = tf.transpose(
-                tf.reshape(
-                    compressed_images,
-                    [hps.n_scale, image_size * hps.n_scale, image_size, 3]),
-                [0, 2, 1, 3])
-            compressed_images = tf.transpose(
-                tf.reshape(
-                    compressed_images,
-                    [1, image_size * hps.n_scale, image_size * hps.n_scale, 3]),
-                [0, 2, 1, 3])
-            tf.summary.image(
-                "compression",
-                tf.cast(compressed_images, tf.uint8),
-                max_outputs=1)
-
-            # SAMPLES x2
-            final_shape[0] *= 2
-            final_shape[1] *= 2
-            big_sample = standard_normal_sample([25] + final_shape)
-            big_sample, _ = decoder(
-                input_=big_sample, hps=hps, n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=True)
-            big_sample = tf.nn.sigmoid(big_sample)
-
-            big_sample = tf.clip_by_value(big_sample, 0, 1) * 255.
-            big_sample = tf.reshape(
-                big_sample,
-                [25, image_size * 2, image_size * 2, 3])
-            big_sample = tf.transpose(
-                tf.reshape(
-                    big_sample,
-                    [5, image_size * 10, image_size * 2, 3]), [0, 2, 1, 3])
-            big_sample = tf.transpose(
-                tf.reshape(
-                    big_sample,
-                    [1, image_size * 10, image_size * 10, 3]),
-                [0, 2, 1, 3])
-            tf.summary.image(
-                "big_sample",
-                tf.cast(big_sample, tf.uint8),
-                max_outputs=1)
-
-            # SAMPLES x10
-            final_shape[0] *= 5
-            final_shape[1] *= 5
-            extra_large = standard_normal_sample([1] + final_shape)
-            extra_large, _ = decoder(
-                input_=extra_large, hps=hps, n_scale=hps.n_scale,
-                use_batch_norm=hps.use_batch_norm, weight_norm=True,
-                train=True)
-            extra_large = tf.nn.sigmoid(extra_large)
-
-            extra_large = tf.clip_by_value(extra_large, 0, 1) * 255.
-            tf.summary.image(
-                "extra_large",
-                tf.cast(extra_large, tf.uint8),
-                max_outputs=1)
-
-    def eval_epoch(self, hps):
-        """Evaluate bits/dim."""
-        n_eval_dict = {
-            "imnet": 50000,
-            "lsun": 300,
-            "celeba": 19962,
-            "svhn": 26032,
-        }
-        if FLAGS.eval_set_size == 0:
-            num_examples_eval = n_eval_dict[FLAGS.dataset]
-        else:
-            num_examples_eval = FLAGS.eval_set_size
-        n_epoch = num_examples_eval / hps.batch_size
-        eval_costs = []
-        bar_len = 70
-        for epoch_idx in xrange(n_epoch):
-            n_equal = epoch_idx * bar_len * 1. / n_epoch
-            n_equal = numpy.ceil(n_equal)
-            n_equal = int(n_equal)
-            n_dash = bar_len - n_equal
-            progress_bar = "[" + "=" * n_equal + "-" * n_dash + "]\r"
-            print(progress_bar, end=' ')
-            cost = self.bit_per_dim.eval()
-            eval_costs.append(cost)
-        print("")
-        return float(numpy.mean(eval_costs))
-
-
-def train_model(hps, logdir):
-    """Training."""
-    with tf.Graph().as_default():
-        with tf.device(tf.train.replica_device_setter(0)):
-            with tf.variable_scope("model"):
-                model = RealNVP(hps)
-
-            saver = tf.train.Saver(tf.global_variables())
-
-            # Build the summary operation from the last tower summaries.
-            summary_op = tf.summary.merge_all()
-
-            # Build an initialization operation to run below.
-            init = tf.global_variables_initializer()
-
-            # Start running operations on the Graph. allow_soft_placement must be set to
-            # True to build towers on GPU, as some of the ops do not have GPU
-            # implementations.
-            sess = tf.Session(config=tf.ConfigProto(
-                allow_soft_placement=True,
-                log_device_placement=True))
-            sess.run(init)
-
-            ckpt_state = tf.train.get_checkpoint_state(logdir)
-            if ckpt_state and ckpt_state.model_checkpoint_path:
-                print("Loading file %s" % ckpt_state.model_checkpoint_path)
-                saver.restore(sess, ckpt_state.model_checkpoint_path)
-
-            # Start the queue runners.
-            tf.train.start_queue_runners(sess=sess)
-
-            summary_writer = tf.summary.FileWriter(
-                logdir,
-                graph=sess.graph)
-
-            local_step = 0
-            while True:
-                fetches = [model.step, model.bit_per_dim, model.train_step]
-                # The chief worker evaluates the summaries every 10 steps.
-                should_eval_summaries = local_step % 100 == 0
-                if should_eval_summaries:
-                    fetches += [summary_op]
-
-
-                start_time = time.time()
-                outputs = sess.run(fetches)
-                global_step_val = outputs[0]
-                loss = outputs[1]
-                duration = time.time() - start_time
-                assert not numpy.isnan(
-                    loss), 'Model diverged with loss = NaN'
-
-                if local_step % 10 == 0:
-                    examples_per_sec = hps.batch_size / float(duration)
-                    format_str = ('%s: step %d, loss = %.2f '
-                                  '(%.1f examples/sec; %.3f '
-                                  'sec/batch)')
-                    print(format_str % (datetime.now(), global_step_val, loss,
-                                        examples_per_sec, duration))
-
-                if should_eval_summaries:
-                    summary_str = outputs[-1]
-                    summary_writer.add_summary(summary_str, global_step_val)
-
-                # Save the model checkpoint periodically.
-                if local_step % 1000 == 0 or (local_step + 1) == FLAGS.train_steps:
-                    checkpoint_path = os.path.join(logdir, 'model.ckpt')
-                    saver.save(
-                        sess,
-                        checkpoint_path,
-                        global_step=global_step_val)
-
-                if outputs[0] >= FLAGS.train_steps:
-                    break
-
-                local_step += 1
-
-
-def evaluate(hps, logdir, traindir, subset="valid", return_val=False):
-    """Evaluation."""
-    hps.batch_size = 100
-    with tf.Graph().as_default():
-        with tf.device("/cpu:0"):
-            with tf.variable_scope("model") as var_scope:
-                eval_model = RealNVP(hps)
-                summary_writer = tf.summary.FileWriter(logdir)
-                var_scope.reuse_variables()
-
-            saver = tf.train.Saver()
-            sess = tf.Session(config=tf.ConfigProto(
-                allow_soft_placement=True,
-                log_device_placement=True))
-            tf.train.start_queue_runners(sess)
-
-            previous_global_step = 0  # don"t run eval for step = 0
-
-            with sess.as_default():
-                while True:
-                    ckpt_state = tf.train.get_checkpoint_state(traindir)
-                    if not (ckpt_state and ckpt_state.model_checkpoint_path):
-                        print("No model to eval yet at %s" % traindir)
-                        time.sleep(30)
-                        continue
-                    print("Loading file %s" % ckpt_state.model_checkpoint_path)
-                    saver.restore(sess, ckpt_state.model_checkpoint_path)
-
-                    current_step = tf.train.global_step(sess, eval_model.step)
-                    if current_step == previous_global_step:
-                        print("Waiting for the checkpoint to be updated.")
-                        time.sleep(30)
-                        continue
-                    previous_global_step = current_step
-
-                    print("Evaluating...")
-                    bit_per_dim = eval_model.eval_epoch(hps)
-                    print("Epoch: %d, %s -> %.3f bits/dim"
-                          % (current_step, subset, bit_per_dim))
-                    print("Writing summary...")
-                    summary = tf.Summary()
-                    summary.value.extend(
-                        [tf.Summary.Value(
-                            tag="bit_per_dim",
-                            simple_value=bit_per_dim)])
-                    summary_writer.add_summary(summary, current_step)
-
-                    if return_val:
-                        return current_step, bit_per_dim
-
-
-def sample_from_model(hps, logdir, traindir):
-    """Sampling."""
-    hps.batch_size = 100
-    with tf.Graph().as_default():
-        with tf.device("/cpu:0"):
-            with tf.variable_scope("model") as var_scope:
-                eval_model = RealNVP(hps, sampling=True)
-                summary_writer = tf.summary.FileWriter(logdir)
-                var_scope.reuse_variables()
-
-                summary_op = tf.summary.merge_all()
-            saver = tf.train.Saver()
-            sess = tf.Session(config=tf.ConfigProto(
-                allow_soft_placement=True,
-                log_device_placement=True))
-            coord = tf.train.Coordinator()
-            threads = tf.train.start_queue_runners(sess=sess, coord=coord)
-
-            previous_global_step = 0  # don"t run eval for step = 0
-
-            initialized = False
-            with sess.as_default():
-                while True:
-                    ckpt_state = tf.train.get_checkpoint_state(traindir)
-                    if not (ckpt_state and ckpt_state.model_checkpoint_path):
-                        if not initialized:
-                            print("No model to eval yet at %s" % traindir)
-                            time.sleep(30)
-                            continue
-                    else:
-                        print ("Loading file %s"
-                               % ckpt_state.model_checkpoint_path)
-                        saver.restore(sess, ckpt_state.model_checkpoint_path)
-
-                    current_step = tf.train.global_step(sess, eval_model.step)
-                    if current_step == previous_global_step:
-                        print("Waiting for the checkpoint to be updated.")
-                        time.sleep(30)
-                        continue
-                    previous_global_step = current_step
-
-                    fetches = [summary_op]
-
-                    outputs = sess.run(fetches)
-                    summary_writer.add_summary(outputs[0], current_step)
-            coord.request_stop()
-            coord.join(threads)
-
-
-def main(unused_argv):
-    hps = get_default_hparams().update_config(FLAGS.hpconfig)
-    if FLAGS.mode == "train":
-        train_model(hps=hps, logdir=FLAGS.logdir)
-    elif FLAGS.mode == "sample":
-        sample_from_model(hps=hps, logdir=FLAGS.logdir,
-                          traindir=FLAGS.traindir)
-    else:
-        hps.batch_size = 100
-        evaluate(hps=hps, logdir=FLAGS.logdir,
-                 traindir=FLAGS.traindir, subset=FLAGS.mode)
-
-if __name__ == "__main__":
-    tf.app.run()
diff --git a/research/real_nvp/real_nvp_utils.py b/research/real_nvp/real_nvp_utils.py
deleted file mode 100644
index d8240f0e98d5b0d91bab8a9027ac737eb425a999..0000000000000000000000000000000000000000
--- a/research/real_nvp/real_nvp_utils.py
+++ /dev/null
@@ -1,475 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Utility functions for Real NVP.
-"""
-
-# pylint: disable=dangerous-default-value
-
-import numpy
-from six.moves import xrange
-import tensorflow as tf
-from tensorflow.python.framework import ops
-
-DEFAULT_BN_LAG = .0
-
-
-def stable_var(input_, mean=None, axes=[0]):
-    """Numerically more stable variance computation."""
-    if mean is None:
-        mean = tf.reduce_mean(input_, axes)
-    res = tf.square(input_ - mean)
-    max_sqr = tf.reduce_max(res, axes)
-    res /= max_sqr
-    res = tf.reduce_mean(res, axes)
-    res *= max_sqr
-
-    return res
-
-
-def variable_on_cpu(name, shape, initializer, trainable=True):
-    """Helper to create a Variable stored on CPU memory.
-
-    Args:
-            name: name of the variable
-            shape: list of ints
-            initializer: initializer for Variable
-            trainable: boolean defining if the variable is for training
-    Returns:
-            Variable Tensor
-    """
-    var = tf.get_variable(
-        name, shape, initializer=initializer, trainable=trainable)
-    return var
-
-
-# layers
-def conv_layer(input_,
-               filter_size,
-               dim_in,
-               dim_out,
-               name,
-               stddev=1e-2,
-               strides=[1, 1, 1, 1],
-               padding="SAME",
-               nonlinearity=None,
-               bias=False,
-               weight_norm=False,
-               scale=False):
-    """Convolutional layer."""
-    with tf.variable_scope(name) as scope:
-        weights = variable_on_cpu(
-            "weights",
-            filter_size + [dim_in, dim_out],
-            tf.random_uniform_initializer(
-                minval=-stddev, maxval=stddev))
-        # weight normalization
-        if weight_norm:
-            weights /= tf.sqrt(tf.reduce_sum(tf.square(weights), [0, 1, 2]))
-            if scale:
-                magnitude = variable_on_cpu(
-                    "magnitude", [dim_out],
-                    tf.constant_initializer(
-                        stddev * numpy.sqrt(dim_in * numpy.prod(filter_size) / 12.)))
-                weights *= magnitude
-        res = input_
-        # handling filter size bigger than image size
-        if hasattr(input_, "shape"):
-            if input_.get_shape().as_list()[1] < filter_size[0]:
-                pad_1 = tf.zeros([
-                    input_.get_shape().as_list()[0],
-                    filter_size[0] - input_.get_shape().as_list()[1],
-                    input_.get_shape().as_list()[2],
-                    input_.get_shape().as_list()[3]
-                ])
-                pad_2 = tf.zeros([
-                    input_.get_shape().as_list[0],
-                    filter_size[0],
-                    filter_size[1] - input_.get_shape().as_list()[2],
-                    input_.get_shape().as_list()[3]
-                ])
-                res = tf.concat(axis=1, values=[pad_1, res])
-                res = tf.concat(axis=2, values=[pad_2, res])
-        res = tf.nn.conv2d(
-            input=res,
-            filter=weights,
-            strides=strides,
-            padding=padding,
-            name=scope.name)
-
-        if hasattr(input_, "shape"):
-            if input_.get_shape().as_list()[1] < filter_size[0]:
-                res = tf.slice(res, [
-                    0, filter_size[0] - input_.get_shape().as_list()[1],
-                    filter_size[1] - input_.get_shape().as_list()[2], 0
-                ], [-1, -1, -1, -1])
-
-        if bias:
-            biases = variable_on_cpu("biases", [dim_out], tf.constant_initializer(0.))
-            res = tf.nn.bias_add(res, biases)
-        if nonlinearity is not None:
-            res = nonlinearity(res)
-
-    return res
-
-
-def max_pool_2x2(input_):
-    """Max pooling."""
-    return tf.nn.max_pool(
-        input_, ksize=[1, 2, 2, 1], strides=[1, 2, 2, 1], padding="SAME")
-
-
-def depool_2x2(input_, stride=2):
-    """Depooling."""
-    shape = input_.get_shape().as_list()
-    batch_size = shape[0]
-    height = shape[1]
-    width = shape[2]
-    channels = shape[3]
-    res = tf.reshape(input_, [batch_size, height, 1, width, 1, channels])
-    res = tf.concat(
-        axis=2, values=[res, tf.zeros([batch_size, height, stride - 1, width, 1, channels])])
-    res = tf.concat(axis=4, values=[
-        res, tf.zeros([batch_size, height, stride, width, stride - 1, channels])
-    ])
-    res = tf.reshape(res, [batch_size, stride * height, stride * width, channels])
-
-    return res
-
-
-# random flip on a batch of images
-def batch_random_flip(input_):
-    """Simultaneous horizontal random flip."""
-    if isinstance(input_, (float, int)):
-        return input_
-    shape = input_.get_shape().as_list()
-    batch_size = shape[0]
-    height = shape[1]
-    width = shape[2]
-    channels = shape[3]
-    res = tf.split(axis=0, num_or_size_splits=batch_size, value=input_)
-    res = [elem[0, :, :, :] for elem in res]
-    res = [tf.image.random_flip_left_right(elem) for elem in res]
-    res = [tf.reshape(elem, [1, height, width, channels]) for elem in res]
-    res = tf.concat(axis=0, values=res)
-
-    return res
-
-
-# build a one hot representation corresponding to the integer tensor
-# the one-hot dimension is appended to the integer tensor shape
-def as_one_hot(input_, n_indices):
-    """Convert indices to one-hot."""
-    shape = input_.get_shape().as_list()
-    n_elem = numpy.prod(shape)
-    indices = tf.range(n_elem)
-    indices = tf.cast(indices, tf.int64)
-    indices_input = tf.concat(axis=0, values=[indices, tf.reshape(input_, [-1])])
-    indices_input = tf.reshape(indices_input, [2, -1])
-    indices_input = tf.transpose(indices_input)
-    res = tf.sparse_to_dense(
-        indices_input, [n_elem, n_indices], 1., 0., name="flat_one_hot")
-    res = tf.reshape(res, [elem for elem in shape] + [n_indices])
-
-    return res
-
-
-def squeeze_2x2(input_):
-    """Squeezing operation: reshape to convert space to channels."""
-    return squeeze_nxn(input_, n_factor=2)
-
-
-def squeeze_nxn(input_, n_factor=2):
-    """Squeezing operation: reshape to convert space to channels."""
-    if isinstance(input_, (float, int)):
-        return input_
-    shape = input_.get_shape().as_list()
-    batch_size = shape[0]
-    height = shape[1]
-    width = shape[2]
-    channels = shape[3]
-    if height % n_factor != 0:
-        raise ValueError("Height not divisible by %d." % n_factor)
-    if width % n_factor != 0:
-        raise ValueError("Width not divisible by %d." % n_factor)
-    res = tf.reshape(
-        input_,
-        [batch_size,
-         height // n_factor,
-         n_factor, width // n_factor,
-         n_factor, channels])
-    res = tf.transpose(res, [0, 1, 3, 5, 2, 4])
-    res = tf.reshape(
-        res,
-        [batch_size,
-         height // n_factor,
-         width // n_factor,
-         channels * n_factor * n_factor])
-
-    return res
-
-
-def unsqueeze_2x2(input_):
-    """Unsqueezing operation: reshape to convert channels into space."""
-    if isinstance(input_, (float, int)):
-        return input_
-    shape = input_.get_shape().as_list()
-    batch_size = shape[0]
-    height = shape[1]
-    width = shape[2]
-    channels = shape[3]
-    if channels % 4 != 0:
-        raise ValueError("Number of channels not divisible by 4.")
-    res = tf.reshape(input_, [batch_size, height, width, channels // 4, 2, 2])
-    res = tf.transpose(res, [0, 1, 4, 2, 5, 3])
-    res = tf.reshape(res, [batch_size, 2 * height, 2 * width, channels // 4])
-
-    return res
-
-
-# batch norm
-def batch_norm(input_,
-               dim,
-               name,
-               scale=True,
-               train=True,
-               epsilon=1e-8,
-               decay=.1,
-               axes=[0],
-               bn_lag=DEFAULT_BN_LAG):
-    """Batch normalization."""
-    # create variables
-    with tf.variable_scope(name):
-        var = variable_on_cpu(
-            "var", [dim], tf.constant_initializer(1.), trainable=False)
-        mean = variable_on_cpu(
-            "mean", [dim], tf.constant_initializer(0.), trainable=False)
-        step = variable_on_cpu("step", [], tf.constant_initializer(0.), trainable=False)
-        if scale:
-            gamma = variable_on_cpu("gamma", [dim], tf.constant_initializer(1.))
-        beta = variable_on_cpu("beta", [dim], tf.constant_initializer(0.))
-    # choose the appropriate moments
-    if train:
-        used_mean, used_var = tf.nn.moments(input_, axes, name="batch_norm")
-        cur_mean, cur_var = used_mean, used_var
-        if bn_lag > 0.:
-            used_mean -= (1. - bn_lag) * (used_mean - tf.stop_gradient(mean))
-            used_var -= (1 - bn_lag) * (used_var - tf.stop_gradient(var))
-            used_mean /= (1. - bn_lag**(step + 1))
-            used_var /= (1. - bn_lag**(step + 1))
-    else:
-        used_mean, used_var = mean, var
-        cur_mean, cur_var = used_mean, used_var
-
-    # normalize
-    res = (input_ - used_mean) / tf.sqrt(used_var + epsilon)
-    # de-normalize
-    if scale:
-        res *= gamma
-    res += beta
-
-    # update variables
-    if train:
-        with tf.name_scope(name, "AssignMovingAvg", [mean, cur_mean, decay]):
-            with ops.colocate_with(mean):
-                new_mean = tf.assign_sub(
-                    mean,
-                    tf.check_numerics(decay * (mean - cur_mean), "NaN in moving mean."))
-        with tf.name_scope(name, "AssignMovingAvg", [var, cur_var, decay]):
-            with ops.colocate_with(var):
-                new_var = tf.assign_sub(
-                    var,
-                    tf.check_numerics(decay * (var - cur_var),
-                                      "NaN in moving variance."))
-        with tf.name_scope(name, "IncrementTime", [step]):
-            with ops.colocate_with(step):
-                new_step = tf.assign_add(step, 1.)
-        res += 0. * new_mean * new_var * new_step
-
-    return res
-
-
-# batch normalization taking into account the volume transformation
-def batch_norm_log_diff(input_,
-                        dim,
-                        name,
-                        train=True,
-                        epsilon=1e-8,
-                        decay=.1,
-                        axes=[0],
-                        reuse=None,
-                        bn_lag=DEFAULT_BN_LAG):
-    """Batch normalization with corresponding log determinant Jacobian."""
-    if reuse is None:
-        reuse = not train
-    # create variables
-    with tf.variable_scope(name) as scope:
-        if reuse:
-            scope.reuse_variables()
-        var = variable_on_cpu(
-            "var", [dim], tf.constant_initializer(1.), trainable=False)
-        mean = variable_on_cpu(
-            "mean", [dim], tf.constant_initializer(0.), trainable=False)
-        step = variable_on_cpu("step", [], tf.constant_initializer(0.), trainable=False)
-    # choose the appropriate moments
-    if train:
-        used_mean, used_var = tf.nn.moments(input_, axes, name="batch_norm")
-        cur_mean, cur_var = used_mean, used_var
-        if bn_lag > 0.:
-            used_var = stable_var(input_=input_, mean=used_mean, axes=axes)
-            cur_var = used_var
-            used_mean -= (1 - bn_lag) * (used_mean - tf.stop_gradient(mean))
-            used_mean /= (1. - bn_lag**(step + 1))
-            used_var -= (1 - bn_lag) * (used_var - tf.stop_gradient(var))
-            used_var /= (1. - bn_lag**(step + 1))
-    else:
-        used_mean, used_var = mean, var
-        cur_mean, cur_var = used_mean, used_var
-
-    # update variables
-    if train:
-        with tf.name_scope(name, "AssignMovingAvg", [mean, cur_mean, decay]):
-            with ops.colocate_with(mean):
-                new_mean = tf.assign_sub(
-                    mean,
-                    tf.check_numerics(
-                        decay * (mean - cur_mean), "NaN in moving mean."))
-        with tf.name_scope(name, "AssignMovingAvg", [var, cur_var, decay]):
-            with ops.colocate_with(var):
-                new_var = tf.assign_sub(
-                    var,
-                    tf.check_numerics(decay * (var - cur_var),
-                                      "NaN in moving variance."))
-        with tf.name_scope(name, "IncrementTime", [step]):
-            with ops.colocate_with(step):
-                new_step = tf.assign_add(step, 1.)
-        used_var += 0. * new_mean * new_var * new_step
-    used_var += epsilon
-
-    return used_mean, used_var
-
-
-def convnet(input_,
-            dim_in,
-            dim_hid,
-            filter_sizes,
-            dim_out,
-            name,
-            use_batch_norm=True,
-            train=True,
-            nonlinearity=tf.nn.relu):
-    """Chaining of convolutional layers."""
-    dims_in = [dim_in] + dim_hid[:-1]
-    dims_out = dim_hid
-    res = input_
-
-    bias = (not use_batch_norm)
-    with tf.variable_scope(name):
-        for layer_idx in xrange(len(dim_hid)):
-            res = conv_layer(
-                input_=res,
-                filter_size=filter_sizes[layer_idx],
-                dim_in=dims_in[layer_idx],
-                dim_out=dims_out[layer_idx],
-                name="h_%d" % layer_idx,
-                stddev=1e-2,
-                nonlinearity=None,
-                bias=bias)
-            if use_batch_norm:
-                res = batch_norm(
-                    input_=res,
-                    dim=dims_out[layer_idx],
-                    name="bn_%d" % layer_idx,
-                    scale=(nonlinearity == tf.nn.relu),
-                    train=train,
-                    epsilon=1e-8,
-                    axes=[0, 1, 2])
-            if nonlinearity is not None:
-                res = nonlinearity(res)
-
-        res = conv_layer(
-            input_=res,
-            filter_size=filter_sizes[-1],
-            dim_in=dims_out[-1],
-            dim_out=dim_out,
-            name="out",
-            stddev=1e-2,
-            nonlinearity=None)
-
-    return res
-
-
-# distributions
-# log-likelihood estimation
-def standard_normal_ll(input_):
-    """Log-likelihood of standard Gaussian distribution."""
-    res = -.5 * (tf.square(input_) + numpy.log(2. * numpy.pi))
-
-    return res
-
-
-def standard_normal_sample(shape):
-    """Samples from standard Gaussian distribution."""
-    return tf.random_normal(shape)
-
-
-SQUEEZE_MATRIX = numpy.array([[[[1., 0., 0., 0.]], [[0., 0., 1., 0.]]],
-                              [[[0., 0., 0., 1.]], [[0., 1., 0., 0.]]]])
-
-
-def squeeze_2x2_ordered(input_, reverse=False):
-    """Squeezing operation with a controlled ordering."""
-    shape = input_.get_shape().as_list()
-    batch_size = shape[0]
-    height = shape[1]
-    width = shape[2]
-    channels = shape[3]
-    if reverse:
-        if channels % 4 != 0:
-            raise ValueError("Number of channels not divisible by 4.")
-        channels /= 4
-    else:
-        if height % 2 != 0:
-            raise ValueError("Height not divisible by 2.")
-        if width % 2 != 0:
-            raise ValueError("Width not divisible by 2.")
-    weights = numpy.zeros((2, 2, channels, 4 * channels))
-    for idx_ch in xrange(channels):
-        slice_2 = slice(idx_ch, (idx_ch + 1))
-        slice_3 = slice((idx_ch * 4), ((idx_ch + 1) * 4))
-        weights[:, :, slice_2, slice_3] = SQUEEZE_MATRIX
-    shuffle_channels = [idx_ch * 4 for idx_ch in xrange(channels)]
-    shuffle_channels += [idx_ch * 4 + 1 for idx_ch in xrange(channels)]
-    shuffle_channels += [idx_ch * 4 + 2 for idx_ch in xrange(channels)]
-    shuffle_channels += [idx_ch * 4 + 3 for idx_ch in xrange(channels)]
-    shuffle_channels = numpy.array(shuffle_channels)
-    weights = weights[:, :, :, shuffle_channels].astype("float32")
-    if reverse:
-        res = tf.nn.conv2d_transpose(
-            value=input_,
-            filter=weights,
-            output_shape=[batch_size, height * 2, width * 2, channels],
-            strides=[1, 2, 2, 1],
-            padding="SAME",
-            name="unsqueeze_2x2")
-    else:
-        res = tf.nn.conv2d(
-            input=input_,
-            filter=weights,
-            strides=[1, 2, 2, 1],
-            padding="SAME",
-            name="squeeze_2x2")
-
-    return res
diff --git a/research/sentiment_analysis/README.md b/research/sentiment_analysis/README.md
deleted file mode 100644
index f98c42751df2475b4dbdc1db0cc303b42d2b7b4c..0000000000000000000000000000000000000000
--- a/research/sentiment_analysis/README.md
+++ /dev/null
@@ -1,26 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Sentiment Analysis
-## Overview
-This is an implementation of the Sentiment Analysis model as described in the [this paper](https://arxiv.org/abs/1412.1058). The implementation is with the reference to [paddle version](https://github.com/mlperf/reference/tree/master/sentiment_analysis/paddle).
-
-The model makes use of concatenation of two CNN layers with different kernel sizes. Batch normalization and dropout layers are used to prevent over-fitting.
-
-## Dataset
-The [keras](https://keras.io)'s [IMDB Movie reviews sentiment classification](https://keras.io/datasets/#imdb-movie-reviews-sentiment-classification) dataset is used. The dataset file download is handled by keras module, and the downloaded files are stored at ``~/.keras/datasets` directory. The compressed file's filesize as of June 15 2018 is 17MB.
-
-## Running Code
-### Train and evaluate model
-To train and evaluate the model, issue the following command:
-```
-python sentiment_main.py
-```
-Arguments:
-  * `--dataset`: The dataset name to be downloaded and preprocessed. By default, it is `imdb`.
-
-There are other arguments about models and training process. Use the `--help` or `-h` flag to get a full list of possible arguments with detailed descriptions.
-
-## Benchmarks
-The model was recorded to have the accuracy of 90.1% for the IMDB dataset.
diff --git a/research/sentiment_analysis/__init__.py b/research/sentiment_analysis/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/sentiment_analysis/data/__init__.py b/research/sentiment_analysis/data/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/sentiment_analysis/data/dataset.py b/research/sentiment_analysis/data/dataset.py
deleted file mode 100644
index 9ba4b9ac677296fbc7d1db549c5dd011117a16c4..0000000000000000000000000000000000000000
--- a/research/sentiment_analysis/data/dataset.py
+++ /dev/null
@@ -1,52 +0,0 @@
-"""Dataset module for sentiment analysis.
-
-Currently imdb dataset is available.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import data.imdb as imdb
-
-DATASET_IMDB = "imdb"
-
-
-def load(dataset, vocabulary_size, sentence_length):
-  """Returns training and evaluation input.
-
-  Args:
-    dataset: Dataset to be trained and evaluated.
-      Currently only imdb is supported.
-    vocabulary_size: The number of the most frequent tokens
-      to be used from the corpus.
-    sentence_length: The number of words in each sentence.
-      Longer sentences get cut, shorter ones padded.
-  Raises:
-    ValueError: if the dataset value is not valid.
-  Returns:
-    A tuple of length 4, for training sentences, labels,
-    evaluation sentences, and evaluation labels,
-    each being an numpy array.
-  """
-  if dataset == DATASET_IMDB:
-    return imdb.load(vocabulary_size, sentence_length)
-  else:
-    raise ValueError("unsupported dataset: " + dataset)
-
-
-def get_num_class(dataset):
-  """Returns an integer for the number of label classes.
-
-  Args:
-    dataset: Dataset to be trained and evaluated.
-      Currently only imdb is supported.
-  Raises:
-    ValueError: if the dataset value is not valid.
-  Returns:
-    int: The number of label classes.
-  """
-  if dataset == DATASET_IMDB:
-    return imdb.NUM_CLASS
-  else:
-    raise ValueError("unsupported dataset: " + dataset)
diff --git a/research/sentiment_analysis/data/imdb.py b/research/sentiment_analysis/data/imdb.py
deleted file mode 100644
index f8160ca2f71ac158a4ee42119a04e2dafec033ee..0000000000000000000000000000000000000000
--- a/research/sentiment_analysis/data/imdb.py
+++ /dev/null
@@ -1,54 +0,0 @@
-"""IMDB Dataset module for sentiment analysis."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-
-from data.util import OOV_CHAR
-from data.util import pad_sentence
-from data.util import START_CHAR
-
-NUM_CLASS = 2
-
-
-def load(vocabulary_size, sentence_length):
-  """Returns training and evaluation input for imdb dataset.
-
-  Args:
-    vocabulary_size: The number of the most frequent tokens
-      to be used from the corpus.
-    sentence_length: The number of words in each sentence.
-      Longer sentences get cut, shorter ones padded.
-  Raises:
-    ValueError: if the dataset value is not valid.
-  Returns:
-    A tuple of length 4, for training and evaluation data,
-    each being an numpy array.
-  """
-  (x_train, y_train), (x_test, y_test) = tf.keras.datasets.imdb.load_data(
-      path="imdb.npz",
-      num_words=vocabulary_size,
-      skip_top=0,
-      maxlen=None,
-      seed=113,
-      start_char=START_CHAR,
-      oov_char=OOV_CHAR,
-      index_from=OOV_CHAR+1)
-
-  x_train_processed = []
-  for sen in x_train:
-    sen = pad_sentence(sen, sentence_length)
-    x_train_processed.append(np.array(sen))
-  x_train_processed = np.array(x_train_processed)
-
-  x_test_processed = []
-  for sen in x_test:
-    sen = pad_sentence(sen, sentence_length)
-    x_test_processed.append(np.array(sen))
-  x_test_processed = np.array(x_test_processed)
-
-  return x_train_processed, np.eye(NUM_CLASS)[y_train], \
-         x_test_processed, np.eye(NUM_CLASS)[y_test]
diff --git a/research/sentiment_analysis/data/util.py b/research/sentiment_analysis/data/util.py
deleted file mode 100644
index c8f8808f7e1d7f26876d052c108d1948cbd2fe9f..0000000000000000000000000000000000000000
--- a/research/sentiment_analysis/data/util.py
+++ /dev/null
@@ -1,32 +0,0 @@
-"""Utility module for sentiment analysis."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-START_CHAR = 1
-END_CHAR = 2
-OOV_CHAR = 3
-
-
-def pad_sentence(sentence, sentence_length):
-  """Pad the given sentense at the end.
-
-  If the input is longer than sentence_length,
-  the remaining portion is dropped.
-  END_CHAR is used for the padding.
-
-  Args:
-    sentence: A numpy array of integers.
-    sentence_length: The length of the input after the padding.
-  Returns:
-    A numpy array of integers of the given length.
-  """
-  sentence = sentence[:sentence_length]
-  if len(sentence) < sentence_length:
-    sentence = np.pad(sentence, (0, sentence_length - len(sentence)),
-                      "constant", constant_values=(START_CHAR, END_CHAR))
-
-  return sentence
diff --git a/research/sentiment_analysis/sentiment_main.py b/research/sentiment_analysis/sentiment_main.py
deleted file mode 100644
index a749d25f3098e071b630f07bac81450b40ade9ba..0000000000000000000000000000000000000000
--- a/research/sentiment_analysis/sentiment_main.py
+++ /dev/null
@@ -1,92 +0,0 @@
-"""Main function for the sentiment analysis model.
-
-The model makes use of concatenation of two CNN layers with
-different kernel sizes. See `sentiment_model.py`
-for more details about the models.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import argparse
-
-import tensorflow as tf
-
-from data import dataset
-import sentiment_model
-
-_DROPOUT_RATE = 0.95
-
-
-def run_model(dataset_name, emb_dim, voc_size, sen_len,
-              hid_dim, batch_size, epochs):
-  """Run training loop and an evaluation at the end.
-
-  Args:
-    dataset_name: Dataset name to be trained and evaluated.
-    emb_dim: The dimension of the Embedding layer.
-    voc_size: The number of the most frequent tokens
-      to be used from the corpus.
-    sen_len: The number of words in each sentence.
-      Longer sentences get cut, shorter ones padded.
-    hid_dim: The dimension of the Embedding layer.
-    batch_size: The size of each batch during training.
-    epochs: The number of the iteration over the training set for training.
-  """
-
-  model = sentiment_model.CNN(emb_dim, voc_size, sen_len,
-                              hid_dim, dataset.get_num_class(dataset_name),
-                              _DROPOUT_RATE)
-  model.summary()
-
-  model.compile(loss="categorical_crossentropy",
-                optimizer="rmsprop",
-                metrics=["accuracy"])
-
-  tf.logging.info("Loading the data")
-  x_train, y_train, x_test, y_test = dataset.load(
-      dataset_name, voc_size, sen_len)
-
-  model.fit(x_train, y_train, batch_size=batch_size,
-            validation_split=0.4, epochs=epochs)
-  score = model.evaluate(x_test, y_test, batch_size=batch_size)
-  tf.logging.info("Score: {}".format(score))
-
-if __name__ == "__main__":
-  parser = argparse.ArgumentParser()
-  parser.add_argument("-d", "--dataset", help="Dataset to be trained "
-                                              "and evaluated.",
-                      type=str, choices=["imdb"], default="imdb")
-
-  parser.add_argument("-e", "--embedding_dim",
-                      help="The dimension of the Embedding layer.",
-                      type=int, default=512)
-
-  parser.add_argument("-v", "--vocabulary_size",
-                      help="The number of the words to be considered "
-                           "in the dataset corpus.",
-                      type=int, default=6000)
-
-  parser.add_argument("-s", "--sentence_length",
-                      help="The number of words in a data point."
-                           "Entries of smaller length are padded.",
-                      type=int, default=600)
-
-  parser.add_argument("-c", "--hidden_dim",
-                      help="The number of the CNN layer filters.",
-                      type=int, default=512)
-
-  parser.add_argument("-b", "--batch_size",
-                      help="The size of each batch for training.",
-                      type=int, default=500)
-
-  parser.add_argument("-p", "--epochs",
-                      help="The number of epochs for training.",
-                      type=int, default=55)
-
-  args = parser.parse_args()
-
-  run_model(args.dataset, args.embedding_dim, args.vocabulary_size,
-            args.sentence_length, args.hidden_dim,
-            args.batch_size, args.epochs)
diff --git a/research/sentiment_analysis/sentiment_model.py b/research/sentiment_analysis/sentiment_model.py
deleted file mode 100644
index 586474992ab4521512bbbb5156be0b306f94c3bb..0000000000000000000000000000000000000000
--- a/research/sentiment_analysis/sentiment_model.py
+++ /dev/null
@@ -1,50 +0,0 @@
-"""Model for sentiment analysis.
-
-The model makes use of concatenation of two CNN layers with
-different kernel sizes.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-class CNN(tf.keras.models.Model):
-  """CNN for sentimental analysis."""
-
-  def __init__(self, emb_dim, num_words, sentence_length, hid_dim,
-               class_dim, dropout_rate):
-    """Initialize CNN model.
-
-    Args:
-      emb_dim: The dimension of the Embedding layer.
-      num_words: The number of the most frequent tokens
-        to be used from the corpus.
-      sentence_length: The number of words in each sentence.
-        Longer sentences get cut, shorter ones padded.
-      hid_dim: The dimension of the Embedding layer.
-      class_dim: The number of the CNN layer filters.
-      dropout_rate: The portion of kept value in the Dropout layer.
-    Returns:
-      tf.keras.models.Model: A Keras model.
-    """
-
-    input_layer = tf.keras.layers.Input(shape=(sentence_length,), dtype=tf.int32)
-
-    layer = tf.keras.layers.Embedding(num_words, output_dim=emb_dim)(input_layer)
-
-    layer_conv3 = tf.keras.layers.Conv1D(hid_dim, 3, activation="relu")(layer)
-    layer_conv3 = tf.keras.layers.GlobalMaxPooling1D()(layer_conv3)
-
-    layer_conv4 = tf.keras.layers.Conv1D(hid_dim, 2, activation="relu")(layer)
-    layer_conv4 = tf.keras.layers.GlobalMaxPooling1D()(layer_conv4)
-
-    layer = tf.keras.layers.concatenate([layer_conv4, layer_conv3], axis=1)
-    layer = tf.keras.layers.BatchNormalization()(layer)
-    layer = tf.keras.layers.Dropout(dropout_rate)(layer)
-
-    output = tf.keras.layers.Dense(class_dim, activation="softmax")(layer)
-
-    super(CNN, self).__init__(inputs=[input_layer], outputs=output)
diff --git a/research/seq2species/README.md b/research/seq2species/README.md
deleted file mode 100644
index dbe473131d59fe990e1ff344ea69bb5bd05972e4..0000000000000000000000000000000000000000
--- a/research/seq2species/README.md
+++ /dev/null
@@ -1,187 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Seq2Species: Neural Network Models for Species Classification
-
-*A deep learning solution for read-level taxonomic classification with 16s.*
-
-Recent improvements in sequencing technology have made possible large, public
-databases of biological sequencing data, bringing about new data richness for
-many important problems in bioinformatics. However, this growing availability of
-data creates a need for analysis methods capable of efficiently handling these
-large sequencing datasets. We on the [Genomics team in Google
-Brain](https://ai.google/research/teams/brain/healthcare-biosciences) are
-particularly interested in the class of problems which can be framed as
-assigning meaningful labels to short biological sequences, and are exploring the
-possiblity of creating a general deep learning solution for solving this class
-of sequence-labeling problems. We are excited to share our initial progress in
-this direction by releasing Seq2Species, an open-source neural network framework
-for [TensorFlow](https://www.tensorflow.org/) for predicting read-level
-taxonomic labels from genomic sequence. Our release includes all the code
-necessary to train new Seq2Species models.
-
-## About Seq2Species
-
-Briefly, Seq2Species provides a framework for training deep neural networks to
-predict database-derived labels directly from short reads of DNA. Thus far, our
-research has focused predominantly on demonstrating the value of this deep
-learning approach on the problem of determining the species of origin of
-next-generation sequencing reads from [16S ribosomal
-DNA](https://en.wikipedia.org/wiki/16S_ribosomal_RNA). We used this
-Seq2Species framework to train depthwise separable convolutional neural networks
-on short subsequences from the 16S genes of more than 13 thousand distinct
-species. The resulting classification model assign species-level probabilities
-to individual 16S reads.
-
-For more information about the use cases we have explored, or for technical
-details describing how Seq2Species work, please see our
-[preprint](https://www.biorxiv.org/content/early/2018/06/22/353474).
-
-## Installation
-
-Training Seq2Species models requires installing the following dependencies:
-
-* python 2.7
-
-* protocol buffers
-
-* numpy
-
-* absl
-
-### Dependencies
-
-Detailed instructions for installing TensorFlow are available on the [Installing
-TensorFlow](https://www.tensorflow.org/install/) website. Please follow the
-full instructions for installing TensorFlow with GPU support. For most
-users, the following command will suffice for continuing with CPU support only:
-```bash
-# For CPU
-pip install --upgrade tensorflow
-```
-
-The TensorFlow installation should also include installation of the numpy and
-absl libraries, which are two of TensorFlow's python dependencies. If
-necessary, instructions for standalone installation are available:
-
-* [numpy](https://scipy.org/install.html)
-
-* [absl](https://github.com/abseil/abseil-py)
-
-Information about protocol buffers, as well as download and installation
-intructions for the protocol buffer (protobuf) compiler, are available on the [Google
-Developers website](https://developers.google.com/protocol-buffers/). A typical
-Ubuntu user can install this library using `apt-get`:
-```bash
-sudo apt-get install protobuf-compiler
-```
-
-### Clone
-
-Now, clone `tensorflow/models` to start working with the code:
-```bash
-git clone https://github.com/tensorflow/models.git
-```
-
-### Protobuf Compilation
-
-Seq2Species uses protobufs to store and save dataset and model metadata. Before
-the framework can be used to build and train models, the protobuf libraries must
-be compiled. This can be accomplished using the following command:
-```bash
-# From tensorflow/models/research
-protoc seq2species/protos/seq2label.proto --python_out=.
-```
-
-### Testing the Installation
-
-One can test that Seq2Species has been installed correctly by running the
-following command:
-```bash
-python seq2species/run_training_test.py
-```
-
-## Usage Information
-
-Input data to Seq2Species models should be [tf.train.Example protocol messages](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/example/example.proto) stored in
-[TFRecord format](https://www.tensorflow.org/versions/r1.0/api_guides/python/python_io#tfrecords_format_details).
-Specifically, the input pipeline expects tf.train.Examples with a 'sequence' field
-containing a genomic sequence as an upper-case string, as one field for each
-target label (e.g. 'species'). There should also be an accompanying
-Seq2LabelDatasetInfo text protobuf containing metadata about the input, including
-the possible label values for each target.
-
-Below, we give an example command that could be used to launch training for 1000
-steps, assuming that appropriate data and metadata files are stored at
-`${TFRECORD}` and `${DATASET_INFO}`:
-```bash
-python seq2species/run_training.py --train_files ${TFRECORD}
---metadata_path ${DATASET_INFO} --hparams 'train_steps=1000'
---logdir $HOME/seq2species
-```
-This will output [TensorBoard
-summaries](https://www.tensorflow.org/guide/summaries_and_tensorboard), [TensorFlow
-checkpoints](https://www.tensorflow.org/guide/variables#checkpoint_files), Seq2LabelModelInfo and
-Seq2LabelExperimentMeasures metadata to the logdir `$HOME/seq2species`.
-
-### Preprocessed Seq2Species Data
-
-We have provided preprocessed data based on 16S reference sequences from the
-[NCBI RefSeq Targeted Loci
-Project](https://www.ncbi.nlm.nih.gov/refseq/targetedloci/) in a Seq2Species
-bucket on Google Cloud Storage. After installing the
-[Cloud SDK](https://cloud.google.com/sdk/install),
-one can download those data (roughly 25 GB) to a local directory `${DEST}` using
-the `gsutil` command:
-```bash
-BUCKET=gs://brain-genomics-public/research/seq2species
-mkdir -p ${DEST}
-gsutil -m cp ${BUCKET}/* ${DEST}
-```
-
-To check if the copy has completed successsfully, check the `${DEST}` directory:
-```bash
-ls -1 ${DEST}
-```
-which should produce:
-```bash
-ncbi_100bp_revcomp.dataset_info.pbtxt
-ncbi_100bp_revcomp.tfrecord
-```
-
-The following command can be used to train a copy of one of our best-perfoming
-deep neural network models for 100 base pair (bp) data. This command also
-illustrates how to set hyperparameter values explicitly from the commandline.
-The file `configuration.py` provides a full list of hyperparameters, their descriptions,
-and their default values. Additional flags are described at the top of
-`run_training.py`.
-```bash
-python seq2species/run_training.py \
---num_filters 3 \
---noise_rate 0.04 \
---train_files ${DEST}/ncbi_100bp_revcomp.tfrecord \
---metadata_path ${DEST}/ncbi_100bp_revcomp.dataset_info.pbtxt \
---logdir $HOME/seq2species \
---hparams 'filter_depths=[1,1,1],filter_widths=[5,9,13],grad_clip_norm=20.0,keep_prob=0.94017831318,
-lr_decay=0.0655052811,lr_init=0.000469689635793,lrelu_slope=0.0125376069918,min_read_length=100,num_fc_layers=2,num_fc_units=2828,optimizer=adam,optimizer_hp=0.885769367218,pointwise_depths=[84,58,180],pooling_type=avg,train_steps=3000000,use_depthwise_separable=true,weight_scale=1.18409526348'
-```
-
-### Visualization
-
-[TensorBoard](https://github.com/tensorflow/tensorboard) can be used to
-visualize training curves and other metrics stored in the summary files produced
-by `run_training.py`. Use the following command to launch a TensorBoard instance
-for the example model directory `$HOME/seq2species`:
-```bash
-tensorboard --logdir=$HOME/seq2species
-```
-
-## Contact
-
-Any issues with the Seq2Species framework should be filed with the
-[TensorFlow/models issue tracker](https://github.com/tensorflow/models/issues).
-Questions regarding Seq2Species capabilities can be directed to
-[seq2species-interest@google.com](mailto:seq2species-interest@google.com). This
-code is maintained by [@apbusia](https://github.com/apbusia) and
-[@depristo](https://github.com/depristo).
diff --git a/research/seq2species/build_model.py b/research/seq2species/build_model.py
deleted file mode 100644
index 9f4ae6b2eb2c1b4b4deeba99a0fbcf3be0ca644a..0000000000000000000000000000000000000000
--- a/research/seq2species/build_model.py
+++ /dev/null
@@ -1,506 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Defines convolutional model graph for Seq2Species.
-
-Builds TensorFlow computation graph for predicting the given taxonomic target
-labels from short reads of DNA using convolutional filters, followed by
-fully-connected layers and a softmax output layer.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import math
-
-import tensorflow as tf
-
-import input as seq2species_input
-import seq2label_utils
-
-
-class ConvolutionalNet(object):
-  """Class to build and store the model's computational graph and operations.
-
-  Attributes:
-    read_length: int; the length in basepairs of the input reads of DNA.
-    placeholders: dict; mapping from name to tf.Placeholder.
-    global_step: tf.Variable tracking number of training iterations performed.
-    train_op: operation to perform one training step by gradient descent.
-    summary_op: operation to log model's performance metrics to TF event files.
-    accuracy: tf.Variable giving the model's read-level accuracy for the
-      current inputs.
-    weighted_accuracy: tf.Variable giving the model's read-level weighted
-      accuracy for the current inputs.
-    loss: tf.Variable giving the model's current cross entropy loss.
-    logits: tf.Variable containing the model's logits for the current inputs.
-    predictions: tf.Variable containing the model's current predicted
-      probability distributions for the current inputs.
-    possible_labels: a dict of possible label values (list of strings), keyed by
-      target name.  Labels in the lists are the order used for integer encoding.
-    use_tpu: whether model is to be run on TPU.
-  """
-
-  def __init__(self, hparams, dataset_info, targets, use_tpu=False):
-    """Initializes the ConvolutionalNet according to provided hyperparameters.
-
-    Does not build the graph---this is done by calling `build_graph` on the
-    constructed object or using `model_fn`.
-
-    Args:
-      hparams: tf.contrib.training.Hparams object containing the model's
-        hyperparamters; see configuration.py for hyperparameter definitions.
-      dataset_info: a `Seq2LabelDatasetInfo` message reflecting the dataset
-        metadata.
-      targets: list of strings: the names of the prediction targets.
-      use_tpu: whether we are running on TPU; if True, summaries will be
-        disabled.
-    """
-    self._placeholders = {}
-    self._targets = targets
-    self._dataset_info = dataset_info
-    self._hparams = hparams
-    all_label_values = seq2label_utils.get_all_label_values(self.dataset_info)
-    self._possible_labels = {
-        target: all_label_values[target]
-        for target in self.targets
-    }
-    self._use_tpu = use_tpu
-
-  @property
-  def hparams(self):
-    return self._hparams
-
-  @property
-  def dataset_info(self):
-    return self._dataset_info
-
-  @property
-  def possible_labels(self):
-    return self._possible_labels
-
-  @property
-  def bases(self):
-    return seq2species_input.DNA_BASES
-
-  @property
-  def n_bases(self):
-    return seq2species_input.NUM_DNA_BASES
-
-  @property
-  def targets(self):
-    return self._targets
-
-  @property
-  def read_length(self):
-    return self.dataset_info.read_length
-
-  @property
-  def placeholders(self):
-    return self._placeholders
-
-  @property
-  def global_step(self):
-    return self._global_step
-
-  @property
-  def train_op(self):
-    return self._train_op
-
-  @property
-  def summary_op(self):
-    return self._summary_op
-
-  @property
-  def accuracy(self):
-    return self._accuracy
-
-  @property
-  def weighted_accuracy(self):
-    return self._weighted_accuracy
-
-  @property
-  def loss(self):
-    return self._loss
-
-  @property
-  def total_loss(self):
-    return self._total_loss
-
-  @property
-  def logits(self):
-    return self._logits
-
-  @property
-  def predictions(self):
-    return self._predictions
-
-  @property
-  def use_tpu(self):
-    return self._use_tpu
-
-  def _summary_scalar(self, name, scalar):
-    """Adds a summary scalar, if the platform supports summaries."""
-    if not self.use_tpu:
-      return tf.summary.scalar(name, scalar)
-    else:
-      return None
-
-  def _summary_histogram(self, name, values):
-    """Adds a summary histogram, if the platform supports summaries."""
-    if not self.use_tpu:
-      return tf.summary.histogram(name, values)
-    else:
-      return None
-
-  def _init_weights(self, shape, scale=1.0, name='weights'):
-    """Randomly initializes a weight Tensor of the given shape.
-
-    Args:
-      shape: list; desired Tensor dimensions.
-      scale: float; standard deviation scale with which to initialize weights.
-      name: string name for the variable.
-
-    Returns:
-      TF Variable contining truncated random Normal initialized weights.
-    """
-    num_inputs = shape[0] if len(shape) < 3 else shape[0] * shape[1] * shape[2]
-    stddev = scale / math.sqrt(num_inputs)
-    return tf.get_variable(
-        name,
-        shape=shape,
-        initializer=tf.truncated_normal_initializer(0., stddev))
-
-  def _init_bias(self, size):
-    """Initializes bias vector of given shape as zeros.
-
-    Args:
-      size: int; desired size of bias Tensor.
-
-    Returns:
-      TF Variable containing the initialized biases.
-    """
-    return tf.get_variable(
-        name='b_{}'.format(size),
-        shape=[size],
-        initializer=tf.zeros_initializer())
-
-  def _add_summaries(self, mode, gradient_norm, parameter_norm):
-    """Defines TensorFlow operation for logging summaries to event files.
-
-    Args:
-      mode: the ModeKey string.
-      gradient_norm: Tensor; norm of gradients produced during the current
-        training operation.
-      parameter_norm: Tensor; norm of the model parameters produced during the
-        current training operation.
-    """
-    # Log summaries for TensorBoard.
-    if mode == tf.estimator.ModeKeys.TRAIN:
-      self._summary_scalar('norm_of_gradients', gradient_norm)
-      self._summary_scalar('norm_of_parameters', parameter_norm)
-      self._summary_scalar('total_loss', self.total_loss)
-      self._summary_scalar('learning_rate', self._learn_rate)
-      for target in self.targets:
-        self._summary_scalar('per_read_weighted_accuracy/{}'.format(target),
-                             self.weighted_accuracy[target])
-        self._summary_scalar('per_read_accuracy/{}'.format(target),
-                             self.accuracy[target])
-        self._summary_histogram('prediction_frequency/{}'.format(target),
-                                self._predictions[target])
-        self._summary_scalar('cross_entropy_loss/{}'.format(target),
-                             self._loss[target])
-      self._summary_op = tf.summary.merge_all()
-    else:
-      # Log average performance metrics over many batches using placeholders.
-      summaries = []
-      for target in self.targets:
-        accuracy_ph = tf.placeholder(tf.float32, shape=())
-        weighted_accuracy_ph = tf.placeholder(tf.float32, shape=())
-        cross_entropy_ph = tf.placeholder(tf.float32, shape=())
-        self._placeholders.update({
-            'accuracy/{}'.format(target): accuracy_ph,
-            'weighted_accuracy/{}'.format(target): weighted_accuracy_ph,
-            'cross_entropy/{}'.format(target): cross_entropy_ph,
-        })
-        summaries += [
-            self._summary_scalar('cross_entropy_loss/{}'.format(target),
-                                 cross_entropy_ph),
-            self._summary_scalar('per_read_accuracy/{}'.format(target),
-                                 accuracy_ph),
-            self._summary_scalar('per_read_weighted_accuracy/{}'.format(target),
-                                 weighted_accuracy_ph)
-        ]
-
-      self._summary_op = tf.summary.merge(summaries)
-
-  def _convolution(self,
-                   inputs,
-                   filter_dim,
-                   pointwise_dim=None,
-                   scale=1.0,
-                   padding='SAME'):
-    """Applies convolutional filter of given dimensions to given input Tensor.
-
-    If a pointwise dimension is specified, a depthwise separable convolution is
-    performed.
-
-    Args:
-      inputs: 4D Tensor of shape (# reads, 1, # basepairs, # bases).
-      filter_dim: integer tuple of the form (width, depth).
-      pointwise_dim: int; output dimension for pointwise convolution.
-      scale: float; standard deviation scale with which to initialize weights.
-      padding: string; type of padding to use. One of "SAME" or "VALID".
-
-    Returns:
-      4D Tensor result of applying the convolutional filter to the inputs.
-    """
-    in_channels = inputs.get_shape()[3].value
-    filter_width, filter_depth = filter_dim
-    filters = self._init_weights([1, filter_width, in_channels, filter_depth],
-                                 scale)
-    self._summary_histogram(filters.name.split(':')[0].split('/')[1], filters)
-    if pointwise_dim is None:
-      return tf.nn.conv2d(
-          inputs,
-          filters,
-          strides=[1, 1, 1, 1],
-          padding=padding,
-          name='weights')
-    pointwise_filters = self._init_weights(
-        [1, 1, filter_depth * in_channels, pointwise_dim],
-        scale,
-        name='pointwise_weights')
-    self._summary_histogram(
-        pointwise_filters.name.split(':')[0].split('/')[1], pointwise_filters)
-    return tf.nn.separable_conv2d(
-        inputs,
-        filters,
-        pointwise_filters,
-        strides=[1, 1, 1, 1],
-        padding=padding)
-
-  def _pool(self, inputs, pooling_type):
-    """Performs pooling across width and height of the given inputs.
-
-    Args:
-      inputs: Tensor shaped (batch, height, width, channels) over which to pool.
-        In our case, height is a unitary dimension and width can be thought of
-        as the read dimension.
-      pooling_type: string; one of "avg" or "max".
-
-    Returns:
-      Tensor result of performing pooling of the given pooling_type over the
-      height and width dimensions of the given inputs.
-    """
-    if pooling_type == 'max':
-      return tf.reduce_max(inputs, axis=[1, 2])
-    if pooling_type == 'avg':
-      return tf.reduce_sum(
-          inputs, axis=[1, 2]) / tf.to_float(tf.shape(inputs)[2])
-
-  def _leaky_relu(self, lrelu_slope, inputs):
-    """Applies leaky ReLu activation to the given inputs with the given slope.
-
-    Args:
-      lrelu_slope: float; slope value for the activation function.
-        A slope of 0.0 defines a standard ReLu activation, while a positive
-        slope defines a leaky ReLu.
-      inputs: Tensor upon which to apply the activation function.
-
-    Returns:
-      Tensor result of applying the activation function to the given inputs.
-    """
-    with tf.variable_scope('leaky_relu_activation'):
-      return tf.maximum(lrelu_slope * inputs, inputs)
-
-  def _dropout(self, inputs, keep_prob):
-    """Applies dropout to the given inputs.
-
-    Args:
-      inputs: Tensor upon which to apply dropout.
-      keep_prob: float; probability with which to randomly retain values in
-        the given input.
-
-    Returns:
-      Tensor result of applying dropout to the given inputs.
-    """
-    with tf.variable_scope('dropout'):
-      if keep_prob < 1.0:
-        return tf.nn.dropout(inputs, keep_prob)
-      return inputs
-
-  def build_graph(self, features, labels, mode, batch_size):
-    """Creates TensorFlow model graph.
-
-    Args:
-      features: a dict of input features Tensors.
-      labels: a dict (by target name) of prediction labels.
-      mode: the ModeKey string.
-      batch_size: the integer batch size.
-
-    Side Effect:
-      Adds the following key Tensors and operations as class attributes:
-        placeholders, global_step, train_op, summary_op, accuracy,
-        weighted_accuracy, loss, logits, and predictions.
-    """
-    is_train = (mode == tf.estimator.ModeKeys.TRAIN)
-    read = features['sequence']
-
-    # Add a unitary dimension, so we can use conv2d.
-    read = tf.expand_dims(read, 1)
-    prev_out = read
-
-    filters = zip(self.hparams.filter_widths, self.hparams.filter_depths)
-    for i, f in enumerate(filters):
-      with tf.variable_scope('convolution_' + str(i)):
-        if self.hparams.use_depthwise_separable:
-          p = self.hparams.pointwise_depths[i]
-        else:
-          p = None
-        conv_out = self._convolution(
-            prev_out, f, pointwise_dim=p, scale=self.hparams.weight_scale)
-        conv_act_out = self._leaky_relu(self.hparams.lrelu_slope, conv_out)
-        prev_out = (
-            self._dropout(conv_act_out, self.hparams.keep_prob)
-            if is_train else conv_act_out)
-
-    for i in xrange(self.hparams.num_fc_layers):
-      with tf.variable_scope('fully_connected_' + str(i)):
-        # Create a convolutional layer which is equivalent to a fully-connected
-        # layer when reads have length self.hparams.min_read_length.
-        # The convolution will tile the layer appropriately for longer reads.
-        biases = self._init_bias(self.hparams.num_fc_units)
-        if i == 0:
-          # Take entire min_read_length segment as input.
-          # Output a single value per min_read_length_segment.
-          filter_dimensions = (self.hparams.min_read_length,
-                               self.hparams.num_fc_units)
-        else:
-          # Take single output value of previous layer as input.
-          filter_dimensions = (1, self.hparams.num_fc_units)
-        fc_out = biases + self._convolution(
-            prev_out,
-            filter_dimensions,
-            scale=self.hparams.weight_scale,
-            padding='VALID')
-        self._summary_histogram(biases.name.split(':')[0].split('/')[1], biases)
-        fc_act_out = self._leaky_relu(self.hparams.lrelu_slope, fc_out)
-        prev_out = (
-            self._dropout(fc_act_out, self.hparams.keep_prob)
-            if is_train else fc_act_out)
-
-    # Pool to collapse tiling for reads longer than hparams.min_read_length.
-    with tf.variable_scope('pool'):
-      pool_out = self._pool(prev_out, self.hparams.pooling_type)
-
-    with tf.variable_scope('output'):
-      self._logits = {}
-      self._predictions = {}
-      self._weighted_accuracy = {}
-      self._accuracy = {}
-      self._loss = collections.OrderedDict()
-
-      for target in self.targets:
-        with tf.variable_scope(target):
-          label = labels[target]
-          possible_labels = self.possible_labels[target]
-          weights = self._init_weights(
-              [pool_out.get_shape()[1].value,
-               len(possible_labels)],
-              self.hparams.weight_scale,
-              name='weights')
-          biases = self._init_bias(len(possible_labels))
-          self._summary_histogram(
-              weights.name.split(':')[0].split('/')[1], weights)
-          self._summary_histogram(
-              biases.name.split(':')[0].split('/')[1], biases)
-          logits = tf.matmul(pool_out, weights) + biases
-          predictions = tf.nn.softmax(logits)
-
-          gather_inds = tf.stack([tf.range(batch_size), label], axis=1)
-          self._weighted_accuracy[target] = tf.reduce_mean(
-              tf.gather_nd(predictions, gather_inds))
-          argmax_prediction = tf.cast(tf.argmax(predictions, axis=1), tf.int32)
-          self._accuracy[target] = tf.reduce_mean(
-              tf.to_float(tf.equal(label, argmax_prediction)))
-
-          losses = tf.nn.sparse_softmax_cross_entropy_with_logits(
-              labels=label, logits=logits)
-          self._loss[target] = tf.reduce_mean(losses)
-          self._logits[target] = logits
-          self._predictions[target] = predictions
-
-    # Compute total loss
-    self._total_loss = tf.add_n(self._loss.values())
-
-    # Define the optimizer.
-
-    # tf.estimator framework builds the global_step for us, but if we aren't
-    # using the framework we have to make it ourselves.
-    self._global_step = tf.train.get_or_create_global_step()
-    if self.hparams.lr_decay < 0:
-      self._learn_rate = self.hparams.lr_init
-    else:
-      self._learn_rate = tf.train.exponential_decay(
-          self.hparams.lr_init,
-          self._global_step,
-          int(self.hparams.train_steps),
-          self.hparams.lr_decay,
-          staircase=False)
-    if self.hparams.optimizer == 'adam':
-      opt = tf.train.AdamOptimizer(self._learn_rate, self.hparams.optimizer_hp)
-    elif self.hparams.optimizer == 'momentum':
-      opt = tf.train.MomentumOptimizer(self._learn_rate,
-                                       self.hparams.optimizer_hp)
-    if self.use_tpu:
-      opt = tf.contrib.tpu.CrossShardOptimizer(opt)
-
-    gradients, variables = zip(*opt.compute_gradients(self._total_loss))
-    clipped_gradients, _ = tf.clip_by_global_norm(gradients,
-                                                  self.hparams.grad_clip_norm)
-    with tf.control_dependencies(tf.get_collection(tf.GraphKeys.UPDATE_OPS)):
-      self._train_op = opt.apply_gradients(
-          zip(clipped_gradients, variables), global_step=self._global_step)
-
-    if not self.use_tpu:
-      grad_norm = tf.global_norm(gradients) if is_train else None
-      param_norm = tf.global_norm(variables) if is_train else None
-      self._add_summaries(mode, grad_norm, param_norm)
-
-  def model_fn(self, features, labels, mode, params):
-    """Function fulfilling the tf.estimator model_fn interface.
-
-    Args:
-      features: a dict containing the input features for prediction.
-      labels: a dict from target name to Tensor-value prediction.
-      mode: the ModeKey string.
-      params: a dictionary of parameters for building the model; current params
-        are params["batch_size"]: the integer batch size.
-
-    Returns:
-      A tf.estimator.EstimatorSpec object ready for use in training, inference.
-      or evaluation.
-    """
-    self.build_graph(features, labels, mode, params['batch_size'])
-
-    return tf.estimator.EstimatorSpec(
-        mode,
-        predictions=self.predictions,
-        loss=self.total_loss,
-        train_op=self.train_op,
-        eval_metric_ops={})
diff --git a/research/seq2species/configuration.py b/research/seq2species/configuration.py
deleted file mode 100644
index a4dd626e27246a79292a80d165b199c42a154df5..0000000000000000000000000000000000000000
--- a/research/seq2species/configuration.py
+++ /dev/null
@@ -1,77 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Defines hyperparameter configuration for ConvolutionalNet models.
-
-Specifically, provides methods for defining and initializing TensorFlow
-hyperparameters objects for a convolutional model as defined in:
-seq2species.build_model
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-def parse_hparams(hparam_values='', num_filters=1):
-  """Initializes TensorFlow hyperparameters object with default values.
-
-  In addition, default hyperparameter values are overwritten with the specified
-  ones, where necessary.
-
-  Args:
-    hparam_values: comma-separated string of name=value pairs for setting
-      particular hyperparameters.
-    num_filters: int; number of filters in the model.
-      Must be fixed outside of hyperparameter/study object as Vizier does not
-      support having inter-hyperparameter dependencies.
-
-  Returns:
-    tf.contrib.training.Hparams object containing the model's hyperparameters.
-  """
-  hparams = tf.contrib.training.HParams()
-
-  # Specify model architecture option.
-  hparams.add_hparam('use_depthwise_separable', True)
-
-  # Specify number of model parameters.
-  hparams.add_hparam('filter_widths', [3] * num_filters)
-  hparams.add_hparam('filter_depths', [1] * num_filters)
-  hparams.add_hparam('pointwise_depths', [64] * num_filters)
-  hparams.add_hparam('num_fc_layers', 2)
-  hparams.add_hparam('num_fc_units', 455)
-  hparams.add_hparam('min_read_length', 100)
-  hparams.add_hparam('pooling_type', 'avg')
-
-  # Specify activation options.
-  hparams.add_hparam('lrelu_slope', 0.0)  # Negative slope for leaky relu.
-
-  # Specify training options.
-  hparams.add_hparam('keep_prob', 1.0)
-  hparams.add_hparam('weight_scale', 1.0)
-  hparams.add_hparam('grad_clip_norm', 20.0)
-  hparams.add_hparam('lr_init', 0.001)
-  hparams.add_hparam('lr_decay', 0.1)
-  hparams.add_hparam('optimizer', 'adam')
-  # optimizer_hp is decay rate for 1st moment estimates for ADAM, and
-  # momentum for SGD.
-  hparams.add_hparam('optimizer_hp', 0.9)
-  hparams.add_hparam('train_steps', 400000)
-
-  # Overwrite defaults with specified values.
-  hparams.parse(hparam_values)
-  return hparams
diff --git a/research/seq2species/input.py b/research/seq2species/input.py
deleted file mode 100644
index f1636c87501175d7a6940693e00ae5713a1780ff..0000000000000000000000000000000000000000
--- a/research/seq2species/input.py
+++ /dev/null
@@ -1,325 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Input pipe for feeding examples to a Seq2Label model graph."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-from google.protobuf import text_format
-
-from protos import seq2label_pb2
-import seq2label_utils
-
-DNA_BASES = tuple('ACGT')
-NUM_DNA_BASES = len(DNA_BASES)
-# Possible FASTA characters/IUPAC ambiguity codes.
-# See https://en.wikipedia.org/wiki/Nucleic_acid_notation.
-AMBIGUITY_CODES = {
-    'K': 'GT',
-    'M': 'AC',
-    'R': 'AG',
-    'Y': 'CT',
-    'S': 'CG',
-    'W': 'AT',
-    'B': 'CGT',
-    'V': 'ACG',
-    'H': 'ACT',
-    'D': 'AGT',
-    'X': 'ACGT',
-    'N': 'ACGT'
-}
-
-
-def load_dataset_info(dataset_info_path):
-  """Load a `Seq2LabelDatasetInfo` from a serialized text proto file."""
-  dataset_info = seq2label_pb2.Seq2LabelDatasetInfo()
-  with tf.gfile.Open(dataset_info_path, 'r') as f:
-    text_format.Parse(f.read(), dataset_info)
-  return dataset_info
-
-
-class _InputEncoding(object):
-  """A helper class providing the graph operations needed to encode input.
-
-  Instantiation of an _InputEncoding will write on the default TF graph, so it
-  should only be instantiated inside the `input_fn`.
-
-  Attributes:
-    mode: `tf.estimator.ModeKeys`; the execution mode {TRAIN, EVAL, INFER}.
-    targets: list of strings; the names of the labels of interest (e.g.
-      "species").
-    dna_bases: a tuple of the recognized DNA alphabet.
-    n_bases: the size of the DNA alphabet.
-    all_characters: list of recognized alphabet, including ambiguity codes.
-    label_values: a tuple of strings, the possible label values of the
-      prediction target.
-    n_labels: the size of label_values
-    fixed_read_length: an integer value of the statically-known read length, or
-     None if the read length is to be determined dynamically.
-  """
-
-  def __init__(self,
-               dataset_info,
-               mode,
-               targets,
-               noise_rate=0.0,
-               fixed_read_length=None):
-    self.mode = mode
-    self.targets = targets
-    self.dna_bases = DNA_BASES
-    self.n_bases = NUM_DNA_BASES
-    self.all_characters = list(DNA_BASES) + sorted(AMBIGUITY_CODES.keys())
-    self.character_encodings = np.concatenate(
-        [[self._character_to_base_distribution(char)]
-         for char in self.all_characters],
-        axis=0)
-    all_legal_label_values = seq2label_utils.get_all_label_values(dataset_info)
-    # TF lookup tables.
-    self.characters_table = tf.contrib.lookup.index_table_from_tensor(
-        mapping=self.all_characters)
-    self.label_tables = {
-        target: tf.contrib.lookup.index_table_from_tensor(
-            all_legal_label_values[target])
-        for target in targets
-    }
-    self.fixed_read_length = fixed_read_length
-    self.noise_rate = noise_rate
-
-  def _character_to_base_distribution(self, char):
-    """Maps the given character to a probability distribution over DNA bases.
-
-    Args:
-      char: character to be encoded as a probability distribution over bases.
-
-    Returns:
-      Array of size (self.n_bases,) representing the identity of the given
-      character as a distribution over the possible DNA bases, self.dna_bases.
-
-    Raises:
-      ValueError: if the given character is not contained in the recognized
-        alphabet, self.all_characters.
-    """
-    if char not in self.all_characters:
-      raise ValueError(
-          'Base distribution requested for unrecognized character %s.' % char)
-    possible_bases = AMBIGUITY_CODES[char] if char in AMBIGUITY_CODES else char
-    base_indices = [self.dna_bases.index(base) for base in possible_bases]
-    probability_weight = 1.0 / len(possible_bases)
-    distribution = np.zeros((self.n_bases))
-    distribution[base_indices] = probability_weight
-    return distribution
-
-  def encode_read(self, string_seq):
-    """Converts the input read sequence to one-hot encoding.
-
-    Args:
-      string_seq: tf.String; input read sequence.
-
-    Returns:
-      Input read sequence as a one-hot encoded Tensor, with depth and ordering
-      of one-hot encoding determined by the given bases. Ambiguous characters
-      such as "N" and "S" are encoded as a probability distribution over the
-      possible bases they represent.
-    """
-    with tf.variable_scope('encode_read'):
-      read = tf.string_split([string_seq], delimiter='').values
-      read = self.characters_table.lookup(read)
-      read = tf.cast(tf.gather(self.character_encodings, read), tf.float32)
-      if self.fixed_read_length:
-        read = tf.reshape(read, (self.fixed_read_length, self.n_bases))
-      return read
-
-  def encode_label(self, target, string_label):
-    """Converts the label value to an integer encoding.
-
-    Args:
-      target: str; the target name.
-      string_label: tf.String; value of the label for the current input read.
-
-    Returns:
-      Given label value as an index into the possible_target_values.
-    """
-    with tf.variable_scope('encode_label/{}'.format(target)):
-      return tf.cast(self.label_tables[target].lookup(string_label), tf.int32)
-
-  def _empty_label(self):
-    return tf.constant((), dtype=tf.int32, shape=())
-
-  def parse_single_tfexample(self, serialized_example):
-    """Parses a tf.train.Example proto to a one-hot encoded read, label pair.
-
-    Injects noise into the incoming tf.train.Example's read sequence
-    when noise_rate is non-zero.
-
-    Args:
-      serialized_example: string; the serialized tf.train.Example proto
-        containing the read sequence and label value of interest as
-        tf.FixedLenFeatures.
-
-    Returns:
-      Tuple (features, labels) of dicts for the input features and prediction
-      targets.
-    """
-    with tf.variable_scope('parse_single_tfexample'):
-      features_spec = {'sequence': tf.FixedLenFeature([], tf.string)}
-      for target in self.targets:
-        features_spec[target] = tf.FixedLenFeature([], tf.string)
-      features = tf.parse_single_example(
-          serialized_example, features=features_spec)
-      if self.noise_rate > 0.0:
-        read_sequence = tf.py_func(seq2label_utils.add_read_noise,
-                                   [features['sequence'], self.noise_rate],
-                                   (tf.string))
-      else:
-        read_sequence = features['sequence']
-      read_sequence = self.encode_read(read_sequence)
-      read_features = {'sequence': read_sequence}
-      if self.mode in (tf.estimator.ModeKeys.TRAIN, tf.estimator.ModeKeys.EVAL):
-        label = {
-            target: self.encode_label(target, features[target])
-            for target in self.targets
-        }
-      else:
-        label = {target: self._empty_label() for target in self.targets}
-      return read_features, label
-
-
-class InputDataset(object):
-  """A class providing access to input data for the Seq2Label model.
-
-  Attributes:
-    mode: `tf.estimator.ModeKeys`; the execution mode {TRAIN, EVAL, INFER}.
-    targets: list of strings; the names of the labels of interest (e.g.
-      "species").
-    dataset_info: a `Seq2LabelDatasetInfo` message reflecting the dataset
-      metadata.
-    initializer: the TF initializer op for the underlying iterator, which
-      will rewind the iterator.
-    is_train: Boolean indicating whether or not the execution mode is TRAIN.
-  """
-
-  def __init__(self,
-               mode,
-               targets,
-               dataset_info,
-               train_epochs=None,
-               noise_rate=0.0,
-               random_seed=None,
-               input_tfrecord_files=None,
-               fixed_read_length=None,
-               ensure_constant_batch_size=False,
-               num_parallel_calls=32):
-    """Constructor for InputDataset.
-
-    Args:
-      mode: `tf.estimator.ModeKeys`; the execution mode {TRAIN, EVAL, INFER}.
-      targets: list of strings; the names of the labels of interest (e.g.
-        "species").
-      dataset_info: a `Seq2LabelDatasetInfo` message reflecting the dataset
-        metadata.
-      train_epochs: the number of training epochs to perform, if mode==TRAIN.
-      noise_rate: float [0.0, 1.0] specifying rate at which to inject
-        base-flipping noise into the read sequences.
-      random_seed: seed to be used for shuffling, if mode==TRAIN.
-      input_tfrecord_files: a list of filenames for TFRecords of TF examples.
-      fixed_read_length: an integer value of the statically-known read length,
-        or None if the read length is to be determined dynamically.  The read
-        length must be known statically for TPU execution.
-      ensure_constant_batch_size: ensure a constant batch size at the expense of
-        discarding the last "short" batch.  This also gives us a statically
-        constant batch size, which is essential for e.g. the TPU platform.
-      num_parallel_calls: the number of dataset elements to process in parallel.
-        If None, elements will be processed sequentially.
-    """
-    self.input_tfrecord_files = input_tfrecord_files
-    self.mode = mode
-    self.targets = targets
-    self.dataset_info = dataset_info
-    self._train_epochs = train_epochs
-    self._noise_rate = noise_rate
-    self._random_seed = random_seed
-    if random_seed is not None:
-      np.random.seed(random_seed)
-    self._fixed_read_length = fixed_read_length
-    self._ensure_constant_batch_size = ensure_constant_batch_size
-    self._num_parallel_calls = num_parallel_calls
-
-  @staticmethod
-  def from_tfrecord_files(input_tfrecord_files, *args, **kwargs):
-    return InputDataset(
-        *args, input_tfrecord_files=input_tfrecord_files, **kwargs)
-
-  @property
-  def is_train(self):
-    return self.mode == tf.estimator.ModeKeys.TRAIN
-
-  def input_fn(self, params):
-    """Supplies input for the model.
-
-    This function supplies input to our model as a function of the mode.
-
-    Args:
-      params: a dictionary, containing:
-        - params['batch_size']: the integer batch size.
-
-    Returns:
-      A tuple of two values as follows:
-       1) the *features* dict, containing a tensor value for keys as follows:
-            - "sequence" - the encoded read input sequence.
-       2) the *labels* dict. containing a key for `target`, whose value is:
-           - a string Tensor value (in TRAIN/EVAL mode), or
-           - a blank Tensor (PREDICT mode).
-    """
-    randomize_input = self.is_train
-    batch_size = params['batch_size']
-
-    encoding = _InputEncoding(
-        self.dataset_info,
-        self.mode,
-        self.targets,
-        noise_rate=self._noise_rate,
-        fixed_read_length=self._fixed_read_length)
-
-    dataset = tf.data.TFRecordDataset(self.input_tfrecord_files)
-    dataset = dataset.map(
-        encoding.parse_single_tfexample,
-        num_parallel_calls=self._num_parallel_calls)
-
-    dataset = dataset.repeat(self._train_epochs if self.is_train else 1)
-    if randomize_input:
-      dataset = dataset.shuffle(
-          buffer_size=max(1000, batch_size), seed=self._random_seed)
-
-    if self._ensure_constant_batch_size:
-      # Only take batches of *exactly* size batch_size; then we get a
-      # statically knowable batch shape.
-      dataset = dataset.batch(batch_size, drop_remainder=True)
-    else:
-      dataset = dataset.batch(batch_size)
-
-    # Prefetch to allow infeed to be in parallel with model computations.
-    dataset = dataset.prefetch(2)
-
-    # Use initializable iterator to support table lookups.
-    iterator = dataset.make_initializable_iterator()
-    self.initializer = iterator.initializer
-    tf.add_to_collection(tf.GraphKeys.TABLE_INITIALIZERS, iterator.initializer)
-
-    features, labels = iterator.get_next()
-    return (features, labels)
diff --git a/research/seq2species/protos/BUILD b/research/seq2species/protos/BUILD
deleted file mode 100644
index 5628d4c41a79fc3fb6fb61fc9089f1524c49f175..0000000000000000000000000000000000000000
--- a/research/seq2species/protos/BUILD
+++ /dev/null
@@ -1,16 +0,0 @@
-# Protos for Tensorflow Seq2Species API.
-
-package(
-    default_visibility = ["//visibility:public"],
-)
-
-py_proto_library(
-    name = "seq2label_py_pb2",
-    api_version = 2,
-    deps = [":seq2label_proto"],
-)
-
-proto_library(
-    name = "seq2label_proto",
-    srcs = ["seq2label.proto"],
-)
diff --git a/research/seq2species/protos/__init__.py b/research/seq2species/protos/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/seq2species/protos/seq2label.proto b/research/seq2species/protos/seq2label.proto
deleted file mode 100644
index 531c4ad75e06db119c547cde34323c08f772e7fa..0000000000000000000000000000000000000000
--- a/research/seq2species/protos/seq2label.proto
+++ /dev/null
@@ -1,49 +0,0 @@
-syntax = "proto2";
-
-package seq2species.protos;
-
-// Summarizes metadata information for a dataset that can be used for running
-// training or inference.
-message Seq2LabelDatasetInfo {
-  // Summarizes all possible values for a given label in the dataset.
-  message LabelInfo {
-    optional string name = 1;
-    repeated string values = 2;
-    // Per-value weights used to normalize the classes in a dataset.
-    repeated float weights = 3;
-  }
-  repeated LabelInfo labels = 3;
-  // Length (in basepairs) of the reads in the dataset.
-  optional int32 read_length = 4;
-  // Stride (in number of basepairs) in the moving window.
-  optional int32 read_stride = 7;
-  // Total number of examples in the dataset.
-  optional int64 num_examples = 5;
-  // Full path to the dataset.
-  optional string dataset_path = 6;
-}
-
-// Summarizes metadata information about a model trained on a Seq2Label dataset.
-message Seq2LabelModelInfo {
-  optional string hparams_string = 1;
-  optional string model_type = 2;
-  repeated string targets = 3;
-  optional int32 num_filters = 4;
-  optional int32 batch_size = 5;
-  optional string metadata_path = 6;
-  optional float training_noise_rate = 7;
-}
-
-// Summarizes resulting measures of modelling experiments.
-message Seq2LabelExperimentMeasures {
-  optional string checkpoint_path = 1;
-  optional int64 steps = 2;
-  optional float wall_time = 3;
-  optional bool experiment_infeasible = 4;
-
-  message Measure {
-    optional string name = 1;
-    optional float value = 2;
-  }
-  repeated Measure measures = 5;
-}
diff --git a/research/seq2species/run_training.py b/research/seq2species/run_training.py
deleted file mode 100644
index f03bb09ecf8b1b65f9899f20cb91e3f0363a2f18..0000000000000000000000000000000000000000
--- a/research/seq2species/run_training.py
+++ /dev/null
@@ -1,293 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Defines training scheme for neural networks for Seq2Species prediction.
-
-Defines and runs the loop for training a (optionally) depthwise separable
-convolutional model for predicting taxonomic labels from short reads of DNA.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import time
-
-from absl import flags
-import numpy as np
-import tensorflow as tf
-from google.protobuf import text_format
-
-import build_model
-import configuration
-import input as seq2species_input
-from protos import seq2label_pb2
-import seq2label_utils
-
-# Define non-tunable parameters.
-flags.DEFINE_integer('num_filters', 1, 'Number of filters for conv model')
-flags.DEFINE_string('hparams', '',
-                    'Comma-separated list of name=value hyperparameter '
-                    "pairs ('hp1=value1,hp2=value2'). Unspecified "
-                    'hyperparameters will be filled with defaults.')
-flags.DEFINE_integer('batch_size', 512, 'Size of batches during training.')
-flags.DEFINE_integer('min_train_steps', 1000,
-                     'Minimum number of training steps to run.')
-flags.DEFINE_float('max_task_loss', 10.0,
-                   "Terminate trial if task loss doesn't fall below this "
-                   'within --min_train_steps.')
-flags.DEFINE_integer('n_print_progress_every', 1000,
-                     'Print training progress every '
-                     '--n_print_progress_every global steps.')
-flags.DEFINE_list('targets', ['species'],
-                  'Names of taxonomic ranks to use as training targets.')
-flags.DEFINE_float(
-    'noise_rate', 0.0, 'Rate [0.0, 1.0] at which to inject '
-    'base-flipping noise into input read sequences.')
-
-# Define paths to logs and data.
-flags.DEFINE_list(
-    'train_files', [], 'Full paths to the TFRecords containing the '
-    'training examples.')
-flags.DEFINE_string(
-    'metadata_path', '', 'Full path of the text proto containing configuration '
-    'information about the set of training examples.')
-flags.DEFINE_string('logdir', '/tmp/seq2species',
-                    'Directory to which to write logs.')
-
-# Define supervisor/checkpointing options.
-flags.DEFINE_integer('task', 0, 'Task ID of the replica running the training.')
-flags.DEFINE_string('master', '', 'Name of the TF master to use.')
-flags.DEFINE_integer(
-    'save_model_secs', 900, 'Rate at which to save model parameters. '
-    'Set to 0 to disable checkpointing.')
-flags.DEFINE_integer('recovery_wait_secs', 30,
-                     'Wait to recover model from checkpoint '
-                     'before timing out.')
-flags.DEFINE_integer('save_summaries_secs', 900,
-                     'Rate at which to save Tensorboard summaries.')
-flags.DEFINE_integer('ps_tasks', 0,
-                     'Number of tasks in the ps job; 0 if no ps is used.')
-
-FLAGS = flags.FLAGS
-RANDOM_SEED = 42
-
-
-def wait_until(time_sec):
-  """Stalls execution until a given time.
-
-  Args:
-    time_sec: time, in seconds, until which to loop idly.
-  """
-  while time.time() < time_sec:
-    pass
-
-
-def update_measures(measures, new_measures, loss_val, max_loss=None):
-  """Updates tracking of experimental measures and infeasibilty.
-
-  Args:
-    measures: dict; mapping from measure name to measure value.
-    new_measures: dict; mapping from measure name to new measure values.
-    loss_val: float; value of loss metric by which to determine fesibility.
-    max_loss: float; maximum value at which to consider the loss feasible.
-
-  Side Effects:
-    Updates the given mapping of measures and values based on the current
-    experimental metrics stored in new_measures, and determines current
-    feasibility of the experiment based on the provided loss value.
-  """
-  max_loss = max_loss if max_loss else np.finfo('f').max
-  measures['is_infeasible'] = (
-      loss_val >= max_loss or not np.isfinite(loss_val))
-  measures.update(new_measures)
-
-
-def run_training(model, hparams, training_dataset, logdir, batch_size):
-  """Trains the given model on random mini-batches of reads.
-
-  Args:
-    model: ConvolutionalNet instance containing the model graph and operations.
-    hparams: tf.contrib.training.Hparams object containing the model's
-      hyperparamters; see configuration.py for hyperparameter definitions.
-    training_dataset: an `InputDataset` that can feed labelled examples.
-    logdir: string; full path of directory to which to save checkpoints.
-    batch_size: integer batch size.
-
-  Yields:
-    Tuple comprising a dictionary of experimental measures and the save path
-    for train checkpoints and summaries.
-  """
-  input_params = dict(batch_size=batch_size)
-  features, labels = training_dataset.input_fn(input_params)
-  model.build_graph(features, labels, tf.estimator.ModeKeys.TRAIN, batch_size)
-
-  is_chief = FLAGS.task == 0
-  scaffold = tf.train.Scaffold(
-      saver=tf.train.Saver(
-          tf.global_variables(),
-          max_to_keep=5,
-          keep_checkpoint_every_n_hours=1.0),
-      init_op=tf.global_variables_initializer(),
-      summary_op=model.summary_op)
-  with tf.train.MonitoredTrainingSession(
-      master=FLAGS.master,
-      checkpoint_dir=logdir,
-      is_chief=is_chief,
-      scaffold=scaffold,
-      save_summaries_secs=FLAGS.save_summaries_secs,
-      save_checkpoint_secs=FLAGS.save_model_secs,
-      max_wait_secs=FLAGS.recovery_wait_secs) as sess:
-    global_step = sess.run(model.global_step)
-    print('Initialized model at global step ', global_step)
-    init_time = time.time()
-    measures = {'is_infeasible': False}
-
-    if is_chief:
-      model_info = seq2label_utils.construct_seq2label_model_info(
-          hparams, 'conv', FLAGS.targets, FLAGS.metadata_path, FLAGS.batch_size,
-          FLAGS.num_filters, FLAGS.noise_rate)
-      write_message(model_info, os.path.join(logdir, 'model_info.pbtxt'))
-
-    ops = [
-        model.accuracy, model.weighted_accuracy, model.total_loss,
-        model.global_step, model.train_op
-    ]
-
-    while not sess.should_stop() and global_step < hparams.train_steps:
-      accuracy, weighted_accuracy, loss, global_step, _ = sess.run(ops)
-
-      def gather_measures():
-        """Updates the measures dictionary from this batch."""
-        new_measures = {'train_loss': loss, 'global_step': global_step}
-        for target in FLAGS.targets:
-          new_measures.update({
-              ('train_accuracy/%s' % target): accuracy[target],
-              ('train_weighted_accuracy/%s' % target): weighted_accuracy[target]
-          })
-        update_measures(
-            measures, new_measures, loss, max_loss=FLAGS.max_task_loss)
-
-      # Periodically track measures according to current mini-batch performance.
-
-      # Log a message.
-      if global_step % FLAGS.n_print_progress_every == 0:
-        log_message = ('\tstep: %d (%d sec), loss: %f' %
-                       (global_step, time.time() - init_time, loss))
-        for target in FLAGS.targets:
-          log_message += (', accuracy/%s: %f ' % (target, accuracy[target]))
-          log_message += (', weighted_accuracy/%s: %f ' %
-                          (target, weighted_accuracy[target]))
-        print(log_message)
-
-        # Gather new measures and update the measures dictionary.
-        gather_measures()
-        yield measures, scaffold.saver.last_checkpoints[-1]
-
-      # Check for additional stopping criteria.
-      if not np.isfinite(loss) or (loss >= FLAGS.max_task_loss and
-                                   global_step > FLAGS.min_train_steps):
-        break
-
-    # Always yield once at the end.
-    gather_measures()
-    yield measures, scaffold.saver.last_checkpoints[-1]
-
-
-def write_message(message, filename):
-  """Writes contents of the given message to the given filename as a text proto.
-
-  Args:
-    message: the proto message to save.
-    filename: full path of file to which to save the text proto.
-
-  Side Effects:
-    Outputs a text proto file to the given filename.
-  """
-  message_string = text_format.MessageToString(message)
-  with tf.gfile.GFile(filename, 'w') as f:
-    f.write(message_string)
-
-
-def write_measures(measures, checkpoint_file, init_time):
-  """Writes performance measures to file.
-
-  Args:
-    measures: dict; mapping from measure name to measure value.
-    checkpoint_file: string; full save path for checkpoints and summaries.
-    init_time: int; start time for work on the current experiment.
-
-  Side Effects:
-    Writes given dictionary of performance measures for the current experiment
-    to a 'measures.pbtxt' file in the checkpoint directory.
-  """
-  # Save experiment measures.
-  print('global_step: ', measures['global_step'])
-  experiment_measures = seq2label_pb2.Seq2LabelExperimentMeasures(
-      checkpoint_path=checkpoint_file,
-      steps=measures['global_step'],
-      experiment_infeasible=measures['is_infeasible'],
-      wall_time=time.time() - init_time)  # Inaccurate for restarts.
-  for name, value in measures.iteritems():
-    if name not in ['is_infeasible', 'global_step']:
-      experiment_measures.measures.add(name=name, value=value)
-  measures_file = os.path.join(
-      os.path.dirname(checkpoint_file), 'measures.pbtxt')
-  write_message(experiment_measures, measures_file)
-  print('Wrote ', measures_file,
-        ' containing the following experiment measures:\n', experiment_measures)
-
-
-def main(unused_argv):
-  dataset_info = seq2species_input.load_dataset_info(FLAGS.metadata_path)
-
-  init_time = time.time()
-
-  # Determine model hyperparameters.
-  hparams = configuration.parse_hparams(FLAGS.hparams, FLAGS.num_filters)
-  print('Current Hyperparameters:')
-  for hp_name, hp_val in hparams.values().items():
-    print('\t', hp_name, ': ', hp_val)
-
-  # Initialize the model graph.
-  print('Constructing TensorFlow Graph.')
-  tf.reset_default_graph()
-
-  input_dataset = seq2species_input.InputDataset.from_tfrecord_files(
-      FLAGS.train_files,
-      'train',
-      FLAGS.targets,
-      dataset_info,
-      noise_rate=FLAGS.noise_rate,
-      random_seed=RANDOM_SEED)
-
-  with tf.device(tf.train.replica_device_setter(FLAGS.ps_tasks)):
-    model = build_model.ConvolutionalNet(
-        hparams, dataset_info, targets=FLAGS.targets)
-
-  # Run the experiment.
-  measures, checkpoint_file = None, None
-  print('Starting model training.')
-  for cur_measures, cur_file in run_training(
-      model, hparams, input_dataset, FLAGS.logdir, batch_size=FLAGS.batch_size):
-    measures, checkpoint_file = cur_measures, cur_file
-
-  # Save experiment results.
-  write_measures(measures, checkpoint_file, init_time)
-
-
-if __name__ == '__main__':
-  tf.app.run(main)
diff --git a/research/seq2species/run_training_test.py b/research/seq2species/run_training_test.py
deleted file mode 100644
index 754d2e0174e730f57309a52ab2b80d1e84e7ab15..0000000000000000000000000000000000000000
--- a/research/seq2species/run_training_test.py
+++ /dev/null
@@ -1,118 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for run_training."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import time
-
-from absl import flags
-from absl.testing import absltest
-from absl.testing import flagsaver
-from absl.testing import parameterized
-import numpy as np
-import tensorflow as tf
-from google.protobuf import text_format
-
-import run_training
-from protos import seq2label_pb2
-import test_utils
-
-FLAGS = flags.FLAGS
-
-
-class RunTrainingTest(parameterized.TestCase):
-
-  @parameterized.parameters(2, 4, 7)
-  def test_wait_until(self, wait_sec):
-    end_time = time.time() + wait_sec
-    run_training.wait_until(end_time)
-    self.assertEqual(round(time.time() - end_time), 0)
-
-  @parameterized.parameters(
-      ({}, {'a': 0.7, 'b': 12.3}, 12.3, None,
-       {'a': 0.7, 'b': 12.3, 'is_infeasible': False}),
-      ({'a': 0.42}, {'b': 24.5}, 24.5, 32.0,
-       {'a': 0.42, 'b': 24.5, 'is_infeasible': False}),
-      ({'a': 0.503}, {'a': 0.82, 'b': 7.2}, 7.2, 0.1,
-       {'a': 0.82, 'b': 7.2, 'is_infeasible': True}),
-      ({}, {'a': 0.7, 'b': 12.3}, float('Inf'), None,
-       {'a': 0.7, 'b': 12.3, 'is_infeasible': True})
-  )
-  def test_update_measures(self, measures, new_measures, loss, max_loss,
-                           expected):
-    run_training.update_measures(measures, new_measures, loss, max_loss)
-    self.assertEqual(measures, expected)
-
-  def test_write_measures(self):
-    init_time = time.time()
-    measures = {
-        'global_step': 311448,
-        'train_loss': np.float32(18.36),
-        'train_weighted_accuracy': np.float32(0.3295),
-        'train_accuracy': 0.8243,
-        'is_infeasible': False
-    }
-    tmp_path = os.path.join(FLAGS.test_tmpdir, 'measures.pbtxt')
-    run_training.write_measures(measures, tmp_path, init_time)
-    experiment_measures = seq2label_pb2.Seq2LabelExperimentMeasures()
-    with tf.gfile.Open(tmp_path) as f:
-      text_format.Parse(f.read(), experiment_measures)
-    self.assertEqual(experiment_measures.checkpoint_path, tmp_path)
-    self.assertFalse(experiment_measures.experiment_infeasible)
-    self.assertEqual(experiment_measures.steps, measures['global_step'])
-    self.assertGreater(experiment_measures.wall_time, 0)
-    self.assertEqual(len(experiment_measures.measures), 3)
-    for measure in experiment_measures.measures:
-      self.assertAlmostEqual(measure.value, measures[measure.name])
-
-  @parameterized.parameters((test_utils.TEST_TARGETS[:1],),
-                            (test_utils.TEST_TARGETS,))
-  def test_run_training(self, targets):
-    """Tests whether the training loop can be run successfully.
-
-    Generates test input files and runs the main driving code.
-
-    Args:
-      targets: the targets to train on.
-    """
-    # Create test input and metadata files.
-    num_examples, read_len = 20, 5
-    train_file = test_utils.create_tmp_train_file(num_examples, read_len)
-    metadata_path = test_utils.create_tmp_metadata(num_examples, read_len)
-
-    # Check that the training loop runs as expected.
-    logdir = os.path.join(FLAGS.test_tmpdir, 'train:{}'.format(len(targets)))
-    with flagsaver.flagsaver(
-        train_files=train_file,
-        metadata_path=metadata_path,
-        targets=targets,
-        logdir=logdir,
-        hparams='train_steps=10,min_read_length=5',
-        batch_size=10):
-      run_training.main(FLAGS)
-      # Check training loop ran by confirming existence of a checkpoint file.
-      self.assertIsNotNone(tf.train.latest_checkpoint(FLAGS.logdir))
-      # Check training loop ran by confiming existence of a measures file.
-      self.assertTrue(
-          os.path.exists(os.path.join(FLAGS.logdir, 'measures.pbtxt')))
-
-
-if __name__ == '__main__':
-  absltest.main()
diff --git a/research/seq2species/seq2label_utils.py b/research/seq2species/seq2label_utils.py
deleted file mode 100644
index b975b7f17d0dfd4798ed798509e7b3d3447cfd6b..0000000000000000000000000000000000000000
--- a/research/seq2species/seq2label_utils.py
+++ /dev/null
@@ -1,95 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utilities for working with Seq2Label datasets and models.
-
-This library provides utilities for parsing and generating Seq2Label protos.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-
-from protos import seq2label_pb2
-
-
-def get_all_label_values(dataset_info):
-  """Retrieves possible values for modeled labels from a `Seq2LabelDatasetInfo`.
-
-  Args:
-    dataset_info: a `Seq2LabelDatasetInfo` message.
-
-  Returns:
-    A dictionary mapping each label name to a tuple of its permissible values.
-  """
-  return {
-      label_info.name: tuple(label_info.values)
-      for label_info in dataset_info.labels
-  }
-
-
-def construct_seq2label_model_info(hparams, model_type, targets, metadata_path,
-                                   batch_size, num_filters,
-                                   training_noise_rate):
-  """Constructs a Seq2LabelModelInfo proto with the given properties.
-
-  Args:
-    hparams: initialized tf.contrib.training.Hparams object.
-    model_type: string; descriptive tag indicating type of model, ie. "conv".
-    targets: list of names of the targets the model is trained to predict.
-    metadata_path: string; full path to Seq2LabelDatasetInfo text proto used
-      to initialize the model.
-    batch_size: int; number of reads per mini-batch.
-    num_filters: int; number of filters for convolutional model.
-    training_noise_rate: float; rate [0.0, 1.0] of base-flipping noise injected
-      into input read sequenced at training time.
-
-  Returns:
-    The Seq2LabelModelInfo proto with the hparams, model_type, targets,
-    num_filters, batch_size, metadata_path, and training_noise_rate fields
-    set to the given values.
-  """
-  return seq2label_pb2.Seq2LabelModelInfo(
-      hparams_string=hparams.to_json(),
-      model_type=model_type,
-      targets=sorted(targets),
-      num_filters=num_filters,
-      batch_size=batch_size,
-      metadata_path=metadata_path,
-      training_noise_rate=training_noise_rate)
-
-
-def add_read_noise(read, base_flip_probability=0.01):
-  """Adds base-flipping noise to the given read sequence.
-
-  Args:
-    read: string; the read sequence to which to add noise.
-    base_flip_probability: float; probability of a base flip at each position.
-
-  Returns:
-    The given read with base-flipping noise added at the provided
-    base_flip_probability rate.
-  """
-  base_flips = np.random.binomial(1, base_flip_probability, len(read))
-  if sum(base_flips) == 0:
-    return read
-
-  read = np.array(list(read))
-  possible_mutations = np.char.replace(['ACTG'] * sum(base_flips),
-                                       read[base_flips == 1], '')
-  mutations = map(np.random.choice, map(list, possible_mutations))
-  read[base_flips == 1] = mutations
-  return ''.join(read)
diff --git a/research/seq2species/test_utils.py b/research/seq2species/test_utils.py
deleted file mode 100644
index f02798fb533c99c8545e3a51f56de31b629ca2f4..0000000000000000000000000000000000000000
--- a/research/seq2species/test_utils.py
+++ /dev/null
@@ -1,106 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Utility methods for accessing and operating on test data."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-from absl import flags
-import tensorflow as tf
-from google.protobuf import text_format
-
-import input as seq2species_input
-from protos import seq2label_pb2
-
-FLAGS = flags.FLAGS
-
-# Target names included in the example inputs.
-TEST_TARGETS = ['test_target_1', 'test_target_2']
-
-
-def _as_bytes_feature(in_string):
-  """Converts the given string to a tf.train.BytesList feature.
-
-  Args:
-    in_string: string to be converted to BytesList Feature.
-
-  Returns:
-    The TF BytesList Feature representing the given string.
-  """
-  return tf.train.Feature(bytes_list=tf.train.BytesList(value=[in_string]))
-
-
-def create_tmp_train_file(num_examples,
-                          read_len,
-                          characters=seq2species_input.DNA_BASES,
-                          name='test.tfrecord'):
-  """Write a test TFRecord of input examples to temporary test directory.
-
-  The generated input examples are test tf.train.Example protos, each comprised
-  of a toy sequence of length read_len and non-meaningful labels for targets in
-  TEST_TARGETS.
-
-  Args:
-    num_examples: int; number of examples to write to test input file.
-    read_len: int; length of test read sequences.
-    characters: string; set of characters from which to construct test reads.
-      Defaults to canonical DNA bases.
-    name: string; filename for the test input file.
-
-  Returns:
-    Full path to the generated temporary test input file.
-  """
-  tmp_path = os.path.join(FLAGS.test_tmpdir, name)
-  with tf.python_io.TFRecordWriter(tmp_path) as writer:
-    for i in xrange(num_examples):
-      char = characters[i % len(characters)]
-      features_dict = {'sequence': _as_bytes_feature(char * read_len)}
-      for target_name in TEST_TARGETS:
-        nonsense_label = _as_bytes_feature(str(i))
-        features_dict[target_name] = nonsense_label
-      tf_features = tf.train.Features(feature=features_dict)
-      example = tf.train.Example(features=tf_features)
-      writer.write(example.SerializeToString())
-  return tmp_path
-
-
-def create_tmp_metadata(num_examples, read_len):
-  """Write a test Seq2LabelDatasetInfo test proto to temporary test directory.
-
-  Args:
-    num_examples: int; number of example labels to write into test metadata.
-    read_len: int; length of test read sequences.
-
-  Returns:
-    Full path to the generated temporary test file containing the
-    Seq2LabelDatasetInfo text proto.
-  """
-  dataset_info = seq2label_pb2.Seq2LabelDatasetInfo(
-      read_length=read_len,
-      num_examples=num_examples,
-      read_stride=1,
-      dataset_path='test.tfrecord')
-
-  for target in TEST_TARGETS:
-    dataset_info.labels.add(
-        name=target, values=[str(i) for i in xrange(num_examples)])
-
-  tmp_path = os.path.join(FLAGS.test_tmpdir, 'test.pbtxt')
-  with tf.gfile.GFile(tmp_path, 'w') as f:
-    f.write(text_format.MessageToString(dataset_info))
-  return tmp_path
diff --git a/research/skip_thoughts/.gitignore b/research/skip_thoughts/.gitignore
deleted file mode 100644
index 91cb861a9c87147ac86eda5434e4af270ea7b1dc..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/.gitignore
+++ /dev/null
@@ -1,8 +0,0 @@
-/bazel-bin
-/bazel-ci_build-cache
-/bazel-genfiles
-/bazel-out
-/bazel-skip_thoughts
-/bazel-testlogs
-/bazel-tf
-*.pyc
diff --git a/research/skip_thoughts/README.md b/research/skip_thoughts/README.md
deleted file mode 100644
index b3a1de73fabfc055ad7a0741d9466b5409168dce..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/README.md
+++ /dev/null
@@ -1,479 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Skip-Thought Vectors
-
-This is a TensorFlow implementation of the model described in:
-
-Jamie Ryan Kiros, Yukun Zhu, Ruslan Salakhutdinov, Richard S. Zemel,
-Antonio Torralba, Raquel Urtasun, Sanja Fidler.
-[Skip-Thought Vectors](https://papers.nips.cc/paper/5950-skip-thought-vectors.pdf).
-*In NIPS, 2015.*
-
-
-## Contact
-***Code author:*** Chris Shallue
-
-***Pull requests and issues:*** @cshallue
-
-## Contents
-* [Model Overview](#model-overview)
-* [Getting Started](#getting-started)
-    * [Install Required Packages](#install-required-packages)
-    * [Download Pretrained Models (Optional)](#download-pretrained-models-optional)
-* [Training a Model](#training-a-model)
-    * [Prepare the Training Data](#prepare-the-training-data)
-    * [Run the Training Script](#run-the-training-script)
-    * [Track Training Progress](#track-training-progress)
-* [Expanding the Vocabulary](#expanding-the-vocabulary)
-    * [Overview](#overview)
-    * [Preparation](#preparation)
-    * [Run the Vocabulary Expansion Script](#run-the-vocabulary-expansion-script)
-* [Evaluating a Model](#evaluating-a-model)
-    * [Overview](#overview-1)
-    * [Preparation](#preparation-1)
-    * [Run the Evaluation Tasks](#run-the-evaluation-tasks)
-* [Encoding Sentences](#encoding-sentences)
-
-## Model overview
-
-The *Skip-Thoughts* model is a sentence encoder. It learns to encode input
-sentences into a fixed-dimensional vector representation that is useful for many
-tasks, for example to detect paraphrases or to classify whether a product review
-is positive or negative. See the
-[Skip-Thought Vectors](https://papers.nips.cc/paper/5950-skip-thought-vectors.pdf)
-paper for details of the model architecture and more example applications.
-
-A trained *Skip-Thoughts* model will encode similar sentences nearby each other
-in the embedding vector space. The following examples show the nearest neighbor by
-cosine similarity of some sentences from the
-[movie review dataset](https://www.cs.cornell.edu/people/pabo/movie-review-data/).
-
-
-| Input sentence | Nearest Neighbor |
-|----------------|------------------|
-| Simplistic, silly and tedious. | Trite, banal, cliched, mostly inoffensive. |
-| Not so much farcical as sour. | Not only unfunny, but downright repellent. |
-| A sensitive and astute first feature by Anne-Sophie Birot. | Absorbing character study by André Turpin . |
-| An enthralling, entertaining feature. |  A slick, engrossing melodrama. |
-
-## Getting Started
-
-### Install Required Packages
-First ensure that you have installed the following required packages:
-
-* **Bazel** ([instructions](http://bazel.build/docs/install.html))
-* **TensorFlow** ([instructions](https://www.tensorflow.org/install/))
-* **NumPy** ([instructions](http://www.scipy.org/install.html))
-* **scikit-learn** ([instructions](http://scikit-learn.org/stable/install.html))
-* **Natural Language Toolkit (NLTK)**
-    * First install NLTK ([instructions](http://www.nltk.org/install.html))
-    * Then install the NLTK data ([instructions](http://www.nltk.org/data.html))
-* **gensim** ([instructions](https://radimrehurek.com/gensim/install.html))
-    * Only required if you will be expanding your vocabulary with the [word2vec](https://code.google.com/archive/p/word2vec/) model.
-
-
-### Download Pretrained Models (Optional)
-
-You can download model checkpoints pretrained on the
-[BookCorpus](http://yknzhu.wixsite.com/mbweb) dataset in the following
-configurations:
-
-* Unidirectional RNN encoder ("uni-skip" in the paper)
-* Bidirectional RNN encoder ("bi-skip" in the paper)
-
-```shell
-# Directory to download the pretrained models to.
-PRETRAINED_MODELS_DIR="${HOME}/skip_thoughts/pretrained/"
-
-mkdir -p ${PRETRAINED_MODELS_DIR}
-cd ${PRETRAINED_MODELS_DIR}
-
-# Download and extract the unidirectional model.
-wget "http://download.tensorflow.org/models/skip_thoughts_uni_2017_02_02.tar.gz"
-tar -xvf skip_thoughts_uni_2017_02_02.tar.gz
-rm skip_thoughts_uni_2017_02_02.tar.gz
-
-# Download and extract the bidirectional model.
-wget "http://download.tensorflow.org/models/skip_thoughts_bi_2017_02_16.tar.gz"
-tar -xvf skip_thoughts_bi_2017_02_16.tar.gz
-rm skip_thoughts_bi_2017_02_16.tar.gz
-```
-
-You can now skip to the sections [Evaluating a Model](#evaluating-a-model) and
-[Encoding Sentences](#encoding-sentences).
-
-
-## Training a Model
-
-### Prepare the Training Data
-
-To train a model you will need to provide training data in TFRecord format. The
-TFRecord format consists of a set of sharded files containing serialized
-`tf.Example` protocol buffers. Each `tf.Example` proto contains three
-sentences:
-
-  * `encode`: The sentence to encode.
-  * `decode_pre`: The sentence preceding `encode` in the original text.
-  * `decode_post`: The sentence following `encode` in the original text.
-
-Each sentence is a list of words. During preprocessing, a dictionary is created
-that assigns each word in the vocabulary to an integer-valued id. Each sentence
-is encoded as a list of integer word ids in the `tf.Example` protos.
-
-We have provided a script to preprocess any set of text-files into this format.
-You may wish to use the [BookCorpus](http://yknzhu.wixsite.com/mbweb) dataset.
-Note that the preprocessing script may take **12 hours** or more to complete
-on this large dataset.
-
-```shell
-# Comma-separated list of globs matching the input input files. The format of
-# the input files is assumed to be a list of newline-separated sentences, where
-# each sentence is already tokenized.
-INPUT_FILES="${HOME}/skip_thoughts/bookcorpus/*.txt"
-
-# Location to save the preprocessed training and validation data.
-DATA_DIR="${HOME}/skip_thoughts/data"
-
-# Build the preprocessing script.
-cd tensorflow-models/skip_thoughts
-bazel build -c opt //skip_thoughts/data:preprocess_dataset
-
-# Run the preprocessing script.
-bazel-bin/skip_thoughts/data/preprocess_dataset \
-  --input_files=${INPUT_FILES} \
-  --output_dir=${DATA_DIR}
-```
-
-When the script finishes you will find 100 training files and 1 validation file
-in `DATA_DIR`. The files will match the patterns `train-?????-of-00100` and
-`validation-00000-of-00001` respectively.
-
-The script will also produce a file named `vocab.txt`. The format of this file
-is a list of newline-separated words where the word id is the corresponding 0-
-based line index. Words are sorted by descending order of frequency in the input
-data. Only the top 20,000 words are assigned unique ids; all other words are
-assigned the "unknown id" of 1 in the processed data.
-
-### Run the Training Script
-
-Execute the following commands to start the training script. By default it will
-run for 500k steps (around 9 days on a GeForce GTX 1080 GPU).
-
-```shell
-# Directory containing the preprocessed data.
-DATA_DIR="${HOME}/skip_thoughts/data"
-
-# Directory to save the model.
-MODEL_DIR="${HOME}/skip_thoughts/model"
-
-# Build the model.
-cd tensorflow-models/skip_thoughts
-bazel build -c opt //skip_thoughts/...
-
-# Run the training script.
-bazel-bin/skip_thoughts/train \
-  --input_file_pattern="${DATA_DIR}/train-?????-of-00100" \
-  --train_dir="${MODEL_DIR}/train"
-```
-
-### Track Training Progress
-
-Optionally, you can run the `track_perplexity` script in a separate process.
-This will log per-word perplexity on the validation set which allows training
-progress to be monitored on
-[TensorBoard](https://www.tensorflow.org/get_started/summaries_and_tensorboard).
-
-Note that you may run out of memory if you run the this script on the same GPU
-as the training script. You can set the environment variable
-`CUDA_VISIBLE_DEVICES=""` to force the script to run on CPU. If it runs too
-slowly on CPU, you can decrease the value of `--num_eval_examples`.
-
-```shell
-DATA_DIR="${HOME}/skip_thoughts/data"
-MODEL_DIR="${HOME}/skip_thoughts/model"
-
-# Ignore GPU devices (only necessary if your GPU is currently memory
-# constrained, for example, by running the training script).
-export CUDA_VISIBLE_DEVICES=""
-
-# Run the evaluation script. This will run in a loop, periodically loading the
-# latest model checkpoint file and computing evaluation metrics.
-bazel-bin/skip_thoughts/track_perplexity \
-  --input_file_pattern="${DATA_DIR}/validation-?????-of-00001" \
-  --checkpoint_dir="${MODEL_DIR}/train" \
-  --eval_dir="${MODEL_DIR}/val" \
-  --num_eval_examples=50000
-```
-
-If you started the `track_perplexity` script, run a
-[TensorBoard](https://www.tensorflow.org/get_started/summaries_and_tensorboard)
-server in a separate process for real-time monitoring of training summaries and
-validation perplexity.
-
-```shell
-MODEL_DIR="${HOME}/skip_thoughts/model"
-
-# Run a TensorBoard server.
-tensorboard --logdir="${MODEL_DIR}"
-```
-
-## Expanding the Vocabulary
-
-### Overview
-
-The vocabulary generated by the preprocessing script contains only 20,000 words
-which is insufficient for many tasks. For example, a sentence from Wikipedia
-might contain nouns that do not appear in this vocabulary.
-
-A solution to this problem described in the
-[Skip-Thought Vectors](https://papers.nips.cc/paper/5950-skip-thought-vectors.pdf)
-paper is to learn a mapping that transfers word representations from one model to
-another. This idea is based on the "Translation Matrix" method from the paper
-[Exploiting Similarities Among Languages for Machine Translation](https://arxiv.org/abs/1309.4168).
-
-
-Specifically, we will load the word embeddings from a trained *Skip-Thoughts*
-model and from a trained [word2vec model](https://arxiv.org/pdf/1301.3781.pdf)
-(which has a much larger vocabulary). We will train a linear regression model
-without regularization to learn a linear mapping from the word2vec embedding
-space to the *Skip-Thoughts* embedding space. We will then apply the linear
-model to all words in the word2vec vocabulary, yielding vectors in the *Skip-
-Thoughts* word embedding space for the union of the two vocabularies.
-
-The linear regression task is to learn a parameter matrix *W* to minimize
-*|| X - Y \* W ||<sup>2</sup>*, where *X* is a matrix of *Skip-Thoughts*
-embeddings of shape `[num_words, dim1]`, *Y* is a matrix of word2vec embeddings
-of shape `[num_words, dim2]`, and *W* is a matrix of shape `[dim2, dim1]`.
-
-### Preparation
-
-First you will need to download and unpack a pretrained
-[word2vec model](https://arxiv.org/pdf/1301.3781.pdf) from
-[this website](https://code.google.com/archive/p/word2vec/)
-([direct download link](https://drive.google.com/file/d/0B7XkCwpI5KDYNlNUTTlSS21pQmM/edit?usp=sharing)).
-This model was trained on the Google News dataset (about 100 billion words).
-
-
-Also ensure that you have already [installed gensim](https://radimrehurek.com/gensim/install.html).
-
-### Run the Vocabulary Expansion Script
-
-```shell
-# Path to checkpoint file or a directory containing checkpoint files (the script
-# will select the most recent).
-CHECKPOINT_PATH="${HOME}/skip_thoughts/model/train"
-
-# Vocabulary file generated by the preprocessing script.
-SKIP_THOUGHTS_VOCAB="${HOME}/skip_thoughts/data/vocab.txt"
-
-# Path to downloaded word2vec model.
-WORD2VEC_MODEL="${HOME}/skip_thoughts/googlenews/GoogleNews-vectors-negative300.bin"
-
-# Output directory.
-EXP_VOCAB_DIR="${HOME}/skip_thoughts/exp_vocab"
-
-# Build the vocabulary expansion script.
-cd tensorflow-models/skip_thoughts
-bazel build -c opt //skip_thoughts:vocabulary_expansion
-
-# Run the vocabulary expansion script.
-bazel-bin/skip_thoughts/vocabulary_expansion \
-  --skip_thoughts_model=${CHECKPOINT_PATH} \
-  --skip_thoughts_vocab=${SKIP_THOUGHTS_VOCAB} \
-  --word2vec_model=${WORD2VEC_MODEL} \
-  --output_dir=${EXP_VOCAB_DIR}
-```
-
-## Evaluating a Model
-
-### Overview
-
-The model can be evaluated using the benchmark tasks described in the
-[Skip-Thought Vectors](https://papers.nips.cc/paper/5950-skip-thought-vectors.pdf)
-paper. The following tasks are supported (refer to the paper for full details):
-
- * **SICK** semantic relatedness task.
- * **MSRP** (Microsoft Research Paraphrase Corpus) paraphrase detection task.
- * Binary classification tasks:
-   * **MR** movie review sentiment task.
-   * **CR** customer product review task.
-   * **SUBJ** subjectivity/objectivity task.
-   * **MPQA** opinion polarity task.
-   * **TREC** question-type classification task.
-
-### Preparation
-
-You will need to clone or download the
-[skip-thoughts GitHub repository](https://github.com/ryankiros/skip-thoughts) by
-[ryankiros](https://github.com/ryankiros) (the first author of the Skip-Thoughts
-paper):
-
-```shell
-# Folder to clone the repository to.
-ST_KIROS_DIR="${HOME}/skip_thoughts/skipthoughts_kiros"
-
-# Clone the repository.
-git clone git@github.com:ryankiros/skip-thoughts.git "${ST_KIROS_DIR}/skipthoughts"
-
-# Make the package importable.
-export PYTHONPATH="${ST_KIROS_DIR}/:${PYTHONPATH}"
-```
-
-You will also need to download the data needed for each evaluation task. See the
-instructions [here](https://github.com/ryankiros/skip-thoughts).
-
-For example, the CR (customer review) dataset is found [here](http://nlp.stanford.edu/~sidaw/home/projects:nbsvm). For this task we want the
-files `custrev.pos` and `custrev.neg`.
-
-### Run the Evaluation Tasks
-
-In the following example we will evaluate a unidirectional model ("uni-skip" in
-the paper) on the CR task. To use a bidirectional model ("bi-skip" in the
-paper),  simply pass the flags `--bi_vocab_file`, `--bi_embeddings_file` and
-`--bi_checkpoint_path` instead. To use the "combine-skip" model described in the
-paper you will need to pass both the unidirectional and bidirectional flags.
-
-```shell
-# Path to checkpoint file or a directory containing checkpoint files (the script
-# will select the most recent).
-CHECKPOINT_PATH="${HOME}/skip_thoughts/model/train"
-
-# Vocabulary file generated by the vocabulary expansion script.
-VOCAB_FILE="${HOME}/skip_thoughts/exp_vocab/vocab.txt"
-
-# Embeddings file generated by the vocabulary expansion script.
-EMBEDDINGS_FILE="${HOME}/skip_thoughts/exp_vocab/embeddings.npy"
-
-# Directory containing files custrev.pos and custrev.neg.
-EVAL_DATA_DIR="${HOME}/skip_thoughts/eval_data"
-
-# Build the evaluation script.
-cd tensorflow-models/skip_thoughts
-bazel build -c opt //skip_thoughts:evaluate
-
-# Run the evaluation script.
-bazel-bin/skip_thoughts/evaluate \
-  --eval_task=CR \
-  --data_dir=${EVAL_DATA_DIR} \
-  --uni_vocab_file=${VOCAB_FILE} \
-  --uni_embeddings_file=${EMBEDDINGS_FILE} \
-  --uni_checkpoint_path=${CHECKPOINT_PATH}
-```
-
-Output:
-
-```python
-[0.82539682539682535, 0.84084880636604775, 0.83023872679045096,
- 0.86206896551724133, 0.83554376657824936, 0.85676392572944293,
- 0.84084880636604775, 0.83023872679045096, 0.85145888594164454,
- 0.82758620689655171]
-```
-
-The output is a list of accuracies of 10 cross-validation classification models.
-To get a single number, simply take the average:
-
-```python
-ipython  # Launch iPython.
-
-In [0]:
-import numpy as np
-np.mean([0.82539682539682535, 0.84084880636604775, 0.83023872679045096,
-         0.86206896551724133, 0.83554376657824936, 0.85676392572944293,
-         0.84084880636604775, 0.83023872679045096, 0.85145888594164454,
-         0.82758620689655171])
-
-Out [0]: 0.84009936423729525
-```
-
-## Encoding Sentences
-
-In this example we will encode data from the
-[movie review dataset](https://www.cs.cornell.edu/people/pabo/movie-review-data/)
-(specifically the [sentence polarity dataset v1.0](https://www.cs.cornell.edu/people/pabo/movie-review-data/rt-polaritydata.tar.gz)).
-
-```python
-ipython  # Launch iPython.
-
-In [0]:
-
-# Imports.
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-import numpy as np
-import os.path
-import scipy.spatial.distance as sd
-from skip_thoughts import configuration
-from skip_thoughts import encoder_manager
-
-In [1]:
-# Set paths to the model.
-VOCAB_FILE = "/path/to/vocab.txt"
-EMBEDDING_MATRIX_FILE = "/path/to/embeddings.npy"
-CHECKPOINT_PATH = "/path/to/model.ckpt-9999"
-# The following directory should contain files rt-polarity.neg and
-# rt-polarity.pos.
-MR_DATA_DIR = "/dir/containing/mr/data"
-
-In [2]:
-# Set up the encoder. Here we are using a single unidirectional model.
-# To use a bidirectional model as well, call load_model() again with
-# configuration.model_config(bidirectional_encoder=True) and paths to the
-# bidirectional model's files. The encoder will use the concatenation of
-# all loaded models.
-encoder = encoder_manager.EncoderManager()
-encoder.load_model(configuration.model_config(),
-                   vocabulary_file=VOCAB_FILE,
-                   embedding_matrix_file=EMBEDDING_MATRIX_FILE,
-                   checkpoint_path=CHECKPOINT_PATH)
-
-In [3]:
-# Load the movie review dataset.
-data = []
-with open(os.path.join(MR_DATA_DIR, 'rt-polarity.neg'), 'rb') as f:
-  data.extend([line.decode('latin-1').strip() for line in f])
-with open(os.path.join(MR_DATA_DIR, 'rt-polarity.pos'), 'rb') as f:
-  data.extend([line.decode('latin-1').strip() for line in f])
-
-In [4]:
-# Generate Skip-Thought Vectors for each sentence in the dataset.
-encodings = encoder.encode(data)
-
-In [5]:
-# Define a helper function to generate nearest neighbors.
-def get_nn(ind, num=10):
-  encoding = encodings[ind]
-  scores = sd.cdist([encoding], encodings, "cosine")[0]
-  sorted_ids = np.argsort(scores)
-  print("Sentence:")
-  print("", data[ind])
-  print("\nNearest neighbors:")
-  for i in range(1, num + 1):
-    print(" %d. %s (%.3f)" %
-          (i, data[sorted_ids[i]], scores[sorted_ids[i]]))
-
-In [6]:
-# Compute nearest neighbors of the first sentence in the dataset.
-get_nn(0)
-```
-
-Output:
-
-```
-Sentence:
- simplistic , silly and tedious .
-
-Nearest neighbors:
- 1. trite , banal , cliched , mostly inoffensive . (0.247)
- 2. banal and predictable . (0.253)
- 3. witless , pointless , tasteless and idiotic . (0.272)
- 4. loud , silly , stupid and pointless . (0.295)
- 5. grating and tedious . (0.299)
- 6. idiotic and ugly . (0.330)
- 7. black-and-white and unrealistic . (0.335)
- 8. hopelessly inane , humorless and under-inspired . (0.335)
- 9. shallow , noisy and pretentious . (0.340)
- 10. . . . unlikable , uninteresting , unfunny , and completely , utterly inept . (0.346)
-```
diff --git a/research/skip_thoughts/WORKSPACE b/research/skip_thoughts/WORKSPACE
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/skip_thoughts/skip_thoughts/BUILD b/research/skip_thoughts/skip_thoughts/BUILD
deleted file mode 100644
index 3ab642cace448b46f6ebe6e3d8a93b9f260bb08f..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/BUILD
+++ /dev/null
@@ -1,87 +0,0 @@
-package(default_visibility = ["//visibility:public"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-py_library(
-    name = "configuration",
-    srcs = ["configuration.py"],
-    srcs_version = "PY2AND3",
-)
-
-py_library(
-    name = "skip_thoughts_model",
-    srcs = ["skip_thoughts_model.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        "//skip_thoughts/ops:gru_cell",
-        "//skip_thoughts/ops:input_ops",
-    ],
-)
-
-py_test(
-    name = "skip_thoughts_model_test",
-    size = "large",
-    srcs = ["skip_thoughts_model_test.py"],
-    deps = [
-        ":configuration",
-        ":skip_thoughts_model",
-    ],
-)
-
-py_binary(
-    name = "train",
-    srcs = ["train.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":configuration",
-        ":skip_thoughts_model",
-    ],
-)
-
-py_binary(
-    name = "track_perplexity",
-    srcs = ["track_perplexity.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":configuration",
-        ":skip_thoughts_model",
-    ],
-)
-
-py_binary(
-    name = "vocabulary_expansion",
-    srcs = ["vocabulary_expansion.py"],
-    srcs_version = "PY2AND3",
-)
-
-py_library(
-    name = "skip_thoughts_encoder",
-    srcs = ["skip_thoughts_encoder.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":skip_thoughts_model",
-        "//skip_thoughts/data:special_words",
-    ],
-)
-
-py_library(
-    name = "encoder_manager",
-    srcs = ["encoder_manager.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":skip_thoughts_encoder",
-    ],
-)
-
-py_binary(
-    name = "evaluate",
-    srcs = ["evaluate.py"],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":encoder_manager",
-        "//skip_thoughts:configuration",
-    ],
-)
-
diff --git a/research/skip_thoughts/skip_thoughts/__init__.py b/research/skip_thoughts/skip_thoughts/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/skip_thoughts/skip_thoughts/configuration.py b/research/skip_thoughts/skip_thoughts/configuration.py
deleted file mode 100644
index bc04d57983584a7026df890d472ff326891e1136..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/configuration.py
+++ /dev/null
@@ -1,110 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Default configuration for model architecture and training."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-class _HParams(object):
-  """Wrapper for configuration parameters."""
-  pass
-
-
-def model_config(input_file_pattern=None,
-                 input_queue_capacity=640000,
-                 num_input_reader_threads=1,
-                 shuffle_input_data=True,
-                 uniform_init_scale=0.1,
-                 vocab_size=20000,
-                 batch_size=128,
-                 word_embedding_dim=620,
-                 bidirectional_encoder=False,
-                 encoder_dim=2400):
-  """Creates a model configuration object.
-
-  Args:
-    input_file_pattern: File pattern of sharded TFRecord files containing
-      tf.Example protobufs.
-    input_queue_capacity: Number of examples to keep in the input queue.
-    num_input_reader_threads: Number of threads for prefetching input
-      tf.Examples.
-    shuffle_input_data: Whether to shuffle the input data.
-    uniform_init_scale: Scale of random uniform initializer.
-    vocab_size: Number of unique words in the vocab.
-    batch_size: Batch size (training and evaluation only).
-    word_embedding_dim: Word embedding dimension.
-    bidirectional_encoder: Whether to use a bidirectional or unidirectional
-      encoder RNN.
-    encoder_dim: Number of output dimensions of the sentence encoder.
-
-  Returns:
-    An object containing model configuration parameters.
-  """
-  config = _HParams()
-  config.input_file_pattern = input_file_pattern
-  config.input_queue_capacity = input_queue_capacity
-  config.num_input_reader_threads = num_input_reader_threads
-  config.shuffle_input_data = shuffle_input_data
-  config.uniform_init_scale = uniform_init_scale
-  config.vocab_size = vocab_size
-  config.batch_size = batch_size
-  config.word_embedding_dim = word_embedding_dim
-  config.bidirectional_encoder = bidirectional_encoder
-  config.encoder_dim = encoder_dim
-  return config
-
-
-def training_config(learning_rate=0.0008,
-                    learning_rate_decay_factor=0.5,
-                    learning_rate_decay_steps=400000,
-                    number_of_steps=500000,
-                    clip_gradient_norm=5.0,
-                    save_model_secs=600,
-                    save_summaries_secs=600):
-  """Creates a training configuration object.
-
-  Args:
-    learning_rate: Initial learning rate.
-    learning_rate_decay_factor: If > 0, the learning rate decay factor.
-    learning_rate_decay_steps: The number of steps before the learning rate
-      decays by learning_rate_decay_factor.
-    number_of_steps: The total number of training steps to run. Passing None
-      will cause the training script to run indefinitely.
-    clip_gradient_norm: If not None, then clip gradients to this value.
-    save_model_secs: How often (in seconds) to save model checkpoints.
-    save_summaries_secs: How often (in seconds) to save model summaries.
-
-  Returns:
-    An object containing training configuration parameters.
-
-  Raises:
-    ValueError: If learning_rate_decay_factor is set and
-      learning_rate_decay_steps is unset.
-  """
-  if learning_rate_decay_factor and not learning_rate_decay_steps:
-    raise ValueError(
-        "learning_rate_decay_factor requires learning_rate_decay_steps.")
-
-  config = _HParams()
-  config.learning_rate = learning_rate
-  config.learning_rate_decay_factor = learning_rate_decay_factor
-  config.learning_rate_decay_steps = learning_rate_decay_steps
-  config.number_of_steps = number_of_steps
-  config.clip_gradient_norm = clip_gradient_norm
-  config.save_model_secs = save_model_secs
-  config.save_summaries_secs = save_summaries_secs
-  return config
diff --git a/research/skip_thoughts/skip_thoughts/data/BUILD b/research/skip_thoughts/skip_thoughts/data/BUILD
deleted file mode 100644
index a8b61bfdc88431b699d08df58813c13b3ddd3ff4..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/data/BUILD
+++ /dev/null
@@ -1,23 +0,0 @@
-package(default_visibility = ["//visibility:public"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-py_library(
-    name = "special_words",
-    srcs = ["special_words.py"],
-    srcs_version = "PY2AND3",
-    deps = [],
-)
-
-py_binary(
-    name = "preprocess_dataset",
-    srcs = [
-        "preprocess_dataset.py",
-    ],
-    srcs_version = "PY2AND3",
-    deps = [
-        ":special_words",
-    ],
-)
diff --git a/research/skip_thoughts/skip_thoughts/data/__init__.py b/research/skip_thoughts/skip_thoughts/data/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/skip_thoughts/skip_thoughts/data/preprocess_dataset.py b/research/skip_thoughts/skip_thoughts/data/preprocess_dataset.py
deleted file mode 100644
index b6f304f5301808177ae0f8ee59fb466d285b9217..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/data/preprocess_dataset.py
+++ /dev/null
@@ -1,301 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Converts a set of text files to TFRecord format with Example protos.
-
-Each Example proto in the output contains the following fields:
-
-  decode_pre: list of int64 ids corresponding to the "previous" sentence.
-  encode: list of int64 ids corresponding to the "current" sentence.
-  decode_post: list of int64 ids corresponding to the "post" sentence.
-
-In addition, the following files are generated:
-
-  vocab.txt: List of "<word> <id>" pairs, where <id> is the integer
-             encoding of <word> in the Example protos.
-  word_counts.txt: List of "<word> <count>" pairs, where <count> is the number
-                   of occurrences of <word> in the input files.
-
-The vocabulary of word ids is constructed from the top --num_words by word
-count. All other words get the <unk> word id.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import os
-
-
-import numpy as np
-import tensorflow as tf
-
-from skip_thoughts.data import special_words
-
-FLAGS = tf.flags.FLAGS
-
-tf.flags.DEFINE_string("input_files", None,
-                       "Comma-separated list of globs matching the input "
-                       "files. The format of the input files is assumed to be "
-                       "a list of newline-separated sentences, where each "
-                       "sentence is already tokenized.")
-
-tf.flags.DEFINE_string("vocab_file", "",
-                       "(Optional) existing vocab file. Otherwise, a new vocab "
-                       "file is created and written to the output directory. "
-                       "The file format is a list of newline-separated words, "
-                       "where the word id is the corresponding 0-based index "
-                       "in the file.")
-
-tf.flags.DEFINE_string("output_dir", None, "Output directory.")
-
-tf.flags.DEFINE_integer("train_output_shards", 100,
-                        "Number of output shards for the training set.")
-
-tf.flags.DEFINE_integer("validation_output_shards", 1,
-                        "Number of output shards for the validation set.")
-
-tf.flags.DEFINE_integer("num_validation_sentences", 50000,
-                        "Number of output shards for the validation set.")
-
-tf.flags.DEFINE_integer("num_words", 20000,
-                        "Number of words to include in the output.")
-
-tf.flags.DEFINE_integer("max_sentences", 0,
-                        "If > 0, the maximum number of sentences to output.")
-
-tf.flags.DEFINE_integer("max_sentence_length", 30,
-                        "If > 0, exclude sentences whose encode, decode_pre OR"
-                        "decode_post sentence exceeds this length.")
-
-tf.flags.DEFINE_boolean("add_eos", True,
-                        "Whether to add end-of-sentence ids to the output.")
-
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-def _build_vocabulary(input_files):
-  """Loads or builds the model vocabulary.
-
-  Args:
-    input_files: List of pre-tokenized input .txt files.
-
-  Returns:
-    vocab: A dictionary of word to id.
-  """
-  if FLAGS.vocab_file:
-    tf.logging.info("Loading existing vocab file.")
-    vocab = collections.OrderedDict()
-    with tf.gfile.GFile(FLAGS.vocab_file, mode="r") as f:
-      for i, line in enumerate(f):
-        word = line.decode("utf-8").strip()
-        assert word not in vocab, "Attempting to add word twice: %s" % word
-        vocab[word] = i
-    tf.logging.info("Read vocab of size %d from %s",
-                    len(vocab), FLAGS.vocab_file)
-    return vocab
-
-  tf.logging.info("Creating vocabulary.")
-  num = 0
-  wordcount = collections.Counter()
-  for input_file in input_files:
-    tf.logging.info("Processing file: %s", input_file)
-    for sentence in tf.gfile.FastGFile(input_file):
-      wordcount.update(sentence.split())
-
-      num += 1
-      if num % 1000000 == 0:
-        tf.logging.info("Processed %d sentences", num)
-
-  tf.logging.info("Processed %d sentences total", num)
-
-  words = list(wordcount)
-  freqs = list(wordcount.values())
-  sorted_indices = np.argsort(freqs)[::-1]
-
-  vocab = collections.OrderedDict()
-  vocab[special_words.EOS] = special_words.EOS_ID
-  vocab[special_words.UNK] = special_words.UNK_ID
-  for w_id, w_index in enumerate(sorted_indices[0:FLAGS.num_words - 2]):
-    vocab[words[w_index]] = w_id + 2  # 0: EOS, 1: UNK.
-
-  tf.logging.info("Created vocab with %d words", len(vocab))
-
-  vocab_file = os.path.join(FLAGS.output_dir, "vocab.txt")
-  with tf.gfile.FastGFile(vocab_file, "w") as f:
-    f.write("\n".join(vocab.keys()))
-  tf.logging.info("Wrote vocab file to %s", vocab_file)
-
-  word_counts_file = os.path.join(FLAGS.output_dir, "word_counts.txt")
-  with tf.gfile.FastGFile(word_counts_file, "w") as f:
-    for i in sorted_indices:
-      f.write("%s %d\n" % (words[i], freqs[i]))
-  tf.logging.info("Wrote word counts file to %s", word_counts_file)
-
-  return vocab
-
-
-def _int64_feature(value):
-  """Helper for creating an Int64 Feature."""
-  return tf.train.Feature(int64_list=tf.train.Int64List(
-      value=[int(v) for v in value]))
-
-
-def _sentence_to_ids(sentence, vocab):
-  """Helper for converting a sentence (list of words) to a list of ids."""
-  ids = [vocab.get(w, special_words.UNK_ID) for w in sentence]
-  if FLAGS.add_eos:
-    ids.append(special_words.EOS_ID)
-  return ids
-
-
-def _create_serialized_example(predecessor, current, successor, vocab):
-  """Helper for creating a serialized Example proto."""
-  example = tf.train.Example(features=tf.train.Features(feature={
-      "decode_pre": _int64_feature(_sentence_to_ids(predecessor, vocab)),
-      "encode": _int64_feature(_sentence_to_ids(current, vocab)),
-      "decode_post": _int64_feature(_sentence_to_ids(successor, vocab)),
-  }))
-
-  return example.SerializeToString()
-
-
-def _process_input_file(filename, vocab, stats):
-  """Processes the sentences in an input file.
-
-  Args:
-    filename: Path to a pre-tokenized input .txt file.
-    vocab: A dictionary of word to id.
-    stats: A Counter object for statistics.
-
-  Returns:
-    processed: A list of serialized Example protos
-  """
-  tf.logging.info("Processing input file: %s", filename)
-  processed = []
-
-  predecessor = None  # Predecessor sentence (list of words).
-  current = None  # Current sentence (list of words).
-  successor = None  # Successor sentence (list of words).
-
-  for successor_str in tf.gfile.FastGFile(filename):
-    stats.update(["sentences_seen"])
-    successor = successor_str.split()
-
-    # The first 2 sentences per file will be skipped.
-    if predecessor and current and successor:
-      stats.update(["sentences_considered"])
-
-      # Note that we are going to insert <EOS> later, so we only allow
-      # sentences with strictly less than max_sentence_length to pass.
-      if FLAGS.max_sentence_length and (
-          len(predecessor) >= FLAGS.max_sentence_length or len(current) >=
-          FLAGS.max_sentence_length or len(successor) >=
-          FLAGS.max_sentence_length):
-        stats.update(["sentences_too_long"])
-      else:
-        serialized = _create_serialized_example(predecessor, current, successor,
-                                                vocab)
-        processed.append(serialized)
-        stats.update(["sentences_output"])
-
-    predecessor = current
-    current = successor
-
-    sentences_seen = stats["sentences_seen"]
-    sentences_output = stats["sentences_output"]
-    if sentences_seen and sentences_seen % 100000 == 0:
-      tf.logging.info("Processed %d sentences (%d output)", sentences_seen,
-                      sentences_output)
-    if FLAGS.max_sentences and sentences_output >= FLAGS.max_sentences:
-      break
-
-  tf.logging.info("Completed processing file %s", filename)
-  return processed
-
-
-def _write_shard(filename, dataset, indices):
-  """Writes a TFRecord shard."""
-  with tf.python_io.TFRecordWriter(filename) as writer:
-    for j in indices:
-      writer.write(dataset[j])
-
-
-def _write_dataset(name, dataset, indices, num_shards):
-  """Writes a sharded TFRecord dataset.
-
-  Args:
-    name: Name of the dataset (e.g. "train").
-    dataset: List of serialized Example protos.
-    indices: List of indices of 'dataset' to be written.
-    num_shards: The number of output shards.
-  """
-  tf.logging.info("Writing dataset %s", name)
-  borders = np.int32(np.linspace(0, len(indices), num_shards + 1))
-  for i in range(num_shards):
-    filename = os.path.join(FLAGS.output_dir, "%s-%.5d-of-%.5d" % (name, i,
-                                                                   num_shards))
-    shard_indices = indices[borders[i]:borders[i + 1]]
-    _write_shard(filename, dataset, shard_indices)
-    tf.logging.info("Wrote dataset indices [%d, %d) to output shard %s",
-                    borders[i], borders[i + 1], filename)
-  tf.logging.info("Finished writing %d sentences in dataset %s.",
-                  len(indices), name)
-
-
-def main(unused_argv):
-  if not FLAGS.input_files:
-    raise ValueError("--input_files is required.")
-  if not FLAGS.output_dir:
-    raise ValueError("--output_dir is required.")
-
-  if not tf.gfile.IsDirectory(FLAGS.output_dir):
-    tf.gfile.MakeDirs(FLAGS.output_dir)
-
-  input_files = []
-  for pattern in FLAGS.input_files.split(","):
-    match = tf.gfile.Glob(FLAGS.input_files)
-    if not match:
-      raise ValueError("Found no files matching %s" % pattern)
-    input_files.extend(match)
-  tf.logging.info("Found %d input files.", len(input_files))
-
-  vocab = _build_vocabulary(input_files)
-
-  tf.logging.info("Generating dataset.")
-  stats = collections.Counter()
-  dataset = []
-  for filename in input_files:
-    dataset.extend(_process_input_file(filename, vocab, stats))
-    if FLAGS.max_sentences and stats["sentences_output"] >= FLAGS.max_sentences:
-      break
-
-  tf.logging.info("Generated dataset with %d sentences.", len(dataset))
-  for k, v in stats.items():
-    tf.logging.info("%s: %d", k, v)
-
-  tf.logging.info("Shuffling dataset.")
-  np.random.seed(123)
-  shuffled_indices = np.random.permutation(len(dataset))
-  val_indices = shuffled_indices[:FLAGS.num_validation_sentences]
-  train_indices = shuffled_indices[FLAGS.num_validation_sentences:]
-
-  _write_dataset("train", dataset, train_indices, FLAGS.train_output_shards)
-  _write_dataset("validation", dataset, val_indices,
-                 FLAGS.validation_output_shards)
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/skip_thoughts/skip_thoughts/data/special_words.py b/research/skip_thoughts/skip_thoughts/data/special_words.py
deleted file mode 100644
index fb76b7a94d1655f49f6906aa42fb2913ba8eceb9..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/data/special_words.py
+++ /dev/null
@@ -1,27 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Special word constants.
-
-NOTE: The ids of the EOS and UNK constants should not be modified. It is assumed
-that these always occupy the first two ids.
-"""
-
-# End of sentence.
-EOS = "<eos>"
-EOS_ID = 0
-
-# Unknown.
-UNK = "<unk>"
-UNK_ID = 1
diff --git a/research/skip_thoughts/skip_thoughts/encoder_manager.py b/research/skip_thoughts/skip_thoughts/encoder_manager.py
deleted file mode 100644
index 00b220245ab30d7c408a9becd146f8ff44c2e054..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/encoder_manager.py
+++ /dev/null
@@ -1,134 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Manager class for loading and encoding with multiple skip-thoughts models.
-
-If multiple models are loaded at once then the encode() function returns the
-concatenation of the outputs of each model.
-
-Example usage:
-  manager = EncoderManager()
-  manager.load_model(model_config_1, vocabulary_file_1, embedding_matrix_file_1,
-                     checkpoint_path_1)
-  manager.load_model(model_config_2, vocabulary_file_2, embedding_matrix_file_2,
-                     checkpoint_path_2)
-  encodings = manager.encode(data)
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-
-
-import numpy as np
-import tensorflow as tf
-
-from skip_thoughts import skip_thoughts_encoder
-
-
-class EncoderManager(object):
-  """Manager class for loading and encoding with skip-thoughts models."""
-
-  def __init__(self):
-    self.encoders = []
-    self.sessions = []
-
-  def load_model(self, model_config, vocabulary_file, embedding_matrix_file,
-                 checkpoint_path):
-    """Loads a skip-thoughts model.
-
-    Args:
-      model_config: Object containing parameters for building the model.
-      vocabulary_file: Path to vocabulary file containing a list of newline-
-        separated words where the word id is the corresponding 0-based index in
-        the file.
-      embedding_matrix_file: Path to a serialized numpy array of shape
-        [vocab_size, embedding_dim].
-      checkpoint_path: SkipThoughtsModel checkpoint file or a directory
-        containing a checkpoint file.
-    """
-    tf.logging.info("Reading vocabulary from %s", vocabulary_file)
-    with tf.gfile.GFile(vocabulary_file, mode="rb") as f:
-      lines = list(f.readlines())
-    reverse_vocab = [line.decode("utf-8").strip() for line in lines]
-    
-    tf.logging.info("Loaded vocabulary with %d words.", len(reverse_vocab))
-
-    tf.logging.info("Loading embedding matrix from %s", embedding_matrix_file)
-    # Note: tf.gfile.GFile doesn't work here because np.load() calls f.seek()
-    # with 3 arguments.
-    embedding_matrix = np.load(embedding_matrix_file)
-    tf.logging.info("Loaded embedding matrix with shape %s",
-                    embedding_matrix.shape)
-
-    word_embeddings = collections.OrderedDict(
-        zip(reverse_vocab, embedding_matrix))
-
-    g = tf.Graph()
-    with g.as_default():
-      encoder = skip_thoughts_encoder.SkipThoughtsEncoder(word_embeddings)
-      restore_model = encoder.build_graph_from_config(model_config,
-                                                      checkpoint_path)
-
-    sess = tf.Session(graph=g)
-    restore_model(sess)
-
-    self.encoders.append(encoder)
-    self.sessions.append(sess)
-
-  def encode(self,
-             data,
-             use_norm=True,
-             verbose=False,
-             batch_size=128,
-             use_eos=False):
-    """Encodes a sequence of sentences as skip-thought vectors.
-
-    Args:
-      data: A list of input strings.
-      use_norm: If True, normalize output skip-thought vectors to unit L2 norm.
-      verbose: Whether to log every batch.
-      batch_size: Batch size for the RNN encoders.
-      use_eos: If True, append the end-of-sentence word to each input sentence.
-
-    Returns:
-      thought_vectors: A list of numpy arrays corresponding to 'data'.
-
-    Raises:
-      ValueError: If called before calling load_encoder.
-    """
-    if not self.encoders:
-      raise ValueError(
-          "Must call load_model at least once before calling encode.")
-
-    encoded = []
-    for encoder, sess in zip(self.encoders, self.sessions):
-      encoded.append(
-          np.array(
-              encoder.encode(
-                  sess,
-                  data,
-                  use_norm=use_norm,
-                  verbose=verbose,
-                  batch_size=batch_size,
-                  use_eos=use_eos)))
-
-    return np.concatenate(encoded, axis=1)
-
-  def close(self):
-    """Closes the active TensorFlow Sessions."""
-    for sess in self.sessions:
-      sess.close()
diff --git a/research/skip_thoughts/skip_thoughts/evaluate.py b/research/skip_thoughts/skip_thoughts/evaluate.py
deleted file mode 100644
index e840d9da9f5c2e7e223669388ef0f43ed4f63398..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/evaluate.py
+++ /dev/null
@@ -1,117 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Script to evaluate a skip-thoughts model.
-
-This script can evaluate a model with a unidirectional encoder ("uni-skip" in
-the paper); or a model with a bidirectional encoder ("bi-skip"); or the
-combination of a model with a unidirectional encoder and a model with a
-bidirectional encoder ("combine-skip").
-
-The uni-skip model (if it exists) is specified by the flags
---uni_vocab_file, --uni_embeddings_file, --uni_checkpoint_path.
-
-The bi-skip model (if it exists) is specified by the flags
---bi_vocab_file, --bi_embeddings_path, --bi_checkpoint_path.
-
-The evaluation tasks have different running times. SICK may take 5-10 minutes.
-MSRP, TREC and CR may take 20-60 minutes. SUBJ, MPQA and MR may take 2+ hours.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-from skipthoughts import eval_classification
-from skipthoughts import eval_msrp
-from skipthoughts import eval_sick
-from skipthoughts import eval_trec
-import tensorflow as tf
-
-from skip_thoughts import configuration
-from skip_thoughts import encoder_manager
-
-FLAGS = tf.flags.FLAGS
-
-tf.flags.DEFINE_string("eval_task", "CR",
-                       "Name of the evaluation task to run. Available tasks: "
-                       "MR, CR, SUBJ, MPQA, SICK, MSRP, TREC.")
-
-tf.flags.DEFINE_string("data_dir", None, "Directory containing training data.")
-
-tf.flags.DEFINE_string("uni_vocab_file", None,
-                       "Path to vocabulary file containing a list of newline-"
-                       "separated words where the word id is the "
-                       "corresponding 0-based index in the file.")
-tf.flags.DEFINE_string("bi_vocab_file", None,
-                       "Path to vocabulary file containing a list of newline-"
-                       "separated words where the word id is the "
-                       "corresponding 0-based index in the file.")
-
-tf.flags.DEFINE_string("uni_embeddings_file", None,
-                       "Path to serialized numpy array of shape "
-                       "[vocab_size, embedding_dim].")
-tf.flags.DEFINE_string("bi_embeddings_file", None,
-                       "Path to serialized numpy array of shape "
-                       "[vocab_size, embedding_dim].")
-
-tf.flags.DEFINE_string("uni_checkpoint_path", None,
-                       "Checkpoint file or directory containing a checkpoint "
-                       "file.")
-tf.flags.DEFINE_string("bi_checkpoint_path", None,
-                       "Checkpoint file or directory containing a checkpoint "
-                       "file.")
-
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-def main(unused_argv):
-  if not FLAGS.data_dir:
-    raise ValueError("--data_dir is required.")
-
-  encoder = encoder_manager.EncoderManager()
-
-  # Maybe load unidirectional encoder.
-  if FLAGS.uni_checkpoint_path:
-    print("Loading unidirectional model...")
-    uni_config = configuration.model_config()
-    encoder.load_model(uni_config, FLAGS.uni_vocab_file,
-                       FLAGS.uni_embeddings_file, FLAGS.uni_checkpoint_path)
-
-  # Maybe load bidirectional encoder.
-  if FLAGS.bi_checkpoint_path:
-    print("Loading bidirectional model...")
-    bi_config = configuration.model_config(bidirectional_encoder=True)
-    encoder.load_model(bi_config, FLAGS.bi_vocab_file, FLAGS.bi_embeddings_file,
-                       FLAGS.bi_checkpoint_path)
-
-  if FLAGS.eval_task in ["MR", "CR", "SUBJ", "MPQA"]:
-    eval_classification.eval_nested_kfold(
-        encoder, FLAGS.eval_task, FLAGS.data_dir, use_nb=False)
-  elif FLAGS.eval_task == "SICK":
-    eval_sick.evaluate(encoder, evaltest=True, loc=FLAGS.data_dir)
-  elif FLAGS.eval_task == "MSRP":
-    eval_msrp.evaluate(
-        encoder, evalcv=True, evaltest=True, use_feats=True, loc=FLAGS.data_dir)
-  elif FLAGS.eval_task == "TREC":
-    eval_trec.evaluate(encoder, evalcv=True, evaltest=True, loc=FLAGS.data_dir)
-  else:
-    raise ValueError("Unrecognized eval_task: %s" % FLAGS.eval_task)
-
-  encoder.close()
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/skip_thoughts/skip_thoughts/ops/BUILD b/research/skip_thoughts/skip_thoughts/ops/BUILD
deleted file mode 100644
index 896d54db76e743008dcd50fc7a5b44aabb61d251..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/ops/BUILD
+++ /dev/null
@@ -1,17 +0,0 @@
-package(default_visibility = ["//visibility:public"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-py_library(
-    name = "input_ops",
-    srcs = ["input_ops.py"],
-    srcs_version = "PY2AND3",
-)
-
-py_library(
-    name = "gru_cell",
-    srcs = ["gru_cell.py"],
-    srcs_version = "PY2AND3",
-)
diff --git a/research/skip_thoughts/skip_thoughts/ops/__init__.py b/research/skip_thoughts/skip_thoughts/ops/__init__.py
deleted file mode 100644
index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..0000000000000000000000000000000000000000
diff --git a/research/skip_thoughts/skip_thoughts/ops/gru_cell.py b/research/skip_thoughts/skip_thoughts/ops/gru_cell.py
deleted file mode 100644
index c4bee46d3a9f5faf1ec060a3b21f66b4fe51d0c9..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/ops/gru_cell.py
+++ /dev/null
@@ -1,134 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""GRU cell implementation for the skip-thought vectors model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-_layer_norm = tf.contrib.layers.layer_norm
-
-
-class LayerNormGRUCell(tf.contrib.rnn.RNNCell):
-  """GRU cell with layer normalization.
-
-  The layer normalization implementation is based on:
-
-    https://arxiv.org/abs/1607.06450.
-
-  "Layer Normalization"
-  Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton
-  """
-
-  def __init__(self,
-               num_units,
-               w_initializer,
-               u_initializer,
-               b_initializer,
-               activation=tf.nn.tanh):
-    """Initializes the cell.
-
-    Args:
-      num_units: Number of cell units.
-      w_initializer: Initializer for the "W" (input) parameter matrices.
-      u_initializer: Initializer for the "U" (recurrent) parameter matrices.
-      b_initializer: Initializer for the "b" (bias) parameter vectors.
-      activation: Cell activation function.
-    """
-    self._num_units = num_units
-    self._w_initializer = w_initializer
-    self._u_initializer = u_initializer
-    self._b_initializer = b_initializer
-    self._activation = activation
-
-  @property
-  def state_size(self):
-    return self._num_units
-
-  @property
-  def output_size(self):
-    return self._num_units
-
-  def _w_h_initializer(self):
-    """Returns an initializer for the "W_h" parameter matrix.
-
-    See equation (23) in the paper. The "W_h" parameter matrix is the
-    concatenation of two parameter submatrices. The matrix returned is
-    [U_z, U_r].
-
-    Returns:
-      A Tensor with shape [num_units, 2 * num_units] as described above.
-    """
-
-    def _initializer(shape, dtype=tf.float32, partition_info=None):
-      num_units = self._num_units
-      assert shape == [num_units, 2 * num_units]
-      u_z = self._u_initializer([num_units, num_units], dtype, partition_info)
-      u_r = self._u_initializer([num_units, num_units], dtype, partition_info)
-      return tf.concat([u_z, u_r], 1)
-
-    return _initializer
-
-  def _w_x_initializer(self, input_dim):
-    """Returns an initializer for the "W_x" parameter matrix.
-
-    See equation (23) in the paper. The "W_x" parameter matrix is the
-    concatenation of two parameter submatrices. The matrix returned is
-    [W_z, W_r].
-
-    Args:
-      input_dim: The dimension of the cell inputs.
-
-    Returns:
-      A Tensor with shape [input_dim, 2 * num_units] as described above.
-    """
-
-    def _initializer(shape, dtype=tf.float32, partition_info=None):
-      num_units = self._num_units
-      assert shape == [input_dim, 2 * num_units]
-      w_z = self._w_initializer([input_dim, num_units], dtype, partition_info)
-      w_r = self._w_initializer([input_dim, num_units], dtype, partition_info)
-      return tf.concat([w_z, w_r], 1)
-
-    return _initializer
-
-  def __call__(self, inputs, state, scope=None):
-    """GRU cell with layer normalization."""
-    input_dim = inputs.get_shape().as_list()[1]
-    num_units = self._num_units
-
-    with tf.variable_scope(scope or "gru_cell"):
-      with tf.variable_scope("gates"):
-        w_h = tf.get_variable(
-            "w_h", [num_units, 2 * num_units],
-            initializer=self._w_h_initializer())
-        w_x = tf.get_variable(
-            "w_x", [input_dim, 2 * num_units],
-            initializer=self._w_x_initializer(input_dim))
-        z_and_r = (_layer_norm(tf.matmul(state, w_h), scope="layer_norm/w_h") +
-                   _layer_norm(tf.matmul(inputs, w_x), scope="layer_norm/w_x"))
-        z, r = tf.split(tf.sigmoid(z_and_r), 2, 1)
-      with tf.variable_scope("candidate"):
-        w = tf.get_variable(
-            "w", [input_dim, num_units], initializer=self._w_initializer)
-        u = tf.get_variable(
-            "u", [num_units, num_units], initializer=self._u_initializer)
-        h_hat = (r * _layer_norm(tf.matmul(state, u), scope="layer_norm/u") +
-                 _layer_norm(tf.matmul(inputs, w), scope="layer_norm/w"))
-      new_h = (1 - z) * state + z * self._activation(h_hat)
-    return new_h, new_h
diff --git a/research/skip_thoughts/skip_thoughts/ops/input_ops.py b/research/skip_thoughts/skip_thoughts/ops/input_ops.py
deleted file mode 100644
index 51b03fc5da335b78977d5c1b9234160f1c240e53..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/ops/input_ops.py
+++ /dev/null
@@ -1,118 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Input ops."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-
-
-import tensorflow as tf
-
-# A SentenceBatch is a pair of Tensors:
-#  ids: Batch of input sentences represented as sequences of word ids: an int64
-#    Tensor with shape [batch_size, padded_length].
-#  mask: Boolean mask distinguishing real words (1) from padded words (0): an
-#    int32 Tensor with shape [batch_size, padded_length].
-SentenceBatch = collections.namedtuple("SentenceBatch", ("ids", "mask"))
-
-
-def parse_example_batch(serialized):
-  """Parses a batch of tf.Example protos.
-
-  Args:
-    serialized: A 1-D string Tensor; a batch of serialized tf.Example protos.
-  Returns:
-    encode: A SentenceBatch of encode sentences.
-    decode_pre: A SentenceBatch of "previous" sentences to decode.
-    decode_post: A SentenceBatch of "post" sentences to decode.
-  """
-  features = tf.parse_example(
-      serialized,
-      features={
-          "encode": tf.VarLenFeature(dtype=tf.int64),
-          "decode_pre": tf.VarLenFeature(dtype=tf.int64),
-          "decode_post": tf.VarLenFeature(dtype=tf.int64),
-      })
-
-  def _sparse_to_batch(sparse):
-    ids = tf.sparse_tensor_to_dense(sparse)  # Padding with zeroes.
-    mask = tf.sparse_to_dense(sparse.indices, sparse.dense_shape,
-                              tf.ones_like(sparse.values, dtype=tf.int32))
-    return SentenceBatch(ids=ids, mask=mask)
-
-  output_names = ("encode", "decode_pre", "decode_post")
-  return tuple(_sparse_to_batch(features[x]) for x in output_names)
-
-
-def prefetch_input_data(reader,
-                        file_pattern,
-                        shuffle,
-                        capacity,
-                        num_reader_threads=1):
-  """Prefetches string values from disk into an input queue.
-
-  Args:
-    reader: Instance of tf.ReaderBase.
-    file_pattern: Comma-separated list of file patterns (e.g.
-        "/tmp/train_data-?????-of-00100", where '?' acts as a wildcard that
-        matches any character).
-    shuffle: Boolean; whether to randomly shuffle the input data.
-    capacity: Queue capacity (number of records).
-    num_reader_threads: Number of reader threads feeding into the queue.
-
-  Returns:
-    A Queue containing prefetched string values.
-  """
-  data_files = []
-  for pattern in file_pattern.split(","):
-    data_files.extend(tf.gfile.Glob(pattern))
-  if not data_files:
-    tf.logging.fatal("Found no input files matching %s", file_pattern)
-  else:
-    tf.logging.info("Prefetching values from %d files matching %s",
-                    len(data_files), file_pattern)
-
-  filename_queue = tf.train.string_input_producer(
-      data_files, shuffle=shuffle, capacity=16, name="filename_queue")
-
-  if shuffle:
-    min_after_dequeue = int(0.6 * capacity)
-    values_queue = tf.RandomShuffleQueue(
-        capacity=capacity,
-        min_after_dequeue=min_after_dequeue,
-        dtypes=[tf.string],
-        shapes=[[]],
-        name="random_input_queue")
-  else:
-    values_queue = tf.FIFOQueue(
-        capacity=capacity,
-        dtypes=[tf.string],
-        shapes=[[]],
-        name="fifo_input_queue")
-
-  enqueue_ops = []
-  for _ in range(num_reader_threads):
-    _, value = reader.read(filename_queue)
-    enqueue_ops.append(values_queue.enqueue([value]))
-  tf.train.queue_runner.add_queue_runner(
-      tf.train.queue_runner.QueueRunner(values_queue, enqueue_ops))
-  tf.summary.scalar("queue/%s/fraction_of_%d_full" % (values_queue.name,
-                                                      capacity),
-                    tf.cast(values_queue.size(), tf.float32) * (1.0 / capacity))
-
-  return values_queue
diff --git a/research/skip_thoughts/skip_thoughts/skip_thoughts_encoder.py b/research/skip_thoughts/skip_thoughts/skip_thoughts_encoder.py
deleted file mode 100644
index 79c47c58813feb72f1b9bdb5c2f7bd7956f015c8..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/skip_thoughts_encoder.py
+++ /dev/null
@@ -1,258 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Class for encoding text using a trained SkipThoughtsModel.
-
-Example usage:
-  g = tf.Graph()
-  with g.as_default():
-    encoder = SkipThoughtsEncoder(embeddings)
-    restore_fn = encoder.build_graph_from_config(model_config, checkpoint_path)
-
-  with tf.Session(graph=g) as sess:
-    restore_fn(sess)
-    skip_thought_vectors = encoder.encode(sess, data)
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os.path
-
-
-import nltk
-import nltk.tokenize
-import numpy as np
-import tensorflow as tf
-
-from skip_thoughts import skip_thoughts_model
-from skip_thoughts.data import special_words
-
-
-def _pad(seq, target_len):
-  """Pads a sequence of word embeddings up to the target length.
-
-  Args:
-    seq: Sequence of word embeddings.
-    target_len: Desired padded sequence length.
-
-  Returns:
-    embeddings: Input sequence padded with zero embeddings up to the target
-      length.
-    mask: A 0/1 vector with zeros corresponding to padded embeddings.
-
-  Raises:
-    ValueError: If len(seq) is not in the interval (0, target_len].
-  """
-  seq_len = len(seq)
-  if seq_len <= 0 or seq_len > target_len:
-    raise ValueError("Expected 0 < len(seq) <= %d, got %d" % (target_len,
-                                                              seq_len))
-
-  emb_dim = seq[0].shape[0]
-  padded_seq = np.zeros(shape=(target_len, emb_dim), dtype=seq[0].dtype)
-  mask = np.zeros(shape=(target_len,), dtype=np.int8)
-  for i in range(seq_len):
-    padded_seq[i] = seq[i]
-    mask[i] = 1
-  return padded_seq, mask
-
-
-def _batch_and_pad(sequences):
-  """Batches and pads sequences of word embeddings into a 2D array.
-
-  Args:
-    sequences: A list of batch_size sequences of word embeddings.
-
-  Returns:
-    embeddings: A numpy array with shape [batch_size, padded_length, emb_dim].
-    mask: A numpy 0/1 array with shape [batch_size, padded_length] with zeros
-      corresponding to padded elements.
-  """
-  batch_embeddings = []
-  batch_mask = []
-  batch_len = max([len(seq) for seq in sequences])
-  for seq in sequences:
-    embeddings, mask = _pad(seq, batch_len)
-    batch_embeddings.append(embeddings)
-    batch_mask.append(mask)
-  return np.array(batch_embeddings), np.array(batch_mask)
-
-
-class SkipThoughtsEncoder(object):
-  """Skip-thoughts sentence encoder."""
-
-  def __init__(self, embeddings):
-    """Initializes the encoder.
-
-    Args:
-      embeddings: Dictionary of word to embedding vector (1D numpy array).
-    """
-    self._sentence_detector = nltk.data.load("tokenizers/punkt/english.pickle")
-    self._embeddings = embeddings
-
-  def _create_restore_fn(self, checkpoint_path, saver):
-    """Creates a function that restores a model from checkpoint.
-
-    Args:
-      checkpoint_path: Checkpoint file or a directory containing a checkpoint
-        file.
-      saver: Saver for restoring variables from the checkpoint file.
-
-    Returns:
-      restore_fn: A function such that restore_fn(sess) loads model variables
-        from the checkpoint file.
-
-    Raises:
-      ValueError: If checkpoint_path does not refer to a checkpoint file or a
-        directory containing a checkpoint file.
-    """
-    if tf.gfile.IsDirectory(checkpoint_path):
-      latest_checkpoint = tf.train.latest_checkpoint(checkpoint_path)
-      if not latest_checkpoint:
-        raise ValueError("No checkpoint file found in: %s" % checkpoint_path)
-      checkpoint_path = latest_checkpoint
-
-    def _restore_fn(sess):
-      tf.logging.info("Loading model from checkpoint: %s", checkpoint_path)
-      saver.restore(sess, checkpoint_path)
-      tf.logging.info("Successfully loaded checkpoint: %s",
-                      os.path.basename(checkpoint_path))
-
-    return _restore_fn
-
-  def build_graph_from_config(self, model_config, checkpoint_path):
-    """Builds the inference graph from a configuration object.
-
-    Args:
-      model_config: Object containing configuration for building the model.
-      checkpoint_path: Checkpoint file or a directory containing a checkpoint
-        file.
-
-    Returns:
-      restore_fn: A function such that restore_fn(sess) loads model variables
-        from the checkpoint file.
-    """
-    tf.logging.info("Building model.")
-    model = skip_thoughts_model.SkipThoughtsModel(model_config, mode="encode")
-    model.build()
-    saver = tf.train.Saver()
-
-    return self._create_restore_fn(checkpoint_path, saver)
-
-  def build_graph_from_proto(self, graph_def_file, saver_def_file,
-                             checkpoint_path):
-    """Builds the inference graph from serialized GraphDef and SaverDef protos.
-
-    Args:
-      graph_def_file: File containing a serialized GraphDef proto.
-      saver_def_file: File containing a serialized SaverDef proto.
-      checkpoint_path: Checkpoint file or a directory containing a checkpoint
-        file.
-
-    Returns:
-      restore_fn: A function such that restore_fn(sess) loads model variables
-        from the checkpoint file.
-    """
-    # Load the Graph.
-    tf.logging.info("Loading GraphDef from file: %s", graph_def_file)
-    graph_def = tf.GraphDef()
-    with tf.gfile.FastGFile(graph_def_file, "rb") as f:
-      graph_def.ParseFromString(f.read())
-    tf.import_graph_def(graph_def, name="")
-
-    # Load the Saver.
-    tf.logging.info("Loading SaverDef from file: %s", saver_def_file)
-    saver_def = tf.train.SaverDef()
-    with tf.gfile.FastGFile(saver_def_file, "rb") as f:
-      saver_def.ParseFromString(f.read())
-    saver = tf.train.Saver(saver_def=saver_def)
-
-    return self._create_restore_fn(checkpoint_path, saver)
-
-  def _tokenize(self, item):
-    """Tokenizes an input string into a list of words."""
-    tokenized = []
-    for s in self._sentence_detector.tokenize(item):
-      tokenized.extend(nltk.tokenize.word_tokenize(s))
-
-    return tokenized
-
-  def _word_to_embedding(self, w):
-    """Returns the embedding of a word."""
-    return self._embeddings.get(w, self._embeddings[special_words.UNK])
-
-  def _preprocess(self, data, use_eos):
-    """Preprocesses text for the encoder.
-
-    Args:
-      data: A list of input strings.
-      use_eos: Whether to append the end-of-sentence word to each sentence.
-
-    Returns:
-      embeddings: A list of word embedding sequences corresponding to the input
-        strings.
-    """
-    preprocessed_data = []
-    for item in data:
-      tokenized = self._tokenize(item)
-      if use_eos:
-        tokenized.append(special_words.EOS)
-      preprocessed_data.append([self._word_to_embedding(w) for w in tokenized])
-    return preprocessed_data
-
-  def encode(self,
-             sess,
-             data,
-             use_norm=True,
-             verbose=True,
-             batch_size=128,
-             use_eos=False):
-    """Encodes a sequence of sentences as skip-thought vectors.
-
-    Args:
-      sess: TensorFlow Session.
-      data: A list of input strings.
-      use_norm: Whether to normalize skip-thought vectors to unit L2 norm.
-      verbose: Whether to log every batch.
-      batch_size: Batch size for the encoder.
-      use_eos: Whether to append the end-of-sentence word to each input
-        sentence.
-
-    Returns:
-      thought_vectors: A list of numpy arrays corresponding to the skip-thought
-        encodings of sentences in 'data'.
-    """
-    data = self._preprocess(data, use_eos)
-    thought_vectors = []
-
-    batch_indices = np.arange(0, len(data), batch_size)
-    for batch, start_index in enumerate(batch_indices):
-      if verbose:
-        tf.logging.info("Batch %d / %d.", batch, len(batch_indices))
-
-      embeddings, mask = _batch_and_pad(
-          data[start_index:start_index + batch_size])
-      feed_dict = {
-          "encode_emb:0": embeddings,
-          "encode_mask:0": mask,
-      }
-      thought_vectors.extend(
-          sess.run("encoder/thought_vectors:0", feed_dict=feed_dict))
-
-    if use_norm:
-      thought_vectors = [v / np.linalg.norm(v) for v in thought_vectors]
-
-    return thought_vectors
diff --git a/research/skip_thoughts/skip_thoughts/skip_thoughts_model.py b/research/skip_thoughts/skip_thoughts/skip_thoughts_model.py
deleted file mode 100644
index 9a9a43a4fed0dbbb03affd26ffa1c635c386aa55..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/skip_thoughts_model.py
+++ /dev/null
@@ -1,369 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Skip-Thoughts model for learning sentence vectors.
-
-The model is based on the paper:
-
-  "Skip-Thought Vectors"
-  Ryan Kiros, Yukun Zhu, Ruslan Salakhutdinov, Richard S. Zemel,
-  Antonio Torralba, Raquel Urtasun, Sanja Fidler.
-  https://papers.nips.cc/paper/5950-skip-thought-vectors.pdf
-
-Layer normalization is applied based on the paper:
-
-  "Layer Normalization"
-  Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton
-  https://arxiv.org/abs/1607.06450
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from skip_thoughts.ops import gru_cell
-from skip_thoughts.ops import input_ops
-
-
-def random_orthonormal_initializer(shape, dtype=tf.float32,
-                                   partition_info=None):  # pylint: disable=unused-argument
-  """Variable initializer that produces a random orthonormal matrix."""
-  if len(shape) != 2 or shape[0] != shape[1]:
-    raise ValueError("Expecting square shape, got %s" % shape)
-  _, u, _ = tf.svd(tf.random_normal(shape, dtype=dtype), full_matrices=True)
-  return u
-
-
-class SkipThoughtsModel(object):
-  """Skip-thoughts model."""
-
-  def __init__(self, config, mode="train", input_reader=None):
-    """Basic setup. The actual TensorFlow graph is constructed in build().
-
-    Args:
-      config: Object containing configuration parameters.
-      mode: "train", "eval" or "encode".
-      input_reader: Subclass of tf.ReaderBase for reading the input serialized
-        tf.Example protocol buffers. Defaults to TFRecordReader.
-
-    Raises:
-      ValueError: If mode is invalid.
-    """
-    if mode not in ["train", "eval", "encode"]:
-      raise ValueError("Unrecognized mode: %s" % mode)
-
-    self.config = config
-    self.mode = mode
-    self.reader = input_reader if input_reader else tf.TFRecordReader()
-
-    # Initializer used for non-recurrent weights.
-    self.uniform_initializer = tf.random_uniform_initializer(
-        minval=-self.config.uniform_init_scale,
-        maxval=self.config.uniform_init_scale)
-
-    # Input sentences represented as sequences of word ids. "encode" is the
-    # source sentence, "decode_pre" is the previous sentence and "decode_post"
-    # is the next sentence.
-    # Each is an int64 Tensor with  shape [batch_size, padded_length].
-    self.encode_ids = None
-    self.decode_pre_ids = None
-    self.decode_post_ids = None
-
-    # Boolean masks distinguishing real words (1) from padded words (0).
-    # Each is an int32 Tensor with shape [batch_size, padded_length].
-    self.encode_mask = None
-    self.decode_pre_mask = None
-    self.decode_post_mask = None
-
-    # Input sentences represented as sequences of word embeddings.
-    # Each is a float32 Tensor with shape [batch_size, padded_length, emb_dim].
-    self.encode_emb = None
-    self.decode_pre_emb = None
-    self.decode_post_emb = None
-
-    # The output from the sentence encoder.
-    # A float32 Tensor with shape [batch_size, num_gru_units].
-    self.thought_vectors = None
-
-    # The cross entropy losses and corresponding weights of the decoders. Used
-    # for evaluation.
-    self.target_cross_entropy_losses = []
-    self.target_cross_entropy_loss_weights = []
-
-    # The total loss to optimize.
-    self.total_loss = None
-
-  def build_inputs(self):
-    """Builds the ops for reading input data.
-
-    Outputs:
-      self.encode_ids
-      self.decode_pre_ids
-      self.decode_post_ids
-      self.encode_mask
-      self.decode_pre_mask
-      self.decode_post_mask
-    """
-    if self.mode == "encode":
-      # Word embeddings are fed from an external vocabulary which has possibly
-      # been expanded (see vocabulary_expansion.py).
-      encode_ids = None
-      decode_pre_ids = None
-      decode_post_ids = None
-      encode_mask = tf.placeholder(tf.int8, (None, None), name="encode_mask")
-      decode_pre_mask = None
-      decode_post_mask = None
-    else:
-      # Prefetch serialized tf.Example protos.
-      input_queue = input_ops.prefetch_input_data(
-          self.reader,
-          self.config.input_file_pattern,
-          shuffle=self.config.shuffle_input_data,
-          capacity=self.config.input_queue_capacity,
-          num_reader_threads=self.config.num_input_reader_threads)
-
-      # Deserialize a batch.
-      serialized = input_queue.dequeue_many(self.config.batch_size)
-      encode, decode_pre, decode_post = input_ops.parse_example_batch(
-          serialized)
-
-      encode_ids = encode.ids
-      decode_pre_ids = decode_pre.ids
-      decode_post_ids = decode_post.ids
-
-      encode_mask = encode.mask
-      decode_pre_mask = decode_pre.mask
-      decode_post_mask = decode_post.mask
-
-    self.encode_ids = encode_ids
-    self.decode_pre_ids = decode_pre_ids
-    self.decode_post_ids = decode_post_ids
-
-    self.encode_mask = encode_mask
-    self.decode_pre_mask = decode_pre_mask
-    self.decode_post_mask = decode_post_mask
-
-  def build_word_embeddings(self):
-    """Builds the word embeddings.
-
-    Inputs:
-      self.encode_ids
-      self.decode_pre_ids
-      self.decode_post_ids
-
-    Outputs:
-      self.encode_emb
-      self.decode_pre_emb
-      self.decode_post_emb
-    """
-    if self.mode == "encode":
-      # Word embeddings are fed from an external vocabulary which has possibly
-      # been expanded (see vocabulary_expansion.py).
-      encode_emb = tf.placeholder(tf.float32, (
-          None, None, self.config.word_embedding_dim), "encode_emb")
-      # No sequences to decode.
-      decode_pre_emb = None
-      decode_post_emb = None
-    else:
-      word_emb = tf.get_variable(
-          name="word_embedding",
-          shape=[self.config.vocab_size, self.config.word_embedding_dim],
-          initializer=self.uniform_initializer)
-
-      encode_emb = tf.nn.embedding_lookup(word_emb, self.encode_ids)
-      decode_pre_emb = tf.nn.embedding_lookup(word_emb, self.decode_pre_ids)
-      decode_post_emb = tf.nn.embedding_lookup(word_emb, self.decode_post_ids)
-
-    self.encode_emb = encode_emb
-    self.decode_pre_emb = decode_pre_emb
-    self.decode_post_emb = decode_post_emb
-
-  def _initialize_gru_cell(self, num_units):
-    """Initializes a GRU cell.
-
-    The Variables of the GRU cell are initialized in a way that exactly matches
-    the skip-thoughts paper: recurrent weights are initialized from random
-    orthonormal matrices and non-recurrent weights are initialized from random
-    uniform matrices.
-
-    Args:
-      num_units: Number of output units.
-
-    Returns:
-      cell: An instance of RNNCell with variable initializers that match the
-        skip-thoughts paper.
-    """
-    return gru_cell.LayerNormGRUCell(
-        num_units,
-        w_initializer=self.uniform_initializer,
-        u_initializer=random_orthonormal_initializer,
-        b_initializer=tf.constant_initializer(0.0))
-
-  def build_encoder(self):
-    """Builds the sentence encoder.
-
-    Inputs:
-      self.encode_emb
-      self.encode_mask
-
-    Outputs:
-      self.thought_vectors
-
-    Raises:
-      ValueError: if config.bidirectional_encoder is True and config.encoder_dim
-        is odd.
-    """
-    with tf.variable_scope("encoder") as scope:
-      length = tf.to_int32(tf.reduce_sum(self.encode_mask, 1), name="length")
-
-      if self.config.bidirectional_encoder:
-        if self.config.encoder_dim % 2:
-          raise ValueError(
-              "encoder_dim must be even when using a bidirectional encoder.")
-        num_units = self.config.encoder_dim // 2
-        cell_fw = self._initialize_gru_cell(num_units)  # Forward encoder
-        cell_bw = self._initialize_gru_cell(num_units)  # Backward encoder
-        _, states = tf.nn.bidirectional_dynamic_rnn(
-            cell_fw=cell_fw,
-            cell_bw=cell_bw,
-            inputs=self.encode_emb,
-            sequence_length=length,
-            dtype=tf.float32,
-            scope=scope)
-        thought_vectors = tf.concat(states, 1, name="thought_vectors")
-      else:
-        cell = self._initialize_gru_cell(self.config.encoder_dim)
-        _, state = tf.nn.dynamic_rnn(
-            cell=cell,
-            inputs=self.encode_emb,
-            sequence_length=length,
-            dtype=tf.float32,
-            scope=scope)
-        # Use an identity operation to name the Tensor in the Graph.
-        thought_vectors = tf.identity(state, name="thought_vectors")
-
-    self.thought_vectors = thought_vectors
-
-  def _build_decoder(self, name, embeddings, targets, mask, initial_state,
-                     reuse_logits):
-    """Builds a sentence decoder.
-
-    Args:
-      name: Decoder name.
-      embeddings: Batch of sentences to decode; a float32 Tensor with shape
-        [batch_size, padded_length, emb_dim].
-      targets: Batch of target word ids; an int64 Tensor with shape
-        [batch_size, padded_length].
-      mask: A 0/1 Tensor with shape [batch_size, padded_length].
-      initial_state: Initial state of the GRU. A float32 Tensor with shape
-        [batch_size, num_gru_cells].
-      reuse_logits: Whether to reuse the logits weights.
-    """
-    # Decoder RNN.
-    cell = self._initialize_gru_cell(self.config.encoder_dim)
-    with tf.variable_scope(name) as scope:
-      # Add a padding word at the start of each sentence (to correspond to the
-      # prediction of the first word) and remove the last word.
-      decoder_input = tf.pad(
-          embeddings[:, :-1, :], [[0, 0], [1, 0], [0, 0]], name="input")
-      length = tf.reduce_sum(mask, 1, name="length")
-      decoder_output, _ = tf.nn.dynamic_rnn(
-          cell=cell,
-          inputs=decoder_input,
-          sequence_length=length,
-          initial_state=initial_state,
-          scope=scope)
-
-    # Stack batch vertically.
-    decoder_output = tf.reshape(decoder_output, [-1, self.config.encoder_dim])
-    targets = tf.reshape(targets, [-1])
-    weights = tf.to_float(tf.reshape(mask, [-1]))
-
-    # Logits.
-    with tf.variable_scope("logits", reuse=reuse_logits) as scope:
-      logits = tf.contrib.layers.fully_connected(
-          inputs=decoder_output,
-          num_outputs=self.config.vocab_size,
-          activation_fn=None,
-          weights_initializer=self.uniform_initializer,
-          scope=scope)
-
-    losses = tf.nn.sparse_softmax_cross_entropy_with_logits(
-        labels=targets, logits=logits)
-    batch_loss = tf.reduce_sum(losses * weights)
-    tf.losses.add_loss(batch_loss)
-
-    tf.summary.scalar("losses/" + name, batch_loss)
-
-    self.target_cross_entropy_losses.append(losses)
-    self.target_cross_entropy_loss_weights.append(weights)
-
-  def build_decoders(self):
-    """Builds the sentence decoders.
-
-    Inputs:
-      self.decode_pre_emb
-      self.decode_post_emb
-      self.decode_pre_ids
-      self.decode_post_ids
-      self.decode_pre_mask
-      self.decode_post_mask
-      self.thought_vectors
-
-    Outputs:
-      self.target_cross_entropy_losses
-      self.target_cross_entropy_loss_weights
-    """
-    if self.mode != "encode":
-      # Pre-sentence decoder.
-      self._build_decoder("decoder_pre", self.decode_pre_emb,
-                          self.decode_pre_ids, self.decode_pre_mask,
-                          self.thought_vectors, False)
-
-      # Post-sentence decoder. Logits weights are reused.
-      self._build_decoder("decoder_post", self.decode_post_emb,
-                          self.decode_post_ids, self.decode_post_mask,
-                          self.thought_vectors, True)
-
-  def build_loss(self):
-    """Builds the loss Tensor.
-
-    Outputs:
-      self.total_loss
-    """
-    if self.mode != "encode":
-      total_loss = tf.losses.get_total_loss()
-      tf.summary.scalar("losses/total", total_loss)
-
-      self.total_loss = total_loss
-
-  def build_global_step(self):
-    """Builds the global step Tensor.
-
-    Outputs:
-      self.global_step
-    """
-    self.global_step = tf.contrib.framework.create_global_step()
-
-  def build(self):
-    """Creates all ops for training, evaluation or encoding."""
-    self.build_inputs()
-    self.build_word_embeddings()
-    self.build_encoder()
-    self.build_decoders()
-    self.build_loss()
-    self.build_global_step()
diff --git a/research/skip_thoughts/skip_thoughts/skip_thoughts_model_test.py b/research/skip_thoughts/skip_thoughts/skip_thoughts_model_test.py
deleted file mode 100644
index 7bd64326d9d9cdcaae11d74ac8831adac915dfe2..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/skip_thoughts_model_test.py
+++ /dev/null
@@ -1,191 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for tensorflow_models.skip_thoughts.skip_thoughts_model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import numpy as np
-import tensorflow as tf
-
-from skip_thoughts import configuration
-from skip_thoughts import skip_thoughts_model
-
-
-class SkipThoughtsModel(skip_thoughts_model.SkipThoughtsModel):
-  """Subclass of SkipThoughtsModel without the disk I/O."""
-
-  def build_inputs(self):
-    if self.mode == "encode":
-      # Encode mode doesn't read from disk, so defer to parent.
-      return super(SkipThoughtsModel, self).build_inputs()
-    else:
-      # Replace disk I/O with random Tensors.
-      self.encode_ids = tf.random_uniform(
-          [self.config.batch_size, 15],
-          minval=0,
-          maxval=self.config.vocab_size,
-          dtype=tf.int64)
-      self.decode_pre_ids = tf.random_uniform(
-          [self.config.batch_size, 15],
-          minval=0,
-          maxval=self.config.vocab_size,
-          dtype=tf.int64)
-      self.decode_post_ids = tf.random_uniform(
-          [self.config.batch_size, 15],
-          minval=0,
-          maxval=self.config.vocab_size,
-          dtype=tf.int64)
-      self.encode_mask = tf.ones_like(self.encode_ids)
-      self.decode_pre_mask = tf.ones_like(self.decode_pre_ids)
-      self.decode_post_mask = tf.ones_like(self.decode_post_ids)
-
-
-class SkipThoughtsModelTest(tf.test.TestCase):
-
-  def setUp(self):
-    super(SkipThoughtsModelTest, self).setUp()
-    self._model_config = configuration.model_config()
-
-  def _countModelParameters(self):
-    """Counts the number of parameters in the model at top level scope."""
-    counter = {}
-    for v in tf.global_variables():
-      name = v.op.name.split("/")[0]
-      num_params = v.get_shape().num_elements()
-      if not num_params:
-        self.fail("Could not infer num_elements from Variable %s" % v.op.name)
-      counter[name] = counter.get(name, 0) + num_params
-    return counter
-
-  def _checkModelParameters(self):
-    """Verifies the number of parameters in the model."""
-    param_counts = self._countModelParameters()
-    expected_param_counts = {
-        # vocab_size * embedding_size
-        "word_embedding": 12400000,
-        # GRU Cells
-        "encoder": 21772800,
-        "decoder_pre": 21772800,
-        "decoder_post": 21772800,
-        # (encoder_dim + 1) * vocab_size
-        "logits": 48020000,
-        "global_step": 1,
-    }
-    self.assertDictEqual(expected_param_counts, param_counts)
-
-  def _checkOutputs(self, expected_shapes, feed_dict=None):
-    """Verifies that the model produces expected outputs.
-
-    Args:
-      expected_shapes: A dict mapping Tensor or Tensor name to expected output
-        shape.
-      feed_dict: Values of Tensors to feed into Session.run().
-    """
-    fetches = expected_shapes.keys()
-
-    with self.test_session() as sess:
-      sess.run(tf.global_variables_initializer())
-      outputs = sess.run(fetches, feed_dict)
-
-    for index, output in enumerate(outputs):
-      tensor = fetches[index]
-      expected = expected_shapes[tensor]
-      actual = output.shape
-      if expected != actual:
-        self.fail("Tensor %s has shape %s (expected %s)." % (tensor, actual,
-                                                             expected))
-
-  def testBuildForTraining(self):
-    model = SkipThoughtsModel(self._model_config, mode="train")
-    model.build()
-
-    self._checkModelParameters()
-
-    expected_shapes = {
-        # [batch_size, length]
-        model.encode_ids: (128, 15),
-        model.decode_pre_ids: (128, 15),
-        model.decode_post_ids: (128, 15),
-        model.encode_mask: (128, 15),
-        model.decode_pre_mask: (128, 15),
-        model.decode_post_mask: (128, 15),
-        # [batch_size, length, word_embedding_dim]
-        model.encode_emb: (128, 15, 620),
-        model.decode_pre_emb: (128, 15, 620),
-        model.decode_post_emb: (128, 15, 620),
-        # [batch_size, encoder_dim]
-        model.thought_vectors: (128, 2400),
-        # [batch_size * length]
-        model.target_cross_entropy_losses[0]: (1920,),
-        model.target_cross_entropy_losses[1]: (1920,),
-        # [batch_size * length]
-        model.target_cross_entropy_loss_weights[0]: (1920,),
-        model.target_cross_entropy_loss_weights[1]: (1920,),
-        # Scalar
-        model.total_loss: (),
-    }
-    self._checkOutputs(expected_shapes)
-
-  def testBuildForEval(self):
-    model = SkipThoughtsModel(self._model_config, mode="eval")
-    model.build()
-
-    self._checkModelParameters()
-
-    expected_shapes = {
-        # [batch_size, length]
-        model.encode_ids: (128, 15),
-        model.decode_pre_ids: (128, 15),
-        model.decode_post_ids: (128, 15),
-        model.encode_mask: (128, 15),
-        model.decode_pre_mask: (128, 15),
-        model.decode_post_mask: (128, 15),
-        # [batch_size, length, word_embedding_dim]
-        model.encode_emb: (128, 15, 620),
-        model.decode_pre_emb: (128, 15, 620),
-        model.decode_post_emb: (128, 15, 620),
-        # [batch_size, encoder_dim]
-        model.thought_vectors: (128, 2400),
-        # [batch_size * length]
-        model.target_cross_entropy_losses[0]: (1920,),
-        model.target_cross_entropy_losses[1]: (1920,),
-        # [batch_size * length]
-        model.target_cross_entropy_loss_weights[0]: (1920,),
-        model.target_cross_entropy_loss_weights[1]: (1920,),
-        # Scalar
-        model.total_loss: (),
-    }
-    self._checkOutputs(expected_shapes)
-
-  def testBuildForEncode(self):
-    model = SkipThoughtsModel(self._model_config, mode="encode")
-    model.build()
-
-    # Test feeding a batch of word embeddings to get skip thought vectors.
-    encode_emb = np.random.rand(64, 15, 620)
-    encode_mask = np.ones((64, 15), dtype=np.int64)
-    feed_dict = {model.encode_emb: encode_emb, model.encode_mask: encode_mask}
-    expected_shapes = {
-        # [batch_size, encoder_dim]
-        model.thought_vectors: (64, 2400),
-    }
-    self._checkOutputs(expected_shapes, feed_dict)
-
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/skip_thoughts/skip_thoughts/track_perplexity.py b/research/skip_thoughts/skip_thoughts/track_perplexity.py
deleted file mode 100644
index 637eaf2c07b5ea705441f146f020887392d9faf3..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/track_perplexity.py
+++ /dev/null
@@ -1,201 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tracks training progress via per-word perplexity.
-
-This script should be run concurrently with training so that summaries show up
-in TensorBoard.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from six.moves import range
-
-import math
-import os.path
-import time
-
-
-import numpy as np
-import tensorflow as tf
-
-from skip_thoughts import configuration
-from skip_thoughts import skip_thoughts_model
-
-FLAGS = tf.flags.FLAGS
-
-tf.flags.DEFINE_string("input_file_pattern", None,
-                       "File pattern of sharded TFRecord input files.")
-tf.flags.DEFINE_string("checkpoint_dir", None,
-                       "Directory containing model checkpoints.")
-tf.flags.DEFINE_string("eval_dir", None, "Directory to write event logs to.")
-
-tf.flags.DEFINE_integer("eval_interval_secs", 600,
-                        "Interval between evaluation runs.")
-tf.flags.DEFINE_integer("num_eval_examples", 50000,
-                        "Number of examples for evaluation.")
-
-tf.flags.DEFINE_integer("min_global_step", 100,
-                        "Minimum global step to run evaluation.")
-
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-def evaluate_model(sess, losses, weights, num_batches, global_step,
-                   summary_writer, summary_op):
-  """Computes perplexity-per-word over the evaluation dataset.
-
-  Summaries and perplexity-per-word are written out to the eval directory.
-
-  Args:
-    sess: Session object.
-    losses: A Tensor of any shape; the target cross entropy losses for the
-      current batch.
-    weights: A Tensor of weights corresponding to losses.
-    num_batches: Integer; the number of evaluation batches.
-    global_step: Integer; global step of the model checkpoint.
-    summary_writer: Instance of SummaryWriter.
-    summary_op: Op for generating model summaries.
-  """
-  # Log model summaries on a single batch.
-  summary_str = sess.run(summary_op)
-  summary_writer.add_summary(summary_str, global_step)
-
-  start_time = time.time()
-  sum_losses = 0.0
-  sum_weights = 0.0
-  for i in range(num_batches):
-    batch_losses, batch_weights = sess.run([losses, weights])
-    sum_losses += np.sum(batch_losses * batch_weights)
-    sum_weights += np.sum(batch_weights)
-    if not i % 100:
-      tf.logging.info("Computed losses for %d of %d batches.", i + 1,
-                      num_batches)
-  eval_time = time.time() - start_time
-
-  perplexity = math.exp(sum_losses / sum_weights)
-  tf.logging.info("Perplexity = %f (%.2f sec)", perplexity, eval_time)
-
-  # Log perplexity to the SummaryWriter.
-  summary = tf.Summary()
-  value = summary.value.add()
-  value.simple_value = perplexity
-  value.tag = "perplexity"
-  summary_writer.add_summary(summary, global_step)
-
-  # Write the Events file to the eval directory.
-  summary_writer.flush()
-  tf.logging.info("Finished processing evaluation at global step %d.",
-                  global_step)
-
-
-def run_once(model, losses, weights, saver, summary_writer, summary_op):
-  """Evaluates the latest model checkpoint.
-
-  Args:
-    model: Instance of SkipThoughtsModel; the model to evaluate.
-    losses: Tensor; the target cross entropy losses for the current batch.
-    weights: A Tensor of weights corresponding to losses.
-    saver: Instance of tf.train.Saver for restoring model Variables.
-    summary_writer: Instance of FileWriter.
-    summary_op: Op for generating model summaries.
-  """
-  model_path = tf.train.latest_checkpoint(FLAGS.checkpoint_dir)
-  if not model_path:
-    tf.logging.info("Skipping evaluation. No checkpoint found in: %s",
-                    FLAGS.checkpoint_dir)
-    return
-
-  with tf.Session() as sess:
-    # Load model from checkpoint.
-    tf.logging.info("Loading model from checkpoint: %s", model_path)
-    saver.restore(sess, model_path)
-    global_step = tf.train.global_step(sess, model.global_step.name)
-    tf.logging.info("Successfully loaded %s at global step = %d.",
-                    os.path.basename(model_path), global_step)
-    if global_step < FLAGS.min_global_step:
-      tf.logging.info("Skipping evaluation. Global step = %d < %d", global_step,
-                      FLAGS.min_global_step)
-      return
-
-    # Start the queue runners.
-    coord = tf.train.Coordinator()
-    threads = tf.train.start_queue_runners(coord=coord)
-
-    num_eval_batches = int(
-        math.ceil(FLAGS.num_eval_examples / model.config.batch_size))
-
-    # Run evaluation on the latest checkpoint.
-    try:
-      evaluate_model(sess, losses, weights, num_eval_batches, global_step,
-                     summary_writer, summary_op)
-    except tf.InvalidArgumentError:
-      tf.logging.error(
-          "Evaluation raised InvalidArgumentError (e.g. due to Nans).")
-    finally:
-      coord.request_stop()
-      coord.join(threads, stop_grace_period_secs=10)
-
-
-def main(unused_argv):
-  if not FLAGS.input_file_pattern:
-    raise ValueError("--input_file_pattern is required.")
-  if not FLAGS.checkpoint_dir:
-    raise ValueError("--checkpoint_dir is required.")
-  if not FLAGS.eval_dir:
-    raise ValueError("--eval_dir is required.")
-
-  # Create the evaluation directory if it doesn't exist.
-  eval_dir = FLAGS.eval_dir
-  if not tf.gfile.IsDirectory(eval_dir):
-    tf.logging.info("Creating eval directory: %s", eval_dir)
-    tf.gfile.MakeDirs(eval_dir)
-
-  g = tf.Graph()
-  with g.as_default():
-    # Build the model for evaluation.
-    model_config = configuration.model_config(
-        input_file_pattern=FLAGS.input_file_pattern,
-        input_queue_capacity=FLAGS.num_eval_examples,
-        shuffle_input_data=False)
-    model = skip_thoughts_model.SkipThoughtsModel(model_config, mode="eval")
-    model.build()
-
-    losses = tf.concat(model.target_cross_entropy_losses, 0)
-    weights = tf.concat(model.target_cross_entropy_loss_weights, 0)
-
-    # Create the Saver to restore model Variables.
-    saver = tf.train.Saver()
-
-    # Create the summary operation and the summary writer.
-    summary_op = tf.summary.merge_all()
-    summary_writer = tf.summary.FileWriter(eval_dir)
-
-    g.finalize()
-
-    # Run a new evaluation run every eval_interval_secs.
-    while True:
-      start = time.time()
-      tf.logging.info("Starting evaluation at " + time.strftime(
-          "%Y-%m-%d-%H:%M:%S", time.localtime()))
-      run_once(model, losses, weights, saver, summary_writer, summary_op)
-      time_to_next_eval = start + FLAGS.eval_interval_secs - time.time()
-      if time_to_next_eval > 0:
-        time.sleep(time_to_next_eval)
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/skip_thoughts/skip_thoughts/train.py b/research/skip_thoughts/skip_thoughts/train.py
deleted file mode 100644
index 445f31c5a8fe9d1c6084ccacb2109449839f1bd5..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/train.py
+++ /dev/null
@@ -1,99 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Train the skip-thoughts model."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-
-import tensorflow as tf
-
-from skip_thoughts import configuration
-from skip_thoughts import skip_thoughts_model
-
-FLAGS = tf.flags.FLAGS
-
-tf.flags.DEFINE_string("input_file_pattern", None,
-                       "File pattern of sharded TFRecord files containing "
-                       "tf.Example protos.")
-tf.flags.DEFINE_string("train_dir", None,
-                       "Directory for saving and loading checkpoints.")
-
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-def _setup_learning_rate(config, global_step):
-  """Sets up the learning rate with optional exponential decay.
-
-  Args:
-    config: Object containing learning rate configuration parameters.
-    global_step: Tensor; the global step.
-
-  Returns:
-    learning_rate: Tensor; the learning rate with exponential decay.
-  """
-  if config.learning_rate_decay_factor > 0:
-    learning_rate = tf.train.exponential_decay(
-        learning_rate=float(config.learning_rate),
-        global_step=global_step,
-        decay_steps=config.learning_rate_decay_steps,
-        decay_rate=config.learning_rate_decay_factor,
-        staircase=False)
-  else:
-    learning_rate = tf.constant(config.learning_rate)
-  return learning_rate
-
-
-def main(unused_argv):
-  if not FLAGS.input_file_pattern:
-    raise ValueError("--input_file_pattern is required.")
-  if not FLAGS.train_dir:
-    raise ValueError("--train_dir is required.")
-
-  model_config = configuration.model_config(
-      input_file_pattern=FLAGS.input_file_pattern)
-  training_config = configuration.training_config()
-
-  tf.logging.info("Building training graph.")
-  g = tf.Graph()
-  with g.as_default():
-    model = skip_thoughts_model.SkipThoughtsModel(model_config, mode="train")
-    model.build()
-
-    learning_rate = _setup_learning_rate(training_config, model.global_step)
-    optimizer = tf.train.AdamOptimizer(learning_rate)
-
-    train_tensor = tf.contrib.slim.learning.create_train_op(
-        total_loss=model.total_loss,
-        optimizer=optimizer,
-        global_step=model.global_step,
-        clip_gradient_norm=training_config.clip_gradient_norm)
-
-    saver = tf.train.Saver()
-
-  tf.contrib.slim.learning.train(
-      train_op=train_tensor,
-      logdir=FLAGS.train_dir,
-      graph=g,
-      global_step=model.global_step,
-      number_of_steps=training_config.number_of_steps,
-      save_summaries_secs=training_config.save_summaries_secs,
-      saver=saver,
-      save_interval_secs=training_config.save_model_secs)
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/skip_thoughts/skip_thoughts/vocabulary_expansion.py b/research/skip_thoughts/skip_thoughts/vocabulary_expansion.py
deleted file mode 100644
index 0d6c8e2bc227c4dd86e73fe9b00daafeffd4c2a5..0000000000000000000000000000000000000000
--- a/research/skip_thoughts/skip_thoughts/vocabulary_expansion.py
+++ /dev/null
@@ -1,203 +0,0 @@
-# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Compute an expanded vocabulary of embeddings using a word2vec model.
-
-This script loads the word embeddings from a trained skip-thoughts model and
-from a trained word2vec model (typically with a larger vocabulary). It trains a
-linear regression model without regularization to learn a linear mapping from
-the word2vec embedding space to the skip-thoughts embedding space. The model is
-then applied to all words in the word2vec vocabulary, yielding vectors in the
-skip-thoughts word embedding space for the union of the two vocabularies.
-
-The linear regression task is to learn a parameter matrix W to minimize
-  || X - Y * W ||^2,
-where X is a matrix of skip-thoughts embeddings of shape [num_words, dim1],
-Y is a matrix of word2vec embeddings of shape [num_words, dim2], and W is a
-matrix of shape [dim2, dim1].
-
-This is based on the "Translation Matrix" method from the paper:
-
-  "Exploiting Similarities among Languages for Machine Translation"
-  Tomas Mikolov, Quoc V. Le, Ilya Sutskever
-  https://arxiv.org/abs/1309.4168
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import collections
-import os.path
-
-
-import gensim.models
-import numpy as np
-import sklearn.linear_model
-import tensorflow as tf
-
-FLAGS = tf.flags.FLAGS
-
-tf.flags.DEFINE_string("skip_thoughts_model", None,
-                       "Checkpoint file or directory containing a checkpoint "
-                       "file.")
-
-tf.flags.DEFINE_string("skip_thoughts_vocab", None,
-                       "Path to vocabulary file containing a list of newline-"
-                       "separated words where the word id is the "
-                       "corresponding 0-based index in the file.")
-
-tf.flags.DEFINE_string("word2vec_model", None,
-                       "File containing a word2vec model in binary format.")
-
-tf.flags.DEFINE_string("output_dir", None, "Output directory.")
-
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-def _load_skip_thoughts_embeddings(checkpoint_path):
-  """Loads the embedding matrix from a skip-thoughts model checkpoint.
-
-  Args:
-    checkpoint_path: Model checkpoint file or directory containing a checkpoint
-        file.
-
-  Returns:
-    word_embedding: A numpy array of shape [vocab_size, embedding_dim].
-
-  Raises:
-    ValueError: If no checkpoint file matches checkpoint_path.
-  """
-  if tf.gfile.IsDirectory(checkpoint_path):
-    checkpoint_file = tf.train.latest_checkpoint(checkpoint_path)
-    if not checkpoint_file:
-      raise ValueError("No checkpoint file found in %s" % checkpoint_path)
-  else:
-    checkpoint_file = checkpoint_path
-
-  tf.logging.info("Loading skip-thoughts embedding matrix from %s",
-                  checkpoint_file)
-  reader = tf.train.NewCheckpointReader(checkpoint_file)
-  word_embedding = reader.get_tensor("word_embedding")
-  tf.logging.info("Loaded skip-thoughts embedding matrix of shape %s",
-                  word_embedding.shape)
-
-  return word_embedding
-
-
-def _load_vocabulary(filename):
-  """Loads a vocabulary file.
-
-  Args:
-    filename: Path to text file containing newline-separated words.
-
-  Returns:
-    vocab: A dictionary mapping word to word id.
-  """
-  tf.logging.info("Reading vocabulary from %s", filename)
-  vocab = collections.OrderedDict()
-  with tf.gfile.GFile(filename, mode="rb") as f:
-    for i, line in enumerate(f):
-      word = line.decode("utf-8").strip()
-      assert word not in vocab, "Attempting to add word twice: %s" % word
-      vocab[word] = i
-  tf.logging.info("Read vocabulary of size %d", len(vocab))
-  return vocab
-
-
-def _expand_vocabulary(skip_thoughts_emb, skip_thoughts_vocab, word2vec):
-  """Runs vocabulary expansion on a skip-thoughts model using a word2vec model.
-
-  Args:
-    skip_thoughts_emb: A numpy array of shape [skip_thoughts_vocab_size,
-        skip_thoughts_embedding_dim].
-    skip_thoughts_vocab: A dictionary of word to id.
-    word2vec: An instance of gensim.models.Word2Vec.
-
-  Returns:
-    combined_emb: A dictionary mapping words to embedding vectors.
-  """
-  # Find words shared between the two vocabularies.
-  tf.logging.info("Finding shared words")
-  shared_words = [w for w in word2vec.vocab if w in skip_thoughts_vocab]
-
-  # Select embedding vectors for shared words.
-  tf.logging.info("Selecting embeddings for %d shared words", len(shared_words))
-  shared_st_emb = skip_thoughts_emb[[
-      skip_thoughts_vocab[w] for w in shared_words
-  ]]
-  shared_w2v_emb = word2vec[shared_words]
-
-  # Train a linear regression model on the shared embedding vectors.
-  tf.logging.info("Training linear regression model")
-  model = sklearn.linear_model.LinearRegression()
-  model.fit(shared_w2v_emb, shared_st_emb)
-
-  # Create the expanded vocabulary.
-  tf.logging.info("Creating embeddings for expanded vocabuary")
-  combined_emb = collections.OrderedDict()
-  for w in word2vec.vocab:
-    # Ignore words with underscores (spaces).
-    if "_" not in w:
-      w_emb = model.predict(word2vec[w].reshape(1, -1))
-      combined_emb[w] = w_emb.reshape(-1)
-
-  for w in skip_thoughts_vocab:
-    combined_emb[w] = skip_thoughts_emb[skip_thoughts_vocab[w]]
-
-  tf.logging.info("Created expanded vocabulary of %d words", len(combined_emb))
-
-  return combined_emb
-
-
-def main(unused_argv):
-  if not FLAGS.skip_thoughts_model:
-    raise ValueError("--skip_thoughts_model is required.")
-  if not FLAGS.skip_thoughts_vocab:
-    raise ValueError("--skip_thoughts_vocab is required.")
-  if not FLAGS.word2vec_model:
-    raise ValueError("--word2vec_model is required.")
-  if not FLAGS.output_dir:
-    raise ValueError("--output_dir is required.")
-
-  if not tf.gfile.IsDirectory(FLAGS.output_dir):
-    tf.gfile.MakeDirs(FLAGS.output_dir)
-
-  # Load the skip-thoughts embeddings and vocabulary.
-  skip_thoughts_emb = _load_skip_thoughts_embeddings(FLAGS.skip_thoughts_model)
-  skip_thoughts_vocab = _load_vocabulary(FLAGS.skip_thoughts_vocab)
-
-  # Load the Word2Vec model.
-  word2vec = gensim.models.KeyedVectors.load_word2vec_format(
-      FLAGS.word2vec_model, binary=True)
-
-  # Run vocabulary expansion.
-  embedding_map = _expand_vocabulary(skip_thoughts_emb, skip_thoughts_vocab,
-                                     word2vec)
-
-  # Save the output.
-  vocab = embedding_map.keys()
-  vocab_file = os.path.join(FLAGS.output_dir, "vocab.txt")
-  with tf.gfile.GFile(vocab_file, "w") as f:
-    f.write("\n".join(vocab))
-  tf.logging.info("Wrote vocabulary file to %s", vocab_file)
-
-  embeddings = np.array(embedding_map.values())
-  embeddings_file = os.path.join(FLAGS.output_dir, "embeddings.npy")
-  np.save(embeddings_file, embeddings)
-  tf.logging.info("Wrote embeddings file to %s", embeddings_file)
-
-
-if __name__ == "__main__":
-  tf.app.run()
diff --git a/research/steve/README.md b/research/steve/README.md
deleted file mode 100644
index 363be719efe76ac9b70913252ec949cf83bfd906..0000000000000000000000000000000000000000
--- a/research/steve/README.md
+++ /dev/null
@@ -1,94 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Stochastic Ensemble Value Expansion
-
-*A hybrid model-based/model-free reinforcement learning algorithm for sample-efficient continuous control.*
-
-This is the code repository accompanying the paper Sample-Efficient Reinforcement Learning with
-Stochastic Ensemble Value Expansion, by Buckman et al. (2018).
-
-#### Abstract:
-Merging model-free and model-based approaches in reinforcement learning has the potential to achieve
-the high performance of model-free algorithms with low sample complexity. This is difficult because
-an imperfect dynamics model can degrade the performance of the learning algorithm, and in sufficiently
-complex environments, the dynamics model will always be imperfect. As a result, a key challenge is to
-combine model-based approaches with model-free learning in such a way that errors in the model do not
-degrade performance. We propose *stochastic ensemble value expansion* (STEVE), a novel model-based
-technique that addresses this issue. By dynamically interpolating between model rollouts of various horizon
-lengths for each individual example, STEVE ensures that the model is only utilized when doing so does not
-introduce significant errors. Our approach outperforms model-free baselines on challenging continuous
-control benchmarks with an order-of-magnitude increase in sample efficiency, and in contrast to previous
-model-based approaches, performance does not degrade as the environment gets more complex.
-
-## Installation
-This code is compatible with Ubuntu 16.04 and Python 2.7. There are several prerequisites:
-*    Numpy, Scipy, and Portalocker: `pip install numpy scipy portalocker`
-*    TensorFlow 1.6 or above. Instructions can be found on the official TensorFlow page:
-     [https://www.tensorflow.org/install/install_linux](https://www.tensorflow.org/install/install_linux).
-     We suggest installing the GPU version of TensorFlow to speed up training.
-*    OpenAI Gym version 0.9.4. Instructions can be found in the OpenAI Gym repository:
-     [https://github.com/openai/gym#installation](https://github.com/openai/gym#installation).
-     Note that you need to replace "pip install gym[all]" with "pip install gym[all]==0.9.4", which
-     will ensure that you get the correct version of Gym. (The current version of Gym has deprecated
-     the -v1 MuJoCo environments, which are the environments studied in this paper.)
-*    MuJoCo version 1.31, which can be downloaded here: [https://www.roboti.us/download/mjpro131_linux.zip](https://www.roboti.us/download/mjpro131_linux.zip).
-     Simply run: ```
-     cd ~; mkdir -p .mujoco; cd .mujoco/; wget https://www.roboti.us/download/mjpro131_linux.zip; unzip mjpro131_linux.zip```
-     You also need to get a license, and put the license key in ~/.mujoco/ as well.
-*    Optionally, Roboschool version 1.1. This is needed only to replicate the Roboschool experiments.
-     Instructions can be found in the OpenAI Roboschool repository:
-     [https://github.com/openai/roboschool#installation](https://github.com/openai/roboschool#installation).
-*    Optionally, MoviePy to render trained agents. Instructions on the MoviePy homepage:
-     [https://zulko.github.io/moviepy/install.html](https://zulko.github.io/moviepy/install.html).
-
-## Running Experiments
-To run an experiment, run master.py and pass in a config file and GPU ID. For example: ```
-python master.py config/experiments/speedruns/humanoid/speedy_steve0.json 0```
-The `config/experiments/`
-directory contains configuration files for all of the experiments run in the paper.
-
-The GPU ID specifies the GPU that should be used to learn the policy. For model-based approaches, the
-next GPU (i.e. GPU_ID+1) is used to learn the worldmodel in parallel.
-
-To resume an experiment that was interrupted, use the same config file and pass the `--resume` flag: ```
-python master.py config/experiments/speedruns/humanoid/speedy_steve0.json 0 --resume```
-
-## Output
-For each experiment, two folders are created in the output directory: `<ENVIRONMENT>/<EXPERIMENT>/log`
-and `<ENVIRONMENT>/<EXPERIMENT>/checkpoints`. The log directory contains the following:
-
-*  `hps.json` contains the accumulated hyperparameters of the config file used to generate these results
-*  `valuerl.log` and `worldmodel.log` contain the log output of the learners. `worldmodel.log` will not
-   exist if you are not learning a worldmodel.
-*  `<EXPERIMENT>.greedy.csv` records all of the scores of our evaluators. The four columns contain time (hours),
-   epochs, frames, and score.
-
-The checkpoints directory contains the most recent versions of the policy and worldmodel, as well as checkpoints
-of the policy, worldmodel, and their respective replay buffers at various points throughout training.
-
-## Code Organization
-`master.py` launches four types of processes: a ValueRlLearner to learn the policy, a WorldmodelLearner
-to learn the dynamics model, several Interactors to gather data from the environment to train on, and
-a few Evaluators to run the greedy policy in the environment and record the score.
-
-`learner.py` contains a general framework for models which learn from a replay buffer. This is where
-most of the code for the overall training loop is located. `valuerl_learner.py` and `worldmodel_learner.py`
-contain a small amount of model-specific training loop code.
-
-`valuerl.py` implements the core model for all value-function-based policy learning techniques studied
-in the paper, including DDPG, MVE, STEVE, etc. Similarly, `worldmodel.py` contains the core model for
-our dynamics model and reward function.
-
-`replay.py` contains the code for the replay buffer. `nn.py`, `envwrap.py`, `config.py`, and `util.py`
-each contain various helper functions.
-
-`toy_demo.py` is a self-contained demo, written in numpy, that was used to generate the results for the
-toy examples in the first segment of the paper.
-
-`visualizer.py` is a utility script for loading trained policies and inspecting them. In addition to a
-config file and a GPU, it takes the filename of the model to load as a mandatory third argument.
-
-## Contact
-Please contact GitHub user buckman-google (jacobbuckman@gmail.com) with any questions.
diff --git a/research/steve/agent.py b/research/steve/agent.py
deleted file mode 100644
index 25069e29be1659fdd0f3a19e1a37bc41c9668b8b..0000000000000000000000000000000000000000
--- a/research/steve/agent.py
+++ /dev/null
@@ -1,143 +0,0 @@
-from __future__ import print_function
-from builtins import zip
-from builtins import range
-from builtins import object
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import numpy as np
-import tensorflow as tf
-import time, os, traceback, multiprocessing, portalocker
-
-import envwrap
-import valuerl
-import util
-from config import config
-
-
-def run_env(pipe):
-  env = envwrap.get_env(config["env"]["name"])
-  reset = True
-  while True:
-    if reset is True: pipe.send(env.reset())
-    action = pipe.recv()
-    obs, reward, done, reset = env.step(action)
-    pipe.send((obs, reward, done, reset))
-
-class AgentManager(object):
-  """
-  Interact with the environment according to the learned policy,
-  """
-  def __init__(self, proc_num, evaluation, policy_lock, batch_size, config):
-    self.evaluation = evaluation
-    self.policy_lock = policy_lock
-    self.batch_size = batch_size
-    self.config = config
-
-    self.log_path =  util.create_directory("%s/%s/%s/%s" % (config["output_root"], config["env"]["name"], config["name"], config["log_path"])) + "/%s" % config["name"]
-    self.load_path = util.create_directory("%s/%s/%s/%s" % (config["output_root"], config["env"]["name"], config["name"], config["save_model_path"]))
-
-    ## placeholders for intermediate states (basis for rollout)
-    self.obs_loader = tf.placeholder(tf.float32, [self.batch_size, np.prod(self.config["env"]["obs_dims"])])
-
-    ## build model
-    self.valuerl =  valuerl.ValueRL(self.config["name"], self.config["env"], self.config["policy_config"])
-    self.policy_actions = self.valuerl.build_evalution_graph(self.obs_loader, mode="exploit" if self.evaluation else "explore")
-
-    # interactors
-    self.agent_pipes, self.agent_child_pipes = list(zip(*[multiprocessing.Pipe() for _ in range(self.batch_size)]))
-    self.agents = [multiprocessing.Process(target=run_env, args=(self.agent_child_pipes[i],)) for i in range(self.batch_size)]
-    for agent in self.agents: agent.start()
-    self.obs = [pipe.recv() for pipe in self.agent_pipes]
-    self.total_rewards = [0. for _ in self.agent_pipes]
-    self.loaded_policy = False
-
-    self.sess = tf.Session()
-    self.sess.run(tf.global_variables_initializer())
-
-    self.rollout_i = 0
-    self.proc_num = proc_num
-    self.epoch = -1
-    self.frame_total = 0
-    self.hours = 0.
-
-    self.first = True
-
-  def get_action(self, obs):
-    if self.loaded_policy:
-      all_actions = self.sess.run(self.policy_actions, feed_dict={self.obs_loader: obs})
-      all_actions = np.clip(all_actions, -1., 1.)
-      return all_actions[:self.batch_size]
-    else:
-      return [self.get_random_action() for _ in range(obs.shape[0])]
-
-  def get_random_action(self, *args, **kwargs):
-    return np.random.random(self.config["env"]["action_dim"]) * 2 - 1
-
-  def step(self):
-    actions = self.get_action(np.stack(self.obs))
-    self.first = False
-    [pipe.send(action) for pipe, action in zip(self.agent_pipes, actions)]
-    next_obs, rewards, dones, resets = list(zip(*[pipe.recv() for pipe in self.agent_pipes]))
-
-    frames = list(zip(self.obs, next_obs, actions, rewards, dones))
-
-    self.obs = [o if resets[i] is False else self.agent_pipes[i].recv() for i, o in enumerate(next_obs)]
-
-    for i, (t,r,reset) in enumerate(zip(self.total_rewards, rewards, resets)):
-      if reset:
-        self.total_rewards[i] = 0.
-        if self.evaluation and self.loaded_policy:
-          with portalocker.Lock(self.log_path+'.greedy.csv', mode="a") as f: f.write("%2f,%d,%d,%2f\n" % (self.hours, self.epoch, self.frame_total, t+r))
-
-      else:
-        self.total_rewards[i] = t + r
-
-    if self.evaluation and np.any(resets): self.reload()
-
-    self.rollout_i += 1
-    return frames
-
-  def reload(self):
-    if not os.path.exists("%s/%s.params.index" % (self.load_path ,self.valuerl.saveid)): return False
-    with self.policy_lock:
-      self.valuerl.load(self.sess, self.load_path)
-      self.epoch, self.frame_total, self.hours = self.sess.run([self.valuerl.epoch_n, self.valuerl.frame_n, self.valuerl.hours])
-    self.loaded_policy = True
-    self.first = True
-    return True
-
-def main(proc_num, evaluation, policy_replay_frame_queue, model_replay_frame_queue, policy_lock, config):
-  try:
-    np.random.seed((proc_num * int(time.time())) % (2 ** 32 - 1))
-    agentmanager = AgentManager(proc_num, evaluation, policy_lock, config["evaluator_config"]["batch_size"] if evaluation else config["agent_config"]["batch_size"], config)
-    frame_i = 0
-    while True:
-      new_frames = agentmanager.step()
-      if not evaluation:
-        policy_replay_frame_queue.put(new_frames)
-        if model_replay_frame_queue is not None: model_replay_frame_queue.put(new_frames)
-        if frame_i % config["agent_config"]["reload_every_n"] == 0: agentmanager.reload()
-        frame_i += len(new_frames)
-
-  except Exception as e:
-    print('Caught exception in agent process %d' % proc_num)
-    traceback.print_exc()
-    print()
-    try:
-      for i in agentmanager.agents: i.join()
-    except:
-      pass
-    raise e
diff --git a/research/steve/config.py b/research/steve/config.py
deleted file mode 100644
index 4a6da98c375e9e78e2f4ffd63c70c66b09ad1631..0000000000000000000000000000000000000000
--- a/research/steve/config.py
+++ /dev/null
@@ -1,38 +0,0 @@
-from __future__ import print_function
-from builtins import str
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import argparse, json, util, traceback
-
-parser = argparse.ArgumentParser()
-parser.add_argument("config")
-parser.add_argument("root_gpu", type=int)
-parser.add_argument("--resume", action="store_true")
-args = parser.parse_args()
-
-config_loc = args.config
-config = util.ConfigDict(config_loc)
-
-config["name"] = config_loc.split("/")[-1][:-5]
-config["resume"] = args.resume
-
-cstr = str(config)
-
-def log_config():
-  HPS_PATH = util.create_directory("output/" + config["env"]["name"] + "/" + config["name"] + "/" + config["log_path"]) + "/hps.json"
-  print("ROOT GPU: " + str(args.root_gpu) + "\n" + str(cstr))
-  with open(HPS_PATH, "w") as f:
-    f.write("ROOT GPU: " + str(args.root_gpu) + "\n" + str(cstr))
\ No newline at end of file
diff --git a/research/steve/config/algos/ddpg.json b/research/steve/config/algos/ddpg.json
deleted file mode 100644
index e76c106986dc46b1fb37697eafff0965390e2061..0000000000000000000000000000000000000000
--- a/research/steve/config/algos/ddpg.json
+++ /dev/null
@@ -1,3 +0,0 @@
-{
-  "inherits": ["config/core/basic.json"]
-}
\ No newline at end of file
diff --git a/research/steve/config/algos/mve_mean.json b/research/steve/config/algos/mve_mean.json
deleted file mode 100644
index 729bccc62efe4549041af8bdf13df17c1cebe78e..0000000000000000000000000000000000000000
--- a/research/steve/config/algos/mve_mean.json
+++ /dev/null
@@ -1,14 +0,0 @@
-{
-  "inherits": [
-    "config/core/basic.json",
-    "config/core/model.json"
-  ],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 3,
-        "mean_k_return": true
-      }
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/algos/mve_tdk.json b/research/steve/config/algos/mve_tdk.json
deleted file mode 100644
index 222fd40c3c928c4a26d0082797e9337e6edf45d2..0000000000000000000000000000000000000000
--- a/research/steve/config/algos/mve_tdk.json
+++ /dev/null
@@ -1,14 +0,0 @@
-{
-  "inherits": [
-    "config/core/basic.json",
-    "config/core/model.json"
-  ],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 3,
-        "tdk_trick": true
-      }
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/algos/mve_tdlambda.json b/research/steve/config/algos/mve_tdlambda.json
deleted file mode 100644
index 3414dda5d00aedce9673643e4b3d9911facde24d..0000000000000000000000000000000000000000
--- a/research/steve/config/algos/mve_tdlambda.json
+++ /dev/null
@@ -1,14 +0,0 @@
-{
-  "inherits": [
-    "config/core/basic.json",
-    "config/core/model.json"
-  ],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 3,
-        "lambda_return": 0.25
-      }
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/algos/steve.json b/research/steve/config/algos/steve.json
deleted file mode 100644
index ca2bc0395f98eda6ce2460fc79014492bb771589..0000000000000000000000000000000000000000
--- a/research/steve/config/algos/steve.json
+++ /dev/null
@@ -1,15 +0,0 @@
-{
-  "inherits": [
-    "config/core/basic.json",
-    "config/core/model.json",
-    "config/core/bayesian.json"
-              ],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 3,
-        "steve_reweight": true
-      }
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/algos/steve_cov.json b/research/steve/config/algos/steve_cov.json
deleted file mode 100644
index 4dbf46e19c13f479af4a2ea132b4019a991144a7..0000000000000000000000000000000000000000
--- a/research/steve/config/algos/steve_cov.json
+++ /dev/null
@@ -1,16 +0,0 @@
-{
-  "inherits": [
-    "config/core/basic.json",
-    "config/core/model.json",
-    "config/core/bayesian.json"
-              ],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 3,
-        "steve_reweight": true,
-        "covariances": true
-      }
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/core/basic.json b/research/steve/config/core/basic.json
deleted file mode 100644
index 411e7b65e5303bf28c1a99b9dd0442575b6be74c..0000000000000000000000000000000000000000
--- a/research/steve/config/core/basic.json
+++ /dev/null
@@ -1,32 +0,0 @@
-{
-  "updates": {
-    "output_root": "output",
-    "save_model_path": "checkpoints",
-    "log_path": "log",
-
-    "agent_config": {
-      "count": 1,
-      "batch_size": 8,
-      "reload_every_n": 1,
-      "full_random_n": 10000
-    },
-
-    "evaluator_config": {
-      "count": 2,
-      "batch_size": 1
-    },
-
-    "policy_config": {
-      "algo": "ddpg",
-      "hidden_dim": 128,
-      "explore_chance": 0.05,
-      "batch_size": 512,
-      "replay_size": 1000000,
-      "frames_before_learning": 10000,
-      "log_every_n": 500,
-      "epoch_every_n": 500,
-      "backup_every_n": 2500000,
-      "frames_per_update": 0.25
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/core/bayesian.json b/research/steve/config/core/bayesian.json
deleted file mode 100644
index ea7d955434c51c91376325ad7791836f59931086..0000000000000000000000000000000000000000
--- a/research/steve/config/core/bayesian.json
+++ /dev/null
@@ -1,26 +0,0 @@
-{
-  "updates": {
-    "policy_config": {
-      "bayesian": {
-        "ensemble_size": 4,
-        "train_sample_count": 4,
-        "eval_sample_count": 4
-      }
-    },
-
-    "*model_config": {
-      "bayesian": {
-        "transition": {
-          "ensemble_size": 4,
-          "train_sample_count": 4,
-          "eval_sample_count": 4
-        },
-        "reward": {
-          "ensemble_size": 4,
-          "train_sample_count": 4,
-          "eval_sample_count": 4
-        }
-      }
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/core/model.json b/research/steve/config/core/model.json
deleted file mode 100644
index 485146ab0242fde093796b8c920a33aed0baaf48..0000000000000000000000000000000000000000
--- a/research/steve/config/core/model.json
+++ /dev/null
@@ -1,16 +0,0 @@
-{
-  "updates": {
-    "model_config": {
-      "transition_hidden_dim": 512,
-      "aux_hidden_dim": 128,
-      "batch_size": 512,
-      "replay_size": 1000000,
-      "frames_before_learning": 10000,
-      "log_every_n": 500,
-      "epoch_every_n": 500,
-      "backup_every_n": 2500000,
-      "pretrain_n": 10000,
-      "frames_per_update": 0.25
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/envs/flagrun.json b/research/steve/config/envs/flagrun.json
deleted file mode 100644
index 09ecc7cdeb893dbe5e1ee1d264848768689b60e0..0000000000000000000000000000000000000000
--- a/research/steve/config/envs/flagrun.json
+++ /dev/null
@@ -1,12 +0,0 @@
-{
-  "updates": {
-    "env": {
-      "name": "RoboschoolHumanoidFlagrun-v1",
-      "obs_dims": [44],
-      "action_dim": 17,
-      "reward_scale":1.0,
-      "discount":0.99,
-      "max_frames": 1000
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/envs/halfcheetah.json b/research/steve/config/envs/halfcheetah.json
deleted file mode 100644
index e0c9b38971d49b27071d8fd875f40ba391e77fbf..0000000000000000000000000000000000000000
--- a/research/steve/config/envs/halfcheetah.json
+++ /dev/null
@@ -1,12 +0,0 @@
-{
-  "updates": {
-    "env": {
-      "name": "HalfCheetah-v1",
-      "obs_dims": [17],
-      "action_dim": 6,
-      "reward_scale":1.0,
-      "discount":0.99,
-      "max_frames": 1000
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/envs/hardcore.json b/research/steve/config/envs/hardcore.json
deleted file mode 100644
index af372b28b689422e7ff0e2ccd1d3809f4712cb08..0000000000000000000000000000000000000000
--- a/research/steve/config/envs/hardcore.json
+++ /dev/null
@@ -1,12 +0,0 @@
-{
-  "updates": {
-    "env": {
-      "name": "BipedalWalkerHardcore-v2",
-      "obs_dims": [24],
-      "action_dim": 4,
-      "reward_scale":1.0,
-      "discount":0.99,
-      "max_frames": 1000
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/envs/hopper.json b/research/steve/config/envs/hopper.json
deleted file mode 100644
index 012def18593b98bdc7978e532d758d59cc3dbb4d..0000000000000000000000000000000000000000
--- a/research/steve/config/envs/hopper.json
+++ /dev/null
@@ -1,12 +0,0 @@
-{
-  "updates": {
-    "env": {
-      "name": "Hopper-v1",
-      "obs_dims": [11],
-      "action_dim": 3,
-      "reward_scale":1.0,
-      "discount":0.99,
-      "max_frames": 1000
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/envs/humanoid.json b/research/steve/config/envs/humanoid.json
deleted file mode 100644
index 39aeeb29517a255b9c241c9b7fd37e6870b85bba..0000000000000000000000000000000000000000
--- a/research/steve/config/envs/humanoid.json
+++ /dev/null
@@ -1,12 +0,0 @@
-{
-  "updates": {
-    "env": {
-      "name": "Humanoid-v1",
-      "obs_dims": [376],
-      "action_dim": 17,
-      "reward_scale":1.0,
-      "discount":0.99,
-      "max_frames": 1000
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/envs/rshum.json b/research/steve/config/envs/rshum.json
deleted file mode 100644
index 0ad54b2bfc3087643156b2b855eb5dbd5202ef77..0000000000000000000000000000000000000000
--- a/research/steve/config/envs/rshum.json
+++ /dev/null
@@ -1,12 +0,0 @@
-{
-  "updates": {
-    "env": {
-      "name": "RoboschoolHumanoid-v1",
-      "obs_dims": [44],
-      "action_dim": 17,
-      "reward_scale":1.0,
-      "discount":0.99,
-      "max_frames": 1000
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/envs/swimmer.json b/research/steve/config/envs/swimmer.json
deleted file mode 100644
index 0fcf2f32e793dbbb61981b5a9bdd0b4636fcd636..0000000000000000000000000000000000000000
--- a/research/steve/config/envs/swimmer.json
+++ /dev/null
@@ -1,12 +0,0 @@
-{
-  "updates": {
-    "env": {
-      "name": "Swimmer-v1",
-      "obs_dims": [8],
-      "action_dim": 2,
-      "reward_scale":1.0,
-      "discount":0.99,
-      "max_frames": 1000
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/envs/walker2d.json b/research/steve/config/envs/walker2d.json
deleted file mode 100644
index 03ed94f7466069f7c7ec1f1fc2e5c8f1be3b398b..0000000000000000000000000000000000000000
--- a/research/steve/config/envs/walker2d.json
+++ /dev/null
@@ -1,12 +0,0 @@
-{
-  "updates": {
-    "env": {
-      "name": "Walker2d-v1",
-      "obs_dims": [17],
-      "action_dim": 6,
-      "reward_scale":1.0,
-      "discount":0.99,
-      "max_frames": 1000
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/experimental_setups/speedrun.json b/research/steve/config/experimental_setups/speedrun.json
deleted file mode 100644
index b34a9b706f919bffb58edc3345995112905d7ef3..0000000000000000000000000000000000000000
--- a/research/steve/config/experimental_setups/speedrun.json
+++ /dev/null
@@ -1,11 +0,0 @@
-{
-  "updates": {
-    "policy_config": {
-      "frames_per_update": false
-    },
-    "*model_config":{
-      "frames_per_update": false,
-      "pretrain_n": false
-    }
-  }
-}
\ No newline at end of file
diff --git a/research/steve/config/experiments/ablations/baselines/ensemble_mve_tdk0.json b/research/steve/config/experiments/ablations/baselines/ensemble_mve_tdk0.json
deleted file mode 100644
index da54f6310d2d3920b5b7dc2b121c5b81dee5fd99..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/ensemble_mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/core/bayesian", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/ablations/baselines/ensemble_mve_tdk1.json b/research/steve/config/experiments/ablations/baselines/ensemble_mve_tdk1.json
deleted file mode 100644
index da54f6310d2d3920b5b7dc2b121c5b81dee5fd99..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/ensemble_mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/core/bayesian", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/ablations/baselines/ensemble_mve_tdk2.json b/research/steve/config/experiments/ablations/baselines/ensemble_mve_tdk2.json
deleted file mode 100644
index da54f6310d2d3920b5b7dc2b121c5b81dee5fd99..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/ensemble_mve_tdk2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/core/bayesian", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/ablations/baselines/mve_25tdlambda0.json b/research/steve/config/experiments/ablations/baselines/mve_25tdlambda0.json
deleted file mode 100644
index b9e3dcd4bcecddaac7f11c1547e522dae720fe0e..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/mve_25tdlambda0.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/mve_tdlambda.json", "config/envs/humanoid.json"],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "lambda_return": 0.25
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/baselines/mve_25tdlambda1.json b/research/steve/config/experiments/ablations/baselines/mve_25tdlambda1.json
deleted file mode 100644
index b9e3dcd4bcecddaac7f11c1547e522dae720fe0e..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/mve_25tdlambda1.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/mve_tdlambda.json", "config/envs/humanoid.json"],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "lambda_return": 0.25
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/baselines/mve_25tdlambda2.json b/research/steve/config/experiments/ablations/baselines/mve_25tdlambda2.json
deleted file mode 100644
index b9e3dcd4bcecddaac7f11c1547e522dae720fe0e..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/mve_25tdlambda2.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/mve_tdlambda.json", "config/envs/humanoid.json"],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "lambda_return": 0.25
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/baselines/mve_75tdlambda0.json b/research/steve/config/experiments/ablations/baselines/mve_75tdlambda0.json
deleted file mode 100644
index 7366ba77b4df755e819b4701d6bb2923fc1516dc..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/mve_75tdlambda0.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/mve_tdlambda.json", "config/envs/humanoid.json"],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "lambda_return": 0.75
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/baselines/mve_75tdlambda1.json b/research/steve/config/experiments/ablations/baselines/mve_75tdlambda1.json
deleted file mode 100644
index 7366ba77b4df755e819b4701d6bb2923fc1516dc..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/mve_75tdlambda1.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/mve_tdlambda.json", "config/envs/humanoid.json"],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "lambda_return": 0.75
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/baselines/mve_75tdlambda2.json b/research/steve/config/experiments/ablations/baselines/mve_75tdlambda2.json
deleted file mode 100644
index 7366ba77b4df755e819b4701d6bb2923fc1516dc..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/mve_75tdlambda2.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/mve_tdlambda.json", "config/envs/humanoid.json"],
-  "updates":{
-    "policy_config": {
-      "value_expansion": {
-        "lambda_return": 0.75
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/baselines/mve_meank0.json b/research/steve/config/experiments/ablations/baselines/mve_meank0.json
deleted file mode 100644
index ce7d9b1ea4fb0742f8947f1a54115ea5704c5a7e..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/mve_meank0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_mean.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/ablations/baselines/mve_meank1.json b/research/steve/config/experiments/ablations/baselines/mve_meank1.json
deleted file mode 100644
index ce7d9b1ea4fb0742f8947f1a54115ea5704c5a7e..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/mve_meank1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_mean.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/ablations/baselines/mve_meank2.json b/research/steve/config/experiments/ablations/baselines/mve_meank2.json
deleted file mode 100644
index ce7d9b1ea4fb0742f8947f1a54115ea5704c5a7e..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/mve_meank2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_mean.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/ablations/baselines/steve_cov0.json b/research/steve/config/experiments/ablations/baselines/steve_cov0.json
deleted file mode 100644
index df2e8a0d8f8785200fd53fff4580fc12fe657c8f..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/steve_cov0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve_cov.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/ablations/baselines/steve_cov1.json b/research/steve/config/experiments/ablations/baselines/steve_cov1.json
deleted file mode 100644
index df2e8a0d8f8785200fd53fff4580fc12fe657c8f..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/steve_cov1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve_cov.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/ablations/baselines/steve_cov2.json b/research/steve/config/experiments/ablations/baselines/steve_cov2.json
deleted file mode 100644
index df2e8a0d8f8785200fd53fff4580fc12fe657c8f..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/baselines/steve_cov2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve_cov.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/ablations/horizons/steve_1h0.json b/research/steve/config/experiments/ablations/horizons/steve_1h0.json
deleted file mode 100644
index 48b6730b7a0f2ae9777d1f06efdb9789a94f43b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/horizons/steve_1h0.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/steve.json", "config/envs/humanoid.json"],
-  "updates": {
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 1
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/horizons/steve_1h1.json b/research/steve/config/experiments/ablations/horizons/steve_1h1.json
deleted file mode 100644
index 48b6730b7a0f2ae9777d1f06efdb9789a94f43b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/horizons/steve_1h1.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/steve.json", "config/envs/humanoid.json"],
-  "updates": {
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 1
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/horizons/steve_1h2.json b/research/steve/config/experiments/ablations/horizons/steve_1h2.json
deleted file mode 100644
index 48b6730b7a0f2ae9777d1f06efdb9789a94f43b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/horizons/steve_1h2.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/steve.json", "config/envs/humanoid.json"],
-  "updates": {
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 1
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/horizons/steve_2h0.json b/research/steve/config/experiments/ablations/horizons/steve_2h0.json
deleted file mode 100644
index 48b6730b7a0f2ae9777d1f06efdb9789a94f43b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/horizons/steve_2h0.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/steve.json", "config/envs/humanoid.json"],
-  "updates": {
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 1
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/horizons/steve_2h1.json b/research/steve/config/experiments/ablations/horizons/steve_2h1.json
deleted file mode 100644
index 48b6730b7a0f2ae9777d1f06efdb9789a94f43b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/horizons/steve_2h1.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/steve.json", "config/envs/humanoid.json"],
-  "updates": {
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 1
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/horizons/steve_2h2.json b/research/steve/config/experiments/ablations/horizons/steve_2h2.json
deleted file mode 100644
index 48b6730b7a0f2ae9777d1f06efdb9789a94f43b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/horizons/steve_2h2.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/steve.json", "config/envs/humanoid.json"],
-  "updates": {
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 1
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/horizons/steve_5h0.json b/research/steve/config/experiments/ablations/horizons/steve_5h0.json
deleted file mode 100644
index 48b6730b7a0f2ae9777d1f06efdb9789a94f43b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/horizons/steve_5h0.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/steve.json", "config/envs/humanoid.json"],
-  "updates": {
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 1
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/horizons/steve_5h1.json b/research/steve/config/experiments/ablations/horizons/steve_5h1.json
deleted file mode 100644
index 48b6730b7a0f2ae9777d1f06efdb9789a94f43b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/horizons/steve_5h1.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/steve.json", "config/envs/humanoid.json"],
-  "updates": {
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 1
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/ablations/horizons/steve_5h2.json b/research/steve/config/experiments/ablations/horizons/steve_5h2.json
deleted file mode 100644
index 48b6730b7a0f2ae9777d1f06efdb9789a94f43b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/ablations/horizons/steve_5h2.json
+++ /dev/null
@@ -1,10 +0,0 @@
-{
-  "inherits": ["config/algos/steve.json", "config/envs/humanoid.json"],
-  "updates": {
-    "policy_config": {
-      "value_expansion": {
-        "rollout_len": 1
-      }
-    }
-  }
-}
diff --git a/research/steve/config/experiments/goodruns/flagrun/ddpg0.json b/research/steve/config/experiments/goodruns/flagrun/ddpg0.json
deleted file mode 100644
index a68ee412d11e75821f21d49664977ff0ebc539a7..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/ddpg1.json b/research/steve/config/experiments/goodruns/flagrun/ddpg1.json
deleted file mode 100644
index a68ee412d11e75821f21d49664977ff0ebc539a7..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/ddpg2.json b/research/steve/config/experiments/goodruns/flagrun/ddpg2.json
deleted file mode 100644
index a68ee412d11e75821f21d49664977ff0ebc539a7..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/ddpg2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/ddpg3.json b/research/steve/config/experiments/goodruns/flagrun/ddpg3.json
deleted file mode 100644
index a68ee412d11e75821f21d49664977ff0ebc539a7..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/ddpg3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/mve_tdk0.json b/research/steve/config/experiments/goodruns/flagrun/mve_tdk0.json
deleted file mode 100644
index 8da85dd375077521f25f6b4f9d0926cf4d8b66b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/mve_tdk1.json b/research/steve/config/experiments/goodruns/flagrun/mve_tdk1.json
deleted file mode 100644
index 8da85dd375077521f25f6b4f9d0926cf4d8b66b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/mve_tdk2.json b/research/steve/config/experiments/goodruns/flagrun/mve_tdk2.json
deleted file mode 100644
index 8da85dd375077521f25f6b4f9d0926cf4d8b66b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/mve_tdk2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/mve_tdk3.json b/research/steve/config/experiments/goodruns/flagrun/mve_tdk3.json
deleted file mode 100644
index 8da85dd375077521f25f6b4f9d0926cf4d8b66b0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/mve_tdk3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/steve0.json b/research/steve/config/experiments/goodruns/flagrun/steve0.json
deleted file mode 100644
index 21d32930224e1e0632e88646654892a2229ab90b..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/steve1.json b/research/steve/config/experiments/goodruns/flagrun/steve1.json
deleted file mode 100644
index 21d32930224e1e0632e88646654892a2229ab90b..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/steve2.json b/research/steve/config/experiments/goodruns/flagrun/steve2.json
deleted file mode 100644
index 21d32930224e1e0632e88646654892a2229ab90b..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/steve2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/flagrun/steve3.json b/research/steve/config/experiments/goodruns/flagrun/steve3.json
deleted file mode 100644
index 21d32930224e1e0632e88646654892a2229ab90b..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/flagrun/steve3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/flagrun.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/ddpg0.json b/research/steve/config/experiments/goodruns/halfcheetah/ddpg0.json
deleted file mode 100644
index fc9d9eef28dfd4d631e41a9f4aeb625c66af27d0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/ddpg1.json b/research/steve/config/experiments/goodruns/halfcheetah/ddpg1.json
deleted file mode 100644
index fc9d9eef28dfd4d631e41a9f4aeb625c66af27d0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/ddpg2.json b/research/steve/config/experiments/goodruns/halfcheetah/ddpg2.json
deleted file mode 100644
index fc9d9eef28dfd4d631e41a9f4aeb625c66af27d0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/ddpg2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/ddpg3.json b/research/steve/config/experiments/goodruns/halfcheetah/ddpg3.json
deleted file mode 100644
index fc9d9eef28dfd4d631e41a9f4aeb625c66af27d0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/ddpg3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk0.json b/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk0.json
deleted file mode 100644
index dcae7eb48df84194e6cc7dfb6c29a5000f2c5a8a..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk1.json b/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk1.json
deleted file mode 100644
index dcae7eb48df84194e6cc7dfb6c29a5000f2c5a8a..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk2.json b/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk2.json
deleted file mode 100644
index dcae7eb48df84194e6cc7dfb6c29a5000f2c5a8a..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk3.json b/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk3.json
deleted file mode 100644
index dcae7eb48df84194e6cc7dfb6c29a5000f2c5a8a..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/mve_tdk3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/steve0.json b/research/steve/config/experiments/goodruns/halfcheetah/steve0.json
deleted file mode 100644
index f2fd36d3bbf79a94bf241a2ac394a8a3652b5ad3..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/steve1.json b/research/steve/config/experiments/goodruns/halfcheetah/steve1.json
deleted file mode 100644
index f2fd36d3bbf79a94bf241a2ac394a8a3652b5ad3..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/steve2.json b/research/steve/config/experiments/goodruns/halfcheetah/steve2.json
deleted file mode 100644
index f2fd36d3bbf79a94bf241a2ac394a8a3652b5ad3..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/steve2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/halfcheetah/steve3.json b/research/steve/config/experiments/goodruns/halfcheetah/steve3.json
deleted file mode 100644
index f2fd36d3bbf79a94bf241a2ac394a8a3652b5ad3..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/halfcheetah/steve3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/halfcheetah.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/ddpg0.json b/research/steve/config/experiments/goodruns/hardcore/ddpg0.json
deleted file mode 100644
index 3dce87b15ce8079953529c417e0c25c30a734ca2..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/ddpg1.json b/research/steve/config/experiments/goodruns/hardcore/ddpg1.json
deleted file mode 100644
index 3dce87b15ce8079953529c417e0c25c30a734ca2..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/ddpg2.json b/research/steve/config/experiments/goodruns/hardcore/ddpg2.json
deleted file mode 100644
index 3dce87b15ce8079953529c417e0c25c30a734ca2..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/ddpg2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/ddpg3.json b/research/steve/config/experiments/goodruns/hardcore/ddpg3.json
deleted file mode 100644
index 3dce87b15ce8079953529c417e0c25c30a734ca2..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/ddpg3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/mve_tdk0.json b/research/steve/config/experiments/goodruns/hardcore/mve_tdk0.json
deleted file mode 100644
index 095d8763ac1ee32233ef08e5643f105eff67e6cd..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/mve_tdk1.json b/research/steve/config/experiments/goodruns/hardcore/mve_tdk1.json
deleted file mode 100644
index 095d8763ac1ee32233ef08e5643f105eff67e6cd..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/mve_tdk2.json b/research/steve/config/experiments/goodruns/hardcore/mve_tdk2.json
deleted file mode 100644
index 095d8763ac1ee32233ef08e5643f105eff67e6cd..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/mve_tdk2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/mve_tdk3.json b/research/steve/config/experiments/goodruns/hardcore/mve_tdk3.json
deleted file mode 100644
index 095d8763ac1ee32233ef08e5643f105eff67e6cd..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/mve_tdk3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/steve0.json b/research/steve/config/experiments/goodruns/hardcore/steve0.json
deleted file mode 100644
index f094208520fe3f93065f981e557571958370b7fb..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/steve1.json b/research/steve/config/experiments/goodruns/hardcore/steve1.json
deleted file mode 100644
index f094208520fe3f93065f981e557571958370b7fb..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/steve2.json b/research/steve/config/experiments/goodruns/hardcore/steve2.json
deleted file mode 100644
index f094208520fe3f93065f981e557571958370b7fb..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/steve2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hardcore/steve3.json b/research/steve/config/experiments/goodruns/hardcore/steve3.json
deleted file mode 100644
index f094208520fe3f93065f981e557571958370b7fb..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hardcore/steve3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/hardcore.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/ddpg0.json b/research/steve/config/experiments/goodruns/hopper/ddpg0.json
deleted file mode 100644
index 4916ab1161349792f0c396904d109d33d0d59d59..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/ddpg1.json b/research/steve/config/experiments/goodruns/hopper/ddpg1.json
deleted file mode 100644
index 4916ab1161349792f0c396904d109d33d0d59d59..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/ddpg2.json b/research/steve/config/experiments/goodruns/hopper/ddpg2.json
deleted file mode 100644
index 4916ab1161349792f0c396904d109d33d0d59d59..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/ddpg2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/ddpg3.json b/research/steve/config/experiments/goodruns/hopper/ddpg3.json
deleted file mode 100644
index 4916ab1161349792f0c396904d109d33d0d59d59..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/ddpg3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/mve_tdk0.json b/research/steve/config/experiments/goodruns/hopper/mve_tdk0.json
deleted file mode 100644
index 40663e8b9450fa0b831eaa6f6c0089545b02d4a8..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/mve_tdk1.json b/research/steve/config/experiments/goodruns/hopper/mve_tdk1.json
deleted file mode 100644
index 40663e8b9450fa0b831eaa6f6c0089545b02d4a8..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/mve_tdk2.json b/research/steve/config/experiments/goodruns/hopper/mve_tdk2.json
deleted file mode 100644
index 40663e8b9450fa0b831eaa6f6c0089545b02d4a8..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/mve_tdk2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/mve_tdk3.json b/research/steve/config/experiments/goodruns/hopper/mve_tdk3.json
deleted file mode 100644
index 40663e8b9450fa0b831eaa6f6c0089545b02d4a8..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/mve_tdk3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/steve0.json b/research/steve/config/experiments/goodruns/hopper/steve0.json
deleted file mode 100644
index 708ce89132733c03f36d8ca7d58e1b91cd5ad9cd..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/steve1.json b/research/steve/config/experiments/goodruns/hopper/steve1.json
deleted file mode 100644
index 708ce89132733c03f36d8ca7d58e1b91cd5ad9cd..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/steve2.json b/research/steve/config/experiments/goodruns/hopper/steve2.json
deleted file mode 100644
index 708ce89132733c03f36d8ca7d58e1b91cd5ad9cd..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/steve2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/hopper/steve3.json b/research/steve/config/experiments/goodruns/hopper/steve3.json
deleted file mode 100644
index 708ce89132733c03f36d8ca7d58e1b91cd5ad9cd..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/hopper/steve3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/hopper.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/ddpg0.json b/research/steve/config/experiments/goodruns/humanoid/ddpg0.json
deleted file mode 100644
index 3bd27e7d5f176340826691b17ee16849bb6ac708..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/ddpg1.json b/research/steve/config/experiments/goodruns/humanoid/ddpg1.json
deleted file mode 100644
index 3bd27e7d5f176340826691b17ee16849bb6ac708..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/ddpg2.json b/research/steve/config/experiments/goodruns/humanoid/ddpg2.json
deleted file mode 100644
index 3bd27e7d5f176340826691b17ee16849bb6ac708..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/ddpg2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/ddpg3.json b/research/steve/config/experiments/goodruns/humanoid/ddpg3.json
deleted file mode 100644
index 3bd27e7d5f176340826691b17ee16849bb6ac708..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/ddpg3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/mve_tdk0.json b/research/steve/config/experiments/goodruns/humanoid/mve_tdk0.json
deleted file mode 100644
index 542ed8d80cec5f3d653c512febbeba3291c579e5..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/mve_tdk1.json b/research/steve/config/experiments/goodruns/humanoid/mve_tdk1.json
deleted file mode 100644
index 542ed8d80cec5f3d653c512febbeba3291c579e5..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/mve_tdk2.json b/research/steve/config/experiments/goodruns/humanoid/mve_tdk2.json
deleted file mode 100644
index 542ed8d80cec5f3d653c512febbeba3291c579e5..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/mve_tdk2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/mve_tdk3.json b/research/steve/config/experiments/goodruns/humanoid/mve_tdk3.json
deleted file mode 100644
index 542ed8d80cec5f3d653c512febbeba3291c579e5..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/mve_tdk3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/steve0.json b/research/steve/config/experiments/goodruns/humanoid/steve0.json
deleted file mode 100644
index 835b3f6213490a83e016846001122d113f2b5b17..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/steve1.json b/research/steve/config/experiments/goodruns/humanoid/steve1.json
deleted file mode 100644
index 835b3f6213490a83e016846001122d113f2b5b17..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/steve2.json b/research/steve/config/experiments/goodruns/humanoid/steve2.json
deleted file mode 100644
index 835b3f6213490a83e016846001122d113f2b5b17..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/steve2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/humanoid/steve3.json b/research/steve/config/experiments/goodruns/humanoid/steve3.json
deleted file mode 100644
index 835b3f6213490a83e016846001122d113f2b5b17..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/humanoid/steve3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/humanoid.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/ddpg0.json b/research/steve/config/experiments/goodruns/rshum/ddpg0.json
deleted file mode 100644
index 9fd98d11e712231750f8dce5cf6352522e92abd8..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/ddpg1.json b/research/steve/config/experiments/goodruns/rshum/ddpg1.json
deleted file mode 100644
index 9fd98d11e712231750f8dce5cf6352522e92abd8..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/ddpg2.json b/research/steve/config/experiments/goodruns/rshum/ddpg2.json
deleted file mode 100644
index 9fd98d11e712231750f8dce5cf6352522e92abd8..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/ddpg2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/ddpg3.json b/research/steve/config/experiments/goodruns/rshum/ddpg3.json
deleted file mode 100644
index 9fd98d11e712231750f8dce5cf6352522e92abd8..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/ddpg3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/mve_tdk0.json b/research/steve/config/experiments/goodruns/rshum/mve_tdk0.json
deleted file mode 100644
index ade2434ee2fae17c53aaaec2bffab7983b45c0d4..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/mve_tdk1.json b/research/steve/config/experiments/goodruns/rshum/mve_tdk1.json
deleted file mode 100644
index ade2434ee2fae17c53aaaec2bffab7983b45c0d4..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/mve_tdk2.json b/research/steve/config/experiments/goodruns/rshum/mve_tdk2.json
deleted file mode 100644
index ade2434ee2fae17c53aaaec2bffab7983b45c0d4..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/mve_tdk2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/mve_tdk3.json b/research/steve/config/experiments/goodruns/rshum/mve_tdk3.json
deleted file mode 100644
index ade2434ee2fae17c53aaaec2bffab7983b45c0d4..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/mve_tdk3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/steve0.json b/research/steve/config/experiments/goodruns/rshum/steve0.json
deleted file mode 100644
index 510854fbf80a03ccfeab8e1a2379ffebf8cc91ab..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/steve1.json b/research/steve/config/experiments/goodruns/rshum/steve1.json
deleted file mode 100644
index 510854fbf80a03ccfeab8e1a2379ffebf8cc91ab..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/steve2.json b/research/steve/config/experiments/goodruns/rshum/steve2.json
deleted file mode 100644
index 510854fbf80a03ccfeab8e1a2379ffebf8cc91ab..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/steve2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/rshum/steve3.json b/research/steve/config/experiments/goodruns/rshum/steve3.json
deleted file mode 100644
index 510854fbf80a03ccfeab8e1a2379ffebf8cc91ab..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/rshum/steve3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/rshum.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/ddpg0.json b/research/steve/config/experiments/goodruns/swimmer/ddpg0.json
deleted file mode 100644
index a94fc7c52ac1f31b9f12b83c5ac10f49e78d66d6..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/ddpg1.json b/research/steve/config/experiments/goodruns/swimmer/ddpg1.json
deleted file mode 100644
index a94fc7c52ac1f31b9f12b83c5ac10f49e78d66d6..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/ddpg2.json b/research/steve/config/experiments/goodruns/swimmer/ddpg2.json
deleted file mode 100644
index a94fc7c52ac1f31b9f12b83c5ac10f49e78d66d6..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/ddpg2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/ddpg3.json b/research/steve/config/experiments/goodruns/swimmer/ddpg3.json
deleted file mode 100644
index a94fc7c52ac1f31b9f12b83c5ac10f49e78d66d6..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/ddpg3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/mve_tdk0.json b/research/steve/config/experiments/goodruns/swimmer/mve_tdk0.json
deleted file mode 100644
index 14210117887df275a89990d6844c3d9fc4c64c24..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/mve_tdk1.json b/research/steve/config/experiments/goodruns/swimmer/mve_tdk1.json
deleted file mode 100644
index 14210117887df275a89990d6844c3d9fc4c64c24..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/mve_tdk2.json b/research/steve/config/experiments/goodruns/swimmer/mve_tdk2.json
deleted file mode 100644
index 14210117887df275a89990d6844c3d9fc4c64c24..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/mve_tdk2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/mve_tdk3.json b/research/steve/config/experiments/goodruns/swimmer/mve_tdk3.json
deleted file mode 100644
index 14210117887df275a89990d6844c3d9fc4c64c24..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/mve_tdk3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/steve0.json b/research/steve/config/experiments/goodruns/swimmer/steve0.json
deleted file mode 100644
index d33583283089eb38f038c5669ba330d3b0720fbb..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/steve1.json b/research/steve/config/experiments/goodruns/swimmer/steve1.json
deleted file mode 100644
index d33583283089eb38f038c5669ba330d3b0720fbb..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/steve2.json b/research/steve/config/experiments/goodruns/swimmer/steve2.json
deleted file mode 100644
index d33583283089eb38f038c5669ba330d3b0720fbb..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/steve2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/swimmer/steve3.json b/research/steve/config/experiments/goodruns/swimmer/steve3.json
deleted file mode 100644
index d33583283089eb38f038c5669ba330d3b0720fbb..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/swimmer/steve3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/swimmer.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/ddpg0.json b/research/steve/config/experiments/goodruns/walker2d/ddpg0.json
deleted file mode 100644
index 81fe2ff5643d47dded60431bbbf39541cdfb3f30..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/ddpg1.json b/research/steve/config/experiments/goodruns/walker2d/ddpg1.json
deleted file mode 100644
index 81fe2ff5643d47dded60431bbbf39541cdfb3f30..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/ddpg2.json b/research/steve/config/experiments/goodruns/walker2d/ddpg2.json
deleted file mode 100644
index 81fe2ff5643d47dded60431bbbf39541cdfb3f30..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/ddpg2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/ddpg3.json b/research/steve/config/experiments/goodruns/walker2d/ddpg3.json
deleted file mode 100644
index 81fe2ff5643d47dded60431bbbf39541cdfb3f30..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/ddpg3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/mve_tdk0.json b/research/steve/config/experiments/goodruns/walker2d/mve_tdk0.json
deleted file mode 100644
index d8420effaf5059eb359bcbd5edc17a50824d4d94..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/mve_tdk1.json b/research/steve/config/experiments/goodruns/walker2d/mve_tdk1.json
deleted file mode 100644
index d8420effaf5059eb359bcbd5edc17a50824d4d94..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/mve_tdk2.json b/research/steve/config/experiments/goodruns/walker2d/mve_tdk2.json
deleted file mode 100644
index d8420effaf5059eb359bcbd5edc17a50824d4d94..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/mve_tdk2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/mve_tdk3.json b/research/steve/config/experiments/goodruns/walker2d/mve_tdk3.json
deleted file mode 100644
index d8420effaf5059eb359bcbd5edc17a50824d4d94..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/mve_tdk3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/steve0.json b/research/steve/config/experiments/goodruns/walker2d/steve0.json
deleted file mode 100644
index a98c410cad3c219541663a6e76072f4e23888135..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/steve1.json b/research/steve/config/experiments/goodruns/walker2d/steve1.json
deleted file mode 100644
index a98c410cad3c219541663a6e76072f4e23888135..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/steve2.json b/research/steve/config/experiments/goodruns/walker2d/steve2.json
deleted file mode 100644
index a98c410cad3c219541663a6e76072f4e23888135..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/steve2.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/goodruns/walker2d/steve3.json b/research/steve/config/experiments/goodruns/walker2d/steve3.json
deleted file mode 100644
index a98c410cad3c219541663a6e76072f4e23888135..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/goodruns/walker2d/steve3.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/walker2d.json"]}
diff --git a/research/steve/config/experiments/speedruns/flagrun/speedy_ddpg0.json b/research/steve/config/experiments/speedruns/flagrun/speedy_ddpg0.json
deleted file mode 100644
index b7280d71ef0a78f8e70dfe9c76575628853c9ff5..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/flagrun/speedy_ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/flagrun.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/flagrun/speedy_ddpg1.json b/research/steve/config/experiments/speedruns/flagrun/speedy_ddpg1.json
deleted file mode 100644
index b7280d71ef0a78f8e70dfe9c76575628853c9ff5..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/flagrun/speedy_ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/flagrun.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/flagrun/speedy_mve_tdk0.json b/research/steve/config/experiments/speedruns/flagrun/speedy_mve_tdk0.json
deleted file mode 100644
index 73252566b5e8a6515f008e1ddbe5c939b2ee000b..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/flagrun/speedy_mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/flagrun.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/flagrun/speedy_mve_tdk1.json b/research/steve/config/experiments/speedruns/flagrun/speedy_mve_tdk1.json
deleted file mode 100644
index 73252566b5e8a6515f008e1ddbe5c939b2ee000b..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/flagrun/speedy_mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/flagrun.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/flagrun/speedy_steve0.json b/research/steve/config/experiments/speedruns/flagrun/speedy_steve0.json
deleted file mode 100644
index ba5708f1ffb59a20751474d999f8a8798e89e751..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/flagrun/speedy_steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/flagrun.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/flagrun/speedy_steve1.json b/research/steve/config/experiments/speedruns/flagrun/speedy_steve1.json
deleted file mode 100644
index ba5708f1ffb59a20751474d999f8a8798e89e751..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/flagrun/speedy_steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/flagrun.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/humanoid/speedy_ddpg0.json b/research/steve/config/experiments/speedruns/humanoid/speedy_ddpg0.json
deleted file mode 100644
index eb07f31dc90b40ae7d66046627b29b153d3f2de9..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/humanoid/speedy_ddpg0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/humanoid.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/humanoid/speedy_ddpg1.json b/research/steve/config/experiments/speedruns/humanoid/speedy_ddpg1.json
deleted file mode 100644
index eb07f31dc90b40ae7d66046627b29b153d3f2de9..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/humanoid/speedy_ddpg1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/ddpg.json", "config/envs/humanoid.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/humanoid/speedy_mve_tdk0.json b/research/steve/config/experiments/speedruns/humanoid/speedy_mve_tdk0.json
deleted file mode 100644
index 51a3bdcb5a3a18903d9671e48eee3b2c10922c95..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/humanoid/speedy_mve_tdk0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/humanoid.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/humanoid/speedy_mve_tdk1.json b/research/steve/config/experiments/speedruns/humanoid/speedy_mve_tdk1.json
deleted file mode 100644
index 51a3bdcb5a3a18903d9671e48eee3b2c10922c95..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/humanoid/speedy_mve_tdk1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/mve_tdk.json", "config/envs/humanoid.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/humanoid/speedy_steve0.json b/research/steve/config/experiments/speedruns/humanoid/speedy_steve0.json
deleted file mode 100644
index 0d2bfaa4e3dd03a0ab88d28b12c97964d6c409d0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/humanoid/speedy_steve0.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/humanoid.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/config/experiments/speedruns/humanoid/speedy_steve1.json b/research/steve/config/experiments/speedruns/humanoid/speedy_steve1.json
deleted file mode 100644
index 0d2bfaa4e3dd03a0ab88d28b12c97964d6c409d0..0000000000000000000000000000000000000000
--- a/research/steve/config/experiments/speedruns/humanoid/speedy_steve1.json
+++ /dev/null
@@ -1 +0,0 @@
-{"inherits": ["config/algos/steve.json", "config/envs/humanoid.json", "config/experimental_setups/speedrun.json"]}
diff --git a/research/steve/envwrap.py b/research/steve/envwrap.py
deleted file mode 100644
index bd88c3035c0afd22d5074460ee51fa346f37a798..0000000000000000000000000000000000000000
--- a/research/steve/envwrap.py
+++ /dev/null
@@ -1,106 +0,0 @@
-from builtins import object
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-try:
-  import roboschool
-except:
-  pass
-import gym
-import numpy as np
-
-from config import config
-
-MAX_FRAMES = config["env"]["max_frames"]
-
-gym.logger.level=40
-
-def get_env(env_name, *args, **kwargs):
-  MAPPING = {
-    "CartPole-v0": CartPoleWrapper,
-  }
-  if env_name in MAPPING: return MAPPING[env_name](env_name, *args, **kwargs)
-  else: return NoTimeLimitMujocoWrapper(env_name, *args, **kwargs)
-
-class GymWrapper(object):
-  """
-  Generic wrapper for OpenAI gym environments.
-  """
-  def __init__(self, env_name):
-    self.internal_env = gym.make(env_name)
-    self.observation_space = self.internal_env.observation_space
-    self.action_space = self.internal_env.action_space
-    self.custom_init()
-
-  def custom_init(self):
-    pass
-
-  def reset(self):
-    self.clock = 0
-    return self.preprocess_obs(self.internal_env.reset())
-
-  # returns normalized actions
-  def sample(self):
-    return self.action_space.sample()
-
-  # this is used for converting continuous approximations back to the original domain
-  def normalize_actions(self, actions):
-    return actions
-
-  # puts actions into a form where they can be predicted. by default, called after sample()
-  def unnormalize_actions(self, actions):
-    return actions
-
-  def preprocess_obs(self, obs):
-    # return np.append(obs, [self.clock/float(MAX_FRAMES)])
-    return obs
-
-  def step(self, normalized_action):
-    out = self.internal_env.step(normalized_action)
-    self.clock += 1
-    obs, reward, done = self.preprocess_obs(out[0]), out[1], float(out[2])
-    reset = done == 1. or self.clock == MAX_FRAMES
-    return obs, reward, done, reset
-
-  def render_rollout(self, states):
-    ## states is numpy array of size [timesteps, state]
-    self.internal_env.reset()
-    for state in states:
-      self.internal_env.env.state = state
-      self.internal_env.render()
-
-class CartPoleWrapper(GymWrapper):
-  """
-  Wrap CartPole.
-  """
-  def sample(self):
-    return np.array([np.random.uniform(0., 1.)])
-
-  def normalize_actions(self, action):
-    return 1 if action[0] >= 0 else 0
-
-  def unnormalize_actions(self, action):
-    return 2. * action - 1.
-
-class NoTimeLimitMujocoWrapper(GymWrapper):
-  """
-  Wrap Mujoco-style environments, removing the termination condition after time.
-  This is needed to keep it Markovian.
-  """
-  def __init__(self, env_name):
-    self.internal_env = gym.make(env_name).env
-    self.observation_space = self.internal_env.observation_space
-    self.action_space = self.internal_env.action_space
-    self.custom_init()
diff --git a/research/steve/learner.py b/research/steve/learner.py
deleted file mode 100644
index 8a4c074cd0b85f84ea19d52bf8b43331aab9c5c5..0000000000000000000000000000000000000000
--- a/research/steve/learner.py
+++ /dev/null
@@ -1,272 +0,0 @@
-from __future__ import division
-from __future__ import print_function
-from builtins import zip
-from builtins import range
-from builtins import object
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import traceback, threading, time, warnings
-import tensorflow as tf
-import numpy as np
-
-import util
-from replay import ReplayBuffer
-
-class Learner(object):
-    """
-    Generic object which runs the main training loop of anything that trains using
-    a replay buffer. Handles updating, logging, saving/loading, batching, etc.
-    """
-    def __init__(self, interactor_queue, lock, config, env_config, learner_config, **bonus_kwargs):
-        self.learner_name = self.learner_name()
-        self.interactor_queue = interactor_queue
-        self.learner_lock = lock
-        self.config = config
-        self.env_config = env_config
-        self.learner_config = learner_config
-        self.bonus_kwargs = bonus_kwargs
-        self.kill_threads = False
-        self.permit_desync = False
-        self.need_frames_notification = threading.Condition()
-        self._reset_inspections()
-        self.total_frames = 0
-
-        self.save_path = util.create_directory("%s/%s/%s/%s" % (self.config["output_root"], self.config["env"]["name"], self.config["name"], self.config["save_model_path"]))
-        self.log_path = util.create_directory("%s/%s/%s/%s" % (self.config["output_root"], self.config["env"]["name"], self.config["name"],  self.config["log_path"])) + "/%s.log" % self.learner_name
-
-        # replay buffer to store data
-        self.replay_buffer_lock = threading.RLock()
-        self.replay_buffer = ReplayBuffer(self.learner_config["replay_size"],
-                                          np.prod(self.env_config["obs_dims"]),
-                                          self.env_config["action_dim"])
-
-        # data loaders pull data from the replay buffer and put it into the tfqueue for model usage
-        self.data_loaders = self.make_loader_placeholders()
-        queue_capacity = np.ceil(1./self.learner_config["frames_per_update"]) if self.learner_config["frames_per_update"] else 100
-        self.tf_queue = tf.FIFOQueue(capacity=queue_capacity, dtypes=[dl.dtype for dl in self.data_loaders])
-        self.enqueue_op = self.tf_queue.enqueue(self.data_loaders)
-        self.current_batch = self.tf_queue.dequeue()
-
-        # build the TF graph for the actual model to train
-        self.core, self.train_losses, self.train_ops, self.inspect_losses = self.make_core_model()
-        self.sess = tf.Session()
-        self.sess.run(tf.global_variables_initializer())
-
-    ## Mandatory functions to override
-    def learner_name(self): raise Exception('unimplemented: learner_name')
-    def make_loader_placeholders(self): raise Exception('unimplemented: make_loader_placeholders')
-    def make_core_model(self): raise Exception('unimplemented: make_core_model')
-
-    ## Optional functions to override
-    def initialize(self): warnings.warn('unimplemented: initialize')
-    def resume_from_checkpoint(self, epoch): warnings.warn('unimplemented: resume_from_checkpoint')
-    def checkpoint(self): warnings.warn('unimplemented: checkpoint')
-    def backup(self): warnings.warn('unimplemented: backup')
-
-    ## Internal functions
-    def _start(self):
-        # fetch data from the interactors to pre-fill the replay buffer
-        self.prefetch_thread = threading.Thread(target=self._poll_interactors, args=(True, self.learner_config["frames_before_learning"],))
-        self.prefetch_thread.start()
-        self.prefetch_thread.join()
-
-        # start the interactor and data loader
-        self.data_load_thread = threading.Thread(target=self._run_enqueue_data)
-        self.data_load_thread.start()
-
-        # initialize the learner, pretraining if needed
-        if self.config["resume"]: self._resume_from_checkpoint()
-        else:                     self._initialize()
-
-        # re-sync everything, and start up interactions with the environment
-        self.interactor_poll_thread = threading.Thread(target=self._poll_interactors)
-        self.interactor_poll_thread.start()
-
-        # start the clock
-        self._last_checkpoint_time = time.time()
-
-    def _learn(self, permit_desync=False, log=True, checkpoint=True, backup=True):
-        # this is to keep the frames/update synced properly
-        if self.learner_config["frames_per_update"] is not False and not permit_desync:
-            if not self._have_enough_frames():
-                with self.need_frames_notification:
-                    self.need_frames_notification.notify()
-                return
-
-        # log
-        if log and (self.update_i + 1) % self.learner_config["log_every_n"] == 0:
-            self._log()
-
-        # checkpoint
-        if checkpoint and (self.update_i + 1) % self.learner_config["epoch_every_n"] == 0:
-            self._checkpoint()
-
-        # backup
-        if backup and (self.update_i + 1) % self.learner_config["backup_every_n"] == 0:
-            self._backup()
-
-        # train
-        self._training_step()
-
-    def _have_enough_frames(self):
-        gathered_frames = self.total_frames - self.learner_config["frames_before_learning"]
-        return gathered_frames > self.learner_config["frames_per_update"] * self.update_i
-
-    def _initialize(self):
-        self.epoch = 0
-        self.update_i = 0
-        self.hours = 0
-        self._last_checkpoint_time = time.time()
-
-        self.initialize()
-
-        if self.learner_config["pretrain_n"]: self._pretrain()
-        self._checkpoint()
-
-    def _pretrain(self):
-        for _ in range(self.learner_config["pretrain_n"]):
-            self._learn(permit_desync=True, checkpoint=False, backup=False)
-        self.epoch = 0
-        self.update_i = 0
-
-    def _resume_from_checkpoint(self):
-        epoch = util.get_largest_epoch_in_dir(self.save_path, self.core.saveid)
-        if not self.config['keep_all_replay_buffers']: util.wipe_all_but_largest_epoch_in_dir(self.save_path, self.core.saveid)
-        if epoch is False:
-            raise Exception("Tried to reload but no model found")
-        with self.learner_lock:
-            self.core.load(self.sess, self.save_path, epoch)
-            self.epoch, self.update_i, self.total_frames, self.hours = self.sess.run([self.core.epoch_n, self.core.update_n, self.core.frame_n, self.core.hours])
-        with self.replay_buffer_lock:
-            self.replay_buffer.load(self.save_path, '%09d_%s' % (epoch, self.learner_name))
-        self.resume_from_checkpoint(epoch)
-
-    def _log(self):
-        if self.denom > 0:
-            logstring = "(%3.2f sec) h%-8.2f e%-8d s%-8d f%-8d\t" % (time.time() - self._log_time, self.hours, self.epoch, self.update_i + 1, self.total_frames) + ', '.join(["%8f" % x for x in (self.running_total / self.denom).tolist()])
-            print("%s\t%s" % (self.learner_name, logstring))
-            with open(self.log_path, "a") as f: f.write(logstring + "\n")
-        self._reset_inspections()
-
-    def _reset_inspections(self):
-        self.running_total = 0.
-        self.denom = 0.
-        self._log_time = time.time()
-
-    def _checkpoint(self):
-        self.checkpoint()
-        self.epoch += 1
-        self.hours += (time.time() - self._last_checkpoint_time) / 3600.
-        self._last_checkpoint_time = time.time()
-        self.core.update_epoch(self.sess, self.epoch, self.update_i, self.total_frames, self.hours)
-        with self.learner_lock: self.core.save(self.sess, self.save_path)
-
-    def _backup(self):
-        self.backup()
-        if not self.learner_config['keep_all_replay_buffers']: util.wipe_all_but_largest_epoch_in_dir(self.save_path, self.core.saveid)
-        with self.learner_lock:
-            self.core.save(self.sess, self.save_path, self.epoch)
-        with self.replay_buffer_lock:
-            self.replay_buffer.save(self.save_path, '%09d_%s' % (self.epoch, self.learner_name))
-
-    def _training_step(self):
-        train_ops = tuple([op for op, loss in zip(self.train_ops,
-                                                  self.train_losses)
-                           if loss is not None])
-        outs = self.sess.run(train_ops + self.inspect_losses)
-        self.running_total += np.array(outs[len(train_ops):])
-        self.denom += 1.
-        self.update_i += 1
-
-    def _poll_interactors(self, continuous_poll=False, frames_before_terminate=None):
-        # poll the interactors for new frames.
-        # the synced_condition semaphore prevents this from consuming too much CPU
-        while not self.kill_threads:
-            if self.learner_config["frames_per_update"] is not False and not continuous_poll:
-                with self.need_frames_notification: self.need_frames_notification.wait()
-            while not self.interactor_queue.empty():
-                new_frames = self.interactor_queue.get()
-                self._add_frames(new_frames)
-                if frames_before_terminate and self.total_frames >= frames_before_terminate: return
-
-    def _add_frames(self, frames):
-        with self.replay_buffer_lock:
-            for frame in frames:
-                self.replay_buffer.add_replay(*frame)
-            self.total_frames = self.replay_buffer.count
-        return self.total_frames
-
-    def _run_enqueue_data(self):
-        while not self.kill_threads:
-            data = self.replay_buffer.random_batch(self.learner_config["batch_size"])
-            self.sess.run(self.enqueue_op, feed_dict=dict(list(zip(self.data_loaders, data))))
-
-    def _kill_threads(self):
-        self.kill_threads = True
-
-
-class CoreModel(object):
-    """The base class for the "core" of learners."""
-    def __init__(self, name, env_config, learner_config):
-        self.name = self.saveid + "/" + name
-        self.env_config = env_config
-        self.learner_config = learner_config
-
-        with tf.variable_scope(self.name):
-            self.epoch_n = tf.get_variable('epoch_n', [], initializer=tf.constant_initializer(0), dtype=tf.int64, trainable=False)
-            self.update_n = tf.get_variable('update_n', [], initializer=tf.constant_initializer(0), dtype=tf.int64, trainable=False)
-            self.frame_n = tf.get_variable('frame_n', [], initializer=tf.constant_initializer(0), dtype=tf.int64, trainable=False)
-            self.hours = tf.get_variable('hours', [], initializer=tf.constant_initializer(0.), dtype=tf.float64, trainable=False)
-            self.epoch_n_placeholder = tf.placeholder(tf.int64, [])
-            self.update_n_placeholder = tf.placeholder(tf.int64, [])
-            self.frame_n_placeholder = tf.placeholder(tf.int64, [])
-            self.hours_placeholder = tf.placeholder(tf.float64, [])
-        self.assign_epoch_op = [tf.assign(self.epoch_n, self.epoch_n_placeholder), tf.assign(self.update_n, self.update_n_placeholder), tf.assign(self.frame_n, self.frame_n_placeholder), tf.assign(self.hours, self.hours_placeholder)]
-
-        self.create_params(env_config, learner_config)
-        self.model_params = tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, scope=self.name)
-        self.saver = tf.train.Saver(self.model_params)
-
-    @property
-    def saveid(self):
-        raise Exception("specify a save ID")
-
-    def create_params(self, env_config, learner_config):
-        raise Exception("unimplemented")
-
-    def update_epoch(self, sess, epoch, updates, frames, hours):
-        sess.run(self.assign_epoch_op, feed_dict={self.epoch_n_placeholder: int(epoch), self.update_n_placeholder: int(updates), self.frame_n_placeholder: int(frames), self.hours_placeholder: float(hours)})
-
-    def save(self, sess, path, epoch=None):
-        if epoch is None:  self.saver.save(sess, path + "/%s.params" % self.saveid)
-        else:              self.saver.save(sess, path + "/%09d_%s.params" % (epoch, self.saveid))
-
-    def load(self, sess, path, epoch=None):
-        if epoch is None:  self.saver.restore(sess, path + "/%s.params" % self.saveid)
-        else:              self.saver.restore(sess, path + "/%09d_%s.params" % (epoch, self.saveid))
-
-def run_learner(learner_subclass, queue, lock, config, env_config, learner_config, **bonus_kwargs):
-    learner = learner_subclass(queue, lock, config, env_config, learner_config, **bonus_kwargs)
-    try:
-        learner._start()
-        while True: learner._learn()
-
-    except Exception as e:
-        print('Caught exception in learner process')
-        traceback.print_exc()
-        learner._kill_threads()
-        print()
-        raise e
diff --git a/research/steve/master.py b/research/steve/master.py
deleted file mode 100644
index 4d08474724dff64980f16257b9fe7a13cceb2cc1..0000000000000000000000000000000000000000
--- a/research/steve/master.py
+++ /dev/null
@@ -1,85 +0,0 @@
-from builtins import str
-from builtins import range
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import multiprocessing
-import os, sys, time
-
-from config import config, log_config
-import util
-
-AGENT_COUNT = config["agent_config"]["count"]
-EVALUATOR_COUNT = config["evaluator_config"]["count"]
-MODEL_AUGMENTED = config["model_config"] is not False
-if config["resume"]:
-  ROOT_PATH = "output/" + config["env"]["name"] + "/" + config["name"]
-else:
-  ROOT_PATH = util.create_and_wipe_directory("output/" + config["env"]["name"] + "/" + config["name"])
-log_config()
-import learner, agent, valuerl_learner
-if MODEL_AUGMENTED: import worldmodel_learner
-
-if __name__ == '__main__':
-  all_procs = set([])
-  interaction_procs = set([])
-
-  # lock
-  policy_lock = multiprocessing.Lock()
-  model_lock = multiprocessing.Lock() if MODEL_AUGMENTED else None
-
-  # queue
-  policy_replay_frame_queue = multiprocessing.Queue(1)
-  model_replay_frame_queue = multiprocessing.Queue(1) if MODEL_AUGMENTED else None
-
-  # interactors
-  for interact_proc_i in range(AGENT_COUNT):
-    interact_proc = multiprocessing.Process(target=agent.main, args=(interact_proc_i, False, policy_replay_frame_queue, model_replay_frame_queue, policy_lock, config))
-    all_procs.add(interact_proc)
-    interaction_procs.add(interact_proc)
-
-  # evaluators
-  for interact_proc_i in range(EVALUATOR_COUNT):
-    interact_proc = multiprocessing.Process(target=agent.main, args=(interact_proc_i, True, policy_replay_frame_queue, model_replay_frame_queue, policy_lock, config))
-    all_procs.add(interact_proc)
-    interaction_procs.add(interact_proc)
-
-  # policy training
-  train_policy_proc = multiprocessing.Process(target=learner.run_learner, args=(valuerl_learner.ValueRLLearner, policy_replay_frame_queue, policy_lock, config, config["env"], config["policy_config"]), kwargs={"model_lock": model_lock})
-  all_procs.add(train_policy_proc)
-
-  # model training
-  if MODEL_AUGMENTED:
-    train_model_proc = multiprocessing.Process(target=learner.run_learner, args=(worldmodel_learner.WorldmodelLearner, model_replay_frame_queue, model_lock, config, config["env"], config["model_config"]))
-    all_procs.add(train_model_proc)
-
-  # start all policies
-  os.environ['TF_CPP_MIN_LOG_LEVEL'] = '3'
-  for i, proc in enumerate(interaction_procs):
-    os.environ['CUDA_VISIBLE_DEVICES'] = ''
-    proc.start()
-
-  os.environ['CUDA_VISIBLE_DEVICES'] = str(int(sys.argv[2]))
-  train_policy_proc.start()
-
-  if MODEL_AUGMENTED:
-    os.environ['CUDA_VISIBLE_DEVICES'] = str(1+int(sys.argv[2]))
-    train_model_proc.start()
-
-  while True:
-    try:
-      pass
-    except:
-      for proc in all_procs: proc.join()
diff --git a/research/steve/nn.py b/research/steve/nn.py
deleted file mode 100644
index c87c6eb834506d0cbb462f92d53bcf3af30a4567..0000000000000000000000000000000000000000
--- a/research/steve/nn.py
+++ /dev/null
@@ -1,189 +0,0 @@
-from builtins import range
-from builtins import object
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import tensorflow as tf
-import numpy as np
-from itertools import product
-
-class FeedForwardNet(object):
-    """Custom feed-forward network layer."""
-    def __init__(self, name, in_size, out_shape, layers=1, hidden_dim=32, final_nonlinearity=None, get_uncertainty=False):
-        self.name = name
-        self.in_size = in_size
-        self.out_shape = out_shape
-        self.out_size = np.prod(out_shape)
-        self.layers = layers
-        self.hidden_dim = hidden_dim
-        self.final_nonlinearity = (lambda x:x) if final_nonlinearity is None else final_nonlinearity
-        self.get_uncertainty = get_uncertainty
-
-        self.weights = [None] * layers
-        self.biases = [None] * layers
-
-        self.params_list = []
-
-        with tf.variable_scope(name):
-            for layer_i in range(self.layers):
-                in_size = self.hidden_dim
-                out_size = self.hidden_dim
-                if layer_i == 0: in_size = self.in_size
-                if layer_i == self.layers - 1: out_size = self.out_size
-                self.weights[layer_i] = tf.get_variable("weights%d" % layer_i, [in_size, out_size], initializer=tf.contrib.layers.xavier_initializer())
-                self.biases[layer_i] = tf.get_variable("bias%d" % layer_i, [1, out_size], initializer=tf.constant_initializer(0.0))
-                self.params_list += [self.weights[layer_i], self.biases[layer_i]]
-
-    def __call__(self, x, stop_params_gradient=False, is_eval=True, ensemble_idxs=None, pre_expanded=None, reduce_mode="none"):
-        original_shape = tf.shape(x)
-        h = tf.reshape(x, [-1, self.in_size])
-        for layer_i in range(self.layers):
-            nonlinearity = tf.nn.relu if layer_i + 1 < self.layers else self.final_nonlinearity
-            if stop_params_gradient: h = nonlinearity(tf.matmul(h, tf.stop_gradient(self.weights[layer_i])) + tf.stop_gradient(self.biases[layer_i]))
-            else:             h = nonlinearity(tf.matmul(h, self.weights[layer_i]) + self.biases[layer_i])
-        if len(self.out_shape) > 0: h = tf.reshape(h, tf.concat([original_shape[:-1], tf.constant(self.out_shape)], -1))
-        else:                       h = tf.reshape(h, original_shape[:-1])
-        if pre_expanded is None: pre_expanded = ensemble_idxs is not None
-        if reduce_mode == "none" and not pre_expanded and self.get_uncertainty:
-            if len(self.out_shape) > 0: h = tf.expand_dims(h, -2)
-            else:                       h = tf.expand_dims(h, -1)
-        return h
-
-    def l2_loss(self):
-        return tf.add_n([tf.reduce_sum(.5 * tf.square(mu)) for mu in self.params_list])
-
-class BayesianDropoutFeedForwardNet(FeedForwardNet):
-    """Custom feed-forward network layer, with dropout as a Bayesian approximation."""
-    def __init__(self, name, in_size, out_shape, layers=1, hidden_dim=32, final_nonlinearity=None, get_uncertainty=False, keep_prob=.5, eval_sample_count=2, consistent_random_seed=False):
-        super(BayesianDropoutFeedForwardNet, self).__init__(name, in_size, out_shape, layers=layers, hidden_dim=hidden_dim,
-                                                            final_nonlinearity=final_nonlinearity, get_uncertainty=get_uncertainty)
-        self.keep_prob = keep_prob
-        self.eval_sample_count = eval_sample_count
-        if eval_sample_count < 2: raise Exception("eval_sample_count must be at least 2 to estimate uncertainty")
-        self.dropout_seed = tf.random_uniform([layers], maxval=1e18, dtype=tf.int64) if consistent_random_seed else [None] * layers
-
-    def __call__(self, x, stop_params_gradient=False, is_eval=True, pre_expanded=False, ensemble_idxs=None, reduce_mode="none"):
-        if is_eval:
-            x = tf.tile(tf.expand_dims(x,0), tf.concat([tf.constant([self.eval_sample_count]), tf.ones_like(tf.shape(x))], 0))
-        original_shape = tf.shape(x)
-        h = tf.reshape(x, [-1, self.in_size])
-        for layer_i in range(self.layers):
-            nonlinearity = tf.nn.relu if layer_i + 1 < self.layers else self.final_nonlinearity
-            if layer_i > 0: h = tf.nn.dropout(h, keep_prob=self.keep_prob, seed=self.dropout_seed[layer_i])
-            if stop_params_gradient: h = nonlinearity(tf.matmul(h, tf.stop_gradient(self.weights[layer_i])) + tf.stop_gradient(self.biases[layer_i]))
-            else:                    h = nonlinearity(tf.matmul(h, self.weights[layer_i]) + self.biases[layer_i])
-        if len(self.out_shape) > 0: h = tf.reshape(h, tf.concat([original_shape[:-1], tf.constant(self.out_shape)], -1))
-        else:                       h = tf.reshape(h, original_shape[:-1])
-        if is_eval:
-            h, uncertainty = tf.nn.moments(h, 0)
-            if self.get_uncertainty: return h, uncertainty
-            else:                    return h
-        else:
-            return h
-
-
-class EnsembleFeedForwardNet(FeedForwardNet):
-    """Custom feed-forward network layer with an ensemble."""
-    def __init__(self, name, in_size, out_shape, layers=1, hidden_dim=32, final_nonlinearity=None, get_uncertainty=False, ensemble_size=2, train_sample_count=2, eval_sample_count=2):
-        if train_sample_count > ensemble_size: raise Exception("train_sample_count cannot be larger than ensemble size")
-        if eval_sample_count > ensemble_size: raise Exception("eval_sample_count cannot be larger than ensemble size")
-        self.name = name
-        self.in_size = in_size
-        self.out_shape = out_shape
-        self.out_size = np.prod(out_shape)
-        self.layers = layers
-        self.hidden_dim = hidden_dim
-        self.final_nonlinearity = (lambda x:x) if final_nonlinearity is None else final_nonlinearity
-        self.get_uncertainty = get_uncertainty
-        self.ensemble_size = ensemble_size
-        self.train_sample_count = train_sample_count
-        self.eval_sample_count = eval_sample_count
-
-        self.weights = [None] * layers
-        self.biases = [None] * layers
-
-        self.params_list = []
-
-        with tf.variable_scope(name):
-            for layer_i in range(self.layers):
-                in_size = self.hidden_dim
-                out_size = self.hidden_dim
-                if layer_i == 0: in_size = self.in_size
-                if layer_i == self.layers - 1: out_size = self.out_size
-                self.weights[layer_i] = tf.get_variable("weights%d" % layer_i, [ensemble_size, in_size, out_size], initializer=tf.contrib.layers.xavier_initializer())
-                self.biases[layer_i] = tf.get_variable("bias%d" % layer_i, [ensemble_size, out_size], initializer=tf.constant_initializer(0.0))
-                self.params_list += [self.weights[layer_i], self.biases[layer_i]]
-
-    def __call__(self, x, stop_params_gradient=False, is_eval=True, ensemble_idxs=None, pre_expanded=None, reduce_mode="none"):
-        if pre_expanded is None: pre_expanded = ensemble_idxs is not None
-        if ensemble_idxs is None:
-            ensemble_idxs = tf.random_shuffle(tf.range(self.ensemble_size))
-            ensemble_sample_n = self.eval_sample_count if is_eval else self.train_sample_count
-            ensemble_idxs = ensemble_idxs[:ensemble_sample_n]
-        else:
-            ensemble_sample_n = tf.shape(ensemble_idxs)[0]
-
-        weights = [tf.gather(w, ensemble_idxs, axis=0) for w in self.weights]
-        biases = [tf.expand_dims(tf.gather(b, ensemble_idxs, axis=0),0) for b in self.biases]
-
-        original_shape = tf.shape(x)
-        if pre_expanded: h = tf.reshape(x, [-1, ensemble_sample_n, self.in_size])
-        else:            h = tf.tile(tf.reshape(x, [-1, 1, self.in_size]), [1, ensemble_sample_n, 1])
-        for layer_i in range(self.layers):
-            nonlinearity = tf.nn.relu if layer_i + 1 < self.layers else self.final_nonlinearity
-            if stop_params_gradient: h = nonlinearity(tf.einsum('bri,rij->brj', h, tf.stop_gradient(weights[layer_i])) + tf.stop_gradient(biases[layer_i]))
-            else:                    h = nonlinearity(tf.einsum('bri,rij->brj', h, weights[layer_i]) + biases[layer_i])
-
-        if pre_expanded:
-            if len(self.out_shape) > 0: h = tf.reshape(h, tf.concat([original_shape[:-1], tf.constant(self.out_shape)], -1))
-            else:                       h = tf.reshape(h, original_shape[:-1])
-        else:
-            if len(self.out_shape) > 0: h = tf.reshape(h, tf.concat([original_shape[:-1], tf.constant([ensemble_sample_n]), tf.constant(self.out_shape)], -1))
-            else:                       h = tf.reshape(h, tf.concat([original_shape[:-1], tf.constant([ensemble_sample_n])], -1))
-
-        if reduce_mode == "none":
-            pass
-        elif reduce_mode == "random":
-            if len(self.out_shape) > 0: h = tf.reduce_sum(h * tf.reshape(tf.one_hot(tf.random_uniform([tf.shape(h)[0]], 0, ensemble_sample_n, dtype=tf.int64), ensemble_sample_n), tf.concat([tf.shape(h)[:1], tf.ones_like(tf.shape(h)[1:-2]), tf.constant([ensemble_sample_n]), tf.constant([1])], 0)), -2)
-            else:                       h = tf.reduce_sum(h * tf.reshape(tf.one_hot(tf.random_uniform([tf.shape(h)[0]], 0, ensemble_sample_n, dtype=tf.int64), ensemble_sample_n), tf.concat([tf.shape(h)[:1], tf.ones_like(tf.shape(h)[1:-1]), tf.constant([ensemble_sample_n])], 0)), -1)
-        elif reduce_mode == "mean":
-            if len(self.out_shape) > 0: h = tf.reduce_mean(h, -2)
-            else:                       h = tf.reduce_mean(h, -1)
-        else: raise Exception("use a valid reduce mode: none, random, or mean")
-
-        return h
-
-
-class ReparamNormal(object):
-    """Wrapper to make a feedforward network that outputs both mu and logsigma,
-    for use in the reparameterization trick."""
-    def __init__(self, base_net, name, in_size, out_shape, layers=2, hidden_dim=32, final_nonlinearity=None, ls_start_bias=0.0, final_net=FeedForwardNet, logsigma_min=-5., logsigma_max=2., **kwargs):
-        assert layers > 1
-        self.main_encoder = base_net(name+"_base", in_size, [hidden_dim], layers, hidden_dim, final_nonlinearity=tf.nn.relu, **kwargs)
-        self.mu = final_net(name+"_mu", hidden_dim, out_shape, layers=1, final_nonlinearity=final_nonlinearity, **kwargs)
-        self.logsigma = final_net(name+"_logsigma", hidden_dim, out_shape, layers=1, final_nonlinearity=None, **kwargs)
-        self.ls_start_bias = ls_start_bias
-        self.params_list = self.main_encoder.params_list + self.mu.params_list + self.logsigma.params_list
-        self.logsigma_min = logsigma_min
-        self.logsigma_max = logsigma_max
-
-    def __call__(self, x):
-        encoded = self.main_encoder(x)
-        mu = self.mu(encoded)
-        logsigma = tf.clip_by_value(self.logsigma(encoded) + self.ls_start_bias, self.logsigma_min, self.logsigma_max)
-        return mu, logsigma
-
-    def l2_loss(self):
-        return self.main_encoder.l2_loss() + self.mu.l2_loss() + self.logsigma.l2_loss()
diff --git a/research/steve/replay.py b/research/steve/replay.py
deleted file mode 100644
index 989cc0b2a518398718f36fc6dd8e33cc35083a76..0000000000000000000000000000000000000000
--- a/research/steve/replay.py
+++ /dev/null
@@ -1,109 +0,0 @@
-from __future__ import print_function
-from future import standard_library
-standard_library.install_aliases()
-from builtins import zip
-from builtins import str
-from builtins import object
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import numpy as np
-import pickle
-import multiprocessing
-
-class ReplayBuffer(object):
-    """
-    Stores frames sampled from the environment, with the ability to sample a batch
-    for training.
-    """
-
-    def __init__(self, max_size, obs_dim, action_dim, roundrobin=True):
-        self.max_size = max_size
-        self.obs_dim = obs_dim
-        self.action_dim = action_dim
-        self.roundrobin = roundrobin
-
-        self.obs_buffer = np.zeros([max_size, obs_dim])
-        self.next_obs_buffer = np.zeros([max_size, obs_dim])
-        self.action_buffer = np.zeros([max_size, action_dim])
-        self.reward_buffer = np.zeros([max_size])
-        self.done_buffer = np.zeros([max_size])
-
-        self.count = 0
-
-    def random_batch(self, batch_size):
-        indices = np.random.randint(0, min(self.count, self.max_size), batch_size)
-
-        return (
-            self.obs_buffer[indices],
-            self.next_obs_buffer[indices],
-            self.action_buffer[indices],
-            self.reward_buffer[indices],
-            self.done_buffer[indices],
-            self.count
-        )
-
-    def add_replay(self, obs, next_obs, action, reward, done):
-        if self.count >= self.max_size:
-            if self.roundrobin: index = self.count % self.max_size
-            else:               index = np.random.randint(0, self.max_size)
-        else:
-            index = self.count
-
-        self.obs_buffer[index] = obs
-        self.next_obs_buffer[index] = next_obs
-        self.action_buffer[index] = action
-        self.reward_buffer[index] = reward
-        self.done_buffer[index] = done
-
-        self.count += 1
-
-    def save(self, path, name):
-        def _save(datas, fnames):
-            print("saving replay buffer...")
-            for data, fname in zip(datas, fnames):
-                with open("%s.npz"%fname, "wb") as f:
-                    pickle.dump(data, f)
-            with open("%s/%s.count" % (path,name), "wb") as f:
-                f.write(str(self.count))
-            print("...done saving.")
-
-        datas = [
-            self.obs_buffer,
-            self.next_obs_buffer,
-            self.action_buffer,
-            self.reward_buffer,
-            self.done_buffer
-        ]
-
-        fnames = [
-            "%s/%s.obs_buffer" % (path, name),
-            "%s/%s.next_obs_buffer" % (path, name),
-            "%s/%s.action_buffer" % (path, name),
-            "%s/%s.reward_buffer" % (path, name),
-            "%s/%s.done_buffer" % (path, name)
-         ]
-
-        proc = multiprocessing.Process(target=_save, args=(datas, fnames))
-        proc.start()
-
-    def load(self, path, name):
-        print("Loading %s replay buffer (may take a while...)" % name)
-        with open("%s/%s.obs_buffer.npz" % (path,name)) as f: self.obs_buffer = pickle.load(f)
-        with open("%s/%s.next_obs_buffer.npz" % (path,name)) as f: self.next_obs_buffer = pickle.load(f)
-        with open("%s/%s.action_buffer.npz" % (path,name)) as f: self.action_buffer = pickle.load(f)
-        with open("%s/%s.reward_buffer.npz" % (path,name)) as f: self.reward_buffer = pickle.load(f)
-        with open("%s/%s.done_buffer.npz" % (path,name)) as f: self.done_buffer = pickle.load(f)
-        with open("%s/%s.count" % (path,name), "r") as f: self.count = int(f.read())
diff --git a/research/steve/toy_demo.py b/research/steve/toy_demo.py
deleted file mode 100644
index 859a86f72a9f8162d9aa9ea1bf78e8baa8b15a4e..0000000000000000000000000000000000000000
--- a/research/steve/toy_demo.py
+++ /dev/null
@@ -1,430 +0,0 @@
-from __future__ import division
-from __future__ import print_function
-from builtins import range
-from past.utils import old_div
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import numpy as np
-import scipy
-import matplotlib.pyplot as plt
-import seaborn as sns
-
-### Hyperparameters
-
-NONTERMINAL_STATE_COUNT = 100
-NOISE_AMOUNT = 0.1
-TRAIN_STEPS = 10000
-Q_ENSEMBLE_SIZE = 8
-MODEL_ENSEMBLE_SIZE = 8
-HORIZON = 5
-TRIAL_N = 10
-
-### Helper functions
-
-initial_state = 0
-terminal_state = NONTERMINAL_STATE_COUNT + 1
-nonterminal_state_count = NONTERMINAL_STATE_COUNT
-state_count = NONTERMINAL_STATE_COUNT + 1
-final_reward = NONTERMINAL_STATE_COUNT
-colors = sns.color_palette('husl', 4)
-plt.rcParams["figure.figsize"] = (6,5)
-
-def step(state):
-  if state == terminal_state: next_state = terminal_state
-  else:                       next_state = state + 1
-
-  if state == terminal_state:     reward = 0
-  elif state+1 == terminal_state: reward = final_reward
-  else:                           reward = -1
-
-  return next_state, reward
-
-def noisy_step(state):
-  if state == terminal_state:               next_state = terminal_state
-  elif np.random.random([]) < NOISE_AMOUNT: next_state = np.random.randint(0, state_count)
-  else:                                     next_state = state + 1
-
-  if state == terminal_state:     reward = 0
-  elif state+1 == terminal_state: reward = final_reward
-  else:                           reward = -1
-
-  return next_state, reward
-
-def get_error(Q):
-  losses = np.square(np.arange(state_count) - Q[:-1])
-  return np.mean(losses)
-
-def downsample(array, factor):
-  pad_size = np.ceil(old_div(float(array.size),factor))*factor - array.size
-  array_padded = np.append(array, np.zeros([pad_size.astype(np.int64)])*np.NaN)
-  return scipy.nanmean(array_padded.reshape(-1,factor), axis=1)
-
-
-######################
-### Main experiments
-######################
-
-# Basic Q
-if True:
-  print("Running basic Q-learning.")
-  trial_results = []
-  for run_i in range(TRIAL_N):
-    print("Trial %d" % run_i)
-    Q = np.random.randint(0,state_count,[state_count+1]).astype(np.float64)
-    Q[state_count] = 0
-    losses = []
-    for step_i in range(TRAIN_STEPS):
-      state = np.random.randint(0,state_count)
-      next_state, reward = step(state)
-      Q[state] = reward + Q[next_state]
-      losses.append(get_error(Q))
-    trial_results.append(losses)
-  print("...complete.\n")
-
-  result = np.stack(trial_results, axis=1)
-  means = np.mean(result, axis=1)
-  stdevs = np.std(result, axis=1)
-  plt.plot(means, label="Basic Q-learning", color=colors[0])
-  plt.fill_between(np.arange(TRAIN_STEPS), means - stdevs, means + stdevs, alpha=.2, color=colors[0])
-  with open('Toy-v1/baseline.csv', 'w') as f:
-    data = []
-    for frame_i in range(result.shape[0]):
-      for loss in result[frame_i]:
-        data.append("%f,%f,%f,%f" % (frame_i, frame_i, frame_i, loss))
-    f.write("\n".join(data))
-
-# Ensemble Q
-if True:
-  print("Running ensemble Q-learning.")
-  trial_results = []
-  for run_i in range(TRIAL_N):
-    print("Trial %d" % run_i)
-    Q = np.random.randint(0,state_count,[Q_ENSEMBLE_SIZE, state_count+1]).astype(np.float64)
-    Q[:, state_count] = 0
-    losses = []
-    for step_i in range(TRAIN_STEPS):
-      for q_ensemble_i in range(Q_ENSEMBLE_SIZE):
-        state = np.random.randint(0,state_count)
-        next_state, reward = step(state)
-        Q[q_ensemble_i, state] = reward + np.mean(Q[:, next_state])
-      losses.append(get_error(np.mean(Q, axis=0)))
-    trial_results.append(losses)
-  print("...complete.\n")
-
-  result = np.stack(trial_results, axis=1)
-  means = np.mean(result, axis=1)
-  stdevs = np.std(result, axis=1)
-  plt.plot(means, label="Ensemble Q-learning", color=colors[1])
-  plt.fill_between(np.arange(TRAIN_STEPS), means - stdevs, means + stdevs, alpha=.2, color=colors[1])
-
-# Ensemble MVE-Oracle
-if True:
-  print("Running ensemble oracle MVE.")
-  trial_results = []
-  for run_i in range(TRIAL_N):
-    print("Trial %d" % run_i)
-    Q = np.random.randint(0,state_count,[Q_ENSEMBLE_SIZE, state_count+1]).astype(np.float64)
-    Q[:, state_count] = 0
-    losses = []
-    for step_i in range(TRAIN_STEPS):
-      for q_ensemble_i in range(Q_ENSEMBLE_SIZE):
-        state = np.random.randint(0,state_count)
-        next_state, reward = step(state)
-
-        # MVE rollout
-        target = reward
-        for _ in range(HORIZON):
-          next_state, reward = step(next_state)
-          target += reward
-        target += np.mean(Q[:,next_state])
-
-        Q[q_ensemble_i, state] = target
-      losses.append(get_error(np.mean(Q, axis=0)))
-    trial_results.append(losses)
-  print("...complete.\n")
-
-  result = np.stack(trial_results, axis=1)
-  means = np.mean(result, axis=1)
-  stdevs = np.std(result, axis=1)
-  plt.plot(means, label="MVE-oracle", color=colors[2])
-  plt.fill_between(np.arange(TRAIN_STEPS), means - stdevs, means + stdevs, alpha=.2, color=colors[2])
-  with open('Toy-v1/mve_oracle.csv', 'w') as f:
-    data = []
-    for frame_i in range(result.shape[0]):
-      for loss in result[frame_i]:
-        data.append("%f,%f,%f,%f" % (frame_i, frame_i, frame_i, loss))
-    f.write("\n".join(data))
-
-# Ensemble MVE-Noisy
-if True:
-  print("Running ensemble noisy MVE.")
-  trial_results = []
-  for run_i in range(TRIAL_N):
-    print("Trial %d" % run_i)
-    Q = np.random.randint(0,state_count,[Q_ENSEMBLE_SIZE, state_count+1]).astype(np.float64)
-    Q[:, state_count] = 0
-    losses = []
-    for step_i in range(TRAIN_STEPS):
-      for q_ensemble_i in range(Q_ENSEMBLE_SIZE):
-        state = np.random.randint(0,state_count)
-        next_state, reward = step(state)
-
-        # MVE rollout
-        targets = []
-        first_next_state, first_reward = next_state, reward
-        for model_ensemble_i in range(MODEL_ENSEMBLE_SIZE):
-          next_state, reward = first_next_state, first_reward
-          target = reward
-          for _ in range(HORIZON):
-            next_state, reward = noisy_step(next_state)
-            target += reward
-          target += np.mean(Q[:,next_state])
-          targets.append(target)
-
-        Q[q_ensemble_i, state] = np.mean(targets)
-      losses.append(get_error(np.mean(Q, axis=0)))
-    trial_results.append(losses)
-  print("...complete.\n")
-
-  result = np.stack(trial_results, axis=1)
-  means = np.mean(result, axis=1)
-  stdevs = np.std(result, axis=1)
-  plt.plot(means, label="MVE-noisy", color=colors[2], linestyle='dotted')
-  plt.fill_between(np.arange(TRAIN_STEPS), means - stdevs, means + stdevs, alpha=.2, color=colors[2])
-  with open('Toy-v1/mve_noisy.csv', 'w') as f:
-    data = []
-    for frame_i in range(result.shape[0]):
-      for loss in result[frame_i]:
-        data.append("%f,%f,%f,%f" % (frame_i, frame_i, frame_i, loss))
-    f.write("\n".join(data))
-
-# STEVE-Oracle
-if True:
-  print("Running ensemble oracle STEVE.")
-  trial_results = []
-
-  oracle_q_estimate_errors = []
-  oracle_mve_estimate_errors = []
-  oracle_steve_estimate_errors = []
-  oracle_opt_estimate_errors = []
-
-
-  for run_i in range(TRIAL_N):
-    print("Trial %d" % run_i)
-    Q = np.random.randint(0,state_count,[Q_ENSEMBLE_SIZE, state_count+1]).astype(np.float64)
-    Q[:, state_count] = 0
-    losses = []
-
-    q_estimate_errors = []
-    mve_estimate_errors = []
-    steve_estimate_errors = []
-    opt_estimate_errors = []
-    steve_beat_freq= []
-
-    for step_i in range(TRAIN_STEPS):
-      _q_estimate_errors = []
-      _mve_estimate_errors = []
-      _steve_estimate_errors = []
-      _opt_estimate_errors = []
-      _steve_beat_freq = []
-
-      for q_ensemble_i in range(Q_ENSEMBLE_SIZE):
-        state = np.random.randint(0,state_count)
-        next_state, reward = step(state)
-
-        # STEVE rollout
-        Q_est_mat = np.zeros([HORIZON + 1, Q_ENSEMBLE_SIZE])
-        reward_est_mat = np.zeros([HORIZON + 1, 1])
-        first_next_state, first_reward = next_state, reward
-        next_state, reward = first_next_state, first_reward
-        Q_est_mat[0, :] = Q[:, next_state]
-        reward_est_mat[0, 0] = reward
-        for timestep_i in range(1,HORIZON+1):
-          next_state, reward = step(next_state)
-          Q_est_mat[timestep_i, :] = Q[:, next_state]
-          reward_est_mat[timestep_i, 0] = reward
-        all_targets = Q_est_mat + np.cumsum(reward_est_mat, axis=0)
-
-        # STEVE weight calculation
-        estimates = np.mean(all_targets, axis=1)
-        confidences = old_div(1., (np.var(all_targets, axis=1) + 1e-8))
-        coefficients = old_div(confidences, np.sum(confidences))
-        target = np.sum(estimates * coefficients)
-
-        Q[q_ensemble_i, state] = target
-
-        true_target = state + 1. if state != terminal_state else 0.
-        _q_estimate_errors.append(np.square(estimates[0] - true_target))
-        _mve_estimate_errors.append(np.square(estimates[-1] - true_target))
-        _steve_estimate_errors.append(np.square(np.sum(estimates * coefficients) - true_target))
-        _opt_estimate_errors.append(np.min(np.square(estimates - true_target)))
-
-      losses.append(get_error(np.mean(Q, axis=0)))
-      q_estimate_errors.append(np.mean(_q_estimate_errors))
-      mve_estimate_errors.append(np.mean(_mve_estimate_errors))
-      steve_estimate_errors.append(np.mean(_steve_estimate_errors))
-      opt_estimate_errors.append(np.mean(_opt_estimate_errors))
-    trial_results.append(losses)
-    oracle_q_estimate_errors.append(q_estimate_errors)
-    oracle_mve_estimate_errors.append(mve_estimate_errors)
-    oracle_steve_estimate_errors.append(steve_estimate_errors)
-    oracle_opt_estimate_errors.append(opt_estimate_errors)
-  print("...complete.\n")
-
-  result = np.stack(trial_results, axis=1)
-  means = np.mean(result, axis=1)
-  stdevs = np.std(result, axis=1)
-  plt.plot(means, label="STEVE-oracle", color=colors[3])
-  plt.fill_between(np.arange(TRAIN_STEPS), means - stdevs, means + stdevs, alpha=.2, color=colors[3])
-  with open('Toy-v1/steve_oracle.csv', 'w') as f:
-    data = []
-    for frame_i in range(result.shape[0]):
-      for loss in result[frame_i]:
-        data.append("%f,%f,%f,%f" % (frame_i, frame_i, frame_i, loss))
-    f.write("\n".join(data))
-
-# STEVE-Noisy
-if True:
-  print("Running ensemble noisy STEVE.")
-  trial_results = []
-
-  noisy_q_estimate_errors = []
-  noisy_mve_estimate_errors = []
-  noisy_steve_estimate_errors = []
-  noisy_opt_estimate_errors = []
-  noisy_steve_beat_freq = []
-
-  for run_i in range(TRIAL_N):
-    print("Trial %d" % run_i)
-    Q = np.random.randint(0,state_count,[Q_ENSEMBLE_SIZE, state_count+1]).astype(np.float64)
-    Q[:, state_count] = 0
-    losses = []
-
-    q_estimate_errors = []
-    mve_estimate_errors = []
-    steve_estimate_errors = []
-    opt_estimate_errors = []
-    steve_beat_freq= []
-
-    for step_i in range(TRAIN_STEPS):
-      _q_estimate_errors = []
-      _mve_estimate_errors = []
-      _steve_estimate_errors = []
-      _opt_estimate_errors = []
-      _steve_beat_freq = []
-      for q_ensemble_i in range(Q_ENSEMBLE_SIZE):
-        state = np.random.randint(0,state_count)
-        next_state, reward = step(state)
-
-        # STEVE rollout
-        Q_est_mat = np.zeros([HORIZON + 1, MODEL_ENSEMBLE_SIZE, Q_ENSEMBLE_SIZE])
-        reward_est_mat = np.zeros([HORIZON + 1, MODEL_ENSEMBLE_SIZE, 1])
-        first_next_state, first_reward = next_state, reward
-        for model_ensemble_i in range(MODEL_ENSEMBLE_SIZE):
-          next_state, reward = first_next_state, first_reward
-          Q_est_mat[0, model_ensemble_i, :] = Q[:, next_state]
-          reward_est_mat[0, model_ensemble_i, 0] = reward
-          for timestep_i in range(1,HORIZON+1):
-            next_state, reward = noisy_step(next_state)
-            Q_est_mat[timestep_i, model_ensemble_i, :] = Q[:, next_state]
-            reward_est_mat[timestep_i, model_ensemble_i, 0] = reward
-        all_targets = Q_est_mat + np.cumsum(reward_est_mat, axis=0)
-
-        # STEVE weight calculation
-        all_targets = np.reshape(all_targets, [HORIZON+1, MODEL_ENSEMBLE_SIZE * Q_ENSEMBLE_SIZE])
-        estimates = np.mean(all_targets, axis=1)
-        confidences = old_div(1., (np.var(all_targets, axis=1) + 1e-8))
-        coefficients = old_div(confidences, np.sum(confidences))
-        target = np.sum(estimates * coefficients)
-        # target = estimates[0]
-
-        Q[q_ensemble_i, state] = target
-
-        true_target = state + 1. if state != terminal_state else 0.
-        _q_estimate_errors.append(np.square(estimates[0] - true_target))
-        _mve_estimate_errors.append(np.square(estimates[-1] - true_target))
-        _steve_estimate_errors.append(np.square(np.sum(estimates * coefficients) - true_target))
-        _opt_estimate_errors.append(np.min(np.square(estimates - true_target)))
-        _steve_beat_freq.append(float(np.square(estimates[0] - true_target) > np.square(target - true_target)))
-
-      losses.append(get_error(np.mean(Q, axis=0)))
-      q_estimate_errors.append(np.mean(_q_estimate_errors))
-      mve_estimate_errors.append(np.mean(_mve_estimate_errors))
-      steve_estimate_errors.append(np.mean(_steve_estimate_errors))
-      opt_estimate_errors.append(np.mean(_opt_estimate_errors))
-      steve_beat_freq.append(np.mean(_steve_beat_freq))
-    trial_results.append(losses)
-    noisy_q_estimate_errors.append(q_estimate_errors)
-    noisy_mve_estimate_errors.append(mve_estimate_errors)
-    noisy_steve_estimate_errors.append(steve_estimate_errors)
-    noisy_opt_estimate_errors.append(opt_estimate_errors)
-    noisy_steve_beat_freq.append(steve_beat_freq)
-
-  print("...complete.\n")
-
-  result = np.stack(trial_results, axis=1)
-  means = np.mean(result, axis=1)
-  stdevs = np.std(result, axis=1)
-  plt.plot(means, label="STEVE-noisy", color=colors[3], linestyle='dotted')
-  plt.fill_between(np.arange(TRAIN_STEPS), means - stdevs, means + stdevs, alpha=.2, color=colors[3])
-  with open('Toy-v1/steve_noisy.csv', 'w') as f:
-    data = []
-    for frame_i in range(result.shape[0]):
-      for loss in result[frame_i]:
-        data.append("%f,%f,%f,%f" % (frame_i, frame_i, frame_i, loss))
-    f.write("\n".join(data))
-
-# ### Display results
-# plt.title("Comparison of convergence rates")
-# plt.legend()
-# plt.savefig("comparison.pdf")
-# plt.show()
-#
-# ### Display secondary results - error comparison
-# DOWNSAMPLE = 50
-# colors = sns.color_palette('husl', 8)
-# for i, (error_curve, label) in enumerate([
-#                                           (oracle_q_estimate_errors, "Oracle Q error"),
-#                                           (oracle_mve_estimate_errors, "Oracle MVE error"),
-#                                           (oracle_steve_estimate_errors, "Oracle STEVE error"),
-#                                           # (oracle_opt_estimate_errors, "Oracle minimum single-estimate error"),
-#                                          ]):
-#   result = np.stack(error_curve, axis=1)
-#   means = downsample(np.mean(result, axis=1), DOWNSAMPLE)
-#   stdevs = downsample(np.std(result, axis=1), DOWNSAMPLE)
-#   plt.plot(means, label=label, color=colors[i])
-#   plt.fill_between(np.arange(means.shape[0]), means - stdevs, means + stdevs, alpha=.2, color=colors[i])
-#
-# plt.title("Comparison of errors for oracle dynamics")
-# plt.legend()
-# plt.show()
-#
-# for i, (error_curve, label) in enumerate([
-#                                           (noisy_q_estimate_errors, "Noisy Q error"),
-#                                           (noisy_mve_estimate_errors, "Noisy MVE error"),
-#                                           (noisy_steve_estimate_errors, "Noisy STEVE error"),
-#                                           # (noisy_opt_estimate_errors, "Noisy minimum single-estimate error"),
-#                                           # (trial_steve_beat_freq, "STEVE beat freq"),
-#                                         ]):
-#   result = np.stack(error_curve, axis=1)
-#   means = downsample(np.mean(result, axis=1), DOWNSAMPLE)
-#   stdevs = downsample(np.std(result, axis=1), DOWNSAMPLE)
-#   plt.plot(means, label=label, color=colors[i])
-#   plt.fill_between(np.arange(means.shape[0]), means - stdevs, means + stdevs, alpha=.2, color=colors[i])
-#
-# plt.title("Comparison of errors for noisy dynamics")
-# plt.legend()
-# plt.show()
\ No newline at end of file
diff --git a/research/steve/util.py b/research/steve/util.py
deleted file mode 100644
index bf0abec0ae193a3ba067a47c895f7cd21593e7b8..0000000000000000000000000000000000000000
--- a/research/steve/util.py
+++ /dev/null
@@ -1,164 +0,0 @@
-from __future__ import division
-from future import standard_library
-standard_library.install_aliases()
-from builtins import str
-from builtins import range
-from past.utils import old_div
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import numpy as np
-import tensorflow as tf
-import os, random, gc, math, re
-import multiprocessing, types, shutil, pickle, json
-from collections import defaultdict, MutableMapping
-
-def tanh_sample_info(mu, logsigma, stop_action_gradient=False, n_samples=1):
-    if n_samples > 1:
-      mu = tf.expand_dims(mu, 2)
-      logsigma = tf.expand_dims(logsigma, 2)
-      sample_shape = tf.concat([tf.shape(mu), n_samples], 0)
-    else:
-      sample_shape = tf.shape(mu)
-
-    flat_act = mu + tf.random_normal(sample_shape) * tf.exp(logsigma)
-    if stop_action_gradient: flat_act = tf.stop_gradient(flat_act)
-    normalized_dist_t = (flat_act - mu) * tf.exp(-logsigma)  # ... x D
-    quadratic = - 0.5 * tf.reduce_sum(normalized_dist_t ** 2, axis=-1) # ... x (None)
-    log_z = tf.reduce_sum(logsigma, axis=-1)  # ... x (None)
-    D_t = tf.cast(tf.shape(mu)[-1], tf.float32)
-    log_z += 0.5 * D_t * np.log(2 * np.pi)
-    flat_ll = quadratic - log_z
-
-    scaled_act = tf.tanh(flat_act)
-    corr = tf.reduce_sum(tf.log(1. - tf.square(scaled_act) + 1e-6), axis=-1)
-    scaled_ll = flat_ll - corr
-    return flat_act, flat_ll, scaled_act, scaled_ll
-
-def tf_cheating_contcartpole(state, action):
-    gravity = 9.8
-    masscart = 1.0
-    masspole = 0.1
-    total_mass = (masspole + masscart)
-    length = 0.5 # actually half the pole's length
-    polemass_length = (masspole * length)
-    force_mag = 10.0
-    tau = 0.02  # seconds between state updates
-
-    # Angle at which to fail the episode
-    theta_threshold_radians = 12 * 2 * math.pi / 360
-    x_threshold = 2.4
-
-    x, x_dot, theta, theta_dot = tf.split(state, 4, axis=-1)
-    done =  tf.logical_or(x < -x_threshold,
-                          tf.logical_or(x > x_threshold,
-                          tf.logical_or(theta < -theta_threshold_radians,
-                                        theta > theta_threshold_radians)))
-
-    force = force_mag * action
-    costheta = tf.cos(theta)
-    sintheta = tf.sin(theta)
-    temp = old_div((force + polemass_length * theta_dot * theta_dot * sintheta), total_mass)
-    thetaacc = old_div((gravity * sintheta - costheta* temp), (length * (old_div(4.0,3.0) - masspole * costheta * costheta / total_mass)))
-    xacc  = temp - polemass_length * thetaacc * costheta / total_mass
-    x  = x + tau * x_dot
-    x_dot = x_dot + tau * xacc
-    theta = theta + tau * theta_dot
-    theta_dot = theta_dot + tau * thetaacc
-    state = tf.concat([x,x_dot,theta,theta_dot], -1)
-    done = tf.squeeze(tf.cast(done, tf.float32), -1)
-    reward = 1.0 - done
-    done *= 0.
-    return state, reward, done
-
-def create_directory(dir):
-    dir_chunks = dir.split("/")
-    for i in range(len(dir_chunks)):
-        partial_dir = "/".join(dir_chunks[:i+1])
-        try:
-            os.makedirs(partial_dir)
-        except OSError:
-            pass
-    return dir
-
-def create_and_wipe_directory(dir):
-    shutil.rmtree(create_directory(dir))
-    create_directory(dir)
-
-def wipe_file(fname):
-    with open(fname, "w") as f:
-        f.write("")
-    return fname
-
-def get_largest_epoch_in_dir(dir, saveid):
-    reg_matches = [re.findall('\d+_%s'%saveid,filename) for filename in os.listdir(dir)]
-    epoch_labels = [int(regmatch[0].split("_")[0]) for regmatch in reg_matches if regmatch]
-    if len(epoch_labels) == 0: return False
-    return max(epoch_labels)
-
-def wipe_all_but_largest_epoch_in_dir(dir, saveid):
-    largest = get_largest_epoch_in_dir(dir, saveid)
-    reg_matches = [(filename, re.findall('\d+_%s'%saveid,filename)) for filename in os.listdir(dir)]
-    for filename, regmatch in reg_matches:
-        if regmatch and int(regmatch[0].split("_")[0]) != largest:
-            os.remove(os.path.join(dir,filename))
-
-class ConfigDict(dict):
-    def __init__(self, loc=None, ghost=False):
-        self._dict = defaultdict(lambda :False)
-        self.ghost = ghost
-        if loc:
-            with open(loc) as f: raw = json.load(f)
-            if "inherits" in raw and raw["inherits"]:
-                for dep_loc in raw["inherits"]:
-                    self.update(ConfigDict(dep_loc))
-            if "updates" in raw and raw["updates"]:
-                self.update(raw["updates"], include_all=True)
-
-    def __getitem__(self, key):
-        return self._dict[key]
-
-    def __setitem__(self, key, value):
-        self._dict[key] = value
-
-    def __str__(self):
-        return str(dict(self._dict))
-
-    def __repr__(self):
-        return str(dict(self._dict))
-
-    def __iter__(self):
-        return self._dict.__iter__()
-
-    def __bool__(self):
-        return bool(self._dict)
-
-    def __nonzero__(self):
-        return bool(self._dict)
-
-    def update(self, dictlike, include_all=False):
-        for key in dictlike:
-            value = dictlike[key]
-            if isinstance(value, dict):
-                if key[0] == "*": # this means only override, do not set
-                    key = key[1:]
-                    ghost = True
-                else:
-                    ghost = False
-                if not include_all and isinstance(value, ConfigDict) and key not in self._dict and value.ghost: continue
-                if key not in self._dict: self._dict[key] = ConfigDict(ghost=ghost)
-                self._dict[key].update(value)
-            else:
-                self._dict[key] = value
diff --git a/research/steve/valuerl.py b/research/steve/valuerl.py
deleted file mode 100644
index 4819dd08c2f813eb524c56ebadec25a24115223b..0000000000000000000000000000000000000000
--- a/research/steve/valuerl.py
+++ /dev/null
@@ -1,307 +0,0 @@
-from __future__ import division
-from builtins import zip
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import tensorflow as tf
-import numpy as np
-import nn
-import util
-from learner import CoreModel
-
-
-class ValueRL(CoreModel):
-  """
-  Learn a state-action value function and its corresponding policy.
-  """
-
-  @property
-  def saveid(self):
-    return "valuerl"
-
-  def create_params(self, env_config, learner_config):
-    self.obs_dim = np.prod(env_config["obs_dims"])
-    self.action_dim = env_config["action_dim"]
-    self.reward_scale = env_config["reward_scale"]
-    self.discount = env_config["discount"]
-
-    self.hidden_dim = learner_config["hidden_dim"]
-    self.bayesian_config = learner_config["bayesian"]
-    self.value_expansion = learner_config["value_expansion"]
-    self.explore_chance = learner_config["ddpg_explore_chance"]
-
-    with tf.variable_scope(self.name):
-      self.policy = nn.FeedForwardNet('policy', self.obs_dim, [self.action_dim], layers=4, hidden_dim=self.hidden_dim, get_uncertainty=False)
-
-      if self.bayesian_config:
-        self.Q = nn.EnsembleFeedForwardNet('Q', self.obs_dim + self.action_dim, [], layers=4, hidden_dim=self.hidden_dim, get_uncertainty=True, ensemble_size=self.bayesian_config["ensemble_size"], train_sample_count=self.bayesian_config["train_sample_count"], eval_sample_count=self.bayesian_config["eval_sample_count"])
-        self.old_Q = nn.EnsembleFeedForwardNet('old_q', self.obs_dim + self.action_dim, [], layers=4, hidden_dim=self.hidden_dim, get_uncertainty=True, ensemble_size=self.bayesian_config["ensemble_size"], train_sample_count=self.bayesian_config["train_sample_count"], eval_sample_count=self.bayesian_config["eval_sample_count"])
-      else:
-        self.Q = nn.FeedForwardNet('Q', self.obs_dim + self.action_dim, [], layers=4, hidden_dim=self.hidden_dim, get_uncertainty=True)
-        self.old_Q = nn.FeedForwardNet('old_q', self.obs_dim + self.action_dim, [], layers=4, hidden_dim=self.hidden_dim, get_uncertainty=True)
-
-    self.policy_params = [v for v in tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, scope=self.name) if "policy" in v.name]
-    self.Q_params = [v for v in tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, scope=self.name) if "Q" in v.name]
-    self.agent_params = tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES, scope=self.name)
-
-    self.copy_to_old_ops = [tf.assign(p_old, p) for p_old, p in zip(self.old_Q.params_list, self.Q.params_list)]
-    self.assign_epoch_op = [tf.assign(self.epoch_n, self.epoch_n_placeholder), tf.assign(self.update_n, self.update_n_placeholder), tf.assign(self.frame_n, self.frame_n_placeholder), tf.assign(self.hours, self.hours_placeholder)]
-
-  def update_epoch(self, sess, epoch, updates, frames, hours):
-    sess.run(self.assign_epoch_op, feed_dict={self.epoch_n_placeholder: int(epoch), self.update_n_placeholder: int(updates), self.frame_n_placeholder: int(frames), self.hours_placeholder: float(hours)})
-
-  def copy_to_old(self, sess):
-    sess.run(self.copy_to_old_ops)
-
-  def build_evalution_graph(self, obs, get_full_info=False, mode="regular", n_samples=1):
-    assert mode in {"regular", "explore", "exploit"}
-    policy_actions_pretanh = self.policy(obs)
-
-    if mode == "regular" or mode == "exploit":
-      policy_actions = tf.tanh(policy_actions_pretanh)
-    elif mode == "explore":
-      _, _, exploring_policy_actions, _ = util.tanh_sample_info(policy_actions_pretanh, tf.zeros_like(policy_actions_pretanh), n_samples=n_samples)
-      policy_actions = tf.where(tf.random_uniform(tf.shape(exploring_policy_actions)) < self.explore_chance, x=exploring_policy_actions, y=tf.tanh(policy_actions_pretanh))
-    else: raise Exception('this should never happen')
-
-    if get_full_info:     return policy_actions_pretanh, policy_actions
-    else:                 return policy_actions
-
-  def build_training_graph(self, obs, next_obs, empirical_actions, rewards, dones, data_size, worldmodel=None):
-    average_model_use = tf.constant(0.)
-    empirical_Q_info = tf.concat([obs, empirical_actions], 1)
-
-    if worldmodel is None:
-      policy_action_pretanh, policy_actions = self.build_evalution_graph(obs, get_full_info=True)
-      policy_Q_info = tf.concat([obs, policy_actions], 1)
-      state_value_estimate = self.Q(policy_Q_info, reduce_mode="mean")
-
-      next_policy_actions = self.build_evalution_graph(next_obs)
-      policy_next_Q_info = tf.concat([next_obs, next_policy_actions], 1)
-      next_Q_estimate = self.old_Q(policy_next_Q_info, reduce_mode="mean")
-
-      Q_guess = self.Q(empirical_Q_info, is_eval=False, reduce_mode="random")
-      Q_target = rewards * self.reward_scale + self.discount * next_Q_estimate * (1. - dones)
-
-      policy_losses = -state_value_estimate
-      Q_losses = .5 * tf.square( Q_guess - tf.stop_gradient(Q_target) )
-
-    else:
-      targets, confidence, Q_guesses, reach_probs = self.build_Q_expansion_graph(next_obs, rewards, dones, worldmodel, rollout_len=self.value_expansion["rollout_len"], model_ensembling=worldmodel.bayesian_config is not False)
-
-      # targets is a 3D matrix: [batch_i, start_timestep, end_timestep]. here, we reduce out the last dimension, turning
-      # it into a [batch_i, start_timestep] matrix. in other words, we are taking a bunch of candidate targets and reducing
-      # them into a single target. the four options here correspond to the four ways to do that reduction.
-      if self.value_expansion["mean_k_return"]:
-        target_counts = self.value_expansion["rollout_len"]+1 - tf.reshape(tf.range(self.value_expansion["rollout_len"]+1), [1, self.value_expansion["rollout_len"]+1])
-        k_returns = tf.reduce_sum(targets, 2) / tf.cast(target_counts, tf.float32)
-      elif self.value_expansion["lambda_return"]:
-        cont_coeffs = self.value_expansion["lambda_return"] ** tf.cast(tf.reshape(tf.range(self.value_expansion["rollout_len"]+1), [1,1,self.value_expansion["rollout_len"]+1]), tf.float32)
-        stop_coeffs = tf.concat([(1 - self.value_expansion["lambda_return"]) * tf.ones_like(targets)[:,:,:-1], tf.ones_like(targets)[:,:,-1:]], 2)
-        k_returns = tf.reduce_sum(targets * stop_coeffs * cont_coeffs, 2)
-      elif self.value_expansion["steve_reweight"]:
-        k_returns = tf.reduce_sum(targets * confidence, 2)
-        average_model_use = 1. - tf.reduce_mean(confidence[:,0,0])
-      else:
-        # MVE objective: just take the last one
-        k_returns = targets[:,:,-1]
-
-      # now we have [batch_i, start_timestep]. if we are using the TDK trick, then we want to use all of the targets,
-      # so we construct a corresponding [batch_i, start_timestep] matrix of guesses. otherwise, we just take the targets
-      # for the first timestep.
-      Q_guess = self.Q(empirical_Q_info, is_eval=False, reduce_mode="random")
-      if self.value_expansion["tdk_trick"]:
-        Q_guess = tf.concat([tf.expand_dims(Q_guess, 1), Q_guesses], 1)
-        reach_probs = tf.concat([tf.expand_dims(tf.ones_like(reach_probs[:,0]), 1), reach_probs[:,:-1]], 1)
-        Q_target = k_returns
-      else:
-        # non-TDK trick means we just take the first one
-        Q_target = k_returns[:,0]
-
-      policy_action_pretanh, policy_actions = self.build_evalution_graph(obs, get_full_info=True)
-      policy_Q_info = tf.concat([obs, policy_actions], 1)
-      state_value_estimate = self.Q(policy_Q_info, stop_params_gradient=True, reduce_mode="mean")
-
-      policy_losses = -state_value_estimate
-      Q_losses = .5 * tf.square( Q_guess - tf.stop_gradient(Q_target) )
-      if self.value_expansion["tdk_trick"]: Q_losses *= reach_probs # we downscale the various TDK-trick losses by
-                                                                    # the likelihood of actually reaching the state
-                                                                    # from which the guess was made
-    policy_loss = tf.reduce_mean(policy_losses)
-    Q_loss = tf.reduce_mean(Q_losses)
-    policy_reg_loss = tf.reduce_mean(tf.square(policy_action_pretanh)) * .001 # a small regularization to make sure the
-                                                                              # tanh does not saturate
-
-    # anything in inspect gets logged
-    inspect = (policy_loss, Q_loss, policy_reg_loss, average_model_use)
-
-    return (policy_loss + policy_reg_loss, Q_loss), inspect
-
-
-  def build_Q_expansion_graph(self, obs, first_rewards, first_done, worldmodel, rollout_len=1, model_ensembling=False):
-    ### this sets up the machinery for having multiple parallel rollouts, each of which has a single consistent transition
-    ensemble_idxs, transition_sample_n, reward_sample_n = worldmodel.get_ensemble_idx_info()
-    q_sample_n = self.bayesian_config["eval_sample_count"] if self.bayesian_config is not False else 1
-    first_rewards = tf.tile(tf.expand_dims(tf.expand_dims(first_rewards,1),1), [1,transition_sample_n,reward_sample_n])
-    first_rewards.set_shape([None, transition_sample_n, reward_sample_n])
-    if model_ensembling:
-      obs = tf.tile(tf.expand_dims(obs,1), [1,transition_sample_n,1])
-      obs.set_shape([None, transition_sample_n, self.obs_dim])
-      first_done = tf.tile(tf.expand_dims(first_done, 1), [1, transition_sample_n])
-      first_done.set_shape([None, transition_sample_n])
-
-    ### below, we use a while loop to actually do the iterative model rollout
-    extra_info = worldmodel.init_extra_info(obs)
-
-    action_ta = tf.TensorArray(size=rollout_len, dynamic_size=False, dtype=tf.float32)
-    obs_ta =       tf.TensorArray(size=rollout_len, dynamic_size=False, dtype=tf.float32)
-    done_ta =     tf.TensorArray(size=rollout_len, dynamic_size=False, dtype=tf.float32)
-    extra_info_ta =tf.TensorArray(size=rollout_len, dynamic_size=False, dtype=tf.float32)
-
-    def rollout_loop_body(r_i, xxx_todo_changeme):
-      (obs, done, extra_info, action_ta, obs_ta, dones_ta, extra_info_ta) = xxx_todo_changeme
-      action_pretanh, action = self.build_evalution_graph(tf.stop_gradient(obs), get_full_info=True)
-
-      if model_ensembling:
-        next_obs, next_dones, next_extra_info = worldmodel.transition(obs, action, extra_info, ensemble_idxs=ensemble_idxs)
-      else:
-        next_obs, next_dones, next_extra_info = worldmodel.transition(obs, action, extra_info)
-        next_obs = tf.reduce_mean(next_obs, -2)
-        next_dones = tf.reduce_mean(next_dones, -1)
-
-      action_ta = action_ta.write(r_i, action)
-      obs_ta = obs_ta.write(r_i, obs)
-      dones_ta = dones_ta.write(r_i, done)
-      extra_info_ta = extra_info_ta.write(r_i, extra_info)
-      return r_i+1, (next_obs, next_dones, next_extra_info, action_ta, obs_ta, dones_ta, extra_info_ta)
-
-    _, (final_obs, final_done, final_extra_info, action_ta, obs_ta, done_ta, extra_info_ta) = tf.while_loop(
-        lambda r_i, _: r_i < rollout_len,
-        rollout_loop_body,
-        [0, (obs, first_done, extra_info, action_ta, obs_ta, done_ta, extra_info_ta)]
-    )
-
-    final_action_pretanh, final_action = self.build_evalution_graph(tf.stop_gradient(final_obs), get_full_info=True)
-
-    ### compile the TensorArrays into useful tensors
-    obss = obs_ta.stack()
-    obss = tf.reshape(obss, tf.stack([rollout_len, -1, transition_sample_n, self.obs_dim]))
-    obss = tf.transpose(obss, [1, 0, 2, 3])
-    final_obs = tf.reshape(final_obs, tf.stack([-1, 1, transition_sample_n, self.obs_dim]))
-    all_obss = tf.concat([obss, final_obs],1)
-    next_obss = all_obss[:,1:]
-
-    dones = done_ta.stack()
-    dones = tf.reshape(dones, tf.stack([rollout_len, -1, transition_sample_n]))
-    dones = tf.transpose(dones, [1, 0, 2])
-    final_done = tf.reshape(final_done, tf.stack([-1, 1, transition_sample_n]))
-    all_dones = tf.concat([dones, final_done],1)
-
-    actions = action_ta.stack()
-    actions = tf.reshape(actions, tf.stack([rollout_len, -1, transition_sample_n, self.action_dim]))
-    actions = tf.transpose(actions , [1, 0, 2, 3])
-    final_action = tf.reshape(final_action, tf.stack([-1, 1, transition_sample_n, self.action_dim]))
-    all_actions = tf.concat([actions, final_action],1)
-
-    continue_probs = tf.cumprod(1. - all_dones, axis=1)
-    rewards = worldmodel.get_rewards(obss, actions, next_obss)
-    rawrew = rewards = tf.concat([tf.expand_dims(first_rewards, 1), rewards],1)
-
-    ### TDK trick means we have to guess at every timestep
-    if self.value_expansion["tdk_trick"]:
-      guess_info = tf.concat([obss,actions], -1)
-      Q_guesses = self.Q(guess_info, reduce_mode="random")
-      Q_guesses = tf.reduce_mean(Q_guesses, -1) # make it so there's only one guess per rollout length, which is the mean of the guesses under all the various model rollouts
-      reached_this_point_to_guess_prob = tf.reduce_mean(continue_probs, -1)
-    else:
-      Q_guesses = None
-      reached_this_point_to_guess_prob = None
-
-    ### use the Q function at every timestep to get value estimates
-    target_info = tf.concat([all_obss, all_actions], -1)
-    Q_targets = self.old_Q(target_info, reduce_mode="none")
-
-    rollout_frames = rollout_len + 1 # if we take N steps, we have N+1 frames
-
-    ### create "decay-exponent matrix" of size [1,ROLLOUT_FRAMES,ROLLOUT_FRAMES,1]. the first ROLLOUT_FRAMES corresponds to the index of the source, the second to the target.
-    ts_count_mat = (tf.cast(tf.reshape(tf.range(rollout_frames), [1, rollout_frames]) - tf.reshape(tf.range(rollout_frames), [rollout_frames, 1]), tf.float32))
-    reward_coeff_matrix = tf.matrix_band_part(tf.ones([rollout_frames, rollout_frames]), 0, -1) * self.discount ** ts_count_mat
-    value_coeff_matrix = tf.matrix_band_part(tf.ones([rollout_frames, rollout_frames]), 0, -1) * self.discount ** (1. + ts_count_mat)
-    reward_coeff_matrix = tf.reshape(reward_coeff_matrix, [1, rollout_frames, rollout_frames, 1, 1])
-    value_coeff_matrix = tf.reshape(value_coeff_matrix, [1, rollout_frames, rollout_frames, 1, 1])
-
-    ### similarly, create a "done" matrix
-    shifted_continue_probs = tf.concat([tf.expand_dims(tf.ones_like(continue_probs[:,0]),1), continue_probs[:,:-1]], 1)
-    reward_continue_matrix = tf.expand_dims(shifted_continue_probs, 1) / tf.expand_dims(shifted_continue_probs+1e-8, 2)
-    value_continue_matrix = tf.expand_dims(continue_probs, 1) / tf.expand_dims(shifted_continue_probs+1e-8, 2)
-    reward_continue_matrix = tf.expand_dims(reward_continue_matrix, -1)
-    value_continue_matrix = tf.expand_dims(value_continue_matrix, -1)
-
-    ### apply the discounting factors to the rewards and values
-    rewards = tf.expand_dims(rewards, 1) * reward_coeff_matrix * reward_continue_matrix
-    rewards = tf.cumsum(rewards, axis=2)
-    values = tf.expand_dims(Q_targets, 1) * value_coeff_matrix * value_continue_matrix
-
-    ### compute the targets using the Bellman equation
-    sampled_targets = tf.expand_dims(rewards,-2) * self.reward_scale + tf.expand_dims(values,-1)
-
-    ### flatten out the various sources of variance (transition, reward, and Q-function ensembles) to get a set of estimates for each candidate target
-    sampled_targets = tf.reshape(sampled_targets, tf.stack([-1, rollout_frames, rollout_frames, transition_sample_n * reward_sample_n * q_sample_n]))
-
-    ### compute the mean and variance for each candidate target
-    target_means, target_variances = tf.nn.moments(sampled_targets, 3)
-
-    ### compute the confidence, either using the full covariance matrix, or approximating all the estimators as independent
-    if self.value_expansion["covariances"]:
-      targetdiffs = sampled_targets - tf.expand_dims(target_means,3)
-      target_covariances = tf.einsum("abij,abjk->abik", targetdiffs, tf.transpose(targetdiffs, [0,1,3,2]))
-      target_confidence = tf.squeeze(tf.matrix_solve(target_covariances + tf.expand_dims(tf.expand_dims(tf.matrix_band_part(tf.ones(tf.shape(target_covariances)[-2:]),0,0) * 1e-3,0),0), tf.ones(tf.concat([tf.shape(target_covariances)[:-1], tf.constant([1])],0))),-1)
-    else:
-      target_confidence = 1./(target_variances + 1e-8)
-
-    ### normalize so weights sum to 1
-    target_confidence *= tf.matrix_band_part(tf.ones([1, rollout_frames, rollout_frames]), 0, -1)
-    target_confidence = target_confidence / tf.reduce_sum(target_confidence, axis=2, keepdims=True)
-
-    ### below here is a bunch of debugging Print statements that I use as a sanity check:
-    # target_confidence = tf.Print(target_confidence, [], message="raw rewards")
-    # target_confidence = tf.Print(target_confidence, [rawrew[0,:,0,0]], summarize=rollout_len+1)
-    # target_means = tf.Print(target_means, [], message="\n", summarize=rollout_len+1)
-    # target_means = tf.Print(target_means, [(1. - all_dones)[0,:,0]], message="contin", summarize=rollout_len+1)
-    # target_means = tf.Print(target_means, [continue_probs[0,:,0]], message="cum_contin", summarize=rollout_len+1)
-    # target_means = tf.Print(target_means, [shifted_continue_probs[0,:,0]], message="shifted contin", summarize=rollout_len+1)
-    # target_means = tf.Print(target_means, [], message="reward_coeff")
-    # for i in range(rollout_len+1): target_means = tf.Print(target_means, [reward_coeff_matrix[0,i,:,0,0]], summarize=rollout_len+1)
-    # target_means = tf.Print(target_means, [], message="reward_continue")
-    # for i in range(rollout_len+1): target_means = tf.Print(target_means, [reward_continue_matrix[0,i,:,0,0]], summarize=rollout_len+1)
-    # target_means = tf.Print(target_means, [], message="value_coeff")
-    # for i in range(rollout_len+1): target_means = tf.Print(target_means, [value_coeff_matrix[0,i,:,0,0]], summarize=rollout_len+1)
-    # target_means = tf.Print(target_means, [], message="value_continue")
-    # for i in range(rollout_len+1): target_means = tf.Print(target_means, [value_continue_matrix[0,i,:,0,0]], summarize=rollout_len+1)
-    # target_confidence = tf.Print(target_confidence, [], message="rewards")
-    # for i in range(rollout_len+1): target_confidence = tf.Print(target_confidence, [rewards[0,i,:,0,0]], summarize=rollout_len+1)
-    # target_confidence = tf.Print(target_confidence, [], message="target Qs")
-    # target_confidence = tf.Print(target_confidence, [Q_targets[0,:,0,0]], summarize=rollout_len+1)
-    # target_confidence = tf.Print(target_confidence, [], message="values")
-    # for i in range(rollout_len+1): target_confidence = tf.Print(target_confidence, [values[0,i,:,0,0]], summarize=rollout_len+1)
-    # target_confidence = tf.Print(target_confidence, [], message="target_means")
-    # for i in range(rollout_len+1): target_confidence = tf.Print(target_confidence, [target_means[0,i,:]], summarize=rollout_len+1)
-    # target_confidence = tf.Print(target_confidence, [], message="target_variance")
-    # for i in range(rollout_len+1): target_confidence = tf.Print(target_confidence, [target_variances[0,i,:]], summarize=rollout_len+1)
-    # target_confidence = tf.Print(target_confidence, [], message="target_confidence")
-    # for i in range(rollout_len+1): target_confidence = tf.Print(target_confidence, [target_confidence[0,i,:]], summarize=rollout_len+1)
-    # target_means = tf.Print(target_means, [target_confidence, action_lls, tf.shape(Q_targets)], message="\n\n", summarize=10)
-
-    return target_means, target_confidence, Q_guesses, reached_this_point_to_guess_prob
\ No newline at end of file
diff --git a/research/steve/valuerl_learner.py b/research/steve/valuerl_learner.py
deleted file mode 100644
index a3c6308f83b02f065757f4d7c56766911719f541..0000000000000000000000000000000000000000
--- a/research/steve/valuerl_learner.py
+++ /dev/null
@@ -1,81 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import tensorflow as tf
-import numpy as np
-import os
-
-from learner import Learner
-from valuerl import ValueRL
-from worldmodel import DeterministicWorldModel
-
-class ValueRLLearner(Learner):
-  """
-  ValueRL-specific training loop details.
-  """
-
-  def learner_name(self): return "valuerl"
-
-  def make_loader_placeholders(self):
-    self.obs_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"], np.prod(self.env_config["obs_dims"])])
-    self.next_obs_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"], np.prod(self.env_config["obs_dims"])])
-    self.action_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"], self.env_config["action_dim"]])
-    self.reward_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"]])
-    self.done_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"]])
-    self.datasize_loader = tf.placeholder(tf.float64, [])
-    return [self.obs_loader, self.next_obs_loader, self.action_loader, self.reward_loader, self.done_loader, self.datasize_loader]
-
-  def make_core_model(self):
-    if self.config["model_config"] is not False:
-        self.worldmodel = DeterministicWorldModel(self.config["name"], self.env_config, self.config["model_config"])
-    else:
-        self.worldmodel = None
-
-    valuerl = ValueRL(self.config["name"], self.env_config, self.learner_config)
-    (policy_loss, Q_loss), inspect_losses = valuerl.build_training_graph(*self.current_batch, worldmodel=self.worldmodel)
-
-    policy_optimizer = tf.train.AdamOptimizer(3e-4)
-    policy_gvs = policy_optimizer.compute_gradients(policy_loss, var_list=valuerl.policy_params)
-    capped_policy_gvs = policy_gvs
-    policy_train_op = policy_optimizer.apply_gradients(capped_policy_gvs)
-
-    Q_optimizer = tf.train.AdamOptimizer(3e-4)
-    Q_gvs = Q_optimizer.compute_gradients(Q_loss, var_list=valuerl.Q_params)
-    capped_Q_gvs = Q_gvs
-    Q_train_op = Q_optimizer.apply_gradients(capped_Q_gvs)
-
-    return valuerl, (policy_loss, Q_loss), (policy_train_op, Q_train_op), inspect_losses
-
-  ## Optional functions to override
-  def initialize(self):
-      if self.config["model_config"] is not False:
-          while not self.load_worldmodel(): pass
-
-  def resume_from_checkpoint(self, epoch):
-      if self.config["model_config"] is not False:
-          with self.bonus_kwargs["model_lock"]: self.worldmodel.load(self.sess, self.save_path, epoch)
-
-  def checkpoint(self):
-      self.core.copy_to_old(self.sess)
-      if self.config["model_config"] is not False:
-          self.load_worldmodel()
-
-  def backup(self): pass
-
-  # Other functions
-  def load_worldmodel(self):
-      if not os.path.exists("%s/%s.params.index" % (self.save_path, self.worldmodel.saveid)): return False
-      with self.bonus_kwargs["model_lock"]: self.worldmodel.load(self.sess, self.save_path)
-      return True
diff --git a/research/steve/visualizer.py b/research/steve/visualizer.py
deleted file mode 100644
index 825f1a238617d78dc2ada4611dcecc175f238eb5..0000000000000000000000000000000000000000
--- a/research/steve/visualizer.py
+++ /dev/null
@@ -1,107 +0,0 @@
-from __future__ import print_function
-from builtins import range
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import numpy as np
-import tensorflow as tf
-# import moviepy.editor as mpy
-import time, os, traceback, multiprocessing, portalocker, sys
-
-import envwrap
-import util
-import valuerl, worldmodel
-from config import config
-
-MODEL_NAME = config["name"]
-LOG_PATH = util.create_directory("output/" + config["env"] + "/" + MODEL_NAME + "/" + config["log_path"]) + "/" + MODEL_NAME
-LOAD_PATH =    util.create_directory("output/" + config["env"] + "/" + MODEL_NAME + "/" + config["save_model_path"])
-OBS_DIM =   np.prod(config["obs_dims"])
-HIDDEN_DIM = config["hidden_dim"]
-ACTION_DIM = config["action_dim"]
-MAX_FRAMES = config["max_frames"]
-REWARD_SCALE = config["reward_scale"]
-DISCOUNT = config["discount"]
-ALGO = config["policy_config"]["algo"]
-AGENT_BATCH_SIZE = config["agent_config"]["batch_size"]
-EVALUATOR_BATCH_SIZE = config["evaluator_config"]["batch_size"]
-RELOAD_EVERY_N = config["agent_config"]["reload_every_n"]
-FRAMES_BEFORE_LEARNING = config["policy_config"]["frames_before_learning"]
-FRAMES_PER_UPDATE = config["policy_config"]["frames_per_update"]
-LEARNER_EPOCH_N = config["policy_config"]["epoch_n"]
-SYNC_UPDATES = config["policy_config"]["frames_per_update"] >= 0
-POLICY_BAYESIAN_CONFIG = config["policy_config"]["bayesian"]
-AUX_CONFIG = config["aux_config"]
-DDPG_EXPLORE_CHANCE = config["policy_config"]["explore_chance"] if ALGO == "ddpg" else 0.
-MODEL_AUGMENTED = config["model_config"] is not False
-if MODEL_AUGMENTED: MODEL_BAYESIAN_CONFIG = config["model_config"]["bayesian"]
-
-FILENAME = sys.argv[3]
-
-if __name__ == '__main__':
-    oprl = valuerl.ValueRL(MODEL_NAME, ALGO, OBS_DIM, ACTION_DIM, HIDDEN_DIM, REWARD_SCALE, DISCOUNT, POLICY_BAYESIAN_CONFIG, AUX_CONFIG, DDPG_EXPLORE_CHANCE)
-
-    obs_loader = tf.placeholder(tf.float32, [1, OBS_DIM])
-    policy_actions, _ = oprl.build_evalution_graph(obs_loader, mode="exploit")
-
-    if MODEL_AUGMENTED:
-        next_obs_loader = tf.placeholder(tf.float32, [1, OBS_DIM])
-        reward_loader = tf.placeholder(tf.float32, [1])
-        done_loader = tf.placeholder(tf.float32, [1])
-        worldmodel = worldmodel.DeterministicWorldModel(MODEL_NAME, OBS_DIM, ACTION_DIM, HIDDEN_DIM, REWARD_SCALE, DISCOUNT, MODEL_BAYESIAN_CONFIG)
-        _, _, _, _, _, confidence, _ = oprl.build_Q_expansion_graph(next_obs_loader, reward_loader, done_loader, worldmodel, rollout_len=3, model_ensembling=True)
-
-    sess = tf.Session()
-    sess.run(tf.global_variables_initializer())
-
-    oprl.load(sess, FILENAME)
-    if MODEL_AUGMENTED: worldmodel.load(sess, FILENAME)
-
-    env = envwrap.get_env(config["env"])
-
-    hist = np.zeros([4, 10])
-    for _ in range(10):
-        ts = 0
-        rgb_frames = []
-        obs, reward, done, reset = env.reset(), 0, False, False
-        while not reset:
-            # env.internal_env.render()
-            # rgb_frames.append(env.internal_env.render(mode='rgb_array'))
-            # action = env.action_space.sample()
-            all_actions = sess.run(policy_actions, feed_dict={obs_loader: np.array([obs])})
-            all_actions = np.clip(all_actions, -1., 1.)
-            action = all_actions[0]
-            obs, _reward, done, reset = env.step(action)
-
-            if MODEL_AUGMENTED:
-                _confidences = sess.run(confidence, feed_dict={next_obs_loader: np.expand_dims(obs,0),
-                                                               reward_loader: np.expand_dims(_reward,0),
-                                                               done_loader: np.expand_dims(done,0)})
-                # print "%.02f %.02f %.02f %.02f" % tuple(_confidences[0,0])
-                for h in range(4):
-                    bucket = int((_confidences[0,0,h]-1e-5)*10)
-                    hist[h,bucket] += 1
-
-            reward += _reward
-            ts += 1
-            # print ts, _reward, reward
-        print(ts, reward)
-    hist /= np.sum(hist, axis=1, keepdims=True)
-    for row in reversed(hist.T): print(' '.join(["%.02f"] * 4) % tuple(row))
-
-    #clip = mpy.ImageSequenceClip(rgb_frames, fps=100)
-    #clip.write_videofile(FILENAME + "/movie.mp4")
-
-
diff --git a/research/steve/worldmodel.py b/research/steve/worldmodel.py
deleted file mode 100644
index 613bc6cb3f69a534fe4a5cada67badf7d4052d4f..0000000000000000000000000000000000000000
--- a/research/steve/worldmodel.py
+++ /dev/null
@@ -1,104 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import tensorflow as tf
-import numpy as np
-import nn
-
-from learner import CoreModel
-
-class DeterministicWorldModel(CoreModel):
-  """
-  A simple feed-forward neural network world model, with an option for an ensemble.
-  """
-
-  @property
-  def saveid(self):
-    return "worldmodel"
-
-  def create_params(self, env_config, learner_config):
-    self.obs_dim = np.prod(env_config["obs_dims"])
-    self.action_dim = env_config["action_dim"]
-    self.reward_scale = env_config["reward_scale"]
-    self.discount = env_config["discount"]
-
-    self.aux_hidden_dim = self.learner_config["aux_hidden_dim"]
-    self.transition_hidden_dim = self.learner_config["transition_hidden_dim"]
-    self.bayesian_config = self.learner_config["bayesian"]
-
-    with tf.variable_scope(self.name):
-      if self.bayesian_config:
-        self.transition_predictor = nn.EnsembleFeedForwardNet('transition_predictor', self.obs_dim + self.action_dim, [self.obs_dim], layers=8, hidden_dim=self.transition_hidden_dim, get_uncertainty=True, ensemble_size=self.bayesian_config["transition"]["ensemble_size"], train_sample_count=self.bayesian_config["transition"]["train_sample_count"], eval_sample_count=self.bayesian_config["transition"]["eval_sample_count"])
-        self.done_predictor =       nn.EnsembleFeedForwardNet('done_predictor', self.obs_dim + self.obs_dim + self.action_dim, [], layers=4, hidden_dim=self.aux_hidden_dim, get_uncertainty=True, ensemble_size=self.bayesian_config["transition"]["ensemble_size"], train_sample_count=self.bayesian_config["transition"]["train_sample_count"], eval_sample_count=self.bayesian_config["transition"]["eval_sample_count"])
-        self.reward_predictor =     nn.EnsembleFeedForwardNet('reward_predictor', self.obs_dim + self.obs_dim + self.action_dim, [], layers=4, hidden_dim=self.aux_hidden_dim, get_uncertainty=True, ensemble_size=self.bayesian_config["reward"]["ensemble_size"], train_sample_count=self.bayesian_config["reward"]["train_sample_count"], eval_sample_count=self.bayesian_config["reward"]["eval_sample_count"])
-      else:
-        self.transition_predictor = nn.FeedForwardNet('transition_predictor', self.obs_dim + self.action_dim, [self.obs_dim], layers=8, hidden_dim=self.transition_hidden_dim, get_uncertainty=True)
-        self.done_predictor =       nn.FeedForwardNet('done_predictor',   self.obs_dim + self.obs_dim + self.action_dim, [], layers=4, hidden_dim=self.aux_hidden_dim, get_uncertainty=True)
-        self.reward_predictor =     nn.FeedForwardNet('reward_predictor', self.obs_dim + self.obs_dim + self.action_dim, [], layers=4, hidden_dim=self.aux_hidden_dim, get_uncertainty=True)
-
-  def get_ensemble_idx_info(self):
-    if self.bayesian_config is not False:
-      ensemble_idxs = tf.random_shuffle(tf.range(self.transition_predictor.ensemble_size))
-      transition_ensemble_sample_n = self.transition_predictor.eval_sample_count
-      reward_ensemble_sample_n = self.reward_predictor.eval_sample_count
-      ensemble_idxs = ensemble_idxs[:transition_ensemble_sample_n]
-      return ensemble_idxs, transition_ensemble_sample_n, reward_ensemble_sample_n
-    else:
-      return None, 1, 1
-
-  def build_training_graph(self, obs, next_obs, actions, rewards, dones, data_size):
-    info = tf.concat([obs, actions], -1)
-    predicted_next_obs = self.transition_predictor(info, is_eval=False, reduce_mode="random") + obs
-    next_info = tf.concat([next_obs, info], -1)
-    predicted_dones = self.done_predictor(next_info, is_eval=False, reduce_mode="random")
-    predicted_rewards = self.reward_predictor(next_info, is_eval=False, reduce_mode="random")
-
-    done_losses = tf.nn.sigmoid_cross_entropy_with_logits(labels=dones, logits=predicted_dones)
-    reward_losses = .5 * tf.square(rewards - predicted_rewards)
-    next_obs_losses = .5 * tf.reduce_sum(tf.square(next_obs - predicted_next_obs), -1)
-
-    done_loss = tf.reduce_mean(done_losses)
-    reward_loss = tf.reduce_mean(reward_losses)
-    next_obs_loss = tf.reduce_mean(next_obs_losses)
-    reg_loss = .0001 * (self.done_predictor.l2_loss() +
-                        self.reward_predictor.l2_loss() +
-                        self.transition_predictor.l2_loss())
-
-    total_loss = done_loss + reward_loss + next_obs_loss + reg_loss
-
-    inspect = (total_loss, done_loss, reward_loss, next_obs_loss, reg_loss)
-
-    return total_loss, inspect
-
-  def init_extra_info(self, obs):
-    return tf.zeros_like(obs)
-
-  def transition(self, obs, action, extra_info, ensemble_idxs=None, pre_expanded=None):
-    info = tf.concat([obs, action], -1)
-    next_obs_delta = self.transition_predictor(info, reduce_mode="none", ensemble_idxs=ensemble_idxs, pre_expanded=pre_expanded)
-    if ensemble_idxs is None:
-      next_obs = tf.expand_dims(obs,-2) + next_obs_delta
-      next_info = tf.concat([next_obs, tf.expand_dims(info,-2)], -1)
-    else:
-      next_obs = obs + next_obs_delta
-      next_info = tf.concat([next_obs, info], -1)
-    done = tf.nn.sigmoid(self.done_predictor(next_info, reduce_mode="none", ensemble_idxs=ensemble_idxs, pre_expanded=True))
-    extra_info = tf.zeros_like(obs)
-    return next_obs, done, extra_info
-
-  def get_rewards(self, obs, action, next_obs):
-    next_info = tf.concat([next_obs, obs, action], -1)
-    reward = self.reward_predictor(next_info, reduce_mode="none")
-    return reward
\ No newline at end of file
diff --git a/research/steve/worldmodel_learner.py b/research/steve/worldmodel_learner.py
deleted file mode 100644
index c36a50f6adff604dd44f961d0360540469d503e5..0000000000000000000000000000000000000000
--- a/research/steve/worldmodel_learner.py
+++ /dev/null
@@ -1,55 +0,0 @@
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-import tensorflow as tf
-import numpy as np
-from learner import Learner
-from worldmodel import DeterministicWorldModel
-
-class WorldmodelLearner(Learner):
-    """
-    Worldmodel-specific training loop details.
-    """
-    def learner_name(self): return "worldmodel"
-
-    def make_loader_placeholders(self):
-        self.obs_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"], np.prod(self.env_config["obs_dims"])])
-        self.next_obs_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"], np.prod(self.env_config["obs_dims"])])
-        self.action_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"], self.env_config["action_dim"]])
-        self.reward_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"]])
-        self.done_loader = tf.placeholder(tf.float32, [self.learner_config["batch_size"]])
-        self.datasize_loader = tf.placeholder(tf.float64, [])
-        return [self.obs_loader, self.next_obs_loader, self.action_loader, self.reward_loader, self.done_loader, self.datasize_loader]
-
-    def make_core_model(self):
-        worldmodel = DeterministicWorldModel(self.config["name"], self.env_config, self.learner_config)
-        worldmodel_loss, inspect_losses = worldmodel.build_training_graph(*self.current_batch)
-
-        model_optimizer = tf.train.AdamOptimizer(3e-4)
-        model_gvs = model_optimizer.compute_gradients(worldmodel_loss, var_list=worldmodel.model_params)
-        capped_model_gvs = model_gvs
-        worldmodel_train_op = model_optimizer.apply_gradients(capped_model_gvs)
-
-        return worldmodel, (worldmodel_loss,), (worldmodel_train_op,), inspect_losses
-
-    ## Optional functions to override
-    def initialize(self): pass
-    def resume_from_checkpoint(self, epoch): pass
-    def checkpoint(self): pass
-    def backup(self): pass
-
-
-
-
diff --git a/research/street/README.md b/research/street/README.md
deleted file mode 100644
index fc2c4d01ce31bb8e0c7db88797d16de4974d9505..0000000000000000000000000000000000000000
--- a/research/street/README.md
+++ /dev/null
@@ -1,268 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# StreetView Tensorflow Recurrent End-to-End Transcription (STREET) Model.
-
-A TensorFlow implementation of the STREET model described in the paper:
-
-"End-to-End Interpretation of the French Street Name Signs Dataset"
-
-Raymond Smith, Chunhui Gu, Dar-Shyang Lee, Huiyi Hu, Ranjith
-Unnikrishnan, Julian Ibarz, Sacha Arnoud, Sophia Lin.
-
-*International Workshop on Robust Reading, Amsterdam, 9 October 2016.*
-
-Available at: http://link.springer.com/chapter/10.1007%2F978-3-319-46604-0_30
-
-
-## Contact
-***Author:*** Ray Smith (rays@google.com).
-
-***Pull requests and issues:*** @theraysmith.
-
-## Contents
-* [Introduction](#introduction)
-* [Installing and setting up the STREET model](#installing-and-setting-up-the-street-model)
-* [Downloading the datasets](#downloading-the-datasets)
-* [Confidence Tests](#confidence-tests)
-* [Training a model](#training-a-model)
-* [The Variable Graph Specification Language](#the-variable-graph-specification-language)
-
-## Introduction
-
-The *STREET* model is a deep recurrent neural network that learns how to
-identify the name of a street (in France) from an image containing upto four
-different views of the street name sign. The model merges information from the
-different views and normalizes the text to the correct format. For example:
-
-![Example image](g3doc/avdessapins.png)
-
-Avenue des Sapins
-
-
-## Installing and setting up the STREET model
-[Install Tensorflow](https://www.tensorflow.org/install/)
-
-Install numpy:
-
-```
-sudo pip install numpy
-```
-
-Build the LSTM op:
-
-```
-cd cc
-TF_INC=$(python -c 'import tensorflow as tf; print(tf.sysconfig.get_include())')
-g++ -std=c++11 -shared rnn_ops.cc -o rnn_ops.so -fPIC -I $TF_INC -O3 -mavx
-```
-
-(Note: if running on Mac, add `-undefined dynamic_lookup` to your `g++` command.
-If you are running a newer version of gcc, you may also need to add
-`-D_GLIBCXX_USE_CXX11_ABI=0`.)
-
-Run the unittests:
-
-```
-cd ../python
-python decoder_test.py
-python errorcounter_test.py
-python shapes_test.py
-python vgslspecs_test.py
-python vgsl_model_test.py
-```
-
-## Downloading the datasets
-
-The French Street Name Signs (FSNS) dataset is split into subsets, each
-of which is composed of multiple files.
-Note that these datasets are very large. The approximate sizes are:
-
-*   Train: 512 files of 300MB each.
-*   Validation: 64 files of 40MB each.
-*   Test: 64 files of 50MB each.
-*   Testdata: some smaller data files of a few MB for testing.
-*   Total: ~158 Gb.
-
-Here is a list of the download paths:
-
-```
-https://download.tensorflow.org/data/fsns-20160927/charset_size=134.txt
-https://download.tensorflow.org/data/fsns-20160927/test/test-00000-of-00064
-...
-https://download.tensorflow.org/data/fsns-20160927/test/test-00063-of-00064
-https://download.tensorflow.org/data/fsns-20160927/testdata/arial-32-00000-of-00001
-https://download.tensorflow.org/data/fsns-20160927/testdata/fsns-00000-of-00001
-https://download.tensorflow.org/data/fsns-20160927/testdata/mnist-sample-00000-of-00001
-https://download.tensorflow.org/data/fsns-20160927/testdata/numbers-16-00000-of-00001
-https://download.tensorflow.org/data/fsns-20160927/train/train-00000-of-00512
-...
-https://download.tensorflow.org/data/fsns-20160927/train/train-00511-of-00512
-https://download.tensorflow.org/data/fsns-20160927/validation/validation-00000-of-00064
-...
-https://download.tensorflow.org/data/fsns-20160927/validation/validation-00063-of-00064
-```
-
-All URLs are stored in the text file `python/fsns_urls.txt`, to download them in
-parallel:
-
-```
-aria2c -c -j 20 -i fsns_urls.txt
-```
-If you ctrl+c and re-execute the command it will continue the aborted download.
-
-
-## Confidence Tests
-
-The datasets download includes a directory `testdata` that contains some small
-datasets that are big enough to test that models can actually learn something.
-Assuming that you have put the downloads in directory `data` alongside
-`python` then you can run the following tests:
-
-### Mnist for zero-dimensional data
-
-```
-cd python
-train_dir=/tmp/mnist
-rm -rf $train_dir
-python vgsl_train.py --model_str='16,0,0,1[Ct5,5,16 Mp3,3 Lfys32 Lfxs64]O0s12' \
-  --max_steps=1024 --train_data=../data/testdata/mnist-sample-00000-of-00001 \
-  --initial_learning_rate=0.001 --final_learning_rate=0.001 \
-  --num_preprocess_threads=1 --train_dir=$train_dir
-python vgsl_eval.py --model_str='16,0,0,1[Ct5,5,16 Mp3,3 Lfys32 Lfxs64]O0s12' \
-  --num_steps=256 --eval_data=../data/testdata/mnist-sample-00000-of-00001 \
-  --num_preprocess_threads=1 --decoder=../testdata/numbers.charset_size=12.txt \
-  --eval_interval_secs=0 --train_dir=$train_dir --eval_dir=$train_dir/eval
-```
-
-Depending on your machine, this should run in about 1 minute, and should obtain
-error rates below 50%. Actual error rates will vary according to random
-initialization.
-
-### Fixed-length targets for number recognition
-
-```
-cd python
-train_dir=/tmp/fixed
-rm -rf $train_dir
-python vgsl_train.py --model_str='8,16,0,1[S1(1x16)1,3 Lfx32 Lrx32 Lfx32]O1s12' \
-  --max_steps=3072 --train_data=../data/testdata/numbers-16-00000-of-00001 \
-  --initial_learning_rate=0.001 --final_learning_rate=0.001 \
-  --num_preprocess_threads=1 --train_dir=$train_dir
-python vgsl_eval.py --model_str='8,16,0,1[S1(1x16)1,3 Lfx32 Lrx32 Lfx32]O1s12' \
-  --num_steps=256 --eval_data=../data/testdata/numbers-16-00000-of-00001 \
-  --num_preprocess_threads=1 --decoder=../testdata/numbers.charset_size=12.txt \
-  --eval_interval_secs=0 --train_dir=$train_dir --eval_dir=$train_dir/eval
-```
-
-Depending on your machine, this should run in about 1-2 minutes, and should
-obtain a label error rate between 50 and 80%, with word error rates probably
-not coming below 100%. Actual error rates will vary
-according to random initialization.
-
-### OCR-style data with CTC
-
-```
-cd python
-train_dir=/tmp/ctc
-rm -rf $train_dir
-python vgsl_train.py --model_str='1,32,0,1[S1(1x32)1,3 Lbx100]O1c105' \
-  --max_steps=4096 --train_data=../data/testdata/arial-32-00000-of-00001 \
-  --initial_learning_rate=0.001 --final_learning_rate=0.001 \
-  --num_preprocess_threads=1 --train_dir=$train_dir &
-python vgsl_eval.py --model_str='1,32,0,1[S1(1x32)1,3 Lbx100]O1c105' \
-  --num_steps=256 --eval_data=../data/testdata/arial-32-00000-of-00001 \
-  --num_preprocess_threads=1 --decoder=../testdata/arial.charset_size=105.txt \
-  --eval_interval_secs=15 --train_dir=$train_dir --eval_dir=$train_dir/eval &
-tensorboard --logdir=$train_dir
-```
-
-Depending on your machine, the background training should run for about 3-4
-minutes, and should obtain a label error rate between 10 and 50%, with
-correspondingly higher word error rates and even higher sequence error rate.
-Actual error rates will vary according to random initialization.
-The background eval will run for ever, and will have to be terminated by hand.
-The tensorboard command will run a visualizer that can be viewed with a
-browser. Go to the link that it prints to view tensorboard and see the
-training progress. See the [Tensorboard](https://www.tensorflow.org/versions/r0.10/how_tos/summaries_and_tensorboard/index.html)
-introduction for more information.
-
-
-### Mini FSNS dataset
-
-You can test the actual STREET model on a small FSNS data set. The model will
-overfit to this small dataset, but will give some confidence that everything
-is working correctly. *Note* that this test runs the training and evaluation
-in parallel, which is something that you should do when training any substantial
-system, so you can monitor progress.
-
-
-```
-cd python
-train_dir=/tmp/fsns
-rm -rf $train_dir
-python vgsl_train.py --max_steps=10000 --num_preprocess_threads=1 \
-  --train_data=../data/testdata/fsns-00000-of-00001 \
-  --initial_learning_rate=0.0001 --final_learning_rate=0.0001 \
-  --train_dir=$train_dir &
-python vgsl_eval.py --num_steps=256 --num_preprocess_threads=1 \
-   --eval_data=../data/testdata/fsns-00000-of-00001 \
-   --decoder=../testdata/charset_size=134.txt \
-   --eval_interval_secs=300 --train_dir=$train_dir --eval_dir=$train_dir/eval &
-tensorboard --logdir=$train_dir
-```
-
-Depending on your machine, the training should finish in about 1-2 *hours*.
-As with the CTC testset above, the eval and tensorboard will have to be
-terminated manually.
-
-## Training a full FSNS model
-
-After running the tests above, you are ready to train the real thing!
-*Note* that you might want to use a `train_dir` somewhere other than `/tmp` as
-you can stop the training, reboot if needed and continue if you keep the
-data intact, but `/tmp` gets deleted on a reboot.
-
-```
-cd python
-train_dir=/tmp/fsns
-rm -rf $train_dir
-python vgsl_train.py --max_steps=100000000 --train_data=../data/train/train* \
-  --train_dir=$train_dir &
-python vgsl_eval.py --num_steps=1000 \
-  --eval_data=../data/validation/validation* \
-  --decoder=../testdata/charset_size=134.txt \
-  --eval_interval_secs=300 --train_dir=$train_dir --eval_dir=$train_dir/eval &
-tensorboard --logdir=$train_dir
-```
-
-Training will take a very long time (probably many weeks) to reach minimum
-error rate on a single machine, although it will probably take substantially
-fewer iterations than with parallel training. Faster training can be obtained
-with parallel training on a cluster.
-Since the setup is likely to be very site-specific, please see the TensorFlow
-documentation on
-[Distributed TensorFlow](https://www.tensorflow.org/versions/r0.10/how_tos/distributed/index.html)
-for more information. Some code changes may be needed in the `Train` function
-in `vgsl_model.py`.
-
-With 40 parallel training workers, nearly optimal error rates (about 25%
-sequence error on the validation set) are obtained in about 30 million steps,
-although the error continues to fall slightly over the next 30 million, to
-perhaps as low as 23%.
-
-With a single machine the number of steps could be substantially lower.
-Although untested on this problem, on other problems the ratio is typically
-5 to 1 so low error rates could be obtained as soon as 6 million iterations,
-which could be reached in about 4 weeks.
-
-
-## The Variable Graph Specification Language
-
-The STREET model makes use of a graph specification language (VGSL) that
-enables rapid experimentation with different model architectures. The language
-defines a Tensor Flow graph that can be used to process images of variable sizes
-to output a 1-dimensional sequence, like a transcription/OCR problem, or a
-0-dimensional label, as for image identification problems. For more information
-see [vgslspecs](g3doc/vgslspecs.md)
diff --git a/research/street/cc/rnn_ops.cc b/research/street/cc/rnn_ops.cc
deleted file mode 100644
index 8e004d91c1e426d2eafcc52a390217bd67e40c9a..0000000000000000000000000000000000000000
--- a/research/street/cc/rnn_ops.cc
+++ /dev/null
@@ -1,538 +0,0 @@
-/* Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-
-Licensed under the Apache License, Version 2.0 (the "License");
-you may not use this file except in compliance with the License.
-You may obtain a copy of the License at
-
-    http://www.apache.org/licenses/LICENSE-2.0
-
-Unless required by applicable law or agreed to in writing, software
-distributed under the License is distributed on an "AS IS" BASIS,
-WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-See the License for the specific language governing permissions and
-limitations under the License.
-==============================================================================*/
-
-// OpKernel of LSTM Neural Networks:
-//
-//   LSTM: VariableLSTMOp (VariableLSTMGradOp)
-//
-// where (.*) are the ops to compute gradients for the corresponding ops.
-
-#define EIGEN_USE_THREADS
-
-#include <vector>
-#ifdef GOOGLE_INCLUDES
-#include "third_party/eigen3/Eigen/Core"
-#include "third_party/tensorflow/core/framework/op.h"
-#include "third_party/tensorflow/core/framework/op_kernel.h"
-#include "third_party/tensorflow/core/framework/tensor.h"
-#else
-#include "Eigen/Core"
-#include "tensorflow/core/framework/op.h"
-#include "tensorflow/core/framework/op_kernel.h"
-#include "tensorflow/core/framework/tensor.h"
-#endif  // GOOGLE_INCLUDES
-
-namespace tensorflow {
-
-using Eigen::array;
-using Eigen::DenseIndex;
-using IndexPair = Eigen::IndexPair<int>;
-
-Status AreDimsEqual(int dim1, int dim2, const string& message) {
-  if (dim1 != dim2) {
-    return errors::InvalidArgument(message, ": ", dim1, " vs. ", dim2);
-  }
-  return Status::OK();
-}
-
-// ------------------------------- VariableLSTMOp -----------------------------
-
-// Kernel to compute the forward propagation of a Long Short-Term Memory
-// network. See the doc of the op below for more detail.
-class VariableLSTMOp : public OpKernel {
- public:
-  explicit VariableLSTMOp(OpKernelConstruction* ctx) : OpKernel(ctx) {
-    OP_REQUIRES_OK(ctx, ctx->GetAttr("clip", &clip_));
-    OP_REQUIRES(
-        ctx, clip_ >= 0.0,
-        errors::InvalidArgument("clip_ needs to be equal or greator than 0"));
-  }
-
-  void Compute(OpKernelContext* ctx) override {
-    // Inputs.
-    const auto input = ctx->input(0).tensor<float, 4>();
-    const auto initial_state = ctx->input(1).tensor<float, 2>();
-    const auto initial_memory = ctx->input(2).tensor<float, 2>();
-    const auto w_m_m = ctx->input(3).tensor<float, 3>();
-    const int batch_size = input.dimension(0);
-    const int seq_len = input.dimension(1);
-    const int output_dim = input.dimension(3);
-
-    // Sanity checks.
-    OP_REQUIRES_OK(ctx, AreDimsEqual(4, input.dimension(2), "Input num"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(batch_size, initial_state.dimension(0),
-                                     "State batch"));
-    OP_REQUIRES_OK(
-        ctx, AreDimsEqual(output_dim, initial_state.dimension(1), "State dim"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(batch_size, initial_memory.dimension(0),
-                                     "Memory batch"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(output_dim, initial_memory.dimension(1),
-                                     "Memory dim"));
-    OP_REQUIRES_OK(
-        ctx, AreDimsEqual(output_dim, w_m_m.dimension(0), "Weight dim 0"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(4, w_m_m.dimension(1), "Weight dim 1"));
-    OP_REQUIRES_OK(
-        ctx, AreDimsEqual(output_dim, w_m_m.dimension(2), "Weight dim 2"));
-
-    // Outputs.
-    Tensor* act_tensor = nullptr;
-    OP_REQUIRES_OK(ctx, ctx->allocate_output(
-                            0, {batch_size, seq_len, output_dim}, &act_tensor));
-    auto act = act_tensor->tensor<float, 3>();
-    act.setZero();
-
-    Tensor* gate_raw_act_tensor = nullptr;
-    OP_REQUIRES_OK(ctx,
-                   ctx->allocate_output(1, {batch_size, seq_len, 4, output_dim},
-                                        &gate_raw_act_tensor));
-    auto gate_raw_act = gate_raw_act_tensor->tensor<float, 4>();
-    gate_raw_act.setZero();
-
-    Tensor* memory_tensor = nullptr;
-    OP_REQUIRES_OK(ctx,
-                   ctx->allocate_output(2, {batch_size, seq_len, output_dim},
-                                        &memory_tensor));
-    auto memory = memory_tensor->tensor<float, 3>();
-    memory.setZero();
-
-    // Const and scratch tensors.
-    Tensor ones_tensor;
-    OP_REQUIRES_OK(ctx, ctx->allocate_temp(DT_FLOAT, {batch_size, output_dim},
-                                           &ones_tensor));
-    auto ones = ones_tensor.tensor<float, 2>();
-    ones.setConstant(1.0);
-
-    Tensor state_tensor;
-    OP_REQUIRES_OK(ctx, ctx->allocate_temp(DT_FLOAT, {batch_size, output_dim},
-                                           &state_tensor));
-    auto state = state_tensor.tensor<float, 2>();
-    state = initial_state;
-
-    Tensor scratch_tensor;
-    OP_REQUIRES_OK(ctx,
-                   ctx->allocate_temp(DT_FLOAT, {batch_size, 4, output_dim},
-                                      &scratch_tensor));
-    auto scratch = scratch_tensor.tensor<float, 3>();
-    scratch.setZero();
-
-    // Uses the most efficient order for the contraction depending on the batch
-    // size.
-
-    // This is the code shared by both cases. It is discouraged to use the
-    // implicit capture with lambda functions, but it should be clear that what
-    // is done here.
-    auto Forward = [&](int i) {
-      // Each pre-activation value is stored in the following order (See the
-      // comment of the op for the meaning):
-      //
-      //   i: 0
-      //   j: 1
-      //   f: 2
-      //   o: 3
-
-      // Adds one to the pre-activation values of the forget gate. This is a
-      // heuristic to make the training easier.
-      scratch.chip(2, 1) += ones;
-
-      gate_raw_act.chip(i, 1) = scratch;
-
-      // c_t = f_t * c_{t-1} + i_t * j_t
-      if (i == 0) {
-        state = initial_memory * scratch.chip(2, 1).sigmoid();
-      } else {
-        state = memory.chip(i - 1, 1) * scratch.chip(2, 1).sigmoid();
-      }
-      state += scratch.chip(0, 1).sigmoid() * scratch.chip(1, 1).tanh();
-
-      if (clip_ > 0.0) {
-        // Clips the values if required.
-        state = state.cwiseMax(-clip_).cwiseMin(clip_);
-      }
-
-      memory.chip(i, 1) = state;
-
-      // h_t = o_t * tanh(c_t)
-      state = scratch.chip(3, 1).sigmoid() * state.tanh();
-
-      act.chip(i, 1) = state;
-    };
-    if (batch_size == 1) {
-      // Reshapes the weight tensor to pretend as if it is a matrix
-      // multiplication which is more efficient.
-      auto w_m_m_r =
-          w_m_m.reshape(array<DenseIndex, 2>{output_dim, 4 * output_dim});
-      // Dimensions for the contraction.
-      const array<IndexPair, 1> m_m_dim = {IndexPair(1, 0)};
-      for (int i = 0; i < seq_len; ++i) {
-        // Computes the pre-activation value of the input and each gate.
-        scratch = input.chip(i, 1) +
-                  state.contract(w_m_m_r, m_m_dim)
-                      .reshape(array<DenseIndex, 3>{batch_size, 4, output_dim});
-        Forward(i);
-      }
-    } else {
-      // Shuffles the dimensions of the weight tensor to be efficient when used
-      // in the left-hand side. Allocates memory for the shuffled tensor for
-      // efficiency.
-      Tensor w_m_m_s_tensor;
-      OP_REQUIRES_OK(ctx,
-                     ctx->allocate_temp(DT_FLOAT, {output_dim * 4, output_dim},
-                                        &w_m_m_s_tensor));
-      auto w_m_m_s = w_m_m_s_tensor.tensor<float, 2>();
-      w_m_m_s = w_m_m.shuffle(array<int, 3>{2, 1, 0})
-                    .reshape(array<DenseIndex, 2>{output_dim * 4, output_dim});
-      // Dimensions for the contraction.
-      const array<IndexPair, 1> m_m_dim = {IndexPair(1, 1)};
-      for (int i = 0; i < seq_len; ++i) {
-        // Computes the pre-activation value of the input and each gate.
-        scratch = input.chip(i, 1) +
-                  w_m_m_s.contract(state, m_m_dim)
-                      .reshape(array<DenseIndex, 3>{output_dim, 4, batch_size})
-                      .shuffle(array<int, 3>{2, 1, 0});
-        Forward(i);
-      }
-    }
-  }
-
- private:
-  // Threshold to clip the values of memory cells.
-  float clip_ = 0;
-};
-
-REGISTER_KERNEL_BUILDER(Name("VariableLSTM").Device(DEVICE_CPU),
-                        VariableLSTMOp);
-REGISTER_OP("VariableLSTM")
-    .Attr("clip: float = 0.0")
-    .Input("input: float32")
-    .Input("initial_state: float32")
-    .Input("initial_memory: float32")
-    .Input("w_m_m: float32")
-    .Output("activation: float32")
-    .Output("gate_raw_act: float32")
-    .Output("memory: float32")
-    .Doc(R"doc(
-Computes the forward propagation of a Long Short-Term Memory Network.
-
-It computes the following equation recursively for `0<t<=T`:
-
-  i_t  = sigmoid(a_{i,t})
-  j_t  = tanh(a_{j,t})
-  f_t  = sigmoid(a_{f,t} + 1.0)
-  o_t  = sigmoid(a_{o,t})
-  c_t  = f_t * c_{t-1} + i_t * j_t
-  c'_t = min(max(c_t, -clip), clip) if clip > 0 else c_t
-  h_t  = o_t * tanh(c'_t)
-
-where
-
-  a_{l,t} = w_{l,m,m} * h_{t-1} + x'_{l,t}
-
-where
-
-  x'_{l,t} = w_{l,m,i} * x_{t}.
-
-`input` corresponds to the concatenation of `X'_i`, `X'_j`, `X'_f`, and `X'_o`
-where `X'_l = (x'_{l,1}, x'_{l,2}, ..., x'_{l,T})`, `initial_state` corresponds
-to `h_{0}`, `initial_memory` corresponds to `c_{0}` and `weight` corresponds to
-`w_{l,m,m}`. `X'_l` (the transformed input) is computed outside of the op in
-advance, so w_{l,m,i} is not passed in to the op.
-
-`activation` corresponds to `H = (h_1, h_2, ..., h_T)`, `gate_raw_activation`
-corresponds to the concatanation of `A_i`, `A_j`, `A_f` and `A_o`, and `memory`
-corresponds `C = (c_0, c_1, ..., c_T)`.
-
-All entries in the batch are propagated to the end, and are assumed to be the
-same length.
-
-input: 4-D with shape `[batch_size, seq_len, 4, num_nodes]`
-initial_state: 2-D with shape `[batch_size, num_nodes]`
-initial_memory: 2-D with shape `[batch_size, num_nodes]`
-w_m_m: 3-D with shape `[num_nodes, 4, num_nodes]`
-activation: 3-D with shape `[batch_size, seq_len, num_nodes]`
-gate_raw_act: 3-D with shape `[batch_size, seq_len, 4, num_nodes]`
-memory: 3-D with shape `[batch_size, seq_len, num_nodes]`
-)doc");
-
-// ----------------------------- VariableLSTMGradOp ----------------------------
-
-// Kernel to compute the gradient of VariableLSTMOp.
-class VariableLSTMGradOp : public OpKernel {
- public:
-  explicit VariableLSTMGradOp(OpKernelConstruction* ctx) : OpKernel(ctx) {}
-
-  void Compute(OpKernelContext* ctx) override {
-    // Inputs.
-    const auto initial_state = ctx->input(0).tensor<float, 2>();
-    const auto initial_memory = ctx->input(1).tensor<float, 2>();
-    const auto w_m_m = ctx->input(2).tensor<float, 3>();
-    const auto act = ctx->input(3).tensor<float, 3>();
-    const auto gate_raw_act = ctx->input(4).tensor<float, 4>();
-    const auto memory = ctx->input(5).tensor<float, 3>();
-    const auto act_grad = ctx->input(6).tensor<float, 3>();
-    const auto gate_raw_act_grad = ctx->input(7).tensor<float, 4>();
-    const auto memory_grad = ctx->input(8).tensor<float, 3>();
-    const int batch_size = act.dimension(0);
-    const int seq_len = act.dimension(1);
-    const int output_dim = act.dimension(2);
-
-    // Sanity checks.
-    OP_REQUIRES_OK(ctx, AreDimsEqual(batch_size, initial_state.dimension(0),
-                                     "State batch"));
-    OP_REQUIRES_OK(
-        ctx, AreDimsEqual(output_dim, initial_state.dimension(1), "State dim"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(batch_size, initial_memory.dimension(0),
-                                     "Memory batch"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(output_dim, initial_memory.dimension(1),
-                                     "Memory dim"));
-    OP_REQUIRES_OK(
-        ctx, AreDimsEqual(output_dim, w_m_m.dimension(0), "Weight dim 0"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(4, w_m_m.dimension(1), "Weight dim 1"));
-    OP_REQUIRES_OK(
-        ctx, AreDimsEqual(output_dim, w_m_m.dimension(2), "Weight dim 2"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(batch_size, gate_raw_act.dimension(0),
-                                     "Gate raw activation batch"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(seq_len, gate_raw_act.dimension(1),
-                                     "Gate raw activation  len"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(4, gate_raw_act.dimension(2),
-                                     "Gate raw activation num"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(output_dim, gate_raw_act.dimension(3),
-                                     "Gate raw activation dim"));
-    OP_REQUIRES_OK(
-        ctx, AreDimsEqual(batch_size, memory.dimension(0), "Memory batch"));
-    OP_REQUIRES_OK(ctx,
-                   AreDimsEqual(seq_len, memory.dimension(1), "Memory len"));
-    OP_REQUIRES_OK(ctx,
-                   AreDimsEqual(output_dim, memory.dimension(2), "Memory dim"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(batch_size, act_grad.dimension(0),
-                                     "Activation gradient batch"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(seq_len, act_grad.dimension(1),
-                                     "Activation gradient len"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(output_dim, act_grad.dimension(2),
-                                     "Activation gradient dim"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(batch_size, gate_raw_act_grad.dimension(0),
-                                     "Activation gradient batch"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(seq_len, gate_raw_act_grad.dimension(1),
-                                     "Activation gradient len"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(4, gate_raw_act_grad.dimension(2),
-                                     "Activation gradient num"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(output_dim, gate_raw_act_grad.dimension(3),
-                                     "Activation gradient dim"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(batch_size, memory_grad.dimension(0),
-                                     "Memory gradient batch"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(seq_len, memory_grad.dimension(1),
-                                     "Memory gradient len"));
-    OP_REQUIRES_OK(ctx, AreDimsEqual(output_dim, memory_grad.dimension(2),
-                                     "Memory gradient dim"));
-
-    // Outputs.
-    std::vector<Tensor*> collections(4, nullptr);
-    OP_REQUIRES_OK(ctx,
-                   ctx->allocate_output(0, {batch_size, seq_len, 4, output_dim},
-                                        &collections[0]));
-    auto input_grad = collections[0]->tensor<float, 4>();
-    input_grad.setZero();
-
-    OP_REQUIRES_OK(ctx, ctx->allocate_output(1, {batch_size, output_dim},
-                                             &collections[1]));
-    auto init_state_grad = collections[1]->tensor<float, 2>();
-    init_state_grad.setZero();
-
-    OP_REQUIRES_OK(ctx, ctx->allocate_output(2, {batch_size, output_dim},
-                                             &collections[2]));
-    auto init_memory_grad = collections[2]->tensor<float, 2>();
-    init_memory_grad.setZero();
-
-    OP_REQUIRES_OK(ctx, ctx->allocate_output(3, {output_dim, 4, output_dim},
-                                             &collections[3]));
-    auto w_m_m_grad = collections[3]->tensor<float, 3>();
-    w_m_m_grad.setZero();
-
-    // Const and scratch tensors.
-    Tensor ones_tensor;
-    OP_REQUIRES_OK(ctx, ctx->allocate_temp(DT_FLOAT, {batch_size, output_dim},
-                                           &ones_tensor));
-    auto ones = ones_tensor.tensor<float, 2>();
-    ones.setConstant(1.0);
-
-    Tensor scratch_tensor;
-    OP_REQUIRES_OK(ctx,
-                   ctx->allocate_temp(DT_FLOAT, {batch_size, 4, output_dim},
-                                      &scratch_tensor));
-    auto scratch = scratch_tensor.tensor<float, 3>();
-    scratch.setZero();
-
-    Tensor tmp1_tensor;
-    OP_REQUIRES_OK(ctx, ctx->allocate_temp(DT_FLOAT, {batch_size, output_dim},
-                                           &tmp1_tensor));
-    auto tmp1 = tmp1_tensor.tensor<float, 2>();
-    tmp1.setZero();
-
-    Tensor tmp2_tensor;
-    OP_REQUIRES_OK(ctx, ctx->allocate_temp(DT_FLOAT, {batch_size, output_dim},
-                                           &tmp2_tensor));
-    auto tmp2 = tmp2_tensor.tensor<float, 2>();
-    tmp2.setZero();
-
-    // Uses the most efficient order for the contraction depending on the batch
-    // size.
-
-    // Shuffles the dimensions of the weight tensor to be efficient when used in
-    // the left-hand side. Allocates memory for the shuffled tensor for
-    // efficiency.
-    Tensor w_m_m_s_tensor;
-    OP_REQUIRES_OK(ctx,
-                   ctx->allocate_temp(DT_FLOAT, {4, output_dim, output_dim},
-                                      &w_m_m_s_tensor));
-    auto w_m_m_s = w_m_m_s_tensor.tensor<float, 3>();
-    if (batch_size == 1) {
-      // Allocates memory only it is used.
-      w_m_m_s = w_m_m.shuffle(array<int, 3>{1, 2, 0});
-    }
-
-    // Dimensions for the contraction with the weight tensor.
-    const array<IndexPair, 1> m_m_dim =
-        batch_size == 1 ? array<IndexPair, 1>{IndexPair(1, 0)}
-                        : array<IndexPair, 1>{IndexPair(1, 1)};
-    // Dimensions for the contraction of the batch dimensions.
-    const array<IndexPair, 1> b_b_dim = {IndexPair(0, 0)};
-    for (int i = seq_len - 1; i >= 0; --i) {
-      if (i == seq_len - 1) {
-        init_state_grad = act_grad.chip(i, 1);
-      } else {
-        w_m_m_grad +=
-            act.chip(i, 1)
-                .contract(scratch.reshape(
-                              array<DenseIndex, 2>{batch_size, 4 * output_dim}),
-                          b_b_dim)
-                .reshape(array<DenseIndex, 3>{output_dim, 4, output_dim});
-        if (batch_size == 1) {
-          init_state_grad.device(ctx->eigen_cpu_device()) =
-              scratch.chip(0, 1).contract(w_m_m_s.chip(0, 0), m_m_dim) +
-              scratch.chip(1, 1).contract(w_m_m_s.chip(1, 0), m_m_dim) +
-              scratch.chip(2, 1).contract(w_m_m_s.chip(2, 0), m_m_dim) +
-              scratch.chip(3, 1).contract(w_m_m_s.chip(3, 0), m_m_dim);
-        } else {
-          init_state_grad.device(ctx->eigen_cpu_device()) =
-              (w_m_m.chip(0, 1).contract(scratch.chip(0, 1), m_m_dim) +
-               w_m_m.chip(1, 1).contract(scratch.chip(1, 1), m_m_dim) +
-               w_m_m.chip(2, 1).contract(scratch.chip(2, 1), m_m_dim) +
-               w_m_m.chip(3, 1).contract(scratch.chip(3, 1), m_m_dim))
-                  .shuffle(array<int, 2>{1, 0});
-        }
-        init_state_grad += act_grad.chip(i, 1);
-      }
-
-      auto gate_raw_act_t = gate_raw_act.chip(i, 1);
-      auto gate_raw_act_grad_t = gate_raw_act_grad.chip(i, 1);
-
-      // Output gate.
-      tmp1 = memory.chip(i, 1);
-      tmp1 = tmp1.tanh();                          // y_t
-      tmp2 = gate_raw_act_t.chip(3, 1).sigmoid();  // o_t
-      scratch.chip(3, 1) = init_state_grad * tmp1 * tmp2 * (ones - tmp2) +
-                           gate_raw_act_grad_t.chip(3, 1);
-
-      init_memory_grad += init_state_grad * tmp2 * (ones - tmp1.square()) +
-                          memory_grad.chip(i, 1);
-
-      // Input gate.
-      tmp1 = gate_raw_act_t.chip(0, 1).sigmoid();  // i_t
-      tmp2 = gate_raw_act_t.chip(1, 1);
-      tmp2 = tmp2.tanh();  // j_t
-      scratch.chip(0, 1) = init_memory_grad * tmp2 * tmp1 * (ones - tmp1) +
-                           gate_raw_act_grad_t.chip(0, 1);
-
-      // Input.
-      scratch.chip(1, 1) = init_memory_grad * tmp1 * (ones - tmp2.square()) +
-                           gate_raw_act_grad_t.chip(1, 1);
-
-      // Forget gate.
-      tmp1 = gate_raw_act_t.chip(2, 1).sigmoid();  // f_t
-      if (i == 0) {
-        scratch.chip(2, 1) =
-            init_memory_grad * initial_memory * tmp1 * (ones - tmp1) +
-            gate_raw_act_grad_t.chip(2, 1);
-      } else {
-        scratch.chip(2, 1) =
-            init_memory_grad * memory.chip(i - 1, 1) * tmp1 * (ones - tmp1) +
-            gate_raw_act_grad_t.chip(2, 1);
-      }
-
-      // Memory.
-      init_memory_grad *= tmp1;
-
-      input_grad.chip(i, 1) = scratch;
-    }
-    w_m_m_grad += initial_state
-                      .contract(scratch.reshape(array<DenseIndex, 2>{
-                                    batch_size, 4 * output_dim}),
-                                b_b_dim)
-                      .reshape(array<DenseIndex, 3>{output_dim, 4, output_dim});
-    if (batch_size == 1) {
-      init_state_grad.device(ctx->eigen_cpu_device()) =
-          (scratch.chip(0, 1).contract(w_m_m_s.chip(0, 0), m_m_dim) +
-           scratch.chip(1, 1).contract(w_m_m_s.chip(1, 0), m_m_dim) +
-           scratch.chip(2, 1).contract(w_m_m_s.chip(2, 0), m_m_dim) +
-           scratch.chip(3, 1).contract(w_m_m_s.chip(3, 0), m_m_dim));
-    } else {
-      init_state_grad.device(ctx->eigen_cpu_device()) =
-          (w_m_m.chip(0, 1).contract(scratch.chip(0, 1), m_m_dim) +
-           w_m_m.chip(1, 1).contract(scratch.chip(1, 1), m_m_dim) +
-           w_m_m.chip(2, 1).contract(scratch.chip(2, 1), m_m_dim) +
-           w_m_m.chip(3, 1).contract(scratch.chip(3, 1), m_m_dim))
-              .shuffle(array<int, 2>{1, 0});
-    }
-  }
-};
-
-REGISTER_KERNEL_BUILDER(Name("VariableLSTMGrad").Device(DEVICE_CPU),
-                        VariableLSTMGradOp);
-
-REGISTER_OP("VariableLSTMGrad")
-    .Input("initial_state: float32")
-    .Input("initial_memory: float32")
-    .Input("w_m_m: float32")
-    .Input("activation: float32")
-    .Input("gate_raw_act: float32")
-    .Input("memory: float32")
-    .Input("act_grad: float32")
-    .Input("gate_raw_act_grad: float32")
-    .Input("memory_grad: float32")
-    .Output("input_grad: float32")
-    .Output("initial_state_grad: float32")
-    .Output("initial_memory_grad: float32")
-    .Output("w_m_m_grad: float32")
-    .Doc(R"doc(
-Computes the gradient for VariableLSTM.
-
-This is to be used conjunction with VariableLSTM. It ignores the clipping used
-in the forward pass.
-
-initial_state: 2-D with shape `[batch_size, num_nodes]`
-initial_memory: 2-D with shape `[batch_size, num_nodes]`
-w_m_m: 3-D with shape `[num_nodes, 4, num_nodes]`
-activation: 3-D with shape `[batch_size, seq_len, num_nodes]`
-gate_raw_act: 3-D with shape `[batch_size, seq_len, 4, num_nodes]`
-memory: 3-D with shape `[batch_size, seq_len, num_nodes]`
-act_grad: 3-D with shape `[batch_size, seq_len, num_nodes]`
-gate_raw_act_grad: 3-D with shape `[batch_size, seq_len, 4, num_nodes]`
-memory_grad: 3-D with shape `[batch_size, seq_len, num_nodes]`
-input_grad: 3-D with shape `[batch_size, seq_len, num_nodes]`
-initial_state_grad: 2-D with shape `[batch_size, num_nodes]`
-initial_memory_grad: 2-D with shape `[batch_size, num_nodes]`
-w_m_m_grad: 3-D with shape `[num_nodes, 4, num_nodes]`
-)doc");
-
-}  // namespace tensorflow
diff --git a/research/street/g3doc/avdessapins.png b/research/street/g3doc/avdessapins.png
deleted file mode 100644
index 7cdb9657a480979060e377e59906120d104680f4..0000000000000000000000000000000000000000
Binary files a/research/street/g3doc/avdessapins.png and /dev/null differ
diff --git a/research/street/g3doc/vgslspecs.md b/research/street/g3doc/vgslspecs.md
deleted file mode 100644
index 74294952b3ca006d957b3d2d9384b02a887ebab0..0000000000000000000000000000000000000000
--- a/research/street/g3doc/vgslspecs.md
+++ /dev/null
@@ -1,324 +0,0 @@
-# VGSL Specs - rapid prototyping of mixed conv/LSTM networks for images.
-
-Variable-size Graph Specification Language (VGSL) enables the specification of a
-Tensor Flow graph, composed of convolutions and LSTMs, that can process
-variable-sized images, from a very short definition string.
-
-## Applications: What is VGSL Specs good for?
-
-VGSL Specs are designed specifically to create TF graphs for:
-
-*   Variable size images as the input. (In one or BOTH dimensions!)
-*   Output an image (heat map), sequence (like text), or a category.
-*   Convolutions and LSTMs are the main computing component.
-*   Fixed-size images are OK too!
-
-But wait, aren't there other systems that simplify generating TF graphs? There
-are indeed, but something they all have in common is that they are designed for
-fixed size images only. If you want to solve a real OCR problem, you either have
-to cut the image into arbitrary sized pieces and try to stitch the results back
-together, or use VGSL.
-
-## Basic Usage
-
-A full model, including input and the output layers, can be built using
-vgsl_model.py. Alternatively you can supply your own tensors and add your own
-loss function layer if you wish, using vgslspecs.py directly.
-
-### Building a full model
-
-Provided your problem matches the one addressed by vgsl_model, you are good to
-go.
-
-Targeted problems:
-
-*   Images for input, either 8 bit greyscale or 24 bit color.
-*   Output is 0-d (A category, like cat, dog, train, car.)
-*   Output is 1-d, with either variable length or a fixed length sequence, eg
-    OCR, transcription problems in general.
-
-Currently only softmax (1 of n) outputs are supported, but it would not be
-difficult to extend to logistic.
-
-Use vgsl_train.py to train your model, and vgsl_eval.py to evaluate it. They
-just call Train and Eval in vgsl_model.py.
-
-### Model string for a full model
-
-The model string for a full model includes the input spec, the output spec and
-the layers spec in between. Example:
-
-```
-'1,0,0,3[Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx256]O1c105'
-```
-
-The first 4 numbers specify the standard TF tensor dimensions: [batch, height,
-width, depth], except that height and/or width may be zero, allowing them to be
-variable. Batch is specific only to training, and may be a different value at
-recognition/inference time. Depth needs to be 1 for greyscale and 3 for color.
-
-The model string in square brackets [] is the main model definition, which is
-described [below.](#basic-layers-syntax) The output specification takes the
-form:
-
-```
-O(2|1|0)(l|s|c)n output layer with n classes.
-  2 (heatmap) Output is a 2-d vector map of the input (possibly at
-    different scale). (Not yet supported.)
-  1 (sequence) Output is a 1-d sequence of vector values.
-  0 (category) Output is a 0-d single vector value.
-  l uses a logistic non-linearity on the output, allowing multiple
-    hot elements in any output vector value. (Not yet supported.)
-  s uses a softmax non-linearity, with one-hot output in each value.
-  c uses a softmax with CTC. Can only be used with s (sequence).
-  NOTE Only O0s, O1s and O1c are currently supported.
-```
-
-The number of classes must match the encoding of the TF Example data set.
-
-### Layers only - providing your own input and loss layers
-
-You don't have to use the canned input/output modules, if you provide your
-separate code to read TF Example and loss functions. First prepare your inputs:
-
-*   A TF-conventional batch of: `images = tf.float32[batch, height, width,
-    depth]`
-*   A tensor of the width of each image in the batch: `widths = tf.int64[batch]`
-*   A tensor of the height of each image in the batch: `heights =
-    tf.int64[batch]`
-
-Note that these can be created from individual images using
-`tf.train.batch_join` with `dynamic_pad=True.`
-
-```python
-import vgslspecs
-...
-spec = '[Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx256]'
-vgsl = vgslspecs.VGSLSpecs(widths, heights, is_training=True)
-last_layer = vgsl.Build(images, spec)
-...
-AddSomeLossFunction(last_layer)....
-```
-
-With some appropriate training data, this would create a world-class OCR engine!
-
-## Basic Layers Syntax
-
-NOTE that *all* ops input and output the standard TF convention of a 4-d tensor:
-`[batch, height, width, depth]` *regardless of any collapsing of dimensions.*
-This greatly simplifies things, and allows the VGSLSpecs class to track changes
-to the values of widths and heights, so they can be correctly passed in to LSTM
-operations, and used by any downstream CTC operation.
-
-NOTE: in the descriptions below, `<d>` is a numeric value, and literals are
-described using regular expression syntax.
-
-NOTE: Whitespace is allowed between ops.
-
-### Naming
-
-Each op gets a unique name by default, based on its spec string plus its
-character position in the overall specification. All the Ops take an optional
-name argument in braces after the mnemonic code, but before any numeric
-arguments.
-
-### Functional ops
-
-```
-C(s|t|r|l|m)[{name}]<y>,<x>,<d> Convolves using a y,x window, with no shrinkage,
-  SAME infill, d outputs, with s|t|r|l|m non-linear layer.
-F(s|t|r|l|m)[{name}]<d> Fully-connected with s|t|r|l|m non-linearity and d
-  outputs. Reduces height, width to 1. Input height and width must be constant.
-L(f|r|b)(x|y)[s][{name}]<n> LSTM cell with n outputs.
-  The LSTM must have one of:
-    f runs the LSTM forward only.
-    r runs the LSTM reversed only.
-    b runs the LSTM bidirectionally.
-  It will operate on either the x- or y-dimension, treating the other dimension
-  independently (as if part of the batch).
-  (Full 2-d and grid are not yet supported).
-  s (optional) summarizes the output in the requested dimension,
-     outputting only the final step, collapsing the dimension to a
-     single element.
-Do[{name}] Insert a dropout layer.
-```
-
-In the above, `(s|t|r|l|m)` specifies the type of the non-linearity:
-
-```python
-s = sigmoid
-t = tanh
-r = relu
-l = linear (i.e., None)
-m = softmax
-```
-
-Examples:
-
-`Cr5,5,32` Runs a 5x5 Relu convolution with 32 depth/number of filters.
-
-`Lfx{MyLSTM}128` runs a forward-only LSTM, named 'MyLSTM' in the x-dimension
-with 128 outputs, treating the y dimension independently.
-
-`Lfys64` runs a forward-only LSTM in the y-dimension with 64 outputs, treating
-the x-dimension independently and collapses the y-dimension to 1 element.
-
-### Plumbing ops
-
-The plumbing ops allow the construction of arbitrarily complex graphs. Something
-currently missing is the ability to define macros for generating say an
-inception unit in multiple places.
-
-```
-[...] Execute ... networks in series (layers).
-(...) Execute ... networks in parallel, with their output concatenated in depth.
-S[{name}]<d>(<a>x<b>)<e>,<f> Splits one dimension, moves one part to another
-  dimension.
-Mp[{name}]<y>,<x> Maxpool the input, reducing the (y,x) rectangle to a single
-  value.
-```
-
-In the `S` op, `<a>, <b>, <d>, <e>, <f>` are numbers.
-
-`S` is a generalized reshape. It splits input dimension `d` into `a` x `b`,
-sending the high/most significant part `a` to the high/most significant side of
-dimension `e`, and the low part `b` to the high side of dimension `f`.
-Exception: if `d=e=f`, then then dimension `d` is internally transposed to
-`bxa`. *At least one* of `e`, `f` must be equal to `d`, so no dimension can be
-totally destroyed. Either `a` or `b` can be zero, meaning whatever is left after
-taking out the other, allowing dimensions to be of variable size.
-
-NOTE: Remember the standard TF convention of a 4-d tensor: `[batch, height,
-width, depth]`, so `batch=0, height=1, width=2, depth=3.`
-
-Eg. `S3(3x50)2,3` will split the 150-element depth into 3x50, with the 3 going
-to the most significant part of the width, and the 50 part staying in depth.
-This will rearrange a 3x50 output parallel operation to spread the 3 output sets
-over width.
-
-### Full Examples
-
-Example 1: A graph capable of high quality OCR.
-
-`1,0,0,1[Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx256]O1c105`
-
-As layer descriptions: (Input layer is at the bottom, output at the top.)
-
-```
-O1c105: Output layer produces 1-d (sequence) output, trained with CTC,
-  outputting 105 classes.
-Lfx256: Forward-only LSTM in x with 256 outputs
-Lrx128: Reverse-only LSTM in x with 128 outputs
-Lfx128: Forward-only LSTM in x with 128 outputs
-Lfys64: Dimension-summarizing LSTM, summarizing the y-dimension with 64 outputs
-Mp3,3: 3 x 3 Maxpool
-Ct5,5,16: 5 x 5 Convolution with 16 outputs and tanh non-linearity
-[]: The body of the graph is alway expressed as a series of layers.
-1,0,0,1: Input is a batch of 1 image of variable size in greyscale
-```
-
-Example 2: The STREET network for reading French street name signs end-to-end.
-For a detailed description see the [FSNS dataset
-paper](http://link.springer.com/chapter/10.1007%2F978-3-319-46604-0_30)
-
-```
-1,600,150,3[S2(4x150)0,2 Ct5,5,16 Mp2,2 Ct5,5,64 Mp3,3
-  ([Lrys64 Lbx128][Lbys64 Lbx128][Lfys64 Lbx128]) S3(3x0)2,3
-  Lfx128 Lrx128 S0(1x4)0,3 Lfx256]O1c134
-```
-
-Since networks are usually illustrated with the input at the bottom, the input
-layer is at the bottom, output at the top, with 'headings' *below* the section
-they introduce.
-
-```
-O1c134: Output is a 1-d sequence, trained with CTC and 134 output softmax.
-Lfx256: Forward-only LSTM with 256 outputs
-S0(1x4)0,3: Reshape transferring the batch of 4 tiles to the depth dimension.
-Lrx128: Reverse-only LSTM with 128 outputs
-Lfx128: Forward-only LSTM with 128 outputs
-(Final section above)
-S3(3x0)2,3: Split the outputs of the 3 parallel summarizers and spread over the
-  x-dimension
-  [Lfys64 Lbx128]: Summarizing LSTM downwards on the y-dimension with 64
-    outputs, followed by a bi-directional LSTM in the x-dimension with 128
-    outputs
-  [Lbys64 Lbx128]: Summarizing bi-directional LSTM on the y-dimension with
-    64 outputs, followed by a bi-directional LSTM in the x-dimension with 128
-    outputs
-  [Lrys64 Lbx128]: Summarizing LSTM upwards on the y-dimension with 64 outputs,
-    followed by a bi-directional LSTM in the x-dimension with 128 outputs
-(): In parallel (re-using the inputs and concatenating the outputs):
-(Summarizing section above)
-Mp3,3: 3 x 3 Maxpool
-Ct5,5,64: 5 x 5 Convolution with 64 outputs and tanh non-linearity
-Mp2,2: 2 x 2 Maxpool
-Ct5,5,16: 5 x 5 Convolution with 16 outputs and tanh non-linearity
-S2(4x150)0,2: Split the x-dimension into 4x150, converting each tiled 600x150
-image into a batch of 4 150x150 images
-(Convolutional input section above)
-[]: The body of the graph is alway expressed as a series of layers.
-1,150,600,3: Input is a batch of 1, 600x150 image in 24 bit color
-```
-
-## Variable size Tensors Under the Hood
-
-Here are some notes about handling variable-sized images since they require some
-consideration and a little bit of knowledge about what goes on inside.
-
-A variable-sized image is an input for which the width and/or height are not
-known at graph-building time, so the tensor shape contains unknown/None/-1
-sizes.
-
-Many standard NN layers, such as convolutions, are designed to cope naturally
-with variable-sized images in TF and produce a variable sized image as the
-output. For other layers, such as 'Fully connected' variable size is
-fundamentally difficult, if not impossible to deal with, since by definition,
-*all* its inputs are connected via a weight to an output. The number of inputs
-therefore must be fixed.
-
-It is possible to handle variable sized images by using sparse tensors. Some
-implementations make a single variable dimension a list instead of part of the
-tensor. Both these solutions suffer from completely segregating the world of
-variable size from the world of fixed size, making models and their descriptions
-completely non-interchangeable.
-
-In VGSL, we use a standard 4-d Tensor, `[batch, height, width, depth]` and
-either use a batch size of 1 or put up with padding of the input images to the
-largest size of any element of the batch. The other price paid for this
-standardization is that the user must supply a pair of tensors of shape [batch]
-specifying the width and height of each input in a batch. This allows the LSTMs
-in the graph to know how many iterations to execute and how to correctly
-back-propagate the gradients.
-
-The standard TF implementation of CTC also requires a tensor giving the sequence
-lengths of its inputs. If the output of VGSL is going into CTC, the lengths can
-be obtained using:
-
-```python
-import vgslspecs
-...
-spec = '[Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx256]'
-vgsl = vgslspecs.VGSLSpecs(widths, heights, is_training=True)
-last_layer = vgsl.Build(images, spec)
-seq_lengths = vgsl.GetLengths()
-```
-
-The above will provide the widths that were given in the constructor, scaled
-down by the max-pool operator. The heights may be obtained using
-`vgsl.GetLengths(1)`, specifying the index of the y-dimension.
-
-NOTE that currently the only way of collapsing a dimension of unknown size to
-known size (1) is through the use of a summarizing LSTM. A single summarizing
-LSTM will collapse one dimension (x or y), leaving a 1-d sequence. The 1-d
-sequence can then be collapsed in the other dimension to make a 0-d categorical
-(softmax) or embedding (logistic) output.
-
-Using the (parallel) op it is entirely possible to run multiple [series] of ops
-that collapse x first in one and y first in the other, reducing both eventually
-to a single categorical value! For eample, the following description may do
-something useful with ImageNet-like problems:
-
-```python
-[Cr5,5,16 Mp2,2 Cr5,5,64 Mp3,3 ([Lfxs64 Lfys256] [Lfys64 Lfxs256]) Fr512 Fr512]
-```
diff --git a/research/street/python/decoder.py b/research/street/python/decoder.py
deleted file mode 100644
index 715146194c58d577c6b799a15baf8619211bc4cd..0000000000000000000000000000000000000000
--- a/research/street/python/decoder.py
+++ /dev/null
@@ -1,244 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Basic CTC+recoder decoder.
-
-Decodes a sequence of class-ids into UTF-8 text.
-For basic information on CTC See:
-Alex Graves et al. Connectionist Temporal Classification: Labelling Unsegmented
-Sequence Data with Recurrent Neural Networks.
-http://www.cs.toronto.edu/~graves/icml_2006.pdf
-"""
-import collections
-import re
-
-import errorcounter as ec
-from six.moves import xrange
-import tensorflow as tf
-
-# Named tuple Part describes a part of a multi (1 or more) part code that
-# represents a utf-8 string. For example, Chinese character 'x' might be
-# represented by 3 codes of which (utf8='x', index=1, num_codes3) would be the
-# middle part. (The actual code is not stored in the tuple).
-Part = collections.namedtuple('Part', 'utf8 index, num_codes')
-
-
-# Class that decodes a sequence of class-ids into UTF-8 text.
-class Decoder(object):
-  """Basic CTC+recoder decoder."""
-
-  def __init__(self, filename):
-    r"""Constructs a Decoder.
-
-    Reads the text file describing the encoding and build the encoder.
-    The text file contains lines of the form:
-    <code>[,<code>]*\t<string>
-    Each line defines a mapping from a sequence of one or more integer codes to
-    a corresponding utf-8 string.
-    Args:
-      filename:   Name of file defining the decoding sequences.
-    """
-    # self.decoder is a list of lists of Part(utf8, index, num_codes).
-    # The index to the top-level list is a code. The list given by the code
-    # index is a list of the parts represented by that code, Eg if the code 42
-    # represents the 2nd (index 1) out of 3 part of Chinese character 'x', then
-    # self.decoder[42] = [..., (utf8='x', index=1, num_codes3), ...] where ...
-    # means all other uses of the code 42.
-    self.decoder = []
-    if filename:
-      self._InitializeDecoder(filename)
-
-  def SoftmaxEval(self, sess, model, num_steps):
-    """Evaluate a model in softmax mode.
-
-    Adds char, word recall and sequence error rate events to the sw summary
-    writer, and returns them as well
-    TODO(rays) Add LogisticEval.
-    Args:
-      sess:  A tensor flow Session.
-      model: The model to run in the session. Requires a VGSLImageModel or any
-        other class that has a using_ctc attribute and a RunAStep(sess) method
-        that reurns a softmax result with corresponding labels.
-      num_steps: Number of steps to evaluate for.
-    Returns:
-      ErrorRates named tuple.
-    Raises:
-      ValueError: If an unsupported number of dimensions is used.
-    """
-    coord = tf.train.Coordinator()
-    threads = tf.train.start_queue_runners(sess=sess, coord=coord)
-    # Run the requested number of evaluation steps, gathering the outputs of the
-    # softmax and the true labels of the evaluation examples.
-    total_label_counts = ec.ErrorCounts(0, 0, 0, 0)
-    total_word_counts = ec.ErrorCounts(0, 0, 0, 0)
-    sequence_errors = 0
-    for _ in xrange(num_steps):
-      softmax_result, labels = model.RunAStep(sess)
-      # Collapse softmax to same shape as labels.
-      predictions = softmax_result.argmax(axis=-1)
-      # Exclude batch from num_dims.
-      num_dims = len(predictions.shape) - 1
-      batch_size = predictions.shape[0]
-      null_label = softmax_result.shape[-1] - 1
-      for b in xrange(batch_size):
-        if num_dims == 2:
-          # TODO(rays) Support 2-d data.
-          raise ValueError('2-d label data not supported yet!')
-        else:
-          if num_dims == 1:
-            pred_batch = predictions[b, :]
-            labels_batch = labels[b, :]
-          else:
-            pred_batch = [predictions[b]]
-            labels_batch = [labels[b]]
-          text = self.StringFromCTC(pred_batch, model.using_ctc, null_label)
-          truth = self.StringFromCTC(labels_batch, False, null_label)
-          # Note that recall_errs is false negatives (fn) aka drops/deletions.
-          # Actual recall would be 1-fn/truth_words.
-          # Likewise precision_errs is false positives (fp) aka adds/insertions.
-          # Actual precision would be 1-fp/ocr_words.
-          total_word_counts = ec.AddErrors(total_word_counts,
-                                           ec.CountWordErrors(text, truth))
-          total_label_counts = ec.AddErrors(total_label_counts,
-                                            ec.CountErrors(text, truth))
-          if text != truth:
-            sequence_errors += 1
-
-    coord.request_stop()
-    coord.join(threads)
-    return ec.ComputeErrorRates(total_label_counts, total_word_counts,
-                                sequence_errors, num_steps * batch_size)
-
-  def StringFromCTC(self, ctc_labels, merge_dups, null_label):
-    """Decodes CTC output to a string.
-
-    Extracts only sequences of codes that are allowed by self.decoder.
-    Labels that make illegal code sequences are dropped.
-    Note that, by its nature of taking only top choices, this is much weaker
-    than a full-blown beam search that considers all the softmax outputs.
-    For languages without many multi-code sequences, this doesn't make much
-    difference, but for complex scripts the accuracy will be much lower.
-    Args:
-      ctc_labels: List of class labels including null characters to remove.
-      merge_dups: If True, Duplicate labels will be merged
-      null_label: Label value to ignore.
-
-    Returns:
-      Labels decoded to a string.
-    """
-    # Run regular ctc on the labels, extracting a list of codes.
-    codes = self._CodesFromCTC(ctc_labels, merge_dups, null_label)
-    length = len(codes)
-    if length == 0:
-      return ''
-    # strings and partials are both indexed by the same index as codes.
-    # strings[i] is the best completed string upto position i, and
-    # partials[i] is a list of partial code sequences at position i.
-    # Warning: memory is squared-order in length.
-    strings = []
-    partials = []
-    for pos in xrange(length):
-      code = codes[pos]
-      parts = self.decoder[code]
-      partials.append([])
-      strings.append('')
-      # Iterate over the parts that this code can represent.
-      for utf8, index, num_codes in parts:
-        if index > pos:
-          continue
-        # We can use code if it is an initial code (index==0) or continues a
-        # sequence in the partials list at the previous position.
-        if index == 0 or partials[pos - 1].count(
-            Part(utf8, index - 1, num_codes)) > 0:
-          if index < num_codes - 1:
-            # Save the partial sequence.
-            partials[-1].append(Part(utf8, index, num_codes))
-          elif not strings[-1]:
-            # A code sequence is completed. Append to the best string that we
-            # had where it started.
-            if pos >= num_codes:
-              strings[-1] = strings[pos - num_codes] + utf8
-            else:
-              strings[-1] = utf8
-      if not strings[-1] and pos > 0:
-        # We didn't get anything here so copy the previous best string, skipping
-        # the current code, but it may just be a partial anyway.
-        strings[-1] = strings[-2]
-    return strings[-1]
-
-  def _InitializeDecoder(self, filename):
-    """Reads the decoder file and initializes self.decoder from it.
-
-    Args:
-      filename: Name of text file mapping codes to utf8 strings.
-    Raises:
-      ValueError: if the input file is not parsed correctly.
-    """
-    line_re = re.compile(r'(?P<codes>\d+(,\d+)*)\t(?P<utf8>.+)')
-    with tf.gfile.GFile(filename) as f:
-      for line in f:
-        m = line_re.match(line)
-        if m is None:
-          raise ValueError('Unmatched line:', line)
-        # codes is the sequence that maps to the string.
-        str_codes = m.groupdict()['codes'].split(',')
-        codes = []
-        for code in str_codes:
-          codes.append(int(code))
-        utf8 = m.groupdict()['utf8']
-        num_codes = len(codes)
-        for index, code in enumerate(codes):
-          while code >= len(self.decoder):
-            self.decoder.append([])
-          self.decoder[code].append(Part(utf8, index, num_codes))
-
-  def _CodesFromCTC(self, ctc_labels, merge_dups, null_label):
-    """Collapses CTC output to regular output.
-
-    Args:
-      ctc_labels: List of class labels including null characters to remove.
-      merge_dups: If True, Duplicate labels will be merged.
-      null_label: Label value to ignore.
-
-    All trailing zeros are removed!!
-    TODO(rays) This may become a problem with non-CTC models.
-    If using charset, this should not be a problem as zero is always space.
-    tf.pad can only append zero, so we have to be able to drop them, as a
-    non-ctc will have learned to output trailing zeros instead of trailing
-    nulls. This is awkward, as the stock ctc loss function requires that the
-    null character be num_classes-1.
-    Returns:
-      (List of) Labels with null characters removed.
-    """
-    out_labels = []
-    prev_label = -1
-    zeros_needed = 0
-    for label in ctc_labels:
-      if label == null_label:
-        prev_label = -1
-      elif label != prev_label or not merge_dups:
-        if label == 0:
-          # Count zeros and only emit them when it is clear there is a non-zero
-          # after, so as to truncate away all trailing zeros.
-          zeros_needed += 1
-        else:
-          if merge_dups and zeros_needed > 0:
-            out_labels.append(0)
-          else:
-            out_labels += [0] * zeros_needed
-          zeros_needed = 0
-          out_labels.append(label)
-        prev_label = label
-    return out_labels
diff --git a/research/street/python/decoder_test.py b/research/street/python/decoder_test.py
deleted file mode 100644
index dc61f8b2a6e10d364e7e89b7d6492c36971015f7..0000000000000000000000000000000000000000
--- a/research/street/python/decoder_test.py
+++ /dev/null
@@ -1,57 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for decoder."""
-import os
-
-import tensorflow as tf
-import decoder
-
-
-def _testdata(filename):
-  return os.path.join('../testdata/', filename)
-
-
-class DecoderTest(tf.test.TestCase):
-
-  def testCodesFromCTC(self):
-    """Tests that the simple CTC decoder drops nulls and duplicates.
-    """
-    ctc_labels = [9, 9, 9, 1, 9, 2, 2, 3, 9, 9, 0, 0, 1, 9, 1, 9, 9, 9]
-    decode = decoder.Decoder(filename=None)
-    non_null_labels = decode._CodesFromCTC(
-        ctc_labels, merge_dups=False, null_label=9)
-    self.assertEqual(non_null_labels, [1, 2, 2, 3, 0, 0, 1, 1])
-    idempotent_labels = decode._CodesFromCTC(
-        non_null_labels, merge_dups=False, null_label=9)
-    self.assertEqual(idempotent_labels, non_null_labels)
-    collapsed_labels = decode._CodesFromCTC(
-        ctc_labels, merge_dups=True, null_label=9)
-    self.assertEqual(collapsed_labels, [1, 2, 3, 0, 1, 1])
-    non_idempotent_labels = decode._CodesFromCTC(
-        collapsed_labels, merge_dups=True, null_label=9)
-    self.assertEqual(non_idempotent_labels, [1, 2, 3, 0, 1])
-
-  def testStringFromCTC(self):
-    """Tests that the decoder can decode sequences including multi-codes.
-    """
-    #             -  f  -  a  r  -  m(1/2)m     -junk sp b  a  r  -  n  -
-    ctc_labels = [9, 6, 9, 1, 3, 9, 4, 9, 5, 5, 9, 5, 0, 2, 1, 3, 9, 4, 9]
-    decode = decoder.Decoder(filename=_testdata('charset_size_10.txt'))
-    text = decode.StringFromCTC(ctc_labels, merge_dups=True, null_label=9)
-    self.assertEqual(text, 'farm barn')
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/street/python/errorcounter.py b/research/street/python/errorcounter.py
deleted file mode 100644
index affbf969532e6a1b865bca752705bbd6a2ef8f84..0000000000000000000000000000000000000000
--- a/research/street/python/errorcounter.py
+++ /dev/null
@@ -1,123 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Some simple tools for error counting.
-
-"""
-import collections
-
-# Named tuple Error counts describes the counts needed to accumulate errors
-# over multiple trials:
-#   false negatives (aka drops or deletions),
-#   false positives: (aka adds or insertions),
-#   truth_count: number of elements in ground truth = denominator for fn,
-#   test_count:  number of elements in test string = denominator for fp,
-# Note that recall = 1 - fn/truth_count, precision = 1 - fp/test_count,
-# accuracy = 1 - (fn + fp) / (truth_count + test_count).
-ErrorCounts = collections.namedtuple('ErrorCounts', ['fn', 'fp', 'truth_count',
-                                                     'test_count'])
-
-# Named tuple for error rates, as a percentage. Accuracies are just 100-error.
-ErrorRates = collections.namedtuple('ErrorRates',
-                                    ['label_error', 'word_recall_error',
-                                     'word_precision_error', 'sequence_error'])
-
-
-def CountWordErrors(ocr_text, truth_text):
-  """Counts the word drop and add errors as a bag of words.
-
-  Args:
-    ocr_text:    OCR text string.
-    truth_text:  Truth text string.
-
-  Returns:
-    ErrorCounts named tuple.
-  """
-  # Convert to lists of words.
-  return CountErrors(ocr_text.split(), truth_text.split())
-
-
-def CountErrors(ocr_text, truth_text):
-  """Counts the drops and adds between 2 bags of iterables.
-
-  Simple bag of objects count returns the number of dropped and added
-  elements, regardless of order, from anything that is iterable, eg
-  a pair of strings gives character errors, and a pair of word lists give
-  word errors.
-  Args:
-    ocr_text:    OCR text iterable (eg string for chars, word list for words).
-    truth_text:  Truth text iterable.
-
-  Returns:
-    ErrorCounts named tuple.
-  """
-  counts = collections.Counter(truth_text)
-  counts.subtract(ocr_text)
-  drops = sum(c for c in counts.values() if c > 0)
-  adds = sum(-c for c in counts.values() if c < 0)
-  return ErrorCounts(drops, adds, len(truth_text), len(ocr_text))
-
-
-def AddErrors(counts1, counts2):
-  """Adds the counts and returns a new sum tuple.
-
-  Args:
-    counts1: ErrorCounts named tuples to sum.
-    counts2: ErrorCounts named tuples to sum.
-  Returns:
-    Sum of counts1, counts2.
-  """
-  return ErrorCounts(counts1.fn + counts2.fn, counts1.fp + counts2.fp,
-                     counts1.truth_count + counts2.truth_count,
-                     counts1.test_count + counts2.test_count)
-
-
-def ComputeErrorRates(label_counts, word_counts, seq_errors, num_seqs):
-  """Returns an ErrorRates corresponding to the given counts.
-
-  Args:
-    label_counts: ErrorCounts for the character labels
-    word_counts:  ErrorCounts for the words
-    seq_errors:   Number of sequence errors
-    num_seqs:     Total sequences
-  Returns:
-    ErrorRates corresponding to the given counts.
-  """
-  label_errors = label_counts.fn + label_counts.fp
-  num_labels = label_counts.truth_count + label_counts.test_count
-  return ErrorRates(
-      ComputeErrorRate(label_errors, num_labels),
-      ComputeErrorRate(word_counts.fn, word_counts.truth_count),
-      ComputeErrorRate(word_counts.fp, word_counts.test_count),
-      ComputeErrorRate(seq_errors, num_seqs))
-
-
-def ComputeErrorRate(error_count, truth_count):
-  """Returns a sanitized percent error rate from the raw counts.
-
-  Prevents div by 0 and clips return to 100%.
-  Args:
-    error_count: Number of errors.
-    truth_count: Number to divide by.
-
-  Returns:
-    100.0 * error_count / truth_count clipped to 100.
-  """
-  if truth_count == 0:
-    truth_count = 1
-    error_count = 1
-  elif error_count > truth_count:
-    error_count = truth_count
-  return error_count * 100.0 / truth_count
diff --git a/research/street/python/errorcounter_test.py b/research/street/python/errorcounter_test.py
deleted file mode 100644
index aeaa36092bf43a4ca4186a136527113f4ebb4a03..0000000000000000000000000000000000000000
--- a/research/street/python/errorcounter_test.py
+++ /dev/null
@@ -1,124 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for errorcounter."""
-import tensorflow as tf
-import errorcounter as ec
-
-
-class ErrorcounterTest(tf.test.TestCase):
-
-  def testComputeErrorRate(self):
-    """Tests that the percent calculation works as expected.
-    """
-    rate = ec.ComputeErrorRate(error_count=0, truth_count=0)
-    self.assertEqual(rate, 100.0)
-    rate = ec.ComputeErrorRate(error_count=1, truth_count=0)
-    self.assertEqual(rate, 100.0)
-    rate = ec.ComputeErrorRate(error_count=10, truth_count=1)
-    self.assertEqual(rate, 100.0)
-    rate = ec.ComputeErrorRate(error_count=0, truth_count=1)
-    self.assertEqual(rate, 0.0)
-    rate = ec.ComputeErrorRate(error_count=3, truth_count=12)
-    self.assertEqual(rate, 25.0)
-
-  def testCountErrors(self):
-    """Tests that the error counter works as expected.
-    """
-    truth_str = 'farm barn'
-    counts = ec.CountErrors(ocr_text=truth_str, truth_text=truth_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=0, fp=0, truth_count=9, test_count=9))
-    # With a period on the end, we get a char error.
-    dot_str = 'farm barn.'
-    counts = ec.CountErrors(ocr_text=dot_str, truth_text=truth_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=0, fp=1, truth_count=9, test_count=10))
-    counts = ec.CountErrors(ocr_text=truth_str, truth_text=dot_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=1, fp=0, truth_count=10, test_count=9))
-    # Space is just another char.
-    no_space = 'farmbarn'
-    counts = ec.CountErrors(ocr_text=no_space, truth_text=truth_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=1, fp=0, truth_count=9, test_count=8))
-    counts = ec.CountErrors(ocr_text=truth_str, truth_text=no_space)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=0, fp=1, truth_count=8, test_count=9))
-    # Lose them all.
-    counts = ec.CountErrors(ocr_text='', truth_text=truth_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=9, fp=0, truth_count=9, test_count=0))
-    counts = ec.CountErrors(ocr_text=truth_str, truth_text='')
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=0, fp=9, truth_count=0, test_count=9))
-
-  def testCountWordErrors(self):
-    """Tests that the error counter works as expected.
-    """
-    truth_str = 'farm barn'
-    counts = ec.CountWordErrors(ocr_text=truth_str, truth_text=truth_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=0, fp=0, truth_count=2, test_count=2))
-    # With a period on the end, we get a word error.
-    dot_str = 'farm barn.'
-    counts = ec.CountWordErrors(ocr_text=dot_str, truth_text=truth_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=1, fp=1, truth_count=2, test_count=2))
-    counts = ec.CountWordErrors(ocr_text=truth_str, truth_text=dot_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=1, fp=1, truth_count=2, test_count=2))
-    # Space is special.
-    no_space = 'farmbarn'
-    counts = ec.CountWordErrors(ocr_text=no_space, truth_text=truth_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=2, fp=1, truth_count=2, test_count=1))
-    counts = ec.CountWordErrors(ocr_text=truth_str, truth_text=no_space)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=1, fp=2, truth_count=1, test_count=2))
-    # Lose them all.
-    counts = ec.CountWordErrors(ocr_text='', truth_text=truth_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=2, fp=0, truth_count=2, test_count=0))
-    counts = ec.CountWordErrors(ocr_text=truth_str, truth_text='')
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=0, fp=2, truth_count=0, test_count=2))
-    # With a space in ba rn, there is an extra add.
-    sp_str = 'farm ba rn'
-    counts = ec.CountWordErrors(ocr_text=sp_str, truth_text=truth_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=1, fp=2, truth_count=2, test_count=3))
-    counts = ec.CountWordErrors(ocr_text=truth_str, truth_text=sp_str)
-    self.assertEqual(
-        counts, ec.ErrorCounts(
-            fn=2, fp=1, truth_count=3, test_count=2))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/street/python/fsns_urls.py b/research/street/python/fsns_urls.py
deleted file mode 100644
index bea547b9d57315e81ed69d290370f851b17784e0..0000000000000000000000000000000000000000
--- a/research/street/python/fsns_urls.py
+++ /dev/null
@@ -1,49 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Creates a text file with URLs to download FSNS dataset using aria2c.
-
-The FSNS dataset has 640 files and takes 158Gb of the disk space. So it is
-highly recommended to use some kind of a download manager to download it.
-
-Aria2c is a powerful download manager which can download multiple files in
-parallel, re-try if encounter an error and continue previously unfinished
-downloads.
-"""
-
-import os
-
-_FSNS_BASE_URL  = 'http://download.tensorflow.org/data/fsns-20160927/'
-_SHARDS = {'test': 64, 'train': 512, 'validation':64}
-_OUTPUT_FILE = "fsns_urls.txt"
-_OUTPUT_DIR = "data/fsns"
-
-def fsns_paths():
-  paths = ['charset_size=134.txt']
-  for name, shards in _SHARDS.items():
-    for i in range(shards):
-      paths.append('%s/%s-%05d-of-%05d' % (name, name, i, shards))
-  return paths
-
-
-if __name__ == "__main__":
-  with open(_OUTPUT_FILE, "w") as f:
-    for path in fsns_paths():
-      url = _FSNS_BASE_URL + path
-      dst_path = os.path.join(_OUTPUT_DIR, path)
-      f.write("%s\n  out=%s\n" % (url, dst_path))
-  print("To download FSNS dataset execute:")
-  print("aria2c -c -j 20 -i %s" % _OUTPUT_FILE)
-  print("The downloaded FSNS dataset will be stored under %s" % _OUTPUT_DIR)
diff --git a/research/street/python/fsns_urls.txt b/research/street/python/fsns_urls.txt
deleted file mode 100644
index 959ffbd5d432105a2964ef2a4be07d046c7ab026..0000000000000000000000000000000000000000
--- a/research/street/python/fsns_urls.txt
+++ /dev/null
@@ -1,1282 +0,0 @@
-http://download.tensorflow.org/data/fsns-20160927/charset_size=134.txt
-  out=data/fsns/charset_size=134.txt
-http://download.tensorflow.org/data/fsns-20160927/test/test-00000-of-00064
-  out=data/fsns/test/test-00000-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00001-of-00064
-  out=data/fsns/test/test-00001-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00002-of-00064
-  out=data/fsns/test/test-00002-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00003-of-00064
-  out=data/fsns/test/test-00003-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00004-of-00064
-  out=data/fsns/test/test-00004-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00005-of-00064
-  out=data/fsns/test/test-00005-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00006-of-00064
-  out=data/fsns/test/test-00006-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00007-of-00064
-  out=data/fsns/test/test-00007-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00008-of-00064
-  out=data/fsns/test/test-00008-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00009-of-00064
-  out=data/fsns/test/test-00009-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00010-of-00064
-  out=data/fsns/test/test-00010-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00011-of-00064
-  out=data/fsns/test/test-00011-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00012-of-00064
-  out=data/fsns/test/test-00012-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00013-of-00064
-  out=data/fsns/test/test-00013-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00014-of-00064
-  out=data/fsns/test/test-00014-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00015-of-00064
-  out=data/fsns/test/test-00015-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00016-of-00064
-  out=data/fsns/test/test-00016-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00017-of-00064
-  out=data/fsns/test/test-00017-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00018-of-00064
-  out=data/fsns/test/test-00018-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00019-of-00064
-  out=data/fsns/test/test-00019-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00020-of-00064
-  out=data/fsns/test/test-00020-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00021-of-00064
-  out=data/fsns/test/test-00021-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00022-of-00064
-  out=data/fsns/test/test-00022-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00023-of-00064
-  out=data/fsns/test/test-00023-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00024-of-00064
-  out=data/fsns/test/test-00024-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00025-of-00064
-  out=data/fsns/test/test-00025-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00026-of-00064
-  out=data/fsns/test/test-00026-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00027-of-00064
-  out=data/fsns/test/test-00027-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00028-of-00064
-  out=data/fsns/test/test-00028-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00029-of-00064
-  out=data/fsns/test/test-00029-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00030-of-00064
-  out=data/fsns/test/test-00030-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00031-of-00064
-  out=data/fsns/test/test-00031-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00032-of-00064
-  out=data/fsns/test/test-00032-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00033-of-00064
-  out=data/fsns/test/test-00033-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00034-of-00064
-  out=data/fsns/test/test-00034-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00035-of-00064
-  out=data/fsns/test/test-00035-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00036-of-00064
-  out=data/fsns/test/test-00036-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00037-of-00064
-  out=data/fsns/test/test-00037-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00038-of-00064
-  out=data/fsns/test/test-00038-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00039-of-00064
-  out=data/fsns/test/test-00039-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00040-of-00064
-  out=data/fsns/test/test-00040-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00041-of-00064
-  out=data/fsns/test/test-00041-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00042-of-00064
-  out=data/fsns/test/test-00042-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00043-of-00064
-  out=data/fsns/test/test-00043-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00044-of-00064
-  out=data/fsns/test/test-00044-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00045-of-00064
-  out=data/fsns/test/test-00045-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00046-of-00064
-  out=data/fsns/test/test-00046-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00047-of-00064
-  out=data/fsns/test/test-00047-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00048-of-00064
-  out=data/fsns/test/test-00048-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00049-of-00064
-  out=data/fsns/test/test-00049-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00050-of-00064
-  out=data/fsns/test/test-00050-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00051-of-00064
-  out=data/fsns/test/test-00051-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00052-of-00064
-  out=data/fsns/test/test-00052-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00053-of-00064
-  out=data/fsns/test/test-00053-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00054-of-00064
-  out=data/fsns/test/test-00054-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00055-of-00064
-  out=data/fsns/test/test-00055-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00056-of-00064
-  out=data/fsns/test/test-00056-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00057-of-00064
-  out=data/fsns/test/test-00057-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00058-of-00064
-  out=data/fsns/test/test-00058-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00059-of-00064
-  out=data/fsns/test/test-00059-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00060-of-00064
-  out=data/fsns/test/test-00060-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00061-of-00064
-  out=data/fsns/test/test-00061-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00062-of-00064
-  out=data/fsns/test/test-00062-of-00064
-http://download.tensorflow.org/data/fsns-20160927/test/test-00063-of-00064
-  out=data/fsns/test/test-00063-of-00064
-http://download.tensorflow.org/data/fsns-20160927/train/train-00000-of-00512
-  out=data/fsns/train/train-00000-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00001-of-00512
-  out=data/fsns/train/train-00001-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00002-of-00512
-  out=data/fsns/train/train-00002-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00003-of-00512
-  out=data/fsns/train/train-00003-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00004-of-00512
-  out=data/fsns/train/train-00004-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00005-of-00512
-  out=data/fsns/train/train-00005-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00006-of-00512
-  out=data/fsns/train/train-00006-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00007-of-00512
-  out=data/fsns/train/train-00007-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00008-of-00512
-  out=data/fsns/train/train-00008-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00009-of-00512
-  out=data/fsns/train/train-00009-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00010-of-00512
-  out=data/fsns/train/train-00010-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00011-of-00512
-  out=data/fsns/train/train-00011-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00012-of-00512
-  out=data/fsns/train/train-00012-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00013-of-00512
-  out=data/fsns/train/train-00013-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00014-of-00512
-  out=data/fsns/train/train-00014-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00015-of-00512
-  out=data/fsns/train/train-00015-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00016-of-00512
-  out=data/fsns/train/train-00016-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00017-of-00512
-  out=data/fsns/train/train-00017-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00018-of-00512
-  out=data/fsns/train/train-00018-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00019-of-00512
-  out=data/fsns/train/train-00019-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00020-of-00512
-  out=data/fsns/train/train-00020-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00021-of-00512
-  out=data/fsns/train/train-00021-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00022-of-00512
-  out=data/fsns/train/train-00022-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00023-of-00512
-  out=data/fsns/train/train-00023-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00024-of-00512
-  out=data/fsns/train/train-00024-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00025-of-00512
-  out=data/fsns/train/train-00025-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00026-of-00512
-  out=data/fsns/train/train-00026-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00027-of-00512
-  out=data/fsns/train/train-00027-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00028-of-00512
-  out=data/fsns/train/train-00028-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00029-of-00512
-  out=data/fsns/train/train-00029-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00030-of-00512
-  out=data/fsns/train/train-00030-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00031-of-00512
-  out=data/fsns/train/train-00031-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00032-of-00512
-  out=data/fsns/train/train-00032-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00033-of-00512
-  out=data/fsns/train/train-00033-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00034-of-00512
-  out=data/fsns/train/train-00034-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00035-of-00512
-  out=data/fsns/train/train-00035-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00036-of-00512
-  out=data/fsns/train/train-00036-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00037-of-00512
-  out=data/fsns/train/train-00037-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00038-of-00512
-  out=data/fsns/train/train-00038-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00039-of-00512
-  out=data/fsns/train/train-00039-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00040-of-00512
-  out=data/fsns/train/train-00040-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00041-of-00512
-  out=data/fsns/train/train-00041-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00042-of-00512
-  out=data/fsns/train/train-00042-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00043-of-00512
-  out=data/fsns/train/train-00043-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00044-of-00512
-  out=data/fsns/train/train-00044-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00045-of-00512
-  out=data/fsns/train/train-00045-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00046-of-00512
-  out=data/fsns/train/train-00046-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00047-of-00512
-  out=data/fsns/train/train-00047-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00048-of-00512
-  out=data/fsns/train/train-00048-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00049-of-00512
-  out=data/fsns/train/train-00049-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00050-of-00512
-  out=data/fsns/train/train-00050-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00051-of-00512
-  out=data/fsns/train/train-00051-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00052-of-00512
-  out=data/fsns/train/train-00052-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00053-of-00512
-  out=data/fsns/train/train-00053-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00054-of-00512
-  out=data/fsns/train/train-00054-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00055-of-00512
-  out=data/fsns/train/train-00055-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00056-of-00512
-  out=data/fsns/train/train-00056-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00057-of-00512
-  out=data/fsns/train/train-00057-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00058-of-00512
-  out=data/fsns/train/train-00058-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00059-of-00512
-  out=data/fsns/train/train-00059-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00060-of-00512
-  out=data/fsns/train/train-00060-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00061-of-00512
-  out=data/fsns/train/train-00061-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00062-of-00512
-  out=data/fsns/train/train-00062-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00063-of-00512
-  out=data/fsns/train/train-00063-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00064-of-00512
-  out=data/fsns/train/train-00064-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00065-of-00512
-  out=data/fsns/train/train-00065-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00066-of-00512
-  out=data/fsns/train/train-00066-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00067-of-00512
-  out=data/fsns/train/train-00067-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00068-of-00512
-  out=data/fsns/train/train-00068-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00069-of-00512
-  out=data/fsns/train/train-00069-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00070-of-00512
-  out=data/fsns/train/train-00070-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00071-of-00512
-  out=data/fsns/train/train-00071-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00072-of-00512
-  out=data/fsns/train/train-00072-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00073-of-00512
-  out=data/fsns/train/train-00073-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00074-of-00512
-  out=data/fsns/train/train-00074-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00075-of-00512
-  out=data/fsns/train/train-00075-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00076-of-00512
-  out=data/fsns/train/train-00076-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00077-of-00512
-  out=data/fsns/train/train-00077-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00078-of-00512
-  out=data/fsns/train/train-00078-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00079-of-00512
-  out=data/fsns/train/train-00079-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00080-of-00512
-  out=data/fsns/train/train-00080-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00081-of-00512
-  out=data/fsns/train/train-00081-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00082-of-00512
-  out=data/fsns/train/train-00082-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00083-of-00512
-  out=data/fsns/train/train-00083-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00084-of-00512
-  out=data/fsns/train/train-00084-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00085-of-00512
-  out=data/fsns/train/train-00085-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00086-of-00512
-  out=data/fsns/train/train-00086-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00087-of-00512
-  out=data/fsns/train/train-00087-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00088-of-00512
-  out=data/fsns/train/train-00088-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00089-of-00512
-  out=data/fsns/train/train-00089-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00090-of-00512
-  out=data/fsns/train/train-00090-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00091-of-00512
-  out=data/fsns/train/train-00091-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00092-of-00512
-  out=data/fsns/train/train-00092-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00093-of-00512
-  out=data/fsns/train/train-00093-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00094-of-00512
-  out=data/fsns/train/train-00094-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00095-of-00512
-  out=data/fsns/train/train-00095-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00096-of-00512
-  out=data/fsns/train/train-00096-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00097-of-00512
-  out=data/fsns/train/train-00097-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00098-of-00512
-  out=data/fsns/train/train-00098-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00099-of-00512
-  out=data/fsns/train/train-00099-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00100-of-00512
-  out=data/fsns/train/train-00100-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00101-of-00512
-  out=data/fsns/train/train-00101-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00102-of-00512
-  out=data/fsns/train/train-00102-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00103-of-00512
-  out=data/fsns/train/train-00103-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00104-of-00512
-  out=data/fsns/train/train-00104-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00105-of-00512
-  out=data/fsns/train/train-00105-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00106-of-00512
-  out=data/fsns/train/train-00106-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00107-of-00512
-  out=data/fsns/train/train-00107-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00108-of-00512
-  out=data/fsns/train/train-00108-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00109-of-00512
-  out=data/fsns/train/train-00109-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00110-of-00512
-  out=data/fsns/train/train-00110-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00111-of-00512
-  out=data/fsns/train/train-00111-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00112-of-00512
-  out=data/fsns/train/train-00112-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00113-of-00512
-  out=data/fsns/train/train-00113-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00114-of-00512
-  out=data/fsns/train/train-00114-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00115-of-00512
-  out=data/fsns/train/train-00115-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00116-of-00512
-  out=data/fsns/train/train-00116-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00117-of-00512
-  out=data/fsns/train/train-00117-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00118-of-00512
-  out=data/fsns/train/train-00118-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00119-of-00512
-  out=data/fsns/train/train-00119-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00120-of-00512
-  out=data/fsns/train/train-00120-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00121-of-00512
-  out=data/fsns/train/train-00121-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00122-of-00512
-  out=data/fsns/train/train-00122-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00123-of-00512
-  out=data/fsns/train/train-00123-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00124-of-00512
-  out=data/fsns/train/train-00124-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00125-of-00512
-  out=data/fsns/train/train-00125-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00126-of-00512
-  out=data/fsns/train/train-00126-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00127-of-00512
-  out=data/fsns/train/train-00127-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00128-of-00512
-  out=data/fsns/train/train-00128-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00129-of-00512
-  out=data/fsns/train/train-00129-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00130-of-00512
-  out=data/fsns/train/train-00130-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00131-of-00512
-  out=data/fsns/train/train-00131-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00132-of-00512
-  out=data/fsns/train/train-00132-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00133-of-00512
-  out=data/fsns/train/train-00133-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00134-of-00512
-  out=data/fsns/train/train-00134-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00135-of-00512
-  out=data/fsns/train/train-00135-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00136-of-00512
-  out=data/fsns/train/train-00136-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00137-of-00512
-  out=data/fsns/train/train-00137-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00138-of-00512
-  out=data/fsns/train/train-00138-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00139-of-00512
-  out=data/fsns/train/train-00139-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00140-of-00512
-  out=data/fsns/train/train-00140-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00141-of-00512
-  out=data/fsns/train/train-00141-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00142-of-00512
-  out=data/fsns/train/train-00142-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00143-of-00512
-  out=data/fsns/train/train-00143-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00144-of-00512
-  out=data/fsns/train/train-00144-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00145-of-00512
-  out=data/fsns/train/train-00145-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00146-of-00512
-  out=data/fsns/train/train-00146-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00147-of-00512
-  out=data/fsns/train/train-00147-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00148-of-00512
-  out=data/fsns/train/train-00148-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00149-of-00512
-  out=data/fsns/train/train-00149-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00150-of-00512
-  out=data/fsns/train/train-00150-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00151-of-00512
-  out=data/fsns/train/train-00151-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00152-of-00512
-  out=data/fsns/train/train-00152-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00153-of-00512
-  out=data/fsns/train/train-00153-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00154-of-00512
-  out=data/fsns/train/train-00154-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00155-of-00512
-  out=data/fsns/train/train-00155-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00156-of-00512
-  out=data/fsns/train/train-00156-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00157-of-00512
-  out=data/fsns/train/train-00157-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00158-of-00512
-  out=data/fsns/train/train-00158-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00159-of-00512
-  out=data/fsns/train/train-00159-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00160-of-00512
-  out=data/fsns/train/train-00160-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00161-of-00512
-  out=data/fsns/train/train-00161-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00162-of-00512
-  out=data/fsns/train/train-00162-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00163-of-00512
-  out=data/fsns/train/train-00163-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00164-of-00512
-  out=data/fsns/train/train-00164-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00165-of-00512
-  out=data/fsns/train/train-00165-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00166-of-00512
-  out=data/fsns/train/train-00166-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00167-of-00512
-  out=data/fsns/train/train-00167-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00168-of-00512
-  out=data/fsns/train/train-00168-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00169-of-00512
-  out=data/fsns/train/train-00169-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00170-of-00512
-  out=data/fsns/train/train-00170-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00171-of-00512
-  out=data/fsns/train/train-00171-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00172-of-00512
-  out=data/fsns/train/train-00172-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00173-of-00512
-  out=data/fsns/train/train-00173-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00174-of-00512
-  out=data/fsns/train/train-00174-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00175-of-00512
-  out=data/fsns/train/train-00175-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00176-of-00512
-  out=data/fsns/train/train-00176-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00177-of-00512
-  out=data/fsns/train/train-00177-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00178-of-00512
-  out=data/fsns/train/train-00178-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00179-of-00512
-  out=data/fsns/train/train-00179-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00180-of-00512
-  out=data/fsns/train/train-00180-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00181-of-00512
-  out=data/fsns/train/train-00181-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00182-of-00512
-  out=data/fsns/train/train-00182-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00183-of-00512
-  out=data/fsns/train/train-00183-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00184-of-00512
-  out=data/fsns/train/train-00184-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00185-of-00512
-  out=data/fsns/train/train-00185-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00186-of-00512
-  out=data/fsns/train/train-00186-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00187-of-00512
-  out=data/fsns/train/train-00187-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00188-of-00512
-  out=data/fsns/train/train-00188-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00189-of-00512
-  out=data/fsns/train/train-00189-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00190-of-00512
-  out=data/fsns/train/train-00190-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00191-of-00512
-  out=data/fsns/train/train-00191-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00192-of-00512
-  out=data/fsns/train/train-00192-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00193-of-00512
-  out=data/fsns/train/train-00193-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00194-of-00512
-  out=data/fsns/train/train-00194-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00195-of-00512
-  out=data/fsns/train/train-00195-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00196-of-00512
-  out=data/fsns/train/train-00196-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00197-of-00512
-  out=data/fsns/train/train-00197-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00198-of-00512
-  out=data/fsns/train/train-00198-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00199-of-00512
-  out=data/fsns/train/train-00199-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00200-of-00512
-  out=data/fsns/train/train-00200-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00201-of-00512
-  out=data/fsns/train/train-00201-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00202-of-00512
-  out=data/fsns/train/train-00202-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00203-of-00512
-  out=data/fsns/train/train-00203-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00204-of-00512
-  out=data/fsns/train/train-00204-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00205-of-00512
-  out=data/fsns/train/train-00205-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00206-of-00512
-  out=data/fsns/train/train-00206-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00207-of-00512
-  out=data/fsns/train/train-00207-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00208-of-00512
-  out=data/fsns/train/train-00208-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00209-of-00512
-  out=data/fsns/train/train-00209-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00210-of-00512
-  out=data/fsns/train/train-00210-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00211-of-00512
-  out=data/fsns/train/train-00211-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00212-of-00512
-  out=data/fsns/train/train-00212-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00213-of-00512
-  out=data/fsns/train/train-00213-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00214-of-00512
-  out=data/fsns/train/train-00214-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00215-of-00512
-  out=data/fsns/train/train-00215-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00216-of-00512
-  out=data/fsns/train/train-00216-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00217-of-00512
-  out=data/fsns/train/train-00217-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00218-of-00512
-  out=data/fsns/train/train-00218-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00219-of-00512
-  out=data/fsns/train/train-00219-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00220-of-00512
-  out=data/fsns/train/train-00220-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00221-of-00512
-  out=data/fsns/train/train-00221-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00222-of-00512
-  out=data/fsns/train/train-00222-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00223-of-00512
-  out=data/fsns/train/train-00223-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00224-of-00512
-  out=data/fsns/train/train-00224-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00225-of-00512
-  out=data/fsns/train/train-00225-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00226-of-00512
-  out=data/fsns/train/train-00226-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00227-of-00512
-  out=data/fsns/train/train-00227-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00228-of-00512
-  out=data/fsns/train/train-00228-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00229-of-00512
-  out=data/fsns/train/train-00229-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00230-of-00512
-  out=data/fsns/train/train-00230-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00231-of-00512
-  out=data/fsns/train/train-00231-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00232-of-00512
-  out=data/fsns/train/train-00232-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00233-of-00512
-  out=data/fsns/train/train-00233-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00234-of-00512
-  out=data/fsns/train/train-00234-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00235-of-00512
-  out=data/fsns/train/train-00235-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00236-of-00512
-  out=data/fsns/train/train-00236-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00237-of-00512
-  out=data/fsns/train/train-00237-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00238-of-00512
-  out=data/fsns/train/train-00238-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00239-of-00512
-  out=data/fsns/train/train-00239-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00240-of-00512
-  out=data/fsns/train/train-00240-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00241-of-00512
-  out=data/fsns/train/train-00241-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00242-of-00512
-  out=data/fsns/train/train-00242-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00243-of-00512
-  out=data/fsns/train/train-00243-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00244-of-00512
-  out=data/fsns/train/train-00244-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00245-of-00512
-  out=data/fsns/train/train-00245-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00246-of-00512
-  out=data/fsns/train/train-00246-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00247-of-00512
-  out=data/fsns/train/train-00247-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00248-of-00512
-  out=data/fsns/train/train-00248-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00249-of-00512
-  out=data/fsns/train/train-00249-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00250-of-00512
-  out=data/fsns/train/train-00250-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00251-of-00512
-  out=data/fsns/train/train-00251-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00252-of-00512
-  out=data/fsns/train/train-00252-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00253-of-00512
-  out=data/fsns/train/train-00253-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00254-of-00512
-  out=data/fsns/train/train-00254-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00255-of-00512
-  out=data/fsns/train/train-00255-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00256-of-00512
-  out=data/fsns/train/train-00256-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00257-of-00512
-  out=data/fsns/train/train-00257-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00258-of-00512
-  out=data/fsns/train/train-00258-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00259-of-00512
-  out=data/fsns/train/train-00259-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00260-of-00512
-  out=data/fsns/train/train-00260-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00261-of-00512
-  out=data/fsns/train/train-00261-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00262-of-00512
-  out=data/fsns/train/train-00262-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00263-of-00512
-  out=data/fsns/train/train-00263-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00264-of-00512
-  out=data/fsns/train/train-00264-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00265-of-00512
-  out=data/fsns/train/train-00265-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00266-of-00512
-  out=data/fsns/train/train-00266-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00267-of-00512
-  out=data/fsns/train/train-00267-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00268-of-00512
-  out=data/fsns/train/train-00268-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00269-of-00512
-  out=data/fsns/train/train-00269-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00270-of-00512
-  out=data/fsns/train/train-00270-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00271-of-00512
-  out=data/fsns/train/train-00271-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00272-of-00512
-  out=data/fsns/train/train-00272-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00273-of-00512
-  out=data/fsns/train/train-00273-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00274-of-00512
-  out=data/fsns/train/train-00274-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00275-of-00512
-  out=data/fsns/train/train-00275-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00276-of-00512
-  out=data/fsns/train/train-00276-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00277-of-00512
-  out=data/fsns/train/train-00277-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00278-of-00512
-  out=data/fsns/train/train-00278-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00279-of-00512
-  out=data/fsns/train/train-00279-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00280-of-00512
-  out=data/fsns/train/train-00280-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00281-of-00512
-  out=data/fsns/train/train-00281-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00282-of-00512
-  out=data/fsns/train/train-00282-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00283-of-00512
-  out=data/fsns/train/train-00283-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00284-of-00512
-  out=data/fsns/train/train-00284-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00285-of-00512
-  out=data/fsns/train/train-00285-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00286-of-00512
-  out=data/fsns/train/train-00286-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00287-of-00512
-  out=data/fsns/train/train-00287-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00288-of-00512
-  out=data/fsns/train/train-00288-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00289-of-00512
-  out=data/fsns/train/train-00289-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00290-of-00512
-  out=data/fsns/train/train-00290-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00291-of-00512
-  out=data/fsns/train/train-00291-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00292-of-00512
-  out=data/fsns/train/train-00292-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00293-of-00512
-  out=data/fsns/train/train-00293-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00294-of-00512
-  out=data/fsns/train/train-00294-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00295-of-00512
-  out=data/fsns/train/train-00295-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00296-of-00512
-  out=data/fsns/train/train-00296-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00297-of-00512
-  out=data/fsns/train/train-00297-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00298-of-00512
-  out=data/fsns/train/train-00298-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00299-of-00512
-  out=data/fsns/train/train-00299-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00300-of-00512
-  out=data/fsns/train/train-00300-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00301-of-00512
-  out=data/fsns/train/train-00301-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00302-of-00512
-  out=data/fsns/train/train-00302-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00303-of-00512
-  out=data/fsns/train/train-00303-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00304-of-00512
-  out=data/fsns/train/train-00304-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00305-of-00512
-  out=data/fsns/train/train-00305-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00306-of-00512
-  out=data/fsns/train/train-00306-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00307-of-00512
-  out=data/fsns/train/train-00307-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00308-of-00512
-  out=data/fsns/train/train-00308-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00309-of-00512
-  out=data/fsns/train/train-00309-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00310-of-00512
-  out=data/fsns/train/train-00310-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00311-of-00512
-  out=data/fsns/train/train-00311-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00312-of-00512
-  out=data/fsns/train/train-00312-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00313-of-00512
-  out=data/fsns/train/train-00313-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00314-of-00512
-  out=data/fsns/train/train-00314-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00315-of-00512
-  out=data/fsns/train/train-00315-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00316-of-00512
-  out=data/fsns/train/train-00316-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00317-of-00512
-  out=data/fsns/train/train-00317-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00318-of-00512
-  out=data/fsns/train/train-00318-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00319-of-00512
-  out=data/fsns/train/train-00319-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00320-of-00512
-  out=data/fsns/train/train-00320-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00321-of-00512
-  out=data/fsns/train/train-00321-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00322-of-00512
-  out=data/fsns/train/train-00322-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00323-of-00512
-  out=data/fsns/train/train-00323-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00324-of-00512
-  out=data/fsns/train/train-00324-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00325-of-00512
-  out=data/fsns/train/train-00325-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00326-of-00512
-  out=data/fsns/train/train-00326-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00327-of-00512
-  out=data/fsns/train/train-00327-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00328-of-00512
-  out=data/fsns/train/train-00328-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00329-of-00512
-  out=data/fsns/train/train-00329-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00330-of-00512
-  out=data/fsns/train/train-00330-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00331-of-00512
-  out=data/fsns/train/train-00331-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00332-of-00512
-  out=data/fsns/train/train-00332-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00333-of-00512
-  out=data/fsns/train/train-00333-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00334-of-00512
-  out=data/fsns/train/train-00334-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00335-of-00512
-  out=data/fsns/train/train-00335-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00336-of-00512
-  out=data/fsns/train/train-00336-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00337-of-00512
-  out=data/fsns/train/train-00337-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00338-of-00512
-  out=data/fsns/train/train-00338-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00339-of-00512
-  out=data/fsns/train/train-00339-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00340-of-00512
-  out=data/fsns/train/train-00340-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00341-of-00512
-  out=data/fsns/train/train-00341-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00342-of-00512
-  out=data/fsns/train/train-00342-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00343-of-00512
-  out=data/fsns/train/train-00343-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00344-of-00512
-  out=data/fsns/train/train-00344-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00345-of-00512
-  out=data/fsns/train/train-00345-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00346-of-00512
-  out=data/fsns/train/train-00346-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00347-of-00512
-  out=data/fsns/train/train-00347-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00348-of-00512
-  out=data/fsns/train/train-00348-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00349-of-00512
-  out=data/fsns/train/train-00349-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00350-of-00512
-  out=data/fsns/train/train-00350-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00351-of-00512
-  out=data/fsns/train/train-00351-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00352-of-00512
-  out=data/fsns/train/train-00352-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00353-of-00512
-  out=data/fsns/train/train-00353-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00354-of-00512
-  out=data/fsns/train/train-00354-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00355-of-00512
-  out=data/fsns/train/train-00355-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00356-of-00512
-  out=data/fsns/train/train-00356-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00357-of-00512
-  out=data/fsns/train/train-00357-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00358-of-00512
-  out=data/fsns/train/train-00358-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00359-of-00512
-  out=data/fsns/train/train-00359-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00360-of-00512
-  out=data/fsns/train/train-00360-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00361-of-00512
-  out=data/fsns/train/train-00361-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00362-of-00512
-  out=data/fsns/train/train-00362-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00363-of-00512
-  out=data/fsns/train/train-00363-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00364-of-00512
-  out=data/fsns/train/train-00364-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00365-of-00512
-  out=data/fsns/train/train-00365-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00366-of-00512
-  out=data/fsns/train/train-00366-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00367-of-00512
-  out=data/fsns/train/train-00367-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00368-of-00512
-  out=data/fsns/train/train-00368-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00369-of-00512
-  out=data/fsns/train/train-00369-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00370-of-00512
-  out=data/fsns/train/train-00370-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00371-of-00512
-  out=data/fsns/train/train-00371-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00372-of-00512
-  out=data/fsns/train/train-00372-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00373-of-00512
-  out=data/fsns/train/train-00373-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00374-of-00512
-  out=data/fsns/train/train-00374-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00375-of-00512
-  out=data/fsns/train/train-00375-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00376-of-00512
-  out=data/fsns/train/train-00376-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00377-of-00512
-  out=data/fsns/train/train-00377-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00378-of-00512
-  out=data/fsns/train/train-00378-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00379-of-00512
-  out=data/fsns/train/train-00379-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00380-of-00512
-  out=data/fsns/train/train-00380-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00381-of-00512
-  out=data/fsns/train/train-00381-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00382-of-00512
-  out=data/fsns/train/train-00382-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00383-of-00512
-  out=data/fsns/train/train-00383-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00384-of-00512
-  out=data/fsns/train/train-00384-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00385-of-00512
-  out=data/fsns/train/train-00385-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00386-of-00512
-  out=data/fsns/train/train-00386-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00387-of-00512
-  out=data/fsns/train/train-00387-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00388-of-00512
-  out=data/fsns/train/train-00388-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00389-of-00512
-  out=data/fsns/train/train-00389-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00390-of-00512
-  out=data/fsns/train/train-00390-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00391-of-00512
-  out=data/fsns/train/train-00391-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00392-of-00512
-  out=data/fsns/train/train-00392-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00393-of-00512
-  out=data/fsns/train/train-00393-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00394-of-00512
-  out=data/fsns/train/train-00394-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00395-of-00512
-  out=data/fsns/train/train-00395-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00396-of-00512
-  out=data/fsns/train/train-00396-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00397-of-00512
-  out=data/fsns/train/train-00397-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00398-of-00512
-  out=data/fsns/train/train-00398-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00399-of-00512
-  out=data/fsns/train/train-00399-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00400-of-00512
-  out=data/fsns/train/train-00400-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00401-of-00512
-  out=data/fsns/train/train-00401-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00402-of-00512
-  out=data/fsns/train/train-00402-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00403-of-00512
-  out=data/fsns/train/train-00403-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00404-of-00512
-  out=data/fsns/train/train-00404-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00405-of-00512
-  out=data/fsns/train/train-00405-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00406-of-00512
-  out=data/fsns/train/train-00406-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00407-of-00512
-  out=data/fsns/train/train-00407-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00408-of-00512
-  out=data/fsns/train/train-00408-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00409-of-00512
-  out=data/fsns/train/train-00409-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00410-of-00512
-  out=data/fsns/train/train-00410-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00411-of-00512
-  out=data/fsns/train/train-00411-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00412-of-00512
-  out=data/fsns/train/train-00412-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00413-of-00512
-  out=data/fsns/train/train-00413-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00414-of-00512
-  out=data/fsns/train/train-00414-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00415-of-00512
-  out=data/fsns/train/train-00415-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00416-of-00512
-  out=data/fsns/train/train-00416-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00417-of-00512
-  out=data/fsns/train/train-00417-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00418-of-00512
-  out=data/fsns/train/train-00418-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00419-of-00512
-  out=data/fsns/train/train-00419-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00420-of-00512
-  out=data/fsns/train/train-00420-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00421-of-00512
-  out=data/fsns/train/train-00421-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00422-of-00512
-  out=data/fsns/train/train-00422-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00423-of-00512
-  out=data/fsns/train/train-00423-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00424-of-00512
-  out=data/fsns/train/train-00424-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00425-of-00512
-  out=data/fsns/train/train-00425-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00426-of-00512
-  out=data/fsns/train/train-00426-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00427-of-00512
-  out=data/fsns/train/train-00427-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00428-of-00512
-  out=data/fsns/train/train-00428-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00429-of-00512
-  out=data/fsns/train/train-00429-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00430-of-00512
-  out=data/fsns/train/train-00430-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00431-of-00512
-  out=data/fsns/train/train-00431-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00432-of-00512
-  out=data/fsns/train/train-00432-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00433-of-00512
-  out=data/fsns/train/train-00433-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00434-of-00512
-  out=data/fsns/train/train-00434-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00435-of-00512
-  out=data/fsns/train/train-00435-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00436-of-00512
-  out=data/fsns/train/train-00436-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00437-of-00512
-  out=data/fsns/train/train-00437-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00438-of-00512
-  out=data/fsns/train/train-00438-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00439-of-00512
-  out=data/fsns/train/train-00439-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00440-of-00512
-  out=data/fsns/train/train-00440-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00441-of-00512
-  out=data/fsns/train/train-00441-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00442-of-00512
-  out=data/fsns/train/train-00442-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00443-of-00512
-  out=data/fsns/train/train-00443-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00444-of-00512
-  out=data/fsns/train/train-00444-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00445-of-00512
-  out=data/fsns/train/train-00445-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00446-of-00512
-  out=data/fsns/train/train-00446-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00447-of-00512
-  out=data/fsns/train/train-00447-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00448-of-00512
-  out=data/fsns/train/train-00448-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00449-of-00512
-  out=data/fsns/train/train-00449-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00450-of-00512
-  out=data/fsns/train/train-00450-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00451-of-00512
-  out=data/fsns/train/train-00451-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00452-of-00512
-  out=data/fsns/train/train-00452-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00453-of-00512
-  out=data/fsns/train/train-00453-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00454-of-00512
-  out=data/fsns/train/train-00454-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00455-of-00512
-  out=data/fsns/train/train-00455-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00456-of-00512
-  out=data/fsns/train/train-00456-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00457-of-00512
-  out=data/fsns/train/train-00457-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00458-of-00512
-  out=data/fsns/train/train-00458-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00459-of-00512
-  out=data/fsns/train/train-00459-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00460-of-00512
-  out=data/fsns/train/train-00460-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00461-of-00512
-  out=data/fsns/train/train-00461-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00462-of-00512
-  out=data/fsns/train/train-00462-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00463-of-00512
-  out=data/fsns/train/train-00463-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00464-of-00512
-  out=data/fsns/train/train-00464-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00465-of-00512
-  out=data/fsns/train/train-00465-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00466-of-00512
-  out=data/fsns/train/train-00466-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00467-of-00512
-  out=data/fsns/train/train-00467-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00468-of-00512
-  out=data/fsns/train/train-00468-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00469-of-00512
-  out=data/fsns/train/train-00469-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00470-of-00512
-  out=data/fsns/train/train-00470-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00471-of-00512
-  out=data/fsns/train/train-00471-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00472-of-00512
-  out=data/fsns/train/train-00472-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00473-of-00512
-  out=data/fsns/train/train-00473-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00474-of-00512
-  out=data/fsns/train/train-00474-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00475-of-00512
-  out=data/fsns/train/train-00475-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00476-of-00512
-  out=data/fsns/train/train-00476-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00477-of-00512
-  out=data/fsns/train/train-00477-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00478-of-00512
-  out=data/fsns/train/train-00478-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00479-of-00512
-  out=data/fsns/train/train-00479-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00480-of-00512
-  out=data/fsns/train/train-00480-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00481-of-00512
-  out=data/fsns/train/train-00481-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00482-of-00512
-  out=data/fsns/train/train-00482-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00483-of-00512
-  out=data/fsns/train/train-00483-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00484-of-00512
-  out=data/fsns/train/train-00484-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00485-of-00512
-  out=data/fsns/train/train-00485-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00486-of-00512
-  out=data/fsns/train/train-00486-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00487-of-00512
-  out=data/fsns/train/train-00487-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00488-of-00512
-  out=data/fsns/train/train-00488-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00489-of-00512
-  out=data/fsns/train/train-00489-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00490-of-00512
-  out=data/fsns/train/train-00490-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00491-of-00512
-  out=data/fsns/train/train-00491-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00492-of-00512
-  out=data/fsns/train/train-00492-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00493-of-00512
-  out=data/fsns/train/train-00493-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00494-of-00512
-  out=data/fsns/train/train-00494-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00495-of-00512
-  out=data/fsns/train/train-00495-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00496-of-00512
-  out=data/fsns/train/train-00496-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00497-of-00512
-  out=data/fsns/train/train-00497-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00498-of-00512
-  out=data/fsns/train/train-00498-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00499-of-00512
-  out=data/fsns/train/train-00499-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00500-of-00512
-  out=data/fsns/train/train-00500-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00501-of-00512
-  out=data/fsns/train/train-00501-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00502-of-00512
-  out=data/fsns/train/train-00502-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00503-of-00512
-  out=data/fsns/train/train-00503-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00504-of-00512
-  out=data/fsns/train/train-00504-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00505-of-00512
-  out=data/fsns/train/train-00505-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00506-of-00512
-  out=data/fsns/train/train-00506-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00507-of-00512
-  out=data/fsns/train/train-00507-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00508-of-00512
-  out=data/fsns/train/train-00508-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00509-of-00512
-  out=data/fsns/train/train-00509-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00510-of-00512
-  out=data/fsns/train/train-00510-of-00512
-http://download.tensorflow.org/data/fsns-20160927/train/train-00511-of-00512
-  out=data/fsns/train/train-00511-of-00512
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00000-of-00064
-  out=data/fsns/validation/validation-00000-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00001-of-00064
-  out=data/fsns/validation/validation-00001-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00002-of-00064
-  out=data/fsns/validation/validation-00002-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00003-of-00064
-  out=data/fsns/validation/validation-00003-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00004-of-00064
-  out=data/fsns/validation/validation-00004-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00005-of-00064
-  out=data/fsns/validation/validation-00005-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00006-of-00064
-  out=data/fsns/validation/validation-00006-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00007-of-00064
-  out=data/fsns/validation/validation-00007-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00008-of-00064
-  out=data/fsns/validation/validation-00008-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00009-of-00064
-  out=data/fsns/validation/validation-00009-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00010-of-00064
-  out=data/fsns/validation/validation-00010-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00011-of-00064
-  out=data/fsns/validation/validation-00011-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00012-of-00064
-  out=data/fsns/validation/validation-00012-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00013-of-00064
-  out=data/fsns/validation/validation-00013-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00014-of-00064
-  out=data/fsns/validation/validation-00014-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00015-of-00064
-  out=data/fsns/validation/validation-00015-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00016-of-00064
-  out=data/fsns/validation/validation-00016-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00017-of-00064
-  out=data/fsns/validation/validation-00017-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00018-of-00064
-  out=data/fsns/validation/validation-00018-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00019-of-00064
-  out=data/fsns/validation/validation-00019-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00020-of-00064
-  out=data/fsns/validation/validation-00020-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00021-of-00064
-  out=data/fsns/validation/validation-00021-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00022-of-00064
-  out=data/fsns/validation/validation-00022-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00023-of-00064
-  out=data/fsns/validation/validation-00023-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00024-of-00064
-  out=data/fsns/validation/validation-00024-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00025-of-00064
-  out=data/fsns/validation/validation-00025-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00026-of-00064
-  out=data/fsns/validation/validation-00026-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00027-of-00064
-  out=data/fsns/validation/validation-00027-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00028-of-00064
-  out=data/fsns/validation/validation-00028-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00029-of-00064
-  out=data/fsns/validation/validation-00029-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00030-of-00064
-  out=data/fsns/validation/validation-00030-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00031-of-00064
-  out=data/fsns/validation/validation-00031-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00032-of-00064
-  out=data/fsns/validation/validation-00032-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00033-of-00064
-  out=data/fsns/validation/validation-00033-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00034-of-00064
-  out=data/fsns/validation/validation-00034-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00035-of-00064
-  out=data/fsns/validation/validation-00035-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00036-of-00064
-  out=data/fsns/validation/validation-00036-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00037-of-00064
-  out=data/fsns/validation/validation-00037-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00038-of-00064
-  out=data/fsns/validation/validation-00038-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00039-of-00064
-  out=data/fsns/validation/validation-00039-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00040-of-00064
-  out=data/fsns/validation/validation-00040-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00041-of-00064
-  out=data/fsns/validation/validation-00041-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00042-of-00064
-  out=data/fsns/validation/validation-00042-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00043-of-00064
-  out=data/fsns/validation/validation-00043-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00044-of-00064
-  out=data/fsns/validation/validation-00044-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00045-of-00064
-  out=data/fsns/validation/validation-00045-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00046-of-00064
-  out=data/fsns/validation/validation-00046-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00047-of-00064
-  out=data/fsns/validation/validation-00047-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00048-of-00064
-  out=data/fsns/validation/validation-00048-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00049-of-00064
-  out=data/fsns/validation/validation-00049-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00050-of-00064
-  out=data/fsns/validation/validation-00050-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00051-of-00064
-  out=data/fsns/validation/validation-00051-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00052-of-00064
-  out=data/fsns/validation/validation-00052-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00053-of-00064
-  out=data/fsns/validation/validation-00053-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00054-of-00064
-  out=data/fsns/validation/validation-00054-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00055-of-00064
-  out=data/fsns/validation/validation-00055-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00056-of-00064
-  out=data/fsns/validation/validation-00056-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00057-of-00064
-  out=data/fsns/validation/validation-00057-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00058-of-00064
-  out=data/fsns/validation/validation-00058-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00059-of-00064
-  out=data/fsns/validation/validation-00059-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00060-of-00064
-  out=data/fsns/validation/validation-00060-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00061-of-00064
-  out=data/fsns/validation/validation-00061-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00062-of-00064
-  out=data/fsns/validation/validation-00062-of-00064
-http://download.tensorflow.org/data/fsns-20160927/validation/validation-00063-of-00064
-  out=data/fsns/validation/validation-00063-of-00064
diff --git a/research/street/python/nn_ops.py b/research/street/python/nn_ops.py
deleted file mode 100644
index 20c3b502853bbec80f30e9d2aa915477fa674c62..0000000000000000000000000000000000000000
--- a/research/street/python/nn_ops.py
+++ /dev/null
@@ -1,253 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Ops and utilities for neural networks.
-
-For now, just an LSTM layer.
-"""
-import shapes
-import tensorflow as tf
-rnn = tf.load_op_library("../cc/rnn_ops.so")
-
-
-def rnn_helper(inp,
-               length,
-               cell_type=None,
-               direction="forward",
-               name=None,
-               *args,
-               **kwargs):
-  """Adds ops for a recurrent neural network layer.
-
-  This function calls an actual implementation of a recurrent neural network
-  based on `cell_type`.
-
-  There are three modes depending on the value of `direction`:
-
-    forward: Adds a forward RNN.
-    backward: Adds a backward RNN.
-    bidirectional: Adds both forward and backward RNNs and creates a
-                   bidirectional RNN.
-
-  Args:
-    inp: A 3-D tensor of shape [`batch_size`, `max_length`, `feature_dim`].
-    length: A 1-D tensor of shape [`batch_size`] and type int64. Each element
-            represents the length of the corresponding sequence in `inp`.
-    cell_type: Cell type of RNN. Currently can only be "lstm".
-    direction: One of "forward", "backward", "bidirectional".
-    name: Name of the op.
-    *args: Other arguments to the layer.
-    **kwargs: Keyword arugments to the layer.
-
-  Returns:
-    A 3-D tensor of shape [`batch_size`, `max_length`, `num_nodes`].
-  """
-
-  assert cell_type is not None
-  rnn_func = None
-  if cell_type == "lstm":
-    rnn_func = lstm_layer
-  assert rnn_func is not None
-  assert direction in ["forward", "backward", "bidirectional"]
-
-  with tf.variable_scope(name):
-    if direction in ["forward", "bidirectional"]:
-      forward = rnn_func(
-          inp=inp,
-          length=length,
-          backward=False,
-          name="forward",
-          *args,
-          **kwargs)
-      if isinstance(forward, tuple):
-        # lstm_layer returns a tuple (output, memory). We only need the first
-        # element.
-        forward = forward[0]
-    if direction in ["backward", "bidirectional"]:
-      backward = rnn_func(
-          inp=inp,
-          length=length,
-          backward=True,
-          name="backward",
-          *args,
-          **kwargs)
-      if isinstance(backward, tuple):
-        # lstm_layer returns a tuple (output, memory). We only need the first
-        # element.
-        backward = backward[0]
-    if direction == "forward":
-      out = forward
-    elif direction == "backward":
-      out = backward
-    else:
-      out = tf.concat(axis=2, values=[forward, backward])
-  return out
-
-
-@tf.RegisterShape("VariableLSTM")
-def _variable_lstm_shape(op):
-  """Shape function for the VariableLSTM op."""
-  input_shape = op.inputs[0].get_shape().with_rank(4)
-  state_shape = op.inputs[1].get_shape().with_rank(2)
-  memory_shape = op.inputs[2].get_shape().with_rank(2)
-  w_m_m_shape = op.inputs[3].get_shape().with_rank(3)
-  batch_size = input_shape[0].merge_with(state_shape[0])
-  batch_size = input_shape[0].merge_with(memory_shape[0])
-  seq_len = input_shape[1]
-  gate_num = input_shape[2].merge_with(w_m_m_shape[1])
-  output_dim = input_shape[3].merge_with(state_shape[1])
-  output_dim = output_dim.merge_with(memory_shape[1])
-  output_dim = output_dim.merge_with(w_m_m_shape[0])
-  output_dim = output_dim.merge_with(w_m_m_shape[2])
-  return [[batch_size, seq_len, output_dim],
-          [batch_size, seq_len, gate_num, output_dim],
-          [batch_size, seq_len, output_dim]]
-
-
-@tf.RegisterGradient("VariableLSTM")
-def _variable_lstm_grad(op, act_grad, gate_grad, mem_grad):
-  """Gradient function for the VariableLSTM op."""
-  initial_state = op.inputs[1]
-  initial_memory = op.inputs[2]
-  w_m_m = op.inputs[3]
-  act = op.outputs[0]
-  gate_raw_act = op.outputs[1]
-  memory = op.outputs[2]
-  return rnn.variable_lstm_grad(initial_state, initial_memory, w_m_m, act,
-                                gate_raw_act, memory, act_grad, gate_grad,
-                                mem_grad)
-
-
-def lstm_layer(inp,
-               length=None,
-               state=None,
-               memory=None,
-               num_nodes=None,
-               backward=False,
-               clip=50.0,
-               reg_func=tf.nn.l2_loss,
-               weight_reg=False,
-               weight_collection="LSTMWeights",
-               bias_reg=False,
-               stddev=None,
-               seed=None,
-               decode=False,
-               use_native_weights=False,
-               name=None):
-  """Adds ops for an LSTM layer.
-
-  This adds ops for the following operations:
-
-    input => (forward-LSTM|backward-LSTM) => output
-
-  The direction of the LSTM is determined by `backward`. If it is false, the
-  forward LSTM is used, the backward one otherwise.
-
-  Args:
-    inp: A 3-D tensor of shape [`batch_size`, `max_length`, `feature_dim`].
-    length: A 1-D tensor of shape [`batch_size`] and type int64. Each element
-            represents the length of the corresponding sequence in `inp`.
-    state: If specified, uses it as the initial state.
-    memory: If specified, uses it as the initial memory.
-    num_nodes: The number of LSTM cells.
-    backward: If true, reverses the `inp` before adding the ops. The output is
-              also reversed so that the direction is the same as `inp`.
-    clip: Value used to clip the cell values.
-    reg_func: Function used for the weight regularization such as
-              `tf.nn.l2_loss`.
-    weight_reg: If true, regularize the filter weights with `reg_func`.
-    weight_collection: Collection to add the weights to for regularization.
-    bias_reg: If true, regularize the bias vector with `reg_func`.
-    stddev: Standard deviation used to initialize the variables.
-    seed: Seed used to initialize the variables.
-    decode: If true, does not add ops which are not used for inference.
-    use_native_weights: If true, uses weights in the same format as the native
-                        implementations.
-    name: Name of the op.
-
-  Returns:
-    A 3-D tensor of shape [`batch_size`, `max_length`, `num_nodes`].
-  """
-  with tf.variable_scope(name):
-    if backward:
-      if length is None:
-        inp = tf.reverse(inp, [1])
-      else:
-        inp = tf.reverse_sequence(inp, length, 1, 0)
-
-    num_prev = inp.get_shape()[2]
-    if stddev:
-      initializer = tf.truncated_normal_initializer(stddev=stddev, seed=seed)
-    else:
-      initializer = tf.uniform_unit_scaling_initializer(seed=seed)
-
-    if use_native_weights:
-      with tf.variable_scope("LSTMCell"):
-        w = tf.get_variable(
-            "W_0",
-            shape=[num_prev + num_nodes, 4 * num_nodes],
-            initializer=initializer,
-            dtype=tf.float32)
-        w_i_m = tf.slice(w, [0, 0], [num_prev, 4 * num_nodes], name="w_i_m")
-        w_m_m = tf.reshape(
-            tf.slice(w, [num_prev, 0], [num_nodes, 4 * num_nodes]),
-            [num_nodes, 4, num_nodes],
-            name="w_m_m")
-    else:
-      w_i_m = tf.get_variable("w_i_m", [num_prev, 4 * num_nodes],
-                              initializer=initializer)
-      w_m_m = tf.get_variable("w_m_m", [num_nodes, 4, num_nodes],
-                              initializer=initializer)
-
-    if not decode and weight_reg:
-      tf.add_to_collection(weight_collection, reg_func(w_i_m, name="w_i_m_reg"))
-      tf.add_to_collection(weight_collection, reg_func(w_m_m, name="w_m_m_reg"))
-
-    batch_size = shapes.tensor_dim(inp, dim=0)
-    num_frames = shapes.tensor_dim(inp, dim=1)
-    prev = tf.reshape(inp, tf.stack([batch_size * num_frames, num_prev]))
-
-    if use_native_weights:
-      with tf.variable_scope("LSTMCell"):
-        b = tf.get_variable(
-            "B",
-            shape=[4 * num_nodes],
-            initializer=tf.zeros_initializer(),
-            dtype=tf.float32)
-      biases = tf.identity(b, name="biases")
-    else:
-      biases = tf.get_variable(
-          "biases", [4 * num_nodes], initializer=tf.constant_initializer(0.0))
-    if not decode and bias_reg:
-      tf.add_to_collection(
-          weight_collection, reg_func(
-              biases, name="biases_reg"))
-    prev = tf.nn.xw_plus_b(prev, w_i_m, biases)
-
-    prev = tf.reshape(prev, tf.stack([batch_size, num_frames, 4, num_nodes]))
-    if state is None:
-      state = tf.fill(tf.stack([batch_size, num_nodes]), 0.0)
-    if memory is None:
-      memory = tf.fill(tf.stack([batch_size, num_nodes]), 0.0)
-
-    out, _, mem = rnn.variable_lstm(prev, state, memory, w_m_m, clip=clip)
-
-    if backward:
-      if length is None:
-        out = tf.reverse(out, [1])
-      else:
-        out = tf.reverse_sequence(out, length, 1, 0)
-
-  return out, mem
diff --git a/research/street/python/shapes.py b/research/street/python/shapes.py
deleted file mode 100644
index 1f56ef05d5c59b6c626eabffa680999a777301fc..0000000000000000000000000000000000000000
--- a/research/street/python/shapes.py
+++ /dev/null
@@ -1,217 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Shape manipulation functions.
-
-rotate_dimensions: prepares for a rotating transpose by returning a rotated
-  list of dimension indices.
-transposing_reshape: allows a dimension to be factorized, with one of the pieces
-  transferred to another dimension, or to transpose factors within a single
-  dimension.
-tensor_dim: gets a shape dimension as a constant integer if known otherwise a
-  runtime usable tensor value.
-tensor_shape: returns the full shape of a tensor as the tensor_dim.
-"""
-from six.moves import xrange
-import tensorflow as tf
-
-
-def rotate_dimensions(num_dims, src_dim, dest_dim):
-  """Returns a list of dimension indices that will rotate src_dim to dest_dim.
-
-  src_dim is moved to dest_dim, with all intervening dimensions shifted towards
-  the hole left by src_dim. Eg:
-  num_dims = 4, src_dim=3, dest_dim=1
-  Returned list=[0, 3, 1, 2]
-  For a tensor with dims=[5, 4, 3, 2] a transpose would yield [5, 2, 4, 3].
-  Args:
-    num_dims: The number of dimensions to handle.
-    src_dim:  The dimension to move.
-    dest_dim: The dimension to move src_dim to.
-
-  Returns:
-    A list of rotated dimension indices.
-  """
-  # List of dimensions for transpose.
-  dim_list = range(num_dims)
-  # Shuffle src_dim to dest_dim by swapping to shuffle up the other dims.
-  step = 1 if dest_dim > src_dim else -1
-  for x in xrange(src_dim, dest_dim, step):
-    dim_list[x], dim_list[x + step] = dim_list[x + step], dim_list[x]
-  return dim_list
-
-
-def transposing_reshape(tensor,
-                        src_dim,
-                        part_a,
-                        part_b,
-                        dest_dim_a,
-                        dest_dim_b,
-                        name=None):
-  """Splits src_dim and sends one of the pieces to another dim.
-
-  Terminology:
-  A matrix is often described as 'row-major' or 'column-major', which doesn't
-  help if you can't remember which is the row index and which is the column,
-  even if you know what 'major' means, so here is a simpler explanation of it:
-  When TF stores a tensor of size [d0, d1, d2, d3] indexed by [i0, i1, i2, i3],
-  the memory address of an element is calculated using:
-  ((i0 * d1 + i1) * d2 + i2) * d3 + i3, so, d0 is the MOST SIGNIFICANT dimension
-  and d3 the LEAST SIGNIFICANT, just like in the decimal number 1234, 1 is the
-  most significant digit and 4 the least significant. In both cases the most
-  significant is multiplied by the largest number to determine its 'value'.
-  Furthermore, if we reshape the tensor to [d0'=d0, d1'=d1 x d2, d2'=d3], then
-  the MOST SIGNIFICANT part of d1' is d1 and the LEAST SIGNIFICANT part of d1'
-  is d2.
-
-  Action:
-  transposing_reshape splits src_dim into factors [part_a, part_b], and sends
-  the most significant part (of size  part_a) to be the most significant part of
-  dest_dim_a*(Exception: see NOTE 2), and the least significant part (of size
-  part_b) to be the most significant part of dest_dim_b.
-  This is basically a combination of reshape, rotating transpose, reshape.
-  NOTE1: At least one of dest_dim_a and dest_dim_b must equal src_dim, ie one of
-  the parts always stays put, so src_dim is never totally destroyed and the
-  output number of dimensions is always the same as the input.
-  NOTE2: If dest_dim_a == dest_dim_b == src_dim, then parts a and b are simply
-  transposed within src_dim to become part_b x part_a, so the most significant
-  part becomes the least significant part and vice versa. Thus if you really
-  wanted to make one of the parts the least significant side of the destiantion,
-  the destination dimension can be internally transposed with a second call to
-  transposing_reshape.
-  NOTE3: One of part_a and part_b may be -1 to allow src_dim to be of unknown
-  size with one known-size factor. Otherwise part_a * part_b must equal the size
-  of src_dim.
-  NOTE4: The reshape preserves as many known-at-graph-build-time dimension sizes
-  as are available.
-
-  Example:
-  Input dims=[5, 2, 6, 2]
-  tensor=[[[[0, 1][2, 3][4, 5][6, 7][8, 9][10, 11]]
-           [[12, 13][14, 15][16, 17][18, 19][20, 21][22, 23]]
-          [[[24, 25]...
-  src_dim=2, part_a=2, part_b=3, dest_dim_a=3, dest_dim_b=2
-  output dims =[5, 2, 3, 4]
-  output tensor=[[[[0, 1, 6, 7][2, 3, 8, 9][4, 5, 10, 11]]
-                  [[12, 13, 18, 19][14, 15, 20, 21][16, 17, 22, 23]]]
-                 [[[24, 26, 28]...
-  Example2:
-  Input dims=[phrases, words, letters]=[2, 6, x]
-  tensor=[[[the][cat][sat][on][the][mat]]
-         [[a][stitch][in][time][saves][nine]]]
-  We can factorize the 6 words into 3x2 = [[the][cat]][[sat][on]][[the][mat]]
-  or 2x3=[[the][cat][sat]][[on][the][mat]] and
-  src_dim=1, part_a=3, part_b=2, dest_dim_a=1, dest_dim_b=1
-  would yield:
-  [[[the][sat][the][cat][on][mat]]
-   [[a][in][saves][stitch][time][nine]]], but
-  src_dim=1, part_a=2, part_b=3, dest_dim_a=1, dest_dim_b=1
-  would yield:
-  [[[the][on][cat][the][sat][mat]]
-   [[a][time][stitch][saves][in][nine]]], and
-  src_dim=1, part_a=2, part_b=3, dest_dim_a=0, dest_dim_b=1
-  would yield:
-  [[[the][cat][sat]]
-   [[a][stitch][in]]
-   [[on][the][mat]]
-   [[time][saves][nine]]]
-  Now remember that the words above represent any least-significant subset of
-  the input dimensions.
-
-  Args:
-    tensor:     A tensor to reshape.
-    src_dim:    The dimension to split.
-    part_a:     The first factor of the split.
-    part_b:     The second factor of the split.
-    dest_dim_a: The dimension to move part_a of src_dim to.
-    dest_dim_b: The dimension to move part_b of src_dim to.
-    name:       Optional base name for all the ops.
-
-  Returns:
-    Reshaped tensor.
-
-  Raises:
-    ValueError: If the args are invalid.
-  """
-  if dest_dim_a != src_dim and dest_dim_b != src_dim:
-    raise ValueError(
-        'At least one of dest_dim_a, dest_dim_b must equal src_dim!')
-  if part_a == 0 or part_b == 0:
-    raise ValueError('Zero not allowed for part_a or part_b!')
-  if part_a < 0 and part_b < 0:
-    raise ValueError('At least one of part_a and part_b must be positive!')
-  if not name:
-    name = 'transposing_reshape'
-  prev_shape = tensor_shape(tensor)
-  expanded = tf.reshape(
-      tensor,
-      prev_shape[:src_dim] + [part_a, part_b] + prev_shape[src_dim + 1:],
-      name=name + '_reshape_in')
-  dest = dest_dim_b
-  if dest_dim_a != src_dim:
-    # We are just moving part_a to dest_dim_a.
-    dest = dest_dim_a
-  else:
-    # We are moving part_b to dest_dim_b.
-    src_dim += 1
-  dim_list = rotate_dimensions(len(expanded.get_shape()), src_dim, dest)
-  expanded = tf.transpose(expanded, dim_list, name=name + '_rot_transpose')
-  # Reshape identity except dest,dest+1, which get merged.
-  ex_shape = tensor_shape(expanded)
-  combined = ex_shape[dest] * ex_shape[dest + 1]
-  return tf.reshape(
-      expanded,
-      ex_shape[:dest] + [combined] + ex_shape[dest + 2:],
-      name=name + '_reshape_out')
-
-
-def tensor_dim(tensor, dim):
-  """Returns int dimension if known at a graph build time else a tensor.
-
-  If the size of the dim of tensor is known at graph building time, then that
-  known value is returned, otherwise (instead of None), a Tensor that will give
-  the size of the dimension when the graph is run. The return value will be
-  accepted by tf.reshape in multiple (or even all) dimensions, even when the
-  sizes are not known at graph building time, unlike -1, which can only be used
-  in one dimension. It is a bad idea to use tf.shape all the time, as some ops
-  demand a known (at graph build time) size. This function therefore returns
-  the best available, most useful dimension size.
-  Args:
-    tensor: Input tensor.
-    dim:    Dimension to find the size of.
-
-  Returns:
-    An integer if shape is known at build time, otherwise a tensor of int32.
-  """
-  result = tensor.get_shape().as_list()[dim]
-  if result is None:
-    result = tf.shape(tensor)[dim]
-  return result
-
-
-def tensor_shape(tensor):
-  """Returns a heterogeneous list of tensor_dim for the tensor.
-
-  See tensor_dim for a more detailed explanation.
-  Args:
-    tensor: Input tensor.
-
-  Returns:
-    A heterogeneous list of integers and int32 tensors.
-  """
-  result = []
-  for d in xrange(len(tensor.get_shape())):
-    result.append(tensor_dim(tensor, d))
-  return result
diff --git a/research/street/python/shapes_test.py b/research/street/python/shapes_test.py
deleted file mode 100644
index 87b3c737fb0946dc4cf9596d3a04ddab754ca39a..0000000000000000000000000000000000000000
--- a/research/street/python/shapes_test.py
+++ /dev/null
@@ -1,171 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for shapes."""
-
-import numpy as np
-import tensorflow as tf
-import shapes
-
-
-def _rand(*size):
-  return np.random.uniform(size=size).astype('f')
-
-
-class ShapesTest(tf.test.TestCase):
-  """Tests just the shapes from a call to transposing_reshape."""
-
-  def __init__(self, other):
-    super(ShapesTest, self).__init__(other)
-    self.batch_size = 4
-    self.im_height = 24
-    self.im_width = 36
-    self.depth = 20
-
-  def testReshapeTile(self):
-    """Tests that a tiled input can be reshaped to the batch dimension."""
-    fake = tf.placeholder(
-        tf.float32, shape=(None, None, None, self.depth), name='inputs')
-    real = _rand(self.batch_size, self.im_height, self.im_width, self.depth)
-    with self.test_session() as sess:
-      outputs = shapes.transposing_reshape(
-          fake, src_dim=2, part_a=3, part_b=-1, dest_dim_a=0, dest_dim_b=2)
-      res_image = sess.run([outputs], feed_dict={fake: real})
-      self.assertEqual(
-          tuple(res_image[0].shape),
-          (self.batch_size * 3, self.im_height, self.im_width / 3, self.depth))
-
-  def testReshapeDepth(self):
-    """Tests that depth can be reshaped to the x dimension."""
-    fake = tf.placeholder(
-        tf.float32, shape=(None, None, None, self.depth), name='inputs')
-    real = _rand(self.batch_size, self.im_height, self.im_width, self.depth)
-    with self.test_session() as sess:
-      outputs = shapes.transposing_reshape(
-          fake, src_dim=3, part_a=4, part_b=-1, dest_dim_a=2, dest_dim_b=3)
-      res_image = sess.run([outputs], feed_dict={fake: real})
-      self.assertEqual(
-          tuple(res_image[0].shape),
-          (self.batch_size, self.im_height, self.im_width * 4, self.depth / 4))
-
-
-class DataTest(tf.test.TestCase):
-  """Tests that the data is moved correctly in a call to transposing_reshape.
-
-  """
-
-  def testTransposingReshape_2_2_3_2_1(self):
-    """Case: dest_a == src, dest_b < src: Split with Least sig part going left.
-    """
-    with self.test_session() as sess:
-      fake = tf.placeholder(
-          tf.float32, shape=(None, None, None, 2), name='inputs')
-      outputs = shapes.transposing_reshape(
-          fake, src_dim=2, part_a=2, part_b=3, dest_dim_a=2, dest_dim_b=1)
-      # Make real inputs. The tensor looks like this:
-      # tensor=[[[[0, 1][2, 3][4, 5][6, 7][8, 9][10, 11]]
-      #          [[12, 13][14, 15][16, 17][18, 19][20, 21][22, 23]]
-      #         [[[24, 25]...
-      real = np.arange(120).reshape((5, 2, 6, 2))
-      np_array = sess.run([outputs], feed_dict={fake: real})[0]
-      self.assertEqual(tuple(np_array.shape), (5, 6, 2, 2))
-      self.assertAllEqual(np_array[0, :, :, :],
-                          [[[0, 1], [6, 7]], [[12, 13], [18, 19]],
-                           [[2, 3], [8, 9]], [[14, 15], [20, 21]],
-                           [[4, 5], [10, 11]], [[16, 17], [22, 23]]])
-
-  def testTransposingReshape_2_2_3_2_3(self):
-    """Case: dest_a == src, dest_b > src: Split with Least sig part going right.
-    """
-    with self.test_session() as sess:
-      fake = tf.placeholder(
-          tf.float32, shape=(None, None, None, 2), name='inputs')
-      outputs = shapes.transposing_reshape(
-          fake, src_dim=2, part_a=2, part_b=3, dest_dim_a=2, dest_dim_b=3)
-      # Make real inputs. The tensor looks like this:
-      # tensor=[[[[0, 1][2, 3][4, 5][6, 7][8, 9][10, 11]]
-      #          [[12, 13][14, 15][16, 17][18, 19][20, 21][22, 23]]
-      #         [[[24, 25]...
-      real = np.arange(120).reshape((5, 2, 6, 2))
-      np_array = sess.run([outputs], feed_dict={fake: real})[0]
-      self.assertEqual(tuple(np_array.shape), (5, 2, 2, 6))
-      self.assertAllEqual(
-          np_array[0, :, :, :],
-          [[[0, 1, 2, 3, 4, 5], [6, 7, 8, 9, 10, 11]],
-           [[12, 13, 14, 15, 16, 17], [18, 19, 20, 21, 22, 23]]])
-
-  def testTransposingReshape_2_2_3_2_2(self):
-    """Case: dest_a == src, dest_b == src. Transpose within dimension 2.
-    """
-    with self.test_session() as sess:
-      fake = tf.placeholder(
-          tf.float32, shape=(None, None, None, 2), name='inputs')
-      outputs = shapes.transposing_reshape(
-          fake, src_dim=2, part_a=2, part_b=3, dest_dim_a=2, dest_dim_b=2)
-      # Make real inputs. The tensor looks like this:
-      # tensor=[[[[0, 1][2, 3][4, 5][6, 7][8, 9][10, 11]]
-      #          [[12, 13][14, 15][16, 17][18, 19][20, 21][22, 23]]
-      #         [[[24, 25]...
-      real = np.arange(120).reshape((5, 2, 6, 2))
-      np_array = sess.run([outputs], feed_dict={fake: real})[0]
-      self.assertEqual(tuple(np_array.shape), (5, 2, 6, 2))
-      self.assertAllEqual(
-          np_array[0, :, :, :],
-          [[[0, 1], [6, 7], [2, 3], [8, 9], [4, 5], [10, 11]],
-           [[12, 13], [18, 19], [14, 15], [20, 21], [16, 17], [22, 23]]])
-
-  def testTransposingReshape_2_2_3_1_2(self):
-    """Case: dest_a < src, dest_b == src. Split with Most sig part going left.
-    """
-    with self.test_session() as sess:
-      fake = tf.placeholder(
-          tf.float32, shape=(None, None, None, 2), name='inputs')
-      outputs = shapes.transposing_reshape(
-          fake, src_dim=2, part_a=2, part_b=3, dest_dim_a=1, dest_dim_b=2)
-      # Make real inputs. The tensor looks like this:
-      # tensor=[[[[0, 1][2, 3][4, 5][6, 7][8, 9][10, 11]]
-      #          [[12, 13][14, 15][16, 17][18, 19][20, 21][22, 23]]
-      #         [[[24, 25]...
-      real = np.arange(120).reshape((5, 2, 6, 2))
-      np_array = sess.run([outputs], feed_dict={fake: real})[0]
-      self.assertEqual(tuple(np_array.shape), (5, 4, 3, 2))
-      self.assertAllEqual(np_array[0, :, :, :],
-                          [[[0, 1], [2, 3], [4, 5]],
-                           [[12, 13], [14, 15], [16, 17]],
-                           [[6, 7], [8, 9], [10, 11]],
-                           [[18, 19], [20, 21], [22, 23]]])
-
-  def testTransposingReshape_2_2_3_3_2(self):
-    """Case: dest_a < src, dest_b == src. Split with Most sig part going right.
-    """
-    with self.test_session() as sess:
-      fake = tf.placeholder(
-          tf.float32, shape=(None, None, None, 2), name='inputs')
-      outputs = shapes.transposing_reshape(
-          fake, src_dim=2, part_a=2, part_b=3, dest_dim_a=3, dest_dim_b=2)
-      # Make real inputs. The tensor looks like this:
-      # tensor=[[[[0, 1][2, 3][4, 5][6, 7][8, 9][10, 11]]
-      #          [[12, 13][14, 15][16, 17][18, 19][20, 21][22, 23]]
-      #         [[[24, 25]...
-      real = np.arange(120).reshape((5, 2, 6, 2))
-      np_array = sess.run([outputs], feed_dict={fake: real})[0]
-      self.assertEqual(tuple(np_array.shape), (5, 2, 3, 4))
-      self.assertAllEqual(
-          np_array[0, :, :, :],
-          [[[0, 1, 6, 7], [2, 3, 8, 9], [4, 5, 10, 11]],
-           [[12, 13, 18, 19], [14, 15, 20, 21], [16, 17, 22, 23]]])
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/street/python/vgsl_eval.py b/research/street/python/vgsl_eval.py
deleted file mode 100644
index 7db00d6f067b24f9222a1d490f817116c5121726..0000000000000000000000000000000000000000
--- a/research/street/python/vgsl_eval.py
+++ /dev/null
@@ -1,49 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Model eval separate from training."""
-from tensorflow import app
-from tensorflow.python.platform import flags
-
-import vgsl_model
-
-flags.DEFINE_string('eval_dir', '/tmp/mdir/eval',
-                    'Directory where to write event logs.')
-flags.DEFINE_string('graph_def_file', None,
-                    'Output eval graph definition file.')
-flags.DEFINE_string('train_dir', '/tmp/mdir',
-                    'Directory where to find training checkpoints.')
-flags.DEFINE_string('model_str',
-                    '1,150,600,3[S2(4x150)0,2 Ct5,5,16 Mp2,2 Ct5,5,64 Mp3,3'
-                    '([Lrys64 Lbx128][Lbys64 Lbx128][Lfys64 Lbx128])S3(3x0)2,3'
-                    'Lfx128 Lrx128 S0(1x4)0,3 Do Lfx256]O1c134',
-                    'Network description.')
-flags.DEFINE_integer('num_steps', 1000, 'Number of steps to run evaluation.')
-flags.DEFINE_integer('eval_interval_secs', 60,
-                     'Time interval between eval runs.')
-flags.DEFINE_string('eval_data', None, 'Evaluation data filepattern')
-flags.DEFINE_string('decoder', None, 'Charset decoder')
-
-FLAGS = flags.FLAGS
-
-
-def main(argv):
-  del argv
-  vgsl_model.Eval(FLAGS.train_dir, FLAGS.eval_dir, FLAGS.model_str,
-                  FLAGS.eval_data, FLAGS.decoder, FLAGS.num_steps,
-                  FLAGS.graph_def_file, FLAGS.eval_interval_secs)
-
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/street/python/vgsl_input.py b/research/street/python/vgsl_input.py
deleted file mode 100644
index e4495c680aa7c757d87e6cfe2fefc1e62bc7ae6f..0000000000000000000000000000000000000000
--- a/research/street/python/vgsl_input.py
+++ /dev/null
@@ -1,150 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""String network description language to define network layouts."""
-import collections
-import tensorflow as tf
-from tensorflow.python.ops import parsing_ops
-
-# Named tuple for the standard tf image tensor Shape.
-# batch_size:     Number of images to batch-up for training.
-# height:         Fixed height of image or None for variable.
-# width:          Fixed width of image or None for variable.
-# depth:          Desired depth in bytes per pixel of input images.
-ImageShape = collections.namedtuple('ImageTensorDims',
-                                    ['batch_size', 'height', 'width', 'depth'])
-
-
-def ImageInput(input_pattern, num_threads, shape, using_ctc, reader=None):
-  """Creates an input image tensor from the input_pattern filenames.
-
-  TODO(rays) Expand for 2-d labels, 0-d labels, and logistic targets.
-  Args:
-    input_pattern:  Filenames of the dataset(s) to read.
-    num_threads:    Number of preprocessing threads.
-    shape:          ImageShape with the desired shape of the input.
-    using_ctc:      Take the unpadded_class labels instead of padded.
-    reader:         Function that returns an actual reader to read Examples from
-      input files. If None, uses tf.TFRecordReader().
-  Returns:
-    images:   Float Tensor containing the input image scaled to [-1.28, 1.27].
-    heights:  Tensor int64 containing the heights of the images.
-    widths:   Tensor int64 containing the widths of the images.
-    labels:   Serialized SparseTensor containing the int64 labels.
-    sparse_labels:   Serialized SparseTensor containing the int64 labels.
-    truths:   Tensor string of the utf8 truth texts.
-  Raises:
-    ValueError: if the optimizer type is unrecognized.
-  """
-  data_files = tf.gfile.Glob(input_pattern)
-  assert data_files, 'no files found for dataset ' + input_pattern
-  queue_capacity = shape.batch_size * num_threads * 2
-  filename_queue = tf.train.string_input_producer(
-      data_files, capacity=queue_capacity)
-
-  # Create a subgraph with its own reader (but sharing the
-  # filename_queue) for each preprocessing thread.
-  images_and_label_lists = []
-  for _ in range(num_threads):
-    image, height, width, labels, text = _ReadExamples(filename_queue, shape,
-                                                       using_ctc, reader)
-    images_and_label_lists.append([image, height, width, labels, text])
-  # Create a queue that produces the examples in batches.
-  images, heights, widths, labels, truths = tf.train.batch_join(
-      images_and_label_lists,
-      batch_size=shape.batch_size,
-      capacity=16 * shape.batch_size,
-      dynamic_pad=True)
-  # Deserialize back to sparse, because the batcher doesn't do sparse.
-  labels = tf.deserialize_many_sparse(labels, tf.int64)
-  sparse_labels = tf.cast(labels, tf.int32)
-  labels = tf.sparse_tensor_to_dense(labels)
-  labels = tf.reshape(labels, [shape.batch_size, -1], name='Labels')
-  # Crush the other shapes to just the batch dimension.
-  heights = tf.reshape(heights, [-1], name='Heights')
-  widths = tf.reshape(widths, [-1], name='Widths')
-  truths = tf.reshape(truths, [-1], name='Truths')
-  # Give the images a nice name as well.
-  images = tf.identity(images, name='Images')
-
-  tf.summary.image('Images', images)
-  return images, heights, widths, labels, sparse_labels, truths
-
-
-def _ReadExamples(filename_queue, shape, using_ctc, reader=None):
-  """Builds network input tensor ops for TF Example.
-
-  Args:
-    filename_queue: Queue of filenames, from tf.train.string_input_producer
-    shape:          ImageShape with the desired shape of the input.
-    using_ctc:      Take the unpadded_class labels instead of padded.
-    reader:         Function that returns an actual reader to read Examples from
-      input files. If None, uses tf.TFRecordReader().
-  Returns:
-    image:   Float Tensor containing the input image scaled to [-1.28, 1.27].
-    height:  Tensor int64 containing the height of the image.
-    width:   Tensor int64 containing the width of the image.
-    labels:  Serialized SparseTensor containing the int64 labels.
-    text:    Tensor string of the utf8 truth text.
-  """
-  if reader:
-    reader = reader()
-  else:
-    reader = tf.TFRecordReader()
-  _, example_serialized = reader.read(filename_queue)
-  example_serialized = tf.reshape(example_serialized, shape=[])
-  features = tf.parse_single_example(
-      example_serialized,
-      {'image/encoded': parsing_ops.FixedLenFeature(
-          [1], dtype=tf.string, default_value=''),
-       'image/text': parsing_ops.FixedLenFeature(
-           [1], dtype=tf.string, default_value=''),
-       'image/class': parsing_ops.VarLenFeature(dtype=tf.int64),
-       'image/unpadded_class': parsing_ops.VarLenFeature(dtype=tf.int64),
-       'image/height': parsing_ops.FixedLenFeature(
-           [1], dtype=tf.int64, default_value=1),
-       'image/width': parsing_ops.FixedLenFeature(
-           [1], dtype=tf.int64, default_value=1)})
-  if using_ctc:
-    labels = features['image/unpadded_class']
-  else:
-    labels = features['image/class']
-  labels = tf.serialize_sparse(labels)
-  image = tf.reshape(features['image/encoded'], shape=[], name='encoded')
-  image = _ImageProcessing(image, shape)
-  height = tf.reshape(features['image/height'], [-1])
-  width = tf.reshape(features['image/width'], [-1])
-  text = tf.reshape(features['image/text'], shape=[])
-
-  return image, height, width, labels, text
-
-
-def _ImageProcessing(image_buffer, shape):
-  """Convert a PNG string into an input tensor.
-
-  We allow for fixed and variable sizes.
-  Does fixed conversion to floats in the range [-1.28, 1.27].
-  Args:
-    image_buffer: Tensor containing a PNG encoded image.
-    shape:          ImageShape with the desired shape of the input.
-  Returns:
-    image:        Decoded, normalized image in the range [-1.28, 1.27].
-  """
-  image = tf.image.decode_png(image_buffer, channels=shape.depth)
-  image.set_shape([shape.height, shape.width, shape.depth])
-  image = tf.cast(image, tf.float32)
-  image = tf.subtract(image, 128.0)
-  image = tf.multiply(image, 1 / 100.0)
-  return image
diff --git a/research/street/python/vgsl_model.py b/research/street/python/vgsl_model.py
deleted file mode 100644
index 7533cd8d5bb781102e9e9b58c4a3259b79404fd2..0000000000000000000000000000000000000000
--- a/research/street/python/vgsl_model.py
+++ /dev/null
@@ -1,601 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""String network description language to define network layouts."""
-from __future__ import print_function
-
-import re
-import time
-
-import decoder
-import errorcounter as ec
-import shapes
-import tensorflow as tf
-import vgsl_input
-import vgslspecs
-import tensorflow.contrib.slim as slim
-from tensorflow.core.framework import summary_pb2
-from tensorflow.python.platform import tf_logging as logging
-
-
-# Parameters for rate decay.
-# We divide the learning_rate_halflife by DECAY_STEPS_FACTOR and use DECAY_RATE
-# as the decay factor for the learning rate, ie we use the DECAY_STEPS_FACTORth
-# root of 2 as the decay rate every halflife/DECAY_STEPS_FACTOR to achieve the
-# desired halflife.
-DECAY_STEPS_FACTOR = 16
-DECAY_RATE = pow(0.5, 1.0 / DECAY_STEPS_FACTOR)
-
-
-def Train(train_dir,
-          model_str,
-          train_data,
-          max_steps,
-          master='',
-          task=0,
-          ps_tasks=0,
-          initial_learning_rate=0.001,
-          final_learning_rate=0.001,
-          learning_rate_halflife=160000,
-          optimizer_type='Adam',
-          num_preprocess_threads=1,
-          reader=None):
-  """Testable trainer with no dependence on FLAGS.
-
-  Args:
-    train_dir: Directory to write checkpoints.
-    model_str: Network specification string.
-    train_data: Training data file pattern.
-    max_steps: Number of training steps to run.
-    master: Name of the TensorFlow master to use.
-    task: Task id of this replica running the training. (0 will be master).
-    ps_tasks: Number of tasks in ps job, or 0 if no ps job.
-    initial_learning_rate: Learing rate at start of training.
-    final_learning_rate: Asymptotic minimum learning rate.
-    learning_rate_halflife: Number of steps over which to halve the difference
-      between initial and final learning rate.
-    optimizer_type: One of 'GradientDescent', 'AdaGrad', 'Momentum', 'Adam'.
-    num_preprocess_threads: Number of input threads.
-    reader: Function that returns an actual reader to read Examples from input
-      files. If None, uses tf.TFRecordReader().
-  """
-  if master.startswith('local'):
-    device = tf.ReplicaDeviceSetter(ps_tasks)
-  else:
-    device = '/cpu:0'
-  with tf.Graph().as_default():
-    with tf.device(device):
-      model = InitNetwork(train_data, model_str, 'train', initial_learning_rate,
-                          final_learning_rate, learning_rate_halflife,
-                          optimizer_type, num_preprocess_threads, reader)
-
-      # Create a Supervisor.  It will take care of initialization, summaries,
-      # checkpoints, and recovery.
-      #
-      # When multiple replicas of this program are running, the first one,
-      # identified by --task=0 is the 'chief' supervisor.  It is the only one
-      # that takes case of initialization, etc.
-      sv = tf.train.Supervisor(
-          logdir=train_dir,
-          is_chief=(task == 0),
-          saver=model.saver,
-          save_summaries_secs=10,
-          save_model_secs=30,
-          recovery_wait_secs=5)
-
-      step = 0
-      while step < max_steps:
-        try:
-          # Get an initialized, and possibly recovered session.  Launch the
-          # services: Checkpointing, Summaries, step counting.
-          with sv.managed_session(master) as sess:
-            while step < max_steps:
-              _, step = model.TrainAStep(sess)
-              if sv.coord.should_stop():
-                break
-        except tf.errors.AbortedError as e:
-          logging.error('Received error:%s', e)
-          continue
-
-
-def Eval(train_dir,
-         eval_dir,
-         model_str,
-         eval_data,
-         decoder_file,
-         num_steps,
-         graph_def_file=None,
-         eval_interval_secs=0,
-         reader=None):
-  """Restores a model from a checkpoint and evaluates it.
-
-  Args:
-    train_dir: Directory to find checkpoints.
-    eval_dir: Directory to write summary events.
-    model_str: Network specification string.
-    eval_data: Evaluation data file pattern.
-    decoder_file: File to read to decode the labels.
-    num_steps: Number of eval steps to run.
-    graph_def_file: File to write graph definition to for freezing.
-    eval_interval_secs: How often to run evaluations, or once if 0.
-    reader: Function that returns an actual reader to read Examples from input
-      files. If None, uses tf.TFRecordReader().
-  Returns:
-    (char error rate, word recall error rate, sequence error rate) as percent.
-  Raises:
-    ValueError: If unimplemented feature is used.
-  """
-  decode = None
-  if decoder_file:
-    decode = decoder.Decoder(decoder_file)
-
-  # Run eval.
-  rates = ec.ErrorRates(
-      label_error=None,
-      word_recall_error=None,
-      word_precision_error=None,
-      sequence_error=None)
-  with tf.Graph().as_default():
-    model = InitNetwork(eval_data, model_str, 'eval', reader=reader)
-    sw = tf.summary.FileWriter(eval_dir)
-
-    while True:
-      sess = tf.Session('')
-      if graph_def_file is not None:
-        # Write the eval version of the graph to a file for freezing.
-        if not tf.gfile.Exists(graph_def_file):
-          with tf.gfile.FastGFile(graph_def_file, 'w') as f:
-            f.write(
-                sess.graph.as_graph_def(add_shapes=True).SerializeToString())
-      ckpt = tf.train.get_checkpoint_state(train_dir)
-      if ckpt and ckpt.model_checkpoint_path:
-        step = model.Restore(ckpt.model_checkpoint_path, sess)
-        if decode:
-          rates = decode.SoftmaxEval(sess, model, num_steps)
-          _AddRateToSummary('Label error rate', rates.label_error, step, sw)
-          _AddRateToSummary('Word recall error rate', rates.word_recall_error,
-                            step, sw)
-          _AddRateToSummary('Word precision error rate',
-                            rates.word_precision_error, step, sw)
-          _AddRateToSummary('Sequence error rate', rates.sequence_error, step,
-                            sw)
-          sw.flush()
-          print('Error rates=', rates)
-        else:
-          raise ValueError('Non-softmax decoder evaluation not implemented!')
-      if eval_interval_secs:
-        time.sleep(eval_interval_secs)
-      else:
-        break
-  return rates
-
-
-def InitNetwork(input_pattern,
-                model_spec,
-                mode='eval',
-                initial_learning_rate=0.00005,
-                final_learning_rate=0.00005,
-                halflife=1600000,
-                optimizer_type='Adam',
-                num_preprocess_threads=1,
-                reader=None):
-  """Constructs a python tensor flow model defined by model_spec.
-
-  Args:
-    input_pattern: File pattern of the data in tfrecords of Example.
-    model_spec: Concatenation of input spec, model spec and output spec.
-      See Build below for input/output spec. For model spec, see vgslspecs.py
-    mode: One of 'train', 'eval'
-    initial_learning_rate: Initial learning rate for the network.
-    final_learning_rate: Final learning rate for the network.
-    halflife: Number of steps over which to halve the difference between
-              initial and final learning rate for the network.
-    optimizer_type: One of 'GradientDescent', 'AdaGrad', 'Momentum', 'Adam'.
-    num_preprocess_threads: Number of threads to use for image processing.
-    reader: Function that returns an actual reader to read Examples from input
-      files. If None, uses tf.TFRecordReader().
-    Eval tasks need only specify input_pattern and model_spec.
-
-  Returns:
-    A VGSLImageModel class.
-
-  Raises:
-    ValueError: if the model spec syntax is incorrect.
-  """
-  model = VGSLImageModel(mode, model_spec, initial_learning_rate,
-                         final_learning_rate, halflife)
-  left_bracket = model_spec.find('[')
-  right_bracket = model_spec.rfind(']')
-  if left_bracket < 0 or right_bracket < 0:
-    raise ValueError('Failed to find [] in model spec! ', model_spec)
-  input_spec = model_spec[:left_bracket]
-  layer_spec = model_spec[left_bracket:right_bracket + 1]
-  output_spec = model_spec[right_bracket + 1:]
-  model.Build(input_pattern, input_spec, layer_spec, output_spec,
-              optimizer_type, num_preprocess_threads, reader)
-  return model
-
-
-class VGSLImageModel(object):
-  """Class that builds a tensor flow model for training or evaluation.
-  """
-
-  def __init__(self, mode, model_spec, initial_learning_rate,
-               final_learning_rate, halflife):
-    """Constructs a VGSLImageModel.
-
-    Args:
-      mode:        One of "train", "eval"
-      model_spec:  Full model specification string, for reference only.
-      initial_learning_rate: Initial learning rate for the network.
-      final_learning_rate: Final learning rate for the network.
-      halflife: Number of steps over which to halve the difference between
-                initial and final learning rate for the network.
-    """
-    # The string that was used to build this model.
-    self.model_spec = model_spec
-    # The layers between input and output.
-    self.layers = None
-    # The train/eval mode.
-    self.mode = mode
-    # The initial learning rate.
-    self.initial_learning_rate = initial_learning_rate
-    self.final_learning_rate = final_learning_rate
-    self.decay_steps = halflife / DECAY_STEPS_FACTOR
-    self.decay_rate = DECAY_RATE
-    # Tensor for the labels.
-    self.labels = None
-    self.sparse_labels = None
-    # Debug data containing the truth text.
-    self.truths = None
-    # Tensor for loss
-    self.loss = None
-    # Train operation
-    self.train_op = None
-    # Tensor for the global step counter
-    self.global_step = None
-    # Tensor for the output predictions (usually softmax)
-    self.output = None
-    # True if we are using CTC training mode.
-    self.using_ctc = False
-    # Saver object to load or restore the variables.
-    self.saver = None
-
-  def Build(self, input_pattern, input_spec, model_spec, output_spec,
-            optimizer_type, num_preprocess_threads, reader):
-    """Builds the model from the separate input/layers/output spec strings.
-
-    Args:
-      input_pattern: File pattern of the data in tfrecords of TF Example format.
-      input_spec: Specification of the input layer:
-        batchsize,height,width,depth (4 comma-separated integers)
-          Training will run with batches of batchsize images, but runtime can
-          use any batch size.
-          height and/or width can be 0 or -1, indicating variable size,
-          otherwise all images must be the given size.
-          depth must be 1 or 3 to indicate greyscale or color.
-          NOTE 1-d image input, treating the y image dimension as depth, can
-          be achieved using S1(1x0)1,3 as the first op in the model_spec, but
-          the y-size of the input must then be fixed.
-      model_spec: Model definition. See vgslspecs.py
-      output_spec: Output layer definition:
-        O(2|1|0)(l|s|c)n output layer with n classes.
-          2 (heatmap) Output is a 2-d vector map of the input (possibly at
-            different scale).
-          1 (sequence) Output is a 1-d sequence of vector values.
-          0 (value) Output is a 0-d single vector value.
-          l uses a logistic non-linearity on the output, allowing multiple
-            hot elements in any output vector value.
-          s uses a softmax non-linearity, with one-hot output in each value.
-          c uses a softmax with CTC. Can only be used with s (sequence).
-          NOTE Only O1s and O1c are currently supported.
-      optimizer_type: One of 'GradientDescent', 'AdaGrad', 'Momentum', 'Adam'.
-      num_preprocess_threads: Number of threads to use for image processing.
-      reader: Function that returns an actual reader to read Examples from input
-        files. If None, uses tf.TFRecordReader().
-    """
-    self.global_step = tf.Variable(0, name='global_step', trainable=False)
-    shape = _ParseInputSpec(input_spec)
-    out_dims, out_func, num_classes = _ParseOutputSpec(output_spec)
-    self.using_ctc = out_func == 'c'
-    images, heights, widths, labels, sparse, _ = vgsl_input.ImageInput(
-        input_pattern, num_preprocess_threads, shape, self.using_ctc, reader)
-    self.labels = labels
-    self.sparse_labels = sparse
-    self.layers = vgslspecs.VGSLSpecs(widths, heights, self.mode == 'train')
-    last_layer = self.layers.Build(images, model_spec)
-    self._AddOutputs(last_layer, out_dims, out_func, num_classes)
-    if self.mode == 'train':
-      self._AddOptimizer(optimizer_type)
-
-    # For saving the model across training and evaluation
-    self.saver = tf.train.Saver()
-
-  def TrainAStep(self, sess):
-    """Runs a training step in the session.
-
-    Args:
-      sess: Session in which to train the model.
-    Returns:
-      loss, global_step.
-    """
-    _, loss, step = sess.run([self.train_op, self.loss, self.global_step])
-    return loss, step
-
-  def Restore(self, checkpoint_path, sess):
-    """Restores the model from the given checkpoint path into the session.
-
-    Args:
-      checkpoint_path: File pathname of the checkpoint.
-      sess:            Session in which to restore the model.
-    Returns:
-      global_step of the model.
-    """
-    self.saver.restore(sess, checkpoint_path)
-    return tf.train.global_step(sess, self.global_step)
-
-  def RunAStep(self, sess):
-    """Runs a step for eval in the session.
-
-    Args:
-      sess:            Session in which to run the model.
-    Returns:
-      output tensor result, labels tensor result.
-    """
-    return sess.run([self.output, self.labels])
-
-  def _AddOutputs(self, prev_layer, out_dims, out_func, num_classes):
-    """Adds the output layer and loss function.
-
-    Args:
-      prev_layer:  Output of last layer of main network.
-      out_dims:    Number of output dimensions, 0, 1 or 2.
-      out_func:    Output non-linearity. 's' or 'c'=softmax, 'l'=logistic.
-      num_classes: Number of outputs/size of last output dimension.
-    """
-    height_in = shapes.tensor_dim(prev_layer, dim=1)
-    logits, outputs = self._AddOutputLayer(prev_layer, out_dims, out_func,
-                                           num_classes)
-    if self.mode == 'train':
-      # Setup loss for training.
-      self.loss = self._AddLossFunction(logits, height_in, out_dims, out_func)
-      tf.summary.scalar('loss', self.loss)
-    elif out_dims == 0:
-      # Be sure the labels match the output, even in eval mode.
-      self.labels = tf.slice(self.labels, [0, 0], [-1, 1])
-      self.labels = tf.reshape(self.labels, [-1])
-
-    logging.info('Final output=%s', outputs)
-    logging.info('Labels tensor=%s', self.labels)
-    self.output = outputs
-
-  def _AddOutputLayer(self, prev_layer, out_dims, out_func, num_classes):
-    """Add the fully-connected logits and SoftMax/Logistic output Layer.
-
-    Args:
-      prev_layer:  Output of last layer of main network.
-      out_dims:    Number of output dimensions, 0, 1 or 2.
-      out_func:    Output non-linearity. 's' or 'c'=softmax, 'l'=logistic.
-      num_classes: Number of outputs/size of last output dimension.
-
-    Returns:
-      logits:  Pre-softmax/logistic fully-connected output shaped to out_dims.
-      outputs: Post-softmax/logistic shaped to out_dims.
-
-    Raises:
-      ValueError: if syntax is incorrect.
-    """
-    # Reduce dimensionality appropriate to the output dimensions.
-    batch_in = shapes.tensor_dim(prev_layer, dim=0)
-    height_in = shapes.tensor_dim(prev_layer, dim=1)
-    width_in = shapes.tensor_dim(prev_layer, dim=2)
-    depth_in = shapes.tensor_dim(prev_layer, dim=3)
-    if out_dims:
-      # Combine any remaining height and width with batch and unpack after.
-      shaped = tf.reshape(prev_layer, [-1, depth_in])
-    else:
-      # Everything except batch goes to depth, and therefore has to be known.
-      shaped = tf.reshape(prev_layer, [-1, height_in * width_in * depth_in])
-    logits = slim.fully_connected(shaped, num_classes, activation_fn=None)
-    if out_func == 'l':
-      raise ValueError('Logistic not yet supported!')
-    else:
-      output = tf.nn.softmax(logits)
-    # Reshape to the dessired output.
-    if out_dims == 2:
-      output_shape = [batch_in, height_in, width_in, num_classes]
-    elif out_dims == 1:
-      output_shape = [batch_in, height_in * width_in, num_classes]
-    else:
-      output_shape = [batch_in, num_classes]
-    output = tf.reshape(output, output_shape, name='Output')
-    logits = tf.reshape(logits, output_shape)
-    return logits, output
-
-  def _AddLossFunction(self, logits, height_in, out_dims, out_func):
-    """Add the appropriate loss function.
-
-    Args:
-      logits:  Pre-softmax/logistic fully-connected output shaped to out_dims.
-      height_in:  Height of logits before going into the softmax layer.
-      out_dims:   Number of output dimensions, 0, 1 or 2.
-      out_func:   Output non-linearity. 's' or 'c'=softmax, 'l'=logistic.
-
-    Returns:
-      loss: That which is to be minimized.
-
-    Raises:
-      ValueError: if logistic is used.
-    """
-    if out_func == 'c':
-      # Transpose batch to the middle.
-      ctc_input = tf.transpose(logits, [1, 0, 2])
-      # Compute the widths of each batch element from the input widths.
-      widths = self.layers.GetLengths(dim=2, factor=height_in)
-      cross_entropy = tf.nn.ctc_loss(ctc_input, self.sparse_labels, widths)
-    elif out_func == 's':
-      if out_dims == 2:
-        self.labels = _PadLabels3d(logits, self.labels)
-      elif out_dims == 1:
-        self.labels = _PadLabels2d(
-            shapes.tensor_dim(
-                logits, dim=1), self.labels)
-      else:
-        self.labels = tf.slice(self.labels, [0, 0], [-1, 1])
-        self.labels = tf.reshape(self.labels, [-1])
-      cross_entropy = tf.nn.sparse_softmax_cross_entropy_with_logits(
-          logits=logits, labels=self.labels, name='xent')
-    else:
-      # TODO(rays) Labels need an extra dimension for logistic, so different
-      # padding functions are needed, as well as a different loss function.
-      raise ValueError('Logistic not yet supported!')
-    return tf.reduce_sum(cross_entropy)
-
-  def _AddOptimizer(self, optimizer_type):
-    """Adds an optimizer with learning rate decay to minimize self.loss.
-
-    Args:
-      optimizer_type: One of 'GradientDescent', 'AdaGrad', 'Momentum', 'Adam'.
-    Raises:
-      ValueError: if the optimizer type is unrecognized.
-    """
-    learn_rate_delta = self.initial_learning_rate - self.final_learning_rate
-    learn_rate_dec = tf.add(
-        tf.train.exponential_decay(learn_rate_delta, self.global_step,
-                                   self.decay_steps, self.decay_rate),
-        self.final_learning_rate)
-    if optimizer_type == 'GradientDescent':
-      opt = tf.train.GradientDescentOptimizer(learn_rate_dec)
-    elif optimizer_type == 'AdaGrad':
-      opt = tf.train.AdagradOptimizer(learn_rate_dec)
-    elif optimizer_type == 'Momentum':
-      opt = tf.train.MomentumOptimizer(learn_rate_dec, momentum=0.9)
-    elif optimizer_type == 'Adam':
-      opt = tf.train.AdamOptimizer(learning_rate=learn_rate_dec)
-    else:
-      raise ValueError('Invalid optimizer type: ' + optimizer_type)
-    tf.summary.scalar('learn_rate', learn_rate_dec)
-
-    self.train_op = opt.minimize(
-        self.loss, global_step=self.global_step, name='train')
-
-
-def _PadLabels3d(logits, labels):
-  """Pads or slices 3-d labels to match logits.
-
-  Covers the case of 2-d softmax output, when labels is [batch, height, width]
-  and logits is [batch, height, width, onehot]
-  Args:
-    logits: 4-d Pre-softmax fully-connected output.
-    labels: 3-d, but not necessarily matching in size.
-
-  Returns:
-    labels: Resized by padding or clipping to match logits.
-  """
-  logits_shape = shapes.tensor_shape(logits)
-  labels_shape = shapes.tensor_shape(labels)
-  labels = tf.reshape(labels, [-1, labels_shape[2]])
-  labels = _PadLabels2d(logits_shape[2], labels)
-  labels = tf.reshape(labels, [labels_shape[0], -1])
-  labels = _PadLabels2d(logits_shape[1] * logits_shape[2], labels)
-  return tf.reshape(labels, [labels_shape[0], logits_shape[1], logits_shape[2]])
-
-
-def _PadLabels2d(logits_size, labels):
-  """Pads or slices the 2nd dimension of 2-d labels to match logits_size.
-
-  Covers the case of 1-d softmax output, when labels is [batch, seq] and
-  logits is [batch, seq, onehot]
-  Args:
-    logits_size: Tensor returned from tf.shape giving the target size.
-    labels:      2-d, but not necessarily matching in size.
-
-  Returns:
-    labels: Resized by padding or clipping the last dimension to logits_size.
-  """
-  pad = logits_size - tf.shape(labels)[1]
-
-  def _PadFn():
-    return tf.pad(labels, [[0, 0], [0, pad]])
-
-  def _SliceFn():
-    return tf.slice(labels, [0, 0], [-1, logits_size])
-
-  return tf.cond(tf.greater(pad, 0), _PadFn, _SliceFn)
-
-
-def _ParseInputSpec(input_spec):
-  """Parses input_spec and returns the numbers obtained therefrom.
-
-  Args:
-    input_spec:  Specification of the input layer. See Build.
-
-  Returns:
-    shape:      ImageShape with the desired shape of the input.
-
-  Raises:
-    ValueError: if syntax is incorrect.
-  """
-  pattern = re.compile(R'(\d+),(\d+),(\d+),(\d+)')
-  m = pattern.match(input_spec)
-  if m is None:
-    raise ValueError('Failed to parse input spec:' + input_spec)
-  batch_size = int(m.group(1))
-  y_size = int(m.group(2)) if int(m.group(2)) > 0 else None
-  x_size = int(m.group(3)) if int(m.group(3)) > 0 else None
-  depth = int(m.group(4))
-  if depth not in [1, 3]:
-    raise ValueError('Depth must be 1 or 3, had:', depth)
-  return vgsl_input.ImageShape(batch_size, y_size, x_size, depth)
-
-
-def _ParseOutputSpec(output_spec):
-  """Parses the output spec.
-
-  Args:
-    output_spec: Output layer definition. See Build.
-
-  Returns:
-    out_dims:     2|1|0 for 2-d, 1-d, 0-d.
-    out_func:     l|s|c for logistic, softmax, softmax+CTC
-    num_classes:  Number of classes in output.
-
-  Raises:
-    ValueError: if syntax is incorrect.
-  """
-  pattern = re.compile(R'(O)(0|1|2)(l|s|c)(\d+)')
-  m = pattern.match(output_spec)
-  if m is None:
-    raise ValueError('Failed to parse output spec:' + output_spec)
-  out_dims = int(m.group(2))
-  out_func = m.group(3)
-  if out_func == 'c' and out_dims != 1:
-    raise ValueError('CTC can only be used with a 1-D sequence!')
-  num_classes = int(m.group(4))
-  return out_dims, out_func, num_classes
-
-
-def _AddRateToSummary(tag, rate, step, sw):
-  """Adds the given rate to the summary with the given tag.
-
-  Args:
-    tag:   Name for this value.
-    rate:  Value to add to the summary. Perhaps an error rate.
-    step:  Global step of the graph for the x-coordinate of the summary.
-    sw:    Summary writer to which to write the rate value.
-  """
-  sw.add_summary(
-      summary_pb2.Summary(value=[summary_pb2.Summary.Value(
-          tag=tag, simple_value=rate)]), step)
diff --git a/research/street/python/vgsl_model_test.py b/research/street/python/vgsl_model_test.py
deleted file mode 100644
index fd23961592a77136539f8110f82f1ada64a5eacf..0000000000000000000000000000000000000000
--- a/research/street/python/vgsl_model_test.py
+++ /dev/null
@@ -1,248 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for vgsl_model."""
-import os
-
-import numpy as np
-import tensorflow as tf
-import vgsl_input
-import vgsl_model
-
-
-def _testdata(filename):
-  return os.path.join('../testdata/', filename)
-
-
-def _rand(*size):
-  return np.random.uniform(size=size).astype('f')
-
-
-class VgslModelTest(tf.test.TestCase):
-
-  def testParseInputSpec(self):
-    """The parser must return the numbers in the correct order.
-    """
-    shape = vgsl_model._ParseInputSpec(input_spec='32,42,256,3')
-    self.assertEqual(
-        shape,
-        vgsl_input.ImageShape(
-            batch_size=32, height=42, width=256, depth=3))
-    # Nones must be inserted for zero sizes.
-    shape = vgsl_model._ParseInputSpec(input_spec='1,0,0,3')
-    self.assertEqual(
-        shape,
-        vgsl_input.ImageShape(
-            batch_size=1, height=None, width=None, depth=3))
-
-  def testParseOutputSpec(self):
-    """The parser must return the correct args in the correct order.
-    """
-    out_dims, out_func, num_classes = vgsl_model._ParseOutputSpec(
-        output_spec='O1c142')
-    self.assertEqual(out_dims, 1)
-    self.assertEqual(out_func, 'c')
-    self.assertEqual(num_classes, 142)
-    out_dims, out_func, num_classes = vgsl_model._ParseOutputSpec(
-        output_spec='O2s99')
-    self.assertEqual(out_dims, 2)
-    self.assertEqual(out_func, 's')
-    self.assertEqual(num_classes, 99)
-    out_dims, out_func, num_classes = vgsl_model._ParseOutputSpec(
-        output_spec='O0l12')
-    self.assertEqual(out_dims, 0)
-    self.assertEqual(out_func, 'l')
-    self.assertEqual(num_classes, 12)
-
-  def testPadLabels2d(self):
-    """Must pad timesteps in labels to match logits.
-    """
-    with self.test_session() as sess:
-      # Make placeholders for logits and labels.
-      ph_logits = tf.placeholder(tf.float32, shape=(None, None, 42))
-      ph_labels = tf.placeholder(tf.int64, shape=(None, None))
-      padded_labels = vgsl_model._PadLabels2d(tf.shape(ph_logits)[1], ph_labels)
-      # Make actual inputs.
-      real_logits = _rand(4, 97, 42)
-      real_labels = _rand(4, 85)
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (4, 97))
-      real_labels = _rand(4, 97)
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (4, 97))
-      real_labels = _rand(4, 100)
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (4, 97))
-
-  def testPadLabels3d(self):
-    """Must pad height and width in labels to match logits.
-
-    The tricky thing with 3-d is that the rows and columns need to remain
-    intact, so we'll test it with small known data.
-    """
-    with self.test_session() as sess:
-      # Make placeholders for logits and labels.
-      ph_logits = tf.placeholder(tf.float32, shape=(None, None, None, 42))
-      ph_labels = tf.placeholder(tf.int64, shape=(None, None, None))
-      padded_labels = vgsl_model._PadLabels3d(ph_logits, ph_labels)
-      # Make actual inputs.
-      real_logits = _rand(1, 3, 4, 42)
-      # Test all 9 combinations of height x width in [small, ok, big]
-      real_labels = np.arange(6).reshape((1, 2, 3))  # Height small, width small
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (1, 3, 4))
-      self.assertAllEqual(np_array[0, :, :],
-                          [[0, 1, 2, 0], [3, 4, 5, 0], [0, 0, 0, 0]])
-      real_labels = np.arange(8).reshape((1, 2, 4))  # Height small, width ok
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (1, 3, 4))
-      self.assertAllEqual(np_array[0, :, :],
-                          [[0, 1, 2, 3], [4, 5, 6, 7], [0, 0, 0, 0]])
-      real_labels = np.arange(10).reshape((1, 2, 5))  # Height small, width big
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (1, 3, 4))
-      self.assertAllEqual(np_array[0, :, :],
-                          [[0, 1, 2, 3], [5, 6, 7, 8], [0, 0, 0, 0]])
-      real_labels = np.arange(9).reshape((1, 3, 3))  # Height ok, width small
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (1, 3, 4))
-      self.assertAllEqual(np_array[0, :, :],
-                          [[0, 1, 2, 0], [3, 4, 5, 0], [6, 7, 8, 0]])
-      real_labels = np.arange(12).reshape((1, 3, 4))  # Height ok, width ok
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (1, 3, 4))
-      self.assertAllEqual(np_array[0, :, :],
-                          [[0, 1, 2, 3], [4, 5, 6, 7], [8, 9, 10, 11]])
-      real_labels = np.arange(15).reshape((1, 3, 5))  # Height ok, width big
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (1, 3, 4))
-      self.assertAllEqual(np_array[0, :, :],
-                          [[0, 1, 2, 3], [5, 6, 7, 8], [10, 11, 12, 13]])
-      real_labels = np.arange(12).reshape((1, 4, 3))  # Height big, width small
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (1, 3, 4))
-      self.assertAllEqual(np_array[0, :, :],
-                          [[0, 1, 2, 0], [3, 4, 5, 0], [6, 7, 8, 0]])
-      real_labels = np.arange(16).reshape((1, 4, 4))  # Height big, width ok
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (1, 3, 4))
-      self.assertAllEqual(np_array[0, :, :],
-                          [[0, 1, 2, 3], [4, 5, 6, 7], [8, 9, 10, 11]])
-      real_labels = np.arange(20).reshape((1, 4, 5))  # Height big, width big
-      np_array = sess.run([padded_labels],
-                          feed_dict={ph_logits: real_logits,
-                                     ph_labels: real_labels})[0]
-      self.assertEqual(tuple(np_array.shape), (1, 3, 4))
-      self.assertAllEqual(np_array[0, :, :],
-                          [[0, 1, 2, 3], [5, 6, 7, 8], [10, 11, 12, 13]])
-
-  def testEndToEndSizes0d(self):
-    """Tests that the output sizes match when training/running real 0d data.
-
-    Uses mnist with dual summarizing LSTMs to reduce to a single value.
-    """
-    filename = _testdata('mnist-tiny')
-    with self.test_session() as sess:
-      model = vgsl_model.InitNetwork(
-          filename,
-          model_spec='4,0,0,1[Cr5,5,16 Mp3,3 Lfys16 Lfxs16]O0s12',
-          mode='train')
-      tf.global_variables_initializer().run(session=sess)
-      coord = tf.train.Coordinator()
-      tf.train.start_queue_runners(sess=sess, coord=coord)
-      _, step = model.TrainAStep(sess)
-      self.assertEqual(step, 1)
-      output, labels = model.RunAStep(sess)
-      self.assertEqual(len(output.shape), 2)
-      self.assertEqual(len(labels.shape), 1)
-      self.assertEqual(output.shape[0], labels.shape[0])
-      self.assertEqual(output.shape[1], 12)
-
-  # TODO(rays) Support logistic and test with Imagenet (as 0d, multi-object.)
-
-  def testEndToEndSizes1dCTC(self):
-    """Tests that the output sizes match when training with CTC.
-
-    Basic bidi LSTM on top of convolution and summarizing LSTM with CTC.
-    """
-    filename = _testdata('arial-32-tiny')
-    with self.test_session() as sess:
-      model = vgsl_model.InitNetwork(
-          filename,
-          model_spec='2,0,0,1[Cr5,5,16 Mp3,3 Lfys16 Lbx100]O1c105',
-          mode='train')
-      tf.global_variables_initializer().run(session=sess)
-      coord = tf.train.Coordinator()
-      tf.train.start_queue_runners(sess=sess, coord=coord)
-      _, step = model.TrainAStep(sess)
-      self.assertEqual(step, 1)
-      output, labels = model.RunAStep(sess)
-      self.assertEqual(len(output.shape), 3)
-      self.assertEqual(len(labels.shape), 2)
-      self.assertEqual(output.shape[0], labels.shape[0])
-      # This is ctc - the only cast-iron guarantee is labels <= output.
-      self.assertLessEqual(labels.shape[1], output.shape[1])
-      self.assertEqual(output.shape[2], 105)
-
-  def testEndToEndSizes1dFixed(self):
-    """Tests that the output sizes match when training/running 1 data.
-
-    Convolution, summarizing LSTM with fwd rev fwd to allow no CTC.
-    """
-    filename = _testdata('numbers-16-tiny')
-    with self.test_session() as sess:
-      model = vgsl_model.InitNetwork(
-          filename,
-          model_spec='8,0,0,1[Cr5,5,16 Mp3,3 Lfys16 Lfx64 Lrx64 Lfx64]O1s12',
-          mode='train')
-      tf.global_variables_initializer().run(session=sess)
-      coord = tf.train.Coordinator()
-      tf.train.start_queue_runners(sess=sess, coord=coord)
-      _, step = model.TrainAStep(sess)
-      self.assertEqual(step, 1)
-      output, labels = model.RunAStep(sess)
-      self.assertEqual(len(output.shape), 3)
-      self.assertEqual(len(labels.shape), 2)
-      self.assertEqual(output.shape[0], labels.shape[0])
-      # Not CTC, output lengths match.
-      self.assertEqual(output.shape[1], labels.shape[1])
-      self.assertEqual(output.shape[2], 12)
-
-  # TODO(rays) Get a 2-d dataset and support 2d (heat map) outputs.
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/street/python/vgsl_train.py b/research/street/python/vgsl_train.py
deleted file mode 100644
index 8dd83089730a7af93513204d5bf1630e68d11bf7..0000000000000000000000000000000000000000
--- a/research/street/python/vgsl_train.py
+++ /dev/null
@@ -1,55 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Model trainer for single or multi-replica training."""
-from tensorflow import app
-from tensorflow.python.platform import flags
-
-import vgsl_model
-
-flags.DEFINE_string('master', '', 'Name of the TensorFlow master to use.')
-flags.DEFINE_string('train_dir', '/tmp/mdir',
-                    'Directory where to write event logs.')
-flags.DEFINE_string('model_str',
-                    '1,150,600,3[S2(4x150)0,2 Ct5,5,16 Mp2,2 Ct5,5,64 Mp3,3'
-                    '([Lrys64 Lbx128][Lbys64 Lbx128][Lfys64 Lbx128])S3(3x0)2,3'
-                    'Lfx128 Lrx128 S0(1x4)0,3 Do Lfx256]O1c134',
-                    'Network description.')
-flags.DEFINE_integer('max_steps', 10000, 'Number of steps to train for.')
-flags.DEFINE_integer('task', 0, 'Task id of the replica running the training.')
-flags.DEFINE_integer('ps_tasks', 0, 'Number of tasks in the ps job.'
-                     'If 0 no ps job is used.')
-flags.DEFINE_string('train_data', None, 'Training data filepattern')
-flags.DEFINE_float('initial_learning_rate', 0.00002, 'Initial learning rate')
-flags.DEFINE_float('final_learning_rate', 0.00002, 'Final learning rate')
-flags.DEFINE_integer('learning_rate_halflife', 1600000,
-                     'Halflife of learning rate')
-flags.DEFINE_string('optimizer_type', 'Adam',
-                    'Optimizer from:GradientDescent, AdaGrad, Momentum, Adam')
-flags.DEFINE_integer('num_preprocess_threads', 4, 'Number of input threads')
-
-FLAGS = flags.FLAGS
-
-
-def main(argv):
-  del argv
-  vgsl_model.Train(FLAGS.train_dir, FLAGS.model_str, FLAGS.train_data,
-                   FLAGS.max_steps, FLAGS.master, FLAGS.task, FLAGS.ps_tasks,
-                   FLAGS.initial_learning_rate, FLAGS.final_learning_rate,
-                   FLAGS.learning_rate_halflife, FLAGS.optimizer_type,
-                   FLAGS.num_preprocess_threads)
-
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/street/python/vgslspecs.py b/research/street/python/vgslspecs.py
deleted file mode 100644
index 36b5c668a3e5c6b11cf51f899d2a2dd358cca8b2..0000000000000000000000000000000000000000
--- a/research/street/python/vgslspecs.py
+++ /dev/null
@@ -1,534 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""String network description language mapping to TF-Slim calls where possible.
-
-See vglspecs.md for detailed description.
-"""
-
-import re
-from string import maketrans
-
-import nn_ops
-import shapes
-from six.moves import xrange
-import tensorflow as tf
-import tensorflow.contrib.slim as slim
-
-
-# Class that builds a set of ops to manipulate variable-sized images.
-class VGSLSpecs(object):
-  """Layers that can be built from a string definition."""
-
-  def __init__(self, widths, heights, is_training):
-    """Constructs a VGSLSpecs.
-
-    Args:
-      widths:  Tensor of size batch_size of the widths of the inputs.
-      heights: Tensor of size batch_size of the heights of the inputs.
-      is_training: True if the graph should be build for training.
-    """
-    # The string that was used to build this model.
-    self.model_str = None
-    # True if we are training
-    self.is_training = is_training
-    # Tensor for the size of the images, of size batch_size.
-    self.widths = widths
-    self.heights = heights
-    # Overall reduction factors of this model so far for each dimension.
-    # TODO(rays) consider building a graph from widths and heights instead of
-    # computing a scale factor.
-    self.reduction_factors = [1.0, 1.0, 1.0, 1.0]
-    # List of Op parsers.
-    # TODO(rays) add more Op types as needed.
-    self.valid_ops = [self.AddSeries, self.AddParallel, self.AddConvLayer,
-                      self.AddMaxPool, self.AddDropout, self.AddReShape,
-                      self.AddFCLayer, self.AddLSTMLayer]
-    # Translation table to convert unacceptable characters that may occur
-    # in op strings that cannot be used as names.
-    self.transtab = maketrans('(,)', '___')
-
-  def Build(self, prev_layer, model_str):
-    """Builds a network with input prev_layer from a VGSLSpecs description.
-
-    Args:
-      prev_layer: The input tensor.
-      model_str:  Model definition similar to Tesseract as follows:
-        ============ FUNCTIONAL OPS ============
-        C(s|t|r|l|m)[{name}]<y>,<x>,<d> Convolves using a y,x window, with no
-          shrinkage, SAME infill, d outputs, with s|t|r|l|m non-linear layer.
-          (s|t|r|l|m) specifies the type of non-linearity:
-          s = sigmoid
-          t = tanh
-          r = relu
-          l = linear (i.e., None)
-          m = softmax
-        F(s|t|r|l|m)[{name}]<d> Fully-connected with s|t|r|l|m non-linearity and
-          d outputs. Reduces height, width to 1. Input height and width must be
-          constant.
-        L(f|r|b)(x|y)[s][{name}]<n> LSTM cell with n outputs.
-          f runs the LSTM forward only.
-          r runs the LSTM reversed only.
-          b runs the LSTM bidirectionally.
-          x runs the LSTM in the x-dimension (on data with or without the
-             y-dimension).
-          y runs the LSTM in the y-dimension (data must have a y dimension).
-          s (optional) summarizes the output in the requested dimension,
-             outputting only the final step, collapsing the dimension to a
-             single element.
-          Examples:
-          Lfx128 runs a forward-only LSTM in the x-dimension with 128
-                 outputs, treating any y dimension independently.
-          Lfys64 runs a forward-only LSTM in the y-dimension with 64 outputs
-                 and collapses the y-dimension to 1 element.
-          NOTE that Lbxsn is implemented as (LfxsnLrxsn) since the summaries
-          need to be taken from opposite ends of the output
-        Do[{name}] Insert a dropout layer.
-        ============ PLUMBING OPS ============
-        [...] Execute ... networks in series (layers).
-        (...) Execute ... networks in parallel, with their output concatenated
-          in depth.
-        S[{name}]<d>(<a>x<b>)<e>,<f> Splits one dimension, moves one part to
-          another dimension.
-          Splits input dimension d into a x b, sending the high part (a) to the
-          high side of dimension e, and the low part (b) to the high side of
-          dimension f. Exception: if d=e=f, then then dimension d is internally
-          transposed to bxa.
-          Either a or b can be zero, meaning whatever is left after taking out
-          the other, allowing dimensions to be of variable size.
-          Eg. S3(3x50)2,3 will split the 150-element depth into 3x50, with the 3
-          going to the most significant part of the width, and the 50 part
-          staying in depth.
-          This will rearrange a 3x50 output parallel operation to spread the 3
-          output sets over width.
-        Mp[{name}]<y>,<x> Maxpool the input, reducing the (y,x) rectangle to a
-          single vector value.
-
-    Returns:
-      Output tensor
-    """
-    self.model_str = model_str
-    final_layer, _ = self.BuildFromString(prev_layer, 0)
-    return final_layer
-
-  def GetLengths(self, dim=2, factor=1):
-    """Returns the lengths of the batch of elements in the given dimension.
-
-    WARNING: The returned sizes may not exactly match TF's calculation.
-    Args:
-      dim: dimension to get the sizes of, in [1,2]. batch, depth not allowed.
-      factor: A scalar value to multiply by.
-
-    Returns:
-      The original heights/widths scaled by the current scaling of the model and
-      the given factor.
-
-    Raises:
-      ValueError: If the args are invalid.
-    """
-    if dim == 1:
-      lengths = self.heights
-    elif dim == 2:
-      lengths = self.widths
-    else:
-      raise ValueError('Invalid dimension given to GetLengths')
-    lengths = tf.cast(lengths, tf.float32)
-    if self.reduction_factors[dim] is not None:
-      lengths = tf.div(lengths, self.reduction_factors[dim])
-    else:
-      lengths = tf.ones_like(lengths)
-    if factor != 1:
-      lengths = tf.multiply(lengths, tf.cast(factor, tf.float32))
-    return tf.cast(lengths, tf.int32)
-
-  def BuildFromString(self, prev_layer, index):
-    """Adds the layers defined by model_str[index:] to the model.
-
-    Args:
-      prev_layer: Input tensor.
-      index:      Position in model_str to start parsing
-
-    Returns:
-      Output tensor, next model_str index.
-
-    Raises:
-      ValueError: If the model string is unrecognized.
-    """
-    index = self._SkipWhitespace(index)
-    for op in self.valid_ops:
-      output_layer, next_index = op(prev_layer, index)
-      if output_layer is not None:
-        return output_layer, next_index
-    if output_layer is not None:
-      return output_layer, next_index
-    raise ValueError('Unrecognized model string:' + self.model_str[index:])
-
-  def AddSeries(self, prev_layer, index):
-    """Builds a sequence of layers for a VGSLSpecs model.
-
-    Args:
-      prev_layer: Input tensor.
-      index:      Position in model_str to start parsing
-
-    Returns:
-      Output tensor of the series, end index in model_str.
-
-    Raises:
-      ValueError: If [] are unbalanced.
-    """
-    if self.model_str[index] != '[':
-      return None, None
-    index += 1
-    while index < len(self.model_str) and self.model_str[index] != ']':
-      prev_layer, index = self.BuildFromString(prev_layer, index)
-    if index == len(self.model_str):
-      raise ValueError('Missing ] at end of series!' + self.model_str)
-    return prev_layer, index + 1
-
-  def AddParallel(self, prev_layer, index):
-    """tf.concats outputs of layers that run on the same inputs.
-
-    Args:
-      prev_layer: Input tensor.
-      index:      Position in model_str to start parsing
-
-    Returns:
-      Output tensor of the parallel,  end index in model_str.
-
-    Raises:
-      ValueError: If () are unbalanced or the elements don't match.
-    """
-    if self.model_str[index] != '(':
-      return None, None
-    index += 1
-    layers = []
-    num_dims = 0
-    # Each parallel must output the same, including any reduction factor, in
-    # all dimensions except depth.
-    # We have to save the starting factors, so they don't get reduced by all
-    # the elements of the parallel, only once.
-    original_factors = self.reduction_factors
-    final_factors = None
-    while index < len(self.model_str) and self.model_str[index] != ')':
-      self.reduction_factors = original_factors
-      layer, index = self.BuildFromString(prev_layer, index)
-      if num_dims == 0:
-        num_dims = len(layer.get_shape())
-      elif num_dims != len(layer.get_shape()):
-        raise ValueError('All elements of parallel must return same num dims')
-      layers.append(layer)
-      if final_factors:
-        if final_factors != self.reduction_factors:
-          raise ValueError('All elements of parallel must scale the same')
-      else:
-        final_factors = self.reduction_factors
-    if index == len(self.model_str):
-      raise ValueError('Missing ) at end of parallel!' + self.model_str)
-    return tf.concat(axis=num_dims - 1, values=layers), index + 1
-
-  def AddConvLayer(self, prev_layer, index):
-    """Add a single standard convolutional layer.
-
-    Args:
-      prev_layer: Input tensor.
-      index:      Position in model_str to start parsing
-
-    Returns:
-      Output tensor, end index in model_str.
-    """
-    pattern = re.compile(R'(C)(s|t|r|l|m)({\w+})?(\d+),(\d+),(\d+)')
-    m = pattern.match(self.model_str, index)
-    if m is None:
-      return None, None
-    name = self._GetLayerName(m.group(0), index, m.group(3))
-    width = int(m.group(4))
-    height = int(m.group(5))
-    depth = int(m.group(6))
-    fn = self._NonLinearity(m.group(2))
-    return slim.conv2d(
-        prev_layer, depth, [height, width], activation_fn=fn,
-        scope=name), m.end()
-
-  def AddMaxPool(self, prev_layer, index):
-    """Add a maxpool layer.
-
-    Args:
-      prev_layer: Input tensor.
-      index:      Position in model_str to start parsing
-
-    Returns:
-      Output tensor, end index in model_str.
-    """
-    pattern = re.compile(R'(Mp)({\w+})?(\d+),(\d+)(?:,(\d+),(\d+))?')
-    m = pattern.match(self.model_str, index)
-    if m is None:
-      return None, None
-    name = self._GetLayerName(m.group(0), index, m.group(2))
-    height = int(m.group(3))
-    width = int(m.group(4))
-    y_stride = height if m.group(5) is None else m.group(5)
-    x_stride = width if m.group(6) is None else m.group(6)
-    self.reduction_factors[1] *= y_stride
-    self.reduction_factors[2] *= x_stride
-    return slim.max_pool2d(
-        prev_layer, [height, width], [y_stride, x_stride],
-        padding='SAME',
-        scope=name), m.end()
-
-  def AddDropout(self, prev_layer, index):
-    """Adds a dropout layer.
-
-    Args:
-      prev_layer: Input tensor.
-      index:      Position in model_str to start parsing
-
-    Returns:
-      Output tensor, end index in model_str.
-    """
-    pattern = re.compile(R'(Do)({\w+})?')
-    m = pattern.match(self.model_str, index)
-    if m is None:
-      return None, None
-    name = self._GetLayerName(m.group(0), index, m.group(2))
-    layer = slim.dropout(
-        prev_layer, 0.5, is_training=self.is_training, scope=name)
-    return layer, m.end()
-
-  def AddReShape(self, prev_layer, index):
-    """Reshapes the input tensor by moving each (x_scale,y_scale) rectangle to.
-
-       the depth dimension. NOTE that the TF convention is that inputs are
-       [batch, y, x, depth].
-
-    Args:
-      prev_layer: Input tensor.
-      index:      Position in model_str to start parsing
-
-    Returns:
-      Output tensor, end index in model_str.
-    """
-    pattern = re.compile(R'(S)(?:{(\w)})?(\d+)\((\d+)x(\d+)\)(\d+),(\d+)')
-    m = pattern.match(self.model_str, index)
-    if m is None:
-      return None, None
-    name = self._GetLayerName(m.group(0), index, m.group(2))
-    src_dim = int(m.group(3))
-    part_a = int(m.group(4))
-    part_b = int(m.group(5))
-    dest_dim_a = int(m.group(6))
-    dest_dim_b = int(m.group(7))
-    if part_a == 0:
-      part_a = -1
-    if part_b == 0:
-      part_b = -1
-    prev_shape = tf.shape(prev_layer)
-    layer = shapes.transposing_reshape(
-        prev_layer, src_dim, part_a, part_b, dest_dim_a, dest_dim_b, name=name)
-    # Compute scale factors.
-    result_shape = tf.shape(layer)
-    for i in xrange(len(self.reduction_factors)):
-      if self.reduction_factors[i] is not None:
-        factor1 = tf.cast(self.reduction_factors[i], tf.float32)
-        factor2 = tf.cast(prev_shape[i], tf.float32)
-        divisor = tf.cast(result_shape[i], tf.float32)
-        self.reduction_factors[i] = tf.div(tf.multiply(factor1, factor2), divisor)
-    return layer, m.end()
-
-  def AddFCLayer(self, prev_layer, index):
-    """Parse expression and add Fully Connected Layer.
-
-    Args:
-      prev_layer: Input tensor.
-      index:      Position in model_str to start parsing
-
-    Returns:
-      Output tensor, end index in model_str.
-    """
-    pattern = re.compile(R'(F)(s|t|r|l|m)({\w+})?(\d+)')
-    m = pattern.match(self.model_str, index)
-    if m is None:
-      return None, None
-    fn = self._NonLinearity(m.group(2))
-    name = self._GetLayerName(m.group(0), index, m.group(3))
-    depth = int(m.group(4))
-    input_depth = shapes.tensor_dim(prev_layer, 1) * shapes.tensor_dim(
-        prev_layer, 2) * shapes.tensor_dim(prev_layer, 3)
-    # The slim fully connected is actually a 1x1 conv, so we have to crush the
-    # dimensions on input.
-    # Everything except batch goes to depth, and therefore has to be known.
-    shaped = tf.reshape(
-        prev_layer, [-1, input_depth], name=name + '_reshape_in')
-    output = slim.fully_connected(shaped, depth, activation_fn=fn, scope=name)
-    # Width and height are collapsed to 1.
-    self.reduction_factors[1] = None
-    self.reduction_factors[2] = None
-    return tf.reshape(
-        output, [shapes.tensor_dim(prev_layer, 0), 1, 1, depth],
-        name=name + '_reshape_out'), m.end()
-
-  def AddLSTMLayer(self, prev_layer, index):
-    """Parse expression and add LSTM Layer.
-
-    Args:
-      prev_layer: Input tensor.
-      index:      Position in model_str to start parsing
-
-    Returns:
-      Output tensor, end index in model_str.
-    """
-    pattern = re.compile(R'(L)(f|r|b)(x|y)(s)?({\w+})?(\d+)')
-    m = pattern.match(self.model_str, index)
-    if m is None:
-      return None, None
-    direction = m.group(2)
-    dim = m.group(3)
-    summarize = m.group(4) == 's'
-    name = self._GetLayerName(m.group(0), index, m.group(5))
-    depth = int(m.group(6))
-    if direction == 'b' and summarize:
-      fwd = self._LSTMLayer(prev_layer, 'forward', dim, True, depth,
-                            name + '_forward')
-      back = self._LSTMLayer(prev_layer, 'backward', dim, True, depth,
-                             name + '_reverse')
-      return tf.concat(axis=3, values=[fwd, back], name=name + '_concat'), m.end()
-    if direction == 'f':
-      direction = 'forward'
-    elif direction == 'r':
-      direction = 'backward'
-    else:
-      direction = 'bidirectional'
-    outputs = self._LSTMLayer(prev_layer, direction, dim, summarize, depth,
-                              name)
-    if summarize:
-      # The x or y dimension is getting collapsed.
-      if dim == 'x':
-        self.reduction_factors[2] = None
-      else:
-        self.reduction_factors[1] = None
-    return outputs, m.end()
-
-  def _LSTMLayer(self, prev_layer, direction, dim, summarize, depth, name):
-    """Adds an LSTM layer with the given pre-parsed attributes.
-
-    Always maps 4-D to 4-D regardless of summarize.
-    Args:
-      prev_layer: Input tensor.
-      direction:  'forward' 'backward' or 'bidirectional'
-      dim:        'x' or 'y', dimension to consider as time.
-      summarize:  True if we are to return only the last timestep.
-      depth:      Output depth.
-      name:       Some string naming the op.
-
-    Returns:
-      Output tensor.
-    """
-    # If the target dimension is y, we need to transpose.
-    if dim == 'x':
-      lengths = self.GetLengths(2, 1)
-      inputs = prev_layer
-    else:
-      lengths = self.GetLengths(1, 1)
-      inputs = tf.transpose(prev_layer, [0, 2, 1, 3], name=name + '_ytrans_in')
-    input_batch = shapes.tensor_dim(inputs, 0)
-    num_slices = shapes.tensor_dim(inputs, 1)
-    num_steps = shapes.tensor_dim(inputs, 2)
-    input_depth = shapes.tensor_dim(inputs, 3)
-    # Reshape away the other dimension.
-    inputs = tf.reshape(
-        inputs, [-1, num_steps, input_depth], name=name + '_reshape_in')
-    # We need to replicate the lengths by the size of the other dimension, and
-    # any changes that have been made to the batch dimension.
-    tile_factor = tf.to_float(input_batch *
-                              num_slices) / tf.to_float(tf.shape(lengths)[0])
-    lengths = tf.tile(lengths, [tf.cast(tile_factor, tf.int32)])
-    lengths = tf.cast(lengths, tf.int64)
-    outputs = nn_ops.rnn_helper(
-        inputs,
-        lengths,
-        cell_type='lstm',
-        num_nodes=depth,
-        direction=direction,
-        name=name,
-        stddev=0.1)
-    # Output depth is doubled if bi-directional.
-    if direction == 'bidirectional':
-      output_depth = depth * 2
-    else:
-      output_depth = depth
-    # Restore the other dimension.
-    if summarize:
-      outputs = tf.slice(
-          outputs, [0, num_steps - 1, 0], [-1, 1, -1], name=name + '_sum_slice')
-      outputs = tf.reshape(
-          outputs, [input_batch, num_slices, 1, output_depth],
-          name=name + '_reshape_out')
-    else:
-      outputs = tf.reshape(
-          outputs, [input_batch, num_slices, num_steps, output_depth],
-          name=name + '_reshape_out')
-    if dim == 'y':
-      outputs = tf.transpose(outputs, [0, 2, 1, 3], name=name + '_ytrans_out')
-    return outputs
-
-  def _NonLinearity(self, code):
-    """Returns the non-linearity function pointer for the given string code.
-
-    For forwards compatibility, allows the full names for stand-alone
-    non-linearities, as well as the single-letter names used in ops like C,F.
-    Args:
-      code: String code representing a non-linearity function.
-    Returns:
-      non-linearity function represented by the code.
-    """
-    if code in ['s', 'Sig']:
-      return tf.sigmoid
-    elif code in ['t', 'Tanh']:
-      return tf.tanh
-    elif code in ['r', 'Relu']:
-      return tf.nn.relu
-    elif code in ['m', 'Smax']:
-      return tf.nn.softmax
-    return None
-
-  def _GetLayerName(self, op_str, index, name_str):
-    """Generates a name for the op, using a user-supplied name if possible.
-
-    Args:
-      op_str:     String representing the parsed op.
-      index:      Position in model_str of the start of the op.
-      name_str:   User-supplied {name} with {} that need removing or None.
-
-    Returns:
-      Selected name.
-    """
-    if name_str:
-      return name_str[1:-1]
-    else:
-      return op_str.translate(self.transtab) + '_' + str(index)
-
-  def _SkipWhitespace(self, index):
-    """Skips any leading whitespace in the model description.
-
-    Args:
-      index:      Position in model_str to start parsing
-
-    Returns:
-      end index in model_str of whitespace.
-    """
-    pattern = re.compile(R'([ \t\n]+)')
-    m = pattern.match(self.model_str, index)
-    if m is None:
-      return index
-    return m.end()
diff --git a/research/street/python/vgslspecs_test.py b/research/street/python/vgslspecs_test.py
deleted file mode 100644
index 69ea28bf75c13700f063b1dc4303af6c0982889c..0000000000000000000000000000000000000000
--- a/research/street/python/vgslspecs_test.py
+++ /dev/null
@@ -1,122 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for vgslspecs."""
-
-import numpy as np
-import tensorflow as tf
-import vgslspecs
-
-
-def _rand(*size):
-  return np.random.uniform(size=size).astype('f')
-
-
-class VgslspecsTest(tf.test.TestCase):
-
-  def __init__(self, other):
-    super(VgslspecsTest, self).__init__(other)
-    self.max_width = 36
-    self.max_height = 24
-    self.batch_size = 4
-
-  def SetupInputs(self):
-    # Make placeholders for standard inputs.
-    # Everything is variable in the input, except the depth.
-    self.ph_image = tf.placeholder(
-        tf.float32, shape=(None, None, None, 3), name='inputs')
-    self.ph_widths = tf.placeholder(tf.int64, shape=(None,), name='w')
-    self.ph_heights = tf.placeholder(tf.int64, shape=(None,), name='h')
-    # Make actual inputs.
-    self.in_image = _rand(self.batch_size, self.max_height, self.max_width, 3)
-    self.in_widths = [24, 12, self.max_width, 30]
-    self.in_heights = [self.max_height, 18, 12, 6]
-
-  def ExpectScaledSize(self, spec, target_shape, factor=1):
-    """Tests that the output of the graph of the given spec has target_shape."""
-    with tf.Graph().as_default():
-      with self.test_session() as sess:
-        self.SetupInputs()
-        # Only the placeholders are given at construction time.
-        vgsl = vgslspecs.VGSLSpecs(self.ph_widths, self.ph_heights, True)
-        outputs = vgsl.Build(self.ph_image, spec)
-        # Compute the expected output widths from the given scale factor.
-        target_widths = tf.div(self.in_widths, factor).eval()
-        target_heights = tf.div(self.in_heights, factor).eval()
-        # Run with the 'real' data.
-        tf.global_variables_initializer().run()
-        res_image, res_widths, res_heights = sess.run(
-            [outputs, vgsl.GetLengths(2), vgsl.GetLengths(1)],
-            feed_dict={self.ph_image: self.in_image,
-                       self.ph_widths: self.in_widths,
-                       self.ph_heights: self.in_heights})
-        self.assertEqual(tuple(res_image.shape), target_shape)
-        if target_shape[1] > 1:
-          self.assertEqual(tuple(res_heights), tuple(target_heights))
-        if target_shape[2] > 1:
-          self.assertEqual(tuple(res_widths), tuple(target_widths))
-
-  def testSameSizeConv(self):
-    """Test all types of Conv. There is no scaling."""
-    self.ExpectScaledSize(
-        '[Cs{MyConv}5,5,16 Ct3,3,12 Cr4,4,24 Cl5,5,64]',
-        (self.batch_size, self.max_height, self.max_width, 64))
-
-  def testSameSizeLSTM(self):
-    """Test all non-reducing LSTMs. Output depth is doubled with BiDi."""
-    self.ExpectScaledSize('[Lfx16 Lrx8 Do Lbx24 Lfy12 Do{MyDo} Lry7 Lby32]',
-                          (self.batch_size, self.max_height, self.max_width,
-                           64))
-
-  def testSameSizeParallel(self):
-    """Parallel affects depth, but not scale."""
-    self.ExpectScaledSize('[Cs5,5,16 (Lfx{MyLSTM}32 Lrx32 Lbx16)]',
-                          (self.batch_size, self.max_height, self.max_width,
-                           96))
-
-  def testScalingOps(self):
-    """Test a heterogeneous series with scaling."""
-    self.ExpectScaledSize('[Cs5,5,16 Mp{MyPool}2,2 Ct3,3,32 Mp3,3 Lfx32 Lry64]',
-                          (self.batch_size, self.max_height / 6,
-                           self.max_width / 6, 64), 6)
-
-  def testXReduction(self):
-    """Test a heterogeneous series with reduction of x-dimension."""
-    self.ExpectScaledSize('[Cr5,5,16 Mp2,2 Ct3,3,32 Mp3,3 Lfxs32 Lry64]',
-                          (self.batch_size, self.max_height / 6, 1, 64), 6)
-
-  def testYReduction(self):
-    """Test a heterogeneous series with reduction of y-dimension."""
-    self.ExpectScaledSize('[Cl5,5,16 Mp2,2 Ct3,3,32 Mp3,3 Lfys32 Lfx64]',
-                          (self.batch_size, 1, self.max_width / 6, 64), 6)
-
-  def testXYReduction(self):
-    """Test a heterogeneous series with reduction to 0-d."""
-    self.ExpectScaledSize(
-        '[Cr5,5,16 Lfys32 Lfxs64 Fr{MyFC}16 Ft20 Fl12 Fs32 Fm40]',
-        (self.batch_size, 1, 1, 40))
-
-  def testReshapeTile(self):
-    """Tests that a tiled input can be reshaped to the batch dimension."""
-    self.ExpectScaledSize('[S2(3x0)0,2 Cr5,5,16 Lfys16]',
-                          (self.batch_size * 3, 1, self.max_width / 3, 16), 3)
-
-  def testReshapeDepth(self):
-    """Tests that depth can be reshaped to the x dimension."""
-    self.ExpectScaledSize('[Cl5,5,16 Mp3,3 (Lrys32 Lbys16 Lfys32) S3(3x0)2,3]',
-                          (self.batch_size, 1, self.max_width, 32))
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/street/testdata/arial-32-tiny b/research/street/testdata/arial-32-tiny
deleted file mode 100644
index b551e2bde5fbfe10648af75bb90275d50fa6f201..0000000000000000000000000000000000000000
Binary files a/research/street/testdata/arial-32-tiny and /dev/null differ
diff --git a/research/street/testdata/arial.charset_size=105.txt b/research/street/testdata/arial.charset_size=105.txt
deleted file mode 100644
index feec47e0aabc60fca462570ea20ac525be761797..0000000000000000000000000000000000000000
--- a/research/street/testdata/arial.charset_size=105.txt
+++ /dev/null
@@ -1,112 +0,0 @@
-0	 
-104	<nul>
-1	G
-2	r
-3	a
-4	s
-5	l
-6	n
-7	d
-8	.
-9	B
-10	C
-11	O
-12	W
-13	Y
-14	,
-15	(
-16	u
-17	z
-18	i
-19	e
-20	)
-21	1
-22	9
-23	2
-24	-
-25	6
-26	o
-27	L
-28	P
-29	'
-30	t
-31	m
-32	K
-33	c
-34	k
-35	V
-36	S
-37	D
-38	J
-39	h
-40	M
-41	x
-42	E
-43	q
-44	;
-45	A
-46	y
-47	f
-48	5
-49	7
-50	b
-51	4
-52	0
-53	3
-54	N
-55	I
-56	T
-57	/
-58	p
-59	w
-60	g
-61	H
-62	“
-63	F
-62	”
-62	"
-29	’
-64	R
-24	—
-65	8
-66	v
-67	?
-68	é
-69	%
-70	:
-71	j
-72	\
-73	{
-74	}
-75	|
-76	U
-77	$
-78	°
-79	*
-80	!
-81	]
-82	Q
-29	‘
-83	Z
-84	X
-85	[
-86	=
-87	+
-88	§
-89	_
-90	£
-91	&
-92	#
-93	>
-94	<
-95	~
-96	€
-97	@
-98	¢
-99	»
-100	«
-47,5	ﬂ
-47,18	ﬁ
-101	®
-102	©
-103	¥
diff --git a/research/street/testdata/charset_size=134.txt b/research/street/testdata/charset_size=134.txt
deleted file mode 100644
index 5c7fcde2ae0ab679f279a083d6de1c50d33ff90b..0000000000000000000000000000000000000000
--- a/research/street/testdata/charset_size=134.txt
+++ /dev/null
@@ -1,139 +0,0 @@
-0	 
-133	<nul>
-1	l
-2	’
-3	é
-4	t
-5	e
-6	i
-7	n
-8	s
-9	x
-10	g
-11	u
-12	o
-13	1
-14	8
-15	7
-16	0
-17	-
-18	.
-19	p
-20	a
-21	r
-22	è
-23	d
-24	c
-25	V
-26	v
-27	b
-28	m
-29	)
-30	C
-31	z
-32	S
-33	y
-34	,
-35	k
-36	É
-37	A
-38	h
-39	E
-40	»
-41	D
-42	/
-43	H
-44	M
-45	(
-46	G
-47	P
-48	ç
-2	'
-49	R
-50	f
-51	"
-52	2
-53	j
-54	|
-55	N
-56	6
-57	°
-58	5
-59	T
-60	O
-61	U
-62	3
-63	%
-64	9
-65	q
-66	Z
-67	B
-68	K
-69	w
-70	W
-71	:
-72	4
-73	L
-74	F
-75	]
-76	ï
-2	‘
-77	I
-78	J
-79	ä
-80	î
-81	;
-82	à
-83	ê
-84	X
-85	ü
-86	Y
-87	ô
-88	=
-89	+
-90	\
-91	{
-92	}
-93	_
-94	Q
-95	œ
-96	ñ
-97	*
-98	!
-99	Ü
-51	“
-100	â
-101	Ç
-102	Œ
-103	û
-104	?
-105	$
-106	ë
-107	«
-108	€
-109	&
-110	<
-51	”
-111	æ
-112	#
-113	®
-114	Â
-115	È
-116	>
-117	[
-17	—
-118	Æ
-119	ù
-120	Î
-121	Ô
-122	ÿ
-123	À
-124	Ê
-125	@
-126	Ï
-127	©
-128	Ë
-129	Ù
-130	£
-131	Ÿ
-132	Û
diff --git a/research/street/testdata/charset_size_10.txt b/research/street/testdata/charset_size_10.txt
deleted file mode 100644
index 93fffbd0f26d2a23f319f8765133a43fb25b7f46..0000000000000000000000000000000000000000
--- a/research/street/testdata/charset_size_10.txt
+++ /dev/null
@@ -1,10 +0,0 @@
-0	 
-9	<nul>
-1	a
-2	b
-3	r
-4	n
-4,5	m
-6	f
-7	.
-8	,
diff --git a/research/street/testdata/mnist-tiny b/research/street/testdata/mnist-tiny
deleted file mode 100644
index 1470119aff9891557dcef02981c40ae958e6bd11..0000000000000000000000000000000000000000
Binary files a/research/street/testdata/mnist-tiny and /dev/null differ
diff --git a/research/street/testdata/numbers-16-tiny b/research/street/testdata/numbers-16-tiny
deleted file mode 100644
index bb0c11fcd034fad1cf650b386bf94476c9c1ccd1..0000000000000000000000000000000000000000
Binary files a/research/street/testdata/numbers-16-tiny and /dev/null differ
diff --git a/research/street/testdata/numbers.charset_size=12.txt b/research/street/testdata/numbers.charset_size=12.txt
deleted file mode 100644
index 12fdc77ded0c74b05d0624bc9f690e7384bc5038..0000000000000000000000000000000000000000
--- a/research/street/testdata/numbers.charset_size=12.txt
+++ /dev/null
@@ -1,12 +0,0 @@
-0	 
-11	<nul>
-1	9
-2	8
-3	7
-4	6
-5	1
-6	4
-7	0
-8	3
-9	5
-10	2
diff --git a/research/struct2depth/BUILD b/research/struct2depth/BUILD
deleted file mode 100644
index ffd0fb0cdc5bc3ecb62d3cbb5b502b353073615f..0000000000000000000000000000000000000000
--- a/research/struct2depth/BUILD
+++ /dev/null
@@ -1 +0,0 @@
-package(default_visibility = ["//visibility:public"])
diff --git a/research/struct2depth/README.md b/research/struct2depth/README.md
deleted file mode 100644
index de1d7e7f2a8864897ed0fdbe71a42171e47da5da..0000000000000000000000000000000000000000
--- a/research/struct2depth/README.md
+++ /dev/null
@@ -1,151 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# struct2depth
-
-This a method for unsupervised learning of depth and egomotion from monocular video, achieving new state-of-the-art results on both tasks by explicitly modeling 3D object motion, performing on-line refinement and improving quality for moving objects by novel loss formulations. It will appear in the following paper: 
-
-**V. Casser, S. Pirk, R. Mahjourian, A. Angelova, Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos, AAAI Conference on Artificial Intelligence, 2019**
-https://arxiv.org/pdf/1811.06152.pdf
-
-This code is implemented and supported by Vincent Casser (git username: VincentCa) and Anelia Angelova (git username: AneliaAngelova). Please contact anelia@google.com for questions. 
-
-Project website: https://sites.google.com/view/struct2depth.
-
-## Quick start: Running training
-
-Before running training, run gen_data_* script for the respective dataset in order to generate the data in the appropriate format for KITTI or Cityscapes. It is assumed that motion masks are already generated and stored as images.
-Models are trained from an Imagenet pretrained model.
-
-```shell
-
-ckpt_dir="your/checkpoint/folder"
-data_dir="KITTI_SEQ2_LR/" # Set for KITTI
-data_dir="CITYSCAPES_SEQ2_LR/" # Set for Cityscapes
-imagenet_ckpt="resnet_pretrained/model.ckpt"
-
-python train.py \
-  --logtostderr \
-  --checkpoint_dir $ckpt_dir \
-  --data_dir $data_dir \
-  --architecture resnet \
-  --imagenet_ckpt $imagenet_ckpt \
-  --imagenet_norm true \
-  --joint_encoder false
-```
-
-
-
-## Running depth/egomotion inference on an image folder
-
-KITTI is trained on the raw image data (resized to 416 x 128), but inputs are standardized before feeding them, and Cityscapes images are cropped using the following cropping parameters: (192, 1856, 256, 768). If using a different crop, it is likely that additional training is necessary. Therefore, please follow the inference example shown below when using one of the models. The right choice might depend on a variety of factors. For example, if a checkpoint should be used for odometry, be aware that for improved odometry on motion models, using segmentation masks could be advantageous (setting *use_masks=true* for inference). On the other hand, all models can be used for single-frame depth estimation without any additional information.
-
-
-```shell
-
-input_dir="your/image/folder"
-output_dir="your/output/folder"
-model_checkpoint="your/model/checkpoint"
-
-python inference.py \
-    --logtostderr \
-    --file_extension png \
-    --depth \
-    --egomotion true \
-    --input_dir $input_dir \
-    --output_dir $output_dir \
-    --model_ckpt $model_checkpoint
-```
-
-Note that the egomotion prediction expects the files in the input directory to be a consecutive sequence, and that sorting the filenames alphabetically is putting them in the right order.
-
-One can also run inference on KITTI by providing
-
-```shell
---input_list_file ~/kitti-raw-uncompressed/test_files_eigen.txt
-```
-
-and on Cityscapes by passing
-
-```shell
---input_list_file CITYSCAPES_FULL/test_files_cityscapes.txt
-```
-
-instead of *input_dir*.
-Alternatively inference can also be ran on pre-processed images.
-
-
-
-## Running on-line refinement
-
-On-line refinement is executed on top of an existing inference folder, so make sure to run regular inference first. Then you can run the on-line fusion procedure as follows:
-
-```shell
-
-prediction_dir="some/prediction/dir"
-model_ckpt="checkpoints/checkpoints_baseline/model-199160"
-handle_motion="false"
-size_constraint_weight="0" # This must be zero when not handling motion.
-
-# If running on KITTI, set as follows:
-data_dir="KITTI_SEQ2_LR_EIGEN/"
-triplet_list_file="$data_dir/test_files_eigen_triplets.txt"
-triplet_list_file_remains="$data_dir/test_files_eigen_triplets_remains.txt"
-ft_name="kitti"
-
-# If running on Cityscapes, set as follows:
-data_dir="CITYSCAPES_SEQ2_LR_TEST/" # Set for Cityscapes
-triplet_list_file="/CITYSCAPES_SEQ2_LR_TEST/test_files_cityscapes_triplets.txt"
-triplet_list_file_remains="CITYSCAPES_SEQ2_LR_TEST/test_files_cityscapes_triplets_remains.txt"
-ft_name="cityscapes"
-
-python optimize.py \
-  --logtostderr \
-  --output_dir $prediction_dir \
-  --data_dir $data_dir \
-  --triplet_list_file $triplet_list_file \
-  --triplet_list_file_remains $triplet_list_file_remains \
-  --ft_name $ft_name \
-  --model_ckpt $model_ckpt \
-  --file_extension png \
-  --handle_motion $handle_motion \
-  --size_constraint_weight $size_constraint_weight
-```
-
-
-
-## Running evaluation
-
-```shell
-
-prediction_dir="some/prediction/dir"
-
-# Use these settings for KITTI:
-eval_list_file="KITTI_FULL/kitti-raw-uncompressed/test_files_eigen.txt"
-eval_crop="garg"
-eval_mode="kitti"
-
-# Use these settings for Cityscapes:
-eval_list_file="CITYSCAPES_FULL/test_files_cityscapes.txt"
-eval_crop="none"
-eval_mode="cityscapes"
-
-python evaluate.py \
-  --logtostderr \
-  --prediction_dir $prediction_dir \
-  --eval_list_file $eval_list_file \
-  --eval_crop $eval_crop \
-  --eval_mode $eval_mode
-```
-
-
-
-## Credits
-
-This code is implemented and supported by Vincent Casser and Anelia Angelova and can be found at
-https://sites.google.com/view/struct2depth.
-The core implementation is derived from [https://github.com/tensorflow/models/tree/master/research/vid2depth)](https://github.com/tensorflow/models/tree/master/research/vid2depth)
-by [Reza Mahjourian](rezama@google.com), which in turn is based on [SfMLearner
-(https://github.com/tinghuiz/SfMLearner)](https://github.com/tinghuiz/SfMLearner)
-by [Tinghui Zhou](https://github.com/tinghuiz).
diff --git a/research/struct2depth/alignment.py b/research/struct2depth/alignment.py
deleted file mode 100644
index 0e9417d482e2a243af45faa40230ad38da9b1b89..0000000000000000000000000000000000000000
--- a/research/struct2depth/alignment.py
+++ /dev/null
@@ -1,54 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Common utilities for data pre-processing, e.g. matching moving object across frames."""
-
-import numpy as np
-
-def compute_overlap(mask1, mask2):
-    # Use IoU here.
-    return np.sum(mask1 & mask2)/np.sum(mask1 | mask2)
-
-def align(seg_img1, seg_img2, seg_img3, threshold_same=0.3):
-    res_img1 = np.zeros_like(seg_img1)
-    res_img2 = np.zeros_like(seg_img2)
-    res_img3 = np.zeros_like(seg_img3)
-    remaining_objects2 = list(np.unique(seg_img2.flatten()))
-    remaining_objects3 = list(np.unique(seg_img3.flatten()))
-    for seg_id in np.unique(seg_img1):
-        # See if we can find correspondences to seg_id in seg_img2.
-        max_overlap2 = float('-inf')
-        max_segid2 = -1
-        for seg_id2 in remaining_objects2:
-            overlap = compute_overlap(seg_img1==seg_id, seg_img2==seg_id2)
-            if overlap>max_overlap2:
-                max_overlap2 = overlap
-                max_segid2 = seg_id2
-        if max_overlap2 > threshold_same:
-            max_overlap3 = float('-inf')
-            max_segid3 = -1
-            for seg_id3 in remaining_objects3:
-                overlap = compute_overlap(seg_img2==max_segid2, seg_img3==seg_id3)
-                if overlap>max_overlap3:
-                    max_overlap3 = overlap
-                    max_segid3 = seg_id3
-            if max_overlap3 > threshold_same:
-                res_img1[seg_img1==seg_id] = seg_id
-                res_img2[seg_img2==max_segid2] = seg_id
-                res_img3[seg_img3==max_segid3] = seg_id
-                remaining_objects2.remove(max_segid2)
-                remaining_objects3.remove(max_segid3)
-    return res_img1, res_img2, res_img3
diff --git a/research/struct2depth/gen_data_city.py b/research/struct2depth/gen_data_city.py
deleted file mode 100644
index 7e18fe5acb978fe89a686b1ff8f44d030a62777d..0000000000000000000000000000000000000000
--- a/research/struct2depth/gen_data_city.py
+++ /dev/null
@@ -1,158 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-""" Offline data generation for the Cityscapes dataset."""
-
-import os
-from absl import app
-from absl import flags
-from absl import logging
-import numpy as np
-import cv2
-import os, glob
-
-import alignment
-from alignment import compute_overlap
-from alignment import align
-
-
-SKIP = 2
-WIDTH = 416
-HEIGHT = 128
-SUB_FOLDER = 'train'
-INPUT_DIR = '/usr/local/google/home/anelia/struct2depth/CITYSCAPES_FULL/'
-OUTPUT_DIR = '/usr/local/google/home/anelia/struct2depth/CITYSCAPES_Processed/'
-
-def crop(img, segimg, fx, fy, cx, cy):
-    # Perform center cropping, preserving 50% vertically.
-    middle_perc = 0.50
-    left = 1 - middle_perc
-    half = left / 2
-    a = img[int(img.shape[0]*(half)):int(img.shape[0]*(1-half)), :]
-    aseg = segimg[int(segimg.shape[0]*(half)):int(segimg.shape[0]*(1-half)), :]
-    cy /= (1 / middle_perc)
-
-    # Resize to match target height while preserving aspect ratio.
-    wdt = int((float(HEIGHT)*a.shape[1]/a.shape[0]))
-    x_scaling = float(wdt)/a.shape[1]
-    y_scaling = float(HEIGHT)/a.shape[0]
-    b = cv2.resize(a, (wdt, HEIGHT))
-    bseg = cv2.resize(aseg, (wdt, HEIGHT))
-
-    # Adjust intrinsics.
-    fx*=x_scaling
-    fy*=y_scaling
-    cx*=x_scaling
-    cy*=y_scaling
-
-    # Perform center cropping horizontally.
-    remain = b.shape[1] - WIDTH
-    cx /= (b.shape[1] / WIDTH)
-    c = b[:, int(remain/2):b.shape[1]-int(remain/2)]
-    cseg = bseg[:, int(remain/2):b.shape[1]-int(remain/2)]
-
-    return c, cseg, fx, fy, cx, cy
-
-
-def run_all():
-  dir_name=INPUT_DIR + '/leftImg8bit_sequence/' + SUB_FOLDER + '/*'
-  print('Processing directory', dir_name)
-  for location in glob.glob(INPUT_DIR + '/leftImg8bit_sequence/' + SUB_FOLDER + '/*'):
-    location_name = os.path.basename(location)
-    print('Processing location', location_name)
-    files = sorted(glob.glob(location + '/*.png'))
-    files = [file for file in files if '-seg.png' not in file]
-    # Break down into sequences
-    sequences = {}
-    seq_nr = 0
-    last_seq = ''
-    last_imgnr = -1
-
-    for i in range(len(files)):
-        seq = os.path.basename(files[i]).split('_')[1]
-        nr = int(os.path.basename(files[i]).split('_')[2])
-        if seq!=last_seq or last_imgnr+1!=nr:
-            seq_nr+=1
-        last_imgnr = nr
-        last_seq = seq
-        if not seq_nr in sequences:
-            sequences[seq_nr] = []
-        sequences[seq_nr].append(files[i])
-
-    for (k,v) in sequences.items():
-        print('Processing sequence', k, 'with', len(v), 'elements...')
-        output_dir = OUTPUT_DIR + '/' + location_name + '_' + str(k)
-        if not os.path.isdir(output_dir):
-            os.mkdir(output_dir)
-        files = sorted(v)
-        triplet = []
-        seg_triplet = []
-        ct = 1
-
-        # Find applicable intrinsics.
-        for j in range(len(files)):
-            osegname = os.path.basename(files[j]).split('_')[1]
-            oimgnr = os.path.basename(files[j]).split('_')[2]
-            applicable_intrinsics = INPUT_DIR + '/camera/' + SUB_FOLDER + '/' + location_name + '/' + location_name + '_' + osegname + '_' + oimgnr + '_camera.json'
-            # Get the intrinsics for one of the file of the sequence.
-            if os.path.isfile(applicable_intrinsics):
-                f = open(applicable_intrinsics, 'r')
-                lines = f.readlines()
-                f.close()
-                lines = [line.rstrip() for line in lines]
-
-                fx = float(lines[11].split(': ')[1].replace(',', ''))
-                fy = float(lines[12].split(': ')[1].replace(',', ''))
-                cx = float(lines[13].split(': ')[1].replace(',', ''))
-                cy = float(lines[14].split(': ')[1].replace(',', ''))
-
-        for j in range(0, len(files), SKIP):
-            img = cv2.imread(files[j])
-            segimg = cv2.imread(files[j].replace('.png', '-seg.png'))
-
-            smallimg, segimg, fx_this, fy_this, cx_this, cy_this = crop(img, segimg, fx, fy, cx, cy)
-            triplet.append(smallimg)
-            seg_triplet.append(segimg)
-            if len(triplet)==3:
-                cmb = np.hstack(triplet)
-                align1, align2, align3 = align(seg_triplet[0], seg_triplet[1], seg_triplet[2])
-                cmb_seg = np.hstack([align1, align2, align3])
-                cv2.imwrite(os.path.join(output_dir, str(ct).zfill(10) + '.png'), cmb)
-                cv2.imwrite(os.path.join(output_dir, str(ct).zfill(10) + '-fseg.png'), cmb_seg)
-                f = open(os.path.join(output_dir, str(ct).zfill(10) + '_cam.txt'), 'w')
-                f.write(str(fx_this) + ',0.0,' + str(cx_this) + ',0.0,' + str(fy_this) + ',' + str(cy_this) + ',0.0,0.0,1.0')
-                f.close()
-                del triplet[0]
-                del seg_triplet[0]
-                ct+=1
-
-# Create file list for training. Be careful as it collects and includes all files recursively.
-fn = open(OUTPUT_DIR + '/' + SUB_FOLDER + '.txt', 'w')
-for f in glob.glob(OUTPUT_DIR + '/*/*.png'):
-    if '-seg.png' in f or '-fseg.png' in f:
-        continue
-    folder_name = f.split('/')[-2]
-    img_name = f.split('/')[-1].replace('.png', '')
-    fn.write(folder_name + ' ' + img_name + '\n')
-fn.close()
-
-
-def main(_):
-  run_all()
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/struct2depth/gen_data_kitti.py b/research/struct2depth/gen_data_kitti.py
deleted file mode 100644
index 8577c4c677383d29e074f922f56fd1abaef23543..0000000000000000000000000000000000000000
--- a/research/struct2depth/gen_data_kitti.py
+++ /dev/null
@@ -1,149 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-""" Offline data generation for the KITTI dataset."""
-
-import os
-from absl import app
-from absl import flags
-from absl import logging
-import numpy as np
-import cv2
-import os, glob
-
-import alignment
-from alignment import compute_overlap
-from alignment import align
-
-
-SEQ_LENGTH = 3
-WIDTH = 416
-HEIGHT = 128
-STEPSIZE = 1
-INPUT_DIR = '/usr/local/google/home/anelia/struct2depth/KITTI_FULL/kitti-raw-uncompressed'
-OUTPUT_DIR = '/usr/local/google/home/anelia/struct2depth/KITTI_procesed/'
-
-
-def get_line(file, start):
-    file = open(file, 'r')
-    lines = file.readlines()
-    lines = [line.rstrip() for line in lines]
-    ret = None
-    for line in lines:
-        nline = line.split(': ')
-        if nline[0]==start:
-            ret = nline[1].split(' ')
-            ret = np.array([float(r) for r in ret], dtype=float)
-            ret = ret.reshape((3,4))[0:3, 0:3]
-            break
-    file.close()
-    return ret
-
-
-def crop(img, segimg, fx, fy, cx, cy):
-    # Perform center cropping, preserving 50% vertically.
-    middle_perc = 0.50
-    left = 1-middle_perc
-    half = left/2
-    a = img[int(img.shape[0]*(half)):int(img.shape[0]*(1-half)), :]
-    aseg = segimg[int(segimg.shape[0]*(half)):int(segimg.shape[0]*(1-half)), :]
-    cy /= (1/middle_perc)
-
-    # Resize to match target height while preserving aspect ratio.
-    wdt = int((128*a.shape[1]/a.shape[0]))
-    x_scaling = float(wdt)/a.shape[1]
-    y_scaling = 128.0/a.shape[0]
-    b = cv2.resize(a, (wdt, 128))
-    bseg = cv2.resize(aseg, (wdt, 128))
-
-    # Adjust intrinsics.
-    fx*=x_scaling
-    fy*=y_scaling
-    cx*=x_scaling
-    cy*=y_scaling
-
-    # Perform center cropping horizontally.
-    remain = b.shape[1] - 416
-    cx /= (b.shape[1]/416)
-    c = b[:, int(remain/2):b.shape[1]-int(remain/2)]
-    cseg = bseg[:, int(remain/2):b.shape[1]-int(remain/2)]
-
-    return c, cseg, fx, fy, cx, cy
-
-
-def run_all():
-  ct = 0
-if not OUTPUT_DIR.endswith('/'):
-    OUTPUT_DIR = OUTPUT_DIR + '/'
-
-for d in glob.glob(INPUT_DIR + '/*/'):
-    date = d.split('/')[-2]
-    file_calibration = d + 'calib_cam_to_cam.txt'
-    calib_raw = [get_line(file_calibration, 'P_rect_02'), get_line(file_calibration, 'P_rect_03')]
-
-    for d2 in glob.glob(d + '*/'):
-        seqname = d2.split('/')[-2]
-        print('Processing sequence', seqname)
-        for subfolder in ['image_02/data', 'image_03/data']:
-            ct = 1
-            seqname = d2.split('/')[-2] + subfolder.replace('image', '').replace('/data', '')
-            if not os.path.exists(OUTPUT_DIR + seqname):
-                os.mkdir(OUTPUT_DIR + seqname)
-
-            calib_camera = calib_raw[0] if subfolder=='image_02/data' else calib_raw[1]
-            folder = d2 + subfolder
-            files = glob.glob(folder + '/*.png')
-            files = [file for file in files if not 'disp' in file and not 'flip' in file and not 'seg' in file]
-            files = sorted(files)
-            for i in range(SEQ_LENGTH, len(files)+1, STEPSIZE):
-                imgnum = str(ct).zfill(10)
-                if os.path.exists(OUTPUT_DIR + seqname + '/' + imgnum + '.png'):
-                    ct+=1
-                    continue
-                big_img = np.zeros(shape=(HEIGHT, WIDTH*SEQ_LENGTH, 3))
-                wct = 0
-
-                for j in range(i-SEQ_LENGTH, i):  # Collect frames for this sample.
-                    img = cv2.imread(files[j])
-                    ORIGINAL_HEIGHT, ORIGINAL_WIDTH, _ = img.shape
-
-                    zoom_x = WIDTH/ORIGINAL_WIDTH
-                    zoom_y = HEIGHT/ORIGINAL_HEIGHT
-
-                    # Adjust intrinsics.
-                    calib_current = calib_camera.copy()
-                    calib_current[0, 0] *= zoom_x
-                    calib_current[0, 2] *= zoom_x
-                    calib_current[1, 1] *= zoom_y
-                    calib_current[1, 2] *= zoom_y
-
-                    calib_representation = ','.join([str(c) for c in calib_current.flatten()])
-
-                    img = cv2.resize(img, (WIDTH, HEIGHT))
-                    big_img[:,wct*WIDTH:(wct+1)*WIDTH] = img
-                    wct+=1
-                cv2.imwrite(OUTPUT_DIR + seqname + '/' + imgnum + '.png', big_img)
-                f = open(OUTPUT_DIR + seqname + '/' + imgnum + '_cam.txt', 'w')
-                f.write(calib_representation)
-                f.close()
-                ct+=1
-
-def main(_):
-  run_all()
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/struct2depth/inference.py b/research/struct2depth/inference.py
deleted file mode 100644
index 042e2be17f736aa99235651452ca27931a83c9bd..0000000000000000000000000000000000000000
--- a/research/struct2depth/inference.py
+++ /dev/null
@@ -1,416 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Runs struct2depth at inference. Produces depth estimates, ego-motion and object motion."""
-
-# Example usage:
-#
-# python inference.py \
-#    --input_dir ~/struct2depth/kitti-raw-uncompressed/ \
-#    --output_dir ~/struct2depth/output \
-#    --model_ckpt ~/struct2depth/model/model-199160
-#    --file_extension png \
-#    --depth \
-#    --egomotion true \
-
-
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-from absl import app
-from absl import flags
-from absl import logging
-#import matplotlib.pyplot as plt
-import model
-import numpy as np
-import fnmatch
-import tensorflow as tf
-import nets
-import util
-
-gfile = tf.gfile
-
-# CMAP = 'plasma'
-
-INFERENCE_MODE_SINGLE = 'single'  # Take plain single-frame input.
-INFERENCE_MODE_TRIPLETS = 'triplets'  # Take image triplets as input.
-# For KITTI, we just resize input images and do not perform cropping. For
-# Cityscapes, the car hood and more image content has been cropped in order
-# to fit aspect ratio, and remove static content from the images. This has to be
-# kept at inference time.
-INFERENCE_CROP_NONE = 'none'
-INFERENCE_CROP_CITYSCAPES = 'cityscapes'
-
-
-flags.DEFINE_string('output_dir', None, 'Directory to store predictions.')
-flags.DEFINE_string('file_extension', 'png', 'Image data file extension of '
-                    'files provided with input_dir. Also determines the output '
-                    'file format of depth prediction images.')
-flags.DEFINE_bool('depth', True, 'Determines if the depth prediction network '
-                  'should be executed and its predictions be saved.')
-flags.DEFINE_bool('egomotion', False, 'Determines if the egomotion prediction '
-                  'network should be executed and its predictions be saved. If '
-                  'inference is run in single inference mode, it is assumed '
-                  'that files in the same directory belong in the same '
-                  'sequence, and sorting them alphabetically establishes the '
-                  'right temporal order.')
-flags.DEFINE_string('model_ckpt', None, 'Model checkpoint to evaluate.')
-flags.DEFINE_string('input_dir', None, 'Directory containing image files to '
-                    'evaluate. This crawls recursively for images in the '
-                    'directory, mirroring relative subdirectory structures '
-                    'into the output directory.')
-flags.DEFINE_string('input_list_file', None, 'Text file containing paths to '
-                    'image files to process. Paths should be relative with '
-                    'respect to the list file location. Relative path '
-                    'structures will be mirrored in the output directory.')
-flags.DEFINE_integer('batch_size', 1, 'The size of a sample batch')
-flags.DEFINE_integer('img_height', 128, 'Input frame height.')
-flags.DEFINE_integer('img_width', 416, 'Input frame width.')
-flags.DEFINE_integer('seq_length', 3, 'Number of frames in sequence.')
-flags.DEFINE_enum('architecture', nets.RESNET, nets.ARCHITECTURES,
-                  'Defines the architecture to use for the depth prediction '
-                  'network. Defaults to ResNet-based encoder and accompanying '
-                  'decoder.')
-flags.DEFINE_boolean('imagenet_norm', True, 'Whether to normalize the input '
-                     'images channel-wise so that they match the distribution '
-                     'most ImageNet-models were trained on.')
-flags.DEFINE_bool('use_skip', True, 'Whether to use skip connections in the '
-                  'encoder-decoder architecture.')
-flags.DEFINE_bool('joint_encoder', False, 'Whether to share parameters '
-                  'between the depth and egomotion networks by using a joint '
-                  'encoder architecture. The egomotion network is then '
-                  'operating only on the hidden representation provided by the '
-                  'joint encoder.')
-flags.DEFINE_bool('shuffle', False, 'Whether to shuffle the order in which '
-                  'images are processed.')
-flags.DEFINE_bool('flip', False, 'Whether images should be flipped as well as '
-                  'resulting predictions (for test-time augmentation). This '
-                  'currently applies to the depth network only.')
-flags.DEFINE_enum('inference_mode', INFERENCE_MODE_SINGLE,
-                  [INFERENCE_MODE_SINGLE,
-                   INFERENCE_MODE_TRIPLETS],
-                  'Whether to use triplet mode for inference, which accepts '
-                  'triplets instead of single frames.')
-flags.DEFINE_enum('inference_crop', INFERENCE_CROP_NONE,
-                  [INFERENCE_CROP_NONE,
-                   INFERENCE_CROP_CITYSCAPES],
-                  'Whether to apply a Cityscapes-specific crop on the input '
-                  'images first before running inference.')
-flags.DEFINE_bool('use_masks', False, 'Whether to mask out potentially '
-                  'moving objects when feeding image input to the egomotion '
-                  'network. This might improve odometry results when using '
-                  'a motion model. For this, pre-computed segmentation '
-                  'masks have to be available for every image, with the '
-                  'background being zero.')
-
-FLAGS = flags.FLAGS
-
-flags.mark_flag_as_required('output_dir')
-flags.mark_flag_as_required('model_ckpt')
-
-
-def _run_inference(output_dir=None,
-                   file_extension='png',
-                   depth=True,
-                   egomotion=False,
-                   model_ckpt=None,
-                   input_dir=None,
-                   input_list_file=None,
-                   batch_size=1,
-                   img_height=128,
-                   img_width=416,
-                   seq_length=3,
-                   architecture=nets.RESNET,
-                   imagenet_norm=True,
-                   use_skip=True,
-                   joint_encoder=True,
-                   shuffle=False,
-                   flip_for_depth=False,
-                   inference_mode=INFERENCE_MODE_SINGLE,
-                   inference_crop=INFERENCE_CROP_NONE,
-                   use_masks=False):
-  """Runs inference. Refer to flags in inference.py for details."""
-  inference_model = model.Model(is_training=False,
-                                batch_size=batch_size,
-                                img_height=img_height,
-                                img_width=img_width,
-                                seq_length=seq_length,
-                                architecture=architecture,
-                                imagenet_norm=imagenet_norm,
-                                use_skip=use_skip,
-                                joint_encoder=joint_encoder)
-  vars_to_restore = util.get_vars_to_save_and_restore(model_ckpt)
-  saver = tf.train.Saver(vars_to_restore)
-  sv = tf.train.Supervisor(logdir='/tmp/', saver=None)
-  with sv.managed_session() as sess:
-    saver.restore(sess, model_ckpt)
-    if not gfile.Exists(output_dir):
-      gfile.MakeDirs(output_dir)
-    logging.info('Predictions will be saved in %s.', output_dir)
-
-    # Collect all images to run inference on.
-    im_files, basepath_in = collect_input_images(input_dir, input_list_file,
-                                                 file_extension)
-    if shuffle:
-      logging.info('Shuffling data...')
-      np.random.shuffle(im_files)
-    logging.info('Running inference on %d files.', len(im_files))
-
-    # Create missing output folders and pre-compute target directories.
-    output_dirs = create_output_dirs(im_files, basepath_in, output_dir)
-
-    # Run depth prediction network.
-    if depth:
-      im_batch = []
-      for i in range(len(im_files)):
-        if i % 100 == 0:
-          logging.info('%s of %s files processed.', i, len(im_files))
-
-        # Read image and run inference.
-        if inference_mode == INFERENCE_MODE_SINGLE:
-          if inference_crop == INFERENCE_CROP_NONE:
-            im = util.load_image(im_files[i], resize=(img_width, img_height))
-          elif inference_crop == INFERENCE_CROP_CITYSCAPES:
-            im = util.crop_cityscapes(util.load_image(im_files[i]),
-                                      resize=(img_width, img_height))
-        elif inference_mode == INFERENCE_MODE_TRIPLETS:
-          im = util.load_image(im_files[i], resize=(img_width * 3, img_height))
-          im = im[:, img_width:img_width*2]
-        if flip_for_depth:
-          im = np.flip(im, axis=1)
-        im_batch.append(im)
-
-        if len(im_batch) == batch_size or i == len(im_files) - 1:
-          # Call inference on batch.
-          for _ in range(batch_size - len(im_batch)):  # Fill up batch.
-            im_batch.append(np.zeros(shape=(img_height, img_width, 3),
-                                     dtype=np.float32))
-          im_batch = np.stack(im_batch, axis=0)
-          est_depth = inference_model.inference_depth(im_batch, sess)
-          if flip_for_depth:
-            est_depth = np.flip(est_depth, axis=2)
-            im_batch = np.flip(im_batch, axis=2)
-
-          for j in range(len(im_batch)):
-            color_map = util.normalize_depth_for_display(
-                np.squeeze(est_depth[j]))
-            visualization = np.concatenate((im_batch[j], color_map), axis=0)
-            # Save raw prediction and color visualization. Extract filename
-            # without extension from full path: e.g. path/to/input_dir/folder1/
-            # file1.png -> file1
-            k = i - len(im_batch) + 1 + j
-            filename_root = os.path.splitext(os.path.basename(im_files[k]))[0]
-            pref = '_flip' if flip_for_depth else ''
-            output_raw = os.path.join(
-                output_dirs[k], filename_root + pref + '.npy')
-            output_vis = os.path.join(
-                output_dirs[k], filename_root + pref + '.png')
-            with gfile.Open(output_raw, 'wb') as f:
-              np.save(f, est_depth[j])
-            util.save_image(output_vis, visualization, file_extension)
-          im_batch = []
-
-    # Run egomotion network.
-    if egomotion:
-      if inference_mode == INFERENCE_MODE_SINGLE:
-        # Run regular egomotion inference loop.
-        input_image_seq = []
-        input_seg_seq = []
-        current_sequence_dir = None
-        current_output_handle = None
-        for i in range(len(im_files)):
-          sequence_dir = os.path.dirname(im_files[i])
-          if sequence_dir != current_sequence_dir:
-            # Assume start of a new sequence, since this image lies in a
-            # different directory than the previous ones.
-            # Clear egomotion input buffer.
-            output_filepath = os.path.join(output_dirs[i], 'egomotion.txt')
-            if current_output_handle is not None:
-              current_output_handle.close()
-            current_sequence_dir = sequence_dir
-            logging.info('Writing egomotion sequence to %s.', output_filepath)
-            current_output_handle = gfile.Open(output_filepath, 'w')
-            input_image_seq = []
-          im = util.load_image(im_files[i], resize=(img_width, img_height))
-          input_image_seq.append(im)
-          if use_masks:
-            im_seg_path = im_files[i].replace('.%s' % file_extension,
-                                              '-seg.%s' % file_extension)
-            if not gfile.Exists(im_seg_path):
-              raise ValueError('No segmentation mask %s has been found for '
-                               'image %s. If none are available, disable '
-                               'use_masks.' % (im_seg_path, im_files[i]))
-            input_seg_seq.append(util.load_image(im_seg_path,
-                                                 resize=(img_width, img_height),
-                                                 interpolation='nn'))
-
-          if len(input_image_seq) < seq_length:  # Buffer not filled yet.
-            continue
-          if len(input_image_seq) > seq_length:  # Remove oldest entry.
-            del input_image_seq[0]
-            if use_masks:
-              del input_seg_seq[0]
-
-          input_image_stack = np.concatenate(input_image_seq, axis=2)
-          input_image_stack = np.expand_dims(input_image_stack, axis=0)
-          if use_masks:
-            input_image_stack = mask_image_stack(input_image_stack,
-                                                 input_seg_seq)
-          est_egomotion = np.squeeze(inference_model.inference_egomotion(
-              input_image_stack, sess))
-          egomotion_str = []
-          for j in range(seq_length - 1):
-            egomotion_str.append(','.join([str(d) for d in est_egomotion[j]]))
-          current_output_handle.write(
-              str(i) + ' ' + ' '.join(egomotion_str) + '\n')
-        if current_output_handle is not None:
-          current_output_handle.close()
-      elif inference_mode == INFERENCE_MODE_TRIPLETS:
-        written_before = []
-        for i in range(len(im_files)):
-          im = util.load_image(im_files[i], resize=(img_width * 3, img_height))
-          input_image_stack = np.concatenate(
-              [im[:, :img_width], im[:, img_width:img_width*2],
-               im[:, img_width*2:]], axis=2)
-          input_image_stack = np.expand_dims(input_image_stack, axis=0)
-          if use_masks:
-            im_seg_path = im_files[i].replace('.%s' % file_extension,
-                                              '-seg.%s' % file_extension)
-            if not gfile.Exists(im_seg_path):
-              raise ValueError('No segmentation mask %s has been found for '
-                               'image %s. If none are available, disable '
-                               'use_masks.' % (im_seg_path, im_files[i]))
-            seg = util.load_image(im_seg_path,
-                                  resize=(img_width * 3, img_height),
-                                  interpolation='nn')
-            input_seg_seq = [seg[:, :img_width], seg[:, img_width:img_width*2],
-                             seg[:, img_width*2:]]
-            input_image_stack = mask_image_stack(input_image_stack,
-                                                 input_seg_seq)
-          est_egomotion = inference_model.inference_egomotion(
-              input_image_stack, sess)
-          est_egomotion = np.squeeze(est_egomotion)
-          egomotion_1_2 = ','.join([str(d) for d in est_egomotion[0]])
-          egomotion_2_3 = ','.join([str(d) for d in est_egomotion[1]])
-
-          output_filepath = os.path.join(output_dirs[i], 'egomotion.txt')
-          file_mode = 'w' if output_filepath not in written_before else 'a'
-          with gfile.Open(output_filepath, file_mode) as current_output_handle:
-            current_output_handle.write(str(i) + ' ' + egomotion_1_2 + ' ' +
-                                        egomotion_2_3 + '\n')
-          written_before.append(output_filepath)
-      logging.info('Done.')
-
-
-def mask_image_stack(input_image_stack, input_seg_seq):
-  """Masks out moving image contents by using the segmentation masks provided.
-
-  This can lead to better odometry accuracy for motion models, but is optional
-  to use. Is only called if use_masks is enabled.
-  Args:
-    input_image_stack: The input image stack of shape (1, H, W, seq_length).
-    input_seg_seq: List of segmentation masks with seq_length elements of shape
-                   (H, W, C) for some number of channels C.
-
-  Returns:
-    Input image stack with detections provided by segmentation mask removed.
-  """
-  background = [mask == 0 for mask in input_seg_seq]
-  background = reduce(lambda m1, m2: m1 & m2, background)
-  # If masks are RGB, assume all channels to be the same. Reduce to the first.
-  if background.ndim == 3 and background.shape[2] > 1:
-    background = np.expand_dims(background[:, :, 0], axis=2)
-  elif background.ndim == 2:  # Expand.
-    background = np.expand_dism(background, axis=2)
-  # background is now of shape (H, W, 1).
-  background_stack = np.tile(background, [1, 1, input_image_stack.shape[3]])
-  return np.multiply(input_image_stack, background_stack)
-
-
-def collect_input_images(input_dir, input_list_file, file_extension):
-  """Collects all input images that are to be processed."""
-  if input_dir is not None:
-    im_files = _recursive_glob(input_dir, '*.' + file_extension)
-    basepath_in = os.path.normpath(input_dir)
-  elif input_list_file is not None:
-    im_files = util.read_text_lines(input_list_file)
-    basepath_in = os.path.dirname(input_list_file)
-    im_files = [os.path.join(basepath_in, f) for f in im_files]
-  im_files = [f for f in im_files if 'disp' not in f and '-seg' not in f and
-              '-fseg' not in f and '-flip' not in f]
-  return sorted(im_files), basepath_in
-
-
-def create_output_dirs(im_files, basepath_in, output_dir):
-  """Creates required directories, and returns output dir for each file."""
-  output_dirs = []
-  for i in range(len(im_files)):
-    relative_folder_in = os.path.relpath(
-        os.path.dirname(im_files[i]), basepath_in)
-    absolute_folder_out = os.path.join(output_dir, relative_folder_in)
-    if not gfile.IsDirectory(absolute_folder_out):
-      gfile.MakeDirs(absolute_folder_out)
-    output_dirs.append(absolute_folder_out)
-  return output_dirs
-
-
-def _recursive_glob(treeroot, pattern):
-  results = []
-  for base, _, files in os.walk(treeroot):
-    files = fnmatch.filter(files, pattern)
-    results.extend(os.path.join(base, f) for f in files)
-  return results
-
-
-def main(_):
-  #if (flags.input_dir is None) == (flags.input_list_file is None):
-  #  raise ValueError('Exactly one of either input_dir or input_list_file has '
-  #                   'to be provided.')
-  #if not flags.depth and not flags.egomotion:
-  #  raise ValueError('At least one of the depth and egomotion network has to '
-  #                   'be called for inference.')
-  #if (flags.inference_mode == inference_lib.INFERENCE_MODE_TRIPLETS and
-  #    flags.seq_length != 3):
-  #  raise ValueError('For sequence lengths other than three, single inference '
-  #                   'mode has to be used.')
-
-  _run_inference(output_dir=FLAGS.output_dir,
-                 file_extension=FLAGS.file_extension,
-                 depth=FLAGS.depth,
-                 egomotion=FLAGS.egomotion,
-                 model_ckpt=FLAGS.model_ckpt,
-                 input_dir=FLAGS.input_dir,
-                 input_list_file=FLAGS.input_list_file,
-                 batch_size=FLAGS.batch_size,
-                 img_height=FLAGS.img_height,
-                 img_width=FLAGS.img_width,
-                 seq_length=FLAGS.seq_length,
-                 architecture=FLAGS.architecture,
-                 imagenet_norm=FLAGS.imagenet_norm,
-                 use_skip=FLAGS.use_skip,
-                 joint_encoder=FLAGS.joint_encoder,
-                 shuffle=FLAGS.shuffle,
-                 flip_for_depth=FLAGS.flip,
-                 inference_mode=FLAGS.inference_mode,
-                 inference_crop=FLAGS.inference_crop,
-                 use_masks=FLAGS.use_masks)
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/struct2depth/model.py b/research/struct2depth/model.py
deleted file mode 100644
index 873be26bb941626c93a7bdbdc9052ede51aeaa8c..0000000000000000000000000000000000000000
--- a/research/struct2depth/model.py
+++ /dev/null
@@ -1,848 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Build model for inference or training."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from absl import logging
-import numpy as np
-import tensorflow as tf
-
-import nets
-import project
-import reader
-import util
-
-gfile = tf.gfile
-slim = tf.contrib.slim
-
-NUM_SCALES = 4
-
-
-class Model(object):
-  """Model code based on SfMLearner."""
-
-  def __init__(self,
-               data_dir=None,
-               file_extension='png',
-               is_training=True,
-               learning_rate=0.0002,
-               beta1=0.9,
-               reconstr_weight=0.85,
-               smooth_weight=0.05,
-               ssim_weight=0.15,
-               icp_weight=0.0,
-               batch_size=4,
-               img_height=128,
-               img_width=416,
-               seq_length=3,
-               architecture=nets.RESNET,
-               imagenet_norm=True,
-               weight_reg=0.05,
-               exhaustive_mode=False,
-               random_scale_crop=False,
-               flipping_mode=reader.FLIP_RANDOM,
-               random_color=True,
-               depth_upsampling=True,
-               depth_normalization=True,
-               compute_minimum_loss=True,
-               use_skip=True,
-               joint_encoder=True,
-               build_sum=True,
-               shuffle=True,
-               input_file='train',
-               handle_motion=False,
-               equal_weighting=False,
-               size_constraint_weight=0.0,
-               train_global_scale_var=True):
-    self.data_dir = data_dir
-    self.file_extension = file_extension
-    self.is_training = is_training
-    self.learning_rate = learning_rate
-    self.reconstr_weight = reconstr_weight
-    self.smooth_weight = smooth_weight
-    self.ssim_weight = ssim_weight
-    self.icp_weight = icp_weight
-    self.beta1 = beta1
-    self.batch_size = batch_size
-    self.img_height = img_height
-    self.img_width = img_width
-    self.seq_length = seq_length
-    self.architecture = architecture
-    self.imagenet_norm = imagenet_norm
-    self.weight_reg = weight_reg
-    self.exhaustive_mode = exhaustive_mode
-    self.random_scale_crop = random_scale_crop
-    self.flipping_mode = flipping_mode
-    self.random_color = random_color
-    self.depth_upsampling = depth_upsampling
-    self.depth_normalization = depth_normalization
-    self.compute_minimum_loss = compute_minimum_loss
-    self.use_skip = use_skip
-    self.joint_encoder = joint_encoder
-    self.build_sum = build_sum
-    self.shuffle = shuffle
-    self.input_file = input_file
-    self.handle_motion = handle_motion
-    self.equal_weighting = equal_weighting
-    self.size_constraint_weight = size_constraint_weight
-    self.train_global_scale_var = train_global_scale_var
-
-    logging.info('data_dir: %s', data_dir)
-    logging.info('file_extension: %s', file_extension)
-    logging.info('is_training: %s', is_training)
-    logging.info('learning_rate: %s', learning_rate)
-    logging.info('reconstr_weight: %s', reconstr_weight)
-    logging.info('smooth_weight: %s', smooth_weight)
-    logging.info('ssim_weight: %s', ssim_weight)
-    logging.info('icp_weight: %s', icp_weight)
-    logging.info('size_constraint_weight: %s', size_constraint_weight)
-    logging.info('beta1: %s', beta1)
-    logging.info('batch_size: %s', batch_size)
-    logging.info('img_height: %s', img_height)
-    logging.info('img_width: %s', img_width)
-    logging.info('seq_length: %s', seq_length)
-    logging.info('architecture: %s', architecture)
-    logging.info('imagenet_norm: %s', imagenet_norm)
-    logging.info('weight_reg: %s', weight_reg)
-    logging.info('exhaustive_mode: %s', exhaustive_mode)
-    logging.info('random_scale_crop: %s', random_scale_crop)
-    logging.info('flipping_mode: %s', flipping_mode)
-    logging.info('random_color: %s', random_color)
-    logging.info('depth_upsampling: %s', depth_upsampling)
-    logging.info('depth_normalization: %s', depth_normalization)
-    logging.info('compute_minimum_loss: %s', compute_minimum_loss)
-    logging.info('use_skip: %s', use_skip)
-    logging.info('joint_encoder: %s', joint_encoder)
-    logging.info('build_sum: %s', build_sum)
-    logging.info('shuffle: %s', shuffle)
-    logging.info('input_file: %s', input_file)
-    logging.info('handle_motion: %s', handle_motion)
-    logging.info('equal_weighting: %s', equal_weighting)
-    logging.info('train_global_scale_var: %s', train_global_scale_var)
-
-    if self.size_constraint_weight > 0 or not is_training:
-      self.global_scale_var = tf.Variable(
-          0.1, name='global_scale_var',
-          trainable=self.is_training and train_global_scale_var,
-          dtype=tf.float32,
-          constraint=lambda x: tf.clip_by_value(x, 0, np.infty))
-
-    if self.is_training:
-      self.reader = reader.DataReader(self.data_dir, self.batch_size,
-                                      self.img_height, self.img_width,
-                                      self.seq_length, NUM_SCALES,
-                                      self.file_extension,
-                                      self.random_scale_crop,
-                                      self.flipping_mode,
-                                      self.random_color,
-                                      self.imagenet_norm,
-                                      self.shuffle,
-                                      self.input_file)
-      self.build_train_graph()
-    else:
-      self.build_depth_test_graph()
-      self.build_egomotion_test_graph()
-      if self.handle_motion:
-        self.build_objectmotion_test_graph()
-
-    # At this point, the model is ready. Print some info on model params.
-    util.count_parameters()
-
-  def build_train_graph(self):
-    self.build_inference_for_training()
-    self.build_loss()
-    self.build_train_op()
-    if self.build_sum:
-      self.build_summaries()
-
-  def build_inference_for_training(self):
-    """Invokes depth and ego-motion networks and computes clouds if needed."""
-    (self.image_stack, self.image_stack_norm, self.seg_stack,
-     self.intrinsic_mat, self.intrinsic_mat_inv) = self.reader.read_data()
-    with tf.variable_scope('depth_prediction'):
-      # Organized by ...[i][scale].  Note that the order is flipped in
-      # variables in build_loss() below.
-      self.disp = {}
-      self.depth = {}
-      self.depth_upsampled = {}
-      self.inf_loss = 0.0
-      # Organized by [i].
-      disp_bottlenecks = [None] * self.seq_length
-
-      if self.icp_weight > 0:
-        self.cloud = {}
-      for i in range(self.seq_length):
-        image = self.image_stack_norm[:, :, :, 3 * i:3 * (i + 1)]
-
-        multiscale_disps_i, disp_bottlenecks[i] = nets.disp_net(
-            self.architecture, image, self.use_skip,
-            self.weight_reg, True)
-        multiscale_depths_i = [1.0 / d for d in multiscale_disps_i]
-        self.disp[i] = multiscale_disps_i
-        self.depth[i] = multiscale_depths_i
-        if self.depth_upsampling:
-          self.depth_upsampled[i] = []
-          # Upsample low-resolution depth maps using differentiable bilinear
-          # interpolation.
-          for s in range(len(multiscale_depths_i)):
-            self.depth_upsampled[i].append(tf.image.resize_bilinear(
-                multiscale_depths_i[s], [self.img_height, self.img_width],
-                align_corners=True))
-
-        if self.icp_weight > 0:
-          multiscale_clouds_i = [
-              project.get_cloud(d,
-                                self.intrinsic_mat_inv[:, s, :, :],
-                                name='cloud%d_%d' % (s, i))
-              for (s, d) in enumerate(multiscale_depths_i)
-          ]
-          self.cloud[i] = multiscale_clouds_i
-        # Reuse the same depth graph for all images.
-        tf.get_variable_scope().reuse_variables()
-
-    if self.handle_motion:
-      # Define egomotion network. This network can see the whole scene except
-      # for any moving objects as indicated by the provided segmentation masks.
-      # To avoid the network getting clues of motion by tracking those masks, we
-      # define the segmentation masks as the union temporally.
-      with tf.variable_scope('egomotion_prediction'):
-        base_input = self.image_stack_norm  # (B, H, W, 9)
-        seg_input = self.seg_stack  # (B, H, W, 9)
-        ref_zero = tf.constant(0, dtype=tf.uint8)
-        # Motion model is currently defined for three-frame sequences.
-        object_mask1 = tf.equal(seg_input[:, :, :, 0], ref_zero)
-        object_mask2 = tf.equal(seg_input[:, :, :, 3], ref_zero)
-        object_mask3 = tf.equal(seg_input[:, :, :, 6], ref_zero)
-        mask_complete = tf.expand_dims(tf.logical_and(  # (B, H, W, 1)
-            tf.logical_and(object_mask1, object_mask2), object_mask3), axis=3)
-        mask_complete = tf.tile(mask_complete, (1, 1, 1, 9))  # (B, H, W, 9)
-        # Now mask out base_input.
-        self.mask_complete = tf.to_float(mask_complete)
-        self.base_input_masked = base_input * self.mask_complete
-        self.egomotion = nets.egomotion_net(
-            image_stack=self.base_input_masked,
-            disp_bottleneck_stack=None,
-            joint_encoder=False,
-            seq_length=self.seq_length,
-            weight_reg=self.weight_reg)
-
-      # Define object motion network for refinement. This network only sees
-      # one object at a time over the whole sequence, and tries to estimate its
-      # motion. The sequence of images are the respective warped frames.
-
-      # For each scale, contains batch_size elements of shape (N, 2, 6).
-      self.object_transforms = {}
-      # For each scale, contains batch_size elements of shape (N, H, W, 9).
-      self.object_masks = {}
-      self.object_masks_warped = {}
-      # For each scale, contains batch_size elements of size N.
-      self.object_ids = {}
-
-      self.egomotions_seq = {}
-      self.warped_seq = {}
-      self.inputs_objectmotion_net = {}
-      with tf.variable_scope('objectmotion_prediction'):
-        # First, warp raw images according to overall egomotion.
-        for s in range(NUM_SCALES):
-          self.warped_seq[s] = []
-          self.egomotions_seq[s] = []
-          for source_index in range(self.seq_length):
-            egomotion_mat_i_1 = project.get_transform_mat(
-                self.egomotion, source_index, 1)
-            warped_image_i_1, _ = (
-                project.inverse_warp(
-                    self.image_stack[
-                        :, :, :, source_index*3:(source_index+1)*3],
-                    self.depth_upsampled[1][s],
-                    egomotion_mat_i_1,
-                    self.intrinsic_mat[:, 0, :, :],
-                    self.intrinsic_mat_inv[:, 0, :, :]))
-
-            self.warped_seq[s].append(warped_image_i_1)
-            self.egomotions_seq[s].append(egomotion_mat_i_1)
-
-          # Second, for every object in the segmentation mask, take its mask and
-          # warp it according to the egomotion estimate. Then put a threshold to
-          # binarize the warped result. Use this mask to mask out background and
-          # other objects, and pass the filtered image to the object motion
-          # network.
-          self.object_transforms[s] = []
-          self.object_masks[s] = []
-          self.object_ids[s] = []
-          self.object_masks_warped[s] = []
-          self.inputs_objectmotion_net[s] = {}
-
-          for i in range(self.batch_size):
-            seg_sequence = self.seg_stack[i]  # (H, W, 9=3*3)
-            object_ids = tf.unique(tf.reshape(seg_sequence, [-1]))[0]
-            self.object_ids[s].append(object_ids)
-            color_stack = []
-            mask_stack = []
-            mask_stack_warped = []
-            for j in range(self.seq_length):
-              current_image = self.warped_seq[s][j][i]  # (H, W, 3)
-              current_seg = seg_sequence[:, :, j * 3:(j+1) * 3]  # (H, W, 3)
-
-              def process_obj_mask_warp(obj_id):
-                """Performs warping of the individual object masks."""
-                obj_mask = tf.to_float(tf.equal(current_seg, obj_id))
-                # Warp obj_mask according to overall egomotion.
-                obj_mask_warped, _ = (
-                    project.inverse_warp(
-                        tf.expand_dims(obj_mask, axis=0),
-                        # Middle frame, highest scale, batch element i:
-                        tf.expand_dims(self.depth_upsampled[1][s][i], axis=0),
-                        # Matrix for warping j into middle frame, batch elem. i:
-                        tf.expand_dims(self.egomotions_seq[s][j][i], axis=0),
-                        tf.expand_dims(self.intrinsic_mat[i, 0, :, :], axis=0),
-                        tf.expand_dims(self.intrinsic_mat_inv[i, 0, :, :],
-                                       axis=0)))
-                obj_mask_warped = tf.squeeze(obj_mask_warped)
-                obj_mask_binarized = tf.greater(  # Threshold to binarize mask.
-                    obj_mask_warped, tf.constant(0.5))
-                return tf.to_float(obj_mask_binarized)
-
-              def process_obj_mask(obj_id):
-                """Returns the individual object masks separately."""
-                return tf.to_float(tf.equal(current_seg, obj_id))
-              object_masks = tf.map_fn(  # (N, H, W, 3)
-                  process_obj_mask, object_ids, dtype=tf.float32)
-
-              if self.size_constraint_weight > 0:
-                # The object segmentation masks are all in object_masks.
-                # We need to measure the height of every of them, and get the
-                # approximate distance.
-
-                # self.depth_upsampled of shape (seq_length, scale, B, H, W).
-                depth_pred = self.depth_upsampled[j][s][i]  # (H, W)
-                def get_losses(obj_mask):
-                  """Get motion constraint loss."""
-                  # Find height of segment.
-                  coords = tf.where(tf.greater(  # Shape (num_true, 2=yx)
-                      obj_mask[:, :, 0], tf.constant(0.5, dtype=tf.float32)))
-                  y_max = tf.reduce_max(coords[:, 0])
-                  y_min = tf.reduce_min(coords[:, 0])
-                  seg_height = y_max - y_min
-                  f_y = self.intrinsic_mat[i, 0, 1, 1]
-                  approx_depth = ((f_y * self.global_scale_var) /
-                                  tf.to_float(seg_height))
-                  reference_pred = tf.boolean_mask(
-                      depth_pred, tf.greater(
-                          tf.reshape(obj_mask[:, :, 0],
-                                     (self.img_height, self.img_width, 1)),
-                          tf.constant(0.5, dtype=tf.float32)))
-
-                  # Establish loss on approx_depth, a scalar, and
-                  # reference_pred, our dense prediction. Normalize both to
-                  # prevent degenerative depth shrinking.
-                  global_mean_depth_pred = tf.reduce_mean(depth_pred)
-                  reference_pred /= global_mean_depth_pred
-                  approx_depth /= global_mean_depth_pred
-                  spatial_err = tf.abs(reference_pred - approx_depth)
-                  mean_spatial_err = tf.reduce_mean(spatial_err)
-                  return mean_spatial_err
-
-                losses = tf.map_fn(
-                    get_losses, object_masks, dtype=tf.float32)
-                self.inf_loss += tf.reduce_mean(losses)
-              object_masks_warped = tf.map_fn(  # (N, H, W, 3)
-                  process_obj_mask_warp, object_ids, dtype=tf.float32)
-              filtered_images = tf.map_fn(
-                  lambda mask: current_image * mask, object_masks_warped,
-                  dtype=tf.float32)  # (N, H, W, 3)
-              color_stack.append(filtered_images)
-              mask_stack.append(object_masks)
-              mask_stack_warped.append(object_masks_warped)
-
-            # For this batch-element, if there are N moving objects,
-            # color_stack, mask_stack and mask_stack_warped contain both
-            # seq_length elements of shape (N, H, W, 3).
-            # We can now concatenate them on the last axis, creating a tensor of
-            # (N, H, W, 3*3 = 9), and, assuming N does not get too large so that
-            # we have enough memory, pass them in a single batch to the object
-            # motion network.
-            mask_stack = tf.concat(mask_stack, axis=3)  # (N, H, W, 9)
-            mask_stack_warped = tf.concat(mask_stack_warped, axis=3)
-            color_stack = tf.concat(color_stack, axis=3)  # (N, H, W, 9)
-            all_transforms = nets.objectmotion_net(
-                # We cut the gradient flow here as the object motion gradient
-                # should have no saying in how the egomotion network behaves.
-                # One could try just stopping the gradient for egomotion, but
-                # not for the depth prediction network.
-                image_stack=tf.stop_gradient(color_stack),
-                disp_bottleneck_stack=None,
-                joint_encoder=False,  # Joint encoder not supported.
-                seq_length=self.seq_length,
-                weight_reg=self.weight_reg)
-            # all_transforms of shape (N, 2, 6).
-            self.object_transforms[s].append(all_transforms)
-            self.object_masks[s].append(mask_stack)
-            self.object_masks_warped[s].append(mask_stack_warped)
-            self.inputs_objectmotion_net[s][i] = color_stack
-            tf.get_variable_scope().reuse_variables()
-    else:
-      # Don't handle motion, classic model formulation.
-      with tf.name_scope('egomotion_prediction'):
-        if self.joint_encoder:
-          # Re-arrange disp_bottleneck_stack to be of shape
-          # [B, h_hid, w_hid, c_hid * seq_length]. Currently, it is a list with
-          # seq_length elements, each of dimension [B, h_hid, w_hid, c_hid].
-          disp_bottleneck_stack = tf.concat(disp_bottlenecks, axis=3)
-        else:
-          disp_bottleneck_stack = None
-        self.egomotion = nets.egomotion_net(
-            image_stack=self.image_stack_norm,
-            disp_bottleneck_stack=disp_bottleneck_stack,
-            joint_encoder=self.joint_encoder,
-            seq_length=self.seq_length,
-            weight_reg=self.weight_reg)
-
-  def build_loss(self):
-    """Adds ops for computing loss."""
-    with tf.name_scope('compute_loss'):
-      self.reconstr_loss = 0
-      self.smooth_loss = 0
-      self.ssim_loss = 0
-      self.icp_transform_loss = 0
-      self.icp_residual_loss = 0
-
-      # self.images is organized by ...[scale][B, h, w, seq_len * 3].
-      self.images = [None for _ in range(NUM_SCALES)]
-      # Following nested lists are organized by ...[scale][source-target].
-      self.warped_image = [{} for _ in range(NUM_SCALES)]
-      self.warp_mask = [{} for _ in range(NUM_SCALES)]
-      self.warp_error = [{} for _ in range(NUM_SCALES)]
-      self.ssim_error = [{} for _ in range(NUM_SCALES)]
-      self.icp_transform = [{} for _ in range(NUM_SCALES)]
-      self.icp_residual = [{} for _ in range(NUM_SCALES)]
-
-      self.middle_frame_index = util.get_seq_middle(self.seq_length)
-
-      # Compute losses at each scale.
-      for s in range(NUM_SCALES):
-        # Scale image stack.
-        if s == 0:  # Just as a precaution. TF often has interpolation bugs.
-          self.images[s] = self.image_stack
-        else:
-          height_s = int(self.img_height / (2**s))
-          width_s = int(self.img_width / (2**s))
-          self.images[s] = tf.image.resize_bilinear(
-              self.image_stack, [height_s, width_s], align_corners=True)
-
-        # Smoothness.
-        if self.smooth_weight > 0:
-          for i in range(self.seq_length):
-            # When computing minimum loss, use the depth map from the middle
-            # frame only.
-            if not self.compute_minimum_loss or i == self.middle_frame_index:
-              disp_smoothing = self.disp[i][s]
-              if self.depth_normalization:
-                # Perform depth normalization, dividing by the mean.
-                mean_disp = tf.reduce_mean(disp_smoothing, axis=[1, 2, 3],
-                                           keep_dims=True)
-                disp_input = disp_smoothing / mean_disp
-              else:
-                disp_input = disp_smoothing
-              scaling_f = (1.0 if self.equal_weighting else 1.0 / (2**s))
-              self.smooth_loss += scaling_f * self.depth_smoothness(
-                  disp_input, self.images[s][:, :, :, 3 * i:3 * (i + 1)])
-
-        self.debug_all_warped_image_batches = []
-        for i in range(self.seq_length):
-          for j in range(self.seq_length):
-            if i == j:
-              continue
-
-            # When computing minimum loss, only consider the middle frame as
-            # target.
-            if self.compute_minimum_loss and j != self.middle_frame_index:
-              continue
-            # We only consider adjacent frames, unless either
-            # compute_minimum_loss is on (where the middle frame is matched with
-            # all other frames) or exhaustive_mode is on (where all frames are
-            # matched with each other).
-            if (not self.compute_minimum_loss and not self.exhaustive_mode and
-                abs(i - j) != 1):
-              continue
-
-            selected_scale = 0 if self.depth_upsampling else s
-            source = self.images[selected_scale][:, :, :, 3 * i:3 * (i + 1)]
-            target = self.images[selected_scale][:, :, :, 3 * j:3 * (j + 1)]
-
-            if self.depth_upsampling:
-              target_depth = self.depth_upsampled[j][s]
-            else:
-              target_depth = self.depth[j][s]
-
-            key = '%d-%d' % (i, j)
-
-            if self.handle_motion:
-              # self.seg_stack of shape (B, H, W, 9).
-              # target_depth corresponds to middle frame, of shape (B, H, W, 1).
-
-              # Now incorporate the other warping results, performed according
-              # to the object motion network's predictions.
-              # self.object_masks batch_size elements of (N, H, W, 9).
-              # self.object_masks_warped batch_size elements of (N, H, W, 9).
-              # self.object_transforms batch_size elements of (N, 2, 6).
-              self.all_batches = []
-              for batch_s in range(self.batch_size):
-                # To warp i into j, first take the base warping (this is the
-                # full image i warped into j using only the egomotion estimate).
-                base_warping = self.warped_seq[s][i][batch_s]
-                transform_matrices_thisbatch = tf.map_fn(
-                    lambda transform: project.get_transform_mat(
-                        tf.expand_dims(transform, axis=0), i, j)[0],
-                    self.object_transforms[0][batch_s])
-
-                def inverse_warp_wrapper(matrix):
-                  """Wrapper for inverse warping method."""
-                  warp_image, _ = (
-                      project.inverse_warp(
-                          tf.expand_dims(base_warping, axis=0),
-                          tf.expand_dims(target_depth[batch_s], axis=0),
-                          tf.expand_dims(matrix, axis=0),
-                          tf.expand_dims(self.intrinsic_mat[
-                              batch_s, selected_scale, :, :], axis=0),
-                          tf.expand_dims(self.intrinsic_mat_inv[
-                              batch_s, selected_scale, :, :], axis=0)))
-                  return warp_image
-                warped_images_thisbatch = tf.map_fn(
-                    inverse_warp_wrapper, transform_matrices_thisbatch,
-                    dtype=tf.float32)
-                warped_images_thisbatch = warped_images_thisbatch[:, 0, :, :, :]
-                # warped_images_thisbatch is now of shape (N, H, W, 9).
-
-                # Combine warped frames into a single one, using the object
-                # masks. Result should be (1, 128, 416, 3).
-                # Essentially, we here want to sum them all up, filtered by the
-                # respective object masks.
-                mask_base_valid_source = tf.equal(
-                    self.seg_stack[batch_s, :, :, i*3:(i+1)*3],
-                    tf.constant(0, dtype=tf.uint8))
-                mask_base_valid_target = tf.equal(
-                    self.seg_stack[batch_s, :, :, j*3:(j+1)*3],
-                    tf.constant(0, dtype=tf.uint8))
-                mask_valid = tf.logical_and(
-                    mask_base_valid_source, mask_base_valid_target)
-                self.base_warping = base_warping * tf.to_float(mask_valid)
-                background = tf.expand_dims(self.base_warping, axis=0)
-                def construct_const_filter_tensor(obj_id):
-                  return tf.fill(
-                      dims=[self.img_height, self.img_width, 3],
-                      value=tf.sign(obj_id)) * tf.to_float(
-                          tf.equal(self.seg_stack[batch_s, :, :, 3:6],
-                                   tf.cast(obj_id, dtype=tf.uint8)))
-                filter_tensor = tf.map_fn(
-                    construct_const_filter_tensor,
-                    tf.to_float(self.object_ids[s][batch_s]))
-                filter_tensor = tf.stack(filter_tensor, axis=0)
-                objects_to_add = tf.reduce_sum(
-                    tf.multiply(warped_images_thisbatch, filter_tensor),
-                    axis=0, keepdims=True)
-                combined = background + objects_to_add
-                self.all_batches.append(combined)
-               # Now of shape (B, 128, 416, 3).
-              self.warped_image[s][key] = tf.concat(self.all_batches, axis=0)
-
-            else:
-              # Don't handle motion, classic model formulation.
-              egomotion_mat_i_j = project.get_transform_mat(
-                  self.egomotion, i, j)
-              # Inverse warp the source image to the target image frame for
-              # photometric consistency loss.
-              self.warped_image[s][key], self.warp_mask[s][key] = (
-                  project.inverse_warp(
-                      source,
-                      target_depth,
-                      egomotion_mat_i_j,
-                      self.intrinsic_mat[:, selected_scale, :, :],
-                      self.intrinsic_mat_inv[:, selected_scale, :, :]))
-
-            # Reconstruction loss.
-            self.warp_error[s][key] = tf.abs(self.warped_image[s][key] - target)
-            if not self.compute_minimum_loss:
-              self.reconstr_loss += tf.reduce_mean(
-                  self.warp_error[s][key] * self.warp_mask[s][key])
-            # SSIM.
-            if self.ssim_weight > 0:
-              self.ssim_error[s][key] = self.ssim(self.warped_image[s][key],
-                                                  target)
-              # TODO(rezama): This should be min_pool2d().
-              if not self.compute_minimum_loss:
-                ssim_mask = slim.avg_pool2d(self.warp_mask[s][key], 3, 1,
-                                            'VALID')
-                self.ssim_loss += tf.reduce_mean(
-                    self.ssim_error[s][key] * ssim_mask)
-
-        # If the minimum loss should be computed, the loss calculation has been
-        # postponed until here.
-        if self.compute_minimum_loss:
-          for frame_index in range(self.middle_frame_index):
-            key1 = '%d-%d' % (frame_index, self.middle_frame_index)
-            key2 = '%d-%d' % (self.seq_length - frame_index - 1,
-                              self.middle_frame_index)
-            logging.info('computing min error between %s and %s', key1, key2)
-            min_error = tf.minimum(self.warp_error[s][key1],
-                                   self.warp_error[s][key2])
-            self.reconstr_loss += tf.reduce_mean(min_error)
-            if self.ssim_weight > 0:  # Also compute the minimum SSIM loss.
-              min_error_ssim = tf.minimum(self.ssim_error[s][key1],
-                                          self.ssim_error[s][key2])
-              self.ssim_loss += tf.reduce_mean(min_error_ssim)
-
-      # Build the total loss as composed of L1 reconstruction, SSIM, smoothing
-      # and object size constraint loss as appropriate.
-      self.reconstr_loss *= self.reconstr_weight
-      self.total_loss = self.reconstr_loss
-      if self.smooth_weight > 0:
-        self.smooth_loss *= self.smooth_weight
-        self.total_loss += self.smooth_loss
-      if self.ssim_weight > 0:
-        self.ssim_loss *= self.ssim_weight
-        self.total_loss += self.ssim_loss
-      if self.size_constraint_weight > 0:
-        self.inf_loss *= self.size_constraint_weight
-        self.total_loss += self.inf_loss
-
-  def gradient_x(self, img):
-    return img[:, :, :-1, :] - img[:, :, 1:, :]
-
-  def gradient_y(self, img):
-    return img[:, :-1, :, :] - img[:, 1:, :, :]
-
-  def depth_smoothness(self, depth, img):
-    """Computes image-aware depth smoothness loss."""
-    depth_dx = self.gradient_x(depth)
-    depth_dy = self.gradient_y(depth)
-    image_dx = self.gradient_x(img)
-    image_dy = self.gradient_y(img)
-    weights_x = tf.exp(-tf.reduce_mean(tf.abs(image_dx), 3, keepdims=True))
-    weights_y = tf.exp(-tf.reduce_mean(tf.abs(image_dy), 3, keepdims=True))
-    smoothness_x = depth_dx * weights_x
-    smoothness_y = depth_dy * weights_y
-    return tf.reduce_mean(abs(smoothness_x)) + tf.reduce_mean(abs(smoothness_y))
-
-  def ssim(self, x, y):
-    """Computes a differentiable structured image similarity measure."""
-    c1 = 0.01**2  # As defined in SSIM to stabilize div. by small denominator.
-    c2 = 0.03**2
-    mu_x = slim.avg_pool2d(x, 3, 1, 'VALID')
-    mu_y = slim.avg_pool2d(y, 3, 1, 'VALID')
-    sigma_x = slim.avg_pool2d(x**2, 3, 1, 'VALID') - mu_x**2
-    sigma_y = slim.avg_pool2d(y**2, 3, 1, 'VALID') - mu_y**2
-    sigma_xy = slim.avg_pool2d(x * y, 3, 1, 'VALID') - mu_x * mu_y
-    ssim_n = (2 * mu_x * mu_y + c1) * (2 * sigma_xy + c2)
-    ssim_d = (mu_x**2 + mu_y**2 + c1) * (sigma_x + sigma_y + c2)
-    ssim = ssim_n / ssim_d
-    return tf.clip_by_value((1 - ssim) / 2, 0, 1)
-
-  def build_train_op(self):
-    with tf.name_scope('train_op'):
-      optim = tf.train.AdamOptimizer(self.learning_rate, self.beta1)
-      self.train_op = slim.learning.create_train_op(self.total_loss, optim)
-      self.global_step = tf.Variable(0, name='global_step', trainable=False)
-      self.incr_global_step = tf.assign(
-          self.global_step, self.global_step + 1)
-
-  def build_summaries(self):
-    """Adds scalar and image summaries for TensorBoard."""
-    tf.summary.scalar('total_loss', self.total_loss)
-    tf.summary.scalar('reconstr_loss', self.reconstr_loss)
-    if self.smooth_weight > 0:
-      tf.summary.scalar('smooth_loss', self.smooth_loss)
-    if self.ssim_weight > 0:
-      tf.summary.scalar('ssim_loss', self.ssim_loss)
-    if self.icp_weight > 0:
-      tf.summary.scalar('icp_transform_loss', self.icp_transform_loss)
-      tf.summary.scalar('icp_residual_loss', self.icp_residual_loss)
-
-    if self.size_constraint_weight > 0:
-      tf.summary.scalar('inf_loss', self.inf_loss)
-      tf.summary.histogram('global_scale_var', self.global_scale_var)
-
-    if self.handle_motion:
-      for s in range(NUM_SCALES):
-        for batch_s in range(self.batch_size):
-          whole_strip = tf.concat([self.warped_seq[s][0][batch_s],
-                                   self.warped_seq[s][1][batch_s],
-                                   self.warped_seq[s][2][batch_s]], axis=1)
-          tf.summary.image('base_warp_batch%s_scale%s' % (batch_s, s),
-                           tf.expand_dims(whole_strip, axis=0))
-
-          whole_strip_input = tf.concat(
-              [self.inputs_objectmotion_net[s][batch_s][:, :, :, 0:3],
-               self.inputs_objectmotion_net[s][batch_s][:, :, :, 3:6],
-               self.inputs_objectmotion_net[s][batch_s][:, :, :, 6:9]], axis=2)
-          tf.summary.image('input_objectmotion_batch%s_scale%s' % (batch_s, s),
-                           whole_strip_input)  # (B, H, 3*W, 3)
-
-      for batch_s in range(self.batch_size):
-        whole_strip = tf.concat([self.base_input_masked[batch_s, :, :, 0:3],
-                                 self.base_input_masked[batch_s, :, :, 3:6],
-                                 self.base_input_masked[batch_s, :, :, 6:9]],
-                                axis=1)
-        tf.summary.image('input_egomotion_batch%s' % batch_s,
-                         tf.expand_dims(whole_strip, axis=0))
-
-      # Show transform predictions (of all objects).
-      for batch_s in range(self.batch_size):
-        for i in range(self.seq_length - 1):
-          # self.object_transforms contains batch_size elements of (N, 2, 6).
-          tf.summary.histogram('batch%d_tx%d' % (batch_s, i),
-                               self.object_transforms[0][batch_s][:, i, 0])
-          tf.summary.histogram('batch%d_ty%d' % (batch_s, i),
-                               self.object_transforms[0][batch_s][:, i, 1])
-          tf.summary.histogram('batch%d_tz%d' % (batch_s, i),
-                               self.object_transforms[0][batch_s][:, i, 2])
-          tf.summary.histogram('batch%d_rx%d' % (batch_s, i),
-                               self.object_transforms[0][batch_s][:, i, 3])
-          tf.summary.histogram('batch%d_ry%d' % (batch_s, i),
-                               self.object_transforms[0][batch_s][:, i, 4])
-          tf.summary.histogram('batch%d_rz%d' % (batch_s, i),
-                               self.object_transforms[0][batch_s][:, i, 5])
-
-    for i in range(self.seq_length - 1):
-      tf.summary.histogram('tx%d' % i, self.egomotion[:, i, 0])
-      tf.summary.histogram('ty%d' % i, self.egomotion[:, i, 1])
-      tf.summary.histogram('tz%d' % i, self.egomotion[:, i, 2])
-      tf.summary.histogram('rx%d' % i, self.egomotion[:, i, 3])
-      tf.summary.histogram('ry%d' % i, self.egomotion[:, i, 4])
-      tf.summary.histogram('rz%d' % i, self.egomotion[:, i, 5])
-
-    for s in range(NUM_SCALES):
-      for i in range(self.seq_length):
-        tf.summary.image('scale%d_image%d' % (s, i),
-                         self.images[s][:, :, :, 3 * i:3 * (i + 1)])
-        if i in self.depth:
-          tf.summary.histogram('scale%d_depth%d' % (s, i), self.depth[i][s])
-          tf.summary.histogram('scale%d_disp%d' % (s, i), self.disp[i][s])
-          tf.summary.image('scale%d_disparity%d' % (s, i), self.disp[i][s])
-
-      for key in self.warped_image[s]:
-        tf.summary.image('scale%d_warped_image%s' % (s, key),
-                         self.warped_image[s][key])
-        tf.summary.image('scale%d_warp_error%s' % (s, key),
-                         self.warp_error[s][key])
-        if self.ssim_weight > 0:
-          tf.summary.image('scale%d_ssim_error%s' % (s, key),
-                           self.ssim_error[s][key])
-        if self.icp_weight > 0:
-          tf.summary.image('scale%d_icp_residual%s' % (s, key),
-                           self.icp_residual[s][key])
-          transform = self.icp_transform[s][key]
-          tf.summary.histogram('scale%d_icp_tx%s' % (s, key), transform[:, 0])
-          tf.summary.histogram('scale%d_icp_ty%s' % (s, key), transform[:, 1])
-          tf.summary.histogram('scale%d_icp_tz%s' % (s, key), transform[:, 2])
-          tf.summary.histogram('scale%d_icp_rx%s' % (s, key), transform[:, 3])
-          tf.summary.histogram('scale%d_icp_ry%s' % (s, key), transform[:, 4])
-          tf.summary.histogram('scale%d_icp_rz%s' % (s, key), transform[:, 5])
-
-  def build_depth_test_graph(self):
-    """Builds depth model reading from placeholders."""
-    with tf.variable_scope('depth_prediction'):
-      input_image = tf.placeholder(
-          tf.float32, [self.batch_size, self.img_height, self.img_width, 3],
-          name='raw_input')
-      self.input_image = input_image
-      if self.imagenet_norm:
-        input_image = (input_image - reader.IMAGENET_MEAN) / reader.IMAGENET_SD
-      est_disp, _ = nets.disp_net(architecture=self.architecture,
-                                  image=input_image,
-                                  use_skip=self.use_skip,
-                                  weight_reg=self.weight_reg,
-                                  is_training=True)
-    est_depth = 1.0 / est_disp[0]
-    self.est_depth = est_depth
-
-  def build_egomotion_test_graph(self):
-    """Builds egomotion model reading from placeholders."""
-    input_image_stack = tf.placeholder(
-        tf.float32,
-        [1, self.img_height, self.img_width, self.seq_length * 3],
-        name='raw_input')
-    input_bottleneck_stack = None
-
-    if self.imagenet_norm:
-      im_mean = tf.tile(
-          tf.constant(reader.IMAGENET_MEAN), multiples=[self.seq_length])
-      im_sd = tf.tile(
-          tf.constant(reader.IMAGENET_SD), multiples=[self.seq_length])
-      input_image_stack = (input_image_stack - im_mean) / im_sd
-
-    if self.joint_encoder:
-      # Pre-compute embeddings here.
-      with tf.variable_scope('depth_prediction', reuse=True):
-        input_bottleneck_stack = []
-        encoder_selected = nets.encoder(self.architecture)
-        for i in range(self.seq_length):
-          input_image = input_image_stack[:, :, :, i * 3:(i + 1) * 3]
-          tf.get_variable_scope().reuse_variables()
-          embedding, _ = encoder_selected(
-              target_image=input_image,
-              weight_reg=self.weight_reg,
-              is_training=True)
-          input_bottleneck_stack.append(embedding)
-        input_bottleneck_stack = tf.concat(input_bottleneck_stack, axis=3)
-
-    with tf.variable_scope('egomotion_prediction'):
-      est_egomotion = nets.egomotion_net(
-          image_stack=input_image_stack,
-          disp_bottleneck_stack=input_bottleneck_stack,
-          joint_encoder=self.joint_encoder,
-          seq_length=self.seq_length,
-          weight_reg=self.weight_reg)
-    self.input_image_stack = input_image_stack
-    self.est_egomotion = est_egomotion
-
-  def build_objectmotion_test_graph(self):
-    """Builds egomotion model reading from placeholders."""
-    input_image_stack_om = tf.placeholder(
-        tf.float32,
-        [1, self.img_height, self.img_width, self.seq_length * 3],
-        name='raw_input')
-
-    if self.imagenet_norm:
-      im_mean = tf.tile(
-          tf.constant(reader.IMAGENET_MEAN), multiples=[self.seq_length])
-      im_sd = tf.tile(
-          tf.constant(reader.IMAGENET_SD), multiples=[self.seq_length])
-      input_image_stack_om = (input_image_stack_om - im_mean) / im_sd
-
-    with tf.variable_scope('objectmotion_prediction'):
-      est_objectmotion = nets.objectmotion_net(
-          image_stack=input_image_stack_om,
-          disp_bottleneck_stack=None,
-          joint_encoder=self.joint_encoder,
-          seq_length=self.seq_length,
-          weight_reg=self.weight_reg)
-    self.input_image_stack_om = input_image_stack_om
-    self.est_objectmotion = est_objectmotion
-
-  def inference_depth(self, inputs, sess):
-    return sess.run(self.est_depth, feed_dict={self.input_image: inputs})
-
-  def inference_egomotion(self, inputs, sess):
-    return sess.run(
-        self.est_egomotion, feed_dict={self.input_image_stack: inputs})
-
-  def inference_objectmotion(self, inputs, sess):
-    return sess.run(
-        self.est_objectmotion, feed_dict={self.input_image_stack_om: inputs})
diff --git a/research/struct2depth/nets.py b/research/struct2depth/nets.py
deleted file mode 100644
index 1cec1b36f3c44d3eaf11451ca3ae4d92839d5886..0000000000000000000000000000000000000000
--- a/research/struct2depth/nets.py
+++ /dev/null
@@ -1,525 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Depth and Ego-Motion networks."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import tensorflow as tf
-slim = tf.contrib.slim
-
-SIMPLE = 'simple'
-RESNET = 'resnet'
-ARCHITECTURES = [SIMPLE, RESNET]
-
-SCALE_TRANSLATION = 0.001
-SCALE_ROTATION = 0.01
-
-# Disparity (inverse depth) values range from 0.01 to 10. Note that effectively,
-# this is undone if depth normalization is used, which scales the values to
-# have a mean of 1.
-DISP_SCALING = 10
-MIN_DISP = 0.01
-WEIGHT_DECAY_KEY = 'WEIGHT_DECAY'
-EGOMOTION_VEC_SIZE = 6
-
-
-def egomotion_net(image_stack, disp_bottleneck_stack, joint_encoder, seq_length,
-                  weight_reg):
-  """Predict ego-motion vectors from a stack of frames or embeddings.
-
-  Args:
-    image_stack: Input tensor with shape [B, h, w, seq_length * 3] in order.
-    disp_bottleneck_stack: Input tensor with shape [B, h_hidden, w_hidden,
-        seq_length * c_hidden] in order.
-    joint_encoder: Determines if the same encoder is used for computing the
-        bottleneck layer of both the egomotion and the depth prediction
-        network. If enabled, disp_bottleneck_stack is used as input, and the
-        encoding steps are skipped. If disabled, a separate encoder is defined
-        on image_stack.
-    seq_length: The sequence length used.
-    weight_reg: The amount of weight regularization.
-
-  Returns:
-    Egomotion vectors with shape [B, seq_length - 1, 6].
-  """
-  num_egomotion_vecs = seq_length - 1
-  with tf.variable_scope('pose_exp_net') as sc:
-    end_points_collection = sc.original_name_scope + '_end_points'
-    with slim.arg_scope([slim.conv2d, slim.conv2d_transpose],
-                        normalizer_fn=None,
-                        weights_regularizer=slim.l2_regularizer(weight_reg),
-                        normalizer_params=None,
-                        activation_fn=tf.nn.relu,
-                        outputs_collections=end_points_collection):
-      if not joint_encoder:
-        # Define separate encoder. If sharing, we can skip the encoding step,
-        # as the bottleneck layer will already be passed as input.
-        cnv1 = slim.conv2d(image_stack, 16, [7, 7], stride=2, scope='cnv1')
-        cnv2 = slim.conv2d(cnv1, 32, [5, 5], stride=2, scope='cnv2')
-        cnv3 = slim.conv2d(cnv2, 64, [3, 3], stride=2, scope='cnv3')
-        cnv4 = slim.conv2d(cnv3, 128, [3, 3], stride=2, scope='cnv4')
-        cnv5 = slim.conv2d(cnv4, 256, [3, 3], stride=2, scope='cnv5')
-
-      with tf.variable_scope('pose'):
-        inputs = disp_bottleneck_stack if joint_encoder else cnv5
-        cnv6 = slim.conv2d(inputs, 256, [3, 3], stride=2, scope='cnv6')
-        cnv7 = slim.conv2d(cnv6, 256, [3, 3], stride=2, scope='cnv7')
-        pred_channels = EGOMOTION_VEC_SIZE * num_egomotion_vecs
-        egomotion_pred = slim.conv2d(cnv7, pred_channels, [1, 1], scope='pred',
-                                     stride=1, normalizer_fn=None,
-                                     activation_fn=None)
-        egomotion_avg = tf.reduce_mean(egomotion_pred, [1, 2])
-        egomotion_res = tf.reshape(
-            egomotion_avg, [-1, num_egomotion_vecs, EGOMOTION_VEC_SIZE])
-        # Tinghui found that scaling by a small constant facilitates training.
-        egomotion_scaled = tf.concat([egomotion_res[:, 0:3] * SCALE_TRANSLATION,
-                                      egomotion_res[:, 3:6] * SCALE_ROTATION],
-                                     axis=1)
-    return egomotion_scaled
-
-
-def objectmotion_net(image_stack, disp_bottleneck_stack, joint_encoder,
-                     seq_length, weight_reg):
-  """Predict object-motion vectors from a stack of frames or embeddings.
-
-  Args:
-    image_stack: Input tensor with shape [B, h, w, seq_length * 3] in order.
-    disp_bottleneck_stack: Input tensor with shape [B, h_hidden, w_hidden,
-        seq_length * c_hidden] in order.
-    joint_encoder: Determines if the same encoder is used for computing the
-        bottleneck layer of both the egomotion and the depth prediction
-        network. If enabled, disp_bottleneck_stack is used as input, and the
-        encoding steps are skipped. If disabled, a separate encoder is defined
-        on image_stack.
-    seq_length: The sequence length used.
-    weight_reg: The amount of weight regularization.
-
-  Returns:
-    Egomotion vectors with shape [B, seq_length - 1, 6].
-  """
-  num_egomotion_vecs = seq_length - 1
-  with tf.variable_scope('pose_exp_net') as sc:
-    end_points_collection = sc.original_name_scope + '_end_points'
-    with slim.arg_scope([slim.conv2d, slim.conv2d_transpose],
-                        normalizer_fn=None,
-                        weights_regularizer=slim.l2_regularizer(weight_reg),
-                        normalizer_params=None,
-                        activation_fn=tf.nn.relu,
-                        outputs_collections=end_points_collection):
-      if not joint_encoder:
-        # Define separate encoder. If sharing, we can skip the encoding step,
-        # as the bottleneck layer will already be passed as input.
-        cnv1 = slim.conv2d(image_stack, 16, [7, 7], stride=2, scope='cnv1')
-        cnv2 = slim.conv2d(cnv1, 32, [5, 5], stride=2, scope='cnv2')
-        cnv3 = slim.conv2d(cnv2, 64, [3, 3], stride=2, scope='cnv3')
-        cnv4 = slim.conv2d(cnv3, 128, [3, 3], stride=2, scope='cnv4')
-        cnv5 = slim.conv2d(cnv4, 256, [3, 3], stride=2, scope='cnv5')
-
-      with tf.variable_scope('pose'):
-        inputs = disp_bottleneck_stack if joint_encoder else cnv5
-        cnv6 = slim.conv2d(inputs, 256, [3, 3], stride=2, scope='cnv6')
-        cnv7 = slim.conv2d(cnv6, 256, [3, 3], stride=2, scope='cnv7')
-        pred_channels = EGOMOTION_VEC_SIZE * num_egomotion_vecs
-        egomotion_pred = slim.conv2d(cnv7, pred_channels, [1, 1], scope='pred',
-                                     stride=1, normalizer_fn=None,
-                                     activation_fn=None)
-        egomotion_avg = tf.reduce_mean(egomotion_pred, [1, 2])
-        egomotion_res = tf.reshape(
-            egomotion_avg, [-1, num_egomotion_vecs, EGOMOTION_VEC_SIZE])
-        # Tinghui found that scaling by a small constant facilitates training.
-        egomotion_scaled = tf.concat([egomotion_res[:, 0:3] * SCALE_TRANSLATION,
-                                      egomotion_res[:, 3:6] * SCALE_ROTATION],
-                                     axis=1)
-    return egomotion_scaled
-
-
-def disp_net(architecture, image, use_skip, weight_reg, is_training):
-  """Defines an encoder-decoder architecture for depth prediction."""
-  if architecture not in ARCHITECTURES:
-    raise ValueError('Unknown architecture.')
-  encoder_selected = encoder(architecture)
-  decoder_selected = decoder(architecture)
-
-  # Encode image.
-  bottleneck, skip_connections = encoder_selected(image, weight_reg,
-                                                  is_training)
-  # Decode to depth.
-  multiscale_disps_i = decoder_selected(target_image=image,
-                                        bottleneck=bottleneck,
-                                        weight_reg=weight_reg,
-                                        use_skip=use_skip,
-                                        skip_connections=skip_connections)
-  return multiscale_disps_i, bottleneck
-
-
-def encoder(architecture):
-  return encoder_resnet if architecture == RESNET else encoder_simple
-
-
-def decoder(architecture):
-  return decoder_resnet if architecture == RESNET else decoder_simple
-
-
-def encoder_simple(target_image, weight_reg, is_training):
-  """Defines the old encoding architecture."""
-  del is_training
-  with slim.arg_scope([slim.conv2d],
-                      normalizer_fn=None,
-                      normalizer_params=None,
-                      weights_regularizer=slim.l2_regularizer(weight_reg),
-                      activation_fn=tf.nn.relu):
-    # Define (joint) encoder.
-    cnv1 = slim.conv2d(target_image, 32, [7, 7], stride=2, scope='cnv1')
-    cnv1b = slim.conv2d(cnv1, 32, [7, 7], stride=1, scope='cnv1b')
-    cnv2 = slim.conv2d(cnv1b, 64, [5, 5], stride=2, scope='cnv2')
-    cnv2b = slim.conv2d(cnv2, 64, [5, 5], stride=1, scope='cnv2b')
-    cnv3 = slim.conv2d(cnv2b, 128, [3, 3], stride=2, scope='cnv3')
-    cnv3b = slim.conv2d(cnv3, 128, [3, 3], stride=1, scope='cnv3b')
-    cnv4 = slim.conv2d(cnv3b, 256, [3, 3], stride=2, scope='cnv4')
-    cnv4b = slim.conv2d(cnv4, 256, [3, 3], stride=1, scope='cnv4b')
-    cnv5 = slim.conv2d(cnv4b, 512, [3, 3], stride=2, scope='cnv5')
-    cnv5b = slim.conv2d(cnv5, 512, [3, 3], stride=1, scope='cnv5b')
-    cnv6 = slim.conv2d(cnv5b, 512, [3, 3], stride=2, scope='cnv6')
-    cnv6b = slim.conv2d(cnv6, 512, [3, 3], stride=1, scope='cnv6b')
-    cnv7 = slim.conv2d(cnv6b, 512, [3, 3], stride=2, scope='cnv7')
-    cnv7b = slim.conv2d(cnv7, 512, [3, 3], stride=1, scope='cnv7b')
-  return cnv7b, (cnv6b, cnv5b, cnv4b, cnv3b, cnv2b, cnv1b)
-
-
-def decoder_simple(target_image, bottleneck, weight_reg, use_skip,
-                   skip_connections):
-  """Defines the old depth decoder architecture."""
-  h = target_image.get_shape()[1].value
-  w = target_image.get_shape()[2].value
-  (cnv6b, cnv5b, cnv4b, cnv3b, cnv2b, cnv1b) = skip_connections
-  with slim.arg_scope([slim.conv2d, slim.conv2d_transpose],
-                      normalizer_fn=None,
-                      normalizer_params=None,
-                      weights_regularizer=slim.l2_regularizer(weight_reg),
-                      activation_fn=tf.nn.relu):
-    up7 = slim.conv2d_transpose(bottleneck, 512, [3, 3], stride=2,
-                                scope='upcnv7')
-    up7 = _resize_like(up7, cnv6b)
-    if use_skip:
-      i7_in = tf.concat([up7, cnv6b], axis=3)
-    else:
-      i7_in = up7
-    icnv7 = slim.conv2d(i7_in, 512, [3, 3], stride=1, scope='icnv7')
-
-    up6 = slim.conv2d_transpose(icnv7, 512, [3, 3], stride=2, scope='upcnv6')
-    up6 = _resize_like(up6, cnv5b)
-    if use_skip:
-      i6_in = tf.concat([up6, cnv5b], axis=3)
-    else:
-      i6_in = up6
-    icnv6 = slim.conv2d(i6_in, 512, [3, 3], stride=1, scope='icnv6')
-
-    up5 = slim.conv2d_transpose(icnv6, 256, [3, 3], stride=2, scope='upcnv5')
-    up5 = _resize_like(up5, cnv4b)
-    if use_skip:
-      i5_in = tf.concat([up5, cnv4b], axis=3)
-    else:
-      i5_in = up5
-    icnv5 = slim.conv2d(i5_in, 256, [3, 3], stride=1, scope='icnv5')
-
-    up4 = slim.conv2d_transpose(icnv5, 128, [3, 3], stride=2, scope='upcnv4')
-    up4 = _resize_like(up4, cnv3b)
-    if use_skip:
-      i4_in = tf.concat([up4, cnv3b], axis=3)
-    else:
-      i4_in = up4
-    icnv4 = slim.conv2d(i4_in, 128, [3, 3], stride=1, scope='icnv4')
-    disp4 = (slim.conv2d(icnv4, 1, [3, 3], stride=1, activation_fn=tf.sigmoid,
-                         normalizer_fn=None, scope='disp4')
-             * DISP_SCALING + MIN_DISP)
-    disp4_up = tf.image.resize_bilinear(disp4, [np.int(h / 4), np.int(w / 4)],
-                                        align_corners=True)
-
-    up3 = slim.conv2d_transpose(icnv4, 64, [3, 3], stride=2, scope='upcnv3')
-    up3 = _resize_like(up3, cnv2b)
-    if use_skip:
-      i3_in = tf.concat([up3, cnv2b, disp4_up], axis=3)
-    else:
-      i3_in = tf.concat([up3, disp4_up])
-    icnv3 = slim.conv2d(i3_in, 64, [3, 3], stride=1, scope='icnv3')
-    disp3 = (slim.conv2d(icnv3, 1, [3, 3], stride=1, activation_fn=tf.sigmoid,
-                         normalizer_fn=None, scope='disp3')
-             * DISP_SCALING + MIN_DISP)
-    disp3_up = tf.image.resize_bilinear(disp3, [np.int(h / 2), np.int(w / 2)],
-                                        align_corners=True)
-
-    up2 = slim.conv2d_transpose(icnv3, 32, [3, 3], stride=2, scope='upcnv2')
-    up2 = _resize_like(up2, cnv1b)
-    if use_skip:
-      i2_in = tf.concat([up2, cnv1b, disp3_up], axis=3)
-    else:
-      i2_in = tf.concat([up2, disp3_up])
-    icnv2 = slim.conv2d(i2_in, 32, [3, 3], stride=1, scope='icnv2')
-    disp2 = (slim.conv2d(icnv2, 1, [3, 3], stride=1, activation_fn=tf.sigmoid,
-                         normalizer_fn=None, scope='disp2')
-             * DISP_SCALING + MIN_DISP)
-    disp2_up = tf.image.resize_bilinear(disp2, [h, w], align_corners=True)
-
-    up1 = slim.conv2d_transpose(icnv2, 16, [3, 3], stride=2, scope='upcnv1')
-    i1_in = tf.concat([up1, disp2_up], axis=3)
-    icnv1 = slim.conv2d(i1_in, 16, [3, 3], stride=1, scope='icnv1')
-    disp1 = (slim.conv2d(icnv1, 1, [3, 3], stride=1, activation_fn=tf.sigmoid,
-                         normalizer_fn=None, scope='disp1')
-             * DISP_SCALING + MIN_DISP)
-  return [disp1, disp2, disp3, disp4]
-
-
-def encoder_resnet(target_image, weight_reg, is_training):
-  """Defines a ResNet18-based encoding architecture.
-
-  This implementation follows Juyong Kim's implementation of ResNet18 on GitHub:
-  https://github.com/dalgu90/resnet-18-tensorflow
-
-  Args:
-    target_image: Input tensor with shape [B, h, w, 3] to encode.
-    weight_reg: Parameter ignored.
-    is_training: Whether the model is being trained or not.
-
-  Returns:
-    Tuple of tensors, with the first being the bottleneck layer as tensor of
-    size [B, h_hid, w_hid, c_hid], and others being intermediate layers
-    for building skip-connections.
-  """
-  del weight_reg
-  encoder_filters = [64, 64, 128, 256, 512]
-  stride = 2
-
-  # conv1
-  with tf.variable_scope('conv1'):
-    x = _conv(target_image, 7, encoder_filters[0], stride)
-    x = _bn(x, is_train=is_training)
-    econv1 = _relu(x)
-    x = tf.nn.max_pool(econv1, [1, 3, 3, 1], [1, 2, 2, 1], 'SAME')
-
-  # conv2_x
-  x = _residual_block(x, is_training, name='conv2_1')
-  econv2 = _residual_block(x, is_training, name='conv2_2')
-
-  # conv3_x
-  x = _residual_block_first(econv2, is_training, encoder_filters[2], stride,
-                            name='conv3_1')
-  econv3 = _residual_block(x, is_training, name='conv3_2')
-
-  # conv4_x
-  x = _residual_block_first(econv3, is_training, encoder_filters[3], stride,
-                            name='conv4_1')
-  econv4 = _residual_block(x, is_training, name='conv4_2')
-
-  # conv5_x
-  x = _residual_block_first(econv4, is_training, encoder_filters[4], stride,
-                            name='conv5_1')
-  econv5 = _residual_block(x, is_training, name='conv5_2')
-  return econv5, (econv4, econv3, econv2, econv1)
-
-
-def decoder_resnet(target_image, bottleneck, weight_reg, use_skip,
-                   skip_connections):
-  """Defines the depth decoder architecture.
-
-  Args:
-    target_image: The original encoder input tensor with shape [B, h, w, 3].
-                  Just the shape information is used here.
-    bottleneck: Bottleneck layer to be decoded.
-    weight_reg: The amount of weight regularization.
-    use_skip: Whether the passed skip connections econv1, econv2, econv3 and
-              econv4 should be used.
-    skip_connections: Tensors for building skip-connections.
-
-  Returns:
-    Disparities at 4 different scales.
-  """
-  (econv4, econv3, econv2, econv1) = skip_connections
-  decoder_filters = [16, 32, 64, 128, 256]
-  default_pad = tf.constant([[0, 0], [1, 1], [1, 1], [0, 0]])
-  reg = slim.l2_regularizer(weight_reg) if weight_reg > 0.0 else None
-  with slim.arg_scope([slim.conv2d, slim.conv2d_transpose],
-                      normalizer_fn=None,
-                      normalizer_params=None,
-                      activation_fn=tf.nn.relu,
-                      weights_regularizer=reg):
-    upconv5 = slim.conv2d_transpose(bottleneck, decoder_filters[4], [3, 3],
-                                    stride=2, scope='upconv5')
-    upconv5 = _resize_like(upconv5, econv4)
-    if use_skip:
-      i5_in = tf.concat([upconv5, econv4], axis=3)
-    else:
-      i5_in = upconv5
-    i5_in = tf.pad(i5_in, default_pad, mode='REFLECT')
-    iconv5 = slim.conv2d(i5_in, decoder_filters[4], [3, 3], stride=1,
-                         scope='iconv5', padding='VALID')
-
-    upconv4 = slim.conv2d_transpose(iconv5, decoder_filters[3], [3, 3],
-                                    stride=2, scope='upconv4')
-    upconv4 = _resize_like(upconv4, econv3)
-    if use_skip:
-      i4_in = tf.concat([upconv4, econv3], axis=3)
-    else:
-      i4_in = upconv4
-    i4_in = tf.pad(i4_in, default_pad, mode='REFLECT')
-    iconv4 = slim.conv2d(i4_in, decoder_filters[3], [3, 3], stride=1,
-                         scope='iconv4', padding='VALID')
-
-    disp4_input = tf.pad(iconv4, default_pad, mode='REFLECT')
-    disp4 = (slim.conv2d(disp4_input, 1, [3, 3], stride=1,
-                         activation_fn=tf.sigmoid, normalizer_fn=None,
-                         scope='disp4', padding='VALID')
-             * DISP_SCALING + MIN_DISP)
-
-    upconv3 = slim.conv2d_transpose(iconv4, decoder_filters[2], [3, 3],
-                                    stride=2, scope='upconv3')
-    upconv3 = _resize_like(upconv3, econv2)
-    if use_skip:
-      i3_in = tf.concat([upconv3, econv2], axis=3)
-    else:
-      i3_in = upconv3
-    i3_in = tf.pad(i3_in, default_pad, mode='REFLECT')
-    iconv3 = slim.conv2d(i3_in, decoder_filters[2], [3, 3], stride=1,
-                         scope='iconv3', padding='VALID')
-    disp3_input = tf.pad(iconv3, default_pad, mode='REFLECT')
-    disp3 = (slim.conv2d(disp3_input, 1, [3, 3], stride=1,
-                         activation_fn=tf.sigmoid, normalizer_fn=None,
-                         scope='disp3', padding='VALID')
-             * DISP_SCALING + MIN_DISP)
-
-    upconv2 = slim.conv2d_transpose(iconv3, decoder_filters[1], [3, 3],
-                                    stride=2, scope='upconv2')
-    upconv2 = _resize_like(upconv2, econv1)
-    if use_skip:
-      i2_in = tf.concat([upconv2, econv1], axis=3)
-    else:
-      i2_in = upconv2
-    i2_in = tf.pad(i2_in, default_pad, mode='REFLECT')
-    iconv2 = slim.conv2d(i2_in, decoder_filters[1], [3, 3], stride=1,
-                         scope='iconv2', padding='VALID')
-    disp2_input = tf.pad(iconv2, default_pad, mode='REFLECT')
-    disp2 = (slim.conv2d(disp2_input, 1, [3, 3], stride=1,
-                         activation_fn=tf.sigmoid, normalizer_fn=None,
-                         scope='disp2', padding='VALID')
-             * DISP_SCALING + MIN_DISP)
-
-    upconv1 = slim.conv2d_transpose(iconv2, decoder_filters[0], [3, 3],
-                                    stride=2, scope='upconv1')
-    upconv1 = _resize_like(upconv1, target_image)
-    upconv1 = tf.pad(upconv1, default_pad, mode='REFLECT')
-    iconv1 = slim.conv2d(upconv1, decoder_filters[0], [3, 3], stride=1,
-                         scope='iconv1', padding='VALID')
-    disp1_input = tf.pad(iconv1, default_pad, mode='REFLECT')
-    disp1 = (slim.conv2d(disp1_input, 1, [3, 3], stride=1,
-                         activation_fn=tf.sigmoid, normalizer_fn=None,
-                         scope='disp1', padding='VALID')
-             * DISP_SCALING + MIN_DISP)
-
-  return [disp1, disp2, disp3, disp4]
-
-
-def _residual_block_first(x, is_training, out_channel, strides, name='unit'):
-  """Helper function for defining ResNet architecture."""
-  in_channel = x.get_shape().as_list()[-1]
-  with tf.variable_scope(name):
-    # Shortcut connection
-    if in_channel == out_channel:
-      if strides == 1:
-        shortcut = tf.identity(x)
-      else:
-        shortcut = tf.nn.max_pool(x, [1, strides, strides, 1],
-                                  [1, strides, strides, 1], 'VALID')
-    else:
-      shortcut = _conv(x, 1, out_channel, strides, name='shortcut')
-    # Residual
-    x = _conv(x, 3, out_channel, strides, name='conv_1')
-    x = _bn(x, is_train=is_training, name='bn_1')
-    x = _relu(x, name='relu_1')
-    x = _conv(x, 3, out_channel, 1, name='conv_2')
-    x = _bn(x, is_train=is_training, name='bn_2')
-    # Merge
-    x = x + shortcut
-    x = _relu(x, name='relu_2')
-  return x
-
-
-def _residual_block(x, is_training, input_q=None, output_q=None, name='unit'):
-  """Helper function for defining ResNet architecture."""
-  num_channel = x.get_shape().as_list()[-1]
-  with tf.variable_scope(name):
-    shortcut = x  # Shortcut connection
-    # Residual
-    x = _conv(x, 3, num_channel, 1, input_q=input_q, output_q=output_q,
-              name='conv_1')
-    x = _bn(x, is_train=is_training, name='bn_1')
-    x = _relu(x, name='relu_1')
-    x = _conv(x, 3, num_channel, 1, input_q=output_q, output_q=output_q,
-              name='conv_2')
-    x = _bn(x, is_train=is_training, name='bn_2')
-    # Merge
-    x = x + shortcut
-    x = _relu(x, name='relu_2')
-  return x
-
-
-def _conv(x, filter_size, out_channel, stride, pad='SAME', input_q=None,
-          output_q=None, name='conv'):
-  """Helper function for defining ResNet architecture."""
-  if (input_q is None) ^ (output_q is None):
-    raise ValueError('Input/Output splits are not correctly given.')
-
-  in_shape = x.get_shape()
-  with tf.variable_scope(name):
-    # Main operation: conv2d
-    with tf.device('/CPU:0'):
-      kernel = tf.get_variable(
-          'kernel', [filter_size, filter_size, in_shape[3], out_channel],
-          tf.float32, initializer=tf.random_normal_initializer(
-              stddev=np.sqrt(2.0/filter_size/filter_size/out_channel)))
-    if kernel not in tf.get_collection(WEIGHT_DECAY_KEY):
-      tf.add_to_collection(WEIGHT_DECAY_KEY, kernel)
-    conv = tf.nn.conv2d(x, kernel, [1, stride, stride, 1], pad)
-  return conv
-
-
-def _bn(x, is_train, name='bn'):
-  """Helper function for defining ResNet architecture."""
-  bn = tf.layers.batch_normalization(x, training=is_train, name=name)
-  return bn
-
-
-def _relu(x, name=None, leakness=0.0):
-  """Helper function for defining ResNet architecture."""
-  if leakness > 0.0:
-    name = 'lrelu' if name is None else name
-    return tf.maximum(x, x*leakness, name='lrelu')
-  else:
-    name = 'relu' if name is None else name
-    return tf.nn.relu(x, name='relu')
-
-
-def _resize_like(inputs, ref):
-  i_h, i_w = inputs.get_shape()[1], inputs.get_shape()[2]
-  r_h, r_w = ref.get_shape()[1], ref.get_shape()[2]
-  if i_h == r_h and i_w == r_w:
-    return inputs
-  else:
-    # TODO(casser): Other interpolation methods could be explored here.
-    return tf.image.resize_bilinear(inputs, [r_h.value, r_w.value],
-                                    align_corners=True)
diff --git a/research/struct2depth/optimize.py b/research/struct2depth/optimize.py
deleted file mode 100644
index becb3ab69f29dfb9dabe5e1534bd1d960f6037cb..0000000000000000000000000000000000000000
--- a/research/struct2depth/optimize.py
+++ /dev/null
@@ -1,383 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Applies online refinement while running inference.
-
-Instructions: Run static inference first before calling this script. Make sure
-to point output_dir to the same folder where static inference results were
-saved previously.
-
-For example use, please refer to README.
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import datetime
-import os
-import random
-from absl import app
-from absl import flags
-from absl import logging
-import numpy as np
-import tensorflow as tf
-
-import model
-import nets
-import reader
-import util
-
-gfile = tf.gfile
-SAVE_EVERY = 1  # Defines the interval that predictions should be saved at.
-SAVE_PREVIEWS = True  # If set, while save image previews of depth predictions.
-FIXED_SEED = 8964  # Fixed seed for repeatability.
-
-flags.DEFINE_string('output_dir', None, 'Directory to store predictions. '
-                    'Assumes that regular inference has been executed before '
-                    'and results were stored in this folder.')
-flags.DEFINE_string('data_dir', None, 'Folder pointing to preprocessed '
-                    'triplets to fine-tune on.')
-flags.DEFINE_string('triplet_list_file', None, 'Text file containing paths to '
-                    'image files to process. Paths should be relative with '
-                    'respect to the list file location. Every line should be '
-                    'of the form [input_folder_name] [input_frame_num] '
-                    '[output_path], where [output_path] is optional to specify '
-                    'a different path to store the prediction.')
-flags.DEFINE_string('triplet_list_file_remains', None, 'Optional text file '
-                    'containing relative paths to image files which should not '
-                    'be fine-tuned, e.g. because of missing adjacent frames. '
-                    'For all files listed, the static prediction will be '
-                    'copied instead. File can be empty. If not, every line '
-                    'should be of the form [input_folder_name] '
-                    '[input_frame_num] [output_path], where [output_path] is '
-                    'optional to specify a different path to take and store '
-                    'the unrefined prediction from/to.')
-flags.DEFINE_string('model_ckpt', None, 'Model checkpoint to optimize.')
-flags.DEFINE_string('ft_name', '', 'Optional prefix for temporary files.')
-flags.DEFINE_string('file_extension', 'png', 'Image data file extension.')
-flags.DEFINE_float('learning_rate', 0.0001, 'Adam learning rate.')
-flags.DEFINE_float('beta1', 0.9, 'Adam momentum.')
-flags.DEFINE_float('reconstr_weight', 0.85, 'Frame reconstruction loss weight.')
-flags.DEFINE_float('ssim_weight', 0.15, 'SSIM loss weight.')
-flags.DEFINE_float('smooth_weight', 0.01, 'Smoothness loss weight.')
-flags.DEFINE_float('icp_weight', 0.0, 'ICP loss weight.')
-flags.DEFINE_float('size_constraint_weight', 0.0005, 'Weight of the object '
-                   'size constraint loss. Use only with motion handling.')
-flags.DEFINE_integer('batch_size', 1, 'The size of a sample batch')
-flags.DEFINE_integer('img_height', 128, 'Input frame height.')
-flags.DEFINE_integer('img_width', 416, 'Input frame width.')
-flags.DEFINE_integer('seq_length', 3, 'Number of frames in sequence.')
-flags.DEFINE_enum('architecture', nets.RESNET, nets.ARCHITECTURES,
-                  'Defines the architecture to use for the depth prediction '
-                  'network. Defaults to ResNet-based encoder and accompanying '
-                  'decoder.')
-flags.DEFINE_boolean('imagenet_norm', True, 'Whether to normalize the input '
-                     'images channel-wise so that they match the distribution '
-                     'most ImageNet-models were trained on.')
-flags.DEFINE_float('weight_reg', 0.05, 'The amount of weight regularization to '
-                   'apply. This has no effect on the ResNet-based encoder '
-                   'architecture.')
-flags.DEFINE_boolean('exhaustive_mode', False, 'Whether to exhaustively warp '
-                     'from any frame to any other instead of just considering '
-                     'adjacent frames. Where necessary, multiple egomotion '
-                     'estimates will be applied. Does not have an effect if '
-                     'compute_minimum_loss is enabled.')
-flags.DEFINE_boolean('random_scale_crop', False, 'Whether to apply random '
-                     'image scaling and center cropping during training.')
-flags.DEFINE_bool('depth_upsampling', True, 'Whether to apply depth '
-                  'upsampling of lower-scale representations before warping to '
-                  'compute reconstruction loss on full-resolution image.')
-flags.DEFINE_bool('depth_normalization', True, 'Whether to apply depth '
-                  'normalization, that is, normalizing inverse depth '
-                  'prediction maps by their mean to avoid degeneration towards '
-                  'small values.')
-flags.DEFINE_bool('compute_minimum_loss', True, 'Whether to take the '
-                  'element-wise minimum of the reconstruction/SSIM error in '
-                  'order to avoid overly penalizing dis-occlusion effects.')
-flags.DEFINE_bool('use_skip', True, 'Whether to use skip connections in the '
-                  'encoder-decoder architecture.')
-flags.DEFINE_bool('joint_encoder', False, 'Whether to share parameters '
-                  'between the depth and egomotion networks by using a joint '
-                  'encoder architecture. The egomotion network is then '
-                  'operating only on the hidden representation provided by the '
-                  'joint encoder.')
-flags.DEFINE_float('egomotion_threshold', 0.01, 'Minimum egomotion magnitude '
-                   'to apply finetuning. If lower, just forwards the ordinary '
-                   'prediction.')
-flags.DEFINE_integer('num_steps', 20, 'Number of optimization steps to run.')
-flags.DEFINE_boolean('handle_motion', True, 'Whether the checkpoint was '
-                     'trained with motion handling.')
-flags.DEFINE_bool('flip', False, 'Whether images should be flipped as well as '
-                  'resulting predictions (for test-time augmentation). This '
-                  'currently applies to the depth network only.')
-
-FLAGS = flags.FLAGS
-flags.mark_flag_as_required('output_dir')
-flags.mark_flag_as_required('data_dir')
-flags.mark_flag_as_required('model_ckpt')
-flags.mark_flag_as_required('triplet_list_file')
-
-
-def main(_):
-  """Runs fine-tuning and inference.
-
-  There are three categories of images.
-  1) Images where we have previous and next frame, and that are not filtered
-     out by the heuristic. For them, we will use the fine-tuned predictions.
-  2) Images where we have previous and next frame, but that were filtered out
-     by our heuristic. For them, we will use the ordinary prediction instead.
-  3) Images where we have at least one missing adjacent frame. For them, we will
-     use the ordinary prediction as indicated by triplet_list_file_remains (if
-     provided). They will also not be part of the generated inference list in
-     the first place.
-
-  Raises:
-     ValueError: Invalid parameters have been passed.
-  """
-
-  if FLAGS.handle_motion and FLAGS.joint_encoder:
-    raise ValueError('Using a joint encoder is currently not supported when '
-                     'modeling object motion.')
-  if FLAGS.handle_motion and FLAGS.seq_length != 3:
-    raise ValueError('The current motion model implementation only supports '
-                     'using a sequence length of three.')
-  if FLAGS.handle_motion and not FLAGS.compute_minimum_loss:
-    raise ValueError('Computing the minimum photometric loss is required when '
-                     'enabling object motion handling.')
-  if FLAGS.size_constraint_weight > 0 and not FLAGS.handle_motion:
-    raise ValueError('To enforce object size constraints, enable motion '
-                     'handling.')
-  if FLAGS.icp_weight > 0.0:
-    raise ValueError('ICP is currently not supported.')
-  if FLAGS.compute_minimum_loss and FLAGS.seq_length % 2 != 1:
-    raise ValueError('Compute minimum loss requires using an odd number of '
-                     'images in a sequence.')
-  if FLAGS.compute_minimum_loss and FLAGS.exhaustive_mode:
-    raise ValueError('Exhaustive mode has no effect when compute_minimum_loss '
-                     'is enabled.')
-  if FLAGS.img_width % (2 ** 5) != 0 or FLAGS.img_height % (2 ** 5) != 0:
-    logging.warn('Image size is not divisible by 2^5. For the architecture '
-                 'employed, this could cause artefacts caused by resizing in '
-                 'lower dimensions.')
-
-  if FLAGS.output_dir.endswith('/'):
-    FLAGS.output_dir = FLAGS.output_dir[:-1]
-
-  # Create file lists to prepare fine-tuning, save it to unique_file.
-  unique_file_name = (str(datetime.datetime.now().date()) + '_' +
-                      str(datetime.datetime.now().time()).replace(':', '_'))
-  unique_file = os.path.join(FLAGS.data_dir, unique_file_name + '.txt')
-  with gfile.FastGFile(FLAGS.triplet_list_file, 'r') as f:
-    files_to_process = f.readlines()
-    files_to_process = [line.rstrip() for line in files_to_process]
-    files_to_process = [line for line in files_to_process if len(line)]
-  logging.info('Creating unique file list %s with %s entries.', unique_file,
-               len(files_to_process))
-  with gfile.FastGFile(unique_file, 'w') as f_out:
-    fetches_network = FLAGS.num_steps * FLAGS.batch_size
-    fetches_saves = FLAGS.batch_size * int(np.floor(FLAGS.num_steps/SAVE_EVERY))
-    repetitions = fetches_network + 3 * fetches_saves
-    for i in range(len(files_to_process)):
-      for _ in range(repetitions):
-        f_out.write(files_to_process[i] + '\n')
-
-  # Read remaining files.
-  remaining = []
-  if gfile.Exists(FLAGS.triplet_list_file_remains):
-    with gfile.FastGFile(FLAGS.triplet_list_file_remains, 'r') as f:
-      remaining = f.readlines()
-      remaining = [line.rstrip() for line in remaining]
-      remaining = [line for line in remaining if len(line)]
-  logging.info('Running fine-tuning on %s files, %s files are remaining.',
-               len(files_to_process), len(remaining))
-
-  # Run fine-tuning process and save predictions in id-folders.
-  tf.set_random_seed(FIXED_SEED)
-  np.random.seed(FIXED_SEED)
-  random.seed(FIXED_SEED)
-  flipping_mode = reader.FLIP_ALWAYS if FLAGS.flip else reader.FLIP_NONE
-  train_model = model.Model(data_dir=FLAGS.data_dir,
-                            file_extension=FLAGS.file_extension,
-                            is_training=True,
-                            learning_rate=FLAGS.learning_rate,
-                            beta1=FLAGS.beta1,
-                            reconstr_weight=FLAGS.reconstr_weight,
-                            smooth_weight=FLAGS.smooth_weight,
-                            ssim_weight=FLAGS.ssim_weight,
-                            icp_weight=FLAGS.icp_weight,
-                            batch_size=FLAGS.batch_size,
-                            img_height=FLAGS.img_height,
-                            img_width=FLAGS.img_width,
-                            seq_length=FLAGS.seq_length,
-                            architecture=FLAGS.architecture,
-                            imagenet_norm=FLAGS.imagenet_norm,
-                            weight_reg=FLAGS.weight_reg,
-                            exhaustive_mode=FLAGS.exhaustive_mode,
-                            random_scale_crop=FLAGS.random_scale_crop,
-                            flipping_mode=flipping_mode,
-                            random_color=False,
-                            depth_upsampling=FLAGS.depth_upsampling,
-                            depth_normalization=FLAGS.depth_normalization,
-                            compute_minimum_loss=FLAGS.compute_minimum_loss,
-                            use_skip=FLAGS.use_skip,
-                            joint_encoder=FLAGS.joint_encoder,
-                            build_sum=False,
-                            shuffle=False,
-                            input_file=unique_file_name,
-                            handle_motion=FLAGS.handle_motion,
-                            size_constraint_weight=FLAGS.size_constraint_weight,
-                            train_global_scale_var=False)
-
-  failed_heuristic_ids = finetune_inference(train_model, FLAGS.model_ckpt,
-                                            FLAGS.output_dir + '_ft')
-  logging.info('Fine-tuning completed, %s files were filtered out by '
-               'heuristic.', len(failed_heuristic_ids))
-  for failed_id in failed_heuristic_ids:
-    failed_entry = files_to_process[failed_id]
-    remaining.append(failed_entry)
-  logging.info('In total, %s images were fine-tuned, while %s were not.',
-               len(files_to_process)-len(failed_heuristic_ids), len(remaining))
-
-  # Copy all results to have the same structural output as running ordinary
-  # inference.
-  for i in range(len(files_to_process)):
-    if files_to_process[i] not in remaining:  # Use fine-tuned result.
-      elements = files_to_process[i].split(' ')
-      source_file = os.path.join(FLAGS.output_dir + '_ft', FLAGS.ft_name +
-                                 'id_' + str(i),
-                                 str(FLAGS.num_steps).zfill(10) +
-                                 ('_flip' if FLAGS.flip else ''))
-      if len(elements) == 2:  # No differing mapping defined.
-        target_dir = os.path.join(FLAGS.output_dir + '_ft', elements[0])
-        target_file = os.path.join(
-            target_dir, elements[1] + ('_flip' if FLAGS.flip else ''))
-      else:  # Other mapping for file defined, copy to this location instead.
-        target_dir = os.path.join(
-            FLAGS.output_dir + '_ft', os.path.dirname(elements[2]))
-        target_file = os.path.join(
-            target_dir,
-            os.path.basename(elements[2]) + ('_flip' if FLAGS.flip else ''))
-      if not gfile.Exists(target_dir):
-        gfile.MakeDirs(target_dir)
-      logging.info('Copy refined result %s to %s.', source_file, target_file)
-      gfile.Copy(source_file + '.npy', target_file + '.npy', overwrite=True)
-      gfile.Copy(source_file + '.txt', target_file + '.txt', overwrite=True)
-      gfile.Copy(source_file + '.%s' % FLAGS.file_extension,
-                 target_file + '.%s' % FLAGS.file_extension, overwrite=True)
-  for j in range(len(remaining)):
-    elements = remaining[j].split(' ')
-    if len(elements) == 2:  # No differing mapping defined.
-      target_dir = os.path.join(FLAGS.output_dir + '_ft', elements[0])
-      target_file = os.path.join(
-          target_dir, elements[1] + ('_flip' if FLAGS.flip else ''))
-    else:  # Other mapping for file defined, copy to this location instead.
-      target_dir = os.path.join(
-          FLAGS.output_dir + '_ft', os.path.dirname(elements[2]))
-      target_file = os.path.join(
-          target_dir,
-          os.path.basename(elements[2]) + ('_flip' if FLAGS.flip else ''))
-    if not gfile.Exists(target_dir):
-      gfile.MakeDirs(target_dir)
-    source_file = target_file.replace('_ft', '')
-    logging.info('Copy unrefined result %s to %s.', source_file, target_file)
-    gfile.Copy(source_file + '.npy', target_file + '.npy', overwrite=True)
-    gfile.Copy(source_file + '.%s' % FLAGS.file_extension,
-               target_file + '.%s' % FLAGS.file_extension, overwrite=True)
-  logging.info('Done, predictions saved in %s.', FLAGS.output_dir + '_ft')
-
-
-def finetune_inference(train_model, model_ckpt, output_dir):
-  """Train model."""
-  vars_to_restore = None
-  if model_ckpt is not None:
-    vars_to_restore = util.get_vars_to_save_and_restore(model_ckpt)
-    ckpt_path = model_ckpt
-  pretrain_restorer = tf.train.Saver(vars_to_restore)
-  sv = tf.train.Supervisor(logdir=None, save_summaries_secs=0, saver=None,
-                           summary_op=None)
-  config = tf.ConfigProto()
-  config.gpu_options.allow_growth = True
-  img_nr = 0
-  failed_heuristic = []
-  with sv.managed_session(config=config) as sess:
-    # TODO(casser): Caching the weights would be better to avoid I/O bottleneck.
-    while True:  # Loop terminates when all examples have been processed.
-      if model_ckpt is not None:
-        logging.info('Restored weights from %s', ckpt_path)
-        pretrain_restorer.restore(sess, ckpt_path)
-      logging.info('Running fine-tuning, image %s...', img_nr)
-      img_pred_folder = os.path.join(
-          output_dir, FLAGS.ft_name + 'id_' + str(img_nr))
-      if not gfile.Exists(img_pred_folder):
-        gfile.MakeDirs(img_pred_folder)
-      step = 1
-
-      # Run fine-tuning.
-      while step <= FLAGS.num_steps:
-        logging.info('Running step %s of %s.', step, FLAGS.num_steps)
-        fetches = {
-            'train': train_model.train_op,
-            'global_step': train_model.global_step,
-            'incr_global_step': train_model.incr_global_step
-        }
-        _ = sess.run(fetches)
-        if step % SAVE_EVERY == 0:
-          # Get latest prediction for middle frame, highest scale.
-          pred = train_model.depth[1][0].eval(session=sess)
-          if FLAGS.flip:
-            pred = np.flip(pred, axis=2)
-          input_img = train_model.image_stack.eval(session=sess)
-          input_img_prev = input_img[0, :, :, 0:3]
-          input_img_center = input_img[0, :, :, 3:6]
-          input_img_next = input_img[0, :, :, 6:]
-          img_pred_file = os.path.join(
-              img_pred_folder,
-              str(step).zfill(10) + ('_flip' if FLAGS.flip else '') + '.npy')
-          motion = np.squeeze(train_model.egomotion.eval(session=sess))
-          # motion of shape (seq_length - 1, 6).
-          motion = np.mean(motion, axis=0)  # Average egomotion across frames.
-
-          if SAVE_PREVIEWS or step == FLAGS.num_steps:
-            # Also save preview of depth map.
-            color_map = util.normalize_depth_for_display(
-                np.squeeze(pred[0, :, :]))
-            visualization = np.concatenate(
-                (input_img_prev, input_img_center, input_img_next, color_map))
-            motion_s = [str(m) for m in motion]
-            s_rep = ','.join(motion_s)
-            with gfile.Open(img_pred_file.replace('.npy', '.txt'), 'w') as f:
-              f.write(s_rep)
-            util.save_image(
-                img_pred_file.replace('.npy', '.%s' % FLAGS.file_extension),
-                visualization, FLAGS.file_extension)
-
-          with gfile.Open(img_pred_file, 'wb') as f:
-            np.save(f, pred)
-
-        # Apply heuristic to not finetune if egomotion magnitude is too low.
-        ego_magnitude = np.linalg.norm(motion[:3], ord=2)
-        heuristic = ego_magnitude >= FLAGS.egomotion_threshold
-        if not heuristic and step == FLAGS.num_steps:
-          failed_heuristic.append(img_nr)
-
-        step += 1
-      img_nr += 1
-  return failed_heuristic
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/struct2depth/project.py b/research/struct2depth/project.py
deleted file mode 100644
index f249ebd3d6ab5837209b07f14f4e54f39f2baf8a..0000000000000000000000000000000000000000
--- a/research/struct2depth/project.py
+++ /dev/null
@@ -1,326 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Geometry utilities for projecting frames based on depth and motion.
-
-Modified from Spatial Transformer Networks:
-https://github.com/tensorflow/models/blob/master/transformer/spatial_transformer.py
-"""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from absl import logging
-import numpy as np
-import tensorflow as tf
-
-
-def inverse_warp(img, depth, egomotion_mat, intrinsic_mat,
-                 intrinsic_mat_inv):
-  """Inverse warp a source image to the target image plane.
-
-  Args:
-    img: The source image (to sample pixels from) -- [B, H, W, 3].
-    depth: Depth map of the target image -- [B, H, W].
-    egomotion_mat: Matrix defining egomotion transform -- [B, 4, 4].
-    intrinsic_mat: Camera intrinsic matrix -- [B, 3, 3].
-    intrinsic_mat_inv: Inverse of the intrinsic matrix -- [B, 3, 3].
-  Returns:
-    Projected source image
-  """
-  dims = tf.shape(img)
-  batch_size, img_height, img_width = dims[0], dims[1], dims[2]
-  depth = tf.reshape(depth, [batch_size, 1, img_height * img_width])
-  grid = _meshgrid_abs(img_height, img_width)
-  grid = tf.tile(tf.expand_dims(grid, 0), [batch_size, 1, 1])
-  cam_coords = _pixel2cam(depth, grid, intrinsic_mat_inv)
-  ones = tf.ones([batch_size, 1, img_height * img_width])
-  cam_coords_hom = tf.concat([cam_coords, ones], axis=1)
-
-  # Get projection matrix for target camera frame to source pixel frame
-  hom_filler = tf.constant([0.0, 0.0, 0.0, 1.0], shape=[1, 1, 4])
-  hom_filler = tf.tile(hom_filler, [batch_size, 1, 1])
-  intrinsic_mat_hom = tf.concat(
-      [intrinsic_mat, tf.zeros([batch_size, 3, 1])], axis=2)
-  intrinsic_mat_hom = tf.concat([intrinsic_mat_hom, hom_filler], axis=1)
-  proj_target_cam_to_source_pixel = tf.matmul(intrinsic_mat_hom, egomotion_mat)
-  source_pixel_coords = _cam2pixel(cam_coords_hom,
-                                   proj_target_cam_to_source_pixel)
-  source_pixel_coords = tf.reshape(source_pixel_coords,
-                                   [batch_size, 2, img_height, img_width])
-  source_pixel_coords = tf.transpose(source_pixel_coords, perm=[0, 2, 3, 1])
-  projected_img, mask = _spatial_transformer(img, source_pixel_coords)
-  return projected_img, mask
-
-
-def get_transform_mat(egomotion_vecs, i, j):
-  """Returns a transform matrix defining the transform from frame i to j."""
-  egomotion_transforms = []
-  batchsize = tf.shape(egomotion_vecs)[0]
-  if i == j:
-    return tf.tile(tf.expand_dims(tf.eye(4, 4), axis=0), [batchsize, 1, 1])
-  for k in range(min(i, j), max(i, j)):
-    transform_matrix = _egomotion_vec2mat(egomotion_vecs[:, k, :], batchsize)
-    if i > j:  # Going back in sequence, need to invert egomotion.
-      egomotion_transforms.insert(0, tf.linalg.inv(transform_matrix))
-    else:  # Going forward in sequence
-      egomotion_transforms.append(transform_matrix)
-
-  # Multiply all matrices.
-  egomotion_mat = egomotion_transforms[0]
-  for i in range(1, len(egomotion_transforms)):
-    egomotion_mat = tf.matmul(egomotion_mat, egomotion_transforms[i])
-  return egomotion_mat
-
-
-def _pixel2cam(depth, pixel_coords, intrinsic_mat_inv):
-  """Transform coordinates in the pixel frame to the camera frame."""
-  cam_coords = tf.matmul(intrinsic_mat_inv, pixel_coords) * depth
-  return cam_coords
-
-
-def _cam2pixel(cam_coords, proj_c2p):
-  """Transform coordinates in the camera frame to the pixel frame."""
-  pcoords = tf.matmul(proj_c2p, cam_coords)
-  x = tf.slice(pcoords, [0, 0, 0], [-1, 1, -1])
-  y = tf.slice(pcoords, [0, 1, 0], [-1, 1, -1])
-  z = tf.slice(pcoords, [0, 2, 0], [-1, 1, -1])
-  # Not tested if adding a small number is necessary
-  x_norm = x / (z + 1e-10)
-  y_norm = y / (z + 1e-10)
-  pixel_coords = tf.concat([x_norm, y_norm], axis=1)
-  return pixel_coords
-
-
-def _meshgrid_abs(height, width):
-  """Meshgrid in the absolute coordinates."""
-  x_t = tf.matmul(
-      tf.ones(shape=tf.stack([height, 1])),
-      tf.transpose(tf.expand_dims(tf.linspace(-1.0, 1.0, width), 1), [1, 0]))
-  y_t = tf.matmul(
-      tf.expand_dims(tf.linspace(-1.0, 1.0, height), 1),
-      tf.ones(shape=tf.stack([1, width])))
-  x_t = (x_t + 1.0) * 0.5 * tf.cast(width - 1, tf.float32)
-  y_t = (y_t + 1.0) * 0.5 * tf.cast(height - 1, tf.float32)
-  x_t_flat = tf.reshape(x_t, (1, -1))
-  y_t_flat = tf.reshape(y_t, (1, -1))
-  ones = tf.ones_like(x_t_flat)
-  grid = tf.concat([x_t_flat, y_t_flat, ones], axis=0)
-  return grid
-
-
-def _euler2mat(z, y, x):
-  """Converts euler angles to rotation matrix.
-
-   From:
-   https://github.com/pulkitag/pycaffe-utils/blob/master/rot_utils.py#L174
-
-   TODO: Remove the dimension for 'N' (deprecated for converting all source
-   poses altogether).
-
-  Args:
-    z: rotation angle along z axis (in radians) -- size = [B, n]
-    y: rotation angle along y axis (in radians) -- size = [B, n]
-    x: rotation angle along x axis (in radians) -- size = [B, n]
-
-  Returns:
-    Rotation matrix corresponding to the euler angles, with shape [B, n, 3, 3].
-  """
-  batch_size = tf.shape(z)[0]
-  n = 1
-  z = tf.clip_by_value(z, -np.pi, np.pi)
-  y = tf.clip_by_value(y, -np.pi, np.pi)
-  x = tf.clip_by_value(x, -np.pi, np.pi)
-
-  # Expand to B x N x 1 x 1
-  z = tf.expand_dims(tf.expand_dims(z, -1), -1)
-  y = tf.expand_dims(tf.expand_dims(y, -1), -1)
-  x = tf.expand_dims(tf.expand_dims(x, -1), -1)
-
-  zeros = tf.zeros([batch_size, n, 1, 1])
-  ones = tf.ones([batch_size, n, 1, 1])
-
-  cosz = tf.cos(z)
-  sinz = tf.sin(z)
-  rotz_1 = tf.concat([cosz, -sinz, zeros], axis=3)
-  rotz_2 = tf.concat([sinz, cosz, zeros], axis=3)
-  rotz_3 = tf.concat([zeros, zeros, ones], axis=3)
-  zmat = tf.concat([rotz_1, rotz_2, rotz_3], axis=2)
-
-  cosy = tf.cos(y)
-  siny = tf.sin(y)
-  roty_1 = tf.concat([cosy, zeros, siny], axis=3)
-  roty_2 = tf.concat([zeros, ones, zeros], axis=3)
-  roty_3 = tf.concat([-siny, zeros, cosy], axis=3)
-  ymat = tf.concat([roty_1, roty_2, roty_3], axis=2)
-
-  cosx = tf.cos(x)
-  sinx = tf.sin(x)
-  rotx_1 = tf.concat([ones, zeros, zeros], axis=3)
-  rotx_2 = tf.concat([zeros, cosx, -sinx], axis=3)
-  rotx_3 = tf.concat([zeros, sinx, cosx], axis=3)
-  xmat = tf.concat([rotx_1, rotx_2, rotx_3], axis=2)
-
-  return tf.matmul(tf.matmul(xmat, ymat), zmat)
-
-
-def _egomotion_vec2mat(vec, batch_size):
-  """Converts 6DoF transform vector to transformation matrix.
-
-  Args:
-    vec: 6DoF parameters [tx, ty, tz, rx, ry, rz] -- [B, 6].
-    batch_size: Batch size.
-
-  Returns:
-    A transformation matrix -- [B, 4, 4].
-  """
-  translation = tf.slice(vec, [0, 0], [-1, 3])
-  translation = tf.expand_dims(translation, -1)
-  rx = tf.slice(vec, [0, 3], [-1, 1])
-  ry = tf.slice(vec, [0, 4], [-1, 1])
-  rz = tf.slice(vec, [0, 5], [-1, 1])
-  rot_mat = _euler2mat(rz, ry, rx)
-  rot_mat = tf.squeeze(rot_mat, squeeze_dims=[1])
-  filler = tf.constant([0.0, 0.0, 0.0, 1.0], shape=[1, 1, 4])
-  filler = tf.tile(filler, [batch_size, 1, 1])
-  transform_mat = tf.concat([rot_mat, translation], axis=2)
-  transform_mat = tf.concat([transform_mat, filler], axis=1)
-  return transform_mat
-
-
-def _bilinear_sampler(im, x, y, name='blinear_sampler'):
-  """Perform bilinear sampling on im given list of x, y coordinates.
-
-  Implements the differentiable sampling mechanism with bilinear kernel
-  in https://arxiv.org/abs/1506.02025.
-
-  x,y are tensors specifying normalized coordinates [-1, 1] to be sampled on im.
-  For example, (-1, -1) in (x, y) corresponds to pixel location (0, 0) in im,
-  and (1, 1) in (x, y) corresponds to the bottom right pixel in im.
-
-  Args:
-    im: Batch of images with shape [B, h, w, channels].
-    x: Tensor of normalized x coordinates in [-1, 1], with shape [B, h, w, 1].
-    y: Tensor of normalized y coordinates in [-1, 1], with shape [B, h, w, 1].
-    name: Name scope for ops.
-
-  Returns:
-    Sampled image with shape [B, h, w, channels].
-    Principled mask with shape [B, h, w, 1], dtype:float32.  A value of 1.0
-      in the mask indicates that the corresponding coordinate in the sampled
-      image is valid.
-  """
-  with tf.variable_scope(name):
-    x = tf.reshape(x, [-1])
-    y = tf.reshape(y, [-1])
-
-    # Constants.
-    batch_size = tf.shape(im)[0]
-    _, height, width, channels = im.get_shape().as_list()
-
-    x = tf.to_float(x)
-    y = tf.to_float(y)
-    height_f = tf.cast(height, 'float32')
-    width_f = tf.cast(width, 'float32')
-    zero = tf.constant(0, dtype=tf.int32)
-    max_y = tf.cast(tf.shape(im)[1] - 1, 'int32')
-    max_x = tf.cast(tf.shape(im)[2] - 1, 'int32')
-
-    # Scale indices from [-1, 1] to [0, width - 1] or [0, height - 1].
-    x = (x + 1.0) * (width_f - 1.0) / 2.0
-    y = (y + 1.0) * (height_f - 1.0) / 2.0
-
-    # Compute the coordinates of the 4 pixels to sample from.
-    x0 = tf.cast(tf.floor(x), 'int32')
-    x1 = x0 + 1
-    y0 = tf.cast(tf.floor(y), 'int32')
-    y1 = y0 + 1
-
-    mask = tf.logical_and(
-        tf.logical_and(x0 >= zero, x1 <= max_x),
-        tf.logical_and(y0 >= zero, y1 <= max_y))
-    mask = tf.to_float(mask)
-
-    x0 = tf.clip_by_value(x0, zero, max_x)
-    x1 = tf.clip_by_value(x1, zero, max_x)
-    y0 = tf.clip_by_value(y0, zero, max_y)
-    y1 = tf.clip_by_value(y1, zero, max_y)
-    dim2 = width
-    dim1 = width * height
-
-    # Create base index.
-    base = tf.range(batch_size) * dim1
-    base = tf.reshape(base, [-1, 1])
-    base = tf.tile(base, [1, height * width])
-    base = tf.reshape(base, [-1])
-
-    base_y0 = base + y0 * dim2
-    base_y1 = base + y1 * dim2
-    idx_a = base_y0 + x0
-    idx_b = base_y1 + x0
-    idx_c = base_y0 + x1
-    idx_d = base_y1 + x1
-
-    # Use indices to lookup pixels in the flat image and restore channels dim.
-    im_flat = tf.reshape(im, tf.stack([-1, channels]))
-    im_flat = tf.to_float(im_flat)
-    pixel_a = tf.gather(im_flat, idx_a)
-    pixel_b = tf.gather(im_flat, idx_b)
-    pixel_c = tf.gather(im_flat, idx_c)
-    pixel_d = tf.gather(im_flat, idx_d)
-
-    x1_f = tf.to_float(x1)
-    y1_f = tf.to_float(y1)
-
-    # And finally calculate interpolated values.
-    wa = tf.expand_dims(((x1_f - x) * (y1_f - y)), 1)
-    wb = tf.expand_dims((x1_f - x) * (1.0 - (y1_f - y)), 1)
-    wc = tf.expand_dims(((1.0 - (x1_f - x)) * (y1_f - y)), 1)
-    wd = tf.expand_dims(((1.0 - (x1_f - x)) * (1.0 - (y1_f - y))), 1)
-
-    output = tf.add_n([wa * pixel_a, wb * pixel_b, wc * pixel_c, wd * pixel_d])
-    output = tf.reshape(output, tf.stack([batch_size, height, width, channels]))
-    mask = tf.reshape(mask, tf.stack([batch_size, height, width, 1]))
-    return output, mask
-
-
-def _spatial_transformer(img, coords):
-  """A wrapper over binlinear_sampler(), taking absolute coords as input."""
-  img_height = tf.cast(tf.shape(img)[1], tf.float32)
-  img_width = tf.cast(tf.shape(img)[2], tf.float32)
-  px = coords[:, :, :, :1]
-  py = coords[:, :, :, 1:]
-  # Normalize coordinates to [-1, 1] to send to _bilinear_sampler.
-  px = px / (img_width - 1) * 2.0 - 1.0
-  py = py / (img_height - 1) * 2.0 - 1.0
-  output_img, mask = _bilinear_sampler(img, px, py)
-  return output_img, mask
-
-
-def get_cloud(depth, intrinsics_inv, name=None):
-  """Convert depth map to 3D point cloud."""
-  with tf.name_scope(name):
-    dims = depth.shape.as_list()
-    batch_size, img_height, img_width = dims[0], dims[1], dims[2]
-    depth = tf.reshape(depth, [batch_size, 1, img_height * img_width])
-    grid = _meshgrid_abs(img_height, img_width)
-    grid = tf.tile(tf.expand_dims(grid, 0), [batch_size, 1, 1])
-    cam_coords = _pixel2cam(depth, grid, intrinsics_inv)
-    cam_coords = tf.transpose(cam_coords, [0, 2, 1])
-    cam_coords = tf.reshape(cam_coords, [batch_size, img_height, img_width, 3])
-    logging.info('depth -> cloud: %s', cam_coords)
-    return cam_coords
diff --git a/research/struct2depth/reader.py b/research/struct2depth/reader.py
deleted file mode 100644
index 444e4bea90a9ac377c6e4789a0eebf947f8ee790..0000000000000000000000000000000000000000
--- a/research/struct2depth/reader.py
+++ /dev/null
@@ -1,344 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Reads data that is produced by dataset/gen_data.py."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import random
-from absl import logging
-import tensorflow as tf
-
-import util
-
-gfile = tf.gfile
-
-QUEUE_SIZE = 2000
-QUEUE_BUFFER = 3
-# See nets.encoder_resnet as reference for below input-normalizing constants.
-IMAGENET_MEAN = (0.485, 0.456, 0.406)
-IMAGENET_SD = (0.229, 0.224, 0.225)
-FLIP_RANDOM = 'random'  # Always perform random flipping.
-FLIP_ALWAYS = 'always'  # Always flip image input, used for test augmentation.
-FLIP_NONE = 'none'  # Always disables flipping.
-
-
-class DataReader(object):
-  """Reads stored sequences which are produced by dataset/gen_data.py."""
-
-  def __init__(self, data_dir, batch_size, img_height, img_width, seq_length,
-               num_scales, file_extension, random_scale_crop, flipping_mode,
-               random_color, imagenet_norm, shuffle, input_file='train'):
-    self.data_dir = data_dir
-    self.batch_size = batch_size
-    self.img_height = img_height
-    self.img_width = img_width
-    self.seq_length = seq_length
-    self.num_scales = num_scales
-    self.file_extension = file_extension
-    self.random_scale_crop = random_scale_crop
-    self.flipping_mode = flipping_mode
-    self.random_color = random_color
-    self.imagenet_norm = imagenet_norm
-    self.shuffle = shuffle
-    self.input_file = input_file
-
-  def read_data(self):
-    """Provides images and camera intrinsics."""
-    with tf.name_scope('data_loading'):
-      with tf.name_scope('enqueue_paths'):
-        seed = random.randint(0, 2**31 - 1)
-        self.file_lists = self.compile_file_list(self.data_dir, self.input_file)
-        image_paths_queue = tf.train.string_input_producer(
-            self.file_lists['image_file_list'], seed=seed,
-            shuffle=self.shuffle,
-            num_epochs=(1 if not self.shuffle else None)
-        )
-        seg_paths_queue = tf.train.string_input_producer(
-            self.file_lists['segment_file_list'], seed=seed,
-            shuffle=self.shuffle,
-            num_epochs=(1 if not self.shuffle else None))
-        cam_paths_queue = tf.train.string_input_producer(
-            self.file_lists['cam_file_list'], seed=seed,
-            shuffle=self.shuffle,
-            num_epochs=(1 if not self.shuffle else None))
-        img_reader = tf.WholeFileReader()
-        _, image_contents = img_reader.read(image_paths_queue)
-        seg_reader = tf.WholeFileReader()
-        _, seg_contents = seg_reader.read(seg_paths_queue)
-        if self.file_extension == 'jpg':
-          image_seq = tf.image.decode_jpeg(image_contents)
-          seg_seq = tf.image.decode_jpeg(seg_contents, channels=3)
-        elif self.file_extension == 'png':
-          image_seq = tf.image.decode_png(image_contents, channels=3)
-          seg_seq = tf.image.decode_png(seg_contents, channels=3)
-
-      with tf.name_scope('load_intrinsics'):
-        cam_reader = tf.TextLineReader()
-        _, raw_cam_contents = cam_reader.read(cam_paths_queue)
-        rec_def = []
-        for _ in range(9):
-          rec_def.append([1.0])
-        raw_cam_vec = tf.decode_csv(raw_cam_contents, record_defaults=rec_def)
-        raw_cam_vec = tf.stack(raw_cam_vec)
-        intrinsics = tf.reshape(raw_cam_vec, [3, 3])
-
-      with tf.name_scope('convert_image'):
-        image_seq = self.preprocess_image(image_seq)  # Converts to float.
-
-      if self.random_color:
-        with tf.name_scope('image_augmentation'):
-          image_seq = self.augment_image_colorspace(image_seq)
-
-      image_stack = self.unpack_images(image_seq)
-      seg_stack = self.unpack_images(seg_seq)
-
-      if self.flipping_mode != FLIP_NONE:
-        random_flipping = (self.flipping_mode == FLIP_RANDOM)
-        with tf.name_scope('image_augmentation_flip'):
-          image_stack, seg_stack, intrinsics = self.augment_images_flip(
-              image_stack, seg_stack, intrinsics,
-              randomized=random_flipping)
-
-      if self.random_scale_crop:
-        with tf.name_scope('image_augmentation_scale_crop'):
-          image_stack, seg_stack, intrinsics = self.augment_images_scale_crop(
-              image_stack, seg_stack, intrinsics, self.img_height,
-              self.img_width)
-
-      with tf.name_scope('multi_scale_intrinsics'):
-        intrinsic_mat = self.get_multi_scale_intrinsics(intrinsics,
-                                                        self.num_scales)
-        intrinsic_mat.set_shape([self.num_scales, 3, 3])
-        intrinsic_mat_inv = tf.matrix_inverse(intrinsic_mat)
-        intrinsic_mat_inv.set_shape([self.num_scales, 3, 3])
-
-      if self.imagenet_norm:
-        im_mean = tf.tile(
-            tf.constant(IMAGENET_MEAN), multiples=[self.seq_length])
-        im_sd = tf.tile(
-            tf.constant(IMAGENET_SD), multiples=[self.seq_length])
-        image_stack_norm = (image_stack - im_mean) / im_sd
-      else:
-        image_stack_norm = image_stack
-
-      with tf.name_scope('batching'):
-        if self.shuffle:
-          (image_stack, image_stack_norm, seg_stack, intrinsic_mat,
-           intrinsic_mat_inv) = tf.train.shuffle_batch(
-               [image_stack, image_stack_norm, seg_stack, intrinsic_mat,
-                intrinsic_mat_inv],
-               batch_size=self.batch_size,
-               capacity=QUEUE_SIZE + QUEUE_BUFFER * self.batch_size,
-               min_after_dequeue=QUEUE_SIZE)
-        else:
-          (image_stack, image_stack_norm, seg_stack, intrinsic_mat,
-           intrinsic_mat_inv) = tf.train.batch(
-               [image_stack, image_stack_norm, seg_stack, intrinsic_mat,
-                intrinsic_mat_inv],
-               batch_size=self.batch_size,
-               num_threads=1,
-               capacity=QUEUE_SIZE + QUEUE_BUFFER * self.batch_size)
-        logging.info('image_stack: %s', util.info(image_stack))
-    return (image_stack, image_stack_norm, seg_stack, intrinsic_mat,
-            intrinsic_mat_inv)
-
-  def unpack_images(self, image_seq):
-    """[h, w * seq_length, 3] -> [h, w, 3 * seq_length]."""
-    with tf.name_scope('unpack_images'):
-      image_list = [
-          image_seq[:, i * self.img_width:(i + 1) * self.img_width, :]
-          for i in range(self.seq_length)
-      ]
-      image_stack = tf.concat(image_list, axis=2)
-      image_stack.set_shape(
-          [self.img_height, self.img_width, self.seq_length * 3])
-    return image_stack
-
-  @classmethod
-  def preprocess_image(cls, image):
-    # Convert from uint8 to float.
-    return tf.image.convert_image_dtype(image, dtype=tf.float32)
-
-  @classmethod
-  def augment_image_colorspace(cls, image_stack):
-    """Apply data augmentation to inputs."""
-    image_stack_aug = image_stack
-    # Randomly shift brightness.
-    apply_brightness = tf.less(tf.random_uniform(
-        shape=[], minval=0.0, maxval=1.0, dtype=tf.float32), 0.5)
-    image_stack_aug = tf.cond(
-        apply_brightness,
-        lambda: tf.image.random_brightness(image_stack_aug, max_delta=0.1),
-        lambda: image_stack_aug)
-
-    # Randomly shift contrast.
-    apply_contrast = tf.less(tf.random_uniform(
-        shape=[], minval=0.0, maxval=1.0, dtype=tf.float32), 0.5)
-    image_stack_aug = tf.cond(
-        apply_contrast,
-        lambda: tf.image.random_contrast(image_stack_aug, 0.85, 1.15),
-        lambda: image_stack_aug)
-
-    # Randomly change saturation.
-    apply_saturation = tf.less(tf.random_uniform(
-        shape=[], minval=0.0, maxval=1.0, dtype=tf.float32), 0.5)
-    image_stack_aug = tf.cond(
-        apply_saturation,
-        lambda: tf.image.random_saturation(image_stack_aug, 0.85, 1.15),
-        lambda: image_stack_aug)
-
-    # Randomly change hue.
-    apply_hue = tf.less(tf.random_uniform(
-        shape=[], minval=0.0, maxval=1.0, dtype=tf.float32), 0.5)
-    image_stack_aug = tf.cond(
-        apply_hue,
-        lambda: tf.image.random_hue(image_stack_aug, max_delta=0.1),
-        lambda: image_stack_aug)
-
-    image_stack_aug = tf.clip_by_value(image_stack_aug, 0, 1)
-    return image_stack_aug
-
-  @classmethod
-  def augment_images_flip(cls, image_stack, seg_stack, intrinsics,
-                          randomized=True):
-    """Randomly flips the image horizontally."""
-
-    def flip(cls, image_stack, seg_stack, intrinsics):
-      _, in_w, _ = image_stack.get_shape().as_list()
-      fx = intrinsics[0, 0]
-      fy = intrinsics[1, 1]
-      cx = in_w - intrinsics[0, 2]
-      cy = intrinsics[1, 2]
-      intrinsics = cls.make_intrinsics_matrix(fx, fy, cx, cy)
-      return (tf.image.flip_left_right(image_stack),
-              tf.image.flip_left_right(seg_stack), intrinsics)
-
-    if randomized:
-      prob = tf.random_uniform(shape=[], minval=0.0, maxval=1.0,
-                               dtype=tf.float32)
-      predicate = tf.less(prob, 0.5)
-      return tf.cond(predicate,
-                     lambda: flip(cls, image_stack, seg_stack, intrinsics),
-                     lambda: (image_stack, seg_stack, intrinsics))
-    else:
-      return flip(cls, image_stack, seg_stack, intrinsics)
-
-  @classmethod
-  def augment_images_scale_crop(cls, im, seg, intrinsics, out_h, out_w):
-    """Randomly scales and crops image."""
-
-    def scale_randomly(im, seg, intrinsics):
-      """Scales image and adjust intrinsics accordingly."""
-      in_h, in_w, _ = im.get_shape().as_list()
-      scaling = tf.random_uniform([2], 1, 1.15)
-      x_scaling = scaling[0]
-      y_scaling = scaling[1]
-      out_h = tf.cast(in_h * y_scaling, dtype=tf.int32)
-      out_w = tf.cast(in_w * x_scaling, dtype=tf.int32)
-      # Add batch.
-      im = tf.expand_dims(im, 0)
-      im = tf.image.resize_area(im, [out_h, out_w])
-      im = im[0]
-      seg = tf.expand_dims(seg, 0)
-      seg = tf.image.resize_area(seg, [out_h, out_w])
-      seg = seg[0]
-      fx = intrinsics[0, 0] * x_scaling
-      fy = intrinsics[1, 1] * y_scaling
-      cx = intrinsics[0, 2] * x_scaling
-      cy = intrinsics[1, 2] * y_scaling
-      intrinsics = cls.make_intrinsics_matrix(fx, fy, cx, cy)
-      return im, seg, intrinsics
-
-    # Random cropping
-    def crop_randomly(im, seg, intrinsics, out_h, out_w):
-      """Crops image and adjust intrinsics accordingly."""
-      # batch_size, in_h, in_w, _ = im.get_shape().as_list()
-      in_h, in_w, _ = tf.unstack(tf.shape(im))
-      offset_y = tf.random_uniform([1], 0, in_h - out_h + 1, dtype=tf.int32)[0]
-      offset_x = tf.random_uniform([1], 0, in_w - out_w + 1, dtype=tf.int32)[0]
-      im = tf.image.crop_to_bounding_box(im, offset_y, offset_x, out_h, out_w)
-      seg = tf.image.crop_to_bounding_box(seg, offset_y, offset_x, out_h, out_w)
-      fx = intrinsics[0, 0]
-      fy = intrinsics[1, 1]
-      cx = intrinsics[0, 2] - tf.cast(offset_x, dtype=tf.float32)
-      cy = intrinsics[1, 2] - tf.cast(offset_y, dtype=tf.float32)
-      intrinsics = cls.make_intrinsics_matrix(fx, fy, cx, cy)
-      return im, seg, intrinsics
-
-    im, seg, intrinsics = scale_randomly(im, seg, intrinsics)
-    im, seg, intrinsics = crop_randomly(im, seg, intrinsics, out_h, out_w)
-    return im, seg, intrinsics
-
-  def compile_file_list(self, data_dir, split, load_pose=False):
-    """Creates a list of input files."""
-    logging.info('data_dir: %s', data_dir)
-    with gfile.Open(os.path.join(data_dir, '%s.txt' % split), 'r') as f:
-      frames = f.readlines()
-      frames = [k.rstrip() for k in frames]
-    subfolders = [x.split(' ')[0] for x in frames]
-    frame_ids = [x.split(' ')[1] for x in frames]
-    image_file_list = [
-        os.path.join(data_dir, subfolders[i], frame_ids[i] + '.' +
-                     self.file_extension)
-        for i in range(len(frames))
-    ]
-    segment_file_list = [
-        os.path.join(data_dir, subfolders[i], frame_ids[i] + '-fseg.' +
-                     self.file_extension)
-        for i in range(len(frames))
-    ]
-    cam_file_list = [
-        os.path.join(data_dir, subfolders[i], frame_ids[i] + '_cam.txt')
-        for i in range(len(frames))
-    ]
-    file_lists = {}
-    file_lists['image_file_list'] = image_file_list
-    file_lists['segment_file_list'] = segment_file_list
-    file_lists['cam_file_list'] = cam_file_list
-    if load_pose:
-      pose_file_list = [
-          os.path.join(data_dir, subfolders[i], frame_ids[i] + '_pose.txt')
-          for i in range(len(frames))
-      ]
-      file_lists['pose_file_list'] = pose_file_list
-    self.steps_per_epoch = len(image_file_list) // self.batch_size
-    return file_lists
-
-  @classmethod
-  def make_intrinsics_matrix(cls, fx, fy, cx, cy):
-    r1 = tf.stack([fx, 0, cx])
-    r2 = tf.stack([0, fy, cy])
-    r3 = tf.constant([0., 0., 1.])
-    intrinsics = tf.stack([r1, r2, r3])
-    return intrinsics
-
-  @classmethod
-  def get_multi_scale_intrinsics(cls, intrinsics, num_scales):
-    """Returns multiple intrinsic matrices for different scales."""
-    intrinsics_multi_scale = []
-    # Scale the intrinsics accordingly for each scale
-    for s in range(num_scales):
-      fx = intrinsics[0, 0] / (2**s)
-      fy = intrinsics[1, 1] / (2**s)
-      cx = intrinsics[0, 2] / (2**s)
-      cy = intrinsics[1, 2] / (2**s)
-      intrinsics_multi_scale.append(cls.make_intrinsics_matrix(fx, fy, cx, cy))
-    intrinsics_multi_scale = tf.stack(intrinsics_multi_scale)
-    return intrinsics_multi_scale
diff --git a/research/struct2depth/train.py b/research/struct2depth/train.py
deleted file mode 100644
index 248c182fe5790ea1305cf887962bd26dd33d7b54..0000000000000000000000000000000000000000
--- a/research/struct2depth/train.py
+++ /dev/null
@@ -1,259 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Train the model. Please refer to README for example usage."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import math
-import os
-import random
-import time
-from absl import app
-from absl import flags
-from absl import logging
-import numpy as np
-import tensorflow as tf
-
-import model
-import nets
-import reader
-import util
-
-gfile = tf.gfile
-MAX_TO_KEEP = 1000000  # Maximum number of checkpoints to keep.
-
-flags.DEFINE_string('data_dir', None, 'Preprocessed data.')
-flags.DEFINE_string('file_extension', 'png', 'Image data file extension.')
-flags.DEFINE_float('learning_rate', 0.0002, 'Adam learning rate.')
-flags.DEFINE_float('beta1', 0.9, 'Adam momentum.')
-flags.DEFINE_float('reconstr_weight', 0.85, 'Frame reconstruction loss weight.')
-flags.DEFINE_float('ssim_weight', 0.15, 'SSIM loss weight.')
-flags.DEFINE_float('smooth_weight', 0.04, 'Smoothness loss weight.')
-flags.DEFINE_float('icp_weight', 0.0, 'ICP loss weight.')
-flags.DEFINE_float('size_constraint_weight', 0.0005, 'Weight of the object '
-                   'size constraint loss. Use only when motion handling is '
-                   'enabled.')
-flags.DEFINE_integer('batch_size', 4, 'The size of a sample batch')
-flags.DEFINE_integer('img_height', 128, 'Input frame height.')
-flags.DEFINE_integer('img_width', 416, 'Input frame width.')
-flags.DEFINE_integer('seq_length', 3, 'Number of frames in sequence.')
-flags.DEFINE_enum('architecture', nets.RESNET, nets.ARCHITECTURES,
-                  'Defines the architecture to use for the depth prediction '
-                  'network. Defaults to ResNet-based encoder and accompanying '
-                  'decoder.')
-flags.DEFINE_boolean('imagenet_norm', True, 'Whether to normalize the input '
-                     'images channel-wise so that they match the distribution '
-                     'most ImageNet-models were trained on.')
-flags.DEFINE_float('weight_reg', 0.05, 'The amount of weight regularization to '
-                   'apply. This has no effect on the ResNet-based encoder '
-                   'architecture.')
-flags.DEFINE_boolean('exhaustive_mode', False, 'Whether to exhaustively warp '
-                     'from any frame to any other instead of just considering '
-                     'adjacent frames. Where necessary, multiple egomotion '
-                     'estimates will be applied. Does not have an effect if '
-                     'compute_minimum_loss is enabled.')
-flags.DEFINE_boolean('random_scale_crop', False, 'Whether to apply random '
-                     'image scaling and center cropping during training.')
-flags.DEFINE_enum('flipping_mode', reader.FLIP_RANDOM,
-                  [reader.FLIP_RANDOM, reader.FLIP_ALWAYS, reader.FLIP_NONE],
-                  'Determines the image flipping mode: if random, performs '
-                  'on-the-fly augmentation. Otherwise, flips the input images '
-                  'always or never, respectively.')
-flags.DEFINE_string('pretrained_ckpt', None, 'Path to checkpoint with '
-                    'pretrained weights.  Do not include .data* extension.')
-flags.DEFINE_string('imagenet_ckpt', None, 'Initialize the weights according '
-                    'to an ImageNet-pretrained checkpoint. Requires '
-                    'architecture to be ResNet-18.')
-flags.DEFINE_string('checkpoint_dir', None, 'Directory to save model '
-                    'checkpoints.')
-flags.DEFINE_integer('train_steps', 10000000, 'Number of training steps.')
-flags.DEFINE_integer('summary_freq', 100, 'Save summaries every N steps.')
-flags.DEFINE_bool('depth_upsampling', True, 'Whether to apply depth '
-                  'upsampling of lower-scale representations before warping to '
-                  'compute reconstruction loss on full-resolution image.')
-flags.DEFINE_bool('depth_normalization', True, 'Whether to apply depth '
-                  'normalization, that is, normalizing inverse depth '
-                  'prediction maps by their mean to avoid degeneration towards '
-                  'small values.')
-flags.DEFINE_bool('compute_minimum_loss', True, 'Whether to take the '
-                  'element-wise minimum of the reconstruction/SSIM error in '
-                  'order to avoid overly penalizing dis-occlusion effects.')
-flags.DEFINE_bool('use_skip', True, 'Whether to use skip connections in the '
-                  'encoder-decoder architecture.')
-flags.DEFINE_bool('equal_weighting', False, 'Whether to use equal weighting '
-                  'of the smoothing loss term, regardless of resolution.')
-flags.DEFINE_bool('joint_encoder', False, 'Whether to share parameters '
-                  'between the depth and egomotion networks by using a joint '
-                  'encoder architecture. The egomotion network is then '
-                  'operating only on the hidden representation provided by the '
-                  'joint encoder.')
-flags.DEFINE_bool('handle_motion', True, 'Whether to try to handle motion by '
-                  'using the provided segmentation masks.')
-flags.DEFINE_string('master', 'local', 'Location of the session.')
-
-FLAGS = flags.FLAGS
-flags.mark_flag_as_required('data_dir')
-flags.mark_flag_as_required('checkpoint_dir')
-
-
-def main(_):
-  # Fixed seed for repeatability
-  seed = 8964
-  tf.set_random_seed(seed)
-  np.random.seed(seed)
-  random.seed(seed)
-
-  if FLAGS.handle_motion and FLAGS.joint_encoder:
-    raise ValueError('Using a joint encoder is currently not supported when '
-                     'modeling object motion.')
-  if FLAGS.handle_motion and FLAGS.seq_length != 3:
-    raise ValueError('The current motion model implementation only supports '
-                     'using a sequence length of three.')
-  if FLAGS.handle_motion and not FLAGS.compute_minimum_loss:
-    raise ValueError('Computing the minimum photometric loss is required when '
-                     'enabling object motion handling.')
-  if FLAGS.size_constraint_weight > 0 and not FLAGS.handle_motion:
-    raise ValueError('To enforce object size constraints, enable motion '
-                     'handling.')
-  if FLAGS.imagenet_ckpt and not FLAGS.imagenet_norm:
-    logging.warn('When initializing with an ImageNet-pretrained model, it is '
-                 'recommended to normalize the image inputs accordingly using '
-                 'imagenet_norm.')
-  if FLAGS.compute_minimum_loss and FLAGS.seq_length % 2 != 1:
-    raise ValueError('Compute minimum loss requires using an odd number of '
-                     'images in a sequence.')
-  if FLAGS.architecture != nets.RESNET and FLAGS.imagenet_ckpt:
-    raise ValueError('Can only load weights from pre-trained ImageNet model '
-                     'when using ResNet-architecture.')
-  if FLAGS.compute_minimum_loss and FLAGS.exhaustive_mode:
-    raise ValueError('Exhaustive mode has no effect when compute_minimum_loss '
-                     'is enabled.')
-  if FLAGS.img_width % (2 ** 5) != 0 or FLAGS.img_height % (2 ** 5) != 0:
-    logging.warn('Image size is not divisible by 2^5. For the architecture '
-                 'employed, this could cause artefacts caused by resizing in '
-                 'lower dimensions.')
-  if FLAGS.icp_weight > 0.0:
-    # TODO(casser): Change ICP interface to take matrix instead of vector.
-    raise ValueError('ICP is currently not supported.')
-
-  if not gfile.Exists(FLAGS.checkpoint_dir):
-    gfile.MakeDirs(FLAGS.checkpoint_dir)
-
-  train_model = model.Model(data_dir=FLAGS.data_dir,
-                            file_extension=FLAGS.file_extension,
-                            is_training=True,
-                            learning_rate=FLAGS.learning_rate,
-                            beta1=FLAGS.beta1,
-                            reconstr_weight=FLAGS.reconstr_weight,
-                            smooth_weight=FLAGS.smooth_weight,
-                            ssim_weight=FLAGS.ssim_weight,
-                            icp_weight=FLAGS.icp_weight,
-                            batch_size=FLAGS.batch_size,
-                            img_height=FLAGS.img_height,
-                            img_width=FLAGS.img_width,
-                            seq_length=FLAGS.seq_length,
-                            architecture=FLAGS.architecture,
-                            imagenet_norm=FLAGS.imagenet_norm,
-                            weight_reg=FLAGS.weight_reg,
-                            exhaustive_mode=FLAGS.exhaustive_mode,
-                            random_scale_crop=FLAGS.random_scale_crop,
-                            flipping_mode=FLAGS.flipping_mode,
-                            depth_upsampling=FLAGS.depth_upsampling,
-                            depth_normalization=FLAGS.depth_normalization,
-                            compute_minimum_loss=FLAGS.compute_minimum_loss,
-                            use_skip=FLAGS.use_skip,
-                            joint_encoder=FLAGS.joint_encoder,
-                            handle_motion=FLAGS.handle_motion,
-                            equal_weighting=FLAGS.equal_weighting,
-                            size_constraint_weight=FLAGS.size_constraint_weight)
-
-  train(train_model, FLAGS.pretrained_ckpt, FLAGS.imagenet_ckpt,
-        FLAGS.checkpoint_dir, FLAGS.train_steps, FLAGS.summary_freq)
-
-
-def train(train_model, pretrained_ckpt, imagenet_ckpt, checkpoint_dir,
-          train_steps, summary_freq):
-  """Train model."""
-  vars_to_restore = None
-  if pretrained_ckpt is not None:
-    vars_to_restore = util.get_vars_to_save_and_restore(pretrained_ckpt)
-    ckpt_path = pretrained_ckpt
-  elif imagenet_ckpt:
-    vars_to_restore = util.get_imagenet_vars_to_restore(imagenet_ckpt)
-    ckpt_path = imagenet_ckpt
-  pretrain_restorer = tf.train.Saver(vars_to_restore)
-  vars_to_save = util.get_vars_to_save_and_restore()
-  vars_to_save[train_model.global_step.op.name] = train_model.global_step
-  saver = tf.train.Saver(vars_to_save, max_to_keep=MAX_TO_KEEP)
-  sv = tf.train.Supervisor(logdir=checkpoint_dir, save_summaries_secs=0,
-                           saver=None)
-  config = tf.ConfigProto()
-  config.gpu_options.allow_growth = True
-  with sv.managed_session(config=config) as sess:
-    if pretrained_ckpt is not None or imagenet_ckpt:
-      logging.info('Restoring pretrained weights from %s', ckpt_path)
-      pretrain_restorer.restore(sess, ckpt_path)
-
-    logging.info('Attempting to resume training from %s...', checkpoint_dir)
-    checkpoint = tf.train.latest_checkpoint(checkpoint_dir)
-    logging.info('Last checkpoint found: %s', checkpoint)
-    if checkpoint:
-      saver.restore(sess, checkpoint)
-
-    logging.info('Training...')
-    start_time = time.time()
-    last_summary_time = time.time()
-    steps_per_epoch = train_model.reader.steps_per_epoch
-    step = 1
-    while step <= train_steps:
-      fetches = {
-          'train': train_model.train_op,
-          'global_step': train_model.global_step,
-          'incr_global_step': train_model.incr_global_step
-      }
-      if step % summary_freq == 0:
-        fetches['loss'] = train_model.total_loss
-        fetches['summary'] = sv.summary_op
-
-      results = sess.run(fetches)
-      global_step = results['global_step']
-
-      if step % summary_freq == 0:
-        sv.summary_writer.add_summary(results['summary'], global_step)
-        train_epoch = math.ceil(global_step / steps_per_epoch)
-        train_step = global_step - (train_epoch - 1) * steps_per_epoch
-        this_cycle = time.time() - last_summary_time
-        last_summary_time += this_cycle
-        logging.info(
-            'Epoch: [%2d] [%5d/%5d] time: %4.2fs (%ds total) loss: %.3f',
-            train_epoch, train_step, steps_per_epoch, this_cycle,
-            time.time() - start_time, results['loss'])
-
-      if step % steps_per_epoch == 0:
-        logging.info('[*] Saving checkpoint to %s...', checkpoint_dir)
-        saver.save(sess, os.path.join(checkpoint_dir, 'model'),
-                   global_step=global_step)
-
-      # Setting step to global_step allows for training for a total of
-      # train_steps even if the program is restarted during training.
-      step = global_step + 1
-
-
-if __name__ == '__main__':
-  app.run(main)
diff --git a/research/struct2depth/util.py b/research/struct2depth/util.py
deleted file mode 100644
index 79376864685363ab886ae35c8fa614d9641a397d..0000000000000000000000000000000000000000
--- a/research/struct2depth/util.py
+++ /dev/null
@@ -1,252 +0,0 @@
-
-# Copyright 2018 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Contains common utilities and functions."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import locale
-import os
-import re
-from absl import logging
-import matplotlib
-matplotlib.use('Agg')
-import matplotlib.pyplot as plt
-import numpy as np
-import tensorflow as tf
-import cv2
-gfile = tf.gfile
-
-
-CMAP_DEFAULT = 'plasma'
-# Defines the cropping that is applied to the Cityscapes dataset with respect to
-# the original raw input resolution.
-CITYSCAPES_CROP = [256, 768, 192, 1856]
-
-
-def crop_cityscapes(im, resize=None):
-  ymin, ymax, xmin, xmax = CITYSCAPES_CROP
-  im = im[ymin:ymax, xmin:xmax]
-  if resize is not None:
-    im = cv2.resize(im, resize)
-  return im
-
-
-def gray2rgb(im, cmap=CMAP_DEFAULT):
-  cmap = plt.get_cmap(cmap)
-  result_img = cmap(im.astype(np.float32))
-  if result_img.shape[2] > 3:
-    result_img = np.delete(result_img, 3, 2)
-  return result_img
-
-
-def load_image(img_file, resize=None, interpolation='linear'):
-  """Load image from disk. Output value range: [0,1]."""
-  im_data = np.fromstring(gfile.Open(img_file).read(), np.uint8)
-  im = cv2.imdecode(im_data, cv2.IMREAD_COLOR)
-  im = cv2.cvtColor(im, cv2.COLOR_BGR2RGB)
-  if resize and resize != im.shape[:2]:
-    ip = cv2.INTER_LINEAR if interpolation == 'linear' else cv2.INTER_NEAREST
-    im = cv2.resize(im, resize, interpolation=ip)
-  return np.array(im, dtype=np.float32) / 255.0
-
-
-def save_image(img_file, im, file_extension):
-  """Save image from disk. Expected input value range: [0,1]."""
-  im = (im * 255.0).astype(np.uint8)
-  with gfile.Open(img_file, 'w') as f:
-    im = cv2.cvtColor(im, cv2.COLOR_RGB2BGR)
-    _, im_data = cv2.imencode('.%s' % file_extension, im)
-    f.write(im_data.tostring())
-
-
-def normalize_depth_for_display(depth, pc=95, crop_percent=0, normalizer=None,
-                                cmap=CMAP_DEFAULT):
-  """Converts a depth map to an RGB image."""
-  # Convert to disparity.
-
-  disp = 1.0 / (depth + 1e-6)
-  if normalizer is not None:
-    disp /= normalizer
-  else:
-    disp /= (np.percentile(disp, pc) + 1e-6)
-  disp = np.clip(disp, 0, 1)
-  disp = gray2rgb(disp, cmap=cmap)
-  keep_h = int(disp.shape[0] * (1 - crop_percent))
-  disp = disp[:keep_h]
-  return disp
-
-
-def get_seq_start_end(target_index, seq_length, sample_every=1):
-  """Returns absolute seq start and end indices for a given target frame."""
-  half_offset = int((seq_length - 1) / 2) * sample_every
-  end_index = target_index + half_offset
-  start_index = end_index - (seq_length - 1) * sample_every
-  return start_index, end_index
-
-
-def get_seq_middle(seq_length):
-  """Returns relative index for the middle frame in sequence."""
-  half_offset = int((seq_length - 1) / 2)
-  return seq_length - 1 - half_offset
-
-
-def info(obj):
-  """Return info on shape and dtype of a numpy array or TensorFlow tensor."""
-  if obj is None:
-    return 'None.'
-  elif isinstance(obj, list):
-    if obj:
-      return 'List of %d... %s' % (len(obj), info(obj[0]))
-    else:
-      return 'Empty list.'
-  elif isinstance(obj, tuple):
-    if obj:
-      return 'Tuple of %d... %s' % (len(obj), info(obj[0]))
-    else:
-      return 'Empty tuple.'
-  else:
-    if is_a_numpy_array(obj):
-      return 'Array with shape: %s, dtype: %s' % (obj.shape, obj.dtype)
-    else:
-      return str(obj)
-
-
-def is_a_numpy_array(obj):
-  """Returns true if obj is a numpy array."""
-  return type(obj).__module__ == np.__name__
-
-
-def count_parameters(also_print=True):
-  """Cound the number of parameters in the model.
-
-  Args:
-    also_print: Boolean.  If True also print the numbers.
-
-  Returns:
-    The total number of parameters.
-  """
-  total = 0
-  if also_print:
-    logging.info('Model Parameters:')
-  for (_, v) in get_vars_to_save_and_restore().items():
-    shape = v.get_shape()
-    if also_print:
-      logging.info('%s %s: %s', v.op.name, shape,
-                   format_number(shape.num_elements()))
-    total += shape.num_elements()
-  if also_print:
-    logging.info('Total: %s', format_number(total))
-  return total
-
-
-def get_vars_to_save_and_restore(ckpt=None):
-  """Returns list of variables that should be saved/restored.
-
-  Args:
-    ckpt: Path to existing checkpoint.  If present, returns only the subset of
-        variables that exist in given checkpoint.
-
-  Returns:
-    List of all variables that need to be saved/restored.
-  """
-  model_vars = tf.trainable_variables()
-  # Add batchnorm variables.
-  bn_vars = [v for v in tf.global_variables()
-             if 'moving_mean' in v.op.name or 'moving_variance' in v.op.name or
-             'mu' in v.op.name or 'sigma' in v.op.name or
-             'global_scale_var' in v.op.name]
-  model_vars.extend(bn_vars)
-  model_vars = sorted(model_vars, key=lambda x: x.op.name)
-  mapping = {}
-  if ckpt is not None:
-    ckpt_var = tf.contrib.framework.list_variables(ckpt)
-    ckpt_var_names = [name for (name, unused_shape) in ckpt_var]
-    ckpt_var_shapes = [shape for (unused_name, shape) in ckpt_var]
-    not_loaded = list(ckpt_var_names)
-    for v in model_vars:
-      if v.op.name not in ckpt_var_names:
-        # For backward compatibility, try additional matching.
-        v_additional_name = v.op.name.replace('egomotion_prediction/', '')
-        if v_additional_name in ckpt_var_names:
-          # Check if shapes match.
-          ind = ckpt_var_names.index(v_additional_name)
-          if ckpt_var_shapes[ind] == v.get_shape():
-            mapping[v_additional_name] = v
-            not_loaded.remove(v_additional_name)
-            continue
-          else:
-            logging.warn('Shape mismatch, will not restore %s.', v.op.name)
-        logging.warn('Did not find var %s in checkpoint: %s', v.op.name,
-                     os.path.basename(ckpt))
-      else:
-        # Check if shapes match.
-        ind = ckpt_var_names.index(v.op.name)
-        if ckpt_var_shapes[ind] == v.get_shape():
-          mapping[v.op.name] = v
-          not_loaded.remove(v.op.name)
-        else:
-          logging.warn('Shape mismatch, will not restore %s.', v.op.name)
-    if not_loaded:
-      logging.warn('The following variables in the checkpoint were not loaded:')
-      for varname_not_loaded in not_loaded:
-        logging.info('%s', varname_not_loaded)
-  else:  # just get model vars.
-    for v in model_vars:
-      mapping[v.op.name] = v
-  return mapping
-
-
-def get_imagenet_vars_to_restore(imagenet_ckpt):
-  """Returns dict of variables to restore from ImageNet-checkpoint."""
-  vars_to_restore_imagenet = {}
-  ckpt_var_names = tf.contrib.framework.list_variables(imagenet_ckpt)
-  ckpt_var_names = [name for (name, unused_shape) in ckpt_var_names]
-  model_vars = tf.global_variables()
-  for v in model_vars:
-    if 'global_step' in v.op.name: continue
-    mvname_noprefix = v.op.name.replace('depth_prediction/', '')
-    mvname_noprefix = mvname_noprefix.replace('moving_mean', 'mu')
-    mvname_noprefix = mvname_noprefix.replace('moving_variance', 'sigma')
-    if mvname_noprefix in ckpt_var_names:
-      vars_to_restore_imagenet[mvname_noprefix] = v
-    else:
-      logging.info('The following variable will not be restored from '
-                   'pretrained ImageNet-checkpoint: %s', mvname_noprefix)
-  return vars_to_restore_imagenet
-
-
-def format_number(n):
-  """Formats number with thousands commas."""
-  locale.setlocale(locale.LC_ALL, 'en_US')
-  return locale.format('%d', n, grouping=True)
-
-
-def atoi(text):
-  return int(text) if text.isdigit() else text
-
-
-def natural_keys(text):
-  return [atoi(c) for c in re.split(r'(\d+)', text)]
-
-
-def read_text_lines(filepath):
-  with tf.gfile.Open(filepath, 'r') as f:
-    lines = f.readlines()
-  lines = [l.rstrip() for l in lines]
-  return lines
diff --git a/research/swivel/.gitignore b/research/swivel/.gitignore
deleted file mode 100644
index 215593fb25dfe5862662aafcade225a7c9d62b83..0000000000000000000000000000000000000000
--- a/research/swivel/.gitignore
+++ /dev/null
@@ -1,14 +0,0 @@
-*.an.tab
-*.pyc
-*.ws.tab
-MEN.tar.gz
-Mtruk.csv
-SimLex-999.zip
-analogy
-fastprep
-*.dSYM
-questions-words.txt
-word_relationship.*
-tensorflow/
-rw.zip
-ws353simrel.tar.gz
diff --git a/research/swivel/README.md b/research/swivel/README.md
deleted file mode 100644
index c5550a2d18ccabdc22dc271c5b0515b233497068..0000000000000000000000000000000000000000
--- a/research/swivel/README.md
+++ /dev/null
@@ -1,185 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Swivel in Tensorflow
-
-This is a [TensorFlow](http://www.tensorflow.org/) implementation of the
-[Swivel algorithm](http://arxiv.org/abs/1602.02215) for generating word
-embeddings.
-
-Swivel works as follows:
-
-1. Compute the co-occurrence statistics from a corpus; that is, determine how
-   often a word *c* appears the context (e.g., "within ten words") of a focus
-   word *f*.  This results in a sparse *co-occurrence matrix* whose rows
-   represent the focus words, and whose columns represent the context
-   words. Each cell value is the number of times the focus and context words
-   were observed together.
-2. Re-organize the co-occurrence matrix and chop it into smaller pieces.
-3. Assign a random *embedding vector* of fixed dimension (say, 300) to each
-   focus word and to each context word.
-4. Iteratively attempt to approximate the
-   [pointwise mutual information](https://en.wikipedia.org/wiki/Pointwise_mutual_information)
-   (PMI) between words with the dot product of the corresponding embedding
-   vectors.
-
-Note that the resulting co-occurrence matrix is very sparse (i.e., contains many
-zeros) since most words won't have been observed in the context of other words.
-In the case of very rare words, it seems reasonable to assume that you just
-haven't sampled enough data to spot their co-occurrence yet.  On the other hand,
-if we've failed to observed two common words co-occuring, it seems likely that
-they are *anti-correlated*.
-
-Swivel attempts to capture this intuition by using both the observed and the
-un-observed co-occurrences to inform the way it iteratively adjusts vectors.
-Empirically, this seems to lead to better embeddings, especially for rare words.
-
-# Contents
-
-This release includes the following programs.
-
-* `prep.py` is a program that takes a text corpus and pre-processes it for
-  training. Specifically, it computes a vocabulary and token co-occurrence
-  statistics for the corpus.  It then outputs the information into a format that
-  can be digested by the TensorFlow trainer.
-* `swivel.py` is a TensorFlow program that generates embeddings from the
-  co-occurrence statistics.  It uses the files created by `prep.py` as input,
-  and generates two text files as output: the row and column embeddings.
-* `distributed.sh` is a Bash script that is meant to act as a template for
-  launching "distributed" Swivel training; i.e., multiple processes that work in
-  parallel and communicate via a parameter server.
-* `text2bin.py` combines the row and column vectors generated by Swivel into a
-  flat binary file that can be quickly loaded into memory to perform vector
-  arithmetic.  This can also be used to convert embeddings from
-  [Glove](http://nlp.stanford.edu/projects/glove/) and
-  [word2vec](https://code.google.com/archive/p/word2vec/) into a form that can
-  be used by the following tools.
-* `nearest.py` is a program that you can use to manually inspect binary
-  embeddings.
-* `eval.mk` is a GNU makefile that fill retrieve and normalize several common
-  word similarity and analogy evaluation data sets.
-* `wordsim.py` performs word similarity evaluation of the resulting vectors.
-* `analogy` performs analogy evaluation of the resulting vectors.
-* `fastprep` is a C++ program that works much more quickly that `prep.py`, but
-  also has some additional dependencies to build.
-
-# Building Embeddings with Swivel
-
-To build your own word embeddings with Swivel, you'll need the following:
-
-* A large corpus of text; for example, the
-  [dump of English Wikipedia](https://dumps.wikimedia.org/enwiki/).
-* A working [TensorFlow](http://www.tensorflow.org/) implementation.
-* A machine with plenty of disk space and, ideally, a beefy GPU card.  (We've
-  experimented with the
-  [Nvidia Titan X](http://www.geforce.com/hardware/desktop-gpus/geforce-gtx-titan-x),
-  for example.)
-
-You'll then run `prep.py` (or `fastprep`) to prepare the data for Swivel and run
-`swivel.py` to create the embeddings. The resulting embeddings will be output
-into two large text files: one for the row vectors and one for the column
-vectors.  You can use those "as is", or convert them into a binary file using
-`text2bin.py` and then use the tools here to experiment with the resulting
-vectors.
-
-## Preparing the data for training
-
-Once you've downloaded the corpus (e.g., to `/tmp/wiki.txt`), run `prep.py` to
-prepare the data for training:
-
-    ./prep.py --output_dir /tmp/swivel_data --input /tmp/wiki.txt
-
-By default, `prep.py` will make one pass through the text file to compute a
-"vocabulary" of the most frequent words, and then a second pass to compute the
-co-occurrence statistics.  The following options allow you to control this
-behavior:
-
-| Option | Description |
-|:--- |:--- |
-| `--min_count <n>` | Only include words in the generated vocabulary that appear at least *n* times. |
-| `--max_vocab <n>` | Admit at most *n* words into the vocabulary. |
-| `--vocab <filename>` | Use the specified filename as the vocabulary instead of computing it from the corpus.  The file should contain one word per line. |
-
-The `prep.py` program is pretty simple.  Notably, it does almost no text
-processing: it does no case translation and simply breaks text into tokens by
-splitting on spaces. Feel free to experiment with the `words` function if you'd
-like to do something more sophisticated.
-
-Unfortunately, `prep.py` is pretty slow.  Also included is `fastprep`, a C++
-equivalent that works much more quickly.  Building `fastprep.cc` is a bit more
-involved: it requires you to pull and build the Tensorflow source code in order
-to provide the libraries and headers that it needs.  See `fastprep.mk` for more
-details.
-
-## Training the embeddings
-
-When `prep.py` completes, it will have produced a directory containing the data
-that the Swivel trainer needs to run.  Train embeddings as follows:
-
-    ./swivel.py --input_base_path /tmp/swivel_data \
-       --output_base_path /tmp/swivel_data
-
-There are a variety of parameters that you can fiddle with to customize the
-embeddings; some that you may want to experiment with include:
-
-| Option | Description |
-|:--- |:--- |
-| `--embedding_size <dim>` | The dimensionality of the embeddings that are created.  By default, 300 dimensional embeddings are created. |
-| `--num_epochs <n>` | The number of iterations through the data that are performed.  By default, 40 epochs are trained. |
-
-As mentioned above, access to beefy GPU will dramatically reduce the amount of
-time it takes Swivel to train embeddings.
-
-When complete, you should find `row_embeddings.tsv` and `col_embedding.tsv` in
-the directory specified by `--ouput_base_path`.  These files are tab-delimited
-files that contain one embedding per line.  Each line contains the token
-followed by *dim* floating point numbers.
-
-## Exploring and evaluating the embeddings
-
-There are also some simple tools you can to explore the embeddings.  These tools
-work with a simple binary vector format that can be `mmap`-ed into memory along
-with a separate vocabulary file.  Use `text2bin.py` to generate these files:
-
-    ./text2bin.py -o vecs.bin -v vocab.txt /tmp/swivel_data/*_embedding.tsv
-
-You can do some simple exploration using `nearest.py`:
-
-    ./nearest.py -v vocab.txt -e vecs.bin
-    query> dog
-    dog
-    dogs
-    cat
-    ...
-    query> man woman king
-    king
-    queen
-    princess
-    ...
-
-To evaluate the embeddings using common word similarity and analogy datasets,
-use `eval.mk` to retrieve the data sets and build the tools. Note that wordsim is currently not compatible with Python 3.x.
-
-    make -f eval.mk
-    ./wordsim.py --vocab vocab.txt --embeddings vecs.bin *.ws.tab
-    ./analogy --vocab vocab.txt --embeddings vecs.bin *.an.tab
-
-The word similarity evaluation compares the embeddings' estimate of "similarity"
-with human judgement using
-[Spearman's rho](https://en.wikipedia.org/wiki/Spearman%27s_rank_correlation_coefficient)
-as the measure of correlation.  (Bigger numbers are better.)
-
-The analogy evaluation tests how well the embeddings can predict analogies like
-"man is to woman as king is to queen".
-
-Note that `eval.mk` forces all evaluation data into lower case.  From there,
-both the word similarity and analogy evaluations assume that the eval data and
-the embeddings use consistent capitalization: if you train embeddings using
-mixed case and evaluate them using lower case, things won't work well.
-
-# Contact
-
-If you have any questions about Swivel, feel free to post to
-[swivel-embeddings@googlegroups.com](https://groups.google.com/forum/#!forum/swivel-embeddings).
-
diff --git a/research/swivel/analogy.cc b/research/swivel/analogy.cc
deleted file mode 100644
index 5a3ff9b3b3b46faa4bad255ddb282cc9478331bd..0000000000000000000000000000000000000000
--- a/research/swivel/analogy.cc
+++ /dev/null
@@ -1,365 +0,0 @@
-/* -*- Mode: C++ -*- */
-
-/*
- * Copyright 2016 Google Inc. All Rights Reserved.
- *
- * Licensed under the Apache License, Version 2.0 (the "License");
- * you may not use this file except in compliance with the License.
- * You may obtain a copy of the License at
- *
- *    http://www.apache.org/licenses/LICENSE-2.0
- *
- * Unless required by applicable law or agreed to in writing, software
- * distributed under the License is distributed on an "AS IS" BASIS,
- * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
- * See the License for the specific language governing permissions and
- * limitations under the License.
- */
-
-/*
- * Computes embedding performance on analogy tasks.  Accepts as input one or
- * more files containing four words per line (A B C D), and determines if:
- *
- *   vec(C) - vec(A) + vec(B) ~= vec(D)
- *
- * Cosine distance in the embedding space is used to retrieve neighbors. Any
- * missing vocabulary items are scored as losses.
- */
-#include <fcntl.h>
-#include <math.h>
-#include <pthread.h>
-#include <stdio.h>
-#include <stdlib.h>
-#include <string.h>
-#include <sys/stat.h>
-#include <sys/types.h>
-#include <unistd.h>
-
-#include <fstream>
-#include <iostream>
-#include <string>
-#include <unordered_map>
-#include <vector>
-
-static const char usage[] = R"(
-Performs analogy testing of embedding vectors.
-
-Usage:
-
-  analogy --embeddings <embeddings> --vocab <vocab> eval1.tab ...
-
-Options:
-
-  --embeddings <filename>
-    The file containing the binary embedding vectors to evaluate.
-
-  --vocab <filename>
-    The vocabulary file corresponding to the embedding vectors.
-
-  --nthreads <integer>
-    The number of evaluation threads to run (default: 8)
-)";
-
-// Reads the vocabulary file into a map from token to vector index.
-static std::unordered_map<std::string, int> ReadVocab(
-    const std::string& vocab_filename) {
-  std::unordered_map<std::string, int> vocab;
-  std::ifstream fin(vocab_filename);
-
-  int index = 0;
-  for (std::string token; std::getline(fin, token); ++index) {
-    auto n = token.find('\t');
-    if (n != std::string::npos) token = token.substr(n);
-
-    vocab[token] = index;
-  }
-
-  return vocab;
-}
-
-// An analogy query: "A is to B as C is to D".
-typedef std::tuple<int, int, int, int> AnalogyQuery;
-
-std::vector<AnalogyQuery> ReadQueries(
-    const std::string &filename,
-    const std::unordered_map<std::string, int> &vocab, int *total) {
-  std::ifstream fin(filename);
-
-  std::vector<AnalogyQuery> queries;
-  int lineno = 0;
-  while (1) {
-    // Read the four words.
-    std::string words[4];
-    int nread = 0;
-    for (int i = 0; i < 4; ++i) {
-      fin >> words[i];
-      if (!words[i].empty()) ++nread;
-    }
-
-    ++lineno;
-    if (nread == 0) break;
-
-    if (nread < 4) {
-      std::cerr << "expected four words at line " << lineno << std::endl;
-      break;
-    }
-
-    // Look up each word's index.
-    int ixs[4], nvalid;
-    for (nvalid = 0; nvalid < 4; ++nvalid) {
-      std::unordered_map<std::string, int>::const_iterator it =
-          vocab.find(words[nvalid]);
-
-      if (it == vocab.end()) break;
-
-      ixs[nvalid] = it->second;
-    }
-
-    // If we don't have all the words, count it as a loss.
-    if (nvalid >= 4)
-      queries.push_back(std::make_tuple(ixs[0], ixs[1], ixs[2], ixs[3]));
-  }
-
-  *total = lineno;
-  return queries;
-}
-
-
-// A thread that evaluates some fraction of the analogies.
-class AnalogyEvaluator {
- public:
-  // Creates a new Analogy evaluator for a range of analogy queries.
-  AnalogyEvaluator(std::vector<AnalogyQuery>::const_iterator begin,
-                   std::vector<AnalogyQuery>::const_iterator end,
-                   const float *embeddings, const int num_embeddings,
-                   const int dim)
-      : begin_(begin),
-        end_(end),
-        embeddings_(embeddings),
-        num_embeddings_(num_embeddings),
-        dim_(dim) {}
-
-  // A thunk for pthreads.
-  static void* Run(void *param) {
-    AnalogyEvaluator *self = static_cast<AnalogyEvaluator*>(param);
-    self->Evaluate();
-    return nullptr;
-  }
-
-  // Evaluates the analogies.
-  void Evaluate();
-
-  // Returns the number of correct analogies after evaluation is complete.
-  int GetNumCorrect() const { return correct_; }
-
- protected:
-  // The beginning of the range of queries to consider.
-  std::vector<AnalogyQuery>::const_iterator begin_;
-
-  // The end of the range of queries to consider.
-  std::vector<AnalogyQuery>::const_iterator end_;
-
-  // The raw embedding vectors.
-  const float *embeddings_;
-
-  // The number of embedding vectors.
-  const int num_embeddings_;
-
-  // The embedding vector dimensionality.
-  const int dim_;
-
-  // The number of correct analogies.
-  int correct_;
-};
-
-
-void AnalogyEvaluator::Evaluate() {
-  float* sum = new float[dim_];
-
-  correct_ = 0;
-  for (auto query = begin_; query < end_; ++query) {
-    const float* vec;
-    int a, b, c, d;
-    std::tie(a, b, c, d) = *query;
-
-    // Compute C - A + B.
-    vec = embeddings_ + dim_ * c;
-    for (int i = 0; i < dim_; ++i) sum[i] = vec[i];
-
-    vec = embeddings_ + dim_ * a;
-    for (int i = 0; i < dim_; ++i) sum[i] -= vec[i];
-
-    vec = embeddings_ + dim_ * b;
-    for (int i = 0; i < dim_; ++i) sum[i] += vec[i];
-
-    // Find the nearest neighbor that isn't one of the query words.
-    int best_ix = -1;
-    float best_dot = -1.0;
-    for (int i = 0; i < num_embeddings_; ++i) {
-      if (i == a || i == b || i == c) continue;
-
-      vec = embeddings_ + dim_ * i;
-
-      float dot = 0;
-      for (int j = 0; j < dim_; ++j) dot += vec[j] * sum[j];
-
-      if (dot > best_dot) {
-        best_ix = i;
-        best_dot = dot;
-      }
-    }
-
-    // The fourth word is the answer; did we get it right?
-    if (best_ix == d) ++correct_;
-  }
-
-  delete[] sum;
-}
-
-
-int main(int argc, char *argv[]) {
-  if (argc <= 1) {
-    printf(usage);
-    return 2;
-  }
-
-  std::string embeddings_filename, vocab_filename;
-  int nthreads = 8;
-
-  std::vector<std::string> input_filenames;
-  std::vector<std::tuple<int, int, int, int>> queries;
-
-  for (int i = 1; i < argc; ++i) {
-    std::string arg = argv[i];
-    if (arg == "--embeddings") {
-      if (++i >= argc) goto argmissing;
-      embeddings_filename = argv[i];
-    } else if (arg == "--vocab") {
-      if (++i >= argc) goto argmissing;
-      vocab_filename = argv[i];
-    } else if (arg == "--nthreads") {
-      if (++i >= argc) goto argmissing;
-      if ((nthreads = atoi(argv[i])) <= 0) goto badarg;
-    } else if (arg == "--help") {
-      std::cout << usage << std::endl;
-      return 0;
-    } else if (arg[0] == '-') {
-      std::cerr << "unknown option: '" << arg << "'" << std::endl;
-      return 2;
-    } else {
-      input_filenames.push_back(arg);
-    }
-
-    continue;
-
-  argmissing:
-    std::cerr << "missing value for '" << argv[i - 1] << "' (--help for help)"
-              << std::endl;
-    return 2;
-
-  badarg:
-    std::cerr << "invalid value '" << argv[i] << "' for '" << argv[i - 1]
-              << "' (--help for help)" << std::endl;
-
-    return 2;
-  }
-
-  // Read the vocabulary.
-  std::unordered_map<std::string, int> vocab = ReadVocab(vocab_filename);
-  if (!vocab.size()) {
-    std::cerr << "unable to read vocabulary file '" << vocab_filename << "'"
-              << std::endl;
-    return 1;
-  }
-
-  const int n = vocab.size();
-
-  // Read the vectors.
-  int fd;
-  if ((fd = open(embeddings_filename.c_str(), O_RDONLY)) < 0) {
-    std::cerr << "unable to open embeddings file '" << embeddings_filename
-              << "'" << std::endl;
-    return 1;
-  }
-
-  off_t nbytes = lseek(fd, 0, SEEK_END);
-  if (nbytes == -1) {
-    std::cerr << "unable to determine file size for '" << embeddings_filename
-              << "'" << std::endl;
-    return 1;
-  }
-
-  if (nbytes % (sizeof(float) * n) != 0) {
-    std::cerr << "'" << embeddings_filename
-              << "' has a strange file size; expected it to be "
-                 "a multiple of the vocabulary size"
-              << std::endl;
-
-    return 1;
-  }
-
-  const int dim = nbytes / (sizeof(float) * n);
-  float *embeddings = static_cast<float *>(malloc(nbytes));
-  lseek(fd, 0, SEEK_SET);
-  if (read(fd, embeddings, nbytes) < nbytes) {
-    std::cerr << "unable to read embeddings from " << embeddings_filename
-              << std::endl;
-    return 1;
-  }
-
-  close(fd);
-
-  /* Normalize the vectors. */
-  for (int i = 0; i < n; ++i) {
-    float *vec = embeddings + dim * i;
-    float norm = 0;
-    for (int j = 0; j < dim; ++j) norm += vec[j] * vec[j];
-
-    norm = sqrt(norm);
-    for (int j = 0; j < dim; ++j) vec[j] /= norm;
-  }
-
-  pthread_attr_t attr;
-  if (pthread_attr_init(&attr) != 0) {
-    std::cerr << "unable to initalize pthreads" << std::endl;
-    return 1;
-  }
-
-  /* Read each input file. */
-  for (const auto filename : input_filenames) {
-    int total = 0;
-    std::vector<AnalogyQuery> queries =
-        ReadQueries(filename.c_str(), vocab, &total);
-
-    const int queries_per_thread = queries.size() / nthreads;
-    std::vector<AnalogyEvaluator*> evaluators;
-    std::vector<pthread_t> threads;
-
-    for (int i = 0; i < nthreads; ++i) {
-      auto begin = queries.begin() + i * queries_per_thread;
-      auto end = (i + 1 < nthreads)
-                     ? queries.begin() + (i + 1) * queries_per_thread
-                     : queries.end();
-
-      AnalogyEvaluator *evaluator =
-          new AnalogyEvaluator(begin, end, embeddings, n, dim);
-
-      pthread_t thread;
-      pthread_create(&thread, &attr, AnalogyEvaluator::Run, evaluator);
-      evaluators.push_back(evaluator);
-      threads.push_back(thread);
-    }
-
-    for (auto &thread : threads) pthread_join(thread, 0);
-
-    int correct = 0;
-    for (const AnalogyEvaluator* evaluator : evaluators) {
-      correct += evaluator->GetNumCorrect();
-      delete evaluator;
-    }
-
-    printf("%0.3f %s\n", static_cast<float>(correct) / total, filename.c_str());
-  }
-
-  return 0;
-}
diff --git a/research/swivel/distributed.sh b/research/swivel/distributed.sh
deleted file mode 100644
index 6aa59f751a8bbd3761a419f5f3242a9d1d5ce5e3..0000000000000000000000000000000000000000
--- a/research/swivel/distributed.sh
+++ /dev/null
@@ -1,54 +0,0 @@
-#!/bin/bash
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-# This script launches a multi-process version of Swivel on a single machine.
-set -e
-
-# A comma-separated list of parameter server processes.
-PS_HOSTS="localhost:4000"
-
-# A comma-separated list of worker processes.
-WORKER_HOSTS="localhost:5000,localhost:5001,localhost:5002,localhost:5003"
-
-# Where the Swivel training data is located.  All processes must be able to read
-# from this directory, so it ought to be a network filesystem if you're running
-# on multiple servers.
-INPUT_BASE_PATH="${HOME}/tmp/swivel/in"
-
-# Where the output and working directory is located.
-OUTPUT_BASE_PATH="${HOME}/tmp/swivel/out"
-
-# Location of evaluation data, if you want to observe evaluation while training.
-EVAL_BASE_PATH="${HOME}/tmp/swivel/eval"
-
-ARGS="--ps_hosts ${PS_HOSTS}
---worker_hosts ${WORKER_HOSTS}
---input_base_path ${INPUT_BASE_PATH}
---output_base_path ${OUTPUT_BASE_PATH}
---eval_base_path ${EVAL_BASE_PATH}"
-
-# This configuration is for a two-GPU machine.  It starts four worker
-# processes, two for each GPU.
-python swivel.py --job_name ps --task_index 0 ${ARGS} >& /tmp/ps.0 &
-python swivel.py --job_name worker --task_index 0 --gpu_device 0 ${ARGS} >& /tmp/worker.0 &
-python swivel.py --job_name worker --task_index 1 --gpu_device 1 ${ARGS} >& /tmp/worker.1 &
-python swivel.py --job_name worker --task_index 2 --gpu_device 0 ${ARGS} >& /tmp/worker.2 &
-python swivel.py --job_name worker --task_index 3 --gpu_device 1 ${ARGS} >& /tmp/worker.3 &
-
-# Perhaps there is a more clever way to clean up the parameter server once all
-# the workers are done.
-wait %2 %3 %4 %5
-kill %1
-
diff --git a/research/swivel/eval.mk b/research/swivel/eval.mk
deleted file mode 100644
index b8db8c86a025c0818001b4aae0c90716c5129a9d..0000000000000000000000000000000000000000
--- a/research/swivel/eval.mk
+++ /dev/null
@@ -1,101 +0,0 @@
-# -*- Mode: Makefile -*-
-#
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-# This makefile pulls down the evaluation datasets and formats them uniformly.
-# Word similarity evaluations are formatted to contain exactly three columns:
-# the two words being compared and the human judgement.
-#
-# Use wordsim.py and analogy to run the actual evaluations.
-
-CXXFLAGS=-std=c++11 -m64 -mavx -g -Ofast -Wall
-LDLIBS=-lpthread -lm
-
-WORDSIM_EVALS=	ws353sim.ws.tab \
-		ws353rel.ws.tab \
-		men.ws.tab	\
-		mturk.ws.tab \
-		rarewords.ws.tab \
-		simlex999.ws.tab \
-		$(NULL)
-
-ANALOGY_EVALS=	mikolov.an.tab \
-		msr.an.tab \
-		$(NULL)
-
-all: $(WORDSIM_EVALS) $(ANALOGY_EVALS) analogy
-
-ws353sim.ws.tab: ws353simrel.tar.gz
-	tar Oxfz $^ wordsim353_sim_rel/wordsim_similarity_goldstandard.txt > $@
-
-ws353rel.ws.tab: ws353simrel.tar.gz
-	tar Oxfz $^ wordsim353_sim_rel/wordsim_relatedness_goldstandard.txt > $@
-
-men.ws.tab: MEN.tar.gz
-	tar Oxfz $^ MEN/MEN_dataset_natural_form_full | tr ' ' '\t' > $@
-
-mturk.ws.tab: Mtruk.csv
-	cat $^ | tr -d '\r' | tr ',' '\t' > $@
-
-rarewords.ws.tab: rw.zip
-	unzip -p $^ rw/rw.txt | cut -f1-3 -d $$'\t' > $@
-
-simlex999.ws.tab: SimLex-999.zip
-	unzip -p $^ SimLex-999/SimLex-999.txt \
-	| tail -n +2 | cut -f1,2,4 -d $$'\t' > $@
-
-mikolov.an.tab: questions-words.txt
-	egrep -v -E '^:' $^ | tr '[A-Z] ' '[a-z]\t' > $@
-
-msr.an.tab: word_relationship.questions word_relationship.answers
-	cat word_relationship.questions | tr ' ' '\t' > /tmp/q
-	cat word_relationship.answers | cut -f2 -d ' ' > /tmp/a
-	paste /tmp/q /tmp/a > $@
-	rm -f /tmp/q /tmp/a
-
-
-# wget commands to fetch the datasets.  Please see the original datasets for
-# appropriate references if you use these.
-ws353simrel.tar.gz:
-	wget http://alfonseca.org/pubs/ws353simrel.tar.gz
-
-MEN.tar.gz:
-	wget http://clic.cimec.unitn.it/~elia.bruni/resources/MEN.tar.gz
-
-Mtruk.csv:
-	wget http://www.kiraradinsky.com/files/Mtruk.csv
-
-rw.zip:
-	wget http://www-nlp.stanford.edu/~lmthang/morphoNLM/rw.zip
-
-SimLex-999.zip:
-	wget http://www.cl.cam.ac.uk/~fh295/SimLex-999.zip
-
-questions-words.txt:
-	wget http://download.tensorflow.org/data/questions-words.txt
-
-word_relationship.questions:
-	wget https://github.com/darshanhegde/SNLPProject/raw/master/word2vec/eval/word_relationship.questions
-
-word_relationship.answers:
-	wget https://github.com/darshanhegde/SNLPProject/raw/master/word2vec/eval/word_relationship.answers
-
-analogy: analogy.cc
-
-clean:
-	rm -f *.ws.tab *.an.tab analogy *.pyc
-
-distclean: clean
-	rm -f *.tgz *.tar.gz *.zip Mtruk.csv questions-words.txt word_relationship.{questions,answers}
diff --git a/research/swivel/fastprep.cc b/research/swivel/fastprep.cc
deleted file mode 100644
index a4bd7feef470ab29c9c8eb89051fc763aaeb16ff..0000000000000000000000000000000000000000
--- a/research/swivel/fastprep.cc
+++ /dev/null
@@ -1,692 +0,0 @@
-/* -*- Mode: C++ -*- */
-
-/*
- * Copyright 2016 Google Inc. All Rights Reserved.
- *
- * Licensed under the Apache License, Version 2.0 (the "License");
- * you may not use this file except in compliance with the License.
- * You may obtain a copy of the License at
- *
- *     http://www.apache.org/licenses/LICENSE-2.0
- *
- * Unless required by applicable law or agreed to in writing, software
- * distributed under the License is distributed on an "AS IS" BASIS,
- * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
- * See the License for the specific language governing permissions and
- * limitations under the License.
- */
-
-/*
- * This program starts with a text file (and optionally a vocabulary file) and
- * computes co-occurrence statistics. It emits output in a format that can be
- * consumed by the "swivel" program.  It's functionally equivalent to "prep.py",
- * but works much more quickly.
- */
-
-#include <assert.h>
-#include <fcntl.h>
-#include <stdio.h>
-#include <sys/mman.h>
-#include <sys/stat.h>
-#include <unistd.h>
-
-#include <algorithm>
-#include <fstream>
-#include <iomanip>
-#include <iostream>
-#include <map>
-#include <mutex>
-#include <string>
-#include <thread>
-#include <tuple>
-#include <unordered_map>
-#include <vector>
-
-#include "google/protobuf/io/zero_copy_stream_impl.h"
-#include "tensorflow/core/example/example.pb.h"
-#include "tensorflow/core/example/feature.pb.h"
-
-static const char usage[] = R"(
-Prepares a corpus for processing by Swivel.
-
-Usage:
-
-  prep --output_dir <output-dir> --input <text-file>
-
-Options:
-
-  --input <filename>
-      The input text.
-
-  --output_dir <directory>
-      Specifies the output directory where the various Swivel data
-      files should be placed.  This directory must exist.
-
-  --shard_size <int>
-      Specifies the shard size; default 4096.
-
-  --min_count <int>
-      The minimum number of times a word should appear to be included in the
-      generated vocabulary; default 5.  (Ignored if --vocab is used.)
-
-  --max_vocab <int>
-      The maximum vocabulary size to generate from the input corpus; default
-      102,400.  (Ignored if --vocab is used.)
-
-  --vocab <filename>
-      Use the specified unigram vocabulary instead of generating
-      it from the corpus.
-
-  --window_size <int>
-      Specifies the window size for computing co-occurrence stats;
-      default 10.
-
-  --num_threads <int>
-      The number of workers to calculate the co-occurrence matrix;
-      default 4.
-)";
-
-struct cooc_t {
-  int row;
-  int col;
-  float cnt;
-};
-
-typedef std::map<long long, float> cooc_counts_t;
-
-// Retrieves the next word from the input stream, treating words as simply being
-// delimited by whitespace.  Returns true if this is the end of a "sentence";
-// i.e., a newline.
-bool NextWord(std::ifstream &fin, std::string* word) {
-  std::string buf;
-  char c;
-
-  if (fin.eof()) {
-    word->erase();
-    return true;
-  }
-
-  // Skip leading whitespace.
-  do {
-    c = fin.get();
-  } while (!fin.eof() && std::isspace(c));
-
-  if (fin.eof()) {
-    word->erase();
-    return true;
-  }
-
-  // Read the next word.
-  do {
-    buf += c;
-    c = fin.get();
-  } while (!fin.eof() && !std::isspace(c));
-
-  *word = buf;
-  if (c == '\n' || fin.eof()) return true;
-
-  // Skip trailing whitespace.
-  do {
-    c = fin.get();
-  } while (!fin.eof() && std::isspace(c));
-
-  if (fin.eof()) return true;
-
-  fin.unget();
-  return false;
-}
-
-// Creates a vocabulary from the most frequent terms in the input file.
-std::vector<std::string> CreateVocabulary(const std::string input_filename,
-                                          const int shard_size,
-                                          const int min_vocab_count,
-                                          const int max_vocab_size) {
-  std::vector<std::string> vocab;
-
-  // Count all the distinct tokens in the file.  (XXX this will eventually
-  // consume all memory and should be re-written to periodically trim the data.)
-  std::unordered_map<std::string, long long> counts;
-
-  std::ifstream fin(input_filename, std::ifstream::ate);
-
-  if (!fin) {
-    std::cerr << "couldn't read input file '" << input_filename << "'"
-              << std::endl;
-
-    return vocab;
-  }
-
-  const auto input_size = fin.tellg();
-  fin.seekg(0);
-
-  long long ntokens = 0;
-  while (!fin.eof()) {
-    std::string word;
-    NextWord(fin, &word);
-    counts[word] += 1;
-
-    if (++ntokens % 1000000 == 0) {
-      const float pct = 100.0 * static_cast<float>(fin.tellg()) / input_size;
-      fprintf(stdout, "\rComputing vocabulary: %0.1f%% complete...", pct);
-      std::flush(std::cout);
-    }
-  }
-
-  std::cout << counts.size() << " distinct tokens" << std::endl;
-
-  // Sort the vocabulary from most frequent to least frequent.
-  std::vector<std::pair<std::string, long long>> buf;
-  std::copy(counts.begin(), counts.end(), std::back_inserter(buf));
-  std::sort(buf.begin(), buf.end(),
-            [](const std::pair<std::string, long long> &a,
-               const std::pair<std::string, long long> &b) {
-              return b.second < a.second;
-            });
-
-  // Truncate to the maximum vocabulary size
-  if (static_cast<int>(buf.size()) > max_vocab_size) buf.resize(max_vocab_size);
-  if (buf.empty()) return vocab;
-
-  // Eliminate rare tokens and truncate to a size modulo the shard size.
-  int vocab_size = buf.size();
-  while (vocab_size > 0 && buf[vocab_size - 1].second < min_vocab_count)
-    --vocab_size;
-
-  vocab_size -= vocab_size % shard_size;
-  if (static_cast<int>(buf.size()) > vocab_size) buf.resize(vocab_size);
-
-  // Copy out the tokens.
-  for (const auto& pair : buf) vocab.push_back(pair.first);
-
-  return vocab;
-}
-
-std::vector<std::string> ReadVocabulary(const std::string vocab_filename) {
-  std::vector<std::string> vocab;
-
-  std::ifstream fin(vocab_filename);
-  int index = 0;
-  for (std::string token; std::getline(fin, token); ++index) {
-    auto n = token.find('\t');
-    if (n != std::string::npos) token = token.substr(n);
-
-    vocab.push_back(token);
-  }
-
-  return vocab;
-}
-
-void WriteVocabulary(const std::vector<std::string> &vocab,
-                     const std::string &output_dirname) {
-  for (const std::string filename : {"row_vocab.txt", "col_vocab.txt"}) {
-    std::ofstream fout(output_dirname + "/" + filename);
-    for (const auto &token : vocab) fout << token << std::endl;
-  }
-}
-
-// Manages accumulation of co-occurrence data into temporary disk buffer files.
-class CoocBuffer {
- public:
-  CoocBuffer(const std::string &output_dirname, const int num_shards,
-             const int shard_size);
-
-  // Accumulate the co-occurrence counts to the buffer.
-  void AccumulateCoocs(const cooc_counts_t &coocs);
-
-  // Read the buffer to produce shard files.
-  void WriteShards();
-
- protected:
-  // The output directory. Also used for temporary buffer files.
-  const std::string output_dirname_;
-
-  // The number of row/column shards.
-  const int num_shards_;
-
-  // The number of elements per shard.
-  const int shard_size_;
-
-  // Parallel arrays of temporary file paths and file descriptors.
-  std::vector<std::string> paths_;
-  std::vector<int> fds_;
-
-  // Ensures that only one buffer file is getting written at a time.
-  std::mutex writer_mutex_;
-};
-
-CoocBuffer::CoocBuffer(const std::string &output_dirname, const int num_shards,
-                       const int shard_size)
-    : output_dirname_(output_dirname),
-      num_shards_(num_shards),
-      shard_size_(shard_size) {
-  for (int row = 0; row < num_shards_; ++row) {
-    for (int col = 0; col < num_shards_; ++col) {
-      char filename[256];
-      sprintf(filename, "shard-%03d-%03d.tmp", row, col);
-
-      std::string path = output_dirname + "/" + filename;
-      int fd = open(path.c_str(), O_RDWR | O_CREAT | O_TRUNC, 0666);
-      assert(fd > 0);
-
-      paths_.push_back(path);
-      fds_.push_back(fd);
-    }
-  }
-}
-
-void CoocBuffer::AccumulateCoocs(const cooc_counts_t &coocs) {
-  std::vector<std::vector<cooc_t>> bufs(fds_.size());
-
-  for (const auto &cooc : coocs) {
-    const int row_id = cooc.first >> 32;
-    const int col_id = cooc.first & 0xffffffff;
-    const float cnt = cooc.second;
-
-    const int row_shard = row_id % num_shards_;
-    const int row_off = row_id / num_shards_;
-    const int col_shard = col_id % num_shards_;
-    const int col_off = col_id / num_shards_;
-
-    const int top_shard_idx = row_shard * num_shards_ + col_shard;
-    bufs[top_shard_idx].push_back(cooc_t{row_off, col_off, cnt});
-
-    const int bot_shard_idx = col_shard * num_shards_ + row_shard;
-    bufs[bot_shard_idx].push_back(cooc_t{col_off, row_off, cnt});
-  }
-
-  for (int i = 0; i < static_cast<int>(fds_.size()); ++i) {
-    std::lock_guard<std::mutex> rv(writer_mutex_);
-    const int nbytes = bufs[i].size() * sizeof(cooc_t);
-    int nwritten = write(fds_[i], bufs[i].data(), nbytes);
-    assert(nwritten == nbytes);
-  }
-}
-
-void CoocBuffer::WriteShards() {
-  for (int shard = 0; shard < static_cast<int>(fds_.size()); ++shard) {
-    const int row_shard = shard / num_shards_;
-    const int col_shard = shard % num_shards_;
-
-    std::cout << "\rwriting shard " << (shard + 1) << "/"
-              << (num_shards_ * num_shards_);
-    std::flush(std::cout);
-
-    // Construct the tf::Example proto.  First, we add the global rows and
-    // column that are present in the shard.
-    tensorflow::Example example;
-
-    auto &feature = *example.mutable_features()->mutable_feature();
-    auto global_row = feature["global_row"].mutable_int64_list();
-    auto global_col = feature["global_col"].mutable_int64_list();
-
-    for (int i = 0; i < shard_size_; ++i) {
-      global_row->add_value(row_shard + i * num_shards_);
-      global_col->add_value(col_shard + i * num_shards_);
-    }
-
-    // Next we add co-occurrences as a sparse representation.  Map the
-    // co-occurrence counts that we've spooled off to disk: these are in
-    // arbitrary order and may contain duplicates.
-    const off_t nbytes = lseek(fds_[shard], 0, SEEK_END);
-    cooc_t *coocs = static_cast<cooc_t*>(
-        mmap(0, nbytes, PROT_READ | PROT_WRITE, MAP_SHARED, fds_[shard], 0));
-
-    const int ncoocs = nbytes / sizeof(cooc_t);
-    cooc_t* cur = coocs;
-    cooc_t* end = coocs + ncoocs;
-
-    auto sparse_value = feature["sparse_value"].mutable_float_list();
-    auto sparse_local_row = feature["sparse_local_row"].mutable_int64_list();
-    auto sparse_local_col = feature["sparse_local_col"].mutable_int64_list();
-
-    std::sort(cur, end, [](const cooc_t &a, const cooc_t &b) {
-      return a.row < b.row || (a.row == b.row && a.col < b.col);
-    });
-
-    // Accumulate the counts into the protocol buffer.
-    int last_row = -1, last_col = -1;
-    float count = 0;
-    for (; cur != end; ++cur) {
-      if (cur->row != last_row || cur->col != last_col) {
-        if (last_row >= 0 && last_col >= 0) {
-          sparse_local_row->add_value(last_row);
-          sparse_local_col->add_value(last_col);
-          sparse_value->add_value(count);
-        }
-
-        last_row = cur->row;
-        last_col = cur->col;
-        count = 0;
-      }
-
-      count += cur->cnt;
-    }
-
-    if (last_row >= 0 && last_col >= 0) {
-      sparse_local_row->add_value(last_row);
-      sparse_local_col->add_value(last_col);
-      sparse_value->add_value(count);
-    }
-
-    munmap(coocs, nbytes);
-    close(fds_[shard]);
-
-    if (sparse_local_row->value_size() * 8 >= (64 << 20)) {
-      std::cout << "Warning: you are likely to catch protobuf parsing errors "
-          "in TF 1.0 and older because the shard is too fat (>= 64MiB); see "
-          << std::endl <<
-          "kDefaultTotalBytesLimit in src/google/protobuf/io/coded_stream.h "
-          " changed in protobuf/commit/5a76e633ea9b5adb215e93fdc11e1c0c08b3fc74"
-          << std::endl <<
-          "https://github.com/tensorflow/tensorflow/issues/7311"
-          << std::endl <<
-          "Consider increasing the number of shards.";
-    }
-
-    // Write the protocol buffer as a binary blob to disk.
-    const int filename_max_size = 4096;
-    std::unique_ptr<char[]> filename(new char[filename_max_size]);
-    snprintf(filename.get(), filename_max_size, "shard-%03d-%03d.pb", row_shard,
-             col_shard);
-
-    const std::string path = output_dirname_ + "/" + filename.get();
-    int fd = open(path.c_str(), O_WRONLY | O_TRUNC | O_CREAT, 0666);
-    assert(fd != -1);
-
-    google::protobuf::io::FileOutputStream fout(fd);
-    example.SerializeToZeroCopyStream(&fout);
-    fout.Close();
-
-    // Remove the temporary file.
-    unlink(paths_[shard].c_str());
-  }
-
-  std::cout << std::endl;
-}
-
-// Counts the co-occurrences in part of the file.
-class CoocCounter {
- public:
-  CoocCounter(const std::string &input_filename, const off_t start,
-              const off_t end, const int window_size,
-              const std::unordered_map<std::string, int> &token_to_id_map,
-              CoocBuffer *coocbuf)
-      : fin_(input_filename, std::ifstream::ate),
-        start_(start),
-        end_(end),
-        window_size_(window_size),
-        token_to_id_map_(token_to_id_map),
-        coocbuf_(coocbuf),
-        marginals_(token_to_id_map.size()) {}
-
-  // PTthreads-friendly thunk to Count.
-  static void* Run(void* param) {
-    CoocCounter* self = static_cast<CoocCounter*>(param);
-    self->Count();
-    return nullptr;
-  }
-
-  // Counts the co-occurrences.
-  void Count();
-
-  const std::vector<double>& Marginals() const { return marginals_; }
-
- protected:
-  // The input stream.
-  std::ifstream fin_;
-
-  // The range of the file to which this counter should attend.
-  const off_t start_;
-  const off_t end_;
-
-  // The window size for computing co-occurrences.
-  const int window_size_;
-
-  // A reference to the mapping from tokens to IDs.
-  const std::unordered_map<std::string, int> &token_to_id_map_;
-
-  // The buffer into which counts are to be accumulated.
-  CoocBuffer* coocbuf_;
-
-  // The marginal counts accumulated by this counter.
-  std::vector<double> marginals_;
-};
-
-void CoocCounter::Count() {
-  const int max_coocs_size = 16 * 1024 * 1024;
-
-  // A buffer of co-occurrence counts that we'll periodically sort into
-  // shards.
-  cooc_counts_t coocs;
-
-  fin_.seekg(start_);
-
-  int nlines = 0;
-  for (off_t filepos = start_; filepos < end_ && !fin_.eof(); filepos = fin_.tellg()) {
-    // Buffer a single sentence.
-    std::vector<int> sentence;
-    bool eos;
-    do {
-      std::string word;
-      eos = NextWord(fin_, &word);
-      auto it = token_to_id_map_.find(word);
-      if (it != token_to_id_map_.end()) sentence.push_back(it->second);
-    } while (!eos);
-
-    // Generate the co-occurrences for the sentence.
-    for (int pos = 0; pos < static_cast<int>(sentence.size()); ++pos) {
-      const int left_id = sentence[pos];
-
-      const int window_extent =
-          std::min(static_cast<int>(sentence.size()) - pos, 1 + window_size_);
-
-      for (int off = 1; off < window_extent; ++off) {
-        const int right_id = sentence[pos + off];
-        const double count = 1.0 / static_cast<double>(off);
-        const long long lo = std::min(left_id, right_id);
-        const long long hi = std::max(left_id, right_id);
-        const long long key = (hi << 32) | lo;
-        coocs[key] += count;
-
-        marginals_[left_id] += count;
-        marginals_[right_id] += count;
-      }
-
-      marginals_[left_id] += 1.0;
-      const long long key = (static_cast<long long>(left_id) << 32) |
-                            static_cast<long long>(left_id);
-
-      coocs[key] += 0.5;
-    }
-
-    // Periodically flush the co-occurrences to disk.
-    if (coocs.size() > max_coocs_size) {
-      coocbuf_->AccumulateCoocs(coocs);
-      coocs.clear();
-    }
-
-    if (start_ == 0 && ++nlines % 1000 == 0) {
-      const double pct = 100.0 * filepos / end_;
-      fprintf(stdout, "\rComputing co-occurrences: %0.1f%% complete...", pct);
-      std::flush(std::cout);
-    }
-  }
-
-  // Accumulate anything we haven't flushed yet.
-  coocbuf_->AccumulateCoocs(coocs);
-
-  if (start_ == 0) std::cout << "done." << std::endl;
-}
-
-void WriteMarginals(const std::vector<double> &marginals,
-                    const std::string &output_dirname) {
-  for (const std::string filename : {"row_sums.txt", "col_sums.txt"}) {
-    std::ofstream fout(output_dirname + "/" + filename);
-    fout.setf(std::ios::fixed);
-    for (double sum : marginals) fout << sum << std::endl;
-  }
-}
-
-int main(int argc, char *argv[]) {
-  std::string input_filename;
-  std::string vocab_filename;
-  std::string output_dirname;
-  bool generate_vocab = true;
-  int max_vocab_size = 100 * 1024;
-  int min_vocab_count = 5;
-  int window_size = 10;
-  int shard_size = 4096;
-  int num_threads = 4;
-
-  for (int i = 1; i < argc; ++i) {
-    std::string arg(argv[i]);
-    if (arg == "--vocab") {
-      if (++i >= argc) goto argmissing;
-      generate_vocab = false;
-      vocab_filename = argv[i];
-    } else if (arg == "--max_vocab") {
-      if (++i >= argc) goto argmissing;
-      if ((max_vocab_size = atoi(argv[i])) <= 0) goto badarg;
-    } else if (arg == "--min_count") {
-      if (++i >= argc) goto argmissing;
-      if ((min_vocab_count = atoi(argv[i])) <= 0) goto badarg;
-    } else if (arg == "--window_size") {
-      if (++i >= argc) goto argmissing;
-      if ((window_size = atoi(argv[i])) <= 0) goto badarg;
-    } else if (arg == "--input") {
-      if (++i >= argc) goto argmissing;
-      input_filename = argv[i];
-    } else if (arg == "--output_dir") {
-      if (++i >= argc) goto argmissing;
-      output_dirname = argv[i];
-    } else if (arg == "--shard_size") {
-      if (++i >= argc) goto argmissing;
-      shard_size = atoi(argv[i]);
-    } else if (arg == "--num_threads") {
-      if (++i >= argc) goto argmissing;
-      num_threads = atoi(argv[i]);
-    } else if (arg == "--help") {
-      std::cout << usage << std::endl;
-      return 0;
-    } else {
-      std::cerr << "unknown arg '" << arg << "'; try --help?" << std::endl;
-      return 2;
-    }
-
-    continue;
-
-  badarg:
-    std::cerr << "'" << argv[i] << "' is not a valid value for '" << arg
-              << "'; try --help?" << std::endl;
-
-    return 2;
-
-  argmissing:
-    std::cerr << arg << " requires an argument; try --help?" << std::endl;
-  }
-
-  if (input_filename.empty()) {
-    std::cerr << "please specify the input text with '--input'; try --help?"
-              << std::endl;
-    return 2;
-  }
-
-  if (output_dirname.empty()) {
-    std::cerr << "please specify the output directory with '--output_dir'"
-              << std::endl;
-
-    return 2;
-  }
-
-  struct stat sb;
-  if (lstat(output_dirname.c_str(), &sb) != 0 || !S_ISDIR(sb.st_mode)) {
-    if (mkdir(output_dirname.c_str(), 0755) != 0) {
-      std::cerr << "output directory '" << output_dirname
-                << "' does not exist or is not a directory." << std::endl;
-      return 1;
-    }
-  }
-
-  if (lstat(input_filename.c_str(), &sb) != 0 || !S_ISREG(sb.st_mode)) {
-    std::cerr << "input file '" << input_filename
-              << "' does not exist or is not a file." << std::endl;
-
-    return 1;
-  }
-
-  // The total size of the input.
-  const off_t input_size = sb.st_size;
-
-  const std::vector<std::string> vocab =
-      generate_vocab ? CreateVocabulary(input_filename, shard_size,
-                                        min_vocab_count, max_vocab_size)
-                     : ReadVocabulary(vocab_filename);
-
-  if (!vocab.size()) {
-    std::cerr << "Empty vocabulary." << std::endl;
-    return 1;
-  }
-
-  std::cout << "Generating Swivel co-occurrence data into " << output_dirname
-            << std::endl;
-
-  std::cout << "Shard size: " << shard_size << "x" << shard_size << std::endl;
-  std::cout << "Vocab size: " << vocab.size() << std::endl;
-
-  // Write the vocabulary files into  the output directory.
-  WriteVocabulary(vocab, output_dirname);
-
-  const int num_shards = vocab.size() / shard_size;
-  CoocBuffer coocbuf(output_dirname, num_shards, shard_size);
-
-  // Build a mapping from the token to its position in the vocabulary file.
-  std::unordered_map<std::string, int> token_to_id_map;
-  for (int i = 0; i < static_cast<int>(vocab.size()); ++i)
-    token_to_id_map[vocab[i]] = i;
-
-  // Compute the co-occurrences
-  std::vector<std::thread> threads;
-  threads.reserve(num_threads);
-  std::vector<CoocCounter*> counters;
-  const off_t nbytes_per_thread = input_size / num_threads;
-  std::cout << "Running " << num_threads << " threads, each on "
-            << nbytes_per_thread << " bytes" << std::endl;
-
-  for (int i = 0; i < num_threads; ++i) {
-    // We could make this smarter and look around for newlines.  But
-    // realistically that's not going to change things much.
-    const off_t start = i * nbytes_per_thread;
-    const off_t end =
-        i < num_threads - 1 ? (i + 1) * nbytes_per_thread : input_size;
-
-    CoocCounter *counter = new CoocCounter(
-        input_filename, start, end, window_size, token_to_id_map, &coocbuf);
-
-    counters.push_back(counter);
-
-    threads.emplace_back(CoocCounter::Run, counter);
-  }
-
-  // Wait for threads to finish and collect marginals.
-  std::vector<double> marginals(vocab.size());
-  for (int i = 0; i < num_threads; ++i) {
-    if (i > 0) {
-      std::cout << "joining thread #" << (i + 1) << std::endl;
-    }
-    threads[i].join();
-
-    const std::vector<double>& counter_marginals = counters[i]->Marginals();
-    for (int j = 0; j < static_cast<int>(vocab.size()); ++j)
-      marginals[j] += counter_marginals[j];
-
-    delete counters[i];
-  }
-
-  std::cout << "writing marginals..." << std::endl;
-  WriteMarginals(marginals, output_dirname);
-
-  std::cout << "writing shards..." << std::endl;
-  coocbuf.WriteShards();
-
-  return 0;
-}
diff --git a/research/swivel/fastprep.mk b/research/swivel/fastprep.mk
deleted file mode 100644
index b1798d0b68a7ac53f47c3d8fa4f72babd1af89f2..0000000000000000000000000000000000000000
--- a/research/swivel/fastprep.mk
+++ /dev/null
@@ -1,60 +0,0 @@
-# -*- Mode: Makefile -*-
-
-#
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-
-# This makefile builds "fastprep", a faster version of prep.py that can be used
-# to build training data for Swivel.
-#
-# = Step 1. Install protobuf v3 =
-#
-#   Ubuntu 16.10+: sudo apt install libprotobuf-dev
-#   Ubuntu 16.04: https://launchpad.net/~maarten-fonville/+archive/ubuntu/ppa + replace xenial with yakkety in /etc/apt/sources.list.d/maarten-fonville-ubuntu-ppa-xenial.list
-#   macOS: brew install protobuf
-#
-# = Step 2. Build "fastprep". =
-#
-#   make -f fastprep.mk
-#
-# If all goes well, you should have a program that is "flag compatible" with
-# "prep.py" and runs significantly faster.  Use it to generate the co-occurrence
-# matrices and other files necessary to train a Swivel matrix.
-
-
-CXXFLAGS=-std=c++11 -march=native -g -O2 -flto -Wall -I.
-LDLIBS=-lprotobuf -pthread -lm
-
-FETCHER=curl -L -o
-TF_URL=https://github.com/tensorflow/tensorflow/raw/master
-PROTOC=protoc
-
-
-%.proto: tensorflow/core/example
-	$(FETCHER) $@ $(TF_URL)/$@
-
-%.pb.cc: %.proto
-	$(PROTOC) --cpp_out=. $<
-
-fastprep: fastprep.cc tensorflow/core/example/feature.pb.cc tensorflow/core/example/example.pb.cc
-
-tensorflow/core/example:
-	@mkdir -p tensorflow/core/example
-
-clean:
-	@rm -f fastprep
-	
-mrproper: clean
-	@rm -rf tensorflow
diff --git a/research/swivel/glove_to_shards.py b/research/swivel/glove_to_shards.py
deleted file mode 100755
index 4a9cd23c79370796380871fc1874d0f45494e8b6..0000000000000000000000000000000000000000
--- a/research/swivel/glove_to_shards.py
+++ /dev/null
@@ -1,198 +0,0 @@
-#!/usr/bin/env python
-#
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Converts a Glove binary co-occurrence matrix into Swivel shards.
-
-Usage:
-
-  glove_to_shards.py --input <coocs> --vocab <vocab> --output_dir <output_dir>
-
-Options
-
-  --input <coocs>
-      The Glove co-occurrence file.
-
-  --vocab <vocab>
-      Path to the vocabulary text file, one token per line.
-
-  --output_dir <directory>
-      Specifies the touput directory where the various Swivel data
-      files sohuld be placed.
-
-  --shard_size <int>
-      Specifies the shard size; default 4096.
-"""
-
-from __future__ import print_function
-
-import itertools
-import os
-import struct
-import sys
-
-import tensorflow as tf
-from six.moves import xrange
-
-flags = tf.app.flags
-
-flags.DEFINE_string('input', 'coocurrences.bin', 'Vocabulary file')
-flags.DEFINE_string('vocab', 'vocab.txt', 'Vocabulary file')
-flags.DEFINE_string('output_dir', '/tmp/swivel_data', 'Output directory')
-flags.DEFINE_integer('shard_size', 4096, 'Shard size')
-
-FLAGS = tf.app.flags.FLAGS
-
-glove_cooc_fmt = struct.Struct('iid')
-shard_cooc_fmt = struct.Struct('if')
-
-
-def make_shard_files(coocs, nshards, vocab_sz):
-  """Chops the binary Glove co-occurrence matrix into shards.
-
-  This reads the Glove binary co-occurrence file and assigns individual
-  co-occurrence counts to the appropriate Swivel shard.
-
-  Args:
-    coocs: the co-occurrnece file to read
-    nshards: the number of shards along one dimension of the square matrix
-    vocab_sz: the vocabulary size
-
-  Returns:
-    A (shard_table, marginals) tuple.  The shard_table maps the row and column
-    shard ID to a file handle containing the co-occurrences for that shard; the
-    marginals contain the marginal sums.
-  """
-  row_sums = [0] * vocab_sz
-  col_sums = [0] * vocab_sz
-
-  coocs.seek(0, os.SEEK_END)
-  ncoocs = coocs.tell() / glove_cooc_fmt.size
-  coocs.seek(0, os.SEEK_SET)
-
-  shard_files = {}
-
-  for row in range(nshards):
-    for col in range(nshards):
-      filename = os.path.join(
-          FLAGS.output_dir, 'shard-%03d-%03d.bin' % (row, col))
-
-      shard_files[(row, col)] = open(filename, 'w+')
-
-  for ix in xrange(ncoocs):
-    if ix % 1000000 == 0:
-      sys.stdout.write('\rsharding co-occurrences: %0.1f%% (%d/%d)' % (
-          100.0 * ix / ncoocs, ix, ncoocs))
-
-      sys.stdout.flush()
-
-    bits = coocs.read(glove_cooc_fmt.size)
-    if not bits:
-      break
-
-    # Glove has 1-indexed IDs.
-    row_id, col_id, cnt = glove_cooc_fmt.unpack(bits)
-    if row_id > vocab_sz or col_id > vocab_sz:
-      continue
-
-    row_id -= 1
-    row_shard = row_id % nshards
-    row_off = row_id / nshards
-
-    col_id -= 1
-    col_shard = col_id % nshards
-    col_off = col_id / nshards
-
-    shard_pos = row_off * FLAGS.shard_size + col_off  # row major
-
-    shard_files[(row_shard, col_shard)].write(
-        shard_cooc_fmt.pack(shard_pos, cnt))
-
-    # Accumulate marginals.
-    row_sums[row_id] += cnt
-    col_sums[col_id] += cnt
-
-  sys.stdout.write('\n')
-
-  if any(abs(r - c) > 0.1 for r, c in itertools.izip(row_sums, col_sums)):
-    print('WARNING! Row and column marginals differ; is your matrix symmetric?',
-          file=sys.stderr)
-
-  return (shard_files, row_sums)
-
-def main(_):
-  with open(FLAGS.vocab, 'r') as lines:
-    orig_vocab_sz = sum(1 for _ in lines)
-
-  shard_sz = FLAGS.shard_size
-  vocab_sz = orig_vocab_sz - orig_vocab_sz % shard_sz
-  nshards = vocab_sz / shard_sz
-
-  print('vocab size is %d (originally %d), %d %dx%d-element shards' % (
-      vocab_sz, orig_vocab_sz, nshards * nshards, shard_sz, shard_sz))
-
-  # Create the output directory, if necessary
-  if FLAGS.output_dir and not os.path.isdir(FLAGS.output_dir):
-    os.makedirs(FLAGS.output_dir)
-
-  with open(FLAGS.input, 'r') as coocs:
-    shard_files, marginals = make_shard_files(coocs, nshards, vocab_sz)
-
-  # Now sort the shards and write the TFRecords.
-  filename = os.path.join(FLAGS.output_dir, 'shards.recs')
-  with tf.python_io.TFRecordWriter(filename) as writer:
-    ix = 0
-    for (row, col), fh in shard_files.iteritems():
-      ix += 1
-      sys.stdout.write('\rwriting shard %d/%d' % (ix, len(shard_files)))
-      sys.stdout.flush()
-
-      fh.seek(0)
-      buf = fh.read()
-      os.unlink(fh.name)
-      fh.close()
-
-      coocs = [
-          shard_cooc_fmt.unpack_from(buf, off)
-          for off in range(0, len(buf), shard_cooc_fmt.size)]
-
-      # N.B. we assume that there aren't any duplicates here!
-      coocs.sort(key=lambda kv: kv[0])
-
-      def _int64s(xs):
-        return tf.train.Feature(int64_list=tf.train.Int64List(value=list(xs)))
-
-      def _floats(xs):
-        return tf.train.Feature(float_list=tf.train.FloatList(value=list(xs)))
-
-      example = tf.train.Example(features=tf.train.Features(feature={
-          'global_row': _int64s(row + nshards * i for i in range(shard_sz)),
-          'global_col': _int64s(col + nshards * i for i in range(shard_sz)),
-          'sparse_local_row': _int64s(pos / shard_sz for pos, _ in coocs),
-          'sparse_local_col': _int64s(pos % shard_sz for pos, _ in coocs),
-          'sparse_value': _floats(cnt for _, cnt in coocs)}))
-
-      writer.write(example.SerializeToString())
-
-  print('\nwriting marginals...')
-
-  with open(os.path.join(FLAGS.output_dir, 'marginals.txt'), 'w') as fh:
-    for cnt in marginals:
-      fh.write('%0.1f\n' % cnt)
-
-  print('done!')
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/swivel/nearest.py b/research/swivel/nearest.py
deleted file mode 100644
index 0fa828b66b7958ae06f115216cea0106ec029033..0000000000000000000000000000000000000000
--- a/research/swivel/nearest.py
+++ /dev/null
@@ -1,76 +0,0 @@
-#!/usr/bin/env python
-#
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Simple tool for inspecting nearest neighbors and analogies."""
-
-from __future__ import print_function
-import re
-import sys
-from getopt import GetoptError, getopt
-
-from vecs import Vecs
-
-try:
-  opts, args = getopt(sys.argv[1:], 'v:e:', ['vocab=', 'embeddings='])
-except GetoptError as e:
-  print(e, file=sys.stderr)
-  sys.exit(2)
-
-opt_vocab = 'vocab.txt'
-opt_embeddings = None
-
-for o, a in opts:
-  if o in ('-v', '--vocab'):
-    opt_vocab = a
-  if o in ('-e', '--embeddings'):
-    opt_embeddings = a
-
-vecs = Vecs(opt_vocab, opt_embeddings)
-
-while True:
-  sys.stdout.write('query> ')
-  sys.stdout.flush()
-
-  query = sys.stdin.readline().strip()
-  if not query:
-    break
-
-  parts = re.split(r'\s+', query)
-
-  if len(parts) == 1:
-    res = vecs.neighbors(parts[0])
-
-  elif len(parts) == 3:
-    vs = [vecs.lookup(w) for w in parts]
-    if any(v is None for v in vs):
-      print('not in vocabulary: %s' % (
-            ', '.join(tok for tok, v in zip(parts, vs) if v is None)))
-
-      continue
-
-    res = vecs.neighbors(vs[2] - vs[0] + vs[1])
-
-  else:
-    print('use a single word to query neighbors, or three words for analogy')
-    continue
-
-  if not res:
-    continue
-
-  for word, sim in res[:20]:
-    print('%0.4f: %s' % (sim, word))
-
-  print()
diff --git a/research/swivel/prep.py b/research/swivel/prep.py
deleted file mode 100644
index b72a6fb2fe0c513303e00318b59b4ff3457add4a..0000000000000000000000000000000000000000
--- a/research/swivel/prep.py
+++ /dev/null
@@ -1,317 +0,0 @@
-#!/usr/bin/env python
-#
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Prepare a corpus for processing by swivel.
-
-Creates a sharded word co-occurrence matrix from a text file input corpus.
-
-Usage:
-
-  prep.py --output_dir <output-dir> --input <text-file>
-
-Options:
-
-  --input <filename>
-      The input text.
-
-  --output_dir <directory>
-      Specifies the output directory where the various Swivel data
-      files should be placed.
-
-  --shard_size <int>
-      Specifies the shard size; default 4096.
-
-  --min_count <int>
-      Specifies the minimum number of times a word should appear
-      to be included in the vocabulary; default 5.
-
-  --max_vocab <int>
-      Specifies the maximum vocabulary size; default shard size
-      times 1024.
-
-  --vocab <filename>
-      Use the specified unigram vocabulary instead of generating
-      it from the corpus.
-
-  --window_size <int>
-      Specifies the window size for computing co-occurrence stats;
-      default 10.
-
-  --bufsz <int>
-      The number of co-occurrences that are buffered; default 16M.
-
-"""
-
-import itertools
-import math
-import os
-import struct
-import sys
-
-from six.moves import xrange
-import tensorflow as tf
-
-flags = tf.app.flags
-
-flags.DEFINE_string('input', '', 'The input text.')
-flags.DEFINE_string('output_dir', '/tmp/swivel_data',
-                    'Output directory for Swivel data')
-flags.DEFINE_integer('shard_size', 4096, 'The size for each shard')
-flags.DEFINE_integer('min_count', 5,
-                     'The minimum number of times a word should occur to be '
-                     'included in the vocabulary')
-flags.DEFINE_integer('max_vocab', 4096 * 64, 'The maximum vocabulary size')
-flags.DEFINE_string('vocab', '', 'Vocabulary to use instead of generating one')
-flags.DEFINE_integer('window_size', 10, 'The window size')
-flags.DEFINE_integer('bufsz', 16 * 1024 * 1024,
-                     'The number of co-occurrences to buffer')
-
-FLAGS = flags.FLAGS
-
-shard_cooc_fmt = struct.Struct('iif')
-
-
-def words(line):
-  """Splits a line of text into tokens."""
-  return line.strip().split()
-
-
-def create_vocabulary(lines):
-  """Reads text lines and generates a vocabulary."""
-  lines.seek(0, os.SEEK_END)
-  nbytes = lines.tell()
-  lines.seek(0, os.SEEK_SET)
-
-  vocab = {}
-  for lineno, line in enumerate(lines, start=1):
-    for word in words(line):
-      vocab.setdefault(word, 0)
-      vocab[word] += 1
-
-    if lineno % 100000 == 0:
-      pos = lines.tell()
-      sys.stdout.write('\rComputing vocabulary: %0.1f%% (%d/%d)...' % (
-          100.0 * pos / nbytes, pos, nbytes))
-      sys.stdout.flush()
-
-  sys.stdout.write('\n')
-
-  vocab = [(tok, n) for tok, n in vocab.iteritems() if n >= FLAGS.min_count]
-  vocab.sort(key=lambda kv: (-kv[1], kv[0]))
-
-  num_words = min(len(vocab), FLAGS.max_vocab)
-  if num_words % FLAGS.shard_size != 0:
-    num_words -= num_words % FLAGS.shard_size
-
-  if not num_words:
-    raise Exception('empty vocabulary')
-
-  print('vocabulary contains %d tokens' % num_words)
-
-  vocab = vocab[:num_words]
-  return [tok for tok, n in vocab]
-
-
-def write_vocab_and_sums(vocab, sums, vocab_filename, sums_filename):
-  """Writes vocabulary and marginal sum files."""
-  with open(os.path.join(FLAGS.output_dir, vocab_filename), 'w') as vocab_out:
-    with open(os.path.join(FLAGS.output_dir, sums_filename), 'w') as sums_out:
-      for tok, cnt in itertools.izip(vocab, sums):
-        print >> vocab_out, tok
-        print >> sums_out, cnt
-
-
-def compute_coocs(lines, vocab):
-  """Compute the co-occurrence statistics from the text.
-
-  This generates a temporary file for each shard that contains the intermediate
-  counts from the shard: these counts must be subsequently sorted and collated.
-
-  """
-  word_to_id = {tok: idx for idx, tok in enumerate(vocab)}
-
-  lines.seek(0, os.SEEK_END)
-  nbytes = lines.tell()
-  lines.seek(0, os.SEEK_SET)
-
-  num_shards = len(vocab) / FLAGS.shard_size
-
-  shardfiles = {}
-  for row in range(num_shards):
-    for col in range(num_shards):
-      filename = os.path.join(
-          FLAGS.output_dir, 'shard-%03d-%03d.tmp' % (row, col))
-
-      shardfiles[(row, col)] = open(filename, 'w+')
-
-  def flush_coocs():
-    for (row_id, col_id), cnt in coocs.iteritems():
-      row_shard = row_id % num_shards
-      row_off = row_id / num_shards
-      col_shard = col_id % num_shards
-      col_off = col_id / num_shards
-
-      # Since we only stored (a, b), we emit both (a, b) and (b, a).
-      shardfiles[(row_shard, col_shard)].write(
-          shard_cooc_fmt.pack(row_off, col_off, cnt))
-
-      shardfiles[(col_shard, row_shard)].write(
-          shard_cooc_fmt.pack(col_off, row_off, cnt))
-
-  coocs = {}
-  sums = [0.0] * len(vocab)
-
-  for lineno, line in enumerate(lines, start=1):
-    # Computes the word IDs for each word in the sentence.  This has the effect
-    # of "stretching" the window past OOV tokens.
-    wids = filter(
-        lambda wid: wid is not None,
-        (word_to_id.get(w) for w in words(line)))
-
-    for pos in xrange(len(wids)):
-      lid = wids[pos]
-      window_extent = min(FLAGS.window_size + 1, len(wids) - pos)
-      for off in xrange(1, window_extent):
-        rid = wids[pos + off]
-        pair = (min(lid, rid), max(lid, rid))
-        count = 1.0 / off
-        sums[lid] += count
-        sums[rid] += count
-        coocs.setdefault(pair, 0.0)
-        coocs[pair] += count
-
-      sums[lid] += 1.0
-      pair = (lid, lid)
-      coocs.setdefault(pair, 0.0)
-      coocs[pair] += 0.5  # Only add 1/2 since we output (a, b) and (b, a)
-
-    if lineno % 10000 == 0:
-      pos = lines.tell()
-      sys.stdout.write('\rComputing co-occurrences: %0.1f%% (%d/%d)...' % (
-          100.0 * pos / nbytes, pos, nbytes))
-      sys.stdout.flush()
-
-      if len(coocs) > FLAGS.bufsz:
-        flush_coocs()
-        coocs = {}
-
-  flush_coocs()
-  sys.stdout.write('\n')
-
-  return shardfiles, sums
-
-
-def write_shards(vocab, shardfiles):
-  """Processes the temporary files to generate the final shard data.
-
-  The shard data is stored as a tf.Example protos using a TFRecordWriter. The
-  temporary files are removed from the filesystem once they've been processed.
-
-  """
-  num_shards = len(vocab) / FLAGS.shard_size
-
-  ix = 0
-  for (row, col), fh in shardfiles.iteritems():
-    ix += 1
-    sys.stdout.write('\rwriting shard %d/%d' % (ix, len(shardfiles)))
-    sys.stdout.flush()
-
-    # Read the entire binary co-occurrence and unpack it into an array.
-    fh.seek(0)
-    buf = fh.read()
-    os.unlink(fh.name)
-    fh.close()
-
-    coocs = [
-        shard_cooc_fmt.unpack_from(buf, off)
-        for off in range(0, len(buf), shard_cooc_fmt.size)]
-
-    # Sort and merge co-occurrences for the same pairs.
-    coocs.sort()
-
-    if coocs:
-      current_pos = 0
-      current_row_col = (coocs[current_pos][0], coocs[current_pos][1])
-      for next_pos in range(1, len(coocs)):
-        next_row_col = (coocs[next_pos][0], coocs[next_pos][1])
-        if current_row_col == next_row_col:
-          coocs[current_pos] = (
-              coocs[current_pos][0],
-              coocs[current_pos][1],
-              coocs[current_pos][2] + coocs[next_pos][2])
-        else:
-          current_pos += 1
-          if current_pos < next_pos:
-            coocs[current_pos] = coocs[next_pos]
-
-          current_row_col = (coocs[current_pos][0], coocs[current_pos][1])
-
-      coocs = coocs[:(1 + current_pos)]
-
-    # Convert to a TF Example proto.
-    def _int64s(xs):
-      return tf.train.Feature(int64_list=tf.train.Int64List(value=list(xs)))
-
-    def _floats(xs):
-      return tf.train.Feature(float_list=tf.train.FloatList(value=list(xs)))
-
-    example = tf.train.Example(features=tf.train.Features(feature={
-        'global_row': _int64s(
-            row + num_shards * i for i in range(FLAGS.shard_size)),
-        'global_col': _int64s(
-            col + num_shards * i for i in range(FLAGS.shard_size)),
-
-        'sparse_local_row': _int64s(cooc[0] for cooc in coocs),
-        'sparse_local_col': _int64s(cooc[1] for cooc in coocs),
-        'sparse_value': _floats(cooc[2] for cooc in coocs),
-    }))
-
-    filename = os.path.join(FLAGS.output_dir, 'shard-%03d-%03d.pb' % (row, col))
-    with open(filename, 'w') as out:
-      out.write(example.SerializeToString())
-
-  sys.stdout.write('\n')
-
-
-def main(_):
-  # Create the output directory, if necessary
-  if FLAGS.output_dir and not os.path.isdir(FLAGS.output_dir):
-    os.makedirs(FLAGS.output_dir)
-
-  # Read the file onces to create the vocabulary.
-  if FLAGS.vocab:
-    with open(FLAGS.vocab, 'r') as lines:
-      vocab = [line.strip() for line in lines]
-  else:
-    with open(FLAGS.input, 'r') as lines:
-      vocab = create_vocabulary(lines)
-
-  # Now read the file again to determine the co-occurrence stats.
-  with open(FLAGS.input, 'r') as lines:
-    shardfiles, sums = compute_coocs(lines, vocab)
-
-  # Collect individual shards into the shards.recs file.
-  write_shards(vocab, shardfiles)
-
-  # Now write the marginals.  They're symmetric for this application.
-  write_vocab_and_sums(vocab, sums, 'row_vocab.txt', 'row_sums.txt')
-  write_vocab_and_sums(vocab, sums, 'col_vocab.txt', 'col_sums.txt')
-
-  print('done!')
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/swivel/swivel.py b/research/swivel/swivel.py
deleted file mode 100755
index c69660c09c18f54da654ca8a7341559f8b9bcc22..0000000000000000000000000000000000000000
--- a/research/swivel/swivel.py
+++ /dev/null
@@ -1,489 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Submatrix-wise Vector Embedding Learner.
-
-Implementation of SwiVel algorithm described at:
-http://arxiv.org/abs/1602.02215
-
-This program expects an input directory that contains the following files.
-
-  row_vocab.txt, col_vocab.txt
-
-    The row an column vocabulary files.  Each file should contain one token per
-    line; these will be used to generate a tab-separate file containing the
-    trained embeddings.
-
-  row_sums.txt, col_sum.txt
-
-    The matrix row and column marginal sums.  Each file should contain one
-    decimal floating point number per line which corresponds to the marginal
-    count of the matrix for that row or column.
-
-  shards.recs
-
-    A file containing the sub-matrix shards, stored as TFRecords.  Each shard is
-    expected to be a serialzed tf.Example protocol buffer with the following
-    properties:
-
-      global_row: the global row indicies contained in the shard
-      global_col: the global column indicies contained in the shard
-      sparse_local_row, sparse_local_col, sparse_value: three parallel arrays
-      that are a sparse representation of the submatrix counts.
-
-It will generate embeddings, training from the input directory for the specified
-number of epochs.  When complete, it will output the trained vectors to a
-tab-separated file that contains one line per embedding.  Row and column
-embeddings are stored in separate files.
-
-Swivel can be run "stand-alone" or "distributed".  The latter involves running
-at least one parameter server process, along with one or more worker processes.
-"""
-
-from __future__ import division
-from __future__ import print_function
-
-import glob
-import itertools
-import os
-import random
-
-import numpy as np
-import scipy.stats
-import tensorflow as tf
-
-flags = tf.app.flags
-
-flags.DEFINE_string(
-    'input_base_path', '/tmp/swivel_data',
-    'Directory containing input shards, vocabularies, and marginals.')
-flags.DEFINE_string(
-    'output_base_path', '/tmp/swivel_data',
-    'Path where to write the trained embeddings.')
-flags.DEFINE_string('eval_base_path', '', 'Path to evaluation data')
-
-# Control for training.
-flags.DEFINE_float('num_epochs', 40, 'Number epochs to train')
-flags.DEFINE_string('hparams', '', 'Model hyper-parameters')
-
-# Model hyper-parameters. (Move these to tf.HParams once that gets integrated
-# into TF from tf.contrib.)
-flags.DEFINE_integer(
-    'dim', 300, 'Embedding dimensionality')
-flags.DEFINE_string(
-    'optimizer', 'rmsprop', 'SGD optimizer; either "adagrad" or "rmsprop"')
-flags.DEFINE_float(
-    'learning_rate', 0.1, 'Optimizer learning rate')
-flags.DEFINE_float(
-    'momentum', 0.1, 'Optimizer momentum; used with RMSProp')
-flags.DEFINE_float(
-    'confidence_base', 0.0, 'Base for count weighting')
-flags.DEFINE_float(
-    'confidence_scale', 1.0, 'Scale for count weighting')
-flags.DEFINE_float(
-    'confidence_exponent', 0.5, 'Exponent for count weighting')
-flags.DEFINE_integer(
-    'submatrix_rows', 4096, 'Number of rows in each submatrix')
-flags.DEFINE_integer(
-    'submatrix_cols', 4096, 'Number of cols in each submatrix')
-
-# For distributed training.
-flags.DEFINE_string(
-    'ps_hosts', '',
-    'Comma-separated list of parameter server host:port; if empty, run local')
-flags.DEFINE_string(
-    'worker_hosts', '', 'Comma-separated list of worker host:port')
-flags.DEFINE_string(
-    'job_name', '', 'The job this process will run, either "ps" or "worker"')
-flags.DEFINE_integer(
-    'task_index', 0, 'The task index for this process')
-flags.DEFINE_integer(
-    'gpu_device', 0, 'The GPU device to use.')
-
-FLAGS = flags.FLAGS
-
-
-class Model(object):
-  """A Swivel model."""
-
-  def __init__(self, input_base_path, hparams):
-    """Creates a new Swivel model."""
-    # Read vocab
-    self.row_ix_to_word, self.row_word_to_ix = self._read_vocab(
-        os.path.join(input_base_path, 'row_vocab.txt'))
-    self.col_ix_to_word, self.col_word_to_ix = self._read_vocab(
-        os.path.join(input_base_path, 'col_vocab.txt'))
-
-    # Read marginals.
-    row_sums = self._read_marginals_file(
-        os.path.join(input_base_path, 'row_sums.txt'))
-    col_sums = self._read_marginals_file(
-        os.path.join(input_base_path, 'col_sums.txt'))
-
-    # Construct input tensors.
-    count_matrix_files = glob.glob(
-        os.path.join(input_base_path, 'shard-*.pb'))
-
-    global_rows, global_cols, counts = self._count_matrix_input(
-        count_matrix_files, hparams.submatrix_rows, hparams.submatrix_cols)
-
-    # Create embedding variables.
-    sigma = 1.0 / np.sqrt(hparams.dim)
-    self.row_embedding = tf.get_variable(
-        'row_embedding',
-        shape=[len(row_sums), hparams.dim],
-        initializer=tf.random_normal_initializer(0, sigma),
-        dtype=tf.float32)
-    self.col_embedding = tf.get_variable(
-        'col_embedding',
-        shape=[len(col_sums), hparams.dim],
-        initializer=tf.random_normal_initializer(0, sigma),
-        dtype=tf.float32)
-
-    matrix_log_sum = np.log(np.sum(row_sums) + 1)
-    row_bias = tf.constant(
-        [np.log(x + 1) for x in row_sums], dtype=tf.float32)
-    col_bias = tf.constant(
-        [np.log(x + 1) for x in col_sums], dtype=tf.float32)
-
-    # Fetch embeddings.
-    selected_rows = tf.nn.embedding_lookup(self.row_embedding, global_rows)
-    selected_cols = tf.nn.embedding_lookup(self.col_embedding, global_cols)
-
-    selected_row_bias = tf.gather(row_bias, global_rows)
-    selected_col_bias = tf.gather(col_bias, global_cols)
-
-    predictions = tf.matmul(selected_rows, selected_cols, transpose_b=True)
-
-    # These binary masks separate zero from non-zero values.
-    count_is_nonzero = tf.to_float(tf.cast(counts, tf.bool))
-    count_is_zero = 1 - count_is_nonzero
-
-    objectives = count_is_nonzero * tf.log(counts + 1e-30)
-    objectives -= tf.reshape(selected_row_bias, [-1, 1])
-    objectives -= selected_col_bias
-    objectives += matrix_log_sum
-
-    err = predictions - objectives
-
-    # The confidence function scales the L2 loss based on the raw
-    # co-occurrence count.
-    l2_confidence = (hparams.confidence_base +
-                     hparams.confidence_scale * tf.pow(
-                         counts, hparams.confidence_exponent))
-
-    loss_multiplier = 1 / np.sqrt(
-        hparams.submatrix_rows * hparams.submatrix_cols)
-
-    l2_loss = loss_multiplier * tf.reduce_sum(
-        0.5 * l2_confidence * tf.square(err))
-
-    sigmoid_loss = loss_multiplier * tf.reduce_sum(
-        tf.nn.softplus(err) * count_is_zero)
-
-    self.loss_op = l2_loss + sigmoid_loss
-
-    if hparams.optimizer == 'adagrad':
-      opt = tf.train.AdagradOptimizer(hparams.learning_rate)
-    elif hparams.optimizer == 'rmsprop':
-      opt = tf.train.RMSPropOptimizer(hparams.learning_rate, hparams.momentum)
-    else:
-      raise ValueError('unknown optimizer "%s"' % hparams.optimizer)
-
-    self.global_step = tf.get_variable(
-        'global_step', initializer=0, trainable=False)
-
-    self.train_op = opt.minimize(self.loss_op, global_step=self.global_step)
-
-    # One epoch trains each submatrix once.
-    self.steps_per_epoch = (
-        (len(row_sums) / hparams.submatrix_rows) *
-        (len(col_sums) / hparams.submatrix_cols))
-
-  def _read_vocab(self, filename):
-    """Reads the vocabulary file."""
-    with open(filename) as lines:
-      ix_to_word = [line.strip() for line in lines]
-      word_to_ix = {word: ix for ix, word in enumerate(ix_to_word)}
-      return ix_to_word, word_to_ix
-
-  def _read_marginals_file(self, filename):
-    """Reads text file with one number per line to an array."""
-    with open(filename) as lines:
-      return [float(line.strip()) for line in lines]
-
-  def _count_matrix_input(self, filenames, submatrix_rows, submatrix_cols):
-    """Creates ops that read submatrix shards from disk."""
-    random.shuffle(filenames)
-    filename_queue = tf.train.string_input_producer(filenames)
-    reader = tf.WholeFileReader()
-    _, serialized_example = reader.read(filename_queue)
-    features = tf.parse_single_example(
-        serialized_example,
-        features={
-            'global_row': tf.FixedLenFeature([submatrix_rows], dtype=tf.int64),
-            'global_col': tf.FixedLenFeature([submatrix_cols], dtype=tf.int64),
-            'sparse_local_row': tf.VarLenFeature(dtype=tf.int64),
-            'sparse_local_col': tf.VarLenFeature(dtype=tf.int64),
-            'sparse_value': tf.VarLenFeature(dtype=tf.float32)
-        })
-
-    global_row = features['global_row']
-    global_col = features['global_col']
-
-    sparse_local_row = features['sparse_local_row'].values
-    sparse_local_col = features['sparse_local_col'].values
-    sparse_count = features['sparse_value'].values
-
-    sparse_indices = tf.concat(
-        axis=1, values=[tf.expand_dims(sparse_local_row, 1),
-                        tf.expand_dims(sparse_local_col, 1)])
-
-    count = tf.sparse_to_dense(sparse_indices, [submatrix_rows, submatrix_cols],
-                               sparse_count)
-
-    return global_row, global_col, count
-
-  def wordsim_eval_op(self, filename):
-    """Returns an op that runs an eval on a word similarity dataset.
-
-    The eval dataset is assumed to be tab-separated, one scored word pair per
-    line.  The resulting value is Spearman's rho of the human judgements with
-    the cosine similarity of the word embeddings.
-
-    Args:
-      filename: the filename containing the word similarity data.
-
-    Returns:
-      An operator that will compute Spearman's rho of the current row
-      embeddings.
-    """
-    with open(filename, 'r') as fh:
-      tuples = (line.strip().split('\t') for line in fh.read().splitlines())
-      word1s, word2s, sims = zip(*tuples)
-      actuals = map(float, sims)
-
-    v1s_t = tf.nn.embedding_lookup(
-        self.row_embedding,
-        [self.row_word_to_ix.get(w, 0) for w in word1s])
-
-    v2s_t = tf.nn.embedding_lookup(
-        self.row_embedding,
-        [self.row_word_to_ix.get(w, 0) for w in word2s])
-
-    # Compute the predicted word similarity as the cosine similarity between the
-    # embedding vectors.
-    preds_t = tf.reduce_sum(
-        tf.nn.l2_normalize(v1s_t, dim=1) * tf.nn.l2_normalize(v2s_t, dim=1),
-        axis=1)
-
-    def _op(preds):
-      rho, _ = scipy.stats.spearmanr(preds, actuals)
-      return rho
-
-    return tf.py_func(_op, [preds_t], tf.float64)
-
-  def analogy_eval_op(self, filename, max_vocab_size=20000):
-    """Returns an op that runs an eval on an analogy dataset.
-
-    The eval dataset is assumed to be tab-separated, with four tokens per
-    line. The first three tokens are query terms, the last is the expected
-    answer. For each line (e.g., "man king woman queen"), the vectors
-    corresponding to the query terms are added ("king - man + woman") to produce
-    a query vector.  If the expected answer's vector is the nearest neighbor to
-    the query vector (not counting any of the query vectors themselves), then
-    the line is scored as correct.  The reported accuracy is the number of
-    correct rows divided by the total number of rows.  Missing terms are
-    replaced with an arbitrary vector and will almost certainly result in
-    incorrect answers.
-
-    Note that the results are approximate: for efficiency's sake, only the first
-    `max_vocab_size` terms are included in the nearest neighbor search.
-
-    Args:
-      filename: the filename containing the analogy data.
-      max_vocab_size: the maximum number of tokens to include in the nearest
-        neighbor search. By default, 20000.
-
-    Returns:
-      The accuracy on the analogy task.
-    """
-    analogy_ixs = []
-    with open(filename, 'r') as lines:
-      for line in lines:
-        parts = line.strip().split('\t')
-        if len(parts) == 4:
-          analogy_ixs.append([self.row_word_to_ix.get(w, 0) for w in parts])
-
-    # man:king :: woman:queen => king - man + woman == queen
-    ix1s, ix2s, ix3s, _ = zip(*analogy_ixs)
-    v1s_t, v2s_t, v3s_t = (
-        tf.nn.l2_normalize(
-            tf.nn.embedding_lookup(self.row_embedding, ixs),
-            dim=1)
-        for ixs in (ix1s, ix2s, ix3s))
-
-    preds_t = v2s_t - v1s_t + v3s_t
-
-    # Compute the nearest neighbors as the cosine similarity.  We only consider
-    # up to max_vocab_size to avoid a matmul that swamps the machine.
-    sims_t = tf.matmul(
-        preds_t,
-        tf.nn.l2_normalize(self.row_embedding[:max_vocab_size], dim=1),
-        transpose_b=True)
-
-    # Take the four nearest neighbors, since the eval explicitly discards the
-    # query terms.
-    _, preds_ixs_t = tf.nn.top_k(sims_t, 4)
-
-    def _op(preds_ixs):
-      correct, total = 0, 0
-      for pred_ixs, actual_ixs in itertools.izip(preds_ixs, analogy_ixs):
-        pred_ixs = [ix for ix in pred_ixs if ix not in actual_ixs[:3]]
-        correct += pred_ixs[0] == actual_ixs[3]
-        total += 1
-
-      return correct / total
-
-    return tf.py_func(_op, [preds_ixs_t], tf.float64)
-
-  def _write_tensor(self, vocab_path, output_path, session, embedding):
-    """Writes tensor to output_path as tsv."""
-    embeddings = session.run(embedding)
-
-    with open(output_path, 'w') as out_f:
-      with open(vocab_path) as vocab_f:
-        for index, word in enumerate(vocab_f):
-          word = word.strip()
-          embedding = embeddings[index]
-          print('\t'.join([word.strip()] + [str(x) for x in embedding]),
-                file=out_f)
-
-  def write_embeddings(self, config, session):
-    """Writes row and column embeddings disk."""
-    self._write_tensor(
-        os.path.join(config.input_base_path, 'row_vocab.txt'),
-        os.path.join(config.output_base_path, 'row_embedding.tsv'),
-        session, self.row_embedding)
-
-    self._write_tensor(
-        os.path.join(config.input_base_path, 'col_vocab.txt'),
-        os.path.join(config.output_base_path, 'col_embedding.tsv'),
-        session, self.col_embedding)
-
-
-def main(_):
-  tf.logging.set_verbosity(tf.logging.INFO)
-
-  # If we have ps_hosts, then we'll assume that this is going to be a
-  # distributed training run.  Configure the cluster appropriately.  Otherwise,
-  # we just do everything in-process.
-  if FLAGS.ps_hosts:
-    cluster = tf.train.ClusterSpec({
-        'ps': FLAGS.ps_hosts.split(','),
-        'worker': FLAGS.worker_hosts.split(','),
-    })
-
-    if FLAGS.job_name == 'ps':
-      # Ignore the GPU if we're the parameter server. This let's the PS run on
-      # the same machine as a worker.
-      config = tf.ConfigProto(device_count={'GPU': 0})
-    elif FLAGS.job_name == 'worker':
-      config = tf.ConfigProto(gpu_options=tf.GPUOptions(
-          visible_device_list='%d' % FLAGS.gpu_device,
-          allow_growth=True))
-    else:
-      raise ValueError('unknown job name "%s"' % FLAGS.job_name)
-
-    server = tf.train.Server(
-        cluster,
-        job_name=FLAGS.job_name,
-        task_index=FLAGS.task_index,
-        config=config)
-
-    if FLAGS.job_name == 'ps':
-      return server.join()
-
-    device_setter = tf.train.replica_device_setter(
-        worker_device='/job:worker/task:%d' % FLAGS.task_index,
-        cluster=cluster)
-
-  else:
-    server = None
-    device_setter = tf.train.replica_device_setter(0)
-
-  # Build the graph.
-  with tf.Graph().as_default():
-    with tf.device(device_setter):
-      model = Model(FLAGS.input_base_path, FLAGS)
-
-      # If an eval path is present, then create eval operators and set up scalar
-      # summaries to report on the results.  Run the evals on the CPU since
-      # the analogy eval requires a fairly enormous tensor to be allocated to
-      # do the nearest neighbor search.
-      if FLAGS.eval_base_path:
-        wordsim_filenames = glob.glob(
-            os.path.join(FLAGS.eval_base_path, '*.ws.tab'))
-
-        for filename in wordsim_filenames:
-          name = os.path.basename(filename).split('.')[0]
-          with tf.device(tf.DeviceSpec(device_type='CPU')):
-            op = model.wordsim_eval_op(filename)
-            tf.summary.scalar(name, op)
-
-        analogy_filenames = glob.glob(
-            os.path.join(FLAGS.eval_base_path, '*.an.tab'))
-
-        for filename in analogy_filenames:
-          name = os.path.basename(filename).split('.')[0]
-          with tf.device(tf.DeviceSpec(device_type='CPU')):
-            op = model.analogy_eval_op(filename)
-            tf.summary.scalar(name, op)
-
-      tf.summary.scalar('loss', model.loss_op)
-
-    # Train on, soldier.
-    supervisor = tf.train.Supervisor(
-        logdir=FLAGS.output_base_path,
-        is_chief=(FLAGS.task_index == 0),
-        save_summaries_secs=60,
-        recovery_wait_secs=5)
-
-    max_step = FLAGS.num_epochs * model.steps_per_epoch
-    master = server.target if server else ''
-    with supervisor.managed_session(master) as session:
-      local_step = 0
-      global_step = session.run(model.global_step)
-      while not supervisor.should_stop() and global_step < max_step:
-        global_step, loss, _ = session.run([
-            model.global_step, model.loss_op, model.train_op])
-
-        if not np.isfinite(loss):
-          raise ValueError('non-finite cost at step %d' % global_step)
-
-        local_step += 1
-        if local_step % 10 == 0:
-          tf.logging.info(
-              'local_step=%d global_step=%d loss=%.1f, %.1f%% complete',
-              local_step, global_step, loss, 100.0 * global_step / max_step)
-
-      if FLAGS.task_index == 0:
-        supervisor.saver.save(
-            session, supervisor.save_path, global_step=global_step)
-
-        model.write_embeddings(FLAGS, session)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/swivel/text2bin.py b/research/swivel/text2bin.py
deleted file mode 100644
index 6ccb132955289dbc10a818382746a03f09e505a3..0000000000000000000000000000000000000000
--- a/research/swivel/text2bin.py
+++ /dev/null
@@ -1,88 +0,0 @@
-#!/usr/bin/env python
-#
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Converts vectors from text to a binary format for quicker manipulation.
-
-Usage:
-
-  text2bin.py -o <out> -v <vocab> vec1.txt [vec2.txt ...]
-
-Optiona:
-
-  -o <filename>, --output <filename>
-    The name of the file into which the binary vectors are written.
-
-  -v <filename>, --vocab <filename>
-    The name of the file into which the vocabulary is written.
-
-Description
-
-This program merges one or more whitespace separated vector files into a single
-binary vector file that can be used by downstream evaluation tools in this
-directory ("wordsim.py" and "analogy").
-
-If more than one vector file is specified, then the files must be aligned
-row-wise (i.e., each line must correspond to the same embedding), and they must
-have the same number of columns (i.e., be the same dimension).
-
-"""
-
-from itertools import izip
-from getopt import GetoptError, getopt
-import os
-import struct
-import sys
-
-try:
-  opts, args = getopt(
-      sys.argv[1:], 'o:v:', ['output=', 'vocab='])
-except GetoptError as e:
-  print >> sys.stderr, e
-  sys.exit(2)
-
-opt_output = 'vecs.bin'
-opt_vocab = 'vocab.txt'
-for o, a in opts:
-  if o in ('-o', '--output'):
-    opt_output = a
-  if o in ('-v', '--vocab'):
-    opt_vocab = a
-
-def go(fhs):
-  fmt = None
-  with open(opt_vocab, 'w') as vocab_out:
-    with open(opt_output, 'w') as vecs_out:
-      for lines in izip(*fhs):
-        parts = [line.split() for line in lines]
-        token = parts[0][0]
-        if any(part[0] != token for part in parts[1:]):
-          raise IOError('vector files must be aligned')
-
-        print >> vocab_out, token
-
-        vec = [sum(float(x) for x in xs) for xs in zip(*parts)[1:]]
-        if not fmt:
-          fmt = struct.Struct('%df' % len(vec))
-
-        vecs_out.write(fmt.pack(*vec))
-
-if args:
-  fhs = [open(filename) for filename in args]
-  go(fhs)
-  for fh in fhs:
-    fh.close()
-else:
-  go([sys.stdin])
diff --git a/research/swivel/vecs.py b/research/swivel/vecs.py
deleted file mode 100644
index 806173f6abe3457f8290c7f24cc99014f4556d82..0000000000000000000000000000000000000000
--- a/research/swivel/vecs.py
+++ /dev/null
@@ -1,92 +0,0 @@
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-import mmap
-import numpy as np
-import os
-
-from six import string_types
-
-
-class Vecs(object):
-  def __init__(self, vocab_filename, rows_filename, cols_filename=None):
-    """Initializes the vectors from a text vocabulary and binary data."""
-    with open(vocab_filename, 'r') as lines:
-      self.vocab = [line.split()[0] for line in lines]
-      self.word_to_idx = {word: idx for idx, word in enumerate(self.vocab)}
-
-    n = len(self.vocab)
-
-    with open(rows_filename, 'r') as rows_fh:
-      rows_fh.seek(0, os.SEEK_END)
-      size = rows_fh.tell()
-
-      # Make sure that the file size seems reasonable.
-      if size % (4 * n) != 0:
-        raise IOError(
-            'unexpected file size for binary vector file %s' % rows_filename)
-
-      # Memory map the rows.
-      dim = round(size / (4 * n))
-      rows_mm = mmap.mmap(rows_fh.fileno(), 0, prot=mmap.PROT_READ)
-      rows = np.matrix(
-          np.frombuffer(rows_mm, dtype=np.float32).reshape(n, dim))
-
-      # If column vectors were specified, then open them and add them to the
-      # row vectors.
-      if cols_filename:
-        with open(cols_filename, 'r') as cols_fh:
-          cols_mm = mmap.mmap(cols_fh.fileno(), 0, prot=mmap.PROT_READ)
-          cols_fh.seek(0, os.SEEK_END)
-          if cols_fh.tell() != size:
-            raise IOError('row and column vector files have different sizes')
-
-          cols = np.matrix(
-              np.frombuffer(cols_mm, dtype=np.float32).reshape(n, dim))
-
-          rows += cols
-          cols_mm.close()
-
-      # Normalize so that dot products are just cosine similarity.
-      self.vecs = rows / np.linalg.norm(rows, axis=1).reshape(n, 1)
-      rows_mm.close()
-
-  def similarity(self, word1, word2):
-    """Computes the similarity of two tokens."""
-    idx1 = self.word_to_idx.get(word1)
-    idx2 = self.word_to_idx.get(word2)
-    if not idx1 or not idx2:
-      return None
-
-    return float(self.vecs[idx1] * self.vecs[idx2].transpose())
-
-  def neighbors(self, query):
-    """Returns the nearest neighbors to the query (a word or vector)."""
-    if isinstance(query, string_types):
-      idx = self.word_to_idx.get(query)
-      if idx is None:
-        return None
-
-      query = self.vecs[idx]
-
-    neighbors = self.vecs * query.transpose()
-
-    return sorted(
-      zip(self.vocab, neighbors.flat),
-      key=lambda kv: kv[1], reverse=True)
-
-  def lookup(self, word):
-    """Returns the embedding for a token, or None if no embedding exists."""
-    idx = self.word_to_idx.get(word)
-    return None if idx is None else self.vecs[idx]
diff --git a/research/swivel/wordsim.py b/research/swivel/wordsim.py
deleted file mode 100644
index 2d27663f8cf47c95cf3c47ed7e7e95bb8130ea5d..0000000000000000000000000000000000000000
--- a/research/swivel/wordsim.py
+++ /dev/null
@@ -1,93 +0,0 @@
-#!/usr/bin/env python
-#
-# Copyright 2016 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-"""Computes Spearman's rho with respect to human judgements.
-
-Given a set of row (and potentially column) embeddings, this computes Spearman's
-rho between the rank ordering of predicted word similarity and human judgements.
-
-Usage:
-
-  wordim.py --embeddings=<binvecs> --vocab=<vocab> eval1.tab eval2.tab ...
-
-Options:
-
-  --embeddings=<filename>: the vectors to test
-  --vocab=<filename>: the vocabulary file
-
-Evaluation files are assumed to be tab-separated files with exactly three
-columns.  The first two columns contain the words, and the third column contains
-the scored human judgement.
-
-"""
-
-from __future__ import print_function
-import scipy.stats
-import sys
-from getopt import GetoptError, getopt
-
-from vecs import Vecs
-
-try:
-  opts, args = getopt(sys.argv[1:], '', ['embeddings=', 'vocab='])
-except GetoptError as e:
-  print(e, file=sys.stderr)
-  sys.exit(2)
-
-opt_embeddings = None
-opt_vocab = None
-
-for o, a in opts:
-  if o == '--embeddings':
-    opt_embeddings = a
-  if o == '--vocab':
-    opt_vocab = a
-
-if not opt_vocab:
-  print('please specify a vocabulary file with "--vocab"', file=sys.stderr)
-  sys.exit(2)
-
-if not opt_embeddings:
-  print('please specify the embeddings with "--embeddings"', file=sys.stderr)
-  sys.exit(2)
-
-try:
-  vecs = Vecs(opt_vocab, opt_embeddings)
-except IOError as e:
-  print(e, file=sys.stderr)
-  sys.exit(1)
-
-
-def evaluate(lines):
-  acts, preds = [], []
-
-  with open(filename, 'r') as lines:
-    for line in lines:
-      w1, w2, act = line.strip().split('\t')
-      pred = vecs.similarity(w1, w2)
-      if pred is None:
-        continue
-
-      acts.append(float(act))
-      preds.append(pred)
-
-  rho, _ = scipy.stats.spearmanr(acts, preds)
-  return rho
-
-
-for filename in args:
-  with open(filename, 'r') as lines:
-    print('%0.3f %s' % (evaluate(lines), filename))
diff --git a/research/tcn/BUILD b/research/tcn/BUILD
deleted file mode 100644
index 39297d4b7bfa6ba94bb91a79d555c3fce75c4e63..0000000000000000000000000000000000000000
--- a/research/tcn/BUILD
+++ /dev/null
@@ -1,213 +0,0 @@
-package(default_visibility = [":internal"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-package_group(
-    name = "internal",
-    packages = [
-        "//tcn/...",
-    ],
-)
-
-py_binary(
-    name = "download_pretrained",
-    srcs = [
-        "download_pretrained.py",
-    ],
-)
-
-py_binary(
-    name = "generate_videos",
-    srcs = [
-        "generate_videos.py",
-    ],
-    main = "generate_videos.py",
-    deps = [
-        ":data_providers",
-        ":get_estimator",
-        ":util",
-    ],
-)
-
-py_test(
-    name = "svtcn_loss_test",
-    size = "medium",
-    srcs = [
-        "estimators/svtcn_loss.py",
-        "estimators/svtcn_loss_test.py",
-    ],
-    deps = [
-        ":util",
-    ],
-)
-
-py_library(
-    name = "data_providers",
-    srcs = [
-        "data_providers.py",
-    ],
-    deps = [
-        ":preprocessing",
-    ],
-)
-
-py_test(
-    name = "data_providers_test",
-    size = "large",
-    srcs = ["data_providers_test.py"],
-    deps = [
-        ":data_providers",
-    ],
-)
-
-py_library(
-    name = "preprocessing",
-    srcs = [
-        "preprocessing.py",
-    ],
-)
-
-py_binary(
-    name = "get_estimator",
-    srcs = [
-        "estimators/get_estimator.py",
-    ],
-    deps = [
-        ":mvtcn_estimator",
-        ":svtcn_estimator",
-    ],
-)
-
-py_binary(
-    name = "base_estimator",
-    srcs = [
-        "estimators/base_estimator.py",
-        "model.py",
-    ],
-    deps = [
-        ":data_providers",
-        ":util",
-    ],
-)
-
-py_library(
-    name = "util",
-    srcs = [
-        "utils/luatables.py",
-        "utils/progress.py",
-        "utils/util.py",
-    ],
-)
-
-py_binary(
-    name = "mvtcn_estimator",
-    srcs = [
-        "estimators/mvtcn_estimator.py",
-    ],
-    deps = [
-        ":base_estimator",
-    ],
-)
-
-py_binary(
-    name = "svtcn_estimator",
-    srcs = [
-        "estimators/svtcn_estimator.py",
-        "estimators/svtcn_loss.py",
-    ],
-    deps = [
-        ":base_estimator",
-    ],
-)
-
-py_binary(
-    name = "train",
-    srcs = [
-        "train.py",
-    ],
-    deps = [
-        ":data_providers",
-        ":get_estimator",
-        ":util",
-    ],
-)
-
-py_binary(
-    name = "labeled_eval",
-    srcs = [
-        "labeled_eval.py",
-    ],
-    deps = [
-        ":get_estimator",
-    ],
-)
-
-py_test(
-    name = "labeled_eval_test",
-    size = "small",
-    srcs = ["labeled_eval_test.py"],
-    deps = [
-        ":labeled_eval",
-    ],
-)
-
-py_binary(
-    name = "eval",
-    srcs = [
-        "eval.py",
-    ],
-    deps = [
-        ":get_estimator",
-    ],
-)
-
-py_binary(
-    name = "alignment",
-    srcs = [
-        "alignment.py",
-    ],
-    deps = [
-        ":get_estimator",
-    ],
-)
-
-py_binary(
-    name = "visualize_embeddings",
-    srcs = [
-        "visualize_embeddings.py",
-    ],
-    deps = [
-        ":data_providers",
-        ":get_estimator",
-        ":util",
-    ],
-)
-
-py_binary(
-    name = "webcam",
-    srcs = [
-        "dataset/webcam.py",
-    ],
-    main = "dataset/webcam.py",
-)
-
-py_binary(
-    name = "images_to_videos",
-    srcs = [
-        "dataset/images_to_videos.py",
-    ],
-    main = "dataset/images_to_videos.py",
-)
-
-py_binary(
-    name = "videos_to_tfrecords",
-    srcs = [
-        "dataset/videos_to_tfrecords.py",
-    ],
-    main = "dataset/videos_to_tfrecords.py",
-    deps = [
-        ":preprocessing",
-    ],
-)
diff --git a/research/tcn/README.md b/research/tcn/README.md
deleted file mode 100644
index 6f96324576cb32f25cd8d3a2c70b9fbae754ccea..0000000000000000000000000000000000000000
--- a/research/tcn/README.md
+++ /dev/null
@@ -1,559 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Time Contrastive Networks
-
-This implements ["Time Contrastive Networks"](https://arxiv.org/abs/1704.06888),
-which is part of the larger [Self-Supervised Imitation
-Learning](https://sermanet.github.io/imitation/) project.
-
-![](https://sermanet.github.io/tcn/docs/figs/mvTCN.png)
-
-## Contacts
-
-Maintainers of TCN:
-
-*   Corey Lynch: [github](https://github.com/coreylynch),
-    [twitter](https://twitter.com/coreylynch)
-*   Pierre Sermanet: [github](https://github.com/sermanet),
-    [twitter](https://twitter.com/psermanet)
-
-## Contents
-
-*   [Getting Started](#getting-started)
-    *   [Install Dependencies](#install-dependencies)
-    *   [Download the Inception v3
-        Checkpoint](#download-pretrained-inceptionv3-checkpoint)
-    *   [Run all the tests](#run-all-the-tests)
-*   [Concepts](#concepts)
-    *   [Multi-view Webcam Video](#multi-view-webcam-video)
-    *   [Data Pipelines](#data-pipelines)
-    *   [Estimators](#estimators)
-    *   [Models](#models)
-    *   [Losses](#losses)
-    *   [Inference](#inference)
-    *   [Configuration](#configuration)
-    *   [Monitoring Training](#monitoring-training)
-        *   [KNN Classification Error](#knn-classification-error)
-        *   [KNN Classification Error](#multi-view-alignment)
-    *   [Visualization](#visualization)
-        *   [Nearest Neighbor Imitation
-            Videos](#nearest-neighbor-imitation-videos)
-        *   [PCA & T-SNE Visualization](#pca-t-sne-visualization)
-*   [Tutorial Part I: Collecting Multi-View Webcam
-    Videos](#tutorial-part-i-collecting-multi-view-webcam-videos)
-    *   [Collect Webcam Videos](#collect-webcam-videos)
-    *   [Create TFRecords](#create-tfrecords)
-*   [Tutorial Part II: Training, Evaluation, and
-    Visualization](#tutorial-part-ii-training-evaluation-and-visualization)
-    *   [Download Data](#download-data)
-    *   [Download the Inception v3
-        Checkpoint](#download-pretrained-inceptionv3-checkpoint)
-    *   [Define a Config](#define-a-config)
-    *   [Train](#train)
-    *   [Evaluate](#evaluate)
-    *   [Monitor training](#monior-training)
-    *   [Visualize](#visualize)
-        *   [Generate Imitation Videos](#generate-imitation-videos)
-        *   [Run PCA & T-SNE Visualization](#t-sne-pca-visualization)
-
-## Getting started
-
-### Install Dependencies
-
-*   [Tensorflow nightly build](https://pypi.python.org/pypi/tf-nightly-gpu) or
-    via `pip install tf-nightly-gpu`.
-*   [Bazel](http://bazel.io/docs/install.html)
-*   matplotlib
-*   sklearn
-*   opencv
-
-### Download Pretrained InceptionV3 Checkpoint
-
-Run the script that downloads the pretrained InceptionV3 checkpoint:
-
-```bash
-cd tensorflow-models/tcn
-python download_pretrained.py
-```
-
-### Run all the tests
-
-```bash
-bazel test :all
-```
-
-## Concepts
-
-### Multi-View Webcam Video
-
-We provide utilities to collect your own multi-view videos in dataset/webcam.py.
-See the [webcam tutorial](#tutorial-part-i-collecting-multi-view-webcam-videos)
-for an end to end example of how to collect multi-view webcam data and convert
-it to the TFRecord format expected by this library.
-
-## Data Pipelines
-
-We use the [tf.data.Dataset
-API](https://www.tensorflow.org/guide/datasets) to construct input
-pipelines that feed training, evaluation, and visualization. These pipelines are
-defined in `data_providers.py`.
-
-## Estimators
-
-We define training, evaluation, and inference behavior using the
-[tf.estimator.Estimator
-API](https://www.tensorflow.org/api_docs/python/tf/estimator/Estimator). See
-`estimators/mvtcn_estimator.py` for an example of how multi-view TCN training,
-evaluation, and inference is implemented.
-
-## Models
-
-Different embedder architectures are implemented in model.py. We used the
-`InceptionConvSSFCEmbedder` in the pouring experiments, but we're also
-evaluating `Resnet` embedders.
-
-## Losses
-
-We use the
-[tf.contrib.losses.metric_learning](https://www.tensorflow.org/versions/master/api_docs/python/tf/contrib/losses/metric_learning)
-library's implementations of triplet loss with semi-hard negative mining and
-npairs loss. In our experiments, npairs loss has better empirical convergence
-and produces the best qualitative visualizations, and will likely be our choice
-for future experiments. See the
-[paper](http://www.nec-labs.com/uploads/images/Department-Images/MediaAnalytics/papers/nips16_npairmetriclearning.pdf)
-for details on the algorithm.
-
-## Inference
-
-We support 3 modes of inference for trained TCN models:
-
-*   Mode 1: Input is a tf.Estimator input_fn (see
-    [this](https://www.tensorflow.org/api_docs/python/tf/estimator/Estimator#predict)
-    for details). Output is an iterator over embeddings and additional metadata.
-    See `labeled_eval.py` for a usage example.
-
-*   Mode 2: Input is a TFRecord or (or list of TFRecords). This returns an
-    iterator over tuples of (embeddings, raw_image_strings, sequence_name),
-    where embeddings is the [num views, sequence length, embedding size] numpy
-    array holding the full embedded sequence (for all views), raw_image_strings
-    is a [num views, sequence length] string array holding the jpeg-encoded raw
-    image strings, and sequence_name is the name of the sequence. See
-    `generate_videos.py` for a usage example.
-
-*   Mode 3: Input is a numpy array of size [num images, height, width, num
-    channels]. This returns a tuple of (embeddings, raw_image_strings), where
-    embeddings is a 2-D float32 numpy array holding [num_images, embedding_size]
-    image embeddings, and raw_image_strings is a 1-D string numpy array holding
-    [batch_size] jpeg-encoded image strings. This can be used as follows:
-
-    ```python
-    images = np.random.uniform(0, 1, size=(batch_size, 1080, 1920, 3))
-    embeddings, _ = estimator.inference(
-        images, checkpoint_path=checkpoint_path)
-    ```
-
-See `estimators/base_estimator.py` for details.
-
-## Configuration
-
-Data pipelines, training, eval, and visualization are all configured using
-key-value parameters passed as [YAML](https://en.wikipedia.org/wiki/YAML) files.
-Configurations can be nested, e.g.:
-
-```yaml
-learning:
-  optimizer: 'adam'
-  learning_rate: 0.001
-```
-
-### T objects
-
-YAML configs are converted to LuaTable-like `T` object (see
-`utils/luatables.py`), which behave like a python `dict`, but allow you to use
-dot notation to access (nested) keys. For example we could access the learning
-rate in the above config snippet via `config.learning.learning_rate`.
-
-### Multiple Configs
-
-Multiple configs can be passed to the various binaries as a comma separated list
-of config paths via the `--config_paths` flag. This allows us to specify a
-default config that applies to all experiments (e.g. how often to write
-checkpoints, default embedder hyperparams) and one config per experiment holding
-the just hyperparams specific to the experiment (path to data, etc.).
-
-See `configs/tcn_default.yml` for an example of our default config and
-`configs/pouring.yml` for an example of how we define the pouring experiments.
-
-Configs are applied left to right. For example, consider two config files:
-
-default.yml
-
-```yaml
-learning:
-  learning_rate: 0.001 # Default learning rate.
-  optimizer: 'adam'
-```
-
-myexperiment.yml
-
-```yaml
-learning:
-  learning_rate: 1.0 # Experiment learning rate (overwrites default).
-data:
-  training: '/path/to/myexperiment/training.tfrecord'
-```
-
-Running
-
-```bash
-bazel run train.py --config_paths='default.yml,myexperiment.yml'
-```
-
-results in a final merged config called final_training_config.yml
-
-```yaml
-learning:
-  optimizer: 'adam'
-  learning_rate: 1.0
-data:
-  training: '/path/to/myexperiment/training.tfrecord'
-```
-
-which is created automatically and stored in the experiment log directory
-alongside model checkpoints and tensorboard summaries. This gives us a record of
-the exact configs that went into each trial.
-
-## Monitoring training
-
-We usually look at two validation metrics during training: knn classification
-error and multi-view alignment.
-
-### KNN-Classification Error
-
-In cases where we have labeled validation data, we can compute the average
-cross-sequence KNN classification error (1.0 - recall@k=1) over all embedded
-labeled images in the validation set. See `labeled_eval.py`.
-
-### Multi-view Alignment
-
-In cases where there is no labeled validation data, we can look at the how well
-our model aligns multiple views of same embedded validation sequences. That is,
-for each embedded validation sequence, for all cross-view pairs, we compute the
-scaled absolute distance between ground truth time indices and knn time indices.
-See `alignment.py`.
-
-## Visualization
-
-We visualize the embedding space learned by our models in two ways: nearest
-neighbor imitation videos and PCA/T-SNE.
-
-### Nearest Neighbor Imitation Videos
-
-One of the easiest way to evaluate the understanding of your model is to see how
-well the model can semantically align two videos via nearest neighbors in
-embedding space.
-
-Consider the case where we have multiple validation demo videos of a human or
-robot performing the same task. For example, in the pouring experiments, we
-collected many different multiview validation videos of a person pouring the
-contents of one container into another, then setting the container down. If we'd
-like to see how well our embeddings generalize across viewpoint, object/agent
-appearance, and background, we can construct what we call "Nearest Neighbor
-Imitation" videos, by embedding some validation query sequence `i` from view 1,
-and finding the nearest neighbor for each query frame in some embedded target
-sequence `j` filmed from view 1.
-[Here's](https://sermanet.github.io/tcn/docs/figs/pouring_human.mov.gif) an
-example of the final product.
-
-See `generate_videos.py` for details.
-
-### PCA & T-SNE Visualization
-
-We can also embed a set of images taken randomly from validation videos and
-visualize the embedding space using PCA projection and T-SNE in the tensorboard
-projector. See `visualize_embeddings.py` for details.
-
-## Tutorial Part I: Collecting Multi-View Webcam Videos
-
-Here we give an end-to-end example of how to collect your own multiview webcam
-videos and convert them to the TFRecord format expected by training.
-
-Note: This was tested with up to 8 concurrent [Logitech c930e
-webcams](https://www.logitech.com/en-us/product/c930e-webcam) extended with
-[Plugable 5 Meter (16 Foot) USB 2.0 Active Repeater Extension
-Cables](https://www.amazon.com/gp/product/B006LFL4X0/ref=oh_aui_detailpage_o05_s00?ie=UTF8&psc=1).
-
-### Collect webcam videos
-
-Go to dataset/webcam.py
-
-1.  Plug your webcams in and run
-
-    ```bash
-    ls -ltrh /dev/video*
-    ```
-
-    You should see one device listed per connected webcam.
-
-2.  Define some environment variables describing the dataset you're collecting.
-
-    ```bash
-    dataset=tutorial  # Name of the dataset.
-    mode=train  # E.g. 'train', 'validation', 'test', 'demo'.
-    num_views=2 # Number of webcams.
-    viddir=/tmp/tcn/videos # Output directory for the videos.
-    tmp_imagedir=/tmp/tcn/tmp_images # Temp directory to hold images.
-    debug_vids=1 # Whether or not to generate side-by-side debug videos.
-    export DISPLAY=:0.0  # This allows real time matplotlib display.
-    ```
-
-3.  Run the webcam.py script.
-
-    ```bash
-    bazel build -c opt --copt=-mavx webcam && \
-    bazel-bin/webcam \
-    --dataset $dataset \
-    --mode $mode \
-    --num_views $num_views \
-    --tmp_imagedir $tmp_imagedir \
-    --viddir $viddir \
-    --debug_vids 1
-    ```
-
-4.  Hit Ctrl-C when done collecting, upon which the script will compile videos
-    for each view and optionally a debug video concatenating multiple
-    simultaneous views.
-
-5.  If `--seqname` flag isn't set, the script will name the first sequence '0',
-    the second sequence '1', and so on (meaning you can just keep rerunning step
-    3.). When you are finished, you should see an output viddir with the
-    following structure:
-
-    ```bash
-    videos/0_view0.mov
-    videos/0_view1.mov
-    ...
-    videos/0_viewM.mov
-    videos/1_viewM.mov
-    ...
-    videos/N_viewM.mov
-    for N sequences and M webcam views.
-    ```
-
-### Create TFRecords
-
-Use `dataset/videos_to_tfrecords.py` to convert the directory of videos into a
-directory of TFRecords files, one per multi-view sequence.
-
-```bash
-viddir=/tmp/tcn/videos
-dataset=tutorial
-mode=train
-videos=$viddir/$dataset
-
-bazel build -c opt videos_to_tfrecords && \
-bazel-bin/videos_to_tfrecords --logtostderr \
---input_dir $videos/$mode \
---output_dir ~/tcn_data/$dataset/$mode \
---max_per_shard 400
-```
-
-Setting `--max_per_shard` > 0 allows you to shard training data. We've observed
-that sharding long training sequences provides better performance in terms of
-global steps/sec.
-
-This should be left at the default of 0 for validation / test data.
-
-You should now have a directory of TFRecords files with the following structure:
-
-```bash
-output_dir/0.tfrecord
-...
-output_dir/N.tfrecord
-
-1 TFRecord file for each of N multi-view sequences.
-```
-
-Now we're ready to move on to part II: training, evaluation, and visualization.
-
-## Tutorial Part II: Training, Evaluation, and Visualization
-
-Here we give an end-to-end example of how to train, evaluate, and visualize the
-embedding space learned by TCN models.
-
-### Download Data
-
-We will be using the 'Multiview Pouring' dataset, which can be downloaded using
-the download.sh script
-[here.](https://sites.google.com/site/brainrobotdata/home/multiview-pouring)
-
-The rest of the tutorial will assume that you have your data downloaded to a
-folder at `~/tcn_data`.
-
-```bash
-mkdir ~/tcn_data
-mv ~/Downloads/download.sh ~/tcn_data
-./download.sh
-```
-
-You should now have the following path containing all the data:
-
-```bash
-ls ~/tcn_data/multiview-pouring
-labels  README.txt  tfrecords  videos
-```
-
-### Download Pretrained Inception Checkpoint
-
-If you haven't already, run the script that downloads the pretrained InceptionV3
-checkpoint:
-
-```bash
-python download_pretrained.py
-```
-
-### Define A Config
-
-For our experiment, we create 2 configs:
-
-*   `configs/tcn_default.yml`: This contains all the default hyperparameters
-    that generally don't vary across experiments.
-*   `configs/pouring.yml`: This contains all the hyperparameters that are
-    specific to the pouring experiment.
-
-Important note about `configs/pouring.yml`:
-
-*   data.eval_cropping: We use 'pad200' for the pouring dataset, which was
-    filmed rather close up on iphone cameras. A better choice for data filmed on
-    webcam is likely 'crop_center'. See preprocessing.py for options.
-
-### Train
-
-Run the training binary:
-
-```yaml
-logdir=/tmp/tcn/pouring
-c=configs
-configs=$c/tcn_default.yml,$c/pouring.yml
-
-bazel build -c opt --copt=-mavx --config=cuda train && \
-bazel-bin/train \
---config_paths $configs --logdir $logdir
-```
-
-### Evaluate
-
-Run the binary that computes running validation loss. Set `export
-CUDA_VISIBLE_DEVICES=` to run on CPU.
-
-```bash
-bazel build -c opt --copt=-mavx eval && \
-bazel-bin/eval \
---config_paths $configs --logdir $logdir
-```
-
-Run the binary that computes running validation cross-view sequence alignment.
-Set `export CUDA_VISIBLE_DEVICES=` to run on CPU.
-
-```bash
-bazel build -c opt --copt=-mavx alignment && \
-bazel-bin/alignment \
---config_paths $configs --checkpointdir $logdir --outdir $logdir
-```
-
-Run the binary that computes running labeled KNN validation error. Set `export
-CUDA_VISIBLE_DEVICES=` to run on CPU.
-
-```bash
-bazel build -c opt --copt=-mavx labeled_eval && \
-bazel-bin/labeled_eval \
---config_paths $configs --checkpointdir $logdir --outdir $logdir
-```
-
-### Monitor training
-
-Run `tensorboard --logdir=$logdir`. After a bit of training, you should see
-curves that look like this:
-
-#### Training loss
-
-<img src="g3doc/loss.png" title="Training Loss" />
-
-#### Validation loss
-
-<img src="g3doc/val_loss.png" title="Validation Loss" />
-
-#### Validation Alignment
-
-<img src="g3doc/alignment.png" title="Validation Alignment" />
-
-#### Average Validation KNN Classification Error
-
-<img src="g3doc/avg_error.png" title="Validation Average KNN Error" />
-
-#### Individual Validation KNN Classification Errors
-
-<img src="g3doc/all_error.png" title="All Validation Average KNN Errors" />
-
-### Visualize
-
-To visualize the embedding space learned by a model, we can:
-
-#### Generate Imitation Videos
-
-```bash
-# Use the automatically generated final config file as config.
-configs=$logdir/final_training_config.yml
-# Visualize checkpoint 40001.
-checkpoint_iter=40001
-# Use validation records for visualization.
-records=~/tcn_data/multiview-pouring/tfrecords/val
-# Write videos to this location.
-outdir=$logdir/tcn_viz/imitation_vids
-```
-
-```bash
-bazel build -c opt --config=cuda --copt=-mavx generate_videos && \
-bazel-bin/generate_videos \
---config_paths $configs \
---checkpointdir $logdir \
---checkpoint_iter $checkpoint_iter \
---query_records_dir $records \
---target_records_dir $records \
---outdir $outdir
-```
-
-After the script completes, you should see a directory of videos with names
-like:
-
-`$outdir/qtrain_clearodwalla_to_clear1_realv1_imtrain_clearsoda_to_white13_realv0.mp4`
-
-that look like this: <img src="https://github.com/sermanet/tcn/blob/master/docs/figs/pouring.gif?raw=true" title="Imitation Video" />
-
-#### T-SNE / PCA Visualization
-
-Run the binary that generates embeddings and metadata.
-
-```bash
-outdir=$logdir/tcn_viz/embedding_viz
-bazel build -c opt --config=cuda --copt=-mavx visualize_embeddings && \
-bazel-bin/visualize_embeddings \
---config_paths $configs \
---checkpointdir $logdir \
---checkpoint_iter $checkpoint_iter \
---embedding_records $records \
---outdir $outdir \
---num_embed 1000 \
---sprite_dim 64
-```
-
-Run tensorboard, pointed at the embedding viz output directory.
-
-```
-tensorboard --logdir=$outdir
-```
-
-You should see something like this in tensorboard.
-<img src="g3doc/pca.png" title="PCA" />
diff --git a/research/tcn/WORKSPACE b/research/tcn/WORKSPACE
deleted file mode 100644
index 87d592329c07914a8feba80bbc96bf4f92305656..0000000000000000000000000000000000000000
--- a/research/tcn/WORKSPACE
+++ /dev/null
@@ -1,2 +0,0 @@
-workspace(name = "tcn")
-
diff --git a/research/tcn/alignment.py b/research/tcn/alignment.py
deleted file mode 100644
index e6ee04c8e1c80d7ab394be4523b1408f1e3bd7b7..0000000000000000000000000000000000000000
--- a/research/tcn/alignment.py
+++ /dev/null
@@ -1,133 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Calculates test sequence alignment score."""
-from __future__ import absolute_import
-from __future__ import absolute_import
-from __future__ import division
-
-import os
-import numpy as np
-from estimators.get_estimator import get_estimator
-from utils import util
-import tensorflow as tf
-tf.logging.set_verbosity(tf.logging.INFO)
-
-tf.flags.DEFINE_string(
-    'config_paths', '',
-    """
-    Path to a YAML configuration files defining FLAG values. Multiple files
-    can be separated by the `#` symbol. Files are merged recursively. Setting
-    a key in these files is equivalent to setting the FLAG value with
-    the same name.
-    """)
-tf.flags.DEFINE_string(
-    'model_params', '{}', 'YAML configuration string for the model parameters.')
-tf.app.flags.DEFINE_string(
-    'checkpoint_iter', '', 'Evaluate this specific checkpoint.')
-tf.app.flags.DEFINE_string(
-    'checkpointdir', '/tmp/tcn', 'Path to model checkpoints.')
-tf.app.flags.DEFINE_string('outdir', '/tmp/tcn', 'Path to write summaries to.')
-FLAGS = tf.app.flags.FLAGS
-
-
-def compute_average_alignment(
-    seqname_to_embeddings, num_views, summary_writer, training_step):
-  """Computes the average cross-view alignment for all sequence view pairs.
-
-  Args:
-    seqname_to_embeddings: Dict, mapping sequence name to a
-      [num_views, embedding size] numpy matrix holding all embedded views.
-    num_views: Int, number of simultaneous views in the dataset.
-    summary_writer: A `SummaryWriter` object.
-    training_step: Int, the training step of the model used to embed images.
-
-  Alignment is the scaled absolute difference between the ground truth time
-  and the knn aligned time.
-  abs(|time_i - knn_time|) / sequence_length
-  """
-  all_alignments = []
-  for _, view_embeddings in seqname_to_embeddings.iteritems():
-    for idx_i in range(num_views):
-      for idx_j in range(idx_i+1, num_views):
-        embeddings_view_i = view_embeddings[idx_i]
-        embeddings_view_j = view_embeddings[idx_j]
-
-        seq_len = len(embeddings_view_i)
-
-        times_i = np.array(range(seq_len))
-        # Get the nearest time_index for each embedding in view_i.
-        times_j = np.array([util.KNNIdsWithDistances(
-            q, embeddings_view_j, k=1)[0][0] for q in embeddings_view_i])
-
-        # Compute sequence view pair alignment.
-        alignment = np.mean(
-            np.abs(np.array(times_i)-np.array(times_j))/float(seq_len))
-        all_alignments.append(alignment)
-        print('alignment so far %f' % alignment)
-  average_alignment = np.mean(all_alignments)
-  print('Average alignment %f' % average_alignment)
-  summ = tf.Summary(value=[tf.Summary.Value(
-      tag='validation/alignment', simple_value=average_alignment)])
-  summary_writer.add_summary(summ, int(training_step))
-
-
-def evaluate_once(
-    config, checkpointdir, validation_records, checkpoint_path, batch_size,
-    num_views):
-  """Evaluates and reports the validation alignment."""
-  # Choose an estimator based on training strategy.
-  estimator = get_estimator(config, checkpointdir)
-
-  # Embed all validation sequences.
-  seqname_to_embeddings = {}
-  for (view_embeddings, _, seqname) in estimator.inference(
-      validation_records, checkpoint_path, batch_size):
-    seqname_to_embeddings[seqname] = view_embeddings
-
-  # Compute and report alignment statistics.
-  ckpt_step = int(checkpoint_path.split('-')[-1])
-  summary_dir = os.path.join(FLAGS.outdir, 'alignment_summaries')
-  summary_writer = tf.summary.FileWriter(summary_dir)
-  compute_average_alignment(
-      seqname_to_embeddings, num_views, summary_writer, ckpt_step)
-
-
-def main(_):
-  # Parse config dict from yaml config files / command line flags.
-  config = util.ParseConfigsToLuaTable(FLAGS.config_paths, FLAGS.model_params)
-  num_views = config.data.num_views
-
-  validation_records = util.GetFilesRecursively(config.data.validation)
-  batch_size = config.data.batch_size
-
-  checkpointdir = FLAGS.checkpointdir
-
-  # If evaluating a specific checkpoint, do that.
-  if FLAGS.checkpoint_iter:
-    checkpoint_path = os.path.join(
-        '%s/model.ckpt-%s' % (checkpointdir, FLAGS.checkpoint_iter))
-    evaluate_once(
-        config, checkpointdir, validation_records, checkpoint_path, batch_size,
-        num_views)
-  else:
-    for checkpoint_path in tf.contrib.training.checkpoints_iterator(
-        checkpointdir):
-      evaluate_once(
-          config, checkpointdir, validation_records, checkpoint_path,
-          batch_size, num_views)
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/tcn/configs/pouring.yml b/research/tcn/configs/pouring.yml
deleted file mode 100644
index 4cfd962712cd2bfcb22c580e633bd8fdaed03c71..0000000000000000000000000000000000000000
--- a/research/tcn/configs/pouring.yml
+++ /dev/null
@@ -1,58 +0,0 @@
-# Train with Multi-View TCN.
-training_strategy: 'mvtcn'
-
-# Use the 'inception_conv_ss_fc' embedder, which has the structure:
-# InceptionV3 -> 2 conv adaptation layers -> spatial softmax -> fully connected
-# -> embedding.
-embedder_strategy: 'inception_conv_ss_fc'
-
-# Use npairs loss.
-loss_strategy: 'npairs'
-
-learning:
-  learning_rate: 0.0001
-
-# Set some hyperparameters for our embedder.
-inception_conv_ss_fc:
-  # Don't finetune the pre-trained weights.
-  finetune_inception: false
-  dropout:
-    # Don't dropout convolutional activations.
-    keep_conv: 1.0
-    # Use a dropout of 0.8 on the fully connected activations.
-    keep_fc: 0.8
-    # Use a dropout of 0.8 on the inception activations.
-    keep_pretrained: 0.8
-
-# Size of the TCN embedding.
-embedding_size: 32
-
-data:
-  raw_height: 480
-  raw_width: 360
-  batch_size: 32
-  examples_per_sequence: 32
-  num_views: 2
-  preprocessing:
-    # Inference-time image cropping strategy.
-    eval_cropping: 'pad200'
-  augmentation:
-    # Do scale augmentation.
-    minscale: 0.8 # When downscaling, zoom in to 80% of the central bounding box.
-    maxscale: 3.0 # When upscaling, zoom out to 300% of the central bounding box.
-    proportion_scaled_up: 0.5 # Proportion of the time to scale up rather than down.
-    color: true # Do color augmentation.
-    fast_mode: true
-  # Paths to the data.
-  training: '~/tcn_data/multiview-pouring/tfrecords/train'
-  validation: '~/tcn_data/multiview-pouring/tfrecords/val'
-  test: 'path/to/test'
-  labeled:
-    image_attr_keys: ['image/view0', 'image/view1', 'task']
-    label_attr_keys: ['contact', 'distance', 'liquid_flowing', 'has_liquid', 'container_angle']
-    validation: '~/tcn_data/multiview-pouring/monolithic-labeled/val'
-    test: '~/tcn_data/multiview-pouring/monolithic-labeled/test'
-
-logging:
-  checkpoint:
-    save_checkpoints_steps: 1000
\ No newline at end of file
diff --git a/research/tcn/configs/tcn_default.yml b/research/tcn/configs/tcn_default.yml
deleted file mode 100644
index 992f36d77507719edd3f82aebc080af4b46eea14..0000000000000000000000000000000000000000
--- a/research/tcn/configs/tcn_default.yml
+++ /dev/null
@@ -1,115 +0,0 @@
-#  These configs are the defaults we used for both the pouring and pose
-#  experiments.
-
-# Train on TPU?
-use_tpu: false # Default is to run without TPU locally.
-tpu:
-  num_shards: 1
-  iterations: 100
-
-# SGD / general learning hyperparameters.
-learning:
-  max_step: 1000000
-  learning_rate: 0.001
-  decay_steps: 10000
-  decay_factor: 1.00
-  l2_reg_weight: 0.000001
-  optimizer: 'adam'
-
-# Default metric learning loss hyperparameters.
-triplet_semihard:
-  embedding_l2: true # Suggestion from Hyun Oh Song's slides.
-  margin: .2 # Default value for Facenet.
-npairs:
-  embedding_l2: false # Suggestion from Hyun Oh Song's slides.
-clustering_loss:
-  embedding_l2: true # Suggestion from Hyun Oh Song's slides.
-  margin: 1.0 # Default in deep_metric_learning.
-lifted_struct:
-  embedding_l2: false # Suggestion from Hyun Oh Song's slides.
-  margin: 1.0
-contrastive:
-  embedding_l2: true # Suggestion from Hyun Oh Song's slides.
-  margin: 1.0
-
-# Which method to use to train the embedding.
-# Options are "mvtcn", "svtcn".
-training_strategy: 'mvtcn'
-
-# Which embedder architecture to use.
-# Options are 'inception_conv_ss_fc' (used in pouring / pose experiments),
-# 'resnet'.
-embedder_strategy: 'inception_conv_ss_fc'
-
-# Size of the TCN embedding.
-embedding_size: 32
-
-# Default hyperparameters for the different embedder architectures.
-inception_conv_ss_fc:
-  pretrained_checkpoint: 'pretrained_checkpoints/inception/inception_v3.ckpt'
-  pretrained_layer: 'Mixed_5d'
-  additional_conv_sizes: [512, 512]
-  fc_hidden_sizes: [2048]
-  finetune: false
-  dropout:
-    keep_pretrained: 1.0
-    keep_conv: 1.0
-    keep_fc: 1.0
-
-resnet:
-  pretrained_checkpoint: 'pretrained_checkpoints/resnet/resnet_v2_50.ckpt'
-  pretrained_layer: 4
-  finetune: false
-  adaptation_blocks: '512_3-512_3'
-  emb_connection: 'conv'
-  fc_hidden_sizes: 'None'
-  dropout:
-    keep_pretrained: 1.0
-
-# Loss hyperparameters.
-mvtcn:
-  # Size of the window in timesteps to get random anchor-positive pairs for
-  # training.
-  window: 580 # 29fps * 20 seconds.
-
-svtcn:
-  pos_radius: 6  # 0.2 seconds * 29fps ~ 6 timesteps.
-  neg_radius: 12 # 2.0 * pos_radius.
-
-# Data configs.
-data:
-  height: 299
-  width: 299
-  preprocessing:
-    # Strategy to use when cropping images at inference time.
-    # See preprocessing.py for options.
-    eval_cropping: 'crop_center'
-  # Training scale, color augmentation hyparameters.
-  augmentation:
-    # See preprocessing.py for a discussion of how to use these parameters.
-    minscale: 1.0
-    maxscale: 1.0
-    proportion_scaled_up: 0.5
-    color: true
-    fast_mode: true
-  num_parallel_calls: 12
-  sequence_prefetch_size: 12
-  batch_prefetch_size: 12
-  batch_size: 36
-  eval_batch_size: 36
-  embed_batch_size: 128
-
-val:
-  recall_at_k_list: [1]
-  num_eval_samples: 1000
-  eval_interval_secs: 300
-
-logging:
-  summary:
-    image_summaries: false
-    save_summaries_steps: 100
-    flush_secs: 600
-  checkpoint:
-    num_to_keep: 0 # Keep all checkpoints.
-    save_checkpoints_steps: 1000
-    secs: 1800
\ No newline at end of file
diff --git a/research/tcn/configs/test_estimator.yml b/research/tcn/configs/test_estimator.yml
deleted file mode 100644
index 4e4519477c0d723b7882d23130d13a5df22dbd84..0000000000000000000000000000000000000000
--- a/research/tcn/configs/test_estimator.yml
+++ /dev/null
@@ -1,29 +0,0 @@
-use_tpu: False
-training_strategy: 'mvtcn'
-loss_strategy: 'triplet_semihard'
-
-learning:
-  max_step: 2
-  optimizer: 'adam'
-
-embedding_size: 8
-
-data:
-  embed_batch_size: 12
-  batch_size: 12
-  examples_per_sequence: 12
-  num_views: 2
-  num_parallel_calls: 1
-  sequence_prefetch_size: 1
-  batch_prefetch_size: 1
-
-logging:
-  summary:
-    image_summaries: false
-    save_summaries_steps: 100
-    flush_secs: 600
-    save_summaries_secs: 60
-  checkpoint:
-    num_to_keep: 0 # Keep all checkpoints.
-    save_checkpoints_steps: 1000
-    secs: 1800
\ No newline at end of file
diff --git a/research/tcn/data_providers.py b/research/tcn/data_providers.py
deleted file mode 100644
index aa2a5f4eb47cce5df83b46768b501430844a9cd5..0000000000000000000000000000000000000000
--- a/research/tcn/data_providers.py
+++ /dev/null
@@ -1,505 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Defines data providers used in training and evaluating TCNs."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import functools
-import random
-import numpy as np
-import preprocessing
-import tensorflow as tf
-
-
-def record_dataset(filename):
-  """Generate a TFRecordDataset from a `filename`."""
-  return tf.data.TFRecordDataset(filename)
-
-
-def full_sequence_provider(file_list, num_views):
-  """Provides full preprocessed image sequences.
-
-  Args:
-    file_list: List of strings, paths to TFRecords to preprocess.
-    num_views: Int, the number of simultaneous viewpoints at each timestep in
-      the dataset.
-  Returns:
-    preprocessed: A 4-D float32 `Tensor` holding a sequence of preprocessed
-      images.
-    raw_image_strings: A 2-D string `Tensor` holding a sequence of raw
-      jpeg-encoded image strings.
-    task: String, the name of the sequence.
-    seq_len: Int, the number of timesteps in the sequence.
-  """
-  def _parse_sequence(x):
-    context, views, seq_len = parse_sequence_example(x, num_views)
-    task = context['task']
-    return views, task, seq_len
-
-  data_files = tf.contrib.slim.parallel_reader.get_data_files(file_list)
-  dataset = tf.data.Dataset.from_tensor_slices(data_files)
-  dataset = dataset.repeat(1)
-  # Get a dataset of sequences.
-  dataset = dataset.flat_map(record_dataset)
-
-  # Build a dataset of TFRecord files.
-  dataset = dataset.repeat(1)
-  # Prefetch a number of opened files.
-  dataset = dataset.prefetch(12)
-  # Use _parse_sequence to deserialize (but not decode) image strings.
-  dataset = dataset.map(_parse_sequence, num_parallel_calls=12)
-  # Prefetch batches of images.
-  dataset = dataset.prefetch(12)
-  dataset = dataset.make_one_shot_iterator()
-  views, task, seq_len = dataset.get_next()
-  return views, task, seq_len
-
-
-def parse_labeled_example(
-    example_proto, view_index, preprocess_fn, image_attr_keys, label_attr_keys):
-  """Parses a labeled test example from a specified view.
-
-  Args:
-    example_proto: A scalar string Tensor.
-    view_index: Int, index on which view to parse.
-    preprocess_fn: A function with the signature (raw_images, is_training) ->
-      preprocessed_images, where raw_images is a 4-D float32 image `Tensor`
-      of raw images, is_training is a Boolean describing if we're in training,
-      and preprocessed_images is a 4-D float32 image `Tensor` holding
-      preprocessed images.
-    image_attr_keys: List of Strings, names for image keys.
-    label_attr_keys: List of Strings, names for label attributes.
-  Returns:
-    data: A tuple of images, attributes and tasks `Tensors`.
-  """
-  features = {}
-  for attr_key in image_attr_keys:
-    features[attr_key] = tf.FixedLenFeature((), tf.string)
-  for attr_key in label_attr_keys:
-    features[attr_key] = tf.FixedLenFeature((), tf.int64)
-  parsed_features = tf.parse_single_example(example_proto, features)
-  image_only_keys = [i for i in image_attr_keys if 'image' in i]
-  view_image_key = image_only_keys[view_index]
-  image = preprocessing.decode_image(parsed_features[view_image_key])
-  preprocessed = preprocess_fn(image, is_training=False)
-  attributes = [parsed_features[k] for k in label_attr_keys]
-  task = parsed_features['task']
-  return tuple([preprocessed] + attributes + [task])
-
-
-def labeled_data_provider(
-    filenames, preprocess_fn, view_index, image_attr_keys, label_attr_keys,
-    batch_size=32, num_epochs=1):
-  """Gets a batched dataset iterator over annotated test images + labels.
-
-  Provides a single view, specifed in `view_index`.
-
-  Args:
-    filenames: List of Strings, paths to tfrecords on disk.
-    preprocess_fn: A function with the signature (raw_images, is_training) ->
-      preprocessed_images, where raw_images is a 4-D float32 image `Tensor`
-      of raw images, is_training is a Boolean describing if we're in training,
-      and preprocessed_images is a 4-D float32 image `Tensor` holding
-      preprocessed images.
-    view_index: Int, the index of the view to embed.
-    image_attr_keys: List of Strings, names for image keys.
-    label_attr_keys: List of Strings, names for label attributes.
-    batch_size: Int, size of the batch.
-    num_epochs: Int, number of epochs over the classification dataset.
-  Returns:
-    batch_images: 4-d float `Tensor` holding the batch images for the view.
-    labels: K-d int `Tensor` holding the K label attributes.
-    tasks: 1-D String `Tensor`, holding the task names for each batch element.
-  """
-  dataset = tf.data.TFRecordDataset(filenames)
-  # pylint: disable=g-long-lambda
-  dataset = dataset.map(
-      lambda p: parse_labeled_example(
-          p, view_index, preprocess_fn, image_attr_keys, label_attr_keys))
-  dataset = dataset.repeat(num_epochs)
-  dataset = dataset.batch(batch_size)
-  data_iterator = dataset.make_one_shot_iterator()
-  batch_data = data_iterator.get_next()
-  batch_images = batch_data[0]
-
-  batch_labels = tf.stack(batch_data[1:-1], 1)
-
-  batch_tasks = batch_data[-1]
-
-  batch_images = set_image_tensor_batch_dim(batch_images, batch_size)
-  batch_labels.set_shape([batch_size, len(label_attr_keys)])
-  batch_tasks.set_shape([batch_size])
-
-  return batch_images, batch_labels, batch_tasks
-
-
-def parse_sequence_example(serialized_example, num_views):
-  """Parses a serialized sequence example into views, sequence length data."""
-  context_features = {
-      'task': tf.FixedLenFeature(shape=[], dtype=tf.string),
-      'len': tf.FixedLenFeature(shape=[], dtype=tf.int64)
-  }
-  view_names = ['view%d' % i for i in range(num_views)]
-  fixed_features = [
-      tf.FixedLenSequenceFeature(
-          shape=[], dtype=tf.string) for _ in range(len(view_names))]
-  sequence_features = dict(zip(view_names, fixed_features))
-  context_parse, sequence_parse = tf.parse_single_sequence_example(
-      serialized=serialized_example,
-      context_features=context_features,
-      sequence_features=sequence_features)
-  views = tf.stack([sequence_parse[v] for v in view_names])
-  lens = [sequence_parse[v].get_shape().as_list()[0] for v in view_names]
-  assert len(set(lens)) == 1
-  seq_len = tf.shape(sequence_parse[view_names[-1]])[0]
-  return context_parse, views, seq_len
-
-
-def get_shuffled_input_records(file_list):
-  """Build a tf.data.Dataset of shuffled input TFRecords that repeats."""
-  dataset = tf.data.Dataset.from_tensor_slices(file_list)
-  dataset = dataset.shuffle(len(file_list))
-  dataset = dataset.repeat()
-  dataset = dataset.flat_map(record_dataset)
-  dataset = dataset.repeat()
-  return dataset
-
-
-def get_tcn_anchor_pos_indices(seq_len, num_views, num_pairs, window):
-  """Gets batch TCN anchor positive timestep and view indices.
-
-  This gets random (anchor, positive) timesteps from a sequence, and chooses
-  2 random differing viewpoints for each anchor positive pair.
-
-  Args:
-    seq_len: Int, the size of the batch sequence in timesteps.
-    num_views: Int, the number of simultaneous viewpoints at each timestep.
-    num_pairs: Int, the number of pairs to build.
-    window: Int, the window (in frames) from which to take anchor, positive
-      and negative indices.
-  Returns:
-    ap_time_indices: 1-D Int `Tensor` with size [num_pairs], holding the
-      timestep for each (anchor,pos) pair.
-    a_view_indices: 1-D Int `Tensor` with size [num_pairs], holding the
-      view index for each anchor.
-    p_view_indices: 1-D Int `Tensor` with size [num_pairs], holding the
-      view index for each positive.
-  """
-  # Get anchor, positive time indices.
-  def f1():
-    # Choose a random window-length range from the sequence.
-    range_min = tf.random_shuffle(tf.range(seq_len-window))[0]
-    range_max = range_min+window
-    return tf.range(range_min, range_max)
-  def f2():
-    # Consider the full sequence.
-    return tf.range(seq_len)
-  time_indices = tf.cond(tf.greater(seq_len, window), f1, f2)
-  shuffled_indices = tf.random_shuffle(time_indices)
-  num_pairs = tf.minimum(seq_len, num_pairs)
-  ap_time_indices = shuffled_indices[:num_pairs]
-
-  # Get opposing anchor, positive view indices.
-  view_indices = tf.tile(
-      tf.expand_dims(tf.range(num_views), 0), (num_pairs, 1))
-  shuffled_view_indices = tf.map_fn(tf.random_shuffle, view_indices)
-  a_view_indices = shuffled_view_indices[:, 0]
-  p_view_indices = shuffled_view_indices[:, 1]
-  return ap_time_indices, a_view_indices, p_view_indices
-
-
-def set_image_tensor_batch_dim(tensor, batch_dim):
-  """Sets the batch dimension on an image tensor."""
-  shape = tensor.get_shape()
-  tensor.set_shape([batch_dim, shape[1], shape[2], shape[3]])
-  return tensor
-
-
-def parse_sequence_to_pairs_batch(
-    serialized_example, preprocess_fn, is_training, num_views, batch_size,
-    window):
-  """Parses a serialized sequence example into a batch of preprocessed data.
-
-  Args:
-    serialized_example: A serialized SequenceExample.
-    preprocess_fn: A function with the signature (raw_images, is_training) ->
-      preprocessed_images.
-    is_training: Boolean, whether or not we're in training.
-    num_views: Int, the number of simultaneous viewpoints at each timestep in
-      the dataset.
-    batch_size: Int, size of the batch to get.
-    window: Int, only take pairs from a maximium window of this size.
-  Returns:
-    preprocessed: A 4-D float32 `Tensor` holding preprocessed images.
-    anchor_images: A 4-D float32 `Tensor` holding raw anchor images.
-    pos_images: A 4-D float32 `Tensor` holding raw positive images.
-  """
-  _, views, seq_len = parse_sequence_example(serialized_example, num_views)
-
-  # Get random (anchor, positive) timestep and viewpoint indices.
-  num_pairs = batch_size // 2
-  ap_time_indices, a_view_indices, p_view_indices = get_tcn_anchor_pos_indices(
-      seq_len, num_views, num_pairs, window)
-
-  # Gather the image strings.
-  combined_anchor_indices = tf.concat(
-      [tf.expand_dims(a_view_indices, 1),
-       tf.expand_dims(ap_time_indices, 1)], 1)
-  combined_pos_indices = tf.concat(
-      [tf.expand_dims(p_view_indices, 1),
-       tf.expand_dims(ap_time_indices, 1)], 1)
-  anchor_images = tf.gather_nd(views, combined_anchor_indices)
-  pos_images = tf.gather_nd(views, combined_pos_indices)
-
-  # Decode images.
-  anchor_images = tf.map_fn(
-      preprocessing.decode_image, anchor_images, dtype=tf.float32)
-  pos_images = tf.map_fn(
-      preprocessing.decode_image, pos_images, dtype=tf.float32)
-
-  # Concatenate [anchor, postitive] images into a batch and preprocess it.
-  concatenated = tf.concat([anchor_images, pos_images], 0)
-  preprocessed = preprocess_fn(concatenated, is_training)
-  anchor_prepro, positive_prepro = tf.split(preprocessed, num_or_size_splits=2,
-                                            axis=0)
-
-  # Set static batch dimensions for all image tensors
-  ims = [anchor_prepro, positive_prepro, anchor_images, pos_images]
-  ims = [set_image_tensor_batch_dim(i, num_pairs) for i in ims]
-  [anchor_prepro, positive_prepro, anchor_images, pos_images] = ims
-
-  # Assign each anchor and positive the same label.
-  anchor_labels = tf.range(1, num_pairs+1)
-  positive_labels = tf.range(1, num_pairs+1)
-
-  return (anchor_prepro, positive_prepro, anchor_images, pos_images,
-          anchor_labels, positive_labels, seq_len)
-
-
-def multiview_pairs_provider(file_list,
-                             preprocess_fn,
-                             num_views,
-                             window,
-                             is_training,
-                             batch_size,
-                             examples_per_seq=2,
-                             num_parallel_calls=12,
-                             sequence_prefetch_size=12,
-                             batch_prefetch_size=12):
-  """Provides multi-view TCN anchor-positive image pairs.
-
-  Returns batches of Multi-view TCN pairs, where each pair consists of an
-  anchor and a positive coming from different views from the same timestep.
-  Batches are filled one entire sequence at a time until
-  batch_size is exhausted. Pairs are chosen randomly without replacement
-  within a sequence.
-
-  Used by:
-    * triplet semihard loss.
-    * clustering loss.
-    * npairs loss.
-    * lifted struct loss.
-    * contrastive loss.
-
-  Args:
-    file_list: List of Strings, paths to tfrecords.
-    preprocess_fn: A function with the signature (raw_images, is_training) ->
-      preprocessed_images, where raw_images is a 4-D float32 image `Tensor`
-      of raw images, is_training is a Boolean describing if we're in training,
-      and preprocessed_images is a 4-D float32 image `Tensor` holding
-      preprocessed images.
-    num_views: Int, the number of simultaneous viewpoints at each timestep.
-    window: Int, size of the window (in frames) from which to draw batch ids.
-    is_training: Boolean, whether or not we're in training.
-    batch_size: Int, how many examples in the batch (num pairs * 2).
-    examples_per_seq: Int, how many examples to take per sequence.
-    num_parallel_calls: Int, the number of elements to process in parallel by
-      mapper.
-    sequence_prefetch_size: Int, size of the buffer used to prefetch sequences.
-    batch_prefetch_size: Int, size of the buffer used to prefetch batches.
-  Returns:
-    batch_images: A 4-D float32 `Tensor` holding preprocessed batch images.
-    anchor_labels: A 1-D int32 `Tensor` holding anchor image labels.
-    anchor_images: A 4-D float32 `Tensor` holding raw anchor images.
-    positive_labels: A 1-D int32 `Tensor` holding positive image labels.
-    pos_images: A 4-D float32 `Tensor` holding raw positive images.
-  """
-  def _parse_sequence(x):
-    return parse_sequence_to_pairs_batch(
-        x, preprocess_fn, is_training, num_views, examples_per_seq, window)
-
-  # Build a buffer of shuffled input TFRecords that repeats forever.
-  dataset = get_shuffled_input_records(file_list)
-
-  # Prefetch a number of opened TFRecords.
-  dataset = dataset.prefetch(sequence_prefetch_size)
-
-  # Use _parse_sequence to map sequences to batches (one sequence per batch).
-  dataset = dataset.map(
-      _parse_sequence, num_parallel_calls=num_parallel_calls)
-
-  # Filter out sequences that don't have at least examples_per_seq.
-  def seq_greater_than_min(seqlen, maximum):
-    return seqlen >= maximum
-  filter_fn = functools.partial(seq_greater_than_min, maximum=examples_per_seq)
-  dataset = dataset.filter(lambda a, b, c, d, e, f, seqlen: filter_fn(seqlen))
-
-  # Take a number of sequences for the batch.
-  assert batch_size % examples_per_seq == 0
-  sequences_per_batch = batch_size // examples_per_seq
-  dataset = dataset.batch(sequences_per_batch)
-
-  # Prefetch batches of images.
-  dataset = dataset.prefetch(batch_prefetch_size)
-
-  iterator = dataset.make_one_shot_iterator()
-  data = iterator.get_next()
-
-  # Pull out images, reshape to [batch_size, ...], concatenate anchor and pos.
-  ims = list(data[:4])
-  anchor_labels, positive_labels = data[4:6]
-
-  # Set labels shape.
-  anchor_labels.set_shape([sequences_per_batch, None])
-  positive_labels.set_shape([sequences_per_batch, None])
-
-  def _reshape_to_batchsize(im):
-    """[num_sequences, num_per_seq, ...] images to [batch_size, ...]."""
-    sequence_ims = tf.split(im, num_or_size_splits=sequences_per_batch, axis=0)
-    sequence_ims = [tf.squeeze(i) for i in sequence_ims]
-    return tf.concat(sequence_ims, axis=0)
-
-  # Reshape labels.
-  anchor_labels = _reshape_to_batchsize(anchor_labels)
-  positive_labels = _reshape_to_batchsize(positive_labels)
-
-  def _set_shape(im):
-    """Sets a static shape for an image tensor of [sequences_per_batch,...] ."""
-    shape = im.get_shape()
-    im.set_shape([sequences_per_batch, shape[1], shape[2], shape[3], shape[4]])
-    return im
-  ims = [_set_shape(im) for im in ims]
-  ims = [_reshape_to_batchsize(im) for im in ims]
-
-  anchor_prepro, positive_prepro, anchor_images, pos_images = ims
-  batch_images = tf.concat([anchor_prepro, positive_prepro], axis=0)
-
-  return batch_images, anchor_labels, positive_labels, anchor_images, pos_images
-
-
-def get_svtcn_indices(seq_len, batch_size, num_views):
-  """Gets a random window of contiguous time indices from a sequence.
-
-  Args:
-    seq_len: Int, number of timesteps in the image sequence.
-    batch_size: Int, size of the batch to construct.
-    num_views: Int, the number of simultaneous viewpoints at each
-      timestep in the dataset.
-
-  Returns:
-    time_indices: 1-D Int `Tensor` with size [batch_size], holding the
-      timestep for each batch image.
-    view_indices: 1-D Int `Tensor` with size [batch_size], holding the
-      view for each batch image. This is consistent across the batch.
-  """
-  # Get anchor, positive time indices.
-  def f1():
-    # Choose a random contiguous range from within the sequence.
-    range_min = tf.random_shuffle(tf.range(seq_len-batch_size))[0]
-    range_max = range_min+batch_size
-    return tf.range(range_min, range_max)
-  def f2():
-    # Consider the full sequence.
-    return tf.range(seq_len)
-  time_indices = tf.cond(tf.greater(seq_len, batch_size), f1, f2)
-  # Get opposing anchor, positive view indices.
-  random_view = tf.random_shuffle(tf.range(num_views))[0]
-  view_indices = tf.tile([random_view], (batch_size,))
-  return time_indices, view_indices
-
-
-def parse_sequence_to_svtcn_batch(
-    serialized_example, preprocess_fn, is_training, num_views, batch_size):
-  """Parses a serialized sequence example into a batch of SVTCN data."""
-  _, views, seq_len = parse_sequence_example(serialized_example, num_views)
-  # Get svtcn indices.
-  time_indices, view_indices = get_svtcn_indices(seq_len, batch_size, num_views)
-  combined_indices = tf.concat(
-      [tf.expand_dims(view_indices, 1),
-       tf.expand_dims(time_indices, 1)], 1)
-
-  # Gather the image strings.
-  images = tf.gather_nd(views, combined_indices)
-
-  # Decode images.
-  images = tf.map_fn(preprocessing.decode_image, images, dtype=tf.float32)
-
-  # Concatenate anchor and postitive images, preprocess the batch.
-  preprocessed = preprocess_fn(images, is_training)
-
-  return preprocessed, images, time_indices
-
-
-def singleview_tcn_provider(file_list,
-                            preprocess_fn,
-                            num_views,
-                            is_training,
-                            batch_size,
-                            num_parallel_calls=12,
-                            sequence_prefetch_size=12,
-                            batch_prefetch_size=12):
-  """Provides data to train singleview TCNs.
-
-  Args:
-    file_list: List of Strings, paths to tfrecords.
-    preprocess_fn: A function with the signature (raw_images, is_training) ->
-      preprocessed_images, where raw_images is a 4-D float32 image `Tensor`
-      of raw images, is_training is a Boolean describing if we're in training,
-      and preprocessed_images is a 4-D float32 image `Tensor` holding
-      preprocessed images.
-    num_views: Int, the number of simultaneous viewpoints at each timestep.
-    is_training: Boolean, whether or not we're in training.
-    batch_size: Int, how many examples in the batch.
-    num_parallel_calls: Int, the number of elements to process in parallel by
-      mapper.
-    sequence_prefetch_size: Int, size of the buffer used to prefetch sequences.
-    batch_prefetch_size: Int, size of the buffer used to prefetch batches.
-
-  Returns:
-    batch_images: A 4-D float32 `Tensor` of preprocessed images.
-    raw_images: A 4-D float32 `Tensor` of raw images.
-    timesteps: A 1-D int32 `Tensor` of timesteps associated with each image.
-  """
-  def _parse_sequence(x):
-    return parse_sequence_to_svtcn_batch(
-        x, preprocess_fn, is_training, num_views, batch_size)
-
-  # Build a buffer of shuffled input TFRecords that repeats forever.
-  dataset = get_shuffled_input_records(file_list)
-
-  # Prefetch a number of opened files.
-  dataset = dataset.prefetch(sequence_prefetch_size)
-
-  # Use _parse_sequence to map sequences to image batches.
-  dataset = dataset.map(
-      _parse_sequence, num_parallel_calls=num_parallel_calls)
-
-  # Prefetch batches of images.
-  dataset = dataset.prefetch(batch_prefetch_size)
-  dataset = dataset.make_one_shot_iterator()
-  batch_images, raw_images, timesteps = dataset.get_next()
-  return batch_images, raw_images, timesteps
diff --git a/research/tcn/data_providers_test.py b/research/tcn/data_providers_test.py
deleted file mode 100644
index e501231022d3ddb1401bbbfa55518e357890b4fd..0000000000000000000000000000000000000000
--- a/research/tcn/data_providers_test.py
+++ /dev/null
@@ -1,69 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for data_providers."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import data_providers
-import tensorflow as tf
-
-
-class DataTest(tf.test.TestCase):
-
-  def testMVTripletIndices(self):
-    """Ensures anchor/pos indices for a TCN batch are valid."""
-    tf.set_random_seed(0)
-    window = 580
-    batch_size = 36
-    num_pairs = batch_size // 2
-    num_views = 2
-    seq_len = 600
-    # Get anchor time and view indices for this sequence.
-    (_, a_view_indices,
-     p_view_indices) = data_providers.get_tcn_anchor_pos_indices(
-         seq_len, num_views, num_pairs, window)
-    with self.test_session() as sess:
-      (np_a_view_indices,
-       np_p_view_indices) = sess.run([a_view_indices, p_view_indices])
-
-      # Assert no overlap between anchor and pos view indices.
-      np.testing.assert_equal(
-          np.any(np.not_equal(np_a_view_indices, np_p_view_indices)), True)
-
-      # Assert set of view indices is a subset of expected set of view indices.
-      view_set = set(range(num_views))
-      self.assertTrue(set(np_a_view_indices).issubset(view_set))
-      self.assertTrue(set(np_p_view_indices).issubset(view_set))
-
-  def testSVTripletIndices(self):
-    """Ensures time indices for a SV triplet batch are valid."""
-    seq_len = 600
-    batch_size = 36
-    num_views = 2
-    time_indices, _ = data_providers.get_svtcn_indices(
-        seq_len, batch_size, num_views)
-    with self.test_session() as sess:
-      np_time_indices = sess.run(time_indices)
-      first = np_time_indices[0]
-      last = np_time_indices[-1]
-      # Make sure batch time indices are a contiguous range.
-      self.assertTrue(np.array_equal(np_time_indices, range(first, last+1)))
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/tcn/dataset/images_to_videos.py b/research/tcn/dataset/images_to_videos.py
deleted file mode 100644
index ad1a7387e7bf73b2e2d42b364f5f3a86c575e7cf..0000000000000000000000000000000000000000
--- a/research/tcn/dataset/images_to_videos.py
+++ /dev/null
@@ -1,86 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Converts temp directories of images to videos."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import argparse
-import os
-import shutil
-# pylint: disable=invalid-name
-
-parser = argparse.ArgumentParser()
-parser.add_argument(
-    '--view_dirs', type=str, default='',
-    help='Comma-separated list of temp view image directories.')
-parser.add_argument(
-    '--vid_paths', type=str, default='',
-    help='Comma-separated list of video output paths.')
-parser.add_argument(
-    '--debug_path', type=str, default='',
-    help='Output path to debug video.')
-
-parser.add_argument(
-    '--debug_lhs_view', type=str, default='',
-    help='Output path to debug video.')
-parser.add_argument(
-    '--debug_rhs_view', type=str, default='',
-    help='Output path to debug video.')
-
-
-def create_vids(view_dirs, vid_paths, debug_path=None,
-                debug_lhs_view=0, debug_rhs_view=1):
-  """Creates one video per view per sequence."""
-
-  # Create the view videos.
-  for (view_dir, vidpath) in zip(view_dirs, vid_paths):
-    encode_vid_cmd = r'mencoder mf://%s/*.png \
-    -mf fps=29:type=png \
-    -ovc lavc -lavcopts vcodec=mpeg4:mbd=2:trell \
-    -oac copy -o %s' % (view_dir, vidpath)
-    os.system(encode_vid_cmd)
-
-  # Optionally create a debug side-by-side video.
-  if debug_path:
-    lhs = vid_paths[int(debug_lhs_view)]
-    rhs = vid_paths[int(debug_rhs_view)]
-    os.system(r"avconv \
-      -i %s \
-      -i %s \
-      -filter_complex '[0:v]pad=iw*2:ih[int];[int][1:v]overlay=W/2:0[vid]' \
-      -map [vid] \
-      -c:v libx264 \
-      -crf 23 \
-      -preset veryfast \
-      %s" % (lhs, rhs, debug_path))
-
-
-def main():
-  FLAGS, _ = parser.parse_known_args()
-  assert FLAGS.view_dirs
-  assert FLAGS.vid_paths
-  view_dirs = FLAGS.view_dirs.split(',')
-  vid_paths = FLAGS.vid_paths.split(',')
-  create_vids(view_dirs, vid_paths, FLAGS.debug_path,
-              FLAGS.debug_lhs_view, FLAGS.debug_rhs_view)
-
-  # Cleanup temp image dirs.
-  for i in view_dirs:
-    shutil.rmtree(i)
-
-if __name__ == '__main__':
-  main()
diff --git a/research/tcn/dataset/videos_to_tfrecords.py b/research/tcn/dataset/videos_to_tfrecords.py
deleted file mode 100644
index a17411f369286603c43c9adb522f06316cf97ab3..0000000000000000000000000000000000000000
--- a/research/tcn/dataset/videos_to_tfrecords.py
+++ /dev/null
@@ -1,458 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Converts videos to training, validation, test, and debug tfrecords on cns.
-
-Example usage:
-
-# From phone videos.
-x=learning/brain/research/tcn/videos_to_tfrecords && \
-blaze build -c opt $x && \
-set=tmp && videos=~/data/tcn/datasets/$set/ && \
-blaze-bin/$x --logtostderr --output_dir /cns/oi-d/home/$USER/tcn_data/$set \
---input_dir $videos/train
---debug $dataset/debug --rotate 90 --max_per_shard 400
-
-# From webcam videos.
-mode=train
-x=learning/brain/research/tcn/videos_to_tfrecords && \
-blaze build -c opt $x && \
-set=tmp && videos=/tmp/tcn/videos/$set/ && \
-blaze-bin/$x --logtostderr \
---output_dir /cns/oi-d/home/$USER/tcn_data/$set/$mode \
---input_dir $videos/$mode --max_per_shard 400
-
-"""
-import glob
-import math
-import multiprocessing
-from multiprocessing.pool import ThreadPool
-import os
-from random import shuffle
-import re
-from StringIO import StringIO
-import cv2
-from PIL import Image
-from PIL import ImageFile
-from preprocessing import cv2resizeminedge
-from preprocessing import cv2rotateimage
-from preprocessing import shapestring
-from utils.progress import Progress
-import tensorflow.google as tf
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-tf.app.flags.DEFINE_string('view_pattern', '_view[_]*[0]+[.].*',
-                           'view regexp pattern for first view')
-tf.app.flags.DEFINE_string('input_dir', '', '''input data path''')
-tf.app.flags.DEFINE_integer('resize_min_edge', 0,
-                            '''resize the smallest edge to this size.''')
-tf.app.flags.DEFINE_integer('rotate', 0, '''rotate the image in degrees.''')
-tf.app.flags.DEFINE_string('rotate_if_matching', None,
-                           'rotate only if video path matches regexp.')
-tf.app.flags.DEFINE_string('output_dir', '', 'output directory for the dataset')
-tf.app.flags.DEFINE_integer(
-    'max_per_shard', -1, 'max # of frames per data chunk')
-tf.app.flags.DEFINE_integer('expected_views', 2, 'expected number of views')
-tf.app.flags.DEFINE_integer('log_frequency', 50, 'frequency of logging')
-tf.app.flags.DEFINE_integer(
-    'max_views_discrepancy', 100,
-    'Maximum length difference (in frames) allowed between views')
-tf.app.flags.DEFINE_boolean('overwrite', False, 'overwrite output files')
-FLAGS = tf.app.flags.FLAGS
-
-feature = tf.train.Feature
-bytes_feature = lambda v: feature(bytes_list=tf.train.BytesList(value=v))
-int64_feature = lambda v: feature(int64_list=tf.train.Int64List(value=v))
-float_feature = lambda v: feature(float_list=tf.train.FloatList(value=v))
-
-
-def FindPatternFiles(path, view_pattern, errors):
-  """Recursively find all files matching a certain pattern."""
-  if not path:
-    return None
-  tf.logging.info(
-      'Recursively searching for files matching pattern \'%s\' in %s' %
-      (view_pattern, path))
-  view_patt = re.compile('.*' + view_pattern)
-  sequences = []
-  for root, _, filenames in os.walk(path, followlinks=True):
-    path_root = root[:len(path)]
-    assert path_root == path
-
-    for filename in filenames:
-      if view_patt.match(filename):
-        fullpath = os.path.join(root, re.sub(view_pattern, '', filename))
-        shortpath = re.sub(path, '', fullpath).lstrip('/')
-
-        # Determine if this sequence should be sharded or not.
-        shard = False
-        if FLAGS.max_per_shard > 0:
-          shard = True
-
-        # Retrieve number of frames for this sequence.
-        num_views, length, view_paths, num_frames = GetViewInfo(
-            fullpath + view_pattern[0] + '*')
-        if num_views != FLAGS.expected_views:
-          tf.logging.info('Expected %d views but found: %s' %
-                          (FLAGS.expected_views, str(view_paths)))
-        assert num_views == FLAGS.expected_views
-        assert length > 0
-        # Drop sequences if view lengths differ too much.
-        if max(num_frames) - min(num_frames) > FLAGS.max_views_discrepancy:
-          error_msg = (
-              'Error: ignoring sequence with views with length difference > %d:'
-              '%s in %s') % (FLAGS.max_views_discrepancy, str(num_frames),
-                             fullpath)
-          errors.append(error_msg)
-          tf.logging.error(error_msg)
-        else:
-          # Append sequence info.
-          sequences.append({'full': fullpath, 'name': shortpath, 'len': length,
-                            'start': 0, 'end': length, 'num_views': num_views,
-                            'shard': shard})
-  return sorted(sequences, key=lambda k: k['name'])
-
-
-def ShardSequences(sequences, max_per_shard):
-  """Find all sequences, shard and randomize them."""
-  total_shards_len = 0
-  total_shards = 0
-  assert max_per_shard > 0
-  for sequence in sequences:
-    if sequence['shard']:
-      sequence['shard'] = False  # Reset shard flag.
-      length = sequence['len']
-      start = sequence['start']
-      end = sequence['end']
-      name = sequence['name']
-      assert end - start == length
-      if length > max_per_shard:
-        # Dividing sequence into smaller shards.
-        num_shards = int(math.floor(length / max_per_shard)) + 1
-        size = int(math.ceil(length / num_shards))
-        tf.logging.info(
-            'splitting sequence of length %d into %d shards of size %d' %
-            (length, num_shards, size))
-        last_end = 0
-        for i in range(num_shards):
-          shard_start = last_end
-          shard_end = min(length, shard_start + size)
-          if i == num_shards - 1:
-            shard_end = length
-          shard_len = shard_end - shard_start
-          total_shards_len += shard_len
-          shard_name = name + '_shard%02d' % i
-          last_end = shard_end
-
-          # Enqueuing shard.
-          if i == 0:  # Replace current sequence.
-            sequence['len'] = shard_len
-            sequence['start'] = shard_start
-            sequence['end'] = shard_end
-            sequence['name'] = shard_name
-          else:  # Enqueue new sequence.
-            sequences.append(
-                {'full': sequence['full'], 'name': shard_name,
-                 'len': shard_len, 'start': shard_start, 'end': shard_end,
-                 'num_views': sequence['num_views'], 'shard': False})
-
-        total_shards += num_shards
-        assert last_end == length
-
-  # Print resulting sharding.
-  if total_shards > 0:
-    tf.logging.info('%d shards of average length %d' %
-                    (total_shards, total_shards_len / total_shards))
-  return sorted(sequences, key=lambda k: k['name'])
-
-
-def RandomizeSets(sets):
-  """Randomize each set."""
-  for _, sequences in sorted(sets.iteritems()):
-    if sequences:
-      # Randomize order.
-      shuffle(sequences)
-
-
-def GetSpecificFrame(vid_path, frame_index):
-  """Gets a frame at a specified index in a video."""
-  cap = cv2.VideoCapture(vid_path)
-  cap.set(1, frame_index)
-  _, bgr = cap.read()
-  cap.release()
-  rgb = cv2.cvtColor(bgr, cv2.COLOR_BGR2RGB)
-  return rgb
-
-
-def JpegString(image, jpeg_quality=90):
-  """Returns given PIL.Image instance as jpeg string.
-
-  Args:
-    image: A PIL image.
-    jpeg_quality: The image quality, on a scale from 1 (worst) to 95 (best).
-
-  Returns:
-    a jpeg_string.
-  """
-  # This fix to PIL makes sure that we don't get an error when saving large
-  # jpeg files. This is a workaround for a bug in PIL. The value should be
-  # substantially larger than the size of the image being saved.
-  ImageFile.MAXBLOCK = 640 * 512 * 64
-
-  output_jpeg = StringIO()
-  image.save(output_jpeg, 'jpeg', quality=jpeg_quality, optimize=True)
-  return output_jpeg.getvalue()
-
-
-def ParallelPreprocessing(args):
-  """Parallel preprocessing: rotation, resize and jpeg encoding to string."""
-  (vid_path, timestep, num_timesteps, view) = args
-  try:
-    image = GetSpecificFrame(vid_path, timestep)
-
-    # Resizing.
-    resize_str = ''
-    if FLAGS.resize_min_edge > 0:
-      resize_str += ', resize ' + shapestring(image)
-      image = cv2resizeminedge(image, FLAGS.resize_min_edge)
-      resize_str += ' => ' + shapestring(image)
-
-    # Rotating.
-    rotate = None
-    if FLAGS.rotate:
-      rotate = FLAGS.rotate
-      if FLAGS.rotate_if_matching is not None:
-        rotate = None
-        patt = re.compile(FLAGS.rotate_if_matching)
-        if patt.match(vid_path) is not None:
-          rotate = FLAGS.rotate
-      if rotate is not None:
-        image = cv2rotateimage(image, FLAGS.rotate)
-
-    # Jpeg encoding.
-    image = Image.fromarray(image)
-    im_string = bytes_feature([JpegString(image)])
-
-    if timestep % FLAGS.log_frequency == 0:
-      tf.logging.info('Loaded frame %d / %d for %s (rotation %s%s) from %s' %
-                      (timestep, num_timesteps, view, str(rotate), resize_str,
-                       vid_path))
-    return im_string
-  except cv2.error as e:
-    tf.logging.error('Error while loading frame %d of %s: %s' %
-                     (timestep, vid_path, str(e)))
-    return None
-
-
-def GetNumFrames(vid_path):
-  """Gets the number of frames in a video."""
-  cap = cv2.VideoCapture(vid_path)
-  total_frames = cap.get(7)
-  cap.release()
-  return int(total_frames)
-
-
-def GetViewInfo(views_fullname):
-  """Return information about a group of views."""
-  view_paths = sorted(glob.glob(views_fullname))
-  num_frames = [GetNumFrames(i) for i in view_paths]
-  min_num_frames = min(num_frames)
-  num_views = len(view_paths)
-  return num_views, min_num_frames, view_paths, num_frames
-
-
-def AddSequence(sequence, writer, progress, errors):
-  """Converts a sequence to a SequenceExample.
-
-  Sequences have multiple viewpoint videos. Extract all frames from all
-  viewpoint videos in parallel, build a single SequenceExample containing
-  all viewpoint images for every timestep.
-
-  Args:
-    sequence: a dict with information on a sequence.
-    writer: A TFRecordWriter.
-    progress: A Progress object to report processing progress.
-    errors: a list of string to append to in case of errors.
-  """
-  fullname = sequence['full']
-  shortname = sequence['name']
-  start = sequence['start']
-  end = sequence['end']
-  num_timesteps = sequence['len']
-
-  # Build a list of all view paths for this fullname.
-  path = fullname + FLAGS.view_pattern[0] + '*'
-  tf.logging.info('Loading sequence from ' + path)
-  view_paths = sorted(glob.glob(path))
-  # Extract all images for all views
-  num_frames = [GetNumFrames(i) for i in view_paths]
-  tf.logging.info('Loading %s with [%d, %d[ (%d frames) from: %s %s' %
-                  (shortname, start, end, num_timesteps,
-                   str(num_frames), str(view_paths)))
-  num_views = len(view_paths)
-  total_timesteps = int(min(num_frames))
-  assert num_views == FLAGS.expected_views
-  assert num_views == sequence['num_views']
-
-  # Create a worker pool to parallelize loading/rotating
-  worker_pool = ThreadPool(multiprocessing.cpu_count())
-
-  # Collect all images for each view.
-  view_to_feature_list = {}
-  view_images = []
-  for view_idx, view in enumerate(
-      ['view'+str(i) for i in range(num_views)]):
-    # Flatten list to process in parallel
-    work = []
-    for i in range(start, end):
-      work.append((view_paths[view_idx], i, total_timesteps, view))
-    # Load and rotate images in parallel
-    view_images.append(worker_pool.map(ParallelPreprocessing, work))
-    # Report progress.
-    progress.Add(len(view_images[view_idx]))
-    tf.logging.info('%s' % str(progress))
-
-  # Remove error frames from all views
-  i = start
-  num_errors = 0
-  while i < len(view_images[0]):
-    remove_frame = False
-    # Check if one or more views have an error for this frame.
-    for view_idx in range(num_views):
-      if view_images[view_idx][i] is None:
-        remove_frame = True
-        error_msg = 'Removing frame %d for all views for %s ' % (i, fullname)
-        errors.append(error_msg)
-        tf.logging.error(error_msg)
-    # Remove faulty frames.
-    if remove_frame:
-      num_errors += 1
-      for view_idx in range(num_views):
-        del view_images[view_idx][i]
-    else:
-      i += 1
-
-  # Ignore sequences that have errors.
-  if num_errors > 0:
-    error_msg = 'Dropping sequence because of frame errors for %s' % fullname
-    errors.append(error_msg)
-    tf.logging.error(error_msg)
-  else:
-    # Build FeatureList objects for each view.
-    for view_idx, view in enumerate(
-        ['view'+str(i) for i in range(num_views)]):
-      # Construct FeatureList from repeated feature.
-      view_to_feature_list[view] = tf.train.FeatureList(
-          feature=view_images[view_idx])
-
-    context_features = tf.train.Features(feature={
-        'task': bytes_feature([shortname]),
-        'len': int64_feature([num_timesteps])
-    })
-    feature_lists = tf.train.FeatureLists(feature_list=view_to_feature_list)
-    ex = tf.train.SequenceExample(
-        context=context_features, feature_lists=feature_lists)
-    writer.write(ex.SerializeToString())
-    tf.logging.info('Done adding %s with %d timesteps'
-                    % (fullname, num_timesteps))
-
-
-def PrintSequencesInfo(sequences, prefix):
-  """Print information about sequences and return the total number of frames."""
-  tf.logging.info('')
-  tf.logging.info(prefix)
-  num_frames = 0
-  for sequence in sequences:
-    shard_str = ''
-    if sequence['shard']:
-      shard_str = ' (sharding)'
-    tf.logging.info('frames [%d, %d[\t(%d frames * %d views)%s\t%s' % (
-        sequence['start'], sequence['end'], sequence['len'],
-        sequence['num_views'], shard_str, sequence['name']))
-    num_frames += sequence['len'] * sequence['num_views']
-  tf.logging.info(('%d frames (all views), %d sequences, average sequence'
-                   ' length (all views): %d') %
-                  (num_frames, len(sequences), num_frames / len(sequences)))
-  tf.logging.info('')
-  return num_frames
-
-
-def CheckRecord(filename, sequence):
-  """Check that an existing tfrecord corresponds to the expected sequence."""
-  num_sequences = 0
-  total_frames = 0
-  for serialized_example in tf.python_io.tf_record_iterator(filename):
-    num_sequences += 1
-    example = tf.train.SequenceExample()
-    example.ParseFromString(serialized_example)
-    length = example.context.feature['len'].int64_list.value[0]
-    name = example.context.feature['task'].bytes_list.value[0]
-    total_frames += len(example.feature_lists.feature_list) * length
-    if sequence['name'] != name or sequence['len'] != length:
-      return False, total_frames
-  if num_sequences == 0:
-    return False, total_frames
-  return True, total_frames
-
-
-def AddSequences():
-  """Creates one training, validation."""
-  errors = []
-
-  # Generate datasets file lists.
-  sequences = FindPatternFiles(FLAGS.input_dir, FLAGS.view_pattern, errors)
-  num_frames = PrintSequencesInfo(sequences,
-                                  'Found the following datasets and files:')
-
-  # Sharding and randomizing sets.
-  if FLAGS.max_per_shard > 0:
-    sequences = ShardSequences(sequences, FLAGS.max_per_shard)
-    num_frames = PrintSequencesInfo(sequences, 'After sharding:')
-    tf.logging.info('')
-
-  # Process sets.
-  progress = Progress(num_frames)
-  output_list = []
-  for sequence in sequences:
-    record_name = os.path.join(
-        FLAGS.output_dir, '%s.tfrecord' % sequence['name'])
-    if tf.gfile.Exists(record_name) and not FLAGS.overwrite:
-      ok, num_frames = CheckRecord(record_name, sequence)
-      if ok:
-        progress.Add(num_frames)
-        tf.logging.info('Skipping existing output file: %s' % record_name)
-        continue
-      else:
-        tf.logging.info('File does not match sequence, reprocessing...')
-    output_dir = os.path.dirname(record_name)
-    if not tf.gfile.Exists(output_dir):
-      tf.logging.info('Creating output directory: %s' % output_dir)
-      tf.gfile.MakeDirs(output_dir)
-    output_list.append(record_name)
-    tf.logging.info('Writing to ' + record_name)
-    writer = tf.python_io.TFRecordWriter(record_name)
-    AddSequence(sequence, writer, progress, errors)
-    writer.close()
-  tf.logging.info('Wrote dataset files: ' + str(output_list))
-  tf.logging.info('All errors (%d): %s' % (len(errors), str(errors)))
-
-
-def main(_):
-  AddSequences()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/tcn/dataset/webcam.py b/research/tcn/dataset/webcam.py
deleted file mode 100644
index 962813082a618a4b0fd970f94f04441ca69feb37..0000000000000000000000000000000000000000
--- a/research/tcn/dataset/webcam.py
+++ /dev/null
@@ -1,491 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Collect images from multiple simultaneous webcams.
-
-Usage:
-
-1. Define some environment variables that describe what you're collecting.
-dataset=your_dataset_name
-mode=train
-num_views=2
-viddir=/tmp/tcn/videos
-tmp_imagedir=/tmp/tcn/tmp_images
-debug_vids=1
-
-2. Run the script.
-export DISPLAY=:0.0 && \
-root=learning/brain/research/tcn && \
-bazel build -c opt --copt=-mavx tcn/webcam && \
-bazel-bin/tcn/webcam \
---dataset $dataset \
---mode $mode \
---num_views $num_views \
---tmp_imagedir $tmp_imagedir \
---viddir $viddir \
---debug_vids 1 \
---logtostderr
-
-3. Hit Ctrl-C when done collecting, upon which the script will compile videos
-for each view and optionally a debug video concatenating multiple
-simultaneous views.
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import multiprocessing
-from multiprocessing import Process
-import os
-import subprocess
-import sys
-import time
-import cv2
-import matplotlib
-matplotlib.use('TkAgg')
-from matplotlib import animation  # pylint: disable=g-import-not-at-top
-import matplotlib.pyplot as plt
-import numpy as np
-from six.moves import input
-import tensorflow as tf
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-tf.flags.DEFINE_string('dataset', '', 'Name of the dataset we`re collecting.')
-tf.flags.DEFINE_string('mode', '',
-                       'What type of data we`re collecting. E.g.:'
-                       '`train`,`valid`,`test`, or `demo`')
-tf.flags.DEFINE_string('seqname', '',
-                       'Name of this sequence. If empty, the script will use'
-                       'the name seq_N+1 where seq_N is the latest'
-                       'integer-named sequence in the videos directory.')
-tf.flags.DEFINE_integer('num_views', 2,
-                        'Number of webcams.')
-tf.flags.DEFINE_string('tmp_imagedir', '/tmp/tcn/data',
-                       'Temporary outdir to write images.')
-tf.flags.DEFINE_string('viddir', '/tmp/tcn/videos',
-                       'Base directory to write debug videos.')
-tf.flags.DEFINE_boolean('debug_vids', True,
-                        'Whether to generate debug vids with multiple'
-                        'concatenated views.')
-tf.flags.DEFINE_string('debug_lhs_view', '0',
-                       'Which viewpoint to use for the lhs video.')
-tf.flags.DEFINE_string('debug_rhs_view', '1',
-                       'Which viewpoint to use for the rhs video.')
-tf.flags.DEFINE_integer('height', 1080, 'Raw input height.')
-tf.flags.DEFINE_integer('width', 1920, 'Raw input width.')
-tf.flags.DEFINE_string('webcam_ports', None,
-                       'Comma-separated list of each webcam usb port.')
-FLAGS = tf.app.flags.FLAGS
-
-
-class ImageQueue(object):
-  """An image queue holding each stream's most recent image.
-
-  Basically implements a process-safe collections.deque(maxlen=1).
-  """
-
-  def __init__(self):
-    self.lock = multiprocessing.Lock()
-    self._queue = multiprocessing.Queue(maxsize=1)
-
-  def append(self, data):
-    with self.lock:
-      if self._queue.full():
-        # Pop the first element.
-        _ = self._queue.get()
-      self._queue.put(data)
-
-  def get(self):
-    with self.lock:
-      return self._queue.get()
-
-  def empty(self):
-    return self._queue.empty()
-
-  def close(self):
-    return self._queue.close()
-
-
-class WebcamViewer(object):
-  """A class which displays a live stream from the webcams."""
-
-  def __init__(self, display_queues):
-    """Create a WebcamViewer instance."""
-    self.height = FLAGS.height
-    self.width = FLAGS.width
-    self.queues = display_queues
-
-  def _get_next_images(self):
-    """Gets the next image to display."""
-    # Wait for one image per view.
-    not_found = True
-    while not_found:
-      if True in [q.empty() for q in self.queues]:
-        # At least one image queue is empty; wait.
-        continue
-      else:
-        # Retrieve the images.
-        latest = [q.get() for q in self.queues]
-        combined = np.concatenate(latest, axis=1)
-      not_found = False
-    return combined
-
-  def run(self):
-    """Displays the Kcam live stream in a window.
-
-    This function blocks until the window is closed.
-    """
-    fig, rgb_axis = plt.subplots()
-
-    image_rows = self.height
-    image_cols = self.width * FLAGS.num_views
-    initial_image = np.zeros((image_rows, image_cols, 3))
-    rgb_image = rgb_axis.imshow(initial_image, interpolation='nearest')
-
-    def update_figure(frame_index):
-      """Animation function for matplotlib FuncAnimation. Updates the image.
-
-      Args:
-        frame_index: The frame number.
-      Returns:
-        An iterable of matplotlib drawables to clear.
-      """
-      _ = frame_index
-      images = self._get_next_images()
-      images = images[..., [2, 1, 0]]
-      rgb_image.set_array(images)
-      return rgb_image,
-
-    # We must keep a reference to this animation in order for it to work.
-    unused_animation = animation.FuncAnimation(
-        fig, update_figure, interval=50, blit=True)
-    mng = plt.get_current_fig_manager()
-    mng.resize(*mng.window.maxsize())
-    plt.show()
-
-
-def reconcile(queues, write_queue):
-  """Gets a list of concurrent images from each view queue.
-
-  This waits for latest images to be available in all view queues,
-  then continuously:
-  - Creates a list of current images for each view.
-  - Writes the list to a queue of image lists to write to disk.
-  Args:
-    queues: A list of `ImageQueues`, holding the latest image from each webcam.
-    write_queue: A multiprocessing.Queue holding lists of concurrent images.
-  """
-  # Loop forever.
-  while True:
-    # Wait till all queues have an image.
-    if True in [q.empty() for q in queues]:
-      continue
-    else:
-      # Retrieve all views' images.
-      latest = [q.get() for q in queues]
-      # Copy the list of all concurrent images to the write queue.
-      write_queue.put(latest)
-
-
-def persist(write_queue, view_dirs):
-  """Pulls lists of concurrent images off a write queue, writes them to disk.
-
-  Args:
-    write_queue: A multiprocessing.Queue holding lists of concurrent images;
-      one image per view.
-    view_dirs: A list of strings, holding the output image directories for each
-      view.
-  """
-  timestep = 0
-  while True:
-    # Wait till there is work in the queue.
-    if write_queue.empty():
-      continue
-    # Get a list of concurrent images to write to disk.
-    view_ims = write_queue.get()
-    for view_idx, image in enumerate(view_ims):
-      view_base = view_dirs[view_idx]
-      # Assign all concurrent view images the same sequence timestep.
-      fname = os.path.join(view_base, '%s.png' % str(timestep).zfill(10))
-      cv2.imwrite(fname, image)
-    # Move to the next timestep.
-    timestep += 1
-
-
-def get_image(camera):
-  """Captures a single image from the camera and returns it in PIL format."""
-  data = camera.read()
-  _, im = data
-  return im
-
-
-def capture_webcam(camera, display_queue, reconcile_queue):
-  """Captures images from simultaneous webcams, writes them to queues.
-
-  Args:
-    camera: A cv2.VideoCapture object representing an open webcam stream.
-    display_queue: An ImageQueue.
-    reconcile_queue: An ImageQueue.
-  """
-  # Take some ramp images to allow cams to adjust for brightness etc.
-  for i in range(60):
-    tf.logging.info('Taking ramp image %d.' % i)
-    get_image(camera)
-
-  cnt = 0
-  start = time.time()
-  while True:
-    # Get images for all cameras.
-    im = get_image(camera)
-    # Replace the current image in the display and reconcile queues.
-    display_queue.append(im)
-    reconcile_queue.append(im)
-    cnt += 1
-    current = time.time()
-    if cnt % 100 == 0:
-      tf.logging.info('Collected %s of video, %d frames at ~%.2f fps.' % (
-          timer(start, current), cnt, cnt/(current-start)))
-
-
-def timer(start, end):
-  """Returns a formatted time elapsed."""
-  hours, rem = divmod(end-start, 3600)
-  minutes, seconds = divmod(rem, 60)
-  return '{:0>2}:{:0>2}:{:05.2f}'.format(int(hours), int(minutes), seconds)
-
-
-def display_webcams(display_queues):
-  """Builds an WebcamViewer to animate incoming images, runs it."""
-  viewer = WebcamViewer(display_queues)
-  viewer.run()
-
-
-def create_vids(view_dirs, seqname):
-  """Creates one video per view per sequence."""
-  vidbase = os.path.join(FLAGS.viddir, FLAGS.dataset, FLAGS.mode)
-  if not os.path.exists(vidbase):
-    os.makedirs(vidbase)
-  vidpaths = []
-  for idx, view_dir in enumerate(view_dirs):
-    vidname = os.path.join(vidbase, '%s_view%d.mp4' % (seqname, idx))
-    encode_vid_cmd = r'mencoder mf://%s/*.png \
-    -mf fps=29:type=png \
-    -ovc lavc -lavcopts vcodec=mpeg4:mbd=2:trell \
-    -oac copy -o %s' % (view_dir, vidname)
-    os.system(encode_vid_cmd)
-    vidpaths.append(vidname)
-
-  debugpath = None
-  if FLAGS.debug_vids:
-    lhs = vidpaths[FLAGS.debug_lhs_view]
-    rhs = vidpaths[FLAGS.debug_rhs_view]
-    debug_base = os.path.join('%s_debug' % FLAGS.viddir, FLAGS.dataset,
-                              FLAGS.mode)
-    if not os.path.exists(debug_base):
-      os.makedirs(debug_base)
-    debugpath = '%s/%s.mp4' % (debug_base, seqname)
-    os.system(r"avconv \
-      -i %s \
-      -i %s \
-      -filter_complex '[0:v]pad=iw*2:ih[int];[int][1:v]overlay=W/2:0[vid]' \
-      -map [vid] \
-      -c:v libx264 \
-      -crf 23 \
-      -preset veryfast \
-      %s" % (lhs, rhs, debugpath))
-
-  return vidpaths, debugpath
-
-
-def setup_paths():
-  """Sets up the necessary paths to collect videos."""
-  assert FLAGS.dataset
-  assert FLAGS.mode
-  assert FLAGS.num_views
-
-  # Setup directory for final images used to create videos for this sequence.
-  tmp_imagedir = os.path.join(FLAGS.tmp_imagedir, FLAGS.dataset, FLAGS.mode)
-  if not os.path.exists(tmp_imagedir):
-    os.makedirs(tmp_imagedir)
-
-  # Create a base directory to hold all sequence videos if it doesn't exist.
-  vidbase = os.path.join(FLAGS.viddir, FLAGS.dataset, FLAGS.mode)
-  if not os.path.exists(vidbase):
-    os.makedirs(vidbase)
-
-  # Get one directory per concurrent view and a sequence name.
-  view_dirs, seqname = get_view_dirs(vidbase, tmp_imagedir)
-
-  # Get an output path to each view's video.
-  vid_paths = []
-  for idx, _ in enumerate(view_dirs):
-    vid_path = os.path.join(vidbase, '%s_view%d.mp4' % (seqname, idx))
-    vid_paths.append(vid_path)
-
-  # Optionally build paths to debug_videos.
-  debug_path = None
-  if FLAGS.debug_vids:
-    debug_base = os.path.join('%s_debug' % FLAGS.viddir, FLAGS.dataset,
-                              FLAGS.mode)
-    if not os.path.exists(debug_base):
-      os.makedirs(debug_base)
-    debug_path = '%s/%s.mp4' % (debug_base, seqname)
-
-  return view_dirs, vid_paths, debug_path
-
-
-def get_view_dirs(vidbase, tmp_imagedir):
-  """Creates and returns one view directory per webcam."""
-  # Create and append a sequence name.
-  if FLAGS.seqname:
-    seqname = FLAGS.seqname
-  else:
-    # If there's no video directory, this is the first sequence.
-    if not os.listdir(vidbase):
-      seqname = '0'
-    else:
-      # Otherwise, get the latest sequence name and increment it.
-      seq_names = [i.split('_')[0] for i in os.listdir(vidbase)]
-      latest_seq = sorted(map(int, seq_names), reverse=True)[0]
-      seqname = str(latest_seq+1)
-    tf.logging.info('No seqname specified, using: %s' % seqname)
-  view_dirs = [os.path.join(
-      tmp_imagedir, '%s_view%d' % (seqname, v)) for v in range(FLAGS.num_views)]
-  for d in view_dirs:
-    if not os.path.exists(d):
-      os.makedirs(d)
-  return view_dirs, seqname
-
-
-def get_cameras():
-  """Opens cameras using cv2, ensures they can take images."""
-  # Try to get free webcam ports.
-  if FLAGS.webcam_ports:
-    ports = map(int, FLAGS.webcam_ports.split(','))
-  else:
-    ports = range(FLAGS.num_views)
-  cameras = [cv2.VideoCapture(i) for i in ports]
-
-  if not all([i.isOpened() for i in cameras]):
-    try:
-      # Try to find and kill hanging cv2 process_ids.
-      output = subprocess.check_output(['lsof -t /dev/video*'], shell=True)
-      tf.logging.info('Found hanging cv2 process_ids: \n')
-      tf.logging.info(output)
-      tf.logging.info('Killing hanging processes...')
-      for process_id in output.split('\n')[:-1]:
-        subprocess.call(['kill %s' % process_id], shell=True)
-      time.sleep(3)
-      # Recapture webcams.
-      cameras = [cv2.VideoCapture(i) for i in ports]
-    except subprocess.CalledProcessError:
-      raise ValueError(
-          'Cannot connect to cameras. Try running: \n'
-          'ls -ltrh /dev/video* \n '
-          'to see which ports your webcams are connected to. Then hand those '
-          'ports as a comma-separated list to --webcam_ports, e.g. '
-          '--webcam_ports 0,1')
-
-  # Verify each camera is able to capture images.
-  ims = map(get_image, cameras)
-  assert False not in [i is not None for i in ims]
-  return cameras
-
-
-def launch_images_to_videos(view_dirs, vid_paths, debug_path):
-  """Launch job in separate process to convert images to videos."""
-
-  f = 'learning/brain/research/tcn/dataset/images_to_videos.py'
-  cmd = ['python %s ' % f]
-  cmd += ['--view_dirs %s ' % ','.join(i for i in view_dirs)]
-  cmd += ['--vid_paths %s ' % ','.join(i for i in vid_paths)]
-  cmd += ['--debug_path %s ' % debug_path]
-  cmd += ['--debug_lhs_view %s ' % FLAGS.debug_lhs_view]
-  cmd += ['--debug_rhs_view %s ' % FLAGS.debug_rhs_view]
-  cmd += [' & ']
-  cmd = ''.join(i for i in cmd)
-
-  # Call images_to_videos asynchronously.
-  fnull = open(os.devnull, 'w')
-  subprocess.Popen([cmd], stdout=fnull, stderr=subprocess.STDOUT, shell=True)
-
-  for p in vid_paths:
-    tf.logging.info('Writing final video to: %s' % p)
-  if debug_path:
-    tf.logging.info('Writing debug video to: %s' % debug_path)
-
-
-def main(_):
-  # Initialize the camera capture objects.
-  cameras = get_cameras()
-  # Get one output directory per view.
-  view_dirs, vid_paths, debug_path = setup_paths()
-  try:
-    # Wait for user input.
-    try:
-      tf.logging.info('About to write to:')
-      for v in view_dirs:
-        tf.logging.info(v)
-      input('Press Enter to continue...')
-    except SyntaxError:
-      pass
-
-    # Create a queue per view for displaying and saving images.
-    display_queues = [ImageQueue() for _ in range(FLAGS.num_views)]
-    reconcile_queues = [ImageQueue() for _ in range(FLAGS.num_views)]
-
-    # Create a queue for collecting all tuples of multi-view images to write to
-    # disk.
-    write_queue = multiprocessing.Queue()
-
-    processes = []
-    # Create a process to display collected images in real time.
-    processes.append(Process(target=display_webcams, args=(display_queues,)))
-    # Create a process to collect the latest simultaneous images from each view.
-    processes.append(Process(
-        target=reconcile, args=(reconcile_queues, write_queue,)))
-    # Create a process to collect the latest simultaneous images from each view.
-    processes.append(Process(
-        target=persist, args=(write_queue, view_dirs,)))
-
-    for (cam, dq, rq) in zip(cameras, display_queues, reconcile_queues):
-      processes.append(Process(
-          target=capture_webcam, args=(cam, dq, rq,)))
-
-    for p in processes:
-      p.start()
-    for p in processes:
-      p.join()
-
-  except KeyboardInterrupt:
-    # Close the queues.
-    for q in display_queues + reconcile_queues:
-      q.close()
-    # Release the cameras.
-    for cam in cameras:
-      cam.release()
-
-    # Launch images_to_videos script asynchronously.
-    launch_images_to_videos(view_dirs, vid_paths, debug_path)
-
-    try:
-      sys.exit(0)
-    except SystemExit:
-      os._exit(0)  # pylint: disable=protected-access
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/tcn/download_pretrained.py b/research/tcn/download_pretrained.py
deleted file mode 100644
index 4d42ee7328b67cf90f115c6b99c7bde5aae3780b..0000000000000000000000000000000000000000
--- a/research/tcn/download_pretrained.py
+++ /dev/null
@@ -1,54 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Downloads pretrained InceptionV3 and ResnetV2-50 checkpoints."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import tarfile
-import urllib
-
-INCEPTION_URL = 'http://download.tensorflow.org/models/inception_v3_2016_08_28.tar.gz'
-RESNET_URL = 'http://download.tensorflow.org/models/resnet_v2_50_2017_04_14.tar.gz'
-
-
-def DownloadWeights(model_dir, url):
-  os.makedirs(model_dir)
-  tar_path = os.path.join(model_dir, 'ckpt.tar.gz')
-  urllib.urlretrieve(url, tar_path)
-  tar = tarfile.open(os.path.join(model_dir, 'ckpt.tar.gz'))
-  tar.extractall(model_dir)
-
-
-if __name__ == '__main__':
-
-  # Create a directory for all pretrained checkpoints.
-  ckpt_dir = 'pretrained_checkpoints'
-  if not os.path.exists(ckpt_dir):
-    os.makedirs(ckpt_dir)
-
-  # Download inception.
-  print('Downloading inception pretrained weights...')
-  inception_dir = os.path.join(ckpt_dir, 'inception')
-  DownloadWeights(inception_dir, INCEPTION_URL)
-  print('Done downloading inception pretrained weights.')
-
-  print('Downloading resnet pretrained weights...')
-  resnet_dir = os.path.join(ckpt_dir, 'resnet')
-  DownloadWeights(resnet_dir, RESNET_URL)
-  print('Done downloading resnet pretrained weights.')
-
diff --git a/research/tcn/estimators/base_estimator.py b/research/tcn/estimators/base_estimator.py
deleted file mode 100644
index f3832ff0a11388cb172f9cc9ba94e6743e2ec49a..0000000000000000000000000000000000000000
--- a/research/tcn/estimators/base_estimator.py
+++ /dev/null
@@ -1,700 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Base estimator defining TCN training, test, and inference."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from abc import ABCMeta
-from abc import abstractmethod
-import os
-import numpy as np
-import numpy as np
-import data_providers
-import preprocessing
-from utils import util
-import tensorflow as tf
-import tensorflow.contrib.slim as slim
-from tensorflow.contrib.tpu.python.tpu import tpu_config
-from tensorflow.contrib.tpu.python.tpu import tpu_estimator
-from tensorflow.contrib.tpu.python.tpu import tpu_optimizer
-from tensorflow.python.training import session_run_hook
-
-tf.app.flags.DEFINE_integer(
-    'tf_random_seed', 0, 'Random seed.')
-FLAGS = tf.app.flags.FLAGS
-
-
-class InitFromPretrainedCheckpointHook(session_run_hook.SessionRunHook):
-  """Hook that can init graph from a pretrained checkpoint."""
-
-  def __init__(self, pretrained_checkpoint_dir):
-    """Initializes a `InitFromPretrainedCheckpointHook`.
-
-    Args:
-      pretrained_checkpoint_dir: The dir of pretrained checkpoint.
-
-    Raises:
-      ValueError: If pretrained_checkpoint_dir is invalid.
-    """
-    if pretrained_checkpoint_dir is None:
-      raise ValueError('pretrained_checkpoint_dir must be specified.')
-    self._pretrained_checkpoint_dir = pretrained_checkpoint_dir
-
-  def begin(self):
-    checkpoint_reader = tf.contrib.framework.load_checkpoint(
-        self._pretrained_checkpoint_dir)
-    variable_shape_map = checkpoint_reader.get_variable_to_shape_map()
-
-    exclude_scopes = 'logits/,final_layer/,aux_'
-    # Skip restoring global_step as to run fine tuning from step=0.
-    exclusions = ['global_step']
-    if exclude_scopes:
-      exclusions.extend([scope.strip() for scope in exclude_scopes.split(',')])
-
-    variable_to_restore = tf.contrib.framework.get_model_variables()
-
-    # Variable filtering by given exclude_scopes.
-    filtered_variables_to_restore = {}
-    for v in variable_to_restore:
-      for exclusion in exclusions:
-        if v.name.startswith(exclusion):
-          break
-      else:
-        var_name = v.name.split(':')[0]
-        filtered_variables_to_restore[var_name] = v
-
-    # Final filter by checking shape matching and skipping variables that
-    # are not in the checkpoint.
-    final_variables_to_restore = {}
-    for var_name, var_tensor in filtered_variables_to_restore.iteritems():
-      if var_name not in variable_shape_map:
-        # Try moving average version of variable.
-        var_name = os.path.join(var_name, 'ExponentialMovingAverage')
-        if var_name not in variable_shape_map:
-          tf.logging.info(
-              'Skip init [%s] because it is not in ckpt.', var_name)
-          # Skip variables not in the checkpoint.
-          continue
-
-      if not var_tensor.get_shape().is_compatible_with(
-          variable_shape_map[var_name]):
-        # Skip init variable from ckpt if shape dismatch.
-        tf.logging.info(
-            'Skip init [%s] from [%s] in ckpt because shape dismatch: %s vs %s',
-            var_tensor.name, var_name,
-            var_tensor.get_shape(), variable_shape_map[var_name])
-        continue
-
-      tf.logging.info('Init %s from %s in ckpt' % (var_tensor, var_name))
-      final_variables_to_restore[var_name] = var_tensor
-
-    self._init_fn = tf.contrib.framework.assign_from_checkpoint_fn(
-        self._pretrained_checkpoint_dir,
-        final_variables_to_restore)
-
-  def after_create_session(self, session, coord):
-    tf.logging.info('Restoring InceptionV3 weights.')
-    self._init_fn(session)
-    tf.logging.info('Done restoring InceptionV3 weights.')
-
-
-class BaseEstimator(object):
-  """Abstract TCN base estimator class."""
-  __metaclass__ = ABCMeta
-
-  def __init__(self, config, logdir):
-    """Constructor.
-
-    Args:
-      config: A Luatable-like T object holding training config.
-      logdir: String, a directory where checkpoints and summaries are written.
-    """
-    self._config = config
-    self._logdir = logdir
-
-  @abstractmethod
-  def construct_input_fn(self, records, is_training):
-    """Builds an estimator input_fn.
-
-    The input_fn is used to pass feature and target data to the train,
-    evaluate, and predict methods of the Estimator.
-
-    Method to be overridden by implementations.
-
-    Args:
-      records: A list of Strings, paths to TFRecords with image data.
-      is_training: Boolean, whether or not we're training.
-
-    Returns:
-      Function, that has signature of ()->(dict of features, target).
-        features is a dict mapping feature names to `Tensors`
-        containing the corresponding feature data (typically, just a single
-        key/value pair 'raw_data' -> image `Tensor` for TCN.
-        labels is a 1-D int32 `Tensor` holding labels.
-    """
-    pass
-
-  def preprocess_data(self, images, is_training):
-    """Preprocesses raw images for either training or inference.
-
-    Args:
-      images: A 4-D float32 `Tensor` holding images to preprocess.
-      is_training: Boolean, whether or not we're in training.
-
-    Returns:
-      data_preprocessed: data after the preprocessor.
-    """
-    config = self._config
-    height = config.data.height
-    width = config.data.width
-    min_scale = config.data.augmentation.minscale
-    max_scale = config.data.augmentation.maxscale
-    p_scale_up = config.data.augmentation.proportion_scaled_up
-    aug_color = config.data.augmentation.color
-    fast_mode = config.data.augmentation.fast_mode
-    crop_strategy = config.data.preprocessing.eval_cropping
-    preprocessed_images = preprocessing.preprocess_images(
-        images, is_training, height, width,
-        min_scale, max_scale, p_scale_up,
-        aug_color=aug_color, fast_mode=fast_mode,
-        crop_strategy=crop_strategy)
-    return preprocessed_images
-
-  @abstractmethod
-  def forward(self, images, is_training, reuse=False):
-    """Defines the forward pass that converts batch images to embeddings.
-
-    Method to be overridden by implementations.
-
-    Args:
-      images: A 4-D float32 `Tensor` holding images to be embedded.
-      is_training: Boolean, whether or not we're in training mode.
-      reuse: Boolean, whether or not to reuse embedder.
-    Returns:
-      embeddings: A 2-D float32 `Tensor` holding embedded images.
-    """
-    pass
-
-  @abstractmethod
-  def define_loss(self, embeddings, labels, is_training):
-    """Defines the loss function on the embedding vectors.
-
-    Method to be overridden by implementations.
-
-    Args:
-      embeddings: A 2-D float32 `Tensor` holding embedded images.
-      labels: A 1-D int32 `Tensor` holding problem labels.
-      is_training: Boolean, whether or not we're in training mode.
-
-    Returns:
-      loss: tf.float32 scalar.
-    """
-    pass
-
-  @abstractmethod
-  def define_eval_metric_ops(self):
-    """Defines the dictionary of eval metric tensors.
-
-    Method to be overridden by implementations.
-
-    Returns:
-      eval_metric_ops:  A dict of name/value pairs specifying the
-        metrics that will be calculated when the model runs in EVAL mode.
-    """
-    pass
-
-  def get_train_op(self, loss):
-    """Creates a training op.
-
-    Args:
-      loss: A float32 `Tensor` representing the total training loss.
-    Returns:
-      train_op: A slim.learning.create_train_op train_op.
-    Raises:
-      ValueError: If specified optimizer isn't supported.
-    """
-    # Get variables to train (defined in subclass).
-    assert self.variables_to_train
-
-    # Define a learning rate schedule.
-    decay_steps = self._config.learning.decay_steps
-    decay_factor = self._config.learning.decay_factor
-    learning_rate = float(self._config.learning.learning_rate)
-
-    # Define a learning rate schedule.
-    global_step = slim.get_or_create_global_step()
-    learning_rate = tf.train.exponential_decay(
-        learning_rate,
-        global_step,
-        decay_steps,
-        decay_factor,
-        staircase=True)
-
-    # Create an optimizer.
-    opt_type = self._config.learning.optimizer
-    if opt_type == 'adam':
-      opt = tf.train.AdamOptimizer(learning_rate)
-    elif opt_type == 'momentum':
-      opt = tf.train.MomentumOptimizer(learning_rate, 0.9)
-    elif opt_type == 'rmsprop':
-      opt = tf.train.RMSPropOptimizer(learning_rate, momentum=0.9,
-                                      epsilon=1.0, decay=0.9)
-    else:
-      raise ValueError('Unsupported optimizer %s' % opt_type)
-
-    if self._config.use_tpu:
-      opt = tpu_optimizer.CrossShardOptimizer(opt)
-
-    # Create a training op.
-    # train_op = opt.minimize(loss, var_list=self.variables_to_train)
-    # Create a training op.
-    train_op = slim.learning.create_train_op(
-        loss,
-        optimizer=opt,
-        variables_to_train=self.variables_to_train,
-        update_ops=tf.get_collection(tf.GraphKeys.UPDATE_OPS))
-
-    return train_op
-
-  def _get_model_fn(self):
-    """Defines behavior for training, evaluation, and inference (prediction).
-
-    Returns:
-      `model_fn` for `Estimator`.
-    """
-    # pylint: disable=unused-argument
-    def model_fn(features, labels, mode, params):
-      """Build the model based on features, labels, and mode.
-
-      Args:
-        features: Dict, strings to `Tensor` input data, returned by the
-          input_fn.
-        labels: The labels Tensor returned by the input_fn.
-        mode: A string indicating the mode. This will be either
-          tf.estimator.ModeKeys.TRAIN, tf.estimator.ModeKeys.PREDICT,
-          or tf.estimator.ModeKeys.EVAL.
-        params: A dict holding training parameters, passed in during TPU
-          training.
-
-      Returns:
-        A tf.estimator.EstimatorSpec specifying train/test/inference behavior.
-      """
-      is_training = mode == tf.estimator.ModeKeys.TRAIN
-
-      # Get preprocessed images from the features dict.
-      batch_preprocessed = features['batch_preprocessed']
-
-      # Do a forward pass to embed data.
-      batch_encoded = self.forward(batch_preprocessed, is_training)
-
-      # Optionally set the pretrained initialization function.
-      initializer_fn = None
-      if mode == tf.estimator.ModeKeys.TRAIN:
-        initializer_fn = self.pretrained_init_fn
-
-      # If we're training or evaluating, define total loss.
-      total_loss = None
-      if mode in (tf.estimator.ModeKeys.TRAIN, tf.estimator.ModeKeys.EVAL):
-        loss = self.define_loss(batch_encoded, labels, is_training)
-        tf.losses.add_loss(loss)
-        total_loss = tf.losses.get_total_loss()
-
-      # If we're training, define a train op.
-      train_op = None
-      if mode == tf.estimator.ModeKeys.TRAIN:
-        train_op = self.get_train_op(total_loss)
-
-      # If we're doing inference, set the output to be the embedded images.
-      predictions_dict = None
-      if mode == tf.estimator.ModeKeys.PREDICT:
-        predictions_dict = {'embeddings': batch_encoded}
-        # Pass through additional metadata stored in features.
-        for k, v in features.iteritems():
-          predictions_dict[k] = v
-
-      # If we're evaluating, define some eval metrics.
-      eval_metric_ops = None
-      if mode == tf.estimator.ModeKeys.EVAL:
-        eval_metric_ops = self.define_eval_metric_ops()
-
-      # Define training scaffold to load pretrained weights.
-      num_checkpoint_to_keep = self._config.logging.checkpoint.num_to_keep
-      saver = tf.train.Saver(
-          max_to_keep=num_checkpoint_to_keep)
-
-      if is_training and self._config.use_tpu:
-        # TPU doesn't have a scaffold option at the moment, so initialize
-        # pretrained weights using a custom train_hook instead.
-        return tpu_estimator.TPUEstimatorSpec(
-            mode,
-            loss=total_loss,
-            eval_metrics=None,
-            train_op=train_op,
-            predictions=predictions_dict)
-      else:
-        # Build a scaffold to initialize pretrained weights.
-        scaffold = tf.train.Scaffold(
-            init_fn=initializer_fn,
-            saver=saver,
-            summary_op=None)
-        return tf.estimator.EstimatorSpec(
-            mode=mode,
-            predictions=predictions_dict,
-            loss=total_loss,
-            train_op=train_op,
-            eval_metric_ops=eval_metric_ops,
-            scaffold=scaffold)
-    return model_fn
-
-  def train(self):
-    """Runs training."""
-    # Get a list of training tfrecords.
-    config = self._config
-    training_dir = config.data.training
-    training_records = util.GetFilesRecursively(training_dir)
-
-    # Define batch size.
-    self._batch_size = config.data.batch_size
-
-    # Create a subclass-defined training input function.
-    train_input_fn = self.construct_input_fn(
-        training_records, is_training=True)
-
-    # Create the estimator.
-    estimator = self._build_estimator(is_training=True)
-
-    train_hooks = None
-    if config.use_tpu:
-      # TPU training initializes pretrained weights using a custom train hook.
-      train_hooks = []
-      if tf.train.latest_checkpoint(self._logdir) is None:
-        train_hooks.append(
-            InitFromPretrainedCheckpointHook(
-                config[config.embedder_strategy].pretrained_checkpoint))
-
-    # Run training.
-    estimator.train(input_fn=train_input_fn, hooks=train_hooks,
-                    steps=config.learning.max_step)
-
-  def _build_estimator(self, is_training):
-    """Returns an Estimator object.
-
-    Args:
-      is_training: Boolean, whether or not we're in training mode.
-
-    Returns:
-      A tf.estimator.Estimator.
-    """
-    config = self._config
-    save_checkpoints_steps = config.logging.checkpoint.save_checkpoints_steps
-    keep_checkpoint_max = self._config.logging.checkpoint.num_to_keep
-    if is_training and config.use_tpu:
-      iterations = config.tpu.iterations
-      num_shards = config.tpu.num_shards
-      run_config = tpu_config.RunConfig(
-          save_checkpoints_secs=None,
-          save_checkpoints_steps=save_checkpoints_steps,
-          keep_checkpoint_max=keep_checkpoint_max,
-          master=FLAGS.master,
-          evaluation_master=FLAGS.master,
-          model_dir=self._logdir,
-          tpu_config=tpu_config.TPUConfig(
-              iterations_per_loop=iterations,
-              num_shards=num_shards,
-              per_host_input_for_training=num_shards <= 8),
-          tf_random_seed=FLAGS.tf_random_seed)
-
-      batch_size = config.data.batch_size
-      return tpu_estimator.TPUEstimator(
-          model_fn=self._get_model_fn(),
-          config=run_config,
-          use_tpu=True,
-          train_batch_size=batch_size,
-          eval_batch_size=batch_size)
-    else:
-      run_config = tf.estimator.RunConfig().replace(
-          model_dir=self._logdir,
-          save_checkpoints_steps=save_checkpoints_steps,
-          keep_checkpoint_max=keep_checkpoint_max,
-          tf_random_seed=FLAGS.tf_random_seed)
-      return tf.estimator.Estimator(
-          model_fn=self._get_model_fn(),
-          config=run_config)
-
-  def evaluate(self):
-    """Runs `Estimator` validation.
-    """
-    config = self._config
-
-    # Get a list of validation tfrecords.
-    validation_dir = config.data.validation
-    validation_records = util.GetFilesRecursively(validation_dir)
-
-    # Define batch size.
-    self._batch_size = config.data.batch_size
-
-    # Create a subclass-defined training input function.
-    validation_input_fn = self.construct_input_fn(
-        validation_records, False)
-
-    # Create the estimator.
-    estimator = self._build_estimator(is_training=False)
-
-    # Run validation.
-    eval_batch_size = config.data.batch_size
-    num_eval_samples = config.val.num_eval_samples
-    num_eval_batches = int(num_eval_samples / eval_batch_size)
-    estimator.evaluate(input_fn=validation_input_fn, steps=num_eval_batches)
-
-  def inference(
-      self, inference_input, checkpoint_path, batch_size=None, **kwargs):
-    """Defines 3 of modes of inference.
-
-    Inputs:
-    * Mode 1: Input is an input_fn.
-    * Mode 2: Input is a TFRecord (or list of TFRecords).
-    * Mode 3: Input is a numpy array holding an image (or array of images).
-
-    Outputs:
-    * Mode 1: this returns an iterator over embeddings and additional
-      metadata. See
-      https://www.tensorflow.org/api_docs/python/tf/estimator/Estimator#predict
-      for details.
-    * Mode 2: Returns an iterator over tuples of
-      (embeddings, raw_image_strings, sequence_name), where embeddings is a
-      2-D float32 numpy array holding [sequence_size, embedding_size] image
-      embeddings, raw_image_strings is a 1-D string numpy array holding
-      [sequence_size] jpeg-encoded image strings, and sequence_name is a
-      string holding the name of the embedded sequence.
-    * Mode 3: Returns a tuple of (embeddings, raw_image_strings), where
-      embeddings is a 2-D float32 numpy array holding
-      [batch_size, embedding_size] image embeddings, raw_image_strings is a
-      1-D string numpy array holding [batch_size] jpeg-encoded image strings.
-
-    Args:
-      inference_input: This can be a tf.Estimator input_fn, a TFRecord path,
-        a list of TFRecord paths, a numpy image, or an array of numpy images.
-      checkpoint_path: String, path to the checkpoint to restore for inference.
-      batch_size: Int, the size of the batch to use for inference.
-      **kwargs: Additional keyword arguments, depending on the mode.
-        See _input_fn_inference, _tfrecord_inference, and _np_inference.
-    Returns:
-      inference_output: Inference output depending on mode, see above for
-        details.
-    Raises:
-      ValueError: If inference_input isn't a tf.Estimator input_fn,
-        a TFRecord path, a list of TFRecord paths, or a numpy array,
-    """
-    # Mode 1: input is a callable tf.Estimator input_fn.
-    if callable(inference_input):
-      return self._input_fn_inference(
-          input_fn=inference_input, checkpoint_path=checkpoint_path, **kwargs)
-    # Mode 2: Input is a TFRecord path (or list of TFRecord paths).
-    elif util.is_tfrecord_input(inference_input):
-      return self._tfrecord_inference(
-          records=inference_input, checkpoint_path=checkpoint_path,
-          batch_size=batch_size, **kwargs)
-    # Mode 3: Input is a numpy array of raw images.
-    elif util.is_np_array(inference_input):
-      return self._np_inference(
-          np_images=inference_input, checkpoint_path=checkpoint_path, **kwargs)
-    else:
-      raise ValueError(
-          'inference input must be a tf.Estimator input_fn, a TFRecord path,'
-          'a list of TFRecord paths, or a numpy array. Got: %s' % str(type(
-              inference_input)))
-
-  def _input_fn_inference(self, input_fn, checkpoint_path, predict_keys=None):
-    """Mode 1: tf.Estimator inference.
-
-    Args:
-      input_fn: Function, that has signature of ()->(dict of features, None).
-        This is a function called by the estimator to get input tensors (stored
-        in the features dict) to do inference over.
-      checkpoint_path: String, path to a specific checkpoint to restore.
-      predict_keys: List of strings, the keys of the `Tensors` in the features
-        dict (returned by the input_fn) to evaluate during inference.
-    Returns:
-      predictions: An Iterator, yielding evaluated values of `Tensors`
-        specified in `predict_keys`.
-    """
-    # Create the estimator.
-    estimator = self._build_estimator(is_training=False)
-
-    # Create an iterator of predicted embeddings.
-    predictions = estimator.predict(input_fn=input_fn,
-                                    checkpoint_path=checkpoint_path,
-                                    predict_keys=predict_keys)
-    return predictions
-
-  def _tfrecord_inference(self, records, checkpoint_path, batch_size,
-                          num_sequences=-1, reuse=False):
-    """Mode 2: TFRecord inference.
-
-    Args:
-      records: List of strings, paths to TFRecords.
-      checkpoint_path: String, path to a specific checkpoint to restore.
-      batch_size: Int, size of inference batch.
-      num_sequences: Int, number of sequences to embed. If -1,
-        embed everything.
-      reuse: Boolean, whether or not to reuse embedder weights.
-    Yields:
-      (embeddings, raw_image_strings, sequence_name):
-        embeddings is a 2-D float32 numpy array holding
-        [sequence_size, embedding_size] image embeddings.
-        raw_image_strings is a 1-D string numpy array holding
-        [sequence_size] jpeg-encoded image strings.
-        sequence_name is a string holding the name of the embedded sequence.
-    """
-    tf.reset_default_graph()
-    if not isinstance(records, list):
-      records = list(records)
-
-    # Map the list of tfrecords to a dataset of preprocessed images.
-    num_views = self._config.data.num_views
-    (views, task, seq_len) = data_providers.full_sequence_provider(
-        records, num_views)
-    tensor_dict = {
-        'raw_image_strings': views,
-        'task': task,
-        'seq_len': seq_len
-    }
-
-    # Create a preprocess function over raw image string placeholders.
-    image_str_placeholder = tf.placeholder(tf.string, shape=[None])
-    decoded = preprocessing.decode_images(image_str_placeholder)
-    decoded.set_shape([batch_size, None, None, 3])
-    preprocessed = self.preprocess_data(decoded, is_training=False)
-
-    # Create an inference graph over preprocessed images.
-    embeddings = self.forward(preprocessed, is_training=False, reuse=reuse)
-
-    # Create a saver to restore model variables.
-    tf.train.get_or_create_global_step()
-    saver = tf.train.Saver(tf.all_variables())
-
-    # Create a session and restore model variables.
-    with tf.train.MonitoredSession() as sess:
-      saver.restore(sess, checkpoint_path)
-      cnt = 0
-      # If num_sequences is specified, embed that many sequences, else embed
-      # everything.
-      try:
-        while cnt < num_sequences if num_sequences != -1 else True:
-          # Get a preprocessed image sequence.
-          np_data = sess.run(tensor_dict)
-          np_raw_images = np_data['raw_image_strings']
-          np_seq_len = np_data['seq_len']
-          np_task = np_data['task']
-
-          # Embed each view.
-          embedding_size = self._config.embedding_size
-          view_embeddings = [
-              np.zeros((0, embedding_size)) for _ in range(num_views)]
-          for view_index in range(num_views):
-            view_raw = np_raw_images[view_index]
-            # Embed the full sequence.
-            t = 0
-            while t < np_seq_len:
-              # Decode and preprocess the batch of image strings.
-              embeddings_np = sess.run(
-                  embeddings, feed_dict={
-                      image_str_placeholder: view_raw[t:t+batch_size]})
-              view_embeddings[view_index] = np.append(
-                  view_embeddings[view_index], embeddings_np, axis=0)
-              tf.logging.info('Embedded %d images for task %s' % (t, np_task))
-              t += batch_size
-
-          # Done embedding for all views.
-          view_raw_images = np_data['raw_image_strings']
-          yield (view_embeddings, view_raw_images, np_task)
-          cnt += 1
-      except tf.errors.OutOfRangeError:
-        tf.logging.info('Done embedding entire dataset.')
-
-  def _np_inference(self, np_images, checkpoint_path):
-    """Mode 3: Call this repeatedly to do inference over numpy images.
-
-    This mode is for when we we want to do real-time inference over
-    some stream of images (represented as numpy arrays).
-
-    Args:
-      np_images: A float32 numpy array holding images to embed.
-      checkpoint_path: String, path to a specific checkpoint to restore.
-    Returns:
-      (embeddings, raw_image_strings):
-        embeddings is a 2-D float32 numpy array holding
-        [inferred batch_size, embedding_size] image embeddings.
-        raw_image_strings is a 1-D string numpy array holding
-        [inferred batch_size] jpeg-encoded image strings.
-    """
-    if isinstance(np_images, list):
-      np_images = np.asarray(np_images)
-    # Add a batch dimension if only 3-dimensional.
-    if len(np_images.shape) == 3:
-      np_images = np.expand_dims(np_images, axis=0)
-
-    # If np_images are in the range [0,255], convert to [0,1].
-    assert np.min(np_images) >= 0.
-    if (np.min(np_images), np.max(np_images)) == (0, 255):
-      np_images = np_images.astype(np.float32) / 255.
-      assert (np.min(np_images), np.max(np_images)) == (0., 1.)
-
-    # If this is the first pass, set up inference graph.
-    if not hasattr(self, '_np_inf_tensor_dict'):
-      self._setup_np_inference(np_images, checkpoint_path)
-
-    # Convert np_images to embeddings.
-    np_tensor_dict = self._sess.run(self._np_inf_tensor_dict, feed_dict={
-        self._image_placeholder: np_images
-    })
-    return np_tensor_dict['embeddings'], np_tensor_dict['raw_image_strings']
-
-  def _setup_np_inference(self, np_images, checkpoint_path):
-    """Sets up and restores inference graph, creates and caches a Session."""
-    tf.logging.info('Restoring model weights.')
-
-    # Define inference over an image placeholder.
-    _, height, width, _ = np.shape(np_images)
-    image_placeholder = tf.placeholder(
-        tf.float32, shape=(None, height, width, 3))
-
-    # Preprocess batch.
-    preprocessed = self.preprocess_data(image_placeholder, is_training=False)
-
-    # Unscale and jpeg encode preprocessed images for display purposes.
-    im_strings = preprocessing.unscale_jpeg_encode(preprocessed)
-
-    # Do forward pass to get embeddings.
-    embeddings = self.forward(preprocessed, is_training=False)
-
-    # Create a saver to restore model variables.
-    tf.train.get_or_create_global_step()
-    saver = tf.train.Saver(tf.all_variables())
-
-    self._image_placeholder = image_placeholder
-    self._batch_encoded = embeddings
-
-    self._np_inf_tensor_dict = {
-        'embeddings': embeddings,
-        'raw_image_strings': im_strings,
-    }
-
-    # Create a session and restore model variables.
-    self._sess = tf.Session()
-    saver.restore(self._sess, checkpoint_path)
diff --git a/research/tcn/estimators/get_estimator.py b/research/tcn/estimators/get_estimator.py
deleted file mode 100644
index 30b850edc6a26ea1712decad166227928b685866..0000000000000000000000000000000000000000
--- a/research/tcn/estimators/get_estimator.py
+++ /dev/null
@@ -1,60 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Get a configured estimator."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from estimators import mvtcn_estimator as mvtcn_estimators
-from estimators import svtcn_estimator
-
-
-def get_mvtcn_estimator(loss_strategy, config, logdir):
-  """Returns a configured MVTCN estimator."""
-  loss_to_trainer = {
-      'triplet_semihard': mvtcn_estimators.MVTCNTripletEstimator,
-      'npairs': mvtcn_estimators.MVTCNNpairsEstimator,
-  }
-  if loss_strategy not in loss_to_trainer:
-    raise ValueError('Unknown loss for MVTCN: %s' % loss_strategy)
-  estimator = loss_to_trainer[loss_strategy](config, logdir)
-  return estimator
-
-
-def get_estimator(config, logdir):
-  """Returns an unsupervised model trainer based on config.
-
-  Args:
-    config: A T object holding training configs.
-    logdir: String, path to directory where model checkpoints and summaries
-      are saved.
-  Returns:
-    estimator: A configured `TCNEstimator` object.
-  Raises:
-    ValueError: If unknown training strategy is specified.
-  """
-  # Get the training strategy.
-  training_strategy = config.training_strategy
-  if training_strategy == 'mvtcn':
-    loss_strategy = config.loss_strategy
-    estimator = get_mvtcn_estimator(
-        loss_strategy, config, logdir)
-  elif training_strategy == 'svtcn':
-    estimator = svtcn_estimator.SVTCNTripletEstimator(config, logdir)
-  else:
-    raise ValueError('Unknown training strategy: %s' % training_strategy)
-  return estimator
diff --git a/research/tcn/estimators/mvtcn_estimator.py b/research/tcn/estimators/mvtcn_estimator.py
deleted file mode 100644
index 4a036b43566730c5f2225a79c25c30abc3887f56..0000000000000000000000000000000000000000
--- a/research/tcn/estimators/mvtcn_estimator.py
+++ /dev/null
@@ -1,165 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""MVTCN trainer implementations with various metric learning losses."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import functools
-import data_providers
-import model as model_module
-from estimators import base_estimator
-import tensorflow as tf
-
-
-class MVTCNEstimator(base_estimator.BaseEstimator):
-  """Multi-view TCN base class."""
-
-  def __init__(self, config, logdir):
-    super(MVTCNEstimator, self).__init__(config, logdir)
-
-  def _pairs_provider(self, records, is_training):
-    config = self._config
-    num_views = config.data.num_views
-    window = config.mvtcn.window
-    num_parallel_calls = config.data.num_parallel_calls
-    sequence_prefetch_size = config.data.sequence_prefetch_size
-    batch_prefetch_size = config.data.batch_prefetch_size
-    examples_per_seq = config.data.examples_per_sequence
-    return functools.partial(
-        data_providers.multiview_pairs_provider,
-        file_list=records,
-        preprocess_fn=self.preprocess_data,
-        num_views=num_views,
-        window=window,
-        is_training=is_training,
-        examples_per_seq=examples_per_seq,
-        num_parallel_calls=num_parallel_calls,
-        sequence_prefetch_size=sequence_prefetch_size,
-        batch_prefetch_size=batch_prefetch_size)
-
-  def forward(self, images_concat, is_training, reuse=False):
-    """See base class."""
-    embedder_strategy = self._config.embedder_strategy
-    loss_strategy = self._config.loss_strategy
-    l2_normalize_embedding = self._config[loss_strategy].embedding_l2
-    embedder = model_module.get_embedder(
-        embedder_strategy,
-        self._config,
-        images_concat,
-        is_training=is_training,
-        l2_normalize_embedding=l2_normalize_embedding, reuse=reuse)
-    embeddings_concat = embedder.construct_embedding()
-    variables_to_train = embedder.get_trainable_variables()
-    self.variables_to_train = variables_to_train
-    self.pretrained_init_fn = embedder.init_fn
-    return embeddings_concat
-
-  def _collect_image_summaries(self, anchor_images, positive_images,
-                               images_concat):
-    image_summaries = self._config.logging.summary.image_summaries
-    if image_summaries and not self._config.use_tpu:
-      batch_pairs_summary = tf.concat(
-          [anchor_images, positive_images], axis=2)
-      tf.summary.image('training/mvtcn_pairs', batch_pairs_summary)
-      tf.summary.image('training/images_preprocessed_concat', images_concat)
-
-
-class MVTCNTripletEstimator(MVTCNEstimator):
-  """Multi-View TCN with semihard triplet loss."""
-
-  def __init__(self, config, logdir):
-    super(MVTCNTripletEstimator, self).__init__(config, logdir)
-
-  def construct_input_fn(self, records, is_training):
-    """See base class."""
-    def input_fn(params):
-      """Provides input to MVTCN models."""
-      if is_training and self._config.use_tpu:
-        batch_size = params['batch_size']
-      else:
-        batch_size = self._batch_size
-      (images_concat,
-       anchor_labels,
-       positive_labels,
-       anchor_images,
-       positive_images) = self._pairs_provider(
-           records, is_training)(batch_size=batch_size)
-      if is_training:
-        self._collect_image_summaries(anchor_images, positive_images,
-                                      images_concat)
-      labels = tf.concat([anchor_labels, positive_labels], axis=0)
-      features = {'batch_preprocessed': images_concat}
-      return (features, labels)
-    return input_fn
-
-  def define_loss(self, embeddings, labels, is_training):
-    """See base class."""
-    margin = self._config.triplet_semihard.margin
-    loss = tf.contrib.losses.metric_learning.triplet_semihard_loss(
-        labels=labels, embeddings=embeddings, margin=margin)
-    self._loss = loss
-    if is_training and not self._config.use_tpu:
-      tf.summary.scalar('training/triplet_semihard', loss)
-    return loss
-
-  def define_eval_metric_ops(self):
-    """See base class."""
-    return {'validation/triplet_semihard': tf.metrics.mean(self._loss)}
-
-
-class MVTCNNpairsEstimator(MVTCNEstimator):
-  """Multi-View TCN with npairs loss."""
-
-  def __init__(self, config, logdir):
-    super(MVTCNNpairsEstimator, self).__init__(config, logdir)
-
-  def construct_input_fn(self, records, is_training):
-    """See base class."""
-    def input_fn(params):
-      """Provides input to MVTCN models."""
-      if is_training and self._config.use_tpu:
-        batch_size = params['batch_size']
-      else:
-        batch_size = self._batch_size
-      (images_concat,
-       npairs_labels,
-       _,
-       anchor_images,
-       positive_images) = self._pairs_provider(
-           records, is_training)(batch_size=batch_size)
-      if is_training:
-        self._collect_image_summaries(anchor_images, positive_images,
-                                      images_concat)
-      features = {'batch_preprocessed': images_concat}
-      return (features, npairs_labels)
-    return input_fn
-
-  def define_loss(self, embeddings, labels, is_training):
-    """See base class."""
-    embeddings_anchor, embeddings_positive = tf.split(embeddings, 2, axis=0)
-    loss = tf.contrib.losses.metric_learning.npairs_loss(
-        labels=labels, embeddings_anchor=embeddings_anchor,
-        embeddings_positive=embeddings_positive)
-    self._loss = loss
-    if is_training and not self._config.use_tpu:
-      tf.summary.scalar('training/npairs', loss)
-    return loss
-
-  def define_eval_metric_ops(self):
-    """See base class."""
-    return {'validation/npairs': tf.metrics.mean(self._loss)}
diff --git a/research/tcn/estimators/svtcn_estimator.py b/research/tcn/estimators/svtcn_estimator.py
deleted file mode 100644
index 069f7e8dd100082b99eaa2e8a8c0249566b4727f..0000000000000000000000000000000000000000
--- a/research/tcn/estimators/svtcn_estimator.py
+++ /dev/null
@@ -1,100 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""SVTCN estimator implementation."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import data_providers
-import model as model_module
-from estimators import base_estimator
-from estimators import svtcn_loss
-import tensorflow as tf
-
-
-class SVTCNEstimator(base_estimator.BaseEstimator):
-  """Single-view TCN Estimator base class."""
-
-  def __init__(self, config, logdir):
-    super(SVTCNEstimator, self).__init__(config, logdir)
-
-  def construct_input_fn(self, records, is_training):
-    """See base class."""
-    config = self._config
-    num_views = config.data.num_views
-    num_parallel_calls = config.data.num_parallel_calls
-    sequence_prefetch_size = config.data.sequence_prefetch_size
-    batch_prefetch_size = config.data.batch_prefetch_size
-
-    def input_fn():
-      """Provides input to SVTCN models."""
-      (images_preprocessed,
-       images_raw,
-       timesteps) = data_providers.singleview_tcn_provider(
-           file_list=records,
-           preprocess_fn=self.preprocess_data,
-           num_views=num_views,
-           is_training=is_training,
-           batch_size=self._batch_size,
-           num_parallel_calls=num_parallel_calls,
-           sequence_prefetch_size=sequence_prefetch_size,
-           batch_prefetch_size=batch_prefetch_size)
-
-      if config.logging.summary.image_summaries and is_training:
-        tf.summary.image('training/svtcn_images', images_raw)
-
-      features = {'batch_preprocessed': images_preprocessed}
-      return (features, timesteps)
-    return input_fn
-
-  def forward(self, images, is_training, reuse=False):
-    """See base class."""
-    embedder_strategy = self._config.embedder_strategy
-    embedder = model_module.get_embedder(
-        embedder_strategy,
-        self._config,
-        images,
-        is_training=is_training, reuse=reuse)
-    embeddings = embedder.construct_embedding()
-
-    if is_training:
-      self.variables_to_train = embedder.get_trainable_variables()
-      self.pretrained_init_fn = embedder.init_fn
-    return embeddings
-
-
-class SVTCNTripletEstimator(SVTCNEstimator):
-  """Single-View TCN with semihard triplet loss."""
-
-  def __init__(self, config, logdir):
-    super(SVTCNTripletEstimator, self).__init__(config, logdir)
-
-  def define_loss(self, embeddings, timesteps, is_training):
-    """See base class."""
-    pos_radius = self._config.svtcn.pos_radius
-    neg_radius = self._config.svtcn.neg_radius
-    margin = self._config.triplet_semihard.margin
-    loss = svtcn_loss.singleview_tcn_loss(
-        embeddings, timesteps, pos_radius, neg_radius, margin=margin)
-    self._loss = loss
-    if is_training:
-      tf.summary.scalar('training/svtcn_loss', loss)
-    return loss
-
-  def define_eval_metric_ops(self):
-    """See base class."""
-    return {'validation/svtcn_loss': tf.metrics.mean(self._loss)}
diff --git a/research/tcn/estimators/svtcn_loss.py b/research/tcn/estimators/svtcn_loss.py
deleted file mode 100644
index 2617803495a6d92971a466639ecc02aab3371e50..0000000000000000000000000000000000000000
--- a/research/tcn/estimators/svtcn_loss.py
+++ /dev/null
@@ -1,217 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""This implements single view TCN triplet loss."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import tensorflow as tf
-
-
-def pairwise_squared_distance(feature):
-  """Computes the squared pairwise distance matrix.
-
-  output[i, j] = || feature[i, :] - feature[j, :] ||_2^2
-
-  Args:
-    feature: 2-D Tensor of size [number of data, feature dimension]
-
-  Returns:
-    pairwise_squared_distances: 2-D Tensor of size
-      [number of data, number of data]
-  """
-  pairwise_squared_distances = tf.add(
-      tf.reduce_sum(
-          tf.square(feature), axis=1, keep_dims=True),
-      tf.reduce_sum(
-          tf.square(tf.transpose(feature)), axis=0,
-          keep_dims=True)) - 2.0 * tf.matmul(feature, tf.transpose(feature))
-
-  # Deal with numerical inaccuracies. Set small negatives to zero.
-  pairwise_squared_distances = tf.maximum(pairwise_squared_distances, 0.0)
-  return pairwise_squared_distances
-
-
-def masked_maximum(data, mask, dim=1):
-  """Computes the axis wise maximum over chosen elements.
-
-  Args:
-    data: N-D Tensor.
-    mask: N-D Tensor of zeros or ones.
-    dim: The dimension over which to compute the maximum.
-
-  Returns:
-    masked_maximums: N-D Tensor.
-      The maximized dimension is of size 1 after the operation.
-  """
-  axis_minimums = tf.reduce_min(data, dim, keep_dims=True)
-  masked_maximums = tf.reduce_max(
-      tf.multiply(
-          data - axis_minimums, mask), dim, keep_dims=True) + axis_minimums
-  return masked_maximums
-
-
-def masked_minimum(data, mask, dim=1):
-  """Computes the axis wise minimum over chosen elements.
-
-  Args:
-    data: 2-D Tensor of size [n, m].
-    mask: 2-D Boolean Tensor of size [n, m].
-    dim: The dimension over which to compute the minimum.
-
-  Returns:
-    masked_minimums: N-D Tensor.
-      The minimized dimension is of size 1 after the operation.
-  """
-  axis_maximums = tf.reduce_max(data, dim, keep_dims=True)
-  masked_minimums = tf.reduce_min(
-      tf.multiply(
-          data - axis_maximums, mask), dim, keep_dims=True) + axis_maximums
-  return masked_minimums
-
-
-def singleview_tcn_loss(
-    embeddings, timesteps, pos_radius, neg_radius, margin=1.0,
-    sequence_ids=None, multiseq=False):
-  """Computes the single view triplet loss with semi-hard negative mining.
-
-  The loss encourages the positive distances (between a pair of embeddings with
-  the same labels) to be smaller than the minimum negative distance among
-  which are at least greater than the positive distance plus the margin constant
-  (called semi-hard negative) in the mini-batch. If no such negative exists,
-  uses the largest negative distance instead.
-
-  Anchor, positive, negative selection is as follow:
-  Anchors: We consider every embedding timestep as an anchor.
-  Positives: pos_radius defines a radius (in timesteps) around each anchor from
-    which positives can be drawn. E.g. An anchor with t=10 and a pos_radius of
-    2 produces a set of 4 (anchor,pos) pairs [(a=10, p=8), ... (a=10, p=12)].
-  Negatives: neg_radius defines a boundary (in timesteps) around each anchor,
-    outside of which negatives can be drawn. E.g. An anchor with t=10 and a
-    neg_radius of 4 means negatives can be any t_neg where t_neg < 6 and
-    t_neg > 14.
-
-  Args:
-    embeddings: 2-D Tensor of embedding vectors.
-    timesteps: 1-D Tensor with shape [batch_size, 1] of sequence timesteps.
-    pos_radius: int32; the size of the window (in timesteps) around each anchor
-      timestep that a positive can be drawn from.
-    neg_radius: int32; the size of the window (in timesteps) around each anchor
-      timestep that defines a negative boundary. Negatives can only be chosen
-      where negative timestep t is < negative boundary min or > negative
-      boundary max.
-    margin: Float; the triplet loss margin hyperparameter.
-    sequence_ids: (Optional) 1-D Tensor with shape [batch_size, 1] of sequence
-      ids. Together (sequence_id, sequence_timestep) give us a unique index for
-      each image if we have multiple sequences in a batch.
-    multiseq: Boolean, whether or not the batch is composed of multiple
-      sequences (with possibly colliding timesteps).
-
-  Returns:
-    triplet_loss: tf.float32 scalar.
-  """
-  assert neg_radius > pos_radius
-
-  # If timesteps shape isn't [batchsize, 1], reshape to [batch_size, 1].
-  tshape = tf.shape(timesteps)
-  assert tshape.shape == 2 or tshape.shape == 1
-  if tshape.shape == 1:
-    timesteps = tf.reshape(timesteps, [tshape[0], 1])
-
-  # Build pairwise squared distance matrix.
-  pdist_matrix = pairwise_squared_distance(embeddings)
-
-  # Build pairwise binary adjacency matrix, where adjacency[i,j] is True
-  # if timestep j is inside the positive range for timestep i and both
-  # timesteps come from the same sequence.
-  pos_radius = tf.cast(pos_radius, tf.int32)
-
-  if multiseq:
-    # If sequence_ids shape isn't [batchsize, 1], reshape to [batch_size, 1].
-    tshape = tf.shape(sequence_ids)
-    assert tshape.shape == 2 or tshape.shape == 1
-    if tshape.shape == 1:
-      sequence_ids = tf.reshape(sequence_ids, [tshape[0], 1])
-
-    # Build pairwise binary adjacency matrix based on sequence_ids
-    sequence_adjacency = tf.equal(sequence_ids, tf.transpose(sequence_ids))
-
-    # Invert so we can select negatives only.
-    sequence_adjacency_not = tf.logical_not(sequence_adjacency)
-
-    in_pos_range = tf.logical_and(
-        tf.less_equal(
-            tf.abs(timesteps - tf.transpose(timesteps)), pos_radius),
-        sequence_adjacency)
-    # Build pairwise binary discordance matrix, where discordance[i,j] is True
-    # if timestep j is inside the negative range for timestep i or if the
-    # timesteps come from different sequences.
-    in_neg_range = tf.logical_or(
-        tf.greater(tf.abs(timesteps - tf.transpose(timesteps)), neg_radius),
-        sequence_adjacency_not
-    )
-  else:
-    in_pos_range = tf.less_equal(
-        tf.abs(timesteps - tf.transpose(timesteps)), pos_radius)
-    in_neg_range = tf.greater(tf.abs(timesteps - tf.transpose(timesteps)),
-                              neg_radius)
-
-  batch_size = tf.size(timesteps)
-
-  # compute the mask
-  pdist_matrix_tile = tf.tile(pdist_matrix, [batch_size, 1])
-  mask = tf.logical_and(
-      tf.tile(in_neg_range, [batch_size, 1]),
-      tf.greater(pdist_matrix_tile,
-                 tf.reshape(tf.transpose(pdist_matrix), [-1, 1])))
-  mask_final = tf.reshape(
-      tf.greater(
-          tf.reduce_sum(
-              tf.cast(
-                  mask, dtype=tf.float32), 1, keep_dims=True),
-          0.0), [batch_size, batch_size])
-  mask_final = tf.transpose(mask_final)
-
-  in_neg_range = tf.cast(in_neg_range, dtype=tf.float32)
-  mask = tf.cast(mask, dtype=tf.float32)
-
-  # negatives_outside: smallest D_an where D_an > D_ap
-  negatives_outside = tf.reshape(
-      masked_minimum(pdist_matrix_tile, mask), [batch_size, batch_size])
-  negatives_outside = tf.transpose(negatives_outside)
-
-  # negatives_inside: largest D_an
-  negatives_inside = tf.tile(
-      masked_maximum(pdist_matrix, in_neg_range), [1, batch_size])
-  semi_hard_negatives = tf.where(
-      mask_final, negatives_outside, negatives_inside)
-
-  loss_mat = tf.add(margin, pdist_matrix - semi_hard_negatives)
-
-  mask_positives = tf.cast(
-      in_pos_range, dtype=tf.float32) - tf.diag(tf.ones([batch_size]))
-
-  # In lifted-struct, the authors multiply 0.5 for upper triangular
-  #   in semihard, they take all positive pairs except the diagonal.
-  num_positives = tf.reduce_sum(mask_positives)
-
-  triplet_loss = tf.truediv(
-      tf.reduce_sum(tf.maximum(tf.multiply(loss_mat, mask_positives), 0.0)),
-      num_positives,
-      name='triplet_svtcn_loss')
-
-  return triplet_loss
diff --git a/research/tcn/estimators/svtcn_loss_test.py b/research/tcn/estimators/svtcn_loss_test.py
deleted file mode 100644
index f5bdfd980e5bdc596bf2e2e1102ccc256bbe2585..0000000000000000000000000000000000000000
--- a/research/tcn/estimators/svtcn_loss_test.py
+++ /dev/null
@@ -1,106 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for svtcn_loss.py."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-from sklearn.metrics.pairwise import euclidean_distances
-from estimators import svtcn_loss
-import tensorflow as tf
-
-
-class SVTCNLoss(tf.test.TestCase):
-
-  def testSVTCNLoss(self):
-    with self.test_session():
-      num_data = 64
-      num_sequences = 2
-      num_data_per_seq = num_data // num_sequences
-      feat_dim = 6
-      margin = 1.0
-      times = np.tile(np.arange(num_data_per_seq, dtype=np.int32),
-                      num_sequences)
-      times = np.reshape(times, [times.shape[0], 1])
-      sequence_ids = np.concatenate(
-          [np.ones(num_data_per_seq)*i for i in range(num_sequences)])
-      sequence_ids = np.reshape(sequence_ids, [sequence_ids.shape[0], 1])
-
-      pos_radius = 6
-      neg_radius = 12
-
-      embedding = np.random.rand(num_data, feat_dim).astype(np.float32)
-
-      # Compute the loss in NP
-
-      # Get a positive mask, i.e. indices for each time index
-      # that are inside the positive range.
-      in_pos_range = np.less_equal(
-          np.abs(times - times.transpose()), pos_radius)
-
-      # Get a negative mask, i.e. indices for each time index
-      # that are inside the negative range (> t + (neg_mult * pos_radius)
-      # and < t - (neg_mult * pos_radius).
-      in_neg_range = np.greater(np.abs(times - times.transpose()), neg_radius)
-
-      sequence_adjacency = sequence_ids == sequence_ids.T
-      sequence_adjacency_not = np.logical_not(sequence_adjacency)
-
-      pdist_matrix = euclidean_distances(embedding, squared=True)
-      loss_np = 0.0
-      num_positives = 0.0
-      for i in range(num_data):
-        for j in range(num_data):
-          if in_pos_range[i, j] and i != j and sequence_adjacency[i, j]:
-            num_positives += 1.0
-
-            pos_distance = pdist_matrix[i][j]
-            neg_distances = []
-
-            for k in range(num_data):
-              if in_neg_range[i, k] or sequence_adjacency_not[i, k]:
-                neg_distances.append(pdist_matrix[i][k])
-
-            neg_distances.sort()  # sort by distance
-            chosen_neg_distance = neg_distances[0]
-
-            for l in range(len(neg_distances)):
-              chosen_neg_distance = neg_distances[l]
-              if chosen_neg_distance > pos_distance:
-                break
-
-            loss_np += np.maximum(
-                0.0, margin - chosen_neg_distance + pos_distance)
-
-      loss_np /= num_positives
-
-      # Compute the loss in TF
-      loss_tf = svtcn_loss.singleview_tcn_loss(
-          embeddings=tf.convert_to_tensor(embedding),
-          timesteps=tf.convert_to_tensor(times),
-          pos_radius=pos_radius,
-          neg_radius=neg_radius,
-          margin=margin,
-          sequence_ids=tf.convert_to_tensor(sequence_ids),
-          multiseq=True
-      )
-      loss_tf = loss_tf.eval()
-      self.assertAllClose(loss_np, loss_tf)
-
-
-if __name__ == '__main__':
-  tf.test.main()
diff --git a/research/tcn/eval.py b/research/tcn/eval.py
deleted file mode 100644
index de24e93e1985dee80c8b9865c68095062d78928f..0000000000000000000000000000000000000000
--- a/research/tcn/eval.py
+++ /dev/null
@@ -1,63 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Calculates running validation of TCN models (and baseline comparisons)."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import time
-from estimators.get_estimator import get_estimator
-from utils import util
-import tensorflow as tf
-tf.logging.set_verbosity(tf.logging.INFO)
-
-tf.flags.DEFINE_string(
-    'config_paths', '',
-    """
-    Path to a YAML configuration files defining FLAG values. Multiple files
-    can be separated by the `#` symbol. Files are merged recursively. Setting
-    a key in these files is equivalent to setting the FLAG value with
-    the same name.
-    """)
-tf.flags.DEFINE_string(
-    'model_params', '{}', 'YAML configuration string for the model parameters.')
-tf.app.flags.DEFINE_string('master', 'local',
-                           'BNS name of the TensorFlow master to use')
-tf.app.flags.DEFINE_string(
-    'logdir', '/tmp/tcn', 'Directory where to write event logs.')
-FLAGS = tf.app.flags.FLAGS
-
-
-def main(_):
-  """Runs main eval loop."""
-  # Parse config dict from yaml config files / command line flags.
-  logdir = FLAGS.logdir
-  config = util.ParseConfigsToLuaTable(FLAGS.config_paths, FLAGS.model_params)
-
-  # Choose an estimator based on training strategy.
-  estimator = get_estimator(config, logdir)
-
-  # Wait for the first checkpoint file to be written.
-  while not tf.train.latest_checkpoint(logdir):
-    tf.logging.info('Waiting for a checkpoint file...')
-    time.sleep(10)
-
-  # Run validation.
-  while True:
-    estimator.evaluate()
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/tcn/g3doc/alignment.png b/research/tcn/g3doc/alignment.png
deleted file mode 100644
index 7cfdfece274ec65fb9afbd1bf56b3a9c30597cef..0000000000000000000000000000000000000000
Binary files a/research/tcn/g3doc/alignment.png and /dev/null differ
diff --git a/research/tcn/g3doc/all_error.png b/research/tcn/g3doc/all_error.png
deleted file mode 100644
index c7b2d5b4fe8a89e9c492c6c943a20c84915afec5..0000000000000000000000000000000000000000
Binary files a/research/tcn/g3doc/all_error.png and /dev/null differ
diff --git a/research/tcn/g3doc/avg_error.png b/research/tcn/g3doc/avg_error.png
deleted file mode 100644
index 0b421824df123276e9de098e958a5e81860a1d20..0000000000000000000000000000000000000000
Binary files a/research/tcn/g3doc/avg_error.png and /dev/null differ
diff --git a/research/tcn/g3doc/loss.png b/research/tcn/g3doc/loss.png
deleted file mode 100644
index 44eaa6d6fd835ced99fd347517d5fbd54c4dc336..0000000000000000000000000000000000000000
Binary files a/research/tcn/g3doc/loss.png and /dev/null differ
diff --git a/research/tcn/g3doc/pca.png b/research/tcn/g3doc/pca.png
deleted file mode 100644
index 2a9ce8f335d50c59138ddcfc493afb6302505bf9..0000000000000000000000000000000000000000
Binary files a/research/tcn/g3doc/pca.png and /dev/null differ
diff --git a/research/tcn/g3doc/val_loss.png b/research/tcn/g3doc/val_loss.png
deleted file mode 100644
index 73ad725c36c86306ff58eb9db1267e6fed6c0c13..0000000000000000000000000000000000000000
Binary files a/research/tcn/g3doc/val_loss.png and /dev/null differ
diff --git a/research/tcn/generate_videos.py b/research/tcn/generate_videos.py
deleted file mode 100644
index 2b2ecba84038a01b88e3468ebfb83b7e13df4368..0000000000000000000000000000000000000000
--- a/research/tcn/generate_videos.py
+++ /dev/null
@@ -1,426 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Generates imitation videos.
-
-Generate single pairwise imitation videos:
-blaze build -c opt --config=cuda --copt=-mavx \
-learning/brain/research/tcn/generate_videos && \
-blaze-bin/learning/brain/research/tcn/generate_videos \
---logtostderr \
---config_paths $config_paths \
---checkpointdir $checkpointdir \
---checkpoint_iter $checkpoint_iter \
---query_records_dir $query_records_dir \
---target_records_dir $target_records_dir \
---outdir $outdir \
---mode single \
---num_query_sequences 1 \
---num_target_sequences -1
-
-# Generate imitation videos with multiple sequences in the target set:
-query_records_path
-blaze build -c opt --config=cuda --copt=-mavx \
-learning/brain/research/tcn/generate_videos && \
-blaze-bin/learning/brain/research/tcn/generate_videos \
---logtostderr \
---config_paths $config_paths \
---checkpointdir $checkpointdir \
---checkpoint_iter $checkpoint_iter \
---query_records_dir $query_records_dir \
---target_records_dir $target_records_dir \
---outdir $outdir \
---num_multi_targets 1 \
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import cv2
-import tensorflow as tf
-import os
-import matplotlib
-matplotlib.use("pdf")
-import matplotlib.animation as animation
-import matplotlib.pyplot as plt
-import numpy as np
-from estimators.get_estimator import get_estimator
-from utils import util
-tf.logging.set_verbosity(tf.logging.INFO)
-
-tf.flags.DEFINE_string(
-    'config_paths', '',
-    """
-    Path to a YAML configuration files defining FLAG values. Multiple files
-    can be separated by the `#` symbol. Files are merged recursively. Setting
-    a key in these files is equivalent to setting the FLAG value with
-    the same name.
-    """)
-tf.flags.DEFINE_string(
-    'model_params', '{}', 'YAML configuration string for the model parameters.')
-tf.app.flags.DEFINE_string(
-    'checkpointdir', '/tmp/tcn', 'Path to model checkpoints.')
-tf.app.flags.DEFINE_string(
-    'checkpoint_iter', '', 'Checkpoint iter to use.')
-tf.app.flags.DEFINE_integer(
-    'num_multi_targets', -1,
-    'Number of imitation vids in the target set per imitation video.')
-tf.app.flags.DEFINE_string(
-    'outdir', '/tmp/tcn', 'Path to write embeddings to.')
-tf.app.flags.DEFINE_string(
-    'mode', 'single', 'single | multi. Single means generate imitation vids'
-                      'where query is being imitated by single sequence. Multi'
-                      'means generate imitation vids where query is being'
-                      'imitated by multiple.')
-tf.app.flags.DEFINE_string('query_records_dir', '',
-                           'Directory of image tfrecords.')
-tf.app.flags.DEFINE_string('target_records_dir', '',
-                           'Directory of image tfrecords.')
-tf.app.flags.DEFINE_integer('query_view', 1,
-                            'Viewpoint of the query video.')
-tf.app.flags.DEFINE_integer('target_view', 0,
-                            'Viewpoint of the imitation video.')
-tf.app.flags.DEFINE_integer('smoothing_window', 5,
-                            'Number of frames to smooth over.')
-tf.app.flags.DEFINE_integer('num_query_sequences', -1,
-                            'Number of query sequences to embed.')
-tf.app.flags.DEFINE_integer('num_target_sequences', -1,
-                            'Number of target sequences to embed.')
-FLAGS = tf.app.flags.FLAGS
-
-
-def SmoothEmbeddings(embs):
-  """Temporally smoothes a sequence of embeddings."""
-  new_embs = []
-  window = int(FLAGS.smoothing_window)
-  for i in range(len(embs)):
-    min_i = max(i-window, 0)
-    max_i = min(i+window, len(embs))
-    new_embs.append(np.mean(embs[min_i:max_i, :], axis=0))
-  return np.array(new_embs)
-
-
-def MakeImitationVideo(
-    outdir, vidname, query_im_strs, knn_im_strs, height=640, width=360):
-  """Creates a KNN imitation video.
-
-  For each frame in vid0, pair with the frame at index in knn_indices in
-  vids1. Write video to disk.
-
-  Args:
-    outdir: String, directory to write videos.
-    vidname: String, name of video.
-    query_im_strs: Numpy array holding query image strings.
-    knn_im_strs: Numpy array holding knn image strings.
-    height: Int, height of raw images.
-    width: Int, width of raw images.
-  """
-  if not tf.gfile.Exists(outdir):
-    tf.gfile.MakeDirs(outdir)
-  vid_path = os.path.join(outdir, vidname)
-  combined = zip(query_im_strs, knn_im_strs)
-
-  # Create and write the video.
-  fig = plt.figure()
-  ax = fig.add_subplot(111)
-  ax.set_aspect('equal')
-  ax.get_xaxis().set_visible(False)
-  ax.get_yaxis().set_visible(False)
-  im = ax.imshow(
-      np.zeros((height, width*2, 3)), cmap='gray', interpolation='nearest')
-  im.set_clim([0, 1])
-  plt.tight_layout(pad=0, w_pad=0, h_pad=0)
-  # pylint: disable=invalid-name
-  def update_img(pair):
-    """Decode pairs of image strings, update a video."""
-    im_i, im_j = pair
-    nparr_i = np.fromstring(str(im_i), np.uint8)
-    img_np_i = cv2.imdecode(nparr_i, 1)
-    img_np_i = img_np_i[..., [2, 1, 0]]
-    nparr_j = np.fromstring(str(im_j), np.uint8)
-    img_np_j = cv2.imdecode(nparr_j, 1)
-    img_np_j = img_np_j[..., [2, 1, 0]]
-
-    # Optionally reshape the images to be same size.
-    frame = np.concatenate([img_np_i, img_np_j], axis=1)
-    im.set_data(frame)
-    return im
-  ani = animation.FuncAnimation(fig, update_img, combined, interval=15)
-  writer = animation.writers['ffmpeg'](fps=15)
-  dpi = 100
-  tf.logging.info('Writing video to:\n %s \n' % vid_path)
-  ani.save('%s.mp4' % vid_path, writer=writer, dpi=dpi)
-
-
-def GenerateImitationVideo(
-    vid_name, query_ims, query_embs, target_ims, target_embs, height, width):
-  """Generates a single cross-sequence imitation video.
-
-  For each frame in some query sequence, find the nearest neighbor from
-  some target sequence in embedding space.
-
-  Args:
-    vid_name: String, the name of the video.
-    query_ims: Numpy array of shape [query sequence length, height, width, 3].
-    query_embs: Numpy array of shape [query sequence length, embedding size].
-    target_ims: Numpy array of shape [target sequence length, height, width,
-      3].
-    target_embs: Numpy array of shape [target sequence length, embedding
-      size].
-    height: Int, height of the raw image.
-    width: Int, width of the raw image.
-  """
-  # For each query frame, find the index of the nearest neighbor in the
-  # target video.
-  knn_indices = [util.KNNIds(q, target_embs, k=1)[0] for q in query_embs]
-
-  # Create and write out the video.
-  assert knn_indices
-  knn_ims = np.array([target_ims[k] for k in knn_indices])
-  MakeImitationVideo(FLAGS.outdir, vid_name, query_ims, knn_ims, height, width)
-
-
-def SingleImitationVideos(
-    query_records, target_records, config, height, width):
-  """Generates pairwise imitation videos.
-
-  This creates all pairs of target imitating query videos, where each frame
-  on the left is matched to a nearest neighbor coming a single
-  embedded target video.
-
-  Args:
-    query_records: List of Strings, paths to tfrecord datasets to use as
-      queries.
-    target_records: List of Strings, paths to tfrecord datasets to use as
-      targets.
-    config: A T object describing training config.
-    height: Int, height of the raw image.
-    width: Int, width of the raw image.
-  """
-  # Embed query and target data.
-  (query_sequences_to_data,
-   target_sequences_to_data) = EmbedQueryTargetData(
-       query_records, target_records, config)
-
-  qview = FLAGS.query_view
-  tview = FLAGS.target_view
-
-  # Loop over query videos.
-  for task_i, data_i in query_sequences_to_data.iteritems():
-    for task_j, data_j in target_sequences_to_data.iteritems():
-      i_ims = data_i['images']
-      i_embs = data_i['embeddings']
-      query_embs = SmoothEmbeddings(i_embs[qview])
-      query_ims = i_ims[qview]
-
-      j_ims = data_j['images']
-      j_embs = data_j['embeddings']
-      target_embs = SmoothEmbeddings(j_embs[tview])
-      target_ims = j_ims[tview]
-
-      tf.logging.info('Generating %s imitating %s video.' % (task_j, task_i))
-      vid_name = 'q%sv%s_im%sv%s' % (task_i, qview, task_j, tview)
-      vid_name = vid_name.replace('/', '_')
-      GenerateImitationVideo(vid_name, query_ims, query_embs,
-                             target_ims, target_embs, height, width)
-
-
-def MultiImitationVideos(
-    query_records, target_records, config, height, width):
-  """Creates multi-imitation videos.
-
-  This creates videos where every frame on the left is matched to a nearest
-  neighbor coming from a set of multiple embedded target videos.
-
-  Args:
-    query_records: List of Strings, paths to tfrecord datasets to use as
-      queries.
-    target_records: List of Strings, paths to tfrecord datasets to use as
-      targets.
-    config: A T object describing training config.
-    height: Int, height of the raw image.
-    width: Int, width of the raw image.
-  """
-  # Embed query and target data.
-  (query_sequences_to_data,
-   target_sequences_to_data) = EmbedQueryTargetData(
-       query_records, target_records, config)
-
-  qview = FLAGS.query_view
-  tview = FLAGS.target_view
-
-  # Loop over query videos.
-  for task_i, data_i in query_sequences_to_data.iteritems():
-    i_ims = data_i['images']
-    i_embs = data_i['embeddings']
-    query_embs = SmoothEmbeddings(i_embs[qview])
-    query_ims = i_ims[qview]
-
-    all_target_embs = []
-    all_target_ims = []
-
-    # If num_imitation_vids is -1, add all seq embeddings to the target set.
-    if FLAGS.num_multi_targets == -1:
-      num_multi_targets = len(target_sequences_to_data)
-    else:
-      # Else, add some specified number of seq embeddings to the target set.
-      num_multi_targets = FLAGS.num_multi_targets
-    for j in range(num_multi_targets):
-      task_j = target_sequences_to_data.keys()[j]
-      data_j = target_sequences_to_data[task_j]
-      print('Adding %s to target set' % task_j)
-      j_ims = data_j['images']
-      j_embs = data_j['embeddings']
-
-      target_embs = SmoothEmbeddings(j_embs[tview])
-      target_ims = j_ims[tview]
-      all_target_embs.extend(target_embs)
-      all_target_ims.extend(target_ims)
-
-    # Generate a "j imitating i" video.
-    tf.logging.info('Generating all imitating %s video.' % task_i)
-    vid_name = 'q%sv%s_multiv%s' % (task_i, qview, tview)
-    vid_name = vid_name.replace('/', '_')
-    GenerateImitationVideo(vid_name, query_ims, query_embs,
-                           all_target_ims, all_target_embs, height, width)
-
-
-def SameSequenceVideos(query_records, config, height, width):
-  """Generate same sequence, cross-view imitation videos."""
-  batch_size = config.data.embed_batch_size
-
-  # Choose an estimator based on training strategy.
-  estimator = get_estimator(config, FLAGS.checkpointdir)
-
-  # Choose a checkpoint path to restore.
-  checkpointdir = FLAGS.checkpointdir
-  checkpoint_path = os.path.join(checkpointdir,
-                                 'model.ckpt-%s' % FLAGS.checkpoint_iter)
-
-  # Embed num_sequences query sequences, store embeddings and image strings in
-  # query_sequences_to_data.
-  sequences_to_data = {}
-  for (view_embeddings, view_raw_image_strings, seqname) in estimator.inference(
-      query_records, checkpoint_path, batch_size,
-      num_sequences=FLAGS.num_query_sequences):
-    sequences_to_data[seqname] = {
-        'embeddings': view_embeddings,
-        'images': view_raw_image_strings,
-    }
-
-  # Loop over query videos.
-  qview = FLAGS.query_view
-  tview = FLAGS.target_view
-  for task_i, data_i in sequences_to_data.iteritems():
-    ims = data_i['images']
-    embs = data_i['embeddings']
-    query_embs = SmoothEmbeddings(embs[qview])
-    query_ims = ims[qview]
-
-    target_embs = SmoothEmbeddings(embs[tview])
-    target_ims = ims[tview]
-
-    tf.logging.info('Generating %s imitating %s video.' % (task_i, task_i))
-    vid_name = 'q%sv%s_im%sv%s' % (task_i, qview, task_i, tview)
-    vid_name = vid_name.replace('/', '_')
-    GenerateImitationVideo(vid_name, query_ims, query_embs,
-                           target_ims, target_embs, height, width)
-
-
-def EmbedQueryTargetData(query_records, target_records, config):
-  """Embeds the full set of query_records and target_records.
-
-  Args:
-    query_records: List of Strings, paths to tfrecord datasets to use as
-      queries.
-    target_records: List of Strings, paths to tfrecord datasets to use as
-      targets.
-    config: A T object describing training config.
-
-  Returns:
-    query_sequences_to_data: A dict holding 'embeddings' and 'images'
-    target_sequences_to_data: A dict holding 'embeddings' and 'images'
-  """
-  batch_size = config.data.embed_batch_size
-
-  # Choose an estimator based on training strategy.
-  estimator = get_estimator(config, FLAGS.checkpointdir)
-
-  # Choose a checkpoint path to restore.
-  checkpointdir = FLAGS.checkpointdir
-  checkpoint_path = os.path.join(checkpointdir,
-                                 'model.ckpt-%s' % FLAGS.checkpoint_iter)
-
-  # Embed num_sequences query sequences, store embeddings and image strings in
-  # query_sequences_to_data.
-  num_query_sequences = FLAGS.num_query_sequences
-  num_target_sequences = FLAGS.num_target_sequences
-  query_sequences_to_data = {}
-  for (view_embeddings, view_raw_image_strings, seqname) in estimator.inference(
-      query_records, checkpoint_path, batch_size,
-      num_sequences=num_query_sequences):
-    query_sequences_to_data[seqname] = {
-        'embeddings': view_embeddings,
-        'images': view_raw_image_strings,
-    }
-
-  if (query_records == target_records) and (
-      num_query_sequences == num_target_sequences):
-    target_sequences_to_data = query_sequences_to_data
-  else:
-    # Embed num_sequences target sequences, store embeddings and image strings
-    # in sequences_to_data.
-    target_sequences_to_data = {}
-    for (view_embeddings, view_raw_image_strings,
-         seqname) in estimator.inference(
-             target_records, checkpoint_path, batch_size,
-             num_sequences=num_target_sequences):
-      target_sequences_to_data[seqname] = {
-          'embeddings': view_embeddings,
-          'images': view_raw_image_strings,
-      }
-  return query_sequences_to_data, target_sequences_to_data
-
-
-def main(_):
-  # Parse config dict from yaml config files / command line flags.
-  config = util.ParseConfigsToLuaTable(FLAGS.config_paths, FLAGS.model_params)
-
-  # Get tables to embed.
-  query_records_dir = FLAGS.query_records_dir
-  query_records = util.GetFilesRecursively(query_records_dir)
-
-  target_records_dir = FLAGS.target_records_dir
-  target_records = util.GetFilesRecursively(target_records_dir)
-
-  height = config.data.raw_height
-  width = config.data.raw_width
-  mode = FLAGS.mode
-  if mode == 'multi':
-    # Generate videos where target set is composed of multiple videos.
-    MultiImitationVideos(query_records, target_records, config,
-                         height, width)
-  elif mode == 'single':
-    # Generate videos where target set is a single video.
-    SingleImitationVideos(query_records, target_records, config,
-                          height, width)
-  elif mode == 'same':
-    # Generate videos where target set is the same as query, but diff view.
-    SameSequenceVideos(query_records, config, height, width)
-  else:
-    raise ValueError('Unknown mode %s' % mode)
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/tcn/labeled_eval.py b/research/tcn/labeled_eval.py
deleted file mode 100644
index a28856a13cc8db03394a615a46cff90f4aff8c75..0000000000000000000000000000000000000000
--- a/research/tcn/labeled_eval.py
+++ /dev/null
@@ -1,309 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Generates test Recall@K statistics on labeled classification problems."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from collections import defaultdict
-import os
-import numpy as np
-from sklearn.metrics.pairwise import pairwise_distances
-from six.moves import xrange
-import data_providers
-from estimators.get_estimator import get_estimator
-from utils import util
-import tensorflow as tf
-tf.logging.set_verbosity(tf.logging.INFO)
-
-
-tf.flags.DEFINE_string(
-    'config_paths', '',
-    """
-    Path to a YAML configuration files defining FLAG values. Multiple files
-    can be separated by the `#` symbol. Files are merged recursively. Setting
-    a key in these files is equivalent to setting the FLAG value with
-    the same name.
-    """)
-tf.flags.DEFINE_string(
-    'model_params', '{}', 'YAML configuration string for the model parameters.')
-tf.app.flags.DEFINE_string(
-    'mode', 'validation',
-    'Which dataset to evaluate: `validation` | `test`.')
-tf.app.flags.DEFINE_string('master', 'local',
-                           'BNS name of the TensorFlow master to use')
-tf.app.flags.DEFINE_string(
-    'checkpoint_iter', '', 'Evaluate this specific checkpoint.')
-tf.app.flags.DEFINE_string(
-    'checkpointdir', '/tmp/tcn', 'Path to model checkpoints.')
-tf.app.flags.DEFINE_string('outdir', '/tmp/tcn', 'Path to write summaries to.')
-FLAGS = tf.app.flags.FLAGS
-
-
-def nearest_cross_sequence_neighbors(data, tasks, n_neighbors=1):
-  """Computes the n_neighbors nearest neighbors for every row in data.
-
-  Args:
-    data: A np.float32 array of shape [num_data, embedding size] holding
-      an embedded validation / test dataset.
-    tasks: A list of strings of size [num_data] holding the task or sequence
-      name that each row belongs to.
-    n_neighbors: The number of knn indices to return for each row.
-  Returns:
-    indices: an np.int32 array of size [num_data, n_neighbors] holding the
-      n_neighbors nearest indices for every row in data. These are
-      restricted to be from different named sequences (as defined in `tasks`).
-  """
-
-  # Compute the pairwise sequence adjacency matrix from `tasks`.
-  num_data = data.shape[0]
-  tasks = np.array(tasks)
-  tasks = np.reshape(tasks, (num_data, 1))
-  assert len(tasks.shape) == 2
-  not_adjacent = (tasks != tasks.T)
-
-  # Compute the symmetric pairwise distance matrix.
-  pdist = pairwise_distances(data, metric='sqeuclidean')
-
-  # For every row in the pairwise distance matrix, only consider
-  # cross-sequence columns.
-  indices = np.zeros((num_data, n_neighbors), dtype=np.int32)
-  for idx in range(num_data):
-    # Restrict to cross_sequence neighbors.
-    distances = [(
-        pdist[idx][i], i) for i in xrange(num_data) if not_adjacent[idx][i]]
-    _, nearest_indices = zip(*sorted(
-        distances, key=lambda x: x[0])[:n_neighbors])
-    indices[idx] = nearest_indices
-  return indices
-
-
-def compute_cross_sequence_recall_at_k(retrieved_labels, labels, k_list):
-  """Compute recall@k for a given list of k values.
-
-  Recall is one if an example of the same class is retrieved among the
-    top k nearest neighbors given a query example and zero otherwise.
-    Counting the recall for all examples and averaging the counts returns
-    recall@k score.
-
-  Args:
-    retrieved_labels: 2-D Numpy array of KNN labels for every embedding.
-    labels: 1-D Numpy array of shape [number of data].
-    k_list: List of k values to evaluate recall@k.
-
-  Returns:
-    recall_list: List of recall@k values.
-  """
-  kvalue_to_recall = dict(zip(k_list, np.zeros(len(k_list))))
-
-  # For each value of K.
-  for k in k_list:
-    matches = defaultdict(float)
-    counts = defaultdict(float)
-    # For each (row index, label value) in the query labels.
-    for i, label_value in enumerate(labels):
-      # Loop over the K nearest retrieved labels.
-      if label_value in retrieved_labels[i][:k]:
-        matches[label_value] += 1.
-      # Increment the denominator.
-      counts[label_value] += 1.
-    kvalue_to_recall[k] = np.mean(
-        [matches[l]/counts[l] for l in matches])
-  return [kvalue_to_recall[i] for i in k_list]
-
-
-def compute_cross_sequence_recalls_at_k(
-    embeddings, labels, label_attr_keys, tasks, k_list, summary_writer,
-    training_step):
-  """Computes and reports the recall@k for each classification problem.
-
-  This takes an embedding matrix and an array of multiclass labels
-  with size [num_data, number of classification problems], then
-  computes the average recall@k for each classification problem
-  as well as the average across problems.
-
-  Args:
-    embeddings: A np.float32 array of size [num_data, embedding_size]
-      representing the embedded validation or test dataset.
-    labels: A np.int32 array of size [num_data, num_classification_problems]
-      holding multiclass labels for each embedding for each problem.
-    label_attr_keys: List of strings, holds the names of the classification
-      problems.
-    tasks: A list of strings describing the video sequence each row
-      belongs to. This is used to restrict the recall@k computation
-      to cross-sequence examples.
-    k_list: A list of ints, the k values to evaluate recall@k.
-    summary_writer: A tf.summary.FileWriter.
-    training_step: Int, the current training step we're evaluating.
-  """
-  num_data = float(embeddings.shape[0])
-  assert labels.shape[0] == num_data
-
-  # Compute knn indices.
-  indices = nearest_cross_sequence_neighbors(
-      embeddings, tasks, n_neighbors=max(k_list))
-  retrieved_labels = labels[indices]
-
-  # Compute the recall@k for each classification problem.
-  recall_lists = []
-  for idx, label_attr in enumerate(label_attr_keys):
-    problem_labels = labels[:, idx]
-    # Take all indices, all k labels for the problem indexed by idx.
-    problem_retrieved = retrieved_labels[:, :, idx]
-    recall_list = compute_cross_sequence_recall_at_k(
-        retrieved_labels=problem_retrieved,
-        labels=problem_labels,
-        k_list=k_list)
-    recall_lists.append(recall_list)
-    for (k, recall) in zip(k_list, recall_list):
-      recall_error = 1-recall
-      summ = tf.Summary(value=[tf.Summary.Value(
-          tag='validation/classification/%s error@top%d' % (
-              label_attr, k),
-          simple_value=recall_error)])
-      print('%s recall@K=%d' % (label_attr, k), recall_error)
-      summary_writer.add_summary(summ, int(training_step))
-
-  # Report an average recall@k across problems.
-  recall_lists = np.array(recall_lists)
-  for i in range(recall_lists.shape[1]):
-    average_recall = np.mean(recall_lists[:, i])
-    recall_error = 1 - average_recall
-    summ = tf.Summary(value=[tf.Summary.Value(
-        tag='validation/classification/average error@top%d' % k_list[i],
-        simple_value=recall_error)])
-    print('Average recall@K=%d' % k_list[i], recall_error)
-    summary_writer.add_summary(summ, int(training_step))
-
-
-def evaluate_once(
-    estimator, input_fn_by_view, batch_size, checkpoint_path,
-    label_attr_keys, embedding_size, num_views, k_list):
-  """Compute the recall@k for a given checkpoint path.
-
-  Args:
-    estimator: an `Estimator` object to evaluate.
-    input_fn_by_view: An input_fn to an `Estimator's` predict method. Takes
-      a view index and returns a dict holding ops for getting raw images for
-      the view.
-    batch_size: Int, size of the labeled eval batch.
-    checkpoint_path: String, path to the specific checkpoint being evaluated.
-    label_attr_keys: A list of Strings, holding each attribute name.
-    embedding_size: Int, the size of the embedding.
-    num_views: Int, number of views in the dataset.
-    k_list: List of ints, list of K values to compute recall at K for.
-  """
-  feat_matrix = np.zeros((0, embedding_size))
-  label_vect = np.zeros((0, len(label_attr_keys)))
-  tasks = []
-  eval_tensor_keys = ['embeddings', 'tasks', 'classification_labels']
-
-  # Iterate all views in the dataset.
-  for view_index in range(num_views):
-    # Set up a graph for embedding entire dataset.
-    predictions = estimator.inference(
-        input_fn_by_view(view_index), checkpoint_path,
-        batch_size, predict_keys=eval_tensor_keys)
-
-    # Enumerate predictions.
-    for i, p in enumerate(predictions):
-      if i % 100 == 0:
-        tf.logging.info('Embedded %d images for view %d' % (i, view_index))
-
-      label = p['classification_labels']
-      task = p['tasks']
-      embedding = p['embeddings']
-
-      # Collect (embedding, label, task) data.
-      feat_matrix = np.append(feat_matrix, [embedding], axis=0)
-      label_vect = np.append(label_vect, [label], axis=0)
-      tasks.append(task)
-
-  # Compute recall statistics.
-  ckpt_step = int(checkpoint_path.split('-')[-1])
-  summary_dir = os.path.join(FLAGS.outdir, 'labeled_eval_summaries')
-  summary_writer = tf.summary.FileWriter(summary_dir)
-  compute_cross_sequence_recalls_at_k(
-      feat_matrix, label_vect, label_attr_keys, tasks, k_list,
-      summary_writer, ckpt_step)
-
-
-def get_labeled_tables(config):
-  """Gets either labeled test or validation tables, based on flags."""
-  # Get a list of filenames corresponding to labeled data.
-  mode = FLAGS.mode
-  if mode == 'validation':
-    labeled_tables = util.GetFilesRecursively(config.data.labeled.validation)
-  elif mode == 'test':
-    labeled_tables = util.GetFilesRecursively(config.data.labeled.test)
-  else:
-    raise ValueError('Unknown dataset: %s' % mode)
-  return labeled_tables
-
-
-def main(_):
-  """Runs main labeled eval loop."""
-  # Parse config dict from yaml config files / command line flags.
-  config = util.ParseConfigsToLuaTable(FLAGS.config_paths, FLAGS.model_params)
-
-  # Choose an estimator based on training strategy.
-  checkpointdir = FLAGS.checkpointdir
-  estimator = get_estimator(config, checkpointdir)
-
-  # Get data configs.
-  image_attr_keys = config.data.labeled.image_attr_keys
-  label_attr_keys = config.data.labeled.label_attr_keys
-  embedding_size = config.embedding_size
-  num_views = config.data.num_views
-  k_list = config.val.recall_at_k_list
-  batch_size = config.data.batch_size
-
-  # Get either labeled validation or test tables.
-  labeled_tables = get_labeled_tables(config)
-
-  def input_fn_by_view(view_index):
-    """Returns an input_fn for use with a tf.Estimator by view."""
-    def input_fn():
-      # Get raw labeled images.
-      (preprocessed_images, labels,
-       tasks) = data_providers.labeled_data_provider(
-           labeled_tables,
-           estimator.preprocess_data, view_index, image_attr_keys,
-           label_attr_keys, batch_size=batch_size)
-      return {
-          'batch_preprocessed': preprocessed_images,
-          'tasks': tasks,
-          'classification_labels': labels,
-      }, None
-    return input_fn
-
-  # If evaluating a specific checkpoint, do that.
-  if FLAGS.checkpoint_iter:
-    checkpoint_path = os.path.join(
-        '%s/model.ckpt-%s' % (checkpointdir, FLAGS.checkpoint_iter))
-    evaluate_once(
-        estimator, input_fn_by_view, batch_size, checkpoint_path,
-        label_attr_keys, embedding_size, num_views, k_list)
-  else:
-    for checkpoint_path in tf.contrib.training.checkpoints_iterator(
-        checkpointdir):
-      evaluate_once(
-          estimator, input_fn_by_view, batch_size, checkpoint_path,
-          label_attr_keys, embedding_size, num_views, k_list)
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/tcn/labeled_eval_test.py b/research/tcn/labeled_eval_test.py
deleted file mode 100644
index e586e2181b61c2dca0651304a3772a9b22b24232..0000000000000000000000000000000000000000
--- a/research/tcn/labeled_eval_test.py
+++ /dev/null
@@ -1,86 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Tests for tcn.labeled_eval."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import numpy as np
-import labeled_eval
-import tensorflow as tf
-
-
-class LabeledEvalTest(tf.test.TestCase):
-
-  def testNearestCrossSequenceNeighbors(self):
-    # Generate embeddings.
-    num_data = 64
-    embedding_size = 4
-    num_tasks = 8
-    n_neighbors = 2
-    data = np.random.randn(num_data, embedding_size)
-    tasks = np.repeat(range(num_tasks), num_data // num_tasks)
-
-    # Get nearest cross-sequence indices.
-    indices = labeled_eval.nearest_cross_sequence_neighbors(
-        data, tasks, n_neighbors=n_neighbors)
-
-    # Assert that no nearest neighbor indices come from the same task.
-    repeated_tasks = np.tile(np.reshape(tasks, (num_data, 1)), n_neighbors)
-    self.assertTrue(np.all(np.not_equal(repeated_tasks, tasks[indices])))
-
-  def testPerfectCrossSequenceRecall(self):
-    # Make sure cross-sequence recall@k returns 1.0 for near-duplicate features.
-    embeddings = np.random.randn(10, 2)
-    embeddings[5:, :] = 0.00001 + embeddings[:5, :]
-    tasks = np.repeat([0, 1], 5)
-    labels = np.array([0, 1, 2, 3, 4, 0, 1, 2, 3, 4])
-    # find k=1, k=2 nearest neighbors.
-    k_list = [1, 2]
-
-    # Compute knn indices.
-    indices = labeled_eval.nearest_cross_sequence_neighbors(
-        embeddings, tasks, n_neighbors=max(k_list))
-    retrieved_labels = labels[indices]
-    recall_list = labeled_eval.compute_cross_sequence_recall_at_k(
-        retrieved_labels=retrieved_labels,
-        labels=labels,
-        k_list=k_list)
-    self.assertTrue(np.allclose(
-        np.array(recall_list), np.array([1.0, 1.0])))
-
-  def testRelativeRecall(self):
-    # Make sure cross-sequence recall@k is strictly non-decreasing over k.
-    num_data = 100
-    num_tasks = 10
-    embeddings = np.random.randn(100, 5)
-    tasks = np.repeat(range(num_tasks), num_data // num_tasks)
-    labels = np.random.randint(0, 5, 100)
-
-    k_list = [1, 2, 4, 8, 16, 32, 64]
-    indices = labeled_eval.nearest_cross_sequence_neighbors(
-        embeddings, tasks, n_neighbors=max(k_list))
-    retrieved_labels = labels[indices]
-    recall_list = labeled_eval.compute_cross_sequence_recall_at_k(
-        retrieved_labels=retrieved_labels,
-        labels=labels,
-        k_list=k_list)
-    recall_list_sorted = sorted(recall_list)
-    self.assertTrue(np.allclose(
-        np.array(recall_list), np.array(recall_list_sorted)))
-
-if __name__ == "__main__":
-  tf.test.main()
diff --git a/research/tcn/model.py b/research/tcn/model.py
deleted file mode 100644
index 91db1b3e1125ce20d26892ac0f886bd8b4333e5e..0000000000000000000000000000000000000000
--- a/research/tcn/model.py
+++ /dev/null
@@ -1,410 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Model implementations."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from abc import ABCMeta
-from abc import abstractmethod
-import tensorflow as tf
-import tensorflow.contrib.slim as slim
-from tensorflow.contrib.slim.python.slim.nets import inception
-from tensorflow.contrib.slim.python.slim.nets import resnet_v2 as resnet_v2
-from tensorflow.contrib.slim.python.slim.nets import resnet_utils as resnet_utils
-
-
-def get_embedder(
-    embedder_strategy, config, images, is_training, reuse=False,
-    l2_normalize_embedding=True):
-  """Returns an embedder based on config.
-
-  Args:
-    embedder_strategy: String, name of embedder version to return.
-    config: LuaTable object, training config.
-    images: 4-D float `Tensor` containing batch images.
-    is_training: Boolean or placeholder for boolean,
-      indicator for whether or not we're training.
-    reuse: Boolean: Reuse embedder variable scope.
-    l2_normalize_embedding: Boolean, whether or not to l2 normalize the
-      embedding.
-  Returns:
-    embedder: An `Embedder` object.
-  Raises:
-    ValueError: if unknown embedder_strategy specified.
-  """
-  if embedder_strategy == 'inception_baseline':
-    pretrained_ckpt = config.inception_conv_ss_fc.pretrained_checkpoint
-    return InceptionBaselineEmbedder(
-        images,
-        pretrained_ckpt,
-        config.random_projection,
-        config.random_projection_dim)
-
-  strategy_to_embedder = {
-      'inception_conv_ss_fc': InceptionConvSSFCEmbedder,
-      'resnet': ResnetEmbedder,
-  }
-  if embedder_strategy not in strategy_to_embedder:
-    raise ValueError('unknown embedder_strategy', embedder_strategy)
-
-  embedding_size = config.embedding_size
-  l2_reg_weight = config.learning.l2_reg_weight
-  embedder = strategy_to_embedder[embedder_strategy](
-      config[embedder_strategy], images, embedding_size,
-      is_training, embedding_l2=l2_normalize_embedding,
-      l2_reg_weight=l2_reg_weight, reuse=reuse)
-  return embedder
-
-
-def build_inceptionv3_graph(images, endpoint, is_training, checkpoint,
-                            reuse=False):
-  """Builds an InceptionV3 model graph.
-
-  Args:
-    images: A 4-D float32 `Tensor` of batch images.
-    endpoint: String, name of the InceptionV3 endpoint.
-    is_training: Boolean, whether or not to build a training or inference graph.
-    checkpoint: String, path to the pretrained model checkpoint.
-    reuse: Boolean, whether or not we are reusing the embedder.
-  Returns:
-    inception_output: `Tensor` holding the InceptionV3 output.
-    inception_variables: List of inception variables.
-    init_fn: Function to initialize the weights (if not reusing, then None).
-  """
-  with slim.arg_scope(inception.inception_v3_arg_scope()):
-    _, endpoints = inception.inception_v3(
-        images, num_classes=1001, is_training=is_training)
-    inception_output = endpoints[endpoint]
-    inception_variables = slim.get_variables_to_restore()
-    inception_variables = [
-        i for i in inception_variables if 'global_step' not in i.name]
-    if is_training and not reuse:
-      init_saver = tf.train.Saver(inception_variables)
-      def init_fn(scaffold, sess):
-        del scaffold
-        init_saver.restore(sess, checkpoint)
-    else:
-      init_fn = None
-    return inception_output, inception_variables, init_fn
-
-
-class InceptionBaselineEmbedder(object):
-  """Produces pre-trained InceptionV3 embeddings."""
-
-  def __init__(self, images, pretrained_ckpt, reuse=False,
-               random_projection=False, random_projection_dim=32):
-    # Build InceptionV3 graph.
-    (inception_output,
-     self.inception_variables,
-     self.init_fn) = build_inceptionv3_graph(
-         images, 'Mixed_7c', False, pretrained_ckpt, reuse)
-
-    # Pool 8x8x2048 -> 1x1x2048.
-    embedding = slim.avg_pool2d(inception_output, [8, 8], stride=1)
-    embedding = tf.squeeze(embedding, [1, 2])
-
-    if random_projection:
-      embedding = tf.matmul(
-          embedding, tf.random_normal(
-              shape=[2048, random_projection_dim], seed=123))
-    self.embedding = embedding
-
-
-class PretrainedEmbedder(object):
-  """Base class for embedders that take pre-trained networks as input."""
-  __metaclass__ = ABCMeta
-
-  def __init__(self, config, images, embedding_size, is_training,
-               embedding_l2=True, l2_reg_weight=1e-6, reuse=False):
-    """Constructor.
-
-    Args:
-      config: A T object holding training config.
-      images: A 4-D float32 `Tensor` holding images to embed.
-      embedding_size: Int, the size of the embedding.
-      is_training: Boolean, whether or not this is a training or inference-time
-        graph.
-      embedding_l2: Boolean, whether or not to l2 normalize the embedding.
-      l2_reg_weight: Float, weight applied to l2 weight regularization.
-      reuse: Boolean, whether or not we're reusing this graph.
-    """
-    # Pull out all the embedder hyperparameters.
-    self._config = config
-    self._embedding_size = embedding_size
-    self._l2_reg_weight = l2_reg_weight
-    self._embedding_l2 = embedding_l2
-    self._is_training = is_training
-    self._reuse = reuse
-
-    # Pull out pretrained hparams.
-    pretrained_checkpoint = config.pretrained_checkpoint
-    pretrained_layer = config.pretrained_layer
-    pretrained_keep_prob = config.dropout.keep_pretrained
-
-    # Build pretrained graph.
-    (pretrained_output,
-     self._pretrained_variables,
-     self.init_fn) = self.build_pretrained_graph(
-         images, pretrained_layer, pretrained_checkpoint, is_training, reuse)
-
-    # Optionally drop out the activations.
-    pretrained_output = slim.dropout(
-        pretrained_output, keep_prob=pretrained_keep_prob,
-        is_training=is_training)
-    self._pretrained_output = pretrained_output
-
-  @abstractmethod
-  def build_pretrained_graph(self, images, layer, pretrained_checkpoint,
-                             is_training, reuse):
-    """Builds the graph for the pre-trained network.
-
-    Method to be overridden by implementations.
-
-    Args:
-      images: A 4-D tf.float32 `Tensor` holding images to embed.
-      layer: String, defining which pretrained layer to take as input
-        to adaptation layers.
-      pretrained_checkpoint: String, path to a checkpoint used to load
-        pretrained weights.
-      is_training: Boolean, whether or not we're in training mode.
-      reuse: Boolean, whether or not to reuse embedder weights.
-
-    Returns:
-      pretrained_output: A 2 or 3-d tf.float32 `Tensor` holding pretrained
-        activations.
-    """
-    pass
-
-  @abstractmethod
-  def construct_embedding(self):
-    """Builds an embedding function on top of images.
-
-    Method to be overridden by implementations.
-
-    Returns:
-      embeddings: A 2-d float32 `Tensor` of shape [batch_size, embedding_size]
-        holding the embedded images.
-    """
-    pass
-
-  def get_trainable_variables(self):
-    """Gets a list of variables to optimize."""
-    if self._config.finetune:
-      return tf.trainable_variables()
-    else:
-      adaptation_only_vars = tf.get_collection(
-          tf.GraphKeys.TRAINABLE_VARIABLES, scope=self._adaptation_scope)
-      return adaptation_only_vars
-
-
-class ResnetEmbedder(PretrainedEmbedder):
-  """Resnet TCN.
-
-  ResnetV2 -> resnet adaptation layers -> optional l2 normalize -> embedding.
-  """
-
-  def __init__(self, config, images, embedding_size, is_training,
-               embedding_l2=True, l2_reg_weight=1e-6, reuse=False):
-    super(ResnetEmbedder, self).__init__(
-        config, images, embedding_size, is_training, embedding_l2,
-        l2_reg_weight, reuse)
-
-  def build_pretrained_graph(
-      self, images, resnet_layer, checkpoint, is_training, reuse=False):
-    """See baseclass."""
-    with slim.arg_scope(resnet_v2.resnet_arg_scope()):
-      _, endpoints = resnet_v2.resnet_v2_50(
-          images, is_training=is_training, reuse=reuse)
-      resnet_layer = 'resnet_v2_50/block%d' % resnet_layer
-      resnet_output = endpoints[resnet_layer]
-      resnet_variables = slim.get_variables_to_restore()
-      resnet_variables = [
-          i for i in resnet_variables if 'global_step' not in i.name]
-      if is_training and not reuse:
-        init_saver = tf.train.Saver(resnet_variables)
-        def init_fn(scaffold, sess):
-          del scaffold
-          init_saver.restore(sess, checkpoint)
-      else:
-        init_fn = None
-
-      return resnet_output, resnet_variables, init_fn
-
-  def construct_embedding(self):
-    """Builds an embedding function on top of images.
-
-    Method to be overridden by implementations.
-
-    Returns:
-      embeddings: A 2-d float32 `Tensor` of shape [batch_size, embedding_size]
-        holding the embedded images.
-    """
-    with tf.variable_scope('tcn_net', reuse=self._reuse) as vs:
-      self._adaptation_scope = vs.name
-      net = self._pretrained_output
-
-      # Define some adaptation blocks on top of the pre-trained resnet output.
-      adaptation_blocks = []
-      adaptation_block_params = [map(
-          int, i.split('_')) for i in self._config.adaptation_blocks.split('-')]
-      for i, (depth, num_units) in enumerate(adaptation_block_params):
-        block = resnet_v2.resnet_v2_block(
-            'adaptation_block_%d' % i, base_depth=depth, num_units=num_units,
-            stride=1)
-        adaptation_blocks.append(block)
-
-      # Stack them on top of the resent output.
-      net = resnet_utils.stack_blocks_dense(
-          net, adaptation_blocks, output_stride=None)
-
-      # Average pool the output.
-      net = tf.reduce_mean(net, [1, 2], name='adaptation_pool', keep_dims=True)
-
-      if self._config.emb_connection == 'fc':
-        # Use fully connected layer to project to embedding layer.
-        fc_hidden_sizes = self._config.fc_hidden_sizes
-        if fc_hidden_sizes == 'None':
-          fc_hidden_sizes = []
-        else:
-          fc_hidden_sizes = map(int, fc_hidden_sizes.split('_'))
-        fc_hidden_keep_prob = self._config.dropout.keep_fc
-        net = tf.squeeze(net)
-        for fc_hidden_size in fc_hidden_sizes:
-          net = slim.layers.fully_connected(net, fc_hidden_size)
-          if fc_hidden_keep_prob < 1.0:
-            net = slim.dropout(net, keep_prob=fc_hidden_keep_prob,
-                               is_training=self._is_training)
-
-        # Connect last FC layer to embedding.
-        embedding = slim.layers.fully_connected(net, self._embedding_size,
-                                                activation_fn=None)
-      else:
-        # Use 1x1 conv layer to project to embedding layer.
-        embedding = slim.conv2d(
-            net, self._embedding_size, [1, 1], activation_fn=None,
-            normalizer_fn=None, scope='embedding')
-        embedding = tf.squeeze(embedding)
-
-      # Optionally L2 normalize the embedding.
-      if self._embedding_l2:
-        embedding = tf.nn.l2_normalize(embedding, dim=1)
-
-      return embedding
-
-  def get_trainable_variables(self):
-    """Gets a list of variables to optimize."""
-    if self._config.finetune:
-      return tf.trainable_variables()
-    else:
-      adaptation_only_vars = tf.get_collection(
-          tf.GraphKeys.TRAINABLE_VARIABLES, scope=self._adaptation_scope)
-      return adaptation_only_vars
-
-
-class InceptionEmbedderBase(PretrainedEmbedder):
-  """Base class for embedders that take pre-trained InceptionV3 activations."""
-
-  def __init__(self, config, images, embedding_size, is_training,
-               embedding_l2=True, l2_reg_weight=1e-6, reuse=False):
-    super(InceptionEmbedderBase, self).__init__(
-        config, images, embedding_size, is_training, embedding_l2,
-        l2_reg_weight, reuse)
-
-  def build_pretrained_graph(
-      self, images, inception_layer, checkpoint, is_training, reuse=False):
-    """See baseclass."""
-    # Build InceptionV3 graph.
-    inception_output, inception_variables, init_fn = build_inceptionv3_graph(
-        images, inception_layer, is_training, checkpoint, reuse)
-    return inception_output, inception_variables, init_fn
-
-
-class InceptionConvSSFCEmbedder(InceptionEmbedderBase):
-  """TCN Embedder V1.
-
-  InceptionV3 (mixed_5d) -> conv layers -> spatial softmax ->
-    fully connected -> optional l2 normalize -> embedding.
-  """
-
-  def __init__(self, config, images, embedding_size, is_training,
-               embedding_l2=True, l2_reg_weight=1e-6, reuse=False):
-    super(InceptionConvSSFCEmbedder, self).__init__(
-        config, images, embedding_size, is_training, embedding_l2,
-        l2_reg_weight, reuse)
-
-    # Pull out all the hyperparameters specific to this embedder.
-    self._additional_conv_sizes = config.additional_conv_sizes
-    self._conv_hidden_keep_prob = config.dropout.keep_conv
-    self._fc_hidden_sizes = config.fc_hidden_sizes
-    self._fc_hidden_keep_prob = config.dropout.keep_fc
-
-  def construct_embedding(self):
-    """Builds a conv -> spatial softmax -> FC adaptation network."""
-    is_training = self._is_training
-    normalizer_params = {'is_training': is_training}
-    with tf.variable_scope('tcn_net', reuse=self._reuse) as vs:
-      self._adaptation_scope = vs.name
-      with slim.arg_scope(
-          [slim.layers.conv2d],
-          activation_fn=tf.nn.relu,
-          normalizer_fn=slim.batch_norm, normalizer_params=normalizer_params,
-          weights_regularizer=slim.regularizers.l2_regularizer(
-              self._l2_reg_weight),
-          biases_regularizer=slim.regularizers.l2_regularizer(
-              self._l2_reg_weight)):
-        with slim.arg_scope(
-            [slim.layers.fully_connected],
-            activation_fn=tf.nn.relu,
-            normalizer_fn=slim.batch_norm, normalizer_params=normalizer_params,
-            weights_regularizer=slim.regularizers.l2_regularizer(
-                self._l2_reg_weight),
-            biases_regularizer=slim.regularizers.l2_regularizer(
-                self._l2_reg_weight)):
-
-          # Input to embedder is pre-trained inception output.
-          net = self._pretrained_output
-
-          # Optionally add more conv layers.
-          for num_filters in self._additional_conv_sizes:
-            net = slim.layers.conv2d(
-                net, num_filters, kernel_size=[3, 3], stride=[1, 1])
-            net = slim.dropout(net, keep_prob=self._conv_hidden_keep_prob,
-                               is_training=is_training)
-
-          # Take the spatial soft arg-max of the last convolutional layer.
-          # This is a form of spatial attention over the activations.
-          # See more here: http://arxiv.org/abs/1509.06113.
-          net = tf.contrib.layers.spatial_softmax(net)
-          self.spatial_features = net
-
-          # Add fully connected layers.
-          net = slim.layers.flatten(net)
-          for fc_hidden_size in self._fc_hidden_sizes:
-            net = slim.layers.fully_connected(net, fc_hidden_size)
-            if self._fc_hidden_keep_prob < 1.0:
-              net = slim.dropout(net, keep_prob=self._fc_hidden_keep_prob,
-                                 is_training=is_training)
-
-          # Connect last FC layer to embedding.
-          net = slim.layers.fully_connected(net, self._embedding_size,
-                                            activation_fn=None)
-
-          # Optionally L2 normalize the embedding.
-          if self._embedding_l2:
-            net = tf.nn.l2_normalize(net, dim=1)
-
-          return net
diff --git a/research/tcn/preprocessing.py b/research/tcn/preprocessing.py
deleted file mode 100644
index 707625aaa76a12ca4c65edac7c921044b8a1bf90..0000000000000000000000000000000000000000
--- a/research/tcn/preprocessing.py
+++ /dev/null
@@ -1,686 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Image preprocessing helpers."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import cv2
-from scipy import ndimage
-import tensorflow as tf
-from tensorflow.python.ops import control_flow_ops
-
-
-def apply_with_random_selector(x, func, num_cases):
-  """Computes func(x, sel), with sel sampled from [0...num_cases-1].
-
-  TODO(coreylynch): add as a dependency, when slim or tensorflow/models are
-  pipfied.
-  Source:
-  https://raw.githubusercontent.com/tensorflow/models/a9d0e6e8923a4/slim/preprocessing/inception_preprocessing.py
-
-  Args:
-    x: input Tensor.
-    func: Python function to apply.
-    num_cases: Python int32, number of cases to sample sel from.
-  Returns:
-    The result of func(x, sel), where func receives the value of the
-    selector as a python integer, but sel is sampled dynamically.
-  """
-  sel = tf.random_uniform([], maxval=num_cases, dtype=tf.int32)
-  # Pass the real x only to one of the func calls.
-  return control_flow_ops.merge([
-      func(control_flow_ops.switch(x, tf.equal(sel, case))[1], case)
-      for case in range(num_cases)])[0]
-
-
-def distorted_bounding_box_crop(image,
-                                bbox,
-                                min_object_covered=0.1,
-                                aspect_ratio_range=(0.75, 1.33),
-                                area_range=(0.05, 1.0),
-                                max_attempts=100,
-                                scope=None):
-  """Generates cropped_image using a one of the bboxes randomly distorted.
-
-  TODO(coreylynch): add as a dependency, when slim or tensorflow/models are
-  pipfied.
-  Source:
-  https://raw.githubusercontent.com/tensorflow/models/a9d0e6e8923a4/slim/preprocessing/inception_preprocessing.py
-
-  See `tf.image.sample_distorted_bounding_box` for more documentation.
-
-  Args:
-    image: 3-D Tensor of image (it will be converted to floats in [0, 1]).
-    bbox: 3-D float Tensor of bounding boxes arranged [1, num_boxes, coords]
-      where each coordinate is [0, 1) and the coordinates are arranged
-      as [ymin, xmin, ymax, xmax]. If num_boxes is 0 then it would use the whole
-      image.
-    min_object_covered: An optional `float`. Defaults to `0.1`. The cropped
-      area of the image must contain at least this fraction of any bounding box
-      supplied.
-    aspect_ratio_range: An optional list of `floats`. The cropped area of the
-      image must have an aspect ratio = width / height within this range.
-    area_range: An optional list of `floats`. The cropped area of the image
-      must contain a fraction of the supplied image within in this range.
-    max_attempts: An optional `int`. Number of attempts at generating a cropped
-      region of the image of the specified constraints. After `max_attempts`
-      failures, return the entire image.
-    scope: Optional scope for name_scope.
-  Returns:
-    A tuple, a 3-D Tensor cropped_image and the distorted bbox
-  """
-  with tf.name_scope(scope, 'distorted_bounding_box_crop', [image, bbox]):
-    # Each bounding box has shape [1, num_boxes, box coords] and
-    # the coordinates are ordered [ymin, xmin, ymax, xmax].
-
-    # A large fraction of image datasets contain a human-annotated bounding
-    # box delineating the region of the image containing the object of interest.
-    # We choose to create a new bounding box for the object which is a randomly
-    # distorted version of the human-annotated bounding box that obeys an
-    # allowed range of aspect ratios, sizes and overlap with the human-annotated
-    # bounding box. If no box is supplied, then we assume the bounding box is
-    # the entire image.
-    sample_distorted_bounding_box = tf.image.sample_distorted_bounding_box(
-        tf.shape(image),
-        bounding_boxes=bbox,
-        min_object_covered=min_object_covered,
-        aspect_ratio_range=aspect_ratio_range,
-        area_range=area_range,
-        max_attempts=max_attempts,
-        use_image_if_no_bounding_boxes=True)
-    bbox_begin, bbox_size, distort_bbox = sample_distorted_bounding_box
-
-    # Crop the image to the specified bounding box.
-    cropped_image = tf.slice(image, bbox_begin, bbox_size)
-    return cropped_image, distort_bbox
-
-
-def distort_color(image, color_ordering=0, fast_mode=True, scope=None):
-  """Distort the color of a Tensor image.
-
-  TODO(coreylynch): add as a dependency, when slim or tensorflow/models are
-  pipfied.
-  Source:
-  https://raw.githubusercontent.com/tensorflow/models/a9d0e6e8923a4/slim/preprocessing/inception_preprocessing.py
-
-  Each color distortion is non-commutative and thus ordering of the color ops
-  matters. Ideally we would randomly permute the ordering of the color ops.
-  Rather than adding that level of complication, we select a distinct ordering
-  of color ops for each preprocessing thread.
-  Args:
-    image: 3-D Tensor containing single image in [0, 1].
-    color_ordering: Python int, a type of distortion (valid values: 0-3).
-    fast_mode: Avoids slower ops (random_hue and random_contrast)
-    scope: Optional scope for name_scope.
-  Returns:
-    3-D Tensor color-distorted image on range [0, 1]
-  Raises:
-    ValueError: if color_ordering not in [0, 3]
-  """
-  with tf.name_scope(scope, 'distort_color', [image]):
-    if fast_mode:
-      if color_ordering == 0:
-        image = tf.image.random_brightness(image, max_delta=32. / 255.)
-        image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-      else:
-        image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-        image = tf.image.random_brightness(image, max_delta=32. / 255.)
-    else:
-      if color_ordering == 0:
-        image = tf.image.random_brightness(image, max_delta=32. / 255.)
-        image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-        image = tf.image.random_hue(image, max_delta=0.2)
-        image = tf.image.random_contrast(image, lower=0.5, upper=1.5)
-      elif color_ordering == 1:
-        image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-        image = tf.image.random_brightness(image, max_delta=32. / 255.)
-        image = tf.image.random_contrast(image, lower=0.5, upper=1.5)
-        image = tf.image.random_hue(image, max_delta=0.2)
-      elif color_ordering == 2:
-        image = tf.image.random_contrast(image, lower=0.5, upper=1.5)
-        image = tf.image.random_hue(image, max_delta=0.2)
-        image = tf.image.random_brightness(image, max_delta=32. / 255.)
-        image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-      elif color_ordering == 3:
-        image = tf.image.random_hue(image, max_delta=0.2)
-        image = tf.image.random_saturation(image, lower=0.5, upper=1.5)
-        image = tf.image.random_contrast(image, lower=0.5, upper=1.5)
-        image = tf.image.random_brightness(image, max_delta=32. / 255.)
-      else:
-        raise ValueError('color_ordering must be in [0, 3]')
-
-    # The random_* ops do not necessarily clamp.
-    return tf.clip_by_value(image, 0.0, 1.0)
-
-
-def crop_center(image):
-  """Returns a cropped square image."""
-  shape = tf.shape(image)
-  new_shape = tf.minimum(shape[0], shape[1])
-  offset_y = tf.maximum(shape[0] - shape[1], 0) // 2
-  offset_x = tf.maximum(shape[1] - shape[0], 0) // 2
-  image = tf.image.crop_to_bounding_box(
-      image, offset_y, offset_x, new_shape, new_shape)
-  return image
-
-
-def pad(image):
-  """Returns an image padded to be square."""
-  shape = tf.shape(image)
-  new_shape = tf.maximum(shape[0], shape[1])
-  height = shape[0]
-  width = shape[1]
-  offset_x = tf.maximum((height-width), 0) // 2
-  offset_y = tf.maximum((width-height), 0) // 2
-  image = tf.image.pad_to_bounding_box(
-      image, offset_y, offset_x, new_shape, new_shape)
-  return image
-
-
-def pad_200(image):
-  """Returns an image padded width-padded with 200 pixels."""
-  shape = tf.shape(image)
-  image = tf.image.pad_to_bounding_box(
-      image, 0, 200, shape[0], shape[1]+400)
-  shape = tf.shape(image)
-  new_shape = tf.minimum(shape[0], shape[1])
-  offset_y = tf.maximum(shape[0] - shape[1], 0) // 2
-  offset_x = tf.maximum(shape[1] - shape[0], 0) // 2
-  image = tf.image.crop_to_bounding_box(
-      image, offset_y, offset_x, new_shape, new_shape)
-  return image
-
-
-def pad_crop_central(image, central_fraction=0.875):
-  """Pads the image to the maximum length, crops the central fraction."""
-  # Pad the image to be square.
-  image = pad(image)
-  # Crop the central region of the image with an area containing 87.5% of
-  # the original image.
-  image = tf.image.central_crop(image, central_fraction=central_fraction)
-  return image
-
-
-def crop_image_by_strategy(image, cropping):
-  """Crops an image according to a strategy defined in config.
-
-  Args:
-    image: 3-d image tensor.
-    cropping: str, name of cropping strategy.
-  Returns:
-    image: cropped image.
-  Raises:
-    ValueError: When unknown cropping strategy is specified.
-  """
-  strategy_to_method = {
-      'crop_center': crop_center,
-      'pad': pad,
-      'pad200': pad_200,
-      'pad_crop_central': pad_crop_central
-  }
-  tf.logging.info('Cropping strategy: %s.' % cropping)
-  if cropping not in strategy_to_method:
-    raise ValueError('Unknown cropping strategy: %s' % cropping)
-  return strategy_to_method[cropping](image)
-
-
-def scale_augment_crop(image, central_bbox, area_range, min_object_covered):
-  """Training time scale augmentation.
-
-  Args:
-    image: 3-d float tensor.
-    central_bbox: Bounding box defining the central region of interest.
-    area_range: Range of allowed areas for the augmented bounding box.
-    min_object_covered: Constraint for the fraction of original image in
-      augmented bounding box.
-  Returns:
-    distort_image: The scaled, cropped image.
-  """
-  (distorted_image, _) = distorted_bounding_box_crop(
-      image, central_bbox, area_range=area_range,
-      aspect_ratio_range=(1.0, 1.0),
-      min_object_covered=min_object_covered)
-  # Restore the shape since the dynamic slice based upon the bbox_size loses
-  # the third dimension.
-  distorted_image.set_shape([None, None, 3])
-  return distorted_image
-
-
-def scale_to_inception_range(image):
-  """Scales an image in the range [0,1] to [-1,1] as expected by inception."""
-  # Assert that incoming images have been properly scaled to [0,1].
-  with tf.control_dependencies(
-      [tf.assert_less_equal(tf.reduce_max(image), 1.),
-       tf.assert_greater_equal(tf.reduce_min(image), 0.)]):
-    image = tf.subtract(image, 0.5)
-    image = tf.multiply(image, 2.0)
-    return image
-
-
-def resize_image(image, height, width):
-  """Resizes an image to a target height and width."""
-  image = tf.expand_dims(image, 0)
-  image = tf.image.resize_bilinear(image, [height, width], align_corners=False)
-  image = tf.squeeze(image, [0])
-  return image
-
-
-def crop_or_pad(image, curr_height, curr_width, new, height=True, crop=True):
-  """Crops or pads an image.
-
-  Args:
-    image: 3-D float32 `Tensor` image.
-    curr_height: Int, current height.
-    curr_width: Int, current width.
-    new: Int, new width or height.
-    height: Boolean, cropping or padding for height.
-    crop: Boolean, True if we're cropping, False if we're padding.
-  Returns:
-    image: 3-D float32 `Tensor` image.
-  """
-  # Crop the image to fit the new shape.
-  abs_diff = tf.abs(new-curr_height)//2 if height else tf.abs(new-curr_width)//2
-  offset_x = 0 if height else abs_diff
-  offset_y = abs_diff if height else 0
-
-  # We process height first, so always pad/crop to new height.
-  target_height = new
-  # We process height first, so pad/crop to new width only if not doing height.
-  target_width = curr_width if height else new
-
-  if crop:
-    image = tf.image.crop_to_bounding_box(
-        image, offset_y, offset_x, target_height, target_width)
-  else:
-    image = tf.image.pad_to_bounding_box(
-        image, offset_y, offset_x, target_height, target_width)
-  return image
-
-
-def get_central_bbox(min_side, new_size):
-  """Gets the central bounding box for an image.
-
-  If image is square, returns bounding box [0,0,1,1].
-  Otherwise, returns the bounding box containing the central
-  smallest side x smallest side square.
-
-  Args:
-    min_side: Int, size of smallest side in pixels.
-    new_size: Int, resize image to a square of new_size x new_size pixels.
-  Returns:
-    bbox: A 4-D Int `Tensor`, holding the coordinates of the central bounding
-      box.
-  """
-  max_shape = tf.cast(new_size, tf.float32)
-  min_shape = tf.cast(min_side, tf.float32)
-  top_xy = ((max_shape-min_shape)/2)/max_shape
-  bottom_xy = (min_shape+(max_shape-min_shape)/2)/max_shape
-  # Create a bbox for the center region of interest.
-  bbox = tf.stack([[[top_xy, top_xy, bottom_xy, bottom_xy]]])
-  bbox.set_shape([1, 1, 4])
-  return bbox
-
-
-def pad_to_max(image, max_scale):
-  """Pads an image to max_scale times the current center crop size.
-
-  E.g.: For an image with dimensions 1920x1080 and a max_scale of 1.5,
-  returns an image that is 1.5 * (1080x1080).
-
-  Args:
-    image: 3-D float32 `Tensor` image.
-    max_scale: Float, maximum scale of the image, as a multiplier on the
-      central bounding box.
-  Returns:
-    image: 3-D float32 `Tensor` image.
-  """
-  orig_shape = tf.shape(image)
-  orig_height = orig_shape[0]
-  orig_width = orig_shape[1]
-
-  # Find the smallest side and corresponding new size.
-  min_side = tf.cast(tf.minimum(orig_height, orig_width), tf.float32)
-  new_shape = tf.cast(tf.sqrt(max_scale*min_side*min_side), tf.int32)
-
-  # Crop or pad height.
-  # pylint: disable=g-long-lambda
-  image = tf.cond(
-      orig_height >= new_shape,
-      lambda: crop_or_pad(
-          image, orig_height, orig_width, new_shape, height=True, crop=True),
-      lambda: crop_or_pad(
-          image, orig_height, orig_width, new_shape, height=True, crop=False))
-
-  # Crop or pad width.
-  image = tf.cond(
-      orig_width >= new_shape,
-      lambda: crop_or_pad(
-          image, orig_height, orig_width, new_shape, height=False, crop=True),
-      lambda: crop_or_pad(
-          image, orig_height, orig_width, new_shape, height=False, crop=False))
-
-  # Get the bounding box of the original centered box in the new resized image.
-  original_bounding_box = get_central_bbox(min_side, new_shape)
-  return image, original_bounding_box
-
-
-def scale_up_augmentation(image, max_scale):
-  """Scales an image randomly >100% up to some max scale."""
-  # Pad to max size.
-  image, original_central_bbox = pad_to_max(image, max_scale)
-
-  # Determine area range of the augmented crop, as a percentage of the
-  # new max area.
-  # aug_max == 100% of new max area.
-  aug_max = 1.0
-  # aug_min == original_area/new_area == original_area/(max_scale*original_area)
-  # == 1/max_scale.
-  aug_min = 1.0/max_scale
-  area_range = (aug_min, aug_max)
-  # Since we're doing >100% scale, always have the full original crop in frame.
-  min_object_covered = 1.0
-  # Get a random scaled, cropped image.
-  image = scale_augment_crop(image, original_central_bbox, area_range,
-                             min_object_covered)
-  return image
-
-
-def scale_down_augmentation(image, min_scale):
-  """Scales an image randomly <100% down to some min scale."""
-  # Crop the center, and consider the whole image the bounding box ROI.
-  image = crop_center(image)
-  bbox = tf.constant([0.0, 0.0, 1.0, 1.0], dtype=tf.float32, shape=[1, 1, 4])
-  # Determine area range of the augmented crop, as a percentage of the
-  # original crop center area.
-  # aug_max == 100% of original area.
-  area_range = (min_scale, 1.0)
-  # Get a random scaled, cropped image.
-  image = scale_augment_crop(image, bbox, area_range, min_scale)
-  return image
-
-
-def augment_image_scale(image, min_scale, max_scale, p_scale_up):
-  """Training time scale augmentation.
-
-  Args:
-    image: 3-d float tensor representing image.
-    min_scale: minimum scale augmentation allowed, as a fraction of the
-      central min_side * min_side area of the original image.
-    max_scale: maximum scale augmentation allowed, as a fraction of the
-      central min_side * min_side area of the original image.
-    p_scale_up: Fraction of images scaled up.
-  Returns:
-    image: The scale-augmented image.
-  """
-  assert max_scale >= 1.0
-  assert min_scale <= 1.0
-  if min_scale == max_scale == 1.0:
-    tf.logging.info('Min and max scale are 1.0, don`t augment.')
-    # Do no augmentation, just crop the center.
-    return crop_center(image)
-  elif (max_scale == 1.0) and (min_scale < 1.0):
-    tf.logging.info('Max scale is 1.0, only scale down augment.')
-    # Always do <100% augmentation.
-    return scale_down_augmentation(image, min_scale)
-  elif (min_scale == 1.0) and (max_scale > 1.0):
-    tf.logging.info('Min scale is 1.0, only scale up augment.')
-    # Always do >100% augmentation.
-    return scale_up_augmentation(image, max_scale)
-  else:
-    tf.logging.info('Sample both augmentations.')
-    # Choose to scale image up or down.
-    rn = tf.random_uniform([], minval=0., maxval=1., dtype=tf.float32)
-    image = tf.cond(rn >= p_scale_up,
-                    lambda: scale_up_augmentation(image, max_scale),
-                    lambda: scale_down_augmentation(image, min_scale))
-  return image
-
-
-def decode_image(image_str):
-  """Decodes a jpeg-encoded image string into a image in range [0,1]."""
-  # Decode jpeg string into np.uint8 tensor.
-  image = tf.image.decode_jpeg(image_str, channels=3)
-  # Convert the image to range [0,1].
-  if image.dtype != tf.float32:
-    image = tf.image.convert_image_dtype(image, dtype=tf.float32)
-  return image
-
-
-def decode_images(image_strs):
-  """Decodes a tensor of image strings."""
-  return tf.map_fn(decode_image, image_strs, dtype=tf.float32)
-
-
-def preprocess_training_images(images, height, width, min_scale, max_scale,
-                               p_scale_up, aug_color=True, fast_mode=True):
-  """Preprocesses a batch of images for training.
-
-  This applies training-time scale and color augmentation, crops/resizes,
-  and scales images to the [-1,1] range expected by pre-trained Inception nets.
-
-  Args:
-    images: A 4-D float32 `Tensor` holding raw images to be preprocessed.
-    height: Int, height in pixels to resize image to.
-    width: Int, width in pixels to resize image to.
-    min_scale: Float, minimum scale augmentation allowed, as a fraction of the
-      central min_side * min_side area of the original image.
-    max_scale: Float, maximum scale augmentation allowed, as a fraction of the
-      central min_side * min_side area of the original image.
-    p_scale_up: Float, fraction of images scaled up.
-    aug_color: Whether or not to do color augmentation.
-    fast_mode: Boolean, avoids slower ops (random_hue and random_contrast).
-  Returns:
-    preprocessed_images: A 4-D float32 `Tensor` holding preprocessed images.
-  """
-  def _prepro_train(im):
-    """Map this preprocessing function over each image in the batch."""
-    return preprocess_training_image(
-        im, height, width, min_scale, max_scale, p_scale_up,
-        aug_color=aug_color, fast_mode=fast_mode)
-  return tf.map_fn(_prepro_train, images)
-
-
-def preprocess_training_image(
-    image, height, width, min_scale, max_scale, p_scale_up,
-    aug_color=True, fast_mode=True):
-  """Preprocesses an image for training.
-
-  Args:
-    image: A 3-d float tensor representing the image.
-    height: Target image height.
-    width: Target image width.
-    min_scale: Minimum scale of bounding box (as a percentage of full
-      bounding box) used to crop image during scale augmentation.
-    max_scale: Minimum scale of bounding box (as a percentage of full
-      bounding box) used to crop image during scale augmentation.
-    p_scale_up: Fraction of images to scale >100%.
-    aug_color: Whether or not to do color augmentation.
-    fast_mode: Avoids slower ops (random_hue and random_contrast).
-  Returns:
-    scaled_image: An scaled image tensor in the range [-1,1].
-  """
-  # Get a random scaled, cropped image.
-  image = augment_image_scale(image, min_scale, max_scale, p_scale_up)
-
-  # Resize image to desired height, width.
-  image = tf.expand_dims(image, 0)
-  image = tf.image.resize_bilinear(image, [height, width], align_corners=False)
-  image = tf.squeeze(image, [0])
-
-  # Optionally augment the color.
-  # pylint: disable=g-long-lambda
-  if aug_color:
-    image = apply_with_random_selector(
-        image,
-        lambda x, ordering: distort_color(
-            x, ordering, fast_mode=fast_mode), num_cases=4)
-
-  # Scale to [-1,1] range as expected by inception.
-  scaled_image = scale_to_inception_range(image)
-  return scaled_image
-
-
-def preprocess_test_image(image, height, width, crop_strategy):
-  """Preprocesses an image for test/inference.
-
-  Args:
-    image: A 3-d float tensor representing the image.
-    height: Target image height.
-    width: Target image width.
-    crop_strategy: String, name of the strategy used to crop test-time images.
-      Can be: 'crop_center', 'pad', 'pad_200', 'pad_crop_central'.
-  Returns:
-    scaled_image: An scaled image tensor in the range [-1,1].
-  """
-  image = crop_image_by_strategy(image, crop_strategy)
-  # Resize.
-  image = resize_image(image, height, width)
-  # Scale the input range to [-1,1] as expected by inception.
-  image = scale_to_inception_range(image)
-  return image
-
-
-def preprocess_test_images(images, height, width, crop_strategy):
-  """Apply test-time preprocessing to a batch of images.
-
-  This crops images (given a named strategy for doing so), resizes them,
-  and scales them to the [-1,1] range expected by pre-trained Inception nets.
-
-  Args:
-    images: A 4-D float32 `Tensor` holding raw images to be preprocessed.
-    height: Int, height in pixels to resize image to.
-    width: Int, width in pixels to resize image to.
-    crop_strategy: String, name of the strategy used to crop test-time images.
-      Can be: 'crop_center', 'pad', 'pad_200', 'pad_crop_central'.
-  Returns:
-    preprocessed_images: A 4-D float32 `Tensor` holding preprocessed images.
-  """
-  def _prepro_test(im):
-    """Map this preprocessing function over each image in the batch."""
-    return preprocess_test_image(im, height, width, crop_strategy)
-  if len(images.shape) == 3:
-    return _prepro_test(images)
-  else:
-    return tf.map_fn(_prepro_test, images)
-
-
-def preprocess_images(
-    images, is_training, height, width,
-    min_scale=1.0, max_scale=1.0, p_scale_up=0.0,
-    aug_color=True, fast_mode=True,
-    crop_strategy='pad_crop_central'):
-  """Preprocess a batch of images.
-
-  Args:
-    images: A 4-D float32 `Tensor` holding raw images to be preprocessed.
-    is_training: Boolean, whether to preprocess them for training or test.
-    height: Int, height in pixels to resize image to.
-    width: Int, width in pixels to resize image to.
-    min_scale: Float, minimum scale augmentation allowed, as a fraction of the
-      central min_side * min_side area of the original image.
-    max_scale: Float, maximum scale augmentation allowed, as a fraction of the
-      central min_side * min_side area of the original image.
-    p_scale_up: Float, fraction of images scaled up.
-    aug_color: Whether or not to do color augmentation.
-    fast_mode: Boolean, avoids slower ops (random_hue and random_contrast).
-    crop_strategy: String, name of the strategy used to crop test-time images.
-      Can be: 'crop_center', 'pad', 'pad_200', 'pad_crop_central'.
-  Returns:
-    preprocessed_images: A 4-D float32 `Tensor` holding preprocessed images.
-  """
-  if is_training:
-    return preprocess_training_images(
-        images, height, width, min_scale, max_scale,
-        p_scale_up, aug_color, fast_mode)
-  else:
-    return preprocess_test_images(
-        images, height, width, crop_strategy)
-
-
-def cv2rotateimage(image, angle):
-  """Efficient rotation if 90 degrees rotations, slow otherwise.
-
-  Not a tensorflow function, using cv2 and scipy on numpy arrays.
-
-  Args:
-    image: a numpy array with shape [height, width, channels].
-    angle: the rotation angle in degrees in the range [-180, 180].
-  Returns:
-    The rotated image.
-  """
-  # Limit angle to [-180, 180] degrees.
-  assert angle <= 180 and angle >= -180
-  if angle == 0:
-    return image
-  # Efficient rotations.
-  if angle == -90:
-    image = cv2.transpose(image)
-    image = cv2.flip(image, 0)
-  elif angle == 90:
-    image = cv2.transpose(image)
-    image = cv2.flip(image, 1)
-  elif angle == 180 or angle == -180:
-    image = cv2.flip(image, 0)
-    image = cv2.flip(image, 1)
-  else:  # Slow rotation.
-    image = ndimage.interpolation.rotate(image, 270)
-  return image
-
-
-def cv2resizeminedge(image, min_edge_size):
-  """Resize smallest edge of image to min_edge_size."""
-  assert min_edge_size >= 0
-  height, width = (image.shape[0], image.shape[1])
-  new_height, new_width = (0, 0)
-  if height > width:
-    new_width = min_edge_size
-    new_height = int(height * new_width / float(width))
-  else:
-    new_height = min_edge_size
-    new_width = int(width * new_height / float(height))
-  return cv2.resize(image, (new_width, new_height),
-                    interpolation=cv2.INTER_AREA)
-
-
-def shapestring(array):
-  """Returns a compact string describing shape of an array."""
-  shape = array.shape
-  s = str(shape[0])
-  for i in range(1, len(shape)):
-    s += 'x' + str(shape[i])
-  return s
-
-
-def unscale_jpeg_encode(ims):
-  """Unscales pixel values and jpeg encodes preprocessed image.
-
-  Args:
-    ims: A 4-D float32 `Tensor` holding preprocessed images.
-  Returns:
-    im_strings: A 1-D string `Tensor` holding images that have been unscaled
-      (reversing the inception [-1,1] scaling), and jpeg encoded.
-  """
-  ims /= 2.0
-  ims += 0.5
-  ims *= 255.0
-  ims = tf.clip_by_value(ims, 0, 255)
-  ims = tf.cast(ims, tf.uint8)
-  im_strings = tf.map_fn(
-      lambda x: tf.image.encode_jpeg(x, format='rgb', quality=100),
-      ims, dtype=tf.string)
-  return im_strings
diff --git a/research/tcn/train.py b/research/tcn/train.py
deleted file mode 100644
index f35cb4c6f4ea7346cf4af05bd164f896d063f073..0000000000000000000000000000000000000000
--- a/research/tcn/train.py
+++ /dev/null
@@ -1,61 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Trains TCN models (and baseline comparisons)."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-from estimators.get_estimator import get_estimator
-from utils import util
-import tensorflow as tf
-tf.logging.set_verbosity(tf.logging.INFO)
-
-tf.flags.DEFINE_string(
-    'config_paths', '',
-    """
-    Path to a YAML configuration files defining FLAG values. Multiple files
-    can be separated by the `#` symbol. Files are merged recursively. Setting
-    a key in these files is equivalent to setting the FLAG value with
-    the same name.
-    """)
-tf.flags.DEFINE_string(
-    'model_params', '{}', 'YAML configuration string for the model parameters.')
-tf.app.flags.DEFINE_string('master', 'local',
-                           'BNS name of the TensorFlow master to use')
-tf.app.flags.DEFINE_string(
-    'logdir', '/tmp/tcn', 'Directory where to write event logs.')
-tf.app.flags.DEFINE_integer(
-    'task', 0, 'Task id of the replica running the training.')
-tf.app.flags.DEFINE_integer(
-    'ps_tasks', 0, 'Number of tasks in the ps job. If 0 no ps job is used.')
-FLAGS = tf.app.flags.FLAGS
-
-
-def main(_):
-  """Runs main training loop."""
-  # Parse config dict from yaml config files / command line flags.
-  config = util.ParseConfigsToLuaTable(
-      FLAGS.config_paths, FLAGS.model_params, save=True, logdir=FLAGS.logdir)
-
-  # Choose an estimator based on training strategy.
-  estimator = get_estimator(config, FLAGS.logdir)
-
-  # Run training
-  estimator.train()
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/tcn/utils/luatables.py b/research/tcn/utils/luatables.py
deleted file mode 100644
index 565d038626a9a8717e36d0701a2e61ab3532bd18..0000000000000000000000000000000000000000
--- a/research/tcn/utils/luatables.py
+++ /dev/null
@@ -1,80 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# pylint: disable=line-too-long,g-explicit-length-test
-"""A convenience class replicating some lua table syntax with a python dict.
-
-In general, should behave like a dictionary except that we can use dot notation
- to access keys. Users should be careful to only provide keys suitable for
- instance variable names.
-
-Nota bene: do not use the key "keys" since it will collide with the method keys.
-
-Usage example:
-
->>> t = T(a=5,b='kaw', c=T(v=[],x=33))
->>> t.a
-5
->>> t.z = None
->>> print t
-T(a=5, z=None, c=T(x=33, v=[]), b='kaw')
-
->>> t2 = T({'h':'f','x':4})
->>> t2
-T(h='f', x=4)
->>> t2['x']
-4
-"""
-
-
-class T(object):
-  """Class for emulating lua tables."""
-
-  def __init__(self, *args, **kwargs):
-    if len(args) > 1 or (len(args) == 1 and len(kwargs) > 0):
-      errmsg = '''constructor only allows a single dict as a positional
-      argument or keyword arguments'''
-      raise ValueError(errmsg)
-    if len(args) == 1 and isinstance(args[0], dict):
-      self.__dict__.update(args[0])
-    else:
-      self.__dict__.update(kwargs)
-
-  def __repr__(self):
-    fmt = ', '.join('%s=%s' for i in range(len(self.__dict__)))
-    kwargstr = fmt % tuple(
-        x for tup in self.__dict__.items() for x in [str(tup[0]), repr(tup[1])])
-    return 'T(' + kwargstr + ')'
-
-  def __getitem__(self, key):
-    return self.__dict__[key]
-
-  def __setitem__(self, key, val):
-    self.__dict__[key] = val
-
-  def __delitem__(self, key):
-    del self.__dict__[key]
-
-  def __iter__(self):
-    return iter(self.__dict__)
-
-  def __len__(self):
-    return len(self.__dict__)
-
-  def keys(self):  # Needed for dict(T( ... )) to work.
-    return self.__dict__.keys()
-
-  def iteritems(self):
-    return self.__dict__.iteritems()
diff --git a/research/tcn/utils/progress.py b/research/tcn/utils/progress.py
deleted file mode 100644
index 1043261b5be743ea6a8fd45e7e21219999090f3a..0000000000000000000000000000000000000000
--- a/research/tcn/utils/progress.py
+++ /dev/null
@@ -1,50 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""A utility class for reporting processing progress."""
-
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import datetime
-
-
-class Progress(object):
-  """A utility class for reporting processing progress."""
-
-  def __init__(self, target_size):
-    self.target_size = target_size
-    self.current_size = 0
-    self.start_time = datetime.datetime.now()
-
-  def Update(self, current_size):
-    """Replaces internal current_size with current_size."""
-    self.current_size = current_size
-
-  def Add(self, size):
-    """Increments internal current_size by size."""
-    self.current_size += size
-
-  def __str__(self):
-    processed = 1e-5 + self.current_size / float(self.target_size)
-    current_time = datetime.datetime.now()
-    elapsed = current_time - self.start_time
-    eta = datetime.timedelta(
-        seconds=elapsed.total_seconds() / processed - elapsed.total_seconds())
-    return "%d / %d (elapsed %s eta %s)" % (
-        self.current_size, self.target_size,
-        str(elapsed).split(".")[0],
-        str(eta).split(".")[0])
diff --git a/research/tcn/utils/util.py b/research/tcn/utils/util.py
deleted file mode 100644
index 9f50366e5a1adc6e57ce5a559c324eed48773ba3..0000000000000000000000000000000000000000
--- a/research/tcn/utils/util.py
+++ /dev/null
@@ -1,247 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""General utility functions."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import numpy as np
-import six
-from utils.luatables import T
-import tensorflow as tf
-import yaml
-from yaml.constructor import ConstructorError
-# pylint: disable=invalid-name
-
-
-def GetFilesRecursively(topdir):
-  """Gets all records recursively for some topdir.
-
-  Args:
-    topdir: String, path to top directory.
-  Returns:
-    allpaths: List of Strings, full paths to all leaf records.
-  Raises:
-    ValueError: If there are no files found for this directory.
-  """
-  assert topdir
-  topdir = os.path.expanduser(topdir)
-  allpaths = []
-  for path, _, leaffiles in tf.gfile.Walk(topdir):
-    if leaffiles:
-      allpaths.extend([os.path.join(path, i) for i in leaffiles])
-  if not allpaths:
-    raise ValueError('No files found for top directory %s' % topdir)
-  return allpaths
-
-
-def NoDuplicatesConstructor(loader, node, deep=False):
-  """Check for duplicate keys."""
-  mapping = {}
-  for key_node, value_node in node.value:
-    key = loader.construct_object(key_node, deep=deep)
-    value = loader.construct_object(value_node, deep=deep)
-    if key in mapping:
-      raise ConstructorError('while constructing a mapping', node.start_mark,
-                             'found duplicate key (%s)' % key,
-                             key_node.start_mark)
-    mapping[key] = value
-  return loader.construct_mapping(node, deep)
-
-
-def WriteConfigAsYaml(config, logdir, filename):
-  """Writes a config dict as yaml to logdir/experiment.yml."""
-  if not tf.gfile.Exists(logdir):
-    tf.gfile.MakeDirs(logdir)
-  config_filename = os.path.join(logdir, filename)
-  with tf.gfile.GFile(config_filename, 'w') as f:
-    f.write(yaml.dump(config))
-  tf.logging.info('wrote config to %s', config_filename)
-
-
-def LoadConfigDict(config_paths, model_params):
-  """Loads config dictionary from specified yaml files or command line yaml."""
-
-  # Ensure that no duplicate keys can be loaded (causing pain).
-  yaml.add_constructor(yaml.resolver.BaseResolver.DEFAULT_MAPPING_TAG,
-                       NoDuplicatesConstructor)
-
-  # Handle either ',' or '#' separated config lists, since borg will only
-  # accept '#'.
-  sep = ',' if ',' in config_paths else '#'
-
-  # Load flags from config file.
-  final_config = {}
-  if config_paths:
-    for config_path in config_paths.split(sep):
-      config_path = config_path.strip()
-      if not config_path:
-        continue
-      config_path = os.path.abspath(config_path)
-      tf.logging.info('Loading config from %s', config_path)
-      with tf.gfile.GFile(config_path.strip()) as config_file:
-        config_flags = yaml.load(config_file)
-        final_config = DeepMergeDict(final_config, config_flags)
-  if model_params:
-    model_params = MaybeLoadYaml(model_params)
-    final_config = DeepMergeDict(final_config, model_params)
-  tf.logging.info('Final Config:\n%s', yaml.dump(final_config))
-  return final_config
-
-
-def MaybeLoadYaml(item):
-  """Parses item if it's a string. If it's a dictionary it's returned as-is."""
-  if isinstance(item, six.string_types):
-    return yaml.load(item)
-  elif isinstance(item, dict):
-    return item
-  else:
-    raise ValueError('Got {}, expected YAML string or dict', type(item))
-
-
-def DeepMergeDict(dict_x, dict_y, path=None):
-  """Recursively merges dict_y into dict_x."""
-  if path is None: path = []
-  for key in dict_y:
-    if key in dict_x:
-      if isinstance(dict_x[key], dict) and isinstance(dict_y[key], dict):
-        DeepMergeDict(dict_x[key], dict_y[key], path + [str(key)])
-      elif dict_x[key] == dict_y[key]:
-        pass  # same leaf value
-      else:
-        dict_x[key] = dict_y[key]
-    else:
-      dict_x[key] = dict_y[key]
-  return dict_x
-
-
-def ParseConfigsToLuaTable(config_paths, extra_model_params=None,
-                           save=False, save_name='final_training_config.yml',
-                           logdir=None):
-  """Maps config_paths and extra_model_params to a Luatable-like object."""
-  # Parse config dict from yaml config files / command line flags.
-  config = LoadConfigDict(config_paths, extra_model_params)
-  if save:
-    WriteConfigAsYaml(config, logdir, save_name)
-  # Convert config dictionary to T object with dot notation.
-  config = RecursivelyConvertToLuatable(config)
-  return config
-
-
-def SetNestedValue(d, keys, value):
-  """Sets a value in a nested dictionary.
-
-  Example:
-    d = {}, keys = ['data','augmentation','minscale'], value = 1.0.
-    returns {'data': {'augmentation' : {'minscale': 1.0 }}}
-
-  Args:
-    d: A dictionary to set a nested value in.
-    keys: list of dict keys nesting left to right.
-    value: the nested value to set.
-  Returns:
-    None
-  """
-  for key in keys[:-1]:
-    d = d.setdefault(key, {})
-  d[keys[-1]] = value
-
-
-def RecursivelyConvertToLuatable(yaml_dict):
-  """Converts a dictionary to a LuaTable-like T object."""
-  if isinstance(yaml_dict, dict):
-    yaml_dict = T(yaml_dict)
-  for key, item in yaml_dict.iteritems():
-    if isinstance(item, dict):
-      yaml_dict[key] = RecursivelyConvertToLuatable(item)
-  return yaml_dict
-
-
-def KNNIds(query_vec, target_seq, k=1):
-  """Gets the knn ids to the query vec from the target sequence."""
-  sorted_distances = KNNIdsWithDistances(query_vec, target_seq, k)
-  return [i[0] for i in sorted_distances]
-
-
-def KNNIdsWithDistances(query_vec, target_seq, k=1):
-  """Gets the knn ids to the query vec from the target sequence."""
-  if not isinstance(np.array(target_seq), np.ndarray):
-    target_seq = np.array(target_seq)
-  assert np.shape(query_vec) == np.shape(target_seq[0])
-  distances = [(i, np.linalg.norm(query_vec-target_vec)) for (
-      i, target_vec) in enumerate(target_seq)]
-  sorted_distances = sorted(distances, key=lambda x: x[1])
-  return sorted_distances[:k]
-
-
-def CopyLocalConfigsToCNS(outdir, configs, gfs_user):
-  """Copies experiment yaml config files to the job_logdir on /cns."""
-  assert configs
-  assert outdir
-  conf_files = configs.split(',')
-  for conf_file in conf_files:
-    copy_command = 'fileutil --gfs_user %s cp -f %s %s' % (
-        gfs_user, conf_file, outdir)
-    tf.logging.info(copy_command)
-    os.system(copy_command)
-
-
-def pairwise_distances(feature, squared=True):
-  """Computes the pairwise distance matrix in numpy.
-
-  Args:
-    feature: 2-D numpy array of size [number of data, feature dimension]
-    squared: Boolean. If true, output is the pairwise squared euclidean
-      distance matrix; else, output is the pairwise euclidean distance matrix.
-
-  Returns:
-    pdists: 2-D numpy array of size
-      [number of data, number of data].
-  """
-  triu = np.triu_indices(feature.shape[0], 1)
-  upper_tri_pdists = np.linalg.norm(feature[triu[1]] - feature[triu[0]], axis=1)
-  if squared:
-    upper_tri_pdists **= 2.
-  num_data = feature.shape[0]
-  pdists = np.zeros((num_data, num_data))
-  pdists[np.triu_indices(num_data, 1)] = upper_tri_pdists
-  # Make symmetrical.
-  pdists = pdists + pdists.T - np.diag(
-      pdists.diagonal())
-  return pdists
-
-
-def is_tfrecord_input(inp):
-  """Checks if input is a TFRecord or list of TFRecords."""
-  def _is_tfrecord(inp):
-    if not isinstance(inp, str):
-      return False
-    _, extension = os.path.splitext(inp)
-    return extension == '.tfrecord'
-  if isinstance(inp, str):
-    return _is_tfrecord(inp)
-  if isinstance(inp, list):
-    return all(map(_is_tfrecord, inp))
-  return False
-
-
-def is_np_array(inp):
-  if isinstance(inp, np.ndarray):
-    return True
-  if isinstance(inp, list):
-    return all([isinstance(i, np.ndarray) for i in inp])
-  return False
diff --git a/research/tcn/visualize_embeddings.py b/research/tcn/visualize_embeddings.py
deleted file mode 100644
index 298c1ab11a9f8f38200fb3e316b4790485aa892f..0000000000000000000000000000000000000000
--- a/research/tcn/visualize_embeddings.py
+++ /dev/null
@@ -1,198 +0,0 @@
-# Copyright 2017 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-r"""Visualizes embeddings in tensorboard.
-
-Usage:
-root=experimental/users/sermanet/imitation/mirror && \
-blaze build -c opt --copt=-mavx --config=cuda $root:visualize_embeddings && \
-blaze-bin/$root/visualize_embeddings \
---checkpointdir $checkpointdir \
---checkpoint_iter $checkpoint_iter \
---embedding_records $embedding_records \
---outdir $outdir \
---num_embed 1000 \
---sprite_dim 64 \
---config_paths $configs \
---logtostderr
-
-blaze build third_party/tensorboard && \
-blaze-bin/third_party/tensorboard/tensorboard --logdir=$outdir
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-
-import os
-import random
-import cv2
-import numpy as np
-from scipy.misc import imresize
-from scipy.misc import imsave
-from estimators.get_estimator import get_estimator
-from utils import util
-import tensorflow as tf
-from tensorflow.contrib.tensorboard.plugins import projector
-tf.logging.set_verbosity(tf.logging.INFO)
-
-tf.flags.DEFINE_string(
-    'config_paths', '',
-    """
-    Path to a YAML configuration files defining FLAG values. Multiple files
-    can be separated by the `#` symbol. Files are merged recursively. Setting
-    a key in these files is equivalent to setting the FLAG value with
-    the same name.
-    """)
-tf.flags.DEFINE_string(
-    'model_params', '{}', 'YAML configuration string for the model parameters.')
-tf.app.flags.DEFINE_string(
-    'checkpoint_iter', '', 'Evaluate this specific checkpoint.')
-tf.app.flags.DEFINE_string(
-    'checkpointdir', '/tmp/tcn', 'Path to model checkpoints.')
-tf.app.flags.DEFINE_string(
-    'outdir', '/tmp/tcn', 'Path to write tensorboard info to.')
-tf.app.flags.DEFINE_integer(
-    'num_embed', 4000, 'Number of embeddings.')
-tf.app.flags.DEFINE_integer(
-    'num_sequences', -1, 'Number of sequences, -1 for all.')
-tf.app.flags.DEFINE_integer(
-    'sprite_dim', 64, 'Height, width of the square sprite image.')
-tf.app.flags.DEFINE_string(
-    'embedding_records', None, 'path to embedding records')
-FLAGS = tf.app.flags.FLAGS
-
-
-def images_to_sprite(data):
-  """Creates the sprite image along with any necessary padding.
-
-  Taken from: https://github.com/tensorflow/tensorflow/issues/6322
-
-  Args:
-    data: NxHxW[x3] tensor containing the images.
-
-  Returns:
-    data: Properly shaped HxWx3 image with any necessary padding.
-  """
-  if len(data.shape) == 3:
-    data = np.tile(data[..., np.newaxis], (1, 1, 1, 3))
-  data = data.astype(np.float32)
-  min_v = np.min(data.reshape((data.shape[0], -1)), axis=1)
-  data = (data.transpose(1, 2, 3, 0) - min_v).transpose(3, 0, 1, 2)
-  max_v = np.max(data.reshape((data.shape[0], -1)), axis=1)
-  data = (data.transpose(1, 2, 3, 0) / max_v).transpose(3, 0, 1, 2)
-  n = int(np.ceil(np.sqrt(data.shape[0])))
-  padding = ((0, n ** 2 - data.shape[0]), (0, 0),
-             (0, 0)) + ((0, 0),) * (data.ndim - 3)
-  data = np.pad(data, padding, mode='constant',
-                constant_values=0)
-  # Tile the individual thumbnails into an image.
-  data = data.reshape((n, n) + data.shape[1:]).transpose(
-      (0, 2, 1, 3) + tuple(range(4, data.ndim + 1)))
-  data = data.reshape((n * data.shape[1], n * data.shape[3]) + data.shape[4:])
-  data = (data * 255).astype(np.uint8)
-  return data
-
-
-def main(_):
-  """Runs main labeled eval loop."""
-  # Parse config dict from yaml config files / command line flags.
-  config = util.ParseConfigsToLuaTable(FLAGS.config_paths, FLAGS.model_params)
-
-  # Choose an estimator based on training strategy.
-  checkpointdir = FLAGS.checkpointdir
-  checkpoint_path = os.path.join(
-      '%s/model.ckpt-%s' % (checkpointdir, FLAGS.checkpoint_iter))
-  estimator = get_estimator(config, checkpointdir)
-
-  # Get records to embed.
-  validation_dir = FLAGS.embedding_records
-  validation_records = util.GetFilesRecursively(validation_dir)
-
-  sequences_to_data = {}
-  for (view_embeddings, view_raw_image_strings, seqname) in estimator.inference(
-      validation_records, checkpoint_path, config.data.embed_batch_size,
-      num_sequences=FLAGS.num_sequences):
-    sequences_to_data[seqname] = {
-        'embeddings': view_embeddings,
-        'images': view_raw_image_strings,
-    }
-
-  all_embeddings = np.zeros((0, config.embedding_size))
-  all_ims = []
-  all_seqnames = []
-
-  num_embeddings = FLAGS.num_embed
-  # Concatenate all views from all sequences into a big flat list.
-  for seqname, data in sequences_to_data.iteritems():
-    embs = data['embeddings']
-    ims = data['images']
-    for v in range(config.data.num_views):
-      for (emb, im) in zip(embs[v], ims[v]):
-        all_embeddings = np.append(all_embeddings, [emb], axis=0)
-        all_ims.append(im)
-        all_seqnames.append(seqname)
-
-  # Choose N indices uniformly from all images.
-  random_indices = range(all_embeddings.shape[0])
-  random.shuffle(random_indices)
-  viz_indices = random_indices[:num_embeddings]
-
-  # Extract embs.
-  viz_embs = np.array(all_embeddings[viz_indices])
-
-  # Extract and decode ims.
-  viz_ims = list(np.array(all_ims)[viz_indices])
-  decoded_ims = []
-
-  sprite_dim = FLAGS.sprite_dim
-  for i, im in enumerate(viz_ims):
-    if i % 100 == 0:
-      print('Decoding image %d/%d.' % (i, num_embeddings))
-    nparr_i = np.fromstring(str(im), np.uint8)
-    img_np = cv2.imdecode(nparr_i, 1)
-    img_np = img_np[..., [2, 1, 0]]
-
-    img_np = imresize(img_np, [sprite_dim, sprite_dim, 3])
-    decoded_ims.append(img_np)
-  decoded_ims = np.array(decoded_ims)
-
-  # Extract sequence names.
-  outdir = FLAGS.outdir
-
-  # The embedding variable, which needs to be stored
-  # Note this must a Variable not a Tensor!
-  embedding_var = tf.Variable(viz_embs, name='viz_embs')
-
-  with tf.Session() as sess:
-    sess.run(embedding_var.initializer)
-    summary_writer = tf.summary.FileWriter(outdir)
-    config = projector.ProjectorConfig()
-    embedding = config.embeddings.add()
-    embedding.tensor_name = embedding_var.name
-
-    # Comment out if you don't want sprites
-    embedding.sprite.image_path = os.path.join(outdir, 'sprite.png')
-    embedding.sprite.single_image_dim.extend(
-        [decoded_ims.shape[1], decoded_ims.shape[1]])
-
-    projector.visualize_embeddings(summary_writer, config)
-    saver = tf.train.Saver([embedding_var])
-    saver.save(sess, os.path.join(outdir, 'model2.ckpt'), 1)
-
-  sprite = images_to_sprite(decoded_ims)
-  imsave(os.path.join(outdir, 'sprite.png'), sprite)
-
-if __name__ == '__main__':
-  tf.app.run(main)
diff --git a/research/textsum/BUILD b/research/textsum/BUILD
deleted file mode 100644
index ea062863911300097740781ba4bd847f6af07afa..0000000000000000000000000000000000000000
--- a/research/textsum/BUILD
+++ /dev/null
@@ -1,64 +0,0 @@
-package(default_visibility = [":internal"])
-
-licenses(["notice"])  # Apache 2.0
-
-exports_files(["LICENSE"])
-
-package_group(
-    name = "internal",
-    packages = [
-        "//textsum/...",
-    ],
-)
-
-py_library(
-    name = "seq2seq_attention_model",
-    srcs = ["seq2seq_attention_model.py"],
-    deps = [
-        ":seq2seq_lib",
-    ],
-)
-
-py_library(
-    name = "seq2seq_lib",
-    srcs = ["seq2seq_lib.py"],
-)
-
-py_binary(
-    name = "seq2seq_attention",
-    srcs = ["seq2seq_attention.py"],
-    deps = [
-        ":batch_reader",
-        ":data",
-        ":seq2seq_attention_decode",
-        ":seq2seq_attention_model",
-    ],
-)
-
-py_library(
-    name = "batch_reader",
-    srcs = ["batch_reader.py"],
-    deps = [
-        ":data",
-        ":seq2seq_attention_model",
-    ],
-)
-
-py_library(
-    name = "beam_search",
-    srcs = ["beam_search.py"],
-)
-
-py_library(
-    name = "seq2seq_attention_decode",
-    srcs = ["seq2seq_attention_decode.py"],
-    deps = [
-        ":beam_search",
-        ":data",
-    ],
-)
-
-py_library(
-    name = "data",
-    srcs = ["data.py"],
-)
diff --git a/research/textsum/README.md b/research/textsum/README.md
deleted file mode 100644
index ac2f55dd524bba3e2af5819a331cfa87c3f35786..0000000000000000000000000000000000000000
--- a/research/textsum/README.md
+++ /dev/null
@@ -1,171 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-Sequence-to-Sequence with Attention Model for Text Summarization.
-
-Authors:
-
-Xin Pan
-Peter Liu (peterjliu@google.com, github:peterjliu)
-
-<b>Introduction</b>
-
-The core model is the traditional sequence-to-sequence model with attention.
-It is customized (mostly inputs/outputs) for the text summarization task. The
-model has been trained on Gigaword dataset and achieved state-of-the-art
-results (as of June 2016).
-
-The results described below are based on model trained on multi-gpu and
-multi-machine settings. It has been simplified to run on only one machine
-for open source purpose.
-
-<b>Dataset</b>
-
-We used the Gigaword dataset described in [Rush et al. A Neural Attention Model
-for Sentence Summarization](https://arxiv.org/abs/1509.00685).
-
-We cannot provide the dataset due to the license. See ExampleGen in data.py
-about the data format. data/data contains a toy example. Also see data/vocab
-for example vocabulary format. In <b>How To Run</b> below, users can use toy
-data and vocab provided in the data/ directory to run the training by replacing
-the data directory flag.
-
-data_convert_example.py contains example of convert between binary and text.
-
-
-<b>Experiment Result</b>
-
-8000 examples from testset are sampled to generate summaries and rouge score is
-calculated for the generated summaries. Here is the best rouge score on
-Gigaword dataset:
-
-ROUGE-1 Average_R: 0.38272 (95%-conf.int. 0.37774 - 0.38755)
-
-ROUGE-1 Average_P: 0.50154 (95%-conf.int. 0.49509 - 0.50780)
-
-ROUGE-1 Average_F: 0.42568 (95%-conf.int. 0.42016 - 0.43099)
-
-ROUGE-2 Average_R: 0.20576 (95%-conf.int. 0.20060 - 0.21112)
-
-ROUGE-2 Average_P: 0.27565 (95%-conf.int. 0.26851 - 0.28257)
-
-ROUGE-2 Average_F: 0.23126 (95%-conf.int. 0.22539 - 0.23708)
-
-<b>Configuration:</b>
-
-Following is the configuration for the best trained model on Gigaword:
-
-batch_size: 64
-
-bidirectional encoding layer: 4
-
-article length: first 2 sentences, total words within 120.
-
-summary length: total words within 30.
-
-word embedding size: 128
-
-LSTM hidden units: 256
-
-Sampled softmax: 4096
-
-vocabulary size: Most frequent 200k words from dataset's article and summaries.
-
-<b>How To Run</b>
-
-Prerequisite: install TensorFlow and Bazel.
-
-```shell
-# cd to your workspace
-# 1. Clone the textsum code to your workspace 'textsum' directory.
-# 2. Create an empty 'WORKSPACE' file in your workspace.
-# 3. Move the train/eval/test data to your workspace 'data' directory.
-#    In the following example, I named the data training-*, test-*, etc.
-#    If your data files have different names, update the --data_path.
-#    If you don't have data but want to try out the model, copy the toy
-#    data from the textsum/data/data to the data/ directory in the workspace.
-$ ls -R
-.:
-data  textsum  WORKSPACE
-
-./data:
-vocab  test-0  training-0  training-1  validation-0 ...(omitted)
-
-./textsum:
-batch_reader.py       beam_search.py       BUILD    README.md                    seq2seq_attention_model.py  data
-data.py  seq2seq_attention_decode.py  seq2seq_attention.py        seq2seq_lib.py
-
-./textsum/data:
-data  vocab
-
-$ bazel build -c opt --config=cuda textsum/...
-
-# Run the training.
-$ bazel-bin/textsum/seq2seq_attention \
-    --mode=train \
-    --article_key=article \
-    --abstract_key=abstract \
-    --data_path=data/training-* \
-    --vocab_path=data/vocab \
-    --log_root=textsum/log_root \
-    --train_dir=textsum/log_root/train
-
-# Run the eval. Try to avoid running on the same machine as training.
-$ bazel-bin/textsum/seq2seq_attention \
-    --mode=eval \
-    --article_key=article \
-    --abstract_key=abstract \
-    --data_path=data/validation-* \
-    --vocab_path=data/vocab \
-    --log_root=textsum/log_root \
-    --eval_dir=textsum/log_root/eval
-
-# Run the decode. Run it when the model is mostly converged.
-$ bazel-bin/textsum/seq2seq_attention \
-    --mode=decode \
-    --article_key=article \
-    --abstract_key=abstract \
-    --data_path=data/test-* \
-    --vocab_path=data/vocab \
-    --log_root=textsum/log_root \
-    --decode_dir=textsum/log_root/decode \
-    --beam_size=8
-```
-
-
-<b>Examples:</b>
-
-The following are some text summarization examples, including experiments
-using dataset other than Gigaword.
-
-article: novell inc. chief executive officer eric schmidt has been named chairman of the internet search-engine company google .
-
-human: novell ceo named google chairman
-
-machine:  novell chief executive named to head internet company
-
-======================================
-
-article: gulf newspapers voiced skepticism thursday over whether newly re - elected us president bill clinton could help revive the troubled middle east peace process but saw a glimmer of hope .
-
-human: gulf skeptical about whether clinton will revive peace process
-
-machine:  gulf press skeptical over clinton 's prospects for peace process
-
-======================================
-
-article:  the european court of justice ( ecj ) recently ruled in lock v british gas trading ltd that eu law requires a worker 's statutory holiday pay to take commission payments into account - it should not be based solely on basic salary . the case is not over yet , but its outcome could potentially be costly for employers with workers who are entitled to commission . mr lock , an energy salesman for british gas , was paid a basic salary and sales commission on a monthly basis . his sales commission made up around 60 % of his remuneration package . when he took two weeks ' annual leave in december 2012 , he was paid his basic salary and also received commission from previous sales that fell due during that period . lock obviously did not generate new sales while he was on holiday , which meant that in the following period he suffered a reduced income through lack of commission . he brought an employment tribunal claim asserting that this amounted to a breach of the working time regulations 1998 .....deleted rest for readability...
-
-abstract: will british gas ecj ruling fuel holiday pay hike ?
-
-decode: eu law requires worker 's statutory holiday pay
-
-======================================
-
-article:  the junior all whites have been eliminated from the fifa u - 20 world cup in colombia with results on the final day of pool play confirming their exit . sitting on two points , new zealand needed results in one of the final two groups to go their way to join the last 16 as one of the four best third place teams . but while spain helped the kiwis ' cause with a 5 - 1 thrashing of australia , a 3 - 0 win for ecuador over costa rica saw the south americans climb to second in group c with costa rica 's three points also good enough to progress in third place . that left the junior all whites hopes hanging on the group d encounter between croatia and honduras finishing in a draw . a stalemate - and a place in the knockout stages for new zealand - appeared on the cards until midfielder marvin ceballos netted an 81st minute winner that sent guatemala through to the second round and left the junior all whites packing their bags . new zealand finishes the 24 - nation tournament in 17th place , having claimed their first ever points at this level in just their second appearance at the finals .
-
-abstract: junior all whites exit world cup
-
-decoded:  junior all whites eliminated from u- 20 world cup
-
diff --git a/research/textsum/batch_reader.py b/research/textsum/batch_reader.py
deleted file mode 100644
index 918551b4c2c5698a5640d11918199f2a6ff65d23..0000000000000000000000000000000000000000
--- a/research/textsum/batch_reader.py
+++ /dev/null
@@ -1,265 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Batch reader to seq2seq attention model, with bucketing support."""
-
-from collections import namedtuple
-from random import shuffle
-from threading import Thread
-import time
-
-import numpy as np
-import six
-from six.moves import queue as Queue
-from six.moves import xrange
-import tensorflow as tf
-
-import data
-
-ModelInput = namedtuple('ModelInput',
-                        'enc_input dec_input target enc_len dec_len '
-                        'origin_article origin_abstract')
-
-BUCKET_CACHE_BATCH = 100
-QUEUE_NUM_BATCH = 100
-
-
-class Batcher(object):
-  """Batch reader with shuffling and bucketing support."""
-
-  def __init__(self, data_path, vocab, hps,
-               article_key, abstract_key, max_article_sentences,
-               max_abstract_sentences, bucketing=True, truncate_input=False):
-    """Batcher constructor.
-
-    Args:
-      data_path: tf.Example filepattern.
-      vocab: Vocabulary.
-      hps: Seq2SeqAttention model hyperparameters.
-      article_key: article feature key in tf.Example.
-      abstract_key: abstract feature key in tf.Example.
-      max_article_sentences: Max number of sentences used from article.
-      max_abstract_sentences: Max number of sentences used from abstract.
-      bucketing: Whether bucket articles of similar length into the same batch.
-      truncate_input: Whether to truncate input that is too long. Alternative is
-        to discard such examples.
-    """
-    self._data_path = data_path
-    self._vocab = vocab
-    self._hps = hps
-    self._article_key = article_key
-    self._abstract_key = abstract_key
-    self._max_article_sentences = max_article_sentences
-    self._max_abstract_sentences = max_abstract_sentences
-    self._bucketing = bucketing
-    self._truncate_input = truncate_input
-    self._input_queue = Queue.Queue(QUEUE_NUM_BATCH * self._hps.batch_size)
-    self._bucket_input_queue = Queue.Queue(QUEUE_NUM_BATCH)
-    self._input_threads = []
-    for _ in xrange(16):
-      self._input_threads.append(Thread(target=self._FillInputQueue))
-      self._input_threads[-1].daemon = True
-      self._input_threads[-1].start()
-    self._bucketing_threads = []
-    for _ in xrange(4):
-      self._bucketing_threads.append(Thread(target=self._FillBucketInputQueue))
-      self._bucketing_threads[-1].daemon = True
-      self._bucketing_threads[-1].start()
-
-    self._watch_thread = Thread(target=self._WatchThreads)
-    self._watch_thread.daemon = True
-    self._watch_thread.start()
-
-  def NextBatch(self):
-    """Returns a batch of inputs for seq2seq attention model.
-
-    Returns:
-      enc_batch: A batch of encoder inputs [batch_size, hps.enc_timestamps].
-      dec_batch: A batch of decoder inputs [batch_size, hps.dec_timestamps].
-      target_batch: A batch of targets [batch_size, hps.dec_timestamps].
-      enc_input_len: encoder input lengths of the batch.
-      dec_input_len: decoder input lengths of the batch.
-      loss_weights: weights for loss function, 1 if not padded, 0 if padded.
-      origin_articles: original article words.
-      origin_abstracts: original abstract words.
-    """
-    enc_batch = np.zeros(
-        (self._hps.batch_size, self._hps.enc_timesteps), dtype=np.int32)
-    enc_input_lens = np.zeros(
-        (self._hps.batch_size), dtype=np.int32)
-    dec_batch = np.zeros(
-        (self._hps.batch_size, self._hps.dec_timesteps), dtype=np.int32)
-    dec_output_lens = np.zeros(
-        (self._hps.batch_size), dtype=np.int32)
-    target_batch = np.zeros(
-        (self._hps.batch_size, self._hps.dec_timesteps), dtype=np.int32)
-    loss_weights = np.zeros(
-        (self._hps.batch_size, self._hps.dec_timesteps), dtype=np.float32)
-    origin_articles = ['None'] * self._hps.batch_size
-    origin_abstracts = ['None'] * self._hps.batch_size
-
-    buckets = self._bucket_input_queue.get()
-    for i in xrange(self._hps.batch_size):
-      (enc_inputs, dec_inputs, targets, enc_input_len, dec_output_len,
-       article, abstract) = buckets[i]
-
-      origin_articles[i] = article
-      origin_abstracts[i] = abstract
-      enc_input_lens[i] = enc_input_len
-      dec_output_lens[i] = dec_output_len
-      enc_batch[i, :] = enc_inputs[:]
-      dec_batch[i, :] = dec_inputs[:]
-      target_batch[i, :] = targets[:]
-      for j in xrange(dec_output_len):
-        loss_weights[i][j] = 1
-    return (enc_batch, dec_batch, target_batch, enc_input_lens, dec_output_lens,
-            loss_weights, origin_articles, origin_abstracts)
-
-  def _FillInputQueue(self):
-    """Fill input queue with ModelInput."""
-    start_id = self._vocab.WordToId(data.SENTENCE_START)
-    end_id = self._vocab.WordToId(data.SENTENCE_END)
-    pad_id = self._vocab.WordToId(data.PAD_TOKEN)
-    input_gen = self._TextGenerator(data.ExampleGen(self._data_path))
-    while True:
-      (article, abstract) = six.next(input_gen)
-      article_sentences = [sent.strip() for sent in
-                           data.ToSentences(article, include_token=False)]
-      abstract_sentences = [sent.strip() for sent in
-                            data.ToSentences(abstract, include_token=False)]
-
-      enc_inputs = []
-      # Use the <s> as the <GO> symbol for decoder inputs.
-      dec_inputs = [start_id]
-
-      # Convert first N sentences to word IDs, stripping existing <s> and </s>.
-      for i in xrange(min(self._max_article_sentences,
-                          len(article_sentences))):
-        enc_inputs += data.GetWordIds(article_sentences[i], self._vocab)
-      for i in xrange(min(self._max_abstract_sentences,
-                          len(abstract_sentences))):
-        dec_inputs += data.GetWordIds(abstract_sentences[i], self._vocab)
-
-      # Filter out too-short input
-      if (len(enc_inputs) < self._hps.min_input_len or
-          len(dec_inputs) < self._hps.min_input_len):
-        tf.logging.warning('Drop an example - too short.\nenc:%d\ndec:%d',
-                           len(enc_inputs), len(dec_inputs))
-        continue
-
-      # If we're not truncating input, throw out too-long input
-      if not self._truncate_input:
-        if (len(enc_inputs) > self._hps.enc_timesteps or
-            len(dec_inputs) > self._hps.dec_timesteps):
-          tf.logging.warning('Drop an example - too long.\nenc:%d\ndec:%d',
-                             len(enc_inputs), len(dec_inputs))
-          continue
-      # If we are truncating input, do so if necessary
-      else:
-        if len(enc_inputs) > self._hps.enc_timesteps:
-          enc_inputs = enc_inputs[:self._hps.enc_timesteps]
-        if len(dec_inputs) > self._hps.dec_timesteps:
-          dec_inputs = dec_inputs[:self._hps.dec_timesteps]
-
-      # targets is dec_inputs without <s> at beginning, plus </s> at end
-      targets = dec_inputs[1:]
-      targets.append(end_id)
-
-      # Now len(enc_inputs) should be <= enc_timesteps, and
-      # len(targets) = len(dec_inputs) should be <= dec_timesteps
-
-      enc_input_len = len(enc_inputs)
-      dec_output_len = len(targets)
-
-      # Pad if necessary
-      while len(enc_inputs) < self._hps.enc_timesteps:
-        enc_inputs.append(pad_id)
-      while len(dec_inputs) < self._hps.dec_timesteps:
-        dec_inputs.append(end_id)
-      while len(targets) < self._hps.dec_timesteps:
-        targets.append(end_id)
-
-      element = ModelInput(enc_inputs, dec_inputs, targets, enc_input_len,
-                           dec_output_len, ' '.join(article_sentences),
-                           ' '.join(abstract_sentences))
-      self._input_queue.put(element)
-
-  def _FillBucketInputQueue(self):
-    """Fill bucketed batches into the bucket_input_queue."""
-    while True:
-      inputs = []
-      for _ in xrange(self._hps.batch_size * BUCKET_CACHE_BATCH):
-        inputs.append(self._input_queue.get())
-      if self._bucketing:
-        inputs = sorted(inputs, key=lambda inp: inp.enc_len)
-
-      batches = []
-      for i in xrange(0, len(inputs), self._hps.batch_size):
-        batches.append(inputs[i:i+self._hps.batch_size])
-      shuffle(batches)
-      for b in batches:
-        self._bucket_input_queue.put(b)
-
-  def _WatchThreads(self):
-    """Watch the daemon input threads and restart if dead."""
-    while True:
-      time.sleep(60)
-      input_threads = []
-      for t in self._input_threads:
-        if t.is_alive():
-          input_threads.append(t)
-        else:
-          tf.logging.error('Found input thread dead.')
-          new_t = Thread(target=self._FillInputQueue)
-          input_threads.append(new_t)
-          input_threads[-1].daemon = True
-          input_threads[-1].start()
-      self._input_threads = input_threads
-
-      bucketing_threads = []
-      for t in self._bucketing_threads:
-        if t.is_alive():
-          bucketing_threads.append(t)
-        else:
-          tf.logging.error('Found bucketing thread dead.')
-          new_t = Thread(target=self._FillBucketInputQueue)
-          bucketing_threads.append(new_t)
-          bucketing_threads[-1].daemon = True
-          bucketing_threads[-1].start()
-      self._bucketing_threads = bucketing_threads
-
-  def _TextGenerator(self, example_gen):
-    """Generates article and abstract text from tf.Example."""
-    while True:
-      e = six.next(example_gen)
-      try:
-        article_text = self._GetExFeatureText(e, self._article_key)
-        abstract_text = self._GetExFeatureText(e, self._abstract_key)
-      except ValueError:
-        tf.logging.error('Failed to get article or abstract from example')
-        continue
-
-      yield (article_text, abstract_text)
-
-  def _GetExFeatureText(self, ex, key):
-    """Extract text for a feature from td.Example.
-
-    Args:
-      ex: tf.Example.
-      key: key of the feature to be extracted.
-    Returns:
-      feature: a feature text extracted.
-    """
-    return ex.features.feature[key].bytes_list.value[0]
diff --git a/research/textsum/beam_search.py b/research/textsum/beam_search.py
deleted file mode 100644
index 446799caa77abd292b326f10d37db952c7722165..0000000000000000000000000000000000000000
--- a/research/textsum/beam_search.py
+++ /dev/null
@@ -1,156 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Beam search module.
-
-Beam search takes the top K results from the model, predicts the K results for
-each of the previous K result, getting K*K results. Pick the top K results from
-K*K results, and start over again until certain number of results are fully
-decoded.
-"""
-
-from six.moves import xrange
-import tensorflow as tf
-
-FLAGS = tf.flags.FLAGS
-tf.flags.DEFINE_bool('normalize_by_length', True, 'Whether to normalize')
-
-
-class Hypothesis(object):
-  """Defines a hypothesis during beam search."""
-
-  def __init__(self, tokens, log_prob, state):
-    """Hypothesis constructor.
-
-    Args:
-      tokens: start tokens for decoding.
-      log_prob: log prob of the start tokens, usually 1.
-      state: decoder initial states.
-    """
-    self.tokens = tokens
-    self.log_prob = log_prob
-    self.state = state
-
-  def Extend(self, token, log_prob, new_state):
-    """Extend the hypothesis with result from latest step.
-
-    Args:
-      token: latest token from decoding.
-      log_prob: log prob of the latest decoded tokens.
-      new_state: decoder output state. Fed to the decoder for next step.
-    Returns:
-      New Hypothesis with the results from latest step.
-    """
-    return Hypothesis(self.tokens + [token], self.log_prob + log_prob,
-                      new_state)
-
-  @property
-  def latest_token(self):
-    return self.tokens[-1]
-
-  def __str__(self):
-    return ('Hypothesis(log prob = %.4f, tokens = %s)' % (self.log_prob,
-                                                          self.tokens))
-
-
-class BeamSearch(object):
-  """Beam search."""
-
-  def __init__(self, model, beam_size, start_token, end_token, max_steps):
-    """Creates BeamSearch object.
-
-    Args:
-      model: Seq2SeqAttentionModel.
-      beam_size: int.
-      start_token: int, id of the token to start decoding with
-      end_token: int, id of the token that completes an hypothesis
-      max_steps: int, upper limit on the size of the hypothesis
-    """
-    self._model = model
-    self._beam_size = beam_size
-    self._start_token = start_token
-    self._end_token = end_token
-    self._max_steps = max_steps
-
-  def BeamSearch(self, sess, enc_inputs, enc_seqlen):
-    """Performs beam search for decoding.
-
-    Args:
-      sess: tf.Session, session
-      enc_inputs: ndarray of shape (enc_length, 1), the document ids to encode
-      enc_seqlen: ndarray of shape (1), the length of the sequnce
-
-    Returns:
-      hyps: list of Hypothesis, the best hypotheses found by beam search,
-          ordered by score
-    """
-
-    # Run the encoder and extract the outputs and final state.
-    enc_top_states, dec_in_state = self._model.encode_top_state(
-        sess, enc_inputs, enc_seqlen)
-    # Replicate the initial states K times for the first step.
-    hyps = [Hypothesis([self._start_token], 0.0, dec_in_state)
-           ] * self._beam_size
-    results = []
-
-    steps = 0
-    while steps < self._max_steps and len(results) < self._beam_size:
-      latest_tokens = [h.latest_token for h in hyps]
-      states = [h.state for h in hyps]
-
-      topk_ids, topk_log_probs, new_states = self._model.decode_topk(
-          sess, latest_tokens, enc_top_states, states)
-      # Extend each hypothesis.
-      all_hyps = []
-      # The first step takes the best K results from first hyps. Following
-      # steps take the best K results from K*K hyps.
-      num_beam_source = 1 if steps == 0 else len(hyps)
-      for i in xrange(num_beam_source):
-        h, ns = hyps[i], new_states[i]
-        for j in xrange(self._beam_size*2):
-          all_hyps.append(h.Extend(topk_ids[i, j], topk_log_probs[i, j], ns))
-
-      # Filter and collect any hypotheses that have the end token.
-      hyps = []
-      for h in self._BestHyps(all_hyps):
-        if h.latest_token == self._end_token:
-          # Pull the hypothesis off the beam if the end token is reached.
-          results.append(h)
-        else:
-          # Otherwise continue to the extend the hypothesis.
-          hyps.append(h)
-        if len(hyps) == self._beam_size or len(results) == self._beam_size:
-          break
-
-      steps += 1
-
-    if steps == self._max_steps:
-      results.extend(hyps)
-
-    return self._BestHyps(results)
-
-  def _BestHyps(self, hyps):
-    """Sort the hyps based on log probs and length.
-
-    Args:
-      hyps: A list of hypothesis.
-    Returns:
-      hyps: A list of sorted hypothesis in reverse log_prob order.
-    """
-    # This length normalization is only effective for the final results.
-    if FLAGS.normalize_by_length:
-      return sorted(hyps, key=lambda h: h.log_prob/len(h.tokens), reverse=True)
-    else:
-      return sorted(hyps, key=lambda h: h.log_prob, reverse=True)
diff --git a/research/textsum/data.py b/research/textsum/data.py
deleted file mode 100644
index 2baad0a12b7010673652c6c66b46fea960f69f0f..0000000000000000000000000000000000000000
--- a/research/textsum/data.py
+++ /dev/null
@@ -1,215 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Data batchers for data described in ..//data_prep/README.md."""
-
-import glob
-import random
-import struct
-import sys
-
-from tensorflow.core.example import example_pb2
-
-
-# Special tokens
-PARAGRAPH_START = '<p>'
-PARAGRAPH_END = '</p>'
-SENTENCE_START = '<s>'
-SENTENCE_END = '</s>'
-UNKNOWN_TOKEN = '<UNK>'
-PAD_TOKEN = '<PAD>'
-DOCUMENT_START = '<d>'
-DOCUMENT_END = '</d>'
-
-
-class Vocab(object):
-  """Vocabulary class for mapping words and ids."""
-
-  def __init__(self, vocab_file, max_size):
-    self._word_to_id = {}
-    self._id_to_word = {}
-    self._count = 0
-
-    with open(vocab_file, 'r') as vocab_f:
-      for line in vocab_f:
-        pieces = line.split()
-        if len(pieces) != 2:
-          sys.stderr.write('Bad line: %s\n' % line)
-          continue
-        if pieces[0] in self._word_to_id:
-          raise ValueError('Duplicated word: %s.' % pieces[0])
-        self._word_to_id[pieces[0]] = self._count
-        self._id_to_word[self._count] = pieces[0]
-        self._count += 1
-        if self._count > max_size:
-          raise ValueError('Too many words: >%d.' % max_size)
-
-  def CheckVocab(self, word):
-    if word not in self._word_to_id:
-      return None
-    return self._word_to_id[word]
-  
-  def WordToId(self, word):
-    if word not in self._word_to_id:
-      return self._word_to_id[UNKNOWN_TOKEN]
-    return self._word_to_id[word]
-
-  def IdToWord(self, word_id):
-    if word_id not in self._id_to_word:
-      raise ValueError('id not found in vocab: %d.' % word_id)
-    return self._id_to_word[word_id]
-
-  def NumIds(self):
-    return self._count
-
-
-def ExampleGen(data_path, num_epochs=None):
-  """Generates tf.Examples from path of data files.
-
-    Binary data format: <length><blob>. <length> represents the byte size
-    of <blob>. <blob> is serialized tf.Example proto. The tf.Example contains
-    the tokenized article text and summary.
-
-  Args:
-    data_path: path to tf.Example data files.
-    num_epochs: Number of times to go through the data. None means infinite.
-
-  Yields:
-    Deserialized tf.Example.
-
-  If there are multiple files specified, they accessed in a random order.
-  """
-  epoch = 0
-  while True:
-    if num_epochs is not None and epoch >= num_epochs:
-      break
-    filelist = glob.glob(data_path)
-    assert filelist, 'Empty filelist.'
-    random.shuffle(filelist)
-    for f in filelist:
-      reader = open(f, 'rb')
-      while True:
-        len_bytes = reader.read(8)
-        if not len_bytes: break
-        str_len = struct.unpack('q', len_bytes)[0]
-        example_str = struct.unpack('%ds' % str_len, reader.read(str_len))[0]
-        yield example_pb2.Example.FromString(example_str)
-
-    epoch += 1
-
-
-def Pad(ids, pad_id, length):
-  """Pad or trim list to len length.
-
-  Args:
-    ids: list of ints to pad
-    pad_id: what to pad with
-    length: length to pad or trim to
-
-  Returns:
-    ids trimmed or padded with pad_id
-  """
-  assert pad_id is not None
-  assert length is not None
-
-  if len(ids) < length:
-    a = [pad_id] * (length - len(ids))
-    return ids + a
-  else:
-    return ids[:length]
-
-
-def GetWordIds(text, vocab, pad_len=None, pad_id=None):
-  """Get ids corresponding to words in text.
-
-  Assumes tokens separated by space.
-
-  Args:
-    text: a string
-    vocab: TextVocabularyFile object
-    pad_len: int, length to pad to
-    pad_id: int, word id for pad symbol
-
-  Returns:
-    A list of ints representing word ids.
-  """
-  ids = []
-  for w in text.split():
-    i = vocab.WordToId(w)
-    if i >= 0:
-      ids.append(i)
-    else:
-      ids.append(vocab.WordToId(UNKNOWN_TOKEN))
-  if pad_len is not None:
-    return Pad(ids, pad_id, pad_len)
-  return ids
-
-
-def Ids2Words(ids_list, vocab):
-  """Get words from ids.
-
-  Args:
-    ids_list: list of int32
-    vocab: TextVocabulary object
-
-  Returns:
-    List of words corresponding to ids.
-  """
-  assert isinstance(ids_list, list), '%s  is not a list' % ids_list
-  return [vocab.IdToWord(i) for i in ids_list]
-
-
-def SnippetGen(text, start_tok, end_tok, inclusive=True):
-  """Generates consecutive snippets between start and end tokens.
-
-  Args:
-    text: a string
-    start_tok: a string denoting the start of snippets
-    end_tok: a string denoting the end of snippets
-    inclusive: Whether include the tokens in the returned snippets.
-
-  Yields:
-    String snippets
-  """
-  cur = 0
-  while True:
-    try:
-      start_p = text.index(start_tok, cur)
-      end_p = text.index(end_tok, start_p + 1)
-      cur = end_p + len(end_tok)
-      if inclusive:
-        yield text[start_p:cur]
-      else:
-        yield text[start_p+len(start_tok):end_p]
-    except ValueError as e:
-      raise StopIteration('no more snippets in text: %s' % e)
-
-
-def GetExFeatureText(ex, key):
-  return ex.features.feature[key].bytes_list.value[0]
-
-
-def ToSentences(paragraph, include_token=True):
-  """Takes tokens of a paragraph and returns list of sentences.
-
-  Args:
-    paragraph: string, text of paragraph
-    include_token: Whether include the sentence separation tokens result.
-
-  Returns:
-    List of sentence strings.
-  """
-  s_gen = SnippetGen(paragraph, SENTENCE_START, SENTENCE_END, include_token)
-  return [s for s in s_gen]
diff --git a/research/textsum/data/data b/research/textsum/data/data
deleted file mode 100644
index b554873a62ad4a6504f596f498c368b6bfe4eb12..0000000000000000000000000000000000000000
Binary files a/research/textsum/data/data and /dev/null differ
diff --git a/research/textsum/data/vocab b/research/textsum/data/vocab
deleted file mode 100644
index 315740c28cf0bf1844a33799a491f8521d888cb6..0000000000000000000000000000000000000000
--- a/research/textsum/data/vocab
+++ /dev/null
@@ -1,10003 +0,0 @@
-the 135597564
-, 121400181
-. 98868076
-to 58429764
-of 56269484
-in 49820911
-a 49701084
-and 49378364
-'s 23787251
-'' 23227828
-`` 23116499
-that 21577263
-for 20998230
-said 20858620
-on 19106851
-## 16627320
-is 15661835
-was 14607055
-with 14265376
-he 13755120
-it 13588190
-<UNK> 12263923
-at 12221539
-as 11657129
-by 11105584
-</d> 11090708
-<d> 11090708
-from 10275933
-his 9090323
-be 8939486
-have 8930288
-has 8880930
-but 8213981
-an 8035012
-fourmile 60
-zwart 60
-post-baby 60
-diasporas 60
-herzeg-bosna 60
-younkers 60
-rolfing 60
-cyclades 60
-lovas 60
-super-cheap 60
-johnsonglobe.com 60
-incarnates 60
-candis 60
-luzira 60
-toyota\/lola\/bridgestone 60
-caohc 60
-flatbeds 60
-pairat 60
-stubborness 60
-mogaka 60
-march-past 60
-alaba 60
-extravehicular 60
-conolly 60
-shelford 60
-snowblowers 60
-excoriation 60
-langoliers 60
-ayios 60
-rsm\/rw## 60
-ultralow 60
-kassire 60
-kirikkale 60
-rutaca 60
-hardys 60
-latigo 60
-aggressive-growth 60
-shankland 60
-tetrault 60
-ntf 60
-british-u.s. 60
-rbk 60
-hannagan 60
-pro-french 60
-macero 60
-bahrenburg 60
-recanvass 60
-hayrunisa 60
-educap 60
-tuoh 60
-x-#-#-#-# 60
-cravenly 60
-jent 60
-britain-farm-animals-disease 60
-white-power 60
-pongsidhirak 60
-fbc-ohiostate 60
-euro-arab 60
-prds 60
-sinkinson 60
-baugher 60
-fpd 60
-sakombi 60
-holyhead 60
-virusscan 60
-niboro 60
-cliffsnotes 60
-inhibitory 60
-nontariff 60
-huaraz 60
-qera 60
-icomos 60
-off-handed 60
-lumbee 60
-kututwa 60
-besra 60
-ownit 60
-out-of-area 60
-petrozuata 60
-wielinga 60
-roecker 60
-jeanneret 60
-ryukyus 60
-chocked 60
-syda 60
-rearers 60
-especialistas 60
-stoeltje 60
-tag-along 60
-pendulums 60
-land-hungry 60
-male-pattern 60
-ogbulafor 60
-jemil 60
-singlehanded 60
-bogaert 60
-brawnier 60
-picardie 60
-patsalides 60
-zvimba 60
-talamoni 60
-aristos 60
-reductionist 60
-sung-han 60
-ebby 60
-tcambanisglobe.com 60
-well-populated 60
-boguinskaia 60
-golfen 60
-fossmo 60
-leches 60
-madtv 60
-mirzapur 60
-dromey 60
-makowski 60
-bearzot 60
-fifth-day 60
-togoimi 60
-ethopia 60
-espoo-based 60
-cuppa 60
-cristin 60
-lambrechts 60
-eurosystem 60
-november-january 60
-home-security 60
-vengerov 60
-major-market 60
-obviates 60
-horster 60
-three-and-half 60
-sinyong 60
-cizek 60
-issyk-kul 60
-granatino 60
-ketones 60
-jenista 60
-mvovo 60
-shuttlecocks 60
-behounek 60
-nonscholarship 60
-mutaz 60
-hermandad 60
-engelmayer 60
-mussallam 60
-lutein 60
-drag-queen 60
-independent-film 60
-u.s.-asia 60
-jiantang 60
-pinkins 60
-uplifts 60
-lifsher 60
-gree 60
-suit-clad 60
-donavan 60
-stracke 60
-hard-to-read 60
-merco 60
-clanks 60
-asvat 60
-kloof 60
-marsland 60
-caipirinhas 60
-army-style 60
-benhuri 60
-mokotedi 60
-toader 60
-hanarotelecom 60
-guodong 60
-cottone 60
-dominicis 60
-osuntokun 60
-planetall 60
-ik-rjm 60
-serhan 60
-delimiting 60
-chiaro 60
-optimizes 60
-kopay 60
-adakhan 60
-cretz 60
-liberato 60
-kandie 60
-abdur-raheem 60
-vodichkova 60
-schawlow 60
-kulivan 60
-tagtop 60
-stefanini 60
-wcec 60
-kabocha 60
-okresek 60
-martin-in-the-fields 60
-lawn-mowing 60
-cpds 60
-raso 60
-ciubuc 60
-match-winners 60
-eye-pleasing 60
-elorriaga 60
-marusa 60
-ford@globe.com 60
-dreger 60
-#x#k 60
-wsm 60
-hocutt 60
-macconnell 60
-out-of-service 60
-deep-fat 60
-body-builder 60
-streamline.com 60
-piatas 60
-scolavino 60
-technobabble 60
-war-ridden 60
-much-used 60
-colisee 60
-itty 60
-bhw 60
-undersubscribed 60
-cargraphics 60
-tourist-related 60
-fada 60
-sibomana 60
-shuger 60
-megawatt-hours 60
-webpad 60
-visine 60
-rewriteable 60
-madore 60
-fasanos 60
-weinhauer 60
-anti-mormon 60
-loudmouths 60
-broadness 60
-quixtar 60
-fancy-schmancy 60
-gangways 60
-aversive 60
-clingendael 60
-jonathans 60
-n#k 60
-schwald 60
-puxi 60
-abrahamic 60
-casebook 60
-clear-the-air 60
-islamised 60
-messeria 60
-vacation-home 60
-pro-tutsi 60
-kishkovsky 60
-portee 60
-awardee 60
-batsh 60
-latika 60
-bristol-meyers 60
-collum 60
-aprils 60
-telephia 60
-cloud-shrouded 60
-child-abusing 60
-hott 60
-bacteriophages 60
-ghosananda 60
-x-original-to 60
-aubuchon 60
-maldon 60
-owei 60
-stumper 60
-ghanaian-born 60
-cincinnatti 60
-spelich 60
-guoxing 60
-regulation-time 60
-scotiamcleod 60
-tesana 60
-seung-youn 60
-wen-ko 60
-stadt 60
-schroeders 60
-norin 60
-nung 60
-bank\/schroder 60
-relased 60
-ea-lm 60
-rubey 60
-cfi 60
-kavaja 60
-bourgault 60
-behrakis 60
-suraphong 60
-homesteader 60
-wbur-fm 60
-whee 60
-afghan-kidnappings 60
-rybin 60
-near-starvation 60
-crippa 60
-sanlitun 60
-pro-monarchist 60
-mortalities 60
-anticoagulants 60
-unsual 60
-kaha 60
-kamte 60
-harakah 60
-scarfe 60
-kenoy 60
-sendov 60
-depasquale 60
-maddaloni 60
-jointly-funded 60
-nokwe 60
-liff 60
-kickstarter 60
-khakpour 60
-cribiore 60
-husbanded 60
-pedauye 60
-perrella 60
-far-post 60
-boruchowitz 60
-interational 60
-close-to-home 60
-sophoan 60
-luqa 60
-legislator-elect 60
-forgy 60
-leutar 60
-uosukainen 60
-mirisch 60
-havin 60
-scorekeeping 60
-englishness 60
-philippines-landslide 60
-economides 60
-etonian 60
-randburg 60
-aew 60
-maillet 60
-abogado 60
-well-aware 60
-stunners 60
-laras 60
-#-tatiana 60
-cicciaro 60
-ski-in 60
-##-billion-u.s. 60
-keteyian 60
-first-in-the-south 60
-full-rate 60
-palepoi 60
-self-deluded 60
-hiv-infection 60
-rios-martinez 60
-turbojet 60
-yike 60
-radmacher 60
-stir-crazy 60
-bucuane 60
-unidentifed 60
-richardo 60
-zygi 60
-spritely 60
-vaynerchuk 60
-meniere 60
-special-ed 60
-teenybopper 60
-jmckim@globe.com 60
-xade 60
-danish-owned 60
-ikuko 60
-#-chris 60
-kayapo 60
-zeckendorf 60
-steamrollers 60
-giroir 60
-jaidev 60
-eraserhead 60
-japanese-chinese 60
-reprocesses 60
-muros 60
-must-buy 60
-corsiglia 60
-yanqui 60
-basanez 60
-unplayed 60
-mimika 60
-electric-only 60
-scerbatihs 60
-hitzig 60
-zero-interest-rate 60
-populares 60
-iphigenia 60
-nutricia 60
-sape 60
-kopec 60
-iencsi 60
-alvan 60
-ho-nyoun 60
-mob-style 60
-frivolities 60
-post-ups 60
-percentiles 60
-coote 60
-sodowsky 60
-caglar 60
-sahaviriya 60
-leggat 60
-xueju 60
-compagnia 60
-zomax 60
-inter-school 60
-hugeness 60
-sleaziness 60
-langenhan 60
-business-news 60
-armenteros 60
-cup-clinching 60
-yelavich 60
-chao-ching 60
-salvadori 60
-ameronline 60
-sex-starved 60
-alifereti 60
-trosch 60
-recombined 60
-dwek 60
-emission-control 60
-pozdniakov 60
-shalaan 60
-uzelac 60
-imzouren 60
-obolensk 60
-moshen 60
-commodified 60
-aleady 60
-barakaldo 60
-tumwesigye 60
-greiff 60
-walp 60
-r-palm 60
-julkipli 60
-managerless 60
-yabucoa 60
-zandl 60
-self-doubting 60
-kherman@statesman.com 60
-amol 60
-erck 60
-nitc 60
-hutcheon 60
-hornbills 60
-all-win 60
-heavy-weighted 60
-publick 60
-b&h 60
-kirchler 60
-cornella-el 60
-seefeldt 60
-sasse 60
-http://www.nra.org 60
-ball-striker 60
-fip 60
-concentra 60
-graywolf 60
-debbouze 60
-snowfields 60
-rabushka 60
-well-trimmed 60
-sofri 60
-jungle-shrouded 60
-beynon 60
-kotil 60
-snarly 60
-##-cent-per-gallon 60
-then-boss 60
-wakeling 60
-haft-e-tir 60
-majoros 60
-guglielminpietro 60
-hasnawi 60
-plachkov 60
-cabarrus 60
-consumer-advocacy 60
-xinhuanet 60
-karakul 60
-eight-iron 60
-rallis 60
-lighthizer 60
-vugar 60
-eighth-generation 60
-unitd 60
-x-anthony 60
-mwonzora 60
-prestart 60
-nadkarni 60
-phyapon 60
-tshuva 60
-kakuryu 60
-ist###-### 60
-golf-ryder 60
-delivered-to 60
-excelle 60
-###.#-yard 60
-less-visible 60
-elopement 60
-khenthong 60
-controllability 60
-venzuela 60
-soutine 60
-pongpen 60
-xhelili 60
-four-homer 60
-his-name 60
-dunnigan 60
-austalia 60
-djalma 60
-uher 60
-yaman 60
-kipng 60
-debt-related 60
-volgyes 60
-liposome 60
-diamoutene 60
-spragg 60
-sumahadi 60
-lebov 60
-blackly 60
-mutineering 60
-overstocking 60
-ryynaenen 60
-beraja 60
-hoklo 60
-uniphoenix 60
-cumbal 60
-redmayne 60
-sesana 60
-inu 60
-daresay 60
-salvinia 60
-egoistic 60
-nellies 60
-deep-frozen 60
-labor-relations 60
-weegen 60
-sino-kenyan 60
-cup-winner 60
-gimlet-eyed 60
-lendus 60
-rwisereka 60
-hugh-jones 60
-shenzhen-listed 60
-sunmonu 60
-zmago 60
-latonya 60
-tv\/ji 60
-troshin 60
-analista 60
-hysteric 60
-lindsay-hogg 60
-marmaro 60
-purvey 60
-kilovolt 60
-val-kill 60
-partership 60
-prostrating 60
-bhairahawa 60
-foc 60
-lyonpo 60
-yeardley 60
-detalles 60
-camas 60
-web-search 60
-kosminsky 60
-defund 60
-cowman 60
-al-kebir 60
-kasher 60
-kapolei 60
-fiords 60
-brentjens 60
-teen-pregnancy 60
-qawi 60
-orquesta 60
-genotyping 60
-nonato 60
-nssa 60
-twyford 60
-yuling 60
-pearcy 60
-sisic 60
-defraying 60
-eisteddfod 60
-benson-pope 60
-barflies 60
-one-arm 60
-unitech 60
-wheats 60
-mid-town 60
-non-tour 60
-fiord 60
-jnr. 60
-bge 60
-mixmaster 60
-glanton 60
-oner 60
-tax-return 60
-embolisms 60
-willse 60
-ribon 60
-turbodiesel 60
-higher-performing 60
-cotswold 60
-early-summer 60
-romasko 60
-rulfo 60
-moad 60
-paradyne 60
-##-sided 60
-galili 60
-nabatean 60
-haarlemmermeer 60
-reanimated 60
-supo 60
-varujan 60
-demystifies 60
-gilgamesh 60
-bangabhaban 60
-ekin 60
-ctl 60
-bmv 60
-ipolito 60
-ovsyannikov 60
-archaeologically 60
-x-michael 60
-hko 60
-stollen 60
-veloute 60
-sung-young 60
-hydroplaned 60
-coogee 60
-tammert 60
-stambouli 60
-makama 60
-kilted 60
-tuqan 60
-dutch-german 60
-mids 60
-two-to-four 60
-gavan 60
-doorstops 60
-pazira 60
-halifa 60
-melwood 60
-techonology 60
-montecatini 60
-throat-clearing 60
-avakian 60
-hirschbiegel 60
-anticonvulsant 60
-disadvantaging 60
-sieger 60
-sealink 60
--##.### 60
-rabelo 60
-web-mail 60
-maulud 60
-inchworm 60
-tabtabai 60
-relata 60
-ktt 60
-dae-hyun 60
-witchdoctor 60
-astres 60
-yannett 60
-fixed-dose 60
-cadle 60
-tianfu 60
-necesitamos 60
-slemrod 60
-tamil-speaking 60
-rsm\/br## 60
-preisdent 60
-eitzmann 60
-akuffo 60
-blefary 60
-cyclophosphamide 60
-aprilla 60
-share-the-wealth 60
-yagmurdereli 60
-maumere 60
-embarek 60
-monday-thursday 60
-ostiglia 60
-seree 60
-smokefree 60
-nerio 60
-lactase 60
-harbour-felax 60
-belue 60
-al-jaz 60
-tamisuke 60
-twahir 60
-setian 60
-dmp 60
-kowske 60
-supermercados 60
-decalogue 60
-outpolls 60
-pinballed 60
-hezbullah 60
-daluwatte 60
-nintendogs 60
-mediocrities 60
-cercelletta 60
-lulzim 60
-youngstars 60
-mavrou 60
-gorgonio 60
-manie 60
-occassionally 60
-chikelu 60
-mcgettigan 60
-sinduhije 60
-produjo 60
-charvat 60
-once-communist 60
-good-tasting 60
-palexpo 60
-izak 60
-comedy\/drama 60
-brusqueness 60
-anky 60
-#rd-#th 60
-ergasias 60
-jaxon 60
-miletti 60
-wangui 60
-heb 60
-exurb 60
-chalices 60
-sentimentalists 60
-###-turbine 60
-vermeers 60
-wipeouts 60
-rolff 60
-deguardia 60
-grosbard 60
-zegra 60
-kunder 60
-street-racing 60
-bullington 60
-wunderteam 60
-non-disabled 60
-leese 60
-weishan 60
-ten-time 60
-lactobacillus 60
-funkadelic 60
-zhengyu 60
-state-subsidised 60
-existen 60
-nakivubo 60
-dorinda 60
-popularly-elected 60
-wmf 60
-zrenjanin 60
-al-momen 60
-cesspools 60
-yacub 60
-zhuoru 60
-zouk 60
-tarnower 60
-sweet-potato 60
-lashonda 60
-fcdu 60
-tantalus 60
-nordling 60
-heidrun 60
-swigged 60
-tramontano 60
-spa\/fas 60
-invicta 60
-fenzl 60
-al-najar 60
-vinyl-coated 60
-stategy 60
-enewetak 60
-frost-free 60
-dstanford 60
-elwin 60
-kainer 60
-yugi 60
-cl### 60
-petitclerc 60
-margriet 60
-afterworld 60
-sirbu 60
-non-tradeable 60
-co-signing 60
-isiro 60
-jelacic 60
-tonghai 60
-narisetti 60
-whisperings 60
-pembangunan 60
-clatters 60
-kalashnikova 60
-health-flu-europe 60
-cross-license 60
-sarees 60
-minal 60
-retirement-age 60
-nitrogen-based 60
-sun-yup 60
-naief 60
-#,###-day 60
-rovs 60
-konia 60
-parasitology 60
-riz 60
-hankyu 60
-ebulliently 60
-franzblau 60
-apuuli 60
-muan 60
-ghoulishly 60
-six-gun 60
-recondite 60
-dukker 60
-zemmouri 60
-yumei 60
-http://www.centcom.mil 60
-one-night-only 60
-kovanda 60
-akhundzadeh 60
-worcester\/eng 60
-missiroli 60
-zhenyuan 60
-quarterpipe 60
-salix 60
-petrouchka 60
-mini-treaty 60
-epicure 60
-maintainance 60
-day-nighter 60
-ghengis 60
-pro-hitler 60
-hurleys 60
-editorialize 60
-henle 60
-sonson 60
-meals-ready-to-eat 60
-pratts 60
-pisetsky 60
-thonglao 60
-testings 60
-narcisa 60
-eight-night 60
-noster 60
-technology-stock 60
-dsquared 60
-nain 60
-bbn# 60
-explosively-formed 60
-b.l. 60
-zucchero 60
-dulay 60
-mid-spring 60
-ionides 60
-prochazkova 60
-twinsburg 60
-jhollis 60
-pre-####s 60
-bouyabes 60
-safe-conduct 60
-drug-resistance 60
-payphones 60
-yongwei 60
-labor-law 60
-tualatin 60
-ehf 60
-hirooka 60
-mortgage-bond 60
-gaztelu 60
-mitidja 60
-kondratiev 60
-christain 60
-http://www.alcoa.com 60
-hand-raised 60
-shaner 60
-medeski 60
-scansoft 60
-multi-millionaires 60
-reder 60
-htin 60
-paillettes 60
-intershop 60
-carfagna 60
-beyazit 60
-annakin 60
-scaglione 60
-esdi 60
-mikie 60
-inferiors 60
-student-on-student 60
-regionalize 60
-luff 60
-demitasse 60
-cal-bred 60
-calzado 60
-agita 60
-balin 60
-highest-end 60
-sinyani 60
-all-industries 60
-reuses 60
-lagrimas 60
-herberts 60
-koves 60
-sub-surface 60
-al-anzi 60
-longer-running 60
-gallimore 60
-pluralities 60
-benedetta 60
-jiddah-based 60
-tomaselli 60
-kallestad 60
-souheil 60
-kanso 60
-privett 60
-koronka 60
-nonmonetary 60
-rangsan 60
-esztergom 60
-zettler 60
-land-management 60
-indie-film 60
-learysptimes.com 60
-demokratikong 60
-vishakhapatnam 60
-rutba 60
-pluss 60
-africare 60
-#.##-acre 60
-shabib 60
-florins 60
-jujiya 60
-compaign 60
-demant 60
-josiane 60
-d'andre 60
-skagerrak 60
-azapo 60
-http://www.ladieseuropeantour.com 60
-whisperers 60
-studdert 60
-overindulgent 60
-qabazard 60
-norcross-based 60
-ex-gurkha 60
-x-seattle 60
-hejiang 60
-mcglaughlin 60
-preciously 60
-geetha 60
-gardezi 60
-ikuta 60
-attaullah 60
-hensler 60
-tree-shaped 60
-subarctic 60
-starman 60
-hussar 60
-verburg 60
-tupurkovski 60
-jeelani 60
-avidan 60
-sancha 60
-poliakoff 60
-room-sized 60
-instable 60
-pejsek 60
-el-falali 60
-chachoengsao 60
-chinese-produced 60
-al-kinani 60
-knuth 60
-bunner 60
-shinholster 60
-sabiston 60
-dvdirect 60
-douroux 60
-zebley 60
-salicylic 60
-etian 60
-co-organised 60
-osmanagic 60
-khotang 60
-doswell 60
-zfa 60
-proscribing 60
-bbn-dodgernotes 60
-domzale 60
-chinguetti 60
-joseph-beth 60
-stirrer 60
-deviatovski 60
-wright-designed 60
-spin-doctoring 60
-un-sudan 60
-contratos 60
-decimalization 60
-murugesu 60
-cleeman 60
-unbolted 60
-delphin 60
-twitter.com/gregauman 60
-depravation 60
-goen 60
-meckel 60
-kerr\/john 60
-nariaki 60
-co-ranked 60
-double-gold 60
-_______ 60
-binet 60
-perpere 60
-mahbuhbullah 60
-gendun 60
-zilla 60
-godowsky 60
-lycett 60
-ceramist 60
-damia 60
-oradell 60
-karimi-rad 60
-kreisler 60
-hard-hatted 60
-wahpeton 60
-christianne 60
-vesselin 60
-kivumbi 60
-life-supporting 60
-writing-directing 60
-valances 60
-chlorinate 60
-na-young 60
-nuclear-bomb 60
-decomissioning 60
-payam 60
-tuleh 60
-innova 60
-chih-hao 60
-dejohn 60
-democratic-farmer-labor 60
-lbs. 60
-small-school 60
-nur-pashi 60
-bollaert 60
-crookedly 60
-talloires 60
-kringen 60
-dopfer 60
-#-#-year 60
-defeatists 60
-madiot 60
-full-member 60
-vyachorka 60
-haixi 60
-sydenham 60
-#-emilie 60
-acuvue 60
-telser 60
-cumpston 60
-sokhina 60
-bancolombia 60
-crisis-prone 60
-gmhc 60
-pbf 60
-nisi 60
-revolutionizes 60
-azzahar 60
-money-draining 60
-double-bogeying 60
-gauvreau 60
-chabris 60
-leg-break 60
-tecbud 60
-ginnifer 60
-osoria 60
-nigeria-unrest 60
-turjanzadeh 60
-self-affirmation 60
-minuto 60
-private-property 60
-wambui 60
-bodao 60
-gwalia 60
-vladimirs 60
-natural-foods 60
-prolapsed 60
-vdis 60
-cullerton 60
-egawa 60
-aspl 60
-windu 60
-recession-fighting 60
-riss 60
-on-land 60
-ngap 60
-perjure 60
-cochaired 60
-mpumelelo 60
-fixed-wireless 60
-poots 60
-luneville 60
-hands-only 60
-bacashihua 60
-chancing 60
-mazrouei 60
-nabiyev 60
-mideast-summit 60
-baler 60
-genii 60
-bleustein 60
-unmixed 60
-phrenology 60
-iica 60
-shevaun 60
-orch 60
-zaramba 60
-china-latin 60
-ujc 60
-heeler 60
-sparseness 60
-percudani 60
-islam#uk 60
-rbos 60
-gb-lak 60
-cobbled-together 60
-half-owner 60
-d'entremont 60
-ation 60
-translucence 60
-luxgen 60
-ahhs 60
-posibles 60
-kulyab 60
-stalement 60
-anti-car 60
-o'crowley 60
-austria-crime-incest 60
-zuazo 60
-sugarbaker 60
-rougemont 60
-brugmann 60
-seidensticker 60
-rambla 60
-pensonic 60
-gimbal 60
-naugahyde 60
-longneck 60
-bumpings 60
-puffball 60
-selolwane 60
-anti-climatic 60
-crucibles 60
-penetta 60
-fatehpur 60
-baixada 60
-ukrop 60
-udoto 60
-yulieski 60
-chidren 60
-karwan 60
-zonca 60
-rapid-ascent 60
-sayle 60
-actuator 60
-multireligious 60
-gotchas 60
-torossian 60
-bonici 60
-fellmeth 60
-smatterings 60
-glycerol 60
-batigol 60
-adamany 60
-longball 60
-duck-and-cover 60
-boxford 60
-flounce 60
-seabeds 60
-shui-tsai 60
-sreerema 60
-wentzville 60
-mckeel 60
-ecla 60
-cheol 60
-tosta 60
-stayaways 60
-qazigund 60
-zolkin 60
-lyuboslav 60
-ka-## 60
-bellaart 60
-marcovicci 60
-dream-come-true 60
-winnemucca 60
-delbarton 60
-filmtec 60
-no-drive 60
-election-reform 60
-escoto 60
-###,###-capacity 60
-supertyphoon 60
-wakaazuma 60
-unsurmountable 60
-empcar 60
-members-in-waiting 60
-virola 60
-g\/h 60
-manch 60
-ducker 60
-kornilov 60
-metsa-botnia 60
-moistening 60
-coequal 60
-acga 60
-rupesh 60
-wilkinsburg 60
-brashest 60
-khoramshahi 60
-inverter 60
-co-offensive 60
-oscar-night 60
-etymological 60
-tranter 60
-eckenrode 60
-sireta 60
-smtp 60
-dadey 60
-rehak 60
-gooneratne 60
-jurie 60
-amte 60
-troyens 60
-nametag 60
-lops 60
-petach 60
-soliev 60
-#,###-peso 60
-re-sentenced 60
-berdiyev 60
-phang-nga 60
-ptuj 60
-navickas 60
-glassmakers 60
-single-ticket 60
-holderman 60
-stratcom 60
-bhartiya 60
-portentously 60
-horlock 60
-##\/#-year-old 60
-hertog 60
-krissoff 60
-pinschers 60
-coverlets 60
-galaxie 60
-barmaids 60
-harmetz 60
-communist-bloc 60
-cavallier 60
-relizane 60
-stauring 60
-kousa 60
-#,###-car 60
-thobela 60
-disavowals 60
-bistrong 60
-relevent 60
-vibert 60
-return-path 60
-lamagna 60
-atthe 60
-satrio 60
-leebove 60
-sallai 60
-health-oriented 60
-isra 60
-bidvest 60
-gasperoni 60
-crispo 60
-gortat 60
-garavani 60
-torgau 60
-ikeyama 60
-kocian 60
-zhihe 60
-catapano 60
-taxmen 60
-quittner 60
-israelson 60
-troi 60
-toevs 60
-germany\/milram 60
-odintsov 60
-chango 60
-latag 60
-balcytis 60
-antolini 60
-xingyi 60
-nienaber 60
-shiso 60
-boskov 60
-resource-hungry 60
-rouland 60
-mpe 60
-hindu-christian 60
-dodt 60
-koenigstein 60
-once-rich 60
-generator-powered 60
-bidemi 60
-khi 60
-kniazkov 60
-taborsky 60
-lehya 60
-zylstra 60
-schosberg 60
-emergency-management 60
-panne 60
-closedown 60
-freemason 60
-higginbottom 60
-erogenous 60
-plaatjes 60
-solyndra 60
-commerciality 60
-gorky-# 60
-i-opener 60
-#.##-carat 60
-alispahic 60
-porzio 60
-cummerbunds 60
-gaku 60
-borght 60
-valicevic 60
-icier 60
-otwell 60
-alighieri 60
-milsteins 60
-machi 60
-one-stop-shop 60
-herperger 60
-hypergrowth 60
-chaffey 60
-henare 60
-peterbilt 60
-sieben 60
-escarre 60
-cross-disciplinary 60
-tahj 60
-homestands 60
-ellenson 60
-perons 60
-vanoy 60
-hafed 60
-mingshan 60
-in-migration 60
-ivancic 60
-dramani 60
-asia-middle 60
-magoffin 60
-hertzfeld 60
-####\/### 60
-larena 60
-njue 60
-hela 60
-nigerian-registered 60
-chraidi 60
-saint-jean-de-maurienne 60
-onsale 60
-taneski 60
-blagoj 60
-pcij 60
-bomb-thrower 60
-tai-shan 60
-fertel 60
-trasvina 60
-shtml 60
-sakaiminato 60
-kilar 60
-sifang 60
-snorkeled 60
-gentian 60
-dito 60
-transbourse 60
-machinga 60
-growth-enhancing 60
-reclogging 60
-#-roque 60
-cmf 60
-putaway 60
-klinge 60
-pitching-rich 59
-alights 59
-#-mariano 59
-shatz 59
-koskoff 59
-sesssion 59
-dowthitt 59
-kriens 59
-revering 59
-nanhu 59
-menduh 59
-issers 59
-gabali 59
-aristolochia 59
-magrino 59
-sb# 59
-piledriver 59
-skywalks 59
-czapiewski 59
-centerstage 59
-blandest 59
-prescriptives 59
-heartlessly 59
-vitalia 59
-anchorages 59
-http://www.nrlc.org 59
-pittston 59
-high-payroll 59
-makahs 59
-shobokshi 59
-bandwagons 59
-krivenik 59
-gamey 59
-year-plus 59
-##-billion-yen 59
-antwun 59
-haniel 59
-longer-lived 59
-saheli 59
-minutewomen 59
-gray-and-white 59
-afran 59
-dysphonia 59
-anau 59
-flemmons 59
-kabushenga 59
-anti-france 59
-simecek 59
-greenwalt 59
-wavelet 59
-zarai 59
-schweickart 59
-iicd 59
-trupin 59
-nuez 59
-tobiass 59
-outofthebox 59
-o'scannlain 59
-haqq 59
-wrcf##a 59
-holberton 59
-germplasm 59
-halide 59
-wine-colored 59
-authories 59
-harpal 59
-high-emission 59
-kombu 59
-cde 59
-horakova 59
-kinderhook 59
-ocksman 59
-ayd 59
-ubdina 59
-subhan 59
-moistness 59
-kampmeier 59
-turyk-wawrynowicz 59
-kawachi 59
-northerns 59
-mangga 59
-adv##-cox 59
-bukvich 59
-budhi 59
-d-word 59
-phap 59
-glimmerings 59
-recondition 59
-stuhlbarg 59
-obfuscated 59
-collingswood 59
-thiew 59
-varberg 59
-california-grown 59
-akbank 59
-#.#-feet 59
-esmie 59
-amanya 59
-re-filed 59
-bonani 59
-bednarz 59
-economatica 59
-tuf 59
-water-recycling 59
-athanasia 59
-jennekvist 59
-huidong 59
-issues-oriented 59
-woong-bae 59
-kochis 59
-air-dried 59
-brunken 59
-http://www.enron.com 59
-handgrenades 59
-deaf-mutes 59
-esops 59
-darulaman 59
-chanachai 59
-aponavicius 59
-upper-hand 59
-baszczynski 59
-culley 59
-ncaer 59
-calcagni 59
-bolarinwa 59
-brumos 59
-velha 59
-lathem 59
-sadoff 59
-volumen 59
-pm-elect 59
-town-house 59
-backpacked 59
-##-yarders 59
-borrelia 59
-agostinelli 59
-rundles 59
-sentir 59
-asjylyn 59
-chimalapas 59
-tetherow 59
-kinan 59
-tegan 59
-gjenero 59
-dl-hla 59
-estigarribia 59
-once-moribund 59
-biersack 59
-khitan 59
-koury 59
-cricket-ind-aus 59
-http://blogs.timesunion.com/mcguire 59
-muzahim 59
-woul 59
-sck 59
-dongzhimen 59
-fanciulla 59
-arianne 59
-koszics 59
-meadwestvaco 59
-staveley 59
-timb 59
-levs 59
-super-spy 59
-yaish 59
-araz 59
-tabman 59
-basanti 59
-peace-enforcement 59
-cesaria 59
-neopolitan 59
-mispronunciations 59
-counter-cyclical 59
-disarmement 59
-chavhanga 59
-gayler 59
-mercedez-benz 59
-recollects 59
-rundall 59
-chen-chung 59
-malusa 59
-gereida 59
-decha 59
-lemberger 59
-bakoyiannis 59
-gogi 59
-grinda 59
-jimson 59
-bradberry 59
-mervat 59
-bvg 59
-jingzhong 59
-endel 59
-allers 59
-zuoyun 59
-loevinger 59
-zntb 59
-mirnawan 59
-igniter 59
-x-shaped 59
-carlsson-paige 59
-cobia 59
-veillette 59
-weed-killer 59
-larbaa 59
-prn 59
-victorinox 59
-household-name 59
-##-dnp 59
-xiangning 59
-minasian 59
-frappe 59
-ozone-friendly 59
-smita 59
-non-biological 59
-syllabuses 59
-twe 59
-northwick 59
-ivashkevich 59
-un-proposed 59
-muharrem 59
-calorie-free 59
-kolwezi 59
-barbados-born 59
-eu-serbia 59
-lish 59
-polyak 59
-http://www.nifc.gov/ 59
-kenro 59
-iran-unrest 59
-cothran 59
-basak 59
-beezer 59
-three-strike 59
-wilkomirski 59
-coovadia 59
-serreqi 59
-bythe 59
-noncompetition 59
-post-columbine 59
-minnix 59
-mickell 59
-spyro 59
-boire 59
-topcu 59
-boureij 59
-fengyang 59
-suntrajarn 59
-etre 59
-tarsy 59
-decors 59
-yerger 59
-non-internet 59
-stuntz 59
-lazer 59
-cahaba 59
-sabtu 59
-mut 59
-fadden 59
-eckstine 59
-wawrzyniak 59
-hip-swiveling 59
-plateaux 59
-shih-fang 59
-varah 59
-schwertner 59
-malfi 59
-scheeren 59
-rousselot 59
-bircher 59
-goldsberry 59
-charteau 59
-sullenness 59
-omoro 59
-alipov 59
-tax-cutter 59
-gayer 59
-cortinovis 59
-kettleman 59
-remodeler 59
-hard-punching 59
-magomedali 59
-telesleuth 59
-smajlovic 59
-nkem 59
-sea-green 59
-morobe 59
-nodia 59
-biss 59
-dawan 59
-abolfazl 59
-alberton 59
-faial 59
-picardi 59
-muchall 59
-no-change 59
-enfranchisement 59
-kijevo 59
-salpigidis 59
-imjingak 59
-anfrel 59
-mubarek 59
-chudy 59
-broadway-notes 59
-near-miraculous 59
-lefse 59
-cacace 59
-coffield 59
-out-of-hand 59
-nuhanovic 59
-chelule 59
-jumagulov 59
-elasticized 59
-non-existing 59
-sperl 59
-junri 59
-manders 59
-substructure 59
-cahaya 59
-segamat 59
-kharas 59
-dds 59
-snitched 59
-monforts 59
-marantha 59
-green-blue 59
-sukawaty 59
-troutt 59
-asics-cga 59
-part-ownership 59
-oline 59
-leanna 59
-merl 59
-rockhopper 59
-shakiba 59
-parlez-vous 59
-costilla 59
-industry.net 59
-logvinenko 59
-kaforey 59
-finnish-born 59
-error-plagued 59
-rodnina 59
-steel-mesh 59
-thrushes 59
-nesvig 59
-arcapita 59
-wellstream 59
-naushad 59
-enad 59
-marxist-leninists 59
-optimising 59
-jl-bg 59
-pecina 59
-calamine 59
-brinner 59
-keion 59
-hatra 59
-sobota 59
-fifita 59
-akayeva 59
-#-million-acre 59
-##-marcelo 59
-curto 59
-battallion 59
-socko 59
-parviainen 59
-all-southeastern 59
-ekland 59
-chinotimba 59
-escamillo 59
-wartelle 59
-barely-there 59
-bookbuilding 59
-brightly-lit 59
-kopeck 59
-pno 59
-pelvises 59
-scuffs 59
-terminix 59
-snow\/cloudy 59
-perraud 59
-introversion 59
-thielemans 59
-ayalew 59
-sardo 59
-debunkers 59
-snow-topped 59
-inuktitut 59
-straberg 59
-keech 59
-voter-id 59
-leoncavallo 59
-harmes 59
-clay.robison@chron.com 59
-sujeeva 59
-image-maker 59
-leet 59
-wister 59
-four-tournament 59
-two-games-to-none 59
-consuela 59
-prokhorova 59
-okin 59
-cholakis 59
-policharki 59
-golf-ball 59
-bischof 59
-languorously 59
-bracanov 59
-joellen 59
-coryo 59
-party-hopping 59
-hangchow 59
-th# 59
-loetschberg 59
-yuxia 59
-anthrax-related 59
-radio-ready 59
-fethiye 59
-shak 59
-current-generation 59
-grotowski 59
-pakorn 59
-garonne 59
-mknobler@ajc.com 59
-buryak 59
-green-minded 59
-barril 59
-yamaki 59
-macheyo 59
-work\/life 59
-fessler 59
-rheingau 59
-wahdan 59
-hugel 59
-conoley 59
-hellcat 59
-poydras 59
-kazoos 59
-gollogly 59
-discoursing 59
-pungue 59
-twilson 59
-aivars 59
-torrelavega 59
-assadollah 59
-ornskoldsvik 59
-syaifudin 59
-doshisha 59
-difazio 59
-portioned 59
-esap 59
-pop\/contemporary 59
-#the 59
-belluscio 59
-afficionados 59
-shargin 59
-caulks 59
-barnesville 59
-al-mahmud 59
-topalli 59
-avowal 59
-skil 59
-jeschke 59
-pieke 59
-p&j 59
-matillano 59
-soft-loan 59
-soon-to-be-former 59
-pittle 59
-transposing 59
-parmelee 59
-fessel 59
-obudu 59
-island-republic 59
-coutry 59
-golpe 59
-vakhayev 59
-vllaznia 59
-karzai-appointed 59
-invs 59
-post-abortion 59
-chianwala 59
-ennoble 59
-neradko 59
-pujobroto 59
-chikowi 59
-ansah 59
-litang 59
-palaghiaccio 59
-euroepan 59
-y-net 59
-cnpp 59
-steroid-distribution 59
-maulawi 59
-grandison 59
-kamungozi 59
-tonneau 59
-torchings 59
-four-candidate 59
-pallium 59
-kljajevic 59
-#.##-liter 59
-buy-to-let 59
-fanzhi 59
-sunanda 59
-taymour 59
-remanufactured 59
-schopenhauer 59
-anti-syrians 59
-anti-affirmative-action 59
-petrochemia 59
-kptc 59
-lobohombo 59
-mongomo 59
-janss 59
-sabharwal 59
-landholding 59
-##-per-month 59
-aftertax 59
-sardjoe 59
-placings\/standings 59
-garrec 59
-johnson-freese 59
-yablonsky 59
-stojkov 59
-wenzao 59
-water-conservation 59
-lawner 59
-front-loader 59
-anaheim-based 59
-doulos 59
-dimapur 59
-vashchuk 59
-yorked 59
-mewar 59
-five-pointer 59
-warrell 59
-ndiwa 59
-shrage 59
-poblanos 59
-black-run 59
-auslese 59
-banducci 59
-carless 59
-suhardjono 59
-extroverts 59
-bavarian-style 59
-calorie-laden 59
-sportbild 59
-robo-signers 59
-bucchi 59
-bourne\/victor 59
-oesterle 59
-renda 59
-mortgage-market 59
-hasibul 59
-bohren 59
-human-computer 59
-half-moons 59
-al-kadhimi 59
-reguera 59
-achike 59
-schnee 59
-atje 59
-esp\/eus 59
-lamplight 59
-solimoes 59
-loua 59
-elkan 59
-vitamin-enriched 59
-deskovic 59
-burgmans 59
-miaa 59
-ascorbic 59
-spain\/festina 59
-#-##-### 59
-agro-industries 59
-negociaciones 59
-skirt-chasing 59
-gph##bb 59
-minister-counselor 59
-penhall 59
-hat-in-hand 59
-autograph-seeking 59
-temodar 59
-othmani 59
-calida 59
-panichgul 59
-blameworthy 59
-burkitt 59
-foppert 59
-optionsxpress 59
-r-salem 59
-onassio 59
-teruyuki 59
-calstart 59
-fitzcarraldo 59
-tetrazzini 59
-bohme 59
-carinii 59
-devilliers 59
-artemyev 59
-bozkurt 59
-pacificor 59
-komisar 59
-weijun 59
-zehri 59
-challe 59
-kingi 59
-zurcher 59
-svansicklersptimes.com 59
-ziso 59
-personal-foul 59
-myanmar-language 59
-italian-french 59
-yung-ping 59
-sadbhawana 59
-waywardness 59
-lockard 59
-monib 59
-controlee 59
-maroga 59
-public-listed 59
-embrey 59
-ched 59
-five-power 59
-saynudin 59
-qasemi 59
-nuzum 59
-microdrive 59
-re-investigation 59
-conflicto 59
-dasanayake 59
-xynthia 59
-wormholes 59
-baoming 59
-rappleyea 59
-ecotech 59
-pre-oscars 59
-minx 59
-yator 59
-jobbed 59
-peccadillos 59
-societa 59
-sadriya 59
-wackness 59
-a.b 59
-dychtwald 59
-libancell 59
-two-hundred 59
-egs 59
-contrada 59
-interferring 59
-shajiao 59
-zeidane 59
-tengger 59
-srinagar-muzaffarabad 59
-kurosaki 59
-yidong 59
-nchito 59
-abdille 59
-tid 59
-raw-vid###### 59
-ski-world 59
-blockings 59
-gedeck 59
-localhost 59
-dooky 59
-khafagy 59
-coppock 59
-non-gay 59
-simec 59
-#,###-yuan 59
-dementias 59
-galatioto 59
-titter 59
-nagyvary 59
-ayahuasca 59
-gordon-conwell 59
-alberto-culver 59
-houtan 59
-waspish 59
-unimaginatively 59
-dead-serious 59
-controvery 59
-ingels 59
-maxton 59
-forefingers 59
-ibata 59
-ticknor 59
-lazne 59
-gidel 59
-blache 59
-schormann 59
-drop-by 59
-nationalbank 59
-shuanghe 59
-freetel 59
-###-###-#-###-#### 59
-averitt 59
-anti-earthquake 59
-macefield 59
-trali 59
-witbooi 59
-patru 59
-unhappiest 59
-pervomaysk 59
-realy 59
-applera 59
-est\/a#r 59
-utada 59
-oil-trading 59
-al-sammarai 59
-muria 59
-trapasso 59
-diefenderfer 59
-movie-quality 59
-maslova 59
-gedz 59
-bright-yellow 59
-al-fahal 59
-razziq 59
-corbelli 59
-decanted 59
-black-and-silver 59
-cubanos 59
-towelettes 59
-http://www.weforum.org 59
-iia 59
-sirba 59
-cads 59
-kapetanos 59
-disses 59
-ltc 59
-debbasch 59
-ft.com 59
-graito 59
-staletovich 59
-debreu 59
-merisant 59
-near-collisions 59
-lokuge 59
-software-maker 59
-stephanus 59
-duba 59
-novik 59
-mayombo 59
-salvato 59
-mets-yankees 59
-oskars 59
-titmice 59
-nikolovski 59
-deanda 59
-restrictionist 59
-khadduri 59
-onterrio 59
-thodey 59
-perroud 59
-foping 59
-agboville 59
-isin 59
-wednesday-thursday 59
-ohmori 59
-jong-chan 59
-acocella 59
-buggery 59
-houphouet 59
-sanofi-pasteur 59
-xxxxxxxxend 59
-ohel 59
-suburbanized 59
-tekeda 59
-well-proportioned 59
-counter-punch 59
-selyem 59
-dawlat 59
-egas 59
-three-punch 59
-fouke 59
-gas-to-liquids 59
-edan 59
-hollopeter 59
-hintsa 59
-addenbrooke 59
-mountaga 59
-zogu 59
-bxe# 59
-bushinsky 59
-martinage 59
-kaufer 59
-pingeon 59
-mazahir 59
-shao-haei 59
-briner 59
-nemea 59
-york-headquartered 59
-anytown 59
-kleptocratic 59
-non-supervisory 59
-aix-les-bains 59
-conocer 59
-heit 59
-reinhilt 59
-clear-cuts 59
-atong 59
-haefner 59
-al-taweel 59
-sholokhov 59
-nezavisne 59
-kamoga 59
-monsalve 59
-zanussi 59
-kaller 59
-fidal 59
-moloi 59
-neck-to-ankle 59
-shirvani 59
-non-saudis 59
-kharg 59
-c-murder 59
-fingernail-sized 59
-gjomle 59
-piccioli 59
-theater-in-the-round 59
-honnold 59
-shingirai 59
-vending-machine 59
-mini-applications 59
-krusty 59
-e-services 59
-adressing 59
-taht 59
-zacharatos 59
-coupler 59
-aparri 59
-edenton 59
-dicke 59
-one-###th 59
-flamme 59
-gladiolas 59
-mcnee 59
-carnero 59
-www.caiso.com 59
-grabar-kitarovic 59
-felsen 59
-memes 59
-reinjected 59
-edwar 59
-favier 59
-yokomine 59
-slamed 59
-poll-watchers 59
-azizulhasni 59
-gracin 59
-tank-automotive 59
-halvari 59
-gloversville 59
-aleksejs 59
-louisianan 59
-clerkships 59
-sharapov 59
-ill-chosen 59
-clerestory 59
-nasa-funded 59
-minella 59
-karnaugh 59
-phoned-in 59
-taibe 59
-schondelmeyer 59
-d'arby 59
-u.s-china 59
-one-hitters 59
-chephren 59
-caru 59
-bachleda 59
-nategh-nuri 59
-gaitley 59
-kiecolt-glaser 59
-watmore 59
-orangeman 59
-arch-villain 59
-http://www.amd.com 59
-powderject 59
-toker 59
-stubbled 59
-ohnishi 59
-requesters 59
-helicoptering 59
-tahoes 59
-morong 59
-rahmonov 59
-ledanois 59
-borodulina 59
-springmann 59
-keynotes 59
-saith 59
-pallidotomy 59
-lusso 59
-shabaa 59
-byrdak 59
-vance-owen 59
-ibertsberger 59
-massih 59
-artyukhov 59
-lip-synch 59
-benziman 59
-schaer 59
-yormark 59
-zuanazzi 59
-dejohnette 59
-maternally 59
-gyurkovics 59
-haleiwa 59
-unprecented 59
-totus 59
-dshir 59
-yeakel 59
-lavecchia 59
-turbo-generators 59
-sanoma 59
-simiyu 59
-hmnzs 59
-sabagh 59
-decoratifs 59
-lehder 59
-suryo 59
-samaraneftegaz 59
-kingfishers 59
-quasi-socialist 59
-jurnee 59
-fischbein 59
-maotai 59
-turgot 59
-karimirad 59
-osnes 59
-margarian 59
-krul 59
-raho 59
-bibring 59
-shaktoolik 59
-wendler 59
-baim 59
-hellwig 59
-sugarcane-based 59
-al-shaar 59
-suomi 59
-guittard 59
-amirov 59
-chafer 59
-reconnoitering 59
-re-forming 59
-thweatt 59
-tax-writers 59
-oladapo 59
-sidelocks 59
-bedaux 59
-aravind 59
-fruit-growing 59
-koror 59
-light-year 59
-adat 59
-utm 59
-keenans 59
-grecia 59
-palestinian-jordanian 59
-pro-choicers 59
-liuhua 59
-sujit 59
-milchan 59
-weather-philippines-typhoon 59
-additonal 59
-leukemias 59
-http://www.continental.com 59
-parrothead 59
-afghanistan-unrest-taliban 59
-re-submitted 59
-cup-champion 59
-jitloff 59
-gungoren 59
-al-shall 59
-photochemical 59
-grandee 59
-chi-chao 59
-cash-raising 59
-nanri 59
-pro-prosecution 59
-maicosuel 59
-plourde 59
-tarell 59
-laquidara 59
-harkonen 59
-microdermabrasion 59
-shootarounds 59
-cryopreservation 59
-cockfield 59
-yb\/jh 59
-phromphan 59
-maxa 59
-flatliners 59
-extranjera 59
-calaio 59
-tjh-rjm 59
-post-performance 59
-apriantono 59
-rawn 59
-humadi 59
-lupsha 59
-bikita 59
-road-rail 59
-mantes-la-jolie 59
-jianyang 59
-shaweesh 59
-migliaro 59
-larrikin 59
-muju 59
-delahunty 59
-no\/okla 59
-gracen 59
-penalty-filled 59
-kitui 59
-ernsting-krienke 59
-retela 59
-ntust 59
-cordel 59
-verkhovsky 59
-hatha 59
-meat-eater 59
-musker 59
-less-publicized 59
-cityu 59
-tla 59
-fancying 59
-pankin 59
-simasiku 59
-spinaway 59
-hedija 59
-mersenne 59
-dunshee 59
-dalipi 59
-ducale 59
-bruene 59
-army-owned 59
-greatorex 59
-karsiyaka 59
-martinu 59
-shamala 59
-brimfield 59
-atlantics 59
-utuado 59
-conneally 59
-hye-youn 59
-sharpstown 59
-signore 59
-atlanta-journal 59
-negoesco 59
-plotnikov 59
-linscott 59
-despereaux 59
-ligang 59
-survivals 59
-okolski 59
-fang-yue 59
-distrigaz 59
-borana 59
-conable 59
-sorest 59
-rompers 59
-batubara 59
-kencell 59
-ibi 59
-tin-plated 59
-shlain 59
-soybean-based 59
-tallberg 59
-fibras 59
-selda 59
-inter-dependence 59
-chunkier 59
-college-jewish 59
-chiune 59
-bredahl 59
-half-starved 59
-dymchenko 59
-suspectedly 59
-dimitrakos 59
-jf-## 59
-dury 59
-mcgivney 59
-cakl 59
-###-storey 59
-kishor 59
-vionnet 59
-torlen 59
-cciced 59
-car-bombs 59
-momsen 59
-coach-class 59
-badmouthed 59
-liebermans 59
-smain 59
-sun-loving 59
-squeaky-voiced 59
-three-pack 59
-primex 59
-remixer 59
-well-articulated 59
-run-first 59
-secret-agent 59
-wismar 59
-pendulous 59
-agro-technicians 59
-supercedes 59
-concepto 59
-back-to-the-land 59
-shamoon 59
-isnt 59
-vergina 59
-kolesar 59
-delpierre 59
-volchkov 59
-baixa 59
-pentacostal 59
-mpca 59
-rwindi 59
-technimetrics 59
-beijing-hangzhou 59
-dapa 59
-divisionism 59
-burcham 59
-bean-counters 59
-serravalle 59
-agni-ii 59
-texana 59
-two-masted 59
-yumen 59
-man-of-war 59
-well-nourished 59
-ktvu 59
-brialy 59
-vlashi 59
-palanggoon 59
-essi 59
-dreamlife 59
-culbreath 59
-gosingan 59
-tejinder 59
-million-to-one 59
-ferozabad 59
-movsisyan 59
-birdie-eagle 59
-taeb 59
-biomolecules 59
-cynara 59
-mogas 59
-halbrook 59
-finstrom 59
-upmc 59
-activist-journalist 59
-state-guided 59
-scitrek 59
-newyork 59
-rosenbauer 59
-braker 59
-bds 59
-chain-smokes 59
-divyang 59
-cinemania 59
-norwegian-based 59
-bowlegged 59
-tosoh 59
-isbin 59
-stamatis 59
-###zx 59
-stroem-erichsen 59
-re-organized 59
-moughniyah 59
-mulpuru 59
-naeringsliv 59
-zitko 59
-caudillos 59
-t-systems 59
-al-shayea 59
-awesomeness 59
-cityvote 59
-desanctis 59
-luckenbill 59
-www.tsa.gov 59
-quatrano 59
-tetuan 59
-seven-million-dollar 59
-bienville 59
-abou-treika 59
-akutagawa 59
-lasn 59
-ouseph 59
-brand-conscious 59
-ultra-cool 59
-supporting-acting 59
-vivar 59
-afanasyeva 59
-raes 59
-faliron 59
-soru 59
-jobski 59
-dodonnell@nycap.rr.com 59
-scadden 59
-arditi 59
-doornekamp 59
-nexabit 59
-livetv 59
-duplin 59
-endurance-booster 59
-radionuclides 59
-volmar 59
-gittings 59
-coile 59
-overtown 59
-lesi 59
-agarwalla 59
-brooksley 59
-trifonov 59
-bizera 59
-sino-chilean 59
-ip-based 59
-condren 59
-half-size 59
-kwu 59
-kangol 59
-arlys 59
-binstock 59
-libyan-backed 59
-rayed 59
-alimentary 59
-grained 59
-atoki 59
-negations 59
-oasen 59
-arab-mediated 59
-mozena 59
-serioux 59
-bjervig 59
-hot-spring 59
-gruet 59
-#-inch-deep 59
-linker 59
-glycine 59
-chimere 59
-hamoked 59
-smyrek 59
-monkish 59
-peraliya 59
-gorospe 59
-eight-month-long 59
-galwey 59
-mastitis 59
-goupil 59
-megaresort 59
-nutraceuticals 59
-ransford 59
-subterfuges 59
-pronostica 59
-reinspection 59
-ho-yeol 59
-apperson 59
-thanomsak 59
-jadideh 59
-wevill 59
-cepelova 59
-aera 59
-kuehbacher 59
-ntumba 59
-iphitos 59
-yanis 59
-isolda 59
-o'shanter 59
-scheuring 59
-black-listed 59
-tan-colored 59
-kayhian 59
-pro-german 59
-stiletto-heeled 59
-mastan 59
-dhamar 59
-creisson 59
-tarasenko 59
-getahun 59
-pordenone 59
-hentunen 59
-underlayment 59
-al-muslimeen 59
-gardy 59
-leukocyte 59
-thermography 59
-matchpoint 59
-hongyi 59
-obertan 59
-primeeast 59
-castellvi 59
-isitolo 59
-pro-competition 59
-laryngeal 59
-xianghe 59
-eurozone-imf 59
-nigeria-oil-unrest-kidnap 59
-nader-camejo 59
-kyrastas 59
-seltsovsky 59
-spain\/astana 59
-beran 59
-foudre 59
-gaudier 59
-france\/cofidis 59
-interenergoservice 59
-gaar 59
-amberleigh 59
-shoora 59
-stehr 59
-dehoyos 59
-wait-listed 59
-meuleman 59
-kennerly 59
-yue\/niu 59
-ananiev 59
-komrskova 59
-jmh 59
-xianfeng 59
-xg### 59
-rane 59
-iturriaga 59
-bely 59
-sekondi 59
-gentex 59
-aleisha 59
-clenched-fist 59
-baraybar 59
-angolite 59
-camp-out 59
-quandry 59
-phymatrix 59
-sidecars 59
-henoch 59
-subandi 59
-baldin 59
-macia 59
-oynes 59
-pre-registered 59
-lanne 59
-garitano 59
-debarked 59
-duval-scott 59
-malambo 59
-hernreich 59
-sankofa 59
-pracharaj 59
-busies 59
-cly 59
-rassul 59
-keobounphanh 59
-per-year 59
-covenas 59
-on-wine 59
-nederlandse 59
-nontitle 59
-goave 59
-fuchsova 59
-monroes 59
-nadzmi 59
-joerres 59
-bogere 59
-shanhua 59
-szalay 59
-duf 59
-gate-crashers 59
-chidamabaram 59
-kirchpaytv 59
-ruweished 59
-andjelko 59
-ostrager 59
-atp-monte 59
-bridgers 59
-p#c 59
-damba 59
-daisey 59
-non-insurance 59
-frappuccinos 59
-missile-equipped 59
-cunneyworth 59
-kostiantyn 59
-zubr 59
-kimmi 59
-gatorland 59
-waxen 59
-sonke 59
-gramm-rudman 59
-rabon 59
-cumani 59
-hirshson 59
-harouna 59
-multi-user 59
-mccleave 59
-nemerov 59
-ejegayehu 59
-rivenbark 59
-less-privileged 59
-rotundo 59
-duangchalerm 59
-speechmaker 59
-ching-piao 59
-underachieve 59
-bakala 59
-suweidi 59
-adriaenssens 59
-autobytel 59
-willimon 59
-clean-out 59
-mazdas 59
-mochida 59
-volkogonov 59
-jasen 59
-waple 59
-podlodowski 59
-cardia 59
-traffick 59
-carpentaria 59
-harrad 59
-foradil 59
-zaveryukha 59
-rueda-denvers 59
-esperan 59
-davis-monthan 59
-over-aggressive 59
-huracan-tres 59
-redaction 59
-begles 59
-kupusovic 59
-goskomstat 59
-cents-per-share 59
-slopped 59
-#-paolo 59
-#-paola 59
-shafiullah 59
-gold-digger 59
-rawle 59
-garnishment 59
-aguta 59
-chartchai 59
-debevec 59
-first-wave 59
-haidt 59
-neider 59
-bsheets 59
-sheja'eya 59
-omofuma 59
-jk-hla 59
-jila 59
-fomca 59
-kozel 59
-phuntsog 59
-soc-intlnotes 59
-neils 59
-jebril 59
-xiaojie 59
-enrica 59
-zhare 59
-sainvil 59
-fsh 59
-fsg 59
-dbu 59
-bocalandro 59
-terror-stricken 59
-nine-plus 59
-health-guestworkout 59
-adie 59
-sissinghurst 59
-subglacial 59
-praefcke 59
-over-the-head 59
-zerlentes 59
-geosystems 59
-mescheriakova 59
-apolloni 59
-batat 59
-shortest-serving 59
-red-dirt 59
-naziunalista 59
-pellucida 59
-near-deserted 59
-olanzapine 59
-once-beautiful 59
-woon-kwong 59
-fixit 59
-reekers 59
-surgically-repaired 59
-hatipoglu 59
-mannichl 59
-latessa 59
-naturists 59
-berquist 59
-veldakova 59
-detabali 59
-sleep-related 59
-asean-us 59
-hiscock 59
-much-reviled 59
-air-ground 59
-once-taboo 59
-romilly 59
-bossem-levy 59
-health-sars-taiwan 59
-dymock 59
-lynge 59
-piranesi 59
-consistencies 59
-nelson-bond 59
-shusaku 59
-vax 59
-rhythmical 59
-all-premier 59
-jorquera 59
-shikhar 59
-camel-colored 59
-philomene 59
-vancomycin-resistant 59
-einaugler 59
-mutrif 59
-martitegi 59
-neustar 59
-ganzuri 59
-eufaula 59
-quichua 59
-thundercloud 59
-ex-christian 59
-havisham 59
-curcic 59
-talledega 59
-sung-man 59
-tr\/vls 59
-malgieri 59
-college-entrance 59
-bourkoff 59
-gambol 59
-swick 59
-calvinism 59
-ghettoization 59
-anti-rollover 59
-kamark 59
-forded 59
-repurpose 59
-gupte 59
-athans 59
-polhemus 59
-keshia 59
-ramachandra 59
-kl-gm 59
-guolla 59
-brazil-plane 59
-porato 59
-poms 59
-post-trauma 59
-ubud 59
-semi-circular 59
-blowhole 59
-inflation-related 59
-flori 59
-barhoumi 59
-barolos 59
-madritsch 59
-kulis 59
-weerts 59
-sardis 59
-saravanan 59
-kfda 59
-grapefruit-sized 59
-mucke 59
-saifudin 59
-time-slot 59
-xinli 59
-downwinders 59
-rotcheva 59
-hemdan 59
-brasses 59
-cowpox 59
-mushota 59
-bounder 59
-ex-lax 59
-tibs 59
-narcocorrido 59
-siok 59
-final-lap 59
-holczer 59
-kresse 59
-ibtisam 59
-emy 59
-al-kurd 59
-outside-the-beltway 59
-legitimises 59
-llamo 59
-incendiaries 59
-topoff 59
-asloum 59
-sabelli 59
-turkey-based 59
-krupnikovic 59
-lezion 59
-outswinging 59
-pesach 59
-mercury-free 59
-baerbel 59
-pranav 59
-khokhlova\/sergei 59
-primecap 59
-lukko 59
-bttb 59
-semprun 59
-zohur 59
-fbc-usc 59
-ulrica 59
-round-ups 59
-maastrict 59
-charbroiled 59
-son-in 59
-segat 59
-transfield 59
-government-rescued 59
-tryggve 59
-odling-smee 59
-canelas 59
-swd 59
-bapu 59
-xcp 59
-keulder 59
-haipe 59
-al-hedoud 59
-efthymiou 59
-yalman 59
-ratsirahonana 59
-too-strong 59
-polonaise 59
-aramburuzabala 59
-najafabad 59
-plaited 59
-jailbait 59
-itagui 59
-co-rookie 59
-thonier 59
-iason 59
-low-probability 59
-ligocka 59
-tetzlaff 59
-glassblowers 59
-everard 59
-ettienne 59
-granlund 59
-bever 59
-carematrix 59
-votevets.org 59
-ghinwa 59
-amoussouga 59
-brigade-size 59
-manats 59
-mrazova 59
-kupelo 59
-dekay 59
-vanzekin 59
-calixte 59
-paktin 59
-givry 59
-chocolatey 59
-poverty-eradication 59
-scandal-prone 59
-el#l 59
-good-for-nothing 59
-sailosi 59
-tuguegarao 59
-struk 59
-vancura 59
-jean-mary 59
-reemployed 59
-durling 59
-xinhua-run 59
-chadd 59
-over-expansion 59
-miyamura 59
-diani 59
-blinov 59
-stick-swinging 59
-sparxxx 59
-blagoi 59
-flohr 59
-casselman 59
-magnifico 59
-temerlin 59
-kirm 59
-bambu 59
-gohlke 59
-maniglia 59
-kamphuis 59
-moodily 59
-kilak 59
-anti-church 59
-leat 59
-serap 59
-danish-based 59
-bagless 59
-bluford 59
-tsygurov 59
-tahmasebi 59
-egomania 59
-u.n.-demarcated 59
-dm-pyg 59
-crianza 59
-ghafari 59
-mottus 58
-djamil 58
-razor-close 58
-setola 58
-cortulua 58
-turvy 58
-gechev 58
-gessel 58
-ayamas 58
-eberl 58
-leys 58
-beating-heart 58
-khadjiev 58
-vbac 58
-steber 58
-o'liney 58
-chizuko 58
-transgressor 58
-balcavage 58
-non-voters 58
-goligoski 58
-uddi 58
-philadelphia-born 58
-schlomo 58
-gamila 58
-herranz 58
-d-ram 58
-aerating 58
-heathwood 58
-jazz-pop 58
-swindlehurst 58
-dabagh 58
-sportfive 58
-jitter 58
-peelle 58
-geste 58
-tuitele 58
-limited-liability 58
-h### 58
-heilmann 58
-onyancha 58
-holobyte 58
-ppt 58
-icelike 58
-boru 58
-ayala-cornejo 58
-singer-bassist 58
-seifullah 58
-sonrisa 58
-bogacheva 58
-dassey 58
-x-muttiah 58
-immigrant-friendly 58
-post-all-star 58
-physically-unable-to-perform 58
-albergo 58
-causado 58
-triple-triples 58
-kalhammer 58
-branquinho 58
-whang 58
-cogbill 58
-rossier 58
-pitztal 58
-breadline 58
-dollar-pegged 58
-ruffing 58
-mchinji 58
-upperhands 58
-stanely 58
-banpro 58
-stereophonic 58
-spaz 58
-dumeisi 58
-soissons 58
-evaluative 58
-vashti 58
-transouth 58
-alinea 58
-just-retired 58
-##-norandrosterone 58
-ethanol-based 58
-blauensteiner 58
-mwaba 58
-chep 58
-intercutting 58
-equal-pay 58
-valencia-based 58
-chambeshi 58
-absorptive 58
-bramson 58
-hallo 58
-diatoms 58
-al-nassiri 58
-leviton 58
-tachi 58
-miscount 58
-anthuenis 58
-brockington 58
-giuffrida 58
-screenvision 58
-limandri 58
-kaliopate 58
-weather-stripping 58
-pauwels 58
-tangaroa 58
-melanogaster 58
-bewilderingly 58
-boasso 58
-deputise 58
-chinese-north 58
-uriri 58
-shibutani 58
-ranya 58
-aramin 58
-antonacci 58
-arab-kurdish 58
-sazanovich 58
-massachusetts-amherst 58
-etti 58
-tandil 58
-electrotechnical 58
-riqueza 58
-chudasama 58
-leksand 58
-two-pointer 58
-mariangela 58
-keye 58
-nikolaev 58
-sl# 58
-jamat-ud-dawa 58
-liener 58
-ashford.com 58
-parknshop 58
-quapaw 58
-two-pack-a-day 58
-aderhold 58
-buring 58
-unrolls 58
-two-iron 58
-r-miss. 58
-fazilah 58
-okam 58
-kabuli 58
-regia 58
-sekula 58
-re-deployed 58
-v.v. 58
-sporleder 58
-vigiano 58
-cahal 58
-sinsuat 58
-shortenings 58
-detargeting 58
-sportspeople 58
-ta-## 58
-immunosuppression 58
-kongying 58
-repack 58
-huster 58
-us-university 58
-eco-tourists 58
-slip-sliding 58
-wucker 58
-greece-finance-economy 58
-shvetsov 58
-sn-mpm 58
-john-patrick 58
-enought 58
-verbard 58
-sconce 58
-electro-mechanics 58
-budhia 58
-zuckerbrod 58
-kadriu 58
-biavaschi 58
-pentathletes 58
-nimani 58
-dezso 58
-cottesloe 58
-grosh 58
-croupiers 58
-vuillermin 58
-lipcsei 58
-reveiz 58
-kolesnik 58
-neeb 58
-mccullagh 58
-stand-still 58
-animistic 58
-llave 58
-oh-so 58
-savannahs 58
-obledo 58
-ston 58
-ziyuan 58
-safety-first 58
-multiple-listing 58
-nadzeya 58
-earth-observing 58
-lacquers 58
-fazl-ur 58
-bribers 58
-pass-catcher 58
-lasek 58
-cyprus-un-talks 58
-byzantines 58
-composer-in-residence 58
-princeling 58
-bataga 58
-as-yet-undetermined 58
-sumarno 58
-wuchuan 58
-gatty 58
-gobdon 58
-farsi-speaking 58
-onewest 58
-kleeman 58
-mbt 58
-andre-joseph 58
-haggett 58
-umd 58
-goggle-eyed 58
-then-popular 58
-garity 58
-ufundi 58
-changsheng 58
-hercegovacka 58
-gasparini 58
-ruyan 58
-tanerau 58
-assault-rifle 58
-vohs 58
-dimmesdale 58
-gun-for-hire 58
-lokomotive 58
-greiss 58
-afsar 58
-hersley 58
-kruma 58
-multiforme 58
-olkhovsky 58
-meridiana 58
-pannella 58
-reduced-rate 58
-leebaw 58
-webman 58
-kacha 58
-oil-like 58
-niello 58
-brierton 58
-ciliberto 58
-aercap 58
-larin 58
-caze 58
-congo-fighting 58
-ak-chin 58
-saengchai 58
-reichsmarks 58
-mutability 58
-redistributive 58
-muons 58
-proyas 58
-agajanian 58
-city-by-city 58
-nageikina 58
-dallam 58
-pb&j 58
-juacevedo 58
-pharmacologists 58
-non-moslems 58
-mukri 58
-javasoft 58
-pre-op 58
-krezelok 58
-third-and-short 58
-wahhabist 58
-farve 58
-bluejays 58
-sergius 58
-dorouma 58
-neukirchen 58
-undulated 58
-serlenga 58
-unsan 58
-al-nueimi 58
-blood-testing 58
-plait 58
-wilhide 58
-four-floor 58
-u.s.-flag 58
-mushier 58
-catto 58
-napster-like 58
-nightstands 58
-salvar 58
-vorontsova 58
-criticos 58
-grantmakers 58
-slader 58
-gholam-reza 58
-magnifique 58
-volen 58
-chishui 58
-sunalliance 58
-indoor\/outdoor 58
-heidgen 58
-lule 58
-belot 58
-cheng-yuan 58
-cyrene 58
-manganelli 58
-undershooting 58
-lorenzo-vera 58
-turadzhonzoda 58
-santilli 58
-kucher 58
-jo-krg 58
-mr\/dw 58
-dosen 58
-margi 58
-half-cooked 58
-heslin 58
-vidoje 58
-lepley 58
-commandante 58
-pattersons 58
-corsaire 58
-famosa 58
-yojimbo 58
-bromma 58
-transmutation 58
-cost-to-income 58
-feinted 58
-mop-topped 58
-moxibustion 58
-yongyi 58
-heterogeneity 58
-curti 58
-bolte 58
-tripplett 58
-paun 58
-bio-pharmaceutical 58
-cherubini 58
-hyun-woo 58
-stroem 58
-wanvig 58
-friuli-venezia 58
-schornack 58
-caucasia 58
-prosecuters 58
-sagmeister 58
-defore 58
-fakhoury 58
-ozen 58
-lightning-bolt 58
-funks 58
-pusillanimous 58
-ticky-tack 58
-lenfest 58
-overmedicated 58
-sung-tae 58
-fotherby 58
-al-gabali 58
-muhajiroun 58
-##.#-billion-pound 58
-cridlin 58
-low-dollar 58
-slappy 58
-under-recognized 58
-consumer-finance 58
-joma 58
-fabyan 58
-vredenburg 58
-rushville 58
-kingsmen 58
-feng-ying 58
-jung-hoon 58
-aldape 58
-wednesday-night 58
-janja 58
-monsod 58
-maimings 58
-kwaito 58
-al-baghli 58
-st.-denis 58
-agro-food 58
-afsa 58
-syncytial 58
-crf 58
-brogliatti 58
-melcior 58
-congestions 58
-themself 58
-meiners 58
-granick 58
-mamonyane 58
-krayer 58
-brm 58
-sex-selective 58
-daina 58
-jjh\/db 58
-carella 58
-re-took 58
-ilmor 58
-chedjou 58
-waites 58
-#-max 58
-merriex 58
-president-general 58
-water-main 58
-psa\/bloomberg 58
-camerounians 58
-savatheda 58
-counter-measure 58
-i.t. 58
-kaid 58
-porkpie 58
-cinchona 58
-makhosini 58
-audemars 58
-messa 58
-unlamented 58
-niedak-ashkenazi 58
-butyrka 58
-musicales 58
-uvm 58
-bayh-dole 58
-schleyer-halle 58
-unzipping 58
-#,###-game 58
-freemarkets 58
-requa 58
-opondo 58
-post-and-beam 58
-kmarts 58
-hejda 58
-peilin 58
-neisser 58
-re-using 58
-takeisha 58
-newstalkzb 58
-pengilly 58
-chinese-african 58
-flippancy 58
-coorsh 58
-apta 58
-willcocks 58
-emlen 58
-citiseconline 58
-rulemakers 58
-distintas 58
-qadderdan 58
-dpk 58
-vortexes 58
-mae-ggl 58
-hodur 58
-karavellas 58
-timidria 58
-haina 58
-nare 58
-overcast\/sleet 58
-eurest 58
-####-###### 58
-kakhi 58
-senafe 58
-zetti 58
-ciencia 58
-boya 58
-weadock 58
-pestriaev 58
-dhahi 58
-hui-mei 58
-el-arabi 58
-kotto 58
-sundstroem 58
-hamui 58
-provos 58
-senio 58
-bochner 58
-edyta 58
-minamoto 58
-scorings 58
-gesner 58
-adiyaman 58
-wassail 58
-fishhook 58
-krivda 58
-testifed 58
-momanyi 58
-cafarelli 58
-re-christened 58
-naaman 58
-chien-chih 58
-trailside 58
-wenhao 58
-deplaning 58
-legree 58
-anti-personal 58
-yemelyanov 58
-test-market 58
-ikramullah 58
-plateauing 58
-batad 58
-peu 58
-samling 58
-glaudini 58
-filippidis 58
-zhifu 58
-anti-divorce 58
-scrooges 58
-lashari 58
-ideologists 58
-sagapolutele 58
-niobium 58
-chows 58
-benfer 58
-hhh 58
-two-footer 58
-non-acceptance 58
-md\/ji 58
-auchi 58
-veanne 58
-brick-red 58
-varga-balazs 58
-peskiric 58
-restitutions 58
-interjection 58
-in-place 58
-cavagnaro 58
-lewinksy 58
-saudi-brokered 58
-rotax 58
-kawagoe 58
-wlosowicz 58
-puleo 58
-hoth 58
-disch-falk 58
-seiple 58
-nitrogenous 58
-urubamba 58
-particularily 58
-hagop 58
-bachirou 58
-cpc-led 58
-arbor-based 58
-valeant 58
-gunhild 58
-markswoman 58
-queyranne 58
-phoumsavanh 58
-kirschstein 58
-sharjah-based 58
-harriton 58
-baktiari 58
-confernce 58
-premade 58
-sleet\/overcast 58
-denbeaux 58
-tyibilika 58
-teya 58
-squalene 58
-guzy 58
-ampad 58
-nua 58
-oundjian 58
-yankees-mets 58
-clyfford 58
-bootes 58
-degeratu 58
-heshmatollah 58
-self-diagnosis 58
-gulyanamitta 58
-belgium-politics 58
-zukor 58
-jyujiya 58
-inadmissibility 58
-security-wise 58
-spiliotes 58
-zambelli 58
-axford 58
-closed-in 58
-laender 58
-anti-flooding 58
-icca 58
-so-what 58
-pharmacogenomics 58
-prefecture-level 58
-declines# 58
-briffa 58
-plexicushion 58
-royal-blue 58
-add-ins 58
-sfp 58
-wide-awake 58
-mahoud 58
-#motion 58
-riwhite 58
-frappes 58
-recoletos 58
-accountholders 58
-most-affected 58
-yalie 58
-sugarmann 58
-ahmadov 58
-ictu 58
-conservative-minded 58
-finigan 58
-blackard 58
-labant 58
-vierra 58
-zizka 58
-figuras 58
-esquimalt 58
-pottuvil 58
-khassawneh 58
-alben 58
-peru-hostages 58
-rebozos 58
-konchalovsky 58
-kesayeva 58
-koyo 58
-flukey 58
-over-bought 58
-dollar-supportive 58
-pushtuns 58
-distribuidora 58
-lalji 58
-thiermann 58
-advance-fee 58
-woodberry 58
-waleska 58
-oysterman 58
-ssali 58
-fievet 58
-pro-and 58
-metaxas 58
-cairngorms 58
-cohabited 58
-doyev 58
-johnsrud 58
-rayani 58
-salvatori 58
-bowdlerized 58
-epidaurus 58
-tilahun 58
-counter-guerrilla 58
-maulani 58
-chalai 58
-good-for-you 58
-soon-to-retire 58
-forkballs 58
-resettlers 58
-ultimatetv 58
-oblinger 58
-ritto 58
-fiberweb 58
-dungey 58
-pinholes 58
-#-diego 58
-kikhia 58
-tugluk 58
-progressivity 58
-asakawa 58
-kolawole 58
-regpay 58
-syktyvkar 58
-anti-copying 58
-luda 58
-tv-like 58
-weisbach 58
-multicanal 58
-export-fueled 58
-lucherini 58
-bossio 58
-numeiri 58
-swagel 58
-family-like 58
-lodar 58
-finalises 58
-larwood 58
-u.n.-proposed 58
-self-designated 58
-record-hard 58
-syncing 58
-winebrenner 58
-provosts 58
-naisbitt 58
-chetcuti 58
-wentland 58
-capetillo 58
-andreyeva 58
-skvortsova 58
-role-players 58
-artek 58
-choling 58
-##,###-sq-m 58
-resized 58
-hartack 58
-wilens 58
-sentimentalism 58
-serifovic 58
-cancio 58
-hausch 58
-earlene 58
-mengwa 58
-yokado 58
-pisarcik 58
-loredana 58
-naqi 58
-most-asked 58
-chicha 58
-skammelsrud 58
-supinit 58
-wanseele 58
-fragrance-free 58
-leffe 58
-http://www.firstunion.com 58
-nuder 58
-rouf 58
-zigomanis 58
-mp-# 58
-guma 58
-market-access 58
-square-kilometre 58
-urbanists 58
-tortes 58
-franking 58
-vorenberg 58
-schwieterman 58
-bawazir 58
-crankcase 58
-derogatis 58
-karakasevic 58
-entropia 58
-barlonyo 58
-hannibal-lagrange 58
-artnews 58
-adventurists 58
-money-changer 58
-housecoat 58
-nordictrack 58
-wilfert 58
-mccain-obama 58
-musselwhite 58
-bangzhu 58
-meyerbeer 58
-tegucigalpa-san 58
-al-saqqa 58
-quercus 58
-louay 58
-yenga 58
-rajauri 58
-cigarroa 58
-kimiyasu 58
-nicotine-free 58
-zelinsky 58
-diagon 58
-permach 58
-symmetrix 58
-firnas 58
-flower-decked 58
-istar 58
-quintupling 58
-breyers 58
-armyworm 58
-abdoulie 58
-care-related 58
-aldar 58
-beauteous 58
-yardages 58
-non-greek 58
-cutthroats 58
-flessel-colovic 58
-mago 58
-margets 58
-barela 58
-flight-to-quality 58
-most-coveted 58
-zubar 58
-ruzowitzky 58
-petru-alexandru 58
-al-hajji 58
-scots-irish 58
-chabang 58
-bouafle 58
-xxxxxxxend 58
-mintoo 58
-lolab 58
-largely-christian 58
-flim-flam 58
-wen-ying 58
-greensomes 58
-practioners 58
-ya\/ml 58
-chanters 58
-karapetian 58
-in-seat 58
-shirko 58
-bistrot 58
-esmaeel 58
-insanitary 58
-cheka 58
-issue-driven 58
-gok 58
-auclair 58
-veira 58
-craftsman-style 58
-honorifics 58
-martek 58
-ebri 58
-ferdie 58
-seshaiah 58
-sunkin 58
-caldrons 58
-cabals 58
-foredeck 58
-tight-head 58
-sung-nam 58
-demerara 58
-jucker 58
-thumb-size 58
-sciame 58
-eriksen\/mette 58
-ricketson 58
-domesticating 58
-khagendra 58
-abla 58
-rescorla 58
-kanta 58
-stylez 58
-ruwenzori 58
-kilfoyle 58
-steinmann 58
-tabakh 58
-inquisitorial 58
-heldman 58
-vus#### 58
-whole-language 58
-kimberlee 58
-harnecker 58
-dary 58
-azimkar 58
-reneau 58
-in-the-round 58
-vucetic 58
-metrowerks 58
-togiola 58
-motor-racing 58
-themistocleous 58
-rugut 58
-bectu 58
-www.orbitz.com 58
-pagos 58
-akeem 58
-anouncement 58
-flat-line 58
-sindical 58
-bc-mexico 58
-three-building 58
-chambal 58
-bhoj 58
-asra 58
-p.p. 58
-overexploitation 58
-time-keeping 58
-bearse 58
-robp 58
-marakesh 58
-avena 58
-highwayman 58
-barnaba 58
-co-coaches 58
-rfranklin 58
-self-abuse 58
-farra 58
-below-strength 58
-bolek 58
-devkota 58
-farney 58
-junck 58
-lefranc 58
-lindale 58
-likhachev 58
-flywheels 58
-cnca 58
-samin 58
-samie 58
-glasberg 58
-blue-suited 58
-kazarlyga 58
-vacanti 58
-purwoprandjono 58
-##-billion-baht 58
-bahaeddin 58
-pw#### 58
-world-shaking 58
-balbina 58
-darwyn 58
-aleskerov 58
-primp 58
-chue 58
-o'cealleagh 58
-scotrail 58
-figeroux 58
-thanawat 58
-iihs 58
-hendardji 58
-boldyrev 58
-mihoko 58
-sibani 58
-diapering 58
-animal-feed 58
-lintel 58
-infobahn 58
-kuribayashi 58
-bretons 58
-bevmark 58
-risk-sensitive 58
-flanner 58
-groenvold 58
-kaspersky 58
-salt-free 58
-cankaya 58
-zna 58
-rabson 58
-outvote 58
-hanada 58
-tourist-filled 58
-adobes 58
-athamna 58
-nato-brokered 58
-priding 58
-ngetich 58
-benac 58
-eelco 58
-teza 58
-keahon 58
-emeryville-based 58
-isentress 58
-munford 58
-critchfield 58
-chinandega 58
-carax 58
-posers 58
-tehreek-i-jafria 58
-tbarnhart@ajc.com 58
-picolinate 58
-cobwebbed 58
-conventionality 58
-byelections 58
-macallister 58
-papoose 58
-##-microgram 58
-sled-dog 58
-myostatin 58
-albaladejo 58
-ndirangu 58
-honderich 58
-sceptically 58
-garro 58
-vereeniging 58
-h-shaped 58
-lillington 58
-diddams 58
-coffy 58
-doorjamb 58
-militia-style 58
-magluta 58
-mazzariol 58
-stanford-trained 58
-agriculturalists 58
-spit-and-polish 58
-servility 58
-padmore 58
-satmars 58
-seren 58
-subsitute 58
-serwotka 58
-kuokuang 58
-tavlaridis 58
-congressman-elect 58
-two-euro 58
-series-levelling 58
-cricket-ashes-aus-eng 58
-digital-only 58
-provencher 58
-muslim-jewish 58
-resizing 58
-hula-hoop 58
-fuegos 58
-fair-housing 58
-waitakere 58
-bouldin 58
-chilanga 58
-suceeded 58
-sundeen 58
-six-phase 58
-disassociation 58
-shteyngart 58
-crystalize 58
-shihan 58
-protocal 58
-long-march 58
-fazal-ur 58
-jackknife 58
-mcnairy 58
-ealey 58
-drivon 58
-pleasure-seeking 58
-sciolino 58
-shore-up 58
-then-banned 58
-german-swiss 58
-polymorphism 58
-three-foot-high 58
-anti-speculation 58
-stoudmann 58
-koreas-talks 58
-ivanek 58
-kanyenda 58
-holyfield-lennox 58
-naceri 58
-anti-globalist 58
-jeg 58
-folden 58
-talpur 58
-r.f. 58
-##,###-barrel-a-day 58
-binshu 58
-doerfler 58
-juanmi 58
-organdy 58
-kostal 58
-chola 58
-presgrave 58
-cosgrave 58
-zhevnov 58
-recapitalizations 58
-us-philippines 58
-half-a-percentage 58
-mayenne 58
-pozole 58
-jva 58
-higher-up 58
-erra 58
-btrc 58
-taxachusetts 58
-lyor 58
-advocate-general 58
-toko 58
-blowzy 58
-kantaras 58
-consiste 58
-indama 58
-xiaoxiang 58
-jamelli 58
-esquipulas 58
-cd-based 58
-maxillofacial 58
-bernabei 58
-meshell 58
-parlin 58
-now-imprisoned 58
-ummc 58
-posterboard 58
-degradable 58
-annasue 58
-anjouanese 58
-gallivan 58
-usc-notre 58
-sappho 58
-creameries 58
-turbocharge 58
-soslan 58
-thurairaja 58
-singapore-china 58
-antiperspirant 58
-abominably 58
-puiu 58
-newfoundlanders 58
-kaylene 58
-plaine 58
-howorth 58
-non-save 58
-dijana 58
-ronaldson 58
-aivar 58
-hudong 58
-##-student 58
-pravastatin 58
-delima 58
-six-continent 58
-union-imposed 58
-kaz\/cof 58
-sandy-colored 58
-longliners 58
-janic 58
-amerithrax 58
-blurriness 58
-aleki 58
-rokkasho-mura 58
-sogang 58
-methow 58
-rosalee 58
-rapiscan 58
-nepali-language 58
-eastgate 58
-timis 58
-record-indoor 58
-adelaida 58
-ethno-sectarian 58
-pay-for-view 58
-dahdouh 58
-desisto 58
-huldai 58
-brunswijk 58
-hundertwasser 58
-chiew 58
-pyrotechnical 58
-snatchings 58
-jabri 58
-kleeblatt 58
-turkish-registered 58
-ghodhbane 58
-masunungure 58
-mousepad 58
-ascott 58
-lapoint 58
-quarter-page 58
-belfiore 58
-se-r 58
-martinville 58
-salchow-triple 58
-minored 58
-ahe 58
-counter-complaint 58
-mbula 58
-khadi 58
-wolfli 58
-faggots 58
-vigouroux 58
-chaifetz 58
-giugliano 58
-defoliated 58
-vashee 58
-maumee 58
-jutge 58
-loudi 58
-powderly 58
-razini 58
-self-seeking 58
-best-written 58
-arkaah 58
-patroon 58
-motormen 58
-pedagogic 58
-soendral 58
-pharynx 58
-qudratullah 58
-proces 58
-alcohol-monitoring 58
-corange 58
-a.t.m. 58
-rasshan 58
-sabbar 58
-malmberg 58
-sverrisson 58
-tiler 58
-sabag 58
-http://www.state.gov/ 58
-feichter 58
-#-george 58
-wheelis 58
-cleric-run 58
-roskot 58
-adjetey-nelson 58
-propios 58
-quickies 58
-nzimbi 58
-gazprombank 58
-woofer 58
-trita 58
-weigman 58
-congealing 58
-solidarite 58
-narcoleptic 58
-pulai 58
-bewag 58
-computadora 58
-sppf 58
-soundcheck 58
-zaituc 58
-adurogboye 58
-orogen 58
-demeanors 58
-neumayr 58
-mso 58
-ntou 58
-westtown 58
-re-directed 58
-avalanche-journal 58
-ishmail 58
-pearsmhnytimes.com 58
-posthaste 58
-needle-nose 58
-self-assessments 58
-lewisite 58
-kuwaiti-based 58
-kostevych 58
-dogonadze 58
-korasuv 58
-hampl 58
-concealed-weapon 58
-shragai 58
-kappos 58
-anti-casino 58
-sleith@ajc.com 58
-karyo 58
-kopi 58
-sub-themes 58
-knock-kneed 58
-stendardo 58
-anthuan 58
-rekapac 58
-nasaa 58
-chinese-manufactured 58
-isioma 58
-greenwillow 58
-strad 58
-piontkowski 58
-nine-stroke 58
-avm 58
-parishoners 58
-delfim 58
-pasquini 58
-wigmore 58
-dieuze 58
-scag 58
-berzengi 58
-cazzulani 58
-balmont 58
-marriotts 58
-cete 58
-phertzberg 58
-wapenaar 58
-hodara 58
-curcumin 58
-canoni 58
-abdulmajid 58
-maroon# 58
-anti-vaccine 58
-mini-city 58
-bioland 58
-lanced 58
-hepatology 58
-balakong 58
-eneco 58
-subuh 58
-wolfenstein 58
-henchy 58
-tumukunde 58
-argentina-vote 58
-soft-dollar 58
-merit-making 58
-qanoni 58
-czech-built 58
-al-khayat 58
-heidsieck 58
-deinstitutionalization 58
-mail-fraud 58
-calakmul 58
-chiasso 58
-faveur 58
-ranarith 58
-satara 58
-cash-management 58
-ayittey 58
-buschbaum 58
-compatibles 58
-olatunji 58
-hourglass-shaped 58
-abousamra 58
-serafino 58
-rassemblement 58
-cohen-tannoudji 58
-artspace 58
-schwartlander 58
-yeltsova 58
-kiu 58
-zoubeir 58
-asias 58
-orv 58
-yaring 58
-sheppards 58
-two-course 58
-cadrez 58
-charima 58
-canana 58
-geodon 58
-deplasco 58
-fully-armed 58
-brontes 58
-kbp 58
-stehn 58
-#-iker 58
-baff 58
-perlas 58
-madhes 58
-preis 58
-harward 58
-koloane 58
-deshayes 58
-falomo 58
-saturno 58
-corrupters 58
-planetariums 58
-bifengxia 58
-flum 58
-propound 58
-hemanshu 58
-imacec 58
-feyernoord 58
-ulta 58
-atli 58
-re-hire 58
-interclan 58
-gitonga 58
-aand 58
-icmr 58
-jarar 58
-skank 58
-thrane 58
-trousseau 58
-spent-fuel 58
-chilaquiles 58
-bump-drafting 58
-iao 58
-vanhala 58
-dj\/ak## 58
-proselytization 58
-dongdu 58
-rave-up 58
-uzebekistan 58
-ntagerura 58
-pengkalan 58
-anti-fascism 58
-domingue 58
-lamport 58
-kaillie 58
-fernet 58
-bekaert 58
-project-based 58
-mauriac 58
-abrogates 58
-basketball-wise 58
-gansey 58
-emailing 58
-jiulong 58
-mechale 58
-vignerons 58
-federative 58
-emmo 58
-panino 58
-easterain 58
-aceto 58
-manresa 58
-own-goals 58
-lamongan 58
-morinaga 58
-fourth-season 58
-duvergel 58
-ruzindana 58
-yu-ih 58
-varnishing 58
-non-televised 58
-ederer 58
-grosboell 58
-sakanyi 58
-wrn 58
-costless 58
-laxalt 58
-lb-# 58
-halse 58
-upper-tier 58
-sheinkin 58
-shadwell 58
-katselas 58
-www.aol.com 58
-yankey 58
-mytouch 58
-enyimba\/ngr 58
-brokedown 58
-jen-hung 58
-kisutu 58
-kuniyoshi 58
-dahiyah 58
-amrhein 58
-up-country 58
-rolo 58
-smartmedia 58
-watch-helsinki 58
-khawazakhela 58
-ocws 58
-yacoubian 58
-wandlike 58
-tratan 58
-temane 58
-tainan-based 58
-micropal 58
-korrodi 58
-ms-as 58
-ahdyar 58
-southerton 58
-kakiuchi 58
-state-of-origin 58
-breault 58
-filled-in 58
-kawase 58
-bysshe 58
-pericard 58
-sakwiba 58
-qingyang 58
-pedantry 58
-kamangar 58
-unlawfulness 58
-gdc 58
-astuteness 58
-vca 58
-highly-qualified 58
-pressurised 58
-fliegende 58
-ambush-style 58
-sweatman 58
-bembry 58
-rabelais 58
-hellion 58
-jaune 58
-pade 58
-cylon 58
-nnt 58
-cod-style 58
-manoeuvering 58
-lubero 58
-knocker 58
-npower 58
-oeystein 58
-stellone 58
-badou 58
-berlinecke 58
-shengrong 58
-non-sporting 58
-anti-moslem 58
-http://www.defenselink.mil/ 58
-macgillivary 58
-pigheaded 58
-grogin 58
-pluta 58
-madasamy 58
-lozzano 58
-bong-kil 58
-christmas-tree 58
-metroid 58
-kudisch 58
-runups 58
-kinsolving 58
-zety 58
-guitarist-singer 58
-vinayagamoorthi 58
-movilnet 58
-ppa-containing 58
-seyval 58
-feliks 58
-shiite-populated 58
-harymurti 58
-tingo 58
-grassless 58
-cheap-looking 58
-zhengming 58
-jastrow 58
-nrcs 58
-shirkers 58
-bc-af-fin 58
-batac 58
-finocchiaro 58
-polyrhythms 58
-weale 58
-fang-yu 58
-ispa 58
-condensers 58
-synergie 58
-rottman 58
-test-bed 58
-afroyim 58
-anti-arms 58
-amathila 58
-billion\/## 58
-todman 58
-rabies-free 58
-gilbreath 58
-ultra-conservatives 58
-out-of-context 58
-flook 58
-pongpanich 58
-skulked 58
-a###xwb 58
-semiskilled 58
-synoptics 58
-home-ported 58
-ih 58
-bornand 58
-keshubhai 58
-adianto 58
-asen 58
-d-hillsborough 58
-commercial-grade 58
-diery 58
-tzoganis 58
-al-nounou 58
-tedd 58
-map-making 58
-blockbusting 58
-be-in 58
-trouble-torn 58
-skipp 58
-whitsunday 58
-hutterites 58
-ttc 58
-occam 58
-witters 58
-winterreise 58
-al-hasani 58
-cities-abc 58
-kurtag 58
-kahoolawe 58
-radiofrequency 58
-elfers 58
-razor-edged 58
-profanity-filled 58
-nedzad 58
-elsworth 58
-sashays 58
-redbone 58
-arkhipova 58
-geck 58
-fouras 58
-erdf 58
-sivori 58
-point-based 58
-apennines 58
-raffanello 58
-foot-fault 58
-emirsyah 58
-safian 58
-poppy-producing 58
-bagong 58
-roehrig 58
-stutes 58
-michelet 58
-anter 58
-campionati 58
-semmelweis 58
-dos-based 58
-sadosky 58
-st-# 58
-franker 58
-brende 58
-mihajlov 58
-eshetu 58
-barile 58
-roey 58
-bijaya 58
-matcha 58
-hounslow 58
-sischy 58
-anxi 58
-topdog\/underdog 58
-predisposing 58
-tightenings 58
-colantuono 58
-duflo 58
-tervuren 58
-slebos 58
-krein 58
-malu-malu 58
-nkorea-nuclear-weapons-us 58
-d'hondt 58
-marce 58
-movieline 58
-bles 58
-hyeon 58
-casner 58
-dry-aged 58
-clomping 58
-jd\/pi## 58
-giganotosaurus 58
-five-six 58
-iita 58
-then-teammate 58
-badola 58
-ahrendts 58
-christoforakos 58
-al-daradji 58
-hathorn 58
-computer-operated 58
-soviet-american 58
-mousetraps 58
-turetsky 58
-farc-held 58
-medzamor 58
-herpoel 58
-scissor-kick 58
-wodie 58
-quirine 58
-shrivelled 58
-flameproof 58
-less-talented 58
-sacharow 58
-thin-bladed 58
-cavour 58
-allaga 58
-dornbush 58
-#-pounder 58
-schuermann 58
-mafia-related 58
-bellingen 58
-crichlow 58
-dividend-rich 58
-lorsch 58
-anglada 58
-non-actors 58
-anti-surface 58
-deepcut 58
-lazarev 58
-thumb-sucking 58
-bi-polar 58
-madrid-barajas 58
-thile 58
-barn-burner 58
-calibrations 58
-stimulus-fueled 58
-unplanted 58
-keyboardists 58
-ecuador-vote 58
-armuelles 58
-chittick 58
-taavi 58
-mondesire 58
-smederevska 58
-ding-dong 58
-promphan 58
-lovsan 58
-loveseat 58
-fullscale 58
-nazzaro 58
-mulvenon 58
-hillegass 58
-vanderford 58
-goodlad 58
-carphedon 58
-court-at-law 58
-pseudo.com 58
-sollers 58
-putterman 58
-finnegans 58
-durakovic 58
-hamayon 58
-duct-tape 58
-bardic 58
-scaled-up 58
-still-robust 58
-wilchcombe 58
-wathiq 58
-hiltachk 58
-krylatskoye 58
-boose 58
-data-intensive 58
-danielides 58
-transsexuality 58
-claw-foot 58
-none-out 58
-elucidating 58
-tomasch 58
-brignol 58
-jeyarajah 58
-dangor 58
-caic 58
-goaland 58
-mellis 58
-somali-based 58
-sidetracking 58
-sushmita 58
-mid-stride 58
-domestically-traded 58
-redox 58
-shoba 58
-house-arrest 58
-coppens 58
-at-tuffah 58
-nijssen 58
-hely 58
-rouged 58
-lounger 58
-a.r.c. 58
-dubovsky 58
-non-payments 58
-enviga 58
-frenz 58
-#-boris 58
-misprision 58
-go-along 58
-riccadonna 58
-machine-gunner 58
-vellore 58
-tackie 58
-constitutionalists 58
-tongliao 58
-paderina 58
-nan-cheng 58
-cattan 58
-us-immigration 58
-sparq 58
-mashingaidze 58
-togawa 58
-houseflies 58
-semenzato 58
-moton 58
-atlanta-bound 58
-summerer 58
-marmottan 58
-quartier 58
-el-motassadeq 58
-more-profitable 58
-boada 57
-malim 57
-voit 57
-tichtchenko 57
-qgpc 57
-sakassou 57
-hamadou 57
-geeked 57
-anagrams 57
-politicizes 57
-chainarong 57
-fourth-youngest 57
-schulweis 57
-unsafeguarded 57
-urmila 57
-mobile-telephone 57
-grigson 57
-abdul-samad 57
-el-youssef 57
-suffuse 57
-castellane 57
-line-outs 57
-manohara 57
-christiansborg 57
-standley 57
-gasoline\/electric 57
-munton 57
-aaib 57
-postcommunist 57
-buckcherry 57
-teletext 57
-juancho 57
-steep-sided 57
-micro-enterprise 57
-medicals 57
-cloudlike 57
-griet 57
-jintropin 57
-funsho 57
-schaus 57
-chenoweth-hage 57
-club-mate 57
-israel-vote 57
-gokavi 57
-toufic 57
-scalloping 57
-lfc 57
-chatwal 57
-movie-mad 57
-american-european 57
-butchie 57
-televisual 57
-chainrai 57
-mambasa 57
-quaye 57
-dilg 57
-http://www.nobel.no 57
-jabotinsky 57
-nilly 57
-lakela 57
-frostily 57
-australia-bushfires 57
-maumalanga 57
-coleccion 57
-##-carry 57
-weisser 57
-morago 57
-kpatinde 57
-korun 57
-kc-pq 57
-nextlink 57
-tbilissi 57
-azua 57
-#m## 57
-obersalzberg 57
-sea-skimming 57
-nizuc 57
-sacombank 57
-nitrate-based 57
-f.h. 57
-farj 57
-lagunov 57
-millipede 57
-#-cd 57
-musclebound 57
-bloche 57
-dissuades 57
-tae-dong 57
-arms-related 57
-suk-tae 57
-conceptualizing 57
-pokaski 57
-often-contentious 57
-seleccion 57
-dialogo 57
-grevenmacher 57
-alledged 57
-maziarz 57
-ecompanies 57
-slow-building 57
-greczyn 57
-packham 57
-lamego 57
-saddlebag 57
-strokosch 57
-eutaw 57
-shophouses 57
-xxiv 57
-nickel-cadmium 57
-shuning 57
-http://www.homedepot.com 57
-sunfin 57
-atg 57
-windmilling 57
-zostavax 57
-then-majority 57
-revenant 57
-daish 57
-cvijanovic 57
-butoh 57
-marianist 57
-visoth 57
-eidul 57
-isaura 57
-aleynikov 57
-shalita 57
-benegas 57
-mine-strewn 57
-litif 57
-qallab 57
-percolation 57
-magnesite 57
-sung-jin 57
-cook-offs 57
-ronayne 57
-carias 57
-calendula 57
-elmaghraby 57
-fafner 57
-scibelli 57
-huhhot 57
-sidex 57
-bank-issued 57
-perfecta 57
-vitrines 57
-ho-chunk 57
-alstyne 57
-sydkraft 57
-football-wise 57
-silverite 57
-pagliaro 57
-overdubbed 57
-mighani 57
-finger-snapping 57
-liron 57
-multi-trillion 57
-ortuno 57
-drop-kick 57
-mussavi 57
-brasilia-based 57
-meteoroid 57
-oxygen-deprived 57
-rentech 57
-puzzle-solving 57
-pro-ravalomanana 57
-thmey 57
-zalben 57
-liko 57
-qardaha 57
-muayad 57
-lottner 57
-conceptualist 57
-laychak 57
-badini 57
-jazzmen 57
-macaco 57
-nexstar 57
-especies 57
-warbirds 57
-avelar 57
-ulcerated 57
-murshidabad 57
-marife 57
-grillers 57
-marakwet 57
-sanroma 57
-college-preparatory 57
-bartholomay 57
-dpp-initiated 57
-duskin 57
-anesthetizing 57
-belgrade-controlled 57
-estragon 57
-novolipetsk 57
-caton-jones 57
-offman 57
-jetways 57
-am\/ji 57
-hatemonger 57
-zeo 57
-dial-around 57
-hunzike 57
-wicha 57
-cupiagua 57
-mewelde 57
-webmethods 57
-dacca 57
-agrast 57
-indonesia-weather-floods 57
-sumaya 57
-magliore 57
-seljan 57
-dead-eyed 57
-rezidor 57
-rubberneck 57
-substantia 57
-azzurro 57
-first-responder 57
-peruses 57
-janusaitis 57
-summiting 57
-mantas 57
-posthumus 57
-sufjan 57
-often-delayed 57
-resiana 57
-rrustem 57
-pulled-together 57
-milmo 57
-compa 57
-prezant 57
-lucke 57
-prosthetist 57
-se# 57
-luisita 57
-molinelli 57
-earthrights 57
-osayemi 57
-market-beating 57
-sincor 57
-rexburg 57
-miku 57
-anti-disease 57
-already-crowded 57
-santry 57
-friesian 57
-tg-pyg 57
-nce 57
-cottoned 57
-vaugrenard 57
-candleholders 57
-mudi 57
-hawksworth 57
-jinglian 57
-schuon 57
-dehesa 57
-puyo 57
-biosensor 57
-periodnone 57
-non-deductible 57
-eid-ul 57
-palestrina 57
-mitic 57
-segars 57
-entrapping 57
-###w 57
-vls\/nvw 57
-peachpit 57
-small-plane 57
-norsworthy 57
-lello 57
-wair 57
-latinpass 57
-souquet 57
-reveles 57
-soapboxes 57
-tholut 57
-pompton 57
-fmd-free 57
-something-or-other 57
-nayoko 57
-forum-asia 57
-interferometer 57
-hortensia 57
-tifosi 57
-u.s.-provided 57
-brasfield 57
-http://www.people-press.org 57
-mojokerto 57
-stankalla 57
-gallien 57
-valda 57
-speciosa 57
-lightly-regarded 57
-bublitz 57
-chongryong 57
-bit-part 57
-sharabati 57
-drottningholm 57
-eum 57
-pongrat 57
-tesa 57
-multicast 57
-+##,### 57
-mushed 57
-ravello 57
-eye-fi 57
-calandra 57
-shujah 57
-rural\/metro 57
-djeric 57
-##,###-points 57
-houtart 57
-idigov 57
-russian-supported 57
-caa# 57
-tionne 57
-gabai 57
-dosek 57
-sadyk 57
-calzati 57
-hip-hugger 57
-iue-cwa 57
-ksf 57
-hpa-an 57
-finamex 57
-berden 57
-ameco 57
-non-jordanians 57
-toxford 57
-mulembwe 57
-rovos 57
-extrication 57
-cutchogue 57
-bang-andersen 57
-lucchetti 57
-uninstalling 57
-excises 57
-azaouagh 57
-decompressing 57
-northwesterners 57
-wielkopolski 57
-tahlequah 57
-optimark 57
-gheen 57
-eurlings 57
-unclimbed 57
-esti 57
-niyonzima 57
-pro-rata 57
-giannoulas 57
-kodjoe 57
-ulanqab 57
-tiliwaldi 57
-baldock 57
-#-meters 57
-ghorak 57
-bovey 57
-sameur 57
-chien-kuo 57
-colten 57
-vincenti 57
-square-kilometers 57
-garamvoelgyi 57
-zippel 57
-then-commander 57
-##.#-nautical 57
-unpowered 57
-hanly 57
-parten 57
-liederman 57
-velayat 57
-willo 57
-coonelly 57
-kitov 57
-swiss-educated 57
-outraise 57
-sinta 57
-#-felix 57
-virts 57
-yome 57
-saury 57
-gretz 57
-isoun 57
-mpigi 57
-nstp 57
-modzeleski 57
-samran 57
-wazed 57
-reappraising 57
-regola 57
-exor 57
-france-telecom 57
-iresearch 57
-soz 57
-n.b.a. 57
-jenai 57
-liukko 57
-wenpu 57
-cervenko 57
-halfa 57
-yefremova 57
-biaggio 57
-mimmo 57
-iraq-unrest-us-toll 57
-quart-size 57
-favalora 57
-hockey-mad 57
-doppelgangers 57
-ghalibaf 57
-marijnissen 57
-open-face 57
-sytem 57
-fredricksen 57
-shafayat 57
-safeen 57
-programe 57
-kuroichi 57
-burqa-style 57
-xuesen 57
-dissembled 57
-uptagrafft 57
-canadiense 57
-sung-wook 57
-paravant 57
-kada 57
-short-stay 57
-pinedo 57
-balwinder 57
-tibon 57
-prig 57
-ducent 57
-kc-###s 57
-ersberg 57
-family-type 57
-lardin 57
-dicussed 57
-kaim 57
-webnews 57
-kavak 57
-debut-making 57
-hewitson 57
-moeletsi 57
-tapit 57
-updegrove 57
-soft-sided 57
-hudepohl 57
-reflectivity 57
-tigerland 57
-#.##-per-share 57
-halandri 57
-plote 57
-temporaries 57
-gren 57
-merlet 57
-odera 57
-lingao 57
-sleazier 57
-dialectics 57
-doubletalk 57
-al-siddiq 57
-tifatul 57
-sanabis 57
-sc### 57
-vanore 57
-alair 57
-mae-eap 57
-spookiness 57
-hyson 57
-nounou 57
-nasreddine 57
-rose-marie 57
-completamente 57
-non-starters 57
-beguin 57
-bell-bottomed 57
-handloom 57
-abu-zeid 57
-tolstaya 57
-transwestern 57
-maraven 57
-neoforma 57
-galster 57
-viraat 57
-engqvist 57
-sadiya 57
-idiot-proof 57
-france-politics-jobs-youth 57
-larini 57
-yuzawa 57
-utilitarianism 57
-antosh 57
-belize-flagged 57
-kasetsiri 57
-kekauoha 57
-kaleida 57
-deviously 57
-boucheron 57
-hanen 57
-crose 57
-pawson 57
-sudikoff 57
-allayar 57
-industry-# 57
-perrott 57
-unframed 57
-kirpan 57
-dimasa 57
-popma 57
-mutianyu 57
-aygun 57
-iannelli 57
-d-dayton 57
-brand-named 57
-persian-speaking 57
-garforth 57
-sucess 57
-disposer 57
-rosangela 57
-pscs 57
-hanway 57
-cambon 57
-presas 57
-j&b 57
-checkmated 57
-bonnardeaux 57
-guzzetti 57
-digel 57
-mweemba 57
-party-sponsored 57
-uncommanded 57
-high-touch 57
-draughon 57
-shanzai 57
-khakimov 57
-japonicus 57
-unigate 57
-##-anastasia 57
-nabatiye 57
-penalties_none 57
-five-song 57
-lightning-strike 57
-thumbnail-sized 57
-gutteres 57
-swardson 57
-kls 57
-still-simmering 57
-burges 57
-am\/sbg 57
-golf-epga-esp 57
-short-termism 57
-lifefx 57
-gtm 57
-huntley-brinkley 57
-saidat 57
-sequencer 57
-seperatist 57
-ki-chi 57
-spoksman 57
-human-driven 57
-laue 57
-vancleave 57
-sjoblom 57
-barbacoa 57
-statman 57
-shaneen 57
-antique-filled 57
-mhh-krg 57
-remond 57
-bijli 57
-carby 57
-greece-style 57
-misapplying 57
-dorsen 57
-buk 57
-schwarzenbauer 57
-raiz 57
-banin 57
-daryn 57
-artemisinin-based 57
-kbohls@statesman.com 57
-persahabatan 57
-wormy 57
-samsung\/radioshack 57
-baxter-johnson 57
-re-tried 57
-aetats 57
-head-turner 57
-degganssptimes.com 57
-thirty-thousand 57
-imperiousness 57
-kesha 57
-tenace 57
-ferguson-mckenzie 57
-jaovisidha 57
-agassa 57
-barriga 57
-air-strike 57
-home-opener 57
-turbi 57
-umali 57
-kralik 57
-pump-and-dump 57
-bassat 57
-keasler 57
-tank-killing 57
-misdirecting 57
-ex-fighter 57
-mngomeni 57
-rejigged 57
-novska 57
-backstabbers 57
-spa\/qst 57
-re-tally 57
-intermune 57
-lifa 57
-urbanczyk 57
-sarabeth 57
-coke-bottle 57
-battle-readiness 57
-kawkab 57
-kerdyk 57
-wenceslaus 57
-mind-expanding 57
-boutroue 57
-tanona 57
-salivation 57
-tricolore 57
-out-gunned 57
-jacobowitz 57
-bacot 57
-participar 57
-alprazolam 57
-polho 57
-funnyordie.com 57
-panarin 57
-blanchfield 57
-yolane 57
-qualia 57
-robertses 57
-ovp 57
-peissel 57
-dratshev 57
-ieremia-stansbury 57
-korchnoi 57
-pvs-lk 57
-jan.-sep 57
-schlickeisen 57
-courier-post 57
-high-cholesterol 57
-etich 57
-guolin 57
-giesen 57
-industy 57
-cucherat 57
-villate 57
-evelin 57
-kelly-goss 57
-u.s.-korea 57
-kivutha 57
-overdramatized 57
-nemzet 57
-cassata 57
-darle 57
-cur 57
-actor-politician 57
-dingbat 57
-gerspach 57
-better-established 57
-restos 57
-tristesse 57
-ausmin 57
-computer-like 57
-thongsing 57
-ramberg 57
-hjort 57
-over-estimated 57
-bioremediation 57
-stress-reduction 57
-televisi 57
-chelny 57
-sundazed 57
-radio-cassette 57
-opening-week 57
-reghecampf 57
-highest-performing 57
-zamanbek 57
-prefigures 57
-red-and-white-striped 57
-metters 57
-travessa 57
-pengiran 57
-copernic 57
-government-granted 57
-qoryoley 57
-kur 57
-martincova 57
-aldis 57
-remissainthe 57
-favor-hamilton 57
-by-# 57
-oooooooooooooooooooo 57
-fieldsman 57
-aa-plus 57
-farmsteads 57
-marrack 57
-fambrough 57
-ongarato 57
-spayd 57
-corsetry 57
-inuvik 57
-salwen 57
-power-grabbing 57
-wornick 57
-kalkstein 57
-papermakers 57
-thwaite 57
-mineira 57
-newbigging 57
-uberstine 57
-roxx 57
-ehrlichiosis 57
-al-noor 57
-chernogorneft 57
-mercally 57
-villamayor 57
-texpool 57
-eye-watering 57
-white-brick 57
-koco 57
-transportations 57
-i-zone 57
-kouadio 57
-websense 57
-clinton-like 57
-kievsky 57
-pietton 57
-athirson 57
-oxygen-generating 57
-youre 57
-supertramp 57
-ergic 57
-kasambala 57
-attention-grabber 57
-j-shaped 57
-qedwa 57
-http://www.ipcc.ch 57
-bossman 57
-last-rock 57
-pentair 57
-bingle 57
-ap# 57
-signally 57
-cundieff 57
-maflahi 57
-ciger 57
-gerlinde 57
-ndambuki 57
-ottakar 57
-braunskill 57
-recopa 57
-journaling 57
-rmit 57
-ernk 57
-non-congress 57
-dayle 57
-q-cells 57
-hetian 57
-boons 57
-non-deliverable 57
-woolston 57
-cristoph 57
-r\/# 57
-manglano 57
-erdinc 57
-gajon 57
-knickknack 57
-rohmat 57
-wen-yuan 57
-harleman 57
-guesso 57
-stamenson 57
-australia-united 57
-nitromethane 57
-eastn 57
-anonyme 57
-stadelmann 57
-siefer 57
-pripps 57
-cross-checks 57
-salzburger 57
-oceaneering 57
-ball-point 57
-mandeep 57
-fratesi 57
-beyrer 57
-sad-faced 57
-krasnomovets 57
-rouhani 57
-eruh 57
-long-battered 57
-toles 57
-parrotfish 57
-tuffree 57
-brasseur 57
-shenkarow 57
-half-a-mile 57
-gyroball 57
-dombi 57
-anzar 57
-tajan 57
-easiness 57
-burgett 57
-hoskyns 57
-kembla 57
-orena 57
-hatam 57
-pinko 57
-detroit-hamtramck 57
-suborned 57
-kavetas 57
-nielsen\/net 57
-srg 57
-u.s.-patrolled 57
-schuelke 57
-hayim 57
-drobiazko\/povilas 57
-bhumidhar 57
-akan 57
-bungler 57
-heterosexually 57
-shija 57
-dazhen 57
-teikyo 57
-techno-thriller 57
-bezrukov 57
-hobgoblin 57
-gattis 57
-famous-brand 57
-seamico 57
-boobytrapped 57
-downrange 57
-fuel-starved 57
-mcaulay 57
-robustelli 57
-brinegar 57
-rohrbaugh 57
-ismar 57
-bonfils 57
-ponomareva 57
-goorjian 57
-kashmola 57
-on-rushing 57
-tuesday-sunday 57
-still-sluggish 57
-regular-cab 57
-obermayer 57
-kpakol 57
-basketball\/pros 57
-#-virginia 57
-hunding 57
-dangol 57
-pendleton-based 57
-neftegaz 57
-nakaniwa 57
-betham 57
-lagniappe 57
-mbambo 57
-indle 57
-sakorn 57
-pro-khartoum 57
-conjectural 57
-lunda-sul 57
-chiu-chin 57
-bancassurance 57
-limbu 57
-inital 57
-anisotropy 57
-pilbeam 57
-yazawa 57
-arzak 57
-opacic 57
-karasu 57
-http://www.cbs.com 57
-mingaladon 57
-joensuu 57
-weidenbaum 57
-ugwu 57
-anugerah 57
-immunologically 57
-espuelas 57
-izgi 57
-fredie 57
-cariplo 57
-coyotepec 57
-cuyos 57
-nolle 57
-yb\/sbg 57
-eigil 57
-maximilien 57
-barberie 57
-yeongam 57
-harperbusiness 57
-nonlawyers 57
-mekachera 57
-mahlon 57
-veruca 57
-firmo 57
-lamell 57
-sileo 57
-jabarani 57
-imkb 57
-maiffret 57
-odalovic 57
-gingivitis 57
-nagasawa 57
-francistown 57
-kocherlakota 57
-life-saver 57
-seegers 57
-sbcs 57
-markazi 57
-kharbash 57
-fokker-### 57
-wide-release 57
-bajilan 57
-yaral 57
-maione 57
-tokiwa 57
-qld 57
-governable 57
-parry-jones 57
-two-unit 57
-mcclay 57
-mcclam 57
-inner-tube 57
-obanda 57
-confederado 57
-eravur 57
-ribbs 57
-mcclennen 57
-mulund 57
-bordallo 57
-cutely 57
-diferencias 57
-black-tailed 57
-ride-alongs 57
-mosson 57
-notarial 57
-rovereto 57
-skosana 57
-holohan 57
-thamilchelvan 57
-business-management 57
-rerecorded 57
-seomin 57
-roewe 57
-gaffs 57
-whirr 57
-geppetto 57
-chukwueke 57
-bc-na-fea-gen 57
-rock-oriented 57
-mirandes 57
-telkiyski 57
-kalindi 57
-non-minority 57
-well-greased 57
-nemsadze 57
-southshore 57
-ddungu 57
-lanin 57
-lubuk 57
-zhongqiang 57
-volesky 57
-gontard 57
-kopylov 57
-siestas 57
-honduras-politics-coup 57
-davitian 57
-financial-industry 57
-kantono 57
-pumpkin-colored 57
-reconverted 57
-fixer-uppers 57
-euro##-euro## 57
-epinal 57
-highchairs 57
-browses 57
-js\/jd## 57
-mg\/l 57
-six-decade-old 57
-chimeric 57
-bowdler 57
-shannyn 57
-snick 57
-female-oriented 57
-cambridgeside 57
-giacomini 57
-polityka 57
-marzouki 57
-vullo 57
-fist-pump 57
-vm###-### 57
-al-aoofi 57
-klecker 57
-inhumanly 57
-white-on-black 57
-macneil-lehrer 57
-conkling 57
-al-kharbit 57
-pcrm 57
-best-song 57
-levines 57
-adventuress 57
-castagnetti 57
-mgn 57
-chumbawamba 57
-discontinuous 57
-ojima 57
-karlos 57
-ihar 57
-mid-# 57
-toe-loop 57
-asantehene 57
-ruso 57
-camago-malampaya 57
-sweet-talked 57
-baitadi 57
-kaleidoscopes 57
-ntabakuze 57
-senado 57
-ill-received 57
-vaccum 57
-guigang 57
-voula 57
-katou 57
-montcoal 57
-harilal 57
-wetted 57
-ruman 57
-agni-i 57
-rahbani 57
-schaffhouse 57
-geyserville 57
-elda 57
-##-book 57
-yinhui 57
-leaseplan 57
-midstage 57
-elnora 57
-depende 57
-second-in 57
-mrp 57
-iturralde 57
-engelberger 57
-youde 57
-regivaldo 57
-schaper 57
-totted 57
-elhassan 57
-#,###-patient 57
-slivinski 57
-manically 57
-sampang 57
-bratu 57
-multi-goal 57
-arab-summit 57
-visted 57
-azour 57
-##-sebastian 57
-mabasa 57
-lambayeque 57
-anuradha 57
-kw\/hours 57
-fire-suppression 57
-biliary 57
-yasujiro 57
-crusat 57
-ncds 57
-zager 57
-schmitt-roschmann 57
-familiar-looking 57
-airiness 57
-moamba 57
-vocci 57
-godchildren 57
-caleigh 57
-brokenborough 57
-cervo 57
-recieving 57
-non-recourse 57
-classic-car 57
-doy 57
-yong-seok 57
-scramjets 57
-terrorismo 57
-westernised 57
-technical-support 57
-buckaroos 57
-caydee 57
-pmdc 57
-v#s 57
-dalhart 57
-vibrance 57
-wayang 57
-roundish 57
-ignarro 57
-sansoni 57
-schobert 57
-#.####-mark 57
-ncsl 57
-bhight 57
-shapey 57
-royan 57
-lemper 57
-shannon.buggs@chron.com 57
-anadyr 57
-ch-##d 57
-jhala 57
-baumgart 57
-caponi 57
-souverain 57
-push-off 57
-vidosevic 57
-verwiel 57
-strougal 57
-non-network 57
-eco-terrorists 57
-washkansky 57
-matherne 57
-bt-## 57
-dragica 57
-namuyamba 57
-petroleum-related 57
-as### 57
-mcgahern 57
-morganti 57
-american-record 57
-randfontein 57
-iraq-al-qaida 57
-cubillan 57
-bayberry 57
-mythbusters 57
-dollar-cost-averaging 57
-multitasker 57
-aicraft 57
-opole 57
-ru\/sw 57
-spirea 57
-encoder 57
-chanko 57
-pyshkin 57
-structuralism 57
-vijayakumar 57
-easygroup 57
-goncalino 57
-deadeye 57
-milieus 57
-leomitis 57
-start-stop 57
-strohm 57
-stevensville 57
-anklam 57
-huanghe 57
-bushisms 57
-sugimori 57
-tightly-knit 57
-apono 57
-one-yuan 57
-pheonix 57
-freeze-for-freeze 57
-cagily 57
-vivants 57
-cotman 57
-sirf 57
-stemme 57
-baldassi 57
-kongsi 57
-tele-medicine 57
-mackanin 57
-build-a-bear 57
-greeson 57
-indomobil 57
-britain-politics-labour 57
-redenomination 57
-sell-outs 57
-grullon 57
-noorda 57
-sawasdi 57
-ccamlr 57
-dbrs 57
-mallouh 57
-age-verification 57
-strength-sapping 57
-b-#bs 57
-prestigous 57
-##-miroslav 57
-uzcategui 57
-laquila 57
-rs\/#### 57
-gasoline-guzzling 57
-galesi 57
-hemley 57
-simha 57
-kokura 57
-nyang 57
-catenaccio 57
-sigo 57
-hiwada 57
-huscroft 57
-bench-pressed 57
-rock-hurling 57
-administrates 57
-al-hawali 57
-time-scale 57
-gokcek 57
-budgett 57
-dyatchin 57
-rege 57
-ouargla 57
-ginns 57
-papathanassiou 57
-lucrezia 57
-isroilova 57
-juillet 57
-r-fairfax 57
-civilian-populated 57
-xiaoyun 57
-mcramerglobe.com 57
-psuv 57
-creasey 57
-telleldin 57
-oltman 57
-post-football 57
-periodontist 57
-us-school 57
-impsa 57
-kahana 57
-lunch-bucket 57
-zahalka 57
-party-going 57
-lc-gm 57
-orumieh 57
-ndongou 57
-ludek 57
-renowed 57
-junyao 57
-most-admired 57
-cricket-aus-ind 57
-outpitch 57
-atherosclerotic 57
-salamao 57
-capilla 57
-all-expense 57
-alganov 57
-vaslav 57
-al-qaissi 57
-stoklos 57
-hortman 57
-ultra-small 57
-ncea 57
-fermenters 57
-belik 57
-barsuk 57
-cycad 57
-fgarcia 57
-high-carbon 57
-riska 57
-pathogenicity 57
-crenson 57
-vinyls 57
-tercentenary 57
-sledded 57
-tleiss 57
-etat 57
-googleplex 57
-messin 57
-sr# 57
-patkar 57
-sinaloan 57
-djoumessi 57
-fritzky 57
-kanaana 57
-pcv 57
-avt 57
-puzzlers 57
-swellings 57
-hillshire 57
-sentani 57
-livery-cab 57
-martensson 57
-unroadworthy 57
-jinwei 57
-longyang 57
-groenfeld 57
-derbent 57
-nihilists 57
-maurizi 57
-turnhalle 57
-mixologists 57
-frostiness 57
-selph 57
-prodigene 57
-tewodros 57
-mosquito-transmitted 57
-portch 57
-arbib 57
-attarian 57
-effendy 57
-kason 57
-advances# 57
-jerges 57
-rockhouse 57
-andani 57
-eight-city 57
-furio 57
-yaswant 57
-monona 57
-epiphanny 57
-six-kilometre 57
-shaleil 57
-protropin 57
-zients 57
-windhorst 57
-abana 57
-alleghenies 57
-intra-state 57
-lalwani 57
-unstinted 57
-warford 57
-flordia 57
-galax 57
-mcgiffert 57
-cadereyta 57
-zeck 57
-railamerica 57
-http://www.ford.com/ 57
-ronetta 57
-foodmakers 57
-yongyudh 57
-half-serious 57
-coal-to-liquid 57
-day-use 57
-acclimatizing 57
-charkhi 57
-phalaborwa 57
-muthaiga 57
-bldp 57
-obnoxiousness 57
-esperar 57
-resistances 57
-top-producing 57
-atochem 57
-intiman 57
-dog-show 57
-hard-edge 57
-wolfhound 57
-vincent-st 57
-campiness 57
-leibniz 57
-religare 57
-tankful 57
-kertih 57
-lodal 57
-mbita 57
-forestalls 57
-imbecilic 57
-watch\/americas 57
-fondiaria 57
-sanyi 57
-mikhailo 57
-palumbi 57
-germain\/fra 57
-mg-### 57
-hunthausen 57
-snappily 57
-shakhnazarov 57
-jakabos 57
-champassak 57
-chia-chun 57
-bassoonist 57
-kasambara 57
-paradies 57
-arcore 57
-newquist 57
-mocny 57
-kostelka 57
-dolina 57
-mcelrathbey 57
-province-based 57
-addum 57
-spritzer 57
-schoenholtz 57
-abbotts 57
-gay-related 57
-suseno 57
-wreh 57
-al-forat 57
-outdraw 57
-keleher 57
-iraqen#### 57
-afrim 57
-souaidia 57
-cardi 57
-inkjets 57
-gas-sipping 57
-scalf 57
-puddled 57
-kadidal 57
-raymonde 57
-ear-shattering 57
-bishi 57
-zamano 57
-sabeh 57
-prinsen 57
-us-violence 57
-millhauser 57
-million\/euros 57
-texmaco 57
-alltime 57
-cheik 57
-comando 57
-amaitis 57
-erskin 57
-mastersingers 57
-bursaries 57
-wimbley 57
-veni 57
-dolgorsvren 57
-sirait 57
-rinero 57
-mander 57
-natiq 57
-parilla 57
-mewling 57
-anangwe 57
-omnifone 57
-shandler 57
-now-ubiquitous 57
-kadokawa 57
-seven-kilometer 57
-a.k.a 57
-satterthwaite 57
-luusua 57
-pila 57
-farabee 57
-often-heard 57
-tembec 57
-fleabag 57
-dozen-plus 57
-trichopoulos 57
-antrobus 57
-antlfinger 57
-zuendel 57
-zeevi-farkash 57
-thumbelina 57
-restrictionists 57
-vraalsen 57
-radwaniya 57
-bergan 57
-spectating 57
-strike-slip 57
-setchell 57
-new-era 57
-coutries 57
-shaoqiang 57
-fact-checked 57
-fariborz 57
-###-billion-pound 57
-biblioteca 57
-isoa 57
-slimeball 57
-genaux 57
-letha 57
-fma 57
-trouble-maker 57
-sonali 57
-anumnu 57
-aavishkar 57
-r-pasadena 57
-wel 57
-trailways 57
-kalpoes 57
-marcelhino 57
-erythematosus 57
-yayha 57
-shenhar 57
-raheel 57
-yasnaya 57
-sung-kuk 57
-corvalan 57
-houston-galveston 57
-touchier 57
-u.p. 57
-re-live 57
-barrington-coupe 57
-jarosz 57
-highest-flying 57
-underuse 57
-olmecs 57
-climent 57
-slabbert 57
-commericial 57
-shalgi 57
-peattie 57
-well-staffed 57
-cypriot-flagged 57
-fraccari 57
-rafel 57
-ribo 57
-vamped 57
-taffel 57
-lucheng 57
-pugnaciously 57
-strambach 57
-cgtp 57
-tapulous 57
-rockiest 57
-bienstock 57
-higher-interest 57
-rusted-out 57
-sopped 57
-third-from-bottom 57
-pacholczyk 57
-krasovska 57
-romulan 57
-sang-moon 57
-undammed 57
-ncacc 57
-brookhart 57
-valbon 57
-greenes 57
-kitchell 57
-zurick 57
-insufficiencies 57
-mohamedou 57
-daric 57
-multicamera 57
-sunao 57
-y## 57
-plumelec 57
-ntshangase 57
-atwi 57
-yiotis 57
-baoquan 57
-muehlebach 57
-vradenburg 57
-deliberateness 57
-ridgelea 57
-superlotto 57
-game-long 57
-hard-to-sell 57
-pjm\/gj## 57
-frot-coutaz 57
-miram 57
-oil-on-canvas 57
-laming 57
-stodginess 57
-rikrok 57
-president-in-uniform 57
-obbo 57
-american-grown 57
-#-matt 57
-charnvirakul 57
-baldomero 57
-langfeld 57
-throat-slashing 57
-hegemonist 57
-molchanov 57
-tarkan 57
-grimaud 57
-treelike 57
-yi-chiao 57
-caston 57
-shallman 57
-kandeh 57
-##-jarkko 57
-boy-king 57
-germani 57
-sindian 57
-off-the-book 57
-v&s 57
-kishaba 57
-fertonani 57
-schear 57
-drainpipes 57
-holness 57
-pwilson 57
-dfler 57
-ako 57
-bonannos 57
-moonwalked 57
-buzim 57
-trapdoors 57
-skeeters 57
-camshafts 57
-onair 57
-psncr 57
-calton 57
-nyambuya 57
-speedman 57
-altarpieces 57
-fat-laden 57
-office-based 57
-end-time 57
-navigenics 57
-anser 57
-knup 57
-kyung-ja 57
-patrimonio 57
-erda 57
-timecards 57
-quencher 57
-non-deployed 57
-anggoro 57
-bajas 57
-iju 57
-kaggwa 57
-miller-jenkins 57
-visicalc 57
-magdalo 57
-johannsen 57
-tassinari 57
-cat-like 57
-spectacled 57
-blauner 57
-stickwork 57
-explosives-sniffing 57
-stavenhagen 57
-bocskai 57
-#-anna-lena 57
-brebner 57
-moehringer 57
-tamweel 57
-gold-leafed 57
-drevna 57
-reidyglobe.com 57
-mennes 57
-ripia 57
-noongar 57
-re-appear 57
-mangosteen 57
-queremos 57
-aarchs 57
-foon 57
-cooz 57
-jahurul 57
-sn####a 57
-demiralp 57
-smooched 57
-cols 57
-chapati 57
-resailed 57
-evensong 57
-luzhin 57
-##-plus-year 57
-presento 57
-non-exempt 57
-wahyono 57
-servier 57
-malayev 57
-neuro 57
-temerko 57
-beddoes 57
-marquard 57
-cremonini 57
-carnivalesque 57
-petrac 57
-tri-colored 57
-debbie-ann 57
-beaute 57
-previously-announced 57
-dugovich 57
-thibaudet 57
-outboards 57
-benhur 57
-kubelik 57
-schellenberger 57
-in-kook 57
-papan 57
-bdnf 57
-haleva 57
-precor 57
-tightly-contested 57
-hge 57
-asrar 57
-piriz 57
-taitz 57
-taita 57
-medog 57
-bonners 57
-caucausus 57
-uncustomary 57
-balsamo 57
-##-card 57
-good-vs 57
-international-class 57
-strominger 57
-interleukin 57
-funeral-home 57
-savall 57
-easynet 57
-konculj 57
-fengdu 57
-enio 57
-cyber-criminals 57
-soms 57
-wasikowska 57
-semporna 57
-#to 57
-damschroder 57
-medstar 57
-bastin 57
-tno 57
-munier 57
-rav-# 57
-ribalta 57
-elmes 57
-flicked-on 57
-cricket-wc####-pak 57
-rehabilitator 57
-medora 57
-kaddura 57
-yannopoulos 57
-sericulture 57
-closed-captioning 57
-rigler 57
-pre-injury 57
-jandek 57
-chiri-yurt 57
-#-midnight 57
-khorshid 57
-witkoff 57
-festen 57
-aboutreika 57
-jourdon 57
-hospitalisations 57
-salonius 57
-subnational 57
-bonadio 57
-tv-viewing 57
-lessee 57
-fikir 57
-watani 57
-newtok 57
-kintz 57
-in-network 57
-repacholi 57
-subert 57
-moralize 57
-truck-loads 57
-rocktober 57
-recently-held 57
-steubing 57
-myfi 57
-harned 57
-alleles 57
-nesi 57
-referential 57
-manufacturing-based 57
-rzb 57
-sun-worshipping 57
-caiu 57
-chi-keung 57
-majdalawi 57
-lackner 57
-arrangment 57
-o'mahoney 57
-intermarriages 57
-zantzinger 57
-margairaz 57
-patzelt 57
-chania 57
-gribakin 57
-##-star 57
-haruko 57
-lagunas 57
-tch 57
-ex-social 57
-qorabi 57
-debt-for-nature 57
-plam 57
-beady-eyed 57
-oscillators 57
-blood-related 57
-qualm 57
-consumer-based 57
-leweck 57
-high-revving 57
-l'heureux 57
-meramec 57
-fsa-eap 57
-lhernandez 57
-carports 57
-air-worthiness 57
-bolswessanen 57
-uberti 57
-aver 57
-#-million-member 57
-singleminded 57
-hurreh 57
-moslem-oriented 56
-benediktsson 56
-aeolian 56
-gogitidze 56
-intrepids 56
-zebedayo 56
-shakhtyor 56
-gioiella 56
-llegue 56
-draggy 56
-tkaczuk 56
-balongan 56
-abullah 56
-mn-imj 56
-abdula 56
-villepinte 56
-concrete-filled 56
-overcash 56
-astara 56
-moskvich 56
-semi-autonomy 56
-investment-linked 56
-self-tanner 56
-nurmukhammed 56
-nilesh 56
-pitch-and-putt 56
-haetzni 56
-corn-soya 56
-http://www.redcross.org 56
-predominantly-muslim 56
-zigging 56
-ryegrass 56
-soft-cover 56
-faibish 56
-reinsured 56
-mideast-israel 56
-koshlyakov 56
-big-hitters 56
-stalcup 56
-amland 56
-mega-hits 56
-mid-stretch 56
-sauna-like 56
-antihypertensive 56
-muskat 56
-ecd 56
-chornovyl 56
-shih-ming 56
-inerrant 56
-xiuqi 56
-pseudo-scientific 56
-langberg 56
-bungoma 56
-zainol 56
-telecomunications 56
-escogido 56
-okkalapa 56
-chun-sheng 56
-saefuddin 56
-rinus 56
-dcp 56
-military-based 56
-petkovski 56
-serbian-led 56
-bernardsville 56
-farkhar 56
-phosphors 56
-dutreil 56
-wragg 56
-hunza 56
-khulani 56
-brahima 56
-zakiur 56
-nooni 56
-alef 56
-governmnent 56
-haiti-vote 56
-roro 56
-waterfowls 56
-schoeller 56
-unsalable 56
-mcdonnel 56
-conflict-prevention 56
-mantz 56
-majelis 56
-exultantly 56
-pigeon-toed 56
-neiers 56
-tzemel 56
-nghimtina 56
-sindhis 56
-lippa 56
-sg-# 56
-al-yemen 56
-oughton 56
-wash-out 56
-pd-imj 56
-destinee 56
-canfor 56
-seyni 56
-bp-gm 56
-jakkrit 56
-###:##:## 56
-rapace 56
-habie 56
-ex-slave 56
-tyisha 56
-frou-frou 56
-groeschel 56
-pmb 56
-wyrsch 56
-idahoans 56
-tiantan 56
-##-million-us 56
-kootenay 56
-assasinated 56
-teruhisa 56
-kirundo 56
-joensen 56
-wedding-cake 56
-underfinancing 56
-shapewear 56
-clinton-haters 56
-tetzchner 56
-arja 56
-duchesnay 56
-fuxi 56
-radow 56
-theatre-goers 56
-sign-carrying 56
-horstman 56
-castrati 56
-heat-generating 56
-golfsmith 56
-pendareva 56
-loubscher 56
-fumento 56
-winiarski 56
-xianrong 56
-zanchi 56
-alibi-ya 56
-f-bomb 56
-stonehouse 56
-zumbo 56
-sange 56
-gamidov 56
-exanta 56
-take-or-pay 56
-mapunda 56
-propuesta 56
-three-meter-high 56
-zafaryab 56
-haloed 56
-sliven 56
-lepton 56
-alcohol-fuelled 56
-operationalized 56
-fairless 56
-foot-tapping 56
-pntl 56
-choos 56
-danisco 56
-khazaal 56
-macknin 56
-kera 56
-non-believer 56
-export-dominated 56
-chowdry 56
-tecnost 56
-monceau 56
-jmarmstrongdenverpost.com 56
-chunsheng 56
-ccac 56
-oakenfold 56
-ktda 56
-record-length 56
-toco 56
-golog 56
-pastorale 56
-zuberi 56
-million-euro## 56
-trustor 56
-delsener 56
-zx 56
-college-student 56
-ultra-right-wing 56
-runout 56
-abloom 56
-holovak 56
-scafidi 56
-associaton 56
-german-brokered 56
-cents-a-share 56
-dhanawibawa 56
-performance-driven 56
-surburb 56
-orso 56
-duodenal 56
-yellow-and-red 56
-koehl 56
-boonton 56
-exective 56
-###-footer 56
-musasa 56
-http://www.nbc.com 56
-pig-raising 56
-telo 56
-cassells 56
-sacko 56
-monshipour 56
-desk-bound 56
-soft-top 56
-tunings 56
-tomasa 56
-lekhanya 56
-breathalyser 56
-hmeid 56
-ventolin 56
-goldsby 56
-coba 56
-singkil 56
-advertisment 56
-barfoed 56
-d'ermilio 56
-esala 56
-#,#-dioxane 56
-zohreh 56
-kasatka 56
-horita 56
-spado 56
-itvs 56
-chia-yuh 56
-nandy 56
-vercauteren 56
-guedj 56
-chimene 56
-masaba 56
-century-oriented 56
-novavax 56
-burn-in 56
-snu 56
-moscou 56
-gurrola 56
-long-feuding 56
-grails 56
-jsk 56
-bachmans 56
-deguchi 56
-wanchalerm 56
-under-performed 56
-paisa 56
-smap 56
-deramus 56
-dysphoria 56
-pardede 56
-lunstead 56
-houweling 56
-solar-heated 56
-lalic 56
-banchetta 56
-tursday 56
-tree-dwelling 56
-rightmire 56
-tradicion 56
-tae-yong 56
-sayrescoxnews.com 56
-paku 56
-mubang 56
-moseyed 56
-carbon-reduction 56
-philippine-born 56
-event-planning 56
-long-fought 56
-shamsudin 56
-geninho 56
-coatless 56
-hardy-garcia 56
-and-white 56
-kuchov 56
-shoebox-sized 56
-penalty-killers 56
-value-added-tax 56
-dispenza 56
-ben-yehuda 56
-yellow-and-green 56
-babaloo 56
-goyas 56
-then-partner 56
-moulty 56
-sachio 56
-over-the-horizon 56
-btp 56
-###t 56
-calagna 56
-avions 56
-copepods 56
-french-ruled 56
-philomel 56
-osmo 56
-iconix 56
-thomastown 56
-schoerghofer 56
-witchdoctors 56
-ryoung 56
-wire-fraud 56
-tynesha 56
-freas 56
-cpw 56
-unc-wilmington 56
-water-born 56
-eket 56
-loudhailer 56
-filipp 56
-killick 56
-klimenko 56
-karppinen 56
-kuwait-politics 56
-stollsteimer 56
-trussell 56
-arbesfeld 56
-alfama 56
-caithness 56
-monetti 56
-arkle 56
-zhoima 56
-choue 56
-vulgamore 56
-rakowitz 56
-provencio 56
-beswick 56
-re-financing 56
-innocentive 56
-lahoti 56
-bertholle 56
-eui 56
-jostens 56
-indecisively 56
-gamov 56
-kalis 56
-toughest-ever 56
-mi-jin 56
-interwar 56
-zari 56
-woodroffe 56
-baixing 56
-isabekov 56
-importations 56
-hedvig 56
-name-your-price 56
-vuai 56
-carkner 56
-jmf\/ml 56
-rotarian 56
-cerra 56
-four-over-par 56
-visitantes 56
-stautner 56
-tiu 56
-inside-baseball 56
-chakravarthi 56
-klokot 56
-misstates 56
-enap 56
-weld-cellucci 56
-tahara 56
-bolender 56
-suettinger 56
-karadassiou 56
-isak-muivah 56
-governates 56
-buonomo 56
-asia-focused 56
-statesville 56
-calgon 56
-yonemura 56
-fazlic 56
-sohlberg 56
-bogale 56
-all-arounder 56
-rou 56
-swint 56
-kendall-smith 56
-jamaar 56
-gurfinkel 56
-kargar 56
-re-invented 56
-sarda 56
-witharanage 56
-non-college 56
-lohas 56
-foremothers 56
-vent-free 56
-narcotic-drug-related 56
-yashar 56
-compounce 56
-botto 56
-geddie 56
-marchman 56
-ouest-france 56
-drees 56
-business-services 56
-pre-selection 56
-vaher 56
-jocko 56
-saint-quentin 56
-amerenue 56
-lindenmuth 56
-ulus 56
-mbala 56
-husic 56
-yl###-### 56
-follicular 56
-most-loved 56
-ncafp 56
-cgx 56
-peaden 56
-pingeot 56
-fusen 56
-phaiboon 56
-pre-marked 56
-heilbroner 56
-longyi 56
-nr\/dj## 56
-nashwan 56
-kathuria 56
-##-wayne 56
-way-station 56
-visitor-friendly 56
-generative 56
-unprejudiced 56
-wolfberry 56
-cashwell 56
-two-building 56
-madryn 56
-akwesasne 56
-community-college 56
-lieff 56
-coalhouse 56
-suncare 56
-passamaquoddy 56
-kanstantsin 56
-amep 56
-news-making 56
-rahe 56
-campione 56
-neiwand 56
-munabao 56
-potti 56
-machicura 56
-wwor 56
-vermeersch 56
-schmoekel 56
-chiffonade 56
-fashola 56
-seec 56
-centre-backs 56
-kremlinologists 56
-akuressa 56
-neuropsychiatry 56
-s-##c 56
-mutaa 56
-sociobiology 56
-ultra-premium 56
-twirler 56
-xiaojin 56
-free-jazz 56
-audrina 56
-nelon 56
-seconds\/### 56
-inquisitions 56
-level-four 56
-vertigo-inducing 56
-mugambage 56
-bondue 56
-yuk\/leung 56
-edelnor 56
-wascher 56
-side-footing 56
-giresun 56
-enfoque 56
-torro 56
-rrps 56
-kirsh 56
-crimefighter 56
-importa 56
-longer-acting 56
-thakurgaon 56
-sufferance 56
-cricket-wc####-aus 56
-fluconazole 56
-waltzer 56
-stz\/ea## 56
-gbm 56
-##-stephen 56
-urey 56
-medine 56
-pelloux 56
-arkans 56
-bizzare 56
-siegbahn 56
-carnelian 56
-rueing 56
-pantomiming 56
-duckweed 56
-hofmeyr 56
-nastas 56
-pennymac 56
-peairs 56
-str-ti-jbm 56
-suharjono 56
-barcenas 56
-spit-shined 56
-coch 56
-aldebert 56
-chups 56
-eckl 56
-czaja 56
-redken 56
-duffie 56
-jordanaires 56
-impudently 56
-zingaro 56
-undistorted 56
-toolik 56
-kammerhoff 56
-decriminalisation 56
-clayoquot 56
-leblon 56
-pakistan-missile 56
-gertrudis 56
-snooker-gbr 56
-al-jarba 56
-tumwine 56
-morinigo 56
-bacre 56
-##,###,###.# 56
-staleness 56
-##hours 56
-amni 56
-linenthal 56
-kleinsasser 56
-bepza 56
-saransk 56
-khermanstatesman.com 56
-rockwellian 56
-nieuw 56
-solomonyan 56
-mangahas 56
-sewering 56
-mozartean 56
-##-square-kilometre 56
-bhu 56
-lampasas 56
-coagulate 56
-dinaburg 56
-dairy-free 56
-one-bath 56
-djibrill 56
-fischer-boel 56
-four-button 56
-lemon-flavored 56
-bipeds 56
-horneber 56
-mini-movies 56
-toplin 56
-silang 56
-barberton 56
-boukensa 56
-fernado 56
-lamberty 56
-gratings 56
-marban 56
-ilala 56
-l'avenir 56
-nizhni 56
-onepass 56
-d&b 56
-ilogho 56
-bowral 56
-lhr### 56
-cabaniss 56
-camera-friendly 56
-langerman 56
-plattekill 56
-todesca 56
-cantorial 56
-import-dependent 56
-overspenders 56
-elsen 56
-novorossisk 56
-hansabank 56
-taip 56
-everding 56
-kilinc 56
-allieu 56
-kippy 56
-kasputis 56
-rrp 56
-darling-hammond 56
-bitkom 56
-holtzhausen 56
-kin-chung 56
-sinosat-# 56
-b'gosh 56
-afroz 56
-solomou 56
-proegler 56
-on-sang 56
-motorcyles 56
-easyknit 56
-cross-pollinating 56
-ambeyi 56
-lavaggi 56
-confessors 56
-aviion 56
-###-######## 56
-iraq-unrest-qaeda 56
-janowitz 56
-ofari 56
-tato 56
-reputacion 56
-gading 56
-univac 56
-iniative 56
-al-amal 56
-sahdan 56
-culloty 56
-mariella 56
-boigny 56
-partyka 56
-juda 56
-vierma 56
-majorette 56
-news-stands 56
-ashti 56
-sarria 56
-donadze 56
-huaqiang 56
-dej 56
-deddy 56
-sobaru 56
-self-invention 56
-murley 56
-neurotically 56
-smolar 56
-piccarreta 56
-#og 56
-flowerbeds 56
-bc-ap 56
-inseparably 56
-prezioso 56
-ropeik 56
-pumpido 56
-hypothesizes 56
-wuz 56
-roulade 56
-http://www.genoa-g#.it/eng/index.html 56
-eu-mideast 56
-paddleboard 56
-bone-white 56
-derschau 56
-burika 56
-boarding-school 56
-senitt 56
-re-equipping 56
-mulvoy 56
-retinues 56
-bresse 56
-greentown 56
-www.insidesocal.com/tv/ 56
-sinaia 56
-raupp 56
-nspo 56
-overtopping 56
-shindell 56
-bahk 56
-jeremain 56
-kurta 56
-margarite 56
-ebbesen 56
-kise 56
-coooperation 56
-brochtrup 56
-kenya-climate 56
-music-themed 56
-rahimpour 56
-samory 56
-business-wise 56
-abadie 56
-shur 56
-euro-skepticism 56
-#,###-kilometers 56
-mystery-shrouded 56
-bulker 56
-ovh 56
-near-vertical 56
-khash 56
-espriella 56
-unef 56
-torbor 56
-digene 56
-dishonoured 56
-ingvard 56
-andis 56
-o'donley 56
-data-serving 56
-lakhubhai 56
-leccion 56
-common-man 56
-pre-test 56
-diht 56
-aegina 56
-youngtown 56
-talkathon 56
-ajirawit 56
-stonerside 56
-oly-####-advisory 56
-non-baseball 56
-lusail 56
-giacoletti 56
-womans 56
-chelsom 56
-lancing 56
-nagyz 56
-striatum 56
-nahel 56
-coinages 56
-kleins 56
-mulherin 56
-estebanez 56
-andolina 56
-on-coming 56
-hazels 56
-automotives 56
-borai 56
-beiteddine 56
-su-lin 56
-al-siyassa 56
-lequi 56
-mcmorran 56
-pitcher-friendly 56
-nitch 56
-rauer 56
-d'arrigo 56
-haziness 56
-draft-night 56
-maokola-majogo 56
-cuppers 56
-steinhoff 56
-gushiken 56
-moppet 56
-potebenko 56
-fishtailed 56
-amedo 56
-arbois 56
-pasturing 56
-harba 56
-authorties 56
-igber 56
-afghan-international 56
-myanmar-protest-monks 56
-fresh-air 56
-iigep 56
-wolke 56
-shock-jock 56
-smilin 56
-guguletu 56
-deregister 56
-coorperation 56
-eckstrom 56
-guelperin 56
-lobed 56
-vartanian 56
-sponged 56
-upledger 56
-kalat 56
-brocato 56
-wuertz 56
-longchamps 56
-hambastegi 56
-wunderkinder 56
-longans 56
-she-devil 56
-frueh 56
-ambridge 56
-lessner 56
-kiddos 56
-kuypers 56
-daair 56
-bg-acw 56
-club-swinging 56
-szulik 56
-quizas 56
-maxxi 56
-projectionists 56
-vujin 56
-flyertalk.com 56
-gurin 56
-prolinea 56
-vivra 56
-jurijs 56
-us-hollywood 56
-mirretti 56
-wildt 56
-###percent 56
-sylvers 56
-hoerster 56
-p-### 56
-mdluli 56
-slipup 56
-zaozhuang 56
-kazakhstani 56
-furtwaengler 56
-muffat 56
-busic 56
-viamonte 56
-montsame 56
-tuszynski 56
-mickiewicz 56
-varathep 56
-freak-show 56
-tampakan 56
-afterburner 56
-early-#### 56
-kerchove 56
-rasanen 56
-simonas 56
-high-sounding 56
-kibbutzniks 56
-us-colombian 56
-antiqued 56
-heatherington 56
-amdo 56
-side-scan 56
-birlik 56
-spigarelli 56
-non-nordic 56
-impact-resistant 56
-umran 56
-chalke 56
-nassari 56
-freiman 56
-jaar 56
-soylent 56
-sinjhuang 56
-four-hectare 56
-continuing-education 56
-mimo 56
-down-ticket 56
-fiercely-contested 56
-levantine 56
-shamie 56
-g\/t\/f 56
-autoridad 56
-prizing 56
-baywalk 56
-mocatta 56
-tea-to-steel 56
-pregunte 56
-irascibility 56
-towles 56
-china-romania 56
-hamengkubuwono 56
-padire 56
-aldam 56
-coinsurance 56
-auret 56
-ignas 56
-disco-era 56
-minette 56
-nurse-midwives 56
-smolian 56
-jaffery 56
-hochschorner 56
-rcastro 56
-fnla 56
-iyanla 56
-plant-derived 56
-krupp-hoesch 56
-night-blooming 56
-food-grade 56
-fuenmayor 56
-glaciation 56
-handmaidens 56
-kosnatcheva 56
-dll###-### 56
-mulyo 56
-ct## 56
-qfc 56
-higher-yield 56
-deyana 56
-underdiagnosed 56
-wojtala 56
-christmas-season 56
-coitus 56
-estey 56
-ashenfelter 56
-hapal 56
-tawatchai 56
-cambron 56
-kasereka 56
-periwinkles 56
-craciun 56
-clabo 56
-glashow 56
-danwei 56
-luwero 56
-blythedale 56
-kadyr 56
-young-me 56
-flash-floods 56
-spla\/m 56
-cross@globe.com 56
-pues 56
-hounddog 56
-bulimics 56
-mingming 56
-insa 56
-videocam 56
-paoua 56
-http://www.xerox.com 56
-car-jackings 56
-marzban 56
-cattolica 56
-katri 56
-kanada 56
-dacula 56
-faurel 56
-zaytun 56
-autonomy-minded 56
-shauri 56
-defrayed 56
-sanhe 56
-howison 56
-edurne 56
-states. 56
-issue-by-issue 56
-microturbines 56
-macijauskas 56
-iberville 56
-yizhar 56
-cocodrie 56
-wats 56
-ustc 56
-walta 56
-ethereally 56
-cookstown 56
-dissociating 56
-once-safe 56
-vishwanathan 56
-wince-inducing 56
-zire 56
-arpa 56
-al-shurta 56
-luneta 56
-aldermaston 56
-http://www.rnc.org 56
-hosono 56
-qiqiu\/zhao 56
-jacinthe 56
-http://www.amrcorp.com 56
-derbyshires 56
-weiyang 56
-houze 56
-pirata 56
-bugiri 56
-lafakis 56
-vika 56
-dismas 56
-cardio-thoracic 56
-thrashings 56
-reagle 56
-us-attacks-guantanamo 56
-hwwa 56
-undergirded 56
-portale 56
-scandalizing 56
-brezovica 56
-zebadua 56
-drizin 56
-reya 56
-tubewells 56
-skink 56
-babushka 56
-pre-judging 56
-schutzman 56
-wroclawski 56
-zaca 56
-daviz 56
-moree 56
-diamantis 56
-former-soviet 56
-islam-based 56
-ennes 56
-alysia 56
-soloveitchik 56
-negar 56
-damselflies 56
-tried-and-tested 56
-hebard 56
-slipstream-chipotle 56
-meraklis 56
-so-call 56
-samarn 56
-pink-hued 56
-hieronim 56
-konadu 56
-fradulent 56
-manchild 56
-skube-column 56
-bouder 56
-permitir 56
-seawolves 56
-pouladi 56
-coronell 56
-folino 56
-juravich 56
-aquacultural 56
-dono 56
-strongest-ever 56
-sub-four-minute 56
-shu-hung 56
-hakkar 56
-kotex 56
-relink 56
-multiethnicity 56
-regio 56
-pomar 56
-azzopardi 56
-kelmon 56
-tihany 56
-shisun 56
-water-repellent 56
-bumgardner 56
-clark\/donna 56
-revenge-minded 56
-grcs 56
-mcphie 56
-small-molecule 56
-zorica 56
-chavit 56
-baselga 56
-sirima 56
-zegas 56
-talis 56
-iner 56
-racette 56
-azema 56
-fuhe 56
-caramagna 56
-strenghtened 56
-lavage 56
-giannichedda 56
-gurbuz 56
-degiorgio 56
-kilolitres 56
-escolero 56
-antespend 56
-swash 56
-unoaked 56
-gimondi 56
-dirt-road 56
-daguin 56
-imputation 56
-orientalist 56
-neigh 56
-inconstant 56
-penaherrera 56
-shambhala 56
-ravishingly 56
-ikrema 56
-mythologizing 56
-gwee 56
-lousiest 56
-zirkin 56
-cross-regional 56
-campagne 56
-dumarsais 56
-mansilla 56
-quintus 56
-jurikova 56
-stickered 56
-ysern 56
-djebbour 56
-law-firm 56
-quark-gluon 56
-forbush 56
-non-responsive 56
-play-it-safe 56
-moisture-laden 56
-comradely 56
-orito 56
-fbl-fra-lcup 56
-paucke 56
-muji 56
-fertilizer-based 56
-early-to-bed 56
-woodcrest 56
-danto 56
-onne 56
-peri-urban 56
-kayiranga 56
-jianbo 56
-correspondant 56
-vankor 56
-lambing 56
-mephedrone 56
-icglr 56
-non-lawyers 56
-open-handed 56
-musetta 56
-khanty-mansiisk 56
-carouser 56
-srebnick 56
-unkown 56
-frankland 56
-by-catch 56
-aeromedical 56
-middle-tier 56
-knust 56
-natsagiin 56
-gressly 56
-long-coveted 56
-golf-epga-por 56
-thanat 56
-ivermectin 56
-garbelotto 56
-medin 56
-lamoreaux 56
-ciwujia 56
-iacoboni 56
-lipin 56
-gundegmaa 56
-nkhoma 56
-itek 56
-cebekhulu 56
-serigne 56
-tsumura 56
-radio-show 56
-mirman 56
-kelder 56
-desvonde 56
-elongation 56
-shot-for-shot 56
-loofah 56
-sterno 56
-hrabal 56
-kumon 56
-erinle 56
-workmate 56
-zuendt 56
-computer-software 56
-dervishi 56
-muhieddin 56
-sasono 56
-second-week 56
-sliti 56
-winterrowd 56
-ex-test 56
-entomological 56
-kooiman 56
-akhmedova 56
-stepanovic 56
-kriste 56
-alfond 56
-qed 56
-fullone 56
-gold-encrusted 56
-fireballer 56
-backbenches 56
-anti-minority 56
-parakh 56
-on-street 56
-majolica 56
-responde 56
-geraldton 56
-polyot 56
-taleqani 56
-hand-knitted 56
-paleozoic 56
-a-ram 56
-oil-tainted 56
-matuidi 56
-sanest 56
-middle-schooler 56
-wrightsman 56
-gerdano 56
-nozadze 56
-imm 56
-sholar 56
-plea-bargained 56
-http://www.usccb.org 56
-shikata 56
-##-kevin 56
-tejgaon 56
-two-and-a-half-month 56
-much-younger 56
-vaill 56
-krach 56
-audah 56
-bullis 56
-maharajahs 56
-unadilla 56
-uzbekistani 56
-airconditioners 56
-egolf 56
-longheld 56
-oriflame 56
-sportcoat 56
-cuttaree 56
-cressend 56
-exculpate 56
-re-shuffle 56
-sinosure 56
-kartono 56
-supergrass 56
-walley 56
-nouzaret 56
-well-adapted 56
-chupacabra 56
-schoenbaum 56
-user-friendliness 56
-shorabak 56
-zaiqing 56
-sluga 56
-oegroseno 56
-cloer 56
-waterlily 56
-nisreen 56
-junor 56
-cha-ching 56
-rocknes 56
-kejuan 56
-hofmannsthal 56
-goldhirsh 56
-h.i.v. 56
-herschbach 56
-rajakarunanayake 56
-pottenger 56
-humanplasma 56
-bentler 56
-opi 56
-troglitazone 56
-borgas 56
-ayatskov 56
-princetonian 56
-wingard 56
-kashmir-based 56
-chaiet 56
-excretions 56
-eydie 56
-marthastewart.com 56
-glassine 56
-henkes 56
-shikuku 56
-ngarlejy 56
-meversley 56
-kunsthistorisches 56
-pro-stadium 56
-carrel 56
-weilerstein 56
-jamiruddin 56
-flashiness 56
-#.##-trillion 56
-dubee 56
-knickerbockers 56
-hedong 56
-yung-lai 56
-done-that 56
-weisler 56
-protectmarriage.com 56
-assitance 56
-kadewe 56
-near-war 56
-wigstock 56
-resonator 56
-neu-ulm 56
-taimali 56
-organ-transplant 56
-compositing 56
-fantagraphics 56
-scharioth 56
-redleaf 56
-htil 56
-mideast-gaza 56
-manasieva 56
-gwang-soo 56
-yannotti 56
-unclassifiable 56
-nagarajan 56
-banisar 56
-devroy 56
-nangka 56
-house-sized 56
-mefraj 56
-lake-side 56
-peachcare 56
-kagah 56
-apostrophes 56
-frame-ups 56
-hainaut 56
-noiseless 56
-chiado 56
-salesgirl 56
-borovic 56
-kaouch 56
-kirkaldy 56
-non-automotive 56
-prokopek 56
-three-event 56
-aseanapol 56
-elfatih 56
-al-osboa 56
-santy 56
-gerrick 56
-untallied 56
-bushings 56
-turn-over 56
-five-length 56
-afanasyevsky 56
-meiselas 56
-yugoslav-made 56
-##-quart 56
-pegu 56
-riverwoods 56
-take-back 56
-remote-detonated 56
-oil-reliant 56
-jd\/nr## 56
-erinvale 56
-brookhiser 56
-ghirardi-rubbi 56
-medsger 56
-mesilla 56
-maysa 56
-carter-finley 56
-biver 56
-surkh 56
-toy-related 56
-pring 56
-jjh-eap 56
-n-#### 56
-sciarrino 56
-moba 56
-falcioni 56
-furia 56
-franz-christoph 56
-liguera 56
-herreshoff 56
-bjelkevik 56
-idoko 56
-helander 56
-bby 56
-matrika 56
-milas 56
-kosak 56
-forteo 56
-non-kenyan 56
-fretes 56
-cienaga 56
-q###s 56
-chia-yen 56
-worrywart 56
-professional-grade 56
-estudillo 56
-deficit-fighting 56
-mazzulla 56
-bie-shyun 56
-brick-sized 56
-robeco 56
-goskowicz 56
-d'hiver 56
-dewes 56
-toppo 56
-sacb 56
-golden-capped 56
-boudreau-gagnon 56
-deflectors 56
-sauzee 56
-ketz 56
-footbal 56
-menaka 56
-jerred 56
-tugendhat 56
-sutee 56
-semi-secret 56
-cftr 56
-schabarum 56
-shirani 56
-odon 56
-cosey 56
-http://www.aflcio.org 56
-doorbuster 56
-dimmock 56
-cref 56
-krusee 56
-gruwell 56
-propellors 56
-brodovitch 56
-poutine 56
-dunhua 56
-lese-majeste 56
-epicurious.com 56
-late-april 56
-niha 56
-lifescan 56
-mellette 56
-guangxiang 56
-shawali 56
-sagia 56
-eu-eurozone-economy-growth 56
-suzerainty 56
-wheatgrass 56
-adir 56
-man-bok 56
-customizer 56
-accordionists 56
-perspicacity 56
-walusimbi 56
-valenciano 56
-developping 56
-matellan 56
-propiedades 56
-topa 56
-mirken 56
-calvins 56
-townfolk 56
-self-motivation 56
-##-nicole 56
-tear-streaked 56
-lin\/wang 56
-shibboleths 56
-ueslei 56
-kleier 56
-unionville 56
-mohtarma 56
-esmeijer 56
-soloed 56
-sandschneider 56
-super-long 56
-payal 56
-victim-impact 56
-carrs 56
-bufalino 56
-pavlovski 56
-demayo 56
-http://www.nmb.gov 56
-chen-yuan 56
-wine-and-cheese 56
-perepelova 56
-saddiqi 56
-unsafely 56
-d'amiano 56
-henochowicz 56
-lesperoglou 56
-vandewater 56
-sportsweek 56
-fankhouser 56
-jayer 56
-karres 56
-economise 56
-masopust 56
-dessie 56
-poona 56
-assouw 56
-rebated 56
-saimone 56
-ethno-religious 56
-albany-area 56
-peiyao 56
-highly-efficient 56
-njie 56
-vernand 56
-bore-hole 56
-lundine 56
-foxcroft 56
-svoik 56
-delyagin 56
-bodas 56
-non-qualified 56
-ditrapano 56
-al-gizouli 56
-giorgianni 56
-talab 56
-caribbean-weather 56
-cross-bred 56
-conair 56
-http://www.njusao.org/break.html 56
-garlock 56
-karmakar 56
-the#### 56
-cheek-to-cheek 56
-jawiya 56
-goldsbury 56
-pelajaran 56
-shults 56
-kalala 56
-zaetta 56
-daubert 56
-risd 56
-brunhoff 56
-ciment 56
-blunden 56
-call-to-arms 56
-khirbet 56
-seliverstova 56
-vadhana 56
-peochar 56
-re-development 56
-eight-sided 56
-magam 56
-veran 56
-tax-dodgers 56
-luana 56
-cattle-rustling 56
-raia 56
-pb-jp 56
-non-labor 56
-mid-##st 56
-russian-occupied 56
-http://www.freddiemac.com 56
-nowgam 56
-hiroyoshi 56
-shenlong 56
-rafidiyeh 56
-njoki 56
-odoriferous 56
-goldoni 56
-iws 56
-mabou 56
-ozio 56
-chiuri 56
-first-of-its 56
-ciders 56
-by-now 56
-second-winningest 56
-arix 56
-gsd 56
-pro-yugoslav 56
-ertugruloglu 56
-nissay 56
-guaita 56
-jefferys 56
-##-straight 56
-spielbergian 56
-complexly 56
-phumaphi 56
-baronet 56
-brulliard 56
-herrara 56
-conwood 56
-contin 56
-eastent 56
-iwashita 56
-proforma 56
-guiyu 56
-wenn 56
-time-cnn 56
-re-regulate 56
-myaungmya 56
-grapplers 56
-chimura 56
-ganchev 56
-twerp 56
-yukking 56
-pflaumer 56
-deloatch 56
-al-jabali 56
-paech 56
-delimited 56
-panola 56
-clottemans 56
-neigbors 56
-gruzen 56
-chile-quake 56
-milkmaid 56
-ooooo 56
-ravenscroft 56
-darbar 56
-whitely 56
-tumpat 56
-sandile 56
-wxia 56
-iwork 56
-mcmahons 56
-gawd 56
-engles 56
-work-based 56
-china-unrest-tibet-rights-oly-#### 56
-trembler 56
-nybz### 56
-fengshui 56
-moalin 56
-perumal 56
-sopon 56
-tae-bo 56
-ghosting 56
-changgwang 56
-kecil 56
-kjolstad 56
-kopel 56
-bone-building 56
-ju-on 56
-dogaru 56
-palframan 56
-bauduin 56
-ueb 56
-time-space 56
-fellow-citizens 56
-papahanaumokuakea 56
-segment-leading 56
-fuel-hedging 56
-shokir 56
-coffeeshops 56
-advantica 56
-marcelina 56
-recirculate 56
-lead-zinc 56
-lankan-born 56
-cnbc.com 56
-effros 56
-keast 56
-indecorous 56
-ostrov 56
-low-demand 56
-grafer 56
-kaigler 56
-yniguez 56
-tayyaba 56
-woodcraft 56
-k.p.s. 56
-jarwan 56
-yamasoto 56
-l'carriere 56
-ruttenberg 56
-revver 56
-#-gisela 56
-long-shuttered 56
-westshore 56
-rendina 56
-moscardi 56
-yoani 56
-dungu 56
-http://www.mtv.com 56
-lupa 56
-money-lending 56
-futagawa 56
-al-majd 56
-sarokin 56
-masefield 56
-hatemongers 56
-#####-#####-# 56
-ci-### 56
-guderzo 56
-connoting 56
-galavision 56
-loggans 56
-fiorucci 56
-re-inspected 56
-lemenager 56
-covadonga 56
-chronometer 56
-prodon 56
-chandraswamy 56
-gatlif 56
-moevenpick 56
-giaccone 56
-charanga 56
-arti 56
-clinton-appointed 56
-extrasensory 56
-martaban 56
-essaye 56
-florentines 56
-mctague 56
-derozan 56
-bmet 56
-akher 56
-counter-corruption 56
-notepaper 56
-luzinski 56
-skanky 56
-unharvested 56
-kadosh 56
-taimyr 56
-martial-law 56
-shoreditch 56
-somersworth 56
-mass-scale 56
-lobotomies 56
-frates 56
-al-gailani 56
-cuito 56
-biters 56
-fiszmann 56
-murati 56
-vouilloz 56
-arscott 56
-goshutes 56
-fouhami 56
-palestinian-palestinian 56
-hellmer 56
-maryinsky 56
-g-class 56
-cvid 56
-hilman 56
-us-japan-space-shuttle 56
-vallop 56
-pripyat 56
-genencor 56
-nightshade 56
-kielar 56
-ginanjar 56
-botan 56
-sukamdani 56
-temizkanoglu 56
-berezovksy 56
-lithographic 56
-colbie 56
-courchesne 56
-newsbreak 56
-kwamie 56
-characterising 56
-joyriding 56
-ghilarducci 56
-mortham 56
-shaken-baby 56
-glorieta 56
-fengyun-# 56
-leibacher 56
-gardening@nytimes.com 56
-christenings 56
-discombobulating 56
-ambion 56
-stolyarov 56
-lastuvka 56
-ultra-tight 56
-xingye 56
-terrariums 56
-treaty-based 56
-edmon 56
-mangena 56
-scopolamine 56
-nobes 56
-long-separated 56
-acls 56
-skamania 56
-egx## 56
-dahduli 56
-motech 56
-intitial 56
-caffita 56
-girl-group 56
-half-expected 56
-linette 56
-soldati 56
-pend 56
-hopoate 56
-pre-castro 56
-lulin 56
-http://www.usccb.org/ 56
-non-convertible 56
-abu-zayyad 56
-step-brother 56
-artux 56
-yusanto 56
-bestiary 56
-debrum 56
-dingers 56
-khamzat 56
-ridpath 56
-adex 56
-anthracnose 56
-ten-week 56
-helbrans 56
-shoehorning 56
-:####### 56
-paride 56
-boosterish 56
-goldwin 56
-pigeonholes 56
-raimunda 56
-howden 56
-evangelized 56
-mixups 56
-panter 56
-o.h. 56
-rockrose 56
-england-born 56
-augustow 56
-lessel 56
-kasane 56
-sudeikis 56
-contentville 56
-cleaning-up 56
-natan-zada 56
-manawatu 56
-unreviewable 56
-witticism 56
-ternus 56
-uscirf 56
-rijsbergen 56
-rakad 56
-cruciferous 56
-dundes 56
-pirozzi 56
-kreegel 56
-budgie 56
-clijster 56
-brou 56
-shaywitz 56
-lopano 56
-helissio 56
-bolshunov 56
-tennstedt 56
-anhua 56
-dimiter 56
-mosquito-control 56
-hearts-and-minds 56
-pretre 56
-tvind 56
-abednego 56
-born-and-bred 56
-zingre-graf 56
-longhai 56
-inniger 56
-diaper-changing 56
-corleones 56
-runge-metzger 56
-tangdhar 56
-thibadeau 56
-financieele 56
-chassin 56
-polarities 56
-chi-x 56
-helvey 56
-vetters 56
-pazzi 56
-univerity 56
-jail-house 56
-pegasystems 56
-chengshan 56
-ofakim 56
-a-lister 56
-jackfruit 56
-kissy 56
-ruocco 56
-paleobiologist 56
-bobosikova 56
-al-rayes 56
-dornod 56
-as-expected 56
-zadari 56
-javadi 56
-dynatech 56
-silantiev 56
-sonaecom 56
-kerosine 56
-akito 55
-benedictions 55
-mamounia 55
-charco 55
-blue-colored 55
-val-de-marne 55
-antipov 55
-nagu 55
-mueller-wohlfahrt 55
-pratapkumar 55
-multirole 55
-squibbed 55
-seroczynski 55
-satcom 55
-yaacobi 55
-weihong 55
-whitesnake 55
-uninflected 55
-pabbo 55
-runyonesque 55
-sloviter 55
-sperrazza 55
-rimland 55
-standen 55
-echouafni 55
-thanakorn 55
-okutan 55
-olim 55
-nadeam 55
-dolla 55
-chaudhri 55
-hediger 55
-ibwc 55
-craigslist.com 55
-sub-office 55
-al-luhaibi 55
-boutris 55
-harshman 55
-buhain 55
-nurhadi 55
-food-loving 55
-pluots 55
-geometrics 55
-pedercini 55
-couvrette 55
-rlopez 55
-greystoke 55
-standbridge 55
-afilias 55
-ex-strongman 55
-fator 55
-somerwill 55
-worthley 55
-karbanenko 55
-amchitka 55
-thadeus 55
-etemad-e-melli 55
-safecracker 55
-once-elegant 55
-gonyea 55
-anti-islamist 55
-dabanovic 55
-aljabr 55
-ozcelik 55
-smutny-jones 55
-fa'asavalu 55
-searby 55
-pengrowth 55
-cuecat 55
-mininum 55
-shkirko 55
-golf-club 55
-muzzafarabad 55
-bozsik 55
-discerns 55
-rothbaum 55
-ayila 55
-scandanavia 55
-lewmar 55
-jelloun 55
-dragnea 55
-pole-dancing 55
-takanyi 55
-customshouse 55
-kritzer 55
-gomelauri 55
-y.v. 55
-off-airport 55
-delfs 55
-points-scoring 55
-jupin 55
-eliska 55
-rainman 55
-renovator 55
-laroussi 55
-osmus 55
-gegamian 55
-bantadtan 55
-voletta 55
-yussupova 55
-tottenville 55
-dascalu 55
-#-los 55
-metabolizing 55
-kremlin-orchestrated 55
-gastro 55
-khayyat 55
-talara 55
-zizzo 55
-sumaysim 55
-b-level 55
-secularisation 55
-foreign-ownership 55
-puning 55
-stoos 55
-moun 55
-sidener 55
-footmen 55
-kamioka 55
-nazy 55
-valentierra 55
-lebovitz 55
-palanga 55
-english-khmer 55
-secularize 55
-tousignant 55
-development-related 55
-joram 55
-positano 55
-hand-crank 55
-laser-beam 55
-outfalls 55
-gps-enabled 55
-maranda 55
-ewatch 55
-executive-secretary 55
-saira 55
-radom 55
-wagoneer 55
-romack 55
-biomolecular 55
-marsudi 55
-vernier 55
-casualization 55
-non-intrusive 55
-idefense 55
-o.b. 55
-credit-monitoring 55
-munante 55
-molehills 55
-seelbach 55
-swiger 55
-bado 55
-dppc 55
-gremolata 55
-non-parents 55
-ikenson 55
-wanniarachchi 55
-wednesdy 55
-lilas 55
-chengliang 55
-krauter 55
-blast-furnace 55
-polinard 55
-oscar-winners 55
-oddar 55
-qingcheng 55
-gellin 55
-castmate 55
-empedocle 55
-wse 55
-comerci 55
-octuplet 55
-gristmill 55
-calcium-fortified 55
-aeron 55
-xiuli 55
-powerine 55
-near-default 55
-spangles 55
-zarafshan 55
-turkish-based 55
-posptoned 55
-guyana-based 55
-agroforestry 55
-heavily-protected 55
-egis 55
-baoqing 55
-niezabitowska 55
-assignee 55
-taddeo 55
-xiaolan 55
-wechsel-bank 55
-still-living 55
-bolingbroke 55
-fusari 55
-padnos 55
-feistiest 55
-heumann 55
-nalle 55
-umbridge 55
-anthocyanins 55
-vampirism 55
-fuel-related 55
-katari 55
-newly-married 55
-funderburg 55
-compunctions 55
-mi-jung 55
-parrotheads 55
-kail 55
-jinhao 55
-ganciclovir 55
-central-east 55
-m\/a-com 55
-ggagbo 55
-haendel 55
-cholamandalam 55
-emina 55
-thiamin 55
-autorities 55
-thrill-seeker 55
-onu 55
-serb-majority 55
-mandylor 55
-kenesei 55
-soffa 55
-ghozlan 55
-puffinburger 55
-autotrader.com 55
-janowska 55
-luxenberg 55
-liddick 55
-stelmakh 55
-internationally-known 55
-oil-pipeline 55
-lolli 55
-shiang-nung 55
-matovu 55
-sablefish 55
-untradable 55
-ninth-placed 55
-minidv 55
-byronic 55
-tesema 55
-nightime 55
-isea 55
-florida-alabama 55
-bruininks 55
-gronke 55
-chearavanont 55
-##-by-##-centimeter 55
-latwp 55
-trouville 55
-allover 55
-switz 55
-notman 55
-mickler 55
-bouchenaki 55
-beirendonck 55
-sharkboy 55
-anahuac 55
-frale 55
-voter-rich 55
-livingood 55
-bailis 55
-catsup 55
-finnish-german 55
-chalupas 55
-genri 55
-abdulatif 55
-episiotomy 55
-rumbaut 55
-mullein 55
-bursted 55
-crye 55
-philoctetes 55
-beji 55
-kreidenko 55
-brianderson 55
-gitex 55
-supoj 55
-redmen 55
-road-testing 55
-rac-ns 55
-el-arab 55
-tahsi 55
-lactose-intolerant 55
-out-aced 55
-sherifi 55
-angat 55
-patarroyo 55
-czuma 55
-arsi 55
-tongue-and-groove 55
-sudol 55
-punkers 55
-frayre 55
-performance-wise 55
-#-xavier 55
-cantel 55
-ndiaye-diatta 55
-junes 55
-comegys 55
-nyuk 55
-bachelorettes 55
-creepy-crawly 55
-risner 55
-nassi 55
-teall 55
-outraising 55
-mini-concert 55
-ever-lasting 55
-habboush 55
-garrigue 55
-trelleborgs 55
-elisangela 55
-task-oriented 55
-rueppel 55
-bone-thin 55
-cremi 55
-hamat 55
-long-faced 55
-cjh\/rr 55
-finisar 55
-malaysia-vote 55
-pene 55
-dovid 55
-deal-killer 55
-enestam 55
-dedes 55
-dingiri 55
-russia-chechnya-vote 55
-genis 55
-shaabiya 55
-chelyabinsk-## 55
-selecta 55
-short-dated 55
-mahadhesi 55
-death-knell 55
-gotzsche 55
-ilaga 55
-lanegan 55
-kouk 55
-mahle 55
-per-mile 55
-corruption-busting 55
-alipui 55
-gorali 55
-jiazhen 55
-yu-ting 55
-isely 55
-medaire 55
-haggui 55
-p#-plus-one 55
-ntn 55
-mkalavishvili 55
-off-farm 55
-chromatis 55
-sharia-compliant 55
-highest-volume 55
-boner 55
-kannell 55
-tottenham\/eng 55
-helsingoer 55
-coluccio 55
-aliyeva 55
-steier 55
-koers 55
-exceso 55
-desaulniers 55
-comcast-spectacor 55
-jersey-born 55
-rock-music 55
-al-mazidi 55
-kilim 55
-povera 55
-iksanov 55
-tele-ventures 55
-primor 55
-civlians 55
-r.i.-based 55
-orania 55
-year-around 55
-small-to-medium 55
-dravecky 55
-bontecou 55
-the# 55
-amsterdam-schiphol 55
-workrate 55
-advocator 55
-demouge 55
-ei-ichi 55
-aeberhard 55
-bridi 55
-picacho 55
-dinant 55
-slashers 55
-judaea 55
-kayama 55
-delavekouras 55
-fuli 55
-dreno 55
-##-year-plus 55
-anang 55
-lerche 55
-abessole 55
-##-robert 55
-khidr 55
-meheganglobe.com 55
-dubuc 55
-budiardjo 55
-#-state 55
-spit-roasted 55
-asbill 55
-balvino 55
-munyenyembe 55
-balzaretti 55
-ivelin 55
-aproximadamente 55
-pockmark 55
-ulleval 55
-moneycentral 55
-houssine 55
-gagoc 55
-jong-ho 55
-nonlawyer 55
-str\/lp## 55
-ship-to-air 55
-yanakiev 55
-gimbels 55
-concow 55
-moschetti 55
-zonghuai 55
-ottenhoff 55
-no-bake 55
-rutschow-stomporowski 55
-tampabay.com 55
-tzeltal 55
-c-note 55
-vebjoern 55
-fuel-cycle 55
-colombian-owned 55
-rescigno 55
-cropsey 55
-triple-x 55
-palmbeachpost.com/depression 55
-khanjani 55
-tousle-haired 55
-gulled 55
-tullia 55
-bideau 55
-#-and-a-half 55
-waytha 55
-aido 55
-###ci 55
-huels 55
-mychael 55
-madrigali 55
-trygg 55
-ciccolo 55
-kotscho 55
-levinstein 55
-taie 55
-adcb 55
-kry 55
-ethers 55
-cheaptickets.com 55
-lehrmann 55
-dendur 55
-antipodean 55
-#-gao 55
-keret 55
-srilanka-unrest-blast 55
-swamis 55
-horse-breeding 55
-dinsmoor 55
-barysch 55
-junior-level 55
-geode 55
-hoffmann-laroche 55
-nones 55
-wijdan 55
-square-shouldered 55
-white-shirted 55
-coldiron 55
-chartis 55
-kuskokwim 55
-stepfret 55
-gopendra 55
-nefertari 55
-klsx-fm 55
-westhusing 55
-revault 55
-luxar 55
-giancola 55
-gartnerg# 55
-lakemba 55
-organizaciones 55
-camerman 55
-amoussou 55
-last-hour 55
-pastrik 55
-datsakorn 55
-duxford 55
-brown-brick 55
-fortul 55
-zainy 55
-wamidh 55
-hanel 55
-feda 55
-reminyl 55
-thoeni 55
-sportcenter 55
-white-fleshed 55
-parchments 55
-seafrance 55
-nordmark 55
-aristobulo 55
-ligonier 55
-rmf 55
-torchy 55
-butcheries 55
-kurfuerstendamm 55
-http://www.cdc.gov/h#n#flu 55
-isleta 55
-letter-bombs 55
-hadrosaur 55
-nourizadeh 55
-kujawa 55
-memling 55
-mtd 55
-parents-to-be 55
-kinyu 55
-completo 55
-alsatians 55
-itta 55
-baoliu 55
-pravit 55
-siping 55
-korea-eu 55
-fulminate 55
-guandu 55
-yili\/zhao 55
-demott 55
-pakistan-militant 55
-lpu 55
-petrowski 55
-sondashi 55
-lindbom 55
-gatton 55
-dubelier 55
-hsin-hsing 55
-ghazl 55
-chad-unrest 55
-amcc 55
-lemon-yellow 55
-aniek 55
-danshuei 55
-tidmore 55
-chin-up 55
-scarpaci 55
-fresnel 55
-mecum 55
-rosebush 55
-consumer-confidence 55
-vainshtok 55
-kateri 55
-hungiapuko 55
-qibao 55
-camon 55
-ruhlmann 55
-claunch 55
-audible.com 55
-falic 55
-bardales 55
-lithographer 55
-ebrima 55
-szot 55
-ambulance-chasing 55
-watoto 55
-vigan 55
-komineft 55
-miklikova 55
-tbl 55
-egidius 55
-srikkanth 55
-senlin 55
-suizhong 55
-saltpeter 55
-shitreet 55
-laxton 55
-cspc 55
-elisdottir 55
-perrins 55
-uniglory 55
-emmick 55
-skulk 55
-earwax 55
-benaroya 55
-rushwaya 55
-discomforted 55
-ohalete 55
-panigoro 55
-mosquito-born 55
-safe-house 55
-excelerate 55
-mattino 55
-masoudi 55
-sonon 55
-bn.com 55
-ostergaard 55
-kamya 55
-kyeung-ran 55
-kitayama 55
-fu-hsing 55
-clip-art 55
-trillion-won 55
-ascender 55
-spritzing 55
-jonesing 55
-policy-based 55
-flomo 55
-globalist 55
-acamprosate 55
-redwall 55
-ljudmila 55
-korade 55
-lubo 55
-highhandedness 55
-anad 55
-#.#-meter-deep 55
-brownshirts 55
-attaboy 55
-traynham 55
-europeanized 55
-adjustables 55
-lumberyards 55
-hotel-like 55
-elektroprivreda 55
-kirilova 55
-dual-layer 55
-errett 55
-racinos 55
-sigou 55
-spindletop 55
-steidl 55
-hilderbrand 55
-man-of-the 55
-unconditioned 55
-budget-strapped 55
-bodart 55
-danic 55
-plushest 55
-jalaleddin 55
-fna 55
-to-# 55
-burhannudin 55
-godward 55
-greece-fires 55
-transfat 55
-lovelife 55
-myomectomy 55
-delgada 55
-ictv 55
-u.s-mexico 55
-lloreda 55
-moebius 55
-barkow 55
-madox 55
-liang-jen 55
-g+j 55
-cgnpc 55
-punchier 55
-wrsa 55
-rospars 55
-overbid 55
-lokichoggio 55
-fukuura 55
-yuegu 55
-tarkett 55
-snocountry 55
-inui 55
-moved.wits-end-column 55
-partita 55
-overfilling 55
-hectarage 55
-okwiri 55
-#-zheng 55
-eidinger 55
-tengan 55
-kouris 55
-cbuchholz 55
-hafnarfjordur 55
-owasi 55
-iza 55
-spungen 55
-tepix 55
-sial 55
-keepin 55
-paint-by-number 55
-jeanene 55
-natural-food 55
-korniyenko 55
-hartadi 55
-ananova 55
-trend-setters 55
-nohilly 55
-pasteurize 55
-www.blogs.tampabay.com/food 55
-advertising-driven 55
-nalumango 55
-ruxton 55
-jalrez 55
-miyar 55
-dmelvin@coxnews.com 55
-overvalue 55
-slane 55
-hankerson 55
-harpswell 55
-wassell 55
-bishan 55
-fried-chicken 55
-galvanic 55
-ylli 55
-cusip 55
-dogeared 55
-goofy-looking 55
-rhys-meyers 55
-lined-up 55
-heeren 55
-hyslop 55
-yanovsky 55
-www.aa.com 55
-seagren 55
-wiklund 55
-art-making 55
-non-lawyer 55
-saurashtra 55
-michoacana 55
-##-###-## 55
-shabnam 55
-roadworker 55
-shetler 55
-quistelli 55
-jacksonville-based 55
-galekovic 55
-badruddin 55
-voorhis 55
-three-song 55
-geordies 55
-decane 55
-kilrea 55
-epfl 55
-gentamicin 55
-chewed-up 55
-severgazprom 55
-bolona 55
-chandos 55
-rafflesia 55
-mercutio 55
-pipebomb 55
-ruthann 55
-makhenkesi 55
-sweatband 55
-built-ins 55
-donata 55
-ramonet 55
-meehl 55
-curico 55
-alibux 55
-commentary\/oped 55
-slifkin 55
-bason 55
-lawsky 55
-sabal 55
-isaksen 55
-cheng-kung 55
-labruno 55
-musics 55
-virgets 55
-qidwa 55
-export-heavy 55
-out-compete 55
-nettuno 55
-ruhanie 55
-shailendra 55
-ogallaga 55
-time-being 55
-mashego 55
-bangladesh-based 55
-####b 55
-####g 55
-school-prayer 55
-necked 55
-luzern 55
-wasbir 55
-pro-royalist 55
-montelongo 55
-sudan-darfur-un 55
-internationally-sponsored 55
-buyenzi 55
-b&l 55
-paragliders 55
-producer\/director 55
-saalbach 55
-dillons 55
-subleasing 55
-unifrance 55
-fleeces 55
-chu-huan 55
-kanchi 55
-bihan 55
-agrama 55
-white-sided 55
-hansack 55
-jin-woo 55
-hileman 55
-handwoven 55
-stouter 55
-leaf-shaped 55
-osby 55
-stamen 55
-pleasurably 55
-chimeras 55
-film-related 55
-jotspot 55
-baitzel 55
-palmar 55
-ilegal 55
-hiaa 55
-eeeee 55
-alcobendas 55
-prewett 55
-knkt 55
-berkous 55
-woe-is-me 55
-krinkie 55
-magdelena 55
-post-hussein 55
-hardline-controlled 55
-narrow-bodied 55
-aqueous 55
-burukina 55
-nozoe 55
-awartani 55
-kalapani 55
-guenot 55
-tuipulotu 55
-iovino 55
-vosper 55
-##-flavia 55
-rrodriguez 55
-tranh 55
-d'amuro 55
-weert 55
-brosh 55
-europewide 55
-kovals 55
-pie-shaped 55
-nahda 55
-siboni 55
-vedenkin 55
-squillacote 55
-gittes 55
-jaakkola 55
-khattabi 55
-vatican-affiliated 55
-fewer-than-expected 55
-subschinski 55
-sherawat 55
-galadari 55
-salvors 55
-virdi 55
-jayasundara 55
-giraudet 55
-longjing 55
-thum 55
-ahorros 55
-pro-franco 55
-plutonium-making 55
-environmental-protection 55
-escuredo 55
-zhiping 55
-worktable 55
-sediq 55
-tree-hugger 55
-sahlman 55
-fadeout 55
-zizhou 55
-martes 55
-u.s.-arranged 55
-kymco 55
-nxc# 55
-kopin 55
-astbury 55
-marzan 55
-dongfanghong 55
-montecore 55
-cpap 55
-yamase 55
-assiri 55
-mbandjock 55
-plutocrat 55
-###-billion-baht 55
-unmerciful 55
-british-chinese 55
-re-gifting 55
-sexually-explicit 55
-switch-off 55
-svan 55
-abeywardene 55
-ansin 55
-cold-shoulder 55
-terron 55
-lizard-like 55
-aquarian 55
-solove 55
-water-carrying 55
-celebrityhood 55
-foreign-produced 55
-ganascia 55
-nyepi 55
-popmart 55
-ustream 55
-rosalio 55
-outdrawing 55
-vladimirovna 55
-gbissau 55
-ersoy 55
-golovlyov 55
-biserko 55
-glicken 55
-fbl-esp-cup 55
-briskman 55
-solvberg 55
-wehr-hasler 55
-yokel 55
-koplovitz 55
-likeminded 55
-berkey 55
-infrasound 55
-al-adil 55
-quinley 55
-boym 55
-boxier 55
-tutuila 55
-stair-climbing 55
-vike 55
-a.f.m. 55
-belgrad 55
-nucleaire 55
-makubuya 55
-silverchair 55
-prawiro 55
-kluzak 55
-decepticons 55
-chappy 55
-eeriest 55
-paycut 55
-pinit 55
-tech-dominant 55
-charbonnet 55
-mid-innings 55
-watermarking 55
-unshelled 55
-bpa-free 55
-faughnan 55
-vanins 55
-sirena 55
-militello 55
-officier 55
-fras 55
-marudai 55
-hypnotherapy 55
-pasada 55
-laser-cut 55
-fuji-servetto 55
-butar 55
-vajna 55
-miguez 55
-cseries 55
-schlow 55
-hamkyong 55
-top-hatted 55
-midf 55
-union-mandated 55
-hanssens 55
-exhorbitant 55
-most-famous 55
-ruesch 55
-dexedrine 55
-baracoa 55
-maricela 55
-appropriator 55
-bronis 55
-vachagayev 55
-maseratis 55
-growth-stock 55
-westerngeco 55
-fuel-injection 55
-jpletz 55
-edun 55
-pinmanee 55
-swarts 55
-gauntlets 55
-bovelander 55
-azpurua 55
-honka 55
-edge-of-your-seat 55
-mcspaden 55
-aacc 55
-underripe 55
-helipads 55
-intra-shiite 55
-corazzin 55
-copperheads 55
-mumuni 55
-hruby 55
-d'alba 55
-goodland 55
-verrill 55
-hitoki 55
-forswore 55
-griddles 55
-spintronics 55
-sunncomm 55
-tongsalee 55
-anti-roll 55
-drogoul 55
-ecologic 55
-two-states 55
-acto 55
-coral-colored 55
-juliusz 55
-traesch 55
-ulaanbaatar 55
-##-grade 55
-scorelines 55
-rainswept 55
-mandisi 55
-dually 55
-wiant 55
-essig 55
-zambeze 55
-ortigas 55
-crewson 55
-assurer 55
-yellow-legged 55
-pirouetting 55
-gotschall 55
-injury-forced 55
-chemins 55
-aerator 55
-musalo 55
-kaijuka 55
-radivoje 55
-rennet 55
-half-seriously 55
-bulwer-lytton 55
-ashrafiyeh 55
-smaller-market 55
-mukarram 55
-one-and-only 55
-bojkov 55
-mosab 55
-remmel 55
-capellini 55
-lemel 55
-marostica 55
-gurfein 55
-kilicdaroglu 55
-config 55
-xxxend 55
-well-chilled 55
-meatiest 55
-###,###-rupee 55
-infinitis 55
-fiddlehead 55
-simpanan 55
-bat-winged 55
-banyoles 55
-neistat 55
-ragas 55
-philologist 55
-autostick 55
-nessler 55
-##-people 55
-tr#s 55
-tent-pole 55
-algarabawi 55
-prakarn 55
-ts### 55
-pierre-henry 55
-dumbfounding 55
-http://www.chinapntr.gov 55
-bnu 55
-step-grandfather 55
-mcnamaraglobe.com 55
-mlm 55
-pepitone 55
-americium 55
-phased-out 55
-mitchellville 55
-india-weather 55
-ilion 55
-sapan 55
-land-to-air 55
-aerators 55
-bacho 55
-gerwig 55
-richart 55
-dethrones 55
-runcorn 55
-hinzpeter 55
-spoon-feeding 55
-krey 55
-artform 55
-kva 55
-macmullanglobe.com 55
-nonde 55
-hispanico 55
-danke 55
-sevene 55
-arqam 55
-mbatista 55
-rphilpotstar-telegram 55
-lauridsen 55
-estrogen-only 55
-clow 55
-ki-### 55
-foucher 55
-zetec 55
-autograph-signing 55
-africanus 55
-ex-israeli 55
-psoriatic 55
-hualing 55
-unibet.com 55
-khoshchehreh 55
-matanog 55
-peace\/def 55
-rashid-merem 55
-kostyuk 55
-niantic 55
-dressen 55
-neinas 55
-in-process 55
-kazeem 55
-pude 55
-arria 55
-funing 55
-self-destructiveness 55
-shergold 55
-batiuk 55
-petrosa 55
-staser 55
-seebaran 55
-nesmachny 55
-#rd\/tv 55
-altona 55
-macarounas 55
-urad 55
-arenes 55
-zinovy 55
-javelins 55
-carpet-cleaning 55
-al-gumhuriya 55
-mogelonsky 55
-tattle 55
-afaq 55
-balie 55
-jayesh 55
-mccawley 55
-manganaro 55
-breitsprecher 55
-charice 55
-abdiqassim 55
-tantaquidgeon 55
-yung-san 55
-meirav 55
-###km\/h 55
-anouma 55
-el-ayoun 55
-wherehouse 55
-alloudi 55
-moshtarak 55
-ktvt 55
-gholikhan 55
-gogele 55
-mithoff 55
-intourist 55
-x-milwaukee 55
-never-used 55
-#,###-square-kilometre 55
-darkman 55
-moulvibazar 55
-reher 55
-sewa 55
-krulwich 55
-kajiwara 55
-tunmore 55
-#-gong 55
-guille 55
-wrong-doers 55
-cirone 55
-adinolfi 55
-kil-seung 55
-charge-card 55
-tumpel-gugerell 55
-guliev 55
-glenmore 55
-flagcarrier 55
-zakum 55
-centromin 55
-earnings-driven 55
-pavoni 55
-machination 55
-flyertalk 55
-all-too-real 55
-maidenhead 55
-celtic\/sco 55
-rawod 55
-co-world 55
-sekhar 55
-gendel 55
-szymanowski 55
-domer 55
-tailandia 55
-three-panel 55
-vereker 55
-kuchinsky 55
-yongqiang 55
-fremont-based 55
-imouraren 55
-kotschau 55
-gebre-egziabher 55
-consolatory 55
-already-fragile 55
-mabini 55
-forehander 55
-whomping 55
-test-drove 55
-opensocial 55
-bacteria-killing 55
-kanko 55
-yakovleva 55
-bergert 55
-munari 55
-soccer-only 55
-maniatis 55
-helgesson 55
-nippert 55
-magnee 55
-hamisah 55
-non-war 55
-klaasen 55
-rubai 55
-saint-lazare 55
-shafir 55
-jianlin 55
-koppers 55
-lamberton 55
-metta 55
-algoma 55
-kessar 55
-kozue 55
-swiss-registered 55
-moderate-led 55
-desarrollar 55
-adenoids 55
-oberhofen 55
-tregubova 55
-##-cents 55
-adesina 55
-nedo 55
-trunzo 55
-nasaw 55
-vinters 55
-avx 55
-zanna 55
-suspender 55
-glutting 55
-objet 55
-medicina 55
-latka 55
-elegante 55
-kilner 55
-government-certified 55
-rc-# 55
-al-thawadi 55
-flip-out 55
-polish-language 55
-stationmaster 55
-fire-bombs 55
-seemo 55
-practical-minded 55
-tank-top 55
-hodari 55
-saj 55
-ger\/sae 55
-spinbaldak 55
-bikker 55
-volpenhein 55
-rom-com 55
-funches 55
-haacke 55
-al-hajiri 55
-tc-gb 55
-latium 55
-bhansali 55
-sterols 55
-mulliken 55
-plutarco 55
-voorsanger 55
-thomas-keprta 55
-bieksa 55
-loikaw 55
-matsepe-casaburri 55
-warty 55
-chen-wei 55
-www.mcdonalds.com 55
-hondora 55
-m.t.b. 55
-off-grid 55
-home-design 55
-lipovsky 55
-bergsson 55
-kii 55
-strakhov 55
-songphon 55
-zahab 55
-allauddin 55
-well-tolerated 55
-scolese 55
-cucbm 55
-drayson 55
-eiken 55
-hfi 55
-zoia 55
-edgecomb 55
-saaremaa 55
-annisu-r 55
-allaf 55
-eye-view 55
-chambermaids 55
-hirosawa 55
-vladivostock 55
-trutv 55
-orrorin 55
-wistron 55
-kik 55
-pretentions 55
-sciullo 55
-handwriting-recognition 55
-majles 55
-motoshima 55
-tanginye 55
-hellawell 55
-extractable 55
-end-year 55
-soll 55
-caronna 55
-vampiric 55
-xigui 55
-hit-and-runs 55
-duba-yurt 55
-saur 55
-koltsov 55
-blue-helmet 55
-hyuga 55
-kifri 55
-najja 55
-wible 55
-matloha 55
-idrissi 55
-tamarindo 55
-just-the-facts 55
-bulaong 55
-meddein 55
-klinefelter 55
-satz 55
-ushpizin 55
-overnighting 55
-handbrake 55
-dalley 55
-prampero 55
-hss 55
-nonhybrid 55
-mutebusi 55
-jingqian 55
-cryptology 55
-anupama 55
-yakshina 55
-eeas 55
-pottengal 55
-right-to-carry 55
-shampooed 55
-sonejee 55
-ozyurek 55
-kinrara 55
-munyua 55
-traboulsi 55
-contar 55
-lerone 55
-roundtrips 55
-chelanga 55
-limited-field 55
-wenming 55
-kreisberg 55
-bertino 55
-zhongxiao 55
-acsi 55
-levance 55
-eocene 55
-#-feliciano 55
-tarantella 55
-jaehnig 55
-al-awadhi 55
-anti-socialist 55
-hornak 55
-darkhovin 55
-perimetre 55
-abramian 55
-caricola 55
-anti-hungarian 55
-maneiro 55
-size-# 55
-farglory 55
-petermann 55
-meiwa 55
-phelpses 55
-porgras 55
-samiya 55
-texoma 55
-huffines 55
-third-placer 55
-al-badran 55
-babyhood 55
-seamounts 55
-aasen 55
-casulties 55
-bodong 55
-shamar 55
-destablising 55
-curado 55
-shangai 55
-svedka 55
-<s> 83845866
-</s> 83845866
-<PAD> 5
diff --git a/research/textsum/data_convert_example.py b/research/textsum/data_convert_example.py
deleted file mode 100644
index 9328936dd8a4ee9d47372f0c93f8b0ae1a9787df..0000000000000000000000000000000000000000
--- a/research/textsum/data_convert_example.py
+++ /dev/null
@@ -1,65 +0,0 @@
-"""Example of Converting TextSum model data.
-Usage:
-python data_convert_example.py --command binary_to_text --in_file data/data --out_file data/text_data
-python data_convert_example.py --command text_to_binary --in_file data/text_data --out_file data/binary_data
-python data_convert_example.py --command binary_to_text --in_file data/binary_data --out_file data/text_data2
-diff data/text_data2 data/text_data
-"""
-
-import struct
-import sys
-
-import tensorflow as tf
-from tensorflow.core.example import example_pb2
-
-FLAGS = tf.app.flags.FLAGS
-tf.app.flags.DEFINE_string('command', 'binary_to_text',
-                           'Either binary_to_text or text_to_binary.'
-                           'Specify FLAGS.in_file accordingly.')
-tf.app.flags.DEFINE_string('in_file', '', 'path to file')
-tf.app.flags.DEFINE_string('out_file', '', 'path to file')
-
-def _binary_to_text():
-  reader = open(FLAGS.in_file, 'rb')
-  writer = open(FLAGS.out_file, 'w')
-  while True:
-    len_bytes = reader.read(8)
-    if not len_bytes:
-      sys.stderr.write('Done reading\n')
-      return
-    str_len = struct.unpack('q', len_bytes)[0]
-    tf_example_str = struct.unpack('%ds' % str_len, reader.read(str_len))[0]
-    tf_example = example_pb2.Example.FromString(tf_example_str)
-    examples = []
-    for key in tf_example.features.feature:
-      examples.append('%s=%s' % (key, tf_example.features.feature[key].bytes_list.value[0]))
-    writer.write('%s\n' % '\t'.join(examples))
-  reader.close()
-  writer.close()
-
-
-def _text_to_binary():
-  inputs = open(FLAGS.in_file, 'r').readlines()
-  writer = open(FLAGS.out_file, 'wb')
-  for inp in inputs:
-    tf_example = example_pb2.Example()
-    for feature in inp.strip().split('\t'):
-      (k, v) = feature.split('=')
-      tf_example.features.feature[k].bytes_list.value.extend([v])
-    tf_example_str = tf_example.SerializeToString()
-    str_len = len(tf_example_str)
-    writer.write(struct.pack('q', str_len))
-    writer.write(struct.pack('%ds' % str_len, tf_example_str))
-  writer.close()
-
-
-def main(unused_argv):
-  assert FLAGS.command and FLAGS.in_file and FLAGS.out_file
-  if FLAGS.command == 'binary_to_text':
-    _binary_to_text()
-  elif FLAGS.command == 'text_to_binary':
-    _text_to_binary()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/textsum/seq2seq_attention.py b/research/textsum/seq2seq_attention.py
deleted file mode 100644
index 33d1b4fed07f65270b4550051bb90411b45c736f..0000000000000000000000000000000000000000
--- a/research/textsum/seq2seq_attention.py
+++ /dev/null
@@ -1,213 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Trains a seq2seq model.
-
-WORK IN PROGRESS.
-
-Implement "Abstractive Text Summarization using Sequence-to-sequence RNNS and
-Beyond."
-
-"""
-import sys
-import time
-
-import tensorflow as tf
-import batch_reader
-import data
-import seq2seq_attention_decode
-import seq2seq_attention_model
-
-FLAGS = tf.app.flags.FLAGS
-tf.app.flags.DEFINE_string('data_path',
-                           '', 'Path expression to tf.Example.')
-tf.app.flags.DEFINE_string('vocab_path',
-                           '', 'Path expression to text vocabulary file.')
-tf.app.flags.DEFINE_string('article_key', 'article',
-                           'tf.Example feature key for article.')
-tf.app.flags.DEFINE_string('abstract_key', 'headline',
-                           'tf.Example feature key for abstract.')
-tf.app.flags.DEFINE_string('log_root', '', 'Directory for model root.')
-tf.app.flags.DEFINE_string('train_dir', '', 'Directory for train.')
-tf.app.flags.DEFINE_string('eval_dir', '', 'Directory for eval.')
-tf.app.flags.DEFINE_string('decode_dir', '', 'Directory for decode summaries.')
-tf.app.flags.DEFINE_string('mode', 'train', 'train/eval/decode mode')
-tf.app.flags.DEFINE_integer('max_run_steps', 10000000,
-                            'Maximum number of run steps.')
-tf.app.flags.DEFINE_integer('max_article_sentences', 2,
-                            'Max number of first sentences to use from the '
-                            'article')
-tf.app.flags.DEFINE_integer('max_abstract_sentences', 100,
-                            'Max number of first sentences to use from the '
-                            'abstract')
-tf.app.flags.DEFINE_integer('beam_size', 4,
-                            'beam size for beam search decoding.')
-tf.app.flags.DEFINE_integer('eval_interval_secs', 60, 'How often to run eval.')
-tf.app.flags.DEFINE_integer('checkpoint_secs', 60, 'How often to checkpoint.')
-tf.app.flags.DEFINE_bool('use_bucketing', False,
-                         'Whether bucket articles of similar length.')
-tf.app.flags.DEFINE_bool('truncate_input', False,
-                         'Truncate inputs that are too long. If False, '
-                         'examples that are too long are discarded.')
-tf.app.flags.DEFINE_integer('num_gpus', 0, 'Number of gpus used.')
-tf.app.flags.DEFINE_integer('random_seed', 111, 'A seed value for randomness.')
-
-
-def _RunningAvgLoss(loss, running_avg_loss, summary_writer, step, decay=0.999):
-  """Calculate the running average of losses."""
-  if running_avg_loss == 0:
-    running_avg_loss = loss
-  else:
-    running_avg_loss = running_avg_loss * decay + (1 - decay) * loss
-  running_avg_loss = min(running_avg_loss, 12)
-  loss_sum = tf.Summary()
-  loss_sum.value.add(tag='running_avg_loss', simple_value=running_avg_loss)
-  summary_writer.add_summary(loss_sum, step)
-  sys.stdout.write('running_avg_loss: %f\n' % running_avg_loss)
-  return running_avg_loss
-
-
-def _Train(model, data_batcher):
-  """Runs model training."""
-  with tf.device('/cpu:0'):
-    model.build_graph()
-    saver = tf.train.Saver()
-    # Train dir is different from log_root to avoid summary directory
-    # conflict with Supervisor.
-    summary_writer = tf.summary.FileWriter(FLAGS.train_dir)
-    sv = tf.train.Supervisor(logdir=FLAGS.log_root,
-                             is_chief=True,
-                             saver=saver,
-                             summary_op=None,
-                             save_summaries_secs=60,
-                             save_model_secs=FLAGS.checkpoint_secs,
-                             global_step=model.global_step)
-    sess = sv.prepare_or_wait_for_session(config=tf.ConfigProto(
-        allow_soft_placement=True))
-    running_avg_loss = 0
-    step = 0
-    while not sv.should_stop() and step < FLAGS.max_run_steps:
-      (article_batch, abstract_batch, targets, article_lens, abstract_lens,
-       loss_weights, _, _) = data_batcher.NextBatch()
-      (_, summaries, loss, train_step) = model.run_train_step(
-          sess, article_batch, abstract_batch, targets, article_lens,
-          abstract_lens, loss_weights)
-
-      summary_writer.add_summary(summaries, train_step)
-      running_avg_loss = _RunningAvgLoss(
-          running_avg_loss, loss, summary_writer, train_step)
-      step += 1
-      if step % 100 == 0:
-        summary_writer.flush()
-    sv.Stop()
-    return running_avg_loss
-
-
-def _Eval(model, data_batcher, vocab=None):
-  """Runs model eval."""
-  model.build_graph()
-  saver = tf.train.Saver()
-  summary_writer = tf.summary.FileWriter(FLAGS.eval_dir)
-  sess = tf.Session(config=tf.ConfigProto(allow_soft_placement=True))
-  running_avg_loss = 0
-  step = 0
-  while True:
-    time.sleep(FLAGS.eval_interval_secs)
-    try:
-      ckpt_state = tf.train.get_checkpoint_state(FLAGS.log_root)
-    except tf.errors.OutOfRangeError as e:
-      tf.logging.error('Cannot restore checkpoint: %s', e)
-      continue
-
-    if not (ckpt_state and ckpt_state.model_checkpoint_path):
-      tf.logging.info('No model to eval yet at %s', FLAGS.train_dir)
-      continue
-
-    tf.logging.info('Loading checkpoint %s', ckpt_state.model_checkpoint_path)
-    saver.restore(sess, ckpt_state.model_checkpoint_path)
-
-    (article_batch, abstract_batch, targets, article_lens, abstract_lens,
-     loss_weights, _, _) = data_batcher.NextBatch()
-    (summaries, loss, train_step) = model.run_eval_step(
-        sess, article_batch, abstract_batch, targets, article_lens,
-        abstract_lens, loss_weights)
-    tf.logging.info(
-        'article:  %s',
-        ' '.join(data.Ids2Words(article_batch[0][:].tolist(), vocab)))
-    tf.logging.info(
-        'abstract: %s',
-        ' '.join(data.Ids2Words(abstract_batch[0][:].tolist(), vocab)))
-
-    summary_writer.add_summary(summaries, train_step)
-    running_avg_loss = _RunningAvgLoss(
-        running_avg_loss, loss, summary_writer, train_step)
-    if step % 100 == 0:
-      summary_writer.flush()
-
-
-def main(unused_argv):
-  vocab = data.Vocab(FLAGS.vocab_path, 1000000)
-  # Check for presence of required special tokens.
-  assert vocab.CheckVocab(data.PAD_TOKEN) > 0
-  assert vocab.CheckVocab(data.UNKNOWN_TOKEN) >= 0
-  assert vocab.CheckVocab(data.SENTENCE_START) > 0
-  assert vocab.CheckVocab(data.SENTENCE_END) > 0
-
-  batch_size = 4
-  if FLAGS.mode == 'decode':
-    batch_size = FLAGS.beam_size
-
-  hps = seq2seq_attention_model.HParams(
-      mode=FLAGS.mode,  # train, eval, decode
-      min_lr=0.01,  # min learning rate.
-      lr=0.15,  # learning rate
-      batch_size=batch_size,
-      enc_layers=4,
-      enc_timesteps=120,
-      dec_timesteps=30,
-      min_input_len=2,  # discard articles/summaries < than this
-      num_hidden=256,  # for rnn cell
-      emb_dim=128,  # If 0, don't use embedding
-      max_grad_norm=2,
-      num_softmax_samples=4096)  # If 0, no sampled softmax.
-
-  batcher = batch_reader.Batcher(
-      FLAGS.data_path, vocab, hps, FLAGS.article_key,
-      FLAGS.abstract_key, FLAGS.max_article_sentences,
-      FLAGS.max_abstract_sentences, bucketing=FLAGS.use_bucketing,
-      truncate_input=FLAGS.truncate_input)
-  tf.set_random_seed(FLAGS.random_seed)
-
-  if hps.mode == 'train':
-    model = seq2seq_attention_model.Seq2SeqAttentionModel(
-        hps, vocab, num_gpus=FLAGS.num_gpus)
-    _Train(model, batcher)
-  elif hps.mode == 'eval':
-    model = seq2seq_attention_model.Seq2SeqAttentionModel(
-        hps, vocab, num_gpus=FLAGS.num_gpus)
-    _Eval(model, batcher, vocab=vocab)
-  elif hps.mode == 'decode':
-    decode_mdl_hps = hps
-    # Only need to restore the 1st step and reuse it since
-    # we keep and feed in state for each step's output.
-    decode_mdl_hps = hps._replace(dec_timesteps=1)
-    model = seq2seq_attention_model.Seq2SeqAttentionModel(
-        decode_mdl_hps, vocab, num_gpus=FLAGS.num_gpus)
-    decoder = seq2seq_attention_decode.BSDecoder(model, batcher, hps, vocab)
-    decoder.DecodeLoop()
-
-
-if __name__ == '__main__':
-  tf.app.run()
diff --git a/research/textsum/seq2seq_attention_decode.py b/research/textsum/seq2seq_attention_decode.py
deleted file mode 100644
index 54b56919439e92b5ca86aa2f5e0d6fbc909fb8d6..0000000000000000000000000000000000000000
--- a/research/textsum/seq2seq_attention_decode.py
+++ /dev/null
@@ -1,162 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Module for decoding."""
-
-import os
-import time
-
-import beam_search
-import data
-from six.moves import xrange
-import tensorflow as tf
-
-FLAGS = tf.app.flags.FLAGS
-tf.app.flags.DEFINE_integer('max_decode_steps', 1000000,
-                            'Number of decoding steps.')
-tf.app.flags.DEFINE_integer('decode_batches_per_ckpt', 8000,
-                            'Number of batches to decode before restoring next '
-                            'checkpoint')
-
-DECODE_LOOP_DELAY_SECS = 60
-DECODE_IO_FLUSH_INTERVAL = 100
-
-
-class DecodeIO(object):
-  """Writes the decoded and references to RKV files for Rouge score.
-
-    See nlp/common/utils/internal/rkv_parser.py for detail about rkv file.
-  """
-
-  def __init__(self, outdir):
-    self._cnt = 0
-    self._outdir = outdir
-    if not os.path.exists(self._outdir):
-      os.mkdir(self._outdir)
-    self._ref_file = None
-    self._decode_file = None
-
-  def Write(self, reference, decode):
-    """Writes the reference and decoded outputs to RKV files.
-
-    Args:
-      reference: The human (correct) result.
-      decode: The machine-generated result
-    """
-    self._ref_file.write('output=%s\n' % reference)
-    self._decode_file.write('output=%s\n' % decode)
-    self._cnt += 1
-    if self._cnt % DECODE_IO_FLUSH_INTERVAL == 0:
-      self._ref_file.flush()
-      self._decode_file.flush()
-
-  def ResetFiles(self):
-    """Resets the output files. Must be called once before Write()."""
-    if self._ref_file: self._ref_file.close()
-    if self._decode_file: self._decode_file.close()
-    timestamp = int(time.time())
-    self._ref_file = open(
-        os.path.join(self._outdir, 'ref%d'%timestamp), 'w')
-    self._decode_file = open(
-        os.path.join(self._outdir, 'decode%d'%timestamp), 'w')
-
-
-class BSDecoder(object):
-  """Beam search decoder."""
-
-  def __init__(self, model, batch_reader, hps, vocab):
-    """Beam search decoding.
-
-    Args:
-      model: The seq2seq attentional model.
-      batch_reader: The batch data reader.
-      hps: Hyperparamters.
-      vocab: Vocabulary
-    """
-    self._model = model
-    self._model.build_graph()
-    self._batch_reader = batch_reader
-    self._hps = hps
-    self._vocab = vocab
-    self._saver = tf.train.Saver()
-    self._decode_io = DecodeIO(FLAGS.decode_dir)
-
-  def DecodeLoop(self):
-    """Decoding loop for long running process."""
-    sess = tf.Session(config=tf.ConfigProto(allow_soft_placement=True))
-    step = 0
-    while step < FLAGS.max_decode_steps:
-      time.sleep(DECODE_LOOP_DELAY_SECS)
-      if not self._Decode(self._saver, sess):
-        continue
-      step += 1
-
-  def _Decode(self, saver, sess):
-    """Restore a checkpoint and decode it.
-
-    Args:
-      saver: Tensorflow checkpoint saver.
-      sess: Tensorflow session.
-    Returns:
-      If success, returns true, otherwise, false.
-    """
-    ckpt_state = tf.train.get_checkpoint_state(FLAGS.log_root)
-    if not (ckpt_state and ckpt_state.model_checkpoint_path):
-      tf.logging.info('No model to decode yet at %s', FLAGS.log_root)
-      return False
-
-    tf.logging.info('checkpoint path %s', ckpt_state.model_checkpoint_path)
-    ckpt_path = os.path.join(
-        FLAGS.log_root, os.path.basename(ckpt_state.model_checkpoint_path))
-    tf.logging.info('renamed checkpoint path %s', ckpt_path)
-    saver.restore(sess, ckpt_path)
-
-    self._decode_io.ResetFiles()
-    for _ in xrange(FLAGS.decode_batches_per_ckpt):
-      (article_batch, _, _, article_lens, _, _, origin_articles,
-       origin_abstracts) = self._batch_reader.NextBatch()
-      for i in xrange(self._hps.batch_size):
-        bs = beam_search.BeamSearch(
-            self._model, self._hps.batch_size,
-            self._vocab.WordToId(data.SENTENCE_START),
-            self._vocab.WordToId(data.SENTENCE_END),
-            self._hps.dec_timesteps)
-
-        article_batch_cp = article_batch.copy()
-        article_batch_cp[:] = article_batch[i:i+1]
-        article_lens_cp = article_lens.copy()
-        article_lens_cp[:] = article_lens[i:i+1]
-        best_beam = bs.BeamSearch(sess, article_batch_cp, article_lens_cp)[0]
-        decode_output = [int(t) for t in best_beam.tokens[1:]]
-        self._DecodeBatch(
-            origin_articles[i], origin_abstracts[i], decode_output)
-    return True
-
-  def _DecodeBatch(self, article, abstract, output_ids):
-    """Convert id to words and writing results.
-
-    Args:
-      article: The original article string.
-      abstract: The human (correct) abstract string.
-      output_ids: The abstract word ids output by machine.
-    """
-    decoded_output = ' '.join(data.Ids2Words(output_ids, self._vocab))
-    end_p = decoded_output.find(data.SENTENCE_END, 0)
-    if end_p != -1:
-      decoded_output = decoded_output[:end_p]
-    tf.logging.info('article:  %s', article)
-    tf.logging.info('abstract: %s', abstract)
-    tf.logging.info('decoded:  %s', decoded_output)
-    self._decode_io.Write(abstract, decoded_output.strip())
diff --git a/research/textsum/seq2seq_attention_model.py b/research/textsum/seq2seq_attention_model.py
deleted file mode 100644
index 618d72fa28c920aa916c73575d716093d6d00499..0000000000000000000000000000000000000000
--- a/research/textsum/seq2seq_attention_model.py
+++ /dev/null
@@ -1,300 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Sequence-to-Sequence with attention model for text summarization.
-"""
-from collections import namedtuple
-
-import numpy as np
-import seq2seq_lib
-from six.moves import xrange
-import tensorflow as tf
-
-HParams = namedtuple('HParams',
-                     'mode, min_lr, lr, batch_size, '
-                     'enc_layers, enc_timesteps, dec_timesteps, '
-                     'min_input_len, num_hidden, emb_dim, max_grad_norm, '
-                     'num_softmax_samples')
-
-
-def _extract_argmax_and_embed(embedding, output_projection=None,
-                              update_embedding=True):
-  """Get a loop_function that extracts the previous symbol and embeds it.
-
-  Args:
-    embedding: embedding tensor for symbols.
-    output_projection: None or a pair (W, B). If provided, each fed previous
-      output will first be multiplied by W and added B.
-    update_embedding: Boolean; if False, the gradients will not propagate
-      through the embeddings.
-
-  Returns:
-    A loop function.
-  """
-  def loop_function(prev, _):
-    """function that feed previous model output rather than ground truth."""
-    if output_projection is not None:
-      prev = tf.nn.xw_plus_b(
-          prev, output_projection[0], output_projection[1])
-    prev_symbol = tf.argmax(prev, 1)
-    # Note that gradients will not propagate through the second parameter of
-    # embedding_lookup.
-    emb_prev = tf.nn.embedding_lookup(embedding, prev_symbol)
-    if not update_embedding:
-      emb_prev = tf.stop_gradient(emb_prev)
-    return emb_prev
-  return loop_function
-
-
-class Seq2SeqAttentionModel(object):
-  """Wrapper for Tensorflow model graph for text sum vectors."""
-
-  def __init__(self, hps, vocab, num_gpus=0):
-    self._hps = hps
-    self._vocab = vocab
-    self._num_gpus = num_gpus
-    self._cur_gpu = 0
-
-  def run_train_step(self, sess, article_batch, abstract_batch, targets,
-                     article_lens, abstract_lens, loss_weights):
-    to_return = [self._train_op, self._summaries, self._loss, self.global_step]
-    return sess.run(to_return,
-                    feed_dict={self._articles: article_batch,
-                               self._abstracts: abstract_batch,
-                               self._targets: targets,
-                               self._article_lens: article_lens,
-                               self._abstract_lens: abstract_lens,
-                               self._loss_weights: loss_weights})
-
-  def run_eval_step(self, sess, article_batch, abstract_batch, targets,
-                    article_lens, abstract_lens, loss_weights):
-    to_return = [self._summaries, self._loss, self.global_step]
-    return sess.run(to_return,
-                    feed_dict={self._articles: article_batch,
-                               self._abstracts: abstract_batch,
-                               self._targets: targets,
-                               self._article_lens: article_lens,
-                               self._abstract_lens: abstract_lens,
-                               self._loss_weights: loss_weights})
-
-  def run_decode_step(self, sess, article_batch, abstract_batch, targets,
-                      article_lens, abstract_lens, loss_weights):
-    to_return = [self._outputs, self.global_step]
-    return sess.run(to_return,
-                    feed_dict={self._articles: article_batch,
-                               self._abstracts: abstract_batch,
-                               self._targets: targets,
-                               self._article_lens: article_lens,
-                               self._abstract_lens: abstract_lens,
-                               self._loss_weights: loss_weights})
-
-  def _next_device(self):
-    """Round robin the gpu device. (Reserve last gpu for expensive op)."""
-    if self._num_gpus == 0:
-      return ''
-    dev = '/gpu:%d' % self._cur_gpu
-    if self._num_gpus > 1:
-      self._cur_gpu = (self._cur_gpu + 1) % (self._num_gpus-1)
-    return dev
-
-  def _get_gpu(self, gpu_id):
-    if self._num_gpus <= 0 or gpu_id >= self._num_gpus:
-      return ''
-    return '/gpu:%d' % gpu_id
-
-  def _add_placeholders(self):
-    """Inputs to be fed to the graph."""
-    hps = self._hps
-    self._articles = tf.placeholder(tf.int32,
-                                    [hps.batch_size, hps.enc_timesteps],
-                                    name='articles')
-    self._abstracts = tf.placeholder(tf.int32,
-                                     [hps.batch_size, hps.dec_timesteps],
-                                     name='abstracts')
-    self._targets = tf.placeholder(tf.int32,
-                                   [hps.batch_size, hps.dec_timesteps],
-                                   name='targets')
-    self._article_lens = tf.placeholder(tf.int32, [hps.batch_size],
-                                        name='article_lens')
-    self._abstract_lens = tf.placeholder(tf.int32, [hps.batch_size],
-                                         name='abstract_lens')
-    self._loss_weights = tf.placeholder(tf.float32,
-                                        [hps.batch_size, hps.dec_timesteps],
-                                        name='loss_weights')
-
-  def _add_seq2seq(self):
-    hps = self._hps
-    vsize = self._vocab.NumIds()
-
-    with tf.variable_scope('seq2seq'):
-      encoder_inputs = tf.unstack(tf.transpose(self._articles))
-      decoder_inputs = tf.unstack(tf.transpose(self._abstracts))
-      targets = tf.unstack(tf.transpose(self._targets))
-      loss_weights = tf.unstack(tf.transpose(self._loss_weights))
-      article_lens = self._article_lens
-
-      # Embedding shared by the input and outputs.
-      with tf.variable_scope('embedding'), tf.device('/cpu:0'):
-        embedding = tf.get_variable(
-            'embedding', [vsize, hps.emb_dim], dtype=tf.float32,
-            initializer=tf.truncated_normal_initializer(stddev=1e-4))
-        emb_encoder_inputs = [tf.nn.embedding_lookup(embedding, x)
-                              for x in encoder_inputs]
-        emb_decoder_inputs = [tf.nn.embedding_lookup(embedding, x)
-                              for x in decoder_inputs]
-
-      for layer_i in xrange(hps.enc_layers):
-        with tf.variable_scope('encoder%d'%layer_i), tf.device(
-            self._next_device()):
-          cell_fw = tf.contrib.rnn.LSTMCell(
-              hps.num_hidden,
-              initializer=tf.random_uniform_initializer(-0.1, 0.1, seed=123),
-              state_is_tuple=False)
-          cell_bw = tf.contrib.rnn.LSTMCell(
-              hps.num_hidden,
-              initializer=tf.random_uniform_initializer(-0.1, 0.1, seed=113),
-              state_is_tuple=False)
-          (emb_encoder_inputs, fw_state, _) = tf.contrib.rnn.static_bidirectional_rnn(
-              cell_fw, cell_bw, emb_encoder_inputs, dtype=tf.float32,
-              sequence_length=article_lens)
-      encoder_outputs = emb_encoder_inputs
-
-      with tf.variable_scope('output_projection'):
-        w = tf.get_variable(
-            'w', [hps.num_hidden, vsize], dtype=tf.float32,
-            initializer=tf.truncated_normal_initializer(stddev=1e-4))
-        w_t = tf.transpose(w)
-        v = tf.get_variable(
-            'v', [vsize], dtype=tf.float32,
-            initializer=tf.truncated_normal_initializer(stddev=1e-4))
-
-      with tf.variable_scope('decoder'), tf.device(self._next_device()):
-        # When decoding, use model output from the previous step
-        # for the next step.
-        loop_function = None
-        if hps.mode == 'decode':
-          loop_function = _extract_argmax_and_embed(
-              embedding, (w, v), update_embedding=False)
-
-        cell = tf.contrib.rnn.LSTMCell(
-            hps.num_hidden,
-            initializer=tf.random_uniform_initializer(-0.1, 0.1, seed=113),
-            state_is_tuple=False)
-
-        encoder_outputs = [tf.reshape(x, [hps.batch_size, 1, 2*hps.num_hidden])
-                           for x in encoder_outputs]
-        self._enc_top_states = tf.concat(axis=1, values=encoder_outputs)
-        self._dec_in_state = fw_state
-        # During decoding, follow up _dec_in_state are fed from beam_search.
-        # dec_out_state are stored by beam_search for next step feeding.
-        initial_state_attention = (hps.mode == 'decode')
-        decoder_outputs, self._dec_out_state = tf.contrib.legacy_seq2seq.attention_decoder(
-            emb_decoder_inputs, self._dec_in_state, self._enc_top_states,
-            cell, num_heads=1, loop_function=loop_function,
-            initial_state_attention=initial_state_attention)
-
-      with tf.variable_scope('output'), tf.device(self._next_device()):
-        model_outputs = []
-        for i in xrange(len(decoder_outputs)):
-          if i > 0:
-            tf.get_variable_scope().reuse_variables()
-          model_outputs.append(
-              tf.nn.xw_plus_b(decoder_outputs[i], w, v))
-
-      if hps.mode == 'decode':
-        with tf.variable_scope('decode_output'), tf.device('/cpu:0'):
-          best_outputs = [tf.argmax(x, 1) for x in model_outputs]
-          tf.logging.info('best_outputs%s', best_outputs[0].get_shape())
-          self._outputs = tf.concat(
-              axis=1, values=[tf.reshape(x, [hps.batch_size, 1]) for x in best_outputs])
-
-          self._topk_log_probs, self._topk_ids = tf.nn.top_k(
-              tf.log(tf.nn.softmax(model_outputs[-1])), hps.batch_size*2)
-
-      with tf.variable_scope('loss'), tf.device(self._next_device()):
-        def sampled_loss_func(inputs, labels):
-          with tf.device('/cpu:0'):  # Try gpu.
-            labels = tf.reshape(labels, [-1, 1])
-            return tf.nn.sampled_softmax_loss(
-                weights=w_t, biases=v, labels=labels, inputs=inputs,
-                num_sampled=hps.num_softmax_samples, num_classes=vsize)
-
-        if hps.num_softmax_samples != 0 and hps.mode == 'train':
-          self._loss = seq2seq_lib.sampled_sequence_loss(
-              decoder_outputs, targets, loss_weights, sampled_loss_func)
-        else:
-          self._loss = tf.contrib.legacy_seq2seq.sequence_loss(
-              model_outputs, targets, loss_weights)
-        tf.summary.scalar('loss', tf.minimum(12.0, self._loss))
-
-  def _add_train_op(self):
-    """Sets self._train_op, op to run for training."""
-    hps = self._hps
-
-    self._lr_rate = tf.maximum(
-        hps.min_lr,  # min_lr_rate.
-        tf.train.exponential_decay(hps.lr, self.global_step, 30000, 0.98))
-
-    tvars = tf.trainable_variables()
-    with tf.device(self._get_gpu(self._num_gpus-1)):
-      grads, global_norm = tf.clip_by_global_norm(
-          tf.gradients(self._loss, tvars), hps.max_grad_norm)
-    tf.summary.scalar('global_norm', global_norm)
-    optimizer = tf.train.GradientDescentOptimizer(self._lr_rate)
-    tf.summary.scalar('learning rate', self._lr_rate)
-    self._train_op = optimizer.apply_gradients(
-        zip(grads, tvars), global_step=self.global_step, name='train_step')
-
-  def encode_top_state(self, sess, enc_inputs, enc_len):
-    """Return the top states from encoder for decoder.
-
-    Args:
-      sess: tensorflow session.
-      enc_inputs: encoder inputs of shape [batch_size, enc_timesteps].
-      enc_len: encoder input length of shape [batch_size]
-    Returns:
-      enc_top_states: The top level encoder states.
-      dec_in_state: The decoder layer initial state.
-    """
-    results = sess.run([self._enc_top_states, self._dec_in_state],
-                       feed_dict={self._articles: enc_inputs,
-                                  self._article_lens: enc_len})
-    return results[0], results[1][0]
-
-  def decode_topk(self, sess, latest_tokens, enc_top_states, dec_init_states):
-    """Return the topK results and new decoder states."""
-    feed = {
-        self._enc_top_states: enc_top_states,
-        self._dec_in_state:
-            np.squeeze(np.array(dec_init_states)),
-        self._abstracts:
-            np.transpose(np.array([latest_tokens])),
-        self._abstract_lens: np.ones([len(dec_init_states)], np.int32)}
-
-    results = sess.run(
-        [self._topk_ids, self._topk_log_probs, self._dec_out_state],
-        feed_dict=feed)
-
-    ids, probs, states = results[0], results[1], results[2]
-    new_states = [s for s in states]
-    return ids, probs, new_states
-
-  def build_graph(self):
-    self._add_placeholders()
-    self._add_seq2seq()
-    self.global_step = tf.Variable(0, name='global_step', trainable=False)
-    if self._hps.mode == 'train':
-      self._add_train_op()
-    self._summaries = tf.summary.merge_all()
diff --git a/research/textsum/seq2seq_lib.py b/research/textsum/seq2seq_lib.py
deleted file mode 100644
index de233895a01f9e1ca336413b788ea5b9bee20db1..0000000000000000000000000000000000000000
--- a/research/textsum/seq2seq_lib.py
+++ /dev/null
@@ -1,137 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""seq2seq library codes copied from elsewhere for customization."""
-
-import tensorflow as tf
-
-
-# Adapted to support sampled_softmax loss function, which accepts activations
-# instead of logits.
-def sequence_loss_by_example(inputs, targets, weights, loss_function,
-                             average_across_timesteps=True, name=None):
-  """Sampled softmax loss for a sequence of inputs (per example).
-
-  Args:
-    inputs: List of 2D Tensors of shape [batch_size x hid_dim].
-    targets: List of 1D batch-sized int32 Tensors of the same length as logits.
-    weights: List of 1D batch-sized float-Tensors of the same length as logits.
-    loss_function: Sampled softmax function (inputs, labels) -> loss
-    average_across_timesteps: If set, divide the returned cost by the total
-      label weight.
-    name: Optional name for this operation, default: 'sequence_loss_by_example'.
-
-  Returns:
-    1D batch-sized float Tensor: The log-perplexity for each sequence.
-
-  Raises:
-    ValueError: If len(inputs) is different from len(targets) or len(weights).
-  """
-  if len(targets) != len(inputs) or len(weights) != len(inputs):
-    raise ValueError('Lengths of logits, weights, and targets must be the same '
-                     '%d, %d, %d.' % (len(inputs), len(weights), len(targets)))
-  with tf.name_scope(values=inputs + targets + weights, name=name,
-                     default_name='sequence_loss_by_example'):
-    log_perp_list = []
-    for inp, target, weight in zip(inputs, targets, weights):
-      crossent = loss_function(inp, target)
-      log_perp_list.append(crossent * weight)
-    log_perps = tf.add_n(log_perp_list)
-    if average_across_timesteps:
-      total_size = tf.add_n(weights)
-      total_size += 1e-12  # Just to avoid division by 0 for all-0 weights.
-      log_perps /= total_size
-  return log_perps
-
-
-def sampled_sequence_loss(inputs, targets, weights, loss_function,
-                          average_across_timesteps=True,
-                          average_across_batch=True, name=None):
-  """Weighted cross-entropy loss for a sequence of logits, batch-collapsed.
-
-  Args:
-    inputs: List of 2D Tensors of shape [batch_size x hid_dim].
-    targets: List of 1D batch-sized int32 Tensors of the same length as inputs.
-    weights: List of 1D batch-sized float-Tensors of the same length as inputs.
-    loss_function: Sampled softmax function (inputs, labels) -> loss
-    average_across_timesteps: If set, divide the returned cost by the total
-      label weight.
-    average_across_batch: If set, divide the returned cost by the batch size.
-    name: Optional name for this operation, defaults to 'sequence_loss'.
-
-  Returns:
-    A scalar float Tensor: The average log-perplexity per symbol (weighted).
-
-  Raises:
-    ValueError: If len(inputs) is different from len(targets) or len(weights).
-  """
-  with tf.name_scope(values=inputs + targets + weights, name=name,
-                     default_name='sampled_sequence_loss'):
-    cost = tf.reduce_sum(sequence_loss_by_example(
-        inputs, targets, weights, loss_function,
-        average_across_timesteps=average_across_timesteps))
-    if average_across_batch:
-      batch_size = tf.shape(targets[0])[0]
-      return cost / tf.cast(batch_size, tf.float32)
-    else:
-      return cost
-
-
-def linear(args, output_size, bias, bias_start=0.0, scope=None):
-  """Linear map: sum_i(args[i] * W[i]), where W[i] is a variable.
-
-  Args:
-    args: a 2D Tensor or a list of 2D, batch x n, Tensors.
-    output_size: int, second dimension of W[i].
-    bias: boolean, whether to add a bias term or not.
-    bias_start: starting value to initialize the bias; 0 by default.
-    scope: VariableScope for the created subgraph; defaults to "Linear".
-
-  Returns:
-    A 2D Tensor with shape [batch x output_size] equal to
-    sum_i(args[i] * W[i]), where W[i]s are newly created matrices.
-
-  Raises:
-    ValueError: if some of the arguments has unspecified or wrong shape.
-  """
-  if args is None or (isinstance(args, (list, tuple)) and not args):
-    raise ValueError('`args` must be specified')
-  if not isinstance(args, (list, tuple)):
-    args = [args]
-
-  # Calculate the total size of arguments on dimension 1.
-  total_arg_size = 0
-  shapes = [a.get_shape().as_list() for a in args]
-  for shape in shapes:
-    if len(shape) != 2:
-      raise ValueError('Linear is expecting 2D arguments: %s' % str(shapes))
-    if not shape[1]:
-      raise ValueError('Linear expects shape[1] of arguments: %s' % str(shapes))
-    else:
-      total_arg_size += shape[1]
-
-  # Now the computation.
-  with tf.variable_scope(scope or 'Linear'):
-    matrix = tf.get_variable('Matrix', [total_arg_size, output_size])
-    if len(args) == 1:
-      res = tf.matmul(args[0], matrix)
-    else:
-      res = tf.matmul(tf.concat(axis=1, values=args), matrix)
-    if not bias:
-      return res
-    bias_term = tf.get_variable(
-        'Bias', [output_size],
-        initializer=tf.constant_initializer(bias_start))
-  return res + bias_term
diff --git a/research/transformer/README.md b/research/transformer/README.md
deleted file mode 100644
index 0acad0005e61707315008598573598d27c70eab4..0000000000000000000000000000000000000000
--- a/research/transformer/README.md
+++ /dev/null
@@ -1,63 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Spatial Transformer Network
-
-The Spatial Transformer Network [1] allows the spatial manipulation of data within the network.
-
-<div align="center">
-  <img width="600px" src="http://i.imgur.com/ExGDVul.png"><br><br>
-</div>
-
-### API 
-
-A Spatial Transformer Network implemented in Tensorflow 1.0 and based on [2].
-
-#### How to use
-
-<div align="center">
-  <img src="http://i.imgur.com/gfqLV3f.png"><br><br>
-</div>
-
-```python
-transformer(U, theta, out_size)
-```
-    
-#### Parameters
-
-    U : float 
-        The output of a convolutional net should have the
-        shape [num_batch, height, width, num_channels]. 
-    theta: float   
-        The output of the
-        localisation network should be [num_batch, 6].
-    out_size: tuple of two ints
-        The size of the output of the network
-        
-    
-#### Notes
-To initialize the network to the identity transform init ``theta`` to :
-
-```python
-identity = np.array([[1., 0., 0.],
-                    [0., 1., 0.]]) 
-identity = identity.flatten()
-theta = tf.Variable(initial_value=identity)
-```        
-
-#### Experiments
-
-<div align="center">
-  <img width="600px" src="http://i.imgur.com/HtCBYk2.png"><br><br>
-</div>
-
-We used cluttered MNIST. Left column are the input images, right are the attended parts of the image by an STN.
-
-All experiments were run in Tensorflow 0.7.
-
-### References
-
-[1] Jaderberg, Max, et al. "Spatial Transformer Networks." arXiv preprint arXiv:1506.02025 (2015)
-
-[2] https://github.com/skaae/transformer_network/blob/master/transformerlayer.py
diff --git a/research/transformer/cluttered_mnist.py b/research/transformer/cluttered_mnist.py
deleted file mode 100644
index ec00c751b2af8dd82cdb3a68188c474634d5d6ad..0000000000000000000000000000000000000000
--- a/research/transformer/cluttered_mnist.py
+++ /dev/null
@@ -1,174 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# =============================================================================
-import tensorflow as tf
-from spatial_transformer import transformer
-import numpy as np
-from tf_utils import weight_variable, bias_variable, dense_to_one_hot
-
-# %% Load data
-mnist_cluttered = np.load('./data/mnist_sequence1_sample_5distortions5x5.npz')
-
-X_train = mnist_cluttered['X_train']
-y_train = mnist_cluttered['y_train']
-X_valid = mnist_cluttered['X_valid']
-y_valid = mnist_cluttered['y_valid']
-X_test = mnist_cluttered['X_test']
-y_test = mnist_cluttered['y_test']
-
-# % turn from dense to one hot representation
-Y_train = dense_to_one_hot(y_train, n_classes=10)
-Y_valid = dense_to_one_hot(y_valid, n_classes=10)
-Y_test = dense_to_one_hot(y_test, n_classes=10)
-
-# %% Graph representation of our network
-
-# %% Placeholders for 40x40 resolution
-x = tf.placeholder(tf.float32, [None, 1600])
-y = tf.placeholder(tf.float32, [None, 10])
-
-# %% Since x is currently [batch, height*width], we need to reshape to a
-# 4-D tensor to use it in a convolutional graph.  If one component of
-# `shape` is the special value -1, the size of that dimension is
-# computed so that the total size remains constant.  Since we haven't
-# defined the batch dimension's shape yet, we use -1 to denote this
-# dimension should not change size.
-x_tensor = tf.reshape(x, [-1, 40, 40, 1])
-
-# %% We'll setup the two-layer localisation network to figure out the
-# %% parameters for an affine transformation of the input
-# %% Create variables for fully connected layer
-W_fc_loc1 = weight_variable([1600, 20])
-b_fc_loc1 = bias_variable([20])
-
-W_fc_loc2 = weight_variable([20, 6])
-# Use identity transformation as starting point
-initial = np.array([[1., 0, 0], [0, 1., 0]])
-initial = initial.astype('float32')
-initial = initial.flatten()
-b_fc_loc2 = tf.Variable(initial_value=initial, name='b_fc_loc2')
-
-# %% Define the two layer localisation network
-h_fc_loc1 = tf.nn.tanh(tf.matmul(x, W_fc_loc1) + b_fc_loc1)
-# %% We can add dropout for regularizing and to reduce overfitting like so:
-keep_prob = tf.placeholder(tf.float32)
-h_fc_loc1_drop = tf.nn.dropout(h_fc_loc1, keep_prob)
-# %% Second layer
-h_fc_loc2 = tf.nn.tanh(tf.matmul(h_fc_loc1_drop, W_fc_loc2) + b_fc_loc2)
-
-# %% We'll create a spatial transformer module to identify discriminative
-# %% patches
-out_size = (40, 40)
-h_trans = transformer(x_tensor, h_fc_loc2, out_size)
-
-# %% We'll setup the first convolutional layer
-# Weight matrix is [height x width x input_channels x output_channels]
-filter_size = 3
-n_filters_1 = 16
-W_conv1 = weight_variable([filter_size, filter_size, 1, n_filters_1])
-
-# %% Bias is [output_channels]
-b_conv1 = bias_variable([n_filters_1])
-
-# %% Now we can build a graph which does the first layer of convolution:
-# we define our stride as batch x height x width x channels
-# instead of pooling, we use strides of 2 and more layers
-# with smaller filters.
-
-h_conv1 = tf.nn.relu(
-    tf.nn.conv2d(input=h_trans,
-                 filter=W_conv1,
-                 strides=[1, 2, 2, 1],
-                 padding='SAME') +
-    b_conv1)
-
-# %% And just like the first layer, add additional layers to create
-# a deep net
-n_filters_2 = 16
-W_conv2 = weight_variable([filter_size, filter_size, n_filters_1, n_filters_2])
-b_conv2 = bias_variable([n_filters_2])
-h_conv2 = tf.nn.relu(
-    tf.nn.conv2d(input=h_conv1,
-                 filter=W_conv2,
-                 strides=[1, 2, 2, 1],
-                 padding='SAME') +
-    b_conv2)
-
-# %% We'll now reshape so we can connect to a fully-connected layer:
-h_conv2_flat = tf.reshape(h_conv2, [-1, 10 * 10 * n_filters_2])
-
-# %% Create a fully-connected layer:
-n_fc = 1024
-W_fc1 = weight_variable([10 * 10 * n_filters_2, n_fc])
-b_fc1 = bias_variable([n_fc])
-h_fc1 = tf.nn.relu(tf.matmul(h_conv2_flat, W_fc1) + b_fc1)
-
-h_fc1_drop = tf.nn.dropout(h_fc1, keep_prob)
-
-# %% And finally our softmax layer:
-W_fc2 = weight_variable([n_fc, 10])
-b_fc2 = bias_variable([10])
-y_logits = tf.matmul(h_fc1_drop, W_fc2) + b_fc2
-
-# %% Define loss/eval/training functions
-cross_entropy = tf.reduce_mean(
-    tf.nn.softmax_cross_entropy_with_logits(logits=y_logits, labels=y))
-opt = tf.train.AdamOptimizer()
-optimizer = opt.minimize(cross_entropy)
-grads = opt.compute_gradients(cross_entropy, [b_fc_loc2])
-
-# %% Monitor accuracy
-correct_prediction = tf.equal(tf.argmax(y_logits, 1), tf.argmax(y, 1))
-accuracy = tf.reduce_mean(tf.cast(correct_prediction, 'float'))
-
-# %% We now create a new session to actually perform the initialization the
-# variables:
-sess = tf.Session()
-sess.run(tf.global_variables_initializer())
-
-
-# %% We'll now train in minibatches and report accuracy, loss:
-iter_per_epoch = 100
-n_epochs = 500
-train_size = 10000
-
-indices = np.linspace(0, 10000 - 1, iter_per_epoch)
-indices = indices.astype('int')
-
-for epoch_i in range(n_epochs):
-    for iter_i in range(iter_per_epoch - 1):
-        batch_xs = X_train[indices[iter_i]:indices[iter_i+1]]
-        batch_ys = Y_train[indices[iter_i]:indices[iter_i+1]]
-
-        if iter_i % 10 == 0:
-            loss = sess.run(cross_entropy,
-                            feed_dict={
-                                x: batch_xs,
-                                y: batch_ys,
-                                keep_prob: 1.0
-                            })
-            print('Iteration: ' + str(iter_i) + ' Loss: ' + str(loss))
-
-        sess.run(optimizer, feed_dict={
-            x: batch_xs, y: batch_ys, keep_prob: 0.8})
-
-    print('Accuracy (%d): ' % epoch_i + str(sess.run(accuracy,
-                                                     feed_dict={
-                                                         x: X_valid,
-                                                         y: Y_valid,
-                                                         keep_prob: 1.0
-                                                     })))
-    # theta = sess.run(h_fc_loc2, feed_dict={
-    #        x: batch_xs, keep_prob: 1.0})
-    # print(theta[0])
diff --git a/research/transformer/data/README.md b/research/transformer/data/README.md
deleted file mode 100644
index c2a9581fd10b29f19b71817bb60c6d5ef7fe842e..0000000000000000000000000000000000000000
--- a/research/transformer/data/README.md
+++ /dev/null
@@ -1,20 +0,0 @@
-### How to get the data
-
-#### Cluttered MNIST
-
-The cluttered MNIST dataset can be found here [1] or can be generated via [2].
-
-Settings used for `cluttered_mnist.py` :
-
-```python
-
-ORG_SHP = [28, 28]
-OUT_SHP = [40, 40]
-NUM_DISTORTIONS = 8
-dist_size = (5, 5) 
-
-```
-
-[1] https://github.com/daviddao/spatial-transformer-tensorflow
-
-[2] https://github.com/skaae/recurrent-spatial-transformer-code/blob/master/MNIST_SEQUENCE/create_mnist_sequence.py
\ No newline at end of file
diff --git a/research/transformer/example.py b/research/transformer/example.py
deleted file mode 100644
index 19ca64d1452b0e2a60c5394ba18493acdbc0db58..0000000000000000000000000000000000000000
--- a/research/transformer/example.py
+++ /dev/null
@@ -1,61 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-from scipy import ndimage
-import tensorflow as tf
-from spatial_transformer import transformer
-import numpy as np
-import matplotlib.pyplot as plt
-
-# %% Create a batch of three images (1600 x 1200)
-# %% Image retrieved from:
-# %% https://raw.githubusercontent.com/skaae/transformer_network/master/cat.jpg
-im = ndimage.imread('cat.jpg')
-im = im / 255.
-im = im.reshape(1, 1200, 1600, 3)
-im = im.astype('float32')
-
-# %% Let the output size of the transformer be half the image size.
-out_size = (600, 800)
-
-# %% Simulate batch
-batch = np.append(im, im, axis=0)
-batch = np.append(batch, im, axis=0)
-num_batch = 3
-
-x = tf.placeholder(tf.float32, [None, 1200, 1600, 3])
-x = tf.cast(batch, 'float32')
-
-# %% Create localisation network and convolutional layer
-with tf.variable_scope('spatial_transformer_0'):
-
-    # %% Create a fully-connected layer with 6 output nodes
-    n_fc = 6
-    W_fc1 = tf.Variable(tf.zeros([1200 * 1600 * 3, n_fc]), name='W_fc1')
-
-    # %% Zoom into the image
-    initial = np.array([[0.5, 0, 0], [0, 0.5, 0]])
-    initial = initial.astype('float32')
-    initial = initial.flatten()
-
-    b_fc1 = tf.Variable(initial_value=initial, name='b_fc1')
-    h_fc1 = tf.matmul(tf.zeros([num_batch, 1200 * 1600 * 3]), W_fc1) + b_fc1
-    h_trans = transformer(x, h_fc1, out_size)
-
-# %% Run session
-sess = tf.Session()
-sess.run(tf.global_variables_initializer())
-y = sess.run(h_trans, feed_dict={x: batch})
-
-# plt.imshow(y[0])
diff --git a/research/transformer/spatial_transformer.py b/research/transformer/spatial_transformer.py
deleted file mode 100644
index 47014fe270171e053cc6abc1a394975fa65b7063..0000000000000000000000000000000000000000
--- a/research/transformer/spatial_transformer.py
+++ /dev/null
@@ -1,205 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-from six.moves import xrange
-import tensorflow as tf
-
-
-def transformer(U, theta, out_size, name='SpatialTransformer', **kwargs):
-    """Spatial Transformer Layer
-
-    Implements a spatial transformer layer as described in [1]_.
-    Based on [2]_ and edited by David Dao for Tensorflow.
-
-    Parameters
-    ----------
-    U : float
-        The output of a convolutional net should have the
-        shape [num_batch, height, width, num_channels].
-    theta: float
-        The output of the
-        localisation network should be [num_batch, 6].
-    out_size: tuple of two ints
-        The size of the output of the network (height, width)
-
-    References
-    ----------
-    .. [1]  Spatial Transformer Networks
-            Max Jaderberg, Karen Simonyan, Andrew Zisserman, Koray Kavukcuoglu
-            Submitted on 5 Jun 2015
-    .. [2]  https://github.com/skaae/transformer_network/blob/master/transformerlayer.py
-
-    Notes
-    -----
-    To initialize the network to the identity transform init
-    ``theta`` to :
-        identity = np.array([[1., 0., 0.],
-                             [0., 1., 0.]])
-        identity = identity.flatten()
-        theta = tf.Variable(initial_value=identity)
-
-    """
-
-    def _repeat(x, n_repeats):
-        with tf.variable_scope('_repeat'):
-            rep = tf.transpose(
-                tf.expand_dims(tf.ones(shape=tf.stack([n_repeats, ])), 1), [1, 0])
-            rep = tf.cast(rep, 'int32')
-            x = tf.matmul(tf.reshape(x, (-1, 1)), rep)
-            return tf.reshape(x, [-1])
-
-    def _interpolate(im, x, y, out_size):
-        with tf.variable_scope('_interpolate'):
-            # constants
-            num_batch = tf.shape(im)[0]
-            height = tf.shape(im)[1]
-            width = tf.shape(im)[2]
-            channels = tf.shape(im)[3]
-
-            x = tf.cast(x, 'float32')
-            y = tf.cast(y, 'float32')
-            height_f = tf.cast(height, 'float32')
-            width_f = tf.cast(width, 'float32')
-            out_height = out_size[0]
-            out_width = out_size[1]
-            zero = tf.zeros([], dtype='int32')
-            max_y = tf.cast(tf.shape(im)[1] - 1, 'int32')
-            max_x = tf.cast(tf.shape(im)[2] - 1, 'int32')
-
-            # scale indices from [-1, 1] to [0, width/height]
-            x = (x + 1.0)*(width_f) / 2.0
-            y = (y + 1.0)*(height_f) / 2.0
-
-            # do sampling
-            x0 = tf.cast(tf.floor(x), 'int32')
-            x1 = x0 + 1
-            y0 = tf.cast(tf.floor(y), 'int32')
-            y1 = y0 + 1
-
-            x0 = tf.clip_by_value(x0, zero, max_x)
-            x1 = tf.clip_by_value(x1, zero, max_x)
-            y0 = tf.clip_by_value(y0, zero, max_y)
-            y1 = tf.clip_by_value(y1, zero, max_y)
-            dim2 = width
-            dim1 = width*height
-            base = _repeat(tf.range(num_batch)*dim1, out_height*out_width)
-            base_y0 = base + y0*dim2
-            base_y1 = base + y1*dim2
-            idx_a = base_y0 + x0
-            idx_b = base_y1 + x0
-            idx_c = base_y0 + x1
-            idx_d = base_y1 + x1
-
-            # use indices to lookup pixels in the flat image and restore
-            # channels dim
-            im_flat = tf.reshape(im, tf.stack([-1, channels]))
-            im_flat = tf.cast(im_flat, 'float32')
-            Ia = tf.gather(im_flat, idx_a)
-            Ib = tf.gather(im_flat, idx_b)
-            Ic = tf.gather(im_flat, idx_c)
-            Id = tf.gather(im_flat, idx_d)
-
-            # and finally calculate interpolated values
-            x0_f = tf.cast(x0, 'float32')
-            x1_f = tf.cast(x1, 'float32')
-            y0_f = tf.cast(y0, 'float32')
-            y1_f = tf.cast(y1, 'float32')
-            wa = tf.expand_dims(((x1_f-x) * (y1_f-y)), 1)
-            wb = tf.expand_dims(((x1_f-x) * (y-y0_f)), 1)
-            wc = tf.expand_dims(((x-x0_f) * (y1_f-y)), 1)
-            wd = tf.expand_dims(((x-x0_f) * (y-y0_f)), 1)
-            output = tf.add_n([wa*Ia, wb*Ib, wc*Ic, wd*Id])
-            return output
-
-    def _meshgrid(height, width):
-        with tf.variable_scope('_meshgrid'):
-            # This should be equivalent to:
-            #  x_t, y_t = np.meshgrid(np.linspace(-1, 1, width),
-            #                         np.linspace(-1, 1, height))
-            #  ones = np.ones(np.prod(x_t.shape))
-            #  grid = np.vstack([x_t.flatten(), y_t.flatten(), ones])
-            x_t = tf.matmul(tf.ones(shape=tf.stack([height, 1])),
-                            tf.transpose(tf.expand_dims(tf.linspace(-1.0, 1.0, width), 1), [1, 0]))
-            y_t = tf.matmul(tf.expand_dims(tf.linspace(-1.0, 1.0, height), 1),
-                            tf.ones(shape=tf.stack([1, width])))
-
-            x_t_flat = tf.reshape(x_t, (1, -1))
-            y_t_flat = tf.reshape(y_t, (1, -1))
-
-            ones = tf.ones_like(x_t_flat)
-            grid = tf.concat(axis=0, values=[x_t_flat, y_t_flat, ones])
-            return grid
-
-    def _transform(theta, input_dim, out_size):
-        with tf.variable_scope('_transform'):
-            num_batch = tf.shape(input_dim)[0]
-            height = tf.shape(input_dim)[1]
-            width = tf.shape(input_dim)[2]
-            num_channels = tf.shape(input_dim)[3]
-            theta = tf.reshape(theta, (-1, 2, 3))
-            theta = tf.cast(theta, 'float32')
-
-            # grid of (x_t, y_t, 1), eq (1) in ref [1]
-            height_f = tf.cast(height, 'float32')
-            width_f = tf.cast(width, 'float32')
-            out_height = out_size[0]
-            out_width = out_size[1]
-            grid = _meshgrid(out_height, out_width)
-            grid = tf.expand_dims(grid, 0)
-            grid = tf.reshape(grid, [-1])
-            grid = tf.tile(grid, tf.stack([num_batch]))
-            grid = tf.reshape(grid, tf.stack([num_batch, 3, -1]))
-
-            # Transform A x (x_t, y_t, 1)^T -> (x_s, y_s)
-            T_g = tf.matmul(theta, grid)
-            x_s = tf.slice(T_g, [0, 0, 0], [-1, 1, -1])
-            y_s = tf.slice(T_g, [0, 1, 0], [-1, 1, -1])
-            x_s_flat = tf.reshape(x_s, [-1])
-            y_s_flat = tf.reshape(y_s, [-1])
-
-            input_transformed = _interpolate(
-                input_dim, x_s_flat, y_s_flat,
-                out_size)
-
-            output = tf.reshape(
-                input_transformed, tf.stack([num_batch, out_height, out_width, num_channels]))
-            return output
-
-    with tf.variable_scope(name):
-        output = _transform(theta, U, out_size)
-        return output
-
-
-def batch_transformer(U, thetas, out_size, name='BatchSpatialTransformer'):
-    """Batch Spatial Transformer Layer
-
-    Parameters
-    ----------
-
-    U : float
-        tensor of inputs [num_batch,height,width,num_channels]
-    thetas : float
-        a set of transformations for each input [num_batch,num_transforms,6]
-    out_size : int
-        the size of the output [out_height,out_width]
-
-    Returns: float
-        Tensor of size [num_batch*num_transforms,out_height,out_width,num_channels]
-    """
-    with tf.variable_scope(name):
-        num_batch, num_transforms = map(int, thetas.get_shape().as_list()[:2])
-        indices = [[i]*num_transforms for i in xrange(num_batch)]
-        input_repeated = tf.gather(U, tf.reshape(indices, [-1]))
-        return transformer(input_repeated, thetas, out_size)
diff --git a/research/transformer/tf_utils.py b/research/transformer/tf_utils.py
deleted file mode 100644
index 3cdac28bcab852d461e040247313a93eb535d2c2..0000000000000000000000000000000000000000
--- a/research/transformer/tf_utils.py
+++ /dev/null
@@ -1,129 +0,0 @@
-# Copyright 2016 The TensorFlow Authors. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-# %% Borrowed utils from here: https://github.com/pkmital/tensorflow_tutorials/
-import tensorflow as tf
-import numpy as np
-
-def conv2d(x, n_filters,
-           k_h=5, k_w=5,
-           stride_h=2, stride_w=2,
-           stddev=0.02,
-           activation=lambda x: x,
-           bias=True,
-           padding='SAME',
-           name="Conv2D"):
-    """2D Convolution with options for kernel size, stride, and init deviation.
-    Parameters
-    ----------
-    x : Tensor
-        Input tensor to convolve.
-    n_filters : int
-        Number of filters to apply.
-    k_h : int, optional
-        Kernel height.
-    k_w : int, optional
-        Kernel width.
-    stride_h : int, optional
-        Stride in rows.
-    stride_w : int, optional
-        Stride in cols.
-    stddev : float, optional
-        Initialization's standard deviation.
-    activation : arguments, optional
-        Function which applies a nonlinearity
-    padding : str, optional
-        'SAME' or 'VALID'
-    name : str, optional
-        Variable scope to use.
-    Returns
-    -------
-    x : Tensor
-        Convolved input.
-    """
-    with tf.variable_scope(name):
-        w = tf.get_variable(
-            'w', [k_h, k_w, x.get_shape()[-1], n_filters],
-            initializer=tf.truncated_normal_initializer(stddev=stddev))
-        conv = tf.nn.conv2d(
-            x, w, strides=[1, stride_h, stride_w, 1], padding=padding)
-        if bias:
-            b = tf.get_variable(
-                'b', [n_filters],
-                initializer=tf.truncated_normal_initializer(stddev=stddev))
-            conv = conv + b
-        return conv
-    
-def linear(x, n_units, scope=None, stddev=0.02,
-           activation=lambda x: x):
-    """Fully-connected network.
-    Parameters
-    ----------
-    x : Tensor
-        Input tensor to the network.
-    n_units : int
-        Number of units to connect to.
-    scope : str, optional
-        Variable scope to use.
-    stddev : float, optional
-        Initialization's standard deviation.
-    activation : arguments, optional
-        Function which applies a nonlinearity
-    Returns
-    -------
-    x : Tensor
-        Fully-connected output.
-    """
-    shape = x.get_shape().as_list()
-
-    with tf.variable_scope(scope or "Linear"):
-        matrix = tf.get_variable("Matrix", [shape[1], n_units], tf.float32,
-                                 tf.random_normal_initializer(stddev=stddev))
-        return activation(tf.matmul(x, matrix))
-    
-# %%
-def weight_variable(shape):
-    '''Helper function to create a weight variable initialized with
-    a normal distribution
-    Parameters
-    ----------
-    shape : list
-        Size of weight variable
-    '''
-    #initial = tf.random_normal(shape, mean=0.0, stddev=0.01)
-    initial = tf.zeros(shape)
-    return tf.Variable(initial)
-
-# %%
-def bias_variable(shape):
-    '''Helper function to create a bias variable initialized with
-    a constant value.
-    Parameters
-    ----------
-    shape : list
-        Size of weight variable
-    '''
-    initial = tf.random_normal(shape, mean=0.0, stddev=0.01)
-    return tf.Variable(initial)
-
-# %% 
-def dense_to_one_hot(labels, n_classes=2):
-    """Convert class labels from scalars to one-hot vectors."""
-    labels = np.array(labels)
-    n_labels = labels.shape[0]
-    index_offset = np.arange(n_labels) * n_classes
-    labels_one_hot = np.zeros((n_labels, n_classes), dtype=np.float32)
-    labels_one_hot.flat[index_offset + labels.ravel()] = 1
-    return labels_one_hot
diff --git a/research/video_prediction/README.md b/research/video_prediction/README.md
deleted file mode 100644
index 89ea9e28566a8d35ccdc82a69fc79bfb0375efcf..0000000000000000000000000000000000000000
--- a/research/video_prediction/README.md
+++ /dev/null
@@ -1,102 +0,0 @@
-![No Maintenance Intended](https://img.shields.io/badge/No%20Maintenance%20Intended-%E2%9C%95-red.svg)
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-
-# Video Prediction with Neural Advection
-
-*A TensorFlow implementation of the models described in [Unsupervised Learning for Physical Interaction through Video Prediction (Finn et al., 2016)](https://arxiv.org/abs/1605.07157).*
-
-This video prediction model, which is optionally conditioned on actions,
-predicts future video by internally predicting how to transform the last
-image (which may have been predicted) into the next image. As a result, it can
-reuse apperance information from previous frames and can better generalize to
-objects not seen in the training set. Some example predictions on novel objects
-are shown below:
-
-![Animation](https://storage.googleapis.com/push_gens/novelgengifs9/16_70.gif)
-![Animation](https://storage.googleapis.com/push_gens/novelgengifs9/2_96.gif)
-![Animation](https://storage.googleapis.com/push_gens/novelgengifs9/1_38.gif)
-![Animation](https://storage.googleapis.com/push_gens/novelgengifs9/11_10.gif)
-![Animation](https://storage.googleapis.com/push_gens/novelgengifs9/3_34.gif)
-
-When the model is conditioned on actions, it changes it's predictions based on
-the passed in action. Here we show the models predictions in response to varying
-the magnitude of the passed in actions, from small to large:
-
-![Animation](https://storage.googleapis.com/push_gens/webgifs/0xact_0.gif)
-![Animation](https://storage.googleapis.com/push_gens/05xact_0.gif)
-![Animation](https://storage.googleapis.com/push_gens/webgifs/1xact_0.gif)
-![Animation](https://storage.googleapis.com/push_gens/webgifs/15xact_0.gif)
-
-![Animation](https://storage.googleapis.com/push_gens/webgifs/0xact_17.gif)
-![Animation](https://storage.googleapis.com/push_gens/webgifs/05xact_17.gif)
-![Animation](https://storage.googleapis.com/push_gens/webgifs/1xact_17.gif)
-![Animation](https://storage.googleapis.com/push_gens/webgifs/15xact_17.gif)
-
-
-Because the model is trained with an l2 objective, it represents uncertainty as
-blur.
-
-## Requirements
-* Tensorflow (see tensorflow.org for installation instructions)
-* spatial_tranformer model in tensorflow/models, for the spatial tranformer
-  predictor (STP).
-
-## Data
-The data used to train this model is located
-[here](https://sites.google.com/site/brainrobotdata/home/push-dataset).
-
-To download the robot data, run the following.
-```shell
-./download_data.sh
-```
-
-## Training the model
-
-To train the model, run the prediction_train.py file.
-```shell
-python prediction_train.py
-```
-
-There are several flags which can control the model that is trained, which are
-exeplified below:
-```shell
-python prediction_train.py \
-  --data_dir=push/push_train \ # path to the training set.
-  --model=CDNA \ # the model type to use - DNA, CDNA, or STP
-  --output_dir=./checkpoints \ # where to save model checkpoints
-  --event_log_dir=./summaries \ # where to save training statistics
-  --num_iterations=100000 \ # number of training iterations
-  --pretrained_model=model \ # path to model to initialize from, random if emtpy
-  --sequence_length=10 \ # the number of total frames in a sequence
-  --context_frames=2 \ # the number of ground truth frames to pass in at start
-  --use_state=1 \ # whether or not to condition on actions and the initial state
-  --num_masks=10 \ # the number of transformations and corresponding masks
-  --schedsamp_k=900.0 \ # the constant used for scheduled sampling or -1
-  --train_val_split=0.95 \ # the percentage of training data for validation
-  --batch_size=32 \ # the training batch size
-  --learning_rate=0.001 \ # the initial learning rate for the Adam optimizer
-```
-
-If the dynamic neural advection (DNA) model is being used, the `--num_masks`
-option should be set to one.
-
-The `--context_frames` option defines both the number of initial ground truth
-frames to pass in, as well as when to start penalizing the model's predictions.
-
-The data directory `--data_dir` should contain tfrecord files with the format
-used in the released push dataset. See
-[here](https://sites.google.com/site/brainrobotdata/home/push-dataset) for
-details. If the `--use_state` option is not set, then the data only needs to
-contain image sequences, not states and actions.
-
-
-## Contact
-
-To ask questions or report issues please open an issue on the tensorflow/models
-[issues tracker](https://github.com/tensorflow/models/issues).
-Please assign issues to @cbfinn.
-
-## Credits
-
-This code was written by Chelsea Finn.
diff --git a/research/video_prediction/download_data.sh b/research/video_prediction/download_data.sh
deleted file mode 100755
index 4928add5fa3833642014af8347b487bac53bf6b7..0000000000000000000000000000000000000000
--- a/research/video_prediction/download_data.sh
+++ /dev/null
@@ -1,55 +0,0 @@
-#!/bin/bash
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-
-# Example:
-#
-#   download_dataset.sh datafiles.txt ./tmp
-#
-# will download all of the files listed in the file, datafiles.txt, into
-# a directory, "./tmp".
-#
-# Each line of the datafiles.txt file should contain the path from the
-# bucket root to a file.
-
-ARGC="$#"
-LISTING_FILE=push_datafiles.txt
-if [ "${ARGC}" -ge 1 ]; then
-  LISTING_FILE=$1
-fi
-OUTPUT_DIR="./"
-if [ "${ARGC}" -ge 2 ]; then
-  OUTPUT_DIR=$2
-fi
-
-echo "OUTPUT_DIR=$OUTPUT_DIR"
-
-mkdir "${OUTPUT_DIR}"
-
-function download_file {
-  FILE=$1
-  BUCKET="https://storage.googleapis.com/brain-robotics-data"
-  URL="${BUCKET}/${FILE}"
-  OUTPUT_FILE="${OUTPUT_DIR}/${FILE}"
-  DIRECTORY=`dirname ${OUTPUT_FILE}`
-  echo DIRECTORY=$DIRECTORY
-  mkdir -p "${DIRECTORY}"
-  curl --output ${OUTPUT_FILE} ${URL}
-}
-
-while read filename; do
-  download_file $filename
-done <${LISTING_FILE}
diff --git a/research/video_prediction/lstm_ops.py b/research/video_prediction/lstm_ops.py
deleted file mode 100644
index 1f8c8d97ae023e5e7ef621219de5b697d0e00d21..0000000000000000000000000000000000000000
--- a/research/video_prediction/lstm_ops.py
+++ /dev/null
@@ -1,104 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Convolutional LSTM implementation."""
-
-import tensorflow as tf
-
-from tensorflow.contrib.slim import add_arg_scope
-from tensorflow.contrib.slim import layers
-
-
-def init_state(inputs,
-               state_shape,
-               state_initializer=tf.zeros_initializer(),
-               dtype=tf.float32):
-  """Helper function to create an initial state given inputs.
-
-  Args:
-    inputs: input Tensor, at least 2D, the first dimension being batch_size
-    state_shape: the shape of the state.
-    state_initializer: Initializer(shape, dtype) for state Tensor.
-    dtype: Optional dtype, needed when inputs is None.
-  Returns:
-     A tensors representing the initial state.
-  """
-  if inputs is not None:
-    # Handle both the dynamic shape as well as the inferred shape.
-    inferred_batch_size = inputs.get_shape().with_rank_at_least(1)[0]
-    dtype = inputs.dtype
-  else:
-    inferred_batch_size = 0
-  initial_state = state_initializer(
-      [inferred_batch_size] + state_shape, dtype=dtype)
-  return initial_state
-
-
-@add_arg_scope
-def basic_conv_lstm_cell(inputs,
-                         state,
-                         num_channels,
-                         filter_size=5,
-                         forget_bias=1.0,
-                         scope=None,
-                         reuse=None):
-  """Basic LSTM recurrent network cell, with 2D convolution connctions.
-
-  We add forget_bias (default: 1) to the biases of the forget gate in order to
-  reduce the scale of forgetting in the beginning of the training.
-
-  It does not allow cell clipping, a projection layer, and does not
-  use peep-hole connections: it is the basic baseline.
-
-  Args:
-    inputs: input Tensor, 4D, batch x height x width x channels.
-    state: state Tensor, 4D, batch x height x width x channels.
-    num_channels: the number of output channels in the layer.
-    filter_size: the shape of the each convolution filter.
-    forget_bias: the initial value of the forget biases.
-    scope: Optional scope for variable_scope.
-    reuse: whether or not the layer and the variables should be reused.
-
-  Returns:
-     a tuple of tensors representing output and the new state.
-  """
-  spatial_size = inputs.get_shape()[1:3]
-  if state is None:
-    state = init_state(inputs, list(spatial_size) + [2 * num_channels])
-  with tf.variable_scope(scope,
-                         'BasicConvLstmCell',
-                         [inputs, state],
-                         reuse=reuse):
-    inputs.get_shape().assert_has_rank(4)
-    state.get_shape().assert_has_rank(4)
-    c, h = tf.split(axis=3, num_or_size_splits=2, value=state)
-    inputs_h = tf.concat(axis=3, values=[inputs, h])
-    # Parameters of gates are concatenated into one conv for efficiency.
-    i_j_f_o = layers.conv2d(inputs_h,
-                            4 * num_channels, [filter_size, filter_size],
-                            stride=1,
-                            activation_fn=None,
-                            scope='Gates')
-
-    # i = input_gate, j = new_input, f = forget_gate, o = output_gate
-    i, j, f, o = tf.split(axis=3, num_or_size_splits=4, value=i_j_f_o)
-
-    new_c = c * tf.sigmoid(f + forget_bias) + tf.sigmoid(i) * tf.tanh(j)
-    new_h = tf.tanh(new_c) * tf.sigmoid(o)
-
-    return new_h, tf.concat(axis=3, values=[new_c, new_h])
-
-
-
diff --git a/research/video_prediction/prediction_input.py b/research/video_prediction/prediction_input.py
deleted file mode 100644
index e35b9daed373e65b5a1ec35959d9234aef320460..0000000000000000000000000000000000000000
--- a/research/video_prediction/prediction_input.py
+++ /dev/null
@@ -1,119 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Code for building the input for the prediction model."""
-
-import os
-
-import numpy as np
-import tensorflow as tf
-
-from tensorflow.python.platform import flags
-from tensorflow.python.platform import gfile
-
-
-FLAGS = flags.FLAGS
-
-# Original image dimensions
-ORIGINAL_WIDTH = 640
-ORIGINAL_HEIGHT = 512
-COLOR_CHAN = 3
-
-# Default image dimensions.
-IMG_WIDTH = 64
-IMG_HEIGHT = 64
-
-# Dimension of the state and action.
-STATE_DIM = 5
-
-
-def build_tfrecord_input(training=True):
-  """Create input tfrecord tensors.
-
-  Args:
-    training: training or validation data.
-  Returns:
-    list of tensors corresponding to images, actions, and states. The images
-    tensor is 5D, batch x time x height x width x channels. The state and
-    action tensors are 3D, batch x time x dimension.
-  Raises:
-    RuntimeError: if no files found.
-  """
-  filenames = gfile.Glob(os.path.join(FLAGS.data_dir, '*'))
-  if not filenames:
-    raise RuntimeError('No data files found.')
-  index = int(np.floor(FLAGS.train_val_split * len(filenames)))
-  if training:
-    filenames = filenames[:index]
-  else:
-    filenames = filenames[index:]
-  filename_queue = tf.train.string_input_producer(filenames, shuffle=True)
-  reader = tf.TFRecordReader()
-  _, serialized_example = reader.read(filename_queue)
-
-  image_seq, state_seq, action_seq = [], [], []
-
-  for i in range(FLAGS.sequence_length):
-    image_name = 'move/' + str(i) + '/image/encoded'
-    action_name = 'move/' + str(i) + '/commanded_pose/vec_pitch_yaw'
-    state_name = 'move/' + str(i) + '/endeffector/vec_pitch_yaw'
-    if FLAGS.use_state:
-      features = {image_name: tf.FixedLenFeature([1], tf.string),
-                  action_name: tf.FixedLenFeature([STATE_DIM], tf.float32),
-                  state_name: tf.FixedLenFeature([STATE_DIM], tf.float32)}
-    else:
-      features = {image_name: tf.FixedLenFeature([1], tf.string)}
-    features = tf.parse_single_example(serialized_example, features=features)
-
-    image_buffer = tf.reshape(features[image_name], shape=[])
-    image = tf.image.decode_jpeg(image_buffer, channels=COLOR_CHAN)
-    image.set_shape([ORIGINAL_HEIGHT, ORIGINAL_WIDTH, COLOR_CHAN])
-
-    if IMG_HEIGHT != IMG_WIDTH:
-      raise ValueError('Unequal height and width unsupported')
-
-    crop_size = min(ORIGINAL_HEIGHT, ORIGINAL_WIDTH)
-    image = tf.image.resize_image_with_crop_or_pad(image, crop_size, crop_size)
-    image = tf.reshape(image, [1, crop_size, crop_size, COLOR_CHAN])
-    image = tf.image.resize_bicubic(image, [IMG_HEIGHT, IMG_WIDTH])
-    image = tf.cast(image, tf.float32) / 255.0
-    image_seq.append(image)
-
-    if FLAGS.use_state:
-      state = tf.reshape(features[state_name], shape=[1, STATE_DIM])
-      state_seq.append(state)
-      action = tf.reshape(features[action_name], shape=[1, STATE_DIM])
-      action_seq.append(action)
-
-  image_seq = tf.concat(axis=0, values=image_seq)
-
-  if FLAGS.use_state:
-    state_seq = tf.concat(axis=0, values=state_seq)
-    action_seq = tf.concat(axis=0, values=action_seq)
-    [image_batch, action_batch, state_batch] = tf.train.batch(
-        [image_seq, action_seq, state_seq],
-        FLAGS.batch_size,
-        num_threads=FLAGS.batch_size,
-        capacity=100 * FLAGS.batch_size)
-    return image_batch, action_batch, state_batch
-  else:
-    image_batch = tf.train.batch(
-        [image_seq],
-        FLAGS.batch_size,
-        num_threads=FLAGS.batch_size,
-        capacity=100 * FLAGS.batch_size)
-    zeros_batch = tf.zeros([FLAGS.batch_size, FLAGS.sequence_length, STATE_DIM])
-    return image_batch, zeros_batch, zeros_batch
-
diff --git a/research/video_prediction/prediction_model.py b/research/video_prediction/prediction_model.py
deleted file mode 100644
index ebdc15d7ccdd96b89cfc6d73a9600678f162ff74..0000000000000000000000000000000000000000
--- a/research/video_prediction/prediction_model.py
+++ /dev/null
@@ -1,350 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Model architecture for predictive model, including CDNA, DNA, and STP."""
-
-import numpy as np
-import tensorflow as tf
-
-import tensorflow.contrib.slim as slim
-from tensorflow.contrib.layers.python import layers as tf_layers
-from lstm_ops import basic_conv_lstm_cell
-
-# Amount to use when lower bounding tensors
-RELU_SHIFT = 1e-12
-
-# kernel size for DNA and CDNA.
-DNA_KERN_SIZE = 5
-
-
-def construct_model(images,
-                    actions=None,
-                    states=None,
-                    iter_num=-1.0,
-                    k=-1,
-                    use_state=True,
-                    num_masks=10,
-                    stp=False,
-                    cdna=True,
-                    dna=False,
-                    context_frames=2):
-  """Build convolutional lstm video predictor using STP, CDNA, or DNA.
-
-  Args:
-    images: tensor of ground truth image sequences
-    actions: tensor of action sequences
-    states: tensor of ground truth state sequences
-    iter_num: tensor of the current training iteration (for sched. sampling)
-    k: constant used for scheduled sampling. -1 to feed in own prediction.
-    use_state: True to include state and action in prediction
-    num_masks: the number of different pixel motion predictions (and
-               the number of masks for each of those predictions)
-    stp: True to use Spatial Transformer Predictor (STP)
-    cdna: True to use Convoluational Dynamic Neural Advection (CDNA)
-    dna: True to use Dynamic Neural Advection (DNA)
-    context_frames: number of ground truth frames to pass in before
-                    feeding in own predictions
-  Returns:
-    gen_images: predicted future image frames
-    gen_states: predicted future states
-
-  Raises:
-    ValueError: if more than one network option specified or more than 1 mask
-    specified for DNA model.
-  """
-  if stp + cdna + dna != 1:
-    raise ValueError('More than one, or no network option specified.')
-  batch_size, img_height, img_width, color_channels = images[0].get_shape()[0:4]
-  lstm_func = basic_conv_lstm_cell
-
-  # Generated robot states and images.
-  gen_states, gen_images = [], []
-  current_state = states[0]
-
-  if k == -1:
-    feedself = True
-  else:
-    # Scheduled sampling:
-    # Calculate number of ground-truth frames to pass in.
-    num_ground_truth = tf.to_int32(
-        tf.round(tf.to_float(batch_size) * (k / (k + tf.exp(iter_num / k)))))
-    feedself = False
-
-  # LSTM state sizes and states.
-  lstm_size = np.int32(np.array([32, 32, 64, 64, 128, 64, 32]))
-  lstm_state1, lstm_state2, lstm_state3, lstm_state4 = None, None, None, None
-  lstm_state5, lstm_state6, lstm_state7 = None, None, None
-
-  for image, action in zip(images[:-1], actions[:-1]):
-    # Reuse variables after the first timestep.
-    reuse = bool(gen_images)
-
-    done_warm_start = len(gen_images) > context_frames - 1
-    with slim.arg_scope(
-        [lstm_func, slim.layers.conv2d, slim.layers.fully_connected,
-         tf_layers.layer_norm, slim.layers.conv2d_transpose],
-        reuse=reuse):
-
-      if feedself and done_warm_start:
-        # Feed in generated image.
-        prev_image = gen_images[-1]
-      elif done_warm_start:
-        # Scheduled sampling
-        prev_image = scheduled_sample(image, gen_images[-1], batch_size,
-                                      num_ground_truth)
-      else:
-        # Always feed in ground_truth
-        prev_image = image
-
-      # Predicted state is always fed back in
-      state_action = tf.concat(axis=1, values=[action, current_state])
-
-      enc0 = slim.layers.conv2d(
-          prev_image,
-          32, [5, 5],
-          stride=2,
-          scope='scale1_conv1',
-          normalizer_fn=tf_layers.layer_norm,
-          normalizer_params={'scope': 'layer_norm1'})
-
-      hidden1, lstm_state1 = lstm_func(
-          enc0, lstm_state1, lstm_size[0], scope='state1')
-      hidden1 = tf_layers.layer_norm(hidden1, scope='layer_norm2')
-      hidden2, lstm_state2 = lstm_func(
-          hidden1, lstm_state2, lstm_size[1], scope='state2')
-      hidden2 = tf_layers.layer_norm(hidden2, scope='layer_norm3')
-      enc1 = slim.layers.conv2d(
-          hidden2, hidden2.get_shape()[3], [3, 3], stride=2, scope='conv2')
-
-      hidden3, lstm_state3 = lstm_func(
-          enc1, lstm_state3, lstm_size[2], scope='state3')
-      hidden3 = tf_layers.layer_norm(hidden3, scope='layer_norm4')
-      hidden4, lstm_state4 = lstm_func(
-          hidden3, lstm_state4, lstm_size[3], scope='state4')
-      hidden4 = tf_layers.layer_norm(hidden4, scope='layer_norm5')
-      enc2 = slim.layers.conv2d(
-          hidden4, hidden4.get_shape()[3], [3, 3], stride=2, scope='conv3')
-
-      # Pass in state and action.
-      smear = tf.reshape(
-          state_action,
-          [int(batch_size), 1, 1, int(state_action.get_shape()[1])])
-      smear = tf.tile(
-          smear, [1, int(enc2.get_shape()[1]), int(enc2.get_shape()[2]), 1])
-      if use_state:
-        enc2 = tf.concat(axis=3, values=[enc2, smear])
-      enc3 = slim.layers.conv2d(
-          enc2, hidden4.get_shape()[3], [1, 1], stride=1, scope='conv4')
-
-      hidden5, lstm_state5 = lstm_func(
-          enc3, lstm_state5, lstm_size[4], scope='state5')  # last 8x8
-      hidden5 = tf_layers.layer_norm(hidden5, scope='layer_norm6')
-      enc4 = slim.layers.conv2d_transpose(
-          hidden5, hidden5.get_shape()[3], 3, stride=2, scope='convt1')
-
-      hidden6, lstm_state6 = lstm_func(
-          enc4, lstm_state6, lstm_size[5], scope='state6')  # 16x16
-      hidden6 = tf_layers.layer_norm(hidden6, scope='layer_norm7')
-      # Skip connection.
-      hidden6 = tf.concat(axis=3, values=[hidden6, enc1])  # both 16x16
-
-      enc5 = slim.layers.conv2d_transpose(
-          hidden6, hidden6.get_shape()[3], 3, stride=2, scope='convt2')
-      hidden7, lstm_state7 = lstm_func(
-          enc5, lstm_state7, lstm_size[6], scope='state7')  # 32x32
-      hidden7 = tf_layers.layer_norm(hidden7, scope='layer_norm8')
-
-      # Skip connection.
-      hidden7 = tf.concat(axis=3, values=[hidden7, enc0])  # both 32x32
-
-      enc6 = slim.layers.conv2d_transpose(
-          hidden7,
-          hidden7.get_shape()[3], 3, stride=2, scope='convt3',
-          normalizer_fn=tf_layers.layer_norm,
-          normalizer_params={'scope': 'layer_norm9'})
-
-      if dna:
-        # Using largest hidden state for predicting untied conv kernels.
-        enc7 = slim.layers.conv2d_transpose(
-            enc6, DNA_KERN_SIZE**2, 1, stride=1, scope='convt4')
-      else:
-        # Using largest hidden state for predicting a new image layer.
-        enc7 = slim.layers.conv2d_transpose(
-            enc6, color_channels, 1, stride=1, scope='convt4')
-        # This allows the network to also generate one image from scratch,
-        # which is useful when regions of the image become unoccluded.
-        transformed = [tf.nn.sigmoid(enc7)]
-
-      if stp:
-        stp_input0 = tf.reshape(hidden5, [int(batch_size), -1])
-        stp_input1 = slim.layers.fully_connected(
-            stp_input0, 100, scope='fc_stp')
-        transformed += stp_transformation(prev_image, stp_input1, num_masks)
-      elif cdna:
-        cdna_input = tf.reshape(hidden5, [int(batch_size), -1])
-        transformed += cdna_transformation(prev_image, cdna_input, num_masks,
-                                           int(color_channels))
-      elif dna:
-        # Only one mask is supported (more should be unnecessary).
-        if num_masks != 1:
-          raise ValueError('Only one mask is supported for DNA model.')
-        transformed = [dna_transformation(prev_image, enc7)]
-
-      masks = slim.layers.conv2d_transpose(
-          enc6, num_masks + 1, 1, stride=1, scope='convt7')
-      masks = tf.reshape(
-          tf.nn.softmax(tf.reshape(masks, [-1, num_masks + 1])),
-          [int(batch_size), int(img_height), int(img_width), num_masks + 1])
-      mask_list = tf.split(axis=3, num_or_size_splits=num_masks + 1, value=masks)
-      output = mask_list[0] * prev_image
-      for layer, mask in zip(transformed, mask_list[1:]):
-        output += layer * mask
-      gen_images.append(output)
-
-      current_state = slim.layers.fully_connected(
-          state_action,
-          int(current_state.get_shape()[1]),
-          scope='state_pred',
-          activation_fn=None)
-      gen_states.append(current_state)
-
-  return gen_images, gen_states
-
-
-## Utility functions
-def stp_transformation(prev_image, stp_input, num_masks):
-  """Apply spatial transformer predictor (STP) to previous image.
-
-  Args:
-    prev_image: previous image to be transformed.
-    stp_input: hidden layer to be used for computing STN parameters.
-    num_masks: number of masks and hence the number of STP transformations.
-  Returns:
-    List of images transformed by the predicted STP parameters.
-  """
-  # Only import spatial transformer if needed.
-  from spatial_transformer import transformer
-
-  identity_params = tf.convert_to_tensor(
-      np.array([1.0, 0.0, 0.0, 0.0, 1.0, 0.0], np.float32))
-  transformed = []
-  for i in range(num_masks - 1):
-    params = slim.layers.fully_connected(
-        stp_input, 6, scope='stp_params' + str(i),
-        activation_fn=None) + identity_params
-    transformed.append(transformer(prev_image, params))
-
-  return transformed
-
-
-def cdna_transformation(prev_image, cdna_input, num_masks, color_channels):
-  """Apply convolutional dynamic neural advection to previous image.
-
-  Args:
-    prev_image: previous image to be transformed.
-    cdna_input: hidden lyaer to be used for computing CDNA kernels.
-    num_masks: the number of masks and hence the number of CDNA transformations.
-    color_channels: the number of color channels in the images.
-  Returns:
-    List of images transformed by the predicted CDNA kernels.
-  """
-  batch_size = int(cdna_input.get_shape()[0])
-  height = int(prev_image.get_shape()[1])
-  width = int(prev_image.get_shape()[2])
-
-  # Predict kernels using linear function of last hidden layer.
-  cdna_kerns = slim.layers.fully_connected(
-      cdna_input,
-      DNA_KERN_SIZE * DNA_KERN_SIZE * num_masks,
-      scope='cdna_params',
-      activation_fn=None)
-
-  # Reshape and normalize.
-  cdna_kerns = tf.reshape(
-      cdna_kerns, [batch_size, DNA_KERN_SIZE, DNA_KERN_SIZE, 1, num_masks])
-  cdna_kerns = tf.nn.relu(cdna_kerns - RELU_SHIFT) + RELU_SHIFT
-  norm_factor = tf.reduce_sum(cdna_kerns, [1, 2, 3], keep_dims=True)
-  cdna_kerns /= norm_factor
-
-  # Treat the color channel dimension as the batch dimension since the same
-  # transformation is applied to each color channel.
-  # Treat the batch dimension as the channel dimension so that
-  # depthwise_conv2d can apply a different transformation to each sample.
-  cdna_kerns = tf.transpose(cdna_kerns, [1, 2, 0, 4, 3])
-  cdna_kerns = tf.reshape(cdna_kerns, [DNA_KERN_SIZE, DNA_KERN_SIZE, batch_size, num_masks])
-  # Swap the batch and channel dimensions.
-  prev_image = tf.transpose(prev_image, [3, 1, 2, 0])
-
-  # Transform image.
-  transformed = tf.nn.depthwise_conv2d(prev_image, cdna_kerns, [1, 1, 1, 1], 'SAME')
-
-  # Transpose the dimensions to where they belong.
-  transformed = tf.reshape(transformed, [color_channels, height, width, batch_size, num_masks])
-  transformed = tf.transpose(transformed, [3, 1, 2, 0, 4])
-  transformed = tf.unstack(transformed, axis=-1)
-  return transformed
-
-
-def dna_transformation(prev_image, dna_input):
-  """Apply dynamic neural advection to previous image.
-
-  Args:
-    prev_image: previous image to be transformed.
-    dna_input: hidden lyaer to be used for computing DNA transformation.
-  Returns:
-    List of images transformed by the predicted CDNA kernels.
-  """
-  # Construct translated images.
-  prev_image_pad = tf.pad(prev_image, [[0, 0], [2, 2], [2, 2], [0, 0]])
-  image_height = int(prev_image.get_shape()[1])
-  image_width = int(prev_image.get_shape()[2])
-
-  inputs = []
-  for xkern in range(DNA_KERN_SIZE):
-    for ykern in range(DNA_KERN_SIZE):
-      inputs.append(
-          tf.expand_dims(
-              tf.slice(prev_image_pad, [0, xkern, ykern, 0],
-                       [-1, image_height, image_width, -1]), [3]))
-  inputs = tf.concat(axis=3, values=inputs)
-
-  # Normalize channels to 1.
-  kernel = tf.nn.relu(dna_input - RELU_SHIFT) + RELU_SHIFT
-  kernel = tf.expand_dims(
-      kernel / tf.reduce_sum(
-          kernel, [3], keep_dims=True), [4])
-  return tf.reduce_sum(kernel * inputs, [3], keep_dims=False)
-
-
-def scheduled_sample(ground_truth_x, generated_x, batch_size, num_ground_truth):
-  """Sample batch with specified mix of ground truth and generated data points.
-
-  Args:
-    ground_truth_x: tensor of ground-truth data points.
-    generated_x: tensor of generated data points.
-    batch_size: batch size
-    num_ground_truth: number of ground-truth examples to include in batch.
-  Returns:
-    New batch with num_ground_truth sampled from ground_truth_x and the rest
-    from generated_x.
-  """
-  idx = tf.random_shuffle(tf.range(int(batch_size)))
-  ground_truth_idx = tf.gather(idx, tf.range(num_ground_truth))
-  generated_idx = tf.gather(idx, tf.range(num_ground_truth, int(batch_size)))
-
-  ground_truth_examps = tf.gather(ground_truth_x, ground_truth_idx)
-  generated_examps = tf.gather(generated_x, generated_idx)
-  return tf.dynamic_stitch([ground_truth_idx, generated_idx],
-                           [ground_truth_examps, generated_examps])
diff --git a/research/video_prediction/prediction_train.py b/research/video_prediction/prediction_train.py
deleted file mode 100644
index dfc7ab6c2455a871cea5b3e2d283d072d5cf4ca2..0000000000000000000000000000000000000000
--- a/research/video_prediction/prediction_train.py
+++ /dev/null
@@ -1,255 +0,0 @@
-# Copyright 2016 The TensorFlow Authors All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-
-"""Code for training the prediction model."""
-
-import numpy as np
-import tensorflow as tf
-
-from tensorflow.python.platform import app
-from tensorflow.python.platform import flags
-
-from prediction_input import build_tfrecord_input
-from prediction_model import construct_model
-
-# How often to record tensorboard summaries.
-SUMMARY_INTERVAL = 40
-
-# How often to run a batch through the validation model.
-VAL_INTERVAL = 200
-
-# How often to save a model checkpoint
-SAVE_INTERVAL = 2000
-
-# EPSILON to avoid NAN
-EPSILON = 1e-9
-
-# tf record data location:
-DATA_DIR = 'push/push_train'
-
-# local output directory
-OUT_DIR = '/tmp/data'
-
-FLAGS = flags.FLAGS
-
-flags.DEFINE_string('data_dir', DATA_DIR, 'directory containing data.')
-flags.DEFINE_string('output_dir', OUT_DIR, 'directory for model checkpoints.')
-flags.DEFINE_string('event_log_dir', OUT_DIR, 'directory for writing summary.')
-flags.DEFINE_integer('num_iterations', 100000, 'number of training iterations.')
-flags.DEFINE_string('pretrained_model', '',
-                    'filepath of a pretrained model to initialize from.')
-
-flags.DEFINE_integer('sequence_length', 10,
-                     'sequence length, including context frames.')
-flags.DEFINE_integer('context_frames', 2, '# of frames before predictions.')
-flags.DEFINE_integer('use_state', 1,
-                     'Whether or not to give the state+action to the model')
-
-flags.DEFINE_string('model', 'CDNA',
-                    'model architecture to use - CDNA, DNA, or STP')
-
-flags.DEFINE_integer('num_masks', 10,
-                     'number of masks, usually 1 for DNA, 10 for CDNA, STP.')
-flags.DEFINE_float('schedsamp_k', 900.0,
-                   'The k hyperparameter for scheduled sampling,'
-                   '-1 for no scheduled sampling.')
-flags.DEFINE_float('train_val_split', 0.95,
-                   'The percentage of files to use for the training set,'
-                   ' vs. the validation set.')
-
-flags.DEFINE_integer('batch_size', 32, 'batch size for training')
-flags.DEFINE_float('learning_rate', 0.001,
-                   'the base learning rate of the generator')
-
-
-## Helper functions
-def peak_signal_to_noise_ratio(true, pred):
-  """Image quality metric based on maximal signal power vs. power of the noise.
-
-  Args:
-    true: the ground truth image.
-    pred: the predicted image.
-  Returns:
-    peak signal to noise ratio (PSNR)
-  """
-  return 10.0 * (- tf.log(tf.maximum(mean_squared_error(true, pred), EPSILON))) / tf.log(10.0)
-
-
-def mean_squared_error(true, pred):
-  """L2 distance between tensors true and pred.
-
-  Args:
-    true: the ground truth image.
-    pred: the predicted image.
-  Returns:
-    mean squared error between ground truth and predicted image.
-  """
-  return tf.reduce_sum(tf.square(true - pred)) / tf.to_float(tf.size(pred))
-
-
-class Model(object):
-
-  def __init__(self,
-               images=None,
-               actions=None,
-               states=None,
-               sequence_length=None,
-               reuse_scope=None,
-               prefix=None):
-
-    if sequence_length is None:
-      sequence_length = FLAGS.sequence_length
-
-    if prefix is None:
-        prefix = tf.placeholder(tf.string, [])
-    self.prefix = prefix
-    self.iter_num = tf.placeholder(tf.float32, [])
-    summaries = []
-
-    # Split into timesteps.
-    actions = tf.split(axis=1, num_or_size_splits=int(actions.get_shape()[1]), value=actions)
-    actions = [tf.squeeze(act) for act in actions]
-    states = tf.split(axis=1, num_or_size_splits=int(states.get_shape()[1]), value=states)
-    states = [tf.squeeze(st) for st in states]
-    images = tf.split(axis=1, num_or_size_splits=int(images.get_shape()[1]), value=images)
-    images = [tf.squeeze(img) for img in images]
-
-    if reuse_scope is None:
-      gen_images, gen_states = construct_model(
-          images,
-          actions,
-          states,
-          iter_num=self.iter_num,
-          k=FLAGS.schedsamp_k,
-          use_state=FLAGS.use_state,
-          num_masks=FLAGS.num_masks,
-          cdna=FLAGS.model == 'CDNA',
-          dna=FLAGS.model == 'DNA',
-          stp=FLAGS.model == 'STP',
-          context_frames=FLAGS.context_frames)
-    else:  # If it's a validation or test model.
-      with tf.variable_scope(reuse_scope, reuse=True):
-        gen_images, gen_states = construct_model(
-            images,
-            actions,
-            states,
-            iter_num=self.iter_num,
-            k=FLAGS.schedsamp_k,
-            use_state=FLAGS.use_state,
-            num_masks=FLAGS.num_masks,
-            cdna=FLAGS.model == 'CDNA',
-            dna=FLAGS.model == 'DNA',
-            stp=FLAGS.model == 'STP',
-            context_frames=FLAGS.context_frames)
-
-    # L2 loss, PSNR for eval.
-    loss, psnr_all = 0.0, 0.0
-    for i, x, gx in zip(
-        range(len(gen_images)), images[FLAGS.context_frames:],
-        gen_images[FLAGS.context_frames - 1:]):
-      recon_cost = mean_squared_error(x, gx)
-      psnr_i = peak_signal_to_noise_ratio(x, gx)
-      psnr_all += psnr_i
-      summaries.append(
-          tf.summary.scalar(prefix + '_recon_cost' + str(i), recon_cost))
-      summaries.append(tf.summary.scalar(prefix + '_psnr' + str(i), psnr_i))
-      loss += recon_cost
-
-    for i, state, gen_state in zip(
-        range(len(gen_states)), states[FLAGS.context_frames:],
-        gen_states[FLAGS.context_frames - 1:]):
-      state_cost = mean_squared_error(state, gen_state) * 1e-4
-      summaries.append(
-          tf.summary.scalar(prefix + '_state_cost' + str(i), state_cost))
-      loss += state_cost
-    summaries.append(tf.summary.scalar(prefix + '_psnr_all', psnr_all))
-    self.psnr_all = psnr_all
-
-    self.loss = loss = loss / np.float32(len(images) - FLAGS.context_frames)
-
-    summaries.append(tf.summary.scalar(prefix + '_loss', loss))
-
-    self.lr = tf.placeholder_with_default(FLAGS.learning_rate, ())
-
-    self.train_op = tf.train.AdamOptimizer(self.lr).minimize(loss)
-    self.summ_op = tf.summary.merge(summaries)
-
-
-def main(unused_argv):
-
-  print('Constructing models and inputs.')
-  with tf.variable_scope('model', reuse=None) as training_scope:
-    images, actions, states = build_tfrecord_input(training=True)
-    model = Model(images, actions, states, FLAGS.sequence_length,
-                  prefix='train')
-
-  with tf.variable_scope('val_model', reuse=None):
-    val_images, val_actions, val_states = build_tfrecord_input(training=False)
-    val_model = Model(val_images, val_actions, val_states,
-                      FLAGS.sequence_length, training_scope, prefix='val')
-
-  print('Constructing saver.')
-  # Make saver.
-  saver = tf.train.Saver(
-      tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES), max_to_keep=0)
-
-  # Make training session.
-  sess = tf.InteractiveSession()
-  sess.run(tf.global_variables_initializer())
-
-  summary_writer = tf.summary.FileWriter(
-      FLAGS.event_log_dir, graph=sess.graph, flush_secs=10)
-
-  if FLAGS.pretrained_model:
-    saver.restore(sess, FLAGS.pretrained_model)
-
-  tf.train.start_queue_runners(sess)
-
-  tf.logging.info('iteration number, cost')
-
-  # Run training.
-  for itr in range(FLAGS.num_iterations):
-    # Generate new batch of data.
-    feed_dict = {model.iter_num: np.float32(itr),
-                 model.lr: FLAGS.learning_rate}
-    cost, _, summary_str = sess.run([model.loss, model.train_op, model.summ_op],
-                                    feed_dict)
-
-    # Print info: iteration #, cost.
-    tf.logging.info(str(itr) + ' ' + str(cost))
-
-    if (itr) % VAL_INTERVAL == 2:
-      # Run through validation set.
-      feed_dict = {val_model.lr: 0.0,
-                   val_model.iter_num: np.float32(itr)}
-      _, val_summary_str = sess.run([val_model.train_op, val_model.summ_op],
-                                     feed_dict)
-      summary_writer.add_summary(val_summary_str, itr)
-
-    if (itr) % SAVE_INTERVAL == 2:
-      tf.logging.info('Saving model.')
-      saver.save(sess, FLAGS.output_dir + '/model' + str(itr))
-
-    if (itr) % SUMMARY_INTERVAL:
-      summary_writer.add_summary(summary_str, itr)
-
-  tf.logging.info('Saving model.')
-  saver.save(sess, FLAGS.output_dir + '/model')
-  tf.logging.info('Training complete')
-  tf.logging.flush()
-
-
-if __name__ == '__main__':
-  app.run()
diff --git a/research/video_prediction/push_datafiles.txt b/research/video_prediction/push_datafiles.txt
deleted file mode 100644
index 48da046330211a0b3580e964711c133a159bba93..0000000000000000000000000000000000000000
--- a/research/video_prediction/push_datafiles.txt
+++ /dev/null
@@ -1,274 +0,0 @@
-push/push_testnovel/push_testnovel.tfrecord-00000-of-00005
-push/push_testnovel/push_testnovel.tfrecord-00001-of-00005
-push/push_testnovel/push_testnovel.tfrecord-00002-of-00005
-push/push_testnovel/push_testnovel.tfrecord-00003-of-00005
-push/push_testnovel/push_testnovel.tfrecord-00004-of-00005
-push/push_testseen/push_testseen.tfrecord-00000-of-00005
-push/push_testseen/push_testseen.tfrecord-00001-of-00005
-push/push_testseen/push_testseen.tfrecord-00002-of-00005
-push/push_testseen/push_testseen.tfrecord-00003-of-00005
-push/push_testseen/push_testseen.tfrecord-00004-of-00005
-push/push_train/push_train.tfrecord-00000-of-00264
-push/push_train/push_train.tfrecord-00001-of-00264
-push/push_train/push_train.tfrecord-00002-of-00264
-push/push_train/push_train.tfrecord-00003-of-00264
-push/push_train/push_train.tfrecord-00004-of-00264
-push/push_train/push_train.tfrecord-00005-of-00264
-push/push_train/push_train.tfrecord-00006-of-00264
-push/push_train/push_train.tfrecord-00007-of-00264
-push/push_train/push_train.tfrecord-00008-of-00264
-push/push_train/push_train.tfrecord-00009-of-00264
-push/push_train/push_train.tfrecord-00010-of-00264
-push/push_train/push_train.tfrecord-00011-of-00264
-push/push_train/push_train.tfrecord-00012-of-00264
-push/push_train/push_train.tfrecord-00013-of-00264
-push/push_train/push_train.tfrecord-00014-of-00264
-push/push_train/push_train.tfrecord-00015-of-00264
-push/push_train/push_train.tfrecord-00016-of-00264
-push/push_train/push_train.tfrecord-00017-of-00264
-push/push_train/push_train.tfrecord-00018-of-00264
-push/push_train/push_train.tfrecord-00019-of-00264
-push/push_train/push_train.tfrecord-00020-of-00264
-push/push_train/push_train.tfrecord-00021-of-00264
-push/push_train/push_train.tfrecord-00022-of-00264
-push/push_train/push_train.tfrecord-00023-of-00264
-push/push_train/push_train.tfrecord-00024-of-00264
-push/push_train/push_train.tfrecord-00025-of-00264
-push/push_train/push_train.tfrecord-00026-of-00264
-push/push_train/push_train.tfrecord-00027-of-00264
-push/push_train/push_train.tfrecord-00028-of-00264
-push/push_train/push_train.tfrecord-00029-of-00264
-push/push_train/push_train.tfrecord-00030-of-00264
-push/push_train/push_train.tfrecord-00031-of-00264
-push/push_train/push_train.tfrecord-00032-of-00264
-push/push_train/push_train.tfrecord-00033-of-00264
-push/push_train/push_train.tfrecord-00034-of-00264
-push/push_train/push_train.tfrecord-00035-of-00264
-push/push_train/push_train.tfrecord-00036-of-00264
-push/push_train/push_train.tfrecord-00037-of-00264
-push/push_train/push_train.tfrecord-00038-of-00264
-push/push_train/push_train.tfrecord-00039-of-00264
-push/push_train/push_train.tfrecord-00040-of-00264
-push/push_train/push_train.tfrecord-00041-of-00264
-push/push_train/push_train.tfrecord-00042-of-00264
-push/push_train/push_train.tfrecord-00043-of-00264
-push/push_train/push_train.tfrecord-00044-of-00264
-push/push_train/push_train.tfrecord-00045-of-00264
-push/push_train/push_train.tfrecord-00046-of-00264
-push/push_train/push_train.tfrecord-00047-of-00264
-push/push_train/push_train.tfrecord-00048-of-00264
-push/push_train/push_train.tfrecord-00049-of-00264
-push/push_train/push_train.tfrecord-00050-of-00264
-push/push_train/push_train.tfrecord-00051-of-00264
-push/push_train/push_train.tfrecord-00052-of-00264
-push/push_train/push_train.tfrecord-00053-of-00264
-push/push_train/push_train.tfrecord-00054-of-00264
-push/push_train/push_train.tfrecord-00055-of-00264
-push/push_train/push_train.tfrecord-00056-of-00264
-push/push_train/push_train.tfrecord-00057-of-00264
-push/push_train/push_train.tfrecord-00058-of-00264
-push/push_train/push_train.tfrecord-00059-of-00264
-push/push_train/push_train.tfrecord-00060-of-00264
-push/push_train/push_train.tfrecord-00061-of-00264
-push/push_train/push_train.tfrecord-00062-of-00264
-push/push_train/push_train.tfrecord-00063-of-00264
-push/push_train/push_train.tfrecord-00064-of-00264
-push/push_train/push_train.tfrecord-00065-of-00264
-push/push_train/push_train.tfrecord-00066-of-00264
-push/push_train/push_train.tfrecord-00067-of-00264
-push/push_train/push_train.tfrecord-00068-of-00264
-push/push_train/push_train.tfrecord-00069-of-00264
-push/push_train/push_train.tfrecord-00070-of-00264
-push/push_train/push_train.tfrecord-00071-of-00264
-push/push_train/push_train.tfrecord-00072-of-00264
-push/push_train/push_train.tfrecord-00073-of-00264
-push/push_train/push_train.tfrecord-00074-of-00264
-push/push_train/push_train.tfrecord-00075-of-00264
-push/push_train/push_train.tfrecord-00076-of-00264
-push/push_train/push_train.tfrecord-00077-of-00264
-push/push_train/push_train.tfrecord-00078-of-00264
-push/push_train/push_train.tfrecord-00079-of-00264
-push/push_train/push_train.tfrecord-00080-of-00264
-push/push_train/push_train.tfrecord-00081-of-00264
-push/push_train/push_train.tfrecord-00082-of-00264
-push/push_train/push_train.tfrecord-00083-of-00264
-push/push_train/push_train.tfrecord-00084-of-00264
-push/push_train/push_train.tfrecord-00085-of-00264
-push/push_train/push_train.tfrecord-00086-of-00264
-push/push_train/push_train.tfrecord-00087-of-00264
-push/push_train/push_train.tfrecord-00088-of-00264
-push/push_train/push_train.tfrecord-00089-of-00264
-push/push_train/push_train.tfrecord-00090-of-00264
-push/push_train/push_train.tfrecord-00091-of-00264
-push/push_train/push_train.tfrecord-00092-of-00264
-push/push_train/push_train.tfrecord-00093-of-00264
-push/push_train/push_train.tfrecord-00094-of-00264
-push/push_train/push_train.tfrecord-00095-of-00264
-push/push_train/push_train.tfrecord-00096-of-00264
-push/push_train/push_train.tfrecord-00097-of-00264
-push/push_train/push_train.tfrecord-00098-of-00264
-push/push_train/push_train.tfrecord-00099-of-00264
-push/push_train/push_train.tfrecord-00100-of-00264
-push/push_train/push_train.tfrecord-00101-of-00264
-push/push_train/push_train.tfrecord-00102-of-00264
-push/push_train/push_train.tfrecord-00103-of-00264
-push/push_train/push_train.tfrecord-00104-of-00264
-push/push_train/push_train.tfrecord-00105-of-00264
-push/push_train/push_train.tfrecord-00106-of-00264
-push/push_train/push_train.tfrecord-00107-of-00264
-push/push_train/push_train.tfrecord-00108-of-00264
-push/push_train/push_train.tfrecord-00109-of-00264
-push/push_train/push_train.tfrecord-00110-of-00264
-push/push_train/push_train.tfrecord-00111-of-00264
-push/push_train/push_train.tfrecord-00112-of-00264
-push/push_train/push_train.tfrecord-00113-of-00264
-push/push_train/push_train.tfrecord-00114-of-00264
-push/push_train/push_train.tfrecord-00115-of-00264
-push/push_train/push_train.tfrecord-00116-of-00264
-push/push_train/push_train.tfrecord-00117-of-00264
-push/push_train/push_train.tfrecord-00118-of-00264
-push/push_train/push_train.tfrecord-00119-of-00264
-push/push_train/push_train.tfrecord-00120-of-00264
-push/push_train/push_train.tfrecord-00121-of-00264
-push/push_train/push_train.tfrecord-00122-of-00264
-push/push_train/push_train.tfrecord-00123-of-00264
-push/push_train/push_train.tfrecord-00124-of-00264
-push/push_train/push_train.tfrecord-00125-of-00264
-push/push_train/push_train.tfrecord-00126-of-00264
-push/push_train/push_train.tfrecord-00127-of-00264
-push/push_train/push_train.tfrecord-00128-of-00264
-push/push_train/push_train.tfrecord-00129-of-00264
-push/push_train/push_train.tfrecord-00130-of-00264
-push/push_train/push_train.tfrecord-00131-of-00264
-push/push_train/push_train.tfrecord-00132-of-00264
-push/push_train/push_train.tfrecord-00133-of-00264
-push/push_train/push_train.tfrecord-00134-of-00264
-push/push_train/push_train.tfrecord-00135-of-00264
-push/push_train/push_train.tfrecord-00136-of-00264
-push/push_train/push_train.tfrecord-00137-of-00264
-push/push_train/push_train.tfrecord-00138-of-00264
-push/push_train/push_train.tfrecord-00139-of-00264
-push/push_train/push_train.tfrecord-00140-of-00264
-push/push_train/push_train.tfrecord-00141-of-00264
-push/push_train/push_train.tfrecord-00142-of-00264
-push/push_train/push_train.tfrecord-00143-of-00264
-push/push_train/push_train.tfrecord-00144-of-00264
-push/push_train/push_train.tfrecord-00145-of-00264
-push/push_train/push_train.tfrecord-00146-of-00264
-push/push_train/push_train.tfrecord-00147-of-00264
-push/push_train/push_train.tfrecord-00148-of-00264
-push/push_train/push_train.tfrecord-00149-of-00264
-push/push_train/push_train.tfrecord-00150-of-00264
-push/push_train/push_train.tfrecord-00151-of-00264
-push/push_train/push_train.tfrecord-00152-of-00264
-push/push_train/push_train.tfrecord-00153-of-00264
-push/push_train/push_train.tfrecord-00154-of-00264
-push/push_train/push_train.tfrecord-00155-of-00264
-push/push_train/push_train.tfrecord-00156-of-00264
-push/push_train/push_train.tfrecord-00157-of-00264
-push/push_train/push_train.tfrecord-00158-of-00264
-push/push_train/push_train.tfrecord-00159-of-00264
-push/push_train/push_train.tfrecord-00160-of-00264
-push/push_train/push_train.tfrecord-00161-of-00264
-push/push_train/push_train.tfrecord-00162-of-00264
-push/push_train/push_train.tfrecord-00163-of-00264
-push/push_train/push_train.tfrecord-00164-of-00264
-push/push_train/push_train.tfrecord-00165-of-00264
-push/push_train/push_train.tfrecord-00166-of-00264
-push/push_train/push_train.tfrecord-00167-of-00264
-push/push_train/push_train.tfrecord-00168-of-00264
-push/push_train/push_train.tfrecord-00169-of-00264
-push/push_train/push_train.tfrecord-00170-of-00264
-push/push_train/push_train.tfrecord-00171-of-00264
-push/push_train/push_train.tfrecord-00172-of-00264
-push/push_train/push_train.tfrecord-00173-of-00264
-push/push_train/push_train.tfrecord-00174-of-00264
-push/push_train/push_train.tfrecord-00175-of-00264
-push/push_train/push_train.tfrecord-00176-of-00264
-push/push_train/push_train.tfrecord-00177-of-00264
-push/push_train/push_train.tfrecord-00178-of-00264
-push/push_train/push_train.tfrecord-00179-of-00264
-push/push_train/push_train.tfrecord-00180-of-00264
-push/push_train/push_train.tfrecord-00181-of-00264
-push/push_train/push_train.tfrecord-00182-of-00264
-push/push_train/push_train.tfrecord-00183-of-00264
-push/push_train/push_train.tfrecord-00184-of-00264
-push/push_train/push_train.tfrecord-00185-of-00264
-push/push_train/push_train.tfrecord-00186-of-00264
-push/push_train/push_train.tfrecord-00187-of-00264
-push/push_train/push_train.tfrecord-00188-of-00264
-push/push_train/push_train.tfrecord-00189-of-00264
-push/push_train/push_train.tfrecord-00190-of-00264
-push/push_train/push_train.tfrecord-00191-of-00264
-push/push_train/push_train.tfrecord-00192-of-00264
-push/push_train/push_train.tfrecord-00193-of-00264
-push/push_train/push_train.tfrecord-00194-of-00264
-push/push_train/push_train.tfrecord-00195-of-00264
-push/push_train/push_train.tfrecord-00196-of-00264
-push/push_train/push_train.tfrecord-00197-of-00264
-push/push_train/push_train.tfrecord-00198-of-00264
-push/push_train/push_train.tfrecord-00199-of-00264
-push/push_train/push_train.tfrecord-00200-of-00264
-push/push_train/push_train.tfrecord-00201-of-00264
-push/push_train/push_train.tfrecord-00202-of-00264
-push/push_train/push_train.tfrecord-00203-of-00264
-push/push_train/push_train.tfrecord-00204-of-00264
-push/push_train/push_train.tfrecord-00205-of-00264
-push/push_train/push_train.tfrecord-00206-of-00264
-push/push_train/push_train.tfrecord-00207-of-00264
-push/push_train/push_train.tfrecord-00208-of-00264
-push/push_train/push_train.tfrecord-00209-of-00264
-push/push_train/push_train.tfrecord-00210-of-00264
-push/push_train/push_train.tfrecord-00211-of-00264
-push/push_train/push_train.tfrecord-00212-of-00264
-push/push_train/push_train.tfrecord-00213-of-00264
-push/push_train/push_train.tfrecord-00214-of-00264
-push/push_train/push_train.tfrecord-00215-of-00264
-push/push_train/push_train.tfrecord-00216-of-00264
-push/push_train/push_train.tfrecord-00217-of-00264
-push/push_train/push_train.tfrecord-00218-of-00264
-push/push_train/push_train.tfrecord-00219-of-00264
-push/push_train/push_train.tfrecord-00220-of-00264
-push/push_train/push_train.tfrecord-00221-of-00264
-push/push_train/push_train.tfrecord-00222-of-00264
-push/push_train/push_train.tfrecord-00223-of-00264
-push/push_train/push_train.tfrecord-00224-of-00264
-push/push_train/push_train.tfrecord-00225-of-00264
-push/push_train/push_train.tfrecord-00226-of-00264
-push/push_train/push_train.tfrecord-00227-of-00264
-push/push_train/push_train.tfrecord-00228-of-00264
-push/push_train/push_train.tfrecord-00229-of-00264
-push/push_train/push_train.tfrecord-00230-of-00264
-push/push_train/push_train.tfrecord-00231-of-00264
-push/push_train/push_train.tfrecord-00232-of-00264
-push/push_train/push_train.tfrecord-00233-of-00264
-push/push_train/push_train.tfrecord-00234-of-00264
-push/push_train/push_train.tfrecord-00235-of-00264
-push/push_train/push_train.tfrecord-00236-of-00264
-push/push_train/push_train.tfrecord-00237-of-00264
-push/push_train/push_train.tfrecord-00238-of-00264
-push/push_train/push_train.tfrecord-00239-of-00264
-push/push_train/push_train.tfrecord-00240-of-00264
-push/push_train/push_train.tfrecord-00241-of-00264
-push/push_train/push_train.tfrecord-00242-of-00264
-push/push_train/push_train.tfrecord-00243-of-00264
-push/push_train/push_train.tfrecord-00244-of-00264
-push/push_train/push_train.tfrecord-00245-of-00264
-push/push_train/push_train.tfrecord-00246-of-00264
-push/push_train/push_train.tfrecord-00247-of-00264
-push/push_train/push_train.tfrecord-00248-of-00264
-push/push_train/push_train.tfrecord-00249-of-00264
-push/push_train/push_train.tfrecord-00250-of-00264
-push/push_train/push_train.tfrecord-00251-of-00264
-push/push_train/push_train.tfrecord-00252-of-00264
-push/push_train/push_train.tfrecord-00253-of-00264
-push/push_train/push_train.tfrecord-00254-of-00264
-push/push_train/push_train.tfrecord-00255-of-00264
-push/push_train/push_train.tfrecord-00256-of-00264
-push/push_train/push_train.tfrecord-00257-of-00264
-push/push_train/push_train.tfrecord-00258-of-00264
-push/push_train/push_train.tfrecord-00259-of-00264
-push/push_train/push_train.tfrecord-00260-of-00264
-push/push_train/push_train.tfrecord-00261-of-00264
-push/push_train/push_train.tfrecord-00262-of-00264
-push/push_train/push_train.tfrecord-00263-of-00264